BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10508
(555 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|322794008|gb|EFZ17246.1| hypothetical protein SINV_10936 [Solenopsis invicta]
Length = 653
Score = 560 bits (1442), Expect = e-157, Method: Compositional matrix adjust.
Identities = 276/546 (50%), Positives = 379/546 (69%), Gaps = 32/546 (5%)
Query: 3 DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLK-LIKTPDIDIKVIENY 61
D Q +F S +++ K + + + D+K+A++Q++R LL L++ +++ +E Y
Sbjct: 13 DYLQQHFKLSDVSLFQKK-CIERYANESDRKAAVDQALRDTLLVVLMENASGNVRSLETY 71
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
I+ C+ELC KD+ +++P++LL D FD TLD+CEQLF +VE NVN+WK++ FF +CKNN
Sbjct: 72 ITFCIELCRKDLATASMPVMLLGDIFDSMTLDRCEQLFTFVENNVNVWKEELFFTACKNN 131
Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
LLRMCNDLLRRLSRSQ TVFCGRILLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++
Sbjct: 132 LLRMCNDLLRRLSRSQQTVFCGRILLFLAKFFPFSERSGLNIVSEFNLENHTEFGVEKAE 191
Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
+ + + D + K+ ID+N Y+KFW+LQD+FRNP QCYNK+ WK+F+++A VL+A
Sbjct: 192 EDDLEQITKDDDKLESKIPIDYNLYRKFWALQDFFRNPNQCYNKMHWKVFSAHASNVLSA 251
Query: 242 FKSYKLDDVQS-------------SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
F S+KL++ +S + YFAKYLTNQKLL+LQLSD+NFRRYVLLQF
Sbjct: 252 FASFKLEEQRSYPTTCMKMDTSTEGSHKETHYFAKYLTNQKLLELQLSDSNFRRYVLLQF 311
Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
LILFQY STVK + E ELK DQ +WVK TTE VY+L+ +TPPDG F++ VK ILK E
Sbjct: 312 LILFQYLNSTVKFKAETHELKPDQVEWVKTTTEQVYTLLTETPPDGPMFAETVKNILKRE 371
Query: 349 EHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKK----------------KTPELTKLW 392
EHWN WKNEGCP KRP + + ++ + K++ PELTKLW
Sbjct: 372 EHWNAWKNEGCPPFKRPASEVGGDEPRKIRKPKRRIGDMIKDAQAIGKYHMGNPELTKLW 431
Query: 393 N-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
N +NLEACKS +RDF PSLE+YFE+AI ++DPAA V+++YKKVND N+ WRALRLL+R
Sbjct: 432 NLCPNNLEACKSKDRDFLPSLETYFEDAIMELDPAAMVDDKYKKVNDGNFGWRALRLLAR 491
Query: 452 KCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQ 511
+ PHFF++ + K E++E M+K+ K++P +Q + P+E +E++
Sbjct: 492 RSPHFFVHGNNPINKLPEYLEAMIKKIAKDRPQTQSDTKLETEETPPSESTDQEFNEDVL 551
Query: 512 EEEKEE 517
++E E+
Sbjct: 552 KQESEQ 557
>gi|332017814|gb|EGI58475.1| THO complex subunit 1 [Acromyrmex echinatior]
Length = 653
Score = 558 bits (1439), Expect = e-156, Method: Compositional matrix adjust.
Identities = 281/557 (50%), Positives = 383/557 (68%), Gaps = 37/557 (6%)
Query: 29 DYDKKSALEQSIRQYLLK-LIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
D D+K+A++Q++R LL L + +++ +E+YI+ C+ELC KD+ +++P++LL D F
Sbjct: 38 DSDRKAAIDQALRDTLLVVLTENASGNVRSLESYITFCIELCRKDLATASMPVMLLGDIF 97
Query: 88 DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
D TLD+CEQLF +VE NV +WK++ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98 DSMTLDRCEQLFTFVENNVTVWKEELFFTACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157
Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYK 207
FLAKFFPFSERSGLNI+SEFN+EN TEFG ++ + + + D + K+ ID+N Y+
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENQTEFGLEKAEEDDLEQITKDDDKSESKIPIDYNLYR 217
Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS----------SLNPS 257
KFW+LQD+FRNP QCYNK+ WK+F+++A VL+AF S+KL++ +S S+ S
Sbjct: 218 KFWALQDFFRNPNQCYNKMHWKVFSAHASNVLSAFASFKLEEQRSYPSTCIKMDTSMEDS 277
Query: 258 G---DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY STVK + E ELK DQ +
Sbjct: 278 HKETHYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPDQVE 337
Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK 374
WVK TE VY+L+ +TPPDG F++ VK ILK EEHWN WKNEGCP KRP + ++DE+
Sbjct: 338 WVKTITEQVYTLLTETPPDGPTFAETVKNILKREEHWNTWKNEGCPPFKRPASDVSDEEP 397
Query: 375 --------------KDEPDAKK--KKTPELTKLWN-SKDNLEACKSAERDFTPSLESYFE 417
+D K PELTKLWN +NLEACKS +RDF PSLE+YFE
Sbjct: 398 RKPRRPKRRIGDVIRDAQTVGKYHMGNPELTKLWNLCPNNLEACKSKDRDFLPSLETYFE 457
Query: 418 EAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKR 477
+AI ++DPAA V+++YKKVND N+ WRALRLL+R+ PHFF++ + K E++E M+K+
Sbjct: 458 DAIMELDPAAMVDDKYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLETMIKK 517
Query: 478 CVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEE------DWEAKADPEGDADE 531
K++P +Q + P+E +E++ ++E E+ D +A+ + +
Sbjct: 518 IAKDRPQTQSDTKLETEETPPSESTDQEFNEDVLKQESEQVDAESADIKARKLNKVTPEM 577
Query: 532 VMSVEYCYQRNWHKQAV 548
V + + +W K A+
Sbjct: 578 VAKLSESLKNDWKKLAI 594
>gi|383852527|ref|XP_003701778.1| PREDICTED: THO complex subunit 1-like [Megachile rotundata]
Length = 654
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 279/538 (51%), Positives = 372/538 (69%), Gaps = 42/538 (7%)
Query: 27 TCDYDKKSALEQSIRQYLLKLI-KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
T D D+K+A++Q++R LL ++ + +++++E YI+ C+ELC +D+ +++P++LL D
Sbjct: 36 TNDGDRKAAIDQALRDTLLVILSENAANNVQLLETYITFCIELCKRDLATASMPVMLLGD 95
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
FD TLD+CEQLF +VE NV +WK+ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRI
Sbjct: 96 IFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNNLLRMCNDLLRRLSRSQQTVFCGRI 155
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNF 205
LLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++ + + + + KV ID+N
Sbjct: 156 LLFLAKFFPFSERSGLNIVSEFNLENFTEFGSEKSEGNDLEQVTKDEDKSETKVPIDYNL 215
Query: 206 YKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLN 255
Y+KFW+LQD+FRNP QCY+K+ WK+F+++A +VL+AF S+KL++ + SS+
Sbjct: 216 YRKFWALQDFFRNPNQCYSKMYWKVFSAHASSVLSAFSSFKLEEQRTYPSTSIKLDSSME 275
Query: 256 PSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQ 312
S YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY STVK + E ELK DQ
Sbjct: 276 GSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPDQ 335
Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
WVK TTE VY+L+ +TPPDG F++ VK ILK EEHWN WKNEGCP K+P D+
Sbjct: 336 VDWVKATTEQVYALLTETPPDGPAFAETVKNILKREEHWNAWKNEGCPPFKKP-----DQ 390
Query: 373 DKKDEPDAKKKK----------------------TPELTKLWN-SKDNLEACKSAERDFT 409
D DE + +K K PELTKLWN +NLEACKS +RDF
Sbjct: 391 DSIDEEEVRKPKRPRHRIGDVIREAQAVGKYHMGNPELTKLWNLCPNNLEACKSKDRDFL 450
Query: 410 PSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE 469
PSLE+YFEEAI Q+DP+A V+++YKKVND N+ WRALRLL+R+ PHFF++ + K E
Sbjct: 451 PSLETYFEEAIMQLDPSAMVDDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPE 510
Query: 470 FIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEG 527
++E M+K+ K++P +Q + P E SE++ ++E E+ AD +
Sbjct: 511 YLETMIKKIAKDRPQTQSDTKLETEETPPPEANDQEFSEDVLKQESEQVESENADAKA 568
>gi|350398011|ref|XP_003485060.1| PREDICTED: THO complex subunit 1-like [Bombus impatiens]
Length = 655
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 294/586 (50%), Positives = 391/586 (66%), Gaps = 42/586 (7%)
Query: 3 DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLI-KTPDIDIKVIENY 61
D Q F S + + K + + T D D+K+A++Q++R LL ++ + ++ ++E Y
Sbjct: 13 DYLQQCFKLSDIQIFQKK-CVERCTNDSDRKAAIDQALRDTLLVILSENLSSNVPLLETY 71
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
I+ C+ELC KD+ +++P++LL D FD TLD+CEQLF +VE NV +WK+ FF +CKNN
Sbjct: 72 ITFCIELCRKDLATASMPVMLLGDIFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNN 131
Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-E 180
LLRMCNDLLRRLSRSQ TVFCGRILLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++ E
Sbjct: 132 LLRMCNDLLRRLSRSQQTVFCGRILLFLAKFFPFSERSGLNIVSEFNLENHTEFGSEKSE 191
Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
D + T++ D E KV ID+N Y+KFW+LQD+FRNP Q Y K+ WK+F+++A +VL+
Sbjct: 192 GDDLEHITKDDDKSET-KVPIDYNLYRKFWALQDFFRNPNQLYLKMHWKIFSTHASSVLS 250
Query: 241 AFKSYKLDD----------VQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
AF S+KL++ V+SS+ S YFAKYLTNQKLL+LQLSD+NFRRYVLLQ
Sbjct: 251 AFSSFKLEEQRNYPSTCVKVESSMEGSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQ 310
Query: 288 FLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKG 347
FLILFQY STVK + E ELK DQ +WVK TTE VY+L+ +TPPDG F++ VK ILK
Sbjct: 311 FLILFQYLNSTVKFKAETHELKPDQVEWVKGTTEQVYALLTETPPDGPTFAETVKNILKR 370
Query: 348 EEHWNQWKNEGCPELKRPLT-SITDEDKKDEPDAKKKK-----------------TPELT 389
EEHWN WKNEGCP KRP S D ++ +P +++ PELT
Sbjct: 371 EEHWNAWKNEGCPPFKRPTPDSSADIEESKKPKRPRRRIGDVIRDAQAVGKYHMGNPELT 430
Query: 390 KLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRL 448
KLWN +NLEACKS ERDF PSLE+YFEEAI Q+DP A VE++YKKVND N+ WRALRL
Sbjct: 431 KLWNLCPNNLEACKSKERDFLPSLETYFEEAIMQLDPNAMVEDEYKKVNDGNFGWRALRL 490
Query: 449 LSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSE 508
L+R+ PHFF++ + K E++E M+K+ K++P +Q + P E +E
Sbjct: 491 LARRSPHFFVHGNYPINKLPEYLETMIKKIAKDRPQTQSDIKLETEETPPPEANDQEFNE 550
Query: 509 EIQEEEKEEDWEAKADPEG------DADEVMSVEYCYQRNWHKQAV 548
++ ++E E+ D +G D V + + +W K A
Sbjct: 551 DVLKQESEQVEAENTDTKGRKLTKVTPDMVAKLSEILKNDWKKLAT 596
>gi|307174438|gb|EFN64939.1| THO complex subunit 1 [Camponotus floridanus]
Length = 654
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 283/527 (53%), Positives = 370/527 (70%), Gaps = 35/527 (6%)
Query: 29 DYDKKSALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
D D+K+A++Q++R LL ++ D + +E YI+ C+ELC KD+ +++P++LL D F
Sbjct: 38 DGDRKAAIDQALRDTLLVILTENASDNVHSLEMYITFCIELCRKDLATASMPVMLLGDIF 97
Query: 88 DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
D TLDKCEQLF +VE NV IWK+ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98 DSMTLDKCEQLFTFVENNVTIWKEDLFFTACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157
Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDTEEVDKVKIDFNFY 206
FLAKFFPFSERSGLNI+SEFN+EN TEFG ++ E D T++ D E +K+ ID+N Y
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENHTEFGSEKVEGDDLEQITKDDDKSE-NKIPIDYNLY 216
Query: 207 KKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLNP 256
+KFW+LQD+FRNP QCY KV WK+F+++A VL+AF S+KL+D + SS+
Sbjct: 217 RKFWALQDFFRNPNQCYVKVHWKVFSAHASNVLSAFASFKLEDQRSYPTTCINMDSSMEG 276
Query: 257 S---GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQE 313
S YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLI+FQY STVK + E ELK DQ
Sbjct: 277 SYKETHYFAKYLTNQKLLELQLSDSNFRRYVLLQFLIVFQYLNSTVKFKAETHELKPDQV 336
Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED 373
+WVK TTE VY+L+ +TPPDG F++ VK ILK EEHWN WKNEGCP KRP + I D D
Sbjct: 337 EWVKTTTEQVYALLTETPPDGPAFAETVKNILKREEHWNAWKNEGCPPFKRPASDIIDGD 396
Query: 374 K----------------KDEPDAKKKKTP--ELTKLWN-SKDNLEACKSAERDFTPSLES 414
+ +D K ELTKLWN +NLEACKS +RDF PSLE+
Sbjct: 397 EEPRKPKRPKRRIGDVIRDAQAVGKYHMGNLELTKLWNLCPNNLEACKSKDRDFLPSLET 456
Query: 415 YFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENM 474
YFE+AI ++DPAA V+++YKKVND N+ WRALRLL+R+ PHFF++ + K E++E M
Sbjct: 457 YFEDAIMELDPAAMVDDKYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLETM 516
Query: 475 VKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEA 521
+K+ K++P +Q + P+E +E++ +++ E+D E+
Sbjct: 517 IKKIAKDRPQTQSDTKLETEETPPSESNDQEFNEDVLKQDSEQDSES 563
>gi|328784514|ref|XP_393145.3| PREDICTED: THO complex subunit 1-like isoform 1 [Apis mellifera]
Length = 654
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 278/492 (56%), Positives = 357/492 (72%), Gaps = 35/492 (7%)
Query: 27 TCDYDKKSALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNSTLPIILLSD 85
T D D+K+A++Q++R LL ++ ++ ++++E YI+ C+ELC KD+ +++P++LL D
Sbjct: 36 TNDSDRKAAIDQALRDTLLVILSENALNNVQLLETYITFCIELCRKDLATASMPVMLLGD 95
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
FD TLD+CEQLF +VE NV +WK+ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRI
Sbjct: 96 IFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNNLLRMCNDLLRRLSRSQQTVFCGRI 155
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDTEEVDKVKIDFN 204
LLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++ E D T++ D E KV ID+N
Sbjct: 156 LLFLAKFFPFSERSGLNIVSEFNLENHTEFGSEKSEGDDLEQITKDDDKSET-KVPIDYN 214
Query: 205 FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSL 254
Y+KFW+LQD+FRNP QCY K+ WK+F+++A +VL+AF S+KL++ V SS+
Sbjct: 215 LYRKFWALQDFFRNPNQCYQKMHWKVFSAHASSVLSAFSSFKLEEQRNYPSTCIKVDSSI 274
Query: 255 NPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSD 311
S YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY STVK + E ELK D
Sbjct: 275 EGSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPD 334
Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLT-SIT 370
Q WVK TTE VY+L+ +TPPDG F++ VK ILK EEHWN WKNEGCP K+P S T
Sbjct: 335 QVDWVKATTEQVYALLTETPPDGPTFAETVKNILKREEHWNAWKNEGCPAFKKPAPDSST 394
Query: 371 DEDKKDEPDAKKKK-----------------TPELTKLWN-SKDNLEACKSAERDFTPSL 412
D + +P K++ PELTKLWN +NLEACKS ERDF PSL
Sbjct: 395 DIEDLRKPKRPKRRIGDVIRDAQTVGKYHMGNPELTKLWNLCPNNLEACKSKERDFLPSL 454
Query: 413 ESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIE 472
E+YFEEAI Q+DP A VE++YKKVND N+ WRALRLL+R+ PHFF++ + K E++E
Sbjct: 455 ETYFEEAIMQLDPNAMVEDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLE 514
Query: 473 NMVKRCVKEKPS 484
M+K+ K++P+
Sbjct: 515 TMIKKIAKDRPT 526
>gi|380029639|ref|XP_003698475.1| PREDICTED: LOW QUALITY PROTEIN: THO complex subunit 1-like [Apis
florea]
Length = 658
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 274/491 (55%), Positives = 354/491 (72%), Gaps = 33/491 (6%)
Query: 27 TCDYDKKSALEQSIRQYLLKLIKTPDI-DIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
T D D+K+A++Q++R LL ++ + +++++E YI+ C+ELC KD+ +++P++LL D
Sbjct: 36 TNDSDRKAAIDQALRDTLLVILSENALSNVQLLETYITFCIELCRKDLATASMPVMLLGD 95
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
FD TLD+CEQLF +VE NV +WK+ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRI
Sbjct: 96 IFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNNLLRMCNDLLRRLSRSQQTVFCGRI 155
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNF 205
LLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++ + + D + KV ID+N
Sbjct: 156 LLFLAKFFPFSERSGLNIVSEFNLENHTEFGSEKSEGDDLEQINKDDDKSEIKVPIDYNL 215
Query: 206 YKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLN 255
Y+KFW+LQD+FRNP QCY K+ WK+F+++A +VL+AF S+KL++ V SS+
Sbjct: 216 YRKFWALQDFFRNPNQCYQKMHWKVFSAHASSVLSAFSSFKLEEQRNYPSTCIKVDSSIE 275
Query: 256 PSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQ 312
S YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY STVK + E ELK DQ
Sbjct: 276 GSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPDQ 335
Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLT-SITD 371
WVK TTE VY+L+ +TPPDG F++ VK ILK EEHWN WKNEGCP K+P S TD
Sbjct: 336 VDWVKATTEQVYALLTETPPDGPTFAETVKNILKREEHWNAWKNEGCPAFKKPTPDSSTD 395
Query: 372 EDKKDEPDAKKKK-----------------TPELTKLWN-SKDNLEACKSAERDFTPSLE 413
+ +P K++ PELTKLWN +NLEACKS ERDF PSLE
Sbjct: 396 IEDLRKPKRPKRRIGDVIRDAQTVGKYHMGNPELTKLWNLCPNNLEACKSKERDFLPSLE 455
Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIEN 473
+YFEEAI Q+DP A VE++YKKVND N+ WRALRLL+R+ PHFF++ + K E++E
Sbjct: 456 TYFEEAIMQLDPNAMVEDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLET 515
Query: 474 MVKRCVKEKPS 484
M+K+ K++P+
Sbjct: 516 MIKKIAKDRPT 526
>gi|307213955|gb|EFN89189.1| THO complex subunit 1 [Harpegnathos saltator]
Length = 653
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 270/488 (55%), Positives = 353/488 (72%), Gaps = 32/488 (6%)
Query: 29 DYDKKSALEQSIRQYLL-KLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
D D+K+A++Q++R LL L + ++ +E YI+ C+ELC KD+ +++P++LL D F
Sbjct: 38 DGDRKAAVDQALRDTLLIVLTENSSGNVHSLELYITFCIELCRKDLATASMPVMLLGDIF 97
Query: 88 DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
D TLD+CEQLF +VE NV++WK+ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98 DSVTLDRCEQLFTFVEDNVSVWKEDLFFSACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157
Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYK 207
FLAKFFPFSERSGLNI+SEFN+EN TEFG ++ + + D + +K+ ID+N Y+
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENYTEFGSEKTEGDDLEQINKDDDKSENKIPIDYNLYR 217
Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLNPS 257
KFW+LQDYFRNP QCYNK+ WK+F+++A VL+AF S+KL++ + SS+ S
Sbjct: 218 KFWALQDYFRNPNQCYNKMHWKVFSAHASNVLSAFSSFKLEEQRSYPTTCIKMDSSMEGS 277
Query: 258 G---DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY STVK + E ELK DQ +
Sbjct: 278 HKETHYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPDQVE 337
Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK 374
WVK TTE VY+L+ +TPPDG F++ VK ILK EEHWN WKNEGCP KRP + I +++
Sbjct: 338 WVKTTTEQVYALLTETPPDGPAFAETVKNILKREEHWNAWKNEGCPPFKRPTSDIAADEE 397
Query: 375 ---------------KDEPDAKK--KKTPELTKLWN-SKDNLEACKSAERDFTPSLESYF 416
+D K PELTKLWN DNLEACKS +RDF PSLE+YF
Sbjct: 398 PRKPKRPRRRIGDVIRDAQTVGKYHMGNPELTKLWNLCPDNLEACKSKDRDFLPSLETYF 457
Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVK 476
E+AI ++DPAA ++++YKKVND N+ WRALRLL+R+ PHFF++ + K E++E M+K
Sbjct: 458 EDAIVELDPAAMIDDKYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLETMIK 517
Query: 477 RCVKEKPS 484
+ K++P+
Sbjct: 518 KIAKDRPT 525
>gi|340720907|ref|XP_003398870.1| PREDICTED: THO complex subunit 1-like [Bombus terrestris]
Length = 654
Score = 552 bits (1423), Expect = e-154, Method: Compositional matrix adjust.
Identities = 291/585 (49%), Positives = 387/585 (66%), Gaps = 41/585 (7%)
Query: 3 DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLI-KTPDIDIKVIENY 61
D Q F S + + K + + T D D+K+A++Q++R LL ++ + ++ ++E Y
Sbjct: 13 DYLQQCFKLSDIQIFQKK-CVERCTNDSDRKAAIDQALRDTLLVILSENVSSNVPLLETY 71
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
I+ C+ELC KD+ +++P++LL D FD TLD+CEQLF +VE NV +WK+ FF +CKNN
Sbjct: 72 ITFCIELCRKDLATASMPVMLLGDIFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNN 131
Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
LLRMCNDLLRRLSRSQ TVFCGRILLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++
Sbjct: 132 LLRMCNDLLRRLSRSQQTVFCGRILLFLAKFFPFSERSGLNIVSEFNLENHTEFGSEKSE 191
Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
+ + D + KV ID+N Y+KFW+LQD+FRNP Q Y K+ WK+F+++A +VL+A
Sbjct: 192 GDDLEQITKDDDKSDTKVPIDYNLYRKFWALQDFFRNPNQLYLKMHWKIFSTHASSVLSA 251
Query: 242 FKSYKLDD----------VQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
F S+KL++ V+SS+ S YFAKYLTNQKLL+LQLSD+NFRRYVLLQF
Sbjct: 252 FSSFKLEEQRNYPSTCVKVESSMEGSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQF 311
Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
LILFQY STVK + E ELK DQ +WVK TTE VY+L+ +TPPDG F++ VK ILK E
Sbjct: 312 LILFQYLNSTVKFKAETHELKPDQVEWVKGTTEQVYALLTETPPDGPTFAETVKNILKRE 371
Query: 349 EHWNQWKNEGCPELKRPLT-SITDEDKKDEPDAKKKK-----------------TPELTK 390
EHWN WKNEGCP KRP S D ++ +P ++K PELTK
Sbjct: 372 EHWNAWKNEGCPPFKRPTPDSSADIEESKKPKRPRRKIGDVIRDAQAVGKYHMGNPELTK 431
Query: 391 LWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLL 449
LWN +NLEACKS ERDF PSLE+YFEEAI Q+DP A VE++YKKVND N+ WRALRLL
Sbjct: 432 LWNLCPNNLEACKSKERDFLPSLETYFEEAIMQLDPNAMVEDEYKKVNDGNFGWRALRLL 491
Query: 450 SRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEE 509
+R+ PHFF++ + K E++E M+K+ K++P +Q + P E +E+
Sbjct: 492 ARRSPHFFVHGNYPINKLPEYLETMIKKIAKDRP-TQSDIKLETEETPPPEANDQEFNED 550
Query: 510 IQEEEKEEDWEAKADPEG------DADEVMSVEYCYQRNWHKQAV 548
+ ++E E+ D +G D V + + +W K A
Sbjct: 551 VLKQESEQVEAENTDTKGRKLTKVTPDMVAKLSEILKNDWKKLAT 595
>gi|345480955|ref|XP_001607027.2| PREDICTED: THO complex subunit 1 isoform 1 [Nasonia vitripennis]
Length = 656
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 273/519 (52%), Positives = 361/519 (69%), Gaps = 34/519 (6%)
Query: 29 DYDKKSALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
D D+K A++Q++R L+ ++ + + +E+YI C+ELC KD+ +++P++LLSD F
Sbjct: 38 DKDRKPAIDQALRDTLVIILSNNATNNTRTLESYIKFCIELCRKDLATTSMPVMLLSDMF 97
Query: 88 DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
D TL+ CE LF YVE NV+ WK+ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98 DALTLNLCEDLFTYVENNVSTWKEDLFFGACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157
Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYK 207
FLAKFFPFSERSGLNI+SEFN+ENITEFG ++ + + E + K+ ID+N Y+
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENITEFGSEKSEEEVIQQMEVDGEKSETKISIDYNLYR 217
Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLNPS 257
KFW+LQD+FRNP QCYNK+ WK F+++A VL+ F S+KL+D ++S L+
Sbjct: 218 KFWALQDFFRNPNQCYNKMHWKCFSTHATNVLSTFASFKLEDQKDCIIDHLKIESLLDGR 277
Query: 258 ---GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
YFAKYLTN+KLL+LQLSD+NFRR+VLLQFLILFQY STVK + E ELK DQ +
Sbjct: 278 YKDNHYFAKYLTNEKLLELQLSDSNFRRFVLLQFLILFQYLNSTVKFKSETHELKPDQVE 337
Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS-ITDED 373
WV TTE VY L+ +TPPDG F++ VK ILK EE+WN WKNEGCP KRP +T+ D
Sbjct: 338 WVNTTTEQVYLLLTETPPDGAAFAEAVKHILKREEYWNSWKNEGCPAFKRPAPEPMTEAD 397
Query: 374 KKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKSAERDFTPSLESY 415
+ +P ++K ELTKLWN +NLEACKS +RDF PSLE+Y
Sbjct: 398 EARKPKRLRRKIGDLIRDAQAVGKYYMGNQELTKLWNLCPNNLEACKSKDRDFLPSLETY 457
Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMV 475
FEEAI+Q+DPAAA+E++YKKVND N+ WRALRLL+R+ PHFF++ + K E++E M+
Sbjct: 458 FEEAIEQLDPAAAIEDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLQEYLEIMI 517
Query: 476 KRCVKEKPSSQISGNGNGVDQDPAE-VEVDTKSEEIQEE 513
K+ K++P Q + P+E +E D E +++E
Sbjct: 518 KKIAKDRPQVQTDIKQEAGETPPSEQIEQDFNEETLKQE 556
>gi|345480957|ref|XP_003424254.1| PREDICTED: THO complex subunit 1 isoform 2 [Nasonia vitripennis]
Length = 660
Score = 542 bits (1397), Expect = e-151, Method: Compositional matrix adjust.
Identities = 274/522 (52%), Positives = 363/522 (69%), Gaps = 37/522 (7%)
Query: 29 DYDKKSALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
D D+K A++Q++R L+ ++ + + +E+YI C+ELC KD+ +++P++LLSD F
Sbjct: 38 DKDRKPAIDQALRDTLVIILSNNATNNTRTLESYIKFCIELCRKDLATTSMPVMLLSDMF 97
Query: 88 DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
D TL+ CE LF YVE NV+ WK+ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98 DALTLNLCEDLFTYVENNVSTWKEDLFFGACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157
Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYK 207
FLAKFFPFSERSGLNI+SEFN+ENITEFG ++ + + E + K+ ID+N Y+
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENITEFGSEKSEEEVIQQMEVDGEKSETKISIDYNLYR 217
Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLNPS 257
KFW+LQD+FRNP QCYNK+ WK F+++A VL+ F S+KL+D ++S L+
Sbjct: 218 KFWALQDFFRNPNQCYNKMHWKCFSTHATNVLSTFASFKLEDQKDCIIDHLKIESLLDGR 277
Query: 258 ---GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
YFAKYLTN+KLL+LQLSD+NFRR+VLLQFLILFQY STVK + E ELK DQ +
Sbjct: 278 YKDNHYFAKYLTNEKLLELQLSDSNFRRFVLLQFLILFQYLNSTVKFKSETHELKPDQVE 337
Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS-ITDED 373
WV TTE VY L+ +TPPDG F++ VK ILK EE+WN WKNEGCP KRP +T+ D
Sbjct: 338 WVNTTTEQVYLLLTETPPDGAAFAEAVKHILKREEYWNSWKNEGCPAFKRPAPEPMTEAD 397
Query: 374 KKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKSAERDFTPSLESY 415
+ +P ++K ELTKLWN +NLEACKS +RDF PSLE+Y
Sbjct: 398 EARKPKRLRRKIGDLIRDAQAVGKYYMGNQELTKLWNLCPNNLEACKSKDRDFLPSLETY 457
Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMV 475
FEEAI+Q+DPAAA+E++YKKVND N+ WRALRLL+R+ PHFF++ + K E++E M+
Sbjct: 458 FEEAIEQLDPAAAIEDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLQEYLEIMI 517
Query: 476 KRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEE 517
K+ K++P I+ + Q+ E SE+I+++ EE
Sbjct: 518 KKIAKDRPVC-ITNVQTDIKQEAGET---PPSEQIEQDFNEE 555
>gi|328714151|ref|XP_001950767.2| PREDICTED: THO complex subunit 1-like [Acyrthosiphon pisum]
Length = 656
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 256/495 (51%), Positives = 342/495 (69%), Gaps = 39/495 (7%)
Query: 29 DYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDT 86
D D++S L+Q +R Y+L ++ K P++ V + + S +EL +MC + +P+++LSD
Sbjct: 40 DADRRSILDQVLRCYVLDIVLSKPPNL-YDVCKMWTSFTIELVRNEMCTAIMPVVILSDM 98
Query: 87 FDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
FD++T+D CE++F +VE NVN+ K+ FFM+CKNNLLRMCNDLLRRLSRS+NTVFCGRIL
Sbjct: 99 FDVTTMDVCEKMFDHVESNVNVLKEPQFFMACKNNLLRMCNDLLRRLSRSRNTVFCGRIL 158
Query: 147 LFLAKFFPFSERSGLNIISEFNVENITE------FGGDEEMDVSSNETEETDTEEVDKVK 200
LFLAKFFPFSERSGLNI+SEFN+EN+TE F E M S E E ++KV
Sbjct: 159 LFLAKFFPFSERSGLNIVSEFNLENVTEYLNENHFDQTECMPDSIPEDEAKSDIVIEKVN 218
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD- 259
+D N Y KFWSLQDYFRNP QCY WK+F S+ + + + F+S+KL+ V N S +
Sbjct: 219 VDKNLYLKFWSLQDYFRNPNQCYKTEHWKLFISHTDFIFSTFQSHKLEHVNRMDNESAEQ 278
Query: 260 ---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWV 316
YF KYLTNQKLL+LQLSD NFRRY+L+QFL+LFQY + VK + + ++K +Q++W+
Sbjct: 279 NMHYFPKYLTNQKLLELQLSDVNFRRYILIQFLLLFQYLDAPVKFKLDNFKIKPEQQEWI 338
Query: 317 KDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKD 376
K++ T+Y LI TPPDG FS+VVK IL+ EE WN+WKN+GCPE+ + L + D+ D
Sbjct: 339 KNSITTIYELINNTPPDGVRFSEVVKNILQREEQWNKWKNDGCPEVTKRLPT---SDQSD 395
Query: 377 EPDAKKK--------------------KTPELTKLWNSK-DNLEACKSAERDFTPSLESY 415
P K++ P+LTKLWN K DNLEACK ERDF PS +S+
Sbjct: 396 MPSPKRRLRRNLGDVIKNMTANNKVFLGDPDLTKLWNLKPDNLEACKGPERDFLPSFDSF 455
Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMV 475
F+EA +Q+DP +E QYKKVND N+ WRALRLL+RK PHFF + + K SE++ENM+
Sbjct: 456 FQEAFEQLDPELCIESQYKKVNDGNFGWRALRLLARKSPHFFAQSNNPIHKLSEYLENMI 515
Query: 476 KRCV-KEK-PSSQIS 488
K+ + KEK P +Q S
Sbjct: 516 KKTMSKEKIPINQGS 530
>gi|91088827|ref|XP_970265.1| PREDICTED: similar to Hpr1 CG2031-PA [Tribolium castaneum]
gi|270012332|gb|EFA08780.1| hypothetical protein TcasGA2_TC006470 [Tribolium castaneum]
Length = 649
Score = 492 bits (1267), Expect = e-136, Method: Compositional matrix adjust.
Identities = 263/548 (47%), Positives = 359/548 (65%), Gaps = 38/548 (6%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
D K+ L+Q+ R LL+L++ ++++ V+ + V C K++ N T+P++LLSD FD
Sbjct: 41 DLKAPLDQAFRDVLLELLEK-EVNVSVLRELVDFSVLTCRKELSNPTMPVMLLSDIFDAL 99
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
TLD C+++F YVE NV +WK+ FF CKNNLLR+CNDLLRRLSRS TVFCGRILLFLA
Sbjct: 100 TLDACQEMFLYVENNVEVWKENLFFSCCKNNLLRLCNDLLRRLSRSTATVFCGRILLFLA 159
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFW 210
KFFPFSERSGLNI+SEFN+EN TE+G D S + + D +E V IDF FY KFW
Sbjct: 160 KFFPFSERSGLNIVSEFNLENTTEYGTDS----SQEKGGDVDGDEKKNV-IDFTFYCKFW 214
Query: 211 SLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD-----YFAKYL 265
SLQD+FRNP Q Y KV WK+F +YA +V F+ KL+ V+++ + + YF+KYL
Sbjct: 215 SLQDFFRNPNQLYTKVQWKLFCTYATSVFNTFQGLKLEHVENNGSYCDESKNKMYFSKYL 274
Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYS 325
T+QKLLDLQL D NFRR VLLQFLILFQY +S+VK + E ELKSDQ++W++ +TE VYS
Sbjct: 275 TSQKLLDLQLYDVNFRRAVLLQFLILFQYLSSSVKFKSESFELKSDQKEWIQSSTEKVYS 334
Query: 326 LIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK---DEP---D 379
L+++TPPDG+ F+++ IL EE+WN WKN+GCPELK+ + DK+ + P D
Sbjct: 335 LLRETPPDGKRFAEIASNILSREEYWNAWKNDGCPELKKSVVPTESPDKRRTEERPLLGD 394
Query: 380 AKKKKTP---------ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAV 429
K+ + ELTKLWN DNLEACK ERDF P+LE YF +AI Q++ AV
Sbjct: 395 IIKQASSEGKYYMGSTELTKLWNLCPDNLEACKGRERDFLPTLEDYFADAIAQIESPGAV 454
Query: 430 EEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISG 489
+++ + D NY WRALRLL+++ PHFF +T + S ++E MVK+ +KP +++
Sbjct: 455 KDEDNLLKDGNYGWRALRLLAKRSPHFFTYSTVIMNPLSSYLEMMVKKIAHDKP-GRVTE 513
Query: 490 NGNGVDQDPAEVE-------VDTKSEEIQEEEKEE---DWEAKADPEGDADEVMSVEYCY 539
N Q+ AE E VD +E+I+ E EE D A+ + + + +++
Sbjct: 514 NSQESQQNDAEEENLFKDTQVDENTEDIKPSETEEFMDDKNARVEHKLNPNQLQLFSEKV 573
Query: 540 QRNWHKQA 547
NW K A
Sbjct: 574 APNWKKLA 581
>gi|357606797|gb|EHJ65225.1| hypothetical protein KGM_07180 [Danaus plexippus]
Length = 565
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 248/510 (48%), Positives = 334/510 (65%), Gaps = 56/510 (10%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
D+KSA++Q+ R KL+ + +E+Y++ C++ C + M +T+P++LL D FD
Sbjct: 41 DRKSAMDQAFRD---KLLDLLLEEPNTLESYVNFCIDSCRRQMVTATMPVVLLGDIFDAL 97
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
TL+KCE++F YVE VNIW+++ FF++CKN+LLRMCNDLLRRLSRSQNTVFCGRILLFLA
Sbjct: 98 TLNKCEKMFMYVENGVNIWREELFFVACKNHLLRMCNDLLRRLSRSQNTVFCGRILLFLA 157
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE-EVDKVKIDFNFYKKF 209
KFFPFSERSGLNI+SEFN+ENITEFGGD + EE E + +K+ ID+N Y +F
Sbjct: 158 KFFPFSERSGLNIVSEFNLENITEFGGDNTSTLKDVLDEEMVVEDDKNKLVIDYNLYCRF 217
Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-----------------DVQS 252
WSLQD+FRNP CYNK+ WK F +++ +VL+AF SYKL+ DV+
Sbjct: 218 WSLQDFFRNPNTCYNKLQWKTFVAHSGSVLSAFSSYKLEAVELQKSKLNILKSVNSDVEM 277
Query: 253 SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQ 312
N YFAK+LTNQKLL+LQLSD+NFRR VL+Q+LILFQY STVK + E ELKSDQ
Sbjct: 278 QENKEQHYFAKFLTNQKLLELQLSDSNFRRCVLIQYLILFQYLMSTVKFKMESQELKSDQ 337
Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
WVKDTT VY L+ +TPPDG+ F++ VK ILK EEHWN WKN+GCPE ++P + E
Sbjct: 338 IDWVKDTTALVYKLLGETPPDGKQFAECVKRILKREEHWNSWKNDGCPEFQKPKPPVQAE 397
Query: 373 DKKDEPDAKKKK------------------TPELTKLWN-SKDNLEACKSAERDFTPSLE 413
+++ + K+++ +LTKLWN DNL AC++ ERDF PSLE
Sbjct: 398 NEEVKRSRKRRRPVGDIIKEYSGTDKFFMGNNDLTKLWNLCPDNLAACRTKERDFMPSLE 457
Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIEN 473
SY WRALRLL+R+ PHFF++ + + +++++
Sbjct: 458 SYMLSGDGGEGAGGW-------------GWRALRLLARRSPHFFVHTNNPIGRLPDYLDD 504
Query: 474 MVKRCVKEKPSSQISGNGNG---VDQDPAE 500
MVK+ +E ++ +S N NG V+ D A+
Sbjct: 505 MVKKITREVAANNVSNNTNGDSNVNNDKAK 534
>gi|242019012|ref|XP_002429961.1| THO complex subunit, putative [Pediculus humanus corporis]
gi|212515012|gb|EEB17223.1| THO complex subunit, putative [Pediculus humanus corporis]
Length = 597
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 256/514 (49%), Positives = 337/514 (65%), Gaps = 42/514 (8%)
Query: 40 IRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLF 99
+R LL L ++++K +E ++S VEL +++C+ ++PI+LLSDTFD+ TLD+CE+LF
Sbjct: 1 MRNCLLNLASNSEVELKSLEEFVSFAVELSRREICSPSIPIVLLSDTFDIFTLDQCEKLF 60
Query: 100 YYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERS 159
YVE NVNIWK+ FF CKNNLLRMCNDLLRRLSRSQNTVF GRILLFLAKFFPFSERS
Sbjct: 61 AYVENNVNIWKEDMFFSGCKNNLLRMCNDLLRRLSRSQNTVFRGRILLFLAKFFPFSERS 120
Query: 160 GLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP 219
GLNI+SEFN++N+ E D + + ET E + EE+ K KID+NFY KF SLQD+ R+P
Sbjct: 121 GLNIVSEFNLDNVVEQSQDTS-SLENFETSEENDEEI-KFKIDYNFYNKFLSLQDFMRSP 178
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP---------SGDYFAKYLTNQKL 270
QCYNKV W++F + A VLAAF+++KLD Q P S +FAKYLTN KL
Sbjct: 179 NQCYNKVRWRVFVNSATDVLAAFQNFKLDADQYQDKPIHDHISEATSSTFFAKYLTNPKL 238
Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
L+LQLSD +FRR +L+QFLI+ QY +S VK + ELK DQ+ W+K+ VY L+ +T
Sbjct: 239 LELQLSDPDFRRSILIQFLIILQYLSSPVKFKLGSYELKPDQQDWIKEQVTLVYQLLAET 298
Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT----- 385
P + F+Q VK IL EE WN WKN GCPE K P K + KK+
Sbjct: 299 PLNSHSFAQSVKHILAREELWNNWKNAGCPEFKMPGNVEEKNSTKTKVRGKKRNIGELIK 358
Query: 386 ------------PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQ 432
PELT+LWN + +NLEACK+ ER F P + YFEEAI QMDP AA+E +
Sbjct: 359 EATAENKYYMGHPELTRLWNLNSNNLEACKAKERKFLPDFDEYFEEAIMQMDPEAAIEPE 418
Query: 433 YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGN 492
YKKV D N+ WRALRLL+++ FF++ V +++E ++K+ V+++P N N
Sbjct: 419 YKKVKDPNFGWRALRLLTQRTSCFFISTNNPVTGLPDYLELILKKIVQDRPQ-----NSN 473
Query: 493 G------VDQDPAEVEVDTKSE--EIQEEEKEED 518
G ++ + EVE KSE E+QE+ EE+
Sbjct: 474 GEIKQEVLNTEAEEVEDLLKSEEKEMQEDGGEEN 507
>gi|41151980|ref|NP_958481.1| THO complex subunit 1 [Danio rerio]
gi|32822787|gb|AAH54938.1| THO complex 1 [Danio rerio]
Length = 655
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 245/533 (45%), Positives = 335/533 (62%), Gaps = 57/533 (10%)
Query: 24 YKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
+ P + +KK+ L+Q++R L + I +++ + I + ++ + +C++T P +LL
Sbjct: 38 HLPGNETEKKATLDQALRGVLEEQIVNQKVNVDDFLSLIYISIDGVTEGICSATTPFLLL 97
Query: 84 SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
D D LD+C+++F +VE NV+ WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 98 GDVLDCLPLDQCDKIFSFVEENVSTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 157
Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNET 188
RI LFLA+ FP SE+SGLN+ S+FN++NIT F +E+ MDV E
Sbjct: 158 RIQLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQDSTLGQQHTEVKEEGMDVEEGEM 217
Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+ D + ID+N Y+KFW+LQDYFRNPVQCY+K SW F Y++ LA FKS+KLD
Sbjct: 218 GDEDAPAPSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFIKYSDEALAVFKSFKLD 277
Query: 249 DVQSS-------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
D+Q+S SGD YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 278 DMQASKKKLEEMRTSSGDHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 337
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L DQ W++DTT+ VY L+K+ PPDG+ F +V+ IL EE+WN WKNEGC
Sbjct: 338 KFKSSSCVLNDDQSLWIEDTTKLVYQLLKEIPPDGDKFGSMVEHILNTEENWNSWKNEGC 397
Query: 360 PEL--KRPLTS--ITDEDKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P +RP + I K+ P+ K P ELT+LWN + DN+EACKS
Sbjct: 398 PSFVKERPAETKPIRPSRKRQAPEDFLGKGPDRKILMGNDELTRLWNLNPDNMEACKSEN 457
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R+F PSLE +FEEAI+Q DPA VE++YK V +SNY WRALRLLSR+ PHFF +
Sbjct: 458 REFMPSLEDFFEEAIEQADPANMVEDEYKVVRNSNYGWRALRLLSRRSPHFFQPTNQQFK 517
Query: 466 KNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEED 518
++++ENMV + KE P D SEEI+ E+++D
Sbjct: 518 SLADYLENMVIKLAKELPK-------------------DIPSEEIKTGEEDDD 551
>gi|328721799|ref|XP_001947842.2| PREDICTED: THO complex subunit 1-like [Acyrthosiphon pisum]
Length = 635
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 233/470 (49%), Positives = 327/470 (69%), Gaps = 23/470 (4%)
Query: 31 DKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
+++ L+Q +R ++L ++ K P++ V + + S +EL +MC + +P+ +LSD F
Sbjct: 41 ERRPILDQVLRCHVLDIVSSKPPNL-YDVCKMWTSFTIELVRNEMCTAIMPVAILSDMFA 99
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
++T++ CE++F +VE NVN+ K+ FFM+ KNNLLRMCND+LRRLSRSQNTVFCGRILLF
Sbjct: 100 VTTINVCEKIFDHVESNVNVLKEPRFFMASKNNLLRMCNDILRRLSRSQNTVFCGRILLF 159
Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
LA FFPFSERSGLNIISEFN+EN+TE+ + D + + E+ K+ ID N Y K
Sbjct: 160 LAIFFPFSERSGLNIISEFNLENLTEYLNENHFD-QTEYMPASIAEDEAKLDIDNNLYLK 218
Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD----YFAKY 264
FWSLQDYFRNP QCY WK+F S+ + ++++F+S+K + V N S + YF KY
Sbjct: 219 FWSLQDYFRNPNQCYETEHWKLFISHTDFIISSFQSHKSEHVNKIDNKSAEQNTYYFPKY 278
Query: 265 LTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVY 324
L NQK+L LQLSD NFRR +L+QFL+LFQY + VK + + ++K +Q++W+K++ T+Y
Sbjct: 279 LPNQKILKLQLSDVNFRRNILIQFLLLFQYLDAPVKFKLDSFKIKPEQQEWIKNSITTIY 338
Query: 325 SLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS-----ITDEDKKDEPD 379
LI TPPDG FS+VVK IL+ EE WN+WKN+GCPE+ + L + +T +++ D
Sbjct: 339 ELINNTPPDGVRFSEVVKSILQREEQWNKWKNDGCPEVTKRLPTSDQSDMTAPKRRNLGD 398
Query: 380 AKKKKT---------PELTKLWNSK-DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAV 429
K T P+LTKLWN K DNLEACK ERDF PS +S+F+EA +Q+D +
Sbjct: 399 VIKNMTANNKVFLGDPDLTKLWNLKPDNLEACKRPERDFLPSFDSFFKEAFEQLDAELCI 458
Query: 430 EEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCV 479
E Q KKVND N+ WRALRLL+RK PHFF N+ +++K E++ENM+K+ +
Sbjct: 459 ESQNKKVNDINFGWRALRLLARKSPHFFANSGDSIQKVPEYLENMIKKTL 508
>gi|432105426|gb|ELK31641.1| THO complex subunit 1 [Myotis davidii]
Length = 657
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 318/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHAEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRAVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSDT 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP + VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPESMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|354490073|ref|XP_003507184.1| PREDICTED: THO complex subunit 1 [Cricetulus griseus]
Length = 655
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 39 PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 94
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV+ WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 95 LGDVLDCLPLDQCDTIFTFVEKNVSTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 154
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+ENIT F +E+ MDV E
Sbjct: 155 GRIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHAEDREEGMDVEEGE 214
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 215 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 274
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 275 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 334
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 335 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 394
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + S T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 395 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 454
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 455 REYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 514
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 515 SLPEYLENMVIKLAKELP 532
>gi|355724121|gb|AES08116.1| THO complex 1 [Mustela putorius furo]
Length = 656
Score = 465 bits (1197), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/502 (48%), Positives = 317/502 (63%), Gaps = 52/502 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 47 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 102
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 103 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 162
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 163 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 222
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 223 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 282
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 283 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 342
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 343 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 402
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
P + TS D K A+K+ P ELT+LWN DN+EAC
Sbjct: 403 PSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 458
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
KS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 459 KSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTN 518
Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 519 QQFKSLPEYLENMVIKLAKELP 540
>gi|296222353|ref|XP_002757154.1| PREDICTED: THO complex subunit 1 [Callithrix jacchus]
Length = 657
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|73961844|ref|XP_547651.2| PREDICTED: THO complex subunit 1 isoform 1 [Canis lupus familiaris]
Length = 657
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/502 (48%), Positives = 317/502 (63%), Gaps = 52/502 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
P + TS D K A+K+ P ELT+LWN DN+EAC
Sbjct: 397 PSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 452
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
KS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 453 KSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTN 512
Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 513 QQFKSLPEYLENMVIKLAKELP 534
>gi|301774498|ref|XP_002922667.1| PREDICTED: THO complex subunit 1-like [Ailuropoda melanoleuca]
Length = 657
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/502 (48%), Positives = 317/502 (63%), Gaps = 52/502 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
P + TS D K A+K+ P ELT+LWN DN+EAC
Sbjct: 397 PSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 452
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
KS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 453 KSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTN 512
Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 513 QQFKSLPEYLENMVIKLAKELP 534
>gi|397489339|ref|XP_003815687.1| PREDICTED: THO complex subunit 1 [Pan paniscus]
Length = 657
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|386781620|ref|NP_001247904.1| THO complex subunit 1 [Macaca mulatta]
gi|332225781|ref|XP_003262062.1| PREDICTED: THO complex subunit 1 [Nomascus leucogenys]
gi|402902763|ref|XP_003914266.1| PREDICTED: THO complex subunit 1 [Papio anubis]
gi|90075390|dbj|BAE87375.1| unnamed protein product [Macaca fascicularis]
gi|355701854|gb|EHH29207.1| THO complex subunit 1 [Macaca mulatta]
gi|355754925|gb|EHH58792.1| THO complex subunit 1 [Macaca fascicularis]
gi|380818558|gb|AFE81152.1| THO complex subunit 1 [Macaca mulatta]
gi|383409917|gb|AFH28172.1| THO complex subunit 1 [Macaca mulatta]
gi|384942554|gb|AFI34882.1| THO complex subunit 1 [Macaca mulatta]
Length = 657
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|114672500|ref|XP_512049.2| PREDICTED: THO complex subunit 1 isoform 3 [Pan troglodytes]
gi|410211798|gb|JAA03118.1| THO complex 1 [Pan troglodytes]
gi|410247646|gb|JAA11790.1| THO complex 1 [Pan troglodytes]
gi|410307156|gb|JAA32178.1| THO complex 1 [Pan troglodytes]
gi|410354343|gb|JAA43775.1| THO complex 1 [Pan troglodytes]
Length = 657
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|335310827|ref|XP_003362210.1| PREDICTED: THO complex subunit 1, partial [Sus scrofa]
Length = 654
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 38 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 93
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 94 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 153
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 154 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 213
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 214 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 273
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 274 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 333
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 334 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 393
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 394 PSFVKERTSDTKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 453
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 454 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 513
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 514 SLPEYLENMVIKLAKELP 531
>gi|149720837|ref|XP_001491940.1| PREDICTED: THO complex subunit 1 [Equus caballus]
Length = 657
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|363730911|ref|XP_419149.3| PREDICTED: THO complex subunit 1 [Gallus gallus]
Length = 656
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 250/539 (46%), Positives = 335/539 (62%), Gaps = 53/539 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
P + +KK L+Q+ R + + I K P ++ I ISL + + +C ++ P +LL
Sbjct: 31 PGSENEKKCTLDQAFRVIVEEEIINKAPCENLLAI---ISLAINGVTEGICTASTPFVLL 87
Query: 84 SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
D D LD+C+++F +VE NV WK F+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 88 GDVLDCLPLDQCDKIFTFVEKNVATWKSNMFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 147
Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGG---------------DEEMDVSSNET 188
RI LFLA+ FP SE+SGLN+ S+FN+EN+T F DE MDV E
Sbjct: 148 RIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEHESTLGQKHSEERDEGMDVEEGEM 207
Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+ + + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F Y+E VLA FKSYKLD
Sbjct: 208 GDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 267
Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
D Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 268 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 327
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ L +Q W++DTT+ VY L+ + PPDGE FS++V+ IL EE+WN WKNEGCP
Sbjct: 328 FKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCP 387
Query: 361 ----ELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNL 398
E +P + D K A+K+ P ELT+LWN DN+
Sbjct: 388 SFVKERSQPEFQLGPPDSKPMRPARKRPAPEDFLGKGPNKKILMGNEELTRLWNLCPDNM 447
Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFL 458
EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 448 EACKSESREYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPHFFQ 507
Query: 459 NATPNVEKNSEFIENMVKRCVKE--KPSSQI-SGNGNGVDQDPAEVEVDTKSEEIQEEE 514
+ E++ENMV + KE PS +I +G + + A ++ + +S E+Q ++
Sbjct: 508 PTNQQFKSLPEYLENMVIKLAKELPPPSEEIKTGEDEDEEDNDALLKENNESPEVQRDK 566
>gi|329664570|ref|NP_001192409.1| THO complex subunit 1 [Bos taurus]
gi|296473722|tpg|DAA15837.1| TPA: hypothetical protein BOS_22300 [Bos taurus]
Length = 657
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 242/502 (48%), Positives = 316/502 (62%), Gaps = 52/502 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVERNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
P + TS D K +K+ P ELT+LWN DN+EAC
Sbjct: 397 PSFVKERTS----DSKPTRAVRKRAAPEDFLGKGPSKKILMGNDELTRLWNLCPDNMEAC 452
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
KS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 453 KSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTN 512
Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 513 QQFKSLPEYLENMVIKLAKELP 534
>gi|417403684|gb|JAA48640.1| Putative nuclear matrix protein [Desmodus rotundus]
Length = 657
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 242/498 (48%), Positives = 318/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS + K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDSKPTRIVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSDT 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP + VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPESMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|426254001|ref|XP_004020677.1| PREDICTED: THO complex subunit 1 [Ovis aries]
Length = 647
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 242/502 (48%), Positives = 316/502 (62%), Gaps = 52/502 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 31 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 86
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 87 LGDVLDCLPLDQCDTIFTFVERNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 146
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 147 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 206
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 207 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 266
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 267 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 326
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 327 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 386
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
P + TS D K +K+ P ELT+LWN DN+EAC
Sbjct: 387 PSFVKERTS----DSKPTRAVRKRAAPEDFLGKGPSKKILMGNDELTRLWNLCPDNMEAC 442
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
KS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 443 KSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTN 502
Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 503 QQFKSLPEYLENMVIKLAKELP 524
>gi|449268552|gb|EMC79415.1| THO complex subunit 1, partial [Columba livia]
Length = 654
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/497 (48%), Positives = 318/497 (63%), Gaps = 42/497 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
P + +KK L+Q+ R + + I K P ++ I ISL + + +C ++ P +LL
Sbjct: 37 PGSENEKKCTLDQAFRVIVEEEIINKAPCENLLAI---ISLAISGVTEGICTASTPFVLL 93
Query: 84 SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
D D LD+C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 94 GDVLDCLPLDQCDKIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 153
Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNET 188
RI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E MDV E
Sbjct: 154 RIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEHESTLGQKHAEEREEGMDVEEGEM 213
Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+ + + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F Y+E VLA FKSYKLD
Sbjct: 214 GDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 273
Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
D Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 274 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 333
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ L +Q W++DTT+ VY L+ + PPDGE FS++V+ IL EE+WN WKNEGCP
Sbjct: 334 FKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCP 393
Query: 361 EL--KRPLTS--ITDEDKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAER 406
+RP S + K+ P+ K P ELT+LWN DN+EACKS R
Sbjct: 394 SFVKERPPDSKPMRPMRKRPAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSESR 453
Query: 407 DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEK 466
++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 454 EYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKS 513
Query: 467 NSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 514 LPEYLENMVIKLAKELP 530
>gi|23956332|ref|NP_705780.1| THO complex subunit 1 [Mus musculus]
gi|48474499|sp|Q8R3N6.1|THOC1_MOUSE RecName: Full=THO complex subunit 1; Short=Tho1; AltName:
Full=Nuclear matrix protein p84
gi|19353206|gb|AAH24951.1| THO complex 1 [Mus musculus]
gi|26327609|dbj|BAC27548.1| unnamed protein product [Mus musculus]
gi|26328027|dbj|BAC27754.1| unnamed protein product [Mus musculus]
Length = 657
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 242/498 (48%), Positives = 316/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + S T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|291394182|ref|XP_002713492.1| PREDICTED: THO complex 1 [Oryctolagus cuniculus]
Length = 657
Score = 463 bits (1192), Expect = e-128, Method: Compositional matrix adjust.
Identities = 242/502 (48%), Positives = 316/502 (62%), Gaps = 52/502 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
P + TS D K +K+ P ELT+LWN DN+EAC
Sbjct: 397 PSFVKERTS----DAKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 452
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
KS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 453 KSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTN 512
Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 513 QQFKSLPEYLENMVIKLAKELP 534
>gi|154448890|ref|NP_005122.2| THO complex subunit 1 [Homo sapiens]
gi|37999906|sp|Q96FV9.1|THOC1_HUMAN RecName: Full=THO complex subunit 1; Short=Tho1; AltName:
Full=Nuclear matrix protein p84; Short=p84N5; AltName:
Full=hTREX84
gi|14714503|gb|AAH10381.1| THO complex 1 [Homo sapiens]
gi|119622136|gb|EAX01731.1| THO complex 1, isoform CRA_a [Homo sapiens]
gi|123984784|gb|ABM83694.1| THO complex 1 [synthetic construct]
gi|157928504|gb|ABW03548.1| THO complex 1 [synthetic construct]
Length = 657
Score = 463 bits (1192), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 316/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R+ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|403265210|ref|XP_003924839.1| PREDICTED: THO complex subunit 1 [Saimiri boliviensis boliviensis]
Length = 657
Score = 463 bits (1192), Expect = e-128, Method: Compositional matrix adjust.
Identities = 242/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R + + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVVEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|60654241|gb|AAX29813.1| THO complex 1 [synthetic construct]
Length = 658
Score = 463 bits (1192), Expect = e-128, Method: Compositional matrix adjust.
Identities = 243/498 (48%), Positives = 316/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R+ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|344269141|ref|XP_003406413.1| PREDICTED: THO complex subunit 1 [Loxodonta africana]
Length = 657
Score = 463 bits (1191), Expect = e-127, Method: Compositional matrix adjust.
Identities = 242/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLRYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS + K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDSKPPRIVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKVVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|348576778|ref|XP_003474163.1| PREDICTED: THO complex subunit 1-like [Cavia porcellus]
Length = 657
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 242/501 (48%), Positives = 315/501 (62%), Gaps = 50/501 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTGGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSSTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHPEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PEL-------KRPLTSITDEDKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACK 402
P +P+ + K+ P+ K P ELT+LWN DN+EACK
Sbjct: 397 PSFVKERASDSKPMRVVR---KRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACK 453
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
S R + P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 454 SETRQYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQ 513
Query: 463 NVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 514 QFKSLPEYLENMVIKLAKELP 534
>gi|189054520|dbj|BAG37293.1| unnamed protein product [Homo sapiens]
Length = 657
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 242/498 (48%), Positives = 316/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + K+ LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKSYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R+ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 REHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534
>gi|281350925|gb|EFB26509.1| hypothetical protein PANDA_011654 [Ailuropoda melanoleuca]
Length = 640
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 243/503 (48%), Positives = 317/503 (63%), Gaps = 53/503 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 23 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 78
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 79 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 138
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 139 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 198
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 199 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 258
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 259 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 318
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 319 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 378
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
P + TS D K A+K+ P ELT+LWN DN+EAC
Sbjct: 379 PSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 434
Query: 402 KSAER-DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
KS R ++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 435 KSETRQEYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPT 494
Query: 461 TPNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 495 NQQFKSLPEYLENMVIKLAKELP 517
>gi|449494000|ref|XP_004175274.1| PREDICTED: THO complex subunit 1 isoform 2 [Taeniopygia guttata]
Length = 664
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 242/505 (47%), Positives = 315/505 (62%), Gaps = 50/505 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
P + +KK L+Q R + + I K P ++ I ISL + + +C ++ P +LL
Sbjct: 39 PGSENEKKCTLDQVFRVVVEEEIINKAPCENLLAI---ISLAINGVTEGICTASTPFVLL 95
Query: 84 SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
D D LD+C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 96 GDVLDCLPLDQCDKIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 155
Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNET 188
RI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E MDV E
Sbjct: 156 RIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEHESTLGQKHTEEREEGMDVEEGEM 215
Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+ + + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F Y+E VLA FKSYKLD
Sbjct: 216 GDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 275
Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
D Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 276 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 335
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ L +Q W++DTT+ VY L+ + PPDGE FS++V+ IL EE+WN WKNEGCP
Sbjct: 336 FKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCP 395
Query: 361 ----ELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNL 398
E P + D K +K+ P ELT+LWN DN+
Sbjct: 396 SFVKERCHPEFQLGPPDSKPMRPVRKRPAPEDFLGKGSNKKILMGNEELTRLWNLCPDNM 455
Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFL 458
EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 456 EACKSESREYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPHFFQ 515
Query: 459 NATPNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 516 PTNQQFKSLPEYLENMVIKLAKELP 540
>gi|224046136|ref|XP_002194707.1| PREDICTED: THO complex subunit 1 isoform 1 [Taeniopygia guttata]
Length = 656
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 242/503 (48%), Positives = 315/503 (62%), Gaps = 54/503 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
P + +KK L+Q R + + I K P ++ I ISL + + +C ++ P +LL
Sbjct: 39 PGSENEKKCTLDQVFRVVVEEEIINKAPCENLLAI---ISLAINGVTEGICTASTPFVLL 95
Query: 84 SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
D D LD+C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 96 GDVLDCLPLDQCDKIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 155
Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNET 188
RI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E MDV E
Sbjct: 156 RIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEHESTLGQKHTEEREEGMDVEEGEM 215
Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+ + + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F Y+E VLA FKSYKLD
Sbjct: 216 GDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 275
Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
D Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 276 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 335
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ L +Q W++DTT+ VY L+ + PPDGE FS++V+ IL EE+WN WKNEGCP
Sbjct: 336 FKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCP 395
Query: 361 EL--KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEA 400
+RP D K +K+ P ELT+LWN DN+EA
Sbjct: 396 SFVKERP------PDSKPMRPVRKRPAPEDFLGKGSNKKILMGNEELTRLWNLCPDNMEA 449
Query: 401 CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
CKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 450 CKSESREYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPHFFQPT 509
Query: 461 TPNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 510 NQQFKSLPEYLENMVIKLAKELP 532
>gi|149031712|gb|EDL86662.1| rCG41283, isoform CRA_b [Rattus norvegicus]
Length = 659
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 243/500 (48%), Positives = 316/500 (63%), Gaps = 46/500 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+ENIT F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + S T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 456
Query: 406 R--DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
R ++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 457 RQVEYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQ 516
Query: 464 VEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 517 FKSLPEYLENMVIKLAKELP 536
>gi|550058|gb|AAA53571.1| protein p84 [Homo sapiens]
Length = 657
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 240/498 (48%), Positives = 315/498 (63%), Gaps = 44/498 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R+ P+LE +FEEAI+Q DP E +YK +N+SNY WRAL+LL+R+ PHFF +
Sbjct: 457 REHMPTLEEFFEEAIEQADPENMAENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFK 516
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 517 SLQEYLENMVIKLAKELP 534
>gi|440903496|gb|ELR54147.1| THO complex subunit 1, partial [Bos grunniens mutus]
Length = 661
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 242/503 (48%), Positives = 316/503 (62%), Gaps = 53/503 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 44 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 99
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 100 LGDVLDCLPLDQCDTIFTFVERNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 159
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 160 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 219
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 220 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 279
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 280 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 339
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 340 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 399
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
P + TS D K +K+ P ELT+LWN DN+EAC
Sbjct: 400 PSFVKERTS----DSKPTRAVRKRAAPEDFLGKGPSKKILMGNDELTRLWNLCPDNMEAC 455
Query: 402 KSAER-DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
KS R ++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 456 KSETRQEYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPT 515
Query: 461 TPNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 516 NQQFKSLPEYLENMVIKLAKELP 538
>gi|148691059|gb|EDL23006.1| THO complex 1, isoform CRA_d [Mus musculus]
Length = 659
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 242/500 (48%), Positives = 316/500 (63%), Gaps = 46/500 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + S T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 456
Query: 406 R--DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
R ++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 457 RQVEYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQ 516
Query: 464 VEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 517 FKSLPEYLENMVIKLAKELP 536
>gi|431896300|gb|ELK05716.1| THO complex subunit 1 [Pteropus alecto]
Length = 658
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 242/499 (48%), Positives = 316/499 (63%), Gaps = 45/499 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + S T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERASDTKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 R-DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNV 464
R ++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF
Sbjct: 457 RQEYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQF 516
Query: 465 EKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 517 KSLPEYLENMVIKLAKELP 535
>gi|410977440|ref|XP_003995113.1| PREDICTED: THO complex subunit 1 [Felis catus]
Length = 636
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 229/452 (50%), Positives = 295/452 (65%), Gaps = 45/452 (9%)
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
+C ++ P +LL D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRR
Sbjct: 66 ICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRR 125
Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE------------ 180
LS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+
Sbjct: 126 LSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDR 185
Query: 181 ---MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
MDV E + + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E
Sbjct: 186 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 245
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 246 VLAVFKSYKLDDTQASRKKMEELKAGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 305
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY VK + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE
Sbjct: 306 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 365
Query: 350 HWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLW 392
+WN WKNEGCP + TS D K A+K+ P ELT+LW
Sbjct: 366 NWNSWKNEGCPSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNRKILMGNEELTRLW 421
Query: 393 N-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
N DN+EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R
Sbjct: 422 NLCPDNMEACKSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLAR 481
Query: 452 KCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
+ PHFF + E++ENMV + KE P
Sbjct: 482 RSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 513
>gi|347971177|ref|XP_309625.4| AGAP004070-PA [Anopheles gambiae str. PEST]
gi|333466622|gb|EAA05359.4| AGAP004070-PA [Anopheles gambiae str. PEST]
Length = 709
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 254/557 (45%), Positives = 331/557 (59%), Gaps = 96/557 (17%)
Query: 3 DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYI 62
+S + F + + V+ Y + + D DKK+AL Q+ R LL T DI IE +I
Sbjct: 17 ESLERAFAANDIEVLKADYDAARAS-DNDKKTALHQAFRDMLL----TKTEDIPAIERFI 71
Query: 63 SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL 122
S V C KDM +T+P++LL D FD TLD+ EQ+F YVE NV IWK++ FF +CK+NL
Sbjct: 72 SFAVNSCRKDMTVATMPVVLLGDIFDAVTLDRAEQIFTYVENNVAIWKEECFFTACKHNL 131
Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD-EEM 181
LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFN+ENITE+G D EM
Sbjct: 132 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNLENITEYGMDGNEM 191
Query: 182 DV---SSNETEETDTE--EVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
V S E D E E +K+KID+N Y KFW+LQD+FRNP QCYNKV WK F ++A+
Sbjct: 192 AVDQLSGGADPEVDGEPAEENKLKIDYNLYCKFWALQDFFRNPNQCYNKVQWKTFETHAK 251
Query: 237 TVLAAFKSYKLDDVQSS--------------------------------LNPSGDYFAKY 264
+VL+AF S+KL++ ++S + SG +FAK+
Sbjct: 252 SVLSAFCSFKLEEHRASSTSSGPATKPSDVGGSATAAAAANNCPMEEDQIRESGHFFAKF 311
Query: 265 LTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVY 324
LTN KLL LQLSD+NFRR VL+QFLILFQY STVK + E L Q W+K+T VY
Sbjct: 312 LTNPKLLSLQLSDSNFRRSVLVQFLILFQYLNSTVKFKAETHVLTQAQSDWLKETETQVY 371
Query: 325 SLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK---------- 374
LI+++PP+G+ F+ + +L EE WN WKNEGC E KRP ++ D
Sbjct: 372 QLIEESPPNGKKFANTARHMLLREELWNSWKNEGCKEFKRPEAALEDTSSSTAAVAGGAA 431
Query: 375 ---------KDEPDAKKKKTP-------------------ELTKLWN-SKDNLEACKSAE 405
P AK+ + P E+T+LWN DNL+ACK +
Sbjct: 432 AGGAATSAGSGRPPAKRPRKPLGDLIRDATKQGKFYMGNSEITRLWNLCPDNLQACKGTD 491
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R+F PSLE+Y E ++ DP+ + WRALRLL+R+ PHFF
Sbjct: 492 RNFLPSLETYLENPKEKQDPS--------------FEWRALRLLARQSPHFFTLFNSPSY 537
Query: 466 KNSEFIENMVKRCVKEK 482
K ++++E++ K+ K+K
Sbjct: 538 KVADYLESVRKKIQKDK 554
>gi|321464075|gb|EFX75086.1| hypothetical protein DAPPUDRAFT_226474 [Daphnia pulex]
Length = 689
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 249/562 (44%), Positives = 342/562 (60%), Gaps = 79/562 (14%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
DKK+ +Q+ + L + + I + E + L ++ + +C+ TLP+ILL+D FD+
Sbjct: 45 DKKAVWDQAFKDRFLS--DSSNFSIPLFEKLLQLSLDCAVALVCSPTLPVILLTDAFDIL 102
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+L+KCE++FY VE+ V IWKQ+ FF CKN LLR+CNDLLRRLSRSQNTVFCGRILLFLA
Sbjct: 103 SLEKCEEIFYMVEMKVAIWKQELFFNQCKNTLLRLCNDLLRRLSRSQNTVFCGRILLFLA 162
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEE---MDVSSNETEETDTEEVDK-VKIDFNFY 206
KFFPFSERSGLN++SEFN++N+T FG +E D+S+N + D EV++ + D+N Y
Sbjct: 163 KFFPFSERSGLNVVSEFNLDNLTTFGAKDESYAADISANSEKHKDAMEVEESLSSDYNLY 222
Query: 207 KKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-----------DV----- 250
KKFWSLQD+FRNP QCY K+ WK F + VL+AF + KLD DV
Sbjct: 223 KKFWSLQDFFRNPNQCYTKIPWKTFCLHTSDVLSAFSTIKLDERRNRSKWDEADVALNID 282
Query: 251 --QSSLNPSG--DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGL 306
++S+N YFAKYLT+Q LL+LQL D+NFRRY+LLQFLI+FQY S V E
Sbjct: 283 MNETSMNGGNVQQYFAKYLTSQNLLELQLGDSNFRRYILLQFLIMFQYLDSPV---NETH 339
Query: 307 ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP- 365
+L +Q WV + VY L+K TPPDG F Q ++ ILK EE W+ WKN+GCPE RP
Sbjct: 340 KLNEEQTTWVSEAIVQVYRLLKDTPPDGASFVQSIRHILKREELWSNWKNDGCPEF-RPS 398
Query: 366 -LTSITDEDK-KDEPDAKKK-----------------------------KTPELTKLWN- 393
++S TD D KD + +KK E+ +LWN
Sbjct: 399 IISSTTDNDTIKDGTNGEKKTIVVGRPRRLKRSLGDQIREASKRNKCIIGNAEMNRLWNL 458
Query: 394 SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
DN+ AC+S +RDF P+LE+YFEEAI+Q+DP +VE QYK+V+++N+ WRALRLL+R+
Sbjct: 459 CPDNMAACRSKDRDFLPTLENYFEEAIEQLDP--SVESQYKRVSEANFGWRALRLLARRS 516
Query: 454 PHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEE 513
HFF + + + +++E V++ K D A + K E + +
Sbjct: 517 LHFFTHGNTPITRLPDYLETHVRKLAK--------------DMQSANALPNVKPEPVDDA 562
Query: 514 EKEEDWEAKADPEGDADEVMSV 535
+ E E +ADP + E+ V
Sbjct: 563 GEVEQMEPEADPVSNNSEIEGV 584
>gi|334325879|ref|XP_001367958.2| PREDICTED: THO complex subunit 1-like [Monodelphis domestica]
Length = 810
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 239/490 (48%), Positives = 312/490 (63%), Gaps = 38/490 (7%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+KK L+Q+ R L + I T + + ISL + + +C ++ P +LL D D
Sbjct: 198 EKKCTLDQAFRVILEEEIIT-RASCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCL 256
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI LFLA
Sbjct: 257 PLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLA 316
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDTEEVD------------ 197
+ FP SE+SGLN+ S+FN+EN+T F +E E + E+ D
Sbjct: 317 RLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHIEDRDEGMEVEEGEMGDDEAPT 376
Query: 198 --KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS-- 253
+ ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD Q+S
Sbjct: 377 SCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRK 436
Query: 254 ----LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE 307
L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK +
Sbjct: 437 KLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYV 496
Query: 308 LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--KRP 365
L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGCP +RP
Sbjct: 497 LTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERP 556
Query: 366 LTSITDE--DKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAERDFTPSLE 413
S K+ P+ K P ELT+LWN DN+EACKS R++ P+LE
Sbjct: 557 SDSKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSESREYMPTLE 616
Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIEN 473
+FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF + E++EN
Sbjct: 617 EFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLEN 676
Query: 474 MVKRCVKEKP 483
MV + KE P
Sbjct: 677 MVIKLAKELP 686
>gi|395511634|ref|XP_003760061.1| PREDICTED: THO complex subunit 1 [Sarcophilus harrisii]
Length = 658
Score = 453 bits (1166), Expect = e-125, Method: Compositional matrix adjust.
Identities = 239/490 (48%), Positives = 312/490 (63%), Gaps = 38/490 (7%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+KK L+Q+ R L + I T + + ISL + + +C ++ P +LL D D
Sbjct: 46 EKKCTLDQAFRVILEEEIIT-RASCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCL 104
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI LFLA
Sbjct: 105 PLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLA 164
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDTEEVD------------ 197
+ FP SE+SGLN+ S+FN+EN+T F +E E + E+ D
Sbjct: 165 RLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHIEDRDEGMEVEEGEMGDDEAPT 224
Query: 198 --KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS-- 253
+ ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD Q+S
Sbjct: 225 SCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRK 284
Query: 254 ----LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE 307
L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK +
Sbjct: 285 KLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYV 344
Query: 308 LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--KRP 365
L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGCP +RP
Sbjct: 345 LTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERP 404
Query: 366 LTSITDE--DKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAERDFTPSLE 413
S K+ P+ K P ELT+LWN DN+EACKS R++ P+LE
Sbjct: 405 SDSKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSESREYMPTLE 464
Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIEN 473
+FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF + E++EN
Sbjct: 465 EFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLEN 524
Query: 474 MVKRCVKEKP 483
MV + KE P
Sbjct: 525 MVIKLAKELP 534
>gi|327269845|ref|XP_003219703.1| PREDICTED: THO complex subunit 1-like [Anolis carolinensis]
Length = 674
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 241/534 (45%), Positives = 331/534 (61%), Gaps = 51/534 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
P + +KK L+Q+ R + + I K P ++ I ISL + + +C ++ P +LL
Sbjct: 57 PGSENEKKFTLDQAFRVAVEEEIINKAPCENVLAI---ISLVINGVTEGICTASTPFVLL 113
Query: 84 SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
D D LD+C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 114 GDVLDCLPLDQCDKIFTFVEKNVTTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 173
Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGG-DEEMDVSSNETEET----------- 191
RI LFLA+ FP SE+SGLN+ S+FN+ENIT F ++E + E+
Sbjct: 174 RIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTIEQESTLGQKHVEDREEGMEVEEGEM 233
Query: 192 ---DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+ + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F Y+E VLA FKSYKLD
Sbjct: 234 GDDEAPTSSSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 293
Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
D Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 294 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 353
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ L +Q W++DTT+ VY L+ + PPDGE F+++V+ IL EE+WN WKNEGCP
Sbjct: 354 FKSSNYVLTDEQSLWIEDTTKLVYQLLSENPPDGERFAKMVEHILNTEENWNSWKNEGCP 413
Query: 361 ELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACK 402
+ E K P +K+ P ELT+LWN DN+EACK
Sbjct: 414 SF---VKERAGESKAIRPFVRKRPPPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACK 470
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
S R++ P+LE +FEEAI+Q DP V++++K VN+SNY WRALRLL+R+ PHFF
Sbjct: 471 SESREYMPTLEEFFEEAIEQADPENMVDKEFKVVNNSNYGWRALRLLARRSPHFFQPTNQ 530
Query: 463 NVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEE 514
+ E++ENMV + KE PS +I + + A ++ + +S ++Q ++
Sbjct: 531 QFKSLPEYLENMVIKLAKELPPPSEEIKTGEEDEEDNDALLKENNESPDVQRDK 584
>gi|348500685|ref|XP_003437903.1| PREDICTED: THO complex subunit 1 [Oreochromis niloticus]
Length = 654
Score = 450 bits (1158), Expect = e-124, Method: Compositional matrix adjust.
Identities = 233/499 (46%), Positives = 318/499 (63%), Gaps = 44/499 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK+ L+Q++R L LI + I L ++ + +C++T P +LL D
Sbjct: 39 PGNEAEKKTTLDQALRGVLGDLIVEQKLRCDDYLALIYLSIDGVTEGICSATTPFVLLGD 98
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD C+++F +VE NV+ WK +F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 99 VLDCLPLDLCDKIFSFVEENVSTWKSNSFYTAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 158
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEET------------- 191
LFLA+ FP SE+SGLN+ S+FN++NIT F +E E + TEE
Sbjct: 159 QLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQESTLGQKHTEEKEDGMEVEEGEMGE 218
Query: 192 -DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
D + ID+N Y+KFW+LQDYFRNPVQCY+K SW F Y++ LA FKSYKLDD+
Sbjct: 219 DDGPAPSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFLKYSDETLAVFKSYKLDDM 278
Query: 251 QSSLN-------PSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
Q+S SG+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 279 QASKRKLEELRASSGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKF 338
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ L DQ W+++TT+ VY L+++ PPDG+ F+ +V+ IL EE+WN WKNEGCP
Sbjct: 339 KSSSCVLNDDQAAWIEETTKLVYQLLREIPPDGDKFAAMVEHILNTEENWNAWKNEGCPS 398
Query: 362 LKRPLTSITDEDKKDEPDAKKK-------KTP---------ELTKLWN-SKDNLEACKSA 404
+ T +DK P K++ K P ELT+LWN + DN+EACKS
Sbjct: 399 FVKERTV---DDKPKRPTRKRQAPEDFLGKGPDRKIFMGNDELTRLWNLNHDNMEACKSD 455
Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNV 464
R+F PSL+ +F EAI+Q DPA VEE+YK V ++NY WRALRLLSR+ PHFF
Sbjct: 456 SREFMPSLDEFFAEAIEQADPANMVEEEYKVVRNANYGWRALRLLSRRSPHFFQPTNQKF 515
Query: 465 EKNSEFIENMVKRCVKEKP 483
+ ++++++MV + KE P
Sbjct: 516 KSLADYLDSMVSKLAKELP 534
>gi|410923899|ref|XP_003975419.1| PREDICTED: THO complex subunit 1-like [Takifugu rubripes]
Length = 654
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 230/500 (46%), Positives = 318/500 (63%), Gaps = 46/500 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK+ L+Q++R L I + I L ++ + +C++T P +LL D
Sbjct: 39 PGSETEKKTTLDQALRGVLGDQIVEQKSSCDDYLSLIYLSIDSVTEGICSATTPFVLLGD 98
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD+C+++F +VE NV+ WK +F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 99 VLDCLPLDQCDKIFSFVEENVSTWKSNSFYTAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 158
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEE-------------- 190
LFLA+ FP SE+SGLN+ S+FN++NIT F +E E + TEE
Sbjct: 159 QLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQESTLGQKNTEEKEDGMEVEEGEMGE 218
Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
D + ID+N Y+KFW+LQDYFRNPVQCY+K SW F Y++ LA FKSYKLDD+
Sbjct: 219 EDAPAPCSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFVKYSDETLAVFKSYKLDDM 278
Query: 251 QSS------LNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
Q+S L SG YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 279 QASKRKLEDLRASGGDHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKF 338
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ G L DQ W+++T++ Y L+++ PPDG+ F+ +V+ IL EE+WN WKNEGCP
Sbjct: 339 KSSGSVLNDDQAAWIEETSKLAYQLLREIPPDGDKFATMVEHILGTEENWNSWKNEGCPS 398
Query: 362 LKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKS 403
+ T +DK P A+K++ P ELT+LWN + DN++ACKS
Sbjct: 399 FVKERTV---DDKPKRP-ARKRQAPEDFLGKGPDRKIFMGNDELTRLWNLNHDNMDACKS 454
Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
R+F PSL+ +F EAI+Q DPA VE++YK V + NY WRALRLLSR+ PHFF
Sbjct: 455 ESREFVPSLDDFFSEAIEQADPANMVEDEYKVVRNPNYGWRALRLLSRRSPHFFQPTNQK 514
Query: 464 VEKNSEFIENMVKRCVKEKP 483
+ ++++++MV + KE P
Sbjct: 515 FKSLADYLDSMVSKLAKELP 534
>gi|157167759|ref|XP_001655616.1| nuclear matrix protein [Aedes aegypti]
gi|108882031|gb|EAT46256.1| AAEL002563-PA [Aedes aegypti]
Length = 685
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 253/582 (43%), Positives = 347/582 (59%), Gaps = 95/582 (16%)
Query: 3 DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYI 62
D+ + ++++ + +Y K D DKK+AL Q+ R LL T DI IE +I
Sbjct: 15 DTLNRAYKVNEIDQLKAEYDGQKGPTDNDKKAALHQAFRNLLL----TKTDDIPAIEGFI 70
Query: 63 SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL 122
+ V C +M ++T+P++LL D FD TLDKCEQ+F YVE NV WK++ FF +CK+NL
Sbjct: 71 NFAVGACRLEMTSATIPVVLLGDIFDAVTLDKCEQIFTYVENNVATWKEELFFTACKHNL 130
Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD-EEM 181
LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFN+EN+TE+G + EM
Sbjct: 131 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNLENVTEYGVEGSEM 190
Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
+ EE + +++KID+N Y KFW+LQD+FRNP QCYNKV KMF ++A +VL+A
Sbjct: 191 GDQLDGFEENGS--ANQLKIDYNLYCKFWALQDFFRNPNQCYNKVQCKMFAAHAGSVLSA 248
Query: 242 FKSYKLDD-----------------------VQSSLNPSGDYFAKYLTNQKLLDLQLSDT 278
F S+KL++ + + SG +FAK+LTN KLL LQLSD+
Sbjct: 249 FSSFKLEEPRSASSSSSASANKYGTTPMDIPPEEQIRDSGHFFAKFLTNPKLLSLQLSDS 308
Query: 279 NFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFS 338
NFRR VL+Q LILFQY STVK + + L Q W+++T TVY LI+++PP+G+ FS
Sbjct: 309 NFRRSVLVQLLILFQYLNSTVKFKADNHLLTQVQIDWLRETETTVYKLIEESPPNGKKFS 368
Query: 339 QVVKLILKGEEHWNQWKNEGCPELKRP---------LTSITDEDKKDE---PDAKKKK-- 384
+ V+ +L EE WN WKNEGC E KRP T+++ ED P + +K
Sbjct: 369 ESVRHMLSREELWNSWKNEGCKEFKRPDAAAAAATTTTNVSGEDPSPPVRLPQKRPRKPL 428
Query: 385 ----------------TPELTKLWNS-KDNLEACKSAERDFTPSLESYFEEAIQQMDPAA 427
PELT+LWN+ DNL+ACK +R+F PSLE+Y + + ++ DP+
Sbjct: 429 GDLIRDSTKQGKFYMGNPELTRLWNTCPDNLQACKGEDRNFLPSLEAYLDGSKEKQDPS- 487
Query: 428 AVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQI 487
+ WRALRLL+R+ PHFF K ++++E++ K+ K+K
Sbjct: 488 -------------FEWRALRLLARQSPHFFTLFNSPSYKVADYLESVRKKIQKDK----- 529
Query: 488 SGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDA 529
VD K E IQE+ ++ + + EG+
Sbjct: 530 ---------------VDIKQEVIQEDVPSQNENEQNEAEGEG 556
>gi|52139086|gb|AAH82664.1| LOC398505 protein, partial [Xenopus laevis]
Length = 659
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 233/500 (46%), Positives = 310/500 (62%), Gaps = 49/500 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK AL+Q R L + I + ISL + +C +T P +LL D
Sbjct: 44 PGSENEKKLALDQVFRVVLEEEI-INQVSCNNCLAIISLAINGVTDGICTATTPFVLLGD 102
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D L C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 103 VLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 162
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE--------------MDVSSNETEET 191
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ M++ E +
Sbjct: 163 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKVEEKDDGMEIEEGEMGDD 222
Query: 192 DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
D + ID+N Y+KFWS+QDYFRNPVQCY+K SWK F +++ VL FKSYKLDD Q
Sbjct: 223 DAPNSSSIPIDYNLYRKFWSIQDYFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLDDTQ 282
Query: 252 SS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG 303
+S L G+ YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY VK +
Sbjct: 283 ASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVKFKS 342
Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL- 362
L +Q W++DTT+ V+ L+ + PPDGE FS+ V+ IL EE+WN WKNEGCP
Sbjct: 343 SNYILLDEQSLWIEDTTKLVHQLLSEIPPDGEKFSKTVEHILNTEENWNAWKNEGCPSFV 402
Query: 363 -KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKS 403
+RP + K + A+K+ P ELT+LWN DN+EAC+S
Sbjct: 403 KERPT------ETKPQKMARKRPVPEDFLGKGSNKKMSLGNDELTRLWNLCPDNMEACRS 456
Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
R+F PSLE +FEEAI+Q DP VE +YK VN++NY WRALRLL+R+ PHFF
Sbjct: 457 ESREFMPSLEQFFEEAIEQADPENMVENEYKVVNNANYGWRALRLLARRSPHFFQPTNQQ 516
Query: 464 VEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 517 FKSLPEYLENMVIKLAKELP 536
>gi|195343819|ref|XP_002038488.1| GM10588 [Drosophila sechellia]
gi|194133509|gb|EDW55025.1| GM10588 [Drosophila sechellia]
Length = 701
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 243/573 (42%), Positives = 350/573 (61%), Gaps = 71/573 (12%)
Query: 4 SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
+ +L ++ ++ +Y + ++DK+ ++ + R L+K + D D+ I +
Sbjct: 28 ALELAITDGKVELLVQEYNRFPANTEHDKRLPMDHAFRVLLMKRL---DEDVSRIGELVR 84
Query: 64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
L VE ++ ++T+P++LL DTFD+ TLDKC+++F +VE V +WK++ FF SCKNN+L
Sbjct: 85 LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEEMVEVWKEEIFFASCKNNIL 144
Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204
Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
S+ E DT E +KID++ Y KFWSLQD+FRNP QCYNK WKMF +AE +L +F
Sbjct: 205 ESDNKELEDTAEDIPIKIDYDLYCKFWSLQDFFRNPNQCYNKPQWKMFQMHAENILQSFS 264
Query: 244 SYKLDDVQSSL--NPSGD------------------------YFAKYLTNQKLLDLQLSD 277
S+KL+DV+ S N SG +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRHSSNDNASGSDQAMDVDEEAVDATLVTSVIKENHFFAKFLTNPKLLALQLSD 324
Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
NFRR VL+QFLILFQY +VK + + L +DQ ++K+T VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTQTLTADQADFIKETESRVYKLLEETPPYGKRF 384
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKK------------ 384
S+ V +L EE WN WKNEGC E K+P + ++ED K P+ + ++
Sbjct: 385 SRTVCHMLAREEMWNNWKNEGCKEFKKPEEPTPSEEDSKPTPNKRPRRPLGDALRDASRS 444
Query: 385 ------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
LT+LWN S DNL+ACKS +R+F P LE+Y E +++DPA
Sbjct: 445 GKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHEKVDPA----------- 493
Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQISGNGNGV 494
+ WRALRLL+R+ PHFF + + K S+++E + KR +++ KP++ +S N +
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDKEPKPAALLSNNSSEN 550
Query: 495 DQDPAEVEVDTKSEE-----IQEEEKEEDWEAK 522
+ V T+ E+ +Q+ E E+D E +
Sbjct: 551 SFGLSANNVHTEGEQDATVALQDGEPEQDLEVE 583
>gi|147905682|ref|NP_001082490.1| THO complex 1 protein [Xenopus laevis]
gi|72679742|gb|AAI00212.1| LOC398505 protein [Xenopus laevis]
Length = 654
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 233/500 (46%), Positives = 310/500 (62%), Gaps = 49/500 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK AL+Q R L + I + ISL + +C +T P +LL D
Sbjct: 39 PGSENEKKLALDQVFRVVLEEEIIN-QVSCNNCLAIISLAINGVTDGICTATTPFVLLGD 97
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D L C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 98 VLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 157
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE--------------MDVSSNETEET 191
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ M++ E +
Sbjct: 158 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKVEEKDDGMEIEEGEMGDD 217
Query: 192 DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
D + ID+N Y+KFWS+QDYFRNPVQCY+K SWK F +++ VL FKSYKLDD Q
Sbjct: 218 DAPNSSSIPIDYNLYRKFWSIQDYFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLDDTQ 277
Query: 252 SS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG 303
+S L G+ YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY VK +
Sbjct: 278 ASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVKFKS 337
Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL- 362
L +Q W++DTT+ V+ L+ + PPDGE FS+ V+ IL EE+WN WKNEGCP
Sbjct: 338 SNYILLDEQSLWIEDTTKLVHQLLSEIPPDGEKFSKTVEHILNTEENWNAWKNEGCPSFV 397
Query: 363 -KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKS 403
+RP + K + A+K+ P ELT+LWN DN+EAC+S
Sbjct: 398 KERPT------ETKPQKMARKRPVPEDFLGKGSNKKMSLGNDELTRLWNLCPDNMEACRS 451
Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
R+F PSLE +FEEAI+Q DP VE +YK VN++NY WRALRLL+R+ PHFF
Sbjct: 452 ESREFMPSLEQFFEEAIEQADPENMVENEYKVVNNANYGWRALRLLARRSPHFFQPTNQQ 511
Query: 464 VEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 512 FKSLPEYLENMVIKLAKELP 531
>gi|28278733|gb|AAH44699.1| LOC398505 protein, partial [Xenopus laevis]
Length = 659
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 233/501 (46%), Positives = 310/501 (61%), Gaps = 50/501 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK AL+Q R L + I + ISL + +C +T P +LL D
Sbjct: 43 PGSENEKKLALDQVFRVVLEEEI-INQVSCNNCLAIISLAINGVTDGICTATTPFVLLGD 101
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D L C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 102 VLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 161
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNETEE 190
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ M++ E +
Sbjct: 162 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKQVEEKDDGMEIEEGEMGD 221
Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
D + ID+N Y+KFWS+QDYFRNPVQCY+K SWK F +++ VL FKSYKLDD
Sbjct: 222 DDAPNSSSIPIDYNLYRKFWSIQDYFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLDDT 281
Query: 251 QSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSR 302
Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY VK +
Sbjct: 282 QASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVKFK 341
Query: 303 GEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL 362
L +Q W++DTT+ V+ L+ + PPDGE FS+ V+ IL EE+WN WKNEGCP
Sbjct: 342 SSNYILLDEQSLWIEDTTKLVHQLLSEIPPDGEKFSKTVEHILNTEENWNAWKNEGCPSF 401
Query: 363 --KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACK 402
+RP + K + A+K+ P ELT+LWN DN+EAC+
Sbjct: 402 VKERPT------ETKPQKMARKRPVPEDFLGKGSNKKMSLGNDELTRLWNLCPDNMEACR 455
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
S R+F PSLE +FEEAI+Q DP VE +YK VN++NY WRALRLL+R+ PHFF
Sbjct: 456 SESREFMPSLEQFFEEAIEQADPENMVENEYKVVNNANYGWRALRLLARRSPHFFQPTNQ 515
Query: 463 NVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 516 QFKSLPEYLENMVIKLAKELP 536
>gi|24644436|ref|NP_649594.1| Hpr1 [Drosophila melanogaster]
gi|7296666|gb|AAF51946.1| Hpr1 [Drosophila melanogaster]
gi|21483450|gb|AAM52700.1| LD43883p [Drosophila melanogaster]
gi|46800533|emb|CAD89220.1| Hpr1 protein [Drosophila melanogaster]
Length = 701
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 242/574 (42%), Positives = 350/574 (60%), Gaps = 73/574 (12%)
Query: 4 SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
+ +L ++ ++ +Y + ++DK+ ++ + R L+K + D D+ I +
Sbjct: 28 ALELAITDGKVELLVKEYNRFPANTEHDKRLPMDHAFRVLLMKRL---DEDVSRIGELVR 84
Query: 64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
L VE ++ ++T+P++LL DTFD+ TLDKC+++F +VE V +WK++ FF SCKNN+L
Sbjct: 85 LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEDMVEVWKEEIFFSSCKNNIL 144
Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204
Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
S+ E DT E +KID++ Y KFWSLQD+FRNP QCYNK WKMF +A+ +L +F
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKPQWKMFQMHADNILQSFS 264
Query: 244 SYKLDDVQSSLN--------------------------PSGDYFAKYLTNQKLLDLQLSD 277
S+KL+DV+ S N + +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRQSSNENASGTDQAMDVDEEAVDTTAVSSVIKANHFFAKFLTNPKLLALQLSD 324
Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
NFRR VL+QFLILFQY +VK + + L +DQ ++K+T VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTQTLTADQADFIKETESRVYKLLEETPPYGKRF 384
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKK------------ 384
S+ V +L EE WN WKNEGC E K+P ++++ED K P+ + ++
Sbjct: 385 SRTVYHMLAREEMWNNWKNEGCKEFKKPEEPTLSEEDSKPTPNKRPRRPLGDALRDASRS 444
Query: 385 ------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
LT+LWN S DNL+ACKS +R+F P LE+Y E +++DPA
Sbjct: 445 GKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHEKVDPA----------- 493
Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQISGN---- 490
+ WRALRLL+R+ PHFF + + K S+++E + KR +++ KP++ +S N
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDKEPKPAALLSNNSSEN 550
Query: 491 --GNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAK 522
G V+ E E D + +Q+ E E+D E +
Sbjct: 551 NFGLSVNNVHTEGEQDA-TVALQDGEPEQDLEVE 583
>gi|195568581|ref|XP_002102292.1| GD19579 [Drosophila simulans]
gi|194198219|gb|EDX11795.1| GD19579 [Drosophila simulans]
Length = 701
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 243/573 (42%), Positives = 349/573 (60%), Gaps = 71/573 (12%)
Query: 4 SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
+ +L ++ ++ +Y + ++DK+ ++ + R L+K + D D+ I +
Sbjct: 28 ALELAITDGKVELLVQEYNRFPANTEHDKRLPMDHAFRVLLMKRL---DEDVSRIGELVR 84
Query: 64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
L VE ++ ++T+P++LL DTFD+ TLDKC+++F +VE V +WK++ FF SCKNN+L
Sbjct: 85 LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEEMVEVWKEEIFFASCKNNIL 144
Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204
Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
S+ E DT E +KID++ Y KFWSLQD+FRNP QCYNK WKMF +AE +L +F
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKPQWKMFQMHAENILQSFS 264
Query: 244 SYKLDDVQSSL--NPSGD------------------------YFAKYLTNQKLLDLQLSD 277
S+KL+DV+ S N SG +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRHSSNDNASGSDQAMDVDEEAVDAAAVTSVIKANHFFAKFLTNPKLLALQLSD 324
Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
NFRR VL+QFLILFQY +VK + + L DQ ++K+T VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTQTLTGDQADFIKETESRVYKLLEETPPYGKRF 384
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKK------------ 384
S+ V +L EE WN WKNEGC E K+P + ++ED K P+ + ++
Sbjct: 385 SRTVCHMLAREEMWNNWKNEGCKEFKKPEEPTPSEEDSKPTPNKRPRRPLGDALRDASRS 444
Query: 385 ------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
LT+LWN S DNL+ACKS +R+F P LE+Y E +++DPA
Sbjct: 445 GKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHEKVDPA----------- 493
Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQISGNGNGV 494
+ WRALRLL+R+ PHFF + + K S+++E + KR +++ KP++ +S N +
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDKEPKPAALLSNNSSEN 550
Query: 495 DQDPAEVEVDTKSEE-----IQEEEKEEDWEAK 522
+ V T+ E+ +Q+ E E+D E +
Sbjct: 551 SFGLSANNVHTEGEQDATVALQDGEPEQDLEVE 583
>gi|170035468|ref|XP_001845591.1| THO complex subunit 1 [Culex quinquefasciatus]
gi|167877503|gb|EDS40886.1| THO complex subunit 1 [Culex quinquefasciatus]
Length = 687
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 248/543 (45%), Positives = 328/543 (60%), Gaps = 93/543 (17%)
Query: 14 LNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDM 73
+ ++ +Y + D DK+SAL Q+ R LL KT DI IE +IS V C +M
Sbjct: 16 IELLKKQYDELTSSTDIDKRSALHQAFRDLLLA--KTDDI--PAIETFISFAVAACRSEM 71
Query: 74 CNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRL 133
T+P++LL D FD TLDKCE +F YVE NV WK+ FF +CK+NLLRMCNDLLRRL
Sbjct: 72 TAPTIPVVLLGDIFDAVTLDKCEAIFTYVENNVTTWKEDFFFTACKHNLLRMCNDLLRRL 131
Query: 134 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--GDEEMDVSSNETEET 191
SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFN+ENITE+G G E D S E+
Sbjct: 132 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNLENITEYGVEGSEMGDQLSEGQEDG 191
Query: 192 D-------TEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKS 244
D T++ ++KID+N Y KFW+LQD+FRNP QCYNKV WKMF ++A +V+AAF S
Sbjct: 192 DGVQASAGTQQQPQLKIDYNLYCKFWALQDFFRNPNQCYNKVQWKMFAAHAGSVMAAFSS 251
Query: 245 YKLDDVQSS---------------------LNPSGDYFAKYLTNQKLLDLQLSDTNFRRY 283
+KL++ +S+ + + +FAK+LTN KLL LQLSD+NFRR
Sbjct: 252 FKLEEPRSNSKSASKNKSSPTPMDIPPEEEVRETAHFFAKFLTNPKLLSLQLSDSNFRRS 311
Query: 284 VLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKL 343
VL+Q LILFQY STVK + + L Q W+K+T TVY LI+++PP+G F++ V+
Sbjct: 312 VLVQLLILFQYLNSTVKFKSDSHCLTVAQVDWLKETETTVYKLIEESPPNGRRFAESVRH 371
Query: 344 ILKGEEHWNQWKNEGCPELKRP---------------LTSITDEDKKDEPDAKKKKT--- 385
+L EE WN WKNEGC E KRP + ED+ P A +
Sbjct: 372 MLSREELWNSWKNEGCKEFKRPDAAAVAATAGAAAAATSGTATEDQPATPAAATSRVASK 431
Query: 386 -----------------------PELTKLWNS-KDNLEACKSAERDFTPSLESYFEEAIQ 421
PELT+LWN+ DNL+ACK +R+F PSLE+Y + + +
Sbjct: 432 RPRRPLGDLIRESTKQGKFFMGNPELTRLWNTCPDNLQACKGDDRNFLPSLEAYLDGSKE 491
Query: 422 QMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF--LNATPNVEKNSEFIENMVKRCV 479
+ DP+ + WRALRLL+R+ PHFF N+T + K ++++E++ K+
Sbjct: 492 RQDPS--------------FEWRALRLLARQSPHFFTLFNSTSSY-KVADYLESVRKKIQ 536
Query: 480 KEK 482
K+K
Sbjct: 537 KDK 539
>gi|195145597|ref|XP_002013778.1| GL23216 [Drosophila persimilis]
gi|194102721|gb|EDW24764.1| GL23216 [Drosophila persimilis]
Length = 718
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 245/562 (43%), Positives = 342/562 (60%), Gaps = 74/562 (13%)
Query: 20 KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
+Y Y ++DK+ ++ + R L+K +K D+K I + + L VE ++ ++T+P
Sbjct: 45 EYNSYTANSEHDKRLPMDHAFRVLLMKRLKD---DVKRIGDLVRLSVEATRAEIVSNTIP 101
Query: 80 IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
++LL DTFD+ TLD+C+++F +VE V++WK++ FF SCKNN+LRMCNDLLRRLSR+QNT
Sbjct: 102 VVLLIDTFDVVTLDRCQEIFKFVEELVDVWKEEIFFASCKNNILRMCNDLLRRLSRTQNT 161
Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV 199
VFCGRILLFL+KFFPFSERSGLNI+SEFN++N+TE+G D ++ E DT E +
Sbjct: 162 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNLTEYGLDSRDHDDTDNKELEDTAEDIPL 221
Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ-------- 251
KID++ Y KFWSLQD+FRNP QCY K WKMF +A+ +L +F S+KL+DV+
Sbjct: 222 KIDYDLYCKFWSLQDFFRNPNQCYGKAQWKMFHMHADNILQSFASFKLEDVRPSNSENGS 281
Query: 252 ------------------SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
S+++ + +FAK+LTN KLL LQLSD+NFRR VL+QFLILFQ
Sbjct: 282 EADQAMDVDDDIVDAAVASTVSKANHFFAKFLTNPKLLALQLSDSNFRRAVLVQFLILFQ 341
Query: 294 YFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
Y +VK + + L ++Q ++KDT VY L+++TPP G F++ V +L EE WN
Sbjct: 342 YLQVSVKFKIDTYTLTTEQADFIKDTEPRVYKLLEETPPYGRRFARTVHHMLVREEMWNN 401
Query: 354 WKNEGCPELKRPLTSIT-DEDKKDEP-------------DAKKK-----KTPELTKLWN- 393
WKNEGC E K+P + DED K P DA + LT+LWN
Sbjct: 402 WKNEGCKEFKKPEDPPSNDEDNKPPPAKRPKRPLGDSLRDAARNGKFFLGNENLTRLWNY 461
Query: 394 SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
S DNL+ACKS +R+F P LE+Y E ++DPA + WRALRLL+R+
Sbjct: 462 SPDNLQACKSEQRNFLPLLETYLETPHDKIDPA--------------FEWRALRLLARQT 507
Query: 454 PHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQIS---------GNGNGVDQDPAEVEVD 504
PHFF + + K S+++E + KR V+++ Q + NG PAE E D
Sbjct: 508 PHFFTSPSQPSSKISDYLEQVRKRLVRDREPKQSAPVSSNASEQNNGLAAKSTPAENEQD 567
Query: 505 TKSEEIQEEEKEEDWEAK-ADP 525
+QE E E+D E + DP
Sbjct: 568 A-GMALQEAEPEQDLEVEDTDP 588
>gi|198451954|ref|XP_001358566.2| GA15194 [Drosophila pseudoobscura pseudoobscura]
gi|198131721|gb|EAL27707.2| GA15194 [Drosophila pseudoobscura pseudoobscura]
Length = 718
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 244/562 (43%), Positives = 342/562 (60%), Gaps = 74/562 (13%)
Query: 20 KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
+Y Y ++DK+ ++ + R L+K +K D+K I + + L VE ++ ++T+P
Sbjct: 45 EYNSYTANSEHDKRLPMDHAFRVLLMKRLKD---DVKRIGDLVRLSVEATRAEIVSNTIP 101
Query: 80 IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
++LL DTFD+ TLD+C+++F +VE V++WK++ FF SCKNN+LRMCNDLLRRLSR+QNT
Sbjct: 102 VVLLIDTFDVVTLDRCQEIFKFVEELVDVWKEEIFFASCKNNILRMCNDLLRRLSRTQNT 161
Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV 199
VFCGRILLFL+KFFPFSERSGLNI+SEFN++N+TE+G D ++ E DT E +
Sbjct: 162 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNLTEYGLDSRDHDDTDNKELEDTAEDIPL 221
Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ-------- 251
KID++ Y KFWSLQD+FRNP QCY K WKMF +A+ +L +F S+KL+DV+
Sbjct: 222 KIDYDLYCKFWSLQDFFRNPNQCYGKAQWKMFHMHADNILQSFASFKLEDVRPSNSENGS 281
Query: 252 ------------------SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
S+++ + +FAK+LTN KLL LQLSD+NFRR VL+QFLILFQ
Sbjct: 282 EADQAMDVDDDIVDAAVASTVSKANHFFAKFLTNPKLLALQLSDSNFRRAVLVQFLILFQ 341
Query: 294 YFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
Y +VK + + L ++Q ++KDT VY L+++TPP G F++ V +L EE WN
Sbjct: 342 YLQVSVKFKIDTYTLTTEQADFIKDTEPRVYKLLEETPPYGRRFARTVHHMLVREEMWNN 401
Query: 354 WKNEGCPELKRPLTSIT-DEDKKDEP-------------DAKKK-----KTPELTKLWN- 393
WKNEGC E K+P + DED K P DA + LT+LWN
Sbjct: 402 WKNEGCKEFKKPEDPPSNDEDNKPPPAKRPKRPLGDSLRDAARNGKFFLGNENLTRLWNY 461
Query: 394 SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
S DNL+ACKS +R+F P LE+Y E ++DPA + WRALRLL+R+
Sbjct: 462 SPDNLQACKSEQRNFLPLLETYLESPHDKIDPA--------------FEWRALRLLARQT 507
Query: 454 PHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQIS---------GNGNGVDQDPAEVEVD 504
PHFF + + K S+++E + KR ++++ Q + NG PAE E D
Sbjct: 508 PHFFTSPSQPSSKISDYLEQVRKRLIRDREPKQSAPVSSNASEQNNGLAAKSTPAENEQD 567
Query: 505 TKSEEIQEEEKEEDWEAK-ADP 525
+QE E E+D E + DP
Sbjct: 568 A-GMALQEAEPEQDLEVEDTDP 588
>gi|405964743|gb|EKC30192.1| THO complex subunit 1 [Crassostrea gigas]
Length = 692
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 245/552 (44%), Positives = 350/552 (63%), Gaps = 49/552 (8%)
Query: 23 LYKPTC--DYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPI 80
+++ C + +KK A++Q++R + ++ D D + + I +E+ +D+C+ +P
Sbjct: 67 IFQSVCGSEAEKKLAVDQALRDSMKDVVMRAD-DCEEYKKVILHSIEMAKEDLCSVIMPF 125
Query: 81 ILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTV 140
+LLSD FD+ TL +CE +FY VE + +WK F+ S KN LLRMCNDLLRRLS+SQNTV
Sbjct: 126 VLLSDIFDLITLSQCEIVFYLVEEKLPVWKTPLFYDSGKNYLLRMCNDLLRRLSKSQNTV 185
Query: 141 FCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE------MDVSSNETEETDTE 194
FCGRI LFL++ FP SE+S LN++S+FN+EN+T F E D+S + E + E
Sbjct: 186 FCGRIQLFLSRLFPLSEKSALNLMSQFNLENVTVFSTKSEEVKPKSHDISDDVMEVEEGE 245
Query: 195 EVD---KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
D +D+N Y++FW+LQDYFR P QCY+KV WK F A+ VL AF S KLDD++
Sbjct: 246 MEDLTGSTPVDYNLYRRFWALQDYFRKPTQCYDKVPWKAFQQNADVVLNAFASAKLDDMK 305
Query: 252 SS-----LNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG 303
SS L D +FAKYLT++KLLDLQL+D+NFRRYVL+QFLILFQY S VK +
Sbjct: 306 SSRRKLDLPRPADTRAFFAKYLTSEKLLDLQLNDSNFRRYVLVQFLILFQYLNSQVKFKS 365
Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK 363
L +Q +W+K + VY LI++TPPDGE F ++V+ IL EEHWN+WKN+GCP+
Sbjct: 366 ASQNLSDEQNQWIKTIQDKVYQLIRETPPDGEQFCKIVQHILAREEHWNKWKNDGCPDFA 425
Query: 364 RPLTSITDEDKKDEPDAKKK----------------KTPELTKLWN-SKDNLEACKSAER 406
R + ED K + AK++ + ELT+LWN + DN+EACK+ +R
Sbjct: 426 RKPVA---EDTKIKSRAKRRWVGDDLQAHGGKIIKMGSAELTRLWNINPDNMEACKAEKR 482
Query: 407 DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEK 466
F PSLE +F EA++Q DP A VEEQYK VN+ + W++LRLL+R+ PHFF
Sbjct: 483 VFLPSLEDFFAEAMEQADPEAKVEEQYKMVNNPVFQWKSLRLLARQSPHFFGQHHTPAMP 542
Query: 467 NSEFIENMVKRCVKEKPSSQISGN----GNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAK 522
+++E M+ + +E PS+ +G+ GN ++ EV D E I+E ++ ED E K
Sbjct: 543 LPQYLELMLNKIAQELPSTGSNGDVAPEGNN-EEMKTEVGDD---EAIKESQEVEDEELK 598
Query: 523 ADPEGDADEVMS 534
A E +A++++S
Sbjct: 599 AQAE-EAEDILS 609
>gi|195449329|ref|XP_002072026.1| GK22628 [Drosophila willistoni]
gi|194168111|gb|EDW83012.1| GK22628 [Drosophila willistoni]
Length = 710
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 239/558 (42%), Positives = 344/558 (61%), Gaps = 73/558 (13%)
Query: 14 LNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDM 73
+ ++ + Y + ++DK+S ++Q+ R+ L+K + D++ I + L V+ ++
Sbjct: 41 VKILVETYNNFNGNSEHDKRSPMDQAFREQLMKNLNE---DVEKIGALVKLSVDATRAEL 97
Query: 74 CNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRL 133
+ST+P++LL DTFD+ TLDKC++LF +VE V +WK+ F SCKNN+LRMCNDLLRRL
Sbjct: 98 VSSTIPVVLLGDTFDVVTLDKCQELFQFVEQMVEVWKEDILFGSCKNNILRMCNDLLRRL 157
Query: 134 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDT 193
SR+QNTVFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D + D + E D
Sbjct: 158 SRTQNTVFCGRILLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDGK-DNEDDSKELEDI 216
Query: 194 EEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ-- 251
E +KID++ Y KFWSLQD+FRNP QCY+K WKMF +AET+L +F S+KL+D++
Sbjct: 217 AEDIPLKIDYDLYCKFWSLQDFFRNPNQCYSKAQWKMFQMHAETILQSFVSFKLEDIRPG 276
Query: 252 ----------------------SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
S++N + ++FAK+LTN KLL LQLSD+NFRR VL+QFL
Sbjct: 277 NDSGDGPQPMEVDEAAVDAAAASTVNRANNFFAKFLTNPKLLALQLSDSNFRRAVLVQFL 336
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY +VK + E L Q ++K+T V+ L+++TPP G F++ V+ +L EE
Sbjct: 337 ILFQYLQVSVKFKMESYILNPAQADFIKETEPRVFKLLEETPPYGRRFARTVQHMLVREE 396
Query: 350 HWNQWKNEGCPELKRPLT-SITDEDKKDEPDAKKKKT------------------PELTK 390
WN WKNEGC E K+P + +ED K P + K+ LT+
Sbjct: 397 MWNNWKNEGCKEFKKPENPPVIEEDIKPPPAKRPKRALGDALRDATRNGKFFLGNENLTR 456
Query: 391 LWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLL 449
LWN S DNL+ACKS +R+F P LE+Y E ++DP + WRALRLL
Sbjct: 457 LWNYSPDNLQACKSEQRNFLPQLETYLETPHGKVDPP--------------FEWRALRLL 502
Query: 450 SRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK-PSSQISGNGNGVDQD--------PAE 500
+R+ PHFF + + K S+++E + KR ++K P ++G N + + PAE
Sbjct: 503 ARQTPHFFTSVSQPSSKISDYLEQVRKRLNRDKEPKPTLTGATNALGGNTAVVATTPPAE 562
Query: 501 VEVDTKSEEIQEEEKEED 518
+E +T + +Q+ E E+D
Sbjct: 563 IEQETVA--LQDPEPEQD 578
>gi|301609229|ref|XP_002934170.1| PREDICTED: THO complex subunit 1 [Xenopus (Silurana) tropicalis]
Length = 671
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 234/503 (46%), Positives = 312/503 (62%), Gaps = 55/503 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIEN---YISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q R L + I I+ EN ISL + +C +T P +L
Sbjct: 54 PGSENEKKPTLDQVFRVVLEEEI----INQVSCENCLAIISLAINGVTDGICTATTPFVL 109
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D L C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 110 LGDVLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 169
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE--------------MDVSSNET 188
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ M++ E
Sbjct: 170 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKVEEKDDGMEIEEGEM 229
Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+ D + ID+N Y+KFW+LQD+FRNPVQCY+K SWK F +++ VL FKSYKLD
Sbjct: 230 GDDDAPNSSSIPIDYNLYRKFWTLQDFFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLD 289
Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
D Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY VK
Sbjct: 290 DTQASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVK 349
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ L +Q W++DTT+ V+ L+ +TPP GE FS+ V+ IL EE+WN WKNEGCP
Sbjct: 350 FKSSNYILLDEQSLWIEDTTKLVHQLLSETPPVGEKFSKTVEHILNTEENWNAWKNEGCP 409
Query: 361 EL--KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEA 400
+RP + K + AKK+ P ELT+LWN DN+EA
Sbjct: 410 SFVKERPT------ETKPQKMAKKRPLPEEFLGKGSNKKMSMGNDELTRLWNLCPDNMEA 463
Query: 401 CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
C+S R+F PSLE +FEEAI+Q DP V+ +YK VN++NY WRALRLL+R+ PHFF
Sbjct: 464 CRSESREFMPSLEQFFEEAIEQADPENMVDNEYKVVNNANYGWRALRLLARRSPHFFQPT 523
Query: 461 TPNVEKNSEFIENMVKRCVKEKP 483
+ E++ENMV + KE P
Sbjct: 524 NQQFKSLPEYLENMVIKLAKELP 546
>gi|194898899|ref|XP_001979000.1| GG10810 [Drosophila erecta]
gi|190650703|gb|EDV47958.1| GG10810 [Drosophila erecta]
Length = 713
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 239/573 (41%), Positives = 346/573 (60%), Gaps = 71/573 (12%)
Query: 4 SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
+ +L + ++ +Y + D+DK+ ++ + R L+K + D+ I +
Sbjct: 28 ALELAIADGNVELLVKEYNSFPGNTDHDKRLPMDHAFRVLLMKRLYD---DVNRIGELVR 84
Query: 64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
L VE ++ ++T+P++LL DTFD+ TLDKC ++F +VE V +WK++ FF SCKNN+L
Sbjct: 85 LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCLKIFQFVEEMVEVWKEEIFFASCKNNIL 144
Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204
Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
S+ E DT E +KID++ Y KFWSLQD+FRNP QCYNK WKMF +AE +L +F
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKAQWKMFQMHAENILQSFS 264
Query: 244 SYKLDDVQ--------------------------SSLNPSGDYFAKYLTNQKLLDLQLSD 277
S+KL+DV+ +S+ + +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRQSSSDNASESDNAMDVDEEAVDATAVTSVIKANHFFAKFLTNPKLLALQLSD 324
Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
NFRR VL+QFLILFQY +VK + + L +DQ ++K+T VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTYTLTADQADFIKETESRVYKLLEETPPYGKRF 384
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKKT----------- 385
S+ V +L EE WN WKNEGC E K+P + +ED K P+ + ++
Sbjct: 385 SRTVHHMLAREEMWNNWKNEGCKEFKKPEEPTPNEEDLKPIPNKRPRRALGDALRDASRS 444
Query: 386 -------PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
LT+LWN S DNL+ACKS +R+F P LE+Y E +++DPA
Sbjct: 445 GKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLEAYLETPHEKVDPA----------- 493
Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQISGNGN-- 492
+ WRALRLL+R+ PHFF + + K S+++E + KR +++ KP++ + N +
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDKEPKPAALLGNNSSEN 550
Query: 493 --GVDQDPAEVEVDTKSE-EIQEEEKEEDWEAK 522
G+ + A+ E + + Q+ E E+D E +
Sbjct: 551 TFGLSANNAQTEGEQDATVAFQDGEPEQDLEVE 583
>gi|194745454|ref|XP_001955203.1| GF18644 [Drosophila ananassae]
gi|190628240|gb|EDV43764.1| GF18644 [Drosophila ananassae]
Length = 713
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 245/587 (41%), Positives = 353/587 (60%), Gaps = 72/587 (12%)
Query: 4 SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
+ +L ++ + ++ Y + D+DK+ ++ + R L+K ++ D++ I + +
Sbjct: 28 TLELAIAQNSVELLVQAYNSFASNSDHDKRLPMDHAFRVLLMKRLED---DVQRIGDLVR 84
Query: 64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
L VE ++ ++T+P++LL DTFD+ TLDKC+++F +VE V +WK++ FF SCKNN+L
Sbjct: 85 LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQEIFRFVEELVEVWKEEIFFASCKNNIL 144
Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
RMCNDLLRRLSR+QNTVFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D +
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRILLFLSKFFPFSERSGLNIVSEFNLDNYTEYGLDSKDHD 204
Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
++ E DT E +KID++ Y KFWSLQ++FRNP QCY+K WKMF + E +L +F
Sbjct: 205 DNDNKELEDTAEDIPLKIDYDLYCKFWSLQEFFRNPNQCYSKAQWKMFQMHTENILQSFS 264
Query: 244 SYKLDDVQ--------------------------SSLNPSGDYFAKYLTNQKLLDLQLSD 277
S+KL+D++ S++N + +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDLRQSINDNASESGHAMEIDDEAVDSAAISTVNKANHFFAKFLTNPKLLALQLSD 324
Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
+NFRR +L+QFLILFQY +VK + + L +DQ +++KDT VY L+++TPP G F
Sbjct: 325 SNFRRAILVQFLILFQYLQVSVKFKNDSYTLTTDQTEFIKDTEPRVYKLLEETPPYGRRF 384
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTSIT-DEDKKDEP-------------DAKKK 383
++ V +L EE WN WKNEGC E K+P T DED K P DA ++
Sbjct: 385 ARTVNHMLVREEMWNNWKNEGCKEFKKPEEPATADEDLKPPPTKRPKRPLGDSLRDAARQ 444
Query: 384 -----KTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
LT+LWN S DNL+ACKS +R+F P LE+Y E + DPA
Sbjct: 445 GKFFLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHDKTDPA----------- 493
Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISG-------N 490
+ WRALRLL+R+ PHFF + + K S+++E + KR V++K +S N
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSLPSNKISDYLEQVRKRLVRDKEPKAVSAASNSEQNN 550
Query: 491 GNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAK-ADPEGDADEVMSVE 536
G + E E D + +QE E E+ E + +P + DE + E
Sbjct: 551 GLATNNSHTESEQDA-TVALQEVEPEQGLEVEDTEPVEETDEAPAHE 596
>gi|195502124|ref|XP_002098085.1| GE24134 [Drosophila yakuba]
gi|194184186|gb|EDW97797.1| GE24134 [Drosophila yakuba]
Length = 713
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 232/525 (44%), Positives = 326/525 (62%), Gaps = 63/525 (12%)
Query: 4 SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
+ +L + ++ +Y + ++DK+ ++ + R L+ + D D+ I +
Sbjct: 28 ALELAIKDGNVELLVKEYNSFPANTEHDKRLPMDHAFRVLLMNRL---DDDVNRIGELVR 84
Query: 64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
L VE ++ ++T+P++LL DTFD+ TLDKC+++F +VE V +WK++ FF SCKNN+L
Sbjct: 85 LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEEMVEVWKEEIFFASCKNNIL 144
Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204
Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
S+ E DT E +KID++ Y KFWSLQD+FRNP QCYNK WKMF +AE +L +F
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKAQWKMFQMHAENILQSFS 264
Query: 244 SYKLDDVQ--SSLNPS------------------------GDYFAKYLTNQKLLDLQLSD 277
S+KL+DV+ SS N S +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRQSSSDNASESDQAMDVDEEAVDTTAVTTVIKANHFFAKFLTNPKLLALQLSD 324
Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
NFRR VL+QFLILFQY +VK + + L +DQ ++K+T VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTYTLTADQSDFIKETETRVYKLLEETPPYGKRF 384
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKKT----------- 385
S+ V +L EE WN WKNEGC E K+P + ++ED K P+ K ++
Sbjct: 385 SRTVHHMLAREEMWNNWKNEGCKEFKKPEEPTPSEEDLKPIPNKKPRRALGDALRDASRS 444
Query: 386 -------PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
LT+LWN S DNL+ACKS +R+F P LE+Y E +++DPA
Sbjct: 445 GKFFLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHEKVDPA----------- 493
Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK 482
+ WRALRLL+R+ PHFF + + K S+++E + KR +++K
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDK 535
>gi|195112320|ref|XP_002000722.1| GI10383 [Drosophila mojavensis]
gi|193917316|gb|EDW16183.1| GI10383 [Drosophila mojavensis]
Length = 730
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 250/588 (42%), Positives = 350/588 (59%), Gaps = 90/588 (15%)
Query: 20 KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
+Y ++ ++DK+S ++Q+ R+ L+K + D++ I + L V+ ++ ++T+P
Sbjct: 44 EYHSFRANSEHDKRSPMDQAFREVLMKRLSD---DVERIGALVRLSVDAARAEIVSNTIP 100
Query: 80 IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
++LL DTFD+ TL+KCE +F +VE V +WK++ FF SCKNNLLRMCNDLLRRLSR+QNT
Sbjct: 101 VVLLGDTFDVVTLNKCELIFKFVEDLVEVWKEEIFFASCKNNLLRMCNDLLRRLSRTQNT 160
Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV 199
VFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D + + E DT E +
Sbjct: 161 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHDDLDNKELEDTAEDIPL 220
Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSG- 258
KID++ Y KFWSLQD+FRNP QCY+K WKMF +AET+L +F S+KL+D++ + + +G
Sbjct: 221 KIDYDLYCKFWSLQDFFRNPNQCYSKAQWKMFQMHAETILQSFLSFKLEDLRQNTDSNGA 280
Query: 259 -------------------------------------DYFAKYLTNQKLLDLQLSDTNFR 281
++FAK+LTN KLL LQLSD+NFR
Sbjct: 281 ANGVNGSSGDSGAQMMDVDVDDAAVDAAASMAVRKETNFFAKFLTNPKLLALQLSDSNFR 340
Query: 282 RYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVV 341
R VL+QFLILFQY +VK + + L S Q ++K+T + VY L+++TPP+G F++ V
Sbjct: 341 RAVLVQFLILFQYLQVSVKFKIDTYTLTSAQSDFIKETEQRVYKLLEETPPNGRRFARTV 400
Query: 342 KLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKTP--------------- 386
+ +L+ EE WN WKN+GC E K+P ED+ K+ K P
Sbjct: 401 QHMLQREEMWNNWKNDGCKEFKKPEEPEPTEDESKPSPPKRAKRPLGDCLREAARNGKFY 460
Query: 387 ----ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNY 441
LT+LWN S DNL+ACKS +R+F P LE+Y E + DPA +
Sbjct: 461 LGNDNLTRLWNYSPDNLQACKSEQRNFLPQLETYLETPYDKTDPA--------------F 506
Query: 442 AWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP-SSQIS---------GNG 491
WRALRLL+R+ PHFF T K +++++ + KR +EK + QI+ NG
Sbjct: 507 EWRALRLLARQTPHFFTFVTQPSCKIADYLDVVRKRLAREKELAKQINTAAANASEHNNG 566
Query: 492 NGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEG---DADEVMSVE 536
PAE E D S +QE E E D E + DPE +ADE + E
Sbjct: 567 LAASSAPAEQEHDA-SVALQETEPEPDLEVE-DPEPVVEEADETPAHE 612
>gi|195391418|ref|XP_002054357.1| GJ22844 [Drosophila virilis]
gi|194152443|gb|EDW67877.1| GJ22844 [Drosophila virilis]
Length = 720
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 247/575 (42%), Positives = 348/575 (60%), Gaps = 83/575 (14%)
Query: 20 KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
+Y ++ ++DK+S ++Q+ R+ L+K + +++ I + L V+ ++ ++T+P
Sbjct: 44 EYHSFRANSEHDKRSPMDQAFREVLMKRLSD---NVERIGALVRLSVDAARAEIVSNTIP 100
Query: 80 IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
++LL DTFD+ TL+KCE +F +VE V +WK++ FF SCKNNLLRMCNDLLRRLSR+QNT
Sbjct: 101 VVLLGDTFDVVTLNKCELIFNFVEELVEVWKEEIFFASCKNNLLRMCNDLLRRLSRTQNT 160
Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD--EEMDVSSNETEETDTEEVD 197
VFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D + D+ + E E DT E
Sbjct: 161 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHDDIENKELE--DTAEDI 218
Query: 198 KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPS 257
+KID++ Y KFWSLQD+FRNP QCY+K WKMF +AET+L +F S+KL+DV+ +++ +
Sbjct: 219 PLKIDYDLYCKFWSLQDFFRNPNQCYSKAQWKMFQMHAETILQSFSSFKLEDVRQNIDTN 278
Query: 258 G------------------------------DYFAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
G ++FAK+LTN KLL LQLSD+NFRR VL+Q
Sbjct: 279 GHSESCLQLMDVDMDDATIDGAVSAAMHKETNFFAKFLTNPKLLALQLSDSNFRRAVLVQ 338
Query: 288 FLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKG 347
FLILFQY +VK + E L S Q ++K+T + VY L+++TPP+G F++ V+ +L+
Sbjct: 339 FLILFQYLQVSVKFKIESYTLTSAQSDFIKETEQRVYKLLEETPPNGRRFARTVQHMLQR 398
Query: 348 EEHWNQWKNEGCPELKRPLTSITDEDKKDEP--------------DAKKK-----KTPEL 388
EE WN WKN+GC E K+P ED P DA + L
Sbjct: 399 EEMWNNWKNDGCKEFKKPEEPEPSEDDAKPPPPKRSKRPLGDCLRDAARNGKFFLGNDNL 458
Query: 389 TKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
T+LWN S DNL+ACKS +R+F P LE+Y E + DPA + WRALR
Sbjct: 459 TRLWNYSPDNLQACKSEQRNFLPLLETYLETPHDKTDPA--------------FEWRALR 504
Query: 448 LLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK----------PSSQISGNGNGVDQD 497
LL+R+ PHFF + K S++++ + KR +EK ++ NG V
Sbjct: 505 LLARQTPHFFTFVSQPSCKISDYLDVVRKRLTREKELVKQTSTTTANASEHNNGLAVSSA 564
Query: 498 PAEVEVDTKSEEIQEEEKEEDWEAKADPEGDADEV 532
P E E + S +QE E E+D E + D E +EV
Sbjct: 565 PLEQEHEA-SVALQETEPEQDLEVE-DSEPVVEEV 597
>gi|119622138|gb|EAX01733.1| THO complex 1, isoform CRA_c [Homo sapiens]
Length = 635
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 232/486 (47%), Positives = 305/486 (62%), Gaps = 42/486 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTPE--LTKLWNSKDNLEACKSAERDFTPSLESYFE 417
P + TS D K +K+ PE L K K + ++ P+LE +FE
Sbjct: 397 PSFVKERTS----DTKPTRIIRKRTAPEDFLGKGPTKKILM------GKEHMPTLEEFFE 446
Query: 418 EAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKR 477
EAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF + E++ENMV +
Sbjct: 447 EAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIK 506
Query: 478 CVKEKP 483
KE P
Sbjct: 507 LAKELP 512
>gi|432917195|ref|XP_004079463.1| PREDICTED: THO complex subunit 1-like [Oryzias latipes]
Length = 654
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 227/499 (45%), Positives = 312/499 (62%), Gaps = 44/499 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK+ L+Q++R L I + I L ++ + +C++T P +LL D
Sbjct: 39 PGNETEKKTTLDQALRGVLGDQIVEQKASCDDYLSLIYLSIDAVTEGICSATTPFVLLGD 98
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD C+++F +VE NV+ WK +F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 99 VLDCLPLDLCDKIFSFVEENVSTWKSNSFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 158
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETE--------------E 190
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E E + T E
Sbjct: 159 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNKNELESTLGQKSTNEKEEGMEVEEGEMGE 218
Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
D + ID+N Y+KFW+LQDYFRNPVQCY+K W F +++ LA FKSYKLDD+
Sbjct: 219 EDAPAQSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFLWMTFLKFSDETLAVFKSYKLDDM 278
Query: 251 QSS-------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
Q+S G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 279 QASKRKLDELRGSDGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKF 338
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ L DQ W+++TT+ VY L+K+ PPDG+ F+ +V+ IL EE+WN WKNEGCP
Sbjct: 339 KSSSCVLNDDQTTWIEETTKLVYQLLKEIPPDGDKFASMVEHILDTEENWNSWKNEGCPS 398
Query: 362 LKRPLTSITDEDKKDEPDAKKK-------KTP---------ELTKLWN-SKDNLEACKSA 404
+ T +DK P K++ K P ELT+LWN + DN+EACKS
Sbjct: 399 FVKERTV---DDKPKRPTRKRQAPEDFLGKGPDRKIFMGNDELTRLWNLNSDNMEACKSD 455
Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNV 464
R+F PSL+ +F EAI+Q DP VE++YK V + NY WRALRLLSR+ PHFF
Sbjct: 456 SREFMPSLDEFFAEAIEQADPVNMVEDEYKVVRNPNYGWRALRLLSRRSPHFFQPTNQKF 515
Query: 465 EKNSEFIENMVKRCVKEKP 483
+ ++++++MV + KE P
Sbjct: 516 KSLADYLDSMVSKLAKELP 534
>gi|47220323|emb|CAF98422.1| unnamed protein product [Tetraodon nigroviridis]
Length = 568
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 220/452 (48%), Positives = 296/452 (65%), Gaps = 44/452 (9%)
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
+C++T P +LL D D L +C+++F +VE NV+ WK +F+ + KN LLRMCNDLLRR
Sbjct: 2 ICSATTPFVLLGDVLDCLPLHQCDKIFSFVEENVSTWKSNSFYTAGKNYLLRMCNDLLRR 61
Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEE- 190
LS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN++NIT F +E E + EE
Sbjct: 62 LSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQESSLGQKHAEEK 121
Query: 191 -------------TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
D + ID+N Y+KFW+LQDYFRNPVQCY+KVSW F Y+E
Sbjct: 122 EDGMEVEEGEMGEEDAPAPCSIPIDYNLYRKFWTLQDYFRNPVQCYDKVSWITFVKYSEE 181
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
LA FKSYKLDD+Q+S L SG YFAK+LT++KL+DLQLSD+NFRR++LLQ+
Sbjct: 182 TLAVFKSYKLDDMQASKRKLEDLRTSGGDHVYFAKFLTSEKLMDLQLSDSNFRRHILLQY 241
Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
LILFQY VK + L DQ W+++T++ VY L+++ PPDG+ F+ +V+ IL E
Sbjct: 242 LILFQYLKGQVKFKSSSSVLNDDQGSWIEETSKLVYQLLREIPPDGDKFATMVEHILSTE 301
Query: 349 EHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKK-------KTP---------ELTKLW 392
E+WN WKNEGCP + T +DK P K++ K P ELT+LW
Sbjct: 302 ENWNSWKNEGCPSFVKERTV---DDKPKRPTRKRQAPEDFLGKGPDRKIFMGNDELTRLW 358
Query: 393 N-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
N + DN++ACKS R+F PSL+ +F EAI+Q DPA VE++YK V + NY WRALRLLSR
Sbjct: 359 NLNHDNMDACKSESREFIPSLDDFFSEAIEQADPANMVEDEYKVVRNPNYGWRALRLLSR 418
Query: 452 KCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
+ PHFF + ++++++MV + KE P
Sbjct: 419 RSPHFFQPTNQKFKSLADYLDSMVSKLAKELP 450
>gi|195056107|ref|XP_001994954.1| GH17514 [Drosophila grimshawi]
gi|193892717|gb|EDV91583.1| GH17514 [Drosophila grimshawi]
Length = 717
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 235/543 (43%), Positives = 325/543 (59%), Gaps = 77/543 (14%)
Query: 20 KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
+Y + ++DK+S ++Q+ R+ L+K + DI+ I + L VE ++ ++T+P
Sbjct: 44 EYHGFGANSEHDKRSPMDQAFREVLMKRLSD---DIEHIGTLVRLSVEAARAEIVSNTIP 100
Query: 80 IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
++LL DTFD+ TL+KCE +F +VE V +WK++ FF SCKNNLLRMCNDLLRRLSR+QNT
Sbjct: 101 VVLLGDTFDVITLNKCELIFKFVEELVEVWKEEIFFASCKNNLLRMCNDLLRRLSRTQNT 160
Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV 199
VFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D + + E DT E +
Sbjct: 161 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNYTEYGLDSKDHDEIDNKELEDTAEDIPL 220
Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD 259
KID++ Y KFWSLQD+FRNP QCY+K WKMF +AE +L +F S+KL+DV+ + + +G+
Sbjct: 221 KIDYDLYCKFWSLQDFFRNPNQCYSKAQWKMFQMHAENILQSFSSFKLEDVRPNNDFNGN 280
Query: 260 ------------------------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
+FAK+LTN KLL LQLSD+NFRR VL+QFL
Sbjct: 281 SDTGPQAMDVDIDAQAMDAVATLAVRKENHFFAKFLTNPKLLALQLSDSNFRRAVLVQFL 340
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY +VK + + L S Q ++K+T + VY L+++TPP G F++ V +L+ EE
Sbjct: 341 ILFQYLQVSVKFKIDTYTLTSAQSDFIKETEQRVYKLLEETPPYGRRFARTVLHMLQREE 400
Query: 350 HWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKTP-------------------ELTK 390
WN WKN+GC E K+P +D AK+ K P LT+
Sbjct: 401 MWNNWKNDGCKEFKKPEEPEPSQDDSKPTPAKRSKRPLGDCLRDAARNGKFYLGNDTLTR 460
Query: 391 LWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLL 449
LWN S DNL+ACKS +R+F P LE+Y E + DPA + WRALRLL
Sbjct: 461 LWNYSPDNLQACKSEQRNFLPLLETYLETTHDKSDPA--------------FEWRALRLL 506
Query: 450 SRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK----------PSSQISGNGNGVDQDPA 499
+R+ HFF + K S++++ + KR +EK +S NG V P+
Sbjct: 507 ARQTSHFFTFISQPSCKISDYLDVVRKRLTREKELVKQTNTASANSSEHNNGLAVSSQPS 566
Query: 500 EVE 502
E E
Sbjct: 567 ENE 569
>gi|241166483|ref|XP_002409875.1| nuclear matrix protein, putative [Ixodes scapularis]
gi|215494663|gb|EEC04304.1| nuclear matrix protein, putative [Ixodes scapularis]
Length = 566
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 224/448 (50%), Positives = 288/448 (64%), Gaps = 37/448 (8%)
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT--FFMSCKNNLLRMCNDLL 130
+C+ +LP +L D FD L++ E+ F VE V+ WK+ FF +CKN LLR CNDLL
Sbjct: 1 VCSGSLPGLLFCDMFDSLPLNESEEFFSLVEDKVDTWKEAGILFFKNCKNQLLRTCNDLL 60
Query: 131 RRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG----------GDEE 180
RRLSRSQNTVFCG+IL+FLAK FP SERSGLNIISEFN ENIT F G +
Sbjct: 61 RRLSRSQNTVFCGKILVFLAKLFPLSERSGLNIISEFNSENITLFSSEDYKGMPMKGSPD 120
Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
D+ ET+ D + +D+N Y+KFWSLQ+YFR P CYNKV W+ FTSY+ VL+
Sbjct: 121 DDLEEGETD--DISSTATLSVDYNLYRKFWSLQEYFRQPTLCYNKVHWRQFTSYSSDVLS 178
Query: 241 AFKSYKLDDVQSS----------LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
F S+KLDD++S+ L+ YFAKYLT+QKLL+L+LSD+NFRRYVL+QFLI
Sbjct: 179 VFGSFKLDDIKSAQWTLPEDAAALSGKSVYFAKYLTSQKLLELELSDSNFRRYVLVQFLI 238
Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
L QY STV+ + E L +Q WVK+T VY L+++TPPDG F+ +K IL+ EE
Sbjct: 239 LIQYLQSTVRFKQESQVLTEEQSTWVKNTMSLVYKLLEETPPDGVGFAANIKHILQREEF 298
Query: 351 WNQWKNEGCPELK-----------RPLTSITDEDKKDEPDAKK-KKTPELTKLWN-SKDN 397
WN WKNEGCP+ K RP + DE K + K E+ +LWN DN
Sbjct: 299 WNTWKNEGCPDFKEAKPVDIVKEGRPKRKLGDEIKAALANKKVILGNAEMNRLWNLCPDN 358
Query: 398 LEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
EAC S +RDF PSLE +FE A++Q DPA A+E +YK V N+ WR+LRLLS+K PHFF
Sbjct: 359 WEACTSPKRDFVPSLEKFFETAVEQADPAHAIENRYKNVYQGNFGWRSLRLLSQKSPHFF 418
Query: 458 LNATPNVEKNSEFIENMVKRCVKEKPSS 485
++ ++ ++E+MV R KE P S
Sbjct: 419 TSSNQPIKSLPLYLESMVLRLAKELPLS 446
>gi|346473721|gb|AEO36705.1| hypothetical protein [Amblyomma maculatum]
Length = 587
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 241/542 (44%), Positives = 327/542 (60%), Gaps = 36/542 (6%)
Query: 33 KSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMK-DMCNSTLPIILLSDTFDMST 91
+ A++ + ++ L +L++ ++ + I L + L + D+C+ +LP++LL D FD
Sbjct: 16 RQAVDLASKKVLSRLLRV-GLNKNTANDLIDLSITLARQADLCSGSLPVLLLCDVFDSLP 74
Query: 92 LDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAK 151
L++ E F VE V+IW++ FF CKN LLR CNDLLRRLSRSQNTVFCG+IL+FLAK
Sbjct: 75 LNESEDFFSLVEGKVDIWREDLFFKCCKNQLLRTCNDLLRRLSRSQNTVFCGKILVFLAK 134
Query: 152 FFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEV--DKVKIDFNFYKKF 209
FP SERSGLNI SEFN ENIT F + +S D+E+ V +D+N Y+KF
Sbjct: 135 LFPLSERSGLNIASEFNAENITLFSSEGYKGISDEVPFREDSEDAPSSAVPVDYNLYRKF 194
Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS----------LNPSGD 259
WSLQDYFR P+ CYNKV W+ FTSY+ VL+ F S+KLDDV+++ +
Sbjct: 195 WSLQDYFRQPILCYNKVHWRQFTSYSGDVLSVFGSFKLDDVKTAKWALPESSDSSSGKSV 254
Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDT 319
YFAKYLT+QKLL+L+LSD+NFRRYVL+QFLILFQY STV+ + E L +Q W+K+T
Sbjct: 255 YFAKYLTSQKLLELELSDSNFRRYVLVQFLILFQYLQSTVRFKLETHILTEEQSAWLKNT 314
Query: 320 TETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK-----------RPLTS 368
VY L+++TPPDG F+ +K IL+ EE WN WKNEGCP+ K RP
Sbjct: 315 LTQVYKLLEETPPDGVGFAARIKHILQREESWNAWKNEGCPDFKEARPVEPPRETRPKRR 374
Query: 369 ITDEDKKDEPDAKKKKT---PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMD 424
+ DE K A KK E+ +LWN DN EAC S RDF PSLE +F+ + Q D
Sbjct: 375 LGDEVKAAM--ASKKVVLGDSEMNRLWNLCPDNWEACSSPRRDFVPSLEKFFDIGVDQAD 432
Query: 425 PAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPS 484
PAAA + + K + SN WR+LRLLS+K PHFF ++ ++ ++E+MV R KE P
Sbjct: 433 PAAAAKSRAKSMYQSNSGWRSLRLLSQKSPHFFTSSNQPIKSLPLYLESMVMRLAKELPL 492
Query: 485 S--QISGNGNGVDQDPAEVEVDTKSEEI---QEEEKEEDWEAKADPEGDADEVMSVEYCY 539
S + + + D+ + +E+ EE KE EA E A S +
Sbjct: 493 STQNTAASSHQSDEKTDMADYGDSEDELLKNHEETKEAAGEAAPSTESVAAGGASAKDTL 552
Query: 540 QR 541
+R
Sbjct: 553 KR 554
>gi|427788977|gb|JAA59940.1| Putative nuclear matrix protein [Rhipicephalus pulchellus]
Length = 672
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 235/522 (45%), Positives = 319/522 (61%), Gaps = 36/522 (6%)
Query: 40 IRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLF 99
I LL++ D +I+ I+L E D+C+ +LP++LL D FD L++ E+ F
Sbjct: 61 ILSVLLRIGLNKDTANHLIDLSITLARE---PDLCSGSLPVLLLCDVFDSLPLNESEEFF 117
Query: 100 YYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERS 159
VE V+IWK+ FF CKN LLR CNDLLRRLSRSQNTVFCG+IL+FLAK FP SERS
Sbjct: 118 SLVEDKVSIWKEDLFFKCCKNQLLRTCNDLLRRLSRSQNTVFCGKILVFLAKLFPLSERS 177
Query: 160 GLNIISEFNVENITEFGGDE--EMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR 217
GLNI SEFN ENIT F ++ + S+ EE++ V +D+N Y+KFWSLQ+YFR
Sbjct: 178 GLNIASEFNAENITLFSSEDYKGLSDESSYNEESEEGSTSSVPVDYNLYRKFWSLQEYFR 237
Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS----------LNPSGDYFAKYLTN 267
P+ CY KV W+ FTSY+ VL+ F S+KLDD++++ + YFAKYLT+
Sbjct: 238 QPLLCYTKVHWRQFTSYSTDVLSVFGSFKLDDIKTAQWALPGSSSSPSGKSVYFAKYLTS 297
Query: 268 QKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLI 327
QKLL+L+LSD+NFRRYVL+QFLILFQY STV+ + + L +Q WVK+T VY L+
Sbjct: 298 QKLLELELSDSNFRRYVLVQFLILFQYLQSTVRFKLDTHILTEEQSSWVKNTLTQVYKLL 357
Query: 328 KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK-----------RPLTSITDEDKKD 376
++TPPDG F+ +K IL+ EE WN WKNEGCP+ K RP + DE K
Sbjct: 358 EETPPDGAGFATRIKHILQREESWNAWKNEGCPDFKEARPVEQTREVRPKRKLGDEVKA- 416
Query: 377 EPDAKKKKT---PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQ 432
A K+ E+ +LWN DN EAC S +RDF PSLE +F+ A+ Q DP A + +
Sbjct: 417 -ALASKRVVLGDTEMNRLWNLCPDNWEACSSPKRDFVPSLEKFFDMAMDQGDPVAGAKSR 475
Query: 433 YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP-SSQISGNG 491
K + SN WRALRLLS+K PHFF ++ ++ ++E+MV R KE P ++Q +
Sbjct: 476 AKSIYQSNSGWRALRLLSQKSPHFFTSSNQPIKSLPLYLESMVMRLAKEFPINTQNAAPS 535
Query: 492 NGVDQDPAEVEVDTKSEEI---QEEEKEEDWEAKADPEGDAD 530
D+ + +E+ EE ++ + + G AD
Sbjct: 536 QAADEKTDMADYGDSEDELLKNHEESRDRETGTLSSTGGAAD 577
>gi|326917493|ref|XP_003205033.1| PREDICTED: THO complex subunit 1-like [Meleagris gallopavo]
Length = 540
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 221/448 (49%), Positives = 286/448 (63%), Gaps = 48/448 (10%)
Query: 114 FFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT 173
F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN+EN+T
Sbjct: 2 FYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLENVT 61
Query: 174 EFGG---------------DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRN 218
F DE MDV E + + + ID+N Y+KFWSLQDYFRN
Sbjct: 62 VFNTNEHESTLGQKHSEERDEGMDVEEGEMGDDEAPTSCSIPIDYNLYRKFWSLQDYFRN 121
Query: 219 PVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKL 270
PVQCY KVSWK F Y+E VLA FKSYKLDD Q+S L G+ YFAK+LT++KL
Sbjct: 122 PVQCYEKVSWKTFLKYSEEVLAVFKSYKLDDTQASRKKLEELKTGGEHVYFAKFLTSEKL 181
Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
+DLQLSD+NFRR++LLQ+LILFQY VK + L +Q W++DTT+ VY L+ +
Sbjct: 182 MDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKAVYQLLSEN 241
Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCP----ELKRPLTSITDEDKKDEPDAKKKKTP 386
PPDGE FS++V+ IL EE+WN WKNEGCP E +P + D K A+K+ P
Sbjct: 242 PPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERSQPEFQLGPPDSKPMRPARKRPAP 301
Query: 387 -----------------ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAA 428
ELT+LWN DN+EACKS R++ P+LE +FEEAI+Q DP
Sbjct: 302 EDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSESREYMPTLEEFFEEAIEQADPENM 361
Query: 429 VEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE--KPSSQ 486
VE +YK VN+SNY WRALRLL+R+ PHFF + E++ENMV + KE PS +
Sbjct: 362 VENKYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELPPPSEE 421
Query: 487 I-SGNGNGVDQDPAEVEVDTKSEEIQEE 513
I +G + + A ++ + +S E+Q +
Sbjct: 422 IKTGEDEDEEDNDALLKENNESPEVQRD 449
>gi|395823221|ref|XP_003784889.1| PREDICTED: THO complex subunit 1 [Otolemur garnettii]
Length = 629
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 228/498 (45%), Positives = 296/498 (59%), Gaps = 72/498 (14%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRVVRKRTAPEDFLGKGPSKKLLMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R VN+SNY WRALRLL+R+ PHFF +
Sbjct: 457 R----------------------------AVNNSNYGWRALRLLARRSPHFFQPTNQQFK 488
Query: 466 KNSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 489 SLPEYLENMVIKLAKELP 506
>gi|395749750|ref|XP_002828149.2| PREDICTED: LOW QUALITY PROTEIN: THO complex subunit 1 [Pongo
abelii]
Length = 625
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 219/451 (48%), Positives = 286/451 (63%), Gaps = 44/451 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKV 436
R++ P+LE +FEEAI+Q DP VE +YK V
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAV 487
>gi|260829829|ref|XP_002609864.1| hypothetical protein BRAFLDRAFT_126015 [Branchiostoma floridae]
gi|229295226|gb|EEN65874.1| hypothetical protein BRAFLDRAFT_126015 [Branchiostoma floridae]
Length = 561
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 213/444 (47%), Positives = 292/444 (65%), Gaps = 41/444 (9%)
Query: 78 LPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQ 137
+P ++L D F TLD CE++F +VE V WK++ ++ KN LLRMCNDLLRRLSRSQ
Sbjct: 2 IPFLMLDDVFSSVTLDVCERVFKFVEDGVPTWKKEPYYTGGKNYLLRMCNDLLRRLSRSQ 61
Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--GDEEMDVSSNETEETDTEE 195
NTVFCGRI LFLA+ FP E+SGLN+ S FN+EN+T + GD+ S+ +E+ D +E
Sbjct: 62 NTVFCGRIQLFLARLFPLEEKSGLNLQSHFNLENVTSYNLKGDD----STMRSEDRDRDE 117
Query: 196 VDK-------------VKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF 242
+ ID+N Y+KFWS+QD+FRNP QCY K WK F S ++ VL F
Sbjct: 118 SMEMEEGEMSDSHSSFTPIDYNLYRKFWSIQDFFRNPNQCYKKDLWKSFASNSDEVLKCF 177
Query: 243 KSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY 294
+S+KLDD+Q+S L G YFAKYLT++KLLDLQL+D+NFRRYVL+QFLILFQY
Sbjct: 178 ESHKLDDIQASKKKKEMLAGGGGNVYFAKYLTSEKLLDLQLADSNFRRYVLVQFLILFQY 237
Query: 295 FTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQW 354
+ VK + L + +Q W+K+ T+ VY L+++TPPDG+HF+++V+ L+ EE+W W
Sbjct: 238 LNAEVKFKAPNLVVSDEQALWIKNATDMVYKLLRETPPDGDHFAKMVQHTLEREENWITW 297
Query: 355 KNEGCP-----------ELKRPLTSITDEDKKDEPDAKK--KKTPELTKLWN-SKDNLEA 400
KN+GCP LK+ ED K+ D+K+ + ELT+LWN DN+EA
Sbjct: 298 KNKGCPTYDRERPKTETPLKKSRKRAMGEDIKEAEDSKRVNMGSAELTRLWNLCPDNMEA 357
Query: 401 CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
C S ER+F PSLE +FEEAI+Q DP A VE++YK VN+ N+ +RAL+LL+ + PHFF
Sbjct: 358 CSSPEREFLPSLEEFFEEAIEQSDPEAMVEDEYKVVNNPNFGFRALKLLACRSPHFFQPT 417
Query: 461 TPNVEKNSEFIENMVKRCVKEKPS 484
+ ++EN++ + KE PS
Sbjct: 418 NNQFKTLPSYLENILSKLAKELPS 441
>gi|443734901|gb|ELU18757.1| hypothetical protein CAPTEDRAFT_184685 [Capitella teleta]
Length = 600
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 216/503 (42%), Positives = 310/503 (61%), Gaps = 49/503 (9%)
Query: 70 MKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDL 129
+ D+C ++ ++LSD FD TLD+C+ +F +VE V++WK TF+ + KN LLRMCNDL
Sbjct: 10 LSDICGTSTLFMMLSDLFDSITLDQCDSVFKFVEERVSLWKSDTFYNAGKNYLLRMCNDL 69
Query: 130 LRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG-----DEEMDVS 184
LRRLS+SQNTVFCGRI LFLA+ FP SERSGLN++S+FN+EN+T + DE S
Sbjct: 70 LRRLSKSQNTVFCGRIHLFLAQLFPLSERSGLNLMSQFNLENVTLYNTKAEDYDELRKSS 129
Query: 185 SNETEET-DTEEVD------KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+E EE+ D EE + + ID+ Y+KFWSLQD+FR P+QCY+K SWK F +
Sbjct: 130 VDEKEESMDIEEGEMQDSSSAIPIDYQLYRKFWSLQDFFRKPIQCYDKNSWKAFEGHTAD 189
Query: 238 VLAAFKSYKLDDVQSS---------LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
VL F S+KLD++Q S PS ++FAKYLT++KLLDLQLSD+NFRRYVL+QF
Sbjct: 190 VLNCFTSFKLDEMQISDKRSKKSGRAKPSQNFFAKYLTSEKLLDLQLSDSNFRRYVLIQF 249
Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
LI+FQY + VK + L DQ ++KD E + ++++TPPDG F++ ++ IL E
Sbjct: 250 LIMFQYLHTQVKFKSPAQTLSEDQTTFIKDIRERILKMLQETPPDGPKFAKTIEHILARE 309
Query: 349 EHWNQWKNEGCPELKRPLT-----------------SITDEDKKDEPDAKKKKTPELTKL 391
E+WN+WKNEGC R L+ +I D+ K K ELT+L
Sbjct: 310 EYWNRWKNEGCQSYVRQLSEDDCKARSKFQTKHKRKNIGDDLKASGGKLMKLGNSELTRL 369
Query: 392 WN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLS 450
WN DNL+AC+ R+F P+L YFEE+I Q DP + +E YK +S + W++LR+L+
Sbjct: 370 WNLCPDNLDACRLESRNFMPTLREYFEESIDQADPESGIEAGYKLTENSTFGWKSLRVLA 429
Query: 451 RKCPHFFL-----NATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDT 505
PHFF +A +K +E++++MV++ + EK ++ G G + A+ +D
Sbjct: 430 HSSPHFFAQPTGGSANQGSKKMTEYLDSMVQKIIDEKAVAE----GKGKEDVDADKNLDN 485
Query: 506 KSEEIQEEEKEEDWEAK-ADPEG 527
++ E ++D + A P G
Sbjct: 486 TADMDDNLEDKDDASTRPASPAG 508
>gi|312375546|gb|EFR22901.1| hypothetical protein AND_14033 [Anopheles darlingi]
Length = 736
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 205/402 (50%), Positives = 267/402 (66%), Gaps = 40/402 (9%)
Query: 3 DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYI 62
+S + F + + ++ Y + T DKK+AL Q+ R LL L + DI IE +I
Sbjct: 37 ESLKRAFKDNDIELLKADYDAVRAT---DKKTALHQAFRDMLLTLTE----DIPAIEAFI 89
Query: 63 SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL 122
+ V C KDM +T+P++LL D FD TLD+ EQ+F YVE NV WK++ FF +CK+NL
Sbjct: 90 NFAVASCRKDMTMATMPVVLLGDCFDAVTLDRAEQIFTYVENNVATWKEEFFFTACKHNL 149
Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--GDE- 179
LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFN+ENITE+G G+E
Sbjct: 150 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNLENITEYGMEGNET 209
Query: 180 EMDVSSNETEETD-------TEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFT 232
+D ++ TD E +K+KID+N Y KFW+LQD+FRNP QCYNKV WK F
Sbjct: 210 TLDQLTSGNGATDDTEGTEKETEENKLKIDYNLYCKFWALQDFFRNPNQCYNKVQWKTFA 269
Query: 233 SYAETVLAAFKSYKLDDVQSS-----------------------LNPSGDYFAKYLTNQK 269
++A +VL+AF S+KL++ +SS + +G +FAK+LTN K
Sbjct: 270 THAGSVLSAFSSFKLEEHRSSTGSAKLTEGGTSSSDCPPMDVDQIREAGHFFAKFLTNPK 329
Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQ 329
LL LQLSD+NFRR VL+QFLILFQY STVK + + L Q W+K+T VY L+++
Sbjct: 330 LLSLQLSDSNFRRSVLVQFLILFQYLNSTVKFKADAHVLTQAQSDWLKETETLVYRLLEE 389
Query: 330 TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
+PP+G+ F++ V+ +L EE WN WKNEGC E KRP + D
Sbjct: 390 SPPNGKKFAETVRHMLTREELWNSWKNEGCKEFKRPEAVLDD 431
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 39/98 (39%), Positives = 59/98 (60%), Gaps = 15/98 (15%)
Query: 386 PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWR 444
PE+T+LWN DNL+ACK +R+F PSLE+Y E ++ DP+ + WR
Sbjct: 497 PEITRLWNVCPDNLQACKGTDRNFLPSLETYLENPKEKQDPS--------------FEWR 542
Query: 445 ALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK 482
ALRLL+R+ PHFF K S++++++ K+ K+K
Sbjct: 543 ALRLLARQSPHFFTLFNAPSYKVSDYLDSVRKKIQKDK 580
>gi|391328907|ref|XP_003738924.1| PREDICTED: THO complex subunit 1-like [Metaseiulus occidentalis]
Length = 657
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 209/477 (43%), Positives = 295/477 (61%), Gaps = 30/477 (6%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ N + L V LC ++ + LP + +D FD T+ +CE++F+ +E V +WK+ FF
Sbjct: 73 LTNLVDLLVSLCEREFASPNLPATIFTDIFDTLTISRCEKVFHLMEDKVEVWKKPLFFSH 132
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
CKN +LRMCNDLLRRLS+SQNTVFCG+IL+ LAK FP SERSGLNI+SEFN ENI+
Sbjct: 133 CKNQMLRMCNDLLRRLSKSQNTVFCGQILIVLAKLFPLSERSGLNIVSEFNSENIS---- 188
Query: 178 DEEMDVSSNETEETDTEEVD-KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
+ + E + DT D +K+ N Y+KFWSLQD+FR P CY++ W+ T +
Sbjct: 189 -----IHNVEDGDADTSSDDGHLKVSHNLYRKFWSLQDFFRQPHLCYDENHWRQMTRCSS 243
Query: 237 TVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
VL F S+KLD+ Q+ L+ S YFAKYLT+++L L+LSD+NFRR +L+QFLI+FQY
Sbjct: 244 EVLTVFSSFKLDE-QAVLSKSQPYFAKYLTSRRLFQLELSDSNFRRCILIQFLIMFQYLK 302
Query: 297 STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN 356
+TV+ + E L +Q +WV++ TE +Y L+++TPPDG F++ ++ IL EEHWN WKN
Sbjct: 303 ATVRFKLEKHFLSENQAQWVREKTEEIYKLLQETPPDGLEFARAIENILVREEHWNAWKN 362
Query: 357 EGCPELK----------RPLTSITDEDKKD-EPDAKKKKTPELTKLWN-SKDNLEACKSA 404
+ CP K R + D+ ++ E PEL K++N D+ EAC+
Sbjct: 363 DNCPAFKKADVMGRKCGRRRKKVGDDVREHLEKGQIAMGGPELNKVFNICPDHWEACRVE 422
Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNV 464
R + P + +FEEAI Q DPAA VE+QYK V + N+ WRALR+LS+K HFF +
Sbjct: 423 RRCYVPGMTEFFEEAIAQADPAACVEDQYKLVANPNWGWRALRILSQKSLHFFTSTNQPA 482
Query: 465 EKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEA 521
+ +++ MV K+ ++ NGV VE D S Q++ KEED EA
Sbjct: 483 KSLPNYLDTMVGFLAKDLCINRDPKEPNGV----VRVEDDDNS---QDDFKEEDSEA 532
>gi|442751601|gb|JAA67960.1| Putative nuclear matrix protein [Ixodes ricinus]
Length = 566
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 209/447 (46%), Positives = 271/447 (60%), Gaps = 63/447 (14%)
Query: 70 MKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDL 129
+K++C+ +LP +L D FD L++ E+ F VE V+ WK++ FF +CKN LLR CNDL
Sbjct: 12 LKEVCSGSLPGLLFCDMFDSLPLNESEEFFSLVEDKVDTWKEELFFKNCKNQLLRTCNDL 71
Query: 130 LRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG----------GDE 179
LRRLSRSQNTVFCG+IL+FLAK FP SERSGLNIISEFN ENIT F G
Sbjct: 72 LRRLSRSQNTVFCGKILVFLAKLFPLSERSGLNIISEFNSENITLFSSEDYKGMPMKGSP 131
Query: 180 EMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVL 239
+ D+ ET+ D V +D+N Y+KFWSLQ+YFR P CYNKV W+ FTSY+ VL
Sbjct: 132 DDDLEEGETD--DMSSTAAVSVDYNLYRKFWSLQEYFRQPTLCYNKVHWRQFTSYSSDVL 189
Query: 240 AAFKSYKLDDVQS----------SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
+ F S+KLDD++S +L+ YFAKYLT+QKLL+L+LSD+NFRRYVL+QFL
Sbjct: 190 SVFGSFKLDDIKSAQWTLPEDAAALSGKSVYFAKYLTSQKLLELELSDSNFRRYVLVQFL 249
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
IL QY STV+ + E L +Q WVK+T VY L+++TPPDG F+ +K IL+ EE
Sbjct: 250 ILIQYLQSTVRFKQESQVLTEEQSTWVKNTMSLVYKLLEETPPDGVGFAANIKHILQREE 309
Query: 350 HWNQWKNEGCPELK-----------RPLTSITDEDKKDEPDAKK-KKTPELTKLWN-SKD 396
WN WKNEGCP+ K RP + DE K + K E+ +LWN D
Sbjct: 310 FWNTWKNEGCPDFKEAKPVDVVKEGRPKRKLGDEIKAALANKKVILGNAEMNRLWNLCPD 369
Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
N EAC S +RDF PSLE +FE A++Q +PA HF
Sbjct: 370 NWEACTSPKRDFVPSLEKFFETAVEQAEPA----------------------------HF 401
Query: 457 FLNATPNVEKNSEFIENMVKRCVKEKP 483
F ++ ++ ++E+MV R KE P
Sbjct: 402 FTSSNQPIKSLPLYLESMVLRLAKELP 428
>gi|119622139|gb|EAX01734.1| THO complex 1, isoform CRA_d [Homo sapiens]
Length = 468
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 204/424 (48%), Positives = 265/424 (62%), Gaps = 44/424 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + TS T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNMEACKSET 456
Query: 406 RDFT 409
R T
Sbjct: 457 RQGT 460
>gi|148691058|gb|EDL23005.1| THO complex 1, isoform CRA_c [Mus musculus]
Length = 473
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 202/421 (47%), Positives = 263/421 (62%), Gaps = 44/421 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 46 PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 101
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 102 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 161
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 162 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 221
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 222 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 281
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 282 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 341
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 342 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 401
Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
P + S T K+ P+ K P ELT+LWN DN+EACKS
Sbjct: 402 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 461
Query: 406 R 406
R
Sbjct: 462 R 462
>gi|196015135|ref|XP_002117425.1| hypothetical protein TRIADDRAFT_1612 [Trichoplax adhaerens]
gi|190579954|gb|EDV20041.1| hypothetical protein TRIADDRAFT_1612 [Trichoplax adhaerens]
Length = 517
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 205/489 (41%), Positives = 287/489 (58%), Gaps = 62/489 (12%)
Query: 31 DKKSALEQSIRQYLLKLI-----KTPDIDIKVIE-NYISLCVELCMKD-MCNSTLPIILL 83
+ KSA++Q+ R L +LI T D +K+ + ++ +C K + + +LP ILL
Sbjct: 2 ESKSAIDQAFRDLLKELIVQKQKGTDDNQVKIDDLTWLIKISTICAKSGLASVSLPFILL 61
Query: 84 SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
D FD TL C+++F YVE + WK F K +LRMCNDLLRRLSRSQ+T+FCG
Sbjct: 62 VDIFDCLTLRVCQRMFGYVEKEMQTWKDTAFDAMGKVYILRMCNDLLRRLSRSQDTIFCG 121
Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEF-----------GGDEEMDVSSNETEETD 192
+I LFL+++FP ERSGLN++S FNV+N+T + + MDV E E D
Sbjct: 122 QIRLFLSRYFPLDERSGLNVMSHFNVDNVTSYTKRDMVDSQTASAEGVMDVEDGEISEGD 181
Query: 193 TEEV----DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL- 247
+E +K ID+N Y +FWSLQD+F NP CY+ W T+ + VL FKSYKL
Sbjct: 182 DDEKKAEKNKTPIDYNLYYRFWSLQDFFSNPNLCYSNSGWNALTTSSRVVLDVFKSYKLE 241
Query: 248 ------------------------DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRY 283
D ++++ S YFAK+LTN+KLLDLQL+D+ FRR
Sbjct: 242 ALTSNKSTKTDNDNPSDQDGMEISDSCDNAMDVSNRYFAKFLTNEKLLDLQLNDSQFRRT 301
Query: 284 VLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKL 343
+L+QFLILFQY TSTVK R + Q++W++ +T+ VY L+++TPP G+ F ++V+
Sbjct: 302 ILIQFLILFQYLTSTVKFRPTSCTISERQQQWIERSTKLVYQLLEETPPGGKQFVKLVRE 361
Query: 344 ILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKK--------------KTPELT 389
IL EE+W++WKN+GCP RP S D P K+K PELT
Sbjct: 362 ILSREEYWSKWKNDGCPSFVRPGVSTESSDSSSAPRKKRKISRHYSPTSHKIELGNPELT 421
Query: 390 KLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRL 448
+LWN S DNL ACK + FTP+++ +FEEAI+Q DP A VEE+YK VN+ + WR+ R+
Sbjct: 422 RLWNLSTDNLSACKMSGEKFTPTIKDFFEEAIEQCDPEAQVEEEYKLVNNLVFRWRSFRI 481
Query: 449 LSRKCPHFF 457
L+ + F
Sbjct: 482 LAVQSSIIF 490
>gi|340368679|ref|XP_003382878.1| PREDICTED: THO complex subunit 1-like [Amphimedon queenslandica]
Length = 535
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 212/500 (42%), Positives = 295/500 (59%), Gaps = 51/500 (10%)
Query: 29 DYDKKSALEQSIRQYLLKLI-KTPDIDIKVIENYISL-CVELCMKDMCNSTLPIILLSDT 86
D DKKS EQS R L LI K P + +S+ C E +C ++P +LLSD
Sbjct: 39 DNDKKSIFEQSFRDVLRDLIAKDPLVSSFSSLVSLSVLCAE---SQICYHSVPFLLLSDA 95
Query: 87 FDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
FD TLD CE+LF VE +V +W +Q FF S + LLRMCNDLLRRLS S NTVFCGRI
Sbjct: 96 FDCITLDFCEKLFDTVESHVTVWTKQDFFSSGRILLLRMCNDLLRRLSSSLNTVFCGRIQ 155
Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF----------------GGDEEMDVSSNETEE 190
LFLA+ FP SE+S LN++S FN+EN+T F GG+ D E +
Sbjct: 156 LFLARLFPLSEKSALNLMSHFNLENVTSFKTLQTPPSSSEPQKPPGGNTNEDGEDMEIDA 215
Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
D +E +D+N Y+K W+LQD+FR+PVQC++ WK FT + VL +F SYKL+D+
Sbjct: 216 LD-QETGMTPVDYNLYEKLWTLQDFFRHPVQCFHSDRWKKFTGTVQEVLQSFSSYKLEDI 274
Query: 251 QS-------------SLNP--SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF 295
QS +L+P S YF KYLT++ LL LQLSD++FRR+VL+QFLILFQY
Sbjct: 275 QSKRRKLQSSDEGMDTLSPYTSSHYFTKYLTSENLLSLQLSDSHFRRHVLIQFLILFQYL 334
Query: 296 TSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWK 355
T VK + L WV +TT VY L+++TPPDG+ F++ VKL+++ EE W WK
Sbjct: 335 TGDVKFKNSSQVLSESHSSWVTETTSKVYQLLEETPPDGKSFAEYVKLVIQREETWIAWK 394
Query: 356 NEGCPELK----RPLTSITDEDKKDE--------PDAKKKKTPELTKLWN-SKDNLEACK 402
N GCP + +P D K+ + K + EL++LWN DNLE+C
Sbjct: 395 NGGCPSFEKTNSKPPNRGGGGDTKERLGERLFAMKEEKDMGSKELSRLWNLCPDNLESCS 454
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
S ER F P LE + EE +++M P + ++ + +++ N+ W+ALRLLS + HFF A
Sbjct: 455 SPERIFVPPLEIFLEEPLEEMKPNCEIPKEDRWISNPNFRWQALRLLSCESTHFFPIANQ 514
Query: 462 PNVEKNSEFIENMVKRCVKE 481
++ E++++++ + KE
Sbjct: 515 SSMRPLEEYLDHVILQTSKE 534
>gi|198432183|ref|XP_002123801.1| PREDICTED: similar to THO complex 1 [Ciona intestinalis]
Length = 744
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/594 (37%), Positives = 323/594 (54%), Gaps = 96/594 (16%)
Query: 21 YALYKPTCDYDKKSALEQSIRQYLLK--LIKT-PDIDIKVIENYISLCVELCMKDMCNST 77
Y Y P + +KK+A +Q++R + + L KT +I+I + I++ V + + +C+ +
Sbjct: 40 YGKY-PGSEQEKKTATDQTLRDLIREEVLSKTHQEIEITRTLHVINVIVAVAKQGLCSPS 98
Query: 78 LPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQ 137
+P LL D D STLD C+ +F YVE V +WK F+ + KN LLRMCNDLLRRLS+SQ
Sbjct: 99 VPFTLLGDVLDCSTLDTCDVIFKYVEERVAVWKSPVFYTAGKNYLLRMCNDLLRRLSQSQ 158
Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD---EEMDVSSNETEE-TDT 193
NTVFCGRI LFLA+ FP SE+S LN++S+FN++N+T F D +DV+ +EE +T
Sbjct: 159 NTVFCGRIQLFLARLFPLSEKSALNLVSQFNLDNVTIFNTDPSNSTLDVNEEASEEKMET 218
Query: 194 EEVDKVK-----IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+EV K K +D+ Y WS+QD FR+PVQCY K +W T A+ + F S+KLD
Sbjct: 219 DEVKKEKSESGLVDYKLYLNIWSIQDIFRSPVQCYQKETWDKLTENADNIFRVFDSFKLD 278
Query: 249 DVQSS-----LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG 303
D++ YF+K+LT++KL +LQLSD FRR +L QFLILF+Y T+TVK +G
Sbjct: 279 DIRGGNKTNQTTHKKSYFSKFLTSEKLTELQLSDGMFRRQILTQFLILFRYLTATVKFKG 338
Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK 363
L L Q+ W+ T +Y L+++TPP+G+ F++ K +L+ EE W WKN GCP
Sbjct: 339 ANLVLTEAQKTWISTTETKIYKLLQETPPNGKKFAKYTKHLLEREEFWITWKNSGCPSFM 398
Query: 364 RPLTSITDE---DKKDEPDAKKKK--------------TPELTKLWNSKDN--------- 397
+ + DE KK + D KK +PELTKLWN N
Sbjct: 399 KMRSKSEDESPASKKRKIDPKKSSIAEDFISGKSLGMGSPELTKLWNLCPNNLAACRSNE 458
Query: 398 ------------------------------------------LEACKSAERDFTPSLESY 415
L AC+S ER+F PS++ Y
Sbjct: 459 REFLPSIQDYFAEPIEQEDPKNQIEDQYNPELTKLWNLFPDNLAACRSNEREFLPSIQDY 518
Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNS--EFIEN 473
F E I+Q DP +E+QYK + +SN+ W+ALRLLS + FF + N++ + +++
Sbjct: 519 FAEPIEQEDPKNQIEDQYKLIKNSNFQWKALRLLSNRSQLFFTSQPANLQFKTVPDYLST 578
Query: 474 MVKRCVKEKPSSQIS------GNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEA 521
++ + KE P+ NG+ V +++ EI E++ ED +A
Sbjct: 579 VIGKLAKEFPTQSTEEIKVEVSNGDLV--GVGDLKTTMTVSEINNEDEGEDEDA 630
>gi|449663492|ref|XP_002156883.2| PREDICTED: THO complex subunit 1-like [Hydra magnipapillata]
Length = 567
Score = 368 bits (944), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 205/458 (44%), Positives = 275/458 (60%), Gaps = 42/458 (9%)
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
C LPI+LLSD FD TL CE++F Y+E NV++W + FF + KN+LLRMCND+LRR
Sbjct: 107 FCTKNLPILLLSDIFDTMTLCDCEKMFKYIEKNVSVWNAEPFFSTGKNHLLRMCNDMLRR 166
Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMD---VSSNETE 189
LS+SQNTVFCGRI LFLA+ FP E+S LN++S F++EN+T F E++D + N++
Sbjct: 167 LSKSQNTVFCGRIQLFLAQLFPVEEKSALNLMSNFHLENVTLFKT-EKLDFDEIQPNDSV 225
Query: 190 ETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD- 248
E D E V+ +D+N Y KFW+LQDYF NP +CY K W + + VL F S+KL+
Sbjct: 226 E-DRELVNN-PVDYNLYIKFWTLQDYFVNPSKCYTKEGWNLINTNISEVLKIFTSFKLEF 283
Query: 249 ------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
DV + +YFAKYLTN+KL + QL+D+NFRR L+Q LILFQY +
Sbjct: 284 NVRKRKSIDMPIDVDQIKQTANNYFAKYLTNEKLFNFQLNDSNFRRNFLVQVLILFQYLS 343
Query: 297 STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN 356
T K + L Q W+ +TT+ VY+ I +TPPDG+ F + +K IL EE+W WKN
Sbjct: 344 GTSKFKAPNQVLNDTQTNWINETTDIVYNSICETPPDGKKFCETIKHILSREENWINWKN 403
Query: 357 EGCPEL-------------KRPLTSITDEDKKDEPDAKKKKTPELTKLWN-SKDNLEACK 402
EGCP K+ SI D+ ++ T ELT+LWN + DNL + K
Sbjct: 404 EGCPSFAKVKSSTETEVAPKKAKRSIGDDFVLNQSKKINMGTLELTRLWNLNPDNLASAK 463
Query: 403 SAER-DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF---- 457
R F P+L+ +FE+AI+Q DP A +E++YK V N+ WRALRLL+R+ HFF
Sbjct: 464 VESRLKFLPNLKDFFEKAIEQADPNAGIEDEYKLVKQPNFQWRALRLLARRSDHFFTPSQ 523
Query: 458 --LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNG 493
A P E S IEN+ K K+KPS NG
Sbjct: 524 TPFKALP--EYLSSVIENISKDMTKDKPSCDSKSLENG 559
>gi|74217198|dbj|BAC31387.2| unnamed protein product [Mus musculus]
gi|74226598|dbj|BAE23950.1| unnamed protein product [Mus musculus]
Length = 397
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 182/361 (50%), Positives = 238/361 (65%), Gaps = 30/361 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
K + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396
Query: 360 P 360
P
Sbjct: 397 P 397
>gi|226487124|emb|CAX75427.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
Length = 632
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
DKKS ++ ++R +L+L+ + +D ++ + + V + ++C++ P +L SD F+
Sbjct: 41 DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+ CE++F ++E + K F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL
Sbjct: 99 PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158
Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
+ FP E+SGLN +S FN E + + F D +S++ EE + T+ ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
+D Y KFWSLQ++F++PV CY K W FTS +TVL F S KL D +S
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276
Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
S F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK + L DQ W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336
Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
V E V L+ P F V+ IL+ E +WN+WKN+GCP I +
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390
Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
+K K+ P +T KLWN DNL AC+ R F L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450
Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
++AI +MDPA VEE+YK +N ++WRALR LSRKCPHF++N P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496
>gi|226487128|emb|CAX75429.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
Length = 639
Score = 341 bits (874), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
DKKS ++ ++R +L+L+ + +D ++ + + V + ++C++ P +L SD F+
Sbjct: 41 DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+ CE++F ++E + K F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL
Sbjct: 99 PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158
Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
+ FP E+SGLN +S FN E + + F D +S++ EE + T+ ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
+D Y KFWSLQ++F++PV CY K W FTS +TVL F S KL D +S
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276
Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
S F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK + L DQ W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336
Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
V E V L+ P F V+ IL+ E +WN+WKN+GCP I +
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390
Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
+K K+ P +T KLWN DNL AC+ R F L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450
Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
++AI +MDPA VEE+YK +N ++WRALR LSRKCPHF++N P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496
>gi|226487130|emb|CAX75430.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
Length = 616
Score = 341 bits (874), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
DKKS ++ ++R +L+L+ + +D ++ + + V + ++C++ P +L SD F+
Sbjct: 41 DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+ CE++F ++E + K F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL
Sbjct: 99 PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158
Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
+ FP E+SGLN +S FN E + + F D +S++ EE + T+ ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
+D Y KFWSLQ++F++PV CY K W FTS +TVL F S KL D +S
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276
Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
S F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK + L DQ W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336
Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
V E V L+ P F V+ IL+ E +WN+WKN+GCP I +
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390
Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
+K K+ P +T KLWN DNL AC+ R F L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450
Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
++AI +MDPA VEE+YK +N ++WRALR LSRKCPHF++N P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496
>gi|226487126|emb|CAX75428.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
Length = 639
Score = 341 bits (874), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
DKKS ++ ++R +L+L+ + +D ++ + + V + ++C++ P +L SD F+
Sbjct: 41 DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+ CE++F ++E + K F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL
Sbjct: 99 PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158
Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
+ FP E+SGLN +S FN E + + F D +S++ EE + T+ ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
+D Y KFWSLQ++F++PV CY K W FTS +TVL F S KL D +S
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276
Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
S F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK + L DQ W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336
Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
V E V L+ P F V+ IL+ E +WN+WKN+GCP I +
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390
Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
+K K+ P +T KLWN DNL AC+ R F L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450
Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
++AI +MDPA VEE+YK +N ++WRALR LSRKCPHF++N P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496
>gi|89267023|emb|CAJ82056.1| THO complex 1 [Xenopus (Silurana) tropicalis]
Length = 421
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 176/367 (47%), Positives = 235/367 (64%), Gaps = 31/367 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIEN---YISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q R L + I I+ EN ISL + +C +T P +L
Sbjct: 54 PGSENEKKPTLDQVFRVVLEEEI----INQVSCENCLAIISLAINGVTDGICTATTPFVL 109
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D L C+++F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 110 LGDVLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 169
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE--------------MDVSSNET 188
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ M++ E
Sbjct: 170 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKVEEKDDGMEIEEGEM 229
Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
+ D + ID+N Y+KFW+LQD+FRNPVQCY+K SWK F +++ VL FKSYKLD
Sbjct: 230 GDDDAPNSSSIPIDYNLYRKFWTLQDFFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLD 289
Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
D Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY VK
Sbjct: 290 DTQASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVK 349
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ L +Q W++DTT+ V+ L+ +TPP GE FS+ V+ IL EE+WN WKNEGCP
Sbjct: 350 FKSSNYILLDEQSLWIEDTTKLVHQLLSETPPVGEKFSKTVEHILNTEENWNAWKNEGCP 409
Query: 361 EL--KRP 365
+RP
Sbjct: 410 SFVKERP 416
>gi|226487132|emb|CAX75431.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
Length = 524
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
DKKS ++ ++R +L+L+ + +D ++ + + V + ++C++ P +L SD F+
Sbjct: 41 DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+ CE++F ++E + K F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL
Sbjct: 99 PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158
Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
+ FP E+SGLN +S FN E + + F D +S++ EE + T+ ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
+D Y KFWSLQ++F++PV CY K W FTS +TVL F S KL D +S
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276
Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
S F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK + L DQ W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336
Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
V E V L+ P F V+ IL+ E +WN+WKN+GCP I +
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390
Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
+K K+ P +T KLWN DNL AC+ R F L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450
Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
++AI +MDPA VEE+YK +N ++WRALR LSRKCPHF++N P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496
>gi|313220893|emb|CBY31729.1| unnamed protein product [Oikopleura dioica]
Length = 648
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 197/519 (37%), Positives = 289/519 (55%), Gaps = 66/519 (12%)
Query: 3 DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLK-LIKTPDIDIKVIENY 61
+ F +F + N KY+ + D DKK +Q ++ L + L+K DI KVI +
Sbjct: 19 EGFDTDFVSGKPNEFISKYSATHGS-DADKKQVADQCLKVKLQEVLVKYSDIS-KVI-SL 75
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM-SCKN 120
++ K +C+ P+ L+SD + STLD CE++F +VE IWK F++ +CKN
Sbjct: 76 LTFATSCARKSLCSVLTPLALISDALECSTLDDCERIFDFVEKQTPIWKSTQFYVNACKN 135
Query: 121 NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE 180
LLR CNDLLRRLS+SQN VFCGRI +FL + FP SE+S LN+ S FN+EN E
Sbjct: 136 QLLRNCNDLLRRLSQSQNNVFCGRIHIFLTRIFPISEKSALNLNSNFNLEN--------E 187
Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
D + E EET+ ++ Y WS+QDYFRNP+Q Y V + F T+
Sbjct: 188 TDYTRIEDEETENRDL---------YNSLWSVQDYFRNPLQLYEPVKFAEFEKAVNTINT 238
Query: 241 AFKSYKLD--------------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
AF ++KL+ DV+ N YF K+LT+ LL+LQL+D +F
Sbjct: 239 AFANFKLEVNDDEEEKRSKMKEIDYDSIDVEMKSNGQIVYFPKFLTSFNLLELQLNDPSF 298
Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
RR +L + L LFQY T VK R L +W+++T +T+Y L+++TPP+G+ +
Sbjct: 299 RRQILTEILFLFQYLTGYVKFRSATNVLPFASREWIRNTEKTIYELLQETPPNGKKYVLF 358
Query: 341 VKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT--------------- 385
VK +++ E W +WKNEGCP R ++DK KK+KT
Sbjct: 359 VKHLIEREAFWVKWKNEGCPRFMR------EKDKDGASRPKKRKTIYDDFMSNKKLGLGS 412
Query: 386 PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWR 444
PEL+KLW S NLEACK+ R F P + ++FEEA++ DPA +E+QYK ++ + WR
Sbjct: 413 PELSKLWGRSLPNLEACKTENRQFIPKVHTFFEEAMEHADPANEIEDQYKCYKNAEFGWR 472
Query: 445 ALRLLSRKCPHFFLNATPNVEKNS--EFIENMVKRCVKE 481
+LRLL+ PHFF N+ + +++E+++K+ +KE
Sbjct: 473 SLRLLAHASPHFFSLQPANMHHLAVPKYLEHILKKLLKE 511
>gi|313226154|emb|CBY21297.1| unnamed protein product [Oikopleura dioica]
Length = 648
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 197/519 (37%), Positives = 289/519 (55%), Gaps = 66/519 (12%)
Query: 3 DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLK-LIKTPDIDIKVIENY 61
+ F +F + N KY+ + D DKK +Q ++ L + L+K DI KVI +
Sbjct: 19 EGFDTDFVSGKPNEFISKYSATHGS-DADKKQVADQCLKVKLQEVLVKYSDIS-KVI-SL 75
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM-SCKN 120
++ K +C+ P+ L+SD + STLD CE++F +VE IWK F++ +CKN
Sbjct: 76 LTFATSCARKSLCSVLTPLALISDALECSTLDDCERIFDFVEKQTPIWKSTQFYVNACKN 135
Query: 121 NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE 180
LLR CNDLLRRLS+SQN VFCGRI +FL + FP SE+S LN+ S FN+EN E
Sbjct: 136 QLLRNCNDLLRRLSQSQNNVFCGRIHIFLTRIFPISEKSALNLNSNFNLEN--------E 187
Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
D + E EET+ ++ Y WS+QDYFRNP+Q Y V + F T+
Sbjct: 188 TDYTRIEDEETENRDL---------YNSLWSVQDYFRNPLQLYEPVKFAEFEKAVNTINT 238
Query: 241 AFKSYKLD--------------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
AF ++KL+ DV+ N YF K+LT+ LL+LQL+D +F
Sbjct: 239 AFANFKLEVNDDEEEKRSKMKEIDYDSIDVEMKSNGQIVYFPKFLTSFNLLELQLNDPSF 298
Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
RR +L + L LFQY T VK R L +W+++T +T+Y L+++TPP+G+ +
Sbjct: 299 RRQILTEILFLFQYLTGYVKFRSATNVLPFASREWIRNTEKTIYELLQETPPNGKKYVLF 358
Query: 341 VKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT--------------- 385
VK +++ E W +WKNEGCP R ++DK KK+KT
Sbjct: 359 VKHLIEREAFWVKWKNEGCPRFMR------EKDKDGASRPKKRKTIYDDFMSNKKLGLGS 412
Query: 386 PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWR 444
PEL+KLW S NLEACK+ R F P + ++FEEA++ DPA +E+QYK ++ + WR
Sbjct: 413 PELSKLWGRSLPNLEACKTENRQFIPKVHTFFEEAMEHADPANEIEDQYKCYKNAEFGWR 472
Query: 445 ALRLLSRKCPHFFLNATPNVEKNS--EFIENMVKRCVKE 481
+LRLL+ PHFF N+ + +++E+++K+ +KE
Sbjct: 473 SLRLLAHASPHFFSLQPANMHHLAVPKYLEHILKKLLKE 511
>gi|358342057|dbj|GAA49608.1| THO complex subunit 1 [Clonorchis sinensis]
Length = 1030
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 182/468 (38%), Positives = 270/468 (57%), Gaps = 38/468 (8%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
D K ++ + R +L+L+ + +D +++ + + V + ++C++ P +L SD F+
Sbjct: 505 DTKPVIDVAFRSRMLELLSSHSLD--EVKSLLKVGVAAALANICSAPTPFLLFSDVFNTK 562
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+ C++LF ++E + K F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL+
Sbjct: 563 PISVCQELFGFMEETITTLKDVPLFASGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLS 622
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEE---------MDVSSNETEETDTEEVDK-VK 200
+ FP E+SGLN++S FN++ T + + MD S+ + +E +T K ++
Sbjct: 623 RLFPLDEKSGLNLMSNFNLDKETPYNKTPDPSIFKHQLSMDSSTEDPDEGETAAHPKSLE 682
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS--SLNPSG 258
+D N Y KFWSLQD+F+ P CY+K W F S ETVL F S KL + N SG
Sbjct: 683 VDANLYVKFWSLQDFFKIPTSCYDKAKWVTFRSNIETVLEIFSSIKLSGSNEGDASNTSG 742
Query: 259 DY-FAKYLTNQK---------LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLEL 308
F+KYLT++K LLDLQL D++FRRY+++Q L++FQY + K + L
Sbjct: 743 SSKFSKYLTSEKAIIPRCSFQLLDLQLMDSSFRRYIMVQILVVFQYLITPTKFKTPDQVL 802
Query: 309 KSDQEKWVKDTTETVYSLIKQT-PPDGEH---FSQVVKLILKGEEHWNQWKNEGCPELKR 364
+Q +W+ E + SL+ D +H F+ + IL+ E HWN WKNEGCP R
Sbjct: 803 SEEQTEWLNRKRELILSLLSGNRKTDSQHTDSFASTIGRILERETHWNSWKNEGCPSFVR 862
Query: 365 PLTSITDEDKK-------DEPDAK--KKKTPELTKLWN-SKDNLEACKSAERDFTPSLES 414
P E +K D AK + L KLWN DNL AC+ R F P+L+S
Sbjct: 863 PQEKSPVETRKRRTNPLIDRAGAKIYRFGNKNLDKLWNVCPDNLAACRDERRLFRPNLQS 922
Query: 415 YFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
YF++AI ++DPA V+E+YK +N + WR+LR L+R+ PHF++N P
Sbjct: 923 YFQDAILELDPAEKVDEEYKSINKEEWCWRSLRCLARRSPHFYINWNP 970
>gi|312066929|ref|XP_003136503.1| hypothetical protein LOAG_00915 [Loa loa]
Length = 667
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 171/474 (36%), Positives = 268/474 (56%), Gaps = 51/474 (10%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
++ +++ + +D C + ++ D FD+S++ +CE+LF VE N+ WK F+
Sbjct: 56 LLRQLLAVTFQAAKEDFCAKGTTVTVVQDVFDVSSVKRCEELFVIVEDNLPQWKMSIFYE 115
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
CKN +LRMCNDLL+RLSR+ +T+FCGRIL+ LA P E+SGLN++S FN+ N+T++
Sbjct: 116 PCKNMILRMCNDLLKRLSRTVDTIFCGRILVLLAHALPLCEKSGLNLVSHFNLGNVTKYD 175
Query: 177 -GDEEMDVSSNETEETDTEEVDKVK-----IDFNFYKKFWSLQDYFRNPVQCYNKVSWKM 230
D + V +N TE++D EV ++ +D+ Y KFW LQ +F +P C+ K W+
Sbjct: 176 MADSCLGVDANLTEDSDVMEVGEISEREIPVDYTLYAKFWQLQSFFSSPATCFEKSKWRT 235
Query: 231 FTSYAETVLAAFKSYKLDD------------------------VQSSLNPSGD---YFAK 263
F VL F S+KL++ + S+ GD YFAK
Sbjct: 236 FQQNTTEVLNIFSSHKLENCIALGEGSKRERLKQENSNDEIPVIPSTSKADGDEDNYFAK 295
Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETV 323
YLT+QKLL LQL+D+ FRRY L+Q LIL+QY S +K + L +Q +++ ++TE
Sbjct: 296 YLTSQKLLQLQLNDSQFRRYFLVQCLILYQYLISDIKFKDRSFTLNDEQLRFIIESTERC 355
Query: 324 YSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK--------- 374
+ L+++T P G HF+ +K+IL E+ W++WKN+GC + T + D++K
Sbjct: 356 FRLLRETHPKGPHFADAMKVILHREKEWSEWKNKGCLD----YTQLADKEKPSVFKKRPR 411
Query: 375 -KDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQ 432
+ +P PELTKLWN + D L AC+ + R+F P L ++ E + ++DP VEEQ
Sbjct: 412 NRYDPSKLDLGNPELTKLWNINPDMLSACEDSRRNFMPKLATFLENPLIELDPEQLVEEQ 471
Query: 433 YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE---FIENMVKRCVKEKP 483
Y+ VN+ ++ WRA RLL + +F + E + F+E ++K P
Sbjct: 472 YRSVNNESFQWRASRLLMSQSTSYFQTPSKRNEVTTNLAPFLEEVIKHTATNFP 525
>gi|393911810|gb|EFO27566.2| hypothetical protein LOAG_00915 [Loa loa]
Length = 657
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 171/474 (36%), Positives = 268/474 (56%), Gaps = 51/474 (10%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
++ +++ + +D C + ++ D FD+S++ +CE+LF VE N+ WK F+
Sbjct: 56 LLRQLLAVTFQAAKEDFCAKGTTVTVVQDVFDVSSVKRCEELFVIVEDNLPQWKMSIFYE 115
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
CKN +LRMCNDLL+RLSR+ +T+FCGRIL+ LA P E+SGLN++S FN+ N+T++
Sbjct: 116 PCKNMILRMCNDLLKRLSRTVDTIFCGRILVLLAHALPLCEKSGLNLVSHFNLGNVTKYD 175
Query: 177 -GDEEMDVSSNETEETDTEEVDKVK-----IDFNFYKKFWSLQDYFRNPVQCYNKVSWKM 230
D + V +N TE++D EV ++ +D+ Y KFW LQ +F +P C+ K W+
Sbjct: 176 MADSCLGVDANLTEDSDVMEVGEISEREIPVDYTLYAKFWQLQSFFSSPATCFEKSKWRT 235
Query: 231 FTSYAETVLAAFKSYKLDD------------------------VQSSLNPSGD---YFAK 263
F VL F S+KL++ + S+ GD YFAK
Sbjct: 236 FQQNTTEVLNIFSSHKLENCIALGEGSKRERLKQENSNDEIPVIPSTSKADGDEDNYFAK 295
Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETV 323
YLT+QKLL LQL+D+ FRRY L+Q LIL+QY S +K + L +Q +++ ++TE
Sbjct: 296 YLTSQKLLQLQLNDSQFRRYFLVQCLILYQYLISDIKFKDRSFTLNDEQLRFIIESTERC 355
Query: 324 YSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK--------- 374
+ L+++T P G HF+ +K+IL E+ W++WKN+GC + T + D++K
Sbjct: 356 FRLLRETHPKGPHFADAMKVILHREKEWSEWKNKGCLD----YTQLADKEKPSVFKKRPR 411
Query: 375 -KDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQ 432
+ +P PELTKLWN + D L AC+ + R+F P L ++ E + ++DP VEEQ
Sbjct: 412 NRYDPSKLDLGNPELTKLWNINPDMLSACEDSRRNFMPKLATFLENPLIELDPEQLVEEQ 471
Query: 433 YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE---FIENMVKRCVKEKP 483
Y+ VN+ ++ WRA RLL + +F + E + F+E ++K P
Sbjct: 472 YRSVNNESFQWRASRLLMSQSTSYFQTPSKRNEVTTNLAPFLEEVIKHTATNFP 525
>gi|402591093|gb|EJW85023.1| hypothetical protein WUBG_04065 [Wuchereria bancrofti]
Length = 657
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 174/496 (35%), Positives = 277/496 (55%), Gaps = 52/496 (10%)
Query: 36 LEQSIRQYLLKLIKTPDIDIK-VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDK 94
+E +I + ++ + DI+ + ++ +++ + +D C + ++ D FD+S +
Sbjct: 34 IENAIHRKAFEIAQCTDIEQESLLRQLLAVTFQAAKEDFCAKGTTVAVIQDVFDVSPVKH 93
Query: 95 CEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFP 154
CE+LF VE N+ WK F+ CKN +LRMCNDLL+RLSR+ +T FCGRIL+ LA P
Sbjct: 94 CEKLFVIVEDNLPQWKMGIFYEPCKNMILRMCNDLLKRLSRTVDTSFCGRILVLLAHALP 153
Query: 155 FSERSGLNIISEFNVENITEFG-GDEEMDVSSNETEETDTEEVDKVK-----IDFNFYKK 208
E+SGLN++S FN+ N+T++ D + +N TE++D EV ++ +D+ Y K
Sbjct: 154 LCEKSGLNLVSHFNLGNVTKYDLADSCFALDANLTEDSDVMEVGEISEREIPVDYTLYAK 213
Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------- 248
FW LQ +F +P C+ K W+ F VL F S+KL+
Sbjct: 214 FWQLQSFFASPATCFEKSKWRTFQQNTTEVLNIFSSHKLEFSATLEEGSKRERSKQENSN 273
Query: 249 ----DVQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
+ S+ GD YFAKYLT+QKLL LQL+D+ FRRY L+Q LIL+QY S +K
Sbjct: 274 DEIAAIPSTSRADGDEDIYFAKYLTSQKLLQLQLNDSQFRRYFLVQCLILYQYLVSDIKF 333
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ + L +Q +++ ++TE + L+++T P G HF+ +K+IL E+ W++WKN+GC +
Sbjct: 334 KDKSFTLNDEQLRFIIESTERCFRLLRETYPKGPHFADAMKVILHREKEWSEWKNKGCLD 393
Query: 362 LKRPLTSITDEDK----------KDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTP 410
T + D++K + +P PELTKLWN + D L AC+ + R+F P
Sbjct: 394 ----YTQLADKEKPSVFKKRPRNRYDPSKLDLGNPELTKLWNINPDMLSACEDSRRNFIP 449
Query: 411 SLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE- 469
L ++ E+ + ++DP VEEQY+ VN+ ++ WRA RLL + +F + E +
Sbjct: 450 MLVAFLEDPLDELDPEQLVEEQYRSVNNESFQWRASRLLMSQSTSYFQTPSKRSEVTTNL 509
Query: 470 --FIENMVKRCVKEKP 483
F+E ++K P
Sbjct: 510 APFLEEVIKHTAANFP 525
>gi|149031711|gb|EDL86661.1| rCG41283, isoform CRA_a [Rattus norvegicus]
Length = 370
Score = 307 bits (787), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 161/327 (49%), Positives = 211/327 (64%), Gaps = 30/327 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+ENIT F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSL 326
K + L +Q W++DTT++VY +
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQV 363
>gi|148691056|gb|EDL23003.1| THO complex 1, isoform CRA_a [Mus musculus]
Length = 377
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 160/327 (48%), Positives = 211/327 (64%), Gaps = 30/327 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSL 326
K + L +Q W++DTT++VY +
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQV 363
>gi|417410320|gb|JAA51635.1| Putative nuclear matrix protein, partial [Desmodus rotundus]
Length = 392
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 160/327 (48%), Positives = 211/327 (64%), Gaps = 30/327 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 44 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 99
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 100 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 159
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 160 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 219
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 220 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 279
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 280 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 339
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSL 326
K + L +Q W++DTT++VY +
Sbjct: 340 KFKSSNYVLTDEQSLWIEDTTKSVYQV 366
>gi|426385304|ref|XP_004059159.1| PREDICTED: THO complex subunit 1-like [Gorilla gorilla gorilla]
gi|50812415|gb|AAT81409.1| death domain-containing protein p84N5 short isoform [Homo sapiens]
gi|119622137|gb|EAX01732.1| THO complex 1, isoform CRA_b [Homo sapiens]
Length = 377
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 160/327 (48%), Positives = 211/327 (64%), Gaps = 30/327 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336
Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSL 326
K + L +Q W++DTT++VY +
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQV 363
>gi|324502527|gb|ADY41112.1| THO complex subunit 1 [Ascaris suum]
Length = 669
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 168/481 (34%), Positives = 270/481 (56%), Gaps = 61/481 (12%)
Query: 33 KSALEQSIRQYLLKLIKT--PDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
K+ +E+++ + L L +T P++ + + ++L ++ +++C + ++ D FD
Sbjct: 31 KNNVEEALHRKALALSETSEPNLLEENVRQLLALVFQIAKEELCAKASTVSVIQDVFDTI 90
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+++ CE+LF VE N+++WK F+ CKN +LRMCNDLL+RLSR+ +T FCGRIL+ LA
Sbjct: 91 SIEWCERLFVVVEDNLSLWKTPFFYEPCKNLVLRMCNDLLKRLSRTVDTSFCGRILVLLA 150
Query: 151 KFFPFSERSGLNIISEFNVENITEF-----GGDEEMDVSSNETEETDTEEVDKVKIDFNF 205
+ P E+SGLN++S FNV+N+T+F D E++ E E + +E + +DF
Sbjct: 151 RALPLCEKSGLNLVSHFNVDNVTKFDLVESTVDSEVNEDGEEMETGEIKEGRDIPVDFAL 210
Query: 206 YKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD---------------DV 250
Y KFW LQ +F +P CY++ W F VL S+KL D
Sbjct: 211 YAKFWQLQTFFSSPNNCYDRSKWNTFQQNTNDVLNVLTSHKLGGSTQADEQSRRKRQHDG 270
Query: 251 QSSLNPS----------------GD------YFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
S P+ GD +FAKY+T+QKLL LQL+D+ FRRY L+Q
Sbjct: 271 APSSAPADVTSSSASALLNSTRDGDTVEKNAFFAKYVTSQKLLQLQLNDSQFRRYFLVQC 330
Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
L+L+QY S VK + + L +Q +++ ++TE Y L++ T P G HF+ VK IL+ E
Sbjct: 331 LVLYQYLVSDVKFKDKSYTLNDEQLRFIIESTEKCYRLLRDTHPKGSHFADSVKKILQRE 390
Query: 349 EHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK-----------TPELTKLWN-SKD 396
+ W++WKN+GC + T + D++K P A++ + +P LT LW D
Sbjct: 391 KEWSEWKNKGCAD----YTLLIDKEKMC-PIARRPRNTYKPTEIDLGSPVLTNLWKIDPD 445
Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
L+AC+ ++R F PS+ + + + ++DP VE+QY VN+ ++ WRA R+L + P +
Sbjct: 446 MLDACQDSKRKFAPSMVDFLQNPLLELDPEQMVEDQYMSVNNESFQWRASRMLMHESPAY 505
Query: 457 F 457
F
Sbjct: 506 F 506
>gi|67967745|dbj|BAE00355.1| unnamed protein product [Macaca fascicularis]
Length = 729
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 150/296 (50%), Positives = 194/296 (65%), Gaps = 30/296 (10%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF 295
DD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYL 332
>gi|256084105|ref|XP_002578273.1| nuclear matrix protein [Schistosoma mansoni]
Length = 622
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 170/469 (36%), Positives = 247/469 (52%), Gaps = 64/469 (13%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
DKKS ++ ++R +L+L+ + +D ++ + + V + ++C++ P +L +D F+
Sbjct: 41 DKKSLIDVALRGRMLRLLASDSMDD--VKTLLKMGVAAALANLCSAPTPFLLFADVFNTK 98
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+ CE++F ++E + K F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL
Sbjct: 99 PISLCEEMFGFMEETIITLKDVPLFASGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158
Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
+ FP E+SGLN +S FN E + + F D +S++ EE + T+ ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSAFKHQVSHDHTSDDLEEGEMTDSSTPLE 218
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS---LNPS 257
+D Y KFWSLQ++F++P CY K W FTS +TVL F S KL ++ S
Sbjct: 219 VDAGLYVKFWSLQEFFKSPALCYEKAKWLRFTSSTDTVLDVFSSIKLMAIEEGDICSQSS 278
Query: 258 GDYFAKYLTNQKLLD-----LQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQ 312
F+KYLT++K + L+L + R ++ Q L DQ
Sbjct: 279 SGTFSKYLTSEKTVGFFKEQLKLLQRSCRGVIIDQV-------------------LSEDQ 319
Query: 313 EKWVKDTTETVYSLI---KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR----- 364
WV E V L+ G F V+ IL+ E +WN+WKN+GCP R
Sbjct: 320 RSWVNQRHEVVLRLLSSNSPNNSSGGTFVSTVERILERESYWNRWKNDGCPSFIRTPEKS 379
Query: 365 ----------PLTSITDEDKKDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLE 413
PL + T + + EL KLWN DNL AC+ R F P L
Sbjct: 380 RLSVRKRHINPLVTRTGQ------KVYRFGNRELDKLWNVCPDNLAACRDKRRVFNPDLH 433
Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
SYF++AI +MDPA VEE+YK +N ++WRALR LSRKCPHF++N P
Sbjct: 434 SYFQDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 482
>gi|351709766|gb|EHB12685.1| THO complex subunit 1 [Heterocephalus glaber]
Length = 530
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 159/350 (45%), Positives = 208/350 (59%), Gaps = 52/350 (14%)
Query: 159 SGLNIISEFNVENITEFGGDEE---------------MDVSSNETEETDTEEVDKVKIDF 203
GLN+ S+FN+EN+T F +E+ MDV E + + + ID+
Sbjct: 85 GGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDDEAPTTCSIPIDY 144
Query: 204 NFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPS 257
N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD Q+S L
Sbjct: 145 NLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTG 204
Query: 258 GD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK + L +Q W
Sbjct: 205 GEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLW 264
Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK 375
++DTT++VY +H IL EE+WN WKNEGCP + TS D K
Sbjct: 265 IEDTTKSVY----------QH-------ILNTEENWNSWKNEGCPSFVKERTS----DSK 303
Query: 376 DEPDAKKKKTPE--LTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQY 433
+K+ PE L K N K + +++ P+LE +FEEAI+Q DP VE +Y
Sbjct: 304 PTRVVRKRTAPEDFLGKGPNKKILM------GKEYMPTLEEFFEEAIEQADPENMVESEY 357
Query: 434 KKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
K VN+SNY WRALRLL+R+ PHFF + E++ENMV + KE P
Sbjct: 358 KVVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 407
>gi|444723139|gb|ELW63801.1| Collectin-12 [Tupaia chinensis]
Length = 1142
Score = 285 bits (729), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 158/349 (45%), Positives = 207/349 (59%), Gaps = 52/349 (14%)
Query: 160 GLNIISEFNVENITEFGGDEE---------------MDVSSNETEETDTEEVDKVKIDFN 204
GLN+ S+FN+EN+T F +E+ MDV E + + + ID+N
Sbjct: 698 GLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDDEAPTTCSIPIDYN 757
Query: 205 FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSG 258
Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD Q+S L G
Sbjct: 758 LYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGG 817
Query: 259 D--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWV 316
+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK + L +Q W+
Sbjct: 818 EHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWI 877
Query: 317 KDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKD 376
+DTT++VY +H IL EE+WN WKNEGCP + S D K
Sbjct: 878 EDTTKSVY----------QH-------ILNTEENWNSWKNEGCPSFVKERAS----DTKP 916
Query: 377 EPDAKKKKTPE--LTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYK 434
+K+ PE L K N K + +++ P+LE +FEEAI+Q DP VE +YK
Sbjct: 917 ARVVRKRTAPEDFLGKGPNKKILM------GKEYMPTLEEFFEEAIEQADPENMVENEYK 970
Query: 435 KVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
VN+SNY WRALRLL+R+ PHFF + E++ENMV + KE P
Sbjct: 971 AVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 1019
>gi|308464438|ref|XP_003094486.1| CRE-THOC-1 protein [Caenorhabditis remanei]
gi|308247715|gb|EFO91667.1| CRE-THOC-1 protein [Caenorhabditis remanei]
Length = 682
Score = 284 bits (726), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 166/437 (37%), Positives = 240/437 (54%), Gaps = 37/437 (8%)
Query: 69 CMK-DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCN 127
CMK C+ + PI+ L D +MS++++C+QLF VE N+N +KQ F + +N +LR+CN
Sbjct: 67 CMKLGYCSKSTPILTLQDLLEMSSIEECKQLFGLVEENMNEFKQPGFIETYQNTILRLCN 126
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE------- 180
DLLRRLSR+ T FCGRI+ FL++F P +E+SG+N + FN NIT + E
Sbjct: 127 DLLRRLSRTAETSFCGRIMFFLSRFLPLTEKSGVNFMGHFNTLNITNYEETETTAESLLT 186
Query: 181 ---------MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMF 231
+D ET E + ++ ++ + Y++FWSLQ + NP Y K + MF
Sbjct: 187 AASSRRPTPVDGEEMETGELEEDQAKEINVTSEMYRQFWSLQKFMSNPNLLYEKDKFAMF 246
Query: 232 TSYAETVLAAFKSYKLDDVQSSLNP----------SGD-YFAKYLTNQKLLDLQLSDTNF 280
VL+ + KL+ + S D +F KYLT+ KLL LQL+D++F
Sbjct: 247 KKDLSAVLSLMTANKLEKLNQEEESEYTENKQKKVSNDVFFTKYLTSPKLLALQLNDSSF 306
Query: 281 RRYVLLQFLILFQYFT--STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFS 338
RRY L+Q +I+FQY S K + + L +Q K+V D + Y L+ T P G F
Sbjct: 307 RRYFLIQTIIIFQYLNAESRFKPPAKKMVLNEEQAKYVTDHEDKCYRLLSDTMPRGAVFV 366
Query: 339 QVVKLILKGEEHWNQWKNEGCPEL--KRPLTSITDEDKKDE----PDAKKKKTPELTKLW 392
+K I++ E+ WN WKN CP+L K ++ K+ P + TPELTKLW
Sbjct: 367 AGLKKIIQREQEWNTWKNSNCPDLSEKADKGAMQMYKKRQRIPFNPQSLDLGTPELTKLW 426
Query: 393 -NSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
N D L+ACKS +R F P+L ++ + + +MDP VEEQYK +ND ++ WRA RLL
Sbjct: 427 TNEPDVLQACKSDKRKFIPTLANFIRDPLDEMDPEQQVEEQYKSINDISFQWRAARLLMH 486
Query: 452 KCPHFFLNATPNVEKNS 468
K P + E NS
Sbjct: 487 KSPGYVSRTDIKSEPNS 503
>gi|449689006|ref|XP_002155851.2| PREDICTED: THO complex subunit 1-like, partial [Hydra
magnipapillata]
Length = 404
Score = 282 bits (722), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 146/301 (48%), Positives = 195/301 (64%), Gaps = 19/301 (6%)
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
C LPI+LLSD FD TL CE++F Y+E NV+IW + FF + KN+LLRMCND+LRR
Sbjct: 107 FCTKNLPILLLSDIFDTMTLCDCEKMFKYIEKNVSIWNAEPFFSTGKNHLLRMCNDMLRR 166
Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMD---VSSNETE 189
LS+SQNTVFCGRI LFLA+ FP E+S LN++S F++EN+T F E++D + N++
Sbjct: 167 LSKSQNTVFCGRIQLFLAQLFPVEEKSALNLMSNFHLENVTLFKT-EKLDFDEIQPNDSV 225
Query: 190 ETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD- 248
E D E V+ +D+N Y KFW+LQDYF NP +CY K W + + VL F S+KL+
Sbjct: 226 E-DRELVNN-PVDYNLYIKFWTLQDYFVNPSKCYTKEGWNLINTNISEVLKIFTSFKLEF 283
Query: 249 ------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
DV + +YFAKYLTN+KL + QL+D+NFRR L+Q LILFQY +
Sbjct: 284 NVRKRKSIDMPIDVDQIKQTANNYFAKYLTNEKLFNFQLNDSNFRRNFLVQVLILFQYLS 343
Query: 297 STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN 356
T K + L Q W+ +TT+ VY+ I +TPPDG+ F + +K IL EE+W WKN
Sbjct: 344 GTSKFKAPNQVLNDTQTNWINETTDIVYNSICETPPDGKKFCETIKHILSREENWINWKN 403
Query: 357 E 357
E
Sbjct: 404 E 404
>gi|349604757|gb|AEQ00218.1| THO complex subunit 1-like protein, partial [Equus caballus]
Length = 268
Score = 282 bits (721), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 142/267 (53%), Positives = 180/267 (67%), Gaps = 23/267 (8%)
Query: 81 ILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTV 140
+LL D D LD+C+ +F E NV WK TF+ + KN LLRMC DLLRRLS+SQNTV
Sbjct: 1 VLLGDVLDCLPLDQCDTIFTLGEKNVATWKSNTFYSAGKNYLLRMCQDLLRRLSKSQNTV 60
Query: 141 FCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSS 185
FCGRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV
Sbjct: 61 FCGRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEE 120
Query: 186 NETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSY 245
E + + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSY
Sbjct: 121 GEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSY 180
Query: 246 KLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
KLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY
Sbjct: 181 KLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKG 240
Query: 298 TVKSRGEGLELKSDQEKWVKDTTETVY 324
VK + L +Q W++DTT++VY
Sbjct: 241 QVKFKSSNYVLTDEQSLWIEDTTKSVY 267
>gi|341893668|gb|EGT49603.1| hypothetical protein CAEBREN_23001 [Caenorhabditis brenneri]
Length = 650
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 157/424 (37%), Positives = 228/424 (53%), Gaps = 36/424 (8%)
Query: 69 CMK-DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCN 127
CMK C+ + PI+ L D + S+++ C++LF VE N+N +KQ F +C+NN+LR+CN
Sbjct: 67 CMKLGFCSRSTPILTLQDLMETSSIEACKELFGLVEENMNEFKQPGFIETCQNNILRLCN 126
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF------------ 175
DLLRRLSR+ T FCGRI+ FL++F P +E+SG+N + FN NIT +
Sbjct: 127 DLLRRLSRTAETSFCGRIMFFLSRFLPLTEKSGVNFMGHFNTLNITNYEETDAAAEALIA 186
Query: 176 ----GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMF 231
+D E E + E ++ + Y++FWSLQ + NP Y K + F
Sbjct: 187 ATSSKAQTPVDGEEMEIGEIEEEGAKEIVVTPEMYRQFWSLQKFMSNPNLLYEKDKFLAF 246
Query: 232 TSYAETVLAAFKSYKLDDVQSSLNPSGD----------YFAKYLTNQKLLDLQLSDTNFR 281
VL+ + KL+ + + D +F KYLT+ KLL LQL+D++FR
Sbjct: 247 KRDLSAVLSLMTANKLEKLSAEEEEMADSRQKKVSNDVFFTKYLTSPKLLALQLNDSSFR 306
Query: 282 RYVLLQFLILFQYFT--STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
RY L+Q +I+FQY S K + + L +Q K+V DT + Y L+ T P G F
Sbjct: 307 RYFLIQAIIIFQYLNAESRFKPPAKKMVLNEEQGKYVTDTEDKCYRLLSDTMPRGTAFVA 366
Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDE------PDAKKKKTPELTKLW- 392
+K I+ E+ WN WKN CP+ + K P + TPELTKLW
Sbjct: 367 GLKKIMSREQEWNVWKNNNCPDFSEKADKGAMQMYKKRARVPFNPTSLDLGTPELTKLWT 426
Query: 393 NSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
N D L+AC++ +R F P+L + + + +MDP VEEQYK +ND+ + WRA RLL K
Sbjct: 427 NEPDVLQACRAEKRKFIPTLADFIRDPLDEMDPEQQVEEQYKSINDAAFQWRAARLLMHK 486
Query: 453 CPHF 456
P +
Sbjct: 487 SPGY 490
>gi|170590342|ref|XP_001899931.1| THO complex subunit 1 [Brugia malayi]
gi|158592563|gb|EDP31161.1| THO complex subunit 1, putative [Brugia malayi]
Length = 480
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 162/467 (34%), Positives = 249/467 (53%), Gaps = 77/467 (16%)
Query: 36 LEQSIRQYLLKLIKTPDIDIK-VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDK 94
+E +I + ++ + DI+ + ++ +++ + +D C + ++ D FD+S +
Sbjct: 34 IENAIHRKAFEIAQCTDIEQESLLRQLLAVTFQAAKEDFCAKGTTVAVIQDVFDVSPVKH 93
Query: 95 CEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFP 154
CE+LF VE N+ WK F+ CKN +LRMCNDLL+RLSR+ +T FCGRIL+ LA P
Sbjct: 94 CEKLFAIVEDNLPQWKMGIFYEPCKNMILRMCNDLLKRLSRTVDTSFCGRILVLLAHALP 153
Query: 155 FSERSGLNIISEFNVENITEFG-GDEEMDVSSNETEETDTEEVDKVK-----IDFNFYKK 208
E+SGLN++S FN+ NIT++ D V +N TE++D EV ++ +D+ Y K
Sbjct: 154 LCEKSGLNLVSHFNLGNITKYDLADSCFAVDANLTEDSDVMEVGEISEREIPVDYTLYAK 213
Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL--------------------- 247
FW LQ +F +P C+ K W+ F VL F S+KL
Sbjct: 214 FWQLQSFFSSPATCFEKSKWRTFQQNTTEVLNIFSSHKLEFSATLEEGSKRERSKQENSN 273
Query: 248 DDVQ---SSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
DD+ S+ GD YFAKYLT+QKLL LQL+D+ FRRY L+Q LIL+QY S +K
Sbjct: 274 DDIAAIPSTSRADGDEDIYFAKYLTSQKLLQLQLNDSQFRRYFLVQCLILYQYLVSDIKF 333
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ + L +Q +++ ++TE + L+++T P G HF+ +K+IL E+ W++WKN+GC +
Sbjct: 334 KDKSFTLNDEQLRFIIESTERCFRLLRETYPKGPHFADAMKVILHREKEWSEWKNKGCLD 393
Query: 362 LKRPLTSITDEDK----------KDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTP 410
T + D++K + +P PELTKLWN + D L AC+ + R
Sbjct: 394 ----YTQLADKEKPSVFKKRPRNRYDPSKLDLGNPELTKLWNINPDMLSACEDSRRS--- 446
Query: 411 SLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
VN+ ++ WRA RLL + +F
Sbjct: 447 -------------------------VNNESFQWRASRLLMSQSTSYF 468
>gi|71994361|ref|NP_493796.2| Protein THOC-1 [Caenorhabditis elegans]
gi|351064857|emb|CCD73573.1| Protein THOC-1 [Caenorhabditis elegans]
Length = 665
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 161/436 (36%), Positives = 240/436 (55%), Gaps = 42/436 (9%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
+++ C +L +C+ P+ L D +MS++++C+Q+F VE N+N +KQ F + +
Sbjct: 62 HFVMNCSKL---GLCSKNTPLSTLQDLLEMSSIEECKQIFSIVEENMNEFKQPGFIETAQ 118
Query: 120 NNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT-----E 174
NN+LR CNDLLRRLSR+ T FCGRI+ FL++F P +E+SG+N + FN N+T E
Sbjct: 119 NNILRFCNDLLRRLSRTAETSFCGRIMFFLSRFLPLTEKSGVNFMGHFNTLNVTNYDESE 178
Query: 175 FGGDEEMDVSSN-----------ETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCY 223
G+ + +S+ ET E + + ++++ Y++FWSLQ + NP Y
Sbjct: 179 TDGEALLAATSSAPTPTEGAEDMETGEIEEDNSKEIQVTPEMYRQFWSLQKFMSNPNSIY 238
Query: 224 NKVSWKMFTSYAETVLAAFKSYKLDDV--------------QSSLNPSGDYFAKYLTNQK 269
K + F + VL S KL+ + Q + + +F KYLT+ K
Sbjct: 239 EKEKFLTFKTDLTAVLTLMTSNKLEKLSSEEEEALENRQKKQQKGSSNDVFFTKYLTSPK 298
Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFT--STVKSRGEGLELKSDQEKWVKDTTETVYSLI 327
LL LQL+D++FRRY L+Q +I+FQY T S K + + L DQ K+V + + Y L+
Sbjct: 299 LLALQLNDSSFRRYFLMQAIIIFQYLTAESRFKPPAKKMVLNEDQAKYVSECEDKCYRLL 358
Query: 328 KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--KRPLTSITDEDKKDE----PDAK 381
T P G F +K I+ E+ WN WKN C + K ++ K+ P++
Sbjct: 359 ADTMPRGTAFVAGLKRIMLREQEWNTWKNANCADFSEKADKGAMQMYKKRQRIPFNPNSL 418
Query: 382 KKKTPELTKLW-NSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSN 440
TPELTKLW N D L+ACKS +R F P L + + I +MDP VEEQYK++NDS
Sbjct: 419 DLGTPELTKLWTNEPDVLKACKSDKRKFIPKLPDFIRDPIDEMDPEQQVEEQYKQINDSA 478
Query: 441 YAWRALRLLSRKCPHF 456
+ WRA RLL K P +
Sbjct: 479 FQWRAARLLMHKSPGY 494
>gi|345309431|ref|XP_001521474.2| PREDICTED: THO complex subunit 1-like, partial [Ornithorhynchus
anatinus]
Length = 361
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 137/279 (49%), Positives = 181/279 (64%), Gaps = 22/279 (7%)
Query: 227 SWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDT 278
S+K+ Y+E VLA F+SYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+
Sbjct: 59 SFKLMIRYSEEVLAVFRSYKLDDTQASRKKLEELKAGGEHVYFAKFLTSEKLMDLQLSDS 118
Query: 279 NFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFS 338
NFRR++LLQ+LILFQY VK + L +Q W++DTT++VY L+ + PPDGE FS
Sbjct: 119 NFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFS 178
Query: 339 QVVKLILKGEEHWNQWKNEGCPEL--KRPLTSITDE--DKKDEPDAKKKKTP-------- 386
++V+ IL EE+WN WKNEGCP +RP K+ P+ K P
Sbjct: 179 KMVEHILNTEENWNSWKNEGCPSFVKERPSDPKPPRLVRKRTAPEDFLGKGPNKKILMGN 238
Query: 387 -ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWR 444
ELT+LWN DN+EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WR
Sbjct: 239 EELTRLWNLCPDNMEACKSESREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWR 298
Query: 445 ALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
ALRLL+R+ PHFF + E++ENMV + KE P
Sbjct: 299 ALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 337
>gi|390351008|ref|XP_793772.3| PREDICTED: THO complex subunit 1-like [Strongylocentrotus
purpuratus]
Length = 647
Score = 258 bits (658), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 143/329 (43%), Positives = 193/329 (58%), Gaps = 41/329 (12%)
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP---- 256
+D+ Y++FW+LQD+FR P+QCY K WK F++ A VL AF S KL+DV S
Sbjct: 169 VDYKLYRQFWALQDFFRAPLQCYTKEKWKDFSNNAIAVLEAFHSMKLEDVASGKKKKKRR 228
Query: 257 ----------------SGDY--FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTST 298
SGD FAKYLT++KL DLQLSD FRRYV +QFLILFQY
Sbjct: 229 SHHRQQQKQGEEEEADSGDQVSFAKYLTSEKLFDLQLSDAMFRRYVYVQFLILFQYLNQQ 288
Query: 299 VKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG 358
VK + L + ++K+T + V +L+K+TPP G+ F+ ++ IL EEHWN WKNEG
Sbjct: 289 VKFKQAHHVLTDEMSLFIKETKDKVVALLKETPPYGDQFTAGIEHILSREEHWNAWKNEG 348
Query: 359 CP---------ELKRPLTSITDEDKKDE---PDAKKK---KTPELTKLWNS-KDNLEACK 402
CP E RP + +E A KK +PELTKLWN NLEAC
Sbjct: 349 CPSYVKEKGKTEQARPKSRARKRTLGEELNLAGAAKKIDLGSPELTKLWNVYPSNLEACA 408
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
+ +R F P +E +F++AI Q DP A +E +YK VN+SNYAW+A+RLL+++ PHFF T
Sbjct: 409 AEDRIFLPQVEEFFQDAIDQADPEAMIEPEYKVVNNSNYAWQAIRLLAKRSPHFFQTVTA 468
Query: 462 --PNVEKNSEFIENMVKRCVKEKPSSQIS 488
P ++ ++E MV + KE P ++ S
Sbjct: 469 TNPPLKTVPGYLEYMVTKIAKELPQNEES 497
Score = 109 bits (273), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 50/116 (43%), Positives = 78/116 (67%), Gaps = 10/116 (8%)
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-------E 180
+LLRRLS+SQ+ VFCGRI LFLA+FFP E+SGLN++S F+++N+T + +
Sbjct: 20 NLLRRLSKSQDMVFCGRIQLFLARFFPLDEKSGLNLMSNFHLDNVTSYNRTKLEGPLRTR 79
Query: 181 MDVSSNETEETDTEEVD---KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTS 233
+ + ++ + D EE + +D+ Y++FW+LQD+FR P+QCY K WK F++
Sbjct: 80 VPQAIDKQDSMDYEEGEMGSATPVDYKLYRQFWALQDFFRAPLQCYTKEKWKDFSN 135
>gi|296087837|emb|CBI35093.3| unnamed protein product [Vitis vinifera]
Length = 613
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 171/490 (34%), Positives = 249/490 (50%), Gaps = 47/490 (9%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 93 IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFAYIESKQDILGKQELFAR 152
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 153 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 210
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
E D E + IDFNFYK FWSLQ++F NP W+ FTS
Sbjct: 211 ------------EKDAPE--GISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTSNLM 256
Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F++ L D + + N + + KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 257 VVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILF 316
Query: 293 QYFTSTVKSRGEGLELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
Y + K+ +L SD K +K E V L++ TPP G+ F ++ IL+ E++W
Sbjct: 317 DYLKAPGKNDK---DLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREKNW 373
Query: 352 NQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACK 402
WK +GCP +R P+ +KK D KK+ P EL++LW D N A
Sbjct: 374 VWWKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALT 427
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
+R TP++ Y++ + MD +A +E +Y N+ Y W+ LR +R+ F T
Sbjct: 428 DPQRVRTPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTE 487
Query: 463 -NVEK--NSEFIENMVKRCVKEKPSSQISG----NGNGVDQDPAEVEVDTKSEEIQEEEK 515
+E E + + V+ + KPS + G Q E ++ T + EI E
Sbjct: 488 YGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGT 547
Query: 516 EEDWEAKADP 525
D EA A P
Sbjct: 548 RVDLEASAAP 557
>gi|359487131|ref|XP_002264619.2| PREDICTED: THO complex subunit 1-like [Vitis vinifera]
Length = 601
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 171/490 (34%), Positives = 249/490 (50%), Gaps = 47/490 (9%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 81 IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFAYIESKQDILGKQELFAR 140
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 141 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 198
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
E D E + IDFNFYK FWSLQ++F NP W+ FTS
Sbjct: 199 ------------EKDAPE--GISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTSNLM 244
Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F++ L D + + N + + KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 245 VVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILF 304
Query: 293 QYFTSTVKSRGEGLELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
Y + K+ +L SD K +K E V L++ TPP G+ F ++ IL+ E++W
Sbjct: 305 DYLKAPGKNDK---DLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILEREKNW 361
Query: 352 NQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACK 402
WK +GCP +R P+ +KK D KK+ P EL++LW D N A
Sbjct: 362 VWWKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALT 415
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
+R TP++ Y++ + MD +A +E +Y N+ Y W+ LR +R+ F T
Sbjct: 416 DPQRARTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTE 475
Query: 463 -NVEK--NSEFIENMVKRCVKEKPSSQISG----NGNGVDQDPAEVEVDTKSEEIQEEEK 515
+E E + + V+ + KPS + G Q E ++ T + EI E
Sbjct: 476 YGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGT 535
Query: 516 EEDWEAKADP 525
D EA A P
Sbjct: 536 RVDLEASAAP 545
>gi|225463994|ref|XP_002263874.1| PREDICTED: THO complex subunit 1-like [Vitis vinifera]
Length = 607
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 171/490 (34%), Positives = 249/490 (50%), Gaps = 47/490 (9%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 87 IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFAYIESKQDILGKQELFAR 146
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 147 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 204
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
E D E + IDFNFYK FWSLQ++F NP W+ FTS
Sbjct: 205 ------------EKDAPE--GISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTSNLM 250
Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F++ L D + + N + + KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 251 VVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILF 310
Query: 293 QYFTSTVKSRGEGLELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
Y + K+ +L SD K +K E V L++ TPP G+ F ++ IL+ E++W
Sbjct: 311 DYLKAPGKNDK---DLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREKNW 367
Query: 352 NQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACK 402
WK +GCP +R P+ +KK D KK+ P EL++LW D N A
Sbjct: 368 VWWKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALT 421
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
+R TP++ Y++ + MD +A +E +Y N+ Y W+ LR +R+ F T
Sbjct: 422 DPQRVRTPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTE 481
Query: 463 -NVEK--NSEFIENMVKRCVKEKPSSQISG----NGNGVDQDPAEVEVDTKSEEIQEEEK 515
+E E + + V+ + KPS + G Q E ++ T + EI E
Sbjct: 482 YGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGT 541
Query: 516 EEDWEAKADP 525
D EA A P
Sbjct: 542 RVDLEASAAP 551
>gi|296087823|emb|CBI35079.3| unnamed protein product [Vitis vinifera]
Length = 613
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 171/490 (34%), Positives = 249/490 (50%), Gaps = 47/490 (9%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 93 IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFAYIESKQDILGKQELFAR 152
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 153 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 210
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
E D E + IDFNFYK FWSLQ++F NP W+ FTS
Sbjct: 211 ------------EKDAPE--GISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTSNLM 256
Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F++ L D + + N + + KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 257 VVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILF 316
Query: 293 QYFTSTVKSRGEGLELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
Y + K+ +L SD K +K E V L++ TPP G+ F ++ IL+ E++W
Sbjct: 317 DYLKAPGKNDK---DLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILEREKNW 373
Query: 352 NQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACK 402
WK +GCP +R P+ +KK D KK+ P EL++LW D N A
Sbjct: 374 VWWKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALT 427
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
+R TP++ Y++ + MD +A +E +Y N+ Y W+ LR +R+ F T
Sbjct: 428 DPQRARTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTE 487
Query: 463 -NVEK--NSEFIENMVKRCVKEKPSSQISG----NGNGVDQDPAEVEVDTKSEEIQEEEK 515
+E E + + V+ + KPS + G Q E ++ T + EI E
Sbjct: 488 YGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGT 547
Query: 516 EEDWEAKADP 525
D EA A P
Sbjct: 548 RVDLEASAAP 557
>gi|357114422|ref|XP_003558999.1| PREDICTED: THO complex subunit 1-like [Brachypodium distachyon]
Length = 630
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 171/531 (32%), Positives = 258/531 (48%), Gaps = 78/531 (14%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
+I + + + + LC + + LL D DMST+ C+ +F Y+E ++ +Q F
Sbjct: 122 LITHLLDIMLYLCERGHVEGGMVFQLLEDLTDMSTIKDCKDVFGYIESKQDVLGKQELFG 181
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
K +LR CN LLRRLS+S + VFCGRI++FLA FFP SERS LNI FN N T++
Sbjct: 182 RGKLVMLRTCNQLLRRLSKSNDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY- 240
Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYA 235
E D + + +DFNFY+ WSLQ++FRNP + N W+ F S
Sbjct: 241 ---EKDATGG------------ISVDFNFYQTLWSLQEHFRNPALTTTNPTKWQKFASNL 285
Query: 236 ETVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
VL F++ L D N + KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 286 TVVLNTFEAQPLCDDDGKHNNLEQEEDAAFNIKYLTSSKLMGLELKDASFRRHILVQCLI 345
Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
F Y + KS EG +E +K E V +L++ PP G+ F Q ++ IL+ E++
Sbjct: 346 FFDYLKAPGKSDKEGPSESMKEE--IKSCEERVKNLLEMIPPKGKEFLQSIEHILEREKN 403
Query: 351 WNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWN-SKDNLEAC 401
W WK +GCP K+P + +P +K+ P EL +LW ++ N A
Sbjct: 404 WVWWKRDGCPAFEKQPF--------EKKPGGVRKRKPRWRLGNKELAQLWKWAELNPNAL 455
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRL--------LSRKC 453
+R TPS+ Y++ MD +A +EE+Y N+ Y W+ LR SR C
Sbjct: 456 TDPDRVRTPSVTEYWKPLADDMDASAGIEEEYHHKNNRVYCWKGLRFSARQDLDGFSRFC 515
Query: 454 PHFFLNATPN--------VEKNSEFIENMVKRCVKE-------KPSSQ-------ISGNG 491
+ P + NS+ E KR +E +P Q G G
Sbjct: 516 EYGIEGVVPTELLPPEVRAKYNSKPGEK-AKRPKREDTKGASAQPKEQQVAATPETDGGG 574
Query: 492 NGVDQDPAEVEVDTKSEEIQEEEKEEDWE------AKADPEGDADEVMSVE 536
+G DQ+ V +D+ + I++ +K+ E + +PE D D+ + E
Sbjct: 575 SGADQEEGAVPMDSDNVAIEDGQKQSPGEVSGPESGQCEPEDDVDDNVKSE 625
>gi|334187564|ref|NP_001190269.1| THO complex subunit 1 [Arabidopsis thaliana]
gi|332004074|gb|AED91457.1| THO complex subunit 1 [Arabidopsis thaliana]
Length = 598
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 167/488 (34%), Positives = 247/488 (50%), Gaps = 59/488 (12%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 84 IPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKDVFGYIESKQDILGKQELFAR 143
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 144 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK 203
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-WKMFTSYAE 236
D +S +DFNFYK FWSLQ+YF NP + + W+ F+S
Sbjct: 204 DPPKGIS----------------VDFNFYKTFWSLQEYFCNPASLTSASTKWQKFSSSLA 247
Query: 237 TVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F + L + + N + + KYLT+ KL+ L+L D++FRR++LLQ LI+F
Sbjct: 248 VVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHILLQCLIMF 307
Query: 293 QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWN 352
Y + K+ +L S+ + +K + V L++ TPP G+ F + V+ IL+ E++W
Sbjct: 308 DYLRAPGKNDK---DLPSETMEELKSCEDRVKKLLEITPPKGKEFLRAVEHILEREKNWV 364
Query: 353 QWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK--------TPELTKLWNSKD-NLEACK 402
WK +GCP K+P+ DKK P+A +KK EL++LW D N A
Sbjct: 365 WWKRDGCPPFEKQPI------DKKS-PNAGQKKRRQRWRLGNKELSQLWRWADQNPNALT 417
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
++R TP + Y++ + MDP+A +E++Y N+ Y W+ LR +R+ F T
Sbjct: 418 DSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTARQDLEGFSRFTE 477
Query: 462 -------------PNVEKNSEFIEN-MVKRCVKEK---PSSQISGNGNGVDQDPAEVEVD 504
P V + N KR KE+ S + GN GV AE E
Sbjct: 478 MGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIGVSNSEAEAEGG 537
Query: 505 TKSEEIQE 512
E E
Sbjct: 538 RGDAETME 545
>gi|242042539|ref|XP_002468664.1| hypothetical protein SORBIDRAFT_01g049910 [Sorghum bicolor]
gi|241922518|gb|EER95662.1| hypothetical protein SORBIDRAFT_01g049910 [Sorghum bicolor]
Length = 637
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 162/528 (30%), Positives = 258/528 (48%), Gaps = 72/528 (13%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC + + LL D +MST+ C+ +F Y+E ++ +Q F
Sbjct: 122 ITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQELFGR 181
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRI++FLA FFP SERS LNI FN N+T++
Sbjct: 182 GKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTKY-- 239
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYAE 236
+ + +D + +DFNFYK WSLQ++F NP + N W+ F+S
Sbjct: 240 --------------EKDAMDGISVDFNFYKTLWSLQEHFSNPALTNTNPAKWQKFSSNLA 285
Query: 237 TVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
VL+ F++ L D LN + KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 286 VVLSTFEAQPLSDDDGKLNNLNEEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIF 345
Query: 292 FQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
F Y + K+ EG +E +K E V L++ PP G+ F + ++ IL+ E++W
Sbjct: 346 FDYLKAPGKNDKEGPTGSMKEE--IKSCEEHVKKLLEIIPPKGKEFLKSIEHILEREKNW 403
Query: 352 NQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKSA 404
WK +GC ++P E K + +K+K EL++LW ++ N
Sbjct: 404 VWWKRDGCLAFEKPPF----EKKPGQAGGRKRKPRWRLGNKELSQLWKWAEQNPNVLTDP 459
Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK------------ 452
+R PS+ Y++ + MDP+A +EE+Y ++ Y W+ LR +R+
Sbjct: 460 DRVRMPSITEYWKPLAEDMDPSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDYG 519
Query: 453 ----CPHFFLNATPNVEKNSEFIENMVKRCVKE-------KPSSQ-------ISGNGNGV 494
P L N +S+ E VKR +E +P Q G G+G
Sbjct: 520 IEGVVPSELLPPEVNARFSSKPAEK-VKRTRREDSKGMSAQPKEQQVAATPETDGGGSGG 578
Query: 495 DQDPAEVEVDTKSEEIQEEEKEEDWE------AKADPEGDADEVMSVE 536
D + +D+ + +++ +K E + +PE DAD+ + E
Sbjct: 579 DPEEGAAPMDSDNGGVEDSQKRSPGEVSGPESGQCEPEADADDNVKTE 626
>gi|297807009|ref|XP_002871388.1| hypothetical protein ARALYDRAFT_908935 [Arabidopsis lyrata subsp.
lyrata]
gi|297317225|gb|EFH47647.1| hypothetical protein ARALYDRAFT_908935 [Arabidopsis lyrata subsp.
lyrata]
Length = 597
Score = 251 bits (640), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 174/509 (34%), Positives = 259/509 (50%), Gaps = 60/509 (11%)
Query: 38 QSIRQYLLKLIKTPDIDIKV-IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCE 96
+ I QY +LI D +I I + + + + LC K+ + LL D +MST+ C+
Sbjct: 62 EQIMQYG-QLIDDDDDNIHGQIPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCK 120
Query: 97 QLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFS 156
+F Y+E +I +Q F K +LR CN LLRRLS++ + VFCGRIL+FLA FFP S
Sbjct: 121 DVFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS 180
Query: 157 ERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYF 216
ERS +NI FN N T++ D +S +DFNFYK FWSLQ+YF
Sbjct: 181 ERSAVNIKGVFNTSNETKYEKDPPKGIS----------------VDFNFYKTFWSLQEYF 224
Query: 217 RNPVQCYNKVS-WKMFTSYAETVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLL 271
NP + + W+ F+S VL F + L + + N + + KYLT+ KL+
Sbjct: 225 CNPASLISASTKWQKFSSSLAVVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLM 284
Query: 272 DLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTP 331
L+L D++FRR++LLQ LI+F Y + K+ + L ++ +E+ +K + V L++ TP
Sbjct: 285 GLELKDSSFRRHILLQCLIMFDYLRAPGKNDKD-LPSETMKEE-LKSCEDRVKKLLEITP 342
Query: 332 PDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK------ 384
P G+ F + V+ IL+ E++W WK +GCP K+P+ DKK P+A +KK
Sbjct: 343 PKGKEFLRAVEHILEREKNWVWWKRDGCPPFEKQPI------DKKS-PNAGQKKRRQRWR 395
Query: 385 --TPELTKLWNSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNY 441
EL++LW D N A ++R TP + Y++ + MDP+A +E++Y N+ Y
Sbjct: 396 LGNKELSQLWRWADQNPNALTDSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVY 455
Query: 442 AWRALRLLSRKCPHFFLNAT--------------PNVEKNSEFIEN-MVKRCVKEKP--- 483
W+ LR +R+ F T P V + N KR KE+
Sbjct: 456 CWKGLRFTARQDLEGFSRFTEMGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEEAKGG 515
Query: 484 SSQISGNGNGVDQDPAEVEVDTKSEEIQE 512
S + GN GV AE E E E
Sbjct: 516 SQETEGNQIGVSNSEAEAEGGRGDAETME 544
>gi|18416110|ref|NP_568219.1| THO complex subunit 1 [Arabidopsis thaliana]
gi|15983384|gb|AAL11560.1|AF424566_1 AT5g09860/MYH9_7 [Arabidopsis thaliana]
gi|16226756|gb|AAL16253.1|AF428323_1 AT5g09860/MYH9_7 [Arabidopsis thaliana]
gi|332004073|gb|AED91456.1| THO complex subunit 1 [Arabidopsis thaliana]
Length = 599
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 167/488 (34%), Positives = 249/488 (51%), Gaps = 58/488 (11%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 84 IPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKDVFGYIESKQDILGKQELFAR 143
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 144 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK 203
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-WKMFTSYAE 236
D +S +DFNFYK FWSLQ+YF NP + + W+ F+S
Sbjct: 204 DPPKGIS----------------VDFNFYKTFWSLQEYFCNPASLTSASTKWQKFSSSLA 247
Query: 237 TVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F + L + + N + + KYLT+ KL+ L+L D++FRR++LLQ LI+F
Sbjct: 248 VVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHILLQCLIMF 307
Query: 293 QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWN 352
Y + K+ + L ++ +E+ +K + V L++ TPP G+ F + V+ IL+ E++W
Sbjct: 308 DYLRAPGKNDKD-LPSETMKEE-LKSCEDRVKKLLEITPPKGKEFLRAVEHILEREKNWV 365
Query: 353 QWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK--------TPELTKLWNSKD-NLEACK 402
WK +GCP K+P+ DKK P+A +KK EL++LW D N A
Sbjct: 366 WWKRDGCPPFEKQPI------DKKS-PNAGQKKRRQRWRLGNKELSQLWRWADQNPNALT 418
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
++R TP + Y++ + MDP+A +E++Y N+ Y W+ LR +R+ F T
Sbjct: 419 DSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTARQDLEGFSRFTE 478
Query: 462 -------------PNVEKNSEFIEN-MVKRCVKEK---PSSQISGNGNGVDQDPAEVEVD 504
P V + N KR KE+ S + GN GV AE E
Sbjct: 479 MGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIGVSNSEAEAEGG 538
Query: 505 TKSEEIQE 512
E E
Sbjct: 539 RGDAETME 546
>gi|356508296|ref|XP_003522894.1| PREDICTED: THO complex subunit 1-like [Glycine max]
Length = 605
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 152/411 (36%), Positives = 222/411 (54%), Gaps = 42/411 (10%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 81 IPRLLDIVLYLCEKEHIEGGMIFQLLEDLTEMSTMKNCKDIFGYIESKQDILGKQELFAR 140
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS LNI FN N T++
Sbjct: 141 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSALNIKGVFNTSNETKY-- 198
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
+ E ++ + IDFNFY+ FW LQ+YF NP + W+ FT
Sbjct: 199 --------------EKEPLEGICIDFNFYQTFWGLQEYFSNPTSISHAPAKWQKFTLSLS 244
Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F++ L D + N + ++ KYLT+ KL+ L+L D +FRR+VL+Q LILF
Sbjct: 245 VVLNTFEAQPLSDEEGDANNLEEEAVNFSIKYLTSSKLMGLELKDPSFRRHVLVQCLILF 304
Query: 293 QYFTSTVKSRGEG-LELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
Y +K+ G+G +L S+ K + E V L++ TPP G F ++ IL+ E++
Sbjct: 305 DY----LKAPGKGDKDLPSENMKEEITSWEERVKKLLELTPPKGTEFLHKIEHILEREKN 360
Query: 351 WNQWKNEGC-PELKRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEAC 401
W WK +GC P K+ + +KK PD KK+ P EL++LW D N A
Sbjct: 361 WVWWKRDGCLPYEKQRI------EKKAVPDGPKKRRPRWRLGNKELSQLWKWADQNPNAL 414
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
+R TPS+ Y++ + MDP+A +E Y N+ Y W+ LRL +R+
Sbjct: 415 TDPQRVQTPSIMEYWKPLAEDMDPSAGIEADYHHKNNRVYCWKGLRLSARQ 465
>gi|414864320|tpg|DAA42877.1| TPA: hypothetical protein ZEAMMB73_799316 [Zea mays]
Length = 604
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 164/529 (31%), Positives = 260/529 (49%), Gaps = 72/529 (13%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
+I + + + LC + + LL D +MST+ C+ +F Y+E ++ +Q F
Sbjct: 87 LITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQELFG 146
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
K +LR CN LLRRLS++ + VFCGRI++FLA FFP SERS LNI FN N+T++
Sbjct: 147 RGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTKY- 205
Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYA 235
+ + +D + +DFNFYK WSLQ++F NP + N W+ F+S
Sbjct: 206 ---------------EKDAMDGISVDFNFYKTLWSLQEHFSNPALTSTNPAKWQKFSSNL 250
Query: 236 ETVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
VL F++ L D LN + KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 251 AVVLNTFEAQPLSDDDGKLNNLNEEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLI 310
Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
F Y + K+ EG +E +K E V L++ PP G+ F + ++ IL+ E++
Sbjct: 311 FFDYLKAPGKNDKEGPTGSMIEE--IKSCEEHVKKLLEIIPPKGKEFLKSIEHILEREKN 368
Query: 351 WNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKS 403
W WK +GC ++P E K + A+K+K + EL++LW ++ N
Sbjct: 369 WVWWKRDGCLAFEKP----PFEKKPGQAGARKRKPRWRLGSKELSQLWKWAEQNPNVLTD 424
Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK----------- 452
+R PS+ Y++ + MDP+A +EE+Y ++ Y W+ LR +R+
Sbjct: 425 PDRVRMPSITEYWKPLAEDMDPSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDY 484
Query: 453 -----CPHFFLNATPNVEKNSEFIENMVKRCVKEKPSS---------QISGNG---NGVD 495
P L N + +S+ E VKR KE S Q++ GVD
Sbjct: 485 GIEGVVPSELLPPEVNAKFSSKPAEK-VKRTRKEDSKSVSAHQPREHQVAATPETDGGVD 543
Query: 496 QDPAE--VEVDTKSEEIQEEEKEEDWE------AKADPEGDADEVMSVE 536
+ E +D+ + + + +K E + +PE DAD+ + E
Sbjct: 544 PEEGEGAAPMDSDNGAVGDSQKRSPGEVSGPESGQCEPEADADDNVRTE 592
>gi|259490220|ref|NP_001159168.1| hypothetical protein [Zea mays]
gi|223942431|gb|ACN25299.1| unknown [Zea mays]
gi|414864321|tpg|DAA42878.1| TPA: hypothetical protein ZEAMMB73_799316 [Zea mays]
Length = 638
Score = 248 bits (632), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 164/529 (31%), Positives = 260/529 (49%), Gaps = 72/529 (13%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
+I + + + LC + + LL D +MST+ C+ +F Y+E ++ +Q F
Sbjct: 121 LITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQELFG 180
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
K +LR CN LLRRLS++ + VFCGRI++FLA FFP SERS LNI FN N+T++
Sbjct: 181 RGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTKY- 239
Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYA 235
+ + +D + +DFNFYK WSLQ++F NP + N W+ F+S
Sbjct: 240 ---------------EKDAMDGISVDFNFYKTLWSLQEHFSNPALTSTNPAKWQKFSSNL 284
Query: 236 ETVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
VL F++ L D LN + KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 285 AVVLNTFEAQPLSDDDGKLNNLNEEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLI 344
Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
F Y + K+ EG +E +K E V L++ PP G+ F + ++ IL+ E++
Sbjct: 345 FFDYLKAPGKNDKEGPTGSMIEE--IKSCEEHVKKLLEIIPPKGKEFLKSIEHILEREKN 402
Query: 351 WNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKS 403
W WK +GC ++P E K + A+K+K + EL++LW ++ N
Sbjct: 403 WVWWKRDGCLAFEKP----PFEKKPGQAGARKRKPRWRLGSKELSQLWKWAEQNPNVLTD 458
Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK----------- 452
+R PS+ Y++ + MDP+A +EE+Y ++ Y W+ LR +R+
Sbjct: 459 PDRVRMPSITEYWKPLAEDMDPSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDY 518
Query: 453 -----CPHFFLNATPNVEKNSEFIENMVKRCVKEKPSS---------QISGNG---NGVD 495
P L N + +S+ E VKR KE S Q++ GVD
Sbjct: 519 GIEGVVPSELLPPEVNAKFSSKPAEK-VKRTRKEDSKSVSAHQPREHQVAATPETDGGVD 577
Query: 496 QDPAE--VEVDTKSEEIQEEEKEEDWE------AKADPEGDADEVMSVE 536
+ E +D+ + + + +K E + +PE DAD+ + E
Sbjct: 578 PEEGEGAAPMDSDNGAVGDSQKRSPGEVSGPESGQCEPEADADDNVRTE 626
>gi|23463069|gb|AAN33204.1| At5g09860/MYH9_7 [Arabidopsis thaliana]
Length = 599
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 166/488 (34%), Positives = 248/488 (50%), Gaps = 58/488 (11%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 84 IPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKDVFGYIESKQDILGKQELFAR 143
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 144 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK 203
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-WKMFTSYAE 236
D +S +DFNFYK FWSLQ+YF NP + + W+ F+S
Sbjct: 204 DPPKGIS----------------VDFNFYKTFWSLQEYFCNPASLTSASTKWQKFSSSLA 247
Query: 237 TVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F + L + + N + + KYLT+ KL+ L+L D++FRR++LLQ LI+F
Sbjct: 248 VVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHILLQCLIMF 307
Query: 293 QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWN 352
Y + K+ + L ++ +E+ +K + V L++ TPP G+ F + V+ IL+ E++W
Sbjct: 308 DYLRAPGKNDKD-LPSETMKEE-LKSCEDRVKKLLEITPPKGKEFLRAVEHILEREKNWV 365
Query: 353 QWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK--------TPELTKLWNSKD-NLEACK 402
WK +G P K+P+ DKK P+A +KK EL++LW D N A
Sbjct: 366 WWKRDGYPPFEKQPI------DKKS-PNAGQKKRRQRWRLGNKELSQLWRWADQNPNALT 418
Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
++R TP + Y++ + MDP+A +E++Y N+ Y W+ LR +R+ F T
Sbjct: 419 DSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTARQDLEGFSRFTE 478
Query: 462 -------------PNVEKNSEFIEN-MVKRCVKEK---PSSQISGNGNGVDQDPAEVEVD 504
P V + N KR KE+ S + GN GV AE E
Sbjct: 479 MGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIGVSNSEAEAEGG 538
Query: 505 TKSEEIQE 512
E E
Sbjct: 539 RGDAETME 546
>gi|224057238|ref|XP_002299188.1| predicted protein [Populus trichocarpa]
gi|222846446|gb|EEE83993.1| predicted protein [Populus trichocarpa]
Length = 608
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 143/408 (35%), Positives = 221/408 (54%), Gaps = 37/408 (9%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + LC ++ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 81 IPRLLDAVLYLCEREHIEGGMIFQLLEDLTEMSTMRNCKDIFGYIESKQDILGKQELFAR 140
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 141 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 198
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+ E + +DFNFYK WSLQ+YF +P + + W+ F+S
Sbjct: 199 --------------EKEPPAAISLDFNFYKTLWSLQEYFCDPSLTLSPIKWQKFSSSLMV 244
Query: 238 VLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
VL F++ L + + N + + KYLT+ KL+ L+L D +FRR+VL+Q LILF
Sbjct: 245 VLNTFEAQPLSEEEGDANNLEEEAAAFNIKYLTSSKLMGLELKDPSFRRHVLVQCLILFD 304
Query: 294 YFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
Y + K+ + L +S +E+ ++ E V L++ TPP G+ F +V+ IL+ E++W
Sbjct: 305 YLKAPGKNDKD-LTSESMKEE-IRSREEHVKKLLEMTPPKGKDFLHMVEHILEREKNWLW 362
Query: 354 WKNEGCPEL-KRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACKSA 404
WK +GCP K+P+ + T + D KK+ P EL++LW D N A
Sbjct: 363 WKRDGCPPFEKQPIENKTVQ------DGGKKRRPRWRLGNKELSQLWKWADQNPNALTDP 416
Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
+R TP + Y++ + MDP+A ++ +Y N+ Y W+ LR +R+
Sbjct: 417 QRVRTPIITDYWKPLAEDMDPSAGIDAEYHHKNNRVYCWKGLRFSARQ 464
>gi|449445104|ref|XP_004140313.1| PREDICTED: THO complex subunit 1-like [Cucumis sativus]
Length = 607
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 161/464 (34%), Positives = 234/464 (50%), Gaps = 49/464 (10%)
Query: 14 LNVVSDKYALYKPT-CDYDKKSALEQSIRQYLLKLIKTP----------DIDIKVIENYI 62
L +V D K T D+ LE +R+ L +L+ + + I E
Sbjct: 25 LQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQ 84
Query: 63 SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL 122
+ + LC K+ + LL D +MSTL C+ +F Y+E +I +Q F K +
Sbjct: 85 DIVLYLCEKEHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVM 144
Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMD 182
LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 145 LRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY------- 197
Query: 183 VSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAETVLAA 241
+ + D IDFNFYK FWSLQ++F NP W+ FTS VL
Sbjct: 198 ---------EKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNT 248
Query: 242 FKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
F + L D + N S + KYLT+ KL+ L+L D +FRR+VL+Q LILF Y +
Sbjct: 249 FDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKA 308
Query: 298 TVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
K+ + +E +K E V L++ TPP G+ F Q ++ IL+ E +W WK +
Sbjct: 309 PGKNEKDIPSETMREE--IKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRD 366
Query: 358 GC-PELKRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWN-SKDNLEACKSAERDF 408
GC P K+P+ +KK D KK+ P EL++LW S N A +R
Sbjct: 367 GCAPFEKQPI------EKKTINDVTKKRRPRWRLGNKELSQLWKWSDQNPNALTDPQRVR 420
Query: 409 TPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
+P++ Y++ + MD +A +E +Y N+ Y W+ LR +R+
Sbjct: 421 SPAISDYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQ 464
>gi|440793057|gb|ELR14255.1| hypothetical protein ACA1_327400 [Acanthamoeba castellanii str.
Neff]
Length = 549
Score = 243 bits (620), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 157/484 (32%), Positives = 253/484 (52%), Gaps = 44/484 (9%)
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN-LLRMCNDLLR 131
M S P+ L+ D F T+++C+Q F +E + + S NN LLR +++ +
Sbjct: 1 MIESAAPLELVRDFFLSHTIEECQQFFALLEDRADQLSRAIIGDSRANNTLLRALSEVRK 60
Query: 132 RLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEET 191
RLSR+ + VF GRIL+F+A FP S++SG+N+ E NV N+T D+E + ET E
Sbjct: 61 RLSRTNDLVFAGRILMFMAYAFPLSDKSGVNLKGEANVANVTTM--DKEPSPALEETPED 118
Query: 192 DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
+E +D+ FY+ FW LQ YF NP++ ++ +W++ T + +VL FK Y + + +
Sbjct: 119 KSEP----PVDWTFYQMFWGLQSYFLNPIKAFD--AWQVVTGHLNSVLDLFKQYPISEEE 172
Query: 252 SSL------NPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSR 302
+S NP Y KYLT+ +LL LQL D FRR++++QFL+ FQ +S
Sbjct: 173 ASAASYSYANPPAASECYLTKYLTSSRLLRLQLRDPWFRRHIMVQFLVFFQSLRVVSESS 232
Query: 303 GEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL 362
+L Q+ + T+ + + +TPP+G HF+ + ILK E +W WK EGCP
Sbjct: 233 TIAQKLTEQQKNEIAGLTKKTKTSLWETPPNGMHFTNTIYAILKRENNWTDWKKEGCPPF 292
Query: 363 KRPLTSITDEDKKDEPDA-----------KKKKTPELTKLWNSKDNLEACKSAERDFTPS 411
++P + EP A KKK L + + +D + ER S
Sbjct: 293 EKPPVAPL------EPAAVRAAREVRFNKKKKGLAALLEGESVRDLQALLQKEERGRMVS 346
Query: 412 LESYFEEAIQQMDPAAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEF 470
LE + E I++MDP AA+E QYKK+N D Y WRA RL ++ +F + K
Sbjct: 347 LEEWLEPVIEEMDPEAAIESQYKKINTDKIYVWRAKRLTFKRSLEYFC----TLAKQDAT 402
Query: 471 IENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDAD 530
+E++ K+ + K +S +GV + E + D ++++++E EAK + + D
Sbjct: 403 LEDVAKQLLANKGTS----TEDGVVLEDREEKRDADVDDVEDKESVVKIEAKHSEDQEGD 458
Query: 531 EVMS 534
+++S
Sbjct: 459 QLVS 462
>gi|115450229|ref|NP_001048715.1| Os03g0110400 [Oryza sativa Japonica Group]
gi|108705792|gb|ABF93587.1| THO complex subunit 1, putative, expressed [Oryza sativa Japonica
Group]
gi|113547186|dbj|BAF10629.1| Os03g0110400 [Oryza sativa Japonica Group]
gi|218191944|gb|EEC74371.1| hypothetical protein OsI_09688 [Oryza sativa Indica Group]
Length = 638
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 166/550 (30%), Positives = 257/550 (46%), Gaps = 80/550 (14%)
Query: 35 ALEQSIRQYLLKLIKTPDIDIKVIENY-------------ISLCVELCMKDMCNSTLPII 81
+LE ++R+ L +L + K I Y + + + LC + +
Sbjct: 89 SLENALRKLLQELASSAVQSGKRIMQYGDNEENNCPITRLLDIVLYLCERGHVEGGMVFQ 148
Query: 82 LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVF 141
LL D +MST+ C+ +F Y+E ++ +Q F K +LR CN LLRRLS++ + VF
Sbjct: 149 LLEDLTEMSTIKDCKDVFGYIESKQDVLGKQELFGRGKLVMLRTCNQLLRRLSKANDVVF 208
Query: 142 CGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKI 201
CGRI++FLA FFP SERS LNI FN N T++ + + D + +
Sbjct: 209 CGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY----------------EKDATDGISV 252
Query: 202 DFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLN----- 255
DFNFY WSLQ++F NP + N W+ F S VL+ F++ L D LN
Sbjct: 253 DFNFYNTLWSLQEHFSNPALTAANLTRWQKFVSNLTVVLSTFEAQPLSDDDGKLNNLDQE 312
Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
+ KYLT+ KL+ L+L D +FRR++L+Q LI F + + K+ EG +E
Sbjct: 313 EDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDFLKAPGKTDKEGPTGSMKEE-- 370
Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSITDEDK 374
+ E V L++ PP G+ F Q ++ IL+ E++W WK +GC K+P E K
Sbjct: 371 IDSCEERVKKLLEIIPPKGKDFLQSIEHILEREKNWVWWKRDGCLAFEKQPF-----EKK 425
Query: 375 KDEPDAKKKK------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAA 427
+ +K+K EL +LW ++ N A +ER PS+ Y++ + MDP+A
Sbjct: 426 PGQAGVRKRKPRWRLGNKELAQLWKWAEQNPNALTDSERICMPSVTEYWKPLAEDMDPSA 485
Query: 428 AVEEQYKKVNDSNYAWRALRL--------LSRKCPHFFLNATPN----VEKNSEFIE--- 472
+E++Y N+ Y W+ LR SR C + P E S+F
Sbjct: 486 GIEDEYHHKNNRVYCWKGLRFSARQDLEGFSRFCDYGIEGVVPQELLPPEVRSKFYSKPG 545
Query: 473 NMVKRCVKEKP---------------SSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEE 517
+ KR +E P + + G G+G + + V +D+ + I E K+
Sbjct: 546 DKAKRPKREDPKGTSAQPKEQQVVSATPETDGGGSGAEPEEGAVPMDSDNAAIDEGRKQS 605
Query: 518 DWEAKADPEG 527
E G
Sbjct: 606 PEEVSGPESG 615
>gi|168000859|ref|XP_001753133.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695832|gb|EDQ82174.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 566
Score = 242 bits (617), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 151/450 (33%), Positives = 224/450 (49%), Gaps = 50/450 (11%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENY-----------------ISLCVELCMKDM 73
D+ LE +R L +L+ + K + Y + + + LC K
Sbjct: 55 DENQLLENGLRTLLEELVGAAVANGKSVMQYGRAEGPEDDGNGLLTRLLDIVLYLCEKGN 114
Query: 74 CNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRL 133
+ LL D ++ST+ C+++F Y+E + F K +LR CN LLRRL
Sbjct: 115 VEGGMIFQLLEDLTEVSTIKDCKEVFGYIEGKQETLGKSELFSRGKLVMLRTCNQLLRRL 174
Query: 134 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDT 193
S++ + VFCGRIL+FLA FFP SERS +NI FN +SNET+
Sbjct: 175 SKTNDVVFCGRILMFLAHFFPLSERSAVNIKGVFN---------------TSNETKYEKE 219
Query: 194 EEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS 253
D IDF FYK FWSLQ+YF NP + W+ F+S VL F+ + + S
Sbjct: 220 PPGDSPAIDFTFYKTFWSLQEYFSNPTTLTHPSKWQTFSSNLGIVLETFEMQPIGEEDES 279
Query: 254 LNPSGD----YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELK 309
L+ GD + KYLT+ L+ L+L D FRR+VL+Q LI+F Y + K+ +G
Sbjct: 280 LSLDGDDDSTFNIKYLTSSNLMSLELKDPGFRRHVLVQCLIIFDYMKTPGKAEKDGPREG 339
Query: 310 SDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSI 369
+E +K + V L++ TPP G F ++ IL+ +++W WK +GC +R +
Sbjct: 340 VREE--LKVHEDRVKKLLRTTPPKGREFLASIEHILERDKNWVWWKRDGCIPFERFV--- 394
Query: 370 TDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQ 422
++KK D K++ EL++LW S DN A A+R P L YF+ +
Sbjct: 395 --QEKKIPSDPPKRRPRWRLGNRELSRLWKWSDDNPNAITHAKRVKMPPLAEYFKPLAED 452
Query: 423 MDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
MDP A +EE+Y N+ Y W+ LR +R+
Sbjct: 453 MDPEAGIEEEYHHKNNKVYCWKGLRFAARQ 482
>gi|255578237|ref|XP_002529986.1| nuclear matrix protein, putative [Ricinus communis]
gi|223530509|gb|EEF32391.1| nuclear matrix protein, putative [Ricinus communis]
Length = 608
Score = 241 bits (614), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 142/406 (34%), Positives = 216/406 (53%), Gaps = 32/406 (7%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC ++ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 81 IPRLLDVVLHLCEREHVEGGMIFQLLEDLTEMSTMKNCQDIFGYIESKQDILGKQELFAR 140
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 141 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK 200
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
D +S +DFNFYK WSLQ+ F NP W FTS
Sbjct: 201 DPPAGIS----------------VDFNFYKTLWSLQENFCNPAPLTLAPTKWHKFTSSLM 244
Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F++ L + + N + + KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 245 VVLNTFEAQPLSEEEGDANNLEEEAATFNIKYLTSSKLMGLELKDPSFRRHILVQCLILF 304
Query: 293 QYFTSTVKSRGEGL--ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
Y + K+ + +K D ++ E V L++ TPP G+ F Q ++ +L+ E++
Sbjct: 305 DYLKAPGKNDKDSTSESMKED----IRTCEERVKKLLEMTPPKGKDFLQKIEHVLEREKN 360
Query: 351 WNQWKNEGC-PELKRPL--TSITDEDKKDEPDAKKKKTPELTKLWNSKD-NLEACKSAER 406
W WK +GC P K+P+ +I + KK +P + EL++LW D N A +R
Sbjct: 361 WVCWKRDGCQPFEKQPIENKTIQEGSKKRKPRWRLGNK-ELSQLWKWADQNPNALTDPQR 419
Query: 407 DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
TP++ Y++ + MDP+A +E +Y N+ Y W+ LR +R+
Sbjct: 420 VRTPAITEYWKPLAEDMDPSAGIEAEYHHKNNRVYCWKGLRFSARQ 465
>gi|148691057|gb|EDL23004.1| THO complex 1, isoform CRA_b [Mus musculus]
Length = 239
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 120/222 (54%), Positives = 153/222 (68%), Gaps = 23/222 (10%)
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE------- 180
DLLRRLS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+
Sbjct: 4 DLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQK 63
Query: 181 --------MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFT 232
MDV E + + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F
Sbjct: 64 HTEDREEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFL 123
Query: 233 SYAETVLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYV 284
Y+E VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++
Sbjct: 124 KYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHI 183
Query: 285 LLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSL 326
LLQ+LILFQY VK + L +Q W++DTT++VY +
Sbjct: 184 LLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQV 225
>gi|67969270|dbj|BAE00988.1| unnamed protein product [Macaca fascicularis]
Length = 296
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 121/246 (49%), Positives = 156/246 (63%), Gaps = 22/246 (8%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L + I I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
+ + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276
Query: 248 DDVQSS 253
DD Q+S
Sbjct: 277 DDTQAS 282
>gi|302781468|ref|XP_002972508.1| hypothetical protein SELMODRAFT_96989 [Selaginella moellendorffii]
gi|300159975|gb|EFJ26594.1| hypothetical protein SELMODRAFT_96989 [Selaginella moellendorffii]
Length = 583
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 142/408 (34%), Positives = 207/408 (50%), Gaps = 36/408 (8%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
+I + + + LC K + LL D ++ST+ C+++F Y+E + + F
Sbjct: 127 LITRLLDIVLYLCEKSHVEGGMVFQLLEDLTEVSTIKDCKEVFAYIEGKQEVLGKTELFG 186
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
K +LR CN LLRRLS++ + VFCGRIL+FLA FP ERS +NI FN N T +
Sbjct: 187 RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHVFPLYERSAVNIKGVFNTSNETNYE 246
Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
D+ D V IDFNFYK FW LQ++F NP + W +F++
Sbjct: 247 KDQP----------------DDVPIDFNFYKTFWGLQEHFSNPAALIPR--WHIFSASLT 288
Query: 237 TVLAAFKSYKL--DDVQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
VL+AF + L DD S GD + KYLT+ L+ L+L D +FRR+VL+Q LI+
Sbjct: 289 IVLSAFDAQPLGDDDSTSHQLDEGDDMTFNTKYLTSSNLMSLELKDPSFRRHVLVQCLIV 348
Query: 292 FQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
F Y + K+ E E VK + V L++ TPP G+ F + ++ +L+ E++W
Sbjct: 349 FDYLKTPGKTEKEAARESLKDE--VKVLEDRVKRLLESTPPKGKDFLKSIEHLLEREKNW 406
Query: 352 NQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWNSKD-NLEACKSA 404
WK +GCP P E K KK+K EL++LW D N A
Sbjct: 407 VWWKRDGCP----PFEKNVSERKSSIEGTKKRKQRWRLGNKELSQLWKWADQNPNALTDP 462
Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
+R TP + Y+ + MDP + +E +Y ND Y W+ LR +R+
Sbjct: 463 QRVKTPPVHEYWTALAEDMDPMSGIEAEYSHKNDKVYCWKGLRFSARQ 510
>gi|320166707|gb|EFW43606.1| hypothetical protein CAOG_01650 [Capsaspora owczarzaki ATCC 30864]
Length = 618
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 153/486 (31%), Positives = 219/486 (45%), Gaps = 126/486 (25%)
Query: 52 DIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVN--IW 109
D + I + +E + + S++P+ LL+D D TLD C++LF VE W
Sbjct: 37 DATVATIGLLLDTALECSRRSVLTSSIPLALLTDALDSLTLDVCDKLFDLVEHRQTDPSW 96
Query: 110 KQQTFFMSCKN---------------------NLLRMCNDLLRRLSRSQNTVFCGRILLF 148
F S +L++CN+LLRRLS++Q+TVFCGRIL F
Sbjct: 97 TSTVFSTSAAPPSATPGVPAPTVIRGTFATNLQILQLCNELLRRLSKTQDTVFCGRILTF 156
Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
L FP SE+SGLNI FN + +F +E D S+ TEE ++
Sbjct: 157 LCSVFPLSEKSGLNIAGAFNGSSALDFLDEESADEVSSTTEE--------------IHRS 202
Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQ 268
FWSLQ +F NP C+ +W T T
Sbjct: 203 FWSLQPFFSNPNSCFTADAWLSVTKAVTT------------------------------- 231
Query: 269 KLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIK 328
+L LQL+D+ FRR++L+Q LI+FQY K+R + L LK DQ K + + + + ++
Sbjct: 232 RLFQLQLNDSTFRRHMLVQILIIFQYLAVPSKAR-KTLVLKDDQLKLLFELRDKTMAALR 290
Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-----------PLTSITD------ 371
PP G F+ V +L+ E WN WKNE CP +R P+ S D
Sbjct: 291 AVPPHGAVFAAAVGQVLESELDWNTWKNESCPPFERAPDSTVIVGHLPIASAADAVSQPN 350
Query: 372 -----------EDKKDE----PDA----KKKKTP--------------------ELTKLW 392
D+ DE P A KK T +LT LW
Sbjct: 351 AGSLAQSRKRAHDQVDENSEDPTAARHIAKKLTSDGRVPSSSSSGSGPVRMGQKDLTALW 410
Query: 393 NSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
N+ D +L + + +R F P +++FEEAI+QMDP A +++ Y K+ D +AWR+LRLL+R
Sbjct: 411 NANDESLASLQGVDRRFIPDPDAFFEEAIEQMDPDAQIDKVYWKIGDELFAWRSLRLLAR 470
Query: 452 KCPHFF 457
H F
Sbjct: 471 TRLHLF 476
>gi|302821808|ref|XP_002992565.1| hypothetical protein SELMODRAFT_135550 [Selaginella moellendorffii]
gi|300139634|gb|EFJ06371.1| hypothetical protein SELMODRAFT_135550 [Selaginella moellendorffii]
Length = 583
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 142/408 (34%), Positives = 207/408 (50%), Gaps = 36/408 (8%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
+I + + + LC K + LL D ++ST+ C+++F Y+E + + F
Sbjct: 127 LITRLLDIVLYLCEKTHVEGGMVFQLLEDLTEVSTIKDCKEVFAYIEGKQEVLGKTELFG 186
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
K +LR CN LLRRLS++ + VFCGRIL+FLA FP ERS +NI FN N T +
Sbjct: 187 RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHVFPLYERSAVNIKGVFNTSNETNYE 246
Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
D+ D V IDFNFYK FW LQ++F NP + W +F++
Sbjct: 247 KDQP----------------DDVPIDFNFYKTFWGLQEHFSNPAALIPR--WHIFSASLT 288
Query: 237 TVLAAFKSYKL--DDVQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
VL+AF + L DD S GD + KYLT+ L+ L+L D +FRR+VL+Q LI+
Sbjct: 289 IVLSAFDAQPLGDDDSTSHQLDEGDDMTFNTKYLTSCNLMSLELKDPSFRRHVLVQCLIV 348
Query: 292 FQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
F Y + K+ E E VK + V L++ TPP G+ F + ++ +L+ E++W
Sbjct: 349 FDYLKTPGKTEKEAARESLKDE--VKVLEDRVKRLLESTPPKGKDFLKSIEHLLEREKNW 406
Query: 352 NQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWNSKD-NLEACKSA 404
WK +GCP P E K KK+K EL++LW D N A
Sbjct: 407 VWWKRDGCP----PFEKNVSERKSSIEGTKKRKQRWRLGNKELSQLWKWADQNPNALTDP 462
Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
+R TP + Y+ + MDP + +E +Y ND Y W+ LR +R+
Sbjct: 463 QRVKTPPVHEYWTALAEDMDPMSGIEAEYSHKNDKVYCWKGLRFSARQ 510
>gi|360042969|emb|CCD78379.1| putative nuclear matrix protein [Schistosoma mansoni]
Length = 465
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 130/304 (42%), Positives = 174/304 (57%), Gaps = 29/304 (9%)
Query: 182 DVSSNETEETD-TEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
D +S++ EE + T+ +++D Y KFWSLQ++F++P CY K W FTS +TVL
Sbjct: 28 DHTSDDLEEGEMTDSSTPLEVDAGLYVKFWSLQEFFKSPALCYEKAKWLRFTSSTDTVLD 87
Query: 241 AFKSYKLDDVQSS---LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
F S KL ++ S F+KYLT++KLLDLQL D +FRRY+++Q LILFQY T+
Sbjct: 88 VFSSIKLMAIEEGDICSQSSSGTFSKYLTSEKLLDLQLMDPSFRRYIIVQLLILFQYLTT 147
Query: 298 TVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPD---GEHFSQVVKLILKGEEHWNQW 354
TVK + L DQ WV E V L+ P+ G F V+ IL+ E +WN+W
Sbjct: 148 TVKFKTIDQVLSEDQRSWVNQRHEVVLRLLSSNSPNNSSGGTFVSTVERILERESYWNRW 207
Query: 355 KNEGCPELKR---------------PLTSITDEDKKDEPDAKKKKTPELTKLWN-SKDNL 398
KN+GCP R PL + T + + EL KLWN DNL
Sbjct: 208 KNDGCPSFIRTPEKSRLSVRKRHINPLVTRTGQK------VYRFGNRELDKLWNVCPDNL 261
Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFL 458
AC+ R F P L SYF++AI +MDPA VEE+YK +N ++WRALR LSRKCPHF++
Sbjct: 262 AACRDKRRVFNPDLHSYFQDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYI 321
Query: 459 NATP 462
N P
Sbjct: 322 NWNP 325
>gi|449506062|ref|XP_004162641.1| PREDICTED: THO complex subunit 1-like [Cucumis sativus]
Length = 483
Score = 231 bits (588), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 144/395 (36%), Positives = 207/395 (52%), Gaps = 38/395 (9%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC K+ + LL D +MSTL C+ +F Y+E +I +Q F
Sbjct: 91 IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFAR 150
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N T++
Sbjct: 151 GKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 208
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
+ + D IDFNFYK FWSLQ++F NP W+ FTS
Sbjct: 209 --------------EKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLM 254
Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
VL F + L D + N S + KYLT+ KL+ L+L D +FRR+VL+Q LILF
Sbjct: 255 VVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILF 314
Query: 293 QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWN 352
Y + K+ + +E +K E V L++ TPP G+ F Q ++ IL+ E +W
Sbjct: 315 DYLKAPGKNEKDIPSETMREE--IKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWV 372
Query: 353 QWKNEGC-PELKRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWN-SKDNLEACKS 403
WK +GC P K+P+ +KK D KK+ P EL++LW S N A
Sbjct: 373 WWKRDGCAPFEKQPI------EKKTINDVTKKRRPRWRLGNKELSQLWKWSDQNPNALTD 426
Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVND 438
+R +P++ Y++ + MD +A +E +Y N+
Sbjct: 427 PQRVRSPAISDYWKPLAEDMDESAGIEAEYHHRNN 461
>gi|326427178|gb|EGD72748.1| hypothetical protein PTSG_04478 [Salpingoeca sp. ATCC 50818]
Length = 605
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 160/497 (32%), Positives = 238/497 (47%), Gaps = 77/497 (15%)
Query: 22 ALYKPTCDYDKKSALEQS--------IRQYLLKLIKTPD---IDIKVIENYISLCVELCM 70
A+ + T D D ++L+Q+ R+ L L+ D +D+ + +C+E C
Sbjct: 12 AVVQGTADTDSLASLKQAKAHIVGLAFRRVLFPLLVQRDEKCLDV------LRICIE-CF 64
Query: 71 KD-----MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRM 125
K +T+ I L D FD+ + CE++F +E N +Q F + ++R+
Sbjct: 65 KREFIAVAATNTVLISLFEDAFDVLPVSMCERVFTLMEQNA---QQPAFVKANGPRVVRL 121
Query: 126 CNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG-GDEEMDVS 184
N L +RLSRSQ+ GR+L+F+A FP S++SG NI S NV N T + +EE+D S
Sbjct: 122 INGLKKRLSRSQDIELSGRLLMFIASIFPMSDKSGANISSAVNVANKTHYDETEEELDTS 181
Query: 185 SNETEETDTEEVDKVKIDFNF----YKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
++E + E++D + YK+FWSLQ +F P CY WK F E VL
Sbjct: 182 ASEAAKAQ-EDIDMPSLGRPLNKSDYKQFWSLQRFFCEPRLCYQPDEWKRFEKATECVLD 240
Query: 241 AFKSYKLDDVQSSLNPSG---------------------------------DYFAKYLTN 267
F+ + L+ + P+ FAKYLT+
Sbjct: 241 LFERHPLEVSPVTTQPTAASMMDTNDDNGDGDGDGDDGDDDGPSQKQRRADTTFAKYLTS 300
Query: 268 QKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLI 327
KL +LQL+D RR +LLQ LILF Y K + L S Q++W+ V L+
Sbjct: 301 PKLFELQLNDVTLRREILLQLLILFSYLPRDSKFKKPAETLTSAQKEWMDKRQRQVEKLM 360
Query: 328 KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK--- 384
K DG F K +L+ E W QWK +GC +P +K
Sbjct: 361 KAAGSDGADFLTNAKRMLERELVWVQWKEDGC----KPFEKKESAKRKAPSPPAPGAAGV 416
Query: 385 ---TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSN 440
+ EL +LW S N+EACK A + PSL+ YF +A+++MDPA VEE+Y ++ND
Sbjct: 417 HMGSDELQRLWTLSSTNMEACKRA-KVRVPSLKEYFLDAVEEMDPAECVEEEYWQINDFR 475
Query: 441 YAWRALRLLSRKCPHFF 457
+ W+ LRL+SRK + F
Sbjct: 476 FNWKGLRLISRKKVNLF 492
>gi|9758964|dbj|BAB09407.1| unnamed protein product [Arabidopsis thaliana]
Length = 588
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 161/499 (32%), Positives = 239/499 (47%), Gaps = 91/499 (18%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + + LC K+ + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 84 IPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKDVFGYIESKQDILGKQELFAR 143
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERSG
Sbjct: 144 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSG----------------- 186
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-WKMFTSYAE 236
+ +DFNFYK FWSLQ+YF NP + + W+ F+S
Sbjct: 187 ---------------------ISVDFNFYKTFWSLQEYFCNPASLTSASTKWQKFSSSLA 225
Query: 237 TVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLLDL-----------QLSDTNFR 281
VL F + L + + N + + KYLT+ KL+ L QL D++FR
Sbjct: 226 VVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLEFTHFFFYLSPQLKDSSFR 285
Query: 282 RYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVV 341
R++LLQ LI+F Y + K+ + L ++ +E+ +K + V L++ TPP G+ F + V
Sbjct: 286 RHILLQCLIMFDYLRAPGKNDKD-LPSETMKEE-LKSCEDRVKKLLEITPPKGKEFLRAV 343
Query: 342 KLILKGEEHWNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK--------TPELTKLW 392
+ IL+ E++W WK +GCP K+P+ DKK P+A +KK EL++LW
Sbjct: 344 EHILEREKNWVWWKRDGCPPFEKQPI------DKKS-PNAGQKKRRQRWRLGNKELSQLW 396
Query: 393 NSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
D N A ++R TP + Y++ + MDP+A +E++Y N+ Y W+ LR +R
Sbjct: 397 RWADQNPNALTDSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTAR 456
Query: 452 KCPHFFLNAT--------------PNVEKNSEFIEN-MVKRCVKEK---PSSQISGNGNG 493
+ F T P V + N KR KE+ S + GN G
Sbjct: 457 QDLEGFSRFTEMGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIG 516
Query: 494 VDQDPAEVEVDTKSEEIQE 512
V AE E E E
Sbjct: 517 VSNSEAEAEGGRGDAETME 535
>gi|326497285|dbj|BAK02227.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 470
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/349 (36%), Positives = 186/349 (53%), Gaps = 36/349 (10%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
+I + + + LC + + LL D DMST+ C+ +F Y+E ++ +Q F
Sbjct: 121 LITRLLDIMLYLCERGHVEGGMVFQLLEDLTDMSTIKDCKDVFGYIESKQDVLGKQELFG 180
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
K +LR CN LLRRLS+S + VFCGRI++FLA FFP SERS LNI FN N T++
Sbjct: 181 RGKLVMLRTCNQLLRRLSKSNDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY- 239
Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYA 235
+ + D + +DFNFY+ WSLQ++FRNP + N W+ F+S
Sbjct: 240 ---------------EKDATDGISVDFNFYQTLWSLQEHFRNPALTTTNPTKWQKFSSNL 284
Query: 236 ETVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
VL+ F++ L D N + KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 285 TVVLSTFEAQPLSDADGKHNNLEQEEDAAFNIKYLTSSKLMGLELKDASFRRHILVQCLI 344
Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
F Y + KS +G +E +K E V +L++ PP G+ F Q ++ IL+ E++
Sbjct: 345 FFDYLKAPGKSDKDGPSESMKEE--IKSCEERVKNLLEMIPPKGKEFLQSIEHILEREKN 402
Query: 351 WNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK------TPELTKLW 392
W WK +GCP K+P E K + A+K+K EL +LW
Sbjct: 403 WVWWKRDGCPAFEKQP-----SEKKPGQAGARKRKPRWRLGNKELAQLW 446
>gi|344235559|gb|EGV91662.1| THO complex subunit 1 [Cricetulus griseus]
Length = 350
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 111/227 (48%), Positives = 146/227 (64%), Gaps = 14/227 (6%)
Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
+DLQLSD+NFRR++LLQ+LILFQY VK + L +Q W++DTT++VY L+ +
Sbjct: 1 MDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSEN 60
Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED----KKDEPDAKKKKTP 386
PPDGE FS++V+ IL EE+WN WKNEGCP + S T K+ P+ K P
Sbjct: 61 PPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERASDTKPTRVVRKRAAPEDFLGKGP 120
Query: 387 ---------ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKV 436
ELT+LWN DN+EACKS R++ P+LE +FEEAI+Q DP VE +YK V
Sbjct: 121 NKKILIGNEELTRLWNLCPDNMEACKSETREYMPTLEEFFEEAIEQADPENMVESEYKAV 180
Query: 437 NDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
N+SNY WRALRLL+R+ PHFF + E++ENMV + KE P
Sbjct: 181 NNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 227
>gi|222624056|gb|EEE58188.1| hypothetical protein OsJ_09127 [Oryza sativa Japonica Group]
Length = 643
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 161/551 (29%), Positives = 250/551 (45%), Gaps = 86/551 (15%)
Query: 35 ALEQSIRQYLLKLIKTPDIDIKVIENY-------------ISLCVELCMKDMCNSTLPII 81
+LE ++R+ L +L + K I Y + + + LC + +
Sbjct: 98 SLENALRKLLQELASSAVQSGKRIMQYGDNEENNCPITRLLDIVLYLCERGHVEGGMVFQ 157
Query: 82 LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVF 141
LL D +MST+ C+ +F Y+E ++ +Q F K +LR CN LLRRLS++ + VF
Sbjct: 158 LLEDLTEMSTIKDCKDVFGYIESKQDVLGKQELFGRGKLVMLRTCNQLLRRLSKANDVVF 217
Query: 142 CGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKI 201
CGRI++FLA FFP SERS LNI FN N T++ +K
Sbjct: 218 CGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY---------------------EKDAT 256
Query: 202 DFNFYKKFWSL-QDYFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLN---- 255
D N +K L ++F NP + N W+ F S VL+ F++ L D LN
Sbjct: 257 DVNCSRKRIELPNEHFSNPALTAANLTRWQKFVSNLTVVLSTFEAQPLSDDDGKLNNLDQ 316
Query: 256 -PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
+ KYLT+ KL+ L+L D +FRR++L+Q LI F + + K+ EG +E
Sbjct: 317 EEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDFLKAPGKTDKEGPTGSMKEE- 375
Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSITDED 373
+ E V L++ PP G+ F Q ++ IL+ E++W WK +GC K+P E
Sbjct: 376 -IDSCEERVKKLLEIIPPKGKDFLQSIEHILEREKNWVWWKRDGCLAFEKQPF-----EK 429
Query: 374 KKDEPDAKKKK------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPA 426
K + +K+K EL +LW ++ N A +ER PS+ Y++ + MDP+
Sbjct: 430 KPGQAGVRKRKPRWRLGNKELAQLWKWAEQNPNALTDSERICMPSVTEYWKPLAEDMDPS 489
Query: 427 AAVEEQYKKVNDSNYAWRALRL--------LSRKCPHFFLNATPN----VEKNSEFIE-- 472
A +E++Y N+ Y W+ LR SR C + P E S+F
Sbjct: 490 AGIEDEYHHKNNRVYCWKGLRFSARQDLEGFSRFCDYGIEGVVPQELLPPEVRSKFYSKP 549
Query: 473 -NMVKRCVKEKP---------------SSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKE 516
+ KR +E P + + G G+G + + V +D+ + I E K+
Sbjct: 550 GDKAKRPKREDPKGTSAQPKEQQVVSATPETDGGGSGAEPEEGAVPMDSDNAAIDEGRKQ 609
Query: 517 EDWEAKADPEG 527
E G
Sbjct: 610 SPEEVSGPESG 620
>gi|224072965|ref|XP_002303943.1| predicted protein [Populus trichocarpa]
gi|222841375|gb|EEE78922.1| predicted protein [Populus trichocarpa]
Length = 611
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 159/518 (30%), Positives = 244/518 (47%), Gaps = 83/518 (16%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
I + + + LC +D + LL D +MST+ C+ +F Y+E +I +Q F
Sbjct: 78 IPRLLDVVLYLCERDFVEGGMIFQLLEDLTEMSTMRNCKDIFGYIESKQDILGKQELFAR 137
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI FN N
Sbjct: 138 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN------ 191
Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSL--QDYFRNPVQCYNKVSWKMFTSYA 235
ET E+ + K L Q+YF +P + + W+ F+
Sbjct: 192 ------------ETKYEKEPPAATCCMYSDKLVCLLFQEYFCDPSLTLSPIKWQKFSLSL 239
Query: 236 ETVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
+L AF++ L + + S N + + KYLT+ KL+ L+L D +FRR+VL+Q LIL
Sbjct: 240 MVILNAFEAQPLSEEEGSANNLEEEAASFNIKYLTSSKLMGLELKDPSFRRHVLVQCLIL 299
Query: 292 FQYFTSTVKSRGEGLELKSDQ--------------------EKWVKDTTETVYSLIKQTP 331
F Y + K+ + L S+ ++ +K E V L++ TP
Sbjct: 300 FDYLKAPGKNDKD---LTSESMVSAVPLLILILSALNSCLCKEEIKSREEHVKKLLEMTP 356
Query: 332 PDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKKTP---- 386
P G+ F V+ IL+ E++W WK +GCP K+P+ + T + D KK+ P
Sbjct: 357 PKGKDFLHKVEHILEREKNWLWWKRDGCPPFEKQPIENKTVQ------DGGKKRRPRWRL 410
Query: 387 ---ELTKLWNSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYA 442
EL++LW D N A +R TP++ Y++ + MDP+A++E Y N+ Y
Sbjct: 411 GNKELSQLWKWADQNPNALTDPQRVRTPAITDYWKPLAEDMDPSASIEADYHHKNNRVYC 470
Query: 443 WRALRLLSRKCPHFFLNAT--------------PNVEKNSEFIEN-MVKRCVKEKPSSQI 487
W+ LR+ +R+ F T P+V + N KR K++P
Sbjct: 471 WKGLRVSARQDLDGFSRFTDHGIEGVVPLELLPPDVRSKHQAKPNDRSKRAKKDEP---- 526
Query: 488 SGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADP 525
G + V+ + +V + T + EI E D EA P
Sbjct: 527 KGASHQVEDN--QVSIATPASEIDGEGIRTDLEASVTP 562
>gi|156354930|ref|XP_001623433.1| predicted protein [Nematostella vectensis]
gi|156210130|gb|EDO31333.1| predicted protein [Nematostella vectensis]
Length = 433
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 106/212 (50%), Positives = 143/212 (67%), Gaps = 7/212 (3%)
Query: 29 DYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
D +KK+A +QS R L+K D D++ ++ I+L ++ C++TLP+ILLSD FD
Sbjct: 42 DGEKKNAADQSFRD-LVKESLLSDEDVETQKHLITLAIKAAAHGTCSTTLPVILLSDVFD 100
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
TLD+C+ LF +VE V+ W FF S KN LLRMCNDLLRRLS SQNTVFCGRI LF
Sbjct: 101 TITLDRCDSLFTFVEEQVSTWTMPIFFSSGKNLLLRMCNDLLRRLSASQNTVFCGRIQLF 160
Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV------KID 202
LA+ FP SE+SGLN++S+FN++N+T + E + S N+ E D E ++ +D
Sbjct: 161 LARLFPLSEKSGLNLMSQFNLDNVTTYATTTEEEDSVNDMETGDGTEAGELVQSNLTPVD 220
Query: 203 FNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSY 234
+N Y+KFW+LQDYFRNP QCY V WK F+ +
Sbjct: 221 YNLYRKFWALQDYFRNPTQCYTTVGWKTFSKH 252
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 94/164 (57%), Gaps = 21/164 (12%)
Query: 342 KLILKGEEHWNQWKNEGCPEL------------------KRPLTSITDEDKKDEPDAKKK 383
K IL+ EE+W WKNEGC KR T E+ E +KK
Sbjct: 251 KHILEREENWISWKNEGCLSFIKEKEKEAHDVVKGPKAPKRKRTKSIGEEINAEWQSKKL 310
Query: 384 K--TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSN 440
+ + ELT+LWN DNLEAC + +R F PSLE +FEEAI+Q DP VE++YK + SN
Sbjct: 311 QMGSSELTRLWNLCSDNLEACSAEKRVFLPSLEEFFEEAIEQADPDGMVEDEYKLIKKSN 370
Query: 441 YAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPS 484
+ WRALRLL+R+ HFF ++ + +++E++V + +E P+
Sbjct: 371 FGWRALRLLARRSQHFFQPSSNPFKPLPDYLESVVSQLAQELPT 414
>gi|24431597|gb|AAN61477.1| Unknown protein [Oryza sativa Japonica Group]
Length = 653
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 160/568 (28%), Positives = 251/568 (44%), Gaps = 101/568 (17%)
Query: 35 ALEQSIRQYLLKLIKTPDIDIKVIENY-------------ISLCVELCMKDMCNSTLPII 81
+LE ++R+ L +L + K I Y + + + LC + +
Sbjct: 89 SLENALRKLLQELASSAVQSGKRIMQYGDNEENNCPITRLLDIVLYLCERGHVEGGMVFQ 148
Query: 82 LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVF 141
LL D +MST+ C+ +F Y+E ++ +Q F K +LR CN LLRRLS++ + VF
Sbjct: 149 LLEDLTEMSTIKDCKDVFGYIESKQDVLGKQELFGRGKLVMLRTCNQLLRRLSKANDVVF 208
Query: 142 CGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKI 201
CGRI++FLA FFP SERS LNI FN N T++ +K
Sbjct: 209 CGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY---------------------EKDAT 247
Query: 202 DFNFYKKFWSL-QDYFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLN---- 255
D N +K L ++F NP + N W+ F S VL+ F++ L D LN
Sbjct: 248 DVNCSRKRIELPNEHFSNPALTAANLTRWQKFVSNLTVVLSTFEAQPLSDDDGKLNNLDQ 307
Query: 256 -PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEG--------- 305
+ KYLT+ KL+ L+L D +FRR++L+Q LI F + + K+ EG
Sbjct: 308 EEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDFLKAPGKTDKEGPTGSMLNRL 367
Query: 306 --------LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
+ ++ + E V L++ PP G+ F Q ++ IL+ E++W WK +
Sbjct: 368 LSLISILLTACATGHKEEIDSCEERVKKLLEIIPPKGKDFLQSIEHILEREKNWVWWKRD 427
Query: 358 GCPEL-KRPLTSITDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKSAERDFT 409
GC K+P E K + +K+K EL +LW ++ N A +ER
Sbjct: 428 GCLAFEKQPF-----EKKPGQAGVRKRKPRWRLGNKELAQLWKWAEQNPNALTDSERICM 482
Query: 410 PSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRL--------LSRKCPHFFLNAT 461
PS+ Y++ + MDP+A +E++Y N+ Y W+ LR SR C +
Sbjct: 483 PSVTEYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFSARQDLEGFSRFCDYGIEGVV 542
Query: 462 PN----VEKNSEFIE---NMVKRCVKEKP---------------SSQISGNGNGVDQDPA 499
P E S+F + KR +E P + + G G+G + +
Sbjct: 543 PQELLPPEVRSKFYSKPGDKAKRPKREDPKGTSAQPKEQQVVSATPETDGGGSGAEPEEG 602
Query: 500 EVEVDTKSEEIQEEEKEEDWEAKADPEG 527
V +D+ + I E K+ E G
Sbjct: 603 AVPMDSDNAAIDEGRKQSPEEVSGPESG 630
>gi|384252374|gb|EIE25850.1| hypothetical protein COCSUDRAFT_64866 [Coccomyxa subellipsoidea
C-169]
Length = 842
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 142/481 (29%), Positives = 220/481 (45%), Gaps = 64/481 (13%)
Query: 33 KSALEQSIRQYLLKLIKTPDIDIKV--------------IENYISLCVELCMKDMCNSTL 78
++ LE S+RQ L+K+ ++ V I + L V L + ++ L
Sbjct: 55 RTILEHSVRQACASLVKS-AVEHGVNSTQYRHAGASEPSIPRLLDLTVYLGSQRQIDTVL 113
Query: 79 PIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSC----KNNLLRMCNDLLRRLS 134
++ D ++ST+++ E +F Y+E + +Q F S K ++LR+CN LLRRLS
Sbjct: 114 VFAIMEDVMELSTIEEAEGVFGYLESRIQKLRQVYNFGSATSHSKLSVLRICNQLLRRLS 173
Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE 194
R+ ++ CGRIL+FL + P +RSGLN+ +FN + T E V SN
Sbjct: 174 RASSSELCGRILMFLTRLLPLLDRSGLNVQGQFNTSHSTPVEDVPEGAVDSNGQ------ 227
Query: 195 EVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL------- 247
+D FY FW LQD F++P W + + VL F +
Sbjct: 228 -----PVDAAFYATFWGLQDVFQHPYTAMEPSKWAATITDIKRVLTEFTKQPVALASSSG 282
Query: 248 DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG---- 303
+ + L + KYL++ KL+ LQL D RR+ LLQ LI + V G
Sbjct: 283 GEGEGGLLEAAALTVKYLSSPKLMRLQLQDATLRRHFLLQALIFLHACQNPVLKPGAAAA 342
Query: 304 ---EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ L+S Q ++ +Y+ +++TP G HF V+ IL E W +WK+EGC
Sbjct: 343 QQAKKEALRSKQVLEAEELEGLLYAELQRTPKHGTHFCAAVRTILTRETQWTKWKSEGCQ 402
Query: 361 ELKR----PLTSITDEDKK-----------DEPDAKKKKTPELTKLWN-SKDNLEACKS- 403
+R PL+ + +P+ + L+ LWN ++DN K
Sbjct: 403 GYERKEVAPLSPLAPSTALQAATAATAATVSDPNYVDTGSQALSTLWNITRDNTSCLKGD 462
Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC-PHF--FLNA 460
+ TPSL+ + I +MDP +EEQYK D Y W++LR ++R P F L A
Sbjct: 463 TNQRSTPSLQEFLAPVILEMDPEEGIEEQYKTKKDEVYTWKSLRQIARASLPVFSKMLTA 522
Query: 461 T 461
T
Sbjct: 523 T 523
>gi|345324596|ref|XP_001508566.2| PREDICTED: THO complex subunit 1-like [Ornithorhynchus anatinus]
Length = 287
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 113/242 (46%), Positives = 147/242 (60%), Gaps = 23/242 (9%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
P + +KK L+Q+ R L LI I+ EN ISL + + +C ++ P +L
Sbjct: 41 PGSENEKKCTLDQAFRGVLEDLI----INQASCENVLAIISLAIGGVTEGICTASTPFVL 96
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
L D D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156
Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDT-------- 193
GRI LFLA+ FP SE+SGLN+ S+FN+ENIT F +E E + E+ +
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHVEDKEEGMEVEEGE 216
Query: 194 ---EEVD---KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTS-YAETVLAAFKSYK 246
+EV + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F ++ LA Y+
Sbjct: 217 MGDDEVPTNCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKLFSSRGLAGPAVYE 276
Query: 247 LD 248
D
Sbjct: 277 HD 278
>gi|328875843|gb|EGG24207.1| putative THO1 protein [Dictyostelium fasciculatum]
Length = 653
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 148/439 (33%), Positives = 221/439 (50%), Gaps = 50/439 (11%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
D ++ +E SIRQ + ++ + I + L +E+ +S L L D FD
Sbjct: 65 DLRTNIELSIRQLFSDVFRSKSMKKDDINITLQLAIEIADLGYIDSYLIAQLAEDLFDSK 124
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMS-CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFL 149
++ + + F +E + + Q F +S KN LL++C DLL+RLS+S N CGRIL+FL
Sbjct: 125 SISESIEFFSLLESRADYFSQDQFMVSRTKNMLLKICIDLLKRLSKSTNPDSCGRILIFL 184
Query: 150 AKFFPFSERSGLN------IISEFNVENITEFGGDEEMDVSSNETEETDTE------EVD 197
A FP S+ SGLN + E ++ NI +F EET E +
Sbjct: 185 AFIFPLSDPSGLNPKSLHAVHGELDLTNIMDF-------------EETSPEGKDKDKDKY 231
Query: 198 KVKIDFNFYKKFWSLQDYFRNPVQCY-------------NKVSWKMFTSYAETVLAAFKS 244
K ID NFYK+FW LQ YF+ P Q + NK W F E VL F +
Sbjct: 232 KTTIDLNFYKQFWGLQSYFQKPFQLFNNANAVNTAALALNKDKWSHFMESIEVVLTTFAT 291
Query: 245 YKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT-STVKSRG 303
+ D ++++ S YF KYLT+ L+ LQL D+ FRR +L Q LI FQ+ +T+KS
Sbjct: 292 HINLDELATMSKSH-YFTKYLTSSNLIKLQLKDSTFRRNILTQLLITFQFIEFNTIKSPN 350
Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWK-NEGCPEL 362
L Q++ + T +I+QT P+GEHF + + LILK E +W WK + GC
Sbjct: 351 ---LLNDSQKQLLTTLTNKAKKIIEQTQPNGEHFMKSLALILKRETNWLTWKMDGGCKAF 407
Query: 363 KRPLT-SITDEDKKDEPDAKK--KKTPELTKLWN-SKDNLEACKS-AERDFTPSLESYFE 417
RP SI + +K + K T EL++LWN S DN E+ K+ + + + ++
Sbjct: 408 GRPAAQSIVVKKRKLRKTSTKVMMGTQELSRLWNLSSDNTESLKTEIKLNVMDYIAPMYD 467
Query: 418 EAIQQMDPAAAVEEQYKKV 436
E IQQ + A +E+ ++
Sbjct: 468 EMIQQEEAAKKEQERLARL 486
>gi|440793799|gb|ELR14970.1| THO1 protein (nuclear matrix protein p84), putative, partial
[Acanthamoeba castellanii str. Neff]
Length = 451
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 174/314 (55%), Gaps = 18/314 (5%)
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
I + + L + M S P+ L+ D F T+++C+Q F +E + + + NN
Sbjct: 116 IDIALWLGREKMIESAAPLELVRDFFLSHTIEECQQFFALLEDRADQLSRAIVGDTRANN 175
Query: 122 -LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE 180
LLR +++ +RLSR+ + VF GRIL+F+A FP S++SG+N+ E NV N+T D+E
Sbjct: 176 TLLRALSEVRKRLSRTNDLVFAGRILMFMAYAFPLSDKSGVNLKGEANVANVTTM--DKE 233
Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
+ ET E +E +D+ FY+ FW LQ YF NP++ + +W++ T + +VL
Sbjct: 234 PSPALEETPEDKSEP----PVDWTFYQMFWGLQSYFLNPIKAFE--AWQVVTGHLNSVLD 287
Query: 241 AFKSYKLDDVQSSL------NPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
FK Y + + ++S NP Y KYLT+ +LL LQL D FRR++++QFL+
Sbjct: 288 LFKQYPISEEEASAASYSYANPPAASECYLTKYLTSSRLLRLQLRDPWFRRHIMVQFLVF 347
Query: 292 FQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
FQ +S +L Q+ + T+ + + +TPP+G HF+ + ILK E +W
Sbjct: 348 FQSLRVVSESSTVAQKLTEQQKNEIAGLTKKTKTSLWETPPNGMHFTNTIYAILKRENNW 407
Query: 352 NQWKNEGCPELKRP 365
WK EGCP ++P
Sbjct: 408 TDWKKEGCPPFEKP 421
>gi|268568954|ref|XP_002648141.1| C. briggsae CBR-THOC-1 protein [Caenorhabditis briggsae]
Length = 762
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 153/542 (28%), Positives = 240/542 (44%), Gaps = 117/542 (21%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQ-------Q 112
N++ C +L C+ + PI+ L D ++S++++C+ LF VE N+N +KQ Q
Sbjct: 62 NFVMNCTKL---GYCSKSTPILTLQDILEVSSIEECKHLFGLVEENMNEFKQVKWLEMDQ 118
Query: 113 TFFMSCKNN-----------------LLRM---------CNDLLRRLSRSQNTVFC---- 142
F S K L R C +L R S ++ ++F
Sbjct: 119 NFPDSSKRTKTSFSASATTFSVDSAELRRRRFVEESCSSCPELENRFSCTKMSIFSIFRP 178
Query: 143 ---GRILLFLAK-----------------FFPFSERSGLNIISEFNVENITEFGGDEEMD 182
+I F K F P +E+SG+N + FN NIT F +E
Sbjct: 179 ENGPKIRKFEPKTQFADLITPQKMPKSRRFLPLTEKSGVNFMGHFNTLNITTFEENEPTA 238
Query: 183 VS-------------SNETEETDTEEVD----------KVKIDFNFYKKFWSLQDYFRNP 219
S S T + E+++ ++ + Y++FWSLQ + NP
Sbjct: 239 ESLLAATSSHHAATPSGATPGAEGEDMETGEIEEEAAKEIIVTPEMYRQFWSLQKFMSNP 298
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLDD---------VQSSLNPSGD-YFAKYLTNQK 269
Y K + +F VL+ + KL+ +S + D +F KYLT+ K
Sbjct: 299 NLLYEKEKFVVFKKDLTAVLSLMTANKLEKFAAEDESEVAESRKKINNDVFFTKYLTSPK 358
Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEG-----------------LELKSDQ 312
LL LQL+D++FRR L+Q +I+FQY + + + + L +Q
Sbjct: 359 LLALQLNDSSFRRNFLIQTIIIFQYLNAESRFKPPANLKIPQNSPQNPIFSRKMVLNEEQ 418
Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--KRPLTSIT 370
K+V D + Y L+ T P G F +K I++ E WNQWKN CP+L K ++
Sbjct: 419 SKFVHDYEDKCYRLLSDTMPRGTAFVAGLKRIMQRETEWNQWKNSNCPDLSEKADKGAMQ 478
Query: 371 DEDKKDE----PDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDP 425
K+ P+ TPELTKLW D L+ACKS +R F P+L + + + +MDP
Sbjct: 479 MYKKRQRVPFNPNNLDLGTPELTKLWTYEPDVLKACKSDKRKFIPTLADFIRDPLDEMDP 538
Query: 426 AAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSS 485
VEEQYK +N+ ++ WRA RLL K P + P + + + N R + + +S
Sbjct: 539 EQQVEEQYKSINEVSFQWRAARLLMHKAPGYVSKTEPKADPTTNMLWNRGTREKRRRETS 598
Query: 486 QI 487
++
Sbjct: 599 RV 600
>gi|390368636|ref|XP_786383.3| PREDICTED: THO complex subunit 1-like [Strongylocentrotus
purpuratus]
Length = 601
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 163/311 (52%), Gaps = 59/311 (18%)
Query: 219 PVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP--------------------SG 258
P CY+ + A VL AF S KL+D+ S SG
Sbjct: 159 PFLCYHDQN-------AIAVLEAFHSMKLEDIASGKKKKKRRSHHRQQQKQGEEEEADSG 211
Query: 259 DY--FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWV 316
D FAKYLT++KL DLQLSD FRRYV +QFLILFQY VK + L + ++
Sbjct: 212 DQVSFAKYLTSEKLFDLQLSDAMFRRYVYVQFLILFQYLNQQVKFKQAHHVLTDEMSLFI 271
Query: 317 KDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP---------ELKRPLT 367
K+T + V +L+K+TPP G+ F+ ++ IL EEHWN WKNEGCP E RP +
Sbjct: 272 KETKDKVVALLKETPPYGDQFTAGIEHILSREEHWNAWKNEGCPSYVKEKGKTEQARPKS 331
Query: 368 SITDEDKKDE---PDAKKK---KTPELTKLWNS-KDNLEACKSAERDFTPSLESYFEEAI 420
+E A KK +PELTKLWN NLEAC +AE P
Sbjct: 332 RARKRTLGEELNLAGAAKKIDLGSPELTKLWNVYPSNLEAC-AAEDRVVPD--------- 381
Query: 421 QQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT---PNVEKNSEFIENMVKR 477
Q DP A +E +YK VN+SNYAW+A+RLL+++ PHFF T P ++ ++E MV +
Sbjct: 382 -QADPEAMIEPEYKVVNNSNYAWQAIRLLAKRSPHFFQTVTATNPPLKTVPGYLEYMVTK 440
Query: 478 CVKEKPSSQIS 488
KE P ++ S
Sbjct: 441 IAKELPQNEES 451
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 1/103 (0%)
Query: 29 DYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
D +KK+AL+Q+ R Y+ KL+ + + L E +C+ P +LL+D FD
Sbjct: 44 DAEKKTALDQAYRNYVKKLV-VGGSSCSDFTSLVHLSTEAVEAGLCSPATPFLLLTDIFD 102
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLR 131
T+ C+ +F +VE V WKQ F+ S KN LLRMCN L+
Sbjct: 103 SVTISVCKDVFQFVEDRVATWKQSDFYSSGKNYLLRMCNGRLQ 145
>gi|147775061|emb|CAN77205.1| hypothetical protein VITISV_014783 [Vitis vinifera]
Length = 507
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 131/412 (31%), Positives = 185/412 (44%), Gaps = 111/412 (26%)
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
MST+ C+ +F Y+E +I +Q F K +LR CN LLRRLS++ + VFCGRIL+F
Sbjct: 1 MSTMRNCKDIFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMF 60
Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
LA FFP SERS +NI FN N T++ E D E + IDFNFYK
Sbjct: 61 LAHFFPLSERSAVNIKGVFNTSNETKY--------------EKDAPE--GISIDFNFYKT 104
Query: 209 FWSLQ----------------------------------DYFRNPVQ-CYNKVSWKMFTS 233
FWSLQ ++F NP W+ FTS
Sbjct: 105 FWSLQXLDWGCSGEXKTYMLFGKFGPDGGYMFRDQSVNVEHFCNPASISLAPTKWQKFTS 164
Query: 234 YAETVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VL F++ L D + + N + + KYLT+ KL+ L+L D +FRR++L+Q L
Sbjct: 165 NLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL 224
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILF Y + K+ +L SD W
Sbjct: 225 ILFDYLKAPGKNDK---DLPSDSMVW---------------------------------- 247
Query: 350 HWNQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEA 400
WK +GCP +R P+ +KK D KK+ P EL++LW D N A
Sbjct: 248 ----WKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNA 297
Query: 401 CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
+R TP++ Y++ + MD +A +E +Y N+ Y W+ LR +R+
Sbjct: 298 LTDPQRVRTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQ 349
>gi|114145453|ref|NP_001041315.1| THO complex 1 [Rattus norvegicus]
gi|37361826|gb|AAQ91026.1| LRRGT00070 [Rattus norvegicus]
Length = 499
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 101/247 (40%), Positives = 140/247 (56%), Gaps = 35/247 (14%)
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
ISL + + +C ++ P +LL D D LD+C+ +F +VE NV WK
Sbjct: 31 ISLAIGGVTESVCTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSN--------- 81
Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
+ D RL L+ + ++ L +++E+ ++ +E M
Sbjct: 82 ---LSGDSYIRL---------------LSACTSWRQQYCLGLVAEYGLDPQHTEDREEGM 123
Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
DV E + + + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA
Sbjct: 124 DVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAV 183
Query: 242 FKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQ
Sbjct: 184 FKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQ 243
Query: 294 YFTSTVK 300
Y VK
Sbjct: 244 YLKGQVK 250
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/87 (49%), Positives = 57/87 (65%)
Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
NL + R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHF
Sbjct: 290 NLTVYFTMTREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHF 349
Query: 457 FLNATPNVEKNSEFIENMVKRCVKEKP 483
F + E++ENMV + KE P
Sbjct: 350 FQPTNQQFKSLPEYLENMVIKLAKELP 376
>gi|348686117|gb|EGZ25932.1| hypothetical protein PHYSODRAFT_540491 [Phytophthora sojae]
Length = 569
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 135/430 (31%), Positives = 206/430 (47%), Gaps = 66/430 (15%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ + LC+ + ++ P +L D + T+ CE ++ +E + + T +
Sbjct: 82 VPALLDLCIAGVTSNFLINSTPYKVLEDLMEGQTISTCEMVWELLESRKD---KLTTGRT 138
Query: 118 CKNNL--LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF 175
K +L LRMCN LLRRLS++ N+VFCG+IL+FL+ F SERS +N+ + NV NI
Sbjct: 139 TKASLCLLRMCNALLRRLSKTHNSVFCGKILVFLSFTFALSERSAVNLTGKANVTNID-- 196
Query: 176 GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR-NPVQCYNKVSWKMFTSY 234
D DV ID+N Y+ FW LQ +FR + + ++ +W+ F +
Sbjct: 197 -NDPSADVGP---------------IDYNLYRTFWDLQSFFREHQMATHSAENWEKFFTE 240
Query: 235 AETVLAAFK--SYKLDDVQ---------------------SSLNP----SGDYF--AKYL 265
VLAAF+ ++ DD++ SS NP S ++F KYL
Sbjct: 241 LNVVLAAFEGNAFSPDDLERSRDLSGGGPIHSADVIMHEDSSTNPAEQGSQEHFFQPKYL 300
Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYS 325
TN +L LQL D R +L QFLILF G + + K+ D TE V +
Sbjct: 301 TNSRLFRLQLRDPILRECMLTQFLILFNDLARAKPPAGS----TTPKAKFA-DLTERVVA 355
Query: 326 LIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK- 384
L+KQTP DGE FS++V +L+ E +W +WK E C E ++ ++ E +P K+ +
Sbjct: 356 LLKQTPSDGEGFSEMVSYVLERERNWVKWKQEKCREYEKYPSAADKEGSAPKPVVKRARR 415
Query: 385 ---TPELTKLWNSKDN----LEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
+P L ++ + LE K ER L +Y E + DP +EE+Y
Sbjct: 416 QLTSPLLEQILSESSKPSQILEKIKGKERASEVPLSAYTERFKEAWDPENGIEEEYWPDK 475
Query: 438 DSNYAWRALR 447
D WR +R
Sbjct: 476 DEMVCWRTMR 485
>gi|428175960|gb|EKX44847.1| hypothetical protein GUITHDRAFT_139464 [Guillardia theta CCMP2712]
Length = 662
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 148/550 (26%), Positives = 246/550 (44%), Gaps = 98/550 (17%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV------------- 104
+ + L + M +M LP++ + + F+ T ++C + F Y+E
Sbjct: 94 VVGLLDLSMHAAMSNMVPRQLPLMAMHELFEGQTAEQCSKAFSYLETVDHTRLKSSAELV 153
Query: 105 ----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSG 160
+ +Q + LLR CN L++RLS++ + + CGR+L+FLA P SE+SG
Sbjct: 154 QRVEKMRDLTEQGEHKYSQAALLRCCNTLMQRLSKTCDLLLCGRVLMFLAHVLPLSEKSG 213
Query: 161 LNIISEFNVENITEFGGD-EEMDVSSNETEETDTEEV----------DKVKIDFNFYKKF 209
+N+ EFN N+TEF + E++D + E E+ D ++ ++++ F+ YK F
Sbjct: 214 VNLKGEFNKANVTEFQENLEDIDYTKMEDEDKDESKIYLKSTDEKLDGQMEVSFSLYKTF 273
Query: 210 WSLQDYFRNPVQCYNKVSW--KMFTSYAETVLAAFKSYKLDDVQS-SLNPSGD---YFAK 263
W LQ Y +P ++ M VL F SY L D + ++ S D YF
Sbjct: 274 WGLQKYLHDPSALFSGPELVDSMIQGMT-LVLDCFSSYSLSDTEGIAVGHSADREFYFPA 332
Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF-------TSTVKSRGEGL-ELKSDQEKW 315
+LT+ KL+ L+L D FRR VL+Q+LI+ QY T+ GL E Q++
Sbjct: 333 FLTSTKLVALELQDPIFRRQVLVQYLIVLQYLLDRAVHPAKTLDVLRTGLSEFTKKQKED 392
Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK 375
+ +T + L+ PP G+ F + ++ IL+ E +W +WK + + + D
Sbjct: 393 LNKLMDTTFELLGNIPPSGKEFVETLRKILQREANWTRWKKSKVADGEDVAIKVGD---- 448
Query: 376 DEPDAKKKKTPELTKLW-------NSKDN-----------LEACKSAERDFTPSLE---- 413
K+K LW +KDN LE CK ER S E
Sbjct: 449 -----KRKSMENAGALWKRRFFTMKTKDNFVPASFENGSLLEICKPKERKSNSSAEDIAN 503
Query: 414 -------SYFEE--AIQQM--DPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
+ EE AI +M DP +E++++K N + W LR+L+ + F AT
Sbjct: 504 GLGNPNSTKLEERLAIVRMEADPENYIEQEFRKNNVPCWRWVTLRMLASYDINMFDKATA 563
Query: 463 NVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAK 522
++++E +V + + G G +D A + K E +E E++E + +
Sbjct: 564 ----ENKYLEGVV---------NHMDGKGKETHKDKAPAGDEAKEGEQKEGEQKEADQKE 610
Query: 523 ADPEGDADEV 532
+A EV
Sbjct: 611 GGEVKEAGEV 620
>gi|301111868|ref|XP_002905013.1| THO complex subunit, putative [Phytophthora infestans T30-4]
gi|262095343|gb|EEY53395.1| THO complex subunit, putative [Phytophthora infestans T30-4]
Length = 595
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 132/433 (30%), Positives = 203/433 (46%), Gaps = 69/433 (15%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ + LC+ ++ P +L D + T++ CE+++ +E + + T +
Sbjct: 82 VPALLDLCIAGVTSGFLVNSTPYKVLEDLMEGQTINTCEKVWDLLESRKD---KLTTGRT 138
Query: 118 CKNNL--LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF 175
K +L LRMCN LLRRLS++ N+VFCG+IL+FL+ F SERS +N+ + NV NI
Sbjct: 139 TKASLCLLRMCNALLRRLSKTHNSVFCGKILVFLSFTFALSERSAVNLTGKANVTNI--- 195
Query: 176 GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR-NPVQCYNKVSWKMFTSY 234
D DV ID+N Y+ FW LQ +FR + + + +W+ F +
Sbjct: 196 DSDPSADVGP---------------IDYNLYRTFWDLQGFFREHQMATQSAENWEKFFTE 240
Query: 235 AETVLAAFK--SYKLDDVQSSLNPS---------------GD--------------YFAK 263
VLAAF+ ++ DD++ S + + GD + K
Sbjct: 241 LNVVLAAFEGNAFSPDDLERSRDLTEGPIHSADIAMHESDGDSDKNATDHNTQEHFFQPK 300
Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETV 323
YLTN +L LQL D R +L QFLILF G + + + + TE V
Sbjct: 301 YLTNSRLFRLQLRDPILRECMLTQFLILFNDLARAKPPLG-----STTPKAKLAELTERV 355
Query: 324 YSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTSITDEDKKDEPDAKK 382
+L+KQTP DGE FS++V +L+ E +W +WK E CP +R P D +P K+
Sbjct: 356 VALLKQTPSDGEGFSEMVTYVLERERNWVKWKQEKCPGYERYPSAKEKDASTASKPVVKR 415
Query: 383 KK----TPELTKLWNSKDN----LEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYK 434
+ +P L ++ + LE K ER SL +Y + + DP +EE+Y
Sbjct: 416 ARRQLTSPLLEQILSESSKPSQILEKIKGKERATEVSLATYTDRFKEAWDPENGIEEEYW 475
Query: 435 KVNDSNYAWRALR 447
D WR +R
Sbjct: 476 PDKDEMVCWRTMR 488
>gi|255936127|ref|XP_002559090.1| Pc13g06570 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583710|emb|CAP91726.1| Pc13g06570 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 706
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 163/598 (27%), Positives = 278/598 (46%), Gaps = 122/598 (20%)
Query: 32 KKSALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFD 88
K +A+E + R+ +++ + ID I N + + + C L L+ + D
Sbjct: 63 KLAAVEIAFREKFYRVLASTSIDDPAFIQIWNLLDIVSIFSDNEQCEPGLIFWLIEELLD 122
Query: 89 MSTLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
T+ C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCG
Sbjct: 123 SQTIGGCRKVFDYLESRRERNTKKHFKQKSLV------ILRSCNELLRRLSRAEDTVFCG 176
Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNET-------- 188
R+ ++L + FP ++S +N+ EF+ EN+T F +E D + E
Sbjct: 177 RVFIYLFQSFPLGDKSSVNLRGEFHSENVTTFDEIVKESSAKQEQDTTMTEGAEPSAPGE 236
Query: 189 -----EETDTEEVDKV--------------KIDFN-FYKKFWSLQDYFRNPVQCYNKVSW 228
ET ++E KV ++D + Y FWSLQ YF P ++ +
Sbjct: 237 AAAGQAETTSQETSKVPKVVVSTGDEKKSDEVDLDALYPLFWSLQAYFSAPSNTFDAERF 296
Query: 229 KMFTSYAETVLAAFKSYKLD--------------------------DVQSSLNPSGDYFA 262
F + E LAAFK+ + +V +S NP
Sbjct: 297 TSFKTGMEATLAAFKNINTEMENESSARASEDARKSNKRKRIADGSEVATSFNP------ 350
Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKSRGE---------GLELKS 310
KYLT++ L DL++SDT FRR+VL+Q LIL + TS KSR G L
Sbjct: 351 KYLTSRDLFDLEISDTAFRRHVLVQALILLDFVLSLTSKSKSRLANTTNKSVLYGFTLNE 410
Query: 311 DQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS-- 368
+ KW +++ ++Q P G+ + ++V +L +++W +WK EGCP ++RP S
Sbjct: 411 EDTKWATSMRKSIEGYLQQG-PGGKFYYRMVDTVLSRDKNWVRWKAEGCPLIERPAISAA 469
Query: 369 --ITDEDKKDEPDAKKKKTP------ELTKLWNSK--DNLEACKSAERDFTPSLESYFEE 418
T + + A K+ P +L L +++ N+E K ++R PS +S+ +
Sbjct: 470 EYTTSREHATKAYANKRLRPSPMGSLDLKFLADTEALANIERLKESDRYSVPSADSFLKG 529
Query: 419 AI-QQMDPAAAVEEQYKKVNDSNYA---WRALRLLSRKCPHFFLNATPNVEKNSEFIENM 474
+ +MD A+ ++ K + + A WR LRL ++ F E N+ ++ +
Sbjct: 530 IMDDEMDIDMAMTDEDKAIAEGAKASKSWRILRLAAKGKLAAFDKIG---EGNN--LQAL 584
Query: 475 VKRCVKEKPSSQISGNGNGVD----QDPAE-VEVDTKSEEIQEEEKEEDWEAKADPEG 527
+ + ++++S +G+ D D A+ + V TK E +++EE+ E EA A+PEG
Sbjct: 585 FESATATEDNNELSPSGDTKDISQSLDGAQGLPVKTK-EPVEDEERAEVTEA-AEPEG 640
>gi|255087913|ref|XP_002505879.1| predicted protein [Micromonas sp. RCC299]
gi|226521150|gb|ACO67137.1| predicted protein [Micromonas sp. RCC299]
Length = 522
Score = 179 bits (453), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 132/396 (33%), Positives = 192/396 (48%), Gaps = 48/396 (12%)
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T+ CE +F +VE + K +LR CN+L RRLS++ +TV GR+LL L+
Sbjct: 34 TIADCEDVFTWVEARRGRLSADALWRRGKLVMLRTCNELQRRLSKTADTVLRGRVLLLLS 93
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFW 210
+P SERS LN+ +N N TE +E+D + E + + + + V +D FY+ FW
Sbjct: 94 SLYPLSERSALNLGGNYNAGNTTEL---DELDADTAEATDANA-KAEGVVVDRAFYETFW 149
Query: 211 SLQDYFRNPVQCYNKVS-WKMFTSYAETVLAAFKSYKLDDVQSSL--------NPSG--- 258
SLQ +F NP + WK F VL F++++LD + S PSG
Sbjct: 150 SLQKFFSNPPAALEPAAGWKTFCVSLRLVLDNFETHRLDKMSSGGVTPGVKPSVPSGATY 209
Query: 259 -----DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQE 313
D KYLT LL LQL D FRR+ L+Q + + TS + +G++
Sbjct: 210 VEAGIDAGTKYLTAAPLLHLQLRDPAFRRHFLVQCAVFLGFCTSPLY---KGVDAAD--- 263
Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE------------ 361
+D + V I+QTPP GE F+ V L L+ EE W WK + C +
Sbjct: 264 --ARDAKKRVMEAIRQTPPHGEEFAAAVALALRREEGWVLWKRDNCKDFEREPEAPRPPP 321
Query: 362 -----LKRPLTSITDEDKKDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLESY 415
++R L + PEL +LWN S+DN+ A +RD P+ E Y
Sbjct: 322 PAPPVMRRRLRPGAAAPAVPPEKRVRLGNPELDRLWNLSEDNVSALGD-KRDAAPTAEVY 380
Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
+ + MDP A +EE YK ND Y W+ALRL+++
Sbjct: 381 LQNVVDDMDPEAQIEETYKSKNDKTYVWKALRLIAK 416
>gi|328770843|gb|EGF80884.1| hypothetical protein BATDEDRAFT_24368 [Batrachochytrium
dendrobatidis JAM81]
Length = 793
Score = 178 bits (451), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 130/475 (27%), Positives = 215/475 (45%), Gaps = 86/475 (18%)
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
++++ + + + + + L+ D D T+D + LF Y++ V I T M
Sbjct: 147 IVQDLWDIAIIAGQQGFTDDVIAFRLIHDFLDFMTVDCTDSLFSYLDSRVEI---MTLGM 203
Query: 117 SCKNN----LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSG------------ 160
+ LLR CND LRR+S+S++ V CG++L ++ +P SERSG
Sbjct: 204 QSERGRGLFLLRFCNDFLRRMSKSKHNVVCGKLLTLVSNTYPLSERSGKLLIWIFCPFDR 263
Query: 161 LNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPV 220
+N+ E+NV N+T F E +SNE ++ +D +K FY FWSLQ+YF NP
Sbjct: 264 VNVRGEYNVGNVTVF----EPASNSNEMDDNKEPSLDSLKTKAEFYHCFWSLQEYFSNPT 319
Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ--SSLNPSGD------------------- 259
++ + F + E V+ F DD+Q S N D
Sbjct: 320 LLFDISHFTKFQTGVEKVVKIF-----DDIQKESDQNTGSDRERRKVGEKKHRVPTIADA 374
Query: 260 ----YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGL--------- 306
+F K+LT+ L DL+L+D FRR +LLQ +I+FQ+ +S V E +
Sbjct: 375 RNDYFFPKFLTSTNLFDLELNDPFFRRQILLQIMIVFQFLSSLVSMEKEKIVKQFSEIAQ 434
Query: 307 ----------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN 356
L +DQE WV + + ++++ P+G+ FS+ + +L E +W +WK
Sbjct: 435 SINRAIQYAYVLTADQESWVSECRSRSFKVLERISPNGKKFSKTLLTVLMHERNWLKWKL 494
Query: 357 EGCPELKRPLTSI--TDEDKKDEPDAKKKKT----PELTKLWNSKDNLEAC--KSAERDF 408
C ++P+ + T + +K E A K+ E+T LW +L+ A R
Sbjct: 495 GSCSSYEKPIIELGTTKKRRKAEHSAAANKSWLGNQEMTNLWERGRDLKQVLRDKASRSI 554
Query: 409 TPSLESYFEEAIQQMDP-----AAAVEE-QYKKVNDSNYAWRALRLLSRKCPHFF 457
SL+ + +E Q++ A +E+ Y N+ Y W A R + + F
Sbjct: 555 VTSLDEHLKELDYQLEEDCETLAEGIEDPSYVHYNNMRYNWIAYRTAIKSHAYLF 609
>gi|345569656|gb|EGX52521.1| hypothetical protein AOL_s00043g15 [Arthrobotrys oligospora ATCC
24927]
Length = 743
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 143/484 (29%), Positives = 225/484 (46%), Gaps = 63/484 (13%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKV-----IENYISLCVELCMKDMCNSTLPIILLSD 85
D KS +E + R+ L ++ D++ + + + L + L ++ + LP L+ +
Sbjct: 207 DVKSCMECASRELLYDIVTDFDLEFSLEPFIQLYQFFDLILTLSELEISDPQLPFALIEE 266
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN-LLRMCNDLLRRLSRSQNTVFCGR 144
T D +D C+++F Y+E + K LLR+CN+LLRRLS+S++TVFCGR
Sbjct: 267 TLDTQPIDNCKRIFSYLEARIERLTVGVDGTKGKGIILLRLCNELLRRLSKSEDTVFCGR 326
Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNETEETDTEEVD 197
I +FL K FP SERSG+N+ EF+VEN T F G++ S + + +E
Sbjct: 327 IFVFLTKSFPLSERSGVNLRGEFHVENKTTFDERWSQLAGNQPDHFSPSLRDNDSSEAAL 386
Query: 198 KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKS----------YKL 247
Y WS Q F P++ + K F +TV+ AF+S
Sbjct: 387 AADACNRLYAVLWSTQHEFAEPIRLFQKDILDNFKESLDTVIKAFRSSIDEYGHNAPVVA 446
Query: 248 DDVQSSLNPSGD----------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
D + S D Y KYLT+++L DL++ D FRR++L+QFLI+ ++ S
Sbjct: 447 SDTRRSHKRKWDGEVKHDELDSYNPKYLTSRELFDLEVRDLVFRRHILVQFLIVIEFLLS 506
Query: 298 TVKSRGEGLE-------------LKSDQEKWVKDT-TETVYSLIKQTPPDGEHFSQVVKL 343
+ LE L + +KW + + V +L ++ DG FS+ V
Sbjct: 507 LSPKYKKYLEAEAKNRSVQFAYTLSEEDDKWAEAAKKDVVNALNYKSGMDGLLFSRTVDS 566
Query: 344 ILKGEEHWNQWKNEGCP--ELKRPLTSITDEDKKDEPDA--------KKKKTPELTKLW- 392
+L E++W +WK E C E++ T K +A K P LT++W
Sbjct: 567 VLTREKNWVKWKAENCVSFEMEPASTEHVAVSKSAGHEAATLINNLGKVVGAPALTRVWF 626
Query: 393 -NSKDNLEACKSAERDFTPSLESYFEEA-IQQMDPAAAVEEQYKKVND---SNYAWRALR 447
NS + +A++ P+ SY E + +D A EQ K+ D SN WR+LR
Sbjct: 627 DNSLEVTSIRSNAQKINLPAYRSYTESIEVYHLDKDFASTEQEKEDIDESISNVTWRSLR 686
Query: 448 LLSR 451
+ SR
Sbjct: 687 IASR 690
>gi|296412446|ref|XP_002835935.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295629732|emb|CAZ80092.1| unnamed protein product [Tuber melanosporum]
Length = 684
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 150/514 (29%), Positives = 234/514 (45%), Gaps = 117/514 (22%)
Query: 55 IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTF 114
+ V N++ L + L ++ + LP+ L+ + FD T+ CE LF Y+E ++
Sbjct: 90 LPVFCNFLDLSLVLAELEISGAGLPLDLIEELFDSQTIPSCECLFQYLESRIDRVTAGVE 149
Query: 115 FMSCKN-NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT 173
K LLR+CN+LL+RLS++++T+FCGRIL+FL+K FP ERSG+N+ +FNV+N+T
Sbjct: 150 GGKGKALILLRLCNELLKRLSKTEDTIFCGRILIFLSKSFPIGERSGVNLRGDFNVDNVT 209
Query: 174 -------------------------------EFG--GDEEMDVSSNETEET-------DT 193
E G G E D+ S + + DT
Sbjct: 210 VYDDILQEPPTVVDGMELDIKKEEMEVDQKKEGGSTGAGEADIGSARSPKRLKVEAGQDT 269
Query: 194 EEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS 252
+ + +D N Y FWSLQ F +P + + K + + F +A FK + D +++
Sbjct: 270 SQKEPTVLDANTLYPIFWSLQHDFADPQRLFVKDNLEKFKVGLGATMAKFKKAEDDSIKT 329
Query: 253 SL--------------NPS------------------GDYF-AKYLTNQKLLDLQLSDTN 279
+ P G+ F KYLT+++L +L++SD
Sbjct: 330 TGKVDVANGPKKADEKKPGVSLGEKRKQIDGDDDDIKGEGFNPKYLTSRELFELEISDLT 389
Query: 280 FRRYVLLQFLILFQYFTS-TVKSRGEGLELKS-------------DQEKWVKDTTETVYS 325
FRR+VL+Q LIL + S T +++G+ +LK+ + EKW T E +
Sbjct: 390 FRRHVLVQALILIDFLLSLTPQAKGKWKDLKTPNRSVQYAFTLGLEDEKWAATTKEQI-- 447
Query: 326 LIKQTPPD--GEHFSQVVKLILKGEEHWNQWKNEGC-PELKRPLTSITDEDKKDE----- 377
++ PD G F++ +K +L E++W +WK E C P PL ED K +
Sbjct: 448 -LRSLEPDVQGRLFTRTIKTVLIREQNWVRWKAESCHPFDMPPLGDADIEDAKQKAVKAC 506
Query: 378 --PDAKK--KKTPELTKLWNSKDN---LEACKSAERDFTPSLESYFE-------EAIQQM 423
P++ K TP L LW + + LE R PS ES+ + E + M
Sbjct: 507 EIPESYKYTMGTPTLNLLWQATGDLTGLEGLTDPTRHNLPSPESFRQPVKDAELELLDAM 566
Query: 424 DPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
P Q KK ++ W+ALRL SR H F
Sbjct: 567 FPDDIELAQTKKFTNT---WKALRLASRDRFHLF 597
>gi|384497532|gb|EIE88023.1| hypothetical protein RO3G_12734 [Rhizopus delemar RA 99-880]
Length = 809
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 137/490 (27%), Positives = 223/490 (45%), Gaps = 78/490 (15%)
Query: 32 KKSALEQSIRQYLLKLIKTPDID--IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDM 89
+KSA E + R+ L++L+ + + K I + + + D+ + +P+ + FD+
Sbjct: 50 RKSATELAFRRALIRLVDGEETEERFKRIFDCCDIVLYCSEIDLLDQVIPLSFTEELFDL 109
Query: 90 STLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRSQNTVFCGRILLF 148
T + CE++F YVE K LLRMCN+LLRRLS+ NTVFCGRIL+F
Sbjct: 110 HTTEGCEKIFDYVEQRKTRLTANMVPGKGKGLVLLRMCNELLRRLSKEINTVFCGRILMF 169
Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
LA FP ERSG+N+ +FN N F DEE+D T+E FYK
Sbjct: 170 LANSFPLGERSGVNLRGDFNT-NAIHFDTDEEVDADPTLTDEQKA-----------FYKL 217
Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSY----------KLDDVQSSLNPSG 258
FWS + YF NP + +++ ++ F+S + +V +
Sbjct: 218 FWSTRIYFANPPSVFTDNNFEDLQKGTNGIVEKFQSIAQKEAEILGARNKEVTGTKRSRT 277
Query: 259 DYF------AKYLTNQ--------------KLLDLQLSDTNFRRYVLLQFLILFQYFT-- 296
+Y A+ + NQ +LLDL++ D FRR V++Q+LILFQY +
Sbjct: 278 EYMDEDDSAAEEILNQINNEYQFPRLLSSRRLLDLEMEDARFRRSVIVQYLILFQYLSGF 337
Query: 297 ---------STVKSRGEGLE--------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
+ + +RG + L +Q +W+ +T E + L++ P G ++
Sbjct: 338 SQEEKERTAALLAARGTTKQSLVQPNYVLSDEQVQWIDETREILVGLLRSIKPHGNLYTD 397
Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK--------DEPDAKKK---KTPEL 388
++ IL E HW WK GCP ++P +I D K + P K + T E+
Sbjct: 398 IILTILAHERHWIIWKASGCPSFEKPPMNINDLQKAWRTKRPRLEAPPTKYRYTFGTYEI 457
Query: 389 TKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQM-DPAAAVEEQYKKVNDSNYAWRALR 447
+ L+ ++ + R PS+ + A+ ++ D +E++ N + +A R
Sbjct: 458 SSLYGKQNMSLSELMINRAKLPSIVEVIDNAVTELQDSLDPPKERFNYANGA--LLQASR 515
Query: 448 LLSRKCPHFF 457
L+ K PH
Sbjct: 516 LIHDKYPHLI 525
>gi|358056555|dbj|GAA97524.1| hypothetical protein E5Q_04202 [Mixia osmundae IAM 14324]
Length = 637
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 144/491 (29%), Positives = 224/491 (45%), Gaps = 91/491 (18%)
Query: 55 IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTF 114
+ + + + + + L K C+ +L I L D T+ E LF ++E I +
Sbjct: 86 LSALYDMLDVVLYLSQKGACDDSLAISLFEMITDAQTITVAESLFEFLESRREIIIKGLL 145
Query: 115 FMSCKN-NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT 173
K LLR CN LL+R S+ +TVFCGRIL+FLA FP ERSG+N+ S FN +N+T
Sbjct: 146 PGRGKGLILLRTCNALLKRASKFVHTVFCGRILIFLAIVFPLCERSGVNVASSFNTDNVT 205
Query: 174 EFGGDEEMDVS--------------------SNETEETDTEEVDKVKIDF---NFYKKFW 210
+E DV S++++ DT++ + + + Y++FW
Sbjct: 206 IL---DESDVGQPDTTARLADASLTGDALALSDDSKSLDTKQNNAAAGEAKMKDLYERFW 262
Query: 211 SLQDYFRNPVQCYNKVSWK---------------MFTSYAETVL--AAFKSYK------- 246
LQ F +P + VS T++AE + A+F S K
Sbjct: 263 RLQQRFCDPRTLFTNVSSAGSKSNLDILKEDIELTLTTFAEAIAKEASFGSKKTRRDVRS 322
Query: 247 ---LDDVQSSLNPSGDY-FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS----- 297
+D +S +Y F KYLT + LL L+L+DT FRR +L+Q L++FQY S
Sbjct: 323 RQDMDQEESLAEVEANYFFPKYLTKRNLLPLELADTVFRRQILVQILVVFQYLLSFTVSE 382
Query: 298 --TVKSR---GEGLELKSD----QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
VK+R G+ L+L + +E W T V +++ T P G F + V LIL E
Sbjct: 383 RAKVKARAATGKPLKLPHNLNEAEEAWTVSTRIRVVEMLRSTGPSGVAFWRTVSLILTRE 442
Query: 349 EHWNQWKNEGCPELKRP----------LTSITDEDKKDEPDAKKKKTPELTKLWNSKDN- 397
E+W WK C + ++P + + + K P K L++LW SK
Sbjct: 443 ENWASWKLNNCVDFEQPPLADDAATVATSKLRERTKIARPYMHKVGNATLSRLWASKGAD 502
Query: 398 ----LEACKSAERD-FTP---SLESYFEEAIQ--QMDPAA-AVEEQYKKVNDSNYAWRAL 446
+ A K+ E D FT +++ EE + ++D AA A + K + ++WR L
Sbjct: 503 LANVVAADKAGELDRFTAEFGAMKRKAEETAENTELDAAAKAAQIDEAKEAQNAFSWRGL 562
Query: 447 RLLSRKCPHFF 457
RL++ F
Sbjct: 563 RLVASSNLSLF 573
>gi|378725794|gb|EHY52253.1| hypothetical protein HMPREF1120_00467 [Exophiala dermatitidis
NIH/UT8656]
Length = 709
Score = 171 bits (434), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 160/618 (25%), Positives = 271/618 (43%), Gaps = 127/618 (20%)
Query: 34 SALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ALE R+ + ++ T I + + L D+C L L+ + D
Sbjct: 69 AALETVFREIFVHIVATVAISEPAFSQVWVLLDIITVLSDNDLCEPGLGFWLIEELLDSQ 128
Query: 91 TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y++ + +K ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 129 TIDGCRKVFDYLDSRRERITAKNFKHKSLV------ILRCCNELLRRLSRAEDTVFCGRV 182
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE----------EMDV------------ 183
+FL + FP ++S +N+ EF+VEN+T F E+D+
Sbjct: 183 FIFLFQSFPLGDKSSVNLRGEFHVENVTTFDPAPRKSEDAIKPMELDMDARTPQATTSGA 242
Query: 184 ----SSNETEETDT--------------EEVDKVKIDFNF---YKKFWSLQDYFRNPVQC 222
SS++ +E D+ E D+ + + Y KFWSLQ F +P +
Sbjct: 243 RTPASSSQIQEQDSTGRGTPVAGKAIKSEVKDQTQPPPDLDTLYPKFWSLQTLFSSPTKL 302
Query: 223 YNKVSWKMFTSYAETVLAAFKSY-------------------KLDDVQSSLN---PSGDY 260
+ + M L FKS K D++S N P+ +
Sbjct: 303 FEPSNMAMLKEGLALTLLCFKSVSHGSAPSTAPSDVRRGLKRKHSDIESDSNASLPASTF 362
Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF---TSTVKSRGEGLELKS------- 310
KYLTN+ L DL++ D +FRR++L+Q LI+ + T K++ EGL KS
Sbjct: 363 NPKYLTNRDLFDLEIHDIDFRRHILVQALIMIDFLLSMTPAAKAKFEGLTNKSVLYPFTL 422
Query: 311 --DQEKWVKDTTETVYSLIKQ-TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR--- 364
+ KW + T + S ++Q +G+ + ++V +L +++W +WK E CP + R
Sbjct: 423 SEEDTKWAQSTRAAIASYLQQGNGNEGKFYYRMVDTVLSRDKNWVRWKAENCPSISRDPV 482
Query: 365 ----------PLTSITDEDKKDEPDAKKKKTPELTKLWNSKDNLEACK-SAERDFTPSLE 413
LT +TD K P+ +LT L + + LEA K + R P++E
Sbjct: 483 PPHAYLEAKDTLTKLTDAHKAPVPNPPGAG--DLTFL-SKVEPLEALKRPSTRHKLPTVE 539
Query: 414 SYFEEA-IQQMDPAAAVEEQYKKVNDSNYA---WRALRLLSRKCP-----HFFLNATPNV 464
Y+ +D AV E K+ + A WRALR R+ + N V
Sbjct: 540 EYYRGIETDDLDMDFAVTEDEKREIEERKAGKLWRALRASDRRFALCERVQYGGNLKALV 599
Query: 465 EKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSE-------EIQEEEKEE 517
E+ +++ ++ +K + + G G + AE E D K + ++++EE+E+
Sbjct: 600 EEGPN--QDVEEKSGADKEARENENGGEGAEAVKAEAETDEKPDIEIEDQPQVKKEEQEQ 657
Query: 518 DWEAKADPEGDADEVMSV 535
A +G+ E+ V
Sbjct: 658 PSTTAATAQGEDAEMTDV 675
>gi|307102917|gb|EFN51183.1| hypothetical protein CHLNCDRAFT_55304 [Chlorella variabilis]
Length = 625
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 132/421 (31%), Positives = 195/421 (46%), Gaps = 55/421 (13%)
Query: 78 LPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS-CKNNLLRMCNDLLRRLSRS 136
LP LL + +++ CE +F +++ +K F + CKN LLR CN LL+RLS+S
Sbjct: 118 LPAQLLEQITEGASVQDCEGVFAFLDSRTATFKSPAFQEARCKNTLLRTCNMLLKRLSKS 177
Query: 137 QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEV 196
N + CGRILLFLAK P +ERSG+N+ FN +N T + +E T+
Sbjct: 178 ANAMLCGRILLFLAKLLPLTERSGVNLGGAFNTDNSTPV----------EDVQEGATDSE 227
Query: 197 DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-----DVQ 251
K +D FY+ FW LQ YF NP W + VL F+ K+
Sbjct: 228 GK-PVDAAFYRTFWGLQSYFSNPPAALQVGKWGEVSRDIRRVLEKFRLEKVTVGESAAAG 286
Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSD 311
KYL++ +L+ LQL D FRR+ LLQ LIL Q G L+L++
Sbjct: 287 PGAGDGQGSSVKYLSSSRLMGLQLRDATFRRHFLLQCLILMQASLLLHMQLGS-LDLRAK 345
Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP--ELKR----P 365
VY+ ++ TP G F+ ++ +++ E+ W WK CP L+R P
Sbjct: 346 -----------VYAELEATPDRGPQFAAAIRHLVRWEDSWAAWKQGSCPPAPLERAAAQP 394
Query: 366 LTSITDEDKKDEPDAKKKK------------TPELTKLWN-SKDNLEACKSAERDFTPSL 412
D D P K++K T EL +LWN ++DNL A + +R ++
Sbjct: 395 PAGSEDADLT-APAPKRRKLGSDAVFGIRVGTDELDRLWNLAEDNLSALSADDRGGFKTV 453
Query: 413 ESYFEEAIQQMDPAAAVEEQ---YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE 469
E +++M AAA E+ K Y+W+ALR ++R F A V+K +
Sbjct: 454 RQLMEPVVEEMREAAAGEDDGLLMKASRSKLYSWKALRAVARTNLQAFAAA---VQKGGD 510
Query: 470 F 470
Sbjct: 511 L 511
>gi|169784203|ref|XP_001826563.1| THO complex subunit Tho1 [Aspergillus oryzae RIB40]
gi|83775308|dbj|BAE65430.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 669
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 141/526 (26%), Positives = 225/526 (42%), Gaps = 121/526 (23%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ T ID I N + + + C L L+ + D
Sbjct: 66 AAVEIAFRERFYSLLATTSIDEPSFIQIWNLLDIISIFSDNEQCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS-------------------SN 186
+FL + FP ++S +N+ E++ EN+T F E+D S
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEIPELDTSVLDEADVVMSDEQGPPTTTEG 239
Query: 187 ETEETDT--------------------------EEVDKVKIDFN-FYKKFWSLQDYFRNP 219
+ E DT E VD+ D N Y FW LQ YF P
Sbjct: 240 QQENNDTKAPSIADPQAPVQDQPAAPKVIISQEEGVDEKANDLNKLYPTFWGLQAYFSAP 299
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQSS 253
+ ++ + F + E L+AFKS D ++ SS
Sbjct: 300 TRIFDPQHFATFRTGLEATLSAFKSVNTDLESSGTSKTSEELRKSNKRKRTSDGQEIASS 359
Query: 254 LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE-------- 304
NP KYLT++ L DL+++DT FRR+VL+Q LIL + S T K++ +
Sbjct: 360 FNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLAELTNKS 413
Query: 305 ---GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
G L D KW + + + Q G+ + ++V +L +++W +WK EGCP
Sbjct: 414 VLYGFVLNEDDAKWAVKMRKAIEEYL-QEGVGGKFYYRMVDTVLSRDKNWVRWKAEGCPL 472
Query: 362 LKRPLTSITDEDKKDEPDA-----KKKKTPELTKL-------WNSKDNLEACKSAERDFT 409
++RP S++D E K+ ++ + L S LE K +R
Sbjct: 473 IERPPVSVSDYLGAREHATKVYANKRLRSSPMGSLNLNFLSEGESLAGLERLKEPQRFAV 532
Query: 410 PSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
PS +S+ + MD A E++ ++ WR LRL ++
Sbjct: 533 PSSDSFMMGIMDDELDMDTAQTKEDKENATRAKASKTWRVLRLSAK 578
>gi|391868537|gb|EIT77751.1| nuclear matrix protein [Aspergillus oryzae 3.042]
Length = 670
Score = 168 bits (426), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 141/526 (26%), Positives = 225/526 (42%), Gaps = 121/526 (23%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ T ID I N + + + C L L+ + D
Sbjct: 66 AAVEIAFRERFYSLLATTSIDEPSFIQIWNLLDIISIFSDNEQCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS-------------------SN 186
+FL + FP ++S +N+ E++ EN+T F E+D S
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEIPELDTSVLDEADVVMSDEQGPPTTTEG 239
Query: 187 ETEETDT--------------------------EEVDKVKIDFN-FYKKFWSLQDYFRNP 219
+ E DT E VD+ D N Y FW LQ YF P
Sbjct: 240 QQENNDTKAPSIADPQAPVQDQPAAPKVIISQEEGVDEKANDLNKLYPTFWGLQAYFSAP 299
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQSS 253
+ ++ + F + E L+AFKS D ++ SS
Sbjct: 300 TRIFDPQHFATFRTGLEATLSAFKSVNTDLESSGTSKTSEELRKSNKRKRTSDGQEIASS 359
Query: 254 LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE-------- 304
NP KYLT++ L DL+++DT FRR+VL+Q LIL + S T K++ +
Sbjct: 360 FNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLAELTNKS 413
Query: 305 ---GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
G L D KW + + + Q G+ + ++V +L +++W +WK EGCP
Sbjct: 414 VLYGFVLNEDDAKWAVKMRKAIEEYL-QEGVGGKFYYRMVDTVLSRDKNWVRWKAEGCPL 472
Query: 362 LKRPLTSITDEDKKDEPDA-----KKKKTPELTKL-------WNSKDNLEACKSAERDFT 409
++RP S++D E K+ ++ + L S LE K +R
Sbjct: 473 IERPPVSVSDYLGAREHATKVYANKRLRSSPMGSLNLNFLSEGESLAGLERLKEPQRFAV 532
Query: 410 PSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
PS +S+ + MD A E++ ++ WR LRL ++
Sbjct: 533 PSSDSFMMGIMDDELDMDTAQTKEDKENATRAKASKTWRVLRLSAK 578
>gi|425777812|gb|EKV15968.1| THO complex subunit Tho1, putative [Penicillium digitatum PHI26]
gi|425782580|gb|EKV20479.1| THO complex subunit Tho1, putative [Penicillium digitatum Pd1]
Length = 691
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 145/541 (26%), Positives = 243/541 (44%), Gaps = 111/541 (20%)
Query: 6 QLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKV---IENYI 62
+ +F +S V S+ K K +A+E + R+ +++ + ID I N +
Sbjct: 38 ETHFGESIWQVPSEDEKATKHLSPQTKLAAVEIAFREKFYRVLASTFIDDPAFIQIWNLL 97
Query: 63 SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNV-----NIWKQQTFFMS 117
+ + C L L+ + D T+D C ++F Y+E +KQ++
Sbjct: 98 DIVSIFSDNEQCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTKKHFKQKSLV-- 155
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
+LR CN+LLRRLSR+++TVFCGR+ ++L + FP ++S +N+ EF+ EN+T F
Sbjct: 156 ----ILRSCNELLRRLSRAEDTVFCGRVFIYLFQSFPLGDKSSVNLRGEFHSENVTTF-- 209
Query: 178 DEEMDVSSNETEETDTEEVDKV--------------------------------KIDFN- 204
+E+ S E +E DT D ++D N
Sbjct: 210 -DEIAKDSAEKKEQDTTMTDGAEPTTPGVTAAGQAENSKVPKVVVSANGEKKSDEVDLNS 268
Query: 205 FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD---------------- 248
Y FWSLQ YF P + ++ + F + E L AFKS +
Sbjct: 269 LYPLFWSLQAYFSAPSKMFDAERFTSFKTGMEATLDAFKSINTEMENESSARASEDARKS 328
Query: 249 ----------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ---FLILFQYF 295
+V +S NP KYLT++ L DL++SDT FRR+VL+Q L
Sbjct: 329 NKRKRIADGTEVATSFNP------KYLTSRDLFDLEISDTAFRRHVLVQALILLDFLLSL 382
Query: 296 TSTVKSRGE---------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILK 346
TS KSR G L + +W +++ ++Q P G+ + ++V +L
Sbjct: 383 TSKSKSRLADTTNKSVLYGFTLNEEDTRWATSMRKSIEGYLQQG-PGGKFYYRMVDTVLS 441
Query: 347 GEEHWNQWKNEGCPELKRPLTS----ITDEDKKDEPDAKKKKTP------ELTKLWNSK- 395
+++W +WK EGCP ++RP S T + + A K+ P +L L +++
Sbjct: 442 RDKNWVRWKAEGCPLIERPAISAAEYTTSREHATKAYANKRLRPSPMGSLDLRFLADTEV 501
Query: 396 -DNLEACKSAERDFTPSLESYFEEAI-QQMDPAAAVEEQYKKVNDSNYA---WRALRLLS 450
N+E K ++R PS +S+ + + +MD A+ ++ K + + A WR LRL +
Sbjct: 502 LANIERLKESDRYSVPSADSFLKGIMDDEMDIDMAMTDEDKAIAEGAKASKSWRILRLAA 561
Query: 451 R 451
+
Sbjct: 562 K 562
>gi|320590242|gb|EFX02685.1| tho complex subunit [Grosmannia clavigera kw1407]
Length = 772
Score = 168 bits (425), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 135/474 (28%), Positives = 214/474 (45%), Gaps = 64/474 (13%)
Query: 32 KKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
K + +E + R LI ID + N + L C+ L L+ + D
Sbjct: 68 KNAIVETATRDVFGGLIAARSIDSPGFVRVWNLFDILSILADDGHCDPALLFWLIEELLD 127
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
T+ C ++F Y+E +F S K +LR CNDLLRRLSR+++T FCGR+ +F
Sbjct: 128 SQTIAGCRKVFDYLESRRERITTNSF-NSKKLVILRSCNDLLRRLSRAEDTPFCGRVFIF 186
Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV--------- 199
L + FP ++S +N+ E+++EN+T + D ++ ETE D +VD
Sbjct: 187 LFQVFPLGDKSSVNLRGEYHIENVTTYDVD-----AAKETEGGDKMDVDGAMDGSKPATD 241
Query: 200 --KIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK----------SYK 246
IDF Y FWSLQ F P + + F S E LAAF+ +
Sbjct: 242 PKNIDFEVLYPVFWSLQTSFSQPKTLFEPAQFAQFKSGLEASLAAFQKINNERRPHSTRH 301
Query: 247 LDDVQSSLNPSGD--------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTST 298
DDV+ D Y KYLT++ L L+ SD FRR+ LLQ LI+ ++ S
Sbjct: 302 TDDVRRGTKRKRDKEDELADTYNPKYLTSRDLFQLETSDLTFRRHFLLQALIIVEFLLSL 361
Query: 299 VKSRGEGLELKSDQEK---------------WVKDTTETVYSLIKQTPPDGEHFSQVVKL 343
S + L S K WV + I+ + DG +F ++V
Sbjct: 362 SPSAKKKLATASTPNKSVTYSDHVLSEEDTVWVTKMKHHITEHIRNSDDDGAYFFRMVDT 421
Query: 344 ILKGEEHWNQWKNEGCPELKRPLTS----ITDEDKKDEPDAKKKKTPELTKLWNSKDNLE 399
IL +++W +WK E C ++RP S ++ +D + A+ ++ ++ S D L
Sbjct: 422 ILARDKNWVRWKVESCQSIERPGVSAEEFVSAKDGAQKAFARHRRRNNMSMGSFSLDFLN 481
Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
+ E D L+ A+ ++D + + E+ K ++ WRALR ++R+C
Sbjct: 482 DDED-EADALEKLKQPKRFALPEIDQSKMLAEERK----ASKTWRALR-IARQC 529
>gi|408397178|gb|EKJ76328.1| hypothetical protein FPSE_03583 [Fusarium pseudograminearum CS3096]
Length = 825
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 120/403 (29%), Positives = 192/403 (47%), Gaps = 77/403 (19%)
Query: 25 KPTCDYDKKSAL-EQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLP 79
KP D ++ A+ E + R LI T ID +KV N + + L C+ L
Sbjct: 69 KPANDKGQQFAIIETAARNIFSHLIATTTIDSPDYVKVW-NLLDILSILSDDGQCDPALL 127
Query: 80 IILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLS 134
L + D T+ C ++F ++E + N +KQ+ +LR CN+LLRRLS
Sbjct: 128 FWLAEELLDSQTIAGCRKIFDFLESRRERITANHFKQKQLI------ILRTCNELLRRLS 181
Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------------GGDE- 179
R+++T FCGR+ +F+ + FP ++S +N+ E++VENIT + G DE
Sbjct: 182 RAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEYHVENITTYEQGTADDDSRMALDGPDEQ 241
Query: 180 --EMDVSSNETEETDTEEVDKVK--IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYA 235
E+ S + TD ++ DK K Y FWSLQ+YF P + + S F
Sbjct: 242 PKELSESKTTPKSTDAKKEDKEKPLPTDELYPLFWSLQEYFSQPKKLFETTSLSSFKESL 301
Query: 236 ETVLAAFKSYKLDDVQS---------------SLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
+ F++ D +S + NP KYLT++ L DL++SD +F
Sbjct: 302 AATMNVFQTVHNDSKRSLKRKRETGEEDESFNAFNP------KYLTSKDLFDLEISDLSF 355
Query: 281 RRYVLLQFLILFQYFTSTVK-SRGEGLE------------------LKSDQEKWVKDTTE 321
RR++L+Q LI+ + + K SR + E L + KW DT +
Sbjct: 356 RRHILVQALIIMDFLLALSKLSRDKLAETLPATANINKAVMYGDQILSEEDAKWASDTKK 415
Query: 322 TVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
T+ ++Q PDG +F ++V+ +L +++W +WK EGC +KR
Sbjct: 416 TIADYLRQG-PDGPYFYRMVETVLARDKNWVRWKIEGCQPIKR 457
>gi|330794778|ref|XP_003285454.1| hypothetical protein DICPUDRAFT_29296 [Dictyostelium purpureum]
gi|325084629|gb|EGC38053.1| hypothetical protein DICPUDRAFT_29296 [Dictyostelium purpureum]
Length = 666
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 129/407 (31%), Positives = 193/407 (47%), Gaps = 45/407 (11%)
Query: 26 PTCDYDK---KSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIIL 82
P+ D +K KS+++ +IR + LIK + + I+ I L +S LP+ L
Sbjct: 70 PSFDINKIEIKSSIDINIRLFFNILIKEKNTTYEDIDVTIQLAYIFIDIFGLDSILPLQL 129
Query: 83 LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK--NNLLRMCNDLLRRLSRSQNTV 140
D ++ T+++C +LF +E ++ Q + + N LL++C +LL+R SRS N
Sbjct: 130 SEDLYETKTINECLKLFGLLESRAELFSQDPEIIKGRKRNLLLKICVELLKRFSRSSNPD 189
Query: 141 FCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVK 200
CGRILLFLA FP S+ SGLN E NV +E +D ++ + + +
Sbjct: 190 SCGRILLFLAYVFPLSDPSGLNGKGEHNVH------PEESLDFQNDIMKNIEDGSSEDQT 243
Query: 201 IDFNFYKKFWSLQDYFRNPVQCY-----------NKVSWKMFTSYAETVLAAFKSY-KLD 248
ID NFY +FW LQ F+NP NK W F E V+ +F ++ L+
Sbjct: 244 IDKNFYTQFWGLQTIFQNPQTVLFNNITATSITLNKAKWDSFIQALELVIGSFSTHINLE 303
Query: 249 DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTST---------- 298
D+Q S YF KYLT+ L+ LQL D+ FRR +L Q LI FQ T
Sbjct: 304 DLQQSNPTKQHYFTKYLTSSNLMKLQLKDSTFRRNILTQILITFQNLDLTGQKYPTLFND 363
Query: 299 --------VKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
K+ + + ++D T Y ++ T P GE+FS + ILK E++
Sbjct: 364 LQVNIIIAFKTIYHYFLFINFIKNIIQDLTIKCYKILSNTQPGGEYFSNCLSSILKREKN 423
Query: 351 WNQWKNE-GCPELKRPLTSITDEDKKDEPDAKKKKT---PELTKLWN 393
W WK + GC +RP S + KK + EL++LWN
Sbjct: 424 WIIWKRDNGCKNFERPPCSPIVKKKKPVSKKSNRMVMGNVELSRLWN 470
>gi|429863476|gb|ELA37927.1| nuclear matrix protein [Colletotrichum gloeosporioides Nara gc5]
Length = 869
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 138/512 (26%), Positives = 235/512 (45%), Gaps = 101/512 (19%)
Query: 36 LEQSIRQYLLKLIKTPDI---DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTL 92
+E ++R LI T I D + N L + L ++ C+ L L+ + D T+
Sbjct: 84 IETAVRDMFSNLIATTPIESPDFVKVWNLFDLLIILSDEEKCDPALLFWLVEELLDSQTI 143
Query: 93 DKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
C ++F ++E + V +KQ+ +LR CN+LLRRLSR+++T FCGR+ +
Sbjct: 144 AGCRKIFDFLESRRERITVKHFKQKQLV------ILRSCNELLRRLSRAEDTAFCGRVFI 197
Query: 148 FLAKFFPFSERSGLNIISEFNVENIT---EFGGDE-----EMDV-SSNETEETDTEEVDK 198
F+ + FP ++S +N+ E++VEN+T + G + +MDV E + D + K
Sbjct: 198 FMFQSFPLGDKSSVNLRGEYHVENVTVYDDLGAKDDDAAGQMDVDPPKEVADVDAKSQTK 257
Query: 199 VKIDFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
+ F+ Y FWSLQ+ F P + + + F E + +F+
Sbjct: 258 A-VSFDAKKPVQADQPLDPDVLYPLFWSLQETFSQPKKLFEPDNLASFKRGLEATMTSFR 316
Query: 244 SYK----------LDDVQSSLN------PSGD----YFAKYLTNQKLLDLQLSDTNFRRY 283
+ + +D+ Q SL GD Y KYLT++ L +L++SD +FRR+
Sbjct: 317 AIQAEQGQRPPKAVDEAQRSLKRKREDAEEGDLANSYNPKYLTSRDLFELEMSDISFRRH 376
Query: 284 VLLQFLILFQY---FTSTVKSRGEG----------LELKSDQE-KWVKDTTETVYSLIKQ 329
+L+Q LI+ + +S VK + G ++ SD++ KW +T + +
Sbjct: 377 ILVQSLIVMDFLLSLSSKVKEKLAGNAQNKSVIYVGQVLSDEDTKWATETKRVIVEKYLK 436
Query: 330 TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS---ITDEDKKDEPDAKKKKTP 386
DG +FS+VV+ +L +++W +WK+E CP ++ P S + K + A KK+
Sbjct: 437 HGTDGPYFSRVVETVLARDKNWIRWKSENCPSIELPPVSPETFLEAKKGSQRTATKKRLR 496
Query: 387 ELT---------KLWNSKDNLEACKSAERDFTPSLESY----------FEEAIQQMDPAA 427
+ + N + ++ K+ ER P LE + E AA
Sbjct: 497 SIPMGSLPLGFLEEGNEETAMQKLKAEERYSLPELEGFKRKIADDDFEIEMPTNNQTKAA 556
Query: 428 AVEEQYKKVNDSNYAWRALRLLSRKCPHFFLN 459
AVE + K +WRALR+ S+ F N
Sbjct: 557 AVESKASK------SWRALRIASKTRLTLFDN 582
>gi|299756494|ref|XP_001829374.2| UDP-glucose epimerase [Coprinopsis cinerea okayama7#130]
gi|298411702|gb|EAU92334.2| UDP-glucose epimerase [Coprinopsis cinerea okayama7#130]
Length = 1127
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/390 (30%), Positives = 183/390 (46%), Gaps = 60/390 (15%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ + + + + +D C T P ++L D + T+ C Q+F ++E + +
Sbjct: 79 LRDKLDIVLIFTEQDACEQTFPFLVLQDLLETQTIASCSQIFSWIEDRASRLTEGMVPQK 138
Query: 118 CKN-NLLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--ENIT 173
K LLR N+LLRRLS+ T+FCGRIL FL+ FP ERSG+N+ E+ E +
Sbjct: 139 GKALILLRTLNELLRRLSKMGSTTIFCGRILTFLSGVFPLGERSGVNLRGEYGPTWEGVK 198
Query: 174 -----EFGGDEEMDVSSNETEETDTEEVDKVKIDF---------------NFYKKFWSLQ 213
+ E+ DV +E+ T++ DK+++D +FY FWSLQ
Sbjct: 199 YSDKEKVVSSEDEDVEMAPADESQTQD-DKMQVDTKESQPAPQTAAEKREDFYNTFWSLQ 257
Query: 214 DYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLDDVQSSLNPSGD------------ 259
YF P + NK + + F + V+ K + K + S N +G
Sbjct: 258 LYFSKPTEFANKGALEDFRASINKVIPVIKEATAKERAMMGSRNLTGPGAGLKRKREPEA 317
Query: 260 --------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEG------ 305
+FAK+LT+ +LLDL+++DT+FRR L Q LIL + + K EG
Sbjct: 318 EESNVTEYFFAKFLTSPELLDLEIADTHFRRQFLFQMLILLHHLLTFTKEAKEGWATLRN 377
Query: 306 ------LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
L+ D +WV++ V ++QT P+G FS V IL E +W +WKNE C
Sbjct: 378 RSLQMDFTLEPDDARWVQEYINKVTEELRQTTPNGRQFSDTVTTILDRERNWVKWKNEIC 437
Query: 360 -PELKRPLTSITDEDKKDEPDAKKKKTPEL 388
P K P + D K D +A ++ EL
Sbjct: 438 VPFDKEPWSQEVDGKKVDMFEATRQFRDEL 467
>gi|310798402|gb|EFQ33295.1| guanylate kinase [Glomerella graminicola M1.001]
Length = 876
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 113/402 (28%), Positives = 192/402 (47%), Gaps = 75/402 (18%)
Query: 34 SALEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDM 89
+A+E ++R LI T I+ +KV N L L + C+ L L+ + D
Sbjct: 79 AAIETAVRNMFSNLIATTPIESPNFVKVW-NLFDLLTILSDDEKCDPALLFWLVEELLDS 137
Query: 90 STLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGR 144
T+ C ++F ++E + V +KQ+ +LR CN+LLRRLSR+++T FCGR
Sbjct: 138 QTIAGCRKIFDFLESRRERITVKHFKQKQLV------ILRSCNELLRRLSRAEDTAFCGR 191
Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDV-SSNETEETDTEE 195
+ +F+ + FP ++S +N+ E++VEN+T F E+MDV S NE +E D +
Sbjct: 192 VFIFMFQSFPLGDKSSVNLRGEYHVENVTVFDDVGAKTDATPEQMDVDSQNEVKEGDAKS 251
Query: 196 VDKVKIDFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
K + F+ Y +FWSLQ+ F P + ++ F E L
Sbjct: 252 QTKA-VSFDAKKPAQADQPLDPDALYPQFWSLQETFSQPKKLFDPAHLTSFKRGLEATLT 310
Query: 241 AFKSYK----------LDDVQSSL----------NPSGDYFAKYLTNQKLLDLQLSDTNF 280
+F+ + +D+ Q +L + + Y KYLT++ L +L++SD +F
Sbjct: 311 SFRVIQNEQGQRPPKTVDEAQRNLKRKREDVEESDLANSYNPKYLTSRDLFELEMSDISF 370
Query: 281 RRYVLLQFLILFQYFTSTVKSRGE--------------GLELKSDQEKWVKDTTETVYSL 326
RR++L+Q LI+ + S E G L + KW + T+
Sbjct: 371 RRHILVQALIIMDFLLSLSSKTKEKLAGNAQNKSVIYLGQVLSEEDTKWASEMKRTIVDK 430
Query: 327 IKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
+ DG +FS++V+ +L +++W +WK+E CP ++ P S
Sbjct: 431 YLKHGVDGPYFSRMVETVLARDKNWIRWKSENCPSIELPAVS 472
>gi|46122093|ref|XP_385600.1| hypothetical protein FG05424.1 [Gibberella zeae PH-1]
Length = 826
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 121/404 (29%), Positives = 192/404 (47%), Gaps = 79/404 (19%)
Query: 25 KPTCDYDKKSAL-EQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLP 79
KP D ++ A+ E + R LI T ID +KV N + + L C+ L
Sbjct: 69 KPANDKGQQFAIIETAARNIFSHLIATTTIDSPDYVKVW-NLLDILSILSDDGQCDPALL 127
Query: 80 IILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLS 134
L + D T+ C ++F ++E + N +KQ+ +LR CN+LLRRLS
Sbjct: 128 FWLAEELLDSQTIAGCRKIFDFLESRRERITANHFKQKQLI------ILRTCNELLRRLS 181
Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------------GGDE- 179
R+++T FCGR+ +F+ + FP ++S +N+ E++VENIT + G DE
Sbjct: 182 RAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEYHVENITTYEQGTADDDFRMALDGPDEQ 241
Query: 180 --EMDVSSNETEETDTEEVDKVK---IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSY 234
E+ S + TD ++ DK K D Y FWSLQ+YF P + + S F
Sbjct: 242 PKELSESKTTPKSTDAKKEDKEKPLPTDV-LYPLFWSLQEYFSQPKKLFETTSLSSFKES 300
Query: 235 AETVLAAFKSYKLDDVQS---------------SLNPSGDYFAKYLTNQKLLDLQLSDTN 279
+ F++ D +S + NP KYLT++ L DL++SD +
Sbjct: 301 LAATMNVFQTVHNDSKRSLKRKRETGEEDESFNTFNP------KYLTSKDLFDLEISDLS 354
Query: 280 FRRYVLLQFLILFQYFTSTVK-SRGEGLE------------------LKSDQEKWVKDTT 320
FRR++L+Q LI+ + + K SR + E L + KW DT
Sbjct: 355 FRRHILVQALIIMDFLLALSKLSRDKLTETLPATAIINKAVMYGDQVLIEEDAKWASDTK 414
Query: 321 ETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
T+ ++Q PDG +F ++V+ +L +++W +WK EGC +KR
Sbjct: 415 RTIADYLRQG-PDGPYFYRMVETVLARDKNWVRWKIEGCQPIKR 457
>gi|146323653|ref|XP_746618.2| THO complex subunit Tho1 [Aspergillus fumigatus Af293]
gi|129555326|gb|EAL84580.2| THO complex subunit Tho1, putative [Aspergillus fumigatus Af293]
gi|159122146|gb|EDP47268.1| THO complex subunit Tho1, putative [Aspergillus fumigatus A1163]
Length = 673
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 150/586 (25%), Positives = 245/586 (41%), Gaps = 144/586 (24%)
Query: 34 SALEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDM 89
+A+E + R+ L+ ID I++ N + + + C L L+ + D
Sbjct: 66 AAVELAFREKFYNLLANTTIDEPSFIRMW-NLLDIISIFSDNEQCEPGLIFWLIEELLDS 124
Query: 90 STLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGR 144
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR
Sbjct: 125 QTIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGR 178
Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEF------GGDE--EMDVSSNETEET----- 191
+ +FL + FP ++S +N+ E++ EN+T F G E +MDV + +ET
Sbjct: 179 VFIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEITKSGAKESDDMDVEMPDAKETLTATE 238
Query: 192 ---------------------------------DTEEVDKVK-IDFN-FYKKFWSLQDYF 216
TE+ D K +D N Y FW LQ YF
Sbjct: 239 ALGHDHDGQPPPSASASQTLVQEQAKIPKLIVSKTEDKDSEKAVDLNALYPVFWGLQAYF 298
Query: 217 RNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DV 250
P + ++ + F + E L AFK+ + D ++
Sbjct: 299 SAPTKVFDSHHFSSFMTGLENTLIAFKNIRTDLENQSASKTAEEIRKSTKRKRSVDGQEI 358
Query: 251 QSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE----- 304
SS NP KYLT++ L DL+++DT FRR+VL+Q LIL + S T K++ +
Sbjct: 359 ASSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFVLSLTPKAKAKLADLT 412
Query: 305 ------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG 358
G L + KW + + + Q G+ + ++V +L +++W +WK EG
Sbjct: 413 NKSVLYGFVLNEEDAKWATKMRKAIEEYL-QEGAGGKFYYRMVDTVLSRDKNWVRWKAEG 471
Query: 359 CPELKRPLTSITDEDKKDEPDAK----KKKTPELTKLWN--------SKDNLEACKSAER 406
CP ++RP S+ + + E K K+ P N S ++E K ER
Sbjct: 472 CPLIERPAVSVAEYLRARENATKTYANKRLRPSPMGALNLNFLAEGESLSSIEKLKEPER 531
Query: 407 DFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR----------- 451
PS +S+ + +D A E++ + ++ WR LRL ++
Sbjct: 532 YSVPSADSFMMGIMDDELDIDTAHTKEDKDTAIRAKASKTWRILRLSAKGKLAAFDKIDD 591
Query: 452 --------KCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISG 489
+ P + TP E S+ E K P + SG
Sbjct: 592 GKNLKALFEAPQPLESTTPAHEPASQASEPAAKASGDAGPEANASG 637
>gi|226289601|gb|EEH45085.1| nuclear matrix protein [Paracoccidioides brasiliensis Pb18]
Length = 649
Score = 165 bits (417), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 151/595 (25%), Positives = 249/595 (41%), Gaps = 127/595 (21%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIIL 82
P Y + +E + RQ L+ T ID I N + + + C L L
Sbjct: 55 PQAQY---AVVETAFRQKFYALLATTSIDDPSFIQIWNLLDIVSIFSDNEQCEPGLFFWL 111
Query: 83 LSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQ 137
+ + D T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR++
Sbjct: 112 IEELLDSQTIDGCRKVFDYLESRRERNTTKHFKQKSLI------ILRSCNELLRRLSRAE 165
Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF---------------------- 175
+TVFCGR+ +FL + FP +RS +N+ EF+ EN+T F
Sbjct: 166 DTVFCGRVFIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQEGSGISVDVDSLEGK 225
Query: 176 -GGDEEMDVSSNETEETDTE-----------------------EVDKVKIDFNFYKKFWS 211
D ++++S ++T+ E E V Y FWS
Sbjct: 226 SSADMKIELSEAAAQQTNGEATPSTADEHQQGVKSAIQKTTEPEAASVLTTDELYPIFWS 285
Query: 212 LQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------DVQS 252
LQ F +P ++ ++ F + E L+ F+ D +
Sbjct: 286 LQTNFSSPTTLFDPANFAAFKTGLEASLSTFQKVNTDMQVRITKGSEETRRGPKRRRIGD 345
Query: 253 SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE---- 307
S + + KYLT++ L +L+++D FRR++L+Q LIL + S T K++ + +
Sbjct: 346 STEMTNSFNPKYLTSRDLFELEVNDVAFRRHILVQALILLDFVLSLTPKAKAKLADSTNK 405
Query: 308 -------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
L D KW ++ S ++Q +G+ + ++V +L +++W +WK EGCP
Sbjct: 406 AVLYNYVLSDDDAKWASQMKASIASYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEGCP 464
Query: 361 ELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACKSAERDF 408
++R SI D K + + TP +L L K D++ K +R
Sbjct: 465 PIERTPVSIQDYLDTQASAIKVTTSKRLRLTPFGSLDLNFLMEDKNLDSVARLKHPDRFT 524
Query: 409 TPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFF------- 457
TP ++SY +D A + EE+ + + +N WR LRL SR F
Sbjct: 525 TPDIDSYLRGVADDEFNIDMAQSREEKDEAIKAKANKIWRTLRLSSRSKLALFDKIEDGN 584
Query: 458 ----LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSE 508
L TP +++ V + + + Q NGNG D A EV+ E
Sbjct: 585 NIKVLFETPPAPEDA-----AVPATSRPQLAVQNGVNGNGQDGPSATGEVNESKE 634
>gi|409083598|gb|EKM83955.1| hypothetical protein AGABI1DRAFT_117418, partial [Agaricus bisporus
var. burnettii JB137-S8]
Length = 1044
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 178/367 (48%), Gaps = 60/367 (16%)
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN- 120
+ L + +D C T P I+L D + T++ C +F ++E + + K
Sbjct: 83 LDLVLTFTEQDACEQTFPFIVLQDLLETQTINSCSHIFSWIEARSSRLTEGMVPQKGKAL 142
Query: 121 NLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEF------------ 167
LLR NDLLRRLS+S +T VFCGRIL FL+ FP ERSG+N+ E+
Sbjct: 143 ILLRTLNDLLRRLSKSGSTTVFCGRILTFLSGVFPLGERSGVNLRGEYGPTWEDVWYPKH 202
Query: 168 ----NVENITEFGGDEEMDVSSNETEETDTEEVDKVK----IDFNFYKKFWSLQDYFRNP 219
N E +T+F G+ S+E E + +VD K +FY FW LQ F P
Sbjct: 203 DEDTNTE-MTDFRGESS---RSDEVPEDNAMQVDSNKETKATPTDFYNTFWHLQYPFSKP 258
Query: 220 VQCYNKVSWKMFTSYAETVLAAFK--SYKLDDVQSSLNPSGD------------------ 259
NK ++ F + VL K + K + S +G
Sbjct: 259 SIFANKEAFPQFQEAIDKVLPVIKEATAKERAMMGSRVAAGVVLTKRKREPEAEESNVNE 318
Query: 260 -YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE--- 307
+FAK+LT+ LLDL+++DT+FRR L Q +IL Q+ FT K+ R L+
Sbjct: 319 YFFAKFLTSPDLLDLEIADTHFRRQFLFQLMILLQHLLTFTKGAKAAWATQRNRSLQMDF 378
Query: 308 -LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC-PELKRP 365
L+++ +WV+DT +KQT P+G F++ + IL+ E++W +WKNE C P K P
Sbjct: 379 TLEAEDTQWVQDTLSKSMEELKQTTPNGRAFAETTQTILEREKNWVKWKNELCAPFDKEP 438
Query: 366 LTSITDE 372
++ D+
Sbjct: 439 WSTRVDD 445
>gi|225682229|gb|EEH20513.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 701
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 141/548 (25%), Positives = 231/548 (42%), Gaps = 121/548 (22%)
Query: 70 MKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLR 124
+ + C L L+ + D T+D C ++F Y+E +KQ++ +LR
Sbjct: 117 LTEQCEPGLFFWLIEELLDSQTIDGCRKVFDYLESRRERNTTKHFKQKSLI------ILR 170
Query: 125 MCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------G 176
CN+LLRRLSR+++TVFCGR+ +FL + FP +RS +N+ EF+ EN+T F G
Sbjct: 171 SCNELLRRLSRAEDTVFCGRVFIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQDG 230
Query: 177 GDEEMDVSSNE--------------------------------------TEETDTEEVDK 198
+DV S E ++T E
Sbjct: 231 SGISVDVDSLEGKSSADMKIELPEAAAQQTNGEATPSTADEHQQGVKSAIQKTTEPEAAS 290
Query: 199 VKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD---------- 248
V Y FWSLQ F +P ++ ++ F + E L+ F+ D
Sbjct: 291 VLTTDELYPIFWSLQTNFSSPTTLFDPANFAAFKTGLEASLSTFQKVNTDMQVRITKGSE 350
Query: 249 ---------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-T 298
+ S + + KYLT++ L +L+++D FRR++L+Q LIL + S T
Sbjct: 351 ETRRGPKRRRIGDSTEMTNSFNPKYLTSRDLFELEVNDVAFRRHILVQALILLDFVLSLT 410
Query: 299 VKSRGEGLE-----------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKG 347
K++ + + L D KW ++ S ++Q +G+ + ++V +L
Sbjct: 411 PKAKAKLADSTNKAVLYNYVLSDDDAKWASQMKASIASYLQQG-SEGKFYYRMVDTVLTR 469
Query: 348 EEHWNQWKNEGCPELKRPLTSITD--EDKKDEPDAKKKKTPELTKLWN----------SK 395
+++W +WK EGCP ++R SI D + + K LT L + +
Sbjct: 470 DKNWVRWKAEGCPPIERTPVSIQDYLDTQASAIKVTTSKRLRLTPLGSLDLNFLMEDKNL 529
Query: 396 DNLEACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
D++ K +R TP ++SY +D A + EE+ + + +N WR LRL SR
Sbjct: 530 DSVARLKHPDRFTTPDIDSYLRGVADDEFNIDMAQSREEKDEAIKAKANKIWRTLRLSSR 589
Query: 452 KCPHFF-----------LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAE 500
F L TP +++ V + +P+ Q NGNG D A
Sbjct: 590 SKLALFDKIEDGNNIKVLFETPPAPEDA-----AVPATSRPQPAVQNGVNGNGQDGPSAT 644
Query: 501 VEVDTKSE 508
EV+ E
Sbjct: 645 GEVNESKE 652
>gi|426201362|gb|EKV51285.1| hypothetical protein AGABI2DRAFT_182243, partial [Agaricus bisporus
var. bisporus H97]
Length = 1043
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 176/363 (48%), Gaps = 52/363 (14%)
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN- 120
+ L + +D C P I+L D + T++ C +F ++E + + K
Sbjct: 83 LDLVLTFTEQDACEQIFPFIVLQDLLETQTINSCSHIFSWIEARSSRLTEGMVPQKGKAL 142
Query: 121 NLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEFNV--ENITEFGG 177
LLR NDLLRRLS+S +T VFCGRIL FL+ FP ERSG+N+ E+ E++
Sbjct: 143 ILLRTLNDLLRRLSKSGSTTVFCGRILTFLSGVFPLGERSGVNLRGEYGPTWEDVWYPKH 202
Query: 178 DEE-----MDVSSNETEETDTEEVDKVKIDFN---------FYKKFWSLQDYFRNPVQCY 223
DE+ MD + + E + +++D N FY FW LQ F P
Sbjct: 203 DEDTNAEMMDFRGESSRSDEVPEDNAMQVDSNKETKATPTDFYNTFWHLQYPFSKPSIFA 262
Query: 224 NKVSWKMFTSYAETVLAAFK--SYKLDDVQSSLNPSGD-------------------YFA 262
NK ++ F + VL K + K + S +G +FA
Sbjct: 263 NKEAFSQFQEAIDKVLPVIKEATAKERAMMGSRVAAGVVLNKRKREPEPEESNVNEYFFA 322
Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE----LKS 310
K+LT+ LLDL+++DT+FRR L Q +IL Q+ FT K+ R L+ L++
Sbjct: 323 KFLTSPDLLDLEIADTHFRRQFLFQLMILLQHLLTFTKGTKAAWATQRNRSLQMDFTLEA 382
Query: 311 DQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC-PELKRPLTSI 369
+ +WV+DT +KQT P+G F++ + IL+ E++W +WKNE C P K P ++
Sbjct: 383 EDTQWVQDTLSKSMEELKQTTPNGRAFAETTQTILEREKNWVKWKNELCAPFDKEPWSTR 442
Query: 370 TDE 372
D+
Sbjct: 443 VDD 445
>gi|238508732|ref|XP_002385551.1| THO complex subunit Tho1, putative [Aspergillus flavus NRRL3357]
gi|220688443|gb|EED44796.1| THO complex subunit Tho1, putative [Aspergillus flavus NRRL3357]
Length = 685
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 133/485 (27%), Positives = 209/485 (43%), Gaps = 118/485 (24%)
Query: 72 DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMC 126
+ C L L+ + D T+D C ++F Y+E +KQ++ +LR C
Sbjct: 122 EQCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSC 175
Query: 127 NDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS-- 184
N+LLRRLSR+++TVFCGR+ +FL + FP ++S +N+ E++ EN+T F E+D S
Sbjct: 176 NELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEIPELDTSVS 235
Query: 185 ----------------------SNETEE---------------------TDTEEVDKVKI 201
+N+TE + E VD+
Sbjct: 236 DEADVVMSDEQGPPTTTEGQQENNDTEAPSIADPQAPVQDQPAAPKVIISQEEGVDEKAN 295
Query: 202 DFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD------------ 248
D N Y FW LQ YF P + ++ + F + E L+AFKS D
Sbjct: 296 DLNKLYPTFWGLQAYFSAPTRIFDPQHFATFRTGLEATLSAFKSVNTDLESSGTSKTSEE 355
Query: 249 --------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY 294
++ SS NP KYLT++ L DL+++DT FRR+VL+Q LIL +
Sbjct: 356 LRKSNKRKRTSDGQEIASSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDF 409
Query: 295 FTS------------TVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
S T KS G L D KW + + + Q G+ + ++V
Sbjct: 410 MLSLTPKAKAKLAELTNKSVLYGFVLNEDDAKWAVKMRKAIEEYL-QEGVGGKFYYRMVD 468
Query: 343 LILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDA-----KKKKTPELTKL------ 391
+L +++W +WK EGCP ++RP S++D E K+ ++ + L
Sbjct: 469 TVLSRDKNWVRWKAEGCPLIERPPVSVSDYLGAREHATKVYANKRLRSSPMGSLNLNFLS 528
Query: 392 -WNSKDNLEACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRAL 446
S LE K +R PS +S+ + MD A E++ ++ WR L
Sbjct: 529 EGESLAGLERLKEPQRFAVPSSDSFMMGIMDDELDMDTAQTKEDKENATRAKASKTWRVL 588
Query: 447 RLLSR 451
RL ++
Sbjct: 589 RLSAK 593
>gi|325187941|emb|CCA22485.1| THO complex subunit putative [Albugo laibachii Nc14]
Length = 620
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 131/468 (27%), Positives = 202/468 (43%), Gaps = 78/468 (16%)
Query: 54 DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
D I + LC++ + ++ P +L D + + CE L+ +EV
Sbjct: 80 DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 139
Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
FF + LLRMCN +LRRLS++ NTVFCGRIL+FL+ F +ERS +N+ S+
Sbjct: 140 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 199
Query: 168 NVENITEFGGD-----EEMDVSSNETEETDT-------EEVDKVKIDFNFYKKFWSLQDY 215
NVEN+T F E D +N E D ++ +ID+ Y FW +Q +
Sbjct: 200 NVENLTIFEDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQRF 259
Query: 216 FRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNPS 257
FR+ + + + K F VL AF++ D V S+ S
Sbjct: 260 FRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKMS 319
Query: 258 GD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
D + KYLTN +L LQL D R VL Q LIL + +
Sbjct: 320 IDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHLVRAKST 379
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ S Q+ + + + V L+K+TPPDG +S++V +L+ E +W +WK E CP
Sbjct: 380 VSQD----STQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCPS 435
Query: 362 L--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----LE 399
KR + T+ + + +KK E + ++ S + LE
Sbjct: 436 YEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLLE 495
Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
K R +L + + DPA+ ++++Y D Y WR LR
Sbjct: 496 KIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 543
>gi|325187939|emb|CCA22483.1| THO complex subunit putative [Albugo laibachii Nc14]
Length = 588
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 132/468 (28%), Positives = 205/468 (43%), Gaps = 78/468 (16%)
Query: 54 DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
D I + LC++ + ++ P +L D + + CE L+ +EV
Sbjct: 48 DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107
Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
FF + LLRMCN +LRRLS++ NTVFCGRIL+FL+ F +ERS +N+ S+
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 167
Query: 168 NVENITEFGGD-----EEMDVSSNETEETDT-------EEVDKVKIDFNFYKKFWSLQDY 215
NVEN+T F E D +N E D ++ +ID+ Y FW +Q +
Sbjct: 168 NVENLTIFEDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQRF 227
Query: 216 FRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNPS 257
FR+ + + + K F VL AF++ D V S+ S
Sbjct: 228 FRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKMS 287
Query: 258 GD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
D + KYLTN +L LQL D R VL Q LIL + V++
Sbjct: 288 IDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VRA 344
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ + S Q+ + + + V L+K+TPPDG +S++V +L+ E +W +WK E CP
Sbjct: 345 KS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCPS 403
Query: 362 L--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----LE 399
KR + T+ + + +KK E + ++ S + LE
Sbjct: 404 YEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLLE 463
Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
K R +L + + DPA+ ++++Y D Y WR LR
Sbjct: 464 KIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 511
>gi|325187943|emb|CCA22487.1| THO complex subunit putative [Albugo laibachii Nc14]
Length = 583
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 131/469 (27%), Positives = 204/469 (43%), Gaps = 80/469 (17%)
Query: 54 DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
D I + LC++ + ++ P +L D + + CE L+ +EV
Sbjct: 80 DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 139
Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
FF + LLRMCN +LRRLS++ NTVFCGRIL+FL+ F +ERS +N+ S+
Sbjct: 140 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 199
Query: 168 NVENITEFGGDEEMDVSSNETEETDTEEVDKV-------------KIDFNFYKKFWSLQD 214
NVEN+T F D+ ++ E+T+ E D +ID+ Y FW +Q
Sbjct: 200 NVENLTIF-EDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQR 258
Query: 215 YFRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNP 256
+FR+ + + + K F VL AF++ D V S+
Sbjct: 259 FFRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKM 318
Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
S D + KYLTN +L LQL D R VL Q LIL +
Sbjct: 319 SIDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHLVRAKS 378
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ + S Q+ + + + V L+K+TPPDG +S++V +L+ E +W +WK E CP
Sbjct: 379 TVSQD----STQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCP 434
Query: 361 EL--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----L 398
KR + T+ + + +KK E + ++ S + L
Sbjct: 435 SYEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLL 494
Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
E K R +L + + DPA+ ++++Y D Y WR LR
Sbjct: 495 EKIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 543
>gi|325187938|emb|CCA22482.1| THO complex subunit putative [Albugo laibachii Nc14]
Length = 682
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 131/469 (27%), Positives = 201/469 (42%), Gaps = 80/469 (17%)
Query: 54 DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
D I + LC++ + ++ P +L D + + CE L+ +EV
Sbjct: 48 DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107
Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
FF + LLRMCN +LRRLS++ NTVFCGRIL+FL+ F +ERS +N+ S+
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 167
Query: 168 NVENITEFGGDEEMDVSSNETEETDTEEVDKV-------------KIDFNFYKKFWSLQD 214
NVEN+T F D+ ++ E+T+ E D +ID+ Y FW +Q
Sbjct: 168 NVENLTIF-EDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQR 226
Query: 215 YFRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNP 256
+FR+ + + + K F VL AF++ D V S+
Sbjct: 227 FFRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKM 286
Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
S D + KYLTN +L LQL D R VL Q LIL +
Sbjct: 287 SIDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHLVRAKS 346
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
+ + S Q+ + + + V L+K+TPPDG +S++V +L+ E +W +WK E CP
Sbjct: 347 TVSQD----STQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCP 402
Query: 361 EL--------------KRPLTSITDEDKKDEPDAKKKKTPELTKL--------WNSKDNL 398
KR + T+ + + +KK E + S L
Sbjct: 403 SYEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLL 462
Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
E K R +L + + DPA+ ++++Y D Y WR LR
Sbjct: 463 EKIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 511
>gi|325187944|emb|CCA22488.1| THO complex subunit putative [Albugo laibachii Nc14]
Length = 551
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 132/469 (28%), Positives = 207/469 (44%), Gaps = 80/469 (17%)
Query: 54 DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
D I + LC++ + ++ P +L D + + CE L+ +EV
Sbjct: 48 DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107
Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
FF + LLRMCN +LRRLS++ NTVFCGRIL+FL+ F +ERS +N+ S+
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 167
Query: 168 NVENITEFGGDEEMDVSSNETEETDTEEVDKV-------------KIDFNFYKKFWSLQD 214
NVEN+T F D+ ++ E+T+ E D +ID+ Y FW +Q
Sbjct: 168 NVENLTIF-EDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQR 226
Query: 215 YFRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNP 256
+FR+ + + + K F VL AF++ D V S+
Sbjct: 227 FFRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKM 286
Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
S D + KYLTN +L LQL D R VL Q LIL + V+
Sbjct: 287 SIDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VR 343
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
++ + S Q+ + + + V L+K+TPPDG +S++V +L+ E +W +WK E CP
Sbjct: 344 AKS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCP 402
Query: 361 EL--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----L 398
KR + T+ + + +KK E + ++ S + L
Sbjct: 403 SYEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLL 462
Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
E K R +L + + DPA+ ++++Y D Y WR LR
Sbjct: 463 EKIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 511
>gi|400595094|gb|EJP62904.1| guanylate kinase [Beauveria bassiana ARSEF 2860]
Length = 817
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 130/475 (27%), Positives = 214/475 (45%), Gaps = 66/475 (13%)
Query: 36 LEQSIRQYLLKLIKTPDIDI-KVIE--NYISLCVELCMKDMCNSTLPIILLSDTFDMSTL 92
+E + R LI + ID K + N++ + L C+ L L+ + D T+
Sbjct: 82 VEGASRDLFGALIASIPIDSPKFVRMWNFLDILSILSDGGQCDPALVFWLVEELLDSQTI 141
Query: 93 DKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKF 152
C +F ++E + F K +LR CN+LLRRLSR+++T FCGR+ +F+ +
Sbjct: 142 AGCRHIFDFLESRRERMTAKNFSQK-KLVILRSCNELLRRLSRAEDTAFCGRVFIFMFQC 200
Query: 153 FPFSERSGLNIISEFNVENITEF-----GGDEEMDVSSNETEETDTEEVDKVK------- 200
FP +RS +N+ E++VEN+T + G +MDV + ET+ EE+ K
Sbjct: 201 FPLGDRSSVNLRGEYHVENVTAYETTANGDATKMDVDA----ETNGEEMADAKDGKPKEG 256
Query: 201 ----IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP 256
Y WSLQ+ F P ++ + F E + AF+S DD +S
Sbjct: 257 SSRLSSEELYPLLWSLQESFSQPTTLFDSAKFSHFKHSLEETMKAFESLHADDQRSLKRK 316
Query: 257 SGDYFA---------KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKSRGE 304
D A KYLT+Q L +L++SD +FRR+VL+Q LI+ + ++ K +
Sbjct: 317 RNDEGATTIAEAFNPKYLTSQDLFELEISDLSFRRHVLVQALIITDFLLSLSAEAKQKLT 376
Query: 305 GL-----------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
G+ +L + KW DT + ++Q DG +F ++V+ +L +++W
Sbjct: 377 GVPTANKAVVYSEQLSEEDTKWATDTKRRISEYLRQG-FDGPYFFRMVETVLARDKNWVF 435
Query: 354 WKNEGCPELKR-PLTS---ITDEDKKDEPDAKKKKTPELTKLWN--------SKDNLEAC 401
WK CP ++R P+T+ + + K+ P + S L+
Sbjct: 436 WKMASCPPIEREPVTAADFVAARSSAERMATSKRLRPTPIGAVSMAFLDRSASSATLDEL 495
Query: 402 KSAERDFTPSLESYFEEAIQQMD-----PAAAVEEQYKKVNDSNYAWRALRLLSR 451
+ R P L S F+ I D PA+ S+ +WRALR+ +R
Sbjct: 496 RPPARYQLPELAS-FQRPIADDDFEIGMPASDASRAAAVHGKSSKSWRALRIAAR 549
>gi|452822457|gb|EME29476.1| THO complex subunit 1 [Galdieria sulphuraria]
Length = 510
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 128/471 (27%), Positives = 209/471 (44%), Gaps = 65/471 (13%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENY------ISLCVELCMKDMCNSTLPIILLS 84
D S LE +R LL+L+ + E+Y + L + L + M P LL
Sbjct: 36 DLGSILELELRGILLELVSRSLSEEVSEEHYLKIFRLVDLSLLLSKESMVEQATPCQLLE 95
Query: 85 DTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGR 144
D D + ++C ++ Y E ++ ++ K LR C LLRR SR + CGR
Sbjct: 96 DLMDTTPTNQCWRVLDYFEKHLGNFRSHEILFKGKLIFLRACISLLRRYSRVLDAETCGR 155
Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFN 204
I+L L+ FP SERSG+N+ F+ N T++ V + E E + D + +D+N
Sbjct: 156 IILLLSSAFPPSERSGVNLKGNFHTSNTTQY-------VRAEEEREYSSTSGDCL-LDWN 207
Query: 205 FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKS---YKLDDVQSSLNPSGDY- 260
Y+ FWSLQ + NP Y WK F + VL ++ +L D S+
Sbjct: 208 LYETFWSLQQFLSNPTVAYGPAEWKSFCNSLSKVLQVMETDPVRELADDPLSMTEVEQVS 267
Query: 261 ---------FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSD 311
F KYLT+ LL Q D+ FRRY L+Q+LI Q+ + R + SD
Sbjct: 268 VEQCEHQLDFVKYLTSPYLLSFQFRDSVFRRYHLVQYLIFLQHMATL---RENFAWIGSD 324
Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN-EGCPELKRPLTSIT 370
+ ++ ++ +++ T P GE F + +K +L E +W +WK + C +RP I
Sbjct: 325 E---MEGLVNRIFRVLQGTKPQGERFVEYIKRVLLHERYWLKWKTLDNCKAFERPPCEIP 381
Query: 371 -------------------------DEDKKDEPDAKKKKTPELTK---LWNSKDNLEAC- 401
D ++K + + K ++++ W +K+ E
Sbjct: 382 ENSAGENDNILWRSLEQRWQRLYNEDRERKTLSEPENKSILQMSRSYWSWKTKEQREGTL 441
Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
K A R P+++ +E + D +EE+ K N ++ W++LR+L K
Sbjct: 442 KDAGRIVVPNIDEIMDEL--KRDKEEGIEEELAKKNSYSFVWKSLRVLLGK 490
>gi|119486899|ref|XP_001262369.1| THO complex subunit Tho1, putative [Neosartorya fischeri NRRL 181]
gi|119410526|gb|EAW20472.1| THO complex subunit Tho1, putative [Neosartorya fischeri NRRL 181]
Length = 674
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 138/529 (26%), Positives = 233/529 (44%), Gaps = 125/529 (23%)
Query: 34 SALEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDM 89
+A+E + R+ L+ + ID I++ N + + + C L L+ + D
Sbjct: 66 AAVELAFREKFYSLLASTTIDEPSFIRMW-NLLDIISIFSDNEQCEPGLIFWLIEELLDS 124
Query: 90 STLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGR 144
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR
Sbjct: 125 QTIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGR 178
Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEF------GGDE--EMDVSSNETEET----- 191
+ +FL + FP ++S +N+ E++ EN+T F G E +MDV ++ +ET
Sbjct: 179 VFIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEITKLGAKESDDMDVEMSDAKETLTATE 238
Query: 192 ---------------------------------DTEEVDKVK-IDFN-FYKKFWSLQDYF 216
TE+ D K +D + Y FW LQ YF
Sbjct: 239 AQGYDHDGQPPPSASASQTPVQGQAKVPKLLVSKTEDKDTEKAVDLDALYPVFWGLQAYF 298
Query: 217 RNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DV 250
P + ++ + F + E L AFK+ + D ++
Sbjct: 299 SAPTKVFDSQHFSSFMTGLENTLIAFKNIRTDLENQSASKTAEEIRKSTKRKRSVDGQEI 358
Query: 251 QSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE----- 304
SS NP KYLT++ L DL+++DT FRR+VL+Q LIL + S T K++ +
Sbjct: 359 ASSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLADLT 412
Query: 305 ------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG 358
G L + KW + + + Q G+ + ++V +L +++W +WK EG
Sbjct: 413 NKSVLYGFVLNEEDAKWATKMRKAIEEYL-QEGTGGKFYYRMVDTVLSRDKNWVRWKAEG 471
Query: 359 CPELKRPLTSITD----EDKKDEPDAKKKKTPELTKLWN--------SKDNLEACKSAER 406
CP ++RP S+ + + + A K+ P N S ++E K ER
Sbjct: 472 CPLIERPAVSVAEYLGARENATKTYANKRLRPSPMGALNLNFLAEGESLSSIEKLKEPER 531
Query: 407 DFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
PS +S+ + +D A E++ + ++ WR LRL ++
Sbjct: 532 YSVPSADSFMMGIMDDELDIDTAHTKEDKDTAIRAKASKTWRVLRLSAK 580
>gi|389639088|ref|XP_003717177.1| guanylate kinase [Magnaporthe oryzae 70-15]
gi|351642996|gb|EHA50858.1| guanylate kinase [Magnaporthe oryzae 70-15]
Length = 810
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 133/507 (26%), Positives = 231/507 (45%), Gaps = 103/507 (20%)
Query: 34 SALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKD---MCNSTLPIILLSDTFDMS 90
+ LE ++R +L+ ID +L LC+ + +C+ L L+ + D
Sbjct: 74 AVLETAVRDTFGRLLAESSIDAPDFVRVWNLLDLLCILEDLELCDPALLFWLIEELLDSQ 133
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T+ C ++F Y+E + F S K +LR CN+LLRRLSR+++T FCGR+ +FL
Sbjct: 134 TISGCRKIFDYLESRRQKITSK-HFASKKLVILRTCNELLRRLSRARDTAFCGRVFIFLF 192
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDK------------ 198
+ FP ++S +N+ EF+VEN+T + + D + TD E+D
Sbjct: 193 QSFPLGDKSSVNLRGEFHVENVTTY----DQDAAKRNGATTDQMEIDGQPATATPNEGKQ 248
Query: 199 --------------VKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF- 242
+D + Y FWSLQ F P +++ ++ F S ++ L+A
Sbjct: 249 ATLKHTSSDAKGQGPPVDMDALYPIFWSLQSSFSQPKNLFDRKNFAEFQSNLQSTLSALE 308
Query: 243 --------KSYKLDDVQSS-----LNPSGDYFA-----KYLTNQKLLDLQLSDTNFRRYV 284
+S + D + +P G+ A +YLT++ L +L+LSD FRRY+
Sbjct: 309 YAQGQASSRSKQYDGGSRAGQKRKRDPDGNELADSFNPRYLTSRDLFELELSDLTFRRYI 368
Query: 285 LLQFLILFQYFTSTVKSRGEGL--------------ELKSDQE-KWVKDTTETVYS-LIK 328
L+Q LI+ ++ S E L ++ SD+E KW + + + + L
Sbjct: 369 LVQALIIMEFLLSLSAEAKEKLAQVKVNNKSVTYSDQVLSDEEAKWAVNMKDKISAQLTS 428
Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE-------------DKK 375
+ DG++F ++V +L +++W +WK EGCP+++ P S+ E +KK
Sbjct: 429 ISVGDGQYFHRIVATVLARDKNWVRWKIEGCPQIELP--SVPPEQFVEAKKTVHRQVNKK 486
Query: 376 DEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMD----------- 424
+A + + ++ L++ K ++R P L F+ I + D
Sbjct: 487 HRSNAGSALSLDFLSTSSNSSVLDSLKVSDRYQLPEL-GMFQRKIAEDDLEIDMPMNDKS 545
Query: 425 PAAAVEEQYKKVNDSNYAWRALRLLSR 451
A A+E + K +WRALR+ S+
Sbjct: 546 KAEAIEGKASK------SWRALRIASK 566
>gi|346319558|gb|EGX89159.1| THO complex subunit Tho1, putative [Cordyceps militaris CM01]
Length = 805
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 121/452 (26%), Positives = 202/452 (44%), Gaps = 66/452 (14%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
N++ + L C++ L L+ + D T+ C LF ++E + F K
Sbjct: 116 NFLDILSILSDDGQCDAALVFWLVEELLDSQTIAGCRHLFDFLESRRERMTAKNFSQK-K 174
Query: 120 NNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF---- 175
+LR CN+LLRRLSR+++T FCGR+ +F+ + FP +RS +N+ E++VEN+T +
Sbjct: 175 LVILRSCNELLRRLSRAEDTAFCGRVFIFMFQCFPLGDRSSVNLRGEYHVENVTTYETTT 234
Query: 176 -GGDEEMDVSSNETEETDTEEVDKVKIDFN---------------FYKKFWSLQDYFRNP 219
G +MD+ + E E + K+D Y WSLQ+ F P
Sbjct: 235 NGSSTKMDI---DIEAPVAETNGEKKLDAKDSKAKEGGSRLSSEELYPLLWSLQESFSQP 291
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFA--------KYLTNQKLL 271
++ + F E L AF+S DD +SS D A KYLT+Q L
Sbjct: 292 TTLFDAAKFSHFKHSLEETLKAFESLHADDPRSSKRKRNDEGAAVAEAFNPKYLTSQDLF 351
Query: 272 DLQ-LSDTNFRRYVLLQFLILFQYFTSTVKSRGEGL--------------ELKSDQEKWV 316
L+ ++D +FRR+VL+Q LI+ ++ S + L +L + KW
Sbjct: 352 VLEVINDLSFRRHVLVQALIITEFLLSLSAEAKQKLSSLPAANKAVVYSEQLSEENTKWA 411
Query: 317 KDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD----- 371
+T + ++Q DG +F ++V+ +L +++W WK CP ++R + D
Sbjct: 412 TETKRRISEYLRQG-VDGPYFLRMVETVLARDKNWVFWKMTSCPPIERAPVAAADFVAAR 470
Query: 372 -------EDKKDEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMD 424
K+ P + E S+ +++ + R P+L S FE + D
Sbjct: 471 TSAERMATSKRLRPTPMGAVSMEFLNRAASQTSMDELQPPARSALPALAS-FERPLADDD 529
Query: 425 -----PAAAVEEQYKKVNDSNYAWRALRLLSR 451
P + V + S+ +WRALR+ +R
Sbjct: 530 FEIAMPTSDVTKAAAVYGKSSKSWRALRIAAR 561
>gi|295662084|ref|XP_002791596.1| nuclear matrix protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279722|gb|EEH35288.1| nuclear matrix protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 649
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 150/601 (24%), Positives = 246/601 (40%), Gaps = 139/601 (23%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIIL 82
P Y + +E + R+ L+ T ID I N + + + C L L
Sbjct: 55 PQAQY---AVVETAFREKFYALLATTSIDDPSFIQIWNLLDIVSIFSDNEQCEPGLLFWL 111
Query: 83 LSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQ 137
+ + D T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR++
Sbjct: 112 IEELLDSQTIDGCRKVFDYLESRRERNTTKHFKQKSLI------ILRSCNELLRRLSRAE 165
Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF---------------------- 175
+TVFCGR+ +FL + FP +RS +N+ EF+ EN+T F
Sbjct: 166 DTVFCGRVFIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQEGSAISVEVDFLEGK 225
Query: 176 ------------------------GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWS 211
DE + ++T E V Y FWS
Sbjct: 226 SSADMKIESPQAAAQQTNGEVTPSTADEHQQGAKPAIQKTTEPEAASVLTTDELYPIFWS 285
Query: 212 LQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD----------------------- 248
LQ F +P ++ ++ F + E L+ F+ D
Sbjct: 286 LQTNFSSPTTLFDPTNFAAFKTGLEASLSTFQKVNTDMQVRMTKGSEETRRGPKRRRIGD 345
Query: 249 --DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEG 305
++ +S NP KYLT++ L +L+++D FRR++L+Q LIL + S T K++ +
Sbjct: 346 GTEMTNSFNP------KYLTSRDLFELEVNDVAFRRHILVQALILLDFVLSLTPKAKAKL 399
Query: 306 LE-----------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQW 354
+ L D KW ++ S ++Q +G+ + ++V +L +++W +W
Sbjct: 400 ADSTNKAVLYNYVLSDDDAKWASQMKASIASYLQQGS-EGKFYYRMVDTVLTRDKNWVRW 458
Query: 355 KNEGCPELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACK 402
K EGCP ++R SI D K + + TP +L L K D++ K
Sbjct: 459 KAEGCPPIERTPVSIQDYLDTQASAVKVTTSKRLRSTPLGSLDLNFLMEDKNLDSVARLK 518
Query: 403 SAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFF- 457
+R TP ++SY +D A + EE+ + + +N WR LRL SR F
Sbjct: 519 HPDRFTTPEIDSYLRGVADDEFNIDMAQSREEKDEAIKAKANKIWRTLRLSSRSKLALFD 578
Query: 458 ----------LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKS 507
L TP +++ + + + + Q NGNG D A EV+
Sbjct: 579 KIEDGNNIKVLFETPPAPEDA-----AIAATSRPQLAVQNGVNGNGQDGPSATGEVNESK 633
Query: 508 E 508
E
Sbjct: 634 E 634
>gi|166240592|ref|XP_643594.2| hypothetical protein DDB_G0275717 [Dictyostelium discoideum AX4]
gi|165988675|gb|EAL69609.2| hypothetical protein DDB_G0275717 [Dictyostelium discoideum AX4]
Length = 726
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 139/446 (31%), Positives = 206/446 (46%), Gaps = 72/446 (16%)
Query: 6 QLNFDK-SQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISL 64
+L F+K S L + Y K + K++++ +IR + LIK DI + I+ I L
Sbjct: 66 KLLFNKDSLLKEIQKIYPNIKDPIQIEIKTSIDLNIRIFFNNLIKQIDITYENIDLAIKL 125
Query: 65 CVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSC--KNNL 122
+ +S LP+ L D F+ T+ KC LF +E I+ Q + +N L
Sbjct: 126 AYSFVELGILDSILPLQLSEDLFETKTISKCLDLFGLLESRAEIFSQDPEIIKGRKRNLL 185
Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--ENITEFGGDEE 180
L++C +LL+ R N CGRILLFLA FP S+ SGLN E N+ E +F D
Sbjct: 186 LKICIELLK---RETNPDSCGRILLFLAYVFPLSDPSGLNTKGEHNIHPEEALDFQNDIM 242
Query: 181 MDVSSNETEETDTEE-------------------VDKVKIDFNFYKKFWSLQDYFRNPVQ 221
+V+ N + E + +D NFY++FW LQ F+NP Q
Sbjct: 243 NNVNGNNNNNVNNSEDTTTTSTAATATVITTTNGNNDTTVDRNFYRQFWGLQTVFQNPQQ 302
Query: 222 C----------------YNKVSWKMFTSYAETVLAAFKSY-KLDDV--QSSLNPS-GDYF 261
NK+ W+ F E V+ +F ++ LD++ SS NPS YF
Sbjct: 303 VLLNTTVTTGGTITNITLNKIKWESFIQSLELVIGSFSTHINLDELSQSSSNNPSKKHYF 362
Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTE 321
KYLT+ L+ LQL D+ FR+ +L Q LI FQ T ++ Q+ +++ T
Sbjct: 363 TKYLTSSNLMKLQLKDSIFRKNILTQILITFQALDLT--NQKYPTIFNDLQKNIIQELTN 420
Query: 322 TVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG-CPELKRPLTSITDEDKKDEPDA 380
+ ++ T P+GE+FS + ILK E++W WK + C +RP S P
Sbjct: 421 KCFKILSNTNPNGEYFSNCLSSILKREKNWIIWKRDNQCKPFERPPCS---------PIV 471
Query: 381 KKKK-------------TPELTKLWN 393
KKKK EL++LWN
Sbjct: 472 KKKKLFRKTALTKISLGNQELSRLWN 497
>gi|115388801|ref|XP_001211906.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195990|gb|EAU37690.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 652
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 150/550 (27%), Positives = 232/550 (42%), Gaps = 118/550 (21%)
Query: 78 LPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSR 135
L II + T+D C ++F Y+E ++ F + NL LR CN+LLRRLSR
Sbjct: 97 LDIISIFSDNGHQTIDGCRKVFDYLESRRERNTKKHF---KQKNLIILRSCNELLRRLSR 153
Query: 136 SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEE 195
+++TVFCGR+ +FL + FP ++S +N+ E++ EN+T F D+ + E+ +TD E
Sbjct: 154 AEDTVFCGRVFIFLFQSFPLGDKSAVNLRGEYHTENVTTF--DDITKEPTKESADTDVEM 211
Query: 196 VD---------------------------------KV------------KIDFN-FYKKF 209
VD KV K+DF+ Y F
Sbjct: 212 VDEQSAPPAAEGQKDESEPQPAPTASQTPDSSKPPKVVISTSEDREEDKKVDFDTLYPMF 271
Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPS------------ 257
W LQ YF P + ++ + F S E L AFK D S+ N +
Sbjct: 272 WGLQAYFSAPTKVFDPQHFATFKSGLEATLDAFKKVNTDLENSNSNRASEELRKANKRKR 331
Query: 258 --------GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE---- 304
G + KYLT++ L DL+++DT FRR+VL+Q LIL + S T K++ +
Sbjct: 332 GADGQEIGGSFNPKYLTSRDLFDLEINDTAFRRHVLVQALILLDFMLSLTPKAKAKLTDL 391
Query: 305 -------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
G L + KW + + ++Q G+ + ++V +L +++W +WK E
Sbjct: 392 TNKSVLYGFVLNDEDAKWAIKVRKAIEEYLQQG-IGGKFYYRMVDTVLSRDKNWVRWKAE 450
Query: 358 GCPELKRPLTSIT------DEDKKDEPDAKKKKTP------ELTKLWNSKDNLEACKSAE 405
GCP ++RP S+T D K + + + +P + S +E K AE
Sbjct: 451 GCPLIERPPVSVTEYLGARDVATKVYANKRLRASPMGSLDLKFLSDGESLSGMERLKEAE 510
Query: 406 RDFTPSLESYF----------EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
R PS +S+ + A + D AAV + K+ WR LRL ++
Sbjct: 511 RFTVPSSDSFMMGIMDDDLDLDMAQTEEDKDAAVRSKDSKI------WRILRLSAKSK-- 562
Query: 456 FFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEK 515
L A +E E + G + A E T EE QE
Sbjct: 563 --LAAFDKIEDGKNLKILFEAPQPTEGTPQALEGTPQAAEPAKASTEETTAQEESQENGN 620
Query: 516 EEDWEAKADP 525
+ E+K DP
Sbjct: 621 VGNGESKEDP 630
>gi|121714347|ref|XP_001274784.1| nuclear matrix protein [Aspergillus clavatus NRRL 1]
gi|119402938|gb|EAW13358.1| nuclear matrix protein [Aspergillus clavatus NRRL 1]
Length = 669
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 133/528 (25%), Positives = 227/528 (42%), Gaps = 123/528 (23%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ ID I N + + + C L L+ + D
Sbjct: 66 AAVELAFREKFYSLLAITSIDEPSFIQIWNLLDIISIFSDNEQCEPGLIFWLIEEMLDSQ 125
Query: 91 TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDEEMDV-------------- 183
+FL + FP ++S +N+ E++ EN+T F G+++MDV
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTENVTTFDETVQSNTQGNDDMDVEMSDGKPALAAPEG 239
Query: 184 -------------SSNETEETDTEEVDKVKIDFN-------------FYKKFWSLQDYFR 217
++T D ++ K+ + N Y FW LQ YF
Sbjct: 240 QGADRESQPPSSGRDSQTPAPDQAKIPKLVVSGNEDKSTDKDVDLSTLYPMFWGLQAYFS 299
Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQ 251
P + ++ + F E L FK+ + D ++
Sbjct: 300 APTKVFDPQHFASFKKGLENTLTVFKTIRTDLESQSASRTAEEIRKSNKRKRTTDGQEIA 359
Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE------ 304
SS NP KYLT++ L DL+++DT FRR+VL+Q LIL + S T K++ +
Sbjct: 360 SSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLADLTN 413
Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
G L + KW + + + Q G+ + ++V +L +++W +WK EGC
Sbjct: 414 KSVLYGFVLSEEDAKWASRMRKVIEEYL-QEGAGGKFYYRMVDTVLSRDKNWVRWKAEGC 472
Query: 360 PELKRPLTSITD----EDKKDEPDAKKKKTPELTKLWN--------SKDNLEACKSAERD 407
P +++P S+T+ + + A K+ P N S +LE K ER
Sbjct: 473 PLIEKPAVSVTEYLGARENATKTYANKRLRPSPMGSLNLNFLTEAESSVSLERLKEQERY 532
Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
P+ +S+ + +D A E++ + ++ WR LRL ++
Sbjct: 533 TVPAADSFMMGLMDDELDIDTAQTKEDREMAMRAKASKTWRILRLSAK 580
>gi|449540150|gb|EMD31146.1| hypothetical protein CERSUDRAFT_120108 [Ceriporiopsis subvermispora
B]
Length = 757
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 164/356 (46%), Gaps = 53/356 (14%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ + + L + D C +LP+ +LSD + T+ C +F ++E + +
Sbjct: 79 LRDRLDLILTFTEHDACEQSLPLQVLSDIMETQTISSCSHIFSWIEERADRLTKDMVPQK 138
Query: 118 CKN-NLLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF 175
K LLR NDLLRRLS++ T+FCGRIL FL++ FP ERSG+N+ E+ +
Sbjct: 139 GKALVLLRTLNDLLRRLSKTGSTTMFCGRILTFLSRVFPLGERSGVNLRGEYG----PTW 194
Query: 176 GGDEEM-----------DVSSNETEETDTEEV--DKVKIDFNFYKKFWSLQDYFRNPVQC 222
G M D + E EE + +K K Y FWSLQ F P
Sbjct: 195 DGPGSMKPEKQEETLRVDEAKPEVEEDPVAKAAEEKRKRQEELYNTFWSLQLPFSRPSLF 254
Query: 223 YNKVSWKMFTSYAETVLAAFK--SYKLDDVQSSLNPSGD--------------------Y 260
+ + F VL K + K + S +G +
Sbjct: 255 VDPAVFPHFKEAVNKVLPVIKEATTKERALMGSKTHAGGGASLKRKREPEAEANAGKEYF 314
Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLEL---- 308
FAKYLT+ +LLDL+++DT+FRR L Q LI + FT K+ R L+L
Sbjct: 315 FAKYLTSPELLDLEIADTHFRRQFLFQLLISLNHLLAFTKATKATWVTPRNRSLQLDFTL 374
Query: 309 KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
++D +WV +T ++QT P+G F++ +IL+ E++W +WKNE CP R
Sbjct: 375 EADDTQWVTETAAKALEELRQTAPNGRAFAETAVVILEREKNWVRWKNELCPAFDR 430
>gi|440475720|gb|ELQ44383.1| guanylate kinase [Magnaporthe oryzae Y34]
gi|440486031|gb|ELQ65934.1| guanylate kinase [Magnaporthe oryzae P131]
Length = 810
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 132/507 (26%), Positives = 230/507 (45%), Gaps = 103/507 (20%)
Query: 34 SALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKD---MCNSTLPIILLSDTFDMS 90
+ LE ++R +L+ ID +L LC+ + +C+ L L+ + D
Sbjct: 74 AVLETAVRDTFGRLLAESSIDAPDFVRVWNLLDLLCILEDLELCDPALLFWLIEELLDSQ 133
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T+ C ++F Y+E + F S K +LR CN+LLRRLSR+++T FCGR+ +FL
Sbjct: 134 TISGCRKIFDYLESRRQKITSK-HFASKKLVILRTCNELLRRLSRARDTAFCGRVFIFLF 192
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDK------------ 198
+ FP ++S +N+ EF+VEN+T + + D + TD E+D
Sbjct: 193 QSFPLGDKSSVNLRGEFHVENVTTY----DQDAAKRNGATTDQMEIDGQPATATPNEGKQ 248
Query: 199 --------------VKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF- 242
+D + Y FWSLQ F P +++ ++ F S ++ L+A
Sbjct: 249 ATLKHTSSDAKGQGPPVDMDALYPIFWSLQSSFSQPKNLFDRKNFAEFQSNLQSTLSALE 308
Query: 243 --------KSYKLDDVQSS-----LNPSGDYFA-----KYLTNQKLLDLQLSDTNFRRYV 284
+S + D + +P G+ A +YLT++ L +L+LSD FRRY+
Sbjct: 309 YAQGQASSRSKQYDGGSRAGQKRKRDPDGNELADSFNPRYLTSRDLFELELSDLTFRRYI 368
Query: 285 LLQFLILFQYFTSTVKSRGEGL--------------ELKSDQE-KWVKDTTETVYS-LIK 328
L+Q LI+ ++ S E L ++ SD+E KW + + + + L
Sbjct: 369 LVQALIIMEFLLSLSAEAKEKLAQVKVNNKSVTYSDQVLSDEEAKWAVNMKDKISAQLTS 428
Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE-------------DKK 375
+ DG++F ++V +L +++W +WK GCP+++ P S+ E +KK
Sbjct: 429 ISVGDGQYFHRIVATVLARDKNWVRWKIGGCPQIELP--SVPPEQFVEAKKTVHRQVNKK 486
Query: 376 DEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMD----------- 424
+A + + ++ L++ K ++R P L F+ I + D
Sbjct: 487 HRSNAGSALSLDFLSTSSNSSVLDSLKVSDRYQLPEL-GMFQRKIAEDDLEIDMPMNDKS 545
Query: 425 PAAAVEEQYKKVNDSNYAWRALRLLSR 451
A A+E + K +WRALR+ S+
Sbjct: 546 KAEAIEGKASK------SWRALRIASK 566
>gi|392597884|gb|EIW87206.1| UDP-glucose 4-epimerase [Coniophora puteana RWD-64-598 SS2]
Length = 1065
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 131/425 (30%), Positives = 203/425 (47%), Gaps = 68/425 (16%)
Query: 1 MLDSF-QLNFDKSQLN-VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKV- 57
+LD F + F+++ L+ +VS++ YK + KS E ++Q + +L T +K
Sbjct: 13 LLDRFPKRPFEQAVLSKLVSEELRKYKGHTTDNVKSRWEFLLKQEVFRLAATEGTALKTD 72
Query: 58 -------IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWK 110
+ + + L + D C+ + P+ +L D + T+ C Q+F ++E
Sbjct: 73 ADAYYDSLRDRLDLVLTFTEHDACDVSFPLTVLQDVLEAQTIASCSQIFAWIETQQ---A 129
Query: 111 QQTFFMSCKNN----LLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIIS 165
+ T M+ + LLR NDLLRRLS++ +T +FCGRIL FL+ FP ERSG+N+
Sbjct: 130 RLTTGMAPQKGKALVLLRTLNDLLRRLSKAGSTTLFCGRILTFLSGVFPLGERSGVNLRG 189
Query: 166 EFNV------ENITEFGGDEEM--DVS-SNETEETDTEEVD------KVKIDFNFYKKFW 210
E+ E DE DVS E D EVD K K FY FW
Sbjct: 190 EYGPIWDGPRETEAAKAADESRGEDVSVEGAKEGGDKMEVDESQNGEKAKKSDGFYNIFW 249
Query: 211 SLQDYFRNPVQCYNKVSWKMFTSYAETVL-----------------------AAFKSYKL 247
SLQ F P + + F + E VL A K +
Sbjct: 250 SLQLPFSRPPVFASTYKFTDFQAAVEKVLPVLREAAVKERALMGNRTNSGQPTALKRKRE 309
Query: 248 DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS------ 301
++ LN S +FAK+LT+ +LL+L+++DT+FRR +L Q LIL + K+
Sbjct: 310 PEMGEQLNNSDYFFAKFLTSPELLELEIADTHFRRQILFQLLILLHHLLQFTKAAKAHWA 369
Query: 302 --RGEGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWK 355
R L+ L+ D ++WV+DT + I+ T P G F+++V +IL+ E++W +WK
Sbjct: 370 TPRNRSLQMDFTLEPDDDQWVQDTMAKAVAEIRSTTPSGPMFAELVNVILEREKNWVKWK 429
Query: 356 NEGCP 360
NE CP
Sbjct: 430 NEMCP 434
>gi|346972039|gb|EGY15491.1| THO complex subunit 1 [Verticillium dahliae VdLs.17]
Length = 858
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 133/479 (27%), Positives = 220/479 (45%), Gaps = 103/479 (21%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTF 114
N + L C+ L ++ + D T+ C ++F ++E + + +KQ+
Sbjct: 109 NLFDIVSILSDNGSCDPALLFWMVEELLDSQTIAGCRKIFDFLESRRERITSSHFKQKQL 168
Query: 115 FMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITE 174
+LR CN+LLRRLSR+++T FCGR+ +F+ + FP +RS +N+ E++VEN+T
Sbjct: 169 V------ILRTCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDRSSVNLRGEYHVENVTA 222
Query: 175 FGGDEEMDVSSNETE-ETDTEEVDK-----------------VKIDFNFYKKFWSLQDYF 216
+ D S+ T+ + D EVDK V D Y FWSLQ+ F
Sbjct: 223 Y------DQSTAPTDPDADKMEVDKDAAAVPDTNKQSSQDGAVNAD-ALYPVFWSLQESF 275
Query: 217 RNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP------------------SG 258
P + ++ + + F E + AFK + + Q S P G
Sbjct: 276 SQPKKLFDMTNLEGFKQGLEATVKAFKVVQKE--QGSRPPKAADSGTGGTKRKRNHDDQG 333
Query: 259 D----YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS----------TVKSRGE 304
D + KYLT+Q L +L++SD +FRR +L+Q I+ ++ S T+K++ +
Sbjct: 334 DQSNAFNPKYLTSQDLFELEISDLSFRRNILVQAYIILEFLLSLSAKAKEKLATIKTQNK 393
Query: 305 GL-----ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
+ L + KW D E V +KQ DG H+ ++V+ +L + +W +WK EGC
Sbjct: 394 SVMYMDQTLSEEDTKWASDMKEAVEDGLKQG-SDGPHYHRMVETVLVRDRNWIRWKMEGC 452
Query: 360 PELK-RPLTSITDEDKKDE---PDAKKKKTPELTKLW-------NSKDNLEACKSAERDF 408
P ++ PL+ T +DE K+ P + L + + L+ ++ +R
Sbjct: 453 PPIELPPLSPETWSKAQDELYSITRNKRMKPPMGSLPLDFMQDEDDEKALDRLRAPDRYE 512
Query: 409 TPSLESY--------FEEAIQQMD--PAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
P LES+ FE + D AAA+E + K +WRALR+ +R F
Sbjct: 513 LPELESFKRKIADDDFEIEMPTNDETKAAAIESKASK------SWRALRIAARSRLGLF 565
>gi|342873471|gb|EGU75639.1| hypothetical protein FOXB_13850 [Fusarium oxysporum Fo5176]
Length = 822
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 113/388 (29%), Positives = 193/388 (49%), Gaps = 70/388 (18%)
Query: 36 LEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMST 91
+E + R +LI T ID +KV N + + L C+ L L + D T
Sbjct: 81 IETAARNIFSQLIATTTIDSPDYVKVW-NLLDILSILSDDGQCDPALLFWLAEELLDSQT 139
Query: 92 LDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
+ C ++F ++E + N +KQ+ +LR CN+LLRRLSR+++T FCGR+
Sbjct: 140 IAGCRKIFDFLESRRERITANHFKQKQLI------ILRTCNELLRRLSRAEDTAFCGRVF 193
Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF---GGDEEMDVSSNETEE------------- 190
+F+ + FP ++S +N+ E++VEN+T + ++E + +E EE
Sbjct: 194 IFMFQSFPLGDKSSVNLRGEYHVENVTTYEATRAEDESKMVVDEPEEQLKEQAESKSTPK 253
Query: 191 -TDTEEVDKVK--IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
D ++ DK K Y FWSLQ+YF P + + + TS+ E + A K ++
Sbjct: 254 SADAKKADKEKPLSTDELYPLFWSLQEYFSQPKKLFETTN---LTSFKEGLAATMKVFQT 310
Query: 248 --DDVQSSLN----------PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF 295
+D + SL S + KYLT++ L DL++SD +FRR++L+Q LI+ +
Sbjct: 311 VHNDSRRSLKRKRESGEEDESSNTFNPKYLTSKDLFDLEISDLSFRRHILVQALIIMDFL 370
Query: 296 TS-TVKSRGEGLE------------LKSDQ------EKWVKDTTETVYSLIKQTPPDGEH 336
S ++++R E + SDQ +W D +T+ ++Q DG +
Sbjct: 371 ISLSIQARERLTEALPVNASVNKAVMYSDQVLSDEDAEWASDMKKTIADYLRQG-ADGPY 429
Query: 337 FSQVVKLILKGEEHWNQWKNEGCPELKR 364
F ++V+ +L +++W +WK EGC +KR
Sbjct: 430 FYRMVETVLARDKNWVRWKIEGCHPIKR 457
>gi|406864828|gb|EKD17871.1| guanylate kinase [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
Length = 862
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 137/508 (26%), Positives = 232/508 (45%), Gaps = 96/508 (18%)
Query: 32 KKSALEQSIRQYLLKLIK-TPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTF 87
KK A+E + R+ L+ T ID I N + + L D + + L+ + F
Sbjct: 59 KKHAIETAARESFNNLLACTTTIDSPSFVNIWNLLDIMSILSDDDKTEPAVMVWLVEELF 118
Query: 88 DMSTLDKCEQLFYYVEVNVN--IWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D T+ C ++F Y+E I K +F S K +LR CN+LLRRLSR+++T FCGR+
Sbjct: 119 DSQTVAGCRKIFDYLESRRERIIAK---YFDSKKLVILRSCNELLRRLSRAEDTAFCGRV 175
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDV--SSNETEETDTEE 195
+FL + FP ++S +N+ E++ EN+T F E+MDV SS T++
Sbjct: 176 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDVPPPKEVADPEKMDVDPSSESPSGGGTQK 235
Query: 196 VDK-----------------VKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETV 238
+ V + Y FWSLQ F P ++ ++ F + E
Sbjct: 236 ASQNSSSAPQKGVSFSRSENVMAAEDLYVAFWSLQQSFSQPKSLFDNTNFSSFKAGLEAT 295
Query: 239 LAAFKSYKLD--DVQSSLNPSGD-----------------YFAKYLTNQKLLDLQLSDTN 279
+A FKS + D Q+ N G Y KYLT++ L +L++SD +
Sbjct: 296 MAMFKSVQSDVSGRQTKSNEEGKRGTKRKHHQGDDDLASTYNPKYLTSRDLFELEISDLS 355
Query: 280 FRRYVLLQFLILFQYFTSTVKSRGEGLE-----------------LKSDQEKWVKDTTET 322
FRR++++Q LI+ ++ S E L L + KW + ++
Sbjct: 356 FRRHIIVQVLIVIEFLLSLSAKAKEKLSKALPDGTNKSVTYVDQVLSEEDTKWATEMKKS 415
Query: 323 VYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED--------K 374
+ +KQ +G +F ++V+ +L +++W +WK E CP + +P SIT ++ +
Sbjct: 416 MAEYLKQG-YEGAYFHRMVETVLSRDKNWARWKIENCPSIAKP--SITPQEYASAKVSAR 472
Query: 375 KDEPDAKKKKTPELTKL-------WNSKDNLEACKSAERDFTPSLESYFEEAIQ--QMDP 425
K + + P + L +++ L+ K R PS++S F+ +I+ MD
Sbjct: 473 KATTTKRPRPNPPIGSLDLQFLSEGDNRSGLDRLKDPTRYQLPSVKS-FKSSIELDDMDI 531
Query: 426 AAAVEEQYKKVN---DSNYAWRALRLLS 450
A +E+ K ++ +WRA R+ S
Sbjct: 532 EMAQDEESKNAAMELKASKSWRAQRIAS 559
>gi|302904097|ref|XP_003049002.1| hypothetical protein NECHADRAFT_46674 [Nectria haematococca mpVI
77-13-4]
gi|256729936|gb|EEU43289.1| hypothetical protein NECHADRAFT_46674 [Nectria haematococca mpVI
77-13-4]
Length = 800
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 121/413 (29%), Positives = 194/413 (46%), Gaps = 67/413 (16%)
Query: 8 NFDKSQLNVVSDKYALYKPTCDYDKKSA-------LEQSIRQYLLKLIKTPDID----IK 56
+FD V S L + D D K A +E + R LI T ID +K
Sbjct: 45 DFDDLSGRVSSTAAQLPQLEGDKDAKEAKSRQYGIIETAARDIFSNLISTTTIDSPDFVK 104
Query: 57 VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQ 111
V N + + L C+ L L + D T+ C ++F ++E + +KQ
Sbjct: 105 VW-NLLDILSILSDDGQCDPALLFWLAEELLDSQTISGCRKIFDFLESRRERITAKHFKQ 163
Query: 112 QTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVEN 171
+ +LR CN+LLRRLSR+++T FCGR+ +F+ + FP ++S +N+ E++VEN
Sbjct: 164 KQLI------ILRSCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEYHVEN 217
Query: 172 ITEF-----GGDEEMDV------SSNETEETDTEEVDKVK--IDFNFYKKFWSLQDYFRN 218
IT + D MDV S + + D ++ DK K Y FWSLQ+ F
Sbjct: 218 ITTYETTIAENDSNMDVDTPTGGSKSTSRSNDAKKGDKEKPLSTDALYPLFWSLQESFSQ 277
Query: 219 PVQCYNKVSWKMFTSYAETVLAAFKSYKL--DDVQSSLNPSGD----------YFAKYLT 266
P + + + S+ E++ A K ++ +D + SL D + KYLT
Sbjct: 278 PKRLFETAN---LASFKESLGATMKFFQTIHNDSRRSLKRKRDSGEEDENSNAFNPKYLT 334
Query: 267 NQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE---------------LKSD 311
++ L DL++SD +FRR+VL+Q LI+ + S E L L +
Sbjct: 335 SKDLFDLEISDLSFRRHVLVQALIIMDFLLSLSSPAREKLSSSLATNKAVMYGDQTLSEE 394
Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
KW D +T+ ++Q DG +F ++V+ +L +++W +WK EGC +KR
Sbjct: 395 DTKWANDMKKTIADYLRQG-ADGPYFYRMVETVLARDKNWVRWKIEGCHAIKR 446
>gi|392570571|gb|EIW63743.1| UDP-glucose 4-epimerase [Trametes versicolor FP-101664 SS1]
Length = 1095
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 112/359 (31%), Positives = 167/359 (46%), Gaps = 55/359 (15%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ + + L + D C+ P L D + T+ C +F ++E + + T M
Sbjct: 79 LSDRLDLILTFTEHDACDQIFPFNTLQDLLETQTIASCSHIFSWIEQRAD---RLTAGMV 135
Query: 118 CKNN----LLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--E 170
+ LLR NDLLRRLS+ T FCGRIL FL++ FP ERSG+N+ E+ E
Sbjct: 136 PQKGKALVLLRTLNDLLRRLSKMGATTQFCGRILTFLSRVFPLGERSGVNLRGEYGPMWE 195
Query: 171 NITEFGG---DEE---MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYN 224
G DEE M+V + E+ E K K +F+ FWSLQ F P
Sbjct: 196 GPGAKGHERKDEEANQMEVDETKEEQASKEAERKRKEKDDFFYTFWSLQLPFSRPHLFSE 255
Query: 225 KVSWKMFTSYAETVLAAFKSYKLDD----VQSSLNPSGD--------------------- 259
++ F + + VL K + SL+ +
Sbjct: 256 TRTFSAFKNAVDKVLPVIKEATAKERALMGNKSLHATAGASSQLKRKRAPAPAAESPATS 315
Query: 260 --YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLEL- 308
YFAKYLT+ +LLDL++SDT+FRR +L Q LI+ + FT K+ R L+L
Sbjct: 316 EYYFAKYLTSPELLDLEISDTHFRRQLLFQLLIVLNHLLNFTKAAKATWYTPRNRSLQLD 375
Query: 309 ---KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
+ +WV+DT ++QT P G F++ V+++L+ E++W +WKNE CP R
Sbjct: 376 FTLEPTDAQWVQDTIARAVEELRQTSPHGRAFAEAVQVMLEREKNWVRWKNELCPPFDR 434
>gi|340905435|gb|EGS17803.1| hypothetical protein CTHT_0071520 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 781
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 122/491 (24%), Positives = 225/491 (45%), Gaps = 84/491 (17%)
Query: 34 SALEQSIRQYLLKLIKTPDIDIK---VIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E ++R LI T I+ + N + + L ++C+ L ++ + D
Sbjct: 74 AIIETAVRDTFKSLIATTPIEAPEFVRVWNLLDILSFLSDSELCDPALLFWIVEELLDSQ 133
Query: 91 TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+ C ++F ++E + KQ+ +LR CN+LLRRLSR+ + FCGR+
Sbjct: 134 TIAGCRKVFDFLESRRERITAKHLKQKQLV------ILRTCNELLRRLSRALDPAFCGRV 187
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGD------EEMDVSSNETEETDTEEVDKV 199
+F+ + FP ++S +N+ E+++EN+T + D ++MDV S + D + +D
Sbjct: 188 FIFMFQTFPLGDKSSVNLRGEYHLENVTTWDRDIPAQDGDKMDVDSEAKPQLDVKALDPD 247
Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYK------------- 246
+ Y FW+LQ+ F P + + + F + E +A FKS K
Sbjct: 248 AL----YPIFWALQESFNQPKRLFEPSNLAAFKAGLEATMATFKSIKPEQPSRARERPEK 303
Query: 247 -LDDVQSSLNPSGD---------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
++++ SL D + KYLT++ L L++SD FRR +L+Q LI+ ++
Sbjct: 304 PPEEIKHSLKKKRDDLDEELASGFNPKYLTSRDLFKLEISDLAFRRNILVQALIIMEFLL 363
Query: 297 STVKSRGEGL---------------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVV 341
S E L +L + KWV D E + +KQ P+G +F+++V
Sbjct: 364 SLSPKAKEKLTSVKQPNKSVMYLDYQLSEEDTKWVLDMKERIVDYLKQG-PEGPYFNRMV 422
Query: 342 KLILKGEEHWNQWKNEGCPELKRPLTS--ITDEDKKDEPDAKKKKTPELTKLW------- 392
+ +L +++W +WK E CP ++ P S I DE + K P L
Sbjct: 423 ETVLSRDKNWVRWKIENCPPIELPPLSPVIFDEARSTISKLATTKRPRSAPLGSLSLDFL 482
Query: 393 --NSKDNLEACKSAERDFTP---------SLESYFEEAIQQMDPAAAVEEQYKKV-NDSN 440
+ +D+ ++ ++ P +L+ EE ++D + E + + + ++
Sbjct: 483 NDDEEDDDSGARAMQKLKDPTRYRLPELSTLKRGIEEDDLEIDMPMSEESKARAIEGKAS 542
Query: 441 YAWRALRLLSR 451
WRALR+ ++
Sbjct: 543 KTWRALRIAAK 553
>gi|327298209|ref|XP_003233798.1| nuclear matrix protein [Trichophyton rubrum CBS 118892]
gi|326463976|gb|EGD89429.1| nuclear matrix protein [Trichophyton rubrum CBS 118892]
Length = 634
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 136/536 (25%), Positives = 235/536 (43%), Gaps = 114/536 (21%)
Query: 16 VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKD 72
V++ Y + + +A+E + R+ L+ T ID I N + + +
Sbjct: 48 VLAGDYGVSVGVNQQTQYAAVETAFREIFYDLLATTSIDEPSFMQIWNLLDIVSIFSDNE 107
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
C L L+ + D T+D C ++F Y+E +KQ++ +LR CN
Sbjct: 108 KCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCN 161
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNE 187
+LLRRLSR+++TVFCGR+ +FL + FP ++S +N+ E++ EN+T F ++M S+E
Sbjct: 162 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDVQQKMQAESDE 221
Query: 188 TEETDTEEVD--------------------------------------KVKIDFNFYKKF 209
+ D +E + ++++D Y F
Sbjct: 222 VMDIDIQENETPVSNAKSTEQEPTTAQTPLESTNDATTQRAKASSIQPELELD-TLYAAF 280
Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
WSLQ+ F P + ++ + S E L++F+ S KL+
Sbjct: 281 WSLQESFSKPTRLFDTTHFASLKSGLEATLSSFRKVSSKLEVRSTTKSSDELRRGTKRKR 340
Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF----------- 292
D+ +S NP KYLT++ L +L+++D +FRR++L+Q LIL
Sbjct: 341 GEGGSDLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPRSK 394
Query: 293 -QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
+ ST KS G L + KW ++ S + Q DG+ + ++V +L +++W
Sbjct: 395 AKLADSTNKSVLYGYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 453
Query: 352 NQWKNEGCPELKRPLTSITDE------DKKDEPDAKKKKTP------ELTKLWNSKDNLE 399
+WK EGCP ++RP + K + + + TP + ++ +L
Sbjct: 454 VRWKAEGCPPIERPPVPVQSHLDTQSSAAKITANRRLRSTPMGSLDLGFLSVESNLGDLN 513
Query: 400 ACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
K ER TPSL+ Y +D A EE+ + + ++ WR LR+ SR
Sbjct: 514 RLKEPERFSTPSLDFYMSGITDDDFNIDMAKDDEEKEQSIKAKASKTWRTLRIASR 569
>gi|320036048|gb|EFW17988.1| nuclear matrix protein [Coccidioides posadasii str. Silveira]
Length = 608
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 103/403 (25%), Positives = 185/403 (45%), Gaps = 78/403 (19%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ T I+ + N + + + C L L+ + D
Sbjct: 66 AAVETAFREKFYHLLATTSINKPEFVHMWNLLDVVSIFSDNERCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF------------------------------ 175
+FL + FP ++S +N+ E++ EN+T F
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDEPPKHIAANEDADDVEMKDAEAQKPSDNS 239
Query: 176 ----------GGDEEMDVSSNETEETDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYN 224
GG ++ ++++ + + ID + Y FW LQ F P + ++
Sbjct: 240 SKTDAAASSQGGSVSPEIDGQRSKQSTDHQAEDTSIDMDALYPVFWGLQANFSAPTRLFD 299
Query: 225 KVSWKMFTSYAETVLAAFKSYKLDDV----QSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
+ F E+ +++F+ K + ++ NP KYLT++ L +L+++D F
Sbjct: 300 HEHFASFKKGLESTISSFQKRKRGETGPETANTFNP------KYLTSRDLFELEVNDVAF 353
Query: 281 RRYVLLQFLILFQYFTS------------TVKSRGEGLELKSDQEKWVKDTTETVYSLIK 328
RR++L+Q LIL + S T KS G L + KW ++ S ++
Sbjct: 354 RRHILVQSLILLDFLISLSPKAKAKLADATNKSVLYGYVLSDEDAKWANQMKSSIASYLQ 413
Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
Q DG+ + ++V +L +++W +WK EGCP +++P I D
Sbjct: 414 QGL-DGKFYYRMVDTVLTRDKNWARWKAEGCPPIEKPPIQIQD 455
>gi|302409216|ref|XP_003002442.1| guanylate kinase [Verticillium albo-atrum VaMs.102]
gi|261358475|gb|EEY20903.1| guanylate kinase [Verticillium albo-atrum VaMs.102]
Length = 854
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 126/474 (26%), Positives = 216/474 (45%), Gaps = 93/474 (19%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTF 114
N + L C+ L ++ + D T+ C ++F ++E + + +KQ+
Sbjct: 109 NLFDIVSILSDNGSCDPALLFWMVEELLDSQTIAGCRKIFDFLESRRERITSSHFKQKQL 168
Query: 115 FMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITE 174
+LR CN+LLRRLSR+++T FCGR+ +F+ + FP +RS +N+ E++VEN+T
Sbjct: 169 V------ILRTCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDRSSVNLRGEYHVENVTA 222
Query: 175 FGGDEEMDVSSNETEETDTEEVDKVKIDFN-------------FYKKFWSLQDYFRNPVQ 221
+ D+ + ++ + E+ D N Y FWSLQ+ F P +
Sbjct: 223 Y--DQSAAPMDPDADKMEVEKDAAAGPDANKQSSQDGAVNADALYPVFWSLQESFSQPKK 280
Query: 222 CYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP------------------SGD---- 259
++ + + F E + AFK + + Q S P GD
Sbjct: 281 LFDMTNLEGFKQGLEATVKAFKVVQKE--QGSRPPKAADSAVGGTKRKRNHDDQGDQSNA 338
Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS----------TVKSRGEGL--- 306
+ KYLT+Q L +L++SD +FRR +L+Q I+ ++ S T+K++ + +
Sbjct: 339 FNPKYLTSQDLFELEISDLSFRRNILVQAYIILEFLLSLSAKAKEKLATIKTQNKSVMYM 398
Query: 307 --ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK- 363
L + KW D E V +KQ DG H+ ++V+ +L + +W +WK EGCP ++
Sbjct: 399 DQTLSEEDAKWASDMKEAVEDGLKQG-SDGPHYHRMVETVLVRDRNWIRWKMEGCPPIEL 457
Query: 364 RPLTSITDEDKKDE---PDAKKKKTPELTKLW-------NSKDNLEACKSAERDFTPSLE 413
PL+ T +DE K+ P + L + + L+ ++ +R P L
Sbjct: 458 PPLSPETWSKAQDELYSITRNKRMKPPMGSLPLDFMQDEDDEKALDKLRAPDRYELPELG 517
Query: 414 SY--------FEEAIQQMD--PAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
S+ FE + D AAA+E + K +WRALR+ +R F
Sbjct: 518 SFRRKIADDDFEIEMPTNDETKAAAIESKASK------SWRALRIAARSRLGLF 565
>gi|322693773|gb|EFY85622.1| nuclear matrix protein [Metarhizium acridum CQMa 102]
Length = 864
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 135/488 (27%), Positives = 224/488 (45%), Gaps = 100/488 (20%)
Query: 48 IKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE---- 103
I++PD +K+ N + + L C+ L L+ + D T+ C +F Y+E
Sbjct: 103 IESPDF-VKMW-NLLDILSILSDDGQCDPALLFWLVEELLDSQTISGCRIVFDYLESRRE 160
Query: 104 -VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLN 162
+ +KQ+ +LR CN+LLRRLSR+++T FCGR+ +FL + FP +RS +N
Sbjct: 161 RITSKHFKQKHLV------ILRSCNELLRRLSRAEDTAFCGRVYIFLFQSFPLGDRSSVN 214
Query: 163 IISEFNVENITEFGGDEEMDVSSNETE------------ETDTEEVDKVKIDFN------ 204
+ E++VEN+T F G E D E++ TDT+ +K + F+
Sbjct: 215 LRGEYHVENVTAFEGSAEEDEQQRESKMEVDTPAETPKSSTDTKASNK-GVSFDSKNSEK 273
Query: 205 ------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS-SL-NP 256
Y FWSLQ+ F P++ ++ + F S E + AF++ ++V S SL NP
Sbjct: 274 PYDPDRLYPLFWSLQESFSQPLKLFDSTHFNKFKSRLEVTIKAFQAIHQNEVPSKSLENP 333
Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTST-- 298
+ + KYLT++ L +L++SD +FRR++L+Q LI+ + S
Sbjct: 334 RRNLKRKRDDEGSEDAPEAFNPKYLTSKDLFELEISDLSFRRHILVQALIITDFLLSLSM 393
Query: 299 -VKSRGEGL-------------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLI 344
+ + GL +L + KW + T+ +K+ DG +F ++V+ +
Sbjct: 394 ETREKLAGLNTPPPNKSVVYNDQLSEENTKWATEMKLTISDYLKRG-IDGPYFYRMVETV 452
Query: 345 LKGEEHWNQWKNEGCPELKRPLTS------ITDEDKKDEPDAKKKKTPELT-KLWNSKDN 397
L +++W WK CP ++R S +K + + TP + L +DN
Sbjct: 453 LARDKNWVFWKMASCPPIQRDPVSPELFVEARTTAQKLATSKRLRPTPMGSVSLEFLRDN 512
Query: 398 LEAC----KSAERDFTPSLESY----------FEEAIQQMDPAAAVEEQYKKVNDSNYAW 443
E C K+ ER P LES+ E AAA+ + K +W
Sbjct: 513 DECCLDQLKTPERYQIPELESFQRKIADDDFEIEMPTNDQTKAAAIAGKASK------SW 566
Query: 444 RALRLLSR 451
RALR+ +R
Sbjct: 567 RALRIAAR 574
>gi|358383603|gb|EHK21267.1| hypothetical protein TRIVIDRAFT_113560, partial [Trichoderma virens
Gv29-8]
Length = 744
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 133/501 (26%), Positives = 222/501 (44%), Gaps = 101/501 (20%)
Query: 36 LEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMST 91
+E + R + KLI T ID +K+ N + + L C+ L L+ + D T
Sbjct: 82 VETAARDFFSKLIATTSIDSPEFVKMW-NLLDILSILSDDGQCDPALLFWLVEELLDSQT 140
Query: 92 LDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
+ C ++F ++E + +KQ+ +LR CN+LLRRLSR+++T FCGR+
Sbjct: 141 ISGCRKIFDFLESRRERITAKHFKQKNLV------ILRSCNELLRRLSRAEDTAFCGRVF 194
Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDT------------- 193
+F+ + FP +RS +N+ E++VEN+T F EE D DT
Sbjct: 195 IFMFQSFPLGDRSSVNLRGEYHVENVTTFETTEEDDSKMEVDTHADTPKEQGDAKTPAKN 254
Query: 194 --EEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
++ K Y FWSLQ+ F P++ ++ + F + E+ + AF++ D +
Sbjct: 255 VAQDSSKALDPDTLYPMFWSLQESFSQPLRLFDVGHFAKFKTSLESTITAFQAIHEDSIY 314
Query: 252 SS--------------------LNPSGDYFAKYLTNQKLLDLQ----LSDTNFRRYVLLQ 287
SS + P + KYLT++ L L+ +SD +FRR+VL+Q
Sbjct: 315 SSKSMENLKRTLKRKRDDDGTEMLPEA-FNPKYLTSKDLFQLEASITISDLSFRRHVLVQ 373
Query: 288 FLILFQYFTSTVKSRGEGL--------------ELKSDQEKWVKDTTETVYSLIKQTPPD 333
LI+ + S K E +L + KW D + + +KQ D
Sbjct: 374 ALIITDFLLSLSKEEREKRTNINVSNKSVIYPDQLNEENTKWATDMKKVISDYLKQG-SD 432
Query: 334 GEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTSIT-----------DEDKKDEPDAK 381
G +F ++V+ +L +++W WK CP ++R P+++ + +K+ P+
Sbjct: 433 GPYFFRMVETVLARDKNWVFWKMASCPPIQRDPVSAASFVESKEAAQKMATNKRLRPNPL 492
Query: 382 KKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMD-----------PAAAVE 430
+ E + +E KS+ER P L+ F+ I D AAAV
Sbjct: 493 NAVSLEFLANEDEGSAMEKLKSSERYQLPELDE-FKNKIADDDFEIDMPTNNETKAAAVA 551
Query: 431 EQYKKVNDSNYAWRALRLLSR 451
+ K +WRALR+ SR
Sbjct: 552 GKASK------SWRALRIASR 566
>gi|325187940|emb|CCA22484.1| THO complex subunit putative [Albugo laibachii Nc14]
Length = 586
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 131/468 (27%), Positives = 204/468 (43%), Gaps = 80/468 (17%)
Query: 54 DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
D I + LC++ + ++ P +L D + + CE L+ +EV
Sbjct: 48 DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107
Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
FF + LLRMCN +LRRLS++ NTVFCGRIL+FL+ F +ERS +N+ S+
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSK- 166
Query: 168 NVENITEFGGD-----EEMDVSSNETEETDT-------EEVDKVKIDFNFYKKFWSLQDY 215
VEN+T F E D +N E D ++ +ID+ Y FW +Q +
Sbjct: 167 -VENLTIFEDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQRF 225
Query: 216 FRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNPS 257
FR+ + + + K F VL AF++ D V S+ S
Sbjct: 226 FRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKMS 285
Query: 258 GD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
D + KYLTN +L LQL D R VL Q LIL + V++
Sbjct: 286 IDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VRA 342
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ + S Q+ + + + V L+K+TPPDG +S++V +L+ E +W +WK E CP
Sbjct: 343 KS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCPS 401
Query: 362 L--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----LE 399
KR + T+ + + +KK E + ++ S + LE
Sbjct: 402 YEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLLE 461
Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
K R +L + + DPA+ ++++Y D Y WR LR
Sbjct: 462 KIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 509
>gi|398405762|ref|XP_003854347.1| hypothetical protein MYCGRDRAFT_70063 [Zymoseptoria tritici IPO323]
gi|339474230|gb|EGP89323.1| hypothetical protein MYCGRDRAFT_70063 [Zymoseptoria tritici IPO323]
Length = 646
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 138/508 (27%), Positives = 221/508 (43%), Gaps = 98/508 (19%)
Query: 32 KKSALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFD 88
K SA+E ++R+ LI + I + N + + + + C++ L +LL + D
Sbjct: 64 KFSAVETAVREVFYSLITSTSISDPAFVGVWNLLDIVLICGDRGRCDAALICLLLEELLD 123
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSRSQNTVFCGRIL 146
T D C +F Y++ + Q+ F K NL LR CN+LLRRLSR+++ +FCGR+
Sbjct: 124 SQTTDGCRIVFDYLDSRRDRLAQKDFH---KKNLIFLRSCNELLRRLSRAEDAIFCGRVF 180
Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF---------GGDEEMDVSSNETEETDTEEVD 197
FL + FP ++S +N+ EF+VEN T+F G E M++ S EE E
Sbjct: 181 FFLFQTFPLGDKSSVNLRGEFHVENTTKFDDSEVVPAEGNGEHMEIDSQPKEEPIKEHTP 240
Query: 198 KVKIDFN-------------------------FYKKFWSLQDYFRNPVQCYNKVSWKMFT 232
+ Y FW LQ F +P + ++ + F
Sbjct: 241 APAVPTKPGSKAVPIKAPHKKIPEEEALPTSELYPIFWRLQHDFSDPTRLFDTAKFTAFK 300
Query: 233 SYAETVLAAFK------------------------SYKLDDVQSSLNPSGDYFAKYLTNQ 268
+ + FK + +DD SS + +Y KYLT++
Sbjct: 301 KGLASTIVKFKKTPTVVQTKAGEEEKRGTKRKIGENGTVDD--SSDHLMDNYNPKYLTSK 358
Query: 269 KLLDLQLSDTNFRRYVLLQFLILFQY--------------FTSTVKSRGEGLELKSDQEK 314
L DL+LSD F+R++L+Q LIL + T+T KS L +
Sbjct: 359 DLFDLELSDLAFQRHILVQALILIDFLLSLTDKAKKRLASLTTTNKSMLYSFALNEGDTQ 418
Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE-- 372
W T T+ S I T +G + ++V+ +L +++W +WK E CP + R S E
Sbjct: 419 WATTTRSTITSYISST-AEGRQYCRMVETVLARDKNWVRWKVESCPSIVRGPVSTEQEMA 477
Query: 373 DKKDEPDAKK-KKTPE-------LTKL-WNSKDNLEACKSAERDFTPSLESYFEEA-IQQ 422
+K A + +K PE L+ L S LEA K R P++E ++ +
Sbjct: 478 ARKGAAQATRPRKIPEKPMGAMNLSFLDEGSGGGLEALKDPARYTAPTIEQLVDKVKTDK 537
Query: 423 MDPAAAVEEQYKKVNDS---NYAWRALR 447
+D A +++ K ++ N WRALR
Sbjct: 538 LDADFAADDEEKASFENAIQNAKWRALR 565
>gi|340522223|gb|EGR52456.1| predicted protein [Trichoderma reesei QM6a]
Length = 807
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 137/497 (27%), Positives = 223/497 (44%), Gaps = 94/497 (18%)
Query: 36 LEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMST 91
+E + R + KLI T ID +K+ N + + L C+ L L+ + D T
Sbjct: 80 VETAARDFFSKLIATTSIDSPEFVKMW-NLLDILSILSDDGQCDPALLFWLVEELLDSQT 138
Query: 92 LDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
+ C +F ++E + +KQ+ +LR CN+LLRRLSR+++T FCGR+
Sbjct: 139 VAGCRIIFDFLESRRERITAKHFKQKNLV------ILRSCNELLRRLSRAEDTAFCGRVF 192
Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF---GGDEEMDVSSNETEETDT---------- 193
+F+ + FP +RS +N+ E++VENIT + D +M++ + +TDT
Sbjct: 193 IFMFQSFPLGDRSSVNLRGEYHVENITNYERLDDDSKMEIDT----QTDTPKEQGDAKAA 248
Query: 194 -----EEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
++ K Y FWSLQ+ F P++ ++ + F S E + AF++ D
Sbjct: 249 TKNGAQDTSKALDPDTLYPMFWSLQESFSQPLKLFDTGHFTKFKSSLEATITAFQAIHDD 308
Query: 249 DVQSS---------LNPSGD----------YFAKYLTNQKLLDLQLS---DTNFRRYVLL 286
+ SS L D + KYLT++ L L+ S D +FRR+VL+
Sbjct: 309 SISSSKSMENLKRTLKRKRDDDGTETLPEAFNPKYLTSKDLFQLEASIAIDLSFRRHVLV 368
Query: 287 QFLILFQYFTSTVKSRGEGL--------------ELKSDQEKWVKDTTETVYSLIKQTPP 332
Q LI+ + S K E +L + KW D + + +KQ
Sbjct: 369 QALIITDFLLSLSKEEREKRTNINISNKSVIYPDQLSEENTKWATDMKKVISDYLKQG-S 427
Query: 333 DGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTS---ITDEDKKDEPDAKKKKTPE- 387
DG +F ++V+ +L +++W WK CP +KR P+++ + ++ + K+ P
Sbjct: 428 DGPYFFRMVETVLARDKNWVFWKMASCPPIKRDPVSAESFVEAKEAAQKMATSKRLRPNP 487
Query: 388 -----LTKLWNSKDNLEA---CKSAERDFTPSLESYFEEAIQQMD-----PAAAVEEQYK 434
L L N D EA K ER P LE F+ I D P A +
Sbjct: 488 LNAVPLDFLTNGDDEGEAMEQLKDPERHQLPELE-VFKSKIADDDFEIDMPTNAQTKATA 546
Query: 435 KVNDSNYAWRALRLLSR 451
++ +WRALR+ SR
Sbjct: 547 VAAKASKSWRALRIASR 563
>gi|358370613|dbj|GAA87224.1| THO complex subunit Tho1 [Aspergillus kawachii IFO 4308]
Length = 671
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 153/615 (24%), Positives = 256/615 (41%), Gaps = 140/615 (22%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ + I+ I N + + + C L L+ + D
Sbjct: 66 AAVEIAFREKFYSLLASTSIEDPSFIRIWNLLDIISIFSDNEQCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENIT---EFGGDEEMDVSSNETEETDTEEV------ 196
+FL + FP ++S +N+ E++ +N+T E DV+ + E TD E
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTDNVTTYDELTKAAAKDVADPDVEMTDEPEAPTTTEG 239
Query: 197 --------------------------------------DK-VKIDFNFYKKFWSLQDYFR 217
DK V +D + Y FW LQ YF
Sbjct: 240 LKEDSEAQTTSASESHTPAQEPPKTPRVVVSSSKDESQDKGVDLD-DLYPIFWGLQAYFS 298
Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQ 251
P + ++ + F + E L+AF++ D ++
Sbjct: 299 APTKTFDPQHFAKFKTGLEATLSAFRNVDTDLENSSNSKMSEEIRKSTKRKRTTDGQEIA 358
Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE------ 304
SS NP KYLT++ L DL+++DT FRR+VL+Q LIL + S T K++ +
Sbjct: 359 SSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLADLTN 412
Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
G L D KW + + + Q G+ + ++V +L +++W +WK EGC
Sbjct: 413 KSVLYGFVLNDDDAKWAVKMRKAIEEYL-QEGIGGKFYYRMVDTVLSRDKNWVRWKAEGC 471
Query: 360 PELKRPLTSITDEDKKDEPDA-----KKKKTPELTKL-------WNSKDNLEACKSAERD 407
P ++RP S+++ E K+ +T + L S LE K ER
Sbjct: 472 PLIERPPVSVSEYLGAREHATKVYANKRLRTSPMGSLDLKFLSEGESLAGLERLKDPERF 531
Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEE-QYKKVNDSNYAWRALRLLSR-KCPHF------ 456
P+ +S + +D A E+ +Y + S+ WR LRL ++ K F
Sbjct: 532 SVPAADSLMMGIMDDELDIDTAQTKEDREYAMRSKSSKTWRILRLSAKSKLAEFEKIEDG 591
Query: 457 ----FLNATP-NVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQ 511
L TP + + +E + K S+ +G+G Q+P + ++ I
Sbjct: 592 KNLKILFETPQQSDGTPQALEGTPQASEKPPAGSESTGHG----QEPGASSTENENSAIA 647
Query: 512 EEEKEEDWEAKADPE 526
E+ + A PE
Sbjct: 648 TEQTPANSVEAAAPE 662
>gi|242794188|ref|XP_002482321.1| THO complex subunit Tho1, putative [Talaromyces stipitatus ATCC
10500]
gi|218718909|gb|EED18329.1| THO complex subunit Tho1, putative [Talaromyces stipitatus ATCC
10500]
Length = 649
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 136/523 (26%), Positives = 235/523 (44%), Gaps = 114/523 (21%)
Query: 34 SALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ +L+ + ID I N + + + C + L L+ + D
Sbjct: 66 AAVETAFREKFYELLASTSIDDLAFVNIWNLLDIVSIFSDNEQCEAGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T+D C ++F Y+E I F S K +LR CN+LLRRLSR+++TVFCGR+ +++
Sbjct: 126 TVDGCRKVFDYLESRREI-NTAKHFESKKLIILRSCNELLRRLSRAEDTVFCGRVFIYMF 184
Query: 151 KFFPFSERSGLNIISEFNVENITEF------GGDEEMDVSSNETEETDTEEVDKV----- 199
+ FP ++S +N+ EF+ EN+T F ++ M++ E +E +E+ +
Sbjct: 185 QSFPLGDKSSVNLRGEFHTENVTTFDDIALKSQEDTMEIEPTEQKEETSEKTSQSVQDEQ 244
Query: 200 -----------------------------KIDFN-FYKKFWSLQDYFRNPVQCYNKVSWK 229
ID + Y FW LQ +F P + ++ ++
Sbjct: 245 TKGETSGPKEGTPAVAETAHERKESEATKTIDLDTLYPIFWGLQGFFSAPTRLFDSQNFA 304
Query: 230 MFTSYAETVLAAFK--------------------------SYKLDDVQSSLNPSGDYFAK 263
F + E L+ FK + + DV SS NP K
Sbjct: 305 SFRTSLEKTLSTFKLMNKVTDTWSTAKAPDEVRRGLKRKRNGEGVDVASSFNP------K 358
Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLEL-------------- 308
YLT+++L DL+++D FRR+VL+Q LIL + S T ++R ++L
Sbjct: 359 YLTSRELFDLEINDIAFRRHVLVQALILLDFILSLTPQARQRLMKLVQPGKSLNKTVLYQ 418
Query: 309 ---KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
+ KW T + V ++Q DG+ + ++V +L +++W +WK E CP +++P
Sbjct: 419 YTVSEEDAKWAAQTRDIVADYLQQG-IDGKFYYRMVDTVLSRDKNWVRWKVENCPPIEKP 477
Query: 366 LTSITD------EDKKDEPDAKKKKTP----ELTKL--WNSKDNLEACKSAERDFTPSLE 413
D K + + + +P +LT L N+ +LE K +R TP+ +
Sbjct: 478 AIQADDYFQARNNALKVFSNKRLRASPMGSLDLTFLADGNNLSSLEQLKDPDRYTTPAPD 537
Query: 414 SYFEEAIQ---QMDPAAAVEEQ--YKKVNDSNYAWRALRLLSR 451
S+ + +D A +E+ ++ +S Y WR LRL SR
Sbjct: 538 SFMMGIVDDTFNIDMAKDKDEKEAAERAKESKY-WRTLRLSSR 579
>gi|145248682|ref|XP_001400680.1| THO complex subunit Tho1 [Aspergillus niger CBS 513.88]
gi|134081347|emb|CAK41850.1| unnamed protein product [Aspergillus niger]
Length = 671
Score = 155 bits (392), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 148/591 (25%), Positives = 247/591 (41%), Gaps = 138/591 (23%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ + I+ I N + + + C L L+ + D
Sbjct: 66 AAVEIAFREKFYSLLASTSIEDPSFICIWNLLDIISIFSDNEQCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF-----------------------------G 176
+FL + FP ++S +N+ E++ +N+T + G
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTDNVTTYDELTKAAAKDVADADVEMSDEQEAPTATNG 239
Query: 177 GDEEMDVSSNETEETDT------------------EEVDK-VKIDFNFYKKFWSLQDYFR 217
E+ + + E+ T E DK V +D + Y FW LQ YF
Sbjct: 240 LKEDSEAQTTSASESHTPAQEPPKTPRVVVPSSKDESQDKGVDLD-DLYPMFWGLQAYFS 298
Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQ 251
P + ++ + F + E L+AFK+ D ++
Sbjct: 299 APTKTFDPQHFAKFKTGLEATLSAFKNVDTDLENSSNSKMSEEIRKSTKRKRTTDGQEIA 358
Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE------ 304
SS NP KYLT++ L DL+++DT FRR+VL+Q LIL + S T K++ +
Sbjct: 359 SSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLADLTN 412
Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
G L D KW + + + Q G+ + ++V +L +++W +WK EGC
Sbjct: 413 KSVLYGFVLHDDDAKWAVKMRKAIEEYL-QEGIGGKFYYRMVDTVLSRDKNWVRWKAEGC 471
Query: 360 PELKRPLTSITDEDKKDEPDA-----KKKKTPELTKL-------WNSKDNLEACKSAERD 407
P ++RP S+++ E K+ +T + L S LE K ER
Sbjct: 472 PLIERPPVSVSEYLGAREHATKVYANKRLRTSPMGSLDLKFLSEGESLAGLERLKEPERF 531
Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEE-QYKKVNDSNYAWRALRLLSR-KCPHF------ 456
P+ +S + +D A E+ +Y + S+ WR LRL ++ K F
Sbjct: 532 SVPAADSLMMGIMDDELDIDTAQTKEDREYAMRSKSSKTWRILRLSAKSKLAEFEKIEDG 591
Query: 457 ----FLNATP-NVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVE 502
L TP + + +E + EKP + G+G + A E
Sbjct: 592 KNLKILFETPQQSDGTPQALEGTPQ--ASEKPPAGSESTGHGQEHGAASTE 640
>gi|358397904|gb|EHK47272.1| hypothetical protein TRIATDRAFT_291414 [Trichoderma atroviride IMI
206040]
Length = 800
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 130/493 (26%), Positives = 226/493 (45%), Gaps = 83/493 (16%)
Query: 34 SALEQSIRQYLLKLIKTPDIDI-KVIE--NYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E + R KLI T ID K + N + + L C+ L L+ + D
Sbjct: 80 AVVETAARDSFSKLIATTSIDSPKFVRMWNLLDILSILSDDGQCDPALLFWLVEELLDSQ 139
Query: 91 TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+ C +F ++E + +KQ+ +LR CN+LLRRLSR+++T FCGR+
Sbjct: 140 TIAGCRIIFDFLESRRERITAKHFKQKNLV------ILRSCNELLRRLSRAEDTAFCGRV 193
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF------GGDEEMDVSSNETEE--------- 190
+F+ + FP +RS +N+ E++VEN+T F G E+D + +E
Sbjct: 194 FIFMFQSFPLGDRSSVNLRGEYHVENVTSFETPEDDDGKMEVDAQAETPKESGAPKAGAK 253
Query: 191 TDTEEVDKVKI--DFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK----- 243
T + +K + Y FWSLQ+ F P++ ++ ++ F S E + AF+
Sbjct: 254 TGAHDSNKAQAPDSDTLYPLFWSLQESFSQPLKLFDVANFTKFKSSLELTIKAFQAIHDD 313
Query: 244 ----SYKLDDVQSSLNPSGD----------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
S +++++ +L D + KYLT++ L L++SD +FRR+VL+Q L
Sbjct: 314 SLYTSKSMENLKRTLKRKRDDDGTETLPEAFNPKYLTSKDLFQLEISDLSFRRHVLVQAL 373
Query: 290 ILFQYFTSTVKSRGEGL--------------ELKSDQEKWVKDTTETVYSLIKQTPPDGE 335
I+ + S K E +L + KW D + + +KQ DG
Sbjct: 374 IITDFLLSLSKEEREKRTNINISNKSVIYHGQLNEENTKWAADIKKVISDYLKQG-ADGP 432
Query: 336 HFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTS---ITDEDKKDEPDAKKKKTPE---- 387
+F ++V+ +L +++W WK CP ++R P+++ + ++ K+ P
Sbjct: 433 YFFRMVETVLARDKNWVFWKMASCPPIQRDPVSAQGFVEAKEAAQRMATSKRLRPNPLNA 492
Query: 388 --LTKLWNSKDN--LEACKSAERDFTPSLESYFEEAIQQMD-----PAAAVEEQYKKVND 438
L L N + L+ K +ER P L++ F+ I D P + +
Sbjct: 493 VPLDFLANEDETSALDKLKDSERCQLPELDA-FKSKIADDDFEIEMPTNSETKAAAVAGK 551
Query: 439 SNYAWRALRLLSR 451
++ +WRALR+ SR
Sbjct: 552 ASKSWRALRIASR 564
>gi|402079094|gb|EJT74359.1| guanylate kinase [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 798
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 135/509 (26%), Positives = 224/509 (44%), Gaps = 93/509 (18%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPII 81
P+ + +E ++R KL+ T I+ ++V N + L L C+ L
Sbjct: 66 PSVRTRQYGVIETAVRDIFGKLVATTSIESPEFVRVW-NLLDLVSILSDMRQCDPVLLFW 124
Query: 82 LLSDTFDMSTLDKCEQLFYYVEV-NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTV 140
L+ + D T C ++ ++E + I + M +LR CN+LLRRLSR+++T
Sbjct: 125 LVEELLDSQTTSGCRRVLDFLESRHQRITANPSVAMQL--TILRSCNELLRRLSRARDTT 182
Query: 141 FCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM--------------DVSSN 186
FCGR+ +F+ + FP +RS +N+ E++VEN+T F D + D +S
Sbjct: 183 FCGRVFIFMFQSFPLGDRSSVNLRGEYHVENVTTFDQDVDRNGTLDKMEIDGHTDDTTSP 242
Query: 187 ETEETDTEEV---DKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF 242
E + T T + ++ D Y FWSLQ F P + +++ ++ F S E+ +AAF
Sbjct: 243 EAKATATRTSAADESIRKDMEALYPVFWSLQASFSQPKKLFDRSHFERFKSGLESTMAAF 302
Query: 243 KSYKLDDVQSSLNPSGD-------------------YFAKYLTNQKLLDLQLSDTNFRRY 283
K + + S D + KYLT++ L L+LSD FRRY
Sbjct: 303 KDAQTKQISRSQRQEEDTKTGQKRKHEGEGADLADAFNPKYLTSRDLFRLELSDLTFRRY 362
Query: 284 VLLQFLILFQYFTSTVKSRGE---------------GLELKSDQEKWVKDTTETVYSLIK 328
+L+Q LI+ ++ S E +L S+ +W T + ++
Sbjct: 363 ILVQALIIMEFLLSLSAQEREIAAVASAPNKSVIYLDQQLSSEDTEWASKTRLAIIVHLE 422
Query: 329 QT---PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS---ITDEDKKDEPDAKK 382
+ P +G +F ++VK IL +++W +WK C ++ P +D K A
Sbjct: 423 TSGAGPSEGAYFVRIVKTILARDKNWVRWKAYACSPIEMPSLPAQFFSDAKKSAYRSATS 482
Query: 383 KK---TP----ELTKLWNSKD--NLEACKSAERDFTPSLESYFEEAIQQ----------- 422
K+ TP + L +S+D L K+ ER P + F+ AI +
Sbjct: 483 KRLRATPMNALSIDFLSDSRDSSGLGKFKADERSRIPDV-CAFKRAIAEDELEIDMPTNN 541
Query: 423 MDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
AA+E + K +WRALR+ SR
Sbjct: 542 QTKVAAIEGKASK------SWRALRVASR 564
>gi|327356040|gb|EGE84897.1| nuclear matrix protein [Ajellomyces dermatitidis ATCC 18188]
Length = 656
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 149/607 (24%), Positives = 252/607 (41%), Gaps = 131/607 (21%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E + R+ L+ T ID I N + + + C L L+ + D
Sbjct: 68 AVVETAFRKKFYDLLATTSIDEPSFIHIWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 127
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 128 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRTEDTVFCGRV 181
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNETE---ETDTE 194
+FL + FP +RS +N+ EF+ EN+T F GG MDV ++ E + TE
Sbjct: 182 FIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQDGGVTPMDVDASAKEVAGDAKTE 241
Query: 195 EVDKVKIDFN-----------------------------------FYKKFWSLQDYFRNP 219
+ + N Y FWSLQ+ F +P
Sbjct: 242 SAQASEQESNRQAAHPAADVGLQSSQPATKKPQEPEPSSTPSMDELYPIFWSLQNNFSSP 301
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------DVQSSLNPSGDY 260
++ ++ F + E L+ F+ D V + + +
Sbjct: 302 TSLFDPANFAAFKTGLEASLSIFQKVNTDMEARMTKGSEDARRGLKRRRVGDGMEMTNSF 361
Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE-----------L 308
KYLT++ L +L+++D FRR++L+Q LIL + S T K++ + + L
Sbjct: 362 NPKYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNKSVLYSYVL 421
Query: 309 KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
+ KW ++ S ++Q +G+ + ++V +L +++W +WK E CP ++R S
Sbjct: 422 NDEDAKWASQMKTSIASYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEACPPIERTPVS 480
Query: 369 ITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACKSAERDFTPSLESYF 416
I D K + + P +L L K +++ K +R TP+L+SY
Sbjct: 481 IQDYLDTQATATKVSTSKRLRSAPLGSLDLNFLTEDKNINSVGRLKHPDRFTTPALDSYM 540
Query: 417 EEAIQ---QMDPAAAVEEQYKKVND-SNYAWRALRLLSRKCPHFF-----------LNAT 461
+D A + EE+ + ++ WR LRL SR F L
Sbjct: 541 RGIADDEFNIDMAQSREEKDEATKARASKTWRTLRLSSRSKLALFDKIEDGNNLKVLFEP 600
Query: 462 PNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVD-QDPAEVEVDTKSEEIQEEEKEEDWE 520
P +++ N K V+ NGV+ QD D K E+ +E + +
Sbjct: 601 PPASEDASSASNKQKAAVE-----------NGVNRQDGPSKRGDVKGEKAKENQPPVSSD 649
Query: 521 AKADPEG 527
A +G
Sbjct: 650 GAAPQQG 656
>gi|396459819|ref|XP_003834522.1| similar to nuclear matrix protein [Leptosphaeria maculans JN3]
gi|312211071|emb|CBX91157.1| similar to nuclear matrix protein [Leptosphaeria maculans JN3]
Length = 696
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 128/456 (28%), Positives = 215/456 (47%), Gaps = 81/456 (17%)
Query: 71 KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
+D+C+ L ++L+ + D ++ C +F ++E ++ ++ KN +LR+CN
Sbjct: 117 RDLCSPQLVLLLIEELLDSQSISGCRTVFNFLESR----REAIIAINSKNKDLVILRLCN 172
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG--------DE 179
+LLRRLSR++ VFCGR+ +F+ + FP ++S +N+ F+VEN+T F D+
Sbjct: 173 ELLRRLSRAEAPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENVTTFEDFLKESNEVDD 232
Query: 180 EMDVSSNETEETDTEEV----------------DKVKIDFN-FYKKFWSLQDYFRNPVQC 222
M+V ++T T T E K +DF+ Y FWSLQ F NP +
Sbjct: 233 LMEVDHDKTVPTATGESASEHEVLSSETTKSTDAKSTLDFDALYPVFWSLQHSFSNPPRL 292
Query: 223 YNKVSWKMFTSYAETVLAAFK-----------------SYKLDDVQSSLNPSGDYFAKYL 265
+ + ++K F + + LA FK LDD Q +L + + KYL
Sbjct: 293 FEEDNFKQFRTSLDATLAKFKEVPKVIQSGDSERKKGQKLHLDDDQDALTST--FNPKYL 350
Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQ-------------YFTSTVKSRGEGLELKSDQ 312
T++ L L+LSD F+R++L+Q LIL ++ + K+ L+ D
Sbjct: 351 TSRDLFKLELSDLAFQRHILVQALILIDFLLTLTEDAKKQPFYQNAQKAMQYNFTLREDD 410
Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD- 371
KW + + + Q PDG+ + ++V +L +++W +WK E C R + D
Sbjct: 411 TKWALGIKSAIANYL-QEDPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRDRVAAKDF 469
Query: 372 --EDKKDEPDAKKKKTPELTKLWNSKD---NLEACKS----AERDF--TPSLESYFEEA- 419
+ KK P+ + ++ D N EA K +RD P+ ESY +
Sbjct: 470 LQAKSGAQRAVVSKKAPKSLGVPHTLDFLNNTEAEKGLSQLRQRDRFNVPNAESYADRVR 529
Query: 420 -IQQMDPAAAVEEQYKKV--NDSNYAWRALRLLSRK 452
Q A EE+ ++V +S++ WR LRL S+K
Sbjct: 530 MTDQNLGMADTEEEKRQVAEKNSSHVWRGLRLASKK 565
>gi|154311261|ref|XP_001554960.1| hypothetical protein BC1G_06483 [Botryotinia fuckeliana B05.10]
Length = 661
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 133/485 (27%), Positives = 215/485 (44%), Gaps = 105/485 (21%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
N + + L + C L L+ + D T+ C ++F Y+E + F + K
Sbjct: 99 NLLDIISILSDDEQCEPALLFWLVEELLDSQTIAGCRRIFDYLESRREKITAKHF--NQK 156
Query: 120 N-NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--- 175
N +LR CN+LLRRLSR+++T FCGR+ +F+ + FP ++S +N+ EF+VEN+T F
Sbjct: 157 NLVILRSCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEFHVENVTSFDTL 216
Query: 176 ----GGDEEMDVSSNE-TEETD----------------TEEVDKVKIDF----------- 203
G E+M+V S+ +ET +E+++ D
Sbjct: 217 PSNPGDVEKMEVDSDSNAKETSPASKPASNGRTSTARGSEDINGSGKDVASSKREPPMKP 276
Query: 204 -NFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL--------------- 247
Y FWSLQ F P + +N ++ F E +A FKS
Sbjct: 277 DELYPIFWSLQQSFSQPKKLFNSENFAEFKKGLEATIATFKSVHHETAGRPTARVAEDSK 336
Query: 248 -----------DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
DD+ +S NP KYLT++ L +L++SD +FRR++L+Q LI+ +
Sbjct: 337 RGTKRKRSQGDDDLANSFNP------KYLTSRDLFELEISDLSFRRHILVQILIIMDFLL 390
Query: 297 STVKSRGEGLE-----------------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
S E L L + KW D +++ +K+ DG F +
Sbjct: 391 SLSAKAKEKLSKALPDNLNRSVTYADQTLSEEDTKWSLDMKKSIAEYLKKG-ADGAFFYR 449
Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTS----ITDEDKKDEPDAKKKKTP------ELT 389
+V+ +L +++W +WK E CP + RP S IT + K+ P +L
Sbjct: 450 MVETVLSRDKNWVRWKVENCPSIARPAVSPHDYITAKASARRATTNKRLRPNPLGSLDLK 509
Query: 390 KLWNS--KDNLEACKSAERDFTPSLESY-FEEAIQQMDPAAAVEEQYKKV---NDSNYAW 443
L S + LE K ER PS++S+ + + MD A +E+ K + ++ W
Sbjct: 510 FLAESDLRSGLERLKKPERYQIPSIKSFKNKIDLDDMDIDMARDEESKNIAIEAKASKCW 569
Query: 444 RALRL 448
RALR+
Sbjct: 570 RALRI 574
>gi|347829111|emb|CCD44808.1| similar to THO complex subunit Tho1 [Botryotinia fuckeliana]
Length = 661
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 137/511 (26%), Positives = 225/511 (44%), Gaps = 107/511 (20%)
Query: 36 LEQSIRQYLLKLIKTPDIDIKVIE--NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLD 93
+E ++R L+ + I+ N + + L + C L L+ + D T+
Sbjct: 73 IETAVRDTFNNLLASISIETSSSRLWNLLDIISILSDDEQCEPALLFWLVEELLDSQTIA 132
Query: 94 KCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKF 152
C ++F Y+E + F + KN +LR CN+LLRRLSR+++T FCGR+ +F+ +
Sbjct: 133 GCRRIFDYLESRREKITAKHF--NQKNLVILRSCNELLRRLSRAEDTAFCGRVFIFMFQS 190
Query: 153 FPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNE-TEETD------------ 192
FP ++S +N+ EF+VEN+T F G E+M+V S+ +ET
Sbjct: 191 FPLGDKSSVNLRGEFHVENVTSFDTLPSNPGDVEKMEVDSDSNAKETSPASKPASNGRTS 250
Query: 193 ----TEEVDKVKIDF------------NFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
+E+++ D Y FWSLQ F P + +N ++ F E
Sbjct: 251 TARGSEDINGSGKDVASSKREPPMKPDELYPIFWSLQQSFSQPKKLFNSENFAEFKKGLE 310
Query: 237 TVLAAFKSYKL--------------------------DDVQSSLNPSGDYFAKYLTNQKL 270
+A FKS DD+ +S NP KYLT++ L
Sbjct: 311 ATIATFKSVHHETAGRPTARVAEDSKRGTKRKRSQGDDDLANSFNP------KYLTSRDL 364
Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE-----------------LKSDQE 313
+L++SD +FRR++L+Q LI+ + S E L L +
Sbjct: 365 FELEISDLSFRRHILVQILIIMDFLLSLSAKAKEKLSKALPDNLNRSVTYADQTLSEEDT 424
Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS----I 369
KW D +++ +K+ DG F ++V+ +L +++W +WK E CP + RP S I
Sbjct: 425 KWSLDMKKSIAEYLKKG-ADGAFFYRMVETVLSRDKNWVRWKVENCPSIARPAVSPHDYI 483
Query: 370 TDEDKKDEPDAKKKKTP------ELTKLWNS--KDNLEACKSAERDFTPSLESY-FEEAI 420
T + K+ P +L L S + LE K ER PS++S+ + +
Sbjct: 484 TAKASARRATTNKRLRPNPLGSLDLKFLAESDLQSGLERLKKPERYQIPSIKSFKNKIDL 543
Query: 421 QQMDPAAAVEEQYKKV---NDSNYAWRALRL 448
MD A +E+ K + ++ WRALR+
Sbjct: 544 DDMDIDMARDEESKNIAIEAKASKCWRALRI 574
>gi|212535848|ref|XP_002148080.1| THO complex subunit Tho1, putative [Talaromyces marneffei ATCC
18224]
gi|210070479|gb|EEA24569.1| THO complex subunit Tho1, putative [Talaromyces marneffei ATCC
18224]
Length = 646
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 140/525 (26%), Positives = 233/525 (44%), Gaps = 117/525 (22%)
Query: 34 SALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ +L+ + ID I N + + C + L L+ + D
Sbjct: 66 AAVETAFREKFYELLASTSIDDPAFVNIWNLLDIVSIFSDNAQCEAGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T+D C ++F Y+E I F S K +LR CN+LLRRLSR+++TVFCGR+ +++
Sbjct: 126 TVDGCRKVFDYLESRREI-NTAKHFESKKLIILRSCNELLRRLSRAEDTVFCGRVFIYMF 184
Query: 151 KFFPFSERSGLNIISEFNVENITEFG---------------------------------- 176
+ FP ++S +N+ EF+ ENIT F
Sbjct: 185 QSFPLGDKSSVNLRGEFHTENITTFDNIVKTSQEDAMETEPTEPTETTEKMAQVAQGEPI 244
Query: 177 GDEEMDVSSNETEETDTEEVDKVK-------IDFN-FYKKFWSLQDYFRNPVQCYNKVSW 228
+ E + E T E D+ K +D + Y FW LQ +F P + ++ S+
Sbjct: 245 NEGEATRAVKEGTPTVAEAADEKKENDPSKTVDLDALYPIFWGLQGFFSAPTRLFDSQSF 304
Query: 229 KMFTSYAETVLAAFK--------------------------SYKLDDVQSSLNPSGDYFA 262
F + E L+ FK + + D+ SS NP
Sbjct: 305 ASFRAGLEKTLSTFKLMNKVTDTWSTAKAPDELRRGLKRKRNGEGVDIASSFNP------ 358
Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSR---------GEGL------ 306
KYLT+++L DL+++D FRR+VL+Q LIL + S T ++R G+ L
Sbjct: 359 KYLTSRELFDLEINDIAFRRHVLVQALILLDFILSLTPQARQRLMKSVQPGKALNKTVLY 418
Query: 307 --ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
+ + KW T + V ++Q DG+ + ++V +L +++W +WK E CP +++
Sbjct: 419 QYAVSEEDAKWAAQTRDVVADYLQQG-IDGKFYYRMVDTVLSRDKNWVRWKVENCPPIEK 477
Query: 365 PLTSITDEDK-KDEPDAKK-------KKTP----ELTKL--WNSKDNLEACKSAERDFTP 410
P +I +D + +A K + P +LT L NS +LE K +R TP
Sbjct: 478 P--AIKGDDYFQARGNASKVFTTKRLRAAPMGSLDLTFLTDGNSISSLEQLKEPDRYSTP 535
Query: 411 SLESYFEEAI-QQMDPAAAVEEQYKKVNDS---NYAWRALRLLSR 451
+ +S+ + + D A ++ K+ + + WR LRL SR
Sbjct: 536 APDSFMMGIVDDEFDIDMAKDKDEKEAAERAKVSKLWRTLRLASR 580
>gi|326483229|gb|EGE07239.1| THO complex subunit 1 [Trichophyton equinum CBS 127.97]
Length = 635
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 138/536 (25%), Positives = 234/536 (43%), Gaps = 114/536 (21%)
Query: 16 VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKD 72
V++ Y + + +A+E + R+ L+ T ID I N + + +
Sbjct: 48 VLAGDYGISVGANQQTQYAAVETAFREIFYDLLATTSIDEPSFMQIWNLLDIVSIFSDNE 107
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
C L L+ + D T+D C ++F Y+E +KQ++ +LR CN
Sbjct: 108 KCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCN 161
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDE 179
+LLRRLSR+++TVFCGR+ +FL + FP ++S +N+ E++ EN+T F DE
Sbjct: 162 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDVQPQAQAESDE 221
Query: 180 EMDVSSNETE-------ETDTEEVD-----------------------KVKIDFNFYKKF 209
MD+ E E TD E ++++D Y F
Sbjct: 222 VMDIDIQEGETQVSNAKSTDQESTTAHAPLESTNDATPQRAKAPSIQPELELD-TLYAAF 280
Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
WSLQ+ F P++ ++ + S E L++F+ S KL+
Sbjct: 281 WSLQESFSKPIRLFDTTHFASLKSGLEATLSSFRKVSSKLEVRSTTKGSDELRRGTKRKR 340
Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF----------- 292
D+ +S NP KYLT++ L +L+++D +FRR++L+Q LIL
Sbjct: 341 GEGGSDLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSK 394
Query: 293 -QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
+ ST KS L + KW ++ S + Q DG+ + ++V +L +++W
Sbjct: 395 AKLADSTNKSVLYSYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 453
Query: 352 NQWKNEGCPELKRPLTSITDE------DKKDEPDAKKKKTP------ELTKLWNSKDNLE 399
+WK EGCP +++P + K + + + TP + ++ +L
Sbjct: 454 VRWKAEGCPPIEQPPVPVQSHLDTQSSAAKITANKRLRSTPMGSLDLGFLSVESNLGDLN 513
Query: 400 ACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
K ER TPSL+ Y +D A EE+ + + ++ WR LR+ SR
Sbjct: 514 RLKEPERFSTPSLDFYMSGITDDDFNIDMAKDDEEKEQSIKAKASKTWRTLRIASR 569
>gi|367051076|ref|XP_003655917.1| hypothetical protein THITE_2120207 [Thielavia terrestris NRRL 8126]
gi|347003181|gb|AEO69581.1| hypothetical protein THITE_2120207 [Thielavia terrestris NRRL 8126]
Length = 765
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 105/390 (26%), Positives = 183/390 (46%), Gaps = 64/390 (16%)
Query: 34 SALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E ++R LI T I+ + N + L ++C+ L L+ + D
Sbjct: 76 AVIETAVRDTFESLIATTPIESPAFVKVWNLFDILSLLSDCELCDPALLFWLVEELLDSQ 135
Query: 91 TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+ C ++F ++E + +KQ+ +LR CN+LLRRLSR+ + FCGR+
Sbjct: 136 TIAGCRKVFDFLESRRERITAKHFKQKQLV------ILRTCNELLRRLSRALDPAFCGRV 189
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE---EVDKVKID 202
+F+ + FP ++S +N+ E++VEN+T + D+ E + D+E + DK
Sbjct: 190 FIFMFQSFPLGDKSSVNLRGEYHVENVTTW--DQTTPAPDGEKMDVDSEPKPQKDKALDP 247
Query: 203 FNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSY----------------- 245
Y FWSLQ+ F P + ++ + F + E +A F+S
Sbjct: 248 DTLYPVFWSLQESFNQPKKLFDSGHFAAFKAGLEATMATFRSIKPEQPSRSKEKQDKPVE 307
Query: 246 --------KLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
K DDV L + + KYLT++ L L++SD FRR +L+Q LI+ ++ +
Sbjct: 308 EMGPSLKRKRDDVDDEL--ASGFNPKYLTSRDLFKLEISDLAFRRNILVQALIVMEFLLA 365
Query: 298 TVKSRGEGL---------------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
E L +L + KW D E + +KQ +G +F+++V+
Sbjct: 366 LSPKAKEKLASVKAPNKSVTYSDQQLSEEDAKWAADMKEAIVEYLKQG-VEGPYFNRMVE 424
Query: 343 LILKGEEHWNQWKNEGCPELKRPLTSITDE 372
+L +++W +WK E CP ++ P S+T E
Sbjct: 425 TVLSRDKNWVRWKIENCPSIELP--SMTPE 452
>gi|156065307|ref|XP_001598575.1| hypothetical protein SS1G_00664 [Sclerotinia sclerotiorum 1980]
gi|154691523|gb|EDN91261.1| hypothetical protein SS1G_00664 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 658
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 129/491 (26%), Positives = 212/491 (43%), Gaps = 113/491 (23%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTF 114
N + + L + C L L+ + D T+ C ++F Y+E + + Q+
Sbjct: 96 NLLDIISILSDDEQCEPALLFWLVEELLDSQTIAGCRKIFDYLESRREKITAKHFNQKNL 155
Query: 115 FMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITE 174
+LR CN+LLRRLSR+++T FCGR+ +F+ + FP ++S +N+ EF+VEN+T
Sbjct: 156 V------ILRSCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEFHVENVTS 209
Query: 175 FGG-------DEEMDVSSN---ETEETDTEEVDKVKI-------DFN------------- 204
F E M+V S+ +T DT+ + D N
Sbjct: 210 FDALPRNPVDTENMEVDSDSNLKTTPPDTKATPNGRTSTARGSEDSNGSGKDVATSKHEP 269
Query: 205 ------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL----------- 247
Y FWSLQ F P + +N ++ F E +A FKS +
Sbjct: 270 PMKPDELYPIFWSLQQSFSQPKRLFNAENFAEFKKGLEATIATFKSVQHETVGRPAARAA 329
Query: 248 ---------------DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
DD+ +S NP KYLT++ L +L++SD +FRR++L+Q LI+
Sbjct: 330 EDNKRGTKRKRSQGDDDLANSFNP------KYLTSRDLFELEISDLSFRRHILVQILIIM 383
Query: 293 QYFTSTVKSRGEGLE-----------------LKSDQEKWVKDTTETVYSLIKQTPPDGE 335
+ S E L L + KW +T + + +K+ DG
Sbjct: 384 DFLLSLSTKAKEKLSKALPENINRSVTYADQTLSEEDTKWSLETKKIIAEYLKKG-ADGA 442
Query: 336 HFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD------------EDKKDEPDAKKK 383
F ++V+ +L +++W +WK E CP + RP S D +K+ P+
Sbjct: 443 FFYRMVETVLSRDKNWVRWKVENCPSIARPAVSPQDYIDAKASARKATTNKRLRPNPLGS 502
Query: 384 KTPELTKLWNSKDNLEACKSAERDFTPSLESYF-EEAIQQMDPAAAVEEQYKKV---NDS 439
+ + + LE K +ER PS++S+ + + MD A +E+ K V +
Sbjct: 503 LDLKFLAETDVQSGLERLKKSERYQVPSIKSFRNKIDLDDMDIDMARDEESKNVAIEAKA 562
Query: 440 NYAWRALRLLS 450
+ WRALR+ S
Sbjct: 563 SKCWRALRIAS 573
>gi|315041375|ref|XP_003170064.1| THO complex subunit 1 [Arthroderma gypseum CBS 118893]
gi|311345098|gb|EFR04301.1| THO complex subunit 1 [Arthroderma gypseum CBS 118893]
Length = 635
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 135/517 (26%), Positives = 233/517 (45%), Gaps = 112/517 (21%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ T I+ I N + + + C L L+ + D
Sbjct: 66 AAVETAFREIFYDLLATTSINEPSFMRIWNLLDIVSIFSDNEKCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDEEMDVS------------- 184
+FL + FP ++S +N+ E++ EN+T F G+E MD+
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDIQPKIQAEGEEAMDIDIQGDVTPASNAKD 239
Query: 185 --------------SNETEETDTEEVD-KVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSW 228
+N+ T+E + ++++D + Y FW LQ+ F P + ++ +
Sbjct: 240 SQQEPTITQAPLEVTNDISSQRTKEPEIQLELDLDTLYAAFWGLQESFSKPTRLFDAAHF 299
Query: 229 KMFTSYAETVLAAFK--SYKLD------------------------DVQSSLNPSGDYFA 262
S E L++F+ S K++ D+ +S NP
Sbjct: 300 ASLKSGLEATLSSFRKVSSKVEVRNTTKGLDESRRGTKRKRAEGGSDLPNSFNP------ 353
Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILF------------QYFTSTVKSRGEGLELKS 310
KYLT++ L +L+++D +FRR++L+Q LIL + ST KS G L
Sbjct: 354 KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSKAKLADSTNKSVLYGYVLSD 413
Query: 311 DQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSIT 370
+ KW ++ S + Q DG+ + ++V +L +++W +WK EGCP ++RP +
Sbjct: 414 EDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNWVRWKAEGCPPIERPPVPVQ 472
Query: 371 DE------DKKDEPDAKKKKTP----ELTKLWNSKD--NLEACKSAERDFTPSLESYFEE 418
K + + + TP +L L + +L+ K ER TPSL+ Y
Sbjct: 473 SHLDTQSSAVKITANRRLRSTPMGSLDLGFLAAESNLGDLDRLKEPERFATPSLDFYMSG 532
Query: 419 AIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
+D A +E+ + V ++ WR LR+ SR
Sbjct: 533 ITDDDFNIDMAKDDDEKEQSVKAKASKTWRTLRIASR 569
>gi|325187942|emb|CCA22486.1| THO complex subunit putative [Albugo laibachii Nc14]
Length = 582
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 130/468 (27%), Positives = 200/468 (42%), Gaps = 84/468 (17%)
Query: 54 DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
D I + LC++ + ++ P +L D + + CE L+ +EV
Sbjct: 48 DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107
Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
FF + LLRMCN +LRRLS++ NTVFCGRIL+FL+ F +ERS
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAA------ 161
Query: 168 NVENITEFGGD-----EEMDVSSNETEETDT-------EEVDKVKIDFNFYKKFWSLQDY 215
NVEN+T F E D +N E D ++ +ID+ Y FW +Q +
Sbjct: 162 NVENLTIFEDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQRF 221
Query: 216 FRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNPS 257
FR+ + + + K F VL AF++ D V S+ S
Sbjct: 222 FRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKMS 281
Query: 258 GD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
D + KYLTN +L LQL D R VL Q LIL + V++
Sbjct: 282 IDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VRA 338
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ + S Q+ + + + V L+K+TPPDG +S++V +L+ E +W +WK E CP
Sbjct: 339 KS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCPS 397
Query: 362 L--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----LE 399
KR + T+ + + +KK E + ++ S + LE
Sbjct: 398 YEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLLE 457
Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
K R +L + + DPA+ ++++Y D Y WR LR
Sbjct: 458 KIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 505
>gi|325187945|emb|CCA22489.1| THO complex subunit putative [Albugo laibachii Nc14]
Length = 545
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 130/469 (27%), Positives = 202/469 (43%), Gaps = 86/469 (18%)
Query: 54 DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
D I + LC++ + ++ P +L D + + CE L+ +EV
Sbjct: 48 DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107
Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
FF + LLRMCN +LRRLS++ NTVFCGRIL+FL+ F +ERS
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAA------ 161
Query: 168 NVENITEFGGDEEMDVSSNETEETDTEEVDKV-------------KIDFNFYKKFWSLQD 214
NVEN+T F D+ ++ E+T+ E D +ID+ Y FW +Q
Sbjct: 162 NVENLTIF-EDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQR 220
Query: 215 YFRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNP 256
+FR+ + + + K F VL AF++ D V S+
Sbjct: 221 FFRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKM 280
Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
S D + KYLTN +L LQL D R VL Q LIL + V+
Sbjct: 281 SIDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VR 337
Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
++ + S Q+ + + + V L+K+TPPDG +S++V +L+ E +W +WK E CP
Sbjct: 338 AKS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCP 396
Query: 361 EL--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----L 398
KR + T+ + + +KK E + ++ S + L
Sbjct: 397 SYEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLL 456
Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
E K R +L + + DPA+ ++++Y D Y WR LR
Sbjct: 457 EKIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 505
>gi|326475798|gb|EGD99807.1| nuclear matrix protein [Trichophyton tonsurans CBS 112818]
Length = 635
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 135/536 (25%), Positives = 236/536 (44%), Gaps = 114/536 (21%)
Query: 16 VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKD 72
V++ Y + + +A+E + R+ L+ T ID I N + + +
Sbjct: 48 VLAGDYGISVGANQQTQYAAVETAFREIFYDLLATTSIDEPSFMQIWNLLDIVSIFSDNE 107
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
C L L+ + D T+D C ++F Y+E +KQ++ +LR CN
Sbjct: 108 KCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCN 161
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDE 179
+LLRRLSR+++TVFCGR+ +FL + FP ++S +N+ E++ EN+T F DE
Sbjct: 162 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDVQPQAQAESDE 221
Query: 180 EMDVSSNETE-------ETDTEEVD-----------------------KVKIDFNFYKKF 209
MD+ E E TD E ++++D Y F
Sbjct: 222 VMDIDIQEGETQVSNAKSTDQESTTAHAPLESTNDATPQRAKAPSIQPELELD-TLYAAF 280
Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
WSLQ+ F P++ ++ + S E L++F+ S KL+
Sbjct: 281 WSLQESFSKPIRLFDTTHFASLKSGLEATLSSFRKVSSKLEVRSTTKGSDELRRGTKRKR 340
Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ-FLILFQYFTSTVKSR 302
D+ +S NP KYLT++ L +L+++D +FRR++L+Q ++L + T KS+
Sbjct: 341 GEGGSDLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSK 394
Query: 303 GEGLE-----------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
+ + L + KW ++ S + Q DG+ + ++V +L +++W
Sbjct: 395 AKLADSTNKPVLYSYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 453
Query: 352 NQWKNEGCPELKRPLTSITDE------DKKDEPDAKKKKTP------ELTKLWNSKDNLE 399
+WK EGCP +++P + K + + + TP + ++ +L
Sbjct: 454 VRWKAEGCPPIEQPPVPVQSHLDTQSSAAKITANKRLRSTPMGSLDLGFLSVESNLGDLN 513
Query: 400 ACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
K ER TPSL+ Y +D A EE+ + + ++ WR LR+ SR
Sbjct: 514 RLKEPERFSTPSLDFYMSGITDDDFNIDMAKDDEEKEQSIKAKASKTWRTLRIASR 569
>gi|171687098|ref|XP_001908490.1| hypothetical protein [Podospora anserina S mat+]
gi|170943510|emb|CAP69163.1| unnamed protein product [Podospora anserina S mat+]
Length = 826
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 136/517 (26%), Positives = 222/517 (42%), Gaps = 130/517 (25%)
Query: 36 LEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMST 91
+E ++R L+ ID +KV N + L ++C+ L + L+ + D T
Sbjct: 76 IETAVRDTFKYLVSRASIDSPEFVKVW-NLFDILSILSDSELCDPALLLWLVEELLDSQT 134
Query: 92 LDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
+ C ++F ++E + +KQ+ +LR CN+LLRRLSR+ + F GR+L
Sbjct: 135 VAGCRKVFDFLESRRERITAKHFKQKQLV------ILRTCNELLRRLSRALDPAFSGRVL 188
Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNETEETD------ 192
+F+ + FP ++S +N+ EF++EN+T F G +++MDV ++ D
Sbjct: 189 IFMFQSFPLGDKSSVNLRGEFHIENVTTFDQIPAKTDGDEDKMDVDTDVGTTGDGSRRGR 248
Query: 193 TEEVDKVKIDFN---FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-- 247
+ +VD K + Y FW LQ++F P + + F S E +A F S K
Sbjct: 249 SNDVDTKKKALDPDALYPIFWGLQEFFNQPKLLFEASHFASFKSGLEATMATFLSMKFEQ 308
Query: 248 ---------------------------DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
D + S NP KYLT++ L L++SD F
Sbjct: 309 PPRVKERPDRPVEEPKHALKRKREESNDALSSGFNP------KYLTSRDLFKLEISDLTF 362
Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLE---------------LKSDQEKWVKDTTETVYS 325
RR +L+Q LI+ ++ + E L L + KWV +T E++ S
Sbjct: 363 RRNILIQALIIMEFLLALSPKAKEKLATVKVPNKSVTYSEKTLSEEDIKWVVETKESIKS 422
Query: 326 LIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT 385
+K P+G HF ++V+ +L +++W +WK E CP ++ P S PD +
Sbjct: 423 YLKLG-PEGPHFCRLVETVLSRDKNWVRWKVENCPPIELPQLS---------PDVFVEAR 472
Query: 386 PELTKLWNSK---------------------DNLEACKSAERDFTPSLESY--------F 416
KL +K L+ K R PSL+SY F
Sbjct: 473 TNAGKLATTKRLRPTPMGSLNLDFLEDDDEERALKKLKDCHRYKVPSLDSYKKGIADDDF 532
Query: 417 EEAIQQMDPA--AAVEEQYKKVNDSNYAWRALRLLSR 451
E + + + AA+E + K AWRALR+ S+
Sbjct: 533 EIGMPTNNDSKTAAIEGKASK------AWRALRIASK 563
>gi|119179900|ref|XP_001241471.1| hypothetical protein CIMG_08634 [Coccidioides immitis RS]
gi|392866650|gb|EAS30175.2| nuclear matrix protein [Coccidioides immitis RS]
Length = 629
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 101/418 (24%), Positives = 185/418 (44%), Gaps = 87/418 (20%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ T I+ + N + + + C L L+ + D
Sbjct: 66 AAVETAFREKFYHLLATTSINKPEFVHMWNLLDVVSIFSDNERCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF------------------------------ 175
+FL + FP ++S +N+ E++ EN+T F
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDEPPKHIAANEGADDVEMKDAEAQKPSDNS 239
Query: 176 --------GGDEEMDVSSNETEETDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKV 226
GG ++ ++++ + + ID + Y FW LQ F P + ++
Sbjct: 240 SKTDAASQGGSVSAEIDGQRSQQSTDHQAEDTSIDMDALYPVFWGLQANFSAPTRLFDHE 299
Query: 227 SWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD---------------------YFAKYL 265
+ F E+ +++F+ +D + + + + KYL
Sbjct: 300 HFASFKKGLESTISSFQKVSIDLEKRGMTKGSEETSRRGLKRKRGETGPETANTFNPKYL 359
Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS------------TVKSRGEGLELKSDQE 313
T++ L +L+++D FRR++L+Q LIL + S T KS G L +
Sbjct: 360 TSRDLFELEVNDVAFRRHILVQSLILLDFLISLSPKAKAKLADATNKSVLYGYVLSDEDA 419
Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
KW ++ S ++Q DG+ + ++V +L +++W +WK EGCP +++P I D
Sbjct: 420 KWANQMKSSIASYLQQGL-DGKFYYRMVDTVLTRDKNWVRWKAEGCPPIEKPPIQIQD 476
>gi|350639205|gb|EHA27559.1| hypothetical protein ASPNIDRAFT_41496 [Aspergillus niger ATCC 1015]
Length = 694
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 150/608 (24%), Positives = 247/608 (40%), Gaps = 149/608 (24%)
Query: 34 SALEQSIRQYLLKLIKTPDIDIKVIENYIS----LCVELC---MKDMCNSTLPIILLSDT 86
+A+E + R+ L+ P+ V E + + E C + C L L+ +
Sbjct: 66 AAVEIAFREKFYSLLPNPESYNNVAEIMVGFGGLIGYEWCPDIFAEQCEPGLIFWLIEEL 125
Query: 87 FDMSTLDKCEQLFYYVE------------------VNVNIWKQQTFFMSCKNNLLRMCND 128
D T+D C ++F Y+E + + Q F +LR CN+
Sbjct: 126 LDSQTIDGCRKVFDYLESRRERNTKVGLLVGPSSPTSHMLTPPQKHFKQKSLIILRSCNE 185
Query: 129 LLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF------------- 175
LLRRLSR+++TVFCGR+ +FL + FP ++S +N+ E++ +N+T +
Sbjct: 186 LLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSAVNLRGEYHTDNVTTYDELTKAAAKDVAD 245
Query: 176 ----------------GGDEEMDVSSNETEETDT------------------EEVDK-VK 200
G E+ + + E+ T E DK V
Sbjct: 246 ADVEMSDEQEAPTATNGLKEDSEAQTTSASESHTPAQEPPKTPRVVVPSSKDESQDKGVD 305
Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD------------ 248
+D + Y FW LQ YF P + ++ + F + E L+AFK+ D
Sbjct: 306 LD-DLYPMFWGLQAYFSAPTKTFDPQHFAKFKTGLEATLSAFKNVDTDLENSSNSKMSEE 364
Query: 249 --------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY 294
++ SS NP KYLT++ L DL+++DT FRR+VL+Q LIL +
Sbjct: 365 IRKSTKRKRTTDGQEIASSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDF 418
Query: 295 FTS-TVKSRGE-----------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
S T K++ + G L D KW + + + Q G+ + ++V
Sbjct: 419 MLSLTPKAKAKLADLTNKSVLYGFVLHDDDAKWAVKMRKAIEEYL-QEGIGGKFYYRMVD 477
Query: 343 LILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDA-----KKKKTPELTKL------ 391
+L +++W +WK EGCP ++RP S+++ E K+ +T + L
Sbjct: 478 TVLSRDKNWVRWKAEGCPLIERPPVSVSEYLGAREHATKVYANKRLRTSPMGSLDLKFLS 537
Query: 392 -WNSKDNLEACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEE-QYKKVNDSNYAWRAL 446
S LE K ER P+ +S + +D A E+ +Y + S+ WR L
Sbjct: 538 EGESLAGLERLKEPERFSVPAADSLMMGIMDDELDIDTAQTKEDREYAMRSKSSKTWRIL 597
Query: 447 RLLSR-KCPHF----------FLNATP-NVEKNSEFIENMVKRCVKEKPSSQISGNGNGV 494
RL ++ K F L TP + + +E + EKP + G+G
Sbjct: 598 RLSAKSKLAEFEKIEDGKNLKILFETPQQSDGTPQALEGTPQ--ASEKPPAGSESTGHGQ 655
Query: 495 DQDPAEVE 502
+ A E
Sbjct: 656 EHGAASTE 663
>gi|239608430|gb|EEQ85417.1| nuclear matrix protein [Ajellomyces dermatitidis ER-3]
Length = 663
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 133/528 (25%), Positives = 225/528 (42%), Gaps = 117/528 (22%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E + R+ L+ T ID I N + + + C L L+ + D
Sbjct: 66 AVVETAFRKKFYDLLATTSIDEPSFIHIWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRTEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNETE---ETDTE 194
+FL + FP +RS +N+ EF+ EN+T F GG MDV ++ E + TE
Sbjct: 180 FIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQDGGVTPMDVDASAKEVAGDAKTE 239
Query: 195 EVDKVKIDFN-----------------------------------FYKKFWSLQDYFRNP 219
+ + N Y FWSLQ+ F +P
Sbjct: 240 SAQASEQESNRQAAHPAADVGLQSSQPATKKPQEPEPSSTPSMDELYPIFWSLQNNFSSP 299
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------DVQSSLNPSGDY 260
++ ++ F + E L+ F+ D V + + +
Sbjct: 300 TSLFDPANFAAFKTGLEASLSIFQKVNTDMEARMTKGSEDARRGLKRRRVGDGMEMTNSF 359
Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE--------------- 304
KYLT++ L +L+++D FRR++L+Q LIL + S T K++ +
Sbjct: 360 NPKYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNKSVLYSYVL 419
Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
L S + KW ++ S ++Q +G+ + ++V +L +++W +WK E C
Sbjct: 420 NDEDVNLCTASSKAKWASQMKTSIASYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEAC 478
Query: 360 PELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACKSAERD 407
P ++R SI D K + + P +L L K +++ K +R
Sbjct: 479 PPIERTPVSIQDYLDTQATATKVSTSKRLRSAPLGSLDLNFLTEDKNINSVGRLKHPDRF 538
Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEEQYKKVND-SNYAWRALRLLSR 451
TP+L+SY +D A + EE+ + ++ WR LRL SR
Sbjct: 539 TTPALDSYMRGIADDEFNIDMAQSREEKDEATKARASKTWRTLRLSSR 586
>gi|303286871|ref|XP_003062725.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456242|gb|EEH53544.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 664
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 123/442 (27%), Positives = 196/442 (44%), Gaps = 58/442 (13%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ + L + L +C+ + +L + T+ + +F ++E +
Sbjct: 104 LPRLLDLSLTLSEWGVCDHGVVFTVLEQLVECCTIADAQDVFTWIEGKRTRLNADVLWRR 163
Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
K +LR CNDLLRRLS++ + GR+LL L+ + SERS LN+ +N N+T
Sbjct: 164 GKLIMLRTCNDLLRRLSKTNDATLRGRVLLLLSSLYALSERSALNLAGNYNRANVTTL-- 221
Query: 178 DEEMDVSSNETEET-----------------DTEEVDKVK------IDFNFYKKFWSLQD 214
+E++V +N E+ D E D +D FYK FWSLQ
Sbjct: 222 -DELEVGTNGAEQGGGGGKGKGKGDDDNEDEDAMETDDAAKTNAPVVDVRFYKTFWSLQT 280
Query: 215 YFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFA---KYLTNQKL 270
+F+NP V W F + + VL F++++LD + + + A KYLT L
Sbjct: 281 FFQNPNVALGVAGGWDAFYAAFQAVLDNFETHRLDASDETAATTAEESALGVKYLTAAPL 340
Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
L LQL D FRR LLQ IL + ++K DQ K +
Sbjct: 341 LQLQLRDPAFRRSFLLQCAILLGHCEGPPALP----KVKRDQVAKAKARV-MAATAATPP 395
Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL------------------KRPLTSITDE 372
G F++ V LK E +W WK + C + KRP + +
Sbjct: 396 NARGVEFAEFVAAALKREVYWVGWKKDNCKDFQRASEPRPPPKPPPPVMRKRP--RLGTK 453
Query: 373 DKKDEPDAKKK-KTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVE 430
P+ + K PEL +LWN S DN + + + + T S+ ++ + + MDP A +E
Sbjct: 454 QPAVPPEKRVKLGNPELDRLWNLSVDNASSLREKKAEET-SMATFLQPVLDDMDPEAQIE 512
Query: 431 EQYKKVNDSNYAWRALRLLSRK 452
E YK+ ND + W+ALR+++++
Sbjct: 513 EAYKQKNDKVFVWKALRMIAKE 534
>gi|449296680|gb|EMC92699.1| hypothetical protein BAUCODRAFT_41548, partial [Baudoinia
compniacensis UAMH 10762]
Length = 591
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 134/501 (26%), Positives = 214/501 (42%), Gaps = 88/501 (17%)
Query: 32 KKSALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFD 88
K + +E + R L+ + I+ I N + + + +C S L L+ + D
Sbjct: 64 KVNVVETAARGIFYGLLNSTPIEKPTFVEIWNLLDVLASCGDQGVCASELVCYLIEELLD 123
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSRSQNTVFCGRIL 146
T C +F Y+E + F K NL LR CN+LL+RLSR+++ +FCGR+
Sbjct: 124 SQTTAGCRIVFDYLESRRERLAAKDFH---KKNLIFLRSCNELLKRLSRAEDAIFCGRVF 180
Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEFG----GDEEMDVSSN---------ETEETDT 193
FL + FP ++S +N+ EF+ EN+T+F E+M+V ET + T
Sbjct: 181 FFLFQTFPLGDKSSVNLRGEFHNENVTKFAVPDEHGEKMEVDEKAGGAVTVKAETPQPPT 240
Query: 194 EEVDK--------------VKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVL 239
+ K V Y FW LQ F P + + V +K F L
Sbjct: 241 KPAGKGAPLKSPHKVPAEVVLSTGELYPIFWRLQQDFSEPTRLFAPVQFKTFKDGLSHTL 300
Query: 240 AAFKSYKL-----------DDVQSSLNPSG-----------DYFAKYLTNQKLLDLQLSD 277
FK V+ L G +Y KYLT++ L DL+LSD
Sbjct: 301 TKFKKTPTVVQTKATETDKRGVKRKLGEEGMDDEASKHFVDNYNPKYLTSRDLFDLELSD 360
Query: 278 TNFRRYVLLQFLILFQYFTSTV--------------KSRGEGLELKSDQEKWVKDTTETV 323
F+R++L+Q LIL + S KS + + KW +DT + +
Sbjct: 361 LAFQRHILVQALILIDFLISLTDKAKKKLAVPEVNNKSLLYSFTIVEEDAKWCRDTRQAI 420
Query: 324 YSLIKQTPP--DGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTS--ITDEDKKDEP 378
++ TP +G + ++V+ +L +++W +WK E CP + R P+++ I D +
Sbjct: 421 TDYLRATPTFEEGRFYHRMVETVLARDKNWVRWKLESCPSIVRDPVSTEQILDARRTARN 480
Query: 379 DAKKKKTPE-------LTKLWNSK-DNLEACKSAERDFTPSLESYFEEA-IQQMDPAAAV 429
A+ +K P+ L+ L LEA K R PS+E E Q+D A+
Sbjct: 481 VARPRKVPQRPMGAIDLSFLAEGHGGGLEALKDPSRYTAPSVEELIEGVKADQLDLEMAM 540
Query: 430 EEQYKKVND---SNYAWRALR 447
++ K+ +N WR LR
Sbjct: 541 DDAEKESIQNAITNKTWRVLR 561
>gi|440640377|gb|ELR10296.1| hypothetical protein GMDG_04680 [Geomyces destructans 20631-21]
Length = 843
Score = 151 bits (381), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 130/505 (25%), Positives = 225/505 (44%), Gaps = 93/505 (18%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E ++R + L+ T I+ + N + + L C+ L L+ + D
Sbjct: 67 AVVEIAVRDAFIDLLATYSIEQPQFVQVWNLLDIVSTLSDDGQCDPALLFWLIEELLDSQ 126
Query: 91 TLDKCEQLFYYVEVNVN--IWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
T+ C ++ Y+E I K F K +LR CN+LLRRLSR+++T FCGR+ +F
Sbjct: 127 TIAGCRKILDYLESRRERIIAKD---FAGKKLAILRSCNELLRRLSRAEDTAFCGRVFIF 183
Query: 149 LAKFFPFSERSGLNIISEFNVENITEF----------GGDEEMDVSSNETEETDTEEV-- 196
+ + FP ++S +N+ E++VEN+T F G E+D T ++
Sbjct: 184 MFQSFPLGDKSSVNLRGEYHVENVTTFDQIPAKGETQGDQMEVDAGVEPTPTGPAADIPT 243
Query: 197 --DKVKIDFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVL 239
+ K+ FN Y FWSLQ F P ++ +K F +
Sbjct: 244 SEEAKKVTFNAASTAQSEQPLSSDDLYPIFWSLQQSFSQPKGLFDTQRFKEFKEGLNLTI 303
Query: 240 AAFKSYKLD-DVQSSLNPSGD-------------------YFAKYLTNQKLLDLQLSDTN 279
AF+S + + + + S P D + KYLT++ L +L++ D
Sbjct: 304 TAFRSVQNESNYRGSSKPVDDENRGTKRKRRQDDDDLAHIFNPKYLTSRDLFELEIKDLL 363
Query: 280 FRRYVLLQFLILFQYFTSTVKSRGEGL--------------ELKSDQE-KWVKDTTETVY 324
FRR++L+Q LI+ + S E ++ SD++ KW DT +++
Sbjct: 364 FRRHILVQALIIMDFLISLGSKAKENQATTTKQNLSVMYQDQILSDEDVKWAVDTKKSIA 423
Query: 325 SLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED--------KKD 376
+KQ DG F ++V+ +L ++HW +WK E CP ++RP ++T E+ KK
Sbjct: 424 DYLKQG-HDGPFFYRMVETVLSRDKHWVRWKIESCPSIERP--AVTPEEYLAAKATAKKI 480
Query: 377 EPDAKKKKTP----ELTKLWNSKD--NLEACKSAERDFTPSLESYFEE-AIQQMDPAAAV 429
+ + + P EL L + +E K+ R P L S+ ++ A+ M+
Sbjct: 481 TTNKRLRPKPMGSLELGFLSEADGIIGMERLKNPSRYSLPPLASFKDKIALDDMEIDMPT 540
Query: 430 EEQYKKV---NDSNYAWRALRLLSR 451
+ K+ ++ WRALR+ S+
Sbjct: 541 NDDTKEAAIYGKASKTWRALRVASK 565
>gi|189205046|ref|XP_001938858.1| nuclear matrix protein [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187985957|gb|EDU51445.1| nuclear matrix protein [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 620
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 128/455 (28%), Positives = 218/455 (47%), Gaps = 79/455 (17%)
Query: 71 KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
+D+C+ L ++L+ + D ++ C +F ++E ++ ++ KN +LR+CN
Sbjct: 101 RDLCHPQLVLLLIEELLDSQSISGCRIVFNFLESR----RETIIAINSKNKDLVILRLCN 156
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDE 179
+LLRRLSR+++ VFCGR+ +F+ + FP ++S +N+ F+VEN+T F D+
Sbjct: 157 ELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENVTTFEDFLKEPKTDDD 216
Query: 180 EMDV------------SSNETEETDTEEVDKVK------IDFN-FYKKFWSLQDYFRNPV 220
M V S +E + + EE DK K +D + Y FWSLQ F NP
Sbjct: 217 SMQVDADDAVITPKEESKSEEKAVENEEPDKDKDAKPDTLDTDTLYPVFWSLQHSFSNPP 276
Query: 221 QCYNKVSWKMFTSYAETVLAAFKSY-KLDDVQSSLNPSG---------DYFA-----KYL 265
+ + + +++ F E LA FK K+ S G D FA KYL
Sbjct: 277 RLFEEDNFQQFQKSLEATLAKFKEVPKVIQAGDSERKRGQQTRDGDDYDAFANSFNPKYL 336
Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGE------------GLELKSDQ 312
T++ L L+LSD F+R++L+Q LIL + T T KS+ + L+ +
Sbjct: 337 TSRDLFKLELSDLAFQRHILVQALILIDFLLTLTEKSKSKPFYQNAQKAMQYNFTLREED 396
Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
+W + + + Q PDG+ + ++V +L +++W +WK E C R + TD
Sbjct: 397 TEWALGIKTAIVNYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRDRVATTDF 455
Query: 373 DKKDEPDAKKKKTPELTK----------LWN--SKDNLEACKSAERDFTPSLESYFEEA- 419
+ K + + T+ L+N ++ L + R P ESY ++
Sbjct: 456 QQAKSGAQKAVVSKKATRPMGVPSTLDFLYNNEAEKGLSQLRQRNRFNVPKAESYAKKVK 515
Query: 420 IQQMD-PAAAVEEQYKKVND--SNYAWRALRLLSR 451
+ ++D A EE+ ++V + S+Y WR LRL S+
Sbjct: 516 MTELDLDMADAEEEKRQVGEKKSSYVWRGLRLASK 550
>gi|303321039|ref|XP_003070514.1| hypothetical protein CPC735_062420 [Coccidioides posadasii C735
delta SOWgp]
gi|240110210|gb|EER28369.1| hypothetical protein CPC735_062420 [Coccidioides posadasii C735
delta SOWgp]
Length = 631
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 101/420 (24%), Positives = 185/420 (44%), Gaps = 89/420 (21%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ T I+ + N + + + C L L+ + D
Sbjct: 66 AAVETAFREKFYHLLATTSINKPEFVHMWNLLDVVSIFSDNERCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF------------------------------ 175
+FL + FP ++S +N+ E++ EN+T F
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDEPPKHIAANEDADDVEMKDAEAQKPSDNS 239
Query: 176 ----------GGDEEMDVSSNETEETDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYN 224
GG ++ ++++ + + ID + Y FW LQ F P + ++
Sbjct: 240 SKTDAAASSQGGSVSPEIDGQRSKQSTDHQAEDTSIDMDALYPVFWGLQANFSAPTRLFD 299
Query: 225 KVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD---------------------YFAK 263
+ F E+ +++F+ +D + + + + K
Sbjct: 300 HEHFASFKKGLESTISSFQKVSIDLEKRGMTKGSEETSRRGLKRKRGETGPETANTFNPK 359
Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS------------TVKSRGEGLELKSD 311
YLT++ L +L+++D FRR++L+Q LIL + S T KS G L +
Sbjct: 360 YLTSRDLFELEVNDVAFRRHILVQSLILLDFLISLSPKAKAKLADATNKSVLYGYVLSDE 419
Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
KW ++ S ++Q DG+ + ++V +L +++W +WK EGCP +++P I D
Sbjct: 420 DAKWANQMKSSIASYLQQGL-DGKFYYRMVDTVLTRDKNWVRWKAEGCPPIEKPPIQIQD 478
>gi|322709414|gb|EFZ00990.1| nuclear matrix protein [Metarhizium anisopliae ARSEF 23]
Length = 873
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 128/489 (26%), Positives = 218/489 (44%), Gaps = 101/489 (20%)
Query: 48 IKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE---- 103
I++PD +K+ N + + L C+ L L+ + D T+ C +F Y+E
Sbjct: 103 IQSPDF-VKMW-NLLDILSILSDDGQCDPALLFWLVEELLDSQTISACRIVFDYLESRRE 160
Query: 104 -VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLN 162
+ +KQ+ +LR CN+LLRRLSR+++T FCGR+ +FL + FP +RS +N
Sbjct: 161 RIMSKHFKQKHLV------ILRSCNELLRRLSRAEDTAFCGRVYIFLFQSFPLGDRSSVN 214
Query: 163 IISEFNVENITEFGGDEEMDVSSNETE------------ETDTEEVDKVKIDFN------ 204
+ E++VEN+T F E D E++ TDT+ +K + F+
Sbjct: 215 LRGEYHVENVTTFEESAEEDEQQPESKMEVDTPAETPKSSTDTKASNK-GVGFDSKNSEK 273
Query: 205 ------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS-SL-NP 256
Y FWSLQ+ F P++ ++ + F + + + AF++ ++ S SL NP
Sbjct: 274 PYDPDRLYPLFWSLQESFSQPLKLFDATHFNKFKTRLDLTIKAFQAIHQNEGPSKSLENP 333
Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
+ + KYLT++ L +L++SD +FRR++L+Q LI+ + S
Sbjct: 334 RRNLKRKREDEGPEDAPEAFNPKYLTSKDLFELEISDLSFRRHILVQALIITDFLLSLST 393
Query: 301 SRGEGL----------------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLI 344
E L +L + KW + T+ +K+ DG +F ++V+ +
Sbjct: 394 ETREKLAGLNTRSLNKSVLYNDQLSEENTKWATEMKFTISDYLKRG-IDGPYFYRMVETV 452
Query: 345 LKGEEHWNQWKNEGCPELKRPLTSI------------TDEDKKDEPDAKKKKTPELTKLW 392
L +++W WK CP ++R S K+ P + E +
Sbjct: 453 LARDKNWVFWKMTSCPSIQRDPVSPELFVEARTTAQKLATSKRLRPTPMGSVSLEFLRDM 512
Query: 393 NSKDNLEACKSAERDFTPSLESY----------FEEAIQQMDPAAAVEEQYKKVNDSNYA 442
N + +L+ K+ ER P LES+ E AAA+ + K +
Sbjct: 513 NDESSLDQLKTPERYQLPELESFQRKIADDDFEIEMPTNDQTKAAAIAGKASK------S 566
Query: 443 WRALRLLSR 451
WRALR+ +R
Sbjct: 567 WRALRIAAR 575
>gi|281201496|gb|EFA75705.1| putative protein serine/threonine kinase [Polysphondylium pallidum
PN500]
Length = 917
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 104/332 (31%), Positives = 152/332 (45%), Gaps = 53/332 (15%)
Query: 11 KSQLNVVSDKYALYKPTCDYDK----------KSALEQSIRQYLLKLIKTPDIDIKVIEN 60
+SQ + + A + P+ D K +S +E +RQ L L K I + IE
Sbjct: 381 ESQKPAIRESLARHLPSFDQSKVATDLQQAEWRSNIELPLRQGLNNLFKLESIKNEKIEQ 440
Query: 61 YISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS--C 118
I L + + +T + + D F+ ++ + +LF +E V + Q F S
Sbjct: 441 LIELAIFIYDLGYVEATYLLYIFEDLFESRSISESIELFSLMESRVEFFSQDAFVGSGRA 500
Query: 119 KNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--- 175
K LL++CND+LRRLS+S N CGRIL+F++ P S+ SG+N S N E+
Sbjct: 501 KKQLLKLCNDMLRRLSKSTNPEPCGRILIFMSYLIPMSDPSGINSKSINNDEHSNSIELL 560
Query: 176 --------GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS 227
+ D NE+ T +D NFY++FWSLQ +F NP Q N V
Sbjct: 561 LKQQQQDQNQLLDRDRDGNESGNT---------VDLNFYRQFWSLQSFFHNPYQLINNVP 611
Query: 228 ------------------WKMFTSYAETVLAAFKSY-KLDDVQSSLNPSGDYFAKYLTNQ 268
W FT E VL F ++ LD++++ P YF KYLT
Sbjct: 612 SSGFVVAANASVIINKDRWNHFTQAVELVLTTFATHINLDELEAL--PKTHYFTKYLTGS 669
Query: 269 KLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
L+ LQL D FRR +L Q L FQ+ T++
Sbjct: 670 NLMKLQLKDAIFRRNILTQLLFTFQFLEITIQ 701
>gi|296818073|ref|XP_002849373.1| nuclear matrix protein [Arthroderma otae CBS 113480]
gi|238839826|gb|EEQ29488.1| nuclear matrix protein [Arthroderma otae CBS 113480]
Length = 648
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/428 (25%), Positives = 187/428 (43%), Gaps = 109/428 (25%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ T I+ I N + + + C L L+ + D
Sbjct: 66 AAVETAFREIFYDLLATTSINDPSFMQIWNLLDIVSIFSDNEKCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDEEMDVSSNET--------- 188
+FL + FP ++S +N+ E++ EN+T F GDE MD+ E
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDLQPRAQVEGDEVMDIDKQENTTLLSTAKG 239
Query: 189 ---EETDTEEVDKVKIDFN------------------FYKKFWSLQDYFRNPVQCYNKVS 227
E T + ID + Y FWSLQ+ F P++ ++
Sbjct: 240 SQPESTTVHTPQESGIDSSVQRAEESYDDKPELELDALYSAFWSLQESFSRPIRLFDATH 299
Query: 228 WKMFTSYAETVLAAFK--SYKLD------------------------DVQSSLNPSGDYF 261
+ S E L++F+ S KL+ D+ +S NP
Sbjct: 300 FASLKSGLEATLSSFRKVSSKLEVRGTTKGSDESRRGIKRKRGEGGSDLTNSFNP----- 354
Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFL------------------------ILFQYFTS 297
KYLT++ L +L+++D +FRR++L+Q L +L+ Y S
Sbjct: 355 -KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSKAKLADSTNKSVLYGYVLS 413
Query: 298 TVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
+ + + KW ++ S + Q DG+ + ++V +L +++W +WK E
Sbjct: 414 DEDLLSLAITKSAVKAKWATQMKSSIASYL-QKGADGKFYFRMVDTVLTRDKNWVRWKAE 472
Query: 358 GCPELKRP 365
GCP ++RP
Sbjct: 473 GCPPIERP 480
>gi|395335087|gb|EJF67463.1| hypothetical protein DICSQDRAFT_41994, partial [Dichomitus squalens
LYAD-421 SS1]
Length = 687
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/346 (32%), Positives = 158/346 (45%), Gaps = 53/346 (15%)
Query: 72 DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLL 130
D C+ P +L D + T+ C +F ++E + K LLR NDLL
Sbjct: 98 DACDQVFPFTVLQDLLEAQTISSCSHIFSWIEARAERLTEGMVPQKGKALVLLRTLNDLL 157
Query: 131 RRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN-VENITEFGGDEEMDVSSNET 188
RRLS + T+F GRIL FL++ FP ERSG+N+ E+ V + G E S+ E
Sbjct: 158 RRLSNTGATTIFTGRILTFLSRVFPLGERSGVNLRGEYGPVWDGPGTKGQEREKASATEK 217
Query: 189 EETDTEEV--DKVKIDFN----------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
+E + E DK K+D + FY FW LQ F P ++ F
Sbjct: 218 QEGENGEGEKDKDKMDVDRRKEKEGGSDFYYTFWGLQLPFSRPPLFAEPGTFAAFKEAVS 277
Query: 237 TVLAAFKSYKLDDVQSSLN--------PSGD------------------YFAKYLTNQKL 270
VL K D N PSG YFAKYLT+ L
Sbjct: 278 KVLPVIKEATAKDRALMGNKSNGIPTGPSGPLKRKRAAGVPESAAKSEYYFAKYLTSPDL 337
Query: 271 LDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE----LKSDQEKWVKD 318
LDL+++DT+FRR L Q LIL + FT T K+ R L+ L + +WV++
Sbjct: 338 LDLEIADTHFRRQFLFQLLILLHHLLTFTKTAKATWASTRNRSLQIDFTLDTADAQWVQE 397
Query: 319 TTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
T ++QT P+G FS+ V++IL+ E++W +WKNE C R
Sbjct: 398 TITRATEELRQTSPNGRAFSETVQVILEREKNWVKWKNELCTPFDR 443
>gi|452839889|gb|EME41828.1| hypothetical protein DOTSEDRAFT_74028 [Dothistroma septosporum
NZE10]
Length = 639
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 132/506 (26%), Positives = 216/506 (42%), Gaps = 99/506 (19%)
Query: 35 ALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNST--LPIILLSDTFDMST 91
A+E + RQ LI T I +E + L + L D ST L + L+ + D T
Sbjct: 67 AVETAARQLFYSLITTCHIQQPGFVEVWNFLDIILLCGDQGKSTPELVVWLIEELLDSQT 126
Query: 92 LDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSRSQNTVFCGRILLFL 149
C +F Y++ + Q+ F K NL LR CN+LLRRLSR+++ +FCGR+ FL
Sbjct: 127 THGCRTVFDYLDSRRDRLAQKDFH---KKNLVFLRSCNELLRRLSRAEDAIFCGRVFFFL 183
Query: 150 AKFFPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNETEETDTEEV------ 196
+ FP ++S +N+ EF+ EN T+F E MD S + E
Sbjct: 184 FQTFPLGDKSSVNLRGEFHTENTTKFDVVVEQSAAGERMDTDSKAGDLVAKEATPRGTPQ 243
Query: 197 -----------------------DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTS 233
+++ Y FW LQ F +P + +N+ ++ +F
Sbjct: 244 PAGPEKPGSKAVPVKTPSKRTVEEEILTSSELYPIFWRLQHDFSDPTRLFNEGNFALFKK 303
Query: 234 YAETVLAAF----------------KSYKLDDVQSSLNPSG------DYFAKYLTNQKLL 271
+ +A F + K ++ +NP+ +Y KYLT++ L
Sbjct: 304 GLASSIAKFQKTPTVVQTRSGDNDKRGTKRKTDENGVNPNASEHLVDNYNPKYLTSRDLF 363
Query: 272 DLQLSDTNFRRYVLLQFLILFQYFTSTV--------------KSRGEGLELKSDQEKWVK 317
+L+LSD F+R++L+Q LIL + S KS L + KW K
Sbjct: 364 ELELSDLAFQRHILVQALILIDFLLSLTEQAKKRLVELPTPNKSMLYAFTLSEENTKWAK 423
Query: 318 DTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTSITDEDKKD 376
+ + ++ T DG ++++V+ +L +++W +WK E CP + R P+T T+++
Sbjct: 424 SSRSVIADYLRST-ADGAQYNRMVETVLARDKNWVRWKIESCPSIVRGPVT--TEQELAA 480
Query: 377 EPDAKKKKTP-----------ELTKL-WNSKDNLEACKSAERDFTPSLESYFEEA-IQQM 423
A+ P +L+ L S LE R PS+E E ++
Sbjct: 481 RKGARSAVRPRNVPDKPNGAMDLSFLDEGSGGGLEVLMDPSRYTAPSIEQLVEVVKTDKL 540
Query: 424 DPAAAVEEQYKKVNDS--NYAWRALR 447
D A E+ + ++ N WRALR
Sbjct: 541 DLDFADGEEKTSLENAMQNAKWRALR 566
>gi|240279266|gb|EER42771.1| nuclear matrix protein [Ajellomyces capsulatus H143]
gi|325089536|gb|EGC42846.1| nuclear matrix protein [Ajellomyces capsulatus H88]
Length = 654
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 133/527 (25%), Positives = 233/527 (44%), Gaps = 124/527 (23%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E + R+ L+ T ID + N + + + C L L+ + D
Sbjct: 66 AVVETAFREKFYDLLATTSIDEPSFIQMWNLLDIISIFSDNEQCEPGLLFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE------------EMDVSSNE-TEETD 192
+FL + FP +RS +N+ EF+ EN+T F DE +MD+S E T +T
Sbjct: 180 FIFLFQSFPLGDRSSVNLRGEFHTENVTTF--DELPKAQESSETPMDMDISEKEITADTK 237
Query: 193 TEE-----------------VDKVKIDF------------------NFYKKFWSLQDYFR 217
TE V+ ++I + Y FWSLQ F
Sbjct: 238 TESAQVTKQEPAGQITHPAAVESLQISKTDAMKPVESEPNSILSMDDLYPIFWSLQTNFS 297
Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------------DVQS 252
+P ++ +++ F + E L+ F+ D ++ +
Sbjct: 298 SPTSLFDPANFEAFKAGLEASLSVFQKVNADMEVRMMKGSEEARRGPKRRRIGDGTEMTN 357
Query: 253 SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE---- 307
S NP KYLT++ L +L+++D FRR++L+Q LIL + S T K++ + +
Sbjct: 358 SFNP------KYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNK 411
Query: 308 -------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
L + KW ++ + ++Q +G+ + ++V +L +++W +WK E CP
Sbjct: 412 SVLYNYVLNDEDTKWASRMKTSIATYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEACP 470
Query: 361 ELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSKDN--LEACKSAERDF 408
++ SI D K + + TP +L L K++ ++ K +R
Sbjct: 471 PIEHTPVSIQDYLDTQTSAIKVSTSKRLRSTPLGSLDLNFLMEDKNSSRVDRLKHPDRFS 530
Query: 409 TPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
TP+L+ Y +D + + EE+ + + ++ WR LRL SR
Sbjct: 531 TPALDFYLRGIADDEFNIDMSQSREEKDEAIKAKASKTWRTLRLSSR 577
>gi|403412199|emb|CCL98899.1| predicted protein [Fibroporia radiculosa]
Length = 757
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 110/368 (29%), Positives = 173/368 (47%), Gaps = 70/368 (19%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQ 112
+ + + L + +C+ + +L D + T+ C +F ++E + V++ Q+
Sbjct: 78 LRSRLDLVLTFTEHGICDQSFVFHVLYDLLETQTIASCSHIFSWIEERMDRLTVDMVPQK 137
Query: 113 TFFMSCKNNLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEFNV-- 169
+ LLR NDLLRRLS+ NT +FCGRIL FL++ FP ERSG+N+ +
Sbjct: 138 GKALI----LLRALNDLLRRLSKMGNTTMFCGRILTFLSRVFPLGERSGVNLRGGYGPVW 193
Query: 170 ---------------------ENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
E E G +++M+V + EE + E K + +FY
Sbjct: 194 DGPGVKGSEGDIKMQTREEGGEKSAEGGEEDKMEVDGVQVEEQENREAKKNQQKADFYDT 253
Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD---------VQSSLNPS-- 257
FWSLQ F P ++ F +L K D V ++ PS
Sbjct: 254 FWSLQLPFSRPPLFSEPGTFVQFKEAVNKILPVIKEATTKDRALMGSKSLVGNAGGPSLK 313
Query: 258 ------------GD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVK 300
G +FAKYLT+ +LLDL+++DT+FRR L Q LIL + FT TVK
Sbjct: 314 RKRDSEGAEDNTGKEYFFAKYLTSPELLDLEIADTHFRRQFLFQLLILLNHLLTFTKTVK 373
Query: 301 S-----RGEGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
+ R L+ L+++ +WV+D ++QT P G F++ V++IL+ E++W
Sbjct: 374 ATWSTPRNRSLQIDFTLEAEDAQWVQDIVAKATEELRQTAPSGRAFAETVQIILEREKNW 433
Query: 352 NQWKNEGC 359
+WKNE C
Sbjct: 434 VKWKNELC 441
>gi|336469264|gb|EGO57426.1| hypothetical protein NEUTE1DRAFT_80974 [Neurospora tetrasperma FGSC
2508]
gi|350291103|gb|EGZ72317.1| hypothetical protein NEUTE2DRAFT_111847 [Neurospora tetrasperma
FGSC 2509]
Length = 853
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 105/391 (26%), Positives = 184/391 (47%), Gaps = 56/391 (14%)
Query: 32 KKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
+ + +E ++R LI T ID + N + L + C+ L + L+ + D
Sbjct: 75 RHAIIETAVRDTFSNLIATTSIDSPDFARVWNLFDILSILSDDEQCDPALLLWLVEELLD 134
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
T+ C ++F ++E + F K +LR CN+LLRRLSR+ + F GR+ ++
Sbjct: 135 SQTIAGCRKVFDFLESRRERITAKHFGQK-KLVILRTCNELLRRLSRALDPAFSGRVFIY 193
Query: 149 LAKFFPFSERSGLNIISEFNVENI---------TEFGGDEEMDV-------SSNETEE-- 190
L + FP +RS +N+ E++VEN+ TE + MDV S+ +T E
Sbjct: 194 LFQSFPLGDRSSVNLRGEYHVENVTTWDQEPTKTEDASADRMDVDTETQGESAQKTGEQR 253
Query: 191 -------TDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF 242
TD ++ +D Y FWSLQ+ F P + + + F S E +A F
Sbjct: 254 STPKPGATDAKKQQDKALDLEALYPIFWSLQESFNQPKKLFEPAQFASFKSGLEATMATF 313
Query: 243 KSYKLDDVQSSLNP----------SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
+S K D+ + +L + + KYLT++ L L++ D RR VL+Q LI+
Sbjct: 314 QSIKSDESKRALKRKRGEGEDDELANGFNPKYLTSRDLFKLEIGDLALRRNVLVQALIIM 373
Query: 293 QYFTS----------TVKSRGEGL-----ELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
++ S ++K+ + + +L + WV D + + +K P+G +F
Sbjct: 374 EFLLSLSPKGKEKIASIKAPNKSVSHSDQQLSEEDTHWVLDMKKKIADYLKLG-PEGPYF 432
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
++++ +L +++W WK EGCP ++ P S
Sbjct: 433 YRMIETVLSRDKNWVHWKVEGCPPIEMPAVS 463
>gi|225555030|gb|EEH03323.1| nuclear matrix protein [Ajellomyces capsulatus G186AR]
Length = 654
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 132/527 (25%), Positives = 231/527 (43%), Gaps = 124/527 (23%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E + R+ L+ T ID + N + + + C L L+ + D
Sbjct: 66 AVVETAFREKFYDLLATTSIDEPSFIQMWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE------------EMDVSSNE-TEETD 192
+FL + FP +RS +N+ EF+ EN+T F DE +MD+ E T +T
Sbjct: 180 FIFLFQSFPLGDRSSVNLRGEFHTENVTTF--DELPKAQESSETPMDMDIPEKEITADTK 237
Query: 193 TEE-----------------VDKVKIDF------------------NFYKKFWSLQDYFR 217
TE V+ ++I + Y FWSLQ F
Sbjct: 238 TESAQVTKQEPAGQTTHPAAVESLQISKTDAMKPVESEPNSILSMDDLYPIFWSLQTNFS 297
Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------------DVQS 252
+P ++ ++ F + E L+ F+ D ++ +
Sbjct: 298 SPTSLFDPANFAAFKAGLEASLSVFQKVNADMEVRMMKGSEEARRGPKRRRIGDGTEITN 357
Query: 253 SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE---- 307
S NP KYLT++ L +L+++D FRR++L+Q LIL + S T K++ + +
Sbjct: 358 SFNP------KYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNK 411
Query: 308 -------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
L + KW ++ + ++Q +G+ + ++V +L +++W +WK E CP
Sbjct: 412 SVLYNYVLNDEDTKWASQMKTSIATYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEACP 470
Query: 361 ELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSKDN--LEACKSAERDF 408
++ SI D K + + TP +L L K++ ++ K +R
Sbjct: 471 PIEHTPVSIQDYLDTQTSAIKVSTSKRLRSTPLGSLDLNFLMEDKNSSRVDRLKHPDRFS 530
Query: 409 TPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
TP+L+ Y +D + + EE+ + + ++ WR LRL SR
Sbjct: 531 TPALDFYLRGIADDEFNIDMSQSREEKDEAIKAKASKTWRTLRLSSR 577
>gi|330918120|ref|XP_003298095.1| hypothetical protein PTT_08697 [Pyrenophora teres f. teres 0-1]
gi|311328880|gb|EFQ93789.1| hypothetical protein PTT_08697 [Pyrenophora teres f. teres 0-1]
Length = 621
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 124/456 (27%), Positives = 216/456 (47%), Gaps = 80/456 (17%)
Query: 71 KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
+D+C+ L ++L+ + D ++ C +F ++E ++ ++ KN +LR+CN
Sbjct: 101 RDLCHPQLVLLLIEELLDSQSISGCRIVFNFLESR----RETIIAINSKNKDLVILRLCN 156
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDE 179
+LLRRLSR+++ VFCGR+ +F+ + FP ++S +N+ F+VEN+T F D+
Sbjct: 157 ELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENVTTFEDFLKESKSDDD 216
Query: 180 EMDVSSNET------EETDTEE-------------VDKVKIDFN-FYKKFWSLQDYFRNP 219
M V +++ EE +EE +D + Y FWSLQ F NP
Sbjct: 217 SMQVDADDAVISTPKEEAKSEEKAVENDEPDKDQDAKPTTLDTDTLYPVFWSLQHSFSNP 276
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSY-KLDDVQSSLNPSG---------DYFA-----KY 264
+ + + +++ F E LA FK K+ S G D FA KY
Sbjct: 277 PRLFEEDNFQQFQKSLEATLAKFKEVPKVIQAGDSERKRGQQTRDGDDYDAFANSFNPKY 336
Query: 265 LTNQKLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGE------------GLELKSD 311
LT++ L L+LSD F+R++L+Q LIL + T T KS+ + L +
Sbjct: 337 LTSRDLFKLELSDLAFQRHILVQALILIDFLLTLTEKSKSKPFYQNAQKAMQYNFTLGEE 396
Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
+ +W + + + Q PDG+ + ++V +L +++W +WK E C R + TD
Sbjct: 397 ETEWALGIKTAIVNYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRDRVATTD 455
Query: 372 EDKKDEPDAKKKKTPELTK----------LWN--SKDNLEACKSAERDFTPSLESYFEEA 419
+ K + + T+ L+N ++ L + +R P ESY ++
Sbjct: 456 FQQAKSGAQKAVVSKKATRPMGVPSTLDFLYNNEAEKGLLQLRQRDRFNVPKAESYAKKV 515
Query: 420 -IQQMD-PAAAVEEQYKKVND--SNYAWRALRLLSR 451
+ +D A EE+ ++V + S+Y WR LR+ S+
Sbjct: 516 KMTDLDLDMADAEEEKRQVGEKKSSYVWRGLRVASK 551
>gi|145342784|ref|XP_001416269.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576494|gb|ABO94562.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 777
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 111/438 (25%), Positives = 199/438 (45%), Gaps = 73/438 (16%)
Query: 73 MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
M +T+ I+L + T+ + E + + + N ++ T + K LRM L +R
Sbjct: 46 MIFTTIEIVL-----EACTIAEAEDVLLWCDENKTRLRETTIWKKGKLPTLRMLIALAKR 100
Query: 133 -LSRSQNTVFC-GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE----------- 179
+ RS + GR L+ ++ F S+RSG+N + N E F +E
Sbjct: 101 CVGRSSDEARVKGRALVLISWFMGLSDRSGMNKNGDSNPETTAIFEQEEWLREELEDGVG 160
Query: 180 -------EMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNK-VSWKMF 231
+M+V S E + ++ ++ +D F++ FWSLQ YF++P +K +W F
Sbjct: 161 VDADAKIKMEVDSGEGDVAADDDDNEFGVDDAFHRTFWSLQQYFQDPPTTMSKNGAWNSF 220
Query: 232 TSYAETVLAAFKSYKLDDVQSSLN--PSGD----YFAKYLTNQKLLDLQLSDTNFRRYVL 285
+ VL F+++ L + +++ P + ++LT ++LL LQ+ D FRR +L
Sbjct: 221 HAILCVVLDKFEAFPLGETAGTIDEEPESEKRKMIGHRFLTARRLLPLQMQDFVFRRQIL 280
Query: 286 LQFLILFQYFTSTVKSRGEGLELKSDQEKW--------VKDTTETVYSLIKQTPPDGEHF 337
+Q + QY D EK+ V + +E V ++K+T P+
Sbjct: 281 IQAVFFLQYM---------------DNEKFKEHVVDDEVTELSERVMKILKRTGPNAVAD 325
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------------- 384
+ +++ L+ E W +WK +GCP ++ E D D K
Sbjct: 326 AGIIENALEDELSWRKWKEQGCPSFEKQPIDFETEPMPDNWDKKYSNWPPDQYPPDDPNL 385
Query: 385 -----TPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDS 439
PEL +LWN ++ S + S+E + + +++MDP+ VE +Y++VND
Sbjct: 386 KHDFDHPELNRLWNLGEDDPNGTSVDALKEVSVEEFLKPCLEEMDPSQQVEAEYRRVNDP 445
Query: 440 NYAWRALRLLSRKCPHFF 457
+ W+A+R+LS+ H F
Sbjct: 446 TFRWKAMRMLSKNHLHLF 463
>gi|451996956|gb|EMD89422.1| hypothetical protein COCHEDRAFT_1196301 [Cochliobolus
heterostrophus C5]
Length = 620
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 127/462 (27%), Positives = 213/462 (46%), Gaps = 92/462 (19%)
Query: 71 KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
+D+C+ L ++L + D ++ C +F ++E ++ ++ KN +LR+CN
Sbjct: 101 RDLCSPQLVLLLTEELLDSQSISGCRVVFNFLESR----REAIIAINSKNKDLVILRLCN 156
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDE 179
+LLRRLSR+++ VFCGR+ +F+ + FP ++S +N+ F+VENIT F DE
Sbjct: 157 ELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENITTFEDFLKDSQPQDE 216
Query: 180 EMDVSSNET-----EETD-------TEEVDKVK------IDFN-FYKKFWSLQDYFRNPV 220
M V +++ EE D T E+D+ K +D + Y FWSLQ F NP
Sbjct: 217 AMQVDGDDSVPKIREEPDSGETTTQTGEIDEAKDGKPKTLDVDTLYPVFWSLQHSFSNPP 276
Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYK----------------------LDDVQSSLNPSG 258
+ + + +K F E LA FK D SS NP
Sbjct: 277 RLFEEDHFKQFQQGLEATLAKFKEVPKVIQAGDSERKKGQVSKDEGEDYDAFASSFNP-- 334
Query: 259 DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGE------------G 305
KYLT++ L L+LSD F+R++L+Q LIL + T T KS+ +
Sbjct: 335 ----KYLTSRDLFRLELSDLAFQRHILVQALILIDFLLTLTEKSKSKPFYQNAQKAMQYN 390
Query: 306 LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
L ++Q +W + + + Q PDG+ + ++V +L +++W +WK E C R
Sbjct: 391 FTLGAEQTEWALGIKNAIANYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRD 449
Query: 366 LTSITDEDKK---------DEPDAKKKKTPELTK-LWN--SKDNLEACKSAERDFTPSLE 413
TD + + A+ P + L+N ++ L + ++R P E
Sbjct: 450 RVPTTDLQQAKTGAQKAVVSKKAARPMGVPSMFDFLYNNEAEKGLSQLRQSDRFKVPDAE 509
Query: 414 SYFEEAIQQ---MDPAAAVEEQYK-KVNDSNYAWRALRLLSR 451
Y + + MD A EE+ + + + ++ WR LR+ S+
Sbjct: 510 WYGKRVQMRDLDMDMADGDEEKRQLEESKASDVWRGLRVASK 551
>gi|393213320|gb|EJC98817.1| hypothetical protein FOMMEDRAFT_114065 [Fomitiporia mediterranea
MF3/22]
Length = 768
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 177/387 (45%), Gaps = 81/387 (20%)
Query: 61 YISLCVELCM------KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTF 114
Y SLC L + ++ C+ +L D + T+ C +F ++E +
Sbjct: 76 YDSLCDRLDVILTFTEREACDPMFMFTVLEDLLETQTVASCSHIFSWIERRSARLTEGMV 135
Query: 115 FMSCKN-NLLRMCNDLLRRLSRSQNTV-FCGRILLFLAKFFPFSERSGLNIISEFNVE-- 170
K LLRM NDLLRRLSR+ NT FCGRIL FL+ FP +RSG+N+ ++ +
Sbjct: 136 PQKGKALVLLRMLNDLLRRLSRAGNTTTFCGRILTFLSAVFPLGDRSGVNLRGDYGPQWE 195
Query: 171 --NITEFGGDEEMDV----SSNETEETDT------------EEVDKVKID---------- 202
+ +E+M+V S E + D + DK+ D
Sbjct: 196 PVSFKRAAVEEKMEVDEAASGREAGKVDAEVKAEVKEEEKGDASDKMDEDRPQTQAPAAE 255
Query: 203 -------FNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDD----- 249
Y FWSLQ F P + ++ F + TV+ K + +
Sbjct: 256 KTEEEKKEELYNTFWSLQLPFSRPSIFATEPNAFAEFKTAVNTVMPVIKEATIKERAMMG 315
Query: 250 ---VQSSL------NPS-----GD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
V +L P+ GD +FAK+LT+ +LLDL+++DT FRR VL Q LIL
Sbjct: 316 SKVVAGALKRKRESTPTATMDDGDRKYFFAKFLTSPELLDLEIADTQFRRQVLFQLLILL 375
Query: 293 QY---FTSTVK-----SRGEGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
+ FT+ K R L+ L+ D KWV++T VY ++QT P+ + F++
Sbjct: 376 THVLQFTAAEKQNWATPRNRSLQMDFTLQPDDAKWVQETITRVYDEMRQTSPNAKAFTET 435
Query: 341 VKLILKGEEHWNQWKNEGC-PELKRPL 366
V+ L+ E++W +WKN+ C P K PL
Sbjct: 436 VQTTLEREKNWVRWKNDLCSPFDKAPL 462
>gi|19075592|ref|NP_588092.1| THO complex subunit (predicted) [Schizosaccharomyces pombe 972h-]
gi|31077065|sp|Q9URT2.1|YJU3_SCHPO RecName: Full=Uncharacterized protein P25A2.03
gi|6562903|emb|CAB62828.1| THO complex subunit (predicted) [Schizosaccharomyces pombe]
Length = 752
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 125/448 (27%), Positives = 197/448 (43%), Gaps = 72/448 (16%)
Query: 53 IDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQ 112
+D+ VI N I+ + C+ LP ++L + D+ T+++C +L+ Y E ++ K
Sbjct: 83 LDLLVIGNEIN-------ESFCDHLLPFLILEELMDIHTVNECAKLYEYFETRPSLMKGI 135
Query: 113 TFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENI 172
LLR+ N+LLRRLSR +N+ FCGRI + L+K FP ERSG N+ ++N +
Sbjct: 136 VSNRGRGPVLLRISNELLRRLSRQENSSFCGRIDILLSKAFPPEERSGANLRGDYNT--V 193
Query: 173 TEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFT 232
FG E S+ ++ TD K+ F Y W LQ NP + + F
Sbjct: 194 HSFGKVELSPPSTPISDRTDLSYHKKLNTLFTAY---WDLQCMCSNPPKLLASDTLPKFI 250
Query: 233 SYAETVLAAFKSY------------KLDDVQSSL------NPSGDYFAKYLTNQKLLDLQ 274
A + + AF+S +D SSL + +KY+ ++ L + Q
Sbjct: 251 DAAGSAIQAFESILQNTFFNGKSNPTIDPNSSSLLSEKYITLDKGFPSKYIYSRSLFEYQ 310
Query: 275 LSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDG 334
LSD +FR +LQ +I+F + K R E L EKW T + V ++ + D
Sbjct: 311 LSDEDFRLQAILQLIIIFDFLLDHSKERIERRTL----EKW---TNKAVIPIVILSDEDT 363
Query: 335 EHFSQV-------------------VKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK 375
+++ +K I+ E +W WK GCP L++PL D+
Sbjct: 364 SKLNELSKEAYSFLHTARCGSVQRTIKEIIHIEGNWKLWKGLGCPSLEKPLVDKAAIDEA 423
Query: 376 DEPDAKKKKTP----------ELTKLWN--SKDNLEACKSAERDFTPSLESYFEEA-IQQ 422
E K TP L++LW ++ L+ K ER PS ES+ +
Sbjct: 424 VEGLKKLTNTPVKLRFAMGNAALSRLWEQAGENTLDDLKKEERYRIPSPESFLSGVKADK 483
Query: 423 MDPAAAVEEQYKKVNDSNYA---WRALR 447
+ AV + K ++ + A WRA R
Sbjct: 484 FEIEEAVRDDDKHFHEQSLATKTWRAFR 511
>gi|85109865|ref|XP_963126.1| hypothetical protein NCU07865 [Neurospora crassa OR74A]
gi|28924778|gb|EAA33890.1| predicted protein [Neurospora crassa OR74A]
Length = 868
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/404 (26%), Positives = 184/404 (45%), Gaps = 69/404 (17%)
Query: 32 KKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
+ + +E ++R KLI T ID + N + L + C+ L + L+ + D
Sbjct: 75 RHAIIETAVRDTFSKLIATTSIDSPDFARVWNLFDILSILSDDEQCDPALLLWLVEELLD 134
Query: 89 MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
T+ C ++F ++E + F K +LR CN+LLRRLSR+ + F GR+ ++
Sbjct: 135 SQTIAGCRKVFDFLESRRERITAKHFGQK-KLVILRTCNELLRRLSRALDPAFSGRVFIY 193
Query: 149 LAKFFPFSERSGLNIISEFNVENI---------TEFGGDEEMDV-------SSNETEE-- 190
L + FP +RS +N+ E++VEN+ TE + MDV S+ +T E
Sbjct: 194 LFQSFPLGDRSSVNLRGEYHVENVTTWDQEPTKTEDASADRMDVDTETQGESAQKTGEQR 253
Query: 191 -------TDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF 242
TD ++ +D Y FWSLQ+ F P + + + F S E +A F
Sbjct: 254 STPKPGATDAKKQQDKALDLEALYPIFWSLQESFNQPKKLFEPAQFASFKSGLEATMATF 313
Query: 243 KSYKLDDVQSSLNP-----------------------SGDYFAKYLTNQKLLDLQLSDTN 279
+S K+D S + + + KYLT++ L L++ D
Sbjct: 314 QSIKVDQSARSRDSKDKSDESKRALKRKRGEGEDDELANGFNPKYLTSRDLFKLEIGDLA 373
Query: 280 FRRYVLLQFLILFQYFTS----------TVKSRGEGL-----ELKSDQEKWVKDTTETVY 324
RR VL+Q LI+ ++ S ++K+ + + +L + WV D + +
Sbjct: 374 LRRNVLVQALIIMEFLLSLSPKGKEKIASIKAPNKSVSHSDQQLSEEDTNWVLDMKKKIA 433
Query: 325 SLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
+K P+G +F ++V+ +L +++W WK EGCP ++ P S
Sbjct: 434 DYLKLG-PEGPYFYRMVETVLSRDKNWVHWKVEGCPPIEMPAVS 476
>gi|154273190|ref|XP_001537447.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150415959|gb|EDN11303.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 649
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 129/526 (24%), Positives = 227/526 (43%), Gaps = 115/526 (21%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E + R+ L+ T ID + N + + + C L L+ + D
Sbjct: 66 AVVETAFREKFYDLLATTSIDEPSFIQMWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T+D C ++F Y+E ++ +LR CN+LLRRLSR+++TVFCGR+ +FL
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKSLI------ILRSCNELLRRLSRAEDTVFCGRVFIFLF 179
Query: 151 KFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNE---TEETDTEE---- 195
+ FP +RS +N+ EF+ EN+T F + MD+ E T +T TE
Sbjct: 180 QSFPLGDRSSVNLRGEFHTENVTTFDELPKAQESSETPMDIDIPEKEITADTKTESAQVT 239
Query: 196 -------------VDKVKIDF------------------NFYKKFWSLQDYFRNPVQCYN 224
V+ ++I + Y FWSLQ F +P ++
Sbjct: 240 KQEPAGQITHPAAVESLQISKTDAMKPVESEPNSILSMDDLYPIFWSLQTNFSSPTSLFD 299
Query: 225 KVSWKMFTSYAETVLAAFKSYKLD-------------------------DVQSSLNPSGD 259
++ F + E L+ F+ D ++ +S NP
Sbjct: 300 PSNFAAFKAGLEASLSVFQKVNADMEVRMMKGSEEARRGPKRRRIGDGTEMTNSFNP--- 356
Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE----------- 307
KYLT++ L +L+++D FRR++L+Q LIL + S T K++ + +
Sbjct: 357 ---KYLTSRDLFELEINDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNKSVLYNYV 413
Query: 308 LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLT 367
L + KW ++ + ++Q +G+ + ++V +L +++W +WK E CP ++
Sbjct: 414 LNDEDTKWASQMKTSIATYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEACPPIEHTPV 472
Query: 368 SITD------EDKKDEPDAKKKKTP----ELTKLWNSKDN--LEACKSAERDFTPSLESY 415
SI D K + + TP +L L K++ ++ K +R TP+L+ Y
Sbjct: 473 SIQDYLDTQTSAIKVSTSKRLRSTPLGSLDLNFLMEDKNSSRVDRLKHPDRFSTPALDFY 532
Query: 416 FEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFF 457
+D + + EE+ + + ++ WR LRL SR F
Sbjct: 533 LRGIADDEFNIDMSQSREEKDEAIKAKASKTWRTLRLSSRSKLALF 578
>gi|451847862|gb|EMD61169.1| hypothetical protein COCSADRAFT_174485 [Cochliobolus sativus
ND90Pr]
Length = 620
Score = 142 bits (358), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 107/366 (29%), Positives = 174/366 (47%), Gaps = 76/366 (20%)
Query: 71 KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
+D+C+ L ++L + D ++ C +F ++E ++ ++ KN +LR+CN
Sbjct: 101 RDLCSPQLVLLLTEELLDSQSISGCRVVFNFLESR----REAIIAINSKNKDLVILRLCN 156
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDE 179
+LLRRLSR+++ VFCGR+ +F+ + FP ++S +N+ F+VENIT F DE
Sbjct: 157 ELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENITTFEDFLKDSQPQDE 216
Query: 180 EMDVSSNET-----EETDTEE-------VDKVK------IDFN-FYKKFWSLQDYFRNPV 220
M V +++ EE ++EE D+ K +D + Y FWSLQ F NP
Sbjct: 217 SMQVDGDDSVSKIREEPESEERTAQTGDGDEAKDGKPKTLDVDTLYPVFWSLQHSFSNPP 276
Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYK----------------------LDDVQSSLNPSG 258
+ + + +K F E LA FK D SS NP
Sbjct: 277 RLFEEDHFKQFQQGLEATLAKFKEVPKVIQAGDSERKKGQVSKDEGEDYDAFASSFNP-- 334
Query: 259 DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGE------------G 305
KYLT++ L L+LSD F+R++L+Q LIL + T T KS+ +
Sbjct: 335 ----KYLTSRDLFRLELSDLAFQRHILVQALILIDFLLTLTEKSKSKPFYQNAQKAMQYN 390
Query: 306 LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
L ++Q +W + + + Q PDG+ + ++V +L +++W +WK E C R
Sbjct: 391 FTLGAEQTEWALGIKNAIANYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRD 449
Query: 366 LTSITD 371
TD
Sbjct: 450 RVPTTD 455
>gi|409052224|gb|EKM61700.1| hypothetical protein PHACADRAFT_248470 [Phanerochaete carnosa
HHB-10118-sp]
Length = 781
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 109/402 (27%), Positives = 179/402 (44%), Gaps = 69/402 (17%)
Query: 27 TCDYDKKSALEQSIRQYLLKLIKTPDIDIK--------VIENYISLCVELCMKDMCNSTL 78
T ++++ E ++R + L T + +K ++ + L + KD C+
Sbjct: 40 TSADNRRTQWEYALRHEVFTLAATEGLALKDPSTAYYDNLKLRLDLVLSFTEKDACDQIF 99
Query: 79 PIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRS- 136
P LL + + T+ C +F ++E + + K LLR NDLLRRLS++
Sbjct: 100 PFNLLYELLETQTIPSCSHIFSWIESRADRLTENMVPQKGKALVLLRTLNDLLRRLSKTG 159
Query: 137 QNTVFCGRILLFLAKFFPFSERSGLNIISEFN-------------------------VEN 171
T+FCGRIL FL++ FP ERSG+N+ E+ +E+
Sbjct: 160 STTLFCGRILTFLSQVFPLGERSGVNLRGEYGPTWDGPGVKEETEDVKMDTVEEVKQLED 219
Query: 172 ITEFGGDEEMDVSSNET-EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKM 230
+ G+ D + E EE+ + K + FY+ FWSLQ F P ++
Sbjct: 220 KMQVDGEAPKDGAKEEANEESPNPDEKKQQQKDEFYRTFWSLQLPFSRPSVFTEPQTFDQ 279
Query: 231 FTSYAETVL---------------------AAFKSYKLDDVQSSLNPSGDYFAKYLTNQK 269
F + +L ++ K + +V G FAKYLT+ +
Sbjct: 280 FKTSMNKILPVIKEATAKERALMGSKANSGSSLKRRREPEVNIDTANQGYLFAKYLTSLE 339
Query: 270 LLDLQLSDTNFRRYVLLQ----------FLILFQYFTSTVKSRGEGLE--LKSDQEKWVK 317
LLDL+++DT+FRR L Q F + ST ++R ++ L D EKWV
Sbjct: 340 LLDLEIADTHFRRQFLFQLLILLNHLLTFTKATKITWSTPRNRSLQMDFTLGPDDEKWVH 399
Query: 318 DTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
+ + ++QT P+G F++ V +IL+ E++W +WKNE C
Sbjct: 400 EAIQKTNEELRQTTPNGRVFAETVAVILEREKNWIRWKNELC 441
>gi|367026702|ref|XP_003662635.1| hypothetical protein MYCTH_2303495 [Myceliophthora thermophila ATCC
42464]
gi|347009904|gb|AEO57390.1| hypothetical protein MYCTH_2303495 [Myceliophthora thermophila ATCC
42464]
Length = 712
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 119/448 (26%), Positives = 193/448 (43%), Gaps = 86/448 (19%)
Query: 72 DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMC 126
++C+ L L+ + D T+ C ++F ++E + +KQ+ +LR C
Sbjct: 117 ELCDPALLFWLVEELLDSQTIAGCRKVFDFLESRRERITAKHFKQKQLV------ILRTC 170
Query: 127 NDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSN 186
N+LLRRLSR+ + FCGR+ +F + FP ++S +N+ E++VEN+T + D+E +
Sbjct: 171 NELLRRLSRALDPAFCGRVFIFTFQSFPLGDKSSVNLRGEYHVENVTTY--DQEPQPTDG 228
Query: 187 ETEETDTEEV---DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
E + DT+ DK Y FW+LQ+ F P + ++ + F + E + F
Sbjct: 229 ERMDVDTDATPRKDKPMDPDALYPIFWALQESFNQPKKLFDPSHFAAFKAGLEATVTTFC 288
Query: 244 SY----------KL----DDVQSSLNPSGD---------YFAKYLTNQKLLDLQLSDTNF 280
+ KL DD + SL D + KYLT++ L L++SD F
Sbjct: 289 TINPEKPSRAKEKLDKPDDDAKPSLKRKRDDVDDELANSFNPKYLTSRDLFKLEISDLAF 348
Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGL---------------ELKSDQEKWVKDTTETVYS 325
RR +L+Q LI + S E L +L + KW ++
Sbjct: 349 RRNILVQALITLDFLLSLSPKAKEKLASVKTPNKSVTYSDQQLSEEDTKWATQMRGSITR 408
Query: 326 LIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT 385
+ Q DG F ++V +L +++W +WK E CP ++ P S + DE A +
Sbjct: 409 YL-QRGVDGPFFERMVGTVLVRDKNWVRWKIENCPSIELPPMS---PQQFDEARATATRM 464
Query: 386 PELTKLWNSKDNL------------EACKSAERDFTPSLESY----------FEEAIQQM 423
+L N +L E + ER P L S+ E A
Sbjct: 465 ARTRRLPNDSRSLEFLGDEDETSAMEKLQDPERYSLPELSSFRDAIDDADFDIERATDDE 524
Query: 424 DPAAAVEEQYKKVNDSNYAWRALRLLSR 451
A A+E + K AW+ALRL S+
Sbjct: 525 SKALAMESKASK------AWKALRLASK 546
>gi|336271337|ref|XP_003350427.1| hypothetical protein SMAC_02140 [Sordaria macrospora k-hell]
gi|380090949|emb|CCC11482.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 867
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 113/430 (26%), Positives = 189/430 (43%), Gaps = 78/430 (18%)
Query: 7 LNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYIS 63
L D SQ+ V+D + + +E ++R L+ T ID + N
Sbjct: 57 LGIDPSQVGEVNDAKRTQR-------HAIIETAVRDTFSSLVATTSIDSPDFARVWNLFD 109
Query: 64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
+ L + C+ L + L+ + D T+ C ++F ++E + F K +L
Sbjct: 110 ILSILSDDEQCDPALLLWLVEELLDSQTIAGCRKVFDFLESRRERITAKHFGQK-KLVIL 168
Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT---------E 174
R CN+LLRRLSR+ + F GR+ ++L + FP +RS +N+ E++VEN+T E
Sbjct: 169 RTCNELLRRLSRALDPAFSGRVFIYLFQSFPLGDRSSVNLRGEYHVENVTTWDQEPAKAE 228
Query: 175 FGGDEEMDVSSNETEETDTEEVDKVK----------------IDFN-FYKKFWSLQDYFR 217
+ MDV ET + D+ +D Y FWSLQ+ F
Sbjct: 229 DASADSMDVDMETQGETTQKTGDQRSTPKPGTADAKKHRDKALDLEALYPIFWSLQESFN 288
Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP--------------------- 256
P + + + F S E +A F+S K+D S +
Sbjct: 289 RPKKLFEPAQFASFKSGLEATMAIFQSIKVDQSARSRDSKDKSDESKRALKRKRGEGEDD 348
Query: 257 --SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELK----- 309
+ + KYLT++ L L++ D RR VL+Q LI+ ++ S V S+G+ ++
Sbjct: 349 ELANGFNPKYLTSRDLFKLEIGDLALRRNVLVQALIIMEFLLS-VSSKGKEKIVRIKAPN 407
Query: 310 -----SDQE------KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG 358
SDQ+ WV D + + +K P+G +F ++V+ +L +++W WK EG
Sbjct: 408 KSVSHSDQQLSEEDTNWVLDMKKKIADYLK-IGPEGPYFYRMVETVLSRDKNWVHWKVEG 466
Query: 359 CPELKRPLTS 368
CP ++ P S
Sbjct: 467 CPPIEMPAVS 476
>gi|302667857|ref|XP_003025507.1| hypothetical protein TRV_00269 [Trichophyton verrucosum HKI 0517]
gi|291189621|gb|EFE44896.1| hypothetical protein TRV_00269 [Trichophyton verrucosum HKI 0517]
Length = 669
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 116/434 (26%), Positives = 193/434 (44%), Gaps = 110/434 (25%)
Query: 16 VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCN 75
V++ Y + + +A+E + R+ L+ T ID E + N
Sbjct: 100 VLAGDYGVSVGANQQTQYAAVETAFREIFYDLLATTSID------------EPSFMQIWN 147
Query: 76 STLPIILLSDT---FDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
+ + SD D T+D C +F Y+E +KQ++ +LR CN
Sbjct: 148 LLDIVSIFSDNEELLDSQTIDGCRIVFDYLESRRERNTAKHFKQKSLI------ILRSCN 201
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG--------DE 179
+LLRRLSR+++TVFCGR+ +FL + FP ++S +N+ E++ EN+T F DE
Sbjct: 202 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDAQPKTQAESDE 261
Query: 180 EMDVSSNE----------------TEETDTEEVD--------------KVKIDFNFYKKF 209
MD+ E T +T E + ++++D Y F
Sbjct: 262 AMDIDIQEDETLVSNAKSTEQELTTAQTPLESTNDATPQRGNAPSIQPELELD-TLYAAF 320
Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
WSLQ+ F P + ++ + S E L++F+ S KL+
Sbjct: 321 WSLQESFSKPTRLFDTAHFASLKSGLEATLSSFRKVSSKLEVRSTTKGSDELRRGTKRKR 380
Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF----------- 292
D+ +S NP KYLT++ L +L+++D +FRR++L+Q LIL
Sbjct: 381 GEGGADLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSK 434
Query: 293 -QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
+ ST KS G L + KW ++ S + Q DG+ + ++V +L +++W
Sbjct: 435 AKLADSTNKSVLYGYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 493
Query: 352 NQWKNEGCPELKRP 365
+WK EGCP ++RP
Sbjct: 494 VRWKAEGCPPIERP 507
>gi|302496461|ref|XP_003010232.1| hypothetical protein ARB_03584 [Arthroderma benhamiae CBS 112371]
gi|291173773|gb|EFE29592.1| hypothetical protein ARB_03584 [Arthroderma benhamiae CBS 112371]
Length = 617
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 116/434 (26%), Positives = 193/434 (44%), Gaps = 110/434 (25%)
Query: 16 VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCN 75
V++ Y + + +A+E + R+ L+ T ID E + N
Sbjct: 48 VLAGDYGVSVGANQQTQYAAVETAFREIFYDLLATTSID------------EPSFMQIWN 95
Query: 76 STLPIILLSDT---FDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
+ + SD D T+D C +F Y+E +KQ++ +LR CN
Sbjct: 96 LLDIVSIFSDNEELLDSQTIDGCRIVFDYLESRRERNTAKHFKQKSLI------ILRSCN 149
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG--------DE 179
+LLRRLSR+++TVFCGR+ +FL + FP ++S +N+ E++ EN+T F DE
Sbjct: 150 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDAQPKTQAESDE 209
Query: 180 EMDVSSNE----------------TEETDTEEVDK--------------VKIDFNFYKKF 209
MD+ E T +T E ++ +++D Y F
Sbjct: 210 VMDIDIQEDETLVSNAKSTEQEPTTAQTPLESTNEATPQRGNASGIQPELELD-TLYAAF 268
Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
WSLQ+ F P + ++ + S E L++F+ S KL+
Sbjct: 269 WSLQESFSKPTRLFDTAHFASLKSGLEATLSSFRKVSSKLEVRSTTKGSDELRRGTKRKR 328
Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF----------- 292
D+ +S NP KYLT++ L +L+++D +FRR++L+Q LIL
Sbjct: 329 GEGGADLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSK 382
Query: 293 -QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
+ ST KS G L + KW ++ S + Q DG+ + ++V +L +++W
Sbjct: 383 AKLADSTNKSVLYGYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 441
Query: 352 NQWKNEGCPELKRP 365
+WK EGCP ++RP
Sbjct: 442 VRWKAEGCPPIERP 455
>gi|308800702|ref|XP_003075132.1| Nuclear matrix protein (ISS) [Ostreococcus tauri]
gi|116061686|emb|CAL52404.1| Nuclear matrix protein (ISS) [Ostreococcus tauri]
Length = 636
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 106/339 (31%), Positives = 173/339 (51%), Gaps = 46/339 (13%)
Query: 156 SERSGLNIISEFNVENITEFGGDEEM-DVSSNETEETDTE-EVDKVKIDFN-------FY 206
S+RSGLN+ N F DE + + + N+ E D + E +K +ID + F+
Sbjct: 5 SDRSGLNLQGASNPNVTPIFEQDEWVREDAENDDETMDVDGEYEKGEIDMSTFGADGAFH 64
Query: 207 KKFWSLQDYFRNPVQCY-NKVSWKMFTSYAETVLAAFKSYKLDDVQSSL-----NPS-GD 259
K FWSLQ Y++NP Q +W + + ++ F+ + L + +L NP D
Sbjct: 65 KTFWSLQTYYQNPTQAMCRDGAWNTYHAILREIMDKFEEHPLTETAKTLLDFHENPELRD 124
Query: 260 YFA-KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKD 318
++LT ++LL +QL D +FRR +L+Q + T T K + E D+E V +
Sbjct: 125 LIGHRFLTARRLLPMQLRDYSFRRQILIQTAFFLEAMT-TEKFK----EHVVDEE--VIE 177
Query: 319 TTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--------KRPLT--- 367
E V ++K+T PD + + ++ +L+ E W +W+++GCP K P+
Sbjct: 178 ARERVMEILKRTGPDAQLAASFIEDVLEDEAGWRKWRDDGCPNFVREPIDFEKEPMPDNW 237
Query: 368 ----SITDEDKKDEPDAKKKKT---PELTKLWNSKDNLEACKSAERDF--TPSLESYFEE 418
S+ D+ EPD + K+ L +LWN D+ K+A +F S+E +F
Sbjct: 238 DIKYSMWPPDQFPEPDPRYKEDFGDEALNRLWNIDDD--DVKNARSEFLVAESIEDFFRP 295
Query: 419 AIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
+++MDP A VEEQY++VND + WRA+R+L+R H F
Sbjct: 296 CLEEMDPEAQVEEQYRRVNDEVFRWRAMRMLARSHLHLF 334
>gi|390603448|gb|EIN12840.1| hypothetical protein PUNSTDRAFT_97766 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 784
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 124/414 (29%), Positives = 187/414 (45%), Gaps = 94/414 (22%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ + + + + +D C+ T P L D + T+ C LF ++E N + T M
Sbjct: 79 LRDMLDVVLTFTEQDACDPTFPFTTLQDLIETHTIASCSHLFSWIESRAN---RLTAGMV 135
Query: 118 CKNN----LLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--E 170
+ LLRM NDLLRRLS+ TVFCGRIL FL+ FP ERSG+N+ E+ E
Sbjct: 136 PQKGKALVLLRMLNDLLRRLSKMGDATVFCGRILSFLSAVFPLGERSGVNLRGEYGPTWE 195
Query: 171 NITEF--------GG---DEEMDVSSNE-TEETDTEEVDKVKIDF-------------NF 205
+ E GG DE+M N E DT K+++D +F
Sbjct: 196 GVMEAVVKEDGSSGGMDVDEKMKPPENALGEPGDT----KMQVDEENPSPSKSSSGSDDF 251
Query: 206 YKKFWSLQDYFRNP----------------------VQCYNKVSWKMFTSYAETVLAAFK 243
Y FW LQ F P + N + S ++T AA K
Sbjct: 252 YSTFWFLQLPFSKPPVFAEPGMFARFKTAVNKVLPVIHEANAKERLLIGSSSKTHAAAVK 311
Query: 244 S-YKLDDVQSSLNPSGD---------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
S + +++ GD +FAK+LT+ +LLDL+++D++FRR L Q LIL Q
Sbjct: 312 SQVTPKRKRDAIDTDGDEEDVGTNDYFFAKFLTSPELLDLEIADSHFRRQFLFQLLILLQ 371
Query: 294 Y---FTSTVKS-----RGEGLEL-------------KSDQE-KWVKDTTETVYSLIKQTP 331
+ FT K+ R L++ KS + +WV +T ++QT
Sbjct: 372 HLQTFTRNAKAAWYTPRNRSLQMDFTLETADPNVKGKSASDLQWVSETMHKAIDELRQTA 431
Query: 332 PDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKK 384
P+G FS V++IL+ E +W +WKNE C R P +E+ +D+ +K++
Sbjct: 432 PNGRGFSDTVRVILERERNWVRWKNELCAPFDREPWAQEIEEETEDDSKERKRR 485
>gi|336377042|gb|EGO05377.1| hypothetical protein SERLA73DRAFT_101182 [Serpula lacrymans var.
lacrymans S7.3]
Length = 1079
Score = 138 bits (348), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 121/385 (31%), Positives = 173/385 (44%), Gaps = 59/385 (15%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+++ + L + D +T P +L D + T+ C +F ++E +
Sbjct: 79 LKDRLDLVLTFTEHDASEATFPFTVLQDLLETQTIASCSHVFSWIEARAARLTKGMVPQK 138
Query: 118 CKN-NLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEFN------- 168
K LLR NDLLRRLS+ NT +FCGRIL FL+ FP ERSG+N+ E+
Sbjct: 139 GKALILLRTLNDLLRRLSKMGNTTLFCGRILTFLSGVFPLGERSGVNLRGEYGPMWEGVK 198
Query: 169 -VENITEFG-----GDEEMDVS-SNETEETDTEEVDKVK-IDFNFYKKFWSLQDYFRNPV 220
VE E +++M+V S E E T VD VK FY FWSLQ F P
Sbjct: 199 EVETKVEIAEPPKTPEDKMEVDDSKEGGEDKTGAVDCVKDKSSKFYTTFWSLQLPFSRPP 258
Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYKLDD---------------------VQSSLNPSGD 259
++ F VL K D V+ S +
Sbjct: 259 LFATPTTFPDFKEAVNKVLPVIKEATTKDRALMGSRSATGSSSSLKRKREVELGEEASSN 318
Query: 260 --YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVK-----SRGEGLE-- 307
+FAK+LT+ LLDL+++DT+FRR L Q LIL + FT K SR L+
Sbjct: 319 EYFFAKFLTSPDLLDLEVADTHFRRQFLFQLLILLNHLLSFTKAAKATWLTSRNRSLQME 378
Query: 308 --LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
L++ + +WV++T + ++ T P+G FS V +IL+ E++W +WKN+ C R
Sbjct: 379 FTLETAEVQWVQETIAKASAELRSTAPNGPVFSDTVNVILEREKNWVKWKNDLCAPFDR- 437
Query: 366 LTSITDEDKKDEPDAKKKKTPELTK 390
E E D KK E TK
Sbjct: 438 ------EPWGTEVDGKKVGLEEATK 456
>gi|336390083|gb|EGO31226.1| hypothetical protein SERLADRAFT_412639 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1229
Score = 138 bits (347), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 121/386 (31%), Positives = 173/386 (44%), Gaps = 60/386 (15%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+++ + L + D +T P +L D + T+ C +F ++E +
Sbjct: 112 LKDRLDLVLTFTEHDASEATFPFTVLQDLLETQTIASCSHVFSWIEARAARLTKGMVPQK 171
Query: 118 CKN-NLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEFN------- 168
K LLR NDLLRRLS+ NT +FCGRIL FL+ FP ERSG+N+ E+
Sbjct: 172 GKALILLRTLNDLLRRLSKMGNTTLFCGRILTFLSGVFPLGERSGVNLRGEYGPMWEGVK 231
Query: 169 -VENITEFG-----GDEEMDVS-SNETEETDTEEVDKVK--IDFNFYKKFWSLQDYFRNP 219
VE E +++M+V S E E T VD VK FY FWSLQ F P
Sbjct: 232 EVETKVEIAEPPKTPEDKMEVDDSKEGGEDKTGAVDCVKDKSSSEFYTTFWSLQLPFSRP 291
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLDD---------------------VQSSLNPSG 258
++ F VL K D V+ S
Sbjct: 292 PLFATPTTFPDFKEAVNKVLPVIKEATTKDRALMGSRSATGSSSSLKRKREVELGEEASS 351
Query: 259 D--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVK-----SRGEGLE- 307
+ +FAK+LT+ LLDL+++DT+FRR L Q LIL + FT K SR L+
Sbjct: 352 NEYFFAKFLTSPDLLDLEVADTHFRRQFLFQLLILLNHLLSFTKAAKATWLTSRNRSLQM 411
Query: 308 ---LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
L++ + +WV++T + ++ T P+G FS V +IL+ E++W +WKN+ C R
Sbjct: 412 EFTLETAEVQWVQETIAKASAELRSTAPNGPVFSDTVNVILEREKNWVKWKNDLCAPFDR 471
Query: 365 PLTSITDEDKKDEPDAKKKKTPELTK 390
E E D KK E TK
Sbjct: 472 -------EPWGTEVDGKKVGLEEATK 490
>gi|453083404|gb|EMF11450.1| nuclear matrix protein [Mycosphaerella populorum SO2202]
Length = 637
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 130/493 (26%), Positives = 212/493 (43%), Gaps = 88/493 (17%)
Query: 40 IRQYLLKLIKTPDIDIKVIENYISL--CVELCM-KDMCNSTLPIILLSDTFDMSTLDKCE 96
IR+ I + DI+ + +L V +C + C L LL + D T C
Sbjct: 72 IREIFYAFIHSTDINDPAFADVWNLLDIVLMCGDQGKCTPELVCWLLEELLDSQTTLGCR 131
Query: 97 QLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFP 154
+F Y++ Q+ F K NL LR CN+LLRRLSR+++ +FCGR+ FL + FP
Sbjct: 132 VVFDYLDSRRERLAQKDFH---KKNLVFLRSCNELLRRLSRAEDAIFCGRVFFFLFQTFP 188
Query: 155 FSERSGLNIISEFNVENITEF---GG--------DEEMDVSSNETEETDT---------- 193
++S +N+ EF+VEN T+F GG E V S E T
Sbjct: 189 LGDKSSVNLRGEFHVENTTKFDVEGGATEGIEPDAEPAAVPSTPKESTPPPAPTGKPGSK 248
Query: 194 --------EEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK- 243
+ ++V + + Y FW LQ F +P + + +++ F L FK
Sbjct: 249 AVPIKAPKKAAEEVPLSASELYPIFWRLQHDFSDPTKLFQPQNFEAFKKGLACTLTKFKK 308
Query: 244 -----------------SYKLDDVQS--SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYV 284
KL + S N +Y KYLT++ L DL+LSD F+R++
Sbjct: 309 TPTVVQTKAGEEDKRGTKRKLGEAAGSESSNLLDNYNPKYLTSRDLFDLELSDLAFQRHI 368
Query: 285 LLQFLILFQY--------------FTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
L+Q +IL + T T KS L + W T + + ++
Sbjct: 369 LVQAMILIDFLLSLTERAKKRLSSLTGTNKSLLYSFTLSEEDTHWATSTRAVIREGL-ES 427
Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTS---------ITDEDKKDEPDA 380
DG + ++V+ +L +++W +WK E CP + R P+T+ + + +
Sbjct: 428 REDGRQYCRMVETVLTRDKNWVRWKVESCPSIVRGPVTTEEELLARKGVANFTRPRRLAE 487
Query: 381 KKKKTPELTKLWNS-KDNLEACKSAERDFTPSLESYFEE-AIQQMDPAAAVEEQYKKVND 438
K + +L L + LE K+ R TPS+ ++ + ++D A+ ++ K D
Sbjct: 488 KPQGMMDLGFLEEADSGGLEPLKNPARYTTPSIPQLVDQIKMDELDAEMAMTDEEKSNID 547
Query: 439 ---SNYAWRALRL 448
+N WRALR+
Sbjct: 548 TAMTNAKWRALRI 560
>gi|170084509|ref|XP_001873478.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164651030|gb|EDR15270.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 732
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 109/366 (29%), Positives = 172/366 (46%), Gaps = 64/366 (17%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
+ + + + + D C T P I+L D + T+ C +F ++E +
Sbjct: 79 LRDKLDVVLTFTEHDACEQTFPFIVLQDLLETQTIASCSHIFSWIEARAERLTEGMVPQK 138
Query: 118 CKN-NLLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--ENIT 173
K LLR NDLLRRLS+ T+FCGRIL FL+ FP ERSG+N+ E+ E ++
Sbjct: 139 GKALILLRTLNDLLRRLSKMGTTTIFCGRILTFLSGVFPLGERSGVNLRGEYGPTWEGVS 198
Query: 174 ---EFGGDE---EMDVSSN--ETEETDTE-EVDKVKIDFN-----------------FYK 207
+ D+ EM+V+ + E E + E + DK+++D FY
Sbjct: 199 LGEKTKADDKHVEMEVAKSLEEPESKEGEAQGDKMQVDSKEVKKPVPTEGLAEKKDAFYN 258
Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK-----------SYKLDDVQSSLNP 256
FW+LQ F P +++ F V+ K S + +SL
Sbjct: 259 TFWALQLPFSKPPLFAADGTFEDFKERVNKVMPVIKEATAKERAMMGSRGVAGTGASLKR 318
Query: 257 SGD-----------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS- 301
D +FAK+LT+ LLDL+++DT+FRR L Q LIL + FT T K+
Sbjct: 319 KRDLDSEESNSNEYFFAKFLTSPDLLDLEIADTHFRRQFLFQLLILLNHLLTFTKTAKAA 378
Query: 302 ----RGEGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
R L+ L+ ++ +WV+++ V ++QT P+G F++ + IL E++W +
Sbjct: 379 WTSTRNRSLQMDFTLEDNEAQWVQESITKVTEELRQTTPNGRIFAETISAILDREKNWVR 438
Query: 354 WKNEGC 359
WKNE C
Sbjct: 439 WKNELC 444
>gi|389742049|gb|EIM83236.1| hypothetical protein STEHIDRAFT_123683 [Stereum hirsutum FP-91666
SS1]
Length = 849
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/385 (27%), Positives = 168/385 (43%), Gaps = 67/385 (17%)
Query: 58 IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
++N + L + L +D+C T P L D + T+ C +F ++E N + +
Sbjct: 81 LKNRLDLALTLTEQDVCEQTFPFNTLQDLLETQTISSCSHVFSWIESNADRLTEGMVPAK 140
Query: 118 CKN-NLLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEF-------- 167
K +LRM NDLLRRLS+ N++FCGRIL FL+ FP ERSG+N+ E+
Sbjct: 141 GKTLIILRMLNDLLRRLSKMGSNSIFCGRILTFLSGVFPLGERSGVNLRGEYGPMWEAVE 200
Query: 168 ---------------NVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDF------NFY 206
NV+++ E G E+ +S E + + K D +FY
Sbjct: 201 IKPKVEEKPEETKVENVKDVKEEDGKEKEKDASMSMEVEEKKPASKTSSDLKQEQKDDFY 260
Query: 207 KKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD------- 259
FWSLQ F P + ++ F + VL + + +G
Sbjct: 261 NTFWSLQLPFSRPPLFASPETFTQFQTAVNKVLPIINEATAKERALMGSRTGSTSVSHSH 320
Query: 260 -----------------YFAKYLTNQKLLDLQLSDTNFRR----------YVLLQFLILF 292
+FAK+LT+ +LLDL+++DT+FRR LL F
Sbjct: 321 KRKREAEPGEETTTTEYFFAKFLTSPELLDLEIADTHFRRQFLFQLLILLNHLLLFTKTS 380
Query: 293 QYFTSTVKSRGEGLE--LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
+ +T ++R E L+ + +WV +T ++QT P G F+ V IL+ E++
Sbjct: 381 RAIWTTTRNRSLQFEFTLEPPETQWVTETINKAQEELRQTAPGGRAFADSVSTILEREKN 440
Query: 351 WNQWKNEGCPELKRPLTSITDEDKK 375
+WKN+ C R S+ + K+
Sbjct: 441 RVKWKNDLCTPFDREPYSVEVDGKR 465
>gi|67541699|ref|XP_664617.1| hypothetical protein AN7013.2 [Aspergillus nidulans FGSC A4]
gi|40742469|gb|EAA61659.1| hypothetical protein AN7013.2 [Aspergillus nidulans FGSC A4]
gi|259483676|tpe|CBF79260.1| TPA: THO complex subunit Tho1, putative (AFU_orthologue;
AFUA_4G04330) [Aspergillus nidulans FGSC A4]
Length = 637
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/418 (24%), Positives = 177/418 (42%), Gaps = 115/418 (27%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ L+ T ID I N + + + C L L+ + D
Sbjct: 66 AAVEIAFRERFYSLLATTTIDDPEFVRIWNLLDVISVFSDNEQCEPGLIFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T+D C ++F Y+E RR +++TVFCGR+ +FL
Sbjct: 126 TIDGCRKVFDYLES--------------------------RRERNTKDTVFCGRVFIFLF 159
Query: 151 KFFPFSERSGLNIISEFNVENITEF--------GGDEEMDV------SSNETEET----- 191
+ FP ++S +N+ E++ EN+T F +MDV ++N ET
Sbjct: 160 QSFPLGDKSAVNLRGEYHTENVTTFDTIANDAISEAHDMDVDMPDVKATNSVAETQKQPV 219
Query: 192 ----------DTEEVDKVKIDFN-------------FYKKFWSLQDYFRNPVQCYNKVSW 228
D ++ KV + + Y FW LQ YF +P + ++ +
Sbjct: 220 AEGDAKSAEQDVTKIPKVTVSQSSDQTGERSLNLDTLYPIFWGLQAYFSSPTKIFDAQHF 279
Query: 229 KMFTSYAETVLAAFKSYKLD-------------------------DVQSSLNPSGDYFAK 263
F S E L AF++ + ++ SS NP K
Sbjct: 280 ATFKSGLEATLNAFRTVNTELETSNSKTQEELRKSTKRKRTADGPEIASSFNP------K 333
Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE-----------GLELKSD 311
YLT+++L DL+++DT FRR+VL+Q LIL + S T KS+ + G L +
Sbjct: 334 YLTSRELFDLEVNDTAFRRHVLVQALILLDFMLSLTSKSKAKLADLTNKSVLYGFVLNDE 393
Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSI 369
+W +++ + Q G+ + ++V +L +++W +WK EGCP +++P S+
Sbjct: 394 DAQWAVKMRKSIEGYL-QEGAGGKFYYRMVDTVLSRDKNWVRWKAEGCPPIEKPAVSV 450
>gi|407927426|gb|EKG20320.1| THO complex subunit THOC1 [Macrophomina phaseolina MS6]
Length = 658
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 131/526 (24%), Positives = 223/526 (42%), Gaps = 113/526 (21%)
Query: 37 EQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLD 93
E + + L L+ T ID + N + + + ++ C+ L L + D ++D
Sbjct: 64 EIAAKTLLHHLVATTTIDDPSFVQVWNLLDIVLTCSERNQCDGLLIFYLAEELLDSQSID 123
Query: 94 KCEQLFYYVEVNVNIWKQQTFFMSCKNN----LLRMCNDLLRRLSRSQNTVFCGRILLFL 149
C ++F Y+E F + + +LR CN+LLRRLSR+++ VFCGR+ +FL
Sbjct: 124 GCRRVFDYLESRRERLVAGDFLQNSGASKRIVVLRFCNELLRRLSRAEDAVFCGRVFIFL 183
Query: 150 AKFFPFSERSGLNIISEFNVENITEF------------------GGDEEMDVSSNETEET 191
+ FP ++S +N EF+VEN T F G D + +V TE+T
Sbjct: 184 FQTFPLGDKSSVNSRGEFHVENATVFEEPASSKEEKADDMEIDAGADGKTEVPIVTTEDT 243
Query: 192 DTEEVDKVKIDFN--------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
E K ++ + Y FW+LQ F NP + + F E + FK
Sbjct: 244 TAEADAKAELKKDNKALDTDTLYPIFWTLQQAFSNPPKLFTDEYLAEFRVGIEHTITKFK 303
Query: 244 -----------------SYKLDDVQSSLNPSGDY---------------------FA--- 262
D ++ G+Y FA
Sbjct: 304 DVPKVIPAKNKDKSDERDLSKDKGKAEAGAEGNYENASAEDGRGMKRKHDEGEEEFASPY 363
Query: 263 --KYLTNQKLLDLQLSDTNFRRYVLLQFLILF--------------QYFTSTVKSRGEGL 306
KYLT++ L +L+LSD F+R++L+Q L+L Q + STV + +
Sbjct: 364 NPKYLTSRDLFELELSDLAFQRHILVQALVLLDFLLSLTEKSKEKTQKWLSTVGNPNRAV 423
Query: 307 E----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL 362
+ L + W T + + + Q PDG+ + ++V +L +++W +WK E CP +
Sbjct: 424 QYSYTLSAADADWASRTRTAITTYL-QLLPDGKFYYRMVDTVLSRDKNWVRWKMESCPPI 482
Query: 363 KRPLTSITD--EDKKDEPDA----KKKKTP----ELTKLWNS--KDNLEACKSAERDFTP 410
+P + ++ E K+ A + K TP L+ L ++ L+ K + R P
Sbjct: 483 AKPPVAPSEFQEAKQGAQRACANKRLKATPPGSLNLSFLSDAAMSGGLQGLKESSRYTIP 542
Query: 411 SLESYFEEAIQ----QMDPAAAVEEQYK-KVNDSNYAWRALRLLSR 451
+ ES IQ ++ A E+++K + ++ WRALR+ S+
Sbjct: 543 TPES-LHRGIQGDELDLEMAMTDEDKHKLQQAKASKTWRALRIASK 587
>gi|328724615|ref|XP_003248200.1| PREDICTED: THO complex subunit 1-like [Acyrthosiphon pisum]
Length = 251
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 71/178 (39%), Positives = 105/178 (58%), Gaps = 25/178 (14%)
Query: 275 LSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDG 334
LS++N+ ++ +F F + ++K +Q++W+K++ T+Y LI TPPDG
Sbjct: 66 LSESNYLYLIIYKFTFDF----------SDNFKVKPEQQEWIKNSITTIYELINNTPPDG 115
Query: 335 EHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED-----KKDEPDAKKKKT---- 385
FS+VVK IL+ EE WN+WKN+GCPE+ + L + D +++ D K T
Sbjct: 116 VRFSEVVKNILQREEQWNKWKNDGCPEVIKRLPTSDQSDMAAPKRRNLGDVIKNMTANNK 175
Query: 386 -----PELTKLWNSK-DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
P+LTKLWN K DNLEACK ERDF PS +S+F+EA +Q+D +E Q K +
Sbjct: 176 VFLGDPDLTKLWNLKPDNLEACKGPERDFVPSFDSFFQEAFEQLDAKLCIESQNNKAD 233
>gi|224009341|ref|XP_002293629.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220971029|gb|EED89365.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 983
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 109/398 (27%), Positives = 170/398 (42%), Gaps = 79/398 (19%)
Query: 121 NLLRMCNDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD 178
NLLR+CN LL+ S S ++ F G +++ LA+ FP SERS LN++ FNV ++ F G+
Sbjct: 218 NLLRICNHLLKLSSSSIGKDAEFAGSVMVTLARVFPLSERSALNVLGSFNVGSVVGFEGE 277
Query: 179 EEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPV-----------QCYNKVS 227
EE + + I ++FYK FW +Q F +P Q +
Sbjct: 278 EERNHHHHSL------------IGYDFYKTFWGVQKVFTDPSSTILASRGGLKQSEALQA 325
Query: 228 WKMFTSYAETVLAAFKSYKL-----DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRR 282
+ F + +T+L A +S + + +S KYLT+ +LL LQL D+ FR
Sbjct: 326 YDGFVADVKTILGALESTPVAAPPSESTSNSSVVVVVNHHKYLTSPQLLHLQLKDSTFRI 385
Query: 283 YVLLQFLILFQYFTSTVKSRGEG----------------LELKSDQEKWVKDTTETVYSL 326
+ L Q +I+ Y TST S ++K Q K + D + V L
Sbjct: 386 HFLTQLIIIINYLTSTPTSVTLPTPPITPTPGADTTKLQAQIKQTQLKQLNDLNKRVEQL 445
Query: 327 IKQTP-----------PDGEHFSQVVKLILKGEE-HWNQWKNEGCP----ELKRPLTSIT 370
++ + P GE+ + + +L+ E W WK C + L +
Sbjct: 446 LRSSSSAGAATAAAVQPYGENQWKGLTWLLRDRETMWKAWKRGKCKSNMDSVLNDLGGVR 505
Query: 371 DEDKKD--EPDAKKKKTPEL----TKLWNSKD-----------NLEACKSAERDFTPSLE 413
D + +K+ E+ T S D NL S PSL
Sbjct: 506 GGDVRAILGSSGRKRTLEEVASSGTATGGSGDVADNAVIGIKTNLPTISSQMAQSQPSLT 565
Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
+ + I+ +DP + +E +Y ND Y+WRALRLL+R
Sbjct: 566 DFLDPYIEALDPESGIEAEYHPRNDKVYSWRALRLLAR 603
>gi|169610283|ref|XP_001798560.1| hypothetical protein SNOG_08238 [Phaeosphaeria nodorum SN15]
gi|160702023|gb|EAT84514.2| hypothetical protein SNOG_08238 [Phaeosphaeria nodorum SN15]
Length = 480
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 111/434 (25%), Positives = 196/434 (45%), Gaps = 88/434 (20%)
Query: 12 SQLNVVSDKYALYKPTCDYDKKSALEQSIRQ--YLLKLIKTPDIDIKVIENYISLCVELC 69
S+LN + ++ KP+ D + IR+ +L I+ D L VE
Sbjct: 14 SRLNYLLERARSVKPSTSIDPPLQVSDLIREDEFLFDPIEGSD-------EARRLSVETA 66
Query: 70 MKDMCNSTLP-IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRM 125
K + + L ++L+ + D +++ C +F ++E ++ ++ KN +LR+
Sbjct: 67 AKSLFYANLATLLLIEELLDSQSIEGCRVVFNFLESR----REAIIAINSKNKDLVILRL 122
Query: 126 CNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GG 177
CN+LLRRLSR+++ VFCGR+ +F+ + FP ++S +N+ F+VEN+T F
Sbjct: 123 CNELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENVTTFEDFLKDSSTE 182
Query: 178 DEEMDVSSNETE-----ETDTEEVD---------------KVKIDFNFYKKFWSLQDYFR 217
D++M + E E E+ +EE + + ID + Y FWSLQ F
Sbjct: 183 DDQMQIDDMEAEPVLKGESKSEEHNVKLGNSNVQKDELSSTMDID-SLYPVFWSLQHSFS 241
Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFK-------SYKLDDVQSSLNPSG---DYFAKYLTN 267
NP + + + +K F E LA FK + + + ++ G D F K L
Sbjct: 242 NPPRLFEEDHFKHFQKSLEATLAKFKEVPKVIQAGDSERTKEQMDQDGDDYDAFGKRLQP 301
Query: 268 Q-----------------KLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGEGL--- 306
Q KL++ LSD F+R++L+Q LIL + T T K++ + +
Sbjct: 302 QISHKPRSVQAWRYAIRDKLVE-ALSDLAFQRHILVQALILVDFLLTLTEKAKSKSIYQK 360
Query: 307 ---------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
L+ + +W + + + Q PDG+ + ++V +L +++W +WK E
Sbjct: 361 AQKAMQYNFTLREEDTEWALGIKTAIANYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKME 419
Query: 358 GCPELKRPLTSITD 371
C R + D
Sbjct: 420 NCQPFTRDRVATKD 433
>gi|388582940|gb|EIM23243.1| hypothetical protein WALSEDRAFT_62616 [Wallemia sebi CBS 633.66]
Length = 689
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 117/400 (29%), Positives = 181/400 (45%), Gaps = 88/400 (22%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDI-----DIKVIENY-------ISLCVELCMKDMC--NS 76
DKK+ +E +IR+ + T I D KV + Y + + V+L +M NS
Sbjct: 34 DKKALVEATIREEIQSYSTTRSIAPQYSDDKVTDPYHTGLLSKLGILVDLLKSEMITPNS 93
Query: 77 TLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQ----QTFFMSCKNNLLRMCNDLLRR 132
+PII SDT D+ L +F ++ N Q +S +NNLL+ N+LLRR
Sbjct: 94 VVPII--SDTLDIHPLSSATVIFSFL---TNFMSSDDSNQVKTISTRNNLLKTSNELLRR 148
Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETD 192
SR++N+ CG ILLFL P ++RS N F GD SN+T D
Sbjct: 149 ASRTRNSELCGEILLFLTSIMPLTDRSATN------------FRGDIR---ESNKTFIND 193
Query: 193 TEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWK--------------MFTSYAETV 238
+D D + YK FW LQ Y +P+ YN K +F++ + +
Sbjct: 194 L--IDSPDAD-DSYKDFWRLQSYLTDPMSIYNGGLVKNDDKEVTHIESFKQLFSNILDLL 250
Query: 239 L-AAFKSYKLD---------------DVQSSLNPSGD--YFAKYLTNQKLLDLQLSDTNF 280
+ A+ K K+ + +S + S D +F +YL KL LQ++D +F
Sbjct: 251 VQASDKEIKMSGSKLKKDTKRKADELEAKSDNHESNDAYFFPEYLAQPKLFPLQVADPSF 310
Query: 281 RRYVLLQFLILFQYFTST-VKSRGEGLELKSDQEKWVKDTTET---VYS--------LIK 328
R V +Q I+ Q+ S K + E+K+ QEK +K ++ +Y L+K
Sbjct: 311 RCCVFIQMFIIVQFIVSVYPKELLKAPEIKNPQEKIIKSGIQSDKPLYDWASNKRAILLK 370
Query: 329 QTP---PDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
P P+ +Q + ++K E+HW WK E CP ++ P
Sbjct: 371 SLPKVSPNNLDITQPIVQLMKREKHWVYWKMENCPMIELP 410
>gi|261187831|ref|XP_002620333.1| nuclear matrix protein [Ajellomyces dermatitidis SLH14081]
gi|239593450|gb|EEQ76031.1| nuclear matrix protein [Ajellomyces dermatitidis SLH14081]
Length = 651
Score = 125 bits (313), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 143/616 (23%), Positives = 242/616 (39%), Gaps = 152/616 (24%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E + R+ L+ T ID I N + + + C L L+ + D
Sbjct: 66 AVVETAFRKKFYDLLATTSIDEPSFIHIWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+D C ++F Y+E +KQ++ +LR CN+LLRRLSR+++T F
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRTEDTSF---- 175
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNETE---ETDTE 194
P +RS +N+ EF+ EN+T F GG MDV ++ E + TE
Sbjct: 176 --------PLGDRSSVNLRGEFHTENVTTFDKLPKAQDGGVTPMDVDASAKEVAGDAKTE 227
Query: 195 EVDKVKIDFN-----------------------------------FYKKFWSLQDYFRNP 219
+ + N Y FWSLQ+ F +P
Sbjct: 228 SAQASEQESNRQAAHPAADVGLQSSQPATKKPQEPEPSSTPSMDELYPIFWSLQNNFSSP 287
Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------DVQSSLNPSGDY 260
++ ++ F + E L+ F+ D V + + +
Sbjct: 288 TSLFDPANFAAFKTGLEASLSIFQKVNTDMEARMTKGSEDARRGLKRRRVGDGMEMTNSF 347
Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE--------------- 304
KYLT++ L +L+++D FRR++L+Q LIL + S T K++ +
Sbjct: 348 NPKYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNKSVLYSYVL 407
Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
L S + KW ++ S ++Q +G+ + ++V +L +++W +WK E C
Sbjct: 408 NDEDVNLCTASSKAKWASQMKTSIASYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEAC 466
Query: 360 PELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACKSAERD 407
P ++R SI D K + + P +L L K +++ K +R
Sbjct: 467 PPIERTPVSIQDYLDTQATATKVSTSKRLRSAPLGSLDLNFLTEDKNINSVGRLKHPDRF 526
Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEEQYKKVND-SNYAWRALRLLSRKCPHFF------ 457
TP+L+SY +D A + EE+ + ++ WR LRL SR F
Sbjct: 527 TTPALDSYMRGIADDEFNIDMAQSREEKDEATKARASKTWRTLRLSSRSKLALFDKIEDG 586
Query: 458 -----LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVD-QDPAEVEVDTKSEEIQ 511
L P +++ N K V+ NGV+ QD D K E+ +
Sbjct: 587 NNLKVLFEPPPASEDASSASNKQKAAVE-----------NGVNRQDGPSKRGDVKGEKAK 635
Query: 512 EEEKEEDWEAKADPEG 527
E + + A +G
Sbjct: 636 ENQPPVSSDGAAPQQG 651
>gi|339259264|ref|XP_003368993.1| hypothetical protein Tsp_12746 [Trichinella spiralis]
gi|316964321|gb|EFV49483.1| hypothetical protein Tsp_12746 [Trichinella spiralis]
Length = 442
Score = 124 bits (312), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 100/379 (26%), Positives = 166/379 (43%), Gaps = 36/379 (9%)
Query: 129 LLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISE----FNVENITEFGGDEEMDVS 184
+L+RLSR N CGR+ FL K PF ++SGLN E + E I+ DEE+
Sbjct: 1 MLKRLSRIINAELCGRLSAFLTKCLPFLDKSGLNATGECEPLLDAELISFPDEDEELVEQ 60
Query: 185 SNETEE-------TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCY-NKVSWKMFTSYAE 236
E+ T E+ ++ K + + Y F + P C + W F +
Sbjct: 61 FAESAYYLRDEFITLPEKTEEKKFNRSLYSSFCLTKKLLAEPTNCCETEDDWYQFKQAVD 120
Query: 237 TVLA-----------AFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVL 285
VL + S V L+ G + YL K+ LQ++D +F L
Sbjct: 121 KVLTHMSHPPLPFDRTYSSLMNQAVARELSLPGGF---YLPEPKMFMLQINDVSFVSTFL 177
Query: 286 LQFLILFQYFT---STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
++ LI+ Y + + LE S + +W+ T Y L+K P DGE + +
Sbjct: 178 MEQLIVLDYLIEQNRCLNTTVHDLESSSGKHRWLLSTYNNCYQLLKAMP-DGEAATFHLN 236
Query: 343 LILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK-----TPELTKLWN-SKD 396
L L E+ W QWK + P+++ T+I ++ E K P LTKLWN +
Sbjct: 237 LQLDNEKLWGQWKQDNYPDVRVKPTAIVQSLRRGEHRPYVKGFFDLGNPTLTKLWNLCPN 296
Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
N+++ + P ++++ EA+ Q P + + KKV+D N+ W R ++ K F
Sbjct: 297 NVDSWRGIGCQAVPEVKAFISEAVNQASPEVQMPMELKKVSDENFKWIGYRFMAEKSRSF 356
Query: 457 FLNATPNVEKNSEFIENMV 475
+ + S+++E+M+
Sbjct: 357 GMFTEKQILNTSKYLESML 375
>gi|452977651|gb|EME77417.1| hypothetical protein MYCFIDRAFT_42306 [Pseudocercospora fijiensis
CIRAD86]
Length = 621
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 126/501 (25%), Positives = 208/501 (41%), Gaps = 116/501 (23%)
Query: 34 SALEQSIRQYLLKLIKTPDI-DIKVIENYISLCVELCMKD--MCNSTLPIILLSDTFDMS 90
SA+E + R+ LI + +I D E + L V L D C+ L L+ + D
Sbjct: 66 SAVETAFREIFYSLIHSTEIHDPAFGEVWNLLDVVLICGDEGRCDPGLVCWLIEELLDSQ 125
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T + C +F Y++ RR SR+++ +FCGR+ FL
Sbjct: 126 TTEGCRTVFDYLDS--------------------------RRESRAEDAIFCGRVFFFLF 159
Query: 151 KFFPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNETEETDTEEV------- 196
+ FP ++S +N+ EF+ EN T+F G ++D + ET +DT+E
Sbjct: 160 QTFPLGDKSSVNLRGEFHTENTTKFEVSAPVEGEQMDVDFKTGETP-SDTKESTPQPAAP 218
Query: 197 -------------------DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
++V Y FW LQ F +P + ++ ++ +F
Sbjct: 219 TKPGSKAIPIKAAPKKVAEEEVLSTEQLYPIFWRLQHDFSDPTRLFDNDNFTVFKKGLAC 278
Query: 238 VLAAFK-SYKLDDVQSSLNP--------SGD-------------YFAKYLTNQKLLDLQL 275
+A FK + + +SS+ GD Y KYLT++ L L+L
Sbjct: 279 TIAKFKKTPTVVQTKSSVEGRRGTKRKMHGDGGEEESDNTLVDNYNPKYLTSRDLFGLEL 338
Query: 276 SDTNFRRYVLLQFLILFQYF-------------TSTVKSRGEGLELKSDQEKWVKDTTET 322
SD F+R++L+Q LIL + ST ++ L + KW T T
Sbjct: 339 SDLAFQRHILVQALILIDFLLSLTEKAKKRLNAVSTNRALIHAFVLNEEDAKWATSTRST 398
Query: 323 VYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKK 382
+ S + + +G H+ ++V+ +L +++W +WK E CP + R S T+E+ A K
Sbjct: 399 ITSHLNMS-NEGRHYCRMVETVLARDKNWVRWKVESCPSIVRGPVS-TEEELLARTGAAK 456
Query: 383 KKTPELT--KLWN----------SKDNLEACKSAERDFTPSLESYFEEA-IQQMDPAAAV 429
P K N + L+A K R PS+E E+ ++D A+
Sbjct: 457 FTRPRTIPDKPQNVMNLSFLDEVTGGGLDALKHPSRYTAPSIEQLVEQVKTDKLDAEMAM 516
Query: 430 EEQYKKVNDS---NYAWRALR 447
+E+ K ++ N WRALR
Sbjct: 517 DEEEKSSYENAIQNAKWRALR 537
>gi|116191693|ref|XP_001221659.1| hypothetical protein CHGG_05564 [Chaetomium globosum CBS 148.51]
gi|88181477|gb|EAQ88945.1| hypothetical protein CHGG_05564 [Chaetomium globosum CBS 148.51]
Length = 718
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 108/409 (26%), Positives = 181/409 (44%), Gaps = 56/409 (13%)
Query: 72 DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMC 126
++C+ L L+ + D T+ C ++F ++E + +KQ+ +LR C
Sbjct: 116 ELCDPALLFWLVEELLDSQTIAGCRKVFDFLESRRERITSKHFKQKQLV------ILRTC 169
Query: 127 NDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSN 186
N+LLRRLSR+ + FCGR+ +F+ + FP ++S +N+ E++VEN+T F D+E ++
Sbjct: 170 NELLRRLSRALDPAFCGRVFIFMFQSFPLGDKSSVNLRGEYHVENVTTF--DQETPIADG 227
Query: 187 ETEETDTE---EVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
E + DT+ + DK Y FWSLQ+ F P + + + F + E +A F
Sbjct: 228 EKMDVDTDAKPQKDKPLDPDTLYPIFWSLQESFNQPKKLFESSHFSAFKAGLEATMATF- 286
Query: 244 SYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF--QYFTSTVKS 301
++NP A+ ++ + S R V + F +Y TS
Sbjct: 287 --------WTINPEKPTRAREKQDKPDEEANPSLKRKRNDVDDELASGFNPKYLTSRDLF 338
Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
+ LEL + KW D V + Q DG F ++V +L +++W +WK E CP
Sbjct: 339 K---LELSEEDTKWAIDMRAGVVKYL-QKGFDGPFFDRMVGTVLVRDKNWVRWKIENCPS 394
Query: 362 LKRPLTS---ITDEDKKDEPDAKKKKTP------ELTKLWNSKDNLEACKSAERDFTPSL 412
++ P S T+ A+ ++ P E + K +E K +R P L
Sbjct: 395 IELPTLSPGTFTEARAAATRLARTRRLPNESRSLEFLGEEDEKSAMEKLKDPQRYSLPEL 454
Query: 413 ESY----------FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
S+ E A +A+E + K AWR LR+ S+
Sbjct: 455 GSFKDGIEDDDFDIERATNNESKTSAIESKASK------AWRTLRIASK 497
>gi|50549211|ref|XP_502076.1| YALI0C21131p [Yarrowia lipolytica]
gi|49647943|emb|CAG82396.1| YALI0C21131p [Yarrowia lipolytica CLIB122]
Length = 724
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 112/426 (26%), Positives = 163/426 (38%), Gaps = 124/426 (29%)
Query: 71 KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVN-----VNIWKQQTFFMSC------- 118
K + STL + L+ D FD+ T+D C + + V+ +N+ + Q S
Sbjct: 92 KPLLESTLTLALVEDIFDIHTVDWCRHFWTHYLVSREEQLMNVSRNQASSNSVTRLSGRK 151
Query: 119 --KNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
L+R+ N LLRR+S+S++ VF G IL+FL++ FP SERSGLN EFNVENIT F
Sbjct: 152 APGTTLIRLANALLRRVSKSRDAVFAGEILIFLSRAFPLSERSGLNQKGEFNVENITVFE 211
Query: 177 GDE----------------------------------EMDVSSNETEETDTEEVDKVKID 202
G E ++D+ +E +E E+
Sbjct: 212 GYEGAKEHKEVDEDEDEDDDEDDNDDDDEDENENEDKDVDMGVDEMKEKVKEKTATPATS 271
Query: 203 FNF--------YKKFWSLQDYFRNPVQCYN---------------KVSWKMFTSY---AE 236
Y FW LQ YF +P N K + K S+ E
Sbjct: 272 KEHTQEPSDSPYALFWGLQKYFADPTILLNSKEPEVEMNKLQTCLKATVKTLRSHDGSTE 331
Query: 237 TVLAAFKSYKLDDVQSSLNPSGDYF-----------------------------AKYLTN 267
+ D + N G+ AK+LTN
Sbjct: 332 IAQGGGTRWAKRDSREGGNRGGNTSAAGGGSGGSRDFGNSRGHSANNHSSRTLNAKWLTN 391
Query: 268 QKLLDLQLSDTNFRRYVLLQFLILFQYF-------------TSTVKSRGEGLELKSDQEK 314
L LQLSDTNFRR + Q LI ++ T+T +S L + E
Sbjct: 392 PDLFTLQLSDTNFRRAIWTQILIFSEFMLSLTEKNKAKIPATATNRSVQYQYTLSAADEI 451
Query: 315 WVKDTTETVYSLIKQTPP--DGEH------FSQVVKLILKGEEHWNQWKNEGCPELKRPL 366
+ + S K PP G S+++ + K +E W +WK E CP ++P+
Sbjct: 452 YFESVVAHFESSGKNKPPYKPGHSPDFNAVLSRIIPHVFKFDEKWVEWKLENCPAYEKPV 511
Query: 367 TSITDE 372
+ DE
Sbjct: 512 MNNIDE 517
>gi|388857770|emb|CCF48664.1| related to nuclear matrix protein p84 [Ustilago hordei]
Length = 865
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 76/216 (35%), Positives = 117/216 (54%), Gaps = 15/216 (6%)
Query: 68 LCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN----LL 123
L + D+ + TLP+ L+ + + + C QLF Y+E V K T + LL
Sbjct: 132 LSLYDLIDPTLPLSLVEELIEGQPISICSQLFGYIESRV---KPLTIGLHSTRGKGLVLL 188
Query: 124 RMCNDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
R CN+LLRRLS+ Q+TVF GRIL LAK FP ERSG+N+ +FNVEN T +E +
Sbjct: 189 RTCNELLRRLSKPSQQHTVFAGRILSLLAKVFPLGERSGVNLRGDFNVENKTHI--EENV 246
Query: 182 DVSSNETEET-DTEEVDKV-KIDFNFYKKFWSLQDYFRN-PVQCYNKVSWKMFTSYAETV 238
D ++ E E+ D +E D++ K+D+ FY+ FW +Q +F N P+ + + T A
Sbjct: 247 DDATEEVEDAEDNDERDQIFKVDYEFYELFWKVQRFFSNSPLLMGAHETSRPSTRSATPA 306
Query: 239 LAAFKSYKLDDVQSSLNPSGDYFAKY-LTNQKLLDL 273
A + D + + + A++ ++ QK+L L
Sbjct: 307 DAGGGEHTNADEDPAPDGAKGAMAEFRISTQKILKL 342
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 134/297 (45%), Gaps = 77/297 (25%)
Query: 257 SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE- 307
SG +F YLT +KL + ++ DT+FR++VL+Q+L+LFQY FT T K + + L+
Sbjct: 452 SGHFFPTYLTGRKLFEYEIRDTSFRKHVLVQYLVLFQYLLSFTPTAKESWKDWKNKTLQA 511
Query: 308 ---LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
L+ E+W++ T + LI++ PDG F + V +LK E +W +WK E CP +++
Sbjct: 512 LFTLEEADERWIRSTWREIQGLIREIQPDGRLFLESVLEVLKREANWIRWKTESCPSIEK 571
Query: 365 PLTSITDED----------KKDEPDAKKKKTPELTKLW--NSKDNLEACKSAERDFT--- 409
P S +K EP K T L++LW K + + E +
Sbjct: 572 PPLSPEQISAFASARALLLRKPEPYPHKLGTAALSELWLDGLKPPVAGTRRVENEIGEYV 631
Query: 410 ----------------PSLESYFEEAIQQ-------------MDPAAAVEEQYKKVND-- 438
PSLE ++ ++IQ+ +DPA E K+ D
Sbjct: 632 RIATDGLEDLEFLPPIPSLE-HYNKSIQREKMKLDMRKEQLGIDPATKEAELSKEEKDKL 690
Query: 439 -------------SNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK 482
+ AWRALR+ SR H F P+ +N IE ++K +E+
Sbjct: 691 DKDERIQATIETMQSLAWRALRVASRDNLHLF--GKPDRPEN---IEGLLKAMEEER 742
>gi|443900289|dbj|GAC77615.1| nuclear matrix protein [Pseudozyma antarctica T-34]
Length = 840
Score = 119 bits (298), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 69/198 (34%), Positives = 104/198 (52%), Gaps = 9/198 (4%)
Query: 68 LCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMC 126
L + D+ + TLP+ L+ + + + C QLF Y E V K LLR C
Sbjct: 129 LSLHDLIDPTLPLSLVEELLEGLPISTCSQLFAYTESRVKPLTVDLHPTRGKGLVLLRTC 188
Query: 127 NDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS 184
N+LLRRLS+ +TVF GR+L LA FP ERSG+N+ +FNVEN T EE D
Sbjct: 189 NELLRRLSKPSQHHTVFAGRVLSLLAAVFPLGERSGVNLRGDFNVENKTHI---EENDSQ 245
Query: 185 SNETE--ETDTEEVDKV-KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
+ E + D +E D++ K+D++FY+ FW +Q +F NP ++ + T
Sbjct: 246 AAEEASIDADRDERDEIFKVDYDFYELFWKMQRFFSNPPLLMGASESSRPSTRSATPALH 305
Query: 242 FKSYKLDDVQSSLNPSGD 259
++ D+VQ+ +P+ D
Sbjct: 306 TEAADSDNVQADDDPAPD 323
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 88/350 (25%), Positives = 144/350 (41%), Gaps = 82/350 (23%)
Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE----LK 309
AK L L + ++ DT+FR+++L+Q+L+LFQY FT T K + + L+ L+
Sbjct: 415 AKRLRPDVLFEYEIRDTSFRKHILVQYLVLFQYLLSFTPTAKESWKDWKNKTLQALFTLE 474
Query: 310 SDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSI 369
E+W++ T + LI+ P G F + V +LK E +W +WK E CP +++P S
Sbjct: 475 DVDERWIRSTWREILILIRDIQPHGRLFLESVLEVLKREANWIRWKTESCPSIEKPPLSP 534
Query: 370 TDED----------KKDEPDAKKKKTPELTKLWNSKDNLEACKSAER------------- 406
+K EP K T L++LW D L + R
Sbjct: 535 EQISAFASARALLLRKPEPYPHKLGTAALSELW--LDGLRPPVAGTRRVENELGEYISVP 592
Query: 407 -------DFTPSLES--YFEEAIQQ-------------MDP----AAAVEEQYKKVND-- 438
+F P + S ++ ++IQ+ +DP A +E +K++
Sbjct: 593 TDGLEELEFLPPIPSVEHYNKSIQREKMKLDLRKKQLGIDPTISEAQLSKEDREKLDKDE 652
Query: 439 ---------SNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQ 486
+ AWRALR+ SR H F + E IE +++ E KP S
Sbjct: 653 RIQATTETMQSLAWRALRVASRDHLHLF-----GKPERPEHIEGLLEAIETERNPKPKSV 707
Query: 487 ISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDADEVMSVE 536
++ + + +E+ E+ E AD DA E + E
Sbjct: 708 APVETQATEEGAVQAQGPQDAEKAAEQSAELAQAGTADGPADAGETRTNE 757
>gi|353236953|emb|CCA68937.1| related to nuclear matrix protein p84 [Piriformospora indica DSM
11827]
Length = 673
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/296 (32%), Positives = 141/296 (47%), Gaps = 61/296 (20%)
Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV------------ 169
+LR N+LLR LS+S T FC RILLF++ FP SERSG+N+ E+
Sbjct: 97 VLRTLNELLRSLSKSTETEFCARILLFMSAVFPLSERSGVNLRGEYGPPWEGPPPYQEGK 156
Query: 170 ENITEFGGDEEM-------DVSSNETEET-DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ 221
E I E EE D + E ++T D E D+ + YK FW LQ F P
Sbjct: 157 ELILEEAKKEETRQGARHDDAGATEMDDTVDPSEEDRKSL---LYKTFWFLQYPFSRPPA 213
Query: 222 CYNKVSWKMF-------------TSYAETVLAAFKSY-----KLD---DVQ----SSLNP 256
+K F + E +LA K+ K D DVQ + +
Sbjct: 214 FAEADMFKAFQHAVNKILPILSEATKKERMLAGSKTLAGSKRKRDVEMDVQKVSYTVQDE 273
Query: 257 SGDY-FAKYLTNQKLLDLQLSDTNFRR---YVLLQFLILFQYFTSTVKSRGEGLELKS-- 310
DY FAKYL++ +LL+L+++DTNFRR + LL L + FT K++ ++++
Sbjct: 274 HADYAFAKYLSSPELLELEIADTNFRRQFLFQLLILLQYLRSFTEQEKAKHPEVKMRPFN 333
Query: 311 -------DQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
+E WVK+ V ++ T PDG F +++++IL+ E +W WKN C
Sbjct: 334 IDFTLPESEENWVKEVNSRVMLELRTTTPDGRAFEEIIRIILERESNWVNWKNSQC 389
>gi|76154476|gb|AAX25952.2| SJCHGC07441 protein [Schistosoma japonicum]
Length = 170
Score = 117 bits (292), Expect = 2e-23, Method: Composition-based stats.
Identities = 55/134 (41%), Positives = 87/134 (64%), Gaps = 2/134 (1%)
Query: 31 DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
DKKS ++ ++R +L+L+ + +D ++ + + V + ++C++ P +L SD F+
Sbjct: 39 DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 96
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
+ CE++F ++E + K F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL
Sbjct: 97 PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 156
Query: 151 KFFPFSERSGLNII 164
+ FP E+SGLN +
Sbjct: 157 RLFPLDEKSGLNFM 170
>gi|164656609|ref|XP_001729432.1| hypothetical protein MGL_3467 [Malassezia globosa CBS 7966]
gi|159103323|gb|EDP42218.1| hypothetical protein MGL_3467 [Malassezia globosa CBS 7966]
Length = 543
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/404 (26%), Positives = 176/404 (43%), Gaps = 78/404 (19%)
Query: 122 LLRMCNDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE 179
LLR+CN+LLRRLS++ Q+T+ GRI L L+ F SERSG+N+ EF+ ++
Sbjct: 18 LLRLCNELLRRLSKTHPQHTMLAGRIFLLLSSTFAISERSGVNLKGEFDT--------NQ 69
Query: 180 EMDVSS-NETEETDTEEVDKVKIDF-NFYKKFWSLQDYFRNPVQCYNKV---------SW 228
+D+ + NE E +E ++ I FY FWSLQ YF NP +++ +
Sbjct: 70 TIDIEAVNEHCEPPSETKPEILIHHPQFYVLFWSLQRYFTNPTLLFSESDAPGAKAAEAL 129
Query: 229 KMFTSYAETVLAAFKSYK--LDDVQSSLNPSGD-----------------YFAKYLTNQK 269
+ A T + F++ + DV LN D + +Y Q
Sbjct: 130 GLQERQASTPMHTFQTGIQCVLDVMRWLNKKSDSPPKTKRARFETSASSPTYPRYRCTQT 189
Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE------------LKSDQEKWVK 317
L +L + F++ L+Q LI FQY + E + LK ++E+W++
Sbjct: 190 LFGYELHNLAFQQQYLVQCLITFQYLLGQTSATHEQSKEWKNKLLVPMHTLKQEEEQWLR 249
Query: 318 DTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR----PLTSITDED 373
T + ++++++ +G F V ++L+ E W +WK P + P T D
Sbjct: 250 KTWRQIQTVLRESSQEGRVFLDAVLMLLRRESSWIRWKAASAPSFHKDALNPDTLQAWSD 309
Query: 374 KKDEPDAKKKK-------TPELTKLWNSKDNLEACKSAE--------RDFTPSLESYFEE 418
K A+++ TPEL++LW +D + +E D T + + E
Sbjct: 310 KIRTTFAQQEPWFPHALGTPELSRLW--EDGFHVAEPSEMHVNDEEGHDKTIRTDGWEEL 367
Query: 419 AIQQMDPA-----AAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
+ P+ A+E + +WRALR S + H F
Sbjct: 368 KFPRAPPSLRSLCRAIEHCTDPDQAQSLSWRALRCASLEHLHVF 411
>gi|343425449|emb|CBQ68984.1| related to nuclear matrix protein p84 [Sporisorium reilianum SRZ2]
Length = 865
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 63/156 (40%), Positives = 89/156 (57%), Gaps = 7/156 (4%)
Query: 68 LCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMC 126
L + D+ + +LP+ L+ + + + C +LF Y E V K LLR C
Sbjct: 133 LSLHDLVDPSLPLSLIEELIEGQPISTCSRLFAYTESRVQPLTIDLHPTRGKGLVLLRTC 192
Query: 127 NDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS 184
N+LLRRLS+ Q+TVF GRIL LAK FP ERSG+N+ EFNVEN T E+
Sbjct: 193 NELLRRLSKPSQQHTVFAGRILSLLAKVFPLGERSGVNLRGEFNVENKTHI---EDRAQD 249
Query: 185 SNETEETDTEEVDKV-KIDFNFYKKFWSLQDYFRNP 219
++E ++E D+V K+D+ FY+ FW +Q +F NP
Sbjct: 250 ADEDAADGSDEKDEVFKVDYEFYELFWKVQRFFSNP 285
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 122/274 (44%), Gaps = 78/274 (28%)
Query: 258 GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE-- 307
G +F YLT +KL + ++ DT+FR+++L+Q+ +LFQY FT K + + L+
Sbjct: 451 GHFFPTYLTGRKLFEHEIRDTSFRKHILVQYFVLFQYLLSFTPLAKESWKDWKNKTLQAL 510
Query: 308 --LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
L+ E+W++ T + LI++ PDG F + V +LK E +W +WK E CP +++P
Sbjct: 511 FTLEDADERWIRSTWREIQGLIREIQPDGRLFLESVLEVLKREANWIRWKTESCPSIEKP 570
Query: 366 LTSITDED------------KKDEPDAKKKKTPELTKLWNSKDNLEACKSAER------- 406
++ E +K EP K T L++LW +D L R
Sbjct: 571 --PLSPEQIAQFASARALLLRKPEPYPHKLGTAALSELW--QDGLRPPVPGTRRVENEMG 626
Query: 407 -------------DFTPSLES--YFEEAIQ-----------QMDPAAAV-------EEQY 433
+F P + S ++ ++IQ Q+ AAV EE+
Sbjct: 627 EYVTIATDGLEELEFLPPIPSLEHYNKSIQREKMKLDLRKKQLGIDAAVGEAQLSKEERE 686
Query: 434 KKVND----------SNYAWRALRLLSRKCPHFF 457
K D + AWRALR+ SR H F
Sbjct: 687 KLDKDERVQATIESMQSLAWRALRVASRDNLHLF 720
>gi|392578305|gb|EIW71433.1| hypothetical protein TREMEDRAFT_28261 [Tremella mesenterica DSM
1558]
Length = 670
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 109/457 (23%), Positives = 190/457 (41%), Gaps = 86/457 (18%)
Query: 80 IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRS-Q 137
I LL + ++ ++ L+ YVE + + K LLR+ N LR L R+ +
Sbjct: 145 IPLLEELVELIPVETWRDLWAYVETRAKRFTKDMPASRGKALPLLRIINSFLRFLPRTPE 204
Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEE------- 190
+ +F GR+ F + +++S +N+ ++N T D + N + E
Sbjct: 205 DLIFRGRVHQFASSVISVADKSAINMRGDYNDIKTTWDESDTSQSSTLNASAEKEEDGDV 264
Query: 191 ---TDTEEVDKVKIDFNFYKKFWSLQDYFRNP-------VQC----YNKVSWKMFTSYAE 236
+D +E K + FY WSLQ YF P VQ + ++ F + E
Sbjct: 265 KMQSDVKEEVKASTETEFYSTLWSLQHYFAYPPSLDGPAVQSNPGEPARTPFETFKAKTE 324
Query: 237 TVLAAF-------------------KSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSD 277
VL + K + QS + G + ++LT ++L D +L+D
Sbjct: 325 FVLPKLFEQTQKEKEMMGKDAEVISRKRKRSEGQSDASARGFFHPRFLTGKRLFDHELTD 384
Query: 278 TNFRRYVLLQFLILFQYF------TSTVKSRGEGLE----LKSDQEKWVKDTTETVYSLI 327
++FRR +L+Q+ ILFQ+ T+ ++ G+ + + E W+K + +
Sbjct: 385 SSFRRQILVQYFILFQFLLNLTPSTAAKQAFTGGMPKTFIVDQENEDWIKSKVAEIKEEL 444
Query: 328 KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE-------LKRPLTSITDED--KKDEP 378
++ PDG F + V I+ E ++ QWKN+ CPE L + + E K+ P
Sbjct: 445 RKMIPDGPMFEETVLSIITRERYYAQWKNDSCPEGVFEVPPLDKAVAKKAAEQWRKRSAP 504
Query: 379 DAK---KKKTPELTKLWNSK----DNLEACKSAERDFTPSLESYFEEAIQQMDP------ 425
++ K + L++LWN D L+ +S P+ +E I++++
Sbjct: 505 PSQYPYKLGSRSLSRLWNRGFTGIDQLKGWRS------PATAVSLDEEIKRIEADDEDDK 558
Query: 426 -----AAAVEEQYKKVNDSNYAWRALRLLSRK-CPHF 456
+A E K + WR LR SR HF
Sbjct: 559 AMGKEPSAEEAALNKDRKTRLTWRGLRQASRTDLRHF 595
>gi|71020959|ref|XP_760710.1| hypothetical protein UM04563.1 [Ustilago maydis 521]
gi|46100304|gb|EAK85537.1| hypothetical protein UM04563.1 [Ustilago maydis 521]
Length = 879
Score = 111 bits (278), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 66/164 (40%), Positives = 93/164 (56%), Gaps = 5/164 (3%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
+ + + + L ++D+ + TLP+ L+ + + + C +LF Y E V K
Sbjct: 96 DRLDVILVLSLQDLVDPTLPLSLVEELIESQPISTCSRLFGYTESRVKPLTVDLHPTRGK 155
Query: 120 N-NLLRMCNDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
LLR CN+LLRRLS+ ++TVF GRIL LAK FP ERSG+N+ EFNVEN T
Sbjct: 156 GLVLLRTCNELLRRLSKPSQEHTVFAGRILSLLAKVFPLGERSGVNLRGEFNVENKTSIE 215
Query: 177 GDEEMDVSSNETEETDTEEVDKV-KIDFNFYKKFWSLQDYFRNP 219
E D +E D +E DKV K+D+ FY+ FW +Q +F NP
Sbjct: 216 EKVE-DAKPEADDEADRDERDKVFKVDYEFYEVFWKVQRFFSNP 258
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 88/345 (25%), Positives = 148/345 (42%), Gaps = 84/345 (24%)
Query: 258 GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEG------------ 305
G +F KYLT +KL + ++ +T+FR+++L+Q+ +LFQY S + E
Sbjct: 427 GHFFPKYLTGRKLFEYEIRNTSFRKHILVQYFVLFQYLLSYTPAAKESWKDWKNKTLQAL 486
Query: 306 LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
L+ E+W++ T + +L++ PDG F V +L+ E +W +WK E CP +++P
Sbjct: 487 FTLEEADERWIRSTWREIQALMRDIQPDGRLFLDSVLEVLRREANWIRWKTESCPSIEKP 546
Query: 366 LTSITDED------------KKDEPDAKKKKTPELTKLWNS--KDNLEACKSAERDF--- 408
++ E +K EP K T L++LW + + + AE +
Sbjct: 547 --PLSPEQLAQFASARALLLRKAEPYPHKLGTAALSELWEDGLRPPVAGTRRAENEMGEY 604
Query: 409 ----------------TPSLESYFEEAIQQ-------------MDPAAA-----VEEQYK 434
PSLE ++ ++IQ+ +DPA + E+Q K
Sbjct: 605 VTVATDGLEELEFAPAAPSLE-HYNKSIQREKMKLDLRKKQLGIDPAVSETNLSKEDQSK 663
Query: 435 KVND----------SNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPS 484
D + AWRALR+ SR H F P+ +N E + ++ KP
Sbjct: 664 LDKDDRVQATIETMQSLAWRALRVASRDNLHLF--GKPDRPENIERLLQAMEEERNPKP- 720
Query: 485 SQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDA 529
N P E DT+S + + + EA+A+ + DA
Sbjct: 721 -----NAVTPSAQPTSDESDTRSGQEANGDTSKGKEAEAEAKTDA 760
>gi|406696339|gb|EKC99630.1| hypothetical protein A1Q2_06049 [Trichosporon asahii var. asahii
CBS 8904]
Length = 722
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 103/367 (28%), Positives = 155/367 (42%), Gaps = 66/367 (17%)
Query: 80 IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRSQN 138
I LL + ++ +++ F Y E + + K LLR N LR LS +
Sbjct: 140 IPLLEEFVELLSVETWHSFFSYFETREKRFTRDMPASKGKALPLLRTINTFLRTLSSTPA 199
Query: 139 TV-FCGRILLFLAKFFPFSERSGLNIISEFN----------VENITEFGGDEEMDVSSNE 187
+ GR+ LF ++ S++S +N+ E++ E GD EMD + +
Sbjct: 200 DLQLRGRVQLFASRAISVSDKSAINMRGEYSGIRTTWEETPEETAAGADGDVEMDGGAAK 259
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
E+ + E DK FY WSLQ YF +P + + A+T F+S
Sbjct: 260 GEKGEKSEDDK------FYNTLWSLQQYFAHPPSLDGPATGEP----AQTPFERFRSAS- 308
Query: 248 DDVQSSL---------NPSGDY-----------FAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
D+V L NP+ F +YLT + LL +L+DT FRR VLLQ
Sbjct: 309 DEVLERLFAETARERANPTPKVDRGPDAETEVVFPRYLTARALLRHELADTGFRRQVLLQ 368
Query: 288 FLILFQY---FTSTVKSR--GEGLELKS-----DQEKWVKDTTETVYSLIKQT-PPDGEH 336
+ ILFQ+ T V S+ G KS D +W+K T + +K+ DG
Sbjct: 369 YFILFQFLLNLTPGVASKQQATGGMPKSFIIDDDAARWIKSTVSRIRDELKRIDSTDGLR 428
Query: 337 FSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD------------EDKKDEPDAKKKK 384
F + V ++ E + QWKN+GCPE + + + K P K+
Sbjct: 429 FEETVVELMNRERRYAQWKNDGCPEADWAIAPVPPTEGKATARKWERQSKPPRPWPYKQG 488
Query: 385 TPELTKL 391
TP LT+L
Sbjct: 489 TPTLTRL 495
>gi|401887333|gb|EJT51323.1| hypothetical protein A1Q1_07504 [Trichosporon asahii var. asahii
CBS 2479]
Length = 722
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 96/365 (26%), Positives = 153/365 (41%), Gaps = 62/365 (16%)
Query: 80 IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRSQN 138
I LL + ++ +++ F Y E + + K LLR N LR LS +
Sbjct: 140 IPLLEEFVELLSVETWHSFFSYFETREKRFTRDMPASKGKALPLLRTINTFLRTLSSTPA 199
Query: 139 TV-FCGRILLFLAKFFPFSERSGLNIISEFN----------VENITEFGGDEEMDVSSNE 187
+ GR+ LF ++ S++S +N+ E++ E GD EMD + +
Sbjct: 200 DLQLRGRVQLFASRAISVSDKSAINMRGEYSGIRTTWEETPEETAAGADGDVEMDGGAAK 259
Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-------WKMFTSYAETVLA 240
E+ + E DK FY WSLQ YF +P + ++ F S ++ VL
Sbjct: 260 GEKGEKSEDDK------FYNTLWSLQQYFAHPPSLDGPATGEPAQTPFERFRSASDEVLE 313
Query: 241 AFKSYKLDDVQSSLNPSGDY-----------FAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
+ + + NP+ F +YLT + LL +L+D FRR VLLQ+
Sbjct: 314 RLFA---ETARERANPTPKVDRGPDAETEVVFPRYLTARALLRHELADIGFRRQVLLQYF 370
Query: 290 ILFQY---FTSTVKSRGEGLE-------LKSDQEKWVKDTTETVYSLIKQT-PPDGEHFS 338
ILFQ+ T V S+ + + D +W+K T + +K+ DG F
Sbjct: 371 ILFQFLLNLTPGVASKQQATGGMPKSFIIDDDAARWIKSTVSRIRDELKRIDSTDGLRFE 430
Query: 339 QVVKLILKGEEHWNQWKNEGCPELKRPLTSITD------------EDKKDEPDAKKKKTP 386
+ V ++ E + QWKN+GCPE + + + K P K+ TP
Sbjct: 431 ETVVELMNRERRYAQWKNDGCPEADWAIAPVPPTEGKATARKWERQSKPPRPWPYKQGTP 490
Query: 387 ELTKL 391
LT+L
Sbjct: 491 TLTRL 495
>gi|213408931|ref|XP_002175236.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
gi|212003283|gb|EEB08943.1| conserved hypothetical protein [Schizosaccharomyces japonicus
yFS275]
Length = 690
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 105/419 (25%), Positives = 175/419 (41%), Gaps = 62/419 (14%)
Query: 82 LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVF 141
L+ + D+ ++ CE+L+ Y + + LLR+ N+LLRRLSR + F
Sbjct: 106 LMEELVDIHSVQDCEKLYEYFTTRPLVTVGLSGTRGKGPVLLRISNNLLRRLSRVDHPAF 165
Query: 142 CGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEE----TDTEEVD 197
CGRI +FL+ F ERSG N+ EF++ N E ++V E T E
Sbjct: 166 CGRIHMFLSSVFGPHERSGANLRGEFDIAN-------EPVNVEDTLPENIRALTTAETRI 218
Query: 198 KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPS 257
K + + ++ WSLQ+Y NP + Y+ +MFT+ + VL F V + + S
Sbjct: 219 KQSVLDDVFEAIWSLQNYMSNPTKLYSSQEIEMFTTNTQKVLECFNGVHKLSVFQNRDDS 278
Query: 258 GD-------------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---F 295
D +Y+ + +LL QL D F L+Q IL + F
Sbjct: 279 VDLHKRFQILLRRAFSEKHLLLLNRYIRSHELLAYQLLDEQFSVEYLVQISILLNHLLDF 338
Query: 296 TSTVKSRGEGLE-----------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLI 344
T + + + L+ L S+ EK + + L+ Q P + ++ +
Sbjct: 339 TQENRKKQDSLQFTNKALISSFVLDSEDEKKLLKVKQATEKLLDQVPESV--LTNTLRSL 396
Query: 345 LKGEEHWNQWKNEGCPELKRPLTSITDEDKKDE-------PDAKKK---KTPELTKLWNS 394
E +W WK C +++PL + + E P + T L+K+W+S
Sbjct: 397 FTIENNWKIWKARNCLTIEKPLLDSSFLEAAQEGLNKLTVPATALRYCMGTASLSKIWSS 456
Query: 395 --KDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAV----EEQYKKVNDSNYAWRALR 447
++NL+ + E P+ + ++ + AA E QY ++ WRALR
Sbjct: 457 SVEENLQDLQRKEATQVPAATDFLNGVEKEENEAAEAVREDERQYHLQAKASKTWRALR 515
>gi|380492396|emb|CCF34633.1| guanylate kinase, partial [Colletotrichum higginsianum]
Length = 364
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 137/296 (46%), Gaps = 65/296 (21%)
Query: 34 SALEQSIRQYLLKLIKTPDI---DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+ +E ++R LI T I D + N L L + C+ L L+ + D
Sbjct: 79 ATIETAVRNMFSNLIATTSIESPDFVKVWNLFDLLTILSDDEKCDPALLFWLVEELLDSQ 138
Query: 91 TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
T+ C ++F ++E + V +KQ+ +LR CN+LLRRLSR+++T FCGR+
Sbjct: 139 TIAGCRKIFDFLESRRERITVKHFKQKQLV------ILRSCNELLRRLSRAEDTAFCGRV 192
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNE-TEETDTEEVDKVK---- 200
+F+ + FP ++S +N+ E++VEN+T F ++ V ++E TE+ D + + VK
Sbjct: 193 FIFMFQSFPLGDKSSVNLRGEYHVENVTVF---DDTGVKTDESTEQMDIDSQNGVKDGEV 249
Query: 201 ------IDFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVL 239
+ F+ Y +FWSLQ+ F P + ++ F E +
Sbjct: 250 KSQAKAVSFDAKQPTQPDQPLNPDALYPQFWSLQETFSQPKKLFDPSHLANFKRGLEATM 309
Query: 240 AAFKSY---------------------KLDDVQSSLNPSGDYFAKYLTNQKLLDLQ 274
+F++ K DDV+ S + + Y KYLT++ L +L+
Sbjct: 310 TSFRAIQNEQGQRPPKAVDEAQRGLKRKRDDVEGS-DLANSYNPKYLTSRDLFELE 364
>gi|219123310|ref|XP_002181970.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217406571|gb|EEC46510.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 790
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 101/414 (24%), Positives = 173/414 (41%), Gaps = 57/414 (13%)
Query: 71 KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVN--------IWK------QQTFFM 116
KDM +P +LL D D+ + Q F+ V + +W +
Sbjct: 109 KDMSVRKVPFVLLEDILDVLSSPLALQ-FWSSRVRPSYDFLFAPTLWSPIRGDSAASPSH 167
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
C L++ LRRL IL+ L+ +P SE+S + E+ TE+
Sbjct: 168 PCWLPFLKISQKFLRRLVPEA----AAPILVQLSTVYPLSEKSATKVWGSHG-ESTTEYD 222
Query: 177 GDEEMDVSSNE-TEETDTEEVDKVKI-DFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSY 234
E+ T +T T + D++FY+ FW LQ+ NP N + F S
Sbjct: 223 SLEDFHKEEQSITLDTTTPNGSSSSVYDYSFYESFWRLQEDLSNP----NSIKVAGFLSR 278
Query: 235 AETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY 294
+++ AF++ D S +P + + YLT+ ++L +QL D +F+ +VL QFLI+ ++
Sbjct: 279 VRSMMTAFENQTCDTNTSQDSPI-NLYGHYLTSSRVLAIQLLDASFQIHVLTQFLIVAKH 337
Query: 295 FTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH-WNQ 353
+ V + + T L + G H +++ + +G E W
Sbjct: 338 LMAQVPVL---------ESQLADHVTRAKNRLQLELGDAGRHQLELLHHLWQGSESLWRD 388
Query: 354 WKNEGCPE-----------------LKRPLTSITDEDKKDEPDAKKKKTPELTKLWNSKD 396
WK + CP KR L ++ + + DA ++ T L D
Sbjct: 389 WKRKKCPADIDAPKLALSAAGGSPPRKRLLGALGSGNGESN-DADERNTD--YSLAQVHD 445
Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLS 450
L A + P L ++ ++ ++ +DP A +E +Y N++ + WRALRLLS
Sbjct: 446 ELPALSKRMKRLAPDLYTHLQDYVEALDPDAGIEAEYHPKNNALFGWRALRLLS 499
>gi|342319244|gb|EGU11194.1| Hypothetical Protein RTG_02997 [Rhodotorula glutinis ATCC 204091]
Length = 727
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 152/363 (41%), Gaps = 65/363 (17%)
Query: 77 TLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRL--- 133
TLP+ +L+ ++ + CE + Y+E V + + + +L + L R
Sbjct: 116 TLPLSVLAALMEVRPISACEPIVAYIESRVGPLTKGMEYQRGRGPILLRLLNDLLRRLPR 175
Query: 134 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------------GGDE 179
S+SQ+ + GRIL+ L+ +P E+SG+N+ FN+ T F E
Sbjct: 176 SQSQSVILSGRILMLLSSAYPLGEKSGVNLRGNFNIGKGTVFEQEAEKEMEEAKEDAKKE 235
Query: 180 EMDVSSNETEETDTEEVDKVKIDFN---FYKKFWSLQDYFRNPVQCYNKV---------- 226
++ + E EE + + + + N FY FWSLQ F NP +
Sbjct: 236 DVKMEVEEGEEEELPKDAENPAESNPSQFYTTFWSLQRIFNNPPLLFAAPPVASTSADAD 295
Query: 227 SWKMFTSYAETVLAAF-----KSYKLDDVQSSLNPSGD---------------YFAKYLT 266
++ + LAAF K +L SG +F K+LT
Sbjct: 296 PFRTLKNGLRQTLAAFAAATKKEKELAGSAKEGAVSGKAKESEVDAEEGLEEYFFPKFLT 355
Query: 267 NQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKSRGEGLE-----------LKSDQ 312
++ LL+L+L+ NFRR +L+Q LILFQY T ++R + L+++
Sbjct: 356 SRNLLELELAAVNFRRQILVQMLILFQYLLSLTPAERTRVQSFHPLNAAALSPFVLQTEN 415
Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
E W+++ + G F + V+L+L+ E++W WK C +P E
Sbjct: 416 ETWIREMRSRTLDELDSM-DGGRRFRKAVQLVLQREQNWMDWKLRSCAPFTKPSLDAAAE 474
Query: 373 DKK 375
+K
Sbjct: 475 SEK 477
>gi|134113815|ref|XP_774492.1| hypothetical protein CNBG1380 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257130|gb|EAL19845.1| hypothetical protein CNBG1380 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 761
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 110/446 (24%), Positives = 174/446 (39%), Gaps = 117/446 (26%)
Query: 122 LLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN------------ 168
LLR N LR L R+ ++ VF GRI F + F +++S +N+ +++
Sbjct: 175 LLRTINAFLRFLPRTPEDLVFRGRIHQFASSVFSVADKSAINMRGDYSEVKTVWEEEPEM 234
Query: 169 ---------------VENITEFGGDEEMDVSSNETEETD--TEEVDKVKIDFN------- 204
++ E GG ++ DV + E+ D TE D +
Sbjct: 235 EKIKVEEKEENKEGKMDQRAEEGGKKDGDVEMEDAEKKDEHTEHSSNQPADQSMEQPTED 294
Query: 205 -------------FYKKFWSLQDYFRNP-------VQCYNKVSWKMFTSYAETVLA---- 240
FY WSLQ YF +P V K ++ F ++ VL
Sbjct: 295 EPPLTPATLQEPEFYSTLWSLQQYFSHPPSLDGPAVGSPPKTPFQTFREKSDFVLPQLFE 354
Query: 241 -------------AFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
+ V + G + ++LT ++L + +L+D++FRR +L+Q
Sbjct: 355 QTRKEKAMAGTDDGLGKKRKRQVMEGQDTGGFFHPRFLTGKRLFEYELADSSFRRQILVQ 414
Query: 288 FLILFQYFTSTVKSRG------EGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
+ ILFQ+ + + G+ L+ E+WVK + +K G F
Sbjct: 415 YFILFQFLLNLTPAHAGKQAFTGGMPRTFMLEQADEQWVKSKIGGIKEELKNI-VGGASF 473
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPD---AKKKKTP-------- 386
V I++ E H+ QWKNEGCPE + ++ D D EP AK+ P
Sbjct: 474 EDTVFSIIRQEAHYVQWKNEGCPEGSFEIPAL-DPDSASEPAQAWAKRLNPPAPYNFKVG 532
Query: 387 --ELTKLWNSK-DNLEACKSAERDFTPSLESYFEE------------AIQQMDPAAAVEE 431
L+ LWN+ N+ K E+ T +E+ EE A+ Q P
Sbjct: 533 SRPLSMLWNNGFTNINQLKRREKATT--VEALGEEIKRIEEDEEDDKAMGQGTPEKLAAN 590
Query: 432 QYKKVNDSNYAWRALRLLSRKCPHFF 457
+ +K + WRALRL S FF
Sbjct: 591 KERKTTST---WRALRLASHTSLKFF 613
>gi|58269822|ref|XP_572067.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57228303|gb|AAW44760.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 761
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 110/446 (24%), Positives = 174/446 (39%), Gaps = 117/446 (26%)
Query: 122 LLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN------------ 168
LLR N LR L R+ ++ VF GRI F + F +++S +N+ +++
Sbjct: 175 LLRTINAFLRFLPRTPEDLVFRGRIHQFASSVFSVADKSAINMRGDYSEVKTVWEEEPEM 234
Query: 169 ---------------VENITEFGGDEEMDVSSNETEETD--TEEVDKVKID--------- 202
++ E GG ++ DV + E+ D TE D
Sbjct: 235 EKIKVEEKEENKEGKMDERAEEGGKKDGDVEMEDAEKKDEHTEHSSNQPADQSMEQPTED 294
Query: 203 -----------FNFYKKFWSLQDYFRNP-------VQCYNKVSWKMFTSYAETVLA---- 240
+FY WSLQ YF +P V K ++ F ++ VL
Sbjct: 295 EPPLTPAALQEPDFYSTLWSLQQYFSHPPSLDGPAVGSPPKTPFQTFREKSDFVLPQLFE 354
Query: 241 -------------AFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
+ V + G + ++LT ++L + +L+D++FRR +L+Q
Sbjct: 355 QTRKEKAMAGTDDGLGKKRKRQVMEGQDTGGFFHPRFLTGKRLFEYELADSSFRRQILVQ 414
Query: 288 FLILFQYFTSTVKSRG------EGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
+ ILFQ+ + + G+ L+ E+WVK + +K G F
Sbjct: 415 YFILFQFLLNLTPAHAGKQAFTGGMPRTFMLEQADEQWVKSKIGGIKEELKNI-VGGASF 473
Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPD---AKKKKTP-------- 386
V I++ E H+ QWKNEGCPE + ++ D D EP AK+ P
Sbjct: 474 EDTVFSIIRQEAHYVQWKNEGCPEGSFEIPAL-DPDSASEPAQAWAKRLNPPAPYNFKVG 532
Query: 387 --ELTKLWNSK-DNLEACKSAERDFTPSLESYFEE------------AIQQMDPAAAVEE 431
L+ LWN+ N+ K E+ T +E+ EE A+ Q P
Sbjct: 533 SRPLSMLWNNGFTNINQLKRREKATT--VEALDEEIKRIEEDEEDDKAMGQGTPEKLAAN 590
Query: 432 QYKKVNDSNYAWRALRLLSRKCPHFF 457
+ +K + WRALRL S FF
Sbjct: 591 KERKTTST---WRALRLASHTSLKFF 613
>gi|405121693|gb|AFR96461.1| hypothetical protein CNAG_03235 [Cryptococcus neoformans var.
grubii H99]
Length = 777
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 111/455 (24%), Positives = 182/455 (40%), Gaps = 126/455 (27%)
Query: 122 LLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN-VENITE----- 174
LLR N LR L R+ ++ VF GRI F + F +++S +N+ +++ V+ + E
Sbjct: 175 LLRTINAFLRFLPRTPEDLVFRGRIHQFASSVFSVADKSAINMRGDYSEVKTVWEKESEV 234
Query: 175 --------------------FGGDEEMDVSSNETEETDT---EEVDKVKI---------- 201
GG ++ DV + E+ D ++ D+
Sbjct: 235 EQIKEEKKKKKEGKMEEKAEEGGKKDGDVEMEDAEKKDEPAEQQADRPTGRSTYQAADQS 294
Query: 202 ------------------DFNFYKKFWSLQDYFRNP-------VQCYNKVSWKMFTSYAE 236
D +FY WSLQ YF +P V K ++ F ++
Sbjct: 295 MEQMPTEDEAPSTPSTLKDPDFYSTLWSLQQYFSHPPSLDGPAVGSPPKTPFQTFRDKSD 354
Query: 237 TVL------------------AAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDT 278
+L K K D ++ + G + ++LT ++L + +L+D+
Sbjct: 355 FILFQLFEQTKKEKAMTGTEDGMGKKRKRDVMEGGQDTGGFFHPRFLTGKRLFEYELADS 414
Query: 279 NFRRYVLLQFLILFQYFTSTVKSRG------EGLE----LKSDQEKWVKDTTETVYSLIK 328
+FRR +L+Q+ ILFQ+ + + G+ L+ E+WVK + +K
Sbjct: 415 SFRRQILVQYFILFQFLLNLTPAHAGKQAFTGGMPRTFMLEQTDEQWVKSKIGGIKDELK 474
Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPD---AKKKKT 385
+ G F V I++ E H+ QWKNEGCPE + ++ D D EP AK+
Sbjct: 475 RI-VGGVSFEDTVFSIIRQEAHYVQWKNEGCPEASFEIPAL-DPDSASEPAQAWAKRLNP 532
Query: 386 P----------ELTKLWNSK-DNLEACKSAERDFTPSLESYFEE------------AIQQ 422
P L+ LWN+ N++ K E+ T +E+ EE A+ Q
Sbjct: 533 PAPYNFKVGSRPLSMLWNNGFTNIDQLKGREKATT--VEALDEEIKRIEEDEEDDKAMGQ 590
Query: 423 MDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
P + +K + WRALRL S FF
Sbjct: 591 DTPEKLAANKERKTTST---WRALRLASHTSLKFF 622
>gi|393246855|gb|EJD54363.1| hypothetical protein AURDEDRAFT_156143 [Auricularia delicata
TFB-10046 SS5]
Length = 619
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 74/267 (27%), Positives = 119/267 (44%), Gaps = 36/267 (13%)
Query: 49 KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNI 108
+ PD + + N + L + + C + LL D + T+D C +F ++E
Sbjct: 75 RDPDAYYQALRNMLDLVLCVAEHGACEELITWTLLEDLLKLQTVDTCWHIFLWIESRAE- 133
Query: 109 WKQQTFFMSCKNN---LLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNII 164
+ + KN LLR N+LLRRLS++ +T FCGRIL+FLA F ERSG+N+
Sbjct: 134 -RVTKNLIPGKNKALALLRTLNELLRRLSKAGSTSTFCGRILIFLANVFTLGERSGVNLR 192
Query: 165 SEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYN 224
++ E T + + E K + F+++FW+LQ P
Sbjct: 193 GDYGPE-WTPVPKPPRATPLDDLPADVRAAEERKEGMSPLFFREFWALQSPMARPQTFSM 251
Query: 225 KVSWKMFTSYAETVL-----------AAFKSYKLDDV----------QSSLNPSGD---- 259
++ MF + +TVL A S L+ + + +G+
Sbjct: 252 PETFPMFQAAVDTVLPVLAEATKRERALMGSKALNGTAAAVAAGVKRKREAHDTGEDGGK 311
Query: 260 ----YFAKYLTNQKLLDLQLSDTNFRR 282
+FAKYLT+ +LLDL+++D FRR
Sbjct: 312 KKAYFFAKYLTSPELLDLEVADPQFRR 338
>gi|321260989|ref|XP_003195214.1| hypothetical protein CGB_G2020C [Cryptococcus gattii WM276]
gi|317461687|gb|ADV23427.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 761
Score = 100 bits (250), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 109/439 (24%), Positives = 174/439 (39%), Gaps = 105/439 (23%)
Query: 122 LLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN-VENI------- 172
LLR N LR L R+ ++ VF GRI F + F +++S +N+ +++ V+ I
Sbjct: 175 LLRTINAFLRFLPRTPEDLVFRGRIHQFASSVFSVADKSAINMRGDYSEVKTIWEEEPKA 234
Query: 173 ---------------------------------------TEFGGDEEMDVSSNETEETDT 193
TE D+ D S + E +T
Sbjct: 235 EESKVDEGKEEGKTEEKTEEEGKKDGDIEMEDAERKDEPTERPVDQPADQSMEQPTEDET 294
Query: 194 EEVDKVKIDFNFYKKFWSLQDYFRNP-------VQCYNKVSWKMFTSYAETVLAA-FKSY 245
D +FY WSLQ YF +P V K ++ F ++ VL F+
Sbjct: 295 PLTPATLQDPDFYSTLWSLQQYFSHPPSLDGPAVGSPPKTPFQTFRDKSDFVLPQLFEQT 354
Query: 246 KLDDVQSSL----------------NPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
K + + + G + ++LT ++L + +L+D++FRR +L+Q+
Sbjct: 355 KKEKAMTGTEDGMGKKRKREAMEGEDTGGFFHPRFLTGKRLFEYELADSSFRRQILVQYF 414
Query: 290 ILFQYFTSTVKSRG----------EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
ILFQ+ + + L+ E+WVK + +K+ G F
Sbjct: 415 ILFQFLLNLTPAHAGKQAFTGGMPRTFTLEQADEQWVKSKIGGIKEELKRI-IGGTSFED 473
Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPD---AKKKKTP---------- 386
V I++ E H+ QWKNEGCPE + ++ + D EP AK+ P
Sbjct: 474 TVFSIIRQEAHYVQWKNEGCPEGSFEIPAL-NPDSASEPAHAWAKRLNPPAPYSFKVGSR 532
Query: 387 ELTKLWNSK-DNLEACKSAERDFT-PSLESYF------EEAIQQMDPAAAVEEQYKKVND 438
L+ LWN+ N+ K E+ T +L+ EE + M A A + K
Sbjct: 533 PLSMLWNNGFTNINQLKGREKATTVEALDEEIKRIEEDEEDDKAMGQATAEKLAANKERK 592
Query: 439 SNYAWRALRLLSRKCPHFF 457
+ WRALRL S+ FF
Sbjct: 593 TTSTWRALRLASQTSLKFF 611
>gi|380495035|emb|CCF32700.1| guanylate kinase [Colletotrichum higginsianum]
Length = 684
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/277 (25%), Positives = 126/277 (45%), Gaps = 65/277 (23%)
Query: 153 FPFSERSGLNIISEFNVENITEFGGDEEMDVSSNE-TEETDTEEVDKVK----------I 201
FP ++S +N+ E++VEN+T F ++ V ++E TE+ D + + VK +
Sbjct: 5 FPLGDKSSVNLRGEYHVENVTVF---DDTGVKTDESTEQMDIDSQNGVKDGEVKSQAKAV 61
Query: 202 DFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSY- 245
F+ Y +FWSLQ+ F P + ++ F E + +F++
Sbjct: 62 SFDAKQPTQPDQPLNPDALYPQFWSLQETFSQPKKLFDPSHLANFKRGLEATMTSFRAIQ 121
Query: 246 --------------------KLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVL 285
K DDV+ S + + Y KYLT++ L +L++SD +FRR++L
Sbjct: 122 NEQGQRPPKAVDEAQRGLKRKRDDVEGS-DLANSYNPKYLTSRDLFELEMSDISFRRHIL 180
Query: 286 LQFLILFQYFTSTVKSRGE--------------GLELKSDQEKWVKDTTETVYSLIKQTP 331
+Q LI+ + S E G L + KW + T+ +
Sbjct: 181 VQALIIMDFLLSLSNKTKEKLAGNAQNKSVIYLGQVLSEEDTKWASEMKRTIVDKYLKHG 240
Query: 332 PDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
DG +FS++V+ +L +++W +WK+E CP ++ P S
Sbjct: 241 VDGPYFSRMVETVLARDKNWIRWKSENCPSIELPAVS 277
>gi|48474722|sp|P59924.1|THOC1_RAT RecName: Full=THO complex subunit 1; Short=Tho1; AltName:
Full=Liver regeneration-related protein LRRG175;
AltName: Full=Nuclear matrix protein p84
gi|33086686|gb|AAP92655.1| Da2-19 [Rattus norvegicus]
gi|149042358|gb|EDL96065.1| rCG63287 [Rattus norvegicus]
Length = 343
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 47/98 (47%), Positives = 63/98 (64%), Gaps = 1/98 (1%)
Query: 387 ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRA 445
ELT+LWN DN+EACK R++ P LE +FEEAI+Q D VE +YK +N+SNY W
Sbjct: 115 ELTRLWNLCPDNMEACKLETREYMPILEEFFEEAIEQADAENMVESEYKAINNSNYGWST 174
Query: 446 LRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
LR L+ + PHFF + +E++ENMV + KE P
Sbjct: 175 LRFLAWRSPHFFQPTNQQFKNMTEYLENMVIKLAKELP 212
>gi|302834964|ref|XP_002949044.1| hypothetical protein VOLCADRAFT_104130 [Volvox carteri f.
nagariensis]
gi|300265789|gb|EFJ49979.1| hypothetical protein VOLCADRAFT_104130 [Volvox carteri f.
nagariensis]
Length = 743
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 112/460 (24%), Positives = 167/460 (36%), Gaps = 141/460 (30%)
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
K LLRM N L RRLS++ + CGR+L+F+++ + +++SGLN+ N
Sbjct: 232 GAKLALLRMSNMLFRRLSKAYDVRLCGRLLIFISRCYELNDKSGLNVAGALNT------- 284
Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR--NPVQCYNKVSWKM---- 230
+ + +E E + + + +ID Y FW LQ F+ PV Y KV M
Sbjct: 285 ---GLSLPVDEVEPGAVDSIGE-QIDRRLYACFWGLQAIFQVLQPV-VYGKVMADMSLVL 339
Query: 231 --FTSYAETV------LAAFKSYK------------LDDVQSSLNPS-------GDYFAK 263
F + V +AA S+ +V+ P+ G +
Sbjct: 340 DEFAAIKTPVAATGPWVAARHSHHNGQHRGAGDGTDAMEVEEGGGPATGGDEGGGGVGTR 399
Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF-------TSTVKSRGEGLELKSDQEKWV 316
YLT+ KL LQ+ D FRR+ LLQ L L QY S + R EGLE
Sbjct: 400 YLTSSKLFGLQVRDCTFRRHFLLQCLALLQYIEQPSKADKSAIGQRPEGLEAMR-----T 454
Query: 317 KDTTETVYSLIKQTPP--DGEHFSQVVKLILKGEEHWNQWKN------------------ 356
K + T L +T +H S +LK E HW WK+
Sbjct: 455 KASERTCGKLSMRTLGWRRCQHHS-----VLKHEAHWVAWKHGTQPPASQGPAAAAAASP 509
Query: 357 ------------------EGCPELKRP-------LTSITDEDKKDEPDAKKKKTPELT-- 389
E C + RP +TS + D A K+K
Sbjct: 510 SAPPAAAATVAAVSAAATELCRDFTRPSAPPTREITSTSHRDTASGAQAPKRKRGTSGGR 569
Query: 390 ------------------KLWNSKDNLEACKSAER--------------DFTPSLESYFE 417
+ D L K +ER P+
Sbjct: 570 AGGGAAAATGGGRFAVSRGVMGLADPLAGLKVSERRTRARNLVGEEEIVPMLPTPPELLS 629
Query: 418 EAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
++ +DP VE QY++V ++ +AW+ +R L+R FF
Sbjct: 630 PILEDLDPDNDVESQYRRVREAVFAWKRMRTLARNYLEFF 669
>gi|344249117|gb|EGW05221.1| THO complex subunit 1 [Cricetulus griseus]
Length = 211
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 74/118 (62%), Gaps = 4/118 (3%)
Query: 398 LEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
+E CKS R++ P+LE +FEEAI+Q DP VE ++K VN+SNY WRALRLL+R+ PHFF
Sbjct: 1 METCKSETREYMPTLEEFFEEAIEQADPENMVESEHKAVNNSNYGWRALRLLARRSPHFF 60
Query: 458 LNATPNVEKNSEFIENMVKRCVKE--KPSSQI--SGNGNGVDQDPAEVEVDTKSEEIQ 511
+ +++ENMV + KE PS +I + N D D E ++KS +++
Sbjct: 61 QPTNQQFKTLPDYLENMVIKLAKELPPPSEEIKTGEDENEEDNDALLKENESKSPDVR 118
>gi|328851043|gb|EGG00202.1| hypothetical protein MELLADRAFT_118113 [Melampsora larici-populina
98AG31]
Length = 975
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 91/154 (59%), Gaps = 3/154 (1%)
Query: 56 KVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFF 115
+++ + + + + L +D + L + + D + +T+++ E++F Y+E V+
Sbjct: 108 QLLHDALDISLVLIEQDRAPADLTLSCIEDILETATVEEAERIFCYLESRVDRLTVGMLP 167
Query: 116 MSCKNNL-LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITE 174
K + LRM N+LLRRLS+S +TVFCGRIL FL+ FP +E+SG+N+ FN N+T
Sbjct: 168 DRGKGLIFLRMQNNLLRRLSKSLHTVFCGRILSFLSSVFPIAEKSGVNLRGAFNTGNVTH 227
Query: 175 FGGD-EEMDVSSNETEETDTEEVDKVKIDFNFYK 207
F D E+ V + +++ ++T + D V ++ N K
Sbjct: 228 FDQDIEKKGVEAVDSQGSETIKSDPV-VETNLLK 260
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 113/234 (48%), Gaps = 47/234 (20%)
Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLEL---------- 308
+F K+LT LLDL+++D+ FRR VL QFLI+ +Y +++ R + L
Sbjct: 482 FFPKFLTKSNLLDLEVADSYFRRQVLTQFLIILKYIRGFSLEERNKRAALPKPNRVSEVP 541
Query: 309 ---KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
++ + WV+ V+ + TPP+GE F+ +K IL E++W WK CP +RP
Sbjct: 542 YVYQARDDGWVRKLNGEVWKTLYATPPNGEVFALTIKQILTREQNWIMWKAGSCPSFERP 601
Query: 366 LTSITDE-DKKDEPDAKK-----------KKTPELTKLWNSKDNLEACKSAE-RDFTPSL 412
S+ DE K E + K+ TP L+KLW++K E+ +S + + P L
Sbjct: 602 --SLGDEMIKAAEANYKRIIAPPMRFPHPVGTPALSKLWSNKVTPESIESMDFSEGVPDL 659
Query: 413 ESYFEEAIQQMDPAAAVEE----QYKKVND--------------SNYAWRALRL 448
+ + + Q +A++E Q K D S+ +WRALR+
Sbjct: 660 NALHGKIMAQEKQLSALKESLLAQGKSDTDVSNDPQVQAIQERKSSLSWRALRV 713
>gi|37779809|gb|AAP23263.1| THO complex 1 [Pan troglodytes]
Length = 200
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 40/77 (51%), Positives = 53/77 (68%)
Query: 407 DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEK 466
++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 1 EYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKS 60
Query: 467 NSEFIENMVKRCVKEKP 483
E++ENMV + KE P
Sbjct: 61 LPEYLENMVIKLAKELP 77
>gi|403166806|ref|XP_003326670.2| hypothetical protein PGTG_07648 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166722|gb|EFP82251.2| hypothetical protein PGTG_07648 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 962
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 93/205 (45%), Gaps = 38/205 (18%)
Query: 60 NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
+ + L + + L + + + + ST+ + + +F Y+E V K
Sbjct: 97 HLLDLVLAFIDHGLAAPELALGCIEEILEASTVQQAQFIFGYLESRVQRLTVDMLPDRGK 156
Query: 120 NNL-LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--- 175
+ LRM N+LLRRLS+S +TVFCGRIL FL+ FP +E+SG+N+ FN NIT+F
Sbjct: 157 GLIFLRMQNNLLRRLSKSLHTVFCGRILCFLSSVFPIAEKSGVNLRGAFNHGNITKFDQF 216
Query: 176 --------------------GGDEEM------DVSSNETEETDTEEVDKVKIDFN----- 204
G +E+ D + EE ++ +VK +
Sbjct: 217 DSPPSTMLISNNNSTTDEIEAGSKEIEKITPQDSTPRPNEEIKPDDSARVKASKSATDKR 276
Query: 205 -FYKKFWSLQDYFRNPVQCYNKVSW 228
FY +FWSLQ NP + + SW
Sbjct: 277 EFYTRFWSLQALLSNPPELFK--SW 299
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 111/241 (46%), Gaps = 47/241 (19%)
Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKSRGEGLE--------- 307
+F K+LT LLDL+++D FRR VL QFLI+ QY F+ +++ L
Sbjct: 386 FFPKFLTKSNLLDLEIADPYFRRQVLTQFLIILQYIRGFSPEERAKRASLPKSNRISEIP 445
Query: 308 --LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
++ E WV + V+ + TPP GE F+ ++ IL E W WK C ++
Sbjct: 446 YVYQAKDEAWVLKLSGEVWKTLYATPPHGEIFAMTIQQILIRESSWIMWKAGSCASFEK- 504
Query: 366 LTSITDEDKKDEPDAKK-----------KKTPELTKLWNSKDNLEACKSAERDF-TPSLE 413
S+ D D P K+ TP L+KLW +K L++ ++ + P L+
Sbjct: 505 -ASLDDLDPA--PGYKRITSNPVKFPHPVGTPALSKLWENKVTLDSIENIDPTAGVPDLK 561
Query: 414 S------YFEEAIQQM--------DPAA-AVEEQYKKVND--SNYAWRALRLLSRKCPHF 456
+ + + ++Q+ DP A ++ Q +++ + + +WRALR+ F
Sbjct: 562 TLHLKIQHLDLQLKQLQENLPPDTDPKAETIKSQVQQLQERKTGLSWRALRIAQSSHLQF 621
Query: 457 F 457
F
Sbjct: 622 F 622
>gi|426385300|ref|XP_004059157.1| PREDICTED: THO complex subunit 1-like [Gorilla gorilla gorilla]
Length = 198
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 40/74 (54%), Positives = 51/74 (68%)
Query: 410 PSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE 469
P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF + E
Sbjct: 2 PTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPE 61
Query: 470 FIENMVKRCVKEKP 483
++ENMV + KE P
Sbjct: 62 YLENMVIKLAKELP 75
>gi|323448287|gb|EGB04188.1| hypothetical protein AURANDRAFT_72585 [Aureococcus anophagefferens]
Length = 601
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/323 (25%), Positives = 133/323 (41%), Gaps = 58/323 (17%)
Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMF--------- 231
M +S + TE E V + D+ Y FW++Q + +P V +
Sbjct: 1 MALSMDCTETATNENV--LAHDYATYSAFWAVQKWLSDPNLALRGVEERRALLANMDACC 58
Query: 232 ------------------TSYAETVLAAFKSYKLDDVQSSLNPSGDY-------FAKYLT 266
T AE V +A K+ +QS+ + SG AKYLT
Sbjct: 59 AALENTAPSEDSVKRARDTWLAERVRSAKKNACGSSIQSA-DISGTMQLGIVTMGAKYLT 117
Query: 267 NQKLLDLQLSDTNFRRYVLLQFLILFQYF---TSTVKSRGEGLELKSDQEKWVKDTTETV 323
N +LL +QL D R ++ Q ++F Y T+ G E S + W V
Sbjct: 118 NSRLLRIQLRDPVLRMQLVTQLYLVFDYLRRETAIGNKPSPGAEFLSKPD-WALPVLLRV 176
Query: 324 YSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE--DKKDEPDAK 381
LI TPP+G + ++ +++ E +W+ WK GC +RP S + +K+
Sbjct: 177 RKLIIATPPNGPERLKSLEEVIEREVNWSTWKRLGCAPYERPSQSKEQQYAEKRKRALQS 236
Query: 382 KKKTPELTKLWNSKDNLEA---------------CKSAERDFTPSLESYFEEAIQQMDPA 426
++ +P + + + L+A C + D PS+E++ E DP
Sbjct: 237 ERYSPPVPVILFKRTKLKAGAKLDARENEGTMKDCCARLADAVPSIEAFLEAYEIAEDPE 296
Query: 427 AAVEEQYKKVNDSNYAWRALRLL 449
++EE+Y +D Y WRALRL+
Sbjct: 297 NSIEEEYHPKHDKVYCWRALRLV 319
>gi|258577873|ref|XP_002543118.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237903384|gb|EEP77785.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 756
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 117/559 (20%), Positives = 211/559 (37%), Gaps = 153/559 (27%)
Query: 34 SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
+A+E + R+ +L+ T I+ I N + + + C L L+ + D
Sbjct: 64 AAVETAFREKFYRLLATTSINEPNFVDIWNLLDIVSIFSDNERCEPGLIFWLVEELLDSQ 123
Query: 91 TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
T+D C ++F Y+E R +NT
Sbjct: 124 TIDGCRKVFDYLESR-----------------------------RERNTAS--------- 145
Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE---------------- 194
FP ++S +N+ E++ ENIT F DE +S E + D E
Sbjct: 146 --FPLGDKSSVNLRGEYHTENITTF--DELPTSASAEEDAADVEMRDAEPPKALEDSTKL 201
Query: 195 -------------------------EVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSW 228
+ + ID + Y FW LQ F +P + ++ +
Sbjct: 202 ETATNLQTSAEPDGQRRKPSTATPSQTEDASIDMDALYPIFWGLQASFSSPTRLFDPEHF 261
Query: 229 KMFTSYAETVLAAFKSYKLD---------------------DVQSSLNPSGDYFAKYLTN 267
F E+ ++ F+ +D ++ + + KYLT+
Sbjct: 262 ASFKKGLESTISNFQKVSIDLEKRGMTKGSEETSRRGIKRKRTENGSEAANTFNPKYLTS 321
Query: 268 QKLLDLQLSDTNFRRYVLLQFLILFQYFTS------------TVKSRGEGLELKSDQEKW 315
+ L +L+ +D FRR++L+Q LI+ + S T KS L + KW
Sbjct: 322 RDLFELEANDVAFRRHILVQSLIILDFLISLTPRAKAKLADATNKSVLYSYVLSDEDAKW 381
Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD---- 371
++ ++Q DG+ + ++V +L +++W +WK E CP +++ + +
Sbjct: 382 ATQMKSSIAGYLQQG-LDGKFYYRMVDTVLTRDKNWVRWKAEACPAIEKAPIQVQEYLDT 440
Query: 372 --EDKKDEPDAKKKKTP--ELTKLWNSKD----NLEACKSAERDFTPSLESYFE------ 417
K + + + TP L + S+D NL+ + A+R PS +SY
Sbjct: 441 QSGVMKLTTNKRLRATPLGSLDLKFLSEDANLGNLDRLREADRFSNPSPDSYMRGIADDE 500
Query: 418 ---EAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENM 474
E Q D E++ ++ AWR LRL SR LN ++ + + ++ +
Sbjct: 501 FNIEMAQNDD-----EKEEAARGKASKAWRVLRLSSRSK----LNKFDKID-DGKNLKIL 550
Query: 475 VKRCVKEKPSSQISGNGNG 493
+ EKP S++ G G G
Sbjct: 551 FESPPPEKPGSEVEG-GEG 568
>gi|440802190|gb|ELR23123.1| hypothetical protein ACA1_249770 [Acanthamoeba castellanii str.
Neff]
Length = 466
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 74/130 (56%), Gaps = 3/130 (2%)
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
I + + L + + S P+ L+ D F T+++C+Q F +E + + S NN
Sbjct: 117 IDIALWLGREKVIESAAPLELVRDFFLSHTIEECQQFFALLEDRADQLSRAIIGDSRANN 176
Query: 122 -LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE 180
LLR +++ +RLSR+ + VF GRIL+F+A FP S++SG+N+ E NV N+T D+E
Sbjct: 177 TLLRALSEVRKRLSRTNDLVFAGRILMFMAYAFPLSDKSGVNLKGEANVANVTTM--DKE 234
Query: 181 MDVSSNETEE 190
+ ET E
Sbjct: 235 PSPALEETPE 244
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 87/170 (51%), Gaps = 16/170 (9%)
Query: 366 LTSITDEDKKDEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDP 425
+ ++T DK EP ++TPE + +D + ER SLE + E I++MDP
Sbjct: 225 VANVTTMDK--EPSPALEETPE-----DIRDLQALLQKEERGRMVSLEEWLEPVIEEMDP 277
Query: 426 AAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPS 484
AA+E QYKK+N D Y WRA RL ++ +F + K +E++ K+ + K +
Sbjct: 278 EAAIESQYKKINTDKIYVWRAKRLTFKRSLEYFC----TLAKQDATLEDVAKQLLANKGT 333
Query: 485 SQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDADEVMS 534
S +G+ + E + D ++++++E EAK + + D+++S
Sbjct: 334 S----TEDGMVLEDREEKRDADVDDVEDKESVVKIEAKHSEDQEGDQLVS 379
>gi|300122524|emb|CBK23094.2| unnamed protein product [Blastocystis hominis]
Length = 555
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 96/442 (21%), Positives = 176/442 (39%), Gaps = 72/442 (16%)
Query: 69 CMKDMCNSTLPIILLSDTFDMSTLDKCEQLF-----YYVEVN--VNIWKQQTFFMSCKNN 121
C +C+ T P++++ + F+ +++CE ++ Y +++ ++I +++ S
Sbjct: 85 CKNGLCSPTTPLLIMDELFESVPINECEAIWNTFKTYQKDISELLHIDEKRCSLTSLA-- 142
Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
+L++ N LLRR + ++ F G +++FL+ F ERSGLN+ ++FN + +
Sbjct: 143 VLKLVNSLLRRTCQIRDCAFRGSLMMFLSNMFSLEERSGLNMANQFNRHASLRLDPPDVV 202
Query: 182 DVSSNETEETDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVS-WK---MFTSYAE 236
+ + K +++ FY KFW LQ+ N + S W+ Y
Sbjct: 203 KKILCALFDCLMCRDNGSKGNYSEFYGKFWKLQEILCNRDYVTKEASRWEELHQLIHYVT 262
Query: 237 TVLAAFK-----------SYKLDDVQSSLNP-----SGDYFAKYLTNQKLLDLQLSDTNF 280
++ K S +LD + + P S Y + + Q+ D
Sbjct: 263 KRMSEIKELTRAGELEKESLELDKIYLNFQPQEFDDSDKILPLYSSRASVFLTQMRDVKT 322
Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
RR + +Q + +K +S+ ++W KD + V L++ T P G+ F
Sbjct: 323 RRSIGVQMYCYLMHVRPVLKR-------ESEIKEW-KDVKKDVLRLLQST-PHGDAFCNA 373
Query: 341 VKLILKGEEHWNQWKNEGCPEL--KRPLTS-------------------ITDEDKKD--- 376
+ + E W WK P + K P+ + D+ KD
Sbjct: 374 IVKHVDREVVWEDWKESKFPPMVPKEPIPAHETVEVSLASRRPGDVRNLCRDDTLKDFVL 433
Query: 377 ---EPDAKKKKTPELT------KLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAA 427
E D + + E T + + E C + TP E + ++ MDP
Sbjct: 434 MKREADIQIQSCREETVRREQLTYPSHPEVPENCTTEAMGATPFAEVMARDLVEAMDPEN 493
Query: 428 AVEEQYKKVNDSNYAWRALRLL 449
EE+Y + Y WR LRLL
Sbjct: 494 GFEEEYMPYRNPIYNWRLLRLL 515
>gi|76156452|gb|ABA40374.1| SJCHGC08626 protein [Schistosoma japonicum]
Length = 105
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 39/91 (42%), Positives = 50/91 (54%), Gaps = 3/91 (3%)
Query: 277 DTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQ---TPPD 333
D +FRRY+L+Q LILFQY T+ VK + L DQ WV E V L+ P
Sbjct: 2 DPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSWVNQRHEVVLRLLSSNNTNSPS 61
Query: 334 GEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
F V+ IL+ E +WN+WKN+GCP R
Sbjct: 62 DNTFVSTVEHILERESYWNRWKNDGCPSFIR 92
>gi|145506230|ref|XP_001439081.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406254|emb|CAK71684.1| unnamed protein product [Paramecium tetraurelia]
Length = 554
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 88/365 (24%), Positives = 154/365 (42%), Gaps = 73/365 (20%)
Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
LLR+CN +L+RLS + +T G++ +FL + F S ++ NI + G DEE
Sbjct: 140 LLRICNSMLKRLSTTHDTFLRGQVQIFLTQIF-----STIHPSLRKKPINIRDRGFDEEA 194
Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDY-------FRNPV----------QCYN 224
V N + + ++FYK FW+LQ Y FRN V Q
Sbjct: 195 -VKVNAS------------LPYSFYKNFWTLQKYLNNSELIFRNKVEFDDIGEEAGQVSQ 241
Query: 225 KVS----WKMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
VS K + + ++ A + +K + ++ + + F K+LT L QL+D F
Sbjct: 242 NVSEPQRTKNLQAVIKVIMRALQYFKENPIEPEI-ITVKRFPKFLTKYSLFKNQLNDPYF 300
Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
R+ L Q L+ F + EL++ +++++++ V KQ G+ +
Sbjct: 301 RKLFLNQVLLFI--FVA---------ELENQEKQFIQNLKIEVQ---KQLDELGDKLGEK 346
Query: 341 VKLILKGEEHWNQWKNEG-CPELKRPLTSITDEDKKDEPDAKKKKTPELTKLWNS----- 394
V+ ++ E W W E C + SI +K AKKK+ ++++
Sbjct: 347 VQHLIDTERTWRLWVTEKQCFDDGYEKLSIKASEK-----AKKKRQEIQKRIYDDLKYKQ 401
Query: 395 --------KDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRAL 446
+D K P L Y E Q+D + + + N +YAW+++
Sbjct: 402 TDAFQYILQDQGRPYKIETNPIEPGLNYYLNEVFYQLDQNRKIADNERCKNMKDYAWKSV 461
Query: 447 RLLSR 451
R++S+
Sbjct: 462 RVISK 466
>gi|291001891|ref|XP_002683512.1| hypothetical protein NAEGRDRAFT_61424 [Naegleria gruberi]
gi|284097141|gb|EFC50768.1| hypothetical protein NAEGRDRAFT_61424 [Naegleria gruberi]
Length = 619
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 120/260 (46%), Gaps = 54/260 (20%)
Query: 131 RRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEE 190
+++S+ +++V CGRILLF+A P++ + +N FN E+ + E+ ++ N +
Sbjct: 206 KKVSKIEDSVTCGRILLFVASVLPYNHDACINKGGHFNKEHHSLL---EQQILNDNNEDS 262
Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQC-------YNKV--------SWKMFTSYA 235
++T++ DF Y W LQ + ++ YN SW +F+
Sbjct: 263 SNTQD------DFEIY---WGLQVHIIQILELQSTLASFYNSYPRGNDECPSWSLFSREV 313
Query: 236 ETVLAAFKSYKLDDVQS---------SLNPSG----DYFA--KYLTNQKLLDLQLSDTNF 280
+ +L + ++Q+ S N D F+ K L N+ L DLQL D F
Sbjct: 314 DRILKCLDTRFTTNIQTMKQLKKNNQSTNHESALVQDLFSPIKLLKNRTLFDLQLDDVRF 373
Query: 281 RRYVLLQFLI-LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
RR VL+Q L+ +++ + ELK+ +KD T V +++ P+
Sbjct: 374 RRQVLIQLLVTIYRLLEKNSEPSFYLSELKT-----LKDKTLNV---VRKYDPES---VT 422
Query: 340 VVKLILKGEEHWNQWKNEGC 359
+++ IL E HW+ WK+ GC
Sbjct: 423 LLETILDRERHWSNWKSNGC 442
>gi|145515651|ref|XP_001443725.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411114|emb|CAK76328.1| unnamed protein product [Paramecium tetraurelia]
Length = 557
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 85/362 (23%), Positives = 152/362 (41%), Gaps = 67/362 (18%)
Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
LLR+CN +L+RLS + +T G++ +FL + F S ++ NI + G DEE
Sbjct: 140 LLRICNSMLKRLSTTHDTFLRGQVQIFLTQIF-----STIHPSLRKKPINIRDRGFDEE- 193
Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCY-NKVSW------------ 228
V N + + ++FYK FW+LQ Y N + NKV +
Sbjct: 194 SVKVNAS------------LPYSFYKNFWTLQKYLNNSELIFKNKVEFDDIGEEAGQVTQ 241
Query: 229 --------KMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
K + + ++ A + +K + ++ + + F K+LT L QL+D F
Sbjct: 242 GVSEPQRTKNLQAVIKVIMRALQYFKENPLEPEI-ITVKRFPKFLTKYSLFKNQLNDPYF 300
Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
R+ L Q L+ F + EL++ +++++++ V KQ G+ +
Sbjct: 301 RKLFLNQVLLFI--FVA---------ELENHEKQFIQNLKIEVQ---KQLDELGDKLGEK 346
Query: 341 VKLILKGEEHWNQWKNEG-CPELKRPLTSITDEDKKDEP----------DAKKKKTPELT 389
V+ ++ E W W E C + SI +K + D K K+T
Sbjct: 347 VQHLIDTERTWRLWVTEKQCFDDGYEKLSIKASEKAKKKRQEIQKRIYDDLKYKQTDTFQ 406
Query: 390 KLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLL 449
++ +D K P L Y E Q+D + + + N +YAW+++R++
Sbjct: 407 QIL--QDQGRPYKIETNPIEPGLNYYLNEVFYQLDQNRKIADNERCKNLKDYAWKSVRVI 464
Query: 450 SR 451
S+
Sbjct: 465 SK 466
>gi|403377322|gb|EJY88653.1| THO complex subunit 1 [Oxytricha trifallax]
Length = 766
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 89/210 (42%), Gaps = 35/210 (16%)
Query: 49 KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE----- 103
K +DI+ I + + L +D+ LL D FD ++ D E LF +E
Sbjct: 120 KKMQVDIQPITKFFEFVILLERQDLITKYEIFQLLVDLFDHTSEDSFEPLFGLMEHLLFQ 179
Query: 104 ---VNVNIWKQQTFFMSCKNNLL------------RMCNDLLRRLSRSQNTVFCGRILLF 148
+ + +Q N++ R+C ++R+LS + +T F G++ +
Sbjct: 180 ENLIKPRVQQQAEQGQKSDQNVIDRQQDINLNLISRICKSVIRKLSVTHDTGFRGKVQIL 239
Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEM----DVSSNETE----------ETDTE 194
+A FP SGLN + +FN +N T F EE D+ N + + + +
Sbjct: 240 IANVFPLMHGSGLNRLGKFNTQNETVFETLEETKKSGDIGQNSGKLSLQQTFSFGQANIQ 299
Query: 195 EVDKVKI-DFNFYKKFWSLQDYFRNPVQCY 223
+I D+ YK FW+LQ Y NP +
Sbjct: 300 STKPQQIADYKTYKNFWNLQKYLTNPFNIF 329
>gi|298712962|emb|CBJ26864.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 896
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 95/225 (42%), Gaps = 35/225 (15%)
Query: 260 YFA-KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKD 318
YF KYLT+ +LL LQL D R VL Q+LIL + V + + D +K
Sbjct: 482 YFGLKYLTSSRLLRLQLRDPTLRLQVLTQWLILAASLKNKVANAKDSPNSVED----LKP 537
Query: 319 TTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEP 378
E L+K TPP GE + +++++L E W QWK+ + K +T+ +E+ + E
Sbjct: 538 RIELAKRLVKATPPRGEEYLSMLQVVLNRETFWTQWKD--VNKCKLAMTAAKEEETQREK 595
Query: 379 D------------AKKKKTPELTKLWNSKDN---------LEACKSAERDFTPSLESYFE 417
D K+ P + + LE S R P+ + + E
Sbjct: 596 DKAAGVVARAPSYPNKRPRPSASVAAGERAKYSPLGAGFPLERMVSKLRAGVPTSQEHLE 655
Query: 418 EAIQQM---DPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLN 459
+ Q+M DP E Y WRALRLL R+ P +
Sbjct: 656 D--QRMCAEDPDTPASELPSA--SPLYTWRALRLLGRENPANLVG 696
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/199 (23%), Positives = 81/199 (40%), Gaps = 58/199 (29%)
Query: 101 YVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSG 160
+VE N + + + L+R+CN L +R+S++ ++ CG++ +FL K F ERS
Sbjct: 198 WVEPNSERLSRLVPLNNGRYPLIRLCNKLAQRMSKTHDSSLCGQMHMFLGKALDFFERSA 257
Query: 161 LNIISEFNVENITEF-----------GGDEEMDVSSNETEETDTE------------EVD 197
+N+ N++N+T F G D E + SS EVD
Sbjct: 258 VNVSGSINLDNVTRFEDEEQFRRAMMGFDGEDEASSGTGAAVAAAPAAKGGGDAMEVEVD 317
Query: 198 ----------------KVK---------IDFNFYKKFWSLQDYFRNPVQC-------YNK 225
K+K + + Y+ FW LQ+Y P+Q K
Sbjct: 318 APAGAAAAAAAGGGATKLKEGANPQEFAVHYPLYRSFWRLQEY---PLQQDKVVKGEETK 374
Query: 226 VSWKMFTSYAETVLAAFKS 244
+W+ + + +L AF++
Sbjct: 375 ATWEALLADVDKILGAFEA 393
>gi|238565549|ref|XP_002385882.1| hypothetical protein MPER_16096 [Moniliophthora perniciosa FA553]
gi|215436172|gb|EEB86812.1| hypothetical protein MPER_16096 [Moniliophthora perniciosa FA553]
Length = 161
Score = 67.8 bits (164), Expect = 1e-08, Method: Composition-based stats.
Identities = 43/119 (36%), Positives = 59/119 (49%), Gaps = 6/119 (5%)
Query: 75 NSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRL 133
+ + P +L D + T+ C +F ++E + Q K LLR NDLLRRL
Sbjct: 26 DPSFPFQILQDLLETQTISSCSHIFTWIESRKDRLTQGMIPQKGKALILLRTLNDLLRRL 85
Query: 134 SRSQN--TVFCGRILLFLAKFFPFSERSGLNIISEFNV---ENITEFGGDEEMDVSSNE 187
S+ T FCGRIL FL+ FP ERSG+N+ E+ E+ G EEM V +
Sbjct: 86 SKMGMGMTGFCGRILCFLSGVFPLGERSGVNLRGEYGPVWEGAHIEYEGGEEMQVDEEK 144
>gi|323448292|gb|EGB04192.1| hypothetical protein AURANDRAFT_67353 [Aureococcus anophagefferens]
Length = 631
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/252 (23%), Positives = 95/252 (37%), Gaps = 65/252 (25%)
Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLELKS-DQEKW---- 315
AKYLTN +LL +QL D R ++ Q ++F Y ++ L KS + W
Sbjct: 295 AKYLTNSRLLRIQLRDPVLRMQLVTQLYLVFYYLLRESIIGDQPSLGTKSTSKPDWASSA 354
Query: 316 ---VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD- 371
V V L+ TPP+G + + ++K E +W+ WK GC L+ P++ + +
Sbjct: 355 MSRVNHVLARVRKLVVSTPPNGPERLESLDELIKREANWSTWKKNGC-ALQSPISRLQNF 413
Query: 372 ------------------EDKKDEPDAK-----------KKKTPELTKLWNSK------- 395
+ P A + TP++TK +
Sbjct: 414 VFIYLYLCLCLCLYLYLYLYRAASPGAANFIFHSRQEDYRTDTPKMTKRSGERYTPPTYA 473
Query: 396 ------------------DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
+E C D PS+E + DP ++E++Y N
Sbjct: 474 VLVKRAKLKADADSIKNEGTIENCCVKLADAVPSIEEFLRAYEVAQDPENSIEDEYHPKN 533
Query: 438 DSNYAWRALRLL 449
D Y WRALRL+
Sbjct: 534 DKVYCWRALRLV 545
>gi|340500147|gb|EGR27045.1| Tho complex protein, putative [Ichthyophthirius multifiliis]
Length = 411
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/263 (21%), Positives = 116/263 (44%), Gaps = 41/263 (15%)
Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMD 182
L++CN + ++LS++Q+ G++ +FL+K P SGL S+ E T+ ++E
Sbjct: 139 LKICNGVFKKLSKTQDIQLRGKVHIFLSKILPLCHASGLK--SKICTEISTQIENEKEAL 196
Query: 183 VSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKM------------ 230
+ + + ++ + ++++FY KFWSLQ Y +NP+ + +M
Sbjct: 197 LQMENNMDIEEQKGNFPFLNYSFYSKFWSLQKYLQNPLNLSKSNNGEMEIEGINDEDVEE 256
Query: 231 ---------------------FTSYAETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQK 269
+ E V ++ L+ ++S+ P + KYLT
Sbjct: 257 GEEKPDQEGIKTLESKKIKGLCAAICEVVEIFNQNPILEKGETSIKP----YIKYLTAYS 312
Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG-EGLELKSDQEKWVKDTTETVYSLIK 328
L+ +Q +D FR+ L+Q L+ F + K + +E + + K +D + + L+
Sbjct: 313 LMKIQFNDQFFRKIWLIQALLFFYTLQNLNKEYDIKNIENEKNTIKNAEDKVKKAFKLLH 372
Query: 329 QTPPDGEHFSQVVKLILKGEEHW 351
Q + + Q + + E++W
Sbjct: 373 QNLKEDISYEQFDQFFM-DEKNW 394
>gi|118362386|ref|XP_001014420.1| hypothetical protein TTHERM_00522420 [Tetrahymena thermophila]
gi|89296187|gb|EAR94175.1| hypothetical protein TTHERM_00522420 [Tetrahymena thermophila
SB210]
Length = 1224
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 59/101 (58%), Gaps = 4/101 (3%)
Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLN--IISEF--NVENITEFGGD 178
L++CN + ++L+R+ +T G + +FL+K P SGL I +E +EN ++ +
Sbjct: 146 LKICNVIFKKLTRTYDTRLRGELHIFLSKILPICHASGLKSKICTEITAQLENESDIAKN 205
Query: 179 EEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP 219
+ + + E EE ++ + + +FYKKFW+LQ YF+NP
Sbjct: 206 QNIQGNQMEIEEDKSQPDNITYSNKDFYKKFWTLQKYFQNP 246
>gi|328908893|gb|AEB61114.1| THO complex subunit 1-like protein, partial [Equus caballus]
Length = 78
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/52 (50%), Positives = 37/52 (71%)
Query: 275 LSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSL 326
LSD+NFRR++LLQ+LILFQY VK + L +Q W++DTT++VY +
Sbjct: 1 LSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQV 52
>gi|443925429|gb|ELU44267.1| efThoc1 domain-containing protein [Rhizoctonia solani AG-1 IA]
Length = 854
Score = 55.8 bits (133), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 51/209 (24%), Positives = 78/209 (37%), Gaps = 46/209 (22%)
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-VNVNIWKQQTFFMSCKN 120
+ + +E + CN + +L + D T+D C +F +VE + + K
Sbjct: 86 LDVALEYTELEACNWQFVLEVLEEVLDGHTIDSCVTIFGWVEDRSSRLLKGMVPSKGKAL 145
Query: 121 NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV----------- 169
+LR N L R +R T F GRIL F + F +ERSG+N+ E
Sbjct: 146 VVLRTLNALKARAAR---TGFAGRILAFQSGIFDITERSGVNLQGELGSTWNGVAIPPPV 202
Query: 170 ---ENITEFG--------------------GDEEMDVSSNETEETDTE--------EVDK 198
TE G GDE+ + D++ +VD
Sbjct: 203 GEKAEATESGVKLEGSDAMEGVESTDQSAAGDEDKKSVAENKGSADSKPASRAADAKVDP 262
Query: 199 VKIDFNFYKKFWSLQDYFRNPVQCYNKVS 227
+ N Y FW LQ +F NP+ +S
Sbjct: 263 AEQRRNLYNTFWQLQLFFANPLVFAEGIS 291
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 19/41 (46%), Positives = 32/41 (78%)
Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
+FAK+LT+ LLDL+L+DTNFRR +L Q +I+ ++ + ++
Sbjct: 396 FFAKFLTSPDLLDLELADTNFRRQILFQLMIMLKHLSLAIQ 436
>gi|387198729|gb|AFJ68866.1| THO complex subunit 1, partial [Nannochloropsis gaditana CCMP526]
Length = 244
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 67/124 (54%), Gaps = 15/124 (12%)
Query: 82 LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT-----FFMSCKNNL--LRMCNDLLRRLS 134
LL D F++ +++ + + VE K+++ F + K+NL L++CN LLR+LS
Sbjct: 124 LLEDIFELLGVEQIQNFWPEVE------KRESALFGLFKTARKSNLTMLKLCNSLLRKLS 177
Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE 194
+ NT CGRI+++L++ + S+ S +N N+ N T F +E D + + E +
Sbjct: 178 KGHNTAICGRIMMYLSRNWELSDPSAINKGGMANLSNKTAF--EEPEDFARQQAREAEKT 235
Query: 195 EVDK 198
D+
Sbjct: 236 RRDE 239
>gi|412985942|emb|CCO17142.1| predicted protein [Bathycoccus prasinos]
Length = 1123
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 68/157 (43%), Gaps = 23/157 (14%)
Query: 316 VKDTTETVYSLIKQT---PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
K+ E V ++++T P G+ + + EH++ W + GCP +R + E
Sbjct: 740 AKECFEKVLDVVEKTDASPEQGKKLRIFLVATMYSTEHYDTWVDMGCPSFEREEIDVVAE 799
Query: 373 D----------------KKDEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYF 416
K E + + PE+ ++WN + EA + S +
Sbjct: 800 QKEYERKEKEKEDAENAKYPEGYVRFENNPEMERIWNLYPSYEAAMTDPERNRYSKPEFM 859
Query: 417 EEAIQ----QMDPAAAVEEQYKKVNDSNYAWRALRLL 449
E +Q +MDP A +EE+YK ND+ Y W+A RL+
Sbjct: 860 NEFLQVVYDEMDPEAQIEEEYKTKNDTMYRWKAFRLI 896
>gi|422294621|gb|EKU21921.1| THO complex subunit 1, partial [Nannochloropsis gaditana CCMP526]
Length = 230
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/101 (32%), Positives = 58/101 (57%), Gaps = 13/101 (12%)
Query: 82 LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT-----FFMSCKNNL--LRMCNDLLRRLS 134
LL D F++ +++ + + VE K+++ F + K+NL L++CN LLR+LS
Sbjct: 124 LLEDIFELLGVEQIQNFWPEVE------KRESALFGLFKTARKSNLTMLKLCNSLLRKLS 177
Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF 175
+ NT CGRI+++L++ + S+ S +N N+ N T F
Sbjct: 178 KGHNTAICGRIMMYLSRNWELSDPSAINKGGMANLSNKTAF 218
>gi|426385316|ref|XP_004059165.1| PREDICTED: THO complex subunit 1-like, partial [Gorilla gorilla
gorilla]
Length = 80
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 42/78 (53%), Gaps = 14/78 (17%)
Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTSITDED----KKDEPDAKKKKTP--------- 386
+V+ IL EE+WN WKNEGCP + TS T K+ P+ K P
Sbjct: 1 MVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNE 60
Query: 387 ELTKLWN-SKDNLEACKS 403
ELT+LWN DN+EACKS
Sbjct: 61 ELTRLWNLCPDNMEACKS 78
>gi|159482536|ref|XP_001699325.1| hypothetical protein CHLREDRAFT_178080 [Chlamydomonas reinhardtii]
gi|158272961|gb|EDO98755.1| predicted protein [Chlamydomonas reinhardtii]
Length = 1302
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/48 (41%), Positives = 34/48 (70%)
Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNII 164
K LLRM N L RRLS++ + CGR+L+F+++ + +++SGLN++
Sbjct: 165 GAKLALLRMSNMLFRRLSKAYDVRLCGRLLIFISRCYELNDKSGLNVL 212
>gi|341883433|gb|EGT39368.1| hypothetical protein CAEBREN_04262 [Caenorhabditis brenneri]
Length = 90
Score = 48.5 bits (114), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 46/81 (56%), Gaps = 7/81 (8%)
Query: 110 KQQTFFMSCKNNLLRMCNDLLRRLSRSQNTV---FCGRILLFLAKFFPF-SERSGLNIIS 165
K+ +F +C+NN+LR+CNDL RR S ++ F I ++ F F E+SG+N +
Sbjct: 9 KKPSFIETCQNNILRLCNDLFRRFSSKCSSCPGDFHKEIYEKASELFRFLVEKSGVNFMG 68
Query: 166 EFNVENITEFGGDEEMDVSSN 186
FN NIT + EE D + N
Sbjct: 69 HFNTRNITNY---EETDPALN 86
>gi|422294622|gb|EKU21922.1| tho complex subunit, partial [Nannochloropsis gaditana CCMP526]
Length = 160
Score = 45.1 bits (105), Expect = 0.11, Method: Composition-based stats.
Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 9/112 (8%)
Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY-------FTSTVKSRGEGLELKSDQEKW 315
KYLTN +L LQL D R VL Q LI+ Y ++ + S +K
Sbjct: 15 KYLTNSRLFRLQLHDPLIRTQVLTQLLIVLNYASAATALAVASPPASSSPAPNPSPPKKG 74
Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN-EGC-PELKRP 365
D + LI+ T G + V+ +L E+ W WK + C P +++P
Sbjct: 75 ANDLVKHARRLIRNTSAQGAVYLGVLDTVLHREQAWLSWKRLQRCEPRIEKP 126
>gi|387219973|gb|AFJ69695.1| tho complex subunit, partial [Nannochloropsis gaditana CCMP526]
Length = 153
Score = 45.1 bits (105), Expect = 0.11, Method: Composition-based stats.
Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 9/112 (8%)
Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY-------FTSTVKSRGEGLELKSDQEKW 315
KYLTN +L LQL D R VL Q LI+ Y ++ + S +K
Sbjct: 8 KYLTNSRLFRLQLHDPLIRTQVLTQLLIVLNYASAATALAVASPPASSSPAPNPSPPKKG 67
Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN-EGC-PELKRP 365
D + LI+ T G + V+ +L E+ W WK + C P +++P
Sbjct: 68 ANDLVKHARRLIRNTSAQGAVYLGVLDTVLHREQAWLSWKRLQRCEPRIEKP 119
>gi|167518301|ref|XP_001743491.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778590|gb|EDQ92205.1| predicted protein [Monosiga brevicollis MX1]
Length = 847
Score = 40.0 bits (92), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 23/74 (31%), Positives = 32/74 (43%)
Query: 158 RSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR 217
+SG+N+ S FNV N T + +E + E + +D YK+FW LQ F
Sbjct: 8 KSGVNLNSSFNVGNATSYDESDEPIPHGASEAALASAEGGEQALDKKLYKQFWQLQAAFC 67
Query: 218 NPVQCYNKVSWKMF 231
NP W F
Sbjct: 68 NPAILGAGSDWLQF 81
>gi|332291219|ref|YP_004429828.1| pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase
[Krokinobacter sp. 4H-3-7-5]
gi|332169305|gb|AEE18560.1| pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase
[Krokinobacter sp. 4H-3-7-5]
Length = 562
Score = 38.9 bits (89), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 50/113 (44%), Gaps = 8/113 (7%)
Query: 364 RPLTSITDEDKKDEPDAKK---KKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAI 420
+P T+ T P AKK +K L+++ S +N KS +FTPS +
Sbjct: 269 KPATNTTGGRIFVSPLAKKIADEKGINLSQVKGSGENGRIVKSDVENFTPSASQSSGAGV 328
Query: 421 QQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK---CPHFFLNATPNVEKNSEF 470
QQ A EE ++++ +S R L + PH++LN N+E F
Sbjct: 329 QQF--VATGEESFEEIENSQMRKAIARGLGKSKFTAPHYYLNVEFNMENMMSF 379
>gi|441501414|ref|ZP_20983527.1| Dihydrolipoamide acetyltransferase component of pyruvate
dehydrogenase complex [Fulvivirga imtechensis AK7]
gi|441434811|gb|ELR68242.1| Dihydrolipoamide acetyltransferase component of pyruvate
dehydrogenase complex [Fulvivirga imtechensis AK7]
Length = 552
Score = 38.5 bits (88), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 71/301 (23%), Positives = 118/301 (39%), Gaps = 59/301 (19%)
Query: 190 ETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD 249
E DTE VD ++ + + KM + E +A++ K D
Sbjct: 116 EEDTESVDASDVNASLI-------------------LMPKMSDTMTEGTIASWLKKKGDK 156
Query: 250 VQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ--FLILFQYFTSTVKSRGEGLE 307
VQ SGD A+ T++ ++L+ + YV ++ + + + +G E
Sbjct: 157 VQ-----SGDILAEVETDKATMELEAYEDGTLLYVGVEEGASVPVDGVIAIIGEKGADYE 211
Query: 308 --LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
LK+ Q K T E K T P K E+ Q P +
Sbjct: 212 KLLKAHQGKKKAATGEDKKKEDKTTAPQ------------KAEKQEEQ------PAASQT 253
Query: 366 LTSITDEDK-KDEPDAKK---KKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQ 421
S+TD + K P AKK K +++K+ + DN K ++TP+ ES E+A +
Sbjct: 254 APSVTDGGRVKASPLAKKMAEDKGYDISKIRGTGDNGRIIKRDIEEYTPAAES-VEKAAE 312
Query: 422 QMD-----PAAAVEEQYKKVNDSNYAWRALRLLSRK---CPHFFLNATPNVEKNSEFIEN 473
+ P EE Y++V+ S + LS PHF++ N++K E ++
Sbjct: 313 EKGTTFHVPQVVGEESYEEVSVSQMRKTIGKRLSESKFTSPHFYITMEINMDKAIEARKS 372
Query: 474 M 474
M
Sbjct: 373 M 373
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,926,752,692
Number of Sequences: 23463169
Number of extensions: 386626281
Number of successful extensions: 1708918
Number of sequences better than 100.0: 752
Number of HSP's better than 100.0 without gapping: 353
Number of HSP's successfully gapped in prelim test: 399
Number of HSP's that attempted gapping in prelim test: 1702755
Number of HSP's gapped (non-prelim): 3970
length of query: 555
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 407
effective length of database: 8,886,646,355
effective search space: 3616865066485
effective search space used: 3616865066485
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)