BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy10508
         (555 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|322794008|gb|EFZ17246.1| hypothetical protein SINV_10936 [Solenopsis invicta]
          Length = 653

 Score =  560 bits (1442), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 276/546 (50%), Positives = 379/546 (69%), Gaps = 32/546 (5%)

Query: 3   DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLK-LIKTPDIDIKVIENY 61
           D  Q +F  S +++   K  + +   + D+K+A++Q++R  LL  L++    +++ +E Y
Sbjct: 13  DYLQQHFKLSDVSLFQKK-CIERYANESDRKAAVDQALRDTLLVVLMENASGNVRSLETY 71

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
           I+ C+ELC KD+  +++P++LL D FD  TLD+CEQLF +VE NVN+WK++ FF +CKNN
Sbjct: 72  ITFCIELCRKDLATASMPVMLLGDIFDSMTLDRCEQLFTFVENNVNVWKEELFFTACKNN 131

Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
           LLRMCNDLLRRLSRSQ TVFCGRILLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++  
Sbjct: 132 LLRMCNDLLRRLSRSQQTVFCGRILLFLAKFFPFSERSGLNIVSEFNLENHTEFGVEKAE 191

Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
           +    +  + D +   K+ ID+N Y+KFW+LQD+FRNP QCYNK+ WK+F+++A  VL+A
Sbjct: 192 EDDLEQITKDDDKLESKIPIDYNLYRKFWALQDFFRNPNQCYNKMHWKVFSAHASNVLSA 251

Query: 242 FKSYKLDDVQS-------------SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
           F S+KL++ +S               +    YFAKYLTNQKLL+LQLSD+NFRRYVLLQF
Sbjct: 252 FASFKLEEQRSYPTTCMKMDTSTEGSHKETHYFAKYLTNQKLLELQLSDSNFRRYVLLQF 311

Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
           LILFQY  STVK + E  ELK DQ +WVK TTE VY+L+ +TPPDG  F++ VK ILK E
Sbjct: 312 LILFQYLNSTVKFKAETHELKPDQVEWVKTTTEQVYTLLTETPPDGPMFAETVKNILKRE 371

Query: 349 EHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKK----------------KTPELTKLW 392
           EHWN WKNEGCP  KRP + +  ++ +     K++                  PELTKLW
Sbjct: 372 EHWNAWKNEGCPPFKRPASEVGGDEPRKIRKPKRRIGDMIKDAQAIGKYHMGNPELTKLW 431

Query: 393 N-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
           N   +NLEACKS +RDF PSLE+YFE+AI ++DPAA V+++YKKVND N+ WRALRLL+R
Sbjct: 432 NLCPNNLEACKSKDRDFLPSLETYFEDAIMELDPAAMVDDKYKKVNDGNFGWRALRLLAR 491

Query: 452 KCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQ 511
           + PHFF++    + K  E++E M+K+  K++P +Q        +  P+E      +E++ 
Sbjct: 492 RSPHFFVHGNNPINKLPEYLEAMIKKIAKDRPQTQSDTKLETEETPPSESTDQEFNEDVL 551

Query: 512 EEEKEE 517
           ++E E+
Sbjct: 552 KQESEQ 557


>gi|332017814|gb|EGI58475.1| THO complex subunit 1 [Acromyrmex echinatior]
          Length = 653

 Score =  558 bits (1439), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 281/557 (50%), Positives = 383/557 (68%), Gaps = 37/557 (6%)

Query: 29  DYDKKSALEQSIRQYLLK-LIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
           D D+K+A++Q++R  LL  L +    +++ +E+YI+ C+ELC KD+  +++P++LL D F
Sbjct: 38  DSDRKAAIDQALRDTLLVVLTENASGNVRSLESYITFCIELCRKDLATASMPVMLLGDIF 97

Query: 88  DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
           D  TLD+CEQLF +VE NV +WK++ FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98  DSMTLDRCEQLFTFVENNVTVWKEELFFTACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157

Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYK 207
           FLAKFFPFSERSGLNI+SEFN+EN TEFG ++  +    +  + D +   K+ ID+N Y+
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENQTEFGLEKAEEDDLEQITKDDDKSESKIPIDYNLYR 217

Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS----------SLNPS 257
           KFW+LQD+FRNP QCYNK+ WK+F+++A  VL+AF S+KL++ +S          S+  S
Sbjct: 218 KFWALQDFFRNPNQCYNKMHWKVFSAHASNVLSAFASFKLEEQRSYPSTCIKMDTSMEDS 277

Query: 258 G---DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
                YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY  STVK + E  ELK DQ +
Sbjct: 278 HKETHYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPDQVE 337

Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK 374
           WVK  TE VY+L+ +TPPDG  F++ VK ILK EEHWN WKNEGCP  KRP + ++DE+ 
Sbjct: 338 WVKTITEQVYTLLTETPPDGPTFAETVKNILKREEHWNTWKNEGCPPFKRPASDVSDEEP 397

Query: 375 --------------KDEPDAKK--KKTPELTKLWN-SKDNLEACKSAERDFTPSLESYFE 417
                         +D     K     PELTKLWN   +NLEACKS +RDF PSLE+YFE
Sbjct: 398 RKPRRPKRRIGDVIRDAQTVGKYHMGNPELTKLWNLCPNNLEACKSKDRDFLPSLETYFE 457

Query: 418 EAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKR 477
           +AI ++DPAA V+++YKKVND N+ WRALRLL+R+ PHFF++    + K  E++E M+K+
Sbjct: 458 DAIMELDPAAMVDDKYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLETMIKK 517

Query: 478 CVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEE------DWEAKADPEGDADE 531
             K++P +Q        +  P+E      +E++ ++E E+      D +A+   +   + 
Sbjct: 518 IAKDRPQTQSDTKLETEETPPSESTDQEFNEDVLKQESEQVDAESADIKARKLNKVTPEM 577

Query: 532 VMSVEYCYQRNWHKQAV 548
           V  +    + +W K A+
Sbjct: 578 VAKLSESLKNDWKKLAI 594


>gi|383852527|ref|XP_003701778.1| PREDICTED: THO complex subunit 1-like [Megachile rotundata]
          Length = 654

 Score =  558 bits (1437), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 279/538 (51%), Positives = 372/538 (69%), Gaps = 42/538 (7%)

Query: 27  TCDYDKKSALEQSIRQYLLKLI-KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           T D D+K+A++Q++R  LL ++ +    +++++E YI+ C+ELC +D+  +++P++LL D
Sbjct: 36  TNDGDRKAAIDQALRDTLLVILSENAANNVQLLETYITFCIELCKRDLATASMPVMLLGD 95

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
            FD  TLD+CEQLF +VE NV +WK+  FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRI
Sbjct: 96  IFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNNLLRMCNDLLRRLSRSQQTVFCGRI 155

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNF 205
           LLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++       +  + + +   KV ID+N 
Sbjct: 156 LLFLAKFFPFSERSGLNIVSEFNLENFTEFGSEKSEGNDLEQVTKDEDKSETKVPIDYNL 215

Query: 206 YKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLN 255
           Y+KFW+LQD+FRNP QCY+K+ WK+F+++A +VL+AF S+KL++          + SS+ 
Sbjct: 216 YRKFWALQDFFRNPNQCYSKMYWKVFSAHASSVLSAFSSFKLEEQRTYPSTSIKLDSSME 275

Query: 256 PSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQ 312
            S     YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY  STVK + E  ELK DQ
Sbjct: 276 GSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPDQ 335

Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
             WVK TTE VY+L+ +TPPDG  F++ VK ILK EEHWN WKNEGCP  K+P     D+
Sbjct: 336 VDWVKATTEQVYALLTETPPDGPAFAETVKNILKREEHWNAWKNEGCPPFKKP-----DQ 390

Query: 373 DKKDEPDAKKKK----------------------TPELTKLWN-SKDNLEACKSAERDFT 409
           D  DE + +K K                       PELTKLWN   +NLEACKS +RDF 
Sbjct: 391 DSIDEEEVRKPKRPRHRIGDVIREAQAVGKYHMGNPELTKLWNLCPNNLEACKSKDRDFL 450

Query: 410 PSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE 469
           PSLE+YFEEAI Q+DP+A V+++YKKVND N+ WRALRLL+R+ PHFF++    + K  E
Sbjct: 451 PSLETYFEEAIMQLDPSAMVDDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPE 510

Query: 470 FIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEG 527
           ++E M+K+  K++P +Q        +  P E      SE++ ++E E+     AD + 
Sbjct: 511 YLETMIKKIAKDRPQTQSDTKLETEETPPPEANDQEFSEDVLKQESEQVESENADAKA 568


>gi|350398011|ref|XP_003485060.1| PREDICTED: THO complex subunit 1-like [Bombus impatiens]
          Length = 655

 Score =  556 bits (1434), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 294/586 (50%), Positives = 391/586 (66%), Gaps = 42/586 (7%)

Query: 3   DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLI-KTPDIDIKVIENY 61
           D  Q  F  S + +   K  + + T D D+K+A++Q++R  LL ++ +    ++ ++E Y
Sbjct: 13  DYLQQCFKLSDIQIFQKK-CVERCTNDSDRKAAIDQALRDTLLVILSENLSSNVPLLETY 71

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
           I+ C+ELC KD+  +++P++LL D FD  TLD+CEQLF +VE NV +WK+  FF +CKNN
Sbjct: 72  ITFCIELCRKDLATASMPVMLLGDIFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNN 131

Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-E 180
           LLRMCNDLLRRLSRSQ TVFCGRILLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++ E
Sbjct: 132 LLRMCNDLLRRLSRSQQTVFCGRILLFLAKFFPFSERSGLNIVSEFNLENHTEFGSEKSE 191

Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
            D   + T++ D  E  KV ID+N Y+KFW+LQD+FRNP Q Y K+ WK+F+++A +VL+
Sbjct: 192 GDDLEHITKDDDKSET-KVPIDYNLYRKFWALQDFFRNPNQLYLKMHWKIFSTHASSVLS 250

Query: 241 AFKSYKLDD----------VQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
           AF S+KL++          V+SS+  S     YFAKYLTNQKLL+LQLSD+NFRRYVLLQ
Sbjct: 251 AFSSFKLEEQRNYPSTCVKVESSMEGSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQ 310

Query: 288 FLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKG 347
           FLILFQY  STVK + E  ELK DQ +WVK TTE VY+L+ +TPPDG  F++ VK ILK 
Sbjct: 311 FLILFQYLNSTVKFKAETHELKPDQVEWVKGTTEQVYALLTETPPDGPTFAETVKNILKR 370

Query: 348 EEHWNQWKNEGCPELKRPLT-SITDEDKKDEPDAKKKK-----------------TPELT 389
           EEHWN WKNEGCP  KRP   S  D ++  +P   +++                  PELT
Sbjct: 371 EEHWNAWKNEGCPPFKRPTPDSSADIEESKKPKRPRRRIGDVIRDAQAVGKYHMGNPELT 430

Query: 390 KLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRL 448
           KLWN   +NLEACKS ERDF PSLE+YFEEAI Q+DP A VE++YKKVND N+ WRALRL
Sbjct: 431 KLWNLCPNNLEACKSKERDFLPSLETYFEEAIMQLDPNAMVEDEYKKVNDGNFGWRALRL 490

Query: 449 LSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSE 508
           L+R+ PHFF++    + K  E++E M+K+  K++P +Q        +  P E      +E
Sbjct: 491 LARRSPHFFVHGNYPINKLPEYLETMIKKIAKDRPQTQSDIKLETEETPPPEANDQEFNE 550

Query: 509 EIQEEEKEEDWEAKADPEG------DADEVMSVEYCYQRNWHKQAV 548
           ++ ++E E+      D +G        D V  +    + +W K A 
Sbjct: 551 DVLKQESEQVEAENTDTKGRKLTKVTPDMVAKLSEILKNDWKKLAT 596


>gi|307174438|gb|EFN64939.1| THO complex subunit 1 [Camponotus floridanus]
          Length = 654

 Score =  555 bits (1430), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 283/527 (53%), Positives = 370/527 (70%), Gaps = 35/527 (6%)

Query: 29  DYDKKSALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
           D D+K+A++Q++R  LL ++     D +  +E YI+ C+ELC KD+  +++P++LL D F
Sbjct: 38  DGDRKAAIDQALRDTLLVILTENASDNVHSLEMYITFCIELCRKDLATASMPVMLLGDIF 97

Query: 88  DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
           D  TLDKCEQLF +VE NV IWK+  FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98  DSMTLDKCEQLFTFVENNVTIWKEDLFFTACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157

Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDTEEVDKVKIDFNFY 206
           FLAKFFPFSERSGLNI+SEFN+EN TEFG ++ E D     T++ D  E +K+ ID+N Y
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENHTEFGSEKVEGDDLEQITKDDDKSE-NKIPIDYNLY 216

Query: 207 KKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLNP 256
           +KFW+LQD+FRNP QCY KV WK+F+++A  VL+AF S+KL+D          + SS+  
Sbjct: 217 RKFWALQDFFRNPNQCYVKVHWKVFSAHASNVLSAFASFKLEDQRSYPTTCINMDSSMEG 276

Query: 257 S---GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQE 313
           S     YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLI+FQY  STVK + E  ELK DQ 
Sbjct: 277 SYKETHYFAKYLTNQKLLELQLSDSNFRRYVLLQFLIVFQYLNSTVKFKAETHELKPDQV 336

Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED 373
           +WVK TTE VY+L+ +TPPDG  F++ VK ILK EEHWN WKNEGCP  KRP + I D D
Sbjct: 337 EWVKTTTEQVYALLTETPPDGPAFAETVKNILKREEHWNAWKNEGCPPFKRPASDIIDGD 396

Query: 374 K----------------KDEPDAKKKKTP--ELTKLWN-SKDNLEACKSAERDFTPSLES 414
           +                +D     K      ELTKLWN   +NLEACKS +RDF PSLE+
Sbjct: 397 EEPRKPKRPKRRIGDVIRDAQAVGKYHMGNLELTKLWNLCPNNLEACKSKDRDFLPSLET 456

Query: 415 YFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENM 474
           YFE+AI ++DPAA V+++YKKVND N+ WRALRLL+R+ PHFF++    + K  E++E M
Sbjct: 457 YFEDAIMELDPAAMVDDKYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLETM 516

Query: 475 VKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEA 521
           +K+  K++P +Q        +  P+E      +E++ +++ E+D E+
Sbjct: 517 IKKIAKDRPQTQSDTKLETEETPPSESNDQEFNEDVLKQDSEQDSES 563


>gi|328784514|ref|XP_393145.3| PREDICTED: THO complex subunit 1-like isoform 1 [Apis mellifera]
          Length = 654

 Score =  554 bits (1428), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 278/492 (56%), Positives = 357/492 (72%), Gaps = 35/492 (7%)

Query: 27  TCDYDKKSALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           T D D+K+A++Q++R  LL ++    ++ ++++E YI+ C+ELC KD+  +++P++LL D
Sbjct: 36  TNDSDRKAAIDQALRDTLLVILSENALNNVQLLETYITFCIELCRKDLATASMPVMLLGD 95

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
            FD  TLD+CEQLF +VE NV +WK+  FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRI
Sbjct: 96  IFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNNLLRMCNDLLRRLSRSQQTVFCGRI 155

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDTEEVDKVKIDFN 204
           LLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++ E D     T++ D  E  KV ID+N
Sbjct: 156 LLFLAKFFPFSERSGLNIVSEFNLENHTEFGSEKSEGDDLEQITKDDDKSET-KVPIDYN 214

Query: 205 FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSL 254
            Y+KFW+LQD+FRNP QCY K+ WK+F+++A +VL+AF S+KL++          V SS+
Sbjct: 215 LYRKFWALQDFFRNPNQCYQKMHWKVFSAHASSVLSAFSSFKLEEQRNYPSTCIKVDSSI 274

Query: 255 NPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSD 311
             S     YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY  STVK + E  ELK D
Sbjct: 275 EGSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPD 334

Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLT-SIT 370
           Q  WVK TTE VY+L+ +TPPDG  F++ VK ILK EEHWN WKNEGCP  K+P   S T
Sbjct: 335 QVDWVKATTEQVYALLTETPPDGPTFAETVKNILKREEHWNAWKNEGCPAFKKPAPDSST 394

Query: 371 DEDKKDEPDAKKKK-----------------TPELTKLWN-SKDNLEACKSAERDFTPSL 412
           D +   +P   K++                  PELTKLWN   +NLEACKS ERDF PSL
Sbjct: 395 DIEDLRKPKRPKRRIGDVIRDAQTVGKYHMGNPELTKLWNLCPNNLEACKSKERDFLPSL 454

Query: 413 ESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIE 472
           E+YFEEAI Q+DP A VE++YKKVND N+ WRALRLL+R+ PHFF++    + K  E++E
Sbjct: 455 ETYFEEAIMQLDPNAMVEDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLE 514

Query: 473 NMVKRCVKEKPS 484
            M+K+  K++P+
Sbjct: 515 TMIKKIAKDRPT 526


>gi|380029639|ref|XP_003698475.1| PREDICTED: LOW QUALITY PROTEIN: THO complex subunit 1-like [Apis
           florea]
          Length = 658

 Score =  554 bits (1427), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 274/491 (55%), Positives = 354/491 (72%), Gaps = 33/491 (6%)

Query: 27  TCDYDKKSALEQSIRQYLLKLIKTPDI-DIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           T D D+K+A++Q++R  LL ++    + +++++E YI+ C+ELC KD+  +++P++LL D
Sbjct: 36  TNDSDRKAAIDQALRDTLLVILSENALSNVQLLETYITFCIELCRKDLATASMPVMLLGD 95

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
            FD  TLD+CEQLF +VE NV +WK+  FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRI
Sbjct: 96  IFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNNLLRMCNDLLRRLSRSQQTVFCGRI 155

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNF 205
           LLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++       +  + D +   KV ID+N 
Sbjct: 156 LLFLAKFFPFSERSGLNIVSEFNLENHTEFGSEKSEGDDLEQINKDDDKSEIKVPIDYNL 215

Query: 206 YKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLN 255
           Y+KFW+LQD+FRNP QCY K+ WK+F+++A +VL+AF S+KL++          V SS+ 
Sbjct: 216 YRKFWALQDFFRNPNQCYQKMHWKVFSAHASSVLSAFSSFKLEEQRNYPSTCIKVDSSIE 275

Query: 256 PSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQ 312
            S     YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY  STVK + E  ELK DQ
Sbjct: 276 GSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPDQ 335

Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLT-SITD 371
             WVK TTE VY+L+ +TPPDG  F++ VK ILK EEHWN WKNEGCP  K+P   S TD
Sbjct: 336 VDWVKATTEQVYALLTETPPDGPTFAETVKNILKREEHWNAWKNEGCPAFKKPTPDSSTD 395

Query: 372 EDKKDEPDAKKKK-----------------TPELTKLWN-SKDNLEACKSAERDFTPSLE 413
            +   +P   K++                  PELTKLWN   +NLEACKS ERDF PSLE
Sbjct: 396 IEDLRKPKRPKRRIGDVIRDAQTVGKYHMGNPELTKLWNLCPNNLEACKSKERDFLPSLE 455

Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIEN 473
           +YFEEAI Q+DP A VE++YKKVND N+ WRALRLL+R+ PHFF++    + K  E++E 
Sbjct: 456 TYFEEAIMQLDPNAMVEDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLET 515

Query: 474 MVKRCVKEKPS 484
           M+K+  K++P+
Sbjct: 516 MIKKIAKDRPT 526


>gi|307213955|gb|EFN89189.1| THO complex subunit 1 [Harpegnathos saltator]
          Length = 653

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 270/488 (55%), Positives = 353/488 (72%), Gaps = 32/488 (6%)

Query: 29  DYDKKSALEQSIRQYLL-KLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
           D D+K+A++Q++R  LL  L +    ++  +E YI+ C+ELC KD+  +++P++LL D F
Sbjct: 38  DGDRKAAVDQALRDTLLIVLTENSSGNVHSLELYITFCIELCRKDLATASMPVMLLGDIF 97

Query: 88  DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
           D  TLD+CEQLF +VE NV++WK+  FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98  DSVTLDRCEQLFTFVEDNVSVWKEDLFFSACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157

Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYK 207
           FLAKFFPFSERSGLNI+SEFN+EN TEFG ++       +  + D +  +K+ ID+N Y+
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENYTEFGSEKTEGDDLEQINKDDDKSENKIPIDYNLYR 217

Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLNPS 257
           KFW+LQDYFRNP QCYNK+ WK+F+++A  VL+AF S+KL++          + SS+  S
Sbjct: 218 KFWALQDYFRNPNQCYNKMHWKVFSAHASNVLSAFSSFKLEEQRSYPTTCIKMDSSMEGS 277

Query: 258 G---DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
                YFAKYLTNQKLL+LQLSD+NFRRYVLLQFLILFQY  STVK + E  ELK DQ +
Sbjct: 278 HKETHYFAKYLTNQKLLELQLSDSNFRRYVLLQFLILFQYLNSTVKFKAETHELKPDQVE 337

Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK 374
           WVK TTE VY+L+ +TPPDG  F++ VK ILK EEHWN WKNEGCP  KRP + I  +++
Sbjct: 338 WVKTTTEQVYALLTETPPDGPAFAETVKNILKREEHWNAWKNEGCPPFKRPTSDIAADEE 397

Query: 375 ---------------KDEPDAKK--KKTPELTKLWN-SKDNLEACKSAERDFTPSLESYF 416
                          +D     K     PELTKLWN   DNLEACKS +RDF PSLE+YF
Sbjct: 398 PRKPKRPRRRIGDVIRDAQTVGKYHMGNPELTKLWNLCPDNLEACKSKDRDFLPSLETYF 457

Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVK 476
           E+AI ++DPAA ++++YKKVND N+ WRALRLL+R+ PHFF++    + K  E++E M+K
Sbjct: 458 EDAIVELDPAAMIDDKYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLPEYLETMIK 517

Query: 477 RCVKEKPS 484
           +  K++P+
Sbjct: 518 KIAKDRPT 525


>gi|340720907|ref|XP_003398870.1| PREDICTED: THO complex subunit 1-like [Bombus terrestris]
          Length = 654

 Score =  552 bits (1423), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 291/585 (49%), Positives = 387/585 (66%), Gaps = 41/585 (7%)

Query: 3   DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLI-KTPDIDIKVIENY 61
           D  Q  F  S + +   K  + + T D D+K+A++Q++R  LL ++ +    ++ ++E Y
Sbjct: 13  DYLQQCFKLSDIQIFQKK-CVERCTNDSDRKAAIDQALRDTLLVILSENVSSNVPLLETY 71

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
           I+ C+ELC KD+  +++P++LL D FD  TLD+CEQLF +VE NV +WK+  FF +CKNN
Sbjct: 72  ITFCIELCRKDLATASMPVMLLGDIFDSMTLDRCEQLFTFVENNVVVWKEDLFFSACKNN 131

Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
           LLRMCNDLLRRLSRSQ TVFCGRILLFLAKFFPFSERSGLNI+SEFN+EN TEFG ++  
Sbjct: 132 LLRMCNDLLRRLSRSQQTVFCGRILLFLAKFFPFSERSGLNIVSEFNLENHTEFGSEKSE 191

Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
                +  + D +   KV ID+N Y+KFW+LQD+FRNP Q Y K+ WK+F+++A +VL+A
Sbjct: 192 GDDLEQITKDDDKSDTKVPIDYNLYRKFWALQDFFRNPNQLYLKMHWKIFSTHASSVLSA 251

Query: 242 FKSYKLDD----------VQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
           F S+KL++          V+SS+  S     YFAKYLTNQKLL+LQLSD+NFRRYVLLQF
Sbjct: 252 FSSFKLEEQRNYPSTCVKVESSMEGSHKETPYFAKYLTNQKLLELQLSDSNFRRYVLLQF 311

Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
           LILFQY  STVK + E  ELK DQ +WVK TTE VY+L+ +TPPDG  F++ VK ILK E
Sbjct: 312 LILFQYLNSTVKFKAETHELKPDQVEWVKGTTEQVYALLTETPPDGPTFAETVKNILKRE 371

Query: 349 EHWNQWKNEGCPELKRPLT-SITDEDKKDEPDAKKKK-----------------TPELTK 390
           EHWN WKNEGCP  KRP   S  D ++  +P   ++K                  PELTK
Sbjct: 372 EHWNAWKNEGCPPFKRPTPDSSADIEESKKPKRPRRKIGDVIRDAQAVGKYHMGNPELTK 431

Query: 391 LWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLL 449
           LWN   +NLEACKS ERDF PSLE+YFEEAI Q+DP A VE++YKKVND N+ WRALRLL
Sbjct: 432 LWNLCPNNLEACKSKERDFLPSLETYFEEAIMQLDPNAMVEDEYKKVNDGNFGWRALRLL 491

Query: 450 SRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEE 509
           +R+ PHFF++    + K  E++E M+K+  K++P +Q        +  P E      +E+
Sbjct: 492 ARRSPHFFVHGNYPINKLPEYLETMIKKIAKDRP-TQSDIKLETEETPPPEANDQEFNED 550

Query: 510 IQEEEKEEDWEAKADPEG------DADEVMSVEYCYQRNWHKQAV 548
           + ++E E+      D +G        D V  +    + +W K A 
Sbjct: 551 VLKQESEQVEAENTDTKGRKLTKVTPDMVAKLSEILKNDWKKLAT 595


>gi|345480955|ref|XP_001607027.2| PREDICTED: THO complex subunit 1 isoform 1 [Nasonia vitripennis]
          Length = 656

 Score =  546 bits (1407), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 273/519 (52%), Positives = 361/519 (69%), Gaps = 34/519 (6%)

Query: 29  DYDKKSALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
           D D+K A++Q++R  L+ ++     +  + +E+YI  C+ELC KD+  +++P++LLSD F
Sbjct: 38  DKDRKPAIDQALRDTLVIILSNNATNNTRTLESYIKFCIELCRKDLATTSMPVMLLSDMF 97

Query: 88  DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
           D  TL+ CE LF YVE NV+ WK+  FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98  DALTLNLCEDLFTYVENNVSTWKEDLFFGACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157

Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYK 207
           FLAKFFPFSERSGLNI+SEFN+ENITEFG ++  +    + E    +   K+ ID+N Y+
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENITEFGSEKSEEEVIQQMEVDGEKSETKISIDYNLYR 217

Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLNPS 257
           KFW+LQD+FRNP QCYNK+ WK F+++A  VL+ F S+KL+D          ++S L+  
Sbjct: 218 KFWALQDFFRNPNQCYNKMHWKCFSTHATNVLSTFASFKLEDQKDCIIDHLKIESLLDGR 277

Query: 258 ---GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
                YFAKYLTN+KLL+LQLSD+NFRR+VLLQFLILFQY  STVK + E  ELK DQ +
Sbjct: 278 YKDNHYFAKYLTNEKLLELQLSDSNFRRFVLLQFLILFQYLNSTVKFKSETHELKPDQVE 337

Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS-ITDED 373
           WV  TTE VY L+ +TPPDG  F++ VK ILK EE+WN WKNEGCP  KRP    +T+ D
Sbjct: 338 WVNTTTEQVYLLLTETPPDGAAFAEAVKHILKREEYWNSWKNEGCPAFKRPAPEPMTEAD 397

Query: 374 KKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKSAERDFTPSLESY 415
           +  +P   ++K                   ELTKLWN   +NLEACKS +RDF PSLE+Y
Sbjct: 398 EARKPKRLRRKIGDLIRDAQAVGKYYMGNQELTKLWNLCPNNLEACKSKDRDFLPSLETY 457

Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMV 475
           FEEAI+Q+DPAAA+E++YKKVND N+ WRALRLL+R+ PHFF++    + K  E++E M+
Sbjct: 458 FEEAIEQLDPAAAIEDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLQEYLEIMI 517

Query: 476 KRCVKEKPSSQISGNGNGVDQDPAE-VEVDTKSEEIQEE 513
           K+  K++P  Q        +  P+E +E D   E +++E
Sbjct: 518 KKIAKDRPQVQTDIKQEAGETPPSEQIEQDFNEETLKQE 556


>gi|345480957|ref|XP_003424254.1| PREDICTED: THO complex subunit 1 isoform 2 [Nasonia vitripennis]
          Length = 660

 Score =  542 bits (1397), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 274/522 (52%), Positives = 363/522 (69%), Gaps = 37/522 (7%)

Query: 29  DYDKKSALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
           D D+K A++Q++R  L+ ++     +  + +E+YI  C+ELC KD+  +++P++LLSD F
Sbjct: 38  DKDRKPAIDQALRDTLVIILSNNATNNTRTLESYIKFCIELCRKDLATTSMPVMLLSDMF 97

Query: 88  DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
           D  TL+ CE LF YVE NV+ WK+  FF +CKNNLLRMCNDLLRRLSRSQ TVFCGRILL
Sbjct: 98  DALTLNLCEDLFTYVENNVSTWKEDLFFGACKNNLLRMCNDLLRRLSRSQQTVFCGRILL 157

Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYK 207
           FLAKFFPFSERSGLNI+SEFN+ENITEFG ++  +    + E    +   K+ ID+N Y+
Sbjct: 158 FLAKFFPFSERSGLNIVSEFNLENITEFGSEKSEEEVIQQMEVDGEKSETKISIDYNLYR 217

Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD----------VQSSLNPS 257
           KFW+LQD+FRNP QCYNK+ WK F+++A  VL+ F S+KL+D          ++S L+  
Sbjct: 218 KFWALQDFFRNPNQCYNKMHWKCFSTHATNVLSTFASFKLEDQKDCIIDHLKIESLLDGR 277

Query: 258 ---GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
                YFAKYLTN+KLL+LQLSD+NFRR+VLLQFLILFQY  STVK + E  ELK DQ +
Sbjct: 278 YKDNHYFAKYLTNEKLLELQLSDSNFRRFVLLQFLILFQYLNSTVKFKSETHELKPDQVE 337

Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS-ITDED 373
           WV  TTE VY L+ +TPPDG  F++ VK ILK EE+WN WKNEGCP  KRP    +T+ D
Sbjct: 338 WVNTTTEQVYLLLTETPPDGAAFAEAVKHILKREEYWNSWKNEGCPAFKRPAPEPMTEAD 397

Query: 374 KKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKSAERDFTPSLESY 415
           +  +P   ++K                   ELTKLWN   +NLEACKS +RDF PSLE+Y
Sbjct: 398 EARKPKRLRRKIGDLIRDAQAVGKYYMGNQELTKLWNLCPNNLEACKSKDRDFLPSLETY 457

Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMV 475
           FEEAI+Q+DPAAA+E++YKKVND N+ WRALRLL+R+ PHFF++    + K  E++E M+
Sbjct: 458 FEEAIEQLDPAAAIEDEYKKVNDGNFGWRALRLLARRSPHFFVHGNYPINKLQEYLEIMI 517

Query: 476 KRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEE 517
           K+  K++P   I+     + Q+  E      SE+I+++  EE
Sbjct: 518 KKIAKDRPVC-ITNVQTDIKQEAGET---PPSEQIEQDFNEE 555


>gi|328714151|ref|XP_001950767.2| PREDICTED: THO complex subunit 1-like [Acyrthosiphon pisum]
          Length = 656

 Score =  504 bits (1297), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 256/495 (51%), Positives = 342/495 (69%), Gaps = 39/495 (7%)

Query: 29  DYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDT 86
           D D++S L+Q +R Y+L ++  K P++   V + + S  +EL   +MC + +P+++LSD 
Sbjct: 40  DADRRSILDQVLRCYVLDIVLSKPPNL-YDVCKMWTSFTIELVRNEMCTAIMPVVILSDM 98

Query: 87  FDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
           FD++T+D CE++F +VE NVN+ K+  FFM+CKNNLLRMCNDLLRRLSRS+NTVFCGRIL
Sbjct: 99  FDVTTMDVCEKMFDHVESNVNVLKEPQFFMACKNNLLRMCNDLLRRLSRSRNTVFCGRIL 158

Query: 147 LFLAKFFPFSERSGLNIISEFNVENITE------FGGDEEMDVSSNETEETDTEEVDKVK 200
           LFLAKFFPFSERSGLNI+SEFN+EN+TE      F   E M  S  E E      ++KV 
Sbjct: 159 LFLAKFFPFSERSGLNIVSEFNLENVTEYLNENHFDQTECMPDSIPEDEAKSDIVIEKVN 218

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD- 259
           +D N Y KFWSLQDYFRNP QCY    WK+F S+ + + + F+S+KL+ V    N S + 
Sbjct: 219 VDKNLYLKFWSLQDYFRNPNQCYKTEHWKLFISHTDFIFSTFQSHKLEHVNRMDNESAEQ 278

Query: 260 ---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWV 316
              YF KYLTNQKLL+LQLSD NFRRY+L+QFL+LFQY  + VK + +  ++K +Q++W+
Sbjct: 279 NMHYFPKYLTNQKLLELQLSDVNFRRYILIQFLLLFQYLDAPVKFKLDNFKIKPEQQEWI 338

Query: 317 KDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKD 376
           K++  T+Y LI  TPPDG  FS+VVK IL+ EE WN+WKN+GCPE+ + L +    D+ D
Sbjct: 339 KNSITTIYELINNTPPDGVRFSEVVKNILQREEQWNKWKNDGCPEVTKRLPT---SDQSD 395

Query: 377 EPDAKKK--------------------KTPELTKLWNSK-DNLEACKSAERDFTPSLESY 415
            P  K++                      P+LTKLWN K DNLEACK  ERDF PS +S+
Sbjct: 396 MPSPKRRLRRNLGDVIKNMTANNKVFLGDPDLTKLWNLKPDNLEACKGPERDFLPSFDSF 455

Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMV 475
           F+EA +Q+DP   +E QYKKVND N+ WRALRLL+RK PHFF  +   + K SE++ENM+
Sbjct: 456 FQEAFEQLDPELCIESQYKKVNDGNFGWRALRLLARKSPHFFAQSNNPIHKLSEYLENMI 515

Query: 476 KRCV-KEK-PSSQIS 488
           K+ + KEK P +Q S
Sbjct: 516 KKTMSKEKIPINQGS 530


>gi|91088827|ref|XP_970265.1| PREDICTED: similar to Hpr1 CG2031-PA [Tribolium castaneum]
 gi|270012332|gb|EFA08780.1| hypothetical protein TcasGA2_TC006470 [Tribolium castaneum]
          Length = 649

 Score =  492 bits (1267), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 263/548 (47%), Positives = 359/548 (65%), Gaps = 38/548 (6%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           D K+ L+Q+ R  LL+L++  ++++ V+   +   V  C K++ N T+P++LLSD FD  
Sbjct: 41  DLKAPLDQAFRDVLLELLEK-EVNVSVLRELVDFSVLTCRKELSNPTMPVMLLSDIFDAL 99

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           TLD C+++F YVE NV +WK+  FF  CKNNLLR+CNDLLRRLSRS  TVFCGRILLFLA
Sbjct: 100 TLDACQEMFLYVENNVEVWKENLFFSCCKNNLLRLCNDLLRRLSRSTATVFCGRILLFLA 159

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFW 210
           KFFPFSERSGLNI+SEFN+EN TE+G D     S  +  + D +E   V IDF FY KFW
Sbjct: 160 KFFPFSERSGLNIVSEFNLENTTEYGTDS----SQEKGGDVDGDEKKNV-IDFTFYCKFW 214

Query: 211 SLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD-----YFAKYL 265
           SLQD+FRNP Q Y KV WK+F +YA +V   F+  KL+ V+++ +   +     YF+KYL
Sbjct: 215 SLQDFFRNPNQLYTKVQWKLFCTYATSVFNTFQGLKLEHVENNGSYCDESKNKMYFSKYL 274

Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYS 325
           T+QKLLDLQL D NFRR VLLQFLILFQY +S+VK + E  ELKSDQ++W++ +TE VYS
Sbjct: 275 TSQKLLDLQLYDVNFRRAVLLQFLILFQYLSSSVKFKSESFELKSDQKEWIQSSTEKVYS 334

Query: 326 LIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK---DEP---D 379
           L+++TPPDG+ F+++   IL  EE+WN WKN+GCPELK+ +      DK+   + P   D
Sbjct: 335 LLRETPPDGKRFAEIASNILSREEYWNAWKNDGCPELKKSVVPTESPDKRRTEERPLLGD 394

Query: 380 AKKKKTP---------ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAV 429
             K+ +          ELTKLWN   DNLEACK  ERDF P+LE YF +AI Q++   AV
Sbjct: 395 IIKQASSEGKYYMGSTELTKLWNLCPDNLEACKGRERDFLPTLEDYFADAIAQIESPGAV 454

Query: 430 EEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISG 489
           +++   + D NY WRALRLL+++ PHFF  +T  +   S ++E MVK+   +KP  +++ 
Sbjct: 455 KDEDNLLKDGNYGWRALRLLAKRSPHFFTYSTVIMNPLSSYLEMMVKKIAHDKP-GRVTE 513

Query: 490 NGNGVDQDPAEVE-------VDTKSEEIQEEEKEE---DWEAKADPEGDADEVMSVEYCY 539
           N     Q+ AE E       VD  +E+I+  E EE   D  A+ + + + +++       
Sbjct: 514 NSQESQQNDAEEENLFKDTQVDENTEDIKPSETEEFMDDKNARVEHKLNPNQLQLFSEKV 573

Query: 540 QRNWHKQA 547
             NW K A
Sbjct: 574 APNWKKLA 581


>gi|357606797|gb|EHJ65225.1| hypothetical protein KGM_07180 [Danaus plexippus]
          Length = 565

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 248/510 (48%), Positives = 334/510 (65%), Gaps = 56/510 (10%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           D+KSA++Q+ R    KL+     +   +E+Y++ C++ C + M  +T+P++LL D FD  
Sbjct: 41  DRKSAMDQAFRD---KLLDLLLEEPNTLESYVNFCIDSCRRQMVTATMPVVLLGDIFDAL 97

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           TL+KCE++F YVE  VNIW+++ FF++CKN+LLRMCNDLLRRLSRSQNTVFCGRILLFLA
Sbjct: 98  TLNKCEKMFMYVENGVNIWREELFFVACKNHLLRMCNDLLRRLSRSQNTVFCGRILLFLA 157

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE-EVDKVKIDFNFYKKF 209
           KFFPFSERSGLNI+SEFN+ENITEFGGD    +     EE   E + +K+ ID+N Y +F
Sbjct: 158 KFFPFSERSGLNIVSEFNLENITEFGGDNTSTLKDVLDEEMVVEDDKNKLVIDYNLYCRF 217

Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-----------------DVQS 252
           WSLQD+FRNP  CYNK+ WK F +++ +VL+AF SYKL+                 DV+ 
Sbjct: 218 WSLQDFFRNPNTCYNKLQWKTFVAHSGSVLSAFSSYKLEAVELQKSKLNILKSVNSDVEM 277

Query: 253 SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQ 312
             N    YFAK+LTNQKLL+LQLSD+NFRR VL+Q+LILFQY  STVK + E  ELKSDQ
Sbjct: 278 QENKEQHYFAKFLTNQKLLELQLSDSNFRRCVLIQYLILFQYLMSTVKFKMESQELKSDQ 337

Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
             WVKDTT  VY L+ +TPPDG+ F++ VK ILK EEHWN WKN+GCPE ++P   +  E
Sbjct: 338 IDWVKDTTALVYKLLGETPPDGKQFAECVKRILKREEHWNSWKNDGCPEFQKPKPPVQAE 397

Query: 373 DKKDEPDAKKKK------------------TPELTKLWN-SKDNLEACKSAERDFTPSLE 413
           +++ +   K+++                    +LTKLWN   DNL AC++ ERDF PSLE
Sbjct: 398 NEEVKRSRKRRRPVGDIIKEYSGTDKFFMGNNDLTKLWNLCPDNLAACRTKERDFMPSLE 457

Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIEN 473
           SY                           WRALRLL+R+ PHFF++    + +  +++++
Sbjct: 458 SYMLSGDGGEGAGGW-------------GWRALRLLARRSPHFFVHTNNPIGRLPDYLDD 504

Query: 474 MVKRCVKEKPSSQISGNGNG---VDQDPAE 500
           MVK+  +E  ++ +S N NG   V+ D A+
Sbjct: 505 MVKKITREVAANNVSNNTNGDSNVNNDKAK 534


>gi|242019012|ref|XP_002429961.1| THO complex subunit, putative [Pediculus humanus corporis]
 gi|212515012|gb|EEB17223.1| THO complex subunit, putative [Pediculus humanus corporis]
          Length = 597

 Score =  479 bits (1232), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 256/514 (49%), Positives = 337/514 (65%), Gaps = 42/514 (8%)

Query: 40  IRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLF 99
           +R  LL L    ++++K +E ++S  VEL  +++C+ ++PI+LLSDTFD+ TLD+CE+LF
Sbjct: 1   MRNCLLNLASNSEVELKSLEEFVSFAVELSRREICSPSIPIVLLSDTFDIFTLDQCEKLF 60

Query: 100 YYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERS 159
            YVE NVNIWK+  FF  CKNNLLRMCNDLLRRLSRSQNTVF GRILLFLAKFFPFSERS
Sbjct: 61  AYVENNVNIWKEDMFFSGCKNNLLRMCNDLLRRLSRSQNTVFRGRILLFLAKFFPFSERS 120

Query: 160 GLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP 219
           GLNI+SEFN++N+ E   D    + + ET E + EE+ K KID+NFY KF SLQD+ R+P
Sbjct: 121 GLNIVSEFNLDNVVEQSQDTS-SLENFETSEENDEEI-KFKIDYNFYNKFLSLQDFMRSP 178

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP---------SGDYFAKYLTNQKL 270
            QCYNKV W++F + A  VLAAF+++KLD  Q    P         S  +FAKYLTN KL
Sbjct: 179 NQCYNKVRWRVFVNSATDVLAAFQNFKLDADQYQDKPIHDHISEATSSTFFAKYLTNPKL 238

Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
           L+LQLSD +FRR +L+QFLI+ QY +S VK +    ELK DQ+ W+K+    VY L+ +T
Sbjct: 239 LELQLSDPDFRRSILIQFLIILQYLSSPVKFKLGSYELKPDQQDWIKEQVTLVYQLLAET 298

Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT----- 385
           P +   F+Q VK IL  EE WN WKN GCPE K P         K +   KK+       
Sbjct: 299 PLNSHSFAQSVKHILAREELWNNWKNAGCPEFKMPGNVEEKNSTKTKVRGKKRNIGELIK 358

Query: 386 ------------PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQ 432
                       PELT+LWN + +NLEACK+ ER F P  + YFEEAI QMDP AA+E +
Sbjct: 359 EATAENKYYMGHPELTRLWNLNSNNLEACKAKERKFLPDFDEYFEEAIMQMDPEAAIEPE 418

Query: 433 YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGN 492
           YKKV D N+ WRALRLL+++   FF++    V    +++E ++K+ V+++P      N N
Sbjct: 419 YKKVKDPNFGWRALRLLTQRTSCFFISTNNPVTGLPDYLELILKKIVQDRPQ-----NSN 473

Query: 493 G------VDQDPAEVEVDTKSE--EIQEEEKEED 518
           G      ++ +  EVE   KSE  E+QE+  EE+
Sbjct: 474 GEIKQEVLNTEAEEVEDLLKSEEKEMQEDGGEEN 507


>gi|41151980|ref|NP_958481.1| THO complex subunit 1 [Danio rerio]
 gi|32822787|gb|AAH54938.1| THO complex 1 [Danio rerio]
          Length = 655

 Score =  469 bits (1208), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 245/533 (45%), Positives = 335/533 (62%), Gaps = 57/533 (10%)

Query: 24  YKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
           + P  + +KK+ L+Q++R  L + I    +++    + I + ++   + +C++T P +LL
Sbjct: 38  HLPGNETEKKATLDQALRGVLEEQIVNQKVNVDDFLSLIYISIDGVTEGICSATTPFLLL 97

Query: 84  SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
            D  D   LD+C+++F +VE NV+ WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 98  GDVLDCLPLDQCDKIFSFVEENVSTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 157

Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNET 188
           RI LFLA+ FP SE+SGLN+ S+FN++NIT F  +E+               MDV   E 
Sbjct: 158 RIQLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQDSTLGQQHTEVKEEGMDVEEGEM 217

Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
            + D      + ID+N Y+KFW+LQDYFRNPVQCY+K SW  F  Y++  LA FKS+KLD
Sbjct: 218 GDEDAPAPSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFIKYSDEALAVFKSFKLD 277

Query: 249 DVQSS-------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           D+Q+S          SGD  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 278 DMQASKKKLEEMRTSSGDHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 337

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  DQ  W++DTT+ VY L+K+ PPDG+ F  +V+ IL  EE+WN WKNEGC
Sbjct: 338 KFKSSSCVLNDDQSLWIEDTTKLVYQLLKEIPPDGDKFGSMVEHILNTEENWNSWKNEGC 397

Query: 360 PEL--KRPLTS--ITDEDKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P    +RP  +  I    K+  P+    K P         ELT+LWN + DN+EACKS  
Sbjct: 398 PSFVKERPAETKPIRPSRKRQAPEDFLGKGPDRKILMGNDELTRLWNLNPDNMEACKSEN 457

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R+F PSLE +FEEAI+Q DPA  VE++YK V +SNY WRALRLLSR+ PHFF       +
Sbjct: 458 REFMPSLEDFFEEAIEQADPANMVEDEYKVVRNSNYGWRALRLLSRRSPHFFQPTNQQFK 517

Query: 466 KNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEED 518
             ++++ENMV +  KE P                    D  SEEI+  E+++D
Sbjct: 518 SLADYLENMVIKLAKELPK-------------------DIPSEEIKTGEEDDD 551


>gi|328721799|ref|XP_001947842.2| PREDICTED: THO complex subunit 1-like [Acyrthosiphon pisum]
          Length = 635

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 233/470 (49%), Positives = 327/470 (69%), Gaps = 23/470 (4%)

Query: 31  DKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           +++  L+Q +R ++L ++  K P++   V + + S  +EL   +MC + +P+ +LSD F 
Sbjct: 41  ERRPILDQVLRCHVLDIVSSKPPNL-YDVCKMWTSFTIELVRNEMCTAIMPVAILSDMFA 99

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
           ++T++ CE++F +VE NVN+ K+  FFM+ KNNLLRMCND+LRRLSRSQNTVFCGRILLF
Sbjct: 100 VTTINVCEKIFDHVESNVNVLKEPRFFMASKNNLLRMCNDILRRLSRSQNTVFCGRILLF 159

Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
           LA FFPFSERSGLNIISEFN+EN+TE+  +   D  +     +  E+  K+ ID N Y K
Sbjct: 160 LAIFFPFSERSGLNIISEFNLENLTEYLNENHFD-QTEYMPASIAEDEAKLDIDNNLYLK 218

Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD----YFAKY 264
           FWSLQDYFRNP QCY    WK+F S+ + ++++F+S+K + V    N S +    YF KY
Sbjct: 219 FWSLQDYFRNPNQCYETEHWKLFISHTDFIISSFQSHKSEHVNKIDNKSAEQNTYYFPKY 278

Query: 265 LTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVY 324
           L NQK+L LQLSD NFRR +L+QFL+LFQY  + VK + +  ++K +Q++W+K++  T+Y
Sbjct: 279 LPNQKILKLQLSDVNFRRNILIQFLLLFQYLDAPVKFKLDSFKIKPEQQEWIKNSITTIY 338

Query: 325 SLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS-----ITDEDKKDEPD 379
            LI  TPPDG  FS+VVK IL+ EE WN+WKN+GCPE+ + L +     +T   +++  D
Sbjct: 339 ELINNTPPDGVRFSEVVKSILQREEQWNKWKNDGCPEVTKRLPTSDQSDMTAPKRRNLGD 398

Query: 380 AKKKKT---------PELTKLWNSK-DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAV 429
             K  T         P+LTKLWN K DNLEACK  ERDF PS +S+F+EA +Q+D    +
Sbjct: 399 VIKNMTANNKVFLGDPDLTKLWNLKPDNLEACKRPERDFLPSFDSFFKEAFEQLDAELCI 458

Query: 430 EEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCV 479
           E Q KKVND N+ WRALRLL+RK PHFF N+  +++K  E++ENM+K+ +
Sbjct: 459 ESQNKKVNDINFGWRALRLLARKSPHFFANSGDSIQKVPEYLENMIKKTL 508


>gi|432105426|gb|ELK31641.1| THO complex subunit 1 [Myotis davidii]
          Length = 657

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 318/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHAEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRAVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSDT 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP + VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPESMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|354490073|ref|XP_003507184.1| PREDICTED: THO complex subunit 1 [Cricetulus griseus]
          Length = 655

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 39  PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 94

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV+ WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 95  LGDVLDCLPLDQCDTIFTFVEKNVSTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 154

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+ENIT F  +E+               MDV   E
Sbjct: 155 GRIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHAEDREEGMDVEEGE 214

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 215 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 274

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 275 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 334

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 335 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 394

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +   S T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 395 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 454

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 455 REYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 514

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 515 SLPEYLENMVIKLAKELP 532


>gi|355724121|gb|AES08116.1| THO complex 1 [Mustela putorius furo]
          Length = 656

 Score =  465 bits (1197), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/502 (48%), Positives = 317/502 (63%), Gaps = 52/502 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 47  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 102

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 103 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 162

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 163 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 222

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 223 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 282

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 283 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 342

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 343 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 402

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
           P   +  TS    D K    A+K+  P                 ELT+LWN   DN+EAC
Sbjct: 403 PSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 458

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
           KS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF    
Sbjct: 459 KSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTN 518

Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
              +   E++ENMV +  KE P
Sbjct: 519 QQFKSLPEYLENMVIKLAKELP 540


>gi|296222353|ref|XP_002757154.1| PREDICTED: THO complex subunit 1 [Callithrix jacchus]
          Length = 657

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|73961844|ref|XP_547651.2| PREDICTED: THO complex subunit 1 isoform 1 [Canis lupus familiaris]
          Length = 657

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/502 (48%), Positives = 317/502 (63%), Gaps = 52/502 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
           P   +  TS    D K    A+K+  P                 ELT+LWN   DN+EAC
Sbjct: 397 PSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 452

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
           KS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF    
Sbjct: 453 KSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTN 512

Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
              +   E++ENMV +  KE P
Sbjct: 513 QQFKSLPEYLENMVIKLAKELP 534


>gi|301774498|ref|XP_002922667.1| PREDICTED: THO complex subunit 1-like [Ailuropoda melanoleuca]
          Length = 657

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/502 (48%), Positives = 317/502 (63%), Gaps = 52/502 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
           P   +  TS    D K    A+K+  P                 ELT+LWN   DN+EAC
Sbjct: 397 PSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 452

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
           KS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF    
Sbjct: 453 KSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTN 512

Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
              +   E++ENMV +  KE P
Sbjct: 513 QQFKSLPEYLENMVIKLAKELP 534


>gi|397489339|ref|XP_003815687.1| PREDICTED: THO complex subunit 1 [Pan paniscus]
          Length = 657

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|386781620|ref|NP_001247904.1| THO complex subunit 1 [Macaca mulatta]
 gi|332225781|ref|XP_003262062.1| PREDICTED: THO complex subunit 1 [Nomascus leucogenys]
 gi|402902763|ref|XP_003914266.1| PREDICTED: THO complex subunit 1 [Papio anubis]
 gi|90075390|dbj|BAE87375.1| unnamed protein product [Macaca fascicularis]
 gi|355701854|gb|EHH29207.1| THO complex subunit 1 [Macaca mulatta]
 gi|355754925|gb|EHH58792.1| THO complex subunit 1 [Macaca fascicularis]
 gi|380818558|gb|AFE81152.1| THO complex subunit 1 [Macaca mulatta]
 gi|383409917|gb|AFH28172.1| THO complex subunit 1 [Macaca mulatta]
 gi|384942554|gb|AFI34882.1| THO complex subunit 1 [Macaca mulatta]
          Length = 657

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|114672500|ref|XP_512049.2| PREDICTED: THO complex subunit 1 isoform 3 [Pan troglodytes]
 gi|410211798|gb|JAA03118.1| THO complex 1 [Pan troglodytes]
 gi|410247646|gb|JAA11790.1| THO complex 1 [Pan troglodytes]
 gi|410307156|gb|JAA32178.1| THO complex 1 [Pan troglodytes]
 gi|410354343|gb|JAA43775.1| THO complex 1 [Pan troglodytes]
          Length = 657

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|335310827|ref|XP_003362210.1| PREDICTED: THO complex subunit 1, partial [Sus scrofa]
          Length = 654

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 38  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 93

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 94  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 153

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 154 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 213

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 214 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 273

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 274 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 333

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 334 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 393

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 394 PSFVKERTSDTKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 453

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 454 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 513

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 514 SLPEYLENMVIKLAKELP 531


>gi|149720837|ref|XP_001491940.1| PREDICTED: THO complex subunit 1 [Equus caballus]
          Length = 657

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|363730911|ref|XP_419149.3| PREDICTED: THO complex subunit 1 [Gallus gallus]
          Length = 656

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 250/539 (46%), Positives = 335/539 (62%), Gaps = 53/539 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
           P  + +KK  L+Q+ R  + + I  K P  ++  I   ISL +    + +C ++ P +LL
Sbjct: 31  PGSENEKKCTLDQAFRVIVEEEIINKAPCENLLAI---ISLAINGVTEGICTASTPFVLL 87

Query: 84  SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
            D  D   LD+C+++F +VE NV  WK   F+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 88  GDVLDCLPLDQCDKIFTFVEKNVATWKSNMFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 147

Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGG---------------DEEMDVSSNET 188
           RI LFLA+ FP SE+SGLN+ S+FN+EN+T F                 DE MDV   E 
Sbjct: 148 RIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEHESTLGQKHSEERDEGMDVEEGEM 207

Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
            + +      + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F  Y+E VLA FKSYKLD
Sbjct: 208 GDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 267

Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           D Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK
Sbjct: 268 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 327

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
            +     L  +Q  W++DTT+ VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGCP
Sbjct: 328 FKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCP 387

Query: 361 ----ELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNL 398
               E  +P   +   D K    A+K+  P                 ELT+LWN   DN+
Sbjct: 388 SFVKERSQPEFQLGPPDSKPMRPARKRPAPEDFLGKGPNKKILMGNEELTRLWNLCPDNM 447

Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFL 458
           EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF 
Sbjct: 448 EACKSESREYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPHFFQ 507

Query: 459 NATPNVEKNSEFIENMVKRCVKE--KPSSQI-SGNGNGVDQDPAEVEVDTKSEEIQEEE 514
                 +   E++ENMV +  KE   PS +I +G     + + A ++ + +S E+Q ++
Sbjct: 508 PTNQQFKSLPEYLENMVIKLAKELPPPSEEIKTGEDEDEEDNDALLKENNESPEVQRDK 566


>gi|329664570|ref|NP_001192409.1| THO complex subunit 1 [Bos taurus]
 gi|296473722|tpg|DAA15837.1| TPA: hypothetical protein BOS_22300 [Bos taurus]
          Length = 657

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 242/502 (48%), Positives = 316/502 (62%), Gaps = 52/502 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVERNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
           P   +  TS    D K     +K+  P                 ELT+LWN   DN+EAC
Sbjct: 397 PSFVKERTS----DSKPTRAVRKRAAPEDFLGKGPSKKILMGNDELTRLWNLCPDNMEAC 452

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
           KS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF    
Sbjct: 453 KSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTN 512

Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
              +   E++ENMV +  KE P
Sbjct: 513 QQFKSLPEYLENMVIKLAKELP 534


>gi|417403684|gb|JAA48640.1| Putative nuclear matrix protein [Desmodus rotundus]
          Length = 657

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 242/498 (48%), Positives = 318/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS +       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDSKPTRIVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSDT 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP + VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPESMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|426254001|ref|XP_004020677.1| PREDICTED: THO complex subunit 1 [Ovis aries]
          Length = 647

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 242/502 (48%), Positives = 316/502 (62%), Gaps = 52/502 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 31  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 86

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 87  LGDVLDCLPLDQCDTIFTFVERNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 146

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 147 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 206

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 207 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 266

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 267 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 326

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 327 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 386

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
           P   +  TS    D K     +K+  P                 ELT+LWN   DN+EAC
Sbjct: 387 PSFVKERTS----DSKPTRAVRKRAAPEDFLGKGPSKKILMGNDELTRLWNLCPDNMEAC 442

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
           KS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF    
Sbjct: 443 KSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTN 502

Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
              +   E++ENMV +  KE P
Sbjct: 503 QQFKSLPEYLENMVIKLAKELP 524


>gi|449268552|gb|EMC79415.1| THO complex subunit 1, partial [Columba livia]
          Length = 654

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/497 (48%), Positives = 318/497 (63%), Gaps = 42/497 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
           P  + +KK  L+Q+ R  + + I  K P  ++  I   ISL +    + +C ++ P +LL
Sbjct: 37  PGSENEKKCTLDQAFRVIVEEEIINKAPCENLLAI---ISLAISGVTEGICTASTPFVLL 93

Query: 84  SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
            D  D   LD+C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 94  GDVLDCLPLDQCDKIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 153

Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNET 188
           RI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E                MDV   E 
Sbjct: 154 RIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEHESTLGQKHAEEREEGMDVEEGEM 213

Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
            + +      + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F  Y+E VLA FKSYKLD
Sbjct: 214 GDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 273

Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           D Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK
Sbjct: 274 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 333

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
            +     L  +Q  W++DTT+ VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGCP
Sbjct: 334 FKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCP 393

Query: 361 EL--KRPLTS--ITDEDKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAER 406
               +RP  S  +    K+  P+    K P         ELT+LWN   DN+EACKS  R
Sbjct: 394 SFVKERPPDSKPMRPMRKRPAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSESR 453

Query: 407 DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEK 466
           ++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       + 
Sbjct: 454 EYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKS 513

Query: 467 NSEFIENMVKRCVKEKP 483
             E++ENMV +  KE P
Sbjct: 514 LPEYLENMVIKLAKELP 530


>gi|23956332|ref|NP_705780.1| THO complex subunit 1 [Mus musculus]
 gi|48474499|sp|Q8R3N6.1|THOC1_MOUSE RecName: Full=THO complex subunit 1; Short=Tho1; AltName:
           Full=Nuclear matrix protein p84
 gi|19353206|gb|AAH24951.1| THO complex 1 [Mus musculus]
 gi|26327609|dbj|BAC27548.1| unnamed protein product [Mus musculus]
 gi|26328027|dbj|BAC27754.1| unnamed protein product [Mus musculus]
          Length = 657

 Score =  464 bits (1193), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 242/498 (48%), Positives = 316/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +   S T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|291394182|ref|XP_002713492.1| PREDICTED: THO complex 1 [Oryctolagus cuniculus]
          Length = 657

 Score =  463 bits (1192), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 242/502 (48%), Positives = 316/502 (62%), Gaps = 52/502 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
           P   +  TS    D K     +K+  P                 ELT+LWN   DN+EAC
Sbjct: 397 PSFVKERTS----DAKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 452

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT 461
           KS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF    
Sbjct: 453 KSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTN 512

Query: 462 PNVEKNSEFIENMVKRCVKEKP 483
              +   E++ENMV +  KE P
Sbjct: 513 QQFKSLPEYLENMVIKLAKELP 534


>gi|154448890|ref|NP_005122.2| THO complex subunit 1 [Homo sapiens]
 gi|37999906|sp|Q96FV9.1|THOC1_HUMAN RecName: Full=THO complex subunit 1; Short=Tho1; AltName:
           Full=Nuclear matrix protein p84; Short=p84N5; AltName:
           Full=hTREX84
 gi|14714503|gb|AAH10381.1| THO complex 1 [Homo sapiens]
 gi|119622136|gb|EAX01731.1| THO complex 1, isoform CRA_a [Homo sapiens]
 gi|123984784|gb|ABM83694.1| THO complex 1 [synthetic construct]
 gi|157928504|gb|ABW03548.1| THO complex 1 [synthetic construct]
          Length = 657

 Score =  463 bits (1192), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 316/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R+  P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|403265210|ref|XP_003924839.1| PREDICTED: THO complex subunit 1 [Saimiri boliviensis boliviensis]
          Length = 657

 Score =  463 bits (1192), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 242/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  + + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVVEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|60654241|gb|AAX29813.1| THO complex 1 [synthetic construct]
          Length = 658

 Score =  463 bits (1192), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 243/498 (48%), Positives = 316/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R+  P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|344269141|ref|XP_003406413.1| PREDICTED: THO complex subunit 1 [Loxodonta africana]
          Length = 657

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 242/498 (48%), Positives = 317/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLRYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS +       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDSKPPRIVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKVVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|348576778|ref|XP_003474163.1| PREDICTED: THO complex subunit 1-like [Cavia porcellus]
          Length = 657

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 242/501 (48%), Positives = 315/501 (62%), Gaps = 50/501 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +      +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTGGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSSTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHPEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PEL-------KRPLTSITDEDKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACK 402
           P          +P+  +    K+  P+    K P         ELT+LWN   DN+EACK
Sbjct: 397 PSFVKERASDSKPMRVVR---KRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACK 453

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           S  R + P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF     
Sbjct: 454 SETRQYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQ 513

Query: 463 NVEKNSEFIENMVKRCVKEKP 483
             +   E++ENMV +  KE P
Sbjct: 514 QFKSLPEYLENMVIKLAKELP 534


>gi|189054520|dbj|BAG37293.1| unnamed protein product [Homo sapiens]
          Length = 657

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 242/498 (48%), Positives = 316/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + K+ LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKSYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R+  P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 REHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLPEYLENMVIKLAKELP 534


>gi|281350925|gb|EFB26509.1| hypothetical protein PANDA_011654 [Ailuropoda melanoleuca]
          Length = 640

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 243/503 (48%), Positives = 317/503 (63%), Gaps = 53/503 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 23  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 78

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 79  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 138

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 139 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 198

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 199 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 258

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 259 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 318

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 319 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 378

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
           P   +  TS    D K    A+K+  P                 ELT+LWN   DN+EAC
Sbjct: 379 PSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEAC 434

Query: 402 KSAER-DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
           KS  R ++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF   
Sbjct: 435 KSETRQEYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPT 494

Query: 461 TPNVEKNSEFIENMVKRCVKEKP 483
               +   E++ENMV +  KE P
Sbjct: 495 NQQFKSLPEYLENMVIKLAKELP 517


>gi|449494000|ref|XP_004175274.1| PREDICTED: THO complex subunit 1 isoform 2 [Taeniopygia guttata]
          Length = 664

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 242/505 (47%), Positives = 315/505 (62%), Gaps = 50/505 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
           P  + +KK  L+Q  R  + + I  K P  ++  I   ISL +    + +C ++ P +LL
Sbjct: 39  PGSENEKKCTLDQVFRVVVEEEIINKAPCENLLAI---ISLAINGVTEGICTASTPFVLL 95

Query: 84  SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
            D  D   LD+C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 96  GDVLDCLPLDQCDKIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 155

Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNET 188
           RI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E                MDV   E 
Sbjct: 156 RIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEHESTLGQKHTEEREEGMDVEEGEM 215

Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
            + +      + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F  Y+E VLA FKSYKLD
Sbjct: 216 GDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 275

Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           D Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK
Sbjct: 276 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 335

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
            +     L  +Q  W++DTT+ VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGCP
Sbjct: 336 FKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCP 395

Query: 361 ----ELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNL 398
               E   P   +   D K     +K+  P                 ELT+LWN   DN+
Sbjct: 396 SFVKERCHPEFQLGPPDSKPMRPVRKRPAPEDFLGKGSNKKILMGNEELTRLWNLCPDNM 455

Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFL 458
           EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF 
Sbjct: 456 EACKSESREYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPHFFQ 515

Query: 459 NATPNVEKNSEFIENMVKRCVKEKP 483
                 +   E++ENMV +  KE P
Sbjct: 516 PTNQQFKSLPEYLENMVIKLAKELP 540


>gi|224046136|ref|XP_002194707.1| PREDICTED: THO complex subunit 1 isoform 1 [Taeniopygia guttata]
          Length = 656

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 242/503 (48%), Positives = 315/503 (62%), Gaps = 54/503 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
           P  + +KK  L+Q  R  + + I  K P  ++  I   ISL +    + +C ++ P +LL
Sbjct: 39  PGSENEKKCTLDQVFRVVVEEEIINKAPCENLLAI---ISLAINGVTEGICTASTPFVLL 95

Query: 84  SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
            D  D   LD+C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 96  GDVLDCLPLDQCDKIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 155

Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNET 188
           RI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E                MDV   E 
Sbjct: 156 RIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEHESTLGQKHTEEREEGMDVEEGEM 215

Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
            + +      + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F  Y+E VLA FKSYKLD
Sbjct: 216 GDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 275

Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           D Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK
Sbjct: 276 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 335

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
            +     L  +Q  W++DTT+ VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGCP
Sbjct: 336 FKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCP 395

Query: 361 EL--KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEA 400
               +RP       D K     +K+  P                 ELT+LWN   DN+EA
Sbjct: 396 SFVKERP------PDSKPMRPVRKRPAPEDFLGKGSNKKILMGNEELTRLWNLCPDNMEA 449

Query: 401 CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
           CKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF   
Sbjct: 450 CKSESREYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPHFFQPT 509

Query: 461 TPNVEKNSEFIENMVKRCVKEKP 483
               +   E++ENMV +  KE P
Sbjct: 510 NQQFKSLPEYLENMVIKLAKELP 532


>gi|149031712|gb|EDL86662.1| rCG41283, isoform CRA_b [Rattus norvegicus]
          Length = 659

 Score =  460 bits (1183), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 243/500 (48%), Positives = 316/500 (63%), Gaps = 46/500 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+ENIT F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +   S T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 456

Query: 406 R--DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
           R  ++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF      
Sbjct: 457 RQVEYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQ 516

Query: 464 VEKNSEFIENMVKRCVKEKP 483
            +   E++ENMV +  KE P
Sbjct: 517 FKSLPEYLENMVIKLAKELP 536


>gi|550058|gb|AAA53571.1| protein p84 [Homo sapiens]
          Length = 657

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 240/498 (48%), Positives = 315/498 (63%), Gaps = 44/498 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYAAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILTGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R+  P+LE +FEEAI+Q DP    E +YK +N+SNY WRAL+LL+R+ PHFF       +
Sbjct: 457 REHMPTLEEFFEEAIEQADPENMAENEYKAMNNSNYGWRALKLLARRSPHFFQPTNQQFK 516

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 517 SLQEYLENMVIKLAKELP 534


>gi|440903496|gb|ELR54147.1| THO complex subunit 1, partial [Bos grunniens mutus]
          Length = 661

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 242/503 (48%), Positives = 316/503 (62%), Gaps = 53/503 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 44  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 99

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 100 LGDVLDCLPLDQCDTIFTFVERNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 159

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 160 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 219

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 220 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 279

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 280 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 339

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 340 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 399

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEAC 401
           P   +  TS    D K     +K+  P                 ELT+LWN   DN+EAC
Sbjct: 400 PSFVKERTS----DSKPTRAVRKRAAPEDFLGKGPSKKILMGNDELTRLWNLCPDNMEAC 455

Query: 402 KSAER-DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
           KS  R ++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF   
Sbjct: 456 KSETRQEYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPT 515

Query: 461 TPNVEKNSEFIENMVKRCVKEKP 483
               +   E++ENMV +  KE P
Sbjct: 516 NQQFKSLPEYLENMVIKLAKELP 538


>gi|148691059|gb|EDL23006.1| THO complex 1, isoform CRA_d [Mus musculus]
          Length = 659

 Score =  459 bits (1181), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 242/500 (48%), Positives = 316/500 (63%), Gaps = 46/500 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +   S T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 456

Query: 406 R--DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
           R  ++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF      
Sbjct: 457 RQVEYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQ 516

Query: 464 VEKNSEFIENMVKRCVKEKP 483
            +   E++ENMV +  KE P
Sbjct: 517 FKSLPEYLENMVIKLAKELP 536


>gi|431896300|gb|ELK05716.1| THO complex subunit 1 [Pteropus alecto]
          Length = 658

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 242/499 (48%), Positives = 316/499 (63%), Gaps = 45/499 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +   S T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERASDTKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 R-DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNV 464
           R ++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       
Sbjct: 457 RQEYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQF 516

Query: 465 EKNSEFIENMVKRCVKEKP 483
           +   E++ENMV +  KE P
Sbjct: 517 KSLPEYLENMVIKLAKELP 535


>gi|410977440|ref|XP_003995113.1| PREDICTED: THO complex subunit 1 [Felis catus]
          Length = 636

 Score =  456 bits (1172), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 229/452 (50%), Positives = 295/452 (65%), Gaps = 45/452 (9%)

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
           +C ++ P +LL D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRR
Sbjct: 66  ICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRR 125

Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE------------ 180
           LS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+            
Sbjct: 126 LSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDR 185

Query: 181 ---MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
              MDV   E  + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E 
Sbjct: 186 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 245

Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
           VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 246 VLAVFKSYKLDDTQASRKKMEELKAGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 305

Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
           ILFQY    VK +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE
Sbjct: 306 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 365

Query: 350 HWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLW 392
           +WN WKNEGCP   +  TS    D K    A+K+  P                 ELT+LW
Sbjct: 366 NWNSWKNEGCPSFVKERTS----DTKPTRVARKRTAPEDFLGKGPNRKILMGNEELTRLW 421

Query: 393 N-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
           N   DN+EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R
Sbjct: 422 NLCPDNMEACKSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLAR 481

Query: 452 KCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
           + PHFF       +   E++ENMV +  KE P
Sbjct: 482 RSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 513


>gi|347971177|ref|XP_309625.4| AGAP004070-PA [Anopheles gambiae str. PEST]
 gi|333466622|gb|EAA05359.4| AGAP004070-PA [Anopheles gambiae str. PEST]
          Length = 709

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 254/557 (45%), Positives = 331/557 (59%), Gaps = 96/557 (17%)

Query: 3   DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYI 62
           +S +  F  + + V+   Y   + + D DKK+AL Q+ R  LL    T   DI  IE +I
Sbjct: 17  ESLERAFAANDIEVLKADYDAARAS-DNDKKTALHQAFRDMLL----TKTEDIPAIERFI 71

Query: 63  SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL 122
           S  V  C KDM  +T+P++LL D FD  TLD+ EQ+F YVE NV IWK++ FF +CK+NL
Sbjct: 72  SFAVNSCRKDMTVATMPVVLLGDIFDAVTLDRAEQIFTYVENNVAIWKEECFFTACKHNL 131

Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD-EEM 181
           LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFN+ENITE+G D  EM
Sbjct: 132 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNLENITEYGMDGNEM 191

Query: 182 DV---SSNETEETDTE--EVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
            V   S     E D E  E +K+KID+N Y KFW+LQD+FRNP QCYNKV WK F ++A+
Sbjct: 192 AVDQLSGGADPEVDGEPAEENKLKIDYNLYCKFWALQDFFRNPNQCYNKVQWKTFETHAK 251

Query: 237 TVLAAFKSYKLDDVQSS--------------------------------LNPSGDYFAKY 264
           +VL+AF S+KL++ ++S                                +  SG +FAK+
Sbjct: 252 SVLSAFCSFKLEEHRASSTSSGPATKPSDVGGSATAAAAANNCPMEEDQIRESGHFFAKF 311

Query: 265 LTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVY 324
           LTN KLL LQLSD+NFRR VL+QFLILFQY  STVK + E   L   Q  W+K+T   VY
Sbjct: 312 LTNPKLLSLQLSDSNFRRSVLVQFLILFQYLNSTVKFKAETHVLTQAQSDWLKETETQVY 371

Query: 325 SLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK---------- 374
            LI+++PP+G+ F+   + +L  EE WN WKNEGC E KRP  ++ D             
Sbjct: 372 QLIEESPPNGKKFANTARHMLLREELWNSWKNEGCKEFKRPEAALEDTSSSTAAVAGGAA 431

Query: 375 ---------KDEPDAKKKKTP-------------------ELTKLWN-SKDNLEACKSAE 405
                       P AK+ + P                   E+T+LWN   DNL+ACK  +
Sbjct: 432 AGGAATSAGSGRPPAKRPRKPLGDLIRDATKQGKFYMGNSEITRLWNLCPDNLQACKGTD 491

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R+F PSLE+Y E   ++ DP+              + WRALRLL+R+ PHFF        
Sbjct: 492 RNFLPSLETYLENPKEKQDPS--------------FEWRALRLLARQSPHFFTLFNSPSY 537

Query: 466 KNSEFIENMVKRCVKEK 482
           K ++++E++ K+  K+K
Sbjct: 538 KVADYLESVRKKIQKDK 554


>gi|321464075|gb|EFX75086.1| hypothetical protein DAPPUDRAFT_226474 [Daphnia pulex]
          Length = 689

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 249/562 (44%), Positives = 342/562 (60%), Gaps = 79/562 (14%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           DKK+  +Q+ +   L    + +  I + E  + L ++  +  +C+ TLP+ILL+D FD+ 
Sbjct: 45  DKKAVWDQAFKDRFLS--DSSNFSIPLFEKLLQLSLDCAVALVCSPTLPVILLTDAFDIL 102

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           +L+KCE++FY VE+ V IWKQ+ FF  CKN LLR+CNDLLRRLSRSQNTVFCGRILLFLA
Sbjct: 103 SLEKCEEIFYMVEMKVAIWKQELFFNQCKNTLLRLCNDLLRRLSRSQNTVFCGRILLFLA 162

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEE---MDVSSNETEETDTEEVDK-VKIDFNFY 206
           KFFPFSERSGLN++SEFN++N+T FG  +E    D+S+N  +  D  EV++ +  D+N Y
Sbjct: 163 KFFPFSERSGLNVVSEFNLDNLTTFGAKDESYAADISANSEKHKDAMEVEESLSSDYNLY 222

Query: 207 KKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-----------DV----- 250
           KKFWSLQD+FRNP QCY K+ WK F  +   VL+AF + KLD           DV     
Sbjct: 223 KKFWSLQDFFRNPNQCYTKIPWKTFCLHTSDVLSAFSTIKLDERRNRSKWDEADVALNID 282

Query: 251 --QSSLNPSG--DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGL 306
             ++S+N      YFAKYLT+Q LL+LQL D+NFRRY+LLQFLI+FQY  S V    E  
Sbjct: 283 MNETSMNGGNVQQYFAKYLTSQNLLELQLGDSNFRRYILLQFLIMFQYLDSPV---NETH 339

Query: 307 ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP- 365
           +L  +Q  WV +    VY L+K TPPDG  F Q ++ ILK EE W+ WKN+GCPE  RP 
Sbjct: 340 KLNEEQTTWVSEAIVQVYRLLKDTPPDGASFVQSIRHILKREELWSNWKNDGCPEF-RPS 398

Query: 366 -LTSITDEDK-KDEPDAKKK-----------------------------KTPELTKLWN- 393
            ++S TD D  KD  + +KK                                E+ +LWN 
Sbjct: 399 IISSTTDNDTIKDGTNGEKKTIVVGRPRRLKRSLGDQIREASKRNKCIIGNAEMNRLWNL 458

Query: 394 SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
             DN+ AC+S +RDF P+LE+YFEEAI+Q+DP  +VE QYK+V+++N+ WRALRLL+R+ 
Sbjct: 459 CPDNMAACRSKDRDFLPTLENYFEEAIEQLDP--SVESQYKRVSEANFGWRALRLLARRS 516

Query: 454 PHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEE 513
            HFF +    + +  +++E  V++  K              D   A    + K E + + 
Sbjct: 517 LHFFTHGNTPITRLPDYLETHVRKLAK--------------DMQSANALPNVKPEPVDDA 562

Query: 514 EKEEDWEAKADPEGDADEVMSV 535
            + E  E +ADP  +  E+  V
Sbjct: 563 GEVEQMEPEADPVSNNSEIEGV 584


>gi|334325879|ref|XP_001367958.2| PREDICTED: THO complex subunit 1-like [Monodelphis domestica]
          Length = 810

 Score =  454 bits (1167), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 239/490 (48%), Positives = 312/490 (63%), Gaps = 38/490 (7%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +KK  L+Q+ R  L + I T     + +   ISL +    + +C ++ P +LL D  D  
Sbjct: 198 EKKCTLDQAFRVILEEEIIT-RASCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCL 256

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI LFLA
Sbjct: 257 PLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLA 316

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDTEEVD------------ 197
           + FP SE+SGLN+ S+FN+EN+T F  +E E  +     E+ D                 
Sbjct: 317 RLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHIEDRDEGMEVEEGEMGDDEAPT 376

Query: 198 --KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS-- 253
              + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD Q+S  
Sbjct: 377 SCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRK 436

Query: 254 ----LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE 307
               L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK +     
Sbjct: 437 KLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYV 496

Query: 308 LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--KRP 365
           L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGCP    +RP
Sbjct: 497 LTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERP 556

Query: 366 LTSITDE--DKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAERDFTPSLE 413
             S       K+  P+    K P         ELT+LWN   DN+EACKS  R++ P+LE
Sbjct: 557 SDSKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSESREYMPTLE 616

Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIEN 473
            +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +   E++EN
Sbjct: 617 EFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLEN 676

Query: 474 MVKRCVKEKP 483
           MV +  KE P
Sbjct: 677 MVIKLAKELP 686


>gi|395511634|ref|XP_003760061.1| PREDICTED: THO complex subunit 1 [Sarcophilus harrisii]
          Length = 658

 Score =  453 bits (1166), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 239/490 (48%), Positives = 312/490 (63%), Gaps = 38/490 (7%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +KK  L+Q+ R  L + I T     + +   ISL +    + +C ++ P +LL D  D  
Sbjct: 46  EKKCTLDQAFRVILEEEIIT-RASCENVLAIISLAIGGVTEGICTASTPFVLLGDVLDCL 104

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI LFLA
Sbjct: 105 PLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLA 164

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDTEEVD------------ 197
           + FP SE+SGLN+ S+FN+EN+T F  +E E  +     E+ D                 
Sbjct: 165 RLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHIEDRDEGMEVEEGEMGDDEAPT 224

Query: 198 --KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS-- 253
              + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD Q+S  
Sbjct: 225 SCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRK 284

Query: 254 ----LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE 307
               L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK +     
Sbjct: 285 KLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYV 344

Query: 308 LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--KRP 365
           L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGCP    +RP
Sbjct: 345 LTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERP 404

Query: 366 LTSITDE--DKKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAERDFTPSLE 413
             S       K+  P+    K P         ELT+LWN   DN+EACKS  R++ P+LE
Sbjct: 405 SDSKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSESREYMPTLE 464

Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIEN 473
            +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +   E++EN
Sbjct: 465 EFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLEN 524

Query: 474 MVKRCVKEKP 483
           MV +  KE P
Sbjct: 525 MVIKLAKELP 534


>gi|327269845|ref|XP_003219703.1| PREDICTED: THO complex subunit 1-like [Anolis carolinensis]
          Length = 674

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 241/534 (45%), Positives = 331/534 (61%), Gaps = 51/534 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLI--KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILL 83
           P  + +KK  L+Q+ R  + + I  K P  ++  I   ISL +    + +C ++ P +LL
Sbjct: 57  PGSENEKKFTLDQAFRVAVEEEIINKAPCENVLAI---ISLVINGVTEGICTASTPFVLL 113

Query: 84  SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
            D  D   LD+C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCG
Sbjct: 114 GDVLDCLPLDQCDKIFTFVEKNVTTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCG 173

Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEFGG-DEEMDVSSNETEET----------- 191
           RI LFLA+ FP SE+SGLN+ S+FN+ENIT F   ++E  +     E+            
Sbjct: 174 RIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTIEQESTLGQKHVEDREEGMEVEEGEM 233

Query: 192 ---DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
              +      + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F  Y+E VLA FKSYKLD
Sbjct: 234 GDDEAPTSSSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLD 293

Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           D Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK
Sbjct: 294 DTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 353

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
            +     L  +Q  W++DTT+ VY L+ + PPDGE F+++V+ IL  EE+WN WKNEGCP
Sbjct: 354 FKSSNYVLTDEQSLWIEDTTKLVYQLLSENPPDGERFAKMVEHILNTEENWNSWKNEGCP 413

Query: 361 ELKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACK 402
                +     E K   P  +K+  P                 ELT+LWN   DN+EACK
Sbjct: 414 SF---VKERAGESKAIRPFVRKRPPPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACK 470

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           S  R++ P+LE +FEEAI+Q DP   V++++K VN+SNY WRALRLL+R+ PHFF     
Sbjct: 471 SESREYMPTLEEFFEEAIEQADPENMVDKEFKVVNNSNYGWRALRLLARRSPHFFQPTNQ 530

Query: 463 NVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEE 514
             +   E++ENMV +  KE   PS +I       + + A ++ + +S ++Q ++
Sbjct: 531 QFKSLPEYLENMVIKLAKELPPPSEEIKTGEEDEEDNDALLKENNESPDVQRDK 584


>gi|348500685|ref|XP_003437903.1| PREDICTED: THO complex subunit 1 [Oreochromis niloticus]
          Length = 654

 Score =  450 bits (1158), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 233/499 (46%), Positives = 318/499 (63%), Gaps = 44/499 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           P  + +KK+ L+Q++R  L  LI    +        I L ++   + +C++T P +LL D
Sbjct: 39  PGNEAEKKTTLDQALRGVLGDLIVEQKLRCDDYLALIYLSIDGVTEGICSATTPFVLLGD 98

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
             D   LD C+++F +VE NV+ WK  +F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 99  VLDCLPLDLCDKIFSFVEENVSTWKSNSFYTAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 158

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEET------------- 191
            LFLA+ FP SE+SGLN+ S+FN++NIT F  +E E  +    TEE              
Sbjct: 159 QLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQESTLGQKHTEEKEDGMEVEEGEMGE 218

Query: 192 -DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
            D      + ID+N Y+KFW+LQDYFRNPVQCY+K SW  F  Y++  LA FKSYKLDD+
Sbjct: 219 DDGPAPSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFLKYSDETLAVFKSYKLDDM 278

Query: 251 QSSLN-------PSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
           Q+S          SG+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK 
Sbjct: 279 QASKRKLEELRASSGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKF 338

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           +     L  DQ  W+++TT+ VY L+++ PPDG+ F+ +V+ IL  EE+WN WKNEGCP 
Sbjct: 339 KSSSCVLNDDQAAWIEETTKLVYQLLREIPPDGDKFAAMVEHILNTEENWNAWKNEGCPS 398

Query: 362 LKRPLTSITDEDKKDEPDAKKK-------KTP---------ELTKLWN-SKDNLEACKSA 404
             +  T    +DK   P  K++       K P         ELT+LWN + DN+EACKS 
Sbjct: 399 FVKERTV---DDKPKRPTRKRQAPEDFLGKGPDRKIFMGNDELTRLWNLNHDNMEACKSD 455

Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNV 464
            R+F PSL+ +F EAI+Q DPA  VEE+YK V ++NY WRALRLLSR+ PHFF       
Sbjct: 456 SREFMPSLDEFFAEAIEQADPANMVEEEYKVVRNANYGWRALRLLSRRSPHFFQPTNQKF 515

Query: 465 EKNSEFIENMVKRCVKEKP 483
           +  ++++++MV +  KE P
Sbjct: 516 KSLADYLDSMVSKLAKELP 534


>gi|410923899|ref|XP_003975419.1| PREDICTED: THO complex subunit 1-like [Takifugu rubripes]
          Length = 654

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 230/500 (46%), Positives = 318/500 (63%), Gaps = 46/500 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           P  + +KK+ L+Q++R  L   I           + I L ++   + +C++T P +LL D
Sbjct: 39  PGSETEKKTTLDQALRGVLGDQIVEQKSSCDDYLSLIYLSIDSVTEGICSATTPFVLLGD 98

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
             D   LD+C+++F +VE NV+ WK  +F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 99  VLDCLPLDQCDKIFSFVEENVSTWKSNSFYTAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 158

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEE-------------- 190
            LFLA+ FP SE+SGLN+ S+FN++NIT F  +E E  +    TEE              
Sbjct: 159 QLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQESTLGQKNTEEKEDGMEVEEGEMGE 218

Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
            D      + ID+N Y+KFW+LQDYFRNPVQCY+K SW  F  Y++  LA FKSYKLDD+
Sbjct: 219 EDAPAPCSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFVKYSDETLAVFKSYKLDDM 278

Query: 251 QSS------LNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
           Q+S      L  SG    YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK 
Sbjct: 279 QASKRKLEDLRASGGDHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKF 338

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           +  G  L  DQ  W+++T++  Y L+++ PPDG+ F+ +V+ IL  EE+WN WKNEGCP 
Sbjct: 339 KSSGSVLNDDQAAWIEETSKLAYQLLREIPPDGDKFATMVEHILGTEENWNSWKNEGCPS 398

Query: 362 LKRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKS 403
             +  T    +DK   P A+K++ P                 ELT+LWN + DN++ACKS
Sbjct: 399 FVKERTV---DDKPKRP-ARKRQAPEDFLGKGPDRKIFMGNDELTRLWNLNHDNMDACKS 454

Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
             R+F PSL+ +F EAI+Q DPA  VE++YK V + NY WRALRLLSR+ PHFF      
Sbjct: 455 ESREFVPSLDDFFSEAIEQADPANMVEDEYKVVRNPNYGWRALRLLSRRSPHFFQPTNQK 514

Query: 464 VEKNSEFIENMVKRCVKEKP 483
            +  ++++++MV +  KE P
Sbjct: 515 FKSLADYLDSMVSKLAKELP 534


>gi|157167759|ref|XP_001655616.1| nuclear matrix protein [Aedes aegypti]
 gi|108882031|gb|EAT46256.1| AAEL002563-PA [Aedes aegypti]
          Length = 685

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 253/582 (43%), Positives = 347/582 (59%), Gaps = 95/582 (16%)

Query: 3   DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYI 62
           D+    +  ++++ +  +Y   K   D DKK+AL Q+ R  LL    T   DI  IE +I
Sbjct: 15  DTLNRAYKVNEIDQLKAEYDGQKGPTDNDKKAALHQAFRNLLL----TKTDDIPAIEGFI 70

Query: 63  SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL 122
           +  V  C  +M ++T+P++LL D FD  TLDKCEQ+F YVE NV  WK++ FF +CK+NL
Sbjct: 71  NFAVGACRLEMTSATIPVVLLGDIFDAVTLDKCEQIFTYVENNVATWKEELFFTACKHNL 130

Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD-EEM 181
           LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFN+EN+TE+G +  EM
Sbjct: 131 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNLENVTEYGVEGSEM 190

Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
               +  EE  +   +++KID+N Y KFW+LQD+FRNP QCYNKV  KMF ++A +VL+A
Sbjct: 191 GDQLDGFEENGS--ANQLKIDYNLYCKFWALQDFFRNPNQCYNKVQCKMFAAHAGSVLSA 248

Query: 242 FKSYKLDD-----------------------VQSSLNPSGDYFAKYLTNQKLLDLQLSDT 278
           F S+KL++                        +  +  SG +FAK+LTN KLL LQLSD+
Sbjct: 249 FSSFKLEEPRSASSSSSASANKYGTTPMDIPPEEQIRDSGHFFAKFLTNPKLLSLQLSDS 308

Query: 279 NFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFS 338
           NFRR VL+Q LILFQY  STVK + +   L   Q  W+++T  TVY LI+++PP+G+ FS
Sbjct: 309 NFRRSVLVQLLILFQYLNSTVKFKADNHLLTQVQIDWLRETETTVYKLIEESPPNGKKFS 368

Query: 339 QVVKLILKGEEHWNQWKNEGCPELKRP---------LTSITDEDKKDE---PDAKKKK-- 384
           + V+ +L  EE WN WKNEGC E KRP          T+++ ED       P  + +K  
Sbjct: 369 ESVRHMLSREELWNSWKNEGCKEFKRPDAAAAAATTTTNVSGEDPSPPVRLPQKRPRKPL 428

Query: 385 ----------------TPELTKLWNS-KDNLEACKSAERDFTPSLESYFEEAIQQMDPAA 427
                            PELT+LWN+  DNL+ACK  +R+F PSLE+Y + + ++ DP+ 
Sbjct: 429 GDLIRDSTKQGKFYMGNPELTRLWNTCPDNLQACKGEDRNFLPSLEAYLDGSKEKQDPS- 487

Query: 428 AVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQI 487
                        + WRALRLL+R+ PHFF        K ++++E++ K+  K+K     
Sbjct: 488 -------------FEWRALRLLARQSPHFFTLFNSPSYKVADYLESVRKKIQKDK----- 529

Query: 488 SGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDA 529
                          VD K E IQE+   ++   + + EG+ 
Sbjct: 530 ---------------VDIKQEVIQEDVPSQNENEQNEAEGEG 556


>gi|52139086|gb|AAH82664.1| LOC398505 protein, partial [Xenopus laevis]
          Length = 659

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 233/500 (46%), Positives = 310/500 (62%), Gaps = 49/500 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           P  + +KK AL+Q  R  L + I    +        ISL +      +C +T P +LL D
Sbjct: 44  PGSENEKKLALDQVFRVVLEEEI-INQVSCNNCLAIISLAINGVTDGICTATTPFVLLGD 102

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
             D   L  C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 103 VLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 162

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE--------------MDVSSNETEET 191
            LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+              M++   E  + 
Sbjct: 163 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKVEEKDDGMEIEEGEMGDD 222

Query: 192 DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
           D      + ID+N Y+KFWS+QDYFRNPVQCY+K SWK F  +++ VL  FKSYKLDD Q
Sbjct: 223 DAPNSSSIPIDYNLYRKFWSIQDYFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLDDTQ 282

Query: 252 SS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG 303
           +S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY    VK + 
Sbjct: 283 ASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVKFKS 342

Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL- 362
               L  +Q  W++DTT+ V+ L+ + PPDGE FS+ V+ IL  EE+WN WKNEGCP   
Sbjct: 343 SNYILLDEQSLWIEDTTKLVHQLLSEIPPDGEKFSKTVEHILNTEENWNAWKNEGCPSFV 402

Query: 363 -KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKS 403
            +RP       + K +  A+K+  P                 ELT+LWN   DN+EAC+S
Sbjct: 403 KERPT------ETKPQKMARKRPVPEDFLGKGSNKKMSLGNDELTRLWNLCPDNMEACRS 456

Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
             R+F PSLE +FEEAI+Q DP   VE +YK VN++NY WRALRLL+R+ PHFF      
Sbjct: 457 ESREFMPSLEQFFEEAIEQADPENMVENEYKVVNNANYGWRALRLLARRSPHFFQPTNQQ 516

Query: 464 VEKNSEFIENMVKRCVKEKP 483
            +   E++ENMV +  KE P
Sbjct: 517 FKSLPEYLENMVIKLAKELP 536


>gi|195343819|ref|XP_002038488.1| GM10588 [Drosophila sechellia]
 gi|194133509|gb|EDW55025.1| GM10588 [Drosophila sechellia]
          Length = 701

 Score =  447 bits (1150), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 243/573 (42%), Positives = 350/573 (61%), Gaps = 71/573 (12%)

Query: 4   SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
           + +L     ++ ++  +Y  +    ++DK+  ++ + R  L+K +   D D+  I   + 
Sbjct: 28  ALELAITDGKVELLVQEYNRFPANTEHDKRLPMDHAFRVLLMKRL---DEDVSRIGELVR 84

Query: 64  LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
           L VE    ++ ++T+P++LL DTFD+ TLDKC+++F +VE  V +WK++ FF SCKNN+L
Sbjct: 85  LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEEMVEVWKEEIFFASCKNNIL 144

Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
           RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +   
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204

Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
            S+  E  DT E   +KID++ Y KFWSLQD+FRNP QCYNK  WKMF  +AE +L +F 
Sbjct: 205 ESDNKELEDTAEDIPIKIDYDLYCKFWSLQDFFRNPNQCYNKPQWKMFQMHAENILQSFS 264

Query: 244 SYKLDDVQSSL--NPSGD------------------------YFAKYLTNQKLLDLQLSD 277
           S+KL+DV+ S   N SG                         +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRHSSNDNASGSDQAMDVDEEAVDATLVTSVIKENHFFAKFLTNPKLLALQLSD 324

Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
            NFRR VL+QFLILFQY   +VK + +   L +DQ  ++K+T   VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTQTLTADQADFIKETESRVYKLLEETPPYGKRF 384

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKK------------ 384
           S+ V  +L  EE WN WKNEGC E K+P   + ++ED K  P+ + ++            
Sbjct: 385 SRTVCHMLAREEMWNNWKNEGCKEFKKPEEPTPSEEDSKPTPNKRPRRPLGDALRDASRS 444

Query: 385 ------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
                    LT+LWN S DNL+ACKS +R+F P LE+Y E   +++DPA           
Sbjct: 445 GKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHEKVDPA----------- 493

Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQISGNGNGV 494
              + WRALRLL+R+ PHFF + +    K S+++E + KR +++   KP++ +S N +  
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDKEPKPAALLSNNSSEN 550

Query: 495 DQDPAEVEVDTKSEE-----IQEEEKEEDWEAK 522
               +   V T+ E+     +Q+ E E+D E +
Sbjct: 551 SFGLSANNVHTEGEQDATVALQDGEPEQDLEVE 583


>gi|147905682|ref|NP_001082490.1| THO complex 1 protein [Xenopus laevis]
 gi|72679742|gb|AAI00212.1| LOC398505 protein [Xenopus laevis]
          Length = 654

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 233/500 (46%), Positives = 310/500 (62%), Gaps = 49/500 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           P  + +KK AL+Q  R  L + I    +        ISL +      +C +T P +LL D
Sbjct: 39  PGSENEKKLALDQVFRVVLEEEIIN-QVSCNNCLAIISLAINGVTDGICTATTPFVLLGD 97

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
             D   L  C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 98  VLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 157

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE--------------MDVSSNETEET 191
            LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+              M++   E  + 
Sbjct: 158 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKVEEKDDGMEIEEGEMGDD 217

Query: 192 DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
           D      + ID+N Y+KFWS+QDYFRNPVQCY+K SWK F  +++ VL  FKSYKLDD Q
Sbjct: 218 DAPNSSSIPIDYNLYRKFWSIQDYFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLDDTQ 277

Query: 252 SS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG 303
           +S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY    VK + 
Sbjct: 278 ASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVKFKS 337

Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL- 362
               L  +Q  W++DTT+ V+ L+ + PPDGE FS+ V+ IL  EE+WN WKNEGCP   
Sbjct: 338 SNYILLDEQSLWIEDTTKLVHQLLSEIPPDGEKFSKTVEHILNTEENWNAWKNEGCPSFV 397

Query: 363 -KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACKS 403
            +RP       + K +  A+K+  P                 ELT+LWN   DN+EAC+S
Sbjct: 398 KERPT------ETKPQKMARKRPVPEDFLGKGSNKKMSLGNDELTRLWNLCPDNMEACRS 451

Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPN 463
             R+F PSLE +FEEAI+Q DP   VE +YK VN++NY WRALRLL+R+ PHFF      
Sbjct: 452 ESREFMPSLEQFFEEAIEQADPENMVENEYKVVNNANYGWRALRLLARRSPHFFQPTNQQ 511

Query: 464 VEKNSEFIENMVKRCVKEKP 483
            +   E++ENMV +  KE P
Sbjct: 512 FKSLPEYLENMVIKLAKELP 531


>gi|28278733|gb|AAH44699.1| LOC398505 protein, partial [Xenopus laevis]
          Length = 659

 Score =  447 bits (1149), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 233/501 (46%), Positives = 310/501 (61%), Gaps = 50/501 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           P  + +KK AL+Q  R  L + I    +        ISL +      +C +T P +LL D
Sbjct: 43  PGSENEKKLALDQVFRVVLEEEI-INQVSCNNCLAIISLAINGVTDGICTATTPFVLLGD 101

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
             D   L  C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 102 VLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 161

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNETEE 190
            LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               M++   E  +
Sbjct: 162 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKQVEEKDDGMEIEEGEMGD 221

Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
            D      + ID+N Y+KFWS+QDYFRNPVQCY+K SWK F  +++ VL  FKSYKLDD 
Sbjct: 222 DDAPNSSSIPIDYNLYRKFWSIQDYFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLDDT 281

Query: 251 QSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSR 302
           Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY    VK +
Sbjct: 282 QASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVKFK 341

Query: 303 GEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL 362
                L  +Q  W++DTT+ V+ L+ + PPDGE FS+ V+ IL  EE+WN WKNEGCP  
Sbjct: 342 SSNYILLDEQSLWIEDTTKLVHQLLSEIPPDGEKFSKTVEHILNTEENWNAWKNEGCPSF 401

Query: 363 --KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEACK 402
             +RP       + K +  A+K+  P                 ELT+LWN   DN+EAC+
Sbjct: 402 VKERPT------ETKPQKMARKRPVPEDFLGKGSNKKMSLGNDELTRLWNLCPDNMEACR 455

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           S  R+F PSLE +FEEAI+Q DP   VE +YK VN++NY WRALRLL+R+ PHFF     
Sbjct: 456 SESREFMPSLEQFFEEAIEQADPENMVENEYKVVNNANYGWRALRLLARRSPHFFQPTNQ 515

Query: 463 NVEKNSEFIENMVKRCVKEKP 483
             +   E++ENMV +  KE P
Sbjct: 516 QFKSLPEYLENMVIKLAKELP 536


>gi|24644436|ref|NP_649594.1| Hpr1 [Drosophila melanogaster]
 gi|7296666|gb|AAF51946.1| Hpr1 [Drosophila melanogaster]
 gi|21483450|gb|AAM52700.1| LD43883p [Drosophila melanogaster]
 gi|46800533|emb|CAD89220.1| Hpr1 protein [Drosophila melanogaster]
          Length = 701

 Score =  446 bits (1148), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 242/574 (42%), Positives = 350/574 (60%), Gaps = 73/574 (12%)

Query: 4   SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
           + +L     ++ ++  +Y  +    ++DK+  ++ + R  L+K +   D D+  I   + 
Sbjct: 28  ALELAITDGKVELLVKEYNRFPANTEHDKRLPMDHAFRVLLMKRL---DEDVSRIGELVR 84

Query: 64  LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
           L VE    ++ ++T+P++LL DTFD+ TLDKC+++F +VE  V +WK++ FF SCKNN+L
Sbjct: 85  LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEDMVEVWKEEIFFSSCKNNIL 144

Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
           RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +   
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204

Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
            S+  E  DT E   +KID++ Y KFWSLQD+FRNP QCYNK  WKMF  +A+ +L +F 
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKPQWKMFQMHADNILQSFS 264

Query: 244 SYKLDDVQSSLN--------------------------PSGDYFAKYLTNQKLLDLQLSD 277
           S+KL+DV+ S N                           +  +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRQSSNENASGTDQAMDVDEEAVDTTAVSSVIKANHFFAKFLTNPKLLALQLSD 324

Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
            NFRR VL+QFLILFQY   +VK + +   L +DQ  ++K+T   VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTQTLTADQADFIKETESRVYKLLEETPPYGKRF 384

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKK------------ 384
           S+ V  +L  EE WN WKNEGC E K+P   ++++ED K  P+ + ++            
Sbjct: 385 SRTVYHMLAREEMWNNWKNEGCKEFKKPEEPTLSEEDSKPTPNKRPRRPLGDALRDASRS 444

Query: 385 ------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
                    LT+LWN S DNL+ACKS +R+F P LE+Y E   +++DPA           
Sbjct: 445 GKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHEKVDPA----------- 493

Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQISGN---- 490
              + WRALRLL+R+ PHFF + +    K S+++E + KR +++   KP++ +S N    
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDKEPKPAALLSNNSSEN 550

Query: 491 --GNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAK 522
             G  V+    E E D  +  +Q+ E E+D E +
Sbjct: 551 NFGLSVNNVHTEGEQDA-TVALQDGEPEQDLEVE 583


>gi|195568581|ref|XP_002102292.1| GD19579 [Drosophila simulans]
 gi|194198219|gb|EDX11795.1| GD19579 [Drosophila simulans]
          Length = 701

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 243/573 (42%), Positives = 349/573 (60%), Gaps = 71/573 (12%)

Query: 4   SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
           + +L     ++ ++  +Y  +    ++DK+  ++ + R  L+K +   D D+  I   + 
Sbjct: 28  ALELAITDGKVELLVQEYNRFPANTEHDKRLPMDHAFRVLLMKRL---DEDVSRIGELVR 84

Query: 64  LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
           L VE    ++ ++T+P++LL DTFD+ TLDKC+++F +VE  V +WK++ FF SCKNN+L
Sbjct: 85  LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEEMVEVWKEEIFFASCKNNIL 144

Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
           RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +   
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204

Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
            S+  E  DT E   +KID++ Y KFWSLQD+FRNP QCYNK  WKMF  +AE +L +F 
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKPQWKMFQMHAENILQSFS 264

Query: 244 SYKLDDVQSSL--NPSGD------------------------YFAKYLTNQKLLDLQLSD 277
           S+KL+DV+ S   N SG                         +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRHSSNDNASGSDQAMDVDEEAVDAAAVTSVIKANHFFAKFLTNPKLLALQLSD 324

Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
            NFRR VL+QFLILFQY   +VK + +   L  DQ  ++K+T   VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTQTLTGDQADFIKETESRVYKLLEETPPYGKRF 384

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKK------------ 384
           S+ V  +L  EE WN WKNEGC E K+P   + ++ED K  P+ + ++            
Sbjct: 385 SRTVCHMLAREEMWNNWKNEGCKEFKKPEEPTPSEEDSKPTPNKRPRRPLGDALRDASRS 444

Query: 385 ------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
                    LT+LWN S DNL+ACKS +R+F P LE+Y E   +++DPA           
Sbjct: 445 GKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHEKVDPA----------- 493

Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQISGNGNGV 494
              + WRALRLL+R+ PHFF + +    K S+++E + KR +++   KP++ +S N +  
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDKEPKPAALLSNNSSEN 550

Query: 495 DQDPAEVEVDTKSEE-----IQEEEKEEDWEAK 522
               +   V T+ E+     +Q+ E E+D E +
Sbjct: 551 SFGLSANNVHTEGEQDATVALQDGEPEQDLEVE 583


>gi|170035468|ref|XP_001845591.1| THO complex subunit 1 [Culex quinquefasciatus]
 gi|167877503|gb|EDS40886.1| THO complex subunit 1 [Culex quinquefasciatus]
          Length = 687

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 248/543 (45%), Positives = 328/543 (60%), Gaps = 93/543 (17%)

Query: 14  LNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDM 73
           + ++  +Y     + D DK+SAL Q+ R  LL   KT DI    IE +IS  V  C  +M
Sbjct: 16  IELLKKQYDELTSSTDIDKRSALHQAFRDLLLA--KTDDI--PAIETFISFAVAACRSEM 71

Query: 74  CNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRL 133
              T+P++LL D FD  TLDKCE +F YVE NV  WK+  FF +CK+NLLRMCNDLLRRL
Sbjct: 72  TAPTIPVVLLGDIFDAVTLDKCEAIFTYVENNVTTWKEDFFFTACKHNLLRMCNDLLRRL 131

Query: 134 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--GDEEMDVSSNETEET 191
           SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFN+ENITE+G  G E  D  S   E+ 
Sbjct: 132 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNLENITEYGVEGSEMGDQLSEGQEDG 191

Query: 192 D-------TEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKS 244
           D       T++  ++KID+N Y KFW+LQD+FRNP QCYNKV WKMF ++A +V+AAF S
Sbjct: 192 DGVQASAGTQQQPQLKIDYNLYCKFWALQDFFRNPNQCYNKVQWKMFAAHAGSVMAAFSS 251

Query: 245 YKLDDVQSS---------------------LNPSGDYFAKYLTNQKLLDLQLSDTNFRRY 283
           +KL++ +S+                     +  +  +FAK+LTN KLL LQLSD+NFRR 
Sbjct: 252 FKLEEPRSNSKSASKNKSSPTPMDIPPEEEVRETAHFFAKFLTNPKLLSLQLSDSNFRRS 311

Query: 284 VLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKL 343
           VL+Q LILFQY  STVK + +   L   Q  W+K+T  TVY LI+++PP+G  F++ V+ 
Sbjct: 312 VLVQLLILFQYLNSTVKFKSDSHCLTVAQVDWLKETETTVYKLIEESPPNGRRFAESVRH 371

Query: 344 ILKGEEHWNQWKNEGCPELKRP---------------LTSITDEDKKDEPDAKKKKT--- 385
           +L  EE WN WKNEGC E KRP                +    ED+   P A   +    
Sbjct: 372 MLSREELWNSWKNEGCKEFKRPDAAAVAATAGAAAAATSGTATEDQPATPAAATSRVASK 431

Query: 386 -----------------------PELTKLWNS-KDNLEACKSAERDFTPSLESYFEEAIQ 421
                                  PELT+LWN+  DNL+ACK  +R+F PSLE+Y + + +
Sbjct: 432 RPRRPLGDLIRESTKQGKFFMGNPELTRLWNTCPDNLQACKGDDRNFLPSLEAYLDGSKE 491

Query: 422 QMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF--LNATPNVEKNSEFIENMVKRCV 479
           + DP+              + WRALRLL+R+ PHFF   N+T +  K ++++E++ K+  
Sbjct: 492 RQDPS--------------FEWRALRLLARQSPHFFTLFNSTSSY-KVADYLESVRKKIQ 536

Query: 480 KEK 482
           K+K
Sbjct: 537 KDK 539


>gi|195145597|ref|XP_002013778.1| GL23216 [Drosophila persimilis]
 gi|194102721|gb|EDW24764.1| GL23216 [Drosophila persimilis]
          Length = 718

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 245/562 (43%), Positives = 342/562 (60%), Gaps = 74/562 (13%)

Query: 20  KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
           +Y  Y    ++DK+  ++ + R  L+K +K    D+K I + + L VE    ++ ++T+P
Sbjct: 45  EYNSYTANSEHDKRLPMDHAFRVLLMKRLKD---DVKRIGDLVRLSVEATRAEIVSNTIP 101

Query: 80  IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
           ++LL DTFD+ TLD+C+++F +VE  V++WK++ FF SCKNN+LRMCNDLLRRLSR+QNT
Sbjct: 102 VVLLIDTFDVVTLDRCQEIFKFVEELVDVWKEEIFFASCKNNILRMCNDLLRRLSRTQNT 161

Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV 199
           VFCGRILLFL+KFFPFSERSGLNI+SEFN++N+TE+G D      ++  E  DT E   +
Sbjct: 162 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNLTEYGLDSRDHDDTDNKELEDTAEDIPL 221

Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ-------- 251
           KID++ Y KFWSLQD+FRNP QCY K  WKMF  +A+ +L +F S+KL+DV+        
Sbjct: 222 KIDYDLYCKFWSLQDFFRNPNQCYGKAQWKMFHMHADNILQSFASFKLEDVRPSNSENGS 281

Query: 252 ------------------SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
                             S+++ +  +FAK+LTN KLL LQLSD+NFRR VL+QFLILFQ
Sbjct: 282 EADQAMDVDDDIVDAAVASTVSKANHFFAKFLTNPKLLALQLSDSNFRRAVLVQFLILFQ 341

Query: 294 YFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
           Y   +VK + +   L ++Q  ++KDT   VY L+++TPP G  F++ V  +L  EE WN 
Sbjct: 342 YLQVSVKFKIDTYTLTTEQADFIKDTEPRVYKLLEETPPYGRRFARTVHHMLVREEMWNN 401

Query: 354 WKNEGCPELKRPLTSIT-DEDKKDEP-------------DAKKK-----KTPELTKLWN- 393
           WKNEGC E K+P    + DED K  P             DA +          LT+LWN 
Sbjct: 402 WKNEGCKEFKKPEDPPSNDEDNKPPPAKRPKRPLGDSLRDAARNGKFFLGNENLTRLWNY 461

Query: 394 SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
           S DNL+ACKS +R+F P LE+Y E    ++DPA              + WRALRLL+R+ 
Sbjct: 462 SPDNLQACKSEQRNFLPLLETYLETPHDKIDPA--------------FEWRALRLLARQT 507

Query: 454 PHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQIS---------GNGNGVDQDPAEVEVD 504
           PHFF + +    K S+++E + KR V+++   Q +          NG      PAE E D
Sbjct: 508 PHFFTSPSQPSSKISDYLEQVRKRLVRDREPKQSAPVSSNASEQNNGLAAKSTPAENEQD 567

Query: 505 TKSEEIQEEEKEEDWEAK-ADP 525
                +QE E E+D E +  DP
Sbjct: 568 A-GMALQEAEPEQDLEVEDTDP 588


>gi|198451954|ref|XP_001358566.2| GA15194 [Drosophila pseudoobscura pseudoobscura]
 gi|198131721|gb|EAL27707.2| GA15194 [Drosophila pseudoobscura pseudoobscura]
          Length = 718

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 244/562 (43%), Positives = 342/562 (60%), Gaps = 74/562 (13%)

Query: 20  KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
           +Y  Y    ++DK+  ++ + R  L+K +K    D+K I + + L VE    ++ ++T+P
Sbjct: 45  EYNSYTANSEHDKRLPMDHAFRVLLMKRLKD---DVKRIGDLVRLSVEATRAEIVSNTIP 101

Query: 80  IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
           ++LL DTFD+ TLD+C+++F +VE  V++WK++ FF SCKNN+LRMCNDLLRRLSR+QNT
Sbjct: 102 VVLLIDTFDVVTLDRCQEIFKFVEELVDVWKEEIFFASCKNNILRMCNDLLRRLSRTQNT 161

Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV 199
           VFCGRILLFL+KFFPFSERSGLNI+SEFN++N+TE+G D      ++  E  DT E   +
Sbjct: 162 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNLTEYGLDSRDHDDTDNKELEDTAEDIPL 221

Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ-------- 251
           KID++ Y KFWSLQD+FRNP QCY K  WKMF  +A+ +L +F S+KL+DV+        
Sbjct: 222 KIDYDLYCKFWSLQDFFRNPNQCYGKAQWKMFHMHADNILQSFASFKLEDVRPSNSENGS 281

Query: 252 ------------------SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
                             S+++ +  +FAK+LTN KLL LQLSD+NFRR VL+QFLILFQ
Sbjct: 282 EADQAMDVDDDIVDAAVASTVSKANHFFAKFLTNPKLLALQLSDSNFRRAVLVQFLILFQ 341

Query: 294 YFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
           Y   +VK + +   L ++Q  ++KDT   VY L+++TPP G  F++ V  +L  EE WN 
Sbjct: 342 YLQVSVKFKIDTYTLTTEQADFIKDTEPRVYKLLEETPPYGRRFARTVHHMLVREEMWNN 401

Query: 354 WKNEGCPELKRPLTSIT-DEDKKDEP-------------DAKKK-----KTPELTKLWN- 393
           WKNEGC E K+P    + DED K  P             DA +          LT+LWN 
Sbjct: 402 WKNEGCKEFKKPEDPPSNDEDNKPPPAKRPKRPLGDSLRDAARNGKFFLGNENLTRLWNY 461

Query: 394 SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
           S DNL+ACKS +R+F P LE+Y E    ++DPA              + WRALRLL+R+ 
Sbjct: 462 SPDNLQACKSEQRNFLPLLETYLESPHDKIDPA--------------FEWRALRLLARQT 507

Query: 454 PHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQIS---------GNGNGVDQDPAEVEVD 504
           PHFF + +    K S+++E + KR ++++   Q +          NG      PAE E D
Sbjct: 508 PHFFTSPSQPSSKISDYLEQVRKRLIRDREPKQSAPVSSNASEQNNGLAAKSTPAENEQD 567

Query: 505 TKSEEIQEEEKEEDWEAK-ADP 525
                +QE E E+D E +  DP
Sbjct: 568 A-GMALQEAEPEQDLEVEDTDP 588


>gi|405964743|gb|EKC30192.1| THO complex subunit 1 [Crassostrea gigas]
          Length = 692

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 245/552 (44%), Positives = 350/552 (63%), Gaps = 49/552 (8%)

Query: 23  LYKPTC--DYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPI 80
           +++  C  + +KK A++Q++R  +  ++   D D +  +  I   +E+  +D+C+  +P 
Sbjct: 67  IFQSVCGSEAEKKLAVDQALRDSMKDVVMRAD-DCEEYKKVILHSIEMAKEDLCSVIMPF 125

Query: 81  ILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTV 140
           +LLSD FD+ TL +CE +FY VE  + +WK   F+ S KN LLRMCNDLLRRLS+SQNTV
Sbjct: 126 VLLSDIFDLITLSQCEIVFYLVEEKLPVWKTPLFYDSGKNYLLRMCNDLLRRLSKSQNTV 185

Query: 141 FCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE------MDVSSNETEETDTE 194
           FCGRI LFL++ FP SE+S LN++S+FN+EN+T F    E       D+S +  E  + E
Sbjct: 186 FCGRIQLFLSRLFPLSEKSALNLMSQFNLENVTVFSTKSEEVKPKSHDISDDVMEVEEGE 245

Query: 195 EVD---KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
             D      +D+N Y++FW+LQDYFR P QCY+KV WK F   A+ VL AF S KLDD++
Sbjct: 246 MEDLTGSTPVDYNLYRRFWALQDYFRKPTQCYDKVPWKAFQQNADVVLNAFASAKLDDMK 305

Query: 252 SS-----LNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG 303
           SS     L    D   +FAKYLT++KLLDLQL+D+NFRRYVL+QFLILFQY  S VK + 
Sbjct: 306 SSRRKLDLPRPADTRAFFAKYLTSEKLLDLQLNDSNFRRYVLVQFLILFQYLNSQVKFKS 365

Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK 363
               L  +Q +W+K   + VY LI++TPPDGE F ++V+ IL  EEHWN+WKN+GCP+  
Sbjct: 366 ASQNLSDEQNQWIKTIQDKVYQLIRETPPDGEQFCKIVQHILAREEHWNKWKNDGCPDFA 425

Query: 364 RPLTSITDEDKKDEPDAKKK----------------KTPELTKLWN-SKDNLEACKSAER 406
           R   +   ED K +  AK++                 + ELT+LWN + DN+EACK+ +R
Sbjct: 426 RKPVA---EDTKIKSRAKRRWVGDDLQAHGGKIIKMGSAELTRLWNINPDNMEACKAEKR 482

Query: 407 DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEK 466
            F PSLE +F EA++Q DP A VEEQYK VN+  + W++LRLL+R+ PHFF         
Sbjct: 483 VFLPSLEDFFAEAMEQADPEAKVEEQYKMVNNPVFQWKSLRLLARQSPHFFGQHHTPAMP 542

Query: 467 NSEFIENMVKRCVKEKPSSQISGN----GNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAK 522
             +++E M+ +  +E PS+  +G+    GN  ++   EV  D   E I+E ++ ED E K
Sbjct: 543 LPQYLELMLNKIAQELPSTGSNGDVAPEGNN-EEMKTEVGDD---EAIKESQEVEDEELK 598

Query: 523 ADPEGDADEVMS 534
           A  E +A++++S
Sbjct: 599 AQAE-EAEDILS 609


>gi|195449329|ref|XP_002072026.1| GK22628 [Drosophila willistoni]
 gi|194168111|gb|EDW83012.1| GK22628 [Drosophila willistoni]
          Length = 710

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 239/558 (42%), Positives = 344/558 (61%), Gaps = 73/558 (13%)

Query: 14  LNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDM 73
           + ++ + Y  +    ++DK+S ++Q+ R+ L+K +     D++ I   + L V+    ++
Sbjct: 41  VKILVETYNNFNGNSEHDKRSPMDQAFREQLMKNLNE---DVEKIGALVKLSVDATRAEL 97

Query: 74  CNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRL 133
            +ST+P++LL DTFD+ TLDKC++LF +VE  V +WK+   F SCKNN+LRMCNDLLRRL
Sbjct: 98  VSSTIPVVLLGDTFDVVTLDKCQELFQFVEQMVEVWKEDILFGSCKNNILRMCNDLLRRL 157

Query: 134 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDT 193
           SR+QNTVFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D + D   +  E  D 
Sbjct: 158 SRTQNTVFCGRILLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDGK-DNEDDSKELEDI 216

Query: 194 EEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ-- 251
            E   +KID++ Y KFWSLQD+FRNP QCY+K  WKMF  +AET+L +F S+KL+D++  
Sbjct: 217 AEDIPLKIDYDLYCKFWSLQDFFRNPNQCYSKAQWKMFQMHAETILQSFVSFKLEDIRPG 276

Query: 252 ----------------------SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
                                 S++N + ++FAK+LTN KLL LQLSD+NFRR VL+QFL
Sbjct: 277 NDSGDGPQPMEVDEAAVDAAAASTVNRANNFFAKFLTNPKLLALQLSDSNFRRAVLVQFL 336

Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
           ILFQY   +VK + E   L   Q  ++K+T   V+ L+++TPP G  F++ V+ +L  EE
Sbjct: 337 ILFQYLQVSVKFKMESYILNPAQADFIKETEPRVFKLLEETPPYGRRFARTVQHMLVREE 396

Query: 350 HWNQWKNEGCPELKRPLT-SITDEDKKDEPDAKKKKT------------------PELTK 390
            WN WKNEGC E K+P    + +ED K  P  + K+                     LT+
Sbjct: 397 MWNNWKNEGCKEFKKPENPPVIEEDIKPPPAKRPKRALGDALRDATRNGKFFLGNENLTR 456

Query: 391 LWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLL 449
           LWN S DNL+ACKS +R+F P LE+Y E    ++DP               + WRALRLL
Sbjct: 457 LWNYSPDNLQACKSEQRNFLPQLETYLETPHGKVDPP--------------FEWRALRLL 502

Query: 450 SRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK-PSSQISGNGNGVDQD--------PAE 500
           +R+ PHFF + +    K S+++E + KR  ++K P   ++G  N +  +        PAE
Sbjct: 503 ARQTPHFFTSVSQPSSKISDYLEQVRKRLNRDKEPKPTLTGATNALGGNTAVVATTPPAE 562

Query: 501 VEVDTKSEEIQEEEKEED 518
           +E +T +  +Q+ E E+D
Sbjct: 563 IEQETVA--LQDPEPEQD 578


>gi|301609229|ref|XP_002934170.1| PREDICTED: THO complex subunit 1 [Xenopus (Silurana) tropicalis]
          Length = 671

 Score =  443 bits (1140), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 234/503 (46%), Positives = 312/503 (62%), Gaps = 55/503 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIEN---YISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q  R  L + I    I+    EN    ISL +      +C +T P +L
Sbjct: 54  PGSENEKKPTLDQVFRVVLEEEI----INQVSCENCLAIISLAINGVTDGICTATTPFVL 109

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   L  C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 110 LGDVLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 169

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE--------------MDVSSNET 188
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+              M++   E 
Sbjct: 170 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKVEEKDDGMEIEEGEM 229

Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
            + D      + ID+N Y+KFW+LQD+FRNPVQCY+K SWK F  +++ VL  FKSYKLD
Sbjct: 230 GDDDAPNSSSIPIDYNLYRKFWTLQDFFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLD 289

Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           D Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY    VK
Sbjct: 290 DTQASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVK 349

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
            +     L  +Q  W++DTT+ V+ L+ +TPP GE FS+ V+ IL  EE+WN WKNEGCP
Sbjct: 350 FKSSNYILLDEQSLWIEDTTKLVHQLLSETPPVGEKFSKTVEHILNTEENWNAWKNEGCP 409

Query: 361 EL--KRPLTSITDEDKKDEPDAKKKKTP-----------------ELTKLWN-SKDNLEA 400
               +RP       + K +  AKK+  P                 ELT+LWN   DN+EA
Sbjct: 410 SFVKERPT------ETKPQKMAKKRPLPEEFLGKGSNKKMSMGNDELTRLWNLCPDNMEA 463

Query: 401 CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
           C+S  R+F PSLE +FEEAI+Q DP   V+ +YK VN++NY WRALRLL+R+ PHFF   
Sbjct: 464 CRSESREFMPSLEQFFEEAIEQADPENMVDNEYKVVNNANYGWRALRLLARRSPHFFQPT 523

Query: 461 TPNVEKNSEFIENMVKRCVKEKP 483
               +   E++ENMV +  KE P
Sbjct: 524 NQQFKSLPEYLENMVIKLAKELP 546


>gi|194898899|ref|XP_001979000.1| GG10810 [Drosophila erecta]
 gi|190650703|gb|EDV47958.1| GG10810 [Drosophila erecta]
          Length = 713

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 239/573 (41%), Positives = 346/573 (60%), Gaps = 71/573 (12%)

Query: 4   SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
           + +L      + ++  +Y  +    D+DK+  ++ + R  L+K +     D+  I   + 
Sbjct: 28  ALELAIADGNVELLVKEYNSFPGNTDHDKRLPMDHAFRVLLMKRLYD---DVNRIGELVR 84

Query: 64  LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
           L VE    ++ ++T+P++LL DTFD+ TLDKC ++F +VE  V +WK++ FF SCKNN+L
Sbjct: 85  LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCLKIFQFVEEMVEVWKEEIFFASCKNNIL 144

Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
           RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +   
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204

Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
            S+  E  DT E   +KID++ Y KFWSLQD+FRNP QCYNK  WKMF  +AE +L +F 
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKAQWKMFQMHAENILQSFS 264

Query: 244 SYKLDDVQ--------------------------SSLNPSGDYFAKYLTNQKLLDLQLSD 277
           S+KL+DV+                          +S+  +  +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRQSSSDNASESDNAMDVDEEAVDATAVTSVIKANHFFAKFLTNPKLLALQLSD 324

Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
            NFRR VL+QFLILFQY   +VK + +   L +DQ  ++K+T   VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTYTLTADQADFIKETESRVYKLLEETPPYGKRF 384

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKKT----------- 385
           S+ V  +L  EE WN WKNEGC E K+P   +  +ED K  P+ + ++            
Sbjct: 385 SRTVHHMLAREEMWNNWKNEGCKEFKKPEEPTPNEEDLKPIPNKRPRRALGDALRDASRS 444

Query: 386 -------PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
                    LT+LWN S DNL+ACKS +R+F P LE+Y E   +++DPA           
Sbjct: 445 GKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLEAYLETPHEKVDPA----------- 493

Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQISGNGN-- 492
              + WRALRLL+R+ PHFF + +    K S+++E + KR +++   KP++ +  N +  
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDKEPKPAALLGNNSSEN 550

Query: 493 --GVDQDPAEVEVDTKSE-EIQEEEKEEDWEAK 522
             G+  + A+ E +  +    Q+ E E+D E +
Sbjct: 551 TFGLSANNAQTEGEQDATVAFQDGEPEQDLEVE 583


>gi|194745454|ref|XP_001955203.1| GF18644 [Drosophila ananassae]
 gi|190628240|gb|EDV43764.1| GF18644 [Drosophila ananassae]
          Length = 713

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 245/587 (41%), Positives = 353/587 (60%), Gaps = 72/587 (12%)

Query: 4   SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
           + +L   ++ + ++   Y  +    D+DK+  ++ + R  L+K ++    D++ I + + 
Sbjct: 28  TLELAIAQNSVELLVQAYNSFASNSDHDKRLPMDHAFRVLLMKRLED---DVQRIGDLVR 84

Query: 64  LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
           L VE    ++ ++T+P++LL DTFD+ TLDKC+++F +VE  V +WK++ FF SCKNN+L
Sbjct: 85  LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQEIFRFVEELVEVWKEEIFFASCKNNIL 144

Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
           RMCNDLLRRLSR+QNTVFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D +   
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRILLFLSKFFPFSERSGLNIVSEFNLDNYTEYGLDSKDHD 204

Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
            ++  E  DT E   +KID++ Y KFWSLQ++FRNP QCY+K  WKMF  + E +L +F 
Sbjct: 205 DNDNKELEDTAEDIPLKIDYDLYCKFWSLQEFFRNPNQCYSKAQWKMFQMHTENILQSFS 264

Query: 244 SYKLDDVQ--------------------------SSLNPSGDYFAKYLTNQKLLDLQLSD 277
           S+KL+D++                          S++N +  +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDLRQSINDNASESGHAMEIDDEAVDSAAISTVNKANHFFAKFLTNPKLLALQLSD 324

Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
           +NFRR +L+QFLILFQY   +VK + +   L +DQ +++KDT   VY L+++TPP G  F
Sbjct: 325 SNFRRAILVQFLILFQYLQVSVKFKNDSYTLTTDQTEFIKDTEPRVYKLLEETPPYGRRF 384

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTSIT-DEDKKDEP-------------DAKKK 383
           ++ V  +L  EE WN WKNEGC E K+P    T DED K  P             DA ++
Sbjct: 385 ARTVNHMLVREEMWNNWKNEGCKEFKKPEEPATADEDLKPPPTKRPKRPLGDSLRDAARQ 444

Query: 384 -----KTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
                    LT+LWN S DNL+ACKS +R+F P LE+Y E    + DPA           
Sbjct: 445 GKFFLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHDKTDPA----------- 493

Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISG-------N 490
              + WRALRLL+R+ PHFF + +    K S+++E + KR V++K    +S        N
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSLPSNKISDYLEQVRKRLVRDKEPKAVSAASNSEQNN 550

Query: 491 GNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAK-ADPEGDADEVMSVE 536
           G   +    E E D  +  +QE E E+  E +  +P  + DE  + E
Sbjct: 551 GLATNNSHTESEQDA-TVALQEVEPEQGLEVEDTEPVEETDEAPAHE 596


>gi|195502124|ref|XP_002098085.1| GE24134 [Drosophila yakuba]
 gi|194184186|gb|EDW97797.1| GE24134 [Drosophila yakuba]
          Length = 713

 Score =  440 bits (1131), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 232/525 (44%), Positives = 326/525 (62%), Gaps = 63/525 (12%)

Query: 4   SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
           + +L      + ++  +Y  +    ++DK+  ++ + R  L+  +   D D+  I   + 
Sbjct: 28  ALELAIKDGNVELLVKEYNSFPANTEHDKRLPMDHAFRVLLMNRL---DDDVNRIGELVR 84

Query: 64  LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
           L VE    ++ ++T+P++LL DTFD+ TLDKC+++F +VE  V +WK++ FF SCKNN+L
Sbjct: 85  LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEEMVEVWKEEIFFASCKNNIL 144

Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
           RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +   
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204

Query: 184 SSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
            S+  E  DT E   +KID++ Y KFWSLQD+FRNP QCYNK  WKMF  +AE +L +F 
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKAQWKMFQMHAENILQSFS 264

Query: 244 SYKLDDVQ--SSLNPS------------------------GDYFAKYLTNQKLLDLQLSD 277
           S+KL+DV+  SS N S                          +FAK+LTN KLL LQLSD
Sbjct: 265 SFKLEDVRQSSSDNASESDQAMDVDEEAVDTTAVTTVIKANHFFAKFLTNPKLLALQLSD 324

Query: 278 TNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
            NFRR VL+QFLILFQY   +VK + +   L +DQ  ++K+T   VY L+++TPP G+ F
Sbjct: 325 ANFRRAVLVQFLILFQYLQVSVKFKSDTYTLTADQSDFIKETETRVYKLLEETPPYGKRF 384

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRP-LTSITDEDKKDEPDAKKKKT----------- 385
           S+ V  +L  EE WN WKNEGC E K+P   + ++ED K  P+ K ++            
Sbjct: 385 SRTVHHMLAREEMWNNWKNEGCKEFKKPEEPTPSEEDLKPIPNKKPRRALGDALRDASRS 444

Query: 386 -------PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
                    LT+LWN S DNL+ACKS +R+F P LE+Y E   +++DPA           
Sbjct: 445 GKFFLGNDNLTRLWNYSPDNLQACKSEQRNFLPLLETYLETPHEKVDPA----------- 493

Query: 438 DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK 482
              + WRALRLL+R+ PHFF + +    K S+++E + KR +++K
Sbjct: 494 ---FEWRALRLLARQTPHFFTSLSQPSSKISDYLEQVRKRLIRDK 535


>gi|195112320|ref|XP_002000722.1| GI10383 [Drosophila mojavensis]
 gi|193917316|gb|EDW16183.1| GI10383 [Drosophila mojavensis]
          Length = 730

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 250/588 (42%), Positives = 350/588 (59%), Gaps = 90/588 (15%)

Query: 20  KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
           +Y  ++   ++DK+S ++Q+ R+ L+K +     D++ I   + L V+    ++ ++T+P
Sbjct: 44  EYHSFRANSEHDKRSPMDQAFREVLMKRLSD---DVERIGALVRLSVDAARAEIVSNTIP 100

Query: 80  IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
           ++LL DTFD+ TL+KCE +F +VE  V +WK++ FF SCKNNLLRMCNDLLRRLSR+QNT
Sbjct: 101 VVLLGDTFDVVTLNKCELIFKFVEDLVEVWKEEIFFASCKNNLLRMCNDLLRRLSRTQNT 160

Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV 199
           VFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D +     +  E  DT E   +
Sbjct: 161 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHDDLDNKELEDTAEDIPL 220

Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSG- 258
           KID++ Y KFWSLQD+FRNP QCY+K  WKMF  +AET+L +F S+KL+D++ + + +G 
Sbjct: 221 KIDYDLYCKFWSLQDFFRNPNQCYSKAQWKMFQMHAETILQSFLSFKLEDLRQNTDSNGA 280

Query: 259 -------------------------------------DYFAKYLTNQKLLDLQLSDTNFR 281
                                                ++FAK+LTN KLL LQLSD+NFR
Sbjct: 281 ANGVNGSSGDSGAQMMDVDVDDAAVDAAASMAVRKETNFFAKFLTNPKLLALQLSDSNFR 340

Query: 282 RYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVV 341
           R VL+QFLILFQY   +VK + +   L S Q  ++K+T + VY L+++TPP+G  F++ V
Sbjct: 341 RAVLVQFLILFQYLQVSVKFKIDTYTLTSAQSDFIKETEQRVYKLLEETPPNGRRFARTV 400

Query: 342 KLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKTP--------------- 386
           + +L+ EE WN WKN+GC E K+P      ED+      K+ K P               
Sbjct: 401 QHMLQREEMWNNWKNDGCKEFKKPEEPEPTEDESKPSPPKRAKRPLGDCLREAARNGKFY 460

Query: 387 ----ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNY 441
                LT+LWN S DNL+ACKS +R+F P LE+Y E    + DPA              +
Sbjct: 461 LGNDNLTRLWNYSPDNLQACKSEQRNFLPQLETYLETPYDKTDPA--------------F 506

Query: 442 AWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP-SSQIS---------GNG 491
            WRALRLL+R+ PHFF   T    K +++++ + KR  +EK  + QI+          NG
Sbjct: 507 EWRALRLLARQTPHFFTFVTQPSCKIADYLDVVRKRLAREKELAKQINTAAANASEHNNG 566

Query: 492 NGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEG---DADEVMSVE 536
                 PAE E D  S  +QE E E D E + DPE    +ADE  + E
Sbjct: 567 LAASSAPAEQEHDA-SVALQETEPEPDLEVE-DPEPVVEEADETPAHE 612


>gi|195391418|ref|XP_002054357.1| GJ22844 [Drosophila virilis]
 gi|194152443|gb|EDW67877.1| GJ22844 [Drosophila virilis]
          Length = 720

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 247/575 (42%), Positives = 348/575 (60%), Gaps = 83/575 (14%)

Query: 20  KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
           +Y  ++   ++DK+S ++Q+ R+ L+K +     +++ I   + L V+    ++ ++T+P
Sbjct: 44  EYHSFRANSEHDKRSPMDQAFREVLMKRLSD---NVERIGALVRLSVDAARAEIVSNTIP 100

Query: 80  IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
           ++LL DTFD+ TL+KCE +F +VE  V +WK++ FF SCKNNLLRMCNDLLRRLSR+QNT
Sbjct: 101 VVLLGDTFDVVTLNKCELIFNFVEELVEVWKEEIFFASCKNNLLRMCNDLLRRLSRTQNT 160

Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD--EEMDVSSNETEETDTEEVD 197
           VFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D  +  D+ + E E  DT E  
Sbjct: 161 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHDDIENKELE--DTAEDI 218

Query: 198 KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPS 257
            +KID++ Y KFWSLQD+FRNP QCY+K  WKMF  +AET+L +F S+KL+DV+ +++ +
Sbjct: 219 PLKIDYDLYCKFWSLQDFFRNPNQCYSKAQWKMFQMHAETILQSFSSFKLEDVRQNIDTN 278

Query: 258 G------------------------------DYFAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
           G                              ++FAK+LTN KLL LQLSD+NFRR VL+Q
Sbjct: 279 GHSESCLQLMDVDMDDATIDGAVSAAMHKETNFFAKFLTNPKLLALQLSDSNFRRAVLVQ 338

Query: 288 FLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKG 347
           FLILFQY   +VK + E   L S Q  ++K+T + VY L+++TPP+G  F++ V+ +L+ 
Sbjct: 339 FLILFQYLQVSVKFKIESYTLTSAQSDFIKETEQRVYKLLEETPPNGRRFARTVQHMLQR 398

Query: 348 EEHWNQWKNEGCPELKRPLTSITDEDKKDEP--------------DAKKK-----KTPEL 388
           EE WN WKN+GC E K+P      ED    P              DA +          L
Sbjct: 399 EEMWNNWKNDGCKEFKKPEEPEPSEDDAKPPPPKRSKRPLGDCLRDAARNGKFFLGNDNL 458

Query: 389 TKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
           T+LWN S DNL+ACKS +R+F P LE+Y E    + DPA              + WRALR
Sbjct: 459 TRLWNYSPDNLQACKSEQRNFLPLLETYLETPHDKTDPA--------------FEWRALR 504

Query: 448 LLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK----------PSSQISGNGNGVDQD 497
           LL+R+ PHFF   +    K S++++ + KR  +EK           ++    NG  V   
Sbjct: 505 LLARQTPHFFTFVSQPSCKISDYLDVVRKRLTREKELVKQTSTTTANASEHNNGLAVSSA 564

Query: 498 PAEVEVDTKSEEIQEEEKEEDWEAKADPEGDADEV 532
           P E E +  S  +QE E E+D E + D E   +EV
Sbjct: 565 PLEQEHEA-SVALQETEPEQDLEVE-DSEPVVEEV 597


>gi|119622138|gb|EAX01733.1| THO complex 1, isoform CRA_c [Homo sapiens]
          Length = 635

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 232/486 (47%), Positives = 305/486 (62%), Gaps = 42/486 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDEDKKDEPDAKKKKTPE--LTKLWNSKDNLEACKSAERDFTPSLESYFE 417
           P   +  TS    D K     +K+  PE  L K    K  +       ++  P+LE +FE
Sbjct: 397 PSFVKERTS----DTKPTRIIRKRTAPEDFLGKGPTKKILM------GKEHMPTLEEFFE 446

Query: 418 EAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKR 477
           EAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +   E++ENMV +
Sbjct: 447 EAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIK 506

Query: 478 CVKEKP 483
             KE P
Sbjct: 507 LAKELP 512


>gi|432917195|ref|XP_004079463.1| PREDICTED: THO complex subunit 1-like [Oryzias latipes]
          Length = 654

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 227/499 (45%), Positives = 312/499 (62%), Gaps = 44/499 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
           P  + +KK+ L+Q++R  L   I           + I L ++   + +C++T P +LL D
Sbjct: 39  PGNETEKKTTLDQALRGVLGDQIVEQKASCDDYLSLIYLSIDAVTEGICSATTPFVLLGD 98

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
             D   LD C+++F +VE NV+ WK  +F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 99  VLDCLPLDLCDKIFSFVEENVSTWKSNSFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 158

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETE--------------E 190
            LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E E  +    T               E
Sbjct: 159 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNKNELESTLGQKSTNEKEEGMEVEEGEMGE 218

Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
            D      + ID+N Y+KFW+LQDYFRNPVQCY+K  W  F  +++  LA FKSYKLDD+
Sbjct: 219 EDAPAQSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFLWMTFLKFSDETLAVFKSYKLDDM 278

Query: 251 QSS-------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
           Q+S           G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK 
Sbjct: 279 QASKRKLDELRGSDGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKF 338

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           +     L  DQ  W+++TT+ VY L+K+ PPDG+ F+ +V+ IL  EE+WN WKNEGCP 
Sbjct: 339 KSSSCVLNDDQTTWIEETTKLVYQLLKEIPPDGDKFASMVEHILDTEENWNSWKNEGCPS 398

Query: 362 LKRPLTSITDEDKKDEPDAKKK-------KTP---------ELTKLWN-SKDNLEACKSA 404
             +  T    +DK   P  K++       K P         ELT+LWN + DN+EACKS 
Sbjct: 399 FVKERTV---DDKPKRPTRKRQAPEDFLGKGPDRKIFMGNDELTRLWNLNSDNMEACKSD 455

Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNV 464
            R+F PSL+ +F EAI+Q DP   VE++YK V + NY WRALRLLSR+ PHFF       
Sbjct: 456 SREFMPSLDEFFAEAIEQADPVNMVEDEYKVVRNPNYGWRALRLLSRRSPHFFQPTNQKF 515

Query: 465 EKNSEFIENMVKRCVKEKP 483
           +  ++++++MV +  KE P
Sbjct: 516 KSLADYLDSMVSKLAKELP 534


>gi|47220323|emb|CAF98422.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 568

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 220/452 (48%), Positives = 296/452 (65%), Gaps = 44/452 (9%)

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
           +C++T P +LL D  D   L +C+++F +VE NV+ WK  +F+ + KN LLRMCNDLLRR
Sbjct: 2   ICSATTPFVLLGDVLDCLPLHQCDKIFSFVEENVSTWKSNSFYTAGKNYLLRMCNDLLRR 61

Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEE- 190
           LS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN++NIT F  +E E  +     EE 
Sbjct: 62  LSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQESSLGQKHAEEK 121

Query: 191 -------------TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
                         D      + ID+N Y+KFW+LQDYFRNPVQCY+KVSW  F  Y+E 
Sbjct: 122 EDGMEVEEGEMGEEDAPAPCSIPIDYNLYRKFWTLQDYFRNPVQCYDKVSWITFVKYSEE 181

Query: 238 VLAAFKSYKLDDVQSS------LNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
            LA FKSYKLDD+Q+S      L  SG    YFAK+LT++KL+DLQLSD+NFRR++LLQ+
Sbjct: 182 TLAVFKSYKLDDMQASKRKLEDLRTSGGDHVYFAKFLTSEKLMDLQLSDSNFRRHILLQY 241

Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
           LILFQY    VK +     L  DQ  W+++T++ VY L+++ PPDG+ F+ +V+ IL  E
Sbjct: 242 LILFQYLKGQVKFKSSSSVLNDDQGSWIEETSKLVYQLLREIPPDGDKFATMVEHILSTE 301

Query: 349 EHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKK-------KTP---------ELTKLW 392
           E+WN WKNEGCP   +  T    +DK   P  K++       K P         ELT+LW
Sbjct: 302 ENWNSWKNEGCPSFVKERTV---DDKPKRPTRKRQAPEDFLGKGPDRKIFMGNDELTRLW 358

Query: 393 N-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
           N + DN++ACKS  R+F PSL+ +F EAI+Q DPA  VE++YK V + NY WRALRLLSR
Sbjct: 359 NLNHDNMDACKSESREFIPSLDDFFSEAIEQADPANMVEDEYKVVRNPNYGWRALRLLSR 418

Query: 452 KCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
           + PHFF       +  ++++++MV +  KE P
Sbjct: 419 RSPHFFQPTNQKFKSLADYLDSMVSKLAKELP 450


>gi|195056107|ref|XP_001994954.1| GH17514 [Drosophila grimshawi]
 gi|193892717|gb|EDV91583.1| GH17514 [Drosophila grimshawi]
          Length = 717

 Score =  429 bits (1104), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 235/543 (43%), Positives = 325/543 (59%), Gaps = 77/543 (14%)

Query: 20  KYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLP 79
           +Y  +    ++DK+S ++Q+ R+ L+K +     DI+ I   + L VE    ++ ++T+P
Sbjct: 44  EYHGFGANSEHDKRSPMDQAFREVLMKRLSD---DIEHIGTLVRLSVEAARAEIVSNTIP 100

Query: 80  IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNT 139
           ++LL DTFD+ TL+KCE +F +VE  V +WK++ FF SCKNNLLRMCNDLLRRLSR+QNT
Sbjct: 101 VVLLGDTFDVITLNKCELIFKFVEELVEVWKEEIFFASCKNNLLRMCNDLLRRLSRTQNT 160

Query: 140 VFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV 199
           VFCGRILLFL+KFFPFSERSGLNI+SEFN++N TE+G D +     +  E  DT E   +
Sbjct: 161 VFCGRILLFLSKFFPFSERSGLNIVSEFNLDNYTEYGLDSKDHDEIDNKELEDTAEDIPL 220

Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD 259
           KID++ Y KFWSLQD+FRNP QCY+K  WKMF  +AE +L +F S+KL+DV+ + + +G+
Sbjct: 221 KIDYDLYCKFWSLQDFFRNPNQCYSKAQWKMFQMHAENILQSFSSFKLEDVRPNNDFNGN 280

Query: 260 ------------------------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
                                         +FAK+LTN KLL LQLSD+NFRR VL+QFL
Sbjct: 281 SDTGPQAMDVDIDAQAMDAVATLAVRKENHFFAKFLTNPKLLALQLSDSNFRRAVLVQFL 340

Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
           ILFQY   +VK + +   L S Q  ++K+T + VY L+++TPP G  F++ V  +L+ EE
Sbjct: 341 ILFQYLQVSVKFKIDTYTLTSAQSDFIKETEQRVYKLLEETPPYGRRFARTVLHMLQREE 400

Query: 350 HWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKTP-------------------ELTK 390
            WN WKN+GC E K+P      +D      AK+ K P                    LT+
Sbjct: 401 MWNNWKNDGCKEFKKPEEPEPSQDDSKPTPAKRSKRPLGDCLRDAARNGKFYLGNDTLTR 460

Query: 391 LWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLL 449
           LWN S DNL+ACKS +R+F P LE+Y E    + DPA              + WRALRLL
Sbjct: 461 LWNYSPDNLQACKSEQRNFLPLLETYLETTHDKSDPA--------------FEWRALRLL 506

Query: 450 SRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK----------PSSQISGNGNGVDQDPA 499
           +R+  HFF   +    K S++++ + KR  +EK           +S    NG  V   P+
Sbjct: 507 ARQTSHFFTFISQPSCKISDYLDVVRKRLTREKELVKQTNTASANSSEHNNGLAVSSQPS 566

Query: 500 EVE 502
           E E
Sbjct: 567 ENE 569


>gi|241166483|ref|XP_002409875.1| nuclear matrix protein, putative [Ixodes scapularis]
 gi|215494663|gb|EEC04304.1| nuclear matrix protein, putative [Ixodes scapularis]
          Length = 566

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 224/448 (50%), Positives = 288/448 (64%), Gaps = 37/448 (8%)

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT--FFMSCKNNLLRMCNDLL 130
           +C+ +LP +L  D FD   L++ E+ F  VE  V+ WK+    FF +CKN LLR CNDLL
Sbjct: 1   VCSGSLPGLLFCDMFDSLPLNESEEFFSLVEDKVDTWKEAGILFFKNCKNQLLRTCNDLL 60

Query: 131 RRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG----------GDEE 180
           RRLSRSQNTVFCG+IL+FLAK FP SERSGLNIISEFN ENIT F           G  +
Sbjct: 61  RRLSRSQNTVFCGKILVFLAKLFPLSERSGLNIISEFNSENITLFSSEDYKGMPMKGSPD 120

Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
            D+   ET+  D      + +D+N Y+KFWSLQ+YFR P  CYNKV W+ FTSY+  VL+
Sbjct: 121 DDLEEGETD--DISSTATLSVDYNLYRKFWSLQEYFRQPTLCYNKVHWRQFTSYSSDVLS 178

Query: 241 AFKSYKLDDVQSS----------LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
            F S+KLDD++S+          L+    YFAKYLT+QKLL+L+LSD+NFRRYVL+QFLI
Sbjct: 179 VFGSFKLDDIKSAQWTLPEDAAALSGKSVYFAKYLTSQKLLELELSDSNFRRYVLVQFLI 238

Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
           L QY  STV+ + E   L  +Q  WVK+T   VY L+++TPPDG  F+  +K IL+ EE 
Sbjct: 239 LIQYLQSTVRFKQESQVLTEEQSTWVKNTMSLVYKLLEETPPDGVGFAANIKHILQREEF 298

Query: 351 WNQWKNEGCPELK-----------RPLTSITDEDKKDEPDAKK-KKTPELTKLWN-SKDN 397
           WN WKNEGCP+ K           RP   + DE K    + K      E+ +LWN   DN
Sbjct: 299 WNTWKNEGCPDFKEAKPVDIVKEGRPKRKLGDEIKAALANKKVILGNAEMNRLWNLCPDN 358

Query: 398 LEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
            EAC S +RDF PSLE +FE A++Q DPA A+E +YK V   N+ WR+LRLLS+K PHFF
Sbjct: 359 WEACTSPKRDFVPSLEKFFETAVEQADPAHAIENRYKNVYQGNFGWRSLRLLSQKSPHFF 418

Query: 458 LNATPNVEKNSEFIENMVKRCVKEKPSS 485
            ++   ++    ++E+MV R  KE P S
Sbjct: 419 TSSNQPIKSLPLYLESMVLRLAKELPLS 446


>gi|346473721|gb|AEO36705.1| hypothetical protein [Amblyomma maculatum]
          Length = 587

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 241/542 (44%), Positives = 327/542 (60%), Gaps = 36/542 (6%)

Query: 33  KSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMK-DMCNSTLPIILLSDTFDMST 91
           + A++ + ++ L +L++   ++     + I L + L  + D+C+ +LP++LL D FD   
Sbjct: 16  RQAVDLASKKVLSRLLRV-GLNKNTANDLIDLSITLARQADLCSGSLPVLLLCDVFDSLP 74

Query: 92  LDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAK 151
           L++ E  F  VE  V+IW++  FF  CKN LLR CNDLLRRLSRSQNTVFCG+IL+FLAK
Sbjct: 75  LNESEDFFSLVEGKVDIWREDLFFKCCKNQLLRTCNDLLRRLSRSQNTVFCGKILVFLAK 134

Query: 152 FFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEV--DKVKIDFNFYKKF 209
            FP SERSGLNI SEFN ENIT F  +    +S       D+E+     V +D+N Y+KF
Sbjct: 135 LFPLSERSGLNIASEFNAENITLFSSEGYKGISDEVPFREDSEDAPSSAVPVDYNLYRKF 194

Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS----------LNPSGD 259
           WSLQDYFR P+ CYNKV W+ FTSY+  VL+ F S+KLDDV+++           +    
Sbjct: 195 WSLQDYFRQPILCYNKVHWRQFTSYSGDVLSVFGSFKLDDVKTAKWALPESSDSSSGKSV 254

Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDT 319
           YFAKYLT+QKLL+L+LSD+NFRRYVL+QFLILFQY  STV+ + E   L  +Q  W+K+T
Sbjct: 255 YFAKYLTSQKLLELELSDSNFRRYVLVQFLILFQYLQSTVRFKLETHILTEEQSAWLKNT 314

Query: 320 TETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK-----------RPLTS 368
              VY L+++TPPDG  F+  +K IL+ EE WN WKNEGCP+ K           RP   
Sbjct: 315 LTQVYKLLEETPPDGVGFAARIKHILQREESWNAWKNEGCPDFKEARPVEPPRETRPKRR 374

Query: 369 ITDEDKKDEPDAKKKKT---PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMD 424
           + DE K     A KK      E+ +LWN   DN EAC S  RDF PSLE +F+  + Q D
Sbjct: 375 LGDEVKAAM--ASKKVVLGDSEMNRLWNLCPDNWEACSSPRRDFVPSLEKFFDIGVDQAD 432

Query: 425 PAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPS 484
           PAAA + + K +  SN  WR+LRLLS+K PHFF ++   ++    ++E+MV R  KE P 
Sbjct: 433 PAAAAKSRAKSMYQSNSGWRSLRLLSQKSPHFFTSSNQPIKSLPLYLESMVMRLAKELPL 492

Query: 485 S--QISGNGNGVDQDPAEVEVDTKSEEI---QEEEKEEDWEAKADPEGDADEVMSVEYCY 539
           S    + + +  D+     +     +E+    EE KE   EA    E  A    S +   
Sbjct: 493 STQNTAASSHQSDEKTDMADYGDSEDELLKNHEETKEAAGEAAPSTESVAAGGASAKDTL 552

Query: 540 QR 541
           +R
Sbjct: 553 KR 554


>gi|427788977|gb|JAA59940.1| Putative nuclear matrix protein [Rhipicephalus pulchellus]
          Length = 672

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 235/522 (45%), Positives = 319/522 (61%), Gaps = 36/522 (6%)

Query: 40  IRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLF 99
           I   LL++    D    +I+  I+L  E    D+C+ +LP++LL D FD   L++ E+ F
Sbjct: 61  ILSVLLRIGLNKDTANHLIDLSITLARE---PDLCSGSLPVLLLCDVFDSLPLNESEEFF 117

Query: 100 YYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERS 159
             VE  V+IWK+  FF  CKN LLR CNDLLRRLSRSQNTVFCG+IL+FLAK FP SERS
Sbjct: 118 SLVEDKVSIWKEDLFFKCCKNQLLRTCNDLLRRLSRSQNTVFCGKILVFLAKLFPLSERS 177

Query: 160 GLNIISEFNVENITEFGGDE--EMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR 217
           GLNI SEFN ENIT F  ++   +   S+  EE++      V +D+N Y+KFWSLQ+YFR
Sbjct: 178 GLNIASEFNAENITLFSSEDYKGLSDESSYNEESEEGSTSSVPVDYNLYRKFWSLQEYFR 237

Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS----------LNPSGDYFAKYLTN 267
            P+ CY KV W+ FTSY+  VL+ F S+KLDD++++           +    YFAKYLT+
Sbjct: 238 QPLLCYTKVHWRQFTSYSTDVLSVFGSFKLDDIKTAQWALPGSSSSPSGKSVYFAKYLTS 297

Query: 268 QKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLI 327
           QKLL+L+LSD+NFRRYVL+QFLILFQY  STV+ + +   L  +Q  WVK+T   VY L+
Sbjct: 298 QKLLELELSDSNFRRYVLVQFLILFQYLQSTVRFKLDTHILTEEQSSWVKNTLTQVYKLL 357

Query: 328 KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK-----------RPLTSITDEDKKD 376
           ++TPPDG  F+  +K IL+ EE WN WKNEGCP+ K           RP   + DE K  
Sbjct: 358 EETPPDGAGFATRIKHILQREESWNAWKNEGCPDFKEARPVEQTREVRPKRKLGDEVKA- 416

Query: 377 EPDAKKKKT---PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQ 432
              A K+      E+ +LWN   DN EAC S +RDF PSLE +F+ A+ Q DP A  + +
Sbjct: 417 -ALASKRVVLGDTEMNRLWNLCPDNWEACSSPKRDFVPSLEKFFDMAMDQGDPVAGAKSR 475

Query: 433 YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP-SSQISGNG 491
            K +  SN  WRALRLLS+K PHFF ++   ++    ++E+MV R  KE P ++Q +   
Sbjct: 476 AKSIYQSNSGWRALRLLSQKSPHFFTSSNQPIKSLPLYLESMVMRLAKEFPINTQNAAPS 535

Query: 492 NGVDQDPAEVEVDTKSEEI---QEEEKEEDWEAKADPEGDAD 530
              D+     +     +E+    EE ++ +    +   G AD
Sbjct: 536 QAADEKTDMADYGDSEDELLKNHEESRDRETGTLSSTGGAAD 577


>gi|326917493|ref|XP_003205033.1| PREDICTED: THO complex subunit 1-like [Meleagris gallopavo]
          Length = 540

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 221/448 (49%), Positives = 286/448 (63%), Gaps = 48/448 (10%)

Query: 114 FFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT 173
           F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN+EN+T
Sbjct: 2   FYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLENVT 61

Query: 174 EFGG---------------DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRN 218
            F                 DE MDV   E  + +      + ID+N Y+KFWSLQDYFRN
Sbjct: 62  VFNTNEHESTLGQKHSEERDEGMDVEEGEMGDDEAPTSCSIPIDYNLYRKFWSLQDYFRN 121

Query: 219 PVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKL 270
           PVQCY KVSWK F  Y+E VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL
Sbjct: 122 PVQCYEKVSWKTFLKYSEEVLAVFKSYKLDDTQASRKKLEELKTGGEHVYFAKFLTSEKL 181

Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
           +DLQLSD+NFRR++LLQ+LILFQY    VK +     L  +Q  W++DTT+ VY L+ + 
Sbjct: 182 MDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKAVYQLLSEN 241

Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCP----ELKRPLTSITDEDKKDEPDAKKKKTP 386
           PPDGE FS++V+ IL  EE+WN WKNEGCP    E  +P   +   D K    A+K+  P
Sbjct: 242 PPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERSQPEFQLGPPDSKPMRPARKRPAP 301

Query: 387 -----------------ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAA 428
                            ELT+LWN   DN+EACKS  R++ P+LE +FEEAI+Q DP   
Sbjct: 302 EDFLGKGPNKKILMGNEELTRLWNLCPDNMEACKSESREYMPTLEEFFEEAIEQADPENM 361

Query: 429 VEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE--KPSSQ 486
           VE +YK VN+SNY WRALRLL+R+ PHFF       +   E++ENMV +  KE   PS +
Sbjct: 362 VENKYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELPPPSEE 421

Query: 487 I-SGNGNGVDQDPAEVEVDTKSEEIQEE 513
           I +G     + + A ++ + +S E+Q +
Sbjct: 422 IKTGEDEDEEDNDALLKENNESPEVQRD 449


>gi|395823221|ref|XP_003784889.1| PREDICTED: THO complex subunit 1 [Otolemur garnettii]
          Length = 629

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 228/498 (45%), Positives = 296/498 (59%), Gaps = 72/498 (14%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRVVRKRTAPEDFLGKGPSKKLLMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
           R                             VN+SNY WRALRLL+R+ PHFF       +
Sbjct: 457 R----------------------------AVNNSNYGWRALRLLARRSPHFFQPTNQQFK 488

Query: 466 KNSEFIENMVKRCVKEKP 483
              E++ENMV +  KE P
Sbjct: 489 SLPEYLENMVIKLAKELP 506


>gi|395749750|ref|XP_002828149.2| PREDICTED: LOW QUALITY PROTEIN: THO complex subunit 1 [Pongo
           abelii]
          Length = 625

 Score =  413 bits (1062), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 219/451 (48%), Positives = 286/451 (63%), Gaps = 44/451 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKV 436
           R++ P+LE +FEEAI+Q DP   VE +YK V
Sbjct: 457 REYMPTLEEFFEEAIEQADPENMVENEYKAV 487


>gi|260829829|ref|XP_002609864.1| hypothetical protein BRAFLDRAFT_126015 [Branchiostoma floridae]
 gi|229295226|gb|EEN65874.1| hypothetical protein BRAFLDRAFT_126015 [Branchiostoma floridae]
          Length = 561

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 213/444 (47%), Positives = 292/444 (65%), Gaps = 41/444 (9%)

Query: 78  LPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQ 137
           +P ++L D F   TLD CE++F +VE  V  WK++ ++   KN LLRMCNDLLRRLSRSQ
Sbjct: 2   IPFLMLDDVFSSVTLDVCERVFKFVEDGVPTWKKEPYYTGGKNYLLRMCNDLLRRLSRSQ 61

Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--GDEEMDVSSNETEETDTEE 195
           NTVFCGRI LFLA+ FP  E+SGLN+ S FN+EN+T +   GD+    S+  +E+ D +E
Sbjct: 62  NTVFCGRIQLFLARLFPLEEKSGLNLQSHFNLENVTSYNLKGDD----STMRSEDRDRDE 117

Query: 196 VDK-------------VKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF 242
             +               ID+N Y+KFWS+QD+FRNP QCY K  WK F S ++ VL  F
Sbjct: 118 SMEMEEGEMSDSHSSFTPIDYNLYRKFWSIQDFFRNPNQCYKKDLWKSFASNSDEVLKCF 177

Query: 243 KSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY 294
           +S+KLDD+Q+S      L   G   YFAKYLT++KLLDLQL+D+NFRRYVL+QFLILFQY
Sbjct: 178 ESHKLDDIQASKKKKEMLAGGGGNVYFAKYLTSEKLLDLQLADSNFRRYVLVQFLILFQY 237

Query: 295 FTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQW 354
             + VK +   L +  +Q  W+K+ T+ VY L+++TPPDG+HF+++V+  L+ EE+W  W
Sbjct: 238 LNAEVKFKAPNLVVSDEQALWIKNATDMVYKLLRETPPDGDHFAKMVQHTLEREENWITW 297

Query: 355 KNEGCP-----------ELKRPLTSITDEDKKDEPDAKK--KKTPELTKLWN-SKDNLEA 400
           KN+GCP            LK+       ED K+  D+K+    + ELT+LWN   DN+EA
Sbjct: 298 KNKGCPTYDRERPKTETPLKKSRKRAMGEDIKEAEDSKRVNMGSAELTRLWNLCPDNMEA 357

Query: 401 CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNA 460
           C S ER+F PSLE +FEEAI+Q DP A VE++YK VN+ N+ +RAL+LL+ + PHFF   
Sbjct: 358 CSSPEREFLPSLEEFFEEAIEQSDPEAMVEDEYKVVNNPNFGFRALKLLACRSPHFFQPT 417

Query: 461 TPNVEKNSEFIENMVKRCVKEKPS 484
               +    ++EN++ +  KE PS
Sbjct: 418 NNQFKTLPSYLENILSKLAKELPS 441


>gi|443734901|gb|ELU18757.1| hypothetical protein CAPTEDRAFT_184685 [Capitella teleta]
          Length = 600

 Score =  408 bits (1048), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 216/503 (42%), Positives = 310/503 (61%), Gaps = 49/503 (9%)

Query: 70  MKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDL 129
           + D+C ++   ++LSD FD  TLD+C+ +F +VE  V++WK  TF+ + KN LLRMCNDL
Sbjct: 10  LSDICGTSTLFMMLSDLFDSITLDQCDSVFKFVEERVSLWKSDTFYNAGKNYLLRMCNDL 69

Query: 130 LRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG-----DEEMDVS 184
           LRRLS+SQNTVFCGRI LFLA+ FP SERSGLN++S+FN+EN+T +       DE    S
Sbjct: 70  LRRLSKSQNTVFCGRIHLFLAQLFPLSERSGLNLMSQFNLENVTLYNTKAEDYDELRKSS 129

Query: 185 SNETEET-DTEEVD------KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
            +E EE+ D EE +       + ID+  Y+KFWSLQD+FR P+QCY+K SWK F  +   
Sbjct: 130 VDEKEESMDIEEGEMQDSSSAIPIDYQLYRKFWSLQDFFRKPIQCYDKNSWKAFEGHTAD 189

Query: 238 VLAAFKSYKLDDVQSS---------LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
           VL  F S+KLD++Q S           PS ++FAKYLT++KLLDLQLSD+NFRRYVL+QF
Sbjct: 190 VLNCFTSFKLDEMQISDKRSKKSGRAKPSQNFFAKYLTSEKLLDLQLSDSNFRRYVLIQF 249

Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
           LI+FQY  + VK +     L  DQ  ++KD  E +  ++++TPPDG  F++ ++ IL  E
Sbjct: 250 LIMFQYLHTQVKFKSPAQTLSEDQTTFIKDIRERILKMLQETPPDGPKFAKTIEHILARE 309

Query: 349 EHWNQWKNEGCPELKRPLT-----------------SITDEDKKDEPDAKKKKTPELTKL 391
           E+WN+WKNEGC    R L+                 +I D+ K       K    ELT+L
Sbjct: 310 EYWNRWKNEGCQSYVRQLSEDDCKARSKFQTKHKRKNIGDDLKASGGKLMKLGNSELTRL 369

Query: 392 WN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLS 450
           WN   DNL+AC+   R+F P+L  YFEE+I Q DP + +E  YK   +S + W++LR+L+
Sbjct: 370 WNLCPDNLDACRLESRNFMPTLREYFEESIDQADPESGIEAGYKLTENSTFGWKSLRVLA 429

Query: 451 RKCPHFFL-----NATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDT 505
              PHFF      +A    +K +E++++MV++ + EK  ++    G G +   A+  +D 
Sbjct: 430 HSSPHFFAQPTGGSANQGSKKMTEYLDSMVQKIIDEKAVAE----GKGKEDVDADKNLDN 485

Query: 506 KSEEIQEEEKEEDWEAK-ADPEG 527
            ++     E ++D   + A P G
Sbjct: 486 TADMDDNLEDKDDASTRPASPAG 508


>gi|312375546|gb|EFR22901.1| hypothetical protein AND_14033 [Anopheles darlingi]
          Length = 736

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 205/402 (50%), Positives = 267/402 (66%), Gaps = 40/402 (9%)

Query: 3   DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYI 62
           +S +  F  + + ++   Y   + T   DKK+AL Q+ R  LL L +    DI  IE +I
Sbjct: 37  ESLKRAFKDNDIELLKADYDAVRAT---DKKTALHQAFRDMLLTLTE----DIPAIEAFI 89

Query: 63  SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL 122
           +  V  C KDM  +T+P++LL D FD  TLD+ EQ+F YVE NV  WK++ FF +CK+NL
Sbjct: 90  NFAVASCRKDMTMATMPVVLLGDCFDAVTLDRAEQIFTYVENNVATWKEEFFFTACKHNL 149

Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--GDE- 179
           LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFN+ENITE+G  G+E 
Sbjct: 150 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNLENITEYGMEGNET 209

Query: 180 EMDVSSNETEETD-------TEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFT 232
            +D  ++    TD         E +K+KID+N Y KFW+LQD+FRNP QCYNKV WK F 
Sbjct: 210 TLDQLTSGNGATDDTEGTEKETEENKLKIDYNLYCKFWALQDFFRNPNQCYNKVQWKTFA 269

Query: 233 SYAETVLAAFKSYKLDDVQSS-----------------------LNPSGDYFAKYLTNQK 269
           ++A +VL+AF S+KL++ +SS                       +  +G +FAK+LTN K
Sbjct: 270 THAGSVLSAFSSFKLEEHRSSTGSAKLTEGGTSSSDCPPMDVDQIREAGHFFAKFLTNPK 329

Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQ 329
           LL LQLSD+NFRR VL+QFLILFQY  STVK + +   L   Q  W+K+T   VY L+++
Sbjct: 330 LLSLQLSDSNFRRSVLVQFLILFQYLNSTVKFKADAHVLTQAQSDWLKETETLVYRLLEE 389

Query: 330 TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
           +PP+G+ F++ V+ +L  EE WN WKNEGC E KRP   + D
Sbjct: 390 SPPNGKKFAETVRHMLTREELWNSWKNEGCKEFKRPEAVLDD 431



 Score = 76.3 bits (186), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 39/98 (39%), Positives = 59/98 (60%), Gaps = 15/98 (15%)

Query: 386 PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWR 444
           PE+T+LWN   DNL+ACK  +R+F PSLE+Y E   ++ DP+              + WR
Sbjct: 497 PEITRLWNVCPDNLQACKGTDRNFLPSLETYLENPKEKQDPS--------------FEWR 542

Query: 445 ALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK 482
           ALRLL+R+ PHFF        K S++++++ K+  K+K
Sbjct: 543 ALRLLARQSPHFFTLFNAPSYKVSDYLDSVRKKIQKDK 580


>gi|391328907|ref|XP_003738924.1| PREDICTED: THO complex subunit 1-like [Metaseiulus occidentalis]
          Length = 657

 Score =  394 bits (1011), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 209/477 (43%), Positives = 295/477 (61%), Gaps = 30/477 (6%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           + N + L V LC ++  +  LP  + +D FD  T+ +CE++F+ +E  V +WK+  FF  
Sbjct: 73  LTNLVDLLVSLCEREFASPNLPATIFTDIFDTLTISRCEKVFHLMEDKVEVWKKPLFFSH 132

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
           CKN +LRMCNDLLRRLS+SQNTVFCG+IL+ LAK FP SERSGLNI+SEFN ENI+    
Sbjct: 133 CKNQMLRMCNDLLRRLSKSQNTVFCGQILIVLAKLFPLSERSGLNIVSEFNSENIS---- 188

Query: 178 DEEMDVSSNETEETDTEEVD-KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
                + + E  + DT   D  +K+  N Y+KFWSLQD+FR P  CY++  W+  T  + 
Sbjct: 189 -----IHNVEDGDADTSSDDGHLKVSHNLYRKFWSLQDFFRQPHLCYDENHWRQMTRCSS 243

Query: 237 TVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
            VL  F S+KLD+ Q+ L+ S  YFAKYLT+++L  L+LSD+NFRR +L+QFLI+FQY  
Sbjct: 244 EVLTVFSSFKLDE-QAVLSKSQPYFAKYLTSRRLFQLELSDSNFRRCILIQFLIMFQYLK 302

Query: 297 STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN 356
           +TV+ + E   L  +Q +WV++ TE +Y L+++TPPDG  F++ ++ IL  EEHWN WKN
Sbjct: 303 ATVRFKLEKHFLSENQAQWVREKTEEIYKLLQETPPDGLEFARAIENILVREEHWNAWKN 362

Query: 357 EGCPELK----------RPLTSITDEDKKD-EPDAKKKKTPELTKLWN-SKDNLEACKSA 404
           + CP  K          R    + D+ ++  E        PEL K++N   D+ EAC+  
Sbjct: 363 DNCPAFKKADVMGRKCGRRRKKVGDDVREHLEKGQIAMGGPELNKVFNICPDHWEACRVE 422

Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNV 464
            R + P +  +FEEAI Q DPAA VE+QYK V + N+ WRALR+LS+K  HFF +     
Sbjct: 423 RRCYVPGMTEFFEEAIAQADPAACVEDQYKLVANPNWGWRALRILSQKSLHFFTSTNQPA 482

Query: 465 EKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEA 521
           +    +++ MV    K+   ++     NGV      VE D  S   Q++ KEED EA
Sbjct: 483 KSLPNYLDTMVGFLAKDLCINRDPKEPNGV----VRVEDDDNS---QDDFKEEDSEA 532


>gi|442751601|gb|JAA67960.1| Putative nuclear matrix protein [Ixodes ricinus]
          Length = 566

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 209/447 (46%), Positives = 271/447 (60%), Gaps = 63/447 (14%)

Query: 70  MKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDL 129
           +K++C+ +LP +L  D FD   L++ E+ F  VE  V+ WK++ FF +CKN LLR CNDL
Sbjct: 12  LKEVCSGSLPGLLFCDMFDSLPLNESEEFFSLVEDKVDTWKEELFFKNCKNQLLRTCNDL 71

Query: 130 LRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG----------GDE 179
           LRRLSRSQNTVFCG+IL+FLAK FP SERSGLNIISEFN ENIT F           G  
Sbjct: 72  LRRLSRSQNTVFCGKILVFLAKLFPLSERSGLNIISEFNSENITLFSSEDYKGMPMKGSP 131

Query: 180 EMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVL 239
           + D+   ET+  D      V +D+N Y+KFWSLQ+YFR P  CYNKV W+ FTSY+  VL
Sbjct: 132 DDDLEEGETD--DMSSTAAVSVDYNLYRKFWSLQEYFRQPTLCYNKVHWRQFTSYSSDVL 189

Query: 240 AAFKSYKLDDVQS----------SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
           + F S+KLDD++S          +L+    YFAKYLT+QKLL+L+LSD+NFRRYVL+QFL
Sbjct: 190 SVFGSFKLDDIKSAQWTLPEDAAALSGKSVYFAKYLTSQKLLELELSDSNFRRYVLVQFL 249

Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
           IL QY  STV+ + E   L  +Q  WVK+T   VY L+++TPPDG  F+  +K IL+ EE
Sbjct: 250 ILIQYLQSTVRFKQESQVLTEEQSTWVKNTMSLVYKLLEETPPDGVGFAANIKHILQREE 309

Query: 350 HWNQWKNEGCPELK-----------RPLTSITDEDKKDEPDAKK-KKTPELTKLWN-SKD 396
            WN WKNEGCP+ K           RP   + DE K    + K      E+ +LWN   D
Sbjct: 310 FWNTWKNEGCPDFKEAKPVDVVKEGRPKRKLGDEIKAALANKKVILGNAEMNRLWNLCPD 369

Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
           N EAC S +RDF PSLE +FE A++Q +PA                            HF
Sbjct: 370 NWEACTSPKRDFVPSLEKFFETAVEQAEPA----------------------------HF 401

Query: 457 FLNATPNVEKNSEFIENMVKRCVKEKP 483
           F ++   ++    ++E+MV R  KE P
Sbjct: 402 FTSSNQPIKSLPLYLESMVLRLAKELP 428


>gi|119622139|gb|EAX01734.1| THO complex 1, isoform CRA_d [Homo sapiens]
          Length = 468

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 204/424 (48%), Positives = 265/424 (62%), Gaps = 44/424 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +  TS T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 397 PSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCPDNMEACKSET 456

Query: 406 RDFT 409
           R  T
Sbjct: 457 RQGT 460


>gi|148691058|gb|EDL23005.1| THO complex 1, isoform CRA_c [Mus musculus]
          Length = 473

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 202/421 (47%), Positives = 263/421 (62%), Gaps = 44/421 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 46  PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 101

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 102 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 161

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 162 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 221

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 222 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 281

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 282 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 341

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 342 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 401

Query: 360 PELKRPLTSITDED----KKDEPDAKKKKTP---------ELTKLWN-SKDNLEACKSAE 405
           P   +   S T       K+  P+    K P         ELT+LWN   DN+EACKS  
Sbjct: 402 PSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCPDNMEACKSET 461

Query: 406 R 406
           R
Sbjct: 462 R 462


>gi|196015135|ref|XP_002117425.1| hypothetical protein TRIADDRAFT_1612 [Trichoplax adhaerens]
 gi|190579954|gb|EDV20041.1| hypothetical protein TRIADDRAFT_1612 [Trichoplax adhaerens]
          Length = 517

 Score =  374 bits (959), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 205/489 (41%), Positives = 287/489 (58%), Gaps = 62/489 (12%)

Query: 31  DKKSALEQSIRQYLLKLI-----KTPDIDIKVIE-NYISLCVELCMKD-MCNSTLPIILL 83
           + KSA++Q+ R  L +LI      T D  +K+ +  ++     +C K  + + +LP ILL
Sbjct: 2   ESKSAIDQAFRDLLKELIVQKQKGTDDNQVKIDDLTWLIKISTICAKSGLASVSLPFILL 61

Query: 84  SDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
            D FD  TL  C+++F YVE  +  WK   F    K  +LRMCNDLLRRLSRSQ+T+FCG
Sbjct: 62  VDIFDCLTLRVCQRMFGYVEKEMQTWKDTAFDAMGKVYILRMCNDLLRRLSRSQDTIFCG 121

Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEF-----------GGDEEMDVSSNETEETD 192
           +I LFL+++FP  ERSGLN++S FNV+N+T +             +  MDV   E  E D
Sbjct: 122 QIRLFLSRYFPLDERSGLNVMSHFNVDNVTSYTKRDMVDSQTASAEGVMDVEDGEISEGD 181

Query: 193 TEEV----DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL- 247
            +E     +K  ID+N Y +FWSLQD+F NP  CY+   W   T+ +  VL  FKSYKL 
Sbjct: 182 DDEKKAEKNKTPIDYNLYYRFWSLQDFFSNPNLCYSNSGWNALTTSSRVVLDVFKSYKLE 241

Query: 248 ------------------------DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRY 283
                                   D   ++++ S  YFAK+LTN+KLLDLQL+D+ FRR 
Sbjct: 242 ALTSNKSTKTDNDNPSDQDGMEISDSCDNAMDVSNRYFAKFLTNEKLLDLQLNDSQFRRT 301

Query: 284 VLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKL 343
           +L+QFLILFQY TSTVK R     +   Q++W++ +T+ VY L+++TPP G+ F ++V+ 
Sbjct: 302 ILIQFLILFQYLTSTVKFRPTSCTISERQQQWIERSTKLVYQLLEETPPGGKQFVKLVRE 361

Query: 344 ILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKK--------------KTPELT 389
           IL  EE+W++WKN+GCP   RP  S    D    P  K+K                PELT
Sbjct: 362 ILSREEYWSKWKNDGCPSFVRPGVSTESSDSSSAPRKKRKISRHYSPTSHKIELGNPELT 421

Query: 390 KLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRL 448
           +LWN S DNL ACK +   FTP+++ +FEEAI+Q DP A VEE+YK VN+  + WR+ R+
Sbjct: 422 RLWNLSTDNLSACKMSGEKFTPTIKDFFEEAIEQCDPEAQVEEEYKLVNNLVFRWRSFRI 481

Query: 449 LSRKCPHFF 457
           L+ +    F
Sbjct: 482 LAVQSSIIF 490


>gi|340368679|ref|XP_003382878.1| PREDICTED: THO complex subunit 1-like [Amphimedon queenslandica]
          Length = 535

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 212/500 (42%), Positives = 295/500 (59%), Gaps = 51/500 (10%)

Query: 29  DYDKKSALEQSIRQYLLKLI-KTPDIDIKVIENYISL-CVELCMKDMCNSTLPIILLSDT 86
           D DKKS  EQS R  L  LI K P +        +S+ C E     +C  ++P +LLSD 
Sbjct: 39  DNDKKSIFEQSFRDVLRDLIAKDPLVSSFSSLVSLSVLCAE---SQICYHSVPFLLLSDA 95

Query: 87  FDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
           FD  TLD CE+LF  VE +V +W +Q FF S +  LLRMCNDLLRRLS S NTVFCGRI 
Sbjct: 96  FDCITLDFCEKLFDTVESHVTVWTKQDFFSSGRILLLRMCNDLLRRLSSSLNTVFCGRIQ 155

Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF----------------GGDEEMDVSSNETEE 190
           LFLA+ FP SE+S LN++S FN+EN+T F                GG+   D    E + 
Sbjct: 156 LFLARLFPLSEKSALNLMSHFNLENVTSFKTLQTPPSSSEPQKPPGGNTNEDGEDMEIDA 215

Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
            D +E     +D+N Y+K W+LQD+FR+PVQC++   WK FT   + VL +F SYKL+D+
Sbjct: 216 LD-QETGMTPVDYNLYEKLWTLQDFFRHPVQCFHSDRWKKFTGTVQEVLQSFSSYKLEDI 274

Query: 251 QS-------------SLNP--SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF 295
           QS             +L+P  S  YF KYLT++ LL LQLSD++FRR+VL+QFLILFQY 
Sbjct: 275 QSKRRKLQSSDEGMDTLSPYTSSHYFTKYLTSENLLSLQLSDSHFRRHVLIQFLILFQYL 334

Query: 296 TSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWK 355
           T  VK +     L      WV +TT  VY L+++TPPDG+ F++ VKL+++ EE W  WK
Sbjct: 335 TGDVKFKNSSQVLSESHSSWVTETTSKVYQLLEETPPDGKSFAEYVKLVIQREETWIAWK 394

Query: 356 NEGCPELK----RPLTSITDEDKKDE--------PDAKKKKTPELTKLWN-SKDNLEACK 402
           N GCP  +    +P       D K+          + K   + EL++LWN   DNLE+C 
Sbjct: 395 NGGCPSFEKTNSKPPNRGGGGDTKERLGERLFAMKEEKDMGSKELSRLWNLCPDNLESCS 454

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
           S ER F P LE + EE +++M P   + ++ + +++ N+ W+ALRLLS +  HFF  A  
Sbjct: 455 SPERIFVPPLEIFLEEPLEEMKPNCEIPKEDRWISNPNFRWQALRLLSCESTHFFPIANQ 514

Query: 462 PNVEKNSEFIENMVKRCVKE 481
            ++    E++++++ +  KE
Sbjct: 515 SSMRPLEEYLDHVILQTSKE 534


>gi|198432183|ref|XP_002123801.1| PREDICTED: similar to THO complex 1 [Ciona intestinalis]
          Length = 744

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 220/594 (37%), Positives = 323/594 (54%), Gaps = 96/594 (16%)

Query: 21  YALYKPTCDYDKKSALEQSIRQYLLK--LIKT-PDIDIKVIENYISLCVELCMKDMCNST 77
           Y  Y P  + +KK+A +Q++R  + +  L KT  +I+I    + I++ V +  + +C+ +
Sbjct: 40  YGKY-PGSEQEKKTATDQTLRDLIREEVLSKTHQEIEITRTLHVINVIVAVAKQGLCSPS 98

Query: 78  LPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQ 137
           +P  LL D  D STLD C+ +F YVE  V +WK   F+ + KN LLRMCNDLLRRLS+SQ
Sbjct: 99  VPFTLLGDVLDCSTLDTCDVIFKYVEERVAVWKSPVFYTAGKNYLLRMCNDLLRRLSQSQ 158

Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD---EEMDVSSNETEE-TDT 193
           NTVFCGRI LFLA+ FP SE+S LN++S+FN++N+T F  D     +DV+   +EE  +T
Sbjct: 159 NTVFCGRIQLFLARLFPLSEKSALNLVSQFNLDNVTIFNTDPSNSTLDVNEEASEEKMET 218

Query: 194 EEVDKVK-----IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
           +EV K K     +D+  Y   WS+QD FR+PVQCY K +W   T  A+ +   F S+KLD
Sbjct: 219 DEVKKEKSESGLVDYKLYLNIWSIQDIFRSPVQCYQKETWDKLTENADNIFRVFDSFKLD 278

Query: 249 DVQSS-----LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG 303
           D++             YF+K+LT++KL +LQLSD  FRR +L QFLILF+Y T+TVK +G
Sbjct: 279 DIRGGNKTNQTTHKKSYFSKFLTSEKLTELQLSDGMFRRQILTQFLILFRYLTATVKFKG 338

Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK 363
             L L   Q+ W+  T   +Y L+++TPP+G+ F++  K +L+ EE W  WKN GCP   
Sbjct: 339 ANLVLTEAQKTWISTTETKIYKLLQETPPNGKKFAKYTKHLLEREEFWITWKNSGCPSFM 398

Query: 364 RPLTSITDE---DKKDEPDAKKKK--------------TPELTKLWNSKDN--------- 397
           +  +   DE    KK + D KK                +PELTKLWN   N         
Sbjct: 399 KMRSKSEDESPASKKRKIDPKKSSIAEDFISGKSLGMGSPELTKLWNLCPNNLAACRSNE 458

Query: 398 ------------------------------------------LEACKSAERDFTPSLESY 415
                                                     L AC+S ER+F PS++ Y
Sbjct: 459 REFLPSIQDYFAEPIEQEDPKNQIEDQYNPELTKLWNLFPDNLAACRSNEREFLPSIQDY 518

Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNS--EFIEN 473
           F E I+Q DP   +E+QYK + +SN+ W+ALRLLS +   FF +   N++  +  +++  
Sbjct: 519 FAEPIEQEDPKNQIEDQYKLIKNSNFQWKALRLLSNRSQLFFTSQPANLQFKTVPDYLST 578

Query: 474 MVKRCVKEKPSSQIS------GNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEA 521
           ++ +  KE P+           NG+ V     +++      EI  E++ ED +A
Sbjct: 579 VIGKLAKEFPTQSTEEIKVEVSNGDLV--GVGDLKTTMTVSEINNEDEGEDEDA 630


>gi|449663492|ref|XP_002156883.2| PREDICTED: THO complex subunit 1-like [Hydra magnipapillata]
          Length = 567

 Score =  368 bits (944), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 205/458 (44%), Positives = 275/458 (60%), Gaps = 42/458 (9%)

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
            C   LPI+LLSD FD  TL  CE++F Y+E NV++W  + FF + KN+LLRMCND+LRR
Sbjct: 107 FCTKNLPILLLSDIFDTMTLCDCEKMFKYIEKNVSVWNAEPFFSTGKNHLLRMCNDMLRR 166

Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMD---VSSNETE 189
           LS+SQNTVFCGRI LFLA+ FP  E+S LN++S F++EN+T F   E++D   +  N++ 
Sbjct: 167 LSKSQNTVFCGRIQLFLAQLFPVEEKSALNLMSNFHLENVTLFKT-EKLDFDEIQPNDSV 225

Query: 190 ETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD- 248
           E D E V+   +D+N Y KFW+LQDYF NP +CY K  W +  +    VL  F S+KL+ 
Sbjct: 226 E-DRELVNN-PVDYNLYIKFWTLQDYFVNPSKCYTKEGWNLINTNISEVLKIFTSFKLEF 283

Query: 249 ------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
                       DV      + +YFAKYLTN+KL + QL+D+NFRR  L+Q LILFQY +
Sbjct: 284 NVRKRKSIDMPIDVDQIKQTANNYFAKYLTNEKLFNFQLNDSNFRRNFLVQVLILFQYLS 343

Query: 297 STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN 356
            T K +     L   Q  W+ +TT+ VY+ I +TPPDG+ F + +K IL  EE+W  WKN
Sbjct: 344 GTSKFKAPNQVLNDTQTNWINETTDIVYNSICETPPDGKKFCETIKHILSREENWINWKN 403

Query: 357 EGCPEL-------------KRPLTSITDEDKKDEPDAKKKKTPELTKLWN-SKDNLEACK 402
           EGCP               K+   SI D+   ++       T ELT+LWN + DNL + K
Sbjct: 404 EGCPSFAKVKSSTETEVAPKKAKRSIGDDFVLNQSKKINMGTLELTRLWNLNPDNLASAK 463

Query: 403 SAER-DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF---- 457
              R  F P+L+ +FE+AI+Q DP A +E++YK V   N+ WRALRLL+R+  HFF    
Sbjct: 464 VESRLKFLPNLKDFFEKAIEQADPNAGIEDEYKLVKQPNFQWRALRLLARRSDHFFTPSQ 523

Query: 458 --LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNG 493
               A P  E  S  IEN+ K   K+KPS       NG
Sbjct: 524 TPFKALP--EYLSSVIENISKDMTKDKPSCDSKSLENG 559


>gi|74217198|dbj|BAC31387.2| unnamed protein product [Mus musculus]
 gi|74226598|dbj|BAE23950.1| unnamed protein product [Mus musculus]
          Length = 397

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 182/361 (50%), Positives = 238/361 (65%), Gaps = 30/361 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           K +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE+WN WKNEGC
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEENWNSWKNEGC 396

Query: 360 P 360
           P
Sbjct: 397 P 397


>gi|226487124|emb|CAX75427.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
          Length = 632

 Score =  341 bits (875), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           DKKS ++ ++R  +L+L+ +  +D   ++  + + V   + ++C++  P +L SD F+  
Sbjct: 41  DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            +  CE++F ++E  +   K    F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL 
Sbjct: 99  PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158

Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
           + FP  E+SGLN +S FN E         + + F      D +S++ EE + T+    ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
           +D   Y KFWSLQ++F++PV CY K  W  FTS  +TVL  F S KL      D +S   
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276

Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
            S   F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK +     L  DQ  W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336

Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
           V    E V  L+       P    F   V+ IL+ E +WN+WKN+GCP        I + 
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390

Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
           +K      K+   P +T               KLWN   DNL AC+   R F   L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450

Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           ++AI +MDPA  VEE+YK +N   ++WRALR LSRKCPHF++N  P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496


>gi|226487128|emb|CAX75429.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
          Length = 639

 Score =  341 bits (874), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           DKKS ++ ++R  +L+L+ +  +D   ++  + + V   + ++C++  P +L SD F+  
Sbjct: 41  DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            +  CE++F ++E  +   K    F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL 
Sbjct: 99  PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158

Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
           + FP  E+SGLN +S FN E         + + F      D +S++ EE + T+    ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
           +D   Y KFWSLQ++F++PV CY K  W  FTS  +TVL  F S KL      D +S   
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276

Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
            S   F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK +     L  DQ  W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336

Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
           V    E V  L+       P    F   V+ IL+ E +WN+WKN+GCP        I + 
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390

Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
           +K      K+   P +T               KLWN   DNL AC+   R F   L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450

Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           ++AI +MDPA  VEE+YK +N   ++WRALR LSRKCPHF++N  P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496


>gi|226487130|emb|CAX75430.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
          Length = 616

 Score =  341 bits (874), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           DKKS ++ ++R  +L+L+ +  +D   ++  + + V   + ++C++  P +L SD F+  
Sbjct: 41  DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            +  CE++F ++E  +   K    F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL 
Sbjct: 99  PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158

Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
           + FP  E+SGLN +S FN E         + + F      D +S++ EE + T+    ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
           +D   Y KFWSLQ++F++PV CY K  W  FTS  +TVL  F S KL      D +S   
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276

Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
            S   F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK +     L  DQ  W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336

Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
           V    E V  L+       P    F   V+ IL+ E +WN+WKN+GCP        I + 
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390

Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
           +K      K+   P +T               KLWN   DNL AC+   R F   L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450

Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           ++AI +MDPA  VEE+YK +N   ++WRALR LSRKCPHF++N  P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496


>gi|226487126|emb|CAX75428.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
          Length = 639

 Score =  341 bits (874), Expect = 7e-91,   Method: Compositional matrix adjust.
 Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           DKKS ++ ++R  +L+L+ +  +D   ++  + + V   + ++C++  P +L SD F+  
Sbjct: 41  DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            +  CE++F ++E  +   K    F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL 
Sbjct: 99  PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158

Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
           + FP  E+SGLN +S FN E         + + F      D +S++ EE + T+    ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
           +D   Y KFWSLQ++F++PV CY K  W  FTS  +TVL  F S KL      D +S   
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276

Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
            S   F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK +     L  DQ  W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336

Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
           V    E V  L+       P    F   V+ IL+ E +WN+WKN+GCP        I + 
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390

Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
           +K      K+   P +T               KLWN   DNL AC+   R F   L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450

Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           ++AI +MDPA  VEE+YK +N   ++WRALR LSRKCPHF++N  P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496


>gi|89267023|emb|CAJ82056.1| THO complex 1 [Xenopus (Silurana) tropicalis]
          Length = 421

 Score =  340 bits (872), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 176/367 (47%), Positives = 235/367 (64%), Gaps = 31/367 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIEN---YISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q  R  L + I    I+    EN    ISL +      +C +T P +L
Sbjct: 54  PGSENEKKPTLDQVFRVVLEEEI----INQVSCENCLAIISLAINGVTDGICTATTPFVL 109

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   L  C+++F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 110 LGDVLDCLPLAYCDKIFTFVEKNVGTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 169

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE--------------MDVSSNET 188
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+              M++   E 
Sbjct: 170 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNANEQESTLGQKVEEKDDGMEIEEGEM 229

Query: 189 EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
            + D      + ID+N Y+KFW+LQD+FRNPVQCY+K SWK F  +++ VL  FKSYKLD
Sbjct: 230 GDDDAPNSSSIPIDYNLYRKFWTLQDFFRNPVQCYDKNSWKTFLKFSDEVLTVFKSYKLD 289

Query: 249 DVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           D Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++L+Q+LILFQY    VK
Sbjct: 290 DTQASRKKIEELKSGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILVQYLILFQYLKGQVK 349

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
            +     L  +Q  W++DTT+ V+ L+ +TPP GE FS+ V+ IL  EE+WN WKNEGCP
Sbjct: 350 FKSSNYILLDEQSLWIEDTTKLVHQLLSETPPVGEKFSKTVEHILNTEENWNAWKNEGCP 409

Query: 361 EL--KRP 365
               +RP
Sbjct: 410 SFVKERP 416


>gi|226487132|emb|CAX75431.1| putative UDP-galactose-4-epimerase [Schistosoma japonicum]
          Length = 524

 Score =  340 bits (871), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 190/466 (40%), Positives = 266/466 (57%), Gaps = 44/466 (9%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           DKKS ++ ++R  +L+L+ +  +D   ++  + + V   + ++C++  P +L SD F+  
Sbjct: 41  DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 98

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            +  CE++F ++E  +   K    F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL 
Sbjct: 99  PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158

Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
           + FP  E+SGLN +S FN E         + + F      D +S++ EE + T+    ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSVFKHQISHDHTSDDLEEGEMTDSSTPLE 218

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-----DDVQSSLN 255
           +D   Y KFWSLQ++F++PV CY K  W  FTS  +TVL  F S KL      D +S   
Sbjct: 219 VDAGLYVKFWSLQEFFKSPVLCYEKAKWLKFTSSTDTVLDVFSSIKLITTEEGDFRS--Q 276

Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
            S   F+KYLT++KLLDLQL D +FRRY+L+Q LILFQY T+ VK +     L  DQ  W
Sbjct: 277 SSSKTFSKYLTSEKLLDLQLMDPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSW 336

Query: 316 VKDTTETVYSLIKQ---TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
           V    E V  L+       P    F   V+ IL+ E +WN+WKN+GCP        I + 
Sbjct: 337 VNQRHEVVLRLLSSNNTNSPSDNTFVSTVEHILERESYWNRWKNDGCPSF------IRNP 390

Query: 373 DKKDEPDAKKKKTPELT---------------KLWN-SKDNLEACKSAERDFTPSLESYF 416
           +K      K+   P +T               KLWN   DNL AC+   R F   L SYF
Sbjct: 391 EKSKLSVRKRHINPLVTRAGQKVYRFGNRALDKLWNVCPDNLAACRDKRRVFDTDLHSYF 450

Query: 417 EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           ++AI +MDPA  VEE+YK +N   ++WRALR LSRKCPHF++N  P
Sbjct: 451 QDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 496


>gi|313220893|emb|CBY31729.1| unnamed protein product [Oikopleura dioica]
          Length = 648

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 197/519 (37%), Positives = 289/519 (55%), Gaps = 66/519 (12%)

Query: 3   DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLK-LIKTPDIDIKVIENY 61
           + F  +F   + N    KY+    + D DKK   +Q ++  L + L+K  DI  KVI + 
Sbjct: 19  EGFDTDFVSGKPNEFISKYSATHGS-DADKKQVADQCLKVKLQEVLVKYSDIS-KVI-SL 75

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM-SCKN 120
           ++       K +C+   P+ L+SD  + STLD CE++F +VE    IWK   F++ +CKN
Sbjct: 76  LTFATSCARKSLCSVLTPLALISDALECSTLDDCERIFDFVEKQTPIWKSTQFYVNACKN 135

Query: 121 NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE 180
            LLR CNDLLRRLS+SQN VFCGRI +FL + FP SE+S LN+ S FN+EN        E
Sbjct: 136 QLLRNCNDLLRRLSQSQNNVFCGRIHIFLTRIFPISEKSALNLNSNFNLEN--------E 187

Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
            D +  E EET+  ++         Y   WS+QDYFRNP+Q Y  V +  F     T+  
Sbjct: 188 TDYTRIEDEETENRDL---------YNSLWSVQDYFRNPLQLYEPVKFAEFEKAVNTINT 238

Query: 241 AFKSYKLD--------------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
           AF ++KL+                    DV+   N    YF K+LT+  LL+LQL+D +F
Sbjct: 239 AFANFKLEVNDDEEEKRSKMKEIDYDSIDVEMKSNGQIVYFPKFLTSFNLLELQLNDPSF 298

Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
           RR +L + L LFQY T  VK R     L     +W+++T +T+Y L+++TPP+G+ +   
Sbjct: 299 RRQILTEILFLFQYLTGYVKFRSATNVLPFASREWIRNTEKTIYELLQETPPNGKKYVLF 358

Query: 341 VKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT--------------- 385
           VK +++ E  W +WKNEGCP   R      ++DK      KK+KT               
Sbjct: 359 VKHLIEREAFWVKWKNEGCPRFMR------EKDKDGASRPKKRKTIYDDFMSNKKLGLGS 412

Query: 386 PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWR 444
           PEL+KLW  S  NLEACK+  R F P + ++FEEA++  DPA  +E+QYK   ++ + WR
Sbjct: 413 PELSKLWGRSLPNLEACKTENRQFIPKVHTFFEEAMEHADPANEIEDQYKCYKNAEFGWR 472

Query: 445 ALRLLSRKCPHFFLNATPNVEKNS--EFIENMVKRCVKE 481
           +LRLL+   PHFF     N+   +  +++E+++K+ +KE
Sbjct: 473 SLRLLAHASPHFFSLQPANMHHLAVPKYLEHILKKLLKE 511


>gi|313226154|emb|CBY21297.1| unnamed protein product [Oikopleura dioica]
          Length = 648

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 197/519 (37%), Positives = 289/519 (55%), Gaps = 66/519 (12%)

Query: 3   DSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLK-LIKTPDIDIKVIENY 61
           + F  +F   + N    KY+    + D DKK   +Q ++  L + L+K  DI  KVI + 
Sbjct: 19  EGFDTDFVSGKPNEFISKYSATHGS-DADKKQVADQCLKVKLQEVLVKYSDIS-KVI-SL 75

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM-SCKN 120
           ++       K +C+   P+ L+SD  + STLD CE++F +VE    IWK   F++ +CKN
Sbjct: 76  LTFATSCARKSLCSVLTPLALISDALECSTLDDCERIFDFVEKQTPIWKSTQFYVNACKN 135

Query: 121 NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE 180
            LLR CNDLLRRLS+SQN VFCGRI +FL + FP SE+S LN+ S FN+EN        E
Sbjct: 136 QLLRNCNDLLRRLSQSQNNVFCGRIHIFLTRIFPISEKSALNLNSNFNLEN--------E 187

Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
            D +  E EET+  ++         Y   WS+QDYFRNP+Q Y  V +  F     T+  
Sbjct: 188 TDYTRIEDEETENRDL---------YNSLWSVQDYFRNPLQLYEPVKFAEFEKAVNTINT 238

Query: 241 AFKSYKLD--------------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
           AF ++KL+                    DV+   N    YF K+LT+  LL+LQL+D +F
Sbjct: 239 AFANFKLEVNDDEEEKRSKMKEIDYDSIDVEMKSNGQIVYFPKFLTSFNLLELQLNDPSF 298

Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
           RR +L + L LFQY T  VK R     L     +W+++T +T+Y L+++TPP+G+ +   
Sbjct: 299 RRQILTEILFLFQYLTGYVKFRSATNVLPFASREWIRNTEKTIYELLQETPPNGKKYVLF 358

Query: 341 VKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT--------------- 385
           VK +++ E  W +WKNEGCP   R      ++DK      KK+KT               
Sbjct: 359 VKHLIEREAFWVKWKNEGCPRFMR------EKDKDGASRPKKRKTIYDDFMSNKKLGLGS 412

Query: 386 PELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWR 444
           PEL+KLW  S  NLEACK+  R F P + ++FEEA++  DPA  +E+QYK   ++ + WR
Sbjct: 413 PELSKLWGRSLPNLEACKTENRQFIPKVHTFFEEAMEHADPANEIEDQYKCYKNAEFGWR 472

Query: 445 ALRLLSRKCPHFFLNATPNVEKNS--EFIENMVKRCVKE 481
           +LRLL+   PHFF     N+   +  +++E+++K+ +KE
Sbjct: 473 SLRLLAHASPHFFSLQPANMHHLAVPKYLEHILKKLLKE 511


>gi|358342057|dbj|GAA49608.1| THO complex subunit 1 [Clonorchis sinensis]
          Length = 1030

 Score =  330 bits (847), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 182/468 (38%), Positives = 270/468 (57%), Gaps = 38/468 (8%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           D K  ++ + R  +L+L+ +  +D   +++ + + V   + ++C++  P +L SD F+  
Sbjct: 505 DTKPVIDVAFRSRMLELLSSHSLD--EVKSLLKVGVAAALANICSAPTPFLLFSDVFNTK 562

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            +  C++LF ++E  +   K    F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL+
Sbjct: 563 PISVCQELFGFMEETITTLKDVPLFASGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLS 622

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEE---------MDVSSNETEETDTEEVDK-VK 200
           + FP  E+SGLN++S FN++  T +    +         MD S+ + +E +T    K ++
Sbjct: 623 RLFPLDEKSGLNLMSNFNLDKETPYNKTPDPSIFKHQLSMDSSTEDPDEGETAAHPKSLE 682

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS--SLNPSG 258
           +D N Y KFWSLQD+F+ P  CY+K  W  F S  ETVL  F S KL       + N SG
Sbjct: 683 VDANLYVKFWSLQDFFKIPTSCYDKAKWVTFRSNIETVLEIFSSIKLSGSNEGDASNTSG 742

Query: 259 DY-FAKYLTNQK---------LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLEL 308
              F+KYLT++K         LLDLQL D++FRRY+++Q L++FQY  +  K +     L
Sbjct: 743 SSKFSKYLTSEKAIIPRCSFQLLDLQLMDSSFRRYIMVQILVVFQYLITPTKFKTPDQVL 802

Query: 309 KSDQEKWVKDTTETVYSLIKQT-PPDGEH---FSQVVKLILKGEEHWNQWKNEGCPELKR 364
             +Q +W+    E + SL+      D +H   F+  +  IL+ E HWN WKNEGCP   R
Sbjct: 803 SEEQTEWLNRKRELILSLLSGNRKTDSQHTDSFASTIGRILERETHWNSWKNEGCPSFVR 862

Query: 365 PLTSITDEDKK-------DEPDAK--KKKTPELTKLWN-SKDNLEACKSAERDFTPSLES 414
           P      E +K       D   AK  +     L KLWN   DNL AC+   R F P+L+S
Sbjct: 863 PQEKSPVETRKRRTNPLIDRAGAKIYRFGNKNLDKLWNVCPDNLAACRDERRLFRPNLQS 922

Query: 415 YFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           YF++AI ++DPA  V+E+YK +N   + WR+LR L+R+ PHF++N  P
Sbjct: 923 YFQDAILELDPAEKVDEEYKSINKEEWCWRSLRCLARRSPHFYINWNP 970


>gi|312066929|ref|XP_003136503.1| hypothetical protein LOAG_00915 [Loa loa]
          Length = 667

 Score =  327 bits (838), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 171/474 (36%), Positives = 268/474 (56%), Gaps = 51/474 (10%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           ++   +++  +   +D C     + ++ D FD+S++ +CE+LF  VE N+  WK   F+ 
Sbjct: 56  LLRQLLAVTFQAAKEDFCAKGTTVTVVQDVFDVSSVKRCEELFVIVEDNLPQWKMSIFYE 115

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
            CKN +LRMCNDLL+RLSR+ +T+FCGRIL+ LA   P  E+SGLN++S FN+ N+T++ 
Sbjct: 116 PCKNMILRMCNDLLKRLSRTVDTIFCGRILVLLAHALPLCEKSGLNLVSHFNLGNVTKYD 175

Query: 177 -GDEEMDVSSNETEETDTEEVDKVK-----IDFNFYKKFWSLQDYFRNPVQCYNKVSWKM 230
             D  + V +N TE++D  EV ++      +D+  Y KFW LQ +F +P  C+ K  W+ 
Sbjct: 176 MADSCLGVDANLTEDSDVMEVGEISEREIPVDYTLYAKFWQLQSFFSSPATCFEKSKWRT 235

Query: 231 FTSYAETVLAAFKSYKLDD------------------------VQSSLNPSGD---YFAK 263
           F      VL  F S+KL++                        + S+    GD   YFAK
Sbjct: 236 FQQNTTEVLNIFSSHKLENCIALGEGSKRERLKQENSNDEIPVIPSTSKADGDEDNYFAK 295

Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETV 323
           YLT+QKLL LQL+D+ FRRY L+Q LIL+QY  S +K +     L  +Q +++ ++TE  
Sbjct: 296 YLTSQKLLQLQLNDSQFRRYFLVQCLILYQYLISDIKFKDRSFTLNDEQLRFIIESTERC 355

Query: 324 YSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK--------- 374
           + L+++T P G HF+  +K+IL  E+ W++WKN+GC +     T + D++K         
Sbjct: 356 FRLLRETHPKGPHFADAMKVILHREKEWSEWKNKGCLD----YTQLADKEKPSVFKKRPR 411

Query: 375 -KDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQ 432
            + +P       PELTKLWN + D L AC+ + R+F P L ++ E  + ++DP   VEEQ
Sbjct: 412 NRYDPSKLDLGNPELTKLWNINPDMLSACEDSRRNFMPKLATFLENPLIELDPEQLVEEQ 471

Query: 433 YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE---FIENMVKRCVKEKP 483
           Y+ VN+ ++ WRA RLL  +   +F   +   E  +    F+E ++K      P
Sbjct: 472 YRSVNNESFQWRASRLLMSQSTSYFQTPSKRNEVTTNLAPFLEEVIKHTATNFP 525


>gi|393911810|gb|EFO27566.2| hypothetical protein LOAG_00915 [Loa loa]
          Length = 657

 Score =  327 bits (838), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 171/474 (36%), Positives = 268/474 (56%), Gaps = 51/474 (10%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           ++   +++  +   +D C     + ++ D FD+S++ +CE+LF  VE N+  WK   F+ 
Sbjct: 56  LLRQLLAVTFQAAKEDFCAKGTTVTVVQDVFDVSSVKRCEELFVIVEDNLPQWKMSIFYE 115

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
            CKN +LRMCNDLL+RLSR+ +T+FCGRIL+ LA   P  E+SGLN++S FN+ N+T++ 
Sbjct: 116 PCKNMILRMCNDLLKRLSRTVDTIFCGRILVLLAHALPLCEKSGLNLVSHFNLGNVTKYD 175

Query: 177 -GDEEMDVSSNETEETDTEEVDKVK-----IDFNFYKKFWSLQDYFRNPVQCYNKVSWKM 230
             D  + V +N TE++D  EV ++      +D+  Y KFW LQ +F +P  C+ K  W+ 
Sbjct: 176 MADSCLGVDANLTEDSDVMEVGEISEREIPVDYTLYAKFWQLQSFFSSPATCFEKSKWRT 235

Query: 231 FTSYAETVLAAFKSYKLDD------------------------VQSSLNPSGD---YFAK 263
           F      VL  F S+KL++                        + S+    GD   YFAK
Sbjct: 236 FQQNTTEVLNIFSSHKLENCIALGEGSKRERLKQENSNDEIPVIPSTSKADGDEDNYFAK 295

Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETV 323
           YLT+QKLL LQL+D+ FRRY L+Q LIL+QY  S +K +     L  +Q +++ ++TE  
Sbjct: 296 YLTSQKLLQLQLNDSQFRRYFLVQCLILYQYLISDIKFKDRSFTLNDEQLRFIIESTERC 355

Query: 324 YSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDK--------- 374
           + L+++T P G HF+  +K+IL  E+ W++WKN+GC +     T + D++K         
Sbjct: 356 FRLLRETHPKGPHFADAMKVILHREKEWSEWKNKGCLD----YTQLADKEKPSVFKKRPR 411

Query: 375 -KDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQ 432
            + +P       PELTKLWN + D L AC+ + R+F P L ++ E  + ++DP   VEEQ
Sbjct: 412 NRYDPSKLDLGNPELTKLWNINPDMLSACEDSRRNFMPKLATFLENPLIELDPEQLVEEQ 471

Query: 433 YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE---FIENMVKRCVKEKP 483
           Y+ VN+ ++ WRA RLL  +   +F   +   E  +    F+E ++K      P
Sbjct: 472 YRSVNNESFQWRASRLLMSQSTSYFQTPSKRNEVTTNLAPFLEEVIKHTATNFP 525


>gi|402591093|gb|EJW85023.1| hypothetical protein WUBG_04065 [Wuchereria bancrofti]
          Length = 657

 Score =  323 bits (828), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 174/496 (35%), Positives = 277/496 (55%), Gaps = 52/496 (10%)

Query: 36  LEQSIRQYLLKLIKTPDIDIK-VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDK 94
           +E +I +   ++ +  DI+ + ++   +++  +   +D C     + ++ D FD+S +  
Sbjct: 34  IENAIHRKAFEIAQCTDIEQESLLRQLLAVTFQAAKEDFCAKGTTVAVIQDVFDVSPVKH 93

Query: 95  CEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFP 154
           CE+LF  VE N+  WK   F+  CKN +LRMCNDLL+RLSR+ +T FCGRIL+ LA   P
Sbjct: 94  CEKLFVIVEDNLPQWKMGIFYEPCKNMILRMCNDLLKRLSRTVDTSFCGRILVLLAHALP 153

Query: 155 FSERSGLNIISEFNVENITEFG-GDEEMDVSSNETEETDTEEVDKVK-----IDFNFYKK 208
             E+SGLN++S FN+ N+T++   D    + +N TE++D  EV ++      +D+  Y K
Sbjct: 154 LCEKSGLNLVSHFNLGNVTKYDLADSCFALDANLTEDSDVMEVGEISEREIPVDYTLYAK 213

Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------- 248
           FW LQ +F +P  C+ K  W+ F      VL  F S+KL+                    
Sbjct: 214 FWQLQSFFASPATCFEKSKWRTFQQNTTEVLNIFSSHKLEFSATLEEGSKRERSKQENSN 273

Query: 249 ----DVQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
                + S+    GD   YFAKYLT+QKLL LQL+D+ FRRY L+Q LIL+QY  S +K 
Sbjct: 274 DEIAAIPSTSRADGDEDIYFAKYLTSQKLLQLQLNDSQFRRYFLVQCLILYQYLVSDIKF 333

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           + +   L  +Q +++ ++TE  + L+++T P G HF+  +K+IL  E+ W++WKN+GC +
Sbjct: 334 KDKSFTLNDEQLRFIIESTERCFRLLRETYPKGPHFADAMKVILHREKEWSEWKNKGCLD 393

Query: 362 LKRPLTSITDEDK----------KDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTP 410
                T + D++K          + +P       PELTKLWN + D L AC+ + R+F P
Sbjct: 394 ----YTQLADKEKPSVFKKRPRNRYDPSKLDLGNPELTKLWNINPDMLSACEDSRRNFIP 449

Query: 411 SLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE- 469
            L ++ E+ + ++DP   VEEQY+ VN+ ++ WRA RLL  +   +F   +   E  +  
Sbjct: 450 MLVAFLEDPLDELDPEQLVEEQYRSVNNESFQWRASRLLMSQSTSYFQTPSKRSEVTTNL 509

Query: 470 --FIENMVKRCVKEKP 483
             F+E ++K      P
Sbjct: 510 APFLEEVIKHTAANFP 525


>gi|149031711|gb|EDL86661.1| rCG41283, isoform CRA_a [Rattus norvegicus]
          Length = 370

 Score =  307 bits (787), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 161/327 (49%), Positives = 211/327 (64%), Gaps = 30/327 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+ENIT F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSL 326
           K +     L  +Q  W++DTT++VY +
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQV 363


>gi|148691056|gb|EDL23003.1| THO complex 1, isoform CRA_a [Mus musculus]
          Length = 377

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 160/327 (48%), Positives = 211/327 (64%), Gaps = 30/327 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSACENVLAIISLAIGGVTESVCTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSL 326
           K +     L  +Q  W++DTT++VY +
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQV 363


>gi|417410320|gb|JAA51635.1| Putative nuclear matrix protein, partial [Desmodus rotundus]
          Length = 392

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 160/327 (48%), Positives = 211/327 (64%), Gaps = 30/327 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 44  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 99

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 100 LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 159

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 160 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 219

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 220 MGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 279

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 280 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 339

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSL 326
           K +     L  +Q  W++DTT++VY +
Sbjct: 340 KFKSSNYVLTDEQSLWIEDTTKSVYQV 366


>gi|426385304|ref|XP_004059159.1| PREDICTED: THO complex subunit 1-like [Gorilla gorilla gorilla]
 gi|50812415|gb|AAT81409.1| death domain-containing protein p84N5 short isoform [Homo sapiens]
 gi|119622137|gb|EAX01732.1| THO complex 1, isoform CRA_b [Homo sapiens]
          Length = 377

 Score =  306 bits (785), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 160/327 (48%), Positives = 211/327 (64%), Gaps = 30/327 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGILEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTV 299
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    V
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQV 336

Query: 300 KSRGEGLELKSDQEKWVKDTTETVYSL 326
           K +     L  +Q  W++DTT++VY +
Sbjct: 337 KFKSSNYVLTDEQSLWIEDTTKSVYQV 363


>gi|324502527|gb|ADY41112.1| THO complex subunit 1 [Ascaris suum]
          Length = 669

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 168/481 (34%), Positives = 270/481 (56%), Gaps = 61/481 (12%)

Query: 33  KSALEQSIRQYLLKLIKT--PDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           K+ +E+++ +  L L +T  P++  + +   ++L  ++  +++C     + ++ D FD  
Sbjct: 31  KNNVEEALHRKALALSETSEPNLLEENVRQLLALVFQIAKEELCAKASTVSVIQDVFDTI 90

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           +++ CE+LF  VE N+++WK   F+  CKN +LRMCNDLL+RLSR+ +T FCGRIL+ LA
Sbjct: 91  SIEWCERLFVVVEDNLSLWKTPFFYEPCKNLVLRMCNDLLKRLSRTVDTSFCGRILVLLA 150

Query: 151 KFFPFSERSGLNIISEFNVENITEF-----GGDEEMDVSSNETEETDTEEVDKVKIDFNF 205
           +  P  E+SGLN++S FNV+N+T+F       D E++    E E  + +E   + +DF  
Sbjct: 151 RALPLCEKSGLNLVSHFNVDNVTKFDLVESTVDSEVNEDGEEMETGEIKEGRDIPVDFAL 210

Query: 206 YKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD---------------DV 250
           Y KFW LQ +F +P  CY++  W  F      VL    S+KL                D 
Sbjct: 211 YAKFWQLQTFFSSPNNCYDRSKWNTFQQNTNDVLNVLTSHKLGGSTQADEQSRRKRQHDG 270

Query: 251 QSSLNPS----------------GD------YFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
             S  P+                GD      +FAKY+T+QKLL LQL+D+ FRRY L+Q 
Sbjct: 271 APSSAPADVTSSSASALLNSTRDGDTVEKNAFFAKYVTSQKLLQLQLNDSQFRRYFLVQC 330

Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
           L+L+QY  S VK + +   L  +Q +++ ++TE  Y L++ T P G HF+  VK IL+ E
Sbjct: 331 LVLYQYLVSDVKFKDKSYTLNDEQLRFIIESTEKCYRLLRDTHPKGSHFADSVKKILQRE 390

Query: 349 EHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK-----------TPELTKLWN-SKD 396
           + W++WKN+GC +     T + D++K   P A++ +           +P LT LW    D
Sbjct: 391 KEWSEWKNKGCAD----YTLLIDKEKMC-PIARRPRNTYKPTEIDLGSPVLTNLWKIDPD 445

Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
            L+AC+ ++R F PS+  + +  + ++DP   VE+QY  VN+ ++ WRA R+L  + P +
Sbjct: 446 MLDACQDSKRKFAPSMVDFLQNPLLELDPEQMVEDQYMSVNNESFQWRASRMLMHESPAY 505

Query: 457 F 457
           F
Sbjct: 506 F 506


>gi|67967745|dbj|BAE00355.1| unnamed protein product [Macaca fascicularis]
          Length = 729

 Score =  293 bits (751), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 150/296 (50%), Positives = 194/296 (65%), Gaps = 30/296 (10%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF 295
           DD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY 
Sbjct: 277 DDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYL 332


>gi|256084105|ref|XP_002578273.1| nuclear matrix protein [Schistosoma mansoni]
          Length = 622

 Score =  293 bits (751), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 170/469 (36%), Positives = 247/469 (52%), Gaps = 64/469 (13%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           DKKS ++ ++R  +L+L+ +  +D   ++  + + V   + ++C++  P +L +D F+  
Sbjct: 41  DKKSLIDVALRGRMLRLLASDSMDD--VKTLLKMGVAAALANLCSAPTPFLLFADVFNTK 98

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            +  CE++F ++E  +   K    F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL 
Sbjct: 99  PISLCEEMFGFMEETIITLKDVPLFASGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 158

Query: 151 KFFPFSERSGLNIISEFNVE---------NITEFGGDEEMDVSSNETEETD-TEEVDKVK 200
           + FP  E+SGLN +S FN E         + + F      D +S++ EE + T+    ++
Sbjct: 159 RLFPLDEKSGLNFMSNFNSEKDILYNKNPDPSAFKHQVSHDHTSDDLEEGEMTDSSTPLE 218

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS---LNPS 257
           +D   Y KFWSLQ++F++P  CY K  W  FTS  +TVL  F S KL  ++        S
Sbjct: 219 VDAGLYVKFWSLQEFFKSPALCYEKAKWLRFTSSTDTVLDVFSSIKLMAIEEGDICSQSS 278

Query: 258 GDYFAKYLTNQKLLD-----LQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQ 312
              F+KYLT++K +      L+L   + R  ++ Q                    L  DQ
Sbjct: 279 SGTFSKYLTSEKTVGFFKEQLKLLQRSCRGVIIDQV-------------------LSEDQ 319

Query: 313 EKWVKDTTETVYSLI---KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR----- 364
             WV    E V  L+         G  F   V+ IL+ E +WN+WKN+GCP   R     
Sbjct: 320 RSWVNQRHEVVLRLLSSNSPNNSSGGTFVSTVERILERESYWNRWKNDGCPSFIRTPEKS 379

Query: 365 ----------PLTSITDEDKKDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLE 413
                     PL + T +         +    EL KLWN   DNL AC+   R F P L 
Sbjct: 380 RLSVRKRHINPLVTRTGQ------KVYRFGNRELDKLWNVCPDNLAACRDKRRVFNPDLH 433

Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
           SYF++AI +MDPA  VEE+YK +N   ++WRALR LSRKCPHF++N  P
Sbjct: 434 SYFQDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYINWNP 482


>gi|351709766|gb|EHB12685.1| THO complex subunit 1 [Heterocephalus glaber]
          Length = 530

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 159/350 (45%), Positives = 208/350 (59%), Gaps = 52/350 (14%)

Query: 159 SGLNIISEFNVENITEFGGDEE---------------MDVSSNETEETDTEEVDKVKIDF 203
            GLN+ S+FN+EN+T F  +E+               MDV   E  + +      + ID+
Sbjct: 85  GGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDDEAPTTCSIPIDY 144

Query: 204 NFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPS 257
           N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD Q+S      L   
Sbjct: 145 NLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTG 204

Query: 258 GD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
           G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK +     L  +Q  W
Sbjct: 205 GEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLW 264

Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK 375
           ++DTT++VY          +H       IL  EE+WN WKNEGCP   +  TS    D K
Sbjct: 265 IEDTTKSVY----------QH-------ILNTEENWNSWKNEGCPSFVKERTS----DSK 303

Query: 376 DEPDAKKKKTPE--LTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQY 433
                +K+  PE  L K  N K  +       +++ P+LE +FEEAI+Q DP   VE +Y
Sbjct: 304 PTRVVRKRTAPEDFLGKGPNKKILM------GKEYMPTLEEFFEEAIEQADPENMVESEY 357

Query: 434 KKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
           K VN+SNY WRALRLL+R+ PHFF       +   E++ENMV +  KE P
Sbjct: 358 KVVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 407


>gi|444723139|gb|ELW63801.1| Collectin-12 [Tupaia chinensis]
          Length = 1142

 Score =  285 bits (729), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 158/349 (45%), Positives = 207/349 (59%), Gaps = 52/349 (14%)

Query: 160  GLNIISEFNVENITEFGGDEE---------------MDVSSNETEETDTEEVDKVKIDFN 204
            GLN+ S+FN+EN+T F  +E+               MDV   E  + +      + ID+N
Sbjct: 698  GLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGDDEAPTTCSIPIDYN 757

Query: 205  FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSG 258
             Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD Q+S      L   G
Sbjct: 758  LYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGG 817

Query: 259  D--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWV 316
            +  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK +     L  +Q  W+
Sbjct: 818  EHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWI 877

Query: 317  KDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKD 376
            +DTT++VY          +H       IL  EE+WN WKNEGCP   +   S    D K 
Sbjct: 878  EDTTKSVY----------QH-------ILNTEENWNSWKNEGCPSFVKERAS----DTKP 916

Query: 377  EPDAKKKKTPE--LTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYK 434
                +K+  PE  L K  N K  +       +++ P+LE +FEEAI+Q DP   VE +YK
Sbjct: 917  ARVVRKRTAPEDFLGKGPNKKILM------GKEYMPTLEEFFEEAIEQADPENMVENEYK 970

Query: 435  KVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
             VN+SNY WRALRLL+R+ PHFF       +   E++ENMV +  KE P
Sbjct: 971  AVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 1019


>gi|308464438|ref|XP_003094486.1| CRE-THOC-1 protein [Caenorhabditis remanei]
 gi|308247715|gb|EFO91667.1| CRE-THOC-1 protein [Caenorhabditis remanei]
          Length = 682

 Score =  284 bits (726), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 166/437 (37%), Positives = 240/437 (54%), Gaps = 37/437 (8%)

Query: 69  CMK-DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCN 127
           CMK   C+ + PI+ L D  +MS++++C+QLF  VE N+N +KQ  F  + +N +LR+CN
Sbjct: 67  CMKLGYCSKSTPILTLQDLLEMSSIEECKQLFGLVEENMNEFKQPGFIETYQNTILRLCN 126

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE------- 180
           DLLRRLSR+  T FCGRI+ FL++F P +E+SG+N +  FN  NIT +   E        
Sbjct: 127 DLLRRLSRTAETSFCGRIMFFLSRFLPLTEKSGVNFMGHFNTLNITNYEETETTAESLLT 186

Query: 181 ---------MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMF 231
                    +D    ET E + ++  ++ +    Y++FWSLQ +  NP   Y K  + MF
Sbjct: 187 AASSRRPTPVDGEEMETGELEEDQAKEINVTSEMYRQFWSLQKFMSNPNLLYEKDKFAMF 246

Query: 232 TSYAETVLAAFKSYKLDDVQSSLNP----------SGD-YFAKYLTNQKLLDLQLSDTNF 280
                 VL+   + KL+ +                S D +F KYLT+ KLL LQL+D++F
Sbjct: 247 KKDLSAVLSLMTANKLEKLNQEEESEYTENKQKKVSNDVFFTKYLTSPKLLALQLNDSSF 306

Query: 281 RRYVLLQFLILFQYFT--STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFS 338
           RRY L+Q +I+FQY    S  K   + + L  +Q K+V D  +  Y L+  T P G  F 
Sbjct: 307 RRYFLIQTIIIFQYLNAESRFKPPAKKMVLNEEQAKYVTDHEDKCYRLLSDTMPRGAVFV 366

Query: 339 QVVKLILKGEEHWNQWKNEGCPEL--KRPLTSITDEDKKDE----PDAKKKKTPELTKLW 392
             +K I++ E+ WN WKN  CP+L  K    ++    K+      P +    TPELTKLW
Sbjct: 367 AGLKKIIQREQEWNTWKNSNCPDLSEKADKGAMQMYKKRQRIPFNPQSLDLGTPELTKLW 426

Query: 393 -NSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
            N  D L+ACKS +R F P+L ++  + + +MDP   VEEQYK +ND ++ WRA RLL  
Sbjct: 427 TNEPDVLQACKSDKRKFIPTLANFIRDPLDEMDPEQQVEEQYKSINDISFQWRAARLLMH 486

Query: 452 KCPHFFLNATPNVEKNS 468
           K P +        E NS
Sbjct: 487 KSPGYVSRTDIKSEPNS 503


>gi|449689006|ref|XP_002155851.2| PREDICTED: THO complex subunit 1-like, partial [Hydra
           magnipapillata]
          Length = 404

 Score =  282 bits (722), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 146/301 (48%), Positives = 195/301 (64%), Gaps = 19/301 (6%)

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
            C   LPI+LLSD FD  TL  CE++F Y+E NV+IW  + FF + KN+LLRMCND+LRR
Sbjct: 107 FCTKNLPILLLSDIFDTMTLCDCEKMFKYIEKNVSIWNAEPFFSTGKNHLLRMCNDMLRR 166

Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMD---VSSNETE 189
           LS+SQNTVFCGRI LFLA+ FP  E+S LN++S F++EN+T F   E++D   +  N++ 
Sbjct: 167 LSKSQNTVFCGRIQLFLAQLFPVEEKSALNLMSNFHLENVTLFKT-EKLDFDEIQPNDSV 225

Query: 190 ETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD- 248
           E D E V+   +D+N Y KFW+LQDYF NP +CY K  W +  +    VL  F S+KL+ 
Sbjct: 226 E-DRELVNN-PVDYNLYIKFWTLQDYFVNPSKCYTKEGWNLINTNISEVLKIFTSFKLEF 283

Query: 249 ------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
                       DV      + +YFAKYLTN+KL + QL+D+NFRR  L+Q LILFQY +
Sbjct: 284 NVRKRKSIDMPIDVDQIKQTANNYFAKYLTNEKLFNFQLNDSNFRRNFLVQVLILFQYLS 343

Query: 297 STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN 356
            T K +     L   Q  W+ +TT+ VY+ I +TPPDG+ F + +K IL  EE+W  WKN
Sbjct: 344 GTSKFKAPNQVLNDTQTNWINETTDIVYNSICETPPDGKKFCETIKHILSREENWINWKN 403

Query: 357 E 357
           E
Sbjct: 404 E 404


>gi|349604757|gb|AEQ00218.1| THO complex subunit 1-like protein, partial [Equus caballus]
          Length = 268

 Score =  282 bits (721), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 142/267 (53%), Positives = 180/267 (67%), Gaps = 23/267 (8%)

Query: 81  ILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTV 140
           +LL D  D   LD+C+ +F   E NV  WK  TF+ + KN LLRMC DLLRRLS+SQNTV
Sbjct: 1   VLLGDVLDCLPLDQCDTIFTLGEKNVATWKSNTFYSAGKNYLLRMCQDLLRRLSKSQNTV 60

Query: 141 FCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSS 185
           FCGRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV  
Sbjct: 61  FCGRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEE 120

Query: 186 NETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSY 245
            E  + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSY
Sbjct: 121 GEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSY 180

Query: 246 KLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
           KLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQY   
Sbjct: 181 KLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKG 240

Query: 298 TVKSRGEGLELKSDQEKWVKDTTETVY 324
            VK +     L  +Q  W++DTT++VY
Sbjct: 241 QVKFKSSNYVLTDEQSLWIEDTTKSVY 267


>gi|341893668|gb|EGT49603.1| hypothetical protein CAEBREN_23001 [Caenorhabditis brenneri]
          Length = 650

 Score =  281 bits (720), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 157/424 (37%), Positives = 228/424 (53%), Gaps = 36/424 (8%)

Query: 69  CMK-DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCN 127
           CMK   C+ + PI+ L D  + S+++ C++LF  VE N+N +KQ  F  +C+NN+LR+CN
Sbjct: 67  CMKLGFCSRSTPILTLQDLMETSSIEACKELFGLVEENMNEFKQPGFIETCQNNILRLCN 126

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF------------ 175
           DLLRRLSR+  T FCGRI+ FL++F P +E+SG+N +  FN  NIT +            
Sbjct: 127 DLLRRLSRTAETSFCGRIMFFLSRFLPLTEKSGVNFMGHFNTLNITNYEETDAAAEALIA 186

Query: 176 ----GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMF 231
                    +D    E  E + E   ++ +    Y++FWSLQ +  NP   Y K  +  F
Sbjct: 187 ATSSKAQTPVDGEEMEIGEIEEEGAKEIVVTPEMYRQFWSLQKFMSNPNLLYEKDKFLAF 246

Query: 232 TSYAETVLAAFKSYKLDDVQSSLNPSGD----------YFAKYLTNQKLLDLQLSDTNFR 281
                 VL+   + KL+ + +      D          +F KYLT+ KLL LQL+D++FR
Sbjct: 247 KRDLSAVLSLMTANKLEKLSAEEEEMADSRQKKVSNDVFFTKYLTSPKLLALQLNDSSFR 306

Query: 282 RYVLLQFLILFQYFT--STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
           RY L+Q +I+FQY    S  K   + + L  +Q K+V DT +  Y L+  T P G  F  
Sbjct: 307 RYFLIQAIIIFQYLNAESRFKPPAKKMVLNEEQGKYVTDTEDKCYRLLSDTMPRGTAFVA 366

Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDE------PDAKKKKTPELTKLW- 392
            +K I+  E+ WN WKN  CP+          +  K        P +    TPELTKLW 
Sbjct: 367 GLKKIMSREQEWNVWKNNNCPDFSEKADKGAMQMYKKRARVPFNPTSLDLGTPELTKLWT 426

Query: 393 NSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
           N  D L+AC++ +R F P+L  +  + + +MDP   VEEQYK +ND+ + WRA RLL  K
Sbjct: 427 NEPDVLQACRAEKRKFIPTLADFIRDPLDEMDPEQQVEEQYKSINDAAFQWRAARLLMHK 486

Query: 453 CPHF 456
            P +
Sbjct: 487 SPGY 490


>gi|170590342|ref|XP_001899931.1| THO complex subunit 1 [Brugia malayi]
 gi|158592563|gb|EDP31161.1| THO complex subunit 1, putative [Brugia malayi]
          Length = 480

 Score =  281 bits (720), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 162/467 (34%), Positives = 249/467 (53%), Gaps = 77/467 (16%)

Query: 36  LEQSIRQYLLKLIKTPDIDIK-VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDK 94
           +E +I +   ++ +  DI+ + ++   +++  +   +D C     + ++ D FD+S +  
Sbjct: 34  IENAIHRKAFEIAQCTDIEQESLLRQLLAVTFQAAKEDFCAKGTTVAVIQDVFDVSPVKH 93

Query: 95  CEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFP 154
           CE+LF  VE N+  WK   F+  CKN +LRMCNDLL+RLSR+ +T FCGRIL+ LA   P
Sbjct: 94  CEKLFAIVEDNLPQWKMGIFYEPCKNMILRMCNDLLKRLSRTVDTSFCGRILVLLAHALP 153

Query: 155 FSERSGLNIISEFNVENITEFG-GDEEMDVSSNETEETDTEEVDKVK-----IDFNFYKK 208
             E+SGLN++S FN+ NIT++   D    V +N TE++D  EV ++      +D+  Y K
Sbjct: 154 LCEKSGLNLVSHFNLGNITKYDLADSCFAVDANLTEDSDVMEVGEISEREIPVDYTLYAK 213

Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL--------------------- 247
           FW LQ +F +P  C+ K  W+ F      VL  F S+KL                     
Sbjct: 214 FWQLQSFFSSPATCFEKSKWRTFQQNTTEVLNIFSSHKLEFSATLEEGSKRERSKQENSN 273

Query: 248 DDVQ---SSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
           DD+    S+    GD   YFAKYLT+QKLL LQL+D+ FRRY L+Q LIL+QY  S +K 
Sbjct: 274 DDIAAIPSTSRADGDEDIYFAKYLTSQKLLQLQLNDSQFRRYFLVQCLILYQYLVSDIKF 333

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           + +   L  +Q +++ ++TE  + L+++T P G HF+  +K+IL  E+ W++WKN+GC +
Sbjct: 334 KDKSFTLNDEQLRFIIESTERCFRLLRETYPKGPHFADAMKVILHREKEWSEWKNKGCLD 393

Query: 362 LKRPLTSITDEDK----------KDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTP 410
                T + D++K          + +P       PELTKLWN + D L AC+ + R    
Sbjct: 394 ----YTQLADKEKPSVFKKRPRNRYDPSKLDLGNPELTKLWNINPDMLSACEDSRRS--- 446

Query: 411 SLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
                                    VN+ ++ WRA RLL  +   +F
Sbjct: 447 -------------------------VNNESFQWRASRLLMSQSTSYF 468


>gi|71994361|ref|NP_493796.2| Protein THOC-1 [Caenorhabditis elegans]
 gi|351064857|emb|CCD73573.1| Protein THOC-1 [Caenorhabditis elegans]
          Length = 665

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 161/436 (36%), Positives = 240/436 (55%), Gaps = 42/436 (9%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
           +++  C +L    +C+   P+  L D  +MS++++C+Q+F  VE N+N +KQ  F  + +
Sbjct: 62  HFVMNCSKL---GLCSKNTPLSTLQDLLEMSSIEECKQIFSIVEENMNEFKQPGFIETAQ 118

Query: 120 NNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT-----E 174
           NN+LR CNDLLRRLSR+  T FCGRI+ FL++F P +E+SG+N +  FN  N+T     E
Sbjct: 119 NNILRFCNDLLRRLSRTAETSFCGRIMFFLSRFLPLTEKSGVNFMGHFNTLNVTNYDESE 178

Query: 175 FGGDEEMDVSSN-----------ETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCY 223
             G+  +  +S+           ET E + +   ++++    Y++FWSLQ +  NP   Y
Sbjct: 179 TDGEALLAATSSAPTPTEGAEDMETGEIEEDNSKEIQVTPEMYRQFWSLQKFMSNPNSIY 238

Query: 224 NKVSWKMFTSYAETVLAAFKSYKLDDV--------------QSSLNPSGDYFAKYLTNQK 269
            K  +  F +    VL    S KL+ +              Q   + +  +F KYLT+ K
Sbjct: 239 EKEKFLTFKTDLTAVLTLMTSNKLEKLSSEEEEALENRQKKQQKGSSNDVFFTKYLTSPK 298

Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFT--STVKSRGEGLELKSDQEKWVKDTTETVYSLI 327
           LL LQL+D++FRRY L+Q +I+FQY T  S  K   + + L  DQ K+V +  +  Y L+
Sbjct: 299 LLALQLNDSSFRRYFLMQAIIIFQYLTAESRFKPPAKKMVLNEDQAKYVSECEDKCYRLL 358

Query: 328 KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--KRPLTSITDEDKKDE----PDAK 381
             T P G  F   +K I+  E+ WN WKN  C +   K    ++    K+      P++ 
Sbjct: 359 ADTMPRGTAFVAGLKRIMLREQEWNTWKNANCADFSEKADKGAMQMYKKRQRIPFNPNSL 418

Query: 382 KKKTPELTKLW-NSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSN 440
              TPELTKLW N  D L+ACKS +R F P L  +  + I +MDP   VEEQYK++NDS 
Sbjct: 419 DLGTPELTKLWTNEPDVLKACKSDKRKFIPKLPDFIRDPIDEMDPEQQVEEQYKQINDSA 478

Query: 441 YAWRALRLLSRKCPHF 456
           + WRA RLL  K P +
Sbjct: 479 FQWRAARLLMHKSPGY 494


>gi|345309431|ref|XP_001521474.2| PREDICTED: THO complex subunit 1-like, partial [Ornithorhynchus
           anatinus]
          Length = 361

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 137/279 (49%), Positives = 181/279 (64%), Gaps = 22/279 (7%)

Query: 227 SWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDT 278
           S+K+   Y+E VLA F+SYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+
Sbjct: 59  SFKLMIRYSEEVLAVFRSYKLDDTQASRKKLEELKAGGEHVYFAKFLTSEKLMDLQLSDS 118

Query: 279 NFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFS 338
           NFRR++LLQ+LILFQY    VK +     L  +Q  W++DTT++VY L+ + PPDGE FS
Sbjct: 119 NFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFS 178

Query: 339 QVVKLILKGEEHWNQWKNEGCPEL--KRPLTSITDE--DKKDEPDAKKKKTP-------- 386
           ++V+ IL  EE+WN WKNEGCP    +RP          K+  P+    K P        
Sbjct: 179 KMVEHILNTEENWNSWKNEGCPSFVKERPSDPKPPRLVRKRTAPEDFLGKGPNKKILMGN 238

Query: 387 -ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWR 444
            ELT+LWN   DN+EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WR
Sbjct: 239 EELTRLWNLCPDNMEACKSESREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWR 298

Query: 445 ALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
           ALRLL+R+ PHFF       +   E++ENMV +  KE P
Sbjct: 299 ALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 337


>gi|390351008|ref|XP_793772.3| PREDICTED: THO complex subunit 1-like [Strongylocentrotus
           purpuratus]
          Length = 647

 Score =  258 bits (658), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 143/329 (43%), Positives = 193/329 (58%), Gaps = 41/329 (12%)

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP---- 256
           +D+  Y++FW+LQD+FR P+QCY K  WK F++ A  VL AF S KL+DV S        
Sbjct: 169 VDYKLYRQFWALQDFFRAPLQCYTKEKWKDFSNNAIAVLEAFHSMKLEDVASGKKKKKRR 228

Query: 257 ----------------SGDY--FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTST 298
                           SGD   FAKYLT++KL DLQLSD  FRRYV +QFLILFQY    
Sbjct: 229 SHHRQQQKQGEEEEADSGDQVSFAKYLTSEKLFDLQLSDAMFRRYVYVQFLILFQYLNQQ 288

Query: 299 VKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG 358
           VK +     L  +   ++K+T + V +L+K+TPP G+ F+  ++ IL  EEHWN WKNEG
Sbjct: 289 VKFKQAHHVLTDEMSLFIKETKDKVVALLKETPPYGDQFTAGIEHILSREEHWNAWKNEG 348

Query: 359 CP---------ELKRPLTSITDEDKKDE---PDAKKK---KTPELTKLWNS-KDNLEACK 402
           CP         E  RP +        +E     A KK    +PELTKLWN    NLEAC 
Sbjct: 349 CPSYVKEKGKTEQARPKSRARKRTLGEELNLAGAAKKIDLGSPELTKLWNVYPSNLEACA 408

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
           + +R F P +E +F++AI Q DP A +E +YK VN+SNYAW+A+RLL+++ PHFF   T 
Sbjct: 409 AEDRIFLPQVEEFFQDAIDQADPEAMIEPEYKVVNNSNYAWQAIRLLAKRSPHFFQTVTA 468

Query: 462 --PNVEKNSEFIENMVKRCVKEKPSSQIS 488
             P ++    ++E MV +  KE P ++ S
Sbjct: 469 TNPPLKTVPGYLEYMVTKIAKELPQNEES 497



 Score =  109 bits (273), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 50/116 (43%), Positives = 78/116 (67%), Gaps = 10/116 (8%)

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-------E 180
           +LLRRLS+SQ+ VFCGRI LFLA+FFP  E+SGLN++S F+++N+T +   +        
Sbjct: 20  NLLRRLSKSQDMVFCGRIQLFLARFFPLDEKSGLNLMSNFHLDNVTSYNRTKLEGPLRTR 79

Query: 181 MDVSSNETEETDTEEVD---KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTS 233
           +  + ++ +  D EE +      +D+  Y++FW+LQD+FR P+QCY K  WK F++
Sbjct: 80  VPQAIDKQDSMDYEEGEMGSATPVDYKLYRQFWALQDFFRAPLQCYTKEKWKDFSN 135


>gi|296087837|emb|CBI35093.3| unnamed protein product [Vitis vinifera]
          Length = 613

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 171/490 (34%), Positives = 249/490 (50%), Gaps = 47/490 (9%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 93  IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFAYIESKQDILGKQELFAR 152

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 153 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 210

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
                       E D  E   + IDFNFYK FWSLQ++F NP         W+ FTS   
Sbjct: 211 ------------EKDAPE--GISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTSNLM 256

Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F++  L D + + N     +  +  KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 257 VVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILF 316

Query: 293 QYFTSTVKSRGEGLELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            Y  +  K+     +L SD  K  +K   E V  L++ TPP G+ F   ++ IL+ E++W
Sbjct: 317 DYLKAPGKNDK---DLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREKNW 373

Query: 352 NQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACK 402
             WK +GCP  +R P+      +KK   D  KK+ P       EL++LW   D N  A  
Sbjct: 374 VWWKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALT 427

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
             +R  TP++  Y++   + MD +A +E +Y   N+  Y W+ LR  +R+    F   T 
Sbjct: 428 DPQRVRTPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTE 487

Query: 463 -NVEK--NSEFIENMVKRCVKEKPSSQISG----NGNGVDQDPAEVEVDTKSEEIQEEEK 515
             +E     E + + V+   + KPS +          G  Q   E ++ T + EI  E  
Sbjct: 488 YGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGT 547

Query: 516 EEDWEAKADP 525
             D EA A P
Sbjct: 548 RVDLEASAAP 557


>gi|359487131|ref|XP_002264619.2| PREDICTED: THO complex subunit 1-like [Vitis vinifera]
          Length = 601

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 171/490 (34%), Positives = 249/490 (50%), Gaps = 47/490 (9%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 81  IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFAYIESKQDILGKQELFAR 140

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 141 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 198

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
                       E D  E   + IDFNFYK FWSLQ++F NP         W+ FTS   
Sbjct: 199 ------------EKDAPE--GISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTSNLM 244

Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F++  L D + + N     +  +  KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 245 VVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILF 304

Query: 293 QYFTSTVKSRGEGLELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            Y  +  K+     +L SD  K  +K   E V  L++ TPP G+ F   ++ IL+ E++W
Sbjct: 305 DYLKAPGKNDK---DLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILEREKNW 361

Query: 352 NQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACK 402
             WK +GCP  +R P+      +KK   D  KK+ P       EL++LW   D N  A  
Sbjct: 362 VWWKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALT 415

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
             +R  TP++  Y++   + MD +A +E +Y   N+  Y W+ LR  +R+    F   T 
Sbjct: 416 DPQRARTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTE 475

Query: 463 -NVEK--NSEFIENMVKRCVKEKPSSQISG----NGNGVDQDPAEVEVDTKSEEIQEEEK 515
             +E     E + + V+   + KPS +          G  Q   E ++ T + EI  E  
Sbjct: 476 YGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGT 535

Query: 516 EEDWEAKADP 525
             D EA A P
Sbjct: 536 RVDLEASAAP 545


>gi|225463994|ref|XP_002263874.1| PREDICTED: THO complex subunit 1-like [Vitis vinifera]
          Length = 607

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 171/490 (34%), Positives = 249/490 (50%), Gaps = 47/490 (9%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 87  IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFAYIESKQDILGKQELFAR 146

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 147 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 204

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
                       E D  E   + IDFNFYK FWSLQ++F NP         W+ FTS   
Sbjct: 205 ------------EKDAPE--GISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTSNLM 250

Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F++  L D + + N     +  +  KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 251 VVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILF 310

Query: 293 QYFTSTVKSRGEGLELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            Y  +  K+     +L SD  K  +K   E V  L++ TPP G+ F   ++ IL+ E++W
Sbjct: 311 DYLKAPGKNDK---DLPSDSMKEEIKSCEERVKKLLETTPPKGKEFLHNIEHILEREKNW 367

Query: 352 NQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACK 402
             WK +GCP  +R P+      +KK   D  KK+ P       EL++LW   D N  A  
Sbjct: 368 VWWKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALT 421

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
             +R  TP++  Y++   + MD +A +E +Y   N+  Y W+ LR  +R+    F   T 
Sbjct: 422 DPQRVRTPAVSEYWKPLAEDMDSSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTE 481

Query: 463 -NVEK--NSEFIENMVKRCVKEKPSSQISG----NGNGVDQDPAEVEVDTKSEEIQEEEK 515
             +E     E + + V+   + KPS +          G  Q   E ++ T + EI  E  
Sbjct: 482 YGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGT 541

Query: 516 EEDWEAKADP 525
             D EA A P
Sbjct: 542 RVDLEASAAP 551


>gi|296087823|emb|CBI35079.3| unnamed protein product [Vitis vinifera]
          Length = 613

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 171/490 (34%), Positives = 249/490 (50%), Gaps = 47/490 (9%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 93  IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTMRNCKDIFAYIESKQDILGKQELFAR 152

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 153 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 210

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
                       E D  E   + IDFNFYK FWSLQ++F NP         W+ FTS   
Sbjct: 211 ------------EKDAPE--GISIDFNFYKTFWSLQEHFCNPASISLAPTKWQKFTSNLM 256

Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F++  L D + + N     +  +  KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 257 VVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCLILF 316

Query: 293 QYFTSTVKSRGEGLELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            Y  +  K+     +L SD  K  +K   E V  L++ TPP G+ F   ++ IL+ E++W
Sbjct: 317 DYLKAPGKNDK---DLPSDSMKEEIKSCEERVKKLLEMTPPKGKEFLHNIEHILEREKNW 373

Query: 352 NQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACK 402
             WK +GCP  +R P+      +KK   D  KK+ P       EL++LW   D N  A  
Sbjct: 374 VWWKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNALT 427

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
             +R  TP++  Y++   + MD +A +E +Y   N+  Y W+ LR  +R+    F   T 
Sbjct: 428 DPQRARTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQDLDGFSRFTE 487

Query: 463 -NVEK--NSEFIENMVKRCVKEKPSSQISG----NGNGVDQDPAEVEVDTKSEEIQEEEK 515
             +E     E + + V+   + KPS +          G  Q   E ++ T + EI  E  
Sbjct: 488 YGIEGVVPMELLPSDVRSKYQAKPSDRSKRAKKEETKGAAQQAEENQIATPASEIDGEGT 547

Query: 516 EEDWEAKADP 525
             D EA A P
Sbjct: 548 RVDLEASAAP 557


>gi|357114422|ref|XP_003558999.1| PREDICTED: THO complex subunit 1-like [Brachypodium distachyon]
          Length = 630

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 171/531 (32%), Positives = 258/531 (48%), Gaps = 78/531 (14%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           +I + + + + LC +      +   LL D  DMST+  C+ +F Y+E   ++  +Q  F 
Sbjct: 122 LITHLLDIMLYLCERGHVEGGMVFQLLEDLTDMSTIKDCKDVFGYIESKQDVLGKQELFG 181

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
             K  +LR CN LLRRLS+S + VFCGRI++FLA FFP SERS LNI   FN  N T++ 
Sbjct: 182 RGKLVMLRTCNQLLRRLSKSNDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY- 240

Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYA 235
              E D +              + +DFNFY+  WSLQ++FRNP +   N   W+ F S  
Sbjct: 241 ---EKDATGG------------ISVDFNFYQTLWSLQEHFRNPALTTTNPTKWQKFASNL 285

Query: 236 ETVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
             VL  F++  L D     N         +  KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 286 TVVLNTFEAQPLCDDDGKHNNLEQEEDAAFNIKYLTSSKLMGLELKDASFRRHILVQCLI 345

Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
            F Y  +  KS  EG      +E  +K   E V +L++  PP G+ F Q ++ IL+ E++
Sbjct: 346 FFDYLKAPGKSDKEGPSESMKEE--IKSCEERVKNLLEMIPPKGKEFLQSIEHILEREKN 403

Query: 351 WNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWN-SKDNLEAC 401
           W  WK +GCP   K+P         + +P   +K+ P       EL +LW  ++ N  A 
Sbjct: 404 WVWWKRDGCPAFEKQPF--------EKKPGGVRKRKPRWRLGNKELAQLWKWAELNPNAL 455

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRL--------LSRKC 453
              +R  TPS+  Y++     MD +A +EE+Y   N+  Y W+ LR          SR C
Sbjct: 456 TDPDRVRTPSVTEYWKPLADDMDASAGIEEEYHHKNNRVYCWKGLRFSARQDLDGFSRFC 515

Query: 454 PHFFLNATPN--------VEKNSEFIENMVKRCVKE-------KPSSQ-------ISGNG 491
            +      P          + NS+  E   KR  +E       +P  Q         G G
Sbjct: 516 EYGIEGVVPTELLPPEVRAKYNSKPGEK-AKRPKREDTKGASAQPKEQQVAATPETDGGG 574

Query: 492 NGVDQDPAEVEVDTKSEEIQEEEKEEDWE------AKADPEGDADEVMSVE 536
           +G DQ+   V +D+ +  I++ +K+   E       + +PE D D+ +  E
Sbjct: 575 SGADQEEGAVPMDSDNVAIEDGQKQSPGEVSGPESGQCEPEDDVDDNVKSE 625


>gi|334187564|ref|NP_001190269.1| THO complex subunit 1 [Arabidopsis thaliana]
 gi|332004074|gb|AED91457.1| THO complex subunit 1 [Arabidopsis thaliana]
          Length = 598

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 167/488 (34%), Positives = 247/488 (50%), Gaps = 59/488 (12%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I + + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 84  IPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKDVFGYIESKQDILGKQELFAR 143

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 144 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK 203

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-WKMFTSYAE 236
           D    +S                +DFNFYK FWSLQ+YF NP    +  + W+ F+S   
Sbjct: 204 DPPKGIS----------------VDFNFYKTFWSLQEYFCNPASLTSASTKWQKFSSSLA 247

Query: 237 TVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F +  L + +   N     +  +  KYLT+ KL+ L+L D++FRR++LLQ LI+F
Sbjct: 248 VVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHILLQCLIMF 307

Query: 293 QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWN 352
            Y  +  K+     +L S+  + +K   + V  L++ TPP G+ F + V+ IL+ E++W 
Sbjct: 308 DYLRAPGKNDK---DLPSETMEELKSCEDRVKKLLEITPPKGKEFLRAVEHILEREKNWV 364

Query: 353 QWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK--------TPELTKLWNSKD-NLEACK 402
            WK +GCP   K+P+      DKK  P+A +KK          EL++LW   D N  A  
Sbjct: 365 WWKRDGCPPFEKQPI------DKKS-PNAGQKKRRQRWRLGNKELSQLWRWADQNPNALT 417

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
            ++R  TP +  Y++   + MDP+A +E++Y   N+  Y W+ LR  +R+    F   T 
Sbjct: 418 DSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTARQDLEGFSRFTE 477

Query: 462 -------------PNVEKNSEFIEN-MVKRCVKEK---PSSQISGNGNGVDQDPAEVEVD 504
                        P V    +   N   KR  KE+    S +  GN  GV    AE E  
Sbjct: 478 MGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIGVSNSEAEAEGG 537

Query: 505 TKSEEIQE 512
               E  E
Sbjct: 538 RGDAETME 545


>gi|242042539|ref|XP_002468664.1| hypothetical protein SORBIDRAFT_01g049910 [Sorghum bicolor]
 gi|241922518|gb|EER95662.1| hypothetical protein SORBIDRAFT_01g049910 [Sorghum bicolor]
          Length = 637

 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 162/528 (30%), Positives = 258/528 (48%), Gaps = 72/528 (13%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC +      +   LL D  +MST+  C+ +F Y+E   ++  +Q  F  
Sbjct: 122 ITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQELFGR 181

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRI++FLA FFP SERS LNI   FN  N+T++  
Sbjct: 182 GKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTKY-- 239

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYAE 236
                         + + +D + +DFNFYK  WSLQ++F NP +   N   W+ F+S   
Sbjct: 240 --------------EKDAMDGISVDFNFYKTLWSLQEHFSNPALTNTNPAKWQKFSSNLA 285

Query: 237 TVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
            VL+ F++  L D    LN         +  KYLT+ KL+ L+L D +FRR++L+Q LI 
Sbjct: 286 VVLSTFEAQPLSDDDGKLNNLNEEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIF 345

Query: 292 FQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
           F Y  +  K+  EG      +E  +K   E V  L++  PP G+ F + ++ IL+ E++W
Sbjct: 346 FDYLKAPGKNDKEGPTGSMKEE--IKSCEEHVKKLLEIIPPKGKEFLKSIEHILEREKNW 403

Query: 352 NQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKSA 404
             WK +GC   ++P      E K  +   +K+K        EL++LW  ++ N       
Sbjct: 404 VWWKRDGCLAFEKPPF----EKKPGQAGGRKRKPRWRLGNKELSQLWKWAEQNPNVLTDP 459

Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK------------ 452
           +R   PS+  Y++   + MDP+A +EE+Y   ++  Y W+ LR  +R+            
Sbjct: 460 DRVRMPSITEYWKPLAEDMDPSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDYG 519

Query: 453 ----CPHFFLNATPNVEKNSEFIENMVKRCVKE-------KPSSQ-------ISGNGNGV 494
                P   L    N   +S+  E  VKR  +E       +P  Q         G G+G 
Sbjct: 520 IEGVVPSELLPPEVNARFSSKPAEK-VKRTRREDSKGMSAQPKEQQVAATPETDGGGSGG 578

Query: 495 DQDPAEVEVDTKSEEIQEEEKEEDWE------AKADPEGDADEVMSVE 536
           D +     +D+ +  +++ +K    E       + +PE DAD+ +  E
Sbjct: 579 DPEEGAAPMDSDNGGVEDSQKRSPGEVSGPESGQCEPEADADDNVKTE 626


>gi|297807009|ref|XP_002871388.1| hypothetical protein ARALYDRAFT_908935 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317225|gb|EFH47647.1| hypothetical protein ARALYDRAFT_908935 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 597

 Score =  251 bits (640), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 174/509 (34%), Positives = 259/509 (50%), Gaps = 60/509 (11%)

Query: 38  QSIRQYLLKLIKTPDIDIKV-IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCE 96
           + I QY  +LI   D +I   I + + + + LC K+     +   LL D  +MST+  C+
Sbjct: 62  EQIMQYG-QLIDDDDDNIHGQIPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCK 120

Query: 97  QLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFS 156
            +F Y+E   +I  +Q  F   K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP S
Sbjct: 121 DVFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLS 180

Query: 157 ERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYF 216
           ERS +NI   FN  N T++  D    +S                +DFNFYK FWSLQ+YF
Sbjct: 181 ERSAVNIKGVFNTSNETKYEKDPPKGIS----------------VDFNFYKTFWSLQEYF 224

Query: 217 RNPVQCYNKVS-WKMFTSYAETVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLL 271
            NP    +  + W+ F+S    VL  F +  L + +   N     +  +  KYLT+ KL+
Sbjct: 225 CNPASLISASTKWQKFSSSLAVVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLM 284

Query: 272 DLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTP 331
            L+L D++FRR++LLQ LI+F Y  +  K+  + L  ++ +E+ +K   + V  L++ TP
Sbjct: 285 GLELKDSSFRRHILLQCLIMFDYLRAPGKNDKD-LPSETMKEE-LKSCEDRVKKLLEITP 342

Query: 332 PDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK------ 384
           P G+ F + V+ IL+ E++W  WK +GCP   K+P+      DKK  P+A +KK      
Sbjct: 343 PKGKEFLRAVEHILEREKNWVWWKRDGCPPFEKQPI------DKKS-PNAGQKKRRQRWR 395

Query: 385 --TPELTKLWNSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNY 441
               EL++LW   D N  A   ++R  TP +  Y++   + MDP+A +E++Y   N+  Y
Sbjct: 396 LGNKELSQLWRWADQNPNALTDSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVY 455

Query: 442 AWRALRLLSRKCPHFFLNAT--------------PNVEKNSEFIEN-MVKRCVKEKP--- 483
            W+ LR  +R+    F   T              P V    +   N   KR  KE+    
Sbjct: 456 CWKGLRFTARQDLEGFSRFTEMGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEEAKGG 515

Query: 484 SSQISGNGNGVDQDPAEVEVDTKSEEIQE 512
           S +  GN  GV    AE E      E  E
Sbjct: 516 SQETEGNQIGVSNSEAEAEGGRGDAETME 544


>gi|18416110|ref|NP_568219.1| THO complex subunit 1 [Arabidopsis thaliana]
 gi|15983384|gb|AAL11560.1|AF424566_1 AT5g09860/MYH9_7 [Arabidopsis thaliana]
 gi|16226756|gb|AAL16253.1|AF428323_1 AT5g09860/MYH9_7 [Arabidopsis thaliana]
 gi|332004073|gb|AED91456.1| THO complex subunit 1 [Arabidopsis thaliana]
          Length = 599

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 167/488 (34%), Positives = 249/488 (51%), Gaps = 58/488 (11%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I + + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 84  IPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKDVFGYIESKQDILGKQELFAR 143

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 144 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK 203

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-WKMFTSYAE 236
           D    +S                +DFNFYK FWSLQ+YF NP    +  + W+ F+S   
Sbjct: 204 DPPKGIS----------------VDFNFYKTFWSLQEYFCNPASLTSASTKWQKFSSSLA 247

Query: 237 TVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F +  L + +   N     +  +  KYLT+ KL+ L+L D++FRR++LLQ LI+F
Sbjct: 248 VVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHILLQCLIMF 307

Query: 293 QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWN 352
            Y  +  K+  + L  ++ +E+ +K   + V  L++ TPP G+ F + V+ IL+ E++W 
Sbjct: 308 DYLRAPGKNDKD-LPSETMKEE-LKSCEDRVKKLLEITPPKGKEFLRAVEHILEREKNWV 365

Query: 353 QWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK--------TPELTKLWNSKD-NLEACK 402
            WK +GCP   K+P+      DKK  P+A +KK          EL++LW   D N  A  
Sbjct: 366 WWKRDGCPPFEKQPI------DKKS-PNAGQKKRRQRWRLGNKELSQLWRWADQNPNALT 418

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
            ++R  TP +  Y++   + MDP+A +E++Y   N+  Y W+ LR  +R+    F   T 
Sbjct: 419 DSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTARQDLEGFSRFTE 478

Query: 462 -------------PNVEKNSEFIEN-MVKRCVKEK---PSSQISGNGNGVDQDPAEVEVD 504
                        P V    +   N   KR  KE+    S +  GN  GV    AE E  
Sbjct: 479 MGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIGVSNSEAEAEGG 538

Query: 505 TKSEEIQE 512
               E  E
Sbjct: 539 RGDAETME 546


>gi|356508296|ref|XP_003522894.1| PREDICTED: THO complex subunit 1-like [Glycine max]
          Length = 605

 Score =  250 bits (638), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 152/411 (36%), Positives = 222/411 (54%), Gaps = 42/411 (10%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 81  IPRLLDIVLYLCEKEHIEGGMIFQLLEDLTEMSTMKNCKDIFGYIESKQDILGKQELFAR 140

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS LNI   FN  N T++  
Sbjct: 141 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSALNIKGVFNTSNETKY-- 198

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
                         + E ++ + IDFNFY+ FW LQ+YF NP    +    W+ FT    
Sbjct: 199 --------------EKEPLEGICIDFNFYQTFWGLQEYFSNPTSISHAPAKWQKFTLSLS 244

Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F++  L D +   N     + ++  KYLT+ KL+ L+L D +FRR+VL+Q LILF
Sbjct: 245 VVLNTFEAQPLSDEEGDANNLEEEAVNFSIKYLTSSKLMGLELKDPSFRRHVLVQCLILF 304

Query: 293 QYFTSTVKSRGEG-LELKSDQEK-WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
            Y    +K+ G+G  +L S+  K  +    E V  L++ TPP G  F   ++ IL+ E++
Sbjct: 305 DY----LKAPGKGDKDLPSENMKEEITSWEERVKKLLELTPPKGTEFLHKIEHILEREKN 360

Query: 351 WNQWKNEGC-PELKRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEAC 401
           W  WK +GC P  K+ +      +KK  PD  KK+ P       EL++LW   D N  A 
Sbjct: 361 WVWWKRDGCLPYEKQRI------EKKAVPDGPKKRRPRWRLGNKELSQLWKWADQNPNAL 414

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
              +R  TPS+  Y++   + MDP+A +E  Y   N+  Y W+ LRL +R+
Sbjct: 415 TDPQRVQTPSIMEYWKPLAEDMDPSAGIEADYHHKNNRVYCWKGLRLSARQ 465


>gi|414864320|tpg|DAA42877.1| TPA: hypothetical protein ZEAMMB73_799316 [Zea mays]
          Length = 604

 Score =  248 bits (632), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 164/529 (31%), Positives = 260/529 (49%), Gaps = 72/529 (13%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           +I   + + + LC +      +   LL D  +MST+  C+ +F Y+E   ++  +Q  F 
Sbjct: 87  LITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQELFG 146

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
             K  +LR CN LLRRLS++ + VFCGRI++FLA FFP SERS LNI   FN  N+T++ 
Sbjct: 147 RGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTKY- 205

Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYA 235
                          + + +D + +DFNFYK  WSLQ++F NP +   N   W+ F+S  
Sbjct: 206 ---------------EKDAMDGISVDFNFYKTLWSLQEHFSNPALTSTNPAKWQKFSSNL 250

Query: 236 ETVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
             VL  F++  L D    LN         +  KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 251 AVVLNTFEAQPLSDDDGKLNNLNEEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLI 310

Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
            F Y  +  K+  EG      +E  +K   E V  L++  PP G+ F + ++ IL+ E++
Sbjct: 311 FFDYLKAPGKNDKEGPTGSMIEE--IKSCEEHVKKLLEIIPPKGKEFLKSIEHILEREKN 368

Query: 351 WNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKS 403
           W  WK +GC   ++P      E K  +  A+K+K      + EL++LW  ++ N      
Sbjct: 369 WVWWKRDGCLAFEKP----PFEKKPGQAGARKRKPRWRLGSKELSQLWKWAEQNPNVLTD 424

Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK----------- 452
            +R   PS+  Y++   + MDP+A +EE+Y   ++  Y W+ LR  +R+           
Sbjct: 425 PDRVRMPSITEYWKPLAEDMDPSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDY 484

Query: 453 -----CPHFFLNATPNVEKNSEFIENMVKRCVKEKPSS---------QISGNG---NGVD 495
                 P   L    N + +S+  E  VKR  KE   S         Q++       GVD
Sbjct: 485 GIEGVVPSELLPPEVNAKFSSKPAEK-VKRTRKEDSKSVSAHQPREHQVAATPETDGGVD 543

Query: 496 QDPAE--VEVDTKSEEIQEEEKEEDWE------AKADPEGDADEVMSVE 536
            +  E    +D+ +  + + +K    E       + +PE DAD+ +  E
Sbjct: 544 PEEGEGAAPMDSDNGAVGDSQKRSPGEVSGPESGQCEPEADADDNVRTE 592


>gi|259490220|ref|NP_001159168.1| hypothetical protein [Zea mays]
 gi|223942431|gb|ACN25299.1| unknown [Zea mays]
 gi|414864321|tpg|DAA42878.1| TPA: hypothetical protein ZEAMMB73_799316 [Zea mays]
          Length = 638

 Score =  248 bits (632), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 164/529 (31%), Positives = 260/529 (49%), Gaps = 72/529 (13%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           +I   + + + LC +      +   LL D  +MST+  C+ +F Y+E   ++  +Q  F 
Sbjct: 121 LITRLLDIVLYLCERGHVEGGMVFQLLEDLTEMSTIKDCKDIFGYIESQQDVLGKQELFG 180

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
             K  +LR CN LLRRLS++ + VFCGRI++FLA FFP SERS LNI   FN  N+T++ 
Sbjct: 181 RGKLVMLRTCNQLLRRLSKANDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNVTKY- 239

Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYA 235
                          + + +D + +DFNFYK  WSLQ++F NP +   N   W+ F+S  
Sbjct: 240 ---------------EKDAMDGISVDFNFYKTLWSLQEHFSNPALTSTNPAKWQKFSSNL 284

Query: 236 ETVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
             VL  F++  L D    LN         +  KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 285 AVVLNTFEAQPLSDDDGKLNNLNEEEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLI 344

Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
            F Y  +  K+  EG      +E  +K   E V  L++  PP G+ F + ++ IL+ E++
Sbjct: 345 FFDYLKAPGKNDKEGPTGSMIEE--IKSCEEHVKKLLEIIPPKGKEFLKSIEHILEREKN 402

Query: 351 WNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKS 403
           W  WK +GC   ++P      E K  +  A+K+K      + EL++LW  ++ N      
Sbjct: 403 WVWWKRDGCLAFEKP----PFEKKPGQAGARKRKPRWRLGSKELSQLWKWAEQNPNVLTD 458

Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK----------- 452
            +R   PS+  Y++   + MDP+A +EE+Y   ++  Y W+ LR  +R+           
Sbjct: 459 PDRVRMPSITEYWKPLAEDMDPSAGIEEEYHHKSNRVYCWKGLRFSARQDLDGFARFSDY 518

Query: 453 -----CPHFFLNATPNVEKNSEFIENMVKRCVKEKPSS---------QISGNG---NGVD 495
                 P   L    N + +S+  E  VKR  KE   S         Q++       GVD
Sbjct: 519 GIEGVVPSELLPPEVNAKFSSKPAEK-VKRTRKEDSKSVSAHQPREHQVAATPETDGGVD 577

Query: 496 QDPAE--VEVDTKSEEIQEEEKEEDWE------AKADPEGDADEVMSVE 536
            +  E    +D+ +  + + +K    E       + +PE DAD+ +  E
Sbjct: 578 PEEGEGAAPMDSDNGAVGDSQKRSPGEVSGPESGQCEPEADADDNVRTE 626


>gi|23463069|gb|AAN33204.1| At5g09860/MYH9_7 [Arabidopsis thaliana]
          Length = 599

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 166/488 (34%), Positives = 248/488 (50%), Gaps = 58/488 (11%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I + + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 84  IPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKDVFGYIESKQDILGKQELFAR 143

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 144 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK 203

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-WKMFTSYAE 236
           D    +S                +DFNFYK FWSLQ+YF NP    +  + W+ F+S   
Sbjct: 204 DPPKGIS----------------VDFNFYKTFWSLQEYFCNPASLTSASTKWQKFSSSLA 247

Query: 237 TVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F +  L + +   N     +  +  KYLT+ KL+ L+L D++FRR++LLQ LI+F
Sbjct: 248 VVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLELKDSSFRRHILLQCLIMF 307

Query: 293 QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWN 352
            Y  +  K+  + L  ++ +E+ +K   + V  L++ TPP G+ F + V+ IL+ E++W 
Sbjct: 308 DYLRAPGKNDKD-LPSETMKEE-LKSCEDRVKKLLEITPPKGKEFLRAVEHILEREKNWV 365

Query: 353 QWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK--------TPELTKLWNSKD-NLEACK 402
            WK +G P   K+P+      DKK  P+A +KK          EL++LW   D N  A  
Sbjct: 366 WWKRDGYPPFEKQPI------DKKS-PNAGQKKRRQRWRLGNKELSQLWRWADQNPNALT 418

Query: 403 SAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT- 461
            ++R  TP +  Y++   + MDP+A +E++Y   N+  Y W+ LR  +R+    F   T 
Sbjct: 419 DSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTARQDLEGFSRFTE 478

Query: 462 -------------PNVEKNSEFIEN-MVKRCVKEK---PSSQISGNGNGVDQDPAEVEVD 504
                        P V    +   N   KR  KE+    S +  GN  GV    AE E  
Sbjct: 479 MGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIGVSNSEAEAEGG 538

Query: 505 TKSEEIQE 512
               E  E
Sbjct: 539 RGDAETME 546


>gi|224057238|ref|XP_002299188.1| predicted protein [Populus trichocarpa]
 gi|222846446|gb|EEE83993.1| predicted protein [Populus trichocarpa]
          Length = 608

 Score =  245 bits (626), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 143/408 (35%), Positives = 221/408 (54%), Gaps = 37/408 (9%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   +   + LC ++     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 81  IPRLLDAVLYLCEREHIEGGMIFQLLEDLTEMSTMRNCKDIFGYIESKQDILGKQELFAR 140

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 141 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 198

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
                         + E    + +DFNFYK  WSLQ+YF +P    + + W+ F+S    
Sbjct: 199 --------------EKEPPAAISLDFNFYKTLWSLQEYFCDPSLTLSPIKWQKFSSSLMV 244

Query: 238 VLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
           VL  F++  L + +   N     +  +  KYLT+ KL+ L+L D +FRR+VL+Q LILF 
Sbjct: 245 VLNTFEAQPLSEEEGDANNLEEEAAAFNIKYLTSSKLMGLELKDPSFRRHVLVQCLILFD 304

Query: 294 YFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
           Y  +  K+  + L  +S +E+ ++   E V  L++ TPP G+ F  +V+ IL+ E++W  
Sbjct: 305 YLKAPGKNDKD-LTSESMKEE-IRSREEHVKKLLEMTPPKGKDFLHMVEHILEREKNWLW 362

Query: 354 WKNEGCPEL-KRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEACKSA 404
           WK +GCP   K+P+ + T +      D  KK+ P       EL++LW   D N  A    
Sbjct: 363 WKRDGCPPFEKQPIENKTVQ------DGGKKRRPRWRLGNKELSQLWKWADQNPNALTDP 416

Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
           +R  TP +  Y++   + MDP+A ++ +Y   N+  Y W+ LR  +R+
Sbjct: 417 QRVRTPIITDYWKPLAEDMDPSAGIDAEYHHKNNRVYCWKGLRFSARQ 464


>gi|449445104|ref|XP_004140313.1| PREDICTED: THO complex subunit 1-like [Cucumis sativus]
          Length = 607

 Score =  245 bits (626), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 161/464 (34%), Positives = 234/464 (50%), Gaps = 49/464 (10%)

Query: 14  LNVVSDKYALYKPT-CDYDKKSALEQSIRQYLLKLIKTP----------DIDIKVIENYI 62
           L +V D     K T    D+   LE  +R+ L +L+ +            + I   E   
Sbjct: 25  LQIVQDVIRPQKHTKLAQDENQLLENILRRLLQELVSSAVQSTEPVMQYGMSIDEKETSQ 84

Query: 63  SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL 122
            + + LC K+     +   LL D  +MSTL  C+ +F Y+E   +I  +Q  F   K  +
Sbjct: 85  DIVLYLCEKEHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFARGKLVM 144

Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMD 182
           LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++       
Sbjct: 145 LRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY------- 197

Query: 183 VSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAETVLAA 241
                    + +  D   IDFNFYK FWSLQ++F NP         W+ FTS    VL  
Sbjct: 198 ---------EKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLMVVLNT 248

Query: 242 FKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
           F +  L D +   N     S  +  KYLT+ KL+ L+L D +FRR+VL+Q LILF Y  +
Sbjct: 249 FDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILFDYLKA 308

Query: 298 TVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
             K+  +       +E  +K   E V  L++ TPP G+ F Q ++ IL+ E +W  WK +
Sbjct: 309 PGKNEKDIPSETMREE--IKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWVWWKRD 366

Query: 358 GC-PELKRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWN-SKDNLEACKSAERDF 408
           GC P  K+P+      +KK   D  KK+ P       EL++LW  S  N  A    +R  
Sbjct: 367 GCAPFEKQPI------EKKTINDVTKKRRPRWRLGNKELSQLWKWSDQNPNALTDPQRVR 420

Query: 409 TPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
           +P++  Y++   + MD +A +E +Y   N+  Y W+ LR  +R+
Sbjct: 421 SPAISDYWKPLAEDMDESAGIEAEYHHRNNRVYCWKGLRFSARQ 464


>gi|440793057|gb|ELR14255.1| hypothetical protein ACA1_327400 [Acanthamoeba castellanii str.
           Neff]
          Length = 549

 Score =  243 bits (620), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 157/484 (32%), Positives = 253/484 (52%), Gaps = 44/484 (9%)

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN-LLRMCNDLLR 131
           M  S  P+ L+ D F   T+++C+Q F  +E   +   +     S  NN LLR  +++ +
Sbjct: 1   MIESAAPLELVRDFFLSHTIEECQQFFALLEDRADQLSRAIIGDSRANNTLLRALSEVRK 60

Query: 132 RLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEET 191
           RLSR+ + VF GRIL+F+A  FP S++SG+N+  E NV N+T    D+E   +  ET E 
Sbjct: 61  RLSRTNDLVFAGRILMFMAYAFPLSDKSGVNLKGEANVANVTTM--DKEPSPALEETPED 118

Query: 192 DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
            +E      +D+ FY+ FW LQ YF NP++ ++  +W++ T +  +VL  FK Y + + +
Sbjct: 119 KSEP----PVDWTFYQMFWGLQSYFLNPIKAFD--AWQVVTGHLNSVLDLFKQYPISEEE 172

Query: 252 SSL------NPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSR 302
           +S       NP      Y  KYLT+ +LL LQL D  FRR++++QFL+ FQ      +S 
Sbjct: 173 ASAASYSYANPPAASECYLTKYLTSSRLLRLQLRDPWFRRHIMVQFLVFFQSLRVVSESS 232

Query: 303 GEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL 362
               +L   Q+  +   T+   + + +TPP+G HF+  +  ILK E +W  WK EGCP  
Sbjct: 233 TIAQKLTEQQKNEIAGLTKKTKTSLWETPPNGMHFTNTIYAILKRENNWTDWKKEGCPPF 292

Query: 363 KRPLTSITDEDKKDEPDA-----------KKKKTPELTKLWNSKDNLEACKSAERDFTPS 411
           ++P  +        EP A           KKK    L +  + +D     +  ER    S
Sbjct: 293 EKPPVAPL------EPAAVRAAREVRFNKKKKGLAALLEGESVRDLQALLQKEERGRMVS 346

Query: 412 LESYFEEAIQQMDPAAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEF 470
           LE + E  I++MDP AA+E QYKK+N D  Y WRA RL  ++   +F      + K    
Sbjct: 347 LEEWLEPVIEEMDPEAAIESQYKKINTDKIYVWRAKRLTFKRSLEYFC----TLAKQDAT 402

Query: 471 IENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDAD 530
           +E++ K+ +  K +S      +GV  +  E + D   ++++++E     EAK   + + D
Sbjct: 403 LEDVAKQLLANKGTS----TEDGVVLEDREEKRDADVDDVEDKESVVKIEAKHSEDQEGD 458

Query: 531 EVMS 534
           +++S
Sbjct: 459 QLVS 462


>gi|115450229|ref|NP_001048715.1| Os03g0110400 [Oryza sativa Japonica Group]
 gi|108705792|gb|ABF93587.1| THO complex subunit 1, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113547186|dbj|BAF10629.1| Os03g0110400 [Oryza sativa Japonica Group]
 gi|218191944|gb|EEC74371.1| hypothetical protein OsI_09688 [Oryza sativa Indica Group]
          Length = 638

 Score =  243 bits (619), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 166/550 (30%), Positives = 257/550 (46%), Gaps = 80/550 (14%)

Query: 35  ALEQSIRQYLLKLIKTPDIDIKVIENY-------------ISLCVELCMKDMCNSTLPII 81
           +LE ++R+ L +L  +     K I  Y             + + + LC +      +   
Sbjct: 89  SLENALRKLLQELASSAVQSGKRIMQYGDNEENNCPITRLLDIVLYLCERGHVEGGMVFQ 148

Query: 82  LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVF 141
           LL D  +MST+  C+ +F Y+E   ++  +Q  F   K  +LR CN LLRRLS++ + VF
Sbjct: 149 LLEDLTEMSTIKDCKDVFGYIESKQDVLGKQELFGRGKLVMLRTCNQLLRRLSKANDVVF 208

Query: 142 CGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKI 201
           CGRI++FLA FFP SERS LNI   FN  N T++                + +  D + +
Sbjct: 209 CGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY----------------EKDATDGISV 252

Query: 202 DFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLN----- 255
           DFNFY   WSLQ++F NP +   N   W+ F S    VL+ F++  L D    LN     
Sbjct: 253 DFNFYNTLWSLQEHFSNPALTAANLTRWQKFVSNLTVVLSTFEAQPLSDDDGKLNNLDQE 312

Query: 256 PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKW 315
               +  KYLT+ KL+ L+L D +FRR++L+Q LI F +  +  K+  EG      +E  
Sbjct: 313 EDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDFLKAPGKTDKEGPTGSMKEE-- 370

Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSITDEDK 374
           +    E V  L++  PP G+ F Q ++ IL+ E++W  WK +GC    K+P      E K
Sbjct: 371 IDSCEERVKKLLEIIPPKGKDFLQSIEHILEREKNWVWWKRDGCLAFEKQPF-----EKK 425

Query: 375 KDEPDAKKKK------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAA 427
             +   +K+K        EL +LW  ++ N  A   +ER   PS+  Y++   + MDP+A
Sbjct: 426 PGQAGVRKRKPRWRLGNKELAQLWKWAEQNPNALTDSERICMPSVTEYWKPLAEDMDPSA 485

Query: 428 AVEEQYKKVNDSNYAWRALRL--------LSRKCPHFFLNATPN----VEKNSEFIE--- 472
            +E++Y   N+  Y W+ LR          SR C +      P      E  S+F     
Sbjct: 486 GIEDEYHHKNNRVYCWKGLRFSARQDLEGFSRFCDYGIEGVVPQELLPPEVRSKFYSKPG 545

Query: 473 NMVKRCVKEKP---------------SSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEE 517
           +  KR  +E P               + +  G G+G + +   V +D+ +  I E  K+ 
Sbjct: 546 DKAKRPKREDPKGTSAQPKEQQVVSATPETDGGGSGAEPEEGAVPMDSDNAAIDEGRKQS 605

Query: 518 DWEAKADPEG 527
             E      G
Sbjct: 606 PEEVSGPESG 615


>gi|168000859|ref|XP_001753133.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695832|gb|EDQ82174.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 566

 Score =  242 bits (617), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 151/450 (33%), Positives = 224/450 (49%), Gaps = 50/450 (11%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENY-----------------ISLCVELCMKDM 73
           D+   LE  +R  L +L+     + K +  Y                 + + + LC K  
Sbjct: 55  DENQLLENGLRTLLEELVGAAVANGKSVMQYGRAEGPEDDGNGLLTRLLDIVLYLCEKGN 114

Query: 74  CNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRL 133
               +   LL D  ++ST+  C+++F Y+E       +   F   K  +LR CN LLRRL
Sbjct: 115 VEGGMIFQLLEDLTEVSTIKDCKEVFGYIEGKQETLGKSELFSRGKLVMLRTCNQLLRRL 174

Query: 134 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDT 193
           S++ + VFCGRIL+FLA FFP SERS +NI   FN               +SNET+    
Sbjct: 175 SKTNDVVFCGRILMFLAHFFPLSERSAVNIKGVFN---------------TSNETKYEKE 219

Query: 194 EEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS 253
              D   IDF FYK FWSLQ+YF NP    +   W+ F+S    VL  F+   + +   S
Sbjct: 220 PPGDSPAIDFTFYKTFWSLQEYFSNPTTLTHPSKWQTFSSNLGIVLETFEMQPIGEEDES 279

Query: 254 LNPSGD----YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELK 309
           L+  GD    +  KYLT+  L+ L+L D  FRR+VL+Q LI+F Y  +  K+  +G    
Sbjct: 280 LSLDGDDDSTFNIKYLTSSNLMSLELKDPGFRRHVLVQCLIIFDYMKTPGKAEKDGPREG 339

Query: 310 SDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSI 369
             +E  +K   + V  L++ TPP G  F   ++ IL+ +++W  WK +GC   +R +   
Sbjct: 340 VREE--LKVHEDRVKKLLRTTPPKGREFLASIEHILERDKNWVWWKRDGCIPFERFV--- 394

Query: 370 TDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQ 422
             ++KK   D  K++        EL++LW  S DN  A   A+R   P L  YF+   + 
Sbjct: 395 --QEKKIPSDPPKRRPRWRLGNRELSRLWKWSDDNPNAITHAKRVKMPPLAEYFKPLAED 452

Query: 423 MDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
           MDP A +EE+Y   N+  Y W+ LR  +R+
Sbjct: 453 MDPEAGIEEEYHHKNNKVYCWKGLRFAARQ 482


>gi|255578237|ref|XP_002529986.1| nuclear matrix protein, putative [Ricinus communis]
 gi|223530509|gb|EEF32391.1| nuclear matrix protein, putative [Ricinus communis]
          Length = 608

 Score =  241 bits (614), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 142/406 (34%), Positives = 216/406 (53%), Gaps = 32/406 (7%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC ++     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 81  IPRLLDVVLHLCEREHVEGGMIFQLLEDLTEMSTMKNCQDIFGYIESKQDILGKQELFAR 140

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 141 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKYEK 200

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
           D    +S                +DFNFYK  WSLQ+ F NP         W  FTS   
Sbjct: 201 DPPAGIS----------------VDFNFYKTLWSLQENFCNPAPLTLAPTKWHKFTSSLM 244

Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F++  L + +   N     +  +  KYLT+ KL+ L+L D +FRR++L+Q LILF
Sbjct: 245 VVLNTFEAQPLSEEEGDANNLEEEAATFNIKYLTSSKLMGLELKDPSFRRHILVQCLILF 304

Query: 293 QYFTSTVKSRGEGL--ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
            Y  +  K+  +     +K D    ++   E V  L++ TPP G+ F Q ++ +L+ E++
Sbjct: 305 DYLKAPGKNDKDSTSESMKED----IRTCEERVKKLLEMTPPKGKDFLQKIEHVLEREKN 360

Query: 351 WNQWKNEGC-PELKRPL--TSITDEDKKDEPDAKKKKTPELTKLWNSKD-NLEACKSAER 406
           W  WK +GC P  K+P+   +I +  KK +P  +     EL++LW   D N  A    +R
Sbjct: 361 WVCWKRDGCQPFEKQPIENKTIQEGSKKRKPRWRLGNK-ELSQLWKWADQNPNALTDPQR 419

Query: 407 DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
             TP++  Y++   + MDP+A +E +Y   N+  Y W+ LR  +R+
Sbjct: 420 VRTPAITEYWKPLAEDMDPSAGIEAEYHHKNNRVYCWKGLRFSARQ 465


>gi|148691057|gb|EDL23004.1| THO complex 1, isoform CRA_b [Mus musculus]
          Length = 239

 Score =  239 bits (610), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 120/222 (54%), Positives = 153/222 (68%), Gaps = 23/222 (10%)

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE------- 180
           DLLRRLS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+       
Sbjct: 4   DLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQK 63

Query: 181 --------MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFT 232
                   MDV   E  + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F 
Sbjct: 64  HTEDREEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFL 123

Query: 233 SYAETVLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYV 284
            Y+E VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++
Sbjct: 124 KYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHI 183

Query: 285 LLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSL 326
           LLQ+LILFQY    VK +     L  +Q  W++DTT++VY +
Sbjct: 184 LLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQV 225


>gi|67969270|dbj|BAE00988.1| unnamed protein product [Macaca fascicularis]
          Length = 296

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 121/246 (49%), Positives = 156/246 (63%), Gaps = 22/246 (8%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L + I    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEEEI----INHSSCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNE 187
           GRI LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV   E
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGE 216

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKL
Sbjct: 217 MGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKL 276

Query: 248 DDVQSS 253
           DD Q+S
Sbjct: 277 DDTQAS 282


>gi|302781468|ref|XP_002972508.1| hypothetical protein SELMODRAFT_96989 [Selaginella moellendorffii]
 gi|300159975|gb|EFJ26594.1| hypothetical protein SELMODRAFT_96989 [Selaginella moellendorffii]
          Length = 583

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 142/408 (34%), Positives = 207/408 (50%), Gaps = 36/408 (8%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           +I   + + + LC K      +   LL D  ++ST+  C+++F Y+E    +  +   F 
Sbjct: 127 LITRLLDIVLYLCEKSHVEGGMVFQLLEDLTEVSTIKDCKEVFAYIEGKQEVLGKTELFG 186

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
             K  +LR CN LLRRLS++ + VFCGRIL+FLA  FP  ERS +NI   FN  N T + 
Sbjct: 187 RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHVFPLYERSAVNIKGVFNTSNETNYE 246

Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
            D+                 D V IDFNFYK FW LQ++F NP     +  W +F++   
Sbjct: 247 KDQP----------------DDVPIDFNFYKTFWGLQEHFSNPAALIPR--WHIFSASLT 288

Query: 237 TVLAAFKSYKL--DDVQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
            VL+AF +  L  DD  S     GD   +  KYLT+  L+ L+L D +FRR+VL+Q LI+
Sbjct: 289 IVLSAFDAQPLGDDDSTSHQLDEGDDMTFNTKYLTSSNLMSLELKDPSFRRHVLVQCLIV 348

Query: 292 FQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
           F Y  +  K+  E        E  VK   + V  L++ TPP G+ F + ++ +L+ E++W
Sbjct: 349 FDYLKTPGKTEKEAARESLKDE--VKVLEDRVKRLLESTPPKGKDFLKSIEHLLEREKNW 406

Query: 352 NQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWNSKD-NLEACKSA 404
             WK +GCP    P      E K      KK+K        EL++LW   D N  A    
Sbjct: 407 VWWKRDGCP----PFEKNVSERKSSIEGTKKRKQRWRLGNKELSQLWKWADQNPNALTDP 462

Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
           +R  TP +  Y+    + MDP + +E +Y   ND  Y W+ LR  +R+
Sbjct: 463 QRVKTPPVHEYWTALAEDMDPMSGIEAEYSHKNDKVYCWKGLRFSARQ 510


>gi|320166707|gb|EFW43606.1| hypothetical protein CAOG_01650 [Capsaspora owczarzaki ATCC 30864]
          Length = 618

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 153/486 (31%), Positives = 219/486 (45%), Gaps = 126/486 (25%)

Query: 52  DIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVN--IW 109
           D  +  I   +   +E   + +  S++P+ LL+D  D  TLD C++LF  VE       W
Sbjct: 37  DATVATIGLLLDTALECSRRSVLTSSIPLALLTDALDSLTLDVCDKLFDLVEHRQTDPSW 96

Query: 110 KQQTFFMSCKN---------------------NLLRMCNDLLRRLSRSQNTVFCGRILLF 148
               F  S                         +L++CN+LLRRLS++Q+TVFCGRIL F
Sbjct: 97  TSTVFSTSAAPPSATPGVPAPTVIRGTFATNLQILQLCNELLRRLSKTQDTVFCGRILTF 156

Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
           L   FP SE+SGLNI   FN  +  +F  +E  D  S+ TEE               ++ 
Sbjct: 157 LCSVFPLSEKSGLNIAGAFNGSSALDFLDEESADEVSSTTEE--------------IHRS 202

Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQ 268
           FWSLQ +F NP  C+   +W   T    T                               
Sbjct: 203 FWSLQPFFSNPNSCFTADAWLSVTKAVTT------------------------------- 231

Query: 269 KLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIK 328
           +L  LQL+D+ FRR++L+Q LI+FQY     K+R + L LK DQ K + +  +   + ++
Sbjct: 232 RLFQLQLNDSTFRRHMLVQILIIFQYLAVPSKAR-KTLVLKDDQLKLLFELRDKTMAALR 290

Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-----------PLTSITD------ 371
             PP G  F+  V  +L+ E  WN WKNE CP  +R           P+ S  D      
Sbjct: 291 AVPPHGAVFAAAVGQVLESELDWNTWKNESCPPFERAPDSTVIVGHLPIASAADAVSQPN 350

Query: 372 -----------EDKKDE----PDA----KKKKTP--------------------ELTKLW 392
                       D+ DE    P A     KK T                     +LT LW
Sbjct: 351 AGSLAQSRKRAHDQVDENSEDPTAARHIAKKLTSDGRVPSSSSSGSGPVRMGQKDLTALW 410

Query: 393 NSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
           N+ D +L + +  +R F P  +++FEEAI+QMDP A +++ Y K+ D  +AWR+LRLL+R
Sbjct: 411 NANDESLASLQGVDRRFIPDPDAFFEEAIEQMDPDAQIDKVYWKIGDELFAWRSLRLLAR 470

Query: 452 KCPHFF 457
              H F
Sbjct: 471 TRLHLF 476


>gi|302821808|ref|XP_002992565.1| hypothetical protein SELMODRAFT_135550 [Selaginella moellendorffii]
 gi|300139634|gb|EFJ06371.1| hypothetical protein SELMODRAFT_135550 [Selaginella moellendorffii]
          Length = 583

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 142/408 (34%), Positives = 207/408 (50%), Gaps = 36/408 (8%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           +I   + + + LC K      +   LL D  ++ST+  C+++F Y+E    +  +   F 
Sbjct: 127 LITRLLDIVLYLCEKTHVEGGMVFQLLEDLTEVSTIKDCKEVFAYIEGKQEVLGKTELFG 186

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
             K  +LR CN LLRRLS++ + VFCGRIL+FLA  FP  ERS +NI   FN  N T + 
Sbjct: 187 RGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHVFPLYERSAVNIKGVFNTSNETNYE 246

Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
            D+                 D V IDFNFYK FW LQ++F NP     +  W +F++   
Sbjct: 247 KDQP----------------DDVPIDFNFYKTFWGLQEHFSNPAALIPR--WHIFSASLT 288

Query: 237 TVLAAFKSYKL--DDVQSSLNPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
            VL+AF +  L  DD  S     GD   +  KYLT+  L+ L+L D +FRR+VL+Q LI+
Sbjct: 289 IVLSAFDAQPLGDDDSTSHQLDEGDDMTFNTKYLTSCNLMSLELKDPSFRRHVLVQCLIV 348

Query: 292 FQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
           F Y  +  K+  E        E  VK   + V  L++ TPP G+ F + ++ +L+ E++W
Sbjct: 349 FDYLKTPGKTEKEAARESLKDE--VKVLEDRVKRLLESTPPKGKDFLKSIEHLLEREKNW 406

Query: 352 NQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------TPELTKLWNSKD-NLEACKSA 404
             WK +GCP    P      E K      KK+K        EL++LW   D N  A    
Sbjct: 407 VWWKRDGCP----PFEKNVSERKSSIEGTKKRKQRWRLGNKELSQLWKWADQNPNALTDP 462

Query: 405 ERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
           +R  TP +  Y+    + MDP + +E +Y   ND  Y W+ LR  +R+
Sbjct: 463 QRVKTPPVHEYWTALAEDMDPMSGIEAEYSHKNDKVYCWKGLRFSARQ 510


>gi|360042969|emb|CCD78379.1| putative nuclear matrix protein [Schistosoma mansoni]
          Length = 465

 Score =  232 bits (592), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 130/304 (42%), Positives = 174/304 (57%), Gaps = 29/304 (9%)

Query: 182 DVSSNETEETD-TEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
           D +S++ EE + T+    +++D   Y KFWSLQ++F++P  CY K  W  FTS  +TVL 
Sbjct: 28  DHTSDDLEEGEMTDSSTPLEVDAGLYVKFWSLQEFFKSPALCYEKAKWLRFTSSTDTVLD 87

Query: 241 AFKSYKLDDVQSS---LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
            F S KL  ++        S   F+KYLT++KLLDLQL D +FRRY+++Q LILFQY T+
Sbjct: 88  VFSSIKLMAIEEGDICSQSSSGTFSKYLTSEKLLDLQLMDPSFRRYIIVQLLILFQYLTT 147

Query: 298 TVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPD---GEHFSQVVKLILKGEEHWNQW 354
           TVK +     L  DQ  WV    E V  L+    P+   G  F   V+ IL+ E +WN+W
Sbjct: 148 TVKFKTIDQVLSEDQRSWVNQRHEVVLRLLSSNSPNNSSGGTFVSTVERILERESYWNRW 207

Query: 355 KNEGCPELKR---------------PLTSITDEDKKDEPDAKKKKTPELTKLWN-SKDNL 398
           KN+GCP   R               PL + T +         +    EL KLWN   DNL
Sbjct: 208 KNDGCPSFIRTPEKSRLSVRKRHINPLVTRTGQK------VYRFGNRELDKLWNVCPDNL 261

Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFL 458
            AC+   R F P L SYF++AI +MDPA  VEE+YK +N   ++WRALR LSRKCPHF++
Sbjct: 262 AACRDKRRVFNPDLHSYFQDAIMEMDPAEKVEEEYKSINKDEWSWRALRFLSRKCPHFYI 321

Query: 459 NATP 462
           N  P
Sbjct: 322 NWNP 325


>gi|449506062|ref|XP_004162641.1| PREDICTED: THO complex subunit 1-like [Cucumis sativus]
          Length = 483

 Score =  231 bits (588), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 144/395 (36%), Positives = 207/395 (52%), Gaps = 38/395 (9%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC K+     +   LL D  +MSTL  C+ +F Y+E   +I  +Q  F  
Sbjct: 91  IPRLLDIVLYLCEKEHVEGGMIFQLLEDLTEMSTLRNCKDIFGYIESKQDILGKQELFAR 150

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N T++  
Sbjct: 151 GKLVMLRTCNQLLRRLSKASDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSNETKY-- 208

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ-CYNKVSWKMFTSYAE 236
                         + +  D   IDFNFYK FWSLQ++F NP         W+ FTS   
Sbjct: 209 --------------EKQPPDGFSIDFNFYKTFWSLQEFFCNPASLALASTKWQKFTSSLM 254

Query: 237 TVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
            VL  F +  L D +   N     S  +  KYLT+ KL+ L+L D +FRR+VL+Q LILF
Sbjct: 255 VVLNTFDAQPLSDEEGDANILEEESATFSIKYLTSSKLMGLELKDPSFRRHVLMQCLILF 314

Query: 293 QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWN 352
            Y  +  K+  +       +E  +K   E V  L++ TPP G+ F Q ++ IL+ E +W 
Sbjct: 315 DYLKAPGKNEKDIPSETMREE--IKSCEERVKKLLEVTPPRGKDFLQKIEHILQRENNWV 372

Query: 353 QWKNEGC-PELKRPLTSITDEDKKDEPDAKKKKTP-------ELTKLWN-SKDNLEACKS 403
            WK +GC P  K+P+      +KK   D  KK+ P       EL++LW  S  N  A   
Sbjct: 373 WWKRDGCAPFEKQPI------EKKTINDVTKKRRPRWRLGNKELSQLWKWSDQNPNALTD 426

Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVND 438
            +R  +P++  Y++   + MD +A +E +Y   N+
Sbjct: 427 PQRVRSPAISDYWKPLAEDMDESAGIEAEYHHRNN 461


>gi|326427178|gb|EGD72748.1| hypothetical protein PTSG_04478 [Salpingoeca sp. ATCC 50818]
          Length = 605

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 160/497 (32%), Positives = 238/497 (47%), Gaps = 77/497 (15%)

Query: 22  ALYKPTCDYDKKSALEQS--------IRQYLLKLIKTPD---IDIKVIENYISLCVELCM 70
           A+ + T D D  ++L+Q+         R+ L  L+   D   +D+      + +C+E C 
Sbjct: 12  AVVQGTADTDSLASLKQAKAHIVGLAFRRVLFPLLVQRDEKCLDV------LRICIE-CF 64

Query: 71  KD-----MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRM 125
           K         +T+ I L  D FD+  +  CE++F  +E N    +Q  F  +    ++R+
Sbjct: 65  KREFIAVAATNTVLISLFEDAFDVLPVSMCERVFTLMEQNA---QQPAFVKANGPRVVRL 121

Query: 126 CNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG-GDEEMDVS 184
            N L +RLSRSQ+    GR+L+F+A  FP S++SG NI S  NV N T +   +EE+D S
Sbjct: 122 INGLKKRLSRSQDIELSGRLLMFIASIFPMSDKSGANISSAVNVANKTHYDETEEELDTS 181

Query: 185 SNETEETDTEEVDKVKIDFNF----YKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
           ++E  +   E++D   +        YK+FWSLQ +F  P  CY    WK F    E VL 
Sbjct: 182 ASEAAKAQ-EDIDMPSLGRPLNKSDYKQFWSLQRFFCEPRLCYQPDEWKRFEKATECVLD 240

Query: 241 AFKSYKLDDVQSSLNPSG---------------------------------DYFAKYLTN 267
            F+ + L+    +  P+                                    FAKYLT+
Sbjct: 241 LFERHPLEVSPVTTQPTAASMMDTNDDNGDGDGDGDDGDDDGPSQKQRRADTTFAKYLTS 300

Query: 268 QKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLI 327
            KL +LQL+D   RR +LLQ LILF Y     K +     L S Q++W+      V  L+
Sbjct: 301 PKLFELQLNDVTLRREILLQLLILFSYLPRDSKFKKPAETLTSAQKEWMDKRQRQVEKLM 360

Query: 328 KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK--- 384
           K    DG  F    K +L+ E  W QWK +GC    +P        +K            
Sbjct: 361 KAAGSDGADFLTNAKRMLERELVWVQWKEDGC----KPFEKKESAKRKAPSPPAPGAAGV 416

Query: 385 ---TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSN 440
              + EL +LW  S  N+EACK A +   PSL+ YF +A+++MDPA  VEE+Y ++ND  
Sbjct: 417 HMGSDELQRLWTLSSTNMEACKRA-KVRVPSLKEYFLDAVEEMDPAECVEEEYWQINDFR 475

Query: 441 YAWRALRLLSRKCPHFF 457
           + W+ LRL+SRK  + F
Sbjct: 476 FNWKGLRLISRKKVNLF 492


>gi|9758964|dbj|BAB09407.1| unnamed protein product [Arabidopsis thaliana]
          Length = 588

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 161/499 (32%), Positives = 239/499 (47%), Gaps = 91/499 (18%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I + + + + LC K+     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 84  IPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKNCKDVFGYIESKQDILGKQELFAR 143

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERSG                 
Sbjct: 144 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSG----------------- 186

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-WKMFTSYAE 236
                                + +DFNFYK FWSLQ+YF NP    +  + W+ F+S   
Sbjct: 187 ---------------------ISVDFNFYKTFWSLQEYFCNPASLTSASTKWQKFSSSLA 225

Query: 237 TVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQKLLDL-----------QLSDTNFR 281
            VL  F +  L + +   N     +  +  KYLT+ KL+ L           QL D++FR
Sbjct: 226 VVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSKLMGLEFTHFFFYLSPQLKDSSFR 285

Query: 282 RYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVV 341
           R++LLQ LI+F Y  +  K+  + L  ++ +E+ +K   + V  L++ TPP G+ F + V
Sbjct: 286 RHILLQCLIMFDYLRAPGKNDKD-LPSETMKEE-LKSCEDRVKKLLEITPPKGKEFLRAV 343

Query: 342 KLILKGEEHWNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK--------TPELTKLW 392
           + IL+ E++W  WK +GCP   K+P+      DKK  P+A +KK          EL++LW
Sbjct: 344 EHILEREKNWVWWKRDGCPPFEKQPI------DKKS-PNAGQKKRRQRWRLGNKELSQLW 396

Query: 393 NSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
              D N  A   ++R  TP +  Y++   + MDP+A +E++Y   N+  Y W+ LR  +R
Sbjct: 397 RWADQNPNALTDSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFTAR 456

Query: 452 KCPHFFLNAT--------------PNVEKNSEFIEN-MVKRCVKEK---PSSQISGNGNG 493
           +    F   T              P V    +   N   KR  KE+    S +  GN  G
Sbjct: 457 QDLEGFSRFTEMGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETEGNQIG 516

Query: 494 VDQDPAEVEVDTKSEEIQE 512
           V    AE E      E  E
Sbjct: 517 VSNSEAEAEGGRGDAETME 535


>gi|326497285|dbj|BAK02227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 470

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 126/349 (36%), Positives = 186/349 (53%), Gaps = 36/349 (10%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           +I   + + + LC +      +   LL D  DMST+  C+ +F Y+E   ++  +Q  F 
Sbjct: 121 LITRLLDIMLYLCERGHVEGGMVFQLLEDLTDMSTIKDCKDVFGYIESKQDVLGKQELFG 180

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
             K  +LR CN LLRRLS+S + VFCGRI++FLA FFP SERS LNI   FN  N T++ 
Sbjct: 181 RGKLVMLRTCNQLLRRLSKSNDVVFCGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY- 239

Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYA 235
                          + +  D + +DFNFY+  WSLQ++FRNP +   N   W+ F+S  
Sbjct: 240 ---------------EKDATDGISVDFNFYQTLWSLQEHFRNPALTTTNPTKWQKFSSNL 284

Query: 236 ETVLAAFKSYKLDDVQSSLN-----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLI 290
             VL+ F++  L D     N         +  KYLT+ KL+ L+L D +FRR++L+Q LI
Sbjct: 285 TVVLSTFEAQPLSDADGKHNNLEQEEDAAFNIKYLTSSKLMGLELKDASFRRHILVQCLI 344

Query: 291 LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
            F Y  +  KS  +G      +E  +K   E V +L++  PP G+ F Q ++ IL+ E++
Sbjct: 345 FFDYLKAPGKSDKDGPSESMKEE--IKSCEERVKNLLEMIPPKGKEFLQSIEHILEREKN 402

Query: 351 WNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKK------TPELTKLW 392
           W  WK +GCP   K+P      E K  +  A+K+K        EL +LW
Sbjct: 403 WVWWKRDGCPAFEKQP-----SEKKPGQAGARKRKPRWRLGNKELAQLW 446


>gi|344235559|gb|EGV91662.1| THO complex subunit 1 [Cricetulus griseus]
          Length = 350

 Score =  219 bits (559), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 111/227 (48%), Positives = 146/227 (64%), Gaps = 14/227 (6%)

Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
           +DLQLSD+NFRR++LLQ+LILFQY    VK +     L  +Q  W++DTT++VY L+ + 
Sbjct: 1   MDLQLSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSEN 60

Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED----KKDEPDAKKKKTP 386
           PPDGE FS++V+ IL  EE+WN WKNEGCP   +   S T       K+  P+    K P
Sbjct: 61  PPDGERFSKMVEHILNTEENWNSWKNEGCPSFVKERASDTKPTRVVRKRAAPEDFLGKGP 120

Query: 387 ---------ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKV 436
                    ELT+LWN   DN+EACKS  R++ P+LE +FEEAI+Q DP   VE +YK V
Sbjct: 121 NKKILIGNEELTRLWNLCPDNMEACKSETREYMPTLEEFFEEAIEQADPENMVESEYKAV 180

Query: 437 NDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
           N+SNY WRALRLL+R+ PHFF       +   E++ENMV +  KE P
Sbjct: 181 NNSNYGWRALRLLARRSPHFFQPTNQQFKSLPEYLENMVIKLAKELP 227


>gi|222624056|gb|EEE58188.1| hypothetical protein OsJ_09127 [Oryza sativa Japonica Group]
          Length = 643

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 161/551 (29%), Positives = 250/551 (45%), Gaps = 86/551 (15%)

Query: 35  ALEQSIRQYLLKLIKTPDIDIKVIENY-------------ISLCVELCMKDMCNSTLPII 81
           +LE ++R+ L +L  +     K I  Y             + + + LC +      +   
Sbjct: 98  SLENALRKLLQELASSAVQSGKRIMQYGDNEENNCPITRLLDIVLYLCERGHVEGGMVFQ 157

Query: 82  LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVF 141
           LL D  +MST+  C+ +F Y+E   ++  +Q  F   K  +LR CN LLRRLS++ + VF
Sbjct: 158 LLEDLTEMSTIKDCKDVFGYIESKQDVLGKQELFGRGKLVMLRTCNQLLRRLSKANDVVF 217

Query: 142 CGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKI 201
           CGRI++FLA FFP SERS LNI   FN  N T++                     +K   
Sbjct: 218 CGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY---------------------EKDAT 256

Query: 202 DFNFYKKFWSL-QDYFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLN---- 255
           D N  +K   L  ++F NP +   N   W+ F S    VL+ F++  L D    LN    
Sbjct: 257 DVNCSRKRIELPNEHFSNPALTAANLTRWQKFVSNLTVVLSTFEAQPLSDDDGKLNNLDQ 316

Query: 256 -PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEK 314
                +  KYLT+ KL+ L+L D +FRR++L+Q LI F +  +  K+  EG      +E 
Sbjct: 317 EEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDFLKAPGKTDKEGPTGSMKEE- 375

Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSITDED 373
            +    E V  L++  PP G+ F Q ++ IL+ E++W  WK +GC    K+P      E 
Sbjct: 376 -IDSCEERVKKLLEIIPPKGKDFLQSIEHILEREKNWVWWKRDGCLAFEKQPF-----EK 429

Query: 374 KKDEPDAKKKK------TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPA 426
           K  +   +K+K        EL +LW  ++ N  A   +ER   PS+  Y++   + MDP+
Sbjct: 430 KPGQAGVRKRKPRWRLGNKELAQLWKWAEQNPNALTDSERICMPSVTEYWKPLAEDMDPS 489

Query: 427 AAVEEQYKKVNDSNYAWRALRL--------LSRKCPHFFLNATPN----VEKNSEFIE-- 472
           A +E++Y   N+  Y W+ LR          SR C +      P      E  S+F    
Sbjct: 490 AGIEDEYHHKNNRVYCWKGLRFSARQDLEGFSRFCDYGIEGVVPQELLPPEVRSKFYSKP 549

Query: 473 -NMVKRCVKEKP---------------SSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKE 516
            +  KR  +E P               + +  G G+G + +   V +D+ +  I E  K+
Sbjct: 550 GDKAKRPKREDPKGTSAQPKEQQVVSATPETDGGGSGAEPEEGAVPMDSDNAAIDEGRKQ 609

Query: 517 EDWEAKADPEG 527
              E      G
Sbjct: 610 SPEEVSGPESG 620


>gi|224072965|ref|XP_002303943.1| predicted protein [Populus trichocarpa]
 gi|222841375|gb|EEE78922.1| predicted protein [Populus trichocarpa]
          Length = 611

 Score =  214 bits (546), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 159/518 (30%), Positives = 244/518 (47%), Gaps = 83/518 (16%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           I   + + + LC +D     +   LL D  +MST+  C+ +F Y+E   +I  +Q  F  
Sbjct: 78  IPRLLDVVLYLCERDFVEGGMIFQLLEDLTEMSTMRNCKDIFGYIESKQDILGKQELFAR 137

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP SERS +NI   FN  N      
Sbjct: 138 GKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFPLSERSAVNIKGVFNTSN------ 191

Query: 178 DEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSL--QDYFRNPVQCYNKVSWKMFTSYA 235
                       ET  E+         +  K   L  Q+YF +P    + + W+ F+   
Sbjct: 192 ------------ETKYEKEPPAATCCMYSDKLVCLLFQEYFCDPSLTLSPIKWQKFSLSL 239

Query: 236 ETVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
             +L AF++  L + + S N     +  +  KYLT+ KL+ L+L D +FRR+VL+Q LIL
Sbjct: 240 MVILNAFEAQPLSEEEGSANNLEEEAASFNIKYLTSSKLMGLELKDPSFRRHVLVQCLIL 299

Query: 292 FQYFTSTVKSRGEGLELKSDQ--------------------EKWVKDTTETVYSLIKQTP 331
           F Y  +  K+  +   L S+                     ++ +K   E V  L++ TP
Sbjct: 300 FDYLKAPGKNDKD---LTSESMVSAVPLLILILSALNSCLCKEEIKSREEHVKKLLEMTP 356

Query: 332 PDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSITDEDKKDEPDAKKKKTP---- 386
           P G+ F   V+ IL+ E++W  WK +GCP   K+P+ + T +      D  KK+ P    
Sbjct: 357 PKGKDFLHKVEHILEREKNWLWWKRDGCPPFEKQPIENKTVQ------DGGKKRRPRWRL 410

Query: 387 ---ELTKLWNSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYA 442
              EL++LW   D N  A    +R  TP++  Y++   + MDP+A++E  Y   N+  Y 
Sbjct: 411 GNKELSQLWKWADQNPNALTDPQRVRTPAITDYWKPLAEDMDPSASIEADYHHKNNRVYC 470

Query: 443 WRALRLLSRKCPHFFLNAT--------------PNVEKNSEFIEN-MVKRCVKEKPSSQI 487
           W+ LR+ +R+    F   T              P+V    +   N   KR  K++P    
Sbjct: 471 WKGLRVSARQDLDGFSRFTDHGIEGVVPLELLPPDVRSKHQAKPNDRSKRAKKDEP---- 526

Query: 488 SGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADP 525
            G  + V+ +  +V + T + EI  E    D EA   P
Sbjct: 527 KGASHQVEDN--QVSIATPASEIDGEGIRTDLEASVTP 562


>gi|156354930|ref|XP_001623433.1| predicted protein [Nematostella vectensis]
 gi|156210130|gb|EDO31333.1| predicted protein [Nematostella vectensis]
          Length = 433

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 106/212 (50%), Positives = 143/212 (67%), Gaps = 7/212 (3%)

Query: 29  DYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           D +KK+A +QS R  L+K     D D++  ++ I+L ++      C++TLP+ILLSD FD
Sbjct: 42  DGEKKNAADQSFRD-LVKESLLSDEDVETQKHLITLAIKAAAHGTCSTTLPVILLSDVFD 100

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
             TLD+C+ LF +VE  V+ W    FF S KN LLRMCNDLLRRLS SQNTVFCGRI LF
Sbjct: 101 TITLDRCDSLFTFVEEQVSTWTMPIFFSSGKNLLLRMCNDLLRRLSASQNTVFCGRIQLF 160

Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV------KID 202
           LA+ FP SE+SGLN++S+FN++N+T +    E + S N+ E  D  E  ++       +D
Sbjct: 161 LARLFPLSEKSGLNLMSQFNLDNVTTYATTTEEEDSVNDMETGDGTEAGELVQSNLTPVD 220

Query: 203 FNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSY 234
           +N Y+KFW+LQDYFRNP QCY  V WK F+ +
Sbjct: 221 YNLYRKFWALQDYFRNPTQCYTTVGWKTFSKH 252



 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 66/164 (40%), Positives = 94/164 (57%), Gaps = 21/164 (12%)

Query: 342 KLILKGEEHWNQWKNEGCPEL------------------KRPLTSITDEDKKDEPDAKKK 383
           K IL+ EE+W  WKNEGC                     KR  T    E+   E  +KK 
Sbjct: 251 KHILEREENWISWKNEGCLSFIKEKEKEAHDVVKGPKAPKRKRTKSIGEEINAEWQSKKL 310

Query: 384 K--TPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSN 440
           +  + ELT+LWN   DNLEAC + +R F PSLE +FEEAI+Q DP   VE++YK +  SN
Sbjct: 311 QMGSSELTRLWNLCSDNLEACSAEKRVFLPSLEEFFEEAIEQADPDGMVEDEYKLIKKSN 370

Query: 441 YAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPS 484
           + WRALRLL+R+  HFF  ++   +   +++E++V +  +E P+
Sbjct: 371 FGWRALRLLARRSQHFFQPSSNPFKPLPDYLESVVSQLAQELPT 414


>gi|24431597|gb|AAN61477.1| Unknown protein [Oryza sativa Japonica Group]
          Length = 653

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 160/568 (28%), Positives = 251/568 (44%), Gaps = 101/568 (17%)

Query: 35  ALEQSIRQYLLKLIKTPDIDIKVIENY-------------ISLCVELCMKDMCNSTLPII 81
           +LE ++R+ L +L  +     K I  Y             + + + LC +      +   
Sbjct: 89  SLENALRKLLQELASSAVQSGKRIMQYGDNEENNCPITRLLDIVLYLCERGHVEGGMVFQ 148

Query: 82  LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVF 141
           LL D  +MST+  C+ +F Y+E   ++  +Q  F   K  +LR CN LLRRLS++ + VF
Sbjct: 149 LLEDLTEMSTIKDCKDVFGYIESKQDVLGKQELFGRGKLVMLRTCNQLLRRLSKANDVVF 208

Query: 142 CGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKI 201
           CGRI++FLA FFP SERS LNI   FN  N T++                     +K   
Sbjct: 209 CGRIIMFLAHFFPLSERSALNIKGVFNTSNETKY---------------------EKDAT 247

Query: 202 DFNFYKKFWSL-QDYFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLN---- 255
           D N  +K   L  ++F NP +   N   W+ F S    VL+ F++  L D    LN    
Sbjct: 248 DVNCSRKRIELPNEHFSNPALTAANLTRWQKFVSNLTVVLSTFEAQPLSDDDGKLNNLDQ 307

Query: 256 -PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEG--------- 305
                +  KYLT+ KL+ L+L D +FRR++L+Q LI F +  +  K+  EG         
Sbjct: 308 EEDAAFNIKYLTSSKLMGLELKDPSFRRHILVQCLIFFDFLKAPGKTDKEGPTGSMLNRL 367

Query: 306 --------LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
                       +  ++ +    E V  L++  PP G+ F Q ++ IL+ E++W  WK +
Sbjct: 368 LSLISILLTACATGHKEEIDSCEERVKKLLEIIPPKGKDFLQSIEHILEREKNWVWWKRD 427

Query: 358 GCPEL-KRPLTSITDEDKKDEPDAKKKK------TPELTKLWN-SKDNLEACKSAERDFT 409
           GC    K+P      E K  +   +K+K        EL +LW  ++ N  A   +ER   
Sbjct: 428 GCLAFEKQPF-----EKKPGQAGVRKRKPRWRLGNKELAQLWKWAEQNPNALTDSERICM 482

Query: 410 PSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRL--------LSRKCPHFFLNAT 461
           PS+  Y++   + MDP+A +E++Y   N+  Y W+ LR          SR C +      
Sbjct: 483 PSVTEYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGLRFSARQDLEGFSRFCDYGIEGVV 542

Query: 462 PN----VEKNSEFIE---NMVKRCVKEKP---------------SSQISGNGNGVDQDPA 499
           P      E  S+F     +  KR  +E P               + +  G G+G + +  
Sbjct: 543 PQELLPPEVRSKFYSKPGDKAKRPKREDPKGTSAQPKEQQVVSATPETDGGGSGAEPEEG 602

Query: 500 EVEVDTKSEEIQEEEKEEDWEAKADPEG 527
            V +D+ +  I E  K+   E      G
Sbjct: 603 AVPMDSDNAAIDEGRKQSPEEVSGPESG 630


>gi|384252374|gb|EIE25850.1| hypothetical protein COCSUDRAFT_64866 [Coccomyxa subellipsoidea
           C-169]
          Length = 842

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 142/481 (29%), Positives = 220/481 (45%), Gaps = 64/481 (13%)

Query: 33  KSALEQSIRQYLLKLIKTPDIDIKV--------------IENYISLCVELCMKDMCNSTL 78
           ++ LE S+RQ    L+K+  ++  V              I   + L V L  +   ++ L
Sbjct: 55  RTILEHSVRQACASLVKS-AVEHGVNSTQYRHAGASEPSIPRLLDLTVYLGSQRQIDTVL 113

Query: 79  PIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSC----KNNLLRMCNDLLRRLS 134
              ++ D  ++ST+++ E +F Y+E  +   +Q   F S     K ++LR+CN LLRRLS
Sbjct: 114 VFAIMEDVMELSTIEEAEGVFGYLESRIQKLRQVYNFGSATSHSKLSVLRICNQLLRRLS 173

Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE 194
           R+ ++  CGRIL+FL +  P  +RSGLN+  +FN  + T      E  V SN        
Sbjct: 174 RASSSELCGRILMFLTRLLPLLDRSGLNVQGQFNTSHSTPVEDVPEGAVDSNGQ------ 227

Query: 195 EVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL------- 247
                 +D  FY  FW LQD F++P        W    +  + VL  F    +       
Sbjct: 228 -----PVDAAFYATFWGLQDVFQHPYTAMEPSKWAATITDIKRVLTEFTKQPVALASSSG 282

Query: 248 DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG---- 303
            + +  L  +     KYL++ KL+ LQL D   RR+ LLQ LI      + V   G    
Sbjct: 283 GEGEGGLLEAAALTVKYLSSPKLMRLQLQDATLRRHFLLQALIFLHACQNPVLKPGAAAA 342

Query: 304 ---EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
              +   L+S Q    ++    +Y+ +++TP  G HF   V+ IL  E  W +WK+EGC 
Sbjct: 343 QQAKKEALRSKQVLEAEELEGLLYAELQRTPKHGTHFCAAVRTILTRETQWTKWKSEGCQ 402

Query: 361 ELKR----PLTSITDEDKK-----------DEPDAKKKKTPELTKLWN-SKDNLEACKS- 403
             +R    PL+ +                  +P+     +  L+ LWN ++DN    K  
Sbjct: 403 GYERKEVAPLSPLAPSTALQAATAATAATVSDPNYVDTGSQALSTLWNITRDNTSCLKGD 462

Query: 404 AERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC-PHF--FLNA 460
             +  TPSL+ +    I +MDP   +EEQYK   D  Y W++LR ++R   P F   L A
Sbjct: 463 TNQRSTPSLQEFLAPVILEMDPEEGIEEQYKTKKDEVYTWKSLRQIARASLPVFSKMLTA 522

Query: 461 T 461
           T
Sbjct: 523 T 523


>gi|345324596|ref|XP_001508566.2| PREDICTED: THO complex subunit 1-like [Ornithorhynchus anatinus]
          Length = 287

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 113/242 (46%), Positives = 147/242 (60%), Gaps = 23/242 (9%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIIL 82
           P  + +KK  L+Q+ R  L  LI    I+    EN    ISL +    + +C ++ P +L
Sbjct: 41  PGSENEKKCTLDQAFRGVLEDLI----INQASCENVLAIISLAIGGVTEGICTASTPFVL 96

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFC 142
           L D  D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFC
Sbjct: 97  LGDVLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFC 156

Query: 143 GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE-EMDVSSNETEETDT-------- 193
           GRI LFLA+ FP SE+SGLN+ S+FN+ENIT F  +E E  +     E+ +         
Sbjct: 157 GRIQLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHVEDKEEGMEVEEGE 216

Query: 194 ---EEVD---KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTS-YAETVLAAFKSYK 246
              +EV     + ID+N Y+KFWSLQDYFRNPVQCY KVSWK F   ++   LA    Y+
Sbjct: 217 MGDDEVPTNCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKLFSSRGLAGPAVYE 276

Query: 247 LD 248
            D
Sbjct: 277 HD 278


>gi|328875843|gb|EGG24207.1| putative THO1 protein [Dictyostelium fasciculatum]
          Length = 653

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 148/439 (33%), Positives = 221/439 (50%), Gaps = 50/439 (11%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           D ++ +E SIRQ    + ++  +    I   + L +E+      +S L   L  D FD  
Sbjct: 65  DLRTNIELSIRQLFSDVFRSKSMKKDDINITLQLAIEIADLGYIDSYLIAQLAEDLFDSK 124

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMS-CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFL 149
           ++ +  + F  +E   + + Q  F +S  KN LL++C DLL+RLS+S N   CGRIL+FL
Sbjct: 125 SISESIEFFSLLESRADYFSQDQFMVSRTKNMLLKICIDLLKRLSKSTNPDSCGRILIFL 184

Query: 150 AKFFPFSERSGLN------IISEFNVENITEFGGDEEMDVSSNETEETDTE------EVD 197
           A  FP S+ SGLN      +  E ++ NI +F             EET  E      +  
Sbjct: 185 AFIFPLSDPSGLNPKSLHAVHGELDLTNIMDF-------------EETSPEGKDKDKDKY 231

Query: 198 KVKIDFNFYKKFWSLQDYFRNPVQCY-------------NKVSWKMFTSYAETVLAAFKS 244
           K  ID NFYK+FW LQ YF+ P Q +             NK  W  F    E VL  F +
Sbjct: 232 KTTIDLNFYKQFWGLQSYFQKPFQLFNNANAVNTAALALNKDKWSHFMESIEVVLTTFAT 291

Query: 245 YKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT-STVKSRG 303
           +   D  ++++ S  YF KYLT+  L+ LQL D+ FRR +L Q LI FQ+   +T+KS  
Sbjct: 292 HINLDELATMSKSH-YFTKYLTSSNLIKLQLKDSTFRRNILTQLLITFQFIEFNTIKSPN 350

Query: 304 EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWK-NEGCPEL 362
               L   Q++ +   T     +I+QT P+GEHF + + LILK E +W  WK + GC   
Sbjct: 351 ---LLNDSQKQLLTTLTNKAKKIIEQTQPNGEHFMKSLALILKRETNWLTWKMDGGCKAF 407

Query: 363 KRPLT-SITDEDKKDEPDAKK--KKTPELTKLWN-SKDNLEACKS-AERDFTPSLESYFE 417
            RP   SI  + +K    + K    T EL++LWN S DN E+ K+  + +    +   ++
Sbjct: 408 GRPAAQSIVVKKRKLRKTSTKVMMGTQELSRLWNLSSDNTESLKTEIKLNVMDYIAPMYD 467

Query: 418 EAIQQMDPAAAVEEQYKKV 436
           E IQQ + A   +E+  ++
Sbjct: 468 EMIQQEEAAKKEQERLARL 486


>gi|440793799|gb|ELR14970.1| THO1 protein (nuclear matrix protein p84), putative, partial
           [Acanthamoeba castellanii str. Neff]
          Length = 451

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 174/314 (55%), Gaps = 18/314 (5%)

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
           I + + L  + M  S  P+ L+ D F   T+++C+Q F  +E   +   +     +  NN
Sbjct: 116 IDIALWLGREKMIESAAPLELVRDFFLSHTIEECQQFFALLEDRADQLSRAIVGDTRANN 175

Query: 122 -LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE 180
            LLR  +++ +RLSR+ + VF GRIL+F+A  FP S++SG+N+  E NV N+T    D+E
Sbjct: 176 TLLRALSEVRKRLSRTNDLVFAGRILMFMAYAFPLSDKSGVNLKGEANVANVTTM--DKE 233

Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
              +  ET E  +E      +D+ FY+ FW LQ YF NP++ +   +W++ T +  +VL 
Sbjct: 234 PSPALEETPEDKSEP----PVDWTFYQMFWGLQSYFLNPIKAFE--AWQVVTGHLNSVLD 287

Query: 241 AFKSYKLDDVQSSL------NPSGD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLIL 291
            FK Y + + ++S       NP      Y  KYLT+ +LL LQL D  FRR++++QFL+ 
Sbjct: 288 LFKQYPISEEEASAASYSYANPPAASECYLTKYLTSSRLLRLQLRDPWFRRHIMVQFLVF 347

Query: 292 FQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
           FQ      +S     +L   Q+  +   T+   + + +TPP+G HF+  +  ILK E +W
Sbjct: 348 FQSLRVVSESSTVAQKLTEQQKNEIAGLTKKTKTSLWETPPNGMHFTNTIYAILKRENNW 407

Query: 352 NQWKNEGCPELKRP 365
             WK EGCP  ++P
Sbjct: 408 TDWKKEGCPPFEKP 421


>gi|268568954|ref|XP_002648141.1| C. briggsae CBR-THOC-1 protein [Caenorhabditis briggsae]
          Length = 762

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 153/542 (28%), Positives = 240/542 (44%), Gaps = 117/542 (21%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQ-------Q 112
           N++  C +L     C+ + PI+ L D  ++S++++C+ LF  VE N+N +KQ       Q
Sbjct: 62  NFVMNCTKL---GYCSKSTPILTLQDILEVSSIEECKHLFGLVEENMNEFKQVKWLEMDQ 118

Query: 113 TFFMSCKNN-----------------LLRM---------CNDLLRRLSRSQNTVFC---- 142
            F  S K                   L R          C +L  R S ++ ++F     
Sbjct: 119 NFPDSSKRTKTSFSASATTFSVDSAELRRRRFVEESCSSCPELENRFSCTKMSIFSIFRP 178

Query: 143 ---GRILLFLAK-----------------FFPFSERSGLNIISEFNVENITEFGGDEEMD 182
               +I  F  K                 F P +E+SG+N +  FN  NIT F  +E   
Sbjct: 179 ENGPKIRKFEPKTQFADLITPQKMPKSRRFLPLTEKSGVNFMGHFNTLNITTFEENEPTA 238

Query: 183 VS-------------SNETEETDTEEVD----------KVKIDFNFYKKFWSLQDYFRNP 219
            S             S  T   + E+++          ++ +    Y++FWSLQ +  NP
Sbjct: 239 ESLLAATSSHHAATPSGATPGAEGEDMETGEIEEEAAKEIIVTPEMYRQFWSLQKFMSNP 298

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLDD---------VQSSLNPSGD-YFAKYLTNQK 269
              Y K  + +F      VL+   + KL+           +S    + D +F KYLT+ K
Sbjct: 299 NLLYEKEKFVVFKKDLTAVLSLMTANKLEKFAAEDESEVAESRKKINNDVFFTKYLTSPK 358

Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEG-----------------LELKSDQ 312
           LL LQL+D++FRR  L+Q +I+FQY  +  + +                    + L  +Q
Sbjct: 359 LLALQLNDSSFRRNFLIQTIIIFQYLNAESRFKPPANLKIPQNSPQNPIFSRKMVLNEEQ 418

Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--KRPLTSIT 370
            K+V D  +  Y L+  T P G  F   +K I++ E  WNQWKN  CP+L  K    ++ 
Sbjct: 419 SKFVHDYEDKCYRLLSDTMPRGTAFVAGLKRIMQRETEWNQWKNSNCPDLSEKADKGAMQ 478

Query: 371 DEDKKDE----PDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDP 425
              K+      P+     TPELTKLW    D L+ACKS +R F P+L  +  + + +MDP
Sbjct: 479 MYKKRQRVPFNPNNLDLGTPELTKLWTYEPDVLKACKSDKRKFIPTLADFIRDPLDEMDP 538

Query: 426 AAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSS 485
              VEEQYK +N+ ++ WRA RLL  K P +     P  +  +  + N   R  + + +S
Sbjct: 539 EQQVEEQYKSINEVSFQWRAARLLMHKAPGYVSKTEPKADPTTNMLWNRGTREKRRRETS 598

Query: 486 QI 487
           ++
Sbjct: 599 RV 600


>gi|390368636|ref|XP_786383.3| PREDICTED: THO complex subunit 1-like [Strongylocentrotus
           purpuratus]
          Length = 601

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 124/311 (39%), Positives = 163/311 (52%), Gaps = 59/311 (18%)

Query: 219 PVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP--------------------SG 258
           P  CY+  +       A  VL AF S KL+D+ S                        SG
Sbjct: 159 PFLCYHDQN-------AIAVLEAFHSMKLEDIASGKKKKKRRSHHRQQQKQGEEEEADSG 211

Query: 259 DY--FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWV 316
           D   FAKYLT++KL DLQLSD  FRRYV +QFLILFQY    VK +     L  +   ++
Sbjct: 212 DQVSFAKYLTSEKLFDLQLSDAMFRRYVYVQFLILFQYLNQQVKFKQAHHVLTDEMSLFI 271

Query: 317 KDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP---------ELKRPLT 367
           K+T + V +L+K+TPP G+ F+  ++ IL  EEHWN WKNEGCP         E  RP +
Sbjct: 272 KETKDKVVALLKETPPYGDQFTAGIEHILSREEHWNAWKNEGCPSYVKEKGKTEQARPKS 331

Query: 368 SITDEDKKDE---PDAKKK---KTPELTKLWNS-KDNLEACKSAERDFTPSLESYFEEAI 420
                   +E     A KK    +PELTKLWN    NLEAC +AE    P          
Sbjct: 332 RARKRTLGEELNLAGAAKKIDLGSPELTKLWNVYPSNLEAC-AAEDRVVPD--------- 381

Query: 421 QQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNAT---PNVEKNSEFIENMVKR 477
            Q DP A +E +YK VN+SNYAW+A+RLL+++ PHFF   T   P ++    ++E MV +
Sbjct: 382 -QADPEAMIEPEYKVVNNSNYAWQAIRLLAKRSPHFFQTVTATNPPLKTVPGYLEYMVTK 440

Query: 478 CVKEKPSSQIS 488
             KE P ++ S
Sbjct: 441 IAKELPQNEES 451



 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/103 (37%), Positives = 57/103 (55%), Gaps = 1/103 (0%)

Query: 29  DYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           D +KK+AL+Q+ R Y+ KL+           + + L  E     +C+   P +LL+D FD
Sbjct: 44  DAEKKTALDQAYRNYVKKLV-VGGSSCSDFTSLVHLSTEAVEAGLCSPATPFLLLTDIFD 102

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLR 131
             T+  C+ +F +VE  V  WKQ  F+ S KN LLRMCN  L+
Sbjct: 103 SVTISVCKDVFQFVEDRVATWKQSDFYSSGKNYLLRMCNGRLQ 145


>gi|147775061|emb|CAN77205.1| hypothetical protein VITISV_014783 [Vitis vinifera]
          Length = 507

 Score =  187 bits (476), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 131/412 (31%), Positives = 185/412 (44%), Gaps = 111/412 (26%)

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
           MST+  C+ +F Y+E   +I  +Q  F   K  +LR CN LLRRLS++ + VFCGRIL+F
Sbjct: 1   MSTMRNCKDIFAYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMF 60

Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
           LA FFP SERS +NI   FN  N T++              E D  E   + IDFNFYK 
Sbjct: 61  LAHFFPLSERSAVNIKGVFNTSNETKY--------------EKDAPE--GISIDFNFYKT 104

Query: 209 FWSLQ----------------------------------DYFRNPVQ-CYNKVSWKMFTS 233
           FWSLQ                                  ++F NP         W+ FTS
Sbjct: 105 FWSLQXLDWGCSGEXKTYMLFGKFGPDGGYMFRDQSVNVEHFCNPASISLAPTKWQKFTS 164

Query: 234 YAETVLAAFKSYKLDDVQSSLN----PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
               VL  F++  L D + + N     +  +  KYLT+ KL+ L+L D +FRR++L+Q L
Sbjct: 165 NLMVVLNTFEAQPLSDEEGNANNLEEEAATFSIKYLTSSKLMGLELKDPSFRRHILVQCL 224

Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
           ILF Y  +  K+     +L SD   W                                  
Sbjct: 225 ILFDYLKAPGKNDK---DLPSDSMVW---------------------------------- 247

Query: 350 HWNQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKKTP-------ELTKLWNSKD-NLEA 400
               WK +GCP  +R P+      +KK   D  KK+ P       EL++LW   D N  A
Sbjct: 248 ----WKRDGCPPFERQPI------EKKAVQDGAKKRRPRWRMGNKELSQLWKWADQNPNA 297

Query: 401 CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
               +R  TP++  Y++   + MD +A +E +Y   N+  Y W+ LR  +R+
Sbjct: 298 LTDPQRVRTPAVSEYWKPLAEDMDLSAGIEAEYHHKNNRVYCWKGLRFAARQ 349


>gi|114145453|ref|NP_001041315.1| THO complex 1 [Rattus norvegicus]
 gi|37361826|gb|AAQ91026.1| LRRGT00070 [Rattus norvegicus]
          Length = 499

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 101/247 (40%), Positives = 140/247 (56%), Gaps = 35/247 (14%)

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
           ISL +    + +C ++ P +LL D  D   LD+C+ +F +VE NV  WK           
Sbjct: 31  ISLAIGGVTESVCTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSN--------- 81

Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
              +  D   RL               L+    + ++  L +++E+ ++       +E M
Sbjct: 82  ---LSGDSYIRL---------------LSACTSWRQQYCLGLVAEYGLDPQHTEDREEGM 123

Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
           DV   E  + +      + ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA 
Sbjct: 124 DVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAV 183

Query: 242 FKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
           FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+LILFQ
Sbjct: 184 FKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYLILFQ 243

Query: 294 YFTSTVK 300
           Y    VK
Sbjct: 244 YLKGQVK 250



 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 43/87 (49%), Positives = 57/87 (65%)

Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
           NL    +  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHF
Sbjct: 290 NLTVYFTMTREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHF 349

Query: 457 FLNATPNVEKNSEFIENMVKRCVKEKP 483
           F       +   E++ENMV +  KE P
Sbjct: 350 FQPTNQQFKSLPEYLENMVIKLAKELP 376


>gi|348686117|gb|EGZ25932.1| hypothetical protein PHYSODRAFT_540491 [Phytophthora sojae]
          Length = 569

 Score =  185 bits (469), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 135/430 (31%), Positives = 206/430 (47%), Gaps = 66/430 (15%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           +   + LC+     +   ++ P  +L D  +  T+  CE ++  +E   +   + T   +
Sbjct: 82  VPALLDLCIAGVTSNFLINSTPYKVLEDLMEGQTISTCEMVWELLESRKD---KLTTGRT 138

Query: 118 CKNNL--LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF 175
            K +L  LRMCN LLRRLS++ N+VFCG+IL+FL+  F  SERS +N+  + NV NI   
Sbjct: 139 TKASLCLLRMCNALLRRLSKTHNSVFCGKILVFLSFTFALSERSAVNLTGKANVTNID-- 196

Query: 176 GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR-NPVQCYNKVSWKMFTSY 234
             D   DV                 ID+N Y+ FW LQ +FR + +  ++  +W+ F + 
Sbjct: 197 -NDPSADVGP---------------IDYNLYRTFWDLQSFFREHQMATHSAENWEKFFTE 240

Query: 235 AETVLAAFK--SYKLDDVQ---------------------SSLNP----SGDYF--AKYL 265
              VLAAF+  ++  DD++                     SS NP    S ++F   KYL
Sbjct: 241 LNVVLAAFEGNAFSPDDLERSRDLSGGGPIHSADVIMHEDSSTNPAEQGSQEHFFQPKYL 300

Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYS 325
           TN +L  LQL D   R  +L QFLILF          G      + + K+  D TE V +
Sbjct: 301 TNSRLFRLQLRDPILRECMLTQFLILFNDLARAKPPAGS----TTPKAKFA-DLTERVVA 355

Query: 326 LIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK- 384
           L+KQTP DGE FS++V  +L+ E +W +WK E C E ++  ++   E    +P  K+ + 
Sbjct: 356 LLKQTPSDGEGFSEMVSYVLERERNWVKWKQEKCREYEKYPSAADKEGSAPKPVVKRARR 415

Query: 385 ---TPELTKLWNSKDN----LEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
              +P L ++ +        LE  K  ER     L +Y E   +  DP   +EE+Y    
Sbjct: 416 QLTSPLLEQILSESSKPSQILEKIKGKERASEVPLSAYTERFKEAWDPENGIEEEYWPDK 475

Query: 438 DSNYAWRALR 447
           D    WR +R
Sbjct: 476 DEMVCWRTMR 485


>gi|428175960|gb|EKX44847.1| hypothetical protein GUITHDRAFT_139464 [Guillardia theta CCMP2712]
          Length = 662

 Score =  181 bits (459), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 148/550 (26%), Positives = 246/550 (44%), Gaps = 98/550 (17%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV------------- 104
           +   + L +   M +M    LP++ + + F+  T ++C + F Y+E              
Sbjct: 94  VVGLLDLSMHAAMSNMVPRQLPLMAMHELFEGQTAEQCSKAFSYLETVDHTRLKSSAELV 153

Query: 105 ----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSG 160
                +    +Q      +  LLR CN L++RLS++ + + CGR+L+FLA   P SE+SG
Sbjct: 154 QRVEKMRDLTEQGEHKYSQAALLRCCNTLMQRLSKTCDLLLCGRVLMFLAHVLPLSEKSG 213

Query: 161 LNIISEFNVENITEFGGD-EEMDVSSNETEETDTEEV----------DKVKIDFNFYKKF 209
           +N+  EFN  N+TEF  + E++D +  E E+ D  ++           ++++ F+ YK F
Sbjct: 214 VNLKGEFNKANVTEFQENLEDIDYTKMEDEDKDESKIYLKSTDEKLDGQMEVSFSLYKTF 273

Query: 210 WSLQDYFRNPVQCYNKVSW--KMFTSYAETVLAAFKSYKLDDVQS-SLNPSGD---YFAK 263
           W LQ Y  +P   ++       M       VL  F SY L D +  ++  S D   YF  
Sbjct: 274 WGLQKYLHDPSALFSGPELVDSMIQGMT-LVLDCFSSYSLSDTEGIAVGHSADREFYFPA 332

Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF-------TSTVKSRGEGL-ELKSDQEKW 315
           +LT+ KL+ L+L D  FRR VL+Q+LI+ QY          T+     GL E    Q++ 
Sbjct: 333 FLTSTKLVALELQDPIFRRQVLVQYLIVLQYLLDRAVHPAKTLDVLRTGLSEFTKKQKED 392

Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK 375
           +    +T + L+   PP G+ F + ++ IL+ E +W +WK     + +     + D    
Sbjct: 393 LNKLMDTTFELLGNIPPSGKEFVETLRKILQREANWTRWKKSKVADGEDVAIKVGD---- 448

Query: 376 DEPDAKKKKTPELTKLW-------NSKDN-----------LEACKSAERDFTPSLE---- 413
                K+K       LW        +KDN           LE CK  ER    S E    
Sbjct: 449 -----KRKSMENAGALWKRRFFTMKTKDNFVPASFENGSLLEICKPKERKSNSSAEDIAN 503

Query: 414 -------SYFEE--AIQQM--DPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATP 462
                  +  EE  AI +M  DP   +E++++K N   + W  LR+L+    + F  AT 
Sbjct: 504 GLGNPNSTKLEERLAIVRMEADPENYIEQEFRKNNVPCWRWVTLRMLASYDINMFDKATA 563

Query: 463 NVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAK 522
                ++++E +V         + + G G    +D A    + K  E +E E++E  + +
Sbjct: 564 ----ENKYLEGVV---------NHMDGKGKETHKDKAPAGDEAKEGEQKEGEQKEADQKE 610

Query: 523 ADPEGDADEV 532
                +A EV
Sbjct: 611 GGEVKEAGEV 620


>gi|301111868|ref|XP_002905013.1| THO complex subunit, putative [Phytophthora infestans T30-4]
 gi|262095343|gb|EEY53395.1| THO complex subunit, putative [Phytophthora infestans T30-4]
          Length = 595

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 132/433 (30%), Positives = 203/433 (46%), Gaps = 69/433 (15%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           +   + LC+         ++ P  +L D  +  T++ CE+++  +E   +   + T   +
Sbjct: 82  VPALLDLCIAGVTSGFLVNSTPYKVLEDLMEGQTINTCEKVWDLLESRKD---KLTTGRT 138

Query: 118 CKNNL--LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF 175
            K +L  LRMCN LLRRLS++ N+VFCG+IL+FL+  F  SERS +N+  + NV NI   
Sbjct: 139 TKASLCLLRMCNALLRRLSKTHNSVFCGKILVFLSFTFALSERSAVNLTGKANVTNI--- 195

Query: 176 GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR-NPVQCYNKVSWKMFTSY 234
             D   DV                 ID+N Y+ FW LQ +FR + +   +  +W+ F + 
Sbjct: 196 DSDPSADVGP---------------IDYNLYRTFWDLQGFFREHQMATQSAENWEKFFTE 240

Query: 235 AETVLAAFK--SYKLDDVQSSLNPS---------------GD--------------YFAK 263
              VLAAF+  ++  DD++ S + +               GD              +  K
Sbjct: 241 LNVVLAAFEGNAFSPDDLERSRDLTEGPIHSADIAMHESDGDSDKNATDHNTQEHFFQPK 300

Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETV 323
           YLTN +L  LQL D   R  +L QFLILF          G      +  +  + + TE V
Sbjct: 301 YLTNSRLFRLQLRDPILRECMLTQFLILFNDLARAKPPLG-----STTPKAKLAELTERV 355

Query: 324 YSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTSITDEDKKDEPDAKK 382
            +L+KQTP DGE FS++V  +L+ E +W +WK E CP  +R P     D     +P  K+
Sbjct: 356 VALLKQTPSDGEGFSEMVTYVLERERNWVKWKQEKCPGYERYPSAKEKDASTASKPVVKR 415

Query: 383 KK----TPELTKLWNSKDN----LEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYK 434
            +    +P L ++ +        LE  K  ER    SL +Y +   +  DP   +EE+Y 
Sbjct: 416 ARRQLTSPLLEQILSESSKPSQILEKIKGKERATEVSLATYTDRFKEAWDPENGIEEEYW 475

Query: 435 KVNDSNYAWRALR 447
              D    WR +R
Sbjct: 476 PDKDEMVCWRTMR 488


>gi|255936127|ref|XP_002559090.1| Pc13g06570 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211583710|emb|CAP91726.1| Pc13g06570 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 706

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 163/598 (27%), Positives = 278/598 (46%), Gaps = 122/598 (20%)

Query: 32  KKSALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           K +A+E + R+   +++ +  ID      I N + +       + C   L   L+ +  D
Sbjct: 63  KLAAVEIAFREKFYRVLASTSIDDPAFIQIWNLLDIVSIFSDNEQCEPGLIFWLIEELLD 122

Query: 89  MSTLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCG 143
             T+  C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCG
Sbjct: 123 SQTIGGCRKVFDYLESRRERNTKKHFKQKSLV------ILRSCNELLRRLSRAEDTVFCG 176

Query: 144 RILLFLAKFFPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNET-------- 188
           R+ ++L + FP  ++S +N+  EF+ EN+T F          +E D +  E         
Sbjct: 177 RVFIYLFQSFPLGDKSSVNLRGEFHSENVTTFDEIVKESSAKQEQDTTMTEGAEPSAPGE 236

Query: 189 -----EETDTEEVDKV--------------KIDFN-FYKKFWSLQDYFRNPVQCYNKVSW 228
                 ET ++E  KV              ++D +  Y  FWSLQ YF  P   ++   +
Sbjct: 237 AAAGQAETTSQETSKVPKVVVSTGDEKKSDEVDLDALYPLFWSLQAYFSAPSNTFDAERF 296

Query: 229 KMFTSYAETVLAAFKSYKLD--------------------------DVQSSLNPSGDYFA 262
             F +  E  LAAFK+   +                          +V +S NP      
Sbjct: 297 TSFKTGMEATLAAFKNINTEMENESSARASEDARKSNKRKRIADGSEVATSFNP------ 350

Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKSRGE---------GLELKS 310
           KYLT++ L DL++SDT FRR+VL+Q LIL  +    TS  KSR           G  L  
Sbjct: 351 KYLTSRDLFDLEISDTAFRRHVLVQALILLDFVLSLTSKSKSRLANTTNKSVLYGFTLNE 410

Query: 311 DQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS-- 368
           +  KW     +++   ++Q  P G+ + ++V  +L  +++W +WK EGCP ++RP  S  
Sbjct: 411 EDTKWATSMRKSIEGYLQQG-PGGKFYYRMVDTVLSRDKNWVRWKAEGCPLIERPAISAA 469

Query: 369 --ITDEDKKDEPDAKKKKTP------ELTKLWNSK--DNLEACKSAERDFTPSLESYFEE 418
              T  +   +  A K+  P      +L  L +++   N+E  K ++R   PS +S+ + 
Sbjct: 470 EYTTSREHATKAYANKRLRPSPMGSLDLKFLADTEALANIERLKESDRYSVPSADSFLKG 529

Query: 419 AI-QQMDPAAAVEEQYKKVNDSNYA---WRALRLLSRKCPHFFLNATPNVEKNSEFIENM 474
            +  +MD   A+ ++ K + +   A   WR LRL ++     F       E N+  ++ +
Sbjct: 530 IMDDEMDIDMAMTDEDKAIAEGAKASKSWRILRLAAKGKLAAFDKIG---EGNN--LQAL 584

Query: 475 VKRCVKEKPSSQISGNGNGVD----QDPAE-VEVDTKSEEIQEEEKEEDWEAKADPEG 527
            +     + ++++S +G+  D     D A+ + V TK E +++EE+ E  EA A+PEG
Sbjct: 585 FESATATEDNNELSPSGDTKDISQSLDGAQGLPVKTK-EPVEDEERAEVTEA-AEPEG 640


>gi|255087913|ref|XP_002505879.1| predicted protein [Micromonas sp. RCC299]
 gi|226521150|gb|ACO67137.1| predicted protein [Micromonas sp. RCC299]
          Length = 522

 Score =  179 bits (453), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 132/396 (33%), Positives = 192/396 (48%), Gaps = 48/396 (12%)

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T+  CE +F +VE           +   K  +LR CN+L RRLS++ +TV  GR+LL L+
Sbjct: 34  TIADCEDVFTWVEARRGRLSADALWRRGKLVMLRTCNELQRRLSKTADTVLRGRVLLLLS 93

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFW 210
             +P SERS LN+   +N  N TE    +E+D  + E  + +  + + V +D  FY+ FW
Sbjct: 94  SLYPLSERSALNLGGNYNAGNTTEL---DELDADTAEATDANA-KAEGVVVDRAFYETFW 149

Query: 211 SLQDYFRNPVQCYNKVS-WKMFTSYAETVLAAFKSYKLDDVQSSL--------NPSG--- 258
           SLQ +F NP       + WK F      VL  F++++LD + S           PSG   
Sbjct: 150 SLQKFFSNPPAALEPAAGWKTFCVSLRLVLDNFETHRLDKMSSGGVTPGVKPSVPSGATY 209

Query: 259 -----DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQE 313
                D   KYLT   LL LQL D  FRR+ L+Q  +   + TS +    +G++      
Sbjct: 210 VEAGIDAGTKYLTAAPLLHLQLRDPAFRRHFLVQCAVFLGFCTSPLY---KGVDAAD--- 263

Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE------------ 361
              +D  + V   I+QTPP GE F+  V L L+ EE W  WK + C +            
Sbjct: 264 --ARDAKKRVMEAIRQTPPHGEEFAAAVALALRREEGWVLWKRDNCKDFEREPEAPRPPP 321

Query: 362 -----LKRPLTSITDEDKKDEPDAKKKKTPELTKLWN-SKDNLEACKSAERDFTPSLESY 415
                ++R L               +   PEL +LWN S+DN+ A    +RD  P+ E Y
Sbjct: 322 PAPPVMRRRLRPGAAAPAVPPEKRVRLGNPELDRLWNLSEDNVSALGD-KRDAAPTAEVY 380

Query: 416 FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
            +  +  MDP A +EE YK  ND  Y W+ALRL+++
Sbjct: 381 LQNVVDDMDPEAQIEETYKSKNDKTYVWKALRLIAK 416


>gi|328770843|gb|EGF80884.1| hypothetical protein BATDEDRAFT_24368 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 793

 Score =  178 bits (451), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 130/475 (27%), Positives = 215/475 (45%), Gaps = 86/475 (18%)

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFM 116
           ++++   + +    +   +  +   L+ D  D  T+D  + LF Y++  V I    T  M
Sbjct: 147 IVQDLWDIAIIAGQQGFTDDVIAFRLIHDFLDFMTVDCTDSLFSYLDSRVEI---MTLGM 203

Query: 117 SCKNN----LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSG------------ 160
             +      LLR CND LRR+S+S++ V CG++L  ++  +P SERSG            
Sbjct: 204 QSERGRGLFLLRFCNDFLRRMSKSKHNVVCGKLLTLVSNTYPLSERSGKLLIWIFCPFDR 263

Query: 161 LNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPV 220
           +N+  E+NV N+T F    E   +SNE ++     +D +K    FY  FWSLQ+YF NP 
Sbjct: 264 VNVRGEYNVGNVTVF----EPASNSNEMDDNKEPSLDSLKTKAEFYHCFWSLQEYFSNPT 319

Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ--SSLNPSGD------------------- 259
             ++   +  F +  E V+  F     DD+Q  S  N   D                   
Sbjct: 320 LLFDISHFTKFQTGVEKVVKIF-----DDIQKESDQNTGSDRERRKVGEKKHRVPTIADA 374

Query: 260 ----YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGL--------- 306
               +F K+LT+  L DL+L+D  FRR +LLQ +I+FQ+ +S V    E +         
Sbjct: 375 RNDYFFPKFLTSTNLFDLELNDPFFRRQILLQIMIVFQFLSSLVSMEKEKIVKQFSEIAQ 434

Query: 307 ----------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN 356
                      L +DQE WV +     + ++++  P+G+ FS+ +  +L  E +W +WK 
Sbjct: 435 SINRAIQYAYVLTADQESWVSECRSRSFKVLERISPNGKKFSKTLLTVLMHERNWLKWKL 494

Query: 357 EGCPELKRPLTSI--TDEDKKDEPDAKKKKT----PELTKLWNSKDNLEAC--KSAERDF 408
             C   ++P+  +  T + +K E  A   K+     E+T LW    +L+      A R  
Sbjct: 495 GSCSSYEKPIIELGTTKKRRKAEHSAAANKSWLGNQEMTNLWERGRDLKQVLRDKASRSI 554

Query: 409 TPSLESYFEEAIQQMDP-----AAAVEE-QYKKVNDSNYAWRALRLLSRKCPHFF 457
             SL+ + +E   Q++      A  +E+  Y   N+  Y W A R   +   + F
Sbjct: 555 VTSLDEHLKELDYQLEEDCETLAEGIEDPSYVHYNNMRYNWIAYRTAIKSHAYLF 609


>gi|345569656|gb|EGX52521.1| hypothetical protein AOL_s00043g15 [Arthrobotrys oligospora ATCC
           24927]
          Length = 743

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 143/484 (29%), Positives = 225/484 (46%), Gaps = 63/484 (13%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKV-----IENYISLCVELCMKDMCNSTLPIILLSD 85
           D KS +E + R+ L  ++   D++  +     +  +  L + L   ++ +  LP  L+ +
Sbjct: 207 DVKSCMECASRELLYDIVTDFDLEFSLEPFIQLYQFFDLILTLSELEISDPQLPFALIEE 266

Query: 86  TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN-LLRMCNDLLRRLSRSQNTVFCGR 144
           T D   +D C+++F Y+E  +            K   LLR+CN+LLRRLS+S++TVFCGR
Sbjct: 267 TLDTQPIDNCKRIFSYLEARIERLTVGVDGTKGKGIILLRLCNELLRRLSKSEDTVFCGR 326

Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNETEETDTEEVD 197
           I +FL K FP SERSG+N+  EF+VEN T F        G++    S +  +   +E   
Sbjct: 327 IFVFLTKSFPLSERSGVNLRGEFHVENKTTFDERWSQLAGNQPDHFSPSLRDNDSSEAAL 386

Query: 198 KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKS----------YKL 247
                   Y   WS Q  F  P++ + K     F    +TV+ AF+S             
Sbjct: 387 AADACNRLYAVLWSTQHEFAEPIRLFQKDILDNFKESLDTVIKAFRSSIDEYGHNAPVVA 446

Query: 248 DDVQSSLNPSGD----------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
            D + S     D          Y  KYLT+++L DL++ D  FRR++L+QFLI+ ++  S
Sbjct: 447 SDTRRSHKRKWDGEVKHDELDSYNPKYLTSRELFDLEVRDLVFRRHILVQFLIVIEFLLS 506

Query: 298 TVKSRGEGLE-------------LKSDQEKWVKDT-TETVYSLIKQTPPDGEHFSQVVKL 343
                 + LE             L  + +KW +    + V +L  ++  DG  FS+ V  
Sbjct: 507 LSPKYKKYLEAEAKNRSVQFAYTLSEEDDKWAEAAKKDVVNALNYKSGMDGLLFSRTVDS 566

Query: 344 ILKGEEHWNQWKNEGCP--ELKRPLTSITDEDKKDEPDA--------KKKKTPELTKLW- 392
           +L  E++W +WK E C   E++   T      K    +A        K    P LT++W 
Sbjct: 567 VLTREKNWVKWKAENCVSFEMEPASTEHVAVSKSAGHEAATLINNLGKVVGAPALTRVWF 626

Query: 393 -NSKDNLEACKSAERDFTPSLESYFEEA-IQQMDPAAAVEEQYKKVND---SNYAWRALR 447
            NS +      +A++   P+  SY E   +  +D   A  EQ K+  D   SN  WR+LR
Sbjct: 627 DNSLEVTSIRSNAQKINLPAYRSYTESIEVYHLDKDFASTEQEKEDIDESISNVTWRSLR 686

Query: 448 LLSR 451
           + SR
Sbjct: 687 IASR 690


>gi|296412446|ref|XP_002835935.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295629732|emb|CAZ80092.1| unnamed protein product [Tuber melanosporum]
          Length = 684

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 150/514 (29%), Positives = 234/514 (45%), Gaps = 117/514 (22%)

Query: 55  IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTF 114
           + V  N++ L + L   ++  + LP+ L+ + FD  T+  CE LF Y+E  ++       
Sbjct: 90  LPVFCNFLDLSLVLAELEISGAGLPLDLIEELFDSQTIPSCECLFQYLESRIDRVTAGVE 149

Query: 115 FMSCKN-NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT 173
               K   LLR+CN+LL+RLS++++T+FCGRIL+FL+K FP  ERSG+N+  +FNV+N+T
Sbjct: 150 GGKGKALILLRLCNELLKRLSKTEDTIFCGRILIFLSKSFPIGERSGVNLRGDFNVDNVT 209

Query: 174 -------------------------------EFG--GDEEMDVSSNETEET-------DT 193
                                          E G  G  E D+ S  + +        DT
Sbjct: 210 VYDDILQEPPTVVDGMELDIKKEEMEVDQKKEGGSTGAGEADIGSARSPKRLKVEAGQDT 269

Query: 194 EEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS 252
            + +   +D N  Y  FWSLQ  F +P + + K + + F       +A FK  + D +++
Sbjct: 270 SQKEPTVLDANTLYPIFWSLQHDFADPQRLFVKDNLEKFKVGLGATMAKFKKAEDDSIKT 329

Query: 253 SL--------------NPS------------------GDYF-AKYLTNQKLLDLQLSDTN 279
           +                P                   G+ F  KYLT+++L +L++SD  
Sbjct: 330 TGKVDVANGPKKADEKKPGVSLGEKRKQIDGDDDDIKGEGFNPKYLTSRELFELEISDLT 389

Query: 280 FRRYVLLQFLILFQYFTS-TVKSRGEGLELKS-------------DQEKWVKDTTETVYS 325
           FRR+VL+Q LIL  +  S T +++G+  +LK+             + EKW   T E +  
Sbjct: 390 FRRHVLVQALILIDFLLSLTPQAKGKWKDLKTPNRSVQYAFTLGLEDEKWAATTKEQI-- 447

Query: 326 LIKQTPPD--GEHFSQVVKLILKGEEHWNQWKNEGC-PELKRPLTSITDEDKKDE----- 377
            ++   PD  G  F++ +K +L  E++W +WK E C P    PL     ED K +     
Sbjct: 448 -LRSLEPDVQGRLFTRTIKTVLIREQNWVRWKAESCHPFDMPPLGDADIEDAKQKAVKAC 506

Query: 378 --PDAKK--KKTPELTKLWNSKDN---LEACKSAERDFTPSLESYFE-------EAIQQM 423
             P++ K    TP L  LW +  +   LE      R   PS ES+ +       E +  M
Sbjct: 507 EIPESYKYTMGTPTLNLLWQATGDLTGLEGLTDPTRHNLPSPESFRQPVKDAELELLDAM 566

Query: 424 DPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
            P      Q KK  ++   W+ALRL SR   H F
Sbjct: 567 FPDDIELAQTKKFTNT---WKALRLASRDRFHLF 597


>gi|384497532|gb|EIE88023.1| hypothetical protein RO3G_12734 [Rhizopus delemar RA 99-880]
          Length = 809

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 137/490 (27%), Positives = 223/490 (45%), Gaps = 78/490 (15%)

Query: 32  KKSALEQSIRQYLLKLIKTPDID--IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDM 89
           +KSA E + R+ L++L+   + +   K I +   + +     D+ +  +P+    + FD+
Sbjct: 50  RKSATELAFRRALIRLVDGEETEERFKRIFDCCDIVLYCSEIDLLDQVIPLSFTEELFDL 109

Query: 90  STLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRSQNTVFCGRILLF 148
            T + CE++F YVE               K   LLRMCN+LLRRLS+  NTVFCGRIL+F
Sbjct: 110 HTTEGCEKIFDYVEQRKTRLTANMVPGKGKGLVLLRMCNELLRRLSKEINTVFCGRILMF 169

Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
           LA  FP  ERSG+N+  +FN  N   F  DEE+D     T+E              FYK 
Sbjct: 170 LANSFPLGERSGVNLRGDFNT-NAIHFDTDEEVDADPTLTDEQKA-----------FYKL 217

Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSY----------KLDDVQSSLNPSG 258
           FWS + YF NP   +   +++        ++  F+S           +  +V  +     
Sbjct: 218 FWSTRIYFANPPSVFTDNNFEDLQKGTNGIVEKFQSIAQKEAEILGARNKEVTGTKRSRT 277

Query: 259 DYF------AKYLTNQ--------------KLLDLQLSDTNFRRYVLLQFLILFQYFT-- 296
           +Y       A+ + NQ              +LLDL++ D  FRR V++Q+LILFQY +  
Sbjct: 278 EYMDEDDSAAEEILNQINNEYQFPRLLSSRRLLDLEMEDARFRRSVIVQYLILFQYLSGF 337

Query: 297 ---------STVKSRGEGLE--------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
                    + + +RG   +        L  +Q +W+ +T E +  L++   P G  ++ 
Sbjct: 338 SQEEKERTAALLAARGTTKQSLVQPNYVLSDEQVQWIDETREILVGLLRSIKPHGNLYTD 397

Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK--------DEPDAKKK---KTPEL 388
           ++  IL  E HW  WK  GCP  ++P  +I D  K         + P  K +    T E+
Sbjct: 398 IILTILAHERHWIIWKASGCPSFEKPPMNINDLQKAWRTKRPRLEAPPTKYRYTFGTYEI 457

Query: 389 TKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQM-DPAAAVEEQYKKVNDSNYAWRALR 447
           + L+  ++   +     R   PS+    + A+ ++ D     +E++   N +    +A R
Sbjct: 458 SSLYGKQNMSLSELMINRAKLPSIVEVIDNAVTELQDSLDPPKERFNYANGA--LLQASR 515

Query: 448 LLSRKCPHFF 457
           L+  K PH  
Sbjct: 516 LIHDKYPHLI 525


>gi|358056555|dbj|GAA97524.1| hypothetical protein E5Q_04202 [Mixia osmundae IAM 14324]
          Length = 637

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 144/491 (29%), Positives = 224/491 (45%), Gaps = 91/491 (18%)

Query: 55  IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTF 114
           +  + + + + + L  K  C+ +L I L     D  T+   E LF ++E    I  +   
Sbjct: 86  LSALYDMLDVVLYLSQKGACDDSLAISLFEMITDAQTITVAESLFEFLESRREIIIKGLL 145

Query: 115 FMSCKN-NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT 173
               K   LLR CN LL+R S+  +TVFCGRIL+FLA  FP  ERSG+N+ S FN +N+T
Sbjct: 146 PGRGKGLILLRTCNALLKRASKFVHTVFCGRILIFLAIVFPLCERSGVNVASSFNTDNVT 205

Query: 174 EFGGDEEMDVS--------------------SNETEETDTEEVDKVKIDF---NFYKKFW 210
                +E DV                     S++++  DT++ +    +    + Y++FW
Sbjct: 206 IL---DESDVGQPDTTARLADASLTGDALALSDDSKSLDTKQNNAAAGEAKMKDLYERFW 262

Query: 211 SLQDYFRNPVQCYNKVSWK---------------MFTSYAETVL--AAFKSYK------- 246
            LQ  F +P   +  VS                   T++AE +   A+F S K       
Sbjct: 263 RLQQRFCDPRTLFTNVSSAGSKSNLDILKEDIELTLTTFAEAIAKEASFGSKKTRRDVRS 322

Query: 247 ---LDDVQSSLNPSGDY-FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS----- 297
              +D  +S      +Y F KYLT + LL L+L+DT FRR +L+Q L++FQY  S     
Sbjct: 323 RQDMDQEESLAEVEANYFFPKYLTKRNLLPLELADTVFRRQILVQILVVFQYLLSFTVSE 382

Query: 298 --TVKSR---GEGLELKSD----QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
              VK+R   G+ L+L  +    +E W   T   V  +++ T P G  F + V LIL  E
Sbjct: 383 RAKVKARAATGKPLKLPHNLNEAEEAWTVSTRIRVVEMLRSTGPSGVAFWRTVSLILTRE 442

Query: 349 EHWNQWKNEGCPELKRP----------LTSITDEDKKDEPDAKKKKTPELTKLWNSKDN- 397
           E+W  WK   C + ++P           + + +  K   P   K     L++LW SK   
Sbjct: 443 ENWASWKLNNCVDFEQPPLADDAATVATSKLRERTKIARPYMHKVGNATLSRLWASKGAD 502

Query: 398 ----LEACKSAERD-FTP---SLESYFEEAIQ--QMDPAA-AVEEQYKKVNDSNYAWRAL 446
               + A K+ E D FT    +++   EE  +  ++D AA A +    K   + ++WR L
Sbjct: 503 LANVVAADKAGELDRFTAEFGAMKRKAEETAENTELDAAAKAAQIDEAKEAQNAFSWRGL 562

Query: 447 RLLSRKCPHFF 457
           RL++      F
Sbjct: 563 RLVASSNLSLF 573


>gi|378725794|gb|EHY52253.1| hypothetical protein HMPREF1120_00467 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 709

 Score =  171 bits (434), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 160/618 (25%), Positives = 271/618 (43%), Gaps = 127/618 (20%)

Query: 34  SALEQSIRQYLLKLIKTPDIDIKVIENY---ISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +ALE   R+  + ++ T  I           + +   L   D+C   L   L+ +  D  
Sbjct: 69  AALETVFREIFVHIVATVAISEPAFSQVWVLLDIITVLSDNDLCEPGLGFWLIEELLDSQ 128

Query: 91  TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y++     +    +K ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 129 TIDGCRKVFDYLDSRRERITAKNFKHKSLV------ILRCCNELLRRLSRAEDTVFCGRV 182

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE----------EMDV------------ 183
            +FL + FP  ++S +N+  EF+VEN+T F              E+D+            
Sbjct: 183 FIFLFQSFPLGDKSSVNLRGEFHVENVTTFDPAPRKSEDAIKPMELDMDARTPQATTSGA 242

Query: 184 ----SSNETEETDT--------------EEVDKVKIDFNF---YKKFWSLQDYFRNPVQC 222
               SS++ +E D+              E  D+ +   +    Y KFWSLQ  F +P + 
Sbjct: 243 RTPASSSQIQEQDSTGRGTPVAGKAIKSEVKDQTQPPPDLDTLYPKFWSLQTLFSSPTKL 302

Query: 223 YNKVSWKMFTSYAETVLAAFKSY-------------------KLDDVQSSLN---PSGDY 260
           +   +  M        L  FKS                    K  D++S  N   P+  +
Sbjct: 303 FEPSNMAMLKEGLALTLLCFKSVSHGSAPSTAPSDVRRGLKRKHSDIESDSNASLPASTF 362

Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF---TSTVKSRGEGLELKS------- 310
             KYLTN+ L DL++ D +FRR++L+Q LI+  +    T   K++ EGL  KS       
Sbjct: 363 NPKYLTNRDLFDLEIHDIDFRRHILVQALIMIDFLLSMTPAAKAKFEGLTNKSVLYPFTL 422

Query: 311 --DQEKWVKDTTETVYSLIKQ-TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR--- 364
             +  KW + T   + S ++Q    +G+ + ++V  +L  +++W +WK E CP + R   
Sbjct: 423 SEEDTKWAQSTRAAIASYLQQGNGNEGKFYYRMVDTVLSRDKNWVRWKAENCPSISRDPV 482

Query: 365 ----------PLTSITDEDKKDEPDAKKKKTPELTKLWNSKDNLEACK-SAERDFTPSLE 413
                      LT +TD  K   P+       +LT L +  + LEA K  + R   P++E
Sbjct: 483 PPHAYLEAKDTLTKLTDAHKAPVPNPPGAG--DLTFL-SKVEPLEALKRPSTRHKLPTVE 539

Query: 414 SYFEEA-IQQMDPAAAVEEQYKKVNDSNYA---WRALRLLSRKCP-----HFFLNATPNV 464
            Y+       +D   AV E  K+  +   A   WRALR   R+        +  N    V
Sbjct: 540 EYYRGIETDDLDMDFAVTEDEKREIEERKAGKLWRALRASDRRFALCERVQYGGNLKALV 599

Query: 465 EKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSE-------EIQEEEKEE 517
           E+     +++ ++   +K + +    G G +   AE E D K +       ++++EE+E+
Sbjct: 600 EEGPN--QDVEEKSGADKEARENENGGEGAEAVKAEAETDEKPDIEIEDQPQVKKEEQEQ 657

Query: 518 DWEAKADPEGDADEVMSV 535
                A  +G+  E+  V
Sbjct: 658 PSTTAATAQGEDAEMTDV 675


>gi|307102917|gb|EFN51183.1| hypothetical protein CHLNCDRAFT_55304 [Chlorella variabilis]
          Length = 625

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 132/421 (31%), Positives = 195/421 (46%), Gaps = 55/421 (13%)

Query: 78  LPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS-CKNNLLRMCNDLLRRLSRS 136
           LP  LL    + +++  CE +F +++     +K   F  + CKN LLR CN LL+RLS+S
Sbjct: 118 LPAQLLEQITEGASVQDCEGVFAFLDSRTATFKSPAFQEARCKNTLLRTCNMLLKRLSKS 177

Query: 137 QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEV 196
            N + CGRILLFLAK  P +ERSG+N+   FN +N T             + +E  T+  
Sbjct: 178 ANAMLCGRILLFLAKLLPLTERSGVNLGGAFNTDNSTPV----------EDVQEGATDSE 227

Query: 197 DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-----DVQ 251
            K  +D  FY+ FW LQ YF NP        W   +     VL  F+  K+         
Sbjct: 228 GK-PVDAAFYRTFWGLQSYFSNPPAALQVGKWGEVSRDIRRVLEKFRLEKVTVGESAAAG 286

Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSD 311
                      KYL++ +L+ LQL D  FRR+ LLQ LIL Q         G  L+L++ 
Sbjct: 287 PGAGDGQGSSVKYLSSSRLMGLQLRDATFRRHFLLQCLILMQASLLLHMQLGS-LDLRAK 345

Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP--ELKR----P 365
                      VY+ ++ TP  G  F+  ++ +++ E+ W  WK   CP   L+R    P
Sbjct: 346 -----------VYAELEATPDRGPQFAAAIRHLVRWEDSWAAWKQGSCPPAPLERAAAQP 394

Query: 366 LTSITDEDKKDEPDAKKKK------------TPELTKLWN-SKDNLEACKSAERDFTPSL 412
                D D    P  K++K            T EL +LWN ++DNL A  + +R    ++
Sbjct: 395 PAGSEDADLT-APAPKRRKLGSDAVFGIRVGTDELDRLWNLAEDNLSALSADDRGGFKTV 453

Query: 413 ESYFEEAIQQMDPAAAVEEQ---YKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE 469
               E  +++M  AAA E+     K      Y+W+ALR ++R     F  A   V+K  +
Sbjct: 454 RQLMEPVVEEMREAAAGEDDGLLMKASRSKLYSWKALRAVARTNLQAFAAA---VQKGGD 510

Query: 470 F 470
            
Sbjct: 511 L 511


>gi|169784203|ref|XP_001826563.1| THO complex subunit Tho1 [Aspergillus oryzae RIB40]
 gi|83775308|dbj|BAE65430.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 669

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 141/526 (26%), Positives = 225/526 (42%), Gaps = 121/526 (23%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ T  ID      I N + +       + C   L   L+ +  D  
Sbjct: 66  AAVEIAFRERFYSLLATTSIDEPSFIQIWNLLDIISIFSDNEQCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS-------------------SN 186
            +FL + FP  ++S +N+  E++ EN+T F    E+D S                     
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEIPELDTSVLDEADVVMSDEQGPPTTTEG 239

Query: 187 ETEETDT--------------------------EEVDKVKIDFN-FYKKFWSLQDYFRNP 219
           + E  DT                          E VD+   D N  Y  FW LQ YF  P
Sbjct: 240 QQENNDTKAPSIADPQAPVQDQPAAPKVIISQEEGVDEKANDLNKLYPTFWGLQAYFSAP 299

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQSS 253
            + ++   +  F +  E  L+AFKS   D                          ++ SS
Sbjct: 300 TRIFDPQHFATFRTGLEATLSAFKSVNTDLESSGTSKTSEELRKSNKRKRTSDGQEIASS 359

Query: 254 LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE-------- 304
            NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +  S T K++ +        
Sbjct: 360 FNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLAELTNKS 413

Query: 305 ---GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
              G  L  D  KW     + +   + Q    G+ + ++V  +L  +++W +WK EGCP 
Sbjct: 414 VLYGFVLNEDDAKWAVKMRKAIEEYL-QEGVGGKFYYRMVDTVLSRDKNWVRWKAEGCPL 472

Query: 362 LKRPLTSITDEDKKDEPDA-----KKKKTPELTKL-------WNSKDNLEACKSAERDFT 409
           ++RP  S++D     E        K+ ++  +  L         S   LE  K  +R   
Sbjct: 473 IERPPVSVSDYLGAREHATKVYANKRLRSSPMGSLNLNFLSEGESLAGLERLKEPQRFAV 532

Query: 410 PSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
           PS +S+    +     MD A   E++       ++  WR LRL ++
Sbjct: 533 PSSDSFMMGIMDDELDMDTAQTKEDKENATRAKASKTWRVLRLSAK 578


>gi|391868537|gb|EIT77751.1| nuclear matrix protein [Aspergillus oryzae 3.042]
          Length = 670

 Score =  168 bits (426), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 141/526 (26%), Positives = 225/526 (42%), Gaps = 121/526 (23%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ T  ID      I N + +       + C   L   L+ +  D  
Sbjct: 66  AAVEIAFRERFYSLLATTSIDEPSFIQIWNLLDIISIFSDNEQCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS-------------------SN 186
            +FL + FP  ++S +N+  E++ EN+T F    E+D S                     
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEIPELDTSVLDEADVVMSDEQGPPTTTEG 239

Query: 187 ETEETDT--------------------------EEVDKVKIDFN-FYKKFWSLQDYFRNP 219
           + E  DT                          E VD+   D N  Y  FW LQ YF  P
Sbjct: 240 QQENNDTKAPSIADPQAPVQDQPAAPKVIISQEEGVDEKANDLNKLYPTFWGLQAYFSAP 299

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQSS 253
            + ++   +  F +  E  L+AFKS   D                          ++ SS
Sbjct: 300 TRIFDPQHFATFRTGLEATLSAFKSVNTDLESSGTSKTSEELRKSNKRKRTSDGQEIASS 359

Query: 254 LNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE-------- 304
            NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +  S T K++ +        
Sbjct: 360 FNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLAELTNKS 413

Query: 305 ---GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
              G  L  D  KW     + +   + Q    G+ + ++V  +L  +++W +WK EGCP 
Sbjct: 414 VLYGFVLNEDDAKWAVKMRKAIEEYL-QEGVGGKFYYRMVDTVLSRDKNWVRWKAEGCPL 472

Query: 362 LKRPLTSITDEDKKDEPDA-----KKKKTPELTKL-------WNSKDNLEACKSAERDFT 409
           ++RP  S++D     E        K+ ++  +  L         S   LE  K  +R   
Sbjct: 473 IERPPVSVSDYLGAREHATKVYANKRLRSSPMGSLNLNFLSEGESLAGLERLKEPQRFAV 532

Query: 410 PSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
           PS +S+    +     MD A   E++       ++  WR LRL ++
Sbjct: 533 PSSDSFMMGIMDDELDMDTAQTKEDKENATRAKASKTWRVLRLSAK 578


>gi|425777812|gb|EKV15968.1| THO complex subunit Tho1, putative [Penicillium digitatum PHI26]
 gi|425782580|gb|EKV20479.1| THO complex subunit Tho1, putative [Penicillium digitatum Pd1]
          Length = 691

 Score =  168 bits (426), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 145/541 (26%), Positives = 243/541 (44%), Gaps = 111/541 (20%)

Query: 6   QLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKV---IENYI 62
           + +F +S   V S+     K      K +A+E + R+   +++ +  ID      I N +
Sbjct: 38  ETHFGESIWQVPSEDEKATKHLSPQTKLAAVEIAFREKFYRVLASTFIDDPAFIQIWNLL 97

Query: 63  SLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNV-----NIWKQQTFFMS 117
            +       + C   L   L+ +  D  T+D C ++F Y+E          +KQ++    
Sbjct: 98  DIVSIFSDNEQCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTKKHFKQKSLV-- 155

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
               +LR CN+LLRRLSR+++TVFCGR+ ++L + FP  ++S +N+  EF+ EN+T F  
Sbjct: 156 ----ILRSCNELLRRLSRAEDTVFCGRVFIYLFQSFPLGDKSSVNLRGEFHSENVTTF-- 209

Query: 178 DEEMDVSSNETEETDTEEVDKV--------------------------------KIDFN- 204
            +E+   S E +E DT   D                                  ++D N 
Sbjct: 210 -DEIAKDSAEKKEQDTTMTDGAEPTTPGVTAAGQAENSKVPKVVVSANGEKKSDEVDLNS 268

Query: 205 FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD---------------- 248
            Y  FWSLQ YF  P + ++   +  F +  E  L AFKS   +                
Sbjct: 269 LYPLFWSLQAYFSAPSKMFDAERFTSFKTGMEATLDAFKSINTEMENESSARASEDARKS 328

Query: 249 ----------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ---FLILFQYF 295
                     +V +S NP      KYLT++ L DL++SDT FRR+VL+Q    L      
Sbjct: 329 NKRKRIADGTEVATSFNP------KYLTSRDLFDLEISDTAFRRHVLVQALILLDFLLSL 382

Query: 296 TSTVKSRGE---------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILK 346
           TS  KSR           G  L  +  +W     +++   ++Q  P G+ + ++V  +L 
Sbjct: 383 TSKSKSRLADTTNKSVLYGFTLNEEDTRWATSMRKSIEGYLQQG-PGGKFYYRMVDTVLS 441

Query: 347 GEEHWNQWKNEGCPELKRPLTS----ITDEDKKDEPDAKKKKTP------ELTKLWNSK- 395
            +++W +WK EGCP ++RP  S     T  +   +  A K+  P      +L  L +++ 
Sbjct: 442 RDKNWVRWKAEGCPLIERPAISAAEYTTSREHATKAYANKRLRPSPMGSLDLRFLADTEV 501

Query: 396 -DNLEACKSAERDFTPSLESYFEEAI-QQMDPAAAVEEQYKKVNDSNYA---WRALRLLS 450
             N+E  K ++R   PS +S+ +  +  +MD   A+ ++ K + +   A   WR LRL +
Sbjct: 502 LANIERLKESDRYSVPSADSFLKGIMDDEMDIDMAMTDEDKAIAEGAKASKSWRILRLAA 561

Query: 451 R 451
           +
Sbjct: 562 K 562


>gi|320590242|gb|EFX02685.1| tho complex subunit [Grosmannia clavigera kw1407]
          Length = 772

 Score =  168 bits (425), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 135/474 (28%), Positives = 214/474 (45%), Gaps = 64/474 (13%)

Query: 32  KKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           K + +E + R     LI    ID      + N   +   L     C+  L   L+ +  D
Sbjct: 68  KNAIVETATRDVFGGLIAARSIDSPGFVRVWNLFDILSILADDGHCDPALLFWLIEELLD 127

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
             T+  C ++F Y+E         +F  S K  +LR CNDLLRRLSR+++T FCGR+ +F
Sbjct: 128 SQTIAGCRKVFDYLESRRERITTNSF-NSKKLVILRSCNDLLRRLSRAEDTPFCGRVFIF 186

Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKV--------- 199
           L + FP  ++S +N+  E+++EN+T +  D     ++ ETE  D  +VD           
Sbjct: 187 LFQVFPLGDKSSVNLRGEYHIENVTTYDVD-----AAKETEGGDKMDVDGAMDGSKPATD 241

Query: 200 --KIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK----------SYK 246
              IDF   Y  FWSLQ  F  P   +    +  F S  E  LAAF+          +  
Sbjct: 242 PKNIDFEVLYPVFWSLQTSFSQPKTLFEPAQFAQFKSGLEASLAAFQKINNERRPHSTRH 301

Query: 247 LDDVQSSLNPSGD--------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTST 298
            DDV+       D        Y  KYLT++ L  L+ SD  FRR+ LLQ LI+ ++  S 
Sbjct: 302 TDDVRRGTKRKRDKEDELADTYNPKYLTSRDLFQLETSDLTFRRHFLLQALIIVEFLLSL 361

Query: 299 VKSRGEGLELKSDQEK---------------WVKDTTETVYSLIKQTPPDGEHFSQVVKL 343
             S  + L   S   K               WV      +   I+ +  DG +F ++V  
Sbjct: 362 SPSAKKKLATASTPNKSVTYSDHVLSEEDTVWVTKMKHHITEHIRNSDDDGAYFFRMVDT 421

Query: 344 ILKGEEHWNQWKNEGCPELKRPLTS----ITDEDKKDEPDAKKKKTPELTKLWNSKDNLE 399
           IL  +++W +WK E C  ++RP  S    ++ +D   +  A+ ++   ++    S D L 
Sbjct: 422 ILARDKNWVRWKVESCQSIERPGVSAEEFVSAKDGAQKAFARHRRRNNMSMGSFSLDFLN 481

Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
             +  E D    L+     A+ ++D +  + E+ K    ++  WRALR ++R+C
Sbjct: 482 DDED-EADALEKLKQPKRFALPEIDQSKMLAEERK----ASKTWRALR-IARQC 529


>gi|408397178|gb|EKJ76328.1| hypothetical protein FPSE_03583 [Fusarium pseudograminearum CS3096]
          Length = 825

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 120/403 (29%), Positives = 192/403 (47%), Gaps = 77/403 (19%)

Query: 25  KPTCDYDKKSAL-EQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLP 79
           KP  D  ++ A+ E + R     LI T  ID    +KV  N + +   L     C+  L 
Sbjct: 69  KPANDKGQQFAIIETAARNIFSHLIATTTIDSPDYVKVW-NLLDILSILSDDGQCDPALL 127

Query: 80  IILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLS 134
             L  +  D  T+  C ++F ++E     +  N +KQ+         +LR CN+LLRRLS
Sbjct: 128 FWLAEELLDSQTIAGCRKIFDFLESRRERITANHFKQKQLI------ILRTCNELLRRLS 181

Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------------GGDE- 179
           R+++T FCGR+ +F+ + FP  ++S +N+  E++VENIT +              G DE 
Sbjct: 182 RAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEYHVENITTYEQGTADDDSRMALDGPDEQ 241

Query: 180 --EMDVSSNETEETDTEEVDKVK--IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYA 235
             E+  S    + TD ++ DK K       Y  FWSLQ+YF  P + +   S   F    
Sbjct: 242 PKELSESKTTPKSTDAKKEDKEKPLPTDELYPLFWSLQEYFSQPKKLFETTSLSSFKESL 301

Query: 236 ETVLAAFKSYKLDDVQS---------------SLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
              +  F++   D  +S               + NP      KYLT++ L DL++SD +F
Sbjct: 302 AATMNVFQTVHNDSKRSLKRKRETGEEDESFNAFNP------KYLTSKDLFDLEISDLSF 355

Query: 281 RRYVLLQFLILFQYFTSTVK-SRGEGLE------------------LKSDQEKWVKDTTE 321
           RR++L+Q LI+  +  +  K SR +  E                  L  +  KW  DT +
Sbjct: 356 RRHILVQALIIMDFLLALSKLSRDKLAETLPATANINKAVMYGDQILSEEDAKWASDTKK 415

Query: 322 TVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
           T+   ++Q  PDG +F ++V+ +L  +++W +WK EGC  +KR
Sbjct: 416 TIADYLRQG-PDGPYFYRMVETVLARDKNWVRWKIEGCQPIKR 457


>gi|330794778|ref|XP_003285454.1| hypothetical protein DICPUDRAFT_29296 [Dictyostelium purpureum]
 gi|325084629|gb|EGC38053.1| hypothetical protein DICPUDRAFT_29296 [Dictyostelium purpureum]
          Length = 666

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 129/407 (31%), Positives = 193/407 (47%), Gaps = 45/407 (11%)

Query: 26  PTCDYDK---KSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIIL 82
           P+ D +K   KS+++ +IR +   LIK  +   + I+  I L          +S LP+ L
Sbjct: 70  PSFDINKIEIKSSIDINIRLFFNILIKEKNTTYEDIDVTIQLAYIFIDIFGLDSILPLQL 129

Query: 83  LSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK--NNLLRMCNDLLRRLSRSQNTV 140
             D ++  T+++C +LF  +E    ++ Q    +  +  N LL++C +LL+R SRS N  
Sbjct: 130 SEDLYETKTINECLKLFGLLESRAELFSQDPEIIKGRKRNLLLKICVELLKRFSRSSNPD 189

Query: 141 FCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVK 200
            CGRILLFLA  FP S+ SGLN   E NV        +E +D  ++  +  +    +   
Sbjct: 190 SCGRILLFLAYVFPLSDPSGLNGKGEHNVH------PEESLDFQNDIMKNIEDGSSEDQT 243

Query: 201 IDFNFYKKFWSLQDYFRNPVQCY-----------NKVSWKMFTSYAETVLAAFKSY-KLD 248
           ID NFY +FW LQ  F+NP               NK  W  F    E V+ +F ++  L+
Sbjct: 244 IDKNFYTQFWGLQTIFQNPQTVLFNNITATSITLNKAKWDSFIQALELVIGSFSTHINLE 303

Query: 249 DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTST---------- 298
           D+Q S      YF KYLT+  L+ LQL D+ FRR +L Q LI FQ    T          
Sbjct: 304 DLQQSNPTKQHYFTKYLTSSNLMKLQLKDSTFRRNILTQILITFQNLDLTGQKYPTLFND 363

Query: 299 --------VKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
                    K+        +  +  ++D T   Y ++  T P GE+FS  +  ILK E++
Sbjct: 364 LQVNIIIAFKTIYHYFLFINFIKNIIQDLTIKCYKILSNTQPGGEYFSNCLSSILKREKN 423

Query: 351 WNQWKNE-GCPELKRPLTSITDEDKKDEPDAKKKKT---PELTKLWN 393
           W  WK + GC   +RP  S   + KK       +      EL++LWN
Sbjct: 424 WIIWKRDNGCKNFERPPCSPIVKKKKPVSKKSNRMVMGNVELSRLWN 470


>gi|429863476|gb|ELA37927.1| nuclear matrix protein [Colletotrichum gloeosporioides Nara gc5]
          Length = 869

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 138/512 (26%), Positives = 235/512 (45%), Gaps = 101/512 (19%)

Query: 36  LEQSIRQYLLKLIKTPDI---DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTL 92
           +E ++R     LI T  I   D   + N   L + L  ++ C+  L   L+ +  D  T+
Sbjct: 84  IETAVRDMFSNLIATTPIESPDFVKVWNLFDLLIILSDEEKCDPALLFWLVEELLDSQTI 143

Query: 93  DKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
             C ++F ++E     + V  +KQ+         +LR CN+LLRRLSR+++T FCGR+ +
Sbjct: 144 AGCRKIFDFLESRRERITVKHFKQKQLV------ILRSCNELLRRLSRAEDTAFCGRVFI 197

Query: 148 FLAKFFPFSERSGLNIISEFNVENIT---EFGGDE-----EMDV-SSNETEETDTEEVDK 198
           F+ + FP  ++S +N+  E++VEN+T   + G  +     +MDV    E  + D +   K
Sbjct: 198 FMFQSFPLGDKSSVNLRGEYHVENVTVYDDLGAKDDDAAGQMDVDPPKEVADVDAKSQTK 257

Query: 199 VKIDFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
             + F+                Y  FWSLQ+ F  P + +   +   F    E  + +F+
Sbjct: 258 A-VSFDAKKPVQADQPLDPDVLYPLFWSLQETFSQPKKLFEPDNLASFKRGLEATMTSFR 316

Query: 244 SYK----------LDDVQSSLN------PSGD----YFAKYLTNQKLLDLQLSDTNFRRY 283
           + +          +D+ Q SL         GD    Y  KYLT++ L +L++SD +FRR+
Sbjct: 317 AIQAEQGQRPPKAVDEAQRSLKRKREDAEEGDLANSYNPKYLTSRDLFELEMSDISFRRH 376

Query: 284 VLLQFLILFQY---FTSTVKSRGEG----------LELKSDQE-KWVKDTTETVYSLIKQ 329
           +L+Q LI+  +    +S VK +  G           ++ SD++ KW  +T   +     +
Sbjct: 377 ILVQSLIVMDFLLSLSSKVKEKLAGNAQNKSVIYVGQVLSDEDTKWATETKRVIVEKYLK 436

Query: 330 TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS---ITDEDKKDEPDAKKKKTP 386
              DG +FS+VV+ +L  +++W +WK+E CP ++ P  S     +  K  +  A KK+  
Sbjct: 437 HGTDGPYFSRVVETVLARDKNWIRWKSENCPSIELPPVSPETFLEAKKGSQRTATKKRLR 496

Query: 387 ELT---------KLWNSKDNLEACKSAERDFTPSLESY----------FEEAIQQMDPAA 427
            +          +  N +  ++  K+ ER   P LE +           E        AA
Sbjct: 497 SIPMGSLPLGFLEEGNEETAMQKLKAEERYSLPELEGFKRKIADDDFEIEMPTNNQTKAA 556

Query: 428 AVEEQYKKVNDSNYAWRALRLLSRKCPHFFLN 459
           AVE +  K      +WRALR+ S+     F N
Sbjct: 557 AVESKASK------SWRALRIASKTRLTLFDN 582


>gi|299756494|ref|XP_001829374.2| UDP-glucose epimerase [Coprinopsis cinerea okayama7#130]
 gi|298411702|gb|EAU92334.2| UDP-glucose epimerase [Coprinopsis cinerea okayama7#130]
          Length = 1127

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/390 (30%), Positives = 183/390 (46%), Gaps = 60/390 (15%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           + + + + +    +D C  T P ++L D  +  T+  C Q+F ++E   +   +      
Sbjct: 79  LRDKLDIVLIFTEQDACEQTFPFLVLQDLLETQTIASCSQIFSWIEDRASRLTEGMVPQK 138

Query: 118 CKN-NLLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--ENIT 173
            K   LLR  N+LLRRLS+    T+FCGRIL FL+  FP  ERSG+N+  E+    E + 
Sbjct: 139 GKALILLRTLNELLRRLSKMGSTTIFCGRILTFLSGVFPLGERSGVNLRGEYGPTWEGVK 198

Query: 174 -----EFGGDEEMDVSSNETEETDTEEVDKVKIDF---------------NFYKKFWSLQ 213
                +    E+ DV     +E+ T++ DK+++D                +FY  FWSLQ
Sbjct: 199 YSDKEKVVSSEDEDVEMAPADESQTQD-DKMQVDTKESQPAPQTAAEKREDFYNTFWSLQ 257

Query: 214 DYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLDDVQSSLNPSGD------------ 259
            YF  P +  NK + + F +    V+   K  + K   +  S N +G             
Sbjct: 258 LYFSKPTEFANKGALEDFRASINKVIPVIKEATAKERAMMGSRNLTGPGAGLKRKREPEA 317

Query: 260 --------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEG------ 305
                   +FAK+LT+ +LLDL+++DT+FRR  L Q LIL  +  +  K   EG      
Sbjct: 318 EESNVTEYFFAKFLTSPELLDLEIADTHFRRQFLFQMLILLHHLLTFTKEAKEGWATLRN 377

Query: 306 ------LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
                   L+ D  +WV++    V   ++QT P+G  FS  V  IL  E +W +WKNE C
Sbjct: 378 RSLQMDFTLEPDDARWVQEYINKVTEELRQTTPNGRQFSDTVTTILDRERNWVKWKNEIC 437

Query: 360 -PELKRPLTSITDEDKKDEPDAKKKKTPEL 388
            P  K P +   D  K D  +A ++   EL
Sbjct: 438 VPFDKEPWSQEVDGKKVDMFEATRQFRDEL 467


>gi|310798402|gb|EFQ33295.1| guanylate kinase [Glomerella graminicola M1.001]
          Length = 876

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 113/402 (28%), Positives = 192/402 (47%), Gaps = 75/402 (18%)

Query: 34  SALEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDM 89
           +A+E ++R     LI T  I+    +KV  N   L   L   + C+  L   L+ +  D 
Sbjct: 79  AAIETAVRNMFSNLIATTPIESPNFVKVW-NLFDLLTILSDDEKCDPALLFWLVEELLDS 137

Query: 90  STLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGR 144
            T+  C ++F ++E     + V  +KQ+         +LR CN+LLRRLSR+++T FCGR
Sbjct: 138 QTIAGCRKIFDFLESRRERITVKHFKQKQLV------ILRSCNELLRRLSRAEDTAFCGR 191

Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDV-SSNETEETDTEE 195
           + +F+ + FP  ++S +N+  E++VEN+T F           E+MDV S NE +E D + 
Sbjct: 192 VFIFMFQSFPLGDKSSVNLRGEYHVENVTVFDDVGAKTDATPEQMDVDSQNEVKEGDAKS 251

Query: 196 VDKVKIDFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA 240
             K  + F+                Y +FWSLQ+ F  P + ++      F    E  L 
Sbjct: 252 QTKA-VSFDAKKPAQADQPLDPDALYPQFWSLQETFSQPKKLFDPAHLTSFKRGLEATLT 310

Query: 241 AFKSYK----------LDDVQSSL----------NPSGDYFAKYLTNQKLLDLQLSDTNF 280
           +F+  +          +D+ Q +L          + +  Y  KYLT++ L +L++SD +F
Sbjct: 311 SFRVIQNEQGQRPPKTVDEAQRNLKRKREDVEESDLANSYNPKYLTSRDLFELEMSDISF 370

Query: 281 RRYVLLQFLILFQYFTSTVKSRGE--------------GLELKSDQEKWVKDTTETVYSL 326
           RR++L+Q LI+  +  S      E              G  L  +  KW  +   T+   
Sbjct: 371 RRHILVQALIIMDFLLSLSSKTKEKLAGNAQNKSVIYLGQVLSEEDTKWASEMKRTIVDK 430

Query: 327 IKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
             +   DG +FS++V+ +L  +++W +WK+E CP ++ P  S
Sbjct: 431 YLKHGVDGPYFSRMVETVLARDKNWIRWKSENCPSIELPAVS 472


>gi|46122093|ref|XP_385600.1| hypothetical protein FG05424.1 [Gibberella zeae PH-1]
          Length = 826

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 121/404 (29%), Positives = 192/404 (47%), Gaps = 79/404 (19%)

Query: 25  KPTCDYDKKSAL-EQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLP 79
           KP  D  ++ A+ E + R     LI T  ID    +KV  N + +   L     C+  L 
Sbjct: 69  KPANDKGQQFAIIETAARNIFSHLIATTTIDSPDYVKVW-NLLDILSILSDDGQCDPALL 127

Query: 80  IILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLS 134
             L  +  D  T+  C ++F ++E     +  N +KQ+         +LR CN+LLRRLS
Sbjct: 128 FWLAEELLDSQTIAGCRKIFDFLESRRERITANHFKQKQLI------ILRTCNELLRRLS 181

Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------------GGDE- 179
           R+++T FCGR+ +F+ + FP  ++S +N+  E++VENIT +              G DE 
Sbjct: 182 RAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEYHVENITTYEQGTADDDFRMALDGPDEQ 241

Query: 180 --EMDVSSNETEETDTEEVDKVK---IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSY 234
             E+  S    + TD ++ DK K    D   Y  FWSLQ+YF  P + +   S   F   
Sbjct: 242 PKELSESKTTPKSTDAKKEDKEKPLPTDV-LYPLFWSLQEYFSQPKKLFETTSLSSFKES 300

Query: 235 AETVLAAFKSYKLDDVQS---------------SLNPSGDYFAKYLTNQKLLDLQLSDTN 279
               +  F++   D  +S               + NP      KYLT++ L DL++SD +
Sbjct: 301 LAATMNVFQTVHNDSKRSLKRKRETGEEDESFNTFNP------KYLTSKDLFDLEISDLS 354

Query: 280 FRRYVLLQFLILFQYFTSTVK-SRGEGLE------------------LKSDQEKWVKDTT 320
           FRR++L+Q LI+  +  +  K SR +  E                  L  +  KW  DT 
Sbjct: 355 FRRHILVQALIIMDFLLALSKLSRDKLTETLPATAIINKAVMYGDQVLIEEDAKWASDTK 414

Query: 321 ETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
            T+   ++Q  PDG +F ++V+ +L  +++W +WK EGC  +KR
Sbjct: 415 RTIADYLRQG-PDGPYFYRMVETVLARDKNWVRWKIEGCQPIKR 457


>gi|146323653|ref|XP_746618.2| THO complex subunit Tho1 [Aspergillus fumigatus Af293]
 gi|129555326|gb|EAL84580.2| THO complex subunit Tho1, putative [Aspergillus fumigatus Af293]
 gi|159122146|gb|EDP47268.1| THO complex subunit Tho1, putative [Aspergillus fumigatus A1163]
          Length = 673

 Score =  165 bits (417), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 150/586 (25%), Positives = 245/586 (41%), Gaps = 144/586 (24%)

Query: 34  SALEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDM 89
           +A+E + R+    L+    ID    I++  N + +       + C   L   L+ +  D 
Sbjct: 66  AAVELAFREKFYNLLANTTIDEPSFIRMW-NLLDIISIFSDNEQCEPGLIFWLIEELLDS 124

Query: 90  STLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGR 144
            T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR
Sbjct: 125 QTIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGR 178

Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEF------GGDE--EMDVSSNETEET----- 191
           + +FL + FP  ++S +N+  E++ EN+T F      G  E  +MDV   + +ET     
Sbjct: 179 VFIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEITKSGAKESDDMDVEMPDAKETLTATE 238

Query: 192 ---------------------------------DTEEVDKVK-IDFN-FYKKFWSLQDYF 216
                                             TE+ D  K +D N  Y  FW LQ YF
Sbjct: 239 ALGHDHDGQPPPSASASQTLVQEQAKIPKLIVSKTEDKDSEKAVDLNALYPVFWGLQAYF 298

Query: 217 RNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DV 250
             P + ++   +  F +  E  L AFK+ + D                          ++
Sbjct: 299 SAPTKVFDSHHFSSFMTGLENTLIAFKNIRTDLENQSASKTAEEIRKSTKRKRSVDGQEI 358

Query: 251 QSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE----- 304
            SS NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +  S T K++ +     
Sbjct: 359 ASSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFVLSLTPKAKAKLADLT 412

Query: 305 ------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG 358
                 G  L  +  KW     + +   + Q    G+ + ++V  +L  +++W +WK EG
Sbjct: 413 NKSVLYGFVLNEEDAKWATKMRKAIEEYL-QEGAGGKFYYRMVDTVLSRDKNWVRWKAEG 471

Query: 359 CPELKRPLTSITDEDKKDEPDAK----KKKTPELTKLWN--------SKDNLEACKSAER 406
           CP ++RP  S+ +  +  E   K    K+  P      N        S  ++E  K  ER
Sbjct: 472 CPLIERPAVSVAEYLRARENATKTYANKRLRPSPMGALNLNFLAEGESLSSIEKLKEPER 531

Query: 407 DFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR----------- 451
              PS +S+    +     +D A   E++   +   ++  WR LRL ++           
Sbjct: 532 YSVPSADSFMMGIMDDELDIDTAHTKEDKDTAIRAKASKTWRILRLSAKGKLAAFDKIDD 591

Query: 452 --------KCPHFFLNATPNVEKNSEFIENMVKRCVKEKPSSQISG 489
                   + P    + TP  E  S+  E   K      P +  SG
Sbjct: 592 GKNLKALFEAPQPLESTTPAHEPASQASEPAAKASGDAGPEANASG 637


>gi|226289601|gb|EEH45085.1| nuclear matrix protein [Paracoccidioides brasiliensis Pb18]
          Length = 649

 Score =  165 bits (417), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 151/595 (25%), Positives = 249/595 (41%), Gaps = 127/595 (21%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIIL 82
           P   Y   + +E + RQ    L+ T  ID      I N + +       + C   L   L
Sbjct: 55  PQAQY---AVVETAFRQKFYALLATTSIDDPSFIQIWNLLDIVSIFSDNEQCEPGLFFWL 111

Query: 83  LSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQ 137
           + +  D  T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR++
Sbjct: 112 IEELLDSQTIDGCRKVFDYLESRRERNTTKHFKQKSLI------ILRSCNELLRRLSRAE 165

Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF---------------------- 175
           +TVFCGR+ +FL + FP  +RS +N+  EF+ EN+T F                      
Sbjct: 166 DTVFCGRVFIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQEGSGISVDVDSLEGK 225

Query: 176 -GGDEEMDVSSNETEETDTE-----------------------EVDKVKIDFNFYKKFWS 211
              D ++++S    ++T+ E                       E   V      Y  FWS
Sbjct: 226 SSADMKIELSEAAAQQTNGEATPSTADEHQQGVKSAIQKTTEPEAASVLTTDELYPIFWS 285

Query: 212 LQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------DVQS 252
           LQ  F +P   ++  ++  F +  E  L+ F+    D                    +  
Sbjct: 286 LQTNFSSPTTLFDPANFAAFKTGLEASLSTFQKVNTDMQVRITKGSEETRRGPKRRRIGD 345

Query: 253 SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE---- 307
           S   +  +  KYLT++ L +L+++D  FRR++L+Q LIL  +  S T K++ +  +    
Sbjct: 346 STEMTNSFNPKYLTSRDLFELEVNDVAFRRHILVQALILLDFVLSLTPKAKAKLADSTNK 405

Query: 308 -------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
                  L  D  KW      ++ S ++Q   +G+ + ++V  +L  +++W +WK EGCP
Sbjct: 406 AVLYNYVLSDDDAKWASQMKASIASYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEGCP 464

Query: 361 ELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACKSAERDF 408
            ++R   SI D         K     + + TP    +L  L   K  D++   K  +R  
Sbjct: 465 PIERTPVSIQDYLDTQASAIKVTTSKRLRLTPFGSLDLNFLMEDKNLDSVARLKHPDRFT 524

Query: 409 TPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFF------- 457
           TP ++SY          +D A + EE+ + +   +N  WR LRL SR     F       
Sbjct: 525 TPDIDSYLRGVADDEFNIDMAQSREEKDEAIKAKANKIWRTLRLSSRSKLALFDKIEDGN 584

Query: 458 ----LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSE 508
               L  TP   +++      V    + + + Q   NGNG D   A  EV+   E
Sbjct: 585 NIKVLFETPPAPEDA-----AVPATSRPQLAVQNGVNGNGQDGPSATGEVNESKE 634


>gi|409083598|gb|EKM83955.1| hypothetical protein AGABI1DRAFT_117418, partial [Agaricus bisporus
           var. burnettii JB137-S8]
          Length = 1044

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 178/367 (48%), Gaps = 60/367 (16%)

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN- 120
           + L +    +D C  T P I+L D  +  T++ C  +F ++E   +   +       K  
Sbjct: 83  LDLVLTFTEQDACEQTFPFIVLQDLLETQTINSCSHIFSWIEARSSRLTEGMVPQKGKAL 142

Query: 121 NLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEF------------ 167
            LLR  NDLLRRLS+S +T VFCGRIL FL+  FP  ERSG+N+  E+            
Sbjct: 143 ILLRTLNDLLRRLSKSGSTTVFCGRILTFLSGVFPLGERSGVNLRGEYGPTWEDVWYPKH 202

Query: 168 ----NVENITEFGGDEEMDVSSNETEETDTEEVDKVK----IDFNFYKKFWSLQDYFRNP 219
               N E +T+F G+      S+E  E +  +VD  K       +FY  FW LQ  F  P
Sbjct: 203 DEDTNTE-MTDFRGESS---RSDEVPEDNAMQVDSNKETKATPTDFYNTFWHLQYPFSKP 258

Query: 220 VQCYNKVSWKMFTSYAETVLAAFK--SYKLDDVQSSLNPSGD------------------ 259
               NK ++  F    + VL   K  + K   +  S   +G                   
Sbjct: 259 SIFANKEAFPQFQEAIDKVLPVIKEATAKERAMMGSRVAAGVVLTKRKREPEAEESNVNE 318

Query: 260 -YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE--- 307
            +FAK+LT+  LLDL+++DT+FRR  L Q +IL Q+   FT   K+     R   L+   
Sbjct: 319 YFFAKFLTSPDLLDLEIADTHFRRQFLFQLMILLQHLLTFTKGAKAAWATQRNRSLQMDF 378

Query: 308 -LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC-PELKRP 365
            L+++  +WV+DT       +KQT P+G  F++  + IL+ E++W +WKNE C P  K P
Sbjct: 379 TLEAEDTQWVQDTLSKSMEELKQTTPNGRAFAETTQTILEREKNWVKWKNELCAPFDKEP 438

Query: 366 LTSITDE 372
            ++  D+
Sbjct: 439 WSTRVDD 445


>gi|225682229|gb|EEH20513.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 701

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 141/548 (25%), Positives = 231/548 (42%), Gaps = 121/548 (22%)

Query: 70  MKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLR 124
           + + C   L   L+ +  D  T+D C ++F Y+E          +KQ++        +LR
Sbjct: 117 LTEQCEPGLFFWLIEELLDSQTIDGCRKVFDYLESRRERNTTKHFKQKSLI------ILR 170

Query: 125 MCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------G 176
            CN+LLRRLSR+++TVFCGR+ +FL + FP  +RS +N+  EF+ EN+T F        G
Sbjct: 171 SCNELLRRLSRAEDTVFCGRVFIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQDG 230

Query: 177 GDEEMDVSSNE--------------------------------------TEETDTEEVDK 198
               +DV S E                                       ++T   E   
Sbjct: 231 SGISVDVDSLEGKSSADMKIELPEAAAQQTNGEATPSTADEHQQGVKSAIQKTTEPEAAS 290

Query: 199 VKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD---------- 248
           V      Y  FWSLQ  F +P   ++  ++  F +  E  L+ F+    D          
Sbjct: 291 VLTTDELYPIFWSLQTNFSSPTTLFDPANFAAFKTGLEASLSTFQKVNTDMQVRITKGSE 350

Query: 249 ---------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-T 298
                     +  S   +  +  KYLT++ L +L+++D  FRR++L+Q LIL  +  S T
Sbjct: 351 ETRRGPKRRRIGDSTEMTNSFNPKYLTSRDLFELEVNDVAFRRHILVQALILLDFVLSLT 410

Query: 299 VKSRGEGLE-----------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKG 347
            K++ +  +           L  D  KW      ++ S ++Q   +G+ + ++V  +L  
Sbjct: 411 PKAKAKLADSTNKAVLYNYVLSDDDAKWASQMKASIASYLQQG-SEGKFYYRMVDTVLTR 469

Query: 348 EEHWNQWKNEGCPELKRPLTSITD--EDKKDEPDAKKKKTPELTKLWN----------SK 395
           +++W +WK EGCP ++R   SI D  + +         K   LT L +          + 
Sbjct: 470 DKNWVRWKAEGCPPIERTPVSIQDYLDTQASAIKVTTSKRLRLTPLGSLDLNFLMEDKNL 529

Query: 396 DNLEACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
           D++   K  +R  TP ++SY          +D A + EE+ + +   +N  WR LRL SR
Sbjct: 530 DSVARLKHPDRFTTPDIDSYLRGVADDEFNIDMAQSREEKDEAIKAKANKIWRTLRLSSR 589

Query: 452 KCPHFF-----------LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAE 500
                F           L  TP   +++      V    + +P+ Q   NGNG D   A 
Sbjct: 590 SKLALFDKIEDGNNIKVLFETPPAPEDA-----AVPATSRPQPAVQNGVNGNGQDGPSAT 644

Query: 501 VEVDTKSE 508
            EV+   E
Sbjct: 645 GEVNESKE 652


>gi|426201362|gb|EKV51285.1| hypothetical protein AGABI2DRAFT_182243, partial [Agaricus bisporus
           var. bisporus H97]
          Length = 1043

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 176/363 (48%), Gaps = 52/363 (14%)

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN- 120
           + L +    +D C    P I+L D  +  T++ C  +F ++E   +   +       K  
Sbjct: 83  LDLVLTFTEQDACEQIFPFIVLQDLLETQTINSCSHIFSWIEARSSRLTEGMVPQKGKAL 142

Query: 121 NLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEFNV--ENITEFGG 177
            LLR  NDLLRRLS+S +T VFCGRIL FL+  FP  ERSG+N+  E+    E++     
Sbjct: 143 ILLRTLNDLLRRLSKSGSTTVFCGRILTFLSGVFPLGERSGVNLRGEYGPTWEDVWYPKH 202

Query: 178 DEE-----MDVSSNETEETDTEEVDKVKIDFN---------FYKKFWSLQDYFRNPVQCY 223
           DE+     MD     +   +  E + +++D N         FY  FW LQ  F  P    
Sbjct: 203 DEDTNAEMMDFRGESSRSDEVPEDNAMQVDSNKETKATPTDFYNTFWHLQYPFSKPSIFA 262

Query: 224 NKVSWKMFTSYAETVLAAFK--SYKLDDVQSSLNPSGD-------------------YFA 262
           NK ++  F    + VL   K  + K   +  S   +G                    +FA
Sbjct: 263 NKEAFSQFQEAIDKVLPVIKEATAKERAMMGSRVAAGVVLNKRKREPEPEESNVNEYFFA 322

Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE----LKS 310
           K+LT+  LLDL+++DT+FRR  L Q +IL Q+   FT   K+     R   L+    L++
Sbjct: 323 KFLTSPDLLDLEIADTHFRRQFLFQLMILLQHLLTFTKGTKAAWATQRNRSLQMDFTLEA 382

Query: 311 DQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC-PELKRPLTSI 369
           +  +WV+DT       +KQT P+G  F++  + IL+ E++W +WKNE C P  K P ++ 
Sbjct: 383 EDTQWVQDTLSKSMEELKQTTPNGRAFAETTQTILEREKNWVKWKNELCAPFDKEPWSTR 442

Query: 370 TDE 372
            D+
Sbjct: 443 VDD 445


>gi|238508732|ref|XP_002385551.1| THO complex subunit Tho1, putative [Aspergillus flavus NRRL3357]
 gi|220688443|gb|EED44796.1| THO complex subunit Tho1, putative [Aspergillus flavus NRRL3357]
          Length = 685

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 133/485 (27%), Positives = 209/485 (43%), Gaps = 118/485 (24%)

Query: 72  DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMC 126
           + C   L   L+ +  D  T+D C ++F Y+E          +KQ++        +LR C
Sbjct: 122 EQCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSC 175

Query: 127 NDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS-- 184
           N+LLRRLSR+++TVFCGR+ +FL + FP  ++S +N+  E++ EN+T F    E+D S  
Sbjct: 176 NELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEIPELDTSVS 235

Query: 185 ----------------------SNETEE---------------------TDTEEVDKVKI 201
                                 +N+TE                      +  E VD+   
Sbjct: 236 DEADVVMSDEQGPPTTTEGQQENNDTEAPSIADPQAPVQDQPAAPKVIISQEEGVDEKAN 295

Query: 202 DFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD------------ 248
           D N  Y  FW LQ YF  P + ++   +  F +  E  L+AFKS   D            
Sbjct: 296 DLNKLYPTFWGLQAYFSAPTRIFDPQHFATFRTGLEATLSAFKSVNTDLESSGTSKTSEE 355

Query: 249 --------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY 294
                         ++ SS NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +
Sbjct: 356 LRKSNKRKRTSDGQEIASSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDF 409

Query: 295 FTS------------TVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
             S            T KS   G  L  D  KW     + +   + Q    G+ + ++V 
Sbjct: 410 MLSLTPKAKAKLAELTNKSVLYGFVLNEDDAKWAVKMRKAIEEYL-QEGVGGKFYYRMVD 468

Query: 343 LILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDA-----KKKKTPELTKL------ 391
            +L  +++W +WK EGCP ++RP  S++D     E        K+ ++  +  L      
Sbjct: 469 TVLSRDKNWVRWKAEGCPLIERPPVSVSDYLGAREHATKVYANKRLRSSPMGSLNLNFLS 528

Query: 392 -WNSKDNLEACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRAL 446
              S   LE  K  +R   PS +S+    +     MD A   E++       ++  WR L
Sbjct: 529 EGESLAGLERLKEPQRFAVPSSDSFMMGIMDDELDMDTAQTKEDKENATRAKASKTWRVL 588

Query: 447 RLLSR 451
           RL ++
Sbjct: 589 RLSAK 593


>gi|325187941|emb|CCA22485.1| THO complex subunit putative [Albugo laibachii Nc14]
          Length = 620

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 131/468 (27%), Positives = 202/468 (43%), Gaps = 78/468 (16%)

Query: 54  DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
           D   I   + LC++  + ++     P  +L D  +   +  CE L+  +EV         
Sbjct: 80  DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 139

Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
           FF +          LLRMCN +LRRLS++ NTVFCGRIL+FL+  F  +ERS +N+ S+ 
Sbjct: 140 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 199

Query: 168 NVENITEFGGD-----EEMDVSSNETEETDT-------EEVDKVKIDFNFYKKFWSLQDY 215
           NVEN+T F         E D  +N  E  D        ++    +ID+  Y  FW +Q +
Sbjct: 200 NVENLTIFEDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQRF 259

Query: 216 FRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNPS 257
           FR+  +   + +  K F      VL AF++    D                 V S+   S
Sbjct: 260 FRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKMS 319

Query: 258 GD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
            D                +  KYLTN +L  LQL D   R  VL Q LIL  +      +
Sbjct: 320 IDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHLVRAKST 379

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
             +     S Q+  + +  + V  L+K+TPPDG  +S++V  +L+ E +W +WK E CP 
Sbjct: 380 VSQD----STQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCPS 435

Query: 362 L--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----LE 399
                          KR   + T+   + +   +KK   E   + ++  S  +     LE
Sbjct: 436 YEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLLE 495

Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
             K   R    +L    +   +  DPA+ ++++Y    D  Y WR LR
Sbjct: 496 KIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 543


>gi|325187939|emb|CCA22483.1| THO complex subunit putative [Albugo laibachii Nc14]
          Length = 588

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 132/468 (28%), Positives = 205/468 (43%), Gaps = 78/468 (16%)

Query: 54  DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
           D   I   + LC++  + ++     P  +L D  +   +  CE L+  +EV         
Sbjct: 48  DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107

Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
           FF +          LLRMCN +LRRLS++ NTVFCGRIL+FL+  F  +ERS +N+ S+ 
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 167

Query: 168 NVENITEFGGD-----EEMDVSSNETEETDT-------EEVDKVKIDFNFYKKFWSLQDY 215
           NVEN+T F         E D  +N  E  D        ++    +ID+  Y  FW +Q +
Sbjct: 168 NVENLTIFEDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQRF 227

Query: 216 FRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNPS 257
           FR+  +   + +  K F      VL AF++    D                 V S+   S
Sbjct: 228 FRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKMS 287

Query: 258 GD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
            D                +  KYLTN +L  LQL D   R  VL Q LIL  +    V++
Sbjct: 288 IDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VRA 344

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           +   +   S Q+  + +  + V  L+K+TPPDG  +S++V  +L+ E +W +WK E CP 
Sbjct: 345 KS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCPS 403

Query: 362 L--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----LE 399
                          KR   + T+   + +   +KK   E   + ++  S  +     LE
Sbjct: 404 YEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLLE 463

Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
             K   R    +L    +   +  DPA+ ++++Y    D  Y WR LR
Sbjct: 464 KIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 511


>gi|325187943|emb|CCA22487.1| THO complex subunit putative [Albugo laibachii Nc14]
          Length = 583

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 131/469 (27%), Positives = 204/469 (43%), Gaps = 80/469 (17%)

Query: 54  DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
           D   I   + LC++  + ++     P  +L D  +   +  CE L+  +EV         
Sbjct: 80  DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 139

Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
           FF +          LLRMCN +LRRLS++ NTVFCGRIL+FL+  F  +ERS +N+ S+ 
Sbjct: 140 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 199

Query: 168 NVENITEFGGDEEMDVSSNETEETDTEEVDKV-------------KIDFNFYKKFWSLQD 214
           NVEN+T F  D+    ++   E+T+  E D               +ID+  Y  FW +Q 
Sbjct: 200 NVENLTIF-EDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQR 258

Query: 215 YFRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNP 256
           +FR+  +   + +  K F      VL AF++    D                 V S+   
Sbjct: 259 FFRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKM 318

Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           S D                +  KYLTN +L  LQL D   R  VL Q LIL  +      
Sbjct: 319 SIDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHLVRAKS 378

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
           +  +     S Q+  + +  + V  L+K+TPPDG  +S++V  +L+ E +W +WK E CP
Sbjct: 379 TVSQD----STQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCP 434

Query: 361 EL--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----L 398
                           KR   + T+   + +   +KK   E   + ++  S  +     L
Sbjct: 435 SYEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLL 494

Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
           E  K   R    +L    +   +  DPA+ ++++Y    D  Y WR LR
Sbjct: 495 EKIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 543


>gi|325187938|emb|CCA22482.1| THO complex subunit putative [Albugo laibachii Nc14]
          Length = 682

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 131/469 (27%), Positives = 201/469 (42%), Gaps = 80/469 (17%)

Query: 54  DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
           D   I   + LC++  + ++     P  +L D  +   +  CE L+  +EV         
Sbjct: 48  DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107

Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
           FF +          LLRMCN +LRRLS++ NTVFCGRIL+FL+  F  +ERS +N+ S+ 
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 167

Query: 168 NVENITEFGGDEEMDVSSNETEETDTEEVDKV-------------KIDFNFYKKFWSLQD 214
           NVEN+T F  D+    ++   E+T+  E D               +ID+  Y  FW +Q 
Sbjct: 168 NVENLTIF-EDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQR 226

Query: 215 YFRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNP 256
           +FR+  +   + +  K F      VL AF++    D                 V S+   
Sbjct: 227 FFRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKM 286

Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           S D                +  KYLTN +L  LQL D   R  VL Q LIL  +      
Sbjct: 287 SIDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHLVRAKS 346

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
           +  +     S Q+  + +  + V  L+K+TPPDG  +S++V  +L+ E +W +WK E CP
Sbjct: 347 TVSQD----STQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCP 402

Query: 361 EL--------------KRPLTSITDEDKKDEPDAKKKKTPELTKL--------WNSKDNL 398
                           KR   + T+   + +   +KK   E   +          S   L
Sbjct: 403 SYEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLL 462

Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
           E  K   R    +L    +   +  DPA+ ++++Y    D  Y WR LR
Sbjct: 463 EKIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 511


>gi|325187944|emb|CCA22488.1| THO complex subunit putative [Albugo laibachii Nc14]
          Length = 551

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 132/469 (28%), Positives = 207/469 (44%), Gaps = 80/469 (17%)

Query: 54  DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
           D   I   + LC++  + ++     P  +L D  +   +  CE L+  +EV         
Sbjct: 48  DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107

Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
           FF +          LLRMCN +LRRLS++ NTVFCGRIL+FL+  F  +ERS +N+ S+ 
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSKA 167

Query: 168 NVENITEFGGDEEMDVSSNETEETDTEEVDKV-------------KIDFNFYKKFWSLQD 214
           NVEN+T F  D+    ++   E+T+  E D               +ID+  Y  FW +Q 
Sbjct: 168 NVENLTIF-EDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQR 226

Query: 215 YFRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNP 256
           +FR+  +   + +  K F      VL AF++    D                 V S+   
Sbjct: 227 FFRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKM 286

Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           S D                +  KYLTN +L  LQL D   R  VL Q LIL  +    V+
Sbjct: 287 SIDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VR 343

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
           ++   +   S Q+  + +  + V  L+K+TPPDG  +S++V  +L+ E +W +WK E CP
Sbjct: 344 AKS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCP 402

Query: 361 EL--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----L 398
                           KR   + T+   + +   +KK   E   + ++  S  +     L
Sbjct: 403 SYEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLL 462

Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
           E  K   R    +L    +   +  DPA+ ++++Y    D  Y WR LR
Sbjct: 463 EKIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 511


>gi|400595094|gb|EJP62904.1| guanylate kinase [Beauveria bassiana ARSEF 2860]
          Length = 817

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 130/475 (27%), Positives = 214/475 (45%), Gaps = 66/475 (13%)

Query: 36  LEQSIRQYLLKLIKTPDIDI-KVIE--NYISLCVELCMKDMCNSTLPIILLSDTFDMSTL 92
           +E + R     LI +  ID  K +   N++ +   L     C+  L   L+ +  D  T+
Sbjct: 82  VEGASRDLFGALIASIPIDSPKFVRMWNFLDILSILSDGGQCDPALVFWLVEELLDSQTI 141

Query: 93  DKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKF 152
             C  +F ++E        + F    K  +LR CN+LLRRLSR+++T FCGR+ +F+ + 
Sbjct: 142 AGCRHIFDFLESRRERMTAKNFSQK-KLVILRSCNELLRRLSRAEDTAFCGRVFIFMFQC 200

Query: 153 FPFSERSGLNIISEFNVENITEF-----GGDEEMDVSSNETEETDTEEVDKVK------- 200
           FP  +RS +N+  E++VEN+T +     G   +MDV +    ET+ EE+   K       
Sbjct: 201 FPLGDRSSVNLRGEYHVENVTAYETTANGDATKMDVDA----ETNGEEMADAKDGKPKEG 256

Query: 201 ----IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP 256
                    Y   WSLQ+ F  P   ++   +  F    E  + AF+S   DD +S    
Sbjct: 257 SSRLSSEELYPLLWSLQESFSQPTTLFDSAKFSHFKHSLEETMKAFESLHADDQRSLKRK 316

Query: 257 SGDYFA---------KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKSRGE 304
             D  A         KYLT+Q L +L++SD +FRR+VL+Q LI+  +    ++  K +  
Sbjct: 317 RNDEGATTIAEAFNPKYLTSQDLFELEISDLSFRRHVLVQALIITDFLLSLSAEAKQKLT 376

Query: 305 GL-----------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
           G+           +L  +  KW  DT   +   ++Q   DG +F ++V+ +L  +++W  
Sbjct: 377 GVPTANKAVVYSEQLSEEDTKWATDTKRRISEYLRQG-FDGPYFFRMVETVLARDKNWVF 435

Query: 354 WKNEGCPELKR-PLTS---ITDEDKKDEPDAKKKKTPELTKLWN--------SKDNLEAC 401
           WK   CP ++R P+T+   +      +     K+  P      +        S   L+  
Sbjct: 436 WKMASCPPIEREPVTAADFVAARSSAERMATSKRLRPTPIGAVSMAFLDRSASSATLDEL 495

Query: 402 KSAERDFTPSLESYFEEAIQQMD-----PAAAVEEQYKKVNDSNYAWRALRLLSR 451
           +   R   P L S F+  I   D     PA+           S+ +WRALR+ +R
Sbjct: 496 RPPARYQLPELAS-FQRPIADDDFEIGMPASDASRAAAVHGKSSKSWRALRIAAR 549


>gi|452822457|gb|EME29476.1| THO complex subunit 1 [Galdieria sulphuraria]
          Length = 510

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 128/471 (27%), Positives = 209/471 (44%), Gaps = 65/471 (13%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENY------ISLCVELCMKDMCNSTLPIILLS 84
           D  S LE  +R  LL+L+     +    E+Y      + L + L  + M     P  LL 
Sbjct: 36  DLGSILELELRGILLELVSRSLSEEVSEEHYLKIFRLVDLSLLLSKESMVEQATPCQLLE 95

Query: 85  DTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGR 144
           D  D +  ++C ++  Y E ++  ++        K   LR C  LLRR SR  +   CGR
Sbjct: 96  DLMDTTPTNQCWRVLDYFEKHLGNFRSHEILFKGKLIFLRACISLLRRYSRVLDAETCGR 155

Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFN 204
           I+L L+  FP SERSG+N+   F+  N T++       V + E  E  +   D + +D+N
Sbjct: 156 IILLLSSAFPPSERSGVNLKGNFHTSNTTQY-------VRAEEEREYSSTSGDCL-LDWN 207

Query: 205 FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKS---YKLDDVQSSLNPSGDY- 260
            Y+ FWSLQ +  NP   Y    WK F +    VL   ++    +L D   S+       
Sbjct: 208 LYETFWSLQQFLSNPTVAYGPAEWKSFCNSLSKVLQVMETDPVRELADDPLSMTEVEQVS 267

Query: 261 ---------FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSD 311
                    F KYLT+  LL  Q  D+ FRRY L+Q+LI  Q+  +    R     + SD
Sbjct: 268 VEQCEHQLDFVKYLTSPYLLSFQFRDSVFRRYHLVQYLIFLQHMATL---RENFAWIGSD 324

Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN-EGCPELKRPLTSIT 370
           +   ++     ++ +++ T P GE F + +K +L  E +W +WK  + C   +RP   I 
Sbjct: 325 E---MEGLVNRIFRVLQGTKPQGERFVEYIKRVLLHERYWLKWKTLDNCKAFERPPCEIP 381

Query: 371 -------------------------DEDKKDEPDAKKKKTPELTK---LWNSKDNLEAC- 401
                                    D ++K   + + K   ++++    W +K+  E   
Sbjct: 382 ENSAGENDNILWRSLEQRWQRLYNEDRERKTLSEPENKSILQMSRSYWSWKTKEQREGTL 441

Query: 402 KSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK 452
           K A R   P+++   +E   + D    +EE+  K N  ++ W++LR+L  K
Sbjct: 442 KDAGRIVVPNIDEIMDEL--KRDKEEGIEEELAKKNSYSFVWKSLRVLLGK 490


>gi|119486899|ref|XP_001262369.1| THO complex subunit Tho1, putative [Neosartorya fischeri NRRL 181]
 gi|119410526|gb|EAW20472.1| THO complex subunit Tho1, putative [Neosartorya fischeri NRRL 181]
          Length = 674

 Score =  162 bits (411), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 138/529 (26%), Positives = 233/529 (44%), Gaps = 125/529 (23%)

Query: 34  SALEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDM 89
           +A+E + R+    L+ +  ID    I++  N + +       + C   L   L+ +  D 
Sbjct: 66  AAVELAFREKFYSLLASTTIDEPSFIRMW-NLLDIISIFSDNEQCEPGLIFWLIEELLDS 124

Query: 90  STLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGR 144
            T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR
Sbjct: 125 QTIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGR 178

Query: 145 ILLFLAKFFPFSERSGLNIISEFNVENITEF------GGDE--EMDVSSNETEET----- 191
           + +FL + FP  ++S +N+  E++ EN+T F      G  E  +MDV  ++ +ET     
Sbjct: 179 VFIFLFQSFPLGDKSAVNLRGEYHTENVTTFDEITKLGAKESDDMDVEMSDAKETLTATE 238

Query: 192 ---------------------------------DTEEVDKVK-IDFN-FYKKFWSLQDYF 216
                                             TE+ D  K +D +  Y  FW LQ YF
Sbjct: 239 AQGYDHDGQPPPSASASQTPVQGQAKVPKLLVSKTEDKDTEKAVDLDALYPVFWGLQAYF 298

Query: 217 RNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DV 250
             P + ++   +  F +  E  L AFK+ + D                          ++
Sbjct: 299 SAPTKVFDSQHFSSFMTGLENTLIAFKNIRTDLENQSASKTAEEIRKSTKRKRSVDGQEI 358

Query: 251 QSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE----- 304
            SS NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +  S T K++ +     
Sbjct: 359 ASSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLADLT 412

Query: 305 ------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG 358
                 G  L  +  KW     + +   + Q    G+ + ++V  +L  +++W +WK EG
Sbjct: 413 NKSVLYGFVLNEEDAKWATKMRKAIEEYL-QEGTGGKFYYRMVDTVLSRDKNWVRWKAEG 471

Query: 359 CPELKRPLTSITD----EDKKDEPDAKKKKTPELTKLWN--------SKDNLEACKSAER 406
           CP ++RP  S+ +     +   +  A K+  P      N        S  ++E  K  ER
Sbjct: 472 CPLIERPAVSVAEYLGARENATKTYANKRLRPSPMGALNLNFLAEGESLSSIEKLKEPER 531

Query: 407 DFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
              PS +S+    +     +D A   E++   +   ++  WR LRL ++
Sbjct: 532 YSVPSADSFMMGIMDDELDIDTAHTKEDKDTAIRAKASKTWRVLRLSAK 580


>gi|389639088|ref|XP_003717177.1| guanylate kinase [Magnaporthe oryzae 70-15]
 gi|351642996|gb|EHA50858.1| guanylate kinase [Magnaporthe oryzae 70-15]
          Length = 810

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 133/507 (26%), Positives = 231/507 (45%), Gaps = 103/507 (20%)

Query: 34  SALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKD---MCNSTLPIILLSDTFDMS 90
           + LE ++R    +L+    ID        +L   LC+ +   +C+  L   L+ +  D  
Sbjct: 74  AVLETAVRDTFGRLLAESSIDAPDFVRVWNLLDLLCILEDLELCDPALLFWLIEELLDSQ 133

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T+  C ++F Y+E        +  F S K  +LR CN+LLRRLSR+++T FCGR+ +FL 
Sbjct: 134 TISGCRKIFDYLESRRQKITSK-HFASKKLVILRTCNELLRRLSRARDTAFCGRVFIFLF 192

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDK------------ 198
           + FP  ++S +N+  EF+VEN+T +    + D +      TD  E+D             
Sbjct: 193 QSFPLGDKSSVNLRGEFHVENVTTY----DQDAAKRNGATTDQMEIDGQPATATPNEGKQ 248

Query: 199 --------------VKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF- 242
                           +D +  Y  FWSLQ  F  P   +++ ++  F S  ++ L+A  
Sbjct: 249 ATLKHTSSDAKGQGPPVDMDALYPIFWSLQSSFSQPKNLFDRKNFAEFQSNLQSTLSALE 308

Query: 243 --------KSYKLDDVQSS-----LNPSGDYFA-----KYLTNQKLLDLQLSDTNFRRYV 284
                   +S + D    +      +P G+  A     +YLT++ L +L+LSD  FRRY+
Sbjct: 309 YAQGQASSRSKQYDGGSRAGQKRKRDPDGNELADSFNPRYLTSRDLFELELSDLTFRRYI 368

Query: 285 LLQFLILFQYFTSTVKSRGEGL--------------ELKSDQE-KWVKDTTETVYS-LIK 328
           L+Q LI+ ++  S      E L              ++ SD+E KW  +  + + + L  
Sbjct: 369 LVQALIIMEFLLSLSAEAKEKLAQVKVNNKSVTYSDQVLSDEEAKWAVNMKDKISAQLTS 428

Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE-------------DKK 375
            +  DG++F ++V  +L  +++W +WK EGCP+++ P  S+  E             +KK
Sbjct: 429 ISVGDGQYFHRIVATVLARDKNWVRWKIEGCPQIELP--SVPPEQFVEAKKTVHRQVNKK 486

Query: 376 DEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMD----------- 424
              +A    + +     ++   L++ K ++R   P L   F+  I + D           
Sbjct: 487 HRSNAGSALSLDFLSTSSNSSVLDSLKVSDRYQLPEL-GMFQRKIAEDDLEIDMPMNDKS 545

Query: 425 PAAAVEEQYKKVNDSNYAWRALRLLSR 451
            A A+E +  K      +WRALR+ S+
Sbjct: 546 KAEAIEGKASK------SWRALRIASK 566


>gi|346319558|gb|EGX89159.1| THO complex subunit Tho1, putative [Cordyceps militaris CM01]
          Length = 805

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 121/452 (26%), Positives = 202/452 (44%), Gaps = 66/452 (14%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
           N++ +   L     C++ L   L+ +  D  T+  C  LF ++E        + F    K
Sbjct: 116 NFLDILSILSDDGQCDAALVFWLVEELLDSQTIAGCRHLFDFLESRRERMTAKNFSQK-K 174

Query: 120 NNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF---- 175
             +LR CN+LLRRLSR+++T FCGR+ +F+ + FP  +RS +N+  E++VEN+T +    
Sbjct: 175 LVILRSCNELLRRLSRAEDTAFCGRVFIFMFQCFPLGDRSSVNLRGEYHVENVTTYETTT 234

Query: 176 -GGDEEMDVSSNETEETDTEEVDKVKIDFN---------------FYKKFWSLQDYFRNP 219
            G   +MD+   + E    E   + K+D                  Y   WSLQ+ F  P
Sbjct: 235 NGSSTKMDI---DIEAPVAETNGEKKLDAKDSKAKEGGSRLSSEELYPLLWSLQESFSQP 291

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFA--------KYLTNQKLL 271
              ++   +  F    E  L AF+S   DD +SS     D  A        KYLT+Q L 
Sbjct: 292 TTLFDAAKFSHFKHSLEETLKAFESLHADDPRSSKRKRNDEGAAVAEAFNPKYLTSQDLF 351

Query: 272 DLQ-LSDTNFRRYVLLQFLILFQYFTSTVKSRGEGL--------------ELKSDQEKWV 316
            L+ ++D +FRR+VL+Q LI+ ++  S      + L              +L  +  KW 
Sbjct: 352 VLEVINDLSFRRHVLVQALIITEFLLSLSAEAKQKLSSLPAANKAVVYSEQLSEENTKWA 411

Query: 317 KDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD----- 371
            +T   +   ++Q   DG +F ++V+ +L  +++W  WK   CP ++R   +  D     
Sbjct: 412 TETKRRISEYLRQG-VDGPYFLRMVETVLARDKNWVFWKMTSCPPIERAPVAAADFVAAR 470

Query: 372 -------EDKKDEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMD 424
                    K+  P      + E      S+ +++  +   R   P+L S FE  +   D
Sbjct: 471 TSAERMATSKRLRPTPMGAVSMEFLNRAASQTSMDELQPPARSALPALAS-FERPLADDD 529

Query: 425 -----PAAAVEEQYKKVNDSNYAWRALRLLSR 451
                P + V +       S+ +WRALR+ +R
Sbjct: 530 FEIAMPTSDVTKAAAVYGKSSKSWRALRIAAR 561


>gi|295662084|ref|XP_002791596.1| nuclear matrix protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279722|gb|EEH35288.1| nuclear matrix protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 649

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 150/601 (24%), Positives = 246/601 (40%), Gaps = 139/601 (23%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIIL 82
           P   Y   + +E + R+    L+ T  ID      I N + +       + C   L   L
Sbjct: 55  PQAQY---AVVETAFREKFYALLATTSIDDPSFIQIWNLLDIVSIFSDNEQCEPGLLFWL 111

Query: 83  LSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQ 137
           + +  D  T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR++
Sbjct: 112 IEELLDSQTIDGCRKVFDYLESRRERNTTKHFKQKSLI------ILRSCNELLRRLSRAE 165

Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF---------------------- 175
           +TVFCGR+ +FL + FP  +RS +N+  EF+ EN+T F                      
Sbjct: 166 DTVFCGRVFIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQEGSAISVEVDFLEGK 225

Query: 176 ------------------------GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWS 211
                                     DE    +    ++T   E   V      Y  FWS
Sbjct: 226 SSADMKIESPQAAAQQTNGEVTPSTADEHQQGAKPAIQKTTEPEAASVLTTDELYPIFWS 285

Query: 212 LQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD----------------------- 248
           LQ  F +P   ++  ++  F +  E  L+ F+    D                       
Sbjct: 286 LQTNFSSPTTLFDPTNFAAFKTGLEASLSTFQKVNTDMQVRMTKGSEETRRGPKRRRIGD 345

Query: 249 --DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEG 305
             ++ +S NP      KYLT++ L +L+++D  FRR++L+Q LIL  +  S T K++ + 
Sbjct: 346 GTEMTNSFNP------KYLTSRDLFELEVNDVAFRRHILVQALILLDFVLSLTPKAKAKL 399

Query: 306 LE-----------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQW 354
            +           L  D  KW      ++ S ++Q   +G+ + ++V  +L  +++W +W
Sbjct: 400 ADSTNKAVLYNYVLSDDDAKWASQMKASIASYLQQGS-EGKFYYRMVDTVLTRDKNWVRW 458

Query: 355 KNEGCPELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACK 402
           K EGCP ++R   SI D         K     + + TP    +L  L   K  D++   K
Sbjct: 459 KAEGCPPIERTPVSIQDYLDTQASAVKVTTSKRLRSTPLGSLDLNFLMEDKNLDSVARLK 518

Query: 403 SAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFF- 457
             +R  TP ++SY          +D A + EE+ + +   +N  WR LRL SR     F 
Sbjct: 519 HPDRFTTPEIDSYLRGVADDEFNIDMAQSREEKDEAIKAKANKIWRTLRLSSRSKLALFD 578

Query: 458 ----------LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKS 507
                     L  TP   +++      +    + + + Q   NGNG D   A  EV+   
Sbjct: 579 KIEDGNNIKVLFETPPAPEDA-----AIAATSRPQLAVQNGVNGNGQDGPSATGEVNESK 633

Query: 508 E 508
           E
Sbjct: 634 E 634


>gi|166240592|ref|XP_643594.2| hypothetical protein DDB_G0275717 [Dictyostelium discoideum AX4]
 gi|165988675|gb|EAL69609.2| hypothetical protein DDB_G0275717 [Dictyostelium discoideum AX4]
          Length = 726

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 139/446 (31%), Positives = 206/446 (46%), Gaps = 72/446 (16%)

Query: 6   QLNFDK-SQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISL 64
           +L F+K S L  +   Y   K     + K++++ +IR +   LIK  DI  + I+  I L
Sbjct: 66  KLLFNKDSLLKEIQKIYPNIKDPIQIEIKTSIDLNIRIFFNNLIKQIDITYENIDLAIKL 125

Query: 65  CVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSC--KNNL 122
                   + +S LP+ L  D F+  T+ KC  LF  +E    I+ Q    +    +N L
Sbjct: 126 AYSFVELGILDSILPLQLSEDLFETKTISKCLDLFGLLESRAEIFSQDPEIIKGRKRNLL 185

Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--ENITEFGGDEE 180
           L++C +LL+   R  N   CGRILLFLA  FP S+ SGLN   E N+  E   +F  D  
Sbjct: 186 LKICIELLK---RETNPDSCGRILLFLAYVFPLSDPSGLNTKGEHNIHPEEALDFQNDIM 242

Query: 181 MDVSSNETEETDTEE-------------------VDKVKIDFNFYKKFWSLQDYFRNPVQ 221
            +V+ N     +  E                    +   +D NFY++FW LQ  F+NP Q
Sbjct: 243 NNVNGNNNNNVNNSEDTTTTSTAATATVITTTNGNNDTTVDRNFYRQFWGLQTVFQNPQQ 302

Query: 222 C----------------YNKVSWKMFTSYAETVLAAFKSY-KLDDV--QSSLNPS-GDYF 261
                             NK+ W+ F    E V+ +F ++  LD++   SS NPS   YF
Sbjct: 303 VLLNTTVTTGGTITNITLNKIKWESFIQSLELVIGSFSTHINLDELSQSSSNNPSKKHYF 362

Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTE 321
            KYLT+  L+ LQL D+ FR+ +L Q LI FQ    T  ++         Q+  +++ T 
Sbjct: 363 TKYLTSSNLMKLQLKDSIFRKNILTQILITFQALDLT--NQKYPTIFNDLQKNIIQELTN 420

Query: 322 TVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG-CPELKRPLTSITDEDKKDEPDA 380
             + ++  T P+GE+FS  +  ILK E++W  WK +  C   +RP  S         P  
Sbjct: 421 KCFKILSNTNPNGEYFSNCLSSILKREKNWIIWKRDNQCKPFERPPCS---------PIV 471

Query: 381 KKKK-------------TPELTKLWN 393
           KKKK               EL++LWN
Sbjct: 472 KKKKLFRKTALTKISLGNQELSRLWN 497


>gi|115388801|ref|XP_001211906.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195990|gb|EAU37690.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 652

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 150/550 (27%), Positives = 232/550 (42%), Gaps = 118/550 (21%)

Query: 78  LPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSR 135
           L II +       T+D C ++F Y+E       ++ F    + NL  LR CN+LLRRLSR
Sbjct: 97  LDIISIFSDNGHQTIDGCRKVFDYLESRRERNTKKHF---KQKNLIILRSCNELLRRLSR 153

Query: 136 SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEE 195
           +++TVFCGR+ +FL + FP  ++S +N+  E++ EN+T F  D+     + E+ +TD E 
Sbjct: 154 AEDTVFCGRVFIFLFQSFPLGDKSAVNLRGEYHTENVTTF--DDITKEPTKESADTDVEM 211

Query: 196 VD---------------------------------KV------------KIDFN-FYKKF 209
           VD                                 KV            K+DF+  Y  F
Sbjct: 212 VDEQSAPPAAEGQKDESEPQPAPTASQTPDSSKPPKVVISTSEDREEDKKVDFDTLYPMF 271

Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPS------------ 257
           W LQ YF  P + ++   +  F S  E  L AFK    D   S+ N +            
Sbjct: 272 WGLQAYFSAPTKVFDPQHFATFKSGLEATLDAFKKVNTDLENSNSNRASEELRKANKRKR 331

Query: 258 --------GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE---- 304
                   G +  KYLT++ L DL+++DT FRR+VL+Q LIL  +  S T K++ +    
Sbjct: 332 GADGQEIGGSFNPKYLTSRDLFDLEINDTAFRRHVLVQALILLDFMLSLTPKAKAKLTDL 391

Query: 305 -------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
                  G  L  +  KW     + +   ++Q    G+ + ++V  +L  +++W +WK E
Sbjct: 392 TNKSVLYGFVLNDEDAKWAIKVRKAIEEYLQQG-IGGKFYYRMVDTVLSRDKNWVRWKAE 450

Query: 358 GCPELKRPLTSIT------DEDKKDEPDAKKKKTP------ELTKLWNSKDNLEACKSAE 405
           GCP ++RP  S+T      D   K   + + + +P      +      S   +E  K AE
Sbjct: 451 GCPLIERPPVSVTEYLGARDVATKVYANKRLRASPMGSLDLKFLSDGESLSGMERLKEAE 510

Query: 406 RDFTPSLESYF----------EEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
           R   PS +S+           + A  + D  AAV  +  K+      WR LRL ++    
Sbjct: 511 RFTVPSSDSFMMGIMDDDLDLDMAQTEEDKDAAVRSKDSKI------WRILRLSAKSK-- 562

Query: 456 FFLNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEK 515
             L A   +E               E     + G     +   A  E  T  EE QE   
Sbjct: 563 --LAAFDKIEDGKNLKILFEAPQPTEGTPQALEGTPQAAEPAKASTEETTAQEESQENGN 620

Query: 516 EEDWEAKADP 525
             + E+K DP
Sbjct: 621 VGNGESKEDP 630


>gi|121714347|ref|XP_001274784.1| nuclear matrix protein [Aspergillus clavatus NRRL 1]
 gi|119402938|gb|EAW13358.1| nuclear matrix protein [Aspergillus clavatus NRRL 1]
          Length = 669

 Score =  161 bits (407), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 133/528 (25%), Positives = 227/528 (42%), Gaps = 123/528 (23%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+    ID      I N + +       + C   L   L+ +  D  
Sbjct: 66  AAVELAFREKFYSLLAITSIDEPSFIQIWNLLDIISIFSDNEQCEPGLIFWLIEEMLDSQ 125

Query: 91  TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDEEMDV-------------- 183
            +FL + FP  ++S +N+  E++ EN+T F         G+++MDV              
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTENVTTFDETVQSNTQGNDDMDVEMSDGKPALAAPEG 239

Query: 184 -------------SSNETEETDTEEVDKVKIDFN-------------FYKKFWSLQDYFR 217
                          ++T   D  ++ K+ +  N              Y  FW LQ YF 
Sbjct: 240 QGADRESQPPSSGRDSQTPAPDQAKIPKLVVSGNEDKSTDKDVDLSTLYPMFWGLQAYFS 299

Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQ 251
            P + ++   +  F    E  L  FK+ + D                          ++ 
Sbjct: 300 APTKVFDPQHFASFKKGLENTLTVFKTIRTDLESQSASRTAEEIRKSNKRKRTTDGQEIA 359

Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE------ 304
           SS NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +  S T K++ +      
Sbjct: 360 SSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLADLTN 413

Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
                G  L  +  KW     + +   + Q    G+ + ++V  +L  +++W +WK EGC
Sbjct: 414 KSVLYGFVLSEEDAKWASRMRKVIEEYL-QEGAGGKFYYRMVDTVLSRDKNWVRWKAEGC 472

Query: 360 PELKRPLTSITD----EDKKDEPDAKKKKTPELTKLWN--------SKDNLEACKSAERD 407
           P +++P  S+T+     +   +  A K+  P      N        S  +LE  K  ER 
Sbjct: 473 PLIEKPAVSVTEYLGARENATKTYANKRLRPSPMGSLNLNFLTEAESSVSLERLKEQERY 532

Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
             P+ +S+    +     +D A   E++   +   ++  WR LRL ++
Sbjct: 533 TVPAADSFMMGLMDDELDIDTAQTKEDREMAMRAKASKTWRILRLSAK 580


>gi|449540150|gb|EMD31146.1| hypothetical protein CERSUDRAFT_120108 [Ceriporiopsis subvermispora
           B]
          Length = 757

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 164/356 (46%), Gaps = 53/356 (14%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           + + + L +     D C  +LP+ +LSD  +  T+  C  +F ++E   +   +      
Sbjct: 79  LRDRLDLILTFTEHDACEQSLPLQVLSDIMETQTISSCSHIFSWIEERADRLTKDMVPQK 138

Query: 118 CKN-NLLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF 175
            K   LLR  NDLLRRLS++   T+FCGRIL FL++ FP  ERSG+N+  E+       +
Sbjct: 139 GKALVLLRTLNDLLRRLSKTGSTTMFCGRILTFLSRVFPLGERSGVNLRGEYG----PTW 194

Query: 176 GGDEEM-----------DVSSNETEETDTEEV--DKVKIDFNFYKKFWSLQDYFRNPVQC 222
            G   M           D +  E EE    +   +K K     Y  FWSLQ  F  P   
Sbjct: 195 DGPGSMKPEKQEETLRVDEAKPEVEEDPVAKAAEEKRKRQEELYNTFWSLQLPFSRPSLF 254

Query: 223 YNKVSWKMFTSYAETVLAAFK--SYKLDDVQSSLNPSGD--------------------Y 260
            +   +  F      VL   K  + K   +  S   +G                     +
Sbjct: 255 VDPAVFPHFKEAVNKVLPVIKEATTKERALMGSKTHAGGGASLKRKREPEAEANAGKEYF 314

Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLEL---- 308
           FAKYLT+ +LLDL+++DT+FRR  L Q LI   +   FT   K+     R   L+L    
Sbjct: 315 FAKYLTSPELLDLEIADTHFRRQFLFQLLISLNHLLAFTKATKATWVTPRNRSLQLDFTL 374

Query: 309 KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
           ++D  +WV +T       ++QT P+G  F++   +IL+ E++W +WKNE CP   R
Sbjct: 375 EADDTQWVTETAAKALEELRQTAPNGRAFAETAVVILEREKNWVRWKNELCPAFDR 430


>gi|440475720|gb|ELQ44383.1| guanylate kinase [Magnaporthe oryzae Y34]
 gi|440486031|gb|ELQ65934.1| guanylate kinase [Magnaporthe oryzae P131]
          Length = 810

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 132/507 (26%), Positives = 230/507 (45%), Gaps = 103/507 (20%)

Query: 34  SALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKD---MCNSTLPIILLSDTFDMS 90
           + LE ++R    +L+    ID        +L   LC+ +   +C+  L   L+ +  D  
Sbjct: 74  AVLETAVRDTFGRLLAESSIDAPDFVRVWNLLDLLCILEDLELCDPALLFWLIEELLDSQ 133

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T+  C ++F Y+E        +  F S K  +LR CN+LLRRLSR+++T FCGR+ +FL 
Sbjct: 134 TISGCRKIFDYLESRRQKITSK-HFASKKLVILRTCNELLRRLSRARDTAFCGRVFIFLF 192

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDK------------ 198
           + FP  ++S +N+  EF+VEN+T +    + D +      TD  E+D             
Sbjct: 193 QSFPLGDKSSVNLRGEFHVENVTTY----DQDAAKRNGATTDQMEIDGQPATATPNEGKQ 248

Query: 199 --------------VKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF- 242
                           +D +  Y  FWSLQ  F  P   +++ ++  F S  ++ L+A  
Sbjct: 249 ATLKHTSSDAKGQGPPVDMDALYPIFWSLQSSFSQPKNLFDRKNFAEFQSNLQSTLSALE 308

Query: 243 --------KSYKLDDVQSS-----LNPSGDYFA-----KYLTNQKLLDLQLSDTNFRRYV 284
                   +S + D    +      +P G+  A     +YLT++ L +L+LSD  FRRY+
Sbjct: 309 YAQGQASSRSKQYDGGSRAGQKRKRDPDGNELADSFNPRYLTSRDLFELELSDLTFRRYI 368

Query: 285 LLQFLILFQYFTSTVKSRGEGL--------------ELKSDQE-KWVKDTTETVYS-LIK 328
           L+Q LI+ ++  S      E L              ++ SD+E KW  +  + + + L  
Sbjct: 369 LVQALIIMEFLLSLSAEAKEKLAQVKVNNKSVTYSDQVLSDEEAKWAVNMKDKISAQLTS 428

Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE-------------DKK 375
            +  DG++F ++V  +L  +++W +WK  GCP+++ P  S+  E             +KK
Sbjct: 429 ISVGDGQYFHRIVATVLARDKNWVRWKIGGCPQIELP--SVPPEQFVEAKKTVHRQVNKK 486

Query: 376 DEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMD----------- 424
              +A    + +     ++   L++ K ++R   P L   F+  I + D           
Sbjct: 487 HRSNAGSALSLDFLSTSSNSSVLDSLKVSDRYQLPEL-GMFQRKIAEDDLEIDMPMNDKS 545

Query: 425 PAAAVEEQYKKVNDSNYAWRALRLLSR 451
            A A+E +  K      +WRALR+ S+
Sbjct: 546 KAEAIEGKASK------SWRALRIASK 566


>gi|392597884|gb|EIW87206.1| UDP-glucose 4-epimerase [Coniophora puteana RWD-64-598 SS2]
          Length = 1065

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 131/425 (30%), Positives = 203/425 (47%), Gaps = 68/425 (16%)

Query: 1   MLDSF-QLNFDKSQLN-VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKV- 57
           +LD F +  F+++ L+ +VS++   YK     + KS  E  ++Q + +L  T    +K  
Sbjct: 13  LLDRFPKRPFEQAVLSKLVSEELRKYKGHTTDNVKSRWEFLLKQEVFRLAATEGTALKTD 72

Query: 58  -------IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWK 110
                  + + + L +     D C+ + P+ +L D  +  T+  C Q+F ++E       
Sbjct: 73  ADAYYDSLRDRLDLVLTFTEHDACDVSFPLTVLQDVLEAQTIASCSQIFAWIETQQ---A 129

Query: 111 QQTFFMSCKNN----LLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIIS 165
           + T  M+ +      LLR  NDLLRRLS++ +T +FCGRIL FL+  FP  ERSG+N+  
Sbjct: 130 RLTTGMAPQKGKALVLLRTLNDLLRRLSKAGSTTLFCGRILTFLSGVFPLGERSGVNLRG 189

Query: 166 EFNV------ENITEFGGDEEM--DVS-SNETEETDTEEVD------KVKIDFNFYKKFW 210
           E+        E       DE    DVS     E  D  EVD      K K    FY  FW
Sbjct: 190 EYGPIWDGPRETEAAKAADESRGEDVSVEGAKEGGDKMEVDESQNGEKAKKSDGFYNIFW 249

Query: 211 SLQDYFRNPVQCYNKVSWKMFTSYAETVL-----------------------AAFKSYKL 247
           SLQ  F  P    +   +  F +  E VL                        A K  + 
Sbjct: 250 SLQLPFSRPPVFASTYKFTDFQAAVEKVLPVLREAAVKERALMGNRTNSGQPTALKRKRE 309

Query: 248 DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS------ 301
            ++   LN S  +FAK+LT+ +LL+L+++DT+FRR +L Q LIL  +     K+      
Sbjct: 310 PEMGEQLNNSDYFFAKFLTSPELLELEIADTHFRRQILFQLLILLHHLLQFTKAAKAHWA 369

Query: 302 --RGEGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWK 355
             R   L+    L+ D ++WV+DT     + I+ T P G  F+++V +IL+ E++W +WK
Sbjct: 370 TPRNRSLQMDFTLEPDDDQWVQDTMAKAVAEIRSTTPSGPMFAELVNVILEREKNWVKWK 429

Query: 356 NEGCP 360
           NE CP
Sbjct: 430 NEMCP 434


>gi|346972039|gb|EGY15491.1| THO complex subunit 1 [Verticillium dahliae VdLs.17]
          Length = 858

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 133/479 (27%), Positives = 220/479 (45%), Gaps = 103/479 (21%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTF 114
           N   +   L     C+  L   ++ +  D  T+  C ++F ++E     +  + +KQ+  
Sbjct: 109 NLFDIVSILSDNGSCDPALLFWMVEELLDSQTIAGCRKIFDFLESRRERITSSHFKQKQL 168

Query: 115 FMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITE 174
                  +LR CN+LLRRLSR+++T FCGR+ +F+ + FP  +RS +N+  E++VEN+T 
Sbjct: 169 V------ILRTCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDRSSVNLRGEYHVENVTA 222

Query: 175 FGGDEEMDVSSNETE-ETDTEEVDK-----------------VKIDFNFYKKFWSLQDYF 216
           +      D S+  T+ + D  EVDK                 V  D   Y  FWSLQ+ F
Sbjct: 223 Y------DQSTAPTDPDADKMEVDKDAAAVPDTNKQSSQDGAVNAD-ALYPVFWSLQESF 275

Query: 217 RNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP------------------SG 258
             P + ++  + + F    E  + AFK  + +  Q S  P                   G
Sbjct: 276 SQPKKLFDMTNLEGFKQGLEATVKAFKVVQKE--QGSRPPKAADSGTGGTKRKRNHDDQG 333

Query: 259 D----YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS----------TVKSRGE 304
           D    +  KYLT+Q L +L++SD +FRR +L+Q  I+ ++  S          T+K++ +
Sbjct: 334 DQSNAFNPKYLTSQDLFELEISDLSFRRNILVQAYIILEFLLSLSAKAKEKLATIKTQNK 393

Query: 305 GL-----ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
            +      L  +  KW  D  E V   +KQ   DG H+ ++V+ +L  + +W +WK EGC
Sbjct: 394 SVMYMDQTLSEEDTKWASDMKEAVEDGLKQG-SDGPHYHRMVETVLVRDRNWIRWKMEGC 452

Query: 360 PELK-RPLTSITDEDKKDE---PDAKKKKTPELTKLW-------NSKDNLEACKSAERDF 408
           P ++  PL+  T    +DE       K+  P +  L        + +  L+  ++ +R  
Sbjct: 453 PPIELPPLSPETWSKAQDELYSITRNKRMKPPMGSLPLDFMQDEDDEKALDRLRAPDRYE 512

Query: 409 TPSLESY--------FEEAIQQMD--PAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
            P LES+        FE  +   D   AAA+E +  K      +WRALR+ +R     F
Sbjct: 513 LPELESFKRKIADDDFEIEMPTNDETKAAAIESKASK------SWRALRIAARSRLGLF 565


>gi|342873471|gb|EGU75639.1| hypothetical protein FOXB_13850 [Fusarium oxysporum Fo5176]
          Length = 822

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 113/388 (29%), Positives = 193/388 (49%), Gaps = 70/388 (18%)

Query: 36  LEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMST 91
           +E + R    +LI T  ID    +KV  N + +   L     C+  L   L  +  D  T
Sbjct: 81  IETAARNIFSQLIATTTIDSPDYVKVW-NLLDILSILSDDGQCDPALLFWLAEELLDSQT 139

Query: 92  LDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
           +  C ++F ++E     +  N +KQ+         +LR CN+LLRRLSR+++T FCGR+ 
Sbjct: 140 IAGCRKIFDFLESRRERITANHFKQKQLI------ILRTCNELLRRLSRAEDTAFCGRVF 193

Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF---GGDEEMDVSSNETEE------------- 190
           +F+ + FP  ++S +N+  E++VEN+T +     ++E  +  +E EE             
Sbjct: 194 IFMFQSFPLGDKSSVNLRGEYHVENVTTYEATRAEDESKMVVDEPEEQLKEQAESKSTPK 253

Query: 191 -TDTEEVDKVK--IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
             D ++ DK K       Y  FWSLQ+YF  P + +   +    TS+ E + A  K ++ 
Sbjct: 254 SADAKKADKEKPLSTDELYPLFWSLQEYFSQPKKLFETTN---LTSFKEGLAATMKVFQT 310

Query: 248 --DDVQSSLN----------PSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF 295
             +D + SL            S  +  KYLT++ L DL++SD +FRR++L+Q LI+  + 
Sbjct: 311 VHNDSRRSLKRKRESGEEDESSNTFNPKYLTSKDLFDLEISDLSFRRHILVQALIIMDFL 370

Query: 296 TS-TVKSRGEGLE------------LKSDQ------EKWVKDTTETVYSLIKQTPPDGEH 336
            S ++++R    E            + SDQ       +W  D  +T+   ++Q   DG +
Sbjct: 371 ISLSIQARERLTEALPVNASVNKAVMYSDQVLSDEDAEWASDMKKTIADYLRQG-ADGPY 429

Query: 337 FSQVVKLILKGEEHWNQWKNEGCPELKR 364
           F ++V+ +L  +++W +WK EGC  +KR
Sbjct: 430 FYRMVETVLARDKNWVRWKIEGCHPIKR 457


>gi|406864828|gb|EKD17871.1| guanylate kinase [Marssonina brunnea f. sp. 'multigermtubi' MB_m1]
          Length = 862

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 137/508 (26%), Positives = 232/508 (45%), Gaps = 96/508 (18%)

Query: 32  KKSALEQSIRQYLLKLIK-TPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTF 87
           KK A+E + R+    L+  T  ID      I N + +   L   D     + + L+ + F
Sbjct: 59  KKHAIETAARESFNNLLACTTTIDSPSFVNIWNLLDIMSILSDDDKTEPAVMVWLVEELF 118

Query: 88  DMSTLDKCEQLFYYVEVNVN--IWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           D  T+  C ++F Y+E      I K   +F S K  +LR CN+LLRRLSR+++T FCGR+
Sbjct: 119 DSQTVAGCRKIFDYLESRRERIIAK---YFDSKKLVILRSCNELLRRLSRAEDTAFCGRV 175

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDV--SSNETEETDTEE 195
            +FL + FP  ++S +N+  E++ EN+T F           E+MDV  SS       T++
Sbjct: 176 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDVPPPKEVADPEKMDVDPSSESPSGGGTQK 235

Query: 196 VDK-----------------VKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETV 238
             +                 V    + Y  FWSLQ  F  P   ++  ++  F +  E  
Sbjct: 236 ASQNSSSAPQKGVSFSRSENVMAAEDLYVAFWSLQQSFSQPKSLFDNTNFSSFKAGLEAT 295

Query: 239 LAAFKSYKLD--DVQSSLNPSGD-----------------YFAKYLTNQKLLDLQLSDTN 279
           +A FKS + D    Q+  N  G                  Y  KYLT++ L +L++SD +
Sbjct: 296 MAMFKSVQSDVSGRQTKSNEEGKRGTKRKHHQGDDDLASTYNPKYLTSRDLFELEISDLS 355

Query: 280 FRRYVLLQFLILFQYFTSTVKSRGEGLE-----------------LKSDQEKWVKDTTET 322
           FRR++++Q LI+ ++  S      E L                  L  +  KW  +  ++
Sbjct: 356 FRRHIIVQVLIVIEFLLSLSAKAKEKLSKALPDGTNKSVTYVDQVLSEEDTKWATEMKKS 415

Query: 323 VYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED--------K 374
           +   +KQ   +G +F ++V+ +L  +++W +WK E CP + +P  SIT ++        +
Sbjct: 416 MAEYLKQG-YEGAYFHRMVETVLSRDKNWARWKIENCPSIAKP--SITPQEYASAKVSAR 472

Query: 375 KDEPDAKKKKTPELTKL-------WNSKDNLEACKSAERDFTPSLESYFEEAIQ--QMDP 425
           K     + +  P +  L        +++  L+  K   R   PS++S F+ +I+   MD 
Sbjct: 473 KATTTKRPRPNPPIGSLDLQFLSEGDNRSGLDRLKDPTRYQLPSVKS-FKSSIELDDMDI 531

Query: 426 AAAVEEQYKKVN---DSNYAWRALRLLS 450
             A +E+ K       ++ +WRA R+ S
Sbjct: 532 EMAQDEESKNAAMELKASKSWRAQRIAS 559


>gi|302904097|ref|XP_003049002.1| hypothetical protein NECHADRAFT_46674 [Nectria haematococca mpVI
           77-13-4]
 gi|256729936|gb|EEU43289.1| hypothetical protein NECHADRAFT_46674 [Nectria haematococca mpVI
           77-13-4]
          Length = 800

 Score =  158 bits (400), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 121/413 (29%), Positives = 194/413 (46%), Gaps = 67/413 (16%)

Query: 8   NFDKSQLNVVSDKYALYKPTCDYDKKSA-------LEQSIRQYLLKLIKTPDID----IK 56
           +FD     V S    L +   D D K A       +E + R     LI T  ID    +K
Sbjct: 45  DFDDLSGRVSSTAAQLPQLEGDKDAKEAKSRQYGIIETAARDIFSNLISTTTIDSPDFVK 104

Query: 57  VIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQ 111
           V  N + +   L     C+  L   L  +  D  T+  C ++F ++E     +    +KQ
Sbjct: 105 VW-NLLDILSILSDDGQCDPALLFWLAEELLDSQTISGCRKIFDFLESRRERITAKHFKQ 163

Query: 112 QTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVEN 171
           +         +LR CN+LLRRLSR+++T FCGR+ +F+ + FP  ++S +N+  E++VEN
Sbjct: 164 KQLI------ILRSCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEYHVEN 217

Query: 172 ITEF-----GGDEEMDV------SSNETEETDTEEVDKVK--IDFNFYKKFWSLQDYFRN 218
           IT +       D  MDV      S + +   D ++ DK K       Y  FWSLQ+ F  
Sbjct: 218 ITTYETTIAENDSNMDVDTPTGGSKSTSRSNDAKKGDKEKPLSTDALYPLFWSLQESFSQ 277

Query: 219 PVQCYNKVSWKMFTSYAETVLAAFKSYKL--DDVQSSLNPSGD----------YFAKYLT 266
           P + +   +     S+ E++ A  K ++   +D + SL    D          +  KYLT
Sbjct: 278 PKRLFETAN---LASFKESLGATMKFFQTIHNDSRRSLKRKRDSGEEDENSNAFNPKYLT 334

Query: 267 NQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE---------------LKSD 311
           ++ L DL++SD +FRR+VL+Q LI+  +  S      E L                L  +
Sbjct: 335 SKDLFDLEISDLSFRRHVLVQALIIMDFLLSLSSPAREKLSSSLATNKAVMYGDQTLSEE 394

Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
             KW  D  +T+   ++Q   DG +F ++V+ +L  +++W +WK EGC  +KR
Sbjct: 395 DTKWANDMKKTIADYLRQG-ADGPYFYRMVETVLARDKNWVRWKIEGCHAIKR 446


>gi|392570571|gb|EIW63743.1| UDP-glucose 4-epimerase [Trametes versicolor FP-101664 SS1]
          Length = 1095

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 112/359 (31%), Positives = 167/359 (46%), Gaps = 55/359 (15%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           + + + L +     D C+   P   L D  +  T+  C  +F ++E   +   + T  M 
Sbjct: 79  LSDRLDLILTFTEHDACDQIFPFNTLQDLLETQTIASCSHIFSWIEQRAD---RLTAGMV 135

Query: 118 CKNN----LLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--E 170
            +      LLR  NDLLRRLS+    T FCGRIL FL++ FP  ERSG+N+  E+    E
Sbjct: 136 PQKGKALVLLRTLNDLLRRLSKMGATTQFCGRILTFLSRVFPLGERSGVNLRGEYGPMWE 195

Query: 171 NITEFGG---DEE---MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYN 224
                G    DEE   M+V   + E+   E   K K   +F+  FWSLQ  F  P     
Sbjct: 196 GPGAKGHERKDEEANQMEVDETKEEQASKEAERKRKEKDDFFYTFWSLQLPFSRPHLFSE 255

Query: 225 KVSWKMFTSYAETVLAAFKSYKLDD----VQSSLNPSGD--------------------- 259
             ++  F +  + VL   K     +       SL+ +                       
Sbjct: 256 TRTFSAFKNAVDKVLPVIKEATAKERALMGNKSLHATAGASSQLKRKRAPAPAAESPATS 315

Query: 260 --YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLEL- 308
             YFAKYLT+ +LLDL++SDT+FRR +L Q LI+  +   FT   K+     R   L+L 
Sbjct: 316 EYYFAKYLTSPELLDLEISDTHFRRQLLFQLLIVLNHLLNFTKAAKATWYTPRNRSLQLD 375

Query: 309 ---KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
              +    +WV+DT       ++QT P G  F++ V+++L+ E++W +WKNE CP   R
Sbjct: 376 FTLEPTDAQWVQDTIARAVEELRQTSPHGRAFAEAVQVMLEREKNWVRWKNELCPPFDR 434


>gi|340905435|gb|EGS17803.1| hypothetical protein CTHT_0071520 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 781

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 122/491 (24%), Positives = 225/491 (45%), Gaps = 84/491 (17%)

Query: 34  SALEQSIRQYLLKLIKTPDIDIK---VIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E ++R     LI T  I+      + N + +   L   ++C+  L   ++ +  D  
Sbjct: 74  AIIETAVRDTFKSLIATTPIEAPEFVRVWNLLDILSFLSDSELCDPALLFWIVEELLDSQ 133

Query: 91  TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+  C ++F ++E     +     KQ+         +LR CN+LLRRLSR+ +  FCGR+
Sbjct: 134 TIAGCRKVFDFLESRRERITAKHLKQKQLV------ILRTCNELLRRLSRALDPAFCGRV 187

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGD------EEMDVSSNETEETDTEEVDKV 199
            +F+ + FP  ++S +N+  E+++EN+T +  D      ++MDV S    + D + +D  
Sbjct: 188 FIFMFQTFPLGDKSSVNLRGEYHLENVTTWDRDIPAQDGDKMDVDSEAKPQLDVKALDPD 247

Query: 200 KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYK------------- 246
            +    Y  FW+LQ+ F  P + +   +   F +  E  +A FKS K             
Sbjct: 248 AL----YPIFWALQESFNQPKRLFEPSNLAAFKAGLEATMATFKSIKPEQPSRARERPEK 303

Query: 247 -LDDVQSSLNPSGD---------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
             ++++ SL    D         +  KYLT++ L  L++SD  FRR +L+Q LI+ ++  
Sbjct: 304 PPEEIKHSLKKKRDDLDEELASGFNPKYLTSRDLFKLEISDLAFRRNILVQALIIMEFLL 363

Query: 297 STVKSRGEGL---------------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVV 341
           S      E L               +L  +  KWV D  E +   +KQ  P+G +F+++V
Sbjct: 364 SLSPKAKEKLTSVKQPNKSVMYLDYQLSEEDTKWVLDMKERIVDYLKQG-PEGPYFNRMV 422

Query: 342 KLILKGEEHWNQWKNEGCPELKRPLTS--ITDEDKKDEPDAKKKKTPELTKLW------- 392
           + +L  +++W +WK E CP ++ P  S  I DE +         K P    L        
Sbjct: 423 ETVLSRDKNWVRWKIENCPPIELPPLSPVIFDEARSTISKLATTKRPRSAPLGSLSLDFL 482

Query: 393 --NSKDNLEACKSAERDFTP---------SLESYFEEAIQQMDPAAAVEEQYKKV-NDSN 440
             + +D+    ++ ++   P         +L+   EE   ++D   + E + + +   ++
Sbjct: 483 NDDEEDDDSGARAMQKLKDPTRYRLPELSTLKRGIEEDDLEIDMPMSEESKARAIEGKAS 542

Query: 441 YAWRALRLLSR 451
             WRALR+ ++
Sbjct: 543 KTWRALRIAAK 553


>gi|327298209|ref|XP_003233798.1| nuclear matrix protein [Trichophyton rubrum CBS 118892]
 gi|326463976|gb|EGD89429.1| nuclear matrix protein [Trichophyton rubrum CBS 118892]
          Length = 634

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 136/536 (25%), Positives = 235/536 (43%), Gaps = 114/536 (21%)

Query: 16  VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKD 72
           V++  Y +        + +A+E + R+    L+ T  ID      I N + +       +
Sbjct: 48  VLAGDYGVSVGVNQQTQYAAVETAFREIFYDLLATTSIDEPSFMQIWNLLDIVSIFSDNE 107

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
            C   L   L+ +  D  T+D C ++F Y+E          +KQ++        +LR CN
Sbjct: 108 KCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCN 161

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNE 187
           +LLRRLSR+++TVFCGR+ +FL + FP  ++S +N+  E++ EN+T F   ++M   S+E
Sbjct: 162 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDVQQKMQAESDE 221

Query: 188 TEETDTEEVD--------------------------------------KVKIDFNFYKKF 209
             + D +E +                                      ++++D   Y  F
Sbjct: 222 VMDIDIQENETPVSNAKSTEQEPTTAQTPLESTNDATTQRAKASSIQPELELD-TLYAAF 280

Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
           WSLQ+ F  P + ++   +    S  E  L++F+  S KL+                   
Sbjct: 281 WSLQESFSKPTRLFDTTHFASLKSGLEATLSSFRKVSSKLEVRSTTKSSDELRRGTKRKR 340

Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF----------- 292
                D+ +S NP      KYLT++ L +L+++D +FRR++L+Q LIL            
Sbjct: 341 GEGGSDLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPRSK 394

Query: 293 -QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            +   ST KS   G  L  +  KW      ++ S + Q   DG+ + ++V  +L  +++W
Sbjct: 395 AKLADSTNKSVLYGYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 453

Query: 352 NQWKNEGCPELKRPLTSITDE------DKKDEPDAKKKKTP------ELTKLWNSKDNLE 399
            +WK EGCP ++RP   +           K   + + + TP          + ++  +L 
Sbjct: 454 VRWKAEGCPPIERPPVPVQSHLDTQSSAAKITANRRLRSTPMGSLDLGFLSVESNLGDLN 513

Query: 400 ACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
             K  ER  TPSL+ Y          +D A   EE+ + +   ++  WR LR+ SR
Sbjct: 514 RLKEPERFSTPSLDFYMSGITDDDFNIDMAKDDEEKEQSIKAKASKTWRTLRIASR 569


>gi|320036048|gb|EFW17988.1| nuclear matrix protein [Coccidioides posadasii str. Silveira]
          Length = 608

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 103/403 (25%), Positives = 185/403 (45%), Gaps = 78/403 (19%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ T  I+      + N + +       + C   L   L+ +  D  
Sbjct: 66  AAVETAFREKFYHLLATTSINKPEFVHMWNLLDVVSIFSDNERCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF------------------------------ 175
            +FL + FP  ++S +N+  E++ EN+T F                              
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDEPPKHIAANEDADDVEMKDAEAQKPSDNS 239

Query: 176 ----------GGDEEMDVSSNETEETDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYN 224
                     GG    ++    ++++   + +   ID +  Y  FW LQ  F  P + ++
Sbjct: 240 SKTDAAASSQGGSVSPEIDGQRSKQSTDHQAEDTSIDMDALYPVFWGLQANFSAPTRLFD 299

Query: 225 KVSWKMFTSYAETVLAAFKSYKLDDV----QSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
              +  F    E+ +++F+  K  +      ++ NP      KYLT++ L +L+++D  F
Sbjct: 300 HEHFASFKKGLESTISSFQKRKRGETGPETANTFNP------KYLTSRDLFELEVNDVAF 353

Query: 281 RRYVLLQFLILFQYFTS------------TVKSRGEGLELKSDQEKWVKDTTETVYSLIK 328
           RR++L+Q LIL  +  S            T KS   G  L  +  KW      ++ S ++
Sbjct: 354 RRHILVQSLILLDFLISLSPKAKAKLADATNKSVLYGYVLSDEDAKWANQMKSSIASYLQ 413

Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
           Q   DG+ + ++V  +L  +++W +WK EGCP +++P   I D
Sbjct: 414 QGL-DGKFYYRMVDTVLTRDKNWARWKAEGCPPIEKPPIQIQD 455


>gi|302409216|ref|XP_003002442.1| guanylate kinase [Verticillium albo-atrum VaMs.102]
 gi|261358475|gb|EEY20903.1| guanylate kinase [Verticillium albo-atrum VaMs.102]
          Length = 854

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 126/474 (26%), Positives = 216/474 (45%), Gaps = 93/474 (19%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTF 114
           N   +   L     C+  L   ++ +  D  T+  C ++F ++E     +  + +KQ+  
Sbjct: 109 NLFDIVSILSDNGSCDPALLFWMVEELLDSQTIAGCRKIFDFLESRRERITSSHFKQKQL 168

Query: 115 FMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITE 174
                  +LR CN+LLRRLSR+++T FCGR+ +F+ + FP  +RS +N+  E++VEN+T 
Sbjct: 169 V------ILRTCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDRSSVNLRGEYHVENVTA 222

Query: 175 FGGDEEMDVSSNETEETDTEEVDKVKIDFN-------------FYKKFWSLQDYFRNPVQ 221
           +  D+       + ++ + E+      D N              Y  FWSLQ+ F  P +
Sbjct: 223 Y--DQSAAPMDPDADKMEVEKDAAAGPDANKQSSQDGAVNADALYPVFWSLQESFSQPKK 280

Query: 222 CYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP------------------SGD---- 259
            ++  + + F    E  + AFK  + +  Q S  P                   GD    
Sbjct: 281 LFDMTNLEGFKQGLEATVKAFKVVQKE--QGSRPPKAADSAVGGTKRKRNHDDQGDQSNA 338

Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS----------TVKSRGEGL--- 306
           +  KYLT+Q L +L++SD +FRR +L+Q  I+ ++  S          T+K++ + +   
Sbjct: 339 FNPKYLTSQDLFELEISDLSFRRNILVQAYIILEFLLSLSAKAKEKLATIKTQNKSVMYM 398

Query: 307 --ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELK- 363
              L  +  KW  D  E V   +KQ   DG H+ ++V+ +L  + +W +WK EGCP ++ 
Sbjct: 399 DQTLSEEDAKWASDMKEAVEDGLKQG-SDGPHYHRMVETVLVRDRNWIRWKMEGCPPIEL 457

Query: 364 RPLTSITDEDKKDE---PDAKKKKTPELTKLW-------NSKDNLEACKSAERDFTPSLE 413
            PL+  T    +DE       K+  P +  L        + +  L+  ++ +R   P L 
Sbjct: 458 PPLSPETWSKAQDELYSITRNKRMKPPMGSLPLDFMQDEDDEKALDKLRAPDRYELPELG 517

Query: 414 SY--------FEEAIQQMD--PAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
           S+        FE  +   D   AAA+E +  K      +WRALR+ +R     F
Sbjct: 518 SFRRKIADDDFEIEMPTNDETKAAAIESKASK------SWRALRIAARSRLGLF 565


>gi|322693773|gb|EFY85622.1| nuclear matrix protein [Metarhizium acridum CQMa 102]
          Length = 864

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 135/488 (27%), Positives = 224/488 (45%), Gaps = 100/488 (20%)

Query: 48  IKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE---- 103
           I++PD  +K+  N + +   L     C+  L   L+ +  D  T+  C  +F Y+E    
Sbjct: 103 IESPDF-VKMW-NLLDILSILSDDGQCDPALLFWLVEELLDSQTISGCRIVFDYLESRRE 160

Query: 104 -VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLN 162
            +    +KQ+         +LR CN+LLRRLSR+++T FCGR+ +FL + FP  +RS +N
Sbjct: 161 RITSKHFKQKHLV------ILRSCNELLRRLSRAEDTAFCGRVYIFLFQSFPLGDRSSVN 214

Query: 163 IISEFNVENITEFGGDEEMDVSSNETE------------ETDTEEVDKVKIDFN------ 204
           +  E++VEN+T F G  E D    E++             TDT+  +K  + F+      
Sbjct: 215 LRGEYHVENVTAFEGSAEEDEQQRESKMEVDTPAETPKSSTDTKASNK-GVSFDSKNSEK 273

Query: 205 ------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS-SL-NP 256
                  Y  FWSLQ+ F  P++ ++   +  F S  E  + AF++   ++V S SL NP
Sbjct: 274 PYDPDRLYPLFWSLQESFSQPLKLFDSTHFNKFKSRLEVTIKAFQAIHQNEVPSKSLENP 333

Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTST-- 298
             +                +  KYLT++ L +L++SD +FRR++L+Q LI+  +  S   
Sbjct: 334 RRNLKRKRDDEGSEDAPEAFNPKYLTSKDLFELEISDLSFRRHILVQALIITDFLLSLSM 393

Query: 299 -VKSRGEGL-------------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLI 344
             + +  GL             +L  +  KW  +   T+   +K+   DG +F ++V+ +
Sbjct: 394 ETREKLAGLNTPPPNKSVVYNDQLSEENTKWATEMKLTISDYLKRG-IDGPYFYRMVETV 452

Query: 345 LKGEEHWNQWKNEGCPELKRPLTS------ITDEDKKDEPDAKKKKTPELT-KLWNSKDN 397
           L  +++W  WK   CP ++R   S           +K     + + TP  +  L   +DN
Sbjct: 453 LARDKNWVFWKMASCPPIQRDPVSPELFVEARTTAQKLATSKRLRPTPMGSVSLEFLRDN 512

Query: 398 LEAC----KSAERDFTPSLESY----------FEEAIQQMDPAAAVEEQYKKVNDSNYAW 443
            E C    K+ ER   P LES+           E        AAA+  +  K      +W
Sbjct: 513 DECCLDQLKTPERYQIPELESFQRKIADDDFEIEMPTNDQTKAAAIAGKASK------SW 566

Query: 444 RALRLLSR 451
           RALR+ +R
Sbjct: 567 RALRIAAR 574


>gi|358383603|gb|EHK21267.1| hypothetical protein TRIVIDRAFT_113560, partial [Trichoderma virens
           Gv29-8]
          Length = 744

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 133/501 (26%), Positives = 222/501 (44%), Gaps = 101/501 (20%)

Query: 36  LEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMST 91
           +E + R +  KLI T  ID    +K+  N + +   L     C+  L   L+ +  D  T
Sbjct: 82  VETAARDFFSKLIATTSIDSPEFVKMW-NLLDILSILSDDGQCDPALLFWLVEELLDSQT 140

Query: 92  LDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
           +  C ++F ++E     +    +KQ+         +LR CN+LLRRLSR+++T FCGR+ 
Sbjct: 141 ISGCRKIFDFLESRRERITAKHFKQKNLV------ILRSCNELLRRLSRAEDTAFCGRVF 194

Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDT------------- 193
           +F+ + FP  +RS +N+  E++VEN+T F   EE D         DT             
Sbjct: 195 IFMFQSFPLGDRSSVNLRGEYHVENVTTFETTEEDDSKMEVDTHADTPKEQGDAKTPAKN 254

Query: 194 --EEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQ 251
             ++  K       Y  FWSLQ+ F  P++ ++   +  F +  E+ + AF++   D + 
Sbjct: 255 VAQDSSKALDPDTLYPMFWSLQESFSQPLRLFDVGHFAKFKTSLESTITAFQAIHEDSIY 314

Query: 252 SS--------------------LNPSGDYFAKYLTNQKLLDLQ----LSDTNFRRYVLLQ 287
           SS                    + P   +  KYLT++ L  L+    +SD +FRR+VL+Q
Sbjct: 315 SSKSMENLKRTLKRKRDDDGTEMLPEA-FNPKYLTSKDLFQLEASITISDLSFRRHVLVQ 373

Query: 288 FLILFQYFTSTVKSRGEGL--------------ELKSDQEKWVKDTTETVYSLIKQTPPD 333
            LI+  +  S  K   E                +L  +  KW  D  + +   +KQ   D
Sbjct: 374 ALIITDFLLSLSKEEREKRTNINVSNKSVIYPDQLNEENTKWATDMKKVISDYLKQG-SD 432

Query: 334 GEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTSIT-----------DEDKKDEPDAK 381
           G +F ++V+ +L  +++W  WK   CP ++R P+++ +             +K+  P+  
Sbjct: 433 GPYFFRMVETVLARDKNWVFWKMASCPPIQRDPVSAASFVESKEAAQKMATNKRLRPNPL 492

Query: 382 KKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMD-----------PAAAVE 430
              + E     +    +E  KS+ER   P L+  F+  I   D            AAAV 
Sbjct: 493 NAVSLEFLANEDEGSAMEKLKSSERYQLPELDE-FKNKIADDDFEIDMPTNNETKAAAVA 551

Query: 431 EQYKKVNDSNYAWRALRLLSR 451
            +  K      +WRALR+ SR
Sbjct: 552 GKASK------SWRALRIASR 566


>gi|325187940|emb|CCA22484.1| THO complex subunit putative [Albugo laibachii Nc14]
          Length = 586

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 131/468 (27%), Positives = 204/468 (43%), Gaps = 80/468 (17%)

Query: 54  DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
           D   I   + LC++  + ++     P  +L D  +   +  CE L+  +EV         
Sbjct: 48  DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107

Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
           FF +          LLRMCN +LRRLS++ NTVFCGRIL+FL+  F  +ERS +N+ S+ 
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAVNLHSK- 166

Query: 168 NVENITEFGGD-----EEMDVSSNETEETDT-------EEVDKVKIDFNFYKKFWSLQDY 215
            VEN+T F         E D  +N  E  D        ++    +ID+  Y  FW +Q +
Sbjct: 167 -VENLTIFEDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQRF 225

Query: 216 FRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNPS 257
           FR+  +   + +  K F      VL AF++    D                 V S+   S
Sbjct: 226 FRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKMS 285

Query: 258 GD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
            D                +  KYLTN +L  LQL D   R  VL Q LIL  +    V++
Sbjct: 286 IDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VRA 342

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           +   +   S Q+  + +  + V  L+K+TPPDG  +S++V  +L+ E +W +WK E CP 
Sbjct: 343 KS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCPS 401

Query: 362 L--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----LE 399
                          KR   + T+   + +   +KK   E   + ++  S  +     LE
Sbjct: 402 YEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLLE 461

Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
             K   R    +L    +   +  DPA+ ++++Y    D  Y WR LR
Sbjct: 462 KIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 509


>gi|398405762|ref|XP_003854347.1| hypothetical protein MYCGRDRAFT_70063 [Zymoseptoria tritici IPO323]
 gi|339474230|gb|EGP89323.1| hypothetical protein MYCGRDRAFT_70063 [Zymoseptoria tritici IPO323]
          Length = 646

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 138/508 (27%), Positives = 221/508 (43%), Gaps = 98/508 (19%)

Query: 32  KKSALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           K SA+E ++R+    LI +  I       + N + + +    +  C++ L  +LL +  D
Sbjct: 64  KFSAVETAVREVFYSLITSTSISDPAFVGVWNLLDIVLICGDRGRCDAALICLLLEELLD 123

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSRSQNTVFCGRIL 146
             T D C  +F Y++   +   Q+ F    K NL  LR CN+LLRRLSR+++ +FCGR+ 
Sbjct: 124 SQTTDGCRIVFDYLDSRRDRLAQKDFH---KKNLIFLRSCNELLRRLSRAEDAIFCGRVF 180

Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF---------GGDEEMDVSSNETEETDTEEVD 197
            FL + FP  ++S +N+  EF+VEN T+F         G  E M++ S   EE   E   
Sbjct: 181 FFLFQTFPLGDKSSVNLRGEFHVENTTKFDDSEVVPAEGNGEHMEIDSQPKEEPIKEHTP 240

Query: 198 KVKIDFN-------------------------FYKKFWSLQDYFRNPVQCYNKVSWKMFT 232
              +                             Y  FW LQ  F +P + ++   +  F 
Sbjct: 241 APAVPTKPGSKAVPIKAPHKKIPEEEALPTSELYPIFWRLQHDFSDPTRLFDTAKFTAFK 300

Query: 233 SYAETVLAAFK------------------------SYKLDDVQSSLNPSGDYFAKYLTNQ 268
               + +  FK                        +  +DD  SS +   +Y  KYLT++
Sbjct: 301 KGLASTIVKFKKTPTVVQTKAGEEEKRGTKRKIGENGTVDD--SSDHLMDNYNPKYLTSK 358

Query: 269 KLLDLQLSDTNFRRYVLLQFLILFQY--------------FTSTVKSRGEGLELKSDQEK 314
            L DL+LSD  F+R++L+Q LIL  +               T+T KS      L     +
Sbjct: 359 DLFDLELSDLAFQRHILVQALILIDFLLSLTDKAKKRLASLTTTNKSMLYSFALNEGDTQ 418

Query: 315 WVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE-- 372
           W   T  T+ S I  T  +G  + ++V+ +L  +++W +WK E CP + R   S   E  
Sbjct: 419 WATTTRSTITSYISST-AEGRQYCRMVETVLARDKNWVRWKVESCPSIVRGPVSTEQEMA 477

Query: 373 DKKDEPDAKK-KKTPE-------LTKL-WNSKDNLEACKSAERDFTPSLESYFEEA-IQQ 422
            +K    A + +K PE       L+ L   S   LEA K   R   P++E   ++    +
Sbjct: 478 ARKGAAQATRPRKIPEKPMGAMNLSFLDEGSGGGLEALKDPARYTAPTIEQLVDKVKTDK 537

Query: 423 MDPAAAVEEQYKKVNDS---NYAWRALR 447
           +D   A +++ K   ++   N  WRALR
Sbjct: 538 LDADFAADDEEKASFENAIQNAKWRALR 565


>gi|340522223|gb|EGR52456.1| predicted protein [Trichoderma reesei QM6a]
          Length = 807

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 137/497 (27%), Positives = 223/497 (44%), Gaps = 94/497 (18%)

Query: 36  LEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMST 91
           +E + R +  KLI T  ID    +K+  N + +   L     C+  L   L+ +  D  T
Sbjct: 80  VETAARDFFSKLIATTSIDSPEFVKMW-NLLDILSILSDDGQCDPALLFWLVEELLDSQT 138

Query: 92  LDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
           +  C  +F ++E     +    +KQ+         +LR CN+LLRRLSR+++T FCGR+ 
Sbjct: 139 VAGCRIIFDFLESRRERITAKHFKQKNLV------ILRSCNELLRRLSRAEDTAFCGRVF 192

Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF---GGDEEMDVSSNETEETDT---------- 193
           +F+ + FP  +RS +N+  E++VENIT +     D +M++ +    +TDT          
Sbjct: 193 IFMFQSFPLGDRSSVNLRGEYHVENITNYERLDDDSKMEIDT----QTDTPKEQGDAKAA 248

Query: 194 -----EEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD 248
                ++  K       Y  FWSLQ+ F  P++ ++   +  F S  E  + AF++   D
Sbjct: 249 TKNGAQDTSKALDPDTLYPMFWSLQESFSQPLKLFDTGHFTKFKSSLEATITAFQAIHDD 308

Query: 249 DVQSS---------LNPSGD----------YFAKYLTNQKLLDLQLS---DTNFRRYVLL 286
            + SS         L    D          +  KYLT++ L  L+ S   D +FRR+VL+
Sbjct: 309 SISSSKSMENLKRTLKRKRDDDGTETLPEAFNPKYLTSKDLFQLEASIAIDLSFRRHVLV 368

Query: 287 QFLILFQYFTSTVKSRGEGL--------------ELKSDQEKWVKDTTETVYSLIKQTPP 332
           Q LI+  +  S  K   E                +L  +  KW  D  + +   +KQ   
Sbjct: 369 QALIITDFLLSLSKEEREKRTNINISNKSVIYPDQLSEENTKWATDMKKVISDYLKQG-S 427

Query: 333 DGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTS---ITDEDKKDEPDAKKKKTPE- 387
           DG +F ++V+ +L  +++W  WK   CP +KR P+++   +  ++   +    K+  P  
Sbjct: 428 DGPYFFRMVETVLARDKNWVFWKMASCPPIKRDPVSAESFVEAKEAAQKMATSKRLRPNP 487

Query: 388 -----LTKLWNSKDNLEA---CKSAERDFTPSLESYFEEAIQQMD-----PAAAVEEQYK 434
                L  L N  D  EA    K  ER   P LE  F+  I   D     P  A  +   
Sbjct: 488 LNAVPLDFLTNGDDEGEAMEQLKDPERHQLPELE-VFKSKIADDDFEIDMPTNAQTKATA 546

Query: 435 KVNDSNYAWRALRLLSR 451
               ++ +WRALR+ SR
Sbjct: 547 VAAKASKSWRALRIASR 563


>gi|358370613|dbj|GAA87224.1| THO complex subunit Tho1 [Aspergillus kawachii IFO 4308]
          Length = 671

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 153/615 (24%), Positives = 256/615 (41%), Gaps = 140/615 (22%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ +  I+      I N + +       + C   L   L+ +  D  
Sbjct: 66  AAVEIAFREKFYSLLASTSIEDPSFIRIWNLLDIISIFSDNEQCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENIT---EFGGDEEMDVSSNETEETDTEEV------ 196
            +FL + FP  ++S +N+  E++ +N+T   E       DV+  + E TD  E       
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTDNVTTYDELTKAAAKDVADPDVEMTDEPEAPTTTEG 239

Query: 197 --------------------------------------DK-VKIDFNFYKKFWSLQDYFR 217
                                                 DK V +D + Y  FW LQ YF 
Sbjct: 240 LKEDSEAQTTSASESHTPAQEPPKTPRVVVSSSKDESQDKGVDLD-DLYPIFWGLQAYFS 298

Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQ 251
            P + ++   +  F +  E  L+AF++   D                          ++ 
Sbjct: 299 APTKTFDPQHFAKFKTGLEATLSAFRNVDTDLENSSNSKMSEEIRKSTKRKRTTDGQEIA 358

Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE------ 304
           SS NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +  S T K++ +      
Sbjct: 359 SSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLADLTN 412

Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
                G  L  D  KW     + +   + Q    G+ + ++V  +L  +++W +WK EGC
Sbjct: 413 KSVLYGFVLNDDDAKWAVKMRKAIEEYL-QEGIGGKFYYRMVDTVLSRDKNWVRWKAEGC 471

Query: 360 PELKRPLTSITDEDKKDEPDA-----KKKKTPELTKL-------WNSKDNLEACKSAERD 407
           P ++RP  S+++     E        K+ +T  +  L         S   LE  K  ER 
Sbjct: 472 PLIERPPVSVSEYLGAREHATKVYANKRLRTSPMGSLDLKFLSEGESLAGLERLKDPERF 531

Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEE-QYKKVNDSNYAWRALRLLSR-KCPHF------ 456
             P+ +S     +     +D A   E+ +Y   + S+  WR LRL ++ K   F      
Sbjct: 532 SVPAADSLMMGIMDDELDIDTAQTKEDREYAMRSKSSKTWRILRLSAKSKLAEFEKIEDG 591

Query: 457 ----FLNATP-NVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVEVDTKSEEIQ 511
                L  TP   +   + +E   +   K    S+ +G+G    Q+P     + ++  I 
Sbjct: 592 KNLKILFETPQQSDGTPQALEGTPQASEKPPAGSESTGHG----QEPGASSTENENSAIA 647

Query: 512 EEEKEEDWEAKADPE 526
            E+   +    A PE
Sbjct: 648 TEQTPANSVEAAAPE 662


>gi|242794188|ref|XP_002482321.1| THO complex subunit Tho1, putative [Talaromyces stipitatus ATCC
           10500]
 gi|218718909|gb|EED18329.1| THO complex subunit Tho1, putative [Talaromyces stipitatus ATCC
           10500]
          Length = 649

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 136/523 (26%), Positives = 235/523 (44%), Gaps = 114/523 (21%)

Query: 34  SALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+   +L+ +  ID      I N + +       + C + L   L+ +  D  
Sbjct: 66  AAVETAFREKFYELLASTSIDDLAFVNIWNLLDIVSIFSDNEQCEAGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T+D C ++F Y+E    I      F S K  +LR CN+LLRRLSR+++TVFCGR+ +++ 
Sbjct: 126 TVDGCRKVFDYLESRREI-NTAKHFESKKLIILRSCNELLRRLSRAEDTVFCGRVFIYMF 184

Query: 151 KFFPFSERSGLNIISEFNVENITEF------GGDEEMDVSSNETEETDTEEVDKV----- 199
           + FP  ++S +N+  EF+ EN+T F        ++ M++   E +E  +E+  +      
Sbjct: 185 QSFPLGDKSSVNLRGEFHTENVTTFDDIALKSQEDTMEIEPTEQKEETSEKTSQSVQDEQ 244

Query: 200 -----------------------------KIDFN-FYKKFWSLQDYFRNPVQCYNKVSWK 229
                                         ID +  Y  FW LQ +F  P + ++  ++ 
Sbjct: 245 TKGETSGPKEGTPAVAETAHERKESEATKTIDLDTLYPIFWGLQGFFSAPTRLFDSQNFA 304

Query: 230 MFTSYAETVLAAFK--------------------------SYKLDDVQSSLNPSGDYFAK 263
            F +  E  L+ FK                          + +  DV SS NP      K
Sbjct: 305 SFRTSLEKTLSTFKLMNKVTDTWSTAKAPDEVRRGLKRKRNGEGVDVASSFNP------K 358

Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLEL-------------- 308
           YLT+++L DL+++D  FRR+VL+Q LIL  +  S T ++R   ++L              
Sbjct: 359 YLTSRELFDLEINDIAFRRHVLVQALILLDFILSLTPQARQRLMKLVQPGKSLNKTVLYQ 418

Query: 309 ---KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
                +  KW   T + V   ++Q   DG+ + ++V  +L  +++W +WK E CP +++P
Sbjct: 419 YTVSEEDAKWAAQTRDIVADYLQQG-IDGKFYYRMVDTVLSRDKNWVRWKVENCPPIEKP 477

Query: 366 LTSITD------EDKKDEPDAKKKKTP----ELTKL--WNSKDNLEACKSAERDFTPSLE 413
                D         K   + + + +P    +LT L   N+  +LE  K  +R  TP+ +
Sbjct: 478 AIQADDYFQARNNALKVFSNKRLRASPMGSLDLTFLADGNNLSSLEQLKDPDRYTTPAPD 537

Query: 414 SYFEEAIQ---QMDPAAAVEEQ--YKKVNDSNYAWRALRLLSR 451
           S+    +     +D A   +E+   ++  +S Y WR LRL SR
Sbjct: 538 SFMMGIVDDTFNIDMAKDKDEKEAAERAKESKY-WRTLRLSSR 579


>gi|145248682|ref|XP_001400680.1| THO complex subunit Tho1 [Aspergillus niger CBS 513.88]
 gi|134081347|emb|CAK41850.1| unnamed protein product [Aspergillus niger]
          Length = 671

 Score =  155 bits (392), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 148/591 (25%), Positives = 247/591 (41%), Gaps = 138/591 (23%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ +  I+      I N + +       + C   L   L+ +  D  
Sbjct: 66  AAVEIAFREKFYSLLASTSIEDPSFICIWNLLDIISIFSDNEQCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNV-----NIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTKKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF-----------------------------G 176
            +FL + FP  ++S +N+  E++ +N+T +                             G
Sbjct: 180 FIFLFQSFPLGDKSAVNLRGEYHTDNVTTYDELTKAAAKDVADADVEMSDEQEAPTATNG 239

Query: 177 GDEEMDVSSNETEETDT------------------EEVDK-VKIDFNFYKKFWSLQDYFR 217
             E+ +  +    E+ T                  E  DK V +D + Y  FW LQ YF 
Sbjct: 240 LKEDSEAQTTSASESHTPAQEPPKTPRVVVPSSKDESQDKGVDLD-DLYPMFWGLQAYFS 298

Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD--------------------------DVQ 251
            P + ++   +  F +  E  L+AFK+   D                          ++ 
Sbjct: 299 APTKTFDPQHFAKFKTGLEATLSAFKNVDTDLENSSNSKMSEEIRKSTKRKRTTDGQEIA 358

Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE------ 304
           SS NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +  S T K++ +      
Sbjct: 359 SSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDFMLSLTPKAKAKLADLTN 412

Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
                G  L  D  KW     + +   + Q    G+ + ++V  +L  +++W +WK EGC
Sbjct: 413 KSVLYGFVLHDDDAKWAVKMRKAIEEYL-QEGIGGKFYYRMVDTVLSRDKNWVRWKAEGC 471

Query: 360 PELKRPLTSITDEDKKDEPDA-----KKKKTPELTKL-------WNSKDNLEACKSAERD 407
           P ++RP  S+++     E        K+ +T  +  L         S   LE  K  ER 
Sbjct: 472 PLIERPPVSVSEYLGAREHATKVYANKRLRTSPMGSLDLKFLSEGESLAGLERLKEPERF 531

Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEE-QYKKVNDSNYAWRALRLLSR-KCPHF------ 456
             P+ +S     +     +D A   E+ +Y   + S+  WR LRL ++ K   F      
Sbjct: 532 SVPAADSLMMGIMDDELDIDTAQTKEDREYAMRSKSSKTWRILRLSAKSKLAEFEKIEDG 591

Query: 457 ----FLNATP-NVEKNSEFIENMVKRCVKEKPSSQISGNGNGVDQDPAEVE 502
                L  TP   +   + +E   +    EKP +     G+G +   A  E
Sbjct: 592 KNLKILFETPQQSDGTPQALEGTPQ--ASEKPPAGSESTGHGQEHGAASTE 640


>gi|358397904|gb|EHK47272.1| hypothetical protein TRIATDRAFT_291414 [Trichoderma atroviride IMI
           206040]
          Length = 800

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 130/493 (26%), Positives = 226/493 (45%), Gaps = 83/493 (16%)

Query: 34  SALEQSIRQYLLKLIKTPDIDI-KVIE--NYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E + R    KLI T  ID  K +   N + +   L     C+  L   L+ +  D  
Sbjct: 80  AVVETAARDSFSKLIATTSIDSPKFVRMWNLLDILSILSDDGQCDPALLFWLVEELLDSQ 139

Query: 91  TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+  C  +F ++E     +    +KQ+         +LR CN+LLRRLSR+++T FCGR+
Sbjct: 140 TIAGCRIIFDFLESRRERITAKHFKQKNLV------ILRSCNELLRRLSRAEDTAFCGRV 193

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF------GGDEEMDVSSNETEE--------- 190
            +F+ + FP  +RS +N+  E++VEN+T F       G  E+D  +   +E         
Sbjct: 194 FIFMFQSFPLGDRSSVNLRGEYHVENVTSFETPEDDDGKMEVDAQAETPKESGAPKAGAK 253

Query: 191 TDTEEVDKVKI--DFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK----- 243
           T   + +K +       Y  FWSLQ+ F  P++ ++  ++  F S  E  + AF+     
Sbjct: 254 TGAHDSNKAQAPDSDTLYPLFWSLQESFSQPLKLFDVANFTKFKSSLELTIKAFQAIHDD 313

Query: 244 ----SYKLDDVQSSLNPSGD----------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
               S  +++++ +L    D          +  KYLT++ L  L++SD +FRR+VL+Q L
Sbjct: 314 SLYTSKSMENLKRTLKRKRDDDGTETLPEAFNPKYLTSKDLFQLEISDLSFRRHVLVQAL 373

Query: 290 ILFQYFTSTVKSRGEGL--------------ELKSDQEKWVKDTTETVYSLIKQTPPDGE 335
           I+  +  S  K   E                +L  +  KW  D  + +   +KQ   DG 
Sbjct: 374 IITDFLLSLSKEEREKRTNINISNKSVIYHGQLNEENTKWAADIKKVISDYLKQG-ADGP 432

Query: 336 HFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTS---ITDEDKKDEPDAKKKKTPE---- 387
           +F ++V+ +L  +++W  WK   CP ++R P+++   +  ++        K+  P     
Sbjct: 433 YFFRMVETVLARDKNWVFWKMASCPPIQRDPVSAQGFVEAKEAAQRMATSKRLRPNPLNA 492

Query: 388 --LTKLWNSKDN--LEACKSAERDFTPSLESYFEEAIQQMD-----PAAAVEEQYKKVND 438
             L  L N  +   L+  K +ER   P L++ F+  I   D     P  +  +       
Sbjct: 493 VPLDFLANEDETSALDKLKDSERCQLPELDA-FKSKIADDDFEIEMPTNSETKAAAVAGK 551

Query: 439 SNYAWRALRLLSR 451
           ++ +WRALR+ SR
Sbjct: 552 ASKSWRALRIASR 564


>gi|402079094|gb|EJT74359.1| guanylate kinase [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 798

 Score =  155 bits (391), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 135/509 (26%), Positives = 224/509 (44%), Gaps = 93/509 (18%)

Query: 26  PTCDYDKKSALEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPII 81
           P+    +   +E ++R    KL+ T  I+    ++V  N + L   L     C+  L   
Sbjct: 66  PSVRTRQYGVIETAVRDIFGKLVATTSIESPEFVRVW-NLLDLVSILSDMRQCDPVLLFW 124

Query: 82  LLSDTFDMSTLDKCEQLFYYVEV-NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTV 140
           L+ +  D  T   C ++  ++E  +  I    +  M     +LR CN+LLRRLSR+++T 
Sbjct: 125 LVEELLDSQTTSGCRRVLDFLESRHQRITANPSVAMQL--TILRSCNELLRRLSRARDTT 182

Query: 141 FCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM--------------DVSSN 186
           FCGR+ +F+ + FP  +RS +N+  E++VEN+T F  D +               D +S 
Sbjct: 183 FCGRVFIFMFQSFPLGDRSSVNLRGEYHVENVTTFDQDVDRNGTLDKMEIDGHTDDTTSP 242

Query: 187 ETEETDTEEV---DKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF 242
           E + T T      + ++ D    Y  FWSLQ  F  P + +++  ++ F S  E+ +AAF
Sbjct: 243 EAKATATRTSAADESIRKDMEALYPVFWSLQASFSQPKKLFDRSHFERFKSGLESTMAAF 302

Query: 243 KSYKLDDVQSSLNPSGD-------------------YFAKYLTNQKLLDLQLSDTNFRRY 283
           K  +   +  S     D                   +  KYLT++ L  L+LSD  FRRY
Sbjct: 303 KDAQTKQISRSQRQEEDTKTGQKRKHEGEGADLADAFNPKYLTSRDLFRLELSDLTFRRY 362

Query: 284 VLLQFLILFQYFTSTVKSRGE---------------GLELKSDQEKWVKDTTETVYSLIK 328
           +L+Q LI+ ++  S      E                 +L S+  +W   T   +   ++
Sbjct: 363 ILVQALIIMEFLLSLSAQEREIAAVASAPNKSVIYLDQQLSSEDTEWASKTRLAIIVHLE 422

Query: 329 QT---PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS---ITDEDKKDEPDAKK 382
            +   P +G +F ++VK IL  +++W +WK   C  ++ P       +D  K     A  
Sbjct: 423 TSGAGPSEGAYFVRIVKTILARDKNWVRWKAYACSPIEMPSLPAQFFSDAKKSAYRSATS 482

Query: 383 KK---TP----ELTKLWNSKD--NLEACKSAERDFTPSLESYFEEAIQQ----------- 422
           K+   TP     +  L +S+D   L   K+ ER   P +   F+ AI +           
Sbjct: 483 KRLRATPMNALSIDFLSDSRDSSGLGKFKADERSRIPDV-CAFKRAIAEDELEIDMPTNN 541

Query: 423 MDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
               AA+E +  K      +WRALR+ SR
Sbjct: 542 QTKVAAIEGKASK------SWRALRVASR 564


>gi|327356040|gb|EGE84897.1| nuclear matrix protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 656

 Score =  155 bits (391), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 149/607 (24%), Positives = 252/607 (41%), Gaps = 131/607 (21%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E + R+    L+ T  ID      I N + +       + C   L   L+ +  D  
Sbjct: 68  AVVETAFRKKFYDLLATTSIDEPSFIHIWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 127

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 128 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRTEDTVFCGRV 181

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNETE---ETDTE 194
            +FL + FP  +RS +N+  EF+ EN+T F        GG   MDV ++  E   +  TE
Sbjct: 182 FIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQDGGVTPMDVDASAKEVAGDAKTE 241

Query: 195 EVDKVKIDFN-----------------------------------FYKKFWSLQDYFRNP 219
                + + N                                    Y  FWSLQ+ F +P
Sbjct: 242 SAQASEQESNRQAAHPAADVGLQSSQPATKKPQEPEPSSTPSMDELYPIFWSLQNNFSSP 301

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------DVQSSLNPSGDY 260
              ++  ++  F +  E  L+ F+    D                    V   +  +  +
Sbjct: 302 TSLFDPANFAAFKTGLEASLSIFQKVNTDMEARMTKGSEDARRGLKRRRVGDGMEMTNSF 361

Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE-----------L 308
             KYLT++ L +L+++D  FRR++L+Q LIL  +  S T K++ +  +           L
Sbjct: 362 NPKYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNKSVLYSYVL 421

Query: 309 KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
             +  KW      ++ S ++Q   +G+ + ++V  +L  +++W +WK E CP ++R   S
Sbjct: 422 NDEDAKWASQMKTSIASYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEACPPIERTPVS 480

Query: 369 ITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACKSAERDFTPSLESYF 416
           I D         K     + +  P    +L  L   K  +++   K  +R  TP+L+SY 
Sbjct: 481 IQDYLDTQATATKVSTSKRLRSAPLGSLDLNFLTEDKNINSVGRLKHPDRFTTPALDSYM 540

Query: 417 EEAIQ---QMDPAAAVEEQYKKVND-SNYAWRALRLLSRKCPHFF-----------LNAT 461
                    +D A + EE+ +     ++  WR LRL SR     F           L   
Sbjct: 541 RGIADDEFNIDMAQSREEKDEATKARASKTWRTLRLSSRSKLALFDKIEDGNNLKVLFEP 600

Query: 462 PNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVD-QDPAEVEVDTKSEEIQEEEKEEDWE 520
           P   +++    N  K  V+           NGV+ QD      D K E+ +E +     +
Sbjct: 601 PPASEDASSASNKQKAAVE-----------NGVNRQDGPSKRGDVKGEKAKENQPPVSSD 649

Query: 521 AKADPEG 527
             A  +G
Sbjct: 650 GAAPQQG 656


>gi|396459819|ref|XP_003834522.1| similar to nuclear matrix protein [Leptosphaeria maculans JN3]
 gi|312211071|emb|CBX91157.1| similar to nuclear matrix protein [Leptosphaeria maculans JN3]
          Length = 696

 Score =  155 bits (391), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 128/456 (28%), Positives = 215/456 (47%), Gaps = 81/456 (17%)

Query: 71  KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
           +D+C+  L ++L+ +  D  ++  C  +F ++E      ++    ++ KN    +LR+CN
Sbjct: 117 RDLCSPQLVLLLIEELLDSQSISGCRTVFNFLESR----REAIIAINSKNKDLVILRLCN 172

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG--------DE 179
           +LLRRLSR++  VFCGR+ +F+ + FP  ++S +N+   F+VEN+T F          D+
Sbjct: 173 ELLRRLSRAEAPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENVTTFEDFLKESNEVDD 232

Query: 180 EMDVSSNETEETDTEEV----------------DKVKIDFN-FYKKFWSLQDYFRNPVQC 222
            M+V  ++T  T T E                  K  +DF+  Y  FWSLQ  F NP + 
Sbjct: 233 LMEVDHDKTVPTATGESASEHEVLSSETTKSTDAKSTLDFDALYPVFWSLQHSFSNPPRL 292

Query: 223 YNKVSWKMFTSYAETVLAAFK-----------------SYKLDDVQSSLNPSGDYFAKYL 265
           + + ++K F +  +  LA FK                    LDD Q +L  +  +  KYL
Sbjct: 293 FEEDNFKQFRTSLDATLAKFKEVPKVIQSGDSERKKGQKLHLDDDQDALTST--FNPKYL 350

Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQ-------------YFTSTVKSRGEGLELKSDQ 312
           T++ L  L+LSD  F+R++L+Q LIL               ++ +  K+      L+ D 
Sbjct: 351 TSRDLFKLELSDLAFQRHILVQALILIDFLLTLTEDAKKQPFYQNAQKAMQYNFTLREDD 410

Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD- 371
            KW       + + + Q  PDG+ + ++V  +L  +++W +WK E C    R   +  D 
Sbjct: 411 TKWALGIKSAIANYL-QEDPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRDRVAAKDF 469

Query: 372 --EDKKDEPDAKKKKTPELTKLWNSKD---NLEACKS----AERDF--TPSLESYFEEA- 419
                  +     KK P+   + ++ D   N EA K      +RD    P+ ESY +   
Sbjct: 470 LQAKSGAQRAVVSKKAPKSLGVPHTLDFLNNTEAEKGLSQLRQRDRFNVPNAESYADRVR 529

Query: 420 -IQQMDPAAAVEEQYKKV--NDSNYAWRALRLLSRK 452
              Q    A  EE+ ++V   +S++ WR LRL S+K
Sbjct: 530 MTDQNLGMADTEEEKRQVAEKNSSHVWRGLRLASKK 565


>gi|154311261|ref|XP_001554960.1| hypothetical protein BC1G_06483 [Botryotinia fuckeliana B05.10]
          Length = 661

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 133/485 (27%), Positives = 215/485 (44%), Gaps = 105/485 (21%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
           N + +   L   + C   L   L+ +  D  T+  C ++F Y+E        + F  + K
Sbjct: 99  NLLDIISILSDDEQCEPALLFWLVEELLDSQTIAGCRRIFDYLESRREKITAKHF--NQK 156

Query: 120 N-NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--- 175
           N  +LR CN+LLRRLSR+++T FCGR+ +F+ + FP  ++S +N+  EF+VEN+T F   
Sbjct: 157 NLVILRSCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEFHVENVTSFDTL 216

Query: 176 ----GGDEEMDVSSNE-TEETD----------------TEEVDKVKIDF----------- 203
               G  E+M+V S+   +ET                 +E+++    D            
Sbjct: 217 PSNPGDVEKMEVDSDSNAKETSPASKPASNGRTSTARGSEDINGSGKDVASSKREPPMKP 276

Query: 204 -NFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL--------------- 247
              Y  FWSLQ  F  P + +N  ++  F    E  +A FKS                  
Sbjct: 277 DELYPIFWSLQQSFSQPKKLFNSENFAEFKKGLEATIATFKSVHHETAGRPTARVAEDSK 336

Query: 248 -----------DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT 296
                      DD+ +S NP      KYLT++ L +L++SD +FRR++L+Q LI+  +  
Sbjct: 337 RGTKRKRSQGDDDLANSFNP------KYLTSRDLFELEISDLSFRRHILVQILIIMDFLL 390

Query: 297 STVKSRGEGLE-----------------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
           S      E L                  L  +  KW  D  +++   +K+   DG  F +
Sbjct: 391 SLSAKAKEKLSKALPDNLNRSVTYADQTLSEEDTKWSLDMKKSIAEYLKKG-ADGAFFYR 449

Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTS----ITDEDKKDEPDAKKKKTP------ELT 389
           +V+ +L  +++W +WK E CP + RP  S    IT +         K+  P      +L 
Sbjct: 450 MVETVLSRDKNWVRWKVENCPSIARPAVSPHDYITAKASARRATTNKRLRPNPLGSLDLK 509

Query: 390 KLWNS--KDNLEACKSAERDFTPSLESY-FEEAIQQMDPAAAVEEQYKKV---NDSNYAW 443
            L  S  +  LE  K  ER   PS++S+  +  +  MD   A +E+ K +     ++  W
Sbjct: 510 FLAESDLRSGLERLKKPERYQIPSIKSFKNKIDLDDMDIDMARDEESKNIAIEAKASKCW 569

Query: 444 RALRL 448
           RALR+
Sbjct: 570 RALRI 574


>gi|347829111|emb|CCD44808.1| similar to THO complex subunit Tho1 [Botryotinia fuckeliana]
          Length = 661

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 137/511 (26%), Positives = 225/511 (44%), Gaps = 107/511 (20%)

Query: 36  LEQSIRQYLLKLIKTPDIDIKVIE--NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLD 93
           +E ++R     L+ +  I+       N + +   L   + C   L   L+ +  D  T+ 
Sbjct: 73  IETAVRDTFNNLLASISIETSSSRLWNLLDIISILSDDEQCEPALLFWLVEELLDSQTIA 132

Query: 94  KCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKF 152
            C ++F Y+E        + F  + KN  +LR CN+LLRRLSR+++T FCGR+ +F+ + 
Sbjct: 133 GCRRIFDYLESRREKITAKHF--NQKNLVILRSCNELLRRLSRAEDTAFCGRVFIFMFQS 190

Query: 153 FPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNE-TEETD------------ 192
           FP  ++S +N+  EF+VEN+T F       G  E+M+V S+   +ET             
Sbjct: 191 FPLGDKSSVNLRGEFHVENVTSFDTLPSNPGDVEKMEVDSDSNAKETSPASKPASNGRTS 250

Query: 193 ----TEEVDKVKIDF------------NFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
               +E+++    D               Y  FWSLQ  F  P + +N  ++  F    E
Sbjct: 251 TARGSEDINGSGKDVASSKREPPMKPDELYPIFWSLQQSFSQPKKLFNSENFAEFKKGLE 310

Query: 237 TVLAAFKSYKL--------------------------DDVQSSLNPSGDYFAKYLTNQKL 270
             +A FKS                             DD+ +S NP      KYLT++ L
Sbjct: 311 ATIATFKSVHHETAGRPTARVAEDSKRGTKRKRSQGDDDLANSFNP------KYLTSRDL 364

Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE-----------------LKSDQE 313
            +L++SD +FRR++L+Q LI+  +  S      E L                  L  +  
Sbjct: 365 FELEISDLSFRRHILVQILIIMDFLLSLSAKAKEKLSKALPDNLNRSVTYADQTLSEEDT 424

Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS----I 369
           KW  D  +++   +K+   DG  F ++V+ +L  +++W +WK E CP + RP  S    I
Sbjct: 425 KWSLDMKKSIAEYLKKG-ADGAFFYRMVETVLSRDKNWVRWKVENCPSIARPAVSPHDYI 483

Query: 370 TDEDKKDEPDAKKKKTP------ELTKLWNS--KDNLEACKSAERDFTPSLESY-FEEAI 420
           T +         K+  P      +L  L  S  +  LE  K  ER   PS++S+  +  +
Sbjct: 484 TAKASARRATTNKRLRPNPLGSLDLKFLAESDLQSGLERLKKPERYQIPSIKSFKNKIDL 543

Query: 421 QQMDPAAAVEEQYKKV---NDSNYAWRALRL 448
             MD   A +E+ K +     ++  WRALR+
Sbjct: 544 DDMDIDMARDEESKNIAIEAKASKCWRALRI 574


>gi|212535848|ref|XP_002148080.1| THO complex subunit Tho1, putative [Talaromyces marneffei ATCC
           18224]
 gi|210070479|gb|EEA24569.1| THO complex subunit Tho1, putative [Talaromyces marneffei ATCC
           18224]
          Length = 646

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 140/525 (26%), Positives = 233/525 (44%), Gaps = 117/525 (22%)

Query: 34  SALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+   +L+ +  ID      I N + +         C + L   L+ +  D  
Sbjct: 66  AAVETAFREKFYELLASTSIDDPAFVNIWNLLDIVSIFSDNAQCEAGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T+D C ++F Y+E    I      F S K  +LR CN+LLRRLSR+++TVFCGR+ +++ 
Sbjct: 126 TVDGCRKVFDYLESRREI-NTAKHFESKKLIILRSCNELLRRLSRAEDTVFCGRVFIYMF 184

Query: 151 KFFPFSERSGLNIISEFNVENITEFG---------------------------------- 176
           + FP  ++S +N+  EF+ ENIT F                                   
Sbjct: 185 QSFPLGDKSSVNLRGEFHTENITTFDNIVKTSQEDAMETEPTEPTETTEKMAQVAQGEPI 244

Query: 177 GDEEMDVSSNETEETDTEEVDKVK-------IDFN-FYKKFWSLQDYFRNPVQCYNKVSW 228
            + E   +  E   T  E  D+ K       +D +  Y  FW LQ +F  P + ++  S+
Sbjct: 245 NEGEATRAVKEGTPTVAEAADEKKENDPSKTVDLDALYPIFWGLQGFFSAPTRLFDSQSF 304

Query: 229 KMFTSYAETVLAAFK--------------------------SYKLDDVQSSLNPSGDYFA 262
             F +  E  L+ FK                          + +  D+ SS NP      
Sbjct: 305 ASFRAGLEKTLSTFKLMNKVTDTWSTAKAPDELRRGLKRKRNGEGVDIASSFNP------ 358

Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSR---------GEGL------ 306
           KYLT+++L DL+++D  FRR+VL+Q LIL  +  S T ++R         G+ L      
Sbjct: 359 KYLTSRELFDLEINDIAFRRHVLVQALILLDFILSLTPQARQRLMKSVQPGKALNKTVLY 418

Query: 307 --ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
              +  +  KW   T + V   ++Q   DG+ + ++V  +L  +++W +WK E CP +++
Sbjct: 419 QYAVSEEDAKWAAQTRDVVADYLQQG-IDGKFYYRMVDTVLSRDKNWVRWKVENCPPIEK 477

Query: 365 PLTSITDEDK-KDEPDAKK-------KKTP----ELTKL--WNSKDNLEACKSAERDFTP 410
           P  +I  +D  +   +A K       +  P    +LT L   NS  +LE  K  +R  TP
Sbjct: 478 P--AIKGDDYFQARGNASKVFTTKRLRAAPMGSLDLTFLTDGNSISSLEQLKEPDRYSTP 535

Query: 411 SLESYFEEAI-QQMDPAAAVEEQYKKVNDS---NYAWRALRLLSR 451
           + +S+    +  + D   A ++  K+  +    +  WR LRL SR
Sbjct: 536 APDSFMMGIVDDEFDIDMAKDKDEKEAAERAKVSKLWRTLRLASR 580


>gi|326483229|gb|EGE07239.1| THO complex subunit 1 [Trichophyton equinum CBS 127.97]
          Length = 635

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 138/536 (25%), Positives = 234/536 (43%), Gaps = 114/536 (21%)

Query: 16  VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKD 72
           V++  Y +        + +A+E + R+    L+ T  ID      I N + +       +
Sbjct: 48  VLAGDYGISVGANQQTQYAAVETAFREIFYDLLATTSIDEPSFMQIWNLLDIVSIFSDNE 107

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
            C   L   L+ +  D  T+D C ++F Y+E          +KQ++        +LR CN
Sbjct: 108 KCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCN 161

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDE 179
           +LLRRLSR+++TVFCGR+ +FL + FP  ++S +N+  E++ EN+T F          DE
Sbjct: 162 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDVQPQAQAESDE 221

Query: 180 EMDVSSNETE-------ETDTEEVD-----------------------KVKIDFNFYKKF 209
            MD+   E E        TD E                          ++++D   Y  F
Sbjct: 222 VMDIDIQEGETQVSNAKSTDQESTTAHAPLESTNDATPQRAKAPSIQPELELD-TLYAAF 280

Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
           WSLQ+ F  P++ ++   +    S  E  L++F+  S KL+                   
Sbjct: 281 WSLQESFSKPIRLFDTTHFASLKSGLEATLSSFRKVSSKLEVRSTTKGSDELRRGTKRKR 340

Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF----------- 292
                D+ +S NP      KYLT++ L +L+++D +FRR++L+Q LIL            
Sbjct: 341 GEGGSDLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSK 394

Query: 293 -QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            +   ST KS      L  +  KW      ++ S + Q   DG+ + ++V  +L  +++W
Sbjct: 395 AKLADSTNKSVLYSYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 453

Query: 352 NQWKNEGCPELKRPLTSITDE------DKKDEPDAKKKKTP------ELTKLWNSKDNLE 399
            +WK EGCP +++P   +           K   + + + TP          + ++  +L 
Sbjct: 454 VRWKAEGCPPIEQPPVPVQSHLDTQSSAAKITANKRLRSTPMGSLDLGFLSVESNLGDLN 513

Query: 400 ACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
             K  ER  TPSL+ Y          +D A   EE+ + +   ++  WR LR+ SR
Sbjct: 514 RLKEPERFSTPSLDFYMSGITDDDFNIDMAKDDEEKEQSIKAKASKTWRTLRIASR 569


>gi|367051076|ref|XP_003655917.1| hypothetical protein THITE_2120207 [Thielavia terrestris NRRL 8126]
 gi|347003181|gb|AEO69581.1| hypothetical protein THITE_2120207 [Thielavia terrestris NRRL 8126]
          Length = 765

 Score =  153 bits (387), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 105/390 (26%), Positives = 183/390 (46%), Gaps = 64/390 (16%)

Query: 34  SALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E ++R     LI T  I+      + N   +   L   ++C+  L   L+ +  D  
Sbjct: 76  AVIETAVRDTFESLIATTPIESPAFVKVWNLFDILSLLSDCELCDPALLFWLVEELLDSQ 135

Query: 91  TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+  C ++F ++E     +    +KQ+         +LR CN+LLRRLSR+ +  FCGR+
Sbjct: 136 TIAGCRKVFDFLESRRERITAKHFKQKQLV------ILRTCNELLRRLSRALDPAFCGRV 189

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE---EVDKVKID 202
            +F+ + FP  ++S +N+  E++VEN+T +  D+       E  + D+E   + DK    
Sbjct: 190 FIFMFQSFPLGDKSSVNLRGEYHVENVTTW--DQTTPAPDGEKMDVDSEPKPQKDKALDP 247

Query: 203 FNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSY----------------- 245
              Y  FWSLQ+ F  P + ++   +  F +  E  +A F+S                  
Sbjct: 248 DTLYPVFWSLQESFNQPKKLFDSGHFAAFKAGLEATMATFRSIKPEQPSRSKEKQDKPVE 307

Query: 246 --------KLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS 297
                   K DDV   L  +  +  KYLT++ L  L++SD  FRR +L+Q LI+ ++  +
Sbjct: 308 EMGPSLKRKRDDVDDEL--ASGFNPKYLTSRDLFKLEISDLAFRRNILVQALIVMEFLLA 365

Query: 298 TVKSRGEGL---------------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
                 E L               +L  +  KW  D  E +   +KQ   +G +F+++V+
Sbjct: 366 LSPKAKEKLASVKAPNKSVTYSDQQLSEEDAKWAADMKEAIVEYLKQG-VEGPYFNRMVE 424

Query: 343 LILKGEEHWNQWKNEGCPELKRPLTSITDE 372
            +L  +++W +WK E CP ++ P  S+T E
Sbjct: 425 TVLSRDKNWVRWKIENCPSIELP--SMTPE 452


>gi|156065307|ref|XP_001598575.1| hypothetical protein SS1G_00664 [Sclerotinia sclerotiorum 1980]
 gi|154691523|gb|EDN91261.1| hypothetical protein SS1G_00664 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 658

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 129/491 (26%), Positives = 212/491 (43%), Gaps = 113/491 (23%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTF 114
           N + +   L   + C   L   L+ +  D  T+  C ++F Y+E     +    + Q+  
Sbjct: 96  NLLDIISILSDDEQCEPALLFWLVEELLDSQTIAGCRKIFDYLESRREKITAKHFNQKNL 155

Query: 115 FMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITE 174
                  +LR CN+LLRRLSR+++T FCGR+ +F+ + FP  ++S +N+  EF+VEN+T 
Sbjct: 156 V------ILRSCNELLRRLSRAEDTAFCGRVFIFMFQSFPLGDKSSVNLRGEFHVENVTS 209

Query: 175 FGG-------DEEMDVSSN---ETEETDTEEVDKVKI-------DFN------------- 204
           F          E M+V S+   +T   DT+     +        D N             
Sbjct: 210 FDALPRNPVDTENMEVDSDSNLKTTPPDTKATPNGRTSTARGSEDSNGSGKDVATSKHEP 269

Query: 205 ------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL----------- 247
                  Y  FWSLQ  F  P + +N  ++  F    E  +A FKS +            
Sbjct: 270 PMKPDELYPIFWSLQQSFSQPKRLFNAENFAEFKKGLEATIATFKSVQHETVGRPAARAA 329

Query: 248 ---------------DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
                          DD+ +S NP      KYLT++ L +L++SD +FRR++L+Q LI+ 
Sbjct: 330 EDNKRGTKRKRSQGDDDLANSFNP------KYLTSRDLFELEISDLSFRRHILVQILIIM 383

Query: 293 QYFTSTVKSRGEGLE-----------------LKSDQEKWVKDTTETVYSLIKQTPPDGE 335
            +  S      E L                  L  +  KW  +T + +   +K+   DG 
Sbjct: 384 DFLLSLSTKAKEKLSKALPENINRSVTYADQTLSEEDTKWSLETKKIIAEYLKKG-ADGA 442

Query: 336 HFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD------------EDKKDEPDAKKK 383
            F ++V+ +L  +++W +WK E CP + RP  S  D             +K+  P+    
Sbjct: 443 FFYRMVETVLSRDKNWVRWKVENCPSIARPAVSPQDYIDAKASARKATTNKRLRPNPLGS 502

Query: 384 KTPELTKLWNSKDNLEACKSAERDFTPSLESYF-EEAIQQMDPAAAVEEQYKKV---NDS 439
              +     + +  LE  K +ER   PS++S+  +  +  MD   A +E+ K V     +
Sbjct: 503 LDLKFLAETDVQSGLERLKKSERYQVPSIKSFRNKIDLDDMDIDMARDEESKNVAIEAKA 562

Query: 440 NYAWRALRLLS 450
           +  WRALR+ S
Sbjct: 563 SKCWRALRIAS 573


>gi|315041375|ref|XP_003170064.1| THO complex subunit 1 [Arthroderma gypseum CBS 118893]
 gi|311345098|gb|EFR04301.1| THO complex subunit 1 [Arthroderma gypseum CBS 118893]
          Length = 635

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 135/517 (26%), Positives = 233/517 (45%), Gaps = 112/517 (21%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ T  I+      I N + +       + C   L   L+ +  D  
Sbjct: 66  AAVETAFREIFYDLLATTSINEPSFMRIWNLLDIVSIFSDNEKCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDEEMDVS------------- 184
            +FL + FP  ++S +N+  E++ EN+T F         G+E MD+              
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDIQPKIQAEGEEAMDIDIQGDVTPASNAKD 239

Query: 185 --------------SNETEETDTEEVD-KVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSW 228
                         +N+     T+E + ++++D +  Y  FW LQ+ F  P + ++   +
Sbjct: 240 SQQEPTITQAPLEVTNDISSQRTKEPEIQLELDLDTLYAAFWGLQESFSKPTRLFDAAHF 299

Query: 229 KMFTSYAETVLAAFK--SYKLD------------------------DVQSSLNPSGDYFA 262
               S  E  L++F+  S K++                        D+ +S NP      
Sbjct: 300 ASLKSGLEATLSSFRKVSSKVEVRNTTKGLDESRRGTKRKRAEGGSDLPNSFNP------ 353

Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILF------------QYFTSTVKSRGEGLELKS 310
           KYLT++ L +L+++D +FRR++L+Q LIL             +   ST KS   G  L  
Sbjct: 354 KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSKAKLADSTNKSVLYGYVLSD 413

Query: 311 DQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSIT 370
           +  KW      ++ S + Q   DG+ + ++V  +L  +++W +WK EGCP ++RP   + 
Sbjct: 414 EDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNWVRWKAEGCPPIERPPVPVQ 472

Query: 371 DE------DKKDEPDAKKKKTP----ELTKLWNSKD--NLEACKSAERDFTPSLESYFEE 418
                     K   + + + TP    +L  L    +  +L+  K  ER  TPSL+ Y   
Sbjct: 473 SHLDTQSSAVKITANRRLRSTPMGSLDLGFLAAESNLGDLDRLKEPERFATPSLDFYMSG 532

Query: 419 AIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
                  +D A   +E+ + V   ++  WR LR+ SR
Sbjct: 533 ITDDDFNIDMAKDDDEKEQSVKAKASKTWRTLRIASR 569


>gi|325187942|emb|CCA22486.1| THO complex subunit putative [Albugo laibachii Nc14]
          Length = 582

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 130/468 (27%), Positives = 200/468 (42%), Gaps = 84/468 (17%)

Query: 54  DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
           D   I   + LC++  + ++     P  +L D  +   +  CE L+  +EV         
Sbjct: 48  DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107

Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
           FF +          LLRMCN +LRRLS++ NTVFCGRIL+FL+  F  +ERS        
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAA------ 161

Query: 168 NVENITEFGGD-----EEMDVSSNETEETDT-------EEVDKVKIDFNFYKKFWSLQDY 215
           NVEN+T F         E D  +N  E  D        ++    +ID+  Y  FW +Q +
Sbjct: 162 NVENLTIFEDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQRF 221

Query: 216 FRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNPS 257
           FR+  +   + +  K F      VL AF++    D                 V S+   S
Sbjct: 222 FRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKMS 281

Query: 258 GD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKS 301
            D                +  KYLTN +L  LQL D   R  VL Q LIL  +    V++
Sbjct: 282 IDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VRA 338

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           +   +   S Q+  + +  + V  L+K+TPPDG  +S++V  +L+ E +W +WK E CP 
Sbjct: 339 KS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCPS 397

Query: 362 L--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----LE 399
                          KR   + T+   + +   +KK   E   + ++  S  +     LE
Sbjct: 398 YEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLLE 457

Query: 400 ACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
             K   R    +L    +   +  DPA+ ++++Y    D  Y WR LR
Sbjct: 458 KIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 505


>gi|325187945|emb|CCA22489.1| THO complex subunit putative [Albugo laibachii Nc14]
          Length = 545

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 130/469 (27%), Positives = 202/469 (43%), Gaps = 86/469 (18%)

Query: 54  DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT 113
           D   I   + LC++  + ++     P  +L D  +   +  CE L+  +EV         
Sbjct: 48  DTTGIPALLDLCIDGAIANILFQFAPYKVLEDVMESQPIQTCEALWDILEVRKTRLTASP 107

Query: 114 FFMSCKNN------LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEF 167
           FF +          LLRMCN +LRRLS++ NTVFCGRIL+FL+  F  +ERS        
Sbjct: 108 FFDTSGRTTKSSLTLLRMCNAMLRRLSKTHNTVFCGRILVFLSFTFTLNERSAA------ 161

Query: 168 NVENITEFGGDEEMDVSSNETEETDTEEVDKV-------------KIDFNFYKKFWSLQD 214
           NVEN+T F  D+    ++   E+T+  E D               +ID+  Y  FW +Q 
Sbjct: 162 NVENLTIF-EDQNTFEAAEADEKTNRNENDDALSKEDCQDDGMSEQIDYKLYHIFWDVQR 220

Query: 215 YFRN-PVQCYNKVSWKMFTSYAETVLAAFKSYKLDD-----------------VQSSLNP 256
           +FR+  +   + +  K F      VL AF++    D                 V S+   
Sbjct: 221 FFRDYSLAASSVMDKKTFLEELNMVLTAFEANAFSDEDTTRSSDLNRTVSKSTVSSASKM 280

Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           S D                +  KYLTN +L  LQL D   R  VL Q LIL  +    V+
Sbjct: 281 SIDKIMHDDNSTDKLQEHFFQPKYLTNSRLFRLQLRDPTLRECVLTQALILLSHL---VR 337

Query: 301 SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
           ++   +   S Q+  + +  + V  L+K+TPPDG  +S++V  +L+ E +W +WK E CP
Sbjct: 338 AKS-TVSQDSTQKSEMVEAYDRVMKLLKRTPPDGAGYSEMVASVLERERNWTKWKEERCP 396

Query: 361 EL--------------KRPLTSITDEDKKDEPDAKKKKTPE---LTKLWNSKDN-----L 398
                           KR   + T+   + +   +KK   E   + ++  S  +     L
Sbjct: 397 SYEKYPDSSEKLLGSSKRSNEAATEHSGEKKRVMRKKNALESGLMDQILESDASRSSTLL 456

Query: 399 EACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALR 447
           E  K   R    +L    +   +  DPA+ ++++Y    D  Y WR LR
Sbjct: 457 EKIKGESRATNIALPKLMDRFTEAWDPASGIDKEYWPDQDEIYCWRTLR 505


>gi|326475798|gb|EGD99807.1| nuclear matrix protein [Trichophyton tonsurans CBS 112818]
          Length = 635

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 135/536 (25%), Positives = 236/536 (44%), Gaps = 114/536 (21%)

Query: 16  VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKD 72
           V++  Y +        + +A+E + R+    L+ T  ID      I N + +       +
Sbjct: 48  VLAGDYGISVGANQQTQYAAVETAFREIFYDLLATTSIDEPSFMQIWNLLDIVSIFSDNE 107

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
            C   L   L+ +  D  T+D C ++F Y+E          +KQ++        +LR CN
Sbjct: 108 KCEPGLIFWLIEELLDSQTIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCN 161

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDE 179
           +LLRRLSR+++TVFCGR+ +FL + FP  ++S +N+  E++ EN+T F          DE
Sbjct: 162 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDVQPQAQAESDE 221

Query: 180 EMDVSSNETE-------ETDTEEVD-----------------------KVKIDFNFYKKF 209
            MD+   E E        TD E                          ++++D   Y  F
Sbjct: 222 VMDIDIQEGETQVSNAKSTDQESTTAHAPLESTNDATPQRAKAPSIQPELELD-TLYAAF 280

Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
           WSLQ+ F  P++ ++   +    S  E  L++F+  S KL+                   
Sbjct: 281 WSLQESFSKPIRLFDTTHFASLKSGLEATLSSFRKVSSKLEVRSTTKGSDELRRGTKRKR 340

Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ-FLILFQYFTSTVKSR 302
                D+ +S NP      KYLT++ L +L+++D +FRR++L+Q  ++L    + T KS+
Sbjct: 341 GEGGSDLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSK 394

Query: 303 GEGLE-----------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            +  +           L  +  KW      ++ S + Q   DG+ + ++V  +L  +++W
Sbjct: 395 AKLADSTNKPVLYSYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 453

Query: 352 NQWKNEGCPELKRPLTSITDE------DKKDEPDAKKKKTP------ELTKLWNSKDNLE 399
            +WK EGCP +++P   +           K   + + + TP          + ++  +L 
Sbjct: 454 VRWKAEGCPPIEQPPVPVQSHLDTQSSAAKITANKRLRSTPMGSLDLGFLSVESNLGDLN 513

Query: 400 ACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
             K  ER  TPSL+ Y          +D A   EE+ + +   ++  WR LR+ SR
Sbjct: 514 RLKEPERFSTPSLDFYMSGITDDDFNIDMAKDDEEKEQSIKAKASKTWRTLRIASR 569


>gi|171687098|ref|XP_001908490.1| hypothetical protein [Podospora anserina S mat+]
 gi|170943510|emb|CAP69163.1| unnamed protein product [Podospora anserina S mat+]
          Length = 826

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 136/517 (26%), Positives = 222/517 (42%), Gaps = 130/517 (25%)

Query: 36  LEQSIRQYLLKLIKTPDID----IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMST 91
           +E ++R     L+    ID    +KV  N   +   L   ++C+  L + L+ +  D  T
Sbjct: 76  IETAVRDTFKYLVSRASIDSPEFVKVW-NLFDILSILSDSELCDPALLLWLVEELLDSQT 134

Query: 92  LDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRIL 146
           +  C ++F ++E     +    +KQ+         +LR CN+LLRRLSR+ +  F GR+L
Sbjct: 135 VAGCRKVFDFLESRRERITAKHFKQKQLV------ILRTCNELLRRLSRALDPAFSGRVL 188

Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNETEETD------ 192
           +F+ + FP  ++S +N+  EF++EN+T F        G +++MDV ++     D      
Sbjct: 189 IFMFQSFPLGDKSSVNLRGEFHIENVTTFDQIPAKTDGDEDKMDVDTDVGTTGDGSRRGR 248

Query: 193 TEEVDKVKIDFN---FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL-- 247
           + +VD  K   +    Y  FW LQ++F  P   +    +  F S  E  +A F S K   
Sbjct: 249 SNDVDTKKKALDPDALYPIFWGLQEFFNQPKLLFEASHFASFKSGLEATMATFLSMKFEQ 308

Query: 248 ---------------------------DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
                                      D + S  NP      KYLT++ L  L++SD  F
Sbjct: 309 PPRVKERPDRPVEEPKHALKRKREESNDALSSGFNP------KYLTSRDLFKLEISDLTF 362

Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLE---------------LKSDQEKWVKDTTETVYS 325
           RR +L+Q LI+ ++  +      E L                L  +  KWV +T E++ S
Sbjct: 363 RRNILIQALIIMEFLLALSPKAKEKLATVKVPNKSVTYSEKTLSEEDIKWVVETKESIKS 422

Query: 326 LIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT 385
            +K   P+G HF ++V+ +L  +++W +WK E CP ++ P  S         PD   +  
Sbjct: 423 YLKLG-PEGPHFCRLVETVLSRDKNWVRWKVENCPPIELPQLS---------PDVFVEAR 472

Query: 386 PELTKLWNSK---------------------DNLEACKSAERDFTPSLESY--------F 416
               KL  +K                       L+  K   R   PSL+SY        F
Sbjct: 473 TNAGKLATTKRLRPTPMGSLNLDFLEDDDEERALKKLKDCHRYKVPSLDSYKKGIADDDF 532

Query: 417 EEAIQQMDPA--AAVEEQYKKVNDSNYAWRALRLLSR 451
           E  +   + +  AA+E +  K      AWRALR+ S+
Sbjct: 533 EIGMPTNNDSKTAAIEGKASK------AWRALRIASK 563


>gi|119179900|ref|XP_001241471.1| hypothetical protein CIMG_08634 [Coccidioides immitis RS]
 gi|392866650|gb|EAS30175.2| nuclear matrix protein [Coccidioides immitis RS]
          Length = 629

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 101/418 (24%), Positives = 185/418 (44%), Gaps = 87/418 (20%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ T  I+      + N + +       + C   L   L+ +  D  
Sbjct: 66  AAVETAFREKFYHLLATTSINKPEFVHMWNLLDVVSIFSDNERCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF------------------------------ 175
            +FL + FP  ++S +N+  E++ EN+T F                              
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDEPPKHIAANEGADDVEMKDAEAQKPSDNS 239

Query: 176 --------GGDEEMDVSSNETEETDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKV 226
                   GG    ++    ++++   + +   ID +  Y  FW LQ  F  P + ++  
Sbjct: 240 SKTDAASQGGSVSAEIDGQRSQQSTDHQAEDTSIDMDALYPVFWGLQANFSAPTRLFDHE 299

Query: 227 SWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD---------------------YFAKYL 265
            +  F    E+ +++F+   +D  +  +    +                     +  KYL
Sbjct: 300 HFASFKKGLESTISSFQKVSIDLEKRGMTKGSEETSRRGLKRKRGETGPETANTFNPKYL 359

Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS------------TVKSRGEGLELKSDQE 313
           T++ L +L+++D  FRR++L+Q LIL  +  S            T KS   G  L  +  
Sbjct: 360 TSRDLFELEVNDVAFRRHILVQSLILLDFLISLSPKAKAKLADATNKSVLYGYVLSDEDA 419

Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
           KW      ++ S ++Q   DG+ + ++V  +L  +++W +WK EGCP +++P   I D
Sbjct: 420 KWANQMKSSIASYLQQGL-DGKFYYRMVDTVLTRDKNWVRWKAEGCPPIEKPPIQIQD 476


>gi|350639205|gb|EHA27559.1| hypothetical protein ASPNIDRAFT_41496 [Aspergillus niger ATCC 1015]
          Length = 694

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 150/608 (24%), Positives = 247/608 (40%), Gaps = 149/608 (24%)

Query: 34  SALEQSIRQYLLKLIKTPDIDIKVIENYIS----LCVELC---MKDMCNSTLPIILLSDT 86
           +A+E + R+    L+  P+    V E  +     +  E C     + C   L   L+ + 
Sbjct: 66  AAVEIAFREKFYSLLPNPESYNNVAEIMVGFGGLIGYEWCPDIFAEQCEPGLIFWLIEEL 125

Query: 87  FDMSTLDKCEQLFYYVE------------------VNVNIWKQQTFFMSCKNNLLRMCND 128
            D  T+D C ++F Y+E                   +  +   Q  F      +LR CN+
Sbjct: 126 LDSQTIDGCRKVFDYLESRRERNTKVGLLVGPSSPTSHMLTPPQKHFKQKSLIILRSCNE 185

Query: 129 LLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF------------- 175
           LLRRLSR+++TVFCGR+ +FL + FP  ++S +N+  E++ +N+T +             
Sbjct: 186 LLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSAVNLRGEYHTDNVTTYDELTKAAAKDVAD 245

Query: 176 ----------------GGDEEMDVSSNETEETDT------------------EEVDK-VK 200
                           G  E+ +  +    E+ T                  E  DK V 
Sbjct: 246 ADVEMSDEQEAPTATNGLKEDSEAQTTSASESHTPAQEPPKTPRVVVPSSKDESQDKGVD 305

Query: 201 IDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLD------------ 248
           +D + Y  FW LQ YF  P + ++   +  F +  E  L+AFK+   D            
Sbjct: 306 LD-DLYPMFWGLQAYFSAPTKTFDPQHFAKFKTGLEATLSAFKNVDTDLENSSNSKMSEE 364

Query: 249 --------------DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY 294
                         ++ SS NP      KYLT++ L DL+++DT FRR+VL+Q LIL  +
Sbjct: 365 IRKSTKRKRTTDGQEIASSFNP------KYLTSRDLFDLEVNDTAFRRHVLVQALILLDF 418

Query: 295 FTS-TVKSRGE-----------GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
             S T K++ +           G  L  D  KW     + +   + Q    G+ + ++V 
Sbjct: 419 MLSLTPKAKAKLADLTNKSVLYGFVLHDDDAKWAVKMRKAIEEYL-QEGIGGKFYYRMVD 477

Query: 343 LILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDA-----KKKKTPELTKL------ 391
            +L  +++W +WK EGCP ++RP  S+++     E        K+ +T  +  L      
Sbjct: 478 TVLSRDKNWVRWKAEGCPLIERPPVSVSEYLGAREHATKVYANKRLRTSPMGSLDLKFLS 537

Query: 392 -WNSKDNLEACKSAERDFTPSLESYFEEAIQ---QMDPAAAVEE-QYKKVNDSNYAWRAL 446
              S   LE  K  ER   P+ +S     +     +D A   E+ +Y   + S+  WR L
Sbjct: 538 EGESLAGLERLKEPERFSVPAADSLMMGIMDDELDIDTAQTKEDREYAMRSKSSKTWRIL 597

Query: 447 RLLSR-KCPHF----------FLNATP-NVEKNSEFIENMVKRCVKEKPSSQISGNGNGV 494
           RL ++ K   F           L  TP   +   + +E   +    EKP +     G+G 
Sbjct: 598 RLSAKSKLAEFEKIEDGKNLKILFETPQQSDGTPQALEGTPQ--ASEKPPAGSESTGHGQ 655

Query: 495 DQDPAEVE 502
           +   A  E
Sbjct: 656 EHGAASTE 663


>gi|239608430|gb|EEQ85417.1| nuclear matrix protein [Ajellomyces dermatitidis ER-3]
          Length = 663

 Score =  151 bits (382), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 133/528 (25%), Positives = 225/528 (42%), Gaps = 117/528 (22%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E + R+    L+ T  ID      I N + +       + C   L   L+ +  D  
Sbjct: 66  AVVETAFRKKFYDLLATTSIDEPSFIHIWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRTEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNETE---ETDTE 194
            +FL + FP  +RS +N+  EF+ EN+T F        GG   MDV ++  E   +  TE
Sbjct: 180 FIFLFQSFPLGDRSSVNLRGEFHTENVTTFDKLPKAQDGGVTPMDVDASAKEVAGDAKTE 239

Query: 195 EVDKVKIDFN-----------------------------------FYKKFWSLQDYFRNP 219
                + + N                                    Y  FWSLQ+ F +P
Sbjct: 240 SAQASEQESNRQAAHPAADVGLQSSQPATKKPQEPEPSSTPSMDELYPIFWSLQNNFSSP 299

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------DVQSSLNPSGDY 260
              ++  ++  F +  E  L+ F+    D                    V   +  +  +
Sbjct: 300 TSLFDPANFAAFKTGLEASLSIFQKVNTDMEARMTKGSEDARRGLKRRRVGDGMEMTNSF 359

Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE--------------- 304
             KYLT++ L +L+++D  FRR++L+Q LIL  +  S T K++ +               
Sbjct: 360 NPKYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNKSVLYSYVL 419

Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
                 L   S + KW      ++ S ++Q   +G+ + ++V  +L  +++W +WK E C
Sbjct: 420 NDEDVNLCTASSKAKWASQMKTSIASYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEAC 478

Query: 360 PELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACKSAERD 407
           P ++R   SI D         K     + +  P    +L  L   K  +++   K  +R 
Sbjct: 479 PPIERTPVSIQDYLDTQATATKVSTSKRLRSAPLGSLDLNFLTEDKNINSVGRLKHPDRF 538

Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEEQYKKVND-SNYAWRALRLLSR 451
            TP+L+SY          +D A + EE+ +     ++  WR LRL SR
Sbjct: 539 TTPALDSYMRGIADDEFNIDMAQSREEKDEATKARASKTWRTLRLSSR 586


>gi|303286871|ref|XP_003062725.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456242|gb|EEH53544.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 664

 Score =  151 bits (382), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 123/442 (27%), Positives = 196/442 (44%), Gaps = 58/442 (13%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           +   + L + L    +C+  +   +L    +  T+   + +F ++E           +  
Sbjct: 104 LPRLLDLSLTLSEWGVCDHGVVFTVLEQLVECCTIADAQDVFTWIEGKRTRLNADVLWRR 163

Query: 118 CKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG 177
            K  +LR CNDLLRRLS++ +    GR+LL L+  +  SERS LN+   +N  N+T    
Sbjct: 164 GKLIMLRTCNDLLRRLSKTNDATLRGRVLLLLSSLYALSERSALNLAGNYNRANVTTL-- 221

Query: 178 DEEMDVSSNETEET-----------------DTEEVDKVK------IDFNFYKKFWSLQD 214
            +E++V +N  E+                  D  E D         +D  FYK FWSLQ 
Sbjct: 222 -DELEVGTNGAEQGGGGGKGKGKGDDDNEDEDAMETDDAAKTNAPVVDVRFYKTFWSLQT 280

Query: 215 YFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFA---KYLTNQKL 270
           +F+NP V       W  F +  + VL  F++++LD    +   + +  A   KYLT   L
Sbjct: 281 FFQNPNVALGVAGGWDAFYAAFQAVLDNFETHRLDASDETAATTAEESALGVKYLTAAPL 340

Query: 271 LDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
           L LQL D  FRR  LLQ  IL  +            ++K DQ    K       +     
Sbjct: 341 LQLQLRDPAFRRSFLLQCAILLGHCEGPPALP----KVKRDQVAKAKARV-MAATAATPP 395

Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL------------------KRPLTSITDE 372
              G  F++ V   LK E +W  WK + C +                   KRP   +  +
Sbjct: 396 NARGVEFAEFVAAALKREVYWVGWKKDNCKDFQRASEPRPPPKPPPPVMRKRP--RLGTK 453

Query: 373 DKKDEPDAKKK-KTPELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVE 430
                P+ + K   PEL +LWN S DN  + +  + + T S+ ++ +  +  MDP A +E
Sbjct: 454 QPAVPPEKRVKLGNPELDRLWNLSVDNASSLREKKAEET-SMATFLQPVLDDMDPEAQIE 512

Query: 431 EQYKKVNDSNYAWRALRLLSRK 452
           E YK+ ND  + W+ALR+++++
Sbjct: 513 EAYKQKNDKVFVWKALRMIAKE 534


>gi|449296680|gb|EMC92699.1| hypothetical protein BAUCODRAFT_41548, partial [Baudoinia
           compniacensis UAMH 10762]
          Length = 591

 Score =  151 bits (382), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 134/501 (26%), Positives = 214/501 (42%), Gaps = 88/501 (17%)

Query: 32  KKSALEQSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           K + +E + R     L+ +  I+      I N + +      + +C S L   L+ +  D
Sbjct: 64  KVNVVETAARGIFYGLLNSTPIEKPTFVEIWNLLDVLASCGDQGVCASELVCYLIEELLD 123

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSRSQNTVFCGRIL 146
             T   C  +F Y+E        + F    K NL  LR CN+LL+RLSR+++ +FCGR+ 
Sbjct: 124 SQTTAGCRIVFDYLESRRERLAAKDFH---KKNLIFLRSCNELLKRLSRAEDAIFCGRVF 180

Query: 147 LFLAKFFPFSERSGLNIISEFNVENITEFG----GDEEMDVSSN---------ETEETDT 193
            FL + FP  ++S +N+  EF+ EN+T+F       E+M+V            ET +  T
Sbjct: 181 FFLFQTFPLGDKSSVNLRGEFHNENVTKFAVPDEHGEKMEVDEKAGGAVTVKAETPQPPT 240

Query: 194 EEVDK--------------VKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVL 239
           +   K              V      Y  FW LQ  F  P + +  V +K F       L
Sbjct: 241 KPAGKGAPLKSPHKVPAEVVLSTGELYPIFWRLQQDFSEPTRLFAPVQFKTFKDGLSHTL 300

Query: 240 AAFKSYKL-----------DDVQSSLNPSG-----------DYFAKYLTNQKLLDLQLSD 277
             FK                 V+  L   G           +Y  KYLT++ L DL+LSD
Sbjct: 301 TKFKKTPTVVQTKATETDKRGVKRKLGEEGMDDEASKHFVDNYNPKYLTSRDLFDLELSD 360

Query: 278 TNFRRYVLLQFLILFQYFTSTV--------------KSRGEGLELKSDQEKWVKDTTETV 323
             F+R++L+Q LIL  +  S                KS      +  +  KW +DT + +
Sbjct: 361 LAFQRHILVQALILIDFLISLTDKAKKKLAVPEVNNKSLLYSFTIVEEDAKWCRDTRQAI 420

Query: 324 YSLIKQTPP--DGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTS--ITDEDKKDEP 378
              ++ TP   +G  + ++V+ +L  +++W +WK E CP + R P+++  I D  +    
Sbjct: 421 TDYLRATPTFEEGRFYHRMVETVLARDKNWVRWKLESCPSIVRDPVSTEQILDARRTARN 480

Query: 379 DAKKKKTPE-------LTKLWNSK-DNLEACKSAERDFTPSLESYFEEA-IQQMDPAAAV 429
            A+ +K P+       L+ L       LEA K   R   PS+E   E     Q+D   A+
Sbjct: 481 VARPRKVPQRPMGAIDLSFLAEGHGGGLEALKDPSRYTAPSVEELIEGVKADQLDLEMAM 540

Query: 430 EEQYKKVND---SNYAWRALR 447
           ++  K+      +N  WR LR
Sbjct: 541 DDAEKESIQNAITNKTWRVLR 561


>gi|440640377|gb|ELR10296.1| hypothetical protein GMDG_04680 [Geomyces destructans 20631-21]
          Length = 843

 Score =  151 bits (381), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 130/505 (25%), Positives = 225/505 (44%), Gaps = 93/505 (18%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E ++R   + L+ T  I+      + N + +   L     C+  L   L+ +  D  
Sbjct: 67  AVVEIAVRDAFIDLLATYSIEQPQFVQVWNLLDIVSTLSDDGQCDPALLFWLIEELLDSQ 126

Query: 91  TLDKCEQLFYYVEVNVN--IWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
           T+  C ++  Y+E      I K    F   K  +LR CN+LLRRLSR+++T FCGR+ +F
Sbjct: 127 TIAGCRKILDYLESRRERIIAKD---FAGKKLAILRSCNELLRRLSRAEDTAFCGRVFIF 183

Query: 149 LAKFFPFSERSGLNIISEFNVENITEF----------GGDEEMDVSSNETEETDTEEV-- 196
           + + FP  ++S +N+  E++VEN+T F          G   E+D     T      ++  
Sbjct: 184 MFQSFPLGDKSSVNLRGEYHVENVTTFDQIPAKGETQGDQMEVDAGVEPTPTGPAADIPT 243

Query: 197 --DKVKIDFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVL 239
             +  K+ FN                Y  FWSLQ  F  P   ++   +K F       +
Sbjct: 244 SEEAKKVTFNAASTAQSEQPLSSDDLYPIFWSLQQSFSQPKGLFDTQRFKEFKEGLNLTI 303

Query: 240 AAFKSYKLD-DVQSSLNPSGD-------------------YFAKYLTNQKLLDLQLSDTN 279
            AF+S + + + + S  P  D                   +  KYLT++ L +L++ D  
Sbjct: 304 TAFRSVQNESNYRGSSKPVDDENRGTKRKRRQDDDDLAHIFNPKYLTSRDLFELEIKDLL 363

Query: 280 FRRYVLLQFLILFQYFTSTVKSRGEGL--------------ELKSDQE-KWVKDTTETVY 324
           FRR++L+Q LI+  +  S      E                ++ SD++ KW  DT +++ 
Sbjct: 364 FRRHILVQALIIMDFLISLGSKAKENQATTTKQNLSVMYQDQILSDEDVKWAVDTKKSIA 423

Query: 325 SLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED--------KKD 376
             +KQ   DG  F ++V+ +L  ++HW +WK E CP ++RP  ++T E+        KK 
Sbjct: 424 DYLKQG-HDGPFFYRMVETVLSRDKHWVRWKIESCPSIERP--AVTPEEYLAAKATAKKI 480

Query: 377 EPDAKKKKTP----ELTKLWNSKD--NLEACKSAERDFTPSLESYFEE-AIQQMDPAAAV 429
             + + +  P    EL  L  +     +E  K+  R   P L S+ ++ A+  M+     
Sbjct: 481 TTNKRLRPKPMGSLELGFLSEADGIIGMERLKNPSRYSLPPLASFKDKIALDDMEIDMPT 540

Query: 430 EEQYKKV---NDSNYAWRALRLLSR 451
            +  K+      ++  WRALR+ S+
Sbjct: 541 NDDTKEAAIYGKASKTWRALRVASK 565


>gi|189205046|ref|XP_001938858.1| nuclear matrix protein [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985957|gb|EDU51445.1| nuclear matrix protein [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 620

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 128/455 (28%), Positives = 218/455 (47%), Gaps = 79/455 (17%)

Query: 71  KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
           +D+C+  L ++L+ +  D  ++  C  +F ++E      ++    ++ KN    +LR+CN
Sbjct: 101 RDLCHPQLVLLLIEELLDSQSISGCRIVFNFLESR----RETIIAINSKNKDLVILRLCN 156

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDE 179
           +LLRRLSR+++ VFCGR+ +F+ + FP  ++S +N+   F+VEN+T F          D+
Sbjct: 157 ELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENVTTFEDFLKEPKTDDD 216

Query: 180 EMDV------------SSNETEETDTEEVDKVK------IDFN-FYKKFWSLQDYFRNPV 220
            M V            S +E +  + EE DK K      +D +  Y  FWSLQ  F NP 
Sbjct: 217 SMQVDADDAVITPKEESKSEEKAVENEEPDKDKDAKPDTLDTDTLYPVFWSLQHSFSNPP 276

Query: 221 QCYNKVSWKMFTSYAETVLAAFKSY-KLDDVQSSLNPSG---------DYFA-----KYL 265
           + + + +++ F    E  LA FK   K+     S    G         D FA     KYL
Sbjct: 277 RLFEEDNFQQFQKSLEATLAKFKEVPKVIQAGDSERKRGQQTRDGDDYDAFANSFNPKYL 336

Query: 266 TNQKLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGE------------GLELKSDQ 312
           T++ L  L+LSD  F+R++L+Q LIL  +  T T KS+ +               L+ + 
Sbjct: 337 TSRDLFKLELSDLAFQRHILVQALILIDFLLTLTEKSKSKPFYQNAQKAMQYNFTLREED 396

Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
            +W       + + + Q  PDG+ + ++V  +L  +++W +WK E C    R   + TD 
Sbjct: 397 TEWALGIKTAIVNYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRDRVATTDF 455

Query: 373 DKKDEPDAKKKKTPELTK----------LWN--SKDNLEACKSAERDFTPSLESYFEEA- 419
            +      K   + + T+          L+N  ++  L   +   R   P  ESY ++  
Sbjct: 456 QQAKSGAQKAVVSKKATRPMGVPSTLDFLYNNEAEKGLSQLRQRNRFNVPKAESYAKKVK 515

Query: 420 IQQMD-PAAAVEEQYKKVND--SNYAWRALRLLSR 451
           + ++D   A  EE+ ++V +  S+Y WR LRL S+
Sbjct: 516 MTELDLDMADAEEEKRQVGEKKSSYVWRGLRLASK 550


>gi|303321039|ref|XP_003070514.1| hypothetical protein CPC735_062420 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240110210|gb|EER28369.1| hypothetical protein CPC735_062420 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 631

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 101/420 (24%), Positives = 185/420 (44%), Gaps = 89/420 (21%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ T  I+      + N + +       + C   L   L+ +  D  
Sbjct: 66  AAVETAFREKFYHLLATTSINKPEFVHMWNLLDVVSIFSDNERCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF------------------------------ 175
            +FL + FP  ++S +N+  E++ EN+T F                              
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDEPPKHIAANEDADDVEMKDAEAQKPSDNS 239

Query: 176 ----------GGDEEMDVSSNETEETDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYN 224
                     GG    ++    ++++   + +   ID +  Y  FW LQ  F  P + ++
Sbjct: 240 SKTDAAASSQGGSVSPEIDGQRSKQSTDHQAEDTSIDMDALYPVFWGLQANFSAPTRLFD 299

Query: 225 KVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD---------------------YFAK 263
              +  F    E+ +++F+   +D  +  +    +                     +  K
Sbjct: 300 HEHFASFKKGLESTISSFQKVSIDLEKRGMTKGSEETSRRGLKRKRGETGPETANTFNPK 359

Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS------------TVKSRGEGLELKSD 311
           YLT++ L +L+++D  FRR++L+Q LIL  +  S            T KS   G  L  +
Sbjct: 360 YLTSRDLFELEVNDVAFRRHILVQSLILLDFLISLSPKAKAKLADATNKSVLYGYVLSDE 419

Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
             KW      ++ S ++Q   DG+ + ++V  +L  +++W +WK EGCP +++P   I D
Sbjct: 420 DAKWANQMKSSIASYLQQGL-DGKFYYRMVDTVLTRDKNWVRWKAEGCPPIEKPPIQIQD 478


>gi|322709414|gb|EFZ00990.1| nuclear matrix protein [Metarhizium anisopliae ARSEF 23]
          Length = 873

 Score =  148 bits (374), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 128/489 (26%), Positives = 218/489 (44%), Gaps = 101/489 (20%)

Query: 48  IKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE---- 103
           I++PD  +K+  N + +   L     C+  L   L+ +  D  T+  C  +F Y+E    
Sbjct: 103 IQSPDF-VKMW-NLLDILSILSDDGQCDPALLFWLVEELLDSQTISACRIVFDYLESRRE 160

Query: 104 -VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLN 162
            +    +KQ+         +LR CN+LLRRLSR+++T FCGR+ +FL + FP  +RS +N
Sbjct: 161 RIMSKHFKQKHLV------ILRSCNELLRRLSRAEDTAFCGRVYIFLFQSFPLGDRSSVN 214

Query: 163 IISEFNVENITEFGGDEEMDVSSNETE------------ETDTEEVDKVKIDFN------ 204
           +  E++VEN+T F    E D    E++             TDT+  +K  + F+      
Sbjct: 215 LRGEYHVENVTTFEESAEEDEQQPESKMEVDTPAETPKSSTDTKASNK-GVGFDSKNSEK 273

Query: 205 ------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQS-SL-NP 256
                  Y  FWSLQ+ F  P++ ++   +  F +  +  + AF++   ++  S SL NP
Sbjct: 274 PYDPDRLYPLFWSLQESFSQPLKLFDATHFNKFKTRLDLTIKAFQAIHQNEGPSKSLENP 333

Query: 257 SGD----------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
             +                +  KYLT++ L +L++SD +FRR++L+Q LI+  +  S   
Sbjct: 334 RRNLKRKREDEGPEDAPEAFNPKYLTSKDLFELEISDLSFRRHILVQALIITDFLLSLST 393

Query: 301 SRGEGL----------------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLI 344
              E L                +L  +  KW  +   T+   +K+   DG +F ++V+ +
Sbjct: 394 ETREKLAGLNTRSLNKSVLYNDQLSEENTKWATEMKFTISDYLKRG-IDGPYFYRMVETV 452

Query: 345 LKGEEHWNQWKNEGCPELKRPLTSI------------TDEDKKDEPDAKKKKTPELTKLW 392
           L  +++W  WK   CP ++R   S                 K+  P      + E  +  
Sbjct: 453 LARDKNWVFWKMTSCPSIQRDPVSPELFVEARTTAQKLATSKRLRPTPMGSVSLEFLRDM 512

Query: 393 NSKDNLEACKSAERDFTPSLESY----------FEEAIQQMDPAAAVEEQYKKVNDSNYA 442
           N + +L+  K+ ER   P LES+           E        AAA+  +  K      +
Sbjct: 513 NDESSLDQLKTPERYQLPELESFQRKIADDDFEIEMPTNDQTKAAAIAGKASK------S 566

Query: 443 WRALRLLSR 451
           WRALR+ +R
Sbjct: 567 WRALRIAAR 575


>gi|281201496|gb|EFA75705.1| putative protein serine/threonine kinase [Polysphondylium pallidum
           PN500]
          Length = 917

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 104/332 (31%), Positives = 152/332 (45%), Gaps = 53/332 (15%)

Query: 11  KSQLNVVSDKYALYKPTCDYDK----------KSALEQSIRQYLLKLIKTPDIDIKVIEN 60
           +SQ   + +  A + P+ D  K          +S +E  +RQ L  L K   I  + IE 
Sbjct: 381 ESQKPAIRESLARHLPSFDQSKVATDLQQAEWRSNIELPLRQGLNNLFKLESIKNEKIEQ 440

Query: 61  YISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS--C 118
            I L + +       +T  + +  D F+  ++ +  +LF  +E  V  + Q  F  S   
Sbjct: 441 LIELAIFIYDLGYVEATYLLYIFEDLFESRSISESIELFSLMESRVEFFSQDAFVGSGRA 500

Query: 119 KNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--- 175
           K  LL++CND+LRRLS+S N   CGRIL+F++   P S+ SG+N  S  N E+       
Sbjct: 501 KKQLLKLCNDMLRRLSKSTNPEPCGRILIFMSYLIPMSDPSGINSKSINNDEHSNSIELL 560

Query: 176 --------GGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS 227
                       + D   NE+  T         +D NFY++FWSLQ +F NP Q  N V 
Sbjct: 561 LKQQQQDQNQLLDRDRDGNESGNT---------VDLNFYRQFWSLQSFFHNPYQLINNVP 611

Query: 228 ------------------WKMFTSYAETVLAAFKSY-KLDDVQSSLNPSGDYFAKYLTNQ 268
                             W  FT   E VL  F ++  LD++++   P   YF KYLT  
Sbjct: 612 SSGFVVAANASVIINKDRWNHFTQAVELVLTTFATHINLDELEAL--PKTHYFTKYLTGS 669

Query: 269 KLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
            L+ LQL D  FRR +L Q L  FQ+   T++
Sbjct: 670 NLMKLQLKDAIFRRNILTQLLFTFQFLEITIQ 701


>gi|296818073|ref|XP_002849373.1| nuclear matrix protein [Arthroderma otae CBS 113480]
 gi|238839826|gb|EEQ29488.1| nuclear matrix protein [Arthroderma otae CBS 113480]
          Length = 648

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 110/428 (25%), Positives = 187/428 (43%), Gaps = 109/428 (25%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ T  I+      I N + +       + C   L   L+ +  D  
Sbjct: 66  AAVETAFREIFYDLLATTSINDPSFMQIWNLLDIVSIFSDNEKCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFG--------GDEEMDVSSNET--------- 188
            +FL + FP  ++S +N+  E++ EN+T F         GDE MD+   E          
Sbjct: 180 FIFLFQSFPLGDKSSVNLRGEYHTENVTTFDLQPRAQVEGDEVMDIDKQENTTLLSTAKG 239

Query: 189 ---EETDTEEVDKVKIDFN------------------FYKKFWSLQDYFRNPVQCYNKVS 227
              E T      +  ID +                   Y  FWSLQ+ F  P++ ++   
Sbjct: 240 SQPESTTVHTPQESGIDSSVQRAEESYDDKPELELDALYSAFWSLQESFSRPIRLFDATH 299

Query: 228 WKMFTSYAETVLAAFK--SYKLD------------------------DVQSSLNPSGDYF 261
           +    S  E  L++F+  S KL+                        D+ +S NP     
Sbjct: 300 FASLKSGLEATLSSFRKVSSKLEVRGTTKGSDESRRGIKRKRGEGGSDLTNSFNP----- 354

Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFL------------------------ILFQYFTS 297
            KYLT++ L +L+++D +FRR++L+Q L                        +L+ Y  S
Sbjct: 355 -KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSKAKLADSTNKSVLYGYVLS 413

Query: 298 TVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
                   +   + + KW      ++ S + Q   DG+ + ++V  +L  +++W +WK E
Sbjct: 414 DEDLLSLAITKSAVKAKWATQMKSSIASYL-QKGADGKFYFRMVDTVLTRDKNWVRWKAE 472

Query: 358 GCPELKRP 365
           GCP ++RP
Sbjct: 473 GCPPIERP 480


>gi|395335087|gb|EJF67463.1| hypothetical protein DICSQDRAFT_41994, partial [Dichomitus squalens
           LYAD-421 SS1]
          Length = 687

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/346 (32%), Positives = 158/346 (45%), Gaps = 53/346 (15%)

Query: 72  DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLL 130
           D C+   P  +L D  +  T+  C  +F ++E       +       K   LLR  NDLL
Sbjct: 98  DACDQVFPFTVLQDLLEAQTISSCSHIFSWIEARAERLTEGMVPQKGKALVLLRTLNDLL 157

Query: 131 RRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN-VENITEFGGDEEMDVSSNET 188
           RRLS +   T+F GRIL FL++ FP  ERSG+N+  E+  V +     G E    S+ E 
Sbjct: 158 RRLSNTGATTIFTGRILTFLSRVFPLGERSGVNLRGEYGPVWDGPGTKGQEREKASATEK 217

Query: 189 EETDTEEV--DKVKIDFN----------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAE 236
           +E +  E   DK K+D +          FY  FW LQ  F  P       ++  F     
Sbjct: 218 QEGENGEGEKDKDKMDVDRRKEKEGGSDFYYTFWGLQLPFSRPPLFAEPGTFAAFKEAVS 277

Query: 237 TVLAAFKSYKLDDVQSSLN--------PSGD------------------YFAKYLTNQKL 270
            VL   K     D     N        PSG                   YFAKYLT+  L
Sbjct: 278 KVLPVIKEATAKDRALMGNKSNGIPTGPSGPLKRKRAAGVPESAAKSEYYFAKYLTSPDL 337

Query: 271 LDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE----LKSDQEKWVKD 318
           LDL+++DT+FRR  L Q LIL  +   FT T K+     R   L+    L +   +WV++
Sbjct: 338 LDLEIADTHFRRQFLFQLLILLHHLLTFTKTAKATWASTRNRSLQIDFTLDTADAQWVQE 397

Query: 319 TTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
           T       ++QT P+G  FS+ V++IL+ E++W +WKNE C    R
Sbjct: 398 TITRATEELRQTSPNGRAFSETVQVILEREKNWVKWKNELCTPFDR 443


>gi|452839889|gb|EME41828.1| hypothetical protein DOTSEDRAFT_74028 [Dothistroma septosporum
           NZE10]
          Length = 639

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 132/506 (26%), Positives = 216/506 (42%), Gaps = 99/506 (19%)

Query: 35  ALEQSIRQYLLKLIKTPDID-IKVIENYISLCVELCMKDMCNST--LPIILLSDTFDMST 91
           A+E + RQ    LI T  I     +E +  L + L   D   ST  L + L+ +  D  T
Sbjct: 67  AVETAARQLFYSLITTCHIQQPGFVEVWNFLDIILLCGDQGKSTPELVVWLIEELLDSQT 126

Query: 92  LDKCEQLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSRSQNTVFCGRILLFL 149
              C  +F Y++   +   Q+ F    K NL  LR CN+LLRRLSR+++ +FCGR+  FL
Sbjct: 127 THGCRTVFDYLDSRRDRLAQKDFH---KKNLVFLRSCNELLRRLSRAEDAIFCGRVFFFL 183

Query: 150 AKFFPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNETEETDTEEV------ 196
            + FP  ++S +N+  EF+ EN T+F          E MD  S   +    E        
Sbjct: 184 FQTFPLGDKSSVNLRGEFHTENTTKFDVVVEQSAAGERMDTDSKAGDLVAKEATPRGTPQ 243

Query: 197 -----------------------DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTS 233
                                  +++      Y  FW LQ  F +P + +N+ ++ +F  
Sbjct: 244 PAGPEKPGSKAVPVKTPSKRTVEEEILTSSELYPIFWRLQHDFSDPTRLFNEGNFALFKK 303

Query: 234 YAETVLAAF----------------KSYKLDDVQSSLNPSG------DYFAKYLTNQKLL 271
              + +A F                +  K    ++ +NP+       +Y  KYLT++ L 
Sbjct: 304 GLASSIAKFQKTPTVVQTRSGDNDKRGTKRKTDENGVNPNASEHLVDNYNPKYLTSRDLF 363

Query: 272 DLQLSDTNFRRYVLLQFLILFQYFTSTV--------------KSRGEGLELKSDQEKWVK 317
           +L+LSD  F+R++L+Q LIL  +  S                KS      L  +  KW K
Sbjct: 364 ELELSDLAFQRHILVQALILIDFLLSLTEQAKKRLVELPTPNKSMLYAFTLSEENTKWAK 423

Query: 318 DTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTSITDEDKKD 376
            +   +   ++ T  DG  ++++V+ +L  +++W +WK E CP + R P+T  T+++   
Sbjct: 424 SSRSVIADYLRST-ADGAQYNRMVETVLARDKNWVRWKIESCPSIVRGPVT--TEQELAA 480

Query: 377 EPDAKKKKTP-----------ELTKL-WNSKDNLEACKSAERDFTPSLESYFEEA-IQQM 423
              A+    P           +L+ L   S   LE      R   PS+E   E     ++
Sbjct: 481 RKGARSAVRPRNVPDKPNGAMDLSFLDEGSGGGLEVLMDPSRYTAPSIEQLVEVVKTDKL 540

Query: 424 DPAAAVEEQYKKVNDS--NYAWRALR 447
           D   A  E+   + ++  N  WRALR
Sbjct: 541 DLDFADGEEKTSLENAMQNAKWRALR 566


>gi|240279266|gb|EER42771.1| nuclear matrix protein [Ajellomyces capsulatus H143]
 gi|325089536|gb|EGC42846.1| nuclear matrix protein [Ajellomyces capsulatus H88]
          Length = 654

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 133/527 (25%), Positives = 233/527 (44%), Gaps = 124/527 (23%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E + R+    L+ T  ID      + N + +       + C   L   L+ +  D  
Sbjct: 66  AVVETAFREKFYDLLATTSIDEPSFIQMWNLLDIISIFSDNEQCEPGLLFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE------------EMDVSSNE-TEETD 192
            +FL + FP  +RS +N+  EF+ EN+T F  DE            +MD+S  E T +T 
Sbjct: 180 FIFLFQSFPLGDRSSVNLRGEFHTENVTTF--DELPKAQESSETPMDMDISEKEITADTK 237

Query: 193 TEE-----------------VDKVKIDF------------------NFYKKFWSLQDYFR 217
           TE                  V+ ++I                    + Y  FWSLQ  F 
Sbjct: 238 TESAQVTKQEPAGQITHPAAVESLQISKTDAMKPVESEPNSILSMDDLYPIFWSLQTNFS 297

Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------------DVQS 252
           +P   ++  +++ F +  E  L+ F+    D                         ++ +
Sbjct: 298 SPTSLFDPANFEAFKAGLEASLSVFQKVNADMEVRMMKGSEEARRGPKRRRIGDGTEMTN 357

Query: 253 SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE---- 307
           S NP      KYLT++ L +L+++D  FRR++L+Q LIL  +  S T K++ +  +    
Sbjct: 358 SFNP------KYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNK 411

Query: 308 -------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
                  L  +  KW      ++ + ++Q   +G+ + ++V  +L  +++W +WK E CP
Sbjct: 412 SVLYNYVLNDEDTKWASRMKTSIATYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEACP 470

Query: 361 ELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSKDN--LEACKSAERDF 408
            ++    SI D         K     + + TP    +L  L   K++  ++  K  +R  
Sbjct: 471 PIEHTPVSIQDYLDTQTSAIKVSTSKRLRSTPLGSLDLNFLMEDKNSSRVDRLKHPDRFS 530

Query: 409 TPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
           TP+L+ Y          +D + + EE+ + +   ++  WR LRL SR
Sbjct: 531 TPALDFYLRGIADDEFNIDMSQSREEKDEAIKAKASKTWRTLRLSSR 577


>gi|403412199|emb|CCL98899.1| predicted protein [Fibroporia radiculosa]
          Length = 757

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 110/368 (29%), Positives = 173/368 (47%), Gaps = 70/368 (19%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQ 112
           + + + L +      +C+ +    +L D  +  T+  C  +F ++E     + V++  Q+
Sbjct: 78  LRSRLDLVLTFTEHGICDQSFVFHVLYDLLETQTIASCSHIFSWIEERMDRLTVDMVPQK 137

Query: 113 TFFMSCKNNLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEFNV-- 169
              +     LLR  NDLLRRLS+  NT +FCGRIL FL++ FP  ERSG+N+   +    
Sbjct: 138 GKALI----LLRALNDLLRRLSKMGNTTMFCGRILTFLSRVFPLGERSGVNLRGGYGPVW 193

Query: 170 ---------------------ENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKK 208
                                E   E G +++M+V   + EE +  E  K +   +FY  
Sbjct: 194 DGPGVKGSEGDIKMQTREEGGEKSAEGGEEDKMEVDGVQVEEQENREAKKNQQKADFYDT 253

Query: 209 FWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD---------VQSSLNPS-- 257
           FWSLQ  F  P       ++  F      +L   K     D         V ++  PS  
Sbjct: 254 FWSLQLPFSRPPLFSEPGTFVQFKEAVNKILPVIKEATTKDRALMGSKSLVGNAGGPSLK 313

Query: 258 ------------GD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVK 300
                       G   +FAKYLT+ +LLDL+++DT+FRR  L Q LIL  +   FT TVK
Sbjct: 314 RKRDSEGAEDNTGKEYFFAKYLTSPELLDLEIADTHFRRQFLFQLLILLNHLLTFTKTVK 373

Query: 301 S-----RGEGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
           +     R   L+    L+++  +WV+D        ++QT P G  F++ V++IL+ E++W
Sbjct: 374 ATWSTPRNRSLQIDFTLEAEDAQWVQDIVAKATEELRQTAPSGRAFAETVQIILEREKNW 433

Query: 352 NQWKNEGC 359
            +WKNE C
Sbjct: 434 VKWKNELC 441


>gi|336469264|gb|EGO57426.1| hypothetical protein NEUTE1DRAFT_80974 [Neurospora tetrasperma FGSC
           2508]
 gi|350291103|gb|EGZ72317.1| hypothetical protein NEUTE2DRAFT_111847 [Neurospora tetrasperma
           FGSC 2509]
          Length = 853

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 105/391 (26%), Positives = 184/391 (47%), Gaps = 56/391 (14%)

Query: 32  KKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           + + +E ++R     LI T  ID      + N   +   L   + C+  L + L+ +  D
Sbjct: 75  RHAIIETAVRDTFSNLIATTSIDSPDFARVWNLFDILSILSDDEQCDPALLLWLVEELLD 134

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
             T+  C ++F ++E        + F    K  +LR CN+LLRRLSR+ +  F GR+ ++
Sbjct: 135 SQTIAGCRKVFDFLESRRERITAKHFGQK-KLVILRTCNELLRRLSRALDPAFSGRVFIY 193

Query: 149 LAKFFPFSERSGLNIISEFNVENI---------TEFGGDEEMDV-------SSNETEE-- 190
           L + FP  +RS +N+  E++VEN+         TE    + MDV       S+ +T E  
Sbjct: 194 LFQSFPLGDRSSVNLRGEYHVENVTTWDQEPTKTEDASADRMDVDTETQGESAQKTGEQR 253

Query: 191 -------TDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF 242
                  TD ++     +D    Y  FWSLQ+ F  P + +    +  F S  E  +A F
Sbjct: 254 STPKPGATDAKKQQDKALDLEALYPIFWSLQESFNQPKKLFEPAQFASFKSGLEATMATF 313

Query: 243 KSYKLDDVQSSLNP----------SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
           +S K D+ + +L            +  +  KYLT++ L  L++ D   RR VL+Q LI+ 
Sbjct: 314 QSIKSDESKRALKRKRGEGEDDELANGFNPKYLTSRDLFKLEIGDLALRRNVLVQALIIM 373

Query: 293 QYFTS----------TVKSRGEGL-----ELKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
           ++  S          ++K+  + +     +L  +   WV D  + +   +K   P+G +F
Sbjct: 374 EFLLSLSPKGKEKIASIKAPNKSVSHSDQQLSEEDTHWVLDMKKKIADYLKLG-PEGPYF 432

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
            ++++ +L  +++W  WK EGCP ++ P  S
Sbjct: 433 YRMIETVLSRDKNWVHWKVEGCPPIEMPAVS 463


>gi|225555030|gb|EEH03323.1| nuclear matrix protein [Ajellomyces capsulatus G186AR]
          Length = 654

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 132/527 (25%), Positives = 231/527 (43%), Gaps = 124/527 (23%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E + R+    L+ T  ID      + N + +       + C   L   L+ +  D  
Sbjct: 66  AVVETAFREKFYDLLATTSIDEPSFIQMWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++TVFCGR+
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRAEDTVFCGRV 179

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDE------------EMDVSSNE-TEETD 192
            +FL + FP  +RS +N+  EF+ EN+T F  DE            +MD+   E T +T 
Sbjct: 180 FIFLFQSFPLGDRSSVNLRGEFHTENVTTF--DELPKAQESSETPMDMDIPEKEITADTK 237

Query: 193 TEE-----------------VDKVKIDF------------------NFYKKFWSLQDYFR 217
           TE                  V+ ++I                    + Y  FWSLQ  F 
Sbjct: 238 TESAQVTKQEPAGQTTHPAAVESLQISKTDAMKPVESEPNSILSMDDLYPIFWSLQTNFS 297

Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------------DVQS 252
           +P   ++  ++  F +  E  L+ F+    D                         ++ +
Sbjct: 298 SPTSLFDPANFAAFKAGLEASLSVFQKVNADMEVRMMKGSEEARRGPKRRRIGDGTEITN 357

Query: 253 SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE---- 307
           S NP      KYLT++ L +L+++D  FRR++L+Q LIL  +  S T K++ +  +    
Sbjct: 358 SFNP------KYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNK 411

Query: 308 -------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP 360
                  L  +  KW      ++ + ++Q   +G+ + ++V  +L  +++W +WK E CP
Sbjct: 412 SVLYNYVLNDEDTKWASQMKTSIATYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEACP 470

Query: 361 ELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSKDN--LEACKSAERDF 408
            ++    SI D         K     + + TP    +L  L   K++  ++  K  +R  
Sbjct: 471 PIEHTPVSIQDYLDTQTSAIKVSTSKRLRSTPLGSLDLNFLMEDKNSSRVDRLKHPDRFS 530

Query: 409 TPSLESYFEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSR 451
           TP+L+ Y          +D + + EE+ + +   ++  WR LRL SR
Sbjct: 531 TPALDFYLRGIADDEFNIDMSQSREEKDEAIKAKASKTWRTLRLSSR 577


>gi|330918120|ref|XP_003298095.1| hypothetical protein PTT_08697 [Pyrenophora teres f. teres 0-1]
 gi|311328880|gb|EFQ93789.1| hypothetical protein PTT_08697 [Pyrenophora teres f. teres 0-1]
          Length = 621

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 124/456 (27%), Positives = 216/456 (47%), Gaps = 80/456 (17%)

Query: 71  KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
           +D+C+  L ++L+ +  D  ++  C  +F ++E      ++    ++ KN    +LR+CN
Sbjct: 101 RDLCHPQLVLLLIEELLDSQSISGCRIVFNFLESR----RETIIAINSKNKDLVILRLCN 156

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDE 179
           +LLRRLSR+++ VFCGR+ +F+ + FP  ++S +N+   F+VEN+T F          D+
Sbjct: 157 ELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENVTTFEDFLKESKSDDD 216

Query: 180 EMDVSSNET------EETDTEE-------------VDKVKIDFN-FYKKFWSLQDYFRNP 219
            M V +++       EE  +EE                  +D +  Y  FWSLQ  F NP
Sbjct: 217 SMQVDADDAVISTPKEEAKSEEKAVENDEPDKDQDAKPTTLDTDTLYPVFWSLQHSFSNP 276

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSY-KLDDVQSSLNPSG---------DYFA-----KY 264
            + + + +++ F    E  LA FK   K+     S    G         D FA     KY
Sbjct: 277 PRLFEEDNFQQFQKSLEATLAKFKEVPKVIQAGDSERKRGQQTRDGDDYDAFANSFNPKY 336

Query: 265 LTNQKLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGE------------GLELKSD 311
           LT++ L  L+LSD  F+R++L+Q LIL  +  T T KS+ +               L  +
Sbjct: 337 LTSRDLFKLELSDLAFQRHILVQALILIDFLLTLTEKSKSKPFYQNAQKAMQYNFTLGEE 396

Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD 371
           + +W       + + + Q  PDG+ + ++V  +L  +++W +WK E C    R   + TD
Sbjct: 397 ETEWALGIKTAIVNYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRDRVATTD 455

Query: 372 EDKKDEPDAKKKKTPELTK----------LWN--SKDNLEACKSAERDFTPSLESYFEEA 419
             +      K   + + T+          L+N  ++  L   +  +R   P  ESY ++ 
Sbjct: 456 FQQAKSGAQKAVVSKKATRPMGVPSTLDFLYNNEAEKGLLQLRQRDRFNVPKAESYAKKV 515

Query: 420 -IQQMD-PAAAVEEQYKKVND--SNYAWRALRLLSR 451
            +  +D   A  EE+ ++V +  S+Y WR LR+ S+
Sbjct: 516 KMTDLDLDMADAEEEKRQVGEKKSSYVWRGLRVASK 551


>gi|145342784|ref|XP_001416269.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576494|gb|ABO94562.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 777

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 111/438 (25%), Positives = 199/438 (45%), Gaps = 73/438 (16%)

Query: 73  MCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRR 132
           M  +T+ I+L     +  T+ + E +  + + N    ++ T +   K   LRM   L +R
Sbjct: 46  MIFTTIEIVL-----EACTIAEAEDVLLWCDENKTRLRETTIWKKGKLPTLRMLIALAKR 100

Query: 133 -LSRSQNTVFC-GRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE----------- 179
            + RS +     GR L+ ++ F   S+RSG+N   + N E    F  +E           
Sbjct: 101 CVGRSSDEARVKGRALVLISWFMGLSDRSGMNKNGDSNPETTAIFEQEEWLREELEDGVG 160

Query: 180 -------EMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNK-VSWKMF 231
                  +M+V S E +    ++ ++  +D  F++ FWSLQ YF++P    +K  +W  F
Sbjct: 161 VDADAKIKMEVDSGEGDVAADDDDNEFGVDDAFHRTFWSLQQYFQDPPTTMSKNGAWNSF 220

Query: 232 TSYAETVLAAFKSYKLDDVQSSLN--PSGD----YFAKYLTNQKLLDLQLSDTNFRRYVL 285
            +    VL  F+++ L +   +++  P  +       ++LT ++LL LQ+ D  FRR +L
Sbjct: 221 HAILCVVLDKFEAFPLGETAGTIDEEPESEKRKMIGHRFLTARRLLPLQMQDFVFRRQIL 280

Query: 286 LQFLILFQYFTSTVKSRGEGLELKSDQEKW--------VKDTTETVYSLIKQTPPDGEHF 337
           +Q +   QY                D EK+        V + +E V  ++K+T P+    
Sbjct: 281 IQAVFFLQYM---------------DNEKFKEHVVDDEVTELSERVMKILKRTGPNAVAD 325

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK------------- 384
           + +++  L+ E  W +WK +GCP  ++       E   D  D K                
Sbjct: 326 AGIIENALEDELSWRKWKEQGCPSFEKQPIDFETEPMPDNWDKKYSNWPPDQYPPDDPNL 385

Query: 385 -----TPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDS 439
                 PEL +LWN  ++     S +     S+E + +  +++MDP+  VE +Y++VND 
Sbjct: 386 KHDFDHPELNRLWNLGEDDPNGTSVDALKEVSVEEFLKPCLEEMDPSQQVEAEYRRVNDP 445

Query: 440 NYAWRALRLLSRKCPHFF 457
            + W+A+R+LS+   H F
Sbjct: 446 TFRWKAMRMLSKNHLHLF 463


>gi|451996956|gb|EMD89422.1| hypothetical protein COCHEDRAFT_1196301 [Cochliobolus
           heterostrophus C5]
          Length = 620

 Score =  145 bits (366), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 127/462 (27%), Positives = 213/462 (46%), Gaps = 92/462 (19%)

Query: 71  KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
           +D+C+  L ++L  +  D  ++  C  +F ++E      ++    ++ KN    +LR+CN
Sbjct: 101 RDLCSPQLVLLLTEELLDSQSISGCRVVFNFLESR----REAIIAINSKNKDLVILRLCN 156

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDE 179
           +LLRRLSR+++ VFCGR+ +F+ + FP  ++S +N+   F+VENIT F          DE
Sbjct: 157 ELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENITTFEDFLKDSQPQDE 216

Query: 180 EMDVSSNET-----EETD-------TEEVDKVK------IDFN-FYKKFWSLQDYFRNPV 220
            M V  +++     EE D       T E+D+ K      +D +  Y  FWSLQ  F NP 
Sbjct: 217 AMQVDGDDSVPKIREEPDSGETTTQTGEIDEAKDGKPKTLDVDTLYPVFWSLQHSFSNPP 276

Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYK----------------------LDDVQSSLNPSG 258
           + + +  +K F    E  LA FK                          D   SS NP  
Sbjct: 277 RLFEEDHFKQFQQGLEATLAKFKEVPKVIQAGDSERKKGQVSKDEGEDYDAFASSFNP-- 334

Query: 259 DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGE------------G 305
               KYLT++ L  L+LSD  F+R++L+Q LIL  +  T T KS+ +             
Sbjct: 335 ----KYLTSRDLFRLELSDLAFQRHILVQALILIDFLLTLTEKSKSKPFYQNAQKAMQYN 390

Query: 306 LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
             L ++Q +W       + + + Q  PDG+ + ++V  +L  +++W +WK E C    R 
Sbjct: 391 FTLGAEQTEWALGIKNAIANYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRD 449

Query: 366 LTSITDEDKK---------DEPDAKKKKTPELTK-LWN--SKDNLEACKSAERDFTPSLE 413
               TD  +           +  A+    P +   L+N  ++  L   + ++R   P  E
Sbjct: 450 RVPTTDLQQAKTGAQKAVVSKKAARPMGVPSMFDFLYNNEAEKGLSQLRQSDRFKVPDAE 509

Query: 414 SYFEEAIQQ---MDPAAAVEEQYK-KVNDSNYAWRALRLLSR 451
            Y +    +   MD A   EE+ + + + ++  WR LR+ S+
Sbjct: 510 WYGKRVQMRDLDMDMADGDEEKRQLEESKASDVWRGLRVASK 551


>gi|393213320|gb|EJC98817.1| hypothetical protein FOMMEDRAFT_114065 [Fomitiporia mediterranea
           MF3/22]
          Length = 768

 Score =  144 bits (364), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 177/387 (45%), Gaps = 81/387 (20%)

Query: 61  YISLCVELCM------KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTF 114
           Y SLC  L +      ++ C+      +L D  +  T+  C  +F ++E       +   
Sbjct: 76  YDSLCDRLDVILTFTEREACDPMFMFTVLEDLLETQTVASCSHIFSWIERRSARLTEGMV 135

Query: 115 FMSCKN-NLLRMCNDLLRRLSRSQNTV-FCGRILLFLAKFFPFSERSGLNIISEFNVE-- 170
               K   LLRM NDLLRRLSR+ NT  FCGRIL FL+  FP  +RSG+N+  ++  +  
Sbjct: 136 PQKGKALVLLRMLNDLLRRLSRAGNTTTFCGRILTFLSAVFPLGDRSGVNLRGDYGPQWE 195

Query: 171 --NITEFGGDEEMDV----SSNETEETDT------------EEVDKVKID---------- 202
             +      +E+M+V    S  E  + D             +  DK+  D          
Sbjct: 196 PVSFKRAAVEEKMEVDEAASGREAGKVDAEVKAEVKEEEKGDASDKMDEDRPQTQAPAAE 255

Query: 203 -------FNFYKKFWSLQDYFRNP-VQCYNKVSWKMFTSYAETVLAAFKSYKLDD----- 249
                     Y  FWSLQ  F  P +      ++  F +   TV+   K   + +     
Sbjct: 256 KTEEEKKEELYNTFWSLQLPFSRPSIFATEPNAFAEFKTAVNTVMPVIKEATIKERAMMG 315

Query: 250 ---VQSSL------NPS-----GD---YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF 292
              V  +L       P+     GD   +FAK+LT+ +LLDL+++DT FRR VL Q LIL 
Sbjct: 316 SKVVAGALKRKRESTPTATMDDGDRKYFFAKFLTSPELLDLEIADTQFRRQVLFQLLILL 375

Query: 293 QY---FTSTVK-----SRGEGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
            +   FT+  K      R   L+    L+ D  KWV++T   VY  ++QT P+ + F++ 
Sbjct: 376 THVLQFTAAEKQNWATPRNRSLQMDFTLQPDDAKWVQETITRVYDEMRQTSPNAKAFTET 435

Query: 341 VKLILKGEEHWNQWKNEGC-PELKRPL 366
           V+  L+ E++W +WKN+ C P  K PL
Sbjct: 436 VQTTLEREKNWVRWKNDLCSPFDKAPL 462


>gi|19075592|ref|NP_588092.1| THO complex subunit (predicted) [Schizosaccharomyces pombe 972h-]
 gi|31077065|sp|Q9URT2.1|YJU3_SCHPO RecName: Full=Uncharacterized protein P25A2.03
 gi|6562903|emb|CAB62828.1| THO complex subunit (predicted) [Schizosaccharomyces pombe]
          Length = 752

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 125/448 (27%), Positives = 197/448 (43%), Gaps = 72/448 (16%)

Query: 53  IDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQ 112
           +D+ VI N I+       +  C+  LP ++L +  D+ T+++C +L+ Y E   ++ K  
Sbjct: 83  LDLLVIGNEIN-------ESFCDHLLPFLILEELMDIHTVNECAKLYEYFETRPSLMKGI 135

Query: 113 TFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENI 172
                    LLR+ N+LLRRLSR +N+ FCGRI + L+K FP  ERSG N+  ++N   +
Sbjct: 136 VSNRGRGPVLLRISNELLRRLSRQENSSFCGRIDILLSKAFPPEERSGANLRGDYNT--V 193

Query: 173 TEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFT 232
             FG  E    S+  ++ TD     K+   F  Y   W LQ    NP +     +   F 
Sbjct: 194 HSFGKVELSPPSTPISDRTDLSYHKKLNTLFTAY---WDLQCMCSNPPKLLASDTLPKFI 250

Query: 233 SYAETVLAAFKSY------------KLDDVQSSL------NPSGDYFAKYLTNQKLLDLQ 274
             A + + AF+S              +D   SSL           + +KY+ ++ L + Q
Sbjct: 251 DAAGSAIQAFESILQNTFFNGKSNPTIDPNSSSLLSEKYITLDKGFPSKYIYSRSLFEYQ 310

Query: 275 LSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDG 334
           LSD +FR   +LQ +I+F +     K R E   L    EKW   T + V  ++  +  D 
Sbjct: 311 LSDEDFRLQAILQLIIIFDFLLDHSKERIERRTL----EKW---TNKAVIPIVILSDEDT 363

Query: 335 EHFSQV-------------------VKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKK 375
              +++                   +K I+  E +W  WK  GCP L++PL      D+ 
Sbjct: 364 SKLNELSKEAYSFLHTARCGSVQRTIKEIIHIEGNWKLWKGLGCPSLEKPLVDKAAIDEA 423

Query: 376 DEPDAKKKKTP----------ELTKLWN--SKDNLEACKSAERDFTPSLESYFEEA-IQQ 422
            E   K   TP           L++LW    ++ L+  K  ER   PS ES+       +
Sbjct: 424 VEGLKKLTNTPVKLRFAMGNAALSRLWEQAGENTLDDLKKEERYRIPSPESFLSGVKADK 483

Query: 423 MDPAAAVEEQYKKVNDSNYA---WRALR 447
            +   AV +  K  ++ + A   WRA R
Sbjct: 484 FEIEEAVRDDDKHFHEQSLATKTWRAFR 511


>gi|85109865|ref|XP_963126.1| hypothetical protein NCU07865 [Neurospora crassa OR74A]
 gi|28924778|gb|EAA33890.1| predicted protein [Neurospora crassa OR74A]
          Length = 868

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 107/404 (26%), Positives = 184/404 (45%), Gaps = 69/404 (17%)

Query: 32  KKSALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFD 88
           + + +E ++R    KLI T  ID      + N   +   L   + C+  L + L+ +  D
Sbjct: 75  RHAIIETAVRDTFSKLIATTSIDSPDFARVWNLFDILSILSDDEQCDPALLLWLVEELLD 134

Query: 89  MSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLF 148
             T+  C ++F ++E        + F    K  +LR CN+LLRRLSR+ +  F GR+ ++
Sbjct: 135 SQTIAGCRKVFDFLESRRERITAKHFGQK-KLVILRTCNELLRRLSRALDPAFSGRVFIY 193

Query: 149 LAKFFPFSERSGLNIISEFNVENI---------TEFGGDEEMDV-------SSNETEE-- 190
           L + FP  +RS +N+  E++VEN+         TE    + MDV       S+ +T E  
Sbjct: 194 LFQSFPLGDRSSVNLRGEYHVENVTTWDQEPTKTEDASADRMDVDTETQGESAQKTGEQR 253

Query: 191 -------TDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF 242
                  TD ++     +D    Y  FWSLQ+ F  P + +    +  F S  E  +A F
Sbjct: 254 STPKPGATDAKKQQDKALDLEALYPIFWSLQESFNQPKKLFEPAQFASFKSGLEATMATF 313

Query: 243 KSYKLDDVQSSLNP-----------------------SGDYFAKYLTNQKLLDLQLSDTN 279
           +S K+D    S +                        +  +  KYLT++ L  L++ D  
Sbjct: 314 QSIKVDQSARSRDSKDKSDESKRALKRKRGEGEDDELANGFNPKYLTSRDLFKLEIGDLA 373

Query: 280 FRRYVLLQFLILFQYFTS----------TVKSRGEGL-----ELKSDQEKWVKDTTETVY 324
            RR VL+Q LI+ ++  S          ++K+  + +     +L  +   WV D  + + 
Sbjct: 374 LRRNVLVQALIIMEFLLSLSPKGKEKIASIKAPNKSVSHSDQQLSEEDTNWVLDMKKKIA 433

Query: 325 SLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
             +K   P+G +F ++V+ +L  +++W  WK EGCP ++ P  S
Sbjct: 434 DYLKLG-PEGPYFYRMVETVLSRDKNWVHWKVEGCPPIEMPAVS 476


>gi|154273190|ref|XP_001537447.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150415959|gb|EDN11303.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 649

 Score =  142 bits (359), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 129/526 (24%), Positives = 227/526 (43%), Gaps = 115/526 (21%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E + R+    L+ T  ID      + N + +       + C   L   L+ +  D  
Sbjct: 66  AVVETAFREKFYDLLATTSIDEPSFIQMWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T+D C ++F Y+E        ++        +LR CN+LLRRLSR+++TVFCGR+ +FL 
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKSLI------ILRSCNELLRRLSRAEDTVFCGRVFIFLF 179

Query: 151 KFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNE---TEETDTEE---- 195
           + FP  +RS +N+  EF+ EN+T F          +  MD+   E   T +T TE     
Sbjct: 180 QSFPLGDRSSVNLRGEFHTENVTTFDELPKAQESSETPMDIDIPEKEITADTKTESAQVT 239

Query: 196 -------------VDKVKIDF------------------NFYKKFWSLQDYFRNPVQCYN 224
                        V+ ++I                    + Y  FWSLQ  F +P   ++
Sbjct: 240 KQEPAGQITHPAAVESLQISKTDAMKPVESEPNSILSMDDLYPIFWSLQTNFSSPTSLFD 299

Query: 225 KVSWKMFTSYAETVLAAFKSYKLD-------------------------DVQSSLNPSGD 259
             ++  F +  E  L+ F+    D                         ++ +S NP   
Sbjct: 300 PSNFAAFKAGLEASLSVFQKVNADMEVRMMKGSEEARRGPKRRRIGDGTEMTNSFNP--- 356

Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLE----------- 307
              KYLT++ L +L+++D  FRR++L+Q LIL  +  S T K++ +  +           
Sbjct: 357 ---KYLTSRDLFELEINDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNKSVLYNYV 413

Query: 308 LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLT 367
           L  +  KW      ++ + ++Q   +G+ + ++V  +L  +++W +WK E CP ++    
Sbjct: 414 LNDEDTKWASQMKTSIATYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEACPPIEHTPV 472

Query: 368 SITD------EDKKDEPDAKKKKTP----ELTKLWNSKDN--LEACKSAERDFTPSLESY 415
           SI D         K     + + TP    +L  L   K++  ++  K  +R  TP+L+ Y
Sbjct: 473 SIQDYLDTQTSAIKVSTSKRLRSTPLGSLDLNFLMEDKNSSRVDRLKHPDRFSTPALDFY 532

Query: 416 FEEAIQ---QMDPAAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFF 457
                     +D + + EE+ + +   ++  WR LRL SR     F
Sbjct: 533 LRGIADDEFNIDMSQSREEKDEAIKAKASKTWRTLRLSSRSKLALF 578


>gi|451847862|gb|EMD61169.1| hypothetical protein COCSADRAFT_174485 [Cochliobolus sativus
           ND90Pr]
          Length = 620

 Score =  142 bits (358), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 107/366 (29%), Positives = 174/366 (47%), Gaps = 76/366 (20%)

Query: 71  KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRMCN 127
           +D+C+  L ++L  +  D  ++  C  +F ++E      ++    ++ KN    +LR+CN
Sbjct: 101 RDLCSPQLVLLLTEELLDSQSISGCRVVFNFLESR----REAIIAINSKNKDLVILRLCN 156

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDE 179
           +LLRRLSR+++ VFCGR+ +F+ + FP  ++S +N+   F+VENIT F          DE
Sbjct: 157 ELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENITTFEDFLKDSQPQDE 216

Query: 180 EMDVSSNET-----EETDTEE-------VDKVK------IDFN-FYKKFWSLQDYFRNPV 220
            M V  +++     EE ++EE        D+ K      +D +  Y  FWSLQ  F NP 
Sbjct: 217 SMQVDGDDSVSKIREEPESEERTAQTGDGDEAKDGKPKTLDVDTLYPVFWSLQHSFSNPP 276

Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYK----------------------LDDVQSSLNPSG 258
           + + +  +K F    E  LA FK                          D   SS NP  
Sbjct: 277 RLFEEDHFKQFQQGLEATLAKFKEVPKVIQAGDSERKKGQVSKDEGEDYDAFASSFNP-- 334

Query: 259 DYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGE------------G 305
               KYLT++ L  L+LSD  F+R++L+Q LIL  +  T T KS+ +             
Sbjct: 335 ----KYLTSRDLFRLELSDLAFQRHILVQALILIDFLLTLTEKSKSKPFYQNAQKAMQYN 390

Query: 306 LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
             L ++Q +W       + + + Q  PDG+ + ++V  +L  +++W +WK E C    R 
Sbjct: 391 FTLGAEQTEWALGIKNAIANYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKMENCQPFTRD 449

Query: 366 LTSITD 371
               TD
Sbjct: 450 RVPTTD 455


>gi|409052224|gb|EKM61700.1| hypothetical protein PHACADRAFT_248470 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 781

 Score =  142 bits (357), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 109/402 (27%), Positives = 179/402 (44%), Gaps = 69/402 (17%)

Query: 27  TCDYDKKSALEQSIRQYLLKLIKTPDIDIK--------VIENYISLCVELCMKDMCNSTL 78
           T   ++++  E ++R  +  L  T  + +K         ++  + L +    KD C+   
Sbjct: 40  TSADNRRTQWEYALRHEVFTLAATEGLALKDPSTAYYDNLKLRLDLVLSFTEKDACDQIF 99

Query: 79  PIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRS- 136
           P  LL +  +  T+  C  +F ++E   +   +       K   LLR  NDLLRRLS++ 
Sbjct: 100 PFNLLYELLETQTIPSCSHIFSWIESRADRLTENMVPQKGKALVLLRTLNDLLRRLSKTG 159

Query: 137 QNTVFCGRILLFLAKFFPFSERSGLNIISEFN-------------------------VEN 171
             T+FCGRIL FL++ FP  ERSG+N+  E+                          +E+
Sbjct: 160 STTLFCGRILTFLSQVFPLGERSGVNLRGEYGPTWDGPGVKEETEDVKMDTVEEVKQLED 219

Query: 172 ITEFGGDEEMDVSSNET-EETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKM 230
             +  G+   D +  E  EE+   +  K +    FY+ FWSLQ  F  P       ++  
Sbjct: 220 KMQVDGEAPKDGAKEEANEESPNPDEKKQQQKDEFYRTFWSLQLPFSRPSVFTEPQTFDQ 279

Query: 231 FTSYAETVL---------------------AAFKSYKLDDVQSSLNPSGDYFAKYLTNQK 269
           F +    +L                     ++ K  +  +V       G  FAKYLT+ +
Sbjct: 280 FKTSMNKILPVIKEATAKERALMGSKANSGSSLKRRREPEVNIDTANQGYLFAKYLTSLE 339

Query: 270 LLDLQLSDTNFRRYVLLQ----------FLILFQYFTSTVKSRGEGLE--LKSDQEKWVK 317
           LLDL+++DT+FRR  L Q          F    +   ST ++R   ++  L  D EKWV 
Sbjct: 340 LLDLEIADTHFRRQFLFQLLILLNHLLTFTKATKITWSTPRNRSLQMDFTLGPDDEKWVH 399

Query: 318 DTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
           +  +     ++QT P+G  F++ V +IL+ E++W +WKNE C
Sbjct: 400 EAIQKTNEELRQTTPNGRVFAETVAVILEREKNWIRWKNELC 441


>gi|367026702|ref|XP_003662635.1| hypothetical protein MYCTH_2303495 [Myceliophthora thermophila ATCC
           42464]
 gi|347009904|gb|AEO57390.1| hypothetical protein MYCTH_2303495 [Myceliophthora thermophila ATCC
           42464]
          Length = 712

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 119/448 (26%), Positives = 193/448 (43%), Gaps = 86/448 (19%)

Query: 72  DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMC 126
           ++C+  L   L+ +  D  T+  C ++F ++E     +    +KQ+         +LR C
Sbjct: 117 ELCDPALLFWLVEELLDSQTIAGCRKVFDFLESRRERITAKHFKQKQLV------ILRTC 170

Query: 127 NDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSN 186
           N+LLRRLSR+ +  FCGR+ +F  + FP  ++S +N+  E++VEN+T +  D+E   +  
Sbjct: 171 NELLRRLSRALDPAFCGRVFIFTFQSFPLGDKSSVNLRGEYHVENVTTY--DQEPQPTDG 228

Query: 187 ETEETDTEEV---DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
           E  + DT+     DK       Y  FW+LQ+ F  P + ++   +  F +  E  +  F 
Sbjct: 229 ERMDVDTDATPRKDKPMDPDALYPIFWALQESFNQPKKLFDPSHFAAFKAGLEATVTTFC 288

Query: 244 SY----------KL----DDVQSSLNPSGD---------YFAKYLTNQKLLDLQLSDTNF 280
           +           KL    DD + SL    D         +  KYLT++ L  L++SD  F
Sbjct: 289 TINPEKPSRAKEKLDKPDDDAKPSLKRKRDDVDDELANSFNPKYLTSRDLFKLEISDLAF 348

Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGL---------------ELKSDQEKWVKDTTETVYS 325
           RR +L+Q LI   +  S      E L               +L  +  KW      ++  
Sbjct: 349 RRNILVQALITLDFLLSLSPKAKEKLASVKTPNKSVTYSDQQLSEEDTKWATQMRGSITR 408

Query: 326 LIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKKT 385
            + Q   DG  F ++V  +L  +++W +WK E CP ++ P  S     + DE  A   + 
Sbjct: 409 YL-QRGVDGPFFERMVGTVLVRDKNWVRWKIENCPSIELPPMS---PQQFDEARATATRM 464

Query: 386 PELTKLWNSKDNL------------EACKSAERDFTPSLESY----------FEEAIQQM 423
               +L N   +L            E  +  ER   P L S+           E A    
Sbjct: 465 ARTRRLPNDSRSLEFLGDEDETSAMEKLQDPERYSLPELSSFRDAIDDADFDIERATDDE 524

Query: 424 DPAAAVEEQYKKVNDSNYAWRALRLLSR 451
             A A+E +  K      AW+ALRL S+
Sbjct: 525 SKALAMESKASK------AWKALRLASK 546


>gi|336271337|ref|XP_003350427.1| hypothetical protein SMAC_02140 [Sordaria macrospora k-hell]
 gi|380090949|emb|CCC11482.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 867

 Score =  141 bits (355), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 113/430 (26%), Positives = 189/430 (43%), Gaps = 78/430 (18%)

Query: 7   LNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDID---IKVIENYIS 63
           L  D SQ+  V+D     +        + +E ++R     L+ T  ID      + N   
Sbjct: 57  LGIDPSQVGEVNDAKRTQR-------HAIIETAVRDTFSSLVATTSIDSPDFARVWNLFD 109

Query: 64  LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
           +   L   + C+  L + L+ +  D  T+  C ++F ++E        + F    K  +L
Sbjct: 110 ILSILSDDEQCDPALLLWLVEELLDSQTIAGCRKVFDFLESRRERITAKHFGQK-KLVIL 168

Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENIT---------E 174
           R CN+LLRRLSR+ +  F GR+ ++L + FP  +RS +N+  E++VEN+T         E
Sbjct: 169 RTCNELLRRLSRALDPAFSGRVFIYLFQSFPLGDRSSVNLRGEYHVENVTTWDQEPAKAE 228

Query: 175 FGGDEEMDVSSNETEETDTEEVDKVK----------------IDFN-FYKKFWSLQDYFR 217
               + MDV      ET  +  D+                  +D    Y  FWSLQ+ F 
Sbjct: 229 DASADSMDVDMETQGETTQKTGDQRSTPKPGTADAKKHRDKALDLEALYPIFWSLQESFN 288

Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP--------------------- 256
            P + +    +  F S  E  +A F+S K+D    S +                      
Sbjct: 289 RPKKLFEPAQFASFKSGLEATMAIFQSIKVDQSARSRDSKDKSDESKRALKRKRGEGEDD 348

Query: 257 --SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELK----- 309
             +  +  KYLT++ L  L++ D   RR VL+Q LI+ ++  S V S+G+   ++     
Sbjct: 349 ELANGFNPKYLTSRDLFKLEIGDLALRRNVLVQALIIMEFLLS-VSSKGKEKIVRIKAPN 407

Query: 310 -----SDQE------KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEG 358
                SDQ+       WV D  + +   +K   P+G +F ++V+ +L  +++W  WK EG
Sbjct: 408 KSVSHSDQQLSEEDTNWVLDMKKKIADYLK-IGPEGPYFYRMVETVLSRDKNWVHWKVEG 466

Query: 359 CPELKRPLTS 368
           CP ++ P  S
Sbjct: 467 CPPIEMPAVS 476


>gi|302667857|ref|XP_003025507.1| hypothetical protein TRV_00269 [Trichophyton verrucosum HKI 0517]
 gi|291189621|gb|EFE44896.1| hypothetical protein TRV_00269 [Trichophyton verrucosum HKI 0517]
          Length = 669

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 116/434 (26%), Positives = 193/434 (44%), Gaps = 110/434 (25%)

Query: 16  VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCN 75
           V++  Y +        + +A+E + R+    L+ T  ID            E     + N
Sbjct: 100 VLAGDYGVSVGANQQTQYAAVETAFREIFYDLLATTSID------------EPSFMQIWN 147

Query: 76  STLPIILLSDT---FDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
               + + SD     D  T+D C  +F Y+E          +KQ++        +LR CN
Sbjct: 148 LLDIVSIFSDNEELLDSQTIDGCRIVFDYLESRRERNTAKHFKQKSLI------ILRSCN 201

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG--------DE 179
           +LLRRLSR+++TVFCGR+ +FL + FP  ++S +N+  E++ EN+T F          DE
Sbjct: 202 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDAQPKTQAESDE 261

Query: 180 EMDVSSNE----------------TEETDTEEVD--------------KVKIDFNFYKKF 209
            MD+   E                T +T  E  +              ++++D   Y  F
Sbjct: 262 AMDIDIQEDETLVSNAKSTEQELTTAQTPLESTNDATPQRGNAPSIQPELELD-TLYAAF 320

Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
           WSLQ+ F  P + ++   +    S  E  L++F+  S KL+                   
Sbjct: 321 WSLQESFSKPTRLFDTAHFASLKSGLEATLSSFRKVSSKLEVRSTTKGSDELRRGTKRKR 380

Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF----------- 292
                D+ +S NP      KYLT++ L +L+++D +FRR++L+Q LIL            
Sbjct: 381 GEGGADLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSK 434

Query: 293 -QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            +   ST KS   G  L  +  KW      ++ S + Q   DG+ + ++V  +L  +++W
Sbjct: 435 AKLADSTNKSVLYGYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 493

Query: 352 NQWKNEGCPELKRP 365
            +WK EGCP ++RP
Sbjct: 494 VRWKAEGCPPIERP 507


>gi|302496461|ref|XP_003010232.1| hypothetical protein ARB_03584 [Arthroderma benhamiae CBS 112371]
 gi|291173773|gb|EFE29592.1| hypothetical protein ARB_03584 [Arthroderma benhamiae CBS 112371]
          Length = 617

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/434 (26%), Positives = 193/434 (44%), Gaps = 110/434 (25%)

Query: 16  VVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCN 75
           V++  Y +        + +A+E + R+    L+ T  ID            E     + N
Sbjct: 48  VLAGDYGVSVGANQQTQYAAVETAFREIFYDLLATTSID------------EPSFMQIWN 95

Query: 76  STLPIILLSDT---FDMSTLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCN 127
               + + SD     D  T+D C  +F Y+E          +KQ++        +LR CN
Sbjct: 96  LLDIVSIFSDNEELLDSQTIDGCRIVFDYLESRRERNTAKHFKQKSLI------ILRSCN 149

Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGG--------DE 179
           +LLRRLSR+++TVFCGR+ +FL + FP  ++S +N+  E++ EN+T F          DE
Sbjct: 150 ELLRRLSRAEDTVFCGRVFIFLFQSFPLGDKSSVNLRGEYHTENVTTFDAQPKTQAESDE 209

Query: 180 EMDVSSNE----------------TEETDTEEVDK--------------VKIDFNFYKKF 209
            MD+   E                T +T  E  ++              +++D   Y  F
Sbjct: 210 VMDIDIQEDETLVSNAKSTEQEPTTAQTPLESTNEATPQRGNASGIQPELELD-TLYAAF 268

Query: 210 WSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK--SYKLD------------------- 248
           WSLQ+ F  P + ++   +    S  E  L++F+  S KL+                   
Sbjct: 269 WSLQESFSKPTRLFDTAHFASLKSGLEATLSSFRKVSSKLEVRSTTKGSDELRRGTKRKR 328

Query: 249 -----DVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF----------- 292
                D+ +S NP      KYLT++ L +L+++D +FRR++L+Q LIL            
Sbjct: 329 GEGGADLPNSFNP------KYLTSRDLFELEINDVSFRRHILVQALILLDFLLSLTPKSK 382

Query: 293 -QYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHW 351
            +   ST KS   G  L  +  KW      ++ S + Q   DG+ + ++V  +L  +++W
Sbjct: 383 AKLADSTNKSVLYGYVLSDEDAKWATQMRSSIASYL-QKGADGKFYFRMVDTVLTRDKNW 441

Query: 352 NQWKNEGCPELKRP 365
            +WK EGCP ++RP
Sbjct: 442 VRWKAEGCPPIERP 455


>gi|308800702|ref|XP_003075132.1| Nuclear matrix protein (ISS) [Ostreococcus tauri]
 gi|116061686|emb|CAL52404.1| Nuclear matrix protein (ISS) [Ostreococcus tauri]
          Length = 636

 Score =  139 bits (350), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 106/339 (31%), Positives = 173/339 (51%), Gaps = 46/339 (13%)

Query: 156 SERSGLNIISEFNVENITEFGGDEEM-DVSSNETEETDTE-EVDKVKIDFN-------FY 206
           S+RSGLN+    N      F  DE + + + N+ E  D + E +K +ID +       F+
Sbjct: 5   SDRSGLNLQGASNPNVTPIFEQDEWVREDAENDDETMDVDGEYEKGEIDMSTFGADGAFH 64

Query: 207 KKFWSLQDYFRNPVQCY-NKVSWKMFTSYAETVLAAFKSYKLDDVQSSL-----NPS-GD 259
           K FWSLQ Y++NP Q      +W  + +    ++  F+ + L +   +L     NP   D
Sbjct: 65  KTFWSLQTYYQNPTQAMCRDGAWNTYHAILREIMDKFEEHPLTETAKTLLDFHENPELRD 124

Query: 260 YFA-KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKD 318
               ++LT ++LL +QL D +FRR +L+Q     +  T T K +    E   D+E  V +
Sbjct: 125 LIGHRFLTARRLLPMQLRDYSFRRQILIQTAFFLEAMT-TEKFK----EHVVDEE--VIE 177

Query: 319 TTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL--------KRPLT--- 367
             E V  ++K+T PD +  +  ++ +L+ E  W +W+++GCP          K P+    
Sbjct: 178 ARERVMEILKRTGPDAQLAASFIEDVLEDEAGWRKWRDDGCPNFVREPIDFEKEPMPDNW 237

Query: 368 ----SITDEDKKDEPDAKKKKT---PELTKLWNSKDNLEACKSAERDF--TPSLESYFEE 418
               S+   D+  EPD + K+      L +LWN  D+    K+A  +F    S+E +F  
Sbjct: 238 DIKYSMWPPDQFPEPDPRYKEDFGDEALNRLWNIDDD--DVKNARSEFLVAESIEDFFRP 295

Query: 419 AIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
            +++MDP A VEEQY++VND  + WRA+R+L+R   H F
Sbjct: 296 CLEEMDPEAQVEEQYRRVNDEVFRWRAMRMLARSHLHLF 334


>gi|390603448|gb|EIN12840.1| hypothetical protein PUNSTDRAFT_97766 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 784

 Score =  138 bits (348), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 124/414 (29%), Positives = 187/414 (45%), Gaps = 94/414 (22%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           + + + + +    +D C+ T P   L D  +  T+  C  LF ++E   N   + T  M 
Sbjct: 79  LRDMLDVVLTFTEQDACDPTFPFTTLQDLIETHTIASCSHLFSWIESRAN---RLTAGMV 135

Query: 118 CKNN----LLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--E 170
            +      LLRM NDLLRRLS+    TVFCGRIL FL+  FP  ERSG+N+  E+    E
Sbjct: 136 PQKGKALVLLRMLNDLLRRLSKMGDATVFCGRILSFLSAVFPLGERSGVNLRGEYGPTWE 195

Query: 171 NITEF--------GG---DEEMDVSSNE-TEETDTEEVDKVKIDF-------------NF 205
            + E         GG   DE+M    N   E  DT    K+++D              +F
Sbjct: 196 GVMEAVVKEDGSSGGMDVDEKMKPPENALGEPGDT----KMQVDEENPSPSKSSSGSDDF 251

Query: 206 YKKFWSLQDYFRNP----------------------VQCYNKVSWKMFTSYAETVLAAFK 243
           Y  FW LQ  F  P                      +   N     +  S ++T  AA K
Sbjct: 252 YSTFWFLQLPFSKPPVFAEPGMFARFKTAVNKVLPVIHEANAKERLLIGSSSKTHAAAVK 311

Query: 244 S-YKLDDVQSSLNPSGD---------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQ 293
           S       + +++  GD         +FAK+LT+ +LLDL+++D++FRR  L Q LIL Q
Sbjct: 312 SQVTPKRKRDAIDTDGDEEDVGTNDYFFAKFLTSPELLDLEIADSHFRRQFLFQLLILLQ 371

Query: 294 Y---FTSTVKS-----RGEGLEL-------------KSDQE-KWVKDTTETVYSLIKQTP 331
           +   FT   K+     R   L++             KS  + +WV +T       ++QT 
Sbjct: 372 HLQTFTRNAKAAWYTPRNRSLQMDFTLETADPNVKGKSASDLQWVSETMHKAIDELRQTA 431

Query: 332 PDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTSITDEDKKDEPDAKKKK 384
           P+G  FS  V++IL+ E +W +WKNE C    R P     +E+ +D+   +K++
Sbjct: 432 PNGRGFSDTVRVILERERNWVRWKNELCAPFDREPWAQEIEEETEDDSKERKRR 485


>gi|336377042|gb|EGO05377.1| hypothetical protein SERLA73DRAFT_101182 [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 1079

 Score =  138 bits (348), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 173/385 (44%), Gaps = 59/385 (15%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           +++ + L +     D   +T P  +L D  +  T+  C  +F ++E       +      
Sbjct: 79  LKDRLDLVLTFTEHDASEATFPFTVLQDLLETQTIASCSHVFSWIEARAARLTKGMVPQK 138

Query: 118 CKN-NLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEFN------- 168
            K   LLR  NDLLRRLS+  NT +FCGRIL FL+  FP  ERSG+N+  E+        
Sbjct: 139 GKALILLRTLNDLLRRLSKMGNTTLFCGRILTFLSGVFPLGERSGVNLRGEYGPMWEGVK 198

Query: 169 -VENITEFG-----GDEEMDVS-SNETEETDTEEVDKVK-IDFNFYKKFWSLQDYFRNPV 220
            VE   E        +++M+V  S E  E  T  VD VK     FY  FWSLQ  F  P 
Sbjct: 199 EVETKVEIAEPPKTPEDKMEVDDSKEGGEDKTGAVDCVKDKSSKFYTTFWSLQLPFSRPP 258

Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYKLDD---------------------VQSSLNPSGD 259
                 ++  F      VL   K     D                     V+     S +
Sbjct: 259 LFATPTTFPDFKEAVNKVLPVIKEATTKDRALMGSRSATGSSSSLKRKREVELGEEASSN 318

Query: 260 --YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVK-----SRGEGLE-- 307
             +FAK+LT+  LLDL+++DT+FRR  L Q LIL  +   FT   K     SR   L+  
Sbjct: 319 EYFFAKFLTSPDLLDLEVADTHFRRQFLFQLLILLNHLLSFTKAAKATWLTSRNRSLQME 378

Query: 308 --LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
             L++ + +WV++T     + ++ T P+G  FS  V +IL+ E++W +WKN+ C    R 
Sbjct: 379 FTLETAEVQWVQETIAKASAELRSTAPNGPVFSDTVNVILEREKNWVKWKNDLCAPFDR- 437

Query: 366 LTSITDEDKKDEPDAKKKKTPELTK 390
                 E    E D KK    E TK
Sbjct: 438 ------EPWGTEVDGKKVGLEEATK 456


>gi|336390083|gb|EGO31226.1| hypothetical protein SERLADRAFT_412639 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 1229

 Score =  138 bits (347), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 121/386 (31%), Positives = 173/386 (44%), Gaps = 60/386 (15%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           +++ + L +     D   +T P  +L D  +  T+  C  +F ++E       +      
Sbjct: 112 LKDRLDLVLTFTEHDASEATFPFTVLQDLLETQTIASCSHVFSWIEARAARLTKGMVPQK 171

Query: 118 CKN-NLLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNIISEFN------- 168
            K   LLR  NDLLRRLS+  NT +FCGRIL FL+  FP  ERSG+N+  E+        
Sbjct: 172 GKALILLRTLNDLLRRLSKMGNTTLFCGRILTFLSGVFPLGERSGVNLRGEYGPMWEGVK 231

Query: 169 -VENITEFG-----GDEEMDVS-SNETEETDTEEVDKVK--IDFNFYKKFWSLQDYFRNP 219
            VE   E        +++M+V  S E  E  T  VD VK      FY  FWSLQ  F  P
Sbjct: 232 EVETKVEIAEPPKTPEDKMEVDDSKEGGEDKTGAVDCVKDKSSSEFYTTFWSLQLPFSRP 291

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLDD---------------------VQSSLNPSG 258
                  ++  F      VL   K     D                     V+     S 
Sbjct: 292 PLFATPTTFPDFKEAVNKVLPVIKEATTKDRALMGSRSATGSSSSLKRKREVELGEEASS 351

Query: 259 D--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVK-----SRGEGLE- 307
           +  +FAK+LT+  LLDL+++DT+FRR  L Q LIL  +   FT   K     SR   L+ 
Sbjct: 352 NEYFFAKFLTSPDLLDLEVADTHFRRQFLFQLLILLNHLLSFTKAAKATWLTSRNRSLQM 411

Query: 308 ---LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
              L++ + +WV++T     + ++ T P+G  FS  V +IL+ E++W +WKN+ C    R
Sbjct: 412 EFTLETAEVQWVQETIAKASAELRSTAPNGPVFSDTVNVILEREKNWVKWKNDLCAPFDR 471

Query: 365 PLTSITDEDKKDEPDAKKKKTPELTK 390
                  E    E D KK    E TK
Sbjct: 472 -------EPWGTEVDGKKVGLEEATK 490


>gi|453083404|gb|EMF11450.1| nuclear matrix protein [Mycosphaerella populorum SO2202]
          Length = 637

 Score =  137 bits (346), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 130/493 (26%), Positives = 212/493 (43%), Gaps = 88/493 (17%)

Query: 40  IRQYLLKLIKTPDIDIKVIENYISL--CVELCM-KDMCNSTLPIILLSDTFDMSTLDKCE 96
           IR+     I + DI+     +  +L   V +C  +  C   L   LL +  D  T   C 
Sbjct: 72  IREIFYAFIHSTDINDPAFADVWNLLDIVLMCGDQGKCTPELVCWLLEELLDSQTTLGCR 131

Query: 97  QLFYYVEVNVNIWKQQTFFMSCKNNL--LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFP 154
            +F Y++       Q+ F    K NL  LR CN+LLRRLSR+++ +FCGR+  FL + FP
Sbjct: 132 VVFDYLDSRRERLAQKDFH---KKNLVFLRSCNELLRRLSRAEDAIFCGRVFFFLFQTFP 188

Query: 155 FSERSGLNIISEFNVENITEF---GG--------DEEMDVSSNETEETDT---------- 193
             ++S +N+  EF+VEN T+F   GG         E   V S   E T            
Sbjct: 189 LGDKSSVNLRGEFHVENTTKFDVEGGATEGIEPDAEPAAVPSTPKESTPPPAPTGKPGSK 248

Query: 194 --------EEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK- 243
                   +  ++V +  +  Y  FW LQ  F +P + +   +++ F       L  FK 
Sbjct: 249 AVPIKAPKKAAEEVPLSASELYPIFWRLQHDFSDPTKLFQPQNFEAFKKGLACTLTKFKK 308

Query: 244 -----------------SYKLDDVQS--SLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYV 284
                              KL +     S N   +Y  KYLT++ L DL+LSD  F+R++
Sbjct: 309 TPTVVQTKAGEEDKRGTKRKLGEAAGSESSNLLDNYNPKYLTSRDLFDLELSDLAFQRHI 368

Query: 285 LLQFLILFQY--------------FTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQT 330
           L+Q +IL  +               T T KS      L  +   W   T   +   + ++
Sbjct: 369 LVQAMILIDFLLSLTERAKKRLSSLTGTNKSLLYSFTLSEEDTHWATSTRAVIREGL-ES 427

Query: 331 PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR-PLTS---------ITDEDKKDEPDA 380
             DG  + ++V+ +L  +++W +WK E CP + R P+T+         + +  +      
Sbjct: 428 REDGRQYCRMVETVLTRDKNWVRWKVESCPSIVRGPVTTEEELLARKGVANFTRPRRLAE 487

Query: 381 KKKKTPELTKLWNS-KDNLEACKSAERDFTPSLESYFEE-AIQQMDPAAAVEEQYKKVND 438
           K +   +L  L  +    LE  K+  R  TPS+    ++  + ++D   A+ ++ K   D
Sbjct: 488 KPQGMMDLGFLEEADSGGLEPLKNPARYTTPSIPQLVDQIKMDELDAEMAMTDEEKSNID 547

Query: 439 ---SNYAWRALRL 448
              +N  WRALR+
Sbjct: 548 TAMTNAKWRALRI 560


>gi|170084509|ref|XP_001873478.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164651030|gb|EDR15270.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 732

 Score =  137 bits (344), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 172/366 (46%), Gaps = 64/366 (17%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           + + + + +     D C  T P I+L D  +  T+  C  +F ++E       +      
Sbjct: 79  LRDKLDVVLTFTEHDACEQTFPFIVLQDLLETQTIASCSHIFSWIEARAERLTEGMVPQK 138

Query: 118 CKN-NLLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--ENIT 173
            K   LLR  NDLLRRLS+    T+FCGRIL FL+  FP  ERSG+N+  E+    E ++
Sbjct: 139 GKALILLRTLNDLLRRLSKMGTTTIFCGRILTFLSGVFPLGERSGVNLRGEYGPTWEGVS 198

Query: 174 ---EFGGDE---EMDVSSN--ETEETDTE-EVDKVKIDFN-----------------FYK 207
              +   D+   EM+V+ +  E E  + E + DK+++D                   FY 
Sbjct: 199 LGEKTKADDKHVEMEVAKSLEEPESKEGEAQGDKMQVDSKEVKKPVPTEGLAEKKDAFYN 258

Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK-----------SYKLDDVQSSLNP 256
            FW+LQ  F  P       +++ F      V+   K           S  +    +SL  
Sbjct: 259 TFWALQLPFSKPPLFAADGTFEDFKERVNKVMPVIKEATAKERAMMGSRGVAGTGASLKR 318

Query: 257 SGD-----------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS- 301
             D           +FAK+LT+  LLDL+++DT+FRR  L Q LIL  +   FT T K+ 
Sbjct: 319 KRDLDSEESNSNEYFFAKFLTSPDLLDLEIADTHFRRQFLFQLLILLNHLLTFTKTAKAA 378

Query: 302 ----RGEGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQ 353
               R   L+    L+ ++ +WV+++   V   ++QT P+G  F++ +  IL  E++W +
Sbjct: 379 WTSTRNRSLQMDFTLEDNEAQWVQESITKVTEELRQTTPNGRIFAETISAILDREKNWVR 438

Query: 354 WKNEGC 359
           WKNE C
Sbjct: 439 WKNELC 444


>gi|389742049|gb|EIM83236.1| hypothetical protein STEHIDRAFT_123683 [Stereum hirsutum FP-91666
           SS1]
          Length = 849

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 105/385 (27%), Positives = 168/385 (43%), Gaps = 67/385 (17%)

Query: 58  IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMS 117
           ++N + L + L  +D+C  T P   L D  +  T+  C  +F ++E N +   +      
Sbjct: 81  LKNRLDLALTLTEQDVCEQTFPFNTLQDLLETQTISSCSHVFSWIESNADRLTEGMVPAK 140

Query: 118 CKN-NLLRMCNDLLRRLSR-SQNTVFCGRILLFLAKFFPFSERSGLNIISEF-------- 167
            K   +LRM NDLLRRLS+   N++FCGRIL FL+  FP  ERSG+N+  E+        
Sbjct: 141 GKTLIILRMLNDLLRRLSKMGSNSIFCGRILTFLSGVFPLGERSGVNLRGEYGPMWEAVE 200

Query: 168 ---------------NVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDF------NFY 206
                          NV+++ E  G E+   +S   E  + +   K   D       +FY
Sbjct: 201 IKPKVEEKPEETKVENVKDVKEEDGKEKEKDASMSMEVEEKKPASKTSSDLKQEQKDDFY 260

Query: 207 KKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPSGD------- 259
             FWSLQ  F  P    +  ++  F +    VL         +     + +G        
Sbjct: 261 NTFWSLQLPFSRPPLFASPETFTQFQTAVNKVLPIINEATAKERALMGSRTGSTSVSHSH 320

Query: 260 -----------------YFAKYLTNQKLLDLQLSDTNFRR----------YVLLQFLILF 292
                            +FAK+LT+ +LLDL+++DT+FRR            LL F    
Sbjct: 321 KRKREAEPGEETTTTEYFFAKFLTSPELLDLEIADTHFRRQFLFQLLILLNHLLLFTKTS 380

Query: 293 QYFTSTVKSRGEGLE--LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH 350
           +   +T ++R    E  L+  + +WV +T       ++QT P G  F+  V  IL+ E++
Sbjct: 381 RAIWTTTRNRSLQFEFTLEPPETQWVTETINKAQEELRQTAPGGRAFADSVSTILEREKN 440

Query: 351 WNQWKNEGCPELKRPLTSITDEDKK 375
             +WKN+ C    R   S+  + K+
Sbjct: 441 RVKWKNDLCTPFDREPYSVEVDGKR 465


>gi|67541699|ref|XP_664617.1| hypothetical protein AN7013.2 [Aspergillus nidulans FGSC A4]
 gi|40742469|gb|EAA61659.1| hypothetical protein AN7013.2 [Aspergillus nidulans FGSC A4]
 gi|259483676|tpe|CBF79260.1| TPA: THO complex subunit Tho1, putative (AFU_orthologue;
           AFUA_4G04330) [Aspergillus nidulans FGSC A4]
          Length = 637

 Score =  133 bits (334), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 103/418 (24%), Positives = 177/418 (42%), Gaps = 115/418 (27%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+    L+ T  ID      I N + +       + C   L   L+ +  D  
Sbjct: 66  AAVEIAFRERFYSLLATTTIDDPEFVRIWNLLDVISVFSDNEQCEPGLIFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T+D C ++F Y+E                           RR   +++TVFCGR+ +FL 
Sbjct: 126 TIDGCRKVFDYLES--------------------------RRERNTKDTVFCGRVFIFLF 159

Query: 151 KFFPFSERSGLNIISEFNVENITEF--------GGDEEMDV------SSNETEET----- 191
           + FP  ++S +N+  E++ EN+T F            +MDV      ++N   ET     
Sbjct: 160 QSFPLGDKSAVNLRGEYHTENVTTFDTIANDAISEAHDMDVDMPDVKATNSVAETQKQPV 219

Query: 192 ----------DTEEVDKVKIDFN-------------FYKKFWSLQDYFRNPVQCYNKVSW 228
                     D  ++ KV +  +              Y  FW LQ YF +P + ++   +
Sbjct: 220 AEGDAKSAEQDVTKIPKVTVSQSSDQTGERSLNLDTLYPIFWGLQAYFSSPTKIFDAQHF 279

Query: 229 KMFTSYAETVLAAFKSYKLD-------------------------DVQSSLNPSGDYFAK 263
             F S  E  L AF++   +                         ++ SS NP      K
Sbjct: 280 ATFKSGLEATLNAFRTVNTELETSNSKTQEELRKSTKRKRTADGPEIASSFNP------K 333

Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE-----------GLELKSD 311
           YLT+++L DL+++DT FRR+VL+Q LIL  +  S T KS+ +           G  L  +
Sbjct: 334 YLTSRELFDLEVNDTAFRRHVLVQALILLDFMLSLTSKSKAKLADLTNKSVLYGFVLNDE 393

Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSI 369
             +W     +++   + Q    G+ + ++V  +L  +++W +WK EGCP +++P  S+
Sbjct: 394 DAQWAVKMRKSIEGYL-QEGAGGKFYYRMVDTVLSRDKNWVRWKAEGCPPIEKPAVSV 450


>gi|407927426|gb|EKG20320.1| THO complex subunit THOC1 [Macrophomina phaseolina MS6]
          Length = 658

 Score =  132 bits (332), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 131/526 (24%), Positives = 223/526 (42%), Gaps = 113/526 (21%)

Query: 37  EQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLD 93
           E + +  L  L+ T  ID      + N + + +    ++ C+  L   L  +  D  ++D
Sbjct: 64  EIAAKTLLHHLVATTTIDDPSFVQVWNLLDIVLTCSERNQCDGLLIFYLAEELLDSQSID 123

Query: 94  KCEQLFYYVEVNVNIWKQQTFFMSCKNN----LLRMCNDLLRRLSRSQNTVFCGRILLFL 149
            C ++F Y+E          F  +   +    +LR CN+LLRRLSR+++ VFCGR+ +FL
Sbjct: 124 GCRRVFDYLESRRERLVAGDFLQNSGASKRIVVLRFCNELLRRLSRAEDAVFCGRVFIFL 183

Query: 150 AKFFPFSERSGLNIISEFNVENITEF------------------GGDEEMDVSSNETEET 191
            + FP  ++S +N   EF+VEN T F                  G D + +V    TE+T
Sbjct: 184 FQTFPLGDKSSVNSRGEFHVENATVFEEPASSKEEKADDMEIDAGADGKTEVPIVTTEDT 243

Query: 192 DTEEVDKVKIDFN--------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
             E   K ++  +         Y  FW+LQ  F NP + +       F    E  +  FK
Sbjct: 244 TAEADAKAELKKDNKALDTDTLYPIFWTLQQAFSNPPKLFTDEYLAEFRVGIEHTITKFK 303

Query: 244 -----------------SYKLDDVQSSLNPSGDY---------------------FA--- 262
                                D  ++     G+Y                     FA   
Sbjct: 304 DVPKVIPAKNKDKSDERDLSKDKGKAEAGAEGNYENASAEDGRGMKRKHDEGEEEFASPY 363

Query: 263 --KYLTNQKLLDLQLSDTNFRRYVLLQFLILF--------------QYFTSTVKSRGEGL 306
             KYLT++ L +L+LSD  F+R++L+Q L+L               Q + STV +    +
Sbjct: 364 NPKYLTSRDLFELELSDLAFQRHILVQALVLLDFLLSLTEKSKEKTQKWLSTVGNPNRAV 423

Query: 307 E----LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL 362
           +    L +    W   T   + + + Q  PDG+ + ++V  +L  +++W +WK E CP +
Sbjct: 424 QYSYTLSAADADWASRTRTAITTYL-QLLPDGKFYYRMVDTVLSRDKNWVRWKMESCPPI 482

Query: 363 KRPLTSITD--EDKKDEPDA----KKKKTP----ELTKLWNS--KDNLEACKSAERDFTP 410
            +P  + ++  E K+    A    + K TP     L+ L ++     L+  K + R   P
Sbjct: 483 AKPPVAPSEFQEAKQGAQRACANKRLKATPPGSLNLSFLSDAAMSGGLQGLKESSRYTIP 542

Query: 411 SLESYFEEAIQ----QMDPAAAVEEQYK-KVNDSNYAWRALRLLSR 451
           + ES     IQ     ++ A   E+++K +   ++  WRALR+ S+
Sbjct: 543 TPES-LHRGIQGDELDLEMAMTDEDKHKLQQAKASKTWRALRIASK 587


>gi|328724615|ref|XP_003248200.1| PREDICTED: THO complex subunit 1-like [Acyrthosiphon pisum]
          Length = 251

 Score =  132 bits (332), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 71/178 (39%), Positives = 105/178 (58%), Gaps = 25/178 (14%)

Query: 275 LSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDG 334
           LS++N+   ++ +F   F           +  ++K +Q++W+K++  T+Y LI  TPPDG
Sbjct: 66  LSESNYLYLIIYKFTFDF----------SDNFKVKPEQQEWIKNSITTIYELINNTPPDG 115

Query: 335 EHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDED-----KKDEPDAKKKKT---- 385
             FS+VVK IL+ EE WN+WKN+GCPE+ + L +    D     +++  D  K  T    
Sbjct: 116 VRFSEVVKNILQREEQWNKWKNDGCPEVIKRLPTSDQSDMAAPKRRNLGDVIKNMTANNK 175

Query: 386 -----PELTKLWNSK-DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
                P+LTKLWN K DNLEACK  ERDF PS +S+F+EA +Q+D    +E Q  K +
Sbjct: 176 VFLGDPDLTKLWNLKPDNLEACKGPERDFVPSFDSFFQEAFEQLDAKLCIESQNNKAD 233


>gi|224009341|ref|XP_002293629.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971029|gb|EED89365.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 983

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 109/398 (27%), Positives = 170/398 (42%), Gaps = 79/398 (19%)

Query: 121 NLLRMCNDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGD 178
           NLLR+CN LL+  S S  ++  F G +++ LA+ FP SERS LN++  FNV ++  F G+
Sbjct: 218 NLLRICNHLLKLSSSSIGKDAEFAGSVMVTLARVFPLSERSALNVLGSFNVGSVVGFEGE 277

Query: 179 EEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPV-----------QCYNKVS 227
           EE +   +              I ++FYK FW +Q  F +P            Q     +
Sbjct: 278 EERNHHHHSL------------IGYDFYKTFWGVQKVFTDPSSTILASRGGLKQSEALQA 325

Query: 228 WKMFTSYAETVLAAFKSYKL-----DDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRR 282
           +  F +  +T+L A +S  +     +   +S         KYLT+ +LL LQL D+ FR 
Sbjct: 326 YDGFVADVKTILGALESTPVAAPPSESTSNSSVVVVVNHHKYLTSPQLLHLQLKDSTFRI 385

Query: 283 YVLLQFLILFQYFTSTVKSRGEG----------------LELKSDQEKWVKDTTETVYSL 326
           + L Q +I+  Y TST  S                     ++K  Q K + D  + V  L
Sbjct: 386 HFLTQLIIIINYLTSTPTSVTLPTPPITPTPGADTTKLQAQIKQTQLKQLNDLNKRVEQL 445

Query: 327 IKQTP-----------PDGEHFSQVVKLILKGEE-HWNQWKNEGCP----ELKRPLTSIT 370
           ++ +            P GE+  + +  +L+  E  W  WK   C      +   L  + 
Sbjct: 446 LRSSSSAGAATAAAVQPYGENQWKGLTWLLRDRETMWKAWKRGKCKSNMDSVLNDLGGVR 505

Query: 371 DEDKKD--EPDAKKKKTPEL----TKLWNSKD-----------NLEACKSAERDFTPSLE 413
             D +       +K+   E+    T    S D           NL    S      PSL 
Sbjct: 506 GGDVRAILGSSGRKRTLEEVASSGTATGGSGDVADNAVIGIKTNLPTISSQMAQSQPSLT 565

Query: 414 SYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
            + +  I+ +DP + +E +Y   ND  Y+WRALRLL+R
Sbjct: 566 DFLDPYIEALDPESGIEAEYHPRNDKVYSWRALRLLAR 603


>gi|169610283|ref|XP_001798560.1| hypothetical protein SNOG_08238 [Phaeosphaeria nodorum SN15]
 gi|160702023|gb|EAT84514.2| hypothetical protein SNOG_08238 [Phaeosphaeria nodorum SN15]
          Length = 480

 Score =  127 bits (319), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/434 (25%), Positives = 196/434 (45%), Gaps = 88/434 (20%)

Query: 12  SQLNVVSDKYALYKPTCDYDKKSALEQSIRQ--YLLKLIKTPDIDIKVIENYISLCVELC 69
           S+LN + ++    KP+   D    +   IR+  +L   I+  D           L VE  
Sbjct: 14  SRLNYLLERARSVKPSTSIDPPLQVSDLIREDEFLFDPIEGSD-------EARRLSVETA 66

Query: 70  MKDMCNSTLP-IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN---LLRM 125
            K +  + L  ++L+ +  D  +++ C  +F ++E      ++    ++ KN    +LR+
Sbjct: 67  AKSLFYANLATLLLIEELLDSQSIEGCRVVFNFLESR----REAIIAINSKNKDLVILRL 122

Query: 126 CNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------GG 177
           CN+LLRRLSR+++ VFCGR+ +F+ + FP  ++S +N+   F+VEN+T F          
Sbjct: 123 CNELLRRLSRAEDPVFCGRVYIFMFQSFPLGDKSSVNLRGNFHVENVTTFEDFLKDSSTE 182

Query: 178 DEEMDVSSNETE-----ETDTEEVD---------------KVKIDFNFYKKFWSLQDYFR 217
           D++M +   E E     E+ +EE +                + ID + Y  FWSLQ  F 
Sbjct: 183 DDQMQIDDMEAEPVLKGESKSEEHNVKLGNSNVQKDELSSTMDID-SLYPVFWSLQHSFS 241

Query: 218 NPVQCYNKVSWKMFTSYAETVLAAFK-------SYKLDDVQSSLNPSG---DYFAKYLTN 267
           NP + + +  +K F    E  LA FK       +   +  +  ++  G   D F K L  
Sbjct: 242 NPPRLFEEDHFKHFQKSLEATLAKFKEVPKVIQAGDSERTKEQMDQDGDDYDAFGKRLQP 301

Query: 268 Q-----------------KLLDLQLSDTNFRRYVLLQFLILFQY-FTSTVKSRGEGL--- 306
           Q                 KL++  LSD  F+R++L+Q LIL  +  T T K++ + +   
Sbjct: 302 QISHKPRSVQAWRYAIRDKLVE-ALSDLAFQRHILVQALILVDFLLTLTEKAKSKSIYQK 360

Query: 307 ---------ELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNE 357
                     L+ +  +W       + + + Q  PDG+ + ++V  +L  +++W +WK E
Sbjct: 361 AQKAMQYNFTLREEDTEWALGIKTAIANYL-QEGPDGKFYYRMVDTVLSRDKNWVRWKME 419

Query: 358 GCPELKRPLTSITD 371
            C    R   +  D
Sbjct: 420 NCQPFTRDRVATKD 433


>gi|388582940|gb|EIM23243.1| hypothetical protein WALSEDRAFT_62616 [Wallemia sebi CBS 633.66]
          Length = 689

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 181/400 (45%), Gaps = 88/400 (22%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDI-----DIKVIENY-------ISLCVELCMKDMC--NS 76
           DKK+ +E +IR+ +     T  I     D KV + Y       + + V+L   +M   NS
Sbjct: 34  DKKALVEATIREEIQSYSTTRSIAPQYSDDKVTDPYHTGLLSKLGILVDLLKSEMITPNS 93

Query: 77  TLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQ----QTFFMSCKNNLLRMCNDLLRR 132
            +PII  SDT D+  L     +F ++    N        Q   +S +NNLL+  N+LLRR
Sbjct: 94  VVPII--SDTLDIHPLSSATVIFSFL---TNFMSSDDSNQVKTISTRNNLLKTSNELLRR 148

Query: 133 LSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETD 192
            SR++N+  CG ILLFL    P ++RS  N            F GD      SN+T   D
Sbjct: 149 ASRTRNSELCGEILLFLTSIMPLTDRSATN------------FRGDIR---ESNKTFIND 193

Query: 193 TEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWK--------------MFTSYAETV 238
              +D    D + YK FW LQ Y  +P+  YN    K              +F++  + +
Sbjct: 194 L--IDSPDAD-DSYKDFWRLQSYLTDPMSIYNGGLVKNDDKEVTHIESFKQLFSNILDLL 250

Query: 239 L-AAFKSYKLD---------------DVQSSLNPSGD--YFAKYLTNQKLLDLQLSDTNF 280
           + A+ K  K+                + +S  + S D  +F +YL   KL  LQ++D +F
Sbjct: 251 VQASDKEIKMSGSKLKKDTKRKADELEAKSDNHESNDAYFFPEYLAQPKLFPLQVADPSF 310

Query: 281 RRYVLLQFLILFQYFTST-VKSRGEGLELKSDQEKWVKDTTET---VYS--------LIK 328
           R  V +Q  I+ Q+  S   K   +  E+K+ QEK +K   ++   +Y         L+K
Sbjct: 311 RCCVFIQMFIIVQFIVSVYPKELLKAPEIKNPQEKIIKSGIQSDKPLYDWASNKRAILLK 370

Query: 329 QTP---PDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
             P   P+    +Q +  ++K E+HW  WK E CP ++ P
Sbjct: 371 SLPKVSPNNLDITQPIVQLMKREKHWVYWKMENCPMIELP 410


>gi|261187831|ref|XP_002620333.1| nuclear matrix protein [Ajellomyces dermatitidis SLH14081]
 gi|239593450|gb|EEQ76031.1| nuclear matrix protein [Ajellomyces dermatitidis SLH14081]
          Length = 651

 Score =  125 bits (313), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 143/616 (23%), Positives = 242/616 (39%), Gaps = 152/616 (24%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E + R+    L+ T  ID      I N + +       + C   L   L+ +  D  
Sbjct: 66  AVVETAFRKKFYDLLATTSIDEPSFIHIWNLLDIVSIFSDNEQCEPGLLFWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEV-----NVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+D C ++F Y+E          +KQ++        +LR CN+LLRRLSR+++T F    
Sbjct: 126 TIDGCRKVFDYLESRRERNTAKHFKQKSLI------ILRSCNELLRRLSRTEDTSF---- 175

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEF--------GGDEEMDVSSNETE---ETDTE 194
                   P  +RS +N+  EF+ EN+T F        GG   MDV ++  E   +  TE
Sbjct: 176 --------PLGDRSSVNLRGEFHTENVTTFDKLPKAQDGGVTPMDVDASAKEVAGDAKTE 227

Query: 195 EVDKVKIDFN-----------------------------------FYKKFWSLQDYFRNP 219
                + + N                                    Y  FWSLQ+ F +P
Sbjct: 228 SAQASEQESNRQAAHPAADVGLQSSQPATKKPQEPEPSSTPSMDELYPIFWSLQNNFSSP 287

Query: 220 VQCYNKVSWKMFTSYAETVLAAFKSYKLD-------------------DVQSSLNPSGDY 260
              ++  ++  F +  E  L+ F+    D                    V   +  +  +
Sbjct: 288 TSLFDPANFAAFKTGLEASLSIFQKVNTDMEARMTKGSEDARRGLKRRRVGDGMEMTNSF 347

Query: 261 FAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGE--------------- 304
             KYLT++ L +L+++D  FRR++L+Q LIL  +  S T K++ +               
Sbjct: 348 NPKYLTSRDLFELEVNDVAFRRHILVQSLILLDFVLSLTPKAKAKLADSTNKSVLYSYVL 407

Query: 305 -----GLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
                 L   S + KW      ++ S ++Q   +G+ + ++V  +L  +++W +WK E C
Sbjct: 408 NDEDVNLCTASSKAKWASQMKTSIASYLQQG-SEGKFYYRMVDTVLTRDKNWVRWKAEAC 466

Query: 360 PELKRPLTSITD------EDKKDEPDAKKKKTP----ELTKLWNSK--DNLEACKSAERD 407
           P ++R   SI D         K     + +  P    +L  L   K  +++   K  +R 
Sbjct: 467 PPIERTPVSIQDYLDTQATATKVSTSKRLRSAPLGSLDLNFLTEDKNINSVGRLKHPDRF 526

Query: 408 FTPSLESYFEEAIQ---QMDPAAAVEEQYKKVND-SNYAWRALRLLSRKCPHFF------ 457
            TP+L+SY          +D A + EE+ +     ++  WR LRL SR     F      
Sbjct: 527 TTPALDSYMRGIADDEFNIDMAQSREEKDEATKARASKTWRTLRLSSRSKLALFDKIEDG 586

Query: 458 -----LNATPNVEKNSEFIENMVKRCVKEKPSSQISGNGNGVD-QDPAEVEVDTKSEEIQ 511
                L   P   +++    N  K  V+           NGV+ QD      D K E+ +
Sbjct: 587 NNLKVLFEPPPASEDASSASNKQKAAVE-----------NGVNRQDGPSKRGDVKGEKAK 635

Query: 512 EEEKEEDWEAKADPEG 527
           E +     +  A  +G
Sbjct: 636 ENQPPVSSDGAAPQQG 651


>gi|339259264|ref|XP_003368993.1| hypothetical protein Tsp_12746 [Trichinella spiralis]
 gi|316964321|gb|EFV49483.1| hypothetical protein Tsp_12746 [Trichinella spiralis]
          Length = 442

 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 166/379 (43%), Gaps = 36/379 (9%)

Query: 129 LLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISE----FNVENITEFGGDEEMDVS 184
           +L+RLSR  N   CGR+  FL K  PF ++SGLN   E     + E I+    DEE+   
Sbjct: 1   MLKRLSRIINAELCGRLSAFLTKCLPFLDKSGLNATGECEPLLDAELISFPDEDEELVEQ 60

Query: 185 SNETEE-------TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCY-NKVSWKMFTSYAE 236
             E+         T  E+ ++ K + + Y  F   +     P  C   +  W  F    +
Sbjct: 61  FAESAYYLRDEFITLPEKTEEKKFNRSLYSSFCLTKKLLAEPTNCCETEDDWYQFKQAVD 120

Query: 237 TVLA-----------AFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVL 285
            VL             + S     V   L+  G +   YL   K+  LQ++D +F    L
Sbjct: 121 KVLTHMSHPPLPFDRTYSSLMNQAVARELSLPGGF---YLPEPKMFMLQINDVSFVSTFL 177

Query: 286 LQFLILFQYFT---STVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
           ++ LI+  Y       + +    LE  S + +W+  T    Y L+K  P DGE  +  + 
Sbjct: 178 MEQLIVLDYLIEQNRCLNTTVHDLESSSGKHRWLLSTYNNCYQLLKAMP-DGEAATFHLN 236

Query: 343 LILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKKKK-----TPELTKLWN-SKD 396
           L L  E+ W QWK +  P+++   T+I    ++ E     K       P LTKLWN   +
Sbjct: 237 LQLDNEKLWGQWKQDNYPDVRVKPTAIVQSLRRGEHRPYVKGFFDLGNPTLTKLWNLCPN 296

Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
           N+++ +       P ++++  EA+ Q  P   +  + KKV+D N+ W   R ++ K   F
Sbjct: 297 NVDSWRGIGCQAVPEVKAFISEAVNQASPEVQMPMELKKVSDENFKWIGYRFMAEKSRSF 356

Query: 457 FLNATPNVEKNSEFIENMV 475
            +     +   S+++E+M+
Sbjct: 357 GMFTEKQILNTSKYLESML 375


>gi|452977651|gb|EME77417.1| hypothetical protein MYCFIDRAFT_42306 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 621

 Score =  124 bits (310), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 126/501 (25%), Positives = 208/501 (41%), Gaps = 116/501 (23%)

Query: 34  SALEQSIRQYLLKLIKTPDI-DIKVIENYISLCVELCMKD--MCNSTLPIILLSDTFDMS 90
           SA+E + R+    LI + +I D    E +  L V L   D   C+  L   L+ +  D  
Sbjct: 66  SAVETAFREIFYSLIHSTEIHDPAFGEVWNLLDVVLICGDEGRCDPGLVCWLIEELLDSQ 125

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T + C  +F Y++                           RR SR+++ +FCGR+  FL 
Sbjct: 126 TTEGCRTVFDYLDS--------------------------RRESRAEDAIFCGRVFFFLF 159

Query: 151 KFFPFSERSGLNIISEFNVENITEF-------GGDEEMDVSSNETEETDTEEV------- 196
           + FP  ++S +N+  EF+ EN T+F       G   ++D  + ET  +DT+E        
Sbjct: 160 QTFPLGDKSSVNLRGEFHTENTTKFEVSAPVEGEQMDVDFKTGETP-SDTKESTPQPAAP 218

Query: 197 -------------------DKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
                              ++V      Y  FW LQ  F +P + ++  ++ +F      
Sbjct: 219 TKPGSKAIPIKAAPKKVAEEEVLSTEQLYPIFWRLQHDFSDPTRLFDNDNFTVFKKGLAC 278

Query: 238 VLAAFK-SYKLDDVQSSLNP--------SGD-------------YFAKYLTNQKLLDLQL 275
            +A FK +  +   +SS+           GD             Y  KYLT++ L  L+L
Sbjct: 279 TIAKFKKTPTVVQTKSSVEGRRGTKRKMHGDGGEEESDNTLVDNYNPKYLTSRDLFGLEL 338

Query: 276 SDTNFRRYVLLQFLILFQYF-------------TSTVKSRGEGLELKSDQEKWVKDTTET 322
           SD  F+R++L+Q LIL  +               ST ++      L  +  KW   T  T
Sbjct: 339 SDLAFQRHILVQALILIDFLLSLTEKAKKRLNAVSTNRALIHAFVLNEEDAKWATSTRST 398

Query: 323 VYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPDAKK 382
           + S +  +  +G H+ ++V+ +L  +++W +WK E CP + R   S T+E+      A K
Sbjct: 399 ITSHLNMS-NEGRHYCRMVETVLARDKNWVRWKVESCPSIVRGPVS-TEEELLARTGAAK 456

Query: 383 KKTPELT--KLWN----------SKDNLEACKSAERDFTPSLESYFEEA-IQQMDPAAAV 429
              P     K  N          +   L+A K   R   PS+E   E+    ++D   A+
Sbjct: 457 FTRPRTIPDKPQNVMNLSFLDEVTGGGLDALKHPSRYTAPSIEQLVEQVKTDKLDAEMAM 516

Query: 430 EEQYKKVNDS---NYAWRALR 447
           +E+ K   ++   N  WRALR
Sbjct: 517 DEEEKSSYENAIQNAKWRALR 537


>gi|116191693|ref|XP_001221659.1| hypothetical protein CHGG_05564 [Chaetomium globosum CBS 148.51]
 gi|88181477|gb|EAQ88945.1| hypothetical protein CHGG_05564 [Chaetomium globosum CBS 148.51]
          Length = 718

 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 108/409 (26%), Positives = 181/409 (44%), Gaps = 56/409 (13%)

Query: 72  DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMC 126
           ++C+  L   L+ +  D  T+  C ++F ++E     +    +KQ+         +LR C
Sbjct: 116 ELCDPALLFWLVEELLDSQTIAGCRKVFDFLESRRERITSKHFKQKQLV------ILRTC 169

Query: 127 NDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSN 186
           N+LLRRLSR+ +  FCGR+ +F+ + FP  ++S +N+  E++VEN+T F  D+E  ++  
Sbjct: 170 NELLRRLSRALDPAFCGRVFIFMFQSFPLGDKSSVNLRGEYHVENVTTF--DQETPIADG 227

Query: 187 ETEETDTE---EVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
           E  + DT+   + DK       Y  FWSLQ+ F  P + +    +  F +  E  +A F 
Sbjct: 228 EKMDVDTDAKPQKDKPLDPDTLYPIFWSLQESFNQPKKLFESSHFSAFKAGLEATMATF- 286

Query: 244 SYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILF--QYFTSTVKS 301
                    ++NP     A+   ++   +   S    R  V  +    F  +Y TS    
Sbjct: 287 --------WTINPEKPTRAREKQDKPDEEANPSLKRKRNDVDDELASGFNPKYLTSRDLF 338

Query: 302 RGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE 361
           +   LEL  +  KW  D    V   + Q   DG  F ++V  +L  +++W +WK E CP 
Sbjct: 339 K---LELSEEDTKWAIDMRAGVVKYL-QKGFDGPFFDRMVGTVLVRDKNWVRWKIENCPS 394

Query: 362 LKRPLTS---ITDEDKKDEPDAKKKKTP------ELTKLWNSKDNLEACKSAERDFTPSL 412
           ++ P  S    T+        A+ ++ P      E     + K  +E  K  +R   P L
Sbjct: 395 IELPTLSPGTFTEARAAATRLARTRRLPNESRSLEFLGEEDEKSAMEKLKDPQRYSLPEL 454

Query: 413 ESY----------FEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSR 451
            S+           E A       +A+E +  K      AWR LR+ S+
Sbjct: 455 GSFKDGIEDDDFDIERATNNESKTSAIESKASK------AWRTLRIASK 497


>gi|50549211|ref|XP_502076.1| YALI0C21131p [Yarrowia lipolytica]
 gi|49647943|emb|CAG82396.1| YALI0C21131p [Yarrowia lipolytica CLIB122]
          Length = 724

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 112/426 (26%), Positives = 163/426 (38%), Gaps = 124/426 (29%)

Query: 71  KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVN-----VNIWKQQTFFMSC------- 118
           K +  STL + L+ D FD+ T+D C   + +  V+     +N+ + Q    S        
Sbjct: 92  KPLLESTLTLALVEDIFDIHTVDWCRHFWTHYLVSREEQLMNVSRNQASSNSVTRLSGRK 151

Query: 119 --KNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
                L+R+ N LLRR+S+S++ VF G IL+FL++ FP SERSGLN   EFNVENIT F 
Sbjct: 152 APGTTLIRLANALLRRVSKSRDAVFAGEILIFLSRAFPLSERSGLNQKGEFNVENITVFE 211

Query: 177 GDE----------------------------------EMDVSSNETEETDTEEVDKVKID 202
           G E                                  ++D+  +E +E   E+       
Sbjct: 212 GYEGAKEHKEVDEDEDEDDDEDDNDDDDEDENENEDKDVDMGVDEMKEKVKEKTATPATS 271

Query: 203 FNF--------YKKFWSLQDYFRNPVQCYN---------------KVSWKMFTSY---AE 236
                      Y  FW LQ YF +P    N               K + K   S+    E
Sbjct: 272 KEHTQEPSDSPYALFWGLQKYFADPTILLNSKEPEVEMNKLQTCLKATVKTLRSHDGSTE 331

Query: 237 TVLAAFKSYKLDDVQSSLNPSGDYF-----------------------------AKYLTN 267
                   +   D +   N  G+                               AK+LTN
Sbjct: 332 IAQGGGTRWAKRDSREGGNRGGNTSAAGGGSGGSRDFGNSRGHSANNHSSRTLNAKWLTN 391

Query: 268 QKLLDLQLSDTNFRRYVLLQFLILFQYF-------------TSTVKSRGEGLELKSDQEK 314
             L  LQLSDTNFRR +  Q LI  ++              T+T +S      L +  E 
Sbjct: 392 PDLFTLQLSDTNFRRAIWTQILIFSEFMLSLTEKNKAKIPATATNRSVQYQYTLSAADEI 451

Query: 315 WVKDTTETVYSLIKQTPP--DGEH------FSQVVKLILKGEEHWNQWKNEGCPELKRPL 366
           + +       S  K  PP   G         S+++  + K +E W +WK E CP  ++P+
Sbjct: 452 YFESVVAHFESSGKNKPPYKPGHSPDFNAVLSRIIPHVFKFDEKWVEWKLENCPAYEKPV 511

Query: 367 TSITDE 372
            +  DE
Sbjct: 512 MNNIDE 517


>gi|388857770|emb|CCF48664.1| related to nuclear matrix protein p84 [Ustilago hordei]
          Length = 865

 Score =  120 bits (301), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 76/216 (35%), Positives = 117/216 (54%), Gaps = 15/216 (6%)

Query: 68  LCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN----LL 123
           L + D+ + TLP+ L+ +  +   +  C QLF Y+E  V   K  T  +         LL
Sbjct: 132 LSLYDLIDPTLPLSLVEELIEGQPISICSQLFGYIESRV---KPLTIGLHSTRGKGLVLL 188

Query: 124 RMCNDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
           R CN+LLRRLS+   Q+TVF GRIL  LAK FP  ERSG+N+  +FNVEN T    +E +
Sbjct: 189 RTCNELLRRLSKPSQQHTVFAGRILSLLAKVFPLGERSGVNLRGDFNVENKTHI--EENV 246

Query: 182 DVSSNETEET-DTEEVDKV-KIDFNFYKKFWSLQDYFRN-PVQCYNKVSWKMFTSYAETV 238
           D ++ E E+  D +E D++ K+D+ FY+ FW +Q +F N P+      + +  T  A   
Sbjct: 247 DDATEEVEDAEDNDERDQIFKVDYEFYELFWKVQRFFSNSPLLMGAHETSRPSTRSATPA 306

Query: 239 LAAFKSYKLDDVQSSLNPSGDYFAKY-LTNQKLLDL 273
            A    +   D   + + +    A++ ++ QK+L L
Sbjct: 307 DAGGGEHTNADEDPAPDGAKGAMAEFRISTQKILKL 342



 Score =  102 bits (254), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 134/297 (45%), Gaps = 77/297 (25%)

Query: 257 SGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE- 307
           SG +F  YLT +KL + ++ DT+FR++VL+Q+L+LFQY   FT T K      + + L+ 
Sbjct: 452 SGHFFPTYLTGRKLFEYEIRDTSFRKHVLVQYLVLFQYLLSFTPTAKESWKDWKNKTLQA 511

Query: 308 ---LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
              L+   E+W++ T   +  LI++  PDG  F + V  +LK E +W +WK E CP +++
Sbjct: 512 LFTLEEADERWIRSTWREIQGLIREIQPDGRLFLESVLEVLKREANWIRWKTESCPSIEK 571

Query: 365 PLTSITDED----------KKDEPDAKKKKTPELTKLW--NSKDNLEACKSAERDFT--- 409
           P  S               +K EP   K  T  L++LW    K  +   +  E +     
Sbjct: 572 PPLSPEQISAFASARALLLRKPEPYPHKLGTAALSELWLDGLKPPVAGTRRVENEIGEYV 631

Query: 410 ----------------PSLESYFEEAIQQ-------------MDPAAAVEEQYKKVND-- 438
                           PSLE ++ ++IQ+             +DPA    E  K+  D  
Sbjct: 632 RIATDGLEDLEFLPPIPSLE-HYNKSIQREKMKLDMRKEQLGIDPATKEAELSKEEKDKL 690

Query: 439 -------------SNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEK 482
                         + AWRALR+ SR   H F    P+  +N   IE ++K   +E+
Sbjct: 691 DKDERIQATIETMQSLAWRALRVASRDNLHLF--GKPDRPEN---IEGLLKAMEEER 742


>gi|443900289|dbj|GAC77615.1| nuclear matrix protein [Pseudozyma antarctica T-34]
          Length = 840

 Score =  119 bits (298), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 104/198 (52%), Gaps = 9/198 (4%)

Query: 68  LCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMC 126
           L + D+ + TLP+ L+ +  +   +  C QLF Y E  V            K   LLR C
Sbjct: 129 LSLHDLIDPTLPLSLVEELLEGLPISTCSQLFAYTESRVKPLTVDLHPTRGKGLVLLRTC 188

Query: 127 NDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS 184
           N+LLRRLS+    +TVF GR+L  LA  FP  ERSG+N+  +FNVEN T     EE D  
Sbjct: 189 NELLRRLSKPSQHHTVFAGRVLSLLAAVFPLGERSGVNLRGDFNVENKTHI---EENDSQ 245

Query: 185 SNETE--ETDTEEVDKV-KIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAA 241
           + E    + D +E D++ K+D++FY+ FW +Q +F NP            ++ + T    
Sbjct: 246 AAEEASIDADRDERDEIFKVDYDFYELFWKMQRFFSNPPLLMGASESSRPSTRSATPALH 305

Query: 242 FKSYKLDDVQSSLNPSGD 259
            ++   D+VQ+  +P+ D
Sbjct: 306 TEAADSDNVQADDDPAPD 323



 Score = 82.0 bits (201), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 144/350 (41%), Gaps = 82/350 (23%)

Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE----LK 309
           AK L    L + ++ DT+FR+++L+Q+L+LFQY   FT T K      + + L+    L+
Sbjct: 415 AKRLRPDVLFEYEIRDTSFRKHILVQYLVLFQYLLSFTPTAKESWKDWKNKTLQALFTLE 474

Query: 310 SDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSI 369
              E+W++ T   +  LI+   P G  F + V  +LK E +W +WK E CP +++P  S 
Sbjct: 475 DVDERWIRSTWREILILIRDIQPHGRLFLESVLEVLKREANWIRWKTESCPSIEKPPLSP 534

Query: 370 TDED----------KKDEPDAKKKKTPELTKLWNSKDNLEACKSAER------------- 406
                         +K EP   K  T  L++LW   D L    +  R             
Sbjct: 535 EQISAFASARALLLRKPEPYPHKLGTAALSELW--LDGLRPPVAGTRRVENELGEYISVP 592

Query: 407 -------DFTPSLES--YFEEAIQQ-------------MDP----AAAVEEQYKKVND-- 438
                  +F P + S  ++ ++IQ+             +DP    A   +E  +K++   
Sbjct: 593 TDGLEELEFLPPIPSVEHYNKSIQREKMKLDLRKKQLGIDPTISEAQLSKEDREKLDKDE 652

Query: 439 ---------SNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE---KPSSQ 486
                     + AWRALR+ SR   H F        +  E IE +++    E   KP S 
Sbjct: 653 RIQATTETMQSLAWRALRVASRDHLHLF-----GKPERPEHIEGLLEAIETERNPKPKSV 707

Query: 487 ISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDADEVMSVE 536
                   ++   + +    +E+  E+  E      AD   DA E  + E
Sbjct: 708 APVETQATEEGAVQAQGPQDAEKAAEQSAELAQAGTADGPADAGETRTNE 757


>gi|353236953|emb|CCA68937.1| related to nuclear matrix protein p84 [Piriformospora indica DSM
           11827]
          Length = 673

 Score =  117 bits (294), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/296 (32%), Positives = 141/296 (47%), Gaps = 61/296 (20%)

Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV------------ 169
           +LR  N+LLR LS+S  T FC RILLF++  FP SERSG+N+  E+              
Sbjct: 97  VLRTLNELLRSLSKSTETEFCARILLFMSAVFPLSERSGVNLRGEYGPPWEGPPPYQEGK 156

Query: 170 ENITEFGGDEEM-------DVSSNETEET-DTEEVDKVKIDFNFYKKFWSLQDYFRNPVQ 221
           E I E    EE        D  + E ++T D  E D+  +    YK FW LQ  F  P  
Sbjct: 157 ELILEEAKKEETRQGARHDDAGATEMDDTVDPSEEDRKSL---LYKTFWFLQYPFSRPPA 213

Query: 222 CYNKVSWKMF-------------TSYAETVLAAFKSY-----KLD---DVQ----SSLNP 256
                 +K F              +  E +LA  K+      K D   DVQ    +  + 
Sbjct: 214 FAEADMFKAFQHAVNKILPILSEATKKERMLAGSKTLAGSKRKRDVEMDVQKVSYTVQDE 273

Query: 257 SGDY-FAKYLTNQKLLDLQLSDTNFRR---YVLLQFLILFQYFTSTVKSRGEGLELKS-- 310
             DY FAKYL++ +LL+L+++DTNFRR   + LL  L   + FT   K++   ++++   
Sbjct: 274 HADYAFAKYLSSPELLELEIADTNFRRQFLFQLLILLQYLRSFTEQEKAKHPEVKMRPFN 333

Query: 311 -------DQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGC 359
                   +E WVK+    V   ++ T PDG  F +++++IL+ E +W  WKN  C
Sbjct: 334 IDFTLPESEENWVKEVNSRVMLELRTTTPDGRAFEEIIRIILERESNWVNWKNSQC 389


>gi|76154476|gb|AAX25952.2| SJCHGC07441 protein [Schistosoma japonicum]
          Length = 170

 Score =  117 bits (292), Expect = 2e-23,   Method: Composition-based stats.
 Identities = 55/134 (41%), Positives = 87/134 (64%), Gaps = 2/134 (1%)

Query: 31  DKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           DKKS ++ ++R  +L+L+ +  +D   ++  + + V   + ++C++  P +L SD F+  
Sbjct: 39  DKKSLIDVALRGRMLRLLASDYMDD--VKTLLKMGVAAALANLCSAPTPFLLFSDVFNTK 96

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
            +  CE++F ++E  +   K    F S +N LLRMCNDLLRRLS+SQNTVFCG+I LFL 
Sbjct: 97  PISLCEEMFGFMEETIITLKDVPLFGSGRNTLLRMCNDLLRRLSKSQNTVFCGQIQLFLT 156

Query: 151 KFFPFSERSGLNII 164
           + FP  E+SGLN +
Sbjct: 157 RLFPLDEKSGLNFM 170


>gi|164656609|ref|XP_001729432.1| hypothetical protein MGL_3467 [Malassezia globosa CBS 7966]
 gi|159103323|gb|EDP42218.1| hypothetical protein MGL_3467 [Malassezia globosa CBS 7966]
          Length = 543

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 106/404 (26%), Positives = 176/404 (43%), Gaps = 78/404 (19%)

Query: 122 LLRMCNDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDE 179
           LLR+CN+LLRRLS++  Q+T+  GRI L L+  F  SERSG+N+  EF+         ++
Sbjct: 18  LLRLCNELLRRLSKTHPQHTMLAGRIFLLLSSTFAISERSGVNLKGEFDT--------NQ 69

Query: 180 EMDVSS-NETEETDTEEVDKVKIDF-NFYKKFWSLQDYFRNPVQCYNKV---------SW 228
            +D+ + NE  E  +E   ++ I    FY  FWSLQ YF NP   +++          + 
Sbjct: 70  TIDIEAVNEHCEPPSETKPEILIHHPQFYVLFWSLQRYFTNPTLLFSESDAPGAKAAEAL 129

Query: 229 KMFTSYAETVLAAFKSYK--LDDVQSSLNPSGD-----------------YFAKYLTNQK 269
            +    A T +  F++    + DV   LN   D                  + +Y   Q 
Sbjct: 130 GLQERQASTPMHTFQTGIQCVLDVMRWLNKKSDSPPKTKRARFETSASSPTYPRYRCTQT 189

Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLE------------LKSDQEKWVK 317
           L   +L +  F++  L+Q LI FQY      +  E  +            LK ++E+W++
Sbjct: 190 LFGYELHNLAFQQQYLVQCLITFQYLLGQTSATHEQSKEWKNKLLVPMHTLKQEEEQWLR 249

Query: 318 DTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKR----PLTSITDED 373
            T   + ++++++  +G  F   V ++L+ E  W +WK    P   +    P T     D
Sbjct: 250 KTWRQIQTVLRESSQEGRVFLDAVLMLLRRESSWIRWKAASAPSFHKDALNPDTLQAWSD 309

Query: 374 KKDEPDAKKKK-------TPELTKLWNSKDNLEACKSAE--------RDFTPSLESYFEE 418
           K     A+++        TPEL++LW  +D     + +E         D T   + + E 
Sbjct: 310 KIRTTFAQQEPWFPHALGTPELSRLW--EDGFHVAEPSEMHVNDEEGHDKTIRTDGWEEL 367

Query: 419 AIQQMDPA-----AAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
              +  P+      A+E         + +WRALR  S +  H F
Sbjct: 368 KFPRAPPSLRSLCRAIEHCTDPDQAQSLSWRALRCASLEHLHVF 411


>gi|343425449|emb|CBQ68984.1| related to nuclear matrix protein p84 [Sporisorium reilianum SRZ2]
          Length = 865

 Score =  113 bits (282), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 63/156 (40%), Positives = 89/156 (57%), Gaps = 7/156 (4%)

Query: 68  LCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMC 126
           L + D+ + +LP+ L+ +  +   +  C +LF Y E  V            K   LLR C
Sbjct: 133 LSLHDLVDPSLPLSLIEELIEGQPISTCSRLFAYTESRVQPLTIDLHPTRGKGLVLLRTC 192

Query: 127 NDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVS 184
           N+LLRRLS+   Q+TVF GRIL  LAK FP  ERSG+N+  EFNVEN T     E+    
Sbjct: 193 NELLRRLSKPSQQHTVFAGRILSLLAKVFPLGERSGVNLRGEFNVENKTHI---EDRAQD 249

Query: 185 SNETEETDTEEVDKV-KIDFNFYKKFWSLQDYFRNP 219
           ++E     ++E D+V K+D+ FY+ FW +Q +F NP
Sbjct: 250 ADEDAADGSDEKDEVFKVDYEFYELFWKVQRFFSNP 285



 Score = 91.3 bits (225), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 77/274 (28%), Positives = 122/274 (44%), Gaps = 78/274 (28%)

Query: 258 GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKS-----RGEGLE-- 307
           G +F  YLT +KL + ++ DT+FR+++L+Q+ +LFQY   FT   K      + + L+  
Sbjct: 451 GHFFPTYLTGRKLFEHEIRDTSFRKHILVQYFVLFQYLLSFTPLAKESWKDWKNKTLQAL 510

Query: 308 --LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
             L+   E+W++ T   +  LI++  PDG  F + V  +LK E +W +WK E CP +++P
Sbjct: 511 FTLEDADERWIRSTWREIQGLIREIQPDGRLFLESVLEVLKREANWIRWKTESCPSIEKP 570

Query: 366 LTSITDED------------KKDEPDAKKKKTPELTKLWNSKDNLEACKSAER------- 406
              ++ E             +K EP   K  T  L++LW  +D L       R       
Sbjct: 571 --PLSPEQIAQFASARALLLRKPEPYPHKLGTAALSELW--QDGLRPPVPGTRRVENEMG 626

Query: 407 -------------DFTPSLES--YFEEAIQ-----------QMDPAAAV-------EEQY 433
                        +F P + S  ++ ++IQ           Q+   AAV       EE+ 
Sbjct: 627 EYVTIATDGLEELEFLPPIPSLEHYNKSIQREKMKLDLRKKQLGIDAAVGEAQLSKEERE 686

Query: 434 KKVND----------SNYAWRALRLLSRKCPHFF 457
           K   D           + AWRALR+ SR   H F
Sbjct: 687 KLDKDERVQATIESMQSLAWRALRVASRDNLHLF 720


>gi|392578305|gb|EIW71433.1| hypothetical protein TREMEDRAFT_28261 [Tremella mesenterica DSM
           1558]
          Length = 670

 Score =  111 bits (278), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 109/457 (23%), Positives = 190/457 (41%), Gaps = 86/457 (18%)

Query: 80  IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRS-Q 137
           I LL +  ++  ++    L+ YVE     + +       K   LLR+ N  LR L R+ +
Sbjct: 145 IPLLEELVELIPVETWRDLWAYVETRAKRFTKDMPASRGKALPLLRIINSFLRFLPRTPE 204

Query: 138 NTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEE------- 190
           + +F GR+  F +     +++S +N+  ++N    T    D     + N + E       
Sbjct: 205 DLIFRGRVHQFASSVISVADKSAINMRGDYNDIKTTWDESDTSQSSTLNASAEKEEDGDV 264

Query: 191 ---TDTEEVDKVKIDFNFYKKFWSLQDYFRNP-------VQC----YNKVSWKMFTSYAE 236
              +D +E  K   +  FY   WSLQ YF  P       VQ       +  ++ F +  E
Sbjct: 265 KMQSDVKEEVKASTETEFYSTLWSLQHYFAYPPSLDGPAVQSNPGEPARTPFETFKAKTE 324

Query: 237 TVLAAF-------------------KSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSD 277
            VL                      +  K  + QS  +  G +  ++LT ++L D +L+D
Sbjct: 325 FVLPKLFEQTQKEKEMMGKDAEVISRKRKRSEGQSDASARGFFHPRFLTGKRLFDHELTD 384

Query: 278 TNFRRYVLLQFLILFQYF------TSTVKSRGEGLE----LKSDQEKWVKDTTETVYSLI 327
           ++FRR +L+Q+ ILFQ+       T+  ++   G+     +  + E W+K     +   +
Sbjct: 385 SSFRRQILVQYFILFQFLLNLTPSTAAKQAFTGGMPKTFIVDQENEDWIKSKVAEIKEEL 444

Query: 328 KQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPE-------LKRPLTSITDED--KKDEP 378
           ++  PDG  F + V  I+  E ++ QWKN+ CPE       L + +     E   K+  P
Sbjct: 445 RKMIPDGPMFEETVLSIITRERYYAQWKNDSCPEGVFEVPPLDKAVAKKAAEQWRKRSAP 504

Query: 379 DAK---KKKTPELTKLWNSK----DNLEACKSAERDFTPSLESYFEEAIQQMDP------ 425
            ++   K  +  L++LWN      D L+  +S      P+     +E I++++       
Sbjct: 505 PSQYPYKLGSRSLSRLWNRGFTGIDQLKGWRS------PATAVSLDEEIKRIEADDEDDK 558

Query: 426 -----AAAVEEQYKKVNDSNYAWRALRLLSRK-CPHF 456
                 +A E    K   +   WR LR  SR    HF
Sbjct: 559 AMGKEPSAEEAALNKDRKTRLTWRGLRQASRTDLRHF 595


>gi|71020959|ref|XP_760710.1| hypothetical protein UM04563.1 [Ustilago maydis 521]
 gi|46100304|gb|EAK85537.1| hypothetical protein UM04563.1 [Ustilago maydis 521]
          Length = 879

 Score =  111 bits (278), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 66/164 (40%), Positives = 93/164 (56%), Gaps = 5/164 (3%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
           + + + + L ++D+ + TLP+ L+ +  +   +  C +LF Y E  V            K
Sbjct: 96  DRLDVILVLSLQDLVDPTLPLSLVEELIESQPISTCSRLFGYTESRVKPLTVDLHPTRGK 155

Query: 120 N-NLLRMCNDLLRRLSRS--QNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
              LLR CN+LLRRLS+   ++TVF GRIL  LAK FP  ERSG+N+  EFNVEN T   
Sbjct: 156 GLVLLRTCNELLRRLSKPSQEHTVFAGRILSLLAKVFPLGERSGVNLRGEFNVENKTSIE 215

Query: 177 GDEEMDVSSNETEETDTEEVDKV-KIDFNFYKKFWSLQDYFRNP 219
              E D      +E D +E DKV K+D+ FY+ FW +Q +F NP
Sbjct: 216 EKVE-DAKPEADDEADRDERDKVFKVDYEFYEVFWKVQRFFSNP 258



 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 88/345 (25%), Positives = 148/345 (42%), Gaps = 84/345 (24%)

Query: 258 GDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEG------------ 305
           G +F KYLT +KL + ++ +T+FR+++L+Q+ +LFQY  S   +  E             
Sbjct: 427 GHFFPKYLTGRKLFEYEIRNTSFRKHILVQYFVLFQYLLSYTPAAKESWKDWKNKTLQAL 486

Query: 306 LELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
             L+   E+W++ T   + +L++   PDG  F   V  +L+ E +W +WK E CP +++P
Sbjct: 487 FTLEEADERWIRSTWREIQALMRDIQPDGRLFLDSVLEVLRREANWIRWKTESCPSIEKP 546

Query: 366 LTSITDED------------KKDEPDAKKKKTPELTKLWNS--KDNLEACKSAERDF--- 408
              ++ E             +K EP   K  T  L++LW    +  +   + AE +    
Sbjct: 547 --PLSPEQLAQFASARALLLRKAEPYPHKLGTAALSELWEDGLRPPVAGTRRAENEMGEY 604

Query: 409 ----------------TPSLESYFEEAIQQ-------------MDPAAA-----VEEQYK 434
                            PSLE ++ ++IQ+             +DPA +      E+Q K
Sbjct: 605 VTVATDGLEELEFAPAAPSLE-HYNKSIQREKMKLDLRKKQLGIDPAVSETNLSKEDQSK 663

Query: 435 KVND----------SNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPS 484
              D           + AWRALR+ SR   H F    P+  +N E +   ++     KP 
Sbjct: 664 LDKDDRVQATIETMQSLAWRALRVASRDNLHLF--GKPDRPENIERLLQAMEEERNPKP- 720

Query: 485 SQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDA 529
                N       P   E DT+S +    +  +  EA+A+ + DA
Sbjct: 721 -----NAVTPSAQPTSDESDTRSGQEANGDTSKGKEAEAEAKTDA 760


>gi|406696339|gb|EKC99630.1| hypothetical protein A1Q2_06049 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 722

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 155/367 (42%), Gaps = 66/367 (17%)

Query: 80  IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRSQN 138
           I LL +  ++ +++     F Y E     + +       K   LLR  N  LR LS +  
Sbjct: 140 IPLLEEFVELLSVETWHSFFSYFETREKRFTRDMPASKGKALPLLRTINTFLRTLSSTPA 199

Query: 139 TV-FCGRILLFLAKFFPFSERSGLNIISEFN----------VENITEFGGDEEMDVSSNE 187
            +   GR+ LF ++    S++S +N+  E++           E      GD EMD  + +
Sbjct: 200 DLQLRGRVQLFASRAISVSDKSAINMRGEYSGIRTTWEETPEETAAGADGDVEMDGGAAK 259

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKL 247
            E+ +  E DK      FY   WSLQ YF +P       + +     A+T    F+S   
Sbjct: 260 GEKGEKSEDDK------FYNTLWSLQQYFAHPPSLDGPATGEP----AQTPFERFRSAS- 308

Query: 248 DDVQSSL---------NPSGDY-----------FAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
           D+V   L         NP+              F +YLT + LL  +L+DT FRR VLLQ
Sbjct: 309 DEVLERLFAETARERANPTPKVDRGPDAETEVVFPRYLTARALLRHELADTGFRRQVLLQ 368

Query: 288 FLILFQY---FTSTVKSR--GEGLELKS-----DQEKWVKDTTETVYSLIKQT-PPDGEH 336
           + ILFQ+    T  V S+    G   KS     D  +W+K T   +   +K+    DG  
Sbjct: 369 YFILFQFLLNLTPGVASKQQATGGMPKSFIIDDDAARWIKSTVSRIRDELKRIDSTDGLR 428

Query: 337 FSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD------------EDKKDEPDAKKKK 384
           F + V  ++  E  + QWKN+GCPE    +  +              + K   P   K+ 
Sbjct: 429 FEETVVELMNRERRYAQWKNDGCPEADWAIAPVPPTEGKATARKWERQSKPPRPWPYKQG 488

Query: 385 TPELTKL 391
           TP LT+L
Sbjct: 489 TPTLTRL 495


>gi|401887333|gb|EJT51323.1| hypothetical protein A1Q1_07504 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 722

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/365 (26%), Positives = 153/365 (41%), Gaps = 62/365 (16%)

Query: 80  IILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRLSRSQN 138
           I LL +  ++ +++     F Y E     + +       K   LLR  N  LR LS +  
Sbjct: 140 IPLLEEFVELLSVETWHSFFSYFETREKRFTRDMPASKGKALPLLRTINTFLRTLSSTPA 199

Query: 139 TV-FCGRILLFLAKFFPFSERSGLNIISEFN----------VENITEFGGDEEMDVSSNE 187
            +   GR+ LF ++    S++S +N+  E++           E      GD EMD  + +
Sbjct: 200 DLQLRGRVQLFASRAISVSDKSAINMRGEYSGIRTTWEETPEETAAGADGDVEMDGGAAK 259

Query: 188 TEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVS-------WKMFTSYAETVLA 240
            E+ +  E DK      FY   WSLQ YF +P       +       ++ F S ++ VL 
Sbjct: 260 GEKGEKSEDDK------FYNTLWSLQQYFAHPPSLDGPATGEPAQTPFERFRSASDEVLE 313

Query: 241 AFKSYKLDDVQSSLNPSGDY-----------FAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
              +   +  +   NP+              F +YLT + LL  +L+D  FRR VLLQ+ 
Sbjct: 314 RLFA---ETARERANPTPKVDRGPDAETEVVFPRYLTARALLRHELADIGFRRQVLLQYF 370

Query: 290 ILFQY---FTSTVKSRGEGLE-------LKSDQEKWVKDTTETVYSLIKQT-PPDGEHFS 338
           ILFQ+    T  V S+ +          +  D  +W+K T   +   +K+    DG  F 
Sbjct: 371 ILFQFLLNLTPGVASKQQATGGMPKSFIIDDDAARWIKSTVSRIRDELKRIDSTDGLRFE 430

Query: 339 QVVKLILKGEEHWNQWKNEGCPELKRPLTSITD------------EDKKDEPDAKKKKTP 386
           + V  ++  E  + QWKN+GCPE    +  +              + K   P   K+ TP
Sbjct: 431 ETVVELMNRERRYAQWKNDGCPEADWAIAPVPPTEGKATARKWERQSKPPRPWPYKQGTP 490

Query: 387 ELTKL 391
            LT+L
Sbjct: 491 TLTRL 495


>gi|213408931|ref|XP_002175236.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
 gi|212003283|gb|EEB08943.1| conserved hypothetical protein [Schizosaccharomyces japonicus
           yFS275]
          Length = 690

 Score =  107 bits (268), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 105/419 (25%), Positives = 175/419 (41%), Gaps = 62/419 (14%)

Query: 82  LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVF 141
           L+ +  D+ ++  CE+L+ Y      +    +        LLR+ N+LLRRLSR  +  F
Sbjct: 106 LMEELVDIHSVQDCEKLYEYFTTRPLVTVGLSGTRGKGPVLLRISNNLLRRLSRVDHPAF 165

Query: 142 CGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEE----TDTEEVD 197
           CGRI +FL+  F   ERSG N+  EF++ N       E ++V     E     T  E   
Sbjct: 166 CGRIHMFLSSVFGPHERSGANLRGEFDIAN-------EPVNVEDTLPENIRALTTAETRI 218

Query: 198 KVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSSLNPS 257
           K  +  + ++  WSLQ+Y  NP + Y+    +MFT+  + VL  F       V  + + S
Sbjct: 219 KQSVLDDVFEAIWSLQNYMSNPTKLYSSQEIEMFTTNTQKVLECFNGVHKLSVFQNRDDS 278

Query: 258 GD-------------------YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---F 295
            D                      +Y+ + +LL  QL D  F    L+Q  IL  +   F
Sbjct: 279 VDLHKRFQILLRRAFSEKHLLLLNRYIRSHELLAYQLLDEQFSVEYLVQISILLNHLLDF 338

Query: 296 TSTVKSRGEGLE-----------LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLI 344
           T   + + + L+           L S+ EK +    +    L+ Q P      +  ++ +
Sbjct: 339 TQENRKKQDSLQFTNKALISSFVLDSEDEKKLLKVKQATEKLLDQVPESV--LTNTLRSL 396

Query: 345 LKGEEHWNQWKNEGCPELKRPLTSITDEDKKDE-------PDAKKK---KTPELTKLWNS 394
              E +W  WK   C  +++PL   +  +   E       P    +    T  L+K+W+S
Sbjct: 397 FTIENNWKIWKARNCLTIEKPLLDSSFLEAAQEGLNKLTVPATALRYCMGTASLSKIWSS 456

Query: 395 --KDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAV----EEQYKKVNDSNYAWRALR 447
             ++NL+  +  E    P+   +     ++ + AA      E QY     ++  WRALR
Sbjct: 457 SVEENLQDLQRKEATQVPAATDFLNGVEKEENEAAEAVREDERQYHLQAKASKTWRALR 515


>gi|380492396|emb|CCF34633.1| guanylate kinase, partial [Colletotrichum higginsianum]
          Length = 364

 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 79/296 (26%), Positives = 137/296 (46%), Gaps = 65/296 (21%)

Query: 34  SALEQSIRQYLLKLIKTPDI---DIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           + +E ++R     LI T  I   D   + N   L   L   + C+  L   L+ +  D  
Sbjct: 79  ATIETAVRNMFSNLIATTSIESPDFVKVWNLFDLLTILSDDEKCDPALLFWLVEELLDSQ 138

Query: 91  TLDKCEQLFYYVE-----VNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
           T+  C ++F ++E     + V  +KQ+         +LR CN+LLRRLSR+++T FCGR+
Sbjct: 139 TIAGCRKIFDFLESRRERITVKHFKQKQLV------ILRSCNELLRRLSRAEDTAFCGRV 192

Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNE-TEETDTEEVDKVK---- 200
            +F+ + FP  ++S +N+  E++VEN+T F   ++  V ++E TE+ D +  + VK    
Sbjct: 193 FIFMFQSFPLGDKSSVNLRGEYHVENVTVF---DDTGVKTDESTEQMDIDSQNGVKDGEV 249

Query: 201 ------IDFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVL 239
                 + F+                Y +FWSLQ+ F  P + ++      F    E  +
Sbjct: 250 KSQAKAVSFDAKQPTQPDQPLNPDALYPQFWSLQETFSQPKKLFDPSHLANFKRGLEATM 309

Query: 240 AAFKSY---------------------KLDDVQSSLNPSGDYFAKYLTNQKLLDLQ 274
            +F++                      K DDV+ S + +  Y  KYLT++ L +L+
Sbjct: 310 TSFRAIQNEQGQRPPKAVDEAQRGLKRKRDDVEGS-DLANSYNPKYLTSRDLFELE 364


>gi|219123310|ref|XP_002181970.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406571|gb|EEC46510.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 790

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 101/414 (24%), Positives = 173/414 (41%), Gaps = 57/414 (13%)

Query: 71  KDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVN--------IWK------QQTFFM 116
           KDM    +P +LL D  D+ +     Q F+   V  +        +W         +   
Sbjct: 109 KDMSVRKVPFVLLEDILDVLSSPLALQ-FWSSRVRPSYDFLFAPTLWSPIRGDSAASPSH 167

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
            C    L++    LRRL           IL+ L+  +P SE+S   +      E+ TE+ 
Sbjct: 168 PCWLPFLKISQKFLRRLVPEA----AAPILVQLSTVYPLSEKSATKVWGSHG-ESTTEYD 222

Query: 177 GDEEMDVSSNE-TEETDTEEVDKVKI-DFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSY 234
             E+        T +T T       + D++FY+ FW LQ+   NP    N +    F S 
Sbjct: 223 SLEDFHKEEQSITLDTTTPNGSSSSVYDYSFYESFWRLQEDLSNP----NSIKVAGFLSR 278

Query: 235 AETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY 294
             +++ AF++   D   S  +P  + +  YLT+ ++L +QL D +F+ +VL QFLI+ ++
Sbjct: 279 VRSMMTAFENQTCDTNTSQDSPI-NLYGHYLTSSRVLAIQLLDASFQIHVLTQFLIVAKH 337

Query: 295 FTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEH-WNQ 353
             + V            + +     T     L  +    G H  +++  + +G E  W  
Sbjct: 338 LMAQVPVL---------ESQLADHVTRAKNRLQLELGDAGRHQLELLHHLWQGSESLWRD 388

Query: 354 WKNEGCPE-----------------LKRPLTSITDEDKKDEPDAKKKKTPELTKLWNSKD 396
           WK + CP                   KR L ++   + +   DA ++ T     L    D
Sbjct: 389 WKRKKCPADIDAPKLALSAAGGSPPRKRLLGALGSGNGESN-DADERNTD--YSLAQVHD 445

Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLS 450
            L A     +   P L ++ ++ ++ +DP A +E +Y   N++ + WRALRLLS
Sbjct: 446 ELPALSKRMKRLAPDLYTHLQDYVEALDPDAGIEAEYHPKNNALFGWRALRLLS 499


>gi|342319244|gb|EGU11194.1| Hypothetical Protein RTG_02997 [Rhodotorula glutinis ATCC 204091]
          Length = 727

 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 152/363 (41%), Gaps = 65/363 (17%)

Query: 77  TLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRL--- 133
           TLP+ +L+   ++  +  CE +  Y+E  V    +   +   +  +L    + L R    
Sbjct: 116 TLPLSVLAALMEVRPISACEPIVAYIESRVGPLTKGMEYQRGRGPILLRLLNDLLRRLPR 175

Query: 134 SRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--------------GGDE 179
           S+SQ+ +  GRIL+ L+  +P  E+SG+N+   FN+   T F                 E
Sbjct: 176 SQSQSVILSGRILMLLSSAYPLGEKSGVNLRGNFNIGKGTVFEQEAEKEMEEAKEDAKKE 235

Query: 180 EMDVSSNETEETDTEEVDKVKIDFN---FYKKFWSLQDYFRNPVQCYNKV---------- 226
           ++ +   E EE +  +  +   + N   FY  FWSLQ  F NP   +             
Sbjct: 236 DVKMEVEEGEEEELPKDAENPAESNPSQFYTTFWSLQRIFNNPPLLFAAPPVASTSADAD 295

Query: 227 SWKMFTSYAETVLAAF-----KSYKLDDVQSSLNPSGD---------------YFAKYLT 266
            ++   +     LAAF     K  +L         SG                +F K+LT
Sbjct: 296 PFRTLKNGLRQTLAAFAAATKKEKELAGSAKEGAVSGKAKESEVDAEEGLEEYFFPKFLT 355

Query: 267 NQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKSRGEGLE-----------LKSDQ 312
           ++ LL+L+L+  NFRR +L+Q LILFQY    T   ++R +              L+++ 
Sbjct: 356 SRNLLELELAAVNFRRQILVQMLILFQYLLSLTPAERTRVQSFHPLNAAALSPFVLQTEN 415

Query: 313 EKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
           E W+++        +      G  F + V+L+L+ E++W  WK   C    +P      E
Sbjct: 416 ETWIREMRSRTLDELDSM-DGGRRFRKAVQLVLQREQNWMDWKLRSCAPFTKPSLDAAAE 474

Query: 373 DKK 375
            +K
Sbjct: 475 SEK 477


>gi|134113815|ref|XP_774492.1| hypothetical protein CNBG1380 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50257130|gb|EAL19845.1| hypothetical protein CNBG1380 [Cryptococcus neoformans var.
           neoformans B-3501A]
          Length = 761

 Score =  103 bits (256), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 110/446 (24%), Positives = 174/446 (39%), Gaps = 117/446 (26%)

Query: 122 LLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN------------ 168
           LLR  N  LR L R+ ++ VF GRI  F +  F  +++S +N+  +++            
Sbjct: 175 LLRTINAFLRFLPRTPEDLVFRGRIHQFASSVFSVADKSAINMRGDYSEVKTVWEEEPEM 234

Query: 169 ---------------VENITEFGGDEEMDVSSNETEETD--TEEVDKVKIDFN------- 204
                          ++   E GG ++ DV   + E+ D  TE       D +       
Sbjct: 235 EKIKVEEKEENKEGKMDQRAEEGGKKDGDVEMEDAEKKDEHTEHSSNQPADQSMEQPTED 294

Query: 205 -------------FYKKFWSLQDYFRNP-------VQCYNKVSWKMFTSYAETVLA---- 240
                        FY   WSLQ YF +P       V    K  ++ F   ++ VL     
Sbjct: 295 EPPLTPATLQEPEFYSTLWSLQQYFSHPPSLDGPAVGSPPKTPFQTFREKSDFVLPQLFE 354

Query: 241 -------------AFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
                             +   V    +  G +  ++LT ++L + +L+D++FRR +L+Q
Sbjct: 355 QTRKEKAMAGTDDGLGKKRKRQVMEGQDTGGFFHPRFLTGKRLFEYELADSSFRRQILVQ 414

Query: 288 FLILFQYFTSTVKSRG------EGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
           + ILFQ+  +   +         G+     L+   E+WVK     +   +K     G  F
Sbjct: 415 YFILFQFLLNLTPAHAGKQAFTGGMPRTFMLEQADEQWVKSKIGGIKEELKNI-VGGASF 473

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPD---AKKKKTP-------- 386
              V  I++ E H+ QWKNEGCPE    + ++ D D   EP    AK+   P        
Sbjct: 474 EDTVFSIIRQEAHYVQWKNEGCPEGSFEIPAL-DPDSASEPAQAWAKRLNPPAPYNFKVG 532

Query: 387 --ELTKLWNSK-DNLEACKSAERDFTPSLESYFEE------------AIQQMDPAAAVEE 431
              L+ LWN+   N+   K  E+  T  +E+  EE            A+ Q  P      
Sbjct: 533 SRPLSMLWNNGFTNINQLKRREKATT--VEALGEEIKRIEEDEEDDKAMGQGTPEKLAAN 590

Query: 432 QYKKVNDSNYAWRALRLLSRKCPHFF 457
           + +K   +   WRALRL S     FF
Sbjct: 591 KERKTTST---WRALRLASHTSLKFF 613


>gi|58269822|ref|XP_572067.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|57228303|gb|AAW44760.1| conserved hypothetical protein [Cryptococcus neoformans var.
           neoformans JEC21]
          Length = 761

 Score =  102 bits (255), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 110/446 (24%), Positives = 174/446 (39%), Gaps = 117/446 (26%)

Query: 122 LLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN------------ 168
           LLR  N  LR L R+ ++ VF GRI  F +  F  +++S +N+  +++            
Sbjct: 175 LLRTINAFLRFLPRTPEDLVFRGRIHQFASSVFSVADKSAINMRGDYSEVKTVWEEEPEM 234

Query: 169 ---------------VENITEFGGDEEMDVSSNETEETD--TEEVDKVKID--------- 202
                          ++   E GG ++ DV   + E+ D  TE       D         
Sbjct: 235 EKIKVEEKEENKEGKMDERAEEGGKKDGDVEMEDAEKKDEHTEHSSNQPADQSMEQPTED 294

Query: 203 -----------FNFYKKFWSLQDYFRNP-------VQCYNKVSWKMFTSYAETVLA---- 240
                       +FY   WSLQ YF +P       V    K  ++ F   ++ VL     
Sbjct: 295 EPPLTPAALQEPDFYSTLWSLQQYFSHPPSLDGPAVGSPPKTPFQTFREKSDFVLPQLFE 354

Query: 241 -------------AFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ 287
                             +   V    +  G +  ++LT ++L + +L+D++FRR +L+Q
Sbjct: 355 QTRKEKAMAGTDDGLGKKRKRQVMEGQDTGGFFHPRFLTGKRLFEYELADSSFRRQILVQ 414

Query: 288 FLILFQYFTSTVKSRG------EGLE----LKSDQEKWVKDTTETVYSLIKQTPPDGEHF 337
           + ILFQ+  +   +         G+     L+   E+WVK     +   +K     G  F
Sbjct: 415 YFILFQFLLNLTPAHAGKQAFTGGMPRTFMLEQADEQWVKSKIGGIKEELKNI-VGGASF 473

Query: 338 SQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPD---AKKKKTP-------- 386
              V  I++ E H+ QWKNEGCPE    + ++ D D   EP    AK+   P        
Sbjct: 474 EDTVFSIIRQEAHYVQWKNEGCPEGSFEIPAL-DPDSASEPAQAWAKRLNPPAPYNFKVG 532

Query: 387 --ELTKLWNSK-DNLEACKSAERDFTPSLESYFEE------------AIQQMDPAAAVEE 431
              L+ LWN+   N+   K  E+  T  +E+  EE            A+ Q  P      
Sbjct: 533 SRPLSMLWNNGFTNINQLKRREKATT--VEALDEEIKRIEEDEEDDKAMGQGTPEKLAAN 590

Query: 432 QYKKVNDSNYAWRALRLLSRKCPHFF 457
           + +K   +   WRALRL S     FF
Sbjct: 591 KERKTTST---WRALRLASHTSLKFF 613


>gi|405121693|gb|AFR96461.1| hypothetical protein CNAG_03235 [Cryptococcus neoformans var.
           grubii H99]
          Length = 777

 Score =  102 bits (254), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 111/455 (24%), Positives = 182/455 (40%), Gaps = 126/455 (27%)

Query: 122 LLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN-VENITE----- 174
           LLR  N  LR L R+ ++ VF GRI  F +  F  +++S +N+  +++ V+ + E     
Sbjct: 175 LLRTINAFLRFLPRTPEDLVFRGRIHQFASSVFSVADKSAINMRGDYSEVKTVWEKESEV 234

Query: 175 --------------------FGGDEEMDVSSNETEETDT---EEVDKVKI---------- 201
                                GG ++ DV   + E+ D    ++ D+             
Sbjct: 235 EQIKEEKKKKKEGKMEEKAEEGGKKDGDVEMEDAEKKDEPAEQQADRPTGRSTYQAADQS 294

Query: 202 ------------------DFNFYKKFWSLQDYFRNP-------VQCYNKVSWKMFTSYAE 236
                             D +FY   WSLQ YF +P       V    K  ++ F   ++
Sbjct: 295 MEQMPTEDEAPSTPSTLKDPDFYSTLWSLQQYFSHPPSLDGPAVGSPPKTPFQTFRDKSD 354

Query: 237 TVL------------------AAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDT 278
            +L                     K  K D ++   +  G +  ++LT ++L + +L+D+
Sbjct: 355 FILFQLFEQTKKEKAMTGTEDGMGKKRKRDVMEGGQDTGGFFHPRFLTGKRLFEYELADS 414

Query: 279 NFRRYVLLQFLILFQYFTSTVKSRG------EGLE----LKSDQEKWVKDTTETVYSLIK 328
           +FRR +L+Q+ ILFQ+  +   +         G+     L+   E+WVK     +   +K
Sbjct: 415 SFRRQILVQYFILFQFLLNLTPAHAGKQAFTGGMPRTFMLEQTDEQWVKSKIGGIKDELK 474

Query: 329 QTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPD---AKKKKT 385
           +    G  F   V  I++ E H+ QWKNEGCPE    + ++ D D   EP    AK+   
Sbjct: 475 RI-VGGVSFEDTVFSIIRQEAHYVQWKNEGCPEASFEIPAL-DPDSASEPAQAWAKRLNP 532

Query: 386 P----------ELTKLWNSK-DNLEACKSAERDFTPSLESYFEE------------AIQQ 422
           P           L+ LWN+   N++  K  E+  T  +E+  EE            A+ Q
Sbjct: 533 PAPYNFKVGSRPLSMLWNNGFTNIDQLKGREKATT--VEALDEEIKRIEEDEEDDKAMGQ 590

Query: 423 MDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
             P      + +K   +   WRALRL S     FF
Sbjct: 591 DTPEKLAANKERKTTST---WRALRLASHTSLKFF 622


>gi|393246855|gb|EJD54363.1| hypothetical protein AURDEDRAFT_156143 [Auricularia delicata
           TFB-10046 SS5]
          Length = 619

 Score =  102 bits (253), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 74/267 (27%), Positives = 119/267 (44%), Gaps = 36/267 (13%)

Query: 49  KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNI 108
           + PD   + + N + L + +     C   +   LL D   + T+D C  +F ++E     
Sbjct: 75  RDPDAYYQALRNMLDLVLCVAEHGACEELITWTLLEDLLKLQTVDTCWHIFLWIESRAE- 133

Query: 109 WKQQTFFMSCKNN---LLRMCNDLLRRLSRSQNT-VFCGRILLFLAKFFPFSERSGLNII 164
            +     +  KN    LLR  N+LLRRLS++ +T  FCGRIL+FLA  F   ERSG+N+ 
Sbjct: 134 -RVTKNLIPGKNKALALLRTLNELLRRLSKAGSTSTFCGRILIFLANVFTLGERSGVNLR 192

Query: 165 SEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYN 224
            ++  E  T            +   +    E  K  +   F+++FW+LQ     P     
Sbjct: 193 GDYGPE-WTPVPKPPRATPLDDLPADVRAAEERKEGMSPLFFREFWALQSPMARPQTFSM 251

Query: 225 KVSWKMFTSYAETVL-----------AAFKSYKLDDV----------QSSLNPSGD---- 259
             ++ MF +  +TVL           A   S  L+            +   + +G+    
Sbjct: 252 PETFPMFQAAVDTVLPVLAEATKRERALMGSKALNGTAAAVAAGVKRKREAHDTGEDGGK 311

Query: 260 ----YFAKYLTNQKLLDLQLSDTNFRR 282
               +FAKYLT+ +LLDL+++D  FRR
Sbjct: 312 KKAYFFAKYLTSPELLDLEVADPQFRR 338


>gi|321260989|ref|XP_003195214.1| hypothetical protein CGB_G2020C [Cryptococcus gattii WM276]
 gi|317461687|gb|ADV23427.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 761

 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 109/439 (24%), Positives = 174/439 (39%), Gaps = 105/439 (23%)

Query: 122 LLRMCNDLLRRLSRS-QNTVFCGRILLFLAKFFPFSERSGLNIISEFN-VENI------- 172
           LLR  N  LR L R+ ++ VF GRI  F +  F  +++S +N+  +++ V+ I       
Sbjct: 175 LLRTINAFLRFLPRTPEDLVFRGRIHQFASSVFSVADKSAINMRGDYSEVKTIWEEEPKA 234

Query: 173 ---------------------------------------TEFGGDEEMDVSSNETEETDT 193
                                                  TE   D+  D S  +  E +T
Sbjct: 235 EESKVDEGKEEGKTEEKTEEEGKKDGDIEMEDAERKDEPTERPVDQPADQSMEQPTEDET 294

Query: 194 EEVDKVKIDFNFYKKFWSLQDYFRNP-------VQCYNKVSWKMFTSYAETVLAA-FKSY 245
                   D +FY   WSLQ YF +P       V    K  ++ F   ++ VL   F+  
Sbjct: 295 PLTPATLQDPDFYSTLWSLQQYFSHPPSLDGPAVGSPPKTPFQTFRDKSDFVLPQLFEQT 354

Query: 246 KLDDVQSSL----------------NPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
           K +   +                  +  G +  ++LT ++L + +L+D++FRR +L+Q+ 
Sbjct: 355 KKEKAMTGTEDGMGKKRKREAMEGEDTGGFFHPRFLTGKRLFEYELADSSFRRQILVQYF 414

Query: 290 ILFQYFTSTVKSRG----------EGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
           ILFQ+  +   +                L+   E+WVK     +   +K+    G  F  
Sbjct: 415 ILFQFLLNLTPAHAGKQAFTGGMPRTFTLEQADEQWVKSKIGGIKEELKRI-IGGTSFED 473

Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEPD---AKKKKTP---------- 386
            V  I++ E H+ QWKNEGCPE    + ++ + D   EP    AK+   P          
Sbjct: 474 TVFSIIRQEAHYVQWKNEGCPEGSFEIPAL-NPDSASEPAHAWAKRLNPPAPYSFKVGSR 532

Query: 387 ELTKLWNSK-DNLEACKSAERDFT-PSLESYF------EEAIQQMDPAAAVEEQYKKVND 438
            L+ LWN+   N+   K  E+  T  +L+         EE  + M  A A +    K   
Sbjct: 533 PLSMLWNNGFTNINQLKGREKATTVEALDEEIKRIEEDEEDDKAMGQATAEKLAANKERK 592

Query: 439 SNYAWRALRLLSRKCPHFF 457
           +   WRALRL S+    FF
Sbjct: 593 TTSTWRALRLASQTSLKFF 611


>gi|380495035|emb|CCF32700.1| guanylate kinase [Colletotrichum higginsianum]
          Length = 684

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 71/277 (25%), Positives = 126/277 (45%), Gaps = 65/277 (23%)

Query: 153 FPFSERSGLNIISEFNVENITEFGGDEEMDVSSNE-TEETDTEEVDKVK----------I 201
           FP  ++S +N+  E++VEN+T F   ++  V ++E TE+ D +  + VK          +
Sbjct: 5   FPLGDKSSVNLRGEYHVENVTVF---DDTGVKTDESTEQMDIDSQNGVKDGEVKSQAKAV 61

Query: 202 DFN---------------FYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSY- 245
            F+                Y +FWSLQ+ F  P + ++      F    E  + +F++  
Sbjct: 62  SFDAKQPTQPDQPLNPDALYPQFWSLQETFSQPKKLFDPSHLANFKRGLEATMTSFRAIQ 121

Query: 246 --------------------KLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVL 285
                               K DDV+ S + +  Y  KYLT++ L +L++SD +FRR++L
Sbjct: 122 NEQGQRPPKAVDEAQRGLKRKRDDVEGS-DLANSYNPKYLTSRDLFELEMSDISFRRHIL 180

Query: 286 LQFLILFQYFTSTVKSRGE--------------GLELKSDQEKWVKDTTETVYSLIKQTP 331
           +Q LI+  +  S      E              G  L  +  KW  +   T+     +  
Sbjct: 181 VQALIIMDFLLSLSNKTKEKLAGNAQNKSVIYLGQVLSEEDTKWASEMKRTIVDKYLKHG 240

Query: 332 PDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTS 368
            DG +FS++V+ +L  +++W +WK+E CP ++ P  S
Sbjct: 241 VDGPYFSRMVETVLARDKNWIRWKSENCPSIELPAVS 277


>gi|48474722|sp|P59924.1|THOC1_RAT RecName: Full=THO complex subunit 1; Short=Tho1; AltName:
           Full=Liver regeneration-related protein LRRG175;
           AltName: Full=Nuclear matrix protein p84
 gi|33086686|gb|AAP92655.1| Da2-19 [Rattus norvegicus]
 gi|149042358|gb|EDL96065.1| rCG63287 [Rattus norvegicus]
          Length = 343

 Score = 99.8 bits (247), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 47/98 (47%), Positives = 63/98 (64%), Gaps = 1/98 (1%)

Query: 387 ELTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRA 445
           ELT+LWN   DN+EACK   R++ P LE +FEEAI+Q D    VE +YK +N+SNY W  
Sbjct: 115 ELTRLWNLCPDNMEACKLETREYMPILEEFFEEAIEQADAENMVESEYKAINNSNYGWST 174

Query: 446 LRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
           LR L+ + PHFF       +  +E++ENMV +  KE P
Sbjct: 175 LRFLAWRSPHFFQPTNQQFKNMTEYLENMVIKLAKELP 212


>gi|302834964|ref|XP_002949044.1| hypothetical protein VOLCADRAFT_104130 [Volvox carteri f.
           nagariensis]
 gi|300265789|gb|EFJ49979.1| hypothetical protein VOLCADRAFT_104130 [Volvox carteri f.
           nagariensis]
          Length = 743

 Score = 99.0 bits (245), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 112/460 (24%), Positives = 167/460 (36%), Gaps = 141/460 (30%)

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFG 176
             K  LLRM N L RRLS++ +   CGR+L+F+++ +  +++SGLN+    N        
Sbjct: 232 GAKLALLRMSNMLFRRLSKAYDVRLCGRLLIFISRCYELNDKSGLNVAGALNT------- 284

Query: 177 GDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR--NPVQCYNKVSWKM---- 230
               + +  +E E    + + + +ID   Y  FW LQ  F+   PV  Y KV   M    
Sbjct: 285 ---GLSLPVDEVEPGAVDSIGE-QIDRRLYACFWGLQAIFQVLQPV-VYGKVMADMSLVL 339

Query: 231 --FTSYAETV------LAAFKSYK------------LDDVQSSLNPS-------GDYFAK 263
             F +    V      +AA  S+               +V+    P+       G    +
Sbjct: 340 DEFAAIKTPVAATGPWVAARHSHHNGQHRGAGDGTDAMEVEEGGGPATGGDEGGGGVGTR 399

Query: 264 YLTNQKLLDLQLSDTNFRRYVLLQFLILFQYF-------TSTVKSRGEGLELKSDQEKWV 316
           YLT+ KL  LQ+ D  FRR+ LLQ L L QY         S +  R EGLE         
Sbjct: 400 YLTSSKLFGLQVRDCTFRRHFLLQCLALLQYIEQPSKADKSAIGQRPEGLEAMR-----T 454

Query: 317 KDTTETVYSLIKQTPP--DGEHFSQVVKLILKGEEHWNQWKN------------------ 356
           K +  T   L  +T      +H S     +LK E HW  WK+                  
Sbjct: 455 KASERTCGKLSMRTLGWRRCQHHS-----VLKHEAHWVAWKHGTQPPASQGPAAAAAASP 509

Query: 357 ------------------EGCPELKRP-------LTSITDEDKKDEPDAKKKKTPELT-- 389
                             E C +  RP       +TS +  D      A K+K       
Sbjct: 510 SAPPAAAATVAAVSAAATELCRDFTRPSAPPTREITSTSHRDTASGAQAPKRKRGTSGGR 569

Query: 390 ------------------KLWNSKDNLEACKSAER--------------DFTPSLESYFE 417
                              +    D L   K +ER                 P+      
Sbjct: 570 AGGGAAAATGGGRFAVSRGVMGLADPLAGLKVSERRTRARNLVGEEEIVPMLPTPPELLS 629

Query: 418 EAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
             ++ +DP   VE QY++V ++ +AW+ +R L+R    FF
Sbjct: 630 PILEDLDPDNDVESQYRRVREAVFAWKRMRTLARNYLEFF 669


>gi|344249117|gb|EGW05221.1| THO complex subunit 1 [Cricetulus griseus]
          Length = 211

 Score = 97.4 bits (241), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/118 (43%), Positives = 74/118 (62%), Gaps = 4/118 (3%)

Query: 398 LEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFF 457
           +E CKS  R++ P+LE +FEEAI+Q DP   VE ++K VN+SNY WRALRLL+R+ PHFF
Sbjct: 1   METCKSETREYMPTLEEFFEEAIEQADPENMVESEHKAVNNSNYGWRALRLLARRSPHFF 60

Query: 458 LNATPNVEKNSEFIENMVKRCVKE--KPSSQI--SGNGNGVDQDPAEVEVDTKSEEIQ 511
                  +   +++ENMV +  KE   PS +I    + N  D D    E ++KS +++
Sbjct: 61  QPTNQQFKTLPDYLENMVIKLAKELPPPSEEIKTGEDENEEDNDALLKENESKSPDVR 118


>gi|328851043|gb|EGG00202.1| hypothetical protein MELLADRAFT_118113 [Melampsora larici-populina
           98AG31]
          Length = 975

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 91/154 (59%), Gaps = 3/154 (1%)

Query: 56  KVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFF 115
           +++ + + + + L  +D   + L +  + D  + +T+++ E++F Y+E  V+        
Sbjct: 108 QLLHDALDISLVLIEQDRAPADLTLSCIEDILETATVEEAERIFCYLESRVDRLTVGMLP 167

Query: 116 MSCKNNL-LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITE 174
              K  + LRM N+LLRRLS+S +TVFCGRIL FL+  FP +E+SG+N+   FN  N+T 
Sbjct: 168 DRGKGLIFLRMQNNLLRRLSKSLHTVFCGRILSFLSSVFPIAEKSGVNLRGAFNTGNVTH 227

Query: 175 FGGD-EEMDVSSNETEETDTEEVDKVKIDFNFYK 207
           F  D E+  V + +++ ++T + D V ++ N  K
Sbjct: 228 FDQDIEKKGVEAVDSQGSETIKSDPV-VETNLLK 260



 Score = 90.5 bits (223), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 70/234 (29%), Positives = 113/234 (48%), Gaps = 47/234 (20%)

Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLEL---------- 308
           +F K+LT   LLDL+++D+ FRR VL QFLI+ +Y    +++ R +   L          
Sbjct: 482 FFPKFLTKSNLLDLEVADSYFRRQVLTQFLIILKYIRGFSLEERNKRAALPKPNRVSEVP 541

Query: 309 ---KSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
              ++  + WV+     V+  +  TPP+GE F+  +K IL  E++W  WK   CP  +RP
Sbjct: 542 YVYQARDDGWVRKLNGEVWKTLYATPPNGEVFALTIKQILTREQNWIMWKAGSCPSFERP 601

Query: 366 LTSITDE-DKKDEPDAKK-----------KKTPELTKLWNSKDNLEACKSAE-RDFTPSL 412
             S+ DE  K  E + K+             TP L+KLW++K   E+ +S +  +  P L
Sbjct: 602 --SLGDEMIKAAEANYKRIIAPPMRFPHPVGTPALSKLWSNKVTPESIESMDFSEGVPDL 659

Query: 413 ESYFEEAIQQMDPAAAVEE----QYKKVND--------------SNYAWRALRL 448
            +   + + Q    +A++E    Q K   D              S+ +WRALR+
Sbjct: 660 NALHGKIMAQEKQLSALKESLLAQGKSDTDVSNDPQVQAIQERKSSLSWRALRV 713


>gi|37779809|gb|AAP23263.1| THO complex 1 [Pan troglodytes]
          Length = 200

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 40/77 (51%), Positives = 53/77 (68%)

Query: 407 DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEK 466
           ++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       + 
Sbjct: 1   EYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKS 60

Query: 467 NSEFIENMVKRCVKEKP 483
             E++ENMV +  KE P
Sbjct: 61  LPEYLENMVIKLAKELP 77


>gi|403166806|ref|XP_003326670.2| hypothetical protein PGTG_07648 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375166722|gb|EFP82251.2| hypothetical protein PGTG_07648 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 962

 Score = 89.0 bits (219), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 93/205 (45%), Gaps = 38/205 (18%)

Query: 60  NYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK 119
           + + L +      +    L +  + +  + ST+ + + +F Y+E  V            K
Sbjct: 97  HLLDLVLAFIDHGLAAPELALGCIEEILEASTVQQAQFIFGYLESRVQRLTVDMLPDRGK 156

Query: 120 NNL-LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF--- 175
             + LRM N+LLRRLS+S +TVFCGRIL FL+  FP +E+SG+N+   FN  NIT+F   
Sbjct: 157 GLIFLRMQNNLLRRLSKSLHTVFCGRILCFLSSVFPIAEKSGVNLRGAFNHGNITKFDQF 216

Query: 176 --------------------GGDEEM------DVSSNETEETDTEEVDKVKIDFN----- 204
                                G +E+      D +    EE   ++  +VK   +     
Sbjct: 217 DSPPSTMLISNNNSTTDEIEAGSKEIEKITPQDSTPRPNEEIKPDDSARVKASKSATDKR 276

Query: 205 -FYKKFWSLQDYFRNPVQCYNKVSW 228
            FY +FWSLQ    NP + +   SW
Sbjct: 277 EFYTRFWSLQALLSNPPELFK--SW 299



 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 66/241 (27%), Positives = 111/241 (46%), Gaps = 47/241 (19%)

Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY---FTSTVKSRGEGLE--------- 307
           +F K+LT   LLDL+++D  FRR VL QFLI+ QY   F+   +++   L          
Sbjct: 386 FFPKFLTKSNLLDLEIADPYFRRQVLTQFLIILQYIRGFSPEERAKRASLPKSNRISEIP 445

Query: 308 --LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
              ++  E WV   +  V+  +  TPP GE F+  ++ IL  E  W  WK   C   ++ 
Sbjct: 446 YVYQAKDEAWVLKLSGEVWKTLYATPPHGEIFAMTIQQILIRESSWIMWKAGSCASFEK- 504

Query: 366 LTSITDEDKKDEPDAKK-----------KKTPELTKLWNSKDNLEACKSAERDF-TPSLE 413
             S+ D D    P  K+             TP L+KLW +K  L++ ++ +     P L+
Sbjct: 505 -ASLDDLDPA--PGYKRITSNPVKFPHPVGTPALSKLWENKVTLDSIENIDPTAGVPDLK 561

Query: 414 S------YFEEAIQQM--------DPAA-AVEEQYKKVND--SNYAWRALRLLSRKCPHF 456
           +      + +  ++Q+        DP A  ++ Q +++ +  +  +WRALR+       F
Sbjct: 562 TLHLKIQHLDLQLKQLQENLPPDTDPKAETIKSQVQQLQERKTGLSWRALRIAQSSHLQF 621

Query: 457 F 457
           F
Sbjct: 622 F 622


>gi|426385300|ref|XP_004059157.1| PREDICTED: THO complex subunit 1-like [Gorilla gorilla gorilla]
          Length = 198

 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 40/74 (54%), Positives = 51/74 (68%)

Query: 410 PSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSE 469
           P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +   E
Sbjct: 2   PTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFKSLPE 61

Query: 470 FIENMVKRCVKEKP 483
           ++ENMV +  KE P
Sbjct: 62  YLENMVIKLAKELP 75


>gi|323448287|gb|EGB04188.1| hypothetical protein AURANDRAFT_72585 [Aureococcus anophagefferens]
          Length = 601

 Score = 87.4 bits (215), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/323 (25%), Positives = 133/323 (41%), Gaps = 58/323 (17%)

Query: 181 MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMF--------- 231
           M +S + TE    E V  +  D+  Y  FW++Q +  +P      V  +           
Sbjct: 1   MALSMDCTETATNENV--LAHDYATYSAFWAVQKWLSDPNLALRGVEERRALLANMDACC 58

Query: 232 ------------------TSYAETVLAAFKSYKLDDVQSSLNPSGDY-------FAKYLT 266
                             T  AE V +A K+     +QS+ + SG          AKYLT
Sbjct: 59  AALENTAPSEDSVKRARDTWLAERVRSAKKNACGSSIQSA-DISGTMQLGIVTMGAKYLT 117

Query: 267 NQKLLDLQLSDTNFRRYVLLQFLILFQYF---TSTVKSRGEGLELKSDQEKWVKDTTETV 323
           N +LL +QL D   R  ++ Q  ++F Y    T+       G E  S  + W       V
Sbjct: 118 NSRLLRIQLRDPVLRMQLVTQLYLVFDYLRRETAIGNKPSPGAEFLSKPD-WALPVLLRV 176

Query: 324 YSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE--DKKDEPDAK 381
             LI  TPP+G    + ++ +++ E +W+ WK  GC   +RP  S   +  +K+      
Sbjct: 177 RKLIIATPPNGPERLKSLEEVIEREVNWSTWKRLGCAPYERPSQSKEQQYAEKRKRALQS 236

Query: 382 KKKTPELTKLWNSKDNLEA---------------CKSAERDFTPSLESYFEEAIQQMDPA 426
           ++ +P +  +   +  L+A               C +   D  PS+E++ E      DP 
Sbjct: 237 ERYSPPVPVILFKRTKLKAGAKLDARENEGTMKDCCARLADAVPSIEAFLEAYEIAEDPE 296

Query: 427 AAVEEQYKKVNDSNYAWRALRLL 449
            ++EE+Y   +D  Y WRALRL+
Sbjct: 297 NSIEEEYHPKHDKVYCWRALRLV 319


>gi|258577873|ref|XP_002543118.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237903384|gb|EEP77785.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 756

 Score = 86.7 bits (213), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 117/559 (20%), Positives = 211/559 (37%), Gaps = 153/559 (27%)

Query: 34  SALEQSIRQYLLKLIKTPDID---IKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMS 90
           +A+E + R+   +L+ T  I+      I N + +       + C   L   L+ +  D  
Sbjct: 64  AAVETAFREKFYRLLATTSINEPNFVDIWNLLDIVSIFSDNERCEPGLIFWLVEELLDSQ 123

Query: 91  TLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLA 150
           T+D C ++F Y+E                               R +NT           
Sbjct: 124 TIDGCRKVFDYLESR-----------------------------RERNTAS--------- 145

Query: 151 KFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE---------------- 194
             FP  ++S +N+  E++ ENIT F  DE    +S E +  D E                
Sbjct: 146 --FPLGDKSSVNLRGEYHTENITTF--DELPTSASAEEDAADVEMRDAEPPKALEDSTKL 201

Query: 195 -------------------------EVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVSW 228
                                    + +   ID +  Y  FW LQ  F +P + ++   +
Sbjct: 202 ETATNLQTSAEPDGQRRKPSTATPSQTEDASIDMDALYPIFWGLQASFSSPTRLFDPEHF 261

Query: 229 KMFTSYAETVLAAFKSYKLD---------------------DVQSSLNPSGDYFAKYLTN 267
             F    E+ ++ F+   +D                       ++    +  +  KYLT+
Sbjct: 262 ASFKKGLESTISNFQKVSIDLEKRGMTKGSEETSRRGIKRKRTENGSEAANTFNPKYLTS 321

Query: 268 QKLLDLQLSDTNFRRYVLLQFLILFQYFTS------------TVKSRGEGLELKSDQEKW 315
           + L +L+ +D  FRR++L+Q LI+  +  S            T KS      L  +  KW
Sbjct: 322 RDLFELEANDVAFRRHILVQSLIILDFLISLTPRAKAKLADATNKSVLYSYVLSDEDAKW 381

Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD---- 371
                 ++   ++Q   DG+ + ++V  +L  +++W +WK E CP +++    + +    
Sbjct: 382 ATQMKSSIAGYLQQG-LDGKFYYRMVDTVLTRDKNWVRWKAEACPAIEKAPIQVQEYLDT 440

Query: 372 --EDKKDEPDAKKKKTP--ELTKLWNSKD----NLEACKSAERDFTPSLESYFE------ 417
                K   + + + TP   L   + S+D    NL+  + A+R   PS +SY        
Sbjct: 441 QSGVMKLTTNKRLRATPLGSLDLKFLSEDANLGNLDRLREADRFSNPSPDSYMRGIADDE 500

Query: 418 ---EAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENM 474
              E  Q  D     E++      ++ AWR LRL SR      LN    ++ + + ++ +
Sbjct: 501 FNIEMAQNDD-----EKEEAARGKASKAWRVLRLSSRSK----LNKFDKID-DGKNLKIL 550

Query: 475 VKRCVKEKPSSQISGNGNG 493
            +    EKP S++ G G G
Sbjct: 551 FESPPPEKPGSEVEG-GEG 568


>gi|440802190|gb|ELR23123.1| hypothetical protein ACA1_249770 [Acanthamoeba castellanii str.
           Neff]
          Length = 466

 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 46/130 (35%), Positives = 74/130 (56%), Gaps = 3/130 (2%)

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNN 121
           I + + L  + +  S  P+ L+ D F   T+++C+Q F  +E   +   +     S  NN
Sbjct: 117 IDIALWLGREKVIESAAPLELVRDFFLSHTIEECQQFFALLEDRADQLSRAIIGDSRANN 176

Query: 122 -LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE 180
            LLR  +++ +RLSR+ + VF GRIL+F+A  FP S++SG+N+  E NV N+T    D+E
Sbjct: 177 TLLRALSEVRKRLSRTNDLVFAGRILMFMAYAFPLSDKSGVNLKGEANVANVTTM--DKE 234

Query: 181 MDVSSNETEE 190
              +  ET E
Sbjct: 235 PSPALEETPE 244



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 49/170 (28%), Positives = 87/170 (51%), Gaps = 16/170 (9%)

Query: 366 LTSITDEDKKDEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDP 425
           + ++T  DK  EP    ++TPE     + +D     +  ER    SLE + E  I++MDP
Sbjct: 225 VANVTTMDK--EPSPALEETPE-----DIRDLQALLQKEERGRMVSLEEWLEPVIEEMDP 277

Query: 426 AAAVEEQYKKVN-DSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKPS 484
            AA+E QYKK+N D  Y WRA RL  ++   +F      + K    +E++ K+ +  K +
Sbjct: 278 EAAIESQYKKINTDKIYVWRAKRLTFKRSLEYFC----TLAKQDATLEDVAKQLLANKGT 333

Query: 485 SQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDADEVMS 534
           S      +G+  +  E + D   ++++++E     EAK   + + D+++S
Sbjct: 334 S----TEDGMVLEDREEKRDADVDDVEDKESVVKIEAKHSEDQEGDQLVS 379


>gi|300122524|emb|CBK23094.2| unnamed protein product [Blastocystis hominis]
          Length = 555

 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 96/442 (21%), Positives = 176/442 (39%), Gaps = 72/442 (16%)

Query: 69  CMKDMCNSTLPIILLSDTFDMSTLDKCEQLF-----YYVEVN--VNIWKQQTFFMSCKNN 121
           C   +C+ T P++++ + F+   +++CE ++     Y  +++  ++I +++    S    
Sbjct: 85  CKNGLCSPTTPLLIMDELFESVPINECEAIWNTFKTYQKDISELLHIDEKRCSLTSLA-- 142

Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
           +L++ N LLRR  + ++  F G +++FL+  F   ERSGLN+ ++FN          + +
Sbjct: 143 VLKLVNSLLRRTCQIRDCAFRGSLMMFLSNMFSLEERSGLNMANQFNRHASLRLDPPDVV 202

Query: 182 DVSSNETEETDTEEVDKVKIDFN-FYKKFWSLQDYFRNPVQCYNKVS-WK---MFTSYAE 236
                   +      +  K +++ FY KFW LQ+   N      + S W+       Y  
Sbjct: 203 KKILCALFDCLMCRDNGSKGNYSEFYGKFWKLQEILCNRDYVTKEASRWEELHQLIHYVT 262

Query: 237 TVLAAFK-----------SYKLDDVQSSLNP-----SGDYFAKYLTNQKLLDLQLSDTNF 280
             ++  K           S +LD +  +  P     S      Y +   +   Q+ D   
Sbjct: 263 KRMSEIKELTRAGELEKESLELDKIYLNFQPQEFDDSDKILPLYSSRASVFLTQMRDVKT 322

Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
           RR + +Q      +    +K        +S+ ++W KD  + V  L++ T P G+ F   
Sbjct: 323 RRSIGVQMYCYLMHVRPVLKR-------ESEIKEW-KDVKKDVLRLLQST-PHGDAFCNA 373

Query: 341 VKLILKGEEHWNQWKNEGCPEL--KRPLTS-------------------ITDEDKKD--- 376
           +   +  E  W  WK    P +  K P+ +                     D+  KD   
Sbjct: 374 IVKHVDREVVWEDWKESKFPPMVPKEPIPAHETVEVSLASRRPGDVRNLCRDDTLKDFVL 433

Query: 377 ---EPDAKKKKTPELT------KLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAA 427
              E D + +   E T         +  +  E C +     TP  E    + ++ MDP  
Sbjct: 434 MKREADIQIQSCREETVRREQLTYPSHPEVPENCTTEAMGATPFAEVMARDLVEAMDPEN 493

Query: 428 AVEEQYKKVNDSNYAWRALRLL 449
             EE+Y    +  Y WR LRLL
Sbjct: 494 GFEEEYMPYRNPIYNWRLLRLL 515


>gi|76156452|gb|ABA40374.1| SJCHGC08626 protein [Schistosoma japonicum]
          Length = 105

 Score = 78.6 bits (192), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 39/91 (42%), Positives = 50/91 (54%), Gaps = 3/91 (3%)

Query: 277 DTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQ---TPPD 333
           D +FRRY+L+Q LILFQY T+ VK +     L  DQ  WV    E V  L+       P 
Sbjct: 2   DPSFRRYILVQLLILFQYLTTAVKFKTIDQVLSEDQHSWVNQRHEVVLRLLSSNNTNSPS 61

Query: 334 GEHFSQVVKLILKGEEHWNQWKNEGCPELKR 364
              F   V+ IL+ E +WN+WKN+GCP   R
Sbjct: 62  DNTFVSTVEHILERESYWNRWKNDGCPSFIR 92


>gi|145506230|ref|XP_001439081.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124406254|emb|CAK71684.1| unnamed protein product [Paramecium tetraurelia]
          Length = 554

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/365 (24%), Positives = 154/365 (42%), Gaps = 73/365 (20%)

Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
           LLR+CN +L+RLS + +T   G++ +FL + F     S ++        NI + G DEE 
Sbjct: 140 LLRICNSMLKRLSTTHDTFLRGQVQIFLTQIF-----STIHPSLRKKPINIRDRGFDEEA 194

Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDY-------FRNPV----------QCYN 224
            V  N +            + ++FYK FW+LQ Y       FRN V          Q   
Sbjct: 195 -VKVNAS------------LPYSFYKNFWTLQKYLNNSELIFRNKVEFDDIGEEAGQVSQ 241

Query: 225 KVS----WKMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
            VS     K   +  + ++ A + +K + ++  +  +   F K+LT   L   QL+D  F
Sbjct: 242 NVSEPQRTKNLQAVIKVIMRALQYFKENPIEPEI-ITVKRFPKFLTKYSLFKNQLNDPYF 300

Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
           R+  L Q L+    F +         EL++ +++++++    V    KQ    G+   + 
Sbjct: 301 RKLFLNQVLLFI--FVA---------ELENQEKQFIQNLKIEVQ---KQLDELGDKLGEK 346

Query: 341 VKLILKGEEHWNQWKNEG-CPELKRPLTSITDEDKKDEPDAKKKKTPELTKLWNS----- 394
           V+ ++  E  W  W  E  C +      SI   +K     AKKK+     ++++      
Sbjct: 347 VQHLIDTERTWRLWVTEKQCFDDGYEKLSIKASEK-----AKKKRQEIQKRIYDDLKYKQ 401

Query: 395 --------KDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRAL 446
                   +D     K       P L  Y  E   Q+D    + +  +  N  +YAW+++
Sbjct: 402 TDAFQYILQDQGRPYKIETNPIEPGLNYYLNEVFYQLDQNRKIADNERCKNMKDYAWKSV 461

Query: 447 RLLSR 451
           R++S+
Sbjct: 462 RVISK 466


>gi|291001891|ref|XP_002683512.1| hypothetical protein NAEGRDRAFT_61424 [Naegleria gruberi]
 gi|284097141|gb|EFC50768.1| hypothetical protein NAEGRDRAFT_61424 [Naegleria gruberi]
          Length = 619

 Score = 74.3 bits (181), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/260 (26%), Positives = 120/260 (46%), Gaps = 54/260 (20%)

Query: 131 RRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEE 190
           +++S+ +++V CGRILLF+A   P++  + +N    FN E+ +     E+  ++ N  + 
Sbjct: 206 KKVSKIEDSVTCGRILLFVASVLPYNHDACINKGGHFNKEHHSLL---EQQILNDNNEDS 262

Query: 191 TDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQC-------YNKV--------SWKMFTSYA 235
           ++T++      DF  Y   W LQ +    ++        YN          SW +F+   
Sbjct: 263 SNTQD------DFEIY---WGLQVHIIQILELQSTLASFYNSYPRGNDECPSWSLFSREV 313

Query: 236 ETVLAAFKSYKLDDVQS---------SLNPSG----DYFA--KYLTNQKLLDLQLSDTNF 280
           + +L    +    ++Q+         S N       D F+  K L N+ L DLQL D  F
Sbjct: 314 DRILKCLDTRFTTNIQTMKQLKKNNQSTNHESALVQDLFSPIKLLKNRTLFDLQLDDVRF 373

Query: 281 RRYVLLQFLI-LFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQ 339
           RR VL+Q L+ +++      +      ELK+     +KD T  V   +++  P+      
Sbjct: 374 RRQVLIQLLVTIYRLLEKNSEPSFYLSELKT-----LKDKTLNV---VRKYDPES---VT 422

Query: 340 VVKLILKGEEHWNQWKNEGC 359
           +++ IL  E HW+ WK+ GC
Sbjct: 423 LLETILDRERHWSNWKSNGC 442


>gi|145515651|ref|XP_001443725.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411114|emb|CAK76328.1| unnamed protein product [Paramecium tetraurelia]
          Length = 557

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 85/362 (23%), Positives = 152/362 (41%), Gaps = 67/362 (18%)

Query: 122 LLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEM 181
           LLR+CN +L+RLS + +T   G++ +FL + F     S ++        NI + G DEE 
Sbjct: 140 LLRICNSMLKRLSTTHDTFLRGQVQIFLTQIF-----STIHPSLRKKPINIRDRGFDEE- 193

Query: 182 DVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCY-NKVSW------------ 228
            V  N +            + ++FYK FW+LQ Y  N    + NKV +            
Sbjct: 194 SVKVNAS------------LPYSFYKNFWTLQKYLNNSELIFKNKVEFDDIGEEAGQVTQ 241

Query: 229 --------KMFTSYAETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNF 280
                   K   +  + ++ A + +K + ++  +  +   F K+LT   L   QL+D  F
Sbjct: 242 GVSEPQRTKNLQAVIKVIMRALQYFKENPLEPEI-ITVKRFPKFLTKYSLFKNQLNDPYF 300

Query: 281 RRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQV 340
           R+  L Q L+    F +         EL++ +++++++    V    KQ    G+   + 
Sbjct: 301 RKLFLNQVLLFI--FVA---------ELENHEKQFIQNLKIEVQ---KQLDELGDKLGEK 346

Query: 341 VKLILKGEEHWNQWKNEG-CPELKRPLTSITDEDKKDEP----------DAKKKKTPELT 389
           V+ ++  E  W  W  E  C +      SI   +K  +           D K K+T    
Sbjct: 347 VQHLIDTERTWRLWVTEKQCFDDGYEKLSIKASEKAKKKRQEIQKRIYDDLKYKQTDTFQ 406

Query: 390 KLWNSKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLL 449
           ++   +D     K       P L  Y  E   Q+D    + +  +  N  +YAW+++R++
Sbjct: 407 QIL--QDQGRPYKIETNPIEPGLNYYLNEVFYQLDQNRKIADNERCKNLKDYAWKSVRVI 464

Query: 450 SR 451
           S+
Sbjct: 465 SK 466


>gi|403377322|gb|EJY88653.1| THO complex subunit 1 [Oxytricha trifallax]
          Length = 766

 Score = 70.9 bits (172), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/210 (25%), Positives = 89/210 (42%), Gaps = 35/210 (16%)

Query: 49  KTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE----- 103
           K   +DI+ I  +    + L  +D+        LL D FD ++ D  E LF  +E     
Sbjct: 120 KKMQVDIQPITKFFEFVILLERQDLITKYEIFQLLVDLFDHTSEDSFEPLFGLMEHLLFQ 179

Query: 104 ---VNVNIWKQQTFFMSCKNNLL------------RMCNDLLRRLSRSQNTVFCGRILLF 148
              +   + +Q         N++            R+C  ++R+LS + +T F G++ + 
Sbjct: 180 ENLIKPRVQQQAEQGQKSDQNVIDRQQDINLNLISRICKSVIRKLSVTHDTGFRGKVQIL 239

Query: 149 LAKFFPFSERSGLNIISEFNVENITEFGGDEEM----DVSSNETE----------ETDTE 194
           +A  FP    SGLN + +FN +N T F   EE     D+  N  +          + + +
Sbjct: 240 IANVFPLMHGSGLNRLGKFNTQNETVFETLEETKKSGDIGQNSGKLSLQQTFSFGQANIQ 299

Query: 195 EVDKVKI-DFNFYKKFWSLQDYFRNPVQCY 223
                +I D+  YK FW+LQ Y  NP   +
Sbjct: 300 STKPQQIADYKTYKNFWNLQKYLTNPFNIF 329


>gi|298712962|emb|CBJ26864.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 896

 Score = 68.6 bits (166), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 64/225 (28%), Positives = 95/225 (42%), Gaps = 35/225 (15%)

Query: 260 YFA-KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKD 318
           YF  KYLT+ +LL LQL D   R  VL Q+LIL     + V +  +      D    +K 
Sbjct: 482 YFGLKYLTSSRLLRLQLRDPTLRLQVLTQWLILAASLKNKVANAKDSPNSVED----LKP 537

Query: 319 TTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDEDKKDEP 378
             E    L+K TPP GE +  +++++L  E  W QWK+    + K  +T+  +E+ + E 
Sbjct: 538 RIELAKRLVKATPPRGEEYLSMLQVVLNRETFWTQWKD--VNKCKLAMTAAKEEETQREK 595

Query: 379 D------------AKKKKTPELTKLWNSKDN---------LEACKSAERDFTPSLESYFE 417
           D              K+  P  +     +           LE   S  R   P+ + + E
Sbjct: 596 DKAAGVVARAPSYPNKRPRPSASVAAGERAKYSPLGAGFPLERMVSKLRAGVPTSQEHLE 655

Query: 418 EAIQQM---DPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLN 459
           +  Q+M   DP     E         Y WRALRLL R+ P   + 
Sbjct: 656 D--QRMCAEDPDTPASELPSA--SPLYTWRALRLLGRENPANLVG 696



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/199 (23%), Positives = 81/199 (40%), Gaps = 58/199 (29%)

Query: 101 YVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSG 160
           +VE N     +     + +  L+R+CN L +R+S++ ++  CG++ +FL K   F ERS 
Sbjct: 198 WVEPNSERLSRLVPLNNGRYPLIRLCNKLAQRMSKTHDSSLCGQMHMFLGKALDFFERSA 257

Query: 161 LNIISEFNVENITEF-----------GGDEEMDVSSNETEETDTE------------EVD 197
           +N+    N++N+T F           G D E + SS                     EVD
Sbjct: 258 VNVSGSINLDNVTRFEDEEQFRRAMMGFDGEDEASSGTGAAVAAAPAAKGGGDAMEVEVD 317

Query: 198 ----------------KVK---------IDFNFYKKFWSLQDYFRNPVQC-------YNK 225
                           K+K         + +  Y+ FW LQ+Y   P+Q          K
Sbjct: 318 APAGAAAAAAAGGGATKLKEGANPQEFAVHYPLYRSFWRLQEY---PLQQDKVVKGEETK 374

Query: 226 VSWKMFTSYAETVLAAFKS 244
            +W+   +  + +L AF++
Sbjct: 375 ATWEALLADVDKILGAFEA 393


>gi|238565549|ref|XP_002385882.1| hypothetical protein MPER_16096 [Moniliophthora perniciosa FA553]
 gi|215436172|gb|EEB86812.1| hypothetical protein MPER_16096 [Moniliophthora perniciosa FA553]
          Length = 161

 Score = 67.8 bits (164), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 43/119 (36%), Positives = 59/119 (49%), Gaps = 6/119 (5%)

Query: 75  NSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN-NLLRMCNDLLRRL 133
           + + P  +L D  +  T+  C  +F ++E   +   Q       K   LLR  NDLLRRL
Sbjct: 26  DPSFPFQILQDLLETQTISSCSHIFTWIESRKDRLTQGMIPQKGKALILLRTLNDLLRRL 85

Query: 134 SRSQN--TVFCGRILLFLAKFFPFSERSGLNIISEFNV---ENITEFGGDEEMDVSSNE 187
           S+     T FCGRIL FL+  FP  ERSG+N+  E+         E+ G EEM V   +
Sbjct: 86  SKMGMGMTGFCGRILCFLSGVFPLGERSGVNLRGEYGPVWEGAHIEYEGGEEMQVDEEK 144


>gi|323448292|gb|EGB04192.1| hypothetical protein AURANDRAFT_67353 [Aureococcus anophagefferens]
          Length = 631

 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/252 (23%), Positives = 95/252 (37%), Gaps = 65/252 (25%)

Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTS-TVKSRGEGLELKS-DQEKW---- 315
           AKYLTN +LL +QL D   R  ++ Q  ++F Y    ++      L  KS  +  W    
Sbjct: 295 AKYLTNSRLLRIQLRDPVLRMQLVTQLYLVFYYLLRESIIGDQPSLGTKSTSKPDWASSA 354

Query: 316 ---VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITD- 371
              V      V  L+  TPP+G    + +  ++K E +W+ WK  GC  L+ P++ + + 
Sbjct: 355 MSRVNHVLARVRKLVVSTPPNGPERLESLDELIKREANWSTWKKNGC-ALQSPISRLQNF 413

Query: 372 ------------------EDKKDEPDAK-----------KKKTPELTKLWNSK------- 395
                               +   P A            +  TP++TK    +       
Sbjct: 414 VFIYLYLCLCLCLYLYLYLYRAASPGAANFIFHSRQEDYRTDTPKMTKRSGERYTPPTYA 473

Query: 396 ------------------DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVN 437
                               +E C     D  PS+E +        DP  ++E++Y   N
Sbjct: 474 VLVKRAKLKADADSIKNEGTIENCCVKLADAVPSIEEFLRAYEVAQDPENSIEDEYHPKN 533

Query: 438 DSNYAWRALRLL 449
           D  Y WRALRL+
Sbjct: 534 DKVYCWRALRLV 545


>gi|340500147|gb|EGR27045.1| Tho complex protein, putative [Ichthyophthirius multifiliis]
          Length = 411

 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/263 (21%), Positives = 116/263 (44%), Gaps = 41/263 (15%)

Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMD 182
           L++CN + ++LS++Q+    G++ +FL+K  P    SGL   S+   E  T+   ++E  
Sbjct: 139 LKICNGVFKKLSKTQDIQLRGKVHIFLSKILPLCHASGLK--SKICTEISTQIENEKEAL 196

Query: 183 VSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKM------------ 230
           +      + + ++ +   ++++FY KFWSLQ Y +NP+      + +M            
Sbjct: 197 LQMENNMDIEEQKGNFPFLNYSFYSKFWSLQKYLQNPLNLSKSNNGEMEIEGINDEDVEE 256

Query: 231 ---------------------FTSYAETVLAAFKSYKLDDVQSSLNPSGDYFAKYLTNQK 269
                                  +  E V    ++  L+  ++S+ P    + KYLT   
Sbjct: 257 GEEKPDQEGIKTLESKKIKGLCAAICEVVEIFNQNPILEKGETSIKP----YIKYLTAYS 312

Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRG-EGLELKSDQEKWVKDTTETVYSLIK 328
           L+ +Q +D  FR+  L+Q L+ F    +  K    + +E + +  K  +D  +  + L+ 
Sbjct: 313 LMKIQFNDQFFRKIWLIQALLFFYTLQNLNKEYDIKNIENEKNTIKNAEDKVKKAFKLLH 372

Query: 329 QTPPDGEHFSQVVKLILKGEEHW 351
           Q   +   + Q  +  +  E++W
Sbjct: 373 QNLKEDISYEQFDQFFM-DEKNW 394


>gi|118362386|ref|XP_001014420.1| hypothetical protein TTHERM_00522420 [Tetrahymena thermophila]
 gi|89296187|gb|EAR94175.1| hypothetical protein TTHERM_00522420 [Tetrahymena thermophila
           SB210]
          Length = 1224

 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 59/101 (58%), Gaps = 4/101 (3%)

Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLN--IISEF--NVENITEFGGD 178
           L++CN + ++L+R+ +T   G + +FL+K  P    SGL   I +E    +EN ++   +
Sbjct: 146 LKICNVIFKKLTRTYDTRLRGELHIFLSKILPICHASGLKSKICTEITAQLENESDIAKN 205

Query: 179 EEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNP 219
           + +  +  E EE  ++  +    + +FYKKFW+LQ YF+NP
Sbjct: 206 QNIQGNQMEIEEDKSQPDNITYSNKDFYKKFWTLQKYFQNP 246


>gi|328908893|gb|AEB61114.1| THO complex subunit 1-like protein, partial [Equus caballus]
          Length = 78

 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/52 (50%), Positives = 37/52 (71%)

Query: 275 LSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSL 326
           LSD+NFRR++LLQ+LILFQY    VK +     L  +Q  W++DTT++VY +
Sbjct: 1   LSDSNFRRHILLQYLILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQV 52


>gi|443925429|gb|ELU44267.1| efThoc1 domain-containing protein [Rhizoctonia solani AG-1 IA]
          Length = 854

 Score = 55.8 bits (133), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 51/209 (24%), Positives = 78/209 (37%), Gaps = 46/209 (22%)

Query: 62  ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVE-VNVNIWKQQTFFMSCKN 120
           + + +E    + CN    + +L +  D  T+D C  +F +VE  +  + K          
Sbjct: 86  LDVALEYTELEACNWQFVLEVLEEVLDGHTIDSCVTIFGWVEDRSSRLLKGMVPSKGKAL 145

Query: 121 NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV----------- 169
            +LR  N L  R +R   T F GRIL F +  F  +ERSG+N+  E              
Sbjct: 146 VVLRTLNALKARAAR---TGFAGRILAFQSGIFDITERSGVNLQGELGSTWNGVAIPPPV 202

Query: 170 ---ENITEFG--------------------GDEEMDVSSNETEETDTE--------EVDK 198
                 TE G                    GDE+    +      D++        +VD 
Sbjct: 203 GEKAEATESGVKLEGSDAMEGVESTDQSAAGDEDKKSVAENKGSADSKPASRAADAKVDP 262

Query: 199 VKIDFNFYKKFWSLQDYFRNPVQCYNKVS 227
            +   N Y  FW LQ +F NP+     +S
Sbjct: 263 AEQRRNLYNTFWQLQLFFANPLVFAEGIS 291



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 19/41 (46%), Positives = 32/41 (78%)

Query: 260 YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
           +FAK+LT+  LLDL+L+DTNFRR +L Q +I+ ++ +  ++
Sbjct: 396 FFAKFLTSPDLLDLELADTNFRRQILFQLMIMLKHLSLAIQ 436


>gi|387198729|gb|AFJ68866.1| THO complex subunit 1, partial [Nannochloropsis gaditana CCMP526]
          Length = 244

 Score = 55.5 bits (132), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 67/124 (54%), Gaps = 15/124 (12%)

Query: 82  LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT-----FFMSCKNNL--LRMCNDLLRRLS 134
           LL D F++  +++ +  +  VE      K+++     F  + K+NL  L++CN LLR+LS
Sbjct: 124 LLEDIFELLGVEQIQNFWPEVE------KRESALFGLFKTARKSNLTMLKLCNSLLRKLS 177

Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTE 194
           +  NT  CGRI+++L++ +  S+ S +N     N+ N T F  +E  D +  +  E +  
Sbjct: 178 KGHNTAICGRIMMYLSRNWELSDPSAINKGGMANLSNKTAF--EEPEDFARQQAREAEKT 235

Query: 195 EVDK 198
             D+
Sbjct: 236 RRDE 239


>gi|412985942|emb|CCO17142.1| predicted protein [Bathycoccus prasinos]
          Length = 1123

 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 68/157 (43%), Gaps = 23/157 (14%)

Query: 316 VKDTTETVYSLIKQT---PPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSITDE 372
            K+  E V  ++++T   P  G+     +   +   EH++ W + GCP  +R    +  E
Sbjct: 740 AKECFEKVLDVVEKTDASPEQGKKLRIFLVATMYSTEHYDTWVDMGCPSFEREEIDVVAE 799

Query: 373 D----------------KKDEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYF 416
                            K  E   + +  PE+ ++WN   + EA  +       S   + 
Sbjct: 800 QKEYERKEKEKEDAENAKYPEGYVRFENNPEMERIWNLYPSYEAAMTDPERNRYSKPEFM 859

Query: 417 EEAIQ----QMDPAAAVEEQYKKVNDSNYAWRALRLL 449
            E +Q    +MDP A +EE+YK  ND+ Y W+A RL+
Sbjct: 860 NEFLQVVYDEMDPEAQIEEEYKTKNDTMYRWKAFRLI 896


>gi|422294621|gb|EKU21921.1| THO complex subunit 1, partial [Nannochloropsis gaditana CCMP526]
          Length = 230

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 58/101 (57%), Gaps = 13/101 (12%)

Query: 82  LLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQT-----FFMSCKNNL--LRMCNDLLRRLS 134
           LL D F++  +++ +  +  VE      K+++     F  + K+NL  L++CN LLR+LS
Sbjct: 124 LLEDIFELLGVEQIQNFWPEVE------KRESALFGLFKTARKSNLTMLKLCNSLLRKLS 177

Query: 135 RSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEF 175
           +  NT  CGRI+++L++ +  S+ S +N     N+ N T F
Sbjct: 178 KGHNTAICGRIMMYLSRNWELSDPSAINKGGMANLSNKTAF 218


>gi|426385316|ref|XP_004059165.1| PREDICTED: THO complex subunit 1-like, partial [Gorilla gorilla
           gorilla]
          Length = 80

 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 42/78 (53%), Gaps = 14/78 (17%)

Query: 340 VVKLILKGEEHWNQWKNEGCPELKRPLTSITDED----KKDEPDAKKKKTP--------- 386
           +V+ IL  EE+WN WKNEGCP   +  TS T       K+  P+    K P         
Sbjct: 1   MVEHILNTEENWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPEDFLGKGPSKKILMGNE 60

Query: 387 ELTKLWN-SKDNLEACKS 403
           ELT+LWN   DN+EACKS
Sbjct: 61  ELTRLWNLCPDNMEACKS 78


>gi|159482536|ref|XP_001699325.1| hypothetical protein CHLREDRAFT_178080 [Chlamydomonas reinhardtii]
 gi|158272961|gb|EDO98755.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 1302

 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/48 (41%), Positives = 34/48 (70%)

Query: 117 SCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNII 164
             K  LLRM N L RRLS++ +   CGR+L+F+++ +  +++SGLN++
Sbjct: 165 GAKLALLRMSNMLFRRLSKAYDVRLCGRLLIFISRCYELNDKSGLNVL 212


>gi|341883433|gb|EGT39368.1| hypothetical protein CAEBREN_04262 [Caenorhabditis brenneri]
          Length = 90

 Score = 48.5 bits (114), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 46/81 (56%), Gaps = 7/81 (8%)

Query: 110 KQQTFFMSCKNNLLRMCNDLLRRLSRSQNTV---FCGRILLFLAKFFPF-SERSGLNIIS 165
           K+ +F  +C+NN+LR+CNDL RR S   ++    F   I    ++ F F  E+SG+N + 
Sbjct: 9   KKPSFIETCQNNILRLCNDLFRRFSSKCSSCPGDFHKEIYEKASELFRFLVEKSGVNFMG 68

Query: 166 EFNVENITEFGGDEEMDVSSN 186
            FN  NIT +   EE D + N
Sbjct: 69  HFNTRNITNY---EETDPALN 86


>gi|422294622|gb|EKU21922.1| tho complex subunit, partial [Nannochloropsis gaditana CCMP526]
          Length = 160

 Score = 45.1 bits (105), Expect = 0.11,   Method: Composition-based stats.
 Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 9/112 (8%)

Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY-------FTSTVKSRGEGLELKSDQEKW 315
           KYLTN +L  LQL D   R  VL Q LI+  Y         ++  +        S  +K 
Sbjct: 15  KYLTNSRLFRLQLHDPLIRTQVLTQLLIVLNYASAATALAVASPPASSSPAPNPSPPKKG 74

Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN-EGC-PELKRP 365
             D  +    LI+ T   G  +  V+  +L  E+ W  WK  + C P +++P
Sbjct: 75  ANDLVKHARRLIRNTSAQGAVYLGVLDTVLHREQAWLSWKRLQRCEPRIEKP 126


>gi|387219973|gb|AFJ69695.1| tho complex subunit, partial [Nannochloropsis gaditana CCMP526]
          Length = 153

 Score = 45.1 bits (105), Expect = 0.11,   Method: Composition-based stats.
 Identities = 33/112 (29%), Positives = 49/112 (43%), Gaps = 9/112 (8%)

Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQY-------FTSTVKSRGEGLELKSDQEKW 315
           KYLTN +L  LQL D   R  VL Q LI+  Y         ++  +        S  +K 
Sbjct: 8   KYLTNSRLFRLQLHDPLIRTQVLTQLLIVLNYASAATALAVASPPASSSPAPNPSPPKKG 67

Query: 316 VKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKN-EGC-PELKRP 365
             D  +    LI+ T   G  +  V+  +L  E+ W  WK  + C P +++P
Sbjct: 68  ANDLVKHARRLIRNTSAQGAVYLGVLDTVLHREQAWLSWKRLQRCEPRIEKP 119


>gi|167518301|ref|XP_001743491.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778590|gb|EDQ92205.1| predicted protein [Monosiga brevicollis MX1]
          Length = 847

 Score = 40.0 bits (92), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 32/74 (43%)

Query: 158 RSGLNIISEFNVENITEFGGDEEMDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFR 217
           +SG+N+ S FNV N T +   +E            + E  +  +D   YK+FW LQ  F 
Sbjct: 8   KSGVNLNSSFNVGNATSYDESDEPIPHGASEAALASAEGGEQALDKKLYKQFWQLQAAFC 67

Query: 218 NPVQCYNKVSWKMF 231
           NP        W  F
Sbjct: 68  NPAILGAGSDWLQF 81


>gi|332291219|ref|YP_004429828.1| pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase
           [Krokinobacter sp. 4H-3-7-5]
 gi|332169305|gb|AEE18560.1| pyruvate dehydrogenase complex dihydrolipoamide acetyltransferase
           [Krokinobacter sp. 4H-3-7-5]
          Length = 562

 Score = 38.9 bits (89), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 50/113 (44%), Gaps = 8/113 (7%)

Query: 364 RPLTSITDEDKKDEPDAKK---KKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAI 420
           +P T+ T       P AKK   +K   L+++  S +N    KS   +FTPS        +
Sbjct: 269 KPATNTTGGRIFVSPLAKKIADEKGINLSQVKGSGENGRIVKSDVENFTPSASQSSGAGV 328

Query: 421 QQMDPAAAVEEQYKKVNDSNYAWRALRLLSRK---CPHFFLNATPNVEKNSEF 470
           QQ    A  EE ++++ +S       R L +     PH++LN   N+E    F
Sbjct: 329 QQF--VATGEESFEEIENSQMRKAIARGLGKSKFTAPHYYLNVEFNMENMMSF 379


>gi|441501414|ref|ZP_20983527.1| Dihydrolipoamide acetyltransferase component of pyruvate
           dehydrogenase complex [Fulvivirga imtechensis AK7]
 gi|441434811|gb|ELR68242.1| Dihydrolipoamide acetyltransferase component of pyruvate
           dehydrogenase complex [Fulvivirga imtechensis AK7]
          Length = 552

 Score = 38.5 bits (88), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 71/301 (23%), Positives = 118/301 (39%), Gaps = 59/301 (19%)

Query: 190 ETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDD 249
           E DTE VD   ++ +                     +  KM  +  E  +A++   K D 
Sbjct: 116 EEDTESVDASDVNASLI-------------------LMPKMSDTMTEGTIASWLKKKGDK 156

Query: 250 VQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQ--FLILFQYFTSTVKSRGEGLE 307
           VQ     SGD  A+  T++  ++L+  +     YV ++    +      + +  +G   E
Sbjct: 157 VQ-----SGDILAEVETDKATMELEAYEDGTLLYVGVEEGASVPVDGVIAIIGEKGADYE 211

Query: 308 --LKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP 365
             LK+ Q K    T E      K T P             K E+   Q      P   + 
Sbjct: 212 KLLKAHQGKKKAATGEDKKKEDKTTAPQ------------KAEKQEEQ------PAASQT 253

Query: 366 LTSITDEDK-KDEPDAKK---KKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAIQ 421
             S+TD  + K  P AKK    K  +++K+  + DN    K    ++TP+ ES  E+A +
Sbjct: 254 APSVTDGGRVKASPLAKKMAEDKGYDISKIRGTGDNGRIIKRDIEEYTPAAES-VEKAAE 312

Query: 422 QMD-----PAAAVEEQYKKVNDSNYAWRALRLLSRK---CPHFFLNATPNVEKNSEFIEN 473
           +       P    EE Y++V+ S       + LS      PHF++    N++K  E  ++
Sbjct: 313 EKGTTFHVPQVVGEESYEEVSVSQMRKTIGKRLSESKFTSPHFYITMEINMDKAIEARKS 372

Query: 474 M 474
           M
Sbjct: 373 M 373


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.316    0.132    0.390 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,926,752,692
Number of Sequences: 23463169
Number of extensions: 386626281
Number of successful extensions: 1708918
Number of sequences better than 100.0: 752
Number of HSP's better than 100.0 without gapping: 353
Number of HSP's successfully gapped in prelim test: 399
Number of HSP's that attempted gapping in prelim test: 1702755
Number of HSP's gapped (non-prelim): 3970
length of query: 555
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 407
effective length of database: 8,886,646,355
effective search space: 3616865066485
effective search space used: 3616865066485
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)