BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 013275
         (446 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255556003|ref|XP_002519036.1| expressed protein, putative [Ricinus communis]
 gi|223541699|gb|EEF43247.1| expressed protein, putative [Ricinus communis]
          Length = 434

 Score =  697 bits (1799), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/447 (76%), Positives = 384/447 (85%), Gaps = 14/447 (3%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MS+TPGTHSLAFRVMRLCRPS HV+  L VDP+DL +GEDIFDDP+AAS LPPLI S +T
Sbjct: 1   MSTTPGTHSLAFRVMRLCRPSFHVDAQLLVDPSDLIVGEDIFDDPVAASRLPPLIDSHIT 60

Query: 61  T-NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
               +SDL+YR+RFL    +DS GL+GLLVLPQAFGAIYLGETFCSYISINNSS  EVRD
Sbjct: 61  KLTDTSDLSYRTRFLHQHPSDSFGLTGLLVLPQAFGAIYLGETFCSYISINNSSNFEVRD 120

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
           V+IKAEIQT++QRILLLDTSK+PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG+G
Sbjct: 121 VIIKAEIQTERQRILLLDTSKNPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGDG 180

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
           ERKYLPQFFKFIV+NPLSVRTKVRVVK       E T+LEACIENHTK+NLYMDQVEFEP
Sbjct: 181 ERKYLPQFFKFIVANPLSVRTKVRVVK-------ETTYLEACIENHTKTNLYMDQVEFEP 233

Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           +Q+WSA ++K D   S+ ++ +REIFKPPVLIRSGGGIHNYLYQL++ +HG++       
Sbjct: 234 AQHWSAKIIKDDEKQSEKDSLTREIFKPPVLIRSGGGIHNYLYQLRLSAHGAAQ------ 287

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
           SNVLGKLQITWRTNLGEPGRLQTQQILGT IT KEIEL + +VP+V+ +DKPF + LKLT
Sbjct: 288 SNVLGKLQITWRTNLGEPGRLQTQQILGTPITRKEIELCIAKVPAVINLDKPFSVHLKLT 347

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           N TDKE GPFE+WLSQ+ S EEK V INGL+ M L+ +EAFG+TDFHLNLIATKLGVQRI
Sbjct: 348 NHTDKELGPFEVWLSQDGSVEEKAVTINGLQTMELSQLEAFGTTDFHLNLIATKLGVQRI 407

Query: 420 TGITVFDKLEKITYDSLPDLEIFVDQD 446
           TGITVFDK EK TYD LPDLEIFV  D
Sbjct: 408 TGITVFDKSEKKTYDPLPDLEIFVAID 434


>gi|225470348|ref|XP_002269604.1| PREDICTED: UPF0533 protein C5orf44 [Vitis vinifera]
 gi|296090651|emb|CBI41051.3| unnamed protein product [Vitis vinifera]
          Length = 438

 Score =  692 bits (1787), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/444 (76%), Positives = 385/444 (86%), Gaps = 8/444 (1%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MSS   +HSLAFRVMRLCRPS HV+ PLR+DP DL  GEDIFDDP+AAS+LP L+ +   
Sbjct: 1   MSSGQTSHSLAFRVMRLCRPSFHVDNPLRLDPADLLAGEDIFDDPLAASDLPRLLHNHTL 60

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
            +  SDLTYR+RFLL+D +D++GLSGLLVLPQ+FGAIYLGETFCSYISINNSS  EVRDV
Sbjct: 61  KSNDSDLTYRTRFLLNDPSDAMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVRDV 120

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
           VIKAEIQT+KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVC+ALY+DG+GE
Sbjct: 121 VIKAEIQTEKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCSALYNDGDGE 180

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
           RKYLPQFFKF+V+NPLSV+TKVR+VK       + TFLEACIENHTKSNLYMDQVEFEPS
Sbjct: 181 RKYLPQFFKFVVANPLSVKTKVRIVK-------DNTFLEACIENHTKSNLYMDQVEFEPS 233

Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
           Q+W+AT+LKA    SD ++ +REIFK P+LIRSGGGI NYLYQLK+ S GS+  +KV GS
Sbjct: 234 QHWTATVLKAGEGLSDNDSPTREIFKQPILIRSGGGIQNYLYQLKLSSQGSAQ-MKVDGS 292

Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
           NVLGKLQITWRTNLGEPGRLQTQQILG+ IT KEIEL V+EVPSV  +++PFL+ L LTN
Sbjct: 293 NVLGKLQITWRTNLGEPGRLQTQQILGSPITRKEIELQVMEVPSVTILERPFLVHLNLTN 352

Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
           QTD+  GPFE+WLSQ+DS EE+VVM+NGLR MAL  VEAF STDF LNLIATKLGVQ+IT
Sbjct: 353 QTDRTMGPFEVWLSQSDSREEQVVMVNGLRAMALPQVEAFCSTDFRLNLIATKLGVQKIT 412

Query: 421 GITVFDKLEKITYDSLPDLEIFVD 444
           GITVFD  EK TY+ LPDLEIFVD
Sbjct: 413 GITVFDIREKRTYEPLPDLEIFVD 436


>gi|356548745|ref|XP_003542760.1| PREDICTED: UPF0533 protein C5orf44 homolog [Glycine max]
          Length = 440

 Score =  670 bits (1729), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/440 (74%), Positives = 375/440 (85%), Gaps = 11/440 (2%)

Query: 7   THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
           +HSLAFRVMRLCRPS +VEPPLR+DPTDLF+GED+FDDP A    P   SS    +  SD
Sbjct: 12  SHSLAFRVMRLCRPSFNVEPPLRLDPTDLFVGEDLFDDPAAK---PHSFSSAAAHDDDSD 68

Query: 67  LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
             YR+RFLL   +D++GLSGLLVLPQ+FGAIYLGETFCSYISINNSS  EVR+V+IKAEI
Sbjct: 69  PNYRNRFLLRHFSDAMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVREVLIKAEI 128

Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
           QT++QRILLLDTSKSPVE+IRAGGRYDFIVEHDVKELG HTLVCTALY+DG+GERKYLPQ
Sbjct: 129 QTERQRILLLDTSKSPVETIRAGGRYDFIVEHDVKELGPHTLVCTALYNDGDGERKYLPQ 188

Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
           FFKFIV+NPLSVRTKVRV+K       E TFLEACIENHTKSNL+MDQV+FEP+Q +SAT
Sbjct: 189 FFKFIVANPLSVRTKVRVIK-------ETTFLEACIENHTKSNLFMDQVDFEPAQYYSAT 241

Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
           +LK DG HS+ ++ +REIFKPP+LIRSGGGI+NYLYQLK LS GS    KV+GSNVLGKL
Sbjct: 242 ILKGDGHHSEKDSPTREIFKPPILIRSGGGIYNYLYQLKTLSDGSPQ-TKVEGSNVLGKL 300

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
           QITWRTNLGEPGRLQTQQILGT  T KEIEL VVEVPS++ + KPF+LKL LTNQTD+E 
Sbjct: 301 QITWRTNLGEPGRLQTQQILGTPATKKEIELQVVEVPSIINLQKPFMLKLNLTNQTDREL 360

Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 426
           GPFE+ LSQN S  E+VVMINGL+ M L+ V+A GST+FHLNLIATK G+QRITGITVFD
Sbjct: 361 GPFEVGLSQNVSYGERVVMINGLQSMVLSEVQALGSTNFHLNLIATKPGIQRITGITVFD 420

Query: 427 KLEKITYDSLPDLEIFVDQD 446
             E  +Y+ LPDLEIFVD D
Sbjct: 421 TREMKSYEPLPDLEIFVDMD 440


>gi|449457717|ref|XP_004146594.1| PREDICTED: UPF0533 protein C5orf44-like [Cucumis sativus]
          Length = 440

 Score =  663 bits (1710), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 317/445 (71%), Positives = 384/445 (86%), Gaps = 8/445 (1%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MS+  G+HSLAFRVMRLCRPS  V+PPLR+DP DL +GEDI DDP+AA+ LP L++  ++
Sbjct: 1   MSNAQGSHSLAFRVMRLCRPSFQVDPPLRLDPVDLLVGEDILDDPVAANQLPRLLAPQLS 60

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
            +  SDL+Y SRFLLHDS+D++GL+GLLVLPQAFGAIYLGETFCSYIS+NNSS  EVRDV
Sbjct: 61  DDSDSDLSYSSRFLLHDSSDAMGLNGLLVLPQAFGAIYLGETFCSYISVNNSSNFEVRDV 120

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
           +IKAEIQT++QRILLLD+SKSPVE+IRAGGRYDFIVEHDVKELGAHTLVCTALY+DG+GE
Sbjct: 121 IIKAEIQTERQRILLLDSSKSPVETIRAGGRYDFIVEHDVKELGAHTLVCTALYNDGDGE 180

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
           RKYLPQFFKF+V+NPLSVRTKVRVVK       + TFLEACIENHTKSNL+MDQV+FEPS
Sbjct: 181 RKYLPQFFKFMVANPLSVRTKVRVVK-------DSTFLEACIENHTKSNLFMDQVDFEPS 233

Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
            NW+A ++ AD  HS++ + +RE+FKPPVL+RSGGGIHN+LYQLK  ++G SSP+KV+GS
Sbjct: 234 PNWNAVIINADEHHSEHKSTTREVFKPPVLVRSGGGIHNFLYQLKCSTNGPSSPLKVEGS 293

Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
           N+LGKLQITWRTN+GEPGRLQTQQILG+ IT KE+ELNVVE+P V+ +++PF L ++LT 
Sbjct: 294 NILGKLQITWRTNMGEPGRLQTQQILGSPITRKELELNVVEMPDVIRLERPFTLHMRLTT 353

Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
           Q ++E GPFE+W+S N SDE+KVVM+NGL+ + +  VE +GSTDFHLNLIATK GVQRI 
Sbjct: 354 QIERELGPFEVWMSLNSSDEDKVVMVNGLQKVVIPRVEPYGSTDFHLNLIATKPGVQRIA 413

Query: 421 GITVFDKLEKITYDS-LPDLEIFVD 444
           GI VFD  EK  Y+   PDLEI+VD
Sbjct: 414 GIKVFDTREKKAYEHPSPDLEIYVD 438


>gi|224079249|ref|XP_002305809.1| predicted protein [Populus trichocarpa]
 gi|222848773|gb|EEE86320.1| predicted protein [Populus trichocarpa]
          Length = 450

 Score =  659 bits (1700), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/442 (73%), Positives = 374/442 (84%), Gaps = 12/442 (2%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MS+ P T SLAFRVMRLCRPS HV+ PL +DP+DL +GEDIFDDP+AA++LPPLI + +T
Sbjct: 1   MSTPPATQSLAFRVMRLCRPSFHVDTPLLLDPSDLILGEDIFDDPLAATHLPPLIDTHLT 60

Query: 61  TN-KSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
               SSDL+YRSRFLL + +DS GLSGLLVLPQ+FGAIYLGETFCSY+SINNSS  EVRD
Sbjct: 61  NPIDSSDLSYRSRFLLQNPSDSFGLSGLLVLPQSFGAIYLGETFCSYVSINNSSNFEVRD 120

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
           +VIKAE+QT++QRILLLDTSK+PVESIRA GRYDFIVEHDVKELGAHTLVCTALY+DG+G
Sbjct: 121 IVIKAEMQTERQRILLLDTSKTPVESIRASGRYDFIVEHDVKELGAHTLVCTALYTDGDG 180

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
           ERKYLPQFFKFIV+NPLSVRTKV ++ V     QE T+LEACIENHTK+NLYMDQVEFEP
Sbjct: 181 ERKYLPQFFKFIVANPLSVRTKVLLLLVS----QETTYLEACIENHTKTNLYMDQVEFEP 236

Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           + NWSA +LKAD   S  N+ SR     P L++SGGGI NYLYQL + SHGS+       
Sbjct: 237 APNWSAKILKADEHKSKDNSPSR-CGNIPFLVKSGGGIRNYLYQLSLSSHGSAE------ 289

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
           SNVLGKLQITWRTNLGEPGRLQTQQILGT IT KEIEL+V EVPS + +D+PFL+ L LT
Sbjct: 290 SNVLGKLQITWRTNLGEPGRLQTQQILGTPITPKEIELHVAEVPSAINLDRPFLVHLNLT 349

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           NQTD+E GPFE+WLSQ+D+ +EK VMINGL+ M L+ +EAFGSTDF+LNLIATKLGVQ+I
Sbjct: 350 NQTDRELGPFEVWLSQDDTLDEKTVMINGLQTMELSQLEAFGSTDFYLNLIATKLGVQKI 409

Query: 420 TGITVFDKLEKITYDSLPDLEI 441
           TGITVFDK EK TY  LPDLE+
Sbjct: 410 TGITVFDKSEKKTYAPLPDLEV 431


>gi|356521339|ref|XP_003529314.1| PREDICTED: UPF0533 protein C5orf44-like [Glycine max]
          Length = 435

 Score =  649 bits (1673), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 322/440 (73%), Positives = 369/440 (83%), Gaps = 15/440 (3%)

Query: 7   THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
           +HSLAFRVMRLCRPS +VEPPLR+DP DLF GED+FDDP A    PP  SS   ++ +  
Sbjct: 11  SHSLAFRVMRLCRPSFNVEPPLRLDPADLFAGEDLFDDPAAN---PPSFSSSDDSDSN-- 65

Query: 67  LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
             YR+RFLL   +D++GLSGLLVLPQ+FGAIYLGETFCSYISINNSS  EVRDV+IKAEI
Sbjct: 66  --YRNRFLLRHFSDAMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVRDVIIKAEI 123

Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
           QT++ RILLLDTSKSPVE+IRAGGRYDFIVEHDVKELG HTLVCTALY+DG+GERKYLPQ
Sbjct: 124 QTERLRILLLDTSKSPVETIRAGGRYDFIVEHDVKELGPHTLVCTALYNDGDGERKYLPQ 183

Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
           FFKFIV+NPLSVRTKVRV+K       E TFLEACIENHTKSNL+MDQV+FEP+Q +SA+
Sbjct: 184 FFKFIVANPLSVRTKVRVIK-------ETTFLEACIENHTKSNLFMDQVDFEPAQYYSAS 236

Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
           +LK DG HS+ ++ +RE FKPP+LIRSGGGI+NYLYQLK  S G     KV+GSNVLGKL
Sbjct: 237 ILKGDGHHSEKDSPTRETFKPPILIRSGGGIYNYLYQLKTSSDGLPQ-TKVEGSNVLGKL 295

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
           QITWRTNLGEPGRLQTQQILGTT T KEIEL VVEVPS++ +  PF+LKL LTNQTD+E 
Sbjct: 296 QITWRTNLGEPGRLQTQQILGTTATKKEIELQVVEVPSIINLQNPFMLKLNLTNQTDREL 355

Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 426
           GPFE+ LSQN S  E+ VMINGL+ M L+ V+A GST+FHLNLIATK G+QRITGITVFD
Sbjct: 356 GPFEVSLSQNVSYGERAVMINGLQSMVLSEVQALGSTNFHLNLIATKPGIQRITGITVFD 415

Query: 427 KLEKITYDSLPDLEIFVDQD 446
             E  +Y+ LPDLEIFVD D
Sbjct: 416 TREMKSYEPLPDLEIFVDMD 435


>gi|388496064|gb|AFK36098.1| unknown [Medicago truncatula]
          Length = 437

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 314/439 (71%), Positives = 366/439 (83%), Gaps = 15/439 (3%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS+AFRVMRLCRPS +V+PPLR+DP DLF+GED FDDP A S      SSD+     SD 
Sbjct: 14  HSVAFRVMRLCRPSFNVDPPLRIDPDDLFVGEDHFDDPSAPS------SSDLIA-PDSDP 66

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
            YR+RFLL   +DS+GLSGLLVLPQ+FGAIYLGETFCSYISINNSS  EVR+V+IKAEIQ
Sbjct: 67  NYRNRFLLQHFSDSMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVREVIIKAEIQ 126

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T++QRILLLDTSKSPVE+IRAGGRYDFIVEHDVKELG HTLVCTALY+DG+GERKYLPQF
Sbjct: 127 TERQRILLLDTSKSPVETIRAGGRYDFIVEHDVKELGPHTLVCTALYNDGDGERKYLPQF 186

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKFIV+NPLSVRTKVRV+K       E TFLEACIENHTKSNL+MDQV+FEP+Q++SAT+
Sbjct: 187 FKFIVANPLSVRTKVRVIK-------ETTFLEACIENHTKSNLFMDQVDFEPAQHYSATI 239

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
           L+ DGPH++ +  +RE FKPP+LIRSGGGI+NYLYQLK  S   S+  KV+G+NVLGKLQ
Sbjct: 240 LRGDGPHTEKDNTARETFKPPILIRSGGGIYNYLYQLKS-SLDDSAQTKVEGNNVLGKLQ 298

Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
           ITWRTNLGEPGRLQTQQILGT  T KEIEL VVEVPS++ + +PF LKL LTN T++E G
Sbjct: 299 ITWRTNLGEPGRLQTQQILGTPTTKKEIELQVVEVPSIINLQRPFTLKLNLTNLTERELG 358

Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
           PF++ +SQN S  E  VMINGL+ M L+ +EA GST+ HLNLIATK G+Q+ITGITVFD 
Sbjct: 359 PFKVSVSQNGSSGETAVMINGLQSMVLSQIEALGSTNIHLNLIATKPGIQKITGITVFDT 418

Query: 428 LEKITYDSLPDLEIFVDQD 446
               +Y+ LPDLEIFVD D
Sbjct: 419 RGMKSYEPLPDLEIFVDID 437


>gi|358346667|ref|XP_003637387.1| hypothetical protein MTR_084s0010 [Medicago truncatula]
 gi|355503322|gb|AES84525.1| hypothetical protein MTR_084s0010 [Medicago truncatula]
          Length = 446

 Score =  629 bits (1623), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 314/448 (70%), Positives = 366/448 (81%), Gaps = 24/448 (5%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS+AFRVMRLCRPS +V+PPLR+DP DLF+GED FDDP A S      SSD+     SD 
Sbjct: 14  HSVAFRVMRLCRPSFNVDPPLRIDPDDLFVGEDHFDDPSAPS------SSDLIA-PDSDP 66

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
            YR+RFLL   +DS+GLSGLLVLPQ+FGAIYLGETFCSYISINNSS  EVR+V+IKAEIQ
Sbjct: 67  NYRNRFLLQHFSDSMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVREVIIKAEIQ 126

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T++QRILLLDTSKSPVE+IRAGGRYDFIVEHDVKELG HTLVCTALY+DG+GERKYLPQF
Sbjct: 127 TERQRILLLDTSKSPVETIRAGGRYDFIVEHDVKELGPHTLVCTALYNDGDGERKYLPQF 186

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKFIV+NPLSVRTKVRV+K       E TFLEACIENHTKSNL+MDQV+FEP+Q++SAT+
Sbjct: 187 FKFIVANPLSVRTKVRVIK-------ETTFLEACIENHTKSNLFMDQVDFEPAQHYSATI 239

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
           L+ DGPH++ +  +RE FKPP+LIRSGGGI+NYLYQLK  S   S+  KV+G+NVLGKLQ
Sbjct: 240 LRGDGPHTEKDNTARETFKPPILIRSGGGIYNYLYQLKS-SLDDSAQTKVEGNNVLGKLQ 298

Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
           ITWRTNLGEPGRLQTQQILGT  T KEIEL VVEVPS++ + +PF LKL LTN T++E G
Sbjct: 299 ITWRTNLGEPGRLQTQQILGTPTTKKEIELQVVEVPSIINLQRPFTLKLNLTNLTERELG 358

Query: 368 PFEIWLSQNDSDEEKVVMINGLRIM---------ALAPVEAFGSTDFHLNLIATKLGVQR 418
           PF++ +SQN S  E  VMINGL+ M          L+ +EA GST+ HLNLIATK G+Q+
Sbjct: 359 PFKVSVSQNGSSGETAVMINGLQSMVMHSLWIISVLSQIEALGSTNIHLNLIATKPGIQK 418

Query: 419 ITGITVFDKLEKITYDSLPDLEIFVDQD 446
           ITGITVFD     +Y+ LPDLEIFVD D
Sbjct: 419 ITGITVFDTRGMKSYEPLPDLEIFVDID 446


>gi|18407493|ref|NP_566117.1| uncharacterized protein [Arabidopsis thaliana]
 gi|16226796|gb|AAL16264.1|AF428334_1 At2g47960/T9J23.10 [Arabidopsis thaliana]
 gi|18377797|gb|AAL67048.1| unknown protein [Arabidopsis thaliana]
 gi|20197311|gb|AAC63650.2| expressed protein [Arabidopsis thaliana]
 gi|20197565|gb|AAM15133.1| expressed protein [Arabidopsis thaliana]
 gi|21281259|gb|AAM45021.1| unknown protein [Arabidopsis thaliana]
 gi|330255823|gb|AEC10917.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 442

 Score =  598 bits (1543), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 299/447 (66%), Positives = 353/447 (78%), Gaps = 12/447 (2%)

Query: 2   SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
           + T G HSLAFRVMRLC+PS HV+PPLR+DP DL  GED  DDP +AS     +SS    
Sbjct: 6   TQTHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADAV 65

Query: 62  NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
           +  SDL+YR+RFLL+   D IGLSGLL+LPQ+FGAIYLGETFCSYIS+NNSST EVRDV 
Sbjct: 66  D--SDLSYRNRFLLNHPTDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDVT 123

Query: 122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           IKAEIQT++QRILLLDTSKSPVESIR GGRYDFIVEHDVKELGAHTLVC+ALY+D +GER
Sbjct: 124 IKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEHDVKELGAHTLVCSALYNDADGER 183

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
           KYLPQFFKF+V+NPLSVRTKVRVVK       E TFLEACIENHTK+NL+MDQV+FEP++
Sbjct: 184 KYLPQFFKFVVANPLSVRTKVRVVK-------ETTFLEACIENHTKANLFMDQVDFEPAK 236

Query: 242 NWSATMLKADGPHSD--YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
            WSA  L+ +    D   +  S  I KPPV+IRSGGGIHNYLY+L   S   S   K QG
Sbjct: 237 QWSAVRLQNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNP-SADVSGQTKFQG 295

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
           SN+LGK QITWRTNLGEPGRLQTQQILG  ++ KEI + VVEVP+V+ +++PF   L LT
Sbjct: 296 SNILGKFQITWRTNLGEPGRLQTQQILGAPVSRKEINMRVVEVPAVIHLNRPFRAYLNLT 355

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           NQTD++ GPFE+ LSQ+++  EK V INGL+ + L  +EAFGS DF LNLIA+KLGVQ+I
Sbjct: 356 NQTDRQLGPFEVSLSQDETQLEKPVGINGLQTLMLPRIEAFGSNDFQLNLIASKLGVQKI 415

Query: 420 TGITVFDKLEKITYDSLPDLEIFVDQD 446
            GIT  D  EK TY+ +PD+EIFV+ D
Sbjct: 416 AGITALDTREKKTYELVPDMEIFVETD 442


>gi|297824907|ref|XP_002880336.1| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326175|gb|EFH56595.1| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 443

 Score =  594 bits (1531), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 297/445 (66%), Positives = 352/445 (79%), Gaps = 12/445 (2%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MS+T G HSLAFRVMRLC+PS HV+PPLR+DP DL  GED  DDP +AS     +SS   
Sbjct: 1   MSATHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADA 60

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
            +  SDL+YR+RFLL+   D IGLSGLL+LPQ+FGAIYLGETFCSYIS+NNSST EVRDV
Sbjct: 61  VD--SDLSYRNRFLLNHPTDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDV 118

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
            IKAEIQT++QRILLLDTSKSPVESIR GGRYDFIVEHDVKELGAHTLVC+ALY+D +GE
Sbjct: 119 TIKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEHDVKELGAHTLVCSALYNDADGE 178

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
           RKYLPQFFKF+V+NPLSVRTKVRVVK       E TFLEACIENHTK+NL+MDQV+FEP+
Sbjct: 179 RKYLPQFFKFVVANPLSVRTKVRVVK-------ETTFLEACIENHTKANLFMDQVDFEPA 231

Query: 241 QNWSATMLKADGPHSD--YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
           + WSA  L+ +    D   +  S  I KPPV+IRSGGGIHNYLY+L   S   S   K Q
Sbjct: 232 KQWSAVRLQNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNP-SADVSGQTKFQ 290

Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
           GSN+LGK QITWRTNLGEPGRLQTQQILG  ++ KEI + V EVP+V+ +++PF   L L
Sbjct: 291 GSNILGKFQITWRTNLGEPGRLQTQQILGAPVSRKEINMRVAEVPAVIHLNRPFPAYLNL 350

Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
           TNQTD++ GPFE+ LSQ++S  EK V INGL+ + L  +EAFGS DF LNLIA+KLGVQ+
Sbjct: 351 TNQTDRQLGPFEVSLSQDESQMEKPVGINGLQTLMLPRIEAFGSNDFQLNLIASKLGVQK 410

Query: 419 ITGITVFDKLEKITYDSLPDLEIFV 443
           I+GIT  D  EK TY+ +P++E+ V
Sbjct: 411 ISGITALDTREKKTYELVPEMEVSV 435


>gi|357146845|ref|XP_003574132.1| PREDICTED: UPF0533 protein C5orf44-like [Brachypodium distachyon]
          Length = 458

 Score =  560 bits (1442), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 284/454 (62%), Positives = 350/454 (77%), Gaps = 19/454 (4%)

Query: 3   STPGTHSLAFRVMRLCRPSLHVEPP--LRVDPTDLFIGEDIFD--DPIAASNL------P 52
           +T   HSLAFRVMRL RPSL  +P   LR DP D+F+ ED     DP AA+ L      P
Sbjct: 14  ATQQNHSLAFRVMRLSRPSLRPDPAALLRFDPRDVFLPEDALTSPDPSAAAELLHGLLHP 73

Query: 53  PLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS 112
           P  S+  TT    D T+R RFLL D AD++ L GLLVLPQAFGAIYLGETFCSYISINNS
Sbjct: 74  P-DSAVSTTAVPGDFTFRDRFLLRDPADALALPGLLVLPQAFGAIYLGETFCSYISINNS 132

Query: 113 STLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
           S LE R+V+IKAEIQT++QRILLLDTSKSPVESIR+GGRYDFIVEHDVKELGAHTLVCTA
Sbjct: 133 SGLEAREVIIKAEIQTERQRILLLDTSKSPVESIRSGGRYDFIVEHDVKELGAHTLVCTA 192

Query: 173 LYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYM 232
           LY+DG+ ERKYLPQFFKF VSNPLSVRTKVR +K       + T+LEACIENHTKSNLYM
Sbjct: 193 LYNDGDAERKYLPQFFKFTVSNPLSVRTKVRTIK-------DTTYLEACIENHTKSNLYM 245

Query: 233 DQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSS 292
           DQV+FEP++ WSAT+L+AD   S   +  R++ K P+LIR+GGGI+NYLYQL+  S   S
Sbjct: 246 DQVDFEPAEQWSATILEADEHPSVVKSTIRDLCKQPILIRAGGGIYNYLYQLRP-SSDES 304

Query: 293 SPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPF 352
           S +K +GS+VLGK QITWRTNLGEPGRLQTQ I  T   SK+++L  V+VP V+ +++PF
Sbjct: 305 SQIKAEGSSVLGKFQITWRTNLGEPGRLQTQNINSTPTPSKDVDLRAVKVPPVIFLERPF 364

Query: 353 LLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIAT 412
           ++ L +TNQT K  GPFE++L+ N S E+K V++NGL+ + L  VEAF S +F L+++AT
Sbjct: 365 MVNLCVTNQTGKTVGPFEVFLASNISGEQKAVLVNGLQKLVLPLVEAFESINFDLSMVAT 424

Query: 413 KLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
           +LGVQ+I+GIT++   E+  Y+ LPD+EIFVD +
Sbjct: 425 QLGVQKISGITMYAVQERKYYEPLPDIEIFVDAE 458


>gi|326514588|dbj|BAJ96281.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  553 bits (1424), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 276/449 (61%), Positives = 345/449 (76%), Gaps = 20/449 (4%)

Query: 8   HSLAFRVMRLCRPSLHVEPP--LRVDPTDLFIGEDIFD--DPIAASNL------PPLISS 57
           HSLAFRVMRL RPSL  +P   LR DP D+F+ ED     DP AA++       PP    
Sbjct: 25  HSLAFRVMRLSRPSLRPDPAALLRFDPRDVFLPEDALTSPDPSAAADFLQGLLHPP--DP 82

Query: 58  DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
              T  + D T+R RFLLHD+AD++   GLLVLPQAFGAIYLGETFCSYISINNSS LE 
Sbjct: 83  GAATTVAGDFTFRDRFLLHDTADALAPPGLLVLPQAFGAIYLGETFCSYISINNSSGLEA 142

Query: 118 RDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG 177
           R+V+IKAEIQT++QRILLLDTSKSPVESIR+GGRYDFIVEHDVKELGAHTLVCTALY+DG
Sbjct: 143 REVIIKAEIQTERQRILLLDTSKSPVESIRSGGRYDFIVEHDVKELGAHTLVCTALYNDG 202

Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
           + ERKYLPQFFKF VSNPLSVRTKVR +K       + T+LEACIENHTKSNLYMDQV+F
Sbjct: 203 DAERKYLPQFFKFTVSNPLSVRTKVRTIK-------DTTYLEACIENHTKSNLYMDQVDF 255

Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
           EP+Q WSAT+L+AD   S   +  R++ K P+LIR+ GGI+NYLYQL+  S      +K 
Sbjct: 256 EPAQQWSATILEADEHPSVVKSTIRDLCKQPILIRAAGGIYNYLYQLRP-SSDEPGQIKT 314

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
           +GS++LGK QITWRTNLGEPGRLQTQ I  T   SK+++L  V++P V+ +++PF++ L 
Sbjct: 315 EGSSILGKFQITWRTNLGEPGRLQTQNIHSTPTPSKDVDLRAVKIPPVIFLERPFMVNLC 374

Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
           LTNQT+K  GPFE++L+ + S E+K V++NGL+ + L  VEAF S +F L+++AT+LGVQ
Sbjct: 375 LTNQTEKTVGPFEVFLAPSVSGEQKTVLVNGLQKLVLPLVEAFESINFDLSMVATQLGVQ 434

Query: 418 RITGITVFDKLEKITYDSLPDLEIFVDQD 446
           +I+GIT++   E+  Y+ LPD+EIFVD +
Sbjct: 435 KISGITLYAVQEREHYEPLPDIEIFVDAE 463


>gi|242039209|ref|XP_002466999.1| hypothetical protein SORBIDRAFT_01g018120 [Sorghum bicolor]
 gi|241920853|gb|EER93997.1| hypothetical protein SORBIDRAFT_01g018120 [Sorghum bicolor]
          Length = 461

 Score =  530 bits (1364), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 278/445 (62%), Positives = 342/445 (76%), Gaps = 14/445 (3%)

Query: 8   HSLAFRVMRLCRPSLH--VEPPLRVDPTDLFIGEDIF--DDPIAASNLPP--LISSDVTT 61
           HSLAFRVMRL RPSL   +   LR DP D+F+ ED     DP AA+N     L  SD  T
Sbjct: 25  HSLAFRVMRLSRPSLQPDLAALLRFDPRDVFLPEDALTGSDPSAAANFLDGLLHPSDSAT 84

Query: 62  NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
               D T+R RFLL D AD++ L GLLVLPQ+FGAIYLGETFCSYISINNSS+ E RDVV
Sbjct: 85  AVPGDFTFRDRFLLRDPADALALPGLLVLPQSFGAIYLGETFCSYISINNSSSFEARDVV 144

Query: 122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           IKAEIQT++QRILLLDTSKSPVESIR+GGRYDFIVEHDVKELGAHTLVCTALY+DG+GER
Sbjct: 145 IKAEIQTERQRILLLDTSKSPVESIRSGGRYDFIVEHDVKELGAHTLVCTALYNDGDGER 204

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
           KYLPQFFKF VSNPLSVRTKVR +K       +IT+LEACIENHTKSNLYMDQV+FEP+Q
Sbjct: 205 KYLPQFFKFSVSNPLSVRTKVRTIK-------DITYLEACIENHTKSNLYMDQVDFEPAQ 257

Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
            WSAT L+AD   S   +   ++ K P+LIR+GGGI+NYLYQL+  S   +   K +GS+
Sbjct: 258 QWSATRLEADEHPSAVKSAIGDLCKQPILIRAGGGIYNYLYQLRS-SSDEAGQTKSEGSS 316

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           +LGK QITWRTNLGEPGRLQTQ I  T   SK+++L  V+VP ++ +++ F++ L LTNQ
Sbjct: 317 ILGKFQITWRTNLGEPGRLQTQNIHSTPTASKDVDLRAVKVPPIIYVERAFMVNLCLTNQ 376

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           TDK  GPFE++L+ + S E++ V++NG + + L  VEAF S  F+L+++AT+LGVQ+I+G
Sbjct: 377 TDKTVGPFEVFLAPSMSGEDRAVLVNGPQKLILPLVEAFESMKFNLSMVATQLGVQKISG 436

Query: 422 ITVFDKLEKITYDSLPDLEIFVDQD 446
           IT++   EK  Y+ LPD+EIFVD +
Sbjct: 437 ITMYAVQEKKYYEPLPDIEIFVDAE 461


>gi|22165060|gb|AAM93677.1| unknown protein [Oryza sativa Japonica Group]
 gi|31432882|gb|AAP54458.1| expressed protein [Oryza sativa Japonica Group]
 gi|218184826|gb|EEC67253.1| hypothetical protein OsI_34196 [Oryza sativa Indica Group]
          Length = 473

 Score =  516 bits (1329), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 275/453 (60%), Positives = 340/453 (75%), Gaps = 24/453 (5%)

Query: 8   HSLAFRVMRLCRPSLHVE--PPLRVDPTDLFIGEDIFDDPIAASN------------LPP 53
           HSLAFRVMRL RPSL  +    LR DP D+F+ ED    P  +++            L P
Sbjct: 31  HSLAFRVMRLSRPSLQPDQAAALRFDPRDVFLPEDALTGPDPSASSAADAAAFLQGLLHP 90

Query: 54  LISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS 113
           L S   T     D T+R RFLL D  D++ L GLLVLPQ+FGAIYLGETFCSYISINNSS
Sbjct: 91  LDSPATTV--PGDFTFRDRFLLRDPVDALALPGLLVLPQSFGAIYLGETFCSYISINNSS 148

Query: 114 TLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTAL 173
           + E RDV IKAEIQT++QRILLLDTSK+PVESIR+GGRYDFIVEHDVKELGAHTLVCTAL
Sbjct: 149 SFEARDVAIKAEIQTERQRILLLDTSKAPVESIRSGGRYDFIVEHDVKELGAHTLVCTAL 208

Query: 174 YSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMD 233
           Y+DG+GERKYLPQFFKF VSNPLSVRTKVR +K       + T+LEACIENHTKSNLYMD
Sbjct: 209 YNDGDGERKYLPQFFKFTVSNPLSVRTKVRTIK-------DTTYLEACIENHTKSNLYMD 261

Query: 234 QVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS 293
           QV+FEPSQ W+AT L+AD   S   +   ++ K P+LIR+GGGI+NYLYQL+  S G S 
Sbjct: 262 QVDFEPSQQWAATRLEADEHPSTVKSIIGDLCKQPILIRAGGGIYNYLYQLRP-SSGESG 320

Query: 294 PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 353
             K +GS++LGK QITWRTNLGEPGRLQTQ I  T   SK+++L  V+VP V+ +++PF+
Sbjct: 321 QTKAEGSSILGKFQITWRTNLGEPGRLQTQNIHSTPTASKDVDLRAVKVPPVIFLERPFM 380

Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 413
           + L LTNQ+DK  GPFE++L+ +  DEEK V++NGL+ + L  VEAF S +F L+++AT+
Sbjct: 381 VNLCLTNQSDKTVGPFEVFLAPSVLDEEKYVLVNGLQKLVLPLVEAFESINFDLSMVATQ 440

Query: 414 LGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
           +GVQ+I+GIT++   EK  Y+ L D+EIFVD +
Sbjct: 441 VGVQKISGITLYAVQEKKLYEPLSDIEIFVDAE 473


>gi|302757339|ref|XP_002962093.1| hypothetical protein SELMODRAFT_76214 [Selaginella moellendorffii]
 gi|300170752|gb|EFJ37353.1| hypothetical protein SELMODRAFT_76214 [Selaginella moellendorffii]
          Length = 439

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 215/444 (48%), Positives = 296/444 (66%), Gaps = 21/444 (4%)

Query: 1   MSSTPGT--HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSD 58
           M+S  G   HSLAFRVMRLCRPS  V+ PL VDP+D+  GED       + N   L+   
Sbjct: 1   MTSGAGAAGHSLAFRVMRLCRPSCQVDHPLLVDPSDVCNGED-------SVNFKELLPGL 53

Query: 59  VTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVR 118
           V  N   D  +  RF L +  D++GLSG LVLPQ FG+IYLGETFCSYIS+ N +  +VR
Sbjct: 54  VNGN---DPGFWKRFELQEPMDAMGLSGQLVLPQTFGSIYLGETFCSYISVGNHTNHDVR 110

Query: 119 DVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE 178
           DV+IKAE+QT++QRI+L D SKSP+ESIRA GR+DFI+EHD+KELG HTLVC A+Y+D +
Sbjct: 111 DVIIKAELQTERQRIILSDNSKSPIESIRATGRFDFIIEHDIKELGGHTLVCMAVYTDPD 170

Query: 179 GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
           G+RKYLPQ+FKF  SNP+SVRTKV           + TFLEACIEN TKS+L+MDQV FE
Sbjct: 171 GDRKYLPQYFKFTTSNPVSVRTKV-------FDLYDTTFLEACIENQTKSHLFMDQVRFE 223

Query: 239 PSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
           P+  WS T L+ +   S+ +       K   LI   GG  +YL+QLK      SS VK++
Sbjct: 224 PAPPWSVTTLENEEEASESDGPISGYIKSLKLINGNGGARHYLFQLKRPPL-ESSDVKLE 282

Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
           G+N LGKL+I WRT LGE GRLQTQQI G+    K +++ +  +P  + I++PFL+++++
Sbjct: 283 GANALGKLEILWRTTLGETGRLQTQQINGSPTPKKPLDVKMTNLPQRILIERPFLVRMEV 342

Query: 359 TNQTDKEQGPFEIWLSQ-NDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
           TN++++  GP  + +S+ +D+   + V++NGL  + + P+    ST+  +NL+A   GVQ
Sbjct: 343 TNRSEQFTGPLRVVMSETDDNGTPRTVLMNGLLSLMVPPLAPLASTELEVNLVAVAAGVQ 402

Query: 418 RITGITVFDKLEKITYDSLPDLEI 441
           R+ GI + D  +    + +P  E+
Sbjct: 403 RVAGICLVDARDGRQVEFVPPTEV 426


>gi|302775158|ref|XP_002970996.1| hypothetical protein SELMODRAFT_95233 [Selaginella moellendorffii]
 gi|300160978|gb|EFJ27594.1| hypothetical protein SELMODRAFT_95233 [Selaginella moellendorffii]
          Length = 439

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 220/451 (48%), Positives = 297/451 (65%), Gaps = 35/451 (7%)

Query: 1   MSSTPGT--HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSD 58
           M+S  G   HSLAFRVMRLCRPS  V+ PL VDP+D+  GED       + N   L+   
Sbjct: 1   MTSGAGAAGHSLAFRVMRLCRPSCQVDHPLLVDPSDVCNGED-------SVNFKELLPGL 53

Query: 59  VTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVR 118
           V  N   D  +  RF L +  D++GLSG LVLPQ FG+IYLGETFCSYIS+ N +  +VR
Sbjct: 54  VNGN---DPGFWKRFELQEPMDAMGLSGQLVLPQTFGSIYLGETFCSYISVGNHTNHDVR 110

Query: 119 DVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE 178
           DV+IKAE+QT++QRI+L D SKSP+ESIRA GR+DFI+EHD+KELG HTLVC A+Y+D +
Sbjct: 111 DVIIKAELQTERQRIILSDNSKSPIESIRATGRFDFIIEHDIKELGGHTLVCMAVYTDPD 170

Query: 179 GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
           G+RKYLPQ+FKF  SNP+SVRTKVR VK       + TFLEACIEN TKS+L+MDQV FE
Sbjct: 171 GDRKYLPQYFKFTTSNPVSVRTKVRTVK-------DTTFLEACIENQTKSHLFMDQVRFE 223

Query: 239 PSQNWSATML-------KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGS 291
           P+  WS T L       ++DGP S Y        K   LI   GG  +YL+QLK      
Sbjct: 224 PAPPWSVTTLENEEEASESDGPISGY-------IKSLKLINGNGGARHYLFQLKRPPL-E 275

Query: 292 SSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKP 351
           SS VK++G+N LGKL+I WRT LGE GRLQTQQI G+    K +++ +  +P  + I++P
Sbjct: 276 SSDVKLEGANALGKLEILWRTTLGETGRLQTQQINGSPTPKKPLDVKMTNLPQRILIERP 335

Query: 352 FLLKLKLTNQTDKEQGPFEIWLSQ-NDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLI 410
           FL+++++TN++++  GP  + +S+ +D+   + V++NGL  +  + +    +     NL+
Sbjct: 336 FLVRMEVTNRSEQFTGPLRVVMSETDDNGTPRTVLMNGLLSLVSSRIHEDLTGTLSQNLV 395

Query: 411 ATKLGVQRITGITVFDKLEKITYDSLPDLEI 441
           A   GVQRI GI + D  +    + +P  E+
Sbjct: 396 AVAAGVQRIAGICLVDARDGRQVEFVPPTEV 426


>gi|168006879|ref|XP_001756136.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692646|gb|EDQ79002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 518

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 218/476 (45%), Positives = 304/476 (63%), Gaps = 47/476 (9%)

Query: 1   MSSTPGT--HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSD 58
           MSS PG   HSLAFRVMRLCRP+L V+  LR DP DL  GED+ D    +  L   I S 
Sbjct: 60  MSSGPGGTGHSLAFRVMRLCRPALQVDLGLRFDPMDLVQGEDLHD----SEELQASIES- 114

Query: 59  VTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVR 118
               +  +  Y  R  L    D++GL GLLVLPQ FG+IYLGE+FCSYIS+ N S  +VR
Sbjct: 115 ----RDKEGPYWRRSELEKPIDALGLPGLLVLPQTFGSIYLGESFCSYISVGNHSNHDVR 170

Query: 119 DVVIKA--------------------------EIQTDKQRILLLDTSKSPVESIRAGGRY 152
           DV IKA                          E+QT++QR+ L D +K+P++ I AGGR+
Sbjct: 171 DVGIKASFLPGSYIAWTDNGVSRCKYGQLCGAELQTERQRVTLYDNTKAPMDFICAGGRH 230

Query: 153 DFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHF 212
           DFI+EHD+KELG HTLVC A+Y+D + ERKYLPQ+FKF+ SNPLSVRTKVR+VK      
Sbjct: 231 DFIIEHDIKELGPHTLVCMAVYTDADAERKYLPQYFKFMASNPLSVRTKVRIVK------ 284

Query: 213 QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS-REIFKPPVLI 271
            + T+LEACIEN TKS L++D V F+P    + ++L+ +   +D +      + K   +I
Sbjct: 285 -DTTYLEACIENSTKSLLFLDHVRFDPQPPMTVSVLEVESNENDESEGPLSGLLKQIKVI 343

Query: 272 RSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTIT 331
           ++ GG  ++LYQ    + G     K  GSN LGKL+I WRT LGEPGRLQTQQILG    
Sbjct: 344 KANGGTRHFLYQFHKPA-GVPVSTKADGSNTLGKLEIMWRTTLGEPGRLQTQQILGNPSP 402

Query: 332 SKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDE-EKVVMINGLR 390
            KE+ L +VE+PS + +++PFL+++ ++N TD+  GP +I +SQ+D+    + +++NGL 
Sbjct: 403 RKEVSLRIVEIPSRILLERPFLVRMSVSNHTDRTVGPLQISMSQDDAQGVPRAIVVNGLW 462

Query: 391 IMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
            M +  ++   STD +L+L+AT +GVQ+ITG+ + D+ +   YD+L   E+FV+ +
Sbjct: 463 SMTVPQLDPLASTDVNLSLVATAVGVQKITGVGLTDRRDGKPYDALTATEVFVESE 518


>gi|449530845|ref|XP_004172402.1| PREDICTED: UPF0533 protein C5orf44-like, partial [Cucumis sativus]
          Length = 239

 Score =  337 bits (864), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 160/244 (65%), Positives = 200/244 (81%), Gaps = 8/244 (3%)

Query: 202 VRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS 261
           VRVVK       + TFLEACIENHTKSNL+MDQV+FEPS NW+A ++ AD  HS++ + +
Sbjct: 1   VRVVK-------DSTFLEACIENHTKSNLFMDQVDFEPSPNWNAVIINADEHHSEHKSTT 53

Query: 262 REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQ 321
           RE+FKPPVL+RSGGGIHN+LYQLK  ++G SSP+KV+GSN+LGKLQITWRTN+GEPGRLQ
Sbjct: 54  REVFKPPVLVRSGGGIHNFLYQLKCSTNGPSSPLKVEGSNILGKLQITWRTNMGEPGRLQ 113

Query: 322 TQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEE 381
           TQQILG+ IT KE+ELNVVE+P V+ +++PF L ++LT Q ++E GPFE+W+S N SDE+
Sbjct: 114 TQQILGSPITRKELELNVVEMPDVIRLERPFTLHMRLTTQIERELGPFEVWMSLNSSDED 173

Query: 382 KVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDS-LPDLE 440
           KVVM+NGL+ + +  VE +GSTDFHLNLIATK GVQRI GI VFD  EK  Y+   PDLE
Sbjct: 174 KVVMVNGLQKVVIPRVEPYGSTDFHLNLIATKPGVQRIAGIKVFDTREKKAYEHPSPDLE 233

Query: 441 IFVD 444
           I+VD
Sbjct: 234 IYVD 237


>gi|222613087|gb|EEE51219.1| hypothetical protein OsJ_32047 [Oryza sativa Japonica Group]
          Length = 402

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 164/279 (58%), Positives = 214/279 (76%), Gaps = 8/279 (2%)

Query: 168 LVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTK 227
           LVCTALY+DG+GERKYLPQFFKF VSNPLSVRTKVR +K       + T+LEACIENHTK
Sbjct: 132 LVCTALYNDGDGERKYLPQFFKFTVSNPLSVRTKVRTIK-------DTTYLEACIENHTK 184

Query: 228 SNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKML 287
           SNLYMDQV+FEPSQ W+AT L+AD   S   +   ++ K P+LIR+GGGI+NYLYQL+  
Sbjct: 185 SNLYMDQVDFEPSQQWAATRLEADEHPSTVKSIIGDLCKQPILIRAGGGIYNYLYQLRP- 243

Query: 288 SHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVG 347
           S G S   K +GS++LGK QITWRTNLGEPGRLQTQ I  T   SK+++L  V+VP V+ 
Sbjct: 244 SSGESGQTKAEGSSILGKFQITWRTNLGEPGRLQTQNIHSTPTASKDVDLRAVKVPPVIF 303

Query: 348 IDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHL 407
           +++PF++ L LTNQ+DK  GPFE++L+ +  DEEK V++NGL+ + L  VEAF S +F L
Sbjct: 304 LERPFMVNLCLTNQSDKTVGPFEVFLAPSVLDEEKYVLVNGLQKLVLPLVEAFESINFDL 363

Query: 408 NLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
           +++AT++GVQ+I+GIT++   EK  Y+ L D+EIFVD +
Sbjct: 364 SMVATQVGVQKISGITLYAVQEKKLYEPLSDIEIFVDAE 402



 Score = 46.2 bits (108), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 26/46 (56%), Positives = 30/46 (65%), Gaps = 4/46 (8%)

Query: 8  HSLAFRVMRLCRPSLHVE--PPLRVDPTDLFIGEDIF--DDPIAAS 49
          HSLAFRVMRL RPSL  +    LR DP D+F+ ED     DP A+S
Sbjct: 31 HSLAFRVMRLSRPSLQPDQAAALRFDPRDVFLPEDALTGPDPSASS 76


>gi|449526317|ref|XP_004170160.1| PREDICTED: UPF0533 protein C5orf44-like, partial [Cucumis sativus]
          Length = 278

 Score =  329 bits (844), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 157/201 (78%), Positives = 184/201 (91%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           MS+  G+HSLAFRVMRLCRPS  V+PPLR+DP DL +GEDI DDP+AA+ LP L++  ++
Sbjct: 78  MSNAQGSHSLAFRVMRLCRPSFQVDPPLRLDPVDLLVGEDILDDPVAANQLPRLLAPQLS 137

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
            +  SDL+Y SRFLLHDS+D++GL+GLLVLPQAFGAIYLGETFCSYIS+NNSS  EVRDV
Sbjct: 138 DDSDSDLSYSSRFLLHDSSDAMGLNGLLVLPQAFGAIYLGETFCSYISVNNSSNFEVRDV 197

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
           +IKAEIQT++QRILLLD+SKSPVE+IRAGGRYDFIVEHDVKELGAHTLVCTALY+DG+GE
Sbjct: 198 IIKAEIQTERQRILLLDSSKSPVETIRAGGRYDFIVEHDVKELGAHTLVCTALYNDGDGE 257

Query: 181 RKYLPQFFKFIVSNPLSVRTK 201
           RKYLPQFFKF+V+NPLSVRTK
Sbjct: 258 RKYLPQFFKFMVANPLSVRTK 278


>gi|302757333|ref|XP_002962090.1| hypothetical protein SELMODRAFT_77366 [Selaginella moellendorffii]
 gi|300170749|gb|EFJ37350.1| hypothetical protein SELMODRAFT_77366 [Selaginella moellendorffii]
          Length = 318

 Score =  318 bits (816), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 160/322 (49%), Positives = 226/322 (70%), Gaps = 14/322 (4%)

Query: 74  LLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRI 133
           L  +  D++GLS  LVLPQ FG+IYLGETFCSYIS+ N +  +VRDV+IKAE+QT++QRI
Sbjct: 2   LPQEPMDAMGLSRQLVLPQTFGSIYLGETFCSYISVGNHTNHDVRDVIIKAELQTERQRI 61

Query: 134 LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVS 193
           +L + SKSP+ESIRA G++DFI+EHD+KELG HTLVC A+Y+D +G+RKYLPQ+FKF  S
Sbjct: 62  ILSNNSKSPIESIRATGQFDFIIEHDIKELGGHTLVCMAVYTDPDGDRKYLPQYFKFTTS 121

Query: 194 NPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGP 253
           NP+SVRTKV           + TFLEACIEN TKS+L+MDQV F+ +  WS T L+    
Sbjct: 122 NPVSVRTKV-------FDLYDTTFLEACIENQTKSHLFMDQVRFDTAPPWSVTTLENVVN 174

Query: 254 HSDYNAQSREIFKPPV-----LIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
               + +  E++   +     LI   GG  +YL+QLK      SS VK++G+N LGKL+I
Sbjct: 175 QMVPSGKKMELYYQQLCLSLKLINGNGGARHYLFQLKR-PPLESSDVKLEGANALGKLEI 233

Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
            WRT LGE GRLQTQQI G+    K +++ +  +P  + I++PFL+++++TN++++  GP
Sbjct: 234 LWRTTLGETGRLQTQQINGSPTPKKPLDVKMTNLPQRILIERPFLVRMEVTNRSEQFTGP 293

Query: 369 FEIWLSQNDSD-EEKVVMINGL 389
             + +S+ D +   + V++NGL
Sbjct: 294 LRVVMSETDDNGTPRTVLMNGL 315


>gi|414870887|tpg|DAA49444.1| TPA: hypothetical protein ZEAMMB73_593757 [Zea mays]
          Length = 239

 Score =  274 bits (700), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 151/207 (72%), Positives = 167/207 (80%), Gaps = 6/207 (2%)

Query: 8   HSLAFRVMRLCRPSLH--VEPPLRVDPTDLFIGEDIF--DDPIAASNL--PPLISSDVTT 61
           HSLAFRVMRL RPSL   +   LR DP D+F+ ED     DP AA+      L  +D  T
Sbjct: 25  HSLAFRVMRLSRPSLQPDLAALLRFDPRDVFLPEDALTGSDPSAAAKFLHGLLHPADSAT 84

Query: 62  NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
               D T+R RFLL D AD++ L GLLVLPQ+FGAIYLGETFCSYISINNSS+ E RDVV
Sbjct: 85  AVPGDFTFRDRFLLRDPADALALPGLLVLPQSFGAIYLGETFCSYISINNSSSFEARDVV 144

Query: 122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           IKAEIQT++QRILLLDTSKSPVESIR+GGRYDFIVEHDVKELGAHTLVCTALY+DG+GER
Sbjct: 145 IKAEIQTERQRILLLDTSKSPVESIRSGGRYDFIVEHDVKELGAHTLVCTALYNDGDGER 204

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVG 208
           KYLPQFFKF VSNPLSVRTKVR +KVG
Sbjct: 205 KYLPQFFKFSVSNPLSVRTKVRTIKVG 231


>gi|384248215|gb|EIE21700.1| DUF974-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 417

 Score =  250 bits (639), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 152/447 (34%), Positives = 246/447 (55%), Gaps = 40/447 (8%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H+LAFRVMRLCRP +  E      P  L + +D   D +A            + +   DL
Sbjct: 2   HALAFRVMRLCRPDIPAE-----FPKGLGLRQDFLPDDLALE----------SNSGEEDL 46

Query: 68  T--YRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
           T  +  R  + +  D++G+ G+L LPQ FG I+LGE F SYIS+ N S   V +VVIKAE
Sbjct: 47  TGPFAHRANIENPIDALGIDGVLELPQNFGTIHLGEAFSSYISVGNYSNATVEEVVIKAE 106

Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
           +Q+ +Q++ L +T+ +P+  +  G R+DF+++HD+KE+ A+TL+C+  Y D +GE  Y P
Sbjct: 107 LQSARQKMTLYETA-TPLPKLDPGERHDFLIKHDIKEISAYTLICSTSYID-KGETAYQP 164

Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN--- 242
           Q+FKF+  NPLSVRTK+R            TFLEAC+EN T   L +  +  + + +   
Sbjct: 165 QYFKFVAQNPLSVRTKIR-------SLTRQTFLEACVENLTSRPLVLAYIRLDAAPSVVA 217

Query: 243 WSATMLKADG-PHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS- 300
             A+   +DG P  D  + S   +   + I   GG  N+LY L    H S +     GS 
Sbjct: 218 VPASSAWSDGEPSKDAESSSLGSYADSLQIVDAGGSSNFLYAL----HSSKASPAEAGSA 273

Query: 301 --NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
               LGK++I WR NLG+ GRLQTQQI+   + SK++EL +  +P  V ++ PF  K+ +
Sbjct: 274 LTGALGKMEIRWRGNLGKLGRLQTQQIMANAVNSKDVELLLTSLPQAVHLEIPFAAKVTV 333

Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
            +  D+      + + +  +  E  +++  L    ++ ++A+GS+     L+  K G+Q+
Sbjct: 334 RSNVDRTLENLALRVPEQPA--EGGLVVEDLSSTVVSRLDAYGSSSVVCTLLPMKEGLQK 391

Query: 419 ITGITVFDKLEKITYDSLPDLEIFVDQ 445
           +  + +  + +    D + D++ FV++
Sbjct: 392 LQAVELISQQDGRILDVM-DIDCFVNR 417


>gi|303270983|ref|XP_003054853.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462827|gb|EEH60105.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 500

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 157/515 (30%), Positives = 239/515 (46%), Gaps = 102/515 (19%)

Query: 2   SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
           ++ P   ++ FRVMR C P+L ++ P R      F  +D+   P A S           T
Sbjct: 16  AAAPLPQAIQFRVMRTCAPTLKIDTPSR------FALDDLGHPPCAPS-----------T 58

Query: 62  NKSSDLTYRSRFLLH-DSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
           + SSD+ + SR  L   ++ + G++G L LPQAFG +YLGETF +Y+S  NSS   VRDV
Sbjct: 59  STSSDVAFESRVDLGLRASRASGVTGTLCLPQAFGNVYLGETFAAYVSAINSSDRVVRDV 118

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
             KAE+QT+++R+ L D +     ++  G  +DF   HD+KELGAHTLVC  +Y+D +GE
Sbjct: 119 SFKAELQTERRRVALFDNAAEAAPTMPPGATFDFTATHDLKELGAHTLVCGVVYTDADGE 178

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
           RKY PQ+FKF  +NPL+VRTKVR  + G         LEACIEN T + L + +  FEP 
Sbjct: 179 RKYAPQYFKFNAANPLAVRTKVRPGRDGR------ALLEACIENATPAPLLLSRATFEPC 232

Query: 241 QNW------------SATMLKADGPHSDYNAQSREIF----------------------- 265
            +             +  ++    PH                                  
Sbjct: 233 AHLECDEIVPACVSGAGVVIPEGDPHRGEEGGGGGGGGGGGARDAAAAGGSGLGEGLPSL 292

Query: 266 --KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQ 323
             +P  ++   GG  ++L++L+        P     S+ LGKL+I W  + GE GRLQTQ
Sbjct: 293 ANRPLRVLSPQGGSTHFLFELRQ------RPDITVTSDTLGKLEIRWTGHNGEAGRLQTQ 346

Query: 324 QILGTT-ITSKEIELNVVE--VPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSD- 379
           QI+G+  I  K++E+       P    +  P  L   +TN+T       E+ ++Q DSD 
Sbjct: 347 QIVGSPRIGGKDVEVAFAHGAPPKTARVHAPLTLSCVVTNKTASATRALEV-IAQPDSDV 405

Query: 380 ------------------------EEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLG 415
                                       ++++G + +A+  +   G     L  + T  G
Sbjct: 406 VGGGATGGGGGATGATGGATGGGGGVAGILVDGPQRIAIGALPPGGERRVELTCVPTLPG 465

Query: 416 VQRITGITVF------DKLEKITYDSLPDLEIFVD 444
            +R+  ++V       D      +D L   E+ V+
Sbjct: 466 TRRLPIVSVAEARGDGDARGGRVFDQLARFEVLVE 500


>gi|347582612|ref|NP_001231572.1| UPF0533 protein C5orf44 homolog isoform 1 [Danio rerio]
          Length = 418

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 141/433 (32%), Positives = 223/433 (51%), Gaps = 48/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF+                L+  D +T K
Sbjct: 10  HLLALKVMRLTKPTLFTNMPVTCEDRDLPGDLFLR---------------LMKDDPSTVK 54

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                          A+++ L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++K
Sbjct: 55  G--------------AETLILGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S S V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNSAVSELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKLY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMY 219

Query: 244 SATMLK--ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L   A G  S  +   +  +  P+  R       YLY LK     +     ++G  
Sbjct: 220 NVTELNNVASGDESSESTFGKMSYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVT 273

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 274 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEFIPDTVDLEEPFDITCKITNC 333

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   ++       ++G ++  L+P     S    L L+++  G+Q I+G
Sbjct: 334 SERT---MDLLLEMCNTRSVHWCGVSGRQLGKLSPS---ASLSIPLKLLSSVQGLQSISG 387

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 388 LRLTDTFLKRTYE 400


>gi|197100367|ref|NP_001125291.1| UPF0533 protein C5orf44 homolog [Pongo abelii]
 gi|75042171|sp|Q5RCG0.1|CE044_PONAB RecName: Full=UPF0533 protein C5orf44 homolog
 gi|55727584|emb|CAH90547.1| hypothetical protein [Pongo abelii]
          Length = 417

 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 138/433 (31%), Positives = 217/433 (50%), Gaps = 49/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLK--ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L   +    S     SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVTELNSVSQAGESVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 386

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 387 LRLTDTFLKRTYE 399


>gi|148277000|ref|NP_079217.2| UPF0533 protein C5orf44 isoform 2 [Homo sapiens]
 gi|206558220|sp|A5PLN9.2|CE044_HUMAN RecName: Full=UPF0533 protein C5orf44
 gi|119571728|gb|EAW51343.1| hypothetical protein FLJ13611, isoform CRA_a [Homo sapiens]
 gi|410217874|gb|JAA06156.1| chromosome 5 open reading frame 44 [Pan troglodytes]
 gi|410217876|gb|JAA06157.1| chromosome 5 open reading frame 44 [Pan troglodytes]
 gi|410249602|gb|JAA12768.1| chromosome 5 open reading frame 44 [Pan troglodytes]
 gi|410249604|gb|JAA12769.1| chromosome 5 open reading frame 44 [Pan troglodytes]
 gi|410249606|gb|JAA12770.1| chromosome 5 open reading frame 44 [Pan troglodytes]
 gi|410292066|gb|JAA24633.1| chromosome 5 open reading frame 44 [Pan troglodytes]
 gi|410292068|gb|JAA24634.1| chromosome 5 open reading frame 44 [Pan troglodytes]
 gi|410292070|gb|JAA24635.1| chromosome 5 open reading frame 44 [Pan troglodytes]
 gi|410339455|gb|JAA38674.1| chromosome 5 open reading frame 44 [Pan troglodytes]
          Length = 417

 Score =  207 bits (527), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 219/433 (50%), Gaps = 49/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +   SR   +P            YLY LK  +  +     ++G  
Sbjct: 220 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 386

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 387 LRLTDTFLKRTYE 399


>gi|348524306|ref|XP_003449664.1| PREDICTED: UPF0533 protein C5orf44 homolog [Oreochromis niloticus]
          Length = 417

 Score =  207 bits (527), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 145/431 (33%), Positives = 226/431 (52%), Gaps = 45/431 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P   +  DL    D+F           L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNLPATCEDRDL--PGDLFGQ---------LMRQDPSTIKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++KA++Q
Sbjct: 56  -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S S V  ++     D ++ H+VKE+G H LVC   Y+  +GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNSAVAELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQQGEKLYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVTE 223

Query: 248 L----KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           L    +AD   S +   S   +  P+  R       YLY LK     +     ++G  V+
Sbjct: 224 LNMVTQADKGESTFGKMS---YLQPMDTR------QYLYCLKPKPEYAEKAGIIKGVTVI 274

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
           GKL I W+TNLGE GRLQT Q+        +I L++  +P  V +++PF +  K+TN ++
Sbjct: 275 GKLDIVWKTNLGERGRLQTSQLQRMAPGYGDIRLSLDLIPDTVNLEEPFDIICKITNCSE 334

Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
           +     ++ L   ++       I+G ++  L+P  AF S    L ++++  G+Q I+G+ 
Sbjct: 335 RT---MDLVLEMCNTSSIHWCGISGRQLGKLSP-GAFLS--LPLTVLSSVQGLQSISGLR 388

Query: 424 VFDKLEKITYD 434
           + D   K TY+
Sbjct: 389 LTDTFLKRTYE 399


>gi|332233704|ref|XP_003266043.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Nomascus
           leucogenys]
          Length = 418

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 138/433 (31%), Positives = 217/433 (50%), Gaps = 48/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           S T L +     +  +   SR   +P            YLY LK     +     ++G  
Sbjct: 220 SVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 333 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 387

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 388 LRLTDTFLKRTYE 400


>gi|318102158|ref|NP_001187397.1| upf0533 protein c5orf44-like protein [Ictalurus punctatus]
 gi|308322905|gb|ADO28590.1| upf0533 protein c5orf44-like protein [Ictalurus punctatus]
          Length = 417

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 139/431 (32%), Positives = 216/431 (50%), Gaps = 45/431 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA + MRL +P+L    P+  +    P DLF G  + +DP       PL+        
Sbjct: 10  HLLALKAMRLTKPTLFTNMPVTCEDRDLPGDLF-GRLMREDPSTIKGAEPLM-------- 60

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                               L  +L LPQ FG I+LGETF SYIS++N ST  V+D+++K
Sbjct: 61  --------------------LGEMLTLPQNFGNIFLGETFSSYISVHNDSTQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S S V  ++     D ++ H+VKE+G H LVC   Y+   G++ Y
Sbjct: 101 ADLQTSSQR-LNLSASNSAVAELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGDKLY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMY 219

Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           + T L     ++  + + RE     +          YLY LK     +     ++G  V+
Sbjct: 220 NVTEL-----NTVCSGEERESTFGKMSYLQPMDTRQYLYCLKPKPEFAEKAGVIKGVTVI 274

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
           GKL I W+TNLGE GRLQT Q+        ++ L++  VP  V I++PF +  K+TN ++
Sbjct: 275 GKLDIVWKTNLGEKGRLQTSQLQRMAPGYGDVRLSLELVPDTVNIEEPFDITCKITNCSE 334

Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
           +     ++ L   ++       ++G ++  L P     S    L L+++  G+Q I+G+ 
Sbjct: 335 RT---MDLLLEMCNTRSVHWCGVSGRQLGKLGPS---ASLSIPLQLLSSVQGLQSISGLR 388

Query: 424 VFDKLEKITYD 434
           + D   K TY+
Sbjct: 389 LTDTFLKRTYE 399


>gi|148277002|ref|NP_001087224.1| UPF0533 protein C5orf44 isoform 1 [Homo sapiens]
 gi|114600020|ref|XP_517735.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 4 [Pan
           troglodytes]
 gi|397514419|ref|XP_003827485.1| PREDICTED: UPF0533 protein C5orf44 homolog [Pan paniscus]
 gi|119571733|gb|EAW51348.1| hypothetical protein FLJ13611, isoform CRA_f [Homo sapiens]
          Length = 418

 Score =  206 bits (525), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 218/433 (50%), Gaps = 48/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +   SR   +P            YLY LK  +  +     ++G  
Sbjct: 220 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 333 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 387

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 388 LRLTDTFLKRTYE 400


>gi|347582610|ref|NP_955832.2| UPF0533 protein C5orf44 homolog isoform 2 [Danio rerio]
 gi|190360173|sp|Q6PBY7.2|CE044_DANRE RecName: Full=UPF0533 protein C5orf44 homolog
          Length = 412

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 141/433 (32%), Positives = 221/433 (51%), Gaps = 54/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF+                L+  D +T K
Sbjct: 10  HLLALKVMRLTKPTLFTNMPVTCEDRDLPGDLFLR---------------LMKDDPSTVK 54

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                          A+++ L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++K
Sbjct: 55  G--------------AETLILGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S S V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNSAVSELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKLY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMY 213

Query: 244 SATMLK--ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L   A G  S  +   +  +  P+  R       YLY LK     +     ++G  
Sbjct: 214 NVTELNNVASGDESSESTFGKMSYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVT 267

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 268 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEFIPDTVDLEEPFDITCKITNC 327

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   ++       ++G ++  L+P     S    L L+++  G+Q I+G
Sbjct: 328 SERT---MDLLLEMCNTRSVHWCGVSGRQLGKLSPS---ASLSIPLKLLSSVQGLQSISG 381

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 382 LRLTDTFLKRTYE 394


>gi|403267437|ref|XP_003925839.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 418

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 218/433 (50%), Gaps = 48/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNA--QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +  +SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVTELNSVSQAGECVSTFRSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L LI++  G+Q ++G
Sbjct: 333 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLISSVQGLQSVSG 387

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 388 LRLTDTFLKRTYE 400


>gi|47228413|emb|CAG05233.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 410

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 142/431 (32%), Positives = 218/431 (50%), Gaps = 45/431 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F           L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNLPVTCEDRDL--PGDLFSQ---------LMREDPSTIKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++KA++Q
Sbjct: 56  -----------AENLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S S V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNSAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTSQYGEKLYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVTE 223

Query: 248 LKA----DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           L      D     +   S   +  P+  R       YLY LK     +     ++G  V+
Sbjct: 224 LNMGTSRDTEECTFGKMS---YLQPMDTR------QYLYCLKPKPEYAEKAGVIKGVTVI 274

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
           GKL I W+TNLGE GRLQT Q+        +I L++  +P  V +++PF L  K+TN ++
Sbjct: 275 GKLDIVWKTNLGERGRLQTSQLQRMAPGYGDIRLSLEVIPDTVNLEEPFDLICKITNCSE 334

Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
           +     ++ L   ++        +G ++  L P     S    L L ++  G+Q I+G+ 
Sbjct: 335 R---TMDLVLEMCNTASIHWCGTSGRKLGKLGPA---ASLSLPLTLFSSVQGLQSISGLR 388

Query: 424 VFDKLEKITYD 434
           + D   K TY+
Sbjct: 389 LKDTFLKRTYE 399


>gi|432884725|ref|XP_004074559.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Oryzias
           latipes]
          Length = 417

 Score =  205 bits (521), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 143/438 (32%), Positives = 227/438 (51%), Gaps = 45/438 (10%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           ++ T   H LA +VMRL +P+L    P+  +  DL    D+F           L+  D +
Sbjct: 3   VNQTKQEHLLALKVMRLTKPTLFTNLPVTCEERDL--PGDLFGQ---------LMRQDPS 51

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
           T K               A+++ L  +L LPQ FG I+LGETF SYIS++N ST  V+++
Sbjct: 52  TIKG--------------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSTQIVKEI 97

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
           ++KA++QT  QR L L TS S V  ++     D ++ H+VKE+G H LVC   Y+   GE
Sbjct: 98  LVKADLQTSSQR-LNLSTSNSAVAELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQLGE 156

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
           + Y  +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EP+
Sbjct: 157 KLYFRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPT 216

Query: 241 QNWSATMLK----ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
             ++ T L      D   S +   S   +  P+  R       YLY LK  +  +     
Sbjct: 217 IMYNVTELNTVASGDDGESTFGKMS---YLQPMDTR------QYLYCLKPKAEYAEKAGV 267

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
           ++G  ++GKL I WRTNLGE GRLQT Q+        +I L++  +P  V +++PF +  
Sbjct: 268 IKGVTMIGKLDIVWRTNLGEKGRLQTSQLQRMAPGYGDIRLSLEIIPDTVNLEEPFDIVC 327

Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
           K+TN +++     ++ +   ++       I+G ++  L+P    GS    L + ++  G+
Sbjct: 328 KITNCSER---TMDLVVEMCNTRSIHWCGISGRQLGKLSP---GGSLLVPLTIFSSVQGL 381

Query: 417 QRITGITVFDKLEKITYD 434
           Q I+G+ + D   K TY+
Sbjct: 382 QSISGLRLTDTFLKRTYE 399


>gi|344272589|ref|XP_003408114.1| PREDICTED: UPF0533 protein C5orf44 homolog [Loxodonta africana]
          Length = 418

 Score =  204 bits (519), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 219/429 (51%), Gaps = 40/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYATQSGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITNSPMFMEKVSLEPSIMYNVAE 223

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L A     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 224 LNAVNQAGECISTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 337 T--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 391

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 392 DTFLKRTYE 400


>gi|224090703|ref|XP_002190150.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Taeniopygia
           guttata]
          Length = 417

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 220/429 (51%), Gaps = 40/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    ++F+          L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 56  -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVAE 223

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
           L       D   +S   F     ++       YLY LK     +     ++G  V+GKL 
Sbjct: 224 LNT----VDTAGESESTFGTRTYLQP-MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGKLD 278

Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
           I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + +   
Sbjct: 279 IVWKTNLGEHGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSSER-- 336

Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH--LNLIATKLGVQRITGITVF 425
             ++ L   +++      ++G ++  L P     S+  H  L L+++  G+Q ++G+ + 
Sbjct: 337 TMDLVLEMCNTNSIHWCGVSGRQLGKLYP-----SSSLHLALTLLSSVQGLQSVSGLRLT 391

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 392 DTFLKRTYE 400


>gi|303304982|ref|NP_001181925.1| uncharacterized protein LOC427165 isoform 1 [Gallus gallus]
          Length = 418

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 218/429 (50%), Gaps = 40/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    ++F+          L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 56  -----------AEALMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASTAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVAE 223

Query: 248 LKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L        S+    SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 224 LNTVDSAGESESTFGSRTYLQP-------MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSSER 336

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      ++G ++  L P     S    L L+++  G+Q ++G+ + 
Sbjct: 337 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHPS---SSLRLALTLLSSVQGLQSVSGLRLT 391

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 392 DTFLKRTYE 400


>gi|334325202|ref|XP_001381439.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Monodelphis
           domestica]
          Length = 418

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 137/435 (31%), Positives = 220/435 (50%), Gaps = 52/435 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEDRDLPGDLF-NQLMKDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQIVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  +++    D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNATVAELKSDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMY 219

Query: 244 SA----TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           +     T+ +A    S +   SR   +P            YLY LK     +     ++G
Sbjct: 220 NVVELNTVKQAGEGMSTFG--SRTYLQPM-------DTRQYLYCLKPKQEFAEKAGIIKG 270

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
             V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+T
Sbjct: 271 VTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKIT 330

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           N + +     ++ L   +++      ++G ++  L P     S    L L+++  G+Q +
Sbjct: 331 NCSSER--TMDLVLEMCNTNSIHWCGVSGRQLGKLNPS---SSLYLALTLLSSVQGLQSV 385

Query: 420 TGITVFDKLEKITYD 434
           +G+ + D   K TY+
Sbjct: 386 SGLRLTDTFLKRTYE 400


>gi|207079887|ref|NP_001128904.1| DKFZP459P083 protein [Pongo abelii]
 gi|55733284|emb|CAH93324.1| hypothetical protein [Pongo abelii]
          Length = 411

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 138/433 (31%), Positives = 215/433 (49%), Gaps = 55/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +      S     SR   +P            YLY LK     +     ++G  
Sbjct: 214 NVTELNSVSQAGESVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 327 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 380

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 381 LRLTDTFLKRTYE 393


>gi|156120529|ref|NP_001095410.1| UPF0533 protein C5orf44 homolog [Bos taurus]
 gi|189042269|sp|A7MB76.1|CE044_BOVIN RecName: Full=UPF0533 protein C5orf44 homolog
 gi|154425662|gb|AAI51377.1| LOC511108 protein [Bos taurus]
 gi|296475854|tpg|DAA17969.1| TPA: hypothetical protein LOC511108 [Bos taurus]
          Length = 417

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 216/433 (49%), Gaps = 49/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L +     +      SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 387 LRLTDTFLKRTYE 399


>gi|441658593|ref|XP_003266042.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Nomascus
           leucogenys]
          Length = 412

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 138/433 (31%), Positives = 215/433 (49%), Gaps = 54/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           S T L +     +  +   SR   +P            YLY LK     +     ++G  
Sbjct: 214 SVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 381

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 382 LRLTDTFLKRTYE 394


>gi|148277004|ref|NP_001087225.1| UPF0533 protein C5orf44 isoform 3 [Homo sapiens]
 gi|119571729|gb|EAW51344.1| hypothetical protein FLJ13611, isoform CRA_b [Homo sapiens]
 gi|410217878|gb|JAA06158.1| chromosome 5 open reading frame 44 [Pan troglodytes]
          Length = 411

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 217/433 (50%), Gaps = 55/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +   SR   +P            YLY LK  +  +     ++G  
Sbjct: 214 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 327 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 380

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 381 LRLTDTFLKRTYE 393


>gi|355734989|gb|AES11515.1| hypothetical protein [Mustela putorius furo]
          Length = 416

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 216/433 (49%), Gaps = 49/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDY--NAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L +     +      SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVAELNSVSQAGECLTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNX 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 387 LRLTDTFLKRTYE 399


>gi|417400575|gb|JAA47218.1| Hypothetical protein [Desmodus rotundus]
          Length = 417

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 217/433 (50%), Gaps = 49/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L +     +      SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVAELNSVNQAGECVTTFGSRTYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 387 LRLTDTFLKRTYE 399


>gi|148276985|ref|NP_001087228.1| UPF0533 protein C5orf44 homolog isoform 2 [Mus musculus]
 gi|123793268|sp|Q3TIR1.1|CE044_MOUSE RecName: Full=UPF0533 protein C5orf44 homolog
 gi|74198618|dbj|BAE39785.1| unnamed protein product [Mus musculus]
          Length = 417

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 220/429 (51%), Gaps = 41/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 224 LNSVTQAGECISTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++ 
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSERM 336

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 337 ---MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 390

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 391 DTFLKRTYE 399


>gi|395510370|ref|XP_003759450.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Sarcophilus
           harrisii]
          Length = 418

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 218/433 (50%), Gaps = 48/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDLPGDLF-NQLMKDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQIVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  +++    D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNATVAELKSDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L       +  +   SR   +P            YLY LK  +  +     ++G  
Sbjct: 220 NVVELNTVKQVGEGVSTFGSRTYLQPM-------DTRQYLYCLKPKAEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      ++G ++  L P     S    L L+++  G+Q ++G
Sbjct: 333 SSER--TMDLVLEMCNTNSIHWCGVSGRQLGKLNPS---SSLYLALTLLSSVQGLQSVSG 387

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 388 LRLTDTFLKRTYE 400


>gi|426246393|ref|XP_004016979.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Ovis aries]
          Length = 417

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 216/433 (49%), Gaps = 49/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L +     +      SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 387 LRLTDTFLKRTYE 399


>gi|148276983|ref|NP_080155.3| UPF0533 protein C5orf44 homolog isoform 1 [Mus musculus]
 gi|112180396|gb|AAH21756.3| 2410002O22Rik protein [Mus musculus]
 gi|148686556|gb|EDL18503.1| RIKEN cDNA 2410002O22, isoform CRA_b [Mus musculus]
          Length = 418

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 219/429 (51%), Gaps = 40/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 224 LNSVTQAGECISTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 337 M--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 391

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 392 DTFLKRTYE 400


>gi|344179108|ref|NP_001230666.1| UPF0533 protein C5orf44 isoform 4 [Homo sapiens]
 gi|397514417|ref|XP_003827484.1| PREDICTED: UPF0533 protein C5orf44 homolog [Pan paniscus]
 gi|410039323|ref|XP_001163636.3| PREDICTED: UPF0533 protein C5orf44 homolog isoform 3 [Pan
           troglodytes]
 gi|119571730|gb|EAW51345.1| hypothetical protein FLJ13611, isoform CRA_c [Homo sapiens]
          Length = 412

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 216/433 (49%), Gaps = 54/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +   SR   +P            YLY LK  +  +     ++G  
Sbjct: 214 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 381

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 382 LRLTDTFLKRTYE 394


>gi|148745378|gb|AAI42995.1| C5orf44 protein [Homo sapiens]
          Length = 412

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 216/433 (49%), Gaps = 54/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +   SR   +P            YLY LK  +  +     ++G  
Sbjct: 214 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 381

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 382 LRLTDTFLKRTYE 394


>gi|403267435|ref|XP_003925838.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 412

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 216/433 (49%), Gaps = 54/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNA--QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +  +SR   +P            YLY LK     +     ++G  
Sbjct: 214 NVTELNSVSQAGECVSTFRSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L LI++  G+Q ++G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLISSVQGLQSVSG 381

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 382 LRLTDTFLKRTYE 394


>gi|296194459|ref|XP_002744954.1| PREDICTED: UPF0533 protein C5orf44 isoform 2 [Callithrix jacchus]
          Length = 412

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 216/433 (49%), Gaps = 54/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNA--QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +  +SR   +P            YLY LK     +     ++G  
Sbjct: 214 NVTELNSVSQAGECVSTFRSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 381

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 382 LRLTDTFLKRTYE 394


>gi|348041260|ref|NP_001013930.2| UPF0533 protein C5orf44 homolog [Rattus norvegicus]
 gi|190360171|sp|Q5M887.2|CE044_RAT RecName: Full=UPF0533 protein C5orf44 homolog
 gi|149059250|gb|EDM10257.1| similar to RIKEN cDNA 2410002O22 gene, isoform CRA_a [Rattus
           norvegicus]
          Length = 418

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 218/429 (50%), Gaps = 40/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 224 LNSVNQAGECVSTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   ++       I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 337 T--MDLVLEMCNTTSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 391

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 392 DTFLKRTYE 400


>gi|449278704|gb|EMC86495.1| UPF0533 protein C5orf44 like protein [Columba livia]
          Length = 410

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 137/431 (31%), Positives = 218/431 (50%), Gaps = 48/431 (11%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           L F VMRL +P+L    P+  +  DL             +    L+  D +T K      
Sbjct: 5   LIFAVMRLTKPTLFTNIPVTCEERDL-----------PGNLFTQLMKDDPSTVKG----- 48

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                    A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT 
Sbjct: 49  ---------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTS 99

Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
            QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFK
Sbjct: 100 SQR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKFFK 158

Query: 190 FIVSNPLSVRTKVRVVKV--GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           F V  PL V+TK    +V     +  E+ FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 159 FQVLKPLDVKTKFYNAEVSESCVYLDEV-FLEAQIQNITTSPMFMEKVSLEPSMMYNVAE 217

Query: 248 LKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L        S+    SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 218 LNTVDTAGESESTFGSRTYLQPM-------DTRQYLYCLKPKQEFAEKAGVIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++ 
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSER- 329

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH--LNLIATKLGVQRITGIT 423
               ++ L   +++      ++G ++  L P     S+  H  L L+++  G+Q ++G+ 
Sbjct: 330 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHP-----SSSLHLALTLLSSVQGLQSVSGLR 382

Query: 424 VFDKLEKITYD 434
           + D   K TY+
Sbjct: 383 LTDTFLKRTYE 393


>gi|345794146|ref|XP_535257.3| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Canis lupus
           familiaris]
          Length = 418

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 215/433 (49%), Gaps = 48/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L +     +      SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVAELNSVSQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G
Sbjct: 333 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 387

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 388 LRLTDTFLKRTYE 400


>gi|109706942|gb|AAI17129.1| C5orf44 protein [Homo sapiens]
 gi|219520363|gb|AAI43694.1| C5orf44 protein [Homo sapiens]
          Length = 400

 Score =  200 bits (509), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 136/431 (31%), Positives = 216/431 (50%), Gaps = 55/431 (12%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           LA +VMRL +P+L    P+  +    P DLF  + + DDP                    
Sbjct: 1   LALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV----------------- 42

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
                      + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA+
Sbjct: 43  -----------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKAD 91

Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
           +QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  
Sbjct: 92  LQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFR 150

Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ 
Sbjct: 151 KFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMYNV 204

Query: 246 TMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           T L +     +  +   SR   +P            YLY LK  +  +     ++G  V+
Sbjct: 205 TELNSVSQAGECVSTFGSRAYLQPM-------DTRQYLYCLKPKNEFAEKAGIIKGVTVI 257

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
           GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN ++
Sbjct: 258 GKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSE 317

Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
           +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G+ 
Sbjct: 318 R---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLR 371

Query: 424 VFDKLEKITYD 434
           + D   K TY+
Sbjct: 372 LTDTFLKRTYE 382


>gi|347922196|ref|NP_001231675.1| uncharacterized protein LOC100513053 [Sus scrofa]
          Length = 417

 Score =  200 bits (509), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 137/434 (31%), Positives = 216/434 (49%), Gaps = 51/434 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKA---DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
           +   L +   DG        SR   +P            YLY LK     +     ++G 
Sbjct: 220 NVAELNSVNQDG-ECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGV 271

Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
            V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN
Sbjct: 272 TVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFHITCKITN 331

Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
            +++     ++ L   +++      I+G ++  L P  +       L L+++  G Q ++
Sbjct: 332 CSER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGPQSVS 385

Query: 421 GITVFDKLEKITYD 434
           G+ + D   K TY+
Sbjct: 386 GLRLTDTFLKRTYE 399


>gi|303304975|ref|NP_001006577.2| uncharacterized protein LOC427165 isoform 2 [Gallus gallus]
          Length = 411

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 217/429 (50%), Gaps = 47/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    ++F+          L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 56  -----------AEALMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASTAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVAE 217

Query: 248 LKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L        S+    SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 218 LNTVDSAGESESTFGSRTYLQP-------MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++ 
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSER- 329

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      ++G ++  L P     S    L L+++  G+Q ++G+ + 
Sbjct: 330 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHPS---SSLRLALTLLSSVQGLQSVSGLRLT 384

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 385 DTFLKRTYE 393


>gi|432884723|ref|XP_004074558.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Oryzias
           latipes]
          Length = 411

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 143/438 (32%), Positives = 225/438 (51%), Gaps = 51/438 (11%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           ++ T   H LA +VMRL +P+L    P+  +  DL    D+F           L+  D +
Sbjct: 3   VNQTKQEHLLALKVMRLTKPTLFTNLPVTCEERDL--PGDLFGQ---------LMRQDPS 51

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
           T K               A+++ L  +L LPQ FG I+LGETF SYIS++N ST  V+++
Sbjct: 52  TIKG--------------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSTQIVKEI 97

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
           ++KA++QT  QR L L TS S V  ++     D ++ H+VKE+G H LVC   Y+   GE
Sbjct: 98  LVKADLQTSSQR-LNLSTSNSAVAELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQLGE 156

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
           + Y  +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EP+
Sbjct: 157 KLYFRKFFKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPT 210

Query: 241 QNWSATMLK----ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
             ++ T L      D   S +   S   +  P+  R       YLY LK  +  +     
Sbjct: 211 IMYNVTELNTVASGDDGESTFGKMS---YLQPMDTR------QYLYCLKPKAEYAEKAGV 261

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
           ++G  ++GKL I WRTNLGE GRLQT Q+        +I L++  +P  V +++PF +  
Sbjct: 262 IKGVTMIGKLDIVWRTNLGEKGRLQTSQLQRMAPGYGDIRLSLEIIPDTVNLEEPFDIVC 321

Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
           K+TN +++     ++ +   ++       I+G ++  L+P    GS    L + ++  G+
Sbjct: 322 KITNCSER---TMDLVVEMCNTRSIHWCGISGRQLGKLSP---GGSLLVPLTIFSSVQGL 375

Query: 417 QRITGITVFDKLEKITYD 434
           Q I+G+ + D   K TY+
Sbjct: 376 QSISGLRLTDTFLKRTYE 393


>gi|198423527|ref|XP_002129801.1| PREDICTED: similar to UPF0533 protein isoform 2 [Ciona
           intestinalis]
          Length = 396

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 137/429 (31%), Positives = 219/429 (51%), Gaps = 52/429 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA RVMRL +PS+    P+  D +D+                            S +L
Sbjct: 7   HPLALRVMRLTKPSIITSVPVLNDKSDVL---------------------------SLNL 39

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
            Y S+ L      S G    L+LP +FG I+LGETF SY+S+NN S  +V +V + A++Q
Sbjct: 40  GYSSKNL-----TSYGTGETLILPHSFGNIFLGETFVSYLSVNNESGTDVLNVSLMADLQ 94

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QRI L  ++K+P ES++ G   D ++ H+VKELG H LVCT  YS  +GE K   +F
Sbjct: 95  TGSQRITL--SNKTPKESLKPGNSLDEVINHEVKELGTHILVCTVSYSRRDGEPKNFRKF 152

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQ-EITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
           FKF V  PL V+TK   ++      Q +  +LE  I+N T + + M++V  +P+  ++A 
Sbjct: 153 FKFQVLKPLDVKTKFYNIESYLLTLQCDQVYLETQIQNITPNPICMEKVNLDPAALYTAQ 212

Query: 247 MLKA-DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
            L      H +++ QS    KP         +  YLY LK L    +     + + V+GK
Sbjct: 213 SLNTISSNHGEFSCQS--YMKP-------SEVRQYLYWLK-LKPSCAKKAFTEAAGVIGK 262

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+++LGE GRLQT Q+    +  ++I + V +VP  + + +PF +  K+TN ++  
Sbjct: 263 LDIVWKSSLGERGRLQTSQLQRAILGQRDILVQVNQVPENLKVLQPFEISCKVTNYSEHA 322

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
           +     + ++ +      ++   +    L  + A  S    ++L+ T +G+Q ++G+ V 
Sbjct: 323 KQLMVQYENRTN------LLWQNVSGYTLNKLPAKESCFITMSLLPTSVGIQSVSGMKVI 376

Query: 426 DKLEKITYD 434
           D     TYD
Sbjct: 377 DMELNRTYD 385


>gi|338718819|ref|XP_003363894.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Equus
           caballus]
          Length = 418

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 213/433 (49%), Gaps = 48/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +       +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLNSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L +     +      SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   ++       I+G ++  L P  +       L L+++  G+Q ++G
Sbjct: 333 SSERT--MDLVLEMCNTSSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 387

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 388 LRLTDTFLKRTYE 400


>gi|449514345|ref|XP_002190091.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Taeniopygia
           guttata]
          Length = 411

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 218/429 (50%), Gaps = 46/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    ++F+          L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 56  -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMYNVAE 217

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
           L       D   +S   F     ++       YLY LK     +     ++G  V+GKL 
Sbjct: 218 LNT----VDTAGESESTFGTRTYLQP-MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGKLD 272

Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
           I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + +   
Sbjct: 273 IVWKTNLGEHGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSSER-- 330

Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH--LNLIATKLGVQRITGITVF 425
             ++ L   +++      ++G ++  L P     S+  H  L L+++  G+Q ++G+ + 
Sbjct: 331 TMDLVLEMCNTNSIHWCGVSGRQLGKLYP-----SSSLHLALTLLSSVQGLQSVSGLRLT 385

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 386 DTFLKRTYE 394


>gi|395825392|ref|XP_003785919.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Otolemur
           garnettii]
          Length = 412

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 137/433 (31%), Positives = 215/433 (49%), Gaps = 54/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +   SR   +P            YLY LK     +     ++G  
Sbjct: 214 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLNPSSSLC---LALTLLSSVQGLQSISG 381

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 382 LRLTDTFLKRTYE 394


>gi|219517954|gb|AAI43692.1| C5orf44 protein [Homo sapiens]
          Length = 401

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 136/431 (31%), Positives = 215/431 (49%), Gaps = 54/431 (12%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           LA +VMRL +P+L    P+  +    P DLF  + + DDP                    
Sbjct: 1   LALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV----------------- 42

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
                      + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA+
Sbjct: 43  -----------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKAD 91

Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
           +QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  
Sbjct: 92  LQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFR 150

Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ 
Sbjct: 151 KFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMYNV 204

Query: 246 TMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           T L +     +  +   SR   +P            YLY LK  +  +     ++G  V+
Sbjct: 205 TELNSVSQAGECVSTFGSRAYLQPM-------DTRQYLYCLKPKNEFAEKAGIIKGVTVI 257

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
           GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + 
Sbjct: 258 GKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSS 317

Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
           +     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G+ 
Sbjct: 318 ER--TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLR 372

Query: 424 VFDKLEKITYD 434
           + D   K TY+
Sbjct: 373 LTDTFLKRTYE 383


>gi|334325204|ref|XP_003340619.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Monodelphis
           domestica]
          Length = 412

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 137/435 (31%), Positives = 218/435 (50%), Gaps = 58/435 (13%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEDRDLPGDLF-NQLMKDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQIVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  +++    D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNATVAELKSDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMY 213

Query: 244 SA----TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           +     T+ +A    S +   SR   +P            YLY LK     +     ++G
Sbjct: 214 NVVELNTVKQAGEGMSTFG--SRTYLQP-------MDTRQYLYCLKPKQEFAEKAGIIKG 264

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
             V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+T
Sbjct: 265 VTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKIT 324

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           N + +     ++ L   +++      ++G ++  L P     S    L L+++  G+Q +
Sbjct: 325 NCSSER--TMDLVLEMCNTNSIHWCGVSGRQLGKLNPS---SSLYLALTLLSSVQGLQSV 379

Query: 420 TGITVFDKLEKITYD 434
           +G+ + D   K TY+
Sbjct: 380 SGLRLTDTFLKRTYE 394


>gi|328771369|gb|EGF81409.1| hypothetical protein BATDEDRAFT_34721 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 484

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 158/498 (31%), Positives = 229/498 (45%), Gaps = 103/498 (20%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL  PS     PL  D TDL +         AA  +  L  SD +     D+
Sbjct: 8   HLLALKVMRLSHPSYAQTHPLYTD-TDLALP--------AAEVVQSLKHSDSSMQVDDDM 58

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A   GL  LL LP AFG IYLGETF SY+ +NN S   V ++  KAE+Q
Sbjct: 59  Y----------AGIAGLGSLLTLPPAFGNIYLGETFSSYLCVNNESLTPVLNLTFKAELQ 108

Query: 128 TDKQRILLLDT--------------------------------------SKSPVESIRAG 149
           T  QRI L DT                                       +S   S+  G
Sbjct: 109 TSTQRITLADTLLSSASSSASSSTGVDRLALGSISGSYSTLHGSGPAENRQSLASSLLPG 168

Query: 150 GRYDFIVEHDVKELGAHTLVCTALY----------SDGEGERKYLPQFFKFIVSNPLSVR 199
              +F++ HD+KELG H LVC+  Y          S  + ERK+  +F+KF V NPLSV+
Sbjct: 169 QSAEFVIHHDIKELGIHILVCSVHYTPAPVIGSSASSMDRERKFFRKFYKFQVLNPLSVK 228

Query: 200 TKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS-----------QNWSATML 248
           TKV  ++ G        FLEA ++N + S +Y++ + FEP+           ++ S ++ 
Sbjct: 229 TKVNTLQDGRI------FLEAQVQNVSSSFMYLEYMNFEPNDPFLVQDLNLFRDSSVSLT 282

Query: 249 KADG------PHSDYNAQSRE------IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
                       ++ + QS +      +FK   L+        YLY   ML+  S + V 
Sbjct: 283 SGQNDIVSTKSETETDVQSSQTSKGLSVFKERDLL-GQQDTRQYLY---MLTPKSINDVA 338

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
            +    LGKL I+WRT LG+ GRLQT Q+    ++    E+ VVE P ++ +++PF++K+
Sbjct: 339 TRMLPGLGKLDISWRTVLGQSGRLQTSQLSRKILSVNPFEVFVVEQPRIIRVEQPFVVKI 398

Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
           ++TN    E+    I   +N       V++ G   + L  +E   S D  L   A  +G+
Sbjct: 399 RITNHVPSERLKLSIHGYKNKMTN---VLLRGPNNIELNELEGASSVDVDLEFFALAIGL 455

Query: 417 QRITGITVFDKLEKITYD 434
           Q+ITGI V DK+   T D
Sbjct: 456 QKITGIQVSDKVSGTTRD 473


>gi|198423525|ref|XP_002129762.1| PREDICTED: similar to UPF0533 protein isoform 1 [Ciona
           intestinalis]
          Length = 389

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 136/428 (31%), Positives = 218/428 (50%), Gaps = 57/428 (13%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA RVMRL +PS+    P+  D +D+                            S +L
Sbjct: 7   HPLALRVMRLTKPSIITSVPVLNDKSDVL---------------------------SLNL 39

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
            Y S+ L      S G    L+LP +FG I+LGETF SY+S+NN S  +V +V + A++Q
Sbjct: 40  GYSSKNL-----TSYGTGETLILPHSFGNIFLGETFVSYLSVNNESGTDVLNVSLMADLQ 94

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QRI L  ++K+P ES++ G   D ++ H+VKELG H LVCT  YS  +GE K   +F
Sbjct: 95  TGSQRITL--SNKTPKESLKPGNSLDEVINHEVKELGTHILVCTVSYSRRDGEPKNFRKF 152

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK   ++       +  +LE  I+N T + + M++V  +P+  ++A  
Sbjct: 153 FKFQVLKPLDVKTKFYNIEC------DQVYLETQIQNITPNPICMEKVNLDPAALYTAQS 206

Query: 248 LKA-DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
           L      H +++ QS    KP         +  YLY LK L    +     + + V+GKL
Sbjct: 207 LNTISSNHGEFSCQS--YMKP-------SEVRQYLYWLK-LKPSCAKKAFTEAAGVIGKL 256

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
            I W+++LGE GRLQT Q+    +  ++I + V +VP  + + +PF +  K+TN ++  +
Sbjct: 257 DIVWKSSLGERGRLQTSQLQRAILGQRDILVQVNQVPENLKVLQPFEISCKVTNYSEHAK 316

Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 426
                + ++ +      ++   +    L  + A  S    ++L+ T +G+Q ++G+ V D
Sbjct: 317 QLMVQYENRTN------LLWQNVSGYTLNKLPAKESCFITMSLLPTSVGIQSVSGMKVID 370

Query: 427 KLEKITYD 434
                TYD
Sbjct: 371 MELNRTYD 378


>gi|387019765|gb|AFJ52000.1| UPF0533 protein C5orf44-like protein [Crotalus adamanteus]
          Length = 413

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 137/429 (31%), Positives = 217/429 (50%), Gaps = 40/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFNQ---------LMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 56  -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSQQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKQDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITSSPMFMEKVSLEPSIMYNVAE 223

Query: 248 LKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L     G  S     +R   +P            YLY LK     S     ++G  V+GK
Sbjct: 224 LNTINQGRDSVSTFGTRTYLQPM-------DTRQYLYCLKPKQEFSEKVGVIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVSLEEPFNITCKITNCSSER 336

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      ++G ++  L P  +   T   L+ +    G+Q ++G+ + 
Sbjct: 337 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHPTSSLYLTLTLLSSVQ---GLQSVSGLRLT 391

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 392 DTFLKRTYE 400


>gi|395510368|ref|XP_003759449.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Sarcophilus
           harrisii]
          Length = 412

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 135/429 (31%), Positives = 218/429 (50%), Gaps = 46/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFNQ---------LMKDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQIVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  +++    D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNATVAELKSDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMYNVVE 217

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L       +  +   SR   +P            YLY LK  +  +     ++G  V+GK
Sbjct: 218 LNTVKQVGEGVSTFGSRTYLQPM-------DTRQYLYCLKPKAEFAEKAGIIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      ++G ++  L P     S    L L+++  G+Q ++G+ + 
Sbjct: 331 --TMDLVLEMCNTNSIHWCGVSGRQLGKLNPS---SSLYLALTLLSSVQGLQSVSGLRLT 385

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 386 DTFLKRTYE 394


>gi|148686557|gb|EDL18504.1| RIKEN cDNA 2410002O22, isoform CRA_c [Mus musculus]
          Length = 426

 Score =  198 bits (503), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 217/429 (50%), Gaps = 46/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 24  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 66

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 67  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 118

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 119 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 177

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 178 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 231

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 232 LNSVTQAGECISTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 284

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 285 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 344

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 345 M--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 399

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 400 DTFLKRTYE 408


>gi|74207988|dbj|BAE29111.1| unnamed protein product [Mus musculus]
          Length = 412

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 217/429 (50%), Gaps = 46/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEKKDL--PGDLFNQ---------LMKDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 217

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 218 LNSVTQAGECISTFGSRGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 331 M--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 385

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 386 DTFLKRTYE 394


>gi|148276987|ref|NP_001087229.1| UPF0533 protein C5orf44 homolog isoform 3 [Mus musculus]
 gi|74194542|dbj|BAE37309.1| unnamed protein product [Mus musculus]
          Length = 412

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 217/429 (50%), Gaps = 46/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 217

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 218 LNSVTQAGECISTFGSRGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 331 M--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 385

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 386 DTFLKRTYE 394


>gi|260792744|ref|XP_002591374.1| hypothetical protein BRAFLDRAFT_282065 [Branchiostoma floridae]
 gi|229276579|gb|EEN47385.1| hypothetical protein BRAFLDRAFT_282065 [Branchiostoma floridae]
          Length = 410

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 136/440 (30%), Positives = 217/440 (49%), Gaps = 46/440 (10%)

Query: 10  LAFRVMRLCRPS-LHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLT 68
           LA +VMRL RP+ LHV P +  D  DL             S    ++ SD+ ++      
Sbjct: 11  LALKVMRLTRPTFLHVTP-ITCDDRDL-----------PGSTFSQVVRSDMASSAG---- 54

Query: 69  YRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQT 128
                      +   +  LL LPQ FG I+LGETF  Y+ ++N ST  V+D+++KA++QT
Sbjct: 55  ----------LEEFAMGELLTLPQNFGNIFLGETFSCYVCVHNDSTQLVKDIMVKADLQT 104

Query: 129 DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
             QR+ L   S  P+  +   G  D ++ H+VKELG H LVC   Y+    E+ Y  +FF
Sbjct: 105 SSQRLTLSGGSSPPIPELGPEGSIDEVIHHEVKELGTHILVCAVSYTTQSSEKMYFRKFF 164

Query: 189 KFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
           KF V  PL V+TK    +       +  +LEA ++N T + + M++V  EPS ++S + L
Sbjct: 165 KFQVLKPLDVKTKFYNAE------SDEVYLEAQVQNITAAPMVMEKVSLEPSASYSVSEL 218

Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
             +            IF   V +     I  YLY LK  +   +    ++G   +GKL I
Sbjct: 219 NTE------EKAGMSIFGTSVYLNP-KDIRQYLYCLKPKAEVGAPRGVLKGVTNIGKLDI 271

Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
            W+TN+GE GRLQT  +        +I L V ++P  V ++KPF  K ++TN  ++    
Sbjct: 272 IWKTNMGEKGRLQTSPLQRMAPGYGDIRLTVEQIPDGVPMEKPFNFKCRVTNCCERTMD- 330

Query: 369 FEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKL 428
             + L  + +       ++G ++  L P       + +L L+A+  G+Q I+G+ + D  
Sbjct: 331 LLLLLQNSGTSGLYWCGVSGKQLGKLGPNTHM---ELNLTLLASVPGLQSISGLRLTDTY 387

Query: 429 EKITY--DSLPDLEIFVDQD 446
            K TY  D +  + ++ DQ+
Sbjct: 388 LKRTYEHDDIAQVFVYSDQE 407


>gi|349732100|ref|NP_001016427.2| UPF0533 protein C5orf44 homolog isoform 2 [Xenopus (Silurana)
           tropicalis]
          Length = 411

 Score =  197 bits (501), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 137/430 (31%), Positives = 220/430 (51%), Gaps = 49/430 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F           L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFS---------TLMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+ +KA++Q
Sbjct: 56  -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDIQVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASTAVVSELKPDSCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ + 
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVSE 217

Query: 248 LKADGPHSD-YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
           L     + D  +    + +  P+  R       YLY LK     +     ++G  V+GKL
Sbjct: 218 LNTVITNGDGCSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKL 271

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
            I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++  
Sbjct: 272 DIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSER-- 329

Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLN--LIATKLGVQRITGITV 424
              ++ L   +++      ++G ++  L P     S+  HL   L+++  G+Q ++G+ +
Sbjct: 330 -TMDLVLEMCNTNAIHWCGVSGRQLGKLHP-----SSSLHLTLALLSSVQGLQSVSGLRL 383

Query: 425 FDKLEKITYD 434
            D   K TY+
Sbjct: 384 TDTFLKRTYE 393


>gi|431907788|gb|ELK11395.1| hypothetical protein PAL_GLEAN10024843 [Pteropus alecto]
          Length = 411

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 214/433 (49%), Gaps = 55/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L +     +      +R   +P            YLY LK     +     ++G  
Sbjct: 214 NVAELNSVNQAGECVTTFGTRTYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++     ++ L   +++      I+G ++  L P  +       L L+++  G+Q I+G
Sbjct: 327 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 380

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 381 LRLTDTFLKRTYE 393


>gi|426246395|ref|XP_004016980.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Ovis aries]
          Length = 412

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 135/429 (31%), Positives = 215/429 (50%), Gaps = 46/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVAE 217

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +      SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 218 LNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 331 T--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 385

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 386 DTFLKRTYE 394


>gi|355691351|gb|EHH26536.1| hypothetical protein EGK_16539 [Macaca mulatta]
 gi|355749957|gb|EHH54295.1| hypothetical protein EGM_15103 [Macaca fascicularis]
          Length = 418

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 137/429 (31%), Positives = 215/429 (50%), Gaps = 40/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +V      +  FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAEVSVECLTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTE 223

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 224 LNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
                + +   +S     +   G+    L  +    S    L L+++  G+Q I+G+ + 
Sbjct: 337 TMDLVLEMCNTNS-----IHWCGISGRQLGKLHPSSSLSLALTLLSSVQGLQSISGLRLT 391

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 392 DTFLKRTYE 400


>gi|359319029|ref|XP_003638975.1| PREDICTED: UPF0533 protein C5orf44 homolog [Canis lupus familiaris]
 gi|410948699|ref|XP_003981068.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Felis catus]
          Length = 412

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 135/429 (31%), Positives = 215/429 (50%), Gaps = 46/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVAE 217

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +      SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 218 LNSVSQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + 
Sbjct: 331 T--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 385

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 386 DTFLKRTYE 394


>gi|443711431|gb|ELU05219.1| hypothetical protein CAPTEDRAFT_211630 [Capitella teleta]
          Length = 423

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 138/443 (31%), Positives = 219/443 (49%), Gaps = 39/443 (8%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           M S    H L  +VMRL +P+L +  PL   PT   +  D    P+  +           
Sbjct: 1   MESKEKEHLLVLKVMRLTKPALMISKPLSCIPTHRTV--DDHGQPVKVA----------- 47

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
               +DL       + +  +   LS LL LPQ FG I+LGETF SYIS++N+S+   RD+
Sbjct: 48  ----TDLA------IAEGLEHFALSQLLTLPQNFGNIFLGETFSSYISVHNNSSHVCRDI 97

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
            IKA++QT  QR+ L  +  +PV+ +      D +++H+VKELG H LVC   Y    GE
Sbjct: 98  QIKADLQTSSQRLTLSSSHANPVQQLTPSESIDDVIQHEVKELGTHILVCAVTYVSNTGE 157

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
           + Y  +FFKF V  PL V+TK    +       +  +LEA I+N T   +++++V  +PS
Sbjct: 158 KMYFRKFFKFQVLKPLDVKTKFYNAE------SDEVYLEAQIQNITPGPIFLEKVLLDPS 211

Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
            ++S   L     H+  +  +R +F   V   S   +  YLY L       + P  ++G 
Sbjct: 212 SHYSGIQL-----HTQEDPVNRPVFG-KVNCVSPLDVRQYLYCLTPKPEVLADPKFMKGV 265

Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
             +GKL I W+TN+ E GRLQT  +        +I L V ++   V ++  F +++++TN
Sbjct: 266 TNIGKLDIVWKTNMAEKGRLQTSALQRVLPGYGDIRLMVEKISESVPVETKFNIEIRVTN 325

Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
            +++      + L  N          +G++I  L    +  ST   L LI T  G+Q I+
Sbjct: 326 CSERTMD-LSVHLDNNIQIGLLWSCCSGIQIGRLT---SGSSTLLKLALIPTACGLQTIS 381

Query: 421 GITVFDKLEKITYDSLPDLEIFV 443
           G+ + D   K TY+     +++V
Sbjct: 382 GLRLTDTFLKRTYEHDEVAQVYV 404


>gi|349732102|ref|NP_001231833.1| UPF0533 protein C5orf44 homolog isoform 1 [Xenopus (Silurana)
           tropicalis]
 gi|123912021|sp|Q0VFT9.1|CE044_XENTR RecName: Full=UPF0533 protein C5orf44 homolog
 gi|110645327|gb|AAI18703.1| LOC549181 protein [Xenopus (Silurana) tropicalis]
          Length = 412

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 137/430 (31%), Positives = 219/430 (50%), Gaps = 48/430 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F           L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFS---------TLMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+ +KA++Q
Sbjct: 56  -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDIQVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASTAVVSELKPDSCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ + 
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVSE 217

Query: 248 LKADGPHSD-YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
           L     + D  +    + +  P+  R       YLY LK     +     ++G  V+GKL
Sbjct: 218 LNTVITNGDGCSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKL 271

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
            I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + +  
Sbjct: 272 DIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSSERT 331

Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLN--LIATKLGVQRITGITV 424
              ++ L   +++      ++G ++  L P     S+  HL   L+++  G+Q ++G+ +
Sbjct: 332 --MDLVLEMCNTNAIHWCGVSGRQLGKLHP-----SSSLHLTLALLSSVQGLQSVSGLRL 384

Query: 425 FDKLEKITYD 434
            D   K TY+
Sbjct: 385 TDTFLKRTYE 394


>gi|194223840|ref|XP_001492631.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Equus
           caballus]
          Length = 412

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 135/433 (31%), Positives = 212/433 (48%), Gaps = 54/433 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +   L +     +      SR   +P            YLY LK     +     ++G  
Sbjct: 214 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +     ++ L   ++       I+G ++  L P  +       L L+++  G+Q ++G
Sbjct: 327 SSERT--MDLVLEMCNTSSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 381

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 382 LRLTDTFLKRTYE 394


>gi|37589695|gb|AAH59537.1| Zgc:73187 [Danio rerio]
 gi|47937881|gb|AAH71349.1| Zgc:73187 [Danio rerio]
          Length = 385

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 123/358 (34%), Positives = 195/358 (54%), Gaps = 21/358 (5%)

Query: 79  ADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDT 138
           A+++ L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++KA++QT  QR L L  
Sbjct: 29  AETLILGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVKADLQTSSQR-LNLSA 87

Query: 139 SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSV 198
           S S V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V
Sbjct: 88  SNSAVSELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKLYFRKFFKFQVLKPLDV 147

Query: 199 RTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK--ADGPHSD 256
           +TK    +          FLEA I+N T S ++M++V  EPS  ++ T L   A G  S 
Sbjct: 148 KTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMYNVTELNNVASGDESS 201

Query: 257 YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGE 316
            +   +  +  P+  R       YLY LK     +     ++G  V+GKL I W+TNLGE
Sbjct: 202 ESTFGKMSYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKLDIVWKTNLGE 255

Query: 317 PGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQN 376
            GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++     ++ L   
Sbjct: 256 RGRLQTSQLQRMAPGYGDVRLSLEFIPDTVDLEEPFDITCKITNCSERT---MDLLLEMC 312

Query: 377 DSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
           ++       ++G ++  L+P     S    L L+++  G+Q I+G+ + D   K TY+
Sbjct: 313 NTRSVHWCGVSGRQLGKLSPS---ASLSIPLKLLSSVQGLQSISGLRLTDTFLKRTYE 367


>gi|327263135|ref|XP_003216376.1| PREDICTED: UPF0533 protein C5orf44 homolog [Anolis carolinensis]
          Length = 417

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 135/429 (31%), Positives = 217/429 (50%), Gaps = 40/429 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFNQ---------LMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 56  -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSHQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKQDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++   
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVVE 223

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L       D  +   +R   +P            YLY LK     +     ++G  V+GK
Sbjct: 224 LNTVSHTEDSISTFGTRTYLQP-------MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSSER 336

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ L   +++      ++G ++  L P  +   T   L+ +    G+Q ++G+ + 
Sbjct: 337 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHPTSSLHLTLTLLSSVQ---GLQSVSGLRLT 391

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 392 DTFLKRTYE 400


>gi|320168756|gb|EFW45655.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 439

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 137/428 (32%), Positives = 219/428 (51%), Gaps = 49/428 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H L  +VMRL +P+L +  P+  +P+D            A S L  + ++DV+T    +L
Sbjct: 9   HYLVLKVMRLSKPTLVIGQPIVSEPSDF-----------AGSVLQEVQTADVSTAGQPEL 57

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                           LS  L+LPQ FG I+LGETF SYIS++N S + +RDV +KAE+Q
Sbjct: 58  --------------FSLSSFLMLPQNFGNIFLGETFSSYISVHNDSNMRIRDVAVKAELQ 103

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR+ L D + S  E +  G   D +V H+VKELG H LVC+  Y   + ERK   +F
Sbjct: 104 TTSQRVPLSDLAPSDKE-LSPGASVDVVVHHEVKELGVHILVCSVSYMTADDERKIFRKF 162

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE--PSQNWSA 245
           FKF V +PL+V+TKV  V       ++  FLEA ++N T + +Y++ V+FE  P  ++  
Sbjct: 163 FKFNVLHPLAVKTKVYNV-------EDDIFLEAQVQNITPAPMYIEAVKFEAMPQFDFQD 215

Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIH-------NYLYQLKMLSHGSSSPVKVQ 298
             + +    +  ++ ++   K       G   H        YLY+L     G  +    +
Sbjct: 216 LNVLSSAASASSSSTNQAGLKASPATTFGLAYHVNPQDIRQYLYRLSPKVKGDKT---AR 272

Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
            ++ +GK+ I W+TN+GE GRLQT Q+        E+ + VVEVP  V ++ PF ++ ++
Sbjct: 273 AADKIGKMDILWKTNMGEVGRLQTSQLPRKLPALTELAVTVVEVPDNVVLEVPFTVQCRI 332

Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
           TN ++ +     ++  ++         ++G  +  L P EA  S    L       G+QR
Sbjct: 333 TNYSEHKMS-LRLFAVKSRMTGVLAAGVSGQSLGELFP-EA--SKIIPLEFFPAVPGLQR 388

Query: 419 ITGITVFD 426
           ++G+ + D
Sbjct: 389 VSGLRLMD 396


>gi|291395448|ref|XP_002714113.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 402

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 132/426 (30%), Positives = 212/426 (49%), Gaps = 48/426 (11%)

Query: 15  MRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYR 70
           MRL +P+L    P+  +    P DLF  + + DDP                         
Sbjct: 1   MRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV---------------------- 37

Query: 71  SRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDK 130
                 + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  
Sbjct: 38  ------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSS 91

Query: 131 QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKF 190
           QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF
Sbjct: 92  QR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKF 150

Query: 191 IVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA 250
            V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++ T L +
Sbjct: 151 QVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNS 210

Query: 251 DGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
                +  +   SR   +P            YLY LK     +     ++G  V+GKL I
Sbjct: 211 VSQAGECLSTFGSRAYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDI 263

Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
            W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + +    
Sbjct: 264 VWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--T 321

Query: 369 FEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKL 428
            ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + D  
Sbjct: 322 MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTF 378

Query: 429 EKITYD 434
            K TY+
Sbjct: 379 LKRTYE 384


>gi|441658598|ref|XP_004091270.1| PREDICTED: UPF0533 protein C5orf44 homolog [Nomascus leucogenys]
          Length = 355

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 119/350 (34%), Positives = 188/350 (53%), Gaps = 15/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
              +   +  FLEA I+N T S ++M++V  EPS  +S T L +     +  +   SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYSVTELNSVSQAGECVSTFGSRAY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN + +     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q I+G+ + D   K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337


>gi|351699840|gb|EHB02759.1| hypothetical protein GW7_09268, partial [Heterocephalus glaber]
          Length = 396

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 134/427 (31%), Positives = 212/427 (49%), Gaps = 55/427 (12%)

Query: 14  VMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           VMRL +P+L    P+  +    P DLF  + + DDP                        
Sbjct: 1   VMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------------- 38

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                  + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT 
Sbjct: 39  -------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTS 91

Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
            QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFK
Sbjct: 92  SQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFK 150

Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
           F V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  +S T L 
Sbjct: 151 FQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYSVTELN 204

Query: 250 ADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
           +     +  +   SR   +P            YLY LK     +     ++G  V+GKL 
Sbjct: 205 SVNQAGECVSTFGSRAYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLD 257

Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
           I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++   
Sbjct: 258 IVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER--- 314

Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
             ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + D 
Sbjct: 315 TMDLVLEMYNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDT 371

Query: 428 LEKITYD 434
             K TY+
Sbjct: 372 FLKRTYE 378


>gi|10435667|dbj|BAB14633.1| unnamed protein product [Homo sapiens]
          Length = 354

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 190/350 (54%), Gaps = 16/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
              +   +  FLEA I+N T S ++M++V  EPS  ++ T L +     +  +   SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK  +  +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN +++     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER---TMDLVLEMCNTNSIHWC 289

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q I+G+ + D   K TY+
Sbjct: 290 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 336


>gi|388453625|ref|NP_001253285.1| trafficking protein particle complex 13 [Macaca mulatta]
 gi|383412261|gb|AFH29344.1| hypothetical protein LOC80006 isoform 2 [Macaca mulatta]
 gi|384941112|gb|AFI34161.1| hypothetical protein LOC80006 isoform 2 [Macaca mulatta]
          Length = 417

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 122/363 (33%), Positives = 186/363 (51%), Gaps = 43/363 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +   SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVTELNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDK 364
           +++
Sbjct: 333 SER 335


>gi|402871695|ref|XP_003899789.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Papio anubis]
 gi|380816682|gb|AFE80215.1| hypothetical protein LOC80006 isoform 1 [Macaca mulatta]
          Length = 418

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 136/433 (31%), Positives = 213/433 (49%), Gaps = 48/433 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +P+L    P+  +    P DLF  + + DDP                  
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                        + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++K
Sbjct: 54  -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219

Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           + T L +     +  +   SR   +P            YLY LK     +     ++G  
Sbjct: 220 NVTELNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
           V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN 
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           + +      + +   +S     +   G+    L  +    S    L L+++  G+Q I+G
Sbjct: 333 SSERTMDLVLEMCNTNS-----IHWCGISGRQLGKLHPSSSLSLALTLLSSVQGLQSISG 387

Query: 422 ITVFDKLEKITYD 434
           + + D   K TY+
Sbjct: 388 LRLTDTFLKRTYE 400


>gi|301767850|ref|XP_002919348.1| PREDICTED: UPF0533 protein C5orf44 homolog [Ailuropoda melanoleuca]
          Length = 401

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 131/426 (30%), Positives = 211/426 (49%), Gaps = 49/426 (11%)

Query: 15  MRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYR 70
           MRL +P+L    P+  +    P DLF  + + DDP                         
Sbjct: 1   MRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV---------------------- 37

Query: 71  SRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDK 130
                 + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  
Sbjct: 38  ------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSS 91

Query: 131 QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKF 190
           QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF
Sbjct: 92  QR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKF 150

Query: 191 IVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA 250
            V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++   L +
Sbjct: 151 QVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVAELNS 210

Query: 251 DGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
                +      SR   +P            YLY LK     +     ++G  V+GKL I
Sbjct: 211 VSQAGECVTTFGSRAYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDI 263

Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
            W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++    
Sbjct: 264 VWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER---T 320

Query: 369 FEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKL 428
            ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + D  
Sbjct: 321 MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTF 377

Query: 429 EKITYD 434
            K TY+
Sbjct: 378 LKRTYE 383


>gi|410039326|ref|XP_003950597.1| PREDICTED: UPF0533 protein C5orf44 homolog [Pan troglodytes]
          Length = 355

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 189/350 (54%), Gaps = 15/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
              +   +  FLEA I+N T S ++M++V  EPS  ++ T L +     +  +   SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK  +  +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN + +     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q I+G+ + D   K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337


>gi|390459897|ref|XP_002744953.2| PREDICTED: UPF0533 protein C5orf44 isoform 1 [Callithrix jacchus]
          Length = 355

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 189/350 (54%), Gaps = 15/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA--QSREI 264
              +   +  FLEA I+N T S ++M++V  EPS  ++ T L +     +  +  +SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFRSRAY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN + +     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q I+G+ + D   K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337


>gi|395825394|ref|XP_003785920.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Otolemur
           garnettii]
          Length = 355

 Score =  192 bits (487), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 188/350 (53%), Gaps = 15/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
              +   +  FLEA I+N T S ++M++V  EPS  ++ T L +     +  +   SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN + +     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q I+G+ + D   K TY+
Sbjct: 291 GISGRQLGKLNPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337


>gi|156392281|ref|XP_001635977.1| predicted protein [Nematostella vectensis]
 gi|156223076|gb|EDO43914.1| predicted protein [Nematostella vectensis]
          Length = 394

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 143/430 (33%), Positives = 212/430 (49%), Gaps = 58/430 (13%)

Query: 15  MRLCRPSLHVEPPLRVDPTDL--FIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSR 72
           MRL +PS++   P++ +  DL   I +D  D  IA+  +P +                  
Sbjct: 1   MRLTKPSMYTSIPVQCESQDLPGSIFKDCHDADIAS--VPGMYD---------------- 42

Query: 73  FLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQR 132
                      L  LLVLPQ FG I+LGETF SY+S++N S   V+D+VIK ++QT  QR
Sbjct: 43  ---------FALGDLLVLPQTFGNIFLGETFASYVSVHNDSNQSVKDIVIKTDLQTSSQR 93

Query: 133 ILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIV 192
           + L   +  PV  +     YD ++ H+VKELG H LVC   YS   GE+ Y  +FFKF V
Sbjct: 94  LTLSGAANMPVAKLDPQKSYDQVIHHEVKELGTHILVCAVSYSSLAGEKMYFRKFFKFQV 153

Query: 193 SNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADG 252
             PL V+TK    +       +  FLEA ++N T S + M+ V  +PS  ++ T L    
Sbjct: 154 LKPLDVKTKFYNAE------DDSVFLEAQVQNITSSPMVMESVRLDPSALYTVTDLNI-A 206

Query: 253 PHSDYNAQSRE---IFKPPVLIRSGGGIH-----NYLYQLKMLSHGSSSPVKVQGSNVLG 304
           P SD N   R+   I++  V    G  +H      YLY+LK  S    +P     S+ +G
Sbjct: 207 P-SDPNKTKRQNAMIYELDV----GSFLHPNDTRQYLYKLKAKSPIDRNPKVRPYSHPVG 261

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           KL I WRT+ GE GRLQT Q+        +++L V ++   V +++PF + LKL N  D+
Sbjct: 262 KLDIVWRTSFGERGRLQTSQLSRVIPAIADLKLTVSQMADAVPVERPFPVSLKLKNTCDR 321

Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
           +     + ++++   ++  +M  G      + V     TD   N        Q I+G+ V
Sbjct: 322 KMD-LRLLMTKS---KDGAMMWCGTSGKVCSNVGKL--TD---NSSIFLFFTQNISGLRV 372

Query: 425 FDKLEKITYD 434
            DKL   TY+
Sbjct: 373 IDKLSGRTYE 382


>gi|349732103|ref|NP_001085628.2| UPF0533 protein C5orf44 homolog [Xenopus laevis]
 gi|190360172|sp|Q6GPR5.2|CE044_XENLA RecName: Full=UPF0533 protein C5orf44 homolog
          Length = 414

 Score =  191 bits (484), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 136/430 (31%), Positives = 217/430 (50%), Gaps = 46/430 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F           L+  D +T K +++
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFS---------TLMKDDPSTVKGAEI 58

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                         + L  +L LPQ FG I+LGETF SYIS++N S   V+DV +KA++Q
Sbjct: 59  --------------LMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDVQVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASSAVVADLKPDSCIDDVIHHEVKEIGTHILVCAVSYTIQSGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ + 
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVSE 217

Query: 248 LKADGPHSDYNAQSR---EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
           L     + D+   S    + +  P+  R       YLY LK     +     ++G  V+G
Sbjct: 218 LNTVITNGDWKGSSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIG 271

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           KL I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + +
Sbjct: 272 KLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSSE 331

Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
                ++ L   +++      ++G ++  L P  +   T   L+ +    G+Q ++G+ +
Sbjct: 332 RT--MDLVLEMCNTNAIHWSGVSGRQLGKLHPSSSLHLTLTLLSSVQ---GLQSVSGLRL 386

Query: 425 FDKLEKITYD 434
            D   K TY+
Sbjct: 387 TDTFLKRTYE 396


>gi|383412259|gb|AFH29343.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
 gi|384941114|gb|AFI34162.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
          Length = 411

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 122/359 (33%), Positives = 186/359 (51%), Gaps = 41/359 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMYNVTE 217

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 218 LNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER 329


>gi|261260081|sp|A8WX89.2|U533_CAEBR RecName: Full=UPF0533 protein CBG04321
          Length = 401

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 138/450 (30%), Positives = 225/450 (50%), Gaps = 61/450 (13%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           +S++     LA RVMRL RP        +  P D F       DP+  +    L++  V 
Sbjct: 5   ISNSSTQQLLALRVMRLARP--------KFAPLDGFS-----HDPVDPTGFGELLAGKV- 50

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
               ++++  SR   HD    + +   L+ PQ F  IYLGETF  Y+++ N S   V +V
Sbjct: 51  ----AEISKESR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNVVNV 99

Query: 121 VIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
            +K E+QT  QR++L        +ES +  G+   ++ H+VKE+G H L+C+  Y    G
Sbjct: 100 SLKCELQTSTQRVVLPCSVQDVTIESTKCDGQ---VISHEVKEIGQHILICSVNYKTLSG 156

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
           E  Y  +FFKF VS P+ V+TK    +  A   Q++ +LEA IEN + SN+++++VE +P
Sbjct: 157 ENMYFRKFFKFPVSKPIDVKTKFYSAEDNAN--QDV-YLEAQIENTSNSNMFLERVELDP 213

Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           SQ++  T +     H D   +  ++ KP         I  +L+ L        SPV V  
Sbjct: 214 SQHYKVTSIS----HEDEFPEVGKLLKP-------KDIRQFLFCL--------SPVDVNN 254

Query: 300 S------NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 353
           +        +GKL ++WRT++GE GRLQT  +        ++ L+V + P+ V + KPF 
Sbjct: 255 TLGYKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQKPFE 314

Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 413
           +  +L N +++     ++ L Q  + +  +   +G+ +  L P       DF LN+    
Sbjct: 315 VACRLYNCSERALD-LQLRLEQPSNRQLVICSPSGVSLGQLPPSRY---VDFALNVFPVA 370

Query: 414 LGVQRITGITVFDKLEKITYDSLPDLEIFV 443
           +G+Q I+GI + D   K  Y+     +IFV
Sbjct: 371 VGIQSISGIRITDTFTKRHYEHDDIAQIFV 400


>gi|440908494|gb|ELR58504.1| hypothetical protein M91_16814, partial [Bos grunniens mutus]
          Length = 399

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 132/427 (30%), Positives = 209/427 (48%), Gaps = 54/427 (12%)

Query: 14  VMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           VMRL +P+L    P+  +    P DLF  + + DDP                        
Sbjct: 3   VMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------------- 40

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                  + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT 
Sbjct: 41  -------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTS 93

Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
            QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFK
Sbjct: 94  SQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFK 152

Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
           F V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++   L 
Sbjct: 153 FQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVAELN 206

Query: 250 ADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
           +     +      SR   +P            YLY LK     +     ++G  V+GKL 
Sbjct: 207 SVNQAGECVTTFGSRAYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLD 259

Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
           I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + +   
Sbjct: 260 IVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER-- 317

Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
             ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + D 
Sbjct: 318 TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDT 374

Query: 428 LEKITYD 434
             K TY+
Sbjct: 375 FLKRTYE 381


>gi|26351063|dbj|BAC39168.1| unnamed protein product [Mus musculus]
          Length = 354

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 117/350 (33%), Positives = 189/350 (54%), Gaps = 16/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
              +   +  FLEA I+N T S ++M++V  EPS  ++ T L +     +  +   SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVTQAGECISTFGSRGY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN +++     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER---MMDLVLEMCNTNSIHWC 289

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q ++G+ + D   K TY+
Sbjct: 290 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 336


>gi|402871693|ref|XP_003899788.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Papio anubis]
 gi|380816684|gb|AFE80216.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
 gi|380816686|gb|AFE80217.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
 gi|380816688|gb|AFE80218.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
 gi|380816690|gb|AFE80219.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
          Length = 412

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 136/429 (31%), Positives = 213/429 (49%), Gaps = 46/429 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMYNVTE 217

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 218 LNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + + 
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
                + +   +S     +   G+    L  +    S    L L+++  G+Q I+G+ + 
Sbjct: 331 TMDLVLEMCNTNS-----IHWCGISGRQLGKLHPSSSLSLALTLLSSVQGLQSISGLRLT 385

Query: 426 DKLEKITYD 434
           D   K TY+
Sbjct: 386 DTFLKRTYE 394


>gi|56789267|gb|AAH88172.1| Similar to RIKEN cDNA 2410002O22 gene [Rattus norvegicus]
          Length = 359

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 118/353 (33%), Positives = 188/353 (53%), Gaps = 15/353 (4%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPV 143
           L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V
Sbjct: 2   LGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAV 60

Query: 144 ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVR 203
             ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK  
Sbjct: 61  AELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFY 120

Query: 204 VVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--S 261
             +   +   +  FLEA I+N T S ++M++V  EPS  ++ T L +     +  +   S
Sbjct: 121 NAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVNQAGECVSTFGS 180

Query: 262 REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQ 321
           R   +P            YLY LK     +     ++G  V+GKL I W+TNLGE GRLQ
Sbjct: 181 RGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQ 233

Query: 322 TQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEE 381
           T Q+        ++ L++  +P  V +++PF +  K+TN + +     ++ L   ++   
Sbjct: 234 TSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTTSI 291

Query: 382 KVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
               I+G ++  L P  +       L L+++  G+Q ++G+ + D   K TY+
Sbjct: 292 HWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 341


>gi|68270943|gb|AAY88966.1| hypothetical protein FLJ13611 [Homo sapiens]
          Length = 355

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 116/350 (33%), Positives = 188/350 (53%), Gaps = 15/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+T+    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTRFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
              +   +  FLEA I+N T S ++M++V  EPS  ++ T L +     +  +   SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY  K  +  +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCPKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN + +     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q I+G+ + D   K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337


>gi|25149716|ref|NP_741009.1| Protein C56C10.7, isoform a [Caenorhabditis elegans]
 gi|75019616|sp|Q95QQ2.1|U533_CAEEL RecName: Full=UPF0533 protein C56C10.7
 gi|351060501|emb|CCD68177.1| Protein C56C10.7, isoform a [Caenorhabditis elegans]
          Length = 401

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 135/453 (29%), Positives = 221/453 (48%), Gaps = 63/453 (13%)

Query: 1   MSSTPGTHS----LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS 56
           M+  P + S    LA RVMRL RP        +  P D F       DP+  +    L++
Sbjct: 1   MADNPASSSSQQLLALRVMRLARP--------KFAPVDGFS-----HDPVDPTGFGELLA 47

Query: 57  SDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE 116
             V     S+++  SR         + +   L+ PQ F  IYLGETF  Y+++ N S   
Sbjct: 48  GKV-----SEISKESR-------QDLPIGEYLIAPQMFENIYLGETFTFYVNVVNESEKT 95

Query: 117 VRDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
           V  V +K E+QT  QR++L      + +ES +  G+   ++ H+VKE+G H L+C+  Y 
Sbjct: 96  VSSVSLKCELQTSTQRVVLPCSVQDATIESSKCEGQ---VISHEVKEIGQHILICSVNYK 152

Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV 235
              GE  Y  +FFKF VS P+ V+TK    +  A   Q++ +LEA IEN + +N+++++V
Sbjct: 153 TSNGENMYFRKFFKFPVSKPIDVKTKFYSAEDNAN--QDV-YLEAQIENTSNANMFLEKV 209

Query: 236 EFEPSQNWSATMLKADGPHSDYNAQSREIFKPP-----VLIRSGGGIHNYLYQLKMLSHG 290
           E +PSQ+++ T +     H D      ++ KP      +   +   +HN L    + S  
Sbjct: 210 ELDPSQHYNVTSIA----HEDEFGDVGKLLKPKDIRQFLFCLTPADVHNTLGYKDLTS-- 263

Query: 291 SSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDK 350
                       +GKL ++WRT++GE GRLQT  +        ++ L+V + P+ V + K
Sbjct: 264 ------------IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQK 311

Query: 351 PFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLI 410
           PF +  +L N +++     ++ L Q  +        +G+ +  L P +     DF LN+ 
Sbjct: 312 PFEVSCRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ---HVDFSLNVF 367

Query: 411 ATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
              +G+Q I+GI + D   K  Y+     +IFV
Sbjct: 368 PVTVGIQSISGIRITDTFTKRIYEHDDIAQIFV 400


>gi|410948701|ref|XP_003981069.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Felis catus]
          Length = 355

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 116/350 (33%), Positives = 186/350 (53%), Gaps = 15/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
              +   +  FLEA I+N T S ++M++V  EPS  ++   L +     +      SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVAELNSVSQAGECVTTFGSRAY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN + +     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q ++G+ + D   K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 337


>gi|145352717|ref|XP_001420684.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580919|gb|ABO98977.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 478

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 132/393 (33%), Positives = 200/393 (50%), Gaps = 52/393 (13%)

Query: 85  SGLLVLPQAFGAIYLGETFCSYISINN------SSTLEVRDVVIKAEIQTDKQRILLLDT 138
           SG L LPQ+FGA+ LGE F S+++  N       ++   R++ IK E+QT+ +R  L D 
Sbjct: 63  SGELTLPQSFGAVALGERFSSFVTFGNFSEPTSGASGTAREIGIKVELQTETRRTTLRDG 122

Query: 139 SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSV 198
           +K+P+E++R G + D IV  D+KELGAHTLVC+A Y D  GERKY PQ+FKF V+NPLSV
Sbjct: 123 TKTPIETLRPGEKVDLIVTKDLKELGAHTLVCSATYYDAAGERKYSPQYFKFNVANPLSV 182

Query: 199 RTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE-----------PSQNWSATM 247
           RTKVR    G        FLE CIEN T+  L +D   F+           P    +A  
Sbjct: 183 RTKVRAAPRGR------AFLEVCIENTTRYALLLDSARFDTVDGILAKDMTPEFGGAAAT 236

Query: 248 LKA--DGPHSDYNA-QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
           L    D P +   +   R +++   L  S G  H YL+++   ++ S  P+  Q    LG
Sbjct: 237 LHGVDDSPDAGLPSLGKRAVYR---LDPSTGAAH-YLFEITR-ANASEEPLTPQ--TQLG 289

Query: 305 KLQITWRTNLGEPGRLQTQQI----LGTTITS---KEIELNVVEVP--------SVVGID 349
           KL++ WR  +G+PGRLQTQ I     G+T  S    ++  +++  P        S V  +
Sbjct: 290 KLELRWRGAMGDPGRLQTQVITAGSAGSTAPSPVAAKMRQSIIVHPRPPDAEDVSTVYAE 349

Query: 350 KPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNL 409
            PF+L+  +      +     + +     D    V I+G R + +  +    + +  +  
Sbjct: 350 TPFILRAAVEALAPIKADACVVRV----KDVVSGVYIDGPRAVRVGALSPGQTVNVDIPC 405

Query: 410 IATKLGVQRITGITVFDKLEKITYDSLPDLEIF 442
           +A  LGVQ    + + D ++     +   LE+F
Sbjct: 406 VALGLGVQTCPSLVLCDAVDDAARAAPAPLEVF 438


>gi|26379545|dbj|BAB29083.2| unnamed protein product [Mus musculus]
          Length = 355

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 116/350 (33%), Positives = 188/350 (53%), Gaps = 15/350 (4%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
              +   +  FLEA I+N T S ++M++V  EPS  ++ T L +     +  +   SR  
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVTQAGECISTFGSRGY 179

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK     +     ++G  V+GKL I W+TNLGE GR+QT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRVQTNQ 232

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN + +     ++ L   +++     
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSERM--MDLVLEMCNTNSIHWC 290

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q ++G+ + D   K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 337


>gi|7452545|pir||T15846 hypothetical protein C56C10.7 - Caenorhabditis elegans
          Length = 398

 Score =  187 bits (476), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 133/453 (29%), Positives = 218/453 (48%), Gaps = 66/453 (14%)

Query: 1   MSSTPGTHS----LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS 56
           M+  P + S    LA RVMRL RP        +  P D F       DP+  +    L++
Sbjct: 1   MADNPASSSSQQLLALRVMRLARP--------KFAPVDGFS-----HDPVDPTGFGELLA 47

Query: 57  SDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE 116
             V     S+++  SR         + +   L+ PQ F  IYLGETF  Y+++ N S   
Sbjct: 48  GKV-----SEISKESR-------QDLPIGEYLIAPQMFENIYLGETFTFYVNVVNESEKT 95

Query: 117 VRDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
           V  V +K E+QT  QR++L      + +ES +  G+   ++ H+VKE+G H L+C+  Y 
Sbjct: 96  VSSVSLKCELQTSTQRVVLPCSVQDATIESSKCEGQ---VISHEVKEIGQHILICSVNYK 152

Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV 235
              GE  Y  +FFKF VS P+ V+TK    +       +  +LEA IEN + +N+++++V
Sbjct: 153 TSNGENMYFRKFFKFPVSKPIDVKTKFYSAE------NQDVYLEAQIENTSNANMFLEKV 206

Query: 236 EFEPSQNWSATMLKADGPHSDYNAQSREIFKPP-----VLIRSGGGIHNYLYQLKMLSHG 290
           E +PSQ+++ T +     H D      ++ KP      +   +   +HN L    + S  
Sbjct: 207 ELDPSQHYNVTSIA----HEDEFGDVGKLLKPKDIRQFLFCLTPADVHNTLGYKDLTS-- 260

Query: 291 SSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDK 350
                       +GKL ++WRT++GE GRLQT  +        ++ L+V + P+ V + K
Sbjct: 261 ------------IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQK 308

Query: 351 PFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLI 410
           PF +  +L N +++     ++ L Q  +        +G+ +  L P +     DF LN+ 
Sbjct: 309 PFEVSCRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ---HVDFSLNVF 364

Query: 411 ATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
              +G+Q I+GI + D   K  Y+     +IFV
Sbjct: 365 PVTVGIQSISGIRITDTFTKRIYEHDDIAQIFV 397


>gi|432104588|gb|ELK31200.1| hypothetical protein MDA_GLEAN10025801 [Myotis davidii]
          Length = 396

 Score =  187 bits (475), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 131/426 (30%), Positives = 208/426 (48%), Gaps = 54/426 (12%)

Query: 15  MRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYR 70
           MRL +P+L    P+  +    P DLF  + + DDP                         
Sbjct: 1   MRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV---------------------- 37

Query: 71  SRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDK 130
                 + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  
Sbjct: 38  ------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSS 91

Query: 131 QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKF 190
           QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF
Sbjct: 92  QR-LNLSASNAAVSELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKF 150

Query: 191 IVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA 250
            V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++   L +
Sbjct: 151 QVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSIMYNVAELNS 204

Query: 251 DGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
                +      SR   +P            YLY LK     +     ++G  V+GKL I
Sbjct: 205 VNQAGECVTTFGSRTYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDI 257

Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
            W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + +    
Sbjct: 258 VWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVILEEPFHITCKITNCSSER--T 315

Query: 369 FEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKL 428
            ++ L   +++      I+G ++  L P  +       L L+++  G+Q ++G+ + D  
Sbjct: 316 MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTF 372

Query: 429 EKITYD 434
            K TY+
Sbjct: 373 LKRTYE 378


>gi|449682850|ref|XP_002166018.2| PREDICTED: UPF0533 protein C5orf44 homolog [Hydra magnipapillata]
          Length = 409

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 134/424 (31%), Positives = 206/424 (48%), Gaps = 54/424 (12%)

Query: 8   HSLAFRVMRLCRPS----LHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H L  +VMRL +PS    LHV       P DLF  E               + +D++  K
Sbjct: 10  HLLVLKVMRLTKPSIKSPLHVTAEEHDFPGDLFYNE---------------MMNDISALK 54

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                          A+ + +  +L LPQAFG+IYLGETF  YISI N S    +D+ +K
Sbjct: 55  G--------------AEEMAVGEILSLPQAFGSIYLGETFSCYISILNDSNQCCKDISVK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
            ++QT  QR  L  T+  P + +      D ++ ++VKELG H L+C   YS   GE+ Y
Sbjct: 101 TDMQTATQRFQL--TAFKPKDMLSPDQSVDDVISYEVKELGTHILICAVTYSSQSGEKLY 158

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
           + +F+KF V  PL V+TK    +       ++ FLEA ++N T SN+ M+QV  EPSQ +
Sbjct: 159 MRRFYKFQVLKPLEVKTKFYNGQ------NDLVFLEAQVQNITTSNMCMEQVTLEPSQFY 212

Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
               L      +  +      +  P+  R       YL++L +     S  ++ +    +
Sbjct: 213 HVQSLNFLPKDNKLDGVYGCSYMNPMDTR------QYLFKL-LPKCDDSKEMRTKPPLSI 265

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT- 362
           GKL I WRTN GE GRLQT Q+   T + ++++L ++E P VV ++K F +K +L N + 
Sbjct: 266 GKLDIVWRTNFGETGRLQTSQLQRMTPSERDVKLVLIEAPDVVSLEKQFQIKCRLENSSP 325

Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
            K +    +    N+S     ++  G+    L P+      D  L L+A + G   I G+
Sbjct: 326 AKIEAKLFLTNPHNNS-----MLWCGISGKILGPLPQGSHLDITLLLLAIRPGFHSIGGV 380

Query: 423 TVFD 426
            + D
Sbjct: 381 RIQD 384


>gi|308502446|ref|XP_003113407.1| hypothetical protein CRE_26256 [Caenorhabditis remanei]
 gi|308263366|gb|EFP07319.1| hypothetical protein CRE_26256 [Caenorhabditis remanei]
          Length = 398

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 132/450 (29%), Positives = 215/450 (47%), Gaps = 64/450 (14%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           +S++     LA RVMRL RP          DP D                  P    ++ 
Sbjct: 5   LSNSSTQQMLALRVMRLARPKFAPVGGFSHDPVD------------------PTGFGELL 46

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
             K S+L+  SR       + + +   L+ PQ F  IYLGETF  Y+++ N S   V +V
Sbjct: 47  AGKVSELSKESR-------NDLPIGDYLIAPQMFENIYLGETFTFYVNVVNESETSVVNV 99

Query: 121 VIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
            +K E+QT  QR++L      + +ES +  G+   ++ H+VKE+G H L+C+  Y    G
Sbjct: 100 SLKCELQTSTQRVVLPCSVQDTTIESSKCDGQ---VISHEVKEIGQHILICSVNYKTLSG 156

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
           E  Y  +FFKF VS P+ V+TK    +       +  +LEA IEN + S++++++VE +P
Sbjct: 157 ENMYFRKFFKFPVSKPIDVKTKFYSAE------NQDVYLEAQIENTSNSSMFLERVELDP 210

Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           SQ++  T +     H D   +  ++ KP         I  +L+ L        SP+ V  
Sbjct: 211 SQHYKVTSVS----HEDEFPEVGKLLKP-------KDIRQFLFCL--------SPIDVNN 251

Query: 300 S------NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 353
           +        +GKL ++WRT++GE GRLQT  +        ++ L+V   P+ V + KPF 
Sbjct: 252 TLGYKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGFGDVRLSVENTPACVDVQKPFE 311

Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 413
           +  +L N +++     ++ L Q  +        +G+ +  L P +     DF LN+    
Sbjct: 312 VACRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQY---VDFTLNVFPVA 367

Query: 414 LGVQRITGITVFDKLEKITYDSLPDLEIFV 443
           +G+Q I+GI + D   K  Y+     +IFV
Sbjct: 368 VGIQSISGIRITDTFTKRIYEHDDIAQIFV 397


>gi|26368656|dbj|BAB26869.2| unnamed protein product [Mus musculus]
          Length = 349

 Score =  185 bits (470), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 117/350 (33%), Positives = 186/350 (53%), Gaps = 21/350 (6%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  +
Sbjct: 1   MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
           +     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +
Sbjct: 60  KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
                     FLEA I+N T S ++M++V  EPS  ++ T L +     +  +   SR  
Sbjct: 120 TDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVTQAGECISTFGSRGY 173

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +P            YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q
Sbjct: 174 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 226

Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
           +        ++ L++  +P  V +++PF +  K+TN + +     ++ L   +++     
Sbjct: 227 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSERM--MDLVLEMCNTNSIHWC 284

Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            I+G ++  L P  +       L L+++  G+Q ++G+ + D   K TY+
Sbjct: 285 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 331


>gi|410929303|ref|XP_003978039.1| PREDICTED: UPF0533 protein C5orf44 homolog [Takifugu rubripes]
          Length = 426

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 137/432 (31%), Positives = 214/432 (49%), Gaps = 38/432 (8%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL            +  LP  I   +        
Sbjct: 10  HLLALKVMRLTKPTLFTNLPVTCEERDL-------PGVTVSECLPSYIGPAIN------- 55

Query: 68  TYRSRFL-LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
            +RS  L L   A  +G S     P+    I+LGETF SYIS++N S+  V+D+++KA++
Sbjct: 56  -WRSITLPLAQLAAGMG-SSAPSDPRTVN-IFLGETFSSYISVHNDSSQVVKDILVKADL 112

Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
           QT  QR L L  S S V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +
Sbjct: 113 QTSSQR-LNLSASNSAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTSQYGEKLYFRK 171

Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
           FFKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++ T
Sbjct: 172 FFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVT 231

Query: 247 MLKA----DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
            L      D     +   S   +  P+  R       YLY LK     +     ++G  V
Sbjct: 232 ELNTITSRDTEECTFGKMS---YLQPMDTR------QYLYCLKPKPEYAEKAGVIKGVTV 282

Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
           +GKL I W+TNLGE GRLQT Q+        +I L++  +P  V +++PF +  K+TN +
Sbjct: 283 IGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDIRLSLEMIPDTVNLEEPFDIICKITNCS 342

Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
           ++     ++ L   ++        +G ++  L+P     S    L L ++  G+Q ++G+
Sbjct: 343 ERT---MDLVLEMCNTASTHWCGTSGRKLGKLSPA---ASLSLPLTLFSSVQGLQSVSGL 396

Query: 423 TVFDKLEKITYD 434
            + D   K TY+
Sbjct: 397 RLKDTFLKRTYE 408


>gi|405970753|gb|EKC35629.1| UPF0533 protein C5orf44-like protein [Crassostrea gigas]
          Length = 395

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 182/366 (49%), Gaps = 17/366 (4%)

Query: 83  GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSP 142
           GL  LL LPQ FG I+LGETF SYIS++N ST + RD+ +K ++QT  QR++L       
Sbjct: 44  GLGDLLTLPQNFGNIFLGETFSSYISVHNDSTQQCRDITLKIDLQTTSQRLMLSGADVPA 103

Query: 143 VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKV 202
            + +      D ++ H+VKELG H LVC   Y+    E+    +FFKF V  PL V+TK 
Sbjct: 104 TDELGPDQSIDDVIHHEVKELGTHILVCAVSYTTNNYEKMAFRKFFKFQVLKPLDVKTKF 163

Query: 203 RVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSR 262
              +       +  +LEA I+N T   +YMD V  EPS  +  T L     ++      +
Sbjct: 164 YNAE------SDEVYLEAQIQNITPGPIYMDHVSLEPSSQYLCTPL-----NNTEGKDQK 212

Query: 263 EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
           E+    V   +   I  YLY L            ++G   +GK+ I W+TNLGE GRLQT
Sbjct: 213 EMVFGKVNYLNPMDIRQYLYCLVPKPEVIKQNKVMKGVTDIGKIDIVWKTNLGERGRLQT 272

Query: 323 QQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEK 382
            Q+        +I++ + E P  V ++  F +  ++TN  ++      + L  N      
Sbjct: 273 SQLQRVAPGYGDIKVTLEETPDSVVLESSFNIICRITNCCERTMD-LTLTLQNNQPSGLL 331

Query: 383 VVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITY--DSLPDLE 440
              I+G ++  LAP E     D  L LIAT  G+Q I+G+ + D   K TY  D L  + 
Sbjct: 332 WTGISGRQLGKLAPKENL---DLRLTLIATIPGLQTISGLRITDNFLKRTYEHDELASVF 388

Query: 441 IFVDQD 446
           I+ D +
Sbjct: 389 IYNDSN 394


>gi|158294379|ref|XP_315565.3| AGAP005561-PA [Anopheles gambiae str. PEST]
 gi|157015536|gb|EAA11831.3| AGAP005561-PA [Anopheles gambiae str. PEST]
          Length = 429

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 128/444 (28%), Positives = 215/444 (48%), Gaps = 45/444 (10%)

Query: 5   PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           P  H LA +VMRL RP+L     L  +P D           +   +   ++ SD T+   
Sbjct: 4   PTEHLLALKVMRLTRPTLISPQILTAEPKD-----------VPQYSFQKILHSDATSVAG 52

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
            +     +F+L              LPQ+FG IYLGETF SY+ ++N     V +V +KA
Sbjct: 53  CETITAGQFML--------------LPQSFGNIYLGETFSSYVCVHNCRAHPVTNVSVKA 98

Query: 125 EIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           ++Q++  R+ L +   K+   ++      D ++ H+VKE+G H LVC   Y    G    
Sbjct: 99  DLQSNNSRVSLPIHADKTGPVTLNPEETLDDVIHHEVKEIGTHILVCEVSYMTPAGLETS 158

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +       +  +LEA I+N T   + +++VE E S+ +
Sbjct: 159 FRKFFKFQVVKPLDVKTKFYNAET------DDVYLEAQIQNITVGPICLEKVELESSEQY 212

Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           +   L    P  +    S+ + +P            +LY ++ +   +  P  ++ +N +
Sbjct: 213 TVVSLNT-LPSGESVFSSKTMLQP-------QNSCQFLYCIRPIPEIARDPSALKAANNI 264

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
           GKL I WR+NLGE GRLQT Q+    +   ++ LNV+E  S V I + F  + ++TN ++
Sbjct: 265 GKLDIVWRSNLGERGRLQTSQLQRCALEYSDLRLNVIEANSTVRIGEGFDFRCRVTNTSE 324

Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
           +     ++ +S N +  +      G+   AL P+E     +F L +   +LG+  I+ + 
Sbjct: 325 RS---MDLLMSLN-TKAKPGCGYTGVTEFALGPLEPGQMKEFPLTVCPVRLGLIVISALQ 380

Query: 424 VFDKLEKITYDSLPDLEIF-VDQD 446
           + D   K  Y+    L++F VD+D
Sbjct: 381 LTDVFTKRKYEFDNFLQVFVVDED 404


>gi|49115693|gb|AAH73045.1| MGC82662 protein [Xenopus laevis]
          Length = 369

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 119/359 (33%), Positives = 190/359 (52%), Gaps = 21/359 (5%)

Query: 79  ADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDT 138
           A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+DV +KA++QT  QR L L  
Sbjct: 11  AEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDVQVKADLQTSSQR-LNLSA 69

Query: 139 SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSV 198
           S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V
Sbjct: 70  SSAVVADLKPDSCIDDVIHHEVKEIGTHILVCAVSYTIQSGEKMYFRKFFKFQVLKPLDV 129

Query: 199 RTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYN 258
           +TK    +          FLEA I+N T S ++M++V  EPS  ++ + L     + D+ 
Sbjct: 130 KTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVSELNTVITNGDWK 183

Query: 259 AQSR---EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLG 315
             S    + +  P+  R       YLY LK     +     ++G  V+GKL I W+TNLG
Sbjct: 184 GSSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKLDIVWKTNLG 237

Query: 316 EPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQ 375
           E GRLQT Q+        ++ L++  +P  V +++PF +  K+TN + +     ++ L  
Sbjct: 238 ERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSSER--TMDLVLEM 295

Query: 376 NDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            +++      ++G ++  L P  +   T   L+ +    G+Q ++G+ + D   K TY+
Sbjct: 296 CNTNAIHWSGVSGRQLGKLHPSSSLHLTLTLLSSVQ---GLQSVSGLRLTDTFLKRTYE 351


>gi|346470407|gb|AEO35048.1| hypothetical protein [Amblyomma maculatum]
          Length = 416

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 130/434 (29%), Positives = 206/434 (47%), Gaps = 52/434 (11%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           LA +VMRL RPSL    P+  D  D           I  S     +  D+      +L  
Sbjct: 8   LALKVMRLTRPSLFTTVPVVCDSRD-----------IPGSMWMQELKQDLGAPLGLEL-- 54

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                        G+   L+LPQ+FG IYLGETF  Y+S++N S   VRDV ++AE+QTD
Sbjct: 55  ------------FGMGSFLMLPQSFGNIYLGETFSCYMSVHNDSQTTVRDVSVRAELQTD 102

Query: 130 KQRILLLDTSKSP--VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
            Q++ L   +  P  V  +      D ++ H+VK++  H LVCT  YS   G++ +  +F
Sbjct: 103 SQKVFLTGRTDGPAVVAELAPNCSIDEVIHHEVKDINTHILVCTVNYSTQAGDKMHFRKF 162

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +       +  +LEA ++N T + + +++V  EPS +++   
Sbjct: 163 FKFQVYKPLDVKTKFYNAE------SDEVYLEAQLQNITSTPICLEKVALEPSSHFNVCQ 216

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK-MLSHGSSSPVKVQ------GS 300
           L   G        S+ +F   V   +      YL+ L   L     S + VQ      G 
Sbjct: 217 LNTCG-------DSQSVFG-SVNFLNPHDTRQYLFSLSPRLPPSEPSSLAVQPDRRRSGI 268

Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
             +GKL I WR+ +GE GRLQT Q+       ++I+L +   PS V +++PF +   +TN
Sbjct: 269 TSIGKLDIIWRSAMGERGRLQTSQLERIAPGYEDIKLTIESAPSTVNLEEPFEIACSVTN 328

Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
                Q   ++ L+  ++     ++  G    +L  +E   + +  L  +  + G+Q ++
Sbjct: 329 TC---QRVMDLVLALENAPSSG-LLWQGTSGQSLGKLEPQATVNLKLEAVPFRTGLQGVS 384

Query: 421 GITVFDKLEKITYD 434
           GI + D   K TYD
Sbjct: 385 GIKLSDTYLKQTYD 398


>gi|195473563|ref|XP_002089062.1| GE18914 [Drosophila yakuba]
 gi|194175163|gb|EDW88774.1| GE18914 [Drosophila yakuba]
          Length = 438

 Score =  177 bits (450), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 132/430 (30%), Positives = 217/430 (50%), Gaps = 52/430 (12%)

Query: 5   PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLF--IGEDIFDDPIAASNLPPLISSDVTT 61
           P  H +A +VMRL RP+L  + P +  +PTDL    G     D IA +            
Sbjct: 6   PDAHLVALKVMRLMRPTLVGLGPVVTCEPTDLVQRFGNSQESDGIAGA------------ 53

Query: 62  NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
                            A+++    +L+LPQ+FG+IYLGETF SYI ++N++   V  V 
Sbjct: 54  ----------------CAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPHPVECVT 97

Query: 122 IKAEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
           +KA++Q++  RI L   + +KSPV  +  GG  D ++ ++VKE+G H LVC   YS   G
Sbjct: 98  VKADLQSNTTRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAG 156

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
             + L +FFKF V  PL V+TK    ++      EI +LEA I+N T S   +++VE + 
Sbjct: 157 YAQSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDG 210

Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           S+++S T L    P+ +     + + +P            +LY +K     + +   ++ 
Sbjct: 211 SEDYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQ 262

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
            N +GKL I WR+NLGE GRLQT Q+       K + L V++  + + I   F    ++T
Sbjct: 263 FNNVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVT 322

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           N T +      + L+   S + +     G     L P+++  S +F L++  +KLG+ +I
Sbjct: 323 N-TSEHTMKLNVRLAAKFSADSQYT---GCADFMLNPLQSGESAEFPLSVCPSKLGLVKI 378

Query: 420 TGITVFDKLE 429
           + + + + L+
Sbjct: 379 SPLVLTNTLQ 388


>gi|354491687|ref|XP_003507986.1| PREDICTED: UPF0533 protein C5orf44 homolog, partial [Cricetulus
           griseus]
          Length = 299

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 114/320 (35%), Positives = 167/320 (52%), Gaps = 35/320 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK     +     ++G  V+GK
Sbjct: 224 LNSVTQAGECVSTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276

Query: 306 LQITWRTNLGEPGRLQTQQI 325
           L I W+TNLGE GRLQT Q+
Sbjct: 277 LDIVWKTNLGERGRLQTSQL 296


>gi|34365494|emb|CAE46070.1| hypothetical protein [Homo sapiens]
 gi|119571731|gb|EAW51346.1| hypothetical protein FLJ13611, isoform CRA_d [Homo sapiens]
          Length = 309

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 114/320 (35%), Positives = 166/320 (51%), Gaps = 41/320 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 53  --------VNGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +          FLEA I+N T S ++M++V  EPS  ++ T 
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTE 217

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L +     +  +   SR   +P            YLY LK  +  +     ++G  V+GK
Sbjct: 218 LNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVTVIGK 270

Query: 306 LQITWRTNLGEPGRLQTQQI 325
           L I W+TNLGE GRLQT Q+
Sbjct: 271 LDIVWKTNLGERGRLQTSQL 290


>gi|348551658|ref|XP_003461647.1| PREDICTED: UPF0533 protein C5orf44 homolog [Cavia porcellus]
          Length = 479

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 129/425 (30%), Positives = 211/425 (49%), Gaps = 42/425 (9%)

Query: 14  VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRF 73
           VMRL +P+L    P+  +  DL    D+F+          L+  D +T            
Sbjct: 75  VMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------------ 111

Query: 74  LLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRI 133
              + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR+
Sbjct: 112 --VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQRL 169

Query: 134 LLLDTSKSPVESIRAGGRYDFIVEHDVK-ELGAHT-LVCTALYSDGEGERKYLPQFFKFI 191
            L  ++ +  E        +F     V  E+ ++  LVC   Y+   GE+ Y  +FFKF 
Sbjct: 170 NLSASNAAVAELKPDSVMSNFCYLQTVCLEICSYIGLVCAVSYTTQGGEKMYFRKFFKFQ 229

Query: 192 VSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD 251
           V  PL V+TK    +   +   +  FLEA I+N T S ++M++V  EPS  +S T L + 
Sbjct: 230 VLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYSVTELNSV 289

Query: 252 GPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQIT 309
               +  +   SR   +P            YLY LK     +     ++G  V+GKL I 
Sbjct: 290 SQAGERVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIV 342

Query: 310 WRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPF 369
           W+TNLGE GRLQT Q+        ++ L++  +P  V +++PF +  K+TN +++     
Sbjct: 343 WKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSERT---M 399

Query: 370 EIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLE 429
           ++ L   D+       ++G ++  L P  + G     L L+++  G+Q ++G+ + D   
Sbjct: 400 DLVLEMCDTSSVHWCGVSGRQLGKLLPSASLG---LALTLLSSVQGLQSVSGLRLTDTFL 456

Query: 430 KITYD 434
           K TY+
Sbjct: 457 KRTYE 461


>gi|330801295|ref|XP_003288664.1| hypothetical protein DICPUDRAFT_34383 [Dictyostelium purpureum]
 gi|325081286|gb|EGC34807.1| hypothetical protein DICPUDRAFT_34383 [Dictyostelium purpureum]
          Length = 509

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 127/448 (28%), Positives = 216/448 (48%), Gaps = 40/448 (8%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           M      H L  +VMRL +P++    P+  +  DL            +S       +   
Sbjct: 1   MEKEKENHLLNLKVMRLSKPNIPTINPILCEKDDLAYESMGLGSNSGSSGNNSGSGTSSP 60

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQ---AFGAIYLGETFCSYISINNSSTLEV 117
           ++  S    +    +  +  + G+ GL + P      G IYLGE FC YIS+NN S  +V
Sbjct: 61  SSPGSAAVEQQLINVSSNTGTNGIEGLGLTPMLQLQSGVIYLGEVFCCYISLNNHSPYQV 120

Query: 118 RDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG 177
            DV +K E+QT  QRI LLD+ K+PV S   G   DF+V+ +VKE G + LVC   YS  
Sbjct: 121 TDVYLKVELQTTSQRICLLDSEKNPVPSFSPGFSSDFVVQREVKESGINILVCAVNYSSP 180

Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
           EGE+K   ++FKF V NPL ++T++        +   I FLEAC+EN T+ +L+++ + F
Sbjct: 181 EGEQKKFRKYFKFQVMNPLVLKTRIH-------NLPNIIFLEACLENATQGSLFIESIVF 233

Query: 238 EPSQNWSATMLKADG--------------------PHSDYNAQSREI-FKPPVLIRSGGG 276
           +P   ++   +  +                      + D N+   +I     ++    G 
Sbjct: 234 DPIDLFTCKDISFEKNLIENNNSDIDNSNSNNVDNSNIDNNSLLSKIKISNDIVFLKQGS 293

Query: 277 IHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIE 336
              YL+Q+      ++   + + S  LG+L ITWR+  GE G+L+T  I    + +++IE
Sbjct: 294 SRQYLFQIIPKDPNNN---ETKTSATLGRLDITWRSYFGEIGKLKTAGI-QRKLGNEDIE 349

Query: 337 LNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAP 396
             +  +P ++ ++KPF +  KL N++++   P +  L +N  D  K+       +  + P
Sbjct: 350 AVLSNIPQLIKLEKPFNITAKLINKSNRTLYP-QFVLIRNKMDGIKI----NSHLPKIEP 404

Query: 397 VEAFGSTDFHLNLIATKLGVQRITGITV 424
           +        ++ +   K G+Q+ITG+ +
Sbjct: 405 ISPNSQVSINVEMFPLKPGMQQITGLAI 432


>gi|225709234|gb|ACO10463.1| UPF0533 protein [Caligus rogercresseyi]
          Length = 425

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 129/445 (28%), Positives = 211/445 (47%), Gaps = 49/445 (11%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLF---IGEDIFDDPIAASNLPPLISSDVTTNKS 64
           H L+ +VMRL RP    +  +  D  D+    + E+   DP +  ++P            
Sbjct: 14  HPLSLKVMRLSRPRFSSKVMITDDSDDILSRTLMEEHLKDPSSCRDVP------------ 61

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
                              L  LL+LPQ+FG IYLGETF  YIS++N ST     + +K 
Sbjct: 62  ----------------EAALGRLLILPQSFGMIYLGETFSCYISLHNDSTDPCFSISMKC 105

Query: 125 EIQTDKQRILLLDTSKSP--VESIRAGGRYDFIVEHDVKELGAHTLVCTALY-SDGEGER 181
           ++QT   RI L   +K P   + +  G   D ++ H+VK+LG H LVC   Y S    E+
Sbjct: 106 DLQTMVHRITLYPQNKEPPLQDQLLPGDSIDRVLNHEVKDLGTHILVCEVFYTSPKTQEK 165

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
               + FKF V  PL V+T            +   F+EA I+N T   LY+++V FEPS 
Sbjct: 166 SSFRKLFKFEVKKPLDVKTNFH------NSDENEVFVEATIQNATTGCLYLEKVAFEPST 219

Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           +++ T L +    ++ N+    +F P   +++      YL+ L    +       ++   
Sbjct: 220 HFNVTSLNSIVGLNEDNS----VFGPVNCLQTNDS-RQYLFCLSPKPNFKLDQKLLRSVI 274

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
            +GK+ + WRTNLGE GR++T Q+L T     +I+  +   PSVV + + F +  K+ N 
Sbjct: 275 AIGKIDVIWRTNLGERGRIKTSQLLRTPPVLNDIQFLIESCPSVVMLHQVFNISAKIFNN 334

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++      + + +N S     +M +G     L  ++  G  +F L+++    G+Q I+G
Sbjct: 335 SERTLELEALCVDKNKSR----LMWSGSTAQKLGLLQPDGCLEFTLSVVPLDTGLQVISG 390

Query: 422 ITVFDKLEKITYDSLPDLEIFVDQD 446
           I + D L K  Y+     ++FV  D
Sbjct: 391 IRILDNLLKRAYEFDDSNQVFVTSD 415


>gi|195339717|ref|XP_002036463.1| GM18092 [Drosophila sechellia]
 gi|194130343|gb|EDW52386.1| GM18092 [Drosophila sechellia]
          Length = 438

 Score =  175 bits (444), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 135/440 (30%), Positives = 225/440 (51%), Gaps = 50/440 (11%)

Query: 5   PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           P +H +A +VMRL RP+L  + P +  +PTDL                        + ++
Sbjct: 6   PDSHLVALKVMRLMRPTLVGLGPVVTCEPTDLV--------------------QRFSNSQ 45

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
            SD       +    A+++    +L+LPQ+FG+IYLGETF SYI ++N++   V  V +K
Sbjct: 46  ESD------GMSGACAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVTVK 99

Query: 124 AEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           A++Q++  RI L   + SKSPV  +  GG  D ++ ++VKE+G H LVC   YS   G  
Sbjct: 100 ADLQSNTSRINLSMHENSKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAGYA 158

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
           + L +FFKF V  PL V+TK    ++      EI +LEA I+N T S   +++VE + S+
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDGSE 212

Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           ++S T L    P+ +     + + +P            +LY +K     + +   ++  N
Sbjct: 213 DYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQFN 264

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
            +GKL I WR+NLGE GRLQT Q+       K + L V++  + + I   F    +LTN 
Sbjct: 265 NVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRLTN- 323

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           T +      + L+   S + +     G     L  +++  S +F L++  +KLG+ +IT 
Sbjct: 324 TSEHPMKLNVRLAAKFSPDSQYT---GCADFMLNLLQSGESAEFPLSVCPSKLGLVKITP 380

Query: 422 ITVFDKL--EKITYDSLPDL 439
           + + + L  E+ T +++ D+
Sbjct: 381 LVLTNTLQNEQFTIENVVDV 400


>gi|341892426|gb|EGT48361.1| hypothetical protein CAEBREN_24983, partial [Caenorhabditis
           brenneri]
          Length = 374

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 199/387 (51%), Gaps = 34/387 (8%)

Query: 58  DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
           ++   K S+L+  +R   HD    + +   L+ PQ F  IYLGETF  Y+++ N S   V
Sbjct: 20  EILAGKVSELSKETR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNV 72

Query: 118 RDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
            +V +K E+QT  QR+ L      + +E+ +  G+   ++ H+VKE+G H L+C+  Y  
Sbjct: 73  VNVCLKCELQTSTQRVALPCSVQDTIIEASKCDGQ---VISHEVKEIGQHILICSVNYKT 129

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
             GE  Y  +FFKF VS P+ V+TK    +       +  +LEA IEN + +N+++++VE
Sbjct: 130 LSGENMYFRKFFKFPVSKPIDVKTKFYSAE------NQDVYLEAQIENTSSANMFLERVE 183

Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
            +PSQ++  T +     H D   +  ++ KP         I  +L+ L  +   ++   K
Sbjct: 184 LDPSQHYKVTSIS----HQDEFPEIGKLLKP-------RDIRQFLFCLSPMDANNTLGYK 232

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
              S  +GKL ++WRT++GE GRLQT  +        ++ L+V + P+ V + KPF +  
Sbjct: 233 DLTS--IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVEVQKPFEILC 290

Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
           +L N +++     ++ L Q  +        +G+ +  L P +     DF LN+    +G+
Sbjct: 291 RLYNCSERALD-LQLRLEQPTNRNLVFCTPSGVSLGQLPPSQY---VDFVLNVFPVAVGI 346

Query: 417 QRITGITVFDKLEKITYDSLPDLEIFV 443
           Q I+GI + D   K  Y+     +IFV
Sbjct: 347 QSISGIRITDTFTKRVYEHDDIAQIFV 373


>gi|194859696|ref|XP_001969431.1| GG10100 [Drosophila erecta]
 gi|190661298|gb|EDV58490.1| GG10100 [Drosophila erecta]
          Length = 438

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 132/442 (29%), Positives = 223/442 (50%), Gaps = 54/442 (12%)

Query: 5   PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLF--IGEDIFDDPIAASNLPPLISSDVTT 61
           P  H +A +VMRL RP+L  + P +  +PTDL    G     D IA +            
Sbjct: 6   PDAHLVALKVMRLMRPTLVGLGPVVTCEPTDLVQRFGNSQASDGIAGA------------ 53

Query: 62  NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
                            A+++    +L+LPQ+FG+IYLGETF SYI ++N++   V  V 
Sbjct: 54  ----------------CAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVT 97

Query: 122 IKAEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
           +KA++Q++  RI L   + +KSPV  +  GG  D ++ ++VKE+G H LVC   YS   G
Sbjct: 98  VKADLQSNTTRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTSAG 156

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
             + L +FFKF V  PL V+TK    ++      EI +LEA I+N T S   +++VE + 
Sbjct: 157 YAQSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDG 210

Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
           S+++S T L    P+ +     + + +P            +LY +K     + +   ++ 
Sbjct: 211 SEDYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQ 262

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
            N +GKL I WR+NLGE GRLQT Q+       K + L V++  + + I   F    ++T
Sbjct: 263 FNNVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVT 322

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           N ++         +++  +D +      G     L  +++  S +F L++  +KLG+ +I
Sbjct: 323 NTSEHPMKLNVRLVAKFSADSQ----YTGCADFMLNLLQSGESAEFPLSVCPSKLGLVKI 378

Query: 420 TGITVFDKL--EKITYDSLPDL 439
           + + + + L  E+ T +++ D+
Sbjct: 379 SPLVLTNTLQNEQFTIENVVDV 400


>gi|194761714|ref|XP_001963073.1| GF15760 [Drosophila ananassae]
 gi|190616770|gb|EDV32294.1| GF15760 [Drosophila ananassae]
          Length = 438

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 131/440 (29%), Positives = 225/440 (51%), Gaps = 50/440 (11%)

Query: 5   PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           P  H +A +VMRL RP+L  + P +  +PTDL                           +
Sbjct: 6   PDAHLVALKVMRLMRPTLVGLGPMVTCEPTDLV--------------------------Q 39

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
             + T  S  +    A+++    +L+LPQ+FG+IYLGETF SYI ++N++T  V  V +K
Sbjct: 40  RFNYTQESDGITGAGAETLAAGQVLLLPQSFGSIYLGETFSSYICVHNTTTHPVECVTVK 99

Query: 124 AEIQTDKQRI--LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           A++Q++  RI   L +  KSPV  +  GG  D ++ ++VKE+G H LVC   Y+   G  
Sbjct: 100 ADLQSNTSRINLSLHEHVKSPV-VLAPGGTIDDVIRYEVKEIGTHILVCEVNYTTPAGFA 158

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
           + L +FFKF V  PL V+TK    ++      EI +LEA I+N T S   +++VE + S+
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDSSE 212

Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           ++S T L    P+ +     + + +P            +LY +K  +  +     ++  N
Sbjct: 213 DYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKADIAKDIDTLRQFN 264

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
            +GKL I WR+NLGE GRLQT Q+       K + L V++  + + I   F  K ++TN 
Sbjct: 265 NVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVMDAKNTIKIGTVFTFKCRVTNT 324

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           +++        +S+   D +     +G     L  +++  S +F L++  +KLG+ +++ 
Sbjct: 325 SEQPMKLNVRMVSKFSPDSQ----YSGCADFMLDLLKSGESAEFPLSVCPSKLGLIKVSP 380

Query: 422 ITVFDKL--EKITYDSLPDL 439
           + + + L  E+ T +++ D+
Sbjct: 381 LILTNTLQNEQFTIENVVDV 400


>gi|341880489|gb|EGT36424.1| hypothetical protein CAEBREN_15251 [Caenorhabditis brenneri]
          Length = 380

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 199/387 (51%), Gaps = 34/387 (8%)

Query: 58  DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
           ++   K S+L+  +R   HD    + +   L+ PQ F  IYLGETF  Y+++ N S   V
Sbjct: 26  EILAGKVSELSKETR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNV 78

Query: 118 RDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
            +V +K E+QT  QR+ L      + +E+ +  G+   ++ H+VKE+G H L+C+  Y  
Sbjct: 79  VNVCLKCELQTSTQRVALPCSVQDTIIEASKCDGQ---VISHEVKEIGQHILICSVNYKT 135

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
             GE  Y  +FFKF VS P+ V+TK    +       +  +LEA IEN + +N+++++VE
Sbjct: 136 LSGENMYFRKFFKFPVSKPIDVKTKFYSAE------NQDVYLEAQIENTSSANMFLERVE 189

Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
            +PSQ++  T +     H D   +  ++ KP         I  +L+ L  +   ++   K
Sbjct: 190 LDPSQHYKVTSIS----HQDEFPEIGKLLKP-------RDIRQFLFCLSPMDANNTLGYK 238

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
              S  +GKL ++WRT++GE GRLQT  +        ++ L+V + P+ V + KPF +  
Sbjct: 239 DLTS--IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVEVQKPFEILC 296

Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
           +L N +++     ++ L Q  +        +G+ +  L P +     DF LN+    +G+
Sbjct: 297 RLYNCSERALD-LQLRLEQPTNRHLVFCSPSGVSLGQLPPSQY---VDFVLNVFPVAVGI 352

Query: 417 QRITGITVFDKLEKITYDSLPDLEIFV 443
           Q I+GI + D   K  Y+     +IFV
Sbjct: 353 QSISGIRITDTFTKRVYEHDDIAQIFV 379


>gi|427789685|gb|JAA60294.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 416

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 129/434 (29%), Positives = 201/434 (46%), Gaps = 52/434 (11%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           LA +VMRL RPSL    P+  D  D           I  S     +  D+      +L  
Sbjct: 8   LALKVMRLTRPSLFTTLPVVCDSRD-----------IPGSMWMQELKQDLGAPLGLEL-- 54

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                        G    L+LPQ+FG IYLGETF  Y+S++N S   VRDV ++AE+QTD
Sbjct: 55  ------------FGAGSFLMLPQSFGNIYLGETFSCYMSVHNDSQTTVRDVSVRAELQTD 102

Query: 130 KQRILLLDTSKS--PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
            Q++LL   +     V  +      D ++ H+VK++  H LVCT  Y+   GE+ +  +F
Sbjct: 103 SQKVLLAGRADGAVAVAELAPNSSIDEVIHHEVKDINTHILVCTVNYTTQAGEKLHFRKF 162

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +       +  +LEA ++N T S + +++V  EPS  ++   
Sbjct: 163 FKFQVYKPLDVKTKFYNAE------SDEVYLEAQLQNITSSPICLEKVALEPSPYFNVCQ 216

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV-------QGS 300
           L   G        S+ +F  PV   +      YL+ L      S +   V        G 
Sbjct: 217 LNTCG-------DSQSVFG-PVNFLNPHDTRQYLFSLSPRVPSSETGETVAQPEKRRSGV 268

Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
             +GKL I WR+ +GE GRLQT Q+       ++I+L +   PS V +++PF +   + N
Sbjct: 269 TSIGKLDIVWRSAMGERGRLQTSQLERIAPGYEDIKLTIESAPSTVNLEEPFEIACSVMN 328

Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
              +     ++ L+  +      ++  G+   +L  +E   +    L  +  + G+Q I+
Sbjct: 329 TCHRT---MDLVLALENLPSSG-LLWQGMSGQSLGKLEPQATVRITLEAVPFRTGLQSIS 384

Query: 421 GITVFDKLEKITYD 434
           GI + D   K TYD
Sbjct: 385 GIKLSDTYLKQTYD 398


>gi|28574117|ref|NP_609365.3| CG4953 [Drosophila melanogaster]
 gi|74866482|sp|Q95TN1.1|U533_DROME RecName: Full=UPF0533 protein CG4953
 gi|16198171|gb|AAL13894.1| LD37668p [Drosophila melanogaster]
 gi|28380339|gb|AAF52893.3| CG4953 [Drosophila melanogaster]
 gi|220946234|gb|ACL85660.1| CG4953-PA [synthetic construct]
 gi|220955926|gb|ACL90506.1| CG4953-PA [synthetic construct]
          Length = 438

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 133/440 (30%), Positives = 225/440 (51%), Gaps = 50/440 (11%)

Query: 5   PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           P  H +A +VMRL RP+L  + P +  +PTDL                        ++++
Sbjct: 6   PDAHLVALKVMRLMRPTLVGLGPVVTCEPTDLV--------------------QRFSSSQ 45

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
            SD       +    A+++    +L+LPQ+FG+IYLGETF SYI ++N++   V  V +K
Sbjct: 46  ESD------GMSGACAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVTVK 99

Query: 124 AEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           A++Q++  RI L   + +KSPV  +  GG  D ++ ++VKE+G H LVC   YS   G  
Sbjct: 100 ADLQSNTSRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAGYA 158

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
           + L +FFKF V  PL V+TK    ++      EI +LEA I+N T S   +++VE + S+
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDGSE 212

Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           ++S T L    P+ +     + + +P            +LY +K     + +   ++  N
Sbjct: 213 DYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQFN 264

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
            +GKL I WR+NLGE GRLQT Q+       K + L V++  + + I   F    ++TN 
Sbjct: 265 NVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVTN- 323

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           T +      + L+   S + +     G     L  +++  S +F L++  +KLG+ +IT 
Sbjct: 324 TSEHPMKLNVRLAAKFSPDSQYT---GCADFMLNLLQSGESAEFPLSVCPSKLGLVKITP 380

Query: 422 ITVFDKL--EKITYDSLPDL 439
           + + + L  E+ T +++ D+
Sbjct: 381 LVLTNTLQNEQFTIENVVDV 400


>gi|268638273|ref|XP_646894.2| DUF974 family protein [Dictyostelium discoideum AX4]
 gi|187608844|sp|Q55EX6.2|U533_DICDI RecName: Full=UPF0533 protein
 gi|256013093|gb|EAL73120.2| DUF974 family protein [Dictyostelium discoideum AX4]
          Length = 511

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 133/457 (29%), Positives = 227/457 (49%), Gaps = 65/457 (14%)

Query: 7   THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
            H L  +VMRL +P++    P+  +  DL    +     I +++L       V ++ S+D
Sbjct: 4   NHLLNLKVMRLSKPNIPTINPILCEKQDL--PYETMSTSIDSTSLS---MGSVNSSGSND 58

Query: 67  LTYRSRFLLHDSADSIGLSGLLV---LPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
               +  L+ ++ + I + GL V   L    G IYLGE FC YIS+NN S  +VR+V +K
Sbjct: 59  ----NNQLIGNNGNPINMEGLGVTSMLQLQSGVIYLGEMFCCYISLNNHSPYQVRNVFLK 114

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
            E+QT   RI LLD+ +  V +   G   DF+V+ +VKE G + LVC   Y+  EGE+K 
Sbjct: 115 VELQTTSSRIPLLDSEQQSVPTFNPGFSSDFVVQREVKESGVNILVCAVNYTTPEGEQKK 174

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             ++FKF V NPL ++T++        +   + FLEAC+EN T+ +L+++ + FEP +++
Sbjct: 175 FRKYFKFQVLNPLVLKTRIH-------NLPNVVFLEACLENATQGSLFIESILFEPIEHF 227

Query: 244 SATMLKADGP-------------HSDYNAQSREIFKPPVLIRSGGGIHN---YLYQLKM- 286
           ++  +  +                 + N  +   FK    +   G I N    L  +K+ 
Sbjct: 228 NSKDISFENSLDDNNNLDNNNNNLENDNNLNNLEFK----LNEKGLIENTDELLENIKLT 283

Query: 287 -------LSHGSS-------SP-----VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILG 327
                  L  G S       +P     V+ + S  LG+L ITWR+  GE GRL+T  I  
Sbjct: 284 TSDNIVFLKQGCSRQYLFQITPKDIENVESKNSLPLGRLDITWRSYFGEIGRLKTAAI-Q 342

Query: 328 TTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMIN 387
             +  ++IE +++ +P  + ++KPF +  KL+N++++   P +  L +N  D  K+    
Sbjct: 343 RKLNQEDIECSLINIPDKIKLEKPFSVIAKLSNKSNRILYP-QFMLVRNKMDGIKI---- 397

Query: 388 GLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
              +  L P++        + +   K G+Q+I G+ +
Sbjct: 398 NSHLPKLDPIQPNSIIQVEIEMFPLKPGMQQIIGLAI 434


>gi|281341772|gb|EFB17356.1| hypothetical protein PANDA_007966 [Ailuropoda melanoleuca]
          Length = 339

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 110/340 (32%), Positives = 178/340 (52%), Gaps = 22/340 (6%)

Query: 97  IYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIV 156
           I+LGETF SYIS++N S   V+D+++KA++QT  QR L L  S + V  ++     D ++
Sbjct: 2   IFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASSAAVAELKPDCCIDDVI 60

Query: 157 EHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEIT 216
            H+VKE+G H LVC   Y+   GE+ Y  +FFKF V  PL V+TK    +          
Sbjct: 61  HHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAETDEV------ 114

Query: 217 FLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREIFKPPVLIRSG 274
           FLEA I+N T S ++M++V  EPS  ++   L +     +      SR   +P       
Sbjct: 115 FLEAQIQNITTSPMFMEKVSLEPSIMYNVAELNSVSQAGECVTTFGSRAYLQPM------ 168

Query: 275 GGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE 334
                YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q+        +
Sbjct: 169 -DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGD 227

Query: 335 IELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMAL 394
           + L++  +P  V +++PF +  K+TN +++     ++ L   +++      I+G ++  L
Sbjct: 228 VRLSLEAIPDTVNLEEPFHITCKITNCSER---TMDLVLEMCNTNSIHWCGISGRQLGKL 284

Query: 395 APVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            P  +       L L+++  G+Q ++G+ + D   K TY+
Sbjct: 285 HPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 321


>gi|157104758|ref|XP_001648554.1| hypothetical protein AaeL_AAEL004198 [Aedes aegypti]
 gi|157104963|ref|XP_001648651.1| hypothetical protein AaeL_AAEL000579 [Aedes aegypti]
 gi|108880202|gb|EAT44427.1| AAEL004198-PA [Aedes aegypti]
 gi|108884143|gb|EAT48368.1| AAEL000579-PA [Aedes aegypti]
          Length = 424

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 134/452 (29%), Positives = 212/452 (46%), Gaps = 61/452 (13%)

Query: 5   PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           P  H LA +VMRL RP+                                LISS + T ++
Sbjct: 4   PSEHLLALKVMRLTRPT--------------------------------LISSQIITAEA 31

Query: 65  SDLTYRS-RFLLHDSA------DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
            DL   +   +L  SA      +++     + LPQ+FG IYLGETF SY+ ++N     V
Sbjct: 32  KDLPQNTFAGILKSSATTVQDCETLAAGQFMQLPQSFGNIYLGETFSSYVCVHNCRAHPV 91

Query: 118 RDVVIKAEIQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
            +V +KA++Q++  RI L +   K     +      D ++ H+VKE+G H LVC   Y  
Sbjct: 92  GNVSVKADLQSNNTRINLPIHVDKQGPVVLHPDETLDDVIHHEVKEIGTHILVCEVSYMT 151

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
             G      +FFKF V  PL V+TK    +          +LEA I+N T   + +++VE
Sbjct: 152 PAGLESSFRKFFKFQVVKPLDVKTKFYNAETDE------VYLEAQIQNITVGPICLEKVE 205

Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
            E S+ ++   L  + P  +     R + +P            +LY +K L    + P+ 
Sbjct: 206 LESSEQYTVVSLN-NLPSGESVFSQRTMLQP-------MNSCQFLYCIKPLPAILNDPMA 257

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
           ++ +N +GKL I WR+NLGE GRLQT Q+  + I   ++ L V+E  S V I + F  K 
Sbjct: 258 LKAANNIGKLDIVWRSNLGERGRLQTSQLQRSPIEYGDLRLTVIEANSTVKIGEGFDFKC 317

Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKV-VMINGLRIMALAPVEAFGSTDFHLNLIATKLG 415
           ++TN +++        L  N +   KV     G   ++L P+E     +F L +   +LG
Sbjct: 318 RVTNTSERSMD-----LLMNLNTNAKVGCGYTGQTEISLGPLEPGKYKEFSLTVCPVRLG 372

Query: 416 VQRITGITVFDKLEKITYDSLPDLEIF-VDQD 446
           +  IT + + D   K  Y+    +++F VD+D
Sbjct: 373 LITITNLQLTDVFMKRKYEFDDFVQVFVVDED 404


>gi|241702186|ref|XP_002413194.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215507008|gb|EEC16502.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 417

 Score =  171 bits (433), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 135/441 (30%), Positives = 206/441 (46%), Gaps = 50/441 (11%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           LA +VMRL RPSL    P+  D  D           I  S     +  D+      +L  
Sbjct: 11  LALKVMRLTRPSLFSTLPVVCDSRD-----------IPGSMWLQDLKQDLGAPLGLEL-- 57

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                        G    L+LPQ+FG IYLGETF  Y+S++N S   VRDV +KAE+QTD
Sbjct: 58  ------------FGTGSFLMLPQSFGNIYLGETFSCYMSVHNDSEHTVRDVSVKAELQTD 105

Query: 130 KQRILLLDTSK-SPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
            Q++ L   S+ + V  +      D ++ H+VK++  H LVCT  YS   GE+ +  +FF
Sbjct: 106 SQKVFLTGKSEGTAVPELPPKSSIDEVIHHEVKDINTHILVCTVNYSSHTGEKLHFRKFF 165

Query: 189 KFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
           KF V  PL V+TK    +       +  +LEA ++N T S + +++V  EPSQ+++   L
Sbjct: 166 KFQVYKPLDVKTKFYNAE------SDEVYLEAQLQNITSSPISLEKVALEPSQHFNVCQL 219

Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK------MLSHGSSSPVKVQGSNV 302
            +        A  + IF   V   +      YL+ L        ++  +S      G   
Sbjct: 220 NS-------CADGQSIFG-QVNFLNPHDTRQYLFSLSPRVADAAVAPAASDKRSRSGITS 271

Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
           +GKL I WR+ +GE GRLQT Q+       ++I L V   PS V +++PF +   +TN  
Sbjct: 272 IGKLDIVWRSVMGERGRLQTSQLERIAPGYEDIRLTVDSAPSSVNLEEPFEITCLVTNTC 331

Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
              Q   ++ L  ++S     ++  G    +L  +E   S    L  +  + G+Q ++GI
Sbjct: 332 ---QRTMDLVLMLDNSATSG-LLWQGTSGQSLGKLEPQTSLRIKLEAVPFRTGLQGVSGI 387

Query: 423 TVFDKLEKITYDSLPDLEIFV 443
            + D   K  YD      +FV
Sbjct: 388 KLNDTFLKQVYDYDDITSVFV 408


>gi|291234053|ref|XP_002736964.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 409

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 116/357 (32%), Positives = 177/357 (49%), Gaps = 50/357 (14%)

Query: 8   HSLAFRVMRLCRPSLHVEPPL----RVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           H LA +VMRL +PS     P+    R  P +LF+                 + +D+++NK
Sbjct: 9   HLLALKVMRLTKPSFMTTIPVLSEDRDLPGNLFLQA---------------LQTDLSSNK 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                           ++  +  LL LPQ FG I+LGETF  YIS++N S+  V D+++K
Sbjct: 54  G--------------IENFAMGELLTLPQNFGNIFLGETFSCYISVHNDSSQSVSDILVK 99

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
            ++QT  QR+ L   + SP  ++      D ++ H+VKELG H LVC   YS   GE+ Y
Sbjct: 100 TDLQTSSQRLTLSGGNVSPSPNLSPENCIDEVIHHEVKELGTHILVCAVSYSISSGEKMY 159

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             +FFKF V  PL V+TK    +          +LEA I+N T S + M++V  EPS  +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESNE------VYLEAQIQNITNSPMVMERVTLEPSILY 213

Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           ++  L     +S  + ++ E     +   +      YLY L   S  +      +G   +
Sbjct: 214 NSQEL-----NSILSKENSETTFGNLSYLNAMDTRQYLYCLTPKSSDN------KGVTNI 262

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
           GKL I W+T+LGE GRLQT Q+        +I L + ++P  V ++KPF +  K+ N
Sbjct: 263 GKLDIVWKTHLGEKGRLQTSQLQRMAPGYGDIRLTIEQIPDGVQLEKPFTVICKVIN 319


>gi|195578101|ref|XP_002078904.1| GD23672 [Drosophila simulans]
 gi|194190913|gb|EDX04489.1| GD23672 [Drosophila simulans]
          Length = 417

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 128/428 (29%), Positives = 218/428 (50%), Gaps = 48/428 (11%)

Query: 5   PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           P +H +A +VMRL RP+L  + P +  +PTDL                        + ++
Sbjct: 6   PDSHLVALKVMRLMRPTLVGLGPVVTCEPTDLV--------------------QRFSNSQ 45

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
            SD       +    A+++    +L+LPQ+FG+IYLGETF SYI ++N++   V  V +K
Sbjct: 46  ESD------GMSGACAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVTVK 99

Query: 124 AEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           A++Q++  RI L   + +KSPV  +  GG  D ++ ++VKE+G H LVC   YS   G  
Sbjct: 100 ADLQSNTSRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAGYA 158

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
           + L +FFKF V  PL V+TK    ++      EI +LEA I+N T S   +++VE + S+
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDGSE 212

Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           ++S T L    P+ +     + + +P            +LY +K     + +   ++  N
Sbjct: 213 DYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQFN 264

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
            +GKL I WR+NLGE GRLQT Q+       K + L V++  + + I   F    ++TN 
Sbjct: 265 NVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVTN- 323

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           T +      + L+   S + +     G     +  +++  S +F L++  +KLG+ +IT 
Sbjct: 324 TSEHPMKVNVRLAAKFSPDSQYT---GCADFMMNFLQSGESAEFPLSVCPSKLGLVKITP 380

Query: 422 ITVFDKLE 429
           + + + ++
Sbjct: 381 LVLTNTIQ 388


>gi|281202555|gb|EFA76757.1| DUF974 family protein [Polysphondylium pallidum PN500]
          Length = 494

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 146/448 (32%), Positives = 219/448 (48%), Gaps = 77/448 (17%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           L  +VMRL +P + V   +  +  D  I  DI          PPLI         ++ TY
Sbjct: 9   LNLKVMRLSKPHIPVNNSILCERDD--IASDIL--------FPPLIQF------GNNDTY 52

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                     +++G+S +L L    G IYLGE F SYIS+NN ST +V +V +K E+QT 
Sbjct: 53  GG------GIEALGISPMLQLQS--GTIYLGEIFTSYISLNNHSTHDVTNVFLKVELQTS 104

Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
            QRILLLD+ +SP+     G   DF+V+ +VKE G + L C   Y   EGE K   +FFK
Sbjct: 105 TQRILLLDSEQSPIAKFGPGFNSDFVVQREVKESGVNILCCAVNYVTPEGEIKKFKKFFK 164

Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
           F V NPL ++TK+        H     FLEAC+EN T+ +L+++ + FEPS+ ++   L 
Sbjct: 165 FQVMNPLIIKTKIH-------HIPNQIFLEACLENATQGSLFLESILFEPSELFNFVNL- 216

Query: 250 ADGPHS------------------------------DYNAQSREIFKPP-VLIRSGGGIH 278
           ++  H+                              D N+   EI     V+     G  
Sbjct: 217 SENSHNVNATPISSPPLTSPSTTSSPTSNVNFKSSVDSNSILSEIKSTSNVVFLKESGSR 276

Query: 279 NYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELN 338
            YL++   ++    +    + S  LGKL ITWR+ LGE GRL+T  I    I   E+E  
Sbjct: 277 QYLFK---ITPKDPNDFDTKNSASLGKLDITWRSYLGEIGRLKTAYI-QRKINIDEVECI 332

Query: 339 VVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGL--RIMALAP 396
           +  +P  V ++KPF++  KL N+T++   P  + L +N  D    +++NG   +I AL P
Sbjct: 333 LTHIPK-VELEKPFVVTAKLVNKTNRILYPLFV-LVRNKMDG---ILVNGHLPKIGALPP 387

Query: 397 VEAFGSTDFHLNLIATKLGVQRITGITV 424
                S D  + +   K G+Q+I G+ +
Sbjct: 388 N---NSLDIDIEMFPIKPGMQQIVGLAI 412


>gi|321467962|gb|EFX78950.1| hypothetical protein DAPPUDRAFT_320008 [Daphnia pulex]
          Length = 414

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 133/442 (30%), Positives = 214/442 (48%), Gaps = 43/442 (9%)

Query: 4   TPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           T     L+ +VMRL RP          +P DL          I +     +++ D   ++
Sbjct: 3   TKADQILSIKVMRLSRPVFTQPGLFHPEPWDLV-------STILSQEENNVLTEDA--DQ 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
           + D T+ S+F            GLL LPQ+FG IYLGETF SY+ + N  +  V ++ IK
Sbjct: 54  TLDKTFSSQF------------GLL-LPQSFGTIYLGETFQSYLRVQNVGSCLVSNISIK 100

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  QR+ L   +K  +  +      D I+ H++ E+G H LVC   Y  GEGE+  
Sbjct: 101 ADLQTAAQRLPLTKRNKVSINQLEPQQSTDDILSHEITEIGTHILVCEVSYQIGEGEQMT 160

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSN-LYMDQVEFEPSQN 242
             +++KF V  PL V+TK    +       +  +LEA I+N T    L +D+V  EPS  
Sbjct: 161 SSRYYKFQVLKPLDVKTKFYNAE------SDDVYLEAQIQNTTVDRPLCLDKVTMEPSTL 214

Query: 243 WSATMLK-----ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
           +  + L         P S+      ++F   V +   G I  YL+ LK   +   +   +
Sbjct: 215 FEVSSLNEISATTGTPWSNMP----QLFGKCVNVVQPGEIRQYLHCLKPKQNVRDNHRML 270

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
           +G + +GKL + WRT +G+ GRLQT Q+        ++ L + E+P+ V + +P     K
Sbjct: 271 RGESNIGKLDLIWRTAIGDRGRLQTSQLQRMVPNYGDVRLTIQELPNPVKLHRPINFVCK 330

Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
           +TN +++   P E+ L   +   +  V+  G+    L  ++   ST+  L L+    G+Q
Sbjct: 331 ITNTSER---PVELSLVL-EIRSKPTVLWTGISNRPLKKIDPNHSTEVSLKLVPVMPGLQ 386

Query: 418 RITGITVFDKLEKITYDSLPDL 439
            I+G+ + D   K TYD  PD+
Sbjct: 387 SISGLKLIDLFLKRTYD-YPDI 407


>gi|170036870|ref|XP_001846284.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167879819|gb|EDS43202.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 424

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 125/438 (28%), Positives = 211/438 (48%), Gaps = 46/438 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL RP+L     +  +  DL   ++ FD          ++    TT +    
Sbjct: 7   HLLALKVMRLTRPTLVSSQIVTAEAKDL--PQNTFDK---------ILRGTATTVQG--- 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++    +++LPQ+FG IYLGETF SY+ ++N     V  V +KA++Q
Sbjct: 53  -----------AETLTAGQMMLLPQSFGNIYLGETFSSYVCVHNCRAHPVSSVTVKADLQ 101

Query: 128 TDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
           ++  RI L +   K   +++      D ++ H+VKE+G H LVC   Y    G      +
Sbjct: 102 SNNTRISLPIHVDKEGPQTLNPDETMDDVIHHEVKEIGTHILVCEVSYMTPAGLETSFRK 161

Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
           FFKF V  PL V+TK    +          +LEA I+N T   + +++VE E S+ ++  
Sbjct: 162 FFKFQVVKPLDVKTKFYNAETDEV------YLEAQIQNITVGPICLEKVELESSEQYTVV 215

Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
            L  + P  +     R + +P            +LY +K ++   + P  ++ +N +GKL
Sbjct: 216 PLN-NLPTGESVFSQRTMLQP-------QNSCQFLYCIKPIAEILNDPKALKAANNIGKL 267

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
            I WR+NLGE GRLQT Q+  + I   ++ L V E  S V I   F  + ++TN +++  
Sbjct: 268 DIVWRSNLGERGRLQTSQLQRSPIEYGDLRLAVTEANSTVKIGDAFDFRCRVTNTSER-- 325

Query: 367 GPFEIWLSQNDSDEEKV-VMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               + L  + + + K+     G   ++L P+E     DF L +   +LG+  I+ + + 
Sbjct: 326 ---SMDLVMHLNTKTKIGCGYTGQTEISLGPLEPGKFKDFGLTVCPVRLGLITISNLQLT 382

Query: 426 DKLEKITYDSLPDLEIFV 443
           D   K  Y+    +++FV
Sbjct: 383 DVFMKRKYEFDDFVQVFV 400


>gi|307171192|gb|EFN63179.1| UPF0533 protein [Camponotus floridanus]
          Length = 402

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 132/442 (29%), Positives = 205/442 (46%), Gaps = 46/442 (10%)

Query: 4   TPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
           T   H LA +VMRL RP+L     +  D TDL             + L   + SD T  +
Sbjct: 5   TKSDHLLALKVMRLTRPTLASPMVVTCDSTDL-----------PGNTLNNELKSDCTALQ 53

Query: 64  SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
                           +++ +   +VLPQ+FG IYLGE F SY+ ++N S   V++V +K
Sbjct: 54  --------------GMEALAIGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQVVKNVTVK 99

Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
           A++QT  Q I L   S    E +      D ++ H+VKE+G H LVC   Y++  G    
Sbjct: 100 ADLQTSTQTISLSGNSLEGKE-LAPDSTVDEVIHHEVKEIGTHILVCEVSYTNQIGPPLS 158

Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
             ++FKF V  PL V+TK    +       +  +LEA I+N T   + +++V  E S  +
Sbjct: 159 FRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVALESSHLF 212

Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           S T L       + N +   I+    L+ +      YLY LK        P  +Q +  +
Sbjct: 213 SVTTL-------NINDEGESIYGSVNLLDTNCS-RQYLYCLKPQLSLMKDPKMMQNATNI 264

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
           GKL I WR+NLGE GRLQT Q+        ++ + + ++P  V +++P      + N ++
Sbjct: 265 GKLDIVWRSNLGERGRLQTSQLQRMAPEYGDLRVIMKDIPLKVNLEEPVNCTCHIINTSE 324

Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
           +     E+ LS   ++      I+   I +L P     S D  L LI    G+  I+G+ 
Sbjct: 325 RS---MELLLSLESNESIAWCGISNTMIGSLKP---GISMDIPLCLIMLNTGIITISGLK 378

Query: 424 VFDKLEKITYDSLPDLEIFVDQ 445
           + D   K  YD     +IFV+Q
Sbjct: 379 LTDTFLKRVYDYDDLAQIFVNQ 400


>gi|344247412|gb|EGW03516.1| UPF0533 protein C5orf44-like [Cricetulus griseus]
          Length = 294

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 111/314 (35%), Positives = 162/314 (51%), Gaps = 41/314 (13%)

Query: 14  VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRF 73
           VMRL +P+L    P+  +  DL    D+F+          L+  D +T            
Sbjct: 1   VMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------------ 37

Query: 74  LLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRI 133
              + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++QT  QR 
Sbjct: 38  --VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR- 94

Query: 134 LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVS 193
           L L  S + V  ++     D ++ H+VKE+G H LVC   Y+   GE+ Y  +FFKF V 
Sbjct: 95  LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVL 154

Query: 194 NPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGP 253
            PL V+TK    +       +  FLEA I+N T S ++M++V  EPS  ++ T L +   
Sbjct: 155 KPLDVKTKFYNAET------DEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVTQ 208

Query: 254 HSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWR 311
             +  +   SR   +P            YLY LK     +     ++G  V+GKL I W+
Sbjct: 209 AGECVSTFGSRGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWK 261

Query: 312 TNLGEPGRLQTQQI 325
           TNLGE GRLQT Q+
Sbjct: 262 TNLGERGRLQTSQL 275


>gi|195051148|ref|XP_001993042.1| GH13306 [Drosophila grimshawi]
 gi|193900101|gb|EDV98967.1| GH13306 [Drosophila grimshawi]
          Length = 438

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 128/452 (28%), Positives = 211/452 (46%), Gaps = 72/452 (15%)

Query: 7   THSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           +H LA +VMRL RP+L  + P +  +P DL              +   L+  D       
Sbjct: 8   SHLLALKVMRLTRPTLVGLGPIVTCEPKDL------------PQSFNRLVEYD------- 48

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
                   +   SA+++G    ++LPQ+FG IYLGETF SYI ++N +T  V  V +K +
Sbjct: 49  -------GIARTSAEALGAGQTMLLPQSFGNIYLGETFSSYICVHNCTTHPVEGVSVKVD 101

Query: 126 IQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
           +Q++  RI LL+  +K     + A    D ++ ++VKE+G H LVC   Y+   G  + L
Sbjct: 102 LQSNNTRINLLMHENKKSSVVLTADETLDDVIRYEVKEIGTHILVCEVNYTSPAGFAQSL 161

Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
            +FFKF V  PL V+TK            EI +LEA I+N T     +++VE + S+ ++
Sbjct: 162 RKFFKFQVLKPLDVKTKFY-----NAEMDEI-YLEAQIQNVTTGPFCLEKVELDSSEQYT 215

Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
            T L    P+ +    S+ + +P            +LY +K     +     ++ +N +G
Sbjct: 216 VTSLNT-LPNGESVFTSKNMLQP-------NNSCQFLYCIKPKPEIAKDIKTLRQANNVG 267

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           KL I WR+N GE GRLQT Q+       K++ L V++  ++V I      + ++TN    
Sbjct: 268 KLDIVWRSNFGEKGRLQTSQLQRLPFEYKDLRLEVIDAENIVKIGTILTFQCRVTN---- 323

Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIA------------- 411
                        + E  + +   L   A A     GS DF L+++              
Sbjct: 324 -------------TAEHSMKLHVTLETKAFADCPYTGSADFELDVLQPGEMAEFPLTICP 370

Query: 412 TKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
           +KLG+ +I+ + + D L+   +     +E+FV
Sbjct: 371 SKLGLIKISPLLIVDTLKNEQFLMTKVVEVFV 402


>gi|156546906|ref|XP_001599918.1| PREDICTED: UPF0533 protein C5orf44 homolog [Nasonia vitripennis]
          Length = 404

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 131/449 (29%), Positives = 208/449 (46%), Gaps = 51/449 (11%)

Query: 1   MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
           M S P + H LA +VMRL RP+L     +  D TDL             + L   + +D 
Sbjct: 1   MESKPKSEHLLALKVMRLTRPTLASPVVVTCDSTDL-----------PGNTLNVELKNDC 49

Query: 60  TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
           T  +                +++ +   ++LPQ+FG IYLGE F SY+ ++N S   V+D
Sbjct: 50  TALQ--------------GMETVAIGQFMILPQSFGNIYLGEIFSSYLCVHNGSHQAVKD 95

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD--- 176
           V +KA +QT  Q I L   +    E +      D ++ H+VKE G H LVC   Y+    
Sbjct: 96  VTVKANLQTSTQTIPLSGQNSQATE-LAPNHTIDEVIHHEVKETGTHILVCEVTYTPLLL 154

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
           G     +  +FFKF V  PL V+TK    +       +  ++EA I+N T   + +++V 
Sbjct: 155 GSQPLSF-RKFFKFQVVKPLDVKTKFYNAE------NDEVYIEAQIQNLTAGPICLEKVA 207

Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
            E S  ++ + L A       N +   I+    L+ SG     YLY LK     +  P  
Sbjct: 208 LESSHLFTVSTLSA-------NEKQESIYGKLNLLDSGHS-RQYLYCLKPTPSLAKDPKM 259

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
           +  +  +GKL I WR+NLGE GRLQT Q+        ++ ++  ++PS + I++P   K+
Sbjct: 260 MHNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPDYGDLRVSAKDIPSKIYIEEPVNFKI 319

Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
            + N T + Q    + L  N S     V  +G+    +  ++   S    L LI  + G+
Sbjct: 320 HIIN-TSERQMDLLLGLQSNTS-----VAWSGISDKMIGTLKPGESVHLPLCLIPLESGL 373

Query: 417 QRITGITVFDKLEKITYDSLPDLEIFVDQ 445
             ++G+ + D   K  YD     +IFV+ 
Sbjct: 374 VAVSGLKLTDTFLKRVYDYDDLAQIFVNH 402


>gi|308810202|ref|XP_003082410.1| unnamed protein product [Ostreococcus tauri]
 gi|116060878|emb|CAL57356.1| unnamed protein product [Ostreococcus tauri]
          Length = 463

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 114/358 (31%), Positives = 175/358 (48%), Gaps = 45/358 (12%)

Query: 117 VRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
            R+V IK E+QT+ +R  L D ++ P+  +R G + D +V  DVKELGAHTLVC+A Y D
Sbjct: 86  AREVGIKIELQTETRRTTLHDATREPIAVLRPGEKRDVVVSKDVKELGAHTLVCSAAYCD 145

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
             GER+Y PQ+FKF VSNPLSVRTK R    G        FLE C+EN T++ L ++   
Sbjct: 146 ENGERRYSPQYFKFKVSNPLSVRTKTRAAPRGR------IFLEVCVENATRNALLLEGAR 199

Query: 237 FE-----------PSQNWSATMLKADGPHSDYNAQSREIFKPPV--LIRSGGGIHNYLYQ 283
           F+           P     AT  + D   +D       I K  V  L  +GG  H +LY+
Sbjct: 200 FDAVDGIMSRDMTPENAGQAT--RVDVGENDRGPGLPSIGKRAVYRLDPTGGSAH-FLYE 256

Query: 284 LKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE-------IE 336
           +      +++      +  LGKL++ WR  +G+ GRLQTQ I   +  S +       I 
Sbjct: 257 IT----SANASTTFAPTTPLGKLELRWRGAMGDLGRLQTQVINAGSAGSSDPVPEIAKIH 312

Query: 337 LNVVEVP--------SVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMING 388
             ++  P        S V +++PF L+ ++      E G F + +     D    V ++G
Sbjct: 313 QTIIVDPKPANAEEESTVYVERPFTLRARIEALAPIEAGAFALRV----RDVVTGVYVDG 368

Query: 389 LRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
            R   +  ++   + D  ++ +A  LGVQ    + +   ++     +   LE+FV +D
Sbjct: 369 PRAFRIDSLDRGQTVDVDVSCVALGLGVQTCPTLALCGAVDDALLHAPTPLEVFVVRD 426


>gi|195118796|ref|XP_002003922.1| GI18169 [Drosophila mojavensis]
 gi|193914497|gb|EDW13364.1| GI18169 [Drosophila mojavensis]
          Length = 438

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 124/438 (28%), Positives = 212/438 (48%), Gaps = 46/438 (10%)

Query: 8   HSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
           H LA +VMRL RP+L  + P +  +P DL              +   L+  D        
Sbjct: 9   HLLALKVMRLTRPTLVGLGPIVTCEPKDL------------PQSFNRLVEFD-------- 48

Query: 67  LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
                  +   SA+++G    ++LPQ+FG IYLGETF SYI ++N +T  V  V +K ++
Sbjct: 49  ------GIARTSAEALGAGQTMLLPQSFGNIYLGETFSSYICVHNCTTHPVEGVSVKVDL 102

Query: 127 QTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
           Q++  +I LL+  +K     + A    D ++ ++VKE+G H LVC   Y+   G  + L 
Sbjct: 103 QSNSSQINLLMHENKKSSVVLTADETLDDVIRYEVKEIGTHILVCEVNYTSPAGFAQSLR 162

Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           +FFKF V  PL V+TK            EI +LEA I+N T     +++VE + S+ ++ 
Sbjct: 163 KFFKFQVLKPLDVKTKFY-----NAEMDEI-YLEAQIQNVTTGPFCLEKVELDSSEQYTV 216

Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           T L    P+ +    S+ + +P            +LY +K  +  +     ++ +N +GK
Sbjct: 217 TSLNT-LPNGESVFTSKNMLQP-------NNSCQFLYCIKPKAEIAKDIKTLREANNVGK 268

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I WR+N GE GRLQT Q+       K++ L V +  ++V I   F  + ++TN  +  
Sbjct: 269 LDIVWRSNFGEKGRLQTSQLQRLPFEYKDLRLEVTDAENIVKIGTIFTFQCRITNTAEH- 327

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
             P ++ + + D+         G     L  ++     +F L +  +KLG+ +++ + + 
Sbjct: 328 --PMKLHV-KLDTKVFPGCPYTGSADFELDTLQPGQLAEFPLTICPSKLGLIKVSPLVIV 384

Query: 426 DKLEKITYDSLPDLEIFV 443
           D L+   +     +E+FV
Sbjct: 385 DTLKNEQFIMTKVVEVFV 402


>gi|195384916|ref|XP_002051158.1| GJ14608 [Drosophila virilis]
 gi|194147615|gb|EDW63313.1| GJ14608 [Drosophila virilis]
          Length = 438

 Score =  161 bits (408), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 125/439 (28%), Positives = 209/439 (47%), Gaps = 46/439 (10%)

Query: 7   THSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           TH LA +VMRL RP+L  + P +  +P DL              +   L+  D       
Sbjct: 8   THLLALKVMRLTRPTLVGLGPIVTCEPKDL------------PQSFNRLVEFDG------ 49

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
                   +    A+++G    ++LPQ+FG IYLGETF SYI ++N ++  V  V +K +
Sbjct: 50  --------IARTCAEALGAGQTMLLPQSFGNIYLGETFSSYICVHNCTSHPVEGVSVKVD 101

Query: 126 IQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
           +Q++  RI LL+  +K     + A    D ++ ++VKE+G H LVC   Y+   G  + L
Sbjct: 102 LQSNTSRINLLMHENKKSSVVLTADETLDDVIRYEVKEIGTHILVCEVNYTSPAGFAQSL 161

Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
            +FFKF V  PL V+TK            EI +LEA I+N T     +++VE + S+ ++
Sbjct: 162 RKFFKFQVLKPLDVKTKFY-----NAEMDEI-YLEAQIQNVTTGPFCLEKVELDSSEQYT 215

Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
            T L    P+ +    S+ + +P            +LY +K     +     ++ +N +G
Sbjct: 216 VTSLNT-LPNGESVFTSKNMLQP-------NNSCQFLYCIKPKPEVAKHIKTLREANNVG 267

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           KL I WR+N GE GRLQT Q+       K++ L V++  ++V I   F  + ++TN T +
Sbjct: 268 KLDIVWRSNFGEKGRLQTSQLQRLPFEYKDLRLEVIDAENIVKIGTIFTFQCRVTN-TAE 326

Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
                 I L      +          +  L P +     +F L +  +KLG+ +++ + +
Sbjct: 327 HAMKLHITLETKAFADCPYTGSANFVLDVLQPGQF---AEFPLTICPSKLGLIKVSPLLI 383

Query: 425 FDKLEKITYDSLPDLEIFV 443
            D L+   +     +E+FV
Sbjct: 384 VDTLKNEQFLMTKVVEVFV 402


>gi|332018225|gb|EGI58830.1| UPF0533 protein C5orf44-like protein [Acromyrmex echinatior]
          Length = 402

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 123/438 (28%), Positives = 204/438 (46%), Gaps = 46/438 (10%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H L  +VMRL RP+L     +  D TDL             + L   + +D TT +    
Sbjct: 9   HLLTLKVMRLTRPTLASPMVVTCDSTDL-----------PGNTLNNELKNDCTTLQG--- 54

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                       +++ +   +VLPQ+FG IYLGE F SY+ ++N S   V++V +KA++Q
Sbjct: 55  -----------MEALAIGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQVVKNVTVKADLQ 103

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  Q ++ L ++    + +      D ++ H+VKE+G H LVC   Y++  G      ++
Sbjct: 104 TSTQ-VIPLSSNNLEGKELAPDSTVDEVIHHEVKEIGTHILVCEVSYTNQIGPSLSFRKY 162

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           FKF V  PL V+TK    +       +  +LEA I+N T   + +++V  E S  +S T 
Sbjct: 163 FKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVALESSHLFSVTT 216

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
           L         N +   I+    L+ +G     YLY LK        P  +Q +  +GKL 
Sbjct: 217 LNT-------NDEGDSIYGSVNLLDAGCS-RQYLYCLKPQLSLLKDPKMMQNATNIGKLD 268

Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
           I WR+NLGE GRLQT Q+        ++ + + ++P    +++P      + N +++   
Sbjct: 269 IVWRSNLGERGRLQTSQLQRMAPEYGDLRVLIKDIPLKAYLEEPVNCTCHIINTSERS-- 326

Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
             E+ LS   ++    +   G+    +  ++   S D  L  I    G+  I+G+ + D 
Sbjct: 327 -MELLLSLESNNS---IAWCGMSDTIIGTLKPGVSMDIPLCFITLDTGIITISGLKLTDT 382

Query: 428 LEKITYDSLPDLEIFVDQ 445
             K  YD     +IFV+Q
Sbjct: 383 FLKRVYDYDDLAQIFVNQ 400


>gi|332373924|gb|AEE62103.1| unknown [Dendroctonus ponderosae]
          Length = 402

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 122/429 (28%), Positives = 195/429 (45%), Gaps = 53/429 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDL---FIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           H LA +VMRL RP+L    P+  D  DL    +   +  DP A                 
Sbjct: 6   HLLALKVMRLTRPTLASPLPVTCDSKDLPGNLLNNVLQQDPTAVP--------------- 50

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
                         +++I +   L+LPQ    IYLGETF SYI + + +T  V ++ +K 
Sbjct: 51  -------------GSETIAIGQFLLLPQNPVNIYLGETFSSYICVYSETTQIVYNITVKV 97

Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
           ++QT  Q++ L + S +    + +    + ++ H+VKE+G H LVC   Y +  G     
Sbjct: 98  DLQTTSQKLSLANNSST--TKLNSDETVNTVIHHEVKEIGPHILVCEVAYQNSAGVLMSF 155

Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
            +FFK  V  PL V+TK    +       +  +LEA ++N T   + +++V  + S  ++
Sbjct: 156 RKFFKIQVLKPLDVKTKFYNAE------NDDVYLEAQVQNITNGPICLEKVSLDASHLFN 209

Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
            T L         N  + E     + +     I  YLY L      SS    + G+  +G
Sbjct: 210 VTCLN--------NTPTGESIFGNITLLQPQSISQYLYCLTPTDKLSSDLKSLSGATNIG 261

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           KL I WR+NLGE GRLQT Q+   +    EI+L++ E+P+ V I++ F  K KL N  ++
Sbjct: 262 KLDIVWRSNLGEKGRLQTSQLQRMSPDFGEIKLSITELPNFVVIEELFTFKCKLANNGER 321

Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
                E  L   ++       I+G ++ AL P     S       I    G++ ++G+ +
Sbjct: 322 T---VEFILYLENTRNIAWCGISGRKLEALPP---HSSKILEFKCIPLVPGLRTLSGVKL 375

Query: 425 FDKLEKITY 433
            D   K TY
Sbjct: 376 VDTFTKRTY 384


>gi|307105123|gb|EFN53374.1| hypothetical protein CHLNCDRAFT_137142 [Chlorella variabilis]
          Length = 467

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 121/445 (27%), Positives = 191/445 (42%), Gaps = 112/445 (25%)

Query: 89  VLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLL-DTSKSPVESIR 147
            +P      + G +F + I+  N S   +  V  KAE+ T++ R+ LL D++ SP+  + 
Sbjct: 44  AMPALAAGGFAGRSFAAIIAACNYSDAPITLVGFKAELSTERSRLALLHDSAASPLPRLA 103

Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV 207
           AG R+D +V+HD+K+LG HTL C+A ++ GEGER+   Q F F   NPL VRTK R  +V
Sbjct: 104 AGQRHDLLVKHDIKDLGVHTLTCSASFTCGEGERRLQAQAFTFSSLNPLVVRTKQR--QV 161

Query: 208 GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML------------------- 248
           G     E   LEA +EN TK+ + +D + F P+  ++A  +                   
Sbjct: 162 G-----EAVLLEATLENATKAPMLLDAISFFPAPPFAAQRVGGGGASSPPPPPAAGRAGD 216

Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKML--------------SHGSSSP 294
           +  GP S Y      I   P++    GG   +L+ L  L              +   +SP
Sbjct: 217 EPAGPLSSY------IQSLPLIPE--GGASAFLFHLTRLPAAAAGSPGGAMPGASPGTSP 268

Query: 295 VK------------VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEV 342
            +             + S  LGK++I WR  +GE  RLQTQQI       +E+ L +  +
Sbjct: 269 SRAAAAAAAAAAAAAEASGALGKMEIRWRGPMGEMARLQTQQISLPQPAQREVSLALARL 328

Query: 343 PSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDS------------------------ 378
           P  V +  PF   L++ +  D+  GP +I  +   S                        
Sbjct: 329 PGRVAVGAPFTATLRVQSHVDRPVGPLKIAAADAPSPAGSPSRSSSLRASSSGSPSRDGS 388

Query: 379 -------------------DEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
                              D  + V+++      LAP +A    +  L ++A   G Q +
Sbjct: 389 LQGGAVAAAAAAAAAAVCLDGAQSVLVD-----ELAPRQA---VEVQLRMLALAAGQQAL 440

Query: 420 TGITVFDKLEKITYDSLPDLEIFVD 444
             + V  + +   Y +LP  E+FVD
Sbjct: 441 PAMCVVSERDGKQYGALPPAELFVD 465


>gi|268530512|ref|XP_002630382.1| Hypothetical protein CBG04321 [Caenorhabditis briggsae]
          Length = 414

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 130/456 (28%), Positives = 211/456 (46%), Gaps = 60/456 (13%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           +S++     LA RVMRL RP        +  P D F       DP+  +    L++  V 
Sbjct: 5   ISNSSTQQLLALRVMRLARP--------KFAPLDGFS-----HDPVDPTGFGELLAGKV- 50

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
               ++++  SR   HD    + +   L+ PQ F  IYLGETF  Y+++ N S   V +V
Sbjct: 51  ----AEISKESR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNVVNV 99

Query: 121 VIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
            +K E+QT  QR++L        +ES +  G+   ++ H+VKE+G H L+C+  Y    G
Sbjct: 100 SLKCELQTSTQRVVLPCSVQDVTIESTKCDGQ---VISHEVKEIGQHILICSVNYKTLSG 156

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVK---VGATHFQEITFLEACIENHTKSNLYMDQVE 236
           E  Y  +FFKF VS P+ V+TK    +   V     ++ +FL        K  L   ++ 
Sbjct: 157 ENMYFRKFFKFPVSKPIDVKTKFYSAEDNAVSKRFLEKSSFLSRIRMFILKRKLRTPRIR 216

Query: 237 FEPSQ--NWSATMLKADG-PHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS 293
               +  NW    +K     H D   +  ++ KP         I  +L+ L        S
Sbjct: 217 TCSWREWNWIRVSIKVTSISHEDEFPEVGKLLKP-------KDIRQFLFCL--------S 261

Query: 294 PVKVQGS------NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVG 347
           PV V  +        +GKL ++WRT++GE GRLQT  +        ++ L+V + P+ V 
Sbjct: 262 PVDVNNTLGYKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVD 321

Query: 348 IDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHL 407
           + KPF +  +L N +++     ++ L Q  + +  +   +G+ +  L P       DF L
Sbjct: 322 VQKPFEVACRLYNCSERALD-LQLRLEQPSNRQLVICSPSGVSLGQLPPSRY---VDFAL 377

Query: 408 NLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
           N+    +G+Q I+GI + D   K  Y+     +IFV
Sbjct: 378 NVFPVAVGIQSISGIRITDTFTKRHYEHDDIAQIFV 413


>gi|328865155|gb|EGG13541.1| DUF974 family protein [Dictyostelium fasciculatum]
          Length = 493

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 139/447 (31%), Positives = 218/447 (48%), Gaps = 78/447 (17%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           L  +VMRL +P L    P+  +           DD I+   LPP I      N  +    
Sbjct: 9   LNLKVMRLSKPLLQANNPVLCE----------RDDVISDMILPPTIQPG---NNDT---- 51

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                +    + +G++ +L L    G IYLGE F SYIS+NN S  EV++V    E+QT 
Sbjct: 52  -----MGGGIEGLGMTSMLQLQS--GLIYLGEIFTSYISLNNHSPHEVKNV----ELQTT 100

Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
            QRILLLD+   P+     G   DF+V+ +VKE G + L C   Y   EGE K   +FFK
Sbjct: 101 TQRILLLDSEPKPIPVFGPGFNSDFVVQREVKEFGVNILCCAVTYVTLEGEVKKFKKFFK 160

Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT--- 246
           F VSNPL +++K+             TF+E C+EN T+  L +D V FE +  ++ +   
Sbjct: 161 FQVSNPLGIKSKI-------ISIPNTTFVEVCLENTTQGALLIDTVTFEAADLFTQSNMS 213

Query: 247 --------------MLK-------ADGP----HSDYNAQS--REIFKPP--VLIRSGGGI 277
                         ML+       ++G      +D   QS   EI   P  V +R G   
Sbjct: 214 EVKHSQQPSPQQPPMLQLANSLGSSNGSGWKKSTDSTIQSLMSEIRASPDIVFLREGNS- 272

Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIEL 337
             YL+++        +  + + +  LGKL I WR+ +GE GRL+T QI    +  +E+E 
Sbjct: 273 RQYLFKVM---PKDPNDFETKNAATLGKLDIVWRSYMGETGRLKTAQI-QRKVCLEEVEC 328

Query: 338 NVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPV 397
           N+V +P+ V ++KPF +  K+ N+T++   P  + L +N  D    ++ING  +  +  +
Sbjct: 329 NLVSIPT-VELEKPFTVTAKIINKTNRILHPLFV-LVRNKMDG---ILING-HLPKIGAL 382

Query: 398 EAFGSTDFHLNLIATKLGVQRITGITV 424
           +A  S +  + +   K G+Q+I+G+ +
Sbjct: 383 QANSSINLDIEMFPLKPGMQQISGLAI 409


>gi|91094103|ref|XP_967297.1| PREDICTED: similar to CG4953 CG4953-PA [Tribolium castaneum]
 gi|270010876|gb|EFA07324.1| hypothetical protein TcasGA2_TC015920 [Tribolium castaneum]
          Length = 404

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 124/430 (28%), Positives = 196/430 (45%), Gaps = 47/430 (10%)

Query: 5   PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           P  H LA +VMRL RP+L    P+  D  DL             + L   +  D  + K 
Sbjct: 3   PEEHLLALKVMRLTRPTLATPLPVTCDSKDL-----------PGNLLNVALQQDAASVKG 51

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
           ++     +FLL              LPQ+   IYLGETF SYI + N +   V +V +K 
Sbjct: 52  TETLSIGQFLL--------------LPQSPVNIYLGETFSSYICVYNETQHIVSNVSVKV 97

Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
           ++QT  QR+ L  +S  P   +      + ++ H+VKE+G H LVC   Y +  G  K  
Sbjct: 98  DLQTTSQRLPL--SSNPPTPQLTPDDTVNIVIHHEVKEIGNHILVCEVSYQNAVGILKSF 155

Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
            +FFK  V  PL V+TK    +       +  +LEA ++N T   + +++V  + S  + 
Sbjct: 156 RKFFKIQVLKPLDVKTKFYNAE------NDDVYLEAQVQNITTGPICLEKVALDASHLFK 209

Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
            T L       +       IF    L+     +  +LY L      SS    + G+  +G
Sbjct: 210 VTSL-------NVTPTGESIFGKTTLLNPQA-VCQFLYCLSPNEKLSSDLKSLSGATNIG 261

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           KL I WR+NLGE GRLQT Q+        +I L++ E+P+ V +++ F  K +L N  ++
Sbjct: 262 KLDIVWRSNLGERGRLQTSQLQRMGPDYGDIRLSITELPNFVVLEELFAFKCRLVNNCER 321

Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
                E+ +  ++SD      I+G ++  L P     +       I    G++ ++GI +
Sbjct: 322 S---VELMMYLDNSDGLAWCGISGRKLEVLPP---HSTRVLEFKAIPLIPGLRTLSGIKL 375

Query: 425 FDKLEKITYD 434
            D   K TY+
Sbjct: 376 VDTFLKRTYN 385


>gi|196010439|ref|XP_002115084.1| hypothetical protein TRIADDRAFT_58871 [Trichoplax adhaerens]
 gi|190582467|gb|EDV22540.1| hypothetical protein TRIADDRAFT_58871 [Trichoplax adhaerens]
          Length = 427

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 126/432 (29%), Positives = 213/432 (49%), Gaps = 39/432 (9%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H L  +VMRL +P+L    P+  +  DL                PPL+      N   D+
Sbjct: 9   HLLTLKVMRLTKPALQFHTPITCEDHDL------------PGFCPPLLYG---INDQKDI 53

Query: 68  TYRSRFLLH--DSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
             +S   L   D  ++  L  +L LPQ+FG I+LGETF SYI++ N ST+  +D+ IK  
Sbjct: 54  FRQSFNALGVVDGLEAFSLGEMLTLPQSFGNIFLGETFTSYINVQNDSTVAAKDIQIKLH 113

Query: 126 IQTDKQR----ILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           IQT+ QR    +  +D + S +  ++     + IV +DVKELG H L C+  Y+   GE+
Sbjct: 114 IQTEAQRHPLPLNCMDENASLL--LQPSENVNEIVSYDVKELGIHVLGCSVGYTSPSGEK 171

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
            +  +FFKF V  PL V+TK  V +       +  ++EA +EN T + +Y+D V+ +PS 
Sbjct: 172 LHFKKFFKFQVLKPLEVKTKFFVTE------DDEVYIEAQVENITPNPMYLDSVKLDPSP 225

Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
           ++    +    P S  ++  +  +  P+ +R       YLY+L  +S       K   + 
Sbjct: 226 SYYLDDINKLLPESGPSSNGKISYLRPMDVR------QYLYRLTPVSPIIEKSDK--SAC 277

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
            +GKL I W T+ GE GRLQT Q+        ++ +N +E+   V ++K F +KL + N 
Sbjct: 278 DVGKLDIQWLTSFGEKGRLQTSQLQRMPRDLNDLRINCIEIADAVPVEKLFTVKLSVINL 337

Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
           T        + L  +++  + ++ +     +AL  ++   S +  +N++    G+  I+G
Sbjct: 338 TSDRIMNLRLML--DNTKVQPLLWVGRSGQVALGELKPGQSIEVSVNILPVYPGLHVISG 395

Query: 422 ITVFDKLEKITY 433
           + + D  +   Y
Sbjct: 396 LQLLDTFKSKVY 407


>gi|383850626|ref|XP_003700896.1| PREDICTED: UPF0533 protein C5orf44 homolog [Megachile rotundata]
          Length = 404

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 128/449 (28%), Positives = 208/449 (46%), Gaps = 51/449 (11%)

Query: 1   MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
           M + P + H L  +VMRL RP+L     +  D TDL             + L   + +D 
Sbjct: 1   METKPKSEHLLELKVMRLTRPTLASPVVVTCDSTDL-----------PGNTLNNELKNDC 49

Query: 60  TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
           T  +                +++ +   +VLPQ+FG IYLGE F SY+ ++N S+  V++
Sbjct: 50  TALQ--------------GMETLAVGQFMVLPQSFGNIYLGEIFSSYLCVHNGSSQLVKN 95

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE- 178
           V ++A++QT  Q I+ L  S   ++ +      D ++ H+VKE+G H LVC   Y+    
Sbjct: 96  VTVRADLQTSTQ-IISLCGSSGEMKDLAPDNTVDEVIHHEVKEIGTHILVCEVTYTSTNL 154

Query: 179 -GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
            G  +   ++FKF V  PL V+TK    +       +  +LEA I+N T   + +++V  
Sbjct: 155 GGTSQSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVAL 208

Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
           E S  +S + L         N +   I+    L+ +      YLY LK        P  +
Sbjct: 209 ESSHLFSVSTLNT-------NEKGESIYGLVNLLDTDCS-RQYLYCLKPQLSLLKDPKMM 260

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
             +  +GKL I WR+NLGE GRLQT Q+        +I + + ++P  V +++       
Sbjct: 261 HNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEYGDIRVTMKDIPLTVYLEQSVNFNCH 320

Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFG-STDFHLNLIATKLGV 416
           + N +++     ++ LS   ++      I+   I  L P    G S D  L LIA + G+
Sbjct: 321 IINTSERS---MDLMLSLESNNSIAWCGISNTTIGTLKP----GISIDIPLCLIALRSGI 373

Query: 417 QRITGITVFDKLEKITYDSLPDLEIFVDQ 445
             I+G+ + D   K  YD     +IFV Q
Sbjct: 374 ITISGLKLVDTFLKRVYDYDNLAQIFVSQ 402


>gi|324516077|gb|ADY46413.1| Unknown [Ascaris suum]
          Length = 366

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 108/358 (30%), Positives = 181/358 (50%), Gaps = 20/358 (5%)

Query: 88  LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
           L+ PQ F  IYLGETF  Y+ + N S+    ++ IK ++QT  QR+ L    +    +++
Sbjct: 26  LMAPQIFDNIYLGETFTFYVCVQNDSSQCATEICIKTDLQTTNQRVALHSKLQDSNATLQ 85

Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV 207
            G     I+ H++KE+G H LVC   Y     E+ Y  +FFKF V+ P+ VRTK    + 
Sbjct: 86  PGQILGDIISHEIKEVGQHILVCAVTYKTPADEKMYFRKFFKFPVTKPIDVRTKFYNAE- 144

Query: 208 GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKP 267
              +     +LEA I+N + + + +++V  EPS  +++T +    P    N  S++ F  
Sbjct: 145 --DNMNNDVYLEAQIQNTSATPMILEKVVLEPSDFYTSTEIP---PPLLLNENSKKQF-- 197

Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILG 327
                +   I  YLY L+  +    S    +G   +GKL + WRTN+GE GRLQT  +  
Sbjct: 198 ---YLNPKDIRQYLYCLRPKT-ADYSLNYYRGGTSIGKLDMVWRTNMGERGRLQTSALQR 253

Query: 328 TTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMI- 386
                 ++ L V ++P+   I + F +  +L N +++     ++ L+ + S +  +V   
Sbjct: 254 MAPGYGDLRLTVEKIPATAKIRQTFEVVCRLHNCSERS---LDLVLTLDGSLQPALVFCT 310

Query: 387 -NGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
            +G+++  L P     + DF L L+    G+Q I+GI V D   K TY+     ++FV
Sbjct: 311 ASGVQLGQLPPN---NTVDFTLELLPITPGLQPISGIRVSDTFLKRTYEHDDIAQVFV 365


>gi|25149719|ref|NP_741010.1| Protein C56C10.7, isoform b [Caenorhabditis elegans]
 gi|351060502|emb|CCD68178.1| Protein C56C10.7, isoform b [Caenorhabditis elegans]
          Length = 417

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 127/465 (27%), Positives = 210/465 (45%), Gaps = 71/465 (15%)

Query: 1   MSSTPGTHS----LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS 56
           M+  P + S    LA RVMRL RP        +  P D F       DP+  +    L++
Sbjct: 1   MADNPASSSSQQLLALRVMRLARP--------KFAPVDGFS-----HDPVDPTGFGELLA 47

Query: 57  SDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE 116
             V     S+++  SR         + +   L+ PQ F  IYLGETF  Y+++ N S   
Sbjct: 48  GKV-----SEISKESR-------QDLPIGEYLIAPQMFENIYLGETFTFYVNVVNESEKT 95

Query: 117 VRDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
           V  V +K E+QT  QR++L      + +ES +  G+   ++ H+VKE+G H L+C+  Y 
Sbjct: 96  VSSVSLKCELQTSTQRVVLPCSVQDATIESSKCEGQ---VISHEVKEIGQHILICSVNYK 152

Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKV--------RVVKVGATHFQEITFLEACIENHTK 227
              GE  Y  +FFKF VS P+ V+TK         RV+ +    F+ +    +   +  K
Sbjct: 153 TSNGENMYFRKFFKFPVSKPIDVKTKFYSAEVSSNRVLCINVVFFRTMRIKMS--TSKPK 210

Query: 228 SNLYMDQVEFEPSQNW---SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQL 284
             ++  ++      +W   +  ML      +D      ++ KP         I  +L+ L
Sbjct: 211 LKIHQMRICSWKKSSWIQVNIIMLLVSLMSTDEFGDVGKLLKP-------KDIRQFLFCL 263

Query: 285 KMLSHGSSSPVKVQGS------NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELN 338
                   +P  V  +        +GKL ++WRT++GE GRLQT  +        ++ L+
Sbjct: 264 --------TPADVHNTLGYKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLS 315

Query: 339 VVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVE 398
           V + P+ V + KPF +  +L N +++     ++ L Q  +        +G+ +  L P +
Sbjct: 316 VEKTPACVDVQKPFEVSCRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ 374

Query: 399 AFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
                DF LN+    +G+Q I+GI + D   K  Y+     +IFV
Sbjct: 375 ---HVDFSLNVFPVTVGIQSISGIRITDTFTKRIYEHDDIAQIFV 416


>gi|307198435|gb|EFN79377.1| UPF0533 protein [Harpegnathos saltator]
          Length = 389

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 120/403 (29%), Positives = 195/403 (48%), Gaps = 30/403 (7%)

Query: 52  PPLISSDVTTNKSSDL--TYRSRFLLHDSADSIGLSGL-----LVLPQAFGAIYLGETFC 104
           P L S  V T  S+DL     +  L +D     G+  L     +VLPQ+FG IYLGE F 
Sbjct: 6   PTLASPVVVTCDSTDLPGNTLNNELKNDCTALQGMEALAIGQFMVLPQSFGNIYLGEIFS 65

Query: 105 SYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELG 164
           SY+ ++N S   V++V++KA++QT  Q I+ L  +    + +      D ++ H+VKE+G
Sbjct: 66  SYLCVHNGSNQVVKNVIVKADLQTSTQ-IISLSGNNLEGKELAPDSTVDEVIHHEVKEIG 124

Query: 165 AHTLVCTALY--SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACI 222
            H LVC   Y  ++  G      ++FKF V  PL V+TK    +       +  +LEA I
Sbjct: 125 THILVCEVSYICANQVGPPLSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQI 178

Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLY 282
           +N T   + +++V  E S  +S T L         N + + I+    L+ +      YLY
Sbjct: 179 QNLTAGPICLEKVALESSHLFSVTTLNT-------NDEEKSIYGSVNLLDTSCS-RQYLY 230

Query: 283 QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEV 342
            LK        P  +Q +  +GKL I WR+NLGE GRLQT Q+        ++ + + ++
Sbjct: 231 CLKPQPSLLKDPKMMQNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEYGDLRVTLKDI 290

Query: 343 PSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGS 402
           P  V +++P   K  + N +++      + L  N+S     +   G+  M +  ++   S
Sbjct: 291 PLKVYLEEPVNCKCHIINTSERSMDLL-LSLESNNS-----IAWCGMSDMTIGTLKPGAS 344

Query: 403 TDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQ 445
            D  L LI    G+  ++G+ + D   K  Y+     +IFV+Q
Sbjct: 345 IDIPLCLITLDTGIITVSGLKLTDTFLKRVYEYDDLAQIFVNQ 387


>gi|340709998|ref|XP_003393586.1| PREDICTED: UPF0533 protein C5orf44 homolog [Bombus terrestris]
          Length = 404

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 130/449 (28%), Positives = 202/449 (44%), Gaps = 51/449 (11%)

Query: 1   MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
           M + P + H L  +VMRL RP L     +  D TDL             + L   + +D 
Sbjct: 1   METKPKSEHLLELKVMRLTRPMLASPVVITCDSTDL-----------PGNTLNNELKNDC 49

Query: 60  TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
           T  +                +++ +   +VLPQ+FG IYLGE F SY+ ++N S   V++
Sbjct: 50  TALQG--------------METLAVGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQIVKN 95

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE- 178
           V +KA++QT  Q I L   S   ++ +      D ++ H+VKE+G H LVC   Y+ G  
Sbjct: 96  VTVKADLQTSTQNISLCGNS-GEMKDLAPDSTVDEVIHHEVKEIGTHILVCEVTYTPGNL 154

Query: 179 -GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
               +   ++FKF V  PL V+TK    +       +  +LEA I+N T   + +++V  
Sbjct: 155 GSTAQSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVSL 208

Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
           E S  +S + L         N +   I+   V I        YLY LK        P  +
Sbjct: 209 ESSHLFSVSTLNT-------NEKGESIYG-LVNILDTDCSRQYLYCLKPQLSLLKDPKMM 260

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
             +  +GKL I WR+NLGE GRLQT Q+        +I + +  +P  V +++       
Sbjct: 261 HNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEFGDIRVTMKNIPLTVYLEQSVNFNCH 320

Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFG-STDFHLNLIATKLGV 416
           + N +++     ++ LS   S+      I+   I  L P    G S D  L LI  + G+
Sbjct: 321 IINTSERS---MDLMLSLESSNSIAWCGISNTMIGTLKP----GISIDIPLCLIPLRSGI 373

Query: 417 QRITGITVFDKLEKITYDSLPDLEIFVDQ 445
             I+G+ + D   K  YD     +IFV Q
Sbjct: 374 ITISGLKLTDTFLKRVYDYDDLAQIFVSQ 402


>gi|357609833|gb|EHJ66705.1| hypothetical protein KGM_03665 [Danaus plexippus]
          Length = 402

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 119/401 (29%), Positives = 185/401 (46%), Gaps = 47/401 (11%)

Query: 52  PPLISSDVTTNKSSDL--TYRSRFLLHDSADSIGLSGL-----LVLPQAFGAIYLGETFC 104
           P LIS  + T    DL     + FL  D+   + +  L     L+LPQ+FG IYLGETF 
Sbjct: 21  PALISPKIVTCDFKDLPGNILNNFLKDDATSVVQMETLAAGQFLLLPQSFGNIYLGETFS 80

Query: 105 SYISINNSSTLEVRDVVIKAEIQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKEL 163
            Y+ ++N +   V+ V IKA++QT  QRI L    ++SP+  +        ++ H+VK+L
Sbjct: 81  CYVCVHNETNQPVQSVSIKADLQTSSQRIPLTTQQNQSPI-MLDVDETLSDVIHHEVKDL 139

Query: 164 GAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIE 223
           G H LVC   Y           +FFKF V  PL V+TK    +       +  F+EA ++
Sbjct: 140 GTHILVCEVTYMSNYSTLASFRKFFKFEVLKPLDVKTKFYNAE------SDDVFVEAQVQ 193

Query: 224 NHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIH----- 278
           N T   + ++ V  E S  ++   L  D            +F    L++           
Sbjct: 194 NITSGPIILETVALESSHQFTVKSLNEDD-------NGVSVFGDVTLLQPQESCQYSYCL 246

Query: 279 ----NYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE 334
               N L  +K+L+   +          +GKL I WR+NLGE GRLQT Q+        +
Sbjct: 247 TPKENILKDIKLLAAAKN----------IGKLDIVWRSNLGEKGRLQTSQLQRMIPDYGD 296

Query: 335 IELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG-PFEIWLSQNDSDEEKVVMINGLRIMA 393
           I +    VPS V ID+PF    K+ N +++      ++   QN S     ++  G+    
Sbjct: 297 IRVTYENVPSRVPIDEPFKFNCKIVNASERTLDLILKLRSLQNSS-----LLWCGISNRK 351

Query: 394 LAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
           L P+E   +T  +L ++    G+  +TG+++ D   K TYD
Sbjct: 352 LGPLEPGNTTIVNLTVLPINSGLHTVTGVSLVDLFLKRTYD 392


>gi|350398663|ref|XP_003485265.1| PREDICTED: UPF0533 protein C5orf44 homolog [Bombus impatiens]
          Length = 404

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 128/448 (28%), Positives = 201/448 (44%), Gaps = 49/448 (10%)

Query: 1   MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
           M + P + H L  +VMRL RP+L     +  D TDL             + L   + +D 
Sbjct: 1   METKPKSEHLLELKVMRLTRPTLASPVVITCDSTDL-----------PGNTLNNELKNDC 49

Query: 60  TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
           T  +                +++ +   +VLPQ+FG IYLGE F SY+ ++N S    ++
Sbjct: 50  TALQ--------------GMETLAVGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQIAKN 95

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE- 178
           V +KA++QT  Q I L   S   ++ +      D ++ H+VKE+G H LVC   Y+ G  
Sbjct: 96  VTVKADLQTSTQNISLCGNS-GEMKDLAPDSTVDEVIHHEVKEIGTHILVCEVTYTPGNL 154

Query: 179 -GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
               +   ++FKF V  PL V+TK    +       +  +LEA I+N T   + +++V  
Sbjct: 155 SSTAQSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVSL 208

Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
           E S  +S + L         N +   I+   V I        YLY LK        P  +
Sbjct: 209 ESSHLFSVSTLNT-------NEKGESIYG-LVNILDTDCSRQYLYCLKPQLSLLKDPKMM 260

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
             +  +GKL I WR+NLGE GRLQT Q+        +I + +  +P  V +++       
Sbjct: 261 HNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEFGDIRVTMKNIPLTVYLEQSVNFNCH 320

Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
           + N +++     ++ LS   S+      I+   I  L P     S D  L LI  + G+ 
Sbjct: 321 IINTSERS---MDLMLSLESSNSIAWCGISNTIIGTLKP---GVSIDIPLCLIPLRSGII 374

Query: 418 RITGITVFDKLEKITYDSLPDLEIFVDQ 445
            I+G+ + D   K  YD     +IFV Q
Sbjct: 375 TISGLKLTDTFLKRVYDYDDLAQIFVSQ 402


>gi|195450486|ref|XP_002072516.1| GK12482 [Drosophila willistoni]
 gi|194168601|gb|EDW83502.1| GK12482 [Drosophila willistoni]
          Length = 437

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 129/432 (29%), Positives = 205/432 (47%), Gaps = 55/432 (12%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL RP+L    P+        +  D+ D     SN+          +K S++
Sbjct: 9   HLLALKVMRLTRPALVAPGPI--------VNCDLRDLLQPFSNVQ-------KKDKKSEV 53

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                         +    +L+LPQ+FG IYLGETF  YI ++N +   V  V +KA++Q
Sbjct: 54  V----------GKPLTAGYILLLPQSFGNIYLGETFSCYICVHNCTAHSVESVTVKADLQ 103

Query: 128 TDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
           ++  RI L   +  KS V  +      D ++ ++VKE+G H LVC   Y+   G  + L 
Sbjct: 104 SNTSRINLPINENCKSSV-MLAPDETLDDVIRYEVKEIGTHILVCEVNYTSPAGFSQSLR 162

Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           +FFKF V  PL V+TK            EI +LEA I+N T     +++VE + S++++ 
Sbjct: 163 KFFKFQVLKPLDVKTKFY-----NAEMDEI-YLEAQIQNVTTGPFCLEKVELDISEHYTV 216

Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH-GSSSPVKVQGSNVLG 304
           T L    P+ +    S+ + +P            +LY +K  S     S V  Q +NV G
Sbjct: 217 TSLNT-LPNGESVLTSKHMLQP-------NNSCQFLYCIKPKSTIARCSKVLRQFTNV-G 267

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD- 363
           KL I WR+NLGE GRLQT Q+       K++ L V++  +++ I   F    ++TN ++ 
Sbjct: 268 KLDIVWRSNLGEKGRLQTSQLQRLPFDYKDLCLEVLDAKNIIKIGSTFSFLCRVTNSSEH 327

Query: 364 --KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
             K     +  LS N           G     L  ++    T+F L++  + LG+ R++ 
Sbjct: 328 PMKLHIRLDTKLSTNS--------YTGSADFLLETIQPAERTEFSLSICPSNLGLIRVSP 379

Query: 422 ITVFDKLEKITY 433
           + + D L+   Y
Sbjct: 380 LLLVDTLQNRRY 391


>gi|195146730|ref|XP_002014337.1| GL19004 [Drosophila persimilis]
 gi|194106290|gb|EDW28333.1| GL19004 [Drosophila persimilis]
          Length = 438

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 126/446 (28%), Positives = 203/446 (45%), Gaps = 56/446 (12%)

Query: 5   PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           P  H LA +VMRL RP+L VE                         L P++S +      
Sbjct: 6   PDAHLLALKVMRLMRPTL-VE-------------------------LGPVVSCE-----H 34

Query: 65  SDLTYRSRFLLHDS------ADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVR 118
            DL  R     H        A+++    +L+LPQ+FG IYLGETF SYI ++N S   V 
Sbjct: 35  KDLMQRFSSKPHSDVFSGIIAETLSAGQVLLLPQSFGNIYLGETFSSYICVHNCSPQPVE 94

Query: 119 DVVIKAEIQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG 177
            + +K ++Q++  RI L L  +      +  G   D ++ ++VKE+G H LVC   Y+  
Sbjct: 95  CINVKTDLQSNTTRINLSLQKNNKSAIILAPGETIDDVIRYEVKEIGTHILVCEVNYTSP 154

Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
            G  + L +FFKF V  PL V+TK    ++      E  +LEA I+N T S   +++VE 
Sbjct: 155 AGYAQSLRKFFKFQVLKPLDVKTKFYNAEI------EEIYLEAQIQNVTTSPFCLEKVEL 208

Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
           + S+ ++   L    P+ +    ++ + +P            +LY +K     ++    +
Sbjct: 209 DSSEEFTVIPLNT-LPNGESVFNTKNMLQP-------NNSCQFLYCIKPKVQKATDIHAL 260

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
           +  + +GKL I WR+NLGE GRLQT Q+       K++   V+   + V I   F    +
Sbjct: 261 RQLSNVGKLDIVWRSNLGEKGRLQTSQLQRLPYECKDLRFEVINALNTVKIGTIFTFNCR 320

Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
           +TN T +      + L    S E       G     L  +    + +F L++  +KLG+ 
Sbjct: 321 VTN-TSEHTMKLHVRLVTKLSPE---CQYTGCADFKLDELNTGENAEFPLSVSPSKLGLI 376

Query: 418 RITGITVFDKLEKITYDSLPDLEIFV 443
           +I  + + D      Y     +E+FV
Sbjct: 377 KIADLLLVDTENNEHYSIEKVVEVFV 402


>gi|170590974|ref|XP_001900246.1| Conserved hypothetical protein [Brugia malayi]
 gi|158592396|gb|EDP30996.1| Conserved hypothetical protein, putative [Brugia malayi]
          Length = 399

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 119/436 (27%), Positives = 201/436 (46%), Gaps = 49/436 (11%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           L  +VMR  RP  +    + +DP D                   LI S +          
Sbjct: 10  LTLKVMRFARPKFYENICMPIDPVD---------------TTSQLIGSAL---------- 44

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
             R    ++AD I +   L+ PQ F  IYLGETF  Y+ + N S     D+ IK ++QT 
Sbjct: 45  -CRLTGQETAD-IPIGKYLMAPQKFENIYLGETFTFYVCVQNISDKFATDICIKTDLQTT 102

Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
            QR  L    +     ++ G     ++ H++KE+G H LVC   Y   + E  Y  +FFK
Sbjct: 103 SQRNALSSQLQEANAVLKPGECLGEVITHEIKEIGQHILVCAVSYKTPKNE-MYFRKFFK 161

Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
           F V+ P+ VRTK    +    +     +LEA I+N ++  + +++V  EPS  + ++ + 
Sbjct: 162 FPVTKPIDVRTKFYNAE---DNLNNDVYLEAQIQNTSELPMVLEKVILEPSDFYLSSEIS 218

Query: 250 ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQIT 309
              P ++     +   KP         I  YL+ LK  +   S     +G+++ GKL + 
Sbjct: 219 P--PETENGTMDQSYLKP-------SDIRQYLFCLKPKTTDYSLNYFRKGTSI-GKLDMV 268

Query: 310 WRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPF 369
           WRT +GE GRLQT  +        ++ L + ++P+ V   + F +  +L N +++     
Sbjct: 269 WRTGMGERGRLQTSALQRMAPGYGDLRLTIEKIPATVKXLQSFRMVCRLRNCSERS---L 325

Query: 370 EIWLSQNDSDEEKVVM--INGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
           ++ L+ +   +  +    I+G+ +  LAP     +TDF + L+    G+Q I+GI V D 
Sbjct: 326 DLVLTLDGKLQPNMAFCSISGIELGQLAPN---STTDFSIELLPLTPGLQSISGIRVTDT 382

Query: 428 LEKITYDSLPDLEIFV 443
             + TY+     ++FV
Sbjct: 383 FLRRTYEHDDIAQVFV 398


>gi|380014781|ref|XP_003691396.1| PREDICTED: UPF0533 protein C5orf44 homolog [Apis florea]
          Length = 404

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 126/448 (28%), Positives = 201/448 (44%), Gaps = 49/448 (10%)

Query: 1   MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
           M + P + H L  +VMRL RP L     +  D TDL             + L   + +D 
Sbjct: 1   METKPKSEHLLELKVMRLTRPMLASPVVVTCDSTDL-----------PGNTLNNELKNDC 49

Query: 60  TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
           T  +                +++ +   +VLPQ+FG IYLGE F SY+ ++N S   V++
Sbjct: 50  TALQ--------------GMETLAVGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQLVKN 95

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS--DG 177
           V++KA++QT  Q I L   S   ++ +      D ++ H+VKE+G H LVC   Y+  + 
Sbjct: 96  VIVKADLQTSTQIISLCGNS-GEMKDLAPDNTVDEVIHHEVKEIGTHILVCEVSYTPVNL 154

Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
               +   ++FKF V  PL V+TK    +       +  +LEA I+N T   + +++V  
Sbjct: 155 SNTAQSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVSL 208

Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
           E S  +S + L         N +   I+   V I        YLY LK        P  +
Sbjct: 209 ESSHLFSVSTLNT-------NERGESIYG-SVNILDTDCSRQYLYCLKPQISLLKDPKMM 260

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
             +  +GKL I WR+NLGE GRLQT Q+        +I + +  +P  V +++       
Sbjct: 261 HNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEYGDIRVTMKNIPLTVYLEQMMNFNCH 320

Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
           + N +++      I L  N S     +   G+    +  ++   S D  L LIA + G+ 
Sbjct: 321 IINTSERSMDLMLI-LESNSS-----IAWCGISNTMIGTLKPGVSIDIPLCLIALRSGII 374

Query: 418 RITGITVFDKLEKITYDSLPDLEIFVDQ 445
            I+G+ + D      YD     +IFV Q
Sbjct: 375 TISGLKLKDTFLNRVYDYDDLTQIFVSQ 402


>gi|110750830|ref|XP_624799.2| PREDICTED: UPF0533 protein C5orf44 homolog [Apis mellifera]
          Length = 404

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 127/452 (28%), Positives = 199/452 (44%), Gaps = 57/452 (12%)

Query: 1   MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
           M + P + H L  +VMRL RP L     +  D TDL             + L   + +D 
Sbjct: 1   METKPKSEHLLELKVMRLTRPMLASPVVVTCDSTDL-----------PGNTLNNELKNDC 49

Query: 60  TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
           T  +                +++ +   +VLPQ+FG IYLGE F SY+ ++N S   V++
Sbjct: 50  TALQ--------------GMETLAIGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQLVKN 95

Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS---- 175
           V++KA++QT  Q I L   S   ++ +      D ++ H+VKE+G H LVC   Y+    
Sbjct: 96  VIVKADLQTSTQIISLCGNS-GEMKDLAPDNTVDEVIHHEVKEIGTHILVCEVSYTPVNL 154

Query: 176 --DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMD 233
               +  RKY    FKF V  PL V+TK    +       +  +LEA I+N T   + ++
Sbjct: 155 GNTAQSFRKY----FKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLE 204

Query: 234 QVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS 293
           +V  E S  +S + L         N +   I+   V I        Y Y LK        
Sbjct: 205 KVSLESSHLFSVSTLNT-------NEKGESIYG-SVNILDTDCSRQYFYCLKPQISLLKD 256

Query: 294 PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 353
           P  +  +  +GKL I WR+NLGE GRLQT Q+        +I + +  +P  V +++   
Sbjct: 257 PKMMHNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEYGDIRVTMKNIPLTVYLEQMMN 316

Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 413
               + N +++      I  S N       +   G+    +  ++   S D  L LIA +
Sbjct: 317 FNCHIINTSERSMDLMLILESNNS------IAWCGISNTMIGTLKPGVSIDIPLCLIALR 370

Query: 414 LGVQRITGITVFDKLEKITYDSLPDLEIFVDQ 445
            G+  I+G+ + D      YD     +IFV Q
Sbjct: 371 SGIITISGLKLKDTFLNRIYDYDDLTQIFVSQ 402


>gi|393909700|gb|EJD75555.1| hypothetical protein LOAG_17321 [Loa loa]
          Length = 399

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 122/445 (27%), Positives = 205/445 (46%), Gaps = 49/445 (11%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           M+       L  +VMRL RP  +    + +D               +A +   LI S + 
Sbjct: 1   MAEAMKEQLLTLKVMRLARPKFYENMCIPID---------------SADSTSQLIGSAL- 44

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
                      R    ++AD I +   L+ PQ F  IYLGETF  ++ + N S     D+
Sbjct: 45  ----------CRLTGQEAAD-IPIGKYLMAPQKFENIYLGETFSFFVCVQNISDKVAMDI 93

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
            IK ++QT  QR  L    +     +  G     I+ H++KE+G H LVC   Y   + E
Sbjct: 94  CIKTDLQTTSQRNALPSQLQEANAVLEPGKCLGEIITHEIKEIGQHILVCAVSYKTSKNE 153

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
             Y  +FFKF V+ P+ VRTK    +    +     +LEA I+N ++  + +++V  EPS
Sbjct: 154 M-YFRKFFKFPVTKPIDVRTKFYNAE---DNLNNDVYLEAQIQNTSELPMVLEKVILEPS 209

Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
             + ++ +    P    N    + +  P  IR       YL+ LK  +   S     +G 
Sbjct: 210 DFYISSEI---SPPEIENENMEQSYLNPSDIR------QYLFCLKPKTTDYSLNYFRKGI 260

Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
             +GKL + WRT++GE GRLQT  +        ++ L + ++P+ V + +PF +  +L N
Sbjct: 261 -AIGKLDMVWRTSMGERGRLQTSALQRMAPGYGDLRLTIEKIPATVKVLQPFHIVCRLHN 319

Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMI--NGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
            +++   P ++ L+ +D  +  +     +G+ +  L P     +TDF L L+    G+Q 
Sbjct: 320 CSER---PLDLVLTLDDKLQPNIAFCSTSGVELGQLPPN---STTDFSLELLPLTPGLQS 373

Query: 419 ITGITVFDKLEKITYDSLPDLEIFV 443
           ++GI V D   + TY+     ++FV
Sbjct: 374 VSGIRVTDTFLRRTYEHDDIAQVFV 398


>gi|391345954|ref|XP_003747246.1| PREDICTED: UPF0533 protein C5orf44 homolog [Metaseiulus
           occidentalis]
          Length = 388

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 106/353 (30%), Positives = 174/353 (49%), Gaps = 25/353 (7%)

Query: 85  SGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVE 144
           S +L LPQAFG IYLGETF SY++++N S+L+V+ V +KAE+Q   Q++ L        +
Sbjct: 46  SDMLCLPQAFGNIYLGETFSSYMTVHNGSSLDVQGVQLKAELQNGTQKVALTPVVVRGSD 105

Query: 145 SIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE-GERKYLPQFFKFIVSNPLSVRTKVR 203
            ++     D I++H+VKE+G H L CT  Y++   GE     ++FKF V  PL V+TK  
Sbjct: 106 VLKPNESLDQIIQHEVKEIGTHLLQCTVDYTNASTGEPMQFCKYFKFQVYKPLDVKTKSY 165

Query: 204 VVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSRE 263
             +       +   LEA ++N T + + + +V  EPS ++  T L       + N     
Sbjct: 166 NAE------NDEVLLEAQLQNITANPVTLAKVSLEPSPHFQVTAL-------NQNDNGES 212

Query: 264 IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN---VLGKLQITWRTNLGEPGRL 320
           IF    L+        YL+ L +  +      KV+G+     +GKL I W++ +GE GRL
Sbjct: 213 IFGQVNLLNPQDS-RQYLFSL-IPKNRLPQESKVKGTRPPFAIGKLDIIWKSAIGEKGRL 270

Query: 321 QTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDE 380
           QT Q+        +I L +   PS + ++ PF +   + N  ++      + L+ +  ++
Sbjct: 271 QTSQLERVATVYSDIRLVIENYPSKIELETPFTISCTIFNTCER-----ALDLTVSLENQ 325

Query: 381 EKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITY 433
           E ++ +       L  ++A         LI T+ G+Q I GI   +   K  Y
Sbjct: 326 EGLMWLESTG-YELGQIQAHSKMTKDFALIMTRCGLQTIGGIKFTESFLKRVY 377


>gi|358058981|dbj|GAA95379.1| hypothetical protein E5Q_02033 [Mixia osmundae IAM 14324]
          Length = 613

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 118/349 (33%), Positives = 164/349 (46%), Gaps = 75/349 (21%)

Query: 2   SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
           SS    H L+ RV+RL RPS   E         ++I +D  D                  
Sbjct: 4   SSMTEAHPLSVRVLRLLRPSAAKE-------DTIYIDKDAVDL----------------- 39

Query: 62  NKSSDLTYRSRFLLHDSADSIGLSG--LLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
                L  R+  L  D A     S   LL L   FG IYLGETF  Y++++N     +  
Sbjct: 40  -----LGARNSLLRQDVAQFCDFSAAPLLALSSVFGQIYLGETFNGYLAVHNDQDSPITG 94

Query: 120 VVIKAEIQTDKQRILLLDTSKS---PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
           V +K E+QT + R  L +T      P ES+      + +V H++KE+G H+LVCT  Y+ 
Sbjct: 95  VNLKVEMQTAQNRWTLAETRSGLLKPRESL------ETVVRHELKEIGVHSLVCTVSYTV 148

Query: 177 GEG-----------ERKYLPQFFKFIVSNPLSVRTKVRVVK-VGA---THFQEITFLEAC 221
            EG            ++ L + FKF +SNPLSV+TK+ + K V A    + +E  +LE  
Sbjct: 149 AEGSQQGFAPELGASQRVLKKSFKFSMSNPLSVKTKIHMAKSVTALLDKNQRETAYLELQ 208

Query: 222 IENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYL 281
           I+N T + L  +Q+ FEPSQ    T + A+            IF     + S G I  YL
Sbjct: 209 IQNMTSAPLVFEQMRFEPSQGL--TFVDANS----------SIFDNEAALLSPGDIRQYL 256

Query: 282 YQLKMLSHG-SSSPV----KVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
           Y   ++S   + SPV    KV G   LG+L I WRT  GE G+LQT Q+
Sbjct: 257 Y---IVSPAVTPSPVFESGKVNGQMNLGRLNIVWRTPNGEGGKLQTSQL 302


>gi|389741307|gb|EIM82496.1| DUF974-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 704

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 90/277 (32%), Positives = 141/277 (50%), Gaps = 27/277 (9%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           LL LP +FGAI LGETF   ++INN + + V  V +K E+QT   ++LL +    P +S+
Sbjct: 67  LLTLPSSFGAIQLGETFSGVLAINNETVVAVDGVNLKIEMQTATNKVLLAELG-GPTQSL 125

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALY---------------SDGEGERKYLPQFFKFI 191
            AG   + IV H++KELG H L CT  Y                +G+ + +   +F+KF 
Sbjct: 126 VAGDTLETIVNHEIKELGQHVLACTVTYQLPPGARPPQPPFDGQNGDPDVQTFRKFYKFA 185

Query: 192 VSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           V+NPLSV+TKV   +  +       +E  FLE  I+N T+  ++ +++ FEP+Q W    
Sbjct: 186 VTNPLSVKTKVHTPRSPSALLSRSEREKVFLEVHIQNLTQEPMWFERMLFEPAQGWQVEE 245

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGKL 306
                P      +   +F     +     I  Y+Y L  +   + +     GS + LG+L
Sbjct: 246 GNVLPPSDPDATEPESLFTGSQTLMQPQDIRQYMYILAAVKLPTFAIQHTPGSIIPLGRL 305

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVP 343
            I+WR++ GEPGRL       T++ S+ I +  V+ P
Sbjct: 306 DISWRSSFGEPGRLL------TSMLSRRIPVPSVQSP 336


>gi|170094860|ref|XP_001878651.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164647105|gb|EDR11350.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 644

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 96/304 (31%), Positives = 151/304 (49%), Gaps = 37/304 (12%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           LL LP +FG+I LGETF S + +NN + +E+    +K E+QT   +I+L +T+  P   +
Sbjct: 70  LLTLPSSFGSIQLGETFSSCLCVNNDAQIEIEVTQMKVEMQTASTKIILSETAD-PGHHL 128

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY--------------LPQFFKFIV 192
            AG     +V H++KELG H L CT  Y      RK                 +F+KF V
Sbjct: 129 AAGKTLQSVVHHEIKELGQHVLACTVTYRSPPNVRKVPGAAEDAGDPTLQTFRKFYKFAV 188

Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA--- 245
           +NPLSV+TKV   +  +       +E  FLE  I+N T+  +  +++ FE +  W +   
Sbjct: 189 TNPLSVKTKVHAARCPSALLSGEEREKIFLEVHIQNLTQQPMCFERMRFECADGWESEHG 248

Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LG 304
            +L+++G         + IF  P+ +     I  Y+Y L   +   +  V + G+ + LG
Sbjct: 249 NLLRSEG-----VDNPKGIFSGPLALMQPQDIRQYVYILTTKTPTVAPTVHLPGNVIPLG 303

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           +L I+W +  GEPGRL       T++ S+ I L  V+ P  V    P+ LK     +T +
Sbjct: 304 RLDISWTSAFGEPGRLL------TSMLSRRIPLPSVQQP--VSALPPY-LKRSTGQETSR 354

Query: 365 EQGP 368
            Q P
Sbjct: 355 PQSP 358


>gi|426200343|gb|EKV50267.1| hypothetical protein AGABI2DRAFT_64546, partial [Agaricus bisporus
           var. bisporus H97]
          Length = 651

 Score =  139 bits (351), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 86/262 (32%), Positives = 134/262 (51%), Gaps = 28/262 (10%)

Query: 85  SGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVE 144
           S LL LP +FG I LG+TF   + +NN +T  V  + ++ E+QT   + LL  T +    
Sbjct: 23  SDLLTLPPSFGTIQLGQTFSGCLCVNNEATFSVDSIRVRIEMQTVTSKTLLFLTQEPQGR 82

Query: 145 SIRAGGRYDFIVEHDVKELGAHTLVCTALYS--------DGEGERKYLP------QFFKF 190
           ++ +G   + IV +++KELG H L CT  Y          G  E    P      +F+KF
Sbjct: 83  TLSSGDTLELIVSNEIKELGQHVLACTVTYRLPPNVRPIAGASEDPKDPALATFRKFYKF 142

Query: 191 IVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
           IV+NPL+V+TKV  V+          +E  FLE  I+N T+  ++ +++ FEP++ W   
Sbjct: 143 IVTNPLAVKTKVHPVRSPTALLSPEEREKIFLEIHIQNVTQDTMHFERLSFEPTEEW--- 199

Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN---VL 303
             +   P+   N QS  IF  P+ + +   +  Y++ L   S  +  P+ V        L
Sbjct: 200 --QVQDPNFTSNGQS--IFSGPIALVNPQDVRQYIFILSPTSTAALRPLAVHPPGSIFPL 255

Query: 304 GKLQITWRTNLGEPGRLQTQQI 325
           G+L I WR++ GEPGRL T  +
Sbjct: 256 GRLNIVWRSSYGEPGRLLTSML 277


>gi|384493079|gb|EIE83570.1| hypothetical protein RO3G_08275 [Rhizopus delemar RA 99-880]
          Length = 934

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 123/421 (29%), Positives = 191/421 (45%), Gaps = 83/421 (19%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTN---KS 64
           H L+ +VMRL RP      P+  + T+          P+    L  L  SD+T     + 
Sbjct: 24  HLLSLKVMRLSRPQFATTLPVFYESTEA--------SPLV-DGLDSLNISDLTACHPIQP 74

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
           SD+  R            GLS +L LP AFG IYLGETF + +SINN S + V  V  K 
Sbjct: 75  SDIQIRD----------FGLSQMLKLPSAFGNIYLGETFSTLVSINNESPIPVHQVTTKI 124

Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
           E+QT  QR LL D  + P+  +  G   D  V H++KELG H LVC+  Y   +G     
Sbjct: 125 ELQTSSQRFLLAD--QPPLNDLSPGANSDITVSHEIKELGVHILVCSVQYIGDDGR---- 178

Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
                                           FLEA ++N +   +++++++FEPS+++ 
Sbjct: 179 -------------------------------VFLEAQLQNVSAGPMFLERMKFEPSEHFG 207

Query: 245 ATML--KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
              L  + D   + +  Q    F  P  +R       YLY   MLS   +  +  + +N 
Sbjct: 208 FESLNGRMDSEKTVFEDQ----FIHPQDVR------QYLY---MLSPHHADRIS-RTTNA 253

Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPS----VVGIDKPFLLKLKL 358
           LGKL I WR+ +G+ GRLQT Q+       ++IE+    V       V ++ PF L +++
Sbjct: 254 LGKLDIVWRSAMGDMGRLQTSQLTRKAPLLEDIEIQPFWVQQDAEVKVVLETPFRLGIRV 313

Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
           TN +++     ++ LS   + +   V+++GL    L  +    ST+  L       G+QR
Sbjct: 314 TNHSNEN---MKLVLSAIKT-KMGSVLLSGLGSRQLGELGPGQSTETELEFFPLTPGLQR 369

Query: 419 I 419
           +
Sbjct: 370 V 370


>gi|390598322|gb|EIN07720.1| DUF974-domain-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 662

 Score =  134 bits (337), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 90/264 (34%), Positives = 131/264 (49%), Gaps = 41/264 (15%)

Query: 85  SGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVE 144
           + LL LP AFG+I LGETF S + INN + ++V+ V +K E+QT   +  L D    P  
Sbjct: 65  TNLLTLPAAFGSIQLGETFTSCLCINNEAAVDVQAVSMKVEMQTATTKTTLADIG-GPDF 123

Query: 145 SIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP---------------QFFK 189
           ++  GG  + +V H++KELG H L CT  Y      R + P               +F+K
Sbjct: 124 TLAPGGVSENVVSHEIKELGQHVLACTVSYRLPSSVR-HAPAGSVDPANPHLATFRKFYK 182

Query: 190 FIVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           F V+NPLSV+TKV V +  +       +E  FLE  I+N T+  ++ ++++FEPS  W  
Sbjct: 183 FAVTNPLSVKTKVHVPRSPSALLSRTEREKVFLEVHIQNLTQDAMWFERIQFEPSDGWQ- 241

Query: 246 TMLKADGPHSDYNAQ---SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
                   H   +A    S  + +P            +LY L  LS          GS +
Sbjct: 242 --------HDSSSATPVVSESLMQP-------QDTRQFLYVLSPLSIPDFPVTHAPGSIL 286

Query: 303 -LGKLQITWRTNLGEPGRLQTQQI 325
            LG+L I+WR+  GEPGRL T  +
Sbjct: 287 PLGRLDISWRSGFGEPGRLITSTL 310


>gi|409046259|gb|EKM55739.1| hypothetical protein PHACADRAFT_121565 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 724

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 90/271 (33%), Positives = 132/271 (48%), Gaps = 35/271 (12%)

Query: 80  DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTS 139
           D   +S +L LP AFGAI LGETF S + +NN ++ E+  V ++ E+QT   + +L +  
Sbjct: 60  DLTHISEMLTLPSAFGAIQLGETFSSCLVVNNETSGEIETVTLRVEMQTATTKQVLAEYG 119

Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP-------------Q 186
             P   +  G   + +V H++KELG H L CT  Y    G +   P             +
Sbjct: 120 -GPDYRLAPGDAMENVVHHEIKELGQHVLACTVSYHLPPGHKPVHPAGEGHDPGIQSFRK 178

Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQN 242
           F+KF V+NPLSV+TKV V +  +       +E  FLE   +N T   +++ ++ FE  + 
Sbjct: 179 FYKFAVTNPLSVKTKVHVPRAPSALLSSTEREKVFLEVHTQNLTPDAMWLQRMRFEAVEG 238

Query: 243 WSA----TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
           W+     T+L    PH   N     IF   + +        YLY   +LS    SP  V 
Sbjct: 239 WNVQDVNTLL---APH---NKDGETIFSDSMALMQPQDTRQYLY---ILSPKELSPFPVN 289

Query: 299 GSN----VLGKLQITWRTNLGEPGRLQTQQI 325
            S      LG+L I+WR+  GEPGRL T  +
Sbjct: 290 HSPGSIIPLGRLDISWRSAFGEPGRLLTSML 320


>gi|440796425|gb|ELR17534.1| hypothetical protein ACA1_062880 [Acanthamoeba castellanii str.
           Neff]
          Length = 408

 Score =  131 bits (330), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 131/453 (28%), Positives = 214/453 (47%), Gaps = 71/453 (15%)

Query: 15  MRLCRPSLHVEPPLRVDPTDL-----FIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           MRL +P+L  +PP+ V+  D        GED           P + SS+V          
Sbjct: 1   MRLSKPTLQFQPPVLVEADDAPYPLSKTGED----------QPTMTSSNVQ--------- 41

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
                     ++  LS  L LP+AFG IY+GETFCSYIS+ N +  ++  V ++AE+ T 
Sbjct: 42  ----------NAFSLSPGLNLPRAFGNIYVGETFCSYISLYNHTQSDLHLVGLRAELNTK 91

Query: 130 KQRILLLD-TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
             + LL+D T+   ++ + AG R+DFIV + V E   H LVCT  Y+ G GE+K   +FF
Sbjct: 92  VLKNLLIDQTTAGSIQRLAAGERHDFIVRYRVVEPTMHILVCTISYAKG-GEKKSFRKFF 150

Query: 189 KFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT-- 246
           KF V +  S   K R+      H ++ T LE  + N  ++ ++++ V++ P+ N      
Sbjct: 151 KFTVVD--SFEWKQRIF-----HIKDDTLLEVQLRNVARNAVFLNNVKYGPAFNPGTARS 203

Query: 247 ---MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN-- 301
               L+     +D    ++ +        S G   +     +     +S  ++++ +   
Sbjct: 204 YLFQLRPRRGAADATMYTKRLRNRVSDADSAGANEDD----EETDSSTSDEMQIELARIK 259

Query: 302 ------VLGKLQITWRTNLGEPGRLQTQQIL--GTTITSKEIELNVVEVPSVVGIDKPFL 353
                 VLGKL ++W T+ GE G   T++IL       S E+E+++  + S + ++ PF 
Sbjct: 260 LEADEMVLGKLLLSWHTSFGETG---TRKILVKHKPSPSPEVEISITSIASAITLETPFP 316

Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRI-MALAPVEAFGSTDFHLNLIAT 412
             + +TN+  +   P   W+ Q   D    V+  GL     L  + + GS    +  +  
Sbjct: 317 ATVTVTNKLPR---PILPWV-QLAQDHTANVVAAGLSAGFKLEEIPSGGSKSAEVAFLPL 372

Query: 413 KLGVQRITGITVFDKLEKITYDSLPDLEIFVDQ 445
           + G+Q ITGI+V DK     Y + PD EI V Q
Sbjct: 373 QAGIQTITGISVLDKKTGRVY-ACPDHEILVLQ 404


>gi|392596039|gb|EIW85362.1| DUF974-domain-containing protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 660

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 83/258 (32%), Positives = 128/258 (49%), Gaps = 23/258 (8%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
            L LP +FGAI LGETF S +S+NN   +++  V ++ EIQT   + L+ +    P   +
Sbjct: 60  FLTLPSSFGAIQLGETFSSCLSVNNEVNIDIEAVTVRVEIQTMNTKTLVAELG-GPDFKL 118

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER--------------KYLPQFFKFIV 192
             G   + +V+H+VKELG H L C   Y      R              + L +F+KF V
Sbjct: 119 TPGQSLEHVVQHEVKELGQHVLACAVSYRMPSHTRPSAVPAAPGADPNLQTLRKFYKFAV 178

Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
           +NPLSV+TKV V K          +E  FLE  ++N T+  L+ +++ FE +++W A   
Sbjct: 179 TNPLSVKTKVHVPKSPTASLLEAEREKVFLEVHVQNLTQEPLWFEKIRFECAESWKAIDT 238

Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGKLQ 307
               P   Y+    E+F   + +     +  Y+Y L      +   V   G+ + LG+L 
Sbjct: 239 AGTEPSKSYD---EELFTDDMSLMQPQDVRQYIYTLVPAVLSTFPLVHPPGTVIALGRLD 295

Query: 308 ITWRTNLGEPGRLQTQQI 325
           I+WR+  GE GRL T  +
Sbjct: 296 ISWRSQFGELGRLLTSML 313


>gi|449547690|gb|EMD38658.1| hypothetical protein CERSUDRAFT_123212 [Ceriporiopsis subvermispora
           B]
          Length = 721

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 163/346 (47%), Gaps = 58/346 (16%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H L+ +VMR+ RPSL                     +P  +S+ P    S  +T   + L
Sbjct: 6   HLLSLKVMRVSRPSLAST-----------------WEPYYSSSQP---FSQRSTASITSL 45

Query: 68  TYRSRFLLHDSA--DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
             ++    H +   D    S +L+LP +FG I +GE F S +S+NN +  E+  V ++ E
Sbjct: 46  QGKAPLPGHPNTLRDLAHASEMLMLPSSFGTIQIGEVFTSCLSVNNETNAEIDGVHVRVE 105

Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER---- 181
           +QT   + +LL+    P   +  G   + +V H++KELG H L CT  Y    G R    
Sbjct: 106 MQTATSKTVLLEMG-GPNSQLAVGASLEKVVSHEIKELGQHVLGCTVSYRLPPGYRPVPG 164

Query: 182 ----------KYLPQFFKFIVSNPLSVRTKVRVVKVGAT----HFQEITFLEACIENHTK 227
                     +   +F+KF V+NPLSV+TKV V +  +     + +E  FLE  I+N T+
Sbjct: 165 TSSEAVDPGVQTFRKFYKFAVTNPLSVKTKVHVPRAPSALLSRNEREKVFLEVHIQNLTQ 224

Query: 228 SNLYMDQVEFEPSQNWSAT------MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYL 281
             +++++V FE S  W A       +  ADG  S +   S  + +P         +  Y+
Sbjct: 225 DGMWLERVRFECSDGWQAQDANRLGLGDADGGESIFTG-SMALLQP-------QDMRQYI 276

Query: 282 YQLKMLSHGSSSPVKVQGSNV--LGKLQITWRTNLGEPGRLQTQQI 325
           Y L   +     P+  Q  ++  LG+L I+WR+  GEPGRL T  +
Sbjct: 277 YILSP-TVPPPFPITHQPGSILPLGRLDISWRSPFGEPGRLLTSML 321


>gi|348690154|gb|EGZ29968.1| hypothetical protein PHYSODRAFT_323413 [Phytophthora sojae]
          Length = 456

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 94/313 (30%), Positives = 151/313 (48%), Gaps = 33/313 (10%)

Query: 78  SADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLD 137
           S     LS +L+LP +FG I+LG TF SYIS+ N  + E+RDV + A IQ    R+ L D
Sbjct: 75  SQHEFALSSMLILPDSFGEIFLGNTFSSYISVINPYSCELRDVGLSANIQCANDRVELHD 134

Query: 138 -----TSK----SPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLPQF 187
                T K    +PV  + AG   D +V++ + ++G H L     Y D   GE K L +F
Sbjct: 135 NRYARTGKLPPPNPVAVLPAGSSLDMVVDYPLNQVGNHVLRVGVAYVDPITGESKSLRKF 194

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           ++F V NPL +  K       A   + I  +EA I N +K  L++D ++F P   +++  
Sbjct: 195 YRFAVQNPLVITFKQNSATGQALKGEAI--VEAQIRNVSKLPLFVDSIKFLPLPPFTSEE 252

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLY---QLKMLSHGSSSPV-------KV 297
           +  D        +   I     L+         +Y   +L+ +   S  P          
Sbjct: 253 MGVDPVGKKAEGEQASIQD---LLSVNSSPQTLVYPQEELQRVFRVSYDPASDPTLLSSA 309

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTT--------ITSKEIELNVVEVPSVVGID 349
           QGS  LG+L + W+T++GE G +Q+Q ++  T            E+ + V E+P  V + 
Sbjct: 310 QGSQNLGRLHVGWKTSMGEAGSVQSQPVMRKTPGAAGHGGAGHSEVAVAVEELPKEVMVG 369

Query: 350 KPFLLKLKLTNQT 362
           +PFL+ + +TN++
Sbjct: 370 QPFLVAVSVTNKS 382


>gi|403417125|emb|CCM03825.1| predicted protein [Fibroporia radiculosa]
          Length = 1166

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 84/260 (32%), Positives = 136/260 (52%), Gaps = 28/260 (10%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L+LP +FGAI LGETF S +S+NN ++++V  V +  E+QT   +  + +    P   +
Sbjct: 523 VLMLPSSFGAIQLGETFTSCLSVNNEASVDVESVTLTVEVQTASTKATVAEFG-GPDFRL 581

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP--------------QFFKFIV 192
             G   + +V H++KELG H L CT  Y    G R  +               +F+KF V
Sbjct: 582 AVGESLEKVVGHEIKELGQHALACTISYRLPSGIRAPVAPAADSNDPNLYVFRKFYKFAV 641

Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
           +NPLSV+TKV V +  +  F    +E  FLE  ++N T+  ++++++  E +  W     
Sbjct: 642 TNPLSVKTKVHVPRAPSATFSRVEREKVFLEIHVQNLTQDAMWLERMRLECADGW----- 696

Query: 249 KADGPH--SDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGK 305
           KAD  +  +D +A S  +F   + +     +  Y+Y L  ++          GS V LG+
Sbjct: 697 KADDANLMNDEDA-SESVFSGSMGLMQPHDMRQYIYILSPVNLALFPTAHQPGSVVPLGR 755

Query: 306 LQITWRTNLGEPGRLQTQQI 325
           L ITW+++ GEPGRL T  +
Sbjct: 756 LDITWKSSFGEPGRLLTSML 775


>gi|395330058|gb|EJF62442.1| hypothetical protein DICSQDRAFT_160869 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 718

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/260 (32%), Positives = 136/260 (52%), Gaps = 26/260 (10%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           LL LP +FGAI LGETF S +S+NN + ++V  V++  E+QT   + LL +    P + +
Sbjct: 67  LLTLPSSFGAIQLGETFSSCLSVNNEANVDVEGVIVHVEMQTASTKTLLAEFG-GPEQRL 125

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER--------------KYLPQFFKFIV 192
             G   + IV H++KELG H L CT  Y    G R              +   +F+KF V
Sbjct: 126 GVGQSLEKIVSHEIKELGQHVLGCTVSYRMPPGVRPPPGQSADLQDPSVESFRKFYKFAV 185

Query: 193 SNPLSVRTKVRVVKVG----ATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
           +NPLSV+TKV + +      ++  +E   LE  I+N T+  +++++++F+    W A   
Sbjct: 186 TNPLSVKTKVHLPRSPTALLSSEEREKVLLEVHIQNLTQDAMWLERMQFDCVDGWQAQ-- 243

Query: 249 KADGPH-SDYNAQSRE-IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGK 305
             D  +  D  A S+E +F     +     +  Y+Y L+ ++          G+ + LG+
Sbjct: 244 --DANYLEDAAAGSKESLFTGSTALMQPQDVRQYIYILQPINLPPFPITHAPGAILALGR 301

Query: 306 LQITWRTNLGEPGRLQTQQI 325
           L I+WR++ GEPGRL T  +
Sbjct: 302 LDISWRSSFGEPGRLLTSTL 321


>gi|302690716|ref|XP_003035037.1| hypothetical protein SCHCODRAFT_232409 [Schizophyllum commune H4-8]
 gi|300108733|gb|EFJ00135.1| hypothetical protein SCHCODRAFT_232409 [Schizophyllum commune H4-8]
          Length = 617

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 86/260 (33%), Positives = 127/260 (48%), Gaps = 31/260 (11%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           LL+LP +FG+I LGETF S +  NN + ++V  V +K E+QT   ++ L +    P  ++
Sbjct: 53  LLMLPASFGSIQLGETFSSCLCANNDTQVDVDSVTVKVEMQTATTKVTLGEFG-GPQYTL 111

Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP-------------QFFKFIVS 193
            AG   + +V H+VKELG H L  T  Y      R  +P             +F+KF+V+
Sbjct: 112 AAGDTLECLVTHEVKELGQHVLSATVSYRLPPNARPPVPAEDPDDPQMQHFRKFYKFVVT 171

Query: 194 NPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
           NPLSV+TKV   K  +       ++  FLE  I+N T+  L+ +++  EP   W      
Sbjct: 172 NPLSVKTKVHTPKSPSAQLSTSERDKIFLEVHIQNLTQEPLWFERMLLEPVDGWDV---- 227

Query: 250 ADGPHSDYNAQSRE---IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGK 305
                 D N  S E   IF     +     +  Y+Y +   S      V   GS + LG+
Sbjct: 228 -----EDTNLGSTEEDGIFTGTTALMGPQDMRQYIYIMSSQSPPRIPVVHSPGSIIPLGR 282

Query: 306 LQITWRTNLGEPGRLQTQQI 325
           L I WR++ GEPGRL T  +
Sbjct: 283 LDIAWRSSFGEPGRLLTSML 302


>gi|301119703|ref|XP_002907579.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262106091|gb|EEY64143.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 358

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 98/307 (31%), Positives = 155/307 (50%), Gaps = 32/307 (10%)

Query: 82  IGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLD---- 137
             LS +L+LP +FG I+LG TF SYIS+ N  T E+RDV + A IQ    R+ L D    
Sbjct: 33  FALSSMLILPDSFGEIFLGNTFSSYISVINPYTCELRDVGLSANIQCANDRVELHDNRYA 92

Query: 138 -TSK----SPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLPQFFKFI 191
            T K    +PV  + AG   D +V++ +  +G H L     Y D   GE K L +F++F 
Sbjct: 93  RTGKLPPPNPVAMLPAGSSLDMVVDYPLNLVGNHVLRVGVAYVDPVTGENKSLRKFYRFA 152

Query: 192 VSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD 251
           V NPL +  K +       H + I  +EA I N +K  L++D ++F P   +++  +  +
Sbjct: 153 VQNPLVITFK-QNSPASQQHGEAI--VEAQIRNVSKLPLFVDSIKFLPLAPFTSEEMVVN 209

Query: 252 GPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH-------GSSSP--VKVQGSNV 302
              S  N   R   K   L+    G    +Y  + L          +S P  +  QGS  
Sbjct: 210 ---SGGNRGERPSIKE--LLSLNNGPQTLVYPQEELQRVFRVWYDPASDPSLLTTQGSQN 264

Query: 303 LGKLQITWRTNLGEPGRLQTQQIL----GTTITS-KEIELNVVEVPSVVGIDKPFLLKLK 357
           LG+L + W+T++GE G +Q+Q ++    GT+     E+ + + E+P+ V + +PFL  + 
Sbjct: 265 LGRLHVGWKTSMGEAGSVQSQPVVRKVPGTSGGGHSEVLVAMQELPTEVVVGQPFLAAIS 324

Query: 358 LTNQTDK 364
           +TN T +
Sbjct: 325 VTNNTTR 331


>gi|237831303|ref|XP_002364949.1| hypothetical protein TGME49_057020 [Toxoplasma gondii ME49]
 gi|211962613|gb|EEA97808.1| hypothetical protein TGME49_057020 [Toxoplasma gondii ME49]
 gi|221487204|gb|EEE25450.1| conserved hypothetical protein [Toxoplasma gondii GT1]
 gi|221506886|gb|EEE32503.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 395

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 108/430 (25%), Positives = 190/430 (44%), Gaps = 57/430 (13%)

Query: 10  LAFRVMRLCRPSLHVEP--PLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           L  +VMRL +PS++ EP   LR+D                      + S D +  K  + 
Sbjct: 9   LTLKVMRLSQPSIYAEPWPLLRIDE---------------------VTSEDQSVKKKLE- 46

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
             R R  +  + +S   +  L+LP + G I+ GETF +YI+I+NSS  +  +V+I+ E+ 
Sbjct: 47  --RERVCVERALES---THALLLPASQGRIFSGETFSAYINISNSSNAQAVNVIIQVELS 101

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCT-ALYSDGEGERKYLPQ 186
             ++R LL D S+ P+ S+  G  +D  + H++ E G +TLVC  + Y    GE+K   +
Sbjct: 102 IGQKRDLLFDNSQDPIRSLTPGNSFDCTIVHELTESGTYTLVCAVSHYLSAVGEQKSFKK 161

Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
            FKF    P  V  +V +++  A       F+E  +EN ++  +Y+        ++    
Sbjct: 162 SFKFAAHPPFGVGHRVVLLQGRA-------FVECSVENVSQEAVYLSDASIFCVEDIEGV 214

Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK-MLSHGSSSPVKVQGSNVLGK 305
            L +  P    N      FKP          +N ++ L    +     P  ++   VLG+
Sbjct: 215 RLDSGPPSDGRNHNGLHYFKP-------HDRYNLVFSLTPTATKLGEDPSFIRRLPVLGQ 267

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L + WRT+ G  G +    +  +   S +        P  + +++PF ++++++   ++ 
Sbjct: 268 LALEWRTSTGGAGCMHEYTLTNSLAESSK--------PLSLRVERPFQVEIEVSAHVEQV 319

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
             P  I      SD E  V I G     L  ++ F    + L  +    G   + GI V+
Sbjct: 320 FCPVLIL---RPSDLEPFV-IQGSTTRPLGIIDMFTPRRYILEAVCLSPGFHSVKGIMVY 375

Query: 426 DKLEKITYDS 435
           D   + T D+
Sbjct: 376 DPDTQQTADA 385


>gi|299753765|ref|XP_001833471.2| hypothetical protein CC1G_05171 [Coprinopsis cinerea okayama7#130]
 gi|298410453|gb|EAU88405.2| hypothetical protein CC1G_05171 [Coprinopsis cinerea okayama7#130]
          Length = 633

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 130/261 (49%), Gaps = 32/261 (12%)

Query: 85  SGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKS-PV 143
           S LL LP +FG+I LGETF S + +NN +T  V    IK E+QT   ++ L +  ++ P 
Sbjct: 48  SELLTLPASFGSIQLGETFSSCLCVNNEATSAVEVKQIKVEMQTVTTKVTLSELDETGPT 107

Query: 144 ESIRAGGRYDFIVEHDVKELGAHTLVCTALYS--------DGEGERKYLP------QFFK 189
           + + AG   + IV H++KELG H L CT  Y          G  E    P      +F+K
Sbjct: 108 KMLEAGDSLETIVHHEIKELGQHVLACTVTYRLPPSARPVPGAAEDASDPSLLTFRKFYK 167

Query: 190 FIVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           F V+NPLSV+TKV   K  +       ++  FLE  I+N T ++++ +++ FE ++ +  
Sbjct: 168 FAVTNPLSVKTKVHTSKSPSASLSLDERDKLFLEVHIQNLTPASMFFEKMRFECAEGF-- 225

Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LG 304
                     D +  +  +F              Y+Y L   S   + P    GS + LG
Sbjct: 226 ----------DVDDINGPVFSGSFATMQPQDTRQYVYILTPKSTTVAPPALPPGSIIPLG 275

Query: 305 KLQITWRTNLGEPGRLQTQQI 325
           +L I+WR++ GEPGRL T  +
Sbjct: 276 RLDISWRSSYGEPGRLLTSML 296


>gi|392567447|gb|EIW60622.1| DUF974-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 716

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 86/261 (32%), Positives = 134/261 (51%), Gaps = 29/261 (11%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPV 143
           ++ LL LP AFGAI LGETF S +SINN + ++V  V+I+ E+QT   + LL +   S  
Sbjct: 66  ITDLLTLPAAFGAIQLGETFSSCLSINNDANIDVDGVIIRVEMQTASSKALLAEFGGS-N 124

Query: 144 ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP-------------QFFKF 190
           + +  G   + +V H++KELG H L C+  Y    G R   P             +F+KF
Sbjct: 125 QRLGVGETLEKVVSHEIKELGQHVLGCSVSYRVPPGVRNLPPAADAQDPSIQTFRKFYKF 184

Query: 191 IVSNPLSVRTKVRVVKVG----ATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS-- 244
            V+NPLSV+TKV + +      +   +E  FLE  I+N T+  +++++++FE    W   
Sbjct: 185 AVTNPLSVKTKVHLPRSPTALLSAQEREKVFLEVHIQNLTQDAMWLERMQFECIDGWQVQ 244

Query: 245 -ATMLKADGPHSDYNAQSRE-IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
            A +L+      +    S+E +F     +     +  Y+Y L            + G  +
Sbjct: 245 DANILE------NTATGSKEYLFSGTTALMQPQDLRQYIYILSPKVLPPFPIAHIPGHIL 298

Query: 303 -LGKLQITWRTNLGEPGRLQT 322
            LG+L I+WR+  GEPGRL T
Sbjct: 299 PLGRLDISWRSCYGEPGRLLT 319


>gi|242004692|ref|XP_002423213.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212506184|gb|EEB10475.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 377

 Score =  124 bits (311), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 118/423 (27%), Positives = 188/423 (44%), Gaps = 91/423 (21%)

Query: 8   HSLAFR---VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           H+L  +   +MRL +P+L    PL V      + EDI ++ +          +D+TT   
Sbjct: 11  HTLTLKGLLIMRLTKPAL--SSPLIVTNESKDLPEDILNNDL---------KNDITTVNE 59

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
           ++     +FLL              +PQ+FG I+LGE+F  YI I+N S    ++V +KA
Sbjct: 60  TETLAVGQFLL--------------IPQSFGTIHLGESFLGYILIHNDSNQIAKNVHVKA 105

Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
           ++QT  Q+I LL                    EH + EL  H               K +
Sbjct: 106 DLQTVTQKIPLL--------------------EHKLSELSPH---------------KTI 130

Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
            QFFKF V  PL ++TK    +       +  FLEA ++N T   +++++V FE S  + 
Sbjct: 131 DQFFKFEVKTPLDLKTKFYNAE------SDEVFLEAQVQNITAGPIHLEKVSFESSDLFK 184

Query: 245 -ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
            +++ K D   SD +      F+             Y+Y L  +     S   + G+  +
Sbjct: 185 VSSLYKTDEIKSDDSLLQPNEFR------------QYVYCLTPIYDSDGS--HLFGATNI 230

Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
           G+L I WR NLGE GRLQT Q+        EI L+V  +P++V I++PF    K++N   
Sbjct: 231 GRLDIAWRYNLGEKGRLQTSQLQKMAPDFGEIRLSVHNLPNIVKIEEPFKFLCKISNLR- 289

Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
                 ++ LS   S  + V +  G     +  ++  GS    L L+    G+  I+GI 
Sbjct: 290 ----AMDLVLSLEKSHPDLVWI--GTSGQHIGKLDIGGSKVIELTLVPLSAGLHNISGIR 343

Query: 424 VFD 426
           + D
Sbjct: 344 LKD 346


>gi|393216624|gb|EJD02114.1| DUF974-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 807

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 110/375 (29%), Positives = 165/375 (44%), Gaps = 69/375 (18%)

Query: 6   GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           G H L+ +VMR+ RPSL          +  F        P++     PL     T     
Sbjct: 8   GQHPLSLKVMRVSRPSLASHWQPFFSSSPSFSAHSTAH-PLSLQGAEPLPGHPKTLR--- 63

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
           DLT+               S LL LP AFGAI LGETF   +S+NN   L V  V  + E
Sbjct: 64  DLTH--------------ASNLLTLPAAFGAIQLGETFACVLSVNNEVGLPVDSVRARVE 109

Query: 126 IQTDKQRILLLDTSKSPVESIR----------------AGGRYDFIVEHDVKELGAHTLV 169
           +QT   ++LL + +    +S R                 G   +  V  ++KELG H L 
Sbjct: 110 MQTATSKVLLAEVNAG--DSDRDVKMEETSGSGTGTLGTGDSLELCVATEIKELGQHVLA 167

Query: 170 CTALYSDGEGER--------------KYLPQFFKFIVSNPLSVRTKVRVVKVGATHF--- 212
           CT  Y    G R              +   +F+KF+V+NPLSV++KV V K         
Sbjct: 168 CTVTYRTPPGMRPATSGAYNAEDPFMQTFRKFYKFMVTNPLSVKSKVHVPKSPTALLSRS 227

Query: 213 -QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS-----REIFK 266
            ++  FLE  I+N T++ ++ +++  E  + W      A  P  D ++ +     + IF 
Sbjct: 228 ERDKVFLEVHIQNLTQAPMWFEKIRLEAVEGWDVVDANAISPPFDLSSTADAENEKSIFS 287

Query: 267 PPVLIRSGGGIHNYLYQL--KMLSHGSSSPV-KVQGSNV-LGKLQITWRTNLGEPGRLQT 322
             + +     +  Y+Y L  K     +S P   V G+ + LG+L I+WR+++GEPGRL  
Sbjct: 288 GSMALMPPHDMRQYVYILTPKFTPRNTSVPAPPVPGTVIPLGRLDISWRSSMGEPGRLL- 346

Query: 323 QQILGTTITSKEIEL 337
                T+I S+ I L
Sbjct: 347 -----TSILSRRIPL 356


>gi|393245725|gb|EJD53235.1| DUF974-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 657

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/260 (31%), Positives = 128/260 (49%), Gaps = 21/260 (8%)

Query: 80  DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTS 139
           D   +S +L+LP +FGAI LGETF S + INN +  +V  V +K E+QT   ++LL    
Sbjct: 48  DLTAISDVLMLPASFGAIQLGETFSSCLCINNDTDGDVHAVALKVEMQTATTKVLLAHLG 107

Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY-------SDGEGERKYLPQFFKFIV 192
              +         + +V H++KELG H L CT  Y       ++ E     + +++KF V
Sbjct: 108 GPDLTLTAEKNFVETVVHHEIKELGQHVLSCTITYRLPGAPPANDEDGLSTIRKYYKFAV 167

Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
           +NPLSV+TKV   +  +       +E  FLE  ++N T   L+ +Q++FE +  W    L
Sbjct: 168 TNPLSVKTKVHTPRAPSALLSRTEREKVFLEVHVQNLTAEPLWFEQMKFECADGW----L 223

Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS-PVKVQGSNV--LGK 305
             D   ++  +    IF     +     +  Y+Y L        S PV      V  LG+
Sbjct: 224 VDD---ANLTSHKTSIFSGAAALIQPQDLRQYVYVLTPTPESVPSFPVVHAPGTVISLGR 280

Query: 306 LQITWRTNLGEPGRLQTQQI 325
           L I+WR++ G PGRL T  +
Sbjct: 281 LDISWRSSFGGPGRLLTSML 300


>gi|53136444|emb|CAG32551.1| hypothetical protein RCJMB04_29c21 [Gallus gallus]
          Length = 207

 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 78/218 (35%), Positives = 113/218 (51%), Gaps = 32/218 (14%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    ++F+          L+  D +T K    
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                      A+++ L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 56  -----------AEALMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
           T  QR L L  S + V  ++     D    H+VKE+G H LVC   Y+   GE+ Y  +F
Sbjct: 105 TSSQR-LNLSASTAAVAELKPDCCIDDGSPHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENH 225
           FKF V  PL V+TK    +          FLEA I+ +
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDE------VFLEAQIQKY 195


>gi|353240747|emb|CCA72601.1| hypothetical protein PIIN_06538 [Piriformospora indica DSM 11827]
          Length = 650

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 99/336 (29%), Positives = 156/336 (46%), Gaps = 46/336 (13%)

Query: 7   THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
           +H LA +VMR+ RPSL       +     F       D   AS++   I   +   +   
Sbjct: 5   SHLLALKVMRVSRPSL-------LGQWQPFAEASTHFDAHNASSIT-SIQPHIPNKQHVP 56

Query: 67  LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
            T R         D   LS  L LP +FG+I LGETF S   + N +  ++  V I+ E+
Sbjct: 57  TTIR---------DLSALSQNLSLPSSFGSISLGETFSSCFCVANMTNYDIEGVHIRVEM 107

Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP- 185
           Q+   + LLL+    P   +   G  + +V+ ++KELG HTL C   Y    G R   P 
Sbjct: 108 QSASAKSLLLELG-GPEHRLGPLGTLEGVVQSEIKELGQHTLSCIVHYRVPPGLRPPAPS 166

Query: 186 ------------QFFKFIVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSN 229
                       + ++F VSNP SV+TKV   K  +       +E  FL+  ++N T+ +
Sbjct: 167 DDPSDPRAQLFRKHYRFPVSNPFSVKTKVHTPKSPSALMSRVEREKLFLQIDVQNLTQES 226

Query: 230 LYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQL--KML 287
           ++ +++EF+P   W+ T    D   ++ + ++R+ F  P  +        Y+Y L   ++
Sbjct: 227 MWFERLEFKPVDGWTFT----DA--NESSIEARQAFTGPKTLVQPQDTFQYIYTLIPAVI 280

Query: 288 SHGSSSPVKVQGSNV-LGKLQITWRTNLGEPGRLQT 322
                +P    G+ + LG+L I WRT  GEPGRL T
Sbjct: 281 PRFLINPAP--GAVIPLGRLDIAWRTTFGEPGRLLT 314


>gi|242220364|ref|XP_002475949.1| predicted protein [Postia placenta Mad-698-R]
 gi|220724816|gb|EED78834.1| predicted protein [Postia placenta Mad-698-R]
          Length = 705

 Score =  118 bits (295), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 81/253 (32%), Positives = 129/253 (50%), Gaps = 20/253 (7%)

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
           +L+LP +FGAI LGETF S IS+NN + ++V  VV+  E+QT   + +L      P + +
Sbjct: 67  VLMLPSSFGAIQLGETFTSCISVNNEANMDVESVVLTVEMQTATTKAVLAQFG-GPEQRL 125

Query: 147 RAGGRYDFIVEHDVKEL-------GAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVR 199
             G   + IV H++KEL       G H  +      +  G   +  +F+KF V+NPLSV+
Sbjct: 126 ALGESLERIVSHEIKELVSYRLPPGDHATIPPVTDPNDPGLHVFR-KFYKFAVTNPLSVK 184

Query: 200 TKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA--TMLKADGP 253
           TKV V +  +       +E  FLE  I+N T+  ++++++  E + +W      L  DG 
Sbjct: 185 TKVHVPRAPSALLSRPEREKVFLEIHIQNLTEDAMWLERMHLECADSWKVHDVNLADDG- 243

Query: 254 HSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGKLQITWRT 312
                 +   IF   + +     +  Y+Y L  +   +       GS V LG+L I+WR+
Sbjct: 244 ---SEMEKEGIFSGSMALMQPQDMRQYVYVLSPVILTAFPVAHAPGSIVPLGRLDISWRS 300

Query: 313 NLGEPGRLQTQQI 325
           + GEPGRL T  +
Sbjct: 301 SFGEPGRLLTSML 313


>gi|390342034|ref|XP_795991.3| PREDICTED: UPF0533 protein C5orf44 homolog [Strongylocentrotus
           purpuratus]
          Length = 230

 Score =  117 bits (292), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 70/209 (33%), Positives = 110/209 (52%), Gaps = 11/209 (5%)

Query: 118 RDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG 177
           +D+ +K ++QT  QR+ L   S  P  ++  G   D ++ H+VKELG H LVC   Y+  
Sbjct: 28  QDIHVKTDLQTSSQRLTLSGGSTPPSPNLAPGACIDQVIHHEVKELGTHILVCAVSYTSP 87

Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
            GE     +F+KF V  PL V+TK    +       +  +LEA I+N T+S + M++V  
Sbjct: 88  SGETLSFRKFYKFQVLKPLDVKTKFYNAE------SDEVYLEAQIQNITQSPMCMEKVAL 141

Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH-GSSSPVK 296
           EP+ ++    L +    +   A S+++        +      YLY LK  +  G+  P  
Sbjct: 142 EPTADYMVEELNS----TQTEATSKKLIFGDFTYLNPMDTRQYLYCLKAKTQAGADRPSL 197

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
           ++G + +GKL I W+T LGE GRLQT Q+
Sbjct: 198 IKGVSSIGKLDIVWKTTLGEKGRLQTSQL 226


>gi|326436192|gb|EGD81762.1| hypothetical protein PTSG_02475 [Salpingoeca sp. ATCC 50818]
          Length = 355

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 101/374 (27%), Positives = 159/374 (42%), Gaps = 64/374 (17%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           M +TP  H L  RVM+L +P      P+  D   L +  ++    + A N      ++VT
Sbjct: 1   MDATPRAHPLTLRVMQLAKPGFARHDPVGYDEEGLALTRNV----LHAENPRHYAPANVT 56

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
                                      L LP + G +YLGE+F ++I+I N     V +V
Sbjct: 57  E-------------------------ALQLPSSQGKVYLGESFSAFINICNDGHDVVTNV 91

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYS 175
            +K E+QT  QR     TS +  ES RA            + H+++ LG H L+C   Y+
Sbjct: 92  SLKVEMQTASQR----HTSLADPESCRASKLERTQTLQTTIRHEIRSLGTHALLCAVSYT 147

Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV 235
              GER+   + F F V+ PL V           T  Q    LE  ++N     ++   +
Sbjct: 148 LLNGERRTFRKSFNFEVNQPLDVIPH-------CTTIQNTIVLEVQVKNQMPHPIHFQSI 200

Query: 236 EFEPS-----QNWSATMLKADGPHSDYNA-QSREIFKPPVLIRSGGGIHNYLYQLKMLSH 289
           +F P      Q+ +AT+ +     S ++  QS E    P   RS      YLY+L   + 
Sbjct: 201 KFTPQSAFAVQDCNATLCQDGKTRSVFHGFQSVE----PKESRS------YLYKL---TP 247

Query: 290 GSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGID 349
                 + +    +GKL + WR+++GE G LQT Q+        ++EL+    PS V + 
Sbjct: 248 AEGQYFEFRRRKAIGKLDVMWRSSMGEFGHLQTSQLERPVPPVHDLELHATNAPSAVTVG 307

Query: 350 KPFLLKLKLTNQTD 363
            PF ++  + N  D
Sbjct: 308 APFEVECDVINFRD 321


>gi|256073664|ref|XP_002573149.1| hypothetical protein [Schistosoma mansoni]
          Length = 509

 Score =  114 bits (285), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 128/529 (24%), Positives = 207/529 (39%), Gaps = 162/529 (30%)

Query: 15  MRLCRPSLHVEPPLRVDPTDLFIGEDI------FDDPI------------AASNLPPLIS 56
           MRL RP+  ++   R +PT+L++ +DI      +D  I            +A N+PP  +
Sbjct: 1   MRLNRPTFVIQ---RCEPTELYL-DDIAGSLTAYDASIRGDLDGISLNLLSAGNIPPSSN 56

Query: 57  SDVTTNKSSDLTYRSRFLLHDSADSI-----GLSGLLVLPQAFGAIYLGETFCSYISINN 111
           S  + N   +L   S+    D+ + I     G S LL L  +FG IYLGETF ++I+++N
Sbjct: 57  SHESPNYDHELNNDSK----DNYNYIQPKVGGYSELLSLTHSFGTIYLGETFSAHINLHN 112

Query: 112 SSTLEVRDVVIKAEIQTDKQRILLL----------------------------------- 136
            S     +V +K  +    + I L                                    
Sbjct: 113 ESNQICYNVELKVALHNRIESITLPIFTSLNGQSNSTVVLRNSSTNSESSNTHTSPSLGS 172

Query: 137 ----DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY------------------ 174
                 +K  V  ++ G   + I+ H++KELG H L CT  Y                  
Sbjct: 173 NAGGTNTKDSVFDLQPGQSLNAIISHELKELGVHNLRCTVSYFQTSSHGKSESSSHVVAY 232

Query: 175 -----------SDGEGERK--YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEAC 221
                       D   +R+     + +KF+V+ PL VR K  +V +  +       +E  
Sbjct: 233 ESPRLTSGLSSRDTTSKREPITFQRLYKFMVNKPLDVRKKFSIVDIDNS-----VLMETQ 287

Query: 222 IENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYL 281
           I+N T + + +++V FE +  +S   L      ++     +  F  P        +  +L
Sbjct: 288 IQNLTVTPIILERVLFESNPQFSVIDL------NNLQFGKKSHFNTPTYYLQPNDVQQFL 341

Query: 282 YQLKMLSHGS------------------------SSPVKVQGSNVLGKLQITWRTNLGEP 317
           Y+L   +  S                        SS    Q S   G+L ITWR+ +GE 
Sbjct: 342 YRLIPTTTNSLPLLNSSSTNSSIPASAVPDPIPVSSTTTRQVSISAGRLDITWRSLMGER 401

Query: 318 GRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQND 377
           GRLQT  +     T  +I+L V+ +PS V  ++PF LK +LTN +   Q           
Sbjct: 402 GRLQTSSLKYELPTFGDIQLRVLTIPSTVTTEQPFTLKFELTNCSKTRQ----------- 450

Query: 378 SDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 426
                       ++  L P +      F LNL+AT  G+  I+G+ + D
Sbjct: 451 ------------KLGKLLPGQCI---PFELNLMATLPGLHMISGLCIHD 484


>gi|193617950|ref|XP_001949728.1| PREDICTED: UPF0533 protein C5orf44 homolog [Acyrthosiphon pisum]
          Length = 404

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 113/448 (25%), Positives = 199/448 (44%), Gaps = 66/448 (14%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H +  RVMRL +P +     +  D  DL         P AA N    +  DVTT      
Sbjct: 12  HPIKLRVMRLGKPVMFNSKIVTCDSKDL---------PGAALNAH--LKKDVTT------ 54

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                  L D A+++     L++P     +YLGETF  YI + N S+  V D+++KAEI 
Sbjct: 55  -------LAD-AETLAAGSFLMVPNVLENLYLGETFLCYIYLKNESSQTVYDIILKAEID 106

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGA-HTLVCTALYSDGEGERKY-LP 185
           T    I +L         +      D IV+H+VKE G+ + L+C   Y     +RK+   
Sbjct: 107 TATSHIPIL--GPKAFSKLDPYASIDVIVKHEVKEHGSVNKLICQVEY-----DRKHSFE 159

Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEIT---FLEACIENHTKSNLYMDQVEFEPSQN 242
             F + V  PL ++TK          +  +T   +LE  ++N   + + +++   E S  
Sbjct: 160 TIFSYRVPKPLDLKTKF---------YNTVTDEVYLEVQVQNIMSTPISLEKFILESSIG 210

Query: 243 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
           +    +     H   +++ + IF   + I        Y+Y+L +      +P +   +N 
Sbjct: 211 YDVNSMN----HLLESSEDKSIFG-DMDILDVKETRQYMYRLSLDKTAEKNPTR---TNN 262

Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
           LGKL I WR+N+G  G++Q+  ++       +I  ++  +P +V  ++ F     + N  
Sbjct: 263 LGKLDILWRSNMGTKGQIQSSPLVRQIPELDDITFSITYLPDMVFCEEQFDFTCSIKNNR 322

Query: 363 DKEQGPFEIWLSQNDSDEEKV----VMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
           ++     ++ L    SDEE       MI+G+++  L P   + +     +++A   G+Q 
Sbjct: 323 NR-----DMQLVVEVSDEEDSNLAWTMISGIQLRLLPP---YATIKTVFSMVALNHGLQV 374

Query: 419 ITGITVFDKLEKITYDSLPDLEIFVDQD 446
           I+GI + + +   TY       +FV Q+
Sbjct: 375 ISGIKLKELILNRTYSYNNFGHVFVTQN 402


>gi|358335977|dbj|GAA34217.2| UPF0533 protein C5orf44 homolog [Clonorchis sinensis]
          Length = 539

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 131/553 (23%), Positives = 214/553 (38%), Gaps = 144/553 (26%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           M++   T  L+ RVMRL RP    +   + +P +L++      D IA++    L ++D  
Sbjct: 1   MTAPQDTDVLSLRVMRLNRPQFVRQ---QCEPAELYL------DDIASA----LTTADAG 47

Query: 61  TNKSSDLTYRSRFLLHDSADS---------------------------------IGLSG- 86
                D     R  + D A +                                 IG  G 
Sbjct: 48  VRADLDGVALHRLSISDCAQNDVTEGLTMEDQGDQEKAETDQIEEAQNHLVRVKIGGPGE 107

Query: 87  LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILL-----LDTSKS 141
           LL LPQ+FG+ YLGETF ++++++N S     +V +K  +    + + L     L  +  
Sbjct: 108 LLGLPQSFGSTYLGETFSAHVNLHNESNQICYNVELKVSLHNRIEWVTLSTSGTLTGASL 167

Query: 142 PVES-----------------IRAGGRYDFIVEHDVKELGAHTLVCTALY---------- 174
           P +S                 +  G   + I+ H++KELG HTL C A Y          
Sbjct: 168 PAQSPSSPEMSNQRSCSGGVDLHPGQSLNAIIHHELKELGIHTLRCVASYCLSSAASTVG 227

Query: 175 ------------SDGEGERKYLPQF-----FKFIVSNPLSVRTKVRVVKVGATHFQEITF 217
                       +   G+   L  F     +KF VS PL V+ K   V           F
Sbjct: 228 QSALSPLTPKSPNQWTGDPSALESFTFQRLYKFPVSKPLDVKKKFSAVDSNG-----CVF 282

Query: 218 LEACIENHTKSNLYMDQVEFEPSQNWSATMLKA--DGPHSDYNAQSREIFKPPVLIRSGG 275
           +EA ++N T   +Y+++V FEPS N     L    DG  S          +         
Sbjct: 283 MEAEVQNLTSVPIYLERVVFEPSPNMRVVDLNTIDDGKSSVPTCGDLRCLR-------AH 335

Query: 276 GIHNYLYQL-------------------------KMLSHGSSSPVKVQGSNV-LGKLQIT 309
            I  +LY+L                         + L  GS +  ++Q   +  G+L IT
Sbjct: 336 DIQQFLYKLIPDSGLLAKSPGQRMSVRSTQGQVRQPLPSGSVTASQLQQQPLSAGRLDIT 395

Query: 310 WRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPF 369
           WR+ +GE GRLQT  +        +++L  + +P+ V I++PF + L+LTN++ +     
Sbjct: 396 WRSTMGERGRLQTSSLKYELPHLGDLQLKALNLPATVQIEQPFQITLELTNRSTQHMDLM 455

Query: 370 EIWLSQNDSDEEKVVMIN--------GLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
                + ++D                GL    L  +    S    L L+AT  G+Q I+G
Sbjct: 456 LDLRGKPETDNSDDCSFRSLPPLAWVGLTTCRLGMLPPGRSMPLSLGLMATVPGLQPISG 515

Query: 422 ITVFDKLEKITYD 434
           + + +   +  Y+
Sbjct: 516 VLIHENTTERDYE 528


>gi|324506540|gb|ADY42790.1| Unknown [Ascaris suum]
          Length = 295

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 132/285 (46%), Gaps = 38/285 (13%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
           M+ T     L  +VMRL RP L+    + +DP           DP++      LI S V 
Sbjct: 1   MAETSRDQLLVLKVMRLARPKLYDTVCIPIDP----------GDPMSE-----LIGSAV- 44

Query: 61  TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
                      R     +AD   +   L+ PQ F  IYLGETF  Y+ + N S+    ++
Sbjct: 45  ----------CRLTGQKAADE-PVGEYLMAPQIFDNIYLGETFTFYVCVQNDSSQCATEI 93

Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
            IK ++QT  QR+ L    +    +++ G     I+ H++KE+G H LVC   Y     E
Sbjct: 94  CIKTDLQTTNQRVALHSKLQDSNATLQPGQILGDIISHEIKEVGQHILVCAVTYKTPADE 153

Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
           + Y  +FFKF V+ P+ VRTK    +    +     +LEA I+N + + + +++V  EPS
Sbjct: 154 KMYFRKFFKFPVTKPIDVRTKFYNAE---DNMNNDVYLEAQIQNTSATPMILEKVVLEPS 210

Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK 285
             +++T +    P    N  S++ F       +   I  YLY L+
Sbjct: 211 DFYTSTEIP---PPLLLNENSKKQF-----YLNPKDIRQYLYCLR 247


>gi|443925337|gb|ELU44194.1| hypothetical protein AG1IA_01781 [Rhizoctonia solani AG-1 IA]
          Length = 616

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 97/332 (29%), Positives = 148/332 (44%), Gaps = 54/332 (16%)

Query: 8   HSLAFRVMRLCRPSLHVEP-PLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
           H LA +VMR+ RPSL   P P   D T L                           ++S 
Sbjct: 4   HLLALKVMRVSRPSLSAHPLPFFSDSTAL-----------------------AAHARASP 40

Query: 67  LTYRSRFL--LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
           L+  S+ L  +  +   +  S +L+LP+AFG+I LGETF S + INN S   V    +  
Sbjct: 41  LSLESQPLDGIPSTLRDLAQSQVLLLPEAFGSISLGETFTSALCINNESAHTVLGSHLLV 100

Query: 125 EIQTDKQRILLLDTSKSPVES-IRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERK- 182
           EIQT   + +L       ++S +  G  +  +V H++KELG H LVCT  Y      R  
Sbjct: 101 EIQTASTKTVLGQVGG--IDSRLEPGQMFSLVVSHEMKELGQHVLVCTVGYHVPPALRNN 158

Query: 183 -YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
              P+    I  +P ++  +    KV         FLE  ++N T   LY ++++FE ++
Sbjct: 159 SIPPEDPIHIPRSPSALLNRNERNKV---------FLEVHVQNLTTKPLYFEKIQFECAE 209

Query: 242 NW-----SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS-PV 295
            W     +   +   G  SD  +++ E    P   R       YLY L      + S P+
Sbjct: 210 GWVLADANPKSVSNSGSESDSGSKTNETSLRPQDTR------QYLYILVATPAATPSFPI 263

Query: 296 KVQGSNV--LGKLQITWRTNLGEPGRLQTQQI 325
                 +  LG+L ++WR++ GEPGRL T  +
Sbjct: 264 PYPPGTIIALGRLDMSWRSSFGEPGRLLTSML 295


>gi|212645333|ref|NP_001129809.1| Protein C56C10.7, isoform c [Caenorhabditis elegans]
 gi|351060510|emb|CCD68186.1| Protein C56C10.7, isoform c [Caenorhabditis elegans]
          Length = 243

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 77/267 (28%), Positives = 133/267 (49%), Gaps = 32/267 (11%)

Query: 183 YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN 242
           Y  +FFKF VS P+ V+TK    +  A   Q++ +LEA IEN + +N+++++VE +PSQ+
Sbjct: 2   YFRKFFKFPVSKPIDVKTKFYSAEDNAN--QDV-YLEAQIENTSNANMFLEKVELDPSQH 58

Query: 243 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS-- 300
           ++ T +     H D      ++ KP         I  +L+ L        +P  V  +  
Sbjct: 59  YNVTSIA----HEDEFGDVGKLLKP-------KDIRQFLFCL--------TPADVHNTLG 99

Query: 301 ----NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
                 +GKL ++WRT++GE GRLQT  +        ++ L+V + P+ V + KPF +  
Sbjct: 100 YKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQKPFEVSC 159

Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
           +L N +++     ++ L Q  +        +G+ +  L P +     DF LN+    +G+
Sbjct: 160 RLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ---HVDFSLNVFPVTVGI 215

Query: 417 QRITGITVFDKLEKITYDSLPDLEIFV 443
           Q I+GI + D   K  Y+     +IFV
Sbjct: 216 QSISGIRITDTFTKRIYEHDDIAQIFV 242


>gi|290982829|ref|XP_002674132.1| hypothetical protein NAEGRDRAFT_80726 [Naegleria gruberi]
 gi|284087720|gb|EFC41388.1| hypothetical protein NAEGRDRAFT_80726 [Naegleria gruberi]
          Length = 483

 Score =  109 bits (272), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 122/477 (25%), Positives = 215/477 (45%), Gaps = 71/477 (14%)

Query: 1   MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIF-DDPIAASNLPPLISSDV 59
           M  TP  H ++ ++MRL +P   +  P+  + TD      +F   P   S++  +  +++
Sbjct: 16  MVETP--HPISIKLMRLKKPDFSLTVPILPEKTDALGDYKLFYKTPNYVSDVKSIYGNEM 73

Query: 60  TTNKSSDLTYRSRFL-----LHDSA----------DSIGLSGLLVLPQAFGAIYLGETFC 104
               S     +   L     L D+           DS+G +    LP A GAIY+GE   
Sbjct: 74  PLRASQQQQQKEDTLIEIPGLEDNGKSLLDRCIIFDSLGYNDGWCLPSAPGAIYVGEHLK 133

Query: 105 SYISINNSSTLEVRDVVIKAEIQTDKQRIL---LLDTSKSPVESIRAGGRYDFIVEH--- 158
            YIS++N S   ++++ + AE+ T K +     LLD S +P++ + +    DFI+EH   
Sbjct: 134 CYISLHNESYKVIQNISVTAELVTGKGKTTKQTLLDISSTPLDQLGSKTNKDFIIEHPLT 193

Query: 159 ---DVKELGAHT-LVCTALYSDGE-GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQ 213
              D+++    T L C   Y D E G  +   + F F V +PL ++ KV         F 
Sbjct: 194 SSDDIQDDEDKTVLTCLVSYYDPEEGRVRSFRKHFPFKVYDPLGMKVKVNT-------FG 246

Query: 214 EITFLEACIENHTKS-NLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIR 272
              F++  ++N T++ +LY++ V+FEP  N+   ++      S +N  S   F+ P+L  
Sbjct: 247 NHVFVQLDLQNLTQTPSLYIESVKFEP--NFGYELMD----QSVHNT-SENYFEHPLL-- 297

Query: 273 SGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITS 332
             G    +L++L   S   +  V  Q S  LGK+ + W+  +GE G L T  I    I  
Sbjct: 298 -RGESKRFLFELVPNSKNRAMNV-TQNSVFLGKISLQWKNTMGECGMLLTNPIPHKLIPK 355

Query: 333 KEIELNVV----EVP---SVVGIDK------------PFLLKLKLTNQTDKEQGPFEIWL 373
           +++E +++     +P   +++G +             PF    ++TN + K+     I L
Sbjct: 356 QDLEASIIGFTSSIPDEFTILGSNNNNNTQESFTLYTPFYAVCEITNYS-KDVMDLSIHL 414

Query: 374 SQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEK 430
              DSD+   + ING  + A+  ++   S    + L   + G   + G  +  K +K
Sbjct: 415 ---DSDKMYPLAINGSSLQAVGELQPLKSRHVFIPLFPLQRGAHLVAGKGILVKDKK 468


>gi|443896779|dbj|GAC74122.1| uncharacterized conserved protein [Pseudozyma antarctica T-34]
          Length = 615

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 107/392 (27%), Positives = 177/392 (45%), Gaps = 90/392 (22%)

Query: 8   HSLAFRVMRLCRPSLHV-EPPLRVDPTDLF------IGEDIFDDPIAASNLPPLISSDVT 60
           H L+ +VMR   PSL V E P   D +         +GE I             +S D+ 
Sbjct: 37  HLLSLKVMRASAPSLAVSEKPYFDDASSTSSSLLAAVGEGIDAG----------LSHDLL 86

Query: 61  TNK---SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
           +N+   SS  T  + +    +A++  +S +LVLP +FG ++LGETF +Y+ + N S   V
Sbjct: 87  SNRWEGSSSTTTAAAY--RSAAENFPISSVLVLPNSFGTLFLGETFRTYVCVRNESGAAV 144

Query: 118 RDVVIKAEIQTDKQ----------------RILL------------LDTSKSPVESIRAG 149
           R+  ++ E+Q                     I++             D+   PV  + AG
Sbjct: 145 REPSLRVEMQVGASDASQPHAESGRWHQLAHIIMPSPSRYTPDPADTDSQGRPVWELAAG 204

Query: 150 GRYDFIVEHDVKELGAHTLVCTALYS------DGEG---ERKYLPQFFKFIVS-NPLSVR 199
              +  + +D+K+LG H LVCT  Y       DG+    ER +  +FFKF V  +P+SVR
Sbjct: 205 RALETSLGYDIKDLGPHVLVCTVGYKARVVMHDGQEAWIERSFR-KFFKFAVERSPISVR 263

Query: 200 TKVRVVKVGATHF------QEITFLEACIEN--HTKSNLYMDQVEFEPSQNWSATMLKAD 251
           TKV   +     +      +E   LE  ++N     S+L +D+++ + +  W+ + +  D
Sbjct: 264 TKVHQPREACAVYHPDPAVRERVHLEVQVQNVASNGSSLVLDRLDLKTAPGWTWSSI--D 321

Query: 252 GPHSDYNAQSREIF-----KPPVLIRSGGGIHNYLYQL-------------KMLSHGSSS 293
            P    + +  +++     K  +L+ + G +  YL+ L               +  GS+ 
Sbjct: 322 RPSLSCDDKDGDMWMRVGGKSKMLL-ADGDVRQYLFALVPSEEVAFWEARESGMDMGSTQ 380

Query: 294 PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
                  + LG L I+WR +LGEPGRLQT Q+
Sbjct: 381 EGWAIRGDALGHLDISWRMSLGEPGRLQTSQL 412


>gi|353233427|emb|CCD80782.1| hypothetical protein Smp_016810 [Schistosoma mansoni]
          Length = 567

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 115/463 (24%), Positives = 185/463 (39%), Gaps = 136/463 (29%)

Query: 15  MRLCRPSLHVEPPLRVDPTDLFIGEDI------FDDPI------------AASNLPPLIS 56
           MRL RP+  ++   R +PT+L++ +DI      +D  I            +A N+PP  +
Sbjct: 1   MRLNRPTFVIQ---RCEPTELYL-DDIAGSLTAYDASIRGDLDGISLNLLSAGNIPPSSN 56

Query: 57  SDVTTNKSSDLTYRSRFLLHDSADSI-----GLSGLLVLPQAFGAIYLGETFCSYISINN 111
           S  + N   +L   S+    D+ + I     G S LL L  +FG IYLGETF ++I+++N
Sbjct: 57  SHESPNYDHELNNDSK----DNYNYIQPKVGGYSELLSLTHSFGTIYLGETFSAHINLHN 112

Query: 112 SSTLEVRDVVIKAEIQTDKQRILLL----------------------------------- 136
            S     +V +K  +    + I L                                    
Sbjct: 113 ESNQICYNVELKVALHNRIESITLPIFTSLNGQSNSTVVLRNSSTNSESSNTHTSPSLGS 172

Query: 137 ----DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY------------------ 174
                 +K  V  ++ G   + I+ H++KELG H L CT  Y                  
Sbjct: 173 NAGGTNTKDSVFDLQPGQSLNAIISHELKELGVHNLRCTVSYFQTSSHGKSESSSHVVAY 232

Query: 175 -----------SDGEGERK--YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEAC 221
                       D   +R+     + +KF+V+ PL VR K  +V +  +       +E  
Sbjct: 233 ESPRLTSGLSSRDTTSKREPITFQRLYKFMVNKPLDVRKKFSIVDIDNS-----VLMETQ 287

Query: 222 IENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYL 281
           I+N T + + +++V FE +  +S   L      ++     +  F  P        +  +L
Sbjct: 288 IQNLTVTPIILERVLFESNPQFSVIDL------NNLQFGKKSHFNTPTYYLQPNDVQQFL 341

Query: 282 YQLKMLSHGS------------------------SSPVKVQGSNVLGKLQITWRTNLGEP 317
           Y+L   +  S                        SS    Q S   G+L ITWR+ +GE 
Sbjct: 342 YRLIPTTTNSLPLLNSSSTNSSIPASAVPDPIPVSSTTTRQVSISAGRLDITWRSLMGER 401

Query: 318 GRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
           GRLQT  +     T  +I+L V+ +PS V  ++PF LK +LTN
Sbjct: 402 GRLQTSSLKYELPTFGDIQLRVLTIPSTVTTEQPFTLKFELTN 444


>gi|388855808|emb|CCF50592.1| uncharacterized protein [Ustilago hordei]
          Length = 809

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 109/378 (28%), Positives = 172/378 (45%), Gaps = 72/378 (19%)

Query: 2   SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLP---PLISS- 57
           S   G H L+ +VMR   PSL V                  + P   S+LP   PLI++ 
Sbjct: 40  SQNAGPHLLSLKVMRASAPSLAVS-----------------EKPYYDSHLPSSSPLIAAV 82

Query: 58  --DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS-T 114
              ++ + SSD    S       + +  +S LL LP +FG +YLGETF +Y+ + N S T
Sbjct: 83  GKGISESLSSDPL--SNHYPDAPSSNFPISNLLTLPSSFGTLYLGETFRTYLCVRNESPT 140

Query: 115 LEVRDVVIKAEIQTDKQR----------ILL---LDTSKS--PVESIRAGGRYDFIVEHD 159
             VR+  ++AE+Q               I+L     TSKS  PV  +      +  + +D
Sbjct: 141 SPVREPSLRAEMQVGSSETEGRWHQLAHIILPSPTSTSKSGEPVWELPPSAPLETSLGYD 200

Query: 160 VKELGAHTLVCT----ALYSDGEGERKYLPQFFKFIV-SNPLSVRTKVRVVKVGATHF-- 212
           +K+LG H LVCT    AL ++G    +   +F+KF V  +P+SVRTKV   +  A+ +  
Sbjct: 201 IKDLGPHVLVCTVGYKALSAEGGWVERSFRKFYKFSVDRSPISVRTKVHQPRNVASLYHA 260

Query: 213 ----QEITFLEACIENHTKSNLYM--DQVEFEPSQNWS-----ATMLKADGPHSDYNAQS 261
               ++   LE  ++N + + + +  + +   P+  W         L  +    +   ++
Sbjct: 261 DEGVRKRVELEVQVQNASANGMRLVFEGLSLRPADGWRWDSVDRPSLTPNSTKGESVEEA 320

Query: 262 REIFKPPV----LIRSGGGIHNYLYQLK-----MLSHGSSSPVKVQG----SNVLGKLQI 308
           R+++  P        + G I  YL+ L       L  G      V+G     + LG L I
Sbjct: 321 RDMWLKPNNGGHEALADGDIRQYLFTLHPKPGVKLGGGVDLGKSVEGYLIRGDALGNLDI 380

Query: 309 TWRTNLGEPGRLQTQQIL 326
            WR +LGEPGRLQT Q++
Sbjct: 381 GWRMSLGEPGRLQTSQLV 398


>gi|71019495|ref|XP_759978.1| hypothetical protein UM03831.1 [Ustilago maydis 521]
 gi|46099484|gb|EAK84717.1| hypothetical protein UM03831.1 [Ustilago maydis 521]
          Length = 833

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 167/389 (42%), Gaps = 73/389 (18%)

Query: 6   GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           G H ++ +VMR   PSL V      D    +      D+ I A      ++  +    S 
Sbjct: 40  GPHLVSLKVMRTSAPSLAVSEKPYCDRHSTY-----HDELITA------VAQGIDDAASH 88

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
           DL           AD   +S LLVLP +FG +YLGETF +Y+ + N S+  VR+  ++ E
Sbjct: 89  DLLSNRWDTSPSPADQFPISELLVLPNSFGTLYLGETFRTYLCVRNESSTAVREPSLRVE 148

Query: 126 IQTDKQR---------------ILLLDTSKS---------PVESIRAGGRYDFIVEHDVK 161
           +Q                    IL   T  S         PV  +R     +  + +D+K
Sbjct: 149 MQVGASDPHTQEGGRWVQLAHVILPTPTRYSPEPDQDKGRPVWELRTAQALETSLAYDIK 208

Query: 162 ELGAHTLVCTALY-----SDGE---GERKYLPQFFKFIVS-NPLSVRTKVRVVKVGATHF 212
           +LG H LVCT  Y      DG+    ER +  +F+KF V  +P+SVRTKV   +  ++ F
Sbjct: 209 DLGPHVLVCTVGYKSPLQQDGDVAWVERSFR-KFYKFSVDRSPISVRTKVHQPRHASSLF 267

Query: 213 QEITFLEACIE-----NHTKSN---LYMDQVEFEPSQNWSATMLKADGPH---SDYNAQS 261
                +   +E      +T  N   L ++++  +P+  W    +  D P    +D   + 
Sbjct: 268 HPDAAVRKRVELEVQVQNTAGNGAALVLNELTLKPAPGWK--WVSVDRPSLNDADRGDED 325

Query: 262 REIFKPPVLIRSGGGIHNYLYQL-----------KMLSHGSSSPVKVQG----SNVLGKL 306
             I +    + + G +  YL+ L           +++  G    V  +G     + LG L
Sbjct: 326 MWILRGTDQVLADGDVRQYLFVLTPENKDQTLAEEVMQGGIDLGVTKEGLALRGDALGHL 385

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEI 335
            I+WR  LGE GRLQT Q++   + ++ +
Sbjct: 386 DISWRMALGEAGRLQTSQLVRRRVVTQPV 414


>gi|343424905|emb|CBQ68443.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 759

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 101/391 (25%), Positives = 166/391 (42%), Gaps = 77/391 (19%)

Query: 6   GTHSLAFRVMRLCRPSLHV-EPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           G H L+ +VMR   P L V E P            +   +P +A  L   +   +    +
Sbjct: 42  GPHLLSLKVMRASAPLLAVSEKPYY----------EHHAEPTSADTLLSAVGQGIEQGLA 91

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
            DL          SA +  +S LLVLP +FG +YLGETF +Y+ + N +   VR+  ++ 
Sbjct: 92  HDLLSNRWDGAGGSASNFPVSDLLVLPSSFGTLYLGETFRTYLCVRNEAATAVREPSLRV 151

Query: 125 EIQTDKQRILLLDTSK-------------------------SPVESIRAGGRYDFIVEHD 159
           E+Q     +   D  +                          PV  +  G   +  + +D
Sbjct: 152 EMQVGASDVQQSDAGRWHQLAHVILPTPTRLSPDPDGGEEGRPVWELAPGQPLETALGYD 211

Query: 160 VKELGAHTLVCTALYSDG--EG------ERKYLPQFFKFIVS-NPLSVRTKVRVVKVGAT 210
           +K+LGAH LVCT  Y     +G      ER +  +++KF V  +P+SVRTKV   +  ++
Sbjct: 212 IKDLGAHVLVCTVGYKAAVQQGSEVAWVERSFR-KYYKFSVERSPISVRTKVHQPRHASS 270

Query: 211 ------HFQEITFLEACIEN--HTKSNLYMDQVEFEPSQNWSATMLKADGPH----SDYN 258
                   ++   LE  ++N     S L  + +  +P+  W       D P      + +
Sbjct: 271 LHHPDAKVRQRVELEVQVQNVAGNGSALVFEGLALKPAPGWG--WASVDRPSLNGGGEED 328

Query: 259 AQSREIFKPPVLIRSGGGIHNYLYQL-----KMLSH---------GSSSPVKVQGSNVLG 304
             +R++      + + G +  YL+ L       L+H         G+S+       + LG
Sbjct: 329 MWARKVG---TEVLADGDVRQYLFTLTPSTAATLAHETLKAGLDLGTSADGHAIRGDALG 385

Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEI 335
            L I+WR +LGEPGRLQT Q++   + +  I
Sbjct: 386 HLDISWRMSLGEPGRLQTSQLVRRRVVTPPI 416


>gi|325189573|emb|CCA24059.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 450

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/372 (25%), Positives = 163/372 (43%), Gaps = 40/372 (10%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKS-- 141
           +S +L LP +FG I+LG TF SYIS+ N    ++ +V + A IQ    R+ L D  +S  
Sbjct: 65  ISNMLCLPDSFGQIFLGNTFSSYISVINPYNCDIEEVGLTANIQCGNDRVELQDNRQSRT 124

Query: 142 -------PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLPQFFKFIVS 193
                  P   + A    D +V+  + ++G H L     Y D    E K L +F++F V 
Sbjct: 125 GKLPPPNPTPVLSANSSLDMVVDFPLSQVGNHVLRVGVSYLDPITKESKSLRKFYRFGVQ 184

Query: 194 NPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK---- 249
           NPL +       K      QEI  +EA I N +   L++D + FE + +++    K    
Sbjct: 185 NPLILN-----FKQSRAPSQEI-LIEAQIRNVSSLPLFIDSIRFEATSSFTLMTTKRSSE 238

Query: 250 ---ADGPH-----SDY-------NAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSP 294
              AD        SDY       + +       P L++    +   +++L          
Sbjct: 239 SSPADCTQPQPEDSDYTIDTIWPSLKQHLARGSPTLLQPQEELQR-MFRLFEYERKKIVD 297

Query: 295 VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLL 354
              Q S  LG+L + W+T++GE G +Q+Q I+    T +++ + +   P  + ++K F++
Sbjct: 298 PGFQSSQTLGRLHVGWKTSVGEAGSVQSQPIVRKYDTMRDVSIRLHSFPERLVVEKVFVV 357

Query: 355 KLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKL 414
           +  + N + +    F+I L       + +V    L    +  + +  S    L L+  + 
Sbjct: 358 ECTIENHSTRN---FDIQLQFRKESLDGIVCY-CLTHQHVGSLVSEASITLPLKLLPLEC 413

Query: 415 GVQRITGITVFD 426
           G+Q I  I   D
Sbjct: 414 GLQEIRDIVCVD 425


>gi|296410908|ref|XP_002835177.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295627952|emb|CAZ79298.1| unnamed protein product [Tuber melanosporum]
          Length = 319

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 98/346 (28%), Positives = 147/346 (42%), Gaps = 72/346 (20%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  +                        +LP    S    N+S +L
Sbjct: 14  HSISLKVLRLSRPSLSEQ-----------------------HSLPKATPS----NQSPEL 46

Query: 68  TYRSR----FLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
              SR    +  H + D   LS LL LP AFG  Y+GETF   +S NN +T     V I 
Sbjct: 47  DELSRQSHAYPSHSTDDPFILSPLLTLPPAFGNAYIGETFSCCLSANNETTSITTSVRIS 106

Query: 124 AEIQT-----------DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
           AE+QT           D+++   LD    PV S++       IV++D+KE G H L  T 
Sbjct: 107 AEMQTPSLTLNLELGGDERQTADLD----PVMSLQK------IVKYDLKEEGNHILAVTV 156

Query: 173 LYSD-------GEGER------KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLE 219
            Y++       GEGE+      +   + ++FI    L+VRTK+  +  G         LE
Sbjct: 157 TYTEAPKRVDYGEGEKGAPGRVRTFRKLYQFIAQQCLTVRTKIGSLSGGR------AILE 210

Query: 220 ACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHN 279
           A +EN     + ++ V    ++ W+AT L   G     + Q      P +  R    +  
Sbjct: 211 AQLENMGDGPISLEMVHMGTTKGWTATSLNWQGSTGRGDGQRNPKDTPMLGSRDVMQVAF 270

Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
            LY  +         V      +LG+L I WR+  G+ G L T ++
Sbjct: 271 LLYPEETEEGWEED-VAANDKKILGQLSIEWRSACGDRGYLSTGRL 315


>gi|167517297|ref|XP_001742989.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778088|gb|EDQ91703.1| predicted protein [Monosiga brevicollis MX1]
          Length = 415

 Score = 97.8 bits (242), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 151/328 (46%), Gaps = 34/328 (10%)

Query: 67  LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
           +T +++  L  S ++ G+S +L LP A G +YLG+T    IS++N  +  V  +V K E+
Sbjct: 20  ITQQNQADLRSSYENFGVSEVLKLPAAVGNVYLGQTLSCLISVHNEGSESVSSIVTKVEL 79

Query: 127 QTDKQRILLLDT--------SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE 178
           QT  +R  L  T           P+  +  G   D IVE+ +++   H +VC   Y+  +
Sbjct: 80  QTGSKRTSLKPTLTGERKGQEVGPIGKLAPGQAIDQIVEYQLQDPAVHIMVCILAYTSQD 139

Query: 179 GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
           G+RK L + FKF V+ PL +    + +K       +   ++  ++N  K  L ++ V   
Sbjct: 140 GDRKQLRKHFKFEVTQPLEIVPLCKTLK-------DDVMVQVNVQNIAKEPLILEYVRMT 192

Query: 239 PSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
           P++ +  T  + D P S    Q   + K            N ++ LK     +      +
Sbjct: 193 PTKVY--TCEETDEPPSP--DQQLPVSK----------TRNRIFVLK--PQPTVDARTFK 236

Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
            S  +G++ ++WR   G  G      I     T  ++ L+V++ P  V +     L++++
Sbjct: 237 QSAKVGQVMVSWRAMRGGRGYTSIATIQRRVPTLNDVHLDVLDPPDSVQVGTLCTLRVRI 296

Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMI 386
            N TD++   + + LS N     ++V++
Sbjct: 297 INFTDRQ---YTLGLSYNPEQVTELVVM 321


>gi|302916379|ref|XP_003052000.1| hypothetical protein NECHADRAFT_37787 [Nectria haematococca mpVI
           77-13-4]
 gi|256732939|gb|EEU46287.1| hypothetical protein NECHADRAFT_37787 [Nectria haematococca mpVI
           77-13-4]
          Length = 822

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/343 (29%), Positives = 153/343 (44%), Gaps = 57/343 (16%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P  +DP    IG  I   P  AS                 L
Sbjct: 517 HSISLKVLRLSRPSLVTQYP--IDPPS-SIGATIKPAPAPAS-----------------L 556

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV----RDVVIK 123
            YRS    + S     LS ++ LP +FG+ Y+GETF   +  NN    +V    RDV I 
Sbjct: 557 AYRSETTSNPSP--FLLSPIVNLPVSFGSAYVGETFSCTLCANNDLLPDVPKNIRDVRID 614

Query: 124 AEIQTDK----QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SD 176
           AE++T      QR+ L   +  P   + +GG    +V  D+KE G H L  T  Y   ++
Sbjct: 615 AEMKTPGLGAVQRLELGPPTDKPEADLDSGGTLQRVVSFDLKEEGNHVLAVTVSYYEATE 674

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITF-LEACIENHTKSNLYMDQV 235
             G  +   + ++FI    L VRTKV  +K  A   Q   + LEA +EN ++  + +++V
Sbjct: 675 TSGRTRTFRKLYQFICKASLIVRTKVGPLKAAAGDGQPRRWALEAQLENCSEDVVQLEKV 734

Query: 236 --EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS 293
             + EP   +     +A G        ++ +  P       G +    + ++  S G+ +
Sbjct: 735 VLDTEPGLRYRDCNWEASG-------STKPVLHP-------GEVEQVCFVVED-SSGTGT 779

Query: 294 P-----VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTIT 331
           P     V   G  + G L I WR  +G  G L T + LGT + 
Sbjct: 780 PGGDVEVTPDGRIIFGSLGIGWRGEMGNRGFLSTGK-LGTRVA 821


>gi|428162256|gb|EKX31425.1| hypothetical protein GUITHDRAFT_149310, partial [Guillardia theta
           CCMP2712]
          Length = 211

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/183 (35%), Positives = 89/183 (48%), Gaps = 27/183 (14%)

Query: 9   SLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLT 68
           +LAF+VMRL RPS H               +  F   + A       +SD     +  L 
Sbjct: 54  ALAFKVMRLNRPSFH---------------QAGFTAGLQALRE---TASDQAEQATGHLP 95

Query: 69  YRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQT 128
                  H  A+      LL LP  FG IYLGETF +YIS  N+S   +  + I+AEIQT
Sbjct: 96  -------HSDAEGCPSENLL-LPTGFGNIYLGETFTAYISACNTSGSRLMRLEIRAEIQT 147

Query: 129 DKQRILLLDTSKSPV-ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
             +R+ LLD     V     +  + D+IV H++KE G H ++C+  Y D  GE K + Q+
Sbjct: 148 GTKRVPLLDGKPETVLAQFESNQQVDYIVSHELKEAGVHIMICSGSYLDASGEEKKVRQY 207

Query: 188 FKF 190
           FKF
Sbjct: 208 FKF 210


>gi|451846695|gb|EMD60004.1| hypothetical protein COCSADRAFT_100123 [Cochliobolus sativus
           ND90Pr]
          Length = 319

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 94/341 (27%), Positives = 152/341 (44%), Gaps = 71/341 (20%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RP L  + PL   P                       S D+  +  + L
Sbjct: 17  HSVSLKVLRLSRPMLATQHPL---PN----------------------SKDLGISPQASL 51

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
            Y S+    ++ D+  LS +L LP+AFG+ Y+GETF   +  NN      ST  +  V I
Sbjct: 52  AYPSQ---RNTNDAFILSPVLNLPEAFGSAYVGETFSCTLCANNELDPLDSTKAISGVRI 108

Query: 123 KAEIQTDKQRI-LLLDTSKSPVESIRA----GGRYDFIVEHDVKELGAHTLVCTALYSD- 176
           + ++QT        LD + +P E +      G     I+  ++KE G H L  T  Y++ 
Sbjct: 109 QGDMQTPSNPTGSPLDLTGTPDEDVNTSPGPGESLQRILRFELKEEGNHVLAVTVTYTET 168

Query: 177 --GEGER-----KYLPQFFKFIVSNPLSVRTKVRVV--KVGATHFQEITFLEACIENHTK 227
             GEG+      +   + ++F+    LSVRTK   +  K G+  +     LEA +EN  +
Sbjct: 169 ALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAGEMSPKNGSRRY----LLEAQLENMGE 224

Query: 228 SNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKML 287
           + + ++ V+  P     +T L  D   S +NA        P+L        + +    +L
Sbjct: 225 AAVCLEAVDVNPKLPLKSTSLNWDMQASGFNA--------PML-----SPRDVVQVAFLL 271

Query: 288 SHGSSSPVKVQGSN------VLGKLQITWRTNLGEPGRLQT 322
           ++      +V+GS       VLG+L I WR+ LG+ G L T
Sbjct: 272 TYKPGEDEEVEGSKTEDDKRVLGQLAIQWRSALGDRGSLST 312


>gi|312378535|gb|EFR25084.1| hypothetical protein AND_09887 [Anopheles darlingi]
          Length = 275

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/261 (26%), Positives = 127/261 (48%), Gaps = 18/261 (6%)

Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           +FFKF V  PL V+TK    +       +  +LEA I+N T   + +++VE E S+ ++ 
Sbjct: 12  KFFKFQVVKPLDVKTKFYNAET------DDVYLEAQIQNITVGPICLEKVELESSEQYTV 65

Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           T L          A    +F    +++       +LY ++ +   +  P  ++ +N +GK
Sbjct: 66  TSLNT-------LATGESVFSSKTMLQPQNSCQ-FLYCIRPIPEIARDPNALKAANNIGK 117

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
           L I WR+NLGE GRLQT Q+    +   ++ L V++  S V I + F  + ++TN +++ 
Sbjct: 118 LDIVWRSNLGERGRLQTSQLQRCPLEYSDLRLLVIDAKSTVRIGEGFSFRCRVTNTSERS 177

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               ++ +  N +  +      G+   AL  +E     +F L +   +LG+  I+ + + 
Sbjct: 178 ---MDLLMGLN-TKAKPGCGYTGVTEFALGALEPGQMKEFPLTVCPVRLGLIVISNLQLT 233

Query: 426 DKLEKITYDSLPDLEIFVDQD 446
           D   K  Y+    L++FV ++
Sbjct: 234 DLFTKRKYEFDNFLQVFVVEE 254


>gi|452005201|gb|EMD97657.1| hypothetical protein COCHEDRAFT_1125394 [Cochliobolus
           heterostrophus C5]
          Length = 319

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 94/341 (27%), Positives = 150/341 (43%), Gaps = 71/341 (20%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RP L  + PL   P                       S D+  +  + L
Sbjct: 17  HSVSLKVLRLSRPMLATQHPL---PN----------------------SKDLGISPQASL 51

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
            Y S+    ++ D+  LS +L LP+AFG+ Y+GETF   +  NN      ST  +  V I
Sbjct: 52  AYPSQ---RNTNDAFILSPVLNLPEAFGSAYVGETFSCTLCANNELDPSDSTKTISGVRI 108

Query: 123 KAEIQTDKQRI-LLLDTSKSPVESIRA----GGRYDFIVEHDVKELGAHTLVCTALYSD- 176
           + ++QT        LD + +P E +      G     I+  ++KE G H L  T  Y++ 
Sbjct: 109 QGDMQTPSNPTGSPLDLTGTPNEEVNTSPGPGESLQRILRFELKEEGNHVLAVTVTYTET 168

Query: 177 --GEGER-----KYLPQFFKFIVSNPLSVRTKVRVV--KVGATHFQEITFLEACIENHTK 227
             GEG+      +   + ++F+    LSVRTK   +  K G   +     LEA +EN  +
Sbjct: 169 ALGEGKAASGKVRTFRKLYQFVAQQLLSVRTKAGEMSPKNGLRRY----LLEAQLENMGE 224

Query: 228 SNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKML 287
           + + ++ V+  P     +T L  D   S  NA        P+L        + +    +L
Sbjct: 225 AAVCLEAVDVSPKPPLKSTSLNWDMQASGLNA--------PML-----SPRDVVQVAFLL 271

Query: 288 SHGSSSPVKVQGSN------VLGKLQITWRTNLGEPGRLQT 322
           ++      +V+GS       VLG+L I WR+ LG+ G L T
Sbjct: 272 TYKPGEDEEVEGSKTEDDKRVLGQLAIQWRSALGDRGSLST 312


>gi|408399762|gb|EKJ78855.1| hypothetical protein FPSE_00998 [Fusarium pseudograminearum CS3096]
          Length = 317

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/339 (26%), Positives = 149/339 (43%), Gaps = 50/339 (14%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P+   P+   +G  +   PI AS       S VT+N +  L
Sbjct: 16  HSISLKVLRLSRPSLVTQYPID-SPSS--VGASLKPAPIPASLA---YHSQVTSNPTPFL 69

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
                           LS ++ LP +FG+ Y+GETF   +  NN     +   +RDV I+
Sbjct: 70  ----------------LSPVVNLPVSFGSAYVGETFSCTLCANNDLPPDAVKNIRDVRIE 113

Query: 124 AEIQTDK----QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SD 176
           AE++T      QR+ L   +      +++G     +V  D+KE G H L  T  Y   ++
Sbjct: 114 AEMKTPGMGAVQRLELGPPNGQSEADLQSGDTMQRVVSFDLKEEGNHVLAVTVSYYEATE 173

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV- 235
             G  +   + ++FI    L VRTKV  +K   T       LEA +EN ++  + +++V 
Sbjct: 174 TSGRTRTFRKLYQFICKASLIVRTKVGSLKAEDTQGHGRWVLEAQLENCSEDVVQLEKVV 233

Query: 236 -EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSP 294
            + EP   +     +A G        ++ +  P       G +    + +      +   
Sbjct: 234 LDTEPGLRYRDCNWEASG-------SAKPMLHP-------GEVEQVCFVVAEDGAETGVE 279

Query: 295 VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSK 333
           V   G  + G L I WR  +G  G L T + LGT   ++
Sbjct: 280 VTPDGRIIFGSLGIGWRGEMGNRGFLATGK-LGTRRAAR 317


>gi|149059253|gb|EDM10260.1| similar to RIKEN cDNA 2410002O22 gene, isoform CRA_c [Rattus
           norvegicus]
          Length = 143

 Score = 89.7 bits (221), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 56/159 (35%), Positives = 85/159 (53%), Gaps = 26/159 (16%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H LA +VMRL +P+L    P+  +  DL    D+F+          L+  D +T      
Sbjct: 10  HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPSTV----- 53

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
                    + A+ + L  +L LPQ FG I+LGETF SYIS++N S   V+D+++KA++Q
Sbjct: 54  ---------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104

Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAH 166
           T  QR L L  S + V  ++     D ++ H+VKE+G H
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTH 142


>gi|46123811|ref|XP_386459.1| hypothetical protein FG06283.1 [Gibberella zeae PH-1]
          Length = 828

 Score = 89.4 bits (220), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 91/339 (26%), Positives = 148/339 (43%), Gaps = 50/339 (14%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P+    +   +G  I   PI AS       S V +N +   
Sbjct: 527 HSISLKVLRLSRPSLVTQYPIDSPSS---VGASIKSAPIPASLA---YHSQVASNPTP-- 578

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
                FLL         S ++ LP +FG+ Y+GETF   +  NN     +   +RDV I+
Sbjct: 579 -----FLL---------SPVVNLPVSFGSAYVGETFSCTLCANNDLPPDAAKNIRDVRIE 624

Query: 124 AEIQTDK----QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SD 176
           AE++T      QR+ L   +      +++G     +V  D+KE G H L  T  Y   ++
Sbjct: 625 AEMKTPGMGAVQRLELGPPNSQSEADLQSGDTMQKVVSFDLKEEGNHVLAVTVSYYEATE 684

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV- 235
             G  +   + ++FI    L VRTKV  +K   T       LEA +EN ++  + +++V 
Sbjct: 685 TSGRTRTFRKLYQFICKASLIVRTKVGSLKAEDTQGHGRWVLEAQLENCSEDVVQLEKVV 744

Query: 236 -EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSP 294
            + EP   +     +A G        ++ +  P       G +    + +      +   
Sbjct: 745 LDTEPGLRYRDCNWEASG-------SAKPMLHP-------GEVEQVCFVVAEDGAETGVE 790

Query: 295 VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSK 333
           V   G  + G L I WR  +G  G L T + LGT   ++
Sbjct: 791 VTPDGRIIFGSLGIGWRGEMGNRGFLATGK-LGTRRAAR 828


>gi|396461873|ref|XP_003835548.1| hypothetical protein LEMA_P048890.1 [Leptosphaeria maculans JN3]
 gi|312212099|emb|CBX92183.1| hypothetical protein LEMA_P048890.1 [Leptosphaeria maculans JN3]
          Length = 323

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 99/344 (28%), Positives = 153/344 (44%), Gaps = 73/344 (21%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RP+L  + PL  D  DL I       P A+   PP    D T +K    
Sbjct: 17  HSVSLKVLRLSRPTLATQHPL-PDSHDLGI------SPKASLAYPP---QDNTNDK---- 62

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
                F+         LS +L LP+AFG+ Y+GETF   +  NN      +T  V  V I
Sbjct: 63  -----FI---------LSPVLNLPEAFGSAYVGETFACTLCANNEIDPSDTTKAVSGVRI 108

Query: 123 KAEIQTDKQ-RILLLDTSKSPVE----SIRAGGRYDFIVEHDVKELGAHTLVCTALYSD- 176
           + ++QT        LD + SP +    S+        I+  ++KE G H L  T  Y++ 
Sbjct: 109 QGDMQTPTNPSGSPLDLTGSPDDSEGLSLGPSESLQRILRFELKEEGNHVLAVTVTYTET 168

Query: 177 --GEGER-----KYLPQFFKFIVSNPLSVRTKVRVV--KVGATHFQEITFLEACIENHTK 227
             GEG+      +   + ++F+    LSVRTK   +  K+G + +     LEA +EN  +
Sbjct: 169 ALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAGEMSQKMGLSRY----LLEAQLENMGE 224

Query: 228 SNLYMDQVEFEP-------SQNWSATMLKADGPHSDYNAQSREIFKPPVLI--RSGGGIH 278
           + + ++ V   P       S NW    L A G H+      R++ +   L+  + GG   
Sbjct: 225 AAVCLEAVNVHPKPPLRSISLNWDMHPLGA-GQHNAPILGPRDVVQVAFLLEQQPGGDGD 283

Query: 279 NYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
           N           S +    +G   +G+L I WR+ LG+ G L T
Sbjct: 284 N-----------SKTDGPTEGRTPIGQLAIQWRSALGDQGSLST 316


>gi|452842472|gb|EME44408.1| hypothetical protein DOTSEDRAFT_172587 [Dothistroma septosporum
           NZE10]
          Length = 321

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 89/340 (26%), Positives = 148/340 (43%), Gaps = 65/340 (19%)

Query: 6   GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           G HS++ +V+RL RPSL  + PL   PT+   G D+  DP A+             + SS
Sbjct: 16  GPHSVSLKVLRLSRPSLATQTPL--PPTNFGNGLDL--DPKAS-----------LAHSSS 60

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDV 120
           D      F L         + LL LP AFGA Y+GETF   +  NN     S +  V  V
Sbjct: 61  DEAQHGAFPL---------TPLLTLPAAFGAAYVGETFICTLCANNELPSDSESKIVSAV 111

Query: 121 VIKAEIQTDKQR---ILLLDTSKSPVES-----IRAGGRYDFIVEHDVKELGAHTLVCTA 172
            I AE+QT        L L+ +    +      ++ GG     + HD+K+ G H L  T 
Sbjct: 112 KIVAELQTPSHSEGIALQLEKAGKAADGDDTGDVKPGGTLQRTLRHDLKDEGPHVLAVTI 171

Query: 173 LYSD--------GEGERKYLPQFFKFIVSNPLSVRTKV--RVVKVGATHFQEITFLEACI 222
            Y++          G  +   + ++F+    ++VR+K+  R  +  A+  +E   LEA +
Sbjct: 172 TYTETLHGNGAASGGRVRTFRKLYQFVSQQLVAVRSKITERKRRDKASGPREW-ILEAQL 230

Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLY 282
           EN  ++++ +++V  +  +  S+  +  +        +   + KP         +   ++
Sbjct: 231 ENVGETSVVLEKVLLKEKEGISSRRMAGE-------EKEATVLKPQ-------DVEQIMF 276

Query: 283 QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
              +L        +  G   LG+L I WR+ +GE G L T
Sbjct: 277 ---LLQEEGERKEEQTGRVPLGQLDIDWRSAMGERGSLTT 313


>gi|328861257|gb|EGG10361.1| hypothetical protein MELLADRAFT_94429 [Melampsora larici-populina
           98AG31]
          Length = 592

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/363 (25%), Positives = 145/363 (39%), Gaps = 86/363 (23%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H L+ +V+R  RP+   +PPL                        P I+    +N  S +
Sbjct: 19  HLLSLKVLRAARPTFK-QPPLH-----------------------PTINPINPSNSISTI 54

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISIN---NSSTLEVRDVVIKA 124
           T+          +S   S  L LP +FG IYLG+TF   +S+    N     V +V +K 
Sbjct: 55  TF----------ESAPKSSTLTLPDSFGVIYLGQTFHGLLSVQYEGNQLDSIVENVALKV 104

Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER--- 181
           E+ T   +  L +     +   + G   +  V+H++KELG HTLVCT  Y   +      
Sbjct: 105 ELHTASHKAFLDEIKTHQIGFGQNG--LELSVKHEIKELGLHTLVCTVFYDQIQSVNSQD 162

Query: 182 ---------------KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQ------------- 213
                          +   + +KF V NPLSV+TKV V       FQ             
Sbjct: 163 LDPTNPSPDPTVRVPRSFRKVYKFQVLNPLSVKTKVLVPSSAQPSFQTSPLPSTINAIFS 222

Query: 214 ----EITFLEACIENHTKSNLYMDQVEFEPSQ---NWSATMLKADGPHSDYNAQSR-EIF 265
               E  +LE  I+N +   +    V+  P Q   N      +    + D N  S+  + 
Sbjct: 223 PTIREQLYLEVQIQNQSTQPIIFQHVKLIPPQAETNPEEEAEEDKLEYLDLNLDSKTNLL 282

Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSS---PVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
              +   S    + +L+ +   S   SS   P++     +LG+L+I+W + +GE GRL T
Sbjct: 283 SNSLTHLSTNDSNQFLFLIISQSVNPSSLKKPIQ-----ILGRLEISWNSMMGESGRLMT 337

Query: 323 QQI 325
             +
Sbjct: 338 NPL 340


>gi|164659806|ref|XP_001731027.1| hypothetical protein MGL_2026 [Malassezia globosa CBS 7966]
 gi|159104925|gb|EDP43813.1| hypothetical protein MGL_2026 [Malassezia globosa CBS 7966]
          Length = 462

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/353 (25%), Positives = 156/353 (44%), Gaps = 46/353 (13%)

Query: 5   PGTHSLAFRVMRLCRPSLHVEPPLRVDPT-DLFIGEDIFDDPIAASNLP----------P 53
           P T  L+ +VMR+  PSL      RV P  +  +   + D+P   +N P          P
Sbjct: 7   PYTPPLSVKVMRIATPSLAS----RVVPMFETCMESGVVDEPSDHNNTPHRQECVEYLDP 62

Query: 54  LISSDV--TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN 111
            I   +  T  + SD  + +  +   +A  +  +  L+LP +FG++ +GETF + I ++N
Sbjct: 63  HIWDVIKSTYARGSDEIFTNAPI---TARDVSYTDQLLLPASFGSVSVGETFQAVICVSN 119

Query: 112 SSTLEVRDVVIKAEIQTDKQRIL------LLDTSKSPVESIRAGGRYDFIVEHDVKELGA 165
           +S + ++ + IK E+ TDK          L D S   + S+  G +   +  H + +L  
Sbjct: 120 TSMMPIQGMRIKVEMHTDKTDSFPPSSHSLNDVS---LPSLAPGAQMTALARHSIDKLAM 176

Query: 166 HTLVCTALYSDGEGERKYLPQFF----KFIVS-NPLSVRTKVRVVKVGATH----FQEIT 216
           H LVC  ++SD    +   P  F    +F V   P  +R++V      + +     +E T
Sbjct: 177 HALVC-RIWSDRHTSQGIYPHSFSKQYRFKVHPPPFLMRSEVHTNDTLSFYHDRSIREQT 235

Query: 217 FLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGG 276
            +   + N +   L +D +  +P Q+WSA+  K D  H     +    F   +  R    
Sbjct: 236 LVLVSVHNTSSRPLRLDMLSIDPDQSWSASAPKLD--HMPLMPKDVRNFVFTLSPRETMS 293

Query: 277 IHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTT 329
             ++  +L+   H      +V  +  LG ++I WR   GE GRL+   I  TT
Sbjct: 294 PLHFREKLQSAEH-----TRVACTVPLGHIRIAWRVPGGEMGRLRIGTIQRTT 341


>gi|397619517|gb|EJK65296.1| hypothetical protein THAOC_13857 [Thalassiosira oceanica]
          Length = 460

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 71/255 (27%), Positives = 127/255 (49%), Gaps = 30/255 (11%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISI-NNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSP 142
           LS  L+LP +FG I++GETF +Y+ + N ++ + VR + + A++QT  +RI+L       
Sbjct: 51  LSSNLMLPDSFGVIHVGETFAAYLGVLNAAADVSVRGLTVSAQLQTPSRRIVLPSRLDGT 110

Query: 143 VESIRAGGRYDFIVEHDVKELGAHTLVCTALY-SDGEGERKYLPQFFKFIVSNPLSVRTK 201
              I   G  D IV   ++E+G H L     Y S+G+   K L +F++F V+NPLS+   
Sbjct: 111 PADIEPSGGVDAIVARTLEEVGPHILRVEVGYVSNGQ---KSLRKFYRFNVTNPLSITES 167

Query: 202 VRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA--TMLKADGPHSD--- 256
             VV+ G         ++  +E  TK  + +  V F+PS   ++    L  +G  S    
Sbjct: 168 --VVRGGDAKCLVTIRVQNTMEKPTKGAVTISDVRFQPSTGMASEQIALSEEGQGSVSAL 225

Query: 257 --YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG---SNVLGKLQITWR 311
             Y++  R   +P       G  + YL+ ++  S  +    K++G    + LG+  +T+ 
Sbjct: 226 DLYDSCGR--LQP-------GESYQYLFSVRAESEAA----KLRGISYGDDLGQAVLTYH 272

Query: 312 TNLGEPGRLQTQQIL 326
             +GE G +++  ++
Sbjct: 273 KAMGETGVIKSSLVV 287


>gi|402583817|gb|EJW77760.1| hypothetical protein WUBG_11331, partial [Wuchereria bancrofti]
          Length = 164

 Score = 85.9 bits (211), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 58/187 (31%), Positives = 84/187 (44%), Gaps = 28/187 (14%)

Query: 15  MRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFL 74
           MRL RP  +    + +DP D                   LI S +            R  
Sbjct: 1   MRLARPKFYENICIPIDPAD---------------TTSQLIGSAL-----------CRLT 34

Query: 75  LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
             ++AD I +   L+ PQ F +IYLGETF  Y+ + N S     D+ +K ++QT  QR  
Sbjct: 35  GQEAAD-IPIGKYLMAPQKFESIYLGETFTFYVCVQNISDKLATDICVKTDLQTTSQRNA 93

Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
           L    +     +  G     ++ H++KE+G H LVC   Y   + E  Y  +FFKF V+ 
Sbjct: 94  LSSQLQEANAVLEPGECLGEVITHEIKEIGQHILVCAVSYRTPKNEM-YFRKFFKFPVTK 152

Query: 195 PLSVRTK 201
           P+ VRTK
Sbjct: 153 PIDVRTK 159


>gi|317146315|ref|XP_001821432.2| hypothetical protein AOR_1_1658144 [Aspergillus oryzae RIB40]
 gi|391869103|gb|EIT78308.1| hypothetical protein Ao3042_05468 [Aspergillus oryzae 3.042]
          Length = 336

 Score = 85.5 bits (210), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 99/355 (27%), Positives = 146/355 (41%), Gaps = 76/355 (21%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P                 P A + +         +NK+S L
Sbjct: 17  HSVSLKVLRLSRPSLSYQYPF----------------PEANTKI---------SNKAS-L 50

Query: 68  TYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVV 121
           +Y S     DS D+   L+  L LP AFG+ Y+GETF   +S NN      ++  V  V 
Sbjct: 51  SYPS-----DSVDNQFILAPNLTLPPAFGSAYVGETFACTLSANNELAEDETSRVVTSVR 105

Query: 122 IKAEIQTDKQ--RILLLDTSKSPV-ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD-- 176
           I AE+QT  Q   + L     +P  + ++ G     IV  D+KE G H L  +  Y++  
Sbjct: 106 IVAEMQTPSQVASLELEPADDAPARDGLQKGQSLQKIVRFDLKEEGNHILAVSVSYTETL 165

Query: 177 -------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEITFL 218
                    G  +   + ++F+    LSVRTK   +             G T       L
Sbjct: 166 IGSDSQAASGRVRTFRKLYQFVAQPCLSVRTKSSELSPLEVENKSLGPYGKTRLLRFA-L 224

Query: 219 EACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIFKPPVLIR 272
           EA +EN     + + Q +  P   + AT L  D    D +         R++ +   L+ 
Sbjct: 225 EAQLENVGDEAVVVKQTKLNPKPPFKATSLNWDLARPDQSDSQPPTLNPRDVLQVAFLVE 284

Query: 273 SGGGIHNYLYQL-KMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
              G    L  L K L H         G  VLG+L I WR  +G+ G L T  +L
Sbjct: 285 QEEGQQEGLDALQKDLKH--------DGRAVLGQLSIEWRGTMGDKGFLTTGNLL 331


>gi|299116795|emb|CBN74908.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 535

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 60/179 (33%), Positives = 91/179 (50%), Gaps = 21/179 (11%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISI-NNSSTLEVRDVVIKAEIQTDKQRILLLD----- 137
           LS  L LP +FG IYLGETF +YIS+ N+ ST  + +  + A++Q+   R+ L D     
Sbjct: 55  LSSALKLPDSFGNIYLGETFTAYISVLNHMSTTVLVNASLSAKLQSPTGRVDLEDRRTAR 114

Query: 138 ----TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEG-ERKYLPQFFK 189
               +  +P   +      D IVEH ++ELG HTL  T  Y    D EG E + + +F++
Sbjct: 115 GASVSRPNPAPLLSPSENLDMIVEHTLEELGTHTLRVTVKYHVAGDPEGSEPRSMRKFYR 174

Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
           F V NP+SV      V+          F+E  + N T+ +L ++   F P     A++L
Sbjct: 175 FSVMNPVSVNPVCTAVRGSP-------FVEVQLVNTTQMDLLLESCHFIPEGGVEASLL 226



 Score = 39.3 bits (90), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 30/127 (23%), Positives = 58/127 (45%), Gaps = 4/127 (3%)

Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
           S+ LG++++ WRT  GE G ++   ++       E+E+ V  +P V+ + +       + 
Sbjct: 381 SHTLGRVEVCWRTTTGESGSIRGGPVVFEAPDRPEVEVTVDGLPDVLKLGRVAECVATVR 440

Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           N++++   P  + L Q  +D    V ++G     L  +         L L+A   G+  +
Sbjct: 441 NRSNR---PMTLQL-QFRTDGMVGVYVHGQSFRNLGELLPGTFVRCPLQLLALVAGLHEL 496

Query: 420 TGITVFD 426
            G TV D
Sbjct: 497 RGCTVAD 503


>gi|330936778|ref|XP_003305510.1| hypothetical protein PTT_18371 [Pyrenophora teres f. teres 0-1]
 gi|311317446|gb|EFQ86402.1| hypothetical protein PTT_18371 [Pyrenophora teres f. teres 0-1]
          Length = 319

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 88/343 (25%), Positives = 144/343 (41%), Gaps = 73/343 (21%)

Query: 7   THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
            HS++ +V+RL RPSL  + PL   P    +G                       +  + 
Sbjct: 16  AHSVSLKVLRLSRPSLATQYPL---PNSKSLG----------------------ISPKAS 50

Query: 67  LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVV 121
           L Y S+   +D+ D   LS  L LP+AFG+ Y+GETF   +  NN      +T  +  V 
Sbjct: 51  LAYPSQ---NDAKDQFILSPALKLPEAFGSAYVGETFSCTLCANNELDSSDNTKAISGVR 107

Query: 122 IKAEIQTDKQRILLLDTSKSPVE-----------SIRAGGRYDFIVEHDVKELGAHTLVC 170
           I+ ++QT        + + SP+E           S   G     I++ ++KE G H L  
Sbjct: 108 IQGDMQTPS------NPTGSPLELCGLSGEDEGISPGPGESLQRILKFELKEDGNHVLAV 161

Query: 171 TALYSD---GEGER-----KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACI 222
           T  Y++   GEG+      +   + ++F+    LSVRTK    ++G  +      LEA +
Sbjct: 162 TVTYTETALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAG--EMGHRNGSSRYLLEAQL 219

Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA---QSREIFKPPVLIRSGGGIHN 279
           EN  ++ + ++ V   P     +  L  D   +  NA     R++ +   L+    G  +
Sbjct: 220 ENMGEAAVCLEAVNVNPKPPLRSRSLNWDMQPAGLNAPILSPRDVVQVAFLLEHQAGDDD 279

Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
            +             +      VLG+L I WR+ LG+ G L T
Sbjct: 280 DM----------PDSITEDNKRVLGQLAIQWRSALGDRGSLST 312


>gi|425781566|gb|EKV19524.1| hypothetical protein PDIG_02530 [Penicillium digitatum PHI26]
 gi|425782814|gb|EKV20700.1| hypothetical protein PDIP_13810 [Penicillium digitatum Pd1]
          Length = 336

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 94/358 (26%), Positives = 143/358 (39%), Gaps = 82/358 (22%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H+++ +V+RL RPSL                   +  P+ ASN   +ISS  +      L
Sbjct: 17  HAVSLKVLRLARPSLS------------------YQHPLPASNT--IISSKAS------L 50

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD-------- 119
           +Y S     DS D   L+ LL LP +FG++Y+GETF   +S NN    E+ D        
Sbjct: 51  SYPS----GDSDDQFILTPLLTLPPSFGSVYVGETFGCTLSANN----EIHDNDNERILT 102

Query: 120 -VVIKAEIQTDKQRILL---LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
            V I AE+QT      L        +  + +R G     IV  D+KE G H L  +  Y+
Sbjct: 103 SVRILAEMQTPSSVAALELQPPNDSASTDGLRIGESLQKIVRFDLKEEGNHILAVSVSYT 162

Query: 176 D---------GEGERKYLPQFFKFIVSNPLSVRTKVRVV-----------KVGATHFQEI 215
           +           G  +   + ++F+    LSVRTK   +             G T     
Sbjct: 163 ETKIGSDSQAASGRVRTFRKLYQFVSQPCLSVRTKASELPPLEVDNKSLGPYGKTRLLRF 222

Query: 216 TFLEACIENHTKSNLYMDQVEFEP-------SQNWSATMLKADGPHSDYNAQSREIFKPP 268
             LEA +EN  +  + + Q +  P       S NW  TM     P +      R++ +  
Sbjct: 223 A-LEAQLENVGEGAVVVKQTKLNPKPPFRSKSLNWD-TMNPNMSPAALPTLNPRDVLQVA 280

Query: 269 VLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
            L+    G       L+         ++  G   LG+L I WR  +G+ G L T  ++
Sbjct: 281 FLVEQEEGQSEGFETLQ-------KDLRRDGRATLGQLSIEWRGAMGDKGFLTTGNLM 331


>gi|255949754|ref|XP_002565644.1| Pc22g17310 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592661|emb|CAP99019.1| Pc22g17310 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 345

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/347 (25%), Positives = 142/347 (40%), Gaps = 72/347 (20%)

Query: 14  VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRF 73
           V+RL RPSL  + PL                        P   + ++T  S  L+Y S  
Sbjct: 32  VLRLARPSLSYQHPL------------------------PTSKTKISTKAS--LSYPS-- 63

Query: 74  LLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD-----VVIKAEIQT 128
              DS D   L+ LL LP +FG++Y+GETF   +S NN   ++  D     V I AE+QT
Sbjct: 64  --SDSDDQFILTPLLTLPPSFGSVYVGETFGCTLSANNEINVDDDDRLLTSVRIVAEMQT 121

Query: 129 DKQRILLL---DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD--------- 176
                 L     +  +  + ++ G     IV  D+KE G H L  +  Y++         
Sbjct: 122 PSSVAALELEPPSDSASTDGLKIGESLQKIVRFDLKEEGNHILAVSVSYTETKIGSDSQA 181

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVV-----------KVGATHFQEITFLEACIENH 225
             G  +   + ++F+    LSVRTK   +             G T       LEA +EN 
Sbjct: 182 ASGRVRTFRKLYQFVAQPCLSVRTKASELPPLEVDNKSLGPYGKTRLLRFA-LEAQLENV 240

Query: 226 TKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIFKPPVLIRSGGGIHN 279
            +  + + Q +  P   + +  L  D  ++D + ++      R++ +   L+    G + 
Sbjct: 241 GEGAVVVKQTKLNPKPPFQSKSLNWDMMNTDMSTRALPTLNPRDVLQVAFLVEQEEGQNE 300

Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
            L  L+         ++  G   LG+L I WR  +G+ G L T  ++
Sbjct: 301 GLEALQ-------KDLRRDGRATLGQLSIEWRGAMGDKGFLTTGNLM 340


>gi|119571732|gb|EAW51347.1| hypothetical protein FLJ13611, isoform CRA_e [Homo sapiens]
          Length = 217

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 105/211 (49%), Gaps = 15/211 (7%)

Query: 226 TKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQ 283
           T S ++M++V  EPS  ++ T L +     +  +   SR   +P            YLY 
Sbjct: 2   TTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAYLQPM-------DTRQYLYC 54

Query: 284 LKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVP 343
           LK  +  +     ++G  V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P
Sbjct: 55  LKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIP 114

Query: 344 SVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGST 403
             V +++PF +  K+TN +++     ++ L   +++      I+G ++  L P  +    
Sbjct: 115 DTVNLEEPFHITCKITNCSER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC-- 169

Query: 404 DFHLNLIATKLGVQRITGITVFDKLEKITYD 434
              L L+++  G+Q I+G+ + D   K TY+
Sbjct: 170 -LALTLLSSVQGLQSISGLRLTDTFLKRTYE 199


>gi|358386843|gb|EHK24438.1| hypothetical protein TRIVIDRAFT_219893 [Trichoderma virens Gv29-8]
          Length = 319

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 88/334 (26%), Positives = 141/334 (42%), Gaps = 53/334 (15%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL +PSL  + P  +DP         F  P   S   P           + L
Sbjct: 16  HSVSVKVLRLSQPSLVTQYP--IDPP--------FSPPNTKSQPAP-----------ASL 54

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
            Y      + + D   LS +L LP +FG+ Y+GETF   +  NN     +   +RDV I+
Sbjct: 55  AYSGS---NTNPDPFLLSPVLNLPVSFGSAYVGETFSCTLCANNDLPPDAAKRIRDVRIE 111

Query: 124 AEIQT----DKQRILLLDTSKSPVES-----IRAGGRYDFIVEHDVKELGAHTLVCTALY 174
           AE++T      Q++ L   +     +     +  GG    IV  D+KE G H L  T  Y
Sbjct: 112 AEMKTPGLGGTQKLELGPANMHGAAAAGGVDLEPGGTLQKIVGFDLKEEGNHVLAVTVSY 171

Query: 175 SDG---EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLY 231
           S+     G  +   + ++FI    L VRTKV  +   A+   +   LEA +EN ++  + 
Sbjct: 172 SEATETSGRTRTFRKLYQFICKASLIVRTKVSSLNTDASSIGKW-ILEAQLENCSEDVIQ 230

Query: 232 MDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGS 291
           +++V  +  +            + D N  S    KP   +   G I    + ++     S
Sbjct: 231 LEKVVLDAEEGLG---------YHDCNWSSDGDKKP---VLHPGEIEQVCFLVQEKGADS 278

Query: 292 SSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
              +   G  + G L I WR  +G  G L T ++
Sbjct: 279 GLRLTADGRMIFGVLGIGWRGEMGCRGFLSTGKL 312


>gi|402084162|gb|EJT79180.1| hypothetical protein GGTG_04268 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 335

 Score = 83.2 bits (204), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 95/359 (26%), Positives = 155/359 (43%), Gaps = 80/359 (22%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P++       +G     +  A ++L    SS+  TN     
Sbjct: 15  HSISLKVLRLSRPSLVPQYPVKSP-----LGAQTAGEASAPASL--AYSSEDGTN----- 62

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTL-----------E 116
                      +D   LS +L LP +FG+ Y+GETF   +  N+ + +           +
Sbjct: 63  -----------SDPFILSPILNLPPSFGSAYVGETFSCTLCANHDAPVAPPGAPPARAKQ 111

Query: 117 VRDVVIKAEIQTDKQ-RILLLD-----------TSKSPVESIRAGGRYDFIVEHDVKELG 164
           VRDV I+AE++T     +  LD           T  +    +  GG    +V  D+K+ G
Sbjct: 112 VRDVRIEAEMKTPASANVTKLDLGPDHAGGRTGTGGAGGVDLEPGGTLQKVVSFDLKDEG 171

Query: 165 AHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHF---QEIT-- 216
            H L  T  Y   +D  G  +   + ++F+    L VRTKV  +  GA      +E+T  
Sbjct: 172 NHVLAVTVSYYEATDTSGRTRTFRKLYQFVCKPSLIVRTKVSALPTGAVAAATEKELTTP 231

Query: 217 ----FLEACIENHTKSNLYMDQ--VEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVL 270
                LEA +EN  +  + +++  ++ EP   ++    +A G               PVL
Sbjct: 232 ARRWVLEAQLENCGEDPIQLERAVLDLEPGLTYTDCNWEAAGGQK------------PVL 279

Query: 271 IRSGGGIHNYLYQLKMLSHGSSSPVK-VQGSNVLGKLQITWRTNLGEPGRLQTQQILGT 328
             S       + Q+  + HG+ +P   V G  + G L + WR  +G  G L T + LGT
Sbjct: 280 HPS------EIEQICFVVHGTPTPASLVDGKVIFGILGVGWRGEMGNRGFLSTGK-LGT 331


>gi|225560447|gb|EEH08728.1| DUF974 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 348

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 93/361 (25%), Positives = 146/361 (40%), Gaps = 75/361 (20%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL                P    ++PPL +S    + SSD 
Sbjct: 17  HSVSLKVLRLSRPSLSYQYPL----------------PSENESVPPLKASLSYPSDSSD- 59

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK---- 123
              S+F+L  +         + LP AFG+ Y+GETF   +  NN   L++ + V+     
Sbjct: 60  ---SQFILSPN---------VTLPPAFGSAYVGETFSCSLCANNELPLDIENRVVSSVRI 107

Query: 124 -AEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
            AE+QT  Q I+ L+ S  P E   +GG         IV  D+KE G H L  +  Y++ 
Sbjct: 108 VAEMQTPSQ-IVSLELSP-PGEDSGSGGLAKSQSLQKIVRFDLKEEGNHVLAVSVSYTET 165

Query: 177 --------------------GEGERKYLPQFFKFIVSNPLSVRTKVRVV----------- 205
                                 G  +   + ++FI    LSVRTK   +           
Sbjct: 166 TLAPQGQETSPGSGVGAVQAASGRVRTFRKLYQFIAQPCLSVRTKATELTPLEVDNRALG 225

Query: 206 KVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIF 265
             G         LEA +EN     + +      P   + +  L  D   SD    +  + 
Sbjct: 226 PYGKARLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLNWDFERSDSLKTAPPML 284

Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
           KP  +++    +     Q + L  G    +   G  +LG+L I WR ++G+ G L T  +
Sbjct: 285 KPRDVLQVAFLVEQEHGQQEGL-EGLQKDMNRDGRTILGQLSIEWRGSMGDRGFLTTGNL 343

Query: 326 L 326
           +
Sbjct: 344 M 344


>gi|449301586|gb|EMC97597.1| hypothetical protein BAUCODRAFT_67883 [Baudoinia compniacensis UAMH
           10762]
          Length = 321

 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 137/340 (40%), Gaps = 63/340 (18%)

Query: 6   GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           G H+++ +V+RL RPSL  + PL   PT+   G DI          PP  S     + + 
Sbjct: 14  GPHAVSLKVLRLSRPSLASQTPL--PPTNFGHGIDI----------PPEASVAYPGSSTK 61

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS----STLEVRDVV 121
           +              +  L  LL LP AFGA Y+GETF   + +NN         V  V 
Sbjct: 62  E------------PSTFPLVPLLTLPSAFGAAYVGETFACTLCVNNEIQHIEKRSVSGVR 109

Query: 122 IKAEIQTDKQ------RILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
           + AE+QT          +   D ++     +         + H++KE G+H L  T  Y+
Sbjct: 110 VTAELQTPNDPSGTHLELTKADNAEEGDGELPLATTLQRTLAHELKEEGSHVLAVTVSYT 169

Query: 176 ------DG---EGERKYLPQFFKFIVSNPLSVRTKV----RVVKVGATHFQEITFLEACI 222
                 DG    G  +   + ++F+  + ++VR+K     R  K G   +     LEA +
Sbjct: 170 ETLRGDDGGASGGRARSFRKLYQFVAQHLIAVRSKATERKRREKAGGRQW----VLEAQL 225

Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLY 282
           EN  +    +++V  +  +  ++  +           + R+I +   L+   GG+     
Sbjct: 226 ENVGEMAAVLEKVWLDGKEGIASRAVNGGEEMEAVVLKPRDIEQVMFLLEEDGGV----- 280

Query: 283 QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
                  G      V G   L KL I WRT +GE G L T
Sbjct: 281 -------GKVEDGTVAGRLPLAKLNIEWRTGMGERGSLTT 313


>gi|426384568|ref|XP_004058833.1| PREDICTED: UPF0533 protein C5orf44 homolog [Gorilla gorilla
           gorilla]
 gi|426384570|ref|XP_004058834.1| PREDICTED: UPF0533 protein C5orf44 homolog [Gorilla gorilla
           gorilla]
          Length = 218

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 104/211 (49%), Gaps = 14/211 (6%)

Query: 226 TKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQ 283
           T S ++M++V  EPS  ++ T L +     +  +   SR   +P            YLY 
Sbjct: 2   TTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAYLQPM-------DTRQYLYC 54

Query: 284 LKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVP 343
           LK  +  +     ++G  V+GKL I W+TNLGE GRLQT Q+        ++ L++  +P
Sbjct: 55  LKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIP 114

Query: 344 SVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGST 403
             V +++PF +  K+TN + +     ++ L   +++      I+G ++  L P  +    
Sbjct: 115 DTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC-- 170

Query: 404 DFHLNLIATKLGVQRITGITVFDKLEKITYD 434
              L L+++  G+Q I+G+ + D   K TY+
Sbjct: 171 -LALTLLSSVQGLQSISGLRLTDTFLKRTYE 200


>gi|359497048|ref|XP_003635408.1| PREDICTED: uncharacterized protein LOC100853279, partial [Vitis
           vinifera]
          Length = 54

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 42/52 (80%), Positives = 44/52 (84%)

Query: 393 ALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVD 444
           AL  VEAF STDF LNLIATKLGVQ+ITGITVFD  EK TY+ LPDLEIFVD
Sbjct: 1   ALPQVEAFCSTDFRLNLIATKLGVQKITGITVFDIREKRTYEPLPDLEIFVD 52


>gi|452984074|gb|EME83831.1| hypothetical protein MYCFIDRAFT_162727, partial [Pseudocercospora
           fijiensis CIRAD86]
          Length = 266

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 74/252 (29%), Positives = 113/252 (44%), Gaps = 47/252 (18%)

Query: 6   GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           G HSL+ +V+RL RPSL  + PL    T+   G DI      AS   P   +D TT    
Sbjct: 12  GPHSLSLKVLRLSRPSLATQTPL--PQTNFGDGLDIHP---TASLAHPKGENDSTT---- 62

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN------SSTLEVRD 119
                             L+ LL LP AFGA Y+GETF   + +NN      +    V  
Sbjct: 63  ----------------FPLTPLLTLPSAFGAAYVGETFTCTLCVNNELSPDSNQRKSVSG 106

Query: 120 VVIKAEIQT-DKQRILLLDTSKSP-----VESIRAGGRYDFIVEHDVKELGAHTLVCTAL 173
           V I AE+QT  +Q  + L+   +       E+++ G      + H++K+ G H L  T  
Sbjct: 107 VKITAELQTPSRQEGISLNLENAAEADQDEENLKPGATLQRTLRHELKDEGPHVLAVTVS 166

Query: 174 Y------SDGE----GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIE 223
           Y      SDG     G  +   + ++F+    L+VR+KV   K+   +      LEA +E
Sbjct: 167 YTETLIGSDGSAASAGRARTFRKLYQFVSQQLLAVRSKVTERKIREKNSPRQWVLEAQLE 226

Query: 224 NHTKSNLYMDQV 235
           N   +++ +++V
Sbjct: 227 NVGDASVVLERV 238


>gi|378734173|gb|EHY60632.1| hypothetical protein HMPREF1120_08585 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 363

 Score = 82.0 bits (201), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 148/376 (39%), Gaps = 91/376 (24%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL ++ PL                       P    ++      S L
Sbjct: 16  HSVSLKVLRLSRPSLALQHPL-----------------------PHESETETKIPHISSL 52

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS--------------- 112
            Y S+ +  +      +S  L LP +FG+ ++GETF   +  NN                
Sbjct: 53  AYPSKLVDQE----FIISNNLALPPSFGSAHVGETFSCVLCANNELLPPGPTGTGTTTTT 108

Query: 113 -STLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI------RAGGRYDFIVEHDVKELGA 165
             T  V    I AE+QT  Q I L     SP E +      R G     I   D+KE G 
Sbjct: 109 TPTKTVSGTKILAEMQTPSQSIPLDLHIASPTERVDGHDDGRPGSALQTIARFDLKEEGN 168

Query: 166 HTLVCTALYSD---GEGERKYLP---------QFFKFIVSNPLSVRTKVRVVKVG----A 209
           H L     Y++   G+G + + P         + ++F+    LSVRTK   +        
Sbjct: 169 HVLAVNVTYTETISGDGGQTHAPTSGRVRSFRKLYQFLAQPCLSVRTKATELPPKEVPDK 228

Query: 210 TH--FQEITFL----EACIENHTKSNLYMDQVEFEPSQNWSATMLK----ADGPHSDYNA 259
           TH  +   T L    EA +EN +   + +++ + +    + +T L        P  D   
Sbjct: 229 THGPYGRTTLLRYALEAQLENVSDITIVLEEAKLQSKPPFKSTSLNYWDAHAAPEKDEKN 288

Query: 260 QS---------REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITW 310
           Q          R+I +   L+    G+   +  LK       + +K  G  VLG+L I W
Sbjct: 289 QGHPQKPIINPRDIIQIAFLVEQMEGVQEGIEDLK-------TSLKRDGRAVLGQLAIQW 341

Query: 311 RTNLGEPGRLQTQQIL 326
           R+++GE G L T  +L
Sbjct: 342 RSSMGERGSLSTGNLL 357


>gi|242765997|ref|XP_002341086.1| DUF974 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218724282|gb|EED23699.1| DUF974 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 345

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 143/364 (39%), Gaps = 84/364 (23%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL  +                          D   +  + L
Sbjct: 17  HSVSLKVLRLSRPSLSYQYPLPRE--------------------------DTRISSKASL 50

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
            Y S    +D      LS  + LP AFG+ Y+GETF   +  NN      ST +V  V I
Sbjct: 51  AYPS----NDFDPHFILSPNVTLPPAFGSAYVGETFACSLCANNELPETDSTKKVTSVRI 106

Query: 123 KAEIQTDKQRILLLD-----------TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCT 171
            AE+QT  Q +  LD           T   P + +  G     IV+ D+KE G H L  +
Sbjct: 107 LAEMQTPSQ-VFPLDLKPGEDEHQDETLPKPGKGLDYGQSLQKIVQFDLKEEGNHILAVS 165

Query: 172 ALYSD-----------GEGERKYLPQFFKFIVSNPLSVRTKVRVV-----------KVGA 209
             Y++             G  +   + ++FI    LSVRTK   +             G 
Sbjct: 166 VSYTETLLADANATTASSGRVRTFRKLYQFIAQPCLSVRTKASELVPAEVENKSLGPYGK 225

Query: 210 THFQEITFLEACIENHTKSNLYMDQ--VEFEP-----SQNWSATMLKADGPHSDYNAQSR 262
           T       LEA +EN    ++ +++  +  +P     S NW      +           R
Sbjct: 226 TRLLRFA-LEAQLENVGDGSVVIEKTILNAKPPFKSQSLNWDIHHFPSSSTSEQPTMNPR 284

Query: 263 EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
           +I +   L+    G H+ L  L+         +K  G  +LG+L I WR+ +G+ G L T
Sbjct: 285 DILQVAFLVEQEVGQHDGLENLQ-------KELKRDGRAILGQLSIEWRSAMGDRGFLTT 337

Query: 323 QQIL 326
             ++
Sbjct: 338 GNLM 341


>gi|212528588|ref|XP_002144451.1| DUF974 domain protein [Talaromyces marneffei ATCC 18224]
 gi|210073849|gb|EEA27936.1| DUF974 domain protein [Talaromyces marneffei ATCC 18224]
          Length = 345

 Score = 81.6 bits (200), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 145/364 (39%), Gaps = 84/364 (23%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL          ED    P A+   P        TN     
Sbjct: 17  HSVSLKVLRLSRPSLSYQYPL--------AREDTRISPKASLAYP--------TND---- 56

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
            +   F+L         S  + LP AFG+ Y+GETF   +  NN      S  +V  V I
Sbjct: 57  -FDPHFIL---------SPNVTLPPAFGSAYVGETFACSLCANNELPTTDSAKKVASVRI 106

Query: 123 KAEIQTDKQRILLLD------------TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVC 170
            AE+QT  Q +  LD             S++P E +  G     IV+ D+KE G H L  
Sbjct: 107 LAEMQTPSQ-VFPLDLRPADDDNHDGTLSRTPGEGLDYGQSLQKIVQFDLKEEGNHILAV 165

Query: 171 TALYSD-------------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK----------- 206
           +  Y++               G  +   + ++FI    LSVRTK   +            
Sbjct: 166 SVSYTETLLTDTLASTQAASGGRVRTFRKLYQFIAQPCLSVRTKASELTPAEVDNKSLGP 225

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDY----NAQSR 262
            G T       LEA +EN    ++ +++    P   + AT L  D   ++     +   R
Sbjct: 226 YGKTRLLRFA-LEAQLENVGDGSVVIEKTILSPKPPFKATSLNWDVQAAENVERPSMNPR 284

Query: 263 EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
           +I +   L+    G  + L  L          +K  G   LG+L I WR+ +G+ G L T
Sbjct: 285 DILQVAFLVEQEVGQQDGLDTLL-------KDLKRDGRATLGQLSIEWRSTMGDRGFLTT 337

Query: 323 QQIL 326
             +L
Sbjct: 338 GNLL 341


>gi|358365955|dbj|GAA82576.1| DUF974 domain protein [Aspergillus kawachii IFO 4308]
          Length = 336

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 92/356 (25%), Positives = 142/356 (39%), Gaps = 78/356 (21%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H+++ +V+RL RPSL  + PL                           ++D   +  + L
Sbjct: 17  HAVSLKVLRLSRPSLSYQYPL--------------------------PAADTKISSKASL 50

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVVI 122
           +Y +     +  D   L+  L LP AFG+ Y+GETF   +S NN      ++  V  V I
Sbjct: 51  SYPA----DNVDDQFILTPNLTLPPAFGSAYVGETFACTLSANNELPDEETSRVVTSVRI 106

Query: 123 KAEIQTDKQRILLLDTSKSPVE------SIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
            AE+QT  Q +  LD    P E       ++ G     IV  D+KE G H L  +  Y++
Sbjct: 107 VAEMQTPSQ-VAALDL--EPAEDTASKDGVQKGHSLQKIVRFDLKEEGNHILAVSVSYTE 163

Query: 177 ---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEIT 216
                      G  +   + ++F+    LSVRTK   +             G T      
Sbjct: 164 TLIGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVENKSLGPYGKTRLLRFA 223

Query: 217 FLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD--GP-HSDYNAQS---REIFKPPVL 270
            LEA +EN     + + Q    P   + A  L  D  GP  +D    +   R++ +   L
Sbjct: 224 -LEAQLENVGDGPVVVKQTRLNPKPPFKAVSLNWDLQGPDQADPRPPTLHPRDVLQVAFL 282

Query: 271 IRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
           +    G    L  L+         +K  G  VLG+L I WR  +G+ G L T  ++
Sbjct: 283 VEQEEGQQEGLETLQ-------KDMKRDGRAVLGQLSIEWRGAMGDKGFLTTGNLM 331


>gi|323509275|dbj|BAJ77530.1| cgd8_3650 [Cryptosporidium parvum]
          Length = 394

 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/350 (25%), Positives = 161/350 (46%), Gaps = 20/350 (5%)

Query: 88  LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
           L+LP     +Y GE+F ++ISI NSS ++   VV+K E+   K+R +L +   +    I 
Sbjct: 51  LLLPTTQCRLYCGESFHAFISITNSSIIKANGVVLKVELVGTKKRHILYNNEDN-YSDID 109

Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV 207
            G   D +V+  V E+G ++L C   ++  E  R    + +KF V +P ++  ++  +  
Sbjct: 110 IGDSLDIVVKERVDEVGLYSLTCQLFFTSNEA-RLTQKKSYKFAVLSPFNISHRLYNLD- 167

Query: 208 GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKP 267
             T  ++  F+E  +EN +  ++ +  ++ EP        L  +    D N +++     
Sbjct: 168 EDTMDKKTIFVEVSLENVSHQSITLSSMKLEPINIKKLPELIFE--LEDVNLKNKH--NE 223

Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILG 327
           P+ I+     +N +++    S   ++  K     +  KL+I W +     G L + +I G
Sbjct: 224 PLYIQPRCK-YNKIFKFTSCSREYNNLGKSSREVLELKLRIGWVSVSYGDGWLDSYKI-G 281

Query: 328 TTITSKEIELN--------VVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSD 379
             I   + +LN          E+PSV    + F + L +TN    +Q    I L   D D
Sbjct: 282 LPILCDQNKLNKEKNAIILKAELPSVNNRQEEFKVFLYVTNNLSIDQKGMSIRL---DFD 338

Query: 380 EEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLE 429
           +   ++I G   + L  ++A  +    L+  A   GV  + GI VFD+LE
Sbjct: 339 QLLPIIILGNDRLYLEELKAGETVTLELDCQALVSGVYNLNGIYVFDELE 388


>gi|317037990|ref|XP_001401447.2| hypothetical protein ANI_1_228184 [Aspergillus niger CBS 513.88]
          Length = 336

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/354 (25%), Positives = 142/354 (40%), Gaps = 74/354 (20%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H+++ +V+RL RPSL  + PL                           ++D   +  + L
Sbjct: 17  HAVSLKVLRLSRPSLSYQYPL--------------------------PAADTKISSKASL 50

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVVI 122
           +Y +     +  D   L+  L LP AFG+ Y+GETF   +S NN      ++  V  V I
Sbjct: 51  SYPA----DNVDDQFILTPNLTLPPAFGSAYVGETFACTLSANNELPDEETSRVVTSVRI 106

Query: 123 KAEIQTDKQRILLLD----TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD-- 176
            AE+QT  Q +  LD       +  + ++ G     IV  D+KE G H L  +  Y++  
Sbjct: 107 VAEMQTPSQ-VAALDLEPAEDTASKDGVQKGHSLQKIVRFDLKEEGNHILAVSVSYTETL 165

Query: 177 -------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEITFL 218
                    G  +   + ++F+    LSVRTK   +             G T       L
Sbjct: 166 IGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVENKTLGPYGKTRLLRFA-L 224

Query: 219 EACIENHTKSNLYMDQVEFEPSQNWSATMLKAD--GP-HSDYNAQS---REIFKPPVLIR 272
           EA +EN     + + Q    P   + A  L  D  GP  +D    +   R++ +   L+ 
Sbjct: 225 EAQLENVGDGPVVVKQTRLNPKPPFKAVSLNWDLQGPDQADPRPPTLHPRDVLQVAFLVE 284

Query: 273 SGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
              G    L  L+         +K  G  VLG+L I WR  +G+ G L T  ++
Sbjct: 285 QEEGQQEGLETLQ-------KDMKRDGRAVLGQLSIEWRGAMGDKGFLTTGNLM 331


>gi|240280000|gb|EER43504.1| DUF974 domain-containing protein [Ajellomyces capsulatus H143]
 gi|325088719|gb|EGC42029.1| DUF974 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 348

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 93/361 (25%), Positives = 148/361 (40%), Gaps = 75/361 (20%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL        + E+         ++PPL +S    + SSD 
Sbjct: 17  HSVSLKVLRLSRPSLSYQYPL--------LSEN--------ESVPPLKASLSYPSDSSD- 59

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK---- 123
              S+F+L  +         + LP AFG+ Y+GETF   +  NN   L++ + V+     
Sbjct: 60  ---SQFILSPN---------VTLPPAFGSAYVGETFSCSLCANNELPLDIENRVVSSVRI 107

Query: 124 -AEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
            AE+QT  Q I+ L+ S  P E   +GG         IV  D+KE G H L  +  Y++ 
Sbjct: 108 VAEMQTPSQ-IVSLELSP-PGEDSGSGGLAKSQSLQKIVRFDLKEEGNHVLAVSVSYTET 165

Query: 177 --------------------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK---------- 206
                                 G  +   + ++FI    LSVRTK   +           
Sbjct: 166 TLAPQGQETSPGSGVGAVQAASGRVRTFRKLYQFIAQPCLSVRTKATELTPLEVDNRALG 225

Query: 207 -VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIF 265
             G         LEA +EN     + +      P   + +  L  D   SD    +  + 
Sbjct: 226 PYGKARLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLNWDFERSDSLKTAPPML 284

Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
           KP  +++    +     Q + L  G    +   G  +LG+L I WR ++G+ G L T  +
Sbjct: 285 KPRDVLQVAFLVEQEHGQQEGL-EGLQKDMNRDGRTILGQLSIEWRGSMGDRGFLTTGNL 343

Query: 326 L 326
           +
Sbjct: 344 M 344


>gi|121706562|ref|XP_001271543.1| DUF974 domain protein [Aspergillus clavatus NRRL 1]
 gi|119399691|gb|EAW10117.1| DUF974 domain protein [Aspergillus clavatus NRRL 1]
          Length = 337

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/315 (28%), Positives = 129/315 (40%), Gaps = 50/315 (15%)

Query: 49  SNLPPLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYI 107
           SN  PL  ++   +  + L+Y S     D AD    LS  L LP AFG+ Y+GETF   +
Sbjct: 31  SNQYPLPVANTKISSKASLSYPS-----DGADGQFILSPNLTLPPAFGSAYVGETFACTL 85

Query: 108 SINNSSTLE-----VRDVVIKAEIQTDKQRILL-LDTSKSPV--ESIRAGGRYDFIVEHD 159
           S NN  T +     V  V I AE+QT  Q   L L+ +  P   E ++ G     IV  D
Sbjct: 86  SANNELTEDEASRVVTSVRIVAEMQTPSQVASLELEPATDPAQTEGLQKGESLQKIVRFD 145

Query: 160 VKELGAHTLVCTALYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK---- 206
           +KE G H L  +  Y++           G  +   + ++F+    LSVRTK   +     
Sbjct: 146 LKEEGNHILAVSISYTETLIGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEV 205

Query: 207 -------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA 259
                   G T       LEA +EN     + + Q +  P   + A  L  D    D  A
Sbjct: 206 ENKSLGPYGKTRLLRFA-LEAQLENVGDGAVVVKQTKLNPRPPFQAASLNWDLDRPDEVA 264

Query: 260 --------QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWR 311
                     R++ +   L+    G    L  L+         ++  G  VLG+L I WR
Sbjct: 265 SPLPPPTLNPRDVLQVAFLVEQEEGQQEGLDALQ-------KDLRRDGRAVLGQLSIEWR 317

Query: 312 TNLGEPGRLQTQQIL 326
             +G+ G L T  +L
Sbjct: 318 GAMGDKGFLTTGNLL 332


>gi|223993247|ref|XP_002286307.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977622|gb|EED95948.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 573

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 74/268 (27%), Positives = 132/268 (49%), Gaps = 32/268 (11%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISI-NNSSTLEVRDVVIKAEIQTDKQRILL---LDTS 139
           LS  L+LP +FG I++GETF +Y+ + N SS L VR + +  ++QT  +RI+L   LD +
Sbjct: 133 LSSNLLLPDSFGVIHVGETFSAYLGVLNPSSDLPVRGLTVTVQLQTPSRRIILPSRLDGT 192

Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY-SDGEGERKYLPQFFKFIVSNPLSV 198
            + ++ I+ GG  D IV   ++E+G H L     Y ++G    K L +F++F V+ PL++
Sbjct: 193 DASLKDIQPGGGVDSIVSRRLEEVGQHILRVEVGYMANGA---KTLRKFYRFNVTVPLNI 249

Query: 199 RTKVRVVKVGATHFQEITFLEACIENHTKSN--LYMDQVEFEPSQNWSATMLKAD----- 251
            T+  V K  A+    IT +E  +E  +     + +  V FEP     A  +  +     
Sbjct: 250 -TETVVRKGDASCLVSIT-VENVMEKQSSGGGAVTISSVGFEPHSGLVAEQINIEEDSQG 307

Query: 252 ---------GPHSDYNAQSR----EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
                       SD +A  R    E++     +   G I+ YL+ +   S  +++   + 
Sbjct: 308 ETTETDDIMTARSDLSASPRKSTVELYDSCGRLEP-GEINRYLFSVTAGSE-AAALRGIA 365

Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQIL 326
             + LG+  + +   +GE G+L +  ++
Sbjct: 366 FGDELGRAYLIYYKAMGESGKLFSSMVV 393


>gi|358399703|gb|EHK49040.1| hypothetical protein TRIATDRAFT_82516 [Trichoderma atroviride IMI
           206040]
          Length = 796

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 95/345 (27%), Positives = 149/345 (43%), Gaps = 55/345 (15%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL +PSL  + P  VDP         F  P   S   P           + L
Sbjct: 488 HSVSVKVLRLSQPSLVTQYP--VDPP--------FSPPNTKSQPAP-----------ASL 526

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
            Y+S    + + D   LS +L LP +FG+ Y+GETF   +  NN     +   +RDV I+
Sbjct: 527 AYKS--ASNTNPDPFLLSPILNLPVSFGSAYVGETFSCTLCANNDLPPDAAKRIRDVRIE 584

Query: 124 AEIQT----DKQRILLLDTS---KSPVESI--RAGGRYDFIVEHDVKELGAHTLVCTALY 174
           AE++T      Q++ L   +    +P   +    GG    IV  D+KE G H L  T  Y
Sbjct: 585 AEMKTPGVGGTQKLELGPANIHGATPAGGVDLEPGGTLQRIVGFDLKEEGNHVLAVTVSY 644

Query: 175 SDG---EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITF-LEACIENHTKSNL 230
           S+     G  +   + ++FI    L VRTKV  ++  A +     + LEA +EN ++  +
Sbjct: 645 SEATETSGRTRTFRKLYQFICKASLIVRTKVSALEASANNSNYRKWVLEAQLENCSEDII 704

Query: 231 YMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHN--YLYQLKMLS 288
            +++V  +  +            + D N  S    KP V     G I    +L   +   
Sbjct: 705 QLEKVVLDVEEGLG---------YQDCNWLSEGDKKPVVHP---GEIEQVCFLVHEEGTD 752

Query: 289 HGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSK 333
            G    +   G  + G L I WR  +G  G L T + LG  + ++
Sbjct: 753 AGGGLRLTSDGRLIFGVLGIGWRGEMGCRGFLSTGK-LGARVAAR 796


>gi|406860784|gb|EKD13841.1| hypothetical protein MBM_08042 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 361

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 93/355 (26%), Positives = 143/355 (40%), Gaps = 78/355 (21%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           H+++ +V+RL RPSL V+ PL   PT L           A S               + L
Sbjct: 37  HAVSLKVLRLSRPSLSVQHPL---PTPLPSSNSSHLSSPAPS---------------ASL 78

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-------SSTLEVRDV 120
            Y S        D   LS LL LP AFG+ Y+GETF   +  NN       S+   + +V
Sbjct: 79  AYPS-----SKPDPFILSPLLTLPPAFGSAYVGETFSCTLCANNEILAGSSSAGKVITNV 133

Query: 121 VIKAEI-----------------------------QTDKQRILLLDTSKSPVESIRAGGR 151
            I+AE+                             + D +++L  D   S +E    G  
Sbjct: 134 RIEAEMKIPSSSVPIPLVLGPEASSKLETDEVEEGERDPEKVLEKDHQGSDLE---PGKS 190

Query: 152 YDFIVEHDVKELGAHTLVCTALYSD---GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVG 208
              IV  D+KE G+H L  T  YS+     G  +   + ++F+  + + VRTK  V+  G
Sbjct: 191 LQKIVGFDLKEEGSHVLAVTVTYSETTPTSGRIRTFRKLYQFVCKSCMVVRTKTGVLPSG 250

Query: 209 ATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPP 268
               ++   LEA +EN  +  + +D V  E  + +    L         N +  E  + P
Sbjct: 251 EKEGRKWA-LEAQLENCGEETITLDVVILETKEGFKGQGL---------NWEVGEEMERP 300

Query: 269 VLIRSGGGIHNYLYQL-KMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
           VL+   G +    + + ++L  G      V G  + G L + WR  +G  G L T
Sbjct: 301 VLMP--GDVQQVCFLVEEVLGVGGEVVEPVDGKLIFGILSLGWRGTMGNRGFLST 353


>gi|219113485|ref|XP_002186326.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583176|gb|ACI65796.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 457

 Score = 79.7 bits (195), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 71/262 (27%), Positives = 123/262 (46%), Gaps = 20/262 (7%)

Query: 75  LHDSA-----DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE-VRDVVIKAEIQT 128
           LH+ A     +   L   L LP++ G +Y+GETF +Y+ + N+ST + +R + + A++QT
Sbjct: 33  LHNPAAGSLDNQAALHNSLCLPESLG-VYVGETFTAYLGVLNTSTRQSIRRLTVLAQLQT 91

Query: 129 DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
              R  L    +  V+   A G  D IV H ++E G H L     Y   +G  +   +F+
Sbjct: 92  PSNRWQLPSLLEKGVDVNPANG-VDAIVAHAIEEPGQHILRVEVGYRTNDGGLQTFRKFY 150

Query: 189 KFIVSNPLSV-RTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
           +F V NPL++ +T  R+          +T+ +          L +    F P     A +
Sbjct: 151 RFQVVNPLTIQQTTTRMGDSQCLVSLSVTYNKTA---DATGPLVIANAAFRPVDGLVARL 207

Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSG----GGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
           L  DG H   +    ++    +L +SG    G I  YL+Q++  S   +    +   ++L
Sbjct: 208 L--DG-HVSESTPDAKMSALQLLDKSGLLQPGSIVRYLFQIEATSR-EAVLKGIAAGDLL 263

Query: 304 GKLQITWRTNLGEPGRLQTQQI 325
           G+  +TWR  +GE G++ +  I
Sbjct: 264 GQAVLTWRKAMGETGQIYSASI 285


>gi|398389012|ref|XP_003847967.1| hypothetical protein MYCGRDRAFT_77482 [Zymoseptoria tritici IPO323]
 gi|339467841|gb|EGP82943.1| hypothetical protein MYCGRDRAFT_77482 [Zymoseptoria tritici IPO323]
          Length = 311

 Score = 79.3 bits (194), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 87/337 (25%), Positives = 142/337 (42%), Gaps = 67/337 (19%)

Query: 6   GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           G HS++ +V+RL RP+L V+ PL    T    G DI   P  AS                
Sbjct: 14  GPHSISLKVLRLSRPTLAVQTPLL--STAFNNGLDI---PAKAS---------------- 52

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE-----VRDV 120
            L Y S     D   +  L+ LL LP +FGA Y+GE F   + +NN    E     V  +
Sbjct: 53  -LAYPS----ADQNSTFPLTPLLTLPASFGAAYVGERFTCTLCVNNELLAEDKAKSVSGL 107

Query: 121 VIKAEIQT----DKQRILLLDTSKSPVES-IRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
            + AE+QT    D    L L ++ +  E  +  G    + + H++KE G H L  T  Y+
Sbjct: 108 KVSAELQTPTFSDAGVALELKSALTKKEEDLSPGDTLQYTLSHELKEEGPHVLAVTVSYT 167

Query: 176 DGE---------GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHT 226
           +           G  +   + ++F+    L+VR+K+   +           LEA +EN  
Sbjct: 168 ETSHTAEGGASGGRARTFRKLYQFVAQPLLAVRSKITERQRREKDALRQWILEAQLENVG 227

Query: 227 KSNLYMDQVEFEPSQNWSATMLKADG-PHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK 285
           + ++ +++V       W   + + DG    D N +   + KP         +   ++ ++
Sbjct: 228 EVSVVLERV-------W---LKEEDGMKGQDVNDKEAVVLKP-------SDVEQVMFLVE 270

Query: 286 MLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
                S    +V     LG+L + WR+ +GE G L T
Sbjct: 271 EEERLSELSARVP----LGELNVDWRSAMGERGGLTT 303


>gi|342874081|gb|EGU76154.1| hypothetical protein FOXB_13326 [Fusarium oxysporum Fo5176]
          Length = 1061

 Score = 79.0 bits (193), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 86/319 (26%), Positives = 132/319 (41%), Gaps = 49/319 (15%)

Query: 11  AFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYR 70
           A   +RL RPSL  + P  +DP    +G  I   PI AS       S+  +N S      
Sbjct: 639 ASSTLRLSRPSLVTQYP--IDPPS-SVGASIKSAPIPASLA---YHSEAASNPSP----- 687

Query: 71  SRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS----STLEVRDVVIKAEI 126
             FLL         S  + LP +FG+ Y+GETF   +  NN     +   +RDV I+AE+
Sbjct: 688 --FLL---------SPAVNLPVSFGSAYVGETFSCTLCANNELPIDAAKNIRDVRIEAEM 736

Query: 127 QTDK----QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEG 179
           +T      QR+ L  ++  P   + +G     +V  D+KE G H L  T  Y   ++  G
Sbjct: 737 KTPGMGAVQRLELGPSNGQPEVDLESGDTLQKVVSFDLKEEGNHVLAVTVSYYEATETSG 796

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV--EF 237
             +   + ++FI    L VRTKV  +    T  +    LEA +EN ++  + +++V  + 
Sbjct: 797 RTRTFRKLYQFICKASLIVRTKVGPLNSNNTQERGRWVLEAQLENCSEDVVQLEKVVLDT 856

Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
           EP   +     +A G                 L+   G +    + +      S   V  
Sbjct: 857 EPGLRYRDCNWEASGSEK--------------LVLHPGEVEQVCFVVAEDGTESGVEVTP 902

Query: 298 QGSNVLGKLQITWRTNLGE 316
            G  + G L I WR    E
Sbjct: 903 DGRIIFGSLGIGWRGPRAE 921


>gi|453080254|gb|EMF08305.1| hypothetical protein SEPMUDRAFT_166779 [Mycosphaerella populorum
           SO2202]
          Length = 365

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 98/378 (25%), Positives = 149/378 (39%), Gaps = 91/378 (24%)

Query: 2   SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
           S+  G HSL+ +V+RL RP+L  + PL   PT    G DI   P A+       S+  + 
Sbjct: 14  STFSGPHSLSLKVLRLSRPALATQAPL--PPTAFGNGLDIA--PNASLAYSTADSTATSQ 69

Query: 62  NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISIN----------- 110
           ++  D +  S F L  +         L LP AFGA Y+GETF   + +N           
Sbjct: 70  DEKRDTSAPSSFPLTQA---------LTLPAAFGAAYVGETFVCTLCVNNELPPSPSSDE 120

Query: 111 --------NSSTLEVRDVVIKAEIQT-------DKQRILLLDTSKSPVE----------- 144
                   N +   V  V I AE+QT       D    L L+ + S  E           
Sbjct: 121 GGGGSGEGNQTITVVSGVKIVAELQTPTRNQAGDGGIALPLEGAASTHEDEGEGGEGGGV 180

Query: 145 SIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ---------------FFK 189
            I+ G      + H++K+ G + L  T  Y+    E   LPQ                ++
Sbjct: 181 KIKPGETLQRTLRHELKDEGQYVLAVTVSYT----EETLLPQHGGTVVGSRTRSFRKLYQ 236

Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIEN--HTKSNLYMDQV---EFEPSQNWS 244
           FI    ++VR+KV   K   T       LEA +EN     + + +++V   E E  +  +
Sbjct: 237 FISQQLVAVRSKVTERKKKDTTAAREWVLEAQLENVADGGAGIVLEKVWLKESEEDRVVA 296

Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
             M+   G           + KP       G I   ++ +K     ++  V +     LG
Sbjct: 297 KAMMDVGG----------TVLKP-------GDIEQIMFLVKEDKKENAEDVDLSMKVRLG 339

Query: 305 KLQITWRTNLGEPGRLQT 322
           +L I WR+ +GE G L T
Sbjct: 340 QLNIDWRSAMGEKGSLTT 357


>gi|19584414|emb|CAD28498.1| hypothetical protein [Homo sapiens]
          Length = 207

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/157 (29%), Positives = 83/157 (52%), Gaps = 6/157 (3%)

Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIEL 337
             YLY LK  +  +     ++G  V+GKL I W+TNLGE GRLQT Q+        ++ L
Sbjct: 39  RQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRL 98

Query: 338 NVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPV 397
           ++  +P  V +++PF +  K+TN +++     ++ L   +++      I+G ++  L P 
Sbjct: 99  SLEAIPDTVNLEEPFHITCKITNCSER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPS 155

Query: 398 EAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            +       L L+++  G+Q I+G+ + D   K TY+
Sbjct: 156 SSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 189


>gi|312077829|ref|XP_003141474.1| hypothetical protein LOAG_05889 [Loa loa]
          Length = 218

 Score = 75.9 bits (185), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 112/229 (48%), Gaps = 18/229 (7%)

Query: 217 FLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGG 276
           +LEA I+N ++  + +++V  EPS  + ++ +    P  +     +    P         
Sbjct: 5   YLEAQIQNTSELPMVLEKVILEPSDFYISSEISP--PEIENENMEQSYLNP-------SD 55

Query: 277 IHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIE 336
           I  YL+ LK  +   S     +G   +GKL + WRT++GE GRLQT  +        ++ 
Sbjct: 56  IRQYLFCLKPKTTDYSLNYFRKGI-AIGKLDMVWRTSMGERGRLQTSALQRMAPGYGDLR 114

Query: 337 LNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMI--NGLRIMAL 394
           L + ++P+ V + +PF +  +L N +++   P ++ L+ +D  +  +     +G+ +  L
Sbjct: 115 LTIEKIPATVKVLQPFHIVCRLHNCSER---PLDLVLTLDDKLQPNIAFCSTSGVELGQL 171

Query: 395 APVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
            P     +TDF L L+    G+Q ++GI V D   + TY+     ++FV
Sbjct: 172 PPN---STTDFSLELLPLTPGLQSVSGIRVTDTFLRRTYEHDDIAQVFV 217


>gi|322695604|gb|EFY87409.1| DUF974 domain-containing protein [Metarhizium acridum CQMa 102]
          Length = 353

 Score = 75.1 bits (183), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 90/339 (26%), Positives = 149/339 (43%), Gaps = 61/339 (17%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RP+L  + P                 P+ A+     + S ++   SS  
Sbjct: 59  HSVSVKVLRLSRPALVPQYP---------------SSPLPATK-EAFLPSSLSYKTSS-- 100

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN---------SSTLEVR 118
           T  + FLL         S +L LP +FG+ Y+GETF   +  NN         S    +R
Sbjct: 101 TNPAPFLL---------SPILNLPVSFGSAYVGETFSCTLCANNDLVTASSSSSPGKRIR 151

Query: 119 DVVIKAEIQT----DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY 174
           DV I AE++T       ++ L   S +P + + AG     +V  D+KE G H L  T  Y
Sbjct: 152 DVRIDAEMKTPGPGPAHKLPL--ASGAPAD-LAAGETLQRVVSFDLKEEGNHVLAVTVSY 208

Query: 175 ---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLY 231
              S+  G  +   + ++FI    L VRTKV +  +G    ++   LEA +EN ++  + 
Sbjct: 209 YEASETSGRTRTFRKLYQFICKASLIVRTKVGL--LGDEGGRKRWVLEAQLENCSQDVMQ 266

Query: 232 MDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGS 291
           +D+V  E  +      L+ +G   ++    + +  P  + +    +     + +  + G 
Sbjct: 267 LDKVGMEAERG-----LRCEG--CNWAEGEKPVLHPGEVEQVCFVVEEEEREEESRADGD 319

Query: 292 SSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTI 330
           +      G  V G L I WR  +G  G L T + LGT +
Sbjct: 320 A-----DGRVVFGVLGIGWRGEMGNRGFLSTGK-LGTRV 352


>gi|349605672|gb|AEQ00830.1| UPF0533 protein C5orf44-like protein-like protein, partial [Equus
           caballus]
          Length = 170

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 81/157 (51%), Gaps = 6/157 (3%)

Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIEL 337
             YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q+        ++ L
Sbjct: 2   RQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRL 61

Query: 338 NVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPV 397
           ++  +P  V +++PF +  K+TN +++     ++ L   ++       I+G ++  L P 
Sbjct: 62  SLEAIPDTVNLEEPFHITCKITNCSER---TMDLVLEMCNTSSIHWCGISGRQLGKLHPS 118

Query: 398 EAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
            +       L L+++  G+Q ++G+ + D   K TY+
Sbjct: 119 SSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 152


>gi|354489776|ref|XP_003507037.1| PREDICTED: UPF0533 protein C5orf44 homolog [Cricetulus griseus]
          Length = 282

 Score = 74.7 bits (182), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 62/251 (24%), Positives = 113/251 (45%), Gaps = 17/251 (6%)

Query: 183 YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN 242
           +L +   F  S PL V+TK           ++  FLE  IEN + S +++ +V  +  + 
Sbjct: 2   FLSKICLFYPSEPLDVKTKF------YNSDKDDLFLEVQIENISHSTVFIREVSLKLPEM 55

Query: 243 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
           ++   L       +   +    F     +++  G H YLY L+           + G   
Sbjct: 56  YTEEALNT----LNLEGEDECTFGTRTFLQATEGRH-YLYHLQFKEEYLEKARTLSGLME 110

Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
           +GKL+I W+  LGE   L T  +     +  E++L++ ++P  V  ++PF +  K+TN T
Sbjct: 111 MGKLEIVWKRELGEMPMLHTVPLRREAPSCGELKLSLEKIPDTVAREEPFQITCKITNCT 170

Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
           DK+    ++ L   D+   +    +G +   L   +   S  F + L+  +LG++ I+GI
Sbjct: 171 DKK---MKLLLKMFDTTSVRWCGCSGRK---LGRFKTGSSLSFTVTLLCLQLGLRSISGI 224

Query: 423 TVFDKLEKITY 433
            + D   K  Y
Sbjct: 225 RIIDATLKTKY 235


>gi|380488796|emb|CCF37134.1| hypothetical protein CH063_08544 [Colletotrichum higginsianum]
          Length = 342

 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 90/363 (24%), Positives = 148/363 (40%), Gaps = 82/363 (22%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL ++ P+R               P+  +NLP   +       ++  
Sbjct: 16  HSVSLKVLRLSRPSLVIQHPVR--------------PPLTPANLPADPTPASLAYDTTAS 61

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN---------------- 111
           T  + FLL         S +L LP +FG+ Y+GE F   +  N+                
Sbjct: 62  TNPAPFLL---------SPILNLPLSFGSAYVGEVFSCTLCANHDVPDPAMAPLGPGGLP 112

Query: 112 ------SSTLEVRDVVIKAEIQT-DKQRILLLDTSK-SPVESIRA-----GGRYDFIVEH 158
                      +RDV I+AE++T     I  L+ S  +P +  +      G     IV  
Sbjct: 113 LAGAAPPKRKSIRDVRIEAEMKTPGANSIQKLELSPPNPSDDTKGTDLDPGDTLQRIVNF 172

Query: 159 DVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEI 215
           D+KE G H L  T  Y   ++  G+ +   + ++FI  + L VRTK+  +   A H    
Sbjct: 173 DLKEEGNHVLAVTVSYYEATETSGKTRTFRKLYQFICKSSLIVRTKIGPLAPAARHGGRR 232

Query: 216 TFLEACIENHTKSNLYMDQVEFEPSQ-------NWSATMLKADGPHSDYNAQSREIFKPP 268
             LEA +EN ++  + +++V  + +        NW A            +  +R +  P 
Sbjct: 233 WALEAQLENCSEDVIQLEKVVLDLADGLGYTDCNWVAAGGGG------SDGDARPVLHP- 285

Query: 269 VLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN---VLGKLQITWRTNLGEPGRLQTQQI 325
                 G +    +   ++     SP   QG +   + G L I WR  +G  G L T + 
Sbjct: 286 ------GEVEQVCF---VVEEAEGSPRAQQGEDGRIMFGILGIGWRGEMGNRGFLSTGK- 335

Query: 326 LGT 328
           LGT
Sbjct: 336 LGT 338


>gi|339254156|ref|XP_003372301.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316967316|gb|EFV51754.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 384

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/370 (23%), Positives = 151/370 (40%), Gaps = 77/370 (20%)

Query: 95  GAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDF 154
           G +YLGE F  YISI N +     + V + +IQT+  R+LL    +    ++ AG     
Sbjct: 69  GNVYLGEVFSCYISILNGTG----ETVTEVDIQTNATRVLLPFKYQDTSLTLNAGQSVGD 124

Query: 155 IVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQE 214
            + H+                              F V  PL V TK+       +   +
Sbjct: 125 SISHE------------------------------FPVLKPLDVCTKL------CSAEND 148

Query: 215 ITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHS------------DYNAQSR 262
             +LEA ++N T +++ M++V  EP  + +  ++ +D   S            + N QS+
Sbjct: 149 TVYLEAQVQNTTDADMIMERVALEPVPDLAPILVPSDFNDSYICTVLYRIIIIERNFQSK 208

Query: 263 E-------IF--KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTN 313
                   +F  K   LI+ G  +  +LY +  +    S          + KL + WRT 
Sbjct: 209 TFPRILMLLFREKNCCLIKPGA-VRQFLYGISCIKQDVSWIA-------VAKLNMVWRTT 260

Query: 314 LGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWL 373
            G  GR+QT  +  T     +++L V+  PS V I  PF       + +   +   ++ L
Sbjct: 261 NGRRGRVQTCPLQKTVSGCGDLKLKVISGPSAVKIRLPF-------HVSSFSERALQLTL 313

Query: 374 SQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITY 433
           + +D+  +K ++ N L  +   P+    + +  L L A   G+Q  +G+  +D   K  Y
Sbjct: 314 TLDDT-LQKGLLWNSLSEVQFEPLLPAKTMNVTLTLFAECAGLQFASGMKFYDCNAKRRY 372

Query: 434 DSLPDLEIFV 443
           +      +FV
Sbjct: 373 EYNDVFHVFV 382


>gi|189196338|ref|XP_001934507.1| hypothetical protein PTRG_04174 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187980386|gb|EDU47012.1| hypothetical protein PTRG_04174 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 334

 Score = 73.9 bits (180), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 87/343 (25%), Positives = 142/343 (41%), Gaps = 58/343 (16%)

Query: 7   THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
            HS++ +V+R       V   L+   TD   G      P  A+  P   S  +  +  + 
Sbjct: 16  AHSVSLKVLR-------VSQILKFAITD---GVPRLSRPSLATQYPLPNSKSLGISPRAS 65

Query: 67  LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVV 121
           L Y S+   +D+ D   LS  L LP+AFG+ Y+GETF   +  NN      +   +  V 
Sbjct: 66  LAYPSQ---NDANDQFILSPALNLPEAFGSAYVGETFSCTLCANNELDPSDNAKAISGVR 122

Query: 122 IKAEIQTDKQRILLLDTSKSPVE-----------SIRAGGRYDFIVEHDVKELGAHTLVC 170
           I+ ++QT        + + SP++           S   G     I+  ++KE G H L  
Sbjct: 123 IQGDMQTPS------NPTGSPLDLSGLSGEDDGVSPGPGESLQRILRFELKEDGNHVLAV 176

Query: 171 TALYSD---GEGER-----KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACI 222
           T  Y +   GEG+      +   + ++F+    LSVRTK    ++G  +      LEA +
Sbjct: 177 TVTYMETALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAG--EMGHRNGSSRYLLEAQL 234

Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA---QSREIFKPPVLIRSGGGIHN 279
           EN  ++ + ++ V   P     +  L  D   +  NA     R++ +   L+    G  +
Sbjct: 235 ENMGEAAVCLETVNVNPKPPLRSRSLNWDMQSAGLNAPILSPRDVVQVAFLLEHQAGDDD 294

Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
            +             V      VLG+L I WR+ LG+ G L T
Sbjct: 295 DM----------PDSVTEDNKRVLGQLAIQWRSALGDRGSLST 327


>gi|424513630|emb|CCO66252.1| predicted protein [Bathycoccus prasinos]
          Length = 542

 Score = 73.2 bits (178), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 69/220 (31%), Positives = 94/220 (42%), Gaps = 60/220 (27%)

Query: 156 VEHDVKELGAHTLVCTALYSD---------------GE------GERKYLPQFFKFIVSN 194
           V    K LG HTL CTA Y D               GE      GERK   ++F F V+N
Sbjct: 151 VHFSAKHLGEHTLKCTAEYVDCPYDERSAVAIMNVAGENTVYDVGERKRAVRYFSFDVTN 210

Query: 195 PLSVRTKVRVV----------KVGATHFQEITFLEACIENH--------TKSNLYMDQVE 236
           PL VRTK R V              +  +E  FLEA IEN         TK +L +D+  
Sbjct: 211 PLHVRTKTRRVFTRSRSEDSDNNSTSSSKEKVFLEATIENVDKAAARLITKVHLIVDE-- 268

Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIF-----KPPVLIRSGGGIHNYLYQLKMLSH-- 289
               +  ++T L  +       A    +F     K  + ++ GGG  ++L+++       
Sbjct: 269 ----RRHASTALFPE------IADEETLFDVGNNKNQIYLQKGGGAAHFLFEITETDEWG 318

Query: 290 --GSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILG 327
              S +     G + LG L+I W  + GEPGRLQTQ IL 
Sbjct: 319 VSSSMTTTSTSGKDELGTLEICWLGSTGEPGRLQTQPILA 358


>gi|346319202|gb|EGX88804.1| DUF974 domain-containing protein [Cordyceps militaris CM01]
          Length = 363

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 82/303 (27%), Positives = 127/303 (41%), Gaps = 73/303 (24%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISINNSST----------LEVRDVVIKAEIQT---DK 130
           LS +L LP +FG+ Y+GETF   +  NN  T           ++RDV I+AE++T     
Sbjct: 76  LSPVLNLPVSFGSAYVGETFRCTLCANNDLTHDDGGDTPAVKKIRDVRIEAEMKTPGLGH 135

Query: 131 QRILLLDTSKS-PVESIRAGGRYDF--------IVEHDVKELGAHTLVCTALYSDG---E 178
           Q    L+     P +   +G   D         +V  D+KE G H L  T  YS+     
Sbjct: 136 QAAQQLELGPPLPADEGASGAGADLAPGATLQRVVSFDLKEEGNHVLAVTVSYSESTETS 195

Query: 179 GERKYLPQFFKFIVSNPLSVRTKVRVV------KVGATHFQEITFLEACIENHTKSNLYM 232
           G  +   + ++FI    L VRTKV V+      K G    +    LEA +EN +   + +
Sbjct: 196 GRTRTFRKLYQFICKPSLIVRTKVGVLPCPSASKQGRRPPRRRWVLEAQLENCSDDTMQL 255

Query: 233 DQVEFEPSQ-------NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK 285
           ++V  EP+        NW+A    ADGP +      + + +P       G +    + ++
Sbjct: 256 ERVVVEPAPGLAYRDCNWTA----ADGPTA-----VKPVLRP-------GEVEQVCFVVE 299

Query: 286 MLSHGSS---------------SPVKVQGSN---VLGKLQITWRTNLGEPGRLQTQQILG 327
            LS  +                +  +  G +   V G L I WR  +G  G L T + LG
Sbjct: 300 ALSRAAQVARGGVEADEAVDVVAEAEAGGPDARIVFGVLGIGWRGEMGSRGFLSTGK-LG 358

Query: 328 TTI 330
           T +
Sbjct: 359 TRL 361


>gi|340522585|gb|EGR52818.1| predicted protein [Trichoderma reesei QM6a]
          Length = 824

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 92/349 (26%), Positives = 147/349 (42%), Gaps = 68/349 (19%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL +PSL  + P  +DP   F   +    P  AS    L  +  +TN     
Sbjct: 517 HSVSVKVLRLSQPSLVTQHP--IDPP--FSPPNTKSQPAPAS----LAYAPSSTN----- 563

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
                       D   LS +L LP +FG+ Y+GETF   +  NN     +   +RDV I+
Sbjct: 564 -----------PDPFLLSPILNLPVSFGSAYVGETFSCTLCANNDLPPDAAKRIRDVRIE 612

Query: 124 AEIQT----DKQRILLLDTSKSPVES-------IRAGGRYDFIVEHDVKELGAHTLVCTA 172
           AE++T      Q++ L   +     +       +  GG    IV  D+KE G H L  T 
Sbjct: 613 AEMKTPGLGGTQKLELGPANTHEGAAAGGGGVDLEPGGTLQRIVGFDLKEEGNHVLAVTV 672

Query: 173 LY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITF-LEACIENHTKS 228
            Y   ++  G  +   + ++FI    L VRTKV  +    +      + LEA +EN ++ 
Sbjct: 673 SYYEATETSGRTRTFRKLYQFICKASLIVRTKVSGLDANTSSSGTRKWILEAQLENCSED 732

Query: 229 NLYMDQV--EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKM 286
            + +++V  + E    +      +DG         + +  P             + Q+  
Sbjct: 733 VMQLEKVVLDVEDGLGYHDCNWASDG-------DQKPVLHP-----------GEIEQVCF 774

Query: 287 LSH--GSSSPVKV--QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTIT 331
           L H  G+ S V++   G  + G L I WR  +G  G L T + LG  I 
Sbjct: 775 LVHEKGADSGVRMTPDGRIIFGVLGIGWRGEMGCRGYLSTGK-LGARIA 822


>gi|320037981|gb|EFW19917.1| hypothetical protein CPSG_03092 [Coccidioides posadasii str.
           Silveira]
          Length = 342

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 143/361 (39%), Gaps = 81/361 (22%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL          ED  + P+  S   P  ++D         
Sbjct: 17  HSVSLKVLRLSRPSLSYQHPL---------PEDFANVPVQPSLSYPSSTAD--------- 58

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
               +F+L  +         L+LP AFG+ Y+GETF   +S NN      ++  V  + I
Sbjct: 59  ---KQFILSPN---------LMLPPAFGSAYVGETFSCSLSANNEFLRGDASRVVTSIRI 106

Query: 123 KAEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
            AE+QT  Q + L     S     ++GG         IV  D+KE G H L     Y++ 
Sbjct: 107 LAEMQTPSQVVPLELYPSSDDNDTKSGGIAQVESMQRIVRFDLKEEGNHVLAVGVSYTET 166

Query: 177 --------------GEGERKYLPQFFKFIVSNPLSVRTKV-----------RVVKVGATH 211
                           G  +   + ++F+    L+VRTK             +   G T 
Sbjct: 167 MITQSSDAHGSVQASGGRVRTFRKLYQFVAQPCLNVRTKATELPPQEVDNRSLGPYGKTK 226

Query: 212 FQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIF 265
                 LEA +EN     + +  V   P   + +  L  D   S  + +S      R++ 
Sbjct: 227 LYRFA-LEAQLENVGDGIITLGAVTLNPKPPFKSRSLNWDF-ESSADKESIPTLSPRDVL 284

Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
           +   ++    G  + L  L+         +  +G   LG+L + WR+ LG+ G L T  +
Sbjct: 285 QIAFIVEQEHGQQDGLETLQ-------KDMNREGRATLGQLSLEWRSALGDRGFLTTGNL 337

Query: 326 L 326
           +
Sbjct: 338 M 338


>gi|407928991|gb|EKG21830.1| hypothetical protein MPH_00750 [Macrophomina phaseolina MS6]
          Length = 327

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 85/340 (25%), Positives = 142/340 (41%), Gaps = 61/340 (17%)

Query: 6   GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
           G HS++ +V+RL RPSL    PL                        P    + T +  +
Sbjct: 15  GPHSVSLKVLRLSRPSLAHSFPLPQ----------------------PAQPDEFTISPKA 52

Query: 66  DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDV 120
            L Y +     D  D   +S LL LP+AFG+ Y+GE F   +  NN       +  +  V
Sbjct: 53  SLAYPT----ADPKDLFLVSPLLKLPEAFGSAYVGEAFSCTLCANNELLPGDESKTISGV 108

Query: 121 VIKAEIQTDK--QRILLLDTSKSPVESIRA----GGRYDFIVEHDVKELGAHTLVCTALY 174
            I A++QT      I L    K   E+++     G     I+  D+KE G+HTL  T  Y
Sbjct: 109 KIAADMQTPSAPSGIPLELEPKDGPETVQGTVGPGQSVQKILTFDLKEEGSHTLAVTVTY 168

Query: 175 SD----GEGER-----KYLPQFFKFIVSNPLSVRTKVR--VVKVGATHFQEITFLEACIE 223
           ++    GEG+      +   + ++F+    +SV+TK      K G + F     LEA +E
Sbjct: 169 TETQMAGEGKAAGGRVRTFRKLYQFVAQQLISVKTKTSELTTKGGPSKF----VLEAQLE 224

Query: 224 NHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQ 283
           N  + +L ++ V        +    KA+  ++  +A   E    PVL    G +    + 
Sbjct: 225 NLGEGSLSLEPVIVN-----AEAPFKANSLNTPLSASPEEPPHLPVL--GPGDVSQVAFI 277

Query: 284 LKMLSHGSSSPVKVQGSN--VLGKLQITWRTNLGEPGRLQ 321
           L+     ++   ++      ++  L + WR+ +G  G L+
Sbjct: 278 LEQQEGATAGETRLSAGRRMLVRNLWVQWRSPMGGRGSLK 317


>gi|119188243|ref|XP_001244728.1| hypothetical protein CIMG_04169 [Coccidioides immitis RS]
 gi|392871443|gb|EAS33358.2| hypothetical protein CIMG_04169 [Coccidioides immitis RS]
          Length = 342

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 143/361 (39%), Gaps = 81/361 (22%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL          ED  + P+  S   P  ++D         
Sbjct: 17  HSVSLKVLRLSRPSLSYQHPL---------PEDFANVPVQPSLSYPSSTAD--------- 58

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
               +F+L  +         L+LP AFG+ Y+GETF   +S NN      ++  V  + I
Sbjct: 59  ---KQFILSPN---------LMLPPAFGSAYVGETFSCSLSANNEFLRGDASRVVTSIRI 106

Query: 123 KAEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
            AE+QT  Q + L     S     ++GG         IV  D+KE G H L     Y++ 
Sbjct: 107 LAEMQTPSQVVPLELYPSSDDNDTKSGGIAQVESMQKIVRFDLKEEGNHVLAVGVSYTET 166

Query: 177 --------------GEGERKYLPQFFKFIVSNPLSVRTKV-----------RVVKVGATH 211
                           G  +   + ++F+    L+VRTK             +   G T 
Sbjct: 167 MITPSSDAHGSVQASGGRVRTFRKLYQFVAQPCLNVRTKATELPPQEVDNRSLGPYGKTK 226

Query: 212 FQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIF 265
                 LEA +EN     + +  V   P   + +  L  D   S  + +S      R++ 
Sbjct: 227 LYRFA-LEAQLENVGDGIITLGAVTLNPKPPFKSRSLNWDF-ESSADKESIPTLSPRDVL 284

Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
           +   ++    G  + L  L+         +  +G   LG+L + WR+ LG+ G L T  +
Sbjct: 285 QIAFIVEQEHGQQDGLETLQ-------KDMNREGRATLGQLSLEWRSALGDRGFLTTGNL 337

Query: 326 L 326
           +
Sbjct: 338 M 338


>gi|345314305|ref|XP_001518717.2| PREDICTED: UPF0533 protein C5orf44 homolog, partial
           [Ornithorhynchus anatinus]
          Length = 129

 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 43/123 (34%), Positives = 60/123 (48%), Gaps = 32/123 (26%)

Query: 79  ADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDT 138
           A+ + L  +L LPQ FG I+LGETF SYIS++N S+  V+D+++K    + +QR      
Sbjct: 17  AEILTLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQMVKDILVKV---SGRQR------ 67

Query: 139 SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSV 198
                                  E     LVC   Y+   GE+ Y  +FFKF V  PL V
Sbjct: 68  -----------------------EAAPGRLVCAVSYTTQSGEKMYFRKFFKFQVLKPLDV 104

Query: 199 RTK 201
           +TK
Sbjct: 105 KTK 107


>gi|320593998|gb|EFX06401.1| duf974 domain containing protein [Grosmannia clavigera kw1407]
          Length = 1072

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 85/354 (24%), Positives = 143/354 (40%), Gaps = 69/354 (19%)

Query: 8    HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
            H ++ +V+RL  PSL  + P+                P++ +  PP + + +        
Sbjct: 751  HPISLKVLRLSHPSLATQYPVAA--------------PLSTALPPPTVPASIAYGGGGPD 796

Query: 68   TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN---------------- 111
            +  +      + D   LS +L LP +FG+ Y+GETF   +  N+                
Sbjct: 797  SAAT------NTDPFLLSPVLNLPPSFGSAYVGETFACTLCANHDAADVEDGGWSKEKAA 850

Query: 112  SSTLEVRDVVIKAEIQTDK-----QRILLLDTSKS--------PVESIRAGGRYDFIVEH 158
            S+   +RDV I+AE++T       + +L  +T               + +G     +V  
Sbjct: 851  SAVASIRDVQIEAEMKTPSAAEPVKLVLGPETDDGDGAGLGLHAGTDLASGQTLQKVVRF 910

Query: 159  DVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVG-ATHFQE 214
            D+KE G H L  T  Y   ++  G  +   + ++FI    L VRTK      G A   + 
Sbjct: 911  DLKEEGNHVLAVTVSYYEATETSGRTRTFRKLYQFICKASLIVRTKAGPYAAGRAGDMRR 970

Query: 215  ITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSG 274
               LEA +EN  +  + +++VE E  ++ +            Y+    E  + PVL    
Sbjct: 971  RWALEAQLENCGEDVIQLERVELELERSLT------------YDKYDWEDGQKPVL--HP 1016

Query: 275  GGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGT 328
            G +    + L+    G   P +  G  + G L I WR+ +G  G L T   LGT
Sbjct: 1017 GEVEQVCFLLEETGPG-LVPEQPNGRLLFGVLGIGWRSEMGNRGFL-TTGTLGT 1068


>gi|119501216|ref|XP_001267365.1| hypothetical protein NFIA_109620 [Neosartorya fischeri NRRL 181]
 gi|119415530|gb|EAW25468.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 352

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 129/329 (39%), Gaps = 63/329 (19%)

Query: 49  SNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYIS 108
           SN  PL +++   ++ + L+Y S      + D   LS  L LP +FG+ Y+GETF   +S
Sbjct: 31  SNQYPLPAANTKISRKASLSYPS----DSTDDKFILSPNLTLPPSFGSAYVGETFACTLS 86

Query: 109 INN-----SSTLEVRDVVIKAEIQTDKQRILL-LDTSKSPV--ESIRAGGRYDFIVEHDV 160
            NN      ++  V  V I AE+QT  Q   L L+ +  P   E ++ G     IV  D+
Sbjct: 87  ANNELPEDETSRVVTSVRIVAEMQTPSQVASLDLEPANDPAQTEGLQRGQSLQKIVRFDL 146

Query: 161 KELGAHTLVCTALYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK----- 206
           KE G H L  +  Y++           G  +   + ++F+    LSVRTK   +      
Sbjct: 147 KEEGNHILAVSISYTETLIGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVE 206

Query: 207 ------VGATHFQEITFLEACIENHTKSNLYM-----------------DQVEFEPSQNW 243
                  G T       LEA +EN     + +                  Q +  P   +
Sbjct: 207 NKALGPYGKTRLLRFA-LEAQLENVGDGTVVVKVCGWGILLKISFLTARQQTKLNPKPPF 265

Query: 244 SATMLKADGPHSDY------NAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
            A  L  D    D           R++ +   L+    G    L  L+         ++ 
Sbjct: 266 RAVSLNWDLERPDKVDSQPPTLNPRDVLQVAFLVEQEEGQQEGLEALQ-------KDLRR 318

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
            G  VLG+L I WR  +G+ G L T  +L
Sbjct: 319 DGRAVLGQLSIEWRGAVGDKGFLTTGNLL 347


>gi|367055168|ref|XP_003657962.1| hypothetical protein THITE_75670 [Thielavia terrestris NRRL 8126]
 gi|347005228|gb|AEO71626.1| hypothetical protein THITE_75670 [Thielavia terrestris NRRL 8126]
          Length = 351

 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 97/365 (26%), Positives = 151/365 (41%), Gaps = 69/365 (18%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL          +     P   S+ PPL +S   +N + + 
Sbjct: 16  HSVSLKVLRLSRPSLVAQYPL----------QPPLSSPT--SHPPPLPASLAYSNGAGNA 63

Query: 68  T-YRSRFLLHDSADSIG---LSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVR 118
           +   +   L     +     LS +L LP +FG+ Y+GETF   +  N+     +    +R
Sbjct: 64  SGANADNPLQPPPTNPAPFVLSPILNLPPSFGSAYVGETFSCTLCANHDIPEGAPPKTIR 123

Query: 119 DVVIKAEIQTDKQ----RILLLDTSKS------PVESIRAGGRYDFIVEH---------- 158
           DV I+AE++T       ++ LL  + S      P  +    G  D    H          
Sbjct: 124 DVRIEAEMKTPSSPAPIKLALLPYTSSDANNDAPTTTTTTAG-VDLTPPHATTLQRILAF 182

Query: 159 DVKELGAHTLVCTALYSDGE---GERKYLPQFFKFIVSNPLSVRTKVRVVKV---GATHF 212
           D+KE G H L  T  Y +     G  +   + ++F     L VRTK   +     GA  +
Sbjct: 183 DLKEEGNHVLAVTVSYYEASALAGRTRTFRKLYQFACKASLIVRTKPGALPARPGGARRW 242

Query: 213 QEITFLEACIENHTKSNLYMDQV--EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVL 270
                LEA +EN ++  + +++V  E EP        +  +G         R   K PVL
Sbjct: 243 ----VLEAQLENCSEEGMLLERVGLELEP----GLACVDCNG------GMGRPRRKRPVL 288

Query: 271 IRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTI 330
               G      + ++    G     +V G  V G LQI WR+ +G  G L T + LGT  
Sbjct: 289 --QPGETEQVCFVIEEEEKGRVE--EVDGRVVFGVLQIGWRSEMGNRGFLSTGK-LGTRF 343

Query: 331 TSKEI 335
              +I
Sbjct: 344 VKPKI 348


>gi|327294773|ref|XP_003232082.1| hypothetical protein TERG_07700 [Trichophyton rubrum CBS 118892]
 gi|326466027|gb|EGD91480.1| hypothetical protein TERG_07700 [Trichophyton rubrum CBS 118892]
          Length = 343

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 84/352 (23%), Positives = 135/352 (38%), Gaps = 63/352 (17%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL ++ P+ V                          SD   ++ + L
Sbjct: 17  HSISLKVLRLSRPSLSLQHPIPV--------------------------SDAQFSRITSL 50

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS----TLEVRDVVIK 123
           +Y S      S     LS  L LP +FG+ Y+GETF   +S NN +    +  V  V I+
Sbjct: 51  SYPS----ATSDSQFILSPNLTLPPSFGSAYVGETFACSLSANNEALGGNSRVVTSVRIQ 106

Query: 124 AEIQTDKQRIL--LLDTSKSPVESIRAG--GRYDFIVEHDVKELGAHTLVCTALYSD--- 176
           A++QT  Q I   LL   + P +S           I+  D+KE G H L  +  Y++   
Sbjct: 107 ADMQTPSQTIPLELLPADEEPKKSTGTSTTASVQKIIHFDLKEEGNHVLAVSVNYTETTM 166

Query: 177 ------------GEGERKYLPQFFKFIVSNPLSVRTKV------RVVKVGATHFQEITF- 217
                         G  +   + ++F+    LSVRTK        +    A  F +    
Sbjct: 167 AANKDAPGGFQASGGRARTFRKLYQFVAQPCLSVRTKATELAPREIEDRSAGPFGKTRLL 226

Query: 218 ---LEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSG 274
              LEA +EN     + +          + +T L  D    D + +       P  +   
Sbjct: 227 RFALEAQLENVGDGMIVLGVPTLNSKPPFKSTSLNWDFYEKDGDQKKIAPTLAPRDVVQI 286

Query: 275 GGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
             +       +     +   +   G   LG+L I WR+ +GE G L T  ++
Sbjct: 287 AFLVEQEEGEQEGLEATQKDISRDGRTALGQLSIQWRSAMGEKGYLTTGNLM 338


>gi|326469947|gb|EGD93956.1| hypothetical protein TESG_01485 [Trichophyton tonsurans CBS 112818]
          Length = 350

 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 137/362 (37%), Gaps = 80/362 (22%)

Query: 5   PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           P T +L   V RL RPSL ++ P+ V                          SD   ++ 
Sbjct: 24  PATDAL---VHRLSRPSLSLQHPIPV--------------------------SDAQFSRI 54

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS----TLEVRDV 120
           + L+Y S      S     LS  L LP +FG  Y+GETF   +S NN +    +  V  V
Sbjct: 55  ASLSYPS----ATSDSQFILSPNLTLPPSFGTAYVGETFACSLSANNEALGGNSRVVTSV 110

Query: 121 VIKAEIQTDKQRIL--LLDTSKSPVES--IRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
            I+A++QT  Q I   LL T + P +S    A      I+  D+KE G H L  +  Y++
Sbjct: 111 RIQADMQTPSQTIPLELLPTGEEPAKSAGTSATASIQKIIHFDLKEEGNHVLAVSVNYTE 170

Query: 177 ---------------GEGERKYLPQFFKFIVSNPLSVRTKV------RVVKVGATHFQEI 215
                            G  +   + ++F+    LSVRTK        +    A  F + 
Sbjct: 171 TMMAPNKDAASGFQASGGRARTFRKLYQFVAQPCLSVRTKATELAPREIEDRSAGPFGKT 230

Query: 216 TF----LEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS-------REI 264
                 LEA +EN     + +          + +T L  D    D   +        R++
Sbjct: 231 RLLRFALEAQLENVGDGMIVLGIPTLNSKPPFKSTSLNWDFFEKDGGEKKIAPTLAPRDV 290

Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
            +   L+    G    L         +   +   G   LG+L I WR+ +GE G L T  
Sbjct: 291 VQIAFLVEQEEGQQEGL-------EATQKDISRDGRTALGQLSIQWRSAMGEKGYLMTGN 343

Query: 325 IL 326
           ++
Sbjct: 344 LM 345


>gi|389640393|ref|XP_003717829.1| hypothetical protein MGG_01105 [Magnaporthe oryzae 70-15]
 gi|16565967|gb|AAL26319.1| hypothetical protein [Magnaporthe grisea]
 gi|351640382|gb|EHA48245.1| hypothetical protein MGG_01105 [Magnaporthe oryzae 70-15]
 gi|440466337|gb|ELQ35609.1| DUF974 domain-containing protein [Magnaporthe oryzae Y34]
 gi|440487884|gb|ELQ67649.1| DUF974 domain-containing protein [Magnaporthe oryzae P131]
          Length = 339

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 149/362 (41%), Gaps = 82/362 (22%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P++  P            P A +   P           + L
Sbjct: 15  HSISLKVLRLSRPSLVAQYPVK-SPEG--------SQPSAGAGSHP-----------ASL 54

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVVI 122
            Y S      + D   LS +L LP +FG+ Y+GETF   +  N+     ++  +VRDV I
Sbjct: 55  AYGSPD--GTNPDPFILSPILNLPPSFGSAYVGETFSCTLCANHDVPDGAAARQVRDVRI 112

Query: 123 KAEIQTDKQRILLL-----------DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCT 171
           +AE++T      ++                    +R G     IV  D+KE G H L  T
Sbjct: 113 EAEMKTPGSAAGVVTKLDLGPNGGGGGEGDGGVDLREGETLQRIVRFDLKEEGNHVLAVT 172

Query: 172 ALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEIT------------ 216
             Y   ++  G  +   + ++FI  + L VRTK   +  G+    E +            
Sbjct: 173 VSYYEATETSGRTRTFRKLYQFICKSSLIVRTKASQLPGGSGAMTETSSAGGKEEQQQSQ 232

Query: 217 -------FLEACIENHTKSNLYMDQV--EFEPSQNWSATMLKADGPHSDYNAQSREIFKP 267
                   LEA +EN ++  + +++V  + EP   ++           +++A  R+    
Sbjct: 233 LRRRRQWVLEAQLENCSEDAIQLERVVLDLEPGLVYT---------DCNWDADERQ---K 280

Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKV-QGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
           PVL  S       + Q+  +   + +  +V  G  V G L + WR  +G  G L T + L
Sbjct: 281 PVLHPS------EVEQVCFVVQEAGAECEVMDGKVVFGVLGVGWRGEMGSRGFLSTGK-L 333

Query: 327 GT 328
           GT
Sbjct: 334 GT 335


>gi|414870886|tpg|DAA49443.1| TPA: hypothetical protein ZEAMMB73_957859 [Zea mays]
          Length = 70

 Score = 69.3 bits (168), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 33/69 (47%), Positives = 50/69 (72%)

Query: 378 SDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLP 437
           S E++ V++NG + + L  VEAF S  F L+++ T+LGVQ+I+GIT++   EK  Y+ LP
Sbjct: 2   SGEDRAVLVNGPQKLILPLVEAFESIKFDLSMVTTQLGVQKISGITMYAVQEKKYYEPLP 61

Query: 438 DLEIFVDQD 446
           D+EIFVD +
Sbjct: 62  DIEIFVDAE 70


>gi|303316452|ref|XP_003068228.1| hypothetical protein CPC735_002510 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240107909|gb|EER26083.1| hypothetical protein CPC735_002510 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 342

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 86/361 (23%), Positives = 142/361 (39%), Gaps = 81/361 (22%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL          ED  + P+  S   P  ++D         
Sbjct: 17  HSVSLKVLRLSRPSLSYQHPL---------PEDFANVPVQPSLSYPSSTAD--------- 58

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
               +F+L  +         L+LP AFG+ Y+GETF   +S NN      ++  V  + I
Sbjct: 59  ---KQFILSPN---------LMLPPAFGSAYVGETFSCSLSANNEFLRGDASRVVTSIRI 106

Query: 123 KAEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
            A++QT  Q + L           ++GG         IV  D+KE G H L     Y++ 
Sbjct: 107 LADMQTPSQVVPLELYPSGDDNDTKSGGIAQVESMQRIVRFDLKEEGNHVLAVGVSYTET 166

Query: 177 --------------GEGERKYLPQFFKFIVSNPLSVRTKV-----------RVVKVGATH 211
                           G  +   + ++F+    L+VRTK             +   G T 
Sbjct: 167 MITQSSDAHGSVQASGGRVRTFRKLYQFVAQPCLNVRTKATELPPQEVDNRSLGPYGKTK 226

Query: 212 FQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIF 265
                 LEA +EN     + +  V   P   + +  L  D   S  + +S      R++ 
Sbjct: 227 LYRFA-LEAQLENVGDGIITLGAVTLNPKPPFKSRSLNWDF-ESSADKESIPTLSPRDVL 284

Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
           +   ++    G  + L  L+         +  +G   LG+L + WR+ LG+ G L T  +
Sbjct: 285 QIAFIVEQEHGQQDGLETLQ-------KDMNREGRATLGQLSLEWRSALGDRGFLTTGNL 337

Query: 326 L 326
           +
Sbjct: 338 M 338


>gi|322705248|gb|EFY96835.1| DUF974 domain-containing protein [Metarhizium anisopliae ARSEF 23]
          Length = 368

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 91/356 (25%), Positives = 147/356 (41%), Gaps = 78/356 (21%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RP+L  + P                 P+ A+    L SS      S++ 
Sbjct: 57  HSVSVKVLRLSRPALVPQYP---------------SSPLPATKEAFLPSSLSYKTPSTN- 100

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTL------------ 115
              + FL         LS +L LP +FG+ Y+GETF   +  NN  T             
Sbjct: 101 --PAPFL---------LSPILNLPVSFGSAYVGETFSCTLCANNDLTTTSSSSSSPSPSP 149

Query: 116 ----EVRDVVIKAEIQT----DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHT 167
                +RDV I AE++T       R+ L   S +P + + AG     +V  D+KE G H 
Sbjct: 150 PPAKHIRDVRIDAEMKTPGPGPAHRLPL--ASGAPAD-LAAGETLQRVVSFDLKEEGNHV 206

Query: 168 LVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEIT----FLEA 220
           L  T  Y   S+  G  +   + ++F+    L VRTKV ++  G++     +     LEA
Sbjct: 207 LAVTVSYYEASETSGRTRTFRKLYQFMCKAGLVVRTKVGLLGGGSSSSSRSSRKRWVLEA 266

Query: 221 CIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNY 280
            +EN ++  + +++V  E  +      L+ +G   ++    R +  P       G +   
Sbjct: 267 QLENCSQDVMQLEEVGMEAERG-----LRCEG--CNWAEGERPVLHP-------GEVEQV 312

Query: 281 LY------QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTI 330
            +      +       S +     G  V G L I WR  +G  G L T + LGT +
Sbjct: 313 CFVVVEEDEEDEDEEESGADGDADGRVVFGVLGIGWRGEMGNRGFLSTGK-LGTRV 367


>gi|336468302|gb|EGO56465.1| hypothetical protein NEUTE1DRAFT_65043 [Neurospora tetrasperma FGSC
           2508]
          Length = 341

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 70/280 (25%), Positives = 120/280 (42%), Gaps = 57/280 (20%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL         GED  +   A               ++ D 
Sbjct: 15  HSVSLKVLRLSRPSLVPQFPLHPP-----HGEDAHEAESAGGE------------RTRDG 57

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETF-CSYISINNSSTL---------EV 117
            Y +   +        LS ++ LP +FG+ Y+GETF C+  + +N+  +          +
Sbjct: 58  YYNTEPFI--------LSPIVNLPPSFGSAYVGETFSCTLCANHNAPPIGEGGTSVKKTI 109

Query: 118 RDVVIKAEIQT---DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY 174
           RDV I+AE+Q       +++L DT+    ++  +G     I+   +KE G H L  T  Y
Sbjct: 110 RDVKIEAEMQAPSGQTTKLVLGDTAGD--DNAGSGTTLQKILNFGLKEEGTHVLGVTVSY 167

Query: 175 ---SDGEGERKYLPQFFKFIVSNPLSVRTK------VRVVKVGATHFQEITFLEACIENH 225
              ++  G  +   + ++FI    L VRTK      +  VK G    +    LEA +EN 
Sbjct: 168 YEATETSGRTRAFRKMYQFICKPSLIVRTKAGPLPSLPPVKAGNGKRRRRWVLEAQLENC 227

Query: 226 TKSNLYMDQVEFEPSQ--------NWSATMLKADGPHSDY 257
           ++  + +++ E    Q        NW+   +    P   +
Sbjct: 228 SEDAILLEKAELAEVQRGLKWRDCNWAGIGVGVGPPRRPF 267


>gi|395754144|ref|XP_003779717.1| PREDICTED: LOW QUALITY PROTEIN: UPF0533 protein C5orf44 homolog
           [Pongo abelii]
          Length = 354

 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 147/329 (44%), Gaps = 44/329 (13%)

Query: 106 YISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGA 165
           Y+SI+  S    + ++  A+IQT+   + +L  S + V  + +  R D ++ HD+K    
Sbjct: 52  YMSISKDSNXVAKIILXNADIQTNTXPLHVL-VSMAIVAELVSHCRIDDVI-HDMK---- 105

Query: 166 HTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENH 225
              +C                 F F+  + L  +TK    +      +   FL+  I+N 
Sbjct: 106 ---LC----------------LFSFL--SQLDDKTKFYNSE------KNDLFLKVKIQNT 138

Query: 226 TKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK 285
           + S +++  + F  S   +   L       + + ++   F     ++S  G   YL  ++
Sbjct: 139 SSSTVFIQSISFVSSDMHTGKELNT----VNQDGENECTFGTTTFLQSMEG-RQYLDHVQ 193

Query: 286 MLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSV 345
           +    S     ++G   +GKL I  + NLGE   LQT Q+L  +   + + L++  +P  
Sbjct: 194 LKQKCSVEAGIIKGLREMGKLDIVSKRNLGEMAMLQTIQLLRXSPGHENMRLSLEMIPDS 253

Query: 346 VGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDF 405
           V +++PF +  K TN +D++    ++ L+  D+D    +   G     L  + +  S  F
Sbjct: 254 VXLEEPFHITCKTTNCSDRK---MKLILNMCDTDS---IHWYGSSGRYLGKLLSCSSLCF 307

Query: 406 HLNLIATKLGVQRITGITVFDKLEKITYD 434
              L+  KLG+Q ++GI + DK  + TYD
Sbjct: 308 TXTLLFLKLGLQSVSGIQLTDKSLQKTYD 336


>gi|115482756|ref|NP_001064971.1| Os10g0498800 [Oryza sativa Japonica Group]
 gi|113639580|dbj|BAF26885.1| Os10g0498800, partial [Oryza sativa Japonica Group]
          Length = 64

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/63 (49%), Positives = 48/63 (76%)

Query: 384 VMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
           V++NGL+ + L  VEAF S +F L+++AT++GVQ+I+GIT++   EK  Y+ L D+EIFV
Sbjct: 2   VLVNGLQKLVLPLVEAFESINFDLSMVATQVGVQKISGITLYAVQEKKLYEPLSDIEIFV 61

Query: 444 DQD 446
           D +
Sbjct: 62  DAE 64


>gi|452824517|gb|EME31519.1| hypothetical protein Gasu_11950 [Galdieria sulphuraria]
          Length = 461

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 101/464 (21%), Positives = 192/464 (41%), Gaps = 64/464 (13%)

Query: 2   SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
           +S  GT  L FR+++  RP      P+       FI    ++     S+       +VTT
Sbjct: 13  TSLSGTPKLLFRIIKTERPKPTFHAPIP------FIRPLFYEQVDRKSSYEK--DFEVTT 64

Query: 62  NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
            +SS  T        DS    G++  +     F  IY GE+    + + N+S+ ++  V 
Sbjct: 65  RESSPRT------AEDSC--FGITSNVSHTSNFN-IYRGESVHLTLVLLNASSSDLGFVS 115

Query: 122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
           +   +QT +    LLDT  SP          ++ ++   K +G + L C A Y+D +G+ 
Sbjct: 116 VLVRLQTSEGSYCLLDTQSSPNNIFTTQASLEYNLQFVAKVVGNYALQCFAFYTDVDGQE 175

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVK-------VGATHFQEITFLEACIENHTKSNLYMDQ 234
             + Q ++F V   L+    +R+V+         + H   +  ++  I N  +  +Y+ +
Sbjct: 176 HTISQSYRFTVHLCLNFIYDIRLVEEETDWEFFASLHPSSVYIVDCFIYNVCQLPVYLHE 235

Query: 235 VEFEPSQNWSATMLKADGPHSDYNAQ--SREIFKPPV---------LIRSGGGIHNYLYQ 283
           V F  S N         G   D N     +++  P V         LI + G    + Y 
Sbjct: 236 VHFLLSDNIGC----ERGSKEDQNPSIIVKDLNIPSVGGEERTNESLILNPGDCQTFTY- 290

Query: 284 LKMLSHGSSSPVKVQGS----NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE----- 334
             ++      P++ + S    NVLG +  ++    G+      + +L   +T +E     
Sbjct: 291 --LVYSAIEDPLRRKSSSRAKNVLGSIYASFTRFGGD------RVVLDPALTVEEPKMSQ 342

Query: 335 ---IELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRI 391
              + + VV VPS + ++ PF+  +K+ N+T + +  F   + ++       + ++G  +
Sbjct: 343 VSMVTIEVVGVPSKIVVECPFVATMKVVNRTSQSKK-FYFQVRRDKVGSIVPIGVSGRLL 401

Query: 392 MALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDS 435
             L P +   S    + LIA + G   ++G  V D   +  Y++
Sbjct: 402 ETLQPNQ---SCKLDMQLIALEPGAHFLSGFRVVDVESREYYEA 442


>gi|116204863|ref|XP_001228242.1| hypothetical protein CHGG_10315 [Chaetomium globosum CBS 148.51]
 gi|88176443|gb|EAQ83911.1| hypothetical protein CHGG_10315 [Chaetomium globosum CBS 148.51]
          Length = 813

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 96/370 (25%), Positives = 149/370 (40%), Gaps = 83/370 (22%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P +      F     FD PI  S+ PP+ +S         L
Sbjct: 472 HSVSLKVLRLSRPSLVAQYPFQPP----F--SSPFDGPI--SHQPPIPAS---------L 514

Query: 68  TYRSRFL--LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISIN------NSSTLE--- 116
            Y S  L  +  +     LS +L LP +FG+ Y+GETF   +  N      N + L    
Sbjct: 515 AYSSNGLNDVPTNPTPFVLSPILNLPPSFGSAYVGETFSCTLCANHDIPDDNPAALAAKT 574

Query: 117 VRDVVIKAEIQTDKQRILLLDTSK---------------------------SPVESIRAG 149
           +RDV I+AE++T      L                                SP ++++  
Sbjct: 575 IRDVRIEAEMKTPSSATALTLPLTPPSPPTPTTTPGDTTTATTETGPGTDLSPHQTLQK- 633

Query: 150 GRYDFIVEHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVV- 205
                I+  D+KE G H L  T  Y   S+  G  +   + ++F+    L VRTK   + 
Sbjct: 634 -----ILSFDLKEEGNHVLAVTVSYYEASELSGRTRTFRKLYQFVCKPSLIVRTKPGALP 688

Query: 206 KVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ-------NWSATMLKADGPHSDYN 258
                  +    LEA +EN  K  L +++V  E  +       NW +      G  +   
Sbjct: 689 PADPASGRRRWVLEAQLENCGKEGLMLEKVGLELERGLGYEDCNWESGGGGGTG-GNGGV 747

Query: 259 AQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPG 318
            + R +  P       G      + ++  + G+    +V G    G LQI WR+ +G  G
Sbjct: 748 GRMRPVLLP-------GETEQVCFVIEEDAAGAVE--EVDGRVAFGILQIGWRSEMGNRG 798

Query: 319 RLQTQQILGT 328
            L T + LGT
Sbjct: 799 FLSTGK-LGT 807


>gi|400601500|gb|EJP69143.1| DUF974 domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 408

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 95/206 (46%), Gaps = 45/206 (21%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISINN-----SSTL----EVRDVVIKAEIQT---DKQ 131
           LS +L LP +FG+ Y+GETF   +  NN     SST     ++RDV ++AE++T    K 
Sbjct: 112 LSPILNLPVSFGSAYVGETFSCTLCANNDLDDSSSTATTKRQIRDVRVEAEMKTPGQTKA 171

Query: 132 RILLLDTSKSPVES------------IRAGGRYDFIVEHDVKELGAHTLVCTALY---SD 176
           + L L  + S  ES            +  GG    IV  D+KE G H L  T  Y   ++
Sbjct: 172 QSLELGPAPSSQESAAVGAAAAAATDLAPGGTLQKIVSFDLKEEGNHVLAVTVSYYEAAE 231

Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEIT-----------FLEACIENH 225
             G  +   + ++FI    L VRTKV V+K  A   ++              LEA +EN 
Sbjct: 232 TSGRTRTFRKLYQFICKPSLIVRTKVGVLKAPAPKKKKQQQQQQQPPLRRWVLEAQLENC 291

Query: 226 TKSNLYMDQV--EFEPS-----QNWS 244
           +   + +D+V  E EP       NW+
Sbjct: 292 SDDTMQLDRVVMELEPGLTCRDCNWT 317


>gi|354489772|ref|XP_003507035.1| PREDICTED: UPF0533 protein C5orf44 homolog [Cricetulus griseus]
          Length = 287

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 66/262 (25%), Positives = 119/262 (45%), Gaps = 19/262 (7%)

Query: 183 YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN 242
           +L +   F  S PL V+TK           ++  FLE  IEN + S +++ +V  +  + 
Sbjct: 35  FLSKICLFYPSEPLDVKTKF------YNSDKDDLFLEVQIENISHSTVFIREVSLKLPEM 88

Query: 243 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
           ++   L       +   +    F     +++  G H YLY L+           + G   
Sbjct: 89  YTEEALNT----LNLEGEDECTFGTRTFLQATEGRH-YLYHLQFKEEYLEKARTLSGLME 143

Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
           +GKL+I W+  LGE   L T  +     +  E++L++ ++P  V  ++PF +  K+TN T
Sbjct: 144 MGKLEIVWKRELGEMPMLHTVPLRREAPSCGELKLSLEKIPDTVAREEPFQITCKITNCT 203

Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
           DK+    ++ L   D+   +    +G +   L P     S  F L L+  +LG++ I+GI
Sbjct: 204 DKK---MKLLLKMFDTTSVRWCGCSGRKPGRLKP---GSSLSFTLTLLCLQLGLRSISGI 257

Query: 423 TVFDK--LEKITYDSLPDLEIF 442
            V D   + K  YD + ++ + 
Sbjct: 258 RVIDTTLMTKYRYDDVANVCVL 279


>gi|238491960|ref|XP_002377217.1| DUF974 domain protein [Aspergillus flavus NRRL3357]
 gi|220697630|gb|EED53971.1| DUF974 domain protein [Aspergillus flavus NRRL3357]
          Length = 257

 Score = 65.5 bits (158), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 96/212 (45%), Gaps = 49/212 (23%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P                 P A + +         +NK+S L
Sbjct: 17  HSVSLKVLRLSRPSLSYQYPF----------------PEANTKI---------SNKAS-L 50

Query: 68  TYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVV 121
           +Y S     DS D+   L+  L LP AFG+ Y+GETF   +S NN      ++  V  V 
Sbjct: 51  SYPS-----DSVDNQFILAPNLTLPPAFGSAYVGETFACTLSANNELAEDETSRVVTSVR 105

Query: 122 IKAEIQTDKQ--RILLLDTSKSPV-ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD-- 176
           I AE+QT  Q   + L     +P  + ++ G     IV  D+KE G H L  +  Y++  
Sbjct: 106 IVAEMQTPSQVASLELEPADDAPARDGLQKGQSLQKIVRFDLKEEGNHILAVSVSYTETL 165

Query: 177 -------GEGERKYLPQFFKFIVSNPLSVRTK 201
                    G  +   + ++F+    LSVRTK
Sbjct: 166 IGSDSQAASGRVRTFRKLYQFVAQPCLSVRTK 197


>gi|171689020|ref|XP_001909450.1| hypothetical protein [Podospora anserina S mat+]
 gi|170944472|emb|CAP70583.1| unnamed protein product [Podospora anserina S mat+]
          Length = 208

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 72/157 (45%), Gaps = 17/157 (10%)

Query: 84  LSGLLVLPQAFGAIYLGETFCSYISINNS-----------STLEVRDVVIKAEIQTDKQR 132
           LS +L LP +FG+ Y+G TF   +  N+            S   +RDV I+AE++T    
Sbjct: 44  LSPILALPPSFGSAYVGTTFSCTLCANHDIPPPIDGGPPLSVKTIRDVKIEAEMKTPSSP 103

Query: 133 IL--LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG---EGERKYLPQF 187
            L  LL         +  GG    IV  D++E GAHTLV    Y +     G  +   + 
Sbjct: 104 TLIPLLPPGNDEGTDLSPGGTLQKIVSFDLREEGAHTLVVQVSYYEATSTSGRARMFRKL 163

Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIEN 224
           ++F+    L VRTK   + +G    +    LEA +EN
Sbjct: 164 YQFVCKGLLVVRTKTSALGLGKQGNRRWV-LEAQVEN 199


>gi|349803503|gb|AEQ17224.1| hypothetical protein [Pipa carvalhoi]
          Length = 122

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 50/87 (57%)

Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIEL 337
             YLY LK     +     ++G  V+GKL I W+TNLGE GRLQT Q+        ++ L
Sbjct: 5   RQYLYCLKPKPEFAEKAGVIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRL 64

Query: 338 NVVEVPSVVGIDKPFLLKLKLTNQTDK 364
           ++  +P  V +++PF +  K+TN +++
Sbjct: 65  SIETIPDTVSLEEPFDITCKITNCSER 91


>gi|261197155|ref|XP_002624980.1| DUF974 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239595610|gb|EEQ78191.1| DUF974 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 457

 Score = 65.1 bits (157), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 83/327 (25%), Positives = 128/327 (39%), Gaps = 51/327 (15%)

Query: 53  PLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN 111
           PL S +      + L+Y S     DS+DS   L   + LP AFG+ Y+GETF   +  NN
Sbjct: 55  PLPSENEKVPLKASLSYPS-----DSSDSQFVLCPNVTLPPAFGSAYVGETFSCSLCANN 109

Query: 112 SSTLE-----VRDVVIKAEIQTDKQRILLLDTSKSPVESIRAG----GRYDFIVEHDVKE 162
              L      V  V I AE+QT  Q ++ L+ S +  +S  +G         IV  D+KE
Sbjct: 110 ELPLYTENRVVSSVRIIAEMQTPSQ-VVSLELSPTGEDSQSSGLAIAQSLQKIVRFDLKE 168

Query: 163 LGAHTLVCTALYSD----------------------GEGERKYLPQFFKFIVSNPLSVRT 200
            G H L  +  Y++                        G  +   + ++FI    LSVRT
Sbjct: 169 EGNHVLAVSVSYTETTLAQRDQEMAPGIGGAGATQAASGRVRTFRKLYQFIAQPCLSVRT 228

Query: 201 KVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
           K   +             G T       LEA +EN     + +      P   + +  L 
Sbjct: 229 KATELSPLEVDNRALGPYGKTRLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLN 287

Query: 250 ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQIT 309
            D   SD  +       P  +++    +     Q + L  G    +   G  +LG+L I 
Sbjct: 288 WDFERSDSPSVGPPTLNPRDVLQVAFLVEQEHGQQEGL-EGLQKDISRDGRTILGQLSIE 346

Query: 310 WRTNLGEPGRLQTQQILGTTITSKEIE 336
           WR ++G+ G L T  ++     + E+E
Sbjct: 347 WRGSMGDRGFLTTGNLMTKRRLTLELE 373


>gi|346976493|gb|EGY19945.1| hypothetical protein VDAG_01961 [Verticillium dahliae VdLs.17]
          Length = 416

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 86/358 (24%), Positives = 140/358 (39%), Gaps = 77/358 (21%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P +  P       D    PI AS                 L
Sbjct: 16  HSISLKVLRLSRPSLVTQHPTK--PPQAPAAHDAA--PIPAS-----------------L 54

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----------STLE 116
            Y        + D   L+ +L LP +FG+ Y+GE F   +  N+             T  
Sbjct: 55  AYAPDAAASTNPDPFLLAPILNLPLSFGSAYVGEHFSCTLCANHEPPVSADVAAALPTKR 114

Query: 117 VRDVVIKAEIQTDK-----QRILLLD---------------TSKSPVESIRAGGRYDFIV 156
           +RDV I+AE++T       Q++ L                  +      +  G     IV
Sbjct: 115 IRDVRIEAEMKTPGAQGSVQKLQLTGRASDSSSSSSDPADPAAAKATADLAPGETLQRIV 174

Query: 157 EHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVV---KVGA- 209
             D+K+ G H L  T  Y   ++  G  +   + ++FI  + L VRTKV  +     GA 
Sbjct: 175 GFDLKDEGNHVLAVTVSYYEATETSGRTRTFRKLYQFICKSSLIVRTKVGSLPGTPGGAD 234

Query: 210 THFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYN--AQSREIFKP 267
              +    LEA +EN  +  + +++VE +         L+A   ++D N  +  + +  P
Sbjct: 235 GRARRRWVLEAQLENCAEDVVQLERVELD---------LEAGLAYTDCNWGSAGKPVLHP 285

Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
                  G +    + ++  + G        G  V G L I WR  +G  G L T ++
Sbjct: 286 -------GEVEQVCFVVEETAEGGGLEPGDDGRIVFGVLGIGWRGEMGNRGYLSTGKL 336


>gi|341901898|gb|EGT57833.1| hypothetical protein CAEBREN_19830 [Caenorhabditis brenneri]
          Length = 126

 Score = 64.7 bits (156), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 47/153 (30%), Positives = 74/153 (48%), Gaps = 29/153 (18%)

Query: 15  MRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFL 74
           MRL RP        +  P D F       DP+  +        ++   K S+L+  +R  
Sbjct: 1   MRLARP--------KYAPLDGF-----SHDPVDPTGF-----GEILAGKVSELSKETR-- 40

Query: 75  LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
            HD    + +   L+ PQ F  IYLGETF  Y+++ N S   V +V +K E+QT  QR+ 
Sbjct: 41  -HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNVVNVCLKCELQTSTQRVA 95

Query: 135 L-LDTSKSPVESIRAGGRYDFIVEHDVKELGAH 166
           L      + +E+ +  G+   ++ H+VKE+G H
Sbjct: 96  LPCSVQDTIIEASKCDGQ---VISHEVKEIGQH 125


>gi|239606593|gb|EEQ83580.1| DUF974 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 367

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 82/323 (25%), Positives = 125/323 (38%), Gaps = 63/323 (19%)

Query: 53  PLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN 111
           PL S +      + L+Y S     DS+DS   L   + LP AFG+ Y+GETF   +  NN
Sbjct: 55  PLPSENEKVPLKASLSYPS-----DSSDSQFVLCPNVTLPPAFGSAYVGETFSCSLCANN 109

Query: 112 SSTLE-----VRDVVIKAEIQTDKQRILLLDTSKSPVESIRAG----GRYDFIVEHDVKE 162
              L      V  V I AE+QT  Q ++ L+ S +  +S  +G         IV  D+KE
Sbjct: 110 ELPLYTENRVVSSVRIIAEMQTPSQ-VVSLELSPTGEDSQSSGLAIAQSLQKIVRFDLKE 168

Query: 163 LGAHTLVCTALYSD----------------------GEGERKYLPQFFKFIVSNPLSVRT 200
            G H L  +  Y++                        G  +   + ++FI    LSVRT
Sbjct: 169 EGNHVLAVSVSYTETTLAQRDQEMAPGIGGAGATQAASGRVRTFRKLYQFIAQPCLSVRT 228

Query: 201 KVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
           K   +             G T       LEA +EN     + +      P   + +  L 
Sbjct: 229 KATELSPLEVDNRALGPYGKTRLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLN 287

Query: 250 ADGPHSDYNA------QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
            D   SD  +        R++ +   L+    G    L  L+         +   G  +L
Sbjct: 288 WDFERSDSPSVGPPTLNPRDVLQVAFLVEQEHGQQEGLEDLQ-------KDISRDGRTIL 340

Query: 304 GKLQITWRTNLGEPGRLQTQQIL 326
           G+L I WR ++G+ G L T  ++
Sbjct: 341 GQLSIEWRGSMGDRGFLTTGNLM 363


>gi|324530182|gb|ADY49073.1| Unknown [Ascaris suum]
          Length = 194

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/147 (29%), Positives = 76/147 (51%), Gaps = 8/147 (5%)

Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
           G   +GKL + WRTN+GE GRLQT  +        ++ L V ++P+   I + F +  +L
Sbjct: 53  GGTSIGKLDMVWRTNMGERGRLQTSALQRMAPGYGDLRLTVEKIPATAKIRQTFEVVCRL 112

Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMI--NGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
            N +++     ++ L+ + S +  +V    +G+++  L P     + DF L L+    G+
Sbjct: 113 HNCSERS---LDLVLTLDGSLQPALVFCTASGVQLGQLPPNN---TVDFTLELLPITPGL 166

Query: 417 QRITGITVFDKLEKITYDSLPDLEIFV 443
           Q I+GI V D   K TY+     ++FV
Sbjct: 167 QPISGIRVSDTFLKRTYEHDDIAQVFV 193


>gi|327357840|gb|EGE86697.1| DUF974 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 367

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/317 (25%), Positives = 124/317 (39%), Gaps = 51/317 (16%)

Query: 53  PLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN 111
           PL S +      + L+Y S     DS+DS   L   + LP AFG+ Y+GETF   +  NN
Sbjct: 55  PLPSENEKVPLKASLSYPS-----DSSDSQFVLCPNVTLPPAFGSAYVGETFSCSLCANN 109

Query: 112 SSTLE-----VRDVVIKAEIQTDKQRILLLDTSKSPVESIRAG----GRYDFIVEHDVKE 162
              L      V  V I AE+QT  Q ++ L+ S +  +S  +G         IV  D+KE
Sbjct: 110 ELPLYTENRVVSSVRIIAEMQTPSQ-VVSLELSPTGEDSQSSGLAKAQSLQKIVRFDLKE 168

Query: 163 LGAHTLVCTALYSD----------------------GEGERKYLPQFFKFIVSNPLSVRT 200
            G H L  +  Y++                        G  +   + ++FI    LSVRT
Sbjct: 169 EGNHVLAVSVSYTETTLAQRDQEMPPSIGGASATQAASGRVRTFRKLYQFIAQPCLSVRT 228

Query: 201 KVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
           K   +             G T       LEA +EN     + +      P   + +  L 
Sbjct: 229 KATELSPLEVDNRALGPYGKTRLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLN 287

Query: 250 ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQIT 309
            D   SD  +       P  +++    +     Q + L  G    +   G  +LG+L I 
Sbjct: 288 WDFERSDSPSVGPPTLNPRDVLQVAFLVEQEHGQQEGL-EGLQKDISRDGRTILGQLSIE 346

Query: 310 WRTNLGEPGRLQTQQIL 326
           WR ++G+ G L T  ++
Sbjct: 347 WRGSMGDRGFLTTGNLM 363


>gi|402590101|gb|EJW84032.1| hypothetical protein WUBG_05056 [Wuchereria bancrofti]
          Length = 207

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 107/220 (48%), Gaps = 20/220 (9%)

Query: 230 LYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH 289
           + +++V  EPS  + ++ +   G  ++   QS     P         I  YL+ LK  + 
Sbjct: 1   MVLEKVILEPSDFYLSSEISPPGTENETMDQS--YLNP-------SDIRQYLFCLKPKTT 51

Query: 290 GSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGID 349
             S     +G+++ GKL + WRT++GE GRLQT  +        ++ L + ++P+ V   
Sbjct: 52  DYSLNYFRKGTSI-GKLDMVWRTSMGERGRLQTSALQRMAPGYGDLRLTIEKIPATVKAL 110

Query: 350 KPF----LLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVM--INGLRIMALAPVEAFGST 403
           + F     L+L++ N +  E+   ++ L+ +   +  +    I+G+ +  LAP     +T
Sbjct: 111 QSFRMVCRLRLEVMNYSFSERS-LDLVLTLDGKLQPNIAFCSISGVELGQLAPN---STT 166

Query: 404 DFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
           DF + L+    G+Q I+GI V D   + TY+     ++FV
Sbjct: 167 DFSIELLPLTPGLQSISGIRVTDTFLRRTYEHDDIAQVFV 206


>gi|310794613|gb|EFQ30074.1| hypothetical protein GLRG_05218 [Glomerella graminicola M1.001]
          Length = 343

 Score = 63.2 bits (152), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 79/359 (22%), Positives = 141/359 (39%), Gaps = 78/359 (21%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + P+R               P+  S +P   +       ++  
Sbjct: 16  HSVSLKVLRLSRPSLVTQHPIRA--------------PLTPSTVPVDATPASLAYDTTGA 61

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETF----CSYISINNSSTL-------- 115
           T  + F+         LS +L LP +FG+ Y+GE F    C+   + + + L        
Sbjct: 62  TNPAPFI---------LSPILNLPLSFGSAYVGEVFSCTLCANHDVPDPAPLVGPGGQPL 112

Query: 116 -----------EVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGG-----------RYD 153
                       +RDV I+AE++T     +       P  +   GG              
Sbjct: 113 PGGGGGAPKRKSIRDVRIEAEMKTPGANSVQKLELSPPDHAAANGGDAKGTDLGPGDTLQ 172

Query: 154 FIVEHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGAT 210
            IV+ D+KE G H L  T  Y   ++  G+ +   + ++FI  + L VRTK+    +GA+
Sbjct: 173 RIVDFDLKEEGNHVLAVTVSYYEATETSGKTRTFRKLYQFICKSSLIVRTKIG--PLGAS 230

Query: 211 HFQEITF----LEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFK 266
             +        +EA +EN ++  + +++V  +     S T    +         +R +  
Sbjct: 231 GGRHGGRRRWAMEAQLENCSEDVIQLEKVVLDLVDGLSYTDCNWEA-----GGGARPVLH 285

Query: 267 PPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
           P       G +    + ++       +     G  + G L I WR  +G  G L T ++
Sbjct: 286 P-------GEVEQVCFVVEEAEGSPRAQPGEDGRIIFGVLGIGWRGEMGNRGFLSTGKL 337


>gi|315056791|ref|XP_003177770.1| DUF974 domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311339616|gb|EFQ98818.1| DUF974 domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 347

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/315 (25%), Positives = 124/315 (39%), Gaps = 53/315 (16%)

Query: 53  PLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS 112
           PL  SD   +K + L+Y S      S     LS  L LP AFG+ Y+GETF   +S NN 
Sbjct: 40  PLPDSDARVSKLASLSYPS----GTSDPQFILSPNLTLPPAFGSAYVGETFACSLSANNE 95

Query: 113 S----TLEVRDVVIKAEIQTDKQRIL--LLDTSKSPVES--IRAGGRYDFIVEHDVKELG 164
           +    +  V  + ++A++QT  Q I   LL   + P +S    A      I+  D+KE G
Sbjct: 96  ALSGNSRVVTSIRMQADMQTPSQTIPLDLLPEDEEPGKSAGTSAAASVQKIIRFDLKEEG 155

Query: 165 AHTLVCTALYSD---------------GEGERKYLPQFFKFIVSNPLSVRTKV-----RV 204
            H L  +  Y++                 G  +   + ++F+    LSVRTK      R 
Sbjct: 156 NHVLAVSVNYTETTMAPNKDAPNGFQASGGRVRTFRKLYQFVAQPCLSVRTKATELPPRE 215

Query: 205 VK------VGATHFQEITFLEACIENHTKS--NLYMDQVEFEP-----SQNWSATMLKAD 251
           ++       G T       LEA +EN       L +  +  +P     S NW       +
Sbjct: 216 IENRSLGPYGKTRLLRFA-LEAQLENVGDEIIVLGVPTLNSKPPFKSTSLNWDVYEQDGE 274

Query: 252 GPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWR 311
              +      R++ +   L+    G    L         +   +   G   LG+L I W+
Sbjct: 275 QKKASPTLAPRDVIQLAFLVEQEEGQQEGL-------EVTQKDISRDGRTALGQLSIQWQ 327

Query: 312 TNLGEPGRLQTQQIL 326
             +GE G L T  ++
Sbjct: 328 GAMGEKGYLTTGNLM 342


>gi|291407886|ref|XP_002720266.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
          Length = 362

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 69/131 (52%), Gaps = 6/131 (4%)

Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
           LGKL + W+ NL E    QT Q+       + I ++V  +P  V +++PF +  K+TN +
Sbjct: 219 LGKLNVFWKKNLHETAIQQTIQLERDVPHYRSISVSVESMPDKVIVEEPFYMTCKITNFS 278

Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
           D++    +++L+  ++D     +  G  +  L P  +       LNL+  K G+QRI+GI
Sbjct: 279 DQK---MKLFLNLCNTDAVHWHLRGGKYLGKLPPRTSLC---LPLNLLFVKQGLQRISGI 332

Query: 423 TVFDKLEKITY 433
            + DK  K TY
Sbjct: 333 QLTDKYTKKTY 343


>gi|296827564|ref|XP_002851189.1| DUF974 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238838743|gb|EEQ28405.1| DUF974 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 342

 Score = 62.0 bits (149), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 69/280 (24%), Positives = 107/280 (38%), Gaps = 49/280 (17%)

Query: 88  LVLPQAFGAIYLGETFCSYISINNSS----TLEVRDVVIKAEIQTDKQRILLL----DTS 139
           L LP AFG+ Y+GETF   +S NN +    +  V  + ++A++QT  Q I L     D  
Sbjct: 66  LTLPPAFGSAYVGETFACSLSANNEALNGNSRVVASIRMQADMQTPSQTIPLELLPPDEE 125

Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD---------------GEGERKYL 184
            S V    A      I+  D+KE G H L  +  Y++                 G  +  
Sbjct: 126 SSQVAGASAANSVQKIIRFDLKEEGNHVLAVSVNYTEILMVPNKDAQSGYQASGGRVRTF 185

Query: 185 PQFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMD 233
            + ++FI    LSVRTK   +             G T       LEA +EN     + + 
Sbjct: 186 RKLYQFIAQPCLSVRTKATELAPREIENRSLGPYGKTRLLRFA-LEAQLENVGDGVIVLG 244

Query: 234 --QVEFEP-----SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKM 286
              +  +P     S NW       +          R++ +   L+    G    L  ++M
Sbjct: 245 VPTLNSKPPFKSTSLNWDFYQRNGERKKDAPTLAPRDVLQIAFLVEQEEGQQEGLEVMQM 304

Query: 287 LSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
                   +   G   LG+L I W+  +GE G L T  ++
Sbjct: 305 -------DISRDGRTSLGQLSIQWQGAMGEKGYLTTGSLM 337


>gi|422293915|gb|EKU21215.1| hypothetical protein NGA_2027510, partial [Nannochloropsis gaditana
           CCMP526]
 gi|422294871|gb|EKU22171.1| hypothetical protein NGA_2027520, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 322

 Score = 61.6 bits (148), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 77/297 (25%), Positives = 118/297 (39%), Gaps = 81/297 (27%)

Query: 98  YLGETFCSYISINNSSTLEV----RDVVIKA----EIQ-----TDKQRILLLDT------ 138
           YLGETFC+Y+SI N+    +        +KA    E+Q       +Q  L+ D       
Sbjct: 1   YLGETFCAYVSIVNTLPFSILLFEAHASLKASRGNEVQLQNTVATRQADLVGDAPPPVPD 60

Query: 139 --------SKSPVESIRAGGRYDFIVEHDVKELGAHTLVC---------TALYSDGEGER 181
                      P+E +R G   D +VEH ++EL  H L           T   + GE  R
Sbjct: 61  QWGGLGVRRDRPLE-LRPGENLDVVVEHVLQELDWHYLAINLELAPTSNTGTRTGGEAPR 119

Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
             + + FKF VSNP+++ T  RV+  G    Q    ++   E HT  NL+++ V F  + 
Sbjct: 120 VMM-KRFKFKVSNPVALTTTQRVLPSGQVLVQ--AQIKNITERHT--NLFLEDVTFLAAD 174

Query: 242 NW--SATMLKADG--------------PHSDYNAQSRE--------IFKPPVLIRSGGGI 277
                A  L  +G              P +   ++ RE         F   V ++    +
Sbjct: 175 RLHSEAVGLAPNGRSALGAMEQWGDRSPEATLPSEERESDPLDCVAAFDRHVYLQPED-V 233

Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNV--------------LGKLQITWRTNLGEPGRL 320
             +LY+L   +  +  P    G                 LG+L+++WRT LGE G L
Sbjct: 234 AQFLYRLSYRAEDTRGPPDQDGMQASSPVARTTLSTGTPLGQLRVSWRTTLGESGTL 290


>gi|326484145|gb|EGE08155.1| DUF974 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 337

 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 63/221 (28%), Positives = 91/221 (41%), Gaps = 56/221 (25%)

Query: 5   PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
           P T +L   V RL RPSL ++ P+ V                          SD   ++ 
Sbjct: 24  PATDAL---VHRLSRPSLSLQHPIPV--------------------------SDAQFSRI 54

Query: 65  SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS----TLEVRDV 120
           + L+Y S      S     LS  L LP +FG  Y+GETF   +S NN +    +  V  V
Sbjct: 55  ASLSYPS----ATSDSQFILSPNLTLPPSFGTAYVGETFACSLSANNEALGGNSRVVTSV 110

Query: 121 VIKAEIQTDKQRIL--LLDTSKSPVES--IRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
            I+A++QT  Q I   LL T + P +S    A      I+  D+KE G H L  +  Y++
Sbjct: 111 RIQADMQTPSQTIPLELLPTGEEPAKSAGTSATASIQKIIHFDLKEEGNHVLAVSVNYTE 170

Query: 177 ---------------GEGERKYLPQFFKFIVSNPLSVRTKV 202
                            G  +   + ++F+    LSVRTK 
Sbjct: 171 TMMAPNKDAASGFQASGGRARTFRKLYQFVAQPCLSVRTKA 211


>gi|401881502|gb|EJT45801.1| hypothetical protein A1Q1_05714 [Trichosporon asahii var. asahii
           CBS 2479]
          Length = 885

 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/190 (22%), Positives = 86/190 (45%), Gaps = 23/190 (12%)

Query: 94  FGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILL------------LDTSKS 141
           +G   LGE   + + ++N+S   V  V +  EIQ+   R+ L            +D S++
Sbjct: 350 YGQASLGEKLKASVRLHNTSNAPVYGVKMMMEIQSPSGRVRLGEVVHGGERPEGMDPSQA 409

Query: 142 PVES------IRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP 195
              +      +  G   +   EH++ ELG H L+C+  + + EG R+   +F KF  + P
Sbjct: 410 ETRAWNELPQLAPGEGVELKGEHELAELGLHILICSVAW-ETEGGRRTFQRFLKFTAALP 468

Query: 196 LSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD 251
           L+++T+V       T      +   +LE  ++N +   + +   + +     +A  + + 
Sbjct: 469 LAIKTRVITPSAPNTALDADKRGDVYLEVLMQNTSPVAMRLQSADLDAVTGMTARSISSP 528

Query: 252 GPHSDYNAQS 261
            P ++ +A+S
Sbjct: 529 DPDTEVDARS 538


>gi|154315960|ref|XP_001557302.1| hypothetical protein BC1G_04552 [Botryotinia fuckeliana B05.10]
 gi|347842101|emb|CCD56673.1| similar to DUF974 domain-containing protein [Botryotinia
           fuckeliana]
          Length = 376

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 93/389 (23%), Positives = 149/389 (38%), Gaps = 99/389 (25%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL ++                   P    +LP           S+ L
Sbjct: 17  HSVSLKVLRLSRPSLSIQ----------HPLPTPSPSPPLNLSLP---------APSASL 57

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN---------------- 111
           +Y S      +  +  LS LL LP AFG+ Y+GETF   +  NN                
Sbjct: 58  SYPS-----PTPSNFILSPLLTLPPAFGSAYVGETFSCTLCANNELPSPISQPAQTHTSP 112

Query: 112 ------SSTLEVRDVVIKAEIQ---TDKQRILLLDTSKSPVE------------SIRAGG 150
                 +S   + ++ + AE++   T    +L L   +SP +             I +  
Sbjct: 113 DIATSANSNKIISNITLTAEMKIPSTPTPILLPLSGPESPPQVSTTSDEETPEAQITSQT 172

Query: 151 RYDFIVEHDVKELGAHTLVCTALYSDGEGER----KYLPQFFKFIVSNPLSVRTKVRVVK 206
               ++  D+KE G+H L  T  Y++         +   + ++FI    L VRT     K
Sbjct: 173 SLQKVLHFDLKEEGSHVLAVTVTYTESSPSSPPRTRTFRKLYQFICKGCLVVRT-----K 227

Query: 207 VGATHFQEITF----------LEACIENHTKSN-LYMDQVEFEPSQNWSATMLKADGPHS 255
           +G   FQ+ T           LEA +EN T+ N + +  V    ++ + AT L  +   S
Sbjct: 228 IGPLPFQKSTLSNVSSSKKYALEAQLENITEDNPITLTLVHLATTKGFKATSLNWEIVVS 287

Query: 256 DY---NAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK-----------VQGSN 301
           D    N    E+ +P   + + G I    + ++    G    V            + G  
Sbjct: 288 DSEKENGGDVELERP---VLAPGDIRQVCFLVEEKVPGDDGEVADSVEGGKESEIIDGRL 344

Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTI 330
           + G L I WR  +G  G L T   LGT +
Sbjct: 345 IFGVLSIGWRGAMGNKGFLSTGN-LGTRV 372


>gi|83769293|dbj|BAE59430.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 291

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 75/303 (24%), Positives = 119/303 (39%), Gaps = 66/303 (21%)

Query: 53  PLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN 111
           P   ++   +  + L+Y S     DS D+   L+  L LP AFG+ Y+GETF   +S NN
Sbjct: 21  PFPEANTKISNKASLSYPS-----DSVDNQFILAPNLTLPPAFGSAYVGETFACTLSANN 75

Query: 112 -----SSTLEVRDVVIKAEIQTDKQ--RILLLDTSKSPV-ESIRAGGRYDFIVEHDVKEL 163
                 ++  V  V I AE+QT  Q   + L     +P  + ++ G     IV  D+KE 
Sbjct: 76  ELAEDETSRVVTSVRIVAEMQTPSQVASLELEPADDAPARDGLQKGQSLQKIVRFDLKEE 135

Query: 164 GAHTLVCTALYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-------- 206
           G H L  +  Y++           G  +   + ++F+    LSVRTK   +         
Sbjct: 136 GNHILAVSVSYTETLIGSDSQAASGRVRTFRKLYQFVAQPCLSVRTKSSELSPLEVENKS 195

Query: 207 ---VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSRE 263
               G T       LEA +EN   S +    +    ++    T ++ +G     +A  ++
Sbjct: 196 LGPYGKTRLLRFA-LEAQLENVDFSLILGTLMLSIANETEPQTPVQEEGQQEGLDALQKD 254

Query: 264 IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQ 323
           +                               K  G  VLG+L I WR  +G+ G L T 
Sbjct: 255 L-------------------------------KHDGRAVLGQLSIEWRGTMGDKGFLTTG 283

Query: 324 QIL 326
            +L
Sbjct: 284 NLL 286


>gi|429863211|gb|ELA37718.1| duf974 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 387

 Score = 59.3 bits (142), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 126/312 (40%), Gaps = 36/312 (11%)

Query: 32  PTDL-------FIGEDIFDDP--IAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSI 82
           P+DL       +   D   +P  ++   LPP     VTT   S L Y +    + +    
Sbjct: 93  PSDLVNMSHQRYPSHDPLKEPHSVSLKALPP-----VTTPAPSSLAYDTPAATNPAP--F 145

Query: 83  GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLL---DTS 139
            LS +L LP +FG+ Y+GE F   +  N+  TLE      +      K  +      D +
Sbjct: 146 LLSPILNLPLSFGSAYVGEVFSCTLCANH-DTLEPPPGPKRKGGAVQKLELTPADPDDAA 204

Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPL 196
           +     +  G     IV  D+KE G H L  T  Y   ++  G+ +   + ++FI  + L
Sbjct: 205 EGKGTDLEPGETLQRIVNFDLKEEGNHVLAVTVSYYEATETSGKTRTFRKLYQFICKSSL 264

Query: 197 SVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSD 256
            VRTK+  +  G         LEA +EN ++  + +++V  +  +    T    D    +
Sbjct: 265 IVRTKIGPLASGKNGGARKWVLEAQLENCSEDVIQLEKVLIDLEEGLGYT----DCNWEE 320

Query: 257 YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGE 316
               +R +  P       G +    + +       + P +  G  + G L I WR  +G 
Sbjct: 321 GGGVARPVLHP-------GEVEQVCFVVTEADGAHAEPGE-DGRIMFGVLGIGWRGEMGN 372

Query: 317 PGRLQTQQILGT 328
            G L T + LGT
Sbjct: 373 RGFLSTGK-LGT 383


>gi|312071429|ref|XP_003138604.1| hypothetical protein LOAG_03019 [Loa loa]
          Length = 145

 Score = 58.5 bits (140), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 45/160 (28%), Positives = 67/160 (41%), Gaps = 27/160 (16%)

Query: 10  LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
           L  +VMRL RP  +    + +D  D               +   LI S +          
Sbjct: 10  LTLKVMRLARPKFYENMCIPIDSAD---------------STSQLIGSALC--------- 45

Query: 70  RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
             R    ++AD I +   L+ PQ F  IYLGETF  ++ + N S     D+ IK ++QT 
Sbjct: 46  --RLTGQEAAD-IPIGKYLMAPQKFENIYLGETFSFFVCVQNISDKVAMDICIKTDLQTT 102

Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLV 169
            QR  L    +     +  G     I+ H++KE+G H  V
Sbjct: 103 SQRNALPSQLQEANAVLEPGKCLGEIITHEIKEIGQHMYV 142


>gi|406696508|gb|EKC99793.1| hypothetical protein A1Q2_05872 [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 885

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/190 (21%), Positives = 86/190 (45%), Gaps = 23/190 (12%)

Query: 94  FGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILL------------LDTSKS 141
           +G   LGE   + + ++++S   V  V +  E+Q+   R+ L            +D S++
Sbjct: 350 YGQASLGEKLKASVRLHDTSNAPVYGVKMMMEVQSPSGRVRLGEVVHGGERPEGMDPSQA 409

Query: 142 PVES------IRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP 195
              +      +  G   +   EH++ ELG H L+C+  + + EG R+   +F KF  + P
Sbjct: 410 ETRAWNELPQLAPGEGVELKGEHELAELGLHILICSVAW-ETEGGRRTFQRFLKFTAALP 468

Query: 196 LSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD 251
           L+++T+V       T      +   +LE  ++N +   + +   + +     +A  + + 
Sbjct: 469 LAIKTRVITPSAPNTALDADKRGDVYLEVLMQNTSPVAMRLRSADLDAVTGMTARSISSP 528

Query: 252 GPHSDYNAQS 261
            P ++ +A+S
Sbjct: 529 DPDTEVDARS 538


>gi|67609511|ref|XP_667022.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54658115|gb|EAL36797.1| hypothetical protein Chro.80422 [Cryptosporidium hominis]
          Length = 299

 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 73/304 (24%), Positives = 133/304 (43%), Gaps = 20/304 (6%)

Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
           +L  ++     I  G   D +V+  V E+G ++L C   ++  E  R    + +KF V +
Sbjct: 1   MLYNNEDNYSDIDIGDSLDIVVKERVDEVGLYSLTCQLFFTSNEA-RLTQKKSYKFAVLS 59

Query: 195 PLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPH 254
           P ++  ++  +  GA   + I F+E  +EN +  ++ +  ++ EP        L  +   
Sbjct: 60  PFNISHRLYNLDEGAMDKKTI-FVEVSLENISHQSITLSSMKLEPINIKKLPELIFE--L 116

Query: 255 SDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG-KLQITWRTN 313
            D N +++     P+ I+     +N +++    S G  + +      VL  KL+I W + 
Sbjct: 117 EDVNLKNKH--NEPLYIQPRCK-YNKIFKFTFRSRGEYNNLGTSSREVLELKLRIGWISV 173

Query: 314 LGEPGRLQTQQILGTTITSKEIELN--------VVEVPSVVGIDKPFLLKLKLTNQTDKE 365
               G L + +I    I   + +LN          E+PSV    + F + L +TN    +
Sbjct: 174 SYGDGWLDSYKI-DLPILCDQNKLNKEKNAIILKAELPSVNNRQEEFKVFLYVTNNLSID 232

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
           Q    I L   D D+   ++I G   + L  ++A  +    L+  A   GV  + GI VF
Sbjct: 233 QKGVSIRL---DFDQLLPIIILGNDRLYLEELKAGETVTLELDCQALVSGVYNLNGIYVF 289

Query: 426 DKLE 429
           D+LE
Sbjct: 290 DELE 293


>gi|169604758|ref|XP_001795800.1| hypothetical protein SNOG_05395 [Phaeosphaeria nodorum SN15]
 gi|160706634|gb|EAT87786.2| hypothetical protein SNOG_05395 [Phaeosphaeria nodorum SN15]
          Length = 294

 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 95/211 (45%), Gaps = 40/211 (18%)

Query: 56  SSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS--- 112
           S D+  +  + L Y S+    DS     LS +L LP+AFG+ Y+GETF   +  NN    
Sbjct: 45  SQDLGISPKASLAYPSQ---DDSNSRFLLSPVLNLPEAFGSAYVGETFSCTLCANNELDA 101

Query: 113 --STLEVRDVVIKAEIQTDKQRILLLDTSKSPVE------------SIRAGGRYDFIVEH 158
             +T  V  V I+ ++QT        + + SP++            S   G     I+  
Sbjct: 102 ADTTRAVSGVRIQGDMQTPS------NPAGSPLDLTGSLEDGEDAVSPGPGESLQRILRF 155

Query: 159 DVKELGAHTLVCTALYSD---GEGER-----KYLPQFFKFIVSNPLSVRTKVRVVKV--G 208
           ++KE G H L  T  Y++   GEG+      +   + ++F+    LSVRTK   +    G
Sbjct: 156 ELKEDGNHVLAVTVTYTETALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAGELTQPNG 215

Query: 209 ATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
            + +     LEA +EN  ++ + ++  +  P
Sbjct: 216 PSKY----LLEAQLENMGEAAVCLEVRDLFP 242


>gi|295665813|ref|XP_002793457.1| DUF974 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226277751|gb|EEH33317.1| DUF974 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 343

 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 72/284 (25%), Positives = 110/284 (38%), Gaps = 56/284 (19%)

Query: 88  LVLPQAFGAIYLGETFCSYISIN-----NSSTLEVRDVVIKAEIQTDKQRILL-LDTSKS 141
           + LP AFG+ Y+GETF   +  N     +S    V  V I AE+QT  Q ++L L  S  
Sbjct: 67  VTLPPAFGSAYVGETFSCSLCANSELLPDSENRIVSSVRIIAEMQTPSQNVVLELFPSG- 125

Query: 142 PVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLP---------- 185
             E   +GG         IV  D+KE G H L  +  Y++    + + +P          
Sbjct: 126 --EDSNSGGLTKSQSLQKIVRFDLKEEGNHVLAVSVSYTETIMAQAREMPSSGDTQAASW 183

Query: 186 ------QFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEITFLEACIENHTKS 228
                 + ++FI    L+VRTKV  +             G T       LEA +EN    
Sbjct: 184 RVRTFRKLYQFIAQPCLNVRTKVTELAPLEADNRAFDPYGKTRLLRY-VLEAQLENIGDG 242

Query: 229 NLYMDQVEFEPSQNWSATMLKAD--GPHS----DYNAQSREIFKPPVLIRSGGGIHNYLY 282
            + +      P   + +  L  D   P+S          R++ +   L+    G    L 
Sbjct: 243 AISLGSTTLNPKPPFQSRSLNWDLEQPNSLEMRPLTLSPRDVLQVAFLVEREPGQQEGL- 301

Query: 283 QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
                  G    +   G   LG+L I WR ++G+ G L T  ++
Sbjct: 302 ------EGLQKDMSRDGRTTLGQLSIEWRGSMGDRGFLTTGNLM 339


>gi|380094878|emb|CCC07380.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 425

 Score = 53.1 bits (126), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 99/415 (23%), Positives = 159/415 (38%), Gaps = 104/415 (25%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLR--VDPTDLFIGEDIFDDPIAAS---------NLPPLIS 56
           HS++ +V+RL RPSL  + PL+  V P  L         P+A           +LPPL +
Sbjct: 15  HSVSLKVLRLSRPSLVPQFPLQPPVIPQSL-------TSPVAGPAPAVLLQPRHLPPLPA 67

Query: 57  S-------------DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETF 103
           S             + +    S    R+R   +++   I LS ++ LP +FG+ Y+GETF
Sbjct: 68  SLAYSPLSPIKKYEEGSQGAESGGGERTRDGYYNTEPFI-LSPIVNLPPSFGSAYVGETF 126

Query: 104 -CSYI----------SINNSSTLEVRDVVIKAEIQT---DKQRILLLDTS---------- 139
            C+            S+ N     +RDV I+AE+QT      +++L+DT+          
Sbjct: 127 SCTLCANHNAPPIGESVTNGVKKTIRDVKIEAEMQTPSGQSTKLVLVDTAGDDNAGSSNM 186

Query: 140 -KSPVESIRAGGRYDF---------------------------IVEHDVKELGAHTLVCT 171
               V    AG   +                            I+   +KE G H L  T
Sbjct: 187 DNDNVAISNAGNEDNNNTTETTPTETETVATLDLLPSYTTLQKILNFGLKEEGTHVLGVT 246

Query: 172 ALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV--GATHFQEITFLEACIENHT 226
             Y   ++  G  +   + ++FI    L VRTK   +    G T  +    LEA +EN +
Sbjct: 247 VSYYEATETSGRTRAFRKMYQFICKPSLIVRTKAGPLPSLPGKTKRRRW-VLEAQLENCS 305

Query: 227 KSNLYMDQVEFEPSQ--------NWSATMLKADGPHSDY--NAQSREIFKPPVLIRSGGG 276
           +  + +++V+    Q        NW+       G   +        +   PP       G
Sbjct: 306 EDAILLEKVKLAEVQRGLKWRDCNWAGIGATTTGEEGNRISQQGQGQGQGPPRRPFLHPG 365

Query: 277 IHNYLYQLKMLSHGSSSPVKVQ---GSNVLGKLQITWRTNLGEPGRLQTQQILGT 328
               L  +    +G     +V+   G    G + + WRT +G  G L T + LGT
Sbjct: 366 ESEQLCFIIEEKNGEEDAAEVEEKDGRIEFGVMALAWRTEMGNRGSLLTLK-LGT 419


>gi|405117419|gb|AFR92194.1| hypothetical protein CNAG_00056 [Cryptococcus neoformans var.
           grubii H99]
          Length = 674

 Score = 52.0 bits (123), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 43/172 (25%), Positives = 79/172 (45%), Gaps = 24/172 (13%)

Query: 91  PQAFGAIYLGETFCSYISINN--SSTLEVRDVVIKAEIQTDKQRILL--------LDTS- 139
           P  FG+I LG      +S+ N       V  V +  E+Q+   R  L         DTS 
Sbjct: 53  PSPFGSIPLGSKLDLRVSLENVHRQRYGVHGVRMMVEVQSASGRARLGEAIHGQISDTSS 112

Query: 140 --------KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFI 191
                   +S +  ++ G   +  VE ++K+LG   ++ +  +   +G RK   +FFKF 
Sbjct: 113 EQPLQEGQESQLPELKFGEMVELGVESEMKDLGLGVVIVSVAWETLDG-RKTFQRFFKFN 171

Query: 192 VSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEP 239
           +  PL ++T+V++     + F    +E T+LE  ++N +  ++ +  +  EP
Sbjct: 172 IITPLGIKTRVQIPSHPNSTFSLSLRERTYLEVFMQNTSLESMLISGISLEP 223


>gi|321250597|ref|XP_003191861.1| hypothetical protein CGB_B0480W [Cryptococcus gattii WM276]
 gi|317458329|gb|ADV20074.1| Hypothetical Protein CGB_B0480W [Cryptococcus gattii WM276]
          Length = 671

 Score = 52.0 bits (123), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 78/172 (45%), Gaps = 24/172 (13%)

Query: 91  PQAFGAIYLGETFCSYISINNSSTLE--VRDVVIKAEIQTDKQRILL--------LDTSK 140
           P  FG+I LG      I + N       +  V +  E+Q+   R+ L         DT+ 
Sbjct: 53  PPPFGSIPLGSKLDFRIGLENVHRQRHGMHGVRMMVEVQSGSGRVRLGEAIHGQMSDTTG 112

Query: 141 SP---------VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFI 191
            P         +  ++ G   +  VE ++K+LG   ++ +  +   +G RK L +FFKF 
Sbjct: 113 EPPLQGGQESQLPELKFGEMVELEVESEMKDLGLGVVIVSVAWETLDG-RKTLQRFFKFN 171

Query: 192 VSNPLSVRTKVRVV----KVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
           +  PL ++T+V++        +   +E T+LE  ++N +  ++ +  +  EP
Sbjct: 172 IITPLGIKTRVQIPSHPNSTLSLSLREQTYLEVFMQNASLESMLISGISLEP 223


>gi|67528320|ref|XP_661962.1| hypothetical protein AN4358.2 [Aspergillus nidulans FGSC A4]
 gi|40741329|gb|EAA60519.1| hypothetical protein AN4358.2 [Aspergillus nidulans FGSC A4]
 gi|259482832|tpe|CBF77688.1| TPA: DUF974 domain protein (AFU_orthologue; AFUA_4G06560)
           [Aspergillus nidulans FGSC A4]
          Length = 267

 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/246 (24%), Positives = 96/246 (39%), Gaps = 38/246 (15%)

Query: 110 NNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPV---ESIRAGGRYDFIVEHDVKELGAH 166
           ++ +T  +  V I AE+QT  Q +  LD   S     + ++ G     IV  D+KE G H
Sbjct: 26  SDDTTRVITSVRIVAEMQTPSQ-VSSLDLEPSDTNANDGLQKGQSLQKIVRFDLKEEGNH 84

Query: 167 TLVCTALYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK----------- 206
            L  +  Y++           G  +   + ++F+    LSVRTK   +            
Sbjct: 85  ILAVSVSYTETMIGNDFQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVDNKSLGP 144

Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA------Q 260
            G T       LEA +EN     + + Q    P   + A  L  D    D          
Sbjct: 145 YGKTRLLRFA-LEAQLENVGDGAVVIKQTCLNPKAPFKAISLNWDLERPDQAETPPPILN 203

Query: 261 SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRL 320
            R++ +   L+    G    L  L+         ++  G  VLG+L I WR+++G+ G L
Sbjct: 204 PRDVLQVAFLVEQEEGQQEGLEALQ-------KDLRRDGRAVLGQLSIEWRSSMGDKGFL 256

Query: 321 QTQQIL 326
            T  +L
Sbjct: 257 TTGNLL 262


>gi|392572585|gb|EIW65730.1| hypothetical protein TREMEDRAFT_74899 [Tremella mesenterica DSM
           1558]
          Length = 753

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/177 (26%), Positives = 82/177 (46%), Gaps = 27/177 (15%)

Query: 95  GAIYLGETFCSYISINNSSTL--EVRDVVIKAEIQTDKQRILL---LDTSKSPV------ 143
           G + LG      + + NS     +V  V +  EIQ+   +  L   +  + SPV      
Sbjct: 60  GVVSLGSPLSLGLQLRNSHVQKHDVLGVRMMVEIQSPSIKTRLGEVIHRTSSPVDKSDLE 119

Query: 144 ---ESIRAGG----RYDFIVEHD----VKELGAHTLVCTALYSDGEGERKYLPQFFKFIV 192
              ES  + G    +YD  V  D    +KELG H ++C+  +   +G RK   +F++F V
Sbjct: 120 NVTESEESTGFSVLKYDEAVNLDSVCEMKELGNHMIICSVAWETLDG-RKTFQRFYRFTV 178

Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           + PL+++T+V+  +          +E  +LE  ++N +K  +  D+V  E  Q  +A
Sbjct: 179 NPPLAMKTRVKPPQSSNLLLNPLRREDVYLEILMQNVSKEGILFDKVLLEAVQGLTA 235


>gi|66360596|ref|XP_627257.1| DM-LD37668p  [Cryptosporidium parvum Iowa II]
 gi|46228846|gb|EAK89716.1| predicted DM-LD37668p [Cryptosporidium parvum Iowa II]
          Length = 308

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 73/312 (23%), Positives = 136/312 (43%), Gaps = 21/312 (6%)

Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
           + T K+ IL    ++     I  G   D +V+  V E+G ++L C   ++  E  R    
Sbjct: 4   VGTKKRHILY--NNEDNYSDIDIGDSLDIVVKERVDEVGLYSLTCQLFFTSNEA-RLTQK 60

Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
           + +KF V +P ++  ++  +    T  ++  F+E  +EN +  ++ +  ++ EP      
Sbjct: 61  KSYKFAVLSPFNISHRLYNLD-EDTMDKKTIFVEVSLENVSHQSITLSSMKLEPINIKKL 119

Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
             L  +    D N +++     P+ I+     +N +++    S   ++  K     +  K
Sbjct: 120 PELIFE--LEDVNLKNKH--NEPLYIQPRCK-YNKIFKFTSCSREYNNLGKSSREVLELK 174

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELN--------VVEVPSVVGIDKPFLLKLK 357
           L+I W +     G L + +I G  I   + +LN          E+PSV    + F + L 
Sbjct: 175 LRIGWVSVSYGDGWLDSYKI-GLPILCDQNKLNKEKNAIILKAELPSVNNRQEEFKVFLY 233

Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
           +TN    +Q    I L   D D+   ++I G   + L  ++A  +    L+  A   GV 
Sbjct: 234 VTNNLSIDQKGMSIRL---DFDQLLPIIILGNDRLYLEELKAGETVTLELDCQALVSGVY 290

Query: 418 RITGITVFDKLE 429
            + GI VFD+LE
Sbjct: 291 NLNGIYVFDELE 302


>gi|225683676|gb|EEH21960.1| UDP-glucoronosyl and UDP-glucosyl transferase family protein
           [Paracoccidioides brasiliensis Pb03]
          Length = 945

 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 63/213 (29%), Positives = 93/213 (43%), Gaps = 57/213 (26%)

Query: 16  RLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLL 75
           RL RPSL  + PL   P++    E+I   P+ AS   P  SSD            ++F+L
Sbjct: 38  RLSRPSLSFQYPL---PSE---NENI---PVKASLSFPSDSSD------------NQFIL 76

Query: 76  HDSADSIGLSGLLVLPQAFGAIYLGETFCSYISIN-----NSSTLEVRDVVIKAEIQTDK 130
             +         + LP AFG+ Y+GETF   +  N     +S    V  V I AE+QT  
Sbjct: 77  SPN---------VTLPPAFGSAYVGETFSCSLCANSELLPDSDNRVVSSVRIIAEMQTPS 127

Query: 131 QRILLLDTSKSPVESIRAG----GRYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLP 185
           Q + +L+ S S  +S   G         IV  D+KE G H L  +  Y++    + + +P
Sbjct: 128 QNV-VLELSPSGEDSHSGGLTKSQSLQKIVRFDLKEEGNHVLAVSVSYTETIMAQAREMP 186

Query: 186 ----------------QFFKFIVSNPLSVRTKV 202
                           + ++FI    L+VRTKV
Sbjct: 187 SSGDTQAASWRVRTFRKLYQFIAQPCLNVRTKV 219


>gi|58258123|ref|XP_566474.1| hypothetical protein [Cryptococcus neoformans var. neoformans
           JEC21]
 gi|134106063|ref|XP_778042.1| hypothetical protein CNBA0450 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50260745|gb|EAL23395.1| hypothetical protein CNBA0450 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57222611|gb|AAW40655.1| expressed protein [Cryptococcus neoformans var. neoformans JEC21]
          Length = 674

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 78/172 (45%), Gaps = 24/172 (13%)

Query: 91  PQAFGAIYLGETFCSYISINN--SSTLEVRDVVIKAEIQTDKQRILL--------LDTS- 139
           P  FG+I LG      + + N       V  V +  E+Q+   R+ L         DTS 
Sbjct: 53  PPPFGSIPLGSKLDLRVGLENVHRQRYGVHGVRMMVEVQSASGRVRLGEAIHGQISDTSS 112

Query: 140 --------KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFI 191
                   +S +  ++ G   +  VE ++K+LG   ++ +  +   +G RK   +FFKF 
Sbjct: 113 EQPLQEGQESQLPELKFGEMVELGVESEMKDLGLGVVIVSVAWETLDG-RKTFQRFFKFN 171

Query: 192 VSNPLSVRTKVRVV----KVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
           +  PL ++T+V++        +   +E T+LE  ++N +  ++ +  +  EP
Sbjct: 172 IITPLGIKTRVQIPSHPNSTLSLSLREQTYLEVFMQNTSLESMLISGISLEP 223


>gi|261331369|emb|CBH14363.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 541

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 58/132 (43%), Gaps = 2/132 (1%)

Query: 53  PLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS 112
           PL    VT  +S D     R          G+S +L LP   G  ++G+ F + +S +N+
Sbjct: 72  PLHHPLVTVKQSGDPLVSQRRSEAARLAMQGVSSVLSLPSVVGKHFVGQPFRAILSFHNA 131

Query: 113 STLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
           +   +   VI+  I T   R + L   + P  +I A G   F VEH +   G +TL   A
Sbjct: 132 AAYPLTTAVIRINIVTPSVRHVTLVNHECP--AIEARGNVSFTVEHLLSSPGQYTLSVVA 189

Query: 173 LYSDGEGERKYL 184
              D   E+K L
Sbjct: 190 TCVDVVKEQKRL 201


>gi|71745036|ref|XP_827148.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70831313|gb|EAN76818.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 541

 Score = 49.3 bits (116), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 58/132 (43%), Gaps = 2/132 (1%)

Query: 53  PLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS 112
           PL    VT  +S D     R          G+S +L LP   G  ++G+ F + +S +N+
Sbjct: 72  PLHHPLVTVKQSGDPLVSQRRSEAARLAMQGVSSVLSLPSVVGKHFVGQPFRAILSFHNA 131

Query: 113 STLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
           +   +   VI+  I T   R + L   + P  +I A G   F VEH +   G +TL   A
Sbjct: 132 AAYPLTTAVIRINIVTPSVRHVTLVNHECP--AIEARGNVSFTVEHLLSSPGQYTLSVVA 189

Query: 173 LYSDGEGERKYL 184
              D   E+K L
Sbjct: 190 TCVDVVKEQKRL 201


>gi|350632010|gb|EHA20378.1| hypothetical protein ASPNIDRAFT_44305 [Aspergillus niger ATCC 1015]
          Length = 258

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 61/240 (25%), Positives = 94/240 (39%), Gaps = 39/240 (16%)

Query: 117 VRDVVIKAEIQTDKQRILLLD----TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
           V  V I AE+QT  Q +  LD       +  + ++ G     IV  D+KE G H L  + 
Sbjct: 23  VTSVRIVAEMQTPSQ-VAALDLEPAEDTASKDGVQKGHSLQKIVRFDLKEEGNHILAVSV 81

Query: 173 LYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-----------VGATHF 212
            Y++           G  +   + ++F+    LSVRTK   +             G T  
Sbjct: 82  SYTETLIGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVENKTLGPYGKTRL 141

Query: 213 QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD--GP-HSDYNAQS---REIFK 266
                LEA +EN     + + Q    P   + A  L  D  GP  +D    +   R++ +
Sbjct: 142 LRFA-LEAQLENVGDGPVVVKQTRLNPKPPFKAVSLNWDLQGPDQADPRPPTLHPRDVLQ 200

Query: 267 PPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
              L+    G    L  L+         +K  G  VLG+L I WR  +G+ G L T  ++
Sbjct: 201 VAFLVEQEEGQQEGLETLQ-------KDMKRDGRAVLGQLSIEWRGAMGDKGFLTTGNLM 253


>gi|70994786|ref|XP_752170.1| DUF974 domain protein [Aspergillus fumigatus Af293]
 gi|66849804|gb|EAL90132.1| DUF974 domain protein [Aspergillus fumigatus Af293]
 gi|159124916|gb|EDP50033.1| DUF974 domain protein [Aspergillus fumigatus A1163]
          Length = 227

 Score = 48.9 bits (115), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 52/212 (24%), Positives = 81/212 (38%), Gaps = 38/212 (17%)

Query: 143 VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD---------GEGERKYLPQFFKFIVS 193
            E ++ G     IV  D+KE G H L  +  Y++           G  +   + ++F+  
Sbjct: 21  TEGLQRGQSLQKIVRFDLKEEGNHILAVSISYTETLIGSDAQAASGRVRTFRKLYQFVAQ 80

Query: 194 NPLSVRTKVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN 242
             LSVRTK   +             G T       LEA +EN     + + Q +  P   
Sbjct: 81  PCLSVRTKSSELAPLEVENKSLGPYGKTRLLRFA-LEAQLENVGDGTVVVKQTKLNPKPP 139

Query: 243 WSATML--------KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSP 294
           + A  L        KAD      N   R++ +   L+    G    L  L+         
Sbjct: 140 FKALSLNWDLERPDKADSQPPTLNP--RDVLQVAFLVEQEEGQQEGLEALQ-------KD 190

Query: 295 VKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
           ++  G  VLG+L I WR+ +G+ G L T  +L
Sbjct: 191 LRRDGRAVLGQLSIEWRSAMGDKGFLTTGNLL 222


>gi|403372611|gb|EJY86205.1| DUF974 domain containing protein [Oxytricha trifallax]
          Length = 482

 Score = 48.5 bits (114), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 59/253 (23%), Positives = 114/253 (45%), Gaps = 22/253 (8%)

Query: 188 FKFIVSNPLSVRT-----KVRVVKVGATHF--QEITFLEACIENHTKSNLYMDQVEFEPS 240
           +KF  + P  VR       V++ ++   HF  Q    L+  I+N + + +++D+V F   
Sbjct: 220 YKFEANLPFEVRKSISLKNVKLQQLFTKHFCIQNEFILQIKIKNLSVNKIFLDKVIFHCI 279

Query: 241 QNWSATMLKADGPHSD---YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGS-SSPVK 296
                 +L  +  H+    ++     +F   V+  + G I  YL+   ++ H   +  + 
Sbjct: 280 NANQMKVLDIN-THTQSLGFDESQVSVFGESVVF-NPGEIRQYLF---IIQHKDPAYKIN 334

Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTT--ITSKEIELNVVEVPSVVGIDKPFLL 354
               + LG+L++ W   LG+PG L+           T  EI+L+VV    ++ +++P  +
Sbjct: 335 KFEMHQLGQLELRWVNYLGDPGLLKIGPFKSNVEQKTKFEIDLDVVSQDQILKLEQPKSI 394

Query: 355 KLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKL 414
             +L N ++      +I LS  +  E   ++I G+    L  +E   S DF L+L     
Sbjct: 395 MFRLYNLSN---SVMKIQLSVKEK-EVGDLLICGISKYNLGRLEPQASVDFSLDLFPKSC 450

Query: 415 GVQRITGITVFDK 427
           GV  + G+ + D+
Sbjct: 451 GVHPVCGLLIKDQ 463


>gi|340056165|emb|CCC50494.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 544

 Score = 47.8 bits (112), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 43/182 (23%), Positives = 73/182 (40%), Gaps = 14/182 (7%)

Query: 10  LAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLT 68
           L+ RV  L +P L     P  V+  D+    D+  +P+       L S +    K  D  
Sbjct: 31  LSVRVAVLRKPELAQALAPELVEEGDILF--DVLANPVYHPTTKALESDEPHVVKGWDC- 87

Query: 69  YRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQT 128
              R  +H      G+   L LP + G  Y+G+ F ++++ +N ++  +  +     +  
Sbjct: 88  --GRLKMH------GIGSALSLPSSIGKHYVGQMFRAFLNFSNHASYPLNSLAFYVSMAD 139

Query: 129 DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
            ++R+  L         I   G   F VEH +   G +TL     Y+D   E+K L    
Sbjct: 140 PEERVTQLINHN--CAQIEGAGNVSFTVEHKLLRPGKYTLKVVVAYTDIAREQKRLKWLS 197

Query: 189 KF 190
            F
Sbjct: 198 SF 199


>gi|401407578|ref|XP_003883238.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325117654|emb|CBZ53206.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 320

 Score = 47.8 bits (112), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 48/230 (20%), Positives = 93/230 (40%), Gaps = 21/230 (9%)

Query: 212 FQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLI 271
            Q   F+E  ++N ++  +Y+        +      L +  P    N +    FKP    
Sbjct: 96  IQGRAFVECSLDNVSQQPVYLSDASIFCVEGIEGVRLDSGPPCDSMNHKGLHYFKP---- 151

Query: 272 RSGGGIHNYLYQLKMLSHGSSSPVKVQGS-----NVLGKLQITWRTNLGEPGRLQTQQIL 326
                 +N ++ L      +++ + V  S      VLG+L + WRT+ G  G +    + 
Sbjct: 152 ---QDRYNLVFSLT----PTATRLGVDASFIRRLPVLGQLALEWRTSTGGAGCMHDYTLT 204

Query: 327 GTTI-TSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVM 385
            +   ++K + L VV  P+ V ++ PF ++++++   ++   P  I      SD +  V 
Sbjct: 205 NSLAGSAKPLSLRVVSCPASVQVESPFQVEIEVSAHIEQVFCPVLIL---RPSDLQPFV- 260

Query: 386 INGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDS 435
           I G     L  ++      + L  +    G   + GI V+D     T D+
Sbjct: 261 IQGSTTRPLGIIDMLTPRRYTLEAVCLSPGFHSVKGIMVYDPDTHQTADA 310



 Score = 45.1 bits (105), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 72/153 (47%), Gaps = 37/153 (24%)

Query: 10  LAFRVMRLCRPSLHVEP-PL-RVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           L  +VMRL +PS++ EP PL R+D                      + S D +  K  + 
Sbjct: 9   LTLKVMRLSQPSINAEPWPLLRIDE---------------------VTSEDQSIEKKVE- 46

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA--- 124
             R++  +  + DS   +  L+LP   G I+ GETF +YI+I+NSS  +  +V+I+    
Sbjct: 47  --RAKDCVERALDS---THALLLPATQGRIFSGETFSAYINISNSSNAQAVNVIIQGRAF 101

Query: 125 -EIQTD---KQRILLLDTSKSPVESIRAGGRYD 153
            E   D   +Q + L D S   VE I  G R D
Sbjct: 102 VECSLDNVSQQPVYLSDASIFCVEGIE-GVRLD 133


>gi|342183401|emb|CCC92881.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
          Length = 543

 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 2/102 (1%)

Query: 83  GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSP 142
           G+   LVLP A G  ++G+ F + +S +N+++  +  VV +  I T   + + L   +  
Sbjct: 101 GVGSALVLPSAVGKHFVGQPFRAILSFHNAASYPLTAVVFRINIVTPSVKHVALVNQEG- 159

Query: 143 VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
             +I   G   F VEH +   G +TL     Y D   E K L
Sbjct: 160 -RTINGKGNTSFTVEHILSSPGQYTLSAVVTYIDVTKESKRL 200


>gi|403171573|ref|XP_003330778.2| hypothetical protein PGTG_12315 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375169240|gb|EFP86359.2| hypothetical protein PGTG_12315 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 405

 Score = 46.6 bits (109), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 39/171 (22%), Positives = 68/171 (39%), Gaps = 44/171 (25%)

Query: 88  LVLPQAFGAIYLGETFCSYISIN----NSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPV 143
           L LP +FG IY GE F   +S+      S+ +   +  +  E+Q+ +         KS +
Sbjct: 37  LSLPNSFGTIYQGEAFNGLLSLRPEQPRSNLIAALNPKLIVELQSSQ------SLHKSLI 90

Query: 144 ESIRAGG--------RYDFIVEHDVKELGAHTLVCTALYS-------------------- 175
            SI A            + ++ H + +LG H+L+CT  Y                     
Sbjct: 91  GSIHAHQLGPASEHEALELLINHQITQLGLHSLICTVTYQEPPPTEPTEEEEDQELTPAE 150

Query: 176 ------DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEA 220
                 + E + +   + +KF V NPL ++TK       ++  +E   LE+
Sbjct: 151 SHQITPESEPQTRSFRKLYKFQVLNPLGIKTKTYRSPSSSSVLEETRVLES 201


>gi|403339766|gb|EJY69144.1| DUF974 domain containing protein [Oxytricha trifallax]
          Length = 429

 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 74/156 (47%), Gaps = 10/156 (6%)

Query: 275 GGIHNYLYQLKMLSHGSSS-PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTT--IT 331
           G I  YL+   ++ H  S+  +     + LG+L++ W   LG+PG L+           T
Sbjct: 262 GEIRQYLF---IIQHKDSAYKINKFEMHQLGQLELRWVNYLGDPGLLKIGPFKSNVEQKT 318

Query: 332 SKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRI 391
             EI+L+VV    ++ +++P  +  +L N ++      +I LS  +  E   ++I G+  
Sbjct: 319 KFEIDLDVVSQDQILKLEQPKSIMFRLYNLSN---SVMKIQLSVKEK-EVGDLLICGISK 374

Query: 392 MALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
             L  +E   S DF L+L     GV  + G+ + D+
Sbjct: 375 YNLGRLEPQASVDFSLDLFPKSCGVHPVCGLLIKDQ 410


>gi|302419145|ref|XP_003007403.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
 gi|261353054|gb|EEY15482.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
          Length = 335

 Score = 45.4 bits (106), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 61/270 (22%), Positives = 103/270 (38%), Gaps = 53/270 (19%)

Query: 95  GAIYLGETFCSYISINNSS-----------TLEVRDVVIKAEIQTDK-----QRILLLD- 137
           G+ Y+GE F   +  N+             T  +RDV I AE++T       Q++ L   
Sbjct: 73  GSAYVGEHFSCTLCANHEPPVSTDVAAALPTKRIRDVRIDAEMKTPGAQGSVQKLQLTGR 132

Query: 138 ---------------TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEG 179
                          T+ +    +  G     IV  D+K+ G H L  T  Y   ++  G
Sbjct: 133 ASDSSSSSSSDAAATTTATATADLAPGETLQRIVGFDLKDEGNHVLAVTVSYYEATETSG 192

Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVV---KVGA-THFQEITFLEACIENHTKSNLYMDQV 235
             +   + ++FI  + L VRTKV  +     GA    +    LEA +EN  +  + +++V
Sbjct: 193 RTRTFRKLYQFICKSSLIVRTKVGSLPGTPGGADGRVRRKWVLEAQLENCAEDVVQLERV 252

Query: 236 EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPV 295
           E     N    +   D    ++    + +  P       G +    + ++  + G     
Sbjct: 253 EL----NLEGGLAYTD---CNWGPAGKPVLHP-------GEVEQVCFVVEETAEGGGLEP 298

Query: 296 KVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
              G  V G L I WR  +G  G L T ++
Sbjct: 299 GDDGRIVFGVLGIGWRGEMGNRGYLSTGKL 328


>gi|115398331|ref|XP_001214757.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114192948|gb|EAU34648.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 227

 Score = 44.7 bits (104), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 76/209 (36%), Gaps = 34/209 (16%)

Query: 144 ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD---------GEGERKYLPQFFKFIVSN 194
           + ++ G     IV  D+KE G H L  +  Y++           G  +   + ++F+   
Sbjct: 22  DGLQKGQSLQKIVRFDLKEEGNHILAVSVSYTETLIGLDAQAASGRVRTFRKLYQFVAQP 81

Query: 195 PLSVRTKVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
            LSVRTK   +             G T       LEA +EN     + + Q    P   +
Sbjct: 82  CLSVRTKSSELTPLEVENKSLGPYGKTRLLRFA-LEAQLENVGDGAVVVQQTRLNPKPPF 140

Query: 244 SATMLKAD------GPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
            A  L  D                R++ +   L+    G    L  L+         +K 
Sbjct: 141 KAISLNWDLEAPDGPDPPPPTLNPRDVLQVAFLVEQEEGQQEGLEALQ-------KDMKR 193

Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
            G  VLG+L I WR  +G+ G L T  +L
Sbjct: 194 DGRAVLGQLSIEWRGPMGDKGYLTTGNLL 222


>gi|357448105|ref|XP_003594328.1| hypothetical protein MTR_2g027310 [Medicago truncatula]
 gi|355483376|gb|AES64579.1| hypothetical protein MTR_2g027310 [Medicago truncatula]
          Length = 55

 Score = 44.7 bits (104), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 21/34 (61%), Positives = 26/34 (76%)

Query: 408 NLIATKLGVQRITGITVFDKLEKITYDSLPDLEI 441
           NLIATK G+Q+ITGITVF      +Y+ LPDLE+
Sbjct: 3   NLIATKPGIQKITGITVFATRGMKSYEPLPDLEV 36


>gi|156094286|ref|XP_001613180.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148802054|gb|EDL43453.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 381

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 42/164 (25%), Positives = 77/164 (46%), Gaps = 12/164 (7%)

Query: 77  DSADSIGLSG--LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
           +S + + LS    L LP     IYLG+   S I+I+N+   E++   I  ++ T +Q   
Sbjct: 42  ESKEDLSLSNEFSLSLPTNSRKIYLGQNLKSQINISNNLKNEIQISSISVDVMT-RQTTF 100

Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
            +  S   V ++++   ++F+    V      T+ C   Y  G  E+K L + F FI  N
Sbjct: 101 NIYRSVEHV-TVQSNCFFNFLTSFLVTFADMFTVHCAVEYLQG-SEKKKLRKDFNFICKN 158

Query: 195 PLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
           P  V+T +          ++  ++EA + N  + N+ ++ V F+
Sbjct: 159 PFHVKTLI-------LQKEDKIYIEAVVRNIEEDNIMLNGVTFK 195


>gi|124505961|ref|XP_001351578.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
 gi|23504505|emb|CAD51385.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
          Length = 381

 Score = 43.9 bits (102), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 74/378 (19%), Positives = 153/378 (40%), Gaps = 49/378 (12%)

Query: 77  DSADSIGLSG--LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
           D  D+I LS    L LP     +Y+G+ F S I+I+++    ++  +I  +I T      
Sbjct: 41  DINDNISLSNEISLSLPINSRKVYIGQNFKSQINISSNLKNNIQVNLINVDIWTRDNNFN 100

Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
           +    +S   +I     + F+    V      T+ CTA Y  G  E+K L + F FI  +
Sbjct: 101 IYKNEESV--NISPNTFFSFVTCFPVYFFDVFTIRCTAEYKIG-SEKKKLKKDFNFISRD 157

Query: 195 PLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPH 254
           P ++R           H  +  +++  ++N  + N+ ++ +  +  +     ++K +G +
Sbjct: 158 PFNIR-------YSLVHKNDKLYMQIIMKNTEEDNIMLNDIILKDIK---CELIKNEGCN 207

Query: 255 SDYNAQSREIFKPPVLIRSGGGIHNYL----YQLKMLSHGSSSPVKVQGSNV----LGKL 306
             +N                 GIH +     Y +        S   +  + +    +  +
Sbjct: 208 KVHN-----------------GIHYFKQHDEYSMIFCIDDEKSKRYILNNTLDNDNITNM 250

Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSV-VGIDKPFLLKLKLTNQTDKE 365
           +I + TN G  G +     L    ++   ++ + E  ++   I+K +  ++   N TD E
Sbjct: 251 EIIYFTNNGGKG-IHNLHYLKKNTSTDNFKIYLKENNNIYYTINKIYNFEIIFENNTD-E 308

Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
               EI++  N +    + ++N      +   +   S  F+   I    G+     IT++
Sbjct: 309 DMFLEIFVHNNSN----IHIVNNFVKEHIIKSKTKKSHFFYTLFINQ--GIHFFNNITIY 362

Query: 426 DKLEKITYDSLPDLEIFV 443
           +K  K T + +   ++FV
Sbjct: 363 NKKNKTTKEYIKLFKLFV 380


>gi|353248314|emb|CCA77337.1| hypothetical protein PIIN_11314 [Piriformospora indica DSM 11827]
          Length = 147

 Score = 43.5 bits (101), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 32/117 (27%), Positives = 58/117 (49%), Gaps = 9/117 (7%)

Query: 213 QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIR 272
           +E  FL+  ++N T+ +++ +++EF+P   W+ T    D   ++ + ++R+ F  P  + 
Sbjct: 6   REKLFLQIDVQNLTQESMWFERLEFKPVDGWTFT----DA--NESSIEARQAFTGPKTLV 59

Query: 273 SGGGIHNYLYQLKMLSHGSSSPVKVQGSNV--LGKLQITWRTNLGEPGRLQTQQILG 327
                  Y+Y L + +      +K     V  LG+L +  RT  GEPGRL T    G
Sbjct: 60  QPQDTFQYIYTL-IPAVVPRFLIKTAPGVVIPLGRLDLACRTTFGEPGRLLTSCYPG 115


>gi|407405130|gb|EKF30284.1| hypothetical protein MOQ_005907 [Trypanosoma cruzi marinkellei]
          Length = 549

 Score = 43.1 bits (100), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 35/134 (26%), Positives = 61/134 (45%), Gaps = 5/134 (3%)

Query: 83  GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK-AEIQTDKQRILLLDTSKS 141
           G+  +L LP + G  ++G+ F +++S +N++T  +  +V   A +     R  +++   S
Sbjct: 98  GIGSVLSLPTSLGKFFVGQFFRAFLSFHNTATYPLASMVFSIACLHPSLHRSRIVNYECS 157

Query: 142 PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP-LSVRT 200
            +E     G   F VE  +KE G +TL     Y D   E K L   F   V    + V  
Sbjct: 158 HLE---GKGNASFTVEFLLKEAGQYTLDVLVTYMDIAREAKRLTWSFSIQVERAIIEVSR 214

Query: 201 KVRVVKVGATHFQE 214
            + VV +   H ++
Sbjct: 215 TLHVVPIITRHSKD 228


>gi|367035632|ref|XP_003667098.1| hypothetical protein MYCTH_2141069 [Myceliophthora thermophila ATCC
           42464]
 gi|347014371|gb|AEO61853.1| hypothetical protein MYCTH_2141069 [Myceliophthora thermophila ATCC
           42464]
          Length = 932

 Score = 43.1 bits (100), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 96/417 (23%), Positives = 148/417 (35%), Gaps = 124/417 (29%)

Query: 8   HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
           HS++ +V+RL RPSL  + PL   P+           PI AS    L  S          
Sbjct: 538 HSVSLKVLRLSRPSLVAQYPLLPPPSSSPDDPLSHQPPIPAS----LAYSHHGAGGVIPP 593

Query: 68  TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETF----CSYISINNSST-----LEVR 118
           T  + F+L         S +L LP +FG+ Y+GETF    C+   +    T       +R
Sbjct: 594 TNPAPFVL---------SPILNLPPSFGSAYVGETFSCTLCANYDVPEDGTGAGPKKSIR 644

Query: 119 DVVIKAEIQTDKQ----------------RILLLDTSKS--------------------- 141
           DV I+AE++T                   ++ L   S S                     
Sbjct: 645 DVRIEAEMKTPSSSSSSSSSAAAGAFPAIKLPLYPPSASHAGDEHGGSGGGGGGGGGGGG 704

Query: 142 -PVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLS 197
             V+    G     I+  D+KE G H L  T  Y   S+  G  +   + ++F+    L 
Sbjct: 705 GGVDLPSPGTSLQKILSFDLKEEGNHVLAVTVSYYEASELSGRTRTFRKLYQFVCKASLI 764

Query: 198 VRTKVR-VVKVGATHFQEIT-------------------------------------FLE 219
           VRTK   +  VG    Q                                         LE
Sbjct: 765 VRTKASPLPAVGPGEEQGEGEEEEEEEEEEEEEEEEEGEKDEGEKGGRGRPRLRRRWVLE 824

Query: 220 ACIENHTKSNLYMDQV--------EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLI 271
           A +EN ++  + ++ V         +E   +W      ADG      ++ + + +P    
Sbjct: 825 AQLENCSEEGILLESVGLELESGLRYEDCNDWQG---HADG--GAVGSRMKPVLQP---- 875

Query: 272 RSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGT 328
              G      + ++    G +   +V+G  V G LQI WR+ +G  G L T + LGT
Sbjct: 876 ---GETEQVCFVIE--EEGDAVVQEVEGRVVFGVLQIGWRSEMGNRGFLSTGK-LGT 926


>gi|156059820|ref|XP_001595833.1| hypothetical protein SS1G_03923 [Sclerotinia sclerotiorum 1980]
 gi|154701709|gb|EDO01448.1| hypothetical protein SS1G_03923 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 385

 Score = 42.7 bits (99), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 61/156 (39%), Gaps = 39/156 (25%)

Query: 85  SGLLVLPQAFGAIYLGETFCSYISINN---------------------SSTLEVRDVVIK 123
           S LL LP AFG+ Y+GETF   +  NN                     ++T  + ++ + 
Sbjct: 70  SPLLTLPPAFGSAYVGETFSCTLCANNELPPLSQLSQTHTSPDIVASPNTTKVISNITLS 129

Query: 124 AE--IQTDKQRILLLDTSKSPVESIRAGGR------------YDFIVEHDVKELGAHTLV 169
           AE  I +    I L  +  SP  +    G                ++  D+KE GAH L 
Sbjct: 130 AEMKIPSTPNPISLPLSGPSPFPAASTTGEETPETQIISQASLQKVLHFDLKEEGAHVLA 189

Query: 170 CTALYSD----GEGERKYLPQFFKFIVSNPLSVRTK 201
            T  Y++         +   + ++FI    L VRTK
Sbjct: 190 VTVTYTESSPSSSPRTRTFRKLYQFICKGCLVVRTK 225


>gi|221057331|ref|XP_002259803.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193809875|emb|CAQ40579.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 382

 Score = 42.4 bits (98), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 2/113 (1%)

Query: 88  LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
           L LP     IY+G+     I+I+N+   +++   I  ++ T KQ    +  S   V ++R
Sbjct: 55  LSLPINSRKIYIGQNLKCQINISNNLKNDIQICTISVDVMT-KQTTFNIYRSAEHVITVR 113

Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRT 200
           +   ++F+    V      T+ C   Y  G  E+K L + F FI  NP  ++T
Sbjct: 114 SNSFFNFLATFLVTFADMFTVHCAVEYLQG-SEKKKLRKDFNFISKNPFHLKT 165


>gi|209881173|ref|XP_002142025.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209557631|gb|EEA07676.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 380

 Score = 42.4 bits (98), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 60/307 (19%), Positives = 126/307 (41%), Gaps = 25/307 (8%)

Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
           +L +++  +  I  G   + I++  V E+G   L C  +Y    G +    + +KF V  
Sbjct: 66  ILYSNEDNLRDIEIGNSINTIIKERVDEVGLFNLTC-QIYFIVNGSKLTQKRSYKFAVIA 124

Query: 195 PLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP-------SQNWSATM 247
           P ++  ++           ++ F+E  +EN T  ++ +++++ +         QN   + 
Sbjct: 125 PFNISHRLFYHNDNLKK-SKLCFIEVSLENITHQSISLEKLDIQNWIDEKGNKQNIQVSQ 183

Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
           L     + D N +  S+ ++   V++      +N ++ +    +  S  +      + G+
Sbjct: 184 LSTTQFY-DENCKNTSQLLYNSGVIVLRPRSRYNQIFCISQSLYKES--INNIDKYITGQ 240

Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVE----VPSVVGIDKPFLLKLKLTNQ 361
           L I+W++       + +  I           LN V     VPS + I   F +++ + N 
Sbjct: 241 LSISWKSKTYGDAFMNSYSITCQVSNEDIYNLNGVAIDVIVPSTIEIQTIFTIEVIIIND 300

Query: 362 TDKEQGPFEIWLSQNDSDEEKVV--MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
           TDK     E+ +     D E ++   I G+ I+ +  +E        L  I+   GV  I
Sbjct: 301 TDKRLHDIELSI-----DNEALLPFCILGMDILQIKFMEPNQKITIPLQCISFTSGVHPI 355

Query: 420 TGITVFD 426
            GI + +
Sbjct: 356 NGIKLIN 362


>gi|389584327|dbj|GAB67060.1| hypothetical protein PCYB_104100 [Plasmodium cynomolgi strain B]
          Length = 381

 Score = 41.6 bits (96), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 41/166 (24%), Positives = 78/166 (46%), Gaps = 16/166 (9%)

Query: 77  DSADSIGLSG--LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
           +S + + LS    L LP     IY+G+   S I+I+N+   E++   I  ++ T   R  
Sbjct: 42  ESKEDLSLSNEFSLSLPINSRKIYIGQNLKSQINISNNLKNEIQICTISVDVMT---RHT 98

Query: 135 LLDTSKSPVE--SIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIV 192
             +  +S VE  ++++   ++F+    V      T+ C   Y  G  E+K L + F FI 
Sbjct: 99  TFNIYRS-VEHVTVQSNSFFNFLTTFLVTFADMFTVHCAVEYLQG-NEKKKLRKDFNFIC 156

Query: 193 SNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
            NP  ++T +          ++  ++EA + N  + N+ ++ V F+
Sbjct: 157 KNPFHLKTLI-------LQKEDKIYIEAVVRNIEEDNIMLNDVVFK 195


>gi|354482026|ref|XP_003503201.1| PREDICTED: peroxisomal proliferator-activated receptor A-interacting
            complex 285 kDa protein-like [Cricetulus griseus]
 gi|344254975|gb|EGW11079.1| Peroxisomal proliferator-activated receptor A-interacting complex 285
            kDa protein [Cricetulus griseus]
          Length = 2914

 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 44/79 (55%), Gaps = 4/79 (5%)

Query: 193  SNPLSVRTKVRVVKVGATHFQEITFLEACIENHT--KSNLYMDQVE--FEPSQNWSATML 248
            S  ++V   V +   GA      +F+  CIE+H+    +L ++Q+E      Q+WS+ ML
Sbjct: 1153 SQLVAVGDAVALCSSGACRKLWKSFIRECIEHHSVFPEDLSLEQIEQGVAQRQHWSSLML 1212

Query: 249  KADGPHSDYNAQSREIFKP 267
            +A GP + + A ++++ +P
Sbjct: 1213 RAGGPDAKHTAVAQDMQRP 1231


>gi|71419122|ref|XP_811074.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70875696|gb|EAN89223.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 571

 Score = 40.4 bits (93), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 58/131 (44%), Gaps = 5/131 (3%)

Query: 83  GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK-AEIQTDKQRILLLDTSKS 141
           G+  +L LP + G  ++G+ F +++S +N++T  +  +V     +     R  +++   S
Sbjct: 120 GIGTVLSLPTSLGKFFVGQPFRAFLSFHNAATYPLATMVFSIVCLHPTLHRSKIVNYECS 179

Query: 142 PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP-LSVRT 200
            +E     G   F VE  +KE G +TL     Y D   E K L   F   V    + V  
Sbjct: 180 HLE---GKGNASFTVECLLKEPGQYTLDVLVTYMDIAKEAKRLTWSFSIQVERAIIEVSR 236

Query: 201 KVRVVKVGATH 211
            + VV +   H
Sbjct: 237 TIHVVPIITRH 247


>gi|71422967|ref|XP_812298.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70877064|gb|EAN90447.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 549

 Score = 40.0 bits (92), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 32/133 (24%), Positives = 58/133 (43%), Gaps = 9/133 (6%)

Query: 83  GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV---VIKAEIQTDKQRILLLDTS 139
           G+  +L LP + G  ++G+ F +++S +N++T  +  +   ++       + +I+  + S
Sbjct: 98  GIGSVLSLPTSLGKFFVGQPFRAFLSFHNAATYPLATMAFSIVCLHPTLHRSKIVNYECS 157

Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP-LSV 198
                 +   G   F VE  +KE G +TL     Y D   E K L   F   V    + V
Sbjct: 158 H-----LEGKGNASFTVECLLKEPGQYTLDVLVTYMDIAKEAKRLTWSFSIQVERAIIEV 212

Query: 199 RTKVRVVKVGATH 211
              + VV +   H
Sbjct: 213 SRTIHVVPIITRH 225


>gi|353248956|emb|CCA77414.1| hypothetical protein PIIN_11391 [Piriformospora indica DSM 11827]
          Length = 147

 Score = 39.3 bits (90), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 28/96 (29%), Positives = 48/96 (50%), Gaps = 11/96 (11%)

Query: 230 LYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQL--KML 287
           ++ +++EF+P   W+ T    D   ++ + ++R+ F  P  +        Y+Y L   ++
Sbjct: 1   MWFERLEFKPVDGWTFT----DA--NESSIEARQAFTGPKTLVQPQDTFQYIYTLIPAVI 54

Query: 288 SHGSSSPVKVQGSNV-LGKLQITWRTNLGEPGRLQT 322
                 P    G+ + LG+L I WRT  GEPGRL T
Sbjct: 55  PRFLIKPAP--GAVIPLGRLDIAWRTTFGEPGRLLT 88


>gi|254284359|ref|ZP_04959327.1| glyoxalase family protein [gamma proteobacterium NOR51-B]
 gi|219680562|gb|EED36911.1| glyoxalase family protein [gamma proteobacterium NOR51-B]
          Length = 454

 Score = 38.9 bits (89), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 55/108 (50%), Gaps = 10/108 (9%)

Query: 320 LQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLK-LKLTNQTDKEQGPFEIWLSQNDS 378
           L   Q+ G  ++    E + +EV   +G+D+P+ +K   LT+Q+D  +     WL   ++
Sbjct: 305 LAWYQMFGYEVSGSLHETDSLEVAEAMGLDRPYRIKGAMLTHQSDGSEIKLVQWLEPYNA 364

Query: 379 DEEKVVMIN--GLRIMALAPVEAFGSTDFHLNLIATKL-GVQRITGIT 423
           +    + +N  G+  MALA      STD   ++ A K  GV+ ++ IT
Sbjct: 365 EAPYPLPVNHLGIHRMALA------STDIESDVAALKAQGVEFVSPIT 406


>gi|119619024|gb|EAW98618.1| hCG1992287, isoform CRA_a [Homo sapiens]
 gi|119619025|gb|EAW98619.1| hCG1992287, isoform CRA_a [Homo sapiens]
          Length = 115

 Score = 38.9 bits (89), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 36/67 (53%)

Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNV 339
           YL  +++    S     ++G   +GKL I W+ NLGE   LQT Q+LG +   + + L++
Sbjct: 34  YLDHVQLKQKYSEEAGIIKGLREMGKLDIVWKRNLGEMAMLQTIQLLGESPGYENMRLSL 93

Query: 340 VEVPSVV 346
             +P  V
Sbjct: 94  EIIPDSV 100


>gi|407844145|gb|EKG01819.1| hypothetical protein TCSYLVIO_007171 [Trypanosoma cruzi]
          Length = 549

 Score = 38.5 bits (88), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 27/113 (23%), Positives = 51/113 (45%), Gaps = 8/113 (7%)

Query: 83  GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV---VIKAEIQTDKQRILLLDTS 139
           G+  +L LP + G  ++G+ F +++S +N++   +  +   ++    +  + +I+  + S
Sbjct: 98  GIGSVLSLPTSLGKFFVGQPFRAFLSFHNAANYPLATMAFSIVCLHPKLHRSKIVNYECS 157

Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIV 192
                 +   G   F VE  +KE G +TL     Y D   E K L   F   V
Sbjct: 158 H-----LEGKGNASFTVEFLLKEPGQYTLDVLVTYMDIAKEAKRLTWSFSIQV 205


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.136    0.390 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,923,887,851
Number of Sequences: 23463169
Number of extensions: 287174940
Number of successful extensions: 601558
Number of sequences better than 100.0: 344
Number of HSP's better than 100.0 without gapping: 244
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 600180
Number of HSP's gapped (non-prelim): 451
length of query: 446
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 300
effective length of database: 8,933,572,693
effective search space: 2680071807900
effective search space used: 2680071807900
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)