BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013275
(446 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255556003|ref|XP_002519036.1| expressed protein, putative [Ricinus communis]
gi|223541699|gb|EEF43247.1| expressed protein, putative [Ricinus communis]
Length = 434
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 340/447 (76%), Positives = 384/447 (85%), Gaps = 14/447 (3%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
MS+TPGTHSLAFRVMRLCRPS HV+ L VDP+DL +GEDIFDDP+AAS LPPLI S +T
Sbjct: 1 MSTTPGTHSLAFRVMRLCRPSFHVDAQLLVDPSDLIVGEDIFDDPVAASRLPPLIDSHIT 60
Query: 61 T-NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
+SDL+YR+RFL +DS GL+GLLVLPQAFGAIYLGETFCSYISINNSS EVRD
Sbjct: 61 KLTDTSDLSYRTRFLHQHPSDSFGLTGLLVLPQAFGAIYLGETFCSYISINNSSNFEVRD 120
Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
V+IKAEIQT++QRILLLDTSK+PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG+G
Sbjct: 121 VIIKAEIQTERQRILLLDTSKNPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGDG 180
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
ERKYLPQFFKFIV+NPLSVRTKVRVVK E T+LEACIENHTK+NLYMDQVEFEP
Sbjct: 181 ERKYLPQFFKFIVANPLSVRTKVRVVK-------ETTYLEACIENHTKTNLYMDQVEFEP 233
Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
+Q+WSA ++K D S+ ++ +REIFKPPVLIRSGGGIHNYLYQL++ +HG++
Sbjct: 234 AQHWSAKIIKDDEKQSEKDSLTREIFKPPVLIRSGGGIHNYLYQLRLSAHGAAQ------ 287
Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
SNVLGKLQITWRTNLGEPGRLQTQQILGT IT KEIEL + +VP+V+ +DKPF + LKLT
Sbjct: 288 SNVLGKLQITWRTNLGEPGRLQTQQILGTPITRKEIELCIAKVPAVINLDKPFSVHLKLT 347
Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
N TDKE GPFE+WLSQ+ S EEK V INGL+ M L+ +EAFG+TDFHLNLIATKLGVQRI
Sbjct: 348 NHTDKELGPFEVWLSQDGSVEEKAVTINGLQTMELSQLEAFGTTDFHLNLIATKLGVQRI 407
Query: 420 TGITVFDKLEKITYDSLPDLEIFVDQD 446
TGITVFDK EK TYD LPDLEIFV D
Sbjct: 408 TGITVFDKSEKKTYDPLPDLEIFVAID 434
>gi|225470348|ref|XP_002269604.1| PREDICTED: UPF0533 protein C5orf44 [Vitis vinifera]
gi|296090651|emb|CBI41051.3| unnamed protein product [Vitis vinifera]
Length = 438
Score = 692 bits (1787), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/444 (76%), Positives = 385/444 (86%), Gaps = 8/444 (1%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
MSS +HSLAFRVMRLCRPS HV+ PLR+DP DL GEDIFDDP+AAS+LP L+ +
Sbjct: 1 MSSGQTSHSLAFRVMRLCRPSFHVDNPLRLDPADLLAGEDIFDDPLAASDLPRLLHNHTL 60
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
+ SDLTYR+RFLL+D +D++GLSGLLVLPQ+FGAIYLGETFCSYISINNSS EVRDV
Sbjct: 61 KSNDSDLTYRTRFLLNDPSDAMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVRDV 120
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
VIKAEIQT+KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVC+ALY+DG+GE
Sbjct: 121 VIKAEIQTEKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCSALYNDGDGE 180
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
RKYLPQFFKF+V+NPLSV+TKVR+VK + TFLEACIENHTKSNLYMDQVEFEPS
Sbjct: 181 RKYLPQFFKFVVANPLSVKTKVRIVK-------DNTFLEACIENHTKSNLYMDQVEFEPS 233
Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
Q+W+AT+LKA SD ++ +REIFK P+LIRSGGGI NYLYQLK+ S GS+ +KV GS
Sbjct: 234 QHWTATVLKAGEGLSDNDSPTREIFKQPILIRSGGGIQNYLYQLKLSSQGSAQ-MKVDGS 292
Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
NVLGKLQITWRTNLGEPGRLQTQQILG+ IT KEIEL V+EVPSV +++PFL+ L LTN
Sbjct: 293 NVLGKLQITWRTNLGEPGRLQTQQILGSPITRKEIELQVMEVPSVTILERPFLVHLNLTN 352
Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
QTD+ GPFE+WLSQ+DS EE+VVM+NGLR MAL VEAF STDF LNLIATKLGVQ+IT
Sbjct: 353 QTDRTMGPFEVWLSQSDSREEQVVMVNGLRAMALPQVEAFCSTDFRLNLIATKLGVQKIT 412
Query: 421 GITVFDKLEKITYDSLPDLEIFVD 444
GITVFD EK TY+ LPDLEIFVD
Sbjct: 413 GITVFDIREKRTYEPLPDLEIFVD 436
>gi|356548745|ref|XP_003542760.1| PREDICTED: UPF0533 protein C5orf44 homolog [Glycine max]
Length = 440
Score = 670 bits (1729), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/440 (74%), Positives = 375/440 (85%), Gaps = 11/440 (2%)
Query: 7 THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
+HSLAFRVMRLCRPS +VEPPLR+DPTDLF+GED+FDDP A P SS + SD
Sbjct: 12 SHSLAFRVMRLCRPSFNVEPPLRLDPTDLFVGEDLFDDPAAK---PHSFSSAAAHDDDSD 68
Query: 67 LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
YR+RFLL +D++GLSGLLVLPQ+FGAIYLGETFCSYISINNSS EVR+V+IKAEI
Sbjct: 69 PNYRNRFLLRHFSDAMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVREVLIKAEI 128
Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
QT++QRILLLDTSKSPVE+IRAGGRYDFIVEHDVKELG HTLVCTALY+DG+GERKYLPQ
Sbjct: 129 QTERQRILLLDTSKSPVETIRAGGRYDFIVEHDVKELGPHTLVCTALYNDGDGERKYLPQ 188
Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
FFKFIV+NPLSVRTKVRV+K E TFLEACIENHTKSNL+MDQV+FEP+Q +SAT
Sbjct: 189 FFKFIVANPLSVRTKVRVIK-------ETTFLEACIENHTKSNLFMDQVDFEPAQYYSAT 241
Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
+LK DG HS+ ++ +REIFKPP+LIRSGGGI+NYLYQLK LS GS KV+GSNVLGKL
Sbjct: 242 ILKGDGHHSEKDSPTREIFKPPILIRSGGGIYNYLYQLKTLSDGSPQ-TKVEGSNVLGKL 300
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
QITWRTNLGEPGRLQTQQILGT T KEIEL VVEVPS++ + KPF+LKL LTNQTD+E
Sbjct: 301 QITWRTNLGEPGRLQTQQILGTPATKKEIELQVVEVPSIINLQKPFMLKLNLTNQTDREL 360
Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 426
GPFE+ LSQN S E+VVMINGL+ M L+ V+A GST+FHLNLIATK G+QRITGITVFD
Sbjct: 361 GPFEVGLSQNVSYGERVVMINGLQSMVLSEVQALGSTNFHLNLIATKPGIQRITGITVFD 420
Query: 427 KLEKITYDSLPDLEIFVDQD 446
E +Y+ LPDLEIFVD D
Sbjct: 421 TREMKSYEPLPDLEIFVDMD 440
>gi|449457717|ref|XP_004146594.1| PREDICTED: UPF0533 protein C5orf44-like [Cucumis sativus]
Length = 440
Score = 663 bits (1710), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 317/445 (71%), Positives = 384/445 (86%), Gaps = 8/445 (1%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
MS+ G+HSLAFRVMRLCRPS V+PPLR+DP DL +GEDI DDP+AA+ LP L++ ++
Sbjct: 1 MSNAQGSHSLAFRVMRLCRPSFQVDPPLRLDPVDLLVGEDILDDPVAANQLPRLLAPQLS 60
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
+ SDL+Y SRFLLHDS+D++GL+GLLVLPQAFGAIYLGETFCSYIS+NNSS EVRDV
Sbjct: 61 DDSDSDLSYSSRFLLHDSSDAMGLNGLLVLPQAFGAIYLGETFCSYISVNNSSNFEVRDV 120
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
+IKAEIQT++QRILLLD+SKSPVE+IRAGGRYDFIVEHDVKELGAHTLVCTALY+DG+GE
Sbjct: 121 IIKAEIQTERQRILLLDSSKSPVETIRAGGRYDFIVEHDVKELGAHTLVCTALYNDGDGE 180
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
RKYLPQFFKF+V+NPLSVRTKVRVVK + TFLEACIENHTKSNL+MDQV+FEPS
Sbjct: 181 RKYLPQFFKFMVANPLSVRTKVRVVK-------DSTFLEACIENHTKSNLFMDQVDFEPS 233
Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
NW+A ++ AD HS++ + +RE+FKPPVL+RSGGGIHN+LYQLK ++G SSP+KV+GS
Sbjct: 234 PNWNAVIINADEHHSEHKSTTREVFKPPVLVRSGGGIHNFLYQLKCSTNGPSSPLKVEGS 293
Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
N+LGKLQITWRTN+GEPGRLQTQQILG+ IT KE+ELNVVE+P V+ +++PF L ++LT
Sbjct: 294 NILGKLQITWRTNMGEPGRLQTQQILGSPITRKELELNVVEMPDVIRLERPFTLHMRLTT 353
Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
Q ++E GPFE+W+S N SDE+KVVM+NGL+ + + VE +GSTDFHLNLIATK GVQRI
Sbjct: 354 QIERELGPFEVWMSLNSSDEDKVVMVNGLQKVVIPRVEPYGSTDFHLNLIATKPGVQRIA 413
Query: 421 GITVFDKLEKITYDS-LPDLEIFVD 444
GI VFD EK Y+ PDLEI+VD
Sbjct: 414 GIKVFDTREKKAYEHPSPDLEIYVD 438
>gi|224079249|ref|XP_002305809.1| predicted protein [Populus trichocarpa]
gi|222848773|gb|EEE86320.1| predicted protein [Populus trichocarpa]
Length = 450
Score = 659 bits (1700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/442 (73%), Positives = 374/442 (84%), Gaps = 12/442 (2%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
MS+ P T SLAFRVMRLCRPS HV+ PL +DP+DL +GEDIFDDP+AA++LPPLI + +T
Sbjct: 1 MSTPPATQSLAFRVMRLCRPSFHVDTPLLLDPSDLILGEDIFDDPLAATHLPPLIDTHLT 60
Query: 61 TN-KSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
SSDL+YRSRFLL + +DS GLSGLLVLPQ+FGAIYLGETFCSY+SINNSS EVRD
Sbjct: 61 NPIDSSDLSYRSRFLLQNPSDSFGLSGLLVLPQSFGAIYLGETFCSYVSINNSSNFEVRD 120
Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
+VIKAE+QT++QRILLLDTSK+PVESIRA GRYDFIVEHDVKELGAHTLVCTALY+DG+G
Sbjct: 121 IVIKAEMQTERQRILLLDTSKTPVESIRASGRYDFIVEHDVKELGAHTLVCTALYTDGDG 180
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
ERKYLPQFFKFIV+NPLSVRTKV ++ V QE T+LEACIENHTK+NLYMDQVEFEP
Sbjct: 181 ERKYLPQFFKFIVANPLSVRTKVLLLLVS----QETTYLEACIENHTKTNLYMDQVEFEP 236
Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
+ NWSA +LKAD S N+ SR P L++SGGGI NYLYQL + SHGS+
Sbjct: 237 APNWSAKILKADEHKSKDNSPSR-CGNIPFLVKSGGGIRNYLYQLSLSSHGSAE------ 289
Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
SNVLGKLQITWRTNLGEPGRLQTQQILGT IT KEIEL+V EVPS + +D+PFL+ L LT
Sbjct: 290 SNVLGKLQITWRTNLGEPGRLQTQQILGTPITPKEIELHVAEVPSAINLDRPFLVHLNLT 349
Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
NQTD+E GPFE+WLSQ+D+ +EK VMINGL+ M L+ +EAFGSTDF+LNLIATKLGVQ+I
Sbjct: 350 NQTDRELGPFEVWLSQDDTLDEKTVMINGLQTMELSQLEAFGSTDFYLNLIATKLGVQKI 409
Query: 420 TGITVFDKLEKITYDSLPDLEI 441
TGITVFDK EK TY LPDLE+
Sbjct: 410 TGITVFDKSEKKTYAPLPDLEV 431
>gi|356521339|ref|XP_003529314.1| PREDICTED: UPF0533 protein C5orf44-like [Glycine max]
Length = 435
Score = 649 bits (1673), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 322/440 (73%), Positives = 369/440 (83%), Gaps = 15/440 (3%)
Query: 7 THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
+HSLAFRVMRLCRPS +VEPPLR+DP DLF GED+FDDP A PP SS ++ +
Sbjct: 11 SHSLAFRVMRLCRPSFNVEPPLRLDPADLFAGEDLFDDPAAN---PPSFSSSDDSDSN-- 65
Query: 67 LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
YR+RFLL +D++GLSGLLVLPQ+FGAIYLGETFCSYISINNSS EVRDV+IKAEI
Sbjct: 66 --YRNRFLLRHFSDAMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVRDVIIKAEI 123
Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
QT++ RILLLDTSKSPVE+IRAGGRYDFIVEHDVKELG HTLVCTALY+DG+GERKYLPQ
Sbjct: 124 QTERLRILLLDTSKSPVETIRAGGRYDFIVEHDVKELGPHTLVCTALYNDGDGERKYLPQ 183
Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
FFKFIV+NPLSVRTKVRV+K E TFLEACIENHTKSNL+MDQV+FEP+Q +SA+
Sbjct: 184 FFKFIVANPLSVRTKVRVIK-------ETTFLEACIENHTKSNLFMDQVDFEPAQYYSAS 236
Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
+LK DG HS+ ++ +RE FKPP+LIRSGGGI+NYLYQLK S G KV+GSNVLGKL
Sbjct: 237 ILKGDGHHSEKDSPTRETFKPPILIRSGGGIYNYLYQLKTSSDGLPQ-TKVEGSNVLGKL 295
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
QITWRTNLGEPGRLQTQQILGTT T KEIEL VVEVPS++ + PF+LKL LTNQTD+E
Sbjct: 296 QITWRTNLGEPGRLQTQQILGTTATKKEIELQVVEVPSIINLQNPFMLKLNLTNQTDREL 355
Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 426
GPFE+ LSQN S E+ VMINGL+ M L+ V+A GST+FHLNLIATK G+QRITGITVFD
Sbjct: 356 GPFEVSLSQNVSYGERAVMINGLQSMVLSEVQALGSTNFHLNLIATKPGIQRITGITVFD 415
Query: 427 KLEKITYDSLPDLEIFVDQD 446
E +Y+ LPDLEIFVD D
Sbjct: 416 TREMKSYEPLPDLEIFVDMD 435
>gi|388496064|gb|AFK36098.1| unknown [Medicago truncatula]
Length = 437
Score = 638 bits (1645), Expect = e-180, Method: Compositional matrix adjust.
Identities = 314/439 (71%), Positives = 366/439 (83%), Gaps = 15/439 (3%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS+AFRVMRLCRPS +V+PPLR+DP DLF+GED FDDP A S SSD+ SD
Sbjct: 14 HSVAFRVMRLCRPSFNVDPPLRIDPDDLFVGEDHFDDPSAPS------SSDLIA-PDSDP 66
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
YR+RFLL +DS+GLSGLLVLPQ+FGAIYLGETFCSYISINNSS EVR+V+IKAEIQ
Sbjct: 67 NYRNRFLLQHFSDSMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVREVIIKAEIQ 126
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T++QRILLLDTSKSPVE+IRAGGRYDFIVEHDVKELG HTLVCTALY+DG+GERKYLPQF
Sbjct: 127 TERQRILLLDTSKSPVETIRAGGRYDFIVEHDVKELGPHTLVCTALYNDGDGERKYLPQF 186
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKFIV+NPLSVRTKVRV+K E TFLEACIENHTKSNL+MDQV+FEP+Q++SAT+
Sbjct: 187 FKFIVANPLSVRTKVRVIK-------ETTFLEACIENHTKSNLFMDQVDFEPAQHYSATI 239
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
L+ DGPH++ + +RE FKPP+LIRSGGGI+NYLYQLK S S+ KV+G+NVLGKLQ
Sbjct: 240 LRGDGPHTEKDNTARETFKPPILIRSGGGIYNYLYQLKS-SLDDSAQTKVEGNNVLGKLQ 298
Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
ITWRTNLGEPGRLQTQQILGT T KEIEL VVEVPS++ + +PF LKL LTN T++E G
Sbjct: 299 ITWRTNLGEPGRLQTQQILGTPTTKKEIELQVVEVPSIINLQRPFTLKLNLTNLTERELG 358
Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
PF++ +SQN S E VMINGL+ M L+ +EA GST+ HLNLIATK G+Q+ITGITVFD
Sbjct: 359 PFKVSVSQNGSSGETAVMINGLQSMVLSQIEALGSTNIHLNLIATKPGIQKITGITVFDT 418
Query: 428 LEKITYDSLPDLEIFVDQD 446
+Y+ LPDLEIFVD D
Sbjct: 419 RGMKSYEPLPDLEIFVDID 437
>gi|358346667|ref|XP_003637387.1| hypothetical protein MTR_084s0010 [Medicago truncatula]
gi|355503322|gb|AES84525.1| hypothetical protein MTR_084s0010 [Medicago truncatula]
Length = 446
Score = 629 bits (1623), Expect = e-178, Method: Compositional matrix adjust.
Identities = 314/448 (70%), Positives = 366/448 (81%), Gaps = 24/448 (5%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS+AFRVMRLCRPS +V+PPLR+DP DLF+GED FDDP A S SSD+ SD
Sbjct: 14 HSVAFRVMRLCRPSFNVDPPLRIDPDDLFVGEDHFDDPSAPS------SSDLIA-PDSDP 66
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
YR+RFLL +DS+GLSGLLVLPQ+FGAIYLGETFCSYISINNSS EVR+V+IKAEIQ
Sbjct: 67 NYRNRFLLQHFSDSMGLSGLLVLPQSFGAIYLGETFCSYISINNSSNFEVREVIIKAEIQ 126
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T++QRILLLDTSKSPVE+IRAGGRYDFIVEHDVKELG HTLVCTALY+DG+GERKYLPQF
Sbjct: 127 TERQRILLLDTSKSPVETIRAGGRYDFIVEHDVKELGPHTLVCTALYNDGDGERKYLPQF 186
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKFIV+NPLSVRTKVRV+K E TFLEACIENHTKSNL+MDQV+FEP+Q++SAT+
Sbjct: 187 FKFIVANPLSVRTKVRVIK-------ETTFLEACIENHTKSNLFMDQVDFEPAQHYSATI 239
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
L+ DGPH++ + +RE FKPP+LIRSGGGI+NYLYQLK S S+ KV+G+NVLGKLQ
Sbjct: 240 LRGDGPHTEKDNTARETFKPPILIRSGGGIYNYLYQLKS-SLDDSAQTKVEGNNVLGKLQ 298
Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
ITWRTNLGEPGRLQTQQILGT T KEIEL VVEVPS++ + +PF LKL LTN T++E G
Sbjct: 299 ITWRTNLGEPGRLQTQQILGTPTTKKEIELQVVEVPSIINLQRPFTLKLNLTNLTERELG 358
Query: 368 PFEIWLSQNDSDEEKVVMINGLRIM---------ALAPVEAFGSTDFHLNLIATKLGVQR 418
PF++ +SQN S E VMINGL+ M L+ +EA GST+ HLNLIATK G+Q+
Sbjct: 359 PFKVSVSQNGSSGETAVMINGLQSMVMHSLWIISVLSQIEALGSTNIHLNLIATKPGIQK 418
Query: 419 ITGITVFDKLEKITYDSLPDLEIFVDQD 446
ITGITVFD +Y+ LPDLEIFVD D
Sbjct: 419 ITGITVFDTRGMKSYEPLPDLEIFVDID 446
>gi|18407493|ref|NP_566117.1| uncharacterized protein [Arabidopsis thaliana]
gi|16226796|gb|AAL16264.1|AF428334_1 At2g47960/T9J23.10 [Arabidopsis thaliana]
gi|18377797|gb|AAL67048.1| unknown protein [Arabidopsis thaliana]
gi|20197311|gb|AAC63650.2| expressed protein [Arabidopsis thaliana]
gi|20197565|gb|AAM15133.1| expressed protein [Arabidopsis thaliana]
gi|21281259|gb|AAM45021.1| unknown protein [Arabidopsis thaliana]
gi|330255823|gb|AEC10917.1| uncharacterized protein [Arabidopsis thaliana]
Length = 442
Score = 598 bits (1543), Expect = e-168, Method: Compositional matrix adjust.
Identities = 299/447 (66%), Positives = 353/447 (78%), Gaps = 12/447 (2%)
Query: 2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
+ T G HSLAFRVMRLC+PS HV+PPLR+DP DL GED DDP +AS +SS
Sbjct: 6 TQTHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADAV 65
Query: 62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
+ SDL+YR+RFLL+ D IGLSGLL+LPQ+FGAIYLGETFCSYIS+NNSST EVRDV
Sbjct: 66 D--SDLSYRNRFLLNHPTDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDVT 123
Query: 122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
IKAEIQT++QRILLLDTSKSPVESIR GGRYDFIVEHDVKELGAHTLVC+ALY+D +GER
Sbjct: 124 IKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEHDVKELGAHTLVCSALYNDADGER 183
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
KYLPQFFKF+V+NPLSVRTKVRVVK E TFLEACIENHTK+NL+MDQV+FEP++
Sbjct: 184 KYLPQFFKFVVANPLSVRTKVRVVK-------ETTFLEACIENHTKANLFMDQVDFEPAK 236
Query: 242 NWSATMLKADGPHSD--YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
WSA L+ + D + S I KPPV+IRSGGGIHNYLY+L S S K QG
Sbjct: 237 QWSAVRLQNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNP-SADVSGQTKFQG 295
Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
SN+LGK QITWRTNLGEPGRLQTQQILG ++ KEI + VVEVP+V+ +++PF L LT
Sbjct: 296 SNILGKFQITWRTNLGEPGRLQTQQILGAPVSRKEINMRVVEVPAVIHLNRPFRAYLNLT 355
Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
NQTD++ GPFE+ LSQ+++ EK V INGL+ + L +EAFGS DF LNLIA+KLGVQ+I
Sbjct: 356 NQTDRQLGPFEVSLSQDETQLEKPVGINGLQTLMLPRIEAFGSNDFQLNLIASKLGVQKI 415
Query: 420 TGITVFDKLEKITYDSLPDLEIFVDQD 446
GIT D EK TY+ +PD+EIFV+ D
Sbjct: 416 AGITALDTREKKTYELVPDMEIFVETD 442
>gi|297824907|ref|XP_002880336.1| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp.
lyrata]
gi|297326175|gb|EFH56595.1| hypothetical protein ARALYDRAFT_483987 [Arabidopsis lyrata subsp.
lyrata]
Length = 443
Score = 594 bits (1531), Expect = e-167, Method: Compositional matrix adjust.
Identities = 297/445 (66%), Positives = 352/445 (79%), Gaps = 12/445 (2%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
MS+T G HSLAFRVMRLC+PS HV+PPLR+DP DL GED DDP +AS +SS
Sbjct: 1 MSATHGPHSLAFRVMRLCKPSFHVDPPLRIDPFDLLAGEDFSDDPSSASLFRRHVSSADA 60
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
+ SDL+YR+RFLL+ D IGLSGLL+LPQ+FGAIYLGETFCSYIS+NNSST EVRDV
Sbjct: 61 VD--SDLSYRNRFLLNHPTDPIGLSGLLLLPQSFGAIYLGETFCSYISVNNSSTSEVRDV 118
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
IKAEIQT++QRILLLDTSKSPVESIR GGRYDFIVEHDVKELGAHTLVC+ALY+D +GE
Sbjct: 119 TIKAEIQTERQRILLLDTSKSPVESIRTGGRYDFIVEHDVKELGAHTLVCSALYNDADGE 178
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
RKYLPQFFKF+V+NPLSVRTKVRVVK E TFLEACIENHTK+NL+MDQV+FEP+
Sbjct: 179 RKYLPQFFKFVVANPLSVRTKVRVVK-------ETTFLEACIENHTKANLFMDQVDFEPA 231
Query: 241 QNWSATMLKADGPHSD--YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
+ WSA L+ + D + S I KPPV+IRSGGGIHNYLY+L S S K Q
Sbjct: 232 KQWSAVRLQNEDSTEDPPTSGLSGLIPKPPVIIRSGGGIHNYLYKLNP-SADVSGQTKFQ 290
Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
GSN+LGK QITWRTNLGEPGRLQTQQILG ++ KEI + V EVP+V+ +++PF L L
Sbjct: 291 GSNILGKFQITWRTNLGEPGRLQTQQILGAPVSRKEINMRVAEVPAVIHLNRPFPAYLNL 350
Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
TNQTD++ GPFE+ LSQ++S EK V INGL+ + L +EAFGS DF LNLIA+KLGVQ+
Sbjct: 351 TNQTDRQLGPFEVSLSQDESQMEKPVGINGLQTLMLPRIEAFGSNDFQLNLIASKLGVQK 410
Query: 419 ITGITVFDKLEKITYDSLPDLEIFV 443
I+GIT D EK TY+ +P++E+ V
Sbjct: 411 ISGITALDTREKKTYELVPEMEVSV 435
>gi|357146845|ref|XP_003574132.1| PREDICTED: UPF0533 protein C5orf44-like [Brachypodium distachyon]
Length = 458
Score = 560 bits (1442), Expect = e-157, Method: Compositional matrix adjust.
Identities = 284/454 (62%), Positives = 350/454 (77%), Gaps = 19/454 (4%)
Query: 3 STPGTHSLAFRVMRLCRPSLHVEPP--LRVDPTDLFIGEDIFD--DPIAASNL------P 52
+T HSLAFRVMRL RPSL +P LR DP D+F+ ED DP AA+ L P
Sbjct: 14 ATQQNHSLAFRVMRLSRPSLRPDPAALLRFDPRDVFLPEDALTSPDPSAAAELLHGLLHP 73
Query: 53 PLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS 112
P S+ TT D T+R RFLL D AD++ L GLLVLPQAFGAIYLGETFCSYISINNS
Sbjct: 74 P-DSAVSTTAVPGDFTFRDRFLLRDPADALALPGLLVLPQAFGAIYLGETFCSYISINNS 132
Query: 113 STLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
S LE R+V+IKAEIQT++QRILLLDTSKSPVESIR+GGRYDFIVEHDVKELGAHTLVCTA
Sbjct: 133 SGLEAREVIIKAEIQTERQRILLLDTSKSPVESIRSGGRYDFIVEHDVKELGAHTLVCTA 192
Query: 173 LYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYM 232
LY+DG+ ERKYLPQFFKF VSNPLSVRTKVR +K + T+LEACIENHTKSNLYM
Sbjct: 193 LYNDGDAERKYLPQFFKFTVSNPLSVRTKVRTIK-------DTTYLEACIENHTKSNLYM 245
Query: 233 DQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSS 292
DQV+FEP++ WSAT+L+AD S + R++ K P+LIR+GGGI+NYLYQL+ S S
Sbjct: 246 DQVDFEPAEQWSATILEADEHPSVVKSTIRDLCKQPILIRAGGGIYNYLYQLRP-SSDES 304
Query: 293 SPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPF 352
S +K +GS+VLGK QITWRTNLGEPGRLQTQ I T SK+++L V+VP V+ +++PF
Sbjct: 305 SQIKAEGSSVLGKFQITWRTNLGEPGRLQTQNINSTPTPSKDVDLRAVKVPPVIFLERPF 364
Query: 353 LLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIAT 412
++ L +TNQT K GPFE++L+ N S E+K V++NGL+ + L VEAF S +F L+++AT
Sbjct: 365 MVNLCVTNQTGKTVGPFEVFLASNISGEQKAVLVNGLQKLVLPLVEAFESINFDLSMVAT 424
Query: 413 KLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
+LGVQ+I+GIT++ E+ Y+ LPD+EIFVD +
Sbjct: 425 QLGVQKISGITMYAVQERKYYEPLPDIEIFVDAE 458
>gi|326514588|dbj|BAJ96281.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 553 bits (1424), Expect = e-155, Method: Compositional matrix adjust.
Identities = 276/449 (61%), Positives = 345/449 (76%), Gaps = 20/449 (4%)
Query: 8 HSLAFRVMRLCRPSLHVEPP--LRVDPTDLFIGEDIFD--DPIAASNL------PPLISS 57
HSLAFRVMRL RPSL +P LR DP D+F+ ED DP AA++ PP
Sbjct: 25 HSLAFRVMRLSRPSLRPDPAALLRFDPRDVFLPEDALTSPDPSAAADFLQGLLHPP--DP 82
Query: 58 DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
T + D T+R RFLLHD+AD++ GLLVLPQAFGAIYLGETFCSYISINNSS LE
Sbjct: 83 GAATTVAGDFTFRDRFLLHDTADALAPPGLLVLPQAFGAIYLGETFCSYISINNSSGLEA 142
Query: 118 RDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG 177
R+V+IKAEIQT++QRILLLDTSKSPVESIR+GGRYDFIVEHDVKELGAHTLVCTALY+DG
Sbjct: 143 REVIIKAEIQTERQRILLLDTSKSPVESIRSGGRYDFIVEHDVKELGAHTLVCTALYNDG 202
Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
+ ERKYLPQFFKF VSNPLSVRTKVR +K + T+LEACIENHTKSNLYMDQV+F
Sbjct: 203 DAERKYLPQFFKFTVSNPLSVRTKVRTIK-------DTTYLEACIENHTKSNLYMDQVDF 255
Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
EP+Q WSAT+L+AD S + R++ K P+LIR+ GGI+NYLYQL+ S +K
Sbjct: 256 EPAQQWSATILEADEHPSVVKSTIRDLCKQPILIRAAGGIYNYLYQLRP-SSDEPGQIKT 314
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
+GS++LGK QITWRTNLGEPGRLQTQ I T SK+++L V++P V+ +++PF++ L
Sbjct: 315 EGSSILGKFQITWRTNLGEPGRLQTQNIHSTPTPSKDVDLRAVKIPPVIFLERPFMVNLC 374
Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
LTNQT+K GPFE++L+ + S E+K V++NGL+ + L VEAF S +F L+++AT+LGVQ
Sbjct: 375 LTNQTEKTVGPFEVFLAPSVSGEQKTVLVNGLQKLVLPLVEAFESINFDLSMVATQLGVQ 434
Query: 418 RITGITVFDKLEKITYDSLPDLEIFVDQD 446
+I+GIT++ E+ Y+ LPD+EIFVD +
Sbjct: 435 KISGITLYAVQEREHYEPLPDIEIFVDAE 463
>gi|242039209|ref|XP_002466999.1| hypothetical protein SORBIDRAFT_01g018120 [Sorghum bicolor]
gi|241920853|gb|EER93997.1| hypothetical protein SORBIDRAFT_01g018120 [Sorghum bicolor]
Length = 461
Score = 530 bits (1364), Expect = e-148, Method: Compositional matrix adjust.
Identities = 278/445 (62%), Positives = 342/445 (76%), Gaps = 14/445 (3%)
Query: 8 HSLAFRVMRLCRPSLH--VEPPLRVDPTDLFIGEDIF--DDPIAASNLPP--LISSDVTT 61
HSLAFRVMRL RPSL + LR DP D+F+ ED DP AA+N L SD T
Sbjct: 25 HSLAFRVMRLSRPSLQPDLAALLRFDPRDVFLPEDALTGSDPSAAANFLDGLLHPSDSAT 84
Query: 62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
D T+R RFLL D AD++ L GLLVLPQ+FGAIYLGETFCSYISINNSS+ E RDVV
Sbjct: 85 AVPGDFTFRDRFLLRDPADALALPGLLVLPQSFGAIYLGETFCSYISINNSSSFEARDVV 144
Query: 122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
IKAEIQT++QRILLLDTSKSPVESIR+GGRYDFIVEHDVKELGAHTLVCTALY+DG+GER
Sbjct: 145 IKAEIQTERQRILLLDTSKSPVESIRSGGRYDFIVEHDVKELGAHTLVCTALYNDGDGER 204
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
KYLPQFFKF VSNPLSVRTKVR +K +IT+LEACIENHTKSNLYMDQV+FEP+Q
Sbjct: 205 KYLPQFFKFSVSNPLSVRTKVRTIK-------DITYLEACIENHTKSNLYMDQVDFEPAQ 257
Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
WSAT L+AD S + ++ K P+LIR+GGGI+NYLYQL+ S + K +GS+
Sbjct: 258 QWSATRLEADEHPSAVKSAIGDLCKQPILIRAGGGIYNYLYQLRS-SSDEAGQTKSEGSS 316
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
+LGK QITWRTNLGEPGRLQTQ I T SK+++L V+VP ++ +++ F++ L LTNQ
Sbjct: 317 ILGKFQITWRTNLGEPGRLQTQNIHSTPTASKDVDLRAVKVPPIIYVERAFMVNLCLTNQ 376
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
TDK GPFE++L+ + S E++ V++NG + + L VEAF S F+L+++AT+LGVQ+I+G
Sbjct: 377 TDKTVGPFEVFLAPSMSGEDRAVLVNGPQKLILPLVEAFESMKFNLSMVATQLGVQKISG 436
Query: 422 ITVFDKLEKITYDSLPDLEIFVDQD 446
IT++ EK Y+ LPD+EIFVD +
Sbjct: 437 ITMYAVQEKKYYEPLPDIEIFVDAE 461
>gi|22165060|gb|AAM93677.1| unknown protein [Oryza sativa Japonica Group]
gi|31432882|gb|AAP54458.1| expressed protein [Oryza sativa Japonica Group]
gi|218184826|gb|EEC67253.1| hypothetical protein OsI_34196 [Oryza sativa Indica Group]
Length = 473
Score = 516 bits (1329), Expect = e-144, Method: Compositional matrix adjust.
Identities = 275/453 (60%), Positives = 340/453 (75%), Gaps = 24/453 (5%)
Query: 8 HSLAFRVMRLCRPSLHVE--PPLRVDPTDLFIGEDIFDDPIAASN------------LPP 53
HSLAFRVMRL RPSL + LR DP D+F+ ED P +++ L P
Sbjct: 31 HSLAFRVMRLSRPSLQPDQAAALRFDPRDVFLPEDALTGPDPSASSAADAAAFLQGLLHP 90
Query: 54 LISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS 113
L S T D T+R RFLL D D++ L GLLVLPQ+FGAIYLGETFCSYISINNSS
Sbjct: 91 LDSPATTV--PGDFTFRDRFLLRDPVDALALPGLLVLPQSFGAIYLGETFCSYISINNSS 148
Query: 114 TLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTAL 173
+ E RDV IKAEIQT++QRILLLDTSK+PVESIR+GGRYDFIVEHDVKELGAHTLVCTAL
Sbjct: 149 SFEARDVAIKAEIQTERQRILLLDTSKAPVESIRSGGRYDFIVEHDVKELGAHTLVCTAL 208
Query: 174 YSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMD 233
Y+DG+GERKYLPQFFKF VSNPLSVRTKVR +K + T+LEACIENHTKSNLYMD
Sbjct: 209 YNDGDGERKYLPQFFKFTVSNPLSVRTKVRTIK-------DTTYLEACIENHTKSNLYMD 261
Query: 234 QVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS 293
QV+FEPSQ W+AT L+AD S + ++ K P+LIR+GGGI+NYLYQL+ S G S
Sbjct: 262 QVDFEPSQQWAATRLEADEHPSTVKSIIGDLCKQPILIRAGGGIYNYLYQLRP-SSGESG 320
Query: 294 PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 353
K +GS++LGK QITWRTNLGEPGRLQTQ I T SK+++L V+VP V+ +++PF+
Sbjct: 321 QTKAEGSSILGKFQITWRTNLGEPGRLQTQNIHSTPTASKDVDLRAVKVPPVIFLERPFM 380
Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 413
+ L LTNQ+DK GPFE++L+ + DEEK V++NGL+ + L VEAF S +F L+++AT+
Sbjct: 381 VNLCLTNQSDKTVGPFEVFLAPSVLDEEKYVLVNGLQKLVLPLVEAFESINFDLSMVATQ 440
Query: 414 LGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
+GVQ+I+GIT++ EK Y+ L D+EIFVD +
Sbjct: 441 VGVQKISGITLYAVQEKKLYEPLSDIEIFVDAE 473
>gi|302757339|ref|XP_002962093.1| hypothetical protein SELMODRAFT_76214 [Selaginella moellendorffii]
gi|300170752|gb|EFJ37353.1| hypothetical protein SELMODRAFT_76214 [Selaginella moellendorffii]
Length = 439
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 215/444 (48%), Positives = 296/444 (66%), Gaps = 21/444 (4%)
Query: 1 MSSTPGT--HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSD 58
M+S G HSLAFRVMRLCRPS V+ PL VDP+D+ GED + N L+
Sbjct: 1 MTSGAGAAGHSLAFRVMRLCRPSCQVDHPLLVDPSDVCNGED-------SVNFKELLPGL 53
Query: 59 VTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVR 118
V N D + RF L + D++GLSG LVLPQ FG+IYLGETFCSYIS+ N + +VR
Sbjct: 54 VNGN---DPGFWKRFELQEPMDAMGLSGQLVLPQTFGSIYLGETFCSYISVGNHTNHDVR 110
Query: 119 DVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE 178
DV+IKAE+QT++QRI+L D SKSP+ESIRA GR+DFI+EHD+KELG HTLVC A+Y+D +
Sbjct: 111 DVIIKAELQTERQRIILSDNSKSPIESIRATGRFDFIIEHDIKELGGHTLVCMAVYTDPD 170
Query: 179 GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
G+RKYLPQ+FKF SNP+SVRTKV + TFLEACIEN TKS+L+MDQV FE
Sbjct: 171 GDRKYLPQYFKFTTSNPVSVRTKV-------FDLYDTTFLEACIENQTKSHLFMDQVRFE 223
Query: 239 PSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
P+ WS T L+ + S+ + K LI GG +YL+QLK SS VK++
Sbjct: 224 PAPPWSVTTLENEEEASESDGPISGYIKSLKLINGNGGARHYLFQLKRPPL-ESSDVKLE 282
Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
G+N LGKL+I WRT LGE GRLQTQQI G+ K +++ + +P + I++PFL+++++
Sbjct: 283 GANALGKLEILWRTTLGETGRLQTQQINGSPTPKKPLDVKMTNLPQRILIERPFLVRMEV 342
Query: 359 TNQTDKEQGPFEIWLSQ-NDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
TN++++ GP + +S+ +D+ + V++NGL + + P+ ST+ +NL+A GVQ
Sbjct: 343 TNRSEQFTGPLRVVMSETDDNGTPRTVLMNGLLSLMVPPLAPLASTELEVNLVAVAAGVQ 402
Query: 418 RITGITVFDKLEKITYDSLPDLEI 441
R+ GI + D + + +P E+
Sbjct: 403 RVAGICLVDARDGRQVEFVPPTEV 426
>gi|302775158|ref|XP_002970996.1| hypothetical protein SELMODRAFT_95233 [Selaginella moellendorffii]
gi|300160978|gb|EFJ27594.1| hypothetical protein SELMODRAFT_95233 [Selaginella moellendorffii]
Length = 439
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 220/451 (48%), Positives = 297/451 (65%), Gaps = 35/451 (7%)
Query: 1 MSSTPGT--HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSD 58
M+S G HSLAFRVMRLCRPS V+ PL VDP+D+ GED + N L+
Sbjct: 1 MTSGAGAAGHSLAFRVMRLCRPSCQVDHPLLVDPSDVCNGED-------SVNFKELLPGL 53
Query: 59 VTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVR 118
V N D + RF L + D++GLSG LVLPQ FG+IYLGETFCSYIS+ N + +VR
Sbjct: 54 VNGN---DPGFWKRFELQEPMDAMGLSGQLVLPQTFGSIYLGETFCSYISVGNHTNHDVR 110
Query: 119 DVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE 178
DV+IKAE+QT++QRI+L D SKSP+ESIRA GR+DFI+EHD+KELG HTLVC A+Y+D +
Sbjct: 111 DVIIKAELQTERQRIILSDNSKSPIESIRATGRFDFIIEHDIKELGGHTLVCMAVYTDPD 170
Query: 179 GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
G+RKYLPQ+FKF SNP+SVRTKVR VK + TFLEACIEN TKS+L+MDQV FE
Sbjct: 171 GDRKYLPQYFKFTTSNPVSVRTKVRTVK-------DTTFLEACIENQTKSHLFMDQVRFE 223
Query: 239 PSQNWSATML-------KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGS 291
P+ WS T L ++DGP S Y K LI GG +YL+QLK
Sbjct: 224 PAPPWSVTTLENEEEASESDGPISGY-------IKSLKLINGNGGARHYLFQLKRPPL-E 275
Query: 292 SSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKP 351
SS VK++G+N LGKL+I WRT LGE GRLQTQQI G+ K +++ + +P + I++P
Sbjct: 276 SSDVKLEGANALGKLEILWRTTLGETGRLQTQQINGSPTPKKPLDVKMTNLPQRILIERP 335
Query: 352 FLLKLKLTNQTDKEQGPFEIWLSQ-NDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLI 410
FL+++++TN++++ GP + +S+ +D+ + V++NGL + + + + NL+
Sbjct: 336 FLVRMEVTNRSEQFTGPLRVVMSETDDNGTPRTVLMNGLLSLVSSRIHEDLTGTLSQNLV 395
Query: 411 ATKLGVQRITGITVFDKLEKITYDSLPDLEI 441
A GVQRI GI + D + + +P E+
Sbjct: 396 AVAAGVQRIAGICLVDARDGRQVEFVPPTEV 426
>gi|168006879|ref|XP_001756136.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692646|gb|EDQ79002.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 518
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 218/476 (45%), Positives = 304/476 (63%), Gaps = 47/476 (9%)
Query: 1 MSSTPGT--HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSD 58
MSS PG HSLAFRVMRLCRP+L V+ LR DP DL GED+ D + L I S
Sbjct: 60 MSSGPGGTGHSLAFRVMRLCRPALQVDLGLRFDPMDLVQGEDLHD----SEELQASIES- 114
Query: 59 VTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVR 118
+ + Y R L D++GL GLLVLPQ FG+IYLGE+FCSYIS+ N S +VR
Sbjct: 115 ----RDKEGPYWRRSELEKPIDALGLPGLLVLPQTFGSIYLGESFCSYISVGNHSNHDVR 170
Query: 119 DVVIKA--------------------------EIQTDKQRILLLDTSKSPVESIRAGGRY 152
DV IKA E+QT++QR+ L D +K+P++ I AGGR+
Sbjct: 171 DVGIKASFLPGSYIAWTDNGVSRCKYGQLCGAELQTERQRVTLYDNTKAPMDFICAGGRH 230
Query: 153 DFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHF 212
DFI+EHD+KELG HTLVC A+Y+D + ERKYLPQ+FKF+ SNPLSVRTKVR+VK
Sbjct: 231 DFIIEHDIKELGPHTLVCMAVYTDADAERKYLPQYFKFMASNPLSVRTKVRIVK------ 284
Query: 213 QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS-REIFKPPVLI 271
+ T+LEACIEN TKS L++D V F+P + ++L+ + +D + + K +I
Sbjct: 285 -DTTYLEACIENSTKSLLFLDHVRFDPQPPMTVSVLEVESNENDESEGPLSGLLKQIKVI 343
Query: 272 RSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTIT 331
++ GG ++LYQ + G K GSN LGKL+I WRT LGEPGRLQTQQILG
Sbjct: 344 KANGGTRHFLYQFHKPA-GVPVSTKADGSNTLGKLEIMWRTTLGEPGRLQTQQILGNPSP 402
Query: 332 SKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDE-EKVVMINGLR 390
KE+ L +VE+PS + +++PFL+++ ++N TD+ GP +I +SQ+D+ + +++NGL
Sbjct: 403 RKEVSLRIVEIPSRILLERPFLVRMSVSNHTDRTVGPLQISMSQDDAQGVPRAIVVNGLW 462
Query: 391 IMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
M + ++ STD +L+L+AT +GVQ+ITG+ + D+ + YD+L E+FV+ +
Sbjct: 463 SMTVPQLDPLASTDVNLSLVATAVGVQKITGVGLTDRRDGKPYDALTATEVFVESE 518
>gi|449530845|ref|XP_004172402.1| PREDICTED: UPF0533 protein C5orf44-like, partial [Cucumis sativus]
Length = 239
Score = 337 bits (864), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 160/244 (65%), Positives = 200/244 (81%), Gaps = 8/244 (3%)
Query: 202 VRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS 261
VRVVK + TFLEACIENHTKSNL+MDQV+FEPS NW+A ++ AD HS++ + +
Sbjct: 1 VRVVK-------DSTFLEACIENHTKSNLFMDQVDFEPSPNWNAVIINADEHHSEHKSTT 53
Query: 262 REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQ 321
RE+FKPPVL+RSGGGIHN+LYQLK ++G SSP+KV+GSN+LGKLQITWRTN+GEPGRLQ
Sbjct: 54 REVFKPPVLVRSGGGIHNFLYQLKCSTNGPSSPLKVEGSNILGKLQITWRTNMGEPGRLQ 113
Query: 322 TQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEE 381
TQQILG+ IT KE+ELNVVE+P V+ +++PF L ++LT Q ++E GPFE+W+S N SDE+
Sbjct: 114 TQQILGSPITRKELELNVVEMPDVIRLERPFTLHMRLTTQIERELGPFEVWMSLNSSDED 173
Query: 382 KVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDS-LPDLE 440
KVVM+NGL+ + + VE +GSTDFHLNLIATK GVQRI GI VFD EK Y+ PDLE
Sbjct: 174 KVVMVNGLQKVVIPRVEPYGSTDFHLNLIATKPGVQRIAGIKVFDTREKKAYEHPSPDLE 233
Query: 441 IFVD 444
I+VD
Sbjct: 234 IYVD 237
>gi|222613087|gb|EEE51219.1| hypothetical protein OsJ_32047 [Oryza sativa Japonica Group]
Length = 402
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 164/279 (58%), Positives = 214/279 (76%), Gaps = 8/279 (2%)
Query: 168 LVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTK 227
LVCTALY+DG+GERKYLPQFFKF VSNPLSVRTKVR +K + T+LEACIENHTK
Sbjct: 132 LVCTALYNDGDGERKYLPQFFKFTVSNPLSVRTKVRTIK-------DTTYLEACIENHTK 184
Query: 228 SNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKML 287
SNLYMDQV+FEPSQ W+AT L+AD S + ++ K P+LIR+GGGI+NYLYQL+
Sbjct: 185 SNLYMDQVDFEPSQQWAATRLEADEHPSTVKSIIGDLCKQPILIRAGGGIYNYLYQLRP- 243
Query: 288 SHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVG 347
S G S K +GS++LGK QITWRTNLGEPGRLQTQ I T SK+++L V+VP V+
Sbjct: 244 SSGESGQTKAEGSSILGKFQITWRTNLGEPGRLQTQNIHSTPTASKDVDLRAVKVPPVIF 303
Query: 348 IDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHL 407
+++PF++ L LTNQ+DK GPFE++L+ + DEEK V++NGL+ + L VEAF S +F L
Sbjct: 304 LERPFMVNLCLTNQSDKTVGPFEVFLAPSVLDEEKYVLVNGLQKLVLPLVEAFESINFDL 363
Query: 408 NLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
+++AT++GVQ+I+GIT++ EK Y+ L D+EIFVD +
Sbjct: 364 SMVATQVGVQKISGITLYAVQEKKLYEPLSDIEIFVDAE 402
Score = 46.2 bits (108), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 26/46 (56%), Positives = 30/46 (65%), Gaps = 4/46 (8%)
Query: 8 HSLAFRVMRLCRPSLHVE--PPLRVDPTDLFIGEDIF--DDPIAAS 49
HSLAFRVMRL RPSL + LR DP D+F+ ED DP A+S
Sbjct: 31 HSLAFRVMRLSRPSLQPDQAAALRFDPRDVFLPEDALTGPDPSASS 76
>gi|449526317|ref|XP_004170160.1| PREDICTED: UPF0533 protein C5orf44-like, partial [Cucumis sativus]
Length = 278
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 157/201 (78%), Positives = 184/201 (91%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
MS+ G+HSLAFRVMRLCRPS V+PPLR+DP DL +GEDI DDP+AA+ LP L++ ++
Sbjct: 78 MSNAQGSHSLAFRVMRLCRPSFQVDPPLRLDPVDLLVGEDILDDPVAANQLPRLLAPQLS 137
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
+ SDL+Y SRFLLHDS+D++GL+GLLVLPQAFGAIYLGETFCSYIS+NNSS EVRDV
Sbjct: 138 DDSDSDLSYSSRFLLHDSSDAMGLNGLLVLPQAFGAIYLGETFCSYISVNNSSNFEVRDV 197
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
+IKAEIQT++QRILLLD+SKSPVE+IRAGGRYDFIVEHDVKELGAHTLVCTALY+DG+GE
Sbjct: 198 IIKAEIQTERQRILLLDSSKSPVETIRAGGRYDFIVEHDVKELGAHTLVCTALYNDGDGE 257
Query: 181 RKYLPQFFKFIVSNPLSVRTK 201
RKYLPQFFKF+V+NPLSVRTK
Sbjct: 258 RKYLPQFFKFMVANPLSVRTK 278
>gi|302757333|ref|XP_002962090.1| hypothetical protein SELMODRAFT_77366 [Selaginella moellendorffii]
gi|300170749|gb|EFJ37350.1| hypothetical protein SELMODRAFT_77366 [Selaginella moellendorffii]
Length = 318
Score = 318 bits (816), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 160/322 (49%), Positives = 226/322 (70%), Gaps = 14/322 (4%)
Query: 74 LLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRI 133
L + D++GLS LVLPQ FG+IYLGETFCSYIS+ N + +VRDV+IKAE+QT++QRI
Sbjct: 2 LPQEPMDAMGLSRQLVLPQTFGSIYLGETFCSYISVGNHTNHDVRDVIIKAELQTERQRI 61
Query: 134 LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVS 193
+L + SKSP+ESIRA G++DFI+EHD+KELG HTLVC A+Y+D +G+RKYLPQ+FKF S
Sbjct: 62 ILSNNSKSPIESIRATGQFDFIIEHDIKELGGHTLVCMAVYTDPDGDRKYLPQYFKFTTS 121
Query: 194 NPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGP 253
NP+SVRTKV + TFLEACIEN TKS+L+MDQV F+ + WS T L+
Sbjct: 122 NPVSVRTKV-------FDLYDTTFLEACIENQTKSHLFMDQVRFDTAPPWSVTTLENVVN 174
Query: 254 HSDYNAQSREIFKPPV-----LIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
+ + E++ + LI GG +YL+QLK SS VK++G+N LGKL+I
Sbjct: 175 QMVPSGKKMELYYQQLCLSLKLINGNGGARHYLFQLKR-PPLESSDVKLEGANALGKLEI 233
Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
WRT LGE GRLQTQQI G+ K +++ + +P + I++PFL+++++TN++++ GP
Sbjct: 234 LWRTTLGETGRLQTQQINGSPTPKKPLDVKMTNLPQRILIERPFLVRMEVTNRSEQFTGP 293
Query: 369 FEIWLSQNDSD-EEKVVMINGL 389
+ +S+ D + + V++NGL
Sbjct: 294 LRVVMSETDDNGTPRTVLMNGL 315
>gi|414870887|tpg|DAA49444.1| TPA: hypothetical protein ZEAMMB73_593757 [Zea mays]
Length = 239
Score = 274 bits (700), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 151/207 (72%), Positives = 167/207 (80%), Gaps = 6/207 (2%)
Query: 8 HSLAFRVMRLCRPSLH--VEPPLRVDPTDLFIGEDIF--DDPIAASNL--PPLISSDVTT 61
HSLAFRVMRL RPSL + LR DP D+F+ ED DP AA+ L +D T
Sbjct: 25 HSLAFRVMRLSRPSLQPDLAALLRFDPRDVFLPEDALTGSDPSAAAKFLHGLLHPADSAT 84
Query: 62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
D T+R RFLL D AD++ L GLLVLPQ+FGAIYLGETFCSYISINNSS+ E RDVV
Sbjct: 85 AVPGDFTFRDRFLLRDPADALALPGLLVLPQSFGAIYLGETFCSYISINNSSSFEARDVV 144
Query: 122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
IKAEIQT++QRILLLDTSKSPVESIR+GGRYDFIVEHDVKELGAHTLVCTALY+DG+GER
Sbjct: 145 IKAEIQTERQRILLLDTSKSPVESIRSGGRYDFIVEHDVKELGAHTLVCTALYNDGDGER 204
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVG 208
KYLPQFFKF VSNPLSVRTKVR +KVG
Sbjct: 205 KYLPQFFKFSVSNPLSVRTKVRTIKVG 231
>gi|384248215|gb|EIE21700.1| DUF974-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 417
Score = 250 bits (639), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 152/447 (34%), Positives = 246/447 (55%), Gaps = 40/447 (8%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H+LAFRVMRLCRP + E P L + +D D +A + + DL
Sbjct: 2 HALAFRVMRLCRPDIPAE-----FPKGLGLRQDFLPDDLALE----------SNSGEEDL 46
Query: 68 T--YRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
T + R + + D++G+ G+L LPQ FG I+LGE F SYIS+ N S V +VVIKAE
Sbjct: 47 TGPFAHRANIENPIDALGIDGVLELPQNFGTIHLGEAFSSYISVGNYSNATVEEVVIKAE 106
Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
+Q+ +Q++ L +T+ +P+ + G R+DF+++HD+KE+ A+TL+C+ Y D +GE Y P
Sbjct: 107 LQSARQKMTLYETA-TPLPKLDPGERHDFLIKHDIKEISAYTLICSTSYID-KGETAYQP 164
Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN--- 242
Q+FKF+ NPLSVRTK+R TFLEAC+EN T L + + + + +
Sbjct: 165 QYFKFVAQNPLSVRTKIR-------SLTRQTFLEACVENLTSRPLVLAYIRLDAAPSVVA 217
Query: 243 WSATMLKADG-PHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS- 300
A+ +DG P D + S + + I GG N+LY L H S + GS
Sbjct: 218 VPASSAWSDGEPSKDAESSSLGSYADSLQIVDAGGSSNFLYAL----HSSKASPAEAGSA 273
Query: 301 --NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
LGK++I WR NLG+ GRLQTQQI+ + SK++EL + +P V ++ PF K+ +
Sbjct: 274 LTGALGKMEIRWRGNLGKLGRLQTQQIMANAVNSKDVELLLTSLPQAVHLEIPFAAKVTV 333
Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
+ D+ + + + + E +++ L ++ ++A+GS+ L+ K G+Q+
Sbjct: 334 RSNVDRTLENLALRVPEQPA--EGGLVVEDLSSTVVSRLDAYGSSSVVCTLLPMKEGLQK 391
Query: 419 ITGITVFDKLEKITYDSLPDLEIFVDQ 445
+ + + + + D + D++ FV++
Sbjct: 392 LQAVELISQQDGRILDVM-DIDCFVNR 417
>gi|303270983|ref|XP_003054853.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462827|gb|EEH60105.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 500
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 157/515 (30%), Positives = 239/515 (46%), Gaps = 102/515 (19%)
Query: 2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
++ P ++ FRVMR C P+L ++ P R F +D+ P A S T
Sbjct: 16 AAAPLPQAIQFRVMRTCAPTLKIDTPSR------FALDDLGHPPCAPS-----------T 58
Query: 62 NKSSDLTYRSRFLLH-DSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
+ SSD+ + SR L ++ + G++G L LPQAFG +YLGETF +Y+S NSS VRDV
Sbjct: 59 STSSDVAFESRVDLGLRASRASGVTGTLCLPQAFGNVYLGETFAAYVSAINSSDRVVRDV 118
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
KAE+QT+++R+ L D + ++ G +DF HD+KELGAHTLVC +Y+D +GE
Sbjct: 119 SFKAELQTERRRVALFDNAAEAAPTMPPGATFDFTATHDLKELGAHTLVCGVVYTDADGE 178
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
RKY PQ+FKF +NPL+VRTKVR + G LEACIEN T + L + + FEP
Sbjct: 179 RKYAPQYFKFNAANPLAVRTKVRPGRDGR------ALLEACIENATPAPLLLSRATFEPC 232
Query: 241 QNW------------SATMLKADGPHSDYNAQSREIF----------------------- 265
+ + ++ PH
Sbjct: 233 AHLECDEIVPACVSGAGVVIPEGDPHRGEEGGGGGGGGGGGARDAAAAGGSGLGEGLPSL 292
Query: 266 --KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQ 323
+P ++ GG ++L++L+ P S+ LGKL+I W + GE GRLQTQ
Sbjct: 293 ANRPLRVLSPQGGSTHFLFELRQ------RPDITVTSDTLGKLEIRWTGHNGEAGRLQTQ 346
Query: 324 QILGTT-ITSKEIELNVVE--VPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSD- 379
QI+G+ I K++E+ P + P L +TN+T E+ ++Q DSD
Sbjct: 347 QIVGSPRIGGKDVEVAFAHGAPPKTARVHAPLTLSCVVTNKTASATRALEV-IAQPDSDV 405
Query: 380 ------------------------EEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLG 415
++++G + +A+ + G L + T G
Sbjct: 406 VGGGATGGGGGATGATGGATGGGGGVAGILVDGPQRIAIGALPPGGERRVELTCVPTLPG 465
Query: 416 VQRITGITVF------DKLEKITYDSLPDLEIFVD 444
+R+ ++V D +D L E+ V+
Sbjct: 466 TRRLPIVSVAEARGDGDARGGRVFDQLARFEVLVE 500
>gi|347582612|ref|NP_001231572.1| UPF0533 protein C5orf44 homolog isoform 1 [Danio rerio]
Length = 418
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 141/433 (32%), Positives = 223/433 (51%), Gaps = 48/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNMPVTCEDRDLPGDLFLR---------------LMKDDPSTVK 54
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
A+++ L +L LPQ FG I+LGETF SYIS++N S+ V+D+++K
Sbjct: 55 G--------------AETLILGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S S V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNSAVSELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKLY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMY 219
Query: 244 SATMLK--ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L A G S + + + P+ R YLY LK + ++G
Sbjct: 220 NVTELNNVASGDESSESTFGKMSYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVT 273
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 274 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEFIPDTVDLEEPFDITCKITNC 333
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L ++ ++G ++ L+P S L L+++ G+Q I+G
Sbjct: 334 SERT---MDLLLEMCNTRSVHWCGVSGRQLGKLSPS---ASLSIPLKLLSSVQGLQSISG 387
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 388 LRLTDTFLKRTYE 400
>gi|197100367|ref|NP_001125291.1| UPF0533 protein C5orf44 homolog [Pongo abelii]
gi|75042171|sp|Q5RCG0.1|CE044_PONAB RecName: Full=UPF0533 protein C5orf44 homolog
gi|55727584|emb|CAH90547.1| hypothetical protein [Pongo abelii]
Length = 417
Score = 207 bits (528), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 138/433 (31%), Positives = 217/433 (50%), Gaps = 49/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLK--ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + S SR +P YLY LK + ++G
Sbjct: 220 NVTELNSVSQAGESVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 386
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>gi|148277000|ref|NP_079217.2| UPF0533 protein C5orf44 isoform 2 [Homo sapiens]
gi|206558220|sp|A5PLN9.2|CE044_HUMAN RecName: Full=UPF0533 protein C5orf44
gi|119571728|gb|EAW51343.1| hypothetical protein FLJ13611, isoform CRA_a [Homo sapiens]
gi|410217874|gb|JAA06156.1| chromosome 5 open reading frame 44 [Pan troglodytes]
gi|410217876|gb|JAA06157.1| chromosome 5 open reading frame 44 [Pan troglodytes]
gi|410249602|gb|JAA12768.1| chromosome 5 open reading frame 44 [Pan troglodytes]
gi|410249604|gb|JAA12769.1| chromosome 5 open reading frame 44 [Pan troglodytes]
gi|410249606|gb|JAA12770.1| chromosome 5 open reading frame 44 [Pan troglodytes]
gi|410292066|gb|JAA24633.1| chromosome 5 open reading frame 44 [Pan troglodytes]
gi|410292068|gb|JAA24634.1| chromosome 5 open reading frame 44 [Pan troglodytes]
gi|410292070|gb|JAA24635.1| chromosome 5 open reading frame 44 [Pan troglodytes]
gi|410339455|gb|JAA38674.1| chromosome 5 open reading frame 44 [Pan troglodytes]
Length = 417
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 219/433 (50%), Gaps = 49/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + SR +P YLY LK + + ++G
Sbjct: 220 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 386
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>gi|348524306|ref|XP_003449664.1| PREDICTED: UPF0533 protein C5orf44 homolog [Oreochromis niloticus]
Length = 417
Score = 207 bits (527), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 145/431 (33%), Positives = 226/431 (52%), Gaps = 45/431 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P + DL D+F L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNLPATCEDRDL--PGDLFGQ---------LMRQDPSTIKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S+ V+D+++KA++Q
Sbjct: 56 -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S S V ++ D ++ H+VKE+G H LVC Y+ +GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNSAVAELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQQGEKLYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVTE 223
Query: 248 L----KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
L +AD S + S + P+ R YLY LK + ++G V+
Sbjct: 224 LNMVTQADKGESTFGKMS---YLQPMDTR------QYLYCLKPKPEYAEKAGIIKGVTVI 274
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
GKL I W+TNLGE GRLQT Q+ +I L++ +P V +++PF + K+TN ++
Sbjct: 275 GKLDIVWKTNLGERGRLQTSQLQRMAPGYGDIRLSLDLIPDTVNLEEPFDIICKITNCSE 334
Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
+ ++ L ++ I+G ++ L+P AF S L ++++ G+Q I+G+
Sbjct: 335 RT---MDLVLEMCNTSSIHWCGISGRQLGKLSP-GAFLS--LPLTVLSSVQGLQSISGLR 388
Query: 424 VFDKLEKITYD 434
+ D K TY+
Sbjct: 389 LTDTFLKRTYE 399
>gi|332233704|ref|XP_003266043.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Nomascus
leucogenys]
Length = 418
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 138/433 (31%), Positives = 217/433 (50%), Gaps = 48/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
S T L + + + SR +P YLY LK + ++G
Sbjct: 220 SVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 333 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 387
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 388 LRLTDTFLKRTYE 400
>gi|318102158|ref|NP_001187397.1| upf0533 protein c5orf44-like protein [Ictalurus punctatus]
gi|308322905|gb|ADO28590.1| upf0533 protein c5orf44-like protein [Ictalurus punctatus]
Length = 417
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 139/431 (32%), Positives = 216/431 (50%), Gaps = 45/431 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA + MRL +P+L P+ + P DLF G + +DP PL+
Sbjct: 10 HLLALKAMRLTKPTLFTNMPVTCEDRDLPGDLF-GRLMREDPSTIKGAEPLM-------- 60
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
L +L LPQ FG I+LGETF SYIS++N ST V+D+++K
Sbjct: 61 --------------------LGEMLTLPQNFGNIFLGETFSSYISVHNDSTQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S S V ++ D ++ H+VKE+G H LVC Y+ G++ Y
Sbjct: 101 ADLQTSSQR-LNLSASNSAVAELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGDKLY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMY 219
Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
+ T L ++ + + RE + YLY LK + ++G V+
Sbjct: 220 NVTEL-----NTVCSGEERESTFGKMSYLQPMDTRQYLYCLKPKPEFAEKAGVIKGVTVI 274
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
GKL I W+TNLGE GRLQT Q+ ++ L++ VP V I++PF + K+TN ++
Sbjct: 275 GKLDIVWKTNLGEKGRLQTSQLQRMAPGYGDVRLSLELVPDTVNIEEPFDITCKITNCSE 334
Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
+ ++ L ++ ++G ++ L P S L L+++ G+Q I+G+
Sbjct: 335 RT---MDLLLEMCNTRSVHWCGVSGRQLGKLGPS---ASLSIPLQLLSSVQGLQSISGLR 388
Query: 424 VFDKLEKITYD 434
+ D K TY+
Sbjct: 389 LTDTFLKRTYE 399
>gi|148277002|ref|NP_001087224.1| UPF0533 protein C5orf44 isoform 1 [Homo sapiens]
gi|114600020|ref|XP_517735.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 4 [Pan
troglodytes]
gi|397514419|ref|XP_003827485.1| PREDICTED: UPF0533 protein C5orf44 homolog [Pan paniscus]
gi|119571733|gb|EAW51348.1| hypothetical protein FLJ13611, isoform CRA_f [Homo sapiens]
Length = 418
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 218/433 (50%), Gaps = 48/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + SR +P YLY LK + + ++G
Sbjct: 220 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 333 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 387
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 388 LRLTDTFLKRTYE 400
>gi|347582610|ref|NP_955832.2| UPF0533 protein C5orf44 homolog isoform 2 [Danio rerio]
gi|190360173|sp|Q6PBY7.2|CE044_DANRE RecName: Full=UPF0533 protein C5orf44 homolog
Length = 412
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 141/433 (32%), Positives = 221/433 (51%), Gaps = 54/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNMPVTCEDRDLPGDLFLR---------------LMKDDPSTVK 54
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
A+++ L +L LPQ FG I+LGETF SYIS++N S+ V+D+++K
Sbjct: 55 G--------------AETLILGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S S V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNSAVSELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKLY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMY 213
Query: 244 SATMLK--ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L A G S + + + P+ R YLY LK + ++G
Sbjct: 214 NVTELNNVASGDESSESTFGKMSYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVT 267
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 268 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEFIPDTVDLEEPFDITCKITNC 327
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L ++ ++G ++ L+P S L L+++ G+Q I+G
Sbjct: 328 SERT---MDLLLEMCNTRSVHWCGVSGRQLGKLSPS---ASLSIPLKLLSSVQGLQSISG 381
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 382 LRLTDTFLKRTYE 394
>gi|403267437|ref|XP_003925839.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Saimiri
boliviensis boliviensis]
Length = 418
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 218/433 (50%), Gaps = 48/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNA--QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + +SR +P YLY LK + ++G
Sbjct: 220 NVTELNSVSQAGECVSTFRSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L LI++ G+Q ++G
Sbjct: 333 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLISSVQGLQSVSG 387
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 388 LRLTDTFLKRTYE 400
>gi|47228413|emb|CAG05233.1| unnamed protein product [Tetraodon nigroviridis]
Length = 410
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 142/431 (32%), Positives = 218/431 (50%), Gaps = 45/431 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNLPVTCEDRDL--PGDLFSQ---------LMREDPSTIKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S+ V+D+++KA++Q
Sbjct: 56 -----------AENLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S S V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNSAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTSQYGEKLYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVTE 223
Query: 248 LKA----DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
L D + S + P+ R YLY LK + ++G V+
Sbjct: 224 LNMGTSRDTEECTFGKMS---YLQPMDTR------QYLYCLKPKPEYAEKAGVIKGVTVI 274
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
GKL I W+TNLGE GRLQT Q+ +I L++ +P V +++PF L K+TN ++
Sbjct: 275 GKLDIVWKTNLGERGRLQTSQLQRMAPGYGDIRLSLEVIPDTVNLEEPFDLICKITNCSE 334
Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
+ ++ L ++ +G ++ L P S L L ++ G+Q I+G+
Sbjct: 335 R---TMDLVLEMCNTASIHWCGTSGRKLGKLGPA---ASLSLPLTLFSSVQGLQSISGLR 388
Query: 424 VFDKLEKITYD 434
+ D K TY+
Sbjct: 389 LKDTFLKRTYE 399
>gi|432884725|ref|XP_004074559.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Oryzias
latipes]
Length = 417
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 143/438 (32%), Positives = 227/438 (51%), Gaps = 45/438 (10%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
++ T H LA +VMRL +P+L P+ + DL D+F L+ D +
Sbjct: 3 VNQTKQEHLLALKVMRLTKPTLFTNLPVTCEERDL--PGDLFGQ---------LMRQDPS 51
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
T K A+++ L +L LPQ FG I+LGETF SYIS++N ST V+++
Sbjct: 52 TIKG--------------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSTQIVKEI 97
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
++KA++QT QR L L TS S V ++ D ++ H+VKE+G H LVC Y+ GE
Sbjct: 98 LVKADLQTSSQR-LNLSTSNSAVAELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQLGE 156
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
+ Y +FFKF V PL V+TK + + + FLEA I+N T S ++M++V EP+
Sbjct: 157 KLYFRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPT 216
Query: 241 QNWSATMLK----ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
++ T L D S + S + P+ R YLY LK + +
Sbjct: 217 IMYNVTELNTVASGDDGESTFGKMS---YLQPMDTR------QYLYCLKPKAEYAEKAGV 267
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
++G ++GKL I WRTNLGE GRLQT Q+ +I L++ +P V +++PF +
Sbjct: 268 IKGVTMIGKLDIVWRTNLGEKGRLQTSQLQRMAPGYGDIRLSLEIIPDTVNLEEPFDIVC 327
Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
K+TN +++ ++ + ++ I+G ++ L+P GS L + ++ G+
Sbjct: 328 KITNCSER---TMDLVVEMCNTRSIHWCGISGRQLGKLSP---GGSLLVPLTIFSSVQGL 381
Query: 417 QRITGITVFDKLEKITYD 434
Q I+G+ + D K TY+
Sbjct: 382 QSISGLRLTDTFLKRTYE 399
>gi|344272589|ref|XP_003408114.1| PREDICTED: UPF0533 protein C5orf44 homolog [Loxodonta africana]
Length = 418
Score = 204 bits (519), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 219/429 (51%), Gaps = 40/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S+ V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYATQSGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITNSPMFMEKVSLEPSIMYNVAE 223
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L A + + SR +P YLY LK + ++G V+GK
Sbjct: 224 LNAVNQAGECISTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 337 T--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 391
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 392 DTFLKRTYE 400
>gi|224090703|ref|XP_002190150.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Taeniopygia
guttata]
Length = 417
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 220/429 (51%), Gaps = 40/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL ++F+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 56 -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVAE 223
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
L D +S F ++ YLY LK + ++G V+GKL
Sbjct: 224 LNT----VDTAGESESTFGTRTYLQP-MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGKLD 278
Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 279 IVWKTNLGEHGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSSER-- 336
Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH--LNLIATKLGVQRITGITVF 425
++ L +++ ++G ++ L P S+ H L L+++ G+Q ++G+ +
Sbjct: 337 TMDLVLEMCNTNSIHWCGVSGRQLGKLYP-----SSSLHLALTLLSSVQGLQSVSGLRLT 391
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 392 DTFLKRTYE 400
>gi|303304982|ref|NP_001181925.1| uncharacterized protein LOC427165 isoform 1 [Gallus gallus]
Length = 418
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 218/429 (50%), Gaps = 40/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL ++F+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 56 -----------AEALMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASTAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVAE 223
Query: 248 LKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L S+ SR +P YLY LK + ++G V+GK
Sbjct: 224 LNTVDSAGESESTFGSRTYLQP-------MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSSER 336
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ ++G ++ L P S L L+++ G+Q ++G+ +
Sbjct: 337 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHPS---SSLRLALTLLSSVQGLQSVSGLRLT 391
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 392 DTFLKRTYE 400
>gi|334325202|ref|XP_001381439.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Monodelphis
domestica]
Length = 418
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 137/435 (31%), Positives = 220/435 (50%), Gaps = 52/435 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEDRDLPGDLF-NQLMKDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQIVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V +++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNATVAELKSDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMY 219
Query: 244 SA----TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
+ T+ +A S + SR +P YLY LK + ++G
Sbjct: 220 NVVELNTVKQAGEGMSTFG--SRTYLQPM-------DTRQYLYCLKPKQEFAEKAGIIKG 270
Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+T
Sbjct: 271 VTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKIT 330
Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
N + + ++ L +++ ++G ++ L P S L L+++ G+Q +
Sbjct: 331 NCSSER--TMDLVLEMCNTNSIHWCGVSGRQLGKLNPS---SSLYLALTLLSSVQGLQSV 385
Query: 420 TGITVFDKLEKITYD 434
+G+ + D K TY+
Sbjct: 386 SGLRLTDTFLKRTYE 400
>gi|207079887|ref|NP_001128904.1| DKFZP459P083 protein [Pongo abelii]
gi|55733284|emb|CAH93324.1| hypothetical protein [Pongo abelii]
Length = 411
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 138/433 (31%), Positives = 215/433 (49%), Gaps = 55/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + S SR +P YLY LK + ++G
Sbjct: 214 NVTELNSVSQAGESVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 327 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 380
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 381 LRLTDTFLKRTYE 393
>gi|156120529|ref|NP_001095410.1| UPF0533 protein C5orf44 homolog [Bos taurus]
gi|189042269|sp|A7MB76.1|CE044_BOVIN RecName: Full=UPF0533 protein C5orf44 homolog
gi|154425662|gb|AAI51377.1| LOC511108 protein [Bos taurus]
gi|296475854|tpg|DAA17969.1| TPA: hypothetical protein LOC511108 [Bos taurus]
Length = 417
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 216/433 (49%), Gaps = 49/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + SR +P YLY LK + ++G
Sbjct: 220 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>gi|441658593|ref|XP_003266042.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Nomascus
leucogenys]
Length = 412
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 138/433 (31%), Positives = 215/433 (49%), Gaps = 54/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
S T L + + + SR +P YLY LK + ++G
Sbjct: 214 SVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 381
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 382 LRLTDTFLKRTYE 394
>gi|148277004|ref|NP_001087225.1| UPF0533 protein C5orf44 isoform 3 [Homo sapiens]
gi|119571729|gb|EAW51344.1| hypothetical protein FLJ13611, isoform CRA_b [Homo sapiens]
gi|410217878|gb|JAA06158.1| chromosome 5 open reading frame 44 [Pan troglodytes]
Length = 411
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 217/433 (50%), Gaps = 55/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + SR +P YLY LK + + ++G
Sbjct: 214 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 327 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 380
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 381 LRLTDTFLKRTYE 393
>gi|355734989|gb|AES11515.1| hypothetical protein [Mustela putorius furo]
Length = 416
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 216/433 (49%), Gaps = 49/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDY--NAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + SR +P YLY LK + ++G
Sbjct: 220 NVAELNSVSQAGECLTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNX 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>gi|417400575|gb|JAA47218.1| Hypothetical protein [Desmodus rotundus]
Length = 417
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 217/433 (50%), Gaps = 49/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S+ V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + SR +P YLY LK + ++G
Sbjct: 220 NVAELNSVNQAGECVTTFGSRTYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>gi|148276985|ref|NP_001087228.1| UPF0533 protein C5orf44 homolog isoform 2 [Mus musculus]
gi|123793268|sp|Q3TIR1.1|CE044_MOUSE RecName: Full=UPF0533 protein C5orf44 homolog
gi|74198618|dbj|BAE39785.1| unnamed protein product [Mus musculus]
Length = 417
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 220/429 (51%), Gaps = 41/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 224 LNSVTQAGECISTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSERM 336
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 337 ---MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 390
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 391 DTFLKRTYE 399
>gi|395510370|ref|XP_003759450.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Sarcophilus
harrisii]
Length = 418
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 218/433 (50%), Gaps = 48/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDLPGDLF-NQLMKDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQIVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V +++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNATVAELKSDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + SR +P YLY LK + + ++G
Sbjct: 220 NVVELNTVKQVGEGVSTFGSRTYLQPM-------DTRQYLYCLKPKAEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ ++G ++ L P S L L+++ G+Q ++G
Sbjct: 333 SSER--TMDLVLEMCNTNSIHWCGVSGRQLGKLNPS---SSLYLALTLLSSVQGLQSVSG 387
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 388 LRLTDTFLKRTYE 400
>gi|426246393|ref|XP_004016979.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Ovis aries]
Length = 417
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 216/433 (49%), Gaps = 49/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + SR +P YLY LK + ++G
Sbjct: 220 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>gi|148276983|ref|NP_080155.3| UPF0533 protein C5orf44 homolog isoform 1 [Mus musculus]
gi|112180396|gb|AAH21756.3| 2410002O22Rik protein [Mus musculus]
gi|148686556|gb|EDL18503.1| RIKEN cDNA 2410002O22, isoform CRA_b [Mus musculus]
Length = 418
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 219/429 (51%), Gaps = 40/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 224 LNSVTQAGECISTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 337 M--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 391
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 392 DTFLKRTYE 400
>gi|344179108|ref|NP_001230666.1| UPF0533 protein C5orf44 isoform 4 [Homo sapiens]
gi|397514417|ref|XP_003827484.1| PREDICTED: UPF0533 protein C5orf44 homolog [Pan paniscus]
gi|410039323|ref|XP_001163636.3| PREDICTED: UPF0533 protein C5orf44 homolog isoform 3 [Pan
troglodytes]
gi|119571730|gb|EAW51345.1| hypothetical protein FLJ13611, isoform CRA_c [Homo sapiens]
Length = 412
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 216/433 (49%), Gaps = 54/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + SR +P YLY LK + + ++G
Sbjct: 214 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 381
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 382 LRLTDTFLKRTYE 394
>gi|148745378|gb|AAI42995.1| C5orf44 protein [Homo sapiens]
Length = 412
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 216/433 (49%), Gaps = 54/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + SR +P YLY LK + + ++G
Sbjct: 214 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 381
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 382 LRLTDTFLKRTYE 394
>gi|403267435|ref|XP_003925838.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Saimiri
boliviensis boliviensis]
Length = 412
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 216/433 (49%), Gaps = 54/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNA--QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + +SR +P YLY LK + ++G
Sbjct: 214 NVTELNSVSQAGECVSTFRSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L LI++ G+Q ++G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLISSVQGLQSVSG 381
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 382 LRLTDTFLKRTYE 394
>gi|296194459|ref|XP_002744954.1| PREDICTED: UPF0533 protein C5orf44 isoform 2 [Callithrix jacchus]
Length = 412
Score = 202 bits (513), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 216/433 (49%), Gaps = 54/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNA--QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + +SR +P YLY LK + ++G
Sbjct: 214 NVTELNSVSQAGECVSTFRSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 381
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 382 LRLTDTFLKRTYE 394
>gi|348041260|ref|NP_001013930.2| UPF0533 protein C5orf44 homolog [Rattus norvegicus]
gi|190360171|sp|Q5M887.2|CE044_RAT RecName: Full=UPF0533 protein C5orf44 homolog
gi|149059250|gb|EDM10257.1| similar to RIKEN cDNA 2410002O22 gene, isoform CRA_a [Rattus
norvegicus]
Length = 418
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 218/429 (50%), Gaps = 40/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 224 LNSVNQAGECVSTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L ++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 337 T--MDLVLEMCNTTSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 391
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 392 DTFLKRTYE 400
>gi|449278704|gb|EMC86495.1| UPF0533 protein C5orf44 like protein [Columba livia]
Length = 410
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 137/431 (31%), Positives = 218/431 (50%), Gaps = 48/431 (11%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
L F VMRL +P+L P+ + DL + L+ D +T K
Sbjct: 5 LIFAVMRLTKPTLFTNIPVTCEERDL-----------PGNLFTQLMKDDPSTVKG----- 48
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT
Sbjct: 49 ---------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTS 99
Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFK
Sbjct: 100 SQR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKFFK 158
Query: 190 FIVSNPLSVRTKVRVVKV--GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
F V PL V+TK +V + E+ FLEA I+N T S ++M++V EPS ++
Sbjct: 159 FQVLKPLDVKTKFYNAEVSESCVYLDEV-FLEAQIQNITTSPMFMEKVSLEPSMMYNVAE 217
Query: 248 LKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L S+ SR +P YLY LK + ++G V+GK
Sbjct: 218 LNTVDTAGESESTFGSRTYLQPM-------DTRQYLYCLKPKQEFAEKAGVIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSER- 329
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH--LNLIATKLGVQRITGIT 423
++ L +++ ++G ++ L P S+ H L L+++ G+Q ++G+
Sbjct: 330 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHP-----SSSLHLALTLLSSVQGLQSVSGLR 382
Query: 424 VFDKLEKITYD 434
+ D K TY+
Sbjct: 383 LTDTFLKRTYE 393
>gi|345794146|ref|XP_535257.3| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Canis lupus
familiaris]
Length = 418
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 215/433 (49%), Gaps = 48/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + SR +P YLY LK + ++G
Sbjct: 220 NVAELNSVSQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L L+++ G+Q ++G
Sbjct: 333 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 387
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 388 LRLTDTFLKRTYE 400
>gi|109706942|gb|AAI17129.1| C5orf44 protein [Homo sapiens]
gi|219520363|gb|AAI43694.1| C5orf44 protein [Homo sapiens]
Length = 400
Score = 200 bits (509), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 136/431 (31%), Positives = 216/431 (50%), Gaps = 55/431 (12%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 1 LALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV----------------- 42
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA+
Sbjct: 43 -----------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKAD 91
Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
+QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 92 LQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFR 150
Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS ++
Sbjct: 151 KFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMYNV 204
Query: 246 TMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
T L + + + SR +P YLY LK + + ++G V+
Sbjct: 205 TELNSVSQAGECVSTFGSRAYLQPM-------DTRQYLYCLKPKNEFAEKAGIIKGVTVI 257
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN ++
Sbjct: 258 GKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSE 317
Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
+ ++ L +++ I+G ++ L P + L L+++ G+Q I+G+
Sbjct: 318 R---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLR 371
Query: 424 VFDKLEKITYD 434
+ D K TY+
Sbjct: 372 LTDTFLKRTYE 382
>gi|347922196|ref|NP_001231675.1| uncharacterized protein LOC100513053 [Sus scrofa]
Length = 417
Score = 200 bits (509), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 137/434 (31%), Positives = 216/434 (49%), Gaps = 51/434 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKA---DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
+ L + DG SR +P YLY LK + ++G
Sbjct: 220 NVAELNSVNQDG-ECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGV 271
Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 272 TVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFHITCKITN 331
Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
+++ ++ L +++ I+G ++ L P + L L+++ G Q ++
Sbjct: 332 CSER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGPQSVS 385
Query: 421 GITVFDKLEKITYD 434
G+ + D K TY+
Sbjct: 386 GLRLTDTFLKRTYE 399
>gi|303304975|ref|NP_001006577.2| uncharacterized protein LOC427165 isoform 2 [Gallus gallus]
Length = 411
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 217/429 (50%), Gaps = 47/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL ++F+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 56 -----------AEALMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASTAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVAE 217
Query: 248 LKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L S+ SR +P YLY LK + ++G V+GK
Sbjct: 218 LNTVDSAGESESTFGSRTYLQP-------MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSER- 329
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ ++G ++ L P S L L+++ G+Q ++G+ +
Sbjct: 330 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHPS---SSLRLALTLLSSVQGLQSVSGLRLT 384
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 385 DTFLKRTYE 393
>gi|432884723|ref|XP_004074558.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Oryzias
latipes]
Length = 411
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 143/438 (32%), Positives = 225/438 (51%), Gaps = 51/438 (11%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
++ T H LA +VMRL +P+L P+ + DL D+F L+ D +
Sbjct: 3 VNQTKQEHLLALKVMRLTKPTLFTNLPVTCEERDL--PGDLFGQ---------LMRQDPS 51
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
T K A+++ L +L LPQ FG I+LGETF SYIS++N ST V+++
Sbjct: 52 TIKG--------------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSTQIVKEI 97
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
++KA++QT QR L L TS S V ++ D ++ H+VKE+G H LVC Y+ GE
Sbjct: 98 LVKADLQTSSQR-LNLSTSNSAVAELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQLGE 156
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
+ Y +FFKF V PL V+TK + FLEA I+N T S ++M++V EP+
Sbjct: 157 KLYFRKFFKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPT 210
Query: 241 QNWSATMLK----ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
++ T L D S + S + P+ R YLY LK + +
Sbjct: 211 IMYNVTELNTVASGDDGESTFGKMS---YLQPMDTR------QYLYCLKPKAEYAEKAGV 261
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
++G ++GKL I WRTNLGE GRLQT Q+ +I L++ +P V +++PF +
Sbjct: 262 IKGVTMIGKLDIVWRTNLGEKGRLQTSQLQRMAPGYGDIRLSLEIIPDTVNLEEPFDIVC 321
Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
K+TN +++ ++ + ++ I+G ++ L+P GS L + ++ G+
Sbjct: 322 KITNCSER---TMDLVVEMCNTRSIHWCGISGRQLGKLSP---GGSLLVPLTIFSSVQGL 375
Query: 417 QRITGITVFDKLEKITYD 434
Q I+G+ + D K TY+
Sbjct: 376 QSISGLRLTDTFLKRTYE 393
>gi|198423527|ref|XP_002129801.1| PREDICTED: similar to UPF0533 protein isoform 2 [Ciona
intestinalis]
Length = 396
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 137/429 (31%), Positives = 219/429 (51%), Gaps = 52/429 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA RVMRL +PS+ P+ D +D+ S +L
Sbjct: 7 HPLALRVMRLTKPSIITSVPVLNDKSDVL---------------------------SLNL 39
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
Y S+ L S G L+LP +FG I+LGETF SY+S+NN S +V +V + A++Q
Sbjct: 40 GYSSKNL-----TSYGTGETLILPHSFGNIFLGETFVSYLSVNNESGTDVLNVSLMADLQ 94
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QRI L ++K+P ES++ G D ++ H+VKELG H LVCT YS +GE K +F
Sbjct: 95 TGSQRITL--SNKTPKESLKPGNSLDEVINHEVKELGTHILVCTVSYSRRDGEPKNFRKF 152
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQ-EITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
FKF V PL V+TK ++ Q + +LE I+N T + + M++V +P+ ++A
Sbjct: 153 FKFQVLKPLDVKTKFYNIESYLLTLQCDQVYLETQIQNITPNPICMEKVNLDPAALYTAQ 212
Query: 247 MLKA-DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L H +++ QS KP + YLY LK L + + + V+GK
Sbjct: 213 SLNTISSNHGEFSCQS--YMKP-------SEVRQYLYWLK-LKPSCAKKAFTEAAGVIGK 262
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+++LGE GRLQT Q+ + ++I + V +VP + + +PF + K+TN ++
Sbjct: 263 LDIVWKSSLGERGRLQTSQLQRAILGQRDILVQVNQVPENLKVLQPFEISCKVTNYSEHA 322
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
+ + ++ + ++ + L + A S ++L+ T +G+Q ++G+ V
Sbjct: 323 KQLMVQYENRTN------LLWQNVSGYTLNKLPAKESCFITMSLLPTSVGIQSVSGMKVI 376
Query: 426 DKLEKITYD 434
D TYD
Sbjct: 377 DMELNRTYD 385
>gi|338718819|ref|XP_003363894.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Equus
caballus]
Length = 418
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 213/433 (49%), Gaps = 48/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLNSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + SR +P YLY LK + ++G
Sbjct: 220 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L ++ I+G ++ L P + L L+++ G+Q ++G
Sbjct: 333 SSERT--MDLVLEMCNTSSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 387
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 388 LRLTDTFLKRTYE 400
>gi|449514345|ref|XP_002190091.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Taeniopygia
guttata]
Length = 411
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 218/429 (50%), Gaps = 46/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL ++F+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 56 -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMYNVAE 217
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
L D +S F ++ YLY LK + ++G V+GKL
Sbjct: 218 LNT----VDTAGESESTFGTRTYLQP-MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGKLD 272
Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 273 IVWKTNLGEHGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSSER-- 330
Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFH--LNLIATKLGVQRITGITVF 425
++ L +++ ++G ++ L P S+ H L L+++ G+Q ++G+ +
Sbjct: 331 TMDLVLEMCNTNSIHWCGVSGRQLGKLYP-----SSSLHLALTLLSSVQGLQSVSGLRLT 385
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 386 DTFLKRTYE 394
>gi|395825392|ref|XP_003785919.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Otolemur
garnettii]
Length = 412
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 137/433 (31%), Positives = 215/433 (49%), Gaps = 54/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + SR +P YLY LK + ++G
Sbjct: 214 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 327 SSERT--MDLVLEMCNTNSIHWCGISGRQLGKLNPSSSLC---LALTLLSSVQGLQSISG 381
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 382 LRLTDTFLKRTYE 394
>gi|219517954|gb|AAI43692.1| C5orf44 protein [Homo sapiens]
Length = 401
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/431 (31%), Positives = 215/431 (49%), Gaps = 54/431 (12%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 1 LALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV----------------- 42
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA+
Sbjct: 43 -----------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKAD 91
Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
+QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 92 LQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFR 150
Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS ++
Sbjct: 151 KFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMYNV 204
Query: 246 TMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
T L + + + SR +P YLY LK + + ++G V+
Sbjct: 205 TELNSVSQAGECVSTFGSRAYLQPM-------DTRQYLYCLKPKNEFAEKAGIIKGVTVI 257
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +
Sbjct: 258 GKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSS 317
Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
+ ++ L +++ I+G ++ L P + L L+++ G+Q I+G+
Sbjct: 318 ER--TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLR 372
Query: 424 VFDKLEKITYD 434
+ D K TY+
Sbjct: 373 LTDTFLKRTYE 383
>gi|334325204|ref|XP_003340619.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Monodelphis
domestica]
Length = 412
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 137/435 (31%), Positives = 218/435 (50%), Gaps = 58/435 (13%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEDRDLPGDLF-NQLMKDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQIVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V +++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNATVAELKSDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMY 213
Query: 244 SA----TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
+ T+ +A S + SR +P YLY LK + ++G
Sbjct: 214 NVVELNTVKQAGEGMSTFG--SRTYLQP-------MDTRQYLYCLKPKQEFAEKAGIIKG 264
Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+T
Sbjct: 265 VTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKIT 324
Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
N + + ++ L +++ ++G ++ L P S L L+++ G+Q +
Sbjct: 325 NCSSER--TMDLVLEMCNTNSIHWCGVSGRQLGKLNPS---SSLYLALTLLSSVQGLQSV 379
Query: 420 TGITVFDKLEKITYD 434
+G+ + D K TY+
Sbjct: 380 SGLRLTDTFLKRTYE 394
>gi|328771369|gb|EGF81409.1| hypothetical protein BATDEDRAFT_34721 [Batrachochytrium
dendrobatidis JAM81]
Length = 484
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 158/498 (31%), Positives = 229/498 (45%), Gaps = 103/498 (20%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL PS PL D TDL + AA + L SD + D+
Sbjct: 8 HLLALKVMRLSHPSYAQTHPLYTD-TDLALP--------AAEVVQSLKHSDSSMQVDDDM 58
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A GL LL LP AFG IYLGETF SY+ +NN S V ++ KAE+Q
Sbjct: 59 Y----------AGIAGLGSLLTLPPAFGNIYLGETFSSYLCVNNESLTPVLNLTFKAELQ 108
Query: 128 TDKQRILLLDT--------------------------------------SKSPVESIRAG 149
T QRI L DT +S S+ G
Sbjct: 109 TSTQRITLADTLLSSASSSASSSTGVDRLALGSISGSYSTLHGSGPAENRQSLASSLLPG 168
Query: 150 GRYDFIVEHDVKELGAHTLVCTALY----------SDGEGERKYLPQFFKFIVSNPLSVR 199
+F++ HD+KELG H LVC+ Y S + ERK+ +F+KF V NPLSV+
Sbjct: 169 QSAEFVIHHDIKELGIHILVCSVHYTPAPVIGSSASSMDRERKFFRKFYKFQVLNPLSVK 228
Query: 200 TKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS-----------QNWSATML 248
TKV ++ G FLEA ++N + S +Y++ + FEP+ ++ S ++
Sbjct: 229 TKVNTLQDGRI------FLEAQVQNVSSSFMYLEYMNFEPNDPFLVQDLNLFRDSSVSLT 282
Query: 249 KADG------PHSDYNAQSRE------IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
++ + QS + +FK L+ YLY ML+ S + V
Sbjct: 283 SGQNDIVSTKSETETDVQSSQTSKGLSVFKERDLL-GQQDTRQYLY---MLTPKSINDVA 338
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
+ LGKL I+WRT LG+ GRLQT Q+ ++ E+ VVE P ++ +++PF++K+
Sbjct: 339 TRMLPGLGKLDISWRTVLGQSGRLQTSQLSRKILSVNPFEVFVVEQPRIIRVEQPFVVKI 398
Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
++TN E+ I +N V++ G + L +E S D L A +G+
Sbjct: 399 RITNHVPSERLKLSIHGYKNKMTN---VLLRGPNNIELNELEGASSVDVDLEFFALAIGL 455
Query: 417 QRITGITVFDKLEKITYD 434
Q+ITGI V DK+ T D
Sbjct: 456 QKITGIQVSDKVSGTTRD 473
>gi|198423525|ref|XP_002129762.1| PREDICTED: similar to UPF0533 protein isoform 1 [Ciona
intestinalis]
Length = 389
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/428 (31%), Positives = 218/428 (50%), Gaps = 57/428 (13%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA RVMRL +PS+ P+ D +D+ S +L
Sbjct: 7 HPLALRVMRLTKPSIITSVPVLNDKSDVL---------------------------SLNL 39
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
Y S+ L S G L+LP +FG I+LGETF SY+S+NN S +V +V + A++Q
Sbjct: 40 GYSSKNL-----TSYGTGETLILPHSFGNIFLGETFVSYLSVNNESGTDVLNVSLMADLQ 94
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QRI L ++K+P ES++ G D ++ H+VKELG H LVCT YS +GE K +F
Sbjct: 95 TGSQRITL--SNKTPKESLKPGNSLDEVINHEVKELGTHILVCTVSYSRRDGEPKNFRKF 152
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK ++ + +LE I+N T + + M++V +P+ ++A
Sbjct: 153 FKFQVLKPLDVKTKFYNIEC------DQVYLETQIQNITPNPICMEKVNLDPAALYTAQS 206
Query: 248 LKA-DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
L H +++ QS KP + YLY LK L + + + V+GKL
Sbjct: 207 LNTISSNHGEFSCQS--YMKP-------SEVRQYLYWLK-LKPSCAKKAFTEAAGVIGKL 256
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
I W+++LGE GRLQT Q+ + ++I + V +VP + + +PF + K+TN ++ +
Sbjct: 257 DIVWKSSLGERGRLQTSQLQRAILGQRDILVQVNQVPENLKVLQPFEISCKVTNYSEHAK 316
Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 426
+ ++ + ++ + L + A S ++L+ T +G+Q ++G+ V D
Sbjct: 317 QLMVQYENRTN------LLWQNVSGYTLNKLPAKESCFITMSLLPTSVGIQSVSGMKVID 370
Query: 427 KLEKITYD 434
TYD
Sbjct: 371 MELNRTYD 378
>gi|387019765|gb|AFJ52000.1| UPF0533 protein C5orf44-like protein [Crotalus adamanteus]
Length = 413
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 137/429 (31%), Positives = 217/429 (50%), Gaps = 40/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFNQ---------LMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 56 -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSQQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKQDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITSSPMFMEKVSLEPSIMYNVAE 223
Query: 248 LKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L G S +R +P YLY LK S ++G V+GK
Sbjct: 224 LNTINQGRDSVSTFGTRTYLQPM-------DTRQYLYCLKPKQEFSEKVGVIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVSLEEPFNITCKITNCSSER 336
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ ++G ++ L P + T L+ + G+Q ++G+ +
Sbjct: 337 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHPTSSLYLTLTLLSSVQ---GLQSVSGLRLT 391
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 392 DTFLKRTYE 400
>gi|395510368|ref|XP_003759449.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Sarcophilus
harrisii]
Length = 412
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 135/429 (31%), Positives = 218/429 (50%), Gaps = 46/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFNQ---------LMKDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQIVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V +++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNATVAELKSDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMYNVVE 217
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + SR +P YLY LK + + ++G V+GK
Sbjct: 218 LNTVKQVGEGVSTFGSRTYLQPM-------DTRQYLYCLKPKAEFAEKAGIIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ ++G ++ L P S L L+++ G+Q ++G+ +
Sbjct: 331 --TMDLVLEMCNTNSIHWCGVSGRQLGKLNPS---SSLYLALTLLSSVQGLQSVSGLRLT 385
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 386 DTFLKRTYE 394
>gi|148686557|gb|EDL18504.1| RIKEN cDNA 2410002O22, isoform CRA_c [Mus musculus]
Length = 426
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 217/429 (50%), Gaps = 46/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 24 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 66
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 67 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 118
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 119 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 177
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 178 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 231
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 232 LNSVTQAGECISTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 284
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 285 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 344
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 345 M--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 399
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 400 DTFLKRTYE 408
>gi|74207988|dbj|BAE29111.1| unnamed protein product [Mus musculus]
Length = 412
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 217/429 (50%), Gaps = 46/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEKKDL--PGDLFNQ---------LMKDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 217
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 218 LNSVTQAGECISTFGSRGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 331 M--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 385
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 386 DTFLKRTYE 394
>gi|148276987|ref|NP_001087229.1| UPF0533 protein C5orf44 homolog isoform 3 [Mus musculus]
gi|74194542|dbj|BAE37309.1| unnamed protein product [Mus musculus]
Length = 412
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 217/429 (50%), Gaps = 46/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 217
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 218 LNSVTQAGECISTFGSRGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 331 M--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 385
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 386 DTFLKRTYE 394
>gi|260792744|ref|XP_002591374.1| hypothetical protein BRAFLDRAFT_282065 [Branchiostoma floridae]
gi|229276579|gb|EEN47385.1| hypothetical protein BRAFLDRAFT_282065 [Branchiostoma floridae]
Length = 410
Score = 197 bits (501), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 136/440 (30%), Positives = 217/440 (49%), Gaps = 46/440 (10%)
Query: 10 LAFRVMRLCRPS-LHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLT 68
LA +VMRL RP+ LHV P + D DL S ++ SD+ ++
Sbjct: 11 LALKVMRLTRPTFLHVTP-ITCDDRDL-----------PGSTFSQVVRSDMASSAG---- 54
Query: 69 YRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQT 128
+ + LL LPQ FG I+LGETF Y+ ++N ST V+D+++KA++QT
Sbjct: 55 ----------LEEFAMGELLTLPQNFGNIFLGETFSCYVCVHNDSTQLVKDIMVKADLQT 104
Query: 129 DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
QR+ L S P+ + G D ++ H+VKELG H LVC Y+ E+ Y +FF
Sbjct: 105 SSQRLTLSGGSSPPIPELGPEGSIDEVIHHEVKELGTHILVCAVSYTTQSSEKMYFRKFF 164
Query: 189 KFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
KF V PL V+TK + + +LEA ++N T + + M++V EPS ++S + L
Sbjct: 165 KFQVLKPLDVKTKFYNAE------SDEVYLEAQVQNITAAPMVMEKVSLEPSASYSVSEL 218
Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
+ IF V + I YLY LK + + ++G +GKL I
Sbjct: 219 NTE------EKAGMSIFGTSVYLNP-KDIRQYLYCLKPKAEVGAPRGVLKGVTNIGKLDI 271
Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
W+TN+GE GRLQT + +I L V ++P V ++KPF K ++TN ++
Sbjct: 272 IWKTNMGEKGRLQTSPLQRMAPGYGDIRLTVEQIPDGVPMEKPFNFKCRVTNCCERTMD- 330
Query: 369 FEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKL 428
+ L + + ++G ++ L P + +L L+A+ G+Q I+G+ + D
Sbjct: 331 LLLLLQNSGTSGLYWCGVSGKQLGKLGPNTHM---ELNLTLLASVPGLQSISGLRLTDTY 387
Query: 429 EKITY--DSLPDLEIFVDQD 446
K TY D + + ++ DQ+
Sbjct: 388 LKRTYEHDDIAQVFVYSDQE 407
>gi|349732100|ref|NP_001016427.2| UPF0533 protein C5orf44 homolog isoform 2 [Xenopus (Silurana)
tropicalis]
Length = 411
Score = 197 bits (501), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 137/430 (31%), Positives = 220/430 (51%), Gaps = 49/430 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFS---------TLMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+ +KA++Q
Sbjct: 56 -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDIQVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASTAVVSELKPDSCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ +
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVSE 217
Query: 248 LKADGPHSD-YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
L + D + + + P+ R YLY LK + ++G V+GKL
Sbjct: 218 LNTVITNGDGCSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKL 271
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 272 DIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSER-- 329
Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLN--LIATKLGVQRITGITV 424
++ L +++ ++G ++ L P S+ HL L+++ G+Q ++G+ +
Sbjct: 330 -TMDLVLEMCNTNAIHWCGVSGRQLGKLHP-----SSSLHLTLALLSSVQGLQSVSGLRL 383
Query: 425 FDKLEKITYD 434
D K TY+
Sbjct: 384 TDTFLKRTYE 393
>gi|431907788|gb|ELK11395.1| hypothetical protein PAL_GLEAN10024843 [Pteropus alecto]
Length = 411
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 214/433 (49%), Gaps = 55/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + +R +P YLY LK + ++G
Sbjct: 214 NVAELNSVNQAGECVTTFGTRTYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 327 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 380
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 381 LRLTDTFLKRTYE 393
>gi|426246395|ref|XP_004016980.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Ovis aries]
Length = 412
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 135/429 (31%), Positives = 215/429 (50%), Gaps = 46/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVAE 217
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + SR +P YLY LK + ++G V+GK
Sbjct: 218 LNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 331 T--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 385
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 386 DTFLKRTYE 394
>gi|355691351|gb|EHH26536.1| hypothetical protein EGK_16539 [Macaca mulatta]
gi|355749957|gb|EHH54295.1| hypothetical protein EGM_15103 [Macaca fascicularis]
Length = 418
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 137/429 (31%), Positives = 215/429 (50%), Gaps = 40/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK +V + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAEVSVECLTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTE 223
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 224 LNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
+ + +S + G+ L + S L L+++ G+Q I+G+ +
Sbjct: 337 TMDLVLEMCNTNS-----IHWCGISGRQLGKLHPSSSLSLALTLLSSVQGLQSISGLRLT 391
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 392 DTFLKRTYE 400
>gi|359319029|ref|XP_003638975.1| PREDICTED: UPF0533 protein C5orf44 homolog [Canis lupus familiaris]
gi|410948699|ref|XP_003981068.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Felis catus]
Length = 412
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 135/429 (31%), Positives = 215/429 (50%), Gaps = 46/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVAE 217
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + SR +P YLY LK + ++G V+GK
Sbjct: 218 LNSVSQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 331 T--MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 385
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 386 DTFLKRTYE 394
>gi|443711431|gb|ELU05219.1| hypothetical protein CAPTEDRAFT_211630 [Capitella teleta]
Length = 423
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 138/443 (31%), Positives = 219/443 (49%), Gaps = 39/443 (8%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
M S H L +VMRL +P+L + PL PT + D P+ +
Sbjct: 1 MESKEKEHLLVLKVMRLTKPALMISKPLSCIPTHRTV--DDHGQPVKVA----------- 47
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
+DL + + + LS LL LPQ FG I+LGETF SYIS++N+S+ RD+
Sbjct: 48 ----TDLA------IAEGLEHFALSQLLTLPQNFGNIFLGETFSSYISVHNNSSHVCRDI 97
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
IKA++QT QR+ L + +PV+ + D +++H+VKELG H LVC Y GE
Sbjct: 98 QIKADLQTSSQRLTLSSSHANPVQQLTPSESIDDVIQHEVKELGTHILVCAVTYVSNTGE 157
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
+ Y +FFKF V PL V+TK + + +LEA I+N T +++++V +PS
Sbjct: 158 KMYFRKFFKFQVLKPLDVKTKFYNAE------SDEVYLEAQIQNITPGPIFLEKVLLDPS 211
Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
++S L H+ + +R +F V S + YLY L + P ++G
Sbjct: 212 SHYSGIQL-----HTQEDPVNRPVFG-KVNCVSPLDVRQYLYCLTPKPEVLADPKFMKGV 265
Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
+GKL I W+TN+ E GRLQT + +I L V ++ V ++ F +++++TN
Sbjct: 266 TNIGKLDIVWKTNMAEKGRLQTSALQRVLPGYGDIRLMVEKISESVPVETKFNIEIRVTN 325
Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
+++ + L N +G++I L + ST L LI T G+Q I+
Sbjct: 326 CSERTMD-LSVHLDNNIQIGLLWSCCSGIQIGRLT---SGSSTLLKLALIPTACGLQTIS 381
Query: 421 GITVFDKLEKITYDSLPDLEIFV 443
G+ + D K TY+ +++V
Sbjct: 382 GLRLTDTFLKRTYEHDEVAQVYV 404
>gi|349732102|ref|NP_001231833.1| UPF0533 protein C5orf44 homolog isoform 1 [Xenopus (Silurana)
tropicalis]
gi|123912021|sp|Q0VFT9.1|CE044_XENTR RecName: Full=UPF0533 protein C5orf44 homolog
gi|110645327|gb|AAI18703.1| LOC549181 protein [Xenopus (Silurana) tropicalis]
Length = 412
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 137/430 (31%), Positives = 219/430 (50%), Gaps = 48/430 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFS---------TLMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+ +KA++Q
Sbjct: 56 -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDIQVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASTAVVSELKPDSCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ +
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVSE 217
Query: 248 LKADGPHSD-YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
L + D + + + P+ R YLY LK + ++G V+GKL
Sbjct: 218 LNTVITNGDGCSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKL 271
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 272 DIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSSERT 331
Query: 367 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLN--LIATKLGVQRITGITV 424
++ L +++ ++G ++ L P S+ HL L+++ G+Q ++G+ +
Sbjct: 332 --MDLVLEMCNTNAIHWCGVSGRQLGKLHP-----SSSLHLTLALLSSVQGLQSVSGLRL 384
Query: 425 FDKLEKITYD 434
D K TY+
Sbjct: 385 TDTFLKRTYE 394
>gi|194223840|ref|XP_001492631.2| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Equus
caballus]
Length = 412
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 135/433 (31%), Positives = 212/433 (48%), Gaps = 54/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMY 213
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ L + + SR +P YLY LK + ++G
Sbjct: 214 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 266
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 267 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 326
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + ++ L ++ I+G ++ L P + L L+++ G+Q ++G
Sbjct: 327 SSERT--MDLVLEMCNTSSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 381
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 382 LRLTDTFLKRTYE 394
>gi|37589695|gb|AAH59537.1| Zgc:73187 [Danio rerio]
gi|47937881|gb|AAH71349.1| Zgc:73187 [Danio rerio]
Length = 385
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 123/358 (34%), Positives = 195/358 (54%), Gaps = 21/358 (5%)
Query: 79 ADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDT 138
A+++ L +L LPQ FG I+LGETF SYIS++N S+ V+D+++KA++QT QR L L
Sbjct: 29 AETLILGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVKADLQTSSQR-LNLSA 87
Query: 139 SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSV 198
S S V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V
Sbjct: 88 SNSAVSELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKLYFRKFFKFQVLKPLDV 147
Query: 199 RTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK--ADGPHSD 256
+TK + FLEA I+N T S ++M++V EPS ++ T L A G S
Sbjct: 148 KTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSMMYNVTELNNVASGDESS 201
Query: 257 YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGE 316
+ + + P+ R YLY LK + ++G V+GKL I W+TNLGE
Sbjct: 202 ESTFGKMSYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKLDIVWKTNLGE 255
Query: 317 PGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQN 376
GRLQT Q+ ++ L++ +P V +++PF + K+TN +++ ++ L
Sbjct: 256 RGRLQTSQLQRMAPGYGDVRLSLEFIPDTVDLEEPFDITCKITNCSERT---MDLLLEMC 312
Query: 377 DSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
++ ++G ++ L+P S L L+++ G+Q I+G+ + D K TY+
Sbjct: 313 NTRSVHWCGVSGRQLGKLSPS---ASLSIPLKLLSSVQGLQSISGLRLTDTFLKRTYE 367
>gi|327263135|ref|XP_003216376.1| PREDICTED: UPF0533 protein C5orf44 homolog [Anolis carolinensis]
Length = 417
Score = 196 bits (497), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 135/429 (31%), Positives = 217/429 (50%), Gaps = 40/429 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFNQ---------LMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 56 -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSHQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASSAAVAELKQDCCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVVE 223
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L D + +R +P YLY LK + ++G V+GK
Sbjct: 224 LNTVSHTEDSISTFGTRTYLQP-------MDTRQYLYCLKPKQEFAEKAGVIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLETIPDTVNLEEPFDITCKITNCSSER 336
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ L +++ ++G ++ L P + T L+ + G+Q ++G+ +
Sbjct: 337 --TMDLVLEMCNTNSIHWCGVSGRQLGKLHPTSSLHLTLTLLSSVQ---GLQSVSGLRLT 391
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 392 DTFLKRTYE 400
>gi|320168756|gb|EFW45655.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 439
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 137/428 (32%), Positives = 219/428 (51%), Gaps = 49/428 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H L +VMRL +P+L + P+ +P+D A S L + ++DV+T +L
Sbjct: 9 HYLVLKVMRLSKPTLVIGQPIVSEPSDF-----------AGSVLQEVQTADVSTAGQPEL 57
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
LS L+LPQ FG I+LGETF SYIS++N S + +RDV +KAE+Q
Sbjct: 58 --------------FSLSSFLMLPQNFGNIFLGETFSSYISVHNDSNMRIRDVAVKAELQ 103
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR+ L D + S E + G D +V H+VKELG H LVC+ Y + ERK +F
Sbjct: 104 TTSQRVPLSDLAPSDKE-LSPGASVDVVVHHEVKELGVHILVCSVSYMTADDERKIFRKF 162
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE--PSQNWSA 245
FKF V +PL+V+TKV V ++ FLEA ++N T + +Y++ V+FE P ++
Sbjct: 163 FKFNVLHPLAVKTKVYNV-------EDDIFLEAQVQNITPAPMYIEAVKFEAMPQFDFQD 215
Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIH-------NYLYQLKMLSHGSSSPVKVQ 298
+ + + ++ ++ K G H YLY+L G + +
Sbjct: 216 LNVLSSAASASSSSTNQAGLKASPATTFGLAYHVNPQDIRQYLYRLSPKVKGDKT---AR 272
Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
++ +GK+ I W+TN+GE GRLQT Q+ E+ + VVEVP V ++ PF ++ ++
Sbjct: 273 AADKIGKMDILWKTNMGEVGRLQTSQLPRKLPALTELAVTVVEVPDNVVLEVPFTVQCRI 332
Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
TN ++ + ++ ++ ++G + L P EA S L G+QR
Sbjct: 333 TNYSEHKMS-LRLFAVKSRMTGVLAAGVSGQSLGELFP-EA--SKIIPLEFFPAVPGLQR 388
Query: 419 ITGITVFD 426
++G+ + D
Sbjct: 389 VSGLRLMD 396
>gi|291395448|ref|XP_002714113.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 402
Score = 195 bits (495), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 132/426 (30%), Positives = 212/426 (49%), Gaps = 48/426 (11%)
Query: 15 MRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYR 70
MRL +P+L P+ + P DLF + + DDP
Sbjct: 1 MRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV---------------------- 37
Query: 71 SRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDK 130
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT
Sbjct: 38 ------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSS 91
Query: 131 QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKF 190
QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF
Sbjct: 92 QR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKF 150
Query: 191 IVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA 250
V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ T L +
Sbjct: 151 QVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNS 210
Query: 251 DGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
+ + SR +P YLY LK + ++G V+GKL I
Sbjct: 211 VSQAGECLSTFGSRAYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDI 263
Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 264 VWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--T 321
Query: 369 FEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKL 428
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ + D
Sbjct: 322 MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTF 378
Query: 429 EKITYD 434
K TY+
Sbjct: 379 LKRTYE 384
>gi|441658598|ref|XP_004091270.1| PREDICTED: UPF0533 protein C5orf44 homolog [Nomascus leucogenys]
Length = 355
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 119/350 (34%), Positives = 188/350 (53%), Gaps = 15/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
+ + FLEA I+N T S ++M++V EPS +S T L + + + SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYSVTELNSVSQAGECVSTFGSRAY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN + + ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q I+G+ + D K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337
>gi|351699840|gb|EHB02759.1| hypothetical protein GW7_09268, partial [Heterocephalus glaber]
Length = 396
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 134/427 (31%), Positives = 212/427 (49%), Gaps = 55/427 (12%)
Query: 14 VMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
VMRL +P+L P+ + P DLF + + DDP
Sbjct: 1 VMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------------- 38
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT
Sbjct: 39 -------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTS 91
Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFK
Sbjct: 92 SQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFK 150
Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
F V PL V+TK + FLEA I+N T S ++M++V EPS +S T L
Sbjct: 151 FQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYSVTELN 204
Query: 250 ADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
+ + + SR +P YLY LK + ++G V+GKL
Sbjct: 205 SVNQAGECVSTFGSRAYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLD 257
Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 258 IVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER--- 314
Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ + D
Sbjct: 315 TMDLVLEMYNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDT 371
Query: 428 LEKITYD 434
K TY+
Sbjct: 372 FLKRTYE 378
>gi|10435667|dbj|BAB14633.1| unnamed protein product [Homo sapiens]
Length = 354
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 190/350 (54%), Gaps = 16/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
+ + FLEA I+N T S ++M++V EPS ++ T L + + + SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN +++ ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER---TMDLVLEMCNTNSIHWC 289
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q I+G+ + D K TY+
Sbjct: 290 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 336
>gi|388453625|ref|NP_001253285.1| trafficking protein particle complex 13 [Macaca mulatta]
gi|383412261|gb|AFH29344.1| hypothetical protein LOC80006 isoform 2 [Macaca mulatta]
gi|384941112|gb|AFI34161.1| hypothetical protein LOC80006 isoform 2 [Macaca mulatta]
Length = 417
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 186/363 (51%), Gaps = 43/363 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + SR +P YLY LK + ++G
Sbjct: 220 NVTELNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDK 364
+++
Sbjct: 333 SER 335
>gi|402871695|ref|XP_003899789.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Papio anubis]
gi|380816682|gb|AFE80215.1| hypothetical protein LOC80006 isoform 1 [Macaca mulatta]
Length = 418
Score = 194 bits (493), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 136/433 (31%), Positives = 213/433 (49%), Gaps = 48/433 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219
Query: 244 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+ T L + + + SR +P YLY LK + ++G
Sbjct: 220 NVTELNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ + + + +S + G+ L + S L L+++ G+Q I+G
Sbjct: 333 SSERTMDLVLEMCNTNS-----IHWCGISGRQLGKLHPSSSLSLALTLLSSVQGLQSISG 387
Query: 422 ITVFDKLEKITYD 434
+ + D K TY+
Sbjct: 388 LRLTDTFLKRTYE 400
>gi|301767850|ref|XP_002919348.1| PREDICTED: UPF0533 protein C5orf44 homolog [Ailuropoda melanoleuca]
Length = 401
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 131/426 (30%), Positives = 211/426 (49%), Gaps = 49/426 (11%)
Query: 15 MRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYR 70
MRL +P+L P+ + P DLF + + DDP
Sbjct: 1 MRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV---------------------- 37
Query: 71 SRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDK 130
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT
Sbjct: 38 ------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSS 91
Query: 131 QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKF 190
QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF
Sbjct: 92 QR-LNLSASSAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKF 150
Query: 191 IVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA 250
V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ L +
Sbjct: 151 QVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVAELNS 210
Query: 251 DGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
+ SR +P YLY LK + ++G V+GKL I
Sbjct: 211 VSQAGECVTTFGSRAYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDI 263
Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 264 VWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER---T 320
Query: 369 FEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKL 428
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ + D
Sbjct: 321 MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTF 377
Query: 429 EKITYD 434
K TY+
Sbjct: 378 LKRTYE 383
>gi|410039326|ref|XP_003950597.1| PREDICTED: UPF0533 protein C5orf44 homolog [Pan troglodytes]
Length = 355
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 189/350 (54%), Gaps = 15/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
+ + FLEA I+N T S ++M++V EPS ++ T L + + + SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN + + ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q I+G+ + D K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337
>gi|390459897|ref|XP_002744953.2| PREDICTED: UPF0533 protein C5orf44 isoform 1 [Callithrix jacchus]
Length = 355
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 189/350 (54%), Gaps = 15/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA--QSREI 264
+ + FLEA I+N T S ++M++V EPS ++ T L + + + +SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFRSRAY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN + + ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q I+G+ + D K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337
>gi|395825394|ref|XP_003785920.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Otolemur
garnettii]
Length = 355
Score = 192 bits (487), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 188/350 (53%), Gaps = 15/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
+ + FLEA I+N T S ++M++V EPS ++ T L + + + SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN + + ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q I+G+ + D K TY+
Sbjct: 291 GISGRQLGKLNPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337
>gi|156392281|ref|XP_001635977.1| predicted protein [Nematostella vectensis]
gi|156223076|gb|EDO43914.1| predicted protein [Nematostella vectensis]
Length = 394
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 143/430 (33%), Positives = 212/430 (49%), Gaps = 58/430 (13%)
Query: 15 MRLCRPSLHVEPPLRVDPTDL--FIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSR 72
MRL +PS++ P++ + DL I +D D IA+ +P +
Sbjct: 1 MRLTKPSMYTSIPVQCESQDLPGSIFKDCHDADIAS--VPGMYD---------------- 42
Query: 73 FLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQR 132
L LLVLPQ FG I+LGETF SY+S++N S V+D+VIK ++QT QR
Sbjct: 43 ---------FALGDLLVLPQTFGNIFLGETFASYVSVHNDSNQSVKDIVIKTDLQTSSQR 93
Query: 133 ILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIV 192
+ L + PV + YD ++ H+VKELG H LVC YS GE+ Y +FFKF V
Sbjct: 94 LTLSGAANMPVAKLDPQKSYDQVIHHEVKELGTHILVCAVSYSSLAGEKMYFRKFFKFQV 153
Query: 193 SNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADG 252
PL V+TK + + FLEA ++N T S + M+ V +PS ++ T L
Sbjct: 154 LKPLDVKTKFYNAE------DDSVFLEAQVQNITSSPMVMESVRLDPSALYTVTDLNI-A 206
Query: 253 PHSDYNAQSRE---IFKPPVLIRSGGGIH-----NYLYQLKMLSHGSSSPVKVQGSNVLG 304
P SD N R+ I++ V G +H YLY+LK S +P S+ +G
Sbjct: 207 P-SDPNKTKRQNAMIYELDV----GSFLHPNDTRQYLYKLKAKSPIDRNPKVRPYSHPVG 261
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
KL I WRT+ GE GRLQT Q+ +++L V ++ V +++PF + LKL N D+
Sbjct: 262 KLDIVWRTSFGERGRLQTSQLSRVIPAIADLKLTVSQMADAVPVERPFPVSLKLKNTCDR 321
Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
+ + ++++ ++ +M G + V TD N Q I+G+ V
Sbjct: 322 KMD-LRLLMTKS---KDGAMMWCGTSGKVCSNVGKL--TD---NSSIFLFFTQNISGLRV 372
Query: 425 FDKLEKITYD 434
DKL TY+
Sbjct: 373 IDKLSGRTYE 382
>gi|349732103|ref|NP_001085628.2| UPF0533 protein C5orf44 homolog [Xenopus laevis]
gi|190360172|sp|Q6GPR5.2|CE044_XENLA RecName: Full=UPF0533 protein C5orf44 homolog
Length = 414
Score = 191 bits (484), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 136/430 (31%), Positives = 217/430 (50%), Gaps = 46/430 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F L+ D +T K +++
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFS---------TLMKDDPSTVKGAEI 58
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ L +L LPQ FG I+LGETF SYIS++N S V+DV +KA++Q
Sbjct: 59 --------------LMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDVQVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASSAVVADLKPDSCIDDVIHHEVKEIGTHILVCAVSYTIQSGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ +
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVSE 217
Query: 248 LKADGPHSDYNAQSR---EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
L + D+ S + + P+ R YLY LK + ++G V+G
Sbjct: 218 LNTVITNGDWKGSSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIG 271
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
KL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 272 KLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSSE 331
Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
++ L +++ ++G ++ L P + T L+ + G+Q ++G+ +
Sbjct: 332 RT--MDLVLEMCNTNAIHWSGVSGRQLGKLHPSSSLHLTLTLLSSVQ---GLQSVSGLRL 386
Query: 425 FDKLEKITYD 434
D K TY+
Sbjct: 387 TDTFLKRTYE 396
>gi|383412259|gb|AFH29343.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
gi|384941114|gb|AFI34162.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
Length = 411
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 186/359 (51%), Gaps = 41/359 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMYNVTE 217
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 218 LNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER 329
>gi|261260081|sp|A8WX89.2|U533_CAEBR RecName: Full=UPF0533 protein CBG04321
Length = 401
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 138/450 (30%), Positives = 225/450 (50%), Gaps = 61/450 (13%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
+S++ LA RVMRL RP + P D F DP+ + L++ V
Sbjct: 5 ISNSSTQQLLALRVMRLARP--------KFAPLDGFS-----HDPVDPTGFGELLAGKV- 50
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
++++ SR HD + + L+ PQ F IYLGETF Y+++ N S V +V
Sbjct: 51 ----AEISKESR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNVVNV 99
Query: 121 VIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
+K E+QT QR++L +ES + G+ ++ H+VKE+G H L+C+ Y G
Sbjct: 100 SLKCELQTSTQRVVLPCSVQDVTIESTKCDGQ---VISHEVKEIGQHILICSVNYKTLSG 156
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
E Y +FFKF VS P+ V+TK + A Q++ +LEA IEN + SN+++++VE +P
Sbjct: 157 ENMYFRKFFKFPVSKPIDVKTKFYSAEDNAN--QDV-YLEAQIENTSNSNMFLERVELDP 213
Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
SQ++ T + H D + ++ KP I +L+ L SPV V
Sbjct: 214 SQHYKVTSIS----HEDEFPEVGKLLKP-------KDIRQFLFCL--------SPVDVNN 254
Query: 300 S------NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 353
+ +GKL ++WRT++GE GRLQT + ++ L+V + P+ V + KPF
Sbjct: 255 TLGYKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQKPFE 314
Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 413
+ +L N +++ ++ L Q + + + +G+ + L P DF LN+
Sbjct: 315 VACRLYNCSERALD-LQLRLEQPSNRQLVICSPSGVSLGQLPPSRY---VDFALNVFPVA 370
Query: 414 LGVQRITGITVFDKLEKITYDSLPDLEIFV 443
+G+Q I+GI + D K Y+ +IFV
Sbjct: 371 VGIQSISGIRITDTFTKRHYEHDDIAQIFV 400
>gi|440908494|gb|ELR58504.1| hypothetical protein M91_16814, partial [Bos grunniens mutus]
Length = 399
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 132/427 (30%), Positives = 209/427 (48%), Gaps = 54/427 (12%)
Query: 14 VMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
VMRL +P+L P+ + P DLF + + DDP
Sbjct: 3 VMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------------- 40
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT
Sbjct: 41 -------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTS 93
Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFK
Sbjct: 94 SQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFK 152
Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
F V PL V+TK + FLEA I+N T S ++M++V EPS ++ L
Sbjct: 153 FQVLKPLDVKTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVAELN 206
Query: 250 ADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
+ + SR +P YLY LK + ++G V+GKL
Sbjct: 207 SVNQAGECVTTFGSRAYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLD 259
Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 260 IVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER-- 317
Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ + D
Sbjct: 318 TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDT 374
Query: 428 LEKITYD 434
K TY+
Sbjct: 375 FLKRTYE 381
>gi|26351063|dbj|BAC39168.1| unnamed protein product [Mus musculus]
Length = 354
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/350 (33%), Positives = 189/350 (54%), Gaps = 16/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
+ + FLEA I+N T S ++M++V EPS ++ T L + + + SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVTQAGECISTFGSRGY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN +++ ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSER---MMDLVLEMCNTNSIHWC 289
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q ++G+ + D K TY+
Sbjct: 290 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 336
>gi|402871693|ref|XP_003899788.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 1 [Papio anubis]
gi|380816684|gb|AFE80216.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
gi|380816686|gb|AFE80217.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
gi|380816688|gb|AFE80218.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
gi|380816690|gb|AFE80219.1| hypothetical protein LOC80006 isoform 3 [Macaca mulatta]
Length = 412
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 136/429 (31%), Positives = 213/429 (49%), Gaps = 46/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEV------FLEAQIQNMTTSPMFMEKVSLEPSIMYNVTE 217
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 218 LNSVTQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 271 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 330
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
+ + +S + G+ L + S L L+++ G+Q I+G+ +
Sbjct: 331 TMDLVLEMCNTNS-----IHWCGISGRQLGKLHPSSSLSLALTLLSSVQGLQSISGLRLT 385
Query: 426 DKLEKITYD 434
D K TY+
Sbjct: 386 DTFLKRTYE 394
>gi|56789267|gb|AAH88172.1| Similar to RIKEN cDNA 2410002O22 gene [Rattus norvegicus]
Length = 359
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 188/353 (53%), Gaps = 15/353 (4%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPV 143
L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V
Sbjct: 2 LGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAV 60
Query: 144 ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVR 203
++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK
Sbjct: 61 AELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFY 120
Query: 204 VVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--S 261
+ + + FLEA I+N T S ++M++V EPS ++ T L + + + S
Sbjct: 121 NAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVNQAGECVSTFGS 180
Query: 262 REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQ 321
R +P YLY LK + ++G V+GKL I W+TNLGE GRLQ
Sbjct: 181 RGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQ 233
Query: 322 TQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEE 381
T Q+ ++ L++ +P V +++PF + K+TN + + ++ L ++
Sbjct: 234 TSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTTSI 291
Query: 382 KVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q ++G+ + D K TY+
Sbjct: 292 HWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 341
>gi|68270943|gb|AAY88966.1| hypothetical protein FLJ13611 [Homo sapiens]
Length = 355
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/350 (33%), Positives = 188/350 (53%), Gaps = 15/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+T+ +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKFFKFQVLKPLDVKTRFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
+ + FLEA I+N T S ++M++V EPS ++ T L + + + SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY K + + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCPKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN + + ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q I+G+ + D K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 337
>gi|25149716|ref|NP_741009.1| Protein C56C10.7, isoform a [Caenorhabditis elegans]
gi|75019616|sp|Q95QQ2.1|U533_CAEEL RecName: Full=UPF0533 protein C56C10.7
gi|351060501|emb|CCD68177.1| Protein C56C10.7, isoform a [Caenorhabditis elegans]
Length = 401
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 135/453 (29%), Positives = 221/453 (48%), Gaps = 63/453 (13%)
Query: 1 MSSTPGTHS----LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS 56
M+ P + S LA RVMRL RP + P D F DP+ + L++
Sbjct: 1 MADNPASSSSQQLLALRVMRLARP--------KFAPVDGFS-----HDPVDPTGFGELLA 47
Query: 57 SDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE 116
V S+++ SR + + L+ PQ F IYLGETF Y+++ N S
Sbjct: 48 GKV-----SEISKESR-------QDLPIGEYLIAPQMFENIYLGETFTFYVNVVNESEKT 95
Query: 117 VRDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
V V +K E+QT QR++L + +ES + G+ ++ H+VKE+G H L+C+ Y
Sbjct: 96 VSSVSLKCELQTSTQRVVLPCSVQDATIESSKCEGQ---VISHEVKEIGQHILICSVNYK 152
Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV 235
GE Y +FFKF VS P+ V+TK + A Q++ +LEA IEN + +N+++++V
Sbjct: 153 TSNGENMYFRKFFKFPVSKPIDVKTKFYSAEDNAN--QDV-YLEAQIENTSNANMFLEKV 209
Query: 236 EFEPSQNWSATMLKADGPHSDYNAQSREIFKPP-----VLIRSGGGIHNYLYQLKMLSHG 290
E +PSQ+++ T + H D ++ KP + + +HN L + S
Sbjct: 210 ELDPSQHYNVTSIA----HEDEFGDVGKLLKPKDIRQFLFCLTPADVHNTLGYKDLTS-- 263
Query: 291 SSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDK 350
+GKL ++WRT++GE GRLQT + ++ L+V + P+ V + K
Sbjct: 264 ------------IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQK 311
Query: 351 PFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLI 410
PF + +L N +++ ++ L Q + +G+ + L P + DF LN+
Sbjct: 312 PFEVSCRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ---HVDFSLNVF 367
Query: 411 ATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
+G+Q I+GI + D K Y+ +IFV
Sbjct: 368 PVTVGIQSISGIRITDTFTKRIYEHDDIAQIFV 400
>gi|410948701|ref|XP_003981069.1| PREDICTED: UPF0533 protein C5orf44 homolog isoform 2 [Felis catus]
Length = 355
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 116/350 (33%), Positives = 186/350 (53%), Gaps = 15/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
+ + FLEA I+N T S ++M++V EPS ++ L + + SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVAELNSVSQAGECVTTFGSRAY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN + + ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWC 290
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q ++G+ + D K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 337
>gi|145352717|ref|XP_001420684.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580919|gb|ABO98977.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 478
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 132/393 (33%), Positives = 200/393 (50%), Gaps = 52/393 (13%)
Query: 85 SGLLVLPQAFGAIYLGETFCSYISINN------SSTLEVRDVVIKAEIQTDKQRILLLDT 138
SG L LPQ+FGA+ LGE F S+++ N ++ R++ IK E+QT+ +R L D
Sbjct: 63 SGELTLPQSFGAVALGERFSSFVTFGNFSEPTSGASGTAREIGIKVELQTETRRTTLRDG 122
Query: 139 SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSV 198
+K+P+E++R G + D IV D+KELGAHTLVC+A Y D GERKY PQ+FKF V+NPLSV
Sbjct: 123 TKTPIETLRPGEKVDLIVTKDLKELGAHTLVCSATYYDAAGERKYSPQYFKFNVANPLSV 182
Query: 199 RTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE-----------PSQNWSATM 247
RTKVR G FLE CIEN T+ L +D F+ P +A
Sbjct: 183 RTKVRAAPRGR------AFLEVCIENTTRYALLLDSARFDTVDGILAKDMTPEFGGAAAT 236
Query: 248 LKA--DGPHSDYNA-QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
L D P + + R +++ L S G H YL+++ ++ S P+ Q LG
Sbjct: 237 LHGVDDSPDAGLPSLGKRAVYR---LDPSTGAAH-YLFEITR-ANASEEPLTPQ--TQLG 289
Query: 305 KLQITWRTNLGEPGRLQTQQI----LGTTITS---KEIELNVVEVP--------SVVGID 349
KL++ WR +G+PGRLQTQ I G+T S ++ +++ P S V +
Sbjct: 290 KLELRWRGAMGDPGRLQTQVITAGSAGSTAPSPVAAKMRQSIIVHPRPPDAEDVSTVYAE 349
Query: 350 KPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNL 409
PF+L+ + + + + D V I+G R + + + + + +
Sbjct: 350 TPFILRAAVEALAPIKADACVVRV----KDVVSGVYIDGPRAVRVGALSPGQTVNVDIPC 405
Query: 410 IATKLGVQRITGITVFDKLEKITYDSLPDLEIF 442
+A LGVQ + + D ++ + LE+F
Sbjct: 406 VALGLGVQTCPSLVLCDAVDDAARAAPAPLEVF 438
>gi|26379545|dbj|BAB29083.2| unnamed protein product [Mus musculus]
Length = 355
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 116/350 (33%), Positives = 188/350 (53%), Gaps = 15/350 (4%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
+ + FLEA I+N T S ++M++V EPS ++ T L + + + SR
Sbjct: 120 SDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVTQAGECISTFGSRGY 179
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + ++G V+GKL I W+TNLGE GR+QT Q
Sbjct: 180 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRVQTNQ 232
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN + + ++ L +++
Sbjct: 233 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSERM--MDLVLEMCNTNSIHWC 290
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q ++G+ + D K TY+
Sbjct: 291 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 337
>gi|7452545|pir||T15846 hypothetical protein C56C10.7 - Caenorhabditis elegans
Length = 398
Score = 187 bits (476), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 133/453 (29%), Positives = 218/453 (48%), Gaps = 66/453 (14%)
Query: 1 MSSTPGTHS----LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS 56
M+ P + S LA RVMRL RP + P D F DP+ + L++
Sbjct: 1 MADNPASSSSQQLLALRVMRLARP--------KFAPVDGFS-----HDPVDPTGFGELLA 47
Query: 57 SDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE 116
V S+++ SR + + L+ PQ F IYLGETF Y+++ N S
Sbjct: 48 GKV-----SEISKESR-------QDLPIGEYLIAPQMFENIYLGETFTFYVNVVNESEKT 95
Query: 117 VRDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
V V +K E+QT QR++L + +ES + G+ ++ H+VKE+G H L+C+ Y
Sbjct: 96 VSSVSLKCELQTSTQRVVLPCSVQDATIESSKCEGQ---VISHEVKEIGQHILICSVNYK 152
Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV 235
GE Y +FFKF VS P+ V+TK + + +LEA IEN + +N+++++V
Sbjct: 153 TSNGENMYFRKFFKFPVSKPIDVKTKFYSAE------NQDVYLEAQIENTSNANMFLEKV 206
Query: 236 EFEPSQNWSATMLKADGPHSDYNAQSREIFKPP-----VLIRSGGGIHNYLYQLKMLSHG 290
E +PSQ+++ T + H D ++ KP + + +HN L + S
Sbjct: 207 ELDPSQHYNVTSIA----HEDEFGDVGKLLKPKDIRQFLFCLTPADVHNTLGYKDLTS-- 260
Query: 291 SSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDK 350
+GKL ++WRT++GE GRLQT + ++ L+V + P+ V + K
Sbjct: 261 ------------IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQK 308
Query: 351 PFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLI 410
PF + +L N +++ ++ L Q + +G+ + L P + DF LN+
Sbjct: 309 PFEVSCRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ---HVDFSLNVF 364
Query: 411 ATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
+G+Q I+GI + D K Y+ +IFV
Sbjct: 365 PVTVGIQSISGIRITDTFTKRIYEHDDIAQIFV 397
>gi|432104588|gb|ELK31200.1| hypothetical protein MDA_GLEAN10025801 [Myotis davidii]
Length = 396
Score = 187 bits (475), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 131/426 (30%), Positives = 208/426 (48%), Gaps = 54/426 (12%)
Query: 15 MRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYR 70
MRL +P+L P+ + P DLF + + DDP
Sbjct: 1 MRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV---------------------- 37
Query: 71 SRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDK 130
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT
Sbjct: 38 ------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSS 91
Query: 131 QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKF 190
QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF
Sbjct: 92 QR-LNLSASNAAVSELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKF 150
Query: 191 IVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA 250
V PL V+TK + FLEA I+N T S ++M++V EPS ++ L +
Sbjct: 151 QVLKPLDVKTKFYNAETDE------VFLEAQIQNITTSPMFMEKVSLEPSIMYNVAELNS 204
Query: 251 DGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQI 308
+ SR +P YLY LK + ++G V+GKL I
Sbjct: 205 VNQAGECVTTFGSRTYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDI 257
Query: 309 TWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGP 368
W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 258 VWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVILEEPFHITCKITNCSSER--T 315
Query: 369 FEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKL 428
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ + D
Sbjct: 316 MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTF 372
Query: 429 EKITYD 434
K TY+
Sbjct: 373 LKRTYE 378
>gi|449682850|ref|XP_002166018.2| PREDICTED: UPF0533 protein C5orf44 homolog [Hydra magnipapillata]
Length = 409
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 134/424 (31%), Positives = 206/424 (48%), Gaps = 54/424 (12%)
Query: 8 HSLAFRVMRLCRPS----LHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H L +VMRL +PS LHV P DLF E + +D++ K
Sbjct: 10 HLLVLKVMRLTKPSIKSPLHVTAEEHDFPGDLFYNE---------------MMNDISALK 54
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
A+ + + +L LPQAFG+IYLGETF YISI N S +D+ +K
Sbjct: 55 G--------------AEEMAVGEILSLPQAFGSIYLGETFSCYISILNDSNQCCKDISVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
++QT QR L T+ P + + D ++ ++VKELG H L+C YS GE+ Y
Sbjct: 101 TDMQTATQRFQL--TAFKPKDMLSPDQSVDDVISYEVKELGTHILICAVTYSSQSGEKLY 158
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+ +F+KF V PL V+TK + ++ FLEA ++N T SN+ M+QV EPSQ +
Sbjct: 159 MRRFYKFQVLKPLEVKTKFYNGQ------NDLVFLEAQVQNITTSNMCMEQVTLEPSQFY 212
Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
L + + + P+ R YL++L + S ++ + +
Sbjct: 213 HVQSLNFLPKDNKLDGVYGCSYMNPMDTR------QYLFKL-LPKCDDSKEMRTKPPLSI 265
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT- 362
GKL I WRTN GE GRLQT Q+ T + ++++L ++E P VV ++K F +K +L N +
Sbjct: 266 GKLDIVWRTNFGETGRLQTSQLQRMTPSERDVKLVLIEAPDVVSLEKQFQIKCRLENSSP 325
Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
K + + N+S ++ G+ L P+ D L L+A + G I G+
Sbjct: 326 AKIEAKLFLTNPHNNS-----MLWCGISGKILGPLPQGSHLDITLLLLAIRPGFHSIGGV 380
Query: 423 TVFD 426
+ D
Sbjct: 381 RIQD 384
>gi|308502446|ref|XP_003113407.1| hypothetical protein CRE_26256 [Caenorhabditis remanei]
gi|308263366|gb|EFP07319.1| hypothetical protein CRE_26256 [Caenorhabditis remanei]
Length = 398
Score = 186 bits (471), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 132/450 (29%), Positives = 215/450 (47%), Gaps = 64/450 (14%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
+S++ LA RVMRL RP DP D P ++
Sbjct: 5 LSNSSTQQMLALRVMRLARPKFAPVGGFSHDPVD------------------PTGFGELL 46
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
K S+L+ SR + + + L+ PQ F IYLGETF Y+++ N S V +V
Sbjct: 47 AGKVSELSKESR-------NDLPIGDYLIAPQMFENIYLGETFTFYVNVVNESETSVVNV 99
Query: 121 VIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
+K E+QT QR++L + +ES + G+ ++ H+VKE+G H L+C+ Y G
Sbjct: 100 SLKCELQTSTQRVVLPCSVQDTTIESSKCDGQ---VISHEVKEIGQHILICSVNYKTLSG 156
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
E Y +FFKF VS P+ V+TK + + +LEA IEN + S++++++VE +P
Sbjct: 157 ENMYFRKFFKFPVSKPIDVKTKFYSAE------NQDVYLEAQIENTSNSSMFLERVELDP 210
Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
SQ++ T + H D + ++ KP I +L+ L SP+ V
Sbjct: 211 SQHYKVTSVS----HEDEFPEVGKLLKP-------KDIRQFLFCL--------SPIDVNN 251
Query: 300 S------NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 353
+ +GKL ++WRT++GE GRLQT + ++ L+V P+ V + KPF
Sbjct: 252 TLGYKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGFGDVRLSVENTPACVDVQKPFE 311
Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 413
+ +L N +++ ++ L Q + +G+ + L P + DF LN+
Sbjct: 312 VACRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQY---VDFTLNVFPVA 367
Query: 414 LGVQRITGITVFDKLEKITYDSLPDLEIFV 443
+G+Q I+GI + D K Y+ +IFV
Sbjct: 368 VGIQSISGIRITDTFTKRIYEHDDIAQIFV 397
>gi|26368656|dbj|BAB26869.2| unnamed protein product [Mus musculus]
Length = 349
Score = 185 bits (470), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 117/350 (33%), Positives = 186/350 (53%), Gaps = 21/350 (6%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V +
Sbjct: 1 MLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASNAAVAEL 59
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVK 206
+ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 60 KPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAE 119
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREI 264
FLEA I+N T S ++M++V EPS ++ T L + + + SR
Sbjct: 120 TDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVTQAGECISTFGSRGY 173
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+P YLY LK + ++G V+GKL I W+TNLGE GRLQT Q
Sbjct: 174 LQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQ 226
Query: 325 ILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVV 384
+ ++ L++ +P V +++PF + K+TN + + ++ L +++
Sbjct: 227 LQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSERM--MDLVLEMCNTNSIHWC 284
Query: 385 MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
I+G ++ L P + L L+++ G+Q ++G+ + D K TY+
Sbjct: 285 GISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 331
>gi|410929303|ref|XP_003978039.1| PREDICTED: UPF0533 protein C5orf44 homolog [Takifugu rubripes]
Length = 426
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 137/432 (31%), Positives = 214/432 (49%), Gaps = 38/432 (8%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL + LP I +
Sbjct: 10 HLLALKVMRLTKPTLFTNLPVTCEERDL-------PGVTVSECLPSYIGPAIN------- 55
Query: 68 TYRSRFL-LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
+RS L L A +G S P+ I+LGETF SYIS++N S+ V+D+++KA++
Sbjct: 56 -WRSITLPLAQLAAGMG-SSAPSDPRTVN-IFLGETFSSYISVHNDSSQVVKDILVKADL 112
Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
QT QR L L S S V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +
Sbjct: 113 QTSSQR-LNLSASNSAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTSQYGEKLYFRK 171
Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
FFKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 172 FFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVT 231
Query: 247 MLKA----DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
L D + S + P+ R YLY LK + ++G V
Sbjct: 232 ELNTITSRDTEECTFGKMS---YLQPMDTR------QYLYCLKPKPEYAEKAGVIKGVTV 282
Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
+GKL I W+TNLGE GRLQT Q+ +I L++ +P V +++PF + K+TN +
Sbjct: 283 IGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDIRLSLEMIPDTVNLEEPFDIICKITNCS 342
Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
++ ++ L ++ +G ++ L+P S L L ++ G+Q ++G+
Sbjct: 343 ERT---MDLVLEMCNTASTHWCGTSGRKLGKLSPA---ASLSLPLTLFSSVQGLQSVSGL 396
Query: 423 TVFDKLEKITYD 434
+ D K TY+
Sbjct: 397 RLKDTFLKRTYE 408
>gi|405970753|gb|EKC35629.1| UPF0533 protein C5orf44-like protein [Crassostrea gigas]
Length = 395
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 124/366 (33%), Positives = 182/366 (49%), Gaps = 17/366 (4%)
Query: 83 GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSP 142
GL LL LPQ FG I+LGETF SYIS++N ST + RD+ +K ++QT QR++L
Sbjct: 44 GLGDLLTLPQNFGNIFLGETFSSYISVHNDSTQQCRDITLKIDLQTTSQRLMLSGADVPA 103
Query: 143 VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKV 202
+ + D ++ H+VKELG H LVC Y+ E+ +FFKF V PL V+TK
Sbjct: 104 TDELGPDQSIDDVIHHEVKELGTHILVCAVSYTTNNYEKMAFRKFFKFQVLKPLDVKTKF 163
Query: 203 RVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSR 262
+ + +LEA I+N T +YMD V EPS + T L ++ +
Sbjct: 164 YNAE------SDEVYLEAQIQNITPGPIYMDHVSLEPSSQYLCTPL-----NNTEGKDQK 212
Query: 263 EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
E+ V + I YLY L ++G +GK+ I W+TNLGE GRLQT
Sbjct: 213 EMVFGKVNYLNPMDIRQYLYCLVPKPEVIKQNKVMKGVTDIGKIDIVWKTNLGERGRLQT 272
Query: 323 QQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEK 382
Q+ +I++ + E P V ++ F + ++TN ++ + L N
Sbjct: 273 SQLQRVAPGYGDIKVTLEETPDSVVLESSFNIICRITNCCERTMD-LTLTLQNNQPSGLL 331
Query: 383 VVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITY--DSLPDLE 440
I+G ++ LAP E D L LIAT G+Q I+G+ + D K TY D L +
Sbjct: 332 WTGISGRQLGKLAPKENL---DLRLTLIATIPGLQTISGLRITDNFLKRTYEHDELASVF 388
Query: 441 IFVDQD 446
I+ D +
Sbjct: 389 IYNDSN 394
>gi|158294379|ref|XP_315565.3| AGAP005561-PA [Anopheles gambiae str. PEST]
gi|157015536|gb|EAA11831.3| AGAP005561-PA [Anopheles gambiae str. PEST]
Length = 429
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 128/444 (28%), Positives = 215/444 (48%), Gaps = 45/444 (10%)
Query: 5 PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
P H LA +VMRL RP+L L +P D + + ++ SD T+
Sbjct: 4 PTEHLLALKVMRLTRPTLISPQILTAEPKD-----------VPQYSFQKILHSDATSVAG 52
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
+ +F+L LPQ+FG IYLGETF SY+ ++N V +V +KA
Sbjct: 53 CETITAGQFML--------------LPQSFGNIYLGETFSSYVCVHNCRAHPVTNVSVKA 98
Query: 125 EIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
++Q++ R+ L + K+ ++ D ++ H+VKE+G H LVC Y G
Sbjct: 99 DLQSNNSRVSLPIHADKTGPVTLNPEETLDDVIHHEVKEIGTHILVCEVSYMTPAGLETS 158
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + + +LEA I+N T + +++VE E S+ +
Sbjct: 159 FRKFFKFQVVKPLDVKTKFYNAET------DDVYLEAQIQNITVGPICLEKVELESSEQY 212
Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
+ L P + S+ + +P +LY ++ + + P ++ +N +
Sbjct: 213 TVVSLNT-LPSGESVFSSKTMLQP-------QNSCQFLYCIRPIPEIARDPSALKAANNI 264
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
GKL I WR+NLGE GRLQT Q+ + ++ LNV+E S V I + F + ++TN ++
Sbjct: 265 GKLDIVWRSNLGERGRLQTSQLQRCALEYSDLRLNVIEANSTVRIGEGFDFRCRVTNTSE 324
Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
+ ++ +S N + + G+ AL P+E +F L + +LG+ I+ +
Sbjct: 325 RS---MDLLMSLN-TKAKPGCGYTGVTEFALGPLEPGQMKEFPLTVCPVRLGLIVISALQ 380
Query: 424 VFDKLEKITYDSLPDLEIF-VDQD 446
+ D K Y+ L++F VD+D
Sbjct: 381 LTDVFTKRKYEFDNFLQVFVVDED 404
>gi|49115693|gb|AAH73045.1| MGC82662 protein [Xenopus laevis]
Length = 369
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 119/359 (33%), Positives = 190/359 (52%), Gaps = 21/359 (5%)
Query: 79 ADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDT 138
A+ + L +L LPQ FG I+LGETF SYIS++N S V+DV +KA++QT QR L L
Sbjct: 11 AEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDVQVKADLQTSSQR-LNLSA 69
Query: 139 SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSV 198
S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V
Sbjct: 70 SSAVVADLKPDSCIDDVIHHEVKEIGTHILVCAVSYTIQSGEKMYFRKFFKFQVLKPLDV 129
Query: 199 RTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYN 258
+TK + FLEA I+N T S ++M++V EPS ++ + L + D+
Sbjct: 130 KTKFYNAETDEV------FLEAQIQNITTSPMFMEKVSLEPSIMYNVSELNTVITNGDWK 183
Query: 259 AQSR---EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLG 315
S + + P+ R YLY LK + ++G V+GKL I W+TNLG
Sbjct: 184 GSSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKLDIVWKTNLG 237
Query: 316 EPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQ 375
E GRLQT Q+ ++ L++ +P V +++PF + K+TN + + ++ L
Sbjct: 238 ERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSSER--TMDLVLEM 295
Query: 376 NDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
+++ ++G ++ L P + T L+ + G+Q ++G+ + D K TY+
Sbjct: 296 CNTNAIHWSGVSGRQLGKLHPSSSLHLTLTLLSSVQ---GLQSVSGLRLTDTFLKRTYE 351
>gi|346470407|gb|AEO35048.1| hypothetical protein [Amblyomma maculatum]
Length = 416
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 130/434 (29%), Positives = 206/434 (47%), Gaps = 52/434 (11%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
LA +VMRL RPSL P+ D D I S + D+ +L
Sbjct: 8 LALKVMRLTRPSLFTTVPVVCDSRD-----------IPGSMWMQELKQDLGAPLGLEL-- 54
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
G+ L+LPQ+FG IYLGETF Y+S++N S VRDV ++AE+QTD
Sbjct: 55 ------------FGMGSFLMLPQSFGNIYLGETFSCYMSVHNDSQTTVRDVSVRAELQTD 102
Query: 130 KQRILLLDTSKSP--VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
Q++ L + P V + D ++ H+VK++ H LVCT YS G++ + +F
Sbjct: 103 SQKVFLTGRTDGPAVVAELAPNCSIDEVIHHEVKDINTHILVCTVNYSTQAGDKMHFRKF 162
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + +LEA ++N T + + +++V EPS +++
Sbjct: 163 FKFQVYKPLDVKTKFYNAE------SDEVYLEAQLQNITSTPICLEKVALEPSSHFNVCQ 216
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK-MLSHGSSSPVKVQ------GS 300
L G S+ +F V + YL+ L L S + VQ G
Sbjct: 217 LNTCG-------DSQSVFG-SVNFLNPHDTRQYLFSLSPRLPPSEPSSLAVQPDRRRSGI 268
Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
+GKL I WR+ +GE GRLQT Q+ ++I+L + PS V +++PF + +TN
Sbjct: 269 TSIGKLDIIWRSAMGERGRLQTSQLERIAPGYEDIKLTIESAPSTVNLEEPFEIACSVTN 328
Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
Q ++ L+ ++ ++ G +L +E + + L + + G+Q ++
Sbjct: 329 TC---QRVMDLVLALENAPSSG-LLWQGTSGQSLGKLEPQATVNLKLEAVPFRTGLQGVS 384
Query: 421 GITVFDKLEKITYD 434
GI + D K TYD
Sbjct: 385 GIKLSDTYLKQTYD 398
>gi|195473563|ref|XP_002089062.1| GE18914 [Drosophila yakuba]
gi|194175163|gb|EDW88774.1| GE18914 [Drosophila yakuba]
Length = 438
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 132/430 (30%), Positives = 217/430 (50%), Gaps = 52/430 (12%)
Query: 5 PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLF--IGEDIFDDPIAASNLPPLISSDVTT 61
P H +A +VMRL RP+L + P + +PTDL G D IA +
Sbjct: 6 PDAHLVALKVMRLMRPTLVGLGPVVTCEPTDLVQRFGNSQESDGIAGA------------ 53
Query: 62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
A+++ +L+LPQ+FG+IYLGETF SYI ++N++ V V
Sbjct: 54 ----------------CAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPHPVECVT 97
Query: 122 IKAEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
+KA++Q++ RI L + +KSPV + GG D ++ ++VKE+G H LVC YS G
Sbjct: 98 VKADLQSNTTRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAG 156
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
+ L +FFKF V PL V+TK ++ EI +LEA I+N T S +++VE +
Sbjct: 157 YAQSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDG 210
Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
S+++S T L P+ + + + +P +LY +K + + ++
Sbjct: 211 SEDYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQ 262
Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
N +GKL I WR+NLGE GRLQT Q+ K + L V++ + + I F ++T
Sbjct: 263 FNNVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVT 322
Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
N T + + L+ S + + G L P+++ S +F L++ +KLG+ +I
Sbjct: 323 N-TSEHTMKLNVRLAAKFSADSQYT---GCADFMLNPLQSGESAEFPLSVCPSKLGLVKI 378
Query: 420 TGITVFDKLE 429
+ + + + L+
Sbjct: 379 SPLVLTNTLQ 388
>gi|354491687|ref|XP_003507986.1| PREDICTED: UPF0533 protein C5orf44 homolog, partial [Cricetulus
griseus]
Length = 299
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 114/320 (35%), Positives = 167/320 (52%), Gaps = 35/320 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 224 LNSVTQAGECVSTFGSRGYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276
Query: 306 LQITWRTNLGEPGRLQTQQI 325
L I W+TNLGE GRLQT Q+
Sbjct: 277 LDIVWKTNLGERGRLQTSQL 296
>gi|34365494|emb|CAE46070.1| hypothetical protein [Homo sapiens]
gi|119571731|gb|EAW51346.1| hypothetical protein FLJ13611, isoform CRA_d [Homo sapiens]
Length = 309
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/320 (35%), Positives = 166/320 (51%), Gaps = 41/320 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDE------VFLEAQIQNMTTSPMFMEKVSLEPSIMYNVTE 217
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + + + SR +P YLY LK + + ++G V+GK
Sbjct: 218 LNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVTVIGK 270
Query: 306 LQITWRTNLGEPGRLQTQQI 325
L I W+TNLGE GRLQT Q+
Sbjct: 271 LDIVWKTNLGERGRLQTSQL 290
>gi|348551658|ref|XP_003461647.1| PREDICTED: UPF0533 protein C5orf44 homolog [Cavia porcellus]
Length = 479
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 129/425 (30%), Positives = 211/425 (49%), Gaps = 42/425 (9%)
Query: 14 VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRF 73
VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 75 VMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------------ 111
Query: 74 LLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRI 133
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR+
Sbjct: 112 --VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQRL 169
Query: 134 LLLDTSKSPVESIRAGGRYDFIVEHDVK-ELGAHT-LVCTALYSDGEGERKYLPQFFKFI 191
L ++ + E +F V E+ ++ LVC Y+ GE+ Y +FFKF
Sbjct: 170 NLSASNAAVAELKPDSVMSNFCYLQTVCLEICSYIGLVCAVSYTTQGGEKMYFRKFFKFQ 229
Query: 192 VSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD 251
V PL V+TK + + + FLEA I+N T S ++M++V EPS +S T L +
Sbjct: 230 VLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYSVTELNSV 289
Query: 252 GPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQIT 309
+ + SR +P YLY LK + ++G V+GKL I
Sbjct: 290 SQAGERVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIV 342
Query: 310 WRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPF 369
W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 343 WKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSERT---M 399
Query: 370 EIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLE 429
++ L D+ ++G ++ L P + G L L+++ G+Q ++G+ + D
Sbjct: 400 DLVLEMCDTSSVHWCGVSGRQLGKLLPSASLG---LALTLLSSVQGLQSVSGLRLTDTFL 456
Query: 430 KITYD 434
K TY+
Sbjct: 457 KRTYE 461
>gi|330801295|ref|XP_003288664.1| hypothetical protein DICPUDRAFT_34383 [Dictyostelium purpureum]
gi|325081286|gb|EGC34807.1| hypothetical protein DICPUDRAFT_34383 [Dictyostelium purpureum]
Length = 509
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 127/448 (28%), Positives = 216/448 (48%), Gaps = 40/448 (8%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
M H L +VMRL +P++ P+ + DL +S +
Sbjct: 1 MEKEKENHLLNLKVMRLSKPNIPTINPILCEKDDLAYESMGLGSNSGSSGNNSGSGTSSP 60
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQ---AFGAIYLGETFCSYISINNSSTLEV 117
++ S + + + + G+ GL + P G IYLGE FC YIS+NN S +V
Sbjct: 61 SSPGSAAVEQQLINVSSNTGTNGIEGLGLTPMLQLQSGVIYLGEVFCCYISLNNHSPYQV 120
Query: 118 RDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG 177
DV +K E+QT QRI LLD+ K+PV S G DF+V+ +VKE G + LVC YS
Sbjct: 121 TDVYLKVELQTTSQRICLLDSEKNPVPSFSPGFSSDFVVQREVKESGINILVCAVNYSSP 180
Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
EGE+K ++FKF V NPL ++T++ + I FLEAC+EN T+ +L+++ + F
Sbjct: 181 EGEQKKFRKYFKFQVMNPLVLKTRIH-------NLPNIIFLEACLENATQGSLFIESIVF 233
Query: 238 EPSQNWSATMLKADG--------------------PHSDYNAQSREI-FKPPVLIRSGGG 276
+P ++ + + + D N+ +I ++ G
Sbjct: 234 DPIDLFTCKDISFEKNLIENNNSDIDNSNSNNVDNSNIDNNSLLSKIKISNDIVFLKQGS 293
Query: 277 IHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIE 336
YL+Q+ ++ + + S LG+L ITWR+ GE G+L+T I + +++IE
Sbjct: 294 SRQYLFQIIPKDPNNN---ETKTSATLGRLDITWRSYFGEIGKLKTAGI-QRKLGNEDIE 349
Query: 337 LNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAP 396
+ +P ++ ++KPF + KL N++++ P + L +N D K+ + + P
Sbjct: 350 AVLSNIPQLIKLEKPFNITAKLINKSNRTLYP-QFVLIRNKMDGIKI----NSHLPKIEP 404
Query: 397 VEAFGSTDFHLNLIATKLGVQRITGITV 424
+ ++ + K G+Q+ITG+ +
Sbjct: 405 ISPNSQVSINVEMFPLKPGMQQITGLAI 432
>gi|225709234|gb|ACO10463.1| UPF0533 protein [Caligus rogercresseyi]
Length = 425
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 129/445 (28%), Positives = 211/445 (47%), Gaps = 49/445 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLF---IGEDIFDDPIAASNLPPLISSDVTTNKS 64
H L+ +VMRL RP + + D D+ + E+ DP + ++P
Sbjct: 14 HPLSLKVMRLSRPRFSSKVMITDDSDDILSRTLMEEHLKDPSSCRDVP------------ 61
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
L LL+LPQ+FG IYLGETF YIS++N ST + +K
Sbjct: 62 ----------------EAALGRLLILPQSFGMIYLGETFSCYISLHNDSTDPCFSISMKC 105
Query: 125 EIQTDKQRILLLDTSKSP--VESIRAGGRYDFIVEHDVKELGAHTLVCTALY-SDGEGER 181
++QT RI L +K P + + G D ++ H+VK+LG H LVC Y S E+
Sbjct: 106 DLQTMVHRITLYPQNKEPPLQDQLLPGDSIDRVLNHEVKDLGTHILVCEVFYTSPKTQEK 165
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
+ FKF V PL V+T + F+EA I+N T LY+++V FEPS
Sbjct: 166 SSFRKLFKFEVKKPLDVKTNFH------NSDENEVFVEATIQNATTGCLYLEKVAFEPST 219
Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
+++ T L + ++ N+ +F P +++ YL+ L + ++
Sbjct: 220 HFNVTSLNSIVGLNEDNS----VFGPVNCLQTNDS-RQYLFCLSPKPNFKLDQKLLRSVI 274
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
+GK+ + WRTNLGE GR++T Q+L T +I+ + PSVV + + F + K+ N
Sbjct: 275 AIGKIDVIWRTNLGERGRIKTSQLLRTPPVLNDIQFLIESCPSVVMLHQVFNISAKIFNN 334
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ + + +N S +M +G L ++ G +F L+++ G+Q I+G
Sbjct: 335 SERTLELEALCVDKNKSR----LMWSGSTAQKLGLLQPDGCLEFTLSVVPLDTGLQVISG 390
Query: 422 ITVFDKLEKITYDSLPDLEIFVDQD 446
I + D L K Y+ ++FV D
Sbjct: 391 IRILDNLLKRAYEFDDSNQVFVTSD 415
>gi|195339717|ref|XP_002036463.1| GM18092 [Drosophila sechellia]
gi|194130343|gb|EDW52386.1| GM18092 [Drosophila sechellia]
Length = 438
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 135/440 (30%), Positives = 225/440 (51%), Gaps = 50/440 (11%)
Query: 5 PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
P +H +A +VMRL RP+L + P + +PTDL + ++
Sbjct: 6 PDSHLVALKVMRLMRPTLVGLGPVVTCEPTDLV--------------------QRFSNSQ 45
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
SD + A+++ +L+LPQ+FG+IYLGETF SYI ++N++ V V +K
Sbjct: 46 ESD------GMSGACAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVTVK 99
Query: 124 AEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
A++Q++ RI L + SKSPV + GG D ++ ++VKE+G H LVC YS G
Sbjct: 100 ADLQSNTSRINLSMHENSKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAGYA 158
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
+ L +FFKF V PL V+TK ++ EI +LEA I+N T S +++VE + S+
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDGSE 212
Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
++S T L P+ + + + +P +LY +K + + ++ N
Sbjct: 213 DYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQFN 264
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
+GKL I WR+NLGE GRLQT Q+ K + L V++ + + I F +LTN
Sbjct: 265 NVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRLTN- 323
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
T + + L+ S + + G L +++ S +F L++ +KLG+ +IT
Sbjct: 324 TSEHPMKLNVRLAAKFSPDSQYT---GCADFMLNLLQSGESAEFPLSVCPSKLGLVKITP 380
Query: 422 ITVFDKL--EKITYDSLPDL 439
+ + + L E+ T +++ D+
Sbjct: 381 LVLTNTLQNEQFTIENVVDV 400
>gi|341892426|gb|EGT48361.1| hypothetical protein CAEBREN_24983, partial [Caenorhabditis
brenneri]
Length = 374
Score = 175 bits (443), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 199/387 (51%), Gaps = 34/387 (8%)
Query: 58 DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
++ K S+L+ +R HD + + L+ PQ F IYLGETF Y+++ N S V
Sbjct: 20 EILAGKVSELSKETR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNV 72
Query: 118 RDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
+V +K E+QT QR+ L + +E+ + G+ ++ H+VKE+G H L+C+ Y
Sbjct: 73 VNVCLKCELQTSTQRVALPCSVQDTIIEASKCDGQ---VISHEVKEIGQHILICSVNYKT 129
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
GE Y +FFKF VS P+ V+TK + + +LEA IEN + +N+++++VE
Sbjct: 130 LSGENMYFRKFFKFPVSKPIDVKTKFYSAE------NQDVYLEAQIENTSSANMFLERVE 183
Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
+PSQ++ T + H D + ++ KP I +L+ L + ++ K
Sbjct: 184 LDPSQHYKVTSIS----HQDEFPEIGKLLKP-------RDIRQFLFCLSPMDANNTLGYK 232
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
S +GKL ++WRT++GE GRLQT + ++ L+V + P+ V + KPF +
Sbjct: 233 DLTS--IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVEVQKPFEILC 290
Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
+L N +++ ++ L Q + +G+ + L P + DF LN+ +G+
Sbjct: 291 RLYNCSERALD-LQLRLEQPTNRNLVFCTPSGVSLGQLPPSQY---VDFVLNVFPVAVGI 346
Query: 417 QRITGITVFDKLEKITYDSLPDLEIFV 443
Q I+GI + D K Y+ +IFV
Sbjct: 347 QSISGIRITDTFTKRVYEHDDIAQIFV 373
>gi|194859696|ref|XP_001969431.1| GG10100 [Drosophila erecta]
gi|190661298|gb|EDV58490.1| GG10100 [Drosophila erecta]
Length = 438
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 132/442 (29%), Positives = 223/442 (50%), Gaps = 54/442 (12%)
Query: 5 PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLF--IGEDIFDDPIAASNLPPLISSDVTT 61
P H +A +VMRL RP+L + P + +PTDL G D IA +
Sbjct: 6 PDAHLVALKVMRLMRPTLVGLGPVVTCEPTDLVQRFGNSQASDGIAGA------------ 53
Query: 62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
A+++ +L+LPQ+FG+IYLGETF SYI ++N++ V V
Sbjct: 54 ----------------CAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVT 97
Query: 122 IKAEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
+KA++Q++ RI L + +KSPV + GG D ++ ++VKE+G H LVC YS G
Sbjct: 98 VKADLQSNTTRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTSAG 156
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
+ L +FFKF V PL V+TK ++ EI +LEA I+N T S +++VE +
Sbjct: 157 YAQSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDG 210
Query: 240 SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG 299
S+++S T L P+ + + + +P +LY +K + + ++
Sbjct: 211 SEDYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQ 262
Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
N +GKL I WR+NLGE GRLQT Q+ K + L V++ + + I F ++T
Sbjct: 263 FNNVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVT 322
Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
N ++ +++ +D + G L +++ S +F L++ +KLG+ +I
Sbjct: 323 NTSEHPMKLNVRLVAKFSADSQ----YTGCADFMLNLLQSGESAEFPLSVCPSKLGLVKI 378
Query: 420 TGITVFDKL--EKITYDSLPDL 439
+ + + + L E+ T +++ D+
Sbjct: 379 SPLVLTNTLQNEQFTIENVVDV 400
>gi|194761714|ref|XP_001963073.1| GF15760 [Drosophila ananassae]
gi|190616770|gb|EDV32294.1| GF15760 [Drosophila ananassae]
Length = 438
Score = 174 bits (441), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 131/440 (29%), Positives = 225/440 (51%), Gaps = 50/440 (11%)
Query: 5 PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
P H +A +VMRL RP+L + P + +PTDL +
Sbjct: 6 PDAHLVALKVMRLMRPTLVGLGPMVTCEPTDLV--------------------------Q 39
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ T S + A+++ +L+LPQ+FG+IYLGETF SYI ++N++T V V +K
Sbjct: 40 RFNYTQESDGITGAGAETLAAGQVLLLPQSFGSIYLGETFSSYICVHNTTTHPVECVTVK 99
Query: 124 AEIQTDKQRI--LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
A++Q++ RI L + KSPV + GG D ++ ++VKE+G H LVC Y+ G
Sbjct: 100 ADLQSNTSRINLSLHEHVKSPV-VLAPGGTIDDVIRYEVKEIGTHILVCEVNYTTPAGFA 158
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
+ L +FFKF V PL V+TK ++ EI +LEA I+N T S +++VE + S+
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDSSE 212
Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
++S T L P+ + + + +P +LY +K + + ++ N
Sbjct: 213 DYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKADIAKDIDTLRQFN 264
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
+GKL I WR+NLGE GRLQT Q+ K + L V++ + + I F K ++TN
Sbjct: 265 NVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVMDAKNTIKIGTVFTFKCRVTNT 324
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+++ +S+ D + +G L +++ S +F L++ +KLG+ +++
Sbjct: 325 SEQPMKLNVRMVSKFSPDSQ----YSGCADFMLDLLKSGESAEFPLSVCPSKLGLIKVSP 380
Query: 422 ITVFDKL--EKITYDSLPDL 439
+ + + L E+ T +++ D+
Sbjct: 381 LILTNTLQNEQFTIENVVDV 400
>gi|341880489|gb|EGT36424.1| hypothetical protein CAEBREN_15251 [Caenorhabditis brenneri]
Length = 380
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 199/387 (51%), Gaps = 34/387 (8%)
Query: 58 DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
++ K S+L+ +R HD + + L+ PQ F IYLGETF Y+++ N S V
Sbjct: 26 EILAGKVSELSKETR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNV 78
Query: 118 RDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
+V +K E+QT QR+ L + +E+ + G+ ++ H+VKE+G H L+C+ Y
Sbjct: 79 VNVCLKCELQTSTQRVALPCSVQDTIIEASKCDGQ---VISHEVKEIGQHILICSVNYKT 135
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
GE Y +FFKF VS P+ V+TK + + +LEA IEN + +N+++++VE
Sbjct: 136 LSGENMYFRKFFKFPVSKPIDVKTKFYSAE------NQDVYLEAQIENTSSANMFLERVE 189
Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
+PSQ++ T + H D + ++ KP I +L+ L + ++ K
Sbjct: 190 LDPSQHYKVTSIS----HQDEFPEIGKLLKP-------RDIRQFLFCLSPMDANNTLGYK 238
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
S +GKL ++WRT++GE GRLQT + ++ L+V + P+ V + KPF +
Sbjct: 239 DLTS--IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVEVQKPFEILC 296
Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
+L N +++ ++ L Q + +G+ + L P + DF LN+ +G+
Sbjct: 297 RLYNCSERALD-LQLRLEQPTNRHLVFCSPSGVSLGQLPPSQY---VDFVLNVFPVAVGI 352
Query: 417 QRITGITVFDKLEKITYDSLPDLEIFV 443
Q I+GI + D K Y+ +IFV
Sbjct: 353 QSISGIRITDTFTKRVYEHDDIAQIFV 379
>gi|427789685|gb|JAA60294.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 416
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 129/434 (29%), Positives = 201/434 (46%), Gaps = 52/434 (11%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
LA +VMRL RPSL P+ D D I S + D+ +L
Sbjct: 8 LALKVMRLTRPSLFTTLPVVCDSRD-----------IPGSMWMQELKQDLGAPLGLEL-- 54
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
G L+LPQ+FG IYLGETF Y+S++N S VRDV ++AE+QTD
Sbjct: 55 ------------FGAGSFLMLPQSFGNIYLGETFSCYMSVHNDSQTTVRDVSVRAELQTD 102
Query: 130 KQRILLLDTSKS--PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
Q++LL + V + D ++ H+VK++ H LVCT Y+ GE+ + +F
Sbjct: 103 SQKVLLAGRADGAVAVAELAPNSSIDEVIHHEVKDINTHILVCTVNYTTQAGEKLHFRKF 162
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + +LEA ++N T S + +++V EPS ++
Sbjct: 163 FKFQVYKPLDVKTKFYNAE------SDEVYLEAQLQNITSSPICLEKVALEPSPYFNVCQ 216
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV-------QGS 300
L G S+ +F PV + YL+ L S + V G
Sbjct: 217 LNTCG-------DSQSVFG-PVNFLNPHDTRQYLFSLSPRVPSSETGETVAQPEKRRSGV 268
Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
+GKL I WR+ +GE GRLQT Q+ ++I+L + PS V +++PF + + N
Sbjct: 269 TSIGKLDIVWRSAMGERGRLQTSQLERIAPGYEDIKLTIESAPSTVNLEEPFEIACSVMN 328
Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRIT 420
+ ++ L+ + ++ G+ +L +E + L + + G+Q I+
Sbjct: 329 TCHRT---MDLVLALENLPSSG-LLWQGMSGQSLGKLEPQATVRITLEAVPFRTGLQSIS 384
Query: 421 GITVFDKLEKITYD 434
GI + D K TYD
Sbjct: 385 GIKLSDTYLKQTYD 398
>gi|28574117|ref|NP_609365.3| CG4953 [Drosophila melanogaster]
gi|74866482|sp|Q95TN1.1|U533_DROME RecName: Full=UPF0533 protein CG4953
gi|16198171|gb|AAL13894.1| LD37668p [Drosophila melanogaster]
gi|28380339|gb|AAF52893.3| CG4953 [Drosophila melanogaster]
gi|220946234|gb|ACL85660.1| CG4953-PA [synthetic construct]
gi|220955926|gb|ACL90506.1| CG4953-PA [synthetic construct]
Length = 438
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 133/440 (30%), Positives = 225/440 (51%), Gaps = 50/440 (11%)
Query: 5 PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
P H +A +VMRL RP+L + P + +PTDL ++++
Sbjct: 6 PDAHLVALKVMRLMRPTLVGLGPVVTCEPTDLV--------------------QRFSSSQ 45
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
SD + A+++ +L+LPQ+FG+IYLGETF SYI ++N++ V V +K
Sbjct: 46 ESD------GMSGACAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVTVK 99
Query: 124 AEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
A++Q++ RI L + +KSPV + GG D ++ ++VKE+G H LVC YS G
Sbjct: 100 ADLQSNTSRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAGYA 158
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
+ L +FFKF V PL V+TK ++ EI +LEA I+N T S +++VE + S+
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDGSE 212
Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
++S T L P+ + + + +P +LY +K + + ++ N
Sbjct: 213 DYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQFN 264
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
+GKL I WR+NLGE GRLQT Q+ K + L V++ + + I F ++TN
Sbjct: 265 NVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVTN- 323
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
T + + L+ S + + G L +++ S +F L++ +KLG+ +IT
Sbjct: 324 TSEHPMKLNVRLAAKFSPDSQYT---GCADFMLNLLQSGESAEFPLSVCPSKLGLVKITP 380
Query: 422 ITVFDKL--EKITYDSLPDL 439
+ + + L E+ T +++ D+
Sbjct: 381 LVLTNTLQNEQFTIENVVDV 400
>gi|268638273|ref|XP_646894.2| DUF974 family protein [Dictyostelium discoideum AX4]
gi|187608844|sp|Q55EX6.2|U533_DICDI RecName: Full=UPF0533 protein
gi|256013093|gb|EAL73120.2| DUF974 family protein [Dictyostelium discoideum AX4]
Length = 511
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 133/457 (29%), Positives = 227/457 (49%), Gaps = 65/457 (14%)
Query: 7 THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
H L +VMRL +P++ P+ + DL + I +++L V ++ S+D
Sbjct: 4 NHLLNLKVMRLSKPNIPTINPILCEKQDL--PYETMSTSIDSTSLS---MGSVNSSGSND 58
Query: 67 LTYRSRFLLHDSADSIGLSGLLV---LPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ L+ ++ + I + GL V L G IYLGE FC YIS+NN S +VR+V +K
Sbjct: 59 ----NNQLIGNNGNPINMEGLGVTSMLQLQSGVIYLGEMFCCYISLNNHSPYQVRNVFLK 114
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
E+QT RI LLD+ + V + G DF+V+ +VKE G + LVC Y+ EGE+K
Sbjct: 115 VELQTTSSRIPLLDSEQQSVPTFNPGFSSDFVVQREVKESGVNILVCAVNYTTPEGEQKK 174
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
++FKF V NPL ++T++ + + FLEAC+EN T+ +L+++ + FEP +++
Sbjct: 175 FRKYFKFQVLNPLVLKTRIH-------NLPNVVFLEACLENATQGSLFIESILFEPIEHF 227
Query: 244 SATMLKADGP-------------HSDYNAQSREIFKPPVLIRSGGGIHN---YLYQLKM- 286
++ + + + N + FK + G I N L +K+
Sbjct: 228 NSKDISFENSLDDNNNLDNNNNNLENDNNLNNLEFK----LNEKGLIENTDELLENIKLT 283
Query: 287 -------LSHGSS-------SP-----VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILG 327
L G S +P V+ + S LG+L ITWR+ GE GRL+T I
Sbjct: 284 TSDNIVFLKQGCSRQYLFQITPKDIENVESKNSLPLGRLDITWRSYFGEIGRLKTAAI-Q 342
Query: 328 TTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMIN 387
+ ++IE +++ +P + ++KPF + KL+N++++ P + L +N D K+
Sbjct: 343 RKLNQEDIECSLINIPDKIKLEKPFSVIAKLSNKSNRILYP-QFMLVRNKMDGIKI---- 397
Query: 388 GLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
+ L P++ + + K G+Q+I G+ +
Sbjct: 398 NSHLPKLDPIQPNSIIQVEIEMFPLKPGMQQIIGLAI 434
>gi|281341772|gb|EFB17356.1| hypothetical protein PANDA_007966 [Ailuropoda melanoleuca]
Length = 339
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 110/340 (32%), Positives = 178/340 (52%), Gaps = 22/340 (6%)
Query: 97 IYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIV 156
I+LGETF SYIS++N S V+D+++KA++QT QR L L S + V ++ D ++
Sbjct: 2 IFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR-LNLSASSAAVAELKPDCCIDDVI 60
Query: 157 EHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEIT 216
H+VKE+G H LVC Y+ GE+ Y +FFKF V PL V+TK +
Sbjct: 61 HHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVLKPLDVKTKFYNAETDEV------ 114
Query: 217 FLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREIFKPPVLIRSG 274
FLEA I+N T S ++M++V EPS ++ L + + SR +P
Sbjct: 115 FLEAQIQNITTSPMFMEKVSLEPSIMYNVAELNSVSQAGECVTTFGSRAYLQPM------ 168
Query: 275 GGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE 334
YLY LK + ++G V+GKL I W+TNLGE GRLQT Q+ +
Sbjct: 169 -DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGD 227
Query: 335 IELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMAL 394
+ L++ +P V +++PF + K+TN +++ ++ L +++ I+G ++ L
Sbjct: 228 VRLSLEAIPDTVNLEEPFHITCKITNCSER---TMDLVLEMCNTNSIHWCGISGRQLGKL 284
Query: 395 APVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
P + L L+++ G+Q ++G+ + D K TY+
Sbjct: 285 HPSSSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 321
>gi|157104758|ref|XP_001648554.1| hypothetical protein AaeL_AAEL004198 [Aedes aegypti]
gi|157104963|ref|XP_001648651.1| hypothetical protein AaeL_AAEL000579 [Aedes aegypti]
gi|108880202|gb|EAT44427.1| AAEL004198-PA [Aedes aegypti]
gi|108884143|gb|EAT48368.1| AAEL000579-PA [Aedes aegypti]
Length = 424
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 134/452 (29%), Positives = 212/452 (46%), Gaps = 61/452 (13%)
Query: 5 PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
P H LA +VMRL RP+ LISS + T ++
Sbjct: 4 PSEHLLALKVMRLTRPT--------------------------------LISSQIITAEA 31
Query: 65 SDLTYRS-RFLLHDSA------DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
DL + +L SA +++ + LPQ+FG IYLGETF SY+ ++N V
Sbjct: 32 KDLPQNTFAGILKSSATTVQDCETLAAGQFMQLPQSFGNIYLGETFSSYVCVHNCRAHPV 91
Query: 118 RDVVIKAEIQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
+V +KA++Q++ RI L + K + D ++ H+VKE+G H LVC Y
Sbjct: 92 GNVSVKADLQSNNTRINLPIHVDKQGPVVLHPDETLDDVIHHEVKEIGTHILVCEVSYMT 151
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
G +FFKF V PL V+TK + +LEA I+N T + +++VE
Sbjct: 152 PAGLESSFRKFFKFQVVKPLDVKTKFYNAETDE------VYLEAQIQNITVGPICLEKVE 205
Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
E S+ ++ L + P + R + +P +LY +K L + P+
Sbjct: 206 LESSEQYTVVSLN-NLPSGESVFSQRTMLQP-------MNSCQFLYCIKPLPAILNDPMA 257
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
++ +N +GKL I WR+NLGE GRLQT Q+ + I ++ L V+E S V I + F K
Sbjct: 258 LKAANNIGKLDIVWRSNLGERGRLQTSQLQRSPIEYGDLRLTVIEANSTVKIGEGFDFKC 317
Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKV-VMINGLRIMALAPVEAFGSTDFHLNLIATKLG 415
++TN +++ L N + KV G ++L P+E +F L + +LG
Sbjct: 318 RVTNTSERSMD-----LLMNLNTNAKVGCGYTGQTEISLGPLEPGKYKEFSLTVCPVRLG 372
Query: 416 VQRITGITVFDKLEKITYDSLPDLEIF-VDQD 446
+ IT + + D K Y+ +++F VD+D
Sbjct: 373 LITITNLQLTDVFMKRKYEFDDFVQVFVVDED 404
>gi|241702186|ref|XP_002413194.1| conserved hypothetical protein [Ixodes scapularis]
gi|215507008|gb|EEC16502.1| conserved hypothetical protein [Ixodes scapularis]
Length = 417
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 135/441 (30%), Positives = 206/441 (46%), Gaps = 50/441 (11%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
LA +VMRL RPSL P+ D D I S + D+ +L
Sbjct: 11 LALKVMRLTRPSLFSTLPVVCDSRD-----------IPGSMWLQDLKQDLGAPLGLEL-- 57
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
G L+LPQ+FG IYLGETF Y+S++N S VRDV +KAE+QTD
Sbjct: 58 ------------FGTGSFLMLPQSFGNIYLGETFSCYMSVHNDSEHTVRDVSVKAELQTD 105
Query: 130 KQRILLLDTSK-SPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
Q++ L S+ + V + D ++ H+VK++ H LVCT YS GE+ + +FF
Sbjct: 106 SQKVFLTGKSEGTAVPELPPKSSIDEVIHHEVKDINTHILVCTVNYSSHTGEKLHFRKFF 165
Query: 189 KFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
KF V PL V+TK + + +LEA ++N T S + +++V EPSQ+++ L
Sbjct: 166 KFQVYKPLDVKTKFYNAE------SDEVYLEAQLQNITSSPISLEKVALEPSQHFNVCQL 219
Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK------MLSHGSSSPVKVQGSNV 302
+ A + IF V + YL+ L ++ +S G
Sbjct: 220 NS-------CADGQSIFG-QVNFLNPHDTRQYLFSLSPRVADAAVAPAASDKRSRSGITS 271
Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
+GKL I WR+ +GE GRLQT Q+ ++I L V PS V +++PF + +TN
Sbjct: 272 IGKLDIVWRSVMGERGRLQTSQLERIAPGYEDIRLTVDSAPSSVNLEEPFEITCLVTNTC 331
Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
Q ++ L ++S ++ G +L +E S L + + G+Q ++GI
Sbjct: 332 ---QRTMDLVLMLDNSATSG-LLWQGTSGQSLGKLEPQTSLRIKLEAVPFRTGLQGVSGI 387
Query: 423 TVFDKLEKITYDSLPDLEIFV 443
+ D K YD +FV
Sbjct: 388 KLNDTFLKQVYDYDDITSVFV 408
>gi|291234053|ref|XP_002736964.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 409
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 116/357 (32%), Positives = 177/357 (49%), Gaps = 50/357 (14%)
Query: 8 HSLAFRVMRLCRPSLHVEPPL----RVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +PS P+ R P +LF+ + +D+++NK
Sbjct: 9 HLLALKVMRLTKPSFMTTIPVLSEDRDLPGNLFLQA---------------LQTDLSSNK 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
++ + LL LPQ FG I+LGETF YIS++N S+ V D+++K
Sbjct: 54 G--------------IENFAMGELLTLPQNFGNIFLGETFSCYISVHNDSSQSVSDILVK 99
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
++QT QR+ L + SP ++ D ++ H+VKELG H LVC YS GE+ Y
Sbjct: 100 TDLQTSSQRLTLSGGNVSPSPNLSPENCIDEVIHHEVKELGTHILVCAVSYSISSGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
+FFKF V PL V+TK + +LEA I+N T S + M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESNE------VYLEAQIQNITNSPMVMERVTLEPSILY 213
Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
++ L +S + ++ E + + YLY L S + +G +
Sbjct: 214 NSQEL-----NSILSKENSETTFGNLSYLNAMDTRQYLYCLTPKSSDN------KGVTNI 262
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
GKL I W+T+LGE GRLQT Q+ +I L + ++P V ++KPF + K+ N
Sbjct: 263 GKLDIVWKTHLGEKGRLQTSQLQRMAPGYGDIRLTIEQIPDGVQLEKPFTVICKVIN 319
>gi|195578101|ref|XP_002078904.1| GD23672 [Drosophila simulans]
gi|194190913|gb|EDX04489.1| GD23672 [Drosophila simulans]
Length = 417
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 128/428 (29%), Positives = 218/428 (50%), Gaps = 48/428 (11%)
Query: 5 PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
P +H +A +VMRL RP+L + P + +PTDL + ++
Sbjct: 6 PDSHLVALKVMRLMRPTLVGLGPVVTCEPTDLV--------------------QRFSNSQ 45
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
SD + A+++ +L+LPQ+FG+IYLGETF SYI ++N++ V V +K
Sbjct: 46 ESD------GMSGACAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVTVK 99
Query: 124 AEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
A++Q++ RI L + +KSPV + GG D ++ ++VKE+G H LVC YS G
Sbjct: 100 ADLQSNTSRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAGYA 158
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
+ L +FFKF V PL V+TK ++ EI +LEA I+N T S +++VE + S+
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEI-----DEI-YLEAQIQNVTTSPFCLEKVELDGSE 212
Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
++S T L P+ + + + +P +LY +K + + ++ N
Sbjct: 213 DYSVTPLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQFN 264
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
+GKL I WR+NLGE GRLQT Q+ K + L V++ + + I F ++TN
Sbjct: 265 NVGKLDIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVTN- 323
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
T + + L+ S + + G + +++ S +F L++ +KLG+ +IT
Sbjct: 324 TSEHPMKVNVRLAAKFSPDSQYT---GCADFMMNFLQSGESAEFPLSVCPSKLGLVKITP 380
Query: 422 ITVFDKLE 429
+ + + ++
Sbjct: 381 LVLTNTIQ 388
>gi|281202555|gb|EFA76757.1| DUF974 family protein [Polysphondylium pallidum PN500]
Length = 494
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 146/448 (32%), Positives = 219/448 (48%), Gaps = 77/448 (17%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
L +VMRL +P + V + + D I DI PPLI ++ TY
Sbjct: 9 LNLKVMRLSKPHIPVNNSILCERDD--IASDIL--------FPPLIQF------GNNDTY 52
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
+++G+S +L L G IYLGE F SYIS+NN ST +V +V +K E+QT
Sbjct: 53 GG------GIEALGISPMLQLQS--GTIYLGEIFTSYISLNNHSTHDVTNVFLKVELQTS 104
Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
QRILLLD+ +SP+ G DF+V+ +VKE G + L C Y EGE K +FFK
Sbjct: 105 TQRILLLDSEQSPIAKFGPGFNSDFVVQREVKESGVNILCCAVNYVTPEGEIKKFKKFFK 164
Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
F V NPL ++TK+ H FLEAC+EN T+ +L+++ + FEPS+ ++ L
Sbjct: 165 FQVMNPLIIKTKIH-------HIPNQIFLEACLENATQGSLFLESILFEPSELFNFVNL- 216
Query: 250 ADGPHS------------------------------DYNAQSREIFKPP-VLIRSGGGIH 278
++ H+ D N+ EI V+ G
Sbjct: 217 SENSHNVNATPISSPPLTSPSTTSSPTSNVNFKSSVDSNSILSEIKSTSNVVFLKESGSR 276
Query: 279 NYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELN 338
YL++ ++ + + S LGKL ITWR+ LGE GRL+T I I E+E
Sbjct: 277 QYLFK---ITPKDPNDFDTKNSASLGKLDITWRSYLGEIGRLKTAYI-QRKINIDEVECI 332
Query: 339 VVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGL--RIMALAP 396
+ +P V ++KPF++ KL N+T++ P + L +N D +++NG +I AL P
Sbjct: 333 LTHIPK-VELEKPFVVTAKLVNKTNRILYPLFV-LVRNKMDG---ILVNGHLPKIGALPP 387
Query: 397 VEAFGSTDFHLNLIATKLGVQRITGITV 424
S D + + K G+Q+I G+ +
Sbjct: 388 N---NSLDIDIEMFPIKPGMQQIVGLAI 412
>gi|321467962|gb|EFX78950.1| hypothetical protein DAPPUDRAFT_320008 [Daphnia pulex]
Length = 414
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 133/442 (30%), Positives = 214/442 (48%), Gaps = 43/442 (9%)
Query: 4 TPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
T L+ +VMRL RP +P DL I + +++ D ++
Sbjct: 3 TKADQILSIKVMRLSRPVFTQPGLFHPEPWDLV-------STILSQEENNVLTEDA--DQ 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ D T+ S+F GLL LPQ+FG IYLGETF SY+ + N + V ++ IK
Sbjct: 54 TLDKTFSSQF------------GLL-LPQSFGTIYLGETFQSYLRVQNVGSCLVSNISIK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR+ L +K + + D I+ H++ E+G H LVC Y GEGE+
Sbjct: 101 ADLQTAAQRLPLTKRNKVSINQLEPQQSTDDILSHEITEIGTHILVCEVSYQIGEGEQMT 160
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSN-LYMDQVEFEPSQN 242
+++KF V PL V+TK + + +LEA I+N T L +D+V EPS
Sbjct: 161 SSRYYKFQVLKPLDVKTKFYNAE------SDDVYLEAQIQNTTVDRPLCLDKVTMEPSTL 214
Query: 243 WSATMLK-----ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
+ + L P S+ ++F V + G I YL+ LK + + +
Sbjct: 215 FEVSSLNEISATTGTPWSNMP----QLFGKCVNVVQPGEIRQYLHCLKPKQNVRDNHRML 270
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
+G + +GKL + WRT +G+ GRLQT Q+ ++ L + E+P+ V + +P K
Sbjct: 271 RGESNIGKLDLIWRTAIGDRGRLQTSQLQRMVPNYGDVRLTIQELPNPVKLHRPINFVCK 330
Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
+TN +++ P E+ L + + V+ G+ L ++ ST+ L L+ G+Q
Sbjct: 331 ITNTSER---PVELSLVL-EIRSKPTVLWTGISNRPLKKIDPNHSTEVSLKLVPVMPGLQ 386
Query: 418 RITGITVFDKLEKITYDSLPDL 439
I+G+ + D K TYD PD+
Sbjct: 387 SISGLKLIDLFLKRTYD-YPDI 407
>gi|170036870|ref|XP_001846284.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167879819|gb|EDS43202.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 424
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 125/438 (28%), Positives = 211/438 (48%), Gaps = 46/438 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL RP+L + + DL ++ FD ++ TT +
Sbjct: 7 HLLALKVMRLTRPTLVSSQIVTAEAKDL--PQNTFDK---------ILRGTATTVQG--- 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ +++LPQ+FG IYLGETF SY+ ++N V V +KA++Q
Sbjct: 53 -----------AETLTAGQMMLLPQSFGNIYLGETFSSYVCVHNCRAHPVSSVTVKADLQ 101
Query: 128 TDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
++ RI L + K +++ D ++ H+VKE+G H LVC Y G +
Sbjct: 102 SNNTRISLPIHVDKEGPQTLNPDETMDDVIHHEVKEIGTHILVCEVSYMTPAGLETSFRK 161
Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
FFKF V PL V+TK + +LEA I+N T + +++VE E S+ ++
Sbjct: 162 FFKFQVVKPLDVKTKFYNAETDEV------YLEAQIQNITVGPICLEKVELESSEQYTVV 215
Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 306
L + P + R + +P +LY +K ++ + P ++ +N +GKL
Sbjct: 216 PLN-NLPTGESVFSQRTMLQP-------QNSCQFLYCIKPIAEILNDPKALKAANNIGKL 267
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 366
I WR+NLGE GRLQT Q+ + I ++ L V E S V I F + ++TN +++
Sbjct: 268 DIVWRSNLGERGRLQTSQLQRSPIEYGDLRLAVTEANSTVKIGDAFDFRCRVTNTSER-- 325
Query: 367 GPFEIWLSQNDSDEEKV-VMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
+ L + + + K+ G ++L P+E DF L + +LG+ I+ + +
Sbjct: 326 ---SMDLVMHLNTKTKIGCGYTGQTEISLGPLEPGKFKDFGLTVCPVRLGLITISNLQLT 382
Query: 426 DKLEKITYDSLPDLEIFV 443
D K Y+ +++FV
Sbjct: 383 DVFMKRKYEFDDFVQVFV 400
>gi|307171192|gb|EFN63179.1| UPF0533 protein [Camponotus floridanus]
Length = 402
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 132/442 (29%), Positives = 205/442 (46%), Gaps = 46/442 (10%)
Query: 4 TPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
T H LA +VMRL RP+L + D TDL + L + SD T +
Sbjct: 5 TKSDHLLALKVMRLTRPTLASPMVVTCDSTDL-----------PGNTLNNELKSDCTALQ 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+++ + +VLPQ+FG IYLGE F SY+ ++N S V++V +K
Sbjct: 54 --------------GMEALAIGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQVVKNVTVK 99
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT Q I L S E + D ++ H+VKE+G H LVC Y++ G
Sbjct: 100 ADLQTSTQTISLSGNSLEGKE-LAPDSTVDEVIHHEVKEIGTHILVCEVSYTNQIGPPLS 158
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
++FKF V PL V+TK + + +LEA I+N T + +++V E S +
Sbjct: 159 FRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVALESSHLF 212
Query: 244 SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
S T L + N + I+ L+ + YLY LK P +Q + +
Sbjct: 213 SVTTL-------NINDEGESIYGSVNLLDTNCS-RQYLYCLKPQLSLMKDPKMMQNATNI 264
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
GKL I WR+NLGE GRLQT Q+ ++ + + ++P V +++P + N ++
Sbjct: 265 GKLDIVWRSNLGERGRLQTSQLQRMAPEYGDLRVIMKDIPLKVNLEEPVNCTCHIINTSE 324
Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
+ E+ LS ++ I+ I +L P S D L LI G+ I+G+
Sbjct: 325 RS---MELLLSLESNESIAWCGISNTMIGSLKP---GISMDIPLCLIMLNTGIITISGLK 378
Query: 424 VFDKLEKITYDSLPDLEIFVDQ 445
+ D K YD +IFV+Q
Sbjct: 379 LTDTFLKRVYDYDDLAQIFVNQ 400
>gi|344247412|gb|EGW03516.1| UPF0533 protein C5orf44-like [Cricetulus griseus]
Length = 294
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 111/314 (35%), Positives = 162/314 (51%), Gaps = 41/314 (13%)
Query: 14 VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRF 73
VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 1 VMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMRDDPST------------ 37
Query: 74 LLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRI 133
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++QT QR
Sbjct: 38 --VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQTSSQR- 94
Query: 134 LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVS 193
L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +FFKF V
Sbjct: 95 LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKFFKFQVL 154
Query: 194 NPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGP 253
PL V+TK + + FLEA I+N T S ++M++V EPS ++ T L +
Sbjct: 155 KPLDVKTKFYNAET------DEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTELNSVTQ 208
Query: 254 HSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWR 311
+ + SR +P YLY LK + ++G V+GKL I W+
Sbjct: 209 AGECVSTFGSRGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWK 261
Query: 312 TNLGEPGRLQTQQI 325
TNLGE GRLQT Q+
Sbjct: 262 TNLGERGRLQTSQL 275
>gi|195051148|ref|XP_001993042.1| GH13306 [Drosophila grimshawi]
gi|193900101|gb|EDV98967.1| GH13306 [Drosophila grimshawi]
Length = 438
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 128/452 (28%), Positives = 211/452 (46%), Gaps = 72/452 (15%)
Query: 7 THSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
+H LA +VMRL RP+L + P + +P DL + L+ D
Sbjct: 8 SHLLALKVMRLTRPTLVGLGPIVTCEPKDL------------PQSFNRLVEYD------- 48
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
+ SA+++G ++LPQ+FG IYLGETF SYI ++N +T V V +K +
Sbjct: 49 -------GIARTSAEALGAGQTMLLPQSFGNIYLGETFSSYICVHNCTTHPVEGVSVKVD 101
Query: 126 IQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
+Q++ RI LL+ +K + A D ++ ++VKE+G H LVC Y+ G + L
Sbjct: 102 LQSNNTRINLLMHENKKSSVVLTADETLDDVIRYEVKEIGTHILVCEVNYTSPAGFAQSL 161
Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
+FFKF V PL V+TK EI +LEA I+N T +++VE + S+ ++
Sbjct: 162 RKFFKFQVLKPLDVKTKFY-----NAEMDEI-YLEAQIQNVTTGPFCLEKVELDSSEQYT 215
Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
T L P+ + S+ + +P +LY +K + ++ +N +G
Sbjct: 216 VTSLNT-LPNGESVFTSKNMLQP-------NNSCQFLYCIKPKPEIAKDIKTLRQANNVG 267
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
KL I WR+N GE GRLQT Q+ K++ L V++ ++V I + ++TN
Sbjct: 268 KLDIVWRSNFGEKGRLQTSQLQRLPFEYKDLRLEVIDAENIVKIGTILTFQCRVTN---- 323
Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIA------------- 411
+ E + + L A A GS DF L+++
Sbjct: 324 -------------TAEHSMKLHVTLETKAFADCPYTGSADFELDVLQPGEMAEFPLTICP 370
Query: 412 TKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
+KLG+ +I+ + + D L+ + +E+FV
Sbjct: 371 SKLGLIKISPLLIVDTLKNEQFLMTKVVEVFV 402
>gi|156546906|ref|XP_001599918.1| PREDICTED: UPF0533 protein C5orf44 homolog [Nasonia vitripennis]
Length = 404
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 131/449 (29%), Positives = 208/449 (46%), Gaps = 51/449 (11%)
Query: 1 MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
M S P + H LA +VMRL RP+L + D TDL + L + +D
Sbjct: 1 MESKPKSEHLLALKVMRLTRPTLASPVVVTCDSTDL-----------PGNTLNVELKNDC 49
Query: 60 TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
T + +++ + ++LPQ+FG IYLGE F SY+ ++N S V+D
Sbjct: 50 TALQ--------------GMETVAIGQFMILPQSFGNIYLGEIFSSYLCVHNGSHQAVKD 95
Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD--- 176
V +KA +QT Q I L + E + D ++ H+VKE G H LVC Y+
Sbjct: 96 VTVKANLQTSTQTIPLSGQNSQATE-LAPNHTIDEVIHHEVKETGTHILVCEVTYTPLLL 154
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
G + +FFKF V PL V+TK + + ++EA I+N T + +++V
Sbjct: 155 GSQPLSF-RKFFKFQVVKPLDVKTKFYNAE------NDEVYIEAQIQNLTAGPICLEKVA 207
Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK 296
E S ++ + L A N + I+ L+ SG YLY LK + P
Sbjct: 208 LESSHLFTVSTLSA-------NEKQESIYGKLNLLDSGHS-RQYLYCLKPTPSLAKDPKM 259
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
+ + +GKL I WR+NLGE GRLQT Q+ ++ ++ ++PS + I++P K+
Sbjct: 260 MHNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPDYGDLRVSAKDIPSKIYIEEPVNFKI 319
Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
+ N T + Q + L N S V +G+ + ++ S L LI + G+
Sbjct: 320 HIIN-TSERQMDLLLGLQSNTS-----VAWSGISDKMIGTLKPGESVHLPLCLIPLESGL 373
Query: 417 QRITGITVFDKLEKITYDSLPDLEIFVDQ 445
++G+ + D K YD +IFV+
Sbjct: 374 VAVSGLKLTDTFLKRVYDYDDLAQIFVNH 402
>gi|308810202|ref|XP_003082410.1| unnamed protein product [Ostreococcus tauri]
gi|116060878|emb|CAL57356.1| unnamed protein product [Ostreococcus tauri]
Length = 463
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/358 (31%), Positives = 175/358 (48%), Gaps = 45/358 (12%)
Query: 117 VRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
R+V IK E+QT+ +R L D ++ P+ +R G + D +V DVKELGAHTLVC+A Y D
Sbjct: 86 AREVGIKIELQTETRRTTLHDATREPIAVLRPGEKRDVVVSKDVKELGAHTLVCSAAYCD 145
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVE 236
GER+Y PQ+FKF VSNPLSVRTK R G FLE C+EN T++ L ++
Sbjct: 146 ENGERRYSPQYFKFKVSNPLSVRTKTRAAPRGR------IFLEVCVENATRNALLLEGAR 199
Query: 237 FE-----------PSQNWSATMLKADGPHSDYNAQSREIFKPPV--LIRSGGGIHNYLYQ 283
F+ P AT + D +D I K V L +GG H +LY+
Sbjct: 200 FDAVDGIMSRDMTPENAGQAT--RVDVGENDRGPGLPSIGKRAVYRLDPTGGSAH-FLYE 256
Query: 284 LKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE-------IE 336
+ +++ + LGKL++ WR +G+ GRLQTQ I + S + I
Sbjct: 257 IT----SANASTTFAPTTPLGKLELRWRGAMGDLGRLQTQVINAGSAGSSDPVPEIAKIH 312
Query: 337 LNVVEVP--------SVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMING 388
++ P S V +++PF L+ ++ E G F + + D V ++G
Sbjct: 313 QTIIVDPKPANAEEESTVYVERPFTLRARIEALAPIEAGAFALRV----RDVVTGVYVDG 368
Query: 389 LRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQD 446
R + ++ + D ++ +A LGVQ + + ++ + LE+FV +D
Sbjct: 369 PRAFRIDSLDRGQTVDVDVSCVALGLGVQTCPTLALCGAVDDALLHAPTPLEVFVVRD 426
>gi|195118796|ref|XP_002003922.1| GI18169 [Drosophila mojavensis]
gi|193914497|gb|EDW13364.1| GI18169 [Drosophila mojavensis]
Length = 438
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 124/438 (28%), Positives = 212/438 (48%), Gaps = 46/438 (10%)
Query: 8 HSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
H LA +VMRL RP+L + P + +P DL + L+ D
Sbjct: 9 HLLALKVMRLTRPTLVGLGPIVTCEPKDL------------PQSFNRLVEFD-------- 48
Query: 67 LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
+ SA+++G ++LPQ+FG IYLGETF SYI ++N +T V V +K ++
Sbjct: 49 ------GIARTSAEALGAGQTMLLPQSFGNIYLGETFSSYICVHNCTTHPVEGVSVKVDL 102
Query: 127 QTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
Q++ +I LL+ +K + A D ++ ++VKE+G H LVC Y+ G + L
Sbjct: 103 QSNSSQINLLMHENKKSSVVLTADETLDDVIRYEVKEIGTHILVCEVNYTSPAGFAQSLR 162
Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
+FFKF V PL V+TK EI +LEA I+N T +++VE + S+ ++
Sbjct: 163 KFFKFQVLKPLDVKTKFY-----NAEMDEI-YLEAQIQNVTTGPFCLEKVELDSSEQYTV 216
Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
T L P+ + S+ + +P +LY +K + + ++ +N +GK
Sbjct: 217 TSLNT-LPNGESVFTSKNMLQP-------NNSCQFLYCIKPKAEIAKDIKTLREANNVGK 268
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I WR+N GE GRLQT Q+ K++ L V + ++V I F + ++TN +
Sbjct: 269 LDIVWRSNFGEKGRLQTSQLQRLPFEYKDLRLEVTDAENIVKIGTIFTFQCRITNTAEH- 327
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
P ++ + + D+ G L ++ +F L + +KLG+ +++ + +
Sbjct: 328 --PMKLHV-KLDTKVFPGCPYTGSADFELDTLQPGQLAEFPLTICPSKLGLIKVSPLVIV 384
Query: 426 DKLEKITYDSLPDLEIFV 443
D L+ + +E+FV
Sbjct: 385 DTLKNEQFIMTKVVEVFV 402
>gi|195384916|ref|XP_002051158.1| GJ14608 [Drosophila virilis]
gi|194147615|gb|EDW63313.1| GJ14608 [Drosophila virilis]
Length = 438
Score = 161 bits (408), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 125/439 (28%), Positives = 209/439 (47%), Gaps = 46/439 (10%)
Query: 7 THSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
TH LA +VMRL RP+L + P + +P DL + L+ D
Sbjct: 8 THLLALKVMRLTRPTLVGLGPIVTCEPKDL------------PQSFNRLVEFDG------ 49
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
+ A+++G ++LPQ+FG IYLGETF SYI ++N ++ V V +K +
Sbjct: 50 --------IARTCAEALGAGQTMLLPQSFGNIYLGETFSSYICVHNCTSHPVEGVSVKVD 101
Query: 126 IQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
+Q++ RI LL+ +K + A D ++ ++VKE+G H LVC Y+ G + L
Sbjct: 102 LQSNTSRINLLMHENKKSSVVLTADETLDDVIRYEVKEIGTHILVCEVNYTSPAGFAQSL 161
Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
+FFKF V PL V+TK EI +LEA I+N T +++VE + S+ ++
Sbjct: 162 RKFFKFQVLKPLDVKTKFY-----NAEMDEI-YLEAQIQNVTTGPFCLEKVELDSSEQYT 215
Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
T L P+ + S+ + +P +LY +K + ++ +N +G
Sbjct: 216 VTSLNT-LPNGESVFTSKNMLQP-------NNSCQFLYCIKPKPEVAKHIKTLREANNVG 267
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
KL I WR+N GE GRLQT Q+ K++ L V++ ++V I F + ++TN T +
Sbjct: 268 KLDIVWRSNFGEKGRLQTSQLQRLPFEYKDLRLEVIDAENIVKIGTIFTFQCRVTN-TAE 326
Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
I L + + L P + +F L + +KLG+ +++ + +
Sbjct: 327 HAMKLHITLETKAFADCPYTGSANFVLDVLQPGQF---AEFPLTICPSKLGLIKVSPLLI 383
Query: 425 FDKLEKITYDSLPDLEIFV 443
D L+ + +E+FV
Sbjct: 384 VDTLKNEQFLMTKVVEVFV 402
>gi|332018225|gb|EGI58830.1| UPF0533 protein C5orf44-like protein [Acromyrmex echinatior]
Length = 402
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/438 (28%), Positives = 204/438 (46%), Gaps = 46/438 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H L +VMRL RP+L + D TDL + L + +D TT +
Sbjct: 9 HLLTLKVMRLTRPTLASPMVVTCDSTDL-----------PGNTLNNELKNDCTTLQG--- 54
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+++ + +VLPQ+FG IYLGE F SY+ ++N S V++V +KA++Q
Sbjct: 55 -----------MEALAIGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQVVKNVTVKADLQ 103
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T Q ++ L ++ + + D ++ H+VKE+G H LVC Y++ G ++
Sbjct: 104 TSTQ-VIPLSSNNLEGKELAPDSTVDEVIHHEVKEIGTHILVCEVSYTNQIGPSLSFRKY 162
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
FKF V PL V+TK + + +LEA I+N T + +++V E S +S T
Sbjct: 163 FKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVALESSHLFSVTT 216
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 307
L N + I+ L+ +G YLY LK P +Q + +GKL
Sbjct: 217 LNT-------NDEGDSIYGSVNLLDAGCS-RQYLYCLKPQLSLLKDPKMMQNATNIGKLD 268
Query: 308 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 367
I WR+NLGE GRLQT Q+ ++ + + ++P +++P + N +++
Sbjct: 269 IVWRSNLGERGRLQTSQLQRMAPEYGDLRVLIKDIPLKAYLEEPVNCTCHIINTSERS-- 326
Query: 368 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
E+ LS ++ + G+ + ++ S D L I G+ I+G+ + D
Sbjct: 327 -MELLLSLESNNS---IAWCGMSDTIIGTLKPGVSMDIPLCFITLDTGIITISGLKLTDT 382
Query: 428 LEKITYDSLPDLEIFVDQ 445
K YD +IFV+Q
Sbjct: 383 FLKRVYDYDDLAQIFVNQ 400
>gi|332373924|gb|AEE62103.1| unknown [Dendroctonus ponderosae]
Length = 402
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 122/429 (28%), Positives = 195/429 (45%), Gaps = 53/429 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDL---FIGEDIFDDPIAASNLPPLISSDVTTNKS 64
H LA +VMRL RP+L P+ D DL + + DP A
Sbjct: 6 HLLALKVMRLTRPTLASPLPVTCDSKDLPGNLLNNVLQQDPTAVP--------------- 50
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
+++I + L+LPQ IYLGETF SYI + + +T V ++ +K
Sbjct: 51 -------------GSETIAIGQFLLLPQNPVNIYLGETFSSYICVYSETTQIVYNITVKV 97
Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
++QT Q++ L + S + + + + ++ H+VKE+G H LVC Y + G
Sbjct: 98 DLQTTSQKLSLANNSST--TKLNSDETVNTVIHHEVKEIGPHILVCEVAYQNSAGVLMSF 155
Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
+FFK V PL V+TK + + +LEA ++N T + +++V + S ++
Sbjct: 156 RKFFKIQVLKPLDVKTKFYNAE------NDDVYLEAQVQNITNGPICLEKVSLDASHLFN 209
Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
T L N + E + + I YLY L SS + G+ +G
Sbjct: 210 VTCLN--------NTPTGESIFGNITLLQPQSISQYLYCLTPTDKLSSDLKSLSGATNIG 261
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
KL I WR+NLGE GRLQT Q+ + EI+L++ E+P+ V I++ F K KL N ++
Sbjct: 262 KLDIVWRSNLGEKGRLQTSQLQRMSPDFGEIKLSITELPNFVVIEELFTFKCKLANNGER 321
Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
E L ++ I+G ++ AL P S I G++ ++G+ +
Sbjct: 322 T---VEFILYLENTRNIAWCGISGRKLEALPP---HSSKILEFKCIPLVPGLRTLSGVKL 375
Query: 425 FDKLEKITY 433
D K TY
Sbjct: 376 VDTFTKRTY 384
>gi|307105123|gb|EFN53374.1| hypothetical protein CHLNCDRAFT_137142 [Chlorella variabilis]
Length = 467
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 121/445 (27%), Positives = 191/445 (42%), Gaps = 112/445 (25%)
Query: 89 VLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLL-DTSKSPVESIR 147
+P + G +F + I+ N S + V KAE+ T++ R+ LL D++ SP+ +
Sbjct: 44 AMPALAAGGFAGRSFAAIIAACNYSDAPITLVGFKAELSTERSRLALLHDSAASPLPRLA 103
Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV 207
AG R+D +V+HD+K+LG HTL C+A ++ GEGER+ Q F F NPL VRTK R +V
Sbjct: 104 AGQRHDLLVKHDIKDLGVHTLTCSASFTCGEGERRLQAQAFTFSSLNPLVVRTKQR--QV 161
Query: 208 GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML------------------- 248
G E LEA +EN TK+ + +D + F P+ ++A +
Sbjct: 162 G-----EAVLLEATLENATKAPMLLDAISFFPAPPFAAQRVGGGGASSPPPPPAAGRAGD 216
Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKML--------------SHGSSSP 294
+ GP S Y I P++ GG +L+ L L + +SP
Sbjct: 217 EPAGPLSSY------IQSLPLIPE--GGASAFLFHLTRLPAAAAGSPGGAMPGASPGTSP 268
Query: 295 VK------------VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEV 342
+ + S LGK++I WR +GE RLQTQQI +E+ L + +
Sbjct: 269 SRAAAAAAAAAAAAAEASGALGKMEIRWRGPMGEMARLQTQQISLPQPAQREVSLALARL 328
Query: 343 PSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDS------------------------ 378
P V + PF L++ + D+ GP +I + S
Sbjct: 329 PGRVAVGAPFTATLRVQSHVDRPVGPLKIAAADAPSPAGSPSRSSSLRASSSGSPSRDGS 388
Query: 379 -------------------DEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
D + V+++ LAP +A + L ++A G Q +
Sbjct: 389 LQGGAVAAAAAAAAAAVCLDGAQSVLVD-----ELAPRQA---VEVQLRMLALAAGQQAL 440
Query: 420 TGITVFDKLEKITYDSLPDLEIFVD 444
+ V + + Y +LP E+FVD
Sbjct: 441 PAMCVVSERDGKQYGALPPAELFVD 465
>gi|268530512|ref|XP_002630382.1| Hypothetical protein CBG04321 [Caenorhabditis briggsae]
Length = 414
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 130/456 (28%), Positives = 211/456 (46%), Gaps = 60/456 (13%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
+S++ LA RVMRL RP + P D F DP+ + L++ V
Sbjct: 5 ISNSSTQQLLALRVMRLARP--------KFAPLDGFS-----HDPVDPTGFGELLAGKV- 50
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
++++ SR HD + + L+ PQ F IYLGETF Y+++ N S V +V
Sbjct: 51 ----AEISKESR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNVVNV 99
Query: 121 VIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
+K E+QT QR++L +ES + G+ ++ H+VKE+G H L+C+ Y G
Sbjct: 100 SLKCELQTSTQRVVLPCSVQDVTIESTKCDGQ---VISHEVKEIGQHILICSVNYKTLSG 156
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVK---VGATHFQEITFLEACIENHTKSNLYMDQVE 236
E Y +FFKF VS P+ V+TK + V ++ +FL K L ++
Sbjct: 157 ENMYFRKFFKFPVSKPIDVKTKFYSAEDNAVSKRFLEKSSFLSRIRMFILKRKLRTPRIR 216
Query: 237 FEPSQ--NWSATMLKADG-PHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS 293
+ NW +K H D + ++ KP I +L+ L S
Sbjct: 217 TCSWREWNWIRVSIKVTSISHEDEFPEVGKLLKP-------KDIRQFLFCL--------S 261
Query: 294 PVKVQGS------NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVG 347
PV V + +GKL ++WRT++GE GRLQT + ++ L+V + P+ V
Sbjct: 262 PVDVNNTLGYKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVD 321
Query: 348 IDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHL 407
+ KPF + +L N +++ ++ L Q + + + +G+ + L P DF L
Sbjct: 322 VQKPFEVACRLYNCSERALD-LQLRLEQPSNRQLVICSPSGVSLGQLPPSRY---VDFAL 377
Query: 408 NLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
N+ +G+Q I+GI + D K Y+ +IFV
Sbjct: 378 NVFPVAVGIQSISGIRITDTFTKRHYEHDDIAQIFV 413
>gi|328865155|gb|EGG13541.1| DUF974 family protein [Dictyostelium fasciculatum]
Length = 493
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 139/447 (31%), Positives = 218/447 (48%), Gaps = 78/447 (17%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
L +VMRL +P L P+ + DD I+ LPP I N +
Sbjct: 9 LNLKVMRLSKPLLQANNPVLCE----------RDDVISDMILPPTIQPG---NNDT---- 51
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
+ + +G++ +L L G IYLGE F SYIS+NN S EV++V E+QT
Sbjct: 52 -----MGGGIEGLGMTSMLQLQS--GLIYLGEIFTSYISLNNHSPHEVKNV----ELQTT 100
Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
QRILLLD+ P+ G DF+V+ +VKE G + L C Y EGE K +FFK
Sbjct: 101 TQRILLLDSEPKPIPVFGPGFNSDFVVQREVKEFGVNILCCAVTYVTLEGEVKKFKKFFK 160
Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT--- 246
F VSNPL +++K+ TF+E C+EN T+ L +D V FE + ++ +
Sbjct: 161 FQVSNPLGIKSKI-------ISIPNTTFVEVCLENTTQGALLIDTVTFEAADLFTQSNMS 213
Query: 247 --------------MLK-------ADGP----HSDYNAQS--REIFKPP--VLIRSGGGI 277
ML+ ++G +D QS EI P V +R G
Sbjct: 214 EVKHSQQPSPQQPPMLQLANSLGSSNGSGWKKSTDSTIQSLMSEIRASPDIVFLREGNS- 272
Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIEL 337
YL+++ + + + + LGKL I WR+ +GE GRL+T QI + +E+E
Sbjct: 273 RQYLFKVM---PKDPNDFETKNAATLGKLDIVWRSYMGETGRLKTAQI-QRKVCLEEVEC 328
Query: 338 NVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPV 397
N+V +P+ V ++KPF + K+ N+T++ P + L +N D ++ING + + +
Sbjct: 329 NLVSIPT-VELEKPFTVTAKIINKTNRILHPLFV-LVRNKMDG---ILING-HLPKIGAL 382
Query: 398 EAFGSTDFHLNLIATKLGVQRITGITV 424
+A S + + + K G+Q+I+G+ +
Sbjct: 383 QANSSINLDIEMFPLKPGMQQISGLAI 409
>gi|91094103|ref|XP_967297.1| PREDICTED: similar to CG4953 CG4953-PA [Tribolium castaneum]
gi|270010876|gb|EFA07324.1| hypothetical protein TcasGA2_TC015920 [Tribolium castaneum]
Length = 404
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 124/430 (28%), Positives = 196/430 (45%), Gaps = 47/430 (10%)
Query: 5 PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
P H LA +VMRL RP+L P+ D DL + L + D + K
Sbjct: 3 PEEHLLALKVMRLTRPTLATPLPVTCDSKDL-----------PGNLLNVALQQDAASVKG 51
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
++ +FLL LPQ+ IYLGETF SYI + N + V +V +K
Sbjct: 52 TETLSIGQFLL--------------LPQSPVNIYLGETFSSYICVYNETQHIVSNVSVKV 97
Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
++QT QR+ L +S P + + ++ H+VKE+G H LVC Y + G K
Sbjct: 98 DLQTTSQRLPL--SSNPPTPQLTPDDTVNIVIHHEVKEIGNHILVCEVSYQNAVGILKSF 155
Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
+FFK V PL V+TK + + +LEA ++N T + +++V + S +
Sbjct: 156 RKFFKIQVLKPLDVKTKFYNAE------NDDVYLEAQVQNITTGPICLEKVALDASHLFK 209
Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
T L + IF L+ + +LY L SS + G+ +G
Sbjct: 210 VTSL-------NVTPTGESIFGKTTLLNPQA-VCQFLYCLSPNEKLSSDLKSLSGATNIG 261
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
KL I WR+NLGE GRLQT Q+ +I L++ E+P+ V +++ F K +L N ++
Sbjct: 262 KLDIVWRSNLGERGRLQTSQLQRMGPDYGDIRLSITELPNFVVLEELFAFKCRLVNNCER 321
Query: 365 EQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITV 424
E+ + ++SD I+G ++ L P + I G++ ++GI +
Sbjct: 322 S---VELMMYLDNSDGLAWCGISGRKLEVLPP---HSTRVLEFKAIPLIPGLRTLSGIKL 375
Query: 425 FDKLEKITYD 434
D K TY+
Sbjct: 376 VDTFLKRTYN 385
>gi|196010439|ref|XP_002115084.1| hypothetical protein TRIADDRAFT_58871 [Trichoplax adhaerens]
gi|190582467|gb|EDV22540.1| hypothetical protein TRIADDRAFT_58871 [Trichoplax adhaerens]
Length = 427
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 126/432 (29%), Positives = 213/432 (49%), Gaps = 39/432 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H L +VMRL +P+L P+ + DL PPL+ N D+
Sbjct: 9 HLLTLKVMRLTKPALQFHTPITCEDHDL------------PGFCPPLLYG---INDQKDI 53
Query: 68 TYRSRFLLH--DSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
+S L D ++ L +L LPQ+FG I+LGETF SYI++ N ST+ +D+ IK
Sbjct: 54 FRQSFNALGVVDGLEAFSLGEMLTLPQSFGNIFLGETFTSYINVQNDSTVAAKDIQIKLH 113
Query: 126 IQTDKQR----ILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
IQT+ QR + +D + S + ++ + IV +DVKELG H L C+ Y+ GE+
Sbjct: 114 IQTEAQRHPLPLNCMDENASLL--LQPSENVNEIVSYDVKELGIHVLGCSVGYTSPSGEK 171
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
+ +FFKF V PL V+TK V + + ++EA +EN T + +Y+D V+ +PS
Sbjct: 172 LHFKKFFKFQVLKPLEVKTKFFVTE------DDEVYIEAQVENITPNPMYLDSVKLDPSP 225
Query: 242 NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 301
++ + P S ++ + + P+ +R YLY+L +S K +
Sbjct: 226 SYYLDDINKLLPESGPSSNGKISYLRPMDVR------QYLYRLTPVSPIIEKSDK--SAC 277
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 361
+GKL I W T+ GE GRLQT Q+ ++ +N +E+ V ++K F +KL + N
Sbjct: 278 DVGKLDIQWLTSFGEKGRLQTSQLQRMPRDLNDLRINCIEIADAVPVEKLFTVKLSVINL 337
Query: 362 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
T + L +++ + ++ + +AL ++ S + +N++ G+ I+G
Sbjct: 338 TSDRIMNLRLML--DNTKVQPLLWVGRSGQVALGELKPGQSIEVSVNILPVYPGLHVISG 395
Query: 422 ITVFDKLEKITY 433
+ + D + Y
Sbjct: 396 LQLLDTFKSKVY 407
>gi|383850626|ref|XP_003700896.1| PREDICTED: UPF0533 protein C5orf44 homolog [Megachile rotundata]
Length = 404
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 128/449 (28%), Positives = 208/449 (46%), Gaps = 51/449 (11%)
Query: 1 MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
M + P + H L +VMRL RP+L + D TDL + L + +D
Sbjct: 1 METKPKSEHLLELKVMRLTRPTLASPVVVTCDSTDL-----------PGNTLNNELKNDC 49
Query: 60 TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
T + +++ + +VLPQ+FG IYLGE F SY+ ++N S+ V++
Sbjct: 50 TALQ--------------GMETLAVGQFMVLPQSFGNIYLGEIFSSYLCVHNGSSQLVKN 95
Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE- 178
V ++A++QT Q I+ L S ++ + D ++ H+VKE+G H LVC Y+
Sbjct: 96 VTVRADLQTSTQ-IISLCGSSGEMKDLAPDNTVDEVIHHEVKEIGTHILVCEVTYTSTNL 154
Query: 179 -GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
G + ++FKF V PL V+TK + + +LEA I+N T + +++V
Sbjct: 155 GGTSQSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVAL 208
Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
E S +S + L N + I+ L+ + YLY LK P +
Sbjct: 209 ESSHLFSVSTLNT-------NEKGESIYGLVNLLDTDCS-RQYLYCLKPQLSLLKDPKMM 260
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
+ +GKL I WR+NLGE GRLQT Q+ +I + + ++P V +++
Sbjct: 261 HNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEYGDIRVTMKDIPLTVYLEQSVNFNCH 320
Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFG-STDFHLNLIATKLGV 416
+ N +++ ++ LS ++ I+ I L P G S D L LIA + G+
Sbjct: 321 IINTSERS---MDLMLSLESNNSIAWCGISNTTIGTLKP----GISIDIPLCLIALRSGI 373
Query: 417 QRITGITVFDKLEKITYDSLPDLEIFVDQ 445
I+G+ + D K YD +IFV Q
Sbjct: 374 ITISGLKLVDTFLKRVYDYDNLAQIFVSQ 402
>gi|324516077|gb|ADY46413.1| Unknown [Ascaris suum]
Length = 366
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 108/358 (30%), Positives = 181/358 (50%), Gaps = 20/358 (5%)
Query: 88 LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
L+ PQ F IYLGETF Y+ + N S+ ++ IK ++QT QR+ L + +++
Sbjct: 26 LMAPQIFDNIYLGETFTFYVCVQNDSSQCATEICIKTDLQTTNQRVALHSKLQDSNATLQ 85
Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV 207
G I+ H++KE+G H LVC Y E+ Y +FFKF V+ P+ VRTK +
Sbjct: 86 PGQILGDIISHEIKEVGQHILVCAVTYKTPADEKMYFRKFFKFPVTKPIDVRTKFYNAE- 144
Query: 208 GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKP 267
+ +LEA I+N + + + +++V EPS +++T + P N S++ F
Sbjct: 145 --DNMNNDVYLEAQIQNTSATPMILEKVVLEPSDFYTSTEIP---PPLLLNENSKKQF-- 197
Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILG 327
+ I YLY L+ + S +G +GKL + WRTN+GE GRLQT +
Sbjct: 198 ---YLNPKDIRQYLYCLRPKT-ADYSLNYYRGGTSIGKLDMVWRTNMGERGRLQTSALQR 253
Query: 328 TTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMI- 386
++ L V ++P+ I + F + +L N +++ ++ L+ + S + +V
Sbjct: 254 MAPGYGDLRLTVEKIPATAKIRQTFEVVCRLHNCSERS---LDLVLTLDGSLQPALVFCT 310
Query: 387 -NGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
+G+++ L P + DF L L+ G+Q I+GI V D K TY+ ++FV
Sbjct: 311 ASGVQLGQLPPN---NTVDFTLELLPITPGLQPISGIRVSDTFLKRTYEHDDIAQVFV 365
>gi|25149719|ref|NP_741010.1| Protein C56C10.7, isoform b [Caenorhabditis elegans]
gi|351060502|emb|CCD68178.1| Protein C56C10.7, isoform b [Caenorhabditis elegans]
Length = 417
Score = 155 bits (392), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 127/465 (27%), Positives = 210/465 (45%), Gaps = 71/465 (15%)
Query: 1 MSSTPGTHS----LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS 56
M+ P + S LA RVMRL RP + P D F DP+ + L++
Sbjct: 1 MADNPASSSSQQLLALRVMRLARP--------KFAPVDGFS-----HDPVDPTGFGELLA 47
Query: 57 SDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE 116
V S+++ SR + + L+ PQ F IYLGETF Y+++ N S
Sbjct: 48 GKV-----SEISKESR-------QDLPIGEYLIAPQMFENIYLGETFTFYVNVVNESEKT 95
Query: 117 VRDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
V V +K E+QT QR++L + +ES + G+ ++ H+VKE+G H L+C+ Y
Sbjct: 96 VSSVSLKCELQTSTQRVVLPCSVQDATIESSKCEGQ---VISHEVKEIGQHILICSVNYK 152
Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKV--------RVVKVGATHFQEITFLEACIENHTK 227
GE Y +FFKF VS P+ V+TK RV+ + F+ + + + K
Sbjct: 153 TSNGENMYFRKFFKFPVSKPIDVKTKFYSAEVSSNRVLCINVVFFRTMRIKMS--TSKPK 210
Query: 228 SNLYMDQVEFEPSQNW---SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQL 284
++ ++ +W + ML +D ++ KP I +L+ L
Sbjct: 211 LKIHQMRICSWKKSSWIQVNIIMLLVSLMSTDEFGDVGKLLKP-------KDIRQFLFCL 263
Query: 285 KMLSHGSSSPVKVQGS------NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELN 338
+P V + +GKL ++WRT++GE GRLQT + ++ L+
Sbjct: 264 --------TPADVHNTLGYKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLS 315
Query: 339 VVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVE 398
V + P+ V + KPF + +L N +++ ++ L Q + +G+ + L P +
Sbjct: 316 VEKTPACVDVQKPFEVSCRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ 374
Query: 399 AFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
DF LN+ +G+Q I+GI + D K Y+ +IFV
Sbjct: 375 ---HVDFSLNVFPVTVGIQSISGIRITDTFTKRIYEHDDIAQIFV 416
>gi|307198435|gb|EFN79377.1| UPF0533 protein [Harpegnathos saltator]
Length = 389
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 120/403 (29%), Positives = 195/403 (48%), Gaps = 30/403 (7%)
Query: 52 PPLISSDVTTNKSSDL--TYRSRFLLHDSADSIGLSGL-----LVLPQAFGAIYLGETFC 104
P L S V T S+DL + L +D G+ L +VLPQ+FG IYLGE F
Sbjct: 6 PTLASPVVVTCDSTDLPGNTLNNELKNDCTALQGMEALAIGQFMVLPQSFGNIYLGEIFS 65
Query: 105 SYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELG 164
SY+ ++N S V++V++KA++QT Q I+ L + + + D ++ H+VKE+G
Sbjct: 66 SYLCVHNGSNQVVKNVIVKADLQTSTQ-IISLSGNNLEGKELAPDSTVDEVIHHEVKEIG 124
Query: 165 AHTLVCTALY--SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACI 222
H LVC Y ++ G ++FKF V PL V+TK + + +LEA I
Sbjct: 125 THILVCEVSYICANQVGPPLSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQI 178
Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLY 282
+N T + +++V E S +S T L N + + I+ L+ + YLY
Sbjct: 179 QNLTAGPICLEKVALESSHLFSVTTLNT-------NDEEKSIYGSVNLLDTSCS-RQYLY 230
Query: 283 QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEV 342
LK P +Q + +GKL I WR+NLGE GRLQT Q+ ++ + + ++
Sbjct: 231 CLKPQPSLLKDPKMMQNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEYGDLRVTLKDI 290
Query: 343 PSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGS 402
P V +++P K + N +++ + L N+S + G+ M + ++ S
Sbjct: 291 PLKVYLEEPVNCKCHIINTSERSMDLL-LSLESNNS-----IAWCGMSDMTIGTLKPGAS 344
Query: 403 TDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVDQ 445
D L LI G+ ++G+ + D K Y+ +IFV+Q
Sbjct: 345 IDIPLCLITLDTGIITVSGLKLTDTFLKRVYEYDDLAQIFVNQ 387
>gi|340709998|ref|XP_003393586.1| PREDICTED: UPF0533 protein C5orf44 homolog [Bombus terrestris]
Length = 404
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 130/449 (28%), Positives = 202/449 (44%), Gaps = 51/449 (11%)
Query: 1 MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
M + P + H L +VMRL RP L + D TDL + L + +D
Sbjct: 1 METKPKSEHLLELKVMRLTRPMLASPVVITCDSTDL-----------PGNTLNNELKNDC 49
Query: 60 TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
T + +++ + +VLPQ+FG IYLGE F SY+ ++N S V++
Sbjct: 50 TALQG--------------METLAVGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQIVKN 95
Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE- 178
V +KA++QT Q I L S ++ + D ++ H+VKE+G H LVC Y+ G
Sbjct: 96 VTVKADLQTSTQNISLCGNS-GEMKDLAPDSTVDEVIHHEVKEIGTHILVCEVTYTPGNL 154
Query: 179 -GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
+ ++FKF V PL V+TK + + +LEA I+N T + +++V
Sbjct: 155 GSTAQSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVSL 208
Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
E S +S + L N + I+ V I YLY LK P +
Sbjct: 209 ESSHLFSVSTLNT-------NEKGESIYG-LVNILDTDCSRQYLYCLKPQLSLLKDPKMM 260
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
+ +GKL I WR+NLGE GRLQT Q+ +I + + +P V +++
Sbjct: 261 HNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEFGDIRVTMKNIPLTVYLEQSVNFNCH 320
Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFG-STDFHLNLIATKLGV 416
+ N +++ ++ LS S+ I+ I L P G S D L LI + G+
Sbjct: 321 IINTSERS---MDLMLSLESSNSIAWCGISNTMIGTLKP----GISIDIPLCLIPLRSGI 373
Query: 417 QRITGITVFDKLEKITYDSLPDLEIFVDQ 445
I+G+ + D K YD +IFV Q
Sbjct: 374 ITISGLKLTDTFLKRVYDYDDLAQIFVSQ 402
>gi|357609833|gb|EHJ66705.1| hypothetical protein KGM_03665 [Danaus plexippus]
Length = 402
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 119/401 (29%), Positives = 185/401 (46%), Gaps = 47/401 (11%)
Query: 52 PPLISSDVTTNKSSDL--TYRSRFLLHDSADSIGLSGL-----LVLPQAFGAIYLGETFC 104
P LIS + T DL + FL D+ + + L L+LPQ+FG IYLGETF
Sbjct: 21 PALISPKIVTCDFKDLPGNILNNFLKDDATSVVQMETLAAGQFLLLPQSFGNIYLGETFS 80
Query: 105 SYISINNSSTLEVRDVVIKAEIQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKEL 163
Y+ ++N + V+ V IKA++QT QRI L ++SP+ + ++ H+VK+L
Sbjct: 81 CYVCVHNETNQPVQSVSIKADLQTSSQRIPLTTQQNQSPI-MLDVDETLSDVIHHEVKDL 139
Query: 164 GAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIE 223
G H LVC Y +FFKF V PL V+TK + + F+EA ++
Sbjct: 140 GTHILVCEVTYMSNYSTLASFRKFFKFEVLKPLDVKTKFYNAE------SDDVFVEAQVQ 193
Query: 224 NHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIH----- 278
N T + ++ V E S ++ L D +F L++
Sbjct: 194 NITSGPIILETVALESSHQFTVKSLNEDD-------NGVSVFGDVTLLQPQESCQYSYCL 246
Query: 279 ----NYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE 334
N L +K+L+ + +GKL I WR+NLGE GRLQT Q+ +
Sbjct: 247 TPKENILKDIKLLAAAKN----------IGKLDIVWRSNLGEKGRLQTSQLQRMIPDYGD 296
Query: 335 IELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG-PFEIWLSQNDSDEEKVVMINGLRIMA 393
I + VPS V ID+PF K+ N +++ ++ QN S ++ G+
Sbjct: 297 IRVTYENVPSRVPIDEPFKFNCKIVNASERTLDLILKLRSLQNSS-----LLWCGISNRK 351
Query: 394 LAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
L P+E +T +L ++ G+ +TG+++ D K TYD
Sbjct: 352 LGPLEPGNTTIVNLTVLPINSGLHTVTGVSLVDLFLKRTYD 392
>gi|350398663|ref|XP_003485265.1| PREDICTED: UPF0533 protein C5orf44 homolog [Bombus impatiens]
Length = 404
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 128/448 (28%), Positives = 201/448 (44%), Gaps = 49/448 (10%)
Query: 1 MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
M + P + H L +VMRL RP+L + D TDL + L + +D
Sbjct: 1 METKPKSEHLLELKVMRLTRPTLASPVVITCDSTDL-----------PGNTLNNELKNDC 49
Query: 60 TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
T + +++ + +VLPQ+FG IYLGE F SY+ ++N S ++
Sbjct: 50 TALQ--------------GMETLAVGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQIAKN 95
Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE- 178
V +KA++QT Q I L S ++ + D ++ H+VKE+G H LVC Y+ G
Sbjct: 96 VTVKADLQTSTQNISLCGNS-GEMKDLAPDSTVDEVIHHEVKEIGTHILVCEVTYTPGNL 154
Query: 179 -GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
+ ++FKF V PL V+TK + + +LEA I+N T + +++V
Sbjct: 155 SSTAQSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVSL 208
Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
E S +S + L N + I+ V I YLY LK P +
Sbjct: 209 ESSHLFSVSTLNT-------NEKGESIYG-LVNILDTDCSRQYLYCLKPQLSLLKDPKMM 260
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
+ +GKL I WR+NLGE GRLQT Q+ +I + + +P V +++
Sbjct: 261 HNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEFGDIRVTMKNIPLTVYLEQSVNFNCH 320
Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
+ N +++ ++ LS S+ I+ I L P S D L LI + G+
Sbjct: 321 IINTSERS---MDLMLSLESSNSIAWCGISNTIIGTLKP---GVSIDIPLCLIPLRSGII 374
Query: 418 RITGITVFDKLEKITYDSLPDLEIFVDQ 445
I+G+ + D K YD +IFV Q
Sbjct: 375 TISGLKLTDTFLKRVYDYDDLAQIFVSQ 402
>gi|195450486|ref|XP_002072516.1| GK12482 [Drosophila willistoni]
gi|194168601|gb|EDW83502.1| GK12482 [Drosophila willistoni]
Length = 437
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 129/432 (29%), Positives = 205/432 (47%), Gaps = 55/432 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL RP+L P+ + D+ D SN+ +K S++
Sbjct: 9 HLLALKVMRLTRPALVAPGPI--------VNCDLRDLLQPFSNVQ-------KKDKKSEV 53
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ +L+LPQ+FG IYLGETF YI ++N + V V +KA++Q
Sbjct: 54 V----------GKPLTAGYILLLPQSFGNIYLGETFSCYICVHNCTAHSVESVTVKADLQ 103
Query: 128 TDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
++ RI L + KS V + D ++ ++VKE+G H LVC Y+ G + L
Sbjct: 104 SNTSRINLPINENCKSSV-MLAPDETLDDVIRYEVKEIGTHILVCEVNYTSPAGFSQSLR 162
Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
+FFKF V PL V+TK EI +LEA I+N T +++VE + S++++
Sbjct: 163 KFFKFQVLKPLDVKTKFY-----NAEMDEI-YLEAQIQNVTTGPFCLEKVELDISEHYTV 216
Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH-GSSSPVKVQGSNVLG 304
T L P+ + S+ + +P +LY +K S S V Q +NV G
Sbjct: 217 TSLNT-LPNGESVLTSKHMLQP-------NNSCQFLYCIKPKSTIARCSKVLRQFTNV-G 267
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD- 363
KL I WR+NLGE GRLQT Q+ K++ L V++ +++ I F ++TN ++
Sbjct: 268 KLDIVWRSNLGEKGRLQTSQLQRLPFDYKDLCLEVLDAKNIIKIGSTFSFLCRVTNSSEH 327
Query: 364 --KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
K + LS N G L ++ T+F L++ + LG+ R++
Sbjct: 328 PMKLHIRLDTKLSTNS--------YTGSADFLLETIQPAERTEFSLSICPSNLGLIRVSP 379
Query: 422 ITVFDKLEKITY 433
+ + D L+ Y
Sbjct: 380 LLLVDTLQNRRY 391
>gi|195146730|ref|XP_002014337.1| GL19004 [Drosophila persimilis]
gi|194106290|gb|EDW28333.1| GL19004 [Drosophila persimilis]
Length = 438
Score = 151 bits (382), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 126/446 (28%), Positives = 203/446 (45%), Gaps = 56/446 (12%)
Query: 5 PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
P H LA +VMRL RP+L VE L P++S +
Sbjct: 6 PDAHLLALKVMRLMRPTL-VE-------------------------LGPVVSCE-----H 34
Query: 65 SDLTYRSRFLLHDS------ADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVR 118
DL R H A+++ +L+LPQ+FG IYLGETF SYI ++N S V
Sbjct: 35 KDLMQRFSSKPHSDVFSGIIAETLSAGQVLLLPQSFGNIYLGETFSSYICVHNCSPQPVE 94
Query: 119 DVVIKAEIQTDKQRI-LLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG 177
+ +K ++Q++ RI L L + + G D ++ ++VKE+G H LVC Y+
Sbjct: 95 CINVKTDLQSNTTRINLSLQKNNKSAIILAPGETIDDVIRYEVKEIGTHILVCEVNYTSP 154
Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
G + L +FFKF V PL V+TK ++ E +LEA I+N T S +++VE
Sbjct: 155 AGYAQSLRKFFKFQVLKPLDVKTKFYNAEI------EEIYLEAQIQNVTTSPFCLEKVEL 208
Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
+ S+ ++ L P+ + ++ + +P +LY +K ++ +
Sbjct: 209 DSSEEFTVIPLNT-LPNGESVFNTKNMLQP-------NNSCQFLYCIKPKVQKATDIHAL 260
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
+ + +GKL I WR+NLGE GRLQT Q+ K++ V+ + V I F +
Sbjct: 261 RQLSNVGKLDIVWRSNLGEKGRLQTSQLQRLPYECKDLRFEVINALNTVKIGTIFTFNCR 320
Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
+TN T + + L S E G L + + +F L++ +KLG+
Sbjct: 321 VTN-TSEHTMKLHVRLVTKLSPE---CQYTGCADFKLDELNTGENAEFPLSVSPSKLGLI 376
Query: 418 RITGITVFDKLEKITYDSLPDLEIFV 443
+I + + D Y +E+FV
Sbjct: 377 KIADLLLVDTENNEHYSIEKVVEVFV 402
>gi|170590974|ref|XP_001900246.1| Conserved hypothetical protein [Brugia malayi]
gi|158592396|gb|EDP30996.1| Conserved hypothetical protein, putative [Brugia malayi]
Length = 399
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 119/436 (27%), Positives = 201/436 (46%), Gaps = 49/436 (11%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
L +VMR RP + + +DP D LI S +
Sbjct: 10 LTLKVMRFARPKFYENICMPIDPVD---------------TTSQLIGSAL---------- 44
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
R ++AD I + L+ PQ F IYLGETF Y+ + N S D+ IK ++QT
Sbjct: 45 -CRLTGQETAD-IPIGKYLMAPQKFENIYLGETFTFYVCVQNISDKFATDICIKTDLQTT 102
Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFK 189
QR L + ++ G ++ H++KE+G H LVC Y + E Y +FFK
Sbjct: 103 SQRNALSSQLQEANAVLKPGECLGEVITHEIKEIGQHILVCAVSYKTPKNE-MYFRKFFK 161
Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
F V+ P+ VRTK + + +LEA I+N ++ + +++V EPS + ++ +
Sbjct: 162 FPVTKPIDVRTKFYNAE---DNLNNDVYLEAQIQNTSELPMVLEKVILEPSDFYLSSEIS 218
Query: 250 ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQIT 309
P ++ + KP I YL+ LK + S +G+++ GKL +
Sbjct: 219 P--PETENGTMDQSYLKP-------SDIRQYLFCLKPKTTDYSLNYFRKGTSI-GKLDMV 268
Query: 310 WRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPF 369
WRT +GE GRLQT + ++ L + ++P+ V + F + +L N +++
Sbjct: 269 WRTGMGERGRLQTSALQRMAPGYGDLRLTIEKIPATVKXLQSFRMVCRLRNCSERS---L 325
Query: 370 EIWLSQNDSDEEKVVM--INGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
++ L+ + + + I+G+ + LAP +TDF + L+ G+Q I+GI V D
Sbjct: 326 DLVLTLDGKLQPNMAFCSISGIELGQLAPN---STTDFSIELLPLTPGLQSISGIRVTDT 382
Query: 428 LEKITYDSLPDLEIFV 443
+ TY+ ++FV
Sbjct: 383 FLRRTYEHDDIAQVFV 398
>gi|380014781|ref|XP_003691396.1| PREDICTED: UPF0533 protein C5orf44 homolog [Apis florea]
Length = 404
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 126/448 (28%), Positives = 201/448 (44%), Gaps = 49/448 (10%)
Query: 1 MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
M + P + H L +VMRL RP L + D TDL + L + +D
Sbjct: 1 METKPKSEHLLELKVMRLTRPMLASPVVVTCDSTDL-----------PGNTLNNELKNDC 49
Query: 60 TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
T + +++ + +VLPQ+FG IYLGE F SY+ ++N S V++
Sbjct: 50 TALQ--------------GMETLAVGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQLVKN 95
Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS--DG 177
V++KA++QT Q I L S ++ + D ++ H+VKE+G H LVC Y+ +
Sbjct: 96 VIVKADLQTSTQIISLCGNS-GEMKDLAPDNTVDEVIHHEVKEIGTHILVCEVSYTPVNL 154
Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
+ ++FKF V PL V+TK + + +LEA I+N T + +++V
Sbjct: 155 SNTAQSFRKYFKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLEKVSL 208
Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
E S +S + L N + I+ V I YLY LK P +
Sbjct: 209 ESSHLFSVSTLNT-------NERGESIYG-SVNILDTDCSRQYLYCLKPQISLLKDPKMM 260
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLK 357
+ +GKL I WR+NLGE GRLQT Q+ +I + + +P V +++
Sbjct: 261 HNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEYGDIRVTMKNIPLTVYLEQMMNFNCH 320
Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
+ N +++ I L N S + G+ + ++ S D L LIA + G+
Sbjct: 321 IINTSERSMDLMLI-LESNSS-----IAWCGISNTMIGTLKPGVSIDIPLCLIALRSGII 374
Query: 418 RITGITVFDKLEKITYDSLPDLEIFVDQ 445
I+G+ + D YD +IFV Q
Sbjct: 375 TISGLKLKDTFLNRVYDYDDLTQIFVSQ 402
>gi|110750830|ref|XP_624799.2| PREDICTED: UPF0533 protein C5orf44 homolog [Apis mellifera]
Length = 404
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 127/452 (28%), Positives = 199/452 (44%), Gaps = 57/452 (12%)
Query: 1 MSSTPGT-HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDV 59
M + P + H L +VMRL RP L + D TDL + L + +D
Sbjct: 1 METKPKSEHLLELKVMRLTRPMLASPVVVTCDSTDL-----------PGNTLNNELKNDC 49
Query: 60 TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
T + +++ + +VLPQ+FG IYLGE F SY+ ++N S V++
Sbjct: 50 TALQ--------------GMETLAIGQFMVLPQSFGNIYLGEIFSSYLCVHNGSNQLVKN 95
Query: 120 VVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS---- 175
V++KA++QT Q I L S ++ + D ++ H+VKE+G H LVC Y+
Sbjct: 96 VIVKADLQTSTQIISLCGNS-GEMKDLAPDNTVDEVIHHEVKEIGTHILVCEVSYTPVNL 154
Query: 176 --DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMD 233
+ RKY FKF V PL V+TK + + +LEA I+N T + ++
Sbjct: 155 GNTAQSFRKY----FKFQVVKPLDVKTKFYNAE------SDEVYLEAQIQNLTAGPICLE 204
Query: 234 QVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS 293
+V E S +S + L N + I+ V I Y Y LK
Sbjct: 205 KVSLESSHLFSVSTLNT-------NEKGESIYG-SVNILDTDCSRQYFYCLKPQISLLKD 256
Query: 294 PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 353
P + + +GKL I WR+NLGE GRLQT Q+ +I + + +P V +++
Sbjct: 257 PKMMHNATNIGKLDIVWRSNLGERGRLQTSQLQRMAPEYGDIRVTMKNIPLTVYLEQMMN 316
Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 413
+ N +++ I S N + G+ + ++ S D L LIA +
Sbjct: 317 FNCHIINTSERSMDLMLILESNNS------IAWCGISNTMIGTLKPGVSIDIPLCLIALR 370
Query: 414 LGVQRITGITVFDKLEKITYDSLPDLEIFVDQ 445
G+ I+G+ + D YD +IFV Q
Sbjct: 371 SGIITISGLKLKDTFLNRIYDYDDLTQIFVSQ 402
>gi|393909700|gb|EJD75555.1| hypothetical protein LOAG_17321 [Loa loa]
Length = 399
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 122/445 (27%), Positives = 205/445 (46%), Gaps = 49/445 (11%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
M+ L +VMRL RP + + +D +A + LI S +
Sbjct: 1 MAEAMKEQLLTLKVMRLARPKFYENMCIPID---------------SADSTSQLIGSAL- 44
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
R ++AD I + L+ PQ F IYLGETF ++ + N S D+
Sbjct: 45 ----------CRLTGQEAAD-IPIGKYLMAPQKFENIYLGETFSFFVCVQNISDKVAMDI 93
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
IK ++QT QR L + + G I+ H++KE+G H LVC Y + E
Sbjct: 94 CIKTDLQTTSQRNALPSQLQEANAVLEPGKCLGEIITHEIKEIGQHILVCAVSYKTSKNE 153
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
Y +FFKF V+ P+ VRTK + + +LEA I+N ++ + +++V EPS
Sbjct: 154 M-YFRKFFKFPVTKPIDVRTKFYNAE---DNLNNDVYLEAQIQNTSELPMVLEKVILEPS 209
Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS 300
+ ++ + P N + + P IR YL+ LK + S +G
Sbjct: 210 DFYISSEI---SPPEIENENMEQSYLNPSDIR------QYLFCLKPKTTDYSLNYFRKGI 260
Query: 301 NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
+GKL + WRT++GE GRLQT + ++ L + ++P+ V + +PF + +L N
Sbjct: 261 -AIGKLDMVWRTSMGERGRLQTSALQRMAPGYGDLRLTIEKIPATVKVLQPFHIVCRLHN 319
Query: 361 QTDKEQGPFEIWLSQNDSDEEKVVMI--NGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
+++ P ++ L+ +D + + +G+ + L P +TDF L L+ G+Q
Sbjct: 320 CSER---PLDLVLTLDDKLQPNIAFCSTSGVELGQLPPN---STTDFSLELLPLTPGLQS 373
Query: 419 ITGITVFDKLEKITYDSLPDLEIFV 443
++GI V D + TY+ ++FV
Sbjct: 374 VSGIRVTDTFLRRTYEHDDIAQVFV 398
>gi|391345954|ref|XP_003747246.1| PREDICTED: UPF0533 protein C5orf44 homolog [Metaseiulus
occidentalis]
Length = 388
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 106/353 (30%), Positives = 174/353 (49%), Gaps = 25/353 (7%)
Query: 85 SGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVE 144
S +L LPQAFG IYLGETF SY++++N S+L+V+ V +KAE+Q Q++ L +
Sbjct: 46 SDMLCLPQAFGNIYLGETFSSYMTVHNGSSLDVQGVQLKAELQNGTQKVALTPVVVRGSD 105
Query: 145 SIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE-GERKYLPQFFKFIVSNPLSVRTKVR 203
++ D I++H+VKE+G H L CT Y++ GE ++FKF V PL V+TK
Sbjct: 106 VLKPNESLDQIIQHEVKEIGTHLLQCTVDYTNASTGEPMQFCKYFKFQVYKPLDVKTKSY 165
Query: 204 VVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSRE 263
+ + LEA ++N T + + + +V EPS ++ T L + N
Sbjct: 166 NAE------NDEVLLEAQLQNITANPVTLAKVSLEPSPHFQVTAL-------NQNDNGES 212
Query: 264 IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN---VLGKLQITWRTNLGEPGRL 320
IF L+ YL+ L + + KV+G+ +GKL I W++ +GE GRL
Sbjct: 213 IFGQVNLLNPQDS-RQYLFSL-IPKNRLPQESKVKGTRPPFAIGKLDIIWKSAIGEKGRL 270
Query: 321 QTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDE 380
QT Q+ +I L + PS + ++ PF + + N ++ + L+ + ++
Sbjct: 271 QTSQLERVATVYSDIRLVIENYPSKIELETPFTISCTIFNTCER-----ALDLTVSLENQ 325
Query: 381 EKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITY 433
E ++ + L ++A LI T+ G+Q I GI + K Y
Sbjct: 326 EGLMWLESTG-YELGQIQAHSKMTKDFALIMTRCGLQTIGGIKFTESFLKRVY 377
>gi|358058981|dbj|GAA95379.1| hypothetical protein E5Q_02033 [Mixia osmundae IAM 14324]
Length = 613
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 118/349 (33%), Positives = 164/349 (46%), Gaps = 75/349 (21%)
Query: 2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
SS H L+ RV+RL RPS E ++I +D D
Sbjct: 4 SSMTEAHPLSVRVLRLLRPSAAKE-------DTIYIDKDAVDL----------------- 39
Query: 62 NKSSDLTYRSRFLLHDSADSIGLSG--LLVLPQAFGAIYLGETFCSYISINNSSTLEVRD 119
L R+ L D A S LL L FG IYLGETF Y++++N +
Sbjct: 40 -----LGARNSLLRQDVAQFCDFSAAPLLALSSVFGQIYLGETFNGYLAVHNDQDSPITG 94
Query: 120 VVIKAEIQTDKQRILLLDTSKS---PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
V +K E+QT + R L +T P ES+ + +V H++KE+G H+LVCT Y+
Sbjct: 95 VNLKVEMQTAQNRWTLAETRSGLLKPRESL------ETVVRHELKEIGVHSLVCTVSYTV 148
Query: 177 GEG-----------ERKYLPQFFKFIVSNPLSVRTKVRVVK-VGA---THFQEITFLEAC 221
EG ++ L + FKF +SNPLSV+TK+ + K V A + +E +LE
Sbjct: 149 AEGSQQGFAPELGASQRVLKKSFKFSMSNPLSVKTKIHMAKSVTALLDKNQRETAYLELQ 208
Query: 222 IENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYL 281
I+N T + L +Q+ FEPSQ T + A+ IF + S G I YL
Sbjct: 209 IQNMTSAPLVFEQMRFEPSQGL--TFVDANS----------SIFDNEAALLSPGDIRQYL 256
Query: 282 YQLKMLSHG-SSSPV----KVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
Y ++S + SPV KV G LG+L I WRT GE G+LQT Q+
Sbjct: 257 Y---IVSPAVTPSPVFESGKVNGQMNLGRLNIVWRTPNGEGGKLQTSQL 302
>gi|389741307|gb|EIM82496.1| DUF974-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 704
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 90/277 (32%), Positives = 141/277 (50%), Gaps = 27/277 (9%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
LL LP +FGAI LGETF ++INN + + V V +K E+QT ++LL + P +S+
Sbjct: 67 LLTLPSSFGAIQLGETFSGVLAINNETVVAVDGVNLKIEMQTATNKVLLAELG-GPTQSL 125
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALY---------------SDGEGERKYLPQFFKFI 191
AG + IV H++KELG H L CT Y +G+ + + +F+KF
Sbjct: 126 VAGDTLETIVNHEIKELGQHVLACTVTYQLPPGARPPQPPFDGQNGDPDVQTFRKFYKFA 185
Query: 192 VSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
V+NPLSV+TKV + + +E FLE I+N T+ ++ +++ FEP+Q W
Sbjct: 186 VTNPLSVKTKVHTPRSPSALLSRSEREKVFLEVHIQNLTQEPMWFERMLFEPAQGWQVEE 245
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGKL 306
P + +F + I Y+Y L + + + GS + LG+L
Sbjct: 246 GNVLPPSDPDATEPESLFTGSQTLMQPQDIRQYMYILAAVKLPTFAIQHTPGSIIPLGRL 305
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVP 343
I+WR++ GEPGRL T++ S+ I + V+ P
Sbjct: 306 DISWRSSFGEPGRLL------TSMLSRRIPVPSVQSP 336
>gi|170094860|ref|XP_001878651.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164647105|gb|EDR11350.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 644
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/304 (31%), Positives = 151/304 (49%), Gaps = 37/304 (12%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
LL LP +FG+I LGETF S + +NN + +E+ +K E+QT +I+L +T+ P +
Sbjct: 70 LLTLPSSFGSIQLGETFSSCLCVNNDAQIEIEVTQMKVEMQTASTKIILSETAD-PGHHL 128
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY--------------LPQFFKFIV 192
AG +V H++KELG H L CT Y RK +F+KF V
Sbjct: 129 AAGKTLQSVVHHEIKELGQHVLACTVTYRSPPNVRKVPGAAEDAGDPTLQTFRKFYKFAV 188
Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA--- 245
+NPLSV+TKV + + +E FLE I+N T+ + +++ FE + W +
Sbjct: 189 TNPLSVKTKVHAARCPSALLSGEEREKIFLEVHIQNLTQQPMCFERMRFECADGWESEHG 248
Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LG 304
+L+++G + IF P+ + I Y+Y L + + V + G+ + LG
Sbjct: 249 NLLRSEG-----VDNPKGIFSGPLALMQPQDIRQYVYILTTKTPTVAPTVHLPGNVIPLG 303
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDK 364
+L I+W + GEPGRL T++ S+ I L V+ P V P+ LK +T +
Sbjct: 304 RLDISWTSAFGEPGRLL------TSMLSRRIPLPSVQQP--VSALPPY-LKRSTGQETSR 354
Query: 365 EQGP 368
Q P
Sbjct: 355 PQSP 358
>gi|426200343|gb|EKV50267.1| hypothetical protein AGABI2DRAFT_64546, partial [Agaricus bisporus
var. bisporus H97]
Length = 651
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 86/262 (32%), Positives = 134/262 (51%), Gaps = 28/262 (10%)
Query: 85 SGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVE 144
S LL LP +FG I LG+TF + +NN +T V + ++ E+QT + LL T +
Sbjct: 23 SDLLTLPPSFGTIQLGQTFSGCLCVNNEATFSVDSIRVRIEMQTVTSKTLLFLTQEPQGR 82
Query: 145 SIRAGGRYDFIVEHDVKELGAHTLVCTALYS--------DGEGERKYLP------QFFKF 190
++ +G + IV +++KELG H L CT Y G E P +F+KF
Sbjct: 83 TLSSGDTLELIVSNEIKELGQHVLACTVTYRLPPNVRPIAGASEDPKDPALATFRKFYKF 142
Query: 191 IVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
IV+NPL+V+TKV V+ +E FLE I+N T+ ++ +++ FEP++ W
Sbjct: 143 IVTNPLAVKTKVHPVRSPTALLSPEEREKIFLEIHIQNVTQDTMHFERLSFEPTEEW--- 199
Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN---VL 303
+ P+ N QS IF P+ + + + Y++ L S + P+ V L
Sbjct: 200 --QVQDPNFTSNGQS--IFSGPIALVNPQDVRQYIFILSPTSTAALRPLAVHPPGSIFPL 255
Query: 304 GKLQITWRTNLGEPGRLQTQQI 325
G+L I WR++ GEPGRL T +
Sbjct: 256 GRLNIVWRSSYGEPGRLLTSML 277
>gi|384493079|gb|EIE83570.1| hypothetical protein RO3G_08275 [Rhizopus delemar RA 99-880]
Length = 934
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 123/421 (29%), Positives = 191/421 (45%), Gaps = 83/421 (19%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTN---KS 64
H L+ +VMRL RP P+ + T+ P+ L L SD+T +
Sbjct: 24 HLLSLKVMRLSRPQFATTLPVFYESTEA--------SPLV-DGLDSLNISDLTACHPIQP 74
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
SD+ R GLS +L LP AFG IYLGETF + +SINN S + V V K
Sbjct: 75 SDIQIRD----------FGLSQMLKLPSAFGNIYLGETFSTLVSINNESPIPVHQVTTKI 124
Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
E+QT QR LL D + P+ + G D V H++KELG H LVC+ Y +G
Sbjct: 125 ELQTSSQRFLLAD--QPPLNDLSPGANSDITVSHEIKELGVHILVCSVQYIGDDGR---- 178
Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
FLEA ++N + +++++++FEPS+++
Sbjct: 179 -------------------------------VFLEAQLQNVSAGPMFLERMKFEPSEHFG 207
Query: 245 ATML--KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
L + D + + Q F P +R YLY MLS + + + +N
Sbjct: 208 FESLNGRMDSEKTVFEDQ----FIHPQDVR------QYLY---MLSPHHADRIS-RTTNA 253
Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPS----VVGIDKPFLLKLKL 358
LGKL I WR+ +G+ GRLQT Q+ ++IE+ V V ++ PF L +++
Sbjct: 254 LGKLDIVWRSAMGDMGRLQTSQLTRKAPLLEDIEIQPFWVQQDAEVKVVLETPFRLGIRV 313
Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
TN +++ ++ LS + + V+++GL L + ST+ L G+QR
Sbjct: 314 TNHSNEN---MKLVLSAIKT-KMGSVLLSGLGSRQLGELGPGQSTETELEFFPLTPGLQR 369
Query: 419 I 419
+
Sbjct: 370 V 370
>gi|390598322|gb|EIN07720.1| DUF974-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 662
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/264 (34%), Positives = 131/264 (49%), Gaps = 41/264 (15%)
Query: 85 SGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVE 144
+ LL LP AFG+I LGETF S + INN + ++V+ V +K E+QT + L D P
Sbjct: 65 TNLLTLPAAFGSIQLGETFTSCLCINNEAAVDVQAVSMKVEMQTATTKTTLADIG-GPDF 123
Query: 145 SIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP---------------QFFK 189
++ GG + +V H++KELG H L CT Y R + P +F+K
Sbjct: 124 TLAPGGVSENVVSHEIKELGQHVLACTVSYRLPSSVR-HAPAGSVDPANPHLATFRKFYK 182
Query: 190 FIVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
F V+NPLSV+TKV V + + +E FLE I+N T+ ++ ++++FEPS W
Sbjct: 183 FAVTNPLSVKTKVHVPRSPSALLSRTEREKVFLEVHIQNLTQDAMWFERIQFEPSDGWQ- 241
Query: 246 TMLKADGPHSDYNAQ---SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
H +A S + +P +LY L LS GS +
Sbjct: 242 --------HDSSSATPVVSESLMQP-------QDTRQFLYVLSPLSIPDFPVTHAPGSIL 286
Query: 303 -LGKLQITWRTNLGEPGRLQTQQI 325
LG+L I+WR+ GEPGRL T +
Sbjct: 287 PLGRLDISWRSGFGEPGRLITSTL 310
>gi|409046259|gb|EKM55739.1| hypothetical protein PHACADRAFT_121565 [Phanerochaete carnosa
HHB-10118-sp]
Length = 724
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 90/271 (33%), Positives = 132/271 (48%), Gaps = 35/271 (12%)
Query: 80 DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTS 139
D +S +L LP AFGAI LGETF S + +NN ++ E+ V ++ E+QT + +L +
Sbjct: 60 DLTHISEMLTLPSAFGAIQLGETFSSCLVVNNETSGEIETVTLRVEMQTATTKQVLAEYG 119
Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP-------------Q 186
P + G + +V H++KELG H L CT Y G + P +
Sbjct: 120 -GPDYRLAPGDAMENVVHHEIKELGQHVLACTVSYHLPPGHKPVHPAGEGHDPGIQSFRK 178
Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQN 242
F+KF V+NPLSV+TKV V + + +E FLE +N T +++ ++ FE +
Sbjct: 179 FYKFAVTNPLSVKTKVHVPRAPSALLSSTEREKVFLEVHTQNLTPDAMWLQRMRFEAVEG 238
Query: 243 WSA----TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
W+ T+L PH N IF + + YLY +LS SP V
Sbjct: 239 WNVQDVNTLL---APH---NKDGETIFSDSMALMQPQDTRQYLY---ILSPKELSPFPVN 289
Query: 299 GSN----VLGKLQITWRTNLGEPGRLQTQQI 325
S LG+L I+WR+ GEPGRL T +
Sbjct: 290 HSPGSIIPLGRLDISWRSAFGEPGRLLTSML 320
>gi|440796425|gb|ELR17534.1| hypothetical protein ACA1_062880 [Acanthamoeba castellanii str.
Neff]
Length = 408
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 131/453 (28%), Positives = 214/453 (47%), Gaps = 71/453 (15%)
Query: 15 MRLCRPSLHVEPPLRVDPTDL-----FIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
MRL +P+L +PP+ V+ D GED P + SS+V
Sbjct: 1 MRLSKPTLQFQPPVLVEADDAPYPLSKTGED----------QPTMTSSNVQ--------- 41
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
++ LS L LP+AFG IY+GETFCSYIS+ N + ++ V ++AE+ T
Sbjct: 42 ----------NAFSLSPGLNLPRAFGNIYVGETFCSYISLYNHTQSDLHLVGLRAELNTK 91
Query: 130 KQRILLLD-TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
+ LL+D T+ ++ + AG R+DFIV + V E H LVCT Y+ G GE+K +FF
Sbjct: 92 VLKNLLIDQTTAGSIQRLAAGERHDFIVRYRVVEPTMHILVCTISYAKG-GEKKSFRKFF 150
Query: 189 KFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT-- 246
KF V + S K R+ H ++ T LE + N ++ ++++ V++ P+ N
Sbjct: 151 KFTVVD--SFEWKQRIF-----HIKDDTLLEVQLRNVARNAVFLNNVKYGPAFNPGTARS 203
Query: 247 ---MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN-- 301
L+ +D ++ + S G + + +S ++++ +
Sbjct: 204 YLFQLRPRRGAADATMYTKRLRNRVSDADSAGANEDD----EETDSSTSDEMQIELARIK 259
Query: 302 ------VLGKLQITWRTNLGEPGRLQTQQIL--GTTITSKEIELNVVEVPSVVGIDKPFL 353
VLGKL ++W T+ GE G T++IL S E+E+++ + S + ++ PF
Sbjct: 260 LEADEMVLGKLLLSWHTSFGETG---TRKILVKHKPSPSPEVEISITSIASAITLETPFP 316
Query: 354 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRI-MALAPVEAFGSTDFHLNLIAT 412
+ +TN+ + P W+ Q D V+ GL L + + GS + +
Sbjct: 317 ATVTVTNKLPR---PILPWV-QLAQDHTANVVAAGLSAGFKLEEIPSGGSKSAEVAFLPL 372
Query: 413 KLGVQRITGITVFDKLEKITYDSLPDLEIFVDQ 445
+ G+Q ITGI+V DK Y + PD EI V Q
Sbjct: 373 QAGIQTITGISVLDKKTGRVY-ACPDHEILVLQ 404
>gi|392596039|gb|EIW85362.1| DUF974-domain-containing protein [Coniophora puteana RWD-64-598
SS2]
Length = 660
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 83/258 (32%), Positives = 128/258 (49%), Gaps = 23/258 (8%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
L LP +FGAI LGETF S +S+NN +++ V ++ EIQT + L+ + P +
Sbjct: 60 FLTLPSSFGAIQLGETFSSCLSVNNEVNIDIEAVTVRVEIQTMNTKTLVAELG-GPDFKL 118
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER--------------KYLPQFFKFIV 192
G + +V+H+VKELG H L C Y R + L +F+KF V
Sbjct: 119 TPGQSLEHVVQHEVKELGQHVLACAVSYRMPSHTRPSAVPAAPGADPNLQTLRKFYKFAV 178
Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
+NPLSV+TKV V K +E FLE ++N T+ L+ +++ FE +++W A
Sbjct: 179 TNPLSVKTKVHVPKSPTASLLEAEREKVFLEVHVQNLTQEPLWFEKIRFECAESWKAIDT 238
Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGKLQ 307
P Y+ E+F + + + Y+Y L + V G+ + LG+L
Sbjct: 239 AGTEPSKSYD---EELFTDDMSLMQPQDVRQYIYTLVPAVLSTFPLVHPPGTVIALGRLD 295
Query: 308 ITWRTNLGEPGRLQTQQI 325
I+WR+ GE GRL T +
Sbjct: 296 ISWRSQFGELGRLLTSML 313
>gi|449547690|gb|EMD38658.1| hypothetical protein CERSUDRAFT_123212 [Ceriporiopsis subvermispora
B]
Length = 721
Score = 131 bits (329), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 103/346 (29%), Positives = 163/346 (47%), Gaps = 58/346 (16%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H L+ +VMR+ RPSL +P +S+ P S +T + L
Sbjct: 6 HLLSLKVMRVSRPSLAST-----------------WEPYYSSSQP---FSQRSTASITSL 45
Query: 68 TYRSRFLLHDSA--DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
++ H + D S +L+LP +FG I +GE F S +S+NN + E+ V ++ E
Sbjct: 46 QGKAPLPGHPNTLRDLAHASEMLMLPSSFGTIQIGEVFTSCLSVNNETNAEIDGVHVRVE 105
Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER---- 181
+QT + +LL+ P + G + +V H++KELG H L CT Y G R
Sbjct: 106 MQTATSKTVLLEMG-GPNSQLAVGASLEKVVSHEIKELGQHVLGCTVSYRLPPGYRPVPG 164
Query: 182 ----------KYLPQFFKFIVSNPLSVRTKVRVVKVGAT----HFQEITFLEACIENHTK 227
+ +F+KF V+NPLSV+TKV V + + + +E FLE I+N T+
Sbjct: 165 TSSEAVDPGVQTFRKFYKFAVTNPLSVKTKVHVPRAPSALLSRNEREKVFLEVHIQNLTQ 224
Query: 228 SNLYMDQVEFEPSQNWSAT------MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYL 281
+++++V FE S W A + ADG S + S + +P + Y+
Sbjct: 225 DGMWLERVRFECSDGWQAQDANRLGLGDADGGESIFTG-SMALLQP-------QDMRQYI 276
Query: 282 YQLKMLSHGSSSPVKVQGSNV--LGKLQITWRTNLGEPGRLQTQQI 325
Y L + P+ Q ++ LG+L I+WR+ GEPGRL T +
Sbjct: 277 YILSP-TVPPPFPITHQPGSILPLGRLDISWRSPFGEPGRLLTSML 321
>gi|348690154|gb|EGZ29968.1| hypothetical protein PHYSODRAFT_323413 [Phytophthora sojae]
Length = 456
Score = 128 bits (321), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 94/313 (30%), Positives = 151/313 (48%), Gaps = 33/313 (10%)
Query: 78 SADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLD 137
S LS +L+LP +FG I+LG TF SYIS+ N + E+RDV + A IQ R+ L D
Sbjct: 75 SQHEFALSSMLILPDSFGEIFLGNTFSSYISVINPYSCELRDVGLSANIQCANDRVELHD 134
Query: 138 -----TSK----SPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLPQF 187
T K +PV + AG D +V++ + ++G H L Y D GE K L +F
Sbjct: 135 NRYARTGKLPPPNPVAVLPAGSSLDMVVDYPLNQVGNHVLRVGVAYVDPITGESKSLRKF 194
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
++F V NPL + K A + I +EA I N +K L++D ++F P +++
Sbjct: 195 YRFAVQNPLVITFKQNSATGQALKGEAI--VEAQIRNVSKLPLFVDSIKFLPLPPFTSEE 252
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLY---QLKMLSHGSSSPV-------KV 297
+ D + I L+ +Y +L+ + S P
Sbjct: 253 MGVDPVGKKAEGEQASIQD---LLSVNSSPQTLVYPQEELQRVFRVSYDPASDPTLLSSA 309
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQILGTT--------ITSKEIELNVVEVPSVVGID 349
QGS LG+L + W+T++GE G +Q+Q ++ T E+ + V E+P V +
Sbjct: 310 QGSQNLGRLHVGWKTSMGEAGSVQSQPVMRKTPGAAGHGGAGHSEVAVAVEELPKEVMVG 369
Query: 350 KPFLLKLKLTNQT 362
+PFL+ + +TN++
Sbjct: 370 QPFLVAVSVTNKS 382
>gi|403417125|emb|CCM03825.1| predicted protein [Fibroporia radiculosa]
Length = 1166
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 136/260 (52%), Gaps = 28/260 (10%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L+LP +FGAI LGETF S +S+NN ++++V V + E+QT + + + P +
Sbjct: 523 VLMLPSSFGAIQLGETFTSCLSVNNEASVDVESVTLTVEVQTASTKATVAEFG-GPDFRL 581
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP--------------QFFKFIV 192
G + +V H++KELG H L CT Y G R + +F+KF V
Sbjct: 582 AVGESLEKVVGHEIKELGQHALACTISYRLPSGIRAPVAPAADSNDPNLYVFRKFYKFAV 641
Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
+NPLSV+TKV V + + F +E FLE ++N T+ ++++++ E + W
Sbjct: 642 TNPLSVKTKVHVPRAPSATFSRVEREKVFLEIHVQNLTQDAMWLERMRLECADGW----- 696
Query: 249 KADGPH--SDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGK 305
KAD + +D +A S +F + + + Y+Y L ++ GS V LG+
Sbjct: 697 KADDANLMNDEDA-SESVFSGSMGLMQPHDMRQYIYILSPVNLALFPTAHQPGSVVPLGR 755
Query: 306 LQITWRTNLGEPGRLQTQQI 325
L ITW+++ GEPGRL T +
Sbjct: 756 LDITWKSSFGEPGRLLTSML 775
>gi|395330058|gb|EJF62442.1| hypothetical protein DICSQDRAFT_160869 [Dichomitus squalens
LYAD-421 SS1]
Length = 718
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 136/260 (52%), Gaps = 26/260 (10%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
LL LP +FGAI LGETF S +S+NN + ++V V++ E+QT + LL + P + +
Sbjct: 67 LLTLPSSFGAIQLGETFSSCLSVNNEANVDVEGVIVHVEMQTASTKTLLAEFG-GPEQRL 125
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER--------------KYLPQFFKFIV 192
G + IV H++KELG H L CT Y G R + +F+KF V
Sbjct: 126 GVGQSLEKIVSHEIKELGQHVLGCTVSYRMPPGVRPPPGQSADLQDPSVESFRKFYKFAV 185
Query: 193 SNPLSVRTKVRVVKVG----ATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
+NPLSV+TKV + + ++ +E LE I+N T+ +++++++F+ W A
Sbjct: 186 TNPLSVKTKVHLPRSPTALLSSEEREKVLLEVHIQNLTQDAMWLERMQFDCVDGWQAQ-- 243
Query: 249 KADGPH-SDYNAQSRE-IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGK 305
D + D A S+E +F + + Y+Y L+ ++ G+ + LG+
Sbjct: 244 --DANYLEDAAAGSKESLFTGSTALMQPQDVRQYIYILQPINLPPFPITHAPGAILALGR 301
Query: 306 LQITWRTNLGEPGRLQTQQI 325
L I+WR++ GEPGRL T +
Sbjct: 302 LDISWRSSFGEPGRLLTSTL 321
>gi|302690716|ref|XP_003035037.1| hypothetical protein SCHCODRAFT_232409 [Schizophyllum commune H4-8]
gi|300108733|gb|EFJ00135.1| hypothetical protein SCHCODRAFT_232409 [Schizophyllum commune H4-8]
Length = 617
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 86/260 (33%), Positives = 127/260 (48%), Gaps = 31/260 (11%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
LL+LP +FG+I LGETF S + NN + ++V V +K E+QT ++ L + P ++
Sbjct: 53 LLMLPASFGSIQLGETFSSCLCANNDTQVDVDSVTVKVEMQTATTKVTLGEFG-GPQYTL 111
Query: 147 RAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP-------------QFFKFIVS 193
AG + +V H+VKELG H L T Y R +P +F+KF+V+
Sbjct: 112 AAGDTLECLVTHEVKELGQHVLSATVSYRLPPNARPPVPAEDPDDPQMQHFRKFYKFVVT 171
Query: 194 NPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
NPLSV+TKV K + ++ FLE I+N T+ L+ +++ EP W
Sbjct: 172 NPLSVKTKVHTPKSPSAQLSTSERDKIFLEVHIQNLTQEPLWFERMLLEPVDGWDV---- 227
Query: 250 ADGPHSDYNAQSRE---IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGK 305
D N S E IF + + Y+Y + S V GS + LG+
Sbjct: 228 -----EDTNLGSTEEDGIFTGTTALMGPQDMRQYIYIMSSQSPPRIPVVHSPGSIIPLGR 282
Query: 306 LQITWRTNLGEPGRLQTQQI 325
L I WR++ GEPGRL T +
Sbjct: 283 LDIAWRSSFGEPGRLLTSML 302
>gi|301119703|ref|XP_002907579.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262106091|gb|EEY64143.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 358
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 155/307 (50%), Gaps = 32/307 (10%)
Query: 82 IGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLD---- 137
LS +L+LP +FG I+LG TF SYIS+ N T E+RDV + A IQ R+ L D
Sbjct: 33 FALSSMLILPDSFGEIFLGNTFSSYISVINPYTCELRDVGLSANIQCANDRVELHDNRYA 92
Query: 138 -TSK----SPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLPQFFKFI 191
T K +PV + AG D +V++ + +G H L Y D GE K L +F++F
Sbjct: 93 RTGKLPPPNPVAMLPAGSSLDMVVDYPLNLVGNHVLRVGVAYVDPVTGENKSLRKFYRFA 152
Query: 192 VSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD 251
V NPL + K + H + I +EA I N +K L++D ++F P +++ + +
Sbjct: 153 VQNPLVITFK-QNSPASQQHGEAI--VEAQIRNVSKLPLFVDSIKFLPLAPFTSEEMVVN 209
Query: 252 GPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH-------GSSSP--VKVQGSNV 302
S N R K L+ G +Y + L +S P + QGS
Sbjct: 210 ---SGGNRGERPSIKE--LLSLNNGPQTLVYPQEELQRVFRVWYDPASDPSLLTTQGSQN 264
Query: 303 LGKLQITWRTNLGEPGRLQTQQIL----GTTITS-KEIELNVVEVPSVVGIDKPFLLKLK 357
LG+L + W+T++GE G +Q+Q ++ GT+ E+ + + E+P+ V + +PFL +
Sbjct: 265 LGRLHVGWKTSMGEAGSVQSQPVVRKVPGTSGGGHSEVLVAMQELPTEVVVGQPFLAAIS 324
Query: 358 LTNQTDK 364
+TN T +
Sbjct: 325 VTNNTTR 331
>gi|237831303|ref|XP_002364949.1| hypothetical protein TGME49_057020 [Toxoplasma gondii ME49]
gi|211962613|gb|EEA97808.1| hypothetical protein TGME49_057020 [Toxoplasma gondii ME49]
gi|221487204|gb|EEE25450.1| conserved hypothetical protein [Toxoplasma gondii GT1]
gi|221506886|gb|EEE32503.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 395
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 108/430 (25%), Positives = 190/430 (44%), Gaps = 57/430 (13%)
Query: 10 LAFRVMRLCRPSLHVEP--PLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
L +VMRL +PS++ EP LR+D + S D + K +
Sbjct: 9 LTLKVMRLSQPSIYAEPWPLLRIDE---------------------VTSEDQSVKKKLE- 46
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
R R + + +S + L+LP + G I+ GETF +YI+I+NSS + +V+I+ E+
Sbjct: 47 --RERVCVERALES---THALLLPASQGRIFSGETFSAYINISNSSNAQAVNVIIQVELS 101
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCT-ALYSDGEGERKYLPQ 186
++R LL D S+ P+ S+ G +D + H++ E G +TLVC + Y GE+K +
Sbjct: 102 IGQKRDLLFDNSQDPIRSLTPGNSFDCTIVHELTESGTYTLVCAVSHYLSAVGEQKSFKK 161
Query: 187 FFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 246
FKF P V +V +++ A F+E +EN ++ +Y+ ++
Sbjct: 162 SFKFAAHPPFGVGHRVVLLQGRA-------FVECSVENVSQEAVYLSDASIFCVEDIEGV 214
Query: 247 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK-MLSHGSSSPVKVQGSNVLGK 305
L + P N FKP +N ++ L + P ++ VLG+
Sbjct: 215 RLDSGPPSDGRNHNGLHYFKP-------HDRYNLVFSLTPTATKLGEDPSFIRRLPVLGQ 267
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L + WRT+ G G + + + S + P + +++PF ++++++ ++
Sbjct: 268 LALEWRTSTGGAGCMHEYTLTNSLAESSK--------PLSLRVERPFQVEIEVSAHVEQV 319
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
P I SD E V I G L ++ F + L + G + GI V+
Sbjct: 320 FCPVLIL---RPSDLEPFV-IQGSTTRPLGIIDMFTPRRYILEAVCLSPGFHSVKGIMVY 375
Query: 426 DKLEKITYDS 435
D + T D+
Sbjct: 376 DPDTQQTADA 385
>gi|299753765|ref|XP_001833471.2| hypothetical protein CC1G_05171 [Coprinopsis cinerea okayama7#130]
gi|298410453|gb|EAU88405.2| hypothetical protein CC1G_05171 [Coprinopsis cinerea okayama7#130]
Length = 633
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 130/261 (49%), Gaps = 32/261 (12%)
Query: 85 SGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKS-PV 143
S LL LP +FG+I LGETF S + +NN +T V IK E+QT ++ L + ++ P
Sbjct: 48 SELLTLPASFGSIQLGETFSSCLCVNNEATSAVEVKQIKVEMQTVTTKVTLSELDETGPT 107
Query: 144 ESIRAGGRYDFIVEHDVKELGAHTLVCTALYS--------DGEGERKYLP------QFFK 189
+ + AG + IV H++KELG H L CT Y G E P +F+K
Sbjct: 108 KMLEAGDSLETIVHHEIKELGQHVLACTVTYRLPPSARPVPGAAEDASDPSLLTFRKFYK 167
Query: 190 FIVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
F V+NPLSV+TKV K + ++ FLE I+N T ++++ +++ FE ++ +
Sbjct: 168 FAVTNPLSVKTKVHTSKSPSASLSLDERDKLFLEVHIQNLTPASMFFEKMRFECAEGF-- 225
Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LG 304
D + + +F Y+Y L S + P GS + LG
Sbjct: 226 ----------DVDDINGPVFSGSFATMQPQDTRQYVYILTPKSTTVAPPALPPGSIIPLG 275
Query: 305 KLQITWRTNLGEPGRLQTQQI 325
+L I+WR++ GEPGRL T +
Sbjct: 276 RLDISWRSSYGEPGRLLTSML 296
>gi|392567447|gb|EIW60622.1| DUF974-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 716
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 86/261 (32%), Positives = 134/261 (51%), Gaps = 29/261 (11%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPV 143
++ LL LP AFGAI LGETF S +SINN + ++V V+I+ E+QT + LL + S
Sbjct: 66 ITDLLTLPAAFGAIQLGETFSSCLSINNDANIDVDGVIIRVEMQTASSKALLAEFGGS-N 124
Query: 144 ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP-------------QFFKF 190
+ + G + +V H++KELG H L C+ Y G R P +F+KF
Sbjct: 125 QRLGVGETLEKVVSHEIKELGQHVLGCSVSYRVPPGVRNLPPAADAQDPSIQTFRKFYKF 184
Query: 191 IVSNPLSVRTKVRVVKVG----ATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS-- 244
V+NPLSV+TKV + + + +E FLE I+N T+ +++++++FE W
Sbjct: 185 AVTNPLSVKTKVHLPRSPTALLSAQEREKVFLEVHIQNLTQDAMWLERMQFECIDGWQVQ 244
Query: 245 -ATMLKADGPHSDYNAQSRE-IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
A +L+ + S+E +F + + Y+Y L + G +
Sbjct: 245 DANILE------NTATGSKEYLFSGTTALMQPQDLRQYIYILSPKVLPPFPIAHIPGHIL 298
Query: 303 -LGKLQITWRTNLGEPGRLQT 322
LG+L I+WR+ GEPGRL T
Sbjct: 299 PLGRLDISWRSCYGEPGRLLT 319
>gi|242004692|ref|XP_002423213.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212506184|gb|EEB10475.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 377
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 118/423 (27%), Positives = 188/423 (44%), Gaps = 91/423 (21%)
Query: 8 HSLAFR---VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
H+L + +MRL +P+L PL V + EDI ++ + +D+TT
Sbjct: 11 HTLTLKGLLIMRLTKPAL--SSPLIVTNESKDLPEDILNNDL---------KNDITTVNE 59
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
++ +FLL +PQ+FG I+LGE+F YI I+N S ++V +KA
Sbjct: 60 TETLAVGQFLL--------------IPQSFGTIHLGESFLGYILIHNDSNQIAKNVHVKA 105
Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
++QT Q+I LL EH + EL H K +
Sbjct: 106 DLQTVTQKIPLL--------------------EHKLSELSPH---------------KTI 130
Query: 185 PQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWS 244
QFFKF V PL ++TK + + FLEA ++N T +++++V FE S +
Sbjct: 131 DQFFKFEVKTPLDLKTKFYNAE------SDEVFLEAQVQNITAGPIHLEKVSFESSDLFK 184
Query: 245 -ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
+++ K D SD + F+ Y+Y L + S + G+ +
Sbjct: 185 VSSLYKTDEIKSDDSLLQPNEFR------------QYVYCLTPIYDSDGS--HLFGATNI 230
Query: 304 GKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTD 363
G+L I WR NLGE GRLQT Q+ EI L+V +P++V I++PF K++N
Sbjct: 231 GRLDIAWRYNLGEKGRLQTSQLQKMAPDFGEIRLSVHNLPNIVKIEEPFKFLCKISNLR- 289
Query: 364 KEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGIT 423
++ LS S + V + G + ++ GS L L+ G+ I+GI
Sbjct: 290 ----AMDLVLSLEKSHPDLVWI--GTSGQHIGKLDIGGSKVIELTLVPLSAGLHNISGIR 343
Query: 424 VFD 426
+ D
Sbjct: 344 LKD 346
>gi|393216624|gb|EJD02114.1| DUF974-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 807
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 110/375 (29%), Positives = 165/375 (44%), Gaps = 69/375 (18%)
Query: 6 GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
G H L+ +VMR+ RPSL + F P++ PL T
Sbjct: 8 GQHPLSLKVMRVSRPSLASHWQPFFSSSPSFSAHSTAH-PLSLQGAEPLPGHPKTLR--- 63
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
DLT+ S LL LP AFGAI LGETF +S+NN L V V + E
Sbjct: 64 DLTH--------------ASNLLTLPAAFGAIQLGETFACVLSVNNEVGLPVDSVRARVE 109
Query: 126 IQTDKQRILLLDTSKSPVESIR----------------AGGRYDFIVEHDVKELGAHTLV 169
+QT ++LL + + +S R G + V ++KELG H L
Sbjct: 110 MQTATSKVLLAEVNAG--DSDRDVKMEETSGSGTGTLGTGDSLELCVATEIKELGQHVLA 167
Query: 170 CTALYSDGEGER--------------KYLPQFFKFIVSNPLSVRTKVRVVKVGATHF--- 212
CT Y G R + +F+KF+V+NPLSV++KV V K
Sbjct: 168 CTVTYRTPPGMRPATSGAYNAEDPFMQTFRKFYKFMVTNPLSVKSKVHVPKSPTALLSRS 227
Query: 213 -QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS-----REIFK 266
++ FLE I+N T++ ++ +++ E + W A P D ++ + + IF
Sbjct: 228 ERDKVFLEVHIQNLTQAPMWFEKIRLEAVEGWDVVDANAISPPFDLSSTADAENEKSIFS 287
Query: 267 PPVLIRSGGGIHNYLYQL--KMLSHGSSSPV-KVQGSNV-LGKLQITWRTNLGEPGRLQT 322
+ + + Y+Y L K +S P V G+ + LG+L I+WR+++GEPGRL
Sbjct: 288 GSMALMPPHDMRQYVYILTPKFTPRNTSVPAPPVPGTVIPLGRLDISWRSSMGEPGRLL- 346
Query: 323 QQILGTTITSKEIEL 337
T+I S+ I L
Sbjct: 347 -----TSILSRRIPL 356
>gi|393245725|gb|EJD53235.1| DUF974-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 657
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 128/260 (49%), Gaps = 21/260 (8%)
Query: 80 DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTS 139
D +S +L+LP +FGAI LGETF S + INN + +V V +K E+QT ++LL
Sbjct: 48 DLTAISDVLMLPASFGAIQLGETFSSCLCINNDTDGDVHAVALKVEMQTATTKVLLAHLG 107
Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY-------SDGEGERKYLPQFFKFIV 192
+ + +V H++KELG H L CT Y ++ E + +++KF V
Sbjct: 108 GPDLTLTAEKNFVETVVHHEIKELGQHVLSCTITYRLPGAPPANDEDGLSTIRKYYKFAV 167
Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
+NPLSV+TKV + + +E FLE ++N T L+ +Q++FE + W L
Sbjct: 168 TNPLSVKTKVHTPRAPSALLSRTEREKVFLEVHVQNLTAEPLWFEQMKFECADGW----L 223
Query: 249 KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS-PVKVQGSNV--LGK 305
D ++ + IF + + Y+Y L S PV V LG+
Sbjct: 224 VDD---ANLTSHKTSIFSGAAALIQPQDLRQYVYVLTPTPESVPSFPVVHAPGTVISLGR 280
Query: 306 LQITWRTNLGEPGRLQTQQI 325
L I+WR++ G PGRL T +
Sbjct: 281 LDISWRSSFGGPGRLLTSML 300
>gi|53136444|emb|CAG32551.1| hypothetical protein RCJMB04_29c21 [Gallus gallus]
Length = 207
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/218 (35%), Positives = 113/218 (51%), Gaps = 32/218 (14%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL ++F+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGNLFNQ---------LMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 56 -----------AEALMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASTAAVAELKPDCCIDDGSPHEVKEIGTHILVCAVSYTTQAGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENH 225
FKF V PL V+TK + FLEA I+ +
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDE------VFLEAQIQKY 195
>gi|353240747|emb|CCA72601.1| hypothetical protein PIIN_06538 [Piriformospora indica DSM 11827]
Length = 650
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 99/336 (29%), Positives = 156/336 (46%), Gaps = 46/336 (13%)
Query: 7 THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
+H LA +VMR+ RPSL + F D AS++ I + +
Sbjct: 5 SHLLALKVMRVSRPSL-------LGQWQPFAEASTHFDAHNASSIT-SIQPHIPNKQHVP 56
Query: 67 LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
T R D LS L LP +FG+I LGETF S + N + ++ V I+ E+
Sbjct: 57 TTIR---------DLSALSQNLSLPSSFGSISLGETFSSCFCVANMTNYDIEGVHIRVEM 107
Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP- 185
Q+ + LLL+ P + G + +V+ ++KELG HTL C Y G R P
Sbjct: 108 QSASAKSLLLELG-GPEHRLGPLGTLEGVVQSEIKELGQHTLSCIVHYRVPPGLRPPAPS 166
Query: 186 ------------QFFKFIVSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSN 229
+ ++F VSNP SV+TKV K + +E FL+ ++N T+ +
Sbjct: 167 DDPSDPRAQLFRKHYRFPVSNPFSVKTKVHTPKSPSALMSRVEREKLFLQIDVQNLTQES 226
Query: 230 LYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQL--KML 287
++ +++EF+P W+ T D ++ + ++R+ F P + Y+Y L ++
Sbjct: 227 MWFERLEFKPVDGWTFT----DA--NESSIEARQAFTGPKTLVQPQDTFQYIYTLIPAVI 280
Query: 288 SHGSSSPVKVQGSNV-LGKLQITWRTNLGEPGRLQT 322
+P G+ + LG+L I WRT GEPGRL T
Sbjct: 281 PRFLINPAP--GAVIPLGRLDIAWRTTFGEPGRLLT 314
>gi|242220364|ref|XP_002475949.1| predicted protein [Postia placenta Mad-698-R]
gi|220724816|gb|EED78834.1| predicted protein [Postia placenta Mad-698-R]
Length = 705
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 81/253 (32%), Positives = 129/253 (50%), Gaps = 20/253 (7%)
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI 146
+L+LP +FGAI LGETF S IS+NN + ++V VV+ E+QT + +L P + +
Sbjct: 67 VLMLPSSFGAIQLGETFTSCISVNNEANMDVESVVLTVEMQTATTKAVLAQFG-GPEQRL 125
Query: 147 RAGGRYDFIVEHDVKEL-------GAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVR 199
G + IV H++KEL G H + + G + +F+KF V+NPLSV+
Sbjct: 126 ALGESLERIVSHEIKELVSYRLPPGDHATIPPVTDPNDPGLHVFR-KFYKFAVTNPLSVK 184
Query: 200 TKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA--TMLKADGP 253
TKV V + + +E FLE I+N T+ ++++++ E + +W L DG
Sbjct: 185 TKVHVPRAPSALLSRPEREKVFLEIHIQNLTEDAMWLERMHLECADSWKVHDVNLADDG- 243
Query: 254 HSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV-LGKLQITWRT 312
+ IF + + + Y+Y L + + GS V LG+L I+WR+
Sbjct: 244 ---SEMEKEGIFSGSMALMQPQDMRQYVYVLSPVILTAFPVAHAPGSIVPLGRLDISWRS 300
Query: 313 NLGEPGRLQTQQI 325
+ GEPGRL T +
Sbjct: 301 SFGEPGRLLTSML 313
>gi|390342034|ref|XP_795991.3| PREDICTED: UPF0533 protein C5orf44 homolog [Strongylocentrotus
purpuratus]
Length = 230
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 110/209 (52%), Gaps = 11/209 (5%)
Query: 118 RDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG 177
+D+ +K ++QT QR+ L S P ++ G D ++ H+VKELG H LVC Y+
Sbjct: 28 QDIHVKTDLQTSSQRLTLSGGSTPPSPNLAPGACIDQVIHHEVKELGTHILVCAVSYTSP 87
Query: 178 EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEF 237
GE +F+KF V PL V+TK + + +LEA I+N T+S + M++V
Sbjct: 88 SGETLSFRKFYKFQVLKPLDVKTKFYNAE------SDEVYLEAQIQNITQSPMCMEKVAL 141
Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH-GSSSPVK 296
EP+ ++ L + + A S+++ + YLY LK + G+ P
Sbjct: 142 EPTADYMVEELNS----TQTEATSKKLIFGDFTYLNPMDTRQYLYCLKAKTQAGADRPSL 197
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
++G + +GKL I W+T LGE GRLQT Q+
Sbjct: 198 IKGVSSIGKLDIVWKTTLGEKGRLQTSQL 226
>gi|326436192|gb|EGD81762.1| hypothetical protein PTSG_02475 [Salpingoeca sp. ATCC 50818]
Length = 355
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 159/374 (42%), Gaps = 64/374 (17%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
M +TP H L RVM+L +P P+ D L + ++ + A N ++VT
Sbjct: 1 MDATPRAHPLTLRVMQLAKPGFARHDPVGYDEEGLALTRNV----LHAENPRHYAPANVT 56
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
L LP + G +YLGE+F ++I+I N V +V
Sbjct: 57 E-------------------------ALQLPSSQGKVYLGESFSAFINICNDGHDVVTNV 91
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYS 175
+K E+QT QR TS + ES RA + H+++ LG H L+C Y+
Sbjct: 92 SLKVEMQTASQR----HTSLADPESCRASKLERTQTLQTTIRHEIRSLGTHALLCAVSYT 147
Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV 235
GER+ + F F V+ PL V T Q LE ++N ++ +
Sbjct: 148 LLNGERRTFRKSFNFEVNQPLDVIPH-------CTTIQNTIVLEVQVKNQMPHPIHFQSI 200
Query: 236 EFEPS-----QNWSATMLKADGPHSDYNA-QSREIFKPPVLIRSGGGIHNYLYQLKMLSH 289
+F P Q+ +AT+ + S ++ QS E P RS YLY+L +
Sbjct: 201 KFTPQSAFAVQDCNATLCQDGKTRSVFHGFQSVE----PKESRS------YLYKL---TP 247
Query: 290 GSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGID 349
+ + +GKL + WR+++GE G LQT Q+ ++EL+ PS V +
Sbjct: 248 AEGQYFEFRRRKAIGKLDVMWRSSMGEFGHLQTSQLERPVPPVHDLELHATNAPSAVTVG 307
Query: 350 KPFLLKLKLTNQTD 363
PF ++ + N D
Sbjct: 308 APFEVECDVINFRD 321
>gi|256073664|ref|XP_002573149.1| hypothetical protein [Schistosoma mansoni]
Length = 509
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 128/529 (24%), Positives = 207/529 (39%), Gaps = 162/529 (30%)
Query: 15 MRLCRPSLHVEPPLRVDPTDLFIGEDI------FDDPI------------AASNLPPLIS 56
MRL RP+ ++ R +PT+L++ +DI +D I +A N+PP +
Sbjct: 1 MRLNRPTFVIQ---RCEPTELYL-DDIAGSLTAYDASIRGDLDGISLNLLSAGNIPPSSN 56
Query: 57 SDVTTNKSSDLTYRSRFLLHDSADSI-----GLSGLLVLPQAFGAIYLGETFCSYISINN 111
S + N +L S+ D+ + I G S LL L +FG IYLGETF ++I+++N
Sbjct: 57 SHESPNYDHELNNDSK----DNYNYIQPKVGGYSELLSLTHSFGTIYLGETFSAHINLHN 112
Query: 112 SSTLEVRDVVIKAEIQTDKQRILLL----------------------------------- 136
S +V +K + + I L
Sbjct: 113 ESNQICYNVELKVALHNRIESITLPIFTSLNGQSNSTVVLRNSSTNSESSNTHTSPSLGS 172
Query: 137 ----DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY------------------ 174
+K V ++ G + I+ H++KELG H L CT Y
Sbjct: 173 NAGGTNTKDSVFDLQPGQSLNAIISHELKELGVHNLRCTVSYFQTSSHGKSESSSHVVAY 232
Query: 175 -----------SDGEGERK--YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEAC 221
D +R+ + +KF+V+ PL VR K +V + + +E
Sbjct: 233 ESPRLTSGLSSRDTTSKREPITFQRLYKFMVNKPLDVRKKFSIVDIDNS-----VLMETQ 287
Query: 222 IENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYL 281
I+N T + + +++V FE + +S L ++ + F P + +L
Sbjct: 288 IQNLTVTPIILERVLFESNPQFSVIDL------NNLQFGKKSHFNTPTYYLQPNDVQQFL 341
Query: 282 YQLKMLSHGS------------------------SSPVKVQGSNVLGKLQITWRTNLGEP 317
Y+L + S SS Q S G+L ITWR+ +GE
Sbjct: 342 YRLIPTTTNSLPLLNSSSTNSSIPASAVPDPIPVSSTTTRQVSISAGRLDITWRSLMGER 401
Query: 318 GRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQND 377
GRLQT + T +I+L V+ +PS V ++PF LK +LTN + Q
Sbjct: 402 GRLQTSSLKYELPTFGDIQLRVLTIPSTVTTEQPFTLKFELTNCSKTRQ----------- 450
Query: 378 SDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 426
++ L P + F LNL+AT G+ I+G+ + D
Sbjct: 451 ------------KLGKLLPGQCI---PFELNLMATLPGLHMISGLCIHD 484
>gi|193617950|ref|XP_001949728.1| PREDICTED: UPF0533 protein C5orf44 homolog [Acyrthosiphon pisum]
Length = 404
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 113/448 (25%), Positives = 199/448 (44%), Gaps = 66/448 (14%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H + RVMRL +P + + D DL P AA N + DVTT
Sbjct: 12 HPIKLRVMRLGKPVMFNSKIVTCDSKDL---------PGAALNAH--LKKDVTT------ 54
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
L D A+++ L++P +YLGETF YI + N S+ V D+++KAEI
Sbjct: 55 -------LAD-AETLAAGSFLMVPNVLENLYLGETFLCYIYLKNESSQTVYDIILKAEID 106
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGA-HTLVCTALYSDGEGERKY-LP 185
T I +L + D IV+H+VKE G+ + L+C Y +RK+
Sbjct: 107 TATSHIPIL--GPKAFSKLDPYASIDVIVKHEVKEHGSVNKLICQVEY-----DRKHSFE 159
Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEIT---FLEACIENHTKSNLYMDQVEFEPSQN 242
F + V PL ++TK + +T +LE ++N + + +++ E S
Sbjct: 160 TIFSYRVPKPLDLKTKF---------YNTVTDEVYLEVQVQNIMSTPISLEKFILESSIG 210
Query: 243 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
+ + H +++ + IF + I Y+Y+L + +P + +N
Sbjct: 211 YDVNSMN----HLLESSEDKSIFG-DMDILDVKETRQYMYRLSLDKTAEKNPTR---TNN 262
Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
LGKL I WR+N+G G++Q+ ++ +I ++ +P +V ++ F + N
Sbjct: 263 LGKLDILWRSNMGTKGQIQSSPLVRQIPELDDITFSITYLPDMVFCEEQFDFTCSIKNNR 322
Query: 363 DKEQGPFEIWLSQNDSDEEKV----VMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQR 418
++ ++ L SDEE MI+G+++ L P + + +++A G+Q
Sbjct: 323 NR-----DMQLVVEVSDEEDSNLAWTMISGIQLRLLPP---YATIKTVFSMVALNHGLQV 374
Query: 419 ITGITVFDKLEKITYDSLPDLEIFVDQD 446
I+GI + + + TY +FV Q+
Sbjct: 375 ISGIKLKELILNRTYSYNNFGHVFVTQN 402
>gi|358335977|dbj|GAA34217.2| UPF0533 protein C5orf44 homolog [Clonorchis sinensis]
Length = 539
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 131/553 (23%), Positives = 214/553 (38%), Gaps = 144/553 (26%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
M++ T L+ RVMRL RP + + +P +L++ D IA++ L ++D
Sbjct: 1 MTAPQDTDVLSLRVMRLNRPQFVRQ---QCEPAELYL------DDIASA----LTTADAG 47
Query: 61 TNKSSDLTYRSRFLLHDSADS---------------------------------IGLSG- 86
D R + D A + IG G
Sbjct: 48 VRADLDGVALHRLSISDCAQNDVTEGLTMEDQGDQEKAETDQIEEAQNHLVRVKIGGPGE 107
Query: 87 LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILL-----LDTSKS 141
LL LPQ+FG+ YLGETF ++++++N S +V +K + + + L L +
Sbjct: 108 LLGLPQSFGSTYLGETFSAHVNLHNESNQICYNVELKVSLHNRIEWVTLSTSGTLTGASL 167
Query: 142 PVES-----------------IRAGGRYDFIVEHDVKELGAHTLVCTALY---------- 174
P +S + G + I+ H++KELG HTL C A Y
Sbjct: 168 PAQSPSSPEMSNQRSCSGGVDLHPGQSLNAIIHHELKELGIHTLRCVASYCLSSAASTVG 227
Query: 175 ------------SDGEGERKYLPQF-----FKFIVSNPLSVRTKVRVVKVGATHFQEITF 217
+ G+ L F +KF VS PL V+ K V F
Sbjct: 228 QSALSPLTPKSPNQWTGDPSALESFTFQRLYKFPVSKPLDVKKKFSAVDSNG-----CVF 282
Query: 218 LEACIENHTKSNLYMDQVEFEPSQNWSATMLKA--DGPHSDYNAQSREIFKPPVLIRSGG 275
+EA ++N T +Y+++V FEPS N L DG S +
Sbjct: 283 MEAEVQNLTSVPIYLERVVFEPSPNMRVVDLNTIDDGKSSVPTCGDLRCLR-------AH 335
Query: 276 GIHNYLYQL-------------------------KMLSHGSSSPVKVQGSNV-LGKLQIT 309
I +LY+L + L GS + ++Q + G+L IT
Sbjct: 336 DIQQFLYKLIPDSGLLAKSPGQRMSVRSTQGQVRQPLPSGSVTASQLQQQPLSAGRLDIT 395
Query: 310 WRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPF 369
WR+ +GE GRLQT + +++L + +P+ V I++PF + L+LTN++ +
Sbjct: 396 WRSTMGERGRLQTSSLKYELPHLGDLQLKALNLPATVQIEQPFQITLELTNRSTQHMDLM 455
Query: 370 EIWLSQNDSDEEKVVMIN--------GLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 421
+ ++D GL L + S L L+AT G+Q I+G
Sbjct: 456 LDLRGKPETDNSDDCSFRSLPPLAWVGLTTCRLGMLPPGRSMPLSLGLMATVPGLQPISG 515
Query: 422 ITVFDKLEKITYD 434
+ + + + Y+
Sbjct: 516 VLIHENTTERDYE 528
>gi|324506540|gb|ADY42790.1| Unknown [Ascaris suum]
Length = 295
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 132/285 (46%), Gaps = 38/285 (13%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
M+ T L +VMRL RP L+ + +DP DP++ LI S V
Sbjct: 1 MAETSRDQLLVLKVMRLARPKLYDTVCIPIDP----------GDPMSE-----LIGSAV- 44
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
R +AD + L+ PQ F IYLGETF Y+ + N S+ ++
Sbjct: 45 ----------CRLTGQKAADE-PVGEYLMAPQIFDNIYLGETFTFYVCVQNDSSQCATEI 93
Query: 121 VIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGE 180
IK ++QT QR+ L + +++ G I+ H++KE+G H LVC Y E
Sbjct: 94 CIKTDLQTTNQRVALHSKLQDSNATLQPGQILGDIISHEIKEVGQHILVCAVTYKTPADE 153
Query: 181 RKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPS 240
+ Y +FFKF V+ P+ VRTK + + +LEA I+N + + + +++V EPS
Sbjct: 154 KMYFRKFFKFPVTKPIDVRTKFYNAE---DNMNNDVYLEAQIQNTSATPMILEKVVLEPS 210
Query: 241 QNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK 285
+++T + P N S++ F + I YLY L+
Sbjct: 211 DFYTSTEIP---PPLLLNENSKKQF-----YLNPKDIRQYLYCLR 247
>gi|443925337|gb|ELU44194.1| hypothetical protein AG1IA_01781 [Rhizoctonia solani AG-1 IA]
Length = 616
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 97/332 (29%), Positives = 148/332 (44%), Gaps = 54/332 (16%)
Query: 8 HSLAFRVMRLCRPSLHVEP-PLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
H LA +VMR+ RPSL P P D T L ++S
Sbjct: 4 HLLALKVMRVSRPSLSAHPLPFFSDSTAL-----------------------AAHARASP 40
Query: 67 LTYRSRFL--LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
L+ S+ L + + + S +L+LP+AFG+I LGETF S + INN S V +
Sbjct: 41 LSLESQPLDGIPSTLRDLAQSQVLLLPEAFGSISLGETFTSALCINNESAHTVLGSHLLV 100
Query: 125 EIQTDKQRILLLDTSKSPVES-IRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERK- 182
EIQT + +L ++S + G + +V H++KELG H LVCT Y R
Sbjct: 101 EIQTASTKTVLGQVGG--IDSRLEPGQMFSLVVSHEMKELGQHVLVCTVGYHVPPALRNN 158
Query: 183 -YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
P+ I +P ++ + KV FLE ++N T LY ++++FE ++
Sbjct: 159 SIPPEDPIHIPRSPSALLNRNERNKV---------FLEVHVQNLTTKPLYFEKIQFECAE 209
Query: 242 NW-----SATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS-PV 295
W + + G SD +++ E P R YLY L + S P+
Sbjct: 210 GWVLADANPKSVSNSGSESDSGSKTNETSLRPQDTR------QYLYILVATPAATPSFPI 263
Query: 296 KVQGSNV--LGKLQITWRTNLGEPGRLQTQQI 325
+ LG+L ++WR++ GEPGRL T +
Sbjct: 264 PYPPGTIIALGRLDMSWRSSFGEPGRLLTSML 295
>gi|212645333|ref|NP_001129809.1| Protein C56C10.7, isoform c [Caenorhabditis elegans]
gi|351060510|emb|CCD68186.1| Protein C56C10.7, isoform c [Caenorhabditis elegans]
Length = 243
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 133/267 (49%), Gaps = 32/267 (11%)
Query: 183 YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN 242
Y +FFKF VS P+ V+TK + A Q++ +LEA IEN + +N+++++VE +PSQ+
Sbjct: 2 YFRKFFKFPVSKPIDVKTKFYSAEDNAN--QDV-YLEAQIENTSNANMFLEKVELDPSQH 58
Query: 243 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS-- 300
++ T + H D ++ KP I +L+ L +P V +
Sbjct: 59 YNVTSIA----HEDEFGDVGKLLKP-------KDIRQFLFCL--------TPADVHNTLG 99
Query: 301 ----NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 356
+GKL ++WRT++GE GRLQT + ++ L+V + P+ V + KPF +
Sbjct: 100 YKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQKPFEVSC 159
Query: 357 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
+L N +++ ++ L Q + +G+ + L P + DF LN+ +G+
Sbjct: 160 RLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ---HVDFSLNVFPVTVGI 215
Query: 417 QRITGITVFDKLEKITYDSLPDLEIFV 443
Q I+GI + D K Y+ +IFV
Sbjct: 216 QSISGIRITDTFTKRIYEHDDIAQIFV 242
>gi|290982829|ref|XP_002674132.1| hypothetical protein NAEGRDRAFT_80726 [Naegleria gruberi]
gi|284087720|gb|EFC41388.1| hypothetical protein NAEGRDRAFT_80726 [Naegleria gruberi]
Length = 483
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 122/477 (25%), Positives = 215/477 (45%), Gaps = 71/477 (14%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIF-DDPIAASNLPPLISSDV 59
M TP H ++ ++MRL +P + P+ + TD +F P S++ + +++
Sbjct: 16 MVETP--HPISIKLMRLKKPDFSLTVPILPEKTDALGDYKLFYKTPNYVSDVKSIYGNEM 73
Query: 60 TTNKSSDLTYRSRFL-----LHDSA----------DSIGLSGLLVLPQAFGAIYLGETFC 104
S + L L D+ DS+G + LP A GAIY+GE
Sbjct: 74 PLRASQQQQQKEDTLIEIPGLEDNGKSLLDRCIIFDSLGYNDGWCLPSAPGAIYVGEHLK 133
Query: 105 SYISINNSSTLEVRDVVIKAEIQTDKQRIL---LLDTSKSPVESIRAGGRYDFIVEH--- 158
YIS++N S ++++ + AE+ T K + LLD S +P++ + + DFI+EH
Sbjct: 134 CYISLHNESYKVIQNISVTAELVTGKGKTTKQTLLDISSTPLDQLGSKTNKDFIIEHPLT 193
Query: 159 ---DVKELGAHT-LVCTALYSDGE-GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQ 213
D+++ T L C Y D E G + + F F V +PL ++ KV F
Sbjct: 194 SSDDIQDDEDKTVLTCLVSYYDPEEGRVRSFRKHFPFKVYDPLGMKVKVNT-------FG 246
Query: 214 EITFLEACIENHTKS-NLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIR 272
F++ ++N T++ +LY++ V+FEP N+ ++ S +N S F+ P+L
Sbjct: 247 NHVFVQLDLQNLTQTPSLYIESVKFEP--NFGYELMD----QSVHNT-SENYFEHPLL-- 297
Query: 273 SGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITS 332
G +L++L S + V Q S LGK+ + W+ +GE G L T I I
Sbjct: 298 -RGESKRFLFELVPNSKNRAMNV-TQNSVFLGKISLQWKNTMGECGMLLTNPIPHKLIPK 355
Query: 333 KEIELNVV----EVP---SVVGIDK------------PFLLKLKLTNQTDKEQGPFEIWL 373
+++E +++ +P +++G + PF ++TN + K+ I L
Sbjct: 356 QDLEASIIGFTSSIPDEFTILGSNNNNNTQESFTLYTPFYAVCEITNYS-KDVMDLSIHL 414
Query: 374 SQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEK 430
DSD+ + ING + A+ ++ S + L + G + G + K +K
Sbjct: 415 ---DSDKMYPLAINGSSLQAVGELQPLKSRHVFIPLFPLQRGAHLVAGKGILVKDKK 468
>gi|443896779|dbj|GAC74122.1| uncharacterized conserved protein [Pseudozyma antarctica T-34]
Length = 615
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 107/392 (27%), Positives = 177/392 (45%), Gaps = 90/392 (22%)
Query: 8 HSLAFRVMRLCRPSLHV-EPPLRVDPTDLF------IGEDIFDDPIAASNLPPLISSDVT 60
H L+ +VMR PSL V E P D + +GE I +S D+
Sbjct: 37 HLLSLKVMRASAPSLAVSEKPYFDDASSTSSSLLAAVGEGIDAG----------LSHDLL 86
Query: 61 TNK---SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV 117
+N+ SS T + + +A++ +S +LVLP +FG ++LGETF +Y+ + N S V
Sbjct: 87 SNRWEGSSSTTTAAAY--RSAAENFPISSVLVLPNSFGTLFLGETFRTYVCVRNESGAAV 144
Query: 118 RDVVIKAEIQTDKQ----------------RILL------------LDTSKSPVESIRAG 149
R+ ++ E+Q I++ D+ PV + AG
Sbjct: 145 REPSLRVEMQVGASDASQPHAESGRWHQLAHIIMPSPSRYTPDPADTDSQGRPVWELAAG 204
Query: 150 GRYDFIVEHDVKELGAHTLVCTALYS------DGEG---ERKYLPQFFKFIVS-NPLSVR 199
+ + +D+K+LG H LVCT Y DG+ ER + +FFKF V +P+SVR
Sbjct: 205 RALETSLGYDIKDLGPHVLVCTVGYKARVVMHDGQEAWIERSFR-KFFKFAVERSPISVR 263
Query: 200 TKVRVVKVGATHF------QEITFLEACIEN--HTKSNLYMDQVEFEPSQNWSATMLKAD 251
TKV + + +E LE ++N S+L +D+++ + + W+ + + D
Sbjct: 264 TKVHQPREACAVYHPDPAVRERVHLEVQVQNVASNGSSLVLDRLDLKTAPGWTWSSI--D 321
Query: 252 GPHSDYNAQSREIF-----KPPVLIRSGGGIHNYLYQL-------------KMLSHGSSS 293
P + + +++ K +L+ + G + YL+ L + GS+
Sbjct: 322 RPSLSCDDKDGDMWMRVGGKSKMLL-ADGDVRQYLFALVPSEEVAFWEARESGMDMGSTQ 380
Query: 294 PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
+ LG L I+WR +LGEPGRLQT Q+
Sbjct: 381 EGWAIRGDALGHLDISWRMSLGEPGRLQTSQL 412
>gi|353233427|emb|CCD80782.1| hypothetical protein Smp_016810 [Schistosoma mansoni]
Length = 567
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 115/463 (24%), Positives = 185/463 (39%), Gaps = 136/463 (29%)
Query: 15 MRLCRPSLHVEPPLRVDPTDLFIGEDI------FDDPI------------AASNLPPLIS 56
MRL RP+ ++ R +PT+L++ +DI +D I +A N+PP +
Sbjct: 1 MRLNRPTFVIQ---RCEPTELYL-DDIAGSLTAYDASIRGDLDGISLNLLSAGNIPPSSN 56
Query: 57 SDVTTNKSSDLTYRSRFLLHDSADSI-----GLSGLLVLPQAFGAIYLGETFCSYISINN 111
S + N +L S+ D+ + I G S LL L +FG IYLGETF ++I+++N
Sbjct: 57 SHESPNYDHELNNDSK----DNYNYIQPKVGGYSELLSLTHSFGTIYLGETFSAHINLHN 112
Query: 112 SSTLEVRDVVIKAEIQTDKQRILLL----------------------------------- 136
S +V +K + + I L
Sbjct: 113 ESNQICYNVELKVALHNRIESITLPIFTSLNGQSNSTVVLRNSSTNSESSNTHTSPSLGS 172
Query: 137 ----DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY------------------ 174
+K V ++ G + I+ H++KELG H L CT Y
Sbjct: 173 NAGGTNTKDSVFDLQPGQSLNAIISHELKELGVHNLRCTVSYFQTSSHGKSESSSHVVAY 232
Query: 175 -----------SDGEGERK--YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEAC 221
D +R+ + +KF+V+ PL VR K +V + + +E
Sbjct: 233 ESPRLTSGLSSRDTTSKREPITFQRLYKFMVNKPLDVRKKFSIVDIDNS-----VLMETQ 287
Query: 222 IENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYL 281
I+N T + + +++V FE + +S L ++ + F P + +L
Sbjct: 288 IQNLTVTPIILERVLFESNPQFSVIDL------NNLQFGKKSHFNTPTYYLQPNDVQQFL 341
Query: 282 YQLKMLSHGS------------------------SSPVKVQGSNVLGKLQITWRTNLGEP 317
Y+L + S SS Q S G+L ITWR+ +GE
Sbjct: 342 YRLIPTTTNSLPLLNSSSTNSSIPASAVPDPIPVSSTTTRQVSISAGRLDITWRSLMGER 401
Query: 318 GRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTN 360
GRLQT + T +I+L V+ +PS V ++PF LK +LTN
Sbjct: 402 GRLQTSSLKYELPTFGDIQLRVLTIPSTVTTEQPFTLKFELTN 444
>gi|388855808|emb|CCF50592.1| uncharacterized protein [Ustilago hordei]
Length = 809
Score = 108 bits (269), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 109/378 (28%), Positives = 172/378 (45%), Gaps = 72/378 (19%)
Query: 2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLP---PLISS- 57
S G H L+ +VMR PSL V + P S+LP PLI++
Sbjct: 40 SQNAGPHLLSLKVMRASAPSLAVS-----------------EKPYYDSHLPSSSPLIAAV 82
Query: 58 --DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS-T 114
++ + SSD S + + +S LL LP +FG +YLGETF +Y+ + N S T
Sbjct: 83 GKGISESLSSDPL--SNHYPDAPSSNFPISNLLTLPSSFGTLYLGETFRTYLCVRNESPT 140
Query: 115 LEVRDVVIKAEIQTDKQR----------ILL---LDTSKS--PVESIRAGGRYDFIVEHD 159
VR+ ++AE+Q I+L TSKS PV + + + +D
Sbjct: 141 SPVREPSLRAEMQVGSSETEGRWHQLAHIILPSPTSTSKSGEPVWELPPSAPLETSLGYD 200
Query: 160 VKELGAHTLVCT----ALYSDGEGERKYLPQFFKFIV-SNPLSVRTKVRVVKVGATHF-- 212
+K+LG H LVCT AL ++G + +F+KF V +P+SVRTKV + A+ +
Sbjct: 201 IKDLGPHVLVCTVGYKALSAEGGWVERSFRKFYKFSVDRSPISVRTKVHQPRNVASLYHA 260
Query: 213 ----QEITFLEACIENHTKSNLYM--DQVEFEPSQNWS-----ATMLKADGPHSDYNAQS 261
++ LE ++N + + + + + + P+ W L + + ++
Sbjct: 261 DEGVRKRVELEVQVQNASANGMRLVFEGLSLRPADGWRWDSVDRPSLTPNSTKGESVEEA 320
Query: 262 REIFKPPV----LIRSGGGIHNYLYQLK-----MLSHGSSSPVKVQG----SNVLGKLQI 308
R+++ P + G I YL+ L L G V+G + LG L I
Sbjct: 321 RDMWLKPNNGGHEALADGDIRQYLFTLHPKPGVKLGGGVDLGKSVEGYLIRGDALGNLDI 380
Query: 309 TWRTNLGEPGRLQTQQIL 326
WR +LGEPGRLQT Q++
Sbjct: 381 GWRMSLGEPGRLQTSQLV 398
>gi|71019495|ref|XP_759978.1| hypothetical protein UM03831.1 [Ustilago maydis 521]
gi|46099484|gb|EAK84717.1| hypothetical protein UM03831.1 [Ustilago maydis 521]
Length = 833
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/389 (26%), Positives = 167/389 (42%), Gaps = 73/389 (18%)
Query: 6 GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
G H ++ +VMR PSL V D + D+ I A ++ + S
Sbjct: 40 GPHLVSLKVMRTSAPSLAVSEKPYCDRHSTY-----HDELITA------VAQGIDDAASH 88
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAE 125
DL AD +S LLVLP +FG +YLGETF +Y+ + N S+ VR+ ++ E
Sbjct: 89 DLLSNRWDTSPSPADQFPISELLVLPNSFGTLYLGETFRTYLCVRNESSTAVREPSLRVE 148
Query: 126 IQTDKQR---------------ILLLDTSKS---------PVESIRAGGRYDFIVEHDVK 161
+Q IL T S PV +R + + +D+K
Sbjct: 149 MQVGASDPHTQEGGRWVQLAHVILPTPTRYSPEPDQDKGRPVWELRTAQALETSLAYDIK 208
Query: 162 ELGAHTLVCTALY-----SDGE---GERKYLPQFFKFIVS-NPLSVRTKVRVVKVGATHF 212
+LG H LVCT Y DG+ ER + +F+KF V +P+SVRTKV + ++ F
Sbjct: 209 DLGPHVLVCTVGYKSPLQQDGDVAWVERSFR-KFYKFSVDRSPISVRTKVHQPRHASSLF 267
Query: 213 QEITFLEACIE-----NHTKSN---LYMDQVEFEPSQNWSATMLKADGPH---SDYNAQS 261
+ +E +T N L ++++ +P+ W + D P +D +
Sbjct: 268 HPDAAVRKRVELEVQVQNTAGNGAALVLNELTLKPAPGWK--WVSVDRPSLNDADRGDED 325
Query: 262 REIFKPPVLIRSGGGIHNYLYQL-----------KMLSHGSSSPVKVQG----SNVLGKL 306
I + + + G + YL+ L +++ G V +G + LG L
Sbjct: 326 MWILRGTDQVLADGDVRQYLFVLTPENKDQTLAEEVMQGGIDLGVTKEGLALRGDALGHL 385
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEI 335
I+WR LGE GRLQT Q++ + ++ +
Sbjct: 386 DISWRMALGEAGRLQTSQLVRRRVVTQPV 414
>gi|343424905|emb|CBQ68443.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 759
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 101/391 (25%), Positives = 166/391 (42%), Gaps = 77/391 (19%)
Query: 6 GTHSLAFRVMRLCRPSLHV-EPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
G H L+ +VMR P L V E P + +P +A L + + +
Sbjct: 42 GPHLLSLKVMRASAPLLAVSEKPYY----------EHHAEPTSADTLLSAVGQGIEQGLA 91
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA 124
DL SA + +S LLVLP +FG +YLGETF +Y+ + N + VR+ ++
Sbjct: 92 HDLLSNRWDGAGGSASNFPVSDLLVLPSSFGTLYLGETFRTYLCVRNEAATAVREPSLRV 151
Query: 125 EIQTDKQRILLLDTSK-------------------------SPVESIRAGGRYDFIVEHD 159
E+Q + D + PV + G + + +D
Sbjct: 152 EMQVGASDVQQSDAGRWHQLAHVILPTPTRLSPDPDGGEEGRPVWELAPGQPLETALGYD 211
Query: 160 VKELGAHTLVCTALYSDG--EG------ERKYLPQFFKFIVS-NPLSVRTKVRVVKVGAT 210
+K+LGAH LVCT Y +G ER + +++KF V +P+SVRTKV + ++
Sbjct: 212 IKDLGAHVLVCTVGYKAAVQQGSEVAWVERSFR-KYYKFSVERSPISVRTKVHQPRHASS 270
Query: 211 ------HFQEITFLEACIEN--HTKSNLYMDQVEFEPSQNWSATMLKADGPH----SDYN 258
++ LE ++N S L + + +P+ W D P + +
Sbjct: 271 LHHPDAKVRQRVELEVQVQNVAGNGSALVFEGLALKPAPGWG--WASVDRPSLNGGGEED 328
Query: 259 AQSREIFKPPVLIRSGGGIHNYLYQL-----KMLSH---------GSSSPVKVQGSNVLG 304
+R++ + + G + YL+ L L+H G+S+ + LG
Sbjct: 329 MWARKVG---TEVLADGDVRQYLFTLTPSTAATLAHETLKAGLDLGTSADGHAIRGDALG 385
Query: 305 KLQITWRTNLGEPGRLQTQQILGTTITSKEI 335
L I+WR +LGEPGRLQT Q++ + + I
Sbjct: 386 HLDISWRMSLGEPGRLQTSQLVRRRVVTPPI 416
>gi|325189573|emb|CCA24059.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 450
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/372 (25%), Positives = 163/372 (43%), Gaps = 40/372 (10%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKS-- 141
+S +L LP +FG I+LG TF SYIS+ N ++ +V + A IQ R+ L D +S
Sbjct: 65 ISNMLCLPDSFGQIFLGNTFSSYISVINPYNCDIEEVGLTANIQCGNDRVELQDNRQSRT 124
Query: 142 -------PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLPQFFKFIVS 193
P + A D +V+ + ++G H L Y D E K L +F++F V
Sbjct: 125 GKLPPPNPTPVLSANSSLDMVVDFPLSQVGNHVLRVGVSYLDPITKESKSLRKFYRFGVQ 184
Query: 194 NPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK---- 249
NPL + K QEI +EA I N + L++D + FE + +++ K
Sbjct: 185 NPLILN-----FKQSRAPSQEI-LIEAQIRNVSSLPLFIDSIRFEATSSFTLMTTKRSSE 238
Query: 250 ---ADGPH-----SDY-------NAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSP 294
AD SDY + + P L++ + +++L
Sbjct: 239 SSPADCTQPQPEDSDYTIDTIWPSLKQHLARGSPTLLQPQEELQR-MFRLFEYERKKIVD 297
Query: 295 VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLL 354
Q S LG+L + W+T++GE G +Q+Q I+ T +++ + + P + ++K F++
Sbjct: 298 PGFQSSQTLGRLHVGWKTSVGEAGSVQSQPIVRKYDTMRDVSIRLHSFPERLVVEKVFVV 357
Query: 355 KLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKL 414
+ + N + + F+I L + +V L + + + S L L+ +
Sbjct: 358 ECTIENHSTRN---FDIQLQFRKESLDGIVCY-CLTHQHVGSLVSEASITLPLKLLPLEC 413
Query: 415 GVQRITGITVFD 426
G+Q I I D
Sbjct: 414 GLQEIRDIVCVD 425
>gi|296410908|ref|XP_002835177.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295627952|emb|CAZ79298.1| unnamed protein product [Tuber melanosporum]
Length = 319
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 98/346 (28%), Positives = 147/346 (42%), Gaps = 72/346 (20%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + +LP S N+S +L
Sbjct: 14 HSISLKVLRLSRPSLSEQ-----------------------HSLPKATPS----NQSPEL 46
Query: 68 TYRSR----FLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
SR + H + D LS LL LP AFG Y+GETF +S NN +T V I
Sbjct: 47 DELSRQSHAYPSHSTDDPFILSPLLTLPPAFGNAYIGETFSCCLSANNETTSITTSVRIS 106
Query: 124 AEIQT-----------DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
AE+QT D+++ LD PV S++ IV++D+KE G H L T
Sbjct: 107 AEMQTPSLTLNLELGGDERQTADLD----PVMSLQK------IVKYDLKEEGNHILAVTV 156
Query: 173 LYSD-------GEGER------KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLE 219
Y++ GEGE+ + + ++FI L+VRTK+ + G LE
Sbjct: 157 TYTEAPKRVDYGEGEKGAPGRVRTFRKLYQFIAQQCLTVRTKIGSLSGGR------AILE 210
Query: 220 ACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHN 279
A +EN + ++ V ++ W+AT L G + Q P + R +
Sbjct: 211 AQLENMGDGPISLEMVHMGTTKGWTATSLNWQGSTGRGDGQRNPKDTPMLGSRDVMQVAF 270
Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
LY + V +LG+L I WR+ G+ G L T ++
Sbjct: 271 LLYPEETEEGWEED-VAANDKKILGQLSIEWRSACGDRGYLSTGRL 315
>gi|167517297|ref|XP_001742989.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778088|gb|EDQ91703.1| predicted protein [Monosiga brevicollis MX1]
Length = 415
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 81/328 (24%), Positives = 151/328 (46%), Gaps = 34/328 (10%)
Query: 67 LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEI 126
+T +++ L S ++ G+S +L LP A G +YLG+T IS++N + V +V K E+
Sbjct: 20 ITQQNQADLRSSYENFGVSEVLKLPAAVGNVYLGQTLSCLISVHNEGSESVSSIVTKVEL 79
Query: 127 QTDKQRILLLDT--------SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGE 178
QT +R L T P+ + G D IVE+ +++ H +VC Y+ +
Sbjct: 80 QTGSKRTSLKPTLTGERKGQEVGPIGKLAPGQAIDQIVEYQLQDPAVHIMVCILAYTSQD 139
Query: 179 GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
G+RK L + FKF V+ PL + + +K + ++ ++N K L ++ V
Sbjct: 140 GDRKQLRKHFKFEVTQPLEIVPLCKTLK-------DDVMVQVNVQNIAKEPLILEYVRMT 192
Query: 239 PSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
P++ + T + D P S Q + K N ++ LK + +
Sbjct: 193 PTKVY--TCEETDEPPSP--DQQLPVSK----------TRNRIFVLK--PQPTVDARTFK 236
Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
S +G++ ++WR G G I T ++ L+V++ P V + L++++
Sbjct: 237 QSAKVGQVMVSWRAMRGGRGYTSIATIQRRVPTLNDVHLDVLDPPDSVQVGTLCTLRVRI 296
Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMI 386
N TD++ + + LS N ++V++
Sbjct: 297 INFTDRQ---YTLGLSYNPEQVTELVVM 321
>gi|302916379|ref|XP_003052000.1| hypothetical protein NECHADRAFT_37787 [Nectria haematococca mpVI
77-13-4]
gi|256732939|gb|EEU46287.1| hypothetical protein NECHADRAFT_37787 [Nectria haematococca mpVI
77-13-4]
Length = 822
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 100/343 (29%), Positives = 153/343 (44%), Gaps = 57/343 (16%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P +DP IG I P AS L
Sbjct: 517 HSISLKVLRLSRPSLVTQYP--IDPPS-SIGATIKPAPAPAS-----------------L 556
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEV----RDVVIK 123
YRS + S LS ++ LP +FG+ Y+GETF + NN +V RDV I
Sbjct: 557 AYRSETTSNPSP--FLLSPIVNLPVSFGSAYVGETFSCTLCANNDLLPDVPKNIRDVRID 614
Query: 124 AEIQTDK----QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SD 176
AE++T QR+ L + P + +GG +V D+KE G H L T Y ++
Sbjct: 615 AEMKTPGLGAVQRLELGPPTDKPEADLDSGGTLQRVVSFDLKEEGNHVLAVTVSYYEATE 674
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITF-LEACIENHTKSNLYMDQV 235
G + + ++FI L VRTKV +K A Q + LEA +EN ++ + +++V
Sbjct: 675 TSGRTRTFRKLYQFICKASLIVRTKVGPLKAAAGDGQPRRWALEAQLENCSEDVVQLEKV 734
Query: 236 --EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSS 293
+ EP + +A G ++ + P G + + ++ S G+ +
Sbjct: 735 VLDTEPGLRYRDCNWEASG-------STKPVLHP-------GEVEQVCFVVED-SSGTGT 779
Query: 294 P-----VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTIT 331
P V G + G L I WR +G G L T + LGT +
Sbjct: 780 PGGDVEVTPDGRIIFGSLGIGWRGEMGNRGFLSTGK-LGTRVA 821
>gi|428162256|gb|EKX31425.1| hypothetical protein GUITHDRAFT_149310, partial [Guillardia theta
CCMP2712]
Length = 211
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 89/183 (48%), Gaps = 27/183 (14%)
Query: 9 SLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLT 68
+LAF+VMRL RPS H + F + A +SD + L
Sbjct: 54 ALAFKVMRLNRPSFH---------------QAGFTAGLQALRE---TASDQAEQATGHLP 95
Query: 69 YRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQT 128
H A+ LL LP FG IYLGETF +YIS N+S + + I+AEIQT
Sbjct: 96 -------HSDAEGCPSENLL-LPTGFGNIYLGETFTAYISACNTSGSRLMRLEIRAEIQT 147
Query: 129 DKQRILLLDTSKSPV-ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
+R+ LLD V + + D+IV H++KE G H ++C+ Y D GE K + Q+
Sbjct: 148 GTKRVPLLDGKPETVLAQFESNQQVDYIVSHELKEAGVHIMICSGSYLDASGEEKKVRQY 207
Query: 188 FKF 190
FKF
Sbjct: 208 FKF 210
>gi|451846695|gb|EMD60004.1| hypothetical protein COCSADRAFT_100123 [Cochliobolus sativus
ND90Pr]
Length = 319
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/341 (27%), Positives = 152/341 (44%), Gaps = 71/341 (20%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RP L + PL P S D+ + + L
Sbjct: 17 HSVSLKVLRLSRPMLATQHPL---PN----------------------SKDLGISPQASL 51
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
Y S+ ++ D+ LS +L LP+AFG+ Y+GETF + NN ST + V I
Sbjct: 52 AYPSQ---RNTNDAFILSPVLNLPEAFGSAYVGETFSCTLCANNELDPLDSTKAISGVRI 108
Query: 123 KAEIQTDKQRI-LLLDTSKSPVESIRA----GGRYDFIVEHDVKELGAHTLVCTALYSD- 176
+ ++QT LD + +P E + G I+ ++KE G H L T Y++
Sbjct: 109 QGDMQTPSNPTGSPLDLTGTPDEDVNTSPGPGESLQRILRFELKEEGNHVLAVTVTYTET 168
Query: 177 --GEGER-----KYLPQFFKFIVSNPLSVRTKVRVV--KVGATHFQEITFLEACIENHTK 227
GEG+ + + ++F+ LSVRTK + K G+ + LEA +EN +
Sbjct: 169 ALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAGEMSPKNGSRRY----LLEAQLENMGE 224
Query: 228 SNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKML 287
+ + ++ V+ P +T L D S +NA P+L + + +L
Sbjct: 225 AAVCLEAVDVNPKLPLKSTSLNWDMQASGFNA--------PML-----SPRDVVQVAFLL 271
Query: 288 SHGSSSPVKVQGSN------VLGKLQITWRTNLGEPGRLQT 322
++ +V+GS VLG+L I WR+ LG+ G L T
Sbjct: 272 TYKPGEDEEVEGSKTEDDKRVLGQLAIQWRSALGDRGSLST 312
>gi|312378535|gb|EFR25084.1| hypothetical protein AND_09887 [Anopheles darlingi]
Length = 275
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 68/261 (26%), Positives = 127/261 (48%), Gaps = 18/261 (6%)
Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
+FFKF V PL V+TK + + +LEA I+N T + +++VE E S+ ++
Sbjct: 12 KFFKFQVVKPLDVKTKFYNAET------DDVYLEAQIQNITVGPICLEKVELESSEQYTV 65
Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
T L A +F +++ +LY ++ + + P ++ +N +GK
Sbjct: 66 TSLNT-------LATGESVFSSKTMLQPQNSCQ-FLYCIRPIPEIARDPNALKAANNIGK 117
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 365
L I WR+NLGE GRLQT Q+ + ++ L V++ S V I + F + ++TN +++
Sbjct: 118 LDIVWRSNLGERGRLQTSQLQRCPLEYSDLRLLVIDAKSTVRIGEGFSFRCRVTNTSERS 177
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
++ + N + + G+ AL +E +F L + +LG+ I+ + +
Sbjct: 178 ---MDLLMGLN-TKAKPGCGYTGVTEFALGALEPGQMKEFPLTVCPVRLGLIVISNLQLT 233
Query: 426 DKLEKITYDSLPDLEIFVDQD 446
D K Y+ L++FV ++
Sbjct: 234 DLFTKRKYEFDNFLQVFVVEE 254
>gi|452005201|gb|EMD97657.1| hypothetical protein COCHEDRAFT_1125394 [Cochliobolus
heterostrophus C5]
Length = 319
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/341 (27%), Positives = 150/341 (43%), Gaps = 71/341 (20%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RP L + PL P S D+ + + L
Sbjct: 17 HSVSLKVLRLSRPMLATQHPL---PN----------------------SKDLGISPQASL 51
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
Y S+ ++ D+ LS +L LP+AFG+ Y+GETF + NN ST + V I
Sbjct: 52 AYPSQ---RNTNDAFILSPVLNLPEAFGSAYVGETFSCTLCANNELDPSDSTKTISGVRI 108
Query: 123 KAEIQTDKQRI-LLLDTSKSPVESIRA----GGRYDFIVEHDVKELGAHTLVCTALYSD- 176
+ ++QT LD + +P E + G I+ ++KE G H L T Y++
Sbjct: 109 QGDMQTPSNPTGSPLDLTGTPNEEVNTSPGPGESLQRILRFELKEEGNHVLAVTVTYTET 168
Query: 177 --GEGER-----KYLPQFFKFIVSNPLSVRTKVRVV--KVGATHFQEITFLEACIENHTK 227
GEG+ + + ++F+ LSVRTK + K G + LEA +EN +
Sbjct: 169 ALGEGKAASGKVRTFRKLYQFVAQQLLSVRTKAGEMSPKNGLRRY----LLEAQLENMGE 224
Query: 228 SNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKML 287
+ + ++ V+ P +T L D S NA P+L + + +L
Sbjct: 225 AAVCLEAVDVSPKPPLKSTSLNWDMQASGLNA--------PML-----SPRDVVQVAFLL 271
Query: 288 SHGSSSPVKVQGSN------VLGKLQITWRTNLGEPGRLQT 322
++ +V+GS VLG+L I WR+ LG+ G L T
Sbjct: 272 TYKPGEDEEVEGSKTEDDKRVLGQLAIQWRSALGDRGSLST 312
>gi|408399762|gb|EKJ78855.1| hypothetical protein FPSE_00998 [Fusarium pseudograminearum CS3096]
Length = 317
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 149/339 (43%), Gaps = 50/339 (14%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P+ P+ +G + PI AS S VT+N + L
Sbjct: 16 HSISLKVLRLSRPSLVTQYPID-SPSS--VGASLKPAPIPASLA---YHSQVTSNPTPFL 69
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
LS ++ LP +FG+ Y+GETF + NN + +RDV I+
Sbjct: 70 ----------------LSPVVNLPVSFGSAYVGETFSCTLCANNDLPPDAVKNIRDVRIE 113
Query: 124 AEIQTDK----QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SD 176
AE++T QR+ L + +++G +V D+KE G H L T Y ++
Sbjct: 114 AEMKTPGMGAVQRLELGPPNGQSEADLQSGDTMQRVVSFDLKEEGNHVLAVTVSYYEATE 173
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV- 235
G + + ++FI L VRTKV +K T LEA +EN ++ + +++V
Sbjct: 174 TSGRTRTFRKLYQFICKASLIVRTKVGSLKAEDTQGHGRWVLEAQLENCSEDVVQLEKVV 233
Query: 236 -EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSP 294
+ EP + +A G ++ + P G + + + +
Sbjct: 234 LDTEPGLRYRDCNWEASG-------SAKPMLHP-------GEVEQVCFVVAEDGAETGVE 279
Query: 295 VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSK 333
V G + G L I WR +G G L T + LGT ++
Sbjct: 280 VTPDGRIIFGSLGIGWRGEMGNRGFLATGK-LGTRRAAR 317
>gi|149059253|gb|EDM10260.1| similar to RIKEN cDNA 2410002O22 gene, isoform CRA_c [Rattus
norvegicus]
Length = 143
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 85/159 (53%), Gaps = 26/159 (16%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPSTV----- 53
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 54 ---------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAH 166
T QR L L S + V ++ D ++ H+VKE+G H
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTH 142
>gi|46123811|ref|XP_386459.1| hypothetical protein FG06283.1 [Gibberella zeae PH-1]
Length = 828
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 91/339 (26%), Positives = 148/339 (43%), Gaps = 50/339 (14%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P+ + +G I PI AS S V +N +
Sbjct: 527 HSISLKVLRLSRPSLVTQYPIDSPSS---VGASIKSAPIPASLA---YHSQVASNPTP-- 578
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
FLL S ++ LP +FG+ Y+GETF + NN + +RDV I+
Sbjct: 579 -----FLL---------SPVVNLPVSFGSAYVGETFSCTLCANNDLPPDAAKNIRDVRIE 624
Query: 124 AEIQTDK----QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SD 176
AE++T QR+ L + +++G +V D+KE G H L T Y ++
Sbjct: 625 AEMKTPGMGAVQRLELGPPNSQSEADLQSGDTMQKVVSFDLKEEGNHVLAVTVSYYEATE 684
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV- 235
G + + ++FI L VRTKV +K T LEA +EN ++ + +++V
Sbjct: 685 TSGRTRTFRKLYQFICKASLIVRTKVGSLKAEDTQGHGRWVLEAQLENCSEDVVQLEKVV 744
Query: 236 -EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSP 294
+ EP + +A G ++ + P G + + + +
Sbjct: 745 LDTEPGLRYRDCNWEASG-------SAKPMLHP-------GEVEQVCFVVAEDGAETGVE 790
Query: 295 VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSK 333
V G + G L I WR +G G L T + LGT ++
Sbjct: 791 VTPDGRIIFGSLGIGWRGEMGNRGFLATGK-LGTRRAAR 828
>gi|396461873|ref|XP_003835548.1| hypothetical protein LEMA_P048890.1 [Leptosphaeria maculans JN3]
gi|312212099|emb|CBX92183.1| hypothetical protein LEMA_P048890.1 [Leptosphaeria maculans JN3]
Length = 323
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 99/344 (28%), Positives = 153/344 (44%), Gaps = 73/344 (21%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RP+L + PL D DL I P A+ PP D T +K
Sbjct: 17 HSVSLKVLRLSRPTLATQHPL-PDSHDLGI------SPKASLAYPP---QDNTNDK---- 62
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
F+ LS +L LP+AFG+ Y+GETF + NN +T V V I
Sbjct: 63 -----FI---------LSPVLNLPEAFGSAYVGETFACTLCANNEIDPSDTTKAVSGVRI 108
Query: 123 KAEIQTDKQ-RILLLDTSKSPVE----SIRAGGRYDFIVEHDVKELGAHTLVCTALYSD- 176
+ ++QT LD + SP + S+ I+ ++KE G H L T Y++
Sbjct: 109 QGDMQTPTNPSGSPLDLTGSPDDSEGLSLGPSESLQRILRFELKEEGNHVLAVTVTYTET 168
Query: 177 --GEGER-----KYLPQFFKFIVSNPLSVRTKVRVV--KVGATHFQEITFLEACIENHTK 227
GEG+ + + ++F+ LSVRTK + K+G + + LEA +EN +
Sbjct: 169 ALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAGEMSQKMGLSRY----LLEAQLENMGE 224
Query: 228 SNLYMDQVEFEP-------SQNWSATMLKADGPHSDYNAQSREIFKPPVLI--RSGGGIH 278
+ + ++ V P S NW L A G H+ R++ + L+ + GG
Sbjct: 225 AAVCLEAVNVHPKPPLRSISLNWDMHPLGA-GQHNAPILGPRDVVQVAFLLEQQPGGDGD 283
Query: 279 NYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
N S + +G +G+L I WR+ LG+ G L T
Sbjct: 284 N-----------SKTDGPTEGRTPIGQLAIQWRSALGDQGSLST 316
>gi|452842472|gb|EME44408.1| hypothetical protein DOTSEDRAFT_172587 [Dothistroma septosporum
NZE10]
Length = 321
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 89/340 (26%), Positives = 148/340 (43%), Gaps = 65/340 (19%)
Query: 6 GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
G HS++ +V+RL RPSL + PL PT+ G D+ DP A+ + SS
Sbjct: 16 GPHSVSLKVLRLSRPSLATQTPL--PPTNFGNGLDL--DPKAS-----------LAHSSS 60
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDV 120
D F L + LL LP AFGA Y+GETF + NN S + V V
Sbjct: 61 DEAQHGAFPL---------TPLLTLPAAFGAAYVGETFICTLCANNELPSDSESKIVSAV 111
Query: 121 VIKAEIQTDKQR---ILLLDTSKSPVES-----IRAGGRYDFIVEHDVKELGAHTLVCTA 172
I AE+QT L L+ + + ++ GG + HD+K+ G H L T
Sbjct: 112 KIVAELQTPSHSEGIALQLEKAGKAADGDDTGDVKPGGTLQRTLRHDLKDEGPHVLAVTI 171
Query: 173 LYSD--------GEGERKYLPQFFKFIVSNPLSVRTKV--RVVKVGATHFQEITFLEACI 222
Y++ G + + ++F+ ++VR+K+ R + A+ +E LEA +
Sbjct: 172 TYTETLHGNGAASGGRVRTFRKLYQFVSQQLVAVRSKITERKRRDKASGPREW-ILEAQL 230
Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLY 282
EN ++++ +++V + + S+ + + + + KP + ++
Sbjct: 231 ENVGETSVVLEKVLLKEKEGISSRRMAGE-------EKEATVLKPQ-------DVEQIMF 276
Query: 283 QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
+L + G LG+L I WR+ +GE G L T
Sbjct: 277 ---LLQEEGERKEEQTGRVPLGQLDIDWRSAMGERGSLTT 313
>gi|328861257|gb|EGG10361.1| hypothetical protein MELLADRAFT_94429 [Melampsora larici-populina
98AG31]
Length = 592
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/363 (25%), Positives = 145/363 (39%), Gaps = 86/363 (23%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H L+ +V+R RP+ +PPL P I+ +N S +
Sbjct: 19 HLLSLKVLRAARPTFK-QPPLH-----------------------PTINPINPSNSISTI 54
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISIN---NSSTLEVRDVVIKA 124
T+ +S S L LP +FG IYLG+TF +S+ N V +V +K
Sbjct: 55 TF----------ESAPKSSTLTLPDSFGVIYLGQTFHGLLSVQYEGNQLDSIVENVALKV 104
Query: 125 EIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER--- 181
E+ T + L + + + G + V+H++KELG HTLVCT Y +
Sbjct: 105 ELHTASHKAFLDEIKTHQIGFGQNG--LELSVKHEIKELGLHTLVCTVFYDQIQSVNSQD 162
Query: 182 ---------------KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQ------------- 213
+ + +KF V NPLSV+TKV V FQ
Sbjct: 163 LDPTNPSPDPTVRVPRSFRKVYKFQVLNPLSVKTKVLVPSSAQPSFQTSPLPSTINAIFS 222
Query: 214 ----EITFLEACIENHTKSNLYMDQVEFEPSQ---NWSATMLKADGPHSDYNAQSR-EIF 265
E +LE I+N + + V+ P Q N + + D N S+ +
Sbjct: 223 PTIREQLYLEVQIQNQSTQPIIFQHVKLIPPQAETNPEEEAEEDKLEYLDLNLDSKTNLL 282
Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSS---PVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
+ S + +L+ + S SS P++ +LG+L+I+W + +GE GRL T
Sbjct: 283 SNSLTHLSTNDSNQFLFLIISQSVNPSSLKKPIQ-----ILGRLEISWNSMMGESGRLMT 337
Query: 323 QQI 325
+
Sbjct: 338 NPL 340
>gi|164659806|ref|XP_001731027.1| hypothetical protein MGL_2026 [Malassezia globosa CBS 7966]
gi|159104925|gb|EDP43813.1| hypothetical protein MGL_2026 [Malassezia globosa CBS 7966]
Length = 462
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/353 (25%), Positives = 156/353 (44%), Gaps = 46/353 (13%)
Query: 5 PGTHSLAFRVMRLCRPSLHVEPPLRVDPT-DLFIGEDIFDDPIAASNLP----------P 53
P T L+ +VMR+ PSL RV P + + + D+P +N P P
Sbjct: 7 PYTPPLSVKVMRIATPSLAS----RVVPMFETCMESGVVDEPSDHNNTPHRQECVEYLDP 62
Query: 54 LISSDV--TTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN 111
I + T + SD + + + +A + + L+LP +FG++ +GETF + I ++N
Sbjct: 63 HIWDVIKSTYARGSDEIFTNAPI---TARDVSYTDQLLLPASFGSVSVGETFQAVICVSN 119
Query: 112 SSTLEVRDVVIKAEIQTDKQRIL------LLDTSKSPVESIRAGGRYDFIVEHDVKELGA 165
+S + ++ + IK E+ TDK L D S + S+ G + + H + +L
Sbjct: 120 TSMMPIQGMRIKVEMHTDKTDSFPPSSHSLNDVS---LPSLAPGAQMTALARHSIDKLAM 176
Query: 166 HTLVCTALYSDGEGERKYLPQFF----KFIVS-NPLSVRTKVRVVKVGATH----FQEIT 216
H LVC ++SD + P F +F V P +R++V + + +E T
Sbjct: 177 HALVC-RIWSDRHTSQGIYPHSFSKQYRFKVHPPPFLMRSEVHTNDTLSFYHDRSIREQT 235
Query: 217 FLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGG 276
+ + N + L +D + +P Q+WSA+ K D H + F + R
Sbjct: 236 LVLVSVHNTSSRPLRLDMLSIDPDQSWSASAPKLD--HMPLMPKDVRNFVFTLSPRETMS 293
Query: 277 IHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTT 329
++ +L+ H +V + LG ++I WR GE GRL+ I TT
Sbjct: 294 PLHFREKLQSAEH-----TRVACTVPLGHIRIAWRVPGGEMGRLRIGTIQRTT 341
>gi|397619517|gb|EJK65296.1| hypothetical protein THAOC_13857 [Thalassiosira oceanica]
Length = 460
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 71/255 (27%), Positives = 127/255 (49%), Gaps = 30/255 (11%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISI-NNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSP 142
LS L+LP +FG I++GETF +Y+ + N ++ + VR + + A++QT +RI+L
Sbjct: 51 LSSNLMLPDSFGVIHVGETFAAYLGVLNAAADVSVRGLTVSAQLQTPSRRIVLPSRLDGT 110
Query: 143 VESIRAGGRYDFIVEHDVKELGAHTLVCTALY-SDGEGERKYLPQFFKFIVSNPLSVRTK 201
I G D IV ++E+G H L Y S+G+ K L +F++F V+NPLS+
Sbjct: 111 PADIEPSGGVDAIVARTLEEVGPHILRVEVGYVSNGQ---KSLRKFYRFNVTNPLSITES 167
Query: 202 VRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA--TMLKADGPHSD--- 256
VV+ G ++ +E TK + + V F+PS ++ L +G S
Sbjct: 168 --VVRGGDAKCLVTIRVQNTMEKPTKGAVTISDVRFQPSTGMASEQIALSEEGQGSVSAL 225
Query: 257 --YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQG---SNVLGKLQITWR 311
Y++ R +P G + YL+ ++ S + K++G + LG+ +T+
Sbjct: 226 DLYDSCGR--LQP-------GESYQYLFSVRAESEAA----KLRGISYGDDLGQAVLTYH 272
Query: 312 TNLGEPGRLQTQQIL 326
+GE G +++ ++
Sbjct: 273 KAMGETGVIKSSLVV 287
>gi|402583817|gb|EJW77760.1| hypothetical protein WUBG_11331, partial [Wuchereria bancrofti]
Length = 164
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 58/187 (31%), Positives = 84/187 (44%), Gaps = 28/187 (14%)
Query: 15 MRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFL 74
MRL RP + + +DP D LI S + R
Sbjct: 1 MRLARPKFYENICIPIDPAD---------------TTSQLIGSAL-----------CRLT 34
Query: 75 LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
++AD I + L+ PQ F +IYLGETF Y+ + N S D+ +K ++QT QR
Sbjct: 35 GQEAAD-IPIGKYLMAPQKFESIYLGETFTFYVCVQNISDKLATDICVKTDLQTTSQRNA 93
Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
L + + G ++ H++KE+G H LVC Y + E Y +FFKF V+
Sbjct: 94 LSSQLQEANAVLEPGECLGEVITHEIKEIGQHILVCAVSYRTPKNEM-YFRKFFKFPVTK 152
Query: 195 PLSVRTK 201
P+ VRTK
Sbjct: 153 PIDVRTK 159
>gi|317146315|ref|XP_001821432.2| hypothetical protein AOR_1_1658144 [Aspergillus oryzae RIB40]
gi|391869103|gb|EIT78308.1| hypothetical protein Ao3042_05468 [Aspergillus oryzae 3.042]
Length = 336
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 146/355 (41%), Gaps = 76/355 (21%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P P A + + +NK+S L
Sbjct: 17 HSVSLKVLRLSRPSLSYQYPF----------------PEANTKI---------SNKAS-L 50
Query: 68 TYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVV 121
+Y S DS D+ L+ L LP AFG+ Y+GETF +S NN ++ V V
Sbjct: 51 SYPS-----DSVDNQFILAPNLTLPPAFGSAYVGETFACTLSANNELAEDETSRVVTSVR 105
Query: 122 IKAEIQTDKQ--RILLLDTSKSPV-ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD-- 176
I AE+QT Q + L +P + ++ G IV D+KE G H L + Y++
Sbjct: 106 IVAEMQTPSQVASLELEPADDAPARDGLQKGQSLQKIVRFDLKEEGNHILAVSVSYTETL 165
Query: 177 -------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEITFL 218
G + + ++F+ LSVRTK + G T L
Sbjct: 166 IGSDSQAASGRVRTFRKLYQFVAQPCLSVRTKSSELSPLEVENKSLGPYGKTRLLRFA-L 224
Query: 219 EACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIFKPPVLIR 272
EA +EN + + Q + P + AT L D D + R++ + L+
Sbjct: 225 EAQLENVGDEAVVVKQTKLNPKPPFKATSLNWDLARPDQSDSQPPTLNPRDVLQVAFLVE 284
Query: 273 SGGGIHNYLYQL-KMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
G L L K L H G VLG+L I WR +G+ G L T +L
Sbjct: 285 QEEGQQEGLDALQKDLKH--------DGRAVLGQLSIEWRGTMGDKGFLTTGNLL 331
>gi|299116795|emb|CBN74908.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 535
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 91/179 (50%), Gaps = 21/179 (11%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISI-NNSSTLEVRDVVIKAEIQTDKQRILLLD----- 137
LS L LP +FG IYLGETF +YIS+ N+ ST + + + A++Q+ R+ L D
Sbjct: 55 LSSALKLPDSFGNIYLGETFTAYISVLNHMSTTVLVNASLSAKLQSPTGRVDLEDRRTAR 114
Query: 138 ----TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEG-ERKYLPQFFK 189
+ +P + D IVEH ++ELG HTL T Y D EG E + + +F++
Sbjct: 115 GASVSRPNPAPLLSPSENLDMIVEHTLEELGTHTLRVTVKYHVAGDPEGSEPRSMRKFYR 174
Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATML 248
F V NP+SV V+ F+E + N T+ +L ++ F P A++L
Sbjct: 175 FSVMNPVSVNPVCTAVRGSP-------FVEVQLVNTTQMDLLLESCHFIPEGGVEASLL 226
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 30/127 (23%), Positives = 58/127 (45%), Gaps = 4/127 (3%)
Query: 300 SNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLT 359
S+ LG++++ WRT GE G ++ ++ E+E+ V +P V+ + + +
Sbjct: 381 SHTLGRVEVCWRTTTGESGSIRGGPVVFEAPDRPEVEVTVDGLPDVLKLGRVAECVATVR 440
Query: 360 NQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
N++++ P + L Q +D V ++G L + L L+A G+ +
Sbjct: 441 NRSNR---PMTLQL-QFRTDGMVGVYVHGQSFRNLGELLPGTFVRCPLQLLALVAGLHEL 496
Query: 420 TGITVFD 426
G TV D
Sbjct: 497 RGCTVAD 503
>gi|330936778|ref|XP_003305510.1| hypothetical protein PTT_18371 [Pyrenophora teres f. teres 0-1]
gi|311317446|gb|EFQ86402.1| hypothetical protein PTT_18371 [Pyrenophora teres f. teres 0-1]
Length = 319
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 88/343 (25%), Positives = 144/343 (41%), Gaps = 73/343 (21%)
Query: 7 THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
HS++ +V+RL RPSL + PL P +G + +
Sbjct: 16 AHSVSLKVLRLSRPSLATQYPL---PNSKSLG----------------------ISPKAS 50
Query: 67 LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVV 121
L Y S+ +D+ D LS L LP+AFG+ Y+GETF + NN +T + V
Sbjct: 51 LAYPSQ---NDAKDQFILSPALKLPEAFGSAYVGETFSCTLCANNELDSSDNTKAISGVR 107
Query: 122 IKAEIQTDKQRILLLDTSKSPVE-----------SIRAGGRYDFIVEHDVKELGAHTLVC 170
I+ ++QT + + SP+E S G I++ ++KE G H L
Sbjct: 108 IQGDMQTPS------NPTGSPLELCGLSGEDEGISPGPGESLQRILKFELKEDGNHVLAV 161
Query: 171 TALYSD---GEGER-----KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACI 222
T Y++ GEG+ + + ++F+ LSVRTK ++G + LEA +
Sbjct: 162 TVTYTETALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAG--EMGHRNGSSRYLLEAQL 219
Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA---QSREIFKPPVLIRSGGGIHN 279
EN ++ + ++ V P + L D + NA R++ + L+ G +
Sbjct: 220 ENMGEAAVCLEAVNVNPKPPLRSRSLNWDMQPAGLNAPILSPRDVVQVAFLLEHQAGDDD 279
Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
+ + VLG+L I WR+ LG+ G L T
Sbjct: 280 DM----------PDSITEDNKRVLGQLAIQWRSALGDRGSLST 312
>gi|425781566|gb|EKV19524.1| hypothetical protein PDIG_02530 [Penicillium digitatum PHI26]
gi|425782814|gb|EKV20700.1| hypothetical protein PDIP_13810 [Penicillium digitatum Pd1]
Length = 336
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 143/358 (39%), Gaps = 82/358 (22%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H+++ +V+RL RPSL + P+ ASN +ISS + L
Sbjct: 17 HAVSLKVLRLARPSLS------------------YQHPLPASNT--IISSKAS------L 50
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD-------- 119
+Y S DS D L+ LL LP +FG++Y+GETF +S NN E+ D
Sbjct: 51 SYPS----GDSDDQFILTPLLTLPPSFGSVYVGETFGCTLSANN----EIHDNDNERILT 102
Query: 120 -VVIKAEIQTDKQRILL---LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
V I AE+QT L + + +R G IV D+KE G H L + Y+
Sbjct: 103 SVRILAEMQTPSSVAALELQPPNDSASTDGLRIGESLQKIVRFDLKEEGNHILAVSVSYT 162
Query: 176 D---------GEGERKYLPQFFKFIVSNPLSVRTKVRVV-----------KVGATHFQEI 215
+ G + + ++F+ LSVRTK + G T
Sbjct: 163 ETKIGSDSQAASGRVRTFRKLYQFVSQPCLSVRTKASELPPLEVDNKSLGPYGKTRLLRF 222
Query: 216 TFLEACIENHTKSNLYMDQVEFEP-------SQNWSATMLKADGPHSDYNAQSREIFKPP 268
LEA +EN + + + Q + P S NW TM P + R++ +
Sbjct: 223 A-LEAQLENVGEGAVVVKQTKLNPKPPFRSKSLNWD-TMNPNMSPAALPTLNPRDVLQVA 280
Query: 269 VLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
L+ G L+ ++ G LG+L I WR +G+ G L T ++
Sbjct: 281 FLVEQEEGQSEGFETLQ-------KDLRRDGRATLGQLSIEWRGAMGDKGFLTTGNLM 331
>gi|255949754|ref|XP_002565644.1| Pc22g17310 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592661|emb|CAP99019.1| Pc22g17310 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 345
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 87/347 (25%), Positives = 142/347 (40%), Gaps = 72/347 (20%)
Query: 14 VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRF 73
V+RL RPSL + PL P + ++T S L+Y S
Sbjct: 32 VLRLARPSLSYQHPL------------------------PTSKTKISTKAS--LSYPS-- 63
Query: 74 LLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRD-----VVIKAEIQT 128
DS D L+ LL LP +FG++Y+GETF +S NN ++ D V I AE+QT
Sbjct: 64 --SDSDDQFILTPLLTLPPSFGSVYVGETFGCTLSANNEINVDDDDRLLTSVRIVAEMQT 121
Query: 129 DKQRILLL---DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD--------- 176
L + + + ++ G IV D+KE G H L + Y++
Sbjct: 122 PSSVAALELEPPSDSASTDGLKIGESLQKIVRFDLKEEGNHILAVSVSYTETKIGSDSQA 181
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVV-----------KVGATHFQEITFLEACIENH 225
G + + ++F+ LSVRTK + G T LEA +EN
Sbjct: 182 ASGRVRTFRKLYQFVAQPCLSVRTKASELPPLEVDNKSLGPYGKTRLLRFA-LEAQLENV 240
Query: 226 TKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIFKPPVLIRSGGGIHN 279
+ + + Q + P + + L D ++D + ++ R++ + L+ G +
Sbjct: 241 GEGAVVVKQTKLNPKPPFQSKSLNWDMMNTDMSTRALPTLNPRDVLQVAFLVEQEEGQNE 300
Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
L L+ ++ G LG+L I WR +G+ G L T ++
Sbjct: 301 GLEALQ-------KDLRRDGRATLGQLSIEWRGAMGDKGFLTTGNLM 340
>gi|119571732|gb|EAW51347.1| hypothetical protein FLJ13611, isoform CRA_e [Homo sapiens]
Length = 217
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 105/211 (49%), Gaps = 15/211 (7%)
Query: 226 TKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQ 283
T S ++M++V EPS ++ T L + + + SR +P YLY
Sbjct: 2 TTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAYLQPM-------DTRQYLYC 54
Query: 284 LKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVP 343
LK + + ++G V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P
Sbjct: 55 LKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIP 114
Query: 344 SVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGST 403
V +++PF + K+TN +++ ++ L +++ I+G ++ L P +
Sbjct: 115 DTVNLEEPFHITCKITNCSER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC-- 169
Query: 404 DFHLNLIATKLGVQRITGITVFDKLEKITYD 434
L L+++ G+Q I+G+ + D K TY+
Sbjct: 170 -LALTLLSSVQGLQSISGLRLTDTFLKRTYE 199
>gi|358386843|gb|EHK24438.1| hypothetical protein TRIVIDRAFT_219893 [Trichoderma virens Gv29-8]
Length = 319
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/334 (26%), Positives = 141/334 (42%), Gaps = 53/334 (15%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL +PSL + P +DP F P S P + L
Sbjct: 16 HSVSVKVLRLSQPSLVTQYP--IDPP--------FSPPNTKSQPAP-----------ASL 54
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
Y + + D LS +L LP +FG+ Y+GETF + NN + +RDV I+
Sbjct: 55 AYSGS---NTNPDPFLLSPVLNLPVSFGSAYVGETFSCTLCANNDLPPDAAKRIRDVRIE 111
Query: 124 AEIQT----DKQRILLLDTSKSPVES-----IRAGGRYDFIVEHDVKELGAHTLVCTALY 174
AE++T Q++ L + + + GG IV D+KE G H L T Y
Sbjct: 112 AEMKTPGLGGTQKLELGPANMHGAAAAGGVDLEPGGTLQKIVGFDLKEEGNHVLAVTVSY 171
Query: 175 SDG---EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLY 231
S+ G + + ++FI L VRTKV + A+ + LEA +EN ++ +
Sbjct: 172 SEATETSGRTRTFRKLYQFICKASLIVRTKVSSLNTDASSIGKW-ILEAQLENCSEDVIQ 230
Query: 232 MDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGS 291
+++V + + + D N S KP + G I + ++ S
Sbjct: 231 LEKVVLDAEEGLG---------YHDCNWSSDGDKKP---VLHPGEIEQVCFLVQEKGADS 278
Query: 292 SSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
+ G + G L I WR +G G L T ++
Sbjct: 279 GLRLTADGRMIFGVLGIGWRGEMGCRGFLSTGKL 312
>gi|402084162|gb|EJT79180.1| hypothetical protein GGTG_04268 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 335
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 95/359 (26%), Positives = 155/359 (43%), Gaps = 80/359 (22%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P++ +G + A ++L SS+ TN
Sbjct: 15 HSISLKVLRLSRPSLVPQYPVKSP-----LGAQTAGEASAPASL--AYSSEDGTN----- 62
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTL-----------E 116
+D LS +L LP +FG+ Y+GETF + N+ + + +
Sbjct: 63 -----------SDPFILSPILNLPPSFGSAYVGETFSCTLCANHDAPVAPPGAPPARAKQ 111
Query: 117 VRDVVIKAEIQTDKQ-RILLLD-----------TSKSPVESIRAGGRYDFIVEHDVKELG 164
VRDV I+AE++T + LD T + + GG +V D+K+ G
Sbjct: 112 VRDVRIEAEMKTPASANVTKLDLGPDHAGGRTGTGGAGGVDLEPGGTLQKVVSFDLKDEG 171
Query: 165 AHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHF---QEIT-- 216
H L T Y +D G + + ++F+ L VRTKV + GA +E+T
Sbjct: 172 NHVLAVTVSYYEATDTSGRTRTFRKLYQFVCKPSLIVRTKVSALPTGAVAAATEKELTTP 231
Query: 217 ----FLEACIENHTKSNLYMDQ--VEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVL 270
LEA +EN + + +++ ++ EP ++ +A G PVL
Sbjct: 232 ARRWVLEAQLENCGEDPIQLERAVLDLEPGLTYTDCNWEAAGGQK------------PVL 279
Query: 271 IRSGGGIHNYLYQLKMLSHGSSSPVK-VQGSNVLGKLQITWRTNLGEPGRLQTQQILGT 328
S + Q+ + HG+ +P V G + G L + WR +G G L T + LGT
Sbjct: 280 HPS------EIEQICFVVHGTPTPASLVDGKVIFGILGVGWRGEMGNRGFLSTGK-LGT 331
>gi|225560447|gb|EEH08728.1| DUF974 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 348
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 93/361 (25%), Positives = 146/361 (40%), Gaps = 75/361 (20%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL P ++PPL +S + SSD
Sbjct: 17 HSVSLKVLRLSRPSLSYQYPL----------------PSENESVPPLKASLSYPSDSSD- 59
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK---- 123
S+F+L + + LP AFG+ Y+GETF + NN L++ + V+
Sbjct: 60 ---SQFILSPN---------VTLPPAFGSAYVGETFSCSLCANNELPLDIENRVVSSVRI 107
Query: 124 -AEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
AE+QT Q I+ L+ S P E +GG IV D+KE G H L + Y++
Sbjct: 108 VAEMQTPSQ-IVSLELSP-PGEDSGSGGLAKSQSLQKIVRFDLKEEGNHVLAVSVSYTET 165
Query: 177 --------------------GEGERKYLPQFFKFIVSNPLSVRTKVRVV----------- 205
G + + ++FI LSVRTK +
Sbjct: 166 TLAPQGQETSPGSGVGAVQAASGRVRTFRKLYQFIAQPCLSVRTKATELTPLEVDNRALG 225
Query: 206 KVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIF 265
G LEA +EN + + P + + L D SD + +
Sbjct: 226 PYGKARLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLNWDFERSDSLKTAPPML 284
Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
KP +++ + Q + L G + G +LG+L I WR ++G+ G L T +
Sbjct: 285 KPRDVLQVAFLVEQEHGQQEGL-EGLQKDMNRDGRTILGQLSIEWRGSMGDRGFLTTGNL 343
Query: 326 L 326
+
Sbjct: 344 M 344
>gi|449301586|gb|EMC97597.1| hypothetical protein BAUCODRAFT_67883 [Baudoinia compniacensis UAMH
10762]
Length = 321
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 137/340 (40%), Gaps = 63/340 (18%)
Query: 6 GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
G H+++ +V+RL RPSL + PL PT+ G DI PP S + +
Sbjct: 14 GPHAVSLKVLRLSRPSLASQTPL--PPTNFGHGIDI----------PPEASVAYPGSSTK 61
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS----STLEVRDVV 121
+ + L LL LP AFGA Y+GETF + +NN V V
Sbjct: 62 E------------PSTFPLVPLLTLPSAFGAAYVGETFACTLCVNNEIQHIEKRSVSGVR 109
Query: 122 IKAEIQTDKQ------RILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
+ AE+QT + D ++ + + H++KE G+H L T Y+
Sbjct: 110 VTAELQTPNDPSGTHLELTKADNAEEGDGELPLATTLQRTLAHELKEEGSHVLAVTVSYT 169
Query: 176 ------DG---EGERKYLPQFFKFIVSNPLSVRTKV----RVVKVGATHFQEITFLEACI 222
DG G + + ++F+ + ++VR+K R K G + LEA +
Sbjct: 170 ETLRGDDGGASGGRARSFRKLYQFVAQHLIAVRSKATERKRREKAGGRQW----VLEAQL 225
Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLY 282
EN + +++V + + ++ + + R+I + L+ GG+
Sbjct: 226 ENVGEMAAVLEKVWLDGKEGIASRAVNGGEEMEAVVLKPRDIEQVMFLLEEDGGV----- 280
Query: 283 QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
G V G L KL I WRT +GE G L T
Sbjct: 281 -------GKVEDGTVAGRLPLAKLNIEWRTGMGERGSLTT 313
>gi|426384568|ref|XP_004058833.1| PREDICTED: UPF0533 protein C5orf44 homolog [Gorilla gorilla
gorilla]
gi|426384570|ref|XP_004058834.1| PREDICTED: UPF0533 protein C5orf44 homolog [Gorilla gorilla
gorilla]
Length = 218
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 104/211 (49%), Gaps = 14/211 (6%)
Query: 226 TKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQ 283
T S ++M++V EPS ++ T L + + + SR +P YLY
Sbjct: 2 TTSPMFMEKVSLEPSIMYNVTELNSVSQAGECVSTFGSRAYLQPM-------DTRQYLYC 54
Query: 284 LKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVP 343
LK + + ++G V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P
Sbjct: 55 LKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIP 114
Query: 344 SVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGST 403
V +++PF + K+TN + + ++ L +++ I+G ++ L P +
Sbjct: 115 DTVNLEEPFHITCKITNCSSER--TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC-- 170
Query: 404 DFHLNLIATKLGVQRITGITVFDKLEKITYD 434
L L+++ G+Q I+G+ + D K TY+
Sbjct: 171 -LALTLLSSVQGLQSISGLRLTDTFLKRTYE 200
>gi|359497048|ref|XP_003635408.1| PREDICTED: uncharacterized protein LOC100853279, partial [Vitis
vinifera]
Length = 54
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/52 (80%), Positives = 44/52 (84%)
Query: 393 ALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFVD 444
AL VEAF STDF LNLIATKLGVQ+ITGITVFD EK TY+ LPDLEIFVD
Sbjct: 1 ALPQVEAFCSTDFRLNLIATKLGVQKITGITVFDIREKRTYEPLPDLEIFVD 52
>gi|452984074|gb|EME83831.1| hypothetical protein MYCFIDRAFT_162727, partial [Pseudocercospora
fijiensis CIRAD86]
Length = 266
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 74/252 (29%), Positives = 113/252 (44%), Gaps = 47/252 (18%)
Query: 6 GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
G HSL+ +V+RL RPSL + PL T+ G DI AS P +D TT
Sbjct: 12 GPHSLSLKVLRLSRPSLATQTPL--PQTNFGDGLDIHP---TASLAHPKGENDSTT---- 62
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN------SSTLEVRD 119
L+ LL LP AFGA Y+GETF + +NN + V
Sbjct: 63 ----------------FPLTPLLTLPSAFGAAYVGETFTCTLCVNNELSPDSNQRKSVSG 106
Query: 120 VVIKAEIQT-DKQRILLLDTSKSP-----VESIRAGGRYDFIVEHDVKELGAHTLVCTAL 173
V I AE+QT +Q + L+ + E+++ G + H++K+ G H L T
Sbjct: 107 VKITAELQTPSRQEGISLNLENAAEADQDEENLKPGATLQRTLRHELKDEGPHVLAVTVS 166
Query: 174 Y------SDGE----GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIE 223
Y SDG G + + ++F+ L+VR+KV K+ + LEA +E
Sbjct: 167 YTETLIGSDGSAASAGRARTFRKLYQFVSQQLLAVRSKVTERKIREKNSPRQWVLEAQLE 226
Query: 224 NHTKSNLYMDQV 235
N +++ +++V
Sbjct: 227 NVGDASVVLERV 238
>gi|378734173|gb|EHY60632.1| hypothetical protein HMPREF1120_08585 [Exophiala dermatitidis
NIH/UT8656]
Length = 363
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 148/376 (39%), Gaps = 91/376 (24%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL ++ PL P ++ S L
Sbjct: 16 HSVSLKVLRLSRPSLALQHPL-----------------------PHESETETKIPHISSL 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS--------------- 112
Y S+ + + +S L LP +FG+ ++GETF + NN
Sbjct: 53 AYPSKLVDQE----FIISNNLALPPSFGSAHVGETFSCVLCANNELLPPGPTGTGTTTTT 108
Query: 113 -STLEVRDVVIKAEIQTDKQRILLLDTSKSPVESI------RAGGRYDFIVEHDVKELGA 165
T V I AE+QT Q I L SP E + R G I D+KE G
Sbjct: 109 TPTKTVSGTKILAEMQTPSQSIPLDLHIASPTERVDGHDDGRPGSALQTIARFDLKEEGN 168
Query: 166 HTLVCTALYSD---GEGERKYLP---------QFFKFIVSNPLSVRTKVRVVKVG----A 209
H L Y++ G+G + + P + ++F+ LSVRTK +
Sbjct: 169 HVLAVNVTYTETISGDGGQTHAPTSGRVRSFRKLYQFLAQPCLSVRTKATELPPKEVPDK 228
Query: 210 TH--FQEITFL----EACIENHTKSNLYMDQVEFEPSQNWSATMLK----ADGPHSDYNA 259
TH + T L EA +EN + + +++ + + + +T L P D
Sbjct: 229 THGPYGRTTLLRYALEAQLENVSDITIVLEEAKLQSKPPFKSTSLNYWDAHAAPEKDEKN 288
Query: 260 QS---------REIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITW 310
Q R+I + L+ G+ + LK + +K G VLG+L I W
Sbjct: 289 QGHPQKPIINPRDIIQIAFLVEQMEGVQEGIEDLK-------TSLKRDGRAVLGQLAIQW 341
Query: 311 RTNLGEPGRLQTQQIL 326
R+++GE G L T +L
Sbjct: 342 RSSMGERGSLSTGNLL 357
>gi|242765997|ref|XP_002341086.1| DUF974 domain protein [Talaromyces stipitatus ATCC 10500]
gi|218724282|gb|EED23699.1| DUF974 domain protein [Talaromyces stipitatus ATCC 10500]
Length = 345
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 93/364 (25%), Positives = 143/364 (39%), Gaps = 84/364 (23%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL + D + + L
Sbjct: 17 HSVSLKVLRLSRPSLSYQYPLPRE--------------------------DTRISSKASL 50
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
Y S +D LS + LP AFG+ Y+GETF + NN ST +V V I
Sbjct: 51 AYPS----NDFDPHFILSPNVTLPPAFGSAYVGETFACSLCANNELPETDSTKKVTSVRI 106
Query: 123 KAEIQTDKQRILLLD-----------TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCT 171
AE+QT Q + LD T P + + G IV+ D+KE G H L +
Sbjct: 107 LAEMQTPSQ-VFPLDLKPGEDEHQDETLPKPGKGLDYGQSLQKIVQFDLKEEGNHILAVS 165
Query: 172 ALYSD-----------GEGERKYLPQFFKFIVSNPLSVRTKVRVV-----------KVGA 209
Y++ G + + ++FI LSVRTK + G
Sbjct: 166 VSYTETLLADANATTASSGRVRTFRKLYQFIAQPCLSVRTKASELVPAEVENKSLGPYGK 225
Query: 210 THFQEITFLEACIENHTKSNLYMDQ--VEFEP-----SQNWSATMLKADGPHSDYNAQSR 262
T LEA +EN ++ +++ + +P S NW + R
Sbjct: 226 TRLLRFA-LEAQLENVGDGSVVIEKTILNAKPPFKSQSLNWDIHHFPSSSTSEQPTMNPR 284
Query: 263 EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
+I + L+ G H+ L L+ +K G +LG+L I WR+ +G+ G L T
Sbjct: 285 DILQVAFLVEQEVGQHDGLENLQ-------KELKRDGRAILGQLSIEWRSAMGDRGFLTT 337
Query: 323 QQIL 326
++
Sbjct: 338 GNLM 341
>gi|212528588|ref|XP_002144451.1| DUF974 domain protein [Talaromyces marneffei ATCC 18224]
gi|210073849|gb|EEA27936.1| DUF974 domain protein [Talaromyces marneffei ATCC 18224]
Length = 345
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 97/364 (26%), Positives = 145/364 (39%), Gaps = 84/364 (23%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL ED P A+ P TN
Sbjct: 17 HSVSLKVLRLSRPSLSYQYPL--------AREDTRISPKASLAYP--------TND---- 56
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
+ F+L S + LP AFG+ Y+GETF + NN S +V V I
Sbjct: 57 -FDPHFIL---------SPNVTLPPAFGSAYVGETFACSLCANNELPTTDSAKKVASVRI 106
Query: 123 KAEIQTDKQRILLLD------------TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVC 170
AE+QT Q + LD S++P E + G IV+ D+KE G H L
Sbjct: 107 LAEMQTPSQ-VFPLDLRPADDDNHDGTLSRTPGEGLDYGQSLQKIVQFDLKEEGNHILAV 165
Query: 171 TALYSD-------------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK----------- 206
+ Y++ G + + ++FI LSVRTK +
Sbjct: 166 SVSYTETLLTDTLASTQAASGGRVRTFRKLYQFIAQPCLSVRTKASELTPAEVDNKSLGP 225
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDY----NAQSR 262
G T LEA +EN ++ +++ P + AT L D ++ + R
Sbjct: 226 YGKTRLLRFA-LEAQLENVGDGSVVIEKTILSPKPPFKATSLNWDVQAAENVERPSMNPR 284
Query: 263 EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
+I + L+ G + L L +K G LG+L I WR+ +G+ G L T
Sbjct: 285 DILQVAFLVEQEVGQQDGLDTLL-------KDLKRDGRATLGQLSIEWRSTMGDRGFLTT 337
Query: 323 QQIL 326
+L
Sbjct: 338 GNLL 341
>gi|358365955|dbj|GAA82576.1| DUF974 domain protein [Aspergillus kawachii IFO 4308]
Length = 336
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 142/356 (39%), Gaps = 78/356 (21%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H+++ +V+RL RPSL + PL ++D + + L
Sbjct: 17 HAVSLKVLRLSRPSLSYQYPL--------------------------PAADTKISSKASL 50
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVVI 122
+Y + + D L+ L LP AFG+ Y+GETF +S NN ++ V V I
Sbjct: 51 SYPA----DNVDDQFILTPNLTLPPAFGSAYVGETFACTLSANNELPDEETSRVVTSVRI 106
Query: 123 KAEIQTDKQRILLLDTSKSPVE------SIRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
AE+QT Q + LD P E ++ G IV D+KE G H L + Y++
Sbjct: 107 VAEMQTPSQ-VAALDL--EPAEDTASKDGVQKGHSLQKIVRFDLKEEGNHILAVSVSYTE 163
Query: 177 ---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEIT 216
G + + ++F+ LSVRTK + G T
Sbjct: 164 TLIGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVENKSLGPYGKTRLLRFA 223
Query: 217 FLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD--GP-HSDYNAQS---REIFKPPVL 270
LEA +EN + + Q P + A L D GP +D + R++ + L
Sbjct: 224 -LEAQLENVGDGPVVVKQTRLNPKPPFKAVSLNWDLQGPDQADPRPPTLHPRDVLQVAFL 282
Query: 271 IRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
+ G L L+ +K G VLG+L I WR +G+ G L T ++
Sbjct: 283 VEQEEGQQEGLETLQ-------KDMKRDGRAVLGQLSIEWRGAMGDKGFLTTGNLM 331
>gi|323509275|dbj|BAJ77530.1| cgd8_3650 [Cryptosporidium parvum]
Length = 394
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/350 (25%), Positives = 161/350 (46%), Gaps = 20/350 (5%)
Query: 88 LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
L+LP +Y GE+F ++ISI NSS ++ VV+K E+ K+R +L + + I
Sbjct: 51 LLLPTTQCRLYCGESFHAFISITNSSIIKANGVVLKVELVGTKKRHILYNNEDN-YSDID 109
Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV 207
G D +V+ V E+G ++L C ++ E R + +KF V +P ++ ++ +
Sbjct: 110 IGDSLDIVVKERVDEVGLYSLTCQLFFTSNEA-RLTQKKSYKFAVLSPFNISHRLYNLD- 167
Query: 208 GATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKP 267
T ++ F+E +EN + ++ + ++ EP L + D N +++
Sbjct: 168 EDTMDKKTIFVEVSLENVSHQSITLSSMKLEPINIKKLPELIFE--LEDVNLKNKH--NE 223
Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILG 327
P+ I+ +N +++ S ++ K + KL+I W + G L + +I G
Sbjct: 224 PLYIQPRCK-YNKIFKFTSCSREYNNLGKSSREVLELKLRIGWVSVSYGDGWLDSYKI-G 281
Query: 328 TTITSKEIELN--------VVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSD 379
I + +LN E+PSV + F + L +TN +Q I L D D
Sbjct: 282 LPILCDQNKLNKEKNAIILKAELPSVNNRQEEFKVFLYVTNNLSIDQKGMSIRL---DFD 338
Query: 380 EEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLE 429
+ ++I G + L ++A + L+ A GV + GI VFD+LE
Sbjct: 339 QLLPIIILGNDRLYLEELKAGETVTLELDCQALVSGVYNLNGIYVFDELE 388
>gi|317037990|ref|XP_001401447.2| hypothetical protein ANI_1_228184 [Aspergillus niger CBS 513.88]
Length = 336
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 90/354 (25%), Positives = 142/354 (40%), Gaps = 74/354 (20%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H+++ +V+RL RPSL + PL ++D + + L
Sbjct: 17 HAVSLKVLRLSRPSLSYQYPL--------------------------PAADTKISSKASL 50
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVVI 122
+Y + + D L+ L LP AFG+ Y+GETF +S NN ++ V V I
Sbjct: 51 SYPA----DNVDDQFILTPNLTLPPAFGSAYVGETFACTLSANNELPDEETSRVVTSVRI 106
Query: 123 KAEIQTDKQRILLLD----TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD-- 176
AE+QT Q + LD + + ++ G IV D+KE G H L + Y++
Sbjct: 107 VAEMQTPSQ-VAALDLEPAEDTASKDGVQKGHSLQKIVRFDLKEEGNHILAVSVSYTETL 165
Query: 177 -------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEITFL 218
G + + ++F+ LSVRTK + G T L
Sbjct: 166 IGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVENKTLGPYGKTRLLRFA-L 224
Query: 219 EACIENHTKSNLYMDQVEFEPSQNWSATMLKAD--GP-HSDYNAQS---REIFKPPVLIR 272
EA +EN + + Q P + A L D GP +D + R++ + L+
Sbjct: 225 EAQLENVGDGPVVVKQTRLNPKPPFKAVSLNWDLQGPDQADPRPPTLHPRDVLQVAFLVE 284
Query: 273 SGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
G L L+ +K G VLG+L I WR +G+ G L T ++
Sbjct: 285 QEEGQQEGLETLQ-------KDMKRDGRAVLGQLSIEWRGAMGDKGFLTTGNLM 331
>gi|240280000|gb|EER43504.1| DUF974 domain-containing protein [Ajellomyces capsulatus H143]
gi|325088719|gb|EGC42029.1| DUF974 domain-containing protein [Ajellomyces capsulatus H88]
Length = 348
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/361 (25%), Positives = 148/361 (40%), Gaps = 75/361 (20%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL + E+ ++PPL +S + SSD
Sbjct: 17 HSVSLKVLRLSRPSLSYQYPL--------LSEN--------ESVPPLKASLSYPSDSSD- 59
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK---- 123
S+F+L + + LP AFG+ Y+GETF + NN L++ + V+
Sbjct: 60 ---SQFILSPN---------VTLPPAFGSAYVGETFSCSLCANNELPLDIENRVVSSVRI 107
Query: 124 -AEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
AE+QT Q I+ L+ S P E +GG IV D+KE G H L + Y++
Sbjct: 108 VAEMQTPSQ-IVSLELSP-PGEDSGSGGLAKSQSLQKIVRFDLKEEGNHVLAVSVSYTET 165
Query: 177 --------------------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK---------- 206
G + + ++FI LSVRTK +
Sbjct: 166 TLAPQGQETSPGSGVGAVQAASGRVRTFRKLYQFIAQPCLSVRTKATELTPLEVDNRALG 225
Query: 207 -VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIF 265
G LEA +EN + + P + + L D SD + +
Sbjct: 226 PYGKARLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLNWDFERSDSLKTAPPML 284
Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
KP +++ + Q + L G + G +LG+L I WR ++G+ G L T +
Sbjct: 285 KPRDVLQVAFLVEQEHGQQEGL-EGLQKDMNRDGRTILGQLSIEWRGSMGDRGFLTTGNL 343
Query: 326 L 326
+
Sbjct: 344 M 344
>gi|121706562|ref|XP_001271543.1| DUF974 domain protein [Aspergillus clavatus NRRL 1]
gi|119399691|gb|EAW10117.1| DUF974 domain protein [Aspergillus clavatus NRRL 1]
Length = 337
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 129/315 (40%), Gaps = 50/315 (15%)
Query: 49 SNLPPLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYI 107
SN PL ++ + + L+Y S D AD LS L LP AFG+ Y+GETF +
Sbjct: 31 SNQYPLPVANTKISSKASLSYPS-----DGADGQFILSPNLTLPPAFGSAYVGETFACTL 85
Query: 108 SINNSSTLE-----VRDVVIKAEIQTDKQRILL-LDTSKSPV--ESIRAGGRYDFIVEHD 159
S NN T + V V I AE+QT Q L L+ + P E ++ G IV D
Sbjct: 86 SANNELTEDEASRVVTSVRIVAEMQTPSQVASLELEPATDPAQTEGLQKGESLQKIVRFD 145
Query: 160 VKELGAHTLVCTALYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK---- 206
+KE G H L + Y++ G + + ++F+ LSVRTK +
Sbjct: 146 LKEEGNHILAVSISYTETLIGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEV 205
Query: 207 -------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA 259
G T LEA +EN + + Q + P + A L D D A
Sbjct: 206 ENKSLGPYGKTRLLRFA-LEAQLENVGDGAVVVKQTKLNPRPPFQAASLNWDLDRPDEVA 264
Query: 260 --------QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWR 311
R++ + L+ G L L+ ++ G VLG+L I WR
Sbjct: 265 SPLPPPTLNPRDVLQVAFLVEQEEGQQEGLDALQ-------KDLRRDGRAVLGQLSIEWR 317
Query: 312 TNLGEPGRLQTQQIL 326
+G+ G L T +L
Sbjct: 318 GAMGDKGFLTTGNLL 332
>gi|223993247|ref|XP_002286307.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977622|gb|EED95948.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 573
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/268 (27%), Positives = 132/268 (49%), Gaps = 32/268 (11%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISI-NNSSTLEVRDVVIKAEIQTDKQRILL---LDTS 139
LS L+LP +FG I++GETF +Y+ + N SS L VR + + ++QT +RI+L LD +
Sbjct: 133 LSSNLLLPDSFGVIHVGETFSAYLGVLNPSSDLPVRGLTVTVQLQTPSRRIILPSRLDGT 192
Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY-SDGEGERKYLPQFFKFIVSNPLSV 198
+ ++ I+ GG D IV ++E+G H L Y ++G K L +F++F V+ PL++
Sbjct: 193 DASLKDIQPGGGVDSIVSRRLEEVGQHILRVEVGYMANGA---KTLRKFYRFNVTVPLNI 249
Query: 199 RTKVRVVKVGATHFQEITFLEACIENHTKSN--LYMDQVEFEPSQNWSATMLKAD----- 251
T+ V K A+ IT +E +E + + + V FEP A + +
Sbjct: 250 -TETVVRKGDASCLVSIT-VENVMEKQSSGGGAVTISSVGFEPHSGLVAEQINIEEDSQG 307
Query: 252 ---------GPHSDYNAQSR----EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQ 298
SD +A R E++ + G I+ YL+ + S +++ +
Sbjct: 308 ETTETDDIMTARSDLSASPRKSTVELYDSCGRLEP-GEINRYLFSVTAGSE-AAALRGIA 365
Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQIL 326
+ LG+ + + +GE G+L + ++
Sbjct: 366 FGDELGRAYLIYYKAMGESGKLFSSMVV 393
>gi|358399703|gb|EHK49040.1| hypothetical protein TRIATDRAFT_82516 [Trichoderma atroviride IMI
206040]
Length = 796
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 95/345 (27%), Positives = 149/345 (43%), Gaps = 55/345 (15%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL +PSL + P VDP F P S P + L
Sbjct: 488 HSVSVKVLRLSQPSLVTQYP--VDPP--------FSPPNTKSQPAP-----------ASL 526
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
Y+S + + D LS +L LP +FG+ Y+GETF + NN + +RDV I+
Sbjct: 527 AYKS--ASNTNPDPFLLSPILNLPVSFGSAYVGETFSCTLCANNDLPPDAAKRIRDVRIE 584
Query: 124 AEIQT----DKQRILLLDTS---KSPVESI--RAGGRYDFIVEHDVKELGAHTLVCTALY 174
AE++T Q++ L + +P + GG IV D+KE G H L T Y
Sbjct: 585 AEMKTPGVGGTQKLELGPANIHGATPAGGVDLEPGGTLQRIVGFDLKEEGNHVLAVTVSY 644
Query: 175 SDG---EGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITF-LEACIENHTKSNL 230
S+ G + + ++FI L VRTKV ++ A + + LEA +EN ++ +
Sbjct: 645 SEATETSGRTRTFRKLYQFICKASLIVRTKVSALEASANNSNYRKWVLEAQLENCSEDII 704
Query: 231 YMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHN--YLYQLKMLS 288
+++V + + + D N S KP V G I +L +
Sbjct: 705 QLEKVVLDVEEGLG---------YQDCNWLSEGDKKPVVHP---GEIEQVCFLVHEEGTD 752
Query: 289 HGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSK 333
G + G + G L I WR +G G L T + LG + ++
Sbjct: 753 AGGGLRLTSDGRLIFGVLGIGWRGEMGCRGFLSTGK-LGARVAAR 796
>gi|406860784|gb|EKD13841.1| hypothetical protein MBM_08042 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 361
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 143/355 (40%), Gaps = 78/355 (21%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H+++ +V+RL RPSL V+ PL PT L A S + L
Sbjct: 37 HAVSLKVLRLSRPSLSVQHPL---PTPLPSSNSSHLSSPAPS---------------ASL 78
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-------SSTLEVRDV 120
Y S D LS LL LP AFG+ Y+GETF + NN S+ + +V
Sbjct: 79 AYPS-----SKPDPFILSPLLTLPPAFGSAYVGETFSCTLCANNEILAGSSSAGKVITNV 133
Query: 121 VIKAEI-----------------------------QTDKQRILLLDTSKSPVESIRAGGR 151
I+AE+ + D +++L D S +E G
Sbjct: 134 RIEAEMKIPSSSVPIPLVLGPEASSKLETDEVEEGERDPEKVLEKDHQGSDLE---PGKS 190
Query: 152 YDFIVEHDVKELGAHTLVCTALYSD---GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVG 208
IV D+KE G+H L T YS+ G + + ++F+ + + VRTK V+ G
Sbjct: 191 LQKIVGFDLKEEGSHVLAVTVTYSETTPTSGRIRTFRKLYQFVCKSCMVVRTKTGVLPSG 250
Query: 209 ATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPP 268
++ LEA +EN + + +D V E + + L N + E + P
Sbjct: 251 EKEGRKWA-LEAQLENCGEETITLDVVILETKEGFKGQGL---------NWEVGEEMERP 300
Query: 269 VLIRSGGGIHNYLYQL-KMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
VL+ G + + + ++L G V G + G L + WR +G G L T
Sbjct: 301 VLMP--GDVQQVCFLVEEVLGVGGEVVEPVDGKLIFGILSLGWRGTMGNRGFLST 353
>gi|219113485|ref|XP_002186326.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583176|gb|ACI65796.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 457
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 71/262 (27%), Positives = 123/262 (46%), Gaps = 20/262 (7%)
Query: 75 LHDSA-----DSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE-VRDVVIKAEIQT 128
LH+ A + L L LP++ G +Y+GETF +Y+ + N+ST + +R + + A++QT
Sbjct: 33 LHNPAAGSLDNQAALHNSLCLPESLG-VYVGETFTAYLGVLNTSTRQSIRRLTVLAQLQT 91
Query: 129 DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
R L + V+ A G D IV H ++E G H L Y +G + +F+
Sbjct: 92 PSNRWQLPSLLEKGVDVNPANG-VDAIVAHAIEEPGQHILRVEVGYRTNDGGLQTFRKFY 150
Query: 189 KFIVSNPLSV-RTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 247
+F V NPL++ +T R+ +T+ + L + F P A +
Sbjct: 151 RFQVVNPLTIQQTTTRMGDSQCLVSLSVTYNKTA---DATGPLVIANAAFRPVDGLVARL 207
Query: 248 LKADGPHSDYNAQSREIFKPPVLIRSG----GGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
L DG H + ++ +L +SG G I YL+Q++ S + + ++L
Sbjct: 208 L--DG-HVSESTPDAKMSALQLLDKSGLLQPGSIVRYLFQIEATSR-EAVLKGIAAGDLL 263
Query: 304 GKLQITWRTNLGEPGRLQTQQI 325
G+ +TWR +GE G++ + I
Sbjct: 264 GQAVLTWRKAMGETGQIYSASI 285
>gi|398389012|ref|XP_003847967.1| hypothetical protein MYCGRDRAFT_77482 [Zymoseptoria tritici IPO323]
gi|339467841|gb|EGP82943.1| hypothetical protein MYCGRDRAFT_77482 [Zymoseptoria tritici IPO323]
Length = 311
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 87/337 (25%), Positives = 142/337 (42%), Gaps = 67/337 (19%)
Query: 6 GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
G HS++ +V+RL RP+L V+ PL T G DI P AS
Sbjct: 14 GPHSISLKVLRLSRPTLAVQTPLL--STAFNNGLDI---PAKAS---------------- 52
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE-----VRDV 120
L Y S D + L+ LL LP +FGA Y+GE F + +NN E V +
Sbjct: 53 -LAYPS----ADQNSTFPLTPLLTLPASFGAAYVGERFTCTLCVNNELLAEDKAKSVSGL 107
Query: 121 VIKAEIQT----DKQRILLLDTSKSPVES-IRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
+ AE+QT D L L ++ + E + G + + H++KE G H L T Y+
Sbjct: 108 KVSAELQTPTFSDAGVALELKSALTKKEEDLSPGDTLQYTLSHELKEEGPHVLAVTVSYT 167
Query: 176 DGE---------GERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHT 226
+ G + + ++F+ L+VR+K+ + LEA +EN
Sbjct: 168 ETSHTAEGGASGGRARTFRKLYQFVAQPLLAVRSKITERQRREKDALRQWILEAQLENVG 227
Query: 227 KSNLYMDQVEFEPSQNWSATMLKADG-PHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK 285
+ ++ +++V W + + DG D N + + KP + ++ ++
Sbjct: 228 EVSVVLERV-------W---LKEEDGMKGQDVNDKEAVVLKP-------SDVEQVMFLVE 270
Query: 286 MLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
S +V LG+L + WR+ +GE G L T
Sbjct: 271 EEERLSELSARVP----LGELNVDWRSAMGERGGLTT 303
>gi|342874081|gb|EGU76154.1| hypothetical protein FOXB_13326 [Fusarium oxysporum Fo5176]
Length = 1061
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 132/319 (41%), Gaps = 49/319 (15%)
Query: 11 AFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYR 70
A +RL RPSL + P +DP +G I PI AS S+ +N S
Sbjct: 639 ASSTLRLSRPSLVTQYP--IDPPS-SVGASIKSAPIPASLA---YHSEAASNPSP----- 687
Query: 71 SRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS----STLEVRDVVIKAEI 126
FLL S + LP +FG+ Y+GETF + NN + +RDV I+AE+
Sbjct: 688 --FLL---------SPAVNLPVSFGSAYVGETFSCTLCANNELPIDAAKNIRDVRIEAEM 736
Query: 127 QTDK----QRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEG 179
+T QR+ L ++ P + +G +V D+KE G H L T Y ++ G
Sbjct: 737 KTPGMGAVQRLELGPSNGQPEVDLESGDTLQKVVSFDLKEEGNHVLAVTVSYYEATETSG 796
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQV--EF 237
+ + ++FI L VRTKV + T + LEA +EN ++ + +++V +
Sbjct: 797 RTRTFRKLYQFICKASLIVRTKVGPLNSNNTQERGRWVLEAQLENCSEDVVQLEKVVLDT 856
Query: 238 EPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
EP + +A G L+ G + + + S V
Sbjct: 857 EPGLRYRDCNWEASGSEK--------------LVLHPGEVEQVCFVVAEDGTESGVEVTP 902
Query: 298 QGSNVLGKLQITWRTNLGE 316
G + G L I WR E
Sbjct: 903 DGRIIFGSLGIGWRGPRAE 921
>gi|453080254|gb|EMF08305.1| hypothetical protein SEPMUDRAFT_166779 [Mycosphaerella populorum
SO2202]
Length = 365
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 98/378 (25%), Positives = 149/378 (39%), Gaps = 91/378 (24%)
Query: 2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
S+ G HSL+ +V+RL RP+L + PL PT G DI P A+ S+ +
Sbjct: 14 STFSGPHSLSLKVLRLSRPALATQAPL--PPTAFGNGLDIA--PNASLAYSTADSTATSQ 69
Query: 62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISIN----------- 110
++ D + S F L + L LP AFGA Y+GETF + +N
Sbjct: 70 DEKRDTSAPSSFPLTQA---------LTLPAAFGAAYVGETFVCTLCVNNELPPSPSSDE 120
Query: 111 --------NSSTLEVRDVVIKAEIQT-------DKQRILLLDTSKSPVE----------- 144
N + V V I AE+QT D L L+ + S E
Sbjct: 121 GGGGSGEGNQTITVVSGVKIVAELQTPTRNQAGDGGIALPLEGAASTHEDEGEGGEGGGV 180
Query: 145 SIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ---------------FFK 189
I+ G + H++K+ G + L T Y+ E LPQ ++
Sbjct: 181 KIKPGETLQRTLRHELKDEGQYVLAVTVSYT----EETLLPQHGGTVVGSRTRSFRKLYQ 236
Query: 190 FIVSNPLSVRTKVRVVKVGATHFQEITFLEACIEN--HTKSNLYMDQV---EFEPSQNWS 244
FI ++VR+KV K T LEA +EN + + +++V E E + +
Sbjct: 237 FISQQLVAVRSKVTERKKKDTTAAREWVLEAQLENVADGGAGIVLEKVWLKESEEDRVVA 296
Query: 245 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 304
M+ G + KP G I ++ +K ++ V + LG
Sbjct: 297 KAMMDVGG----------TVLKP-------GDIEQIMFLVKEDKKENAEDVDLSMKVRLG 339
Query: 305 KLQITWRTNLGEPGRLQT 322
+L I WR+ +GE G L T
Sbjct: 340 QLNIDWRSAMGEKGSLTT 357
>gi|19584414|emb|CAD28498.1| hypothetical protein [Homo sapiens]
Length = 207
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 83/157 (52%), Gaps = 6/157 (3%)
Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIEL 337
YLY LK + + ++G V+GKL I W+TNLGE GRLQT Q+ ++ L
Sbjct: 39 RQYLYCLKPKNEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRL 98
Query: 338 NVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPV 397
++ +P V +++PF + K+TN +++ ++ L +++ I+G ++ L P
Sbjct: 99 SLEAIPDTVNLEEPFHITCKITNCSER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPS 155
Query: 398 EAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
+ L L+++ G+Q I+G+ + D K TY+
Sbjct: 156 SSLC---LALTLLSSVQGLQSISGLRLTDTFLKRTYE 189
>gi|312077829|ref|XP_003141474.1| hypothetical protein LOAG_05889 [Loa loa]
Length = 218
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 112/229 (48%), Gaps = 18/229 (7%)
Query: 217 FLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGG 276
+LEA I+N ++ + +++V EPS + ++ + P + + P
Sbjct: 5 YLEAQIQNTSELPMVLEKVILEPSDFYISSEISP--PEIENENMEQSYLNP-------SD 55
Query: 277 IHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIE 336
I YL+ LK + S +G +GKL + WRT++GE GRLQT + ++
Sbjct: 56 IRQYLFCLKPKTTDYSLNYFRKGI-AIGKLDMVWRTSMGERGRLQTSALQRMAPGYGDLR 114
Query: 337 LNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMI--NGLRIMAL 394
L + ++P+ V + +PF + +L N +++ P ++ L+ +D + + +G+ + L
Sbjct: 115 LTIEKIPATVKVLQPFHIVCRLHNCSER---PLDLVLTLDDKLQPNIAFCSTSGVELGQL 171
Query: 395 APVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
P +TDF L L+ G+Q ++GI V D + TY+ ++FV
Sbjct: 172 PPN---STTDFSLELLPLTPGLQSVSGIRVTDTFLRRTYEHDDIAQVFV 217
>gi|322695604|gb|EFY87409.1| DUF974 domain-containing protein [Metarhizium acridum CQMa 102]
Length = 353
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 90/339 (26%), Positives = 149/339 (43%), Gaps = 61/339 (17%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RP+L + P P+ A+ + S ++ SS
Sbjct: 59 HSVSVKVLRLSRPALVPQYP---------------SSPLPATK-EAFLPSSLSYKTSS-- 100
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN---------SSTLEVR 118
T + FLL S +L LP +FG+ Y+GETF + NN S +R
Sbjct: 101 TNPAPFLL---------SPILNLPVSFGSAYVGETFSCTLCANNDLVTASSSSSPGKRIR 151
Query: 119 DVVIKAEIQT----DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY 174
DV I AE++T ++ L S +P + + AG +V D+KE G H L T Y
Sbjct: 152 DVRIDAEMKTPGPGPAHKLPL--ASGAPAD-LAAGETLQRVVSFDLKEEGNHVLAVTVSY 208
Query: 175 ---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLY 231
S+ G + + ++FI L VRTKV + +G ++ LEA +EN ++ +
Sbjct: 209 YEASETSGRTRTFRKLYQFICKASLIVRTKVGL--LGDEGGRKRWVLEAQLENCSQDVMQ 266
Query: 232 MDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGS 291
+D+V E + L+ +G ++ + + P + + + + + + G
Sbjct: 267 LDKVGMEAERG-----LRCEG--CNWAEGEKPVLHPGEVEQVCFVVEEEEREEESRADGD 319
Query: 292 SSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTI 330
+ G V G L I WR +G G L T + LGT +
Sbjct: 320 A-----DGRVVFGVLGIGWRGEMGNRGFLSTGK-LGTRV 352
>gi|349605672|gb|AEQ00830.1| UPF0533 protein C5orf44-like protein-like protein, partial [Equus
caballus]
Length = 170
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 45/157 (28%), Positives = 81/157 (51%), Gaps = 6/157 (3%)
Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIEL 337
YLY LK + ++G V+GKL I W+TNLGE GRLQT Q+ ++ L
Sbjct: 2 RQYLYCLKPKKEFAEKAGIIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRL 61
Query: 338 NVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPV 397
++ +P V +++PF + K+TN +++ ++ L ++ I+G ++ L P
Sbjct: 62 SLEAIPDTVNLEEPFHITCKITNCSER---TMDLVLEMCNTSSIHWCGISGRQLGKLHPS 118
Query: 398 EAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYD 434
+ L L+++ G+Q ++G+ + D K TY+
Sbjct: 119 SSLC---LALTLLSSVQGLQSVSGLRLTDTFLKRTYE 152
>gi|354489776|ref|XP_003507037.1| PREDICTED: UPF0533 protein C5orf44 homolog [Cricetulus griseus]
Length = 282
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 62/251 (24%), Positives = 113/251 (45%), Gaps = 17/251 (6%)
Query: 183 YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN 242
+L + F S PL V+TK ++ FLE IEN + S +++ +V + +
Sbjct: 2 FLSKICLFYPSEPLDVKTKF------YNSDKDDLFLEVQIENISHSTVFIREVSLKLPEM 55
Query: 243 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
++ L + + F +++ G H YLY L+ + G
Sbjct: 56 YTEEALNT----LNLEGEDECTFGTRTFLQATEGRH-YLYHLQFKEEYLEKARTLSGLME 110
Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
+GKL+I W+ LGE L T + + E++L++ ++P V ++PF + K+TN T
Sbjct: 111 MGKLEIVWKRELGEMPMLHTVPLRREAPSCGELKLSLEKIPDTVAREEPFQITCKITNCT 170
Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
DK+ ++ L D+ + +G + L + S F + L+ +LG++ I+GI
Sbjct: 171 DKK---MKLLLKMFDTTSVRWCGCSGRK---LGRFKTGSSLSFTVTLLCLQLGLRSISGI 224
Query: 423 TVFDKLEKITY 433
+ D K Y
Sbjct: 225 RIIDATLKTKY 235
>gi|380488796|emb|CCF37134.1| hypothetical protein CH063_08544 [Colletotrichum higginsianum]
Length = 342
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 90/363 (24%), Positives = 148/363 (40%), Gaps = 82/363 (22%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL ++ P+R P+ +NLP + ++
Sbjct: 16 HSVSLKVLRLSRPSLVIQHPVR--------------PPLTPANLPADPTPASLAYDTTAS 61
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN---------------- 111
T + FLL S +L LP +FG+ Y+GE F + N+
Sbjct: 62 TNPAPFLL---------SPILNLPLSFGSAYVGEVFSCTLCANHDVPDPAMAPLGPGGLP 112
Query: 112 ------SSTLEVRDVVIKAEIQT-DKQRILLLDTSK-SPVESIRA-----GGRYDFIVEH 158
+RDV I+AE++T I L+ S +P + + G IV
Sbjct: 113 LAGAAPPKRKSIRDVRIEAEMKTPGANSIQKLELSPPNPSDDTKGTDLDPGDTLQRIVNF 172
Query: 159 DVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEI 215
D+KE G H L T Y ++ G+ + + ++FI + L VRTK+ + A H
Sbjct: 173 DLKEEGNHVLAVTVSYYEATETSGKTRTFRKLYQFICKSSLIVRTKIGPLAPAARHGGRR 232
Query: 216 TFLEACIENHTKSNLYMDQVEFEPSQ-------NWSATMLKADGPHSDYNAQSREIFKPP 268
LEA +EN ++ + +++V + + NW A + +R + P
Sbjct: 233 WALEAQLENCSEDVIQLEKVVLDLADGLGYTDCNWVAAGGGG------SDGDARPVLHP- 285
Query: 269 VLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN---VLGKLQITWRTNLGEPGRLQTQQI 325
G + + ++ SP QG + + G L I WR +G G L T +
Sbjct: 286 ------GEVEQVCF---VVEEAEGSPRAQQGEDGRIMFGILGIGWRGEMGNRGFLSTGK- 335
Query: 326 LGT 328
LGT
Sbjct: 336 LGT 338
>gi|339254156|ref|XP_003372301.1| conserved hypothetical protein [Trichinella spiralis]
gi|316967316|gb|EFV51754.1| conserved hypothetical protein [Trichinella spiralis]
Length = 384
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/370 (23%), Positives = 151/370 (40%), Gaps = 77/370 (20%)
Query: 95 GAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDF 154
G +YLGE F YISI N + + V + +IQT+ R+LL + ++ AG
Sbjct: 69 GNVYLGEVFSCYISILNGTG----ETVTEVDIQTNATRVLLPFKYQDTSLTLNAGQSVGD 124
Query: 155 IVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQE 214
+ H+ F V PL V TK+ + +
Sbjct: 125 SISHE------------------------------FPVLKPLDVCTKL------CSAEND 148
Query: 215 ITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHS------------DYNAQSR 262
+LEA ++N T +++ M++V EP + + ++ +D S + N QS+
Sbjct: 149 TVYLEAQVQNTTDADMIMERVALEPVPDLAPILVPSDFNDSYICTVLYRIIIIERNFQSK 208
Query: 263 E-------IF--KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTN 313
+F K LI+ G + +LY + + S + KL + WRT
Sbjct: 209 TFPRILMLLFREKNCCLIKPGA-VRQFLYGISCIKQDVSWIA-------VAKLNMVWRTT 260
Query: 314 LGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWL 373
G GR+QT + T +++L V+ PS V I PF + + + ++ L
Sbjct: 261 NGRRGRVQTCPLQKTVSGCGDLKLKVISGPSAVKIRLPF-------HVSSFSERALQLTL 313
Query: 374 SQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITY 433
+ +D+ +K ++ N L + P+ + + L L A G+Q +G+ +D K Y
Sbjct: 314 TLDDT-LQKGLLWNSLSEVQFEPLLPAKTMNVTLTLFAECAGLQFASGMKFYDCNAKRRY 372
Query: 434 DSLPDLEIFV 443
+ +FV
Sbjct: 373 EYNDVFHVFV 382
>gi|189196338|ref|XP_001934507.1| hypothetical protein PTRG_04174 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187980386|gb|EDU47012.1| hypothetical protein PTRG_04174 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 334
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 142/343 (41%), Gaps = 58/343 (16%)
Query: 7 THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
HS++ +V+R V L+ TD G P A+ P S + + +
Sbjct: 16 AHSVSLKVLR-------VSQILKFAITD---GVPRLSRPSLATQYPLPNSKSLGISPRAS 65
Query: 67 LTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVV 121
L Y S+ +D+ D LS L LP+AFG+ Y+GETF + NN + + V
Sbjct: 66 LAYPSQ---NDANDQFILSPALNLPEAFGSAYVGETFSCTLCANNELDPSDNAKAISGVR 122
Query: 122 IKAEIQTDKQRILLLDTSKSPVE-----------SIRAGGRYDFIVEHDVKELGAHTLVC 170
I+ ++QT + + SP++ S G I+ ++KE G H L
Sbjct: 123 IQGDMQTPS------NPTGSPLDLSGLSGEDDGVSPGPGESLQRILRFELKEDGNHVLAV 176
Query: 171 TALYSD---GEGER-----KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACI 222
T Y + GEG+ + + ++F+ LSVRTK ++G + LEA +
Sbjct: 177 TVTYMETALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAG--EMGHRNGSSRYLLEAQL 234
Query: 223 ENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA---QSREIFKPPVLIRSGGGIHN 279
EN ++ + ++ V P + L D + NA R++ + L+ G +
Sbjct: 235 ENMGEAAVCLETVNVNPKPPLRSRSLNWDMQSAGLNAPILSPRDVVQVAFLLEHQAGDDD 294
Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQT 322
+ V VLG+L I WR+ LG+ G L T
Sbjct: 295 DM----------PDSVTEDNKRVLGQLAIQWRSALGDRGSLST 327
>gi|424513630|emb|CCO66252.1| predicted protein [Bathycoccus prasinos]
Length = 542
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 94/220 (42%), Gaps = 60/220 (27%)
Query: 156 VEHDVKELGAHTLVCTALYSD---------------GE------GERKYLPQFFKFIVSN 194
V K LG HTL CTA Y D GE GERK ++F F V+N
Sbjct: 151 VHFSAKHLGEHTLKCTAEYVDCPYDERSAVAIMNVAGENTVYDVGERKRAVRYFSFDVTN 210
Query: 195 PLSVRTKVRVV----------KVGATHFQEITFLEACIENH--------TKSNLYMDQVE 236
PL VRTK R V + +E FLEA IEN TK +L +D+
Sbjct: 211 PLHVRTKTRRVFTRSRSEDSDNNSTSSSKEKVFLEATIENVDKAAARLITKVHLIVDE-- 268
Query: 237 FEPSQNWSATMLKADGPHSDYNAQSREIF-----KPPVLIRSGGGIHNYLYQLKMLSH-- 289
+ ++T L + A +F K + ++ GGG ++L+++
Sbjct: 269 ----RRHASTALFPE------IADEETLFDVGNNKNQIYLQKGGGAAHFLFEITETDEWG 318
Query: 290 --GSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILG 327
S + G + LG L+I W + GEPGRLQTQ IL
Sbjct: 319 VSSSMTTTSTSGKDELGTLEICWLGSTGEPGRLQTQPILA 358
>gi|346319202|gb|EGX88804.1| DUF974 domain-containing protein [Cordyceps militaris CM01]
Length = 363
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 82/303 (27%), Positives = 127/303 (41%), Gaps = 73/303 (24%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISINNSST----------LEVRDVVIKAEIQT---DK 130
LS +L LP +FG+ Y+GETF + NN T ++RDV I+AE++T
Sbjct: 76 LSPVLNLPVSFGSAYVGETFRCTLCANNDLTHDDGGDTPAVKKIRDVRIEAEMKTPGLGH 135
Query: 131 QRILLLDTSKS-PVESIRAGGRYDF--------IVEHDVKELGAHTLVCTALYSDG---E 178
Q L+ P + +G D +V D+KE G H L T YS+
Sbjct: 136 QAAQQLELGPPLPADEGASGAGADLAPGATLQRVVSFDLKEEGNHVLAVTVSYSESTETS 195
Query: 179 GERKYLPQFFKFIVSNPLSVRTKVRVV------KVGATHFQEITFLEACIENHTKSNLYM 232
G + + ++FI L VRTKV V+ K G + LEA +EN + + +
Sbjct: 196 GRTRTFRKLYQFICKPSLIVRTKVGVLPCPSASKQGRRPPRRRWVLEAQLENCSDDTMQL 255
Query: 233 DQVEFEPSQ-------NWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK 285
++V EP+ NW+A ADGP + + + +P G + + ++
Sbjct: 256 ERVVVEPAPGLAYRDCNWTA----ADGPTA-----VKPVLRP-------GEVEQVCFVVE 299
Query: 286 MLSHGSS---------------SPVKVQGSN---VLGKLQITWRTNLGEPGRLQTQQILG 327
LS + + + G + V G L I WR +G G L T + LG
Sbjct: 300 ALSRAAQVARGGVEADEAVDVVAEAEAGGPDARIVFGVLGIGWRGEMGSRGFLSTGK-LG 358
Query: 328 TTI 330
T +
Sbjct: 359 TRL 361
>gi|340522585|gb|EGR52818.1| predicted protein [Trichoderma reesei QM6a]
Length = 824
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 147/349 (42%), Gaps = 68/349 (19%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL +PSL + P +DP F + P AS L + +TN
Sbjct: 517 HSVSVKVLRLSQPSLVTQHP--IDPP--FSPPNTKSQPAPAS----LAYAPSSTN----- 563
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN----SSTLEVRDVVIK 123
D LS +L LP +FG+ Y+GETF + NN + +RDV I+
Sbjct: 564 -----------PDPFLLSPILNLPVSFGSAYVGETFSCTLCANNDLPPDAAKRIRDVRIE 612
Query: 124 AEIQT----DKQRILLLDTSKSPVES-------IRAGGRYDFIVEHDVKELGAHTLVCTA 172
AE++T Q++ L + + + GG IV D+KE G H L T
Sbjct: 613 AEMKTPGLGGTQKLELGPANTHEGAAAGGGGVDLEPGGTLQRIVGFDLKEEGNHVLAVTV 672
Query: 173 LY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITF-LEACIENHTKS 228
Y ++ G + + ++FI L VRTKV + + + LEA +EN ++
Sbjct: 673 SYYEATETSGRTRTFRKLYQFICKASLIVRTKVSGLDANTSSSGTRKWILEAQLENCSED 732
Query: 229 NLYMDQV--EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKM 286
+ +++V + E + +DG + + P + Q+
Sbjct: 733 VMQLEKVVLDVEDGLGYHDCNWASDG-------DQKPVLHP-----------GEIEQVCF 774
Query: 287 LSH--GSSSPVKV--QGSNVLGKLQITWRTNLGEPGRLQTQQILGTTIT 331
L H G+ S V++ G + G L I WR +G G L T + LG I
Sbjct: 775 LVHEKGADSGVRMTPDGRIIFGVLGIGWRGEMGCRGYLSTGK-LGARIA 822
>gi|320037981|gb|EFW19917.1| hypothetical protein CPSG_03092 [Coccidioides posadasii str.
Silveira]
Length = 342
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 143/361 (39%), Gaps = 81/361 (22%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL ED + P+ S P ++D
Sbjct: 17 HSVSLKVLRLSRPSLSYQHPL---------PEDFANVPVQPSLSYPSSTAD--------- 58
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
+F+L + L+LP AFG+ Y+GETF +S NN ++ V + I
Sbjct: 59 ---KQFILSPN---------LMLPPAFGSAYVGETFSCSLSANNEFLRGDASRVVTSIRI 106
Query: 123 KAEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
AE+QT Q + L S ++GG IV D+KE G H L Y++
Sbjct: 107 LAEMQTPSQVVPLELYPSSDDNDTKSGGIAQVESMQRIVRFDLKEEGNHVLAVGVSYTET 166
Query: 177 --------------GEGERKYLPQFFKFIVSNPLSVRTKV-----------RVVKVGATH 211
G + + ++F+ L+VRTK + G T
Sbjct: 167 MITQSSDAHGSVQASGGRVRTFRKLYQFVAQPCLNVRTKATELPPQEVDNRSLGPYGKTK 226
Query: 212 FQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIF 265
LEA +EN + + V P + + L D S + +S R++
Sbjct: 227 LYRFA-LEAQLENVGDGIITLGAVTLNPKPPFKSRSLNWDF-ESSADKESIPTLSPRDVL 284
Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
+ ++ G + L L+ + +G LG+L + WR+ LG+ G L T +
Sbjct: 285 QIAFIVEQEHGQQDGLETLQ-------KDMNREGRATLGQLSLEWRSALGDRGFLTTGNL 337
Query: 326 L 326
+
Sbjct: 338 M 338
>gi|407928991|gb|EKG21830.1| hypothetical protein MPH_00750 [Macrophomina phaseolina MS6]
Length = 327
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 85/340 (25%), Positives = 142/340 (41%), Gaps = 61/340 (17%)
Query: 6 GTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSS 65
G HS++ +V+RL RPSL PL P + T + +
Sbjct: 15 GPHSVSLKVLRLSRPSLAHSFPLPQ----------------------PAQPDEFTISPKA 52
Query: 66 DLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDV 120
L Y + D D +S LL LP+AFG+ Y+GE F + NN + + V
Sbjct: 53 SLAYPT----ADPKDLFLVSPLLKLPEAFGSAYVGEAFSCTLCANNELLPGDESKTISGV 108
Query: 121 VIKAEIQTDK--QRILLLDTSKSPVESIRA----GGRYDFIVEHDVKELGAHTLVCTALY 174
I A++QT I L K E+++ G I+ D+KE G+HTL T Y
Sbjct: 109 KIAADMQTPSAPSGIPLELEPKDGPETVQGTVGPGQSVQKILTFDLKEEGSHTLAVTVTY 168
Query: 175 SD----GEGER-----KYLPQFFKFIVSNPLSVRTKVR--VVKVGATHFQEITFLEACIE 223
++ GEG+ + + ++F+ +SV+TK K G + F LEA +E
Sbjct: 169 TETQMAGEGKAAGGRVRTFRKLYQFVAQQLISVKTKTSELTTKGGPSKF----VLEAQLE 224
Query: 224 NHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQ 283
N + +L ++ V + KA+ ++ +A E PVL G + +
Sbjct: 225 NLGEGSLSLEPVIVN-----AEAPFKANSLNTPLSASPEEPPHLPVL--GPGDVSQVAFI 277
Query: 284 LKMLSHGSSSPVKVQGSN--VLGKLQITWRTNLGEPGRLQ 321
L+ ++ ++ ++ L + WR+ +G G L+
Sbjct: 278 LEQQEGATAGETRLSAGRRMLVRNLWVQWRSPMGGRGSLK 317
>gi|119188243|ref|XP_001244728.1| hypothetical protein CIMG_04169 [Coccidioides immitis RS]
gi|392871443|gb|EAS33358.2| hypothetical protein CIMG_04169 [Coccidioides immitis RS]
Length = 342
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 88/361 (24%), Positives = 143/361 (39%), Gaps = 81/361 (22%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL ED + P+ S P ++D
Sbjct: 17 HSVSLKVLRLSRPSLSYQHPL---------PEDFANVPVQPSLSYPSSTAD--------- 58
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
+F+L + L+LP AFG+ Y+GETF +S NN ++ V + I
Sbjct: 59 ---KQFILSPN---------LMLPPAFGSAYVGETFSCSLSANNEFLRGDASRVVTSIRI 106
Query: 123 KAEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
AE+QT Q + L S ++GG IV D+KE G H L Y++
Sbjct: 107 LAEMQTPSQVVPLELYPSSDDNDTKSGGIAQVESMQKIVRFDLKEEGNHVLAVGVSYTET 166
Query: 177 --------------GEGERKYLPQFFKFIVSNPLSVRTKV-----------RVVKVGATH 211
G + + ++F+ L+VRTK + G T
Sbjct: 167 MITPSSDAHGSVQASGGRVRTFRKLYQFVAQPCLNVRTKATELPPQEVDNRSLGPYGKTK 226
Query: 212 FQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIF 265
LEA +EN + + V P + + L D S + +S R++
Sbjct: 227 LYRFA-LEAQLENVGDGIITLGAVTLNPKPPFKSRSLNWDF-ESSADKESIPTLSPRDVL 284
Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
+ ++ G + L L+ + +G LG+L + WR+ LG+ G L T +
Sbjct: 285 QIAFIVEQEHGQQDGLETLQ-------KDMNREGRATLGQLSLEWRSALGDRGFLTTGNL 337
Query: 326 L 326
+
Sbjct: 338 M 338
>gi|345314305|ref|XP_001518717.2| PREDICTED: UPF0533 protein C5orf44 homolog, partial
[Ornithorhynchus anatinus]
Length = 129
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/123 (34%), Positives = 60/123 (48%), Gaps = 32/123 (26%)
Query: 79 ADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDT 138
A+ + L +L LPQ FG I+LGETF SYIS++N S+ V+D+++K + +QR
Sbjct: 17 AEILTLGEMLTLPQNFGNIFLGETFSSYISVHNDSSQMVKDILVKV---SGRQR------ 67
Query: 139 SKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSV 198
E LVC Y+ GE+ Y +FFKF V PL V
Sbjct: 68 -----------------------EAAPGRLVCAVSYTTQSGEKMYFRKFFKFQVLKPLDV 104
Query: 199 RTK 201
+TK
Sbjct: 105 KTK 107
>gi|320593998|gb|EFX06401.1| duf974 domain containing protein [Grosmannia clavigera kw1407]
Length = 1072
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 85/354 (24%), Positives = 143/354 (40%), Gaps = 69/354 (19%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H ++ +V+RL PSL + P+ P++ + PP + + +
Sbjct: 751 HPISLKVLRLSHPSLATQYPVAA--------------PLSTALPPPTVPASIAYGGGGPD 796
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN---------------- 111
+ + + D LS +L LP +FG+ Y+GETF + N+
Sbjct: 797 SAAT------NTDPFLLSPVLNLPPSFGSAYVGETFACTLCANHDAADVEDGGWSKEKAA 850
Query: 112 SSTLEVRDVVIKAEIQTDK-----QRILLLDTSKS--------PVESIRAGGRYDFIVEH 158
S+ +RDV I+AE++T + +L +T + +G +V
Sbjct: 851 SAVASIRDVQIEAEMKTPSAAEPVKLVLGPETDDGDGAGLGLHAGTDLASGQTLQKVVRF 910
Query: 159 DVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVG-ATHFQE 214
D+KE G H L T Y ++ G + + ++FI L VRTK G A +
Sbjct: 911 DLKEEGNHVLAVTVSYYEATETSGRTRTFRKLYQFICKASLIVRTKAGPYAAGRAGDMRR 970
Query: 215 ITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSG 274
LEA +EN + + +++VE E ++ + Y+ E + PVL
Sbjct: 971 RWALEAQLENCGEDVIQLERVELELERSLT------------YDKYDWEDGQKPVL--HP 1016
Query: 275 GGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGT 328
G + + L+ G P + G + G L I WR+ +G G L T LGT
Sbjct: 1017 GEVEQVCFLLEETGPG-LVPEQPNGRLLFGVLGIGWRSEMGNRGFL-TTGTLGT 1068
>gi|119501216|ref|XP_001267365.1| hypothetical protein NFIA_109620 [Neosartorya fischeri NRRL 181]
gi|119415530|gb|EAW25468.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 352
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/329 (25%), Positives = 129/329 (39%), Gaps = 63/329 (19%)
Query: 49 SNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYIS 108
SN PL +++ ++ + L+Y S + D LS L LP +FG+ Y+GETF +S
Sbjct: 31 SNQYPLPAANTKISRKASLSYPS----DSTDDKFILSPNLTLPPSFGSAYVGETFACTLS 86
Query: 109 INN-----SSTLEVRDVVIKAEIQTDKQRILL-LDTSKSPV--ESIRAGGRYDFIVEHDV 160
NN ++ V V I AE+QT Q L L+ + P E ++ G IV D+
Sbjct: 87 ANNELPEDETSRVVTSVRIVAEMQTPSQVASLDLEPANDPAQTEGLQRGQSLQKIVRFDL 146
Query: 161 KELGAHTLVCTALYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK----- 206
KE G H L + Y++ G + + ++F+ LSVRTK +
Sbjct: 147 KEEGNHILAVSISYTETLIGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVE 206
Query: 207 ------VGATHFQEITFLEACIENHTKSNLYM-----------------DQVEFEPSQNW 243
G T LEA +EN + + Q + P +
Sbjct: 207 NKALGPYGKTRLLRFA-LEAQLENVGDGTVVVKVCGWGILLKISFLTARQQTKLNPKPPF 265
Query: 244 SATMLKADGPHSDY------NAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
A L D D R++ + L+ G L L+ ++
Sbjct: 266 RAVSLNWDLERPDKVDSQPPTLNPRDVLQVAFLVEQEEGQQEGLEALQ-------KDLRR 318
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
G VLG+L I WR +G+ G L T +L
Sbjct: 319 DGRAVLGQLSIEWRGAVGDKGFLTTGNLL 347
>gi|367055168|ref|XP_003657962.1| hypothetical protein THITE_75670 [Thielavia terrestris NRRL 8126]
gi|347005228|gb|AEO71626.1| hypothetical protein THITE_75670 [Thielavia terrestris NRRL 8126]
Length = 351
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 151/365 (41%), Gaps = 69/365 (18%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL + P S+ PPL +S +N + +
Sbjct: 16 HSVSLKVLRLSRPSLVAQYPL----------QPPLSSPT--SHPPPLPASLAYSNGAGNA 63
Query: 68 T-YRSRFLLHDSADSIG---LSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVR 118
+ + L + LS +L LP +FG+ Y+GETF + N+ + +R
Sbjct: 64 SGANADNPLQPPPTNPAPFVLSPILNLPPSFGSAYVGETFSCTLCANHDIPEGAPPKTIR 123
Query: 119 DVVIKAEIQTDKQ----RILLLDTSKS------PVESIRAGGRYDFIVEH---------- 158
DV I+AE++T ++ LL + S P + G D H
Sbjct: 124 DVRIEAEMKTPSSPAPIKLALLPYTSSDANNDAPTTTTTTAG-VDLTPPHATTLQRILAF 182
Query: 159 DVKELGAHTLVCTALYSDGE---GERKYLPQFFKFIVSNPLSVRTKVRVVKV---GATHF 212
D+KE G H L T Y + G + + ++F L VRTK + GA +
Sbjct: 183 DLKEEGNHVLAVTVSYYEASALAGRTRTFRKLYQFACKASLIVRTKPGALPARPGGARRW 242
Query: 213 QEITFLEACIENHTKSNLYMDQV--EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVL 270
LEA +EN ++ + +++V E EP + +G R K PVL
Sbjct: 243 ----VLEAQLENCSEEGMLLERVGLELEP----GLACVDCNG------GMGRPRRKRPVL 288
Query: 271 IRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTI 330
G + ++ G +V G V G LQI WR+ +G G L T + LGT
Sbjct: 289 --QPGETEQVCFVIEEEEKGRVE--EVDGRVVFGVLQIGWRSEMGNRGFLSTGK-LGTRF 343
Query: 331 TSKEI 335
+I
Sbjct: 344 VKPKI 348
>gi|327294773|ref|XP_003232082.1| hypothetical protein TERG_07700 [Trichophyton rubrum CBS 118892]
gi|326466027|gb|EGD91480.1| hypothetical protein TERG_07700 [Trichophyton rubrum CBS 118892]
Length = 343
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 84/352 (23%), Positives = 135/352 (38%), Gaps = 63/352 (17%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL ++ P+ V SD ++ + L
Sbjct: 17 HSISLKVLRLSRPSLSLQHPIPV--------------------------SDAQFSRITSL 50
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS----TLEVRDVVIK 123
+Y S S LS L LP +FG+ Y+GETF +S NN + + V V I+
Sbjct: 51 SYPS----ATSDSQFILSPNLTLPPSFGSAYVGETFACSLSANNEALGGNSRVVTSVRIQ 106
Query: 124 AEIQTDKQRIL--LLDTSKSPVESIRAG--GRYDFIVEHDVKELGAHTLVCTALYSD--- 176
A++QT Q I LL + P +S I+ D+KE G H L + Y++
Sbjct: 107 ADMQTPSQTIPLELLPADEEPKKSTGTSTTASVQKIIHFDLKEEGNHVLAVSVNYTETTM 166
Query: 177 ------------GEGERKYLPQFFKFIVSNPLSVRTKV------RVVKVGATHFQEITF- 217
G + + ++F+ LSVRTK + A F +
Sbjct: 167 AANKDAPGGFQASGGRARTFRKLYQFVAQPCLSVRTKATELAPREIEDRSAGPFGKTRLL 226
Query: 218 ---LEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSG 274
LEA +EN + + + +T L D D + + P +
Sbjct: 227 RFALEAQLENVGDGMIVLGVPTLNSKPPFKSTSLNWDFYEKDGDQKKIAPTLAPRDVVQI 286
Query: 275 GGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
+ + + + G LG+L I WR+ +GE G L T ++
Sbjct: 287 AFLVEQEEGEQEGLEATQKDISRDGRTALGQLSIQWRSAMGEKGYLTTGNLM 338
>gi|326469947|gb|EGD93956.1| hypothetical protein TESG_01485 [Trichophyton tonsurans CBS 112818]
Length = 350
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 137/362 (37%), Gaps = 80/362 (22%)
Query: 5 PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
P T +L V RL RPSL ++ P+ V SD ++
Sbjct: 24 PATDAL---VHRLSRPSLSLQHPIPV--------------------------SDAQFSRI 54
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS----TLEVRDV 120
+ L+Y S S LS L LP +FG Y+GETF +S NN + + V V
Sbjct: 55 ASLSYPS----ATSDSQFILSPNLTLPPSFGTAYVGETFACSLSANNEALGGNSRVVTSV 110
Query: 121 VIKAEIQTDKQRIL--LLDTSKSPVES--IRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
I+A++QT Q I LL T + P +S A I+ D+KE G H L + Y++
Sbjct: 111 RIQADMQTPSQTIPLELLPTGEEPAKSAGTSATASIQKIIHFDLKEEGNHVLAVSVNYTE 170
Query: 177 ---------------GEGERKYLPQFFKFIVSNPLSVRTKV------RVVKVGATHFQEI 215
G + + ++F+ LSVRTK + A F +
Sbjct: 171 TMMAPNKDAASGFQASGGRARTFRKLYQFVAQPCLSVRTKATELAPREIEDRSAGPFGKT 230
Query: 216 TF----LEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS-------REI 264
LEA +EN + + + +T L D D + R++
Sbjct: 231 RLLRFALEAQLENVGDGMIVLGIPTLNSKPPFKSTSLNWDFFEKDGGEKKIAPTLAPRDV 290
Query: 265 FKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQ 324
+ L+ G L + + G LG+L I WR+ +GE G L T
Sbjct: 291 VQIAFLVEQEEGQQEGL-------EATQKDISRDGRTALGQLSIQWRSAMGEKGYLMTGN 343
Query: 325 IL 326
++
Sbjct: 344 LM 345
>gi|389640393|ref|XP_003717829.1| hypothetical protein MGG_01105 [Magnaporthe oryzae 70-15]
gi|16565967|gb|AAL26319.1| hypothetical protein [Magnaporthe grisea]
gi|351640382|gb|EHA48245.1| hypothetical protein MGG_01105 [Magnaporthe oryzae 70-15]
gi|440466337|gb|ELQ35609.1| DUF974 domain-containing protein [Magnaporthe oryzae Y34]
gi|440487884|gb|ELQ67649.1| DUF974 domain-containing protein [Magnaporthe oryzae P131]
Length = 339
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 149/362 (41%), Gaps = 82/362 (22%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P++ P P A + P + L
Sbjct: 15 HSISLKVLRLSRPSLVAQYPVK-SPEG--------SQPSAGAGSHP-----------ASL 54
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVVI 122
Y S + D LS +L LP +FG+ Y+GETF + N+ ++ +VRDV I
Sbjct: 55 AYGSPD--GTNPDPFILSPILNLPPSFGSAYVGETFSCTLCANHDVPDGAAARQVRDVRI 112
Query: 123 KAEIQTDKQRILLL-----------DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCT 171
+AE++T ++ +R G IV D+KE G H L T
Sbjct: 113 EAEMKTPGSAAGVVTKLDLGPNGGGGGEGDGGVDLREGETLQRIVRFDLKEEGNHVLAVT 172
Query: 172 ALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEIT------------ 216
Y ++ G + + ++FI + L VRTK + G+ E +
Sbjct: 173 VSYYEATETSGRTRTFRKLYQFICKSSLIVRTKASQLPGGSGAMTETSSAGGKEEQQQSQ 232
Query: 217 -------FLEACIENHTKSNLYMDQV--EFEPSQNWSATMLKADGPHSDYNAQSREIFKP 267
LEA +EN ++ + +++V + EP ++ +++A R+
Sbjct: 233 LRRRRQWVLEAQLENCSEDAIQLERVVLDLEPGLVYT---------DCNWDADERQ---K 280
Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKV-QGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
PVL S + Q+ + + + +V G V G L + WR +G G L T + L
Sbjct: 281 PVLHPS------EVEQVCFVVQEAGAECEVMDGKVVFGVLGVGWRGEMGSRGFLSTGK-L 333
Query: 327 GT 328
GT
Sbjct: 334 GT 335
>gi|414870886|tpg|DAA49443.1| TPA: hypothetical protein ZEAMMB73_957859 [Zea mays]
Length = 70
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 33/69 (47%), Positives = 50/69 (72%)
Query: 378 SDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLP 437
S E++ V++NG + + L VEAF S F L+++ T+LGVQ+I+GIT++ EK Y+ LP
Sbjct: 2 SGEDRAVLVNGPQKLILPLVEAFESIKFDLSMVTTQLGVQKISGITMYAVQEKKYYEPLP 61
Query: 438 DLEIFVDQD 446
D+EIFVD +
Sbjct: 62 DIEIFVDAE 70
>gi|303316452|ref|XP_003068228.1| hypothetical protein CPC735_002510 [Coccidioides posadasii C735
delta SOWgp]
gi|240107909|gb|EER26083.1| hypothetical protein CPC735_002510 [Coccidioides posadasii C735
delta SOWgp]
Length = 342
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 86/361 (23%), Positives = 142/361 (39%), Gaps = 81/361 (22%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL ED + P+ S P ++D
Sbjct: 17 HSVSLKVLRLSRPSLSYQHPL---------PEDFANVPVQPSLSYPSSTAD--------- 58
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----STLEVRDVVI 122
+F+L + L+LP AFG+ Y+GETF +S NN ++ V + I
Sbjct: 59 ---KQFILSPN---------LMLPPAFGSAYVGETFSCSLSANNEFLRGDASRVVTSIRI 106
Query: 123 KAEIQTDKQRILLLDTSKSPVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSD- 176
A++QT Q + L ++GG IV D+KE G H L Y++
Sbjct: 107 LADMQTPSQVVPLELYPSGDDNDTKSGGIAQVESMQRIVRFDLKEEGNHVLAVGVSYTET 166
Query: 177 --------------GEGERKYLPQFFKFIVSNPLSVRTKV-----------RVVKVGATH 211
G + + ++F+ L+VRTK + G T
Sbjct: 167 MITQSSDAHGSVQASGGRVRTFRKLYQFVAQPCLNVRTKATELPPQEVDNRSLGPYGKTK 226
Query: 212 FQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQS------REIF 265
LEA +EN + + V P + + L D S + +S R++
Sbjct: 227 LYRFA-LEAQLENVGDGIITLGAVTLNPKPPFKSRSLNWDF-ESSADKESIPTLSPRDVL 284
Query: 266 KPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
+ ++ G + L L+ + +G LG+L + WR+ LG+ G L T +
Sbjct: 285 QIAFIVEQEHGQQDGLETLQ-------KDMNREGRATLGQLSLEWRSALGDRGFLTTGNL 337
Query: 326 L 326
+
Sbjct: 338 M 338
>gi|322705248|gb|EFY96835.1| DUF974 domain-containing protein [Metarhizium anisopliae ARSEF 23]
Length = 368
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 147/356 (41%), Gaps = 78/356 (21%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RP+L + P P+ A+ L SS S++
Sbjct: 57 HSVSVKVLRLSRPALVPQYP---------------SSPLPATKEAFLPSSLSYKTPSTN- 100
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTL------------ 115
+ FL LS +L LP +FG+ Y+GETF + NN T
Sbjct: 101 --PAPFL---------LSPILNLPVSFGSAYVGETFSCTLCANNDLTTTSSSSSSPSPSP 149
Query: 116 ----EVRDVVIKAEIQT----DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHT 167
+RDV I AE++T R+ L S +P + + AG +V D+KE G H
Sbjct: 150 PPAKHIRDVRIDAEMKTPGPGPAHRLPL--ASGAPAD-LAAGETLQRVVSFDLKEEGNHV 206
Query: 168 LVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEIT----FLEA 220
L T Y S+ G + + ++F+ L VRTKV ++ G++ + LEA
Sbjct: 207 LAVTVSYYEASETSGRTRTFRKLYQFMCKAGLVVRTKVGLLGGGSSSSSRSSRKRWVLEA 266
Query: 221 CIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNY 280
+EN ++ + +++V E + L+ +G ++ R + P G +
Sbjct: 267 QLENCSQDVMQLEEVGMEAERG-----LRCEG--CNWAEGERPVLHP-------GEVEQV 312
Query: 281 LY------QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTI 330
+ + S + G V G L I WR +G G L T + LGT +
Sbjct: 313 CFVVVEEDEEDEDEEESGADGDADGRVVFGVLGIGWRGEMGNRGFLSTGK-LGTRV 367
>gi|336468302|gb|EGO56465.1| hypothetical protein NEUTE1DRAFT_65043 [Neurospora tetrasperma FGSC
2508]
Length = 341
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 70/280 (25%), Positives = 120/280 (42%), Gaps = 57/280 (20%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL GED + A ++ D
Sbjct: 15 HSVSLKVLRLSRPSLVPQFPLHPP-----HGEDAHEAESAGGE------------RTRDG 57
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETF-CSYISINNSSTL---------EV 117
Y + + LS ++ LP +FG+ Y+GETF C+ + +N+ + +
Sbjct: 58 YYNTEPFI--------LSPIVNLPPSFGSAYVGETFSCTLCANHNAPPIGEGGTSVKKTI 109
Query: 118 RDVVIKAEIQT---DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY 174
RDV I+AE+Q +++L DT+ ++ +G I+ +KE G H L T Y
Sbjct: 110 RDVKIEAEMQAPSGQTTKLVLGDTAGD--DNAGSGTTLQKILNFGLKEEGTHVLGVTVSY 167
Query: 175 ---SDGEGERKYLPQFFKFIVSNPLSVRTK------VRVVKVGATHFQEITFLEACIENH 225
++ G + + ++FI L VRTK + VK G + LEA +EN
Sbjct: 168 YEATETSGRTRAFRKMYQFICKPSLIVRTKAGPLPSLPPVKAGNGKRRRRWVLEAQLENC 227
Query: 226 TKSNLYMDQVEFEPSQ--------NWSATMLKADGPHSDY 257
++ + +++ E Q NW+ + P +
Sbjct: 228 SEDAILLEKAELAEVQRGLKWRDCNWAGIGVGVGPPRRPF 267
>gi|395754144|ref|XP_003779717.1| PREDICTED: LOW QUALITY PROTEIN: UPF0533 protein C5orf44 homolog
[Pongo abelii]
Length = 354
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 147/329 (44%), Gaps = 44/329 (13%)
Query: 106 YISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGA 165
Y+SI+ S + ++ A+IQT+ + +L S + V + + R D ++ HD+K
Sbjct: 52 YMSISKDSNXVAKIILXNADIQTNTXPLHVL-VSMAIVAELVSHCRIDDVI-HDMK---- 105
Query: 166 HTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENH 225
+C F F+ + L +TK + + FL+ I+N
Sbjct: 106 ---LC----------------LFSFL--SQLDDKTKFYNSE------KNDLFLKVKIQNT 138
Query: 226 TKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLK 285
+ S +++ + F S + L + + ++ F ++S G YL ++
Sbjct: 139 SSSTVFIQSISFVSSDMHTGKELNT----VNQDGENECTFGTTTFLQSMEG-RQYLDHVQ 193
Query: 286 MLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSV 345
+ S ++G +GKL I + NLGE LQT Q+L + + + L++ +P
Sbjct: 194 LKQKCSVEAGIIKGLREMGKLDIVSKRNLGEMAMLQTIQLLRXSPGHENMRLSLEMIPDS 253
Query: 346 VGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDF 405
V +++PF + K TN +D++ ++ L+ D+D + G L + + S F
Sbjct: 254 VXLEEPFHITCKTTNCSDRK---MKLILNMCDTDS---IHWYGSSGRYLGKLLSCSSLCF 307
Query: 406 HLNLIATKLGVQRITGITVFDKLEKITYD 434
L+ KLG+Q ++GI + DK + TYD
Sbjct: 308 TXTLLFLKLGLQSVSGIQLTDKSLQKTYD 336
>gi|115482756|ref|NP_001064971.1| Os10g0498800 [Oryza sativa Japonica Group]
gi|113639580|dbj|BAF26885.1| Os10g0498800, partial [Oryza sativa Japonica Group]
Length = 64
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/63 (49%), Positives = 48/63 (76%)
Query: 384 VMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
V++NGL+ + L VEAF S +F L+++AT++GVQ+I+GIT++ EK Y+ L D+EIFV
Sbjct: 2 VLVNGLQKLVLPLVEAFESINFDLSMVATQVGVQKISGITLYAVQEKKLYEPLSDIEIFV 61
Query: 444 DQD 446
D +
Sbjct: 62 DAE 64
>gi|452824517|gb|EME31519.1| hypothetical protein Gasu_11950 [Galdieria sulphuraria]
Length = 461
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 101/464 (21%), Positives = 192/464 (41%), Gaps = 64/464 (13%)
Query: 2 SSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTT 61
+S GT L FR+++ RP P+ FI ++ S+ +VTT
Sbjct: 13 TSLSGTPKLLFRIIKTERPKPTFHAPIP------FIRPLFYEQVDRKSSYEK--DFEVTT 64
Query: 62 NKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVV 121
+SS T DS G++ + F IY GE+ + + N+S+ ++ V
Sbjct: 65 RESSPRT------AEDSC--FGITSNVSHTSNFN-IYRGESVHLTLVLLNASSSDLGFVS 115
Query: 122 IKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
+ +QT + LLDT SP ++ ++ K +G + L C A Y+D +G+
Sbjct: 116 VLVRLQTSEGSYCLLDTQSSPNNIFTTQASLEYNLQFVAKVVGNYALQCFAFYTDVDGQE 175
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVK-------VGATHFQEITFLEACIENHTKSNLYMDQ 234
+ Q ++F V L+ +R+V+ + H + ++ I N + +Y+ +
Sbjct: 176 HTISQSYRFTVHLCLNFIYDIRLVEEETDWEFFASLHPSSVYIVDCFIYNVCQLPVYLHE 235
Query: 235 VEFEPSQNWSATMLKADGPHSDYNAQ--SREIFKPPV---------LIRSGGGIHNYLYQ 283
V F S N G D N +++ P V LI + G + Y
Sbjct: 236 VHFLLSDNIGC----ERGSKEDQNPSIIVKDLNIPSVGGEERTNESLILNPGDCQTFTY- 290
Query: 284 LKMLSHGSSSPVKVQGS----NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE----- 334
++ P++ + S NVLG + ++ G+ + +L +T +E
Sbjct: 291 --LVYSAIEDPLRRKSSSRAKNVLGSIYASFTRFGGD------RVVLDPALTVEEPKMSQ 342
Query: 335 ---IELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRI 391
+ + VV VPS + ++ PF+ +K+ N+T + + F + ++ + ++G +
Sbjct: 343 VSMVTIEVVGVPSKIVVECPFVATMKVVNRTSQSKK-FYFQVRRDKVGSIVPIGVSGRLL 401
Query: 392 MALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDS 435
L P + S + LIA + G ++G V D + Y++
Sbjct: 402 ETLQPNQ---SCKLDMQLIALEPGAHFLSGFRVVDVESREYYEA 442
>gi|116204863|ref|XP_001228242.1| hypothetical protein CHGG_10315 [Chaetomium globosum CBS 148.51]
gi|88176443|gb|EAQ83911.1| hypothetical protein CHGG_10315 [Chaetomium globosum CBS 148.51]
Length = 813
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 149/370 (40%), Gaps = 83/370 (22%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P + F FD PI S+ PP+ +S L
Sbjct: 472 HSVSLKVLRLSRPSLVAQYPFQPP----F--SSPFDGPI--SHQPPIPAS---------L 514
Query: 68 TYRSRFL--LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISIN------NSSTLE--- 116
Y S L + + LS +L LP +FG+ Y+GETF + N N + L
Sbjct: 515 AYSSNGLNDVPTNPTPFVLSPILNLPPSFGSAYVGETFSCTLCANHDIPDDNPAALAAKT 574
Query: 117 VRDVVIKAEIQTDKQRILLLDTSK---------------------------SPVESIRAG 149
+RDV I+AE++T L SP ++++
Sbjct: 575 IRDVRIEAEMKTPSSATALTLPLTPPSPPTPTTTPGDTTTATTETGPGTDLSPHQTLQK- 633
Query: 150 GRYDFIVEHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVV- 205
I+ D+KE G H L T Y S+ G + + ++F+ L VRTK +
Sbjct: 634 -----ILSFDLKEEGNHVLAVTVSYYEASELSGRTRTFRKLYQFVCKPSLIVRTKPGALP 688
Query: 206 KVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ-------NWSATMLKADGPHSDYN 258
+ LEA +EN K L +++V E + NW + G +
Sbjct: 689 PADPASGRRRWVLEAQLENCGKEGLMLEKVGLELERGLGYEDCNWESGGGGGTG-GNGGV 747
Query: 259 AQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPG 318
+ R + P G + ++ + G+ +V G G LQI WR+ +G G
Sbjct: 748 GRMRPVLLP-------GETEQVCFVIEEDAAGAVE--EVDGRVAFGILQIGWRSEMGNRG 798
Query: 319 RLQTQQILGT 328
L T + LGT
Sbjct: 799 FLSTGK-LGT 807
>gi|400601500|gb|EJP69143.1| DUF974 domain-containing protein [Beauveria bassiana ARSEF 2860]
Length = 408
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 95/206 (46%), Gaps = 45/206 (21%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISINN-----SSTL----EVRDVVIKAEIQT---DKQ 131
LS +L LP +FG+ Y+GETF + NN SST ++RDV ++AE++T K
Sbjct: 112 LSPILNLPVSFGSAYVGETFSCTLCANNDLDDSSSTATTKRQIRDVRVEAEMKTPGQTKA 171
Query: 132 RILLLDTSKSPVES------------IRAGGRYDFIVEHDVKELGAHTLVCTALY---SD 176
+ L L + S ES + GG IV D+KE G H L T Y ++
Sbjct: 172 QSLELGPAPSSQESAAVGAAAAAATDLAPGGTLQKIVSFDLKEEGNHVLAVTVSYYEAAE 231
Query: 177 GEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEIT-----------FLEACIENH 225
G + + ++FI L VRTKV V+K A ++ LEA +EN
Sbjct: 232 TSGRTRTFRKLYQFICKPSLIVRTKVGVLKAPAPKKKKQQQQQQQPPLRRWVLEAQLENC 291
Query: 226 TKSNLYMDQV--EFEPS-----QNWS 244
+ + +D+V E EP NW+
Sbjct: 292 SDDTMQLDRVVMELEPGLTCRDCNWT 317
>gi|354489772|ref|XP_003507035.1| PREDICTED: UPF0533 protein C5orf44 homolog [Cricetulus griseus]
Length = 287
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 66/262 (25%), Positives = 119/262 (45%), Gaps = 19/262 (7%)
Query: 183 YLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN 242
+L + F S PL V+TK ++ FLE IEN + S +++ +V + +
Sbjct: 35 FLSKICLFYPSEPLDVKTKF------YNSDKDDLFLEVQIENISHSTVFIREVSLKLPEM 88
Query: 243 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNV 302
++ L + + F +++ G H YLY L+ + G
Sbjct: 89 YTEEALNT----LNLEGEDECTFGTRTFLQATEGRH-YLYHLQFKEEYLEKARTLSGLME 143
Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
+GKL+I W+ LGE L T + + E++L++ ++P V ++PF + K+TN T
Sbjct: 144 MGKLEIVWKRELGEMPMLHTVPLRREAPSCGELKLSLEKIPDTVAREEPFQITCKITNCT 203
Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
DK+ ++ L D+ + +G + L P S F L L+ +LG++ I+GI
Sbjct: 204 DKK---MKLLLKMFDTTSVRWCGCSGRKPGRLKP---GSSLSFTLTLLCLQLGLRSISGI 257
Query: 423 TVFDK--LEKITYDSLPDLEIF 442
V D + K YD + ++ +
Sbjct: 258 RVIDTTLMTKYRYDDVANVCVL 279
>gi|238491960|ref|XP_002377217.1| DUF974 domain protein [Aspergillus flavus NRRL3357]
gi|220697630|gb|EED53971.1| DUF974 domain protein [Aspergillus flavus NRRL3357]
Length = 257
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 96/212 (45%), Gaps = 49/212 (23%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P P A + + +NK+S L
Sbjct: 17 HSVSLKVLRLSRPSLSYQYPF----------------PEANTKI---------SNKAS-L 50
Query: 68 TYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN-----SSTLEVRDVV 121
+Y S DS D+ L+ L LP AFG+ Y+GETF +S NN ++ V V
Sbjct: 51 SYPS-----DSVDNQFILAPNLTLPPAFGSAYVGETFACTLSANNELAEDETSRVVTSVR 105
Query: 122 IKAEIQTDKQ--RILLLDTSKSPV-ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD-- 176
I AE+QT Q + L +P + ++ G IV D+KE G H L + Y++
Sbjct: 106 IVAEMQTPSQVASLELEPADDAPARDGLQKGQSLQKIVRFDLKEEGNHILAVSVSYTETL 165
Query: 177 -------GEGERKYLPQFFKFIVSNPLSVRTK 201
G + + ++F+ LSVRTK
Sbjct: 166 IGSDSQAASGRVRTFRKLYQFVAQPCLSVRTK 197
>gi|171689020|ref|XP_001909450.1| hypothetical protein [Podospora anserina S mat+]
gi|170944472|emb|CAP70583.1| unnamed protein product [Podospora anserina S mat+]
Length = 208
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 72/157 (45%), Gaps = 17/157 (10%)
Query: 84 LSGLLVLPQAFGAIYLGETFCSYISINNS-----------STLEVRDVVIKAEIQTDKQR 132
LS +L LP +FG+ Y+G TF + N+ S +RDV I+AE++T
Sbjct: 44 LSPILALPPSFGSAYVGTTFSCTLCANHDIPPPIDGGPPLSVKTIRDVKIEAEMKTPSSP 103
Query: 133 IL--LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDG---EGERKYLPQF 187
L LL + GG IV D++E GAHTLV Y + G + +
Sbjct: 104 TLIPLLPPGNDEGTDLSPGGTLQKIVSFDLREEGAHTLVVQVSYYEATSTSGRARMFRKL 163
Query: 188 FKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIEN 224
++F+ L VRTK + +G + LEA +EN
Sbjct: 164 YQFVCKGLLVVRTKTSALGLGKQGNRRWV-LEAQVEN 199
>gi|349803503|gb|AEQ17224.1| hypothetical protein [Pipa carvalhoi]
Length = 122
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 50/87 (57%)
Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIEL 337
YLY LK + ++G V+GKL I W+TNLGE GRLQT Q+ ++ L
Sbjct: 5 RQYLYCLKPKPEFAEKAGVIKGVTVIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRL 64
Query: 338 NVVEVPSVVGIDKPFLLKLKLTNQTDK 364
++ +P V +++PF + K+TN +++
Sbjct: 65 SIETIPDTVSLEEPFDITCKITNCSER 91
>gi|261197155|ref|XP_002624980.1| DUF974 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239595610|gb|EEQ78191.1| DUF974 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 457
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 83/327 (25%), Positives = 128/327 (39%), Gaps = 51/327 (15%)
Query: 53 PLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN 111
PL S + + L+Y S DS+DS L + LP AFG+ Y+GETF + NN
Sbjct: 55 PLPSENEKVPLKASLSYPS-----DSSDSQFVLCPNVTLPPAFGSAYVGETFSCSLCANN 109
Query: 112 SSTLE-----VRDVVIKAEIQTDKQRILLLDTSKSPVESIRAG----GRYDFIVEHDVKE 162
L V V I AE+QT Q ++ L+ S + +S +G IV D+KE
Sbjct: 110 ELPLYTENRVVSSVRIIAEMQTPSQ-VVSLELSPTGEDSQSSGLAIAQSLQKIVRFDLKE 168
Query: 163 LGAHTLVCTALYSD----------------------GEGERKYLPQFFKFIVSNPLSVRT 200
G H L + Y++ G + + ++FI LSVRT
Sbjct: 169 EGNHVLAVSVSYTETTLAQRDQEMAPGIGGAGATQAASGRVRTFRKLYQFIAQPCLSVRT 228
Query: 201 KVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
K + G T LEA +EN + + P + + L
Sbjct: 229 KATELSPLEVDNRALGPYGKTRLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLN 287
Query: 250 ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQIT 309
D SD + P +++ + Q + L G + G +LG+L I
Sbjct: 288 WDFERSDSPSVGPPTLNPRDVLQVAFLVEQEHGQQEGL-EGLQKDISRDGRTILGQLSIE 346
Query: 310 WRTNLGEPGRLQTQQILGTTITSKEIE 336
WR ++G+ G L T ++ + E+E
Sbjct: 347 WRGSMGDRGFLTTGNLMTKRRLTLELE 373
>gi|346976493|gb|EGY19945.1| hypothetical protein VDAG_01961 [Verticillium dahliae VdLs.17]
Length = 416
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 86/358 (24%), Positives = 140/358 (39%), Gaps = 77/358 (21%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P + P D PI AS L
Sbjct: 16 HSISLKVLRLSRPSLVTQHPTK--PPQAPAAHDAA--PIPAS-----------------L 54
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS-----------STLE 116
Y + D L+ +L LP +FG+ Y+GE F + N+ T
Sbjct: 55 AYAPDAAASTNPDPFLLAPILNLPLSFGSAYVGEHFSCTLCANHEPPVSADVAAALPTKR 114
Query: 117 VRDVVIKAEIQTDK-----QRILLLD---------------TSKSPVESIRAGGRYDFIV 156
+RDV I+AE++T Q++ L + + G IV
Sbjct: 115 IRDVRIEAEMKTPGAQGSVQKLQLTGRASDSSSSSSDPADPAAAKATADLAPGETLQRIV 174
Query: 157 EHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVV---KVGA- 209
D+K+ G H L T Y ++ G + + ++FI + L VRTKV + GA
Sbjct: 175 GFDLKDEGNHVLAVTVSYYEATETSGRTRTFRKLYQFICKSSLIVRTKVGSLPGTPGGAD 234
Query: 210 THFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYN--AQSREIFKP 267
+ LEA +EN + + +++VE + L+A ++D N + + + P
Sbjct: 235 GRARRRWVLEAQLENCAEDVVQLERVELD---------LEAGLAYTDCNWGSAGKPVLHP 285
Query: 268 PVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
G + + ++ + G G V G L I WR +G G L T ++
Sbjct: 286 -------GEVEQVCFVVEETAEGGGLEPGDDGRIVFGVLGIGWRGEMGNRGYLSTGKL 336
>gi|341901898|gb|EGT57833.1| hypothetical protein CAEBREN_19830 [Caenorhabditis brenneri]
Length = 126
Score = 64.7 bits (156), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 74/153 (48%), Gaps = 29/153 (18%)
Query: 15 MRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFL 74
MRL RP + P D F DP+ + ++ K S+L+ +R
Sbjct: 1 MRLARP--------KYAPLDGF-----SHDPVDPTGF-----GEILAGKVSELSKETR-- 40
Query: 75 LHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
HD + + L+ PQ F IYLGETF Y+++ N S V +V +K E+QT QR+
Sbjct: 41 -HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNVVNVCLKCELQTSTQRVA 95
Query: 135 L-LDTSKSPVESIRAGGRYDFIVEHDVKELGAH 166
L + +E+ + G+ ++ H+VKE+G H
Sbjct: 96 LPCSVQDTIIEASKCDGQ---VISHEVKEIGQH 125
>gi|239606593|gb|EEQ83580.1| DUF974 domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 367
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 82/323 (25%), Positives = 125/323 (38%), Gaps = 63/323 (19%)
Query: 53 PLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN 111
PL S + + L+Y S DS+DS L + LP AFG+ Y+GETF + NN
Sbjct: 55 PLPSENEKVPLKASLSYPS-----DSSDSQFVLCPNVTLPPAFGSAYVGETFSCSLCANN 109
Query: 112 SSTLE-----VRDVVIKAEIQTDKQRILLLDTSKSPVESIRAG----GRYDFIVEHDVKE 162
L V V I AE+QT Q ++ L+ S + +S +G IV D+KE
Sbjct: 110 ELPLYTENRVVSSVRIIAEMQTPSQ-VVSLELSPTGEDSQSSGLAIAQSLQKIVRFDLKE 168
Query: 163 LGAHTLVCTALYSD----------------------GEGERKYLPQFFKFIVSNPLSVRT 200
G H L + Y++ G + + ++FI LSVRT
Sbjct: 169 EGNHVLAVSVSYTETTLAQRDQEMAPGIGGAGATQAASGRVRTFRKLYQFIAQPCLSVRT 228
Query: 201 KVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
K + G T LEA +EN + + P + + L
Sbjct: 229 KATELSPLEVDNRALGPYGKTRLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLN 287
Query: 250 ADGPHSDYNA------QSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVL 303
D SD + R++ + L+ G L L+ + G +L
Sbjct: 288 WDFERSDSPSVGPPTLNPRDVLQVAFLVEQEHGQQEGLEDLQ-------KDISRDGRTIL 340
Query: 304 GKLQITWRTNLGEPGRLQTQQIL 326
G+L I WR ++G+ G L T ++
Sbjct: 341 GQLSIEWRGSMGDRGFLTTGNLM 363
>gi|324530182|gb|ADY49073.1| Unknown [Ascaris suum]
Length = 194
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 76/147 (51%), Gaps = 8/147 (5%)
Query: 299 GSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKL 358
G +GKL + WRTN+GE GRLQT + ++ L V ++P+ I + F + +L
Sbjct: 53 GGTSIGKLDMVWRTNMGERGRLQTSALQRMAPGYGDLRLTVEKIPATAKIRQTFEVVCRL 112
Query: 359 TNQTDKEQGPFEIWLSQNDSDEEKVVMI--NGLRIMALAPVEAFGSTDFHLNLIATKLGV 416
N +++ ++ L+ + S + +V +G+++ L P + DF L L+ G+
Sbjct: 113 HNCSERS---LDLVLTLDGSLQPALVFCTASGVQLGQLPPNN---TVDFTLELLPITPGL 166
Query: 417 QRITGITVFDKLEKITYDSLPDLEIFV 443
Q I+GI V D K TY+ ++FV
Sbjct: 167 QPISGIRVSDTFLKRTYEHDDIAQVFV 193
>gi|327357840|gb|EGE86697.1| DUF974 domain-containing protein [Ajellomyces dermatitidis ATCC
18188]
Length = 367
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/317 (25%), Positives = 124/317 (39%), Gaps = 51/317 (16%)
Query: 53 PLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN 111
PL S + + L+Y S DS+DS L + LP AFG+ Y+GETF + NN
Sbjct: 55 PLPSENEKVPLKASLSYPS-----DSSDSQFVLCPNVTLPPAFGSAYVGETFSCSLCANN 109
Query: 112 SSTLE-----VRDVVIKAEIQTDKQRILLLDTSKSPVESIRAG----GRYDFIVEHDVKE 162
L V V I AE+QT Q ++ L+ S + +S +G IV D+KE
Sbjct: 110 ELPLYTENRVVSSVRIIAEMQTPSQ-VVSLELSPTGEDSQSSGLAKAQSLQKIVRFDLKE 168
Query: 163 LGAHTLVCTALYSD----------------------GEGERKYLPQFFKFIVSNPLSVRT 200
G H L + Y++ G + + ++FI LSVRT
Sbjct: 169 EGNHVLAVSVSYTETTLAQRDQEMPPSIGGASATQAASGRVRTFRKLYQFIAQPCLSVRT 228
Query: 201 KVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 249
K + G T LEA +EN + + P + + L
Sbjct: 229 KATELSPLEVDNRALGPYGKTRLLRYA-LEAQLENVGDGAISLGSTTLNPKPPFKSRSLN 287
Query: 250 ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQIT 309
D SD + P +++ + Q + L G + G +LG+L I
Sbjct: 288 WDFERSDSPSVGPPTLNPRDVLQVAFLVEQEHGQQEGL-EGLQKDISRDGRTILGQLSIE 346
Query: 310 WRTNLGEPGRLQTQQIL 326
WR ++G+ G L T ++
Sbjct: 347 WRGSMGDRGFLTTGNLM 363
>gi|402590101|gb|EJW84032.1| hypothetical protein WUBG_05056 [Wuchereria bancrofti]
Length = 207
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 57/220 (25%), Positives = 107/220 (48%), Gaps = 20/220 (9%)
Query: 230 LYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSH 289
+ +++V EPS + ++ + G ++ QS P I YL+ LK +
Sbjct: 1 MVLEKVILEPSDFYLSSEISPPGTENETMDQS--YLNP-------SDIRQYLFCLKPKTT 51
Query: 290 GSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGID 349
S +G+++ GKL + WRT++GE GRLQT + ++ L + ++P+ V
Sbjct: 52 DYSLNYFRKGTSI-GKLDMVWRTSMGERGRLQTSALQRMAPGYGDLRLTIEKIPATVKAL 110
Query: 350 KPF----LLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVM--INGLRIMALAPVEAFGST 403
+ F L+L++ N + E+ ++ L+ + + + I+G+ + LAP +T
Sbjct: 111 QSFRMVCRLRLEVMNYSFSERS-LDLVLTLDGKLQPNIAFCSISGVELGQLAPN---STT 166
Query: 404 DFHLNLIATKLGVQRITGITVFDKLEKITYDSLPDLEIFV 443
DF + L+ G+Q I+GI V D + TY+ ++FV
Sbjct: 167 DFSIELLPLTPGLQSISGIRVTDTFLRRTYEHDDIAQVFV 206
>gi|310794613|gb|EFQ30074.1| hypothetical protein GLRG_05218 [Glomerella graminicola M1.001]
Length = 343
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 79/359 (22%), Positives = 141/359 (39%), Gaps = 78/359 (21%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + P+R P+ S +P + ++
Sbjct: 16 HSVSLKVLRLSRPSLVTQHPIRA--------------PLTPSTVPVDATPASLAYDTTGA 61
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETF----CSYISINNSSTL-------- 115
T + F+ LS +L LP +FG+ Y+GE F C+ + + + L
Sbjct: 62 TNPAPFI---------LSPILNLPLSFGSAYVGEVFSCTLCANHDVPDPAPLVGPGGQPL 112
Query: 116 -----------EVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGG-----------RYD 153
+RDV I+AE++T + P + GG
Sbjct: 113 PGGGGGAPKRKSIRDVRIEAEMKTPGANSVQKLELSPPDHAAANGGDAKGTDLGPGDTLQ 172
Query: 154 FIVEHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGAT 210
IV+ D+KE G H L T Y ++ G+ + + ++FI + L VRTK+ +GA+
Sbjct: 173 RIVDFDLKEEGNHVLAVTVSYYEATETSGKTRTFRKLYQFICKSSLIVRTKIG--PLGAS 230
Query: 211 HFQEITF----LEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFK 266
+ +EA +EN ++ + +++V + S T + +R +
Sbjct: 231 GGRHGGRRRWAMEAQLENCSEDVIQLEKVVLDLVDGLSYTDCNWEA-----GGGARPVLH 285
Query: 267 PPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
P G + + ++ + G + G L I WR +G G L T ++
Sbjct: 286 P-------GEVEQVCFVVEEAEGSPRAQPGEDGRIIFGVLGIGWRGEMGNRGFLSTGKL 337
>gi|315056791|ref|XP_003177770.1| DUF974 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311339616|gb|EFQ98818.1| DUF974 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 347
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 80/315 (25%), Positives = 124/315 (39%), Gaps = 53/315 (16%)
Query: 53 PLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS 112
PL SD +K + L+Y S S LS L LP AFG+ Y+GETF +S NN
Sbjct: 40 PLPDSDARVSKLASLSYPS----GTSDPQFILSPNLTLPPAFGSAYVGETFACSLSANNE 95
Query: 113 S----TLEVRDVVIKAEIQTDKQRIL--LLDTSKSPVES--IRAGGRYDFIVEHDVKELG 164
+ + V + ++A++QT Q I LL + P +S A I+ D+KE G
Sbjct: 96 ALSGNSRVVTSIRMQADMQTPSQTIPLDLLPEDEEPGKSAGTSAAASVQKIIRFDLKEEG 155
Query: 165 AHTLVCTALYSD---------------GEGERKYLPQFFKFIVSNPLSVRTKV-----RV 204
H L + Y++ G + + ++F+ LSVRTK R
Sbjct: 156 NHVLAVSVNYTETTMAPNKDAPNGFQASGGRVRTFRKLYQFVAQPCLSVRTKATELPPRE 215
Query: 205 VK------VGATHFQEITFLEACIENHTKS--NLYMDQVEFEP-----SQNWSATMLKAD 251
++ G T LEA +EN L + + +P S NW +
Sbjct: 216 IENRSLGPYGKTRLLRFA-LEAQLENVGDEIIVLGVPTLNSKPPFKSTSLNWDVYEQDGE 274
Query: 252 GPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWR 311
+ R++ + L+ G L + + G LG+L I W+
Sbjct: 275 QKKASPTLAPRDVIQLAFLVEQEEGQQEGL-------EVTQKDISRDGRTALGQLSIQWQ 327
Query: 312 TNLGEPGRLQTQQIL 326
+GE G L T ++
Sbjct: 328 GAMGEKGYLTTGNLM 342
>gi|291407886|ref|XP_002720266.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 362
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 69/131 (52%), Gaps = 6/131 (4%)
Query: 303 LGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQT 362
LGKL + W+ NL E QT Q+ + I ++V +P V +++PF + K+TN +
Sbjct: 219 LGKLNVFWKKNLHETAIQQTIQLERDVPHYRSISVSVESMPDKVIVEEPFYMTCKITNFS 278
Query: 363 DKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGI 422
D++ +++L+ ++D + G + L P + LNL+ K G+QRI+GI
Sbjct: 279 DQK---MKLFLNLCNTDAVHWHLRGGKYLGKLPPRTSLC---LPLNLLFVKQGLQRISGI 332
Query: 423 TVFDKLEKITY 433
+ DK K TY
Sbjct: 333 QLTDKYTKKTY 343
>gi|296827564|ref|XP_002851189.1| DUF974 domain-containing protein [Arthroderma otae CBS 113480]
gi|238838743|gb|EEQ28405.1| DUF974 domain-containing protein [Arthroderma otae CBS 113480]
Length = 342
Score = 62.0 bits (149), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 69/280 (24%), Positives = 107/280 (38%), Gaps = 49/280 (17%)
Query: 88 LVLPQAFGAIYLGETFCSYISINNSS----TLEVRDVVIKAEIQTDKQRILLL----DTS 139
L LP AFG+ Y+GETF +S NN + + V + ++A++QT Q I L D
Sbjct: 66 LTLPPAFGSAYVGETFACSLSANNEALNGNSRVVASIRMQADMQTPSQTIPLELLPPDEE 125
Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD---------------GEGERKYL 184
S V A I+ D+KE G H L + Y++ G +
Sbjct: 126 SSQVAGASAANSVQKIIRFDLKEEGNHVLAVSVNYTEILMVPNKDAQSGYQASGGRVRTF 185
Query: 185 PQFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMD 233
+ ++FI LSVRTK + G T LEA +EN + +
Sbjct: 186 RKLYQFIAQPCLSVRTKATELAPREIENRSLGPYGKTRLLRFA-LEAQLENVGDGVIVLG 244
Query: 234 --QVEFEP-----SQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKM 286
+ +P S NW + R++ + L+ G L ++M
Sbjct: 245 VPTLNSKPPFKSTSLNWDFYQRNGERKKDAPTLAPRDVLQIAFLVEQEEGQQEGLEVMQM 304
Query: 287 LSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
+ G LG+L I W+ +GE G L T ++
Sbjct: 305 -------DISRDGRTSLGQLSIQWQGAMGEKGYLTTGSLM 337
>gi|422293915|gb|EKU21215.1| hypothetical protein NGA_2027510, partial [Nannochloropsis gaditana
CCMP526]
gi|422294871|gb|EKU22171.1| hypothetical protein NGA_2027520, partial [Nannochloropsis gaditana
CCMP526]
Length = 322
Score = 61.6 bits (148), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 77/297 (25%), Positives = 118/297 (39%), Gaps = 81/297 (27%)
Query: 98 YLGETFCSYISINNSSTLEV----RDVVIKA----EIQ-----TDKQRILLLDT------ 138
YLGETFC+Y+SI N+ + +KA E+Q +Q L+ D
Sbjct: 1 YLGETFCAYVSIVNTLPFSILLFEAHASLKASRGNEVQLQNTVATRQADLVGDAPPPVPD 60
Query: 139 --------SKSPVESIRAGGRYDFIVEHDVKELGAHTLVC---------TALYSDGEGER 181
P+E +R G D +VEH ++EL H L T + GE R
Sbjct: 61 QWGGLGVRRDRPLE-LRPGENLDVVVEHVLQELDWHYLAINLELAPTSNTGTRTGGEAPR 119
Query: 182 KYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQ 241
+ + FKF VSNP+++ T RV+ G Q ++ E HT NL+++ V F +
Sbjct: 120 VMM-KRFKFKVSNPVALTTTQRVLPSGQVLVQ--AQIKNITERHT--NLFLEDVTFLAAD 174
Query: 242 NW--SATMLKADG--------------PHSDYNAQSRE--------IFKPPVLIRSGGGI 277
A L +G P + ++ RE F V ++ +
Sbjct: 175 RLHSEAVGLAPNGRSALGAMEQWGDRSPEATLPSEERESDPLDCVAAFDRHVYLQPED-V 233
Query: 278 HNYLYQLKMLSHGSSSPVKVQGSNV--------------LGKLQITWRTNLGEPGRL 320
+LY+L + + P G LG+L+++WRT LGE G L
Sbjct: 234 AQFLYRLSYRAEDTRGPPDQDGMQASSPVARTTLSTGTPLGQLRVSWRTTLGESGTL 290
>gi|326484145|gb|EGE08155.1| DUF974 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 337
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 91/221 (41%), Gaps = 56/221 (25%)
Query: 5 PGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKS 64
P T +L V RL RPSL ++ P+ V SD ++
Sbjct: 24 PATDAL---VHRLSRPSLSLQHPIPV--------------------------SDAQFSRI 54
Query: 65 SDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSS----TLEVRDV 120
+ L+Y S S LS L LP +FG Y+GETF +S NN + + V V
Sbjct: 55 ASLSYPS----ATSDSQFILSPNLTLPPSFGTAYVGETFACSLSANNEALGGNSRVVTSV 110
Query: 121 VIKAEIQTDKQRIL--LLDTSKSPVES--IRAGGRYDFIVEHDVKELGAHTLVCTALYSD 176
I+A++QT Q I LL T + P +S A I+ D+KE G H L + Y++
Sbjct: 111 RIQADMQTPSQTIPLELLPTGEEPAKSAGTSATASIQKIIHFDLKEEGNHVLAVSVNYTE 170
Query: 177 ---------------GEGERKYLPQFFKFIVSNPLSVRTKV 202
G + + ++F+ LSVRTK
Sbjct: 171 TMMAPNKDAASGFQASGGRARTFRKLYQFVAQPCLSVRTKA 211
>gi|401881502|gb|EJT45801.1| hypothetical protein A1Q1_05714 [Trichosporon asahii var. asahii
CBS 2479]
Length = 885
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 42/190 (22%), Positives = 86/190 (45%), Gaps = 23/190 (12%)
Query: 94 FGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILL------------LDTSKS 141
+G LGE + + ++N+S V V + EIQ+ R+ L +D S++
Sbjct: 350 YGQASLGEKLKASVRLHNTSNAPVYGVKMMMEIQSPSGRVRLGEVVHGGERPEGMDPSQA 409
Query: 142 PVES------IRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP 195
+ + G + EH++ ELG H L+C+ + + EG R+ +F KF + P
Sbjct: 410 ETRAWNELPQLAPGEGVELKGEHELAELGLHILICSVAW-ETEGGRRTFQRFLKFTAALP 468
Query: 196 LSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD 251
L+++T+V T + +LE ++N + + + + + +A + +
Sbjct: 469 LAIKTRVITPSAPNTALDADKRGDVYLEVLMQNTSPVAMRLQSADLDAVTGMTARSISSP 528
Query: 252 GPHSDYNAQS 261
P ++ +A+S
Sbjct: 529 DPDTEVDARS 538
>gi|154315960|ref|XP_001557302.1| hypothetical protein BC1G_04552 [Botryotinia fuckeliana B05.10]
gi|347842101|emb|CCD56673.1| similar to DUF974 domain-containing protein [Botryotinia
fuckeliana]
Length = 376
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 93/389 (23%), Positives = 149/389 (38%), Gaps = 99/389 (25%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL ++ P +LP S+ L
Sbjct: 17 HSVSLKVLRLSRPSLSIQ----------HPLPTPSPSPPLNLSLP---------APSASL 57
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINN---------------- 111
+Y S + + LS LL LP AFG+ Y+GETF + NN
Sbjct: 58 SYPS-----PTPSNFILSPLLTLPPAFGSAYVGETFSCTLCANNELPSPISQPAQTHTSP 112
Query: 112 ------SSTLEVRDVVIKAEIQ---TDKQRILLLDTSKSPVE------------SIRAGG 150
+S + ++ + AE++ T +L L +SP + I +
Sbjct: 113 DIATSANSNKIISNITLTAEMKIPSTPTPILLPLSGPESPPQVSTTSDEETPEAQITSQT 172
Query: 151 RYDFIVEHDVKELGAHTLVCTALYSDGEGER----KYLPQFFKFIVSNPLSVRTKVRVVK 206
++ D+KE G+H L T Y++ + + ++FI L VRT K
Sbjct: 173 SLQKVLHFDLKEEGSHVLAVTVTYTESSPSSPPRTRTFRKLYQFICKGCLVVRT-----K 227
Query: 207 VGATHFQEITF----------LEACIENHTKSN-LYMDQVEFEPSQNWSATMLKADGPHS 255
+G FQ+ T LEA +EN T+ N + + V ++ + AT L + S
Sbjct: 228 IGPLPFQKSTLSNVSSSKKYALEAQLENITEDNPITLTLVHLATTKGFKATSLNWEIVVS 287
Query: 256 DY---NAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVK-----------VQGSN 301
D N E+ +P + + G I + ++ G V + G
Sbjct: 288 DSEKENGGDVELERP---VLAPGDIRQVCFLVEEKVPGDDGEVADSVEGGKESEIIDGRL 344
Query: 302 VLGKLQITWRTNLGEPGRLQTQQILGTTI 330
+ G L I WR +G G L T LGT +
Sbjct: 345 IFGVLSIGWRGAMGNKGFLSTGN-LGTRV 372
>gi|83769293|dbj|BAE59430.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 291
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 75/303 (24%), Positives = 119/303 (39%), Gaps = 66/303 (21%)
Query: 53 PLISSDVTTNKSSDLTYRSRFLLHDSADS-IGLSGLLVLPQAFGAIYLGETFCSYISINN 111
P ++ + + L+Y S DS D+ L+ L LP AFG+ Y+GETF +S NN
Sbjct: 21 PFPEANTKISNKASLSYPS-----DSVDNQFILAPNLTLPPAFGSAYVGETFACTLSANN 75
Query: 112 -----SSTLEVRDVVIKAEIQTDKQ--RILLLDTSKSPV-ESIRAGGRYDFIVEHDVKEL 163
++ V V I AE+QT Q + L +P + ++ G IV D+KE
Sbjct: 76 ELAEDETSRVVTSVRIVAEMQTPSQVASLELEPADDAPARDGLQKGQSLQKIVRFDLKEE 135
Query: 164 GAHTLVCTALYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-------- 206
G H L + Y++ G + + ++F+ LSVRTK +
Sbjct: 136 GNHILAVSVSYTETLIGSDSQAASGRVRTFRKLYQFVAQPCLSVRTKSSELSPLEVENKS 195
Query: 207 ---VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSRE 263
G T LEA +EN S + + ++ T ++ +G +A ++
Sbjct: 196 LGPYGKTRLLRFA-LEAQLENVDFSLILGTLMLSIANETEPQTPVQEEGQQEGLDALQKD 254
Query: 264 IFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQ 323
+ K G VLG+L I WR +G+ G L T
Sbjct: 255 L-------------------------------KHDGRAVLGQLSIEWRGTMGDKGFLTTG 283
Query: 324 QIL 326
+L
Sbjct: 284 NLL 286
>gi|429863211|gb|ELA37718.1| duf974 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 387
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 126/312 (40%), Gaps = 36/312 (11%)
Query: 32 PTDL-------FIGEDIFDDP--IAASNLPPLISSDVTTNKSSDLTYRSRFLLHDSADSI 82
P+DL + D +P ++ LPP VTT S L Y + + +
Sbjct: 93 PSDLVNMSHQRYPSHDPLKEPHSVSLKALPP-----VTTPAPSSLAYDTPAATNPAP--F 145
Query: 83 GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLL---DTS 139
LS +L LP +FG+ Y+GE F + N+ TLE + K + D +
Sbjct: 146 LLSPILNLPLSFGSAYVGEVFSCTLCANH-DTLEPPPGPKRKGGAVQKLELTPADPDDAA 204
Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPL 196
+ + G IV D+KE G H L T Y ++ G+ + + ++FI + L
Sbjct: 205 EGKGTDLEPGETLQRIVNFDLKEEGNHVLAVTVSYYEATETSGKTRTFRKLYQFICKSSL 264
Query: 197 SVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSD 256
VRTK+ + G LEA +EN ++ + +++V + + T D +
Sbjct: 265 IVRTKIGPLASGKNGGARKWVLEAQLENCSEDVIQLEKVLIDLEEGLGYT----DCNWEE 320
Query: 257 YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGE 316
+R + P G + + + + P + G + G L I WR +G
Sbjct: 321 GGGVARPVLHP-------GEVEQVCFVVTEADGAHAEPGE-DGRIMFGVLGIGWRGEMGN 372
Query: 317 PGRLQTQQILGT 328
G L T + LGT
Sbjct: 373 RGFLSTGK-LGT 383
>gi|312071429|ref|XP_003138604.1| hypothetical protein LOAG_03019 [Loa loa]
Length = 145
Score = 58.5 bits (140), Expect = 6e-06, Method: Composition-based stats.
Identities = 45/160 (28%), Positives = 67/160 (41%), Gaps = 27/160 (16%)
Query: 10 LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTY 69
L +VMRL RP + + +D D + LI S +
Sbjct: 10 LTLKVMRLARPKFYENMCIPIDSAD---------------STSQLIGSALC--------- 45
Query: 70 RSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTD 129
R ++AD I + L+ PQ F IYLGETF ++ + N S D+ IK ++QT
Sbjct: 46 --RLTGQEAAD-IPIGKYLMAPQKFENIYLGETFSFFVCVQNISDKVAMDICIKTDLQTT 102
Query: 130 KQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLV 169
QR L + + G I+ H++KE+G H V
Sbjct: 103 SQRNALPSQLQEANAVLEPGKCLGEIITHEIKEIGQHMYV 142
>gi|406696508|gb|EKC99793.1| hypothetical protein A1Q2_05872 [Trichosporon asahii var. asahii
CBS 8904]
Length = 885
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/190 (21%), Positives = 86/190 (45%), Gaps = 23/190 (12%)
Query: 94 FGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILL------------LDTSKS 141
+G LGE + + ++++S V V + E+Q+ R+ L +D S++
Sbjct: 350 YGQASLGEKLKASVRLHDTSNAPVYGVKMMMEVQSPSGRVRLGEVVHGGERPEGMDPSQA 409
Query: 142 PVES------IRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP 195
+ + G + EH++ ELG H L+C+ + + EG R+ +F KF + P
Sbjct: 410 ETRAWNELPQLAPGEGVELKGEHELAELGLHILICSVAW-ETEGGRRTFQRFLKFTAALP 468
Query: 196 LSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD 251
L+++T+V T + +LE ++N + + + + + +A + +
Sbjct: 469 LAIKTRVITPSAPNTALDADKRGDVYLEVLMQNTSPVAMRLRSADLDAVTGMTARSISSP 528
Query: 252 GPHSDYNAQS 261
P ++ +A+S
Sbjct: 529 DPDTEVDARS 538
>gi|67609511|ref|XP_667022.1| hypothetical protein [Cryptosporidium hominis TU502]
gi|54658115|gb|EAL36797.1| hypothetical protein Chro.80422 [Cryptosporidium hominis]
Length = 299
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 73/304 (24%), Positives = 133/304 (43%), Gaps = 20/304 (6%)
Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
+L ++ I G D +V+ V E+G ++L C ++ E R + +KF V +
Sbjct: 1 MLYNNEDNYSDIDIGDSLDIVVKERVDEVGLYSLTCQLFFTSNEA-RLTQKKSYKFAVLS 59
Query: 195 PLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPH 254
P ++ ++ + GA + I F+E +EN + ++ + ++ EP L +
Sbjct: 60 PFNISHRLYNLDEGAMDKKTI-FVEVSLENISHQSITLSSMKLEPINIKKLPELIFE--L 116
Query: 255 SDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG-KLQITWRTN 313
D N +++ P+ I+ +N +++ S G + + VL KL+I W +
Sbjct: 117 EDVNLKNKH--NEPLYIQPRCK-YNKIFKFTFRSRGEYNNLGTSSREVLELKLRIGWISV 173
Query: 314 LGEPGRLQTQQILGTTITSKEIELN--------VVEVPSVVGIDKPFLLKLKLTNQTDKE 365
G L + +I I + +LN E+PSV + F + L +TN +
Sbjct: 174 SYGDGWLDSYKI-DLPILCDQNKLNKEKNAIILKAELPSVNNRQEEFKVFLYVTNNLSID 232
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
Q I L D D+ ++I G + L ++A + L+ A GV + GI VF
Sbjct: 233 QKGVSIRL---DFDQLLPIIILGNDRLYLEELKAGETVTLELDCQALVSGVYNLNGIYVF 289
Query: 426 DKLE 429
D+LE
Sbjct: 290 DELE 293
>gi|169604758|ref|XP_001795800.1| hypothetical protein SNOG_05395 [Phaeosphaeria nodorum SN15]
gi|160706634|gb|EAT87786.2| hypothetical protein SNOG_05395 [Phaeosphaeria nodorum SN15]
Length = 294
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 95/211 (45%), Gaps = 40/211 (18%)
Query: 56 SSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS--- 112
S D+ + + L Y S+ DS LS +L LP+AFG+ Y+GETF + NN
Sbjct: 45 SQDLGISPKASLAYPSQ---DDSNSRFLLSPVLNLPEAFGSAYVGETFSCTLCANNELDA 101
Query: 113 --STLEVRDVVIKAEIQTDKQRILLLDTSKSPVE------------SIRAGGRYDFIVEH 158
+T V V I+ ++QT + + SP++ S G I+
Sbjct: 102 ADTTRAVSGVRIQGDMQTPS------NPAGSPLDLTGSLEDGEDAVSPGPGESLQRILRF 155
Query: 159 DVKELGAHTLVCTALYSD---GEGER-----KYLPQFFKFIVSNPLSVRTKVRVVKV--G 208
++KE G H L T Y++ GEG+ + + ++F+ LSVRTK + G
Sbjct: 156 ELKEDGNHVLAVTVTYTETALGEGKAASGRVRTFRKLYQFVAQQLLSVRTKAGELTQPNG 215
Query: 209 ATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
+ + LEA +EN ++ + ++ + P
Sbjct: 216 PSKY----LLEAQLENMGEAAVCLEVRDLFP 242
>gi|295665813|ref|XP_002793457.1| DUF974 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226277751|gb|EEH33317.1| DUF974 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 343
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 72/284 (25%), Positives = 110/284 (38%), Gaps = 56/284 (19%)
Query: 88 LVLPQAFGAIYLGETFCSYISIN-----NSSTLEVRDVVIKAEIQTDKQRILL-LDTSKS 141
+ LP AFG+ Y+GETF + N +S V V I AE+QT Q ++L L S
Sbjct: 67 VTLPPAFGSAYVGETFSCSLCANSELLPDSENRIVSSVRIIAEMQTPSQNVVLELFPSG- 125
Query: 142 PVESIRAGG-----RYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLP---------- 185
E +GG IV D+KE G H L + Y++ + + +P
Sbjct: 126 --EDSNSGGLTKSQSLQKIVRFDLKEEGNHVLAVSVSYTETIMAQAREMPSSGDTQAASW 183
Query: 186 ------QFFKFIVSNPLSVRTKVRVVK-----------VGATHFQEITFLEACIENHTKS 228
+ ++FI L+VRTKV + G T LEA +EN
Sbjct: 184 RVRTFRKLYQFIAQPCLNVRTKVTELAPLEADNRAFDPYGKTRLLRY-VLEAQLENIGDG 242
Query: 229 NLYMDQVEFEPSQNWSATMLKAD--GPHS----DYNAQSREIFKPPVLIRSGGGIHNYLY 282
+ + P + + L D P+S R++ + L+ G L
Sbjct: 243 AISLGSTTLNPKPPFQSRSLNWDLEQPNSLEMRPLTLSPRDVLQVAFLVEREPGQQEGL- 301
Query: 283 QLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
G + G LG+L I WR ++G+ G L T ++
Sbjct: 302 ------EGLQKDMSRDGRTTLGQLSIEWRGSMGDRGFLTTGNLM 339
>gi|380094878|emb|CCC07380.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 425
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 99/415 (23%), Positives = 159/415 (38%), Gaps = 104/415 (25%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLR--VDPTDLFIGEDIFDDPIAAS---------NLPPLIS 56
HS++ +V+RL RPSL + PL+ V P L P+A +LPPL +
Sbjct: 15 HSVSLKVLRLSRPSLVPQFPLQPPVIPQSL-------TSPVAGPAPAVLLQPRHLPPLPA 67
Query: 57 S-------------DVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETF 103
S + + S R+R +++ I LS ++ LP +FG+ Y+GETF
Sbjct: 68 SLAYSPLSPIKKYEEGSQGAESGGGERTRDGYYNTEPFI-LSPIVNLPPSFGSAYVGETF 126
Query: 104 -CSYI----------SINNSSTLEVRDVVIKAEIQT---DKQRILLLDTS---------- 139
C+ S+ N +RDV I+AE+QT +++L+DT+
Sbjct: 127 SCTLCANHNAPPIGESVTNGVKKTIRDVKIEAEMQTPSGQSTKLVLVDTAGDDNAGSSNM 186
Query: 140 -KSPVESIRAGGRYDF---------------------------IVEHDVKELGAHTLVCT 171
V AG + I+ +KE G H L T
Sbjct: 187 DNDNVAISNAGNEDNNNTTETTPTETETVATLDLLPSYTTLQKILNFGLKEEGTHVLGVT 246
Query: 172 ALY---SDGEGERKYLPQFFKFIVSNPLSVRTKVRVVKV--GATHFQEITFLEACIENHT 226
Y ++ G + + ++FI L VRTK + G T + LEA +EN +
Sbjct: 247 VSYYEATETSGRTRAFRKMYQFICKPSLIVRTKAGPLPSLPGKTKRRRW-VLEAQLENCS 305
Query: 227 KSNLYMDQVEFEPSQ--------NWSATMLKADGPHSDY--NAQSREIFKPPVLIRSGGG 276
+ + +++V+ Q NW+ G + + PP G
Sbjct: 306 EDAILLEKVKLAEVQRGLKWRDCNWAGIGATTTGEEGNRISQQGQGQGQGPPRRPFLHPG 365
Query: 277 IHNYLYQLKMLSHGSSSPVKVQ---GSNVLGKLQITWRTNLGEPGRLQTQQILGT 328
L + +G +V+ G G + + WRT +G G L T + LGT
Sbjct: 366 ESEQLCFIIEEKNGEEDAAEVEEKDGRIEFGVMALAWRTEMGNRGSLLTLK-LGT 419
>gi|405117419|gb|AFR92194.1| hypothetical protein CNAG_00056 [Cryptococcus neoformans var.
grubii H99]
Length = 674
Score = 52.0 bits (123), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 79/172 (45%), Gaps = 24/172 (13%)
Query: 91 PQAFGAIYLGETFCSYISINN--SSTLEVRDVVIKAEIQTDKQRILL--------LDTS- 139
P FG+I LG +S+ N V V + E+Q+ R L DTS
Sbjct: 53 PSPFGSIPLGSKLDLRVSLENVHRQRYGVHGVRMMVEVQSASGRARLGEAIHGQISDTSS 112
Query: 140 --------KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFI 191
+S + ++ G + VE ++K+LG ++ + + +G RK +FFKF
Sbjct: 113 EQPLQEGQESQLPELKFGEMVELGVESEMKDLGLGVVIVSVAWETLDG-RKTFQRFFKFN 171
Query: 192 VSNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEP 239
+ PL ++T+V++ + F +E T+LE ++N + ++ + + EP
Sbjct: 172 IITPLGIKTRVQIPSHPNSTFSLSLRERTYLEVFMQNTSLESMLISGISLEP 223
>gi|321250597|ref|XP_003191861.1| hypothetical protein CGB_B0480W [Cryptococcus gattii WM276]
gi|317458329|gb|ADV20074.1| Hypothetical Protein CGB_B0480W [Cryptococcus gattii WM276]
Length = 671
Score = 52.0 bits (123), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 78/172 (45%), Gaps = 24/172 (13%)
Query: 91 PQAFGAIYLGETFCSYISINNSSTLE--VRDVVIKAEIQTDKQRILL--------LDTSK 140
P FG+I LG I + N + V + E+Q+ R+ L DT+
Sbjct: 53 PPPFGSIPLGSKLDFRIGLENVHRQRHGMHGVRMMVEVQSGSGRVRLGEAIHGQMSDTTG 112
Query: 141 SP---------VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFI 191
P + ++ G + VE ++K+LG ++ + + +G RK L +FFKF
Sbjct: 113 EPPLQGGQESQLPELKFGEMVELEVESEMKDLGLGVVIVSVAWETLDG-RKTLQRFFKFN 171
Query: 192 VSNPLSVRTKVRVV----KVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
+ PL ++T+V++ + +E T+LE ++N + ++ + + EP
Sbjct: 172 IITPLGIKTRVQIPSHPNSTLSLSLREQTYLEVFMQNASLESMLISGISLEP 223
>gi|67528320|ref|XP_661962.1| hypothetical protein AN4358.2 [Aspergillus nidulans FGSC A4]
gi|40741329|gb|EAA60519.1| hypothetical protein AN4358.2 [Aspergillus nidulans FGSC A4]
gi|259482832|tpe|CBF77688.1| TPA: DUF974 domain protein (AFU_orthologue; AFUA_4G06560)
[Aspergillus nidulans FGSC A4]
Length = 267
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 96/246 (39%), Gaps = 38/246 (15%)
Query: 110 NNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPV---ESIRAGGRYDFIVEHDVKELGAH 166
++ +T + V I AE+QT Q + LD S + ++ G IV D+KE G H
Sbjct: 26 SDDTTRVITSVRIVAEMQTPSQ-VSSLDLEPSDTNANDGLQKGQSLQKIVRFDLKEEGNH 84
Query: 167 TLVCTALYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK----------- 206
L + Y++ G + + ++F+ LSVRTK +
Sbjct: 85 ILAVSVSYTETMIGNDFQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVDNKSLGP 144
Query: 207 VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNA------Q 260
G T LEA +EN + + Q P + A L D D
Sbjct: 145 YGKTRLLRFA-LEAQLENVGDGAVVIKQTCLNPKAPFKAISLNWDLERPDQAETPPPILN 203
Query: 261 SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRL 320
R++ + L+ G L L+ ++ G VLG+L I WR+++G+ G L
Sbjct: 204 PRDVLQVAFLVEQEEGQQEGLEALQ-------KDLRRDGRAVLGQLSIEWRSSMGDKGFL 256
Query: 321 QTQQIL 326
T +L
Sbjct: 257 TTGNLL 262
>gi|392572585|gb|EIW65730.1| hypothetical protein TREMEDRAFT_74899 [Tremella mesenterica DSM
1558]
Length = 753
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/177 (26%), Positives = 82/177 (46%), Gaps = 27/177 (15%)
Query: 95 GAIYLGETFCSYISINNSSTL--EVRDVVIKAEIQTDKQRILL---LDTSKSPV------ 143
G + LG + + NS +V V + EIQ+ + L + + SPV
Sbjct: 60 GVVSLGSPLSLGLQLRNSHVQKHDVLGVRMMVEIQSPSIKTRLGEVIHRTSSPVDKSDLE 119
Query: 144 ---ESIRAGG----RYDFIVEHD----VKELGAHTLVCTALYSDGEGERKYLPQFFKFIV 192
ES + G +YD V D +KELG H ++C+ + +G RK +F++F V
Sbjct: 120 NVTESEESTGFSVLKYDEAVNLDSVCEMKELGNHMIICSVAWETLDG-RKTFQRFYRFTV 178
Query: 193 SNPLSVRTKVRVVKVGATHF----QEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
+ PL+++T+V+ + +E +LE ++N +K + D+V E Q +A
Sbjct: 179 NPPLAMKTRVKPPQSSNLLLNPLRREDVYLEILMQNVSKEGILFDKVLLEAVQGLTA 235
>gi|66360596|ref|XP_627257.1| DM-LD37668p [Cryptosporidium parvum Iowa II]
gi|46228846|gb|EAK89716.1| predicted DM-LD37668p [Cryptosporidium parvum Iowa II]
Length = 308
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/312 (23%), Positives = 136/312 (43%), Gaps = 21/312 (6%)
Query: 126 IQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLP 185
+ T K+ IL ++ I G D +V+ V E+G ++L C ++ E R
Sbjct: 4 VGTKKRHILY--NNEDNYSDIDIGDSLDIVVKERVDEVGLYSLTCQLFFTSNEA-RLTQK 60
Query: 186 QFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSA 245
+ +KF V +P ++ ++ + T ++ F+E +EN + ++ + ++ EP
Sbjct: 61 KSYKFAVLSPFNISHRLYNLD-EDTMDKKTIFVEVSLENVSHQSITLSSMKLEPINIKKL 119
Query: 246 TMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + D N +++ P+ I+ +N +++ S ++ K + K
Sbjct: 120 PELIFE--LEDVNLKNKH--NEPLYIQPRCK-YNKIFKFTSCSREYNNLGKSSREVLELK 174
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELN--------VVEVPSVVGIDKPFLLKLK 357
L+I W + G L + +I G I + +LN E+PSV + F + L
Sbjct: 175 LRIGWVSVSYGDGWLDSYKI-GLPILCDQNKLNKEKNAIILKAELPSVNNRQEEFKVFLY 233
Query: 358 LTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQ 417
+TN +Q I L D D+ ++I G + L ++A + L+ A GV
Sbjct: 234 VTNNLSIDQKGMSIRL---DFDQLLPIIILGNDRLYLEELKAGETVTLELDCQALVSGVY 290
Query: 418 RITGITVFDKLE 429
+ GI VFD+LE
Sbjct: 291 NLNGIYVFDELE 302
>gi|225683676|gb|EEH21960.1| UDP-glucoronosyl and UDP-glucosyl transferase family protein
[Paracoccidioides brasiliensis Pb03]
Length = 945
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 93/213 (43%), Gaps = 57/213 (26%)
Query: 16 RLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLTYRSRFLL 75
RL RPSL + PL P++ E+I P+ AS P SSD ++F+L
Sbjct: 38 RLSRPSLSFQYPL---PSE---NENI---PVKASLSFPSDSSD------------NQFIL 76
Query: 76 HDSADSIGLSGLLVLPQAFGAIYLGETFCSYISIN-----NSSTLEVRDVVIKAEIQTDK 130
+ + LP AFG+ Y+GETF + N +S V V I AE+QT
Sbjct: 77 SPN---------VTLPPAFGSAYVGETFSCSLCANSELLPDSDNRVVSSVRIIAEMQTPS 127
Query: 131 QRILLLDTSKSPVESIRAG----GRYDFIVEHDVKELGAHTLVCTALYSDG-EGERKYLP 185
Q + +L+ S S +S G IV D+KE G H L + Y++ + + +P
Sbjct: 128 QNV-VLELSPSGEDSHSGGLTKSQSLQKIVRFDLKEEGNHVLAVSVSYTETIMAQAREMP 186
Query: 186 ----------------QFFKFIVSNPLSVRTKV 202
+ ++FI L+VRTKV
Sbjct: 187 SSGDTQAASWRVRTFRKLYQFIAQPCLNVRTKV 219
>gi|58258123|ref|XP_566474.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|134106063|ref|XP_778042.1| hypothetical protein CNBA0450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260745|gb|EAL23395.1| hypothetical protein CNBA0450 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57222611|gb|AAW40655.1| expressed protein [Cryptococcus neoformans var. neoformans JEC21]
Length = 674
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 78/172 (45%), Gaps = 24/172 (13%)
Query: 91 PQAFGAIYLGETFCSYISINN--SSTLEVRDVVIKAEIQTDKQRILL--------LDTS- 139
P FG+I LG + + N V V + E+Q+ R+ L DTS
Sbjct: 53 PPPFGSIPLGSKLDLRVGLENVHRQRYGVHGVRMMVEVQSASGRVRLGEAIHGQISDTSS 112
Query: 140 --------KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFI 191
+S + ++ G + VE ++K+LG ++ + + +G RK +FFKF
Sbjct: 113 EQPLQEGQESQLPELKFGEMVELGVESEMKDLGLGVVIVSVAWETLDG-RKTFQRFFKFN 171
Query: 192 VSNPLSVRTKVRVV----KVGATHFQEITFLEACIENHTKSNLYMDQVEFEP 239
+ PL ++T+V++ + +E T+LE ++N + ++ + + EP
Sbjct: 172 IITPLGIKTRVQIPSHPNSTLSLSLREQTYLEVFMQNTSLESMLISGISLEP 223
>gi|261331369|emb|CBH14363.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 541
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 58/132 (43%), Gaps = 2/132 (1%)
Query: 53 PLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS 112
PL VT +S D R G+S +L LP G ++G+ F + +S +N+
Sbjct: 72 PLHHPLVTVKQSGDPLVSQRRSEAARLAMQGVSSVLSLPSVVGKHFVGQPFRAILSFHNA 131
Query: 113 STLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
+ + VI+ I T R + L + P +I A G F VEH + G +TL A
Sbjct: 132 AAYPLTTAVIRINIVTPSVRHVTLVNHECP--AIEARGNVSFTVEHLLSSPGQYTLSVVA 189
Query: 173 LYSDGEGERKYL 184
D E+K L
Sbjct: 190 TCVDVVKEQKRL 201
>gi|71745036|ref|XP_827148.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70831313|gb|EAN76818.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 541
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/132 (29%), Positives = 58/132 (43%), Gaps = 2/132 (1%)
Query: 53 PLISSDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNS 112
PL VT +S D R G+S +L LP G ++G+ F + +S +N+
Sbjct: 72 PLHHPLVTVKQSGDPLVSQRRSEAARLAMQGVSSVLSLPSVVGKHFVGQPFRAILSFHNA 131
Query: 113 STLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
+ + VI+ I T R + L + P +I A G F VEH + G +TL A
Sbjct: 132 AAYPLTTAVIRINIVTPSVRHVTLVNHECP--AIEARGNVSFTVEHLLSSPGQYTLSVVA 189
Query: 173 LYSDGEGERKYL 184
D E+K L
Sbjct: 190 TCVDVVKEQKRL 201
>gi|350632010|gb|EHA20378.1| hypothetical protein ASPNIDRAFT_44305 [Aspergillus niger ATCC 1015]
Length = 258
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 61/240 (25%), Positives = 94/240 (39%), Gaps = 39/240 (16%)
Query: 117 VRDVVIKAEIQTDKQRILLLD----TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTA 172
V V I AE+QT Q + LD + + ++ G IV D+KE G H L +
Sbjct: 23 VTSVRIVAEMQTPSQ-VAALDLEPAEDTASKDGVQKGHSLQKIVRFDLKEEGNHILAVSV 81
Query: 173 LYSD---------GEGERKYLPQFFKFIVSNPLSVRTKVRVVK-----------VGATHF 212
Y++ G + + ++F+ LSVRTK + G T
Sbjct: 82 SYTETLIGSDAQAASGRVRTFRKLYQFVAQPCLSVRTKSSELAPLEVENKTLGPYGKTRL 141
Query: 213 QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKAD--GP-HSDYNAQS---REIFK 266
LEA +EN + + Q P + A L D GP +D + R++ +
Sbjct: 142 LRFA-LEAQLENVGDGPVVVKQTRLNPKPPFKAVSLNWDLQGPDQADPRPPTLHPRDVLQ 200
Query: 267 PPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
L+ G L L+ +K G VLG+L I WR +G+ G L T ++
Sbjct: 201 VAFLVEQEEGQQEGLETLQ-------KDMKRDGRAVLGQLSIEWRGAMGDKGFLTTGNLM 253
>gi|70994786|ref|XP_752170.1| DUF974 domain protein [Aspergillus fumigatus Af293]
gi|66849804|gb|EAL90132.1| DUF974 domain protein [Aspergillus fumigatus Af293]
gi|159124916|gb|EDP50033.1| DUF974 domain protein [Aspergillus fumigatus A1163]
Length = 227
Score = 48.9 bits (115), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 52/212 (24%), Positives = 81/212 (38%), Gaps = 38/212 (17%)
Query: 143 VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD---------GEGERKYLPQFFKFIVS 193
E ++ G IV D+KE G H L + Y++ G + + ++F+
Sbjct: 21 TEGLQRGQSLQKIVRFDLKEEGNHILAVSISYTETLIGSDAQAASGRVRTFRKLYQFVAQ 80
Query: 194 NPLSVRTKVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQN 242
LSVRTK + G T LEA +EN + + Q + P
Sbjct: 81 PCLSVRTKSSELAPLEVENKSLGPYGKTRLLRFA-LEAQLENVGDGTVVVKQTKLNPKPP 139
Query: 243 WSATML--------KADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSP 294
+ A L KAD N R++ + L+ G L L+
Sbjct: 140 FKALSLNWDLERPDKADSQPPTLNP--RDVLQVAFLVEQEEGQQEGLEALQ-------KD 190
Query: 295 VKVQGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
++ G VLG+L I WR+ +G+ G L T +L
Sbjct: 191 LRRDGRAVLGQLSIEWRSAMGDKGFLTTGNLL 222
>gi|403372611|gb|EJY86205.1| DUF974 domain containing protein [Oxytricha trifallax]
Length = 482
Score = 48.5 bits (114), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 114/253 (45%), Gaps = 22/253 (8%)
Query: 188 FKFIVSNPLSVRT-----KVRVVKVGATHF--QEITFLEACIENHTKSNLYMDQVEFEPS 240
+KF + P VR V++ ++ HF Q L+ I+N + + +++D+V F
Sbjct: 220 YKFEANLPFEVRKSISLKNVKLQQLFTKHFCIQNEFILQIKIKNLSVNKIFLDKVIFHCI 279
Query: 241 QNWSATMLKADGPHSD---YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGS-SSPVK 296
+L + H+ ++ +F V+ + G I YL+ ++ H + +
Sbjct: 280 NANQMKVLDIN-THTQSLGFDESQVSVFGESVVF-NPGEIRQYLF---IIQHKDPAYKIN 334
Query: 297 VQGSNVLGKLQITWRTNLGEPGRLQTQQILGTT--ITSKEIELNVVEVPSVVGIDKPFLL 354
+ LG+L++ W LG+PG L+ T EI+L+VV ++ +++P +
Sbjct: 335 KFEMHQLGQLELRWVNYLGDPGLLKIGPFKSNVEQKTKFEIDLDVVSQDQILKLEQPKSI 394
Query: 355 KLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKL 414
+L N ++ +I LS + E ++I G+ L +E S DF L+L
Sbjct: 395 MFRLYNLSN---SVMKIQLSVKEK-EVGDLLICGISKYNLGRLEPQASVDFSLDLFPKSC 450
Query: 415 GVQRITGITVFDK 427
GV + G+ + D+
Sbjct: 451 GVHPVCGLLIKDQ 463
>gi|340056165|emb|CCC50494.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 544
Score = 47.8 bits (112), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 43/182 (23%), Positives = 73/182 (40%), Gaps = 14/182 (7%)
Query: 10 LAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDLT 68
L+ RV L +P L P V+ D+ D+ +P+ L S + K D
Sbjct: 31 LSVRVAVLRKPELAQALAPELVEEGDILF--DVLANPVYHPTTKALESDEPHVVKGWDC- 87
Query: 69 YRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQT 128
R +H G+ L LP + G Y+G+ F ++++ +N ++ + + +
Sbjct: 88 --GRLKMH------GIGSALSLPSSIGKHYVGQMFRAFLNFSNHASYPLNSLAFYVSMAD 139
Query: 129 DKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFF 188
++R+ L I G F VEH + G +TL Y+D E+K L
Sbjct: 140 PEERVTQLINHN--CAQIEGAGNVSFTVEHKLLRPGKYTLKVVVAYTDIAREQKRLKWLS 197
Query: 189 KF 190
F
Sbjct: 198 SF 199
>gi|401407578|ref|XP_003883238.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325117654|emb|CBZ53206.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 320
Score = 47.8 bits (112), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 48/230 (20%), Positives = 93/230 (40%), Gaps = 21/230 (9%)
Query: 212 FQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLI 271
Q F+E ++N ++ +Y+ + L + P N + FKP
Sbjct: 96 IQGRAFVECSLDNVSQQPVYLSDASIFCVEGIEGVRLDSGPPCDSMNHKGLHYFKP---- 151
Query: 272 RSGGGIHNYLYQLKMLSHGSSSPVKVQGS-----NVLGKLQITWRTNLGEPGRLQTQQIL 326
+N ++ L +++ + V S VLG+L + WRT+ G G + +
Sbjct: 152 ---QDRYNLVFSLT----PTATRLGVDASFIRRLPVLGQLALEWRTSTGGAGCMHDYTLT 204
Query: 327 GTTI-TSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVM 385
+ ++K + L VV P+ V ++ PF ++++++ ++ P I SD + V
Sbjct: 205 NSLAGSAKPLSLRVVSCPASVQVESPFQVEIEVSAHIEQVFCPVLIL---RPSDLQPFV- 260
Query: 386 INGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEKITYDS 435
I G L ++ + L + G + GI V+D T D+
Sbjct: 261 IQGSTTRPLGIIDMLTPRRYTLEAVCLSPGFHSVKGIMVYDPDTHQTADA 310
Score = 45.1 bits (105), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 72/153 (47%), Gaps = 37/153 (24%)
Query: 10 LAFRVMRLCRPSLHVEP-PL-RVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
L +VMRL +PS++ EP PL R+D + S D + K +
Sbjct: 9 LTLKVMRLSQPSINAEPWPLLRIDE---------------------VTSEDQSIEKKVE- 46
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKA--- 124
R++ + + DS + L+LP G I+ GETF +YI+I+NSS + +V+I+
Sbjct: 47 --RAKDCVERALDS---THALLLPATQGRIFSGETFSAYINISNSSNAQAVNVIIQGRAF 101
Query: 125 -EIQTD---KQRILLLDTSKSPVESIRAGGRYD 153
E D +Q + L D S VE I G R D
Sbjct: 102 VECSLDNVSQQPVYLSDASIFCVEGIE-GVRLD 133
>gi|342183401|emb|CCC92881.1| conserved hypothetical protein [Trypanosoma congolense IL3000]
Length = 543
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 48/102 (47%), Gaps = 2/102 (1%)
Query: 83 GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSP 142
G+ LVLP A G ++G+ F + +S +N+++ + VV + I T + + L +
Sbjct: 101 GVGSALVLPSAVGKHFVGQPFRAILSFHNAASYPLTAVVFRINIVTPSVKHVALVNQEG- 159
Query: 143 VESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYL 184
+I G F VEH + G +TL Y D E K L
Sbjct: 160 -RTINGKGNTSFTVEHILSSPGQYTLSAVVTYIDVTKESKRL 200
>gi|403171573|ref|XP_003330778.2| hypothetical protein PGTG_12315 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375169240|gb|EFP86359.2| hypothetical protein PGTG_12315 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 405
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 39/171 (22%), Positives = 68/171 (39%), Gaps = 44/171 (25%)
Query: 88 LVLPQAFGAIYLGETFCSYISIN----NSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPV 143
L LP +FG IY GE F +S+ S+ + + + E+Q+ + KS +
Sbjct: 37 LSLPNSFGTIYQGEAFNGLLSLRPEQPRSNLIAALNPKLIVELQSSQ------SLHKSLI 90
Query: 144 ESIRAGG--------RYDFIVEHDVKELGAHTLVCTALYS-------------------- 175
SI A + ++ H + +LG H+L+CT Y
Sbjct: 91 GSIHAHQLGPASEHEALELLINHQITQLGLHSLICTVTYQEPPPTEPTEEEEDQELTPAE 150
Query: 176 ------DGEGERKYLPQFFKFIVSNPLSVRTKVRVVKVGATHFQEITFLEA 220
+ E + + + +KF V NPL ++TK ++ +E LE+
Sbjct: 151 SHQITPESEPQTRSFRKLYKFQVLNPLGIKTKTYRSPSSSSVLEETRVLES 201
>gi|403339766|gb|EJY69144.1| DUF974 domain containing protein [Oxytricha trifallax]
Length = 429
Score = 45.4 bits (106), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 41/156 (26%), Positives = 74/156 (47%), Gaps = 10/156 (6%)
Query: 275 GGIHNYLYQLKMLSHGSSS-PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTT--IT 331
G I YL+ ++ H S+ + + LG+L++ W LG+PG L+ T
Sbjct: 262 GEIRQYLF---IIQHKDSAYKINKFEMHQLGQLELRWVNYLGDPGLLKIGPFKSNVEQKT 318
Query: 332 SKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRI 391
EI+L+VV ++ +++P + +L N ++ +I LS + E ++I G+
Sbjct: 319 KFEIDLDVVSQDQILKLEQPKSIMFRLYNLSN---SVMKIQLSVKEK-EVGDLLICGISK 374
Query: 392 MALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 427
L +E S DF L+L GV + G+ + D+
Sbjct: 375 YNLGRLEPQASVDFSLDLFPKSCGVHPVCGLLIKDQ 410
>gi|302419145|ref|XP_003007403.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261353054|gb|EEY15482.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 335
Score = 45.4 bits (106), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 61/270 (22%), Positives = 103/270 (38%), Gaps = 53/270 (19%)
Query: 95 GAIYLGETFCSYISINNSS-----------TLEVRDVVIKAEIQTDK-----QRILLLD- 137
G+ Y+GE F + N+ T +RDV I AE++T Q++ L
Sbjct: 73 GSAYVGEHFSCTLCANHEPPVSTDVAAALPTKRIRDVRIDAEMKTPGAQGSVQKLQLTGR 132
Query: 138 ---------------TSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEG 179
T+ + + G IV D+K+ G H L T Y ++ G
Sbjct: 133 ASDSSSSSSSDAAATTTATATADLAPGETLQRIVGFDLKDEGNHVLAVTVSYYEATETSG 192
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVV---KVGA-THFQEITFLEACIENHTKSNLYMDQV 235
+ + ++FI + L VRTKV + GA + LEA +EN + + +++V
Sbjct: 193 RTRTFRKLYQFICKSSLIVRTKVGSLPGTPGGADGRVRRKWVLEAQLENCAEDVVQLERV 252
Query: 236 EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPV 295
E N + D ++ + + P G + + ++ + G
Sbjct: 253 EL----NLEGGLAYTD---CNWGPAGKPVLHP-------GEVEQVCFVVEETAEGGGLEP 298
Query: 296 KVQGSNVLGKLQITWRTNLGEPGRLQTQQI 325
G V G L I WR +G G L T ++
Sbjct: 299 GDDGRIVFGVLGIGWRGEMGNRGYLSTGKL 328
>gi|115398331|ref|XP_001214757.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192948|gb|EAU34648.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 227
Score = 44.7 bits (104), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 76/209 (36%), Gaps = 34/209 (16%)
Query: 144 ESIRAGGRYDFIVEHDVKELGAHTLVCTALYSD---------GEGERKYLPQFFKFIVSN 194
+ ++ G IV D+KE G H L + Y++ G + + ++F+
Sbjct: 22 DGLQKGQSLQKIVRFDLKEEGNHILAVSVSYTETLIGLDAQAASGRVRTFRKLYQFVAQP 81
Query: 195 PLSVRTKVRVVK-----------VGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNW 243
LSVRTK + G T LEA +EN + + Q P +
Sbjct: 82 CLSVRTKSSELTPLEVENKSLGPYGKTRLLRFA-LEAQLENVGDGAVVVQQTRLNPKPPF 140
Query: 244 SATMLKAD------GPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKV 297
A L D R++ + L+ G L L+ +K
Sbjct: 141 KAISLNWDLEAPDGPDPPPPTLNPRDVLQVAFLVEQEEGQQEGLEALQ-------KDMKR 193
Query: 298 QGSNVLGKLQITWRTNLGEPGRLQTQQIL 326
G VLG+L I WR +G+ G L T +L
Sbjct: 194 DGRAVLGQLSIEWRGPMGDKGYLTTGNLL 222
>gi|357448105|ref|XP_003594328.1| hypothetical protein MTR_2g027310 [Medicago truncatula]
gi|355483376|gb|AES64579.1| hypothetical protein MTR_2g027310 [Medicago truncatula]
Length = 55
Score = 44.7 bits (104), Expect = 0.096, Method: Compositional matrix adjust.
Identities = 21/34 (61%), Positives = 26/34 (76%)
Query: 408 NLIATKLGVQRITGITVFDKLEKITYDSLPDLEI 441
NLIATK G+Q+ITGITVF +Y+ LPDLE+
Sbjct: 3 NLIATKPGIQKITGITVFATRGMKSYEPLPDLEV 36
>gi|156094286|ref|XP_001613180.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148802054|gb|EDL43453.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 381
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 42/164 (25%), Positives = 77/164 (46%), Gaps = 12/164 (7%)
Query: 77 DSADSIGLSG--LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
+S + + LS L LP IYLG+ S I+I+N+ E++ I ++ T +Q
Sbjct: 42 ESKEDLSLSNEFSLSLPTNSRKIYLGQNLKSQINISNNLKNEIQISSISVDVMT-RQTTF 100
Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
+ S V ++++ ++F+ V T+ C Y G E+K L + F FI N
Sbjct: 101 NIYRSVEHV-TVQSNCFFNFLTSFLVTFADMFTVHCAVEYLQG-SEKKKLRKDFNFICKN 158
Query: 195 PLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
P V+T + ++ ++EA + N + N+ ++ V F+
Sbjct: 159 PFHVKTLI-------LQKEDKIYIEAVVRNIEEDNIMLNGVTFK 195
>gi|124505961|ref|XP_001351578.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
gi|23504505|emb|CAD51385.1| conserved Plasmodium protein, unknown function [Plasmodium
falciparum 3D7]
Length = 381
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 74/378 (19%), Positives = 153/378 (40%), Gaps = 49/378 (12%)
Query: 77 DSADSIGLSG--LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
D D+I LS L LP +Y+G+ F S I+I+++ ++ +I +I T
Sbjct: 41 DINDNISLSNEISLSLPINSRKVYIGQNFKSQINISSNLKNNIQVNLINVDIWTRDNNFN 100
Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
+ +S +I + F+ V T+ CTA Y G E+K L + F FI +
Sbjct: 101 IYKNEESV--NISPNTFFSFVTCFPVYFFDVFTIRCTAEYKIG-SEKKKLKKDFNFISRD 157
Query: 195 PLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPH 254
P ++R H + +++ ++N + N+ ++ + + + ++K +G +
Sbjct: 158 PFNIR-------YSLVHKNDKLYMQIIMKNTEEDNIMLNDIILKDIK---CELIKNEGCN 207
Query: 255 SDYNAQSREIFKPPVLIRSGGGIHNYL----YQLKMLSHGSSSPVKVQGSNV----LGKL 306
+N GIH + Y + S + + + + +
Sbjct: 208 KVHN-----------------GIHYFKQHDEYSMIFCIDDEKSKRYILNNTLDNDNITNM 250
Query: 307 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSV-VGIDKPFLLKLKLTNQTDKE 365
+I + TN G G + L ++ ++ + E ++ I+K + ++ N TD E
Sbjct: 251 EIIYFTNNGGKG-IHNLHYLKKNTSTDNFKIYLKENNNIYYTINKIYNFEIIFENNTD-E 308
Query: 366 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 425
EI++ N + + ++N + + S F+ I G+ IT++
Sbjct: 309 DMFLEIFVHNNSN----IHIVNNFVKEHIIKSKTKKSHFFYTLFINQ--GIHFFNNITIY 362
Query: 426 DKLEKITYDSLPDLEIFV 443
+K K T + + ++FV
Sbjct: 363 NKKNKTTKEYIKLFKLFV 380
>gi|353248314|emb|CCA77337.1| hypothetical protein PIIN_11314 [Piriformospora indica DSM 11827]
Length = 147
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 32/117 (27%), Positives = 58/117 (49%), Gaps = 9/117 (7%)
Query: 213 QEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIR 272
+E FL+ ++N T+ +++ +++EF+P W+ T D ++ + ++R+ F P +
Sbjct: 6 REKLFLQIDVQNLTQESMWFERLEFKPVDGWTFT----DA--NESSIEARQAFTGPKTLV 59
Query: 273 SGGGIHNYLYQLKMLSHGSSSPVKVQGSNV--LGKLQITWRTNLGEPGRLQTQQILG 327
Y+Y L + + +K V LG+L + RT GEPGRL T G
Sbjct: 60 QPQDTFQYIYTL-IPAVVPRFLIKTAPGVVIPLGRLDLACRTTFGEPGRLLTSCYPG 115
>gi|407405130|gb|EKF30284.1| hypothetical protein MOQ_005907 [Trypanosoma cruzi marinkellei]
Length = 549
Score = 43.1 bits (100), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 35/134 (26%), Positives = 61/134 (45%), Gaps = 5/134 (3%)
Query: 83 GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK-AEIQTDKQRILLLDTSKS 141
G+ +L LP + G ++G+ F +++S +N++T + +V A + R +++ S
Sbjct: 98 GIGSVLSLPTSLGKFFVGQFFRAFLSFHNTATYPLASMVFSIACLHPSLHRSRIVNYECS 157
Query: 142 PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP-LSVRT 200
+E G F VE +KE G +TL Y D E K L F V + V
Sbjct: 158 HLE---GKGNASFTVEFLLKEAGQYTLDVLVTYMDIAREAKRLTWSFSIQVERAIIEVSR 214
Query: 201 KVRVVKVGATHFQE 214
+ VV + H ++
Sbjct: 215 TLHVVPIITRHSKD 228
>gi|367035632|ref|XP_003667098.1| hypothetical protein MYCTH_2141069 [Myceliophthora thermophila ATCC
42464]
gi|347014371|gb|AEO61853.1| hypothetical protein MYCTH_2141069 [Myceliophthora thermophila ATCC
42464]
Length = 932
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 96/417 (23%), Positives = 148/417 (35%), Gaps = 124/417 (29%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
HS++ +V+RL RPSL + PL P+ PI AS L S
Sbjct: 538 HSVSLKVLRLSRPSLVAQYPLLPPPSSSPDDPLSHQPPIPAS----LAYSHHGAGGVIPP 593
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETF----CSYISINNSST-----LEVR 118
T + F+L S +L LP +FG+ Y+GETF C+ + T +R
Sbjct: 594 TNPAPFVL---------SPILNLPPSFGSAYVGETFSCTLCANYDVPEDGTGAGPKKSIR 644
Query: 119 DVVIKAEIQTDKQ----------------RILLLDTSKS--------------------- 141
DV I+AE++T ++ L S S
Sbjct: 645 DVRIEAEMKTPSSSSSSSSSAAAGAFPAIKLPLYPPSASHAGDEHGGSGGGGGGGGGGGG 704
Query: 142 -PVESIRAGGRYDFIVEHDVKELGAHTLVCTALY---SDGEGERKYLPQFFKFIVSNPLS 197
V+ G I+ D+KE G H L T Y S+ G + + ++F+ L
Sbjct: 705 GGVDLPSPGTSLQKILSFDLKEEGNHVLAVTVSYYEASELSGRTRTFRKLYQFVCKASLI 764
Query: 198 VRTKVR-VVKVGATHFQEIT-------------------------------------FLE 219
VRTK + VG Q LE
Sbjct: 765 VRTKASPLPAVGPGEEQGEGEEEEEEEEEEEEEEEEEGEKDEGEKGGRGRPRLRRRWVLE 824
Query: 220 ACIENHTKSNLYMDQV--------EFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLI 271
A +EN ++ + ++ V +E +W ADG ++ + + +P
Sbjct: 825 AQLENCSEEGILLESVGLELESGLRYEDCNDWQG---HADG--GAVGSRMKPVLQP---- 875
Query: 272 RSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGT 328
G + ++ G + +V+G V G LQI WR+ +G G L T + LGT
Sbjct: 876 ---GETEQVCFVIE--EEGDAVVQEVEGRVVFGVLQIGWRSEMGNRGFLSTGK-LGT 926
>gi|156059820|ref|XP_001595833.1| hypothetical protein SS1G_03923 [Sclerotinia sclerotiorum 1980]
gi|154701709|gb|EDO01448.1| hypothetical protein SS1G_03923 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 385
Score = 42.7 bits (99), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 61/156 (39%), Gaps = 39/156 (25%)
Query: 85 SGLLVLPQAFGAIYLGETFCSYISINN---------------------SSTLEVRDVVIK 123
S LL LP AFG+ Y+GETF + NN ++T + ++ +
Sbjct: 70 SPLLTLPPAFGSAYVGETFSCTLCANNELPPLSQLSQTHTSPDIVASPNTTKVISNITLS 129
Query: 124 AE--IQTDKQRILLLDTSKSPVESIRAGGR------------YDFIVEHDVKELGAHTLV 169
AE I + I L + SP + G ++ D+KE GAH L
Sbjct: 130 AEMKIPSTPNPISLPLSGPSPFPAASTTGEETPETQIISQASLQKVLHFDLKEEGAHVLA 189
Query: 170 CTALYSD----GEGERKYLPQFFKFIVSNPLSVRTK 201
T Y++ + + ++FI L VRTK
Sbjct: 190 VTVTYTESSPSSSPRTRTFRKLYQFICKGCLVVRTK 225
>gi|221057331|ref|XP_002259803.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
gi|193809875|emb|CAQ40579.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
knowlesi strain H]
Length = 382
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 2/113 (1%)
Query: 88 LVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVESIR 147
L LP IY+G+ I+I+N+ +++ I ++ T KQ + S V ++R
Sbjct: 55 LSLPINSRKIYIGQNLKCQINISNNLKNDIQICTISVDVMT-KQTTFNIYRSAEHVITVR 113
Query: 148 AGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNPLSVRT 200
+ ++F+ V T+ C Y G E+K L + F FI NP ++T
Sbjct: 114 SNSFFNFLATFLVTFADMFTVHCAVEYLQG-SEKKKLRKDFNFISKNPFHLKT 165
>gi|209881173|ref|XP_002142025.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209557631|gb|EEA07676.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 380
Score = 42.4 bits (98), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 60/307 (19%), Positives = 126/307 (41%), Gaps = 25/307 (8%)
Query: 135 LLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSN 194
+L +++ + I G + I++ V E+G L C +Y G + + +KF V
Sbjct: 66 ILYSNEDNLRDIEIGNSINTIIKERVDEVGLFNLTC-QIYFIVNGSKLTQKRSYKFAVIA 124
Query: 195 PLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFEP-------SQNWSATM 247
P ++ ++ ++ F+E +EN T ++ +++++ + QN +
Sbjct: 125 PFNISHRLFYHNDNLKK-SKLCFIEVSLENITHQSISLEKLDIQNWIDEKGNKQNIQVSQ 183
Query: 248 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 305
L + D N + S+ ++ V++ +N ++ + + S + + G+
Sbjct: 184 LSTTQFY-DENCKNTSQLLYNSGVIVLRPRSRYNQIFCISQSLYKES--INNIDKYITGQ 240
Query: 306 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVE----VPSVVGIDKPFLLKLKLTNQ 361
L I+W++ + + I LN V VPS + I F +++ + N
Sbjct: 241 LSISWKSKTYGDAFMNSYSITCQVSNEDIYNLNGVAIDVIVPSTIEIQTIFTIEVIIIND 300
Query: 362 TDKEQGPFEIWLSQNDSDEEKVV--MINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRI 419
TDK E+ + D E ++ I G+ I+ + +E L I+ GV I
Sbjct: 301 TDKRLHDIELSI-----DNEALLPFCILGMDILQIKFMEPNQKITIPLQCISFTSGVHPI 355
Query: 420 TGITVFD 426
GI + +
Sbjct: 356 NGIKLIN 362
>gi|389584327|dbj|GAB67060.1| hypothetical protein PCYB_104100 [Plasmodium cynomolgi strain B]
Length = 381
Score = 41.6 bits (96), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 41/166 (24%), Positives = 78/166 (46%), Gaps = 16/166 (9%)
Query: 77 DSADSIGLSG--LLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRIL 134
+S + + LS L LP IY+G+ S I+I+N+ E++ I ++ T R
Sbjct: 42 ESKEDLSLSNEFSLSLPINSRKIYIGQNLKSQINISNNLKNEIQICTISVDVMT---RHT 98
Query: 135 LLDTSKSPVE--SIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIV 192
+ +S VE ++++ ++F+ V T+ C Y G E+K L + F FI
Sbjct: 99 TFNIYRS-VEHVTVQSNSFFNFLTTFLVTFADMFTVHCAVEYLQG-NEKKKLRKDFNFIC 156
Query: 193 SNPLSVRTKVRVVKVGATHFQEITFLEACIENHTKSNLYMDQVEFE 238
NP ++T + ++ ++EA + N + N+ ++ V F+
Sbjct: 157 KNPFHLKTLI-------LQKEDKIYIEAVVRNIEEDNIMLNDVVFK 195
>gi|354482026|ref|XP_003503201.1| PREDICTED: peroxisomal proliferator-activated receptor A-interacting
complex 285 kDa protein-like [Cricetulus griseus]
gi|344254975|gb|EGW11079.1| Peroxisomal proliferator-activated receptor A-interacting complex 285
kDa protein [Cricetulus griseus]
Length = 2914
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 44/79 (55%), Gaps = 4/79 (5%)
Query: 193 SNPLSVRTKVRVVKVGATHFQEITFLEACIENHT--KSNLYMDQVE--FEPSQNWSATML 248
S ++V V + GA +F+ CIE+H+ +L ++Q+E Q+WS+ ML
Sbjct: 1153 SQLVAVGDAVALCSSGACRKLWKSFIRECIEHHSVFPEDLSLEQIEQGVAQRQHWSSLML 1212
Query: 249 KADGPHSDYNAQSREIFKP 267
+A GP + + A ++++ +P
Sbjct: 1213 RAGGPDAKHTAVAQDMQRP 1231
>gi|71419122|ref|XP_811074.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70875696|gb|EAN89223.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 571
Score = 40.4 bits (93), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 34/131 (25%), Positives = 58/131 (44%), Gaps = 5/131 (3%)
Query: 83 GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK-AEIQTDKQRILLLDTSKS 141
G+ +L LP + G ++G+ F +++S +N++T + +V + R +++ S
Sbjct: 120 GIGTVLSLPTSLGKFFVGQPFRAFLSFHNAATYPLATMVFSIVCLHPTLHRSKIVNYECS 179
Query: 142 PVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP-LSVRT 200
+E G F VE +KE G +TL Y D E K L F V + V
Sbjct: 180 HLE---GKGNASFTVECLLKEPGQYTLDVLVTYMDIAKEAKRLTWSFSIQVERAIIEVSR 236
Query: 201 KVRVVKVGATH 211
+ VV + H
Sbjct: 237 TIHVVPIITRH 247
>gi|71422967|ref|XP_812298.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70877064|gb|EAN90447.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 549
Score = 40.0 bits (92), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 58/133 (43%), Gaps = 9/133 (6%)
Query: 83 GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV---VIKAEIQTDKQRILLLDTS 139
G+ +L LP + G ++G+ F +++S +N++T + + ++ + +I+ + S
Sbjct: 98 GIGSVLSLPTSLGKFFVGQPFRAFLSFHNAATYPLATMAFSIVCLHPTLHRSKIVNYECS 157
Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIVSNP-LSV 198
+ G F VE +KE G +TL Y D E K L F V + V
Sbjct: 158 H-----LEGKGNASFTVECLLKEPGQYTLDVLVTYMDIAKEAKRLTWSFSIQVERAIIEV 212
Query: 199 RTKVRVVKVGATH 211
+ VV + H
Sbjct: 213 SRTIHVVPIITRH 225
>gi|353248956|emb|CCA77414.1| hypothetical protein PIIN_11391 [Piriformospora indica DSM 11827]
Length = 147
Score = 39.3 bits (90), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 48/96 (50%), Gaps = 11/96 (11%)
Query: 230 LYMDQVEFEPSQNWSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQL--KML 287
++ +++EF+P W+ T D ++ + ++R+ F P + Y+Y L ++
Sbjct: 1 MWFERLEFKPVDGWTFT----DA--NESSIEARQAFTGPKTLVQPQDTFQYIYTLIPAVI 54
Query: 288 SHGSSSPVKVQGSNV-LGKLQITWRTNLGEPGRLQT 322
P G+ + LG+L I WRT GEPGRL T
Sbjct: 55 PRFLIKPAP--GAVIPLGRLDIAWRTTFGEPGRLLT 88
>gi|254284359|ref|ZP_04959327.1| glyoxalase family protein [gamma proteobacterium NOR51-B]
gi|219680562|gb|EED36911.1| glyoxalase family protein [gamma proteobacterium NOR51-B]
Length = 454
Score = 38.9 bits (89), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 55/108 (50%), Gaps = 10/108 (9%)
Query: 320 LQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLK-LKLTNQTDKEQGPFEIWLSQNDS 378
L Q+ G ++ E + +EV +G+D+P+ +K LT+Q+D + WL ++
Sbjct: 305 LAWYQMFGYEVSGSLHETDSLEVAEAMGLDRPYRIKGAMLTHQSDGSEIKLVQWLEPYNA 364
Query: 379 DEEKVVMIN--GLRIMALAPVEAFGSTDFHLNLIATKL-GVQRITGIT 423
+ + +N G+ MALA STD ++ A K GV+ ++ IT
Sbjct: 365 EAPYPLPVNHLGIHRMALA------STDIESDVAALKAQGVEFVSPIT 406
>gi|119619024|gb|EAW98618.1| hCG1992287, isoform CRA_a [Homo sapiens]
gi|119619025|gb|EAW98619.1| hCG1992287, isoform CRA_a [Homo sapiens]
Length = 115
Score = 38.9 bits (89), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 36/67 (53%)
Query: 280 YLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNV 339
YL +++ S ++G +GKL I W+ NLGE LQT Q+LG + + + L++
Sbjct: 34 YLDHVQLKQKYSEEAGIIKGLREMGKLDIVWKRNLGEMAMLQTIQLLGESPGYENMRLSL 93
Query: 340 VEVPSVV 346
+P V
Sbjct: 94 EIIPDSV 100
>gi|407844145|gb|EKG01819.1| hypothetical protein TCSYLVIO_007171 [Trypanosoma cruzi]
Length = 549
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 27/113 (23%), Positives = 51/113 (45%), Gaps = 8/113 (7%)
Query: 83 GLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV---VIKAEIQTDKQRILLLDTS 139
G+ +L LP + G ++G+ F +++S +N++ + + ++ + + +I+ + S
Sbjct: 98 GIGSVLSLPTSLGKFFVGQPFRAFLSFHNAANYPLATMAFSIVCLHPKLHRSKIVNYECS 157
Query: 140 KSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQFFKFIV 192
+ G F VE +KE G +TL Y D E K L F V
Sbjct: 158 H-----LEGKGNASFTVEFLLKEPGQYTLDVLVTYMDIAKEAKRLTWSFSIQV 205
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.136 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,923,887,851
Number of Sequences: 23463169
Number of extensions: 287174940
Number of successful extensions: 601558
Number of sequences better than 100.0: 344
Number of HSP's better than 100.0 without gapping: 244
Number of HSP's successfully gapped in prelim test: 100
Number of HSP's that attempted gapping in prelim test: 600180
Number of HSP's gapped (non-prelim): 451
length of query: 446
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 300
effective length of database: 8,933,572,693
effective search space: 2680071807900
effective search space used: 2680071807900
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)