BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 013597
(439 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6PBY7|CE044_DANRE UPF0533 protein C5orf44 homolog OS=Danio rerio GN=zgc:73187 PE=2
SV=2
Length = 412
Score = 207 bits (528), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 141/427 (33%), Positives = 222/427 (51%), Gaps = 49/427 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF+ L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNMPVTCEDRDLPGDLFLR---------------LMKDDPSTVK 54
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
A+++ L +L LPQ FG I+LGETF SYIS++N S+ V+D+++K
Sbjct: 55 G--------------AETLILGEMLTLPQNFGNIFLGETFSSYISVHNDSSQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S S V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNSAVSELKPECCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKLY 159
Query: 184 LPQFFKFIVSNPLSVRTKVRVVK-EITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLK 242
+FFKF V PL V+TK + + FLEA I+N T S ++M++V EPS ++ T L
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAETDEVFLEAQIQNITTSPMFMEKVSLEPSMMYNVTELN 219
Query: 243 --ADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQ 300
A G S + + + P+ R YLY LK + ++G V+GKL
Sbjct: 220 NVASGDESSESTFGKMSYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKLD 273
Query: 301 ITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQG 360
I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 274 IVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEFIPDTVDLEEPFDITCKITNCSERT-- 331
Query: 361 PFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDK 420
++ L ++ ++G ++ L+P S L L+++ G+Q I+G+ + D
Sbjct: 332 -MDLLLEMCNTRSVHWCGVSGRQLGKLSPS---ASLSIPLKLLSSVQGLQSISGLRLTDT 387
Query: 421 LEKITYD 427
K TY+
Sbjct: 388 FLKRTYE 394
>sp|Q5RCG0|CE044_PONAB UPF0533 protein C5orf44 homolog OS=Pongo abelii PE=2 SV=1
Length = 417
Score = 205 bits (522), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 139/433 (32%), Positives = 217/433 (50%), Gaps = 56/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTK-------VRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNW 236
+FFKF V PL V+TK + V + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219
Query: 237 SATMLKA--DGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 294
+ T L + S SR +P YLY LK + ++G
Sbjct: 220 NVTELNSVSQAGESVSTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 295 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 354
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 355 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 414
+++ ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 386
Query: 415 ITVFDKLEKITYD 427
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>sp|A5PLN9|CE044_HUMAN UPF0533 protein C5orf44 OS=Homo sapiens GN=C5orf44 PE=2 SV=2
Length = 417
Score = 204 bits (520), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 138/433 (31%), Positives = 219/433 (50%), Gaps = 56/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEVLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQAGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTK-------VRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNW 236
+FFKF V PL V+TK + V + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNMTTSPMFMEKVSLEPSIMY 219
Query: 237 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 294
+ T L + + + SR +P YLY LK + + ++G
Sbjct: 220 NVTELNSVSQAGECVSTFGSRAYLQP-------MDTRQYLYCLKPKNEFAEKAGIIKGVT 272
Query: 295 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 354
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 355 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 414
+++ ++ L +++ I+G ++ L P + L L+++ G+Q I+G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSISG 386
Query: 415 ITVFDKLEKITYD 427
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>sp|Q3TIR1|CE044_MOUSE UPF0533 protein C5orf44 homolog OS=Mus musculus PE=2 SV=1
Length = 417
Score = 200 bits (509), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 137/429 (31%), Positives = 220/429 (51%), Gaps = 48/429 (11%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTK-------VRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 240
FKF V PL V+TK + V + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223
Query: 241 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 298
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 224 LNSVTQAGECISTFGSRGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276
Query: 299 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 358
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN +++
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSERM 336
Query: 359 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 418
++ L +++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 337 ---MDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 390
Query: 419 DKLEKITYD 427
D K TY+
Sbjct: 391 DTFLKRTYE 399
>sp|A7MB76|CE044_BOVIN UPF0533 protein C5orf44 homolog OS=Bos taurus PE=2 SV=1
Length = 417
Score = 200 bits (509), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 136/433 (31%), Positives = 216/433 (49%), Gaps = 56/433 (12%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVD----PTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
H LA +VMRL +P+L P+ + P DLF + + DDP
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDLPGDLF-NQLMRDDPSTV--------------- 53
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++K
Sbjct: 54 -------------NGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVK 100
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
A++QT QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y
Sbjct: 101 ADLQTSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQSGEKMY 159
Query: 184 LPQFFKFIVSNPLSVRTK-------VRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNW 236
+FFKF V PL V+TK + V + FLEA I+N T S ++M++V EPS +
Sbjct: 160 FRKFFKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMY 219
Query: 237 SATMLKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSN 294
+ L + + SR +P YLY LK + ++G
Sbjct: 220 NVAELNSVNQAGECVTTFGSRAYLQP-------MDTRQYLYCLKPKKEFAEKAGIIKGVT 272
Query: 295 VLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQ 354
V+GKL I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN
Sbjct: 273 VIGKLDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNC 332
Query: 355 TDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITG 414
+++ ++ L +++ I+G ++ L P + L L+++ G+Q ++G
Sbjct: 333 SER---TMDLVLEMCNTNSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSG 386
Query: 415 ITVFDKLEKITYD 427
+ + D K TY+
Sbjct: 387 LRLTDTFLKRTYE 399
>sp|Q5M887|CE044_RAT UPF0533 protein C5orf44 homolog OS=Rattus norvegicus PE=2 SV=2
Length = 418
Score = 199 bits (507), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 137/429 (31%), Positives = 218/429 (50%), Gaps = 47/429 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F+ L+ D +T
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFNQ---------LMKDDPST------ 52
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ A+ + L +L LPQ FG I+LGETF SYIS++N S V+D+++KA++Q
Sbjct: 53 --------VNGAEILMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDILVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASNAAVAELKPDCCIDDVIHHEVKEIGTHILVCAVSYTTQGGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTK-------VRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATM 240
FKF V PL V+TK + V + FLEA I+N T S ++M++V EPS ++ T
Sbjct: 164 FKFQVLKPLDVKTKFYNAESDLSSVTDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVTE 223
Query: 241 LKADGPHSDYNAQ--SREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGK 298
L + + + SR +P YLY LK + ++G V+GK
Sbjct: 224 LNSVNQAGECVSTFGSRGYLQPM-------DTRQYLYCLKPKKEFAEKAGIIKGVTVIGK 276
Query: 299 LQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKE 358
L I W+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + +
Sbjct: 277 LDIVWKTNLGERGRLQTSQLQRMAPGYGDVRLSLEAIPDTVNLEEPFHITCKITNCSSER 336
Query: 359 QGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVF 418
++ L ++ I+G ++ L P + L L+++ G+Q ++G+ +
Sbjct: 337 T--MDLVLEMCNTTSIHWCGISGRQLGKLHPSSSLC---LALTLLSSVQGLQSVSGLRLT 391
Query: 419 DKLEKITYD 427
D K TY+
Sbjct: 392 DTFLKRTYE 400
>sp|Q0VFT9|CE044_XENTR UPF0533 protein C5orf44 homolog OS=Xenopus tropicalis PE=2 SV=1
Length = 412
Score = 198 bits (503), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 137/424 (32%), Positives = 220/424 (51%), Gaps = 43/424 (10%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F L+ D +T K
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEERDL--PGDLFS---------TLMKDDPSTVKG--- 55
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
A+++ L +L LPQ FG I+LGETF SYIS++N S V+D+ +KA++Q
Sbjct: 56 -----------AETLMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDIQVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASTAVVSELKPDSCIDDVIHHEVKEIGTHILVCAVSYTTQTGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVK-EITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGP 246
FKF V PL V+TK + + FLEA I+N T S ++M++V EPS ++ + L
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVSELNTVIT 223
Query: 247 HSD-YNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITWRT 305
+ D + + + P+ R YLY LK + ++G V+GKL I W+T
Sbjct: 224 NGDGCSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKLDIVWKT 277
Query: 306 NLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIW 365
NLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + + ++
Sbjct: 278 NLGERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSSERT--MDLV 335
Query: 366 LSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLN--LIATKLGVQRITGITVFDKLEK 423
L +++ ++G ++ L P S+ HL L+++ G+Q ++G+ + D K
Sbjct: 336 LEMCNTNAIHWCGVSGRQLGKLHP-----SSSLHLTLALLSSVQGLQSVSGLRLTDTFLK 390
Query: 424 ITYD 427
TY+
Sbjct: 391 RTYE 394
>sp|Q6GPR5|CE044_XENLA UPF0533 protein C5orf44 homolog OS=Xenopus laevis PE=2 SV=2
Length = 414
Score = 192 bits (489), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 136/424 (32%), Positives = 218/424 (51%), Gaps = 41/424 (9%)
Query: 8 HSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSDL 67
H LA +VMRL +P+L P+ + DL D+F L+ D +T K +++
Sbjct: 10 HLLALKVMRLTKPTLFTNIPVTCEEKDL--PGDLFS---------TLMKDDPSTVKGAEI 58
Query: 68 TYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIKAEIQ 127
+ L +L LPQ FG I+LGETF SYIS++N S V+DV +KA++Q
Sbjct: 59 --------------LMLGEMLTLPQNFGNIFLGETFSSYISVHNDSNQVVKDVQVKADLQ 104
Query: 128 TDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQF 187
T QR L L S + V ++ D ++ H+VKE+G H LVC Y+ GE+ Y +F
Sbjct: 105 TSSQR-LNLSASSAVVADLKPDSCIDDVIHHEVKEIGTHILVCAVSYTIQSGEKMYFRKF 163
Query: 188 FKFIVSNPLSVRTKVRVVK-EITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKADGP 246
FKF V PL V+TK + + FLEA I+N T S ++M++V EPS ++ + L
Sbjct: 164 FKFQVLKPLDVKTKFYNAETDEVFLEAQIQNITTSPMFMEKVSLEPSIMYNVSELNTVIT 223
Query: 247 HSDYNAQSR---EIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKLQITW 303
+ D+ S + + P+ R YLY LK + ++G V+GKL I W
Sbjct: 224 NGDWKGSSTFGTKTYLQPLDTR------QYLYCLKPKPEFAEKAGVIKGVTVIGKLDIVW 277
Query: 304 RTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFE 363
+TNLGE GRLQT Q+ ++ L++ +P V +++PF + K+TN + + +
Sbjct: 278 KTNLGERGRLQTSQLQRMAPGYGDVRLSIETIPDTVRLEEPFDITCKITNCSSERT--MD 335
Query: 364 IWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFDKLEK 423
+ L +++ ++G ++ L P + T L+ + G+Q ++G+ + D K
Sbjct: 336 LVLEMCNTNAIHWSGVSGRQLGKLHPSSSLHLTLTLLSSVQ---GLQSVSGLRLTDTFLK 392
Query: 424 ITYD 427
TY+
Sbjct: 393 RTYE 396
>sp|A8WX89|U533_CAEBR UPF0533 protein CBG04321 OS=Caenorhabditis briggsae GN=CBG04321
PE=3 SV=2
Length = 401
Score = 189 bits (481), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 136/447 (30%), Positives = 222/447 (49%), Gaps = 62/447 (13%)
Query: 1 MSSTPGTHSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVT 60
+S++ LA RVMRL RP + P D F DP+ + L++ V
Sbjct: 5 ISNSSTQQLLALRVMRLARP--------KFAPLDGFS-----HDPVDPTGFGELLAGKV- 50
Query: 61 TNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDV 120
++++ SR HD + + L+ PQ F IYLGETF Y+++ N S V +V
Sbjct: 51 ----AEISKESR---HD----LPIGEYLIAPQMFENIYLGETFTFYVNVVNESESNVVNV 99
Query: 121 VIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEG 179
+K E+QT QR++L +ES + G+ ++ H+VKE+G H L+C+ Y G
Sbjct: 100 SLKCELQTSTQRVVLPCSVQDVTIESTKCDGQ---VISHEVKEIGQHILICSVNYKTLSG 156
Query: 180 ERKYLPQFFKFIVSNPLSVRTKVRVVK----EITFLEACIENHTKSNLYMDQVEFEPSQN 235
E Y +FFKF VS P+ V+TK + + +LEA IEN + SN+++++VE +PSQ+
Sbjct: 157 ENMYFRKFFKFPVSKPIDVKTKFYSAEDNANQDVYLEAQIENTSNSNMFLERVELDPSQH 216
Query: 236 WSATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGS-- 293
+ T + H D + ++ KP I +L+ L SPV V +
Sbjct: 217 YKVTSIS----HEDEFPEVGKLLKP-------KDIRQFLFCL--------SPVDVNNTLG 257
Query: 294 ----NVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKL 349
+GKL ++WRT++GE GRLQT + ++ L+V + P+ V + KPF +
Sbjct: 258 YKDLTSIGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQKPFEVAC 317
Query: 350 KLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGV 409
+L N +++ ++ L Q + + + +G+ + L P DF LN+ +G+
Sbjct: 318 RLYNCSERALD-LQLRLEQPSNRQLVICSPSGVSLGQLPPSRY---VDFALNVFPVAVGI 373
Query: 410 QRITGITVFDKLEKITYDSLPDLEIFV 436
Q I+GI + D K Y+ +IFV
Sbjct: 374 QSISGIRITDTFTKRHYEHDDIAQIFV 400
>sp|Q95QQ2|U533_CAEEL UPF0533 protein C56C10.7 OS=Caenorhabditis elegans GN=C56C10.7 PE=1
SV=1
Length = 401
Score = 188 bits (478), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 133/450 (29%), Positives = 218/450 (48%), Gaps = 64/450 (14%)
Query: 1 MSSTPGTHS----LAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLIS 56
M+ P + S LA RVMRL RP + P D F DP+ + L++
Sbjct: 1 MADNPASSSSQQLLALRVMRLARP--------KFAPVDGFS-----HDPVDPTGFGELLA 47
Query: 57 SDVTTNKSSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLE 116
V S+++ SR + + L+ PQ F IYLGETF Y+++ N S
Sbjct: 48 GKV-----SEISKESR-------QDLPIGEYLIAPQMFENIYLGETFTFYVNVVNESEKT 95
Query: 117 VRDVVIKAEIQTDKQRILL-LDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYS 175
V V +K E+QT QR++L + +ES + G+ ++ H+VKE+G H L+C+ Y
Sbjct: 96 VSSVSLKCELQTSTQRVVLPCSVQDATIESSKCEGQ---VISHEVKEIGQHILICSVNYK 152
Query: 176 DGEGERKYLPQFFKFIVSNPLSVRTKVRVVK----EITFLEACIENHTKSNLYMDQVEFE 231
GE Y +FFKF VS P+ V+TK + + +LEA IEN + +N+++++VE +
Sbjct: 153 TSNGENMYFRKFFKFPVSKPIDVKTKFYSAEDNANQDVYLEAQIENTSNANMFLEKVELD 212
Query: 232 PSQNWSATMLKADGPHSDYNAQSREIFKPP-----VLIRSGGGIHNYLYQLKMLSHGSSS 286
PSQ+++ T + H D ++ KP + + +HN L + S
Sbjct: 213 PSQHYNVTSIA----HEDEFGDVGKLLKPKDIRQFLFCLTPADVHNTLGYKDLTS----- 263
Query: 287 PVKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFL 346
+GKL ++WRT++GE GRLQT + ++ L+V + P+ V + KPF
Sbjct: 264 ---------IGKLDMSWRTSMGEKGRLQTSALQRIAPGYGDVRLSVEKTPACVDVQKPFE 314
Query: 347 LKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATK 406
+ +L N +++ ++ L Q + +G+ + L P + DF LN+
Sbjct: 315 VSCRLYNCSERALD-LQLRLEQPSNRHLVFCSPSGVSLGQLPPSQ---HVDFSLNVFPVT 370
Query: 407 LGVQRITGITVFDKLEKITYDSLPDLEIFV 436
+G+Q I+GI + D K Y+ +IFV
Sbjct: 371 VGIQSISGIRITDTFTKRIYEHDDIAQIFV 400
>sp|Q55EX6|U533_DICDI UPF0533 protein OS=Dictyostelium discoideum GN=DDB_G0269062 PE=3
SV=2
Length = 511
Score = 179 bits (453), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 133/450 (29%), Positives = 227/450 (50%), Gaps = 58/450 (12%)
Query: 7 THSLAFRVMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNKSSD 66
H L +VMRL +P++ P+ + DL + I +++L V ++ S+D
Sbjct: 4 NHLLNLKVMRLSKPNIPTINPILCEKQDL--PYETMSTSIDSTSLS---MGSVNSSGSND 58
Query: 67 LTYRSRFLLHDSADSIGLSGLLV---LPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
+ L+ ++ + I + GL V L G IYLGE FC YIS+NN S +VR+V +K
Sbjct: 59 ----NNQLIGNNGNPINMEGLGVTSMLQLQSGVIYLGEMFCCYISLNNHSPYQVRNVFLK 114
Query: 124 AEIQTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKY 183
E+QT RI LLD+ + V + G DF+V+ +VKE G + LVC Y+ EGE+K
Sbjct: 115 VELQTTSSRIPLLDSEQQSVPTFNPGFSSDFVVQREVKESGVNILVCAVNYTTPEGEQKK 174
Query: 184 LPQFFKFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSATMLKA 243
++FKF V NPL ++T++ + + FLEAC+EN T+ +L+++ + FEP +++++ +
Sbjct: 175 FRKYFKFQVLNPLVLKTRIHNLPNVVFLEACLENATQGSLFIESILFEPIEHFNSKDISF 234
Query: 244 DGP-------------HSDYNAQSREIFKPPVLIRSGGGIHN---YLYQLKM-------- 279
+ + N + FK + G I N L +K+
Sbjct: 235 ENSLDDNNNLDNNNNNLENDNNLNNLEFK----LNEKGLIENTDELLENIKLTTSDNIVF 290
Query: 280 LSHGSS-------SP-----VKVQGSNVLGKLQITWRTNLGEPGRLQTQQILGTTITSKE 327
L G S +P V+ + S LG+L ITWR+ GE GRL+T I + ++
Sbjct: 291 LKQGCSRQYLFQITPKDIENVESKNSLPLGRLDITWRSYFGEIGRLKTAAI-QRKLNQED 349
Query: 328 IELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQGPFEIWLSQNDSDEEKVVMINGLRIMAL 387
IE +++ +P + ++KPF + KL+N++++ P + L +N D K+ + L
Sbjct: 350 IECSLINIPDKIKLEKPFSVIAKLSNKSNRILYP-QFMLVRNKMDGIKI----NSHLPKL 404
Query: 388 APVEAFGSTDFHLNLIATKLGVQRITGITV 417
P++ + + K G+Q+I G+ +
Sbjct: 405 DPIQPNSIIQVEIEMFPLKPGMQQIIGLAI 434
>sp|Q95TN1|U533_DROME UPF0533 protein CG4953 OS=Drosophila melanogaster GN=CG4953 PE=2
SV=1
Length = 438
Score = 174 bits (441), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 133/435 (30%), Positives = 224/435 (51%), Gaps = 47/435 (10%)
Query: 5 PGTHSLAFRVMRLCRPSL-HVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISSDVTTNK 63
P H +A +VMRL RP+L + P + +PTDL ++++
Sbjct: 6 PDAHLVALKVMRLMRPTLVGLGPVVTCEPTDLV--------------------QRFSSSQ 45
Query: 64 SSDLTYRSRFLLHDSADSIGLSGLLVLPQAFGAIYLGETFCSYISINNSSTLEVRDVVIK 123
SD + A+++ +L+LPQ+FG+IYLGETF SYI ++N++ V V +K
Sbjct: 46 ESD------GMSGACAETLAAGQVLLLPQSFGSIYLGETFASYICVHNTTPNPVECVTVK 99
Query: 124 AEIQTDKQRILLL--DTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGER 181
A++Q++ RI L + +KSPV + GG D ++ ++VKE+G H LVC YS G
Sbjct: 100 ADLQSNTSRINLSMHENAKSPV-VLPPGGTIDDVIRYEVKEIGTHILVCEVNYSTPAGYA 158
Query: 182 KYLPQFFKFIVSNPLSVRTKV--RVVKEITFLEACIENHTKSNLYMDQVEFEPSQNWSAT 239
+ L +FFKF V PL V+TK + EI +LEA I+N T S +++VE + S+++S T
Sbjct: 159 QSLRKFFKFQVLKPLDVKTKFYNAEIDEI-YLEAQIQNVTTSPFCLEKVELDGSEDYSVT 217
Query: 240 MLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLGKL 299
L P+ + + + +P +LY +K + + ++ N +GKL
Sbjct: 218 PLNT-LPNGESVFTVKHMLQP-------NNSCQFLYCIKPKGDIAKNVDTLRQFNNVGKL 269
Query: 300 QITWRTNLGEPGRLQTQQILGTTITSKEIELNVVEVPSVVGIDKPFLLKLKLTNQTDKEQ 359
I WR+NLGE GRLQT Q+ K + L V++ + + I F ++TN T +
Sbjct: 270 DIVWRSNLGEKGRLQTSQLQRLPFECKTLRLEVLDAKNTIKIGTIFTFNCRVTN-TSEHP 328
Query: 360 GPFEIWLSQNDSDEEKVVMINGLRIMALAPVEAFGSTDFHLNLIATKLGVQRITGITVFD 419
+ L+ S + + G L +++ S +F L++ +KLG+ +IT + + +
Sbjct: 329 MKLNVRLAAKFSPDSQYT---GCADFMLNLLQSGESAEFPLSVCPSKLGLVKITPLVLTN 385
Query: 420 KL--EKITYDSLPDL 432
L E+ T +++ D+
Sbjct: 386 TLQNEQFTIENVVDV 400
>sp|Q8YA32|INLI_LISMO Internalin-I OS=Listeria monocytogenes serovar 1/2a (strain ATCC
BAA-679 / EGD-e) GN=inlI PE=4 SV=1
Length = 1778
Score = 35.0 bits (79), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 22/81 (27%), Positives = 40/81 (49%), Gaps = 5/81 (6%)
Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
QTD++R L S++ ++ + DF + D+ ++G +T+ TA DG+ P
Sbjct: 1448 QTDERRFLY--DSEAKIDEANVEIKTDFAEKVDINKVGTYTVTLTATNEDGQAAN---PV 1502
Query: 187 FFKFIVSNPLSVRTKVRVVKE 207
IVS+ + + V+ V E
Sbjct: 1503 EVSVIVSDAAAEKVNVKYVDE 1523
>sp|Q2NJ77|UVRB_AYWBP UvrABC system protein B OS=Aster yellows witches'-broom phytoplasma
(strain AYWB) GN=uvrB PE=3 SV=1
Length = 667
Score = 34.7 bits (78), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 20/71 (28%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 1 MSSTPGTHSLAFR---VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISS 57
+S+TPG + L + V ++ RP+ ++P + V PT+ + + F+ N ++ +
Sbjct: 394 LSATPGDYELTKKIPIVEQIIRPTFVLDPEIEVRPTNNQMDDLYFEIKHQTKNNQRILIT 453
Query: 58 DVTTNKSSDLT 68
+T N S DLT
Sbjct: 454 TLTINMSEDLT 464
>sp|Q42966|NRL4B_TOBAC Bifunctional nitrilase/nitrile hydratase NIT4B OS=Nicotiana tabacum
GN=NIT4B PE=2 SV=1
Length = 348
Score = 34.3 bits (77), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 49/113 (43%), Gaps = 17/113 (15%)
Query: 87 LLVLPQAF-GAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVES 145
L+V P+AF G G TF +SI N + K + + K +D V+
Sbjct: 64 LVVFPEAFIGGYPRGSTFG--VSIGNRTA--------KGKEEFRKYHASAIDVPGPEVDR 113
Query: 146 IRA-GGRYDFIVEHDVKELGAHTLVCTALYSDGEG-----ERKYLPQFFKFIV 192
+ A G+Y + V E +TL CT L+ D +G RK +P + I+
Sbjct: 114 LAAMAGKYKVYLVMGVIERDGYTLYCTVLFFDSQGHYLGKHRKIMPTALERII 166
>sp|Q42965|NRL4A_TOBAC Bifunctional nitrilase/nitrile hydratase NIT4A OS=Nicotiana tabacum
GN=NIT4A PE=2 SV=1
Length = 349
Score = 34.3 bits (77), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 49/113 (43%), Gaps = 17/113 (15%)
Query: 87 LLVLPQAF-GAIYLGETFCSYISINNSSTLEVRDVVIKAEIQTDKQRILLLDTSKSPVES 145
L+V P+AF G G TF +SI N + K + + K +D V+
Sbjct: 64 LVVFPEAFIGGYPRGSTFG--VSIGNRTA--------KGKEEFRKYHASAIDVPGPEVDR 113
Query: 146 IRA-GGRYDFIVEHDVKELGAHTLVCTALYSDGEG-----ERKYLPQFFKFIV 192
+ A G+Y + V E +TL CT L+ D +G RK +P + I+
Sbjct: 114 LAAMAGKYKVYLVMGVIERDGYTLYCTVLFFDSQGHFLGKHRKIMPTALERII 166
>sp|Q723X5|INLI_LISMF Internalin-I OS=Listeria monocytogenes serotype 4b (strain F2365)
GN=inlI PE=3 SV=1
Length = 1775
Score = 34.3 bits (77), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 5/81 (6%)
Query: 127 QTDKQRILLLDTSKSPVESIRAGGRYDFIVEHDVKELGAHTLVCTALYSDGEGERKYLPQ 186
QTD++R L S + ++ + DF + D+ ++G +T+ TA DG+ P
Sbjct: 1445 QTDERRFLY--DSDAKIDEANVEIKTDFAEKVDINKVGTYTVTLTATNEDGQAAN---PV 1499
Query: 187 FFKFIVSNPLSVRTKVRVVKE 207
IVS+ + + V+ V E
Sbjct: 1500 EVSVIVSDAAAEKVNVKYVDE 1520
>sp|Q6YQE1|UVRB_ONYPE UvrABC system protein B OS=Onion yellows phytoplasma (strain OY-M)
GN=uvrB PE=3 SV=2
Length = 670
Score = 33.9 bits (76), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 19/71 (26%), Positives = 37/71 (52%), Gaps = 3/71 (4%)
Query: 1 MSSTPGTHSLAFR---VMRLCRPSLHVEPPLRVDPTDLFIGEDIFDDPIAASNLPPLISS 57
+S+TPG + L + V ++ RP+ ++P + + PT + + F+ N ++ +
Sbjct: 394 LSATPGNYELTKKIPIVEQIIRPTFVLDPEIEIRPTHNQMDDLYFEIKHQTKNNQRILIT 453
Query: 58 DVTTNKSSDLT 68
+T N S DLT
Sbjct: 454 TLTINMSEDLT 464
>sp|O53508|MPTA_MYCTU Alpha-(1->6)-mannopyranosyltransferase A OS=Mycobacterium
tuberculosis GN=mptA PE=3 SV=1
Length = 516
Score = 33.5 bits (75), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 21/65 (32%), Positives = 34/65 (52%), Gaps = 6/65 (9%)
Query: 238 ATMLKADGPHSDYNAQSREIFKPPVLIRSGGGIHNYLYQLKMLSHGSSSPVKVQGSNVLG 297
A+ L GP + + + P VLI GG+HN + + +++ G + + VQG NV G
Sbjct: 224 ASHLGTHGPTALWIC----VLNPLVLIHLMGGVHNEMLMVGLMTAGIA--LTVQGRNVAG 277
Query: 298 KLQIT 302
+ IT
Sbjct: 278 IILIT 282
>sp|P26191|VP2_ROTPC Inner capsid protein VP2 OS=Rotavirus C (strain Pig/United
States/Cowden/1980) PE=3 SV=1
Length = 872
Score = 32.0 bits (71), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 27/83 (32%), Positives = 38/83 (45%), Gaps = 9/83 (10%)
Query: 178 EGERKYLPQFF--KFIVSNPLSVRTKVRVVKEITFLEACIENHTKSNLYMDQVEFEPSQN 235
E +KY+P + K IV N + R K + ++ L H KS LY E + S
Sbjct: 93 EVVKKYIPSYSTDKLIVKNYRNSRIKCQTYNKLFRLL-----HVKSYLYDVNGEKKLSTR 147
Query: 236 WSATMLKADGPHSDYNAQSREIF 258
W +LK D P DY+ R+ F
Sbjct: 148 WYWKLLKDDLPAGDYSV--RQFF 168
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.136 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 161,805,441
Number of Sequences: 539616
Number of extensions: 6753751
Number of successful extensions: 15235
Number of sequences better than 100.0: 21
Number of HSP's better than 100.0 without gapping: 13
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 15163
Number of HSP's gapped (non-prelim): 25
length of query: 439
length of database: 191,569,459
effective HSP length: 121
effective length of query: 318
effective length of database: 126,275,923
effective search space: 40155743514
effective search space used: 40155743514
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)