BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 010656
(505 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225440787|ref|XP_002281618.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 507
Score = 774 bits (1998), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/506 (73%), Positives = 429/506 (84%), Gaps = 6/506 (1%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQ 60
MAT FI L S+ S+AKI P FPSS + PE+LS +SS + LY+ KY TQ
Sbjct: 4 MATTFILPCIFLFSISSSGFSSAKITPRFPSSIVRPEQLS--VSSQTE---LYEAKYFTQ 58
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAP 120
+LDHFNY PQSY+TFQQRYLIND +WGG+ APIFVYTGNEGDIEWFAQNTGFM+D AP
Sbjct: 59 LLDHFNYQPQSYRTFQQRYLINDKYWGGADKLAPIFVYTGNEGDIEWFAQNTGFMFDTAP 118
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT 180
F+ALLVFIEHR+YGKSIP+GG+ +AY NAST GYLSSTQALADYA+LIIDLKKNL+AT
Sbjct: 119 HFQALLVFIEHRFYGKSIPFGGDTNVAYSNASTLGYLSSTQALADYATLIIDLKKNLSAT 178
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
+SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF+NI SPY+F+NIITQDFRS
Sbjct: 179 NSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFENITSPYTFNNIITQDFRSE 238
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE-KNLAIESWLSTAFVYTAMTDY 299
SENCYKVIKGSW+QIEETA K GGLE L+K+FRICK+ AIE+WLSTA+VYTAMTDY
Sbjct: 239 SENCYKVIKGSWEQIEETAMKNGGLEVLRKSFRICKNYISGGAIENWLSTAYVYTAMTDY 298
Query: 300 PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP 359
PTPSNFL+P+PA+PVK+MCKAIDDP GND FAKLYGAA+VYYNY+GTA CFDL DSDP
Sbjct: 299 PTPSNFLSPMPAYPVKQMCKAIDDPAQGNDTFAKLYGAANVYYNYTGTAACFDLADDSDP 358
Query: 360 HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGG 419
HGL EW WQACTEMIM +N++SIF S +Y RA +CK A+ ++PRPNWITTEFGG
Sbjct: 359 HGLGEWTWQACTEMIMPVNANNEESIFPVSTWNYSNRAAFCKFAFDIEPRPNWITTEFGG 418
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQ 479
H I VLKRF SNIIFFNGLRDPWSGGGVL++ISK+++A+V K+GAHHVDLRF+T EDP+
Sbjct: 419 HDIKRVLKRFGSNIIFFNGLRDPWSGGGVLESISKSIVAIVAKDGAHHVDLRFATSEDPE 478
Query: 480 WLKDVRRREVEIIGKWISQYFQDLAQ 505
WL+DVR+REV+II KW S+Y+ DLA
Sbjct: 479 WLQDVRKREVKIITKWFSEYYHDLAH 504
>gi|297740163|emb|CBI30345.3| unnamed protein product [Vitis vinifera]
Length = 485
Score = 763 bits (1971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/478 (75%), Positives = 415/478 (86%), Gaps = 6/478 (1%)
Query: 29 FPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGG 88
FPSS + PE+LS +SS + LY+ KY TQ+LDHFNY PQSY+TFQQRYLIND +WGG
Sbjct: 10 FPSSIVRPEQLS--VSSQTE---LYEAKYFTQLLDHFNYQPQSYRTFQQRYLINDKYWGG 64
Query: 89 SKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
+ APIFVYTGNEGDIEWFAQNTGFM+D AP F+ALLVFIEHR+YGKSIP+GG+ +AY
Sbjct: 65 ADKLAPIFVYTGNEGDIEWFAQNTGFMFDTAPHFQALLVFIEHRFYGKSIPFGGDTNVAY 124
Query: 149 KNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
NAST GYLSSTQALADYA+LIIDLKKNL+AT+SPVVVFGGSYGGMLAAWFRLKYPHVAI
Sbjct: 125 SNASTLGYLSSTQALADYATLIIDLKKNLSATNSPVVVFGGSYGGMLAAWFRLKYPHVAI 184
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKL 268
GALASSAPILNF+NI SPY+F+NIITQDFRS SENCYKVIKGSW+QIEETA K GGLE L
Sbjct: 185 GALASSAPILNFENITSPYTFNNIITQDFRSESENCYKVIKGSWEQIEETAMKNGGLEVL 244
Query: 269 QKAFRICKSE-KNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG 327
+K+FRICK+ AIE+WLSTA+VYTAMTDYPTPSNFL+P+PA+PVK+MCKAIDDP G
Sbjct: 245 RKSFRICKNYISGGAIENWLSTAYVYTAMTDYPTPSNFLSPMPAYPVKQMCKAIDDPAQG 304
Query: 328 NDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
ND FAKLYGAA+VYYNY+GTA CFDL DSDPHGL EW WQACTEMIM +N++SIF
Sbjct: 305 NDTFAKLYGAANVYYNYTGTAACFDLADDSDPHGLGEWTWQACTEMIMPVNANNEESIFP 364
Query: 388 ESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
S +Y RA +CK A+ ++PRPNWITTEFGGH I VLKRF SNIIFFNGLRDPWSGGG
Sbjct: 365 VSTWNYSNRAAFCKFAFDIEPRPNWITTEFGGHDIKRVLKRFGSNIIFFNGLRDPWSGGG 424
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
VL++ISK+++A+V K+GAHHVDLRF+T EDP+WL+DVR+REV+II KW S+Y+ DLA
Sbjct: 425 VLESISKSIVAIVAKDGAHHVDLRFATSEDPEWLQDVRKREVKIITKWFSEYYHDLAH 482
>gi|255579351|ref|XP_002530520.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223529924|gb|EEF31852.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 508
Score = 758 bits (1958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 359/488 (73%), Positives = 414/488 (84%), Gaps = 2/488 (0%)
Query: 20 ISNAKIF-PTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQR 78
IS+A F P F SS T S S LYK K+ TQ LDHFN+NP+SYQTFQQR
Sbjct: 18 ISSADYFLPRFSSSINTQPADQKKTSLSTTPNKLYKEKFFTQTLDHFNFNPKSYQTFQQR 77
Query: 79 YLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSI 138
YLINDT+W G KNNAPIF+YTGNEG+IEWFAQNTGFM+D APKF ALLVF+EHR+YGKSI
Sbjct: 78 YLINDTYWAGPKNNAPIFMYTGNEGEIEWFAQNTGFMFDNAPKFNALLVFVEHRFYGKSI 137
Query: 139 PYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAW 198
P+GGNKE+AY NAST GYL+STQ+LADYA+LI DLK NL+ATDSPVVVFGGSYGGMLAAW
Sbjct: 138 PFGGNKEVAYSNASTLGYLTSTQSLADYATLITDLKNNLSATDSPVVVFGGSYGGMLAAW 197
Query: 199 FRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEET 258
FRLKYPHV IGALASSAPIL F NI SPYSF+NIIT DFRS SENCYKVIKGSW+QIE+T
Sbjct: 198 FRLKYPHVTIGALASSAPILGFVNITSPYSFNNIITHDFRSESENCYKVIKGSWQQIEDT 257
Query: 259 AKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEM 317
A + GGLEKL+K+F+ICK+ + ++E+WLSTA+VY+AMTDYPTPSNFLNPLPAFPVK+M
Sbjct: 258 ANQHGGLEKLRKSFKICKNYISAGSLENWLSTAWVYSAMTDYPTPSNFLNPLPAFPVKQM 317
Query: 318 CKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLT 377
CKAIDDP GND FAK + AASVYYNYSGTA CFDL+ DSDPHGL W WQACTEMI+ T
Sbjct: 318 CKAIDDPTAGNDTFAKFHAAASVYYNYSGTATCFDLDDDSDPHGLGGWDWQACTEMILPT 377
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
GG +SIF SE DY+ R YCK + +DPRPNWITTEFGGH I + LKRFASNIIFFN
Sbjct: 378 GGSTAESIFPASEWDYNDRVTYCKLRFDIDPRPNWITTEFGGHNIKMALKRFASNIIFFN 437
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
GLRDPWSGGGVL++ISK+++ALVE++GAHHVDLRF+T EDP+WL++VR++EV+II KW+S
Sbjct: 438 GLRDPWSGGGVLEDISKSLVALVEEKGAHHVDLRFATSEDPKWLQEVRQKEVKIIAKWLS 497
Query: 498 QYFQDLAQ 505
Y+QDLA
Sbjct: 498 DYYQDLAH 505
>gi|356503833|ref|XP_003520707.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 508
Score = 744 bits (1920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 360/503 (71%), Positives = 414/503 (82%), Gaps = 7/503 (1%)
Query: 4 RFIFLSFCLLFSS-TLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQ-GLYKTKYHTQI 61
+FI ++ LFS+ +LT + I P FPSS ++ E L S SQ GLY+TK+ TQI
Sbjct: 6 QFIIITLFSLFSAPSLTFAFVPILPRFPSSAVSAE----LKQRSHSSQNGLYRTKFFTQI 61
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
LDHFN+NPQS TFQQRYLINDT WGG+KNNAPIFVYTGNEG+IEWF QNTGFM++ AP
Sbjct: 62 LDHFNFNPQSNHTFQQRYLINDTFWGGAKNNAPIFVYTGNEGNIEWFTQNTGFMFENAPS 121
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD 181
F+ALLVFIEHR+YGKSIP+GGNK +AY N ST GYLSSTQALADYA+LIIDLKKNL+ATD
Sbjct: 122 FQALLVFIEHRFYGKSIPFGGNKTVAYANTSTLGYLSSTQALADYATLIIDLKKNLSATD 181
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
SPVVVFGGSYGGMLAAWFR+KYPHVAIGALASSAPIL+F +VSP F+NIITQDFRS S
Sbjct: 182 SPVVVFGGSYGGMLAAWFRMKYPHVAIGALASSAPILHFMGLVSPDIFNNIITQDFRSES 241
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYP 300
ENCYKVIKGSW I++TA KPGG+E L+K FRIC + ++E WL A++YTAMTDYP
Sbjct: 242 ENCYKVIKGSWDLIDDTANKPGGMELLRKTFRICNDDFGPGSLEGWLRAAWIYTAMTDYP 301
Query: 301 TPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH 360
TPSNFLNPLPA+PVK+MC+AID TGN+ AKL+ AASVYYNY+G A+CFDL+ +SDPH
Sbjct: 302 TPSNFLNPLPAYPVKKMCEAIDSSVTGNNRLAKLHAAASVYYNYTGKARCFDLDDNSDPH 361
Query: 361 GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGH 420
L W WQACTEMIM GG NK+SIF E E Y+ARA +C Y V PRP+WITTEFGGH
Sbjct: 362 DLGGWQWQACTEMIMPVGGSNKESIFPEYEWSYEARASWCDFFYNVQPRPHWITTEFGGH 421
Query: 421 KIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQW 480
+ VLKR ASNIIFFNGLRDPWSGGGVLK ISKT++A+V K+GAHHVDLRFS+KEDPQW
Sbjct: 422 AVERVLKRSASNIIFFNGLRDPWSGGGVLKTISKTLVAIVAKKGAHHVDLRFSSKEDPQW 481
Query: 481 LKDVRRREVEIIGKWISQYFQDL 503
LKDVR+ EV II WISQY QDL
Sbjct: 482 LKDVRKLEVNIIASWISQYHQDL 504
>gi|356570875|ref|XP_003553609.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 507
Score = 740 bits (1910), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/502 (71%), Positives = 409/502 (81%), Gaps = 6/502 (1%)
Query: 4 RFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQ-GLYKTKYHTQIL 62
+FI F L +LT + A I P FPSS ++ E L S SQ GLY+TK+ TQIL
Sbjct: 6 QFILTLFSLFSVPSLTFAFAPILPRFPSSAVSAE----LKQRSHSSQNGLYRTKFFTQIL 61
Query: 63 DHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKF 122
DHFN+NPQS TFQQRYLINDT WGG+KNNAPIFVYTGNEG+IEWF QNTGFM++ AP F
Sbjct: 62 DHFNFNPQSNHTFQQRYLINDTFWGGAKNNAPIFVYTGNEGNIEWFTQNTGFMFENAPSF 121
Query: 123 KALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDS 182
+ALLVFIEHR+YGKSIP+GGNK +AY N ST GYLSSTQALADYA+LIIDLKKNL+ATDS
Sbjct: 122 QALLVFIEHRFYGKSIPFGGNKTVAYANTSTLGYLSSTQALADYAALIIDLKKNLSATDS 181
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE 242
PVVVFGGSYGGMLAAWFR+KYPHVAIGALASSAPIL+F +VSP F +IITQDFRS SE
Sbjct: 182 PVVVFGGSYGGMLAAWFRMKYPHVAIGALASSAPILHFMGMVSPDIFISIITQDFRSESE 241
Query: 243 NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYPT 301
NCYKVIKGSW I++TA KPGG+E L+K FRIC + ++E WL A++YTAMTDYPT
Sbjct: 242 NCYKVIKGSWDLIDDTANKPGGMELLRKTFRICNDDFGPDSLEGWLRAAWIYTAMTDYPT 301
Query: 302 PSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHG 361
PSNFLNPLPA+PVK+MC+AID TGN+ AKLY AA+VYYNY+G A CFDL+ +SDPH
Sbjct: 302 PSNFLNPLPAYPVKKMCEAIDSSVTGNNRLAKLYAAANVYYNYTGKATCFDLDDNSDPHD 361
Query: 362 LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHK 421
L W WQACTEMIM GG NK+SIF E E Y+ARA +C Y V PRP+WITTEFGGH
Sbjct: 362 LGGWQWQACTEMIMPVGGSNKESIFPEYEWSYEARASWCDFFYNVQPRPHWITTEFGGHA 421
Query: 422 IGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWL 481
I VLKR ASNIIFFNGLRDPWS GGVLK ISKT++A+V K+GAHHVDLR+S+KEDPQWL
Sbjct: 422 IERVLKRSASNIIFFNGLRDPWSAGGVLKTISKTIVAIVAKKGAHHVDLRYSSKEDPQWL 481
Query: 482 KDVRRREVEIIGKWISQYFQDL 503
KDVR++EV II WISQY QDL
Sbjct: 482 KDVRKQEVNIIASWISQYHQDL 503
>gi|147788546|emb|CAN61013.1| hypothetical protein VITISV_036738 [Vitis vinifera]
Length = 554
Score = 719 bits (1855), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 347/485 (71%), Positives = 400/485 (82%), Gaps = 17/485 (3%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQ 60
MAT FI L S+ S AKI P FPSS + PE+LS +SS + LY+ KY TQ
Sbjct: 4 MATTFILPCIFLFSISSSGFSAAKITPRFPSSIVRPEQLS--VSSQTE---LYEAKYFTQ 58
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAP 120
ILDHFNY PQSY+TFQQRYLIND +WGG+ APIFVYTGNEGDIEWFAQNTGFM+D AP
Sbjct: 59 ILDHFNYQPQSYRTFQQRYLINDKYWGGADKLAPIFVYTGNEGDIEWFAQNTGFMFDTAP 118
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT 180
F+ALLVFIEHR+YGKSIP+GG+ +AY NAST GYLSSTQALADYA+LIIDLKKNL+AT
Sbjct: 119 HFQALLVFIEHRFYGKSIPFGGDTNVAYSNASTLGYLSSTQALADYATLIIDLKKNLSAT 178
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
+SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF+NI SPY+F+NIITQDF
Sbjct: 179 NSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFENITSPYTFNNIITQDF--- 235
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE-KNLAIESWLSTAFVYTAMTDY 299
+GSW+QIEETA K GGLE L+K+FRICK+ AIE+WLSTA+VYTAMTDY
Sbjct: 236 --------QGSWEQIEETAMKNGGLEVLRKSFRICKNYISGGAIENWLSTAYVYTAMTDY 287
Query: 300 PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP 359
PTPSNFL+P+PA+PVK+MCKAIDDP GND FAKLYGAA+VYYNY+GTA CFDL DSDP
Sbjct: 288 PTPSNFLSPMPAYPVKQMCKAIDDPAXGNDTFAKLYGAANVYYNYTGTAACFDLADDSDP 347
Query: 360 HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGG 419
HGL EW WQACTEMIM +N++SIF S +Y RA +CK A+ ++PRPNWITTEFGG
Sbjct: 348 HGLGEWTWQACTEMIMPVNANNEESIFPVSTWNYSNRAAFCKFAFDIEPRPNWITTEFGG 407
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQ 479
H I VLKRF SNIIFFNGLRDPWSGGGVL++ISK+++A+V K+GAHHVDLRF+T EDP+
Sbjct: 408 HDIKRVLKRFGSNIIFFNGLRDPWSGGGVLESISKSIVAIVAKDGAHHVDLRFATSEDPE 467
Query: 480 WLKDV 484
WL+B
Sbjct: 468 WLQBA 472
>gi|224088466|ref|XP_002308456.1| predicted protein [Populus trichocarpa]
gi|222854432|gb|EEE91979.1| predicted protein [Populus trichocarpa]
Length = 461
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 337/466 (72%), Positives = 396/466 (84%), Gaps = 7/466 (1%)
Query: 1 MATRFIFLSFCLLFSSTLTISN-AKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHT 59
MAT FI L F LL IS+ AK P+FPSS I EK+S +S+ + LY K+ T
Sbjct: 1 MATGFISLLFSLLSLLFFGISSFAKPIPSFPSSIIQAEKIS--LSTPNE---LYHEKFFT 55
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q+LDH+ + PQSY+TFQQRYLIND +WGG++ NAPIF+YTGNEGDIEWFAQNTGF++D+A
Sbjct: 56 QVLDHYTFRPQSYKTFQQRYLINDKYWGGAEKNAPIFLYTGNEGDIEWFAQNTGFIFDIA 115
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
P FK LLVFIEHR+YGKS+P+GGNKE+AY N+ST GYL+STQALADYA+LIIDLKKNL+A
Sbjct: 116 PHFKPLLVFIEHRFYGKSMPFGGNKEVAYSNSSTLGYLTSTQALADYATLIIDLKKNLSA 175
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS+PILNF+NI SPYSF+NIITQDFR
Sbjct: 176 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSSPILNFENITSPYSFNNIITQDFRG 235
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTD 298
SENCYKVIK SW++IE+TA +PGGLE L+ +FRIC++ + +++SWL TA VYTAMTD
Sbjct: 236 ESENCYKVIKRSWQEIEDTASQPGGLEILRSSFRICRNSISASSLQSWLYTALVYTAMTD 295
Query: 299 YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSD 358
YPTPSNFLNP+PA+PVKEMCKAIDDPKTGN+ FAKLYGAASVYYNYSG A CF+L+ DSD
Sbjct: 296 YPTPSNFLNPMPAYPVKEMCKAIDDPKTGNNTFAKLYGAASVYYNYSGNATCFNLDDDSD 355
Query: 359 PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG 418
PHGL W WQACTEMI+ T G+NKDSIF SE +YD RA +CK +GV+PRPNWIT EFG
Sbjct: 356 PHGLGGWSWQACTEMILPTSGNNKDSIFPASEWNYDDRASFCKAYFGVEPRPNWITAEFG 415
Query: 419 GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
GH I VLKRF SNIIFFNGLRDPWSGGGVL+NIS +++A++ K+G
Sbjct: 416 GHDIKRVLKRFGSNIIFFNGLRDPWSGGGVLENISSSIVAIIAKQG 461
>gi|449437430|ref|XP_004136495.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 502
Score = 686 bits (1769), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/500 (66%), Positives = 403/500 (80%), Gaps = 5/500 (1%)
Query: 6 IFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
I LS L S T S +KI +FPSS + + SS D Y+T + TQILDHF
Sbjct: 7 ISLSIFLFLSLHFTSSFSKIPLSFPSSLLLRPQ-----SSPIDPLLPYQTSFFTQILDHF 61
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
N+NPQSYQ+FQQRYLINDT+WGG+ +N+PIFVYTGNEG+IEWFAQNTGF+ AP F+AL
Sbjct: 62 NFNPQSYQSFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNTGFLLQYAPHFRAL 121
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVV 185
+VFIEHR+YGKSIP+GG++++A N+S GYLSSTQALADYA+LI DLKKNL+A DSPV+
Sbjct: 122 VVFIEHRFYGKSIPFGGDEDVANSNSSMLGYLSSTQALADYATLITDLKKNLSAVDSPVL 181
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
VFGGSYGGMLAAWFRLKYPH+A+GALASSAPIL +NI SPY+F+NI+TQDF+S S+NCY
Sbjct: 182 VFGGSYGGMLAAWFRLKYPHIALGALASSAPILQLENITSPYAFNNIVTQDFKSESQNCY 241
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNF 305
VIK SW I+ T+ P G + L+K+F+ CK + +I++WLSTA +YTAMTDYPTPSNF
Sbjct: 242 SVIKESWHLIDITSTHPQGPQLLRKSFKFCKEAEAESIKNWLSTAIIYTAMTDYPTPSNF 301
Query: 306 LNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEW 365
LNPLPA+PVK+MCKAIDDP++GND F KLYGAA++YYN++GT CFDL+ DSDPH L +W
Sbjct: 302 LNPLPAYPVKQMCKAIDDPRSGNDSFTKLYGAANIYYNFTGTVTCFDLDDDSDPHDLGDW 361
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
WQACTEMI+ TGG+ K+SIF S + R ++CK + V+PR WI T FGGH I V
Sbjct: 362 SWQACTEMILPTGGNTKESIFPASTWHFADRFQFCKTFFDVEPRRIWIPTHFGGHNIERV 421
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
LKRF SNIIFFNGLRDPWSGGGVLKNIS T++A+V KEGAHHVDLRFS +DP+WLKDVR
Sbjct: 422 LKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPDDPKWLKDVR 481
Query: 486 RREVEIIGKWISQYFQDLAQ 505
++EV II W+SQY+ DLAQ
Sbjct: 482 KQEVNIIEDWLSQYYLDLAQ 501
>gi|356505400|ref|XP_003521479.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 504
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 339/506 (66%), Positives = 403/506 (79%), Gaps = 8/506 (1%)
Query: 1 MATRFIF--LSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYH 58
MAT F F + FCL TI+ A + P FPSS + P +L+S+ GLY K+
Sbjct: 1 MATSFQFTIIGFCLFSLLCFTITFAYVIPRFPSSMLHPALDVNLLSAQN---GLYTAKFF 57
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ LDHFNYNPQSYQTFQQRYLINDT+WGG+KNNAPIFVY GNEGDIEWFAQNTGFM++
Sbjct: 58 TQTLDHFNYNPQSYQTFQQRYLINDTYWGGAKNNAPIFVYMGNEGDIEWFAQNTGFMFET 117
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
AP FKALLVFIEHRYYGKS P+GGN+E+A N +T GY+SSTQALADYA+LIIDLK NL+
Sbjct: 118 APYFKALLVFIEHRYYGKSFPFGGNEEVADANTTTVGYMSSTQALADYATLIIDLKNNLS 177
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
ATDSPVVV GGSYGGMLAAWFR+KYPHVAIGALASSAPIL F ++VSPY++++IITQD++
Sbjct: 178 ATDSPVVVVGGSYGGMLAAWFRMKYPHVAIGALASSAPILQFLDLVSPYTYTDIITQDYK 237
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMT 297
S SENCYKVIKGSWKQIE+TA+KPGGLE+L+K+FRICK + A+ WL A AMT
Sbjct: 238 SESENCYKVIKGSWKQIEDTAQKPGGLEQLRKSFRICKHYISAGALVYWLQMALGSAAMT 297
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS 357
DYPTPS FL PLPA+PV++MC+AID+ N+ F KLY AA+++YNY+GTA CF L+ +
Sbjct: 298 DYPTPSVFLAPLPAYPVRKMCEAIDNLSAVNETFTKLYAAANIFYNYTGTATCFFLDNTT 357
Query: 358 DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEF 417
P L W WQACTE+IM G +N+ S+F + YCK Y + PRP+WITTEF
Sbjct: 358 AP--LGGWDWQACTELIMPLGANNEGSMFPPYKWKLRDVEFYCKRVYHIQPRPHWITTEF 415
Query: 418 GGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKED 477
GGH I VLKR SNIIFFNGLRDPWSGGGVLKNIS+T++A+V KEGAHHVDLRFST ED
Sbjct: 416 GGHDIKRVLKRSGSNIIFFNGLRDPWSGGGVLKNISETIVAIVAKEGAHHVDLRFSTTED 475
Query: 478 PQWLKDVRRREVEIIGKWISQYFQDL 503
P+WLKD+R+RE++II WISQY+QDL
Sbjct: 476 PEWLKDIRKREIKIIANWISQYYQDL 501
>gi|449521545|ref|XP_004167790.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 501
Score = 681 bits (1758), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 331/500 (66%), Positives = 404/500 (80%), Gaps = 6/500 (1%)
Query: 6 IFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
I LS L S T S +KI +FPSS + + SS D Y+T + TQILDHF
Sbjct: 7 ISLSIFLFLSLHFTSSFSKIPLSFPSSLLLRPQ-----SSPIDPLLPYQTSFFTQILDHF 61
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
N+NPQSYQ+FQQRYLINDT+WGG+ +N+PIFVYTGNEG+IEWFAQNTGF+ AP F+AL
Sbjct: 62 NFNPQSYQSFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNTGFLLQYAPHFRAL 121
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVV 185
+VFIEHR+YGKSIP+GG++++A N+S GYLSSTQALADYA+LI DLKKNL+A DSPV+
Sbjct: 122 VVFIEHRFYGKSIPFGGDEDVANSNSSMLGYLSSTQALADYATLITDLKKNLSAVDSPVL 181
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
VFGGSYGGMLAAWFRLKYPH+A+GALASSAPIL +NI SPY+F+NI+TQDF+S S+NCY
Sbjct: 182 VFGGSYGGMLAAWFRLKYPHIALGALASSAPILQLENITSPYAFNNIVTQDFKSESQNCY 241
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNF 305
VIK SW I+ T+ P G + L+K+F+I ++E +I++WLSTA +YTAMTDYPTPSNF
Sbjct: 242 SVIKESWHLIDITSTHPQGPQLLRKSFQILEAEAE-SIKNWLSTAIIYTAMTDYPTPSNF 300
Query: 306 LNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEW 365
LNPLPA+PVK+MCKAIDDP++GND F KLYGAA++YYN++GT CFDL+ DSDPH L +W
Sbjct: 301 LNPLPAYPVKQMCKAIDDPRSGNDSFTKLYGAANIYYNFTGTVTCFDLDDDSDPHDLGDW 360
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
WQACTEMI+ TGG+ K+SIF S + R ++CK + V+PR WI T FGGH I V
Sbjct: 361 SWQACTEMILPTGGNTKESIFPASTWHFADRFQFCKTFFDVEPRRIWIPTHFGGHNIERV 420
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
LKRF SNIIFFNGLRDPWSGGGVLKNIS T++A+V KEGAHHVDLRFS +DP+WLKDVR
Sbjct: 421 LKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVAKEGAHHVDLRFSNPDDPKWLKDVR 480
Query: 486 RREVEIIGKWISQYFQDLAQ 505
++EV II W+SQY+ DLAQ
Sbjct: 481 KQEVNIIEDWLSQYYLDLAQ 500
>gi|297821719|ref|XP_002878742.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297324581|gb|EFH55001.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 495
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 270/463 (58%), Positives = 342/463 (73%), Gaps = 9/463 (1%)
Query: 45 SSKDSQGL-YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
+SK Q L ++T+Y Q LDHF + P+SY F Q+YLIN W + PIFVYTGNEG
Sbjct: 37 ASKSKQELPFETRYFPQNLDHFGFTPESYTVFHQKYLINSRFW---RKGGPIFVYTGNEG 93
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
DI+WFA NTGFM D+APKF+ALLVFIEHR+YG+S P+G ++K+A T GYLSS QAL
Sbjct: 94 DIDWFASNTGFMSDIAPKFQALLVFIEHRFYGESTPFGKK---SHKSAETLGYLSSQQAL 150
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
ADYA LI LK+NL++ SPVVVFGGSYGGMLAAWFRLKYPH+ IGALASSAPIL+FDNI
Sbjct: 151 ADYAILIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNI 210
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLA 282
V SF + I+QDF+ S NC+ VIK SW+++E + GL++L K FR CK + +
Sbjct: 211 VPLTSFYDAISQDFKDASINCFTVIKRSWEELEAVSTMKNGLQELSKKFRTCKGLQSKYS 270
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY 342
WLS AFVYTAM +YPT +NF+ PLP +PV++MCK ID G+ + + AAS+YY
Sbjct: 271 ARDWLSGAFVYTAMVNYPTAANFMAPLPGYPVEQMCKIIDGFPRGSSNLDRAFAAASLYY 330
Query: 343 NYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
NYSG+ KCF++ +D HGL+ W +QACTEM+M N+ S+ + DY+A C
Sbjct: 331 NYSGSEKCFEMEQQTDDHGLNGWQYQACTEMVMPMSCSNQ-SMLPPYDNDYEAFQEQCMS 389
Query: 403 AYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
YGV PRP+WITTEFGG +I VLKRF SNIIF NG++DPWS GGVLKNIS +++ALV K
Sbjct: 390 TYGVKPRPHWITTEFGGKRIETVLKRFGSNIIFSNGMQDPWSRGGVLKNISSSIVALVTK 449
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+GAHH DLR +TK+DP+WLK+ RR+EV II KWIS+Y++DL +
Sbjct: 450 KGAHHADLRAATKDDPEWLKEQRRQEVSIIEKWISEYYRDLRE 492
>gi|30682358|ref|NP_850050.1| alpha/beta-hydrolase domain-containing protein [Arabidopsis
thaliana]
gi|16648801|gb|AAL25591.1| At2g24280/F27D4.19 [Arabidopsis thaliana]
gi|22655366|gb|AAM98275.1| At2g24280/F27D4.19 [Arabidopsis thaliana]
gi|330252462|gb|AEC07556.1| alpha/beta-hydrolase domain-containing protein [Arabidopsis
thaliana]
Length = 494
Score = 562 bits (1449), Expect = e-157, Method: Compositional matrix adjust.
Identities = 275/497 (55%), Positives = 354/497 (71%), Gaps = 10/497 (2%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNP 69
CL+F ++ A P ++ +L +S SK + ++T+Y Q LDHF++ P
Sbjct: 5 LCLVFLFFSIVAEATYSPG-GFHHLSSLRLKKKVSKSKH-ELPFETRYFPQNLDHFSFTP 62
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
SY+ F Q+YLIN+ W + PIFVYTGNEGDI+WFA NTGFM D+APKF+ALLVFI
Sbjct: 63 DSYKVFHQKYLINNRFW---RKGGPIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFI 119
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGG 189
EHR+YG+S P+G ++K+A T GYL+S QALADYA LI LK+NL++ SPVVVFGG
Sbjct: 120 EHRFYGESTPFGKK---SHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGG 176
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGGMLAAWFRLKYPH+ IGALASSAPIL+FDNIV SF + I+QDF+ S NC+KVIK
Sbjct: 177 SYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIK 236
Query: 250 GSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFLNP 308
SW+++E + GL++L K FR CK + WLS AFVYTAM +YPT +NF+ P
Sbjct: 237 RSWEELEAVSTMKNGLQELSKKFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAP 296
Query: 309 LPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQ 368
LP +PV++MCK ID G+ + + AAS+YYNYSG+ KCF++ +D HGL W +Q
Sbjct: 297 LPGYPVEQMCKIIDGFPRGSSNLDRAFAAASLYYNYSGSEKCFEMEQQTDDHGLDGWQYQ 356
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKR 428
ACTEM+M N+ S+ E D +A C YGV PRP+WITTEFGG +I VLKR
Sbjct: 357 ACTEMVMPMSCSNQ-SMLPPYENDSEAFQEQCMTRYGVKPRPHWITTEFGGMRIETVLKR 415
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRRE 488
F SNIIF NG++DPWS GGVLKNIS +++ALV K+GAHH DLR +TK+DP+WLK+ RR+E
Sbjct: 416 FGSNIIFSNGMQDPWSRGGVLKNISSSIVALVTKKGAHHADLRAATKDDPEWLKEQRRQE 475
Query: 489 VEIIGKWISQYFQDLAQ 505
V II KWIS+Y++DL +
Sbjct: 476 VAIIEKWISEYYRDLRE 492
>gi|326513984|dbj|BAJ92142.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 535
Score = 562 bits (1448), Expect = e-157, Method: Compositional matrix adjust.
Identities = 267/454 (58%), Positives = 331/454 (72%), Gaps = 5/454 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ Y Q LDHF + P + F Q+YL+NDT W P+FVYTGNEGDIEWFA NT
Sbjct: 85 FTAHYFQQELDHFTFTPNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNT 144
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++D+AP F ALLVFIEHR+YG+S P+G + +YK+A T GYL+STQALAD+A LI
Sbjct: 145 GFLFDIAPDFGALLVFIEHRFYGESKPFGND---SYKSADTLGYLTSTQALADFAVLITS 201
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NL+ D+PVVVFGGSYGGMLA+WFRLKYPHVA+GA+ASSAPIL FD+I SF +
Sbjct: 202 LKHNLSTVDAPVVVFGGSYGGMLASWFRLKYPHVAMGAVASSAPILQFDDITPWSSFYDT 261
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
++QDF+S S NC+ VIK +W +++ GL +L K FR CK+ K+ ++ WLSTAF
Sbjct: 262 VSQDFKSESLNCFSVIKAAWDVLDDRGSNHTGLLELSKTFRACKTVKSADSLGDWLSTAF 321
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
YTAM DYPTP+NF+ LPA+PVKEMCK ID TG D+ K + AAS+YYNY+G KCF
Sbjct: 322 TYTAMVDYPTPANFMMNLPAYPVKEMCKIIDSFPTGADIIDKAFAAASLYYNYTGDQKCF 381
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
+ GD DPHGL W WQACTEM+M N +S+F S Y+ + C YGV PR N
Sbjct: 382 QVEGDDDPHGLDGWDWQACTEMVMPMIVSN-ESMFPPSSFSYENNSDACLADYGVRPRMN 440
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WITTE+GGHKI VLKRF SNIIF NG+RDPWS GGVLKNIS +++ALV ++GAHH+D R
Sbjct: 441 WITTEYGGHKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTEKGAHHLDFR 500
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
TK+DP W+ + RR+EVEII WI QY +D+AQ
Sbjct: 501 SETKDDPDWVVEQRRQEVEIIHGWIDQYNKDIAQ 534
>gi|357123237|ref|XP_003563318.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 536
Score = 561 bits (1446), Expect = e-157, Method: Compositional matrix adjust.
Identities = 271/468 (57%), Positives = 341/468 (72%), Gaps = 13/468 (2%)
Query: 47 KDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW--------GGSKNNAPIFVY 98
K ++ ++ Y Q LDHF + P + + F Q+YL+NDT W G + P+FVY
Sbjct: 70 KKTKNPFRAHYFPQELDHFTFTPNASRIFYQKYLVNDTFWRKPTGKGRGAAAAPGPVFVY 129
Query: 99 TGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLS 158
TGNEGDIEWFA N+GF++D+APKF ALLVFIEHR+YG+S P+G +Y +A+T GYL+
Sbjct: 130 TGNEGDIEWFATNSGFLFDIAPKFSALLVFIEHRFYGESKPFGNE---SYGSAATLGYLT 186
Query: 159 STQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
STQALAD+A LI LK NL+A +PVVVFGGSYGGMLA+WFRLKYPHVA+GALASSAPIL
Sbjct: 187 STQALADFAVLITSLKHNLSAPVAPVVVFGGSYGGMLASWFRLKYPHVAMGALASSAPIL 246
Query: 219 NFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE 278
FD+I SF + +QDF+S S+NC+ VIK W ++E GL +L K FR CK+
Sbjct: 247 QFDDITPWSSFYDAASQDFKSESKNCFSVIKAVWDVLDERGSNDKGLLQLSKTFRACKTV 306
Query: 279 KNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA 337
K + ++ +WL TAFVYTAM DYPTP+NFL LPA+PVKEMCK ID G D+ K + A
Sbjct: 307 KYVDSLSNWLWTAFVYTAMVDYPTPANFLMNLPAYPVKEMCKIIDAFPPGADIVDKAFSA 366
Query: 338 ASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
AS+YYNY+G KCFD+ G DPHGLS WGWQACTEM+M NK S+F S Y+ ++
Sbjct: 367 ASLYYNYTGDQKCFDVEGGDDPHGLSGWGWQACTEMVMPMTVSNK-SMFPPSSFSYEEKS 425
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
C +YGV PR +WITTE+GGHKI VLKRF SNIIF NG+RDPWS GGVLKNIS +++
Sbjct: 426 EGCLASYGVRPRMHWITTEYGGHKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSII 485
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
ALV ++GAHH+D R +TK+DP W+ + RR+EVEII WI QY +D+A
Sbjct: 486 ALVTEKGAHHLDFRTATKDDPDWVIEQRRQEVEIIQGWIDQYNKDIAH 533
>gi|326510399|dbj|BAJ87416.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 524
Score = 560 bits (1444), Expect = e-157, Method: Compositional matrix adjust.
Identities = 267/454 (58%), Positives = 331/454 (72%), Gaps = 5/454 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ Y Q LDHF + P + F Q+YL+NDT W P+FVYTGNEGDIEWFA NT
Sbjct: 74 FTAHYFQQELDHFTFTPNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNT 133
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++D+AP F ALLVFIEHR+YG+S P+G + +YK+A T GYL+STQALAD+A LI
Sbjct: 134 GFLFDIAPDFGALLVFIEHRFYGESKPFGND---SYKSADTLGYLTSTQALADFAVLITS 190
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NL+ D+PVVVFGGSYGGMLA+WFRLKYPHVA+GA+ASSAPIL FD+I SF +
Sbjct: 191 LKHNLSTVDAPVVVFGGSYGGMLASWFRLKYPHVAMGAVASSAPILQFDDITPWSSFYDT 250
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
++QDF+S S NC+ VIK +W +++ GL +L K FR CK+ K+ ++ WLSTAF
Sbjct: 251 VSQDFKSESLNCFSVIKAAWDVLDDRGSNHTGLLELSKTFRACKTVKSADSLGDWLSTAF 310
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
YTAM DYPTP+NF+ LPA+PVKEMCK ID TG D+ K + AAS+YYNY+G KCF
Sbjct: 311 TYTAMVDYPTPANFMMNLPAYPVKEMCKIIDSFPTGADIIDKAFAAASLYYNYTGDQKCF 370
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
+ GD DPHGL W WQACTEM+M N +S+F S Y+ + C YGV PR N
Sbjct: 371 QVEGDDDPHGLDGWDWQACTEMVMPMIVSN-ESMFPPSSFSYENNSDACLADYGVRPRMN 429
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WITTE+GGHKI VLKRF SNIIF NG+RDPWS GGVLKNIS +++ALV ++GAHH+D R
Sbjct: 430 WITTEYGGHKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTEKGAHHLDFR 489
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
TK+DP W+ + RR+EVEII WI QY +D+AQ
Sbjct: 490 SETKDDPDWVVEQRRQEVEIIHGWIDQYNKDIAQ 523
>gi|326491047|dbj|BAK05623.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 511
Score = 560 bits (1443), Expect = e-157, Method: Compositional matrix adjust.
Identities = 259/453 (57%), Positives = 338/453 (74%), Gaps = 4/453 (0%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T+Y+TQ LDHFN P SY TF QRYL+N T+WGG AP+FVY GNEG IE F NT
Sbjct: 59 YETRYYTQRLDHFNAAPVSYSTFPQRYLVNGTYWGG--KTAPVFVYAGNEGSIELFTNNT 116
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+++AP+F+A+LVFIEHRYYG+S+P+G ++E A+KNAST GYLS+TQA+AD+A+L+
Sbjct: 117 GFMWELAPRFRAMLVFIEHRYYGRSVPFG-SEEAAFKNASTMGYLSTTQAVADFATLVQS 175
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NL+A +PVVVFGGSYGGMLAAW R+KYPHV IGA+ASSAPIL F + PY+F +I
Sbjct: 176 LKANLSAPAAPVVVFGGSYGGMLAAWMRMKYPHVVIGAVASSAPILGFYGMADPYAFYDI 235
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
I+ DF+S S+NC+ V+ SWK+++ G +L + F++C+ AI L TA V
Sbjct: 236 ISNDFKSESKNCHDVLMNSWKELDNALSNDAGRAQLNRTFKMCRGSTVEAIPDMLDTAIV 295
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
Y+AMTDYPT S FL LPA+PVKE+C+AID PK+G D F+++ A +VYYNY+G A CF
Sbjct: 296 YSAMTDYPTESGFLTHLPAYPVKEICRAIDHPKSGKDTFSRIKDALTVYYNYTGNAHCFG 355
Query: 353 LNGDSDPHGL-SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
+ DP+G+ + W WQACTEMI+++ G S+ ++ C+ + G+ PRP
Sbjct: 356 DASEDDPYGMFNGWDWQACTEMILMSYGVRNRSVLPPEPFNFTKLLDGCRASTGLPPRPY 415
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WI TEFGG I VLKR ASNIIFFNGLRDPWS GGVLK+IS+++LALVE +G+HHVDLR
Sbjct: 416 WIPTEFGGFDIANVLKRSASNIIFFNGLRDPWSSGGVLKDISRSILALVEPKGSHHVDLR 475
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLA 504
FS+K+DP WLK VR +E II W++QY++D A
Sbjct: 476 FSSKDDPHWLKQVREKETRIIAHWLNQYYKDEA 508
>gi|148909163|gb|ABR17682.1| unknown [Picea sitchensis]
Length = 501
Score = 559 bits (1440), Expect = e-156, Method: Compositional matrix adjust.
Identities = 274/504 (54%), Positives = 355/504 (70%), Gaps = 29/504 (5%)
Query: 10 FCLLFSSTLTISNAKIFPTFPS-SRITPEK-----LSSLISSSKDSQGLYKTKYHTQILD 63
C++ ++I ++ P S R +P + +++++S+ S Y +Y+TQILD
Sbjct: 11 LCVVILILVSIGTSESVPLKSSLIRFSPTRKWNRGAATVLASASVSAAKYDVRYYTQILD 70
Query: 64 HFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFK 123
HF++ P+SYQTFQQ+YLIN HWGG+ +PIFVYTGNEG IEWF +NTGFM+D+AP+FK
Sbjct: 71 HFSFVPESYQTFQQKYLINSDHWGGASAKSPIFVYTGNEGFIEWFTENTGFMFDIAPQFK 130
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP 183
A+LVFIEHR+YG S+P+G K AY N+ST G+LSS QALAD+A+LI DLKKNL+A DSP
Sbjct: 131 AMLVFIEHRFYGHSMPFGSQKA-AYSNSSTLGFLSSAQALADFATLITDLKKNLSAEDSP 189
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
VVVF + G ++SSAPIL FDNI SF + +++DFRS SEN
Sbjct: 190 VVVF-----------------GGSYGGISSSAPILYFDNITPIGSFDDTVSEDFRSESEN 232
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES----WLSTAFVYTAMTDY 299
C+KVIKGSW I+E P GL+ L+KA RICKS + I WL A+ AMTDY
Sbjct: 233 CFKVIKGSWNVIDEMTSTPEGLKSLRKALRICKSNSDNYIAGNLAGWLYDAYYTAAMTDY 292
Query: 300 PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP 359
P +NF+ PLPA+PVK+MCKAID+P D+ ++LYG A+VYYNY+G + CFD+ SDP
Sbjct: 293 PVAANFVQPLPAYPVKQMCKAIDNPSGSTDLLSQLYGVANVYYNYTGRSSCFDIR-PSDP 351
Query: 360 HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGG 419
HG W +QACTEM+M D K S+F S DY R C+ AYGV PR +WITT++GG
Sbjct: 352 HGEDGWQFQACTEMVMPMADDPKKSMFPNSTFDYQERVDSCESAYGVQPRRHWITTQYGG 411
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQ 479
H I VLK FASNIIFFNGLRDPWSGGGVL++I+++V+A+V KEGAHHVD RF+TK+DPQ
Sbjct: 412 HHIKRVLKNFASNIIFFNGLRDPWSGGGVLEDINESVVAIVAKEGAHHVDFRFATKDDPQ 471
Query: 480 WLKDVRRREVEIIGKWISQYFQDL 503
WLKD R +E+ II W+ QY+ +L
Sbjct: 472 WLKDARTKEISIIKSWLQQYYMEL 495
>gi|255565027|ref|XP_002523506.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537213|gb|EEF38845.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 501
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 264/452 (58%), Positives = 338/452 (74%), Gaps = 8/452 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT+Y Q+LDHF + P Y+ F Q+YLI+ +W APIFVYTGNEGDIEWFA NT
Sbjct: 50 YKTRYFPQLLDHFTFQPNGYKIFYQKYLISSQYW---HKEAPIFVYTGNEGDIEWFAANT 106
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D+APKF+ALLVFIEHR+YG+S+P+G + +YK+A T GYL+S QALAD+A LI
Sbjct: 107 GFLLDIAPKFRALLVFIEHRFYGESMPFGKD---SYKSAETLGYLNSQQALADFAVLIRS 163
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK+NL++ SPVVVFGGSYGGMLA WFRLKYPH+AIGALASSAPIL FD+I SF +
Sbjct: 164 LKQNLSSEASPVVVFGGSYGGMLATWFRLKYPHIAIGALASSAPILQFDDITPWSSFYDA 223
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
++QDFR S NCY+VIKGSW ++E + + GL +L + FR CK +L ++ WL +A+
Sbjct: 224 VSQDFREASLNCYEVIKGSWAELETLSSQKEGLIELSRTFRTCKDLHSLDSVWDWLWSAY 283
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
VY+AM +YPT +NF+ PLPA+PVKEMCK ID G ++++ AAS+YYNYS KCF
Sbjct: 284 VYSAMVNYPTEANFMKPLPAYPVKEMCKIIDGFPAGASKVSRVFAAASLYYNYSRGEKCF 343
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
L D HGL W WQACTEM+M +K+S+F S DY A CK+ +GV PR +
Sbjct: 344 QLENVPDAHGLHGWNWQACTEMVMPMTC-SKESMFPPSGYDYKEFAEECKKKFGVMPRQH 402
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WITTEFGG +I VLKRF SNIIF NG+ DPWS GGVLKNIS +++A+V K+GAHHVD R
Sbjct: 403 WITTEFGGKRIDKVLKRFGSNIIFSNGMEDPWSRGGVLKNISSSIIAIVTKKGAHHVDFR 462
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+TK+DP WLK++R++E++IIG+WI +Y+ DL
Sbjct: 463 SATKDDPNWLKEMRKQEIQIIGRWIDEYYADL 494
>gi|312282209|dbj|BAJ33970.1| unnamed protein product [Thellungiella halophila]
Length = 494
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 277/501 (55%), Positives = 352/501 (70%), Gaps = 18/501 (3%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGL----YKTKYHTQILDHF 65
CL+F ++ + A T+P SSL + K S+ ++T Y Q LDHF
Sbjct: 5 LCLVFLFFISFAEA----TYPPGGF--HHFSSLRQNKKASKSKSELPFETLYFPQNLDHF 58
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
++ P+SY+ F Q+YLIN W + PIFVYTGNEGDI+WFA NTGFM D+APKF AL
Sbjct: 59 SFRPESYKVFHQKYLINSRFW---RKGGPIFVYTGNEGDIDWFASNTGFMSDIAPKFGAL 115
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVV 185
LVFIEHR+YG+S P+G ++K+A T GYL+S QALADYA LI LK+NL++ SPVV
Sbjct: 116 LVFIEHRFYGESTPFGKK---SHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVV 172
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
VFGGSYGGMLAAWFRLKYPH+AIGALASSAPIL+FDNIV SF + I+QDF+ S NC+
Sbjct: 173 VFGGSYGGMLAAWFRLKYPHIAIGALASSAPILHFDNIVPLTSFYDAISQDFKDASVNCF 232
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSN 304
+VIK SW+++E + GL +L K FR CK A WL +AF+YTAM +Y T +N
Sbjct: 233 EVIKRSWQELEAVSNMKHGLPELSKKFRTCKGLHSQYAARDWLMSAFIYTAMVNYATAAN 292
Query: 305 FLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE 364
F+ PLP +PV++MCK ID G+ + + AAS+YYNYSG+ KCF+L +D HGL
Sbjct: 293 FMAPLPGYPVEQMCKIIDGFPRGSSNLDRAFAAASLYYNYSGSEKCFELEQPTDDHGLDG 352
Query: 365 WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGL 424
WGWQACTEM+M N+ S+F + DY+A C YGV PRP+WITTEFGG +I
Sbjct: 353 WGWQACTEMVMPMSCSNQ-SMFPPYDNDYEAFKEQCMSRYGVKPRPHWITTEFGGKRIET 411
Query: 425 VLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDV 484
VLKRF SNIIF NG++DPWS GGVLKNIS +++ALV K+GAHH DLR ++K DP+WLK+
Sbjct: 412 VLKRFGSNIIFSNGMQDPWSRGGVLKNISSSIIALVTKKGAHHADLRAASKGDPEWLKEQ 471
Query: 485 RRREVEIIGKWISQYFQDLAQ 505
RR+EV II KWIS+Y + L +
Sbjct: 472 RRQEVAIIEKWISEYHRALRE 492
>gi|297802936|ref|XP_002869352.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297315188|gb|EFH45611.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 497
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 275/498 (55%), Positives = 349/498 (70%), Gaps = 15/498 (3%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGL-YKTKYHTQILDHFNYN 68
CL+F ++ AK +P S + + +S S SK ++T+Y Q LDHF++
Sbjct: 8 LCLVFLFLTVVAEAK-YPPGGSYHLFSLRQNSKTSKSKAELPFHFQTRYFPQNLDHFSFQ 66
Query: 69 PQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVF 128
P+SY+ F Q+YLI+ W + PIFVYTGNEGDIEWFA NTGFM D+APKF+ALLVF
Sbjct: 67 PESYRIFHQKYLISSHFW---RKGGPIFVYTGNEGDIEWFASNTGFMLDIAPKFQALLVF 123
Query: 129 IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFG 188
IEHR+YG+S P+ A T GYL+S QALADYA LI LK+NL++ SPVVVFG
Sbjct: 124 IEHRFYGESKPHNL--------AKTLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFG 175
Query: 189 GSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVI 248
GSYGGMLAAWFRLKYPH+ IGALASSAPIL FD IV SF N+++QDF+ S NC++VI
Sbjct: 176 GSYGGMLAAWFRLKYPHITIGALASSAPILQFDKIVPSSSFYNVVSQDFKDASLNCFEVI 235
Query: 249 KGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES-WLSTAFVYTAMTDYPTPSNFLN 307
K SW+++E + GL++L K FR CK + + S WL TAF TAM +YPTP+NF+
Sbjct: 236 KKSWRELEVFSTMKDGLQELSKKFRTCKDLHAVYLASRWLETAFTDTAMVNYPTPANFMA 295
Query: 308 PLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGW 367
PLPA+PV+EMCK ID + + AAS+YYNYSG+ CFD+ +DPHGL+ W W
Sbjct: 296 PLPAYPVEEMCKIIDWFPLEASNLDRAFAAASLYYNYSGSENCFDIENQTDPHGLNGWYW 355
Query: 368 QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLK 427
QACTEM+M N+ S+F+ E D C + YGV PRP+WITTEFGGH+I +VLK
Sbjct: 356 QACTEMVMPISCSNQ-SMFQPFEYDEKVDQEDCLKEYGVKPRPHWITTEFGGHRIEMVLK 414
Query: 428 RFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRR 487
RF SNIIF NG++DPWS GVLKNIS +++A V K+GAHH DLR +TK+DP+WLK+ RR+
Sbjct: 415 RFGSNIIFSNGMQDPWSREGVLKNISSSIIAFVTKKGAHHTDLRAATKDDPEWLKEQRRQ 474
Query: 488 EVEIIGKWISQYFQDLAQ 505
EV I KWIS+Y+ DL Q
Sbjct: 475 EVAEIEKWISEYYSDLRQ 492
>gi|359484787|ref|XP_003633162.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
gi|297735899|emb|CBI18675.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 269/452 (59%), Positives = 334/452 (73%), Gaps = 8/452 (1%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
T+Y Q+LDHF + P+S F Q+YLIN +W + APIFVYTGNEGDI+WFA NTGF
Sbjct: 58 TQYFPQLLDHFTFTPKSSTIFYQKYLINTQYW---THGAPIFVYTGNEGDIDWFASNTGF 114
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ D+AP F+A+LVFIEHR+YG+S+P+G KE +YK+ T GYL+S QALAD+A LI LK
Sbjct: 115 LLDIAPSFRAMLVFIEHRFYGESMPFG--KE-SYKSPETLGYLNSQQALADFAVLIRSLK 171
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+NL++ SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL FD+I SF + ++
Sbjct: 172 QNLSSEASPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILQFDDITPWSSFYDAVS 231
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVY 293
QDF+ S NCY+VIKGSW +++ + K GGL ++ + FR CK ++ + WL +AFVY
Sbjct: 232 QDFKEASLNCYEVIKGSWAELDAMSAKEGGLAEVSRTFRTCKDINSVYSARDWLWSAFVY 291
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
TAM +YPT +NF+ PLPA+PV+EMCK ID G ++ + AAS+YYNYSGT KCFDL
Sbjct: 292 TAMVNYPTKANFMMPLPAYPVEEMCKIIDRFPHGATNVSRAFAAASLYYNYSGTEKCFDL 351
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
D HGL W WQACTEM+M N +S+F S +Y A C YGV PRP+WI
Sbjct: 352 ENGKDAHGLHGWDWQACTEMVMPLTCSN-ESMFPPSSFEYKEFADECTRKYGVMPRPHWI 410
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
TTEFGG +I VLKR ASNIIF NG++DPWS G VLKNIS +++ALV K+GAHHVD RF+
Sbjct: 411 TTEFGGSRIEQVLKRSASNIIFSNGMQDPWSRGSVLKNISASIVALVTKKGAHHVDFRFA 470
Query: 474 TKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
TKEDP WL + RR+EVEI+ KWI Y DL Q
Sbjct: 471 TKEDPDWLIEQRRQEVEILQKWIHDYNADLKQ 502
>gi|242096520|ref|XP_002438750.1| hypothetical protein SORBIDRAFT_10g025520 [Sorghum bicolor]
gi|241916973|gb|EER90117.1| hypothetical protein SORBIDRAFT_10g025520 [Sorghum bicolor]
Length = 558
Score = 555 bits (1429), Expect = e-155, Method: Compositional matrix adjust.
Identities = 271/459 (59%), Positives = 332/459 (72%), Gaps = 10/459 (2%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW-----GGSKNNAPIFVYTGNEGDIEW 107
+ Y Q LDHF + P + F+Q+YL+NDT W G P+FVYTGNEGDIEW
Sbjct: 101 FTVHYFAQELDHFTFTPNASMVFRQKYLLNDTFWRRPSAGDGDGAGPLFVYTGNEGDIEW 160
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
FA NTGFM+D+APKF ALLVFIEHR+YG+S P+G + +YK+A T GYL+STQALAD+A
Sbjct: 161 FATNTGFMFDIAPKFGALLVFIEHRFYGESKPFGND---SYKSAETLGYLTSTQALADFA 217
Query: 168 SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
LI LKKNL+A +PVVVFGGSYGGMLA+WFRLKYPHV IGA+ASSAPIL FD I
Sbjct: 218 ILIRSLKKNLSAEAAPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWS 277
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-NLAIESW 286
SF + ++QDF+S S NC+ VIKG+W ++E GL L K FR CK+ K +I +W
Sbjct: 278 SFYDGVSQDFKSESLNCFSVIKGTWDVLDERGSTDKGLLDLSKLFRACKTVKYAYSIRNW 337
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
L TAF YTAM DYPTP+NFL LPA+PVKEMCK ID TG D+ K + AAS+YYNY+G
Sbjct: 338 LWTAFSYTAMVDYPTPANFLENLPAYPVKEMCKIIDGFPTGADILEKAFAAASLYYNYTG 397
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
C + DPHGL W WQACTEMIM N +S+F S YD R+ C +++GV
Sbjct: 398 DQTCNKIEDGDDPHGLDGWQWQACTEMIMPMTVSN-ESMFPPSSFSYDERSDECFQSWGV 456
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+WITTE+GG+KI VLKRF SNIIF NG+RDPWS GGVLKNIS +++ALV ++GAH
Sbjct: 457 RPRPHWITTEYGGYKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTEKGAH 516
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
H+D R STK DP W+ + RR+EV+II WI QY QD+A+
Sbjct: 517 HLDFRSSTKGDPDWVIEQRRQEVDIIQGWIDQYHQDMAE 555
>gi|326499536|dbj|BAJ86079.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 526
Score = 554 bits (1428), Expect = e-155, Method: Compositional matrix adjust.
Identities = 267/454 (58%), Positives = 329/454 (72%), Gaps = 5/454 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ Y Q LDHF + P + F Q+YL+NDT W P+FVYTGNEGDIEWFA NT
Sbjct: 76 FTAHYFQQELDHFTFTPNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNT 135
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+AP F ALLVFIEHR+YG+S P+G + +YK+A T GYL+STQALAD+A LI
Sbjct: 136 GFMFDIAPDFGALLVFIEHRFYGESKPFGND---SYKSADTLGYLTSTQALADFAVLITS 192
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK+NL+A D+PVVVFGGSYGGMLA+WFRLKYPHVAIGALASSAPIL F++I SF
Sbjct: 193 LKQNLSAVDAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFEDITPWSSFYEA 252
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE-KNLAIESWLSTAF 291
+++DF+S S NC+ VIK W + GL +L K FR CK+ ++ WLSTAF
Sbjct: 253 VSEDFKSESLNCFSVIKAVWDVLTVRGSNDTGLLELSKTFRACKTVLLPNSLLDWLSTAF 312
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
YTAM DYPTP+NF+ LPA+PVKEMCK ID G DV K + AAS+YYNY+G KCF
Sbjct: 313 TYTAMVDYPTPANFMQNLPAYPVKEMCKIIDSFPAGADVVEKAFAAASLYYNYTGDQKCF 372
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
++ G DPHGLS WGWQACTEM+M N +S+F S Y+ ++ C AY V PR +
Sbjct: 373 EVEGGDDPHGLSGWGWQACTEMVMPMTVSN-ESMFPPSGFSYEEKSEGCIAAYDVRPRMH 431
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WITTE+GGHKI VLKRF SNIIF N +RDPWS GGVLKNIS +++ALV ++GAHH+D R
Sbjct: 432 WITTEYGGHKIDKVLKRFGSNIIFSNEMRDPWSRGGVLKNISSSIIALVTEKGAHHLDFR 491
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+TK+DP W+ + RR+EVEII WI QY +D+AQ
Sbjct: 492 SATKDDPDWVVEQRRQEVEIIHGWIDQYNKDIAQ 525
>gi|118487801|gb|ABK95724.1| unknown [Populus trichocarpa]
Length = 500
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 271/497 (54%), Positives = 349/497 (70%), Gaps = 10/497 (2%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGL--YKTKYHTQILDHFNYNP 69
LL ++ + + P+ P SL K S+ YKT Y Q+LDHF + P
Sbjct: 7 LLLPVFAILATLPVIQSVPTFFPRPSYDQSLAKQPKASKPKIPYKTHYFPQVLDHFTFQP 66
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
+S + F Q+YL+N +W APIFVYTGNEGDIEWFA NTGF+ D+APKF+ALLVFI
Sbjct: 67 KSSKIFYQKYLVNSHYW---HRGAPIFVYTGNEGDIEWFAANTGFLLDIAPKFRALLVFI 123
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGG 189
EHR+YG+S+P+G +YK+A T GYL+S QALAD+A LI LK NL++ SPVVVFGG
Sbjct: 124 EHRFYGESMPFGNK---SYKSAETLGYLNSQQALADFALLIRSLKHNLSSEASPVVVFGG 180
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGGMLAAWFRLKYPH+AIGALASSAPIL FD+I SF + ++QDF+ S NCY+VIK
Sbjct: 181 SYGGMLAAWFRLKYPHIAIGALASSAPILQFDDITPWSSFYDAVSQDFKEASLNCYEVIK 240
Query: 250 GSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNP 308
GSW ++E + + GL +L + FR C+ +L ++ WL +AFVYTAM +YPT +NF+ P
Sbjct: 241 GSWAELEALSAQNEGLAELSRTFRACQDLHSLDSVWEWLWSAFVYTAMVNYPTEANFMMP 300
Query: 309 LPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQ 368
LPA+PV+ MCK ID +G +++ AAS+YYNYS KCF L D HGL W WQ
Sbjct: 301 LPAYPVQAMCKIIDGFPSGASKITRVFAAASLYYNYSRAEKCFKLEHGPDAHGLHGWNWQ 360
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKR 428
ACTEM+M +++S+F S Y A C + +GV PRP+WITTEFGG +I LVLKR
Sbjct: 361 ACTEMVMPMTC-SEESMFPTSSFSYKEFAEDCMKTFGVKPRPHWITTEFGGKRIDLVLKR 419
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRRE 488
NIIF NG++DPWS GGVLKNIS +++ALV ++GAHHVD R +TK+DP+WLK++RR+E
Sbjct: 420 SGGNIIFSNGMQDPWSRGGVLKNISSSIIALVTEKGAHHVDFRSATKDDPEWLKELRRQE 479
Query: 489 VEIIGKWISQYFQDLAQ 505
VEII WI QY+ DL +
Sbjct: 480 VEIIQGWIDQYYPDLKR 496
>gi|224142419|ref|XP_002324555.1| predicted protein [Populus trichocarpa]
gi|222865989|gb|EEF03120.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 263/452 (58%), Positives = 334/452 (73%), Gaps = 8/452 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT Y Q+LDHF + P+S + F Q+YL+N +W APIFVYTGNEGDIEWFA NT
Sbjct: 14 YKTHYFPQVLDHFTFQPKSSKIFYQKYLVNSHYW---HRGAPIFVYTGNEGDIEWFAANT 70
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D+APKF+ALLVFIEHR+YG+S+P+G +YK+A T GYL+S QALAD+A LI
Sbjct: 71 GFLLDIAPKFRALLVFIEHRFYGESMPFGNK---SYKSAETLGYLNSQQALADFALLIRS 127
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NL++ SPVVVFGGSYGGMLAAWFRLKYPH+AIGALASSAPIL FD+I SF +
Sbjct: 128 LKHNLSSEASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFDDITPWSSFYDA 187
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
++QDF+ S NCY+VIKGSW ++E + + GL +L + FR C+ +L ++ WL +AF
Sbjct: 188 VSQDFKEASLNCYEVIKGSWAELEALSAQNEGLAELSRTFRACQDLHSLDSVWEWLWSAF 247
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
VYTAM +YPT +NF+ PLPA+PV+ MCK ID +G +++ AAS+YYNYS KCF
Sbjct: 248 VYTAMVNYPTEANFMMPLPAYPVQAMCKIIDGFPSGASKITRVFAAASLYYNYSRAEKCF 307
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
L D HGL W WQACTEM+M +++S+F S Y A C + +GV PRP+
Sbjct: 308 KLEHGPDAHGLHGWNWQACTEMVMPMTC-SEESMFPTSSFSYKEFAEDCMKTFGVKPRPH 366
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WITTEFGG +I LVLKR NIIF NG++DPWS GGVLKNIS +++ALV ++GAHHVD R
Sbjct: 367 WITTEFGGKRIDLVLKRSGGNIIFSNGMQDPWSRGGVLKNISSSIIALVTEKGAHHVDFR 426
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+TK+DP+WLK++RR+EVEII WI QY+ DL
Sbjct: 427 SATKDDPEWLKELRRQEVEIIQGWIDQYYPDL 458
>gi|449443023|ref|XP_004139280.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
gi|449493651|ref|XP_004159394.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 499
Score = 551 bits (1419), Expect = e-154, Method: Compositional matrix adjust.
Identities = 261/454 (57%), Positives = 345/454 (75%), Gaps = 9/454 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T+++ Q+LDHF + P+S + F Q+YLIN+ +W +N APIFVYTGNEGDIEWFA NT
Sbjct: 49 FETRFYPQLLDHFTFTPKSSKIFYQKYLINEEYW---RNGAPIFVYTGNEGDIEWFAANT 105
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D+AP+F ALLVFIEHR+YG+S P+G + +Y +A T GYL+S QALADYA LI
Sbjct: 106 GFLPDIAPEFHALLVFIEHRFYGESTPFGND---SYNSAETLGYLTSQQALADYAVLIRS 162
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK+NL++ SPVVVFGGSYGGMLAAWFRLKYPH+ IGALASSAPIL+FDNIV SF +
Sbjct: 163 LKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPWSSFYDA 222
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAF 291
++QDF+ S NC++VIKGSW ++++ + G L +L K FR CK+ +++ ++ WL +AF
Sbjct: 223 VSQDFKDASLNCFEVIKGSWTELQQEFSEEG-LAELSKTFRTCKNLHSVSSVQDWLWSAF 281
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
VYT+M +YPT +NF+ PLPA+PV+EMCK ID K + AAS+YYNYS KCF
Sbjct: 282 VYTSMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLNKAFAAASLYYNYSHGEKCF 341
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
++ D HGLS W WQACTEM+M N+ S+F S+ DY+ A CK+ YGV PRP+
Sbjct: 342 NVENGPDLHGLSGWNWQACTEMVMPMTCSNQ-SMFPPSKFDYEEFATDCKKKYGVSPRPH 400
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WITTE+GG +I VLKRF SNIIF NG++DPWS GGVL+NIS +++ALV ++GAHHVD R
Sbjct: 401 WITTEYGGERIEEVLKRFGSNIIFSNGMQDPWSRGGVLRNISTSIVALVTEKGAHHVDFR 460
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+TK+DP WL + RR+EVEII +WI++++ D+ Q
Sbjct: 461 SATKDDPDWLVEQRRQEVEIIHQWINEHYADMKQ 494
>gi|148909204|gb|ABR17702.1| unknown [Picea sitchensis]
Length = 509
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 256/457 (56%), Positives = 338/457 (73%), Gaps = 4/457 (0%)
Query: 47 KDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE 106
+ SQ Y+TKY TQ LDHF++ TFQQRYLIND +W G++ PIF Y GNEG I+
Sbjct: 48 QSSQYEYETKYFTQRLDHFSFKNHKNSTFQQRYLINDKYWLGAERMGPIFYYCGNEGYID 107
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
WFA NTGFM+D+AP+F ALLVF EHRYYG+S+PYG ++ +AYK+ + YL++ QALAD+
Sbjct: 108 WFAVNTGFMWDIAPQFGALLVFPEHRYYGESMPYG-SQSMAYKDGDSLSYLTAEQALADF 166
Query: 167 ASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSP 226
A+LI+DLKKNL+A PVV+FGGSYGGMLAAW RLKYPH+AIGALASSAPIL F++IV
Sbjct: 167 ATLIVDLKKNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPS 226
Query: 227 YSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIES 285
+F N+++ DF+ SENC+KVI+ SWK +E ++ GL+ L K FR+C+ + IE
Sbjct: 227 DTFYNLVSNDFKRESENCFKVIQQSWKALETYGERDEGLQNLSKKFRMCRDLNSTDEIED 286
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS 345
WL++A+ AM DYP P++FL PLPA+P+KE+CK ID G DV +++ SVYYNY+
Sbjct: 287 WLNSAYSNLAMVDYPYPASFLMPLPAYPIKEVCKVIDSFSNGTDVLDRIFAGVSVYYNYT 346
Query: 346 GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
G KCFD+N DPHG + W WQACTEM+M + + SIF + D ++ + C +G
Sbjct: 347 GEEKCFDVN--DDPHGENGWNWQACTEMVMPMSSNPESSIFPQFTFDIESYTKNCLNMFG 404
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V+PRP+WITTEFGG I VLK + SNIIF NGL DPWSGGGVL+NIS +++ALV + GA
Sbjct: 405 VEPRPHWITTEFGGQDIKRVLKNYGSNIIFSNGLLDPWSGGGVLQNISNSIVALVTELGA 464
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
HH+DLR +T+ DP WL + R E++II KW+++Y+QD
Sbjct: 465 HHLDLRAATENDPLWLVEQRNAEMKIINKWMNEYYQD 501
>gi|226503900|ref|NP_001142279.1| uncharacterized protein LOC100274448 precursor [Zea mays]
gi|194689380|gb|ACF78774.1| unknown [Zea mays]
gi|194707984|gb|ACF88076.1| unknown [Zea mays]
gi|413943525|gb|AFW76174.1| putative serine peptidase S28 family protein [Zea mays]
Length = 542
Score = 546 bits (1406), Expect = e-152, Method: Compositional matrix adjust.
Identities = 277/500 (55%), Positives = 344/500 (68%), Gaps = 19/500 (3%)
Query: 21 SNAKIFPTFPSSRITPEKLSSLISSSKD--------SQGLYKTKYHTQILDHFNYNPQSY 72
SN+ P F + +L SSS + S + Y Q+LDHF + P +
Sbjct: 44 SNSNPRPAFTPPLLKRHQLRRPSSSSAELVAGPANASTKPFTAHYFPQLLDHFAFTPNAS 103
Query: 73 QTFQQRYLINDTHW------GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
F+ +YL+NDT W G P+FVYTGNEGDIEWFA NTGFM+D+AP F ALL
Sbjct: 104 TVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNEGDIEWFATNTGFMFDIAPTFGALL 163
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVV 186
VFIEHR+YG+S P+G + +Y++A T GYL+STQALAD+A +I LK++L A +PVVV
Sbjct: 164 VFIEHRFYGESKPFGND---SYRSAETLGYLTSTQALADFAVVIRGLKRDLGAEAAPVVV 220
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYK 246
FGGSYGGMLA+WFRLKYPHVAIGALASSAPIL FD+I SF + ++QDF+S S NC+
Sbjct: 221 FGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHITPWSSFYDAVSQDFKSESSNCFG 280
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-NLAIESWLSTAFVYTAMTDYPTPSNF 305
VI+ +W ++E GL L K FR CK+ K +I +WL TAF YTAM DYPTP+NF
Sbjct: 281 VIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYSIRNWLWTAFTYTAMVDYPTPANF 340
Query: 306 LNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEW 365
L LPA+PVKEMCK ID G DV K + AAS+YYNY+G C + DPHGL W
Sbjct: 341 LENLPAYPVKEMCKTIDAFPAGADVLEKAFAAASLYYNYTGDQACNKIEDGDDPHGLDGW 400
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
WQACTEMIM N +S+F S YD R+ C +++GV PRP+WITTE+GG+KI V
Sbjct: 401 QWQACTEMIMPMTISN-ESMFPPSAFSYDDRSDECFQSWGVRPRPHWITTEYGGYKIDKV 459
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
LKRF SNIIF NG+RDPWS GGVLKNIS +++ALV ++GAHH+DLR STK DP WL + R
Sbjct: 460 LKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTEKGAHHLDLRSSTKGDPDWLIEQR 519
Query: 486 RREVEIIGKWISQYFQDLAQ 505
R+EVEII WI QY QD+A+
Sbjct: 520 RQEVEIIQGWIDQYHQDMAE 539
>gi|357152979|ref|XP_003576298.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 517
Score = 545 bits (1405), Expect = e-152, Method: Compositional matrix adjust.
Identities = 257/497 (51%), Positives = 351/497 (70%), Gaps = 13/497 (2%)
Query: 16 STLTISNAKIFPTFPSSRITP-----EKLSSLISSSKD----SQGLYKTKYHTQILDHFN 66
S T+S A PT+ R T + +++++ +Q Y+TKY+ Q LDHF+
Sbjct: 18 SFFTVSGATAPPTYRRRRHTSPLTHIHRRRPIVAAAAGGGVPTQVRYETKYYEQRLDHFD 77
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
P SY+TF QRYL+N T+WGG +P+F+Y GNEG++E F NTGFM+++AP+F+ALL
Sbjct: 78 ALPASYRTFPQRYLVNGTYWGG--KTSPVFLYAGNEGNVELFTNNTGFMWELAPRFRALL 135
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVV 186
+F+EHRYYGKS P+G ++E A++N ST GYL++TQA+AD A+L+ LK NL+A +PV+V
Sbjct: 136 LFVEHRYYGKSFPFG-SEEAAFRNTSTVGYLTTTQAVADLATLVQSLKSNLSAHAAPVIV 194
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYK 246
FGGSYGGMLAAW R+KYPHV +GA+ASSAPIL F + PY+F ++I+ DF+S S+NC+
Sbjct: 195 FGGSYGGMLAAWVRMKYPHVVMGAVASSAPILGFYGLADPYAFYDVISNDFKSESKNCHD 254
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
V+ SW ++++ G L F++C++ AI L TA Y+AMTDYPT S FL
Sbjct: 255 VLMKSWGELDKALSNDAGRADLNSTFKMCRASTVDAIPDLLDTALTYSAMTDYPTSSGFL 314
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-W 365
PLP +PVKEMC+AID PK+GND FA++ GA VYYN++G C +SDP+G+ + W
Sbjct: 315 TPLPPYPVKEMCRAIDHPKSGNDTFARIKGALDVYYNHTGAEPCLGDATESDPYGMFDGW 374
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
WQACTEMI++T G ++F ++ C+ + G+ PRP WI TEFGG I V
Sbjct: 375 DWQACTEMILMTYGVRNGTVFPPEPFNFTDLLAGCRASTGLPPRPYWIPTEFGGFDIKHV 434
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
L+R ASNIIFFNGLRDPWS GGVLK+IS +++ALVE +G+HHVDLRFSTKEDP+WLK VR
Sbjct: 435 LRRSASNIIFFNGLRDPWSSGGVLKSISNSIIALVEPKGSHHVDLRFSTKEDPEWLKQVR 494
Query: 486 RREVEIIGKWISQYFQD 502
+E II W+ QY+++
Sbjct: 495 IKETRIIAHWLRQYYKE 511
>gi|218198653|gb|EEC81080.1| hypothetical protein OsI_23902 [Oryza sativa Indica Group]
Length = 539
Score = 545 bits (1405), Expect = e-152, Method: Compositional matrix adjust.
Identities = 264/459 (57%), Positives = 327/459 (71%), Gaps = 10/459 (2%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW-----GGSKNNAPIFVYTGNEGDIEW 107
+ Y Q LDHF + P + F Q+YL+NDT W G PIFVYTGNEGDIEW
Sbjct: 82 FTAHYFPQELDHFTFTPNASALFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
FA NTGFM+ +AP F ALLVFIEHR+YG+S P+G + GYL+STQALAD+A
Sbjct: 142 FATNTGFMFHIAPSFGALLVFIEHRFYGESKPFGNESN---SSPEKLGYLTSTQALADFA 198
Query: 168 SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
LI LK NL+A SPVVVFGGSYGGMLA+WFRLKYPHV IGA+ASSAPIL FD I
Sbjct: 199 VLITSLKHNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWS 258
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESW 286
SF + ++QD++S S NC+ VIK +W I+E GL +L K FR CK+ K++ + +W
Sbjct: 259 SFYDAVSQDYKSESFNCFSVIKAAWDLIDEKGSTDAGLLQLSKTFRACKTVKSVYSFRNW 318
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
L TAFVYTAM DYPTP+NFL LPA+P+KEMCK ID G D+ K + AAS+YYNY+G
Sbjct: 319 LWTAFVYTAMVDYPTPANFLMNLPAYPIKEMCKIIDGFPAGADIVDKAFAAASLYYNYTG 378
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
CF L DPHGLS WGWQACTEM+M N +S+F Y+ ++ C ++YGV
Sbjct: 379 DQTCFQLEDGEDPHGLSGWGWQACTEMVMPMTISN-ESMFPPFTFTYEGKSDDCFQSYGV 437
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+WITTE+GG++I LVLKRF SNIIF NG+RDPWS GGVLKNIS +++ALV ++GAH
Sbjct: 438 RPRPHWITTEYGGNRIDLVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTEKGAH 497
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
H+D R +TK+DP W+ + RR+EV+II WI QY +DLAQ
Sbjct: 498 HLDFRSATKDDPDWVVEQRRQEVKIIQGWIDQYNEDLAQ 536
>gi|115469214|ref|NP_001058206.1| Os06g0647400 [Oryza sativa Japonica Group]
gi|51535425|dbj|BAD37324.1| putative prolylcarboxypeptidase isoform 1 [Oryza sativa Japonica
Group]
gi|113596246|dbj|BAF20120.1| Os06g0647400 [Oryza sativa Japonica Group]
gi|215737061|dbj|BAG95990.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 539
Score = 545 bits (1405), Expect = e-152, Method: Compositional matrix adjust.
Identities = 264/459 (57%), Positives = 326/459 (71%), Gaps = 10/459 (2%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW-----GGSKNNAPIFVYTGNEGDIEW 107
+ Y Q LDHF + P + F Q+YL+NDT W G PIFVYTGNEGDIEW
Sbjct: 82 FTAHYFPQELDHFTFTPNASAVFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
FA NTGFM+D+AP F ALLVFIEHR+YG+S P+G + GYL+STQALAD+A
Sbjct: 142 FATNTGFMFDIAPSFGALLVFIEHRFYGESKPFGNESN---SSPEKLGYLTSTQALADFA 198
Query: 168 SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
LI LK NL+A SPVVVFGGSYGGMLA+WFRLKYPHV IGA+ASSAPIL FD I
Sbjct: 199 VLITSLKHNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWS 258
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESW 286
SF ++QD++S S NC+ VIK +W I+E GL +L K FR CK+ K++ + +W
Sbjct: 259 SFYEAVSQDYKSESFNCFSVIKAAWDLIDERGSTDAGLLQLSKTFRACKTVKSVYSFRNW 318
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
L TAFVYTAM DYPTP+NFL LPA+P+KEMCK I G D+ K + AAS+YYNY+G
Sbjct: 319 LWTAFVYTAMVDYPTPANFLMNLPAYPIKEMCKIIHGFPAGADIVDKAFAAASLYYNYTG 378
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
CF L DPHGLS WGWQACTEM+M N +S+F Y+ ++ C ++YGV
Sbjct: 379 DQTCFQLEDGEDPHGLSGWGWQACTEMVMPMTISN-ESMFPPFTFTYEGKSDDCFQSYGV 437
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+WITTE+GG++I LVLKRF SNIIF NG+RDPWS GGVLKNIS +++ALV ++GAH
Sbjct: 438 RPRPHWITTEYGGNRIDLVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTEKGAH 497
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
H+D R +TK+DP W+ + RR+EV+II WI QY +DLAQ
Sbjct: 498 HLDFRSATKDDPDWVVEQRRQEVKIIQGWIDQYNEDLAQ 536
>gi|195643982|gb|ACG41459.1| lysosomal Pro-X carboxypeptidase precursor [Zea mays]
Length = 542
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 276/500 (55%), Positives = 344/500 (68%), Gaps = 19/500 (3%)
Query: 21 SNAKIFPTFPSSRITPEKLSSLISSSKD--------SQGLYKTKYHTQILDHFNYNPQSY 72
SN+ P F + +L SSS + S + Y Q+LDHF + P +
Sbjct: 44 SNSNPRPAFTPPLLKRHQLRRPSSSSAELVAGPANASTKPFTAHYFPQLLDHFAFTPNAS 103
Query: 73 QTFQQRYLINDTHW------GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
F+ +YL+NDT W G P+FVYTGNEGDIEWFA NTGFM+D+AP F ALL
Sbjct: 104 TVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNEGDIEWFATNTGFMFDIAPTFGALL 163
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVV 186
VFIEH++YG+S P+G + +Y++A T GYL+STQALAD+A +I LK++L A +PVVV
Sbjct: 164 VFIEHQFYGESKPFGND---SYRSAETLGYLTSTQALADFAVVIRGLKRDLGAEAAPVVV 220
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYK 246
FGGSYGGMLA+WFRLKYPHVAIGALASSAPIL FD+I SF + ++QDF+S S NC+
Sbjct: 221 FGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHITPWSSFYDAVSQDFKSESSNCFG 280
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-NLAIESWLSTAFVYTAMTDYPTPSNF 305
VI+ +W ++E GL L K FR CK+ K +I +WL TAF YTAM DYPTP+NF
Sbjct: 281 VIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYSIRNWLWTAFTYTAMVDYPTPANF 340
Query: 306 LNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEW 365
L LPA+PVKEMCK ID G DV K + AAS+YYNY+G C + DPHGL W
Sbjct: 341 LENLPAYPVKEMCKTIDAFPAGADVLEKAFAAASLYYNYTGDQACNKIEDGDDPHGLDGW 400
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
WQACTEMIM N +S+F S YD R+ C +++GV PRP+WITTE+GG+KI V
Sbjct: 401 QWQACTEMIMPMTISN-ESMFPPSAFSYDDRSDECFQSWGVRPRPHWITTEYGGYKIDKV 459
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
LKRF SNIIF NG+RDPWS GGVLKNIS +++ALV ++GAHH+DLR STK DP WL + R
Sbjct: 460 LKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTEKGAHHLDLRSSTKGDPDWLIEQR 519
Query: 486 RREVEIIGKWISQYFQDLAQ 505
R+EVEII WI QY QD+A+
Sbjct: 520 RQEVEIIQGWIDQYHQDMAE 539
>gi|413924933|gb|AFW64865.1| putative serine peptidase S28 family protein [Zea mays]
Length = 534
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 250/469 (53%), Positives = 329/469 (70%), Gaps = 21/469 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T+++TQ LDHFN P SY TFQQRYL+NDT WGG APIF+Y GNEGD++ F NT
Sbjct: 63 YETRWYTQRLDHFNSAPASYATFQQRYLVNDTFWGGP--TAPIFLYAGNEGDVDLFTNNT 120
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM++ AP+F+ALLVF+EHRYYG+S+P+GG + A+++A T GYL+ TQALADYAS ++
Sbjct: 121 GFMWESAPRFRALLVFVEHRYYGESMPFGGTRAAAFRDARTKGYLTVTQALADYASFVLS 180
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NL+A +PVVVFGGSYGGMLAAW RLKYPH+ +GA+ASSAPIL+F IV PY+F +
Sbjct: 181 LKANLSAPAAPVVVFGGSYGGMLAAWMRLKYPHIVMGAVASSAPILSFYGIVDPYAFYDR 240
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
IT DF+S S++CY V++ SW +++ G +L++ F +C I S L +A V
Sbjct: 241 ITDDFKSESKHCYDVLRKSWDVLDDALATKEGQAQLRRTFTMCNGSSVQDIPSLLESAVV 300
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK------------------TGNDVFAKL 334
Y AMTDYPTPS FL PLPA+PV+ MC+AID + T +++
Sbjct: 301 YAAMTDYPTPSGFLTPLPAYPVRAMCRAIDASRAESAEASSGAANDGNSNSTAQLTLSQV 360
Query: 335 YGAASVYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDY 393
A VYYN++G A CF D DPHG+ + W WQACTE++++ G + + S ++
Sbjct: 361 RDAMDVYYNHTGAAACFRAEEDDDPHGMYDGWNWQACTEVMVMAYGIRDGGVLQPSPFNF 420
Query: 394 DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
C+ G+ PRP WI TEFGG IG VLK+ ASNI+FFNGLRDPWS GGVLK+IS
Sbjct: 421 TDVVDSCRNYTGLPPRPFWIETEFGGFDIGNVLKKSASNIVFFNGLRDPWSTGGVLKSIS 480
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
+++ALVE +G+HHVDLRFS+KEDP+WLK VR +E II +W+ QY+ D
Sbjct: 481 DSIIALVEPKGSHHVDLRFSSKEDPEWLKQVRVKETRIIARWLKQYYSD 529
>gi|242070053|ref|XP_002450303.1| hypothetical protein SORBIDRAFT_05g003460 [Sorghum bicolor]
gi|241936146|gb|EES09291.1| hypothetical protein SORBIDRAFT_05g003460 [Sorghum bicolor]
Length = 553
Score = 531 bits (1369), Expect = e-148, Method: Compositional matrix adjust.
Identities = 251/475 (52%), Positives = 329/475 (69%), Gaps = 25/475 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T+ +TQ LDHFN P SY TFQQRYLINDT WGG APIF+Y GNEGDI+ F NT
Sbjct: 74 YETRRYTQRLDHFNSLPSSYATFQQRYLINDTFWGGRSRTAPIFLYAGNEGDIDLFTNNT 133
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM++ AP+F+A+LVF+EHRYYG+S+P+GG +E A+++A+T GYL+ TQALADYAS ++
Sbjct: 134 GFMWEAAPRFRAMLVFVEHRYYGESMPFGGTREAAFRDAATKGYLTVTQALADYASFVLS 193
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NL+ +PVVVFGGSYGGMLAAW RLKYPHV +GA+ASSAPIL+F IV PY+F +
Sbjct: 194 LKANLSVPAAPVVVFGGSYGGMLAAWMRLKYPHVVMGAVASSAPILSFYGIVDPYAFYDR 253
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
I D++S S+NCY V++ SW +++ G +L++ F +C I S L A V
Sbjct: 254 INDDYKSESKNCYDVLRKSWDVLDDALATKEGQAQLRRTFNMCNGSSVWDIPSLLENAMV 313
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAID-----------DPKTGND------------ 329
AMTDYPT S FL PLPA+PV+ MC+AID +GND
Sbjct: 314 EAAMTDYPTTSGFLTPLPAYPVRAMCRAIDARHQQAAASTSTSGSGNDDGNNSSSSTALL 373
Query: 330 -VFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFE 387
+ ++ A +VYYN++G A CF D DP+GL + W WQACTE++++ G ++ +
Sbjct: 374 LLSEQVRDAMNVYYNHTGGAACFRAEEDDDPYGLYDGWNWQACTEVMVMAYGVRDGTVLQ 433
Query: 388 ESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
+ ++ C+ G+ PRP WI TEFGG+ I VLK+ ASNIIFFNGLRDPWS GG
Sbjct: 434 PAPFNFTEVVDDCRNVTGLPPRPFWIETEFGGYDIANVLKKSASNIIFFNGLRDPWSTGG 493
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
VLK+IS +++ALVE +GAHHVDLRFS+KEDP+WLK VR +E II +W+ QY+ D
Sbjct: 494 VLKSISDSIIALVEPKGAHHVDLRFSSKEDPEWLKKVRVKETRIIARWLKQYYSD 548
>gi|224095007|ref|XP_002310325.1| predicted protein [Populus trichocarpa]
gi|222853228|gb|EEE90775.1| predicted protein [Populus trichocarpa]
Length = 515
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 256/491 (52%), Positives = 329/491 (67%), Gaps = 10/491 (2%)
Query: 13 LFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSY 72
L S L ++K P F S P K + Q Y++KY Q LDHF++ +
Sbjct: 27 LASQPLNHLSSKRAPRFLSKHSYPIK----TQLQEQQQYRYESKYFYQQLDHFSF--LNL 80
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHR 132
F QRYLIN HW G + PIF+Y GNEGDIEWFA NTGF++++AP F A+++F EHR
Sbjct: 81 PKFPQRYLINTDHWAGPERRGPIFLYCGNEGDIEWFAVNTGFVWEIAPLFGAMVLFPEHR 140
Query: 133 YYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYG 192
YYG+S+PYG N+E AYKNAST YL++ QALAD+A LI DLK+NL+A PVV+FGGSYG
Sbjct: 141 YYGESMPYG-NREEAYKNASTLSYLTAEQALADFAVLITDLKRNLSAQACPVVLFGGSYG 199
Query: 193 GMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSW 252
GMLAAW RLKYPHVAIGALASSAPIL F++IV P +F NI++ DF+ S +C+ IK SW
Sbjct: 200 GMLAAWMRLKYPHVAIGALASSAPILQFEDIVPPETFYNIVSNDFKRESTSCFNTIKESW 259
Query: 253 KQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPA 311
+ K GL +L K F +C+ K+ + +WL +A+ Y AM DYP PS+F+ PLP
Sbjct: 260 DALLSEGLKKNGLVQLTKTFHLCRELKSTEDLANWLDSAYSYLAMVDYPYPSSFMMPLPG 319
Query: 312 FPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACT 371
+P+ E+CK ID G + +++ S+YYNY+G CF+L D DPHGL W WQACT
Sbjct: 320 YPIGEVCKRIDGCPDGTSILERIFEGISIYYNYTGELHCFEL--DDDPHGLDGWNWQACT 377
Query: 372 EMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFAS 431
EM+M + S+F + +Y + C E +GV PRP WITTEFGG I L+ F S
Sbjct: 378 EMVMPMSSSHNASMFPTYDFNYSSYQEGCWEEFGVIPRPRWITTEFGGQDIKTALETFGS 437
Query: 432 NIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEI 491
NIIF NGL DPWSGG VL+NIS+TV+ALV +EGAHH+DLR ST EDP WL + R EV++
Sbjct: 438 NIIFSNGLLDPWSGGSVLQNISETVVALVTEEGAHHIDLRPSTPEDPDWLVEQRETEVKL 497
Query: 492 IGKWISQYFQD 502
I WI Y ++
Sbjct: 498 IKGWIDGYLKE 508
>gi|115484263|ref|NP_001065793.1| Os11g0156200 [Oryza sativa Japonica Group]
gi|62701876|gb|AAX92949.1| At2g24280/F27D4.19 [Oryza sativa Japonica Group]
gi|77548743|gb|ABA91540.1| Serine carboxypeptidase S28 family protein, expressed [Oryza sativa
Japonica Group]
gi|113644497|dbj|BAF27638.1| Os11g0156200 [Oryza sativa Japonica Group]
Length = 511
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 244/456 (53%), Positives = 326/456 (71%), Gaps = 7/456 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T+Y TQ LDHFN P S TF+QRYL+N T WGG+ AP+FVY GNEGD+ FA NT
Sbjct: 53 YETRYFTQRLDHFNELPASNGTFRQRYLVNGTFWGGAA--APVFVYAGNEGDVALFASNT 110
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM++ AP+F+A+LVF+EHRYYG+S+P+GG + A+ +AS GYL++ QALAD+A LI+
Sbjct: 111 GFMWEAAPRFRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTTAQALADFAELILS 170
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NLTA +PVV+FGGSYGGMLAAW R+KYPH+ +GA+ASSAPIL + + PYSF N+
Sbjct: 171 LKSNLTACKAPVVIFGGSYGGMLAAWMRMKYPHIVMGAVASSAPILGLNGLSDPYSFYNV 230
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
++ DF+S S++CY V++ SW ++ + G +L + F +CK + I + A +
Sbjct: 231 VSNDFKSESKHCYDVLRNSWSEMYKALATDAGRARLNQTFNMCKGNVD-DIPGLVEKALI 289
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
Y +M DYPTPSNFL LPA+PV+E+C+AID P +GND +++ A ++YYN +G CF
Sbjct: 290 YGSMMDYPTPSNFLTSLPAYPVREICRAIDKPTSGNDTVSRIKDAMTIYYNSTGGLACFP 349
Query: 353 LNG--DSDPHGL-SEWGWQACTEMIM-LTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
G D DP+G+ W WQACTE+IM ++ G ++F + A C GV P
Sbjct: 350 GAGAEDDDPYGMFPGWTWQACTEVIMTMSYGIGNATVFPPDPFNLTAYLAGCLATTGVPP 409
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV 468
RP+WI + FGG+ I VLKR SNIIFFNGLRDPWS GG+LK+IS +++ALVE +G HHV
Sbjct: 410 RPHWIQSYFGGYDIRNVLKRSGSNIIFFNGLRDPWSAGGILKSISNSIIALVEPKGGHHV 469
Query: 469 DLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLA 504
DLRFSTKEDP+WLK VRR+E+ II W+ QY+ D A
Sbjct: 470 DLRFSTKEDPEWLKKVRRQEMRIIADWLKQYYSDEA 505
>gi|125533461|gb|EAY80009.1| hypothetical protein OsI_35177 [Oryza sativa Indica Group]
Length = 511
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 244/456 (53%), Positives = 325/456 (71%), Gaps = 7/456 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T+Y TQ LDHFN P S TF+QRYL+N T WGG+ AP+FVY GNEGD+ FA NT
Sbjct: 53 YETRYFTQRLDHFNELPASNGTFRQRYLVNGTFWGGAA--APVFVYAGNEGDVALFASNT 110
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM++ AP+F+A+LVF+EHRYYG+S+P+GG + A+ +AS GYL+ QALAD+A LI+
Sbjct: 111 GFMWEAAPRFRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTPAQALADFAELILS 170
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NLTA +PVV+FGGSYGGMLAAW R+KYPH+ +GA+ASSAPIL + + PYSF N+
Sbjct: 171 LKSNLTACKAPVVIFGGSYGGMLAAWMRMKYPHIVMGAVASSAPILGLNGLSDPYSFYNV 230
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
++ DF+S S++CY V++ SW ++ + G +L + F +CK + I + A +
Sbjct: 231 VSNDFKSESKHCYDVLRNSWSEMYKALATDAGRARLNQTFNMCKGNVD-DIPGLVEKALI 289
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
Y +M DYPTPSNFL LPA+PV+E+C+AID P +GND +++ A ++YYN +G CF
Sbjct: 290 YGSMMDYPTPSNFLTSLPAYPVREICRAIDKPTSGNDTVSRIKDAMTIYYNSTGGLACFP 349
Query: 353 LNG--DSDPHGL-SEWGWQACTEMIM-LTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
G D DP+G+ W WQACTE+IM ++ G ++F + A C GV P
Sbjct: 350 GAGAEDDDPYGMFPGWTWQACTEVIMTMSYGIGNATVFPPDPFNLTAYLADCLATTGVPP 409
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV 468
RP+WI + FGG+ I VLKR SNIIFFNGLRDPWS GG+LK+IS +++ALVE +G HHV
Sbjct: 410 RPHWIQSYFGGYDIRNVLKRSGSNIIFFNGLRDPWSAGGILKSISNSIIALVEPKGGHHV 469
Query: 469 DLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLA 504
DLRFSTKEDP+WLK VRR+E+ II W+ QY+ D A
Sbjct: 470 DLRFSTKEDPEWLKKVRRQEMRIIADWLKQYYSDEA 505
>gi|22328106|ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|95147306|gb|ABF57288.1| At5g65760 [Arabidopsis thaliana]
gi|110736177|dbj|BAF00060.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana]
gi|332010719|gb|AED98102.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 515
Score = 515 bits (1327), Expect = e-143, Method: Compositional matrix adjust.
Identities = 258/507 (50%), Positives = 337/507 (66%), Gaps = 15/507 (2%)
Query: 1 MATRF----IFLSFCLLFSST-LTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYK- 54
MA+ F IF F L+F S ++S++K+ P FP R T + + I + + Y+
Sbjct: 1 MASHFCLLLIFTFFTLVFPSNGSSLSSSKLLPRFP--RYTFQNREARIQQFRGDRNEYRY 58
Query: 55 -TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
TK+ +Q LDHF++ F QRYLIN HW G+ PIF+Y GNEGDIEWFA N+G
Sbjct: 59 ETKFFSQQLDHFSF--ADLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSG 116
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F++D+APKF ALLVF EHRYYG+S+PYG +E AYKNA+T YL++ QALAD+A + DL
Sbjct: 117 FIWDIAPKFGALLVFPEHRYYGESMPYGSREE-AYKNATTLSYLTTEQALADFAVFVTDL 175
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K+NL+A PVV+FGGSYGGMLAAW RLKYPH+AIGALASSAPIL F+++V P +F +I
Sbjct: 176 KRNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIA 235
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFV 292
+ DF+ S +C+ IK SW I +K GL +L K F C+ + + WL +A+
Sbjct: 236 SNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYS 295
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
Y AM DYP P++F+ PLP P++E+C+ ID + + ++Y SVYYNY+G CF
Sbjct: 296 YLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFK 355
Query: 353 LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNW 412
L D DPHGL W WQACTEM+M + ++S+F +Y + C + V+PRP W
Sbjct: 356 L--DDDPHGLDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKW 413
Query: 413 ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRF 472
+TTEFGGH I LK F SNIIF NGL DPWSGG VLKN+S T++ALV KEGAHH+DLR
Sbjct: 414 VTTEFGGHDIATTLKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVALVTKEGAHHLDLRP 473
Query: 473 STKEDPQWLKDVRRREVEIIGKWISQY 499
ST EDP+WL D R E+ +I WI Y
Sbjct: 474 STPEDPKWLVDQREAEIRLIQGWIETY 500
>gi|168047270|ref|XP_001776094.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672604|gb|EDQ59139.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 505
Score = 513 bits (1322), Expect = e-143, Method: Compositional matrix adjust.
Identities = 243/453 (53%), Positives = 328/453 (72%), Gaps = 6/453 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y TQ++DHF++ ++ +FQQRYLI +W G+ + PIF+Y GNEGD+EWFA+NT
Sbjct: 39 YAVDYFTQVIDHFSFRREA--SFQQRYLIEKRYWKGAADRGPIFMYCGNEGDVEWFAKNT 96
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++++AP F AL++F EHRYYGKS+PYG E +YK+A + L+S QALAD+A+L+ID
Sbjct: 97 GFLWEIAPSFGALILFPEHRYYGKSMPYG-TMEASYKDADSLSTLTSEQALADFATLVID 155
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LKKNLTA SPVV+FGGSYGGMLA+W RLKYPH+AIGA+A+SAPIL F++IV +F I
Sbjct: 156 LKKNLTAAASPVVLFGGSYGGMLASWMRLKYPHIAIGAVAASAPILQFEDIVPSDTFYKI 215
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAF 291
++ DF+ S +C+ I+ SW I++ A K GGL L F +C+ + +E+WLS+A+
Sbjct: 216 VSADFKRESASCFNYIRESWGVIDKIASKNGGLHDLSTQFHMCRDLNASWELENWLSSAY 275
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
Y AM DYP P+NF+ PLPA+PV+E+C+ ID G+D+ ++++ ASVYYNYSG A+CF
Sbjct: 276 SYVAMVDYPIPTNFITPLPAYPVREICRVIDSLPEGSDILSRIFAGASVYYNYSGQAECF 335
Query: 352 D-LNGDSDPHGLSEWGWQ-ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
+ +D G++ W WQ ACTEMIM ++ +S+F+ + D + +YC + YGV PR
Sbjct: 336 QPSDPGNDDLGVTGWDWQQACTEMIMPMSSNSSNSMFQPYDWDLEGNIQYCMKTYGVRPR 395
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVD 469
PNWITT +GG I VLK F SNI+F NGL DPWSGGGVL+NIS +++ALV EGAHH+D
Sbjct: 396 PNWITTNYGGKDIKAVLKDFGSNIVFSNGLLDPWSGGGVLENISSSIIALVAPEGAHHLD 455
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
LR STK DP WL + R EV I KWI+ Y D
Sbjct: 456 LRASTKIDPDWLVEQRATEVAYITKWIAAYNSD 488
>gi|413954869|gb|AFW87518.1| putative serine peptidase S28 family protein [Zea mays]
Length = 1052
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 253/488 (51%), Positives = 324/488 (66%), Gaps = 40/488 (8%)
Query: 51 GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW---GGSKNNAPIFVYTGNEGDIEW 107
G + Y Q LDHF++ P + F Q+YL+NDT W GG P+ VY G E DIE
Sbjct: 77 GPFTVHYFQQELDHFSFTPNASTVFYQKYLVNDTFWRRPGGGGTAGPLLVYVGGEADIEC 136
Query: 108 FAQNTGFMYDVAPKFKALLVFIEH---------------------------RYYGKSIPY 140
A N GFM+D+AP F ALLVF+EH R+YG+S+P+
Sbjct: 137 IAHNVGFMFDIAPTFGALLVFVEHIEYIFGDLNIGPQKDMARVVWWSKRKHRFYGESLPF 196
Query: 141 GGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFR 200
G N +A GYL+STQALAD A LI DLK+NL+A SPVV+FGGSYGGMLA+WFR
Sbjct: 197 GNN------SAQALGYLTSTQALADLAILITDLKRNLSAETSPVVIFGGSYGGMLASWFR 250
Query: 201 LKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAK 260
LKYPHV IGALASSAPIL FD I SF ++++QD++S S NC+ VIK +W +EE
Sbjct: 251 LKYPHVTIGALASSAPILQFDYITPWSSFYDVVSQDYKSESLNCFSVIKAAWDVLEERGS 310
Query: 261 KPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
GL +L K FR CK+ K +I WL TAFV AM DYPTP++FL LPA+PVKEMCK
Sbjct: 311 NGNGLVELSKLFRACKTVKYADSIRRWLRTAFVSIAMMDYPTPASFLENLPAYPVKEMCK 370
Query: 320 AIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH--GLSEWGWQACTEMIMLT 377
+D D+ K++ AAS+YYNY+G C + + +P L+ WGWQACTE++M
Sbjct: 371 IVDGFPADADILEKVFAAASLYYNYTGDQTCNQIEDEGNPRCLNLNYWGWQACTELMMPM 430
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
N DS+F Y+ +A YC + +GV PRP+WITTE+GG++I VLKRF SNIIF N
Sbjct: 431 SSAN-DSMFPPHTFSYEDKANYCLQTFGVRPRPHWITTEYGGYRIDEVLKRFGSNIIFSN 489
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G+RDPWS GGVLKNIS +++ALV ++GAHH+DLR +TK+DP W+ + RR+EVEII WI
Sbjct: 490 GMRDPWSRGGVLKNISSSIVALVTEKGAHHLDLRSATKDDPDWVTEQRRQEVEIIHGWID 549
Query: 498 QYFQDLAQ 505
QY++D+AQ
Sbjct: 550 QYYRDMAQ 557
>gi|356563482|ref|XP_003549991.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 513
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 242/459 (52%), Positives = 317/459 (69%), Gaps = 6/459 (1%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
+S Q Y+ +Y Q LDHF+++ TF QRYLI+ HW G PIF Y GNEG
Sbjct: 52 NSEPPPQFHYEKRYFQQRLDHFSFS--ELPTFPQRYLISTEHWVGPHRLGPIFFYCGNEG 109
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
DIEWFAQNTGF++++AP+F A++VF EHRYYG+S+PYG +E AYKNA+T YL++ QAL
Sbjct: 110 DIEWFAQNTGFVWEIAPRFGAMVVFPEHRYYGESVPYGSAEE-AYKNATTLSYLTAEQAL 168
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
AD++ LI LK N +A D PVV+FGGSYGGMLAAW RLKYPH+A+GALASSAPIL F++I
Sbjct: 169 ADFSVLITYLKHNYSAKDCPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDI 228
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-KSEKNLA 282
V P +F ++++ F+ S C+ IK SW +I T + GLE L K F +C K ++
Sbjct: 229 VPPETFYDLVSNAFKRESFTCFNYIKQSWNEIASTGQTNNGLELLTKTFNLCQKLKRTKD 288
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY 342
+ W A+ Y AM +YP P+ F+ LP P++E+C+ ID G + ++Y +VYY
Sbjct: 289 LYDWAEAAYSYLAMVNYPYPAEFMMTLPEHPIREVCRRIDGGPAGTSILERIYEGVNVYY 348
Query: 343 NYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
NY+G AKCF+L D DPHG+S W WQACTEM+M + S+F E +Y + C +
Sbjct: 349 NYTGEAKCFEL--DDDPHGMSGWEWQACTEMVMPMSSSQESSMFPPYEYNYTSIQAECLK 406
Query: 403 AYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
+GV PRP WITTEFGGH I LK+F SNIIF NGL DPWSGGGVL+NIS++V++LV +
Sbjct: 407 KFGVKPRPRWITTEFGGHDIHATLKKFGSNIIFSNGLLDPWSGGGVLQNISESVVSLVTE 466
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
EGAHH+DLR STK DP WL + R E+++I WIS Y Q
Sbjct: 467 EGAHHIDLRSSTKNDPDWLVEQRETEIKLIEGWISDYHQ 505
>gi|413954870|gb|AFW87519.1| putative serine peptidase S28 family protein [Zea mays]
Length = 552
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 270/499 (54%), Positives = 342/499 (68%), Gaps = 37/499 (7%)
Query: 40 SSLISSSKD----SQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAP- 94
+ L+S++ D + + T Y Q LDHF + P + F+Q+YL+NDT W +
Sbjct: 56 AKLVSAAADGTNSTAQPFTTHYFPQELDHFTFTPNASMLFRQKYLVNDTFWRRPRRGGGG 115
Query: 95 ----IFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKN 150
+FVYTGNEGDIEWFA NTGFM+D+APKF ALLVFIEHR+YG+S+P+G + +Y +
Sbjct: 116 GAGPLFVYTGNEGDIEWFATNTGFMFDIAPKFGALLVFIEHRFYGESLPFGDD---SYSS 172
Query: 151 ASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGA 210
A T GYL+STQALAD+A LI LK+NL+A +PVVVFGGSYGGMLA+WFRLKYPHVAIGA
Sbjct: 173 AETEGYLTSTQALADFAILITGLKRNLSAETAPVVVFGGSYGGMLASWFRLKYPHVAIGA 232
Query: 211 LASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
LASSAPIL FD+I SFS+ ++QD++S S NC+ VIK +W ++E GL +L K
Sbjct: 233 LASSAPILQFDHITPWNSFSDAVSQDYKSESLNCFSVIKAAWDVLDERGSTDTGLLELSK 292
Query: 271 AFRICKSEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE---MCKAIDDPKT 326
FR CKS K+ +I WL TAF YTAM DYPTP+NFL LPA+PVKE MCK ID
Sbjct: 293 LFRACKSVKHADSIAGWLQTAFTYTAMVDYPTPANFLMDLPAYPVKEPIQMCKIIDGFPA 352
Query: 327 GNDVFAKLYGAASVYYNYSGTAKCFDLN----GDSDPHGLSEWGWQACTEMIMLTGGDNK 382
G D+ K++ AA++YYNY+G C + S GLS WGWQACTEMIM N
Sbjct: 353 GADILEKVFAAANLYYNYTGDQACNQIESDSDDSSSSLGLSGWGWQACTEMIMPMSTSNA 412
Query: 383 DSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDP 442
S+F S Y+ + C ++ GV PRP+WITTE+GG++I VLKRF SNIIF NG+RDP
Sbjct: 413 -SMFPPSSFSYEDTSNACFQSTGVRPRPHWITTEYGGYRIDKVLKRFGSNIIFSNGMRDP 471
Query: 443 WSGGGVLKNISKTVLALVEKEG----------------AHHVDLRFSTKEDPQWLKDVRR 486
WS GGVLKNIS +++ALV ++G AHH+DLRF+T EDP W+ + RR
Sbjct: 472 WSRGGVLKNISSSIVALVTEKGEHGNLRRKKKKKKKKWAHHLDLRFATDEDPDWVTEQRR 531
Query: 487 REVEIIGKWISQYFQDLAQ 505
+EVEII WI QY +D+AQ
Sbjct: 532 QEVEIIEGWIDQYHRDMAQ 550
>gi|20197495|gb|AAM15096.1| putative prolylcarboxypeptidase [Arabidopsis thaliana]
Length = 476
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 259/497 (52%), Positives = 338/497 (68%), Gaps = 28/497 (5%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNP 69
CL+F ++ A P ++ +L +S SK + ++T+Y Q LDHF++ P
Sbjct: 5 LCLVFLFFSIVAEATYSPG-GFHHLSSLRLKKKVSKSKH-ELPFETRYFPQNLDHFSFTP 62
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
SY+ F Q+YLIN+ W + PIFVYTGNEGDI+WFA NTGFM D+APKF+ALLVFI
Sbjct: 63 DSYKVFHQKYLINNRFW---RKGGPIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFI 119
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGG 189
EHR+YG+S P+G ++K+A T GYL+S QALADYA LI LK+NL++ SPVVVFGG
Sbjct: 120 EHRFYGESTPFGKK---SHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGG 176
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGGMLAAWFRLKYPH+ IGALASSAPIL+FDNIV SF + I+QDF+ S NC+KVIK
Sbjct: 177 SYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIK 236
Query: 250 GSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFLNP 308
SW+++E + GL++L K FR CK + WLS AFVYTAM +YPT +NF+ P
Sbjct: 237 RSWEELEAVSTMKNGLQELSKKFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAP 296
Query: 309 LPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQ 368
LP +PV++MCK ID G+ + + AAS+YYNYSG+ KCF++ +D HGL W +Q
Sbjct: 297 LPGYPVEQMCKIIDGFPRGSSNLDRAFAAASLYYNYSGSEKCFEMEQQTDDHGLDGWQYQ 356
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKR 428
ACTEM+M N+ S+ E D +A C YGV PRP+WITTEFGG ++
Sbjct: 357 ACTEMVMPMSCSNQ-SMLPPYENDSEAFQEQCMTRYGVKPRPHWITTEFGG-----MMDY 410
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRRE 488
F + IF VLKNIS +++ALV K+GAHH DLR +TK+DP+WLK+ RR+E
Sbjct: 411 F--HQIF-----------RVLKNISSSIVALVTKKGAHHADLRAATKDDPEWLKEQRRQE 457
Query: 489 VEIIGKWISQYFQDLAQ 505
V II KWIS+Y++DL +
Sbjct: 458 VAIIEKWISEYYRDLRE 474
>gi|359497044|ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
gi|296085719|emb|CBI29519.3| unnamed protein product [Vitis vinifera]
Length = 510
Score = 506 bits (1302), Expect = e-140, Method: Compositional matrix adjust.
Identities = 245/493 (49%), Positives = 331/493 (67%), Gaps = 8/493 (1%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNP 69
F +L S+ TI++ + S+ P L ++ Y+T+Y Q LDHF+
Sbjct: 17 FIVLTSAPPTIASEAATKGY--SKSIPRFLGKFAYPNRGKPFQYETRYFEQRLDHFSI-- 72
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
F+QRYLI+ HW G PIF+Y GNEGDIEWFA NTGF++D+AP+F A+++F
Sbjct: 73 ADLPKFRQRYLISTRHWTGPDRMGPIFLYCGNEGDIEWFAANTGFVWDMAPRFGAMVLFP 132
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGG 189
EHRYYG+S+PYG +++ AY NA++ YL++ QALAD+A L+ +LK+NL+A PVV+FGG
Sbjct: 133 EHRYYGESMPYG-SRDKAYANAASLSYLTAEQALADFAVLVTNLKRNLSAEGCPVVLFGG 191
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGGMLAAW RLKYPH+AIGALASSAPIL F++IV P +F +I++ +F+ S +C+ IK
Sbjct: 192 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYDIVSNNFKRESISCFDTIK 251
Query: 250 GSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNP 308
SW + +K GL++L KAFR+C+ K + WL +A+ + AM +YP PS+FL P
Sbjct: 252 KSWDVLISEGQKNDGLKQLTKAFRLCRDLKRTEDLYDWLDSAYSFLAMVNYPYPSDFLMP 311
Query: 309 LPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQ 368
LP P+KE+C+ +D G V +++ SVYYNY+G +CF L D DPHG+ W WQ
Sbjct: 312 LPGHPIKEVCRKMDSCPEGTSVLERIFEGVSVYYNYTGKVECFQL--DDDPHGMDGWNWQ 369
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKR 428
ACTEM+M + S+F + +Y + C + + V PRP WITTEFGGH+ LK
Sbjct: 370 ACTEMVMPMASSRESSMFPTYDYNYSSFQEECWKDFSVKPRPTWITTEFGGHEFKTTLKV 429
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRRE 488
F SNIIF NGL DPWSGG VL+NIS+TV+ALV +EGAHH+DLR ST EDP WL + R E
Sbjct: 430 FGSNIIFSNGLLDPWSGGSVLQNISETVVALVTEEGAHHIDLRSSTAEDPDWLVEQRAFE 489
Query: 489 VEIIGKWISQYFQ 501
V++I WI Y Q
Sbjct: 490 VKLIKGWIEDYHQ 502
>gi|10177334|dbj|BAB10683.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana]
Length = 529
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 257/521 (49%), Positives = 337/521 (64%), Gaps = 29/521 (5%)
Query: 1 MATRF----IFLSFCLLFSST-LTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYK- 54
MA+ F IF F L+F S ++S++K+ P FP R T + + I + + Y+
Sbjct: 1 MASHFCLLLIFTFFTLVFPSNGSSLSSSKLLPRFP--RYTFQNREARIQQFRGDRNEYRY 58
Query: 55 -TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
TK+ +Q LDHF++ F QRYLIN HW G+ PIF+Y GNEGDIEWFA N+G
Sbjct: 59 ETKFFSQQLDHFSF--ADLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSG 116
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F++D+APKF ALLVF EHRYYG+S+PYG +E AYKNA+T YL++ QALAD+A + DL
Sbjct: 117 FIWDIAPKFGALLVFPEHRYYGESMPYGSREE-AYKNATTLSYLTTEQALADFAVFVTDL 175
Query: 174 KKNLTATDSPVVVFGGSYGG--------------MLAAWFRLKYPHVAIGALASSAPILN 219
K+NL+A PVV+FGGSYGG +LAAW RLKYPH+AIGALASSAPIL
Sbjct: 176 KRNLSAEACPVVLFGGSYGGSNNCVFVFVVIDATVLAAWMRLKYPHIAIGALASSAPILQ 235
Query: 220 FDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK 279
F+++V P +F +I + DF+ S +C+ IK SW I +K GL +L K F C+
Sbjct: 236 FEDVVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLN 295
Query: 280 NLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA 338
+ + WL +A+ Y AM DYP P++F+ PLP P++E+C+ ID + + ++Y
Sbjct: 296 STDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGI 355
Query: 339 SVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
SVYYNY+G CF L D DPHGL W WQACTEM+M + ++S+F +Y +
Sbjct: 356 SVYYNYTGNVDCFKL--DDDPHGLDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKE 413
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
C + V+PRP W+TTEFGGH I LK F SNIIF NGL DPWSGG VLKN+S T++A
Sbjct: 414 ECWNTFRVNPRPKWVTTEFGGHDIATTLKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVA 473
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
LV KEGAHH+DLR ST EDP+WL D R E+ +I WI Y
Sbjct: 474 LVTKEGAHHLDLRPSTPEDPKWLVDQREAEIRLIQGWIETY 514
>gi|302758120|ref|XP_002962483.1| hypothetical protein SELMODRAFT_78660 [Selaginella moellendorffii]
gi|300169344|gb|EFJ35946.1| hypothetical protein SELMODRAFT_78660 [Selaginella moellendorffii]
Length = 474
Score = 503 bits (1294), Expect = e-139, Method: Compositional matrix adjust.
Identities = 243/464 (52%), Positives = 320/464 (68%), Gaps = 25/464 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q LDHFN + F QRYL+++ W G + APIFVY GNEGDI WFA+NT
Sbjct: 20 YTEHYFQQTLDHFNVG--NITLFPQRYLLHNASWSGGASGAPIFVYCGNEGDIVWFAENT 77
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGN---KEIAYKNASTTGYLSSTQALADYASL 169
GFM+D+AP F ALLVF EHR+YGKS P+GG KE+A+ S+ QALAD+A+L
Sbjct: 78 GFMFDIAPLFGALLVFPEHRFYGKSQPFGGQNGPKELAF--------CSAEQALADFATL 129
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
I+DLK+NL+A SPVVVFGGSYGGMLAAWFRLKYPH+AIGALASSAPIL F+NIV +F
Sbjct: 130 ILDLKRNLSAQASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENIVPYTTF 189
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLS 288
+I++ F+ E C+++I+ SW I E A++ GL L + F +C KN + +WL
Sbjct: 190 YDIVSNAFKREGEKCFEIIRNSWTAITEAAEQQNGLRNLSQDFHMCSDFKNADELINWLE 249
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
+A+ Y AM +YP +NF PLPA PV+++C+A+ + + + ++Y +VYYN++G A
Sbjct: 250 SAYSYLAMANYPYAANFTMPLPAHPVRKVCQAMVNSPVASSILQRIYAGVNVYYNFTGAA 309
Query: 349 KCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
KCFDL D DPHGLS W WQ+CTEM+M ++ S++ E D +A +R+C E YG P
Sbjct: 310 KCFDL--DDDPHGLSGWNWQSCTEMVMPMSSNSNTSMYPPFEWDGEAWSRFCWENYGAIP 367
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG---------VLKNISKTVLAL 459
RP+W+TTEFGGH I VL+ F SNI+F NGL DPWSGG VL++IS T+LA
Sbjct: 368 RPSWVTTEFGGHDIKSVLRNFGSNIVFSNGLLDPWSGGRQAMNDSFRFVLESISSTILAF 427
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
V KEGAHH+D R+S K+DPQWL + R EV I +W+S+Y Q++
Sbjct: 428 VTKEGAHHLDFRWSRKDDPQWLIEQRESEVREIKRWLSEYHQNV 471
>gi|326489169|dbj|BAK01568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 514
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 238/450 (52%), Positives = 319/450 (70%), Gaps = 6/450 (1%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T+Y Q LDHF+++ + + FQQRYL+ G + PIF Y GNEGDI WFA N+
Sbjct: 52 FETRYFRQRLDHFSFSGEE-EFFQQRYLVGRAG-GWAGPGGPIFFYCGNEGDIAWFAANS 109
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G +++ AP+F AL+VF EHRYYG+S+P+G +KE AY ++ + YL++ QALADYA L+ D
Sbjct: 110 GLVWEAAPRFAALVVFAEHRYYGESMPFG-SKEKAYNSSRSMAYLTAEQALADYAVLLTD 168
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK+NL++ SPVV+FGGSYGGMLAAW RLKYPH+A+GALASSAPIL F++IV F ++
Sbjct: 169 LKRNLSSESSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPDTIFYDL 228
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
++ DF+ S +C++ IK SWK+++E GL KL K F +C++ A+ WLS+A+
Sbjct: 229 VSNDFKRESLSCFQTIKDSWKELDEQGNGQDGLLKLSKTFHLCQTLNTTGALSDWLSSAY 288
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
Y AM DYP PS FL PLPA P+KE+C+ ID + + ++Y ++YYNY+GT CF
Sbjct: 289 SYLAMVDYPMPSEFLMPLPANPIKEVCRNIDKQPERSSILERIYAGVNIYYNYTGTVHCF 348
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
DL D DPHG+ W WQACTEM+M S+F E DY A C + +GV PRP
Sbjct: 349 DL--DDDPHGMGGWDWQACTEMVMPMSSSEGLSMFPPDEFDYALYADDCVKNFGVRPRPR 406
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WI+TEFGGH I VL++F+SNIIFFNGL DPWSGGGVLKNIS +V+A+V GAHH+DLR
Sbjct: 407 WISTEFGGHNISSVLEKFSSNIIFFNGLLDPWSGGGVLKNISDSVVAIVAPLGAHHIDLR 466
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
+TKEDP WL +R E+EII W+S +++
Sbjct: 467 PATKEDPDWLVSLRESELEIISGWLSDHYR 496
>gi|115440161|ref|NP_001044360.1| Os01g0767100 [Oryza sativa Japonica Group]
gi|53792444|dbj|BAD53352.1| putative prolylcarboxypeptidase, isoform 1 [Oryza sativa Japonica
Group]
gi|53793554|dbj|BAD53324.1| putative prolylcarboxypeptidase, isoform 1 [Oryza sativa Japonica
Group]
gi|113533891|dbj|BAF06274.1| Os01g0767100 [Oryza sativa Japonica Group]
Length = 517
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/478 (50%), Positives = 330/478 (69%), Gaps = 13/478 (2%)
Query: 29 FPSSRITPEKLSSLISSSKDSQGL-YKTKYHTQILDHFNYNPQSYQT----FQQRYLIND 83
FP P + ++ + + +G Y+T+Y Q LDHF++ + + FQQRYL+
Sbjct: 25 FPVPHARPRR--GVVGAEEAVRGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGR 82
Query: 84 THWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGN 143
G + PIF Y GNEGDI WFA N+G +++ A +F AL+VF EHRYYG+S+P+G +
Sbjct: 83 GG-GWAGAGGPIFFYCGNEGDIAWFAANSGLVWEAATRFAALVVFAEHRYYGESMPFG-S 140
Query: 144 KEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKY 203
K+ AY N+ + YL++ QALADYA L+ DLKKNL++ SPVV+FGGSYGGMLAAW RLKY
Sbjct: 141 KDKAYNNSKSLAYLTAEQALADYAVLLTDLKKNLSSEGSPVVLFGGSYGGMLAAWMRLKY 200
Query: 204 PHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
PH+A+GALASSAPIL F+++V F ++++ DF+ S +C++ IK SWK ++
Sbjct: 201 PHIAVGALASSAPILQFEDVVPSTIFYDLVSNDFKRESLSCFQTIKDSWKALDAQGNGQD 260
Query: 264 GLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID 322
GL KL K F +CK+ KN + WLS+A+ Y AM DYP P++F+ PLP P+KE+C ID
Sbjct: 261 GLLKLSKTFHLCKTIKNTGELSDWLSSAYSYLAMVDYPMPADFMMPLPGNPIKELCTKID 320
Query: 323 DPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNK 382
+ G + ++Y +VYYNY+GT CFDLN DPHG+ W WQACTEM+M ++
Sbjct: 321 NQPDGTSILERIYAGVNVYYNYTGTVDCFDLN--DDPHGMDGWDWQACTEMVMPMSY-SE 377
Query: 383 DSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDP 442
DS+F + +Y + + C ++GV+PRP WITTEFGGH I LVL+RF SNIIFFNGL DP
Sbjct: 378 DSMFPADKFNYTSYEKDCINSFGVEPRPQWITTEFGGHNISLVLERFGSNIIFFNGLLDP 437
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
WSGGGVLKNIS++V+A++ GAHH+DLR ++K+DP WL +R E+ II W+S Y+
Sbjct: 438 WSGGGVLKNISESVVAIIAPLGAHHIDLRPASKDDPDWLVRLRESELGIISGWLSDYY 495
>gi|242058825|ref|XP_002458558.1| hypothetical protein SORBIDRAFT_03g035680 [Sorghum bicolor]
gi|241930533|gb|EES03678.1| hypothetical protein SORBIDRAFT_03g035680 [Sorghum bicolor]
Length = 518
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 237/454 (52%), Positives = 317/454 (69%), Gaps = 10/454 (2%)
Query: 53 YKTKYHTQILDHFNY-----NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW 107
Y+T+Y Q LDHF++ ++ FQQRYL+ G + PIF Y GNEGDI W
Sbjct: 51 YETRYFRQRLDHFSFPGVADEDEAAAFFQQRYLVGRGG-GWAGPGGPIFFYCGNEGDIAW 109
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
FA N+G +++ AP+F AL+VF EHRYYG+S+P+G +K AY ++ + YL++ QALAD+A
Sbjct: 110 FASNSGLVWEAAPRFAALVVFAEHRYYGESMPFG-SKAKAYNDSKSLAYLTAEQALADFA 168
Query: 168 SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
L+ DLK+NL+A SPVV+FGGSYGGMLAAW RLKYPH+++GAL+SSAPIL F++IV
Sbjct: 169 VLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHISVGALSSSAPILQFEDIVPST 228
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESW 286
F ++++ DFR S +C+ IK SWK++++ + GGL KL K F +C++ K + W
Sbjct: 229 IFYDLVSDDFRRESLSCFLTIKDSWKELDDQGNEQGGLLKLSKTFHLCQTLKTSGDLSDW 288
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
LS+A+ Y AM DYP PS FL PLPA P+KE+C+ ID G ++Y +VYYNY+G
Sbjct: 289 LSSAYSYLAMVDYPLPSEFLMPLPANPIKEVCRNIDSQPEGTSTLERIYAGVNVYYNYTG 348
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
T CFDLN DPHG+ W WQACTEM+M S++ + DY + A C ++YGV
Sbjct: 349 TVGCFDLN--DDPHGMGGWDWQACTEMVMPMSYSEGRSMYPPYKFDYPSYADGCIKSYGV 406
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP WITTEFGGH I VL++F SNIIFFNGL DPWSGGGVLKNIS++V+A+V GAH
Sbjct: 407 RPRPRWITTEFGGHNITTVLEKFGSNIIFFNGLLDPWSGGGVLKNISESVIAIVAPLGAH 466
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
H+DLR +T +DP WL +R E++II W+S Y+
Sbjct: 467 HIDLRPATPDDPDWLVALRESELKIISGWLSDYY 500
>gi|413952315|gb|AFW84964.1| putative serine peptidase S28 family protein [Zea mays]
Length = 517
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/456 (52%), Positives = 316/456 (69%), Gaps = 12/456 (2%)
Query: 53 YKTKYHTQILDHFNY-------NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
Y+T+Y Q LDHF++ ++ FQQRYL+ G + PIF Y GNEGDI
Sbjct: 48 YETRYFRQRLDHFSFPGVGDEDEDEAAAFFQQRYLVGRGG-GWAGPGGPIFFYCGNEGDI 106
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
WFA N+G +++ AP+F AL+VF EHRYYG+S+P+G +K AY ++ + YL++ QALAD
Sbjct: 107 AWFAANSGLVWEAAPRFAALVVFAEHRYYGESMPFG-SKAKAYSDSKSLAYLTAEQALAD 165
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
+A L+ DLK+NL+A SPVV+FGGSYGGMLAAW RLKYPH+AIGALASSAPIL F++IV
Sbjct: 166 FAVLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP 225
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IE 284
F ++++ DFR S +C+ IK SWK++++ A + GL KL K F +C++ K +
Sbjct: 226 STIFYDLVSDDFRRESLSCFLTIKDSWKELDDQANEQDGLLKLSKTFHLCQTLKTSGDLS 285
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
WLS+A+ Y AM DYP PS FL PLPA P+KE+C+ ID G ++Y +VYYNY
Sbjct: 286 DWLSSAYSYLAMVDYPLPSEFLMPLPANPIKEVCRNIDSQPKGTGTLERIYAGVNVYYNY 345
Query: 345 SGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+GT CFDLN DPHG+ W WQACTEM+M + S++ + DY + A C ++Y
Sbjct: 346 TGTVDCFDLN--DDPHGMGGWDWQACTEMVMPMSYSEQRSMYPPYKFDYASYAEDCIKSY 403
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
GV PRP WITTEFGGH I VL+ F SNIIFFNGL DPWSGGGVLKNIS++V+A+V G
Sbjct: 404 GVRPRPKWITTEFGGHNITKVLENFGSNIIFFNGLLDPWSGGGVLKNISESVVAIVAPLG 463
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
AHH+DLR +T +DP WL +R E+EII W+ Y+
Sbjct: 464 AHHIDLRPATPDDPDWLVALRESELEIISGWLWDYY 499
>gi|297794189|ref|XP_002864979.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297310814|gb|EFH41238.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 514
Score = 494 bits (1271), Expect = e-137, Method: Compositional matrix adjust.
Identities = 252/506 (49%), Positives = 330/506 (65%), Gaps = 14/506 (2%)
Query: 1 MATRF----IFLSFCLLFSST-LTISNAKIFPTFPSSRITPEKLSSLISSSKD-SQGLYK 54
MA F IF F LLF S ++S++K+ P FP R T + D ++ Y+
Sbjct: 1 MANHFCLLLIFTFFTLLFPSNGSSLSSSKLLPRFP--RYTSRNRGRIQQFRGDRNEYRYE 58
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
TK+ +Q LDHF++ F QRYLIN +W G+ PIF+Y GNEGDIEWFA N+GF
Sbjct: 59 TKFFSQQLDHFSF--ADLPKFPQRYLINSDYWLGASALGPIFLYCGNEGDIEWFATNSGF 116
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
++D+APKF ALLVF E R +PYG +E AYKNA+T YL++ QALAD+A + DLK
Sbjct: 117 IWDIAPKFGALLVFPEVRSCLFCMPYGSMEE-AYKNATTLSYLTTEQALADFAVFVTDLK 175
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+NL+A PVV+FGGSYGGMLAAW RLKYPH+AIGALASSAPIL F++IV P +F +I +
Sbjct: 176 RNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYDIAS 235
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVY 293
DF+ S +C+ IK SW I +K GL +L K F C+ + + WL +A+ Y
Sbjct: 236 NDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSY 295
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
AM DYP P++F+ PLP P++E+C+ ID + + +++ SVYYNY+G CF L
Sbjct: 296 LAMVDYPYPADFMMPLPGHPIREVCRKIDGAHSDASILDRIFAGISVYYNYTGNVDCFKL 355
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
D DPHGL W WQACTEM+M + + S+F + +Y + C + V+PRP W+
Sbjct: 356 --DDDPHGLDGWNWQACTEMVMPMSSNQEKSMFPAYDFNYSSYKEECWNTFRVNPRPKWV 413
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
TTEFGGH I LK F SNIIF NG+ DPWSGG VLKN+S T++ALV KEGAHH+DLR S
Sbjct: 414 TTEFGGHDIETTLKLFGSNIIFSNGMLDPWSGGSVLKNLSNTIVALVTKEGAHHLDLRPS 473
Query: 474 TKEDPQWLKDVRRREVEIIGKWISQY 499
T EDP+WL D R E+++I WI Y
Sbjct: 474 TPEDPKWLVDQREAEIQLIQGWIETY 499
>gi|357136615|ref|XP_003569899.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 512
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 238/457 (52%), Positives = 319/457 (69%), Gaps = 8/457 (1%)
Query: 47 KDSQGLYK--TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD 104
+D G Y+ T+Y Q LDHF+++ ++FQQRYL+ G + PIF Y GNEGD
Sbjct: 41 RDGVGGYEFDTRYFRQRLDHFSFS-GGEESFQQRYLVGRAG-GWAGPGGPIFFYCGNEGD 98
Query: 105 IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
I WFA N+G +++ AP+F AL+VF EHRYYG+S+P+G +KE AY N+ + YL++ QA+A
Sbjct: 99 IAWFAANSGLVWEAAPRFAALVVFAEHRYYGESMPFG-SKEKAYNNSRSLAYLTAEQAIA 157
Query: 165 DYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
DYA L+ DLK+NL++ SPVV+FGGSYGGMLAAW RLKYPH+A+GALASSAPIL F++IV
Sbjct: 158 DYAVLLTDLKRNLSSESSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIV 217
Query: 225 SPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-I 283
F +I++ DF+ S +C++ IK SWK+++E GL KL K F +C++ K +
Sbjct: 218 PQTIFYDIVSNDFKRESLSCFQTIKDSWKELDEQGNGQDGLLKLSKTFHLCQTLKTTGGL 277
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYN 343
WL++A+ + AM DYP S+FL PLP+ P+KE+C+ ID G ++Y ++YYN
Sbjct: 278 SDWLNSAYGFLAMVDYPMSSDFLMPLPSNPIKEVCRNIDKQLDGTSTLERIYAGVNIYYN 337
Query: 344 YSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
Y+GT CFDL D DPHG+ W WQACTEM+M S+F E DY + A C ++
Sbjct: 338 YTGTVDCFDL--DDDPHGMGGWQWQACTEMVMPMSSSESLSMFPPYEFDYASYADDCVKS 395
Query: 404 YGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
+GV P P WITTEFGGH I VL++F SNIIFFNGL DPWSGGGVLKNIS +V+A+V
Sbjct: 396 FGVRPSPRWITTEFGGHNISAVLEKFGSNIIFFNGLLDPWSGGGVLKNISGSVVAIVAPL 455
Query: 464 GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
GAHH+DLR +TKEDP WL +R E+ II W+S ++
Sbjct: 456 GAHHIDLRPATKEDPDWLVSLRESELGIISGWLSDHY 492
>gi|413952316|gb|AFW84965.1| putative serine peptidase S28 family protein [Zea mays]
Length = 515
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 237/456 (51%), Positives = 313/456 (68%), Gaps = 14/456 (3%)
Query: 53 YKTKYHTQILDHFNY-------NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
Y+T+Y Q LDHF++ ++ FQQRYL+ G + PIF Y GNEGDI
Sbjct: 48 YETRYFRQRLDHFSFPGVGDEDEDEAAAFFQQRYLVGRGG-GWAGPGGPIFFYCGNEGDI 106
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
WFA N+G +++ AP+F AL+VF EHRYYG+S+P+G +K AY ++ + YL++ QALAD
Sbjct: 107 AWFAANSGLVWEAAPRFAALVVFAEHRYYGESMPFG-SKAKAYSDSKSLAYLTAEQALAD 165
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
+A L+ DLK+NL+A SPVV+FGGSYGG AW RLKYPH+AIGALASSAPIL F++IV
Sbjct: 166 FAVLLTDLKRNLSAEGSPVVLFGGSYGG--TAWMRLKYPHIAIGALASSAPILQFEDIVP 223
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IE 284
F ++++ DFR S +C+ IK SWK++++ A + GL KL K F +C++ K +
Sbjct: 224 STIFYDLVSDDFRRESLSCFLTIKDSWKELDDQANEQDGLLKLSKTFHLCQTLKTSGDLS 283
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
WLS+A+ Y AM DYP PS FL PLPA P+KE+C+ ID G ++Y +VYYNY
Sbjct: 284 DWLSSAYSYLAMVDYPLPSEFLMPLPANPIKEVCRNIDSQPKGTGTLERIYAGVNVYYNY 343
Query: 345 SGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+GT CFDLN DPHG+ W WQACTEM+M + S++ + DY + A C ++Y
Sbjct: 344 TGTVDCFDLN--DDPHGMGGWDWQACTEMVMPMSYSEQRSMYPPYKFDYASYAEDCIKSY 401
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
GV PRP WITTEFGGH I VL+ F SNIIFFNGL DPWSGGGVLKNIS++V+A+V G
Sbjct: 402 GVRPRPKWITTEFGGHNITKVLENFGSNIIFFNGLLDPWSGGGVLKNISESVVAIVAPLG 461
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
AHH+DLR +T +DP WL +R E+EII W+ Y+
Sbjct: 462 AHHIDLRPATPDDPDWLVALRESELEIISGWLWDYY 497
>gi|302758752|ref|XP_002962799.1| hypothetical protein SELMODRAFT_78251 [Selaginella moellendorffii]
gi|300169660|gb|EFJ36262.1| hypothetical protein SELMODRAFT_78251 [Selaginella moellendorffii]
Length = 470
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 233/452 (51%), Positives = 312/452 (69%), Gaps = 17/452 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q LDHFN + F QRYL+++ W G + APIFVY GNEGDI WFA+NT
Sbjct: 20 YTEHYFQQTLDHFNVG--NITLFPQRYLLHNASWSGGASGAPIFVYCGNEGDIVWFAENT 77
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGN---KEIAYKNASTTGYLSSTQALADYASL 169
GFM+D+AP F ALLVF EHR+YGKS P+GG KE+A+ S+ QALAD+A+L
Sbjct: 78 GFMFDIAPLFGALLVFPEHRFYGKSQPFGGQNGPKELAF--------CSAEQALADFATL 129
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
I+DLK+NL+A SPVVVFGGSYGGMLAAWFRLKYPH+AIGALASSAPIL F+NIV ++
Sbjct: 130 ILDLKRNLSAQASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENIVPYTTY 189
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLS 288
+I++ F+ E C+++I+ SW I E A++ GL L + F +C KN + +WL
Sbjct: 190 YDIVSNAFKREGEECFEIIRNSWTAITEAAEQQNGLRNLSQDFHMCSDFKNADELINWLE 249
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
+A+ Y AM +YP +NF PLPA+PV+++C+A+ + + + ++Y +VYYN++G A
Sbjct: 250 SAYSYLAMANYPYAANFTMPLPAYPVRKVCQAMVNSPVASSILQRIYAGVNVYYNFTGAA 309
Query: 349 KCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
KCFDL D DPHGLS W WQ+CTEM+M ++ S++ E D +A +R+C E YG P
Sbjct: 310 KCFDL--DDDPHGLSGWNWQSCTEMVMPMSSNSSTSMYPPFEWDGEAWSRFCWEKYGAIP 367
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS-KTVLALVEKEGAHH 467
RP+W+TTEFGGH I L+ F SNI+F NGL DPWSGG N S + +L++ +GAHH
Sbjct: 368 RPSWVTTEFGGHDIKSALRNFGSNIVFSNGLLDPWSGGRQAMNDSFRLRSSLIDFDGAHH 427
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+D R+S K+DPQWL + R EV I +W+S+Y
Sbjct: 428 LDFRWSRKDDPQWLIEQRELEVREIKRWLSEY 459
>gi|125572151|gb|EAZ13666.1| hypothetical protein OsJ_03583 [Oryza sativa Japonica Group]
Length = 549
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 239/510 (46%), Positives = 331/510 (64%), Gaps = 45/510 (8%)
Query: 29 FPSSRITPEKLSSLISSSKDSQGL-YKTKYHTQILDHFNYNPQSYQT----FQQRYLIND 83
FP P + ++ + + +G Y+T+Y Q LDHF++ + + FQQRYL+
Sbjct: 25 FPVPHARPRR--GVVGAEEAVRGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGR 82
Query: 84 THWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVF--------------- 128
G + PIF Y GNEGDI WFA N+G +++ A +F AL+VF
Sbjct: 83 GG-GWAGAGGPIFFYCGNEGDIAWFAANSGLVWEAATRFAALVVFAEANLSSHLAPVSVF 141
Query: 129 -----------------IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
++HRYYG+S+P+G +K+ AY N+ + YL++ QALADYA L+
Sbjct: 142 FLGCALIVRDPIPFTCSLQHRYYGESMPFG-SKDKAYNNSKSLAYLTAEQALADYAVLLT 200
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
DLKKNL++ SPVV+FGGSYGGMLAAW RLKYPH+A+GALASSAPIL F+++V F +
Sbjct: 201 DLKKNLSSEGSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDVVPSTIFYD 260
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTA 290
+++ DF+ S +C++ IK SWK ++ GL KL K F +CK+ KN + WLS+A
Sbjct: 261 LVSNDFKRESLSCFQTIKDSWKALDAQGNGQDGLLKLSKTFHLCKTIKNTGELSDWLSSA 320
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
+ Y AM DYP P++F+ PLP P+KE+C ID+ G + ++Y +VYYNY+GT C
Sbjct: 321 YSYLAMVDYPMPADFMMPLPGNPIKELCTKIDNQPDGTSILERIYAGVNVYYNYTGTVDC 380
Query: 351 FDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
FDLN DPHG+ W WQACTEM+M ++DS+F + +Y + + C ++GV+PRP
Sbjct: 381 FDLN--DDPHGMDGWDWQACTEMVMPMSY-SEDSMFPADKFNYTSYEKDCINSFGVEPRP 437
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
WITTEFGGH I LVL+RF SNIIFFNGL DPWSGGGVLKNIS++V+A++ GAHH+DL
Sbjct: 438 QWITTEFGGHNISLVLERFGSNIIFFNGLLDPWSGGGVLKNISESVVAIIAPLGAHHIDL 497
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R ++K+DP WL +R E+ II W+S Y+
Sbjct: 498 RPASKDDPDWLVRLRESELGIISGWLSDYY 527
>gi|125527844|gb|EAY75958.1| hypothetical protein OsI_03875 [Oryza sativa Indica Group]
Length = 549
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 239/510 (46%), Positives = 330/510 (64%), Gaps = 45/510 (8%)
Query: 29 FPSSRITPEKLSSLISSSKDSQGL-YKTKYHTQILDHFNYNPQSYQT----FQQRYLIND 83
FP P + ++ + + +G Y+T+Y Q LDHF++ + + FQQRYL+
Sbjct: 25 FPVPHARPRR--GVVGAEEAVRGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGR 82
Query: 84 THWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVF--------------- 128
G + PIF Y GNEGDI WFA N+G +++ A +F AL+VF
Sbjct: 83 GG-GWAGAGGPIFFYCGNEGDIAWFAANSGLVWEAATRFAALVVFAEANLSSHLAPVSVF 141
Query: 129 -----------------IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
++HRYYG+S+P+G +K+ AY N+ + YL++ QALADYA L+
Sbjct: 142 FLGCALIVRDPIPFTCSLQHRYYGESMPFG-SKDKAYNNSKSLAYLTAEQALADYAVLLT 200
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
DLKKNL++ SPVV+FGGSYGGMLAAW RLKYPH+A+GALASSAPIL F+++V F +
Sbjct: 201 DLKKNLSSEGSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDVVPSTIFYD 260
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTA 290
+++ DF+ S C++ IK SWK ++ GL KL K F +CK+ KN + WLS+A
Sbjct: 261 LVSNDFKRESLICFQTIKDSWKALDAQGNGQDGLLKLSKTFHLCKTIKNTGELSDWLSSA 320
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
+ Y AM DYP P++F+ PLP P+KE+C ID+ G + ++Y +VYYNY+GT C
Sbjct: 321 YSYLAMVDYPMPADFMMPLPGNPIKELCTKIDNQPDGTSILERIYAGVNVYYNYTGTVDC 380
Query: 351 FDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
FDLN DPHG+ W WQACTEM+M ++DS+F + +Y + + C ++GV+PRP
Sbjct: 381 FDLN--DDPHGMDGWDWQACTEMVMPMSY-SEDSMFPADKFNYTSYEKDCINSFGVEPRP 437
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
WITTEFGGH I LVL+RF SNIIFFNGL DPWSGGGVLKNIS++V+A++ GAHH+DL
Sbjct: 438 QWITTEFGGHNISLVLERFGSNIIFFNGLLDPWSGGGVLKNISESVVAIIAPLGAHHIDL 497
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R ++K+DP WL +R E+ II W+S Y+
Sbjct: 498 RPASKDDPDWLVRLRESELGIISGWLSDYY 527
>gi|356514372|ref|XP_003525880.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 597
Score = 480 bits (1235), Expect = e-133, Method: Compositional matrix adjust.
Identities = 228/453 (50%), Positives = 306/453 (67%), Gaps = 11/453 (2%)
Query: 50 QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA 109
Q Y+T+ Q LDHF+++ TF QRYLI+ HW G + P+F Y+GNE DIEWFA
Sbjct: 126 QFHYETRCIQQSLDHFSFS--ELPTFPQRYLISTEHWVGPRRLGPVFFYSGNEDDIEWFA 183
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
QNTG ++++AP+F A++VF EH+YYG+S+PYG +E AYKN +T YL+S QAL D++ +
Sbjct: 184 QNTGVVWEIAPRFGAMVVFPEHQYYGESVPYGSAEE-AYKNVTTLSYLTSEQALVDFSVV 242
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
I DLK N + D PV +FGGSYGGMLAAW RLKYPHVA+GALASSAPIL F++IV P +F
Sbjct: 243 IADLKHNFSTKDCPVFLFGGSYGGMLAAWMRLKYPHVAVGALASSAPILQFEDIVPPETF 302
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-KSEKNLAIESWLS 288
++++ F+ S C+ IK SW ++ + GLE L K F +C K + + W+
Sbjct: 303 YDLVSNAFKRESFICFNYIKQSWNEMASAGQTNNGLELLTKTFNLCQKLNRTKDLYDWVE 362
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
A+ Y AM +YP P+ F+ LP P++E+ + + + ++Y +VYYNY+G A
Sbjct: 363 AAYSYLAMVNYPYPAEFMMTLPEHPIREVSMV-----SNSYILERIYEGVNVYYNYTGEA 417
Query: 349 KCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
KCF+L D DPHG+S W WQACTEMIM + S+F E Y + C + +GV P
Sbjct: 418 KCFEL--DDDPHGMSGWDWQACTEMIMPMSSSQESSMFLPYEYXYTSIQEECLKKFGVKP 475
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV 468
RP WITTEFGGH I LK+F SNIIF NGL DPWSGG +L+NIS++V++LV +EGAHH+
Sbjct: 476 RPKWITTEFGGHDIHATLKKFGSNIIFSNGLLDPWSGGSILQNISESVVSLVTEEGAHHI 535
Query: 469 DLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
DLR STK DP WL + R E+++I WIS Y Q
Sbjct: 536 DLRSSTKNDPDWLVEQRETEIKLIEGWISDYHQ 568
>gi|242096526|ref|XP_002438753.1| hypothetical protein SORBIDRAFT_10g025550 [Sorghum bicolor]
gi|241916976|gb|EER90120.1| hypothetical protein SORBIDRAFT_10g025550 [Sorghum bicolor]
Length = 569
Score = 479 bits (1234), Expect = e-132, Method: Compositional matrix adjust.
Identities = 260/493 (52%), Positives = 322/493 (65%), Gaps = 41/493 (8%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW------GGSKNNAPIFVYTGNEGDIE 106
+ T Y Q LDHF + P + F+Q+YL+NDT W G+ P+FVYTGNEGDIE
Sbjct: 76 FTTHYFPQELDHFTFTPNASMVFRQKYLVNDTFWRRPSGGNGTAGAGPLFVYTGNEGDIE 135
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEH-----------------RYYGKSIPYGGNKEI--- 146
WFA NTGFM+D+APKF ALLVFIE + G P G +
Sbjct: 136 WFATNTGFMFDIAPKFGALLVFIECLNAENMIQSKIFSFCFLKSRGPRCPEPGPRPPLAL 195
Query: 147 -------AYKNASTTGYLSSTQA---LADYASLIIDLKKNLTATDSPVVVFGGSYGGMLA 196
A A G ++ +A LAD+A LI LK+NL+A +PVVVFGGSYGGMLA
Sbjct: 196 PLPLARQARSAAGLRGLPTAGRAAVALADFAILITSLKQNLSAKTAPVVVFGGSYGGMLA 255
Query: 197 AWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIE 256
+WFRLKYPHVAIGALASSAPIL FD I SFS+ I+QD++S S NC+ VIK +W ++
Sbjct: 256 SWFRLKYPHVAIGALASSAPILQFDYITPWSSFSDAISQDYKSESLNCFSVIKATWDVLD 315
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVK 315
E GL +L K FR CK+ K +I WL TAFVYTAM DYPTP+NFL LPA+PVK
Sbjct: 316 ERGANDRGLLELSKLFRACKTLKYADSIAGWLQTAFVYTAMVDYPTPANFLKNLPAYPVK 375
Query: 316 EMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN---GDSDPHGLSEWGWQACTE 372
EMCK ID D+ K++ AAS+YYN++G C + S GLS WGWQACTE
Sbjct: 376 EMCKIIDGFPANADILEKVFAAASLYYNFTGDQTCNQIEYDDDSSSSLGLSGWGWQACTE 435
Query: 373 MIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASN 432
MIM D+ +S+F Y+ + C + YGV PRP+WITTE+GG+KI VLKRF SN
Sbjct: 436 MIMPMS-DSNESMFPPDTFSYEDTSNTCFQLYGVRPRPHWITTEYGGYKIDKVLKRFGSN 494
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
IIF NGLRDPWS GGVLK+IS +++ALV ++GAHH+DLR +TK+DP W+ + RR+EVEII
Sbjct: 495 IIFSNGLRDPWSRGGVLKDISSSIVALVTEKGAHHLDLRSATKDDPDWVIEQRRQEVEII 554
Query: 493 GKWISQYFQDLAQ 505
WI QY QD+AQ
Sbjct: 555 HGWIDQYHQDMAQ 567
>gi|18700101|gb|AAL77662.1| AT5g65760/MPA24_11 [Arabidopsis thaliana]
Length = 491
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 238/472 (50%), Positives = 313/472 (66%), Gaps = 15/472 (3%)
Query: 1 MATRF----IFLSFCLLFSST-LTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYK- 54
MA+ F IF F L+F S ++S++K+ P FP R T + + I + + Y+
Sbjct: 1 MASHFCLLLIFTFFTLVFPSNGSSLSSSKLLPRFP--RYTFQNREARIQQFRGDRNEYRY 58
Query: 55 -TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
TK+ +Q LDHF++ F QRYLIN HW G+ PIF+Y GNEGDIEWFA N+G
Sbjct: 59 ETKFFSQQLDHFSF--ADLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSG 116
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F++D+APKF ALLVF EHRYYG+S+PYG +E AYKNA+T YL++ QALAD+A + DL
Sbjct: 117 FIWDIAPKFGALLVFPEHRYYGESMPYGSREE-AYKNATTLSYLTTEQALADFAVFVTDL 175
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K+NL+A PVV+FGGSYGGMLAAW RLKYPH+AIGALASSAPIL F+++V P +F +I
Sbjct: 176 KRNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIA 235
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFV 292
+ DF+ S +C+ IK SW I +K GL +L K F C+ + + WL +A+
Sbjct: 236 SNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYS 295
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
Y AM DYP P++F+ PLP P++E+C+ ID + + ++Y SVYYNY+G CF
Sbjct: 296 YLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFK 355
Query: 353 LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNW 412
L D DPHGL W WQACTEM+M + ++S+F +Y + C + V+PRP W
Sbjct: 356 L--DDDPHGLDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKW 413
Query: 413 ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
+TTEFGGH I LK F SNIIF NGL DPWSGG VLKN+S T++ALV KEG
Sbjct: 414 VTTEFGGHDIATTLKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVALVTKEG 465
>gi|222635987|gb|EEE66119.1| hypothetical protein OsJ_22160 [Oryza sativa Japonica Group]
Length = 393
Score = 458 bits (1178), Expect = e-126, Method: Compositional matrix adjust.
Identities = 224/394 (56%), Positives = 283/394 (71%), Gaps = 7/394 (1%)
Query: 115 MYDVAPKFKALL--VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
M+D+AP F ALL + +HR+YG+S P+G + GYL+STQALAD+A LI
Sbjct: 1 MFDIAPSFGALLHDLSKQHRFYGESKPFGNESN---SSPEKLGYLTSTQALADFAVLITS 57
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK NL+A SPVVVFGGSYGGMLA+WFRLKYPHV IGA+ASSAPIL FD I SF
Sbjct: 58 LKHNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSSFYEA 117
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
++QD++S S NC+ VIK +W I+E GL +L K FR CK+ K++ + +WL TAF
Sbjct: 118 VSQDYKSESFNCFSVIKAAWDLIDERGSTDAGLLQLSKTFRACKTVKSVYSFRNWLWTAF 177
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
VYTAM DYPTP+NFL LPA+P+KEMCK I G D+ K + AAS+YYNY+G CF
Sbjct: 178 VYTAMVDYPTPANFLMNLPAYPIKEMCKIIHGFPAGADIVDKAFAAASLYYNYTGDQTCF 237
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
L DPHGLS WGWQACTEM+M N +S+F Y+ ++ C ++YGV PRP+
Sbjct: 238 QLEDGEDPHGLSGWGWQACTEMVMPMTISN-ESMFPPFTFTYEGKSDDCFQSYGVRPRPH 296
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WITTE+GG++I LVLKRF SNIIF NG+RDPWS GGVLKNIS +++ALV ++GAHH+D R
Sbjct: 297 WITTEYGGNRIDLVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTEKGAHHLDFR 356
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+TK+DP W+ + RR+EV+II WI QY +DLAQ
Sbjct: 357 SATKDDPDWVVEQRRQEVKIIQGWIDQYNEDLAQ 390
>gi|356570877|ref|XP_003553610.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 349
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 229/354 (64%), Positives = 273/354 (77%), Gaps = 14/354 (3%)
Query: 18 LTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQ 77
TI+ A I P FPSS + P + ++S G+Y K+ TQILDHFNYNPQSYQTFQQ
Sbjct: 3 FTITFAHILPRFPSSVVHP---AIDVTSRSAQNGVYTKKFFTQILDHFNYNPQSYQTFQQ 59
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIE--HRYYG 135
RYLINDT+WGG K+NAPIF YTGNEGDIEWFAQN GFM++ AP FKALLVFIE HRYYG
Sbjct: 60 RYLINDTYWGGDKSNAPIFFYTGNEGDIEWFAQNPGFMFETAPYFKALLVFIEEXHRYYG 119
Query: 136 KSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGML 195
KS P+GGN+E A N+ST GYLSST LIIDLKKNL+AT SPVVVFGGSYGG++
Sbjct: 120 KSFPFGGNEEDANANSSTLGYLSSTL-------LIIDLKKNLSATYSPVVVFGGSYGGII 172
Query: 196 AAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI 255
AWFR+KYPHVAIGALASSAPIL F ++VSP ++++IITQD++S SENCYKVIKGSWKQI
Sbjct: 173 LAWFRMKYPHVAIGALASSAPILQFLDLVSPNTYTDIITQDYKSESENCYKVIKGSWKQI 232
Query: 256 EETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPV 314
E+TA+KPGGLE+L K+FRICK+ + A+ SWL A AM DYP PSNFL LPA PV
Sbjct: 233 EDTARKPGGLEQLWKSFRICKNYISAGALVSWLQMALGIAAMIDYPAPSNFLAHLPASPV 292
Query: 315 KEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP-HGLSEWGW 367
+++ ID+ GN+ F KLY AAS++YNYSGTA CF L+ +D +S+ GW
Sbjct: 293 RKVMALIDNLSVGNEAFTKLYAAASIFYNYSGTAICFGLDNTTDTLERISKAGW 346
>gi|225436410|ref|XP_002271961.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 505
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 221/454 (48%), Positives = 299/454 (65%), Gaps = 16/454 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T Y+ Q LDHFNY P+SY TFQQRY++N +WGG+ +APIF Y G E D+++
Sbjct: 60 FQTFYYNQTLDHFNYRPESYITFQQRYVVNFKYWGGANASAPIFAYLGAEADLDYDLSGI 119
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D A +FKALLV+IEHRYYGKSIPYG +E A+KNAST GY +S QA+ADYA ++I
Sbjct: 120 GFLTDNAHQFKALLVYIEHRYYGKSIPYGSREE-AFKNASTLGYFNSAQAIADYAEVLIY 178
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK L A +SPV+V G SYGGMLA+WFRLKYPHVA+GALASSAPIL FD+I + +I
Sbjct: 179 IKKKLLAENSPVIVVGASYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSI 238
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
+T+DFR SE+CYK I SW +I+ A +P GL L K FRIC N ++ +L +
Sbjct: 239 VTKDFREASESCYKTISESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTY 298
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
A D P P++PV +C ID G+D+ ++++ + G C+
Sbjct: 299 SIAAQYDAP---------PSYPVTVVCGGIDGAPEGSDILSRIFAGVVA---FRGNMSCY 346
Query: 352 DLNGDSD--PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
+ +D + WGWQ C+E+++ G D++F + + ++ C YGV PR
Sbjct: 347 YTSNTTDYPIETIQGWGWQTCSEIVIPIGRGVNDTMFPLAPFNLTTFSQDCTSLYGVAPR 406
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVD 469
P+WITT +GGH I L+L+RFASNIIF NGLRDP+S GGVLKNIS +VLA++ G+H +D
Sbjct: 407 PHWITTYYGGHDIELILRRFASNIIFSNGLRDPYSIGGVLKNISNSVLAILTVNGSHCLD 466
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
L + DP+WL R+ EVEII WI+QY+ DL
Sbjct: 467 LLPAASTDPEWLVMQRKAEVEIIEGWIAQYYVDL 500
>gi|297734878|emb|CBI17112.3| unnamed protein product [Vitis vinifera]
Length = 624
Score = 447 bits (1149), Expect = e-123, Method: Compositional matrix adjust.
Identities = 221/454 (48%), Positives = 299/454 (65%), Gaps = 16/454 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T Y+ Q LDHFNY P+SY TFQQRY++N +WGG+ +APIF Y G E D+++
Sbjct: 179 FQTFYYNQTLDHFNYRPESYITFQQRYVVNFKYWGGANASAPIFAYLGAEADLDYDLSGI 238
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D A +FKALLV+IEHRYYGKSIPYG +E A+KNAST GY +S QA+ADYA ++I
Sbjct: 239 GFLTDNAHQFKALLVYIEHRYYGKSIPYGSREE-AFKNASTLGYFNSAQAIADYAEVLIY 297
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK L A +SPV+V G SYGGMLA+WFRLKYPHVA+GALASSAPIL FD+I + +I
Sbjct: 298 IKKKLLAENSPVIVVGASYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSI 357
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
+T+DFR SE+CYK I SW +I+ A +P GL L K FRIC N ++ +L +
Sbjct: 358 VTKDFREASESCYKTISESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTY 417
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
A D P P++PV +C ID G+D+ ++++ + G C+
Sbjct: 418 SIAAQYDAP---------PSYPVTVVCGGIDGAPEGSDILSRIFAGVVA---FRGNMSCY 465
Query: 352 DLNGDSD--PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
+ +D + WGWQ C+E+++ G D++F + + ++ C YGV PR
Sbjct: 466 YTSNTTDYPIETIQGWGWQTCSEIVIPIGRGVNDTMFPLAPFNLTTFSQDCTSLYGVAPR 525
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVD 469
P+WITT +GGH I L+L+RFASNIIF NGLRDP+S GGVLKNIS +VLA++ G+H +D
Sbjct: 526 PHWITTYYGGHDIELILRRFASNIIFSNGLRDPYSIGGVLKNISNSVLAILTVNGSHCLD 585
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
L + DP+WL R+ EVEII WI+QY+ DL
Sbjct: 586 LLPAASTDPEWLVMQRKAEVEIIEGWIAQYYVDL 619
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 36/80 (45%), Positives = 47/80 (58%)
Query: 368 QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLK 427
Q C E +M + D++F + + C YGV P P+W TT +GGH I L+L
Sbjct: 2 QRCNETVMPLRRGDSDTMFPLVPFNLTTFIQECISIYGVSPWPHWATTYYGGHNIELILH 61
Query: 428 RFASNIIFFNGLRDPWSGGG 447
RFASNIIF NGLRDP++ G
Sbjct: 62 RFASNIIFSNGLRDPYAVPG 81
>gi|225436414|ref|XP_002272152.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 493
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 217/448 (48%), Positives = 299/448 (66%), Gaps = 16/448 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ T ++ Q LDHFNY P+SY+TFQQRY++N +WGG+ +++PIFVYTG+E I A
Sbjct: 48 FVTYFYNQTLDHFNYKPESYRTFQQRYIMNSEYWGGANSSSPIFVYTGDEASITAVAAFA 107
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A +F LL++IEHRYYG S+P+G E A+ N ST GY +STQALADYA LI +
Sbjct: 108 GFIVELASRFNGLLLYIEHRYYGDSVPFGSKDE-AFSNTSTLGYFTSTQALADYAELITN 166
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LKKNL+A + PV+ GGSYGGMLA+WFRLKYPH+ IGALASSAPIL FD+I ++ I
Sbjct: 167 LKKNLSAENCPVIAIGGSYGGMLASWFRLKYPHIVIGALASSAPILYFDDITPGNAYHVI 226
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAF 291
+T+DFR SE+CY I+ SW +I++ A +P GL L + F C+ + ++ +L+ +
Sbjct: 227 VTKDFRETSESCYSTIRDSWSEIDKVAAEPNGLANLSQIFMTCEPLNSSQELKYYLALCY 286
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
V +A D P PA+PVK++C AID G D+ ++ AA + N S C
Sbjct: 287 VVSAQNDNP---------PAYPVKKVCDAIDGAPEGTDIIGRV--AAGL--NASVGPPCH 333
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
+ D P SEW WQ CTEM+M G D++F+ D + + C++ +GV PRP+
Sbjct: 334 FVY-DFKPSNRSEWTWQTCTEMVMPIGHGANDTMFQAWPFDLNNHTKTCQDLFGVTPRPH 392
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
WITTEFGGH I V+ FASNIIF NGLRDP+S GGVL++IS +V+A+ +GAH +DL
Sbjct: 393 WITTEFGGHDIKSVVGNFASNIIFSNGLRDPYSAGGVLQDISDSVVAIYTDKGAHCLDLS 452
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQY 499
T DP WL + +EV+IIG W+++Y
Sbjct: 453 TPTATDPDWLVSQQEKEVKIIGLWLAEY 480
>gi|225436405|ref|XP_002271797.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 510
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 229/488 (46%), Positives = 309/488 (63%), Gaps = 26/488 (5%)
Query: 25 IFPTFPSSRITPEKLSSLISSSKDSQGL-------YKTKYHTQILDHFNYNPQSYQTFQQ 77
IFPT ++ TP KL L + ++S+ ++T ++ Q LDHFNY P+SY TFQQ
Sbjct: 29 IFPTCATA--TPSKLPRLSTILRESEIFSELISDDFQTFFYNQTLDHFNYRPESYYTFQQ 86
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
RY++N +WGG+ +APIF Y G E +++ GF D A +FKALLV+IEHRYYG+S
Sbjct: 87 RYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPVDNALQFKALLVYIEHRYYGQS 146
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAA 197
IP+G +E A KNAST GY +S QA+ADYA ++ +KK L A +SPV+V GGSYGGMLA+
Sbjct: 147 IPFGSREE-ALKNASTRGYFNSAQAIADYAEVLEYIKKKLLAENSPVIVIGGSYGGMLAS 205
Query: 198 WFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE 257
WFRLKYPHVA+GALASSAPIL FD+I + +I+T+DFR SE+CY I+ SW +I+
Sbjct: 206 WFRLKYPHVALGALASSAPILYFDDITPQNGYYSIVTKDFREASESCYSTIRESWSEIDR 265
Query: 258 TAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
A +P GL L K FR C K+ ++ +L T + A ++P P +PV
Sbjct: 266 VASEPNGLSILSKKFRTCAELNKSNELKDYLETMYAVAAQYNHP---------PRYPVTV 316
Query: 317 MCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIM 375
+C ID G+D+ ++++ Y G + C+ N +P SE W WQ C+EM+M
Sbjct: 317 VCGGIDGAPEGSDILSRIFAGVVA---YRGNSSCY--NTSVNPTETSEGWRWQTCSEMVM 371
Query: 376 LTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIF 435
G + D++F S + + C Y V PRP+WITT +GGH I L+L RFASNIIF
Sbjct: 372 PIGRGDNDTMFPPSPFNLTTFIQACTSLYDVPPRPHWITTYYGGHDIKLILHRFASNIIF 431
Query: 436 FNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
NGLRDP+S GVLKNIS TVLA+ G+H +D+ + DP+WL R+ EVEII W
Sbjct: 432 SNGLRDPYSSAGVLKNISHTVLAIHTVNGSHCLDILPAKSTDPEWLIMQRKTEVEIIESW 491
Query: 496 ISQYFQDL 503
I+QY DL
Sbjct: 492 IAQYHADL 499
>gi|147792728|emb|CAN64374.1| hypothetical protein VITISV_018664 [Vitis vinifera]
Length = 502
Score = 443 bits (1140), Expect = e-122, Method: Compositional matrix adjust.
Identities = 229/488 (46%), Positives = 309/488 (63%), Gaps = 26/488 (5%)
Query: 25 IFPTFPSSRITPEKLSSLISSSKDSQGL-------YKTKYHTQILDHFNYNPQSYQTFQQ 77
IFPT ++ TP KL L + ++S+ ++T ++ Q LDHFNY P+SY TFQQ
Sbjct: 21 IFPTCATA--TPSKLPRLSTILRESEIFSELISDDFQTFFYNQTLDHFNYRPESYYTFQQ 78
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
RY++N +WGG+ +APIF Y G E +++ GF D A +FKALLV+IEHRYYG+S
Sbjct: 79 RYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPVDNALQFKALLVYIEHRYYGQS 138
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAA 197
IP+G +E A KNAST GY +S QA+ADYA ++ +KK L A +SPV+V GGSYGGMLA+
Sbjct: 139 IPFGSREE-ALKNASTRGYFNSAQAIADYAEVLEYIKKKLLAENSPVIVIGGSYGGMLAS 197
Query: 198 WFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE 257
WFRLKYPHVA+GALASSAPIL FD+I + +I+T+DFR SE+CY I+ SW +I+
Sbjct: 198 WFRLKYPHVALGALASSAPILYFDDITPQNGYYSIVTKDFREASESCYSTIRESWSEIDR 257
Query: 258 TAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
A +P GL L K FR C K+ ++ +L T + A ++P P +PV
Sbjct: 258 VASEPNGLSILSKKFRTCAELNKSNELKDYLETMYAVAAQYNHP---------PRYPVTV 308
Query: 317 MCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIM 375
+C ID G+D+ ++++ Y G + C+ N +P SE W WQ C+EM+M
Sbjct: 309 VCGGIDGAPEGSDILSRIFAGVVA---YRGNSSCY--NTSVNPTETSEGWRWQTCSEMVM 363
Query: 376 LTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIF 435
G + D++F S + + C Y V PRP+WITT +GGH I L+L RFASNIIF
Sbjct: 364 PIGRGDNDTMFPPSPFNLTTFIQACTSLYDVPPRPHWITTYYGGHDIKLILHRFASNIIF 423
Query: 436 FNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
NGLRDP+S GVLKNIS TVLA+ G+H +D+ + DP+WL R+ EVEII W
Sbjct: 424 SNGLRDPYSSAGVLKNISHTVLAIHTVNGSHCLDILPAKSTDPEWLIMQRKTEVEIIESW 483
Query: 496 ISQYFQDL 503
I+QY DL
Sbjct: 484 IAQYHADL 491
>gi|359479403|ref|XP_003632268.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Vitis vinifera]
Length = 503
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 226/500 (45%), Positives = 313/500 (62%), Gaps = 20/500 (4%)
Query: 7 FLSFCLLFSST-LTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
L + F +T ++ + KI P S + L + S+ + ++T Y+ Q LDHF
Sbjct: 16 ILVLVIFFCATCVSATQPKILPKL--SVLGRTFLREPATFSESNSQDFQTFYYNQTLDHF 73
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
NY P+SY TFQ RY++N +WGG+ +APIF Y G E D++ GF+ D A +FKAL
Sbjct: 74 NYRPESYTTFQHRYVMNFKYWGGANASAPIFAYLGAEEDLDXILSGLGFLTDNAHRFKAL 133
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVV 185
LV+IEHRY GKSIP+G +E A KNAS GY +S QA+ADYA ++I +KK L A +SPV+
Sbjct: 134 LVYIEHRYCGKSIPFGSREE-ALKNASIRGYFNSAQAIADYAEVLIYIKKKLLAENSPVI 192
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
V GGSYGGMLA+WFRLKYPHVA+G LASSAPIL F++I + +I+T+DFR SE+CY
Sbjct: 193 VVGGSYGGMLASWFRLKYPHVALGVLASSAPILYFEDITPQNGYYSIVTKDFREASESCY 252
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYPTPSN 304
K I+ SW +I+ A +P G+ L K FR C N ++ +L T +Y Y P
Sbjct: 253 KTIRESWSEIDRVASEPNGISILSKKFRTCDRLNNSDELKDYLDT--IYCTAAQYNDP-- 308
Query: 305 FLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE 364
P +PV C ID G+D+ ++++ Y G + C+ + +P SE
Sbjct: 309 -----PMYPVTMACSGIDGAPEGSDILSRIFAGVVA---YGGNSSCYTTS--HNPTETSE 358
Query: 365 -WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIG 423
W WQ C+EM+M G + D++F + + + C YGV PRP+W+TT +GGH I
Sbjct: 359 GWAWQTCSEMVMPIGRGDNDTMFPPAPFNLTTFIQECTSIYGVSPRPHWVTTYYGGHNIE 418
Query: 424 LVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKD 483
L+L+RFASNIIF NGLRDP+S GGVLKNIS +VLA++ G+H +D+ +T DP+WL
Sbjct: 419 LILRRFASNIIFSNGLRDPYSSGGVLKNISDSVLAILTVNGSHCLDILPATSTDPEWLVM 478
Query: 484 VRRREVEIIGKWISQYFQDL 503
R+ EVE+I WI+QY+ DL
Sbjct: 479 QRKAEVEVIESWIAQYYADL 498
>gi|359479405|ref|XP_002272041.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 491
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 228/484 (47%), Positives = 308/484 (63%), Gaps = 18/484 (3%)
Query: 18 LTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQ 77
+T+ + ++P + + + S +++ S + T Y+ Q LDHFNY P+SY FQQ
Sbjct: 17 MTVCVSAMYPRTARLGMVSKSMKSALNAELSSD--FVTYYYNQTLDHFNYRPESYTNFQQ 74
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
RYLIN +WGG+ +++PIFVYTG+EG I A GFM D+A +FK LL++IEHRYYG S
Sbjct: 75 RYLINSAYWGGANSSSPIFVYTGDEGSITGAAAFAGFMVDLASRFKGLLLYIEHRYYGDS 134
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAA 197
+P+ +K+IA+ N ST GY SSTQALADYA LI +LKKNL+A + PV+ GGSYGGMLA+
Sbjct: 135 VPFR-SKDIAFNNTSTLGYFSSTQALADYAELITNLKKNLSAENCPVIAIGGSYGGMLAS 193
Query: 198 WFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE 257
WFRLKYPHV IGALASSAPIL FD+I + +I+T+DFR SE CY IK SW +I+E
Sbjct: 194 WFRLKYPHVVIGALASSAPILYFDDITPHNGYDSIVTKDFRETSETCYTTIKQSWSEIDE 253
Query: 258 TAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEM 317
A +P GL L + FR C+ N + + L + Y A Y P PA V ++
Sbjct: 254 VAGQPNGLANLSQIFRTCE-PLNSSQQLKLYLEYTYEASAQYDNP-------PAHYVSDI 305
Query: 318 CKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLT 377
C AID G + ++ A V N S C + D P +S W WQ CTEM+M
Sbjct: 306 CNAIDGAPEGTSILGRV--AEGV--NASAGPPCHRIY-DFQPSNMSGWLWQTCTEMVMPF 360
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYG--VDPRPNWITTEFGGHKIGLVLKRFASNIIF 435
G D++F+ S D + + C++ +G V PRP+WITTEFGGH I VL FASNIIF
Sbjct: 361 GRGENDTMFQASPFDLNNYTKTCQDIFGASVTPRPHWITTEFGGHNIKSVLGNFASNIIF 420
Query: 436 FNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
NGLRDP+S GGVL++IS++V+A+ +GAH +DL DP WL R +E++I+ W
Sbjct: 421 SNGLRDPYSIGGVLQDISESVVAVYTLKGAHCLDLGTPMPSDPDWLVAQRDKEIKIVALW 480
Query: 496 ISQY 499
+++Y
Sbjct: 481 LAEY 484
>gi|356541970|ref|XP_003539445.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 509
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 220/473 (46%), Positives = 307/473 (64%), Gaps = 15/473 (3%)
Query: 33 RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNN 92
R T E+ + SS S KT Y+TQ LDHFNY P SY TFQQRY++N +WGG+K++
Sbjct: 35 RSTKERDPEISSSLHLSDDDLKTFYYTQRLDHFNYRPDSYHTFQQRYMVNFKYWGGAKSS 94
Query: 93 APIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAS 152
APIF + G EG ++ A+ GF+ D AP+F AL+VFIEHRYYGKSIP+G ++E A +NAS
Sbjct: 95 APIFAFFGAEGPVDEDAKYIGFLRDNAPQFNALIVFIEHRYYGKSIPFGSSEE-AMRNAS 153
Query: 153 TTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
T GY +S QA+ADYA++++ +KK L+A +SP++V GGSYGGMLA+WFRLKYPH+A+GALA
Sbjct: 154 TRGYFNSAQAIADYAAVLLHIKKTLSAQNSPIIVIGGSYGGMLASWFRLKYPHIALGALA 213
Query: 213 SSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
SSAPIL F+ I + I+T+DF+ SE+CY+ I+ SW +I+ AKKP GL L K F
Sbjct: 214 SSAPILYFNGIAPQAGYYYIVTKDFKETSESCYQTIRKSWSEIDRVAKKPNGLSILSKRF 273
Query: 273 RIC-KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
+ C K K+ ++ +L + + A +YP+ PVK +C AID D+
Sbjct: 274 KTCDKLNKSFDLKDYLDSLYTDAAQYNYPS---------EHPVKIVCGAIDAAAKKTDIL 324
Query: 332 AKLYGAASVYYNYSGTAKCFDLNGDSDP-HGLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
+++ Y + C+D+N + P W WQ C+E+IM G + DS+F +
Sbjct: 325 GQIFEGVVAYKQHRS---CYDMNEYNHPTESFLGWRWQTCSEIIMPIGHEKNDSMFPPAP 381
Query: 391 EDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
+ + C+ YGV P+P+W+TT +GG + L+L RFASNIIF NGLRDP+S GGVL+
Sbjct: 382 FNMKTFVQECRSLYGVLPQPHWVTTYYGGPDLKLILHRFASNIIFSNGLRDPYSSGGVLE 441
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+IS TV+A+ G H +D++ DPQWL R EV+II WI++Y DL
Sbjct: 442 SISNTVVAVTTVNGCHCLDIQSRKANDPQWLVTQRNTEVKIIKGWIAEYKADL 494
>gi|148906489|gb|ABR16397.1| unknown [Picea sitchensis]
Length = 508
Score = 440 bits (1131), Expect = e-121, Method: Compositional matrix adjust.
Identities = 237/498 (47%), Positives = 310/498 (62%), Gaps = 23/498 (4%)
Query: 4 RFIFLSFCLLFSS-TLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQIL 62
RF+ L CLL SS + + N + P+ +S L SS + Y T + Q L
Sbjct: 28 RFLILVGCLLDSSYSFNLDNWFMNPSLLTSARGKLLTMKLKSSRRPLSLDYTTNQYMQTL 87
Query: 63 DHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKF 122
DHF + P Y+TF QRYL+N T+WGG +NN+PIFV GNE DI G M + A F
Sbjct: 88 DHFTFRPDGYRTFPQRYLVNKTYWGGPQNNSPIFVCLGNEEDIITQLPYFGIMTEHAADF 147
Query: 123 KALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDS 182
+AL+VFIEHRYYG S+P+G E +Y NAST GY S++QALADYA +I DLKKNL+A D
Sbjct: 148 RALIVFIEHRYYGTSMPFGSQDE-SYANASTLGYYSASQALADYAIVITDLKKNLSADDC 206
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE 242
PVVVFGGSYGGMLAAW RLKYPH+ IGALASS+PIL F+++ ++ ++T+DFR S+
Sbjct: 207 PVVVFGGSYGGMLAAWLRLKYPHITIGALASSSPILYFEDMTPHDAYDRVVTKDFRDASD 266
Query: 243 NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF-VYTAMTDYPT 301
CY+ IK SW ++E+ A +PGGL+KL K F C+S + SW + +Y A+
Sbjct: 267 ICYRRIKESWAEMEKVASQPGGLQKLAKLFNTCQS-----LSSWTDLFYWIYPAL-QVSA 320
Query: 302 PSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHG 361
NF VK +C+ I+ G D+ +L AA Y G C +L+
Sbjct: 321 QYNFAE------VKAICRVINSQARGTDILTRL--AAGAEYANEGLG-CLNLSTT----- 366
Query: 362 LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHK 421
LS W WQ C+EM++ + ++F + D + C + YGV PRP WITTEFGGH
Sbjct: 367 LSGWDWQTCSEMVIPLAPNANTTMFPSAPFDLKSYFSGCFKTYGVLPRPYWITTEFGGHN 426
Query: 422 IGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWL 481
I VLKRF SNIIF NGLRDP+S GGVLKNIS +++A+ KEG H D+R ST +DP WL
Sbjct: 427 IKRVLKRFGSNIIFSNGLRDPYSSGGVLKNISDSIVAITAKEGVHCEDIRSSTNDDPNWL 486
Query: 482 KDVRRREVEIIGKWISQY 499
K+ R++E+ II KWI Y
Sbjct: 487 KEQRQKEINIIRKWIIDY 504
>gi|224143318|ref|XP_002324914.1| predicted protein [Populus trichocarpa]
gi|222866348|gb|EEF03479.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 440 bits (1131), Expect = e-121, Method: Compositional matrix adjust.
Identities = 226/475 (47%), Positives = 302/475 (63%), Gaps = 18/475 (3%)
Query: 33 RITPEKLSSLISSSKDSQGLY---KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
R+ P S S+S + +Y +T Y+ Q LDHFNY P S+ FQQRY+IN +WGG+
Sbjct: 5 RVKPLGRISRNSASFAASSIYHNLQTFYYNQTLDHFNYRPDSFDMFQQRYVINSKYWGGA 64
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
+NAPIFVY G E +E + G + + A +FKAL V+IEHRYYGKSIP+G E A+K
Sbjct: 65 NSNAPIFVYFGEEAPLENDFGDIGILAENAHRFKALQVYIEHRYYGKSIPFGSRNE-AFK 123
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
NAST GY +S QALADYA +II + + SPV+V G SYGGMLA+WFRLKYPH+A+G
Sbjct: 124 NASTLGYFNSAQALADYAEIIIHVNEKFHVQRSPVIVVGASYGGMLASWFRLKYPHIALG 183
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
ALASSAPIL F +I +++ +I+T+DFR S++C+ IK SW I++ A +P GL L
Sbjct: 184 ALASSAPILYFTDITPAHAYVSIVTKDFREDSQSCHDTIKKSWTVIDKIASEPDGLSILS 243
Query: 270 KAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN 328
K F CK N + + +L A +Y A Y P P++PV +CK+ID+P GN
Sbjct: 244 KKFETCKPLNNSSELTDYL--AGIYMAAAQYDAP-------PSYPVTMVCKSIDEPSFGN 294
Query: 329 DVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE 388
D+ +++ Y G C+ + W WQ C +M++ G N DS+F+
Sbjct: 295 DILGRIFAGMVA---YQGELPCYVNEPTKETETDVGWSWQTCADMVIPFGISN-DSMFQP 350
Query: 389 SEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
D +A CK+ YGV PRP+W+TT FGGH I L+LKRF SNIIF NGLRDP+S GGV
Sbjct: 351 YPFDLNAYINDCKDEYGVPPRPHWVTTYFGGHDIKLILKRFGSNIIFSNGLRDPYSSGGV 410
Query: 449 LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
L+NIS +V+A+ +G+H +D+ +TK DPQWL R+ EV II KWI YF DL
Sbjct: 411 LQNISDSVVAITTVKGSHCLDVLATTKSDPQWLVAQRKEEVRIIRKWIRNYFSDL 465
>gi|297734875|emb|CBI17109.3| unnamed protein product [Vitis vinifera]
Length = 975
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 224/472 (47%), Positives = 302/472 (63%), Gaps = 21/472 (4%)
Query: 38 KLSSLISSSKDSQGL----YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNA 93
KLS+++ S+ L ++T ++ Q LDHFNY P+SY TFQQRY++N +WGG+ +A
Sbjct: 15 KLSTILRESEIFSELISDDFQTFFYNQTLDHFNYRPESYYTFQQRYVMNFKYWGGANASA 74
Query: 94 PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
PIF Y G E +++ GF D A +FKALLV+IEHRYYG+SIP+G +E A KNAST
Sbjct: 75 PIFAYLGAEAALDFDLTGVGFPVDNALQFKALLVYIEHRYYGQSIPFGSREE-ALKNAST 133
Query: 154 TGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
GY +S QA+ADYA ++ +KK L A +SPV+V GGSYGGMLA+WFRLKYPHVA+GALAS
Sbjct: 134 RGYFNSAQAIADYAEVLEYIKKKLLAENSPVIVIGGSYGGMLASWFRLKYPHVALGALAS 193
Query: 214 SAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFR 273
SAPIL FD+I + +I+T+DFR SE+CY I+ SW +I+ A +P GL L K FR
Sbjct: 194 SAPILYFDDITPQNGYYSIVTKDFREASESCYSTIRESWSEIDRVASEPNGLSILSKKFR 253
Query: 274 ICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA 332
C K+ ++ +L T + A ++P P +PV +C ID G+D+ +
Sbjct: 254 TCAELNKSNELKDYLETMYAVAAQYNHP---------PRYPVTVVCGGIDGAPEGSDILS 304
Query: 333 KLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEE 391
+++ Y G + C+ N +P SE W WQ C+EM+M G + D++F S
Sbjct: 305 RIFAGVVA---YRGNSSCY--NTSVNPTETSEGWRWQTCSEMVMPIGRGDNDTMFPPSPF 359
Query: 392 DYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
+ + C Y V PRP+WITT +GGH I L+L RFASNIIF NGLRDP+S GVLKN
Sbjct: 360 NLTTFIQACTSLYDVPPRPHWITTYYGGHDIKLILHRFASNIIFSNGLRDPYSSAGVLKN 419
Query: 452 ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
IS TVLA+ G+H +D+ + DP+WL R+ EVEII WI+QY DL
Sbjct: 420 ISHTVLAIHTVNGSHCLDILPAKSTDPEWLIMQRKTEVEIIESWIAQYHADL 471
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 224/505 (44%), Positives = 312/505 (61%), Gaps = 24/505 (4%)
Query: 7 FLSFCLLFSSTLTISNAKIFPTFPSSR---ITPEKLSSLISSSKDSQGLYKTKYHTQILD 63
FL L+ S ++ + + P SR PE + S KD KT ++ Q LD
Sbjct: 487 FLIPTLILSCCVSAAQFNVPRLGPLSRGILRNPEPAAVSESFYKD----LKTFFYAQTLD 542
Query: 64 HFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFK 123
HFNY P+SY+TF+QRY++N HWGG+K APIF Y G E ++ N GF+ D A +F
Sbjct: 543 HFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPLDGDLVNIGFVNDNAARFN 602
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP 183
ALL++IEHRYYGKSIP+G K +A KNAST GY +S QA+ADYA++++ +KK L A +SP
Sbjct: 603 ALLIYIEHRYYGKSIPFGSTK-VALKNASTLGYFNSAQAIADYAAVLMHVKKRLHAQNSP 661
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
V+V GGSYGGMLA+WFRLKYPH+A+GALASSAPIL FD I + +I+T+DFR SE+
Sbjct: 662 VIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPEIGYYSIVTKDFREASES 721
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTP 302
CY+ I+ SW +I+ A KP GL L K F+ C E + ++ +L + + A + P
Sbjct: 722 CYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKDYLDSIYAEAAQYNEP-- 779
Query: 303 SNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY-GAASVYYNYSGTAKCFDLNGDSDP-H 360
P +PV +CK I+ D +++ G ++ +G C+D + P
Sbjct: 780 -------PTYPVTVVCKGINGASKRTDTLGRIFHGLVAI----AGKRSCYDTKEFNYPTE 828
Query: 361 GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGH 420
W WQ C+EM++ G D++F+ + + + C Y V PRP+W+TT +GG
Sbjct: 829 TYLGWRWQKCSEMVLPIGHATNDTMFQPEPFNLNRFIKECNSLYSVSPRPHWVTTYYGGR 888
Query: 421 KIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQW 480
I L+L RFASNIIF NGLRDP+S GGVL+NIS T++A+ + G+H +D+ S K DPQW
Sbjct: 889 DIKLILHRFASNIIFSNGLRDPYSSGGVLENISDTLVAVYTRHGSHCLDILPSQKSDPQW 948
Query: 481 LKDVRRREVEIIGKWISQYFQDLAQ 505
L R+ EVEII W+ +Y+ DLA
Sbjct: 949 LVMQRKMEVEIIKGWMDKYYTDLAN 973
>gi|225448871|ref|XP_002270231.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
gi|296083471|emb|CBI23429.3| unnamed protein product [Vitis vinifera]
Length = 503
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 224/500 (44%), Positives = 316/500 (63%), Gaps = 19/500 (3%)
Query: 12 LLFSSTLTISNAKIFPTFP----SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNY 67
L +++++ + +K P SS E SSL + S + ++T ++ Q LDHFNY
Sbjct: 15 LFITASVSATPSKKIPRLGVLRGSSLSVLEGSSSLRTVSVNLSENFQTFFYPQTLDHFNY 74
Query: 68 NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLV 127
P+SY TFQ RY++N +WGG+++ A IFVY G E D++ + GF+ D +F ALLV
Sbjct: 75 RPESYTTFQHRYMVNFNYWGGARSAAQIFVYLGEESDLDKDINSIGFLVDNGARFGALLV 134
Query: 128 FIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVF 187
+IEHRYYGKS P+G ++ + +NAS GY +S QALADYA +II+LKKNL+A SPV+V
Sbjct: 135 YIEHRYYGKSNPFGSMQK-SLQNASRRGYFNSGQALADYAEVIINLKKNLSADSSPVIVV 193
Query: 188 GGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKV 247
GGSYGG+LAAWFRLKYPHVA+GALASSAPIL FD+I + +++T+DFR SE+CY
Sbjct: 194 GGSYGGLLAAWFRLKYPHVALGALASSAPILYFDDITPQDGYYSLVTKDFRDFSESCYNT 253
Query: 248 IKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
IK SW +I++ A + GL L K FR CK E ++ +L T + A D P
Sbjct: 254 IKDSWAEIDKAAAEANGLLNLSKKFRTCKPLESASQLKDYLETMYSIAAQYDRP------ 307
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-W 365
P +PV +C ID G D+ +++ G C+D+ S P E W
Sbjct: 308 ---PMYPVTVVCNGIDGGLQGTDILDRIFSGIVA---SRGNKSCYDMGQSSFPSETEEGW 361
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
WQ C+E+++ G + D++F + D+ A CK +YGV PRP+WIT+ +GGH I L+
Sbjct: 362 NWQVCSELVIPIGRGSNDTMFPAAPFDFKEYADSCKYSYGVTPRPHWITSYYGGHNIKLI 421
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
LKRF SNIIF NGLRDP+S GGVL++IS +++A+ G+H +D+ ST++DP WL R
Sbjct: 422 LKRFGSNIIFSNGLRDPYSSGGVLEDISHSIIAVHTPRGSHCLDILPSTEDDPNWLVLQR 481
Query: 486 RREVEIIGKWISQYFQDLAQ 505
E+EII W+ +Y++DL Q
Sbjct: 482 NVEIEIIHGWLLKYYEDLLQ 501
>gi|297734879|emb|CBI17113.3| unnamed protein product [Vitis vinifera]
Length = 623
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 224/487 (45%), Positives = 306/487 (62%), Gaps = 21/487 (4%)
Query: 18 LTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQ 77
+T+ + ++P + + + S +++ S + T Y+ Q LDHFNY P+SY FQQ
Sbjct: 17 MTVCVSAMYPRTARLGMVSKSMKSALNAELSSD--FVTYYYNQTLDHFNYRPESYTNFQQ 74
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVF---IEHRYY 134
RYLIN +WGG+ +++PIFVYTG+EG I A GFM D+A +FK LL++ ++HRYY
Sbjct: 75 RYLINSAYWGGANSSSPIFVYTGDEGSITGAAAFAGFMVDLASRFKGLLLYYLILQHRYY 134
Query: 135 GKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGM 194
G S+P+ +K+IA+ N ST GY SSTQALADYA LI +LKKNL+A + PV+ GGSYGGM
Sbjct: 135 GDSVPFR-SKDIAFNNTSTLGYFSSTQALADYAELITNLKKNLSAENCPVIAIGGSYGGM 193
Query: 195 LAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQ 254
LA+WFRLKYPHV IGALASSAPIL FD+I + +I+T+DFR SE CY IK SW +
Sbjct: 194 LASWFRLKYPHVVIGALASSAPILYFDDITPHNGYDSIVTKDFRETSETCYTTIKQSWSE 253
Query: 255 IEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPV 314
I+E A +P GL L + FR C+ N + + L + Y A Y P PA V
Sbjct: 254 IDEVAGQPNGLANLSQIFRTCE-PLNSSQQLKLYLEYTYEASAQYDNP-------PAHYV 305
Query: 315 KEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMI 374
++C AID G + ++ N S C + D P +S W WQ CTEM+
Sbjct: 306 SDICNAIDGAPEGTSILGRVAEGV----NASAGPPCHRIY-DFQPSNMSGWLWQTCTEMV 360
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYG--VDPRPNWITTEFGGHKIGLVLKRFASN 432
M G D++F+ S D + + C++ +G V PRP+WITTEFGGH I VL FASN
Sbjct: 361 MPFGRGENDTMFQASPFDLNNYTKTCQDIFGASVTPRPHWITTEFGGHNIKSVLGNFASN 420
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
IIF NGLRDP+S GGVL++IS++V+A+ +GAH +DL DP WL R +E++I+
Sbjct: 421 IIFSNGLRDPYSIGGVLQDISESVVAVYTLKGAHCLDLGTPMPSDPDWLVAQRDKEIKIV 480
Query: 493 GKWISQY 499
W+++Y
Sbjct: 481 ALWLAEY 487
Score = 44.3 bits (103), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK 279
SE+CYK I+ SW +I + A +P GL L K F CK E+
Sbjct: 532 SESCYKTIQESWPEINKVASEPNGLLILSKKFGTCKGEQ 570
>gi|225436403|ref|XP_002271733.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 503
Score = 434 bits (1115), Expect = e-119, Method: Compositional matrix adjust.
Identities = 224/504 (44%), Positives = 312/504 (61%), Gaps = 24/504 (4%)
Query: 7 FLSFCLLFSSTLTISNAKIFPTFPSSR---ITPEKLSSLISSSKDSQGLYKTKYHTQILD 63
FL L+ S ++ + + P SR PE + S KD KT ++ Q LD
Sbjct: 15 FLIPTLILSCCVSAAQFNVPRLGPLSRGILRNPEPAAVSESFYKD----LKTFFYAQTLD 70
Query: 64 HFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFK 123
HFNY P+SY+TF+QRY++N HWGG+K APIF Y G E ++ N GF+ D A +F
Sbjct: 71 HFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPLDGDLVNIGFVNDNAARFN 130
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP 183
ALL++IEHRYYGKSIP+G K +A KNAST GY +S QA+ADYA++++ +KK L A +SP
Sbjct: 131 ALLIYIEHRYYGKSIPFGSTK-VALKNASTLGYFNSAQAIADYAAVLMHVKKRLHAQNSP 189
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
V+V GGSYGGMLA+WFRLKYPH+A+GALASSAPIL FD I + +I+T+DFR SE+
Sbjct: 190 VIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPEIGYYSIVTKDFREASES 249
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTP 302
CY+ I+ SW +I+ A KP GL L K F+ C E + ++ +L + + A + P
Sbjct: 250 CYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKDYLDSIYAEAAQYNEP-- 307
Query: 303 SNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY-GAASVYYNYSGTAKCFDLNGDSDP-H 360
P +PV +CK I+ D +++ G ++ +G C+D + P
Sbjct: 308 -------PTYPVTVVCKGINGASKRTDTLGRIFHGLVAI----AGKRSCYDTKEFNYPTE 356
Query: 361 GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGH 420
W WQ C+EM++ G D++F+ + + + C Y V PRP+W+TT +GG
Sbjct: 357 TYLGWRWQKCSEMVLPIGHATNDTMFQPEPFNLNRFIKECNSLYSVSPRPHWVTTYYGGR 416
Query: 421 KIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQW 480
I L+L RFASNIIF NGLRDP+S GGVL+NIS T++A+ + G+H +D+ S K DPQW
Sbjct: 417 DIKLILHRFASNIIFSNGLRDPYSSGGVLENISDTLVAVYTRHGSHCLDILPSQKSDPQW 476
Query: 481 LKDVRRREVEIIGKWISQYFQDLA 504
L R+ EVEII W+ +Y+ DLA
Sbjct: 477 LVMQRKMEVEIIKGWMDKYYTDLA 500
>gi|449456064|ref|XP_004145770.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
gi|449496213|ref|XP_004160074.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 499
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 228/501 (45%), Positives = 311/501 (62%), Gaps = 19/501 (3%)
Query: 7 FLSFCL-LFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
+L F L + S+ +T + +I P R ++ SS D +KT Y+ Q LDHF
Sbjct: 11 WLPFILFILSNCVTATQYRIPRLSPIGRTFLHNAEAIPSSISDD---FKTFYYNQTLDHF 67
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
NY P+SY F RY+IN +WGG+ ++API Y G EG +E GFM D A +F AL
Sbjct: 68 NYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAARFDAL 127
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVV 185
LV+IEHRYYGKS+P+G +E A KNAST GY SS QA+ADYA+++I LK+ A DSPV+
Sbjct: 128 LVYIEHRYYGKSMPFGSREE-ALKNASTLGYFSSAQAIADYAAVLIHLKQKYHAKDSPVI 186
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
V GGSYGGMLAAWFRLKYPHVA+GALASSAPIL F++I + +I T+DFR VSE CY
Sbjct: 187 VLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCY 246
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSN 304
+ I+ SW +IE KP GL L K F+ C + +E +L + + A ++P
Sbjct: 247 ETIRDSWSKIEIIGSKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHP---- 302
Query: 305 FLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE 364
P +PV +C ID G+ + +K+ AA V + Y G C+++ S+
Sbjct: 303 -----PRYPVTRICGGIDGASPGSGIISKV--AAGV-FAYKGNLSCYNIGPRSETETDVG 354
Query: 365 WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGL 424
W WQ C+EM+M N D++F D + YC + YGV RP+W+TT +GG+ I L
Sbjct: 355 WRWQRCSEMVMPLSTTN-DTMFPPITFDLKSFVDYCYQLYGVSSRPHWVTTYYGGNDIKL 413
Query: 425 VLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDV 484
+L+RF SNIIF NGLRDP+S GGVL+N+S ++LA+ +G+H +D+ + + DPQWL
Sbjct: 414 ILQRFGSNIIFSNGLRDPYSSGGVLQNLSDSLLAVHTPKGSHCLDILRANETDPQWLVKQ 473
Query: 485 RRREVEIIGKWISQYFQDLAQ 505
R EV II WIS+Y+ DL +
Sbjct: 474 RETEVRIIEGWISKYYADLEK 494
>gi|356498058|ref|XP_003517871.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 471
Score = 430 bits (1106), Expect = e-118, Method: Compositional matrix adjust.
Identities = 220/455 (48%), Positives = 293/455 (64%), Gaps = 23/455 (5%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
KT Y Q+LDHFNY P+SY TFQQRYLIN +WGG+ ++APIF Y G E I+ G
Sbjct: 23 KTFYFKQVLDHFNYRPESYTTFQQRYLINFKYWGGANSSAPIFAYFGAESPIDNSPNGVG 82
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F+ D A F ALLV+IEHRYYGKS+ +G +E A KNAST GY +S QALADYAS++ +
Sbjct: 83 FLTDNAASFNALLVYIEHRYYGKSVQFGSREE-ALKNASTIGYFNSAQALADYASILKHV 141
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
KK L A +SPV+V GGSYGGMLA+WFRLKYPH+AIGALASSAPIL FD I + + +
Sbjct: 142 KKTLHAKNSPVIVIGGSYGGMLASWFRLKYPHLAIGALASSAPILYFDKITPQNGYYSTV 201
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFV 292
T+D+R SE CY+ + SW +I A +P GL L + F C + ++ + +L + +V
Sbjct: 202 TRDYREASETCYETVLKSWSEIRRIASQPNGLVTLSQRFNTCHTLNQSYELIDYLRSTYV 261
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
Y A + P P +PV +C ID G+D+ +K+Y G + C
Sbjct: 262 YAAQYNQP---------PRYPVSMICGGIDGESLGSDILSKIYAGIVA---LRGNSTC-K 308
Query: 353 LNGDSDPHGLSE----WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
+NG P +SE W WQ C+EM++ G N D++FE + A CKE YGV P
Sbjct: 309 VNG---PTNVSETTVGWRWQTCSEMVIPIGIGN-DTMFEPIPFNLTRYAEGCKEQYGVSP 364
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV 468
RP+W+TT +GGH I LVL+R SNIIF NGLRDP+S GGVL NIS +++A+ G+H +
Sbjct: 365 RPHWVTTYYGGHNIKLVLRRLGSNIIFSNGLRDPYSIGGVLDNISDSIVAVHTVNGSHCL 424
Query: 469 DLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
DL + + DP WL + R++EV+II +WI+QY+ DL
Sbjct: 425 DLLRANQSDPGWLVEQRKKEVKIIKRWITQYYADL 459
>gi|224109994|ref|XP_002333168.1| predicted protein [Populus trichocarpa]
gi|222835051|gb|EEE73500.1| predicted protein [Populus trichocarpa]
Length = 446
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 214/451 (47%), Positives = 299/451 (66%), Gaps = 16/451 (3%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y+TQ LDHFNY P+SY TF+QRY+I+ +WGG+ +APIFV+ G E D++ GF+
Sbjct: 3 YYTQTLDHFNYKPESYTTFRQRYVIDFRYWGGANTSAPIFVFFGAEEDLDDDLDAIGFLS 62
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D AP FKALL++IEHRYYG+SIP+G KE A KNA T GYL+S QA+ADYA++I+ LKK
Sbjct: 63 DNAPHFKALLIYIEHRYYGRSIPFGSRKE-ALKNAETLGYLNSAQAMADYAAVIMHLKKK 121
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
+A +SPV+V GGSYGGML +WFRLKYPH+A+GALASSAPIL FD+I + +I+T+D
Sbjct: 122 YSAKNSPVIVIGGSYGGMLTSWFRLKYPHIALGALASSAPILYFDDISPQEGYYSIVTKD 181
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-KSEKNLAIESWLSTAFVYTA 295
F+ SE+CY I+ SW +IE+ A KP GL L K F+ C + +E +L + + A
Sbjct: 182 FKETSESCYNTIRKSWGEIEKIASKPNGLSILSKKFKTCFPLNRTFELEDFLDSIYFEAA 241
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG-NDVFAKLYGAASVYYNYSGTAKCFDLN 354
DYP P FPV +C I+ D+ ++ +V Y G C D+N
Sbjct: 242 QYDYP---------PEFPVSIVCGGINKASAARTDILDRI---LAVVVAYMGNRSCHDMN 289
Query: 355 GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWIT 414
+ P + EW WQ C+E++M G ++ +S+F + + + + CK +GV P+P+WIT
Sbjct: 290 AFNYPDAIYEWRWQTCSEIVMPIGHES-NSMFPPAPFNLNDYIKDCKSLFGVLPQPHWIT 348
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFST 474
T +GGH I L+L+RF SNIIF NGLRDP+S GGVL NIS +++A+ G+H +D++ ++
Sbjct: 349 TYYGGHDIKLILQRFGSNIIFSNGLRDPYSSGGVLNNISDSIVAVSTVNGSHCLDIQRAS 408
Query: 475 KEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
DP WL R+ EV+II WIS+Y+ DL +
Sbjct: 409 PSDPHWLVMQRKIEVKIIEGWISKYYTDLLE 439
>gi|449456174|ref|XP_004145825.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 491
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 220/454 (48%), Positives = 292/454 (64%), Gaps = 16/454 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+KT Y Q LDHFNY P+SY TF QRY+IN +WGG+ ++API Y G E I+
Sbjct: 50 FKTFYFNQTLDHFNYRPESYTTFPQRYIINFKYWGGANSSAPILAYLGPEAPIDSAMNVI 109
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM D A KF ALLV+IEHRYYGKSIP+G KE A +NAST GY +S QALADYA+++I
Sbjct: 110 GFMTDNAVKFNALLVYIEHRYYGKSIPFGSRKE-ALRNASTLGYFNSAQALADYAAILIH 168
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK +A SPV+V GGSYGGMLA WFRLKYPHVA+GALASSAPIL F++I + I
Sbjct: 169 VKKEFSAKYSPVIVIGGSYGGMLATWFRLKYPHVALGALASSAPILYFNDITPENGYYVI 228
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-NLAIESWLSTAF 291
+T+DFR VS+ CY+ I+ SW +IE A + GL L K F+ C + + +E++L F
Sbjct: 229 VTKDFREVSQTCYESIRESWSEIETVASQSNGLSVLDKVFKTCSPLRSSTQLENYL--WF 286
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
+Y + Y PS +PV +C AID + N K+ AA V + Y G C+
Sbjct: 287 MYASAAQYNHPSR-------YPVNRICDAIDQTYS-NGTLGKI--AAGV-FAYRGELSCY 335
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
+ W WQ C+EM+M N D++F D+++ + YC + YGV PRP+
Sbjct: 336 INEPINTTETTVGWQWQRCSEMVMPISTGN-DTMFPSETFDHESFSIYCNQLYGVTPRPH 394
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
W+TT +GGH I L+L RFASNIIF NGL+DP+S GGVL NIS ++LA+ G+H +D+
Sbjct: 395 WVTTYYGGHDIHLILHRFASNIIFSNGLKDPYSIGGVLHNISDSLLAVYTANGSHCLDIL 454
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+ + DP+WL R+ EV II +WI +Y+ DLA
Sbjct: 455 TANRMDPEWLVTQRKTEVGIIKEWIDEYYADLAN 488
>gi|147820652|emb|CAN65360.1| hypothetical protein VITISV_036071 [Vitis vinifera]
Length = 580
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 217/500 (43%), Positives = 312/500 (62%), Gaps = 19/500 (3%)
Query: 12 LLFSSTLTISNAKIFPTFP----SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNY 67
L +++++ + +K P SS E SSL + S + ++T ++ Q LDHFNY
Sbjct: 15 LFITASVSATPSKKIPRLGVLRGSSLSVLEGSSSLRTVSVNLSENFQTFFYPQTLDHFNY 74
Query: 68 NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLV 127
P+SY TFQ RY++N +WGG+++ A IFVY G E D++ + GF+ + +F ALLV
Sbjct: 75 RPESYTTFQHRYMVNFNYWGGARSAAXIFVYLGEESDLDKDINSIGFLVENGARFGALLV 134
Query: 128 FIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVF 187
+IEHRYYGKS P+G ++ + +NA GY +S QALAD+A +II+LKKNL+A SPV+V
Sbjct: 135 YIEHRYYGKSNPFGSMQK-SLQNAGQRGYFNSGQALADFAEVIINLKKNLSADSSPVIVV 193
Query: 188 GGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKV 247
GGS GG+LAAWFRLKYPHVA+GALASSAPIL FD+I + +++T+DFR SE+CY
Sbjct: 194 GGSDGGLLAAWFRLKYPHVALGALASSAPILYFDDITPQDGYYSLVTKDFRDFSESCYNT 253
Query: 248 IKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
IK SW +I++ A + GL L K FR CK E ++ +L T + A D P
Sbjct: 254 IKDSWAEIDKAAAEANGLLNLSKKFRTCKPLESASQLKDYLETMYSIAAQYDRP------ 307
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-W 365
P +PV +C ID G D+ +++ G C+D+ S P E W
Sbjct: 308 ---PMYPVTIVCNGIDGGLQGTDILGRIFSGIVA---SRGNKSCYDMGQSSFPSETEEGW 361
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
WQ C+E+++ G + D++F + D+ CK +YGV PRP+WIT+ +GGH I L+
Sbjct: 362 NWQVCSELVIPIGRGSNDTMFPAAPFDFKEYDDSCKYSYGVTPRPHWITSYYGGHNIKLI 421
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
LKRF SNIIF NGLRDP+S GGVL++IS +++A+ G+H +D+ S ++DP WL R
Sbjct: 422 LKRFGSNIIFSNGLRDPYSSGGVLEDISHSIIAVHTPRGSHCLDILPSREDDPNWLVLQR 481
Query: 486 RREVEIIGKWISQYFQDLAQ 505
E+EI+ W+ +Y++DL Q
Sbjct: 482 NVEIEILHGWLLKYYEDLLQ 501
>gi|224109890|ref|XP_002315345.1| predicted protein [Populus trichocarpa]
gi|222864385|gb|EEF01516.1| predicted protein [Populus trichocarpa]
Length = 513
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/469 (46%), Positives = 293/469 (62%), Gaps = 18/469 (3%)
Query: 38 KLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFV 97
K+ SL S ++ ++T Y+ Q LDHFNY P+SY+TFQ RY+++ HW G APIFV
Sbjct: 53 KVQSLAPSDQE----FRTFYYNQTLDHFNYRPESYKTFQHRYVVSFKHWRGPDTMAPIFV 108
Query: 98 YTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYL 157
Y G E + G + D A +F AL V+IEHR+YG+SIP+ +E A K+A+ GY
Sbjct: 109 YLGEESSLNDDLGYIGILSDNAARFGALQVYIEHRFYGESIPFVSREE-ALKDANLRGYF 167
Query: 158 SSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
SS Q LADYA +I+ +KK +A SPV+VFGGSYGGMLAAWFRLKYPHVA+GALASSAP+
Sbjct: 168 SSAQTLADYAEVILHIKKKHSADSSPVIVFGGSYGGMLAAWFRLKYPHVALGALASSAPV 227
Query: 218 LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
L FDNI + ++T+DF+ SE+CYK IK SW +I++ A K GL LQK F CK
Sbjct: 228 LYFDNITPSNGYYTVVTKDFKESSESCYKTIKQSWFEIDKVAAKADGLSILQKKFNTCKP 287
Query: 278 -EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYG 336
E ++++L + F A D P P +PV +CK ID G+DV +++
Sbjct: 288 LEAATELKNFLDSLFSVAAQYDRP---------PRYPVDLVCKGIDSAPEGSDVLDRIFS 338
Query: 337 AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
Y+ G C++L+ L W WQ C+E+++ G + D++F D
Sbjct: 339 GIVAYF---GKKPCYNLDAFFSSETLEGWTWQTCSELVIPIGRGSNDTMFPAEPFDLKEY 395
Query: 397 ARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTV 456
CK A+GV PRP+WITT +GGH VL+RF SNIIF NGLRDP+S GGVL+NIS ++
Sbjct: 396 IEECKSAFGVPPRPHWITTYYGGHHFKEVLRRFGSNIIFSNGLRDPYSSGGVLENISDSI 455
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
LA+ +GAH +D+ +T DP W+ R E+EII WI +Y+QDL +
Sbjct: 456 LAVYTTKGAHCMDILPATIGDPDWVVLQRNIEIEIINGWILKYYQDLLE 504
>gi|449496306|ref|XP_004160099.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 491
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 219/454 (48%), Positives = 292/454 (64%), Gaps = 16/454 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+KT Y Q LDHFNY P+SY TF QRY+IN +WGG+ ++API Y G E I+
Sbjct: 50 FKTFYFNQTLDHFNYRPESYTTFPQRYIINFKYWGGAYSSAPILAYLGPEAPIDSAMNVI 109
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM D A KF ALLV+IEHRYYGKSIP+G KE A +NAST GY +S QA+ADYA+++I
Sbjct: 110 GFMTDNAVKFNALLVYIEHRYYGKSIPFGSRKE-ALRNASTLGYFNSAQAIADYAAILIH 168
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK +A SPV+V GGSYGGMLA WFRLKYPHVA+GALASSAPIL F++I + I
Sbjct: 169 VKKEFSAKYSPVIVIGGSYGGMLATWFRLKYPHVALGALASSAPILYFNDITPENGYYVI 228
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-NLAIESWLSTAF 291
+T+DFR VS+ CY+ I+ SW +IE A + GL L K F+ C + + +E++L F
Sbjct: 229 VTKDFREVSQTCYESIRESWSEIETVASQSNGLSVLDKEFKTCSPLRSSTQLENYL--WF 286
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
+Y + Y PS +PV +C AID + N K+ AA V + Y G C+
Sbjct: 287 MYASAAQYNHPSR-------YPVNRICDAIDQTYS-NGTLGKI--AAGV-FAYRGELSCY 335
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
+ W WQ C+EM+M N D++F D+++ + YC + YGV PRP+
Sbjct: 336 INEPINTTETTVGWQWQRCSEMVMPISTGN-DTMFPSETFDHESFSIYCNQLYGVTPRPH 394
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
W+TT +GGH I L+L RFASNIIF NGL+DP+S GGVL NIS ++LA+ G+H +D+
Sbjct: 395 WVTTYYGGHDIHLILHRFASNIIFSNGLKDPYSIGGVLHNISDSLLAVYTANGSHCLDIL 454
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+ + DP+WL R+ EV II +WI +Y+ DLA
Sbjct: 455 TANRMDPEWLVTQRKTEVGIIKEWIDEYYADLAN 488
>gi|255565519|ref|XP_002523750.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537054|gb|EEF38690.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 491
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 207/453 (45%), Positives = 285/453 (62%), Gaps = 17/453 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ Y+TQ LDHFNYNPQSY TFQQRY++N +WGG+ ++PIFVYTG E D+ + +
Sbjct: 53 YEIHYYTQTLDHFNYNPQSYATFQQRYILNFKYWGGANTSSPIFVYTGEEVDVTY---DV 109
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
+ +A +FKALL++IEHRYYG+S+P+G + A++N+ST GYLSS QALADYA ++ D
Sbjct: 110 DTILHLAARFKALLLYIEHRYYGESMPFGSEDQ-AFQNSSTLGYLSSEQALADYAQVVTD 168
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK L+A + P + G SYGGMLAAWFRLKYPH+ IG+LASS+PIL FD+I + +
Sbjct: 169 VKKKLSAENCPAIAVGASYGGMLAAWFRLKYPHIVIGSLASSSPILYFDDITPQNGYHVV 228
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAF 291
+T+D+R SE+CY IK SW +I+ A +P GL L F C + + +L +
Sbjct: 229 VTKDYRDTSESCYNTIKQSWSEIDRVAAQPNGLLTLSNMFNTCSPLNSSTELREYLEILY 288
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
V A D P P PV+ C+ ID G D+ ++ A + + C+
Sbjct: 289 VSAAQYDNP---------PYNPVQNTCRGIDGAPPGTDILGRI--VAGLKSRIPSWSSCY 337
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
D+ D S W WQ CTEM+ G +++F+ S D + C + +G+ PRP
Sbjct: 338 DV-PTWDLSNKSAWDWQTCTEMVFPIGYGYNETMFQPSPFDINNYTEACVQVFGIKPRPQ 396
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
W TTEFGGH I VL FASNIIF NGLRDPWS GGVL++IS TV+A+ + GAH +DL
Sbjct: 397 WATTEFGGHDIKTVLGNFASNIIFANGLRDPWSAGGVLEDISDTVVAVYTEYGAHCLDLY 456
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLA 504
T DP WL + R +E++II WI++Y+ L+
Sbjct: 457 PPTPNDPDWLVEQRDKEIKIIAAWIAEYYAKLS 489
>gi|356543520|ref|XP_003540208.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 511
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 219/514 (42%), Positives = 322/514 (62%), Gaps = 29/514 (5%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFPTFPSS---RITPEKLSSLISSSKDSQGLYKTKY 57
M ++++ F +S+ LT+S++ P T ++++++ D++ T Y
Sbjct: 6 MLSKWLLFIFLTYYSTYLTVSHSLKIPRLSPIAEWETTLHNRATVVATDTDTENRV-TFY 64
Query: 58 HTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD 117
+ Q+LDHF P+SY+TFQQRYLIN +WGG+ ++APIF Y G E I+ + GF+ D
Sbjct: 65 YKQVLDHFKXRPESYKTFQQRYLINFKYWGGANSSAPIFAYLGAEEPIDGSPELIGFLTD 124
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A F AL+V+IEHRYYGKS+P+G +E A KNAST GY +S QA+ADYAS++I +KK L
Sbjct: 125 NAASFNALIVYIEHRYYGKSVPFGSREE-ALKNASTIGYFNSAQAIADYASVLIHIKKTL 183
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
A SPV+V GGSYGGMLA+WFRLKYPH+AIGALASSAPIL FDNI + +++++DF
Sbjct: 184 HAQKSPVIVIGGSYGGMLASWFRLKYPHLAIGALASSAPILYFDNITPQDGYYSVVSRDF 243
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAM 296
R SE CY+ I SW +I+ A +P GL L + F C+ ++ ++ +L + +A
Sbjct: 244 REASETCYQTILKSWSEIDRVASQPKGLSILSQRFNTCRPLNESSELKDYLINMYASSAQ 303
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGD 356
++P P +PV +C ID G+D+ +K+Y G C +NG
Sbjct: 304 YNHP---------PRYPVTVICGGIDRASFGSDILSKIYAGLVA---LRGNTTC-KVNG- 349
Query: 357 SDPHGLSE----WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNW 412
P +SE W WQ C+EM++ G N +++F+ + A+ CK+ +GV PRP+W
Sbjct: 350 --PIIVSETTLGWRWQTCSEMVIPIGIGN-NTMFQPDPFSLKSYAKDCKKQFGVSPRPHW 406
Query: 413 ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL-- 470
+TT +GGH I LVL++F SNIIF NGLRDP+S GGVLKNIS +++A+ G+H +D+
Sbjct: 407 VTTYYGGHSIELVLQKFGSNIIFSNGLRDPYSSGGVLKNISDSLVAIHTVNGSHCLDILG 466
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLA 504
+ DP+WL + +RE+ ++ WI+QY+ DL
Sbjct: 467 ANANHSDPEWLVEQWKREIMVMKGWIAQYYDDLV 500
>gi|255565523|ref|XP_002523752.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537056|gb|EEF38692.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 501
Score = 421 bits (1081), Expect = e-115, Method: Compositional matrix adjust.
Identities = 227/510 (44%), Positives = 315/510 (61%), Gaps = 34/510 (6%)
Query: 5 FIFLSFCLLFSSTLT-ISNAKIFPTFPSSRITPEKL--SSLISSSKDSQGLYKTKYHTQI 61
F+ F L SST ++ PT P PE++ S L+S +T ++ Q
Sbjct: 11 FVLGLFILSASSTAKPFDIPRLSPTGPRIVQDPEEIFISELVSDD------LETFFYNQT 64
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
LDHFNYNP+SY+TFQQRY+I+ +WGG+ +++PIFVY G E ++ GF+ D A +
Sbjct: 65 LDHFNYNPESYETFQQRYIISSKYWGGANSSSPIFVYFGAEAPLDGDLTVIGFLADNAAQ 124
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD 181
F ALL++IEHRYYGKS+P+G E A KN S GY +S QA+ADYA +II +KKNL A +
Sbjct: 125 FNALLLYIEHRYYGKSVPFGSQGE-ALKNGSIRGYFNSAQAIADYAEIIIHVKKNLQAEN 183
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
SPV+V GGSYGGMLA+WFRLKYPH+A+GALASSAP+L FD+I + +I ++DFR S
Sbjct: 184 SPVIVIGGSYGGMLASWFRLKYPHLALGALASSAPVLYFDDITPQDGYYSIASRDFREAS 243
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAFVYTAMTDYP 300
ENCYK I+ SW +I+ A P GL+ L K F+ CK + ++ L + +Y+ Y
Sbjct: 244 ENCYKTIQKSWAEIDGVASMPKGLDVLSKKFKTCKPLTDSDELKDRLDS--MYSGAAQYN 301
Query: 301 TPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP 359
P P +PV +C ID + ND K++ Y G C+ + P
Sbjct: 302 KP-------PTYPVNIICSGIDGAASSSNDTLDKIFAGVVA---YRGNRSCY----INPP 347
Query: 360 HGLSE----WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
LSE W WQ C+EM++ G N D++F S D + + C YGV PRP+W+TT
Sbjct: 348 TNLSETSVGWRWQTCSEMVIPIGRGN-DTMFPPSPFDLNGYVQDCNAIYGVRPRPHWVTT 406
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
+GGH I L+L+RF SNIIF NG+RDP+S GGVL++IS T+LA+ G+H +D+ + +
Sbjct: 407 YYGGHSIKLILQRFGSNIIFSNGIRDPYSSGGVLEDISDTILAVHTANGSHCLDILIANE 466
Query: 476 -EDPQWLKDVRRREVEIIGKWISQYFQDLA 504
DP+WL R+ E+ II WIS+Y+ DL+
Sbjct: 467 TTDPEWLVAQRKTEINIIKGWISKYYDDLS 496
>gi|255565527|ref|XP_002523754.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537058|gb|EEF38694.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 517
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 210/472 (44%), Positives = 309/472 (65%), Gaps = 22/472 (4%)
Query: 38 KLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGG-SKNNAPIF 96
K+++ +SSSKD T ++ Q LDHFN+ P+SY TF+QRY+IN +W G ++ + P+
Sbjct: 56 KITTELSSSKD----ILTFFYPQKLDHFNFKPESYATFKQRYVINFKYWSGPNRTSVPLL 111
Query: 97 VYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY 156
V+ G E +IE GF+ D+A FK LLV+IEHRYYGKSIP+G +KE+ ++NAS+ GY
Sbjct: 112 VFFGAEENIEDDVDGIGFLNDIASHFKGLLVYIEHRYYGKSIPFGSSKEV-FRNASSLGY 170
Query: 157 LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
SS QA+ADYA++I+ +KK +A SPV+V GGSYGGMLA+WFRLKYPHVA+GALASSAP
Sbjct: 171 FSSAQAIADYAAIIMHVKKKYSAKTSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAP 230
Query: 217 ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
IL F++I + +I+T+DF+ SE+CY+ I+ SW +IE+ A K GL L K F+ C
Sbjct: 231 ILYFEDIAPRNGYYSIVTKDFKETSESCYQTIRKSWAEIEKVASKRNGLSILSKKFKTCN 290
Query: 277 S-EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY 335
++ ++ +L + +Y+ Y P P +PV +C ID G DV +++
Sbjct: 291 PLKRTFELKDYLDS--IYSEAAQYNDP-------PRYPVTIVCGGIDGAPKGTDVLGRIF 341
Query: 336 GAASVYYNYSGTAKCFDLNGDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIFEESEEDY 393
Y G C+D+NG + P + W WQ C+E++M G + ++++F S +
Sbjct: 342 AGVVAYM---GDRSCYDVNGYNHPTDATSLAWRWQTCSELVMPIGHE-RNTMFPTSPFNL 397
Query: 394 DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
++ + CK YGV P+P+W+T +GGH + L+L RFASNIIF NGL+DP+S GGVL+NIS
Sbjct: 398 NSYTQKCKALYGVLPQPHWVTNYYGGHDLKLILHRFASNIIFSNGLKDPYSSGGVLENIS 457
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+++A+ G+H +D++ + DP WL R+ E+EII WIS+Y DL +
Sbjct: 458 DSIVAISTVNGSHCLDIQQTQPTDPHWLVMQRKAEIEIIQGWISKYNIDLHE 509
>gi|449496302|ref|XP_004160098.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 514
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 215/454 (47%), Positives = 290/454 (63%), Gaps = 16/454 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+KT Y+ Q LDHFNY P+SY TF QRY+IN +WGG ++APIF Y G E I+
Sbjct: 73 FKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWGGPNSSAPIFAYLGAEAPIDDDLDFI 132
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM D A +F ALL++IEHRYYGKSIP+ E A NAST GY +S QA+ADYA+++I
Sbjct: 133 GFMTDNAIQFNALLIYIEHRYYGKSIPFRSRDE-ALGNASTLGYFNSAQAIADYAAILIH 191
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK A SPV+V GGSYGGMLA+WFRLKYPHVA+GALASSAPIL FD+I + ++
Sbjct: 192 VKKEFHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYSV 251
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN-LAIESWLSTAF 291
+T+DFR +SE CY+ IK SW +IE A +P GL L + F+ C+ + +E +L + +
Sbjct: 252 VTKDFRGLSETCYETIKKSWSEIETVAYQPNGLSILDQEFKTCRPLRGYFELEDYLWSMY 311
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
A ++P P +PV +C AID + N +K+ AA V + + G+ C+
Sbjct: 312 ASAAQYNHP---------PKYPVTRICDAIDGTYSVNGTLSKI--AAGV-FAFRGSVSCY 359
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
++ W WQ+C+EM+M G D D +F S D + YC YGV PRP+
Sbjct: 360 INEPRNETETDVGWRWQSCSEMVMPIGSD--DDMFPPSPFDLQSVINYCNRLYGVPPRPH 417
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
W TT +GGH I LVL+RF SNIIF NGL+DP+S GVL NIS ++LA+ G+H +D+
Sbjct: 418 WATTYYGGHDIRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLLAVYTTNGSHCLDIL 477
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+ + DP+WL R+ EV II WIS+Y+ DL +
Sbjct: 478 KAHETDPEWLVRQRKTEVGIIKGWISKYYADLKK 511
>gi|449456172|ref|XP_004145824.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 507
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 215/454 (47%), Positives = 290/454 (63%), Gaps = 16/454 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+KT Y+ Q LDHFNY P+SY TF QRY+IN +WGG ++APIF Y G E I+
Sbjct: 66 FKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWGGPNSSAPIFAYLGAEAPIDDDLDFI 125
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM D A +F ALL++IEHRYYGKSIP+ E A NAST GY +S QA+ADYA+++I
Sbjct: 126 GFMTDNAIQFNALLIYIEHRYYGKSIPFRSRDE-ALGNASTLGYFNSAQAIADYAAILIH 184
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK A SPV+V GGSYGGMLA+WFRLKYPHVA+GALASSAPIL FD+I + ++
Sbjct: 185 VKKEFHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYSV 244
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN-LAIESWLSTAF 291
+T+DFR +SE CY+ IK SW +IE A +P GL L + F+ C+ + +E +L + +
Sbjct: 245 VTKDFRGLSETCYETIKKSWSEIETVAYQPNGLSILDQEFKTCRPLRGYFELEDYLWSMY 304
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
A ++P P +PV +C AID + N +K+ AA V + + G+ C+
Sbjct: 305 ASAAQYNHP---------PKYPVTRICDAIDGTYSVNGTLSKI--AAGV-FAFRGSVSCY 352
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
++ W WQ+C+EM+M G D D +F S D + YC YGV PRP+
Sbjct: 353 INEPRNETETDVGWRWQSCSEMVMPIGSD--DDMFPPSPFDLQSVINYCNRLYGVPPRPH 410
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
W TT +GGH I LVL+RF SNIIF NGL+DP+S GVL NIS ++LA+ G+H +D+
Sbjct: 411 WATTYYGGHDIRLVLQRFGSNIIFSNGLKDPYSIAGVLHNISDSLLAVYTTNGSHCLDIL 470
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+ + DP+WL R+ EV II WIS+Y+ DL +
Sbjct: 471 KAHETDPEWLVRQRKTEVGIIKGWISEYYADLKK 504
>gi|224104755|ref|XP_002313554.1| predicted protein [Populus trichocarpa]
gi|222849962|gb|EEE87509.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 208/457 (45%), Positives = 294/457 (64%), Gaps = 25/457 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ +TQ LDHFN+ P+SY TFQQRY++N +WGG+ ++PIFVY G E D+ QN
Sbjct: 11 FEVHNYTQSLDHFNFKPESYATFQQRYILNYKYWGGANTSSPIFVYLGAEIDV---TQNL 67
Query: 113 GF-MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
+ D+A +FK LL+++EHRYYG S+P+G E A++N+ST GYLSS QALADYA +I+
Sbjct: 68 DLSIVDLAARFKGLLLYVEHRYYGVSMPFGSEDE-AFQNSSTFGYLSSEQALADYAQVIV 126
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
D+KK+L+A + P + GGSYGGMLA+WFRLKYPH+ IG+LASSAPIL FD+I +
Sbjct: 127 DVKKDLSAENCPAIAIGGSYGGMLASWFRLKYPHIVIGSLASSAPILYFDDITPQNGYHV 186
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLS 288
I+++DFR SE+CY I+ SW +I+ A + GL L F C S K+L + +
Sbjct: 187 IVSKDFRETSESCYNTIQQSWAEIDRVASETNGLLNLSNIFTTCSPLNSSKDLKV--YTE 244
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA--ASVYYNYSG 346
A+++ A D P P++PV ++C AID +G D+ ++ ASV+ N
Sbjct: 245 IAYMWAAQLDNP---------PSYPVNKICSAIDGAPSGTDILGRVAAGVNASVFGNSCH 295
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
+A L+ S S W WQ CTEM+ G +++F+ D + + C + +G+
Sbjct: 296 SASGSGLSRKS----ASAWEWQTCTEMVFPMGYGENETMFQSDPLDINNYTKECVDVFGI 351
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+WITTEFGGH I VL FASNIIF NGLRDPWS GGVL++IS +V+AL ++G+H
Sbjct: 352 KPRPHWITTEFGGHDIKTVLGNFASNIIFSNGLRDPWSAGGVLEDISDSVVALYTEQGSH 411
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+DL T DP WL +R +E +II W+++Y+ L
Sbjct: 412 CLDLYPPTSSDPDWLLALRDKENKIIAYWLAEYYAKL 448
>gi|224104759|ref|XP_002313555.1| predicted protein [Populus trichocarpa]
gi|222849963|gb|EEE87510.1| predicted protein [Populus trichocarpa]
Length = 499
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 217/458 (47%), Positives = 301/458 (65%), Gaps = 23/458 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T ++ Q LDHFNY P+SY TF QRYLIN +WGG+ +API VY G E I+
Sbjct: 59 FETFFYNQTLDHFNYRPESYDTFLQRYLINSKYWGGANASAPILVYLGAEAPIDGDLDAV 118
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D A +F +LLV++EHRYYGKSIP+G +E A KNAST GY +S QA+ADYA++II
Sbjct: 119 GFLVDTAVEFNSLLVYVEHRYYGKSIPFGSREE-ALKNASTLGYFNSAQAIADYAAIIIH 177
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK L A DSPV+V GGSYGGMLA+WFRLKYPH+A+GALASSAP+L FD+I Y + +
Sbjct: 178 IKKTLQAKDSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPVLYFDDITPQYGYYAL 237
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAF 291
+++DFR SE CY+ I+ SW++I+E A KP GL L K F+ C + + +++ L + +
Sbjct: 238 VSKDFRGASETCYQTIRESWEEIDEVASKPDGLSILSKKFKTCNPLTDASELKNHLDSMY 297
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
A + P P +PV ++C ID G+D+ +++G Y G C+
Sbjct: 298 ANAAQYNKP---------PTYPVNKVCGGIDGCGFGDDLLGRVFGGLVA---YKGNRSCY 345
Query: 352 DLNGDSDPHGLSE----WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
++P SE W WQ C+EM+M G N DS+F D A CK Y V
Sbjct: 346 ----VNEPTNQSETSVGWRWQTCSEMVMPIGYGN-DSMFPPDPFDLKAYIEDCKSLYDVT 400
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
PR +W+TT +GGH I L+L+RFASNIIF NGLRDP+S GGVL+NIS TV+A+ G+H
Sbjct: 401 PRFHWVTTYYGGHSIRLILQRFASNIIFSNGLRDPYSSGGVLENISDTVVAVKTVNGSHC 460
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+D+ F+ + DP+WL R+ E++II +WI++Y+ DL++
Sbjct: 461 LDILFAKETDPEWLVAQRKTEIKIIKEWINKYYADLSR 498
>gi|356499942|ref|XP_003518794.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 494
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 226/505 (44%), Positives = 312/505 (61%), Gaps = 35/505 (6%)
Query: 10 FCLLFSSTL-TISNAKIFPTFPSSRITPEKLSSLISS--SKDSQGLYKTKYHTQILDHFN 66
+ LL S T T+ AK T P PE +SL + D++ + KT Y Q+LDHFN
Sbjct: 10 WLLLISLTHSTLLTAKHSLTIPRMSPIPEWETSLHDHPVATDAEEV-KTFYFKQVLDHFN 68
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
Y P+SY TFQQRYL+N +WGG+ ++APIF Y G E I+ GF+ D A F ALL
Sbjct: 69 YRPESYTTFQQRYLVNFKYWGGANSSAPIFAYFGAESPIDNSPNGIGFLTDNAASFNALL 128
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVV 186
V+IEHRYYGKS+P+G +E A KNAST GY +S QALADYA+++ +KK L A +SPV+V
Sbjct: 129 VYIEHRYYGKSVPFGSREE-ALKNASTIGYFNSAQALADYAAILEHIKKTLHAQNSPVIV 187
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYK 246
GGSYGGMLA+WFRLKYPH+ +GALAS+APIL FD I + +++T+D+R SE CY+
Sbjct: 188 IGGSYGGMLASWFRLKYPHLTVGALASAAPILYFDKITPQNGYYSVVTRDYRDASETCYE 247
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNF 305
I SW +I A +P GL L F C + ++ + +L + +VY A + P
Sbjct: 248 TILKSWSEIHRVASQPNGLVTLSHRFNTCHTVNQSYELIDYLRSTYVYAAQYNQP----- 302
Query: 306 LNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE- 364
P +PV E+C ID G+D+ +K+Y + G C +NG P +SE
Sbjct: 303 ----PRYPVSEICGGIDGASLGSDILSKIYAGVVALW---GNNTC-KVNG---PTNVSET 351
Query: 365 ---WGWQACTEM-IMLTGGDNKDSIFEESEEDYDAR-ARYCKEAYGVDPRPNWITTEFGG 419
W WQ C+EM I ++ G+N ++FE+ R A CK+ YGV PRP+W+TT +GG
Sbjct: 352 SVGWRWQTCSEMAIPISIGNN--TMFEQPIPFNLTRYAEGCKKQYGVSPRPHWVTTYYGG 409
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQ 479
H I LVL+RF SNIIF NGLRDP+S GGVL S +++A+ G+H +DL + +
Sbjct: 410 HNIKLVLRRFGSNIIFSNGLRDPYSIGGVLDYTSDSIVAVNTVNGSHCLDLLRA-----R 464
Query: 480 WLKDVRRREVEIIGKWISQYFQDLA 504
WL + ++E+ II WI+QY+ D++
Sbjct: 465 WLVEQXKKEIRIIKSWITQYYDDVS 489
>gi|350588336|ref|XP_003129755.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Sus scrofa]
Length = 493
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 217/510 (42%), Positives = 308/510 (60%), Gaps = 29/510 (5%)
Query: 4 RFIFLSFCLLFSSTLTISNAKIFPTFPSSR-ITPEKLSSLISSSKDSQGLYKTKYHTQIL 62
R + L+F + ++T P P+ R + LS+ SS Y Y Q +
Sbjct: 3 RLLLLAFLMFGAAT---------PVPPALRAFSSLHLSNSFSSRPAVAMKYSIHYIQQKV 53
Query: 63 DHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKF 122
DHF +N +TF+QRYLI DTHW K+ I YTGNEGDI WF NTGFM+DVA +
Sbjct: 54 DHFGFNTD--KTFKQRYLIADTHW--RKDGGSILFYTGNEGDIIWFCNNTGFMWDVAEEL 109
Query: 123 KALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDS 182
KA+LVF EHRYYG+S+P+G + ++K++ +L+S QALAD+A LI LK+ + T++
Sbjct: 110 KAMLVFAEHRYYGESLPFGAH---SFKDSRHLNFLTSEQALADFAELIRHLKRKIPGTEN 166
Query: 183 -PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
PV+ GGSYGGMLAAWFR+KYPH+ +GALASSAPI +F+N+V F +T+DFR
Sbjct: 167 QPVIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWHFENLVPCGVFMKTVTKDFRESG 226
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWLSTAFVYTAMT 297
NC + I+ SW I A+K GL L +A +C S+ ++ W+S +V AM
Sbjct: 227 PNCSETIRRSWDAINRLARKGTGLHWLSEALHLCTPLTNSQDVQRLKDWISETWVNLAMV 286
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLNGD 356
DYP S+FL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG A+C +++
Sbjct: 287 DYPYESDFLQPLPAWPIKVVCQYLKNPNVSDSLLVQNIFQALNVYYNYSGQARCLNISET 346
Query: 357 SDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
+ G W +QACTEM+M + D +FE + D + C +GV PRP WITT
Sbjct: 347 ATSSLGALGWSYQACTEMVMPFCSNGIDDMFEPNSWDLKEFSDDCFRQWGVRPRPYWITT 406
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
+GG I +NIIF NG DPWSGGGV KN++ T++A+V +GAHH+DLR +T
Sbjct: 407 VYGGKNIS-----SHTNIIFSNGELDPWSGGGVTKNVTDTLVAIVIPKGAHHLDLRANTA 461
Query: 476 EDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
DP + R EV + +WI ++ L +
Sbjct: 462 FDPTTVLLARSLEVRYMKQWIKDFYASLRR 491
>gi|326914542|ref|XP_003203584.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Meleagris
gallopavo]
Length = 483
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 205/456 (44%), Positives = 291/456 (63%), Gaps = 21/456 (4%)
Query: 51 GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQ 110
G Y T+Y TQ +DHF ++ TFQQRYLI D HW K+N PI YTGNEGDI WF
Sbjct: 36 GPYVTRYLTQQIDHFGFDEN--LTFQQRYLIADQHW--QKDNGPILFYTGNEGDITWFCN 91
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
NTGFM+DVA + A+LVF EHRYYG+S+P+G ++ ++ YL+S QALAD+A LI
Sbjct: 92 NTGFMWDVAEELNAMLVFAEHRYYGESLPFGNE---SFSDSKHLNYLTSEQALADFAVLI 148
Query: 171 IDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
LK+ + A SPV+ GGSYGGMLAAWFR+KYPHV +GALA+SAPI F ++V ++
Sbjct: 149 EYLKETIAGARHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFGDLVPCGAY 208
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----LAIES 285
I+T DF+ C + I+ SW I + GL+ L + F +C KN L ++S
Sbjct: 209 FTIVTNDFKKSGTGCSESIRNSWNAINHLSSTDAGLQWLSRTFHLCSPLKNLQDALMLKS 268
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNY 344
WLS ++ AM DYP ++FL PLPA+P++E+CK + DP + + + ++ A ++YYNY
Sbjct: 269 WLSETWIDLAMVDYPYKADFLEPLPAWPIQEVCKFLKDPSLSDKLLLQNVFQAINIYYNY 328
Query: 345 SGTAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
+G A CFD++ ++ L + GW Q CTEM+M D +FE + D++A + C +
Sbjct: 329 TGEASCFDMS-ETATKNLGQLGWYYQVCTEMVMPMCTDGVHDMFEPQKWDFEALSDECFK 387
Query: 403 AYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
+GV PRP+WI + +GG I SNIIF NG DPWS GGV +NI+ +++A+V
Sbjct: 388 LWGVRPRPSWILSMYGGKNIS-----SHSNIIFSNGGLDPWSAGGVTQNITDSLVAIVIP 442
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+GAHH+DLR DP+ +++ R EV + +WI++
Sbjct: 443 DGAHHLDLRSRNPCDPKSVQEARALEVHYMKQWIAK 478
>gi|147820345|emb|CAN63232.1| hypothetical protein VITISV_004830 [Vitis vinifera]
Length = 468
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 200/411 (48%), Positives = 271/411 (65%), Gaps = 16/411 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T Y+ Q LDHFNY P+SY TFQQRY++N +WGG+ +APIF Y G E D+++
Sbjct: 60 FQTFYYNQTLDHFNYRPESYITFQQRYVVNFKYWGGANASAPIFAYLGXEADLDYDLSGI 119
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D A +FKALLV+IEHRYYGKSIP G +E A+KNAST GY +S QA+ADYA ++I
Sbjct: 120 GFLTDNAHQFKALLVYIEHRYYGKSIPXGSREE-AFKNASTLGYFNSAQAIADYAEVLIY 178
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK L A +SPV+V G SYGGMLA+WFRLKYPHVA+GALASSAPIL FD+I + +I
Sbjct: 179 IKKKLLAENSPVIVVGASYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSI 238
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
+T+DFR SE+CYK I SW +I+ A +P GL L K FRIC N ++ +L +
Sbjct: 239 VTKDFREASESCYKTISESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTY 298
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
A D P P++PV +C ID G+D+ ++++ + G C+
Sbjct: 299 SIAAQYDAP---------PSYPVTVVCGGIDGAPEGSDILSRIFAGVVA---FRGNMSCY 346
Query: 352 DLNGDSD--PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
+ +D + WGWQ C+E+++ G D++F + + ++ C YGV PR
Sbjct: 347 YTSNTTDYPIETIQGWGWQTCSEIVIPIGRXVNDTMFPLAPFNLTTFSQDCTSLYGVAPR 406
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
P+WITT +GGH I L+L+RFASNIIF NGLRDP+S GGVLKNIS +VLA++
Sbjct: 407 PHWITTYYGGHDIELILRRFASNIIFSNGLRDPYSIGGVLKNISNSVLAIL 457
>gi|225709542|gb|ACO10617.1| Lysosomal Pro-X carboxypeptidase precursor [Caligus rogercresseyi]
Length = 492
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 208/449 (46%), Positives = 288/449 (64%), Gaps = 18/449 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT+++ ++DHF + S TF+QRYLINDTHW S++ PIF+Y GNEGD E FAQNT
Sbjct: 41 YKTEWYENLVDHFGFAINS--TFKQRYLINDTHWD-SQHGGPIFLYAGNEGDSEAFAQNT 97
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+AP+F AL++FIEHRYYGKS+P+G K+ + GYL+S QALADYA + +
Sbjct: 98 GFMWDIAPEFNALIIFIEHRYYGKSLPFG--KDSLKPDPKMNGYLTSEQALADYARFVTE 155
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
K A DSPV+VFGGSYGGMLAAW R+KYPH+ GA+A SAP+ FD +F
Sbjct: 156 FKSTRKGAKDSPVIVFGGSYGGMLAAWMRIKYPHIVNGAIAGSAPVAQFDTPC--LNFGR 213
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTA 290
I+T D+ S++C VI SW I++ K GL++LQ ++C K++ ++S+L+
Sbjct: 214 IVTSDYSFYSKSCSGVISMSWAAIDQVGKDDKGLQRLQSLLKLCSKPKSVEPLKSFLTDV 273
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL---YGAASVYYNYSGT 347
+ AM +YP P+ FL PLP PVK +C+ + D A L Y +VY NYSG
Sbjct: 274 WTNVAMMNYPYPTEFLMPLPGNPVKYICRKMSPTTVPTDPVAILKYVYEGLNVYANYSGK 333
Query: 348 AKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
AKC D+ G++D G W +Q+CTEM+M +N D +FE+S+ ++ A+ C+E + V
Sbjct: 334 AKCIDM-GNADQIGADMWDYQSCTEMVMPFCYNNVDDMFEKSDWNFTTYAQGCQERWKVT 392
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
PRP +G K+ + ASNIIF NGL DPWS GG++K+IS +V++++ EGAHH
Sbjct: 393 PRPKMADIMYGSKKL-----KAASNIIFSNGLLDPWSSGGIMKSISDSVVSIIIPEGAHH 447
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+DLR S DP + R+ E I KWI
Sbjct: 448 LDLRGSNPNDPVSVIHARKLERSFIRKWI 476
>gi|363729398|ref|XP_425654.3| PREDICTED: lysosomal Pro-X carboxypeptidase [Gallus gallus]
Length = 482
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/453 (44%), Positives = 287/453 (63%), Gaps = 20/453 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y T+Y TQ +DHF ++ TFQQRYLI D HW K+N PI YTGNEGDI WF NT
Sbjct: 38 YLTRYLTQQIDHFGFDEN--LTFQQRYLIADQHW--KKDNGPILFYTGNEGDITWFCNNT 93
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + A+LVF EHRYYG+S+P+G ++ ++ YL+S QALAD+A LI
Sbjct: 94 GFMWDVAEELNAMLVFAEHRYYGESLPFGNE---SFSDSKHLNYLTSEQALADFAVLIEY 150
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A SPV+ GGSYGGMLAAWFR+KYPHV +GALA+SAPI F ++V ++
Sbjct: 151 LKETIAGARHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFGDLVPCGAYFT 210
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL----AIESWL 287
I+T DF+ C + I+ SW I + GL+ L + F +C KNL +++WL
Sbjct: 211 IVTNDFKKSGTGCSESIRNSWNAINHLSSTDVGLQWLSRTFHLCSPLKNLQDAAMLKTWL 270
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGT 347
S ++ AM DYP ++FL PLPA+P++E+CK + P + V A+ + A ++YYNY+G
Sbjct: 271 SETWIDLAMVDYPYKADFLQPLPAWPIREVCKFLRTPVSLTIVAAEYFQAINIYYNYTGE 330
Query: 348 AKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
A CFD++ ++ L + GW Q CTEM+M D +FE + D++A + C +G
Sbjct: 331 ASCFDMS-ETATKNLGQLGWYYQVCTEMVMPMCTDGVHDMFEPQKWDFEALSDECFRLWG 389
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PRP+WI + +GG I SNIIF NG DPWS GGV +N++ +++A+V +GA
Sbjct: 390 VRPRPSWILSMYGGKNIS-----SHSNIIFSNGGLDPWSAGGVTQNVTDSLVAIVIPDGA 444
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
HH+DLR DP+ ++ R EV + +WI++
Sbjct: 445 HHLDLRSRNPCDPKSVQQARALEVHYMKQWIAE 477
>gi|225708852|gb|ACO10272.1| Lysosomal Pro-X carboxypeptidase precursor [Caligus rogercresseyi]
Length = 492
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 208/449 (46%), Positives = 288/449 (64%), Gaps = 18/449 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT+++ ++DHF + S TF+QRYLINDTHW S++ PIF+Y GNEGDIE FAQNT
Sbjct: 41 YKTEWYENLVDHFGFAINS--TFKQRYLINDTHWD-SQHGGPIFLYAGNEGDIEAFAQNT 97
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+AP+F AL++FIEHRYYGKS+P+G K+ + GYL+S QALADYA + +
Sbjct: 98 GFMWDIAPEFNALIIFIEHRYYGKSLPFG--KDSLKPDPKMNGYLTSEQALADYARFVTE 155
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
K A DSPV+VFGGSYGGMLAAW R+KYPH+ GA+A SAP+ FD +F
Sbjct: 156 FKSTRKGAKDSPVIVFGGSYGGMLAAWMRIKYPHIVNGAIAGSAPVAQFDTPC--LNFGR 213
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTA 290
I+T D+ S++C VI SW I++ K GL++LQ ++C K++ ++S+L+
Sbjct: 214 IVTSDYSFYSKSCSGVISKSWAAIDQVGKNDKGLQRLQSLLKLCSKPKSVEPLKSFLTDV 273
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL---YGAASVYYNYSGT 347
+ AM +YP P+ FL PLP PVK +C + D A L Y +VY NYSG
Sbjct: 274 WTSVAMMNYPYPTEFLMPLPGNPVKYICGKMSPTTVPTDPVAILKYVYEGLNVYANYSGK 333
Query: 348 AKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
AKC D++ ++D G W +Q+CTEM+M +N D +FE+S+ ++ A+ C+E + V
Sbjct: 334 AKCIDMD-NADQIGADMWDYQSCTEMVMPFCYNNVDDMFEKSDWNFTTYAQGCQERWKVT 392
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
PRP +G K+ + ASNIIF NGL DPWS GG++K+IS +V++++ EGAHH
Sbjct: 393 PRPKMADIMYGSKKL-----KAASNIIFSNGLLDPWSSGGIMKSISDSVVSIIIPEGAHH 447
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+DLR S DP + R+ E I KWI
Sbjct: 448 LDLRGSNPNDPVSVIHARKLERSFIRKWI 476
>gi|195997817|ref|XP_002108777.1| hypothetical protein TRIADDRAFT_19181 [Trichoplax adhaerens]
gi|190589553|gb|EDV29575.1| hypothetical protein TRIADDRAFT_19181 [Trichoplax adhaerens]
Length = 463
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 206/462 (44%), Positives = 291/462 (62%), Gaps = 20/462 (4%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S S S Y+TKY Q LDHF+ P + FQQRYLI+ +W K +PIF YTGNEG
Sbjct: 11 SDSFQSNVKYQTKYFKQRLDHFS--PADDRKFQQRYLISQKYW---KKGSPIFFYTGNEG 65
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
DI WFA+NTGFM+D+AP+F A+L+F+EHRYYGK++P+G + ++K+ YLSS QAL
Sbjct: 66 DITWFAKNTGFMWDIAPEFNAMLIFVEHRYYGKTLPFGKD---SFKDKEHLAYLSSEQAL 122
Query: 164 ADYASLIIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN 222
AD+A I+D K T +S V+ FGGSYGGML AW R+KYP++ GA+A+SAPI +
Sbjct: 123 ADFAQFIVDFKAETHGTQNSSVIAFGGSYGGMLTAWLRIKYPNIVDGAIAASAPIWQLEG 182
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL- 281
+ FS+I+T F+ C K I+ SWK I + G L ++C K+
Sbjct: 183 LTPCDRFSSIVTNTFKLAYPECPKNIRNSWKVIRKLGSTKSGRHTLSTTLKLCNPLKSPS 242
Query: 282 ---AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGA 337
A+ +WLS+ +V A DYP P+NFL PLPA PVK +C ++ P + + + +
Sbjct: 243 DVDALVNWLSSIWVNLAEVDYPYPANFLEPLPAKPVKAVCASLQKPLANDTLLIRGIAKG 302
Query: 338 ASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
+VY+N++G +KCF+ + D+ H G++ W +Q+CTEM++ D +FE + + A
Sbjct: 303 LNVYFNHTGNSKCFNTDQDATSHLGIAGWNFQSCTEMVLPVCMDGVHDMFEPTPFNLTAM 362
Query: 397 ARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTV 456
A CKE YGV RP W T +GG +I + ASNI+F NG DPWSGGGVLK++SK++
Sbjct: 363 AAICKEQYGVRTRPFWAQTLYGGKRI-----KAASNIVFSNGNLDPWSGGGVLKSLSKSL 417
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+A+V + GAHH+DLR + K+DP +K R+ E I KWI +
Sbjct: 418 VAIVIEGGAHHLDLRHANKDDPATVKHARKIEKMHIAKWIKE 459
>gi|255565521|ref|XP_002523751.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537055|gb|EEF38691.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 489
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 198/453 (43%), Positives = 292/453 (64%), Gaps = 21/453 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ Y+TQ LDHFNY P+SY TFQQRY++N +WGG+ ++PIF+YTG E ++ + +
Sbjct: 53 YEIHYYTQTLDHFNYKPESYATFQQRYILNFKYWGGANTSSPIFLYTGAEENLIYHVDRS 112
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
+ ++A +F+ LL++IEHRYYG+S+P+G ++E A +N+ST GYLSS QALADYA +I D
Sbjct: 113 --IVELAARFRGLLLYIEHRYYGESMPFG-SEEQALQNSSTLGYLSSEQALADYAQVITD 169
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KKNL+A + P + G SYGGMLAAWFRLKYPH+ IG+LASS+PIL FD+I +P + ++
Sbjct: 170 VKKNLSAENCPAIAVGASYGGMLAAWFRLKYPHIVIGSLASSSPILYFDDI-TPQNGYHV 228
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAF 291
+++ R + E+C+ IK SW +I+ A +P GL L F C+ + A + +L +
Sbjct: 229 LSR--RILDESCHNTIKQSWSEIDRVAAQPNGLLTLSNMFNTCRPLVSSAEFKEYLELLY 286
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY-GAASVYYNYSGTAKC 350
+ A D P P PV+ C+ ID G D+ ++ G +S
Sbjct: 287 ITAAQYDNP---------PDNPVQSTCRGIDGAPPGTDILGRIVEGLNGRIPGWSSCHDI 337
Query: 351 FDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
F L ++ W WQ CTEM+ G + +++F+ S D + + C + +G+ PRP
Sbjct: 338 FTLELSNN----GSWDWQTCTEMVFPIGYGDNETMFQPSPFDINNYKKECLQVFGIKPRP 393
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+W+TTEFGGH I VL FASNIIF NGLRDPWS GGVL++IS +++A+ + GAH +DL
Sbjct: 394 HWVTTEFGGHDIKTVLGNFASNIIFANGLRDPWSAGGVLEDISDSIVAVYTEHGAHCLDL 453
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
ST +DP WL + R +E++II W+++Y+ L
Sbjct: 454 YPSTPDDPNWLVEQREKEIKIIAAWLAEYYAKL 486
>gi|357451981|ref|XP_003596267.1| Lysosomal Pro-X carboxypeptidase [Medicago truncatula]
gi|355485315|gb|AES66518.1| Lysosomal Pro-X carboxypeptidase [Medicago truncatula]
Length = 479
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 223/508 (43%), Positives = 306/508 (60%), Gaps = 45/508 (8%)
Query: 7 FLSFCLLF----SSTLTISNAKIFPT---FPSSRITPEKLSSLISSSKDSQGLYKTKYHT 59
F+SF LLF ++T +N+ P F S+ T + + + ++D Q + +
Sbjct: 4 FISFYLLFIFFLCFSVTTTNSLTLPRLSPFSESKTTEYQNTKTFNLNEDMQPYF----YE 59
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHFNY SY+TF+QRY+IN +WGG+ ++APIF Y G E DI GFM D A
Sbjct: 60 QTLDHFNYLSDSYKTFKQRYIINFNYWGGANSSAPIFAYLGGEDDI---VNTLGFMTDNA 116
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
FKALLV+IEHRYYGKS+P ++ GYL+S QALADYA +++ LK +L A
Sbjct: 117 TSFKALLVYIEHRYYGKSVP---------SFNASYGYLNSAQALADYAEVLLYLKDSLHA 167
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
SPV+V GGSY GMLAAWFRLKYPH+AIGALASSAP+L FDNI ++++IT+DF+
Sbjct: 168 QKSPVIVVGGSYAGMLAAWFRLKYPHIAIGALASSAPLLYFDNITPASGYNDVITRDFQE 227
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTD 298
S+ CY+ I+ SW +I+E A KP GL L K F C E+ ++ +L + A +
Sbjct: 228 TSKTCYETIRNSWFEIDEVASKPNGLNFLSKKFNTCYPLEQPGELKDYLGNMYQKAAQYN 287
Query: 299 YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGD-- 356
P P +C+AID G+D+ +++YG Y G KC ++N D
Sbjct: 288 DP------------PTTTICEAIDRASYGDDILSRIYGGMVASY---GNKKC-NVNPDKY 331
Query: 357 SDPHGLSEWGWQACTEMIMLTG-GDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
+ W WQ CTE++M G GD+ S+F+ ++ + A CK+ +GV PRP+WIT+
Sbjct: 332 TGAKPFDRWRWQTCTEIVMPIGIGDS--SLFQPKPFNFTSFAENCKKDFGVQPRPHWITS 389
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
+GG I LVLKRF SNIIF NGLRDP+S GGVL NIS +++AL G H D+ + +
Sbjct: 390 YYGGQDIQLVLKRFGSNIIFSNGLRDPYSSGGVLNNISDSLVALPTVNGYHCQDIVPAIE 449
Query: 476 EDPQWLKDVRRREVEIIGKWISQYFQDL 503
DP WL R EVEII WI +Y+ ++
Sbjct: 450 SDPAWLVHQRNTEVEIIQSWIKKYYDEV 477
>gi|224043680|ref|XP_002188561.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Taeniopygia guttata]
Length = 479
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 202/456 (44%), Positives = 288/456 (63%), Gaps = 21/456 (4%)
Query: 51 GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQ 110
G Y T+Y +Q +DHF ++ +TFQQRYL+ D HW K+N PI YTGNEGDIEWF
Sbjct: 32 GPYLTRYLSQQIDHFGFDEN--RTFQQRYLLADQHW--KKDNGPILFYTGNEGDIEWFCN 87
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
NTGFM+DVA + A+LVF EHRYYG+S+P+G ++ ++ YL+S QALAD+A L+
Sbjct: 88 NTGFMWDVAEELNAMLVFAEHRYYGESLPFGNE---SFSDSKHLNYLTSEQALADFAVLV 144
Query: 171 IDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
LK + A SPV+ GGSYGGMLAAWFR+KYPHV +GALA+SAPI F ++V +F
Sbjct: 145 EYLKTTIAGAQHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFADLVPCGTF 204
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL----AIES 285
+I+T DF+ + C + I+ SW I + GL+ L F +C K L +++
Sbjct: 205 FSIVTNDFKKSGKGCSESIRNSWNAINHLSSTDAGLQWLSNTFHLCSPLKTLQDAAVLKN 264
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNY 344
WLS +V AM +YP ++FL PLPA+P++E+CK + DP + + + ++ A ++YYNY
Sbjct: 265 WLSETWVNLAMVNYPYKADFLQPLPAWPIQEVCKFLKDPSLSDKLLLQNVFQAVNLYYNY 324
Query: 345 SGTAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
+G A C D++ + L E GW QACTE++M D +FE + D+DA + C
Sbjct: 325 TGEASCLDVS-QTATKSLGEMGWYYQACTEIVMPLCTDGVHDMFEPQKWDFDALSEECYR 383
Query: 403 AYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
+GV PR +WI + +GG I SNIIF NG DPWS GGV +NIS +++A+V
Sbjct: 384 MWGVRPRLSWILSMYGGKNIS-----SHSNIIFSNGGLDPWSAGGVTQNISNSLVAVVIP 438
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+GAHH+DLR DP+ ++ R E+ ++ +WI +
Sbjct: 439 DGAHHLDLRSHNPLDPKSVQQARAMEICLMKEWIEK 474
>gi|431838478|gb|ELK00410.1| Lysosomal Pro-X carboxypeptidase [Pteropus alecto]
Length = 497
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 211/488 (43%), Positives = 292/488 (59%), Gaps = 20/488 (4%)
Query: 26 FPTFPSSR-ITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDT 84
P P+ R + LS+ +S Y Y Q +DHF +N +TF QRYLI D
Sbjct: 20 IPVPPAVRALNNLHLSTSFTSHPAIVMEYSIHYIQQKVDHFGFNTD--KTFNQRYLIADK 77
Query: 85 HWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
HW K+ I YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYGKS+P+G +
Sbjct: 78 HW--KKDGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGKSLPFGSS- 134
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKY 203
++K++ +L+S QALAD+A LI L+K + A + PV+ GGSYGGMLAAWFR+KY
Sbjct: 135 --SFKDSRHLNFLTSEQALADFAELITHLRKTIPGAKNQPVIALGGSYGGMLAAWFRMKY 192
Query: 204 PHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
PH+ GALA+SAPI FDNIV F I+T D+R NC + I+ SW I A+
Sbjct: 193 PHMVAGALAASAPIWQFDNIVPCGVFMEIVTTDYRKSGPNCAECIRRSWDAINRIAENGT 252
Query: 264 GLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
GL L + +C K+ ++ W+S ++ A+ DYP SNFL PLPA+PVK +C+
Sbjct: 253 GLPWLSETLHLCTPLKHTQDIQNLKDWISETWINLAVVDYPYESNFLQPLPAWPVKVVCQ 312
Query: 320 AIDDPKTGN-DVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLT 377
+ DP + + ++ A ++YYNY+G A+C +++ S G W +QACTEM+M
Sbjct: 313 YLKDPSVSDPQLLQNIFQALNIYYNYTGQARCLNMSETSTSSLGSLGWSYQACTEMVMPF 372
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
+ D +FE D + + YC +GV PRP+WITT +GG I +NIIF N
Sbjct: 373 CTNGIDDMFEPHSWDLEEFSDYCFRQWGVRPRPSWITTLYGGKNISS-----HTNIIFSN 427
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G DPWSGGGV K+I+ T++A+ EGAHH+DLR + DP + R EV + +WI
Sbjct: 428 GDLDPWSGGGVTKDITDTLVAITIPEGAHHLDLRANNAFDPTTVVLARSMEVRYMKQWIK 487
Query: 498 QYFQDLAQ 505
++ +L +
Sbjct: 488 DFYANLRK 495
>gi|348565561|ref|XP_003468571.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cavia porcellus]
Length = 494
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 212/466 (45%), Positives = 287/466 (61%), Gaps = 20/466 (4%)
Query: 42 LISSSKDSQGL--YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYT 99
LISS+ S + Y Y Q +DHF +N QTF+QRYLI+ HW K+ I YT
Sbjct: 33 LISSTSQSAIVQNYSVLYFQQKVDHFGFN--YLQTFKQRYLISAKHW--EKDGGSILFYT 88
Query: 100 GNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
GNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G + ++K++ +L+S
Sbjct: 89 GNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGAD---SFKDSKRLNFLTS 145
Query: 160 TQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
QALAD+A LI LK + A D PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI
Sbjct: 146 EQALADFAELIRHLKTTIPGAGDQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIW 205
Query: 219 NFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS- 277
F+N+V +F I+T DFR NC K I+ SW I A GL+ L K R+C
Sbjct: 206 QFENLVPCDTFMKIVTTDFRKSDPNCPKSIRRSWDAINRLATTRSGLQWLTKTLRLCNPL 265
Query: 278 --EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKL 334
+ + W+ +V AMTDYP SNF+ PLPA+P+K +C+ + DP + + +
Sbjct: 266 TLDDVQHFKQWILETWVNLAMTDYPYESNFMQPLPAWPIKVVCQYLTDPNVSDSQLLQNI 325
Query: 335 YGAASVYYNYSGTAKCFDLNGD-SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
+ A +VYYNYSG A+C +++ S G+S W +Q+CTEM+M + + +FE +
Sbjct: 326 FQALNVYYNYSGQAECLNISETASSSLGISGWSYQSCTEMVMPFCANGVEDMFEPFSWNL 385
Query: 394 DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
+ C +GV P P+WITT +GG I SNIIF NG DPWSGGGV K+IS
Sbjct: 386 KQFSDECFNQWGVRPNPSWITTLYGGKNISS-----HSNIIFSNGDLDPWSGGGVTKDIS 440
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
T++A+ EGAHH+DLR + DP + R EVE + KWI+ +
Sbjct: 441 DTLVAINIAEGAHHLDLRAYSAYDPASVLLARSLEVEYMKKWITDF 486
>gi|327269259|ref|XP_003219412.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Anolis
carolinensis]
Length = 500
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 210/456 (46%), Positives = 286/456 (62%), Gaps = 19/456 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y T + Q +DHF + +TF+QRYLI + HW ++ I YTGNEGDI WFA NT
Sbjct: 50 YHTCFIGQKIDHFGFYEN--RTFKQRYLIAEQHW--KRDVGSILFYTGNEGDITWFANNT 105
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM++VA + A+LVF EHRYYG S+P+G NK ++ +A YLSS QALAD+A L+
Sbjct: 106 GFMWNVAEELDAILVFAEHRYYGVSLPFG-NK--SFSDAKHLNYLSSAQALADFAVLVQH 162
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK + A D+PV+ GGSYGGMLAAWFR+KYPH+ IGALA+SAPI FD++V +F +
Sbjct: 163 LKATIPGAQDTPVIAIGGSYGGMLAAWFRMKYPHIVIGALAASAPIWQFDSLVPCGTFYS 222
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLA-IESWL 287
I+TQDF+ C + I+ SW I A GL L FR+C K+E + A + WL
Sbjct: 223 IVTQDFKKSGNCCSESIRNSWAAINRLASTEEGLRWLSSTFRLCTPLKTETDAATFKGWL 282
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSG 346
+V AM DYP ++FL PLPA+P++ +CK + +PK + + + ++ A +VYYNY+G
Sbjct: 283 GETWVNLAMVDYPYKADFLQPLPAWPIQVVCKYLKNPKLPDKLLLQNIFQAVNVYYNYTG 342
Query: 347 TAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
A C +L + G+ W +QACTEM+M D +FE + D+ + C + +G
Sbjct: 343 HASCLNLTQTATKSLGIQGWYYQACTEMVMPMCSDGVSDMFEPQKWDFHTYSEECFKNWG 402
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PRP+WI T +GG I SNIIF NG DPWSGGGV KNI+ T++A+V EGA
Sbjct: 403 VRPRPSWIPTFYGGKNISA-----HSNIIFSNGGLDPWSGGGVTKNITNTLVAVVIPEGA 457
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
HH+DLR +T DP + R EV + +W+ QY Q
Sbjct: 458 HHLDLRSNTPFDPDSVLQARLLEVHYMRQWLRQYHQ 493
>gi|354489833|ref|XP_003507065.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cricetulus
griseus]
gi|344240433|gb|EGV96536.1| Lysosomal Pro-X carboxypeptidase [Cricetulus griseus]
Length = 492
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 206/456 (45%), Positives = 285/456 (62%), Gaps = 18/456 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF + + +TF+QRYLI D HW N I YTGNEGDI WF NT
Sbjct: 46 YSVLYFQQKVDHFGF--YNSRTFKQRYLIADKHW--KANGGTILFYTGNEGDIVWFCNNT 101
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G E ++K+A +L+S QALAD+A LI
Sbjct: 102 GFMWDVAKELKAMLVFAEHRYYGESLPFG---EESFKDAQHLDFLTSEQALADFAELIKH 158
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALASSAPI FD++V F
Sbjct: 159 LKRTVPGAKNQPVIAIGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFDDLVPCGVFMK 218
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWLS 288
I+T+DFR S C + I+ SWK I + L+ L K F +C E ++ WLS
Sbjct: 219 IVTEDFRKSSPQCSESIRRSWKAINRHSTTGSDLQWLTKTFHLCSPLNFEDITTLKEWLS 278
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM DYP ++FL PLPA+PV +C+ + +P + V + ++ A ++YYNYSG
Sbjct: 279 ETWVNLAMVDYPYEASFLQPLPAWPVTVVCQYLKNPNVSDTVLLQNIFQALNIYYNYSGQ 338
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
A+C +++ + G + WG+Q+CTE++M + D +FE D+D + C +GV
Sbjct: 339 AQCLNISQTATSSLGATGWGYQSCTELVMPICSNGVDDMFESYIWDFDKFSNDCFNQWGV 398
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PR NWI T +GG I SNIIF NG DPWSGGGV K+I+ T++A+ GAH
Sbjct: 399 KPRLNWIITLYGGKNISS-----HSNIIFSNGDLDPWSGGGVTKDITDTLVAINIPNGAH 453
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
H+DLR ST DP + R EV+ + KWI+ ++++
Sbjct: 454 HLDLRASTAFDPSTVLLSRSLEVKHMKKWIADFYKN 489
>gi|395521107|ref|XP_003764661.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Sarcophilus harrisii]
Length = 450
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 203/445 (45%), Positives = 283/445 (63%), Gaps = 19/445 (4%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF ++ TF+QRYLI D +W NN I YTGNEGDI WF +NTGFM+DVA +
Sbjct: 14 VDHFGFDVN--LTFKQRYLIADQYW--KNNNGVILFYTGNEGDITWFCKNTGFMWDVAEE 69
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
KA+LVF EHRYYG+S+P+G ++ ++ YL++ QALAD+A LI LKK + A
Sbjct: 70 LKAMLVFAEHRYYGESLPFGNQ---SFSDSKHLNYLTAEQALADFAVLIEYLKKTIPGAK 126
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
+ PV+ GGSYGGMLAAWFR+KYPH+ IGALA+SAPI F ++VS F I+T DF+
Sbjct: 127 NRPVIAIGGSYGGMLAAWFRMKYPHIVIGALAASAPIWQFTDLVSCGKFFEIVTNDFKKS 186
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKN-LAIESWLSTAFVYTAM 296
C + I+GSWK I+ A GL+ + +AF +C +SE + + ++SWL+ +V AM
Sbjct: 187 GSKCSETIQGSWKAIDRLASTGEGLQWISEAFHLCNPLRSEMDVITLKSWLAETWVNLAM 246
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLNG 355
DYP SNFL PLPA+P+KE+CK + D + V + ++ A ++YYNYSG A C + +
Sbjct: 247 VDYPYSSNFLMPLPAWPIKEVCKYLTDSHASDKVLVQNIFKAVNIYYNYSGEASCLNTSQ 306
Query: 356 DSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWIT 414
+ G+ W +QACTEM+M +FE D+ A + C + +GV PRP WI
Sbjct: 307 TATSSLGIQGWNYQACTEMVMPICATGISDMFEPQAWDFAAYSDQCFQEWGVRPRPLWIP 366
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFST 474
T FGG I SNIIF NG DPW GGV +NI+ T++++V +GAHH+DLR
Sbjct: 367 TVFGGKNISS-----HSNIIFSNGALDPWYAGGVNENITDTLISIVIPDGAHHLDLRARN 421
Query: 475 KEDPQWLKDVRRREVEIIGKWISQY 499
+DP+ + R EV+ + KWI+++
Sbjct: 422 ADDPESVLLARIMEVDYMKKWITKF 446
>gi|297734880|emb|CBI17114.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 194/396 (48%), Positives = 263/396 (66%), Gaps = 16/396 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ T ++ Q LDHFNY P+SY+TFQQRY++N +WGG+ +++PIFVYTG+E I A
Sbjct: 48 FVTYFYNQTLDHFNYKPESYRTFQQRYIMNSEYWGGANSSSPIFVYTGDEASITAVAAFA 107
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A +F LL++IEHRYYG S+P+G E A+ N ST GY +STQALADYA LI +
Sbjct: 108 GFIVELASRFNGLLLYIEHRYYGDSVPFGSKDE-AFSNTSTLGYFTSTQALADYAELITN 166
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LKKNL+A + PV+ GGSYGGMLA+WFRLKYPH+ IGALASSAPIL FD+I ++ I
Sbjct: 167 LKKNLSAENCPVIAIGGSYGGMLASWFRLKYPHIVIGALASSAPILYFDDITPGNAYHVI 226
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAF 291
+T+DFR SE+CY I+ SW +I++ A +P GL L + F C+ + ++ +L+ +
Sbjct: 227 VTKDFRETSESCYSTIRDSWSEIDKVAAEPNGLANLSQIFMTCEPLNSSQELKYYLALCY 286
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
V +A D P PA+PVK++C AID G D+ ++ AA + N S C
Sbjct: 287 VVSAQNDNP---------PAYPVKKVCDAIDGAPEGTDIIGRV--AAGL--NASVGPPCH 333
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
+ D P SEW WQ CTEM+M G D++F+ D + + C++ +GV PRP+
Sbjct: 334 FVY-DFKPSNRSEWTWQTCTEMVMPIGHGANDTMFQAWPFDLNNHTKTCQDLFGVTPRPH 392
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
WITTEFGGH I V+ FASNIIF NGLRDP+S GG
Sbjct: 393 WITTEFGGHDIKSVVGNFASNIIFSNGLRDPYSAGG 428
>gi|410972567|ref|XP_003992730.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Felis catus]
Length = 497
Score = 394 bits (1012), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/455 (43%), Positives = 282/455 (61%), Gaps = 19/455 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF + +TF+QRYL+ D HW K++ I YTGNEGDI WF NT
Sbjct: 48 YTVHYLQQKVDHFGFTTD--KTFKQRYLLADEHW--KKDDGSILFYTGNEGDIVWFCNNT 103
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G + ++K++ YL+S QALAD+A LI
Sbjct: 104 GFMWDVAEELKAMLVFAEHRYYGESLPFGND---SFKDSRYLNYLTSEQALADFAVLIKY 160
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI +F N+V F
Sbjct: 161 LKRTIPGAKNQPVIALGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMK 220
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----LAIESWL 287
I+T+DFR NC + I SW I A+ GL L +A +C +N ++ W+
Sbjct: 221 IVTEDFRKSGPNCSETIHNSWGAITRLARTGSGLHWLSEALHLCTPLRNSQDVQHLKGWI 280
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
S ++ AM DYP SNFL PLPA+P+K +CK +++P + + ++ A ++YYNYSG
Sbjct: 281 SETWINLAMVDYPYESNFLQPLPAWPIKVVCKYLNNPSLSDAQLLENIFQALNIYYNYSG 340
Query: 347 TAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
A+C +++ + + G W +QACTEM+M + D +FE ++ + C + +G
Sbjct: 341 QARCLNISETTTSNLGTQGWSYQACTEMVMPFCTNGIDDMFEPRSWNFREYSDDCFKQWG 400
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PRP WITT +GG I SNI+F NG DPWSGGGV K+I+ T++A+ EGA
Sbjct: 401 VRPRPTWITTVYGGRNIS-----SHSNIVFSNGELDPWSGGGVTKDITDTLVAITIPEGA 455
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
HH+DLR DP + R EV + +WI ++
Sbjct: 456 HHLDLRARNAFDPTTVLLARSLEVRHMKQWIRDFY 490
>gi|296471920|tpg|DAA14035.1| TPA: prolylcarboxypeptidase precursor [Bos taurus]
Length = 499
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 206/460 (44%), Positives = 286/460 (62%), Gaps = 19/460 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y +Y Q +DHF +N +TF+QRYLI D +W ++ I YTGNEGDI WF NT
Sbjct: 50 YSIRYIQQKVDHFGFNID--RTFKQRYLIADNYW--KEDGGSILFYTGNEGDIIWFCNNT 105
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+A + KA+LVF EHRYYG+S+P+G + ++ ++ +L++ QALAD+A LI
Sbjct: 106 GFMWDIAEEMKAMLVFAEHRYYGESLPFGAD---SFSDSRHLNFLTTEQALADFAKLIRY 162
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALASSAPI F+++V F
Sbjct: 163 LKRTIPGARNQPVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDIFMK 222
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWL 287
I+T DF NC + I+ SW I AKK GL L +A +C KS+ ++ W+
Sbjct: 223 IVTTDFSQSGPNCSESIRRSWDAINRLAKKGPGLRWLSEALHLCTPLTKSQDVQRLKDWI 282
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSG 346
S +V AM DYP SNFL PLPA+PVK +C+ + V + ++ A +VYYNYSG
Sbjct: 283 SETWVNVAMVDYPYESNFLQPLPAWPVKVVCQYFKYSNVPDTVMVQNIFQALNVYYNYSG 342
Query: 347 TAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
AKC +++ + G+ W +QACTEM+M T D D +FE + + C + +G
Sbjct: 343 QAKCLNVSETATSSLGVLGWSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWG 402
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PRP+WI T +GG I +NIIF NG DPWSGGGV K+I+ T+LA+V GA
Sbjct: 403 VRPRPSWIPTMYGGKNIS-----SHTNIIFSNGELDPWSGGGVTKDITDTLLAIVIPNGA 457
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
HH+DLR S DP ++ R EV+ + +WIS ++ L +
Sbjct: 458 HHLDLRASNALDPVSVQLTRSLEVKYMKQWISDFYVRLRK 497
>gi|291234129|ref|XP_002737003.1| PREDICTED: prolylcarboxypeptidase isoform 1 preproprotein-like
[Saccoglossus kowalevskii]
Length = 501
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 203/454 (44%), Positives = 283/454 (62%), Gaps = 22/454 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T Y Q LDHFN+ + TF QRYL++D +W S PIF YTGNEGDI WF NT
Sbjct: 57 YQTLYFKQTLDHFNF--ANNGTFSQRYLLSDDYWNSS---GPIFFYTGNEGDITWFCNNT 111
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++D+AP+FKA+++F EHRYYG+S+P+G ++ + GYL+S QALAD+A+LI
Sbjct: 112 GFIWDIAPQFKAMVIFAEHRYYGESLPFGNE---SFSDLEHVGYLTSEQALADFATLIKY 168
Query: 173 LKKNLTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+K + D SPV+VFGGSYGGM+AAWFR+KYP++ GALA+SAPI F + +
Sbjct: 169 IKSSRPGADKSPVIVFGGSYGGMMAAWFRMKYPNIVDGALAASAPIWQFPGLTPCNTLFT 228
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWL 287
IITQDF +C + I SW I + G + L AF +C K A + SWL
Sbjct: 229 IITQDFVKAGRDCAETIHKSWNAINRMKDQESGRQWLTMAFHLCTPLKTPADVNDLSSWL 288
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND-VFAKLYGAASVYYNYSG 346
S + AM DYP P++FL PLPA+P+K+ C + + + + + + VYYN +G
Sbjct: 289 SNTWFNLAMVDYPYPASFLEPLPAWPIKKTCSYMTNSSLLDKPLLNSVASSLQVYYNTTG 348
Query: 347 TAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+CF+++ D+ L E GW Q+CTEM+M + D +F ++ ++D + C+ +
Sbjct: 349 KTQCFNISQDA-VSSLGELGWSYQSCTEMVMPSCSDGVHDMFPPNKWNFDDFVKECQNTW 407
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
GV PR +WI T +GG I +SNIIF NG DPWSGGGVL ++S+T++A+V K+G
Sbjct: 408 GVTPRADWIVTHYGGKAITA-----SSNIIFSNGELDPWSGGGVLHSLSETLIAIVIKDG 462
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
AHH+DLR K DPQ + D R +E I KWI Q
Sbjct: 463 AHHLDLRSKDKGDPQSVIDARNQEKYHITKWIQQ 496
>gi|417401904|gb|JAA47816.1| Putative lysosomal pro-x carboxypeptidase [Desmodus rotundus]
Length = 497
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 204/460 (44%), Positives = 283/460 (61%), Gaps = 19/460 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF + ++ +TF+QRYLI D HW K+ I YTGNEGDI WF NT
Sbjct: 48 YSIHYTEQKVDHFGF--KTDKTFKQRYLIADQHW--KKDGGSILFYTGNEGDIIWFCNNT 103
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYGKS+P+G N ++K++ +L+S QALAD+ LI
Sbjct: 104 GFMWDVAEELKAMLVFAEHRYYGKSLPFGAN---SFKDSRHLNFLTSEQALADFGELIRH 160
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ GALA+SAPI F++IV F
Sbjct: 161 LKRTIPGAENQPVIAVGGSYGGMLAAWFRMKYPHMVAGALAASAPIWQFEDIVPCGVFME 220
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWL 287
I+T DFR NC + I+ SW I + GL L +A +C S+ ++ W+
Sbjct: 221 IVTTDFRKSGPNCSESIRRSWDAINRLTRTGTGLHWLSEALHLCTPLTNSQDVQHLKDWI 280
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
S +V AM DYP S+FL PLPA+P+K +C+ + +P + + ++ A +VYYNYSG
Sbjct: 281 SETWVNLAMVDYPYESDFLQPLPAWPIKVVCQYLKNPHVSDTQLLQNIFQALNVYYNYSG 340
Query: 347 TAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
A+C +++ + G W +QACTEM+M + D +FE D + C + +G
Sbjct: 341 QARCLNMSETATGSLGTLGWSYQACTEMVMPFCTNGIDDMFEPHSWDLKEFSDDCFKQWG 400
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PRP+WITT +GG I +NIIF NG DPWSGGGV KNI+ T++A+ EGA
Sbjct: 401 VTPRPSWITTMYGGKNIS-----SHTNIIFSNGDLDPWSGGGVTKNITDTLVAITIPEGA 455
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
HH+DLR + DP+ L R EV+ + +WI ++ L +
Sbjct: 456 HHLDLRANNAFDPKSLLLARSLEVKYMKQWIRDFYASLRK 495
>gi|118404640|ref|NP_001072639.1| prolylcarboxypeptidase precursor [Xenopus (Silurana) tropicalis]
gi|115313632|gb|AAI23913.1| prolylcarboxypeptidase (angiotensinase C) [Xenopus (Silurana)
tropicalis]
Length = 499
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 203/453 (44%), Positives = 288/453 (63%), Gaps = 19/453 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T Y TQ +DHF + + TF+QRYL++DT+W K PI YTGNEGDI F NT
Sbjct: 53 YETYYFTQQVDHFGFYEDA--TFKQRYLVSDTYW--RKPGGPILFYTGNEGDITLFCNNT 108
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + A+LVF EHRYYG+S+P+G ++A+ + YL+S QALAD+A L+
Sbjct: 109 GFMWDVAEEMGAMLVFAEHRYYGESMPFG---DLAFSDPKHLNYLTSEQALADFAVLLRY 165
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
K A +SPV+ GGSYGGMLAAWFR+KYP V +GA+ASSAPI F+++V +
Sbjct: 166 FKATTEGAQNSPVIALGGSYGGMLAAWFRMKYPDVVVGAIASSAPIWQFEDLVPCNEYYQ 225
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAI-ESWL 287
++T DF+ C + ++ SW I A+ GL+ L AF +C KS++++A+ + WL
Sbjct: 226 VVTNDFKKSGPGCSESVRNSWAAINRMAETTDGLQWLSNAFHLCCPLKSKEDVAVFKGWL 285
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
S +V AM DYP P++FL PLPA+P++ +CK + DPK+G+ D+ ++ A +VYYNY+G
Sbjct: 286 SETWVSLAMVDYPYPASFLEPLPAWPIQVVCKFLKDPKSGDKDLLQDIFQAVNVYYNYTG 345
Query: 347 TAKCFDLNGD-SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
C + + S G W +QACTEM+M D +FE D A + C + +G
Sbjct: 346 DTPCLNTSQTASGSLGDLGWSYQACTEMVMPFCSDGVSDMFEPQSWDLQAFSDECYKQWG 405
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PR +W+TT +GG I SNIIF NG DPWS GGV +++S ++LA++ EGA
Sbjct: 406 VRPRGSWVTTVYGGKNI-----SSHSNIIFSNGGLDPWSAGGVKESLSDSLLAVLIPEGA 460
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
HH+DLR + +DP+ + R EVE + KWI+Q
Sbjct: 461 HHLDLRSNNADDPKSVLLARSLEVEYMKKWIAQ 493
>gi|344293754|ref|XP_003418585.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Loxodonta africana]
Length = 503
Score = 390 bits (1002), Expect = e-106, Method: Compositional matrix adjust.
Identities = 208/482 (43%), Positives = 291/482 (60%), Gaps = 32/482 (6%)
Query: 26 FPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTH 85
PT P+SR + + S++ Y Q +DHF ++ +TF+QRYLI D +
Sbjct: 36 LPTRPTSRPSVARNYSIL-------------YFKQKVDHFGFDIN--KTFKQRYLIADKY 80
Query: 86 WGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
W K+ I YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N
Sbjct: 81 W--KKDGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGAN-- 136
Query: 146 IAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYP 204
++K++ +L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYP
Sbjct: 137 -SFKDSRHLNFLTSEQALADFAKLIKHLKQTIPGAENQPVIAVGGSYGGMLAAWFRMKYP 195
Query: 205 HVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGG 264
H+ +GALA+SAPI F+++V F I+T+DF+ NC + I+ SW I A G
Sbjct: 196 HLVVGALAASAPIWQFEDLVPCGIFMEIVTRDFKKSGPNCSESIRRSWNVINRLATTSSG 255
Query: 265 LEKLQKAFRICKSEKNL----AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKA 320
L+ L +A +C + ++ W+S +V AM DYP NFL PLPA+P+K +C+
Sbjct: 256 LQWLSEALHLCSPLTDFQDFRMLKDWISETWVNLAMVDYPYECNFLQPLPAWPIKVVCQY 315
Query: 321 IDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTG 378
DPK + + + L+ A +VYYNYSG KC +++ + G W +QACTE+IM
Sbjct: 316 FKDPKVSDQLLVQNLFQALNVYYNYSGQVKCLNISETATSSLGSLGWSYQACTEIIMPFC 375
Query: 379 GDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
+ D +FE D + + C + +GV PRP WITT +GG I NIIF NG
Sbjct: 376 TNGVDDMFEPRSWDLEELSDDCFKEWGVRPRPFWITTLYGGKNISS-----HRNIIFSNG 430
Query: 439 LRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
DPWSGGGV KNI+ T++A+ +GAHH+DLR S DP+ + R EV + +WIS
Sbjct: 431 ELDPWSGGGVTKNITDTLVAINIPDGAHHLDLRASNALDPKTVLLARSLEVRYMKQWISD 490
Query: 499 YF 500
++
Sbjct: 491 FY 492
>gi|354489839|ref|XP_003507068.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cricetulus
griseus]
Length = 494
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 209/485 (43%), Positives = 295/485 (60%), Gaps = 34/485 (7%)
Query: 22 NAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLI 81
N ++FPTF D Y Y Q +DHF + + +TF+QRYLI
Sbjct: 31 NLEVFPTF----------------QPDEANNYSVFYFEQKVDHFGF--YNTKTFKQRYLI 72
Query: 82 NDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYG 141
D +W + I YTGNEGDI WF+ +TGFM+DVA K KALLVF EHRYYG+S+P+G
Sbjct: 73 ADRYW--KTYDGVILFYTGNEGDITWFSNHTGFMWDVAEKLKALLVFAEHRYYGESLPFG 130
Query: 142 GNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDS-PVVVFGGSYGGMLAAWFR 200
++K++ +L+S QALAD+A LI L++ + DS PV+ GGSYGG+LAAWFR
Sbjct: 131 AE---SFKDSKHLNFLTSEQALADFAELIRHLRRTVPGADSQPVIAIGGSYGGVLAAWFR 187
Query: 201 LKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAK 260
+KYPH+ IGALA+SAPI F+ ++ F I+T+DFR + C + I SW I +
Sbjct: 188 MKYPHLVIGALAASAPIWQFEELIPCGMFMKIVTEDFRKSGKKCSESILRSWGAINRLSN 247
Query: 261 KPGGLEKLQKAFRIC---KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEM 317
GL L KAF +C SE ++ W+S +V AM +YP P+NFL PLP++P+K +
Sbjct: 248 TGSGLPWLTKAFHLCSSLNSEDVQRLKDWISETWVNLAMVNYPYPANFLKPLPSWPLKVV 307
Query: 318 CKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLNGDS-DPHGLSEWGWQACTEMIM 375
C+ + +P + + + +Y A +VYYNYSG ++C +++ + G WG+Q CTE+I+
Sbjct: 308 CQYLTNPFMSDSLLVQNIYQALNVYYNYSGQSRCNNISETTIGSLGSRVWGYQTCTEIIL 367
Query: 376 LTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIF 435
+ D +FE D D + C + +GV PRP+WITT +GG I R SNIIF
Sbjct: 368 PFCTNGVDDMFEARAWDLDKYSDDCYKQWGVRPRPSWITTLYGGKDI-----RSHSNIIF 422
Query: 436 FNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
NG DPWSGGGV +++S T++A+ EGAHH+DLR ST+ DP L+ R EV + +W
Sbjct: 423 SNGDLDPWSGGGVTEDLSDTLVAVNIPEGAHHLDLRSSTETDPPSLQLARSVEVRHMKQW 482
Query: 496 ISQYF 500
IS ++
Sbjct: 483 ISDFY 487
>gi|260814189|ref|XP_002601798.1| hypothetical protein BRAFLDRAFT_279167 [Branchiostoma floridae]
gi|229287100|gb|EEN57810.1| hypothetical protein BRAFLDRAFT_279167 [Branchiostoma floridae]
Length = 481
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 206/458 (44%), Positives = 280/458 (61%), Gaps = 23/458 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y TKY TQ +DHF++ TF QRYLIN ++ G+ PIF+YTGNEGDI F NT
Sbjct: 40 YDTKYFTQPVDHFSFT--RTDTFDQRYLINMKYFEGT--GGPIFLYTGNEGDITMFCDNT 95
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+APKFKAL+VF EHRYYG+S+PYG + +YK+ GYL++ QALAD+A LI
Sbjct: 96 GFMWDIAPKFKALVVFAEHRYYGESMPYGKD---SYKDPEHLGYLTAEQALADFARLITH 152
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK ++ A DSPVV FGGSYGGMLAAWFR+KYP IG+LA+SAP+ F+ + S +
Sbjct: 153 LKASIPGAADSPVVAFGGSYGGMLAAWFRMKYPSSVIGSLAASAPVWQFEGLTPCGSSYS 212
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWL 287
IIT+DF+ S C I SW + + + G EKL F +C + + SWL
Sbjct: 213 IITEDFQKGSPGCETYIHKSWDLLTQMGQTAAGREKLSSMFSLCSPLNTTTDVATMSSWL 272
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGT 347
+ + AM +YP P++FL PLP +PVKE+C I ++V + A +YYNY+G
Sbjct: 273 LSTWFNLAMVNYPYPASFLEPLPGWPVKEVCSLI---TASSNVLEGIAAGAMLYYNYTGQ 329
Query: 348 AKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
A C ++ +S L + GW Q C+EM M + D +F + CK +
Sbjct: 330 APCLNIE-ESAVSSLGDLGWSFQFCSEMAMPSCTDGVHDMFPLQPWNLTTYISQCKSTWK 388
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PRP WI +FGG I ASNIIF NGL DPWS GGV++++S +++A+ +GA
Sbjct: 389 VTPRPYWILQQFGGKNI-----TAASNIIFSNGLLDPWSAGGVMESLSDSLVAITIADGA 443
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
HH+DLR S DPQ + + R +EV+II +W+ Y+ L
Sbjct: 444 HHLDLRSSNPADPQSVIEAREQEVQIIREWLQDYYSSL 481
>gi|383408887|gb|AFH27657.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
Length = 496
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 204/471 (43%), Positives = 287/471 (60%), Gaps = 22/471 (4%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +SL + +K+ Y Y Q +DHF +N + +TF QRYL+ D +W KN I
Sbjct: 35 PTNPTSLPAVAKN----YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSI 86
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N ++K++
Sbjct: 87 LFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLN 143
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+A+GALA+S
Sbjct: 144 FLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAAS 203
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F+++V F I+T DFR +C + I+ SW I + GL+ L A +
Sbjct: 204 APIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRRSWDAINRLSSTGSGLQWLTGALHL 263
Query: 275 CK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
C S+ ++ W+S +V AM DYP SNFL PLPA+P+K +C+ + +P + +
Sbjct: 264 CSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLL 323
Query: 332 AK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ ++ A +VYYNYSG KC +++ + G W +QACTE++M + D +FE
Sbjct: 324 LQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPH 383
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + C E +GV PRP+WITT +GG I SNI+F NG DPWSGGGV
Sbjct: 384 SWNLKELSDECFEQWGVRPRPSWITTMYGGKNIS-----SHSNIVFSNGELDPWSGGGVT 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 439 KDITDTLVAVTISEGAHHLDLRAKNALDPTSVLLARALEVRHMKNWIRDFY 489
>gi|84000301|ref|NP_001033253.1| lysosomal Pro-X carboxypeptidase precursor [Bos taurus]
gi|122137021|sp|Q2TA14.1|PCP_BOVIN RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|83405390|gb|AAI11172.1| Prolylcarboxypeptidase (angiotensinase C) [Bos taurus]
Length = 499
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 205/460 (44%), Positives = 285/460 (61%), Gaps = 19/460 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y +Y Q +DHF +N +TF+QRYLI D +W ++ I YTGNEGDI WF NT
Sbjct: 50 YSIRYIQQKVDHFGFNID--RTFKQRYLIADNYW--KEDGGSILFYTGNEGDIIWFCNNT 105
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+A + KA+LVF EHRYYG+S+P+G + ++ ++ +L++ QALAD+A LI
Sbjct: 106 GFMWDIAEEMKAMLVFAEHRYYGESLPFGAD---SFSDSRHLNFLTTEQALADFAKLIRY 162
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A + V+ GGSYGGMLAAWFR+KYPH+ +GALASSAPI F+++V F
Sbjct: 163 LKRTIPGARNQHVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDIFMK 222
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWL 287
I+T DF NC + I+ SW I AKK GL L +A +C KS+ ++ W+
Sbjct: 223 IVTTDFSQSGPNCSESIRRSWDAINRLAKKGTGLRWLSEALHLCTPLTKSQDVQRLKDWI 282
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSG 346
S +V AM DYP SNFL PLPA+PVK +C+ + V + ++ A +VYYNYSG
Sbjct: 283 SETWVNVAMVDYPYESNFLQPLPAWPVKVVCQYFKYSNVPDTVMVQNIFQALNVYYNYSG 342
Query: 347 TAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
AKC +++ + G+ W +QACTEM+M T D D +FE + + C + +G
Sbjct: 343 QAKCLNVSETATSSLGVLGWSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWG 402
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PRP+WI T +GG I +NIIF NG DPWSGGGV K+I+ T+LA+V GA
Sbjct: 403 VRPRPSWIPTMYGGKNIS-----SHTNIIFSNGELDPWSGGGVTKDITDTLLAIVIPNGA 457
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
HH+DLR S DP ++ R EV+ + +WIS ++ L +
Sbjct: 458 HHLDLRASNALDPVSVQLTRSLEVKYMKQWISDFYVRLRK 497
>gi|403287756|ref|XP_003935096.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Saimiri boliviensis
boliviensis]
Length = 496
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 210/505 (41%), Positives = 301/505 (59%), Gaps = 26/505 (5%)
Query: 2 ATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQI 61
A RF+ LSF + TI+ S + P +SL + +K+ Y Y Q
Sbjct: 5 ALRFLLLSF---LAPWATIALRPALRALGSLHL-PTNPTSLPAVAKN----YSVLYFQQK 56
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NTGFM+DVA +
Sbjct: 57 VDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEE 112
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
KA+LVF EHRYYG+S+P+G + ++K++ +L+S QALAD+A LI +K+ + A
Sbjct: 113 LKAMLVFAEHRYYGESLPFG---DSSFKDSRHLNFLTSEQALADFAELIKHMKRTIPGAE 169
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
+ PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F I+T DFR
Sbjct: 170 NQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKS 229
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFVYTAMT 297
+C + I+ SW I + GL+ L +A +C SE ++ W+S +V AM
Sbjct: 230 GPHCSESIRRSWDAINRLSSTGSGLQWLSEALHLCSPLTSEDVRHLKDWISETWVNLAMV 289
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLNGD 356
DYP SNFL PLPA+P+K +C+ + P + + + ++ A +VYYNYSG AKC +++
Sbjct: 290 DYPYASNFLQPLPAWPIKVVCQYLKSPNVSDLLLLQNIFQALNVYYNYSGQAKCLNISET 349
Query: 357 SDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
+ G W +QACTE++M + D +FE + + C + +GV PRP+WITT
Sbjct: 350 ATGSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRPSWITT 409
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
+GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAHH+DLR
Sbjct: 410 MYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDLRAKNA 464
Query: 476 EDPQWLKDVRRREVEIIGKWISQYF 500
DP + R EV + WI +++
Sbjct: 465 LDPTSVLLARSLEVRHMKNWIREFY 489
>gi|355752501|gb|EHH56621.1| hypothetical protein EGM_06072 [Macaca fascicularis]
Length = 516
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 199/450 (44%), Positives = 279/450 (62%), Gaps = 18/450 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y T ++DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NTGFM+
Sbjct: 72 YKTPLVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMW 127
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
DVA + KA+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD+A LI LK+
Sbjct: 128 DVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADFAELIKHLKRT 184
Query: 177 LT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ A + PV+ GGSYGGMLAAWFR+KYPH+A+GALA+SAPI F+++V F I+T
Sbjct: 185 IPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVTT 244
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFV 292
DFR +C + I+ SW I + GL+ L A +C S+ ++ W+S +V
Sbjct: 245 DFRKSGPHCSESIRRSWDAINRLSSTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWV 304
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCF 351
AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG KC
Sbjct: 305 NLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCL 364
Query: 352 DLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+++ + G W +QACTE++M + D +FE + + C E +GV PRP
Sbjct: 365 NISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDECFEQWGVRPRP 424
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+WITT +GG I SNI+F NG DPWSGGGV K+I+ T++A+ EGAHH+DL
Sbjct: 425 SWITTMYGGKNIS-----SHSNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL 479
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R DP + R EV + WI ++
Sbjct: 480 RAKNALDPTSVLLARALEVRHMKNWIRDFY 509
>gi|2827710|emb|CAA16683.1| lysosomal Pro-X carboxypeptidase - like protein [Arabidopsis
thaliana]
Length = 499
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 215/486 (44%), Positives = 281/486 (57%), Gaps = 81/486 (16%)
Query: 1 MATRF----IFLSFCLLFSST-LTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYK- 54
MA+ F IF F L+F S ++S++K+ P FP R T + + I + + Y+
Sbjct: 1 MASHFCLLLIFTFFTLVFPSNGSSLSSSKLLPRFP--RYTFQNREARIQQFRGDRNEYRY 58
Query: 55 -TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
TK+ +Q LDHF++ F QRYLIN HW G+ PIF+Y GNEGDIEWFA N+G
Sbjct: 59 ETKFFSQQLDHFSF--ADLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSG 116
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F++D+APKF ALLVF EHRYYG+S+PYG +E AYKNA+T YL++ QALAD+A + DL
Sbjct: 117 FIWDIAPKFGALLVFPEHRYYGESMPYGSREE-AYKNATTLSYLTTEQALADFAVFVTDL 175
Query: 174 KKNLTATDSPVVVFGGSYGG--------------MLAAWFRLKYPHVAIGALASSAPILN 219
K+NL+A PVV+FGGSYGG +LAAW RLKYPH+AIGALASSAPIL
Sbjct: 176 KRNLSAEACPVVLFGGSYGGSNNCVFVFVVIDATVLAAWMRLKYPHIAIGALASSAPILQ 235
Query: 220 FDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK 279
F+++V P +F +I + DF+ S +C+ IK SW I +K GL +L K F C+
Sbjct: 236 FEDVVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLN 295
Query: 280 NL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA 338
+ + WL +A+ Y AM DYP P++F+ PLP P++E+C+ ID + + ++Y
Sbjct: 296 STDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNASILDRIYAGI 355
Query: 339 SVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
SVYYNY+G CF L D DPHGL W WQ
Sbjct: 356 SVYYNYTGNVDCFKL--DDDPHGLDGWNWQ------------------------------ 383
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
I LK F SNIIF NGL DPWSGG VLKN+S T++A
Sbjct: 384 ----------------------DIATTLKSFGSNIIFSNGLLDPWSGGSVLKNLSDTIVA 421
Query: 459 LVEKEG 464
LV KEG
Sbjct: 422 LVTKEG 427
>gi|166064014|ref|NP_001099751.2| lysosomal Pro-X carboxypeptidase precursor [Rattus norvegicus]
Length = 493
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 198/456 (43%), Positives = 281/456 (61%), Gaps = 18/456 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF ++ +TF+QRYL+ D HW +N I YTGNEGDI WF NT
Sbjct: 45 YSVHYFQQKVDHFGFS--DTRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNT 100
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYGKS+P+G + ++K++ +L+S QALAD+A LI
Sbjct: 101 GFMWDVAEELKAMLVFAEHRYYGKSLPFGRD---SFKDSQHLNFLTSEQALADFAELIRH 157
Query: 173 LKKNLTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + T+ PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI D + F
Sbjct: 158 LKETIPGTEGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMAPCGEFMK 217
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLS 288
I+T+DF +C + I+ SW I + GL+ L+ +C SEK ++ W++
Sbjct: 218 IVTKDFSQSGPHCSESIRRSWSVINRLSDTGSGLQWLRDTLHLCSPLTSEKMPTLKGWIA 277
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM +YP NFL PLPA+P+KE+C+ + +P + V + ++ A +VYYNYSG
Sbjct: 278 ETWVNLAMVNYPYACNFLQPLPAWPIKEVCRYLRNPNVSDTVLLQNIFQALNVYYNYSGQ 337
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
KC +++ + G + W +QACTEM+M + D +FE D + + C +GV
Sbjct: 338 TKCLNISQTTTSSLGSTGWSFQACTEMVMPFCTNGIDDMFEPYLWDLETYSNDCFNQWGV 397
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+W+TT +GG I SNIIF NG DPWSGGGV K+I+ T++A+ EGAH
Sbjct: 398 KPRPHWMTTMYGGKNIS-----SHSNIIFSNGDLDPWSGGGVTKDITDTLVAINIPEGAH 452
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
H+DLR DP + R EV+ + KWI+ ++ +
Sbjct: 453 HLDLRAHNAFDPSSVLLSRLLEVKHMKKWIADFYSN 488
>gi|386781752|ref|NP_001248178.1| lysosomal Pro-X carboxypeptidase precursor [Macaca mulatta]
gi|380789243|gb|AFE66497.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
gi|384941696|gb|AFI34453.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
Length = 496
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 203/471 (43%), Positives = 287/471 (60%), Gaps = 22/471 (4%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +SL + +K+ Y Y Q +DHF +N + +TF QRYL+ D +W KN I
Sbjct: 35 PTNPTSLPAVAKN----YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSI 86
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N ++K++
Sbjct: 87 LFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLN 143
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+A+GALA+S
Sbjct: 144 FLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAAS 203
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F+++V F I+T DF+ +C + I+ SW I + GL+ L A +
Sbjct: 204 APIWQFEDLVPCGVFMKIVTTDFKKSGPHCSESIRRSWDAINRLSSTGSGLQWLTGALHL 263
Query: 275 CK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
C S+ ++ W+S +V AM DYP SNFL PLPA+P+K +C+ + +P + +
Sbjct: 264 CSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLL 323
Query: 332 AK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ ++ A +VYYNYSG KC +++ + G W +QACTE++M + D +FE
Sbjct: 324 LQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPH 383
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + C E +GV PRP+WITT +GG I SNI+F NG DPWSGGGV
Sbjct: 384 SWNLKELSDECFEQWGVRPRPSWITTMYGGKNIS-----SHSNIVFSNGELDPWSGGGVT 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 439 KDITDTLVAVTISEGAHHLDLRAKNALDPTSVLLARALEVRHMKNWIRDFY 489
>gi|426251523|ref|XP_004019471.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Ovis aries]
Length = 497
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 209/488 (42%), Positives = 293/488 (60%), Gaps = 22/488 (4%)
Query: 27 PTFPSSRITPEKL--SSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDT 84
P PS R P L S+ S Y +Y Q +DHF +N +TF+QRYLI D
Sbjct: 21 PVSPSLR-APSSLPWSTSFGSRPTITPKYSIRYIQQKVDHFGFNID--RTFKQRYLIADN 77
Query: 85 HWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
+W ++ I YTGNEGDI WF NTGFM+D+A + KA+LVF EHRYYG+S+P+G +
Sbjct: 78 YW--KEDGGSILFYTGNEGDIIWFCNNTGFMWDIAEEMKAMLVFAEHRYYGESLPFGAD- 134
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKY 203
++ ++ +L++ QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KY
Sbjct: 135 --SFSDSRHLNFLTTEQALADFAKLIKYLKRTIPGARNQPVIALGGSYGGMLAAWFRMKY 192
Query: 204 PHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
PH+ +GALASSAPI F+++V F I+T DF NC + I+ SW I AKK
Sbjct: 193 PHLVVGALASSAPIWQFNDLVPCDVFMKIVTTDFSQSGPNCSESIRRSWDAINRLAKKGP 252
Query: 264 GLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
GL L +A +C L ++ W++ +V AM DYP SNFL PLPA+PVK +C+
Sbjct: 253 GLRWLSEALHLCTPLTKLQDVQHLKDWIAETWVNLAMVDYPYESNFLQPLPAWPVKVVCQ 312
Query: 320 AIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLT 377
+ + + ++ A +VYYNYSG AKC +++ + G+ W +QACTEM+M T
Sbjct: 313 YFKYSNVPDTLLLRNIFQALNVYYNYSGQAKCLNVSETATSSLGVQGWSYQACTEMVMPT 372
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
D D +FE + + C + +GV PRP+WI T +GG I +NIIF N
Sbjct: 373 CSDGVDDMFEPHSWNMKEYSDDCFKQWGVRPRPSWIPTMYGGKNIS-----SHTNIIFSN 427
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G DPWSGGGV K+I+ T+LA+V GAHH+DLR S DP ++ R EV+ + +W++
Sbjct: 428 GELDPWSGGGVTKDITDTLLAIVIPNGAHHLDLRASNALDPVSVQLTRSLEVKYMKQWVT 487
Query: 498 QYFQDLAQ 505
++ L +
Sbjct: 488 DFYVRLRK 495
>gi|296216949|ref|XP_002754795.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Callithrix
jacchus]
Length = 496
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 199/454 (43%), Positives = 280/454 (61%), Gaps = 18/454 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NT
Sbjct: 48 YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNT 103
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G + ++K++ +L+S QALAD+A LI
Sbjct: 104 GFMWDVAEELKAMLVFAEHRYYGESLPFG---DSSFKDSRHLNFLTSEQALADFAELIKH 160
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F
Sbjct: 161 LKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMK 220
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLS 288
I+T DFR +C + I SW I + GL+ L +A +C SE ++ W+S
Sbjct: 221 IVTTDFRKSGPHCSESIHRSWDAINRLSSTGSGLQWLSEALHLCSPLTSEDIQHLKDWIS 280
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG
Sbjct: 281 ETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDLLLLQNIFQALNVYYNYSGQ 340
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
AKC +++ + G W +QACTE++M + D +FE + + YC + +GV
Sbjct: 341 AKCLNISETATGSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDYCFQQWGV 400
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAH
Sbjct: 401 RPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAH 455
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
H+DLR DP + R EV + WI ++
Sbjct: 456 HLDLRAKNALDPTSVLLARVLEVRHMKNWIRDFY 489
>gi|349604443|gb|AEP99992.1| Lysosomal Pro-X carboxypeptidase-like protein, partial [Equus
caballus]
Length = 441
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 199/452 (44%), Positives = 280/452 (61%), Gaps = 21/452 (4%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF +N +TF+QRYLI D HW K+ I YTGNEGDI WF NTGFM+D+A +
Sbjct: 1 VDHFGFNTD--KTFKQRYLIADKHW--KKDGGSILFYTGNEGDITWFCNNTGFMWDMAEE 56
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
KA+LVF EHRYYG+S+P+G N ++K+++ +L+S QALAD+A LI LK+ + A
Sbjct: 57 LKAMLVFAEHRYYGESLPFGAN---SFKDSTHLNFLTSEQALADFAKLIKHLKRTVPGAK 113
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
+ PV+ GGSYGGMLAAWFR+KYPH+ +GALASSAPI F+++V F I+T DFR
Sbjct: 114 NQPVIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWQFEDLVPCGVFMEIVTADFRKS 173
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWLSTAFVYTAM 296
NC + I+ SW I GL L +A R+C S+ ++ W++ +V AM
Sbjct: 174 GPNCSESIRRSWDAIRRLTSTGPGLHWLSEALRLCSPLTNSQDVEHLKDWITETWVNLAM 233
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSGTAKCFDLNG 355
DYP SNFL PLPA+P+K +C+ + +P + + ++ A +VYYNYSG KC +++
Sbjct: 234 VDYPYESNFLQPLPAWPIKVVCQYLKNPNVSDPQLLENIFQALNVYYNYSGQVKCLNMS- 292
Query: 356 DSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
++ L GW QACTEM+M + D +FE + + C + +GV PRP+WI
Sbjct: 293 ETATRSLGSLGWSYQACTEMVMPFCTNGIDDMFEPHSWNLKEFSDDCFKQWGVRPRPSWI 352
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
TT +GG I +NIIF NG DPWSGGGV +N++ T++A+ EGAHH+DLR +
Sbjct: 353 TTMYGGKNISS-----HTNIIFSNGDLDPWSGGGVTENVTDTLVAITIPEGAHHLDLRAN 407
Query: 474 TKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
DP + R EV + +WI ++ L +
Sbjct: 408 NAFDPTSVLLARSLEVRYMKQWIKNFYASLRK 439
>gi|402894798|ref|XP_003910532.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Papio
anubis]
Length = 496
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 203/471 (43%), Positives = 286/471 (60%), Gaps = 22/471 (4%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +SL + +K+ Y Y Q +DHF +N + +TF QRYL+ D +W KN I
Sbjct: 35 PTNPTSLPAVAKN----YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSI 86
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N ++K++
Sbjct: 87 LFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLN 143
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+A+GALA+S
Sbjct: 144 FLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAAS 203
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F+++V F I+T DFR C + I+ SW I + GL+ L A +
Sbjct: 204 APIWQFEDLVPCGVFMKIVTTDFRKSGPYCSESIRRSWDAINRLSNTGSGLQWLTGALHL 263
Query: 275 CK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
C S+ ++ W+S +V AM DYP SNFL PLPA+P+K +C+ + +P + +
Sbjct: 264 CSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLL 323
Query: 332 AK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ ++ A +VYYNYSG KC +++ + G W +QACTE++M + D +FE
Sbjct: 324 LQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPH 383
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + C E +GV PRP+WITT +GG I +NI+F NG DPWSGGGV
Sbjct: 384 SWNLKELSDECFEQWGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVT 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 439 KDITDTLVAVTISEGAHHLDLRAKNALDPTSVLLARALEVRHMKNWIRDFY 489
>gi|449271455|gb|EMC81816.1| Lysosomal Pro-X carboxypeptidase, partial [Columba livia]
Length = 437
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 200/443 (45%), Positives = 277/443 (62%), Gaps = 21/443 (4%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF ++ TFQQRYLI D HW KNN PI YTGNEGDI WF+ NTGFM+DVA +
Sbjct: 2 VDHFGFDDN--LTFQQRYLIADQHW--KKNNGPILFYTGNEGDITWFSNNTGFMWDVAQE 57
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
A+LVF EHRYYG+S+P+G +Y ++ YL+S QALAD+A LI LK + A
Sbjct: 58 LNAMLVFAEHRYYGESLPFGNE---SYSDSKRLNYLTSEQALADFAVLIEHLKSTIAGAR 114
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
SPV+ GGSY GMLAAWFR+KYPH+ +GALA+SAPI F ++V +F +I+T+DF+
Sbjct: 115 YSPVIAIGGSYRGMLAAWFRMKYPHLVVGALAASAPIWQFGDLVPCGAFFSIVTKDFKRS 174
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL----AIESWLSTAFVYTAM 296
C + I+ SW I+ + GL L K F +C KNL ++ LS ++ AM
Sbjct: 175 GTGCAESIRNSWSAIDRLSSTDEGLLWLSKTFHLCSPLKNLEDVAVLKRGLSDTWINLAM 234
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLNG 355
+YP S+FL PLPA+P++E+CK + DP + + + ++ A ++YYNYSG CFD+
Sbjct: 235 MNYPYKSDFLEPLPAWPIEEVCKFLKDPSLSDKLLLQNVFRAINIYYNYSGQILCFDM-A 293
Query: 356 DSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
++ L GW Q CTEM+M D + +FE + D AR+ C + +GV PRP+WI
Sbjct: 294 ETATKSLGHLGWNYQTCTEMVMPLCTDGVNDMFEPQKWDLKARSEECYKLWGVRPRPSWI 353
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
+ +GG I SNIIF NG DPWS GGV +NIS +++A++ EGAHH+DLR
Sbjct: 354 ISMYGGKNI-----HSHSNIIFSNGGMDPWSAGGVTRNISHSLVAIMIPEGAHHLDLRGR 408
Query: 474 TKEDPQWLKDVRRREVEIIGKWI 496
DP+ ++ R E+ + +WI
Sbjct: 409 NPSDPKSVQQARDLELCYMKQWI 431
>gi|114639541|ref|XP_001175153.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 7 [Pan
troglodytes]
gi|397502802|ref|XP_003822032.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Pan
paniscus]
gi|410223852|gb|JAA09145.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410254528|gb|JAA15231.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410308824|gb|JAA33012.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410334959|gb|JAA36426.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
Length = 496
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 201/471 (42%), Positives = 287/471 (60%), Gaps = 22/471 (4%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +SL + +K+ Y Y Q +DHF +N + +TF QRYL+ D +W KN I
Sbjct: 35 PTNPTSLPAVAKN----YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSI 86
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N ++K++
Sbjct: 87 LFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLN 143
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+S
Sbjct: 144 FLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAAS 203
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F+++V F I+T DFR +C + I+ SW+ I + GL+ L A +
Sbjct: 204 APIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRRSWEAINRLSNTGSGLQWLTGALHL 263
Query: 275 CK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
C S+ ++ W+S +V AM DYP SNFL PLPA+P+K +C+ + +P + +
Sbjct: 264 CSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLL 323
Query: 332 AK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ ++ A +VYYNYSG KC +++ + G W +QACTE++M + D +FE
Sbjct: 324 LQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPH 383
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + C + +GV PRP+WITT +GG I +NI+F NG DPWSGGGV
Sbjct: 384 SWNLKELSDDCFQQWGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVT 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 439 KDITDTLVAVTISEGAHHLDLRTKNALDPTSVLLARSLEVRHMKNWIRDFY 489
>gi|402894800|ref|XP_003910533.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Papio
anubis]
Length = 516
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 198/450 (44%), Positives = 278/450 (61%), Gaps = 18/450 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y T ++DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NTGFM+
Sbjct: 72 YKTPLVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMW 127
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
DVA + KA+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD+A LI LK+
Sbjct: 128 DVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADFAELIKHLKRT 184
Query: 177 LT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ A + PV+ GGSYGGMLAAWFR+KYPH+A+GALA+SAPI F+++V F I+T
Sbjct: 185 IPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVTT 244
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFV 292
DFR C + I+ SW I + GL+ L A +C S+ ++ W+S +V
Sbjct: 245 DFRKSGPYCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWV 304
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCF 351
AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG KC
Sbjct: 305 NLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCL 364
Query: 352 DLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+++ + G W +QACTE++M + D +FE + + C E +GV PRP
Sbjct: 365 NISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDECFEQWGVRPRP 424
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAHH+DL
Sbjct: 425 SWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL 479
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R DP + R EV + WI ++
Sbjct: 480 RAKNALDPTSVLLARALEVRHMKNWIRDFY 509
>gi|338726996|ref|XP_001917253.2| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X carboxypeptidase
[Equus caballus]
Length = 519
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 201/461 (43%), Positives = 283/461 (61%), Gaps = 23/461 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT + +DHF +N +TF+QRYLI D HW K+ I YTGNEGDI WF NT
Sbjct: 72 YKTPL--RXVDHFGFNTD--KTFKQRYLIADKHW--KKDGGSILFYTGNEGDITWFCNNT 125
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+A + KA+LVF EHRYYG+S+P+G N ++K+++ +L+S QALAD+A LI
Sbjct: 126 GFMWDMAEELKAMLVFAEHRYYGESLPFGAN---SFKDSTHLNFLTSEQALADFAKLIKH 182
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALASSAPI F+++V F
Sbjct: 183 LKRTVPGAKNQPVIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWQFEDLVPCGVFME 242
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWL 287
I+T DFR NC + I+ SW I GL L +A R+C S+ ++ W+
Sbjct: 243 IVTADFRKSGPNCSESIRRSWDAIRRLTSTGPGLHWLSEALRLCSPLTNSQDVEHLKDWI 302
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
+ +V AM DYP SNFL PLPA+P+K +C+ + +P + + ++ A +VYYNYS
Sbjct: 303 TETWVNLAMVDYPYESNFLQPLPAWPIKVVCQYLKNPNVSDPQLLENIFQALNVYYNYSD 362
Query: 347 TAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
KC +++ ++ L GW QACTEM+M + D +FE + + C + +
Sbjct: 363 QVKCLNMS-ETATRSLGSLGWSYQACTEMVMPFCTNGIDDMFEPHSWNLKEFSDDCFKQW 421
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
GV PRP+WITT +GG I +NIIF NG DPWSGGGV +N++ T++A+ EG
Sbjct: 422 GVRPRPSWITTMYGGKNIS-----SHTNIIFSNGDLDPWSGGGVTENVTDTLVAITIPEG 476
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
AHH+DLR + DP + R EV + +WI ++ L +
Sbjct: 477 AHHLDLRANNAFDPTSVLLARSLEVRYMKQWIKNFYASLRK 517
>gi|114639539|ref|XP_001175149.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 5 [Pan
troglodytes]
gi|397502804|ref|XP_003822033.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Pan
paniscus]
Length = 517
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 196/450 (43%), Positives = 279/450 (62%), Gaps = 18/450 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y T ++DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NTGFM+
Sbjct: 73 YKTPLVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMW 128
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
DVA + KA+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD+A LI LK+
Sbjct: 129 DVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADFAELIKHLKRT 185
Query: 177 LT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F I+T
Sbjct: 186 IPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTT 245
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFV 292
DFR +C + I+ SW+ I + GL+ L A +C S+ ++ W+S +V
Sbjct: 246 DFRKSGPHCSESIRRSWEAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWV 305
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCF 351
AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG KC
Sbjct: 306 NLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCL 365
Query: 352 DLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+++ + G W +QACTE++M + D +FE + + C + +GV PRP
Sbjct: 366 NISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRP 425
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAHH+DL
Sbjct: 426 SWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL 480
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R DP + R EV + WI ++
Sbjct: 481 RTKNALDPTSVLLARSLEVRHMKNWIRDFY 510
>gi|345322922|ref|XP_001513964.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Ornithorhynchus
anatinus]
Length = 742
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 196/453 (43%), Positives = 290/453 (64%), Gaps = 19/453 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T++ Q +DHF ++ TF+QRYL++D HW ++ I YTGNEGDI WF NT
Sbjct: 46 YETRFFPQKVDHFGFDLDL--TFKQRYLVSDQHW--REDGGSILFYTGNEGDITWFCNNT 101
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM++VA + +A+LVF EHRYYG+S+P+G + ++ ++ YL+S QALAD+A LI
Sbjct: 102 GFMWEVAEELQAMLVFAEHRYYGESLPFG---DQSFSDSKHLNYLTSEQALADFAVLIEH 158
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK + A +SPV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI F ++V F
Sbjct: 159 LKATIPGAQNSPVISIGGSYGGMLAAWIRMKYPHLVVGALAASAPIWQFGDLVPCGRFFE 218
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKN-LAIESWL 287
I+T DF+ C + I+ SW + + GL+ L F +C +S+++ +A+++W+
Sbjct: 219 IVTNDFKKSGAGCSETIRASWDAVGRISSTEEGLQWLSHTFHLCSPLRSQQDAVALKTWM 278
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSG 346
S+ +V AM DYP ++FL PLPA+PV+ +CK + DPK+ +DV + ++ A +VYYNY+G
Sbjct: 279 SSTWVDLAMVDYPYETDFLQPLPAWPVQAVCKYLKDPKSADDVLLQNIFQALNVYYNYTG 338
Query: 347 TAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
C + + + G+ W +QACTEM+M D + +FE D+ A + C + +G
Sbjct: 339 KTSCLNTSETVTGNLGMQGWSYQACTEMVMPLCTDGINDMFEPQPWDFKALSDECFKLWG 398
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PR WI T +GG I SNIIF NG DPWSGGGV +NI+ T++A+V EGA
Sbjct: 399 VRPRLFWIPTVYGGKNIS-----SHSNIIFSNGALDPWSGGGVNENITDTLVAVVIPEGA 453
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
HH+DLR + DP+ + R EV +I +W+++
Sbjct: 454 HHLDLRANNPYDPKSVLQARAAEVHLIKQWVAK 486
>gi|426369948|ref|XP_004051942.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Gorilla
gorilla gorilla]
Length = 496
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 201/471 (42%), Positives = 286/471 (60%), Gaps = 22/471 (4%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +SL + +K+ Y Y Q +DHF +N + +TF QRYL+ D +W KN I
Sbjct: 35 PTNPTSLPAVAKN----YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSI 86
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N ++K++
Sbjct: 87 LFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLN 143
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+S
Sbjct: 144 FLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAAS 203
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F+++V F I+T DFR +C + I+ SW I + GL+ L A +
Sbjct: 204 APIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHL 263
Query: 275 CK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
C S+ ++ W+S +V AM DYP SNFL PLPA+P+K +C+ + +P + +
Sbjct: 264 CSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLL 323
Query: 332 AK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ ++ A +VYYNYSG KC +++ + G W +QACTE++M + D +FE
Sbjct: 324 LQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPH 383
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + C + +GV PRP+WITT +GG I +NI+F NG DPWSGGGV
Sbjct: 384 SWNLKELSDDCFQQWGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVT 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 439 KDITDTLVAVTISEGAHHLDLRTKNALDPTSVLLARSLEVRHMKNWIRDFY 489
>gi|443709977|gb|ELU04397.1| hypothetical protein CAPTEDRAFT_174591 [Capitella teleta]
Length = 487
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 199/449 (44%), Positives = 286/449 (63%), Gaps = 19/449 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y T Y TQ +DHF + + T++QRYL+ND HW + +PIF YTGNEG I+WF NT
Sbjct: 37 YTTHYITQKVDHFGF--ANDNTYKQRYLLNDQHW---RPGSPIFFYTGNEGAIDWFCNNT 91
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G M++ AP F A+L+F EHRYYG+S+PYG NK ++ + + YL+S QALAD+ SLI D
Sbjct: 92 GIMWEWAPSFNAMLIFAEHRYYGESLPYG-NK--SFDSPNHLNYLTSEQALADFVSLIAD 148
Query: 173 LKKNLTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+K+ + AT SPVV FGGSYGGMLAAW R+KYP +GA A+SAPI F ++V F+
Sbjct: 149 VKQRMPATSKSPVVAFGGSYGGMLAAWLRMKYPSAVVGAFAASAPIWEFGDLVPLGGFAV 208
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTA 290
+ T+ + S + NC +I+ SW +++ A G E L A +C K+ A ++SWLS+
Sbjct: 209 VTTKSYASANPNCPIIIRRSWSVMDQLASSVEGREFLASALGLCNPVKSSAEVKSWLSST 268
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSGTAK 349
++ AM +YP +NFL PLPA+PVK +C + D + ++ + A VYYNY+G+A
Sbjct: 269 WINLAMANYPYKANFLEPLPAWPVKAICSHLSDINLDHKELVHAVRHAVDVYYNYTGSAS 328
Query: 350 CFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
C+ +S L + GW Q+CTEM+M D + +FE S + A C++ + +
Sbjct: 329 CYK-TSESATGNLGDQGWDIQSCTEMVMPMSNDGVNDMFEPSPWNMTAVTEDCQKKFKLT 387
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
PRP+WI ++GG I SNIIF NGL DPWS GGV+++IS++++A+V +GAHH
Sbjct: 388 PRPDWIIRQYGGRNISA-----HSNIIFSNGLLDPWSAGGVMQSISESLVAIVIADGAHH 442
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
VDLR S +DP ++ R++E II W+
Sbjct: 443 VDLRSSHPDDPISVQMARKKEKAIIAHWL 471
>gi|426369950|ref|XP_004051943.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Gorilla
gorilla gorilla]
Length = 517
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 196/450 (43%), Positives = 278/450 (61%), Gaps = 18/450 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y T ++DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NTGFM+
Sbjct: 73 YKTPLVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMW 128
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
DVA + KA+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD+A LI LK+
Sbjct: 129 DVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADFAELIKHLKRT 185
Query: 177 LT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F I+T
Sbjct: 186 IPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTT 245
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFV 292
DFR +C + I+ SW I + GL+ L A +C S+ ++ W+S +V
Sbjct: 246 DFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWV 305
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCF 351
AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG KC
Sbjct: 306 NLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCL 365
Query: 352 DLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+++ + G W +QACTE++M + D +FE + + C + +GV PRP
Sbjct: 366 NISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRP 425
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAHH+DL
Sbjct: 426 SWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL 480
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R DP + R EV + WI ++
Sbjct: 481 RTKNALDPTSVLLARSLEVRHMKNWIRDFY 510
>gi|197098906|ref|NP_001125428.1| lysosomal Pro-X carboxypeptidase precursor [Pongo abelii]
gi|68565632|sp|Q5RBU7.1|PCP_PONAB RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|55728021|emb|CAH90763.1| hypothetical protein [Pongo abelii]
Length = 496
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 201/471 (42%), Positives = 285/471 (60%), Gaps = 22/471 (4%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +SL + +K+ Y Y Q +DHF +N + +TF QRYL+ D +W KN I
Sbjct: 35 PTNPTSLPAVAKN----YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSI 86
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N +K++
Sbjct: 87 LFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN---TFKDSRHLN 143
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+S
Sbjct: 144 FLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAAS 203
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F+++V F I+T DFR +C + I+ SW I + GL+ L A +
Sbjct: 204 APIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHL 263
Query: 275 CK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
C S+ ++ W+S +V AM DYP SNFL PLPA+P+K +C+ + +P + +
Sbjct: 264 CSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLL 323
Query: 332 AK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ ++ A +VYYNYSG KC +++ + G W +QACTE++M + D +FE
Sbjct: 324 LQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPH 383
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + C + +GV PRP+WITT +GG I +NI+F NG DPWSGGGV
Sbjct: 384 SWNLKELSDDCFQQWGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVT 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 439 KDITDTLVAVTISEGAHHLDLRTKNALDPTSVLLARSLEVRHMKNWIRDFY 489
>gi|4826940|ref|NP_005031.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Homo
sapiens]
gi|1172047|sp|P42785.1|PCP_HUMAN RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Angiotensinase C; AltName: Full=Lysosomal
carboxypeptidase C; AltName: Full=Proline
carboxypeptidase; AltName: Full=Prolylcarboxypeptidase;
Short=PRCP; Flags: Precursor
gi|431321|gb|AAA99891.1| prolylcarboxypeptidase [Homo sapiens]
gi|16306648|gb|AAH01500.1| Prolylcarboxypeptidase (angiotensinase C) [Homo sapiens]
gi|119595480|gb|EAW75074.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_a [Homo
sapiens]
gi|119595482|gb|EAW75076.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_a [Homo
sapiens]
gi|123992784|gb|ABM83994.1| prolylcarboxypeptidase (angiotensinase C) [synthetic construct]
gi|123999550|gb|ABM87320.1| prolylcarboxypeptidase (angiotensinase C) [synthetic construct]
gi|197692241|dbj|BAG70084.1| prolylcarboxypeptidase isoform 1 preproprotein [Homo sapiens]
gi|197692495|dbj|BAG70211.1| prolylcarboxypeptidase isoform 1 preproprotein [Homo sapiens]
Length = 496
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 201/471 (42%), Positives = 285/471 (60%), Gaps = 22/471 (4%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +SL + +K+ Y Y Q +DHF +N + +TF QRYL+ D +W KN I
Sbjct: 35 PTNPTSLPAVAKN----YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSI 86
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N ++K++
Sbjct: 87 LFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLN 143
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+S
Sbjct: 144 FLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAAS 203
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F+++V F I+T DFR +C + I SW I + GL+ L A +
Sbjct: 204 APIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHL 263
Query: 275 CK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
C S+ ++ W+S +V AM DYP SNFL PLPA+P+K +C+ + +P + +
Sbjct: 264 CSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLL 323
Query: 332 AK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ ++ A +VYYNYSG KC +++ + G W +QACTE++M + D +FE
Sbjct: 324 LQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPH 383
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + C + +GV PRP+WITT +GG I +NI+F NG DPWSGGGV
Sbjct: 384 SWNLKELSDDCFQQWGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVT 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 439 KDITDTLVAVTISEGAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFY 489
>gi|297812427|ref|XP_002874097.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297319934|gb|EFH50356.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 501
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 216/505 (42%), Positives = 300/505 (59%), Gaps = 29/505 (5%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLI-------SSSKDSQGLYKTKYHTQIL 62
LL SST T F +F S+I +SS + S+ K + K Y Q L
Sbjct: 8 LLLLISSTST----SYFISFAHSKIARLGISSKMLKNAPDGSTQKIDESDLKMYYFNQTL 63
Query: 63 DHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKF 122
DHF + P+SY TFQQRY IN +HWGG+K NAPI + G E ++ GF+ D P
Sbjct: 64 DHFTFTPKSYMTFQQRYAINSSHWGGAKANAPILAFLGEESSLDSDLSGIGFLRDNGPHL 123
Query: 123 KALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDS 182
KALLV+IEHRYYGK++P+G +E A KNAST GYL++ QALADYA++++ +K+ + S
Sbjct: 124 KALLVYIEHRYYGKTMPFGSAEE-ALKNASTLGYLNAAQALADYAAILLHVKEKYSTKHS 182
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE 242
P++V GGSYGGMLAAWFRLKYPH+A+GALASSAP+L F++ + I+T+ F+ SE
Sbjct: 183 PIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKIGYYYIVTKVFKETSE 242
Query: 243 NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPT 301
CY I+ SWK+I+ A KP GL L K F+ C + I+ +L T +Y Y
Sbjct: 243 RCYTRIRKSWKEIDRVAVKPNGLSILSKTFKTCAPLNGSFNIKDFLDT--IYAEAVQY-- 298
Query: 302 PSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP 359
N P + V +C AI + P + + +++ G C+D N S P
Sbjct: 299 -----NRGPTYWVANVCNAINANTPNRKDTLLDRIFAGVVALI---GNRTCYDTNMFSQP 350
Query: 360 -HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG 418
+ W WQ+C+E+++ G D +D++F + + + C+ YGV PRP+WITT FG
Sbjct: 351 TNNHIAWRWQSCSEIVIPVGYDKQDTMFPTAPFNMTSYIDGCESYYGVPPRPHWITTYFG 410
Query: 419 GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDP 478
+ L+L+RF SNIIF NGL DP+S GGVL++IS TV+A + G+H D+ KEDP
Sbjct: 411 IQDVKLILRRFGSNIIFSNGLSDPYSVGGVLEDISDTVVA-ITTNGSHCQDISLKNKEDP 469
Query: 479 QWLKDVRRREVEIIGKWISQYFQDL 503
QWL R +E+++I WIS Y DL
Sbjct: 470 QWLVMQREKEIKVIDSWISTYQNDL 494
>gi|224058951|ref|XP_002299660.1| predicted protein [Populus trichocarpa]
gi|222846918|gb|EEE84465.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 220/453 (48%), Positives = 297/453 (65%), Gaps = 17/453 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T +H Q LDHFNY P+SY F QRYLIN +WGG+ +API VY G E I+
Sbjct: 56 FETFFHNQTLDHFNYRPESYDKFPQRYLINSKYWGGANVSAPILVYLGAEEPIDEDLAAV 115
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D A +F +LLVFIEHRYYGKSIP+G +E A K+AS GY +S QA+ADYA++II
Sbjct: 116 GFLVDNAVQFNSLLVFIEHRYYGKSIPFGSREE-ALKDASKLGYFNSAQAIADYAAIIIH 174
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+K+ L A SPV+V GGSYGGMLA+WFRLKYPH+A+GALASSAPIL FD+I + +I
Sbjct: 175 IKETLRAQYSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDDITPQDGYYSI 234
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAF 291
+T+DFR SE CY+ IK SW +I+E A KP GL L K F+ C + + ++ L T +
Sbjct: 235 VTKDFREASETCYQTIKTSWSEIDELASKPDGLSMLSKKFKTCTPLADASELKDHLDTMY 294
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
A + P P +PV E+CK ID G+D+ ++++G Y G C+
Sbjct: 295 ASAAQYNRP---------PTYPVNEVCKGIDGGGFGDDILSRIFGGLVA---YKGNLSCY 342
Query: 352 DLNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+N +DP + W WQ C+EM + G N +S+F D + CK YGV RP
Sbjct: 343 -VNAHTDPSETTVGWRWQTCSEMAIPIGVGN-NSMFPPDPFDLEDYIENCKSLYGVPTRP 400
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+WITT +GGH I L+L+RFASNIIF NGLRDP+S GGVL+NIS TV+A+ G+H +D+
Sbjct: 401 HWITTYYGGHSIKLILQRFASNIIFSNGLRDPYSSGGVLENISDTVVAVKTVNGSHCLDI 460
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
F+ + DP+WL R+ E++II +WI++Y+ DL
Sbjct: 461 LFAKETDPEWLVTQRKIEIKIIKEWINKYYVDL 493
>gi|33469015|ref|NP_082519.1| lysosomal Pro-X carboxypeptidase precursor [Mus musculus]
gi|68565642|sp|Q7TMR0.2|PCP_MOUSE RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|26342108|dbj|BAC34716.1| unnamed protein product [Mus musculus]
gi|74193542|dbj|BAE20697.1| unnamed protein product [Mus musculus]
Length = 491
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 198/457 (43%), Positives = 279/457 (61%), Gaps = 18/457 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF + +TF+QRYL+ D HW +N I YTGNEGDI WF NT
Sbjct: 46 YSVLYFEQKVDHFGF--ADMRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNT 101
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G + ++K++ +L+S QALAD+A LI
Sbjct: 102 GFMWDVAEELKAMLVFAEHRYYGESLPFGQD---SFKDSQHLNFLTSEQALADFAELIRH 158
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
L+K + A PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI D +V F
Sbjct: 159 LEKTIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMK 218
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLS 288
I+T DFR C + I+ SW I++ + GL+ L +C SEK ++ W++
Sbjct: 219 IVTNDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIA 278
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM +YP NFL PLPA+P+KE+C+ + +P + V + ++ A SVYYNYSG
Sbjct: 279 ETWVNLAMVNYPYACNFLQPLPAWPIKEVCQYLKNPNVSDTVLLQNIFQALSVYYNYSGQ 338
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
A C +++ + G W +QACTEM+M + D +FE D + + C +GV
Sbjct: 339 AACLNISQTTTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWGV 398
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+W+TT +GG I SNIIF NG DPWSGGGV ++I+ T++A+ +GAH
Sbjct: 399 KPRPHWMTTMYGGKNIS-----SHSNIIFSNGELDPWSGGGVTRDITDTLVAINIHDGAH 453
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
H+DLR DP + R EV+ + KWI ++ ++
Sbjct: 454 HLDLRAHNAFDPSSVLLSRLLEVKHMKKWILDFYSNI 490
>gi|20072291|gb|AAH26424.1| Prcp protein, partial [Mus musculus]
Length = 451
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 198/457 (43%), Positives = 279/457 (61%), Gaps = 18/457 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF + +TF+QRYL+ D HW +N I YTGNEGDI WF NT
Sbjct: 6 YSVLYFEQKVDHFGF--ADMRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNT 61
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G + ++K++ +L+S QALAD+A LI
Sbjct: 62 GFMWDVAEELKAMLVFAEHRYYGESLPFGQD---SFKDSQHLNFLTSEQALADFAELIRH 118
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
L+K + A PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI D +V F
Sbjct: 119 LEKTIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMK 178
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLS 288
I+T DFR C + I+ SW I++ + GL+ L +C SEK ++ W++
Sbjct: 179 IVTNDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIA 238
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM +YP NFL PLPA+P+KE+C+ + +P + V + ++ A SVYYNYSG
Sbjct: 239 ETWVNLAMVNYPYACNFLQPLPAWPIKEVCQYLKNPNVSDTVLLQNIFQALSVYYNYSGQ 298
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
A C +++ + G W +QACTEM+M + D +FE D + + C +GV
Sbjct: 299 AACLNISQTTTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWGV 358
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+W+TT +GG I SNIIF NG DPWSGGGV ++I+ T++A+ +GAH
Sbjct: 359 KPRPHWMTTMYGGKNISS-----HSNIIFSNGELDPWSGGGVTRDITDTLVAINIHDGAH 413
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
H+DLR DP + R EV+ + KWI ++ ++
Sbjct: 414 HLDLRAHNAFDPSSVLLSRLLEVKHMKKWILDFYSNI 450
>gi|32967631|gb|AAH55022.1| Prcp protein [Mus musculus]
gi|74147590|dbj|BAE38679.1| unnamed protein product [Mus musculus]
Length = 491
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 198/457 (43%), Positives = 279/457 (61%), Gaps = 18/457 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF + +TF+QRYL+ D HW +N I YTGNEGDI WF NT
Sbjct: 46 YSVLYFEQKVDHFGF--ADMRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNT 101
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G + ++K++ +L+S QALAD+A LI
Sbjct: 102 GFMWDVAEELKAMLVFAEHRYYGESLPFGQD---SFKDSQHLNFLTSEQALADFAELIRH 158
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
L+K + A PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI D +V F
Sbjct: 159 LEKTIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMK 218
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLS 288
I+T DFR C + I+ SW I++ + GL+ L +C SEK ++ W++
Sbjct: 219 IVTNDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIA 278
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM +YP NFL PLPA+P+KE+C+ + +P + V + ++ A SVYYNYSG
Sbjct: 279 ETWVNLAMVNYPYACNFLQPLPAWPIKEVCQYLKNPNVSDTVLLQNIFQALSVYYNYSGQ 338
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
A C +++ + G W +QACTEM+M + D +FE D + + C +GV
Sbjct: 339 AACLNISQTTTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWGV 398
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+W+TT +GG I SNIIF NG DPWSGGGV ++I+ T++A+ +GAH
Sbjct: 399 KPRPHWMTTMYGGKNIS-----SHSNIIFSNGKLDPWSGGGVTRDITDTLVAINIPDGAH 453
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
H+DLR DP + R EV+ + KWI ++ ++
Sbjct: 454 HLDLRAHNAFDPSSVLLSRLLEVKHMKKWILDFYSNI 490
>gi|189066514|dbj|BAG35764.1| unnamed protein product [Homo sapiens]
Length = 496
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 201/471 (42%), Positives = 284/471 (60%), Gaps = 22/471 (4%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +SL + +K+ Y Y Q +DHF +N + +TF QRYL+ D +W KN I
Sbjct: 35 PTNPTSLPAVAKN----YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSI 86
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
YTGNEGDI WF NTGFM+DVA KA+LVF EHRYYG+S+P+G N ++K++
Sbjct: 87 LFYTGNEGDIIWFCNNTGFMWDVAEDLKAMLVFAEHRYYGESLPFGDN---SFKDSRHLN 143
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+S
Sbjct: 144 FLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAAS 203
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F+++V F I+T DFR +C + I SW I + GL+ L A +
Sbjct: 204 APIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHL 263
Query: 275 CK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
C S+ ++ W+S +V AM DYP SNFL PLPA+P+K +C+ + +P + +
Sbjct: 264 CSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLL 323
Query: 332 AK-LYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ ++ A +VYYNYSG KC +++ + G W +QACTE++M + D +FE
Sbjct: 324 LQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPH 383
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + C + +GV PRP+WITT +GG I +NI+F NG DPWSGGGV
Sbjct: 384 SWNLKELSDDCFQQWGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVT 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 439 KDITDTLVAVTISEGAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFY 489
>gi|117306169|ref|NP_955450.2| lysosomal Pro-X carboxypeptidase isoform 2 preproprotein [Homo
sapiens]
gi|119595481|gb|EAW75075.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_b [Homo
sapiens]
Length = 517
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 196/450 (43%), Positives = 277/450 (61%), Gaps = 18/450 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y T ++DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NTGFM+
Sbjct: 73 YKTPLVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMW 128
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
DVA + KA+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD+A LI LK+
Sbjct: 129 DVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADFAELIKHLKRT 185
Query: 177 LT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F I+T
Sbjct: 186 IPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTT 245
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFV 292
DFR +C + I SW I + GL+ L A +C S+ ++ W+S +V
Sbjct: 246 DFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWV 305
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCF 351
AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG KC
Sbjct: 306 NLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCL 365
Query: 352 DLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+++ + G W +QACTE++M + D +FE + + C + +GV PRP
Sbjct: 366 NISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRP 425
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAHH+DL
Sbjct: 426 SWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL 480
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R DP + R EV + WI ++
Sbjct: 481 RTKNALDPMSVLLARSLEVRHMKNWIRDFY 510
>gi|426251525|ref|XP_004019472.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Ovis aries]
Length = 517
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 199/456 (43%), Positives = 283/456 (62%), Gaps = 19/456 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
+ T ++DHF +N +TF+QRYLI D +W ++ I YTGNEGDI WF NTGFM+
Sbjct: 72 HKTPLVDHFGFNID--RTFKQRYLIADNYW--KEDGGSILFYTGNEGDIIWFCNNTGFMW 127
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D+A + KA+LVF EHRYYG+S+P+G + ++ ++ +L++ QALAD+A LI LK+
Sbjct: 128 DIAEEMKAMLVFAEHRYYGESLPFGAD---SFSDSRHLNFLTTEQALADFAKLIKYLKRT 184
Query: 177 LT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ A + PV+ GGSYGGMLAAWFR+KYPH+ +GALASSAPI F+++V F I+T
Sbjct: 185 IPGARNQPVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDVFMKIVTT 244
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAF 291
DF NC + I+ SW I AKK GL L +A +C L ++ W++ +
Sbjct: 245 DFSQSGPNCSESIRRSWDAINRLAKKGPGLRWLSEALHLCTPLTKLQDVQHLKDWIAETW 304
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKC 350
V AM DYP SNFL PLPA+PVK +C+ + + + ++ A +VYYNYSG AKC
Sbjct: 305 VNLAMVDYPYESNFLQPLPAWPVKVVCQYFKYSNVPDTLLLRNIFQALNVYYNYSGQAKC 364
Query: 351 FDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
+++ + G+ W +QACTEM+M T D D +FE + + C + +GV PR
Sbjct: 365 LNVSETATSSLGVQGWSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWGVRPR 424
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVD 469
P+WI T +GG I +NIIF NG DPWSGGGV K+I+ T+LA+V GAHH+D
Sbjct: 425 PSWIPTMYGGKNIS-----SHTNIIFSNGELDPWSGGGVTKDITDTLLAIVIPNGAHHLD 479
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
LR S DP ++ R EV+ + +W++ ++ L +
Sbjct: 480 LRASNALDPVSVQLTRSLEVKYMKQWVTDFYVRLRK 515
>gi|321468146|gb|EFX79132.1| hypothetical protein DAPPUDRAFT_225197 [Daphnia pulex]
Length = 477
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 199/458 (43%), Positives = 287/458 (62%), Gaps = 21/458 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+KT + Q +DHF++ Q T+ QRYL+N T+W + PIF YTGNEGDIEWFAQNT
Sbjct: 22 WKTFFFKQQVDHFSFANQ--DTYPQRYLVNSTYW--KRGGGPIFFYTGNEGDIEWFAQNT 77
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+A +F A+LVF EHRYYG+S+PYG NK +Y +A GYL+S QALAD+A L+
Sbjct: 78 GFMWDIAEEFGAMLVFAEHRYYGQSLPYG-NK--SYSDAKYLGYLTSEQALADFAELVAY 134
Query: 173 LKK-NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+K N A DSPV+ FGGSYGGML+AW R+KYPH+ G++A+SAPIL F + +F+
Sbjct: 135 IKSTNSGAVDSPVIAFGGSYGGMLSAWMRIKYPHIITGSIAASAPILQFTGLTPCDAFNR 194
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWL 287
++T DF S S C + I+ SWK + + G E L+ + +C ++ + ++ WL
Sbjct: 195 VVTADFASASTECSETIRKSWKSLSSILSQDDGKEWLRTHWNLCVPLNGTDDVVNLKDWL 254
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
+ + AM +YP +NFL PLPA+PVK +C+ + + + + +L+ SVY N++G
Sbjct: 255 TNVWTNLAMVNYPYAANFLAPLPAYPVKAVCEHLTNSSLDDHSLLDELFKGLSVYANFTG 314
Query: 347 TAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
KC D++ +D L + GW QACTEM+M GD + +FE + + + C + +
Sbjct: 315 QTKCLDVSQQAD-QSLGDMGWDFQACTEMVMPMCGDGINDMFEAQPWNIEKYSERCLKKW 373
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
V+PRP +GG I +SNI+F NGL DPWS GGV K++S +++A++ EG
Sbjct: 374 KVNPRPLMAPLIYGGKNISS-----SSNIVFSNGLLDPWSTGGVTKSLSDSIVAIIIPEG 428
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
AHH+DLR + DP + R E + IGKWIS +D
Sbjct: 429 AHHLDLRAADPNDPPSVVKAREIEKQFIGKWISSVKKD 466
>gi|190702395|gb|ACE75287.1| prolylcarboxypeptidase [Glyptapanteles flavicoxis]
Length = 497
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 201/476 (42%), Positives = 289/476 (60%), Gaps = 21/476 (4%)
Query: 37 EKLSSLISSSKDSQGLYK--TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAP 94
+ +++ I ++ S YK TK +DHF+++ + TF+ RY +NDT W KN AP
Sbjct: 26 QVITATIETTNSSSSPYKYVTKKFIVPVDHFSFSLNN--TFEMRYFVNDT-WKSGKN-AP 81
Query: 95 IFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
IF YTGNEG +E FA NTGFM+D+AP F AL+VF EHRYYG+S+P+G ++ N
Sbjct: 82 IFFYTGNEGVLETFAANTGFMWDIAPTFGALIVFAEHRYYGESMPFGNK---SFDNVKNL 138
Query: 155 GYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
GYL+S QALADY LI+ LK + + SPV+ FGGSYGGML+AWFR+KYPH+ GA+A+S
Sbjct: 139 GYLTSQQALADYVDLIVHLKSDPSLKHSPVIAFGGSYGGMLSAWFRMKYPHIINGAIAAS 198
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
AP+L F +FS I+T DFR+V NC KVI+ SW+ I+ G + + F++
Sbjct: 199 APVLQFTGYTDCQAFSRIVTSDFRAVHPNCEKVIRKSWESIKNLTSTDDGKKWISTKFKV 258
Query: 275 CKS---EKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK--TGN 328
C S E+N + +L + + AM +YP ++FL+PLPAFP++E CK ++D +
Sbjct: 259 CGSLTTEENFKTFKDFLLSVYSNLAMVNYPYATDFLSPLPAFPIREFCKFVNDTNLSSDK 318
Query: 329 DVFAKLYGAASVYYNYSGTAKCFDL-NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
DV L ++Y NY+G KC DL + + D + W +QACTEM+M D + +FE
Sbjct: 319 DVMTNLQKGINLYSNYTGRLKCLDLSDPEPDLGAANGWNYQACTEMVMPMCNDGVNDMFE 378
Query: 388 ESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
++ ++ C + V P+P + +G + ASNI+F NGL DPWS GG
Sbjct: 379 PEPWNFTKYSQDCFTHFNVTPKPELVCDMYGCDDLST-----ASNIVFSNGLLDPWSSGG 433
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
VL+N+S + +A++ EGAHH+DLR S DP + R I KWI ++ +DL
Sbjct: 434 VLRNLSSSAVAIIIPEGAHHLDLRGSHPSDPYSVVKAREYHAYSIKKWIREHREDL 489
>gi|73988045|ref|XP_533994.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Canis lupus
familiaris]
Length = 497
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 204/486 (41%), Positives = 288/486 (59%), Gaps = 20/486 (4%)
Query: 26 FPTFPSSR-ITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDT 84
P P+ + ++ + S+ +S + Y Y Q +DHF + +TF+QRYLI D
Sbjct: 20 IPVPPAVKALSSLRWSTSFTSRHFTGAKYSVHYILQKVDHFGFAVD--KTFKQRYLIADE 77
Query: 85 HWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
HW K+ I YTGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G N
Sbjct: 78 HW--KKDGGSILFYTGNEGDIIWFCNNTGFMWDVAEEMKAMLVFAEHRYYGESLPFGNN- 134
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKY 203
++K++ YL+S QALAD+A LI LK+ + A + PV+ GGSYGGMLAAWFR+KY
Sbjct: 135 --SFKDSRHLNYLTSEQALADFAMLIKHLKRTIPGAKNQPVIAIGGSYGGMLAAWFRMKY 192
Query: 204 PHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
PH+ +GALA+SAPI F N+VS F I+T DF+ NC + I+ SW I ++
Sbjct: 193 PHMVVGALAASAPIWQFGNLVSCGVFMEIVTTDFKKSGPNCSESIRRSWDAINRYSRTGA 252
Query: 264 GLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
GL L +A +C N ++ W+S ++ AM DYP S+FL PLP +P+K +C+
Sbjct: 253 GLGWLSEALHLCTPLTNTQDVQHLKIWISETWINMAMVDYPYESDFLQPLPPWPIKVVCQ 312
Query: 320 AIDDPKTGN-DVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLT 377
+ +P + + ++ A +VYYNYSG A C +++ + G W +QACTEM+M
Sbjct: 313 YLRNPNVPDAQLLQNIFQALNVYYNYSGQASCLNVSETTTSSLGTQGWSYQACTEMVMPF 372
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
+ D +FE + + C + +GV PRP WI T +GG I +NIIF N
Sbjct: 373 CTNGIDDMFEPHSWNLREFSDDCFKQWGVRPRPAWIITTYGGKNISA-----HTNIIFSN 427
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G DPWSGGGV K+I+ T++A+ EGAHH+DLR DP + R EV + +WI
Sbjct: 428 GELDPWSGGGVTKDITDTLVAITIPEGAHHLDLRARNAFDPTAVLLARSLEVRHMKQWIK 487
Query: 498 QYFQDL 503
Y+ +L
Sbjct: 488 DYYANL 493
>gi|300193275|pdb|3N2Z|B Chain B, The Structure Of Human Prolylcarboxypeptidase At 2.80
Angstroms Resolution
Length = 446
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 197/454 (43%), Positives = 277/454 (61%), Gaps = 18/454 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NT
Sbjct: 3 YSVLYFQQKVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNT 58
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD+A LI
Sbjct: 59 GFMWDVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADFAELIKH 115
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F
Sbjct: 116 LKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMK 175
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLS 288
I+T DFR +C + I SW I + GL+ L A +C S+ ++ W+S
Sbjct: 176 IVTTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWIS 235
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG
Sbjct: 236 ETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQ 295
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
KC +++ + G W +QACTE++M + D +FE + + C + +GV
Sbjct: 296 VKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGV 355
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAH
Sbjct: 356 RPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAH 410
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
H+DLR DP + R EV + WI ++
Sbjct: 411 HLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFY 444
>gi|21537161|gb|AAM61502.1| prolylcarboxypeptidase-like protein [Arabidopsis thaliana]
Length = 502
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 200/464 (43%), Positives = 289/464 (62%), Gaps = 17/464 (3%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S+ K + K Y Q LDHF + P+SY TFQQRY I+ THWGG+K NAPI + G E
Sbjct: 45 STQKVDESDLKMYYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEES 104
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
++ GF+ D P+ ALLV+IEHRYYG+++P+G +E A KNAST GYL++ QAL
Sbjct: 105 SLDSDLAAIGFLRDNGPRLNALLVYIEHRYYGETMPFGSAEE-ALKNASTLGYLNAAQAL 163
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
ADYA++++ +K+ + SP++V GGSYGGMLAAWFRLKYPH+A+GALASSAP+L F++
Sbjct: 164 ADYAAILLHVKEKYSTNHSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDT 223
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLA 282
+ + I+T+ F+ SE CY I+ SWK+I+ A KP GL L K F+ C +
Sbjct: 224 RPKFGYYYIVTKVFKEASERCYNTIRNSWKEIDRVAGKPNGLSILSKQFKTCDPLNGSFD 283
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFAKLYGAASV 340
I+ +L T +Y Y N P F V ++C AI + P ++ +++
Sbjct: 284 IKDFLDT--IYAEAVQY-------NRGPNFWVAKVCNAINANPPNRRYNLLDRIFAGVVA 334
Query: 341 YYNYSGTAKCFDLNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
G C+D + P + W WQ+C+E++M G D +D++F + + +
Sbjct: 335 LV---GNRTCYDTKMFAQPTNNNIAWRWQSCSEIVMPVGYDKQDTMFPTAPFNMTSYIDG 391
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
CK +GV PRP+WITT FG ++ L+L++F SNIIF NGL DP+S GGVL++IS T++A+
Sbjct: 392 CKSYHGVTPRPHWITTYFGIQEVKLILQKFGSNIIFSNGLSDPYSVGGVLEDISDTLVAI 451
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
K G+H +D+ +KEDP+WL R +E+++I WIS Y DL
Sbjct: 452 TTKNGSHCLDITLKSKEDPEWLVIQREKEIKVIDSWISTYQNDL 495
>gi|47209068|emb|CAF90249.1| unnamed protein product [Tetraodon nigroviridis]
Length = 487
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 203/454 (44%), Positives = 271/454 (59%), Gaps = 20/454 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT Y Q +DHF + TF+QRYL+ND HW + PIF YTGNEGDI WF NT
Sbjct: 46 YKTLYFDQKIDHFGFLEDG--TFKQRYLVNDKHW--QQPGGPIFFYTGNEGDITWFCNNT 101
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+++A +F A+LVF EHRYYG+S+P+G + +Y + YL+S QALAD+A LI +
Sbjct: 102 GFMWEIAEEFGAMLVFAEHRYYGESLPFGHD---SYSDNKHLNYLTSEQALADFAVLIQN 158
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK L A +SPV+ GGSYGGMLAAWFR+KYPH+ +GALASSAPI F +V F
Sbjct: 159 LKSTLPGAQNSPVIAIGGSYGGMLAAWFRMKYPHMVVGALASSAPIWQFPGMVPCGDFYK 218
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----LAIESWL 287
I+TQDF NC K I SWK IE + GL+ L + F +C KN +SWL
Sbjct: 219 IVTQDFARSGSNCDKNIGMSWKAIENVSSTASGLQWLSEEFGLCTPLKNRNDVAGFKSWL 278
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASVYYNYS 345
+V AM DYP +NFL PLPA+P++ +CK + + D + + A VYYNY+
Sbjct: 279 QETWVNLAMVDYPYEANFLQPLPAWPIQVVCKYLSFSHSVPDDHLLHGVSQATKVYYNYT 338
Query: 346 GTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
G++ C + + + G W +QACTEM+M D +FE E ++ A + C +
Sbjct: 339 GSSPCLNTSQTATGSLGFIGWFYQACTEMVMPMCTDGVRDMFEPEEWNFQAFSDECNAMF 398
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
G PR W +GG I SNIIF NG DPWS GGV NI++++++++ +G
Sbjct: 399 GARPRAEWARAVYGGKDIAA-----HSNIIFSNGGLDPWSAGGVTHNITESLVSILIPDG 453
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
AHH+DLR+S DP ++ R EV KWI Q
Sbjct: 454 AHHLDLRYSNDRDPPSVRAARALEVNYFRKWIKQ 487
>gi|301780038|ref|XP_002925438.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Ailuropoda
melanoleuca]
Length = 520
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 198/449 (44%), Positives = 277/449 (61%), Gaps = 19/449 (4%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF + +TF+QRYLI D HW KN I YTGNEGDI WF NTGFM+DVA +
Sbjct: 80 VDHFGFTAD--ETFKQRYLIADEHW--KKNGGSILFYTGNEGDITWFCNNTGFMWDVADQ 135
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
KA+LVF EHRYYG+S+P+G NK ++K++ +L+S QALAD+A LI LK+ + A
Sbjct: 136 LKAMLVFAEHRYYGESLPFG-NK--SFKDSRHLNFLTSEQALADFAVLIKHLKRTIPGAK 192
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
+ PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI +F N+V F I+T+DF+
Sbjct: 193 NQPVIAVGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMEIVTKDFKRS 252
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
NC + I+ SW I A+ GL L +A +C + N +++W+S ++ AM
Sbjct: 253 GPNCSETIRSSWDAINRFARTGAGLRWLSEALGLCTALTNTEDVQRLKAWISETWINLAM 312
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSGTAKCFDLNG 355
DYP S+FL PLPA+P+K +C+ + +P + + ++ A +VYYNYSG A C +++
Sbjct: 313 VDYPYESDFLQPLPAWPIKVVCQYLKNPSVSDAQLLQNIFQALNVYYNYSGQASCLNISE 372
Query: 356 DSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWIT 414
+ G W +QACTEM+M + D +FE + + C + +GV PRP WI
Sbjct: 373 TTTSSLGTQGWSYQACTEMVMPFCTNGIDDMFEPHLWNLGEFSDDCFKQWGVRPRPAWII 432
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFST 474
T +GG I +NIIF NG DPWSGGGV K+I+ T++A+ EGAHH+DLR
Sbjct: 433 TMYGGKNIS-----SHTNIIFSNGELDPWSGGGVTKDITDTLVAITIPEGAHHLDLRARN 487
Query: 475 KEDPQWLKDVRRREVEIIGKWISQYFQDL 503
DP + R EV + +WI ++ L
Sbjct: 488 AFDPTTVLLARSLEVRHMKQWIRDFYASL 516
>gi|320166581|gb|EFW43480.1| prolylcarboxypeptidase [Capsaspora owczarzaki ATCC 30864]
Length = 496
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 198/451 (43%), Positives = 274/451 (60%), Gaps = 19/451 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y K+ TQ LDHF ++ + TFQQRYLI+ +W G P+F YTGNEGDIEWFA NT
Sbjct: 60 YDVKWFTQTLDHFRFDTNA--TFQQRYLISTANWNGY---GPMFFYTGNEGDIEWFADNT 114
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++++A ++ AL+VF EHRYYG+++P+G +K + GYL++ QALAD+A LI
Sbjct: 115 GFVWEIAAEYNALVVFAEHRYYGQTMPFG-DKSF---DLDKVGYLTTEQALADFAILIPA 170
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK L + PVV FGGSYGGMLA WFRLKYP+V GA+A+SAPI+ F ++ S F+ I
Sbjct: 171 LKAQLNVPNLPVVAFGGSYGGMLAGWFRLKYPNVVDGAIAASAPIVYFQDLTSTEIFNEI 230
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES---WLST 289
T DF C +I+ + +++ +K GL+ + KAF++C + + + WL
Sbjct: 231 ATNDFALTDARCPNIIRDGFSKVDALSKTAAGLQSISKAFKLCGTLQPADYATFIGWLEA 290
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG-NDVFAKLYGAASVYYNYSGTA 348
Y AMTDYP SNFL P+PA+PV CKA+ G +D A L A VYYNY+G
Sbjct: 291 GLTYMAMTDYPYASNFLQPMPAWPVDASCKALLSTFAGTSDSVAALNTAVGVYYNYTGQT 350
Query: 349 KCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
C +++ + G+ W +Q+CTEM+M G D +F + D + YCK+ + V
Sbjct: 351 ACNNISSQATSDLGVLGWDYQSCTEMVMPMGSDGIHDMFPAAPWDLKSFNEYCKKRWNVV 410
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
PRP W T FGG I SNI+F NG+ DPWSGG + + S+T++ + +GAHH
Sbjct: 411 PRPTWAATSFGGFNITA-----GSNIVFSNGMLDPWSGGSITQIQSQTLVVVNIPKGAHH 465
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+DLR S DPQ + D R E I +WIS+
Sbjct: 466 LDLRSSNPADPQDVIDARNVERAQISRWISR 496
>gi|281339633|gb|EFB15217.1| hypothetical protein PANDA_014943 [Ailuropoda melanoleuca]
Length = 441
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 198/449 (44%), Positives = 277/449 (61%), Gaps = 19/449 (4%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF + +TF+QRYLI D HW KN I YTGNEGDI WF NTGFM+DVA +
Sbjct: 1 VDHFGFTAD--ETFKQRYLIADEHW--KKNGGSILFYTGNEGDITWFCNNTGFMWDVADQ 56
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
KA+LVF EHRYYG+S+P+G NK ++K++ +L+S QALAD+A LI LK+ + A
Sbjct: 57 LKAMLVFAEHRYYGESLPFG-NK--SFKDSRHLNFLTSEQALADFAVLIKHLKRTIPGAK 113
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
+ PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI +F N+V F I+T+DF+
Sbjct: 114 NQPVIAVGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMEIVTKDFKRS 173
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
NC + I+ SW I A+ GL L +A +C + N +++W+S ++ AM
Sbjct: 174 GPNCSETIRSSWDAINRFARTGAGLRWLSEALGLCTALTNTEDVQRLKAWISETWINLAM 233
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSGTAKCFDLNG 355
DYP S+FL PLPA+P+K +C+ + +P + + ++ A +VYYNYSG A C +++
Sbjct: 234 VDYPYESDFLQPLPAWPIKVVCQYLKNPSVSDAQLLQNIFQALNVYYNYSGQASCLNISE 293
Query: 356 DSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWIT 414
+ G W +QACTEM+M + D +FE + + C + +GV PRP WI
Sbjct: 294 TTTSSLGTQGWSYQACTEMVMPFCTNGIDDMFEPHLWNLGEFSDDCFKQWGVRPRPAWII 353
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFST 474
T +GG I +NIIF NG DPWSGGGV K+I+ T++A+ EGAHH+DLR
Sbjct: 354 TMYGGKNISS-----HTNIIFSNGELDPWSGGGVTKDITDTLVAITIPEGAHHLDLRARN 408
Query: 475 KEDPQWLKDVRRREVEIIGKWISQYFQDL 503
DP + R EV + +WI ++ L
Sbjct: 409 AFDPTTVLLARSLEVRHMKQWIRDFYASL 437
>gi|355713391|gb|AES04658.1| prolylcarboxypeptidase [Mustela putorius furo]
Length = 496
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 198/458 (43%), Positives = 280/458 (61%), Gaps = 19/458 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF ++ +TF+QRYLI D +W KN I YTGNEGDI WF NT
Sbjct: 48 YSVHYIQQKVDHFGFSAD--KTFKQRYLIADAYW--KKNGGSILFYTGNEGDITWFCNNT 103
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G NK +++++ +L+S QALAD+A LI
Sbjct: 104 GFMWDVADQLKAMLVFAEHRYYGESLPFG-NK--SFRDSRHLNFLTSEQALADFAVLIKH 160
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LKK + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI +F N+V F
Sbjct: 161 LKKTIPGAKNQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWHFGNLVPCGVFME 220
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWL 287
I+T+DF+ NC + I+ SW I ++ GL L +A +C + N +++WL
Sbjct: 221 IVTKDFKRGGPNCSETIRSSWDAINRFSRTGAGLRWLSEALDLCTALTNAQDVQHLKAWL 280
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
S ++ AM DYP +FL PLPA+P+K +C+ + +P + + ++ A +VYYNYSG
Sbjct: 281 SETWINLAMVDYPYECDFLQPLPAWPIKVVCQYLKNPSVSDAQLLQNIFQALNVYYNYSG 340
Query: 347 TAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
A C +++ + + G W +QACTEMIM + D +FE + + C + +
Sbjct: 341 QASCLNISETTTSNLGTQGWSYQACTEMIMPFCTNGIDDMFEPHAWNLREFSDDCFKQWS 400
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V PRP WI T +GG I +NIIF NG DPWSGGGV K+I+ T++A+ +GA
Sbjct: 401 VRPRPAWIITMYGGKNISS-----HTNIIFSNGELDPWSGGGVTKDITDTLVAITIPDGA 455
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
HH+DLR DP + R EV + +WI ++ L
Sbjct: 456 HHLDLRARNAFDPTTVILARSLEVRHMKQWIRNFYASL 493
>gi|30688688|ref|NP_851059.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|10177233|dbj|BAB10607.1| prolylcarboxypeptidase-like protein [Arabidopsis thaliana]
gi|332005705|gb|AED93088.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 502
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 199/464 (42%), Positives = 288/464 (62%), Gaps = 17/464 (3%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S+ K + K Y Q LDHF + P+SY TFQQRY I+ THWGG+K NAPI + G E
Sbjct: 45 STQKVDESNLKMYYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEES 104
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
++ GF+ D P+ ALLV+IEHRYYG+++P+G +E A KNAST GYL++ QAL
Sbjct: 105 SLDSDLAAIGFLRDNGPRLNALLVYIEHRYYGETMPFGSAEE-ALKNASTLGYLNAAQAL 163
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
ADYA++++ +K+ + SP++V GGSYGGMLAAWFRLKYPH+A+GALASSAP+L F++
Sbjct: 164 ADYAAILLHVKEKYSTNHSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDT 223
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLA 282
+ + I+T+ F+ SE CY I+ SW +I+ A KP GL L K F+ C +
Sbjct: 224 RPKFGYYYIVTKVFKEASERCYNTIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFD 283
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFAKLYGAASV 340
I+ +L T +Y Y N P F V ++C AI + P ++ +++
Sbjct: 284 IKDFLDT--IYAEAVQY-------NRGPNFWVAKVCNAINANPPNRRYNLLDRIFAGVVA 334
Query: 341 YYNYSGTAKCFDLNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
G C+D + P + W WQ+C+E++M G D +D++F + + +
Sbjct: 335 LV---GNRTCYDTKMFAQPTNNNIAWRWQSCSEIVMPVGYDKQDTMFPTAPFNMTSYIDG 391
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
CK +GV PRP+WITT FG ++ L+L++F SNIIF NGL DP+S GGVL++IS T++A+
Sbjct: 392 CKSYHGVTPRPHWITTYFGIQEVKLILQKFGSNIIFSNGLSDPYSVGGVLEDISDTLVAI 451
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
K G+H +D+ +KEDP+WL R +E+++I WIS Y DL
Sbjct: 452 TTKNGSHCLDITLKSKEDPEWLVIQREKEIKVIDSWISTYQNDL 495
>gi|224109998|ref|XP_002333169.1| predicted protein [Populus trichocarpa]
gi|222835052|gb|EEE73501.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 217/453 (47%), Positives = 296/453 (65%), Gaps = 17/453 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T ++ Q LDHFNY P+SY F QRYLIN +WGG+ +API V+ G E I+
Sbjct: 56 FETFFYNQTLDHFNYRPESYDKFPQRYLINSKYWGGANVSAPILVFLGAEEPIDEDLAAV 115
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D A +F +LLVFIEHRYYGKSIP+G +E A K+AS GY +S QA+ADYA++II
Sbjct: 116 GFLVDNAVQFNSLLVFIEHRYYGKSIPFGSREE-ALKDASKLGYFNSAQAIADYAAIIIH 174
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+K+ L A SPV+V GGSYGGMLA+WFRLKYPH+A+GALASSAPIL FD+I + +I
Sbjct: 175 IKETLRAQYSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDDITPQDGYYSI 234
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAF 291
+T+DFR SE CY+ IK SW +I+E A KP GL L K F+ C + + ++ L T +
Sbjct: 235 VTKDFREASETCYQTIKTSWSEIDELASKPDGLSMLSKKFKTCNPLTDASELKDHLDTMY 294
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
A + P P +PV E+CK ID G+D ++++G Y+G C+
Sbjct: 295 ASAAQYNRP---------PTYPVNEVCKGIDGGGFGDDTLSRIFGGLVA---YNGNLSCY 342
Query: 352 DLNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+N +DP + W WQ C+EM + G N +S+F D +CK YGV RP
Sbjct: 343 -VNAHTDPSETTVGWQWQKCSEMAIPIGVGN-NSMFPPDPFDLKDYIEHCKSLYGVTTRP 400
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+W+TT +GGH I L+L+RFASNIIF NGLRDP+S GGVL+NIS TV+A+ G+H +D+
Sbjct: 401 HWVTTYYGGHSIKLILQRFASNIIFSNGLRDPYSSGGVLENISDTVVAVKTVNGSHCLDI 460
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
F+ + DP WL R+ E++II +WI++Y+ DL
Sbjct: 461 LFAEENDPAWLVTQRKIEIKIIKEWINKYYADL 493
>gi|441645350|ref|XP_004093164.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X carboxypeptidase
[Nomascus leucogenys]
Length = 517
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 195/450 (43%), Positives = 276/450 (61%), Gaps = 18/450 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y T ++DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NTGFM+
Sbjct: 73 YKTPLVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMW 128
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
DVA +A+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD A LI LK+
Sbjct: 129 DVAEDXEAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADLAELIKHLKRT 185
Query: 177 LT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F I+T
Sbjct: 186 IPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTT 245
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFV 292
DFR +C + I+ SW I + GL+ L A +C S+ ++ W+S +V
Sbjct: 246 DFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWV 305
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCF 351
AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG KC
Sbjct: 306 NLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCL 365
Query: 352 DLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+++ + G W +QACTE++M + D +FE + + C + +GV PRP
Sbjct: 366 NISETATSSLGTLGWSYQACTEIVMPFCTNGVDDMFEPHSWNLKELSDDCFQLWGVRPRP 425
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAHH+DL
Sbjct: 426 SWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL 480
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R DP + R EV + WI ++
Sbjct: 481 RTKNALDPTSVLLARSLEVRHMKNWIRDFY 510
>gi|340374641|ref|XP_003385846.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Amphimedon
queenslandica]
Length = 490
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 216/509 (42%), Positives = 285/509 (55%), Gaps = 36/509 (7%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQ 60
MATR S L I ++ S + P L +L + +S Y+T Y Q
Sbjct: 1 MATRISSFSLFLF-----------ILVSYTSGLLNPRSLKNLPRGNSESSYEYQTLYFKQ 49
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAP 120
+DHFN+ +S TF QRYL+ND W N PIF Y GNEGDI WFA NTGF++D+AP
Sbjct: 50 PIDHFNF--ESNVTFSQRYLLNDAFWD-KDNGGPIFFYCGNEGDITWFANNTGFVWDIAP 106
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT 180
+FKAL+VF EHRYYG ++P+G +Y N ST GYL+S QALAD+ LI DLK
Sbjct: 107 EFKALVVFAEHRYYGNTLPFGAE---SYANLSTLGYLTSEQALADFVLLINDLKGKYG-- 161
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
D PVV FGGSYGGML+AW R+KYP V +G++A+SAPI F + + II+
Sbjct: 162 DVPVVAFGGSYGGMLSAWIRMKYPSVVVGSIAASAPIWQFPGLCDCGKANEIISSTMSQY 221
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN-LAIESWLSTAFVYTAMTDY 299
S NCY + SW I +T GGL L F +C+ K+ L + SWL + AM +Y
Sbjct: 222 STNCYNNVLSSWDIINKTGTTSGGLSLLSTTFSLCQPLKDSLTLMSWLQDVWFNLAMMNY 281
Query: 300 PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAAS----VYYNYSGTAKCFDLNG 355
P P+NF+ PLPA+P+ C+ + T D KL A S VYYNYSG + C DLN
Sbjct: 282 PYPANFMEPLPAWPLNVTCQKLSQTFTTEDDDIKLLQALSTAMQVYYNYSGQSSCLDLNK 341
Query: 356 DSDPH-GLSEWGWQACTEMIMLT---GGDNKDSIFEESEEDYDARARYCKEAY-GVDPRP 410
+S G W +Q CTEM M GGDN F + + + + C++++ G+ PRP
Sbjct: 342 ESSTDLGAKGWSYQYCTEMAMPMCSKGGDND--AFPKQQWTVNNYVKNCQDSFPGIQPRP 399
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
WI + G I SNI+F NG DPWS G VL NIS +++A++ +GAHH+DL
Sbjct: 400 YWIEKVYNGKNISAF-----SNIVFSNGDLDPWSAGVVLDNISDSLIAVIINDGAHHLDL 454
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQY 499
R + D +K R + I KWI Y
Sbjct: 455 REANPMDTDSVKAARNIHKDNINKWIGGY 483
>gi|193787211|dbj|BAG52417.1| unnamed protein product [Homo sapiens]
Length = 517
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/450 (43%), Positives = 274/450 (60%), Gaps = 18/450 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y T ++DHF +N + +TF QRYL+ D +W KN I YTGNEGDI WF NT FM+
Sbjct: 73 YKTPLVDHFGFN--TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTEFMW 128
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
DVA + KA+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD+A LI LK+
Sbjct: 129 DVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADFAELIKHLKRT 185
Query: 177 LT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F I+T
Sbjct: 186 IPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTT 245
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFV 292
DFR +C + I SW I GL+ L A +C S+ ++ W+S +V
Sbjct: 246 DFRKSGPHCSESIHRSWDAINRLLNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWV 305
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCF 351
AM DYP SNFL PLPA P+K +C+ + +P + + + ++ A +VYYNYSG KC
Sbjct: 306 NLAMVDYPYASNFLQPLPARPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCL 365
Query: 352 DLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+++ + G W +QACTE++M + D +FE + + C + +GV PRP
Sbjct: 366 NISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRP 425
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAHH+DL
Sbjct: 426 SWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHLDL 480
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
R DP + R EV + WI ++
Sbjct: 481 RTKNALDPMSVLLARSLEVRHMKNWIRDFY 510
>gi|380027987|ref|XP_003697693.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Apis florea]
Length = 491
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 199/461 (43%), Positives = 282/461 (61%), Gaps = 17/461 (3%)
Query: 46 SKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
+++++ Y+ K +DHF+++ TF+ RYLIN T W + NNAPIF YTGNEG+I
Sbjct: 40 TQNTKYKYEIKTIDMPVDHFSFSVSD--TFKLRYLINGT-WQKT-NNAPIFFYTGNEGNI 95
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
E FAQNTGFM+D+AP+F+ALLVF EHRYYG+S+PY +Y + + GYL+S QALAD
Sbjct: 96 EIFAQNTGFMWDIAPEFEALLVFAEHRYYGESMPYSNK---SYTDLNHLGYLTSQQALAD 152
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
Y LI LK +SP++VFGGSYGGML+AW R+KYPH+ GA+ASSAPIL F I
Sbjct: 153 YIDLIQYLKSKPKYKNSPIIVFGGSYGGMLSAWIRMKYPHIVQGAIASSAPILQFTGITE 212
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNL- 281
SF I+T DF+ NC K+I+ SW I G + L +++C K+E N+
Sbjct: 213 CESFLRIVTSDFKKAHSNCPKLIRKSWNIITNITSTNEGKKWLSDNWKLCQPLKNENNIE 272
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASV 340
+ S+L + AM +YP +NFL PLPA+P+ +CK + ++ TG D+ + A ++
Sbjct: 273 QLISYLQDIYTNLAMVNYPYKANFLAPLPAYPINAVCKHLTNESLTGTDLLIAIKNAINI 332
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+ NY+ KC +LN S W +QACTEM+M D + +F+ + D + C
Sbjct: 333 FTNYTSETKCLNLNNSSPQLDAIGWSFQACTEMVMPICSDGINDMFKPHTWNLDEYSNDC 392
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ Y V P+PN I E+G + A+NI+F NGL DPWS GGVL+N+S + +A++
Sbjct: 393 IKQYSVKPQPNLICEEYGCKDLST-----ATNIVFSNGLMDPWSSGGVLRNLSSSAIAII 447
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
E AHH+DLR S DP + R+ I KWI++Y +
Sbjct: 448 IPESAHHLDLRSSNPNDPFSVVLARKYHRFFITKWINEYHE 488
>gi|410910456|ref|XP_003968706.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Takifugu
rubripes]
Length = 500
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 205/478 (42%), Positives = 283/478 (59%), Gaps = 25/478 (5%)
Query: 33 RITPEKLSSLISSSKDSQGL---YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
R +L + S S G+ YKT Y Q +DHF + TF+QRYLI D HW
Sbjct: 23 RALKSQLFTRRGGSSSSAGIPISYKTLYFEQKIDHFGFLEDG--TFKQRYLIADKHW--Q 78
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
+ PIF YTGNEGDI WF NTGFM+++A +F A+LVF EHRYYG+S+P+G + +Y
Sbjct: 79 QPGGPIFFYTGNEGDITWFCNNTGFMWEIAEEFGAMLVFAEHRYYGESLPFGAD---SYS 135
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+ YL+S QALAD+A L+ +LK A +SPV+ GGSYGGMLAAWFR+KYPH+ +
Sbjct: 136 DNKHLNYLTSEQALADFAVLVQNLKSTFPGAQNSPVIAVGGSYGGMLAAWFRMKYPHIVV 195
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKL 268
GALASSAPI F +V F I+TQDF +C K I+ SWK IE + GL+ L
Sbjct: 196 GALASSAPIWQFPGMVPCGDFYKIVTQDFAKSGSDCDKNIRMSWKAIENVSSTASGLQWL 255
Query: 269 QKAFRIC---KSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP 324
+ F +C KS+ ++A ++WL +V AM DYP +NFL PLP +P++ +CK +
Sbjct: 256 SEEFGLCAPLKSKSDIAGFKNWLQETWVNLAMVDYPYEANFLQPLPPWPIQAVCKYLSFS 315
Query: 325 KTGNDVFAKLYG---AASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGD 380
+ D + L+G A VYYNY+G++ C + + + G W +QACTEM+M D
Sbjct: 316 GSVPD-YHLLHGVSQATKVYYNYTGSSPCLNTSQTATGSLGFLGWFYQACTEMVMPMCTD 374
Query: 381 NKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLR 440
+FE E ++ A + C +G PR +W T +GG I SNIIF NG
Sbjct: 375 GIQDMFEPEEWNFQAFSDDCNARFGARPRADWAGTVYGGKDIAA-----HSNIIFSNGGL 429
Query: 441 DPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
DPWS GGV NI+ ++++++ +GAHH+DLR++ DP ++ R EV KWI
Sbjct: 430 DPWSAGGVNHNITDSLISILIPDGAHHLDLRYTNDHDPPSVRAARALEVNYFHKWIKH 487
>gi|190702489|gb|ACE75375.1| prolylcarboxypeptidase [Glyptapanteles indiensis]
Length = 497
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 199/467 (42%), Positives = 285/467 (61%), Gaps = 19/467 (4%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
++S S Y TK +DHF+++ + TF+ RY +NDT W KN APIF YTGNEG
Sbjct: 35 TNSNSSPYKYVTKKFIVPVDHFSFSLNN--TFEMRYFVNDT-WKNGKN-APIFFYTGNEG 90
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
+E FA NTGFM+++AP F AL+VF EHRYYG+S+P+G NK ++ N GYL+S QAL
Sbjct: 91 VLETFAANTGFMWEIAPTFGALIVFAEHRYYGESMPFG-NK--SFDNVKNLGYLTSQQAL 147
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
ADY LI+ LK + + SPV+ FGGSYGGML+AWFR+KYPH+ GA+A+SAP+L F
Sbjct: 148 ADYVDLIVHLKSDPSLKHSPVIAFGGSYGGMLSAWFRMKYPHIINGAIAASAPVLQFTGY 207
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKN 280
+FS I+T DFR+V NC KVI+ SW+ I+ G + + F++C S E+N
Sbjct: 208 TDCQAFSRIVTSDFRAVHPNCEKVIRKSWESIKNLTSTDDGKKWISSEFKVCGSLTTEEN 267
Query: 281 L-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK--TGNDVFAKLYGA 337
+ +L + + AM +YP ++FL+PLPA+P++E CK +++ + DV +
Sbjct: 268 FETFKYFLLSVYSNLAMVNYPYATDFLSPLPAYPIREFCKFVNNTNLSSDKDVMTNIQKG 327
Query: 338 ASVYYNYSGTAKCFDL-NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
++Y NY+G KC DL N + D S W +QACTEM+M D + +FE ++
Sbjct: 328 INLYSNYTGKLKCLDLSNPEPDLGAASGWDYQACTEMVMPMCNDGVNDMFEPDPWNFTKY 387
Query: 397 ARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTV 456
++ C + V P+P + +G + ASNI+F NGL DPWS GGVL+N+S +
Sbjct: 388 SQDCFTHFNVTPKPELVCDTYGCDDLST-----ASNIVFSNGLLDPWSSGGVLRNLSSSA 442
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+A++ EGAHH+DLR S DP + R I KWI ++ + L
Sbjct: 443 VAIIIPEGAHHLDLRGSHPADPYSVVKAREYHAYSIKKWIREHREGL 489
>gi|297734876|emb|CBI17110.3| unnamed protein product [Vitis vinifera]
Length = 405
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 188/393 (47%), Positives = 255/393 (64%), Gaps = 17/393 (4%)
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D A +FKALLV+IEHRY GKSIP+G +E A KNAS GY +S QA+ADYA ++I
Sbjct: 23 GFLTDNAHRFKALLVYIEHRYCGKSIPFGSREE-ALKNASIRGYFNSAQAIADYAEVLIY 81
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+KK L A +SPV+V GGSYGGMLA+WFRLKYPHVA+G LASSAPIL F++I + +I
Sbjct: 82 IKKKLLAENSPVIVVGGSYGGMLASWFRLKYPHVALGVLASSAPILYFEDITPQNGYYSI 141
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-AIESWLSTAF 291
+T+DFR SE+CYK I+ SW +I+ A +P G+ L K FR C N ++ +L T
Sbjct: 142 VTKDFREASESCYKTIRESWSEIDRVASEPNGISILSKKFRTCDRLNNSDELKDYLDT-- 199
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
+Y Y P P +PV C ID G+D+ ++++ Y G + C+
Sbjct: 200 IYCTAAQYNDP-------PMYPVTMACSGIDGAPEGSDILSRIFAGVVA---YGGNSSCY 249
Query: 352 DLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+ +P SE W WQ C+EM+M G + D++F + + + C YGV PRP
Sbjct: 250 TTS--HNPTETSEGWAWQTCSEMVMPIGRGDNDTMFPPAPFNLTTFIQECTSIYGVSPRP 307
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+W+TT +GGH I L+L+RFASNIIF NGLRDP+S GGVLKNIS +VLA++ G+H +D+
Sbjct: 308 HWVTTYYGGHNIELILRRFASNIIFSNGLRDPYSSGGVLKNISDSVLAILTVNGSHCLDI 367
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+T DP+WL R+ EVE+I WI+QY+ DL
Sbjct: 368 LPATSTDPEWLVMQRKAEVEVIESWIAQYYADL 400
>gi|297812429|ref|XP_002874098.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297319935|gb|EFH50357.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 427
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 195/434 (44%), Positives = 270/434 (62%), Gaps = 18/434 (4%)
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRY 133
TFQQRY I+ HW G+K NAPI + G E +E GF+ D AP FKAL V+IEHRY
Sbjct: 2 TFQQRYAIDAKHWAGAKANAPILAFLGLEASLETDLAAFGFLSDNAPHFKALKVYIEHRY 61
Query: 134 YGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGG 193
YGK+IP+G KE A KNAST GYL+S QALADYA++++ +K+ +AT SP++V GGSYGG
Sbjct: 62 YGKTIPFGSAKE-AMKNASTLGYLNSAQALADYAAILLHIKEKYSATHSPIIVVGGSYGG 120
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWK 253
MLAAWFRLKYPH+A+GALASSAP+L F++ + + IIT+ F+ ++ CY I+ SW+
Sbjct: 121 MLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIITKVFKETNKRCYNTIRKSWE 180
Query: 254 QIEETAKKPGGLEKLQKAFRICKS--EKNLAIESWLSTAFVYTAM-TDYPTPSNFLNPLP 310
+I+ A KP GL L K F+ C S ++ ++ +L T + T D +N N +
Sbjct: 181 EIDRVASKPNGLLILSKKFKTCASPLSRSFDLKDFLDTVYAETVQYNDGVWVTNVCNAIN 240
Query: 311 AFPVKEMCKAIDDPKTGNDVFAKLYGAASVY-YNYSGTAKCFDLNGDSDPHGLSEWGWQA 369
A P +D G L G+ S Y NYS D+ W WQ
Sbjct: 241 ANPPNRKIDILDRIFAG---VVALTGSQSCYNTNYSVQVTNNDM----------AWRWQC 287
Query: 370 CTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRF 429
C+E+++ G D +D++++ S + + C+ +YGV PRP+WITT FG + L+L+RF
Sbjct: 288 CSEIVVPVGHDKQDTMYQTSPFNMTSYIEDCESSYGVSPRPHWITTYFGIQNVKLILQRF 347
Query: 430 ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREV 489
SNIIF NGL DP+S GGVL++IS TV+A+ K G+H D+ +K DP+WL R +E+
Sbjct: 348 GSNIIFSNGLSDPYSVGGVLEDISDTVVAITTKNGSHCQDINLKSKGDPEWLVMQREKEI 407
Query: 490 EIIGKWISQYFQDL 503
++I WIS Y DL
Sbjct: 408 KVINSWISTYQNDL 421
>gi|156369614|ref|XP_001628070.1| predicted protein [Nematostella vectensis]
gi|156215037|gb|EDO36007.1| predicted protein [Nematostella vectensis]
Length = 436
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 198/449 (44%), Positives = 281/449 (62%), Gaps = 21/449 (4%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
LDHFN+ ++ TF QRYL+N +W + PIF YTGNEGDI WFA NTGFM+D A +
Sbjct: 1 LDHFNF--RTSATFSQRYLVNIANW---RKGGPIFFYTGNEGDITWFANNTGFMWDNAKE 55
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
F A+LVF EHRYYG+++P+G + +Y++ GYLSS QALAD+A+LI +K AT
Sbjct: 56 FGAMLVFAEHRYYGETLPFG---KRSYESPKYLGYLSSEQALADFATLIRHIKLTTPGAT 112
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF-RS 239
SPV+ GGSYGGML++W R+KYP++ ALA+SAPIL F + F+ I+T+DF R
Sbjct: 113 GSPVIAIGGSYGGMLSSWIRMKYPNLVTAALAASAPILYFQGLTPCEGFNEIVTKDFHRD 172
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLA-IESWLSTAFVYTA 295
++C I+ SW IE+ G + L F C K++ N+ ++ WLS + A
Sbjct: 173 GGDSCVNSIRKSWSVIEKLGATQSGRKTLTSVFNTCSPIKTKYNVTQLQDWLSETWANLA 232
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLN 354
M +YP + FL PLPA+P+K++C + D + K + GA VYYN SG AKCF+L+
Sbjct: 233 MVNYPYAATFLEPLPAWPIKKVCSHLTDADLPDVALLKAVAGAVGVYYNSSGQAKCFNLS 292
Query: 355 GDS-DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
+ G W +QACTEM+M D + +F+ + D++A A C+ +YGV PR W+
Sbjct: 293 QQAVSSLGDKGWDFQACTEMVMPMCSDGVNDMFKPVKWDFEAFAESCQGSYGVKPRQYWV 352
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
++GG I SNIIF NGL DPW GGV K++S +++A++ +EGAHH+DLR S
Sbjct: 353 EVQYGGRDISA-----HSNIIFSNGLLDPWFAGGVTKSLSPSLVAVLVEEGAHHLDLRHS 407
Query: 474 TKEDPQWLKDVRRREVEIIGKWISQYFQD 502
DP L R+ E E + +WIS+Y+++
Sbjct: 408 NPADPPSLIKARQTEKEYLHRWISEYYKN 436
>gi|348525386|ref|XP_003450203.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Oreochromis
niloticus]
Length = 502
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 198/452 (43%), Positives = 274/452 (60%), Gaps = 20/452 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT Y Q +DHF + TF+QRYL++D +W + PI YTGNEGDI WF NT
Sbjct: 50 YKTFYFDQKIDHFGFLEDG--TFKQRYLLSDKYW--QQPGGPILFYTGNEGDITWFCNNT 105
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+++A + A+LVF EHRYYG+S+P+G + +Y+++ YL+S QALAD+A LI +
Sbjct: 106 GFMWEIAEELDAMLVFAEHRYYGESLPFGQD---SYRDSKHLNYLTSEQALADFAVLIQN 162
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK L A SPV+ GGSYGGML+AWFR+KYPHV +GALASSAPI F +V F
Sbjct: 163 LKGTLPGAQHSPVIAVGGSYGGMLSAWFRMKYPHVVVGALASSAPIWQFPGMVPCGDFYK 222
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----LAIESWL 287
+TQDF NC I+ SWK + + GL+ L + F +C KN L +SWL
Sbjct: 223 TVTQDFAKSGINCDANIRKSWKAVNNVSSSASGLQWLSEEFSLCAPLKNKNDVLTFKSWL 282
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASVYYNYS 345
+V AM DYP +NFL PLP +P++ +CK + T +D + + AA VYYNY+
Sbjct: 283 QETWVNLAMVDYPYEANFLQPLPRWPIQVVCKYLSFDSTVSDYQLLQGVAQAAKVYYNYT 342
Query: 346 GTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
G++ C + + + G W +QACTEM+M D +FE E ++ A + CK +
Sbjct: 343 GSSPCLNTSQTATSSLGYLGWFYQACTEMVMPMCTDGVQDMFEPEEWNFQAFSDECKAMF 402
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
G+ PR +W T +GG +I SNIIF NG DPWS GGV NIS ++++++ +G
Sbjct: 403 GIRPRADWAGTVYGGKEISS-----HSNIIFSNGGLDPWSSGGVTSNISHSLVSIMIPDG 457
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
AHH+DLR+S DP ++ R EV+ +WI
Sbjct: 458 AHHLDLRYSNDLDPPSVRAARDLEVKYFREWI 489
>gi|297812433|ref|XP_002874100.1| hypothetical protein ARALYDRAFT_910290 [Arabidopsis lyrata subsp.
lyrata]
gi|297319937|gb|EFH50359.1| hypothetical protein ARALYDRAFT_910290 [Arabidopsis lyrata subsp.
lyrata]
Length = 495
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 189/450 (42%), Positives = 285/450 (63%), Gaps = 19/450 (4%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
+ Q LDHF Y P SY+ F+QRY +N +W G K NAPI Y G E ++ GF+
Sbjct: 58 FFEQTLDHFTYTPGSYKKFRQRYAVNSKYWEGGKTNAPILAYLGAESSLDSELSVLGFLK 117
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D AP FKAL+V+IEHR+YG+++P+G +E KNA T GYL++ QALADYA++++ +K+
Sbjct: 118 DNAPHFKALMVYIEHRFYGETMPFGSAEE-TLKNAKTLGYLNAAQALADYAAILLHIKET 176
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
+A SPV+V GGSYGGMLAAWF+LKYPH+A+GALASSAP+L F++ + + + I+T+
Sbjct: 177 YSAKHSPVIVIGGSYGGMLAAWFKLKYPHIALGALASSAPLLYFEDTLPKHGYFYIVTKV 236
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTA 295
F+ S+ C+ I+ SW +I+ A KP GL L K F++C + ++S+LS + TA
Sbjct: 237 FKETSQKCHNKIRKSWDEIDRIAAKPNGLSILSKKFKLCNPLNDTIELKSYLSNIYAGTA 296
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDD--PKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
+ NP + V +C+AI+ P T +D+ +++ G C+ +
Sbjct: 297 QYNN-------NP---YSVASLCEAINTSPPNTKSDLLDQIFAGVVA---SGGNISCYGM 343
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
D + W WQ+C+EM+M G + +D++F+ + + + C+ YGV PRP+W+
Sbjct: 344 --DQITNDARAWTWQSCSEMVMPIGYEKEDTMFQPKPFNMSSFTKNCESQYGVSPRPHWV 401
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
T FG + L+ +RF +NIIF NGL DP+S GGVL++IS TV+A+ ++G+H D+
Sbjct: 402 TAYFGSQDVKLIFRRFGNNIIFSNGLLDPYSVGGVLEDISDTVIAITTRDGSHCQDIVLK 461
Query: 474 TKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+KEDP+WL + R +EV+II WIS Y +DL
Sbjct: 462 SKEDPEWLVEQREKEVKIIDSWISTYQKDL 491
>gi|350415944|ref|XP_003490799.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Bombus impatiens]
Length = 494
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 197/459 (42%), Positives = 278/459 (60%), Gaps = 23/459 (5%)
Query: 52 LYKTKYHTQI------LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
LY KY +I +DHF+++ TF+ RYL+N+T W K +APIF YTGNEG+I
Sbjct: 41 LYSGKYKYEIKTIDMPVDHFSFSVPD--TFKLRYLVNNT-WQ-IKKDAPIFFYTGNEGNI 96
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
E FAQNTGFM+D+AP+F ALL+F EHRYYG+S+PYG +Y + + GYL+S QALAD
Sbjct: 97 ENFAQNTGFMWDIAPEFGALLIFAEHRYYGESMPYGNK---SYMDLNHLGYLTSRQALAD 153
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
Y LI +K SPV+VFGGSYGGML+AW R+KYPHV GA+ASSAPIL F +
Sbjct: 154 YVDLIQYVKSKPEYKYSPVIVFGGSYGGMLSAWIRMKYPHVVQGAIASSAPILQFSGVTE 213
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA--- 282
+F I+T DF++ NC K+I+ SW I G E L ++++C+ KN +
Sbjct: 214 CEAFVRIVTSDFKTAHTNCPKLIRRSWNTIINITSSDKGREWLADSWKLCQPLKNSSDVQ 273
Query: 283 -IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAASV 340
+ S+L ++ A+ +YP +NFL PLPA+PVK +CK + + G ++ ++ A ++
Sbjct: 274 QLMSYLEDIYINLAVVNYPYEANFLTPLPAYPVKAVCKHLTNQSLVGTELLIAIHKAINI 333
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+ NYS KC +LN G W +QACTEM+M D + +F+ + D C
Sbjct: 334 FTNYSSETKCLNLNDSVPQLGAVGWPFQACTEMVMPICSDGINDMFKPRSWNLDRYINDC 393
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ Y V P+PN + ++G + A+NI+F NGL DPWS GGVL+N+S + +A++
Sbjct: 394 MKQYSVKPQPNLVCEQYGCKDLST-----ATNIVFSNGLMDPWSSGGVLQNLSSSAVAVI 448
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
E AHH+DLR S DP + R+ I KWI +Y
Sbjct: 449 IPESAHHLDLRSSNANDPYSVVLARKYHRFFIKKWIQEY 487
>gi|383861707|ref|XP_003706326.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Megachile
rotundata]
Length = 493
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 204/502 (40%), Positives = 294/502 (58%), Gaps = 25/502 (4%)
Query: 7 FLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHT--QILDH 64
F + L+F + ++ IFP + S + + YK +Y T +DH
Sbjct: 3 FNALLLIFVALWQLTVQHIFPNIHDGNFKRQFRSRI----ELVNAKYKYEYKTIDMPVDH 58
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKA 124
F++ S F+ RYL+NDT W + NNAPIF YTGNEGDIE FAQN+GFM+D+AP+F A
Sbjct: 59 FDF--ASVDKFKLRYLMNDT-WVKT-NNAPIFFYTGNEGDIEGFAQNSGFMWDIAPEFGA 114
Query: 125 LLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPV 184
LL+F EHRYYG+S+PYG +Y + GYLSS QALADY LI L+ + SPV
Sbjct: 115 LLIFAEHRYYGESMPYGNK---SYTDIKYLGYLSSEQALADYVDLIQYLRSDSKHKHSPV 171
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENC 244
+VFGGSYGGML+AW R+KYPH+ GA+A SAPIL F FS I+T DF+ NC
Sbjct: 172 IVFGGSYGGMLSAWMRMKYPHIVQGAIACSAPILQFTTECE--VFSRIVTSDFKMAHRNC 229
Query: 245 YKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----LAIESWLSTAFVYTAMTDYP 300
K+I+ SW I G + L + +++C+ KN +S+LS + AM +YP
Sbjct: 230 PKLIRKSWNAINNITSNDEGKKWLSENWKLCQPLKNENDVEVFKSYLSDIYGNFAMVNYP 289
Query: 301 TPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP 359
S+FL PLP FP+K C+ + ++ G ++ L+ A +V+ NY+G KC +LN +
Sbjct: 290 YASDFLAPLPPFPIKVACEYLTNESLVGKELLTALHDAVNVFTNYTGETKCLNLNTSTPQ 349
Query: 360 HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGG 419
W +QACTEM+M D + +FE + +++ ++ C++ + + P+PN ++G
Sbjct: 350 LNDRGWHFQACTEMVMPLCSDGINDMFEPTPWNFEEYSKECEKLFSIKPQPNMACNQYGC 409
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQ 479
+ A+NI+F NGL DPWS GGVL+N+S++ +A++ EGAHH+DLR S DP
Sbjct: 410 EDLST-----ATNIVFSNGLLDPWSSGGVLRNLSESAIAIIIPEGAHHLDLRGSHTNDPF 464
Query: 480 WLKDVRRREVEIIGKWISQYFQ 501
+ R I KWI +Y +
Sbjct: 465 SVVIARNYHRYYINKWIQEYHE 486
>gi|395814759|ref|XP_003780909.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Otolemur
garnettii]
Length = 495
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 189/459 (41%), Positives = 284/459 (61%), Gaps = 18/459 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF +N S +TF QRYLI + +W K+ I YTGNEGDI WF NT
Sbjct: 48 YSVLYFQQKIDHFGFN--SVKTFNQRYLIANEYW--KKDGGSILFYTGNEGDIVWFCNNT 103
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G N ++K++ YL+S QALAD+A LI
Sbjct: 104 GFMWDVAQELKAMLVFAEHRYYGESLPFGKN---SFKDSRHLNYLTSEQALADFAELIRH 160
Query: 173 LKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
L++ + + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F
Sbjct: 161 LQETIPGVKNQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMK 220
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLA-IESWLS 288
I+T DF+ C + I+ SW I + + GL+ L + +C + +++ ++ W++
Sbjct: 221 IVTSDFKRSGPYCSETIRKSWNVINQLSTTSRGLQYLTEVLHLCTPLTPRDVQHLKDWIA 280
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM DYP PS+FL PLPA+P+K +C+ + +P + + + ++ A ++YYNYSG
Sbjct: 281 ETWVNLAMVDYPYPSSFLQPLPAWPIKVVCQYLKNPNVSDSLMIQNIFQALNIYYNYSGE 340
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
A+C +++ + + G W +QACTE++M + D +FE + + C + +GV
Sbjct: 341 AQCLNVSTTTTSNLGSLGWSYQACTEIVMPFCTNGIDDMFEPHAWNLKEFSDECFKQWGV 400
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
P P+WITT +GG I +NI+F NG DPWSGGGV ++++ T++A EGAH
Sbjct: 401 RPSPSWITTMYGGKNI-----NSHTNIVFSNGELDPWSGGGVTEDLTDTLVAFTISEGAH 455
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
H+DLR S DP ++ R E + + WI +++ Q
Sbjct: 456 HLDLRASNALDPISVQLARTLETKHVKNWIRDFYRSGLQ 494
>gi|260808833|ref|XP_002599211.1| hypothetical protein BRAFLDRAFT_200045 [Branchiostoma floridae]
gi|229284488|gb|EEN55223.1| hypothetical protein BRAFLDRAFT_200045 [Branchiostoma floridae]
Length = 436
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 200/449 (44%), Positives = 280/449 (62%), Gaps = 27/449 (6%)
Query: 57 YHTQILDHFNY-NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
Y TQ+ DHF++ NP TF RYL+NDT++ K+ PIF YTGNEGDIE F +NTG +
Sbjct: 4 YPTQV-DHFSFANPD---TFLLRYLVNDTYF---KDGGPIFFYTGNEGDIEGFVKNTGLL 56
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
++AP+F A+++F EHRYYG+S+PYG E ++K+ + GYL+STQALAD+A LI L+K
Sbjct: 57 MEMAPRFGAMVIFAEHRYYGQSMPYG---EESFKDPAHLGYLTSTQALADFAVLITRLRK 113
Query: 176 NLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+ A +SPV FGGSYGGMLAAW R+KYPH+ G+LAS+A I + I ++S++ T
Sbjct: 114 TASGAANSPVFAFGGSYGGMLAAWIRMKYPHLVAGSLASAASIFQYPGITDCEAYSHVAT 173
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAF 291
+ F+ + C I+ SW I++ K GL+ L FR+C+ ++ A+ WL +
Sbjct: 174 RTFQRSAAGCPGRIRSSWDIIDDLGKTGAGLQNLTSMFRLCERLTADDMPALVDWLVNLW 233
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
+ A+ DYP P+NFL PLPA+PVKE C I + DV + + AA +Y+NY+G C
Sbjct: 234 MIYALIDYPYPANFLTPLPAWPVKEACHLI---VSNEDVLSGVAAAAKLYFNYTGQTACL 290
Query: 352 DLNGDSDP-HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
D+ S P +G W +QACTE I D + +F D A + C E + V PRP
Sbjct: 291 DI---SKPYYGGLAWQYQACTEQIEPICSDGVNDMFPAIPWDLSAFSEACYERWKVRPRP 347
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+W TE+ G I ASNIIF NG DPWS G VLK++S +++A+V ++ AHH+DL
Sbjct: 348 HWAVTEYWGRNISA-----ASNIIFSNGDLDPWSAGCVLKSLSDSLVAIVMEDAAHHLDL 402
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQY 499
R S DP + R +E +II KWI +Y
Sbjct: 403 RPSNPADPPSVIKARAQEADIIEKWIQEY 431
>gi|340710015|ref|XP_003393594.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Bombus
terrestris]
Length = 494
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 197/459 (42%), Positives = 278/459 (60%), Gaps = 23/459 (5%)
Query: 52 LYKTKYHTQI------LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
LY KY +I +DHF+++ TF+ RYL+N+T W K+ APIF YTGNEG+I
Sbjct: 41 LYSGKYKYEIKTINMPVDHFSFSVPD--TFKLRYLVNNT-WQIRKD-APIFFYTGNEGNI 96
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
E FAQNTGFM+D+AP+F ALL+F EHRYYG+S+PY +Y + + GYL+S QALAD
Sbjct: 97 ENFAQNTGFMWDIAPEFGALLIFAEHRYYGESMPYNNK---SYMDLNHLGYLTSRQALAD 153
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
Y LI +K SPV+VFGGSYGGML+AW R+KYPHV GA+ASSAPIL F +
Sbjct: 154 YVDLIQYVKSKPEYKYSPVIVFGGSYGGMLSAWIRMKYPHVVQGAIASSAPILQFSGVTE 213
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA--- 282
+F I+T DF++ NC K+I+ SW I G E L ++++C+ KN +
Sbjct: 214 CEAFVRIVTSDFKTAHTNCPKLIRRSWSTIINITSSDKGREWLADSWKLCQPLKNSSDVQ 273
Query: 283 -IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAASV 340
+ S+L + A+ +YP +NFL PLPA+PVK +CK + + G ++ ++ A ++
Sbjct: 274 QLISYLEDIYTNLAVVNYPYEANFLTPLPAYPVKAVCKHLTNQSLVGTELLIAIHKAINI 333
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+ NY+G KC +LN G W +QACTEM+M D + +F+ + D + C
Sbjct: 334 FTNYTGETKCLNLNDSVPQLGAVGWPFQACTEMVMPICSDGINDMFKPRSWNLDRYTKDC 393
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ Y V P+PN I ++G + A+NI+F NGL DPWS GGVL+N+S + +A++
Sbjct: 394 IKQYSVKPQPNLICEQYGCKDLST-----ATNIVFSNGLMDPWSSGGVLQNLSSSAVAII 448
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
E AHH+DLR S DP + R+ I KWI +Y
Sbjct: 449 IPESAHHLDLRSSDANDPYSVILARKYHRFFIKKWIQEY 487
>gi|242006450|ref|XP_002424063.1| Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus
corporis]
gi|212507369|gb|EEB11325.1| Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus
corporis]
Length = 457
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 194/444 (43%), Positives = 271/444 (61%), Gaps = 18/444 (4%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF++ + +TF RYLINDT+W PIF YTGNEGDIE FAQNTGFM+++APK
Sbjct: 17 VDHFSF--VTNETFNIRYLINDTYWNNK--TGPIFFYTGNEGDIEVFAQNTGFMWEIAPK 72
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT- 180
F ALL+F EHRYYGKS+PYG + + GYL+S QALADY LI L N +
Sbjct: 73 FNALLIFAEHRYYGKSLPYGNK---TFSDPKYLGYLTSEQALADYVDLIAHLTWNDNKSY 129
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
+PV+ FGGSYGGMLAA+ R+KYPH+ GA+ASSAPI F + FS I+T DF
Sbjct: 130 KNPVIAFGGSYGGMLAAYIRMKYPHMVAGAIASSAPIWQFTGLTPCDVFSRIVTSDFEIE 189
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
+++C I+ SW I GL+ + +++C+ KN +++WLS + AM
Sbjct: 190 NKSCSFNIRRSWNIINNITSNEDGLKWITNEWKLCRPLKNSTDVRDLKNWLSDVYNNLAM 249
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAASVYYNYSGTAKCFDLNG 355
+YP P+NFL PLP +P+++ CK + + TG D+ LY + +VY+NY+G++KC + +
Sbjct: 250 VNYPYPTNFLTPLPGYPIRKFCKKLQNTTATGKDLLNLLYKSVTVYFNYTGSSKCLNFDD 309
Query: 356 DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
+ G W +QACTEM+M + +FE E D C++ + V P+P I
Sbjct: 310 STPSLGADLWDYQACTEMVMPMCQNGNTDMFEPQEWDLLKFTHECEKKWKVTPKPYLIEK 369
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
+GG + ASNIIF NGL DPW+GGGVLK SKT+ ++ E AHH+DLR +T
Sbjct: 370 LYGGKDLST-----ASNIIFSNGLLDPWAGGGVLKVNSKTIKVVLIPEAAHHLDLRATTP 424
Query: 476 EDPQWLKDVRRREVEIIGKWISQY 499
DP + + R+ +II WI ++
Sbjct: 425 IDPLSVIESRKLYKKIIYLWIKEH 448
>gi|328778095|ref|XP_623670.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Apis mellifera]
Length = 492
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 197/461 (42%), Positives = 277/461 (60%), Gaps = 23/461 (4%)
Query: 52 LYKTKYHTQI------LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
L+ KY +I +DHF+++ TF+ RYLIN T W + NNAPIF YTGNEG+I
Sbjct: 41 LHSAKYRYEIKTIDMPVDHFSFS--VLNTFKLRYLINGT-WQKT-NNAPIFFYTGNEGNI 96
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
E FAQNTGFM+D+AP+F ALLVF EHRYYG+S+PY +Y + + GYL+S QALAD
Sbjct: 97 ETFAQNTGFMWDIAPEFGALLVFAEHRYYGESMPYNNK---SYADLNHLGYLTSQQALAD 153
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
Y LI LK +SP++VFGGSYGGML+AW R+KYPH+ GA+ASSAPIL F I
Sbjct: 154 YVDLIQYLKSKPKYKNSPIIVFGGSYGGMLSAWIRIKYPHIVQGAIASSAPILQFTGITE 213
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----L 281
SF I+T DF+ NC K+I+ SW I G + L +++C+ KN
Sbjct: 214 CESFLRIVTSDFKKAHSNCPKLIRKSWNIIMNITSTNEGKKWLSDNWKLCQPLKNENDIE 273
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASV 340
+ S+L + AM +YP +NFL PLPA+P+ +CK + ++ TG ++ + A ++
Sbjct: 274 QLISYLQDIYTNLAMVNYPYKANFLAPLPAYPINAVCKHLTNESLTGIELLIAIKNAINI 333
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+ NY+ KC +LN + W +QACTEM+M D + +F+ + D + C
Sbjct: 334 FTNYTSETKCLNLNNSTPQLDAIGWSFQACTEMVMPICSDGINDMFKPHTWNLDEYSNDC 393
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ Y V P+PN I ++G + A+NI+F NGL DPWS GGVL+N+S + +A++
Sbjct: 394 IKQYSVKPQPNLICEKYGCKDLST-----ATNIVFSNGLMDPWSSGGVLRNLSSSAIAII 448
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
E AHH+DLR S DP + R+ I KWI++Y +
Sbjct: 449 IPESAHHLDLRSSNPNDPFSVVLARKYHRFFIKKWINEYHE 489
>gi|432889721|ref|XP_004075329.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Oryzias latipes]
Length = 504
Score = 369 bits (948), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 191/454 (42%), Positives = 275/454 (60%), Gaps = 21/454 (4%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN 111
+YKT Y Q ++HF + TF+QRYL+ D HW + + PI YTGNEGDI WF N
Sbjct: 51 IYKTFYFNQRINHFGFLEDG--TFKQRYLVADKHW--QEPDGPILFYTGNEGDITWFCNN 106
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
TGFM+++A + A+LVF EHRYYG+S+P+G + +Y ++ YL+S QALAD+A LI
Sbjct: 107 TGFMWEIAEELGAMLVFAEHRYYGESLPFGQD---SYSDSKHLNYLTSEQALADFAVLIQ 163
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+LK + +SPV+ GGSYGGML+AW R+KYP+V +GALA+SAPI F +V F
Sbjct: 164 NLKSKMP--ESPVISVGGSYGGMLSAWLRMKYPNVVVGALAASAPIWQFPRMVGCGDFYK 221
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----LAIESWL 287
I+TQDF +NC I+ SWK I + GL+ L + F +C K + + WL
Sbjct: 222 IVTQDFAKSGQNCDVTIRRSWKAINNLSSTASGLQWLSEEFSLCSPLKTKTDAVIFKGWL 281
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK--AIDDPKTGNDVFAKLYGAASVYYNYS 345
+V AM DYP ++FL PLPA+P++ +CK A D + D+ + AA VYYNY+
Sbjct: 282 QETWVNLAMVDYPYEASFLQPLPAWPIQVVCKYLAFDSKASDYDLLHGVSQAAKVYYNYT 341
Query: 346 GTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
G++ C + + + G W +QACTEM+M D +FE + ++ A + C +
Sbjct: 342 GSSSCLNTSQTATSSLGALGWTYQACTEMVMPMCTDGVQDMFEPEDWNFQAFSDECYSMF 401
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
GV PR +W T +GG I SNIIF NG DPWS GGV NI++++++++ +G
Sbjct: 402 GVRPREDWAGTLYGGKDISS-----HSNIIFSNGGLDPWSAGGVTYNITESLVSIMIPDG 456
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
AHH+DLR+ST DP ++ R E++ +WI +
Sbjct: 457 AHHLDLRYSTDLDPPSVRAARALELKFFQEWIDK 490
>gi|395814761|ref|XP_003780910.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Otolemur
garnettii]
Length = 515
Score = 369 bits (946), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 186/451 (41%), Positives = 282/451 (62%), Gaps = 18/451 (3%)
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAP 120
++DHF +N S +TF QRYLI + +W K+ I YTGNEGDI WF NTGFM+DVA
Sbjct: 76 LVDHFGFN--SVKTFNQRYLIANEYW--KKDGGSILFYTGNEGDIVWFCNNTGFMWDVAQ 131
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT 180
+ KA+LVF EHRYYG+S+P+G N ++K++ YL+S QALAD+A LI L++ +
Sbjct: 132 ELKAMLVFAEHRYYGESLPFGKN---SFKDSRHLNYLTSEQALADFAELIRHLQETIPGV 188
Query: 181 -DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
+ PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F I+T DF+
Sbjct: 189 KNQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTSDFKR 248
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLA-IESWLSTAFVYTAM 296
C + I+ SW I + + GL+ L + +C + +++ ++ W++ +V AM
Sbjct: 249 SGPYCSETIRKSWNVINQLSTTSRGLQYLTEVLHLCTPLTPRDVQHLKDWIAETWVNLAM 308
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFDLNG 355
DYP PS+FL PLPA+P+K +C+ + +P + + + ++ A ++YYNYSG A+C +++
Sbjct: 309 VDYPYPSSFLQPLPAWPIKVVCQYLKNPNVSDSLMIQNIFQALNIYYNYSGEAQCLNVST 368
Query: 356 DSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWIT 414
+ + G W +QACTE++M + D +FE + + C + +GV P P+WIT
Sbjct: 369 TTTSNLGSLGWSYQACTEIVMPFCTNGIDDMFEPHAWNLKEFSDECFKQWGVRPSPSWIT 428
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFST 474
T +GG I +NI+F NG DPWSGGGV ++++ T++A EGAHH+DLR S
Sbjct: 429 TMYGGKNI-----NSHTNIVFSNGELDPWSGGGVTEDLTDTLVAFTISEGAHHLDLRASN 483
Query: 475 KEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
DP ++ R E + + WI +++ Q
Sbjct: 484 ALDPISVQLARTLETKHVKNWIRDFYRSGLQ 514
>gi|328702869|ref|XP_001949478.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Acyrthosiphon
pisum]
Length = 507
Score = 367 bits (943), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 201/477 (42%), Positives = 275/477 (57%), Gaps = 41/477 (8%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN 111
+YKT+Y T +DHF++ TF+ +YLINDT W + N PIF Y GNEG IE F +N
Sbjct: 22 VYKTEYFTVPVDHFSFTNN--DTFRMKYLINDTFW--ERENGPIFFYAGNEGAIEMFCEN 77
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
TGFM+++A +F+AL+VF EHRYYG S+PYG ++ + GYL+S QALADY LI
Sbjct: 78 TGFMWEIAEEFRALVVFAEHRYYGVSMPYGNR---SFDDIGRVGYLTSQQALADYVDLIT 134
Query: 172 DLKKNLTATD-----------------------SPVVVFGGSYGGMLAAWFRLKYPHVAI 208
L+ N + ++ +PV+ FGGSYGGMLAAWFR+KYP +
Sbjct: 135 YLRHNGSYSNRELYQTGDIYDTAVGDTPAPSASNPVIAFGGSYGGMLAAWFRIKYPAIIE 194
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKL 268
GA+ASSAPI F + +F + + + S C I SWK I+ K G L
Sbjct: 195 GAIASSAPIWQFTGMTPCNAFYRVTSSVYTDTSAECGLTILASWKAIDNVTKTVDGKAWL 254
Query: 269 QKAFRICKSEKN----LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP 324
+ + +C N ++ W+S +V AM +YP P+NFL PLP PVKE+CK + +
Sbjct: 255 SQKWNLCSPLTNDNDVSILKEWVSELYVNLAMINYPYPANFLTPLPGHPVKEVCKPMKNH 314
Query: 325 KTGN-DVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNK 382
K + + ++ SVY+NY+GT+KC D+ S P G W +Q+CTE++M +
Sbjct: 315 KEDDFTLLGSVFRGLSVYFNYTGTSKCLDILTSSAPTLGEKGWSYQSCTEIVMPMCSNGI 374
Query: 383 DSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDP 442
IFE+ +++ ARYC E +GV P I +GG + ASNIIF NGL DP
Sbjct: 375 KDIFEKKPWNFEENARYCFETFGVQPSIYAIEKTYGGKNLNA-----ASNIIFSNGLLDP 429
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
WS GGVL+NISKTVLA+V E AHH+DLR S +DP+ + R+ I KWI Y
Sbjct: 430 WSSGGVLQNISKTVLAVVIPESAHHLDLRASHAQDPESVIKARKFYKNWIKKWIFHY 486
>gi|224109990|ref|XP_002333167.1| predicted protein [Populus trichocarpa]
gi|222835050|gb|EEE73499.1| predicted protein [Populus trichocarpa]
Length = 514
Score = 361 bits (927), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 208/519 (40%), Positives = 291/519 (56%), Gaps = 76/519 (14%)
Query: 17 TLTISNAKIFPTFPSS-----RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQS 71
T + AK T P R+ ++ S D +G ++T ++ Q LDHFNY P+S
Sbjct: 18 TTATATAKRLNTIPRHSPIGPRVWRDQPDKTTSCEVDEEG-FETCFYNQTLDHFNYRPES 76
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEH 131
Y TF QRY+IN +WGG+ NA I VY G E I+ + GF+ D A +FK+LLV IEH
Sbjct: 77 YDTFPQRYVINSKYWGGA--NASILVYLGAEASIDRYLDAGGFLVDNAVQFKSLLVVIEH 134
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSY 191
RYYG+SIP G + GY +S QALADYA++II +KK L A SPV+V GGSY
Sbjct: 135 RYYGQSIPPG--------SWGKRGYFNSAQALADYAAIIIHIKKTLRAQYSPVIVIGGSY 186
Query: 192 GGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGS 251
GGMLA+WFRLKYPH+A+GALASSAPIL FD+I ++ +++T+ FR SE CY+ IK S
Sbjct: 187 GGMLASWFRLKYPHIALGALASSAPILYFDDITPQDAYYSVVTKAFREASETCYQTIKTS 246
Query: 252 WKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLP 310
W +I+E A KP GL L F CK+ + + ++ +L + Y A + P P
Sbjct: 247 WSEIDELASKPDGLSMLSTKFNTCKNLTDASELKDYLRLMYAYAAQNNSP---------P 297
Query: 311 AFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS---DPHGLSEWGW 367
+PV E+CK IDD +G+D+ ++++ YY G C+ N ++ +W W
Sbjct: 298 TYPVNEVCKGIDDDASGDDILSRIFRGVVAYY---GNRTCY-FNNNAYAYQSEATLDWSW 353
Query: 368 QA--------CTEMIMLTG--------------GDNKDSIFEESEEDYDARARYCKEAYG 405
Q C+ I L NK ++ +E +R C+ A+
Sbjct: 354 QQSLSFSYIDCSLGISLITINKYKKTYNPIIWPSKNKPTLNQE-------LSRPCRNAFM 406
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
+ I L+L+RF SNIIF NGLRDP+S GGVL NIS +++A+ G+
Sbjct: 407 IP-------------DIKLILQRFGSNIIFSNGLRDPYSSGGVLNNISDSIVAVTTVNGS 453
Query: 466 HHVDL-RFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
H +D+ R + DP WL R++EVEII WI+QY++DL
Sbjct: 454 HCLDIQRANPSTDPDWLVMQRKKEVEIIEGWITQYYEDL 492
>gi|198429605|ref|XP_002127825.1| PREDICTED: similar to prolylcarboxypeptidase [Ciona intestinalis]
Length = 494
Score = 360 bits (925), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 192/465 (41%), Positives = 285/465 (61%), Gaps = 17/465 (3%)
Query: 46 SKDSQGLYKTK-YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD 104
+K+ +Y T+ Y Q LDHF++ + TF QRY +++ +W +K + PIF YTGNEGD
Sbjct: 24 TKNEGYIYPTELYFKQNLDHFDFTINA--TFTQRYFVSEQYW--TKMDGPIFFYTGNEGD 79
Query: 105 IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
IE F +NTG M+D+AP FKA++VF EHRYYGKS P+ GN + + K YL++ QALA
Sbjct: 80 IELFIKNTGLMWDIAPMFKAMVVFAEHRYYGKSKPF-GNLKPSTKTIKEFSYLTAEQALA 138
Query: 165 DYASLIIDLKK-NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
D+A L+ +K + A +SPVVVFGGSYGGML+AWFRLKYPH+ GA+A+SAP+L F +
Sbjct: 139 DFAILVKHIKSTDSKAKNSPVVVFGGSYGGMLSAWFRLKYPHIVTGAIAASAPVLYFPST 198
Query: 224 VSPYSFSNIITQDFRSVS--ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKN 280
V ++ +T +F SV E C I+ WK + ETAKKPGGL+ L + F +C + +
Sbjct: 199 VKCSQYNEAVTNNFLSVQNGETCVANIRNVWKTMNETAKKPGGLKLLGEIFHLCSAINSS 258
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAAS 339
A+ES++ F AM DYP +NFL+ +PA+PV + C+ + +P G D+ ++ A
Sbjct: 259 TAVESFIKDIFGNMAMVDYPYANNFLSNIPAWPVNKTCQHLSEPNLQGLDLLQAMHSAIG 318
Query: 340 VYYNYSGTAKCFDL-NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
VY NY+G+ KC+++ ++ + W + C M+M + +F ++ +
Sbjct: 319 VYQNYTGSVKCYNVKTTETSKLSTTLWNYMTCGAMVMPFCANGVTDMFPVKNWTQESFDK 378
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
C + YG+ RP W T+FGG K A NI+F NGL DPW GGV + S++V++
Sbjct: 379 SCFKKYGIKSRPEWALTDFGGSK-----AVEAGNIVFTNGLLDPWHVGGVPEMKSESVVS 433
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
++ AHH+DLR + DP + + R+ +V+ I KWIS + L
Sbjct: 434 ILMWGAAHHLDLRHANDADPSSVVEARKTQVKHIAKWISSTQERL 478
>gi|195998996|ref|XP_002109366.1| hypothetical protein TRIADDRAFT_21030 [Trichoplax adhaerens]
gi|190587490|gb|EDV27532.1| hypothetical protein TRIADDRAFT_21030 [Trichoplax adhaerens]
Length = 469
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 197/465 (42%), Positives = 271/465 (58%), Gaps = 22/465 (4%)
Query: 41 SLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
+L++ + YKTKY QI+DHF++ +S T++QRYL+ND HW K PIF YTG
Sbjct: 18 TLLAQGTQAAVKYKTKYFDQIIDHFDW--KSNATYRQRYLMNDDHW--DKGTGPIFFYTG 73
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
NEG I F QN+G ++D+AP+F+AL+VF EHRYYGKS+P+G + ++K G L+S
Sbjct: 74 NEGGIVGFWQNSGLLFDLAPQFRALIVFGEHRYYGKSLPFGKD---SFK-PKNLGLLTSE 129
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
QALADYA L+ LKK+L A VV FGGSYGGML AW RLKYP++ LA+SAP+
Sbjct: 130 QALADYAVLLTSLKKSLNANKCKVVAFGGSYGGMLTAWMRLKYPNIIDAGLAASAPLYMA 189
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
+VSP F +T+D++ + C I+ ++ + E AK +K+ K F +C K
Sbjct: 190 GGVVSPNFFFPAVTKDYQDANPKCVPNIRKAFSAVLEMAKSKSK-QKVAKIFNVCNKLKT 248
Query: 281 LA----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---DDPKTGNDVFAK 333
A + W+ FV AM DYP P++F PLPAFPV CK I P G K
Sbjct: 249 SADVKQLIGWIRNGFVSMAMGDYPYPASFFGPLPAFPVNASCKYIVNASHPIQGMAKAMK 308
Query: 334 L-YGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEED 392
L YG+ + Y C D G W +QACTE+++ +NK +F
Sbjct: 309 LFYGSKKCHDIYKQYVHCADPTGCGTGASAIPWDYQACTEILLPGSTNNKTDMFPPIPFT 368
Query: 393 YDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
R +YC + YGV PRPNW+ T+F +++ + ASNIIF NG DPW GG+LK+
Sbjct: 369 SKIRKQYCLKKYGVTPRPNWVATQFWANRL-----KGASNIIFSNGNLDPWKNGGILKSP 423
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
S +++A+ GAHH+DLR K DP + R+ E ++I KWIS
Sbjct: 424 SSSLVAIQIPHGAHHLDLRGKNKNDPASVIKARKMEAKLIKKWIS 468
>gi|332025885|gb|EGI66041.1| Lysosomal Pro-X carboxypeptidase [Acromyrmex echinatior]
Length = 484
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 194/456 (42%), Positives = 277/456 (60%), Gaps = 25/456 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ K +DHF++ Q TF+ RYLINDT W + NAPIF YTGNEG+IE FAQNT
Sbjct: 45 YEIKTFDVRVDHFSFAVQD--TFKLRYLINDT-WR-KQQNAPIFFYTGNEGNIEVFAQNT 100
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++++APKF AL++F EHRYYG+S+PYG ++ N GYL+S QALADY LI
Sbjct: 101 GFLWEIAPKFDALVIFAEHRYYGESLPYGNQ---SFANLQHRGYLTSQQALADYVELIAH 157
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK SPV+VFGGSYGGML+AW R+KYPHV GA+ASSAP+L F ++V F+ I
Sbjct: 158 LKSQPRYEHSPVIVFGGSYGGMLSAWMRMKYPHVVQGAIASSAPLLQFTDVVDCEVFARI 217
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWLS 288
T D+++ + C K+I+ SW I G + L +++C+ +E ++++L
Sbjct: 218 TTSDYKAANPTCSKLIQKSWNTITNVTSNDEGKKWLSDNWKLCEPLKTAEDVKTLKNFLQ 277
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAASVYYNYSGT 347
++ AM +YP +NFL PLP P+ C+ + + TG + L+GA SVY NY+G
Sbjct: 278 EVYIDLAMVNYPYETNFLAPLPGNPINVFCQHLTNVSLTGKPLLLALHGAISVYTNYTGK 337
Query: 348 AKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
C + +++P GL + GW QACTEM+M D + +FE + + C + Y
Sbjct: 338 TNCISMK-NAEP-GLDDQGWDYQACTEMVMPICTDGINDMFEPVKWNITDYNNICFKKYS 395
Query: 406 VDPRPNWITTEFGGHKIGLVLKRF--ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
V P+P + E+G K F ASNIIF NGL DPW+ GGVL+N+S++ +A++ +
Sbjct: 396 VSPQPYLVCEEYG-------CKNFNSASNIIFSNGLLDPWASGGVLRNLSQSAIAILMPD 448
Query: 464 GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
AHH+DLR + DP ++ R+ I +WI +Y
Sbjct: 449 AAHHLDLRETNSNDPYSVRMTRKFHQFWIYQWIREY 484
>gi|61651780|ref|NP_001013333.1| dipeptidyl peptidase 2 precursor [Danio rerio]
gi|60416102|gb|AAH90719.1| Dipeptidyl-peptidase 7 [Danio rerio]
Length = 500
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 194/474 (40%), Positives = 280/474 (59%), Gaps = 29/474 (6%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
E S+ I+S +D +K KY QILDHFNYN T+ QRYLI D +W K PIF
Sbjct: 34 ENYSNEITSHRDLSLDFKEKYFKQILDHFNYNSLGNGTYDQRYLITDKYW--KKGYGPIF 91
Query: 97 VYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY 156
YTGNEGDI FA+N+GFM ++A ALL+F EHRYYGKS+P+G N ++K G
Sbjct: 92 FYTGNEGDISEFARNSGFMVELAAAQGALLIFAEHRYYGKSLPFGKN---SFK-IPEVGL 147
Query: 157 LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
L+ QALADYA +I +LK+ L PV+VFGGSYGGML+ + R++YP++ GALA+SAP
Sbjct: 148 LTVEQALADYAVMITELKEELGGQTCPVIVFGGSYGGMLSVYMRIRYPNIVAGALAASAP 207
Query: 217 ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
IL+ + P F +T DF + C ++G+++++ A++ + ++Q AF +CK
Sbjct: 208 ILSTAGLGDPRQFFQDVTADFEKFNPACRNAVQGAFQKLNTLAQQKDYI-RIQSAFSLCK 266
Query: 277 ---SEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA 332
S K++ + +L AF AM DYP ++F+ +PAFPVK C+ + + G D+ +
Sbjct: 267 TPSSPKDIHQLNGFLRNAFTMMAMLDYPYSTHFMGSMPAFPVKVACEIMLN---GTDLMS 323
Query: 333 KLYGAASVYYNYSGTAKCFDLNG----DSDPHGLS------EWGWQACTEMIMLTGGDNK 382
L + YN +G C+DL +DP G W +QACTE+ M +N
Sbjct: 324 ALRDTVGIVYNNTGELTCYDLYSLYVECADPTGCGLGFNSYAWDYQACTEIEMCFESNNV 383
Query: 383 DSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDP 442
+F R +YC +GV PRP W+ T+F G+ + ASNIIF NG DP
Sbjct: 384 TDMFPAMPFTEQQREQYCSNRWGVVPRPGWLKTQFWGNDLST-----ASNIIFSNGDLDP 438
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
W+ GG+ K++S +++A+ EGAHH+DLR S DP+ + R++E EII +W+
Sbjct: 439 WANGGIRKSLSPSLIAITIPEGAHHLDLRESNPADPESVIVARKKEAEIIAQWV 492
>gi|390335012|ref|XP_797434.3| PREDICTED: dipeptidyl peptidase 2-like [Strongylocentrotus
purpuratus]
Length = 487
Score = 357 bits (916), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 201/508 (39%), Positives = 289/508 (56%), Gaps = 54/508 (10%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNP 69
CLL S L + +A P +P YK K+ Q +DHFN
Sbjct: 13 LCLLIS--LLVCSANPLPRYP----------------------YKEKFFDQFIDHFNSES 48
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
QTF+QRY + D +W K + PIF YTGNEG IE F NTG+++D+AP+F AL++F
Sbjct: 49 HGKQTFRQRYFVTDDYW--QKGSGPIFFYTGNEGAIESFFDNTGYIFDIAPEFGALVIFA 106
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGG 189
EHRYYGKS+P G N+ N G L+ QALADYA+LI LK+ DSP+VVFGG
Sbjct: 107 EHRYYGKSLPLG-NQSFTPAN---LGLLTVEQALADYATLITSLKEEPGLQDSPLVVFGG 162
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGGML+A+ R+KYPH+ GALA+SAP+ + N S F +T+DF V NC +++
Sbjct: 163 SYGGMLSAYMRMKYPHLVAGALAASAPVYSTANESSQSYFFQTVTKDFYDVDTNCPDLVR 222
Query: 250 GSWKQIEETA-KKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAMTDYPTPSN 304
+ +++ A + GL K+ AF +CKS + A + W+ +F Y AM DYP ++
Sbjct: 223 AGFATLDQLAGQGTSGLNKISTAFHLCKSLVSKADYYHLLGWIRNSFTYQAMLDYPYSAS 282
Query: 305 FLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA-KCFDLNGD----SDP 359
F+ LPA PV C + + T + A+ A+ +YYN S T +CFD+ + +DP
Sbjct: 283 FMGSLPANPVNVSCHMLLNASTPLEGLAQ---ASGLYYNTSSTPLQCFDIFAEFIQCADP 339
Query: 360 HGL------SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
G + W +QACTE+ + G + K +F + D R YC+ Y V PRP+W+
Sbjct: 340 TGCGLGTDSTAWDYQACTEITLPAGSNGKTDMFPDMPFTPDMRTSYCQSVYNVTPRPDWL 399
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
+ + G GL +SNIIF NGL DPW GGV K++S++++A+ GAHH+DLR S
Sbjct: 400 SIQ--GFGKGLAS---SSNIIFSNGLLDPWRLGGVAKDLSESIIAIPVPGGAHHLDLRGS 454
Query: 474 TKEDPQWLKDVRRREVEIIGKWISQYFQ 501
+DP+ + + R +E II W+ +++
Sbjct: 455 NPKDPESVIEARSQEKLIIKGWVEAWYR 482
>gi|347964904|ref|XP_309189.4| AGAP000994-PA [Anopheles gambiae str. PEST]
gi|333466532|gb|EAA04920.4| AGAP000994-PA [Anopheles gambiae str. PEST]
Length = 481
Score = 355 bits (911), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 191/474 (40%), Positives = 278/474 (58%), Gaps = 22/474 (4%)
Query: 39 LSSLISSSKDSQG--LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKN--NAP 94
++ L ++ +D +Y+TK +DHF Y ++ TF+ RYLINDT+ G + +P
Sbjct: 10 IAGLSAAVRDGTATYVYQTKTIDVPIDHFTYTGEA--TFKLRYLINDTYAPGGADLPASP 67
Query: 95 IFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
I Y GNEGDIE FAQNTGFM+++APK KA L+F+EHR+YG S+P+G +Y +
Sbjct: 68 ILFYAGNEGDIELFAQNTGFMWELAPKLKATLLFVEHRFYGHSLPFG---NASYDSPKNL 124
Query: 155 GYLSSTQALADYASLIIDLK-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
GYL+S QALAD+A ++ L N T PV+ FGGSYGGMLAAW R+KYPH+ GA+A+
Sbjct: 125 GYLTSEQALADFALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIAA 184
Query: 214 SAPILNFDNIVSPYSFSNIITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
SAP+ F + F+ I+T ++ + + +C I+ SW ++ + GL L + F
Sbjct: 185 SAPVRQFAGVTDCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEKF 244
Query: 273 RICK-----SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG 327
+ C ++ + +L+ + AM +YP PS+FL P+PA+PV+E C + TG
Sbjct: 245 KFCTNLTKGTDVTETLFDYLTDVYGNLAMINYPYPSSFLAPVPAYPVREFCGRLAQNYTG 304
Query: 328 NDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSI 385
++ L A S+YYNY G A C ++N D G+S+ GW QACTEM+M D +
Sbjct: 305 VELLDHLQSALSIYYNYDGKAACLNINSSYDGTGISDRGWDFQACTEMVMPICADGVHDM 364
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSG 445
F + + A C + YGV PRP +GG L +NI+F NGL DPWSG
Sbjct: 365 FPPQQWNMQTYADKCFKKYGVHPRPANALLNYGGE----FLDASITNIVFSNGLLDPWSG 420
Query: 446 GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
GGVL++ ++ V ++ EGAHH+DLR S DP + R V+ I KW+++Y
Sbjct: 421 GGVLRSSNENVKIVLIPEGAHHLDLRASNPADPASVTRARAVHVQNIQKWLTEY 474
>gi|193718325|ref|XP_001949511.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Acyrthosiphon
pisum]
Length = 469
Score = 352 bits (904), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 196/459 (42%), Positives = 271/459 (59%), Gaps = 27/459 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y T Y T +DHF+Y TF+ +YLIND +W N PIF YTGNEG IE F NT
Sbjct: 22 YTTNYITVPVDHFSYTND--DTFELKYLINDKYW--DVNKGPIFFYTGNEGRIEDFCDNT 77
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM++++ +FKAL+VF EHRYYG+S+PYG N ++ + GYL+S QA+AD+ LI
Sbjct: 78 GFMWEISREFKALVVFAEHRYYGESMPYGVN---SFDDKEKLGYLTSQQAIADFVDLIKY 134
Query: 173 LKKNLTATD---SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
L+ + + +PV FGGSYGGMLAAWFR+KYP GA+ASSAPI F + F
Sbjct: 135 LRDDALSVGRRPNPVFAFGGSYGGMLAAWFRIKYPAYVEGAIASSAPIWQFTGMTPCNDF 194
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIES 285
+ + +R+ S C I SWK I + G L +++CK S+ ++
Sbjct: 195 YKVTSSVYRNASAECGLTISASWKAINNVTESDSGKTWLTDNWKLCKPLKDSDDVARLKY 254
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI----DDPKTGNDVFAKLYGAASVY 341
W + +V AM +YP +NFL PLPA P+KE+CK++ +D KT + ++ SVY
Sbjct: 255 WATDLYVALAMVNYPYEANFLGPLPANPIKEVCKSMTNHTEDDKT---LLMSVFHGLSVY 311
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIM-LTGGDNKDSIFEESEEDYDARARYC 400
+NY+G+A C +L+ + ++ W +QACTEMIM L +D IFE D+ + A YC
Sbjct: 312 FNYTGSASCLNLSSAFSDNTMNGWDYQACTEMIMPLCTNGGEDDIFEAYPWDFKSYAEYC 371
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ Y V P + + ++GG + + ASNIIF NGL DPWSGGGVLK+IS +V AL+
Sbjct: 372 ENRYDVIPTTDDVEKQYGGKNL-----KAASNIIFSNGLLDPWSGGGVLKSISSSVRALL 426
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+GAHH+DLR S D + R+ I KWI Y
Sbjct: 427 IPDGAHHLDLRASNPNDTSSVIHARKTIKHWITKWIFDY 465
>gi|440799092|gb|ELR20153.1| dipeptidyl-peptidase family protein [Acanthamoeba castellanii str.
Neff]
Length = 498
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 193/468 (41%), Positives = 269/468 (57%), Gaps = 39/468 (8%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHW-----GGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
Q +DHFN Q TF+QRYL +W GG PIF YTGNEGDI F N+GF
Sbjct: 38 QTIDHFNIETQP-ATFRQRYLTFSNYWSSANHGGELRRGPIFFYTGNEGDITAFWDNSGF 96
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
++++A + AL+VF EHRYYGK+ P+G +Y GYLS QALADYA+LI LK
Sbjct: 97 VFELAKSYGALVVFGEHRYYGKTYPFGSGGPDSYSK-EHIGYLSVEQALADYATLIEHLK 155
Query: 175 KNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS-------- 225
L A+ SPV+ FGGSYGGML+AWFR+KYP V GALA+SAPIL N+ S
Sbjct: 156 STLPGASHSPVIAFGGSYGGMLSAWFRMKYPQVVDGALAASAPILWSTNVSSATTGPDSK 215
Query: 226 -PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIE 284
P + +T DFR+ E C +++ ++ ++ + A+ P GL + K F +CK+ +E
Sbjct: 216 RPPGYFETVTNDFRAADERCPGLVQQAFAKMLQLAQTPSGLAAIAKQFSLCKNVLPHEVE 275
Query: 285 S---WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVY 341
W+ AF AM DYP P+ F+ PLPA+P+K C + + +DV L AA +Y
Sbjct: 276 HLILWVVNAFGNMAMMDYPYPTGFMAPLPAYPIKVACHLMLN--NTDDVLRGLAQAAGLY 333
Query: 342 YNYSGTAKCFDL-----------NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
YN S +CF++ + P G S W +QAC E++ +N +F +
Sbjct: 334 YNSSSPLQCFNIWDEFVECADQTGCGTGPAGQS-WDYQACGEIVYYPNTNNVTDMFPPRD 392
Query: 391 EDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
+C+ +G+ PRP W+ T GG I R+AS IIF NGL DPW GGG L+
Sbjct: 393 WTLADLNAHCQRTWGITPRPTWLKTYTGGENI-----RYASRIIFSNGLLDPWHGGGFLE 447
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
++S +++A++ K+GAHH+DLR S DP + + R E IIGKW+++
Sbjct: 448 SLSDSLIAIIIKDGAHHLDLRSSDPRDPPSVVEARNHEARIIGKWLAE 495
>gi|91088397|ref|XP_972807.1| PREDICTED: similar to prolylcarboxypeptidase [Tribolium castaneum]
gi|270012192|gb|EFA08640.1| hypothetical protein TcasGA2_TC006303 [Tribolium castaneum]
Length = 478
Score = 350 bits (898), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 189/456 (41%), Positives = 281/456 (61%), Gaps = 25/456 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+TKY +LDHF++ + TF+ +YLINDT W N+ PIF YTGNEG +E FA+NT
Sbjct: 20 YETKYFEVLLDHFSFTNNA--TFKLKYLINDTFW---TNDGPIFFYTGNEGTVENFAENT 74
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+AP F AL+VF EHRYYG+S+P+G + ++ + S GYL+S+QALAD+ L I+
Sbjct: 75 GFMFDIAPSFNALVVFAEHRYYGESLPFGND---SFVSPSHIGYLTSSQALADFVDL-IN 130
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+ ++ PV+ FGGSYGGMLA+W R+KYP +GA+A+SAPI F+ F +
Sbjct: 131 YLQTMSLEKVPVIAFGGSYGGMLASWLRMKYPASVVGAIAASAPIWQFETPCE--DFYKV 188
Query: 233 ITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIES---WL 287
+T+ ++ +V+++C +I SW + ++ P G L A+++C E + +E+ W
Sbjct: 189 VTRVYQEAVAKDCPLLITKSWTALRNISESPEGKAWLSDAWQLCSPLETSADVETLIGWY 248
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASVYYNYS 345
S V AM +YP ++FL PLP FPVK C + +D + L A +Y N++
Sbjct: 249 SEILVNMAMVNYPYSTSFLAPLPPFPVKTFCSQLTQANIVDDKSLVMALGDALQIYTNFT 308
Query: 346 GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
T C +N ++ G W +QACTEMIM + D +FE DY A C E +G
Sbjct: 309 ETTTCNKINQTAEALGEEGWYFQACTEMIMPMCSIDGD-MFENDPWDYGKYASQCFEKWG 367
Query: 406 VD-PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
V+ P E+GG +I + ASNI+F NGL DPWS GGVLKN+S++V++++ +G
Sbjct: 368 VNQTHPELPVLEYGGKEI-----KAASNIVFSNGLLDPWSSGGVLKNVSESVVSVIIPDG 422
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
AHH+DLR K+DP+ + + R+ V+ I KWI +++
Sbjct: 423 AHHIDLRGGNKDDPETVIEARQFHVDNIKKWIMEFY 458
>gi|50540452|ref|NP_001002694.1| lysosomal Pro-X carboxypeptidase precursor [Danio rerio]
gi|49903247|gb|AAH76507.1| Prolylcarboxypeptidase (angiotensinase C) [Danio rerio]
Length = 490
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 199/454 (43%), Positives = 271/454 (59%), Gaps = 22/454 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y T Y Q +DHF + TF+QRYL+ND HW K+ PI YTGNEGDI WF NT
Sbjct: 47 YNTLYIDQQIDHFGFLENG--TFKQRYLLNDQHW--HKDGGPILFYTGNEGDITWFCNNT 102
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + ALLVF EHRYYG+S+P+G E +Y NA YL+S Q LAD+A LI
Sbjct: 103 GFMWDVAEELGALLVFAEHRYYGESLPFG---EESYSNAKYLNYLTSEQVLADFAVLIKA 159
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LKK+ A S V+ GGSYGGMLAAW R+KYP+ +GALA+SAPI F IV F
Sbjct: 160 LKKSQPGAEKSSVIAIGGSYGGMLAAWLRMKYPNAVVGALAASAPIWQF--IVPCGEFYR 217
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNL-AIESWL 287
++T+DF NC I+ SW I+ + GL+ L + F +C KS++++ ++WL
Sbjct: 218 VVTRDFTISGSNCSSSIRSSWAAIDRLSATGEGLQWLSQTFGLCGPLKSQEDVFGFKAWL 277
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASVYYNYS 345
+V AM DYP ++FL PLPA+PVK +CK + K +D + + A VYYNY+
Sbjct: 278 QETWVNLAMVDYPYEADFLQPLPAWPVKVVCKNLQFNKGVSDKQLLNGVSQAVRVYYNYT 337
Query: 346 GTAKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
G A C + + + + G W +Q+CTEM+M D + +FE + A + C +
Sbjct: 338 GDAVCLNTSKTATGNLGFLGWFYQSCTEMVMPMCTDGINDMFEPQPWSFQAFSDECYNQF 397
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
GV PR +W T +GG I SNIIF NG DPW GGV K++S+++LA++ G
Sbjct: 398 GVRPREDWAETVYGGRNI-----HAHSNIIFSNGNLDPWMSGGVTKSLSESLLAIMIDGG 452
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
AHH+DLR++ + DPQ + R EV+ +WI
Sbjct: 453 AHHLDLRYNNELDPQSVIKARSLEVQYFKQWIKH 486
>gi|307189813|gb|EFN74085.1| Lysosomal Pro-X carboxypeptidase [Camponotus floridanus]
Length = 453
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 185/443 (41%), Positives = 267/443 (60%), Gaps = 16/443 (3%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF++ Q TF RYLINDT W + NAPIF YTGNEG IE FA+NTGF++++APK
Sbjct: 18 VDHFSFAVQ--DTFNLRYLINDT-WCKTVKNAPIFFYTGNEGRIELFAENTGFLWEIAPK 74
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD 181
F AL++F EHRYYG+S+PYG ++ N GYL+S QALADY LI L+
Sbjct: 75 FGALVIFAEHRYYGESLPYGNQ---SFANPRYLGYLTSQQALADYVELIGYLRSKEGFEF 131
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
SPV+VFGGSYGGML+AW R+KYPH+ GA+A+SAPIL F +IV +F+ I T D+ +
Sbjct: 132 SPVIVFGGSYGGMLSAWMRIKYPHIVQGAIAASAPILQFTDIVECDAFARIATSDYYVSN 191
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAI-ESWLSTAFVYTAMT 297
C ++I+ +WK I E G + L +++C K+E+++ + + +L ++ AM
Sbjct: 192 PTCPQLIRKAWKTITEVTSNDEGKKWLSSNWKLCEPLKTEEDVQVLKDFLQEIYINMAMV 251
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAASVYYNYSGTAKCFDLNGD 356
+YP +NFL PLPA P+ C+ + + TG + + LY A +VY NY+G A C
Sbjct: 252 NYPYETNFLAPLPANPINVFCQHLTNSSLTGKPLLSTLYRAINVYTNYTGKANCTFTKNT 311
Query: 357 SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE 416
+ G W +QACTEM+M D + +FE + ++D C + Y V +P+ +
Sbjct: 312 TPTLGDEGWDYQACTEMVMPMCTDGTNDMFEPATWNFDDYNNTCFKKYSVSSQPHLACQQ 371
Query: 417 FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKE 476
+G + V +NI F NGL DPW+ GGVL+N+S + A++ + AHH+DLR S
Sbjct: 372 YGCKNLDTV-----TNIHFSNGLLDPWTSGGVLRNLSSSATAIIIPDAAHHLDLRESNSN 426
Query: 477 DPQWLKDVRRREVEIIGKWISQY 499
D + R+ I KWI++Y
Sbjct: 427 DSYDVILTRKFHQYSIKKWIAEY 449
>gi|195437264|ref|XP_002066561.1| GK24560 [Drosophila willistoni]
gi|194162646|gb|EDW77547.1| GK24560 [Drosophila willistoni]
Length = 480
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 189/467 (40%), Positives = 269/467 (57%), Gaps = 18/467 (3%)
Query: 40 SSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYT 99
+ +++S + Y+ K LDHF++ + +F RYL N++ +PIF YT
Sbjct: 21 NGFVTASDNQPFKYEIKEFQVPLDHFSFLKNA--SFNIRYLYNNSFADKGNKRSPIFFYT 78
Query: 100 GNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
GNEGDIEWFAQNTGF++++A K A++VF EHRYYGKS+P+G N Y +
Sbjct: 79 GNEGDIEWFAQNTGFLWELAEKQGAVVVFAEHRYYGKSLPFGPNT-FNKTMPENLAYFTV 137
Query: 160 TQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN 219
Q L D+A LI LK D PVV FGGSYGGMLAAWFR+KYPH+ IG+LA+SAPIL
Sbjct: 138 EQTLEDFALLITYLKN---GADLPVVAFGGSYGGMLAAWFRMKYPHIVIGSLAASAPILQ 194
Query: 220 FDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE 278
F I F+ I T F + + NC I SWK IE A G +++ AF +C
Sbjct: 195 FPGITPCDIFNKITTSVFHTAYNGNCTVNIGKSWKAIENVASTDAGKKQISDAFHLCNPL 254
Query: 279 KN----LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAK 333
K + ++ + AM +YP ++FL PLPA+PV++MC + + KT D+
Sbjct: 255 KGPDDLTKLLDYIELVYGNLAMANYPYNNSFLAPLPAYPVRQMCFYLKELHKTDADLLQA 314
Query: 334 LYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
+ A +VY NY+GT KC D++ +S+ S W Q+C +M+M ++ D+++ S D+
Sbjct: 315 MANALAVYTNYTGTVKCLDISSNSNADD-SGWNIQSCNQMVMPFCSNSSDTMYRTSTWDF 373
Query: 394 DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
+ C Y + P+PN I +GG + + SNIIF NGL DPWSGGGVL+ +
Sbjct: 374 KEVSENCVRDYHLTPKPNDIILRYGGRDLSSI-----SNIIFSNGLLDPWSGGGVLQAPN 428
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
V ++ EGAHH+DLR S DP + D R++E II WI +++
Sbjct: 429 DRVHVIIIPEGAHHLDLRKSEPADPPSVIDARQKEATIIASWIEEFY 475
>gi|195385893|ref|XP_002051639.1| GJ16686 [Drosophila virilis]
gi|194148096|gb|EDW63794.1| GJ16686 [Drosophila virilis]
Length = 469
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 191/455 (41%), Positives = 258/455 (56%), Gaps = 19/455 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y K + LDHF++ S ++F RYL ND++ + + +PIF YTGNEGDIEWFAQN+
Sbjct: 26 YAIKEFSVPLDHFSF--LSNESFSIRYLYNDSYVDKANSQSPIFFYTGNEGDIEWFAQNS 83
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++++A K +AL+VF EHRYYGKS+P+GG+ K Y + Q L DYA LI
Sbjct: 84 GFIWELAAKLRALVVFAEHRYYGKSMPFGGDTFNTSK-PEHLAYFTVEQTLEDYALLITF 142
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+ PVV FGGSYGGMLAAWFR+KYPH+ IGALA+SAPIL F + F I
Sbjct: 143 LR---NGQQLPVVAFGGSYGGMLAAWFRMKYPHIVIGALAASAPILQFPGLTPCDIFEKI 199
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----LAIESWL 287
T F + + NC I SWK IE A G ++L F +C KN +L
Sbjct: 200 TTSVFETAYNANCSANIGKSWKAIESLAATDAGKKQLGDMFHLCDPVKNGDDLNKFMDYL 259
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
+ AM +YP S FL PLPA+PV++ C + D + + D+ A L A VY NY+G
Sbjct: 260 EEVYGNLAMVNYPYNSTFLAPLPAYPVRQFCYYLKDLQPNDTDLLASLASALGVYTNYTG 319
Query: 347 TAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+KC D + GW Q C +M+M +N D+++ S D + C +
Sbjct: 320 ASKCVDYKDSNSGANYDSRGWDIQTCNQMVMPFCANNSDTMYRTSTWDLKKFSEKCYTKF 379
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
+ P+P+ I +GG + ASNIIF NGL DPWSGGGVL+ + V ++ EG
Sbjct: 380 HLTPKPHDILLRYGGRNL-----EAASNIIFSNGLLDPWSGGGVLRAPNNQVHVIILPEG 434
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
AHH+DLR S DP + D R++E II +WI +
Sbjct: 435 AHHLDLRQSHPSDPASVTDARKKEAAIIEQWIRDF 469
>gi|66267347|gb|AAH95721.1| Dpp7 protein [Danio rerio]
Length = 487
Score = 346 bits (888), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 190/472 (40%), Positives = 277/472 (58%), Gaps = 29/472 (6%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
E S+ I+S +D +K KY QILDHFNYN T+ QRYLI D +W K PIF
Sbjct: 25 ENYSNEITSHRDLSLDFKEKYFKQILDHFNYNSLGNGTYDQRYLITDKYW--KKGYGPIF 82
Query: 97 VYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY 156
YTGNEGDI FA+N+GFM ++A ALL+F EHRYYGKS+P+G N ++K G
Sbjct: 83 FYTGNEGDISEFARNSGFMVELAAAQGALLIFAEHRYYGKSLPFGKN---SFK-IPEVGL 138
Query: 157 LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
L+ QALADYA +I +LK+ L PV+VFGGSYGGML+ + R++YP++ GALA+SAP
Sbjct: 139 LTVEQALADYAVMITELKEELGGQTCPVIVFGGSYGGMLSVYMRIRYPNIVAGALAASAP 198
Query: 217 ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
IL+ + P F +T DF + C ++G+++++ A++ + ++Q AF +CK
Sbjct: 199 ILSTAGLGDPRQFFQDVTADFEKFNPACRDAVQGAFQKLNTLAQQKDYI-RIQSAFSLCK 257
Query: 277 ---SEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA 332
S K++ + +L AF AM DYP ++F+ +PAFPVK C+ + + G D+ +
Sbjct: 258 TPSSPKDIHQLNGFLRNAFTMMAMLDYPYSTHFMGSMPAFPVKVACEIMLN---GTDLMS 314
Query: 333 KLYGAASVYYNYSGTAKCFDLNG----DSDPH----GLSEWGWQACTEMIMLTGGDNKDS 384
L + YN +G C+DL +DP G + + W EM + +N
Sbjct: 315 ALRDTVGIVYNNTGELTCYDLYSLYVECADPTGCGLGFNSYAWDYQIEMCFES--NNVTD 372
Query: 385 IFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
+F R +YC +GV PRP W+ T+F G+ + ASNIIF NG DPW+
Sbjct: 373 MFPAMPFTEQQREQYCSNRWGVVPRPGWLKTQFWGNDLST-----ASNIIFSNGDLDPWA 427
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
GG+ K++S +++A+ EGAHH+DLR S DP+ + R++E EII +W+
Sbjct: 428 NGGIRKSLSPSLIAITIPEGAHHLDLRESNPADPESVIVARKKEAEIIAQWV 479
>gi|170055552|ref|XP_001863632.1| lysosomal Pro-X carboxypeptidase [Culex quinquefasciatus]
gi|167875507|gb|EDS38890.1| lysosomal Pro-X carboxypeptidase [Culex quinquefasciatus]
Length = 475
Score = 345 bits (885), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 199/471 (42%), Positives = 278/471 (59%), Gaps = 28/471 (5%)
Query: 41 SLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
+L++ ++ Y+TKY LDHF+Y +S TF+ RYL NDT+ + PI YTG
Sbjct: 14 ALLTGQSSAKYEYQTKYLDVPLDHFSYVNESV-TFKLRYLTNDTY--NPDGSGPILFYTG 70
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
NEGDIE FAQNTGFM+++APK KA LVF EHR+YGK++P+G +Y++ GYL+S
Sbjct: 71 NEGDIELFAQNTGFMWELAPKLKASLVFAEHRFYGKTLPFG---NASYESPRHLGYLTSE 127
Query: 161 QALADYASLIIDLK-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN 219
QALAD+A L+ + N T PVV FGGSYGGMLAAWFR+KYPH+ GA+A+SAPI
Sbjct: 128 QALADFAYLLAQINPSNRTVRARPVVAFGGSYGGMLAAWFRMKYPHMVAGAIAASAPIRQ 187
Query: 220 FDNIVSPYSFSNIITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE 278
FD F+ I+T F + ++ C I SW ++ + GL+ L + F+ C +
Sbjct: 188 FDTDCG--VFNQILTSVFSVAYTKECSLNIGRSWDVLKNYSSSADGLKTLNEKFKFCSNL 245
Query: 279 KNLAIESWLSTAFVY-------TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
E T F Y AM +YP S+FL P+PA+PV+E C + TG ++
Sbjct: 246 TK--AEDVTGTFFDYLIDVYGNLAMINYPYNSSFLAPVPAYPVREFCGRLAQNYTGLELL 303
Query: 332 AKLYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEMI--MLTGGDNKDSIFEE 388
KL A S+YYNY+G AKC ++N D G WG+Q+CTEM+ M G ++D +F +
Sbjct: 304 TKLQEALSIYYNYNGQAKCLNINSSYDSSMGDLGWGFQSCTEMVMPMCAEGGSRD-MFPK 362
Query: 389 SEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
D + C + +GV PRP +GG + ASNI+F NGL DPWSGGGV
Sbjct: 363 QPWDEKKFSDDCFKKFGVRPRPIAPAYIYGGQYL-----EGASNIVFSNGLLDPWSGGGV 417
Query: 449 LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
L++ + + ++ EGAHH+DLR + DP ++ R+ V+ I W+ QY
Sbjct: 418 LRSGNSNIKIVLIPEGAHHIDLRAADDNDPGSVRSARQIHVQNIQLWLKQY 468
>gi|348535188|ref|XP_003455083.1| PREDICTED: dipeptidyl peptidase 2-like [Oreochromis niloticus]
Length = 492
Score = 344 bits (883), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 195/510 (38%), Positives = 279/510 (54%), Gaps = 43/510 (8%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQ 60
M RFIF S L+F + + P + + D+ + K+ TQ
Sbjct: 1 MNKRFIFSSVALVFC-------VDVLHSLPHTYFQSHQ-------HHDTDPRFTEKFFTQ 46
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAP 120
LDHFN+N TF QRYLI D +W K PIF YTGNEG+I FA N+GF+ ++A
Sbjct: 47 TLDHFNFNSMGNGTFNQRYLITDQYW--EKGFGPIFFYTGNEGNIWEFALNSGFITELAA 104
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT 180
+ +AL++F EHRYYGKS+P+ E N L+ QALADYA +I +LK+ L AT
Sbjct: 105 QQRALVIFAEHRYYGKSLPF----EKDSFNIPQVSLLTVEQALADYAIMITELKQQLGAT 160
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
D PV+VFGGSYGGML+ + R+KYP++ GALA+SAPIL+ + F +T DF SV
Sbjct: 161 DCPVIVFGGSYGGMLSVYMRIKYPNIVAGALAASAPILSTAGLGDSREFFRDVTSDFESV 220
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWLSTAFVYTAM 296
S +C ++G++ Q++E A+ +Q AF +CK ++ + +L AF AM
Sbjct: 221 SSDCTDAVRGAFHQLKELAQSQ-EYHHIQSAFALCKPLSSAQDIHQLNGFLRNAFTLMAM 279
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG- 355
DYP ++F+ +PA PVK C+ + G+D+ L A + YN +G CFDL
Sbjct: 280 LDYPYSTHFIGNMPANPVKVACETM---LRGSDLLGNLRDTAGIVYNATGVLTCFDLYSL 336
Query: 356 ---DSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
+DP G W +QACTE+ + +N +F + R YC + +GV
Sbjct: 337 YLECADPTGCGLGFDSLAWDYQACTEIELCYESNNVTDMFPPMPFTEEDRRLYCSKRWGV 396
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP W+ +F G + ASNIIF NG DPW+ GGV ++S +++AL GAH
Sbjct: 397 VPRPGWLNIQFWGDALST-----ASNIIFSNGDLDPWANGGVRTSLSDSLIALNISGGAH 451
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
H+DLR S + DP + R+ E ++I W+
Sbjct: 452 HLDLRGSNEADPVSVISARKTEADLIALWV 481
>gi|328868406|gb|EGG16784.1| peptidase S28 family protein [Dictyostelium fasciculatum]
Length = 505
Score = 343 bits (881), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 184/453 (40%), Positives = 266/453 (58%), Gaps = 26/453 (5%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWG-----GSKNNAPIFVYTGNEGDIEWFAQN 111
Y+ Q LDHFN+ Q+ QTF QRYLI+DT+W S+ N PI YTGNEGDI WF QN
Sbjct: 59 YYIQKLDHFNF--QTQQTFPQRYLISDTYWNKPSSNDSQCNGPILFYTGNEGDIVWFYQN 116
Query: 112 TGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
+ F+ +V A + ALL F EHRYYG+++P+G + Y +S QALADYA++I
Sbjct: 117 SQFITNVLAQELGALLFFAEHRYYGQTLPFGNESTVP----ENLQYCTSEQALADYATII 172
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VSPYSF 229
+ ++L + PV+ GGSYGGMLA+W R+KYP++ GALA+SAPIL F P F
Sbjct: 173 PQVLEDLGGLNCPVISVGGSYGGMLASWMRMKYPNIVDGALAASAPILYFLGTGADPEGF 232
Query: 230 SNIITQDFRSVSEN--CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL 287
+ I T DF S + C I+G++ +I E A+KP G E L + F +C + + +W+
Sbjct: 233 NEIATNDFAQTSADGSCATRIRGAFTEISEIAEKPNGDELLSEMFSLCGVQSVDDLVNWI 292
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGT 347
+ Y AM DYP P+ FL P+P +P+ C A++ + D L +YYN SG
Sbjct: 293 ESGLTYMAMADYPYPAAFLEPMPGWPINASCAAMEPVQ---DDIQALLQVLHIYYNSSGQ 349
Query: 348 A-KCFDLNG-DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
A C++++ + G W +QACTEM+M + +F S D D+ C++ +G
Sbjct: 350 AGSCYNVSVFTTGALGSDVWDYQACTEMVMPMSSNGVQDMFPASSFDLDSLITSCQQQWG 409
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
V P P WIT +GG + +SNI+F NG+ DPW GGV++N ++ L++ GA
Sbjct: 410 VTPDPYWITNYYGGSQ-----NVQSSNIVFSNGILDPWRAGGVIENGNEIYAVLIDG-GA 463
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
HH+DLR + +DPQ + + R E ++I KW +Q
Sbjct: 464 HHLDLRMPSPQDPQSVINARALETQLITKWSNQ 496
>gi|157110322|ref|XP_001651053.1| prolylcarboxypeptidase [Aedes aegypti]
gi|108878777|gb|EAT43002.1| AAEL005521-PA [Aedes aegypti]
Length = 476
Score = 342 bits (878), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 198/468 (42%), Positives = 277/468 (59%), Gaps = 24/468 (5%)
Query: 42 LISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGN 101
L+++ ++ Y+ KY LDHF Y +S TF+ RYL+NDT+ PI YTGN
Sbjct: 18 LMTAQISAKYEYQIKYLDVPLDHFTYVNESV-TFKMRYLVNDTY--NPDGKGPILFYTGN 74
Query: 102 EGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQ 161
EGDIE FAQNTGFM+D+APK KA LVF EHR+YGK++P+G +Y++ GYLSS Q
Sbjct: 75 EGDIENFAQNTGFMWDIAPKLKASLVFAEHRFYGKTLPFG---NASYESPKHLGYLSSEQ 131
Query: 162 ALADYASLIIDLK-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
ALAD+A L+ ++ N + PV+ FGGSYGGMLAAWFR+KYPH+ GA+ASSAPI F
Sbjct: 132 ALADFADLLAEINPSNRSTRGRPVIAFGGSYGGMLAAWFRMKYPHLVTGAIASSAPIRQF 191
Query: 221 DNIVSPYSFSNIITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---- 275
D FS I+T F + + C I SW ++ + GL+ L+ F+ C
Sbjct: 192 DTDCG--VFSQILTSVFSVAYTRECSLNIARSWDTLKNYSSTADGLKTLKDKFKFCGNIT 249
Query: 276 KSEK-NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL 334
K+E +L+ + AM +YP S+FL P+PA+PV+E C + TG ++ +L
Sbjct: 250 KAEDITETFFDYLTDVYGNLAMINYPYNSSFLAPVPAYPVREFCGRLAQNFTGLELIGRL 309
Query: 335 YGAASVYYNYSGTAKCFDL-NGDSDPHGLSEWGWQACTE--MIMLTGGDNKDSIFEESEE 391
A S+Y NY+G KC ++ N D G S W +Q+CTE M M + G KD +F +
Sbjct: 310 QEALSIYSNYTGKTKCLNIANAYDDNMGTSGWDFQSCTEMWMPMCSEGSGKD-MFPKKAW 368
Query: 392 DYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
D + C + +GV P+ T +G GL L ASNI+F NGL DPWSGGGVL++
Sbjct: 369 DEKKFSDDCFKKFGVRPKKTVALTTYG----GLYLDG-ASNIVFSNGLMDPWSGGGVLQS 423
Query: 452 ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ + ++ EGAHH+DLR + + DP ++ R+ ++ I WI QY
Sbjct: 424 PNNAIKVVLIPEGAHHIDLRAADENDPGSVRGARQVHLQNIQMWIKQY 471
>gi|219362661|ref|NP_001136852.1| uncharacterized protein LOC100217003 [Zea mays]
gi|194697358|gb|ACF82763.1| unknown [Zea mays]
Length = 319
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 160/303 (52%), Positives = 208/303 (68%), Gaps = 3/303 (0%)
Query: 199 FRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEET 258
RLKYPH+AIGALASSAPIL F++IV F ++++ DFR S +C+ IK SWK++++
Sbjct: 1 MRLKYPHIAIGALASSAPILQFEDIVPSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQ 60
Query: 259 AKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEM 317
A + GL KL K F +C++ K + WLS+A+ Y AM DYP PS FL PLPA P+KE+
Sbjct: 61 ANEQDGLLKLSKTFHLCQTLKTSGDLSDWLSSAYSYLAMVDYPLPSEFLMPLPANPIKEV 120
Query: 318 CKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLT 377
C+ ID G ++Y +VYYNY+GT CFDLN DPHG+ W WQACTEM+M
Sbjct: 121 CRNIDSQPKGTGTLERIYAGVNVYYNYTGTVDCFDLN--DDPHGMGGWDWQACTEMVMPM 178
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
+ S++ + DY + A C ++YGV PRP WITTEFGGH I VL+ F SNIIFFN
Sbjct: 179 SYSEQRSMYPPYKFDYASYAEDCIKSYGVRPRPKWITTEFGGHNITKVLENFGSNIIFFN 238
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
GL DPWSGGGVLKNIS++V+A+V GAHH+DLR +T +DP WL +R E+EII W+
Sbjct: 239 GLLDPWSGGGVLKNISESVVAIVAPLGAHHIDLRPATPDDPDWLVALRESELEIISGWLW 298
Query: 498 QYF 500
Y+
Sbjct: 299 DYY 301
>gi|198473985|ref|XP_001356509.2| GA15377 [Drosophila pseudoobscura pseudoobscura]
gi|198138198|gb|EAL33573.2| GA15377 [Drosophila pseudoobscura pseudoobscura]
Length = 474
Score = 342 bits (876), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 194/458 (42%), Positives = 265/458 (57%), Gaps = 28/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ K LDHF++ S TF RYL ND+ + PIF YTGNEGDIE FAQNT
Sbjct: 34 YEIKEFQVPLDHFSF--LSNATFNIRYLYNDSFVDKKNAHTPIFFYTGNEGDIELFAQNT 91
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST---TGYLSSTQALADYASL 169
GFM+++A K +ALL+F EHRYYGKS+P+G + NAS Y + Q L DYA L
Sbjct: 92 GFMWELAEKQRALLIFAEHRYYGKSLPFGAST----FNASMPDHLAYFTVEQTLEDYAML 147
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
I L+ +L PVV FGGSYGGMLAAWFR+KYPH+ GALA+SAPIL F I F
Sbjct: 148 ITFLRNDLPL---PVVAFGGSYGGMLAAWFRMKYPHLVAGALAASAPILQFPGITDCDIF 204
Query: 230 SNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IE 284
I+T F++ + NC I SWK E G +++ AF +C KN A
Sbjct: 205 YRIVTSVFQNAYNSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFL 264
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYN 343
++ + AM +YP S+FL PLPA+PV+++C + D ++ D+ + A +VY N
Sbjct: 265 DYIEEVYGNLAMVNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASALAVYTN 324
Query: 344 YSGTAKCFD--LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
Y+G+ KC D +N ++D G W Q C +M+M ++ DS+F S ++ + C
Sbjct: 325 YTGSVKCLDTSVNSNADDSG---WNVQTCNQMVMPFCSNSTDSMFRPSSWNFKEFSDKCY 381
Query: 402 EAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
+ Y + P+P I +GG I A+NIIF NGL DPWSGGGVL+ + V ++
Sbjct: 382 KDYRLTPKPYDIILRYGGRNI-----ETATNIIFSNGLLDPWSGGGVLQAPNNKVDIIIL 436
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
EGAHH+DLR S DP ++D R +E II +WI ++
Sbjct: 437 PEGAHHLDLRNSNPADPPSVRDARNKEASIIARWIHEF 474
>gi|351704787|gb|EHB07706.1| Dipeptidyl-peptidase 2 [Heterocephalus glaber]
Length = 515
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 184/460 (40%), Positives = 269/460 (58%), Gaps = 35/460 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q+LDHFN+ +TF QR+L++D W ++ PIF YTGNEGD+ FA N+
Sbjct: 39 FREYYFEQLLDHFNFESFGNKTFSQRFLVSDKFW--RRSEGPIFFYTGNEGDVWGFANNS 96
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYL---SSTQALADYASL 169
GF+ ++A + + LLVF EHRYYGKS+P+G ++ GYL + QALAD+A L
Sbjct: 97 GFLVELAQQQEGLLVFAEHRYYGKSLPFGAQ-------STQRGYLKLLTVEQALADFAVL 149
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ L+++L D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 150 LQALRRDLGTQDAPAIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVSGLGDSYQF 209
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL-- 287
+T DF S C + ++ +++QI + + G +++ + F C+S L + L
Sbjct: 210 FRDVTADFYGQSPKCAQGVRDAFQQIRDLFLQ-GAYDRISREFGTCQSLSGLEDLTQLFV 268
Query: 288 --STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS 345
AF AM DYP P++FL PLPA PVK C D + A L A + YN S
Sbjct: 269 FARNAFTVLAMMDYPYPTDFLVPLPANPVKAGC---DRLLSEAQRIAGLRSLAGLVYNAS 325
Query: 346 GTAKCFDL----NGDSDPHGL------SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
GT +C+D+ +DP G W +QACTE+ + +N +F +
Sbjct: 326 GTERCYDIYQLYRSCADPTGCGTGSDAQAWDYQACTEINLTFDSNNVTDMFPALPFTEEL 385
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
R +YC + +GV PRP+W+ T FGG I + ASNIIF NG DPW+GGG+ +N+S +
Sbjct: 386 RQQYCLDTWGVWPRPDWLQTSFGGSDI-----KGASNIIFSNGDLDPWAGGGIQRNLSAS 440
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
V+A+V + GAHH+DLR S EDP + +VR E +I +W
Sbjct: 441 VVAVVIRGGAHHLDLRASHPEDPASVVEVRELEASLIREW 480
>gi|225706950|gb|ACO09321.1| Dipeptidyl-peptidase 2 precursor [Osmerus mordax]
Length = 490
Score = 341 bits (875), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 187/461 (40%), Positives = 269/461 (58%), Gaps = 29/461 (6%)
Query: 50 QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA 109
Q +K KY TQILDHFN+N T+ QRYLI D +W + PIF YTGNEGDI FA
Sbjct: 37 QPQFKEKYFTQILDHFNFNSMGNGTYDQRYLITDQYW--KRGYGPIFFYTGNEGDIWEFA 94
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
N+GF+ ++A KAL++F EHRYYGKS+P+G + + G L+ QALAD+A +
Sbjct: 95 LNSGFITELAADQKALVIFAEHRYYGKSLPFGQDS----FSIPEVGLLTVEQALADFAVM 150
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
I LK L A++ PV+VFGGSYGGML+ + RL+YP++ GALA+SAPIL+ + F
Sbjct: 151 ITALKPQLGASECPVIVFGGSYGGMLSVYMRLRYPNIVAGALAASAPILSTAGMGDSRQF 210
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNL-AIES 285
+T DF S + C ++G+++++++ ++ ++Q AF +CK S+K++ +
Sbjct: 211 FQDVTHDFESYAPECRDAVRGAFQKLQDLSEVE-DYSRIQAAFSLCKPPSSQKDIHQLNG 269
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS 345
L AF AM DYP ++F+ +PA PVK C D +G D+ L A + YN +
Sbjct: 270 LLRNAFTLMAMLDYPYSTHFMGNMPANPVKVGC---DIMLSGADLLQALRDTAGIVYNST 326
Query: 346 GTAKCFDLNG----DSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
G KCFDL +DP G W +QACTE+ + +N +F
Sbjct: 327 GILKCFDLYSLYVECADPTGCGLGFNSLAWDYQACTEIELCFESNNVTDMFPPMPFTEAH 386
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
R +YC + +GV PRP W+ T+F G + ASNIIF NG DPW+ GGV K++S +
Sbjct: 387 REQYCSKRWGVIPRPGWLKTQFWGSALSS-----ASNIIFSNGDLDPWANGGVRKSLSSS 441
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
++A+ EGAHH+DLR S DP + R+ E + I +W+
Sbjct: 442 LVAINISEGAHHLDLRSSNDADPLSVITARKTEADTISQWV 482
>gi|290997696|ref|XP_002681417.1| lysosomal carboxypeptidase [Naegleria gruberi]
gi|284095041|gb|EFC48673.1| lysosomal carboxypeptidase [Naegleria gruberi]
Length = 528
Score = 340 bits (872), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 192/469 (40%), Positives = 270/469 (57%), Gaps = 32/469 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT Y Q LDHF++ + +TF+QRYL+ D+ G + PIF YTGNEGDI F +NT
Sbjct: 60 YKTLYFDQKLDHFDFT--NDKTFKQRYLVCDSFVGKMTPSTPIFFYTGNEGDIVTFYENT 117
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G M+D AP+F AL++F+EHRYYG S P+G +N +LSS QALADY+ I +
Sbjct: 118 GLMFDTAPQFNALIIFVEHRYYGVSNPFGPVNSFTPENIK---WLSSEQALADYSYFITE 174
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+ +PV+ FGGSYGGML++W+R+KYPH+ GA+A+SAPI F + +P ++ I
Sbjct: 175 MFGLDEKRTNPVIAFGGSYGGMLSSWWRMKYPHIVDGAIAASAPIFQFTGLTAPNVYNQI 234
Query: 233 ITQDFRSVS------ENCYKVIKGSWKQIEE--TAKKPGGLEKLQKAFRIC------KSE 278
T+DF+ S E C VIK + + P L+KL FRIC ++
Sbjct: 235 CTEDFKKSSNLAKYHETCDAVIKNGLSILNQYYQNNNPQILQKLSSQFRICSPGIRTSAD 294
Query: 279 KNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID----DPKTGNDVFAK- 333
N I SWL+ A+ M DYP P+NFL PLP +P++ +C I K D + K
Sbjct: 295 VNQLI-SWLTQAYNTLPMIDYPYPANFLMPLPGYPIQVICDRISTIMASEKVTTDTYLKA 353
Query: 334 LYGAASVYYNYSGTAKCFDLNG-DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEED 392
+ A+VYYNY+GT+ C +L DS G W +Q C EM+M G + +F + D
Sbjct: 354 VLEGANVYYNYTGTSTCNNLTQPDSPSLGDDGWEYQTCNEMVMPIGNYPQTDMFIPAPWD 413
Query: 393 YDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK-- 450
A YC++ + V PR NW T +GG + L A+NI+F NG DPW GGGVL
Sbjct: 414 LQAWISYCQQKWKVTPRTNWAITNYGGKRAILE----ATNIVFSNGDLDPWHGGGVLPGM 469
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+++ V + + GAHH+DLR S DPQ ++ R EV+ I W++++
Sbjct: 470 KVNEKVKVVYIEGGAHHLDLRSSNPLDPQSVRLARALEVKEITAWLNEF 518
>gi|195147472|ref|XP_002014704.1| GL19317 [Drosophila persimilis]
gi|194106657|gb|EDW28700.1| GL19317 [Drosophila persimilis]
Length = 474
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 193/458 (42%), Positives = 264/458 (57%), Gaps = 28/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ K LDHF++ S TF RYL ND+ + PIF YTGNEGDIE FAQNT
Sbjct: 34 YEIKEFQVPLDHFSF--LSNATFNIRYLYNDSFVDKKNAHTPIFFYTGNEGDIELFAQNT 91
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST---TGYLSSTQALADYASL 169
GFM+++A K +ALL+F EHRYYGKS+P+G + NAS Y + Q L DYA L
Sbjct: 92 GFMWELAEKQRALLIFAEHRYYGKSLPFGAST----FNASMPDHLAYFTVEQTLEDYAML 147
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
I L+ +L PVV FGGSYGGMLAAWFR+KYPH+ GALA+SAPIL F I F
Sbjct: 148 ITFLRNDLPL---PVVAFGGSYGGMLAAWFRMKYPHLVAGALAASAPILQFPGITDCDIF 204
Query: 230 SNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IE 284
I+T F++ + NC I SWK E G +++ AF +C KN A
Sbjct: 205 YRIVTSVFQNAYNSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFL 264
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYN 343
++ + AM +YP S+FL PLP +PV+++C + D ++ D+ + A +VY N
Sbjct: 265 DYIEEVYGNLAMVNYPYNSSFLAPLPPYPVRQVCFYLKDLHQSDADLLHAMASALAVYTN 324
Query: 344 YSGTAKCFD--LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
Y+G+ KC D +N ++D G W Q C +M+M ++ DS+F S ++ + C
Sbjct: 325 YTGSVKCLDTSVNSNADDSG---WNVQTCNQMVMPFCSNSTDSMFRPSSWNFKEFSDKCY 381
Query: 402 EAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
+ Y + P+P I +GG I A+NIIF NGL DPWSGGGVL+ + V ++
Sbjct: 382 KDYRLTPKPYDIILRYGGRNI-----ETATNIIFSNGLLDPWSGGGVLQAPNNKVDIIIL 436
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
EGAHH+DLR S DP ++D R +E II +WI ++
Sbjct: 437 PEGAHHLDLRNSNPADPPSVRDARNKEASIIARWIHEF 474
>gi|91092240|ref|XP_971305.1| PREDICTED: similar to prolylcarboxypeptidase [Tribolium castaneum]
gi|270014428|gb|EFA10876.1| hypothetical protein TcasGA2_TC001698 [Tribolium castaneum]
Length = 488
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 188/459 (40%), Positives = 267/459 (58%), Gaps = 25/459 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y TK+ LDHF++ + TF+ +YLIND+ W ++ PIF YTGNEG +E FA+NT
Sbjct: 19 YTTKFIDVPLDHFSFTNNA--TFKLKYLINDSFW---IDDGPIFFYTGNEGAVETFAENT 73
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++D+AP F AL+VF EHRYYG ++P+G ++ N G+L+S+QALADY LI
Sbjct: 74 GFIFDIAPTFNALIVFAEHRYYGATLPFG---NASFSNPGHLGFLTSSQALADYVYLINH 130
Query: 173 LKKNLTATD----SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
L+ ++ PVV FGGSYGGMLAAW R+KYP +GA+A+SAPI F + +
Sbjct: 131 LQTTHQRSEYLSKVPVVAFGGSYGGMLAAWLRMKYPASVVGAIAASAPIWQFQGLTPCEN 190
Query: 229 FSNIITQDFRS-VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIE--- 284
F+ I++ +++ V ++C I+ SWK I G L KA+++C K+ I+
Sbjct: 191 FNRIVSNVYKTAVDDDCSAPIQKSWKIIRNITANDDGKAWLTKAWKLCSPLKSSDIDDLL 250
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASVYY 342
W S V AM +YP P+ FL PLPAFPV+ C + K +D + + A +Y
Sbjct: 251 EWYSEILVNMAMVNYPYPTKFLAPLPAFPVRNFCYKLTGEKITDDKSLVTAIGNALEIYT 310
Query: 343 NYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
N++ KC ++N + G W +QACTEMIM D+ D +FE D+ + C
Sbjct: 311 NFTKATKCNNINQTAASLGEEGWDFQACTEMIMPMCSDDND-MFENQSWDFKKYSDKCYT 369
Query: 403 AYGV-DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
+GV E+GG I ASNI+F NGL DPWS GGVL NIS TV +++
Sbjct: 370 KWGVRQTNAELPILEYGGKDITA-----ASNIVFSNGLLDPWSSGGVLSNISSTVSSVII 424
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
EGAHH+DLR + DP+ + + R+ V I KWI+ ++
Sbjct: 425 PEGAHHLDLRGENRNDPKSVIEARQFHVSSIRKWITDFY 463
>gi|330843655|ref|XP_003293764.1| hypothetical protein DICPUDRAFT_158674 [Dictyostelium purpureum]
gi|325075873|gb|EGC29712.1| hypothetical protein DICPUDRAFT_158674 [Dictyostelium purpureum]
Length = 503
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 182/457 (39%), Positives = 263/457 (57%), Gaps = 26/457 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNA-----PIFVYTGNEGDIEW 107
YK ++ Q LDHFN+ Q+ F QRYLI+DT+W ++ PI YTGNEGDI W
Sbjct: 54 YKEYWYMQTLDHFNF--QTKGQFAQRYLISDTYWNKPSPSSKVCSGPIIFYTGNEGDIVW 111
Query: 108 FAQNTGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
F +N+ F+ +V A + ALL F EHRYYG+++P+G N+ + +N TGYL+S QALADY
Sbjct: 112 FYENSQFITNVLAKEMGALLFFAEHRYYGETLPFG-NESLTPEN---TGYLTSEQALADY 167
Query: 167 ASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VS 225
A LI + +L A PV+ GGSYGGML AWFR+KYP++ LA+SAPIL F S
Sbjct: 168 AELIPSVLADLGAEHCPVISVGGSYGGMLTAWFRMKYPNIVDAGLAASAPILMFYKTGAS 227
Query: 226 PYSFSNIITQDFRSVSE--NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA- 282
F+ I T DF+ SE C I+ ++ I E +++ GGL++L F +C S +
Sbjct: 228 QEGFNQIATDDFKQTSEEGTCASRIRNAFNSIMEISQQTGGLQQLTNTFSLCDSLNQVGD 287
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY 342
+ +W+ + Y AM DYP P+ FL P+P +P+ C+A+ T D L +VYY
Sbjct: 288 LVNWIESGLTYMAMADYPYPAGFLEPMPGYPINVSCEAM---ATTTDDIQGLLKVLNVYY 344
Query: 343 NYSGTAKCFDLNG-DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
NY+GT +C++ + + G W QAC EMIM + +F + + YC+
Sbjct: 345 NYTGTTQCYNTSVFTTSALGSDAWDVQACNEMIMPISSNGVQDMFPAAPFNLQQLTSYCQ 404
Query: 402 EAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
+ +G+ P NWITT +GG SN+IF NG+ DPW GGVLK+ +V+ ++
Sbjct: 405 QTWGITPGVNWITTYYGGSNFT------TSNLIFSNGVLDPWRAGGVLKDYGDSVIHIII 458
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
GAHH+DLR + DP + R E +++ W ++
Sbjct: 459 DGGAHHLDLRMPNEADPDSVIQARITETKLLQMWANE 495
>gi|357606522|gb|EHJ65099.1| putative Lysosomal Pro-X carboxypeptidase [Danaus plexippus]
Length = 467
Score = 338 bits (866), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 187/452 (41%), Positives = 263/452 (58%), Gaps = 18/452 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++TK+ LDHF + Q +TF +YLIN+ +W K PIF YTGNEG IE FA++T
Sbjct: 20 FETKWFNVPLDHFGF--QRNETFNIKYLINEEYW--DKGGGPIFFYTGNEGQIEVFAKHT 75
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+A +FKA LVF EHRYYG+S+P+G NK + + GYL+S QALADYA LI
Sbjct: 76 GFMWDIAEEFKAKLVFAEHRYYGQSMPFG-NKSL---DNEHIGYLTSEQALADYADLINY 131
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+ N PV+ FGGSYGGML+A+ R+KYPH+ GA+A+SAPI + +V F I
Sbjct: 132 LQGNKQRPTYPVIAFGGSYGGMLSAYIRIKYPHLVTGAIAASAPIHMYPGMVPCEVFHRI 191
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES---WLST 289
+T F+ E C K I+ SW + + + + L K + +C+ K + + +L +
Sbjct: 192 VTSSFKIADEKCVKNIRSSWGVLRKFLESQNNTDWLHKNWNLCEPVKPADVNTLMEFLQS 251
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
+ AM +YP PS+FL PLPA PV+ +C+ +++ +G + + VY NY G A
Sbjct: 252 MYETLAMVNYPFPSDFLLPLPAQPVRVVCQYLNETLSGQKLIEAIGKVIKVYSNYDGKAP 311
Query: 350 CFDLNGDSDPHGL--SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
C D D L S W +QACTEMIM +FE S ++ A C Y V
Sbjct: 312 CVDYKKGDDFGNLDASGWDYQACTEMIMPMCTTGNQDMFEPSPWNFTKYAEDCHRKYNVY 371
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
PR ++GG ++ R A+NI+F NGL DPW+GGG+L +IS +V A+V + AHH
Sbjct: 372 PRQEAARIQYGGDRL-----RAATNIVFSNGLLDPWAGGGILNSISNSVKAVVIIDAAHH 426
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+DL S DP +K R + I KWI ++
Sbjct: 427 LDLMPSNPADPNSVKLARNIHKQNIDKWIREF 458
>gi|147792727|emb|CAN64373.1| hypothetical protein VITISV_018663 [Vitis vinifera]
Length = 702
Score = 337 bits (865), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 164/372 (44%), Positives = 233/372 (62%), Gaps = 17/372 (4%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
KT ++ Q LDHFNY P+SY+TF+QRY++N HWGG+K APIF Y G E ++ N G
Sbjct: 63 KTFFYAQTLDHFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPLDGDLVNIG 122
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F+ D A +F ALL++IEHRYYGKSIP+G K +A KNAST GY +S QA+ADYA++++ +
Sbjct: 123 FVNDNAARFNALLIYIEHRYYGKSIPFGSTK-VALKNASTLGYFNSAQAIADYAAVLMHV 181
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
KK L A +SPV+V GGSYGGMLA+WFRLKYPH+A+GALASSAPIL FD I + +I+
Sbjct: 182 KKRLHAQNSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPEIGYYSIV 241
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFV 292
T+DFR SE+CY+ I+ SW +I+ A KP GL L K F+ C E + ++ +L + +
Sbjct: 242 TKDFREASESCYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKDYLDSIYA 301
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY-GAASVYYNYSGTAKCF 351
A + P P +PV +CK I+ D +++ G ++ +G C+
Sbjct: 302 EAAQYNEP---------PTYPVTVVCKGINGASKRTDTLGRIFHGLVAI----AGKRSCY 348
Query: 352 DLNGDSDP-HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
D + P W WQ C+EM++ G D++F+ + + + C Y V PRP
Sbjct: 349 DTKEFNYPTETYLGWRWQKCSEMVLPIGHATNDTMFQPEPFNLNRFIKECNSLYSVSPRP 408
Query: 411 NWITTEFGGHKI 422
+W+TT +GG +
Sbjct: 409 HWVTTYYGGRTL 420
>gi|114627680|ref|XP_528471.2| PREDICTED: dipeptidyl peptidase 2 isoform 5 [Pan troglodytes]
Length = 492
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 182/458 (39%), Positives = 268/458 (58%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G + T L+ QALAD+A L+
Sbjct: 89 GFVAELAAEQGALLVFAEHRYYGKSLPFGAQS----RQRGHTELLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 205 VTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N +F + + R R
Sbjct: 321 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQR 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 436 VTIQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|194878910|ref|XP_001974147.1| GG21569 [Drosophila erecta]
gi|190657334|gb|EDV54547.1| GG21569 [Drosophila erecta]
Length = 470
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 183/455 (40%), Positives = 265/455 (58%), Gaps = 22/455 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ K LDHF++ +TF RYL ND+ S + PIF YTGNEGDIE FAQNT
Sbjct: 30 YEIKEFQVPLDHFSFLIN--ETFNIRYLYNDSFVDKSNDRTPIFFYTGNEGDIELFAQNT 87
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+++ A + +AL++F EHRYYGKS+P+G + Y + QAL DYA LI
Sbjct: 88 GFLWEQAERQRALVIFAEHRYYGKSLPFGSST-FNTSMPEHLAYFTVEQALEDYAMLITF 146
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+ + PVV FGGSYGGMLAAWFR+KYPH+ GALA+SAPIL F I F I
Sbjct: 147 LRND---RQLPVVAFGGSYGGMLAAWFRMKYPHLVAGALAASAPILQFSGITDCDIFYKI 203
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNL-AIESWL 287
+T F + ++NC I SWK E G +++ AF +C K++++L +++
Sbjct: 204 VTSVFENAYNKNCSANIAKSWKLFETLGASEAGKKQISDAFHLCNALKTDEDLKKFLNYV 263
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
+ AM +YP S+FL PLPA+PV+++C + + +T + + + A +VY NY+
Sbjct: 264 EEVYTNLAMVNYPYNSSFLAPLPAYPVRQVCYYLKELQTTDASLLHAMSSALAVYTNYTQ 323
Query: 347 TAKCFDL--NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+AKC D+ N ++D G W Q+C +M+M ++ +++F S ++ A C + Y
Sbjct: 324 SAKCLDISVNSNADESG---WDIQSCNQMVMPICSNSSETMFRTSSWNFKDYAEKCYKNY 380
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
+ P+P I +GG + SNIIF NGL DPWSGGGVL+ + V ++ EG
Sbjct: 381 RLTPKPYDIILRYGGRNL-----EATSNIIFSNGLLDPWSGGGVLQTPNDKVFVIILPEG 435
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
AHH+DLR + DP ++D R +E II +WI +
Sbjct: 436 AHHLDLRHTDPADPPSVRDARDKEAAIIARWIQDF 470
>gi|195118432|ref|XP_002003741.1| GI18077 [Drosophila mojavensis]
gi|193914316|gb|EDW13183.1| GI18077 [Drosophila mojavensis]
Length = 469
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 185/455 (40%), Positives = 257/455 (56%), Gaps = 19/455 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ K + LDHF++ S +F RYL ND++ PIF YTGNEGDIEWFAQN+
Sbjct: 26 YEIKEFSVPLDHFSF--LSNASFNIRYLANDSYVDKKNPQPPIFFYTGNEGDIEWFAQNS 83
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++++A + +AL++F EHRYYGKS+PYG + K Y + Q L DYA LI
Sbjct: 84 GFVWELAAQQRALVIFAEHRYYGKSLPYGADTFNTSK-PEHLAYFTVEQTLEDYAQLITY 142
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+ PVV FGGSYGGMLAAWFR+KYPH+ +GALA+SAPIL F + F+ I
Sbjct: 143 LRN---GKQLPVVAFGGSYGGMLAAWFRMKYPHIVVGALAASAPILQFSGLTPCDIFNKI 199
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL----AIESWL 287
T F + + NC I SWK E A G ++L F +C + KN +L
Sbjct: 200 TTAVFENAYNANCTANIGRSWKVFESMAATDAGKKQLSDIFHVCDAIKNADDLNNFMDYL 259
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYNYSG 346
+ AM +YP S FL+PLPA PV+++C + D + D+ A + A +VY NY+G
Sbjct: 260 EEVYGNLAMANYPYNSTFLSPLPANPVRQLCFYLKDLHQNDTDLVASMASALAVYTNYTG 319
Query: 347 TAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+ KC D + GW Q+C +M+M +N +++F S D+ + C + +
Sbjct: 320 SVKCLDYKDSNAGASYDGTGWEIQSCNQMVMPMCANNNNTMFRTSNWDFKKVSDDCYKQF 379
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
+ PRP I +GG + ASNIIF NGL DPWSGGGVL+ + + ++ EG
Sbjct: 380 HLIPRPYDIVMRYGGRNL-----EAASNIIFSNGLLDPWSGGGVLQAPNNQINVIILPEG 434
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
AHH+DLR S DP + R+ E +II +WI +
Sbjct: 435 AHHLDLRQSNPLDPPSVTVARQLEAKIIAQWIKDF 469
>gi|410224782|gb|JAA09610.1| dipeptidyl-peptidase 7 [Pan troglodytes]
gi|410251824|gb|JAA13879.1| dipeptidyl-peptidase 7 [Pan troglodytes]
Length = 492
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 182/458 (39%), Positives = 267/458 (58%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 89 GFVAELAAEQGALLVFAEHRYYGKSLPFGAQS----TQRGHTELLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 205 VTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N +F + + R R
Sbjct: 321 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQR 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 436 VTIQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|20129649|ref|NP_610037.1| CG2493, isoform A [Drosophila melanogaster]
gi|442628593|ref|NP_001260630.1| CG2493, isoform B [Drosophila melanogaster]
gi|7298683|gb|AAF53897.1| CG2493, isoform A [Drosophila melanogaster]
gi|18447221|gb|AAL68201.1| GH14278p [Drosophila melanogaster]
gi|220945264|gb|ACL85175.1| CG2493-PA [synthetic construct]
gi|220954996|gb|ACL90041.1| CG2493-PA [synthetic construct]
gi|440213994|gb|AGB93165.1| CG2493, isoform B [Drosophila melanogaster]
Length = 475
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 187/465 (40%), Positives = 264/465 (56%), Gaps = 19/465 (4%)
Query: 42 LISSSKDSQGL-YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
LI+ SQ Y+ K LDHF++ + TF RYL ND+ S PIF YTG
Sbjct: 23 LIAGCDCSQRFKYEIKEFQVPLDHFSFLINA--TFNIRYLYNDSFVDKSNARTPIFFYTG 80
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
NEGDIE FAQNTGF+++ A + +AL++F EHRYYGKS+P+G + Y +
Sbjct: 81 NEGDIELFAQNTGFLWEQAERQRALVIFAEHRYYGKSLPFGSST-FNTSLPEHLAYFTVE 139
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
Q L DYA LI L+ + PVV FGGSYGGMLAAWFR+KYPH+ GALA+SAP+L F
Sbjct: 140 QTLEDYAMLITFLRND---RQMPVVAFGGSYGGMLAAWFRMKYPHLVNGALAASAPVLQF 196
Query: 221 DNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK 279
I F I+T F++ +ENC I SWK E G +++ AF +C + K
Sbjct: 197 PGITDCDIFYRIVTSVFQNAYNENCTLNIAKSWKLFETLGASEAGKKQISDAFHLCNALK 256
Query: 280 N----LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKL 334
N ++ + AM +YP S+FL PLPA+PV+++C + + T D+ +
Sbjct: 257 NDDDLKKFLDYVEEVYSNLAMVNYPYNSSFLAPLPAYPVRQVCYYLKELHSTDADLLHAM 316
Query: 335 YGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
A +VY NY+ +AKC D++ +S+ S W Q+C +M+M + +++F S ++
Sbjct: 317 SSALAVYTNYTQSAKCLDISVNSNADD-SGWNIQSCNQMVMPICSNGSETMFRTSSWNFK 375
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
A C + Y + P+P I +GG + A+NIIF NGL DPWSGGGVL+ +
Sbjct: 376 DYAEKCYKNYRLTPKPYDIILRYGGRNL-----EAATNIIFSNGLLDPWSGGGVLQAPND 430
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
V ++ EGAHH+DLR S DP ++D R +E II +WI +
Sbjct: 431 KVFVIILPEGAHHLDLRHSDPADPPSVRDARDKEAAIIARWIQDF 475
>gi|403301430|ref|XP_003941393.1| PREDICTED: dipeptidyl peptidase 2 [Saimiri boliviensis boliviensis]
Length = 513
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 182/457 (39%), Positives = 264/457 (57%), Gaps = 28/457 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ +Y Q LDHFN+ +TF QR L++D W + PIF YTGNEGD+ FA N+
Sbjct: 53 FQERYFQQRLDHFNFERLGNKTFPQRVLVSDKFW--IRGEGPIFFYTGNEGDVWNFANNS 110
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G TG L+ QALAD+A L+
Sbjct: 111 GFIAELAAEQGALLVFAEHRYYGKSLPFGERS----TQRGHTGLLTVEQALADFAELLRA 166
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D P + FGGSYGGML+A+ R+KYPH+ GALA+SAPIL + F
Sbjct: 167 LRRDLGAQDVPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPILAVAGLGDSTQFFRD 226
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS--EKNLA-IESWLST 289
+T DF C + ++ +++QI++ + G +K+ F C+ EK+L + +
Sbjct: 227 VTADFEDQGPKCTQAVREAFRQIKDLFLQ-GAYDKVSWEFGTCQPLLEKDLTQLFVFARN 285
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
AF AM DYP P++FL PLPA PVK C D + L + YN SG+
Sbjct: 286 AFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRIVGLRALTGLVYNASGSEH 342
Query: 350 CFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
C+D+ + +DP G W +QACTE+ + +N +F + + R Y
Sbjct: 343 CYDIYRLYHNCADPTGCGTGPDARAWDYQACTEINLTLASNNMTDMFPDLPFTEELRREY 402
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
C++ +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A+
Sbjct: 403 CRDTWGVWPRPDWLQTNFWGGDL-----RAASNIIFSNGNLDPWAGGGIQRNLSTSVIAV 457
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ GAHH+DLR S EDP + + R+ E +IG+W+
Sbjct: 458 TIQGGAHHLDLRASHPEDPASVVEARKLEAAVIGEWV 494
>gi|14010871|ref|NP_114179.1| dipeptidyl peptidase 2 precursor [Rattus norvegicus]
gi|13626317|sp|Q9EPB1.1|DPP2_RAT RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
gi|9796394|dbj|BAB11691.1| dipeptidyl peptidase II [Rattus norvegicus]
gi|10119950|dbj|BAB13500.1| dipeptidyl peptidase II [Rattus norvegicus]
gi|51260653|gb|AAH78783.1| Dipeptidylpeptidase 7 [Rattus norvegicus]
gi|149039379|gb|EDL93599.1| dipeptidylpeptidase 7, isoform CRA_a [Rattus norvegicus]
Length = 500
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 182/459 (39%), Positives = 268/459 (58%), Gaps = 29/459 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q +DHFN+ S +TF QR+L++D W PIF YTGNEGDI A N+
Sbjct: 41 FRENYFEQYMDHFNFESFSNKTFGQRFLVSDKFW--KMGEGPIFFYTGNEGDIWSLANNS 98
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + +ALLVF EHRYYGKS+P+G + T L+ QALAD+A L+
Sbjct: 99 GFIVELAAQQEALLVFAEHRYYGKSLPFG----VQSTQRGYTQLLTVEQALADFAVLLQA 154
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+ NL D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + +P F
Sbjct: 155 LRHNLGVQDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVAGLGNPDQFFRD 214
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + + + F C+ S K+L + +
Sbjct: 215 VTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQSLSSPKDLTQLFGFAR 273
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P+NFL PLPA PVK C+ + G + L A + YN SG
Sbjct: 274 NAFTVLAMMDYPYPTNFLGPLPANPVKVGCERL--LSEGQRIMG-LRALAGLVYNSSGME 330
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
CFD+ +DP G W +QACTE+ + +N +F E + R +
Sbjct: 331 PCFDIYQMYQSCADPTGCGTGSNARAWDYQACTEINLTFDSNNVTDMFPEIPFSDELRQQ 390
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + + ASNIIF NG DPW+GGG+ +N+S +++A
Sbjct: 391 YCLDTWGVWPRPDWLQTSFWGGDL-----KAASNIIFSNGDLDPWAGGGIQRNLSTSIIA 445
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ + GAHH+DLR S EDP + +VR+ E +I +W++
Sbjct: 446 VTIQGGAHHLDLRASNSEDPPSVVEVRKLEATLIREWVA 484
>gi|15080291|gb|AAH11907.1| Dipeptidyl-peptidase 7 [Homo sapiens]
gi|16877430|gb|AAH16961.1| Dipeptidyl-peptidase 7 [Homo sapiens]
gi|123981114|gb|ABM82386.1| dipeptidyl-peptidase 7 [synthetic construct]
gi|123995923|gb|ABM85563.1| dipeptidyl-peptidase 7 [synthetic construct]
gi|158257656|dbj|BAF84801.1| unnamed protein product [Homo sapiens]
Length = 492
Score = 335 bits (858), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 182/458 (39%), Positives = 267/458 (58%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 89 GFVAELAAERGALLVFAEHRYYGKSLPFGAQS----TQRGHTELLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 205 VTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N +F + + R R
Sbjct: 321 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQR 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 436 VTIQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|195580410|ref|XP_002080033.1| GD21700 [Drosophila simulans]
gi|194192042|gb|EDX05618.1| GD21700 [Drosophila simulans]
Length = 475
Score = 335 bits (858), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 187/465 (40%), Positives = 266/465 (57%), Gaps = 19/465 (4%)
Query: 42 LISSSKDSQGL-YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
LI+ SQ Y+ K LDHF++ + TF RYL ND+ S PIF YTG
Sbjct: 23 LIAGCDCSQRFKYEIKEFQVPLDHFSFLINA--TFSIRYLYNDSFVDKSNARTPIFFYTG 80
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
NEGDIE FAQNTGF+++ A + +AL++F EHRYYGKS+P+G + Y +
Sbjct: 81 NEGDIELFAQNTGFLWEQAERQRALVIFAEHRYYGKSLPFGSST-FNTSLPEHLAYFTVE 139
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
Q L DYA LI L+ + PVV FGGSYGGMLAAWFR+KYPH+ GALA+SAP+L F
Sbjct: 140 QTLEDYAMLITFLRND---RQMPVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPVLQF 196
Query: 221 DNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---K 276
I F I+T F++ +ENC I SWK E G +++ AF +C K
Sbjct: 197 PGITDCDIFYRIVTSVFQNAYNENCTLNIAKSWKLFETLGASEAGKKQISDAFHLCNALK 256
Query: 277 SEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKL 334
++ +L ++ + AM +YP S+FL PLPA+PV+++C + + T D+ +
Sbjct: 257 TDDDLKKFLDYVEEVYSNLAMVNYPYNSSFLAPLPAYPVRQVCYYLKELHSTDADLLHAM 316
Query: 335 YGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
A +VY NY+ +AKC D++ +S+ S W Q+C +M+M + +++F S ++
Sbjct: 317 SSALAVYTNYTQSAKCLDISVNSNADD-SGWNIQSCNQMVMPICSNGSETMFRTSSWNFK 375
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
A C + Y + P+P I +GG + A+NIIF NGL DPWSGGGVL+ +
Sbjct: 376 DYAEKCYKNYRLTPKPYDIILRYGGRNL-----EAATNIIFSNGLLDPWSGGGVLQAPND 430
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
V ++ EGAHH+DLR S DP ++D R +E II +WI +
Sbjct: 431 KVFVIILPEGAHHLDLRHSDPADPPSVRDARDKEAAIIARWIQDF 475
>gi|195351854|ref|XP_002042435.1| GM23324 [Drosophila sechellia]
gi|194124304|gb|EDW46347.1| GM23324 [Drosophila sechellia]
Length = 475
Score = 334 bits (856), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 187/465 (40%), Positives = 266/465 (57%), Gaps = 19/465 (4%)
Query: 42 LISSSKDSQGL-YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
LI+ SQ Y+ K LDHF++ + TF RYL ND+ S PIF YTG
Sbjct: 23 LIAGCDCSQRFKYEIKEFQVPLDHFSFLINA--TFNIRYLYNDSFVDKSNARTPIFFYTG 80
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
NEGDIE FAQNTGF+++ A + +AL++F EHRYYGKS+P+G + Y +
Sbjct: 81 NEGDIELFAQNTGFLWEQAERQRALVIFAEHRYYGKSLPFGSST-FNTSLPEHLAYFTVE 139
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
Q L DYA LI L+ + PVV FGGSYGGMLAAWFR+KYPH+ GALA+SAP+L F
Sbjct: 140 QTLEDYAMLITFLRND---RQMPVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPVLQF 196
Query: 221 DNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---K 276
I F I+T F++ +ENC I SWK E G +++ AF +C K
Sbjct: 197 PGITDCDIFYRIVTSVFQNAYNENCTLNIGKSWKLFETLGASEAGKKQISDAFHLCNVLK 256
Query: 277 SEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKL 334
++ +L ++ + AM +YP S+FL PLPA+PV+++C + + T D+ +
Sbjct: 257 TDDDLKKFLDYVEEVYSNLAMVNYPYNSSFLAPLPAYPVRQVCYYLKELHSTDADLLHAM 316
Query: 335 YGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
A +VY NY+ +AKC D++ +S+ S W Q+C +M+M + +++F S ++
Sbjct: 317 SSALAVYTNYTQSAKCLDISVNSNADD-SGWNIQSCNQMVMPICSNGSETMFRTSSWNFK 375
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
A C + Y + P+P I +GG + A+NIIF NGL DPWSGGGVL+ +
Sbjct: 376 DYAEKCYKNYRLTPKPYDIILRYGGRNL-----EAATNIIFSNGLLDPWSGGGVLQAPND 430
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
V ++ EGAHH+DLR S DP ++D R +E II +WI +
Sbjct: 431 KVFVIILPEGAHHLDLRHSDPADPPSVRDARDKEAAIIARWIQDF 475
>gi|195484754|ref|XP_002090813.1| GE12590 [Drosophila yakuba]
gi|194176914|gb|EDW90525.1| GE12590 [Drosophila yakuba]
Length = 470
Score = 334 bits (856), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 182/453 (40%), Positives = 263/453 (58%), Gaps = 18/453 (3%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ K LDHF++ + TF RYL ND+ S PIF YTGNEGDIE FAQNT
Sbjct: 30 YEIKEFQVPLDHFSFLINA--TFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQNT 87
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+++ A + +AL++F EHRYYGKS+P+G + Y + Q L DYA LI
Sbjct: 88 GFLWEQAERQRALVIFAEHRYYGKSLPFGSST-FNTSLPEHLAYFTVEQTLEDYAMLITF 146
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L + PVV FGGSYGGMLAAWFR+KYPH+ GALA+SAPIL F I F I
Sbjct: 147 LTND---RQMPVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPILQFSGITDCDIFYRI 203
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNL-AIESWL 287
+T F++ + NC I SWK E G +++ AF +C K++++L ++
Sbjct: 204 VTSVFQNAYNTNCTANIAKSWKLFETLGASEAGKKQISDAFHLCNALKTDEDLKKFLDYV 263
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSG 346
+ AM +YP S+FL PLPA+PV+++C + + +T + D+ + A +VY NY+
Sbjct: 264 EEVYSNLAMVNYPYNSSFLAPLPAYPVRQVCYYLRELQTTDADLLHAMSSALAVYTNYTQ 323
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
+AKC D++ +S+ S W Q+C +M+M ++ +++F S ++ A C + Y +
Sbjct: 324 SAKCLDISVNSNADD-SGWNIQSCNQMVMPICSNSSETMFRTSSWNFKDYAEKCYKNYRL 382
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
P+P I +GG + +NIIF NGL DPWSGGGVL++ + V ++ EGAH
Sbjct: 383 TPKPYDIILRYGGRNL-----EATTNIIFSNGLLDPWSGGGVLQSPNDKVFVIILPEGAH 437
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
H+DLR S DP ++D R +E II +WI +
Sbjct: 438 HLDLRHSDPADPPSVRDAREKEAAIIARWIQDF 470
>gi|443733182|gb|ELU17643.1| hypothetical protein CAPTEDRAFT_152094 [Capitella teleta]
Length = 472
Score = 333 bits (855), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 193/479 (40%), Positives = 279/479 (58%), Gaps = 35/479 (7%)
Query: 34 ITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSY--QTFQQRYLINDTHWGGSKN 91
I E LS +++S +Q YK ++ Q +DHFN Y +T++QRYLI D W K
Sbjct: 13 IIAEFLSIPLTAS--AQTPYKEQFFEQTIDHFNSYWAQYGKRTYKQRYLIQDKWWTPGK- 69
Query: 92 NAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
PIF YTGNEGDI F NTGFM+++APKF AL+VF EHRYYGKS+P+G E ++K
Sbjct: 70 -GPIFFYTGNEGDIATFWNNTGFMFEIAPKFNALIVFAEHRYYGKSLPFG---ERSFKQP 125
Query: 152 STTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
+ LSS QALAD+A L+ LK +L ATD V+ FGGSYGGML+A+ R+KYP++ G++
Sbjct: 126 YIS-LLSSQQALADFAVLLNHLKPSLNATDCKVIAFGGSYGGMLSAYMRIKYPNLIDGSI 184
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA 271
A+SAP+ S F +T DF++ C K+I+ + ++ + GL+K+
Sbjct: 185 AASAPVYLIGGDSSRDFFFEDVTADFQAA--GCDKLIRDGFSKMASMSSTTDGLKKISSH 242
Query: 272 FRICKSEKNLA----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG 327
F +CK K + WL AF AM DYP P++F++ +PA+PV CKA+ +
Sbjct: 243 FMLCKYMKTTSDFTHFLGWLRNAFTLMAMMDYPYPTDFMSKMPAWPVNAGCKAMKNVTCP 302
Query: 328 NDVFAKLYGAASVYYNYSGTAKCFDLNGD----SDPHGLS------EWGWQACTEMIMLT 377
A L AS+ Y Y C D+ D +DP G W +QACT+ +M +
Sbjct: 303 VKGLADL---ASIVYPYKPDG-CHDIWTDFVDCADPTGCGTGPDSYAWDYQACTDFLMPS 358
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
G +NK +F + R YC++ +GV P W F G + ++ NI+F N
Sbjct: 359 GTNNKTDMFPILPFTMEQRNSYCEKRWGVTPDVEWTKLSFWGKDL-----KYTGNIVFSN 413
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
GL DPW GGVL+++S +++A+ KEGAHH+DLR S + DP+ +K R++E++II W+
Sbjct: 414 GLLDPWHRGGVLEDLSDSLIAITIKEGAHHLDLRASNEHDPESVKVARQKEIDIITHWL 472
>gi|332373508|gb|AEE61895.1| unknown [Dendroctonus ponderosae]
Length = 487
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 192/467 (41%), Positives = 274/467 (58%), Gaps = 31/467 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ TKY LDHF++ + TF+ RYL+ND+++ N+ PIF YTGNEGDI FAQNT
Sbjct: 22 FVTKYIDMPLDHFSFTTNT--TFKLRYLVNDSYF---SNDQPIFFYTGNEGDISMFAQNT 76
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF++++A K AL++F EHR+YG+++P+G +Y + T GYLSS QALADY LI +
Sbjct: 77 GFLFELAEKMGALIIFAEHRFYGETLPFGNE---SYSSPKTLGYLSSQQALADYVYLIDN 133
Query: 173 LKKNLTATDS-----PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
L+K S PV+ FGGSYGGMLAAW R+KYP+ +GA+ASSAPI F I
Sbjct: 134 LQKKYVNELSSLRKLPVISFGGSYGGMLAAWLRIKYPYSVLGAIASSAPIWQFQGITPCQ 193
Query: 228 SFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNL-A 282
+F+ I+T ++ S C + IK SW + A G + + K F +C KS+ +L
Sbjct: 194 NFNKIVTDVIGALGSATCVETIKKSWSIFRKIASTDIGKQNISKTFDLCTSLKSDDDLNT 253
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASV 340
+WLS + +YP P++FL PLP PV+E C +D ND + L +
Sbjct: 254 FLNWLSEMYTMIVEVNYPYPNSFLVPLPGNPVREFCSRMDSVNYNNDDGLIKALSTGVQL 313
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
+ NY+GT KC ++ + P L E GW QACT+MIM ++D +FE + ++ +
Sbjct: 314 FTNYTGTTKCNNIGQTASP-SLGELGWDFQACTDMIMPMCSTDED-LFENAAWNFTEYSD 371
Query: 399 YCKEAYGVDPRPNWI-TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
C + +GV PR + EFGG +I ASNI+F NGL DPWS GGV+ N+S V
Sbjct: 372 DCYKQFGVRPRNEEVPILEFGGTEI-----ETASNIVFSNGLLDPWSSGGVIANVSAQVW 426
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ-YFQDL 503
+++ GAHH DLR + + D +K R + I KW+ + YF++L
Sbjct: 427 SILMPNGAHHSDLRSANELDADDVKSARLFHEKHIQKWLDKFYFRNL 473
>gi|296191232|ref|XP_002743536.1| PREDICTED: dipeptidyl peptidase 2 [Callithrix jacchus]
Length = 492
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 182/458 (39%), Positives = 264/458 (57%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ +Y Q+LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 31 FQERYFQQLLDHFNFESFGNKTFPQRFLVSDRFW--IRGKGPIFFYTGNEGDVWVFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G TG L+ QALAD+A L+
Sbjct: 89 GFIAELAAEQGALLVFAEHRYYGKSLPFGERS----TQRGYTGLLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDVPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSTQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF C + ++ ++ QI + + G +K+ F C+ EK+L + +
Sbjct: 205 VTADFEGQGPKCTQAVREAFWQIRDLFLQ-GAYDKVSWEFGTCQPLLDEKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C + + A L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDLL---LSEAQRIAGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ ++ +N +F + R +
Sbjct: 321 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLMFASNNVTDMFPVLPFTDELRQQ 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLQTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIQRNLSTSVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + R E +IG+W+
Sbjct: 436 VTIQGGAHHLDLRASHPEDPDSVVKARILEATVIGEWV 473
>gi|167888448|gb|ACA09612.1| prolyl carboxypeptidase [Tenebrio molitor]
Length = 488
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 189/460 (41%), Positives = 276/460 (60%), Gaps = 26/460 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y TKY +DHF++ + TF+ +YLIN++ W ++ PIF YTGNEG IE FA+N
Sbjct: 18 YTTKYFEVPVDHFSFTNNA--TFKLKYLINNSFW---VDDGPIFFYTGNEGTIENFAENM 72
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+D+AP+F ALLVF EHRYYG+S+P+G +Y + + GYL+S QALADY LI
Sbjct: 73 GFMFDIAPQFNALLVFAEHRYYGESLPFGDE---SYADPARLGYLTSNQALADYVDLINY 129
Query: 173 LKKNLT----ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
L+ + + PVV FGGSYGGMLA+W R+K+P +GA+ASSAPI F + +
Sbjct: 130 LQTTRSRSSYSNKVPVVAFGGSYGGMLASWLRMKFPASVVGAIASSAPIWQFQGLTPCEN 189
Query: 229 FSNIITQDFRS-VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNL-AI 283
F+ I+T +++ + ++C + SWK I G L +++C K++ ++ +
Sbjct: 190 FNRIVTDVYKTALDDDCSVPLSRSWKVIRNITSNDAGKAWLSAQWKLCTPLKTQSDVDTL 249
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASVY 341
+W S V AM +YP P++FL PLPA+PV+ C + + +T +D + + A +Y
Sbjct: 250 VNWFSEIVVNMAMVNYPYPTSFLAPLPAYPVRSFCYKMTESQTVDDETLLTTIGAALEIY 309
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
NY+ T KC +N + G W +QACTEMIM + D +FE S D+D + C
Sbjct: 310 TNYTQTTKCNVINETAAALGEDAWDFQACTEMIMPMCSTD-DDMFENSPWDFDTYSENCY 368
Query: 402 EAYGV-DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ +GV P E+GG +I ASNI+F NGL DPWS GGVL N+S +V A++
Sbjct: 369 KKWGVKQTHPELPILEYGGKEISS-----ASNIVFSNGLLDPWSSGGVLSNVSSSVSAVI 423
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
EGAHH+DLR K+DP+ + + R+ + I KWI+ Y+
Sbjct: 424 IPEGAHHLDLRGENKDDPESVIEARQFHIRNIKKWITDYY 463
>gi|197099620|ref|NP_001127163.1| dipeptidyl peptidase 2 precursor [Pongo abelii]
gi|55725338|emb|CAH89533.1| hypothetical protein [Pongo abelii]
Length = 492
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 182/458 (39%), Positives = 267/458 (58%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--IRGEGPIFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 89 GFVAELAAEQGALLVFAEHRYYGKSLPFGAQS----TRRGHTELLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 205 VTADFEGQSPKCTQGVREAFRQIKDLFLR-GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N +F + + R +
Sbjct: 321 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQQ 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ KN+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRKNLSASVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 436 VTIQGGAHHLDLRASHPEDPVSVVETRKLEATIIGEWV 473
>gi|380790295|gb|AFE67023.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
gi|383414091|gb|AFH30259.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
gi|384940796|gb|AFI34003.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
Length = 492
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 182/458 (39%), Positives = 266/458 (58%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ QTF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNQTFPQRFLVSDRFW--IRGEGPIFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 89 GFIAELAAEQGALLVFAEHRYYGKSLPFGERS----TQRGHTELLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ ++ QI++ + G ++++ F C+ EK+L + +
Sbjct: 205 VTADFEGQSPKCTQGVREAFGQIKDLFLQ-GAYDRVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ +DP G W +QACTE+ + +N +F + + R +
Sbjct: 321 HCYDIYRLYRSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDERRQQ 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 436 ITIQGGAHHLDLRASHPEDPVSVVEARKLEATIIGEWV 473
>gi|62420888|ref|NP_037511.2| dipeptidyl peptidase 2 preproprotein [Homo sapiens]
gi|212276510|sp|Q9UHL4.3|DPP2_HUMAN RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
Length = 492
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 181/458 (39%), Positives = 266/458 (58%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
F+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 89 AFVAELAAERGALLVFAEHRYYGKSLPFGAQS----TQRGHTELLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 205 VTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N +F + + R R
Sbjct: 321 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQR 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 436 VTIQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|6465985|gb|AAF12747.1|AF154502_1 quiescent cell proline dipeptidase [Homo sapiens]
Length = 492
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 181/458 (39%), Positives = 266/458 (58%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + P F YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPTFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 89 GFVAELAAERGALLVFAEHRYYGKSLPFGAQS----TQRGHTELLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 205 VTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N +F + + R R
Sbjct: 321 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQR 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 436 VTIQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|261278872|pdb|3JYH|A Chain A, Human Dipeptidyl Peptidase Dpp7
gi|261278873|pdb|3JYH|B Chain B, Human Dipeptidyl Peptidase Dpp7
gi|261278874|pdb|3JYH|C Chain C, Human Dipeptidyl Peptidase Dpp7
gi|261278875|pdb|3JYH|D Chain D, Human Dipeptidyl Peptidase Dpp7
gi|301015979|pdb|3N0T|A Chain A, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015980|pdb|3N0T|B Chain B, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015981|pdb|3N0T|C Chain C, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015982|pdb|3N0T|D Chain D, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
Length = 469
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 181/458 (39%), Positives = 266/458 (58%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 8 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 65
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
F+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 66 AFVAELAAERGALLVFAEHRYYGKSLPFGAQS----TQRGHTELLTVEQALADFAELLRA 121
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 122 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 181
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 182 VTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 240
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 241 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 297
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N +F + + R R
Sbjct: 298 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQR 357
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 358 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 412
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 413 VTIQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 450
>gi|194759919|ref|XP_001962194.1| GF14561 [Drosophila ananassae]
gi|190615891|gb|EDV31415.1| GF14561 [Drosophila ananassae]
Length = 471
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 189/457 (41%), Positives = 260/457 (56%), Gaps = 25/457 (5%)
Query: 56 KYHTQI------LDHFNYNPQSYQTFQQRYLINDTHWGGSKN-NAPIFVYTGNEGDIEWF 108
KYH I LDHF++ S TF RYL ND+ N PI YTGNEGDIE F
Sbjct: 27 KYHYLIKEFVVPLDHFSF--LSNATFSIRYLYNDSFVRDQSNARTPILFYTGNEGDIELF 84
Query: 109 AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS 168
AQNTGF+++ A + AL+VF EHRYYGKS+P+G Y + Q L DYA
Sbjct: 85 AQNTGFLWEEAERKHALVVFAEHRYYGKSLPFG-KSTFNTSMPEHLAYFTVEQTLEDYAM 143
Query: 169 LIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
LI L+ T PVV FGGSYGGMLAAWFR+KYPH+A+GALA+SAPIL F I
Sbjct: 144 LITYLRN---GTQRPVVTFGGSYGGMLAAWFRMKYPHLAVGALAASAPILQFPGITDCDI 200
Query: 229 FSNIITQDF-RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNL-AI 283
F I+T F + + NC I SW+ E G +++ AF +C KS+ +L
Sbjct: 201 FYRIVTSVFANAYNSNCTVNIAKSWRVFETLGGTDAGKKQISDAFNLCSPVKSDADLKKF 260
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYY 342
++ + AM +YP S+FL PLPAFPV+++C + D T + D+ + A +VY
Sbjct: 261 LDYVEEVYSNLAMVNYPYSSSFLAPLPAFPVRQVCFYLKDLHTTDADLLHTMASALAVYT 320
Query: 343 NYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
N++GT KC D++ S+ S W Q C +M+M + +++F S +++ + C +
Sbjct: 321 NFTGTTKCLDISATSNADE-SGWNIQTCNQMVMPFCSNGTETMFRPSTWNFNEFSEKCYK 379
Query: 403 AYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
Y + P+P I +GG I A+NIIF NGL DPWSGGGVL+ + + ++
Sbjct: 380 DYRLTPKPYDIILRYGGKNI-----EAATNIIFSNGLLDPWSGGGVLQAPNDKIHVIILP 434
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
EGAHH+DLR S DP ++D R +E II +WI ++
Sbjct: 435 EGAHHLDLRRSDPADPPSVRDARNKESAIIARWIEEF 471
>gi|224058949|ref|XP_002299659.1| predicted protein [Populus trichocarpa]
gi|222846917|gb|EEE84464.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 169/372 (45%), Positives = 232/372 (62%), Gaps = 28/372 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++T +H Q LDHFNY P+SY F QRYLIN +WGG+ +API VY G E I+ +
Sbjct: 1 FETFFHNQTLDHFNYRPESYDKFPQRYLINSKYWGGANVSAPILVYLGAEASIDGYRDAA 60
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ D A +FK+LLVFIEHRYYG S P G GY SS QALADYA++IID
Sbjct: 61 GFLDDNAVQFKSLLVFIEHRYYGHSFPPGA--------WGKRGYFSSAQALADYAAIIID 112
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+K+N +A SPV+V GGSYGGMLA+WFRLKYPH+A+GALASSAPIL FD+I ++ ++
Sbjct: 113 IKENRSAQYSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDDITPQDAYYSV 172
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAF 291
+T++FR SE CY+ IK SW +I+E A KP GL L F CK+ + + ++ +L +
Sbjct: 173 VTKEFREASETCYQTIKTSWSEIDELASKPDGLSMLSTKFNTCKNLTDASELKDYLRLMY 232
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
Y A + P P +PV E+CK IDD +G+D+ ++++ YY G C+
Sbjct: 233 AYAAQNNSP---------PTYPVNEVCKGIDDDASGDDILSRIFRGVVAYY---GNRTCY 280
Query: 352 DLNGDS---DPHGLSEWGWQACTEMIMLTG-GDNKDSIFEESEEDYDARARYCKEAYGVD 407
N ++ +W WQ C+EM++ G GDN S+F+ + + CK YGV
Sbjct: 281 -FNNNAYAYQSEATLDWSWQRCSEMVIPLGVGDN--SMFQPNPFNLTDHIERCKSLYGVR 337
Query: 408 PRPNWITTEFGG 419
PRP+W+TT +GG
Sbjct: 338 PRPHWVTTYYGG 349
>gi|198425804|ref|XP_002127477.1| PREDICTED: similar to Dipeptidyl-peptidase 2 precursor
(Dipeptidyl-peptidase II) (DPP II) (Dipeptidyl
aminopeptidase II) (Quiescent cell proline dipeptidase)
(Dipeptidyl peptidase 7) [Ciona intestinalis]
Length = 494
Score = 331 bits (849), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 184/464 (39%), Positives = 261/464 (56%), Gaps = 29/464 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y TKY Q +DHFN+ T+ QRYLI+D HW K P+ Y GNEGDI F +
Sbjct: 21 YHTKYFEQFVDHFNFQSNGNATYMQRYLISDEHWVAGK--GPMLFYAGNEGDIVGFKDAS 78
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G + + APK A++VF EHR+YG S+P+G N KN G LS QA+ADYA L+
Sbjct: 79 GLLTETAPKLGAMVVFAEHRFYGTSLPFG-NDSFIDKN---IGLLSIEQAMADYAYLLKH 134
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LK + A D P++ FGGSYGG+LAA+ R+KYP++ GALA+SAPI +P+ F
Sbjct: 135 LKSSYNADDIPIIAFGGSYGGILAAYMRIKYPNLITGALAASAPIYWTSGEGNPHGFWKS 194
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLST 289
+T F +E C +K + + + A++ G + + K F+ C KS + + W+
Sbjct: 195 VTTIFGH-NEGCVNRVKEGFAETAKYAQQ-GKYDVISKGFKTCSQVKSSSLMHLYGWVRN 252
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
+F AM +YP P+NF PLPAFPV C+ + N + A S+ YN +G
Sbjct: 253 SFTQLAMANYPYPANFFGPLPAFPVNVACEKM---LKANTAIEGMLEATSLLYNGTGDKD 309
Query: 350 CFDLNGD----SDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
CFD+ + SDP G S W +Q CTE+++ G N +F + R +Y
Sbjct: 310 CFDIYEEYIECSDPTGCSLGLAARSWDYQGCTEIVLPGGSTNITDMFPAIPFTPEIRKKY 369
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
C+ V PR NW+ F + + +SNIIF NG DPW GG+L ++S TV+AL
Sbjct: 370 CETHQRVTPRRNWLALNFWTDNL-----KLSSNIIFSNGDLDPWKDGGILHDLSPTVVAL 424
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+ K GAHH+DLR S +DP + +VR+ EVEII WI+Q+++ +
Sbjct: 425 LVKGGAHHLDLRGSNPQDPPSVIEVRKHEVEIISGWIAQHWEKI 468
>gi|427785257|gb|JAA58080.1| Putative lysosomal pro-x carboxypeptidase-like protein
[Rhipicephalus pulchellus]
Length = 467
Score = 331 bits (848), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 190/462 (41%), Positives = 269/462 (58%), Gaps = 19/462 (4%)
Query: 40 SSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYT 99
+ ++S+S D Y+ ++ +DHF Y + T++ RYL D +W PIF YT
Sbjct: 14 ACVVSTSADYA--YEVRFFETKVDHFGY--ANNDTYKMRYLFADQYW--DHQGGPIFFYT 67
Query: 100 GNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
GNEG I FA N+G M+D AP+F+ALL+F EHRYYGKS+PYG + ++K+ + GYL+
Sbjct: 68 GNEGSITTFANNSGLMWDWAPEFRALLIFAEHRYYGKSMPYGND---SFKSPAHLGYLTV 124
Query: 160 TQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
QALADYA L+ ++K L A DS VV FGGSYGGMLAAWFR+KYPHV ALA+SAPIL
Sbjct: 125 EQALADYADLLQYIRKTLPGARDSQVVSFGGSYGGMLAAWFRMKYPHVTAAALAASAPIL 184
Query: 219 NFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-- 276
F +I + S +IT FR SE C + I+ SW I+ G + +++ F +C
Sbjct: 185 QFQDITPCGAQSAVITGAFRKDSELCVEAIRKSWGIIKNMTSTVEGAQAVRERFHLCGNY 244
Query: 277 SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-L 334
+ KN + WL+ + AM +YP ++FL P+P PV+E CK ++ +D L
Sbjct: 245 TPKNYTNLRDWLTDLYANLAMVNYPYDNSFLAPVPGHPVREACKFLNRTFDNDDALLDGL 304
Query: 335 YGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
Y A SV+ NY+G +C DL+ S W +Q+C EM+M D + +F++ + D +
Sbjct: 305 YQAISVFQNYTGQTQCNDLSKSSGTLDADGWNYQSCNEMVMPLCSDGVNDMFDKQDWDLN 364
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
+ C++ + V P FGG I ASNIIF NG DPWS GGVL+ IS
Sbjct: 365 EVRKKCEKDFHVTPDVYKAALIFGGRNIAA-----ASNIIFSNGDLDPWSAGGVLETISD 419
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+++A+ + AHH+DLR S DP + R E + I KW+
Sbjct: 420 SLIAIYMEGAAHHLDLRSSNPADPDSVVRARALEKKYITKWL 461
>gi|449530679|ref|XP_004172321.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Cucumis sativus]
Length = 440
Score = 330 bits (847), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 180/409 (44%), Positives = 249/409 (60%), Gaps = 18/409 (4%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S+ + S GL T + Q LDHFNY PQSY TF QRY+I+ +W G PIF Y G E
Sbjct: 45 STPQQSDGL-ATFXYKQPLDHFNYQPQSYVTFDQRYIIDFKYWEGINPKTPIFAYLGAES 103
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
DI+ GF A ++KA+ V++EHR+YGKSIP+G + E A KN S GY +S QAL
Sbjct: 104 DIDNDVPYVGFPLRFASQYKAMSVYLEHRFYGKSIPFG-SLEKAMKNGSIRGYFNSAQAL 162
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
ADYA L++ +KK SP++V G SYGGMLA+WFRLKYPH+A+GALASSAPIL FDNI
Sbjct: 163 ADYAELLLHIKKMFAYDTSPIIVMGASYGGMLASWFRLKYPHIALGALASSAPILYFDNI 222
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETA-KKPGGLEKLQKAFRIC-KSEKNL 281
+ +I+++ F+ S+ C+ I+ SW +I+ A K GGL L K F+ C K + +
Sbjct: 223 TPQDGYYSIVSKSFKETSKTCHDTIRRSWGEIDRIAGKTQGGLSILSKQFKTCGKLKTSS 282
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGNDVFAKLYGAASV 340
I++ + + F A + P + PV+ +C AID + K ++V ++
Sbjct: 283 EIKNLMDSVFTMAAQYNDPYEN---------PVRGICVAIDEEAKKKSNVIKQVVAGVIA 333
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTG--GDNKDSIFEESEEDYDARAR 398
Y ++ +DP L+++GWQ C+EM+M G G +K+S+F S ++
Sbjct: 334 YLGERPCYDVYEFGYPNDP--LNQYGWQVCSEMVMPIGSSGRDKNSMFPPSPFQFNDFKT 391
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
CK+ YGV PRP+WITT +GG I LVL RF SNIIF NGL+DP+S GG
Sbjct: 392 MCKDLYGVTPRPHWITTFYGGQDIKLVLHRFGSNIIFSNGLKDPYSSGG 440
>gi|260828789|ref|XP_002609345.1| hypothetical protein BRAFLDRAFT_128878 [Branchiostoma floridae]
gi|229294701|gb|EEN65355.1| hypothetical protein BRAFLDRAFT_128878 [Branchiostoma floridae]
Length = 489
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 186/483 (38%), Positives = 275/483 (56%), Gaps = 34/483 (7%)
Query: 34 ITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNA 93
+ P S+ + YK +Y Q +D+FN+ QT+ Q+ L++D +W K
Sbjct: 12 VAPLTFVCCSSTPNPADTPYKVRYVEQYVDNFNFPSYGQQTYMQKVLVSDAYW--EKREG 69
Query: 94 PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
PIF YTGNEG I F + +GF+ ++A KFKALLVF EHRYYG+S+P+ GN+ +N
Sbjct: 70 PIFFYTGNEGPITAFWEASGFVKELAAKFKALLVFAEHRYYGESLPF-GNQSFTKEN--- 125
Query: 154 TGYLSSTQALADYASLIIDLKKNLTATD---SPVVVFGGSYGGMLAAWFRLKYPHVAIGA 210
G LS QA+ADYA L+ L+ +L P++ FGGSYGGML+A+ R KYP++ GA
Sbjct: 126 IGLLSVEQAMADYARLMTALRTHLDCKSPDVCPIITFGGSYGGMLSAYMRFKYPNLVAGA 185
Query: 211 LASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
LA+SAPI + + F +T+DFR C ++ ++ ++EE GGL+++
Sbjct: 186 LAASAPIYLVAGLTEGHQFFQDVTEDFRKSDARCPLKVQSAYFEMEELGA--GGLKEISD 243
Query: 271 AFRICK---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKT 326
F++C +K+L+ + W+ +F AM DYP P++F LPA PV C I
Sbjct: 244 RFQLCSPLTDKKDLSHLYGWVRNSFTTLAMLDYPYPTDFEAKLPANPVNVACGLI---LN 300
Query: 327 GNDVFAKLYGAASVYYNYS-GTAKCFDLNGD----SDPHGL------SEWGWQACTEMIM 375
+D+ L AA + YN + GT +CFD+ + +DP G + W +QACTE+ +
Sbjct: 301 SSDLLKGLSQAAGLAYNGTDGTLECFDIFDEFVACADPTGCGLGDDSTAWDYQACTEVSL 360
Query: 376 LTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIF 435
L +N +F +ARA YC+ YGV PRP W+ +F G I +SNIIF
Sbjct: 361 LESTNNVTDMFPPDNYTAEARAEYCRTKYGVTPRPGWMGVQFWGKNI-----LSSSNIIF 415
Query: 436 FNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
NG DPW GGVL N+S +++A+ + GAHH+DLR + DP + R++E +IG+W
Sbjct: 416 SNGDLDPWRRGGVLTNLSSSLVAITIEGGAHHLDLRSTNPADPPSVTRARQQEEALIGQW 475
Query: 496 ISQ 498
IS+
Sbjct: 476 ISR 478
>gi|432876424|ref|XP_004073042.1| PREDICTED: dipeptidyl peptidase 2-like [Oryzias latipes]
Length = 480
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 183/471 (38%), Positives = 262/471 (55%), Gaps = 37/471 (7%)
Query: 40 SSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYT 99
S+ S D + + KY +Q++DHFN+N +TF QRYLI D W +++ P+F YT
Sbjct: 22 SAFFQSRTDGEPQFTEKYFSQVVDHFNFNSLGNRTFNQRYLITDRFW--RRSSGPVFFYT 79
Query: 100 GNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
GNEGDI FA N+GF+ ++A + +AL++F EHRYYG+S+P+G N + G L+
Sbjct: 80 GNEGDIWEFALNSGFIMELAAQQEALVIFAEHRYYGRSLPFGNNS----FSIPEVGLLTV 135
Query: 160 TQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN 219
QALADYA +I +LK L A SPV+ FGGSYGGML+ + RLKYP++ GALA+SAPIL+
Sbjct: 136 EQALADYALMITELKLQLGAAQSPVIAFGGSYGGMLSVYMRLKYPNIVAGALAASAPILS 195
Query: 220 FDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--- 276
+ P F +T DF VS C + +++Q+ E A++ +Q +C+
Sbjct: 196 TAGLGDPRQFFRDVTADFERVSPACRGAVTAAFQQLREAAER-RDYSHIQAELSLCQPPS 254
Query: 277 -SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY 335
++ + L AF AM DYP ++F+ LPA PV KTG+ + +
Sbjct: 255 SAQDVHQLYGLLRNAFTLMAMLDYPYSTHFMGSLPANPV----------KTGSGL-CVMS 303
Query: 336 GAASVYYNYSGTAKCFDLNG----DSDPHGLS------EWGWQACTEMIMLTGGDNKDSI 385
+ YN SG CFDL +DP G W +QACTE+ M +N +
Sbjct: 304 TKEWMVYNSSGLLPCFDLYSLYVECADPTGCGLGFNSLAWDYQACTEIEMCYESNNVTDM 363
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSG 445
F DAR YC + +GV PRP W+ +F G + ASNIIF NG DPW+
Sbjct: 364 FPAMTFTEDARQLYCSKRWGVQPRPGWLRLQFWGDDLSA-----ASNIIFSNGDLDPWAN 418
Query: 446 GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
GGV ++S +++A+ GAHH+DLR S DP+ + R+ E E+I W+
Sbjct: 419 GGVRTSLSPSLIAINISGGAHHLDLRGSNAADPESVIKARKMEAELIAAWV 469
>gi|402895977|ref|XP_003911085.1| PREDICTED: dipeptidyl peptidase 2 [Papio anubis]
Length = 492
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 178/458 (38%), Positives = 265/458 (57%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ QTF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNQTFPQRFLVSDRFW--IRGEGPIFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 89 GFIAELAAEQGALLVFAEHRYYGKSLPFGERS----TQRGHTELLTVEQALADFAELLRA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 145 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ ++ QI++ + G ++++ F C+ +K+L + +
Sbjct: 205 VTADFEGQSPKCTQGVREAFGQIKDLFLQ-GAYDRVRWEFGTCQPLSDKKDLTQLFMFAR 263
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 320
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ +DP G W +QACTE+ + +N +F + + R +
Sbjct: 321 HCYDIYRLYRSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDERRQQ 380
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + + ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 381 YCLDTWGVWPRPDWLLTSFWGGDL-----KAASNIIFSNGNLDPWAGGGIRRNLSASVIA 435
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + AHH+DLR S EDP + + R+ E +IG+W+
Sbjct: 436 ITIQGAAHHLDLRASHPEDPVSVVEARKLEATVIGEWV 473
>gi|195091774|ref|XP_001997565.1| GH13951 [Drosophila grimshawi]
gi|193906104|gb|EDW04971.1| GH13951 [Drosophila grimshawi]
Length = 472
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 187/468 (39%), Positives = 266/468 (56%), Gaps = 21/468 (4%)
Query: 45 SSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD 104
+S S+ Y+ K LDHF++ S +F RYL N++ + N IF YTGNEGD
Sbjct: 17 TSAQSKFEYEIKEFQVPLDHFSF--LSNASFSIRYLYNES-YADKSNPKSIFFYTGNEGD 73
Query: 105 IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
IEWFA+N+GF++++A K +A++VF EHRYYGKS+P+G + A K Y + Q L
Sbjct: 74 IEWFAKNSGFVWELAEKERAIVVFAEHRYYGKSLPFGSDTFNASK-PEHLAYFTVEQTLE 132
Query: 165 DYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
DYA LI L+ PVV FGGSYGGMLAAWFR+KYPH+ +GALA+SAPIL F+ +
Sbjct: 133 DYAMLITFLRN---GRQLPVVAFGGSYGGMLAAWFRIKYPHIVVGALAASAPILQFEGLT 189
Query: 225 SPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN--- 280
F+ I T F + + NC I SW E A G ++L F +C+ KN
Sbjct: 190 PCDIFNEITTSVFNTAYNANCSANIGKSWHAFETMAATDAGKKQLGDKFHLCEPIKNGDD 249
Query: 281 -LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAA 338
+L + AM++YP +NFL PLPA PV++ C + D D+ + + A
Sbjct: 250 LNQFMDYLEDVYANLAMSNYPYNTNFLAPLPANPVRQFCLYLKDLHADEADLVSSMASAL 309
Query: 339 SVYYNYSGTAKCFDL---NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
+VY NYSG+ C D N S S W Q+C +M+M ++ ++++ S D
Sbjct: 310 AVYTNYSGSDPCVDYKNSNASSATVDSSGWEIQSCNQMVMPFCANSSNTMYRTSTWDLKK 369
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
+ C + + + P+P I +GG + ASNIIF NGL DPWSGGGVL + +
Sbjct: 370 VSDKCYKKFQLTPKPFDIVLRYGGRDLVA-----ASNIIFSNGLLDPWSGGGVLMSPNDR 424
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+ ++ EGAHH+DLR S DP + D R++E IIG+WIS++++ L
Sbjct: 425 IHIIILPEGAHHLDLRQSNPMDPASVVDARQKEAAIIGQWISEFYRYL 472
>gi|195034070|ref|XP_001988820.1| GH11372 [Drosophila grimshawi]
gi|193904820|gb|EDW03687.1| GH11372 [Drosophila grimshawi]
Length = 472
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 187/468 (39%), Positives = 266/468 (56%), Gaps = 21/468 (4%)
Query: 45 SSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD 104
+S S+ Y+ K LDHF++ S +F RYL N++ + N IF YTGNEGD
Sbjct: 17 TSAQSKFEYEIKEFQVPLDHFSF--LSNASFSIRYLYNES-YADKSNPKSIFFYTGNEGD 73
Query: 105 IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
IEWFA+N+GF++++A K +A++VF EHRYYGKS+P+G + A K Y + Q L
Sbjct: 74 IEWFAKNSGFVWELAEKERAIVVFAEHRYYGKSLPFGSDTFNASK-PEHLAYFTVEQTLE 132
Query: 165 DYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
DYA LI L+ PVV FGGSYGGMLAAWFR+KYPH+ +GALA+SAPIL F+ +
Sbjct: 133 DYALLITFLRN---GRQLPVVAFGGSYGGMLAAWFRIKYPHIVVGALAASAPILQFEGLT 189
Query: 225 SPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN--- 280
F+ I T F + + NC I SW E A G ++L F +C+ KN
Sbjct: 190 PCDIFNEITTSVFNTAYNANCSANIGKSWHAFETMAATDAGKKQLGDKFHLCEPIKNGDD 249
Query: 281 -LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAA 338
+L + AM++YP +NFL PLPA PV++ C + D D+ + + A
Sbjct: 250 LNQFMDYLEDVYANLAMSNYPYNTNFLAPLPANPVRQFCLYLKDLHADEADLVSSMASAL 309
Query: 339 SVYYNYSGTAKCFDL---NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
+VY NYSG+ C D N S S W Q+C +M+M ++ ++++ S D
Sbjct: 310 AVYTNYSGSDPCVDYKNSNASSATVDSSGWEIQSCNQMVMPFCANSSNTMYRTSTWDLKK 369
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
+ C + + + P+P I +GG + ASNIIF NGL DPWSGGGVL + +
Sbjct: 370 VSDKCYKKFQLTPKPFDIVLRYGGRDLVA-----ASNIIFSNGLLDPWSGGGVLMSPNDR 424
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+ ++ EGAHH+DLR S DP + D R++E IIG+WIS++++ L
Sbjct: 425 IHIIILPEGAHHLDLRQSNPMDPASVVDARQKEAAIIGQWISEFYRYL 472
>gi|339245559|ref|XP_003378705.1| putative serine protease pcp-1 [Trichinella spiralis]
gi|316972372|gb|EFV56050.1| putative serine protease pcp-1 [Trichinella spiralis]
Length = 826
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 187/475 (39%), Positives = 270/475 (56%), Gaps = 56/475 (11%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF++ ++ TF+ RYLIN +W + + PIF Y GNEG +E FA+NTGFM++ A
Sbjct: 35 VDHFSF--ENSDTFRLRYLINTENW--NSDGGPIFFYCGNEGSVEGFAENTGFMWENAKD 90
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
F A++VF EHRYYGKS+P+G +++S G L+S QA+ADYA LI LK N+T A
Sbjct: 91 FGAMVVFAEHRYYGKSLPFGN------ESSSNLGKLNSEQAMADYAVLINWLKTNITGAK 144
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
S V+ FGGSYGGMLAAW R KYPH+ GA+A+SAP+ F + SFS+I T+ +R+
Sbjct: 145 SSAVIAFGGSYGGMLAAWMRTKYPHLVDGAIAASAPVAQFSGMTVCSSFSDITTEVYRNA 204
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-----SEKNLA-IESWLSTAFVYT 294
S +C IK SW I + K G L K FR+C S+KN+ + +WL+ +
Sbjct: 205 SPSCALSIKRSWPIIRKWGKTAEGRLDLAKMFRLCNETQFTSKKNVTQLVNWLTDIYGTL 264
Query: 295 AMTDYPTPSNFLNPLPAFPVKE-------------------------------MCKAIDD 323
AM +YP + FL P+PA+PVK C+ ++D
Sbjct: 265 AMVNYPYATEFLKPVPAWPVKASDEDSVCTVVVGLFFFLIIHPRACICKRICVACQFLND 324
Query: 324 PKTG-NDVFAKLYGAASVYYNYSGTAKCFDLNGD-SDPHGLSEWGWQACTEMIMLTGGDN 381
+ G ++ ++Y S+Y N++G C L D D W +QACTEM+M +
Sbjct: 325 TEVGETELLHRIYSTISIYTNFTGKKPCNLLENDYGDSVDGKLWDYQACTEMVMPM-CNT 383
Query: 382 KDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRD 441
KDS+FE+S+ + + C E + V PRP+W +GG K+ A+N++F NG D
Sbjct: 384 KDSMFEQSDWNLTEFSDECFEKFKVRPRPDWAIINYGGRKL-----ESATNVVFSNGWLD 438
Query: 442 PWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
PW GGG++ + + V AL+ ++GAHH DLR S D ++ VR E+ + KWI
Sbjct: 439 PWRGGGIVNSHFRGVAALIVEDGAHHYDLRGSNSADTASVQTVRLLELGFMRKWI 493
>gi|156403949|ref|XP_001640170.1| predicted protein [Nematostella vectensis]
gi|156227303|gb|EDO48107.1| predicted protein [Nematostella vectensis]
Length = 446
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 193/462 (41%), Positives = 267/462 (57%), Gaps = 35/462 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+KT Q +DHFN+ TF+QRYL + +W G PIF Y+GNEG I F +N+
Sbjct: 3 FKTGTFEQTVDHFNFIQSG--TFKQRYLYTEKYWDG---KGPIFFYSGNEGGITGFWENS 57
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+++ A F AL++F EHRYYG+S+P+G + ++K GYLS QALAD+A+LI
Sbjct: 58 GFVFEAAKNFSALVIFGEHRYYGESLPFGQD---SFK-IENIGYLSIEQALADFATLIPA 113
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LKK A + PVV FGGSYGGML+A+ R KYP+V ALA+SAPI ++ F
Sbjct: 114 LKKQFKAEEKPVVSFGGSYGGMLSAYLRFKYPNVIQAALAASAPIYFIADLSIRDFFFPA 173
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPG--GLEKLQKAFRICKSEKNL----AIESW 286
+T+DF++ C +++ + +++ KK G GL+ + KAF++CK K+ + W
Sbjct: 174 VTRDFKNADPKCPDLVRAGFIELDNL-KKEGLKGLDAISKAFKLCKPLKSADQINHLIGW 232
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYN-YS 345
+ AF AM DYP ++FL PLPA PV CK + T +D + L AA + YN S
Sbjct: 233 IRNAFTIIAMCDYPYATDFLAPLPANPVNYACKLL---ATASDRLSGLADAAGLAYNGTS 289
Query: 346 GTAKCFDLNGD----SDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
GT KCFD + +DP G W +QACTE+ M G +N +F D
Sbjct: 290 GTLKCFDPWTEFVECADPTGCGLGNANLAWDYQACTELPMPAGTNNVTDMFPVLPWTLDM 349
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
RA YC++ + V PR W G I ASNIIF NG DPW GGVLK++S +
Sbjct: 350 RADYCQKHWQVKPRLEWPGISLWGRDIST-----ASNIIFSNGNLDPWRPGGVLKSVSPS 404
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
++A++ + GAHH+DLR S EDP + R E+E+I KWIS
Sbjct: 405 LVAVLVEGGAHHLDLRSSNPEDPPSVVAAREMELELIRKWIS 446
>gi|405944913|pdb|4EBB|A Chain A, Structure Of Dpp2
gi|405944914|pdb|4EBB|B Chain B, Structure Of Dpp2
Length = 472
Score = 325 bits (832), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 179/458 (39%), Positives = 262/458 (57%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 5 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 62
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
F+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 63 AFVAELAAERGALLVFAEHRYYGKSLPFGAQS----TQRGHTELLTVEQALADFAELLRA 118
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGG L+A+ R KYPH+ GALA+SAP+L + F
Sbjct: 119 LRRDLGAQDAPAIAFGGSYGGXLSAYLRXKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 178
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 179 VTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFXFAR 237
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF A DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 238 NAFTVLAXXDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSE 294
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N F + + R R
Sbjct: 295 HCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDXFPDLPFTDELRQR 354
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 355 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 409
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 410 VTIQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 447
>gi|30688698|ref|NP_197677.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332005706|gb|AED93089.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 439
Score = 324 bits (830), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 174/408 (42%), Positives = 249/408 (61%), Gaps = 17/408 (4%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S+ K + K Y Q LDHF + P+SY TFQQRY I+ THWGG+K NAPI + G E
Sbjct: 45 STQKVDESNLKMYYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEES 104
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
++ GF+ D P+ ALLV+IEHRYYG+++P+G +E A KNAST GYL++ QAL
Sbjct: 105 SLDSDLAAIGFLRDNGPRLNALLVYIEHRYYGETMPFGSAEE-ALKNASTLGYLNAAQAL 163
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
ADYA++++ +K+ + SP++V GGSYGGMLAAWFRLKYPH+A+GALASSAP+L F++
Sbjct: 164 ADYAAILLHVKEKYSTNHSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDT 223
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLA 282
+ + I+T+ F+ SE CY I+ SW +I+ A KP GL L K F+ C +
Sbjct: 224 RPKFGYYYIVTKVFKEASERCYNTIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFD 283
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFAKLYGAASV 340
I+ +L T +Y Y N P F V ++C AI + P ++ +++
Sbjct: 284 IKDFLDT--IYAEAVQY-------NRGPNFWVAKVCNAINANPPNRRYNLLDRIFAGVVA 334
Query: 341 YYNYSGTAKCFDLNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
G C+D + P + W WQ+C+E++M G D +D++F + + +
Sbjct: 335 LV---GNRTCYDTKMFAQPTNNNIAWRWQSCSEIVMPVGYDKQDTMFPTAPFNMTSYIDG 391
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
CK +GV PRP+WITT FG ++ L+L++F SNIIF NGL DP+S GG
Sbjct: 392 CKSYHGVTPRPHWITTYFGIQEVKLILQKFGSNIIFSNGLSDPYSVGG 439
>gi|221044030|dbj|BAH13692.1| unnamed protein product [Homo sapiens]
Length = 391
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 165/392 (42%), Positives = 239/392 (60%), Gaps = 14/392 (3%)
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
M+DVA + KA+LVF EHRYYG+S+P+G N ++K++ +L+S QALAD+A LI LK
Sbjct: 1 MWDVAEELKAMLVFAEHRYYGESLPFGDN---SFKDSRHLNFLTSEQALADFAELIKHLK 57
Query: 175 KNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
+ + A + PV+ GGSYGGMLAAWFR+KYPH+ +GALA+SAPI F+++V F I+
Sbjct: 58 RTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIV 117
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTA 290
T DFR +C + I SW I + GL+ L A +C S+ ++ W+S
Sbjct: 118 TTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISET 177
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAK 349
+V AM DYP SNFL PLPA+P+K +C+ + +P + + + ++ A +VYYNYSG K
Sbjct: 178 WVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVK 237
Query: 350 CFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
C +++ + G W +QACTE++M + D +FE + + C + +GV P
Sbjct: 238 CLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRP 297
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV 468
RP+WITT +GG I +NI+F NG DPWSGGGV K+I+ T++A+ EGAHH+
Sbjct: 298 RPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKDITDTLVAVTISEGAHHL 352
Query: 469 DLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
DLR DP + R EV + WI ++
Sbjct: 353 DLRTKNALDPMSVLLARSLEVRHMKNWIRDFY 384
>gi|431899040|gb|ELK07410.1| Dipeptidyl-peptidase 2 [Pteropus alecto]
Length = 521
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 183/490 (37%), Positives = 276/490 (56%), Gaps = 43/490 (8%)
Query: 47 KDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE 106
+D ++ Y Q+LDHFN+ +TF QR+LI+D W + P+F YTGNEGD+
Sbjct: 30 RDPDPNFREDYFEQLLDHFNFERFGNKTFLQRFLISDKFW--KRGEGPLFFYTGNEGDVW 87
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
+FA N+ F+ ++A + +AL+VF EHRYYGKS+P+G T L+ QALAD+
Sbjct: 88 FFANNSRFILELAMQQEALVVFAEHRYYGKSLPFGEQS----TQRGHTELLTVEQALADF 143
Query: 167 ASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSP 226
A L+ L+++ A D P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+++ +
Sbjct: 144 ARLLRSLRQDFKARDVPAIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVSVAGLSDS 203
Query: 227 YSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA- 282
+ F ++ F + S C + ++ +++QI++ + G E+L + F C+ K+LA
Sbjct: 204 HQFFRDLSVIFENQSPECAQGVRDAFRQIKDLFLQ-GAYEELSREFGTCQLVTDWKSLAQ 262
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE----MCKAIDDPKTGNDVFAK----- 333
+ + AF+ AM +YP P++F LPA PV+ G V K
Sbjct: 263 LFGFARNAFIMLAMLNYPYPTDFTGHLPANPVQARPPLQGTGRGPAALGQSVGCKRLLSE 322
Query: 334 ------LYGAASVYYNYSGTAKCFDL------NGDSDPHGLS----EWGWQACTEMIMLT 377
L A + YN SGT C+++ D+ GL W +QACTEM +
Sbjct: 323 THRIRGLQALAGLLYNSSGTEPCYNIYQQYQACADATGCGLGPNAKAWDYQACTEMNLAF 382
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
+N+ IF + + D R +YC+E +GV PR +W+ T FGG + R ASNIIF N
Sbjct: 383 SSNNRTDIFPDLQFTEDLRQQYCQETWGVYPRRDWLLTNFGGADL-----RAASNIIFSN 437
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G DPW+GGG+ N+S +VLA+ GAHH+DLR S EDP +++ R+ E +I KW++
Sbjct: 438 GDLDPWAGGGIRSNLSASVLAITIHGGAHHLDLRASHPEDPMSVREARKLEATVIHKWVT 497
Query: 498 --QYFQDLAQ 505
++ Q L Q
Sbjct: 498 AARHKQQLQQ 507
>gi|307108156|gb|EFN56397.1| hypothetical protein CHLNCDRAFT_144962 [Chlorella variabilis]
Length = 496
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 187/476 (39%), Positives = 267/476 (56%), Gaps = 48/476 (10%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGDIEWFAQN 111
K ++ LDHF + +Y F+QR+ + D +W GGS +F+Y GNE D+ + N
Sbjct: 18 KERWRETRLDHFTWVNPTY--FKQRFFVCDEYWRPGGS-----VFLYIGNEADVTLYLNN 70
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
TG M+++APK+ A+LVF EHRYYG+S P+ + + K+ + +L+S QA+ DYA+L+
Sbjct: 71 TGLMWELAPKYDAMLVFAEHRYYGQSKPFPAS--VLRKHMA---WLTSEQAMGDYATLLW 125
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA-SSAPILNFDNIVSPY--- 227
+LK+ L D PV+ FGGSYGGML WFR+KYPH+ G +A S+API + PY
Sbjct: 126 ELKRELGDPDVPVIGFGGSYGGMLGTWFRMKYPHLVDGVIAGSAAPIWTYKGENPPYDPG 185
Query: 228 SFSNIITQDFR---SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC------KSE 278
SF+ I+TQD +E C ++ +WK ++ G + + A R+C E
Sbjct: 186 SFAKIVTQDASPEGGSAEACADNVRAAWKLLDRWGSSEEGRQSISAAMRLCPESAVESGE 245
Query: 279 KNLAIESWLSTAFVY-----TAMTDYPTPSNFL-----NPLPAFPVKEMCKAIDDPK-TG 327
A+ W S+A+ Y AM +YP PS ++ PLPAFPV+ C + +P G
Sbjct: 246 DATALCDWASSAWDYLASAARAMGNYPYPSVYIVNGAQPPLPAFPVRVACGHLAEPGLDG 305
Query: 328 NDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE---WGWQACTEMIMLTGGDNKDS 384
+ L AA V+YN++G CF +P + WG+Q CTE + D
Sbjct: 306 EALLEGLARAAGVFYNHTGDLPCFSFKQGPNPETDEDADFWGYQYCTEQFQVFSKDGVHD 365
Query: 385 IFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
+F E A + CK+ +GV+PRP W T E+GG ++G ASNI+F NGL DPWS
Sbjct: 366 MFWEEPFSTKAAIQDCKDGWGVEPRPLWATIEWGGKRLGA-----ASNIVFSNGLLDPWS 420
Query: 445 GGGVLKNISKT--VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
GGGVL NIS+ ++A+V EGAHH+DL FS DP + R E I KWI+Q
Sbjct: 421 GGGVLANISQANDLVAVVIPEGAHHLDLMFSHPLDPPSVTAARAFEEHYIAKWIAQ 476
>gi|348574464|ref|XP_003473010.1| PREDICTED: dipeptidyl peptidase 2-like [Cavia porcellus]
Length = 507
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 183/461 (39%), Positives = 267/461 (57%), Gaps = 35/461 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+ Y Q+LDHFN+ +TF QR+L++D W K PIF YTGNEGD+ FA N+
Sbjct: 42 YREYYFEQLLDHFNFESYGNKTFHQRFLMSDKFWKQPK--GPIFFYTGNEGDVWVFANNS 99
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + +ALL+F EHRYYGKS+P+G L+ QALAD+A L+
Sbjct: 100 GFLVELAQQQEALLIFAEHRYYGKSLPFGAQS----TQHGFMQLLTVEQALADFAVLLQV 155
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A DSP + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ +V Y F
Sbjct: 156 LRQDLCAQDSPTITFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVAGLVDSYQFFRD 215
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWLS 288
+T DF S S C + ++ ++++I + G E++ + F C+ SE + +
Sbjct: 216 VTADFYSQSPKCVQAVREAFQEIRNLYLQ-GAHERISREFGTCQLLSGSEDLTQLFMFAR 274
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---DDPKTGNDVFAKLYGAASVYYNYS 345
AF AM DYP ++FL PLPA PVK C + P TG L A + YN S
Sbjct: 275 NAFTVLAMMDYPYHTDFLVPLPANPVKVGCDILLNEAQPITG------LRMLAGMIYNTS 328
Query: 346 GTAKCFDL----NGDSDPHGLSE------WGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
G C+D+ + +DP G W +QACTE+ + +N +F +
Sbjct: 329 GMEHCYDIYQLYHSCADPTGCGSGSDAQAWDYQACTEINLTFSSNNVSDMFPTLLFTEEL 388
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
R +YC E +GV PRPNW+ T FGG + + A+ IIF NG DPW+GGG+ +N+S++
Sbjct: 389 REQYCLEKWGVWPRPNWLQTSFGGGDL-----KGATKIIFSNGDLDPWAGGGIHRNLSES 443
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
V+A++ + GAHH+DLR S EDP + + R+ E +I +W+
Sbjct: 444 VIAVMIQGGAHHLDLRASHPEDPASVVEARKLEAGLIWEWV 484
>gi|238006194|gb|ACR34132.1| unknown [Zea mays]
gi|413943526|gb|AFW76175.1| putative serine peptidase S28 family protein [Zea mays]
Length = 372
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 170/332 (51%), Positives = 217/332 (65%), Gaps = 23/332 (6%)
Query: 21 SNAKIFPTFPSSRITPEKLSSLISSSKD--------SQGLYKTKYHTQILDHFNYNPQSY 72
SN+ P F + +L SSS + S + Y Q+LDHF + P +
Sbjct: 44 SNSNPRPAFTPPLLKRHQLRRPSSSSAELVAGPANASTKPFTAHYFPQLLDHFAFTPNAS 103
Query: 73 QTFQQRYLINDTHW------GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
F+ +YL+NDT W G P+FVYTGNEGDIEWFA NTGFM+D+AP F ALL
Sbjct: 104 TVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNEGDIEWFATNTGFMFDIAPTFGALL 163
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVV 186
VFIEHR+YG+S P+G + +Y++A T GYL+STQALAD+A +I LK++L A +PVVV
Sbjct: 164 VFIEHRFYGESKPFGND---SYRSAETLGYLTSTQALADFAVVIRGLKRDLGAEAAPVVV 220
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYK 246
FGGSYGGMLA+WFRLKYPHVAIGALASSAPIL FD+I SF + ++QDF+S S NC+
Sbjct: 221 FGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHITPWSSFYDAVSQDFKSESSNCFG 280
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-NLAIESWLSTAFVYTAMTDYPTPSNF 305
VI+ +W ++E GL L K FR CK+ K +I +WL TAF YTAM DYPTP+NF
Sbjct: 281 VIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYSIRNWLWTAFTYTAMVDYPTPANF 340
Query: 306 LNPLPAFPVKEMCKA-----IDDPKTGNDVFA 332
L LPA+PVKE+ +D T ++ FA
Sbjct: 341 LENLPAYPVKEVVHLSAKVNLDSSSTDHEAFA 372
>gi|302854868|ref|XP_002958938.1| hypothetical protein VOLCADRAFT_70104 [Volvox carteri f.
nagariensis]
gi|300255730|gb|EFJ40018.1| hypothetical protein VOLCADRAFT_70104 [Volvox carteri f.
nagariensis]
Length = 557
Score = 318 bits (816), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 177/451 (39%), Positives = 249/451 (55%), Gaps = 34/451 (7%)
Query: 74 TFQQRYLINDTHW---GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIE 130
TF QR+ + HW G ++ PIF Y GNE D+ + NTG M++ AP F+A+LVF E
Sbjct: 1 TFLQRFFVCANHWRRRGPDGSSGPIFFYLGNEADVTLYLNNTGLMWEGAPDFEAMLVFAE 60
Query: 131 HRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGS 190
HRYYG+S+PYG N GYL + QA+ADYA LI+++K+ A S V+ FGGS
Sbjct: 61 HRYYGESVPYGKNVR------KHMGYLMAEQAMADYAELIMEIKEEYDAEGSAVIGFGGS 114
Query: 191 YGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY---SFSNIITQDFRSVSEN---C 244
YGGMLAAW RLKYPH GA+A+SAPI NF P+ SF+ +T D ++ + C
Sbjct: 115 YGGMLAAWMRLKYPHALDGAIAASAPIWNFLGETPPFDSGSFAKGVTYDASELAGSAPAC 174
Query: 245 YKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS------EKNLAIESWLSTAFVYTAMTD 298
++ +W + G L A ++C + + + WL+ A+ AM +
Sbjct: 175 IDNVRATWGLLRIYGGDEHGRAFLADALQLCSGAGPQNEDDVILLREWLAAAWDMMAMGN 234
Query: 299 YPTPSNFL----NPLPAFPVKEMCKAID-DPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
+P PS+++ LPAFPV+ C +D D D+ + A V+YN++G CFD
Sbjct: 235 FPYPSSYITNGHGRLPAFPVRVACSYLDSDSFNDEDLLEAMSLAVGVFYNHTGDVSCFDP 294
Query: 354 NGDSDP---HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+DP H W +Q C EM+M D +F D A C++ +G+ PRP
Sbjct: 295 LSGTDPDSDHDADFWDFQWCAEMLMPFSKDGASDMFWREPFDLAAATSACQQHWGISPRP 354
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
TTE+GG +I SNI+F NGL DPW GGGVL++IS ++ A++ EGAHH+DL
Sbjct: 355 LRATTEWGGRRISA-----GSNIVFSNGLLDPWHGGGVLEDISDSMPAVIIPEGAHHLDL 409
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
FS +DP + +VRR + E I WI Q
Sbjct: 410 MFSHPDDPLSVVEVRRFQREAIRDWIEMRLQ 440
>gi|66804725|ref|XP_636095.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|60464439|gb|EAL62586.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 513
Score = 316 bits (810), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 182/460 (39%), Positives = 258/460 (56%), Gaps = 30/460 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNN-----APIFVYTGNEGDIEW 107
Y+ + Q LDHFN+ QS F QRYL++D +W N PI YTGNEGDI
Sbjct: 62 YQELFFLQTLDHFNF--QSKGEFAQRYLVSDVYWKKPSPNDKVCQGPILFYTGNEGDITL 119
Query: 108 FAQNTGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
F N+ F+ +V A + ALL+F EHRYYG+S+P+G + + GYL+S QALADY
Sbjct: 120 FYDNSQFVTNVLAQEMNALLIFAEHRYYGESLPFGNDS----WTSDNIGYLTSEQALADY 175
Query: 167 ASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VS 225
A LI + + A PV+ GGSYGGML AWFR+KYP++ GALA+SAPIL+F N V+
Sbjct: 176 AQLIPAVLSEMGAEHCPVLSVGGSYGGMLTAWFRMKYPNIVDGALAASAPILSFLNTGVN 235
Query: 226 PYSFSNIITQDFRSVSE--NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKN 280
P +F+ I T DF+ S C I+ + I + + GL +L K F +C ++ N
Sbjct: 236 PETFNKIATDDFKDTSSEGTCASRIRSALNDIVTISTQSNGLAQLSKTFSVCGAPLTDVN 295
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV 340
I +W+ +A Y AM DYP P+NFL P+P +P+ C A+ + D L V
Sbjct: 296 DLI-NWIESALTYMAMADYPYPANFLEPMPGYPINVSCSALAQQE---DDIQGLLEVLHV 351
Query: 341 YYNYSGTA-KCFDLNG-DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
YYNY+G A C++++ + G + W +QACTEM+M D + F S +
Sbjct: 352 YYNYTGQAGTCYNMSVFTTGALGDASWNYQACTEMVMPVSSDGVNDFFPPSPFSLSDLTQ 411
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
C++ + P P WITT +GG A+NIIF NG+ D W GG+L+ S +++A
Sbjct: 412 QCQQQFQTTPDPYWITTYYGGSNFS------ATNIIFSNGVLDVWRSGGILETRSDSIVA 465
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
L + GAHH+DLR+ DP + R E +++ W S+
Sbjct: 466 LTIEGGAHHLDLRYPNPLDPPSVTQAREIESKLLQLWASE 505
>gi|417411236|gb|JAA52063.1| Putative dipeptidyl peptidase 2, partial [Desmodus rotundus]
Length = 502
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 172/463 (37%), Positives = 262/463 (56%), Gaps = 35/463 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q LDHFN+ QTF QR+L+++ W + P+F YTGNEGD+ FA N+
Sbjct: 28 FRESYFEQFLDHFNFERFGNQTFPQRFLVSEKFW--KRGEGPLFFYTGNEGDVWAFANNS 85
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY---LSSTQALADYASL 169
GF+ ++A + AL+VF EHRYYGKS+P+G + ++ G+ L+ QALAD+A L
Sbjct: 86 GFILELAAQQGALVVFAEHRYYGKSLPFG-------ERSTQRGHVELLTVEQALADFARL 138
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ L+++L A D P V FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 139 LQALRRDLGAQDVPAVAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVLVAGLGDAYQF 198
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS----EKNLAIES 285
++ DF C + ++ +++QI++ G + + +AF +C+ + + +
Sbjct: 199 FRDVSLDFEGQGPKCAQGVRDAFRQIKDLFLL-GAYDVVSQAFGLCRPLSGWKDLVQLFG 257
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS 345
+ AF AM DYP P++FL LPA PV+ C D +D L A + YN S
Sbjct: 258 FARNAFTVLAMMDYPYPTDFLGHLPANPVQVAC---DRLLNESDRIEGLRALAGLVYNSS 314
Query: 346 GTAKCFDL----NGDSDPHGLSE------WGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
G C+D+ +DP G W +QACTE+ + +N +F E +
Sbjct: 315 GEQSCYDVYLQYRACADPTGCGSGPDARAWDYQACTEINLAFSSNNLTDLFPELLFTEEL 374
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
R +YC + +GV PR +W+ T FGG + + ASNIIF NG DPW+ GG+ N+S +
Sbjct: 375 RQQYCLDTWGVWPRRDWLHTSFGGADL-----KAASNIIFSNGDLDPWARGGIQSNLSAS 429
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+LA+ GAHH+DLR S DP + + R+ E I +W+++
Sbjct: 430 ILAITIHGGAHHLDLRASHPADPMSVLEARQMEATFIRQWVAE 472
>gi|241753992|ref|XP_002401170.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508367|gb|EEC17821.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 493
Score = 315 bits (806), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 174/456 (38%), Positives = 259/456 (56%), Gaps = 19/456 (4%)
Query: 50 QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA 109
Q YK + +DHF ++ S TF RY + D +W + PIF YTGNE IE F
Sbjct: 25 QVTYKMRTFRTKIDHFTFH--SSDTFVMRYAVADQYW--DFDGGPIFFYTGNENAIENFI 80
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
+TG M++ AP+FKA+LVF EHR+YG+S+P+G + ++ GYLS+ Q LADYA L
Sbjct: 81 NHTGLMWEWAPEFKAMLVFAEHRFYGESMPFGNR---SLESPHHLGYLSTDQVLADYADL 137
Query: 170 IIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
II LK+++ A++SPV+ FGGSYGGML+AW R++YPH+ +LASSAP+ F +V S
Sbjct: 138 IIHLKESVRGASESPVISFGGSYGGMLSAWIRIRYPHLVSASLASSAPVHMFPGLVPCSS 197
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIES 285
+ ++T+ FR S C I+ SW +E G + +Q F +C+S E +
Sbjct: 198 LNRVLTETFRRESPVCSATIRKSWPILEAKFSTVEGRKSIQDKFHLCQSLQEEDYVTFRD 257
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND-VFAKLYGAASVYYNY 344
+L + A+ +Y PS FL PLP +PVKE CK + ++ + + ++++N
Sbjct: 258 FLHDVYSNMALVNYADPSVFLTPLPGYPVKESCKFLTRSFVNDEAIVDAVSQVVNIFFNT 317
Query: 345 SGTAKCFDLNGDSDPHG--LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
+GT +C D+N D L W +Q CTE++M T D +F + + C++
Sbjct: 318 TGTRQCNDINAFHDVLNSVLIAWDFQGCTELVMPTCSDGLTDMFYPLIWNVTETIQDCQQ 377
Query: 403 AYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
+ V P +GG + ASNIIF NG DPW+G G++++IS TV+A+V
Sbjct: 378 RFNVTPDLYKAVMTYGGRNM-----ESASNIIFSNGDADPWAGVGLMESISDTVVAIVIP 432
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
AHH DLRFS++E+P +K R E + I WI +
Sbjct: 433 GAAHHYDLRFSSREEPLAVKAARGLEKKYIRDWIRK 468
>gi|395844464|ref|XP_003794980.1| PREDICTED: dipeptidyl peptidase 2 [Otolemur garnettii]
Length = 558
Score = 314 bits (805), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 178/461 (38%), Positives = 265/461 (57%), Gaps = 35/461 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ +Y Q+LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA ++
Sbjct: 99 FGERYFEQLLDHFNFERFGNKTFPQRFLVSDKFW--DRAEGPIFFYTGNEGDVWSFANHS 156
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY---LSSTQALADYASL 169
GF+ ++A + ALLVF EHRYYGKS+P+G K ++ GY L+ QALAD+A L
Sbjct: 157 GFIVELAAQEAALLVFAEHRYYGKSLPFG-------KRSTQRGYMELLTVEQALADFAVL 209
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ L+++L A D+P + FGGSYGGML+A+ R+KYPH+ +GALA+SAP++ + P F
Sbjct: 210 LQALQRDLGAQDAPTIAFGGSYGGMLSAYLRMKYPHLVVGALAASAPVIAAAGLGDPDQF 269
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL-- 287
+T F S S C + ++ +++QI + + G + + + F C+ ++ L
Sbjct: 270 FRDVTAAFESQSPKCAQGVRDAFQQIRDLFLQ-GDYDTVSREFSTCQPLSGPKDQTQLFV 328
Query: 288 --STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS 345
AF AM +YP ++F+ LPA PVK C + + L A + YN S
Sbjct: 329 FTRNAFTVLAMMNYPYATDFMGHLPANPVKVGCNLL---LSEAQRITGLRALAGLVYNAS 385
Query: 346 GTAKCFDL----NGDSDPHGLSE------WGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
GT C+D+ + +DP G W +QACTE+ + +N +F
Sbjct: 386 GTEPCYDIYQLYHSCADPTGCGSGPDAKAWDYQACTEISLTFASNNVTDMFPVLPFTEAQ 445
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
R +YC+EA+GV PRP+W+ T F G + + SNIIF NG DPW+GGG+ +N+S +
Sbjct: 446 RQQYCQEAWGVWPRPDWLHTNFWGGDL-----KATSNIIFSNGDLDPWAGGGIQQNLSAS 500
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
V+A+ GAHH+DLR S EDP + + RR E +IGKW+
Sbjct: 501 VVAITIPGGAHHLDLRASHPEDPHSVVEARRLEAALIGKWV 541
>gi|334312261|ref|XP_001374504.2| PREDICTED: dipeptidyl peptidase 2-like [Monodelphis domestica]
Length = 513
Score = 314 bits (805), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 176/461 (38%), Positives = 258/461 (55%), Gaps = 37/461 (8%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ +Y QILDHFN+ TF QR+L+ + W K PIF YTGNE DI FA N+
Sbjct: 68 FQERYFEQILDHFNFESYGSSTFLQRFLVTEKFW--KKGTGPIFFYTGNEADIWAFANNS 125
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
F+ ++A +AL++F EHRYYGKS+P+G TG L+ QALAD+A LI
Sbjct: 126 NFILELAAVEEALVIFAEHRYYGKSLPFGDQS----TRKGNTGLLTVEQALADFAVLIQT 181
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LKK D PV+ FGGSYGGML+A+ R+KYP++ GALA+SAP+++ I + F
Sbjct: 182 LKKEYE--DVPVITFGGSYGGMLSAYMRMKYPNLVAGALAASAPVVSIAGIGNSSQFFRD 239
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL----S 288
+T DF + S C + ++ +++ I + + +++ + C N + + L
Sbjct: 240 VTTDFENHSPKCAQRVREAFRMIRDLYLEQA-FDRIHQDMGTCTQPSNDSAITQLFEFAR 298
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---DDPKTGNDVFAKLYGAASVYYNYS 345
AF +M DYP P++F+ PA PVK C + ++P G L A + YN S
Sbjct: 299 NAFTMISMMDYPYPTDFMGHFPANPVKVGCDRLLRAENPIQG------LRALAGLLYNAS 352
Query: 346 GTAKCFDL----NGDSDPHGL------SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
GT CFD+ +DP G W +QACTE+ + +N +F E D
Sbjct: 353 GTEPCFDIYQLYQKCADPTGCGTGPSAEAWDYQACTEINLTFDSNNITDMFPEIPFTSDL 412
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
R +YC +GV PR +W+ T FGG+ + + ASNIIF NG DPW+GGG+ N+S +
Sbjct: 413 REKYCFARWGVQPRKSWMLTNFGGNNL-----KAASNIIFSNGDLDPWAGGGIKTNLSSS 467
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+++L + GAHH+DLR S DP+ + +VR+ E E I +W+
Sbjct: 468 LISLTIRGGAHHLDLRGSNPADPESVIEVRKLEAEYIHEWV 508
>gi|9858825|gb|AAG01154.1|AF285235_1 quiescent cell proline dipeptidase precursor [Mus musculus]
Length = 506
Score = 314 bits (804), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 181/459 (39%), Positives = 267/459 (58%), Gaps = 29/459 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ Y Q +DHFN+ +TF QR+L++D W PIF YTGNEGDI FA N+
Sbjct: 41 FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFW--KMGEGPIFFYTGNEGDIWSFANNS 98
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM ++A + +ALLVF EHRYYGKS+P+G + T L+ QALAD+A L+
Sbjct: 99 GFMVELAAQQEALLVFAEHRYYGKSLPFG----VQSTQRGYTQLLTVEQALADFAVLLQA 154
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 155 LRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFRD 214
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + + + F C+ S K+L + +
Sbjct: 215 VTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQSLSSPKDLTQLFGFAR 273
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C+ + + G + L A + YN SGT
Sbjct: 274 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLN--EGQRIMG-LRALAGLVYNSSGTE 330
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ +DP G W +QACTE+ + +N +F E + R +
Sbjct: 331 PCYDIYRLYQSCADPTGCGTGSDARAWDYQACTEINLTFDSNNVTDMFPEIPFSEELRQQ 390
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PR +W+ T F G + + ASNIIF NG DPW+GGG+ N+S +V+A
Sbjct: 391 YCLDTWGVWPRQDWLQTSFWGGDL-----KAASNIIFSNGDLDPWAGGGIQSNLSTSVIA 445
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ + GAHH+DLR S EDP + +VR+ E +I +W++
Sbjct: 446 VTIQGGAHHLDLRASNSEDPPSVVEVRKLESTLIREWVA 484
>gi|31981425|ref|NP_114031.2| dipeptidyl peptidase 2 precursor [Mus musculus]
gi|341940461|sp|Q9ET22.2|DPP2_MOUSE RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
gi|20072621|gb|AAH27205.1| Dipeptidylpeptidase 7 [Mus musculus]
gi|148676290|gb|EDL08237.1| dipeptidylpeptidase 7, isoform CRA_d [Mus musculus]
Length = 506
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 181/459 (39%), Positives = 267/459 (58%), Gaps = 29/459 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ Y Q +DHFN+ +TF QR+L++D W PIF YTGNEGDI FA N+
Sbjct: 41 FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFW--KMGEGPIFFYTGNEGDIWSFANNS 98
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM ++A + +ALLVF EHRYYGKS+P+G + T L+ QALAD+A L+
Sbjct: 99 GFMVELAAQQEALLVFAEHRYYGKSLPFG----VQSTQRGYTQLLTVEQALADFAVLLQA 154
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 155 LRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFRD 214
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + + + F C+ S K+L + +
Sbjct: 215 VTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQSLSSPKDLTQLFGFAR 273
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C+ + + G + L A + YN SGT
Sbjct: 274 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLN--EGQRIMG-LRALAGLVYNSSGTE 330
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ +DP G W +QACTE+ + +N +F E + R +
Sbjct: 331 PCYDIYRLYQSCADPTGCGTGSDARAWDYQACTEINLTFDSNNVTDMFPEIPFSEELRQQ 390
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PR +W+ T F G + + ASNIIF NG DPW+GGG+ N+S +V+A
Sbjct: 391 YCLDTWGVWPRQDWLQTSFWGGDL-----KAASNIIFSNGDLDPWAGGGIQSNLSTSVIA 445
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ + GAHH+DLR S EDP + +VR+ E +I +W++
Sbjct: 446 VTIQGGAHHLDLRASNSEDPPSVVEVRKLESTLIREWVA 484
>gi|157279995|ref|NP_001098513.1| dipeptidyl peptidase 2 precursor [Bos taurus]
gi|151556163|gb|AAI49046.1| DPP7 protein [Bos taurus]
Length = 488
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 180/478 (37%), Positives = 271/478 (56%), Gaps = 37/478 (7%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P L + KD + ++ Y Q+LDHFN+ +TF QR+L+ + W ++ PI
Sbjct: 21 PRGLEARAHRPKDPE--FQEAYFEQLLDHFNFERFGNKTFLQRFLMTEKFW--NRGEGPI 76
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
F YTGNEGD+ FA N+GF+ ++A + AL+VF EHRYYGKS+P+G + ++ G
Sbjct: 77 FFYTGNEGDVWSFANNSGFILELAEQQGALVVFAEHRYYGKSLPFG-------ERSTWRG 129
Query: 156 Y---LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
Y L+ QALAD+A L+ L++ L A D+P + FGGSYGGML+A+ R+KYPH+ GALA
Sbjct: 130 YTELLTVEQALADFAGLLRALRQELEAPDAPAIAFGGSYGGMLSAYLRIKYPHLVAGALA 189
Query: 213 SSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
+SAP+++ + PY F ++ DF+ S C + ++ +++QI + ++ G + + F
Sbjct: 190 ASAPVVSAAGLGDPYQFFQDVSADFQGQSPECARAVQDAFRQIRDLFQQ-GAPHVVSQEF 248
Query: 273 RICK---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN 328
C+ K+L + + AF AM DYP ++F+ LPA PV+ C + + +
Sbjct: 249 GTCQPLSGPKDLTQLFGFARNAFTVLAMMDYPYATDFVGHLPAHPVQVGCSRL---LSES 305
Query: 329 DVFAKLYGAASVYYNYSGTAKCFDL----NGDSDPHGLS------EWGWQACTEMIMLTG 378
A L A + YN SG C+D+ +DP G W +QACTE+ +
Sbjct: 306 SRIAGLRALAGLVYNSSGIEPCYDIYLQYQACADPTGCGLGSDAKAWDYQACTEISLTFS 365
Query: 379 GDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
+N +F E R +YC + +GV PR +W+ T FGG + ASNIIF NG
Sbjct: 366 SNNVSDLFPELPFTEAQRQQYCLDTWGVWPRQDWLQTSFGGGDLTA-----ASNIIFSNG 420
Query: 439 LRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DPW+ GG+ N+S +VLA+ GAHH+DLR S +DP + + RR E +IGKW+
Sbjct: 421 DLDPWARGGIQSNLSASVLAITIHGGAHHLDLRGSHPDDPASVVEARRLEAALIGKWV 478
>gi|241161688|ref|XP_002408973.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494442|gb|EEC04083.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 409
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 177/420 (42%), Positives = 241/420 (57%), Gaps = 19/420 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y T Y T +DHF Y + TF+ RYL+ D +W + PIF YTGNEGDIE FA NT
Sbjct: 4 YDTYYFTTKVDHFGY--ANNDTFKMRYLVADQYW--DHDGGPIFFYTGNEGDIEVFANNT 59
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G M+D AP+FKALL+F EHRYYGKS+PYG KE +++ S GYL+ QALADYA L+
Sbjct: 60 GLMWDWAPEFKALLIFAEHRYYGKSMPYG--KE-SFEGPSRHGYLTVEQALADYADLLTH 116
Query: 173 LKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
K ++ A DS VV FGGSYGGMLAAWFRLKYP G +SAPIL F I + +
Sbjct: 117 FKAHVPAAGDSKVVSFGGSYGGMLAAWFRLKYPPRDHG--VTSAPILQFTGITPCNALNE 174
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWLS 288
++T+ F S+ C I+ S++ + A G + L++ FR+C++ + W
Sbjct: 175 VVTKAFAKESDQCTSAIRTSFEVMRNQAATEEGAKALEEKFRLCQALAPSNYTVLRDWFV 234
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND-VFAKLYGAASVYYNYSGT 347
+ AM +YP + FL P+P PVKE CK ++ T + + +Y A SV+ NY+G
Sbjct: 235 DVYTNLAMVNYPYANEFLGPVPGHPVKEACKFLEKNFTDDQSLLDGIYQAISVFQNYTGQ 294
Query: 348 AKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
C DL + W Q+C EM+M + K +F ++ + C++ +G+
Sbjct: 295 THCNDLTNSAGTLDAGGWDIQSCNEMVMPMCSNGKTDMFFDNPWIIENVTAKCEQKFGLT 354
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
P + FGG I ASNIIF NG DPWSGGGVLK++S +++AL E AHH
Sbjct: 355 PDVDKAALIFGGKNISA-----ASNIIFSNGDIDPWSGGGVLKSLSDSLIALYMTEAAHH 409
>gi|47206852|emb|CAF90612.1| unnamed protein product [Tetraodon nigroviridis]
Length = 463
Score = 311 bits (797), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 176/448 (39%), Positives = 252/448 (56%), Gaps = 29/448 (6%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P L + L+ Y TQ LDHFN+N TF+QRYL+ D +W + + P+
Sbjct: 24 PRLLQAARPPGLQGAALFTESYFTQTLDHFNFNSYGNGTFRQRYLVADRYW--RRGHGPL 81
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
F YTGNEGDI FA N+GF+ ++A + AL+VF EHRYYGKS+P+G A G
Sbjct: 82 FFYTGNEGDIWDFALNSGFITELAAQQGALVVFAEHRYYGKSLPFGD----ASFQVPEVG 137
Query: 156 YLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
L+ QALADYA LI L++ L AT PV+VFGGSYGGML+ + RL+YP++ GALA+SA
Sbjct: 138 LLTVEQALADYALLISQLREQLAATRCPVIVFGGSYGGMLSVYMRLRYPNLVAGALAASA 197
Query: 216 PILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
P+L+ + P F +T DF+SV C ++G+++Q+ E A+ +QK +C
Sbjct: 198 PVLSTAGLGEPTQFFRDVTADFQSVEPQCTGAVRGAFQQLRELAEDQ-DYGAIQKKLSLC 256
Query: 276 K---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
+ S ++++ + L AF AM DYP ++F+ LPA PVK C+ + +++
Sbjct: 257 QRPSSPQDVSQLYGLLRNAFTLMAMLDYPYSTHFMGSLPANPVKVACQTM---LRASELL 313
Query: 332 AKLYGAASVYYNYSGTAKCFDLN----GDSDPHGL----SEWGW--QACTEMIMLTGGDN 381
L AA + YN SG C DL +DP G + W W QACTE+ + +N
Sbjct: 314 TNLRDAAGLVYNASGQLGCLDLYRLYVQCADPTGCGLGPNSWAWDYQACTEVDLCFHSNN 373
Query: 382 KDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRD 441
+F + R YC + + V PRP W+ T+F G + ASNIIF NG D
Sbjct: 374 VTDMFPPMSFGEEQRRAYCSQRWSVLPRPRWLRTQFWGDALST-----ASNIIFSNGDLD 428
Query: 442 PWSGGGVLKNISKTVLALVEKEGAHHVD 469
PW+ GGV K++S +++A+ AHH+D
Sbjct: 429 PWANGGVRKSLSPSLIAINIPGAAHHLD 456
>gi|354507428|ref|XP_003515758.1| PREDICTED: dipeptidyl peptidase 2-like [Cricetulus griseus]
gi|344258871|gb|EGW14975.1| Dipeptidyl-peptidase 2 [Cricetulus griseus]
Length = 506
Score = 311 bits (796), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 181/458 (39%), Positives = 261/458 (56%), Gaps = 29/458 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q +DHFN+ +TF QR+L++D W K PIF YTGNEGDI FA N+
Sbjct: 41 FRENYFEQYMDHFNFESFGNKTFAQRFLVSDKFWKMGK--GPIFFYTGNEGDIWTFANNS 98
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM ++A + +ALLVF EHRYYGKS+P+G + T L+ QALAD+A L+
Sbjct: 99 GFMVELAAQQEALLVFAEHRYYGKSLPFG----LQSTQRGYTQLLTVEQALADFAVLLQA 154
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L D P + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 155 LRQDLKVQDIPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGESYQFFRD 214
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI + + G + ++K F C+ S K+L + +
Sbjct: 215 VTADFYGQSPKCAQAVRDAFQQIRDLFLQ-GAHDTIRKNFGTCQSLSSSKDLTQLFVFAR 273
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AMTDYP P+ FL LPA PVK C+ + G + L + YN SG
Sbjct: 274 NAFTVLAMTDYPYPTEFLGNLPANPVKVACERM--LSKGQRIMG-LRALVGLVYNSSGME 330
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
CFD+ +DP G W +QACTE+ + +N +F + R
Sbjct: 331 PCFDIYRLYQSCADPTGCGTGSNAKAWDYQACTEINLTFDSNNVTDMFPVIPFSDELRQE 390
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC +GV PRP+W+ T F G + + ASNIIF NG DPW+GGG+ +N+S +++A
Sbjct: 391 YCLHTWGVWPRPDWLRTSFWGGDL-----KAASNIIFSNGDLDPWAGGGIRRNLSTSIIA 445
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ GAHH+DLR S DPQ + +VR+ E +I +W+
Sbjct: 446 VTIHGGAHHLDLRASNSADPQSVVEVRKLEAALIREWV 483
>gi|410990145|ref|XP_004001310.1| PREDICTED: dipeptidyl peptidase 2 [Felis catus]
Length = 549
Score = 308 bits (789), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 174/459 (37%), Positives = 268/459 (58%), Gaps = 29/459 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q+LDHFN+ +TF QR+L+++ W + PIF YTGNEG++ FA N+
Sbjct: 36 FQEGYFEQLLDHFNFERFGNKTFLQRFLVSEKFW--KRGEGPIFFYTGNEGNVWSFANNS 93
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + AL++F EHRYYGKS+P+G E + + T L+ QALAD+A L+
Sbjct: 94 GFILELAAQQGALVIFAEHRYYGKSLPFG---ERSTQRGHTE-LLTVEQALADFARLLNA 149
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P +VFGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 150 LRRDLGAQDTPAIVFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVAGLGDSYQFFRD 209
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
++ DF S C + ++ +++QI++ + G + + + F C+ +K+L + +
Sbjct: 210 VSADFEGQSPKCAQGVRDAFRQIKDLFIQ-GAYDTVSQEFGTCQPLSGQKDLTQLFGFAR 268
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++F+ LPA PVK C D + L A + YN SGT
Sbjct: 269 NAFTVLAMMDYPYPTDFIGHLPANPVKVGC---DRLLSETQRIKGLRALAGLVYNSSGTE 325
Query: 349 KCFDL----NGDSDPHGLSE------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ +DP G W +QACTE+ + +N +F + R +
Sbjct: 326 PCYDIYRQYQACADPTGCGSGPDAKAWDYQACTEINLTFSSNNVTDLFPDLPFTEGLRQQ 385
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PR +W+ FG + + ASNIIF NG DPW+GGG+ +N+S +VLA
Sbjct: 386 YCLDTWGVWPRRDWLRISFGAGDL-----KAASNIIFSNGDLDPWAGGGIQRNLSTSVLA 440
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ + GAHH+DLR S DP +++ RR E +IG+W++
Sbjct: 441 VTIRGGAHHLDLRASHPRDPTSVREARRLEARLIGEWVA 479
>gi|395506514|ref|XP_003757577.1| PREDICTED: dipeptidyl peptidase 2 [Sarcophilus harrisii]
Length = 465
Score = 307 bits (787), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 173/462 (37%), Positives = 255/462 (55%), Gaps = 29/462 (6%)
Query: 49 SQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWF 108
SQ ++ +Y QILDHFN+ TF QR+L+ + W K PIF YTGNE D+ F
Sbjct: 14 SQVDFQERYFEQILDHFNFESYGNNTFLQRFLVTEKFW--KKGTGPIFFYTGNEADVWAF 71
Query: 109 AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS 168
A N F+ ++A +AL++F EHRYYGKS+P+G + T L+ QALAD+A
Sbjct: 72 ASNCDFILELASAEEALVIFAEHRYYGKSLPFG----VQSTRKGNTSLLTVEQALADFAV 127
Query: 169 LIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
LI L+K A + PV+ FGGSYGGML+A+ R+KYP++ GALA+SAP+L+ I
Sbjct: 128 LIQALQKEYKAENVPVITFGGSYGGMLSAYMRMKYPNLVAGALAASAPVLSIAGIGDSSQ 187
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL- 287
F +T DF + S C + ++ +++ I++ + +K+ + C + + + L
Sbjct: 188 FFRDVTADFENYSPKCVQGVREAFRLIKDLYLQR-AFDKISQEMGTCTQPSSDSAITQLF 246
Query: 288 ---STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
AF M DYP P++F+ PA PVK C + K L G ++YN
Sbjct: 247 EFARNAFTMITMMDYPYPTDFMGHFPANPVKVGCDRLLSAKNQIQGLRDLVG---LFYNT 303
Query: 345 SGTAKCFDL----NGDSDPHGL------SEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
SGT CFD+ + +DP G W +QACTE+ + +N +F + +
Sbjct: 304 SGTEPCFDIYKLYHKCADPTGCGTGPSAEAWDYQACTEINLTFDSNNVTDMFPKIPFTDE 363
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
R +YC +GV PR +W+ T F G + + ASNIIF NG DPW+GGG+ N+S
Sbjct: 364 LREKYCFTRWGVRPRKSWMQTNFWGKNL-----KAASNIIFSNGDLDPWAGGGIRSNLSS 418
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+++AL + GAHH+DLR S EDP + +VR+ E I +W+
Sbjct: 419 SLIALTIQGGAHHLDLRGSNPEDPVSVIEVRKLEAAYIHEWV 460
>gi|449666912|ref|XP_004206448.1| PREDICTED: dipeptidyl peptidase 2-like [Hydra magnipapillata]
Length = 478
Score = 307 bits (786), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 175/459 (38%), Positives = 256/459 (55%), Gaps = 22/459 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQT--FQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQ 110
Y KY Q +DHFN+ Q+ F+QRYLI+D +W SK P+ YTGNEG IE F +
Sbjct: 22 YVEKYFVQFIDHFNFLGQAGANGQFKQRYLISDKYW--SKGKGPVLFYTGNEGSIENFWE 79
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
NTGF++++A K K L++F EHRYYGKS+P+G + + G+L+ QALAD+A+LI
Sbjct: 80 NTGFVFELAQKLKGLVIFGEHRYYGKSLPFGNDS----FTPANIGFLTIDQALADFAALI 135
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
LKK++ A + V FGGSYGGML A+ R KYPH+ G +ASSAP L F
Sbjct: 136 QHLKKSMGADNCSVFAFGGSYGGMLTAYMRYKYPHIVDGGVASSAPFLTIAGKRPRSEFF 195
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRICKSEKNL-----AIE 284
+T+ FR NC ++ ++ Q+ + GL++L+K F +C+ + +
Sbjct: 196 QTVTETFRKADSNCPSSVQIAFTQLMDLFNSGKEGLQQLEKVFSLCEGQMTRPFLEKQMI 255
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK--AIDDPKTG-NDVFAKLYGAASVY 341
+W AF +M DYP P+ F+ LP PV+ C ++D G + LYG +
Sbjct: 256 AWARNAFTLLSMVDYPYPAKFMADLPGHPVELACSYMQVEDKLAGLAKITDLLYGKPANC 315
Query: 342 YN-YSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+N Y C D G W +QACTEMI+ G +N +F + + R YC
Sbjct: 316 HNLYEEYVSCSDPTGCGTGPDNPPWDYQACTEMILPGGSNNITDMFPHLDFTLEMRQHYC 375
Query: 401 KEAYGVD-PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
+ +G+ R NW+ T++ G L + AS IIF NG DPW GGVL+++S +++A+
Sbjct: 376 SKRWGLGYSRLNWLATQYWG---SLNDIKKASRIIFPNGDLDPWHTGGVLEDLSDSLIAI 432
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ + GAHH+DLR S DP+ + D R + EII W+ +
Sbjct: 433 MVEGGAHHLDLRGSNPADPKSVIDARNKITEIITGWMHE 471
>gi|73967473|ref|XP_848703.1| PREDICTED: dipeptidyl peptidase 2 isoform 1 [Canis lupus
familiaris]
Length = 497
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 181/457 (39%), Positives = 271/457 (59%), Gaps = 35/457 (7%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y Q+LDHFN+ +TFQQR+L+++ W + PIF YTGNEG++ FA N+GF+
Sbjct: 40 YFEQLLDHFNFERFGNKTFQQRFLVSEKFW--KRGKGPIFFYTGNEGNVWSFANNSGFIL 97
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY---LSSTQALADYASLIIDL 173
++A + +AL++F EHRYYGKS+P+G + ++ GY L+ QALAD+A L++ L
Sbjct: 98 ELAAQQEALVIFAEHRYYGKSLPFG-------EQSTRRGYTELLTVEQALADFARLLLAL 150
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
+++L A DSP + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F +
Sbjct: 151 RRDLGAQDSPAIAFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVAGLGDSYQFFRDV 210
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLST 289
+ DF S C + ++ +++QI++ + G + + + F C+ S K+L + +
Sbjct: 211 SADFEGQSPKCAQGVRDAFQQIQDLCFQ-GACDVVSREFGTCQPLSSRKDLTQLFGFARN 269
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
AF AM DYP P++F+ LPA PVK C D + + L A + YN SGT
Sbjct: 270 AFTVLAMMDYPYPTHFIAHLPANPVKVGC---DRLLSESQSIKGLRALAGLVYNSSGTVP 326
Query: 350 CFDL----NGDSDPHGLSE------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
C+D+ +DP G W +QACTE+ + +N +F E R +Y
Sbjct: 327 CYDIYLQYQACADPTGCGSGPNAKAWDYQACTEINLTFSSNNVTDLFPELPFTDALRQQY 386
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
C + +GV PR +W+ T FGG + R ASNI+F NG DPW+GGG+ N+S TVLA+
Sbjct: 387 CLDTWGVWPRRDWLQTSFGGDDL-----RGASNILFSNGDLDPWAGGGIRSNLSATVLAI 441
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ GAHH+DLR S EDP +++ RR E +IG+W+
Sbjct: 442 TIQGGAHHLDLRASHPEDPASVREARRFEARLIGEWV 478
>gi|326438061|gb|EGD83631.1| hypothetical protein PTSG_04239 [Salpingoeca sp. ATCC 50818]
Length = 528
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 186/489 (38%), Positives = 257/489 (52%), Gaps = 54/489 (11%)
Query: 56 KYHTQILDHFNY-NPQSYQ-TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ TQ +DHFN+ P + + T++QRY I D + S PIF Y GNE D+ + NTG
Sbjct: 50 NFFTQNIDHFNWAKPLNDKFTYRQRYFICDQYADLSNPKTPIFFYFGNEDDVTLYVNNTG 109
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
M++ A +KALLVF EHRYYGKS P+ +L++ QA+ADYA+LI DL
Sbjct: 110 LMWENAASYKALLVFAEHRYYGKSKPFPAGTP------GCMNWLTTEQAMADYATLIRDL 163
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS---FS 230
K++L T +PV+ FGGSYGGMLAA+FR KYP + G +A SAPI F + Y F+
Sbjct: 164 KQDLNLTPAPVIGFGGSYGGMLAAYFRRKYPDIVDGVIAGSAPIWAFSGLTPAYDYYGFN 223
Query: 231 NIITQDFRS---VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAI 283
NII D S S++C K +I A G L + R+CK + I
Sbjct: 224 NIIADDASSKGGASDHCRNNFKAIQPRIMAIASTQHGRHMLSQQLRLCKPLASDQDAYNI 283
Query: 284 ESWLSTAFVYTAMTDYPTPSNFL----NPLPAFPVKEMCKAIDDPK-TGNDV--FAKLYG 336
W A+ Y AM D+P S ++ LP +PV+E CK + DP+ ND + L
Sbjct: 284 LLWAQNAWAYMAMGDFPYASGYIVHGRGKLPPYPVREACKPLSDPQLPANDTKFISALRD 343
Query: 337 AASVYYNYSGTAKCFDLNGDSD-------PHGL----------------SEWGWQACTEM 373
A VYYNY+ T CFDL + PH L +WG+Q CTEM
Sbjct: 344 AMDVYYNYTHTEPCFDLFPATSIPRLGHHPHHLLSRPRPAAAVAAAQCTGDWGYQFCTEM 403
Query: 374 IMLTG-GDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASN 432
+M + G KD + D + + C++ +GV PRP W + V SN
Sbjct: 404 VMPSSQGGPKDMFWPALPFDLNETIKQCQQQWGVTPRPLWAPLNLASKDLTDV-----SN 458
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
++ NG DPW GGV+ N+S +V+A+V + GAHH+DL FS DP + RR E++ I
Sbjct: 459 MVLSNGGLDPWRAGGVVTNVSDSVVAVVIESGAHHIDLMFSDPADPPDVIAARRLELQHI 518
Query: 493 GKWISQYFQ 501
+WI+Q+ Q
Sbjct: 519 SRWINQHNQ 527
>gi|391328440|ref|XP_003738697.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 459
Score = 305 bits (780), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 177/451 (39%), Positives = 255/451 (56%), Gaps = 21/451 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+T + +DHF++ +F+ R L +D ++ S+ P+F YTGNEGDIE F NT
Sbjct: 20 YQTFWFETKIDHFSFARN--DSFKMRVLYSDKYFDSSEP-GPVFFYTGNEGDIETFTNNT 76
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G M+D A FKALL+F EHR+YGKS+P+G YK GYL++ QALAD+A LI
Sbjct: 77 GLMWDWAADFKALLIFAEHRFYGKSMPFGDKSYDTYKQ---YGYLTAEQALADFADLIQH 133
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+K N VV FGGSYGGML+AW R+KYP + A+A+SAPIL F ++ + F I
Sbjct: 134 VKNNWPV--KKVVAFGGSYGGMLSAWMRIKYPWLIDAAIAASAPILQFQDVTACGVFDKI 191
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRICKS---EKNLAIESWLS 288
+T+ F SE C I+ SW +E+ K G +++ FRIC++ A+ WL
Sbjct: 192 VTKAFAKASERCADNIRRSWIALEKLGKDGENGSALIRENFRICQNVLPSNYTAVRDWLH 251
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGNDVFAKLYGAASVYYNYSGT 347
+ AM +YP +NFL +P PV+ C +D D ++ +Y A +V++N+SG
Sbjct: 252 DTYGNLAMINYPYGTNFLKKVPGHPVQVSCSFLDKDFHCDAELLKGVYQAINVFHNFSGD 311
Query: 348 AKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
+C D+ G+S +S+ GW Q C EM+M GD ++ +F ++ + C++ YG
Sbjct: 312 TQCNDV-GNSGGDNISDAGWNIQTCNEMVMPFCGDGQEDMFYPYSWNFTQFRKDCEKKYG 370
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
+ P N FGG I ASNI+F NG DPW GGGVLK ++ T+ ++ + GA
Sbjct: 371 MTPDLNIARRMFGGRDISA-----ASNIVFSNGDLDPWCGGGVLKQLNPTLPVVIIEGGA 425
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
HH DLR ++ D + R E E I WI
Sbjct: 426 HHYDLRSASPLDTPAVISARNVEKEYIKLWI 456
>gi|255579501|ref|XP_002530593.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223529841|gb|EEF31773.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 327
Score = 304 bits (779), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 158/346 (45%), Positives = 205/346 (59%), Gaps = 28/346 (8%)
Query: 157 LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
+ QALAD+A LI DLK+NLTA D PVV+FGGSYGGMLAAW RLKYPH+AIGALA+SAP
Sbjct: 1 MKPEQALADFAVLITDLKRNLTAEDCPVVLFGGSYGGMLAAWMRLKYPHIAIGALAASAP 60
Query: 217 ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
IL F+N+V F +I++ DF+ S C+ IK SW I K GL KL + F +C
Sbjct: 61 ILQFENVVPHEIFYDIVSNDFKRESSRCFNTIKESWNAIASEGLKENGLVKLSRTFHMCS 120
Query: 277 SEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY 335
+ + WL +A+ Y AM DYP P+ F+ PLP P++E+ K + V+ L
Sbjct: 121 DLNSTDELADWLESAYSYLAMVDYPYPAEFMMPLPGHPIREVTFYFSTSKLSHIVYPCL- 179
Query: 336 GAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
H L W C+EM+M +S+F + +Y +
Sbjct: 180 ------------------------HILDFRNW--CSEMVMPMASSKYESMFPTYDFNYTS 213
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
+ C + + V RP WI TEFGG I L++F SNIIF NGL DPWSGG VL+NIS T
Sbjct: 214 FEKQCWDDFRVVSRPRWIMTEFGGQDIKTSLEKFGSNIIFSNGLLDPWSGGSVLQNISDT 273
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
V+ALV +EGAHH+DLR ST EDP WL + R EV++I WI+ + Q
Sbjct: 274 VVALVTEEGAHHIDLRPSTPEDPDWLVEQRATEVKLIEGWINDHNQ 319
>gi|440795690|gb|ELR16807.1| lysosomal ProX carboxypeptidase [Acanthamoeba castellanii str.
Neff]
Length = 489
Score = 304 bits (778), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 170/465 (36%), Positives = 258/465 (55%), Gaps = 26/465 (5%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNA---PIFVYTGNEGDIEWF 108
+Y+T Y Q LDHFN+ + T++QR+L+ D +W GS PIF YTGNE + +
Sbjct: 38 VYRTLYFDQTLDHFNFATKP-ATYKQRFLMADDYWRGSYPGGCPGPIFFYTGNESPVTDY 96
Query: 109 AQNTGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
GF V APK ALLVF EHRY+G+S+P+G + YLS QALADYA
Sbjct: 97 YAGAGFFTQVLAPKHNALLVFAEHRYFGESMPFGSKS----FDPEKISYLSPEQALADYA 152
Query: 168 SLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSP 226
LI LK+ L A + PV FGGSYGG+L AWFR KYP + +G L++SAP+ + +SP
Sbjct: 153 VLITHLKETLPHAKNCPVFAFGGSYGGILTAWFRSKYPDIVMGGLSASAPLAFYGTGISP 212
Query: 227 YSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL----A 282
Y+F++ + F C ++ ++ +++ + G + AF++C + A
Sbjct: 213 YAFTDSASDTFAQARPGCAPLVSRAFDVLQKLSATSDGRARFSAAFKLCSPLNSQADAEA 272
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN--DVFAKLYG-AAS 339
+ +W+ + + AM DYP +N+ LP +PV C + + T N DV A+ + A
Sbjct: 273 VINWVDSGLIGMAMLDYPFATNYGISLPGWPVNRTCDRLLEKATSNDDDVLAEAFAYAIG 332
Query: 340 VYYNYSGTAKCFDLNGDSDPHG-LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
V+YN +G C+D+N D G W + CTE+ + +G IF + + +
Sbjct: 333 VFYNNTGAHTCYDINRDVPDWGKCCGWDYLHCTEVYIPSG---SSGIFPRAAYNLTSDIA 389
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
C++ +GV RPNW +FGG + +SNIIF NGL DPW GVL ++S +++A
Sbjct: 390 QCQQQFGVTLRPNWARIQFGGFNL-----TSSSNIIFSNGLLDPWHTSGVLHSLSDSLVA 444
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+V E AHH+DL + EDP +++ R +E +I KW+++Y+ L
Sbjct: 445 IVIPEAAHHLDLWAPSPEDPVYVQRAREQEAMLIEKWLNEYWAKL 489
>gi|281206058|gb|EFA80247.1| peptidase S28 family protein [Polysphondylium pallidum PN500]
Length = 481
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 179/462 (38%), Positives = 254/462 (54%), Gaps = 31/462 (6%)
Query: 53 YKTKYHTQILDHFN-YNPQSYQTFQQRYLINDTHWG---GSKNNAPIFVYTGNEGDIEWF 108
Y+ Y+ Q LDHFN YN S F QRYL++DT+W G +P+ YTGNEGDI WF
Sbjct: 29 YEEFYYMQTLDHFNFYNKGS---FPQRYLVSDTYWTRPTGPICESPVLFYTGNEGDIVWF 85
Query: 109 AQNTGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
+N+ F+ +V A + ALLVF EHR+YG+++P+G + + GYL+S QALADYA
Sbjct: 86 YENSQFVTNVLAKEMGALLVFAEHRFYGETMPFGNSSSLP----ENIGYLTSEQALADYA 141
Query: 168 SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VSP 226
LI + +L + PV+ GGSYGGMLA+WFR+KYP++ GALA+SAPIL F +
Sbjct: 142 QLIPAVLSDLGGSHCPVLAVGGSYGGMLASWFRMKYPNIIDGALAASAPILYFLGTNANS 201
Query: 227 YSFSNIITQDFRSVSE--NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-I 283
F+ I T DF S C I ++ +I + + GL L K F +C+ L+ +
Sbjct: 202 EGFNEIATIDFAETSSEGTCATRIHSAFNEITQMSNTQNGLNVLTKTFSLCEELTELSDL 261
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYN 343
+WL A Y AM DYP P+NFL P+P P+ C + D L +VY+N
Sbjct: 262 INWLEAAITYMAMADYPYPANFLEPMPGNPINVSCSLL---AKETDNIQGLVQVMNVYFN 318
Query: 344 YSGTA-KCFDLNG-DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
SG A +C +++ + G W +QACTEM+M + F + +YC+
Sbjct: 319 SSGQAGQCNNVSVYTTGALGSDGWDYQACTEMVMPISANGVQDFFPAAPWSLSQLTQYCQ 378
Query: 402 EAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN-ISKTVLALV 460
+ + V P PNWITT + G + SNIIF NG+ DPW GGV+ N ++ ++
Sbjct: 379 QTWQVTPNPNWITTYYQGQDLSQT-----SNIIFSNGVLDPWRAGGVVSNDGGNDIITII 433
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
GAHH+DLR DP + + R E K++ Q+ Q+
Sbjct: 434 IDGGAHHLDLRMPNSADPTAVTNARVLET----KYLMQFAQE 471
>gi|322792044|gb|EFZ16143.1| hypothetical protein SINV_09817 [Solenopsis invicta]
Length = 446
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 178/462 (38%), Positives = 258/462 (55%), Gaps = 41/462 (8%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF++ Q+ TF+ RYLINDT +++N+PIF YTGNEG+IE FA+NTGF++++AP
Sbjct: 2 VDHFSFAVQN--TFKLRYLINDTFTRKTEDNSPIFFYTGNEGNIEVFAENTGFIWEIAPS 59
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD 181
F AL+VF EHRYYG+S+PYG ++ + GYL+S QALADY LI LK
Sbjct: 60 FDALVVFAEHRYYGESLPYGNR---SFADPQHLGYLTSEQALADYVDLIHHLKSQPEYKL 116
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
SPV+VFGGSYGGML+AW RLKYPHV G +L+ D+ + +I D
Sbjct: 117 SPVIVFGGSYGGMLSAWMRLKYPHVVQGY------VLSLDDSNTLSCLMRVIRSDNNFGL 170
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWLSTAFVYTAMT 297
C + ++ ++ + G + L +++C+ E ++ +L + AM
Sbjct: 171 SRCESNVSAAYPEVVAHNNEHEGKKWLSDNWKLCEPLKTGENVTTLKDFLQEVLINLAMV 230
Query: 298 DYPTPSNFLNPLPAFP---------------VKEMCKAIDDPK-TGNDVFAKLYGAASVY 341
DYP +NFL PLPA P V+ C+ + + G + + LY A SVY
Sbjct: 231 DYPYEANFLAPLPANPINLQILKSMISIFTIVQAFCQHLTNASLMGKPLLSALYRALSVY 290
Query: 342 YNYSGTAKCFDL-NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
NY+G A C + + S W +QACTEM+M D + +FE +E + +A C
Sbjct: 291 TNYTGKASCTSIKSAQSGLDADQGWDYQACTEMVMPICFDGVNDMFEPTEWNINAYNSTC 350
Query: 401 KEAYGVDPRPNWITTEFG--GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS-KTVL 457
+ Y + +P I E+G H G ASNIIF NGL DPW+GGGV+ NIS ++V+
Sbjct: 351 FKKYSISSQPYQICKEYGCSAHFPG------ASNIIFSNGLLDPWTGGGVVANISAESVI 404
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
++V + AHH+DLR + +DP + R+ I KWI ++
Sbjct: 405 SIVMPDAAHHLDLRETNPKDPYDVTLARKFHQFSIKKWIKEH 446
>gi|307213053|gb|EFN88584.1| Lysosomal Pro-X carboxypeptidase [Harpegnathos saltator]
Length = 404
Score = 300 bits (769), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 157/393 (39%), Positives = 228/393 (58%), Gaps = 14/393 (3%)
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+++ A +F AL+VF EHRYYG+S+P+G NK A + GYL+S Q LADY LI
Sbjct: 19 GFLWETAQQFGALVVFAEHRYYGESLPFG-NKSFA--DPQHLGYLTSQQVLADYVELIQY 75
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+ SPV++FGGSYGGML+AW R+KYPH+ GA+A+SAPIL F IV +F+ I
Sbjct: 76 LRSKPGYKRSPVILFGGSYGGMLSAWMRMKYPHIVQGAIAASAPILQFTGIVKCEAFARI 135
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWLS 288
+T DFR+ + C K+I+ SW I E G + L +++C+ +E ++ +L
Sbjct: 136 VTSDFRASNPTCAKLIRQSWNTITEVTSNDEGKKWLSDNWKLCEPLKTAEHVKTLKDFLQ 195
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAASVYYNYSGT 347
+ AM DYP +NFL PLP P+ C+ + + TG + L+GA S+Y NY+G
Sbjct: 196 EVYTDLAMVDYPYETNFLTPLPGNPINVFCRHLTNSSLTGKPLLLALHGAVSIYTNYTGK 255
Query: 348 AKCFDL-NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
A C N + W +QACTEM+M D + +FE S+ D+ C + Y V
Sbjct: 256 ATCISTENAEPGLDAAQGWDYQACTEMVMPMCPDGVNDMFEPSKWDFKDYNNTCFKKYSV 315
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
P+P+ + ++G + V +NI F NGL DPW+ GGVL+N+S + +A++ + AH
Sbjct: 316 TPQPHRVCQQYGCENLSTV-----TNINFSNGLLDPWTSGGVLRNLSSSAIAILIPDAAH 370
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
H+DLR S DP + R+ I KWI +Y
Sbjct: 371 HLDLRESNSNDPYSVILTRKFHRYSIKKWIDEY 403
>gi|441623431|ref|XP_004088909.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 2 [Nomascus
leucogenys]
Length = 485
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 168/458 (36%), Positives = 254/458 (55%), Gaps = 36/458 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W ++ PIF YTGNEGD+ FA N+
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--TQGKGPIFFYTGNEGDVWAFANNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALL+F EH + QALAD+A L+
Sbjct: 89 GFVAELAAEQGALLIFAEH-----------VGARQGRGRGXXXXXXVEQALADFAELLRA 137
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 138 LRRDLGXQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 197
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ ++++I++ + G + ++ F C+ EK+L + +
Sbjct: 198 VTADFEGQSPKCTQGVREAFRRIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFRFAR 256
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AF AM DYP P++FL PLPA PVK C D + L A + YN SG+
Sbjct: 257 NAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSQ 313
Query: 349 KCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C+D+ + +DP G W +QACTE+ + +N +F + R +
Sbjct: 314 HCYDIYRLYHSCADPTGCGTGPNARAWDYQACTEINLTFASNNVTDMFPNLPFTEELRQQ 373
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
YC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A
Sbjct: 374 YCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIA 428
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 429 VTIQGGAHHLDLRASHPEDPVSVVEARKLEATIIGEWV 466
>gi|224072899|ref|XP_002190034.1| PREDICTED: dipeptidyl peptidase 2 [Taeniopygia guttata]
Length = 468
Score = 297 bits (761), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 166/420 (39%), Positives = 235/420 (55%), Gaps = 25/420 (5%)
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
K PIF YTGNEGDI FAQN+ F++++A + +AL++F EHRYYGKS+P+G +
Sbjct: 40 KGFGPIFFYTGNEGDIWTFAQNSDFIFELAEEQQALVIFAEHRYYGKSLPFG----LEST 95
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
T L+ QALADYA LI +LK+ A D PV+ FGGSYGGML+A+ R+KYP+V G
Sbjct: 96 QLKKTALLTVEQALADYAVLITELKQQFGAADCPVIAFGGSYGGMLSAYLRMKYPNVVAG 155
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE--- 266
ALA+SAP+L+ + P F +T DF+ S C ++ +++QI++ E
Sbjct: 156 ALAASAPLLSVAGLGDPTQFFRDVTADFQKSSLGCVTAVRKAFQQIKDLCLSGAYDEISS 215
Query: 267 KLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKT 326
K+ +I E + + AF AM DYP ++F+ LPA PVK C+ I K
Sbjct: 216 KMATCNKISNKEDVYQLFGFARNAFTMMAMMDYPYKTDFMGDLPANPVKVGCEQIIAHK- 274
Query: 327 GNDVFAKLYGAASVYYNYSGTAKCFDL----NGDSDPHGL------SEWGWQACTEMIML 376
D L V+YN SG A+C+++ +DP G W +Q CTE+ +
Sbjct: 275 --DPIEGLTALVGVFYNSSGLAQCYNIYQLYQSCADPTGCGTGSDAEAWDYQVCTEINLT 332
Query: 377 TGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFF 436
+N +F E R +YC + V PR +W+ T F G + + ASNIIF
Sbjct: 333 FDSNNVTDMFPEMPFTEAMREQYCWNKWHVRPRAHWLQTNFWGEDL-----KSASNIIFS 387
Query: 437 NGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
NG DPW+GGG+ ++S +++AL K GAHH+DLR S DP + +VRR E II W+
Sbjct: 388 NGDLDPWAGGGINSSLSPSLIALTIKGGAHHLDLRGSNPADPPSVTEVRRLEAGIISSWV 447
>gi|118099097|ref|XP_415570.2| PREDICTED: dipeptidyl peptidase 2 [Gallus gallus]
Length = 495
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 171/453 (37%), Positives = 247/453 (54%), Gaps = 33/453 (7%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q+ DHF + +TF QRYL++ W K PIF YTGNEG+I FA+N+ F++++A
Sbjct: 39 QVRDHFRFEAGGNETFPQRYLLSAKFW--KKGFGPIFFYTGNEGNIWTFAENSDFIFELA 96
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+ +AL++F EHRYYGKS+P+G + T L+ QALADYA LI +LK+ A
Sbjct: 97 EQQQALVIFAEHRYYGKSLPFG----LESTQLKNTHLLTVEQALADYAVLITELKQQYGA 152
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
PV+ FGGSYGGML+A+ R+KYP+V GALA+SAP+L+ + P F +T DF+
Sbjct: 153 AGCPVIAFGGSYGGMLSAYMRMKYPNVVDGALAASAPVLSVAGLGDPTQFFRDVTADFQK 212
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLE---KLQKAFRICKSEKNLAIESWLSTAFVYTAM 296
C ++ +++QI + E K+ +I E + + AF AM
Sbjct: 213 SIPGCVPAVQRAFQQIRDLFLSGAYDEISSKMATCSKISSKEDLYQLFGFARNAFTMIAM 272
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAI---DDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
DYP ++F+ LPA PVK C+ I DP G A L G V+YN SG+A+C+D+
Sbjct: 273 MDYPYKTDFMGHLPANPVKVGCEQILAHTDPIQG---LAALVG---VFYNSSGSAQCYDV 326
Query: 354 ----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
+DP G W +Q CTE+ + +N +F E R +YC
Sbjct: 327 YRLYRPCADPTGCGTGADAEAWDYQVCTEINLTFNSNNVTDMFPEMPFTEAMREQYCWSR 386
Query: 404 YGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
+ V PR W+ F G + + ASNIIF NG DPW+GGG+ ++S ++ A+ +
Sbjct: 387 WRVRPRAQWLRINFWGGDL-----KSASNIIFSNGDLDPWAGGGINSSLSPSLTAVTIQG 441
Query: 464 GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
GAHH+DLR DP + + R+ E II W+
Sbjct: 442 GAHHLDLRGHNPADPPSVIEARKLEASIISNWV 474
>gi|403352351|gb|EJY75686.1| Lysosomal Pro-X carboxypeptidase [Oxytricha trifallax]
Length = 503
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 187/491 (38%), Positives = 268/491 (54%), Gaps = 37/491 (7%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQI-LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFV 97
L ++++ L +T Y ++ LDHF S TF +YL + +W + PIF
Sbjct: 14 LGLIVAAPPGGYPLKRTNYTFEVPLDHFASGGNS-PTFNIKYLADAQYW--NPMEGPIFF 70
Query: 98 YTGNEGDIEWFAQNTGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY 156
Y GNEG +E F N+GF+ DV AP+ +AL++F EHRY+G S P+ +K++A + +
Sbjct: 71 YAGNEGKVEGFWDNSGFLTDVLAPQHQALIIFGEHRYFGDSFPF--DKKVAL-DKDHNKW 127
Query: 157 LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
L+ QA+ DY LI +++ A+D PVVVFGGSYGGMLA+W R+KYP GA ASSAP
Sbjct: 128 LTVEQAMMDYVLLIKEIRYIYGASDKPVVVFGGSYGGMLASWLRMKYPATFQGAYASSAP 187
Query: 217 ILNF-DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAK-KPGGLEKLQKAFRI 274
IL F D+ V +F +IITQDF + ++NC +IK +W + + + +P L+ F
Sbjct: 188 ILYFKDSGVPQSAFGDIITQDFYAANQNCPSIIKEAWGYLMDIKENRPTDYPALKTIFNT 247
Query: 275 CKSEKNLA----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-----PK 325
C A + ++L F Y AMTDYP ++FLNP+PA PV C + D PK
Sbjct: 248 CTDITVKADVDNLYTYLMNGFSYMAMTDYPYETSFLNPMPANPVNAACTKLKDIPYPAPK 307
Query: 326 TGNDVFAKLYG-----------AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQ--ACTE 372
+ L A+SVY+NY G C D++ L GW AC +
Sbjct: 308 SEKSTVGALSARETLVLQGVQDASSVYFNYKGQTPCNDISNSDATGQLDGAGWDVLACNQ 367
Query: 373 MIMLTGGDNKDSIFEESEE-DYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFAS 431
+ M T + KDS+F ++ D A C + YG+ PR W+ FGG I + +
Sbjct: 368 LAMPT-TNGKDSMFLVNDPFDEKAFNADCMQKYGLTPRYGWVWDTFGGQNIQKDFQAH-T 425
Query: 432 NIIFFNGLRDPWSGGGVLKNIS--KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREV 489
NIIF NG DPWS GGV NI+ T+ +++ + AHH++LR DP +K VR
Sbjct: 426 NIIFTNGNLDPWSAGGVTANITGNPTINSILLEGSAHHLELRAPNDADPDDVKKVRGTIS 485
Query: 490 EIIGKWISQYF 500
+ IGKW+S+YF
Sbjct: 486 DTIGKWVSEYF 496
>gi|302832479|ref|XP_002947804.1| hypothetical protein VOLCADRAFT_41600 [Volvox carteri f.
nagariensis]
gi|300267152|gb|EFJ51337.1| hypothetical protein VOLCADRAFT_41600 [Volvox carteri f.
nagariensis]
Length = 451
Score = 295 bits (754), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 170/468 (36%), Positives = 257/468 (54%), Gaps = 42/468 (8%)
Query: 54 KTKYHTQILDHFNYNPQSYQT-FQQRYLINDTHW--GGSKNNAPIFVYTGNEGDIEWFAQ 110
+ + TQ LDHF+++P T FQQRY + D W G + PIF Y GNE D+ +
Sbjct: 1 QESWFTQRLDHFHHHPGGNDTTFQQRYFLCDKFWSRGPGGSRGPIFFYAGNEADVTLYVN 60
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
TG +++ A +F AL++F EHRYYGK+ P G + +S YLS QALADY+ LI
Sbjct: 61 ATGLIWEHAEEFGALVLFAEHRYYGKTQPLGPDSW-----SSDPTYLSVEQALADYSVLI 115
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS--PYS 228
++ + DSPV+ FGGSYGGMLAAW RLKYPH+ GA+A+SAP+ F + P
Sbjct: 116 WNITRTTGGEDSPVIAFGGSYGGMLAAWLRLKYPHLVTGAVAASAPVGAFPGVPGWQPSK 175
Query: 229 FSNIITQDFRSVS---ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNL 281
F ++T D + + C ++ ++ + + G L + R+CK + +
Sbjct: 176 FWEVVTYDATASAGAVPECSSNVRAAFGHVMALGRTATGRAALGRLLRLCKPIEDEDAAV 235
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFL-----NPLPAFPVKEMCKAIDDPKTGNDVFAK--- 333
+ WL AF AM +YP PS+++ +PLPA+P++ C + G + A
Sbjct: 236 GVAYWLQGAFDAFAMGNYPYPSSYISDDPAHPLPAWPMRAACTRL----AGRGLRASDLA 291
Query: 334 LYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIML-----TGGDNKDSIFEE 388
L AA V YN +G +C+ + G W +Q CTE++ T G + D +++
Sbjct: 292 LRDAAGVLYNVTGRVQCYTVETSGPAAG--PWDYQWCTELMAQLPYYPTNGIS-DMFWDQ 348
Query: 389 SEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
D +A ++C+ +GV PRP W +GG + R+ASNI+F NGL DPWS GV
Sbjct: 349 GPFDLEAINQHCEAMWGVRPRPFWSAITYGG-----LDYRYASNIVFSNGLYDPWSAYGV 403
Query: 449 LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
L +IS +V+A++ EGAHH+DL +S DP ++ R+ E++ + W+
Sbjct: 404 LTDISDSVVAVIIPEGAHHLDLMYSHPSDPPSVRAARQIEMDYVRNWV 451
>gi|324507364|gb|ADY43126.1| Prolyl carboxy peptidase like protein 5 [Ascaris suum]
Length = 534
Score = 294 bits (753), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 188/467 (40%), Positives = 255/467 (54%), Gaps = 51/467 (10%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF++ +TF RYLIN ++ PIF YTGNEG+IE FA NTGFM+D+A +
Sbjct: 55 IDHFSFADN--RTFHLRYLINTDYF---IKYGPIFFYTGNEGNIEGFASNTGFMWDIAAE 109
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL--TA 179
F A +VF EHRYYGK+ P+G +Y + S GYLSS QALADYA LI L+ A
Sbjct: 110 FGAAIVFAEHRYYGKTHPFGNE---SYASVSNLGYLSSEQALADYAHLIQYLRNERLKNA 166
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS-FSNIITQDFR 238
+S V+ FGGSYGGMLAAW R+KYPH+ GA+A+SAP+ F P F NI+ + F
Sbjct: 167 INSTVIAFGGSYGGMLAAWIRIKYPHLVEGAIAASAPVFWFPQTNVPEDIFDNIVKRSF- 225
Query: 239 SVSENC-YKVIKGSWKQIEETAKKPGG------LEKLQKAFRICKSEKNLAIESWLSTAF 291
V+ C I +W IEE A G L KL++ + KSE +++++ +F
Sbjct: 226 -VNSGCKADAIIAAWSAIEELANSEQGRTYLNSLFKLEEKSFLQKSEDVNFLKAFIRESF 284
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAID--DPKTGNDVFAKLYGAASVYYNYSGTAK 349
AM +YP PS FL PLP +PVK C + + KT LY ++YYN++G K
Sbjct: 285 ESMAMVNYPYPSEFLAPLPGWPVKVACGFFNSTEMKTREHHAESLYSMVNLYYNFTGEKK 344
Query: 350 CFDLNGD----------SDPHGLSEWGWQACTEMIM--LTGGDNKDSIFEESEEDYDARA 397
+N D DP G W WQACTEM+M G D +++
Sbjct: 345 TLCVNPDVCSDSAYGALGDPLG---WPWQACTEMVMQLCASGPPNDFFWKDCPFTVKGVI 401
Query: 398 RYCKEAYGVD------PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG--VL 449
C++ +G RP+W +G H ASNIIF NG DPWSGGG +
Sbjct: 402 EGCEKQFGKIGYTKQLTRPDWAILNYGSH------YPCASNIIFSNGYLDPWSGGGWSLK 455
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
++++++ ++GAHH DLR S +D + +K+ RR E IGKWI
Sbjct: 456 PQTVGSLVSIIIEDGAHHYDLRGSHPKDTEAVKEARRLERIYIGKWI 502
>gi|301103554|ref|XP_002900863.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262101618|gb|EEY59670.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 542
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 176/467 (37%), Positives = 256/467 (54%), Gaps = 38/467 (8%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
K+ TQ LDHF+ +YQ QRY + D + + +F Y GNE D+E + +TG M
Sbjct: 85 KFLTQTLDHFDVGAPTYQ---QRYFVCDKQF---RPGGVMFFYVGNEADVELYLNHTGLM 138
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
++ A +F A+LVF EHRY+GKS+P+G + K YLS+ QALAD+A LI LK
Sbjct: 139 WENADEFGAMLVFAEHRYFGKSVPFGKDVTKHMK------YLSTEQALADFAVLITYLKT 192
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY---SFSNI 232
D PV+ FGGSYGGML +W R+KYPH+ G +A SAPIL+F P SF I
Sbjct: 193 EWK-LDIPVIGFGGSYGGMLGSWLRMKYPHIIDGVIAGSAPILSFLGDEVPLDKGSFERI 251
Query: 233 ITQDFRS---VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA------I 283
+T D + NC I+ +W +++ G ++L++A +C S K + +
Sbjct: 252 VTFDASEEAGSAPNCVPNIRRTWPAMKKLGDTEDGRKQLKRALSLCDSVKLESRKDVDEV 311
Query: 284 ESWLSTAFVYTAMTDYPTPSNFL----NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA- 338
W +AF Y AM +YP PS+++ + LPA+PV+ C + D +D A L A
Sbjct: 312 MDWAKSAFDYMAMGNYPYPSSYIMNGVSVLPAYPVRVACSFVADEFAPDDEVALLSAFAK 371
Query: 339 --SVYYNYSGTAKCFDLNGDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
VYYN + +C++LN S+ L W + C E+ D + +F ++
Sbjct: 372 SLGVYYNSTKHQECYELNAASNESALDSDFWDYIFCAEIYQPQNVDGVNDMFWSIPWNFT 431
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
A CK +GV+ RP W TT++GG K + ASNI+F NG DPWSG GVL+N S
Sbjct: 432 ADNENCKREWGVEIRPLWATTQYGGRKA----LKAASNIVFSNGNYDPWSGTGVLQNYSD 487
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
+V+AL + GAHH+DL FS + D + VR E + + KW ++++
Sbjct: 488 SVVALSVEGGAHHLDLMFSNELDTASVLAVREAEKQHMHKWAREFYE 534
>gi|148674771|gb|EDL06718.1| prolylcarboxypeptidase (angiotensinase C) [Mus musculus]
Length = 437
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 170/457 (37%), Positives = 244/457 (53%), Gaps = 72/457 (15%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF + +TF+QRYL+ D HW +N I YTGNEGDI WF NT
Sbjct: 46 YSVLYFEQKVDHFGF--ADMRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNT 101
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYG+S+P+G + ++K++ +L+S QALAD+A LI
Sbjct: 102 GFMWDVAEELKAMLVFAEHRYYGESLPFGQD---SFKDSQHLNFLTSEQALADFAELIRH 158
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
L+K + A PV+ GGSYG G LA+
Sbjct: 159 LEKTIPGAQGQPVIAIGGSYG----------------GMLAA------------------ 184
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLS 288
W +++ GGL+ L +C SEK ++ W++
Sbjct: 185 --------------------WFRMKYPHIVVGGLQSLTNILHLCSPLTSEKIPTLKGWIA 224
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGT 347
+V AM +YP NFL PLPA+P+KE+C+ + +P + V + ++ A SVYYNYSG
Sbjct: 225 ETWVNLAMVNYPYACNFLQPLPAWPIKEVCQYLKNPNVSDTVLLQNIFQALSVYYNYSGQ 284
Query: 348 AKCFDLNGDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
A C +++ + G W +QACTEM+M + D +FE D + + C +GV
Sbjct: 285 AACLNISQTTTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWGV 344
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
PRP+W+TT +GG I SNIIF NG DPWSGGGV ++I+ T++A+ +GAH
Sbjct: 345 KPRPHWMTTMYGGKNIS-----SHSNIIFSNGELDPWSGGGVTRDITDTLVAINIHDGAH 399
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
H+DLR DP + R EV+ + KWI ++ ++
Sbjct: 400 HLDLRAHNAFDPSSVLLSRLLEVKHMKKWILDFYSNI 436
>gi|256084265|ref|XP_002578351.1| family S28 unassigned peptidase (S28 family) [Schistosoma mansoni]
gi|360045230|emb|CCD82778.1| family S28 unassigned peptidase (S28 family) [Schistosoma mansoni]
Length = 472
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 180/459 (39%), Positives = 248/459 (54%), Gaps = 33/459 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K Y Q LDHFN+ ++ TF+QRYL D + K N PIF Y GNEG I+ F NTG
Sbjct: 29 KENYFDQTLDHFNFQARNL-TFKQRYLYEDKWF---KPNGPIFFYCGNEGGIDGFWNNTG 84
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++++AP F A ++F EHRYYGKS+P+ + + Y YLS QALADYA LI +
Sbjct: 85 LIFELAPSFNAFVLFAEHRYYGKSLPFNTSFQQPY-----IQYLSIDQALADYAYLIEGI 139
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K T S VV FGGSYGGMLAA+ R KYPH+ GALASSAP+ + + F +
Sbjct: 140 KSTFNMTRSLVVAFGGSYGGMLAAYMRAKYPHIIKGALASSAPVRWVAGEGNFHDFFESV 199
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKN----LAIESWLS 288
T+D+R C + IK ++ + ++KP G ++L R+CK +N + W
Sbjct: 200 TKDYRDADPKCSEKIKNAFNLAVQLSQKPDIGYKQLSNDLRLCKPIQNDFEFYWVLKWAR 259
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AFV AM DYP ++F+ LPA PV CK + D+ L A V+YN S +
Sbjct: 260 NAFVMMAMLDYPYKASFMASLPANPVNVSCK---NALAVTDLIPTLREAVGVFYNSSQSL 316
Query: 349 KCFDLN------GDSDPHGLSE----WGWQACTEMIMLTGGDN-KDSIFEESEEDYDARA 397
CFD D GL W +Q+CTEM + D+ + +F
Sbjct: 317 PCFDYKTQFIECADITGCGLGSDSLAWDFQSCTEMNLHDDSDSTTNDMFVSLPLTKQQVT 376
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
YC+ +GV P N ++T +G + + + +SNIIF NG DPW GGG+L + S+ V+
Sbjct: 377 SYCQRRWGVTPAFNQLSTFYGDN-----IWKTSSNIIFSNGNLDPWMGGGILTDQSEKVI 431
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+LV GAHH+DLR DP + VR+ EV+ I W+
Sbjct: 432 SLVLDGGAHHLDLRSPDPNDPPSARQVRQIEVQTIRSWL 470
>gi|226472758|emb|CAX71065.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 472
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 178/459 (38%), Positives = 250/459 (54%), Gaps = 33/459 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K Y Q LDHF++ ++ TF+QRYL D + K N PIF Y GNEG+I F NTG
Sbjct: 29 KENYFDQTLDHFSFQARNL-TFKQRYLYEDKWF---KPNGPIFFYCGNEGEIGGFWNNTG 84
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++++AP F A ++F EHRYYGKS+P+ + + Y YLS QALADYA LI +
Sbjct: 85 LVFELAPSFNAFILFAEHRYYGKSLPFDKSFQQPY-----IQYLSIGQALADYAYLIEGI 139
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K T SPVV FGGSYGGMLAA+ R KYPH+ GALA+SAP+ + + F +
Sbjct: 140 KSKFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAV 199
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKN----LAIESWLS 288
T+D+ C + IK ++ + ++KP G ++L + R+C+ +N I W
Sbjct: 200 TKDYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWILKWAR 259
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AFV AM DYP ++F+ LP PV CK + + D + L A V+YN S +
Sbjct: 260 NAFVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPISTLREAVGVFYNSSQSL 316
Query: 349 KCFDLN------GDSDPHGLSE----WGWQACTEMIMLTGGDNKDS-IFEESEEDYDARA 397
CFD D GL W +Q+CTEM + D+ S +F
Sbjct: 317 MCFDYKTQFVECADITGCGLGNDSLAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVT 376
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
YC++ +GV P N ++T FG + + + ASNIIF NG DPW GGG+L + S+ V+
Sbjct: 377 SYCQQKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQSEKVI 431
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+L+ GAHH+DLR DP + +R+ EV+ I W+
Sbjct: 432 SLMLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 470
>gi|226472752|emb|CAX71062.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472756|emb|CAX71064.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 177/459 (38%), Positives = 249/459 (54%), Gaps = 33/459 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K Y Q LDHF++ ++ TF+QRYL D + K N PIF Y GNEG+I F NTG
Sbjct: 37 KENYFNQTLDHFSFQARNL-TFKQRYLYEDKWF---KPNGPIFFYCGNEGEIGGFWNNTG 92
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++++AP F A ++F EHRYYGKS+P+ + + Y YLS QALADYA LI +
Sbjct: 93 LVFELAPSFNAFILFAEHRYYGKSLPFDKSFQQPY-----IQYLSIGQALADYAYLIEGI 147
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K T SPVV FGGSYGGMLAA+ R KYPH+ GALA+SAP+ + + F +
Sbjct: 148 KNKFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAV 207
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKN----LAIESWLS 288
T+D+ C + IK ++ + ++KP G ++L + R+C+ +N + W
Sbjct: 208 TKDYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWAR 267
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AFV AM DYP ++F+ LP PV CK + + D L A V+YN S +
Sbjct: 268 NAFVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSL 324
Query: 349 KCFDLN------GDSDPHGLSE----WGWQACTEMIMLTGGDNKDS-IFEESEEDYDARA 397
CFD D GL W +Q+CTEM + D+ S +F
Sbjct: 325 MCFDYKTQFIECADITGCGLGNDSLAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVT 384
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
YC++ +GV P N ++T FG + + + ASNIIF NG DPW GGG+L + S+ V+
Sbjct: 385 SYCQQKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQSEKVI 439
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+L+ GAHH+DLR DP + +R+ EV+ I W+
Sbjct: 440 SLMLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 478
>gi|412993304|emb|CCO16837.1| predicted protein [Bathycoccus prasinos]
Length = 563
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 179/479 (37%), Positives = 267/479 (55%), Gaps = 45/479 (9%)
Query: 54 KTKYHTQILDHFNYNPQSYQT--------FQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
+ ++ Q L HF++N + + F+ RY + + + ++PIF+YTGNE ++
Sbjct: 78 EERWIQQPLSHFSWNSEEEEERGGEGGGEFKTRYFVCSEFY---RKDSPIFLYTGNEANV 134
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
E + +NTG M++ A F ALLVF EHRYYGKS P + E N +T +L+S +ALAD
Sbjct: 135 ESYLENTGLMWENAEHFNALLVFAEHRYYGKSSPMSDDDE-EDTNKNTLKHLNSMEALAD 193
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN--- 222
YASL+ +L++ + V+ FGGSYGGMLA+W R+KYPHV GA+A+SAPI FD
Sbjct: 194 YASLVRELREEYEDAVA-VIAFGGSYGGMLASWMRMKYPHVVDGAIAASAPIYAFDGEDP 252
Query: 223 IVSPYSFS--NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGG--LEKLQKAFRICKS- 277
V P +F+ + T C K I+ ++ + ++ + L+ L+ FR C
Sbjct: 253 PVDPNAFARGSTYTAMVSGHGAECPKRIQDAFTLLIDSGDESDKIYLDVLKHTFRACDDI 312
Query: 278 EKNLAIESWLSTAFVYTAMTDYPTPSNFL----NPLPAFPVKEMCK--AIDDPKTGND-- 329
E + W +A Y AM DYP S ++ LPA+P+K +C +DDP N
Sbjct: 313 ESPYEVAEWAQSALDYIAMGDYPVESGYMLSGKGTLPAWPMKVVCNEMMVDDPNASNSTS 372
Query: 330 --VFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-----WGWQACTEMIM---LTGG 379
+ L A S+YYN + T +CF + GD P+ ++ WG+Q C+EM M TGG
Sbjct: 373 LALLENLREAVSIYYNATKTEQCFTI-GDPSPNDDTKATEDLWGYQYCSEMFMPMETTGG 431
Query: 380 DNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGL 439
+N ++ S + RYC++AY V PRP + +GG K+ ++ FASNI+F NG+
Sbjct: 432 EN--DMYWLSSWNETNEFRYCRDAYDVQPRPYFAQETYGGRKM---VENFASNIVFSNGM 486
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
DPW GVL+ + V+ + EGAHH DL FS+K DP +K R EV+ I +W+ +
Sbjct: 487 LDPWHLLGVLETSNPRVVLVKIDEGAHHNDLMFSSKNDPTSVKRARLLEVKEIQRWVDE 545
>gi|56756895|gb|AAW26619.1| SJCHGC02147 protein [Schistosoma japonicum]
Length = 472
Score = 291 bits (745), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 177/459 (38%), Positives = 249/459 (54%), Gaps = 33/459 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K Y Q LDHF++ ++ TF+QRYL D + K N PIF Y GNEG+I F NTG
Sbjct: 29 KENYFDQTLDHFSFQARNL-TFKQRYLYEDKWF---KPNGPIFFYCGNEGEIGGFWNNTG 84
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++++AP F A ++F EHRYYGKS+P+ + + Y YLS QALADYA LI +
Sbjct: 85 LVFELAPSFNAFILFAEHRYYGKSLPFDKSFQQPY-----IQYLSIGQALADYAYLIEGI 139
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K T SPVV FGGSYGGMLAA+ R KYPH+ GALA+SAP+ + + F +
Sbjct: 140 KSKFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAV 199
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKN----LAIESWLS 288
T+D+ C + IK ++ + ++KP G ++L + R+C+ +N + W
Sbjct: 200 TKDYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIQNDFEFYWMLKWAR 259
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AFV AM DYP ++F+ LP PV CK + + D L A V+YN S +
Sbjct: 260 NAFVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSL 316
Query: 349 KCFDLN------GDSDPHGLSE----WGWQACTEMIMLTGGDNKDS-IFEESEEDYDARA 397
CFD D GL W +Q+CTEM + D+ S +F
Sbjct: 317 MCFDYKTQFIECADITGCGLGNDSLAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVT 376
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
YC++ +GV P N ++T FG + + + ASNIIF NG DPW GGG+L + S+ V+
Sbjct: 377 SYCQQKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQSEKVI 431
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+L+ GAHH+DLR DP + +R+ EV+ I W+
Sbjct: 432 SLMLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 470
>gi|226472754|emb|CAX71063.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 472
Score = 291 bits (745), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 177/459 (38%), Positives = 249/459 (54%), Gaps = 33/459 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K Y Q LDHF++ ++ TF+QRYL D + K N PIF Y GNEG+I F NTG
Sbjct: 29 KENYFNQTLDHFSFQARNL-TFKQRYLYEDKWF---KPNGPIFFYCGNEGEIGGFWNNTG 84
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++++AP F A ++F EHRYYGKS+P+ + + Y YLS QALADYA LI +
Sbjct: 85 LVFELAPSFNAFILFAEHRYYGKSLPFDKSFQQPY-----IQYLSIGQALADYAYLIEGI 139
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K T SPVV FGGSYGGMLAA+ R KYPH+ GALA+SAP+ + + F +
Sbjct: 140 KNKFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAV 199
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKN----LAIESWLS 288
T+D+ C + IK ++ + ++KP G ++L + R+C+ +N + W
Sbjct: 200 TKDYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWAR 259
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AFV AM DYP ++F+ LP PV CK + + D L A V+YN S +
Sbjct: 260 NAFVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSL 316
Query: 349 KCFDLN------GDSDPHGLSE----WGWQACTEMIMLTGGDNKDS-IFEESEEDYDARA 397
CFD D GL W +Q+CTEM + D+ S +F
Sbjct: 317 MCFDYKTQFIECADITGCGLGNDSLAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVT 376
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
YC++ +GV P N ++T FG + + + ASNIIF NG DPW GGG+L + S+ V+
Sbjct: 377 SYCQQKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQSEKVI 431
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+L+ GAHH+DLR DP + +R+ EV+ I W+
Sbjct: 432 SLMLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 470
>gi|226472750|emb|CAX71061.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 291 bits (745), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 177/459 (38%), Positives = 249/459 (54%), Gaps = 33/459 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K Y Q LDHF++ ++ TF+QRYL D + K N PIF Y GNEG+I F NTG
Sbjct: 37 KENYFDQTLDHFSFQARNL-TFKQRYLYEDKWF---KPNGPIFFYCGNEGEIGGFWNNTG 92
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++++AP F A ++F EHRYYGKS+P+ + + Y YLS QALADYA LI +
Sbjct: 93 LVFELAPSFNAFILFAEHRYYGKSLPFDKSFQQPY-----IQYLSIGQALADYAYLIEGI 147
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K T SPVV FGGSYGGMLAA+ R KYPH+ GALA+SAP+ + + F +
Sbjct: 148 KSKFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAV 207
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKN----LAIESWLS 288
T+D+ C + IK ++ + ++KP G ++L + R+C+ +N + W
Sbjct: 208 TKDYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWAR 267
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AFV AM DYP ++F+ LP PV CK + + D L A V+YN S +
Sbjct: 268 NAFVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSL 324
Query: 349 KCFDLN------GDSDPHGLSE----WGWQACTEMIMLTGGDNKDS-IFEESEEDYDARA 397
CFD D GL W +Q+CTEM + D+ S +F
Sbjct: 325 MCFDYKTQFIECADITGCGLGNDSLAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVT 384
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
YC++ +GV P N ++T FG + + + ASNIIF NG DPW GGG+L + S+ V+
Sbjct: 385 SYCRQKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQSEKVI 439
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+L+ GAHH+DLR DP + +R+ EV+ I W+
Sbjct: 440 SLMLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 478
>gi|226472746|emb|CAX71059.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472748|emb|CAX71060.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472760|emb|CAX71066.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 291 bits (745), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 177/459 (38%), Positives = 249/459 (54%), Gaps = 33/459 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K Y Q LDHF++ ++ TF+QRYL D + K N PIF Y GNEG+I F NTG
Sbjct: 37 KENYFDQTLDHFSFQARNL-TFKQRYLYEDKWF---KPNGPIFFYCGNEGEIGGFWNNTG 92
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++++AP F A ++F EHRYYGKS+P+ + + Y YLS QALADYA LI +
Sbjct: 93 LVFELAPSFNAFILFAEHRYYGKSLPFDKSFQQPY-----IQYLSIGQALADYAYLIEGI 147
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K T SPVV FGGSYGGMLAA+ R KYPH+ GALA+SAP+ + + F +
Sbjct: 148 KSKFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAV 207
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKN----LAIESWLS 288
T+D+ C + IK ++ + ++KP G ++L + R+C+ +N + W
Sbjct: 208 TKDYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWAR 267
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
AFV AM DYP ++F+ LP PV CK + + D L A V+YN S +
Sbjct: 268 NAFVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSL 324
Query: 349 KCFDLN------GDSDPHGLSE----WGWQACTEMIMLTGGDNKDS-IFEESEEDYDARA 397
CFD D GL W +Q+CTEM + D+ S +F
Sbjct: 325 MCFDYKTQFIECADITGCGLGNDSLAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVT 384
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
YC++ +GV P N ++T FG + + + ASNIIF NG DPW GGG+L + S+ V+
Sbjct: 385 SYCQQKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQSEKVI 439
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+L+ GAHH+DLR DP + +R+ EV+ I W+
Sbjct: 440 SLMLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 478
>gi|440790409|gb|ELR11692.1| prolylcarboxypeptidase isoform 2 preproprotein [Acanthamoeba
castellanii str. Neff]
Length = 506
Score = 290 bits (743), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 172/470 (36%), Positives = 251/470 (53%), Gaps = 43/470 (9%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNA---PIFVYTGNEGDIEWFA 109
Y+T Y Q LDHFN+ Q T++QR+L+ D +W GS PIF YTGNE + +
Sbjct: 60 YRTLYFDQTLDHFNFATQP-ATYKQRFLLADEYWRGSYPGGCPGPIFFYTGNEAPVTDYY 118
Query: 110 QNTGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS 168
+GF V APK ALLVF E S+P+G + YLS QALADYA
Sbjct: 119 SASGFFTQVLAPKHNALLVFAE------SMPFGSKS----FDPEKISYLSPEQALADYAV 168
Query: 169 LIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
LI LK+ L A + PV FGGSYGG+L AWFR+KYP + +G LA+SAP+ + +SPY
Sbjct: 169 LITHLKETLPHARNCPVFAFGGSYGGILTAWFRMKYPDIVMGGLAASAPLSFYGTGISPY 228
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAI 283
+F+N + F C +I +++ ++ + P G E+ KAF++C + A+
Sbjct: 229 AFTNSASDTFAQARLGCAPLIAQAFETLQRFSATPEGCERFAKAFKLCGPLNSQAEASAV 288
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN---DVFAKLYG-AAS 339
W+ AM DYP SN+ LPA+PV + C I + N D+ A+ G A
Sbjct: 289 VYWVEMGLASMAMLDYPFASNYGVSLPAWPVNKTCDRILEKAANNNDPDILAEALGYAIG 348
Query: 340 VYYNYSGTAKCFDLNGDSDPHGLSEW----GWQ--ACTEMIMLTGGDNKDSIFEESEEDY 393
V+YN +G C+D+ D + EW GW CTE+ + G F + +
Sbjct: 349 VFYNNTGDHSCYDIKRD-----VPEWEKCCGWDYLHCTEVYIPIG---FSGFFPHATYNL 400
Query: 394 DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
A C++ +G+ RPNW ++GG I +SNIIF NGL DPW GVL ++S
Sbjct: 401 TADIEQCRQKFGITLRPNWARIQYGGFNI-----TSSSNIIFSNGLLDPWHSSGVLHSLS 455
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
++++++ E HH+DL + EDP +++ R +E +I KW+ +Y+ L
Sbjct: 456 DSLISIMIPEAGHHLDLWAPSPEDPIYIQRAREQEAVLIDKWLKEYWAKL 505
>gi|224058953|ref|XP_002299661.1| predicted protein [Populus trichocarpa]
gi|222846919|gb|EEE84466.1| predicted protein [Populus trichocarpa]
Length = 437
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 156/370 (42%), Positives = 216/370 (58%), Gaps = 33/370 (8%)
Query: 17 TLTISNAKIFPTFP-----SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQS 71
T + AK T P R+ ++ S D +G ++T ++ Q LDHFNY P+S
Sbjct: 86 TTATATAKRLNTIPRHSPIGPRVWRDQPDKTTSCEVDEEG-FETCFYNQTLDHFNYRPES 144
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEH 131
Y TF QRY+IN +WGG+ NA I VY G E I+ + GF+ D A +FK+LLVFIEH
Sbjct: 145 YDTFPQRYVINSKYWGGA--NASILVYLGAEASIDGYRDAAGFLDDNAVQFKSLLVFIEH 202
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSY 191
RYYG S P G GY SS QALADYA++IID+K+N +A SPV+V GGSY
Sbjct: 203 RYYGHSFPPGA--------WGKRGYFSSAQALADYAAIIIDIKENRSAQYSPVIVIGGSY 254
Query: 192 GGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGS 251
GGMLA+WFRLKYPH+A+GALASSAPIL FD+I ++ +++T++FR SE CY+ IK S
Sbjct: 255 GGMLASWFRLKYPHIALGALASSAPILYFDDITPQDAYYSVVTKEFREASETCYQTIKTS 314
Query: 252 WKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLP 310
W +I+E A KP GL L F CK+ + + ++ +L + Y A + P P
Sbjct: 315 WSEIDELASKPDGLSMLSTKFNTCKNLTDASELKDYLRLMYAYAAQNNSP---------P 365
Query: 311 AFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS---DPHGLSEWGW 367
+PV E+CK IDD +G+D+ ++++ YY G C+ N ++ +W W
Sbjct: 366 TYPVNEVCKGIDDDASGDDILSRIFRGVVAYY---GNRTCY-FNNNAYAYQSEATLDWSW 421
Query: 368 QACTEMIMLT 377
Q +T
Sbjct: 422 QYSNHYFCIT 431
>gi|428182777|gb|EKX51637.1| hypothetical protein GUITHDRAFT_157202 [Guillardia theta CCMP2712]
Length = 481
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 185/473 (39%), Positives = 262/473 (55%), Gaps = 47/473 (9%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLI-NDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ + Q+LDHF++ S +QQRY + +T + A IF Y GNEG++E + +NT
Sbjct: 4 REAWFEQVLDHFSWRNDS--RWQQRYYVCQETEQQLANPAATIFFYCGNEGNVEMYIRNT 61
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G M++ A F A+L+F EHRYYGKS+P+G + +A++ YLS QALADYA L+ D
Sbjct: 62 GLMFENAKSFSAMLIFAEHRYYGKSLPFGNDF-----SAASLRYLSHEQALADYAVLLDD 116
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VSPY---S 228
K+ + V+ FGGSYGGML+AWFR+KYPH+ GA+A+SAP+L+F + P+
Sbjct: 117 FKRKHKMVRAKVIAFGGSYGGMLSAWFRMKYPHIVEGAVAASAPVLSFHSSDKGPWRSEK 176
Query: 229 FSNIITQDFRSVS---ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-----SEKN 280
+ I+T+D + E C +++ SW I+ G E L FR+C+ E N
Sbjct: 177 YWEIVTRDASGAAGSDERCVPLVRQSWPIIDSMGASESGRESLAALFRLCEPLASPGEVN 236
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLN----PLPAFPVKEMCKAID---DPKTGNDVFAK 333
++ +++ AF AM +YP PS++L LP +PV+E CK + D K N +
Sbjct: 237 -DLKLFIAMAFDTMAMGNYPFPSDYLTGGIGKLPPWPVREACKLLSSCGDCKAEN-LLDS 294
Query: 334 LYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMI---MLTGGDNKDSIFEESE 390
L A S+ YN SG C L DS G+ W +Q CTEM+ D K +F
Sbjct: 295 LCSAISLLYNASGDQACLHLPEDSSYAGI--WDFQWCTEMLPQETYFKRDGKRDMFFPFS 352
Query: 391 EDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
++CK YGV PR WI +GG + +KR ASNIIF NG DPW+ GGV
Sbjct: 353 ISSKEIDQHCKSKYGVIPRRGWIEQLYGGLE---GIKR-ASNIIFSNGEFDPWAAGGV-- 406
Query: 451 NISKT-------VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
N+S+ V A+ +EGAHH+DL FS DP +K R+ EV I KW+
Sbjct: 407 NVSEVKDAAARGVEAVWIEEGAHHLDLFFSHPADPPSVKKARQMEVACISKWL 459
>gi|449671875|ref|XP_002154818.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Hydra
magnipapillata]
Length = 460
Score = 288 bits (737), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 176/465 (37%), Positives = 257/465 (55%), Gaps = 52/465 (11%)
Query: 46 SKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
+K+S Y T + Q LDHF++ + YQ F QRYLIND ++ K AP+F YTGNEGDI
Sbjct: 29 AKESGYYYTTHWFPQTLDHFSFRSEDYQ-FAQRYLINDDYF---KPGAPVFFYTGNEGDI 84
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGN--KEIAYKNASTTGYLS---ST 160
WF NTGFM+D+A +F A+LVF EHRYYG+S+P+G + K A+ + G++ T
Sbjct: 85 TWFCNNTGFMWDIAEEFSAMLVFAEHRYYGESMPFGSDSYKVKAFVDGG-GGFIKLGIGT 143
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
+A Y S I + N + GMLAAWFR+KYP +GA++SSAPIL F
Sbjct: 144 IDVASYFSDDITTRSNYSE-------------GMLAAWFRMKYPASVVGAISSSAPILAF 190
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
++ + + NC+ + ++ T G K+ + F++CK K
Sbjct: 191 VDMND--------CELYYLKFYNCF-----GYAVLDYT-----GRLKISELFKLCKPLKT 232
Query: 281 L----AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLY 335
+ +W S +V AM +YP P+NFL LPA+P+ E+CK + + TG+++ L
Sbjct: 233 FDDVYNLNNWFSEVWVNLAMVNYPYPANFLEDLPAWPINEVCKHLQNSNATGDELIRNLV 292
Query: 336 GAASVYYNYSGTAKCFDLNGD-SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
A +VY+N++G + C ++ S G W +QACTEM M D ++ + D+D
Sbjct: 293 NAVNVYFNFTGQSSCLNIEQQASGSLGDQGWDFQACTEMAMPLCQDGIRDMWLPYKYDFD 352
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
A CK+ +GV R W +++GG + ASNI+F NG DPWSG GVLK+ S
Sbjct: 353 DFATSCKQKWGVTTRKYWSQSQYGGFNL-----NGASNIVFSNGKLDPWSGYGVLKSQSP 407
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
T+ ++ +GAHH+DLR S DPQ + D R I WI++Y
Sbjct: 408 TIKVVMIDDGAHHLDLRKSNSLDPQSVIDARNIHKSNIHSWINEY 452
>gi|405950118|gb|EKC18122.1| Lysosomal Pro-X carboxypeptidase [Crassostrea gigas]
Length = 576
Score = 287 bits (734), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 150/329 (45%), Positives = 206/329 (62%), Gaps = 11/329 (3%)
Query: 156 YLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+L+S QALADYA LI +K ++ ++ S V+ FGGSYGGMLAAWFR+KYP+V G+LA+S
Sbjct: 17 FLTSEQALADYAVLIKHIKSSIPGSSQSKVIAFGGSYGGMLAAWFRMKYPNVVQGSLAAS 76
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
API F +F +T FR S C IK WK + TA + GGL KL + F +
Sbjct: 77 APIWTFRKDADCDAFDRTVTGTFRKSSSTCVDNIKALWKTLNTTASQTGGLAKLSEMFHL 136
Query: 275 CKSEKN----LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV 330
CK K+ +++W+ +A VY AM DYP PS FL PLPA+PVKE C+ I P G+++
Sbjct: 137 CKPLKSADDVTTLKNWIVSALVYLAMVDYPYPSKFLAPLPAWPVKETCRPILTPLNGDNL 196
Query: 331 FAKLYGAASVYYNYSGTAKCFDL-NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ A +V+YNY+G+ CFD+ +GD G+S W +Q+CTEM+ + + K +FE+S
Sbjct: 197 IIGMAKAMNVFYNYTGSTSCFDIGSGDIPNLGISGWDYQSCTEMVAPSCSNGKTDMFEKS 256
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
D+ C + + V P NWI T++ G + ASNIIF NGL DPWS GGVL
Sbjct: 257 AWDFKEYTNGCLKNWKVKPDINWIETQYWGKNLSA-----ASNIIFSNGLLDPWSSGGVL 311
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDP 478
K+ S +V+A++ GAHH+DLR S K DP
Sbjct: 312 KSQSDSVVAILIPNGAHHLDLRGSNKADP 340
>gi|167533602|ref|XP_001748480.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772999|gb|EDQ86644.1| predicted protein [Monosiga brevicollis MX1]
Length = 508
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 171/481 (35%), Positives = 249/481 (51%), Gaps = 55/481 (11%)
Query: 60 QILDHFNYNP---QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Q +DHFN+ Q+ TFQQRY + D ++ + +FVY GNE DI + +TG M+
Sbjct: 34 QTIDHFNWGAPLGQAQTTFQQRYFVYDKYY--KPGSGALFVYFGNEDDITLYINHTGLMW 91
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
+ A F A L+FIEHRYYGKS P+ A +L+S QA+ADYA L+ K
Sbjct: 92 ENAKDFGAYLIFIEHRYYGKSQPFSPG------TAGCMNWLTSEQAMADYAVLLRWFKAT 145
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS---FSNII 233
D P + FGGSYGGMLAAWFR K+P V G +++SAPI F N+ Y F+ I+
Sbjct: 146 HQMEDVPTIGFGGSYGGMLAAWFRRKFPDVVDGVISASAPIWAFANLTPAYDDDGFAQIV 205
Query: 234 TQDFRSVSENCYKV---IKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL----AIESW 286
T D S K K I +TA GL L FR+C ++L ++ W
Sbjct: 206 TNDATPASGAAAACAANFKQGQKLIIDTASSAAGLANLTSIFRLCNPLQSLNDAYSLLYW 265
Query: 287 LSTAFVYTAMTDYPTPSNF----LNPLPAFPVKEMCKAIDD---PKTGNDVFAKLYGAAS 339
+ + Y AM ++P PS++ L LPA+PV+ C+++ D P D+ + A
Sbjct: 266 VQEPWSYMAMGNFPYPSSYLLHGLGMLPAWPVRVACESLADSSLPDQPPDLLDAMRAALD 325
Query: 340 VYYNYSGTAKCFDLNGDSDPHGL---------------------SEWGWQACTEMIM-LT 377
+YYNY+ C+DL+ + L +W +Q CTEM+M T
Sbjct: 326 IYYNYTHAETCYDLSDAPETATLMRPRKAFLRQQGTLGGPEACTGDWDYQYCTEMVMPST 385
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
G +KD + + D + C+ +GV PR NW TT + + +N++F N
Sbjct: 386 QGTDKDMFWPLEKFDLASLTASCQSTWGVKPRQNWATTYLASKDLTDL-----TNVVFSN 440
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G DPW GGV++N+S +V++++ GAHH+DL FST EDP+ + R EV + +W+
Sbjct: 441 GHYDPWRAGGVVQNVSDSVVSIIIPSGAHHIDLMFSTPEDPEDVTVARAFEVSHMRRWVD 500
Query: 498 Q 498
Q
Sbjct: 501 Q 501
>gi|308482020|ref|XP_003103214.1| CRE-PCP-5 protein [Caenorhabditis remanei]
gi|308260319|gb|EFP04272.1| CRE-PCP-5 protein [Caenorhabditis remanei]
Length = 507
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 188/518 (36%), Positives = 281/518 (54%), Gaps = 47/518 (9%)
Query: 6 IFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
I +S +L ++T + + P ++ P+K D + Y+ Y +DHF
Sbjct: 3 ILISLAILLATTHAFTRLQD----PVTQKGPQKFEK-----ADGKYKYEEGYLKAPIDHF 53
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
++ Y+ F RY +N ++ ++ PI YTGNEG +E FA+NTGFM+D+AP+ KA
Sbjct: 54 SFT-NDYE-FDLRYFLNTDNY---ESGGPILFYTGNEGSLEAFAENTGFMWDLAPELKAA 108
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT---ATDS 182
+VF+EHR+YGKS P+ K +Y + GYLSS QALAD+A L + +N A +S
Sbjct: 109 VVFVEHRFYGKSQPF---KNQSYTDIRNLGYLSSQQALADFA-LSVQFFRNEKIKGAKNS 164
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-DNIVSPYSFSNIITQDFRSVS 241
V+ FGGSYGGML+AWFR+KYPH+ GA+A+SAP+ F D+ + + I+T+ F
Sbjct: 165 AVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVYDFIVTRAFLDSG 224
Query: 242 ENCYKVIKGSWKQIEETAKKPGG------LEKLQKAFRICKSEKNLAIESWLSTAFVYTA 295
N V KG W ++E AK G L KL ++ + ++ ++ + A
Sbjct: 225 CNRKAVEKG-WIALDELAKTDSGRQYLNVLYKLDPKSKLENKDDVSFLKQYIRESMEAMA 283
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDP-KTGNDVFAKLYGAASVYYNYSG-------- 346
M +YP P++FL+ LP++PVKE CK P K+ + +LY ++YYNY+G
Sbjct: 284 MVNYPYPTSFLSSLPSWPVKEACKFASQPGKSQEESAEQLYNIVNLYYNYTGDKSTHCAN 343
Query: 347 TAKCFDLNGD-SDPHGLSEWGWQACTEMIM-LTG-GDNKDSIFEESEEDYDARARYCKEA 403
AKC G DP G W +Q CTEM+M L G G D +++ + A YCK+
Sbjct: 344 AAKCDSAYGSLGDPLG---WPFQTCTEMVMPLCGSGYPNDFFWKDCPFTTEKYAEYCKQT 400
Query: 404 YG-VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV--LKNISKTVLALV 460
+ + + + GG G ASNI+F NG DPWSGGG + +V++++
Sbjct: 401 FAQISYNKTLLRPQAGGRAFGATSLPSASNIVFSNGYLDPWSGGGYDHSDKVQGSVISVI 460
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
K+GAHH DLR + +D + +K VR E I KWI +
Sbjct: 461 LKQGAHHYDLRGAHPQDTEEVKKVRAMETAAIKKWIKE 498
>gi|402591692|gb|EJW85621.1| hypothetical protein WUBG_03470, partial [Wuchereria bancrofti]
Length = 478
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 179/473 (37%), Positives = 259/473 (54%), Gaps = 57/473 (12%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF+++ + F+ RYLIN H+ +N PIF YTGNEG++E FAQNTG M+D+AP+
Sbjct: 17 IDHFSFHDN--RVFRLRYLINTEHF---VSNGPIFFYTGNEGNVELFAQNTGLMWDLAPE 71
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN--LTA 179
F A+++F EHR+YGKS P+ GNK +Y GYLSS QAL D+A LI LK L A
Sbjct: 72 FNAVIIFAEHRFYGKSQPF-GNK--SYATIRNLGYLSSEQALGDFALLIYHLKNKRLLVA 128
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI---VSPYSFSNIITQD 236
+S V+ FGGSYGGMLAAW R+KYPH+ G++ASSAP+ F ++ V +++ I+ +
Sbjct: 129 QNSSVIAFGGSYGGMLAAWMRIKYPHLVEGSIASSAPVFWFIDMSRSVPQDAYNRIVKRS 188
Query: 237 FRSVSENCY-KVIKGSWKQIEETAKKPGGLEKLQKAFRICK------SEKNLAIESWLST 289
F +S C K I W ++ + G L F + K S + ++ +L
Sbjct: 189 F--LSSGCIEKNILDGWIALKNLSLTTTGRAYLNGLFHLDKKSYLKISTDWIMLKEYLED 246
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASVYYNYSGT 347
F AM +YP P+N+L LP +PVK C+ + K +D + +YG ++YYNY+G
Sbjct: 247 IFGSMAMVNYPYPTNYLAQLPGWPVKVACQFFNSNKAKSDEELAQSMYGIMNLYYNYTGQ 306
Query: 348 AKCF----DLNGDS------DPHGLSEWGWQACTEMIM--LTGGDNKDSIFEESEEDYDA 395
K F ++ DS DP G W WQ+CTEMIM + G D +
Sbjct: 307 KKTFCIKPNVCNDSAYGALGDPFG---WTWQSCTEMIMQQCSSGPPNDFFIKNCPFSLKG 363
Query: 396 RARYCKEAYG------VDPRPNWITTEFGGHKIGLVLKRF--ASNIIFFNGLRDPWSGGG 447
+ YC +G RP+W +G R+ A+NI+F NG DPWS GG
Sbjct: 364 QELYCINTFGKLGYTKALMRPHWSILNYG--------NRYPTATNIVFSNGYLDPWSAGG 415
Query: 448 --VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ + ++++++ K+GAHH DLR + D +KD RR E I W+ +
Sbjct: 416 WSLKSRVMGSLISIIIKDGAHHYDLRGKHQLDTNSVKDARRLEKFYIKYWLKE 468
>gi|324508835|gb|ADY43728.1| Serine protease pcp-1 [Ascaris suum]
Length = 484
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 177/456 (38%), Positives = 247/456 (54%), Gaps = 45/456 (9%)
Query: 77 QRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGK 136
+YL N+T++ K PIF Y GNE IE FA+NTG M+D+AP+F A +VF EHRYYG+
Sbjct: 1 MKYLYNNTYY---KIGGPIFFYAGNEAAIEGFAENTGIMFDLAPRFNASIVFAEHRYYGE 57
Query: 137 SIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL--TATDSPVVVFGGSYGGM 194
S P+G +++Y + G+L+STQA+AD+A + K N+ ++D+PV+ FGGSYGGM
Sbjct: 58 SKPFG---DLSYSDVKNLGFLTSTQAMADFAKFLPHFKANVLNCSSDTPVIAFGGSYGGM 114
Query: 195 LAAWFRLKYPHVAIGALASSAPILNFDNI-VSPYSFSNIITQDFRSVSENCYKVIKGSWK 253
LAAWFR+KYPH+ GA ASSAP+L F V P +F ++T+DF N V K ++
Sbjct: 115 LAAWFRIKYPHIVTGAWASSAPVLLFKGANVDPGAFDKVVTEDFIEAGCNREAVYK-AFN 173
Query: 254 QIEETAKKPGGLEKLQKAFRICKSEKNLA-------IESWLSTAFVYTAMTDYPTPSNFL 306
I E A GL L + F I +++ NL + S++ AF Y AM +YP P++FL
Sbjct: 174 AIHELASTTAGLTFLNEMFVI-EAKSNLTQASDADYLVSFIREAFGYLAMVNYPYPTSFL 232
Query: 307 NPLPAFPVKEMCKAI-----DDPKTGNDVFAKLYGAASVYYNYSGTAK--------CFDL 353
PLP +PVKE CK P T D+ LY +++YYNY+GT C D
Sbjct: 233 LPLPGWPVKEACKRAQAAFPQTPTTNRDLVNYLYIISNLYYNYTGTVATNCVKTSVCGDQ 292
Query: 354 NGDSDPHGLSEWGWQACTEMI--MLTGGDNKDSIFEESEED---YDARARYCKEAY-GVD 407
W WQ+CTE++ M G + D ++E ++ + YC + +
Sbjct: 293 ATAESGDDAFGWPWQSCTELVIEMCARGGSNDFFYDECQQAGGVLNLITDYCLTTFSSIG 352
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS-------KTVLALV 460
N++ + GL ASNIIF NG DPWS GGV N S V
Sbjct: 353 YNKNFLFELGAPIQYGLEFSA-ASNIIFTNGNLDPWSVGGVFANTSGIQQASENGVYTYF 411
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ AHH+DLR DP +K+ R + V II W+
Sbjct: 412 IEGSAHHLDLRQPNTCDPAPVKNARFQIVNIIDCWV 447
>gi|449281466|gb|EMC88535.1| Dipeptidyl-peptidase 2, partial [Columba livia]
Length = 437
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 163/420 (38%), Positives = 231/420 (55%), Gaps = 25/420 (5%)
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
K PIF YTGNEGDI FAQN+ F++++A + +AL++F EHRYYGKS+P+G +
Sbjct: 9 KGFGPIFFYTGNEGDIWNFAQNSDFIFELAEEQQALVIFAEHRYYGKSLPFG----LESM 64
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
TG L+ QALADYA LI +LK+ A D PV+ FGGSYGGML+A+ R+KYP++ G
Sbjct: 65 QPKNTGLLTVEQALADYAVLITELKQQYGAADCPVIAFGGSYGGMLSAYMRMKYPNIVSG 124
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE--- 266
ALA+SAP+L+ + P F +T DF S C ++ +++QI++ + E
Sbjct: 125 ALAASAPLLSVAGLGDPTQFFRDVTADFDKSSPGCVPAVRKAFQQIKDLFLRGAYDEISS 184
Query: 267 KLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKT 326
K+ +I E + + AF AM DYP ++F+ LPA PVK C I T
Sbjct: 185 KMATCNKISTKEDVYQLFGFARNAFTMMAMMDYPYKTDFMGHLPANPVKVGCDQI---LT 241
Query: 327 GNDVFAKLYGAASVYYNYSGTAKCFDL----NGDSDPHGLS------EWGWQACTEMIML 376
D L V YN S +A+C+D+ +DP G W +Q CTE+ +
Sbjct: 242 HADPIRGLAALVGVLYNSSSSAQCYDIYQLYQSCADPTGCGIGSDAEAWDYQVCTEINLT 301
Query: 377 TGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFF 436
+N +F E R +YC + V PR W+ F G + + ASNIIF
Sbjct: 302 FNSNNVTDMFPEMPFTEAMREQYCWSRWRVRPRARWLQINFWGGDL-----KSASNIIFS 356
Query: 437 NGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
NG DPW+GGG+ ++S +++AL K GAHH+DLR DP + +VR+ E II W+
Sbjct: 357 NGDLDPWAGGGINSSLSSSLIALTIKGGAHHLDLRGHNPADPPSVTEVRKLEASIINHWV 416
>gi|326428644|gb|EGD74214.1| hypothetical protein PTSG_06225 [Salpingoeca sp. ATCC 50818]
Length = 501
Score = 281 bits (720), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 171/469 (36%), Positives = 245/469 (52%), Gaps = 44/469 (9%)
Query: 57 YHTQILDHFNYNP----QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Q DHF + P + TFQQR I D +W + N PIF Y GNEGD+E + +T
Sbjct: 46 WYNQTTDHFQWRPSGTAEEPLTFQQRVFICDQYWDKT-NPGPIFFYAGNEGDVELYVNHT 104
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKS--IP-YGGNKEIAYKNASTTGYLSSTQALADYASL 169
G M++ AP F+ALLVF EHR+YGK+ P G E YK YL+ QA+ADYA L
Sbjct: 105 GLMWESAPMFRALLVFAEHRFYGKTQLTPGASGPSEHQYK------YLTHDQAMADYAHL 158
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY-- 227
+ LK++ S +VFGGSYGGMLAAW R+KYP GA+A+SAPIL F + P+
Sbjct: 159 LYHLKRDRNCESSKTIVFGGSYGGMLAAWLRMKYPQTFDGAIAASAPILAFPGMTPPFDS 218
Query: 228 -SFSNIITQD---FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAI 283
+ ++T+D + C ++ +WK++ K G + L FR C S
Sbjct: 219 NGYWQVVTRDATPAAGAAPACENNMRNAWKELFSRGKTESGRKSLSTLFRTC-SPVTSED 277
Query: 284 ESWLSTAFVY-----TAMTDYPTPSNFLN----PLPAFPVKEMCKAIDDPK-TGNDVFAK 333
++W F+ AM +YP PSN+L LPA+PV CK + G+ + +
Sbjct: 278 DTWRLAMFLLLSIDTLAMGNYPYPSNYLTGGGPKLPAYPVVAACKPLAKKDLKGDALLSA 337
Query: 334 LYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMI---MLTGGDNKDSIFEESE 390
L A+VY N + CFD+ W +Q CTE++ + +F
Sbjct: 338 LRDGAAVYANATQDLTCFDIPDQKHVEQDGIWDYQWCTELMPQETYFSLNGTTDMFWAQP 397
Query: 391 EDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
+D +C+ YG+ PR +W+ ++G GL ASNI+F NGL DPWS GGV
Sbjct: 398 QDMAFVRDHCRTKYGIVPREDWMAVKYG----GLNALPAASNIVFSNGLLDPWSSGGVKH 453
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDP---QWLKDVRRREVEIIGKWI 496
NIS ++ A++ GAHH+DL F+ +D W ++ R + I KW+
Sbjct: 454 NISDSITAIILPHGAHHIDLFFTNPQDTWDITWARNYHRAQ---IAKWL 499
>gi|66805245|ref|XP_636355.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|60464723|gb|EAL62849.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 513
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 165/461 (35%), Positives = 248/461 (53%), Gaps = 32/461 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNA-------PIFVYTGNEGDI 105
Y + Q LDHFN+ Y F QR LI D ++ N P+ + GNEGD+
Sbjct: 56 YTLLWFNQTLDHFNFETSGY--FNQRVLIIDQYFNEKSKNEIDQICTKPLIFFCGNEGDV 113
Query: 106 EWFAQNTGFMYD-VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
+F +N+ F+ + +A + AL++F EHRYYG+S+P+G +Y N + YLSS QALA
Sbjct: 114 TFFYENSLFITNTLAQEMNALVIFAEHRYYGESLPFGNQ---SYTNENFQ-YLSSEQALA 169
Query: 165 DYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
DY+ +I + K A + PV GSYGG LAAW RLKYP + GALASSAP+L++
Sbjct: 170 DYSKIIPSILKQYNALNCPVFTTSGSYGGDLAAWMRLKYPFIVDGALASSAPLLSYMGTG 229
Query: 225 SPYS-FSNIITQDFRSVSEN--CYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KS 277
PY F +T DF+ S++ C I+ ++ +E AK G ++ +F++C +
Sbjct: 230 VPYDVFPVGVTNDFKETSQDGSCAIKIRNAFNDLETIAKADNGFNEISTSFKLCTPINSN 289
Query: 278 EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA 337
+ + W+ + F Y +M DYP P++FL P+ PV E C I+ D+ +
Sbjct: 290 DDFQSFLGWVESGFSYMSMADYPYPASFLEPMMGNPVNETCNLINQLDNSIDI---IMSG 346
Query: 338 ASVYYNYSG-TAKCFDLNGDSDPHG-LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
+YYNY+G +CF+ N + G L W +Q+CTE + +F S +
Sbjct: 347 LQIYYNYTGQMMQCFNTNIFIEDQGMLIPWSYQSCTEFVFPFTTTGIKDMFYYSPFNLTE 406
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV-LKNISK 454
C+E Y V P PNW+T+ +GG +SNIIF NG+ D W G G+ + + SK
Sbjct: 407 YIENCQEEYNVTPDPNWVTSVYGGTP-----NFPSSNIIFSNGVLDGWHGAGINVTDYSK 461
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
++A++ AHH+DLR S DPQ + D R E++ + +W
Sbjct: 462 NIIAILIPGAAHHLDLRGSNPLDPQSITDARLLELKYLTEW 502
>gi|391339066|ref|XP_003743874.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 476
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 171/490 (34%), Positives = 264/490 (53%), Gaps = 24/490 (4%)
Query: 18 LTISNAKI--FPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTF 75
++IS A++ F + L ++ SS+ + + +T + +DHF Y + TF
Sbjct: 1 MSISTARVCLLLQFLPVVLVCSLLGAVSSSTARRRFVTETTWFNVPIDHFGY--YNNNTF 58
Query: 76 QQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYG 135
R L N+ ++ +K PIF+Y GNEGDI F NTG ++D A +F ALLVF EHRYYG
Sbjct: 59 PLRVLYNNEYFNHTKP-GPIFLYAGNEGDIALFVYNTGLLWDWAEEFGALLVFAEHRYYG 117
Query: 136 KSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD-SPVVVFGGSYGGM 194
KS+PYG + + K+ S GYL+ QALAD+A +I ++K+ S VV FGGSY GM
Sbjct: 118 KSMPYGRD---SLKDVSYYGYLTVDQALADFAHVISEIKETWPGVQKSKVVAFGGSYAGM 174
Query: 195 LAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWK 253
LAAW R+KYP + AL+S API + +V +F++ + + F + + C I+ SWK
Sbjct: 175 LAAWLRMKYPWLVEAALSSGAPIRLYQGLVGCNAFNDGVARAFLAEGGKKCVNNIRKSWK 234
Query: 254 QIEETAKKPGGLEKLQKAFRICKSEKNLA-----IESWLSTAFVYTAMTDYPTPSNFLNP 308
+E + G + K F +C +KN+A + W+ ++V AM +YP
Sbjct: 235 ALERFKESEEGTNFIFKKFHVC--QKNIASGFAQVRDWIYGSYVNLAMHNYPYGLE-TRR 291
Query: 309 LPAFPVKEMCKAID-DPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGW 367
L +P++ C + D + D+ + +Y A +VY+N+SG C +++ H S W
Sbjct: 292 LSTYPIRLACAFLQKDFQKDEDLLSAIYDAVNVYHNHSGVVHCNNVDDVYGEHIGSAWQV 351
Query: 368 QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLK 427
Q C E+++ + K I D YCK YG+ P P+ + T F G KI +
Sbjct: 352 QNCNELVLPYCANGKSDISYPFSWKLDGIKAYCKRRYGMTPNPSRVRTMFAGDKISV--- 408
Query: 428 RFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRR 487
ASNI+F NG DPWS GG+LK++ T+ A++ + GAHH DL+ +D + ++ R
Sbjct: 409 --ASNIVFSNGDLDPWSSGGILKSLGPTLPAIIVRGGAHHYDLKGDHPDDTEEVRKARNT 466
Query: 488 EVEIIGKWIS 497
I W++
Sbjct: 467 AKNYIKTWLA 476
>gi|341900594|gb|EGT56529.1| CBN-PCP-5 protein [Caenorhabditis brenneri]
Length = 507
Score = 281 bits (718), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 183/494 (37%), Positives = 267/494 (54%), Gaps = 43/494 (8%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P ++ P+K + I K Y+ Y +DHF++ F RY +N ++
Sbjct: 23 PVTQRGPQKFENSIGKYK-----YEVGYLKVPIDHFSFTND--MEFNLRYFLNTDNY--- 72
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
++ PI YTGNEG +E FA+NTGFM+D+AP+ KA +VF+EHR+YGKS P+G +Y
Sbjct: 73 ESGGPILFYTGNEGSLEAFAENTGFMWDLAPELKAAVVFVEHRFYGKSQPFGNQ---SYT 129
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLT---ATDSPVVVFGGSYGGMLAAWFRLKYPHV 206
+ GYLSS QALAD+A L + KN A S V+ FGGSYGGML+AWFR+KYPH+
Sbjct: 130 DIRRLGYLSSQQALADFA-LSVQFFKNEKIKGAQKSAVIAFGGSYGGMLSAWFRIKYPHI 188
Query: 207 AIGALASSAPILNF-DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGG- 264
GA+A+SAP+ F D+ + + I+T+ F N K I +W ++E +K G
Sbjct: 189 VDGAIAASAPVFWFTDSNIPEDVYDFIVTRAFLDAGCN-RKAIDKAWLALDELSKSDSGR 247
Query: 265 -----LEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
L KL ++ + ++ ++ + AM +YP P++FL+ LP++PVKE CK
Sbjct: 248 RYLNILYKLDPKSKLENKDDIGFLKQYIRESMEAMAMVNYPYPTSFLSSLPSWPVKEACK 307
Query: 320 AIDDP-KTGNDVFAKLYGAASVYYNYSG--------TAKCFDLNGD-SDPHGLSEWGWQA 369
P K+ + +LY ++YYNY+G AKC G DP G W +Q
Sbjct: 308 FASQPGKSQEESAEQLYNIVNLYYNYTGDKSTHCANAAKCDSAYGSLGDPLG---WPFQT 364
Query: 370 CTEMIM-LTGGDNKDSIFEESEEDYDAR-ARYCKEAYG-VDPRPNWITTEFGGHKIGLVL 426
CTEM+M L G + F + A+ A YC + + + + GG G
Sbjct: 365 CTEMVMPLCGSGYPNDFFWKDCPFTTAKYAEYCMQTFSQISYNKTLLRPLAGGRAFGATS 424
Query: 427 KRFASNIIFFNGLRDPWSGGGVLKN--ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDV 484
ASNI+F NG DPWSGGG + + TV++++ K+GAHH DLR + +D + +K V
Sbjct: 425 LPSASNIVFSNGYLDPWSGGGYEHSDKVQGTVVSVILKQGAHHYDLRGAHPQDTEEVKKV 484
Query: 485 RRREVEIIGKWISQ 498
R E I KWI +
Sbjct: 485 RAMETTNIKKWIKE 498
>gi|312080065|ref|XP_003142442.1| hypothetical protein LOAG_06858 [Loa loa]
gi|307762394|gb|EFO21628.1| hypothetical protein LOAG_06858 [Loa loa]
Length = 509
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 174/468 (37%), Positives = 260/468 (55%), Gaps = 53/468 (11%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF+++ +TF+ RYLIN ++ +N PIF YTGNEG++E FAQNTG M+D+AP+
Sbjct: 52 IDHFSFHDN--RTFRLRYLINTDYFA---HNGPIFFYTGNEGNVELFAQNTGLMWDLAPQ 106
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL--TA 179
A++VF EHR+YGKS P+G I +N GYLSS QAL D+A LI LK A
Sbjct: 107 LNAMVVFAEHRFYGKSQPFGNKSYITIQN---FGYLSSEQALGDFALLINHLKNKYLSMA 163
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSP-YSFSNIITQDFR 238
+S V+ FGGSYGGMLAAW R+KYPH+ G++ASSAP+ F ++ P ++S+I+ + F
Sbjct: 164 QNSSVIAFGGSYGGMLAAWMRIKYPHLVEGSIASSAPVFWFIDMSVPDDAYSHIVKRSF- 222
Query: 239 SVSENCYK--VIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKN-LAIESWLSTA 290
V+ C + +I G W ++ + G + L + F + KS + + ++ +L
Sbjct: 223 -VNSGCIERNIING-WIALKNLSSTASGRDYLNRLFHLDKKSYLKSNTDWIMLKEYLEDI 280
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID--DPKTGNDVFAKLYGAASVYYNYSGTA 348
F AM +YP PSN+L LP +PVK C+ + + +T ++ +YG ++YYNY+G
Sbjct: 281 FQSMAMVNYPYPSNYLAKLPGWPVKVACQFFNNTNKQTDKELAQSMYGIMNLYYNYTGQK 340
Query: 349 KCFDLN----------GDSDPHGLSEWGWQACTEMIM--LTGGDNKDSIFEESEEDYDAR 396
+ F ++ DP G W WQ+CTEMIM + G D + + +
Sbjct: 341 EQFCIDPKVCKDTAYEALGDPIG---WSWQSCTEMIMQLCSSGPPNDFFIKNCPFTLEDQ 397
Query: 397 ARYCKEAYGV------DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG--V 448
YC A+G RP+W +G A+NIIF NG DPWS GG +
Sbjct: 398 ESYCINAFGKLGYTKNLMRPHWSILNYGNQ------YPTATNIIFSNGYLDPWSAGGWSL 451
Query: 449 LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ +++++ K+GAHH DLR + D + +K+ R E I W+
Sbjct: 452 KSQLIGPLISIIIKDGAHHYDLRGEHQLDTKSVKEARLLEKLCIKHWL 499
>gi|391347711|ref|XP_003748099.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 468
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 164/448 (36%), Positives = 242/448 (54%), Gaps = 20/448 (4%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
+Y Q +DHF ++ + TF+Q+YL++D + + PIF Y G E ++E A+ TG
Sbjct: 28 VQYFEQRVDHFGFHKR--DTFRQKYLMSDKTF---QAGGPIFFYCGGEMNVELHARQTGL 82
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
M+ A +F+AL+VF EHRYYG+S+PYG + ++ + GYLS+ QALADYA+++ LK
Sbjct: 83 MFTWAREFRALVVFAEHRYYGESLPYG---DASFYGSERRGYLSTEQALADYAAILSHLK 139
Query: 175 KNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
N T AT S +VV+G Y GMLA W R+KYPH+A A ASSAPI + V F +
Sbjct: 140 ANHTGATKSEIVVWGAGYSGMLAVWMRVKYPHIAKLAYASSAPIGFYSGEVPCGKFLKAV 199
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES---WLSTA 290
T FRS SE C + I+ W ++ A G+ L AF C+ + I + WL +
Sbjct: 200 TSVFRSESETCVQSIRRIWNVLQTMATSRDGMAHLADAFNTCQPVRGDNISNLFRWLKES 259
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASVYYNYSGTAK 349
F +M D+P ++ LPA+PVKE C + + ++ D+ + A SV YN++G
Sbjct: 260 FRTISMLDFPYETDLFGKLPAYPVKEACSKLWNHQQSDRDLMRAAHKAVSVLYNHTGEVV 319
Query: 350 CFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
C+ L D+ + WG+QACTE+IM D D +F D R C +GV P
Sbjct: 320 CYSL--DNTLRNNAGWGFQACTELIMPVCSDGMDDMFNPKSWDLKKVQRKCLNKFGVWPD 377
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVD 469
+ +GG GL NI+ N RDPW GG+L ++ + + + GAH D
Sbjct: 378 DQRLKRIYGG-ATGLAT---VDNIVVTNNQRDPWYDGGILTG-TEGITVISIRNGAHGHD 432
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+R ++DP + R R +I + IS
Sbjct: 433 MRTPHEKDPISVTWARSRVRAVIRRTIS 460
>gi|17557065|ref|NP_498718.1| Protein PCP-5, isoform b [Caenorhabditis elegans]
gi|351020563|emb|CCD62539.1| Protein PCP-5, isoform b [Caenorhabditis elegans]
Length = 568
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 186/518 (35%), Positives = 279/518 (53%), Gaps = 47/518 (9%)
Query: 6 IFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
IF+S +L ++T ++ + P ++ K I K +G K +D F
Sbjct: 64 IFISLAILIATTHCLTLLRD----PVTQNGASKFEKSIGKYKYEEGYLKAP-----IDPF 114
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
+ F RY +N H+ + PI YTGNEG +E FA+NTGFM+D+AP+ KA
Sbjct: 115 AFTND--LEFDLRYFLNIDHY---ETGGPILFYTGNEGSLEAFAENTGFMWDLAPELKAA 169
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT---ATDS 182
+VF+EHR+YGKS P+ K +Y + GYLSS QALAD+A L + KN A S
Sbjct: 170 VVFVEHRFYGKSQPF---KNESYTDIRHLGYLSSQQALADFA-LSVQFFKNEKIKGAQKS 225
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-DNIVSPYSFSNIITQDFRSVS 241
V+ FGGSYGGML+AWFR+KYPH+ GA+A+SAP+ F D+ + + I+T+ F
Sbjct: 226 AVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVYDFIVTRAFLDAG 285
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKNLA-IESWLSTAFVYTA 295
N K I+ W ++E AK G + L +++ +++ ++ ++ ++ + A
Sbjct: 286 CN-RKAIEKGWIALDELAKSDSGRQYLNVLYKLDPKSKLENKDDIGFLKQYIRESMEAMA 344
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDP-KTGNDVFAKLYGAASVYYNYSG-------- 346
M +YP P++FL+ LPA+PVKE CK+ P KT + +LY ++YYNY+G
Sbjct: 345 MVNYPYPTSFLSSLPAWPVKEACKSASQPGKTQEESAEQLYKIVNLYYNYTGDKSTHCAN 404
Query: 347 TAKCFDLNGD-SDPHGLSEWGWQACTEMIM-LTG-GDNKDSIFEESEEDYDARARYCKEA 403
AKC G DP G W +Q CTEM+M L G G D +++ + A +C +
Sbjct: 405 AAKCDSAYGSLGDPLG---WPFQTCTEMVMPLCGSGYPNDFFWKDCPFTSEKYAEFCMQT 461
Query: 404 Y-GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV--LKNISKTVLALV 460
+ + + GG G ASNI+F NG DPWSGGG + +V++++
Sbjct: 462 FSSIHYNKTLLRPLAGGLAFGATSLPSASNIVFSNGYLDPWSGGGYDHSDKVQGSVISVI 521
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
K+GAHH DLR + +D + +K VR E + I KWI +
Sbjct: 522 LKQGAHHYDLRGAHPQDTEEVKKVRAMETQAIKKWIKE 559
>gi|17557063|ref|NP_498719.1| Protein PCP-5, isoform a [Caenorhabditis elegans]
gi|466050|sp|P34676.1|PCP5_CAEEL RecName: Full=Prolyl carboxy peptidase like protein 5; Flags:
Precursor
gi|351020562|emb|CCD62538.1| Protein PCP-5, isoform a [Caenorhabditis elegans]
Length = 507
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 186/518 (35%), Positives = 279/518 (53%), Gaps = 47/518 (9%)
Query: 6 IFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
IF+S +L ++T ++ + P ++ K I K +G K +D F
Sbjct: 3 IFISLAILIATTHCLTLLRD----PVTQNGASKFEKSIGKYKYEEGYLKAP-----IDPF 53
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
+ F RY +N H+ + PI YTGNEG +E FA+NTGFM+D+AP+ KA
Sbjct: 54 AFTND--LEFDLRYFLNIDHY---ETGGPILFYTGNEGSLEAFAENTGFMWDLAPELKAA 108
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT---ATDS 182
+VF+EHR+YGKS P+ K +Y + GYLSS QALAD+A L + KN A S
Sbjct: 109 VVFVEHRFYGKSQPF---KNESYTDIRHLGYLSSQQALADFA-LSVQFFKNEKIKGAQKS 164
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-DNIVSPYSFSNIITQDFRSVS 241
V+ FGGSYGGML+AWFR+KYPH+ GA+A+SAP+ F D+ + + I+T+ F
Sbjct: 165 AVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVYDFIVTRAFLDAG 224
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKNLA-IESWLSTAFVYTA 295
N K I+ W ++E AK G + L +++ +++ ++ ++ ++ + A
Sbjct: 225 CN-RKAIEKGWIALDELAKSDSGRQYLNVLYKLDPKSKLENKDDIGFLKQYIRESMEAMA 283
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDP-KTGNDVFAKLYGAASVYYNYSG-------- 346
M +YP P++FL+ LPA+PVKE CK+ P KT + +LY ++YYNY+G
Sbjct: 284 MVNYPYPTSFLSSLPAWPVKEACKSASQPGKTQEESAEQLYKIVNLYYNYTGDKSTHCAN 343
Query: 347 TAKCFDLNGD-SDPHGLSEWGWQACTEMIM-LTG-GDNKDSIFEESEEDYDARARYCKEA 403
AKC G DP G W +Q CTEM+M L G G D +++ + A +C +
Sbjct: 344 AAKCDSAYGSLGDPLG---WPFQTCTEMVMPLCGSGYPNDFFWKDCPFTSEKYAEFCMQT 400
Query: 404 Y-GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV--LKNISKTVLALV 460
+ + + GG G ASNI+F NG DPWSGGG + +V++++
Sbjct: 401 FSSIHYNKTLLRPLAGGLAFGATSLPSASNIVFSNGYLDPWSGGGYDHSDKVQGSVISVI 460
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
K+GAHH DLR + +D + +K VR E + I KWI +
Sbjct: 461 LKQGAHHYDLRGAHPQDTEEVKKVRAMETQAIKKWIKE 498
>gi|325188924|emb|CCA23453.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 544
Score = 279 bits (713), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 177/467 (37%), Positives = 260/467 (55%), Gaps = 45/467 (9%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHFN + +F QRY D + + +A +F Y GNE ++ + +TG+M++ A
Sbjct: 88 QRLDHFNVAQNA--SFPQRYFFCDPYELNAAIDA-VFFYLGNEAEVTLYLNHTGWMWENA 144
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+FKA L+F EHRY+G+SIP+ KE +N G+LSS QALADYA+LI +K+N T
Sbjct: 145 WEFKAALIFAEHRYFGRSIPFP--KESIRQN---MGFLSSEQALADYAALITSIKQNRTH 199
Query: 180 TD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS---FSNIITQ 235
+P + FGGSYGGMLAAWFR+KYPH+ G +A+SAP+L F P F+ + T
Sbjct: 200 LQRAPFIGFGGSYGGMLAAWFRVKYPHIIDGVIAASAPVLAFMGDQRPVDMEGFARVSTF 259
Query: 236 DFR---SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-----SEKNL-AIESW 286
D S NC I+ SW+ + + +K G EKL K F++C SEK+ A+ W
Sbjct: 260 DATMGAGASSNCASNIRQSWQSMWKLSKTLQGREKLSKIFQLCNDAILHSEKDAEAMIMW 319
Query: 287 LSTAFVYTAMTDYPTPSNFL----NPLPAFPVKEMCKAIDD----PKTGNDVFAKLYGAA 338
AF Y +M +YP P++++ + LP++PV+ C + D PK + + +
Sbjct: 320 AKEAFDYMSMGNYPYPTSYIMNGESTLPSYPVRVACGFLSDAFVVPKEEDTLLEAFVRSI 379
Query: 339 SVYYNYSGTAKCFDLNGDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIF-----EESEE 391
VYYN + C D+ S+ W + C+E+ M + D +F +SE+
Sbjct: 380 GVYYNSTKQKSCHDMKPASEKSRRDADFWDYIYCSELYMPSTTDGIHDMFWPVAWNQSED 439
Query: 392 DYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
+ + C + +GV RP W T+FGG K L+R ASNI+F NG DPWS GV K+
Sbjct: 440 NAN-----CIKTWGVSLRPFWAVTQFGGLK---ALQR-ASNIVFSNGNYDPWSATGVTKS 490
Query: 452 ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
IS +V+ + GAHH+DL FS DP ++ R+ E + I +WI +
Sbjct: 491 ISSSVVYIPVPGGAHHIDLFFSNDLDPPEVRKARQLERQNIRRWIRR 537
>gi|440913231|gb|ELR62707.1| Dipeptidyl peptidase 2, partial [Bos grunniens mutus]
Length = 413
Score = 277 bits (709), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 162/425 (38%), Positives = 240/425 (56%), Gaps = 42/425 (9%)
Query: 89 SKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
++ PIF YTGNEGD+ FA N+GF+ ++A + AL+VF EHRYYGKS+P+G
Sbjct: 10 NRGEGPIFFYTGNEGDVWSFANNSGFILELAEQQGALVVFAEHRYYGKSLPFG------- 62
Query: 149 KNASTTGY---LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
+ ++ GY L+ QALAD+A L+ L++ L A D+P + FGGSYGGML+A+ R+KYPH
Sbjct: 63 ERSTWRGYTELLTVEQALADFAGLLRALRQELEAPDAPAIAFGGSYGGMLSAYLRIKYPH 122
Query: 206 VAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGL 265
+ GALA+SAP+++ + PY F ++ DF+ S C + ++ +++QI + +
Sbjct: 123 LVAGALAASAPVVSAAGLGDPYQFFQDVSADFQGQSPECARAVQDAFRQIRDLFQ----- 177
Query: 266 EKLQKAFRICK---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI 321
Q F C+ K+L + + AF AM DYP ++F+ LPA PV C +
Sbjct: 178 ---QGEFGTCQPLSGPKDLTQLFGFARNAFTVLAMMDYPYATDFVGHLPAHPVG--CSRL 232
Query: 322 DDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL----NGDSDPHGLS------EWGWQACT 371
+ + A L A + YN SG C+D+ +DP G W +Q CT
Sbjct: 233 ---LSESSRIAGLRALAGLVYNSSGIEPCYDIYLQYQACADPTGCGLGSDAKAWDYQVCT 289
Query: 372 EMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFAS 431
E+ + +N +F E R +YC + +GV PR +W+ T FGG + AS
Sbjct: 290 EISLTFSSNNVSDLFPELPFTEAQRQQYCLDTWGVWPRQDWLQTSFGGGDL-----TAAS 344
Query: 432 NIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEI 491
NIIF NG DPW+ GG+ N+S +VLA+ + GAHH+DLR S +DP + + RR E +
Sbjct: 345 NIIFSNGDLDPWARGGIQSNLSASVLAIAIRGGAHHLDLRGSHPDDPASVVEARRLEAAL 404
Query: 492 IGKWI 496
IGKW+
Sbjct: 405 IGKWV 409
>gi|256070800|ref|XP_002571730.1| lysosomal Pro-Xaa carboxypeptidase (S28 family) [Schistosoma
mansoni]
gi|360043145|emb|CCD78557.1| lysosomal Pro-Xaa carboxypeptidase (S28 family) [Schistosoma
mansoni]
Length = 498
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 178/497 (35%), Positives = 253/497 (50%), Gaps = 36/497 (7%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQS 71
L+FS L +S + FP S I K+ Y+TKY +DHF++ +
Sbjct: 5 LIFSFLLPLSFGLLTRHFPWSTI------------KNPDFTYETKYFWTRVDHFSF--VN 50
Query: 72 YQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
+ F RYLIN+ + GG PI YTGNEG IE FA+N+GF++ ++ + A +VF
Sbjct: 51 DEKFLIRYLINNESFTPGG-----PILFYTGNEGPIETFAENSGFIWKLSRELNASVVFA 105
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA-TDSPVVVFG 188
EHRYYG S+P+G N ++K+ GYL++ Q LADY LI LK N + SPV+ FG
Sbjct: 106 EHRYYGTSLPFGNN---SFKDRRHFGYLTAEQTLADYVLLINQLKANYSCFASSPVIAFG 162
Query: 189 GSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF-RSVSENCYKV 247
GSYGGML+AW R KYP+ GA+ASSAP+ F + FS IT F + ENC K
Sbjct: 163 GSYGGMLSAWIRQKYPNQIAGAIASSAPVWLFPGLSDCNGFSMTITNSFLKYGGENCVKN 222
Query: 248 IKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFL 306
I+ SW I + + G E L F IC ++ I +LS +M +YP P++ +
Sbjct: 223 IQLSWSNIVDIGQSVDGKELLTHMFNICTPLTDVQNIIDYLSDFLGIISMVNYPYPASLI 282
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDVFAK---LYGAASVYYNYSGTAKCFDLNGDSDPHGLS 363
LP +PVK +C + + V + L A NY+G C D++ + ++
Sbjct: 283 LALPEWPVKYLCTNLSEYDPQQPVVTRISLLAKAVLSLTNYTGNQTCLDISMNLPNVDVN 342
Query: 364 EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIG 423
W QAC EM+ T +I S D + YC+ +GV PR W EF +
Sbjct: 343 GWDLQACMEMVTPTCASGPVNIMPPSNWDLKTYSIYCQNRFGVSPRVEWPKVEFWSKSVY 402
Query: 424 LVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE-KEGAHHVDLRFSTKEDPQWLK 482
V +NIIF NG DPWS + N ++ + AHH+DLR DPQ +
Sbjct: 403 TV-----TNIIFSNGEIDPWSAFSITNNSYVPFATVINMSDAAHHLDLRTPNSADPQSVV 457
Query: 483 DVRRREVEIIGKWISQY 499
+ R E + I +WI ++
Sbjct: 458 EAREIEKQKIIQWIKEW 474
>gi|324509048|gb|ADY43812.1| Serine protease pcp-1 [Ascaris suum]
Length = 593
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 195/533 (36%), Positives = 273/533 (51%), Gaps = 71/533 (13%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSK--DSQGLY--KTKYHTQILDHFNY 67
LL SS + + +A TF + P L + D + +Y T Y+ +D+F +
Sbjct: 7 LLASSFIVVCSA----TFRRLTVDPVALRERKRAEPRVDDETIYGWSTAYYDVPIDNFAF 62
Query: 68 NPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
S QT++ +YL N T++ GG PIF YTGNEG IE FA+NTG M+D+A KFKA
Sbjct: 63 T--SAQTYRMKYLYNLTYYELGG-----PIFFYTGNEGSIEEFAKNTGIMFDLAEKFKAA 115
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL--TATDSP 183
+ F EHRYYG S+P+G I+Y NA+ GYLSSTQALAD+A LI +K ++ D+P
Sbjct: 116 VFFAEHRYYGASMPFG---NISYTNANYLGYLSSTQALADFAKLITFIKTDVLKCPPDTP 172
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD-NIVSPYSFSNIITQDFRSVSE 242
V+ FGGSYGGMLAAW R+KYPH+ GA +SSAP+L F+ VSP +F + + F +
Sbjct: 173 VIAFGGSYGGMLAAWLRMKYPHIVSGAWSSSAPLLYFEGGNVSPSAFEKAVKEVFINAGC 232
Query: 243 NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI------CKSEKNLAIESWLSTAFVYTAM 296
N I + I+ G + L FRI +S + + W+ A Y AM
Sbjct: 233 N-ENTIANGLEAIKNLMNTAEGRQFLNDLFRIDPTSTLTESTDSDFLVEWIWAAMDYMAM 291
Query: 297 TDYPTPSNFLNPLPAFPVKEMCK--AIDDPKTGNDVFAKLYGAASVYYNYSGTA--KCFD 352
+YP PSNFL PLP +PVK C+ A + LY ++VYYN++G C +
Sbjct: 292 VNYPYPSNFLQPLPGWPVKYSCENFARSEITDARQAATALYQISNVYYNFTGNVATNCVN 351
Query: 353 LN--GDSDPHGL---SEWGWQACTEMIMLTGGDN-KDSIFEESEEDYDA----------- 395
N G+S L + W WQ CTE++++ + + F+ E+Y+
Sbjct: 352 WNVCGESAIANLGADAGWSWQTCTELVLMMCSEGPPNDFFDNQCENYNGPVEIQIALCAA 411
Query: 396 ---RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN- 451
RA + +E V N + E+G ASNIIF NG DPWS GGV N
Sbjct: 412 EFTRAGWNREFLDV----NAVAIEYGFDYAA------ASNIIFTNGNLDPWSPGGVYANT 461
Query: 452 ------ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ + AHH+DLR DP +K+ R + II W+ +
Sbjct: 462 PGIQEATKNGIYTFLIDGSAHHLDLRQPNTCDPPSVKNARFQITNIIDCWVHK 514
>gi|268575392|ref|XP_002642675.1| C. briggsae CBR-TAG-282 protein [Caenorhabditis briggsae]
Length = 505
Score = 275 bits (702), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 176/470 (37%), Positives = 260/470 (55%), Gaps = 36/470 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y +DHF++ Y+ F RY +N H+ ++ PI YTGNEG +E FA+NT
Sbjct: 41 YDEGYLKVPIDHFSFT-NDYE-FDLRYFLNTDHY---ESGGPILFYTGNEGSLESFAENT 95
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA--SLI 170
G M+D+AP+ KA +VF+EHR+YGKS P+ K +Y + GYLSS QALAD+A +
Sbjct: 96 GLMWDLAPELKAAVVFVEHRFYGKSQPF---KNQSYTDIRHLGYLSSQQALADFALSAQF 152
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-DNIVSPYSF 229
+K A S V+ FGGSYGGML+AWFR+KYPH+ GA+A+SAP+ F D+ + +
Sbjct: 153 FRNEKIKGAQTSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVY 212
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKNLA-I 283
I+T+ F N V KG W ++E AK G + L +++ +S+ ++ +
Sbjct: 213 DFIVTRAFLDAGCNRKAVEKG-WIALDELAKSDSGRQYLNVLYKLDPKSKLESKDDIGFL 271
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP-KTGNDVFAKLYGAASVYY 342
+ ++ A AM +YP P++FL+ LP +PVKE CK + P K+ + +LY ++YY
Sbjct: 272 KQYIREAMEAMAMVNYPYPTSFLSSLPGWPVKEACKYANAPGKSQEESAEQLYNIVNLYY 331
Query: 343 NYSG--------TAKCFD-LNGDSDPHGLSEWGWQACTEMIM-LTG-GDNKDSIFEESEE 391
N++G AKC DP G W +Q CTEM+M L G G D +++
Sbjct: 332 NFTGDKTTHCANAAKCDSAYEALGDPLG---WPFQTCTEMVMPLCGSGYPNDFFWKDCPF 388
Query: 392 DYDARARYCKEA-YGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV-- 448
+ YC++ Y + + GG G ASNI+F NG DPWSGGG
Sbjct: 389 TTEKYGDYCQQTFYKIGYNKTLLRPLAGGRAFGATSLPSASNIVFSNGYLDPWSGGGYDH 448
Query: 449 LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ +V++++ K+GAHH DLR + +D + +K VR E I KWI +
Sbjct: 449 SDRVQGSVISVILKQGAHHYDLRGAHPQDTEEVKKVRAMETSAIKKWIKE 498
>gi|348686510|gb|EGZ26325.1| hypothetical protein PHYSODRAFT_327243 [Phytophthora sojae]
Length = 543
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 170/471 (36%), Positives = 250/471 (53%), Gaps = 46/471 (9%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
K+ TQ LDHF+ +Y QRY + D H+ + +F Y GNE D+E + +TG M
Sbjct: 81 KFFTQTLDHFDVGAPTYL---QRYFVCDRHF---RPGGVMFFYVGNEADVELYLNHTGLM 134
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
++ A +F A+LVF EHRY+GKS+P+G N + YLS+ QALADYA LI LK+
Sbjct: 135 WENADEFGAMLVFAEHRYFGKSVPFGRNVTKHMR------YLSTEQALADYAVLITRLKE 188
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF--DNIVSPY-SFSNI 232
D PV+ FGGSYGGML +WFR+KYPH+ G +A+SAPIL++ D + +S +
Sbjct: 189 EWQ-RDIPVIGFGGSYGGMLGSWFRMKYPHIIDGVIAASAPILSYFGDEVAHDLRGYSQV 247
Query: 233 ITQDFRSV---SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK------SEKNLAI 283
T D ++NC ++ +W + K G KL++A +C+ E A+
Sbjct: 248 TTFDASPAAGSAQNCVPNVRRAWPTMRAFGKTTSGRRKLKEALVLCEDTPLDTDEAIDAV 307
Query: 284 ESWLSTAFVYTAMTDYPTPSNFL----NPLPAFPVKEMCK----AIDDPKTGN----DVF 331
W +F AM +YP S+++ + LPA+P++ C A D+ + G+ F
Sbjct: 308 MQWAKDSFDSMAMGNYPYASSYIMNGVSELPAYPMRVACSHLQDAFDETEDGDFKLLRAF 367
Query: 332 AKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIFEES 389
AK G VYYN + +CF L S + W + C E+ T D +F +
Sbjct: 368 AKTIG---VYYNSTKDKECFQLKAPSAEDAVDSDFWDYIYCAELYGPTTTDGVADMFWYA 424
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+Y A C +G+D R W T FGG + V ASNI+F NG DP S GVL
Sbjct: 425 PWNYTADNASCHAEWGIDARIAWPTIHFGGRRFLEV----ASNIVFSNGNYDPCSATGVL 480
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
+N S +V+A++ GAHH+DL FS DP+ +K R E + + +W +++
Sbjct: 481 QNYSDSVVAVLIDGGAHHLDLMFSNPLDPEPVKAARAAEKQHMKRWADEFY 531
>gi|313235771|emb|CBY11221.1| unnamed protein product [Oikopleura dioica]
Length = 484
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 161/475 (33%), Positives = 239/475 (50%), Gaps = 35/475 (7%)
Query: 31 SSRITPEKLSSLISSSKDSQGL--YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGG 88
++ + P K SSL S S + ++ +Y DHF+ ++ Q + R + +D +
Sbjct: 11 AAALKPLKYSSLESYSDFCSEISTFEAEY-----DHFS--TRNTQKIEIRVITDDRFY-- 61
Query: 89 SKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
+ P+ YTGNEGD++ F +NTGFM + A LVF+EHRYYGKSIP N
Sbjct: 62 -QAGGPVLFYTGNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIPDDKNL---- 116
Query: 149 KNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
YLS+ QALADYA ++ LK + PV+ GGSYGGMLAA+FR+KYP++
Sbjct: 117 -------YLSAEQALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVA 167
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN--CYKVIKGSWKQIEETAKKPGGLE 266
GA+A SAP+ + F + T+ F + C I+ SW+ I+ G
Sbjct: 168 GAIAGSAPVKFLPGLFDCRGFYRVTTRTFTNTPSGHFCSDNIRKSWETIKLIGAHMVGKR 227
Query: 267 KLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK 325
L + FR C ++ + +L + AM DYP P+NF+ +P +PV C +D
Sbjct: 228 TLSEVFRTCDPITDVEPLLDFLEDVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDI 287
Query: 326 TGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSI 385
++ L AASVYYNY+G C DL + G + W +Q CTE + D K+ +
Sbjct: 288 NQEELLEPLRDAASVYYNYTGDLACLDLGDEGGDLGYNNWYFQTCTEFVFPFCSDGKEDM 347
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSG 445
F D+ + + C++ +G PR +W F + + IIF NGL DPWS
Sbjct: 348 FRVHTYDFPSYSTNCQQTFGTTPREHWAEMFFSVETMKTI-----GGIIFSNGLLDPWSS 402
Query: 446 GGVL--KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
GGVL + + +GAHH+DLR DP+ + R + I+ WI++
Sbjct: 403 GGVLTQEEAGPRNYIFILSKGAHHLDLRADNPADPEEVTLARTEYISIMKNWIAE 457
>gi|313246245|emb|CBY35176.1| unnamed protein product [Oikopleura dioica]
Length = 484
Score = 272 bits (695), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 163/475 (34%), Positives = 242/475 (50%), Gaps = 35/475 (7%)
Query: 31 SSRITPEKLSSLISSSKDSQGL--YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGG 88
++ + P K SSL S S + ++ +Y DHF+ ++ Q + R + +D +
Sbjct: 11 TAALKPLKYSSLESYSDFCSEISTFEAEY-----DHFS--TRNTQKIEIRVITDDRFY-- 61
Query: 89 SKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
+ P+ YTGNEGD++ F +NTGFM + A LVF+EHRYYGKSIP N
Sbjct: 62 -QAGGPVLFYTGNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIPDDKNL---- 116
Query: 149 KNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
YLS+ QALADYA ++ LK + PV+ GGSYGGMLAA+FR+KYP++
Sbjct: 117 -------YLSAEQALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVA 167
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSV-SEN-CYKVIKGSWKQIEETAKKPGGLE 266
GA+A SAP+ + F + T+ F + SE+ C I+ SW+ I+ G
Sbjct: 168 GAIAGSAPVKFLPGLFDCRGFYRVTTRTFTNTPSEHFCSDNIRKSWETIKLIGAHMVGKR 227
Query: 267 KLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK 325
L + FR C+ ++ + +L + AM DYP P+NF+ +P +PV C +D
Sbjct: 228 TLSEVFRTCEPITDVEPLLDFLEDVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDI 287
Query: 326 TGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSI 385
++ L AASVYYNY+G C DL + G + W +Q CTE + D K+ +
Sbjct: 288 NQEELLEPLRDAASVYYNYTGDLACLDLGDEGGDLGYNNWYFQTCTEFVFPFCSDGKEDM 347
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSG 445
F D+ + C++ +G PR +W F + + IIF NGL DPWS
Sbjct: 348 FRVHTYDFPTYSTNCQQTFGTTPREHWAEMFFSVETMKTI-----GGIIFSNGLLDPWSS 402
Query: 446 GGVL--KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
GGVL + + +GAHH+DLR DP+ + R + I+ WI++
Sbjct: 403 GGVLTQEEAGPRNYIFILSKGAHHLDLRADNPADPEEVTLARTEYISIMKNWIAE 457
>gi|313236608|emb|CBY11867.1| unnamed protein product [Oikopleura dioica]
Length = 471
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 169/467 (36%), Positives = 253/467 (54%), Gaps = 38/467 (8%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+TK+ Q +D+FNY +T++ RYL+N T + + APIF YTGNEG I+ FA NT
Sbjct: 23 YETKWIDQRVDNFNYYLD--KTYKMRYLVN-TDFVKDEKTAPIFFYTGNEGPIDSFAANT 79
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM + A + A +V+ EHRYYG+S+PY GN +N + YLS ALAD+A LI++
Sbjct: 80 GFMNEFAEEENAFIVYAEHRYYGQSLPY-GNSSFTPENMA---YLSVENALADFAQLIVE 135
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LKK P++ FGGSYGG+L+ + R+ YP++ GALA+S+P+ + + F
Sbjct: 136 LKKTYKG---PLICFGGSYGGLLSMYMRMTYPNLVNGALAASSPVYWISAMGDSHGFWVK 192
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKN-LAIESWLST 289
T+DF + + C I+ + +++ K + K R C+ +E N + + W
Sbjct: 193 TTEDFSTALDKCEDTIRAGFAALDKM-KNDKDWAGITKTMRTCQNITEDNYMHMLGWARN 251
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMC-KAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
A AM DYP P+NF LP PVKE C +A+ +TG D + AA + YN + +
Sbjct: 252 AMATMAMMDYPYPTNFEAALPGNPVKESCVRAV--AETGADSIRE---AAGLVYNGTDPS 306
Query: 349 K---CFDLNGD----SDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
K CFD+ + SDP G W +Q CT+ ++ G D K +F + D D
Sbjct: 307 KYKQCFDIMEEYVYCSDPTGCGTGPQALAWDYQCCTQQVLPGGTDGKTDMFPLIKFDVDD 366
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
RA YC + +GV P +W+ ++ + SN IF NG DPW GGV ++
Sbjct: 367 RAAYCNKTWGVVPDRDWLRIKYWADNL-----EATSNTIFSNGDLDPWGPGGVTHDLRHD 421
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
+ A + GAHH DLR S D Q + +VR+ + I W++Q++ +
Sbjct: 422 LPAPLVHGGAHHYDLRGSNSGDTQDVLNVRQFHRDTIRDWMAQFYAE 468
>gi|346467697|gb|AEO33693.1| hypothetical protein [Amblyomma maculatum]
Length = 348
Score = 269 bits (687), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 208/351 (59%), Gaps = 13/351 (3%)
Query: 93 APIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAS 152
PIF YTGNEG I FA NTG M+D AP+F+ALL+F EHRYYGKS+PYG + ++++ S
Sbjct: 5 GPIFFYTGNEGSITTFANNTGLMWDWAPEFRALLIFAEHRYYGKSMPYG---DRSFESPS 61
Query: 153 TTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
GYL+ Q LADYA L++ ++ L A +S VV FGGSYGGMLAAWFR+KYPHV AL
Sbjct: 62 HLGYLTVEQTLADYADLLLYIRSTLPGAGNSQVVSFGGSYGGMLAAWFRMKYPHVTAAAL 121
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA 271
A+SAPIL F I + + ++T+ FR SE C + I+ SW+ IE + G + +
Sbjct: 122 AASAPILQFQGITPCGALNAVVTRAFRMESELCAEAIRSSWELIENRSSTEEGAADIAER 181
Query: 272 FRICK--SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN 328
F IC + KN + WL+ + AM +YP P+ FLNP+P PV+E CK ++ +
Sbjct: 182 FHICGNYTHKNYTKLRDWLTDQYTNLAMVNYPYPNEFLNPVPGHPVREACKFLNRTFDSS 241
Query: 329 DVFAK-LYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
DV + +Y A SV+ NY+G +C DL S W +Q+C EM+M D D +F
Sbjct: 242 DVLLESIYQAFSVFTNYTGQTQCNDLCKGSGTLDADGWDYQSCNEMVMPMCSDGVDDMFY 301
Query: 388 ESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
+++ D C++ + V P N FGG I +SNIIF NG
Sbjct: 302 KNDWDLKKVREKCEKDFHVTPDVNKAVLIFGGRNISA-----SSNIIFSNG 347
>gi|170574568|ref|XP_001892871.1| Serine carboxypeptidase S28 family protein [Brugia malayi]
gi|158601365|gb|EDP38293.1| Serine carboxypeptidase S28 family protein [Brugia malayi]
Length = 567
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 176/483 (36%), Positives = 247/483 (51%), Gaps = 61/483 (12%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y + +DHF Y + F +YL N +++ + P+F Y GNEGDIE FAQNTG ++
Sbjct: 54 YQSMPIDHFTYRNNEF--FSLKYLANYSYF---LCDGPLFFYAGNEGDIEAFAQNTGIIW 108
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D+AP+F A +VF EHRYYG S PYG + +Y + GYL+ Q LAD+A LI LK +
Sbjct: 109 DLAPRFHAAIVFAEHRYYGNSKPYG---KRSYMDVLRLGYLNDIQVLADFAQLITFLKTD 165
Query: 177 LT-------ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VSPYS 228
T+ PV+VFGGSYGGMLAAW R+KYPH+ GA ASSAP+ NF ++P S
Sbjct: 166 QEELGFCPPGTEIPVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRNFYGTGINPES 225
Query: 229 FSNIITQDFRSVSENC-YKVIKGSWKQIEETAKKPGGLEKLQKAFR------ICKSEKNL 281
SN+ T ++ V+ C +KV + I++ +K G KL + FR + S+
Sbjct: 226 VSNVTTTNY--VTNGCDWKVFSEGFVAIQKLSKTEEGRMKLNQIFRSKPGFEMKNSDDFT 283
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA-KLYGAASV 340
++ S++ +A Y A+ DYP P+NFL PLP FPVK C+ ++ A +LY +V
Sbjct: 284 SLYSYIYSAVFYMAVIDYPYPTNFLTPLPGFPVKYACQYATKXGINDEELAEQLYNVINV 343
Query: 341 YYNYSG-------TAKCFDLNGDSDPHGLSEWGWQACTEMIM------------LTGGDN 381
YYNY+G T+ C L+ + + W WQ CT M M L DN
Sbjct: 344 YYNYTGQLTDHCFTSNCTILSPFQNNDEVIAWNWQCCTSMTMQNCDRSGENDFFLNTCDN 403
Query: 382 KDSIFEESEEDYDARARYCKEAY-GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLR 440
D + S +YC E + + N+ + G++ SN IF NG
Sbjct: 404 PDGLINIS-------IKYCTELFKDIGYSSNFYKLHDTMIRYGMIYNA-TSNTIFSNGNL 455
Query: 441 DPWSGGGVLKN-------ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIG 493
DPWS GV +N + V + AHH+DLR DP + R + II
Sbjct: 456 DPWSASGVYENSPGITNAMRNGVYIFYMSDAAHHLDLRTPNTCDPPSVTYERFQVTNIIK 515
Query: 494 KWI 496
W+
Sbjct: 516 CWV 518
>gi|255081901|ref|XP_002508169.1| predicted protein [Micromonas sp. RCC299]
gi|226523445|gb|ACO69427.1| predicted protein [Micromonas sp. RCC299]
Length = 590
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 167/467 (35%), Positives = 254/467 (54%), Gaps = 55/467 (11%)
Query: 81 INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPY 140
++D G +K+ PIF YTGNE ++E + TG M++ A F A+LVF EHRYYG+S P
Sbjct: 127 LDDLDAGFTKSGPPIFFYTGNEANVELYLNATGLMWEHAESFGAVLVFAEHRYYGESKPK 186
Query: 141 GGNKEIAYKNASTTG--------------YLSSTQALADYASLIIDLKKNLTATDSPVVV 186
++ +AS G YL+S QA+ADYA+LI +LK + A D+PV
Sbjct: 187 PKEEDGNALDASNLGGIIPGHLKKKGQYPYLTSEQAMADYATLIRELKAEIRAPDAPVFA 246
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF---DNIVSPYSFSNIITQDFRSVSEN 243
FGGSYGGMLA W RLKY +V GA+A SAP+ +F D V P +F++ +T D + +
Sbjct: 247 FGGSYGGMLATWMRLKYANVVDGAVAGSAPVWSFVGEDPPVDPGAFADGVTMDATAAGGS 306
Query: 244 ---CYKVIKGSWKQI---EETAKKPGGLEKLQKAFRICKS------EKNLAIESWLSTAF 291
C ++ ++ ++ ET K ++ R+C L + W AF
Sbjct: 307 PPACAPNVRAAFAELLRRSETDPK-----SIKAPMRLCDDTPLGSPTDVLDVALWAQGAF 361
Query: 292 VYTAMTDYPTPSNFL----NPLPAFPVKEMC-KAIDDPKT----GNDVFAKLYGAASVYY 342
Y AM ++P S+++ LP +P + C A+ DPK G+ + + L A VYY
Sbjct: 362 DYLAMGNFPYASSYILNGDGTLPPYPFRVACGGAMADPKLLNKGGDALLSALADAVGVYY 421
Query: 343 NYSGTAKCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDN-KDSIFEESEEDYDARA 397
NYS T +CFD N DSD G W +Q CTEM M D +D F + + DA
Sbjct: 422 NYSKTQECFDTRHGSNDDSDEDG-ELWDYQYCTEMFMPMSRDGVRDMFFPQPWNETDAVL 480
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
C+ +GV P+ W TT FGG ++ +ASN+++ NG DPW+G GV +++S +++
Sbjct: 481 E-CERRWGVRPKTLWATTVFGGRRLS-----WASNVVWTNGYLDPWAGLGVQESLSPSLV 534
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLA 504
A++ GAHH+D +S DP+ + + R+ ++ ++ +WI +Q +A
Sbjct: 535 AMMLPGGAHHLDFMWSNDLDPEPVVEARKTQMRLLRQWILNKYQAVA 581
>gi|348680585|gb|EGZ20401.1| hypothetical protein PHYSODRAFT_558788 [Phytophthora sojae]
Length = 566
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 168/499 (33%), Positives = 262/499 (52%), Gaps = 56/499 (11%)
Query: 41 SLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQ-TFQQRYLINDTHWGGSKNNAPIFVYT 99
+L S+D L + K+ TQ LDHF N S + +F+QRY + +N IF Y
Sbjct: 74 TLPDGSRDLLSLCQEKFITQELDHFRANGGSSEGSFEQRYFVCSPE-SFDPSNGSIFFYV 132
Query: 100 GNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
GNE D+ + +TG M++ A F AL+VF EHRY+GKS+P+G +LS+
Sbjct: 133 GNEADVTLYLNHTGLMWENAVAFNALIVFAEHRYFGKSVPFG------LDVLEHMEFLST 186
Query: 160 TQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN 219
QALADYA LI LKK L D PV+ FGGSYGGML WFR+KYPH+ G +A+SAP++N
Sbjct: 187 QQALADYAVLIEALKKQL-GVDVPVIGFGGSYGGMLGTWFRMKYPHIIDGVIAASAPVVN 245
Query: 220 F----DNIVSPYSFSNIITQDFRS---VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
F D+ +F+ ++T D + NC ++ + E+ + G ++L +
Sbjct: 246 FLGDPDHPADTEAFNRVVTFDMSEEAGAAPNCIPNLRRALTAAIESVQTQDGRKQLAELL 305
Query: 273 RICKSEK---NLAIESWLSTAFVYTAMTDYPTPSNFLN----PLPAFPVKEMCKAIDDPK 325
+C ++ + + S + A+ AM +YP P++++ LP +P++ C+ +
Sbjct: 306 HLCDTDSLSTSDDVVSIAAEAYGDLAMGNYPYPTSYIMDGNVDLPGYPMRAACEPLAGDF 365
Query: 326 TGNDVFAKLYG---AASVYYNYSGTAKCF---------DLNGDSDPHGLSE-------WG 366
+D + + VYYN + + CF + + SD ++ WG
Sbjct: 366 AEDDDLGLINAFRESIDVYYNATKSESCFFPPAPKKTVNESATSDEAKQAKIDQKGNFWG 425
Query: 367 WQACTEMIMLTGGDNKDSIF-----EESEEDYDARARYCKEAYGVDPRPNWITTEFGGHK 421
+ C+E+ M D I+ +S++D D C E +GV +PNW TE+GG K
Sbjct: 426 YLECSELYMPMSSDGVSDIYPAVPVNQSKDDAD-----CFEQWGVHLKPNWAQTEYGGMK 480
Query: 422 IGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWL 481
R SNI+F NG DPWSG VL+++S +V+A+ GAHH+DL FS DP +
Sbjct: 481 A----LRATSNIVFSNGNFDPWSGLDVLQSLSPSVVAVPVPGGAHHLDLFFSHPLDPPAV 536
Query: 482 KDVRRREVEIIGKWISQYF 500
+ R+ E++ I +WI +++
Sbjct: 537 TEARQTELKYIRQWIDEFY 555
>gi|301097471|ref|XP_002897830.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262106578|gb|EEY64630.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 569
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 169/502 (33%), Positives = 257/502 (51%), Gaps = 50/502 (9%)
Query: 35 TPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQ-TFQQRYLINDTHWGGSKNNA 93
TP S S++ L + K+ TQ LDHF + +S + TF RY + S N
Sbjct: 71 TPTSFPSSPDGSRNLLSLCEEKFITQSLDHFRADGKSSEGTFDMRYFVCSPD-NFSPTNG 129
Query: 94 PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
IF Y GNE D+ + +TG M++ A F AL+VF EHRY+GKS+P+G
Sbjct: 130 SIFFYVGNEADVTLYLNHTGLMWENAAAFNALIVFAEHRYFGKSVPFG------LDVLDH 183
Query: 154 TGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
+LS+ QA+ADYA LI LK++L D PV+ FGGSYGGML WFR+KYPH+ G +A
Sbjct: 184 MEFLSTQQAMADYAVLIEMLKRDLK-VDVPVIGFGGSYGGMLGTWFRMKYPHIIDGIIAG 242
Query: 214 SAPILNF----DNIVSPYSFSNIITQDFRS---VSENCYKVIKGSWKQIEETAKKPGGLE 266
SAP+ NF D+ P +F+ ++T D + NC I+ + + G +
Sbjct: 243 SAPVANFFGDPDHPADPEAFNRVVTFDMSEDAGAATNCIPNIRRALNTAVAMSGTKSGRK 302
Query: 267 KLQKAFRICKS---EKNLAIESWLSTAFVYTAMTDYPTPSNFLNP----LPAFPVKEMCK 319
+L + +C + + + + S S A+ A+ +YP P++++ LP +P++ C+
Sbjct: 303 ELAELLHLCDADSLQSSDKVISIASEAYGDLAVGNYPYPTSYIMDGKVDLPGYPMRTACE 362
Query: 320 AI-----DDPKTGNDVFAKLYGAASVYYNYSGTAKCF---------DLNGDSDP------ 359
+D K+G + + +VYYN S + C D SD
Sbjct: 363 PFAGVFTEDDKSG--LIRAFRESIAVYYNASKSESCLFPVSPVKTIDELDTSDAAKQARI 420
Query: 360 -HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG 418
H + WG+ C+E+ M D + +F + C E +GV +P W EFG
Sbjct: 421 DHKGNFWGYLECSELYMPMSSDGVNDVFPTVAVNESQDNAACFEKWGVHLKPRWAQFEFG 480
Query: 419 GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDP 478
G K R ASNI+F NG DPWSG GVL+++S +V+A+ GAHH+DL F+ DP
Sbjct: 481 GMKA----LRAASNIVFSNGNFDPWSGLGVLESLSPSVVAVPVPGGAHHLDLFFTHPSDP 536
Query: 479 QWLKDVRRREVEIIGKWISQYF 500
+ + R E++ I +WI++++
Sbjct: 537 PAVTEARNTELKYIRQWINEFY 558
>gi|393909177|gb|EJD75348.1| serine carboxypeptidase S28 family protein [Loa loa]
Length = 542
Score = 265 bits (677), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 178/477 (37%), Positives = 253/477 (53%), Gaps = 49/477 (10%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y + +DHFNY ++ TF +YL+N +++ + P+F Y GNEGDIE FAQ TG M+
Sbjct: 13 YQSMPIDHFNY--RNLDTFGLKYLVNYSYFNC---DGPLFFYAGNEGDIETFAQMTGIMW 67
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D+AP F A +VF EHRYYG+S P+G + +Y + GYL+ QALAD+A LI LK +
Sbjct: 68 DLAPLFNAAIVFAEHRYYGESQPFG---KRSYMDVLRLGYLNEIQALADFAELISFLKTD 124
Query: 177 L-------TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VSPYS 228
T+ PV+VFGGSYGGMLAAW R+KYPH+ GA ASSAP+ F ++P S
Sbjct: 125 QKELGFCPMGTEIPVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRIFYGTGINPES 184
Query: 229 FSNIITQDFRSVSENC-YKVIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKN-L 281
S IT ++ ++ C KV + IE+ +K G KL + F KS + +
Sbjct: 185 VSRTITTNY--LTSGCDRKVFSDGFVAIEKMSKTEEGRMKLNRIFHAKPGFEMKSYNDFM 242
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA-KLYGAASV 340
++ S++ +A Y AMTDYP P++F PLP +PVK +C+ T + A ++Y +V
Sbjct: 243 SLYSYIYSAIFYMAMTDYPYPADFFEPLPGYPVKYVCQYAKKAATNEENLAEQIYSIINV 302
Query: 341 YYNYSG--TAKCFDLNG------DSDPHGLSEWGWQACTEM---IMLTGGDNK--DSIFE 387
YYNY+G T CF N +D ++ W WQ+CT + I GGDN + +
Sbjct: 303 YYNYTGQLTDNCFTSNCTTPSPIQNDDEDIA-WNWQSCTSLTIQICDRGGDNDFFLNTCD 361
Query: 388 ESEEDYDARARYCKEAY-GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
S + + C E + + N+ + G++ SNIIF NG DPWS G
Sbjct: 362 NSGDPVSTNIKLCTELFKDIGYNNNFYKLHDVTIRYGMIYNT-TSNIIFSNGNLDPWSAG 420
Query: 447 GVLKN-------ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
GV +N + V AHH+D R DP + R + V II W+
Sbjct: 421 GVYENSPGIMEAMKNGVYIFYMLGAAHHLDFRTPNTCDPPSVTHERFQVVNIIKCWV 477
>gi|298710358|emb|CBJ31975.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 596
Score = 265 bits (676), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 169/518 (32%), Positives = 245/518 (47%), Gaps = 97/518 (18%)
Query: 54 KTKYHTQILDHFNYNPQSYQ----TFQQRYLINDTHWG----GSKNN------------- 92
+ ++ TQ LDHF + P S TFQQRY + +WG GS N+
Sbjct: 46 EERFFTQTLDHFRHTPVSEHDDDNTFQQRYFVCREYWGPTGGGSANSPRGQEDGASTSSS 105
Query: 93 -----------------------------------APIFVYTGNEGDIEWFAQNTGFMYD 117
PIF YTGNE D+ + + +G M++
Sbjct: 106 SSTSTSTSRRRKSNRGTAAEGGAGRGGEQAVPGAPGPIFFYTGNEADVSLYLEASGLMWE 165
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGG-NKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
AP F ALLVF EHR+YG+S+P+G +K + +T G + QALADYA L+ LK+
Sbjct: 166 NAPAFNALLVFAEHRFYGESLPFGAPDKRREFLRQATAG---TPQALADYARLVTALKQE 222
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
L A +PV+ FGGSYGGMLA+W RLKYPH+ GA+A+SAP+L + + P T +
Sbjct: 223 LGAEGAPVIAFGGSYGGMLASWLRLKYPHIVHGAIAASAPVLALEGLHRP-------TPN 275
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAM 296
+ +E + + A P G L + + W AF Y AM
Sbjct: 276 PEAFAETVTAAAGPAGGAADSCAANPRGDGAL------------VELAWWARAAFDYLAM 323
Query: 297 TDYPTPSNFL-----NPLPAFPVKEMCKAIDDP---KTGNDVF-AKLYGAASVYYNYSGT 347
++P + ++ LP +P++E C + DP +DV L A VYYN +G
Sbjct: 324 GNFPYATGYMLNSGEVELPPWPLREACSYLADPTLQAEDDDVLLGALADAIGVYYNATGE 383
Query: 348 AKCFDLNGDSDPHG---LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
CF ++ W WQACTEM M D K +F ++ D A+A C E +
Sbjct: 384 VGCFTPAAGANNASSVDADNWNWQACTEMSMPMSTDGKRDMFWRNDWDPVAQAAQCMEQF 443
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL--KNISKTVLALVEK 462
GV P W E+GG+ + +N++F NG DPWSG GV+ + V ++
Sbjct: 444 GVSPGEGWGAAEYGGYDAWSQV----TNVVFSNGRLDPWSGMGVVDQRRAGGGVEVIMMD 499
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
+ AHH+DL F DPQ + D RR E++ + +W+ +
Sbjct: 500 QAAHHLDLFFEHPLDPQDVLDARRVEMDFVERWVDMAY 537
>gi|301096651|ref|XP_002897422.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262107113|gb|EEY65165.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 568
Score = 264 bits (675), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 174/482 (36%), Positives = 253/482 (52%), Gaps = 57/482 (11%)
Query: 60 QILDHFNYNPQSY--------------QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
Q +DHF++ P T++QRYL+N W S AP+F YTGNEGD+
Sbjct: 98 QRIDHFSWLPAEAVDAADPNAAPSGLPATYKQRYLLNTQFWDPSDKKAPVFFYTGNEGDV 157
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
+A +TG +++ A FKAL+VF EHRYYGKS P+G K GYL+ QALAD
Sbjct: 158 TLYANHTGLIWENAQTFKALVVFAEHRYYGKSFPFGD------KYMDHLGYLTHDQALAD 211
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
YA LI ++K A + PV+ FGGSYGGML+AWFR+KYP + GA+A+SAPI F +
Sbjct: 212 YAELIYHVQKKYDALNHPVIAFGGSYGGMLSAWFRMKYPSIIAGAIAASAPIYGFGGFPA 271
Query: 226 --PYSFSNIITQDFR---SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----- 275
+ ++T+D ++NC + +W QI E A+ G L FR+C
Sbjct: 272 FDGQKYWQVVTRDASPAAGAAKNCVPNARKAWPQIFELAQTENGRSTLSSIFRLCEPLTT 331
Query: 276 -KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLN----PLPAFPVKEMCKAI--DDPKT-- 326
+ ++LA+ AF AM ++P PS++L LPA+PV+E C + D P +
Sbjct: 332 EQQGEDLAMSVLF--AFDTLAMGNFPYPSSYLTGGAVDLPAWPVREACSHLAGDFPASTL 389
Query: 327 -----GNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMI---MLTG 378
+ L AA+V++N +G CF + D G+ W +Q CTEM+
Sbjct: 390 RQENVDTKLLEALRDAANVFHNATGDLTCFKIPTLWDYDGI--WDYQYCTEMLPQETYFS 447
Query: 379 GDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
+ + +F ++ +C+ + P PN I +G + R ASNI+F NG
Sbjct: 448 TNGETDMFWSRNTTFEEIRAHCQRDWHTTPDPNGIRVSYGDDML-----RSASNIVFSNG 502
Query: 439 LRDPWSGGGVLKNISKTVLALVE-KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
L DPWS GVL + +VE EGAHH+DL FS +DP + R+ E+ +I KW+
Sbjct: 503 LLDPWSSAGVLHAPQDAKVTIVEIAEGAHHLDLFFSHPKDPPSVIAARKTEIRMIQKWVD 562
Query: 498 QY 499
++
Sbjct: 563 EF 564
>gi|341900697|gb|EGT56632.1| hypothetical protein CAEBREN_05149 [Caenorhabditis brenneri]
Length = 568
Score = 264 bits (674), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 169/478 (35%), Positives = 247/478 (51%), Gaps = 48/478 (10%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y LDHF + +TF R + N+T + K PIF YTGNEG +E F + TG M+
Sbjct: 48 YKGMRLDHFTWGDT--RTFDLRVMWNNTFY---KEGGPIFFYTGNEGGLESFEKATGMMF 102
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D+AP F A ++F EHR+YG++ P+G + +YKN + GYL+S QALADYA L+ +LK++
Sbjct: 103 DLAPMFNAAIIFAEHRFYGQTQPFGKD---SYKNLANIGYLTSEQALADYAELLTELKRD 159
Query: 177 LT------ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-DNIVSPYSF 229
+ D+PV+ FGGSYGGML+AWFR KYPH+ GA A SAP++ D V P +F
Sbjct: 160 NNRMGKTFSQDTPVISFGGSYGGMLSAWFRQKYPHLVKGAWAGSAPLIYMHDGGVDPGAF 219
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE--------KLQKAFRICKSEKNL 281
NI ++ + N Y ++ +W + G + KL I +
Sbjct: 220 DNITSRTYVDRGCNRY-ILANAWNATIRLSSTDAGRQWLNNNNVFKLDPRTPIKTAADGW 278
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG---NDVFAKLYGAA 338
+ S+L A Y AM DYP P+ FL PLPA+PV C ++ T D+ + + AA
Sbjct: 279 NLNSYLREAIEYMAMVDYPYPTGFLEPLPAWPVDAACGYMNATGTSFSDQDLVSAVANAA 338
Query: 339 SVYYNYSGTAK---CFD--LNGDSDPHGLSE----WGWQACTEMI--MLTGGDNKDSIFE 387
++YYNY+ A C D + GD GL + W WQ C+E+I M G + D +
Sbjct: 339 NIYYNYNKNASFTYCIDYSICGDQGTGGLGDDQLGWPWQECSEIIMGMCARGGSNDVFWN 398
Query: 388 ESEED-YDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLK---RFASNIIFFNGLRDPW 443
E ++ YD + C +G NW T + + + +SN+I G DPW
Sbjct: 399 ECPDNIYDDLKQGCISIFG---SMNWTTANWNIDAVKTLYGYDLSGSSNLILTQGHLDPW 455
Query: 444 SGGGVLK---NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
SGGG N ++ + + AHH+DLR DP + + R + V I+ W+ +
Sbjct: 456 SGGGYTADQTNAARGIYVMEIPGSAHHLDLRTPNTCDPNTVTNARFQIVSILKCWVDK 513
>gi|449443602|ref|XP_004139566.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 325
Score = 262 bits (669), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/322 (43%), Positives = 198/322 (61%), Gaps = 17/322 (5%)
Query: 131 HRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGS 190
HR+YGKSIP+G + E A KN S GY +S QALADYA L++ +KK SP++V G S
Sbjct: 16 HRFYGKSIPFG-SLEKAMKNGSIRGYFNSAQALADYAELLLHIKKMFAYDTSPIIVMGAS 74
Query: 191 YGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKG 250
YGGMLA+WFRLKYPH+A+GALASSAPIL FDNI + +I+++ F+ S+ C+ I+
Sbjct: 75 YGGMLASWFRLKYPHIALGALASSAPILYFDNITPQDGYYSIVSKSFKETSKTCHDTIRR 134
Query: 251 SWKQIEETAKKP-GGLEKLQKAFRIC-KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNP 308
SW +I+ A K GGL L K F+ C K + + I++ + + F A + P +
Sbjct: 135 SWGEIDRIAGKTRGGLSILSKQFKTCGKLKTSSEIKNLMDSVFTMAAQYNDPYEN----- 189
Query: 309 LPAFPVKEMCKAID-DPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGW 367
PV+ +C AID + K ++V ++ Y ++ +DP L+++GW
Sbjct: 190 ----PVRGICVAIDEEAKKKSNVIKQVVAGVIAYLGERPCYDVYEFGYPNDP--LNQYGW 243
Query: 368 QACTEMIMLTG--GDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
Q C+EM+M G G +K+S+F S ++ CK+ YGV PRP+WITT +GG I LV
Sbjct: 244 QVCSEMVMPIGSSGRDKNSMFPPSPFQFNDFKTMCKDLYGVTPRPHWITTFYGGQDIKLV 303
Query: 426 LKRFASNIIFFNGLRDPWSGGG 447
L RF SNIIF NGL+DP+S GG
Sbjct: 304 LHRFGSNIIFSNGLKDPYSSGG 325
>gi|290981802|ref|XP_002673620.1| predicted protein [Naegleria gruberi]
gi|284087205|gb|EFC40876.1| predicted protein [Naegleria gruberi]
Length = 503
Score = 261 bits (666), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 165/503 (32%), Positives = 258/503 (51%), Gaps = 57/503 (11%)
Query: 42 LISSSKDSQGL--YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNA------ 93
LISS SQ + Y+T + TQ LDHF++ + TF QRY + D + S ++
Sbjct: 16 LISSQCISQDIPPYQTGFFTQRLDHFDFT--NIATFPQRYFVCDLYVKHSTRSSVIVDDN 73
Query: 94 ---------PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
PI Y GNEG +E F +NTG ++++A + AL++FIEHR+YGK+IP
Sbjct: 74 NLIQIDPFIPIIAYPGNEGALEEFYENTGLVFELAKYYGALVIFIEHRFYGKTIP----- 128
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYP 204
N YL+ QA D A + +P+++ GGSYGG LAAW R K+P
Sbjct: 129 ----PNQDPQRYLTIEQATHDLAVFFTENFGLDEKRKNPIILVGGSYGGDLAAWMRFKFP 184
Query: 205 HVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE-------NCYKVIKGSW----K 253
H+ G++A+SAPIL F+ I PY + I T+ +R+++ C +K + K
Sbjct: 185 HLIDGSIAASAPILFFNGITPPYLAAQIATEAYRNLTNFQIYPKMTCDSAVKKGFEFLSK 244
Query: 254 QIEETAKKPGGLEKLQKAFRICKSEK-NLAIE---SWLSTAFVYTAMTDYPTPSNFLNPL 309
E T K L+ L + FR+C K NL ++ S+++ +F A +YP P+NF N L
Sbjct: 245 YFESTTSKE-QLQMLSRKFRLCNEMKSNLEVKVLASYIAFSFEVLAQANYPYPTNFFNNL 303
Query: 310 PAFPVKEMCKAI-------DDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL 362
PA+PV +C +I + ++ + F L+ +++ NY+G CF+ +
Sbjct: 304 PAWPVNGLCTSIAKHLATSPNLESEDLYFTILFDGVNLFQNYTGDKSCFNTSNLGGGLQW 363
Query: 363 SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKI 422
+ W Q C EMI+ +G +F + + + + C Y +P+P W+ T FGG +
Sbjct: 364 NSWSLQLCNEMIIPSGFYPSTDMFFSNPYNLKVQMKACMSKYKFNPQPYWLATYFGGKR- 422
Query: 423 GLVLKRFASNIIFFNGLRDPWSGGGVLKNI--SKTVLALVEKEGAHHVDLRFSTKEDPQW 480
L SNIIF NG D G V K + S +++ + ++G HH+D+R+S DPQ
Sbjct: 423 --ALTE-HSNIIFSNGQYDAVRAGSVEKGMKTSPSIIPIFIEQGGHHLDIRWSNPNDPQS 479
Query: 481 LKDVRRREVEIIGKWISQYFQDL 503
+K R E + +G WI ++ L
Sbjct: 480 VKIAREIEFKYVGIWIQKFLNSL 502
>gi|125576268|gb|EAZ17490.1| hypothetical protein OsJ_33021 [Oryza sativa Japonica Group]
Length = 259
Score = 258 bits (658), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 124/245 (50%), Positives = 164/245 (66%), Gaps = 5/245 (2%)
Query: 264 GLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD 323
G +L + F +CK + I + A +Y +M DYPTPSNFL LPA+PV+E+C+AID
Sbjct: 10 GRARLNQTFNMCKGNVD-DIPGLVEKALIYGSMMDYPTPSNFLTSLPAYPVREICRAIDK 68
Query: 324 PKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG--DSDPHGL-SEWGWQACTEMIM-LTGG 379
P +GND +++ A ++YYN +G CF G D DP+G+ W WQACTE+IM ++ G
Sbjct: 69 PTSGNDTVSRIKDAMTIYYNSTGGLACFPGAGAEDDDPYGMFPGWTWQACTEVIMTMSYG 128
Query: 380 DNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGL 439
++F + A C GV PRP+WI + FGG+ I VLKR SNIIFFNGL
Sbjct: 129 IGNATVFPPDPFNLTAYLAGCLATTGVPPRPHWIQSYFGGYDIRNVLKRSGSNIIFFNGL 188
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
RDPWS GG+LK+IS +++ALVE +G HHVDLRFSTKEDP+WLK VRR+E+ II W+ QY
Sbjct: 189 RDPWSAGGILKSISNSIIALVEPKGGHHVDLRFSTKEDPEWLKKVRRQEMRIIADWLKQY 248
Query: 500 FQDLA 504
+ D A
Sbjct: 249 YSDEA 253
>gi|242096524|ref|XP_002438752.1| hypothetical protein SORBIDRAFT_10g025540 [Sorghum bicolor]
gi|241916975|gb|EER90119.1| hypothetical protein SORBIDRAFT_10g025540 [Sorghum bicolor]
Length = 306
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 129/231 (55%), Positives = 158/231 (68%), Gaps = 15/231 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW-------GGSKNNAPIFVYTGNEGDI 105
+ Y Q LDHF + P + F Q+YL+NDT W GGS P+FV+TG E DI
Sbjct: 84 FTEHYFPQELDHFTFRPNASTVFYQKYLVNDTFWRRSSGRKGGS--TGPLFVFTGGETDI 141
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
E A N GFM+D+APKF ALLVFIEHR+YG+S+P+ N A GYL+STQALAD
Sbjct: 142 ESIAINAGFMFDIAPKFGALLVFIEHRFYGESMPFRSNSTEAL------GYLTSTQALAD 195
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
+A LI LK+NL+A +PVVVFGGSYGGMLA+WFRLKYPHV IGALASSAPIL FD I
Sbjct: 196 FAILITSLKQNLSAETAPVVVFGGSYGGMLASWFRLKYPHVTIGALASSAPILQFDYITP 255
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
SF ++++QD++S S NC+ VIK +W +EE GL +L K FR CK
Sbjct: 256 WSSFYDVVSQDYKSESLNCFSVIKAAWDVLEERGSNDNGLLELSKLFRACK 306
>gi|348676451|gb|EGZ16269.1| hypothetical protein PHYSODRAFT_560767 [Phytophthora sojae]
Length = 574
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 175/506 (34%), Positives = 257/506 (50%), Gaps = 53/506 (10%)
Query: 34 ITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNY------NPQSY--------QTFQQRY 79
+ + L + + S G + Q +DHF++ +P + T++QRY
Sbjct: 78 VHTQHLVATADIPRSSLGTTSPLWFEQRIDHFSWLAAEALDPSNAGAAPSGLPATYKQRY 137
Query: 80 LINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIP 139
L+N W AP+F YTGNEGD+ +A +TG +++ A FKAL+VF EHRYYGKS P
Sbjct: 138 LLNTQFWDPKDKKAPVFFYTGNEGDVTLYANHTGLIWENAKAFKALVVFAEHRYYGKSFP 197
Query: 140 YGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWF 199
+G K Y++ QALADY LI L+K A + PV+ FGGSYGGML+AWF
Sbjct: 198 FGD------KYMDHLAYVTHDQALADYTELIYHLQKKYDAFNHPVIAFGGSYGGMLSAWF 251
Query: 200 RLKYPHVAIGALASSAPILNFDNIVS--PYSFSNIITQDFRSV---SENCYKVIKGSWKQ 254
R+KYP++ GA+A+SAPI F + + ++T+D + NC K SW Q
Sbjct: 252 RMKYPNIIAGAIAASAPIYGFGGFPAFDGQKYWQVVTRDASPAAGSAANCVPNAKKSWAQ 311
Query: 255 IEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLN--- 307
I E AK G L FR+C E+ + + AF AM D+P PS++L
Sbjct: 312 IFELAKTEDGRATLSSLFRLCTPLASEEQGEDLAMSVLFAFDTLAMGDFPYPSSYLTGGA 371
Query: 308 -PLPAFPVKEMCKAIDDP-------KTGND--VFAKLYGAASVYYNYSGTAKCFDLNGDS 357
LPA+PV++ C + K G D + L AA+V++N + CF +
Sbjct: 372 VDLPAWPVRQACSHLAGEFPTPSLRKDGVDTTLLEALRNAANVFHNATKDLACFKIPTLW 431
Query: 358 DPHGLSEWGWQACTEMI---MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWIT 414
D G+ W +Q CTEM+ + + +F ++ +C+ + P + I
Sbjct: 432 DYDGI--WDYQYCTEMLPQETYFSTNGETDMFWPRNTTFEEIRAHCQRDWHTTPDQDGIR 489
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE-KEGAHHVDLRFS 473
+G + R ASNI+F NGL DPWS GVL + +VE EGAHH+DL FS
Sbjct: 490 VSYGDEML-----RSASNIVFSNGLLDPWSSAGVLHAPKDAKVTIVEIAEGAHHLDLFFS 544
Query: 474 TKEDPQWLKDVRRREVEIIGKWISQY 499
+DP + R+ EV++I +WI ++
Sbjct: 545 HPKDPPSVIAARKTEVKMIHQWIDEF 570
>gi|432102770|gb|ELK30246.1| Lysosomal Pro-X carboxypeptidase [Myotis davidii]
Length = 353
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 130/281 (46%), Positives = 176/281 (62%), Gaps = 12/281 (4%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
LS+ +S Y Y Q +DHF + ++ +TF QRYLI D HW K I Y
Sbjct: 33 LSTSFTSRSSVVMKYSIHYFEQKIDHFGF--KNDKTFNQRYLIADQHW--RKEGGSILFY 88
Query: 99 TGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLS 158
TGNEGDI WF NTGFM+DVA + KA+LVF EHRYYG+S+P+G + +++++ +L+
Sbjct: 89 TGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGQSLPFGAD---SFQDSRHLNFLT 145
Query: 159 STQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
S QALAD+A LI LK+ + A + P + GGSYGGMLAAWFR+KYPH+ +GALA+SAPI
Sbjct: 146 SEQALADFAELIKHLKRTIPGAENQPFIALGGSYGGMLAAWFRMKYPHIVVGALAASAPI 205
Query: 218 LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
F++IV F I+T DF+ NC + I+ SW I + GL+ L +A +C
Sbjct: 206 WQFEDIVPCGVFMKIVTTDFKRSGPNCSESIQSSWDAINRLTRNGTGLDWLSEALHLCTP 265
Query: 278 EKNLA----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPV 314
KN ++ W+S +V AM DYP SNFL PLPA+P+
Sbjct: 266 LKNAQDVQHLKDWISETWVNLAMVDYPYESNFLQPLPAWPI 306
>gi|116242320|gb|ABJ89817.1| lysosomal pro-X carboxypeptidase [Clonorchis sinensis]
Length = 434
Score = 254 bits (649), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 157/416 (37%), Positives = 222/416 (53%), Gaps = 20/416 (4%)
Query: 93 APIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAS 152
PI YTGNEG IE FA+NTGFM+++A + KA ++F EHR+YG S+P+ + ++K+
Sbjct: 1 GPILFYTGNEGAIETFAENTGFMWEIAEELKAAVLFAEHRFYGSSLPFVND---SFKDPQ 57
Query: 153 TTGYLSSTQALADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
GYL++ QALADYASL+ LK ++ +SPV+ FGGSYGGML+AWFR KYP++ GA+
Sbjct: 58 HFGYLTAEQALADYASLVQYLKSSVKDFENSPVIAFGGSYGGMLSAWFRYKYPNLIAGAI 117
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
A+SAPI F N+ + F + T+ F S S C K + W I AK+ G E L+
Sbjct: 118 AASAPIWLFPNMSNCAGFYDTTTRAFSTSGSTVCTKNVALVWDSIRTVAKQHSGHELLRL 177
Query: 271 AFRICK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG 327
F++C E+ L +L AM +YP ++F+ P PVK CK + D
Sbjct: 178 MFQLCDPLPDEQKLI--DYLIDFLGTLAMVNYPYEASFIGTFPGEPVKYFCKGLSDAVHR 235
Query: 328 NDVFAKLYG---AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDS 384
+ + A NY+ C L GD W Q C EM N +
Sbjct: 236 DVDVDVVQRVATAVRSLTNYTKNQSCISLEGDLPGLDAKAWTLQTCLEMTTPMCS-NGEG 294
Query: 385 IFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
+F E D ++ C + + V PR NW EF G I + A+NI+F NG DPWS
Sbjct: 295 MFPSLEWDPVVFSQSCFDKFAVRPRLNWSAVEFWGKNI-----KTATNIVFSNGDLDPWS 349
Query: 445 GGGVLKNISKTVLALVE-KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
GVL + ++ GAHH+DLR + DP + D R+RE++ I WI ++
Sbjct: 350 AFGVLTDDQAPGCNVIRIPSGAHHLDLRAKNELDPADVVDARQRELQHIKDWIDEW 405
>gi|241161686|ref|XP_002408972.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494441|gb|EEC04082.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 360
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 182/308 (59%), Gaps = 14/308 (4%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT--GFMYDVA 119
+DHF Y + TF+ RYL+ D +W + PIF YTGNE DIE FA + G M++ A
Sbjct: 20 VDHFGY--ANNDTFKMRYLVADQYW--DHDGGPIFFYTGNEADIEVFANKSYSGLMWEWA 75
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT- 178
P+FKALL+F EHRYYGKS+PYG ++K S GYL++ QALADYA L+ K ++
Sbjct: 76 PEFKALLIFAEHRYYGKSMPYGNE---SFKGPSRHGYLTAEQALADYADLLTHFKADVPG 132
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
A DS VV FGGSYGGMLAAWFRLKYPHV ALASSAPIL F + +FS ++T+ F
Sbjct: 133 AGDSKVVSFGGSYGGMLAAWFRLKYPHVTTAALASSAPILQFTGMTPCNAFSEVVTKAFA 192
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFVYTA 295
S C I+ S++ I + A G + L+K FR+CK + + W+ F Y A
Sbjct: 193 KESNQCTNAIRTSFELIRKQAATEEGAKALKKQFRLCKPLAPSNDTVLRDWIRNVFAYLA 252
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG-NDVFAKLYGAASVYYNYSGTAKCFDLN 354
M +YP S P P PVKE CK + T + +Y A SV NY+G C DL+
Sbjct: 253 MVNYPYASKLTLPAPGHPVKEACKFLKKNFTDVQSLLDGIYRAISVLTNYTGKIHCNDLS 312
Query: 355 GDSDPHGL 362
++ G+
Sbjct: 313 DNAGTPGI 320
>gi|268575348|ref|XP_002642653.1| C. briggsae CBR-PCP-1.2 protein [Caenorhabditis briggsae]
Length = 564
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 166/477 (34%), Positives = 237/477 (49%), Gaps = 50/477 (10%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y LDHF + +TF R + N+T + + PIF YTGNEG + F TG M+
Sbjct: 44 YKNMRLDHFTWGDT--RTFDMRIMWNNTFY---QPGGPIFFYTGNEGAVSTFEVATGMMF 98
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D+AP F A ++F EHR+YG + P+G +Y N + GYL+S QALADYA L+ +LK++
Sbjct: 99 DLAPMFNASIIFAEHRFYGATQPFGNQ---SYANLANVGYLTSEQALADYAELLTELKRD 155
Query: 177 LTA------TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-DNIVSPYSF 229
DS V+ FGGSYGGML+AWFR KYPH+ GA A SAP++ D V P +F
Sbjct: 156 NNQFGKTFHRDSQVISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMHDGGVDPGAF 215
Query: 230 SNIITQDFRSVSENCYK-VIKGSWKQIEETAKKPGG--------LEKLQKAFRICKSEKN 280
NI ++ + V C + ++ +W + + G + KL I
Sbjct: 216 DNITSRTY--VENGCNRFILANAWNAVLNLSSTDSGRAWLNNNPVFKLDPRTPINNQTDG 273
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMC---KAIDDPKTGNDVFAKLYGA 337
+ +++ A Y AM DYP P+ FL PLP +PV C A + + D+ + A
Sbjct: 274 WNLNAYMREAIEYMAMVDYPYPTGFLEPLPGWPVTVACGYMNATGESFSDQDLVTAVANA 333
Query: 338 ASVYYNYSGTAK---CFDLN--GDSDPHGLSE----WGWQACTEMIM---LTGGDNKDSI 385
A+VYYNY+ A C D N GD GL + W WQ C+E+IM +GG N
Sbjct: 334 ANVYYNYNQNANFTWCIDFNICGDQGTGGLGDDALGWPWQECSEIIMAMCASGGANDVFW 393
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLK---RFASNIIFFNGLRDP 442
E + YD + C +G W T + + + +SN+I G DP
Sbjct: 394 SECGDNIYDTLKQGCVSIFG---SMKWTTANWNIDAVKTLYGYDLSGSSNLILTQGHLDP 450
Query: 443 WSGGGVLKNISKTVLALVEKE---GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
WSGGG + + T + E AHH+DLR DP + + R + V+I+ W+
Sbjct: 451 WSGGGYKVDQTNTARGIYVMEIPGSAHHLDLRQPNTCDPNTVVNARYQIVQILKCWV 507
>gi|260829575|ref|XP_002609737.1| hypothetical protein BRAFLDRAFT_78548 [Branchiostoma floridae]
gi|229295099|gb|EEN65747.1| hypothetical protein BRAFLDRAFT_78548 [Branchiostoma floridae]
Length = 422
Score = 252 bits (643), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 147/425 (34%), Positives = 227/425 (53%), Gaps = 40/425 (9%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQS 71
L+ +++++ A +F TFP +S+++D ++T+Y Q LDHFN+
Sbjct: 7 LVVRHSVSLAVAVLFVTFPH-----------VSATRDPP--HETRYFNQYLDHFNFASHG 53
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEH 131
+TFQ+R L++D W + PIF YTGNEG I GF+ D+A KF+AL+VF+EH
Sbjct: 54 AETFQERVLVSDAFW---RKEGPIFFYTGNEGPITSIWNEVGFIKDLAEKFEALIVFVEH 110
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD--SPVVVFGG 189
RYYG+S+P+G E + N G L+ QALADYA LI +L + PV+ FGG
Sbjct: 111 RYYGESLPFG---ETTF-NKENMGLLTVEQALADYAVLITNLTASYCEDPDVCPVIAFGG 166
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGG+L+A+ RLKYP++ GALASSA + + +T+DFR + C + ++
Sbjct: 167 SYGGVLSAFMRLKYPNLVAGALASSANVYMSAGLTPGNELFQDVTEDFRRYNPRCPERVR 226
Query: 250 GSWKQIEETA-KKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAMTDYPTPSN 304
+ ++E A + GL ++ R+C ++ A + W+ AF AM D P +
Sbjct: 227 EGFAEMERLAGQGKQGLHEISSRMRLCSPLQHHADLVNMYRWVREAFTVLAMEDLPYSIS 286
Query: 305 FLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGD----SDPH 360
LPA+PV C + +D + A + YN++ CFDL+ D +DP
Sbjct: 287 NGPSLPAYPVNASCDLL---LKASDGIEGILQAVGMLYNFTSNLTCFDLHRDFVPCADPT 343
Query: 361 GLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWIT 414
G S W +Q CTE+ +L +N +F + RA +C++ +GV PRP W++
Sbjct: 344 GCSLMPGAQAWDYQTCTEISLLESTNNVTDMFPPDAFTEETRAVHCRQRWGVTPRPGWLS 403
Query: 415 TEFGG 419
T+F G
Sbjct: 404 TQFWG 408
>gi|268575350|ref|XP_002642654.1| C. briggsae CBR-PCP-1.1 protein [Caenorhabditis briggsae]
Length = 512
Score = 251 bits (642), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 166/477 (34%), Positives = 238/477 (49%), Gaps = 50/477 (10%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y LDHF + +TF R + N+T++ + PIF YTGNEG + F TG M+
Sbjct: 14 YKGMRLDHFTWG--DTRTFDLRIMWNNTYY---QPGGPIFFYTGNEGAVSTFEVATGMMF 68
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D+AP F A ++F EHR+YG + P+G +Y N + GYL+S QALADYA L+ +LK++
Sbjct: 69 DLAPMFNASIIFAEHRFYGATQPFGNQ---SYANLANVGYLTSEQALADYAELLTELKRD 125
Query: 177 LTA------TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-DNIVSPYSF 229
DS V+ FGGSYGGML+AWFR KYPH+ GA A SAP++ D V P +F
Sbjct: 126 NNQFGKTFHRDSQVISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMHDGGVDPGAF 185
Query: 230 SNIITQDFRSVSENCYK-VIKGSWKQIEETAKKPGG--------LEKLQKAFRICKSEKN 280
NI ++ + V C + ++ +W + + G + KL I
Sbjct: 186 DNITSRTY--VENGCNRFILANAWNAVLNLSSTDSGRAWLNNNPVFKLDPRTPINNQTDG 243
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMC---KAIDDPKTGNDVFAKLYGA 337
+ +++ A Y AM DYP P+ FL PLP +PV C A + + D+ + A
Sbjct: 244 WNLNAYMREAIEYMAMVDYPYPTGFLEPLPGWPVTVACGYMNATGESFSDQDLVTAVANA 303
Query: 338 ASVYYNYSGTAK---CFDLN--GDSDPHGLSE----WGWQACTEMIM---LTGGDNKDSI 385
A+VYYNY+ A C D N GD GL + W WQ C+E+IM +GG N
Sbjct: 304 ANVYYNYNQNANFTWCIDFNICGDQGTGGLGDDALGWPWQECSEIIMAMCASGGANDVFW 363
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLK---RFASNIIFFNGLRDP 442
E + YD + C +G W T + + + +SN+I G DP
Sbjct: 364 SECGDNIYDTLKQGCVSIFG---SMKWTTANWNIDAVKTLYGYDLSGSSNLILTQGHLDP 420
Query: 443 WSGGGVLKNISKTVLALVEKE---GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
WSGGG + + T + E AHH+DLR DP + + R + V+I+ W+
Sbjct: 421 WSGGGYKVDQTNTARGIYVMEIPGSAHHLDLRQPNTCDPNTVVNARYQIVQILKCWV 477
>gi|118396082|ref|XP_001030384.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89284685|gb|EAR82721.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 495
Score = 251 bits (642), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 158/473 (33%), Positives = 244/473 (51%), Gaps = 37/473 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKT Y Q ++H + TF+Q+YL+ D + K PI Y GNEG IE F NT
Sbjct: 20 YKTYYFDQKVNHEGFEMNDL-TFKQKYLVKDDFYRYDK--GPILFYCGNEGPIEMFYNNT 76
Query: 113 GFM-YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
GF + +A + L+VF+EHRY+G+S P+G +E K YL+S QAL DY +
Sbjct: 77 GFQTHTLAKELNGLVVFMEHRYFGESWPFGNEEESLKK--GNNKYLTSLQALNDYVVFLN 134
Query: 172 DLKKNLTATDS--PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI--VSPY 227
KK+L D PV+ GGSYGGMLAAW R+K+P+V +LA+SAPI F N ++
Sbjct: 135 WFKKSLGCADDECPVIAIGGSYGGMLAAWIRMKFPNVVDASLAASAPIYQFLNREGLNQT 194
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGG-----------LEKLQKAFRICK 276
F +IIT+++ C I +++ + P + +A C+
Sbjct: 195 LFYSIITRNY--AQNGCSDKIHQAYQYLTNIIDSPMSTKYFKYQYDSIFANISQAMNTCE 252
Query: 277 SEKN----LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMC---KAIDDPKTGND 329
N + +++ TA+ Y AMT+YP S+FL +PA+P C +A++ T +
Sbjct: 253 PITNSTGLTQLRTYMDTAYSYMAMTNYPQASSFLRSMPAWPANASCIPMEAVNSNSTVFE 312
Query: 330 VFAKLYGAASVYYNYSGTAKCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSI 385
+F+ + + +YNY +A C D+ +G SD + +S W AC++M++ + K +
Sbjct: 313 LFSAIKLSTDTFYNYDQSANCSDISQGDDGASD-NDMSGWNILACSDMVLPMASNGKTDM 371
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSG 445
F +++ +C YGV P +W +GG L ++ F SNI F NG+ DPWSG
Sbjct: 372 FYNQPWNFEQYKEWCNYTYGVTPNYDWALDFYGGRN-DLEMENF-SNIFFSNGMLDPWSG 429
Query: 446 GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
G + +S+ + AHH DLR + DP+ + R E++ + KWI
Sbjct: 430 GSPTEYLSEDLPTNYMYASAHHNDLRLPQEGDPESVVQGRELEIKYLKKWIQH 482
>gi|118487656|gb|ABK95653.1| unknown [Populus trichocarpa]
Length = 213
Score = 251 bits (641), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 116/210 (55%), Positives = 150/210 (71%), Gaps = 1/210 (0%)
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG 355
M +YPT +NF+ PLPA+PV+ MCK ID +G +++ AAS+YYNYS KCF L
Sbjct: 1 MVNYPTEANFMMPLPAYPVQAMCKIIDGFPSGASKITRVFAAASLYYNYSRAEKCFKLEH 60
Query: 356 DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
D HGL W WQACTEM+M +++S+F S Y A C + +GV PRP+WITT
Sbjct: 61 GPDAHGLHGWNWQACTEMVMPMTC-SEESMFPTSSFSYKEFAEDCMKTFGVKPRPHWITT 119
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
EFGG +I LVLKR NIIF NG++DPWS GGVLKNIS +++ALV ++GAHHVD R +TK
Sbjct: 120 EFGGKRIDLVLKRSGGNIIFSNGMQDPWSRGGVLKNISSSIIALVTEKGAHHVDFRSATK 179
Query: 476 EDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+DP+WLK++RR+EVEII WI QY+ DL +
Sbjct: 180 DDPEWLKELRRQEVEIIQGWIDQYYPDLKR 209
>gi|403340721|gb|EJY69653.1| Lysosomal carboxypeptidase [Oxytricha trifallax]
Length = 477
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 168/462 (36%), Positives = 237/462 (51%), Gaps = 25/462 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+TK +DHF + TF+ RYLIND + G PI Y GNEG I F N+
Sbjct: 15 YQTKQFDADIDHFTTQGSTTNTFKLRYLINDKYVTG-PGPWPILFYCGNEGIITDFYDNS 73
Query: 113 GFMYDVAPKFKAL-LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
GF+ +VF EHRYYG+S+P+G + ++K + +L+ QA+ DY L+
Sbjct: 74 GFVTTTLATATNALVVFAEHRYYGQSMPFGKD---SFKPGNVN-FLTIDQAMMDYVKLLQ 129
Query: 172 DLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+K + T+SPV+ FGGSYGGM+AAW R++YP + GA ASSAPIL F VSPY+F+
Sbjct: 130 FIKASDNRFTNSPVIAFGGSYGGMIAAWIRMRYPQIIYGAHASSAPILFFPGTVSPYAFN 189
Query: 231 NIITQDFRSVSEN--CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL- 287
+ T+ ++S +++ C I+ +K + + A K++ F C + + +L
Sbjct: 190 ELATRSYQSATQDGRCAANIQYGFKILNQWAADNTTYAKIKDYFNACVAPASTDDIQFLL 249
Query: 288 ---STAFVYTAMTDYPTPSNFLNPLPAFPVKEMC-KAIDDPKTGNDVFAKLYGAASVYYN 343
S A A +YP +NF LPA PV+ C K D K+ +D + K G A +
Sbjct: 250 GEISDALGTMAQVNYPYDTNFTRFLPANPVQTACTKGAVDQKSDDDGYVK--GLAQAFMV 307
Query: 344 YSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
Y G KC D G S WG+Q C EM+M K +F D D A C +
Sbjct: 308 YHGD-KCVSFKPDPS-DGTSGWGYQVCNEMVMPIAQSGKTDMFLPQPWDPDQFASDC-AS 364
Query: 404 YGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI----SKTVLAL 459
G+ P+ ++I FGG L SNIIF NG DPW GGVL K V+ L
Sbjct: 365 MGLKPQFDFILDSFGGRNTNLDFAH-VSNIIFSNGDLDPWRAGGVLPGTLAKNDKIVVRL 423
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
+ K AHH++LR DPQ + D R +I WI+ Y++
Sbjct: 424 I-KNSAHHLELRLPNAADPQDVTDARTAFTTVITGWIADYWK 464
>gi|449467104|ref|XP_004151265.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like, partial [Cucumis
sativus]
Length = 359
Score = 247 bits (631), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 197/329 (59%), Gaps = 18/329 (5%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S+ + S GL T Y+ Q LDHFNY PQSY TF QRY+I+ +W G PIF Y G E
Sbjct: 45 STPQQSDGL-ATFYYKQPLDHFNYQPQSYVTFDQRYIIDFKYWEGINPKTPIFAYLGAES 103
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
DI+ GF A ++KA+ V++EHR+YGKSIP+G + E A KN S GY +S QAL
Sbjct: 104 DIDNDVPYVGFPLRFASQYKAMSVYLEHRFYGKSIPFG-SLEKAMKNGSIRGYFNSAQAL 162
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
ADYA L++ +KK SP++V G SYGGMLA+WFRLKYPH+A+GALASSAPIL FDNI
Sbjct: 163 ADYAELLLHIKKMFAYDTSPIIVMGASYGGMLASWFRLKYPHIALGALASSAPILYFDNI 222
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETA-KKPGGLEKLQKAFRIC-KSEKNL 281
+ +I+++ F+ S+ C+ I+ SW +I+ A K GGL L K F+ C K + +
Sbjct: 223 TPQDGYYSIVSKSFKETSKTCHDTIRRSWGEIDRIAGKTRGGLSILSKQFKTCGKLKTSS 282
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGNDVFAKLYGAASV 340
I++ + + F A + P + PV+ +C AID + K ++V ++
Sbjct: 283 EIKNLMDSVFTMAAQYNDPYEN---------PVRGICVAIDEEAKKKSNVIKQVVAGVIA 333
Query: 341 YYNYSGTAKCFDLNGDSDPHG-LSEWGWQ 368
Y G C+D+ P+ L+++GWQ
Sbjct: 334 YL---GERPCYDVYEFGYPNDPLNQYGWQ 359
>gi|444521185|gb|ELV13126.1| Dipeptidyl peptidase 2 [Tupaia chinensis]
Length = 471
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 160/463 (34%), Positives = 240/463 (51%), Gaps = 59/463 (12%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ +Y Q+LDHFN+ +TF+QR L++D W + PIF YTGNEGDI FA N+
Sbjct: 43 FRERYFDQLLDHFNFERFGNRTFRQRVLVSDRFW--RRGEGPIFFYTGNEGDIWTFANNS 100
Query: 113 GFMYDVAPKFKALLVFIEH-RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
GF+ ++A + ALLVF EH R + +P E+
Sbjct: 101 GFLAELAAQQAALLVFAEHLRGDAQRLP---EDEVPPPGGGGA----------------- 140
Query: 172 DLKKNLTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
D KK T +P SYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 141 DSKKGPAVTRVTP------SYGGMLSAYLRMKYPHLVAGALAASAPVIAVAGLGDSYQFF 194
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESW 286
+T DF S S C + ++ +++Q+++ + G E + + F C+ ++L + ++
Sbjct: 195 RDVTTDFTSQSPECAQGVREAFRQMKDLFLQ-GAHETVSREFGTCQPLSGPRDLTQLFAF 253
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---DDPKTGNDVFAKLYGAASVYYN 343
AF AM DYP P++F+ PA PVK C + P G L + YN
Sbjct: 254 ARNAFTVLAMMDYPYPTSFMGHFPAHPVKVGCARLLREAQPIPG------LRALTGLVYN 307
Query: 344 YSGTAKCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
SGT C+D+ SDP G W +QACTE+ + +N +F
Sbjct: 308 SSGTEPCYDIYRQYRSCSDPTGCGTGPDAKAWDYQACTEINLTFSSNNVTDMFPALPFTD 367
Query: 394 DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
R +YC + +GV PR +W+ T F G + + ASNIIF NG DPW+GGG+ +N+S
Sbjct: 368 LQRRQYCLDTWGVWPRHDWLRTSFWGGDL-----KAASNIIFSNGDLDPWAGGGIRRNVS 422
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+V+A+ + GAHH+DLR S EDP + VR+ E ++ +W+
Sbjct: 423 ASVIAVTVRGGAHHLDLRASQPEDPASVVQVRQLEATLVREWV 465
>gi|340508129|gb|EGR33905.1| serine carboxypeptidase s28 family protein, putative
[Ichthyophthirius multifiliis]
Length = 429
Score = 244 bits (623), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 140/422 (33%), Positives = 225/422 (53%), Gaps = 21/422 (4%)
Query: 95 IFVYTGNEGDIEWFAQNTGFMYDVAPK-FKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
I Y GNEG IE F +NTGF+ + K KAL++++EHRY+G+S P+G K K
Sbjct: 1 IIFYCGNEGPIEMFYKNTGFVTQILSKELKALVLYMEHRYFGESQPFGDEKTSLQK--GN 58
Query: 154 TGYLSSTQALADYASLIIDLKKNLTATD--SPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
YL+S QAL+DY +I +KK+L + P++ GGSYGGMLAAW R+K+P++ +L
Sbjct: 59 NQYLTSIQALSDYVEFLIYIKKSLQCQEKECPIIAVGGSYGGMLAAWIRMKFPNLVDASL 118
Query: 212 ASSAPILNFDNI--VSPYSFSNIITQDF--RSVSENCYKVIKGSWKQIEETAKKPGGLEK 267
A+SAPI F N + + IIT ++ R + Y++++ + + ++ ++
Sbjct: 119 AASAPIFQFLNRENLDQTKYFQIITNNYPCRDKIKTAYQILQNLLNEKNKILEQNNIFQQ 178
Query: 268 LQKAFRICKSEKN----LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMC---KA 320
+ +A +C+ KN L + +++ A+ Y AMT+YP + FL LP +P C +
Sbjct: 179 ISQAMGLCQPLKNNTDVLNLRNYMDNAYSYMAMTNYPQETTFLKHLPPWPANFSCIFFQN 238
Query: 321 IDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP---HGLSEWGWQACTEMIMLT 377
I + D+F+ + + Y++ C D++ + W +C +M++
Sbjct: 239 ITQQSSVFDLFSAVRNSTRTLYDFDQKNNCADISQADQTVSDDNMEGWDILSCADMVLPM 298
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
+ K +F S D + + C++AYGV P PNW +GG +K F SNI F N
Sbjct: 299 FSNGKTDMFYNSTWDLETYKQNCRKAYGVSPNPNWALNFYGGRN-DQEMKGF-SNIFFSN 356
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G+ DPWSGG + I++ + ++ AHH DLR + DPQ + R+ E+ I KWI
Sbjct: 357 GMLDPWSGGSPTQFINENLPVFYMEQAAHHNDLRLPAQGDPQSVIQGRKLEIFYIKKWIR 416
Query: 498 QY 499
Y
Sbjct: 417 FY 418
>gi|326930125|ref|XP_003211202.1| PREDICTED: dipeptidyl peptidase 2-like [Meleagris gallopavo]
Length = 432
Score = 243 bits (619), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 149/424 (35%), Positives = 222/424 (52%), Gaps = 50/424 (11%)
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
K PIF YTGNEGDI FA+N+ F++++A + +AL++F EHRYYGKS+P+G + + K
Sbjct: 21 KGFGPIFFYTGNEGDIWTFAENSDFIFELAEQQQALVIFAEHRYYGKSLPFG-LESMQIK 79
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
N T L+ QALADYA LI +LK+ A PV+ FGGSYGGML+A+ R+KYP+V G
Sbjct: 80 N---THLLTVEQALADYAVLITELKQQYGAAGCPVIAFGGSYGGMLSAYLRMKYPNVVDG 136
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
ALA+SAP+L+ + P F +T DF+ C ++ +++QI + G +++
Sbjct: 137 ALAASAPVLSVAGLGDPTQFFRDVTADFQKSIPGCVTAVRRAFQQIRDLFLS-GAYDEIS 195
Query: 270 KAFRICK---SEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---D 322
C S+K++ + + AF AM DYP ++F+ LPA PVK C+ I
Sbjct: 196 SKMATCNKISSKKDVYQLFGFARNAFTMIAMMDYPYKTDFMGHLPANPVKVGCEQILAHT 255
Query: 323 DPKTGNDVFAKLYGAASVYYNYSGTAKCFDL----NGDSDPHGLS------EWGWQACTE 372
DP G A L G V+YN SG+ +C+D+ +DP G W +QACTE
Sbjct: 256 DPIQG---LAALVG---VFYNSSGSVQCYDVYQLYRPCADPTGCGTGADAEAWDYQACTE 309
Query: 373 MIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASN 432
+ + +N +F E R +YC + V PR W+ F G + +
Sbjct: 310 INLTFNSNNVTDMFPEMPFTEAMREQYCWSRWRVRPRAQWLQINFWGGDQAIAIN----- 364
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
++S ++ A+ + GAHH+DLR DP + + R+ E II
Sbjct: 365 -----------------SSLSPSLTAVTIQGGAHHLDLRGHNPADPPSVIEARKLEASII 407
Query: 493 GKWI 496
W+
Sbjct: 408 SSWV 411
>gi|308481892|ref|XP_003103150.1| CRE-PCP-1 protein [Caenorhabditis remanei]
gi|308260255|gb|EFP04208.1| CRE-PCP-1 protein [Caenorhabditis remanei]
Length = 564
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 165/487 (33%), Positives = 237/487 (48%), Gaps = 56/487 (11%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y LDHF + +TF R + N+T + K PIF YTGNEG +E F TG M+
Sbjct: 44 YKNMRLDHFTWG--DTRTFDMRVMWNNTFY---KPGGPIFFYTGNEGGLESFVTATGIMF 98
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK- 175
D+AP + A ++F EHR+YG++ P+G N +Y + GYL+S QALADYA L+ +LK+
Sbjct: 99 DLAPMYNASIIFAEHRFYGQTQPFGNN---SYATLANVGYLTSEQALADYAELLTELKRQ 155
Query: 176 ----NLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-DNIVSPYSF 229
NLT D+ ++ FGGSYGGML+AWFR KYPH+ GA A SAP++ D V P +F
Sbjct: 156 PNQFNLTFQKDTQIISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMHDGGVDPGAF 215
Query: 230 SNIITQDFRSVSENCYK-VIKGSWKQIEETAKKPGGLE--------KLQKAFRICKSEKN 280
NI ++ + V C + ++ W + + G + L I
Sbjct: 216 DNITSRTY--VDNGCNRFILANVWNAVLNLSNTDAGRQWLNNNAVFTLDPRTPIRNQTDG 273
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKT---GNDVFAKLYGA 337
+ ++L A Y AM DYP P+ FL PLPA+PV C ++ T + + A
Sbjct: 274 WNLNAYLREAIEYMAMVDYPYPTGFLEPLPAWPVAVACGYMNATGTTFSDQQLVTMVANA 333
Query: 338 ASVYYNYSGTAK---------CFDLNGDSDPHGLSEWGWQACTEMIMLT---GGDNKDSI 385
A++YYNY+ C D W WQ C+E+IM GG N
Sbjct: 334 ANIYYNYNKDPNFKYCIDYSVCGDQGTGGLGGDQLGWPWQECSEIIMAMCARGGSNDVFW 393
Query: 386 FEESEEDYDARARYCKEAYG-VDPRP-NW----ITTEFGGHKIGLVLKRFASNIIFFNGL 439
E YD + C +G + P NW + T +G G +SN+I G
Sbjct: 394 SECGANIYDVLKQECVSIFGSMGWTPSNWNIDAVKTLYGYDLSG------SSNLILTQGH 447
Query: 440 RDPWSGGGV---LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DPWSGGG N ++ + L AHH+DLR DP + + R + ++I+ W+
Sbjct: 448 LDPWSGGGYKVDQNNAARGIYVLEIPGSAHHLDLRQPNTCDPNTVTNARFQIIQILNCWV 507
Query: 497 SQYFQDL 503
+ D+
Sbjct: 508 NPNCNDV 514
>gi|426363699|ref|XP_004048972.1| PREDICTED: dipeptidyl peptidase 2 [Gorilla gorilla gorilla]
Length = 495
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 155/469 (33%), Positives = 234/469 (49%), Gaps = 48/469 (10%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTG A +
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGTR------ASGS 82
Query: 113 GFMYDVA-PKFKALLVFIEHRYYGKSIPYGG-NKEIAYKNASTTGYLSSTQALADYASLI 170
GF VA P+ + L G + E + G S + +
Sbjct: 83 GFRLSVASPRKPSHLSSQPAAQLGCAATLAAFPSESCTRTWPRVGICGSQKCRPPCGTCG 142
Query: 171 IDLKKN---------LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
+K LT SP SYGGML+A+ R+KYPH+ GALA+SAP+L
Sbjct: 143 PLRRKRGNEVLGAAALTLCPSP------SYGGMLSAYLRMKYPHLVAGALAASAPVLAVA 196
Query: 222 NIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SE 278
+ F +T DF S C + ++ +++QI++ + G + +++ F C+ E
Sbjct: 197 GLGDSNQFFRDVTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRREFGTCQPLSDE 255
Query: 279 KNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA 337
K+L + + AF AM DYP P++FL PLPA PVK C D + L
Sbjct: 256 KDLTQLFMFARNAFTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSEAQRITGLRAL 312
Query: 338 ASVYYNYSGTAKCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFE 387
A + YN SG+ C+D+ + +DP G W +QACTE+ + +N +F
Sbjct: 313 AGLVYNASGSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFP 372
Query: 388 ESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
+ + R RYC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG
Sbjct: 373 DLPFTDELRQRYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGG 427
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ +N+S +V+A+ + GAHH+DLR S EDP + + R+ E +IG+W+
Sbjct: 428 IQRNLSASVIAVTIQGGAHHLDLRASHPEDPASVVEARKLEATVIGEWV 476
>gi|268619142|gb|ACZ13347.1| prolyl carboxy peptidase-like protein [Bursaphelenchus xylophilus]
Length = 401
Score = 238 bits (608), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 143/333 (42%), Positives = 200/333 (60%), Gaps = 32/333 (9%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF + + F RY IN T++ + PIF YTGNEG +E FA+NTGF++D+AP+
Sbjct: 52 IDHFAF--ADTREFPLRYFINLTYY---EPGGPIFFYTGNEGKLEVFAENTGFIWDIAPE 106
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT--A 179
+KA +VF EHR+YG S+P+G E +YK+ GYL+S QALAD+A +I LK A
Sbjct: 107 YKAAIVFTEHRFYGNSLPFG---EDSYKHIKNLGYLTSEQALADFADVITYLKTQRIPQA 163
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VSPYSFSNIITQDFR 238
T SPV+VFGGSYGGMLAAWFR+KYPH+A GA+A+SAP+L F N V ++NI T+ F+
Sbjct: 164 THSPVIVFGGSYGGMLAAWFRIKYPHLADGAIAASAPLLWFQNTGVRQDGYANITTRTFK 223
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-------NLAIESWLSTAF 291
+S ++ S+ + AK G + L K ++ KS + N+ + + +
Sbjct: 224 -LSGCDLTHLRASFDAMRTLAKTEDGRDHLNKVLKLGKSSEFEHSHDYNILV-NIFADVM 281
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-----DDPKTGNDVFAKLYGAASVYYNYSG 346
M DYP P+NF +PA+PVK+MC+ DDPK + LY +++YN SG
Sbjct: 282 GNVVMIDYPYPTNFFAQVPAWPVKKMCEKFNGDIPDDPK---ETVKPLYDILNIFYNTSG 338
Query: 347 TAKCFDLNG---DSDPHG-LSEWGWQACTEMIM 375
+ F L G +D G + W WQ CTEMIM
Sbjct: 339 KLEEFCLRGPDCGNDQLGAMDGWNWQICTEMIM 371
>gi|390365524|ref|XP_783661.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like
[Strongylocentrotus purpuratus]
Length = 622
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 185/312 (59%), Gaps = 11/312 (3%)
Query: 195 LAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQ 254
LAAW R+KYP+ GA+A+SAP+ F + + I++DF++ ++ CY + SW
Sbjct: 312 LAAWMRIKYPNAIAGAIAASAPVWQFTGLTPCNTQYLTISKDFQAANQLCYDSVHMSWDV 371
Query: 255 IEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAMTDYPTPSNFLNPLP 310
I + G KL +A ++C K A + SWL+ ++ AM DYP P+NFL PLP
Sbjct: 372 ITRIGQTASGRTKLAQAMKLCNPLKTTADVDGLISWLAGSWFNLAMVDYPYPANFLEPLP 431
Query: 311 AFPVKEMCKAIDDPKTGND-VFAKLYGAASVYYNYSGTAKCFDLNGDSDPH-GLSEWGWQ 368
AFP+KE+C P +D + A+L GA VYYNY+ + +CF+L+ D+ G W +Q
Sbjct: 432 AFPIKEVCSYFKTPSPTDDQLLAELTGALGVYYNYTSSIQCFNLSQDATASLGDLGWSFQ 491
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKR 428
ACTEM+M D + +F +YDA+ CK + V PRPNWI ++FGG I
Sbjct: 492 ACTEMVMPFCADGVNDMFYSMPWNYDAQVAACKAQWNVTPRPNWIVSQFGGKNITA---- 547
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRRE 488
+SNI F NGL DPW GGVL ++S T++A + +GAHH+DLR K DP + VR +E
Sbjct: 548 -SSNIFFSNGLLDPWHLGGVLTDLSDTLVAGIIPDGAHHLDLRGKNKLDPPSVIAVRNQE 606
Query: 489 VEIIGKWISQYF 500
E I +WI++++
Sbjct: 607 RENINRWIAEWW 618
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 126/328 (38%), Positives = 172/328 (52%), Gaps = 56/328 (17%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNP 69
CL+ S + S +F R KL + ISS S +K +Y Q +DHF++
Sbjct: 13 MCLVVCSGFSSSACVLF-----GRRINNKLEAKISSQGCSHP-HKEEYFEQQVDHFSFT- 65
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
+ TFQ RYL++D W PIF YTGNEGDI WF QNTGF++D+A ++KA+++F
Sbjct: 66 -NSDTFQMRYLVSDELW---TKGGPIFFYTGNEGDITWFCQNTGFVWDLAVEYKAIVIFA 121
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT--ATDSPVVVF 187
EHRYYGKS+PYG + +YK+A+ GYL++ QALAD+A + K N A SPVV F
Sbjct: 122 EHRYYGKSLPYGND---SYKDAAHLGYLTAEQALADFAVFLDWYKANTRGGAAGSPVVAF 178
Query: 188 GGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-----------DNIVSPYSFSNIITQD 236
GGSYGGMLAAW R+KYP+ G + D I+ P +++
Sbjct: 179 GGSYGGMLAAWMRIKYPNAIAGDDKDCDDDDDDNNNSSDDKDINDKIIKPLHVNHV---- 234
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFV 292
+ G KL +A ++C K A + SWL+ ++
Sbjct: 235 ---------------------SPLTASGRTKLAQAMKLCNPLKTTADVDGLISWLAGSWF 273
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKA 320
AM DYP P+NFL PLPAFP+K A
Sbjct: 274 NLAMVDYPYPANFLEPLPAFPIKSFFAA 301
>gi|17556861|ref|NP_498688.1| Protein PCP-1 [Caenorhabditis elegans]
gi|21431885|sp|P34610.2|PCP1_CAEEL RecName: Full=Putative serine protease pcp-1; Flags: Precursor
gi|351020794|emb|CCD62762.1| Protein PCP-1 [Caenorhabditis elegans]
Length = 565
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 161/484 (33%), Positives = 239/484 (49%), Gaps = 64/484 (13%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y LDHF + +TF R + N+T + K PIF YTGNEG +E F TG M+
Sbjct: 46 YKNMKLDHFTWGDT--RTFDMRVMWNNTFY---KPGGPIFFYTGNEGGLESFVTATGMMF 100
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
D+AP F A ++F EHR+YG++ P+G +Y + + GYL+S QALADYA L+ +LK++
Sbjct: 101 DLAPMFNASIIFAEHRFYGQTQPFGNQ---SYASLANVGYLTSEQALADYAELLTELKRD 157
Query: 177 LT------ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD-NIVSPYSF 229
+ V+ FGGSYGGML+AWFR KYPH+ GA A SAP++ + V P +F
Sbjct: 158 NNQFKMTFPAATQVISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMNGGGVDPGAF 217
Query: 230 SNIITQDFRSVSENCYK-VIKGSWKQIEETAKKPGGLE--------KLQKAFRICKSEKN 280
+I ++ + + C + ++ +W + G + KL +I
Sbjct: 218 DHITSRTY--IDNGCNRFILANAWNATLNLSSTDAGRQWLNNNTVFKLDPRTKIRNQTDG 275
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG---NDVFAKLYGA 337
+ ++L A Y AM DYP P+ FL PLPA+PV C ++ T D+ + A
Sbjct: 276 WNLNAYLREAIEYMAMVDYPYPTGFLEPLPAWPVTVACGYMNANGTSFSDKDLVKAVANA 335
Query: 338 ASVYYNYSGTAK---CFDLN---------GDSDPHGLSEWGWQACTEMI--MLTGGDNKD 383
A++YYNY+ C D + D G W WQ C+E+I M G + D
Sbjct: 336 ANIYYNYNRDPNFTYCIDFSICGDQGTGGLGGDELG---WPWQECSEIIMAMCASGGSND 392
Query: 384 SIFEESEED-YDARARYCKEAY---GVDPRPNW----ITTEFGGHKIGLVLKRFASNIIF 435
+ E +D Y + C + G P+ NW + T +G G +SN+I
Sbjct: 393 VFWNECGKDIYQTLQQGCVSIFKSMGWTPK-NWNIDAVKTLYGYDLSG------SSNLIL 445
Query: 436 FNGLRDPWSGGGV---LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
G DPWSGGG N ++ + L AHH+DLR DP + + R + ++I+
Sbjct: 446 TQGHLDPWSGGGYKVDQNNAARGIYVLEIPGSAHHLDLRQPNTCDPNTVTNARFQIIQIL 505
Query: 493 GKWI 496
W+
Sbjct: 506 KCWV 509
>gi|388515463|gb|AFK45793.1| unknown [Lotus japonicus]
Length = 212
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 107/206 (51%), Positives = 140/206 (67%), Gaps = 2/206 (0%)
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG 355
M +YP PS FL LP P+KE+C+ ID G + ++Y +VYYNY+G AKCF+L
Sbjct: 1 MVNYPYPSEFLMTLPGHPIKEVCRRIDQGPAGTSILERIYEGVNVYYNYTGEAKCFEL-- 58
Query: 356 DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
D DPHGLS W WQACTEM+M + S+F E +Y + C +++GV+PRP WITT
Sbjct: 59 DDDPHGLSGWNWQACTEMVMPMSSSQESSMFPPYEYNYTSYLEDCIKSFGVEPRPKWITT 118
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
EFGGH I LK+F SNIIF N L DPWSGG VL+NI ++++ LV KEGAHH++LR ST
Sbjct: 119 EFGGHNILAPLKKFGSNIIFSNSLLDPWSGGSVLQNIFESIVFLVTKEGAHHINLRASTG 178
Query: 476 EDPQWLKDVRRREVEIIGKWISQYFQ 501
DP WL + R E+++I WIS Y++
Sbjct: 179 NDPDWLVEQRATEIKLIQGWISDYYR 204
>gi|344309381|ref|XP_003423355.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 2-like
[Loxodonta africana]
Length = 579
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 152/454 (33%), Positives = 223/454 (49%), Gaps = 73/454 (16%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q+LDHFN+ +TF+QR+L+++ W +N PIF YTGNEGD+ FA N+
Sbjct: 165 FREGYFEQLLDHFNFERFGNKTFRQRFLVSEKFW--KRNEGPIFFYTGNEGDVWSFANNS 222
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + AL+VF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 223 GFILELAAREAALVVFAEHRYYGKSLPFGAQS----TQRGRTELLTVEQALADFAVLLQA 278
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+ + A D+P + F G L+A + P S P + ++P S S
Sbjct: 279 LRASFGAQDAPAIAF-GGRSANLSAPGGIVSPSSPHSPSVRSGPPPESASRMTPLS-SQD 336
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
+TQ F + + N + V+
Sbjct: 337 LTQLF-AFARNXFTVL-------------------------------------------- 351
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
AM DYP P+NFL PLPA PV+ C + K L G + YN SGT C+
Sbjct: 352 --AMMDYPYPTNFLGPLPANPVRAGCDRLLSEKQSIVGLRALVG---LLYNSSGTEPCYS 406
Query: 353 L----NGDSDPHGLSE------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
+ +DP G W +QACTE+ + +N +F E R +YC +
Sbjct: 407 IYRQYRSCADPTGCGSGPDAEAWDYQACTEINLTFASNNVSDMFPELLFTESLRQQYCLD 466
Query: 403 AYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
+GV PR +W+ T F G ++ + ASNIIF NG DPW+GGG+ N+S +V+A+ +
Sbjct: 467 TWGVWPRRDWLRTSFWGAEL-----KAASNIIFSNGDLDPWAGGGIRSNLSASVIAVTIQ 521
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
GAHH+DLR S DP + + R+ E +I +W+
Sbjct: 522 GGAHHLDLRESNPADPASVVEARKLEAALIHEWV 555
>gi|313242000|emb|CBY34184.1| unnamed protein product [Oikopleura dioica]
Length = 402
Score = 232 bits (592), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 140/414 (33%), Positives = 206/414 (49%), Gaps = 33/414 (7%)
Query: 31 SSRITPEKLSSLISSSKDSQGL--YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGG 88
++ + P K SSL S S + ++ +Y DHF+ ++ Q + R + +D +
Sbjct: 11 AAALKPLKYSSLESYSDFCSEISTFEAEY-----DHFS--TRNTQKIEIRVITDDRFY-- 61
Query: 89 SKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
+ P+ YTGNEGD++ F +NTGFM + A LVF+EHRYYGKSIP N
Sbjct: 62 -QAGGPVLFYTGNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIPDDKNL---- 116
Query: 149 KNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
YLS+ QALADYA ++ LK + PV+ GGSYGGMLAA+FR+KYP++
Sbjct: 117 -------YLSAEQALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVA 167
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN--CYKVIKGSWKQIEETAKKPGGLE 266
GA+A SAP+ + F + T+ F + C I+ SW+ I+ G
Sbjct: 168 GAIAGSAPVKFLPGLFDCRGFYRVTTRTFTNTPSGHFCSDNIRKSWETIKLIGAHMVGKR 227
Query: 267 KLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK 325
L + FR C ++ + +L + AM DYP P+NF+ +P +PV C +D
Sbjct: 228 TLSEVFRTCDPITDVEPLLDFLENVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDI 287
Query: 326 TGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSI 385
++ L AASVYYNY+G C DL + G W +Q CTE + D K+ +
Sbjct: 288 NQEELLEPLRDAASVYYNYTGDLACLDLGDEGGDLGYDNWYFQTCTEFVFPFCSDGKEDM 347
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGL 439
F D+ + C++ +G PR +W F + + IIF NGL
Sbjct: 348 FRVHTYDFPTYSTNCQQTFGTTPREHWAEMFFSVETMKTI-----GGIIFSNGL 396
>gi|413933337|gb|AFW67888.1| hypothetical protein ZEAMMB73_712720 [Zea mays]
Length = 773
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 135/333 (40%), Positives = 175/333 (52%), Gaps = 71/333 (21%)
Query: 62 LDHFNY-----NPQSYQTFQQRYLI-NDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
LDHF++ ++ FQQRYL+ D+ W G PIF Y GNEGDI WFA N+G +
Sbjct: 482 LDHFSFPGVEDEDEAVVFFQQRYLVGRDSGWAGP--GGPIFFYCGNEGDIAWFAANSGLI 539
Query: 116 YDVAPKFKA--------------LLVFIE---------HRYYGKSIPYGGNKEIAYKNAS 152
+D AP+F A L F+ HRYY +S+P+G +K AY ++
Sbjct: 540 WDAAPRFAARGNRSSAASLVSYSYLFFVLKTFRLKRYIHRYYRESMPFG-SKAKAYSDSK 598
Query: 153 TTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
YL++ QALAD+ L+ DLK+NL+A SPVV+FGGSYGGMLAAW RLKYPH+AIG L
Sbjct: 599 FPTYLTAEQALADFVVLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHIAIGVL- 657
Query: 213 SSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
+ + R++ Y +I S+ I T K G L
Sbjct: 658 ----------------HHQLRSCSLRTLFLLLYSMI--SYLMILGTLKTSGDL------- 692
Query: 273 RICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA 332
WLS+A+ Y AM DYP PS FL PLPA P+KE+C ID G
Sbjct: 693 -----------SDWLSSAYSYLAMVDYPLPSEFLRPLPANPIKEVCGNIDSQPKGIGTLE 741
Query: 333 KLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEW 365
++Y +VYYNY+ CFDLN DPHG+ W
Sbjct: 742 RIYAGVNVYYNYTDIVDCFDLN--DDPHGMGGW 772
>gi|255584372|ref|XP_002532920.1| catalytic, putative [Ricinus communis]
gi|223527313|gb|EEF29462.1| catalytic, putative [Ricinus communis]
Length = 245
Score = 228 bits (582), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 107/207 (51%), Positives = 144/207 (69%), Gaps = 1/207 (0%)
Query: 69 PQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVF 128
PQSY TFQQ+Y+I+ HW G++ +APIF Y G E + GF++D A KF AL VF
Sbjct: 39 PQSYATFQQKYVISFKHWTGAQASAPIFAYLGEESPLNADIHGIGFLFDNAAKFGALTVF 98
Query: 129 IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFG 188
IEHR+YG SIP+ +E A NA+ GY +S QALADYA +++++K L+A SP++V G
Sbjct: 99 IEHRFYGDSIPFVSRQE-ALANATLRGYFNSAQALADYAEILLNIKLILSAETSPIIVIG 157
Query: 189 GSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVI 248
GSYGGMLA+WFRLKYPH+A+GALASSAPIL FDNI ++ +++T+D+R SE+C I
Sbjct: 158 GSYGGMLASWFRLKYPHIALGALASSAPILYFDNITPSDAYYSLVTKDYRDASESCSNTI 217
Query: 249 KGSWKQIEETAKKPGGLEKLQKAFRIC 275
K SW ++ A + GL L + F C
Sbjct: 218 KESWLELARVASQENGLSILSEKFHTC 244
>gi|413943524|gb|AFW76173.1| putative serine peptidase S28 family protein, partial [Zea mays]
Length = 266
Score = 224 bits (571), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 118/216 (54%), Positives = 147/216 (68%), Gaps = 17/216 (7%)
Query: 21 SNAKIFPTFPSSRITPEKLSSLISSSKD--------SQGLYKTKYHTQILDHFNYNPQSY 72
SN+ P F + +L SSS + S + Y Q+LDHF + P +
Sbjct: 44 SNSNPRPAFTPPLLKRHQLRRPSSSSAELVAGPANASTKPFTAHYFPQLLDHFAFTPNAS 103
Query: 73 QTFQQRYLINDTHW------GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
F+ +YL+NDT W G P+FVYTGNEGDIEWFA NTGFM+D+AP F ALL
Sbjct: 104 TVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNEGDIEWFATNTGFMFDIAPTFGALL 163
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVV 186
VFIEHR+YG+S P+G + +Y++A T GYL+STQALAD+A +I LK++L A +PVVV
Sbjct: 164 VFIEHRFYGESKPFGND---SYRSAETLGYLTSTQALADFAVVIRGLKRDLGAEAAPVVV 220
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN 222
FGGSYGGMLA+WFRLKYPHVAIGALASSAPIL FD+
Sbjct: 221 FGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDH 256
>gi|323451280|gb|EGB07158.1| hypothetical protein AURANDRAFT_2013, partial [Aureococcus
anophagefferens]
Length = 451
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 158/471 (33%), Positives = 245/471 (52%), Gaps = 50/471 (10%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG-FM 115
Y TQ LDHF ++ +TF Q+ L++D W + P+ +Y GNEG IE F N+G M
Sbjct: 1 YVTQQLDHFRFD--ETRTFSQKLLVHDA-W--HRPGGPLLMYFGNEGAIEDFYGNSGGLM 55
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYG----GNKEIAYKNASTTGYLSSTQALADYASLII 171
+++APK A + F+EHRYYG S+P+G G+ E+A+ L+ QALAD A ++
Sbjct: 56 FELAPKLNASVAFLEHRYYGSSLPFGNASYGSDELAF--------LTVEQALADMALVLA 107
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+ L A D P V+FGGSYGGMLAAWF LKYPH+A GA+A+SAP+ + F +
Sbjct: 108 TSSEILGAADGPAVLFGGSYGGMLAAWFMLKYPHLAAGAVAASAPVDLYPGEGKERPFFD 167
Query: 232 IITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESW 286
+ + + S C ++ + + AK G + L ++FR C+ + + S+
Sbjct: 168 AGLEVYGTYGSAACEADLRAALAALAAAAKTAAGRDALARSFRTCEPLPDPVDGDRLASY 227
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
++ A AM DYP S F+ P+PA PV+ C + +KL GA V+ N++G
Sbjct: 228 VNGALSTLAMLDYPYASAFVAPMPANPVRVACGRV---AAAPSAASKLKGAVDVFLNHTG 284
Query: 347 TAKCFDLN-------GDSDPHGLSE----WGWQACTEMIM--LTGGDNKDSIFEESEEDY 393
C+D G L W +QACTE+ + LT D +S +
Sbjct: 285 ETACYDARRELLAAPGAPPLRALGAIDRPWNYQACTELPLEPLT-SDGFGFFVPQSPKAL 343
Query: 394 DARARYCKEAYGVDPRPNWITTEFG-GHKIGLVLKRFASNIIFFNGLRDPWSGGGV---L 449
C++ +GV PRP+W+ FG G ++ L+ N++F +G +DPW GGV
Sbjct: 344 AEVEAACRDRFGVAPRPDWLRQSFGDGAQLAASLR----NVVFTDGDKDPWRVGGVPGDA 399
Query: 450 KNISK--TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ +S+ +V+ ++ + AHH DL S D + R E E + +W+++
Sbjct: 400 RALSRDGSVVHVLIADAAHHQDLFASDPADSPGVVAARVLEFEHVSRWVAR 450
>gi|149039380|gb|EDL93600.1| dipeptidylpeptidase 7, isoform CRA_b [Rattus norvegicus]
Length = 329
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 121/294 (41%), Positives = 173/294 (58%), Gaps = 14/294 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q +DHFN+ S +TF QR+L++D W PIF YTGNEGDI A N+
Sbjct: 41 FRENYFEQYMDHFNFESFSNKTFGQRFLVSDKFW--KMGEGPIFFYTGNEGDIWSLANNS 98
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + +ALLVF EHRYYGKS+P+G + T L+ QALAD+A L+
Sbjct: 99 GFIVELAAQQEALLVFAEHRYYGKSLPFG----VQSTQRGYTQLLTVEQALADFAVLLQA 154
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+ NL D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + +P F
Sbjct: 155 LRHNLGVQDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVAGLGNPDQFFRD 214
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + + + F C+ S K+L + +
Sbjct: 215 VTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQSLSSPKDLTQLFGFAR 273
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---DDPKTGNDVFAKLYGAAS 339
AF AM DYP P+NFL PLPA PVK C+ + G A LYG S
Sbjct: 274 NAFTVLAMMDYPYPTNFLGPLPANPVKVGCERLLSEGQRIMGLRALAGLYGDQS 327
>gi|358333026|dbj|GAA51627.1| lysosomal Pro-X carboxypeptidase [Clonorchis sinensis]
Length = 412
Score = 222 bits (565), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 141/394 (35%), Positives = 205/394 (52%), Gaps = 20/394 (5%)
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
M+++A + KA ++F EHR+YG S+P+ + ++K+ GYL++ QALADYASL+ LK
Sbjct: 1 MWEIAEELKAAVLFAEHRFYGSSLPFVND---SFKDPQHFGYLTAEQALADYASLVQYLK 57
Query: 175 KNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
++ +SPV+ FGGSYGGML+AWFR KYP++ GA+A+SAPI F N+ + F +
Sbjct: 58 SSVKDFENSPVIAFGGSYGGMLSAWFRYKYPNLIAGAIAASAPIWLFPNMSNCAGFYDTT 117
Query: 234 TQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLST 289
T+ F S S C K + W I AK+ G E L+ F++C E+ L +L
Sbjct: 118 TRAFSTSGSTVCTKNVALVWDSIRTVAKQHSGHELLRLMFQLCDPLPDEQKLI--DYLID 175
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYG---AASVYYNYSG 346
AM +YP ++F+ P PVK CK + D + + A NY+
Sbjct: 176 FLGTLAMVNYPYEASFIGTFPGEPVKYFCKGLSDAVHRDVDVDVVQRVATAVRSLTNYTK 235
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
C L GD W Q C EM N + +F E D ++ C + + V
Sbjct: 236 NQSCISLEGDLPGLDAKAWTLQTCLEMTTPMCS-NGEGMFPSLEWDPVVFSQSCFDKFAV 294
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE-KEGA 465
PR NW EF G I + A+NI+F NG DPWS GVL + ++ GA
Sbjct: 295 RPRLNWSAVEFWGKNI-----KTATNIVFSNGDLDPWSAFGVLTDDQAPGCNVIRIPSGA 349
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
HH+DLR + DP + D R+RE++ I WI ++
Sbjct: 350 HHLDLRAKNELDPADVVDARQRELQHIKDWIDEW 383
>gi|323453591|gb|EGB09462.1| hypothetical protein AURANDRAFT_23831 [Aureococcus anophagefferens]
Length = 472
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 166/473 (35%), Positives = 231/473 (48%), Gaps = 55/473 (11%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWG-GS--------KNNAPIFVYTGNEG 103
Y+TK+HTQ L H + + T+QQRYL+NDT WG GS P+ Y+GNEG
Sbjct: 20 YETKWHTQSLTHAKGDDR---TYQQRYLVNDTFWGKGSAPLWRDDDSCPGPVLFYSGNEG 76
Query: 104 DIEWFAQNTGFMYD-VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
++ F GFM D +APK+ A ++ E RYYG S+P+G N +N YLS+
Sbjct: 77 PVDGFWPANGFMTDYLAPKWGAYVLMAEARYYGASLPFG-NASWTPENVQ---YLSTELI 132
Query: 163 LADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF-- 220
LADYA L+ +LK +L PVV FGGSYGG L +FRL YP V +G LA+SAPI +
Sbjct: 133 LADYARLLTELKSSLQGC--PVVSFGGSYGGTLTTFFRLTYPDVVVGGLAASAPIGYYDP 190
Query: 221 ----DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
D+ V Y+FS+II +D+ + C I+ + + A P E L F +C
Sbjct: 191 AHWKDHGVDAYTFSDIIARDYDDAAPGCLDAIRATTDALN--AASP---EALVDLFHLCD 245
Query: 277 SEKNLAIESWL-STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY 335
+ + L A DYP + +PA+PV C + T A+L
Sbjct: 246 AAALGPTRAALWQYALESLPQLDYP---RAVGSIPAWPVNHTCHLLARASTAA---ARLR 299
Query: 336 GAASVYYNYSGTA--KCF------DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
AA V GT CF D G WG+Q+CTE + +K
Sbjct: 300 VAAEVQAMVLGTGGETCFPALVEGPGGVPGDGPGPDSWGYQSCTENLHEFSSKSK---VR 356
Query: 388 ESEEDYDARARYCKEAY-GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
+ D++A+A C + P P +T +GG++I + +N+IF NGL DPW GG
Sbjct: 357 DYTFDFEAQASLCGSLFDDTTPDPRRLTALYGGYEIPAKV----TNVIFSNGLLDPWHGG 412
Query: 447 GVL--KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G N + + V +GAHH DLR +DP +K R E IG W++
Sbjct: 413 GFYPSDNADASNVFCVMPKGAHHGDLRKPEADDPADIKACRALEEATIGGWLA 465
>gi|118346549|ref|XP_977067.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288520|gb|EAR86508.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 502
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 148/480 (30%), Positives = 243/480 (50%), Gaps = 43/480 (8%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+TKY Q++DH + +TF+Q+YLI D ++ K PI Y GNE +++
Sbjct: 19 YQTKYFDQLVDHIGFETGD-KTFKQKYLIKDDYYRYDK--GPILFYCGNEAPVDFSFGGA 75
Query: 113 GFMYD-VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
GFM+ +A + AL+VF+EHRY+G+S P+G KE ++K + YL+S QA+ DYA ++
Sbjct: 76 GFMHTTLAQELNALVVFMEHRYFGESQPFGTEKE-SFKKGNNK-YLTSFQAINDYAKFLV 133
Query: 172 DLKKNLTATDS--PVVVFGG----------SYGGMLAAWFRLKYPHVAIGALASSAPILN 219
KK+L D PVV FG SYGGML+AW R+K+P + +LASSAPI
Sbjct: 134 WFKKSLGCGDDECPVVAFGALSNIFINYKASYGGMLSAWIRMKFPEIIDVSLASSAPIFL 193
Query: 220 FDNI--VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKP-----------GGLE 266
++N + F I+T + C I + + + P L
Sbjct: 194 YENREGIDETLFYKIVTDTYE--QNGCNTQIHRAMNILTDLINSPVPSFLFKIQNKKILN 251
Query: 267 KLQKAFRICK---SEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMC---K 319
++ + + CK + NL + S++ A+ Y +M +YP +F++ +PA+P C +
Sbjct: 252 EINEGMKTCKPITDQDNLDVLRSYIDQAYSYMSMFNYPQEGHFVSKMPAWPANYSCTPFE 311
Query: 320 AIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN-GDSDPHGLSEWGWQACTEMIMLTG 378
AI+D T + +F + + VYY++ +C + N G + S + C +++
Sbjct: 312 AINDKSTISQLFQAVKKSVDVYYDFEEQKECTNFNTGSTGEINTSAYEILTCADIVQPIH 371
Query: 379 GDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
+ +F + D D+ +YC+E +G+ P +++ +GG K +K+F + IIF NG
Sbjct: 372 PNGVTDMFYDQPWDKDSYQQYCQETFGLTPNYDYVLNFYGG-KNDEEMKQF-TRIIFSNG 429
Query: 439 LRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
L DPW G K IS + + AH DLR D + + R +E + I +WI +
Sbjct: 430 LLDPWQSGSPTKYISDDLPIINMYAAAHCSDLRLPQNGDVESVIQARIQEEKYIKQWIQE 489
>gi|413954868|gb|AFW87517.1| putative serine peptidase S28 family protein [Zea mays]
Length = 528
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 125/251 (49%), Positives = 156/251 (62%), Gaps = 20/251 (7%)
Query: 213 SSAPI-LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGS--WKQIEETAKKPGGLEKLQ 269
SSAPI L D+I SF + ++QDF+S S NC+ VIK + W ++E A GGL L
Sbjct: 287 SSAPIILQLDHITPWSSFYDAVSQDFKSESMNCFSVIKATWGWDALDERAASDGGLLDLS 346
Query: 270 KAFRICKSEK-NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN 328
K FR CK+ K +I +WL TAF YTAM DYPTP+NFL LPA+PVKEMCK ID G
Sbjct: 347 KLFRACKTLKYAYSIRNWLWTAFAYTAMVDYPTPANFLQNLPAYPVKEMCKIIDGFPAGA 406
Query: 329 DVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE 388
DV K + AAS+YYNY+G C S G W +C M + + +S+F
Sbjct: 407 DVLDKAFAAASLYYNYTGDQTCTASMAGS---GRLARRW-SCGPMTV-----SNESMFPP 457
Query: 389 SEEDYDARARYCKEAY-------GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRD 441
S Y+ R+ C +++ GV PRP+W+TTE+GGHKI VLKRF SNIIF NG+RD
Sbjct: 458 STFSYEERSDECFQSWGGGGGGGGVRPRPHWVTTEYGGHKIEKVLKRFGSNIIFSNGMRD 517
Query: 442 PWSGGGVLKNI 452
PWS GGVLKNI
Sbjct: 518 PWSRGGVLKNI 528
>gi|242096522|ref|XP_002438751.1| hypothetical protein SORBIDRAFT_10g025530 [Sorghum bicolor]
gi|241916974|gb|EER90118.1| hypothetical protein SORBIDRAFT_10g025530 [Sorghum bicolor]
Length = 192
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 106/191 (55%), Positives = 133/191 (69%), Gaps = 3/191 (1%)
Query: 317 MCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL--SEWGWQACTEMI 374
MCK ID D+ K++ AAS++YNY+G C + P L S W WQACTEMI
Sbjct: 1 MCKIIDGFPANADILEKVFAAASLFYNYTGDQTCNHIEDKDSPRSLDLSYWRWQACTEMI 60
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNII 434
M N +S+F S Y+ R YC +GV PRP+WITTE+GG+KI VLKRF SNII
Sbjct: 61 MPMSSSN-ESMFPPSTFSYEDRFNYCFRFFGVRPRPHWITTEYGGYKIDKVLKRFGSNII 119
Query: 435 FFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGK 494
F NG+RDPWSGGGVLKNIS +++ALV ++GAHH+DLR +TK+DP W+ + RR+EVEII
Sbjct: 120 FSNGMRDPWSGGGVLKNISSSIVALVTEKGAHHLDLRGATKDDPDWVIEQRRQEVEIIQG 179
Query: 495 WISQYFQDLAQ 505
WI QY+QD AQ
Sbjct: 180 WIDQYYQDTAQ 190
>gi|241753998|ref|XP_002401172.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508369|gb|EEC17823.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 329
Score = 218 bits (555), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 127/340 (37%), Positives = 187/340 (55%), Gaps = 28/340 (8%)
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G M++ AP+F ALLVF EHRYYGKS+P+G ++++ S GYL+S QALADYA L++
Sbjct: 1 GLMWEWAPEFNALLVFAEHRYYGKSMPFGNR---SFESPSKLGYLTSEQALADYADLLLH 57
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
LK L A SPVV FGGSYGG+L+AWFR+KYPH+ ALASSAP+ F +V ++S
Sbjct: 58 LKAKLPGAEKSPVVAFGGSYGGLLSAWFRIKYPHLITAALASSAPVNMFPGLVPCSTYSI 117
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLAI-ESWLS 288
IT+ FR VSE C + I+ SW +E G + LQ+ F +C+ + N + W+
Sbjct: 118 AITEAFRRVSELCTQAIRQSWSPLEAMGATEKGTKTLQEKFNLCQGLNPGNYTVFRDWIR 177
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND--VFAKLYGAASVYYNYSG 346
+ A+ +YP P + + PLP PVK +C A+ GN + + A ++++N +G
Sbjct: 178 DTYAVLALVNYPEPGSLITPLPGSPVKAVCDALTK-AIGNRSAMVDAVAAAVNLFFNSTG 236
Query: 347 TAKCFDLNGDSDPHGLSEWGWQ-----------ACTEMIMLTGGDNKDSIFEESEEDYDA 395
T KC D++ + W +Q CTE++M D +F S ++
Sbjct: 237 TRKCHDVSIFQS--AVPSWRFQVSHLCTLAYNAGCTELVMPVCSDGVTDMFYPSSWNFTE 294
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIF 435
C+E +GV P +GG + A+NI+F
Sbjct: 295 VTAKCRETFGVTPDIYKSVMLYGGGHLAR-----ATNIVF 329
>gi|62089160|dbj|BAD93024.1| Dipeptidyl-peptidase II precursor variant [Homo sapiens]
Length = 377
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 113/273 (41%), Positives = 166/273 (60%), Gaps = 11/273 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ ++ Q LDHFN+ +TF QR+L++D W + PIF YTGNEGD+ FA N+
Sbjct: 30 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFW--VRGEGPIFFYTGNEGDVWAFANNS 87
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + ALLVF EHRYYGKS+P+G T L+ QALAD+A L+
Sbjct: 88 GFVAELAAERGALLVFAEHRYYGKSLPFGAQS----TQRGHTELLTVEQALADFAELLRA 143
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP+L + F
Sbjct: 144 LRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRD 203
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + ++ F C+ EK+L + +
Sbjct: 204 VTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 262
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI 321
AF AM DYP P++FL PLPA PVK C +
Sbjct: 263 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRL 295
>gi|323450637|gb|EGB06517.1| hypothetical protein AURANDRAFT_71973 [Aureococcus anophagefferens]
Length = 939
Score = 218 bits (554), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 153/430 (35%), Positives = 222/430 (51%), Gaps = 27/430 (6%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
+Q LDHF++ + TF+QR ++ HW PIFVY GNE D+ + TG M++
Sbjct: 40 SQNLDHFDFTTNA--TFEQRVFVHADHW---SPGGPIFVYCGNEDDVTLYVNATGLMWEH 94
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A F A+LVF+EHRYYG+++P+G A YLS QALAD + + +K
Sbjct: 95 AAAFGAMLVFVEHRYYGETLPFGA----ASFEPGRLRYLSHEQALADLVNALRRIKATYG 150
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD-NIVSPYSFSNIITQDF 237
A ++ V FGGSYGGMLAAW R+KYP +GA+A+SAPIL FD + ++ ++T+D
Sbjct: 151 AENAKTVAFGGSYGGMLAAWLRMKYPAAVVGAVAASAPILAFDGDGFDGEAYWEVVTRDA 210
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST-AFVYTAM 296
+ + + + + L + FR C + + + L+ AF AM
Sbjct: 211 TAAAGAAPACAANVREAFSALFR--ADRDDLSRIFRTCGPVADRSRLALLALFAFDTMAM 268
Query: 297 TDYPTPSNFLN----PLPAFPVKEMCKAIDDPKTGND-VFAKLYGAASVYYNYSGTAKCF 351
+YP S +L LPAFPV+ C+ + P G++ + A L AA V+YN SG C
Sbjct: 269 GNYPYESTYLTHGEVALPAFPVRAACEHLAGPLDGDEALLAALAAAAGVFYNASGALACN 328
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTG---GDNKDSIFEESEEDYDARARYCKEAYGVDP 408
+L D + G+ W WQ CTE + D +F + + +C+ YGV P
Sbjct: 329 ELPADVEEDGI--WDWQYCTETLPQETYFPRDGVRDMFWPAPANDSWVDAHCEAKYGVAP 386
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV 468
R WI +GG + G+ A+NI+F NG DPWS GGV + GAHH+
Sbjct: 387 RRRWIADSYGG-RAGVAA---ATNIVFSNGALDPWSAGGVADAAGGATETVRIDLGAHHL 442
Query: 469 DLRFSTKEDP 478
DL F+ +DP
Sbjct: 443 DLMFAHPDDP 452
>gi|167534702|ref|XP_001749026.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772450|gb|EDQ86101.1| predicted protein [Monosiga brevicollis MX1]
Length = 433
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 141/422 (33%), Positives = 213/422 (50%), Gaps = 38/422 (9%)
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY--LSSTQALAD 165
+ +TG M++ A FKAL+VF EHR++G+S G ST+ Y S QA+AD
Sbjct: 10 YVDHTGLMWENAADFKALIVFAEHRFFGQSQVTPGAD-----GPSTSEYPLFSVEQAMAD 64
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
Y + + K+N + DSPV+VFGGSYGGMLAAW R+KYP +GA+A+SAPI F
Sbjct: 65 YNHFLFEFKQNRSIEDSPVIVFGGSYGGMLAAWLRIKYPETFLGAVAASAPISGFAGQQP 124
Query: 226 PY---SFSNIITQDFRSVSENCYKV---IKGSWKQIEETAKKPGGLEKLQKAFRICK--- 276
+ ++ ++T+D + ++ S+ + +T G L FR+CK
Sbjct: 125 EWDSNTYWQVVTRDATPAAGAPAACADNVRNSFVTLFKTGASESGRAHLSDLFRLCKPLG 184
Query: 277 -SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNP----LPAFPVKEMCKAIDDPK-TGNDV 330
+ A+ W++ A+ AM D+P PSN+L LPA+PV C+ + TG+ +
Sbjct: 185 STSDVQALAMWIAYAWDTMAMGDFPYPSNYLTSNGPMLPAYPVTAACQHLATANLTGDAL 244
Query: 331 FAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLT---GGDNKDSIFE 387
+ AASV+ N + +C D+ D D W W CTE + D + +F
Sbjct: 245 LQGVLAAASVFTNATANLQCNDVPFD-DVEQDGIWDWLFCTETMHQETYFSLDGQRDMFW 303
Query: 388 ESEEDYDARARYCKEAYGVDPR--------PNWITTEFGGHK----IGLVLKRFASNIIF 435
+ +C + YGV PR N E G H+ A N++
Sbjct: 304 SQPWNTTFINDHCFKKYGVTPRYTKVAERYGNPEAIEVGAHRSTTLSLQAALAAAGNVVL 363
Query: 436 FNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
NGL DPWS G+ +NIS TV A++ + GAHH+DL FST +DP + R E++ + +W
Sbjct: 364 SNGLLDPWSSAGIKRNISNTVTAIILEHGAHHLDLMFSTPQDPVDVSFARWFEMQRVREW 423
Query: 496 IS 497
++
Sbjct: 424 LA 425
>gi|156348420|ref|XP_001621842.1| hypothetical protein NEMVEDRAFT_v1g990 [Nematostella vectensis]
gi|156208137|gb|EDO29742.1| predicted protein [Nematostella vectensis]
Length = 283
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 124/296 (41%), Positives = 177/296 (59%), Gaps = 19/296 (6%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+KT Q +DHFN+ TF+QRYL + +W G PIF Y+GNEG I F +N+
Sbjct: 1 FKTGTFEQTVDHFNFIQSG--TFKQRYLYTEKYWDG---KGPIFFYSGNEGGITGFWENS 55
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+++ A F AL++F EHRYYG+S+P+G + ++K GYLS QALAD+A+LI
Sbjct: 56 GFVFEAAKNFSALVIFGEHRYYGESLPFGQD---SFK-IENIGYLSIEQALADFATLIPA 111
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LKK A + PVV FGGSYGGML+A+ R KYP+V ALA+SAPI ++ F
Sbjct: 112 LKKQFKAEEKPVVSFGGSYGGMLSAYLRFKYPNVIQAALAASAPIYFIADLSIRDFFFPA 171
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPG--GLEKLQKAFRICKSEKNL----AIESW 286
+T+DF++ C +++ + +++ KK G GL+ + KAF++CK K+ + W
Sbjct: 172 VTRDFKNADPKCPDLVRAGFIELDNL-KKEGLKGLDAISKAFKLCKPLKSADQINHLIGW 230
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY 342
+ AF AM DYP ++FL PLPA PV CK + T +D + L AA + Y
Sbjct: 231 IRNAFTIIAMCDYPYATDFLAPLPANPVNYACKLL---ATASDRLSGLADAAGLAY 283
>gi|216296557|gb|ACJ72063.1| prolylcarboxypeptidase [Trichoplax adhaerens]
Length = 254
Score = 214 bits (545), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 153/244 (62%), Gaps = 12/244 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
YKTKY QI+DHF++ +S T++QRYL+ND HW K PIF YTGNEG I F QN+
Sbjct: 13 YKTKYFDQIIDHFDW--KSNATYRQRYLMNDDHW--DKGTGPIFFYTGNEGGIVGFWQNS 68
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G ++D+AP+F+AL+VF EHRYYGKS+P+G + KN L+S QALADYA L+
Sbjct: 69 GLLFDLAPQFRALIVFGEHRYYGKSLPFGKD-SFKPKNLEL---LTSEQALADYAVLLTS 124
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
LKK+L A VV FGGSYGGML AW RLKYP++ LA+SAP+ +VSP F
Sbjct: 125 LKKSLNANKCKVVAFGGSYGGMLTAWMRLKYPNIIDAGLAASAPLYMAGGVVSPNFFFPA 184
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLS 288
+T+D++ + C I+ ++ + E AK G +K+ K F +C K A + W+
Sbjct: 185 VTKDYQDANPKCVPNIRKAFSAVLEMAKSKSGKQKVAKIFNVCNKLKTSADVKQLIGWIR 244
Query: 289 TAFV 292
FV
Sbjct: 245 NGFV 248
>gi|302758754|ref|XP_002962800.1| hypothetical protein SELMODRAFT_404105 [Selaginella moellendorffii]
gi|300169661|gb|EFJ36263.1| hypothetical protein SELMODRAFT_404105 [Selaginella moellendorffii]
Length = 674
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 101/165 (61%), Positives = 127/165 (76%), Gaps = 11/165 (6%)
Query: 77 QRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGK 136
Q YL++ W G + API VY GN+GDI WFA+NTGFM+D+A F+ALLVF EHR+YGK
Sbjct: 18 QHYLLHSASWSGGASGAPILVYCGNKGDIVWFAENTGFMFDIAHLFRALLVFPEHRFYGK 77
Query: 137 SIPYGGN---KEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGG 193
S P+GG KE+A+ S+ QALAD+A+LI+DLK+NL+A SPVVVFGGSYGG
Sbjct: 78 SQPFGGQNGPKELAF--------CSAEQALADFATLILDLKRNLSAQASPVVVFGGSYGG 129
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
MLAAWFRLKYPH+AIGALASSAPIL F+NIV ++ +I++ F+
Sbjct: 130 MLAAWFRLKYPHIAIGALASSAPILQFENIVPYTTYYDIVSNAFK 174
>gi|397492345|ref|XP_003817084.1| PREDICTED: dipeptidyl peptidase 2 [Pan paniscus]
Length = 406
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 120/320 (37%), Positives = 179/320 (55%), Gaps = 23/320 (7%)
Query: 191 YGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKG 250
YGGML+A+ R+KYPH+ GALA+SAP+L + F +T DF S C + ++
Sbjct: 77 YGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVRE 136
Query: 251 SWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFL 306
+++QI++ + G + ++ F C+ EK+L + + AF AM DYP P++FL
Sbjct: 137 AFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPTDFL 195
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL----NGDSDPHGL 362
PLPA PVK C D + L A + YN SG+ C+D+ + +DP G
Sbjct: 196 GPLPANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSEHCYDIYRLYHSCADPTGC 252
Query: 363 S------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE 416
W +QACTE+ + +N +F + + R RYC + +GV PRP+W+ T
Sbjct: 253 GTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQRYCLDTWGVWPRPDWLLTS 312
Query: 417 FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKE 476
F G + R ASNIIF NG DPW+GGG+ +N+S +V+A+ + GAHH+DLR S E
Sbjct: 313 FWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAVTIQGGAHHLDLRASHPE 367
Query: 477 DPQWLKDVRRREVEIIGKWI 496
DP + + R+ E IIG+W+
Sbjct: 368 DPASVVEARKLEATIIGEWV 387
>gi|417400793|gb|JAA47320.1| Putative dipeptidyl peptidase 2 [Desmodus rotundus]
Length = 429
Score = 211 bits (538), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 110/294 (37%), Positives = 173/294 (58%), Gaps = 17/294 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q LDHFN+ QTF QR+L+++ W + P+F YTGNEGD+ FA N+
Sbjct: 40 FRESYFEQFLDHFNFERFGNQTFPQRFLVSEKFW--KRGEGPLFFYTGNEGDVWAFANNS 97
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY---LSSTQALADYASL 169
GF+ ++A + AL+VF EHRYYGKS+P+G + ++ G+ L+ QALAD+A L
Sbjct: 98 GFILELAAQQGALVVFAEHRYYGKSLPFG-------ERSTQRGHVELLTVEQALADFARL 150
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ L+++L A D P V FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 151 LQALRRDLGAQDVPAVAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVLVAGLGDAYQF 210
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN----LAIES 285
++ DF C + ++ +++QI++ + G + + +AF +C+ + +
Sbjct: 211 FRDVSLDFEGQGPKCAQGVRDAFRQIKDLFLQ-GAYDVVSQAFGLCRPLSGWKDLVQLFG 269
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAAS 339
+ AF AM DYP P++FL LPA PV+ C + + + L GA++
Sbjct: 270 FARNAFTVLAMMDYPYPTDFLGHLPANPVQVACDRLLNESDRIEGLRALAGASA 323
>gi|296491747|tpg|DAA33780.1| TPA: dipeptidyl peptidase 7 [Bos taurus]
Length = 335
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 121/325 (37%), Positives = 189/325 (58%), Gaps = 22/325 (6%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P L + KD + ++ Y Q+LDHFN+ +TF QR+L+ + W ++ PI
Sbjct: 21 PRGLEARAHRPKDPE--FQEAYFEQLLDHFNFERFGNKTFLQRFLMTEKFW--NRGEGPI 76
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
F YTGNEGD+ FA N+GF+ ++A + AL+VF EHRYYGKS+P+G + ++ G
Sbjct: 77 FFYTGNEGDVWSFANNSGFILELAEQQGALVVFAEHRYYGKSLPFG-------ERSTWRG 129
Query: 156 Y---LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
Y L+ QALAD+A L+ L++ L A D+P + FGGSYGGML+A+ R+KYPH+ GALA
Sbjct: 130 YTELLTVEQALADFAGLLRALRQELEAPDAPAIAFGGSYGGMLSAYLRIKYPHLVAGALA 189
Query: 213 SSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
+SAP+++ + PY F ++ DF+ S C + ++ +++QI + ++ G + + F
Sbjct: 190 ASAPVVSAAGLGDPYQFFQDVSADFQGQSPECARAVQDAFRQIRDLFQQ-GAPHVVSQEF 248
Query: 273 RICK---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN 328
C+ K+L + + AF AM DYP ++F+ LPA PV+ C + + +
Sbjct: 249 GTCQPLSGPKDLTQLFGFARNAFTVLAMMDYPYATDFVGHLPAHPVQVGCSRL---LSES 305
Query: 329 DVFAKLYGAASVYYNYSGTAKCFDL 353
A L A + YN SG C+D+
Sbjct: 306 SRIAGLRALAGLVYNSSGIEPCYDI 330
>gi|414880844|tpg|DAA57975.1| TPA: hypothetical protein ZEAMMB73_592594 [Zea mays]
Length = 761
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 101/197 (51%), Positives = 139/197 (70%), Gaps = 2/197 (1%)
Query: 123 KALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDS 182
K + +HRYY +S+P+G +K AY ++ + YL++ QALAD+A + DLK+NL+A S
Sbjct: 525 KTMEKLSKHRYYRESMPFG-SKAKAYIDSKSLAYLTAKQALADFAVQLTDLKRNLSAEGS 583
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE 242
PVV+FG SYGGMLAAW RLKYPH+AIGALASSAPIL F++IV F ++++ DFR S
Sbjct: 584 PVVLFGDSYGGMLAAWIRLKYPHIAIGALASSAPILQFEDIVPSTIFYDLVSDDFRRESL 643
Query: 243 NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPT 301
+C+ IK SWK++++ A K GL KL K F +C++ K + WLS+A+ Y AM DYP
Sbjct: 644 SCFLKIKDSWKELDDQANKQDGLLKLSKTFHLCQTLKTSGDLSDWLSSAYSYLAMVDYPL 703
Query: 302 PSNFLNPLPAFPVKEMC 318
S FL PLPA P+K++
Sbjct: 704 SSKFLRPLPANPIKKLV 720
>gi|148676287|gb|EDL08234.1| dipeptidylpeptidase 7, isoform CRA_a [Mus musculus]
Length = 329
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 173/294 (58%), Gaps = 14/294 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ Y Q +DHFN+ +TF QR+L++D W PIF YTGNEGDI FA N+
Sbjct: 41 FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFW--KMGEGPIFFYTGNEGDIWSFANNS 98
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM ++A + +ALLVF EHRYYGKS+P+G + T L+ QALAD+A L+
Sbjct: 99 GFMVELAAQQEALLVFAEHRYYGKSLPFG----VQSTQRGYTQLLTVEQALADFAVLLQA 154
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L D+P + FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 155 LRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFRD 214
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
+T DF S C + ++ +++QI++ + G + + + F C+ S K+L + +
Sbjct: 215 VTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQSLSSPKDLTQLFGFAR 273
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK---TGNDVFAKLYGAAS 339
AF AM DYP P++FL PLPA PVK C+ + + G A LYG S
Sbjct: 274 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLNEGQRIMGLRALAGLYGDQS 327
>gi|119608755|gb|EAW88349.1| dipeptidyl-peptidase 7 [Homo sapiens]
Length = 327
Score = 205 bits (521), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 117/317 (36%), Positives = 176/317 (55%), Gaps = 23/317 (7%)
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWK 253
ML+A+ R+KYPH+ GALA+SAP+L + F +T DF S C + ++ +++
Sbjct: 1 MLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVREAFR 60
Query: 254 QIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPL 309
QI++ + G + ++ F C+ EK+L + + AF AM DYP P++FL PL
Sbjct: 61 QIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPTDFLGPL 119
Query: 310 PAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL----NGDSDPHGLS-- 363
PA PVK C D + L A + YN SG+ C+D+ + +DP G
Sbjct: 120 PANPVKVGC---DRLLSEAQRITGLRALAGLVYNASGSEHCYDIYRLYHSCADPTGCGTG 176
Query: 364 ----EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGG 419
W +QACTE+ + +N +F + + R RYC + +GV PRP+W+ T F G
Sbjct: 177 PDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQRYCLDTWGVWPRPDWLLTSFWG 236
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQ 479
+ R ASNIIF NG DPW+GGG+ +N+S +V+A+ + GAHH+DLR S EDP
Sbjct: 237 GDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAVTIQGGAHHLDLRASHPEDPA 291
Query: 480 WLKDVRRREVEIIGKWI 496
+ + R+ E IIG+W+
Sbjct: 292 SVVEARKLEATIIGEWV 308
>gi|355684809|gb|AER97524.1| dipeptidyl-peptidase 7 [Mustela putorius furo]
Length = 316
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 112/273 (41%), Positives = 168/273 (61%), Gaps = 11/273 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ Y Q+LDHFN+ +TF QR+L+++ W K PIF YTGNEGD+ FA N+
Sbjct: 36 FQEGYFEQLLDHFNFERFGNKTFPQRFLVSEKFW--KKGKGPIFFYTGNEGDVWSFANNS 93
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GF+ ++A + +AL+VF EHRYYGKS+P+G T L+ QALAD+A LI
Sbjct: 94 GFIQELAAQQEALVVFAEHRYYGKSLPFGDRS----TRRGHTELLTVEQALADFARLIRA 149
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L+++L A DSPV+ FGGSYGGML+A+ R+KYPH+ GALA+SAP++ + Y F
Sbjct: 150 LQRDLGAYDSPVIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFRD 209
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLS 288
++ DF S C + ++ +++QI++ + G + + + F C+ S ++L + +
Sbjct: 210 VSADFEGQSPKCAQGVRDAFRQIKDLFSQ-GAYDTVSREFGTCQPLSSREDLTQLFGFAR 268
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI 321
AF AM DYP P++F+ LPA PVK C +
Sbjct: 269 NAFTVLAMMDYPYPTDFIGHLPANPVKVGCDRL 301
>gi|358253095|dbj|GAA51984.1| dipeptidyl peptidase 2, partial [Clonorchis sinensis]
Length = 593
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 115/297 (38%), Positives = 164/297 (55%), Gaps = 17/297 (5%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K +Y TQ +DH N+ P + T++ RYL D + K+ PIF Y GNEGDI F N+G
Sbjct: 308 KEQYFTQRVDHMNFQPANI-TYRMRYLYEDKWY---KSGGPIFFYCGNEGDIFGFWNNSG 363
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F++ +A K A++VF EHRYYGKS+P+ + Y +LS Q LADYA+LI L
Sbjct: 364 FIFHLASKMDAMVVFAEHRYYGKSLPFKNSFSQPY-----IQFLSIEQTLADYANLIQHL 418
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K+ ++ V+ FGGSYGGMLAA+ R YPH+ GA+ASSAP+ + + + F +
Sbjct: 419 KEKYGRDNTAVIAFGGSYGGMLAAYMRASYPHLVAGAIASSAPVNWVAGLGNIHQFFEHV 478
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKP-GGLEKLQKAFRICKSEKNLA----IESWLS 288
T D+ V+ C +K ++ +E + L + K ++CK + + W
Sbjct: 479 TDDYNQVNPQCVARVKNAYDLLERMVMEDIRALASISKQMKLCKPMHTIFDFVWMLKWSR 538
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS 345
AFV M DYPT + F++ LPA+PV C I DV + L A SV+YN S
Sbjct: 539 NAFVMMTMLDYPTDNTFISQLPAYPVNVSCAKI---LAAPDVISALRDAVSVWYNSS 592
>gi|384253205|gb|EIE26680.1| hypothetical protein COCSUDRAFT_11918 [Coccomyxa subellipsoidea
C-169]
Length = 395
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 135/406 (33%), Positives = 203/406 (50%), Gaps = 57/406 (14%)
Query: 3 TRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLIS-SSKDSQGLYKTKYHTQ- 60
R L CL+ S + ++ + P K SS +S + + K K +T+
Sbjct: 2 ARITPLVLCLIICSAFSGGQCQVRRSRKGPPRIPYKNSSRTQLTSNGREPITKCKVYTRD 61
Query: 61 -ILDHFNYN--PQSYQTFQQRYLINDTHWGGSKN--NAPIFVYTGNEGDIEWFAQNTGFM 115
LDHF++ P + TF+QRY + + HW K+ PIF Y GNE D+ + TG M
Sbjct: 62 ATLDHFSWATPPDNRTTFKQRYFLCNDHWKSHKDGTRGPIFFYVGNEADVTLYLNATGLM 121
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
++ A F ALLVF EHRYYG+S P+ K + + YL+S QA+AD+A LI++LK+
Sbjct: 122 WENAAAFGALLVFAEHRYYGESKPF--KKALRHH----MQYLTSEQAMADFAELIMELKE 175
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY---SFSNI 232
+L A S V+ FGGSYGGMLA W R+KYPH+ GA+A SAPI ++ Y S++ I
Sbjct: 176 DLGAQSSAVIGFGGSYGGMLATWMRIKYPHILDGAIAGSAPIWSYLGEEPAYDSGSYAKI 235
Query: 233 ITQDFRSV---SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST 289
+T D + C ++ W Q G +QK IC +
Sbjct: 236 VTADASEAGGSAPACASNVREVWNQ---------GSWAVQK---ICLDMRR--------- 274
Query: 290 AFVYTAMTDYPTPSNFL----NPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASVYYNY 344
+YP PS+++ LPA+PV+ C+++ + G+D+ + A V+YNY
Sbjct: 275 -------GNYPYPSSYILNGNGILPAYPVRVACESLRQEDLAGSDLLSAFADALGVFYNY 327
Query: 345 SGTAKCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIF 386
+ +C+D N ++D G S W +Q CTE D K ++
Sbjct: 328 TEDVECYDFGAGPNPETDEDG-SFWDYQWCTEQFQPFSKDGKHDMY 372
>gi|440803021|gb|ELR23935.1| serine carboxypeptidase [Acanthamoeba castellanii str. Neff]
Length = 481
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 144/466 (30%), Positives = 222/466 (47%), Gaps = 29/466 (6%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S+S + L K +++TQ LDHFN Q +T++QRY INDT W S PIF G EG
Sbjct: 27 SASSVGRDLPKEQWYTQRLDHFN--GQETRTWKQRYFINDTFWNPSAP-GPIFFQMGGEG 83
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
+ M K AL+V +EHR+YG S P + + + +LSS QAL
Sbjct: 84 AVSGEDVVLLQMVQYGIKHGALMVTLEHRFYGTSQP------LPDLSIESLRFLSSEQAL 137
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
AD A ++ LK A SP++ FG SY G LAAWFRLKYPHV ++ASSAP+ +
Sbjct: 138 ADAAEFLLWLKDQYQAPKSPIITFGCSYPGALAAWFRLKYPHVTYASVASSAPV---EAT 194
Query: 224 VSPYSFSNIITQDFRS-VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN-L 281
+ + + +++ Q V + C IK + + + PGG KLQ F C +N L
Sbjct: 195 LDFFEYLDVVDQSLEYFVGDKCVANIKQATTAVSQLMASPGGRAKLQSLFNFCGPIQNEL 254
Query: 282 AIESWLST-AFVYTAMTDYPTPSNFLNPLPA-FPVKEMCKAIDDPKTG----NDVFAKLY 335
I ++ S+ A + Y + NPL + K M + DP T +++F +
Sbjct: 255 DIANFYSSLAGNWMGTVQYNDENG--NPLDVIYLCKIMTQPGVDPLTAYVNISNIFLRSQ 312
Query: 336 GAASVYYNYS-GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
+ + +Y+ A+ D + + G+ +W +Q CTE D+ F +
Sbjct: 313 DQSCLDVSYADAIAQLRDTSAAAAGVGIRQWVYQTCTEFGYFQTSDSDGQPFGDGMP-LK 371
Query: 395 ARARYCKEAYGV--DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
C++A+G+ PR N +GG + SNI+F NG DPW + K+I
Sbjct: 372 FSLDQCRDAFGLIDPPRINATNHIYGGRNLP---AWGPSNILFVNGNIDPWHALSITKSI 428
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
S ++ + AH ++ + + DP L R+ I +W++Q
Sbjct: 429 SPSLTTVFINGTAHCANVLPAHENDPPSLVQARKDIQAQIDQWLAQ 474
>gi|159489554|ref|XP_001702762.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280784|gb|EDP06541.1| predicted protein [Chlamydomonas reinhardtii]
Length = 419
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 178/371 (47%), Gaps = 71/371 (19%)
Query: 32 SRITPEKLSSLISSS--KDSQGLYKTKYHTQILDHFNYNPQS--YQTFQQRYLINDTHWG 87
SR SS+ + S KD K + LDHF+ P + TF QRY + +HW
Sbjct: 44 SRKLQRPFSSVATHSLRKDLLAQCKLQLRNASLDHFSRVPPAGDVTTFPQRYFVCASHWQ 103
Query: 88 GSKNNA---PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
+ PIF Y GNE D+ + NTG M++ A F A+LVF EHRYYG+S+PYG
Sbjct: 104 RENPDGTPGPIFFYLGNEADVTLYLNNTGLMWESAADFGAMLVFAEHRYYGESVPYG--- 160
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVV-FGGSYGGMLAAWFRLKY 203
E K+ GYLS+ Q LI++LK+ V+ FGGSYGGMLAAW RLKY
Sbjct: 161 EAVKKH---MGYLSAEQ-------LIMELKEQFQLPQGTAVIGFGGSYGGMLAAWMRLKY 210
Query: 204 PHVAIGALASSAPILNFDNIVSPY---SFSNIITQDFRSVSEN---CYKVIKGSWKQIEE 257
PHV GA+A+SAPI NF V + SF+ +T D ++ + C +K +W +
Sbjct: 211 PHVLDGAIAASAPIWNFLGEVPAFDAGSFAKGVTYDASELAGSAPACVDNVKATWDVMAS 270
Query: 258 TAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL----NPLPAFP 313
G AM ++P PS+++ LPAFP
Sbjct: 271 YGNDDAGRGG---------------------------AMGNFPYPSSYITNGHGQLPAFP 303
Query: 314 VKEMCKAIDDPKTGNDVFA------KLYGAASVYYNYSGTAKCFDLNGDSDP---HGLSE 364
V+ C +P G D + + A V+YN++G +CFD +DP H +
Sbjct: 304 VRAAC----EPLAGGDDWVDADLLDAMAAAVGVFYNHTGDLECFDPFAGTDPDSDHDANW 359
Query: 365 WGWQACTEMIM 375
W +Q C EM+M
Sbjct: 360 WDYQWCAEMLM 370
>gi|76156383|gb|AAX27594.2| SJCHGC06818 protein [Schistosoma japonicum]
Length = 271
Score = 178 bits (452), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 102/232 (43%), Positives = 140/232 (60%), Gaps = 10/232 (4%)
Query: 46 SKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
+KDSQ Y+TKY +DHF++ F+ +YLIN+ + + PI YTGNEG I
Sbjct: 27 NKDSQFKYETKYFRTKIDHFSFVTDG--EFEIKYLINNESF---SSGGPILFYTGNEGAI 81
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
E FA+N+GF++ +A + A +VF EHRYYG S+P+G + ++K+ GYL++ QALAD
Sbjct: 82 ETFAENSGFIWKLAEELNASVVFAEHRYYGTSLPFGND---SFKDRQYFGYLTAEQALAD 138
Query: 166 YASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
Y LI LK N + SPV+ FGGSYGGML+AW R KYP+ GA+ASSAP+ F +
Sbjct: 139 YVLLINQLKVNYSCFASSPVISFGGSYGGMLSAWIRQKYPNQIAGAIASSAPVWLFPGLS 198
Query: 225 SPYSFSNIITQDF-RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
FS + T F + +NC K I+ SW I + + G E L F IC
Sbjct: 199 DCNGFSLVATNSFLKYGGDNCVKNIQHSWSNIVDIGQSFDGKELLTNMFNIC 250
>gi|222615542|gb|EEE51674.1| hypothetical protein OsJ_33022 [Oryza sativa Japonica Group]
Length = 184
Score = 178 bits (451), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 82/133 (61%), Positives = 106/133 (79%), Gaps = 2/133 (1%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
LDHFN P S TF+QRYL+N T WGG+ AP+FVY GNEGD+ FA NTGFM++ AP+
Sbjct: 41 LDHFNELPASNGTFRQRYLVNGTFWGGAA--APVFVYAGNEGDVALFASNTGFMWEAAPR 98
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD 181
F+A+LVF+EHRYYG+S+P+GG + A+ +AS GYL++ QALAD+A LI+ LK NLTA
Sbjct: 99 FRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTTAQALADFAELILSLKSNLTACK 158
Query: 182 SPVVVFGGSYGGM 194
+PVV+FGGSYGG+
Sbjct: 159 APVVIFGGSYGGI 171
>gi|241753990|ref|XP_002401169.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508366|gb|EEC17820.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 261
Score = 177 bits (450), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 93/239 (38%), Positives = 140/239 (58%), Gaps = 11/239 (4%)
Query: 84 THWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGN 143
TH+ PIF Y G+E +E + NTG +++ A +FKA+++F EHR+YG+S+P
Sbjct: 27 THFWEPTKTGPIFFYCGHELPLEDYINNTGLLWNWAREFKAMIIFSEHRFYGQSLP---- 82
Query: 144 KEIAYKNASTTGYL---SSTQALADYASLIIDLKKNLTATDS-PVVVFGGSYGGMLAAWF 199
++ + T YL S+ QALADYA LI+ +K+ + D PV+ FGG YGGMLAA+F
Sbjct: 83 DDVRTSHLPTVPYLNYFSAVQALADYAHLILHIKETVHRADKVPVIAFGGFYGGMLAAYF 142
Query: 200 RLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETA 259
RLKYPH+ GALASSAP+ F +V +F + +T+ FR S C K I+ SW +
Sbjct: 143 RLKYPHLIAGALASSAPVQMFPGLVPCSAFDHTLTKAFRRESAACAKAIRKSWPHLLSLT 202
Query: 260 KKPGGLEKLQKAFRICKS---EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVK 315
+ + +++C + + + W+ +++ AM +YP PS+ L LPA PVK
Sbjct: 203 DSSKKAHEFSRKYKMCDNLTPQAQRMLLDWMHDTYLHLAMFNYPEPSSRLTSLPANPVK 261
>gi|294881794|ref|XP_002769498.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
gi|239872962|gb|EER02216.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
Length = 455
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 146/466 (31%), Positives = 207/466 (44%), Gaps = 94/466 (20%)
Query: 59 TQILDHFNY----NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
TQ DHF++ NP FQQRY + + P+F Y GNEG +E + +TG
Sbjct: 34 TQDRDHFSFGEGGNPGK---FQQRYFTFKDFY---RPGGPLFFYVGNEGPVEIYVNHTGL 87
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
M+++ A LVF EHRYYGK+ Y + YL+ QALADY S++I+
Sbjct: 88 MWELGSDLGAFLVFAEHRYYGKTQVYSDGTPDCLR------YLTIEQALADY-SVMINTY 140
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV---SPYSFSN 231
+ ++ + FGGSYGGMLA+ FR KYPH+ GA+A+SAPI + S +F+
Sbjct: 141 TRIASSLIATIAFGGSYGGMLASAFRYKYPHIIDGAIAASAPIFAIGGVTPEPSKTAFNE 200
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-------AIE 284
IIT+D V A R C + +L +
Sbjct: 201 IITRDAGPVC-----------------------------AQRWCDNSSHLNSSDLANQMV 231
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNP---------LPAFPVKEMCKAID------DPKTGND 329
+W + + Y AM ++P PSN++ LPA PV+ C+ + K G
Sbjct: 232 AWATAPWAYLAMGNFPFPSNYITAAMNVGGGADLPAHPVRVACEPFERLDTMPTKKEGAH 291
Query: 330 VFAKLYGAASVYYNYSGTAKC-----FDLNGDSDPHGL--SEWGWQACTEMIM-LTGGDN 381
+ A L + S+YYN SG C D G P G ++G+Q CTEM G +
Sbjct: 292 IRA-LAESLSIYYNASGELACNSFAETDGGGAPIPEGSCKGDYGFQTCTEMPWGQDSGTD 350
Query: 382 KDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRD 441
+D + SE D D C + YGV + W +F + + SN+ F NG D
Sbjct: 351 RDMFWPPSEFDPDDYKAECLQKYGVTTK-AWAGLQFLRNMADALASM--SNVFFSNGKFD 407
Query: 442 PWSGGGVLKNI-----------SKTVLALVEKEGAHHVDLRFSTKE 476
PW I K+V + V + GAHH DL F+ E
Sbjct: 408 PWGVSASEDQIPQGVDCTVMYCPKSVASFVMETGAHHSDLMFTRDE 453
>gi|307200054|gb|EFN80400.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 495
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 144/473 (30%), Positives = 207/473 (43%), Gaps = 57/473 (12%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ LDHFN P+ +T+ RY N + N PI + G E I G MY++
Sbjct: 49 TQPLDHFN--PRENRTWSMRYYENSALL---RANGPILITIGGEWTISTGFLQGGLMYEI 103
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A ++ + EHR+YGKS P +AS YLS QALAD A+ I KK
Sbjct: 104 ASVHGGMMYYTEHRFYGKSRPTKDT------SASNLRYLSVDQALADLANFIETKKKEKN 157
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
+SPV+VFGGSY G +A W RLKYPH+ GALASSAPI + Y + ++T+
Sbjct: 158 LENSPVIVFGGSYAGNMATWARLKYPHLIQGALASSAPIYAKADF---YEYYEVVTRSLG 214
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLAIESWLSTAFVY 293
S C +K +++ +EE GG EKL+ F +C KS +L S A V+
Sbjct: 215 RHSAQCVADVKTAFESVEELLAAQGGPEKLKVYFDLCNVPDVKSPSDLG-SLMNSLAEVF 273
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
+ Y N + A +C + T + + L A V N + CFD+
Sbjct: 274 AEIVQYDKVENGRTKIAA-----LCAEM----TATHLGSPLQRLARVIANSDPGSACFDM 324
Query: 354 --------------NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE-------SEED 392
+ + + +W Q CTE + SIF ++
Sbjct: 325 SYKNVIKKYRDISWDSPAAASAMRQWYHQTCTEYGYYQTTSSDKSIFGTLFPLSYFTDMC 384
Query: 393 YDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
D Y E +D R +GG + L +N+IF NG DPW VL+++
Sbjct: 385 IDLYGDYYNEKL-LDSRVKRTNMMYGGQRPDL------TNVIFTNGDIDPWHALSVLEDL 437
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+ A++ +H DL D + LK R + IIGKW+S + L +
Sbjct: 438 NAYAPAILINGSSHCRDLYSDADTDVEDLKKARAKVRSIIGKWLSSVKEVLVR 490
>gi|156537791|ref|XP_001608051.1| PREDICTED: putative serine protease K12H4.7-like [Nasonia
vitripennis]
Length = 476
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 140/466 (30%), Positives = 211/466 (45%), Gaps = 62/466 (13%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTGF 114
K+ Q LDHFN+ +T+Q RY D ++ G PIF+ G E I F QN G
Sbjct: 47 KWIEQPLDHFNHRDN--RTWQMRYYEEDKYFNGI---GPIFIMLGGEWTINPGFLQN-GL 100
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
M+D+A + AL+ + EHRYYGKS P + ++ N YL+ QALAD A I + K
Sbjct: 101 MHDLAKQHGALMFYTEHRYYGKSYP---TQNMSSDNMQ---YLNVDQALADVAYFIDNRK 154
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
TDS V+VFGGSY G +AAW R+KYPH+ G++ASSAP+ + Y + ++
Sbjct: 155 SEYNITDSKVIVFGGSYAGNMAAWIRIKYPHLIQGSVASSAPVYAKADF---YEYYEVVA 211
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYT 294
R C ++ ++ + EE GG EK+QK F ICK+
Sbjct: 212 NSLRRHDSQCALDVENAFDETEELLVTEGGPEKIQKIFNICKTPN-------------VN 258
Query: 295 AMTDYPTPSNFLNPLPAFPVK------------EMCKAIDDPKTGNDVFAKLY----GAA 338
+MTD NFL+ + A V+ ++C + G + Y G
Sbjct: 259 SMTDVGYFMNFLSEVFASAVQYNKVVNGMSNIGQLCDTMTSASIGKPIERLAYLIRSGPK 318
Query: 339 SVYYNYSGTAKCFDLNGDS-DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
+Y K DL S + +W +Q CTE ++ S F + D
Sbjct: 319 CKDVDYKDMIK--DLRMSSWSTSAMRQWYFQTCTEFGYYQTANSSKSAFGRL-VNLDFFV 375
Query: 398 RYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
CK+ YG +D + +GG + N+IF NG DPW VLK
Sbjct: 376 NICKDVYGDYYERELLDSGISRTNIMYGGRLPDI------KNVIFVNGDVDPWHALSVLK 429
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
++++ A++ + +H DL+ + D L+ R++ I+ W+
Sbjct: 430 DVNEFSPAILIQGSSHCQDLQADSAGDVPELRTARKKIRNIVSGWL 475
>gi|428184604|gb|EKX53459.1| hypothetical protein GUITHDRAFT_133167 [Guillardia theta CCMP2712]
Length = 467
Score = 175 bits (444), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 139/473 (29%), Positives = 210/473 (44%), Gaps = 101/473 (21%)
Query: 74 TFQQRYLINDTHWG------GSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLV 127
T++QRY + +K IF Y GNE +E + NTG M++ A +F A++V
Sbjct: 9 TYRQRYFVCKCDQASVRITDATKKLQTIFFYFGNEDSVELYVNNTGLMWESASEFDAVMV 68
Query: 128 FIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT------- 180
F+EHRYYGKS+ + +E +L++ QAL D + + LK N
Sbjct: 69 FLEHRYYGKSVLFEPGRE------GCMEFLTTDQALLDASQFLSTLKANPKEILPKKISK 122
Query: 181 --DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY---SFSNIITQ 235
P++ FGGSYGGM+A+WFR+++PH+ G +A SAPIL+F+ + Y + ++TQ
Sbjct: 123 KPVGPIIGFGGSYGGMIASWFRMRFPHLIDGVIAGSAPILSFEGLRPAYDNGGYMRVVTQ 182
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-------------------- 275
D S C + + K + ++ G E L++A +C
Sbjct: 183 D---ASPLCARNFLRAQKAVYSVSQSEEGREFLKEALLLCVPPAAWRQAVLTGAVRCEDF 239
Query: 276 KSEKNLAIE--------------SWLSTAFVYTAMTDYPTPSNFLNP----LPAFPVKEM 317
E+ L + SW+ + + AM +YP S++L LP P+
Sbjct: 240 AREEILVLAPPMLPSSYALQNLISWMQDPWSFLAMGEYPYSSSYLTHGKCNLPPKPLSFA 299
Query: 318 CKAI----DDPKTGNDVFAKLYG--------------AASVYYNYSGTAKCFDLNGDSD- 358
C+ + DD T +DV A V+YN S +CF NG+S
Sbjct: 300 CQYLEMYSDDGITYSDVCCNRQAKSIDLPELLNATRYALGVFYNCSQDVQCF-FNGNSKQ 358
Query: 359 ---PHGL-------SEWGWQACTEMIMLTGGDNKDSIFEESEE-DYDARARYCKEAYGVD 407
P + S+ G + + GD F + ++ CKEA+GV
Sbjct: 359 MQIPRKMGSIKRFHSKEGRKKAEGADLSCVGDWGYQWFWPPHLWNAESAIAGCKEAWGVK 418
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
PR +W FG + ASNI+F NGL DPW GGVLKN+S V A+
Sbjct: 419 PRSHWAVVRFGDRDLSA-----ASNILFSNGLLDPWYVGGVLKNVSSNVRAVC 466
>gi|340715503|ref|XP_003396251.1| PREDICTED: putative serine protease K12H4.7-like [Bombus
terrestris]
Length = 479
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 156/519 (30%), Positives = 230/519 (44%), Gaps = 67/519 (12%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRIT-PEKLSSLISSSKDSQGLYKTKYHTQILD 63
IFLSF +L T + + F + P LS S+ ++ L+ Q +D
Sbjct: 1 MIFLSFLVLLCFTCRLGHGVGLHGFHFHGLEEPRSLSK--SACENITELWIR----QPVD 54
Query: 64 HFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFK 123
HFN +T+ RY N ++ K N PI + G E I G MY++A +
Sbjct: 55 HFNIRDN--RTWLMRYYENSRYF---KKNGPILIMIGGEWAISKGFLEAGLMYELATTYN 109
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP 183
A++ + EHRYYGKS P ++ + +N YLS QALAD A I K++ +S
Sbjct: 110 AIMYYTEHRYYGKSKP---TEDTSSRNLQ---YLSVDQALADLAYFIETRKRDENLRNSK 163
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
V+VFGGSY G +A W RLKYPH+ GALASSAP+L V Y + ++T+ R S+
Sbjct: 164 VIVFGGSYAGNVATWVRLKYPHLVQGALASSAPVL---AKVDFYEYYEVVTESLRRYSQK 220
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLA-IESWLSTAFVYTAMT 297
C +K ++ ++EE GG +KL K F +C S K+L + L+ F +
Sbjct: 221 CVNEVKAAFDEVEELLAIKGGAQKLTKYFNLCDVPDVHSFKDLGHFGNLLAEEFA--GIV 278
Query: 298 DYPTPSNFLNPLPAF-----------PVKEMCKAIDDP-KTGNDVFAKLYGAASVYYNYS 345
Y N + A P++ + + D K + + K VY N +
Sbjct: 279 QYDKVENNRTKIAACCENMTAMYLGSPLQRLAHLVSDKDKCLKNNYKKF---VEVYRNET 335
Query: 346 GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
DS P W +Q CTE +++ S+F + C + YG
Sbjct: 336 ---------WDSQPDITRLWFYQTCTEYGYYQTTNSRRSVF-GTLFPLPYFTGLCTDLYG 385
Query: 406 -------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
+ R T +GG + L N+IF NG DPW VLK+++ A
Sbjct: 386 YYYGNRFLYTRIGRTNTMYGGLRPDL------QNVIFTNGDVDPWHTLSVLKDLNAFSPA 439
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
++ K +H DL D + L R R +IIG WIS
Sbjct: 440 ILIKGSSHCRDLYSDLDTDAEDLIRARARVRKIIGTWIS 478
>gi|391335934|ref|XP_003742341.1| PREDICTED: putative serine protease K12H4.7-like [Metaseiulus
occidentalis]
Length = 486
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 138/483 (28%), Positives = 232/483 (48%), Gaps = 63/483 (13%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S+S+ S G + ++HTQ ++HF+ P +T++QRY++ND + + P+F+ G EG
Sbjct: 36 STSRHSFGEAEIRWHTQRMNHFD--PADRRTWKQRYMVNDEFY---REGGPVFLLLGGEG 90
Query: 104 D--IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQ 161
+ I W +NT M +A K AL+ +EHR+YG+S P +++ +N YLSS Q
Sbjct: 91 EASISWVEKNTHVML-MAKKHNALVFQLEHRFYGQSRP---TSDLSTENLV---YLSSEQ 143
Query: 162 ALADYASL--IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN 219
ALAD A +I ++NL+ D+ VVFGGSY G LAAWF+LKYPH+A+GA+ASSAP+L
Sbjct: 144 ALADAAHFRNVITNRRNLSP-DAKWVVFGGSYSGSLAAWFKLKYPHLAVGAVASSAPLL- 201
Query: 220 FDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---K 276
I+ + ++ S+ +C +K ++ ++ A + + F+ C
Sbjct: 202 --AIIDFQDYVRVVRD---SLGSSCSAKVKDGFQALQVKAARRSSWPDIDNEFKTCVPFD 256
Query: 277 SEKNLAIESWLST-AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY 335
+L ++++ T A + + Y + + ++C+ +++ T L
Sbjct: 257 GYNSLNLQNFFQTIAGNFEGIVQY-NKDQRMEGRTNISIDDLCRLMENAPT------PLE 309
Query: 336 GAASV--YYNYSGTAKCFDLN-------------GDSDPHGLSEWGWQACTEMIMLTGGD 380
G ASV S +KC D + G +W +Q C E D
Sbjct: 310 GLASVNDLLLESTDSKCLDYDYAKFVREMRNVSYSSVAAEGGRQWTYQTCVEFGFFQSSD 369
Query: 381 NKDSIFEESEEDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNI 433
+D F + + + C++ + +D TE+GG + L +N+
Sbjct: 370 AEDQPFGDLFP-VELFIQQCRDIFDDFFDQSMLDNAIFRTNTEYGGQQPKL------TNV 422
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIG 493
F NG DPW +LKN+S +V A + AH D+ + ED + L R++ +
Sbjct: 423 TFPNGSIDPWHALSILKNLSDSVTAHFIEGTAHCADMYPPSAEDDETLTAGRQKIEAEVA 482
Query: 494 KWI 496
KW+
Sbjct: 483 KWL 485
>gi|110749179|ref|XP_623676.2| PREDICTED: putative serine protease K12H4.7-like [Apis mellifera]
Length = 478
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 146/464 (31%), Positives = 209/464 (45%), Gaps = 63/464 (13%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHFN P+ +T+ RYL N + K N PI + G E I G MY++A
Sbjct: 51 QPLDHFN--PRDNRTWSMRYLENSRFF---KENGPILIMIGGEWAISKGFLRAGLMYELA 105
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
A + + EHRYYGKS P + + +N YLS QALAD A I KK+ +
Sbjct: 106 SNHSASMYYTEHRYYGKSKP---TNDTSSRNLQ---YLSVDQALADLAYFIKTKKKDESR 159
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
+S V+VFGGSY G +A+W RLKYPH+ GALASSAP+L + Y ++T+ R
Sbjct: 160 RNSTVIVFGGSYAGNVASWARLKYPHLIQGALASSAPVLAKLDFNEYYE---VVTESLRR 216
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLA-IESWLSTAFVY 293
SE C + IK ++ ++EE G ++L++ F +C KS +LA S L+ +F
Sbjct: 217 YSEKCVEEIKTAFDEVEELLYIENGPQRLKQYFNLCDVPNIKSFNDLAHFGSLLAESFAS 276
Query: 294 TAMTD---------YPTPSNFLNPLPAFPVKEMCKAIDDP----KTGNDVFAKLYGAASV 340
D N P++ + + K D F LY +
Sbjct: 277 VVQYDKVENGRTKIASCCENMTATYLGSPLQRLAHFVSSKDKCLKNNYDKFVTLYRNET- 335
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+N S + +W +Q CTE D+ SIF S C
Sbjct: 336 -WNQSDIMR--------------QWYYQTCTEYGYYQTTDSTRSIF-GSLFPLPYFTNIC 379
Query: 401 KEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
++ YG ++ R +GG + L N+IF NG DPW VL++++
Sbjct: 380 QDLYGEYYNRDFLNNRIKRTNMMYGGLRPDL------RNVIFTNGDVDPWHALSVLQDLN 433
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
A++ K +H DL + D + L R R EIIG WIS
Sbjct: 434 AFSPAVLIKGSSHCRDLYSDSNTDAEDLIRARVRIREIIGSWIS 477
>gi|350422894|ref|XP_003493318.1| PREDICTED: putative serine protease K12H4.7-like [Bombus impatiens]
Length = 478
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 143/463 (30%), Positives = 211/463 (45%), Gaps = 60/463 (12%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DHFN T+ RY N ++ K N PI + G E I G MY++A
Sbjct: 50 QPVDHFNVRNNC--TWLMRYYENSRYF---KKNGPILIMIGGEWAISKGFLEAGLMYELA 104
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+ A++ + EHRYYGKS P ++ + +N YLS QALAD A I KK+
Sbjct: 105 SAYNAIMYYTEHRYYGKSKP---TEDTSSRNLQ---YLSVDQALADLAYFIETRKKDEKL 158
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
+S V+VFGGSY G +A W RLKYPH+ GALASSAP+L V Y + ++T+ R
Sbjct: 159 RNSKVIVFGGSYAGNVATWVRLKYPHLVQGALASSAPVL---AKVDFYEYYEVVTESLRR 215
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLA-IESWLSTAFVY 293
S+ C +K ++ +EE GG +KL++ F +C S K+LA + + L+ F
Sbjct: 216 HSQKCMDEVKAAFDDVEELLAIQGGAQKLKEYFNLCDVPDVHSFKDLAHLGNLLAEEF-- 273
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
++ Y N + A C+ + G+ + + + S KC
Sbjct: 274 ASIVQYDKVENNRTKIAA-----CCENMTASYLGSPL-------QRLAHLVSNKDKCLKN 321
Query: 354 N------------GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
N DS P + W +Q CTE +++ S+F S C
Sbjct: 322 NYNKFVEVYRNEIWDSQPDIMRLWFYQTCTEYGYYQTTNSRKSVF-GSLFPLPYFTGLCT 380
Query: 402 EAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
+ YG + R T +GG + L N+IF NG DPW VL+++++
Sbjct: 381 DLYGYYYGNRFLYTRIGRTNTMYGGLRPDL------QNVIFTNGDVDPWHALSVLQDLNE 434
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
A++ K +H DL D + L R R +IIG WIS
Sbjct: 435 FSPAILIKGSSHCRDLYSDLDTDVEDLIRARARVRKIIGTWIS 477
>gi|410931333|ref|XP_003979050.1| PREDICTED: dipeptidyl peptidase 2-like, partial [Takifugu rubripes]
Length = 317
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 104/320 (32%), Positives = 157/320 (49%), Gaps = 29/320 (9%)
Query: 191 YGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKG 250
YGGML+ + RL+YP++ GALA+SAPIL+ + P F FR V+ +
Sbjct: 1 YGGMLSVYMRLRYPNMVAGALAASAPILSTAGLGDPSQF-------FRDVTAVSPVCLSA 53
Query: 251 SWKQIEETAKKPGGLEKLQKAFRICKS----EKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
+ ++Q+ +CK E + L AF AM DYP ++F+
Sbjct: 54 CLSLSALLSLLLPDYRRIQEKLSLCKPPSSPEDIYQLNGLLRNAFTLMAMLDYPYSTHFM 113
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG----DSDPHGL 362
+PA PVK C+ + + + L A + YN +G CFDL +DP G
Sbjct: 114 GNMPANPVKVACETM---LRASGLLENLRDTAGIVYNSTGALGCFDLYSLYVQCADPTGC 170
Query: 363 S------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE 416
W +QACTE+ + +N+ +F R YC + + V PRP W+ T+
Sbjct: 171 GLGSNSLAWDYQACTEINLCYDSNNETDMFPPMTFGETERNIYCSKRWAVLPRPRWLQTQ 230
Query: 417 FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKE 476
F G + ASNIIF NG DPW+ GGV K++S +++A+ GAHH+DLR S
Sbjct: 231 FWGDALSA-----ASNIIFSNGDLDPWANGGVRKSLSSSLIAVNIPGGAHHLDLRGSNDA 285
Query: 477 DPQWLKDVRRREVEIIGKWI 496
DP+ + R+ E ++I +W+
Sbjct: 286 DPESVIKARKTEADLIAQWV 305
>gi|242020732|ref|XP_002430805.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516008|gb|EEB18067.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 478
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 139/494 (28%), Positives = 216/494 (43%), Gaps = 62/494 (12%)
Query: 31 SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSK 90
SS ITPE+ KY Q LDHF+ P + +T+ QRY +ND+ + +
Sbjct: 21 SSEITPEE-----------------KYFVQKLDHFD--PTNTKTWNQRYFVNDSFY---Q 58
Query: 91 NNAPIFVYTGNEGDI--EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
N P F+ G EG+ +W T D A K+ A V +EHR+YGKS P +++
Sbjct: 59 PNGPFFLMIGGEGEASPKWMVNGTWL--DYAKKYNAYCVMVEHRFYGKSHP---TEDLGV 113
Query: 149 KNASTTGYLSSTQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVA 207
KN YLSS QAL D A I L L P +V GGSY G LAAW RLKYPH+
Sbjct: 114 KNLK---YLSSEQALGDLAYFISSLNNKLNIFPPPKWIVMGGSYPGSLAAWMRLKYPHLV 170
Query: 208 IGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK 267
+GA+++S P+L N + +++ S + C I+ KQI P G
Sbjct: 171 LGAVSTSGPLLALINF---EEYFDVVKDSLSSYNPECVTAIEAGTKQIMSLLIHPLGQRS 227
Query: 268 LQKAFRIC------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI 321
L K F++C + N + L+ F + + + + +C +
Sbjct: 228 LFKMFKLCDPLELNNEDDNSNLFESLAGNFAGVVQYNKDNRHDQNSGGSDLTIDYLCDIM 287
Query: 322 DDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGD-------------SDPHGLSEWGWQ 368
+ G ++ +L V N + KC D D G +W +Q
Sbjct: 288 LNQSLGKEI-NRLAVVNEVVLNKTTKEKCLDYKYDKMIKQMQLTDWKSEVAEGGRQWTYQ 346
Query: 369 ACTEM-IMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV-- 425
CTE T N ++ + + C + +G+ N TE G + ++
Sbjct: 347 TCTEFGFFQTSSLNTTKQMFGNKFPPEFFLKQCTDIFGIKYNAN--LTEEGIIRTNMIYG 404
Query: 426 -LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDV 484
L A NI+F +G DPW G+ K + A+ + AH ++ S+++DP L D
Sbjct: 405 GLNLVADNIVFVHGSIDPWHALGITKTLRPGAPAIYIQGTAHCANMYPSSEKDPPQLVDA 464
Query: 485 RRREVEIIGKWISQ 498
R++ ++IG+W+ +
Sbjct: 465 RKQIEQLIGEWLKE 478
>gi|291236175|ref|XP_002738015.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 515
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 149/532 (28%), Positives = 241/532 (45%), Gaps = 65/532 (12%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFP------TFPSSRITPEKLSSLISSSKDSQGLY- 53
M T + LSF +F + +I + + R KL S +S +Y
Sbjct: 1 MKTLLLVLSFTCVFEISNSILGTHFWKFQEKVGQYKKKRFLKNKLRWQKVFSPNSHLMYS 60
Query: 54 --KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN 111
+ Y Q LDHF+ P ++QRY +N T+W +K N P+F++ G EG + +
Sbjct: 61 EFEDLYLEQPLDHFD--PLVTDIYEQRYWVNPTYW--NKENGPVFLFIGGEGALGAYDVE 116
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
G D+A K+ AL+ +EHR+YG SI G K YLSS QALAD AS
Sbjct: 117 EGEHVDLAKKYGALIFAVEHRFYGASINKDGLK------LEYLQYLSSQQALADLASFHR 170
Query: 172 DLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL---NFD--NIVS 225
T S + + FGGSY G L+AWFRLKYPH+ GA+ASSAP+ NF+ N V
Sbjct: 171 FATSKYNITQSNIWICFGGSYPGSLSAWFRLKYPHLVYGAIASSAPVRVVKNFEGYNQVV 230
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
S ++ + + S+N I ++K I++ K + L+ F+ C + I S
Sbjct: 231 AASLADPVVKGSLKCSDN----IAAAFKIIDQKIKDK-QFDTLKADFKSCNN-----ISS 280
Query: 286 WLSTA-FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
+ TA F+ + + N +P + V +C+ + P + D K ++Y +
Sbjct: 281 YNDTALFLNNLAGIFMGIVQYNNEMPDWNVAAVCQNMTQPASPYDNLVKF---TTIYLDG 337
Query: 345 SGTAKCFDLNGDS------------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEED 392
G +CFD + D+ + G+ +W +Q C++ D + D
Sbjct: 338 MG-QECFDNSYDNFIQELEDTTPTEEGVGVRQWTYQTCSQFGYYQTCDQNTTCLFSPLID 396
Query: 393 YDARARYCKEAYG-----VDPRPNWITTEFGG-HKIGLVLKRFASNIIFFNGLRDPWSGG 446
+ C +G VD + ++ + +G H G + I+F NG DPW
Sbjct: 397 LKSSLEVCTTVFGIHGKIVDKQVDFTNSYYGANHPKG-------TRIVFVNGSIDPWHAL 449
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
VL+N S + +++ AH +++ DP L + R++ IG+W+++
Sbjct: 450 SVLRNESPSQISIYINGTAHCANMKSQQPTDPPSLVEARQKIDAQIGEWLNE 501
>gi|323451276|gb|EGB07154.1| hypothetical protein AURANDRAFT_28266, partial [Aureococcus
anophagefferens]
Length = 311
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 113/308 (36%), Positives = 169/308 (54%), Gaps = 26/308 (8%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG- 113
Y TQ LDHF ++ +TF Q+ L++D W + P+ +Y GNEG IE F N+G
Sbjct: 20 VAYVTQQLDHFRFD--ETRTFSQKLLVHDA-W--HRPGGPLLMYFGNEGAIEDFYGNSGG 74
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYG----GNKEIAYKNASTTGYLSSTQALADYASL 169
M+++APK A + F+EHRYYG S+P+G G+ E+A+ L+ QALAD A +
Sbjct: 75 LMFELAPKLNASVAFLEHRYYGSSLPFGNASYGSDELAF--------LTVEQALADMALV 126
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ + L A D P V+FGGSYGGMLAAWF LKYPH+A GA+A+SAP+ + F
Sbjct: 127 LATSSEILGAADGPAVLFGGSYGGMLAAWFMLKYPHLAAGAVAASAPVDLYPGEGKERPF 186
Query: 230 SNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IE 284
+ + + + S C ++ + + AK G + L ++FR C+ + +
Sbjct: 187 FDAGLEVYGTYGSAACEADLRAALAALAAAAKTAEGRDALARSFRTCEPLPDPVDGDRLT 246
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
S+++ A AM DYP S F+ P+PA PV+ C + +KL GA V+ N+
Sbjct: 247 SYVNGALSTLAMLDYPYASAFVAPMPANPVRVACGRV---AAAPSAASKLKGAVDVFLNH 303
Query: 345 SGTAKCFD 352
+G C+D
Sbjct: 304 TGETACYD 311
>gi|312090033|ref|XP_003146464.1| hypothetical protein LOAG_10893 [Loa loa]
Length = 390
Score = 171 bits (432), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 173/348 (49%), Gaps = 34/348 (9%)
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI-VSPYSFSNIITQDF 237
T+ PV+VFGGSYGGMLAAW R+KYPH+ GA ASSAP+ F ++P S S IT ++
Sbjct: 2 GTEIPVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRIFYGTGINPESVSRTITTNY 61
Query: 238 RSVSENC-YKVIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKN-LAIESWLSTA 290
++ C KV + IE+ +K G KL + F KS + +++ S++ +A
Sbjct: 62 --LTSGCDRKVFSDGFVAIEKMSKTEEGRMKLNRIFHAKPGFEMKSYNDFMSLYSYIYSA 119
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA-KLYGAASVYYNYSG--T 347
Y AMTDYP P++F PLP +PVK +C+ T + A ++Y +VYYNY+G T
Sbjct: 120 IFYMAMTDYPYPADFFEPLPGYPVKYVCQYAKKAATNEENLAEQIYSIINVYYNYTGQLT 179
Query: 348 AKCFDLNGDSDPHGLSE------WGWQACTEM---IMLTGGDNK--DSIFEESEEDYDAR 396
CF N + P + W WQ+CT + I GGDN + + S +
Sbjct: 180 DNCFTSNC-TTPSPIQNDDEDIAWNWQSCTSLTIQICDRGGDNDFFLNTCDNSGDPVSTN 238
Query: 397 ARYCKEAY-GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN---- 451
+ C E + + N+ + G++ SNIIF NG DPWS GGV +N
Sbjct: 239 IKLCTELFKDIGYNNNFYKLHDVTIRYGMIYNT-TSNIIFSNGNLDPWSAGGVYENSPGI 297
Query: 452 ---ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ V AHH+D R DP + R + V II W+
Sbjct: 298 MEAMKNGVYIFYMLGAAHHLDFRTPNTCDPPSVTHERFQVVNIIKCWV 345
>gi|156717482|ref|NP_001096281.1| protease, serine, 16 (thymus) precursor [Xenopus (Silurana)
tropicalis]
gi|134025460|gb|AAI35535.1| LOC100124847 protein [Xenopus (Silurana) tropicalis]
Length = 506
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 141/458 (30%), Positives = 221/458 (48%), Gaps = 42/458 (9%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y Q LDHFN S TF QRY IN+ +W + N P+F+Y G E + F+ +G
Sbjct: 64 YIAQPLDHFNRRNNS--TFNQRYWINEEYW--NHPNGPVFLYIGGESSLSEFSVLSGEHV 119
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS--LIIDLK 174
D+A +ALLV +EHRYYG SI G + +N +LSS QALAD AS + I K
Sbjct: 120 DLAQTHRALLVSLEHRYYGSSINPDG---LTLENIR---FLSSQQALADLASFHMFISQK 173
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVSPYSF 229
NLT ++ + FGGSY G L+AWFRLK+PH+ A+ASSAP+ L+F N V +S
Sbjct: 174 YNLTRQNT-WICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDFTGYNKVVAWSL 232
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST 289
++ + SE C +K ++ ++ +K G + +L+K F C S + S T
Sbjct: 233 ADPVIGG----SEKCLDAVKEGFQAVDSLLQK-GNITQLEKDFYSCGSLQG----SDDYT 283
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCK--------AIDDPKTGNDVFAKLYGAASVY 341
FV + + PA V+++C+ A + ++ N ++ G + VY
Sbjct: 284 EFVGNLADIFMGAVQYNGMSPASNVQQICQLMTIKDNSAYEGLRSVNRMYMDFMGLSCVY 343
Query: 342 YNYSGTAKCFDLNGDSDPH---GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
+++ + DL+ G +W +Q CTE ++ F S ++
Sbjct: 344 NSHAKSVA--DLSSTKLSLVGVGERQWFYQTCTEFGYYQTCEDPSCPF-SSLITLKSQLD 400
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
C + + V + +F G + +S IIF NG DPW VLKN S++ +A
Sbjct: 401 LCSQIFQVPTESVLQSVQFTNEFYGADHPK-SSRIIFVNGDVDPWHALSVLKNQSRSEIA 459
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
++ +H ++ S DP L++ R+ + W+
Sbjct: 460 ILINGTSHCANMSPSHTSDPLSLQEARKEIAAQVATWL 497
>gi|187447082|emb|CAO84855.1| ENSANGG00000016082 protein [Anopheles gambiae]
Length = 227
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/226 (39%), Positives = 132/226 (58%), Gaps = 9/226 (3%)
Query: 155 GYLSSTQALADYASLIIDLKK-NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
GYL+S QALAD+A ++ L N T PV+ FGGSYGGMLAAW R+KYPH+ GA+A+
Sbjct: 2 GYLTSEQALADFALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIAA 61
Query: 214 SAPILNFDNIVSPYSFSNIITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
SAP+ F + F+ I+T ++ + + +C I+ SW ++ + GL L + F
Sbjct: 62 SAPVRQFAGVTDCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEKF 121
Query: 273 RICK-----SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG 327
+ C ++ + +L+ + AM +YP PS+FL P+PA+PV+E C + TG
Sbjct: 122 KFCTNLTKGTDVTETLFDYLTDVYGNXAMINYPYPSSFLAPVPAYPVREFCGRLAQNYTG 181
Query: 328 NDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGW--QACT 371
++ L A S+YYNY G A C ++N D G+S+ GW QACT
Sbjct: 182 VELLDHLQSALSIYYNYDGKAACLNINSSYDGTGISDRGWDFQACT 227
>gi|187447074|emb|CAO84851.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447076|emb|CAO84852.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447078|emb|CAO84853.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447080|emb|CAO84854.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447084|emb|CAO84856.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447086|emb|CAO84857.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447088|emb|CAO84858.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447090|emb|CAO84859.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447092|emb|CAO84860.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447094|emb|CAO84861.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447096|emb|CAO84862.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447098|emb|CAO84863.1| ENSANGG00000016082 protein [Anopheles gambiae]
Length = 227
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/226 (39%), Positives = 132/226 (58%), Gaps = 9/226 (3%)
Query: 155 GYLSSTQALADYASLIIDLKK-NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
GYL+S QALAD+A ++ L N T PV+ FGGSYGGMLAAW R+KYPH+ GA+A+
Sbjct: 2 GYLTSEQALADFALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIAA 61
Query: 214 SAPILNFDNIVSPYSFSNIITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
SAP+ F + F+ I+T ++ + + +C I+ SW ++ + GL L + F
Sbjct: 62 SAPVRQFAGVTDCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEKF 121
Query: 273 RICK-----SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG 327
+ C ++ + +L+ + AM +YP PS+FL P+PA+PV+E C + TG
Sbjct: 122 KFCTNLTKGTDVTETLFDYLTDVYGNLAMINYPYPSSFLAPVPAYPVREFCGRLAQNYTG 181
Query: 328 NDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGW--QACT 371
++ L A S+YYNY G A C ++N D G+S+ GW QACT
Sbjct: 182 VELLDHLQSALSIYYNYDGKAACLNINSSYDGTGISDRGWDFQACT 227
>gi|291240192|ref|XP_002740004.1| PREDICTED: predicted protein-like, partial [Saccoglossus
kowalevskii]
Length = 381
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 139/256 (54%), Gaps = 22/256 (8%)
Query: 263 GGLEKLQKAFRICK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
GGL+KL K R+CK ++ + W+ + AM DYP P+ FL P+PA+PVK C
Sbjct: 53 GGLQKLTKILRLCKPMTKDRLNHVNGWIRNSLTNLAMFDYPYPTTFLVPVPAYPVKVACS 112
Query: 320 AIDDPKTGNDVFAKLYGAASVYYNYS-GTAKCFDLNGD----SDPHGLS------EWGWQ 368
I +D L AA +YYN + G+ KCFD++ + +DP G W +Q
Sbjct: 113 YI---MNSSDPLVGLVQAAGLYYNGTKGSLKCFDVDTEFVECADPTGCGVGPDSMAWDYQ 169
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKR 428
ACTE +M G + K +F + R YC++ + + PR +W+ G I
Sbjct: 170 ACTESMMPAGSNGKTDMFPDLPFTLKMRDEYCEKKWNIVPRNDWLNVHLWGKDI-----L 224
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRRE 488
ASNI+F NG DPW GGVL ++S +++A++ GAHH+DLR + DPQ + + R+ E
Sbjct: 225 TASNIVFANGSLDPWRRGGVLSSLSDSLIAVLIDGGAHHLDLRGANPLDPQSVLNARQEE 284
Query: 489 VEIIGKWISQYFQDLA 504
V+ I KWI Q + LA
Sbjct: 285 VKYIQKWIEQEQEALA 300
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 35/58 (60%), Gaps = 4/58 (6%)
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGG 189
RYYGKS+P+G + I G+LS QA+ADYA LI LK L A PV+ FGG
Sbjct: 1 RYYGKSLPFGDSSFIL----GNIGFLSIEQAMADYAVLIHYLKIKLNAAKCPVIAFGG 54
>gi|281212302|gb|EFA86462.1| hypothetical protein PPL_00255 [Polysphondylium pallidum PN500]
Length = 482
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 132/444 (29%), Positives = 208/444 (46%), Gaps = 39/444 (8%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T + TQ++DHF+ P + +TFQQ+Y + D ++ G+ PIF + E + +F
Sbjct: 46 ETFWFTQLVDHFD--PNNDETFQQQYQVIDDYFDGT---GPIFFFLAGEAPMGFFNFQEV 100
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+++ A KF AL V +EHR+YG S P + + N YL+S QALAD A+ +
Sbjct: 101 QIWNWADKFNALYVVLEHRFYGASNP---TNDFSTPNLR---YLTSQQALADAANFLTSF 154
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS-FSNI 232
K +PVVVFG SY G L+AWFRLKYP + + ++A S P+L N YS FSN
Sbjct: 155 KAERGLESAPVVVFGCSYSGALSAWFRLKYPQLVVASVAPSGPVLAQLNYTGYYSQFSN- 213
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
S + +C + + QI G ++L K F C + +N + F+
Sbjct: 214 ------SAAPDCVAAAQTATNQIMALTTSKSGRDQLAKTFNSCSNLENPRDLYY----FL 263
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV---YYNYSGTAK 349
YT +T+ ++ +N P + + C+ + + D +A++ Y+
Sbjct: 264 YT-LTEALGSADQMNNPPTWGLNTTCQTLTQTSSLLDNWAQIVAGGQTGCQDYSLKSFID 322
Query: 350 CFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY---GV 406
D G W WQ C E + S+F + + + + ++C+E + G+
Sbjct: 323 SMRKTNSKDQDGSRSWLWQTCVEFGYFSTTYPGTSVFPPT-LNVEEQVKWCEEIFDIKGM 381
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK-NISKTVLALVEKEGA 465
P W + +GG +I SNI+F NGL DPW V + N+ TV A E
Sbjct: 382 TPNIAWTNSYYGGQQIQ------GSNIMFTNGLLDPWHLLSVNEPNLEGTVQA-ATYEAG 434
Query: 466 HHVDLRFSTKEDPQWLKDVRRREV 489
H L ST DP L R +++
Sbjct: 435 HCGTLIQSTSIDPPSLIAARAQKL 458
>gi|118346661|ref|XP_976865.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288576|gb|EAR86564.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 480
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 147/481 (30%), Positives = 219/481 (45%), Gaps = 57/481 (11%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
E + I +Q +Y Y+TQ+LDHFN N Q +T+QQRY I + + N +F
Sbjct: 25 EVFKTFIDGKNITQSIY---YYTQVLDHFNPNDQ--RTWQQRYAIYSDEY--NPVNGTVF 77
Query: 97 VYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY 156
VY G EG + + G+M ++A KF AL + +EHR+YG S P+G + E +Y N + Y
Sbjct: 78 VYIGGEGKQKGLSPGLGWMVELAKKFSALFLIVEHRFYGASQPFGKD-ENSYSNQNL-AY 135
Query: 157 LSSTQALADYASLIID---LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
LS QAL D A +I + L+ + + + P + GGSY G ++AWFR KYPH+ +GALAS
Sbjct: 136 LSVEQALEDLAQIIANFKTLRLHGLSENVPFITIGGSYPGAVSAWFRSKYPHLVVGALAS 195
Query: 214 SAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAF 272
SA IL ++ Y + I RS + C + I+ KQ+E G EK+ + F
Sbjct: 196 SAVILPVEDF-QQYDYQ-IYLSTLRS-GQWCPQNIQAFNKQLESILVNGGEQAEKIIQQF 252
Query: 273 RICKSEKNLAIESWLS-TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
+ NL + +LS +Y+ + Y S +C T D
Sbjct: 253 ----NATNLRQDEFLSFFGDLYSGLVQYGRRS------------LLCNFFAQNTTFYDQL 296
Query: 332 AKLYGAASV--------YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKD 383
+Y A V Y Y+ T + D D G +W WQ CTE N+
Sbjct: 297 NSIYQYAIVQGNQPIEAYDTYTLTNTTY----DEDAAG-RQWVWQTCTEFGWFQTA-NQV 350
Query: 384 SIFEESEEDYDARARYCKEAYG---VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLR 440
+ D + C A+ DP FGG KIG A+NI+F NG+
Sbjct: 351 QPMRSKQVDLNFYRYICNVAFDGEHDDPDITANVNRFGGLKIG------ATNIVFTNGIE 404
Query: 441 DPWSGGGVLKNISKTVLALVEKEG-AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
D W + ++ + + AH + R DP L+ R++ I +WI Q+
Sbjct: 405 DEWQWASLRQSTPQLTSIFNNCDNCAHCQEFRTPKPTDPPGLQSTRKQVEAIFAQWIHQF 464
Query: 500 F 500
+
Sbjct: 465 Y 465
>gi|332026663|gb|EGI66772.1| Putative serine protease K12H4.7 [Acromyrmex echinatior]
Length = 481
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 137/463 (29%), Positives = 210/463 (45%), Gaps = 56/463 (12%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ LDHFN+ + +T+ RY N K N PI + G E +I G MY++
Sbjct: 50 TQPLDHFNH--RDNRTWSMRYKENSAFL---KKNGPILIMIGGEWEITNGFLQGGLMYEL 104
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
K+ L+ + EHR+YG+S P K+I+ +N YL++ QALAD A I KK
Sbjct: 105 GVKYHGLMYYTEHRFYGQSRP---TKDISTENLQ---YLNADQALADLAYFIDTKKKEKN 158
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
S V+V GGSY G +AAW RLKYPH+ GALASSAP+ + Y + ++T
Sbjct: 159 LEKSIVIVVGGSYAGNMAAWARLKYPHLIQGALASSAPVRAKADF---YEYYEVVTDALG 215
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLAIESWLST-AFV 292
S+ C + +K ++ +EE G +KL+ F++C +S +L +++T + +
Sbjct: 216 KYSKTCIESVKTAFASVEELLAMRAGPQKLKLLFKLCHVPDVRSSSDLGY--FMNTLSEI 273
Query: 293 YTAMTDYPTPSNFLNPLPAF-----------PVKEMCKAIDDPKTGNDVFAKLYGAASVY 341
+ + Y N + A P++ + + + K NDV Y
Sbjct: 274 FAGVVQYNKIENSETGIAALCNKMTAKHLGSPLQRLARIFSNQKKCNDV---------NY 324
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
N+ T + + + + +W Q CTE ++ SIF + + C
Sbjct: 325 NNFLKTYREISWDSPAATSIMRQWYHQTCTEYGYYQTTNSNKSIFGKLFP-LNYFINLCT 383
Query: 402 EAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
+ YG +D +GG L N+IF NG DPW VL++++
Sbjct: 384 DLYGDYHNKKILDSHVRRTNIMYGGKLPDL------RNVIFTNGNSDPWHPLSVLQDLNA 437
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
A+V +H DL DP LK R + +IIGKWIS
Sbjct: 438 FSPAIVINGSSHCRDLYSDVTTDPDNLKAARAKIRKIIGKWIS 480
>gi|91078858|ref|XP_972061.1| PREDICTED: similar to thymus-specific serine protease [Tribolium
castaneum]
Length = 501
Score = 168 bits (425), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 144/520 (27%), Positives = 237/520 (45%), Gaps = 52/520 (10%)
Query: 9 SFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQ-GLYKTKYHTQILDHFNY 67
S+ LL + S + F + R+ L + K+S + ++ TQ LDHFN
Sbjct: 4 SWLLLLTLFYISSEVSSWRIFRNGRMVGGNLGEPKCNCKESSIKEVQEEWFTQNLDHFN- 62
Query: 68 NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD--IEWFAQNTGFMYDVAPKFKAL 125
P T++QR+ ND + KN P+F+ G EG+ I+W Q G + A KF AL
Sbjct: 63 -PTDETTWKQRFYSNDQFFD-PKNGGPVFLMIGGEGEASIKWMTQ--GAWVNYAEKFGAL 118
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPV 184
+ +EHRYYGKS P +++ +N YL+S QALAD A+ I + + + D
Sbjct: 119 MFQLEHRYYGKSHP---TDDLSTQNLK---YLTSQQALADLATFITAMNEKYSLPPDVKW 172
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENC 244
+ FGGSY G LAAW R KYPH+ GA+++S P+L V + +I + + S++C
Sbjct: 173 IAFGGSYPGSLAAWLRFKYPHLVHGAMSASGPLL---AQVDFKDYFRVIKESLATHSDDC 229
Query: 245 YKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTD------ 298
++ QI K+ G L + F++C +N +I + + +Y + D
Sbjct: 230 VTAVQQGVDQIGVLLKQEIGQANLNELFKLCDPVQN-SINNEKDISNLYETIADDFAGVV 288
Query: 299 -YPTPSNFLNPLPA-FPVKEMCKAIDDPKTG---------NDVFAKLYGAASVYYNYSGT 347
Y + +P A + +C + + G N+V Y + YNY
Sbjct: 289 QYNKDNRVGSPAGANITIDVVCDIMVNQTIGPPVNRLAKVNEVLLSAYDQKCLDYNYDKM 348
Query: 348 AKCF-DLNGDSDP-HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
+++ DS+ G +W +Q CTE D + IF + + D + C + +G
Sbjct: 349 INNLRNVSWDSEASEGGRQWTYQTCTEFGFYQTSDYEPQIFGD-QFSVDFFIQQCTDIFG 407
Query: 406 -------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
++ T +GG I + SN++F +G DPW G+ K I + A
Sbjct: 408 SIYDEDFLNSATERTNTYYGGLDIEV------SNVVFVHGSIDPWHALGITKTIDEEAPA 461
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ + AH ++ D LK+ R + + +IG W++Q
Sbjct: 462 IYIEGTAHCANMYPPADTDLPQLKEAREQILNLIGTWLAQ 501
>gi|270004132|gb|EFA00580.1| hypothetical protein TcasGA2_TC003450 [Tribolium castaneum]
Length = 473
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 135/474 (28%), Positives = 221/474 (46%), Gaps = 51/474 (10%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD--IEWFAQN 111
+ ++ TQ LDHFN P T++QR+ ND + KN P+F+ G EG+ I+W Q
Sbjct: 22 QEEWFTQNLDHFN--PTDETTWKQRFYSNDQFFD-PKNGGPVFLMIGGEGEASIKWMTQ- 77
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
G + A KF AL+ +EHRYYGKS P +++ +N YL+S QALAD A+ I
Sbjct: 78 -GAWVNYAEKFGALMFQLEHRYYGKSHP---TDDLSTQNLK---YLTSQQALADLATFIT 130
Query: 172 DLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+ + + D + FGGSY G LAAW R KYPH+ GA+++S P+L V +
Sbjct: 131 AMNEKYSLPPDVKWIAFGGSYPGSLAAWLRFKYPHLVHGAMSASGPLL---AQVDFKDYF 187
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
+I + + S++C ++ QI K+ G L + F++C +N +I + +
Sbjct: 188 RVIKESLATHSDDCVTAVQQGVDQIGVLLKQEIGQANLNELFKLCDPVQN-SINNEKDIS 246
Query: 291 FVYTAMTD-------YPTPSNFLNPLPA-FPVKEMCKAIDDPKTG---------NDVFAK 333
+Y + D Y + +P A + +C + + G N+V
Sbjct: 247 NLYETIADDFAGVVQYNKDNRVGSPAGANITIDVVCDIMVNQTIGPPVNRLAKVNEVLLS 306
Query: 334 LYGAASVYYNYSGTAKCF-DLNGDSDP-HGLSEWGWQACTEMIMLTGGDNKDSIFEESEE 391
Y + YNY +++ DS+ G +W +Q CTE D + IF + +
Sbjct: 307 AYDQKCLDYNYDKMINNLRNVSWDSEASEGGRQWTYQTCTEFGFYQTSDYEPQIFGD-QF 365
Query: 392 DYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
D + C + +G ++ T +GG I + SN++F +G DPW
Sbjct: 366 SVDFFIQQCTDIFGSIYDEDFLNSATERTNTYYGGLDIEV------SNVVFVHGSIDPWH 419
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
G+ K I + A+ + AH ++ D LK+ R + + +IG W++Q
Sbjct: 420 ALGITKTIDEEAPAIYIEGTAHCANMYPPADTDLPQLKEAREQILNLIGTWLAQ 473
>gi|330799265|ref|XP_003287667.1| hypothetical protein DICPUDRAFT_151801 [Dictyostelium purpureum]
gi|325082345|gb|EGC35830.1| hypothetical protein DICPUDRAFT_151801 [Dictyostelium purpureum]
Length = 467
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 125/446 (28%), Positives = 209/446 (46%), Gaps = 37/446 (8%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T + Q +DH+++ + T++Q+Y++ D ++ GS PIF+Y E + +F
Sbjct: 46 QTFWFDQKIDHYDFFNNN--TYKQQYIVVDDYFDGS---GPIFIYLAGEAPMGFFGFQEV 100
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+ + A +F AL + IEHR+YGKS P ++++ N YL+S QALAD A+ +
Sbjct: 101 QVVEWAKQFGALFIVIEHRFYGKSYP---TQDLSTNNLK---YLTSQQALADAANFLSTY 154
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS-FSNI 232
K ++P VVFG SY G L++WFRLKYP +AI ++A S P+L N Y+ FSN
Sbjct: 155 KAENDLVENPTVVFGCSYSGALSSWFRLKYPQLAIASIAPSGPVLAQLNFTGYYAQFSN- 213
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
S + NC + + + QI + K G+++L+K F C S +N + F+
Sbjct: 214 ------SAAPNCVQAAQTATNQIMQLTKSSAGIKQLEKTFNSCHSLENPRDLYY----FL 263
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
YT +T+ ++ +N P + + C +A++ A N + +
Sbjct: 264 YT-LTEALGSADQMNNPPTWILNSTCGTFLQNDNLLTNWAQIVNAGQTGCNDYRLSTFIE 322
Query: 353 LNGD---SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY---GV 406
+ S G W +Q C E + S+F + + + + ++C+E + G+
Sbjct: 323 QMREIKISQQDGSRSWVYQTCVEFGYFSTTYEGTSVFPPT-LNVEEQVKWCEEIFDVPGM 381
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
P +W +GG A+N++F NGL DPW V + + E H
Sbjct: 382 TPNIDWTNAYYGGQNTQ------ATNVMFTNGLLDPWHLLSVNSDNEAGTVRAATYEAGH 435
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEII 492
L T EDP L + R V +
Sbjct: 436 CASLIQETSEDPISLVNAREEVVSFL 461
>gi|195427485|ref|XP_002061807.1| GK16991 [Drosophila willistoni]
gi|194157892|gb|EDW72793.1| GK16991 [Drosophila willistoni]
Length = 512
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 130/465 (27%), Positives = 219/465 (47%), Gaps = 39/465 (8%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYD 117
Q LDHF P + QT+QQRY +N+ ++ + + APIF+ G EG+ +W + Y
Sbjct: 65 QRLDHFQ--PSNTQTWQQRYFVNEDYYR-NDSTAPIFLMIGGEGEASKKWMHEGAWIHY- 120
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A F AL + +EHR+YGKS P K+++ N YL+S QALAD A+ + +K
Sbjct: 121 -AEHFSALCIQLEHRFYGKSHP---TKDLSTSNLV---YLTSEQALADLANFVAAMKVKY 173
Query: 178 TATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
DS + FGGSY G LAAW R KYPH+ G+++SS P+L V + +++
Sbjct: 174 DLKDSQKWIAFGGSYPGSLAAWAREKYPHLIYGSISSSGPLL---AEVDFKEYFDVVKAS 230
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-----NLAIESWLST-A 290
S +C + + S+ Q+E K G+ L K F+ C K +L I ++ A
Sbjct: 231 LASYKPDCVEAVTRSFGQVEILLKHMIGMANLDKTFKTCTPIKDSIHNDLDISNFFENLA 290
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG---------NDVFAKLYGAASVY 341
+ + Y + +P + ++C + + G ND+ + A+ +
Sbjct: 291 GNFAGVVQYNKDN---SPHATITIDDICDVMLNTTMGPPVTRLAAVNDLLLQQSNASCLD 347
Query: 342 YNYSGT-AKCFDLNGDSD-PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
Y Y A +++ DS+ G +W +Q C E +NK F + D R
Sbjct: 348 YKYDKMIADMKNISWDSETAQGARQWTYQTCNEFGFYQTSENKADTFGD-RFGVDFFIRQ 406
Query: 400 CKEAYGVDPRPNWITTEFG-GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
C++ Y D ++ G +K LK +N+++ +G DPW G++K I+
Sbjct: 407 CQDVYSSDMDARYLEQVVGQTNKHYGALKPETTNVLYIHGSIDPWHALGLIKPINAKTPT 466
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
+ + AH ++ K DP L + R + ++ + +++Y +L
Sbjct: 467 IYIEGTAHCANMYEPVKSDPPQLVEARNKILKYLANLLTEYDSEL 511
>gi|443725197|gb|ELU12877.1| hypothetical protein CAPTEDRAFT_173918 [Capitella teleta]
Length = 517
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 140/474 (29%), Positives = 216/474 (45%), Gaps = 54/474 (11%)
Query: 50 QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA 109
+G + +Y Q LDHF+ PQ +++QRY +N W G + P+F+Y G EG +
Sbjct: 64 RGPFIDEYFEQPLDHFD--PQVSGSYKQRYWVNADFWSGKE--GPVFLYIGGEGGLTSMT 119
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
G D+A K+KAL+ +EHR+YG+S+ G K + YLSS QALAD A
Sbjct: 120 VQAGEHVDLAKKYKALIFAVEHRFYGESLNDDGLK------LESLQYLSSQQALADLAKF 173
Query: 170 --IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--N 222
++ K NLT D+ V FGGSY G L+AWFR+KYPH+ A+ASSAP+ ++F N
Sbjct: 174 HAVMSQKYNLT-DDNHWVCFGGSYPGALSAWFRIKYPHLVHAAVASSAPVRALVDFQGYN 232
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEK 279
V S S I S+ C +K ++ I++ K G L +L+ F C EK
Sbjct: 233 DVVAASLSATIVNG----SDKCLSQVKEAFSTIDQMLDK-GNLLQLENDFYSCAPLDGEK 287
Query: 280 NL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA 338
++ S ++ AF+ + +P + +C+ + D +A L
Sbjct: 288 DIYQFTSNVADAFMGVV--------QYNQEIPGQSIAGLCEQM---TASADSYANLRKLF 336
Query: 339 SVYYNYSGTAKCFDLNGDSDPH--------------GLSEWGWQACTEMIMLTGGDNKDS 384
+ N S KC D + S GL +W +Q CT+ D +
Sbjct: 337 RRFLNES-DQKCSDNSWSSAIAQMSNTTVDRGGFGVGLRQWIYQTCTQFGYYQSCDVNTT 395
Query: 385 IFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
C E +G+ + + +F G + + I+F NG DPW
Sbjct: 396 CPFSRYMGLVPNLDICTEVFGIGGKSTYGRVDFTNAYYGSDQPK-GTRIVFVNGSIDPWH 454
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
VLK++S A+ + AH ++ + DP L R++ E+IG W++Q
Sbjct: 455 ALSVLKDLSGGQHAIFIEGTAHCANMNSNQPWDPPQLLKARKQTDELIGSWLNQ 508
>gi|326429444|gb|EGD75014.1| hypothetical protein PTSG_07238 [Salpingoeca sp. ATCC 50818]
Length = 502
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 136/469 (28%), Positives = 212/469 (45%), Gaps = 66/469 (14%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTG 113
K+ TQ +DHFN PQ +TFQQ+Y +N T+ K P+F+ G EG W +T
Sbjct: 67 KWFTQKVDHFN--PQDTRTFQQQYQVNATY---HKQGGPVFLMLGGEGPASPRWLEIDTA 121
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
M A + A++V +EHR+YGKS P+ K+++ + YLSS QALAD A+ +
Sbjct: 122 IMI-YARQHDAVVVQLEHRFYGKSQPF---KDLSTDHLQ---YLSSEQALADAANFLTSF 174
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
+P VVFGGSY G LAA+FR KYPH+ GA+++S+P+ +V + + ++
Sbjct: 175 MPG-----APAVVFGGSYSGALAAFFRSKYPHLVNGAISTSSPVY---ALVDFHQYHEVV 226
Query: 234 TQDFRSVSEN---CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
+V N C I + ++I+ K G ++L K F +C + ++ + + T
Sbjct: 227 RNSLATVPHNGSHCSAAIATATEKIQTMLKTTNGRKQLAKDFNLC-GDSDVTHDDDIETL 285
Query: 291 FVYTA-----MTDYPTPSNFLNPLPAFP-VKEMCKAIDDPKTGNDVFAK-------LYGA 337
F A + Y +N P + ++C + T ND +A L G
Sbjct: 286 FTNLAGNIDGVVQYNLDNNHFEGRTKVPTITDVCAVM--AATPNDPYAAYANLQKYLTGG 343
Query: 338 ASV---YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
+ Y N K L+ D G+ +W +Q C E + D F +
Sbjct: 344 ECIETSYANMIAEMKNTSLSSDV-AGGMRQWIYQTCVEFGFYQTSEGNDKPFLNTIS-LK 401
Query: 395 ARARYCKEAYGV-DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
C + YGV P NW +GG+ + +NI++ NGL DPW +S
Sbjct: 402 YNLDQCSDIYGVPGPNVNWTNANYGGYDVA------GTNIVYVNGLIDPWHA------LS 449
Query: 454 KTVLAL-------VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
+T AL V + AH ++ + +DP L R +G W
Sbjct: 450 RTDTALPDGCDAIVIPQTAHCANMYPPSPDDPPALTRARETISSYLGVW 498
>gi|149068959|gb|EDM18511.1| prolylcarboxypeptidase (angiotensinase C) (predicted) [Rattus
norvegicus]
Length = 193
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 80/151 (52%), Positives = 105/151 (69%), Gaps = 8/151 (5%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y Y Q +DHF ++ +TF+QRYL+ D HW +N I YTGNEGDI WF NT
Sbjct: 45 YSVHYFQQKVDHFGFS--DTRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNT 100
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
GFM+DVA + KA+LVF EHRYYGKS+P+G + ++K++ +L+S QALAD+A LI
Sbjct: 101 GFMWDVAEELKAMLVFAEHRYYGKSLPFGRD---SFKDSQHLNFLTSEQALADFAELIRH 157
Query: 173 LKKNLTATD-SPVVVFGGSYGGMLAAWFRLK 202
LK+ + T+ PV+ GGSYGGMLAAWFR+K
Sbjct: 158 LKETIPGTEGQPVIAIGGSYGGMLAAWFRMK 188
>gi|195013134|ref|XP_001983807.1| GH15372 [Drosophila grimshawi]
gi|193897289|gb|EDV96155.1| GH15372 [Drosophila grimshawi]
Length = 505
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 139/487 (28%), Positives = 220/487 (45%), Gaps = 48/487 (9%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P K+ SL S D + L+ Q LDHF + + QT+QQRY IND H+ + +NAP+
Sbjct: 37 PSKVLSL-QRSLDVEELW----FEQRLDHFKADNR--QTWQQRYFINDQHYV-NDSNAPV 88
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
F+ G EG+ N G A F AL + +EHR+YGKS P G + S
Sbjct: 89 FIMIGGEGEATKKWMNEGAWIHYAEHFGALCIQLEHRFYGKSHPTGD------LSTSNLA 142
Query: 156 YLSSTQALADYASLIIDLKK--NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
YLSS QALAD A+ + +K N+ AT + FGGSY G LAAW R KYP + GA++S
Sbjct: 143 YLSSEQALADLANFVSAMKSKYNMKATQK-WIAFGGSYPGSLAAWAREKYPDLIDGAISS 201
Query: 214 SAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFR 273
S P+L V + ++ S +C + + S+ Q+E K G L + F+
Sbjct: 202 SGPLL---AEVDFRQYFEVVKASLASYKPDCVEAVTRSFAQVEILLKHMIGQRNLDEKFK 258
Query: 274 ICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL---------NPLPAFPVKEMCKAIDDP 324
C K+ +IE+ L A ++ + NF +P + E+C + +
Sbjct: 259 TCTPLKD-SIETPLDIANLFENLA-----GNFAGVVQYNKDNSPHANITIDEVCDVMLNT 312
Query: 325 KTG---------NDVFAKLYGAASVYYNYSGT-AKCFDLNGDSD-PHGLSEWGWQACTEM 373
+ G ND+ K + + Y Y A +++ DS+ G+ +W +Q C E
Sbjct: 313 QLGPPVTRLAAVNDMLLKQSKSKCLDYKYDKMIADMKNVSWDSEVAKGMRQWTYQTCNEF 372
Query: 374 IMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE-FGGHKIGLVLKRFASN 432
DNK F + D R C + + N++ +K LK +
Sbjct: 373 GFYQTSDNKSDTFGD-RFGVDFFVRQCADIFSERMDANFVEQAVLATNKFYGALKPDTTQ 431
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
+++ +G DPW G+ + + + + AH ++ DP+ LK R + ++ +
Sbjct: 432 VLYVHGSIDPWHALGLYVSPNAKTPTIYIEGTAHCANMYEPVDSDPEQLKAARNKILKFL 491
Query: 493 GKWISQY 499
K + Y
Sbjct: 492 AKLLDGY 498
>gi|322785978|gb|EFZ12594.1| hypothetical protein SINV_07473 [Solenopsis invicta]
Length = 494
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 139/496 (28%), Positives = 224/496 (45%), Gaps = 52/496 (10%)
Query: 28 TFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWG 87
TF R L + SSK S L K ++ TQ LDHFN P ++QRY +N +
Sbjct: 22 TFLRGRSKGGNLGEPVVSSKKS--LPKEQWFTQYLDHFN--PTDVHVWKQRYFVNSDFY- 76
Query: 88 GSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
K N PIF+ G EG + +W + Y A +F A+ ++EHR+YGKS P +
Sbjct: 77 --KPNGPIFLMIGAEGIANPKWMIEGQWIEY--AKEFGAMCFYLEHRFYGKSHP---TSD 129
Query: 146 IAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYP 204
++ KN YLSS QALAD A I + ++ +VFGGSYGG LAAW R KYP
Sbjct: 130 LSVKNLV---YLSSEQALADLAYFIQSVNIGYKFPNNAKWIVFGGSYGGSLAAWMRAKYP 186
Query: 205 HVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGG 264
H+ GA+++S P+L + Y ++T + SE C VI+ + QI G
Sbjct: 187 HLVHGAVSASGPLLAQIDFEEYYI---VVTNALKGYSEKCVDVIQDANSQINMLLHHVTG 243
Query: 265 LEKLQKAFRICK-----SEKNLAIESWLST-AFVYTAMTDYPTPSNFLNPLPAFPVKEMC 318
+++QK F++C K + I + T A + + Y + + ++ +C
Sbjct: 244 QQQIQKKFKLCDPIDPGHTKKVDISNLYETLASNFAGIVQYNKDNRQSSQTANITIETVC 303
Query: 319 KAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL-------------SEW 365
+ D K G + Y + + + KC D D H L +W
Sbjct: 304 DILVDEKIGKSIDRLAYVSNMIL--DATKEKCLDYRYDKMIHELRNVTWASEQAEGGRQW 361
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDAR--ARYCKEAYGVDPRPNWITTEFGGHKIG 423
+Q CTE + +I+ ++ + A + C + +G PR N G +
Sbjct: 362 MYQTCTEFGFFQTSTGQSNIYGDT---FPAAFFVQQCLDIFG--PRYNLNLLTSGVTRTN 416
Query: 424 LV---LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQW 480
++ L +N++F +G DPW G+ ++ + A+ K AH ++ ++ D
Sbjct: 417 ILYGALNLQVTNVVFVHGSIDPWHVLGITESANPQAPAIYIKGTAHCANMYPPSEHDMPQ 476
Query: 481 LKDVRRREVEIIGKWI 496
LK+ R + +I +W+
Sbjct: 477 LKEARIQIQGLIKQWL 492
>gi|171847157|gb|AAI61646.1| LOC100145763 protein [Xenopus (Silurana) tropicalis]
Length = 494
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 134/471 (28%), Positives = 216/471 (45%), Gaps = 66/471 (14%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q L+HF+ + +QQRY +NDT+ ++ P+F+ G EG +G A
Sbjct: 56 QRLNHFS--GADTRVWQQRYFMNDTY---RQSGGPVFLMIGGEGPANPAWMTSGAWLTYA 110
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
K AL + +EHR+YGKS P ++I+ N YLSS QALAD A +K+ L
Sbjct: 111 EKLGALCLMLEHRFYGKSHP---TQDISTDNLH---YLSSQQALADIAHFQTVMKEKLGL 164
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIITQD 236
DS VVFGGSY G L+AW+R+KYPH+A A+ASSAP+ +NF + ++
Sbjct: 165 ADSKWVVFGGSYPGSLSAWYRMKYPHLAHMAVASSAPVKVQVNFSE------YLEVVQLA 218
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAM 296
+C + IK + +++ + +KL FR+C+ L I SW+ TA++ +
Sbjct: 219 LSLNHSDCPEAIKMASEEVSKLLVL-SSFQKLTDDFRLCEL---LQINSWMDTAYLLDTL 274
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV---YYNYSGTA-KCFD 352
+ N +N + + + D V + S+ YY Y+ K +
Sbjct: 275 AE-----NIMNVVQYNKDNREFEGVKDTSITIQVVCDIMANKSLGTPYYRYANVIRKILE 329
Query: 353 LNGD--------------------SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEED 392
+NG+ S G +W +Q CTE D+ F
Sbjct: 330 VNGEKCMDASYKKFIQDMRNITWASVSLGGRQWMYQTCTEFGFFQSTDSAAQPFSGIPLS 389
Query: 393 YDARARYCKEAYGVDPRPNWI------TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
Y + C + +G + + + T E+ G GL +K S IIF NGL DPW
Sbjct: 390 Y--HVQQCSDIFGPEYNLSMVTDSVQQTNEYYG---GLDIK--GSRIIFPNGLIDPWHRL 442
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G+ ++S ++A+ + AH ++ + EDP + R+ +++ KW++
Sbjct: 443 GINADLSGDLIAIQMEGAAHCANMYPARLEDPPSVPSARQYIFQLLTKWLN 493
>gi|321477664|gb|EFX88622.1| hypothetical protein DAPPUDRAFT_191142 [Daphnia pulex]
Length = 493
Score = 161 bits (407), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 145/489 (29%), Positives = 225/489 (46%), Gaps = 67/489 (13%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+ S + L ++ Q LDHF+ + +T++QRY NDT + G ++P+F+ G E
Sbjct: 36 VELSSSNTNLPPDQWFDQKLDHFDV--VNSKTWKQRYHTNDTFFKG---DSPVFLMIGGE 90
Query: 103 GDI--EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
G+ +W Q G + A +F AL +EHRYYGKS P K ++ KN +LSS
Sbjct: 91 GEASPKWMVQ--GMWIEWAKQFNALCFQLEHRYYGKSHP---TKNMSTKNLK---FLSSE 142
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI--- 217
QALAD A I KK L +++ +VFGGSY G LAAWFRLKYPH+A GA+ASSAP+
Sbjct: 143 QALADLAYFIEAKKKELKLSNNKWIVFGGSYPGSLAAWFRLKYPHLAHGAVASSAPLFAK 202
Query: 218 LNFDNIVSPYSFSNIITQDFRSVSEN--CYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
+NF + ++T ++ S++ C I+ + +E+ + + + K F +C
Sbjct: 203 INFKEYLG------VVTNALQTTSQSSKCTNAIEQATIALEDEIQSTSCCKTMDKLFSLC 256
Query: 276 ---KSEKNLAIESWLST-AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF 331
+ L + ++ T A + + Y + + +C + DPK G+ +
Sbjct: 257 DPLDTTNKLDVANFFETLAGNFEGVVQYNKDNRDFKG-ANITMDVLCDMMTDPKIGSPL- 314
Query: 332 AKLYGAASVYYNYSGTAKCFDLNGD-------------SDPHGLSEWGWQACTEMIMLTG 378
A+ +V +G KC D D S G +W +Q CTE
Sbjct: 315 ARYAAVNNVLLGTTG-EKCLDYKYDKFLIDMRSTDWNSSASEGGRQWTYQTCTEFGYYQS 373
Query: 379 GDNKDSIFEESEEDYDARARYCKEAYG----------VDPRPNWITTEFGGHKIGLVLKR 428
D +D F + + R C + +G R N+I +GG +GL L R
Sbjct: 374 SDLEDQPFGK-RFPIEFSVRQCSDIFGGKFNYKLLKNAVARTNFI---YGG--LGLKLDR 427
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRRE 488
+F NG DPWS G+ N + V ++ AH D+ + +D L + R
Sbjct: 428 ----TVFPNGSVDPWSALGITSNTTGNVAIFIQGT-AHCADMYPPSPKDSLELTEARNVI 482
Query: 489 VEIIGKWIS 497
+ WIS
Sbjct: 483 ESHLRSWIS 491
>gi|218198654|gb|EEC81081.1| hypothetical protein OsI_23904 [Oryza sativa Indica Group]
Length = 232
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 82/154 (53%), Positives = 98/154 (63%), Gaps = 12/154 (7%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW-----GGSKNNAPIFVYTGNEGDIEW 107
+ Y Q LDHF + P + F Q+YL+NDT W G PIFVYTGNEGDIEW
Sbjct: 82 FTAHYFPQELDHFTFTPNASALFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
FA NTGFM+D+AP F ALL HR+YG+S P+G + GYL+STQALAD+A
Sbjct: 142 FATNTGFMFDIAPSFGALL----HRFYGESKPFGNESN---SSPEKLGYLTSTQALADFA 194
Query: 168 SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRL 201
LI LK NL+A SPVVVFGGSYGG + +L
Sbjct: 195 VLITSLKHNLSAVSSPVVVFGGSYGGSKSPIIKL 228
>gi|301624087|ref|XP_002941341.1| PREDICTED: putative serine protease K12H4.7 [Xenopus (Silurana)
tropicalis]
Length = 516
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 133/471 (28%), Positives = 216/471 (45%), Gaps = 66/471 (14%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q L+HF+ + +QQRY +NDT+ ++ P+F+ G EG +G A
Sbjct: 78 QRLNHFS--GADTRVWQQRYFMNDTY---RQSGGPVFLMIGGEGPANPAWMTSGAWLTYA 132
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
K AL + +EHR+YGKS P ++I+ N YLSS QALAD A +K+ L
Sbjct: 133 EKLGALCLMLEHRFYGKSHP---TQDISTDNLH---YLSSQQALADIAHFQTVMKEKLGL 186
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIITQD 236
DS VVFGGSY G L+AW+R+KYPH+A A+ASSAP+ +NF + ++
Sbjct: 187 ADSKWVVFGGSYPGSLSAWYRMKYPHLAHMAVASSAPVKVQVNFSE------YLEVVQLA 240
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAM 296
+C + IK + +++ + +KL FR+C+ L I SW+ TA++ +
Sbjct: 241 LSLNHSDCPEAIKMASEEVSKLLVL-SSFQKLTDDFRLCEL---LQINSWMDTAYLLDTL 296
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV---YYNYSGTAK-CFD 352
+ N +N + + + D V + S+ YY Y+ + +
Sbjct: 297 AE-----NIMNVVQYNKDNREFEGVKDTSITIQVVCDIMANKSLGTPYYRYANVIRTILE 351
Query: 353 LNGD--------------------SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEED 392
+NG+ S G +W +Q CTE D+ F
Sbjct: 352 VNGEKCMDASYKKFIQDMRNITWASVSLGGRQWMYQTCTEFGFFQSTDSAAQPFSGIPLS 411
Query: 393 YDARARYCKEAYGVDPRPNWI------TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
Y + C + +G + + + T E+ G GL +K S IIF NGL DPW
Sbjct: 412 Y--HVQQCSDIFGPEYNLSMVTDSVQQTNEYYG---GLDIK--GSRIIFPNGLIDPWHRL 464
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G+ ++S ++A+ + AH ++ + EDP + R+ +++ KW++
Sbjct: 465 GINADLSGDLIAIQMEGAAHCANMYPARLEDPPSVPSARQYIFQLLTKWLN 515
>gi|327276291|ref|XP_003222903.1| PREDICTED: thymus-specific serine protease-like [Anolis
carolinensis]
Length = 511
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 134/458 (29%), Positives = 213/458 (46%), Gaps = 47/458 (10%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDH+N Q+ TF QRY +N W ++ P+F++ G EG + +A G +A
Sbjct: 70 QHLDHYNKKNQA--TFNQRYWVNAGFW---RHGGPVFLFIGGEGRLSEYAVLKGHHVTLA 124
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--IIDLKKNL 177
K+ ALL+ +EHR+YG S+ E+ + YLSS QAL+D S I K L
Sbjct: 125 EKYGALLLALEHRFYGGSL----KPEMLEDD--NLQYLSSQQALSDLVSFHQFISKKYKL 178
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVSPYSFSNI 232
T ++ + FGGSY G LAAWFRLK+PH+ GA+ASSAP+ L+F + V S SN
Sbjct: 179 TPNNT-WICFGGSYPGSLAAWFRLKFPHLVFGAVASSAPVRAQLDFKGYHKVVAASLSNP 237
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
+ S S+ C + ++ +EE + G L+KL + FR C + L WL V
Sbjct: 238 VI----SGSKQCLDAVTEAFSAVEELVRS-GQLDKLDQDFRSCLPLEGLKDSLWLIKNLV 292
Query: 293 --YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
+ A+ Y + V +C+ + + G+ + +L ++ + + KC
Sbjct: 293 SMFMAIVQYNGER-----VEWANVGRICEIMTNHSAGS-TYQRLVATNNIVLS-AMRLKC 345
Query: 351 FD------LNGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
D + ++P S +W +Q CTE ++ F +
Sbjct: 346 LDNSYAAFIEKMTNPKFFSMNMVVRQWIFQTCTEFGYFQTCEDPACPFSRL-VNLRFEMD 404
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
CK+ + + R F G + AS ++F NG DPW VLK++S + LA
Sbjct: 405 VCKQVFNISDRSAQEAVSFTNEYYGANHPK-ASRVLFVNGDIDPWHVLSVLKDLSPSELA 463
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+V +H ++ DP L + R++ +G+W+
Sbjct: 464 IVITGTSHCANMESPLPTDPLPLVEARKKITAQVGEWL 501
>gi|383865215|ref|XP_003708070.1| PREDICTED: putative serine protease K12H4.7-like [Megachile
rotundata]
Length = 479
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 136/454 (29%), Positives = 207/454 (45%), Gaps = 42/454 (9%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHFN P+ +T+ RYL N + K PI + G E +I TG MY++A
Sbjct: 51 QPLDHFN--PRDNRTWSMRYLENSKY---HKEGGPIMIMIGGEWEISTGFLTTGLMYEIA 105
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
A++ + EHRYYG+S P ++I+ KN YLS QALAD A I K+
Sbjct: 106 STHGAMMYYTEHRYYGQSKP---TEDISSKNLQ---YLSVDQALADLAYFIETKKEQDHL 159
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
+S V+V GGSY G +AAW RLKYPH+ GALASSAP+ + Y + ++T+ R
Sbjct: 160 RNSTVIVIGGSYAGSMAAWARLKYPHLIQGALASSAPVFAKADF---YEYYEVVTESIRR 216
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLA-IESWLSTAFVY 293
+E C IK ++ +E+ G ++L+ F +C KS ++ + + L+ F
Sbjct: 217 QNEKCADDIKAAFDAVEKLLFTKNGPKRLKTYFHLCDAPDVKSHNDIGHLMNTLAEGFA- 275
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
+ Y + A K ++ P + NY K +
Sbjct: 276 -GIVQYDNVEKNQTKIAACCDKMTATSLGSPLQRLAHVITDGNKKCIENNYQKFVKQYS- 333
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR--ARYCKEAYGVDPRPN 411
NG +W +Q C+E ++K+SIF + R C + YG N
Sbjct: 334 NGTWKNDISRQWYYQTCSEFGYYQTTNSKNSIFGSL---FPLRFFTDLCVDLYGDYYNEN 390
Query: 412 WITTE-------FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
++ T +GG + L N+IF NG DPW VL+N++ A++ K
Sbjct: 391 FLDTSIRRTNIMYGGLRPDL------RNVIFTNGDIDPWHKLSVLQNLNADSPAILIKGS 444
Query: 465 AHHVDLRFSTKE-DPQWLKDVRRREVEIIGKWIS 497
+H DL + D + L + R +IIG W++
Sbjct: 445 SHCRDLYSDNLDTDAKDLVNARANVRKIIGTWLA 478
>gi|166240237|ref|XP_635876.2| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|165988498|gb|EAL62376.2| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 476
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 135/485 (27%), Positives = 225/485 (46%), Gaps = 80/485 (16%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW 107
D G + ++ TQ LDHFN+ Q+ QTFQQ+Y +ND ++ KN PI +Y EG +
Sbjct: 34 DGSGSFPAQWFTQTLDHFNF--QNNQTFQQKYYVNDQYYN-YKNGGPIILYINGEGPVSS 90
Query: 108 --FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
++ + G + A ++V +EHR+YG+S P+ E+ +N YLS QAL D
Sbjct: 91 PPYSSDDGVVI-YAQALNCMIVTLEHRFYGESSPF---SELTIENLQ---YLSHQQALED 143
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDN 222
A+ ++D + L +V GGSY G L+AWFR+KYPH+ +G++ASS IL+F
Sbjct: 144 LATFVVDFQSKLVGAGH-IVTIGGSYSGALSAWFRIKYPHITVGSIASSGVVHSILDFTA 202
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA 282
+ S+ +V C K ++ E+ G E+ K +I ++E +
Sbjct: 203 FDAYVSY---------AVGPECTKALQAVTSAAEDEYFAGGIREQQMK--QILQAESLVD 251
Query: 283 IES---WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK----AIDDPKTGNDVFAKLY 335
I WL+ + + Y + E+C AI+ +G D+
Sbjct: 252 IGDFFYWLADSMMEGDQYGY--------------IDELCSPLVDAINSGTSGIDLI---- 293
Query: 336 GAASVYYNYS--------GT----AKCFDLNGDSDPHGLSE--WGWQACTEMIMLTGGDN 381
+VY NY+ GT + + N DP ++ W +Q C+ + + +
Sbjct: 294 ---TVYSNYTINTWGKVLGTPDEYSTAWQQNVTYDPSKSADRAWWYQTCSSLGWMQAAPS 350
Query: 382 KDSIFEESEEDYDARARYCKEAYGV---DPRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
++SI S + +C++ +G P N + T++GG + +L +NI+F NG
Sbjct: 351 ENSI-RSSLVNMTYFQTHCQQLFGQAIWPPNVNAVNTQYGGDQSNPLLNAAGTNILFTNG 409
Query: 439 LRDPWSGGGVLK----NISKTVLALVEKEGAHHVDLRFSTK--EDPQWLKDVRRREVEII 492
DPWS ++ N+ + + K G H VDLR + P L VR ++ I
Sbjct: 410 HADPWSQASIVNSNYPNVEPSAMTTCRKCG-HCVDLRGCPGGCDLPNNLDQVRSLSLKSI 468
Query: 493 GKWIS 497
+W++
Sbjct: 469 AQWLN 473
>gi|54648564|gb|AAH85041.1| LOC495469 protein, partial [Xenopus laevis]
Length = 502
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 145/487 (29%), Positives = 219/487 (44%), Gaps = 86/487 (17%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y Q LDHFN + T+ QRY IN+ +W + + P+F+Y G E + F+ +G
Sbjct: 60 YIVQPLDHFNRLNNA--TYNQRYWINEQYW--NHPDGPVFLYIGGESSLSEFSVLSGEHI 115
Query: 117 DVAPKFKALLVFIEHRYYGKSI-PYGGNKEIAYKNASTTGYLSSTQALADYAS--LIIDL 173
++A +ALLV +EHRYYG SI P G E +LSS QALAD AS + I
Sbjct: 116 ELAQTHRALLVSLEHRYYGSSINPDGLTLE-------NIKFLSSQQALADLASFHMFISQ 168
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVSPYS 228
K NLT ++ + FGGSY G L+AWFRLK+PH+ A+ASSAP+ L+F N V S
Sbjct: 169 KYNLTRQNT-WICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDFTGYNKVVALS 227
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL- 287
++ + SE C +K ++ ++ +K G + +++K F C S + S
Sbjct: 228 LADPVIGG----SEKCLDAVKKGFQAVDSLLQK-GNVTQMEKDFYSCGSLQGSDDYSEFV 282
Query: 288 -STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK--AIDDPKTGNDVFAKLYGAASVYYNY 344
+ A ++ Y S P V+ +C+ +I D N + L +Y N
Sbjct: 283 GNLADIFMGAVQYNGMS------PVSNVQYICQLMSIKD----NSAYEGLRSVNRMYMNS 332
Query: 345 SGTAKCFDLNGDSDPH-----------GLSEWGWQACTE----------------MIMLT 377
G + + + S G +W +Q CTE +I L
Sbjct: 333 MGLSCISNSHAKSVADLSSTKLSLIGVGERQWYYQTCTEFGYYQTCEDPSCPFSPLITLK 392
Query: 378 GG-DNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFF 436
D IF+ E ++ E YG D RP +S IIF
Sbjct: 393 SQLDLCSQIFQVPTESVLQSVQFTNEFYGAD-RPK------------------SSRIIFV 433
Query: 437 NGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
NG DPW VLKN S++ +A++ +H +++ S DP L++ R+ + W+
Sbjct: 434 NGDVDPWHALSVLKNQSRSEIAILINGTSHCANMKPSQTSDPLSLQEARKEIATQVATWL 493
Query: 497 SQYFQDL 503
Q+L
Sbjct: 494 KSAQQEL 500
>gi|384249050|gb|EIE22532.1| hypothetical protein COCSUDRAFT_16137, partial [Coccomyxa
subellipsoidea C-169]
Length = 291
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 151/284 (53%), Gaps = 26/284 (9%)
Query: 99 TGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLS 158
GNE +E + TGFM+++ + ALL+F EHRYYG S P G + YLS
Sbjct: 1 AGNEDAVEVYVNFTGFMWELGREMNALLIFAEHRYYGDSQPLGPSSL-----DRDPSYLS 55
Query: 159 STQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
QALAD+A+LI +K+ A DSPV+ FGGSYGGMLAAW R KYP+ GA+A SAP+
Sbjct: 56 IEQALADFATLIYHVKEKHGARDSPVIAFGGSYGGMLAAWLRAKYPNAVQGAIAGSAPVG 115
Query: 219 NFDNIVSPYSFSNIITQDFR---SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
+ ++T D +++C + ++++ ++ + L FR+C
Sbjct: 116 AY-----------VVTYDASPEAGAAKHCRANVHSFFQELLADKERASFWQHLADVFRLC 164
Query: 276 ---KSEKNLA-IESWLSTAFVYTAMTDYPTPSNFL-NPLPAFPVKEMCKAIDDPK-TGND 329
+S K++ + W+ AF AM +YP PS ++ LPA+P++ C + D K + D
Sbjct: 165 LAPESGKDVENVAYWVQGAFDSFAMGNYPYPSTYMGGALPAWPMRAACDHLADEKPSKED 224
Query: 330 VFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL-SEWGWQACTE 372
+ + A + YN +G A C++ P G + W +Q CTE
Sbjct: 225 LLQGMAAAVGLLYNATGDAPCYNATQLVGPAGPGATWMFQWCTE 268
>gi|291241533|ref|XP_002740665.1| PREDICTED: Prolyl Carboxy Peptidase like family member (pcp-2)-like
[Saccoglossus kowalevskii]
Length = 500
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 137/477 (28%), Positives = 218/477 (45%), Gaps = 62/477 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGDI--EWFA 109
+ ++ +Q LDH YN +T+QQRY I+D+H+ GG P+F+ G EG + +W
Sbjct: 51 QEQWISQRLDH--YNDADLRTWQQRYYIDDSHYIAGG-----PVFLNIGGEGPLNSKWLM 103
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
T ++ A K+ AL + +EHRYYGKS P + + YLSS QALAD A
Sbjct: 104 AETTWI-QYAMKYGALCLLVEHRYYGKSHP------TVDVSTDSLQYLSSEQALADLAYF 156
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ + L T++ + FGGSY G LAAWFR+KYPH+ GA+A+SAP+L N +
Sbjct: 157 RNYIGEKLNITNNKWIAFGGSYSGNLAAWFRIKYPHLVDGAVATSAPVLAKLNFT---EY 213
Query: 230 SNIITQDFRS--VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIE--- 284
++ S E C K I+ + +++ + G + LQ F++C + ++
Sbjct: 214 LEVVRDSLASSKAGEACNKNIQAAVIDMQKKLQTTEGEKLLQNIFQVCGPINSTELKDVQ 273
Query: 285 ---SWLSTAFVYTAMTDYPTPSNFLNPLPA--FPVKEMCKAIDDPKTGNDVFAKLYGAAS 339
S +S F + Y + + +C + D G+ + Y A +
Sbjct: 274 NFHSLVSGNF--EGVVQYNRDNREFEGAVGTNITLDTLCDIMVDESIGDPLHR--YAAVN 329
Query: 340 VYYNYSGTAKCFDLNGD-------------SDPHGLSEWGWQACTEMIMLTGGDNKDSIF 386
+ KC D++ D S G +W +Q CTE D + F
Sbjct: 330 TLMLQTYQTKCLDISYDNMIQEMRQNSWNSSAAEGGKQWVYQTCTEFGYYQTSDAINQPF 389
Query: 387 EESEEDYDARARYCKEAYGVDPRPNWIT-------TEFGGHKIGLVLKRFASNIIFFNGL 439
+ + C++ YG +T T +G GL LK +N++F NG
Sbjct: 390 GHNFP-LSFSLQQCQDIYGKQFNQTTLTAGIKSTNTNYG----GLGLK--TNNVVFPNGS 442
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DPW G+ +++S++V A+ K AH ++ +D LK R+ +IGKWI
Sbjct: 443 IDPWHALGITQDVSQSVTAIYIKGTAHCANMYPEKADDLPQLKQARKTIEILIGKWI 499
>gi|260797277|ref|XP_002593630.1| hypothetical protein BRAFLDRAFT_284130 [Branchiostoma floridae]
gi|229278856|gb|EEN49641.1| hypothetical protein BRAFLDRAFT_284130 [Branchiostoma floridae]
Length = 464
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 133/466 (28%), Positives = 211/466 (45%), Gaps = 41/466 (8%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ LDH YN +T+QQRY +NDT + K P+F+ G EG + TG
Sbjct: 17 QWVTQRLDH--YNDADLRTWQQRYFVNDTFY---KPGGPVFLMIGGEGTADPIWMVTGSW 71
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
+ A +F AL + +EHRYYGKS P ++ + +N YLSS QALAD A + +
Sbjct: 72 IEYAKEFHALCLMLEHRYYGKSHP---TEDTSVENLQ---YLSSEQALADLAYFRNYMAE 125
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
++ TD+ + FGGSY G L+AWFRLKYPH+ GA+A+S P+L + V + ++
Sbjct: 126 KMSLTDNKWITFGGSYPGSLSAWFRLKYPHLVAGAVATSGPLLAELDFV---EYVEVVRD 182
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIES---WLSTAF 291
+ C K I+ + +++ + G+EKL K F +C + N+A + W + +
Sbjct: 183 SLATTGPECNKNIQEATDAVKQMLETQEGVEKLNKLFNLCSPLDVNVANDVANFWSTISG 242
Query: 292 VYTAMTDYPTPSNFLNPLPA--FPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
+ + Y + + +C ++D V + Y A + + K
Sbjct: 243 NFMGVVQYNKDNREFEGAVGTNITLDVLCNIMNDNSKHMPVEER-YAAVNYLMMETYKQK 301
Query: 350 CFDLN-------------GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
C D++ + G +W +Q CTE D+ + F +
Sbjct: 302 CLDVSYSAMIKELRETSWESAASEGGRQWIYQTCTEFGFYQTSDSPNQPFGDGFP-LSFS 360
Query: 397 ARYCKEAYGVDPRPNWITTEFGGHKIGL---VLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
+ C + YG P+ N T G + LK + I+F NG DPW G+ ++
Sbjct: 361 LQQCSDIYG--PQFNQSTLMEGIRRTNTNYGALKIAGTKIVFPNGSIDPWHALGITEDPK 418
Query: 454 KTVLALVEKEG-AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
T L V +G AH ++ + DP L R I W+SQ
Sbjct: 419 GTDLHAVYIKGTAHCANMYPARFGDPPELTQARETIKGHIQDWLSQ 464
>gi|328868233|gb|EGG16611.1| hypothetical protein DFA_07589 [Dictyostelium fasciculatum]
Length = 479
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 137/450 (30%), Positives = 208/450 (46%), Gaps = 52/450 (11%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T + Q DHF+ T++Q+Y + D + S+ NAP+F++ E + +F
Sbjct: 53 QTFWFDQQQDHFDQTNNI--TWKQQYQVIDDWFDPSQPNAPVFIFLAGEAPMGFFNFQEV 110
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+ A +FKAL V +EHR+YG+S P +++ N YL+S QALAD A+ +
Sbjct: 111 QIRAWAQEFKALYVILEHRFYGQSYP---TNDLSTHNLK---YLTSQQALADAANFLTTF 164
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS-FSNI 232
K D+ VVFG SY G L+AWFRLKYP + +G++A S P+L N Y+ F+N
Sbjct: 165 KSERGIADNQAVVFGCSYSGALSAWFRLKYPQLVVGSVAPSGPVLAQLNYTGYYAQFTN- 223
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
S +C + + Q+ + K+ G+++L+K F C S KN + F
Sbjct: 224 ------SAPTSCVNAAQQASDQVMQLIKQGDKGIKQLEKTFNSCSSLKNGRDLYY----F 273
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
VY+ + NP P + + C + +D+ L A + +N KC
Sbjct: 274 VYSIVEALGGADQMNNP-PTWTLNSTCNTL---SQNSDL---LVNWAEI-FNQGLDDKCN 325
Query: 352 DLNGDS-----------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
D S D G W +Q C E + S+F + + + ++C
Sbjct: 326 DFTLRSFIDQARKTRINDQDGTRSWVFQTCAEFGYFSTTYPGSSVF-PGLLNVEEQVKWC 384
Query: 401 KEAY---GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK-NISKTV 456
+E Y G+ P +W + +GG +I SNI+F NGL DPW V + NI TV
Sbjct: 385 QEIYDVPGMTPNIDWTNSYYGGQEIK------GSNIMFSNGLLDPWHLLSVNQDNIDGTV 438
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRR 486
A V E H L ST DP L D R+
Sbjct: 439 KA-VTYEAGHCGTLIASTTIDPPSLVDARQ 467
>gi|159470599|ref|XP_001693444.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282947|gb|EDP08698.1| predicted protein [Chlamydomonas reinhardtii]
Length = 264
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 100/274 (36%), Positives = 157/274 (57%), Gaps = 25/274 (9%)
Query: 94 PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
PIF Y GNE ++E + TG +++ A F AL++F+EHRYYGK+ P+G +
Sbjct: 2 PIFFYAGNEANVELYVNVTGLIWENAQAFGALIIFVEHRYYGKTQPFGPDSW-----QVD 56
Query: 154 TGYLSSTQALADYASLIIDLKKNLTA---TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGA 210
YL+ QALADYA+L+ LK + A DSPV+ FGGSYGGML+AW R+KYPH+ GA
Sbjct: 57 PSYLTVEQALADYAALLWHLKADSPAGGAADSPVIAFGGSYGGMLSAWMRVKYPHIIAGA 116
Query: 211 LASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
+A+SAP+ F +V+ Y + + C ++ ++ + + ++ G L +
Sbjct: 117 VAASAPVAAFPGLVT-YDATPAAGS-----APECVTNVRLAFGNLRQLSRFAEGRAALSQ 170
Query: 271 AFRICK--SEKNLAIES--WLSTAFVYTAMTDYPTPSNFLN-----PLPAFPVKEMCK-- 319
R+CK +++ A+++ WL AF AM +YP PS++++ PLPA+P++ C
Sbjct: 171 LLRLCKPLADEGEALDAAYWLQGAFDAFAMGNYPFPSSYISDNPDRPLPAWPMRAACTHM 230
Query: 320 AIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
+ + + L AA+V YN +G +CFDL
Sbjct: 231 VVGRSTKPSTLIEALRDAAAVLYNVTGDVQCFDL 264
>gi|118361433|ref|XP_001013945.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89295712|gb|EAR93700.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 873
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 130/470 (27%), Positives = 217/470 (46%), Gaps = 42/470 (8%)
Query: 47 KDSQGLYKTKYHTQI---LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
KD + K + T+I DHFN + +T+ QRY + D ++ N + +Y E
Sbjct: 415 KDQKNKKKYLFSTRIQNLTDHFNIT--NNRTWSQRYWVLDQYYNPQ--NGSVLLYICGEY 470
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
+ F +A KF +L++ +EHR+YG S+P+G ++ + N L+ QAL
Sbjct: 471 TCPGIPEERQFPILLAQKFSSLVLVLEHRFYGNSMPFG-DQSMKQHNLYL---LNVDQAL 526
Query: 164 ADYASLIIDLKKNL---TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
AD A I +K + P + GGSY G ++AWFR KYPH+ +GALASSA +
Sbjct: 527 ADLAYFITYVKDHHLHGVQNHIPWLTIGGSYPGAMSAWFRYKYPHLTVGALASSAVV--- 583
Query: 221 DNIVSPYSF-SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK 279
+ I+ Y +I RS E C + I ++ + P +++K F +E
Sbjct: 584 NAILDYYQMDQQVILSALRS-GEKCAQSIHDLNIYVQNLLQNPTSAYEIKKQF---NAEH 639
Query: 280 NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFP-VKEMCKAIDDPKTGNDVFAKLYGAA 338
E ++T M Y + + N +P +++ ++I + N+V YG+
Sbjct: 640 LNNGEFLYFYTDIFTGMVQYGSRTVLCNQTLNYPTIEQQYQSILNYTKENNVTVNYYGS- 698
Query: 339 SVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
YY + T D + G +W WQ CTE N + +E + D
Sbjct: 699 --YYLRNDTY-------DPENDGSRQWTWQYCTEFGFFQTCSNPQTGSRSTEVNLDMFTN 749
Query: 399 YCKEAYGVD--PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTV 456
+CK+++ D P P+ + ++GG + A+N+I NG+ DPW G+ ++ V
Sbjct: 750 FCKQSFTQDIFPNPSRVNIQYGGVNLK------ATNLILTNGIEDPWRWAGLQQSSGDIV 803
Query: 457 LALVE-KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
L++ + AH VDL + D LK R + VE +WI +++ L +
Sbjct: 804 SYLIDCDDCAHCVDLYTPKETDALVLKQTREKIVEHFSQWIKEHYDSLEK 853
Score = 42.0 bits (97), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 76/191 (39%), Gaps = 30/191 (15%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLI--NDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
++ Q DHF+ N ++ + QRY I N+ G I + + + F
Sbjct: 32 FNEQRYDHFSNN---FELWDQRYFIAKNEKSQNGQLGKVNIIFVCDKDLTHDILSCIPPF 88
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ +E RYYG+S PY Y Y S Q +AD A + LK
Sbjct: 89 FDSQRRNSDVNIFLLEMRYYGESQPYSS----RYLGIDYLSYQSIQQNIADIALFVSFLK 144
Query: 175 K-NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K N+ ++DS ++KYPH+ G +A ++ ++N ++ ++ I+
Sbjct: 145 KDNMVSSDSK----------------KIKYPHLIDGVIAFNSQLVN----INYEQYNQIL 184
Query: 234 TQDFRSVSENC 244
Q + C
Sbjct: 185 DQQLSQTNPQC 195
>gi|345323658|ref|XP_001512518.2| PREDICTED: putative serine protease K12H4.7-like [Ornithorhynchus
anatinus]
Length = 489
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 134/456 (29%), Positives = 200/456 (43%), Gaps = 66/456 (14%)
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYG 135
+Y INDT + K P+F+ G EG + W T Y A K AL + +EHR+YG
Sbjct: 65 QYFINDTFY---KTGGPVFLMIGGEGPANPAWMHHGTWLTY--AEKLGALCLLLEHRFYG 119
Query: 136 KSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGML 195
KS P ++++ N YLSS QALAD A + + L D+ V FGGSY G L
Sbjct: 120 KSHP---TQDLSTSNLH---YLSSRQALADLAHFRTVMAEKLGLVDNKWVAFGGSYPGSL 173
Query: 196 AAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI 255
AAWFRLKYPH+ A+A+SAPI N + ++ + R+ C ++++ + +
Sbjct: 174 AAWFRLKYPHLVDVAVATSAPIYAVMNFP---EYLEVVQKSLRTFRRECPEIVREASDTV 230
Query: 256 EETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLP----- 310
E K K+ K F +C+ L I+S + +A+ ++ SNF++ +
Sbjct: 231 VEMLKYKANYFKITKDFNLCEQ---LQIKSKMDSAYFLESLA-----SNFMDVVQYNEDN 282
Query: 311 -AFP--------VKEMCKAIDDPKTGN--DVFA----------KLYGAASVYYNYSGTAK 349
AF + +C + D G D +A ++ + Y NY
Sbjct: 283 RAFEGVKGTNITINVLCDIMSDISLGTPYDRYATVVRLLLNTFEMTCLDASYSNYVQEMT 342
Query: 350 CFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
F +G + G +W +Q CTE D+K F Y + C + YG +
Sbjct: 343 NFSWDGPAATGG-RQWVYQTCTEFGFFQSTDSKKQPFSGFPLHY--HLQQCSDIYGQEFN 399
Query: 410 PNWITT-------EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
I +GG I S I+F NG DPWS GV+ +ISK + A K
Sbjct: 400 NTLIANAIRDTNENYGGFNIT------GSRIVFPNGSIDPWSALGVISDISKDLPAAFIK 453
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
AH + ED L + R R ++ KW+ Q
Sbjct: 454 GTAHCAIMYPERTEDSAELVNARVRVFRLLQKWLKQ 489
>gi|156354267|ref|XP_001623320.1| predicted protein [Nematostella vectensis]
gi|156210006|gb|EDO31220.1| predicted protein [Nematostella vectensis]
Length = 502
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 133/473 (28%), Positives = 217/473 (45%), Gaps = 72/473 (15%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DHF + P+ +T+ QRY +N W G + P+ +Y G E + G + D+A
Sbjct: 63 QYIDHFEFTPRP-RTYLQRYWMNRAFWKGP--DGPVLLYVGGESVLSGGYIAGGHIVDIA 119
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
++ ALL +EHRYYGKS +G K KN YLSS ALAD A + K
Sbjct: 120 KEYGALLFAVEHRYYGKSNFFGCLKT---KNMR---YLSSQLALADLAQFVAHAKNKFGL 173
Query: 180 TD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI------LNFDNIVSPYSFSNI 232
TD + + +GGSY G L+AWFR+KYPH+ IGA+ASSAP+ +++N+V+ S +
Sbjct: 174 TDKNKWITYGGSYPGSLSAWFRIKYPHLVIGAVASSAPVEAQTDFKDYNNVVASSLSSPL 233
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL---ST 289
+ S+ C I+ ++K ++ + L+K F C L ++W+ +
Sbjct: 234 V-----GGSKLCMHNIEEAFKFVDRLLDTK-NFKTLEKDFIACNDISKLN-DTWMFASNL 286
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGT 347
A + + Y N +P + +CK + N+ Y + S+ Y T
Sbjct: 287 AGFFMGLVQYN------NQVPGINIAYVCKQM------NNASRSPYKSLSILYKQQIQKT 334
Query: 348 AKCFDLNGDS----------DPHG---LSEWGWQACTEM-IMLTGGDNKDSIFEESE--- 390
A C D + ++ DP G + +W +Q+CT+ T +F +
Sbjct: 335 ASCSDFSYENFMKTVKTQKRDPDGFDMIRQWYYQSCTQFGYFQTCEPGTHCVFSKRLGII 394
Query: 391 EDYDARARYCKEAYGV-----DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSG 445
D D C+E + + R N+ +GG + S I+F NG DPW
Sbjct: 395 NDMD----LCQEVFEIALGQLKARINFTNEYYGGKRPR------GSKIVFVNGSIDPWHS 444
Query: 446 GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
V+ N + + +A+ +H ++ + DP L + RRR I+G+W+ +
Sbjct: 445 LSVVTNQTSSEVAVFIPGTSHCANMGANQPNDPPALVEARRRVTAIVGEWLKE 497
>gi|126631396|gb|AAI33756.1| LOC495469 protein [Xenopus laevis]
Length = 505
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 143/486 (29%), Positives = 223/486 (45%), Gaps = 84/486 (17%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y Q LDHFN + T+ QRY IN+ +W + + P+F+Y G E + F+ +G
Sbjct: 63 YIVQPLDHFNRLNNA--TYNQRYWINEQYW--NHPDGPVFLYIGGESSLSEFSVLSGEHI 118
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS--LIIDLK 174
++A +ALLV +EHRYYG SI G + +N +LSS QALAD AS + I K
Sbjct: 119 ELAQTHRALLVSLEHRYYGSSINPDG---LTLENIK---FLSSQQALADLASFHMFISQK 172
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVSPYSF 229
NLT ++ + FGGSY G L+AWFRLK+PH+ A+ASSAP+ L+F N V S
Sbjct: 173 YNLTRQNT-WICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDFTGYNKVVALSL 231
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL-- 287
++ + SE C +K ++ ++ +K G + +++K F C S + S
Sbjct: 232 ADPVIGG----SEKCLDAVKKGFQAVDSLLQK-GNVTQMEKDFYSCGSLQGSDDYSEFVG 286
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK--AIDDPKTGNDVFAKLYGAASVYYNYS 345
+ A ++ Y N ++P+ V+ +C+ +I D N + L +Y N
Sbjct: 287 NLADIFMGAVQY----NGMSPVSN--VQYICQLMSIKD----NSAYEGLRSVNRMYMNSM 336
Query: 346 GTAKCFDLNGDSDPH-----------GLSEWGWQACTE----------------MIMLTG 378
G + + + S G +W +Q CTE +I L
Sbjct: 337 GLSCISNSHAKSVADLSSTKLSLIGVGERQWYYQTCTEFGYYQTCEDPSCPFSPLITLKS 396
Query: 379 G-DNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
D IF+ E ++ E YG D +P +S IIF N
Sbjct: 397 QLDLCSQIFQVPTESVLQSVQFTNEFYGAD-QPK------------------SSRIIFVN 437
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G DPW VLKN S++ +A++ +H +++ S DP L++ R+ + W+
Sbjct: 438 GDVDPWHALSVLKNQSRSEIAILINGTSHCANMKPSQTSDPLSLQEARKEIATQVATWLK 497
Query: 498 QYFQDL 503
Q+L
Sbjct: 498 SAQQEL 503
>gi|322795209|gb|EFZ18031.1| hypothetical protein SINV_09833 [Solenopsis invicta]
Length = 584
Score = 158 bits (399), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 138/467 (29%), Positives = 203/467 (43%), Gaps = 56/467 (11%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ LDHFN+ + +T+ RY N KN PI + G E +I G MY++
Sbjct: 50 TQPLDHFNH--RDNRTWSMRYKENSLFL---KNGGPILIMIGGEWEITDGYLQGGLMYEI 104
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
K+ L+ + EHR+YG+S P K+I+ +N YL++ QALAD A I KK
Sbjct: 105 GVKYGGLMYYTEHRFYGQSKP---TKDISSENLQ---YLNADQALADLAYFIETKKKEKN 158
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
S V+V GGSY G +AAW RLKYPH+ GALASSAP+ + Y + ++T+
Sbjct: 159 LEKSTVIVVGGSYAGNMAAWARLKYPHLIQGALASSAPVQAKADF---YEYYEVVTKSLG 215
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-----SEKNLA-IESWLSTAFV 292
SE C + +K ++ +EE K G +L+ F +C+ S +L + LS F
Sbjct: 216 RHSEKCVENVKIAFASVEELLAKQSGAIELKYLFNLCEVPDINSASDLGYFMNMLSEIFA 275
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS-GTAKCF 351
D + +C + TG + + L A V+ Y KC
Sbjct: 276 GIVQYDKIEKGE-------TNIATLCHNM----TGEHLGSPLQRLAHVFLMYQKDQHKCV 324
Query: 352 DLNGD--------------SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
D++ + + + +W Q CTE ++ SIF + D
Sbjct: 325 DVSYNNFVKTYRNVSWDSLAATSIMRQWYHQTCTEYGYFQTTNSDKSIF-GTLFPLDYYV 383
Query: 398 RYC---KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
C +D R +GG L N+IF NG DPW VL +++
Sbjct: 384 NLCIDFNNGKWLDSRVKRTNIMYGGQLPDL------RNVIFTNGDIDPWHSLSVLHDLNA 437
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
A+ +H D+ DP LK R + IIGKWIS + +
Sbjct: 438 FSPAIFINGSSHCRDMLSDVATDPDDLKKARAKIRSIIGKWISSHLK 484
>gi|80476792|gb|AAI08760.1| MGC85068 protein [Xenopus laevis]
Length = 457
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 133/456 (29%), Positives = 212/456 (46%), Gaps = 42/456 (9%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
Q LDHFN ++ T+ QRY IN+ +W + + P+F+Y G EG + F+ +G ++
Sbjct: 17 VQPLDHFNR--RNNGTYNQRYWINEQYW--NYPDGPVFLYIGGEGSLSEFSVLSGEHVEL 72
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS--LIIDLKKN 176
A +ALLV +EHR+YG SI G + +N +LSS QALAD AS + I K N
Sbjct: 73 AQTHRALLVSLEHRFYGSSINIDG---LTLENIK---FLSSQQALADLASFHMFISQKYN 126
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVSPYSFSN 231
LT ++ + FGGSY G L+AWFRLK+PH+ A+ASSAP+ L+F N V +S ++
Sbjct: 127 LTRQNT-WICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDFTGYNKVVAWSLAD 185
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
+ SE C +K + ++ +K G + +L+K F C S + S T F
Sbjct: 186 PVIGG----SEKCLDAVKEGFHAVDSLIQK-GNVTQLEKDFYSCGSLQG----SDDYTEF 236
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
V + + P V+ +C+ + N + L +Y N G +
Sbjct: 237 VGNLADIFMGAVQYNGMSPISNVQNICQLMTTKD--NSAYEGLRSVNKMYMNSMGLSCIS 294
Query: 352 DLNGDSDPH-----------GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+ + S G +W +Q CTE ++ F ++ C
Sbjct: 295 NSHAKSVADLSSTKLSLIGVGERQWYYQTCTEFGYYQTCEDPSCPFSPLIT-LKSQLDLC 353
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ + V + +F G + +S IIF NG DPW VLKN S++ +A+
Sbjct: 354 FQIFQVPTESVLQSVQFTNEFYGADFPK-SSRIIFVNGDVDPWHALSVLKNQSRSEIAIF 412
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+H ++ S+ DP L++ R+ + W+
Sbjct: 413 INGTSHCANMNPSSTSDPLSLQEARKEIATQVATWL 448
>gi|307171940|gb|EFN63566.1| Putative serine protease K12H4.7 [Camponotus floridanus]
Length = 415
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 133/435 (30%), Positives = 198/435 (45%), Gaps = 58/435 (13%)
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
K N+ I + G E +I G MY++ K+ AL+ + EHRYYGKS P K+ + +
Sbjct: 11 KKNSSILIMIGGEWEISNGFLQGGLMYEIGSKYNALMYYTEHRYYGKSKP---TKDTSTE 67
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
N YL+ QALAD A I KK +S V+VFGGSY G +AAW RLKYPH+ G
Sbjct: 68 NLQ---YLNVDQALADLAYFIETKKKEKNLENSTVIVFGGSYAGNMAAWARLKYPHLIQG 124
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
ALASSAP+ + Y + ++T+ +E C + +K ++ IEE GG +KL+
Sbjct: 125 ALASSAPVYAKADF---YEYYEVVTKSLGKYNEKCVEDVKIAFDSIEELLTAEGGPDKLK 181
Query: 270 KAFRIC-----KSEKNLAIESWL--STAFVYTAMTDYPTPSNFLNPLPAF---------- 312
F +C KS +L +L + A V+ A Y N + A
Sbjct: 182 LYFNLCNVPNVKSSSDLG---YLMNTLAEVFAATVQYDNIENGQTKIAALCNIMTAAHLG 238
Query: 313 -PVKEMCKAIDDPKTGNDVFAKLYGA-ASVYYNYS-GTAKCFDLNGDSDPHGLSEWGWQA 369
P++ + + P D+ + Y + Y N S +A D+ + +W Q
Sbjct: 239 SPLQRLAHIVSKP----DMCIENYNSFIEKYRNISWDSAAAQDI--------MRQWYHQT 286
Query: 370 CTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-------VDPRPNWITTEFGGHKI 422
C+E +SIF + + C + YG ++ R +GG
Sbjct: 287 CSEYGYYQTTSANNSIFG-TLFPLNYYIDMCTDLYGDYSNDKILNSRVRRTNIMYGGQLP 345
Query: 423 GLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLK 482
+ +N+IF NG DPW VLK+++ A+V K +H D+ D + LK
Sbjct: 346 DI------TNVIFTNGDVDPWHPLSVLKDLNAFSPAIVIKGSSHCRDIYSDVDTDLEDLK 399
Query: 483 DVRRREVEIIGKWIS 497
R R +II KWIS
Sbjct: 400 KARARIRDIISKWIS 414
>gi|148236037|ref|NP_001089218.1| uncharacterized protein LOC734265 precursor [Xenopus laevis]
gi|57920938|gb|AAH89148.1| MGC85068 protein [Xenopus laevis]
Length = 506
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 133/456 (29%), Positives = 212/456 (46%), Gaps = 42/456 (9%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
Q LDHFN ++ T+ QRY IN+ +W + + P+F+Y G EG + F+ +G ++
Sbjct: 66 VQPLDHFNR--RNNGTYNQRYWINEQYW--NYPDGPVFLYIGGEGSLSEFSVLSGEHVEL 121
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS--LIIDLKKN 176
A +ALLV +EHR+YG SI G + +N +LSS QALAD AS + I K N
Sbjct: 122 AQTHRALLVSLEHRFYGSSINIDG---LTLENIK---FLSSQQALADLASFHMFISQKYN 175
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVSPYSFSN 231
LT ++ + FGGSY G L+AWFRLK+PH+ A+ASSAP+ L+F N V +S ++
Sbjct: 176 LTRQNT-WICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDFTGYNKVVAWSLAD 234
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
+ SE C +K + ++ +K G + +L+K F C S + S T F
Sbjct: 235 PVIGG----SEKCLDAVKEGFHAVDSLIQK-GNVTQLEKDFYSCGSLQG----SDDYTEF 285
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
V + + P V+ +C+ + N + L +Y N G +
Sbjct: 286 VGNLADIFMGAVQYNGMSPISNVQNICQLMTTKD--NSAYEGLRSVNKMYMNSMGLSCIS 343
Query: 352 DLNGDSDPH-----------GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+ + S G +W +Q CTE ++ F ++ C
Sbjct: 344 NSHAKSVADLSSTKLSLIGVGERQWYYQTCTEFGYYQTCEDPSCPFSPL-ITLKSQLDLC 402
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ + V + +F G + +S IIF NG DPW VLKN S++ +A+
Sbjct: 403 FQIFQVPTESVLQSVQFTNEFYGADFPK-SSRIIFVNGDVDPWHALSVLKNQSRSEIAIF 461
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+H ++ S+ DP L++ R+ + W+
Sbjct: 462 INGTSHCANMNPSSTSDPLSLQEARKEIATQVATWL 497
>gi|118787231|ref|XP_315944.3| AGAP005914-PA [Anopheles gambiae str. PEST]
gi|116126698|gb|EAA11647.3| AGAP005914-PA [Anopheles gambiae str. PEST]
Length = 502
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 127/460 (27%), Positives = 203/460 (44%), Gaps = 63/460 (13%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDH +P + T+QQRY +ND ++ S NAP+F+ G EG+ + G A
Sbjct: 59 QQLDH--NDPTNAATWQQRYYVNDQYFNASDPNAPVFLMIGGEGEATARWMHEGAWIRYA 116
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL--KKNL 177
K AL +EHR+YGKS P ++++ S+ YL+S QALAD A I+ + K L
Sbjct: 117 EKHGALCFQLEHRFYGKSRP---TEDLS---TSSLAYLTSEQALADLAYFIVAMNDKYQL 170
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
+ FGGSY G LAAW R KYP + GA++SS P+L + V Y + +T+
Sbjct: 171 EPHRHRWIAFGGSYPGSLAAWLREKYPSLVHGAISSSGPLLAKIDFVEYY---DTVTRSL 227
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT 297
S +C + ++ +++Q+E K G L + F++C + +IE+ L A ++ A+
Sbjct: 228 ERYSADCVRAVRSAFQQVETLLKHMIGQRTLNEKFQLCDPVER-SIENPLDIANLFEAIA 286
Query: 298 DYPTPSNFL---------NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
SNF +P + E+C + + G V ++L + S T
Sbjct: 287 -----SNFAGVVQYNKDNSPHATVTIDEVCDVMVNQTIGAPV-SRLAEVNRILLKQSNTT 340
Query: 349 KCFDLN-------------GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
C D G S G +W +Q C E +N S+F + +
Sbjct: 341 -CLDFVYDKSIEEMRNTSWGSSQASGARQWTYQTCNEFGFYQTSNNASSVFGD-RFPVEF 398
Query: 396 RARYCKEAYGVDPRPNWITTEFGGHKIGLVLKR----------FASNIIFFNGLRDPWSG 445
R C + YG T FG + + R +N+++ +G DPW
Sbjct: 399 FVRQCVDVYG---------TRFGAESLARAVYRTNTNYGALDPATTNVLYVHGNIDPWHR 449
Query: 446 GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
G+ ++ + ++ AH ++ DP LK R
Sbjct: 450 LGLTESNDIHMPTILIDGTAHCANMYEPKDSDPPQLKQAR 489
>gi|66803795|ref|XP_635725.1| hypothetical protein DDB_G0290409 [Dictyostelium discoideum AX4]
gi|60464058|gb|EAL62220.1| hypothetical protein DDB_G0290409 [Dictyostelium discoideum AX4]
Length = 469
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 124/449 (27%), Positives = 210/449 (46%), Gaps = 43/449 (9%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DH++Y + TF+QRY++ D ++ G + PIF Y E + +F + + A
Sbjct: 54 QNVDHYDYFNNN--TFKQRYIVVDDYFTG---DGPIFFYLAGEAPMGFFGFQEVQVVNWA 108
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
F AL + +EHRYYG+S P +++ N YL+S QAL+D A+ + K++
Sbjct: 109 QDFGALFIVLEHRYYGESYPV---DDLSTHNLK---YLTSQQALSDAANFLSTYKQDNNL 162
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS-FSNIITQDFR 238
D+ VVVFG SY G L+AWFRLKYP++ + ++A S P+L N Y+ FSN
Sbjct: 163 IDNQVVVFGCSYSGALSAWFRLKYPNLVVASVAPSGPVLAQLNYTGYYAQFSN------- 215
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTD 298
S +C + + +I + G ++L+K F C S + + + F+Y ++TD
Sbjct: 216 SAQPDCVAATQQATNEIMQLIANESGRKQLEKTFNSCHSLDDPRDQYY----FLY-SITD 270
Query: 299 YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYN------YSGTAKCFD 352
S+ +N P + + C+ + + +A++ N + +
Sbjct: 271 ALGGSDQMNNPPTWILNSTCQMLLQNTNYVNNWAQIVNVGQTQCNDFRLKSFIEQLRDIS 330
Query: 353 LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY---GVDPR 409
+N +SD W +Q C E + S+F + + + ++C+E Y G+ P
Sbjct: 331 INDNSDNR---MWTYQTCVEFGYFSTAYPGTSVFPPV-LNVEEQTKWCEEIYDIPGMTPN 386
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVD 469
+ +GG I SNI+F NGL DPW V ++ + V E H
Sbjct: 387 IDATNNYYGGQNIQ------GSNIMFTNGLLDPWHLLSVNEDNQAGTVKAVTYEAGHCGS 440
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
L +T +DP L + R+ + + +SQ
Sbjct: 441 LIATTNDDPISLTNARQEVLSFLKLVLSQ 469
>gi|56757429|gb|AAW26883.1| SJCHGC06819 protein [Schistosoma japonicum]
Length = 331
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 149/316 (47%), Gaps = 19/316 (6%)
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF-RSVSENCYKVIKGSW 252
ML+AW R KYP+ GA+ASSAP+ F + FS + T F + +NC K I+ SW
Sbjct: 1 MLSAWIRQKYPNQIAGAIASSAPVWLFPGLSDCNGFSLVATNSFLKYGGDNCVKNIQHSW 60
Query: 253 KQIEETAKKPGGLEKLQKAFRICKSEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPLPA 311
I + + G E L F IC ++ I +LS +M +YP P+NFL LPA
Sbjct: 61 SNIVDIGQSFDGKELLTNMFNICTPLTDVQNIIDYLSDYLGTISMVNYPYPANFLGTLPA 120
Query: 312 FPVKEMCK--AIDDPK----TGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEW 365
+PVK +C + DP+ T + AK A + NY+G C D++G W
Sbjct: 121 WPVKYLCSNLTVYDPQQPVVTRISLLAKAILALT---NYTGNQNCLDISGSLPGIDAKAW 177
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV 425
Q C EM +I D ++ + YC++ YG+ PR NW EF + +
Sbjct: 178 EIQTCMEMTTPMCASGAVNIMPPVNWDLNSFSAYCQKQYGISPRVNWPKVEFWSKSVDTI 237
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK--EGAHHVDLRFSTKEDPQWLKD 483
+NI+F NG DPW + N S A V + AHH+DLR DP +
Sbjct: 238 -----TNIVFSNGEIDPWFALSI-TNSSYVPFATVINIADAAHHLDLRTPNPADPDSVVK 291
Query: 484 VRRREVEIIGKWISQY 499
R E + I +WI ++
Sbjct: 292 ARTLEKQKIIQWIKEW 307
>gi|328874647|gb|EGG23012.1| Putative serine protease [Dictyostelium fasciculatum]
Length = 490
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 131/454 (28%), Positives = 193/454 (42%), Gaps = 46/454 (10%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV- 118
Q +DHF+ P + TF+Q+Y INDT+W + P+F G EG I N F+ +
Sbjct: 64 QQVDHFD--PLNQDTFKQQYFINDTYW---RPGGPVFFVLGGEGPISPGYVNGHFVVNTY 118
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A F AL+V EHR+YG S P+ + L++ QALADYA+ +
Sbjct: 119 AQLFDALIVACEHRFYGYSSPH------PTLDTKHLHLLTTEQALADYANFRQFIAAKYN 172
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
S + FGGSY G L+AW RLKYP + GA+A+SAP + + F+ +
Sbjct: 173 TGSSKWISFGGSYSGSLSAWLRLKYPQLIDGAIATSAP------VEAQLDFTQYLEVVSA 226
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMT 297
S+ C ++K QI G +++ F C L I +++ + + +A++
Sbjct: 227 SIGPACSAIVKNV-TQIVTQMIANGQTSQVESLFNTCDPISSELDIATFMES--LTSAVS 283
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYG----------AASVYYNYSGT 347
+ +N N + MC + FA L S Y G
Sbjct: 284 EIVQYNNDNNNYSFANITTMCDMLSKGNNQLQAFADLNNKYNDFNGDNCTTSSYEKMIGQ 343
Query: 348 AKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-- 405
+ +NG + L W WQ CTE G + F ++ D + C + +G
Sbjct: 344 MQETQVNGPNAATRL--WTWQCCTEYAYFQTGQSALQPFSDTLT-LDYFIQQCTDTFGPP 400
Query: 406 ---VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
P +WI E+GG I S IF NGL DPW GV+ S +V +
Sbjct: 401 GYTYQPNIDWIINEYGGKNIQ------TSQTIFPNGLVDPWHVLGVMNTTSSSVYTITIS 454
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
GAH DL D L RR E+++I I
Sbjct: 455 TGAHCSDLYPPLPTDSDDLVLARRMEIDLISTVI 488
>gi|90077758|dbj|BAE88559.1| unnamed protein product [Macaca fascicularis]
Length = 229
Score = 155 bits (391), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 120/216 (55%), Gaps = 18/216 (8%)
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
F AM DYP P++FL PLPA PVK C D + L A + YN SG+ C
Sbjct: 3 FTVLAMMDYPYPTDFLGPLPANPVKVGC---DRLLSETQRITGLRALAGLVYNASGSEHC 59
Query: 351 FDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+D+ +DP G W +QACTE+ + +N +F + + R +YC
Sbjct: 60 YDIYRLYRSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDERRQQYC 119
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+ +N+S +V+A+
Sbjct: 120 LDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAIT 174
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ GAHH+DLR S EDP + + R+ E +IG+W+
Sbjct: 175 IQGGAHHLDLRASHPEDPVSVVEARKLEATVIGEWV 210
>gi|407038295|gb|EKE39045.1| serine carboxypeptidase (S28) family protein [Entamoeba nuttalli
P19]
Length = 466
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 140/475 (29%), Positives = 205/475 (43%), Gaps = 73/475 (15%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
+ TQ +DHF+ + +T RY INDT + AP+ V G EG A F+
Sbjct: 32 RVSTQPIDHFDLTNK--KTINIRYFINDTIYS---KEAPLLVDLGGEGTQRAAAVGGRFV 86
Query: 116 YD-VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ A K+ +L++ IEHR+YGKS+P GG + GYLS+ QAL DY +I +K
Sbjct: 87 INKYAEKYNSLMLAIEHRFYGKSVPEGGLSQ------ENLGYLSAAQALEDYVMIINQIK 140
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
K T PV+VFGGSY G LA W R KYP+V A+ASSAP+ Y F ++I
Sbjct: 141 KEYQIT-GPVIVFGGSYSGNLATWIRQKYPNVVYAAVASSAPVYATSTF---YEFLDVIY 196
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLSTAF 291
D + E C K + + IEE K G +L+ F+ C K E +L I
Sbjct: 197 ND---MGEKCGNAWKEATESIEELFKTDSGKAQLKNDFKTCTEIKEEDDLTI----LIQQ 249
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
+ M +YP N + ++ +C + + KT + L A + +
Sbjct: 250 IQATMVNYPQ----YNGSYSLTIEGVCNVLTTEGKTAYENMVDLMSHAFNEFGFKCAPSS 305
Query: 351 F-DL-----NGDSDPHG-----LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR- 398
+ D+ N ++ G W WQ C+E S F+ E R
Sbjct: 306 YADMLTDMANTKTEEEGNRLASTRSWAWQICSEY----------SYFQPVNESLPFSKRL 355
Query: 399 -------YCKEAYGVDP-----RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
CK+ + VD R + +GG+K A+N+ + +G DPWS
Sbjct: 356 NNEFYYLLCKDIFNVDKQRLDRRVHHTNLMYGGYKPK------ATNVAYTSGSTDPWSPL 409
Query: 447 GVLKNISKTV--LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ + + A K AH DL DP+ LK R + I + IS+Y
Sbjct: 410 AKHETLPSDINCYASYIKGTAHCADLYAEKDTDPEQLKQQRMETAQFIDELISRY 464
>gi|281206134|gb|EFA80323.1| Putative serine protease [Polysphondylium pallidum PN500]
Length = 484
Score = 154 bits (390), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 144/470 (30%), Positives = 219/470 (46%), Gaps = 73/470 (15%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
++++ Q +DH Y+P + TF+Q+Y +NDT+W P+F+ G EG +
Sbjct: 52 QSQWFDQQVDH--YDPLNTATFKQQYFVNDTYW---TTGGPVFLLLGGEGPASVTSVTGH 106
Query: 114 FMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--I 170
F+ + A +F AL+V +EHR+YGKS P +K +A + + L++ QALAD+A+
Sbjct: 107 FVINTYAQQFGALIVSVEHRFYGKSSP---SKTLATEYLN---LLTTQQALADFANFRQF 160
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPY 227
I K N+ +T + V FGGSY G L+AW RLKYP + A+A+SAP+ L+F P
Sbjct: 161 IAAKYNVPST-TKWVSFGGSYSGSLSAWLRLKYPQLIDAAIATSAPVQPQLDF-----PE 214
Query: 228 SFSNIITQDFRSVSENCYKVI---KGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIE 284
F + RSV C I Q+ +T +K ++K F C I
Sbjct: 215 YFEVVA----RSVGPACSARIAEVTNLVTQMLQTDRK-----TVEKLFNTCD-----PIV 260
Query: 285 SWLSTAFVYTAMTDYPTP----SNFLNPLPAFPVKEMCKAID--DPKTG----NDVFAKL 334
S A + +++D + +N N F + MC +D DP N+ F +
Sbjct: 261 SSDDVATFFESLSDGISEIVQYNNDNNKYTMFNISHMCSLLDGGDPLQSFVNFNNEFNQF 320
Query: 335 YG---AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEE 391
G S Y + + ++NG++ L W WQ CTE ++ + F S
Sbjct: 321 SGNKCTQSSYKSMIAQMRETEVNGENAAGRL--WTWQTCTEYGYFQTSESPNQPFSSSIT 378
Query: 392 DYDARARYCKEAYGVDPRPN---------WITTEFGGHKIGLVLKRFASNIIFFNGLRDP 442
D + C + +G P+P+ WI T+FGG I SN IF NGL DP
Sbjct: 379 -LDWFLQQCADIFG--PKPDGKPYLPAIEWIETDFGGRNIQ------TSNTIFPNGLIDP 429
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
W GVL + ++ + GAH DL D + L R+ EV +I
Sbjct: 430 WHILGVLNATTSSISTAIIPLGAHCSDLYPPLPTDNEALVLARQMEVNLI 479
>gi|125980321|ref|XP_001354185.1| GA22150 [Drosophila pseudoobscura pseudoobscura]
gi|195174498|ref|XP_002028010.1| GL15050 [Drosophila persimilis]
gi|54642489|gb|EAL31237.1| GA22150 [Drosophila pseudoobscura pseudoobscura]
gi|194115732|gb|EDW37775.1| GL15050 [Drosophila persimilis]
Length = 508
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 136/490 (27%), Positives = 227/490 (46%), Gaps = 54/490 (11%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P K+ +L S DS+ L+ Q LDHF ++ +T+QQRY +N ++ + + API
Sbjct: 44 PSKIPTL-QRSLDSEDLW----FEQRLDHFQ--ARNTRTWQQRYFVNADYYR-NDSTAPI 95
Query: 96 FVYTGNEGDI--EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
F+ G EG+ +W + Y A F AL + +EHR+YGKS P ++++ N +
Sbjct: 96 FLMIGGEGEASAKWMREGAWVHY--AEHFDALCIQLEHRFYGKSHP---TRDLSTANLA- 149
Query: 154 TGYLSSTQALADYASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
+LSS QALAD A+ + + +K NL T V FGGSY G LAAW R KYPH+ G++
Sbjct: 150 --FLSSEQALADLANFVAAMKVKYNLAETQK-WVAFGGSYPGSLAAWAREKYPHLIYGSI 206
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA 271
++S P+L V + ++ + + +C + + S+ Q+E K G L +
Sbjct: 207 STSGPLL---AEVDFREYFEVVKASLATYNPDCVEAVTRSFTQVEILLKHMIGQRNLDEK 263
Query: 272 FRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL---------NPLPAFPVKEMCKAID 322
F+ C K+ +IE+ L A ++ + NF +P + E+C +
Sbjct: 264 FKTCTPIKD-SIENQLDIASLFENLA-----GNFAGVVQYNKDNSPHATITIDEICDVML 317
Query: 323 DPKTG---------NDVFAKLYGAASVYYNYSG-TAKCFDLNGDSD-PHGLSEWGWQACT 371
+ +G ND+ K A + Y Y + + + DS+ G+ +W +Q C
Sbjct: 318 NMTSGPPVTRLGVVNDMLLKQSNATCLDYKYDKMVSDMRNASWDSEAAKGMRQWTYQTCN 377
Query: 372 EMIMLTGGDNKDSIFEESEEDYDARARYCKEAY--GVDPRPNWITTEFGGHKIGLVLKRF 429
E +NK F + D R C + + +D R G LK
Sbjct: 378 EFGFYQTSENKTDTFGD-RFGVDFFIRQCMDVFSNSMDARYLQNVVSQTNKHYG-ALKPE 435
Query: 430 ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREV 489
+N+++ +G DPW G++K+ + + + AH ++ TK DP L R + V
Sbjct: 436 TTNVLYVHGSIDPWHALGLVKSSNAATPTIFIEGTAHCANMYEPTKTDPPQLVAARNKIV 495
Query: 490 EIIGKWISQY 499
+ + K + Y
Sbjct: 496 KYLAKLLEGY 505
>gi|330790734|ref|XP_003283451.1| hypothetical protein DICPUDRAFT_147119 [Dictyostelium purpureum]
gi|325086716|gb|EGC40102.1| hypothetical protein DICPUDRAFT_147119 [Dictyostelium purpureum]
Length = 487
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 133/456 (29%), Positives = 214/456 (46%), Gaps = 61/456 (13%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV- 118
+DH YNP + +TF+QRY +NDT+W GG P+F+ G EG I F+ +
Sbjct: 64 VDH--YNPLNTETFKQRYYVNDTYWTPGG-----PVFLVLGGEGPISPSYVTGHFVVNYY 116
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
AP F AL+V +EHR+YG S P G +A +N YLS+ QALADYA+ + K+
Sbjct: 117 APMFDALIVAVEHRFYGASTPKG---NLATENLK---YLSTQQALADYANFVQFFKQKYN 170
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
DS V FGGSY G L+AW RLKYP++ A+A+SAP+ +V + +++
Sbjct: 171 TGDSKWVSFGGSYSGNLSAWLRLKYPNLIDAAIATSAPV---KPVVDFPEYFEVVS---N 224
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL-STAFVYTAM 296
S+ +C + K + + G +++ K F C +L I +++ S + + +
Sbjct: 225 SIGPSCSAFVANITKTVTDMINN-GQNDQVAKLFNACDPIVSDLDIATFMESLSGGISEI 283
Query: 297 TDYPTPSN---FLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA----- 348
Y +N F N + MC+ + G+D + Y +SG+
Sbjct: 284 VQYNLDNNAYTFTN------ITAMCEEFEQ---GSDPMQTFIDFNNRYNTFSGSPCTLSS 334
Query: 349 --KCFDLNGDSDPHGLS----EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
K + DP ++ W WQ CTE G++ F S D C +
Sbjct: 335 YEKSVIYQQNIDPANVNASSRSWNWQCCTEYGYYQTGESPSQPF-SSTITLDYFINMCTD 393
Query: 403 AYGVD-----PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN-ISKTV 456
+G + P+ ++I T++G I +SNI+ +G DPWS GV + + +V
Sbjct: 394 VFGPEGFVYKPQVDYIITDYGSTNIQ------SSNIVMASGTIDPWSFLGVHQTPLKSSV 447
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
++ + GAH +L + D + R E+++I
Sbjct: 448 QPILIQGGAHCSELYMPKEHDLPDVVTARLVEIQLI 483
>gi|347965569|ref|XP_321914.5| AGAP001240-PA [Anopheles gambiae str. PEST]
gi|333470451|gb|EAA01781.5| AGAP001240-PA [Anopheles gambiae str. PEST]
Length = 496
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 123/460 (26%), Positives = 213/460 (46%), Gaps = 34/460 (7%)
Query: 51 GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQ 110
G +TK+ Q LDHF+ PQ+ T+ RY+ N H+ P+F+Y G E +I +
Sbjct: 58 GPVETKHIMQRLDHFD--PQNVNTWSMRYMANGEHY---VEGGPLFIYVGGEWEISEGSI 112
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
+ G +YD+A + K L + EHR+YG+S P + K YL+ QALAD A +
Sbjct: 113 SRGHVYDMAAELKGYLFYTEHRFYGQSHPTVDLRTDKLK------YLNIDQALADLAHFV 166
Query: 171 IDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
++++K + A S V++ GGSY + +WFR KYPH+ GA ASSAP+ V +
Sbjct: 167 VEMRKTIPGAEKSGVIMIGGSYSATMVSWFRQKYPHLINGAWASSAPVF---AKVEFTEY 223
Query: 230 SNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLS 288
I+T+ R V ++C I+ + +Q EE + G + + F++C ++ + L
Sbjct: 224 KEIVTESIRLVGGQSCADRIERAIRQTEELLDR-GEYASVAQEFQLC---SDVDLSQPLD 279
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV--FAKLY--GAASVYYNY 344
+++++D + + ++ +C+ I+D +D+ AKL G S N
Sbjct: 280 RMNFFSSLSD--EFAGVVQYHSTGDIEGVCQVIEDATITDDMQALAKLVTRGLTSTNCNS 337
Query: 345 SGTAKCFDL------NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
G D N + + +W +Q C E + IF S D +
Sbjct: 338 YGYKAMVDYYKNTAWNEGAAMSSMRQWLYQTCAEYGWYQISGSSKQIFGSSFP-VDLFVK 396
Query: 399 YCKEAY-GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
C + Y G + + + I +N+ F G DPW G+ ++++
Sbjct: 397 LCGDLYDGFFDKTRMMNNADRTNVIYGGWNPEVTNVFFTQGQLDPWRAMGIQQDLNDQSP 456
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
A+V AH DL T +D ++ + + +E++ KW++
Sbjct: 457 AVVIPGAAHCADLSSITAQDSAEMRAAKEKILELVKKWLA 496
>gi|195126042|ref|XP_002007483.1| GI12975 [Drosophila mojavensis]
gi|193919092|gb|EDW17959.1| GI12975 [Drosophila mojavensis]
Length = 507
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 140/517 (27%), Positives = 228/517 (44%), Gaps = 60/517 (11%)
Query: 11 CLLF-SSTLTISNAKIFPTF---PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN 66
CL+ SS+L N + F PSS T ++ S D + L+ Q L+HF
Sbjct: 16 CLIAQSSSLGFRNGRHMNGFLGQPSSMPTLQR-------SLDVEDLW----FEQRLNHFK 64
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYDVAPKFKA 124
P +T+QQRY +ND + + + AP+F+ G EG+ W + Y A F A
Sbjct: 65 --PDDTRTWQQRYFVNDAFYR-NDSQAPVFLMIGGEGEATKNWMREGAWIHY--AEHFGA 119
Query: 125 LLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPV 184
L + +EHR+YGKS P + + S YLSS QALAD + + +K+ DS
Sbjct: 120 LCIQLEHRFYGKSHP------TSDLSNSNLAYLSSEQALADLGNFVSAMKRQYNMADSQK 173
Query: 185 -VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
+ FGGSY G LAAW R KYPH+ GA++SS P+L V + ++ S
Sbjct: 174 WIAFGGSYPGSLAAWAREKYPHLIDGAISSSGPLL---AQVDFTQYFEVVKASLASYKPE 230
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPS 303
C + + + Q+E K G L + F+ C K+ +IE+ L A ++ + S
Sbjct: 231 CVEAVSRGFAQVEILLKHMIGQRNLDEKFKTCTPLKD-SIENKLDIANLFENIA-----S 284
Query: 304 NFL---------NPLPAFPVKEMCKAIDDPKTG---------NDVFAKLYGAASVYYNYS 345
NF +P + ++C + + G ND+ K + Y Y
Sbjct: 285 NFAGVVQYNKDNSPHAKITIDQICDVMLNTTMGPPVTRLGAVNDLLLKQSKTKCLDYKYD 344
Query: 346 GT-AKCFDLNGDSD-PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
A+ +++ DS+ G+ +W +Q CTE +NK F + D R C +
Sbjct: 345 KMIAEMKNVSWDSEVAKGMRQWTYQTCTEFGFYQTSENKSDTFGD-RFGIDFFIRQCMDI 403
Query: 404 YGVDPRPNWITTEFG-GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
+ ++ +K LK +N+++ +G DPW G+ + + A+ +
Sbjct: 404 FSDRMNGKFLEQAVAQTNKYYGALKPGTTNVLYVHGSIDPWHALGLYVSTNSNTPAIYIE 463
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
AH ++ DP LK+ R + ++ + K + Y
Sbjct: 464 GTAHCANMYEPANCDPPQLKEARNKILKYLAKLLDGY 500
>gi|167383690|ref|XP_001736631.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900889|gb|EDR27114.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 466
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 137/469 (29%), Positives = 205/469 (43%), Gaps = 67/469 (14%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD- 117
TQ +DHF+ + +T RY INDT + AP+ V G EG + A F+ +
Sbjct: 35 TQPIDHFDLTNK--KTINIRYFINDTIYS---KEAPLLVDLGGEGPQKAAAVGGRFVINK 89
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A K+ +L++ IEHR+YGKS+P GG + GYLS+ QAL DY +I +KK
Sbjct: 90 YAEKYNSLMLAIEHRFYGKSVPEGGLSQ------ENLGYLSAAQALEDYIMIINQIKKEY 143
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
T PV+VFGGSY G LAAW R KYP+V A+ASSAP+ Y F ++I D
Sbjct: 144 QVT-GPVIVFGGSYSGNLAAWIRQKYPNVVYAAVASSAPVYATSTF---YEFLDVIYND- 198
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT 297
+ E C K + + IEE K G +L+ F C ++ N + + + M
Sbjct: 199 --MGEKCGNAWKEATESIEELFKTDSGKAQLKSDFNAC-TDINGEDDLTILIQQIQATMI 255
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASVYYNYSGTAKCF-DL-- 353
+YP N + ++ +C + + KT + +L A + + + D+
Sbjct: 256 NYPQ----YNGSYSLTIEGVCSILTTEGKTAYENMVELMNHAFNEFGFKCAPSSYADMLT 311
Query: 354 ---NGDSDPHG-----LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR------- 398
N +D G W WQ C+E S F+ E R
Sbjct: 312 DMANTKTDEEGNRLASTRSWAWQICSEY----------SYFQPVNESLPFSKRLNNEFYY 361
Query: 399 -YCKEAYGVDP-----RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
CK+ + VD R + +GG++ A+N+ + +G DPWS + +
Sbjct: 362 LLCKDIFNVDKQRLDRRVHHTNLMYGGYQPK------ATNVAYTSGSTDPWSPLAKHETL 415
Query: 453 SKTVLALVE--KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ K AH DL DP+ LK R + I + IS+Y
Sbjct: 416 PSDINCYASHIKGTAHCADLYAEKDTDPEQLKQQRIETAQFIDELISRY 464
>gi|332021809|gb|EGI62155.1| Putative serine protease K12H4.7 [Acromyrmex echinatior]
Length = 493
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 139/515 (26%), Positives = 235/515 (45%), Gaps = 62/515 (12%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQS 71
L+ S L+I+ A + TF R L + I SS ++ Q LDHF+ P +
Sbjct: 9 LIVFSILSITTA--WRTFLRGRSKGGNLGNPILSSDTP--FPTDQWFLQYLDHFD--PTN 62
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVAPKFKALLVFI 129
+QQR+ +N + K N PIF+ G EG + W + Y A +F A+ ++
Sbjct: 63 VNDWQQRFFVNVDFY---KPNGPIFLMIGAEGTANASWMVEGEWIEY--AKEFGAMCFYL 117
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGG 189
EHRYYGKS P +++ KN YLSS QALAD A I + +L ++ +VFGG
Sbjct: 118 EHRYYGKSHP---TIDLSVKNLM---YLSSEQALADLAYFIASVNVDL-PRNTKWIVFGG 170
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGG LAAW R KYPH+ GA+++S P+L + Y ++T + S+ C ++I+
Sbjct: 171 SYGGSLAAWMRAKYPHLVHGAVSTSGPLLAQIDFSEYYQ---VVTNALKEYSDQCVRIIQ 227
Query: 250 GSWKQIEETAKKPGGLEKLQKAFRICK------------SEKNLAIESWLSTAFVYTAMT 297
+ Q+ G +++QK FR+C S A+ S ++ Y
Sbjct: 228 EANSQLNIMLHHTVGQQQIQKKFRLCDPIDPGYTKLVDISNLYEALASNFASVVQYNK-- 285
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS 357
D S+F+N ++ +C + D K + +L S+ N + KC D D+
Sbjct: 286 DNRQSSSFVN----ITIETVCDILVDEKIEKAI-DRLAYVNSMILN-ATKEKCLDYRYDT 339
Query: 358 DPHGL-------------SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
H L +W +Q C+E + +F ++ + A+ C + +
Sbjct: 340 MIHELRNITWTSEQAEGGRQWIYQTCSEFGFFQTSTGQIDVF-GNKFPVEFFAQQCVDIF 398
Query: 405 GVDPRPNWITTEFGGHKIGLV---LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
G P+ N + + ++ L +N++F +G DPW G++++ + A+
Sbjct: 399 G--PKYNMDLLKSAVTRTNILYGALNLQVTNVVFVHGSVDPWHVLGIVQSSNPQAPAIYI 456
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
AH ++ ++ D LK+ R+ +I +W+
Sbjct: 457 NGTAHCANMYPPSERDMPQLKEARKVIRGLIKQWL 491
>gi|67472594|ref|XP_652089.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|56468897|gb|EAL46703.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 466
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 140/472 (29%), Positives = 202/472 (42%), Gaps = 73/472 (15%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD- 117
TQ +DHF+ + +T RY INDT + AP+ V G EG A F+ +
Sbjct: 35 TQPIDHFDLTNK--KTINIRYFINDTIYS---KEAPLLVDLGGEGTQRAAAVGGRFVINK 89
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A K+ +L++ IEHR+YGKS+P GG + GYLS+ QAL DY +I +KK
Sbjct: 90 YAEKYNSLMLAIEHRFYGKSVPEGGLSQ------ENLGYLSAAQALEDYIMIINQIKKEY 143
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
T PV+VFGGSY G LA W R KYP+V A+ASSAP+ Y F ++I D
Sbjct: 144 QIT-GPVIVFGGSYSGNLATWIRQKYPNVVYAAVASSAPVYATSTF---YEFLDVIYND- 198
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLSTAFVYT 294
+ E C K + IEE K G +L+ F+ C K E +L I +
Sbjct: 199 --MGEKCGNAWKEATDSIEELFKTDSGKAQLKNDFKTCTEIKEEDDLTI----LIQQIQA 252
Query: 295 AMTDYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASVYYNYSGTAKCF-D 352
M +YP N + ++ +C + + KT + L A + + + D
Sbjct: 253 TMVNYPQ----YNGSYSLTIEGVCNILTTEGKTAYENMVDLMSHAFNEFGFECAPSSYAD 308
Query: 353 L-----NGDSDPHG-----LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR---- 398
+ N ++ G W WQ C+E S F+ E R
Sbjct: 309 MLTDMANTKTEEEGNRLASTRSWAWQICSEY----------SYFQPVNESLPFSKRLNNE 358
Query: 399 ----YCKEAYGVDP-----RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
CK+ + VD R +GG+K A+N+ + +G DPWS
Sbjct: 359 FYYLLCKDIFNVDKQRLDRRVYHTNLMYGGYKPK------ATNVAYTSGSTDPWSPLAKH 412
Query: 450 KNISKTV--LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ + + A K AH DL DP+ LK R + I + IS+Y
Sbjct: 413 ETLPSDINCYASYIKGTAHCADLYAEKDTDPEQLKQQRMETAQFIDELISRY 464
>gi|449706553|gb|EMD46378.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
KU27]
Length = 466
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 140/472 (29%), Positives = 202/472 (42%), Gaps = 73/472 (15%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD- 117
TQ +DHF+ + +T RY INDT + AP+ V G EG A F+ +
Sbjct: 35 TQPIDHFDLTNK--KTINIRYFINDTIYS---KEAPLLVDLGGEGTQRAAAVGGRFVINK 89
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A K+ +L++ IEHR+YGKS+P GG + GYLS+ QAL DY +I +KK
Sbjct: 90 YAEKYNSLMLAIEHRFYGKSVPEGGLSQ------ENLGYLSAAQALEDYIMIINQIKKEY 143
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
T PV+VFGGSY G LA W R KYP+V A+ASSAP+ Y F ++I D
Sbjct: 144 QIT-GPVIVFGGSYSGNLATWIRQKYPNVVYAAVASSAPVYATSTF---YEFLDVIYND- 198
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLSTAFVYT 294
+ E C K + IEE K G +L+ F+ C K E +L I +
Sbjct: 199 --MGEKCGNAWKEATDSIEELFKTDSGKAQLKNDFKTCTEIKEEDDLTI----LIQQIQA 252
Query: 295 AMTDYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASVYYNYSGTAKCF-D 352
M +YP N + ++ +C + + KT + L A + + + D
Sbjct: 253 TMVNYPQ----YNGSYSLTIEGVCNILTTEGKTAYENMVDLMSHAFNEFGFECAPSSYAD 308
Query: 353 L-----NGDSDPHG-----LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR---- 398
+ N ++ G W WQ C+E S F+ E R
Sbjct: 309 MLTDMANTKTEEEGNRLASTRSWAWQICSEY----------SYFQPVNESLPFSKRLNNE 358
Query: 399 ----YCKEAYGVDP-----RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
CK+ + VD R +GG+K A+N+ + +G DPWS
Sbjct: 359 FYYLLCKDIFNVDKQRLDRRVYHTNLMYGGYKPK------ATNVAYTSGSTDPWSPLAKH 412
Query: 450 KNISKTV--LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ + + A K AH DL DP+ LK R + I + IS+Y
Sbjct: 413 ETLPSDINCYASYIKGTAHCADLYAEKDTDPEQLKQQRMETAQFIDELISRY 464
>gi|357612135|gb|EHJ67826.1| putative thymus-specific serine protease [Danaus plexippus]
Length = 494
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 138/498 (27%), Positives = 216/498 (43%), Gaps = 58/498 (11%)
Query: 28 TFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWG 87
F R L + + + KDS L KY Q LDH + Q Y ++QRY +N++ +
Sbjct: 23 NFRLGRSKHGNLGAPVGADKDS--LPPNKYFLQKLDHSSPTDQRY--WEQRYFVNESFYD 78
Query: 88 GSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
+ N P+F+ G EG D W + T D A FKAL + +EHRYYG+S P +
Sbjct: 79 FN-NPGPVFLMIGGEGTADPRWMVKGTWI--DYAIHFKALCILLEHRYYGQSRP---TMD 132
Query: 146 IAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYP 204
++ KN YLSS QALAD A I + D VVFGGSY G LAAW RLKYP
Sbjct: 133 LSVKNLQ---YLSSYQALADLAYFINAMNNKYKFNKDVKWVVFGGSYPGSLAAWMRLKYP 189
Query: 205 HVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS--ENCYKVIKGSWKQIEETAK-K 261
H+ A++SS P++ N + + ++ R + E C +K + KQI+E K
Sbjct: 190 HLVHAAVSSSGPLVAKVNFM---EYFQVVVNALREKTGGEECVGQVKLAHKQIQEIIKTD 246
Query: 262 PGGLEKLQKAFRIC----KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNP--LPAFPVK 315
P +E + FR+C K+ +N + S A + + Y + +
Sbjct: 247 PATIE---REFRVCEPFSKASQNDMKNFYNSIADDFADLVQYNEDNRISGDKMYKNLTIN 303
Query: 316 EMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS----------DPHGLSEW 365
+C + +P G F KL S+ N S C D D+ G +W
Sbjct: 304 SVCDMLTEP-GGKPAFKKLAAYNSIVLNKSNQT-CLDYGYDNMIKEMRNISWGSEGGRQW 361
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-------VDPRPNWITTEFG 418
+Q CTE ++ +F + ++ + CK+ +G ++ W +++G
Sbjct: 362 MYQTCTEFGFYQTSSSEIEVFGDFSLEFFIQQ--CKDVFGSKFNDAFINDAAKWTNSDYG 419
Query: 419 GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDP 478
G I A +++ +G DPW G+ A+ + AH ++ ++K D
Sbjct: 420 GLNIP------AKRVVYVHGSIDPWHALGMTTTEENDAPAIFIRGTAHCANMYPASKNDN 473
Query: 479 QWLKDVRRREVEIIGKWI 496
L R + W+
Sbjct: 474 PGLVSARMEVRSYLESWL 491
>gi|402591691|gb|EJW85620.1| serine carboxypeptidase S28 family protein [Wuchereria bancrofti]
Length = 183
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 86/187 (45%), Positives = 115/187 (61%), Gaps = 21/187 (11%)
Query: 41 SLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
++I +S D+ L+ T I++ NY + +YL N +++ + P+F Y G
Sbjct: 7 TIIRNSSDN--LWMTTEINAIMESMNYE------WDVKYLANYSYF---LCDGPLFFYAG 55
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
NEGDIE FAQNTG ++D+AP+F A +VF EHRYYG S PYG +Y + GYL+
Sbjct: 56 NEGDIEAFAQNTGIIWDLAPRFHAAIVFAEHRYYGDSKPYGKR---SYMDVLRLGYLNDI 112
Query: 161 QALADYASLIIDLKKN-------LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
Q LAD+A LI LK + T+ PV+VFGGSYGGMLA W R+KYPH+ GA AS
Sbjct: 113 QVLADFAQLITFLKIDDEELGFCPPGTEIPVIVFGGSYGGMLATWLRMKYPHIVDGAWAS 172
Query: 214 SAPILNF 220
SAP+ NF
Sbjct: 173 SAPLRNF 179
>gi|410910698|ref|XP_003968827.1| PREDICTED: thymus-specific serine protease-like [Takifugu rubripes]
Length = 493
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 129/481 (26%), Positives = 220/481 (45%), Gaps = 69/481 (14%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFA 109
++ ++ +Q LDHF+ + + ++ QRY ++ + K + P+F+ G EG + W
Sbjct: 48 VFDEQWFSQRLDHFSADSREWK---QRYFLSQAFY---KPDGPVFLMIGGEGPANPAWMQ 101
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
T Y A K AL + +EHR+YGKS P +++ N +LSS QALAD A
Sbjct: 102 YGTWLTY--AEKLGALCLMLEHRFYGKSRP---TSDLSTDNLR---FLSSRQALADLAHF 153
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ + L T++ V FGGSY G LAAWFRLKYPH+ A+A+SAP+ N +
Sbjct: 154 RTTIAEALGLTNAKWVAFGGSYPGSLAAWFRLKYPHMVHAAVATSAPVRATVNFP---EY 210
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST 289
++ + SV C ++K + + E K+P + + K F +C L I++ + +
Sbjct: 211 LEVVWRSLASVDVECPLLVKKASDTLAELLKEPKTYDNITKDFNLC---SKLQIQTEMDS 267
Query: 290 A-FVYTAMTDYPTPSNFLNPLPAFP--------VKEMCKAIDDPKTGNDVFAKLYGAASV 340
A F+ T ++ + AF +K +C + D G + +A+ Y A +
Sbjct: 268 AQFLETLAGNFMDVVQYNEDNRAFEGVVGTNVTIKVLCGMMRDGSVG-EPYAR-YAAVAR 325
Query: 341 YYNYSGTAKCFDLNGDS-------------DPHGLSEWGWQACTEMIMLTGGDNKDSIF- 386
+ + + KC D + D+ G +W +Q C E D+ + F
Sbjct: 326 FMLDTLSIKCLDSSFDAYVRDMTNTSWDGPAAGGGRQWVYQTCAEFGFFQSSDSPNQPFT 385
Query: 387 --------EESEEDYDARARYCKEAYGVDPRPNWITTE-FGGHKIGLVLKRFASNIIFFN 437
++ E+ Y+ A EA T E +GG+ I +S I+F N
Sbjct: 386 GFPLMFQVKQCEQFYNISAEMVAEAVAQ-------TNEYYGGYDIR------SSKIVFAN 432
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G DPW G+ ++I++ + A+ + AH ++ + ED L R ++ +W+
Sbjct: 433 GDVDPWHALGITQDITRDLPAVFIQGTAHCANMYPARSEDLPQLTLARDHIFLLLQQWLK 492
Query: 498 Q 498
Q
Sbjct: 493 Q 493
>gi|344292452|ref|XP_003417941.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 482
Score = 151 bits (382), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 125/479 (26%), Positives = 210/479 (43%), Gaps = 54/479 (11%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE- 106
++G + T+Y Q LDHF+ N + + QRYLIND + K P+F+ G +
Sbjct: 30 QTKGTHPTRYFQQKLDHFSKNCS--RLWPQRYLINDAFY---KRGGPVFLLIGGFETLSE 84
Query: 107 -WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
W A N ++ A + AL + +EHR+YG S P G + ++ YLSS QALAD
Sbjct: 85 SWIAINKTWV-TYAERLGALFLLLEHRFYGHSQPTGD------LSTASLQYLSSRQALAD 137
Query: 166 YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
+ + + + T + V+FG SY G LA W R+K+P + A+ SSAPI N
Sbjct: 138 IVNFRTKIAEKMRLTKNKWVLFGCSYAGSLAVWSRIKHPDLFAAAVGSSAPIQAKANF-- 195
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAI-- 283
Y + ++ + + + C++ IK ++ Q+ + K KL F++CK K +
Sbjct: 196 -YEYLEVVQRSLATHNSKCFQTIKEAFDQVAKMMKLSVYYSKLANDFKLCKPAKTYSAMD 254
Query: 284 ESWLSTAFVY---TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV 340
+++ ++ A+ N +F + E+C + + G+ + + +
Sbjct: 255 KAYFLERLIFPVEVAVQHNRNEKNHKGEQLSFTMDELCDIMANTSLGSPYYRYVRIIHLI 314
Query: 341 Y-YNYSGTAKCFDLN-------------GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIF 386
+ + YS CF N +P + ++ +Q+CTE D+K+ F
Sbjct: 315 FKHKYS---PCFAANYRQKLQTLLNSSINHHNPTKVRQYFYQSCTEFGFFFTTDSKNQPF 371
Query: 387 EESEEDYDARARYCKEAYGVDPRPNWITT-------EFGGHKIGLVLKRFASNIIFFNGL 439
Y + C + +G + + T +GG + S IIF NG
Sbjct: 372 TGLPLSYFVQQ--CSDLFGPKFNNDSLNTGVMSTNAYYGGFNVT------GSKIIFPNGS 423
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
DPW G+ K+ISK + A+ K H D+ D L R + I+ KW+ Q
Sbjct: 424 FDPWHPLGITKDISKDLPAVFIKGAVHCADIYKQKDTDSAELIQAREKIFRILQKWLKQ 482
>gi|157120503|ref|XP_001653636.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108874911|gb|EAT39136.1| AAEL009038-PA, partial [Aedes aegypti]
Length = 495
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 135/472 (28%), Positives = 219/472 (46%), Gaps = 60/472 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQN 111
+T + Q+LDH +P + T++QRY +ND ++ ++N P+F+ G EG+ W +
Sbjct: 47 QTLWFDQLLDH--NDPTNPTTWKQRYNVNDEYFKNEEDNGPVFLMIGGEGEATARWMTEG 104
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
Y A ++ AL +EHR+YGKS P ++++ KN + YL+S QALAD A I
Sbjct: 105 AWIRY--AQEYGALCFQLEHRFYGKSHP---TEDLSTKNLA---YLTSEQALADLAYFIE 156
Query: 172 DLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+K+ S + FGGSY G LAAW R KYP++ G+++SS P+L + +
Sbjct: 157 AMKQKYQLGRSNRWIAFGGSYPGSLAAWLREKYPYLVYGSISSSGPLL---AKIDFKEYF 213
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLST 289
++ S C ++ + Q+E K G L F++C EK++A E LS+
Sbjct: 214 EVVADSLNRYSPECVANVRSAMSQVETLLKHMIGQRNLNDKFKLCDPVEKSIANELDLSS 273
Query: 290 AFVYTAMTDYPTPSNF---LNPLPAFPVKEMCKAIDDPKTG---------NDVFAKLYGA 337
F A SNF ++P + E+C + + G N++ K A
Sbjct: 274 LFEAIA-------SNFAGVVHPHARLTIDEVCDVMMNQTIGAPVSRLAVVNEMVMKQDKA 326
Query: 338 ASVYYNYSGTAKCF-DLNGDSD-PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
+ Y Y T K + + DSD G +W +Q C E DN +S+F + + A
Sbjct: 327 KCLDYVYDKTIKQMQNTSWDSDVASGARQWIYQTCNEFGFYQTSDNAESVFGDR---FPA 383
Query: 396 R--ARYCKEAYGVD------PRPNWIT-TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
R C + YG R + T T +G L +N+++ +G DPW
Sbjct: 384 EFFTRQCADVYGRRFDQKALSRAVYRTNTNYGA------LNPSTTNVLYVHGSIDPWHRL 437
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFSTKED--PQWLKDVRRREVEIIGKWI 496
G+ ++ + + + +G H + KED PQ LK R + IG+ +
Sbjct: 438 GLTES-NDIQMPTIFIDGTAHCANMYEPKEDDFPQ-LKAARVQISSFIGRLL 487
>gi|123703648|ref|NP_001074031.1| uncharacterized protein LOC556307 precursor [Danio rerio]
gi|120538664|gb|AAI29321.1| Zgc:158605 [Danio rerio]
Length = 488
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 132/477 (27%), Positives = 216/477 (45%), Gaps = 69/477 (14%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTG 113
++ Q LDHFN + ++QRY +ND+ + + P+F+ G EG + W T
Sbjct: 47 QWFIQRLDHFN--GADSRVWKQRYFVNDSFY---RVGGPVFLMIGGEGPANPAWMQYGTW 101
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
Y A K AL + +EHR+YGKS P ++++ +N +LSS QALAD A
Sbjct: 102 LTY--AQKLGALCLLLEHRFYGKSHP---TEDLSTENLR---FLSSRQALADLAHFRTVT 153
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFS 230
T+S V FGGSY G LAAWFRLKYPH+ ++A+SAP+ +NF +
Sbjct: 154 AAARGLTNSKWVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPVHASVNFPE------YL 207
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
++ + + + C ++K + + E P + + K FR+C L I+S + +A
Sbjct: 208 EVVWRSLAAENPECPLLVKKASDTLLERLSDPKTYDNITKDFRLC---SKLQIQSKMDSA 264
Query: 291 FVYTAMTDYPTPSNFLNPLP------AFP--------VKEMCKAIDDPKTGN--DVFA-- 332
++ ++ NF++ + AF +K +C + D G+ D +A
Sbjct: 265 YLLESLA-----GNFMDVVQYNEDNRAFEGAVGTNITIKVLCGVMLDSSLGDPYDRYAAV 319
Query: 333 -----KLYGAASVYYNYSGTAKCFDLNGDSDPH--GLSEWGWQACTEMIMLTGGDNKDSI 385
K + + + Y + + S P G +W +Q CTE D+ +
Sbjct: 320 ARLMQKTFSQSCINTQYKSFIQDISNSSWSGPEAGGGRQWVYQTCTEFGFYQSTDSPNQP 379
Query: 386 FEESEEDYDARARYCKEAY----GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRD 441
F Y + C + Y +D E+GG+ I ++ I+F NG D
Sbjct: 380 FSGFPLGY--HLQQCADIYNLSTSLDEAIQQTNEEYGGYDIK------STRIVFPNGSID 431
Query: 442 PWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
PW GV K+IS + A+ K AH ++ + ED L R R ++ KW+++
Sbjct: 432 PWHALGVTKDISGDLPAVFIKGTAHCANMYPARAEDLPQLGLARDRIFILLQKWLAE 488
>gi|313212780|emb|CBY36704.1| unnamed protein product [Oikopleura dioica]
Length = 395
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 104/344 (30%), Positives = 164/344 (47%), Gaps = 32/344 (9%)
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
T + P+ F + G + L YP++ GALA+S+P+ + + F T+DF
Sbjct: 66 TGNEGPIDSFAANTG----KFNELTYPNLVNGALAASSPVYWISAMGDSHGFWVKTTEDF 121
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKN-LAIESWLSTAFVYT 294
+ + C I+ + +++ K + K R C+ +E N + + W A
Sbjct: 122 STALDKCEDTIRAGFAALDKM-KNDKDWAGITKTMRTCQNITEDNYMHMLGWARNAMATM 180
Query: 295 AMTDYPTPSNFLNPLPAFPVKEMC-KAIDDPKTGNDVFAKLYGAASVYYNYSGTAK---C 350
AM DYP P+NF LP PVK C +A+ +TG D + AA + YN + +K C
Sbjct: 181 AMMDYPYPTNFEAALPGNPVKASCVRAM--AETGADSIRE---AAGLVYNGTDPSKYKQC 235
Query: 351 FDLNGD----SDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
FD+ + SDP G W +Q CT+ ++ G D K +F + D D RA YC
Sbjct: 236 FDIMEEYVYCSDPTGCGTGPEALAWDYQCCTQQVLPGGTDGKTDMFPVIKFDVDDRAAYC 295
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ +GV P +W+ ++ + SN IF NG DPW GGV ++ + A +
Sbjct: 296 NKTWGVVPDRDWLRIKYWADNL-----EATSNTIFSNGDLDPWGPGGVTHDLRHDLPAPL 350
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLA 504
GAHH DLR S D Q + +VR+ + I W++Q++ + A
Sbjct: 351 VHGGAHHYDLRGSNSGDTQDVLNVRQFHRDTIRDWMAQFYAEQA 394
Score = 53.1 bits (126), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 41/61 (67%), Gaps = 3/61 (4%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y+TK+ Q +D+FNY +T++ RYL+N T + + APIF YTGNEG I+ FA NT
Sbjct: 23 YETKWIDQRVDNFNYYLN--KTYKMRYLVN-TDFVKDEKTAPIFFYTGNEGPIDSFAANT 79
Query: 113 G 113
G
Sbjct: 80 G 80
>gi|410910676|ref|XP_003968816.1| PREDICTED: thymus-specific serine protease-like [Takifugu rubripes]
Length = 509
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 132/483 (27%), Positives = 214/483 (44%), Gaps = 62/483 (12%)
Query: 50 QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA 109
Q + + K H Q LDHF+ PQ +TF QR+ +N+ G + P+F+Y G EG I F
Sbjct: 53 QHVTQGKIH-QPLDHFH--PQDRRTFPQRFFVNEAFCRGP--DGPVFLYIGGEGPIFEFD 107
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
G D+A + ALL+ +EHR+YG SI G K +N LSS QALAD +
Sbjct: 108 VLAGHHVDMAREHGALLLALEHRFYGDSINPDGLKTENLEN------LSSKQALADLVAF 161
Query: 170 IIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVS 225
+ ++ + + FGGSY G L+AWFR ++PH+ GA+ASSAP+ L+F S
Sbjct: 162 HQHISQSFNLSQRNTWISFGGSYSGSLSAWFRGQFPHLVFGAVASSAPVKATLDF----S 217
Query: 226 PYSFSNIITQDFRSV--SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL-- 281
YS + ++ +V S C +K ++ + E A G + ++ F C++ KNL
Sbjct: 218 AYSDTVGLSLANEAVGGSAKCLDAVKEAFAAV-EAALMMGNVSQVASDFGCCQTPKNLDD 276
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP----KTGNDVFAKLYGA 337
IE A ++ Y F++ + ++C+ + + D + +L
Sbjct: 277 QIELMQELAGIFMGAVQYNEEGVFMS------ISDLCEVMTSTNGTCQKAADAYNRLVKL 330
Query: 338 ASVYYNYSGTAKCFDLNG--------DSDPHG----LSEWGWQACTEMIMLTGGDNKDSI 385
A VY + + C D++ D+ PH + +W +Q CTE ++
Sbjct: 331 AQVYRSIT-EEPCLDVSHEKTLKHLMDTSPHAGRRSVRQWTYQTCTEFGFFQTCEDTTCP 389
Query: 386 FEESEEDYDARARYCKEAYGVDPRPN------WITTEFGGHKIGLVLKRFASNIIFFNGL 439
F C +GV RP+ + T +GG +++ NG
Sbjct: 390 F-SGMVTLQFETEVCPTVFGVS-RPSLARQIAFTNTYYGGDSPR------THRVLYINGG 441
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
DPW V ++ S + +E + AH D+R D L+ R + +W+
Sbjct: 442 IDPWKELSVTQDRSGDQVVFIE-DTAHCADMRSQRVTDRSSLQTARAEIERHVTEWLKMA 500
Query: 500 FQD 502
Q+
Sbjct: 501 SQE 503
>gi|196014354|ref|XP_002117036.1| hypothetical protein TRIADDRAFT_63413 [Trichoplax adhaerens]
gi|190580258|gb|EDV20342.1| hypothetical protein TRIADDRAFT_63413 [Trichoplax adhaerens]
Length = 439
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 132/454 (29%), Positives = 201/454 (44%), Gaps = 33/454 (7%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHF++ + + QRY +NDT K PIF+ G EG G A
Sbjct: 2 QKLDHFDHTNTA--VWSQRYFVNDTF---HKKGGPIFLMLGGEGPASPVWNVAGAWQIYA 56
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD--YASLIIDLKKNL 177
K A+ + IEHRYYG+S P ++ + YLSS QALAD Y KN+
Sbjct: 57 KKLNAITIQIEHRYYGQSHP------VSDASTPNLKYLSSEQALADAAYFREYFMTSKNM 110
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
+A D+ +VFGGSY G L+AW R KYPH+ ++A+SAPIL V + ++T+
Sbjct: 111 SA-DTKWIVFGGSYSGALSAWLRTKYPHLFHASVATSAPIL---AKVDFEQYLQVVTKSL 166
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLAIESWL-STAFVYT 294
++ C K I + I+ K G +KL + F+ CK S+ I +++ S A +
Sbjct: 167 QTAGMACTKNIHNATTIIQGMIKTSAGRKKLSQMFKTCKPLSKDPNDISTFMQSLAGNFE 226
Query: 295 AMTDY-PTPSNFLNPLPAFPVKEMCKAID--DPKTG----NDVFAKLYGAASVYYNYSGT 347
+ Y + F PA + ++CK ++ P G N + K G Y
Sbjct: 227 GIVQYNKDNTGFERHTPATTLTDLCKIMEKNKPLDGYVKVNSLILKQNGQKCNDVVYKDM 286
Query: 348 AKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
K + W +Q CTE D+K F + C + +G
Sbjct: 287 IKQMQQSKFQPGIAGRLWFYQTCTEFGYYQTSDSKKQSFGNMFP-IKYWVQQCADVFGKK 345
Query: 408 PRPNWITTEF---GGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
P++I E + L +K + I+F NG DPW G+LK+ T + K
Sbjct: 346 FSPSYINGEITMTNNYYGALAIK--GTRIVFPNGSIDPWHALGLLKSTDATRPTIFIKGT 403
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
AH ++ T +DP L+ R + + + KW+ +
Sbjct: 404 AHCANMYPPTSKDPAGLRQARTKILGYLTKWLKE 437
>gi|340715980|ref|XP_003396483.1| PREDICTED: putative serine protease K12H4.7-like [Bombus
terrestris]
Length = 493
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 132/501 (26%), Positives = 216/501 (43%), Gaps = 47/501 (9%)
Query: 20 ISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRY 79
I+N+ + TF R L + I S + L ++ TQ LDHFN P + +QQRY
Sbjct: 14 INNSISWKTFMRGRSKYGNLGAPILS--EDHELPSAEWFTQFLDHFN--PTDARVWQQRY 69
Query: 80 LINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
+N ++ K P+F+ E + +W + Y A +F AL +EHR+YG+S
Sbjct: 70 FVNGEYY---KKGGPVFLMISGEAAANAKWMVEGQWIEY--AKQFGALCFQVEHRFYGQS 124
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLA 196
P ++ KN YLSS QALAD A I + N + + FGGSY G LA
Sbjct: 125 HP---TSDLGVKNLM---YLSSQQALADLAYFIQSMNINYKLPAGTKWIAFGGSYAGSLA 178
Query: 197 AWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIE 256
AW R KYPH+ GA+++S P+L + + ++ + SE C I + KQ
Sbjct: 179 AWLRYKYPHLVHGAVSASGPLL---AEIDFQEYFVVVENALKEYSEACVNAILEANKQFH 235
Query: 257 ETAKKPGGLEKLQKAFRIC-----KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPA 311
P G + + K F +C +++N + + A ++ + Y + + +
Sbjct: 236 IMLHHPIGQQGIAKKFILCDPINEHTKRNDISNLYETIASIFAGIVQYNKDNRNNSAMAN 295
Query: 312 FPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL--------- 362
F + C + + G + +L ++ SG KC D D H L
Sbjct: 296 FTIDSACDILTNETLGIAI-NRLAILSTKILQASGE-KCLDYMYDKMIHKLRNITWASEE 353
Query: 363 ----SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG 418
+W +Q CTE + +F E+ D + C + +G + + +
Sbjct: 354 AEGGRQWTYQTCTEFGFFQTSTARPKLFSETFP-VDFFVQQCVDIFGPRYNIHLLNSAVN 412
Query: 419 GHKI---GLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
I GL LK +N++F +G DPW G+ K+ + + + AH ++ +K
Sbjct: 413 RTNILYGGLDLK--TTNVVFVHGSIDPWHVLGITKSPNPQMPVIYIDGTAHCANMYPPSK 470
Query: 476 EDPQWLKDVRRREVEIIGKWI 496
DP LK R +I +W+
Sbjct: 471 NDPLQLKTARVEVGHLIDEWL 491
>gi|380014101|ref|XP_003691081.1| PREDICTED: putative serine protease K12H4.7-like [Apis florea]
Length = 494
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 137/513 (26%), Positives = 227/513 (44%), Gaps = 55/513 (10%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQS 71
LLF +L I+N+ + F R L + I S ++ L ++ TQ LDHF+ P
Sbjct: 7 LLFFISL-INNSISWQIFMRGRSKHGNLGAPILS--ENYKLPNEQWFTQFLDHFD--PTE 61
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYDVAPKFKALLVFI 129
+ +QQRY IN ++ K P+F+ EG +W + Y A +F AL +
Sbjct: 62 ARVWQQRYFINGEYY---KKGGPVFLMISGEGTATAKWMVKGQWIEY--AKQFGALCFQV 116
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI----IDLKKNLTATDSPVV 185
EHR+YGKS P +++ KN YLSS QALAD A I ID K D+ +
Sbjct: 117 EHRFYGKSHP---TSDLSVKNLK---YLSSQQALADLAYFIEIMNIDYK---LPNDTKWI 167
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
FGGSY G LAAW R KYPH+ GA+++S P+L + Y ++ + SE C
Sbjct: 168 AFGGSYAGSLAAWLRSKYPHLLYGAVSASGPLLAEIDFQEYYV---VVENALKQYSETCV 224
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRIC------KSEKNLAIESWLSTAFVYTAMTDY 299
I + KQ + P G + + K F +C ++++ + + A + + Y
Sbjct: 225 NTIIEANKQFHIMLRHPIGQQGIAKKFVLCDPIDSGHTKRSDISNLYETIASNFAGIVQY 284
Query: 300 PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN----- 354
+ + + ++ C + + G + +L ++ N S KC D
Sbjct: 285 NKDNRNNSAMANLTIESACDILTNDSLGIAI-DRLAILSTKILNAS-KEKCLDYMYSKMI 342
Query: 355 --------GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
+ + G +W +Q CTE + +F E+ D + C + +G
Sbjct: 343 HKLRNVTWANEEAEGGRQWMYQTCTEFGFFQTSTARPKLFSETFP-IDFFVQQCIDVFG- 400
Query: 407 DPRPNWITTEFGGHKIGLV---LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
PR N ++ ++ L +N++F +G DPW G+ K+ + + +
Sbjct: 401 -PRYNIHLLNSAINRTNILYGALNLKVTNVVFVHGSIDPWHVLGLTKSSNPQMPVIYIDG 459
Query: 464 GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
AH ++ S+K+DP LK R + +I +W+
Sbjct: 460 TAHCANMYPSSKDDPPQLKTARVKIGNLISQWL 492
>gi|328866714|gb|EGG15097.1| hypothetical protein DFA_09921 [Dictyostelium fasciculatum]
Length = 495
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 143/490 (29%), Positives = 230/490 (46%), Gaps = 51/490 (10%)
Query: 33 RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNN 92
+++ EKL L SS ++ ++ TQ +DHF+ Q + FQQRYL+ND W G
Sbjct: 40 QLSDEKL--LPSSDQELISKIDYEWFTQSVDHFDSANQ--KKFQQRYLVNDHFWDGK--- 92
Query: 93 APIFVYTGNEGDIEWFAQNTGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
P+F+ EG + A TG Y V A + AL+V +EHRYYG S +A +N
Sbjct: 93 GPVFMMINGEGPMSLGAV-TGLQYVVWAKEVHALIVSLEHRYYGASFV---TDNLATENL 148
Query: 152 STTGYLSSTQALADYASLIIDLKKNLTAT--DSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
YL+ QALAD A + D N + S V FGGSY G L++WFR+KYP++
Sbjct: 149 I---YLTPQQALADNA-VFRDFIANTYSVPQTSKWVSFGGSYSGCLSSWFRIKYPNLVDY 204
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVS--ENCYKVIKGSWKQIEETAKKPGGLEK 267
A+ASSAP+ + ++ Y + ++ + S + C IK S ++I++ K+P GL+
Sbjct: 205 AIASSAPV---NPVIDFYQYLEVVQNALLTTSNGQQCVDRIKQSTQKIQDLLKQPNGLKT 261
Query: 268 LQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG 327
+ + F + K S + T M D T N + P V+ +C +++
Sbjct: 262 VSELFSLDPVLKTDDDISNFMQSLAGTFMGD--TQYNLIEG-PFKSVEALCLIMNN--YS 316
Query: 328 NDV---FAKLYGAAS----VYYNYSGTAKCF-DLNGDSDPHGLSEWGWQACTEMIMLTGG 379
ND + +++ A V +Y + + ++ D G +W +Q CT+
Sbjct: 317 NDSLTNYIQIWNNAQKGELVDVSYQSMIQEYANITNDDTNVGGRQWFFQTCTQFGYYQSS 376
Query: 380 DNKDSIFEESEEDYDARARYCKEAYGVD--PRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
+ + F E D + + C + +G P NW E+GG L ASNI++ N
Sbjct: 377 TSNNHPFGHLFE-IDFQIKQCTDIFGFAFLPNVNWTILEYGG------LDPSASNIMYIN 429
Query: 438 GLRDPWSGGGVL--KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
G DPW G+L K S ++ L+ AH D+ P + ++ II K+
Sbjct: 430 GDIDPWHALGILDPKPASSSIQTLLIHGAAHTADMYIPRLFTPSTIAPAQK----IIFKY 485
Query: 496 ISQYFQDLAQ 505
+S + ++ Q
Sbjct: 486 LSDFLNNVNQ 495
>gi|328776766|ref|XP_395356.3| PREDICTED: putative serine protease K12H4.7-like [Apis mellifera]
Length = 494
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 139/513 (27%), Positives = 227/513 (44%), Gaps = 55/513 (10%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQS 71
LLF +L I+N+ + F R L + I S ++ L ++ TQ LDHF+ P
Sbjct: 7 LLFFISL-INNSISWQIFMRGRSKYGNLGAPILS--ENYKLPNEQWFTQFLDHFD--PTD 61
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFI 129
+ +QQRY IN ++ K P+F+ E +W + Y A +F AL +
Sbjct: 62 ARVWQQRYFINGEYY---KKGGPVFLMISGESTATAKWMVKGQWIEY--AKQFGALCFQV 116
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI----IDLKKNLTATDSPVV 185
EHR+YGKS P +++ KN YLSS QALAD A I ID K + D+ +
Sbjct: 117 EHRFYGKSHP---TSDLSVKNLK---YLSSQQALADLAYFIEIMNIDYK---LSNDTKWI 167
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
FGGSY G LAAW R KYPH+ GA+++S P+L + Y I+ + SE C
Sbjct: 168 AFGGSYAGSLAAWLRSKYPHLLHGAVSASGPLLAEIDFQEYYI---IVENALKQYSEACV 224
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICK------SEKNLAIESWLSTAFVYTAMTDY 299
I + KQ + P G + + K F +C +++N + + A + + Y
Sbjct: 225 NTIVEANKQFHIMLRHPIGQQGIVKKFVLCDPIDSGYTKRNDISNLYETLASNFAGIVQY 284
Query: 300 PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDP 359
+ + + ++ C + + G + +L ++ N S KC D +
Sbjct: 285 NKDNRNNSAMANLTIESACDILTNDSLGIAI-DRLAILSTKILNAS-KEKCLDYTYNKMI 342
Query: 360 HGL-------------SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
H L +W +Q CTE + +F E+ D + C + +G
Sbjct: 343 HKLRNVTWASEEAEGGRQWMYQTCTEFGFFQTSTARPKLFSETFP-IDFFVQQCIDVFG- 400
Query: 407 DPRPNWITTEFGGHKIGLV---LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
PR N ++ ++ L +N++F +G DPW G+ K+ + + +
Sbjct: 401 -PRYNIHLLNSAINRTNILYGALNLQVTNVVFIHGSIDPWHVLGLTKSSNPQMPVIYING 459
Query: 464 GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
AH ++ S+K+DP LK R + +I +W+
Sbjct: 460 TAHCANMYPSSKDDPPQLKTARVKIENLISQWL 492
>gi|432891041|ref|XP_004075519.1| PREDICTED: putative serine protease K12H4.7-like [Oryzias latipes]
Length = 489
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 127/477 (26%), Positives = 213/477 (44%), Gaps = 68/477 (14%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQN 111
+ ++ TQ LDHFN + ++QRY +N+ + K P+F+ G EG + W
Sbjct: 45 EEQWFTQKLDHFN--GADSRAWKQRYFLNEAFY---KPGGPVFLMIGGEGPANPAWMKNG 99
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
T +Y A K AL + +EHRYYGKS P +++ N YLSS QALAD A
Sbjct: 100 TWLIY--AEKLGALCLMLEHRYYGKSHP---TLDLSTNNLR---YLSSRQALADLAHFRT 151
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+ + T++ V FGGSY G LAAWFRLKYPH+ ++A+SAP+ V+ +
Sbjct: 152 VMGEAQGLTNNKWVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPV---HATVNFPEYLE 208
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
++ + S + C ++K + + E K P + + K F +C L I++ + +A+
Sbjct: 209 VVWRSLASENAKCPLLVKKASDSLVEKLKDPETFDNITKDFNLC---SKLQIQTEMDSAY 265
Query: 292 VYTAMTDYPTPSNFLNPLP--------------AFPVKEMCKAIDDPKTGNDVFAKLYGA 337
+ NF++ + +K +C + DP G+ A
Sbjct: 266 FLEMLA-----GNFMDVVQYNEDNREFEGVQGGNITIKVLCDVMSDPSLGDAYARYAAVA 320
Query: 338 ASVYYNYSGTAKCFDLNGDS-------------DPHGLSEWGWQACTEMIMLTGGDNKDS 384
+ +S KC D + + +G +W +Q CTE D+ +
Sbjct: 321 RLLMDTFS--LKCLDASFSNYLRDMTNTSWEGPSANGGRQWVYQTCTEFGFYQSTDSPNQ 378
Query: 385 IFEESEEDYDARARYCKEAYGVD----PRPNWITTE-FGGHKIGLVLKRFASNIIFFNGL 439
F DY + + C + Y + + T E +GG+ I +S I+F NG
Sbjct: 379 PFTGFPLDY--QLKQCADFYNISAEQVAQAVAQTNEYYGGYNIK------SSRIVFPNGS 430
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DPW G+ ++I+ + A+ K AH ++ + +D L R + ++ +W+
Sbjct: 431 IDPWHALGITQDITADLPAVFIKGTAHCANMYPARSQDLPQLSLARDHVLLLLQQWL 487
>gi|167518842|ref|XP_001743761.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777723|gb|EDQ91339.1| predicted protein [Monosiga brevicollis MX1]
Length = 448
Score = 149 bits (375), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 133/459 (28%), Positives = 204/459 (44%), Gaps = 48/459 (10%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYD 117
Q LDHFN PQ +TF+Q+Y +N T + K P+F+ G EG EW NT M
Sbjct: 12 QTLDHFN--PQDTRTFKQQYQVNRTFY---KAGGPLFLMLGGEGPASPEWLETNTAIML- 65
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID-LKKN 176
A + A++ IEHR+YG+S P+ ++++ N YLSS QALAD A+ I L+ N
Sbjct: 66 YAQQLNAVVAQIEHRFYGESQPF---EDLSVDNLR---YLSSEQALADAANFIQSFLEMN 119
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
D+ VV FGGSY G L+A+ R KYPHV A+A+S+P+L + V + ++ +
Sbjct: 120 GMPADTRVVSFGGSYSGALSAFLRTKYPHVVYAAVATSSPVLAKLDYV---EYHEVVGRS 176
Query: 237 FRSVS--ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK---NLAIESWLSTAF 291
+ + + C IKG+ +++ +L + +IC +L +++LS
Sbjct: 177 MGTSTHGQACVDQIKGALSKVDAMLADSSQWNQLAQDMKICSDTDLNVDLDKQTFLSNLI 236
Query: 292 VYTAMTDYPTPSNFLNPLPAFP-VKEMCKAIDDPKTGNDVF----AKLYGAASVYYNYSG 346
D N P V+ MC + + D F A L A+ + N G
Sbjct: 237 ---GNIDGIVQYNLDAKFQGRPTVQSMCDIMAPIEQDYDAFVALNAYLLNASDMECN-DG 292
Query: 347 TAKCF--DLNGDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+ + F DL S G +W +Q C E D D F + C
Sbjct: 293 SYQSFVTDLRNTSLSSDFAGGTRQWIYQTCMEFAYFQTTDASDQPFGVGVP-LSYFEQQC 351
Query: 401 KEAYGVDPRP--NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS-KTVL 457
+ YG+ P P NW +GG ++ + II+ NG DPW V N + + L
Sbjct: 352 VDGYGLPPVPNVNWTNEFYGGQQVA------GTRIIYPNGSIDPWHALSVTSNTTIEDTL 405
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
A+ AH ++ + D L R + + W+
Sbjct: 406 AIFINGTAHCANMYPPSSSDLPGLTAARTSILNTLQTWL 444
>gi|221108130|ref|XP_002169972.1| PREDICTED: putative serine protease K12H4.7-like [Hydra
magnipapillata]
Length = 496
Score = 148 bits (374), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 134/472 (28%), Positives = 206/472 (43%), Gaps = 63/472 (13%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ L+HF+ S T++QRY +ND ++ G P+F+ G EG + N G M
Sbjct: 53 QWFTQKLNHFDDADDS--TWKQRYYVNDEYFDG----GPVFLMIGGEGSLSSLWVNVGAM 106
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
D A + AL++ +EHR+YG+S P ++ + YLSS QALAD A ++
Sbjct: 107 VDYAKQHSALILGLEHRFYGESHP------LSDMSTENLKYLSSEQALADLAHFRNEMAL 160
Query: 176 NLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNI--VSPYSF 229
+ D + FGGSY G LAAW R KY H+ GA+ASSAPI LNF VS S
Sbjct: 161 KYSLNDKNRWIAFGGSYPGALAAWLRYKYQHLIYGAIASSAPIYAQLNFPQYLEVSTNSL 220
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIES 285
S S C + + K +E GL KL K F+ CK +N+ +
Sbjct: 221 S----------SSRCRANVNAATKILESYLTTEEGLMKLSKDFKTCKPITNDLRNIQNFA 270
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPA-FPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
+ + + F + + +C + D + G D + + S+ N
Sbjct: 271 NNAANNFFGVIQYNKDNREFEGAIGTNITIDVLCGIMTDTQLG-DPYNRYVAVNSLIMN- 328
Query: 345 SGTAKCFDLN-------------GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEE 391
+ KC D++ G S G +W +Q CTE D+ +F
Sbjct: 329 TYQQKCLDVSYEDYVESMKETDWGSSAGEGGRQWLYQTCTEFGYYQTTDSNKQVFGNMFP 388
Query: 392 DYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
D + C + +G + NW T +GG+K+ A I+F NG DPW
Sbjct: 389 -LDFFLKQCVDIFGDKFNESSISQGINWSNTNYGGYKMN------AKRIVFPNGSIDPWH 441
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
KN K ++++ AH ++ S+ +D L R+ ++I KW+
Sbjct: 442 ALSFTKN-EKDMISVFINGTAHCANMYPSSPDDSAELIKARQFIGDLITKWL 492
>gi|405945266|gb|EKC17248.1| Thymus-specific serine protease [Crassostrea gigas]
Length = 519
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 125/450 (27%), Positives = 207/450 (46%), Gaps = 52/450 (11%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DHF+ + + + Q+ IN +W K + PIF++ G EG + + +G ++A
Sbjct: 60 QPIDHFD--ALNSEMYNQKVYINTENW--IKPSGPIFLFIGGEGALSNRSAYSGHHVEMA 115
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
++ A++V EHR+YG SI G + +LSS Q LAD + +
Sbjct: 116 KRYGAMVVAAEHRFYGSSINDNG------LHLDQLEHLSSQQGLADLTRVHKYITDRYEL 169
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIITQD 236
T + + FGGSY G L+AWFRLKYPH+ GA+ASSAP+ NF+ ++ ++ Q
Sbjct: 170 TSNKWISFGGSYPGALSAWFRLKYPHLVYGAVASSAPVQAQTNFE------GYNEVVAQS 223
Query: 237 FRSV----SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLAIESWLSTA 290
S+ C K I ++++I+ + ++ L+K F C SEKN + + A
Sbjct: 224 LTDSTVGGSQQCIKQIVEAFQRIDSMIQANQTVQ-LEKDFLSCGPLSEKNDQMVFVNNLA 282
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI----DDPKTGNDVF---AKLYGAASVYYN 343
++ + Y N +P ++ +CK + D K V+ KL + V +
Sbjct: 283 GIFAGVVQYN------NEVPGLNIQHLCKQMTKSDDSYKNLQMVYKMAMKLLNQSCVDNS 336
Query: 344 YSGTAKCFD---LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
YS F+ ++ + G+ +W WQ C++ D S ++ C
Sbjct: 337 YSNFLSQFNNQTVDQAASGVGIRQWTWQTCSQFGYYQTCDEGTSCPFSRLLTLESNLVIC 396
Query: 401 KEAYGVDPR--PNWI--TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK-NISKT 455
++ Y + P PN++ T E+ G + + ++F NG DPW VLK ++S
Sbjct: 397 RDIYKISPSSVPNFVEFTNEYYGAN-----RPKGTRVLFVNGSIDPWHFLSVLKSDVSLN 451
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
A+ AH D+ DPQ LKD R
Sbjct: 452 ETAVFINGTAHCADMASDRSTDPQSLKDAR 481
>gi|157132186|ref|XP_001662504.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871255|gb|EAT35480.1| AAEL012351-PA [Aedes aegypti]
Length = 478
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/467 (26%), Positives = 216/467 (46%), Gaps = 56/467 (11%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ K+ Q +D N++PQ+ T+ RY+ N H+ + P+F++ G E +I G
Sbjct: 45 EVKWIMQYVD--NFDPQNPSTWSMRYIQNGEHY---QPGGPLFIFLGGEWEISPGYVMYG 99
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
YD+A + A L + EHRYYG+S P + K +L+ QALAD A + ++
Sbjct: 100 HFYDMAKELGAHLFYTEHRYYGQSRPTASTRSDLLK------FLNIDQALADLAHFVEEM 153
Query: 174 KKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
++ + A +S V++ GGSY + AWFR KYPH+ G ASSAP+L + V + +
Sbjct: 154 RRAIPGAENSKVIMAGGSYSATMVAWFRQKYPHLVDGGWASSAPLLAKLDFV---EYKEV 210
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
+++ R V + C I+ +++QIE+ + +K+++ F++C N+ + L +A
Sbjct: 211 VSESIRLVGGDACADRIERAYEQIEDHLARE-EFDKVREEFKVC---NNINFANSLDSAM 266
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
++++DY + + ++ +C+ I D ND+ A + G +C
Sbjct: 267 FLSSISDY--FAGVVQYHSPGDIEGVCEIIMDDSIENDM------EALANWFIRGVNQCM 318
Query: 352 DLNGDSD---------PHG-----LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
D+ DS HG + W +Q C E +++ IF S D
Sbjct: 319 DMTYDSTIRYYRSIDWNHGANRGAMRPWLYQTCAEYGWYQTSGSENQIF-GSGFPVDLYV 377
Query: 398 RYCKEAYGVDPRPNWI------TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
R C + Y P + T GH + +N+ F G DPW G+ ++
Sbjct: 378 RMCYDLYDYIFYPARLDANIKRTNTIYGH-----MNPEVTNVFFTQGQLDPWRPMGLQED 432
Query: 452 ISKTVLALVEKEGAHHVDL-RFSTKEDPQWLKDVRRREVEIIGKWIS 497
++ +V +H D+ S ++ P+ L + R E+I +WIS
Sbjct: 433 LNDQSPTVVIPMASHVADMGSISDRDSPEMLA-AKERVFELIKQWIS 478
>gi|237700855|gb|ACR16009.1| carboxypeptidase 3 [Mamestra configurata]
Length = 484
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 121/454 (26%), Positives = 202/454 (44%), Gaps = 42/454 (9%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
LDHF+ PQ+ F R++ N+ +GG + +PIF+ G E DI+ G M+++A +
Sbjct: 55 LDHFD--PQNPTEFLMRFMFNEQFFGG--DGSPIFIMVGGEWDIDHRWLLAGNMFEMARE 110
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD 181
K V+ EHRYYG G K A A +L+ QALAD A I ++KK +
Sbjct: 111 NKGYQVYTEHRYYG------GTKIFANFTAENLRFLNIDQALADLAYFITEMKKQPRFAE 164
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
S VV++GGSY + WF+ +YPH+ +G +ASS PIL + P +
Sbjct: 165 SEVVLYGGSYAANMVMWFKKRYPHLVVGTVASSGPILA--KVDFPEYLEVVHEAFMLEGG 222
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLAIESWLSTAFVYTAM 296
E C I+ ++ + G L++++R+C +E L V+ +
Sbjct: 223 EECIGHIRRGVEETIAAMQTESGRRLLEQSYRLCAPLDYDNENELG---------VFAGL 273
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKT-GNDVFAKLYGAASVYYNYSGTAKCFDLNG 355
+ ++ P ++ +C++ D + G+ K G ++ S T+ C+ +N
Sbjct: 274 ISWTFSTSVQQARPG-TLQAVCQSFTDHTSYGSTPMEKFGGYIAIIRRLSDTS-CWSINY 331
Query: 356 DSDPHGLSE------WGWQACTEMIMLTGGDNKDSIFEESE-EDYDARARYCKEAYGVDP 408
DS +E W +Q CTE + F+ + D CK + D
Sbjct: 332 DSFLTSYNETTNSRAWYYQTCTEYGFYQTAPRSGTAFDGLKWLSLDFYVDICKRIF--DE 389
Query: 409 RPNWITTEFGGHKIGLV---LKRFASNIIFFNGLRDPWSGGGVLKN-ISKTVLALVEKEG 464
R + E G ++ L+ L+ +N I +G DPW GV KN IS+T
Sbjct: 390 RFDLAFVEDGAERVNLIFGGLEPVVNNTINIHGYIDPWRALGVYKNDISETSPTYTVNRA 449
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+H D++ + D + V++R + W+S+
Sbjct: 450 SHCFDMQGWLQSDTIEMTAVQQRARRTVASWLSK 483
>gi|327290302|ref|XP_003229862.1| PREDICTED: dipeptidyl peptidase 2-like [Anolis carolinensis]
Length = 380
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 91/269 (33%), Positives = 134/269 (49%), Gaps = 30/269 (11%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYP 300
S+ C++ + EE +++ EKL E + + AF AM DYP
Sbjct: 116 SDLCFRCSSWCLRPYEEISRRMSTCEKLSS------KEDIYQLFEFSRNAFTMIAMMDYP 169
Query: 301 TPSNFLNPLPAFPVKEMCK---AIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGD- 356
++F+ PA PVK C+ A DP G A L G ++YN +GT CFD+ +
Sbjct: 170 YKTDFMGHFPANPVKVGCELMLANKDPVRG---LAALCG---LFYNSTGTVPCFDIYKEY 223
Query: 357 ---SDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
+DP G W +QACTE+ + +N +F E R YC +GV
Sbjct: 224 RKCADPTGCGTGSDADAWDYQACTEINLTFDSNNVTDMFPEIPFTEADRDSYCFRRWGVH 283
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
PRP+W+ T F G + ASNI+F NG DPW+GGG+ KN+S +++A+ GAHH
Sbjct: 284 PRPSWLDTSFWGSNLAA-----ASNIVFSNGDLDPWAGGGIRKNLSSSLIAITISGGAHH 338
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+DLR S DP +++ R +E +I W+
Sbjct: 339 LDLRASNPADPPSVREARLQEASLILDWV 367
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 41/82 (50%), Positives = 53/82 (64%), Gaps = 2/82 (2%)
Query: 50 QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA 109
Q ++ KY Q LDHFN+ S +TF QRYLI D W + N PIF YTGNEGD+ F
Sbjct: 27 QTGFEEKYFDQWLDHFNFESYSNRTFPQRYLITDKFW--KRGNRPIFFYTGNEGDVWNFG 84
Query: 110 QNTGFMYDVAPKFKALLVFIEH 131
+N GF+ ++A + AL+VF EH
Sbjct: 85 ENCGFILELAGQQGALVVFAEH 106
>gi|348537748|ref|XP_003456355.1| PREDICTED: putative serine protease K12H4.7-like [Oreochromis
niloticus]
Length = 510
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 137/522 (26%), Positives = 220/522 (42%), Gaps = 75/522 (14%)
Query: 6 IFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
+FLSFC T+ IF F ++ T + L++ ++ TQ LDHF
Sbjct: 35 VFLSFC-----TVGEGRYSIFRRFKEAQDTKQG------------SLFEEQWFTQKLDHF 77
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVAPKFK 123
N + ++Q Y IN+ + + P+F+ G EG + W T Y A K
Sbjct: 78 N--GADTRVWKQMYFINEAFY---RPGGPVFLMIGGEGPANPAWMEHGTWLTY--AEKLG 130
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP 183
AL + +EHR+YGKS P +++ N +LSS QALAD A + K TD
Sbjct: 131 ALCLMLEHRFYGKSHP---TMDLSTDNLR---FLSSRQALADLAHFRTMIAKARGLTDRK 184
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
V FGGSY G LAAWFRLKYPH+ ++A+SAP+ N + ++ + S +
Sbjct: 185 WVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPVYATVNFP---EYLEVVWRSLASENTE 241
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPS 303
C ++K + + E K P + + K F +C L I++ + +A+ +
Sbjct: 242 CPLLVKKASDDLVERLKDPRTYDNITKDFNLC---SKLQIQTDMDSAYFLETLA-----G 293
Query: 304 NFLNPL--------------PAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
NF++ + +K +C + D G D +A+ Y A + + + K
Sbjct: 294 NFMDVVQYNEDNRGFEGLTGTNITIKVLCSVMTDSSLG-DPYAR-YAAVARLMMETFSMK 351
Query: 350 CFDLNGD---SDPHGLS----------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
C D + SD S +W +Q C E D+ + F Y +
Sbjct: 352 CLDTSFSKYVSDMTNTSWDGPAAGAGRQWVYQTCAEFGFYQSTDSPNQPFAGFPLVYFLK 411
Query: 397 ARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTV 456
C Y + G R +S I+F NG DPW G+ ++IS +
Sbjct: 412 Q--CTNFYNISAEQVAEAVTQSNEYYGAYNIR-SSRIVFPNGSIDPWHALGITQDISADL 468
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
A+ K +H ++ + +DP L R ++ +W+ Q
Sbjct: 469 PAIFIKGTSHCANMYPARSDDPPQLTLARDHIFLLLQQWLKQ 510
>gi|241176364|ref|XP_002399549.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215495171|gb|EEC04812.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 317
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 78/167 (46%), Positives = 105/167 (62%), Gaps = 9/167 (5%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF + + +T+Q RY I D W + P+F YTGNE E F + TG +++ AP
Sbjct: 20 VDHFTF--RDNRTYQMRYAIADQFW--DRKGGPVFFYTGNEDPYETFIKETGVIWEWAPD 75
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-AT 180
FKAL+VF EHR+YGKS+P+G + +Y++ GYL+S QALADYA L++ LK L A
Sbjct: 76 FKALIVFAEHRFYGKSLPFG---DESYQSPKNLGYLTSEQALADYAYLVVYLKTTLAGAA 132
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN-FDNIVSP 226
S V FGGSYGGMLA WFR+KYPH+ L + F ++V P
Sbjct: 133 KSQFVAFGGSYGGMLATWFRIKYPHLIAATLRERPEVRQIFVSLVLP 179
>gi|332245708|ref|XP_003271995.1| PREDICTED: thymus-specific serine protease isoform 1 [Nomascus
leucogenys]
Length = 514
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/491 (24%), Positives = 216/491 (43%), Gaps = 72/491 (14%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHWVG--QDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRHA 156
Query: 163 LADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-KLQKAFR 273
+F +++++++ S S C + ++ ++E + G + L+
Sbjct: 217 DFSE------YNDVVSRSLMSTEIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELS 270
Query: 274 IC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP- 324
+C ++E A+++ + Y P V+++C +
Sbjct: 271 VCGPLGRAENQAELLGALQALVGGVVQYDGQAGVP-----------LSVRQLCGLLLGGG 319
Query: 325 --KTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQAC 370
++ + + L A + + G KC + ++P G +W +Q C
Sbjct: 320 GNRSHSTPYCGLRRAVKIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTC 378
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLV 425
TE +N F + ++ C++ +G V + +GG G
Sbjct: 379 TEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQTPG-- 435
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
A+ ++F NG DPW V + + + AL+ + G+H +D+ D L+ R
Sbjct: 436 ----ANQVLFVNGDTDPWHVLSVTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRQGR 491
Query: 486 RREVEIIGKWI 496
+ + W+
Sbjct: 492 QNIFRQLQTWL 502
>gi|194752101|ref|XP_001958361.1| GF23556 [Drosophila ananassae]
gi|190625643|gb|EDV41167.1| GF23556 [Drosophila ananassae]
Length = 508
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 131/488 (26%), Positives = 214/488 (43%), Gaps = 50/488 (10%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P K+++L S +S+ L+ Q LDHF P ++++QRY +N H+ + + API
Sbjct: 42 PSKVATL-QRSLESEDLW----FEQRLDHFK--PSDTRSWKQRYYLNADHYR-NDSTAPI 93
Query: 96 FVYTGNEGDI--EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
F+ G EG+ +W + Y A F AL +EHR+YGKS P G + +
Sbjct: 94 FLMIGGEGEATAKWMREGAWVHY--AEHFGALCFQLEHRFYGKSHPTGD------LSTAN 145
Query: 154 TGYLSSTQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALA 212
YLSS QALAD A+ + +K +S V FGGSY G LAAW R KYPH+ G+++
Sbjct: 146 LAYLSSEQALADLANFVSAMKVKFNLAESQKWVAFGGSYPGSLAAWAREKYPHLIYGSIS 205
Query: 213 SSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
SS P+L V + ++ S +C + + S+ Q+E K G L + F
Sbjct: 206 SSGPLL---AEVDFKEYFEVVKASLASYKPDCVEAVTRSFAQVEILLKHMIGQRNLDEKF 262
Query: 273 RICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL---------NPLPAFPVKEMCKAIDD 323
+ C K+ AIE+ L A + + NF +P + ++C + +
Sbjct: 263 KTCTPIKD-AIENPLDIANFFENLA-----GNFAGVVQYNKDNSPHATITIDDICDVMLN 316
Query: 324 PKTG---------NDVFAKLYGAASVYYNYSG-TAKCFDLNGDSD-PHGLSEWGWQACTE 372
G N + K + Y Y A +++ DS+ G+ +W +Q C E
Sbjct: 317 TTIGPPVTRLGVVNGMLLKESNTTCLDYKYDKMVADMKNVSWDSETAKGMRQWTYQTCNE 376
Query: 373 MIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG-GHKIGLVLKRFAS 431
+NK F + D R C + + ++ +K LK +
Sbjct: 377 FGFYQTSENKSDTFGD-RFGVDFFIRQCMDVFSESMDAKYLQQAVAQTNKHYGALKPETT 435
Query: 432 NIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEI 491
N+++ +G DPW G++K+ + + AH ++ K DP L R + +
Sbjct: 436 NVLYVHGSIDPWHALGLVKSANSATPTIYIDGTAHCANMYEPAKSDPPQLVAARNKITKY 495
Query: 492 IGKWISQY 499
+ K + Y
Sbjct: 496 LAKLLEGY 503
>gi|357626528|gb|EHJ76588.1| hypothetical protein KGM_15000 [Danaus plexippus]
Length = 413
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 122/427 (28%), Positives = 193/427 (45%), Gaps = 31/427 (7%)
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
RY N +W + N PIFV+ G E G ++++A + + + EHRYYG+S
Sbjct: 2 RYFENVLYW---QENGPIFVFLGGESASSPQWTRFGIIHELAKESQGAMYVTEHRYYGES 58
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAA 197
P KE +K YLSS QALAD A LI LK +S VVV GGSY G LAA
Sbjct: 59 KPKNLTKEDQFK------YLSSRQALADIAKLIHYLKLLPMYKNSKVVVIGGSYAGNLAA 112
Query: 198 WFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIE 256
W ++ YP + A+ASSAP+L + + + +T+D+ + + C IK + +
Sbjct: 113 WMKVLYPDLVDAAVASSAPVLAKKDF---FEYLEKVTEDYETYGTHGCSDKIKNIFDRFH 169
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
+ + G+++L+K IC S IE+ F + + + S + + +K+
Sbjct: 170 QLLQSSEGIKQLKKEENICDSCDMSVIEN--QAVFFEVKTSIFMSNSQYGS---TKTIKQ 224
Query: 317 MCKAIDDPK------TGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQAC 370
C+ + D T N + +Y Y+Y + + D W +Q C
Sbjct: 225 HCEKLSDVSYDTKSLTDNSMLPIIYSEKLNCYDYDFNRMIQVMKSNDDLF----WIYQTC 280
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGG-HKIGLVLKRF 429
TE ++K IF+ ++ + C E +G D + +K+ L
Sbjct: 281 TEFGYYQTTNSKAQIFKNIPLEF--YIKICTEMFGNDFNETRVDQAVKNTNKLYGGLNPN 338
Query: 430 ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREV 489
+ ++F NG DPWS GVL+ +S A+V H DL + D + LK+ R+
Sbjct: 339 VTKVVFSNGNLDPWSTIGVLEGLSYDAPAVVIPRSTHCADLLPIFEPDNEELKEARKHIK 398
Query: 490 EIIGKWI 496
+I KWI
Sbjct: 399 YLIKKWI 405
>gi|281202858|gb|EFA77060.1| hypothetical protein PPL_09813 [Polysphondylium pallidum PN500]
Length = 487
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 127/458 (27%), Positives = 199/458 (43%), Gaps = 48/458 (10%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ +DHFN Q QTFQQRY++ND +W G N P+F EG + TG
Sbjct: 60 QWFTQRVDHFNQANQ--QTFQQRYIVNDAYWNG---NGPVFFMLNGEGPMS-LGTVTGLQ 113
Query: 116 Y-DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ + A +F AL+V +EHRY+G S ++++ N YLSS QALAD A+ +
Sbjct: 114 FVNWAQEFGALIVTLEHRYFGASFT---TEDLSTDNLQ---YLSSQQALADNAAFRQFIA 167
Query: 175 KNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
+ L S V FGGSY G L +WFR+KYP + +ASSAP+ + V+ Y + ++
Sbjct: 168 ETLNVPASSQWVSFGGSYSGALTSWFRIKYPALVDYTVASSAPV---NAEVNFYQYLEVV 224
Query: 234 TQDFRSVS--ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLAIESWLST 289
+ S + C I + +I+ + GL + F +C + +N S
Sbjct: 225 QNSLLATSNGQQCIDNIAAATGKIQAMLESADGLASVSNMFNLCPPLASQNDVANFMQSL 284
Query: 290 AFVYTAMTDY------PTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYY 342
A + + Y P+ N +M A DP T V+ G +
Sbjct: 285 AGNFMGVVQYNLEASGPSTQNLC---------DMMTAKGDPLTNYISVWNAFSGDECLDV 335
Query: 343 NYSGT-AKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
+Y + ++ D+ G W + CTE D+ + F + C
Sbjct: 336 SYDTVIEEMLNITNDATTIGGRMWFYMTCTEFGYFQSSDSPNQPFGNLFP-IGFSTQQCN 394
Query: 402 EAYGVD--PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
+ +G D P NW T++G L A+NI++ NG DPW G+ N +
Sbjct: 395 DVFGFDFLPNTNWTHTDYGA------LSPVATNILYVNGDIDPWHSLGITTNPPTSPTPS 448
Query: 460 VEKEG-AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ G AH D+ + P L ++ + KW+
Sbjct: 449 LLIHGTAHCADMMIPNQYSPSTLVPAQQIIKSTLQKWL 486
>gi|115623598|ref|XP_783667.2| PREDICTED: putative serine protease K12H4.7-like
[Strongylocentrotus purpuratus]
Length = 492
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 133/485 (27%), Positives = 203/485 (41%), Gaps = 64/485 (13%)
Query: 46 SKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEG 103
+K+ L + Q LDHFN +T+ QR+ +N T + GG P+F+ G EG
Sbjct: 40 AKEKYNLPPDMWFKQKLDHFN--DADLRTWNQRFFLNGTFYTPGG-----PVFLMIGGEG 92
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
+ G + A + KA + +EHR+YGKS P + ++ N YLSS QAL
Sbjct: 93 EANPVWMVEGAWMEYAKEMKAFCIMVEHRFYGKSHP---TENMSVDNLQ---YLSSEQAL 146
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNF 220
AD A + + L + + FGGSY G L+AWFRLKYPH+ +GA+A+SAP+ L+F
Sbjct: 147 ADLAHFRTVIGQQLKFDTNKWISFGGSYPGSLSAWFRLKYPHLVVGAIATSAPVQAQLDF 206
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
++ S ++ V C I+ + IE G E L K F +C
Sbjct: 207 PEYLTVVRDSLATSR----VGSKCNDAIQAATHDIESLMMHMTGWEALAKIFNLCSPLNG 262
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAF-------PVKEMCKAIDDPKTGNDVFAK 333
++ F T ++ + AF + +C + + G ++
Sbjct: 263 TSMND--VANFYETLAGNFMETVQYNKDNKAFEGGSNNNAIDTLCDIMVNESIGTEL--S 318
Query: 334 LYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-------------WGWQACTEMIMLTGGD 380
Y A S + +C D + D L E W +Q CTE D
Sbjct: 319 RYAAVSALLS---DGECVDASYDKMIADLRETSWKSEASAGGRQWTYQTCTEFGFYQTTD 375
Query: 381 NKDSIFEESEEDYDARARYCKEAYGV-------DPRPNWITTEFGGHKIGLVLKRFASNI 433
F + C++ YG N+ T +GG I A NI
Sbjct: 376 TDQQPFGR-HFPLSLSIQMCEDIYGKQFNKTTNQAGVNFTNTNYGGRDI------VAFNI 428
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIG 493
+F NG DPW G+ K+ + A+ K AH ++ ++ +DP L R + + I
Sbjct: 429 VFPNGSIDPWHALGITKS-TDMYSAIFIKGTAHCANMYPASPDDPAELTQARTQIRDTIQ 487
Query: 494 KWISQ 498
KW+SQ
Sbjct: 488 KWLSQ 492
>gi|443699452|gb|ELT98942.1| hypothetical protein CAPTEDRAFT_229193 [Capitella teleta]
Length = 459
Score = 144 bits (364), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 124/458 (27%), Positives = 205/458 (44%), Gaps = 62/458 (13%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTG 113
K+ TQ LDHF + ++T+ QR+ IND H+ K P+F+ G EG + EW Q
Sbjct: 48 KWFTQKLDHFTSS--DHRTWSQRFFINDEHY---KPGGPVFLMIGGEGAANPEWMVQGQ- 101
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++ + AP+F AL V +EHR+YGKS P K++ + YLSS QALAD A+ +++
Sbjct: 102 WVQNYAPQFNALCVMLEHRFYGKSHP---TKDL---KVESLRYLSSEQALADLAAFRVNI 155
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
++ D+ + FGGSY G L+AWFR KYPH+ +++SSAP+L N P + +
Sbjct: 156 SESRGLADAKWIAFGGSYPGALSAWFRYKYPHLVYASVSSSAPMLAQLNFKGPKKVAG-L 214
Query: 234 TQDFRSVSENCYKVIKGSWKQI----EETAKKPGGLEKLQKAFRICKSE--KNLAIESWL 287
+ FR C + K + + A+ G+ + + R + N+ IE+
Sbjct: 215 EKYFRL----CEPIDASDAKDVSNLHDTIAQSIAGVIQYNRDNRAFEGAVGTNITIETIC 270
Query: 288 STAFVYTAMTDYPTPS-NFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
S + T P S +N L KE C + KT ++ + + + +
Sbjct: 271 S---IMTTKGSKPFESYAKVNSLLLDTYKEKCLDVSYNKTVQELREESWKSEA------- 320
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG- 405
G +W +Q CTE D F + C + YG
Sbjct: 321 ------------SEGGRQWTYQTCTEFGFYQTSDLTTQPFGQ-HFPLKFSTEQCADVYGT 367
Query: 406 ------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
+ +W +E+GG+ I + + ++F NG DPW G+ ++++ A+
Sbjct: 368 EFTQTSIQSAVDWTNSEYGGYNITV------TRVVFVNGDIDPWHALGITRDLNAHSPAI 421
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ K AH ++ D L R +++ W+S
Sbjct: 422 LIKGTAHCANMYPDAPNDLPQLIRARESVKKLLTLWLS 459
>gi|350406141|ref|XP_003487670.1| PREDICTED: putative serine protease K12H4.7-like [Bombus impatiens]
Length = 493
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 130/502 (25%), Positives = 215/502 (42%), Gaps = 49/502 (9%)
Query: 20 ISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRY 79
I+N+ + TF R L + I S + L ++ TQ LDHFN P + +QQRY
Sbjct: 14 INNSISWETFMRGRSKYGNLGAPILS--EDHELPSAEWFTQFLDHFN--PTDARVWQQRY 69
Query: 80 LINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
+N ++ K P+F+ E + +W + Y A +F AL +EHR+YG+S
Sbjct: 70 FVNGEYY---KKGGPVFLMISGEAAANAKWMVEGQWIEY--AKQFGALCFQVEHRFYGQS 124
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLI--IDLKKNLTATDSPVVVFGGSYGGML 195
P ++ KN YLSS QALAD A I +++ L A + + FGGSY G L
Sbjct: 125 HP---TSDLGVKNLM---YLSSQQALADLAYFIQLMNINYKLPA-GTKWIAFGGSYAGSL 177
Query: 196 AAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI 255
AAW R KYPH+ GA+++S P+L + + ++ + SE C I + KQ
Sbjct: 178 AAWLRYKYPHLVHGAVSASGPLL---AEIDFQEYFVVVENALKEHSEACVNAILEANKQF 234
Query: 256 EETAKKPGGLEKLQKAFRIC-----KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLP 310
P G + + K F +C +++N + + A ++ + Y + + +
Sbjct: 235 HIMLHHPIGQQGIAKKFILCDPINEHTKRNDISNLYETIASIFAGIVQYNKDNRNNSAMA 294
Query: 311 AFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL-------- 362
+ C + + G + S + KC D + H L
Sbjct: 295 NLTIDSACDILTNETLG--IAIDRLAILSTKILQASEEKCLDYMYNKMIHKLRNITWASE 352
Query: 363 -----SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEF 417
+W +Q CTE + +F E+ D + C + +G + + +
Sbjct: 353 EAEGGRQWTYQTCTEFGFFQTSTARPKLFSETFP-VDFFVQQCIDIFGPRYNIHLLNSAV 411
Query: 418 GGHKI---GLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFST 474
I GL LK +N++F +G DPW G+ K+ + + A+ AH ++ +
Sbjct: 412 NRTNILYGGLDLK--TTNVVFVHGSIDPWHVLGITKSPNPQMPAIYIDGTAHCANMYPPS 469
Query: 475 KEDPQWLKDVRRREVEIIGKWI 496
K DP LK R +I +W+
Sbjct: 470 KNDPLQLKAARVEVGHLIDEWL 491
>gi|219130123|ref|XP_002185222.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403401|gb|EEC43354.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 283
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 151/294 (51%), Gaps = 47/294 (15%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLI-NDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
+ Q L+HF+ T+QQRY + ND + ++ +APIF+YTGNE +E + +TG +
Sbjct: 5 FFQQALNHFDLPRGQSGTYQQRYCVYND--FMVNETSAPIFLYTGNESPLEQYINHTGLI 62
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
++ A F A +VFIEHRY G+S+P + +S Y S+ QALAD+A + +LK
Sbjct: 63 WESAEAFGAQVVFIEHRYEGQSLP-------SPFISSCMAYSSTIQALADFARFV-ELKL 114
Query: 176 NLTATD------SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ D PV+ FGGSYGGML+AW R+KYP+ GA+A SAPI F P +F
Sbjct: 115 FVDTGDFSRLRRRPVIAFGGSYGGMLSAWLRMKYPNTIAGAIAGSAPIWGF-----PRNF 169
Query: 230 SNIITQDFRSVSE----------------NCYKVIKGSWKQIEETAKKPGGLEKLQKAFR 273
+ I +R + +C + +W I A+ GL+ L +FR
Sbjct: 170 PSKIDAAYRVIQHGLQQSYPPTLKPLDNNHCATNLLATWPLISALAQHHEGLQLLTSSFR 229
Query: 274 ICKSEKNL-AIESWLSTAFVYTAMTDYPTPSNFL--------NPLPAFPVKEMC 318
+C+ K+ + W + + A +P S+++ LPA+P++ C
Sbjct: 230 LCEVLKDGDTLIDWAQSPWFDLAEGSFPYSSSYIPFALTHKDAKLPAWPLQAAC 283
>gi|405960530|gb|EKC26449.1| Putative serine protease F56F10.1 [Crassostrea gigas]
Length = 446
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 132/474 (27%), Positives = 207/474 (43%), Gaps = 70/474 (14%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYD 117
Q L HFNY +T+QQRY +NDT + K N PIF+ G EG + W Q G +
Sbjct: 5 QKLTHFNY--ADTRTWQQRYFVNDTFY---KPNGPIFLMIGGEGTANPAWMLQ--GAWIE 57
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI--IDLKK 175
A + A+ +EHRYYGKS P +++ N +LSS QALAD A I + K
Sbjct: 58 YAKTYHAICFLLEHRYYGKSHP---TPDLSVDNLQ---FLSSEQALADLAYFIQYVKHKY 111
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNI 232
NL + D ++ FGGS L+AWFR+KYPH+ GA+A+SAPI LNF + +
Sbjct: 112 NLMSKDQKLITFGGS----LSAWFRVKYPHLVDGAVATSAPIFAQLNFK------EYLQV 161
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
+ + C K IK + I + G + +K F +C L ++ + +
Sbjct: 162 VVSSLATTGPGCNKNIKMATDTITKMISTDTGRKSAEKMFNLCDP---LDTSQYVDGSNL 218
Query: 293 YTAMT-DYPTPSNFLNPLPAF--------PVKEMCKAIDDPKTGNDVFAKLYGAASVYYN 343
++ + ++ + AF + +C + D G+ V + Y +
Sbjct: 219 FSNLAGNFEGVVQYNKDNRAFEGAIGTNITIDTICGIMTDESNGSPV--ERYAKVNSLML 276
Query: 344 YSGTAKCFD------LNG-------DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
+ + KC D + G S G +W +Q CTE D D +
Sbjct: 277 STYSQKCLDNSYNKMIKGLQATAWNSSASEGGRQWMYQTCTEFGFFQSSDLGDVQPFGNF 336
Query: 391 EDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
+ + C + +G + N T +GG+ + A+ I+F NG DPW
Sbjct: 337 FNLKFSIQQCMDVFGAKFNQELIQMGINRTNTNYGGYGMR------ATKIVFPNGSIDPW 390
Query: 444 SGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G K++S A+ + AH ++ +T +D L R ++IG W+S
Sbjct: 391 HFLGFTKDLSMESPAIYIQGTAHCANMYPATSDDLPQLVQARATIEKLIGTWLS 444
>gi|109070008|ref|XP_001094051.1| PREDICTED: thymus-specific serine protease isoform 2 [Macaca
mulatta]
Length = 514
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 121/491 (24%), Positives = 216/491 (43%), Gaps = 72/491 (14%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +N+ HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNEQHWVG--EDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRHA 156
Query: 163 LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-KLQKAFR 273
+F +++++++ S S C + ++ ++E + G + L+
Sbjct: 217 DFSE------YNDVVSRSLMSTAIGGSLECRAAVSAAFAEVERRLRLGGAAQAALRSELS 270
Query: 274 IC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP- 324
C ++E A+++ + A Y P V+++C +
Sbjct: 271 ACGTLGSAENQAELLGALQALVGGAVQYDGQAGAP-----------LSVRQLCGLLLGGG 319
Query: 325 --KTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQAC 370
++ + + L A + + G KC + ++P G +W +Q C
Sbjct: 320 GNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTC 378
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLV 425
TE N F + ++ C++ +G V + +GG G
Sbjct: 379 TEFGFYVTCGNPRCPFSQLPA-LPSQLDLCEKVFGLSALSVAQAVAQTNSYYGGQTPG-- 435
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
A+ ++F NG DPW V + + + AL+ + G+H +D+ D L+ R
Sbjct: 436 ----ANQVLFVNGDTDPWHVLSVTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRLGR 491
Query: 486 RREVEIIGKWI 496
+ + + W+
Sbjct: 492 QNIFQQLQTWL 502
>gi|5031993|ref|NP_005856.1| thymus-specific serine protease [Homo sapiens]
gi|13633990|sp|Q9NQE7.2|TSSP_HUMAN RecName: Full=Thymus-specific serine protease; AltName: Full=Serine
protease 16; Flags: Precursor
gi|3510663|gb|AAC33563.1| thymus specific serine peptidase [Homo sapiens]
gi|119623493|gb|EAX03088.1| protease, serine, 16 (thymus), isoform CRA_a [Homo sapiens]
gi|162318714|gb|AAI56929.1| Protease, serine, 16 (thymus) [synthetic construct]
gi|162318946|gb|AAI56192.1| Protease, serine, 16 (thymus) [synthetic construct]
gi|307685491|dbj|BAJ20676.1| protease, serine, 16 [synthetic construct]
Length = 514
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 121/491 (24%), Positives = 216/491 (43%), Gaps = 72/491 (14%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHWVG--QDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRLA 156
Query: 163 LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-KLQKAFR 273
+F +++++++ S S C + ++ ++E + G + L+
Sbjct: 217 DFSE------YNDVVSRSLMSTAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELS 270
Query: 274 IC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP- 324
C ++E A+++ + Y T P V+++C +
Sbjct: 271 ACGPLGRAENQAELLGALQALVGGVVQYDGQTGAP-----------LSVRQLCGLLLGGG 319
Query: 325 --KTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQAC 370
++ + + L A + + G KC + ++P G +W +Q C
Sbjct: 320 GNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTC 378
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLV 425
TE +N F + ++ C++ +G V + +GG G
Sbjct: 379 TEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQTPG-- 435
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
A+ ++F NG DPW V + + + L+ + G+H +D+ D L+ R
Sbjct: 436 ----ANKVLFVNGDTDPWHVLSVTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGR 491
Query: 486 RREVEIIGKWI 496
+ + + W+
Sbjct: 492 QNIFQQLQTWL 502
>gi|344292450|ref|XP_003417940.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 484
Score = 143 bits (360), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 120/462 (25%), Positives = 207/462 (44%), Gaps = 44/462 (9%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYD 117
Q LDHF+ N S + + QRY ND + K P+F+ G + W + N ++
Sbjct: 44 QKLDHFSKN--STELWPQRYFFNDAFY---KPGGPVFLLIGGFETVCESWISTNNTWV-S 97
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A + AL + +EHR+YG S P G + ++ YLSS QALAD + I + + +
Sbjct: 98 YAERLGALFLLLEHRFYGHSQPKGD------LSTASLHYLSSRQALADIVNFRIKVAEKV 151
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
T + V FG SYGG LA W R+K+P + A+ SSAPI N Y + ++ +
Sbjct: 152 GLTKNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSSAPIKAKANF---YEYLEVVQRSL 208
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAI--ESWLSTAFVY-- 293
+ + C++ ++ ++ Q+ + +L+K F +CK K + +++ V+
Sbjct: 209 ATHNSKCFQAVREAFGQVVKMLSLRKYHRQLRKDFTLCKPMKRFSEMDKAFFIEHLVFPF 268
Query: 294 -TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF- 351
+A+ D S++ +F + E+C + + G+ + + V + +CF
Sbjct: 269 KSAVQDNDNESDYEGIQVSFGMDELCDMMTNTSLGSPYY-RFVRLLHVLFK-DEYLRCFP 326
Query: 352 -------DLNGDS-----DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
++ DS +P ++ +Q CTE D+K+ +F Y +
Sbjct: 327 AQYEKKLEVYLDSSINHHNPTKARQYFYQCCTEFGFFHTTDSKNQLFTGLPLSYFVQQ-- 384
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRF---ASNIIFFNGLRDPWSGGGVLKNISKTV 456
C + +G P N+ + G F S IIF NG DPW G+ K+ISK +
Sbjct: 385 CSDFFG--PEFNYDSLNMGVMSTNAYYGGFKVTGSKIIFSNGSFDPWHPLGITKDISKDL 442
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
A+ K H D+ D L R + +I+ +W+ +
Sbjct: 443 PAVFIKGAVHCADMYQQKDTDSAELIQAREKIFQILQQWLKE 484
>gi|297677423|ref|XP_002816600.1| PREDICTED: thymus-specific serine protease isoform 1 [Pongo abelii]
Length = 514
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 120/491 (24%), Positives = 215/491 (43%), Gaps = 72/491 (14%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVS--NRRSFLQRYWVNDQHWVG--QDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRHA 156
Query: 163 LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-KLQKAFR 273
+F +++++++ S S C + ++ ++E + G + L+
Sbjct: 217 DFSE------YNDVVSRSLMSTAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELS 270
Query: 274 IC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP- 324
C ++E A+++ + Y P V+++C +
Sbjct: 271 ACGPLGRAENQAELLGALQALVGGVVQYDGQAGAP-----------LSVRQLCGLLLGGG 319
Query: 325 --KTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQAC 370
++ + + L A + + G KC + ++P G +W +Q C
Sbjct: 320 GNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTC 378
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLV 425
TE +N F + ++ C++ +G V + +GG G
Sbjct: 379 TEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQTPG-- 435
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
A+ ++F NG DPW V + + + L+ + G+H +D+ D L+ R
Sbjct: 436 ----ANQVLFVNGDTDPWHVLSVTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGR 491
Query: 486 RREVEIIGKWI 496
+ + + W+
Sbjct: 492 QNIFQQLQTWL 502
>gi|195376151|ref|XP_002046860.1| GJ13120 [Drosophila virilis]
gi|194154018|gb|EDW69202.1| GJ13120 [Drosophila virilis]
Length = 513
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 129/489 (26%), Positives = 215/489 (43%), Gaps = 52/489 (10%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P K+++L S D + L+ Q LDH P +T+QQRY +ND + + ++AP+
Sbjct: 45 PSKIATL-QQSMDVEDLW----FEQRLDHLQ--PDDTRTWQQRYFVNDAFYR-NDSHAPV 96
Query: 96 FVYTGNEGDI--EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
F+ G EG+ +W + Y A F AL + +EHR+YGKS P + + S
Sbjct: 97 FLMIGGEGEATKKWMHEGAWVRY--AEHFGALCIQLEHRFYGKSHP------TSDLSTSN 148
Query: 154 TGYLSSTQALADYASLIIDLKK--NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
YLSS QALAD A+ + +K N+ A + FGGSY G LAAW R KYPH+ G++
Sbjct: 149 LAYLSSEQALADLANFVTTMKTKYNMDAKQK-WIAFGGSYPGSLAAWAREKYPHLIDGSI 207
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA 271
+SS P+L V + ++ S C + + Q+E K G L +
Sbjct: 208 SSSGPLL---AQVDFSQYFEVVKSSLASYKPECVEAVTRGIAQVEILLKHMIGQRNLDEK 264
Query: 272 FRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL---------NPLPAFPVKEMCKAID 322
F+ C K+ +IE+ L + ++ + NF +P + E+C +
Sbjct: 265 FKTCTPLKD-SIENQLDISNLFENIA-----GNFAGVVQYNKDNSPHANITIDEICDVML 318
Query: 323 DPKTG---------NDVFAKLYGAASVYYNYSGT-AKCFDLNGDSD-PHGLSEWGWQACT 371
+ G ND+ K + + Y Y A +++ DS+ G+ +W +Q CT
Sbjct: 319 NTTMGPPVTRLAAVNDMLLKQSESKCLDYKYEKMIADMKNVSWDSEAAKGMRQWTYQTCT 378
Query: 372 EMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG-GHKIGLVLKRFA 430
E +NK F + D R C + + ++ +K LK
Sbjct: 379 EFGFYQTSENKSDTFGD-RFGVDFFIRQCMDIFSERMDGKFLEQAVAQTNKYYGALKPAT 437
Query: 431 SNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVE 490
+ +++ +G DPW G+ + + + + AH ++ DP LK R + ++
Sbjct: 438 TQVLYVHGSIDPWHALGLYVSPNANTPTIYIEGTAHCANMYEPVNSDPPQLKAARNKILK 497
Query: 491 IIGKWISQY 499
+ K + Y
Sbjct: 498 YLAKLLDGY 506
>gi|256574599|dbj|BAH98105.1| serine protease [Entamoeba invadens]
gi|440300845|gb|ELP93292.1| hypothetical protein EIN_056770 [Entamoeba invadens IP1]
Length = 468
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 144/504 (28%), Positives = 211/504 (41%), Gaps = 83/504 (16%)
Query: 29 FPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGG 88
F R + E+L + D+ GLY Q LDHF+ + Q Y +NDT++
Sbjct: 15 FQRVRFSKEELEA------DNTGLY----MDQPLDHFDLTNTKKISIQ--YFLNDTYFTP 62
Query: 89 SKNNAPIFVYTGNEGDIEWFAQNTGFMYD-VAPKFKALLVFIEHRYYGKSIPYGGNKEIA 147
P+FV G EG A F+ D A K+K +++ IEHR+YG+S+P GG +
Sbjct: 63 E---GPLFVDLGGEGAASAGAIGGKFVIDKYAQKYKGMMLAIEHRFYGRSLPVGGLSQ-- 117
Query: 148 YKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
GYLS QAL DY +I ++KK T PV+VFGGSY G LA W R KYP+V
Sbjct: 118 ----ENLGYLSGIQALEDYIHIISEIKKQNQIT-GPVIVFGGSYSGNLAVWIRQKYPNVV 172
Query: 208 IGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK 267
A+ASSAP+L + F ++I +D + C K + IE+ K G+++
Sbjct: 173 YAAVASSAPLLATNQFTQ---FMDVIEKD---MGPQCAAAWKQANANIEQLYKTADGIKQ 226
Query: 268 LQKAFRICKSEKN-----LAIESWLSTAFVY-----------------TAMTDYPTPSNF 305
+Q F+ CK KN L ++ + Y +T TP N
Sbjct: 227 IQTDFKTCKDIKNDKDFTLFLQEIQANFISYPQYNNKKEKGKKCEDVCNILTGEDTPYNG 286
Query: 306 LNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEW 365
+ L F + +M K P + + + ++ + T FD+ + W
Sbjct: 287 MKKLVEFMLNDM-KLTCSPSSYDQMLIEM----------AKTEGEFDVTKPNSFASTRSW 335
Query: 366 GWQACTEMIMLTGGDNKDSIFEESEED--YDARARY------CKEAYGVDPRPNWITTEF 417
WQ C+E S F+ E +D R CK+ +GV +
Sbjct: 336 AWQICSEY----------SFFQPITETQPFDQRLNNDFYYANCKDIFGVSKEKLDKKIKH 385
Query: 418 GGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK-EGAHHV-DLRFSTK 475
G + R +N+ F +G DPWS + V EG H DL T+
Sbjct: 386 TNMMYGAMSPR-VTNVAFTSGSFDPWSPLAKHETQYNDVDCYASYIEGTSHCADLYSETE 444
Query: 476 EDPQWLKDVRRREVEIIGKWISQY 499
EDP L R I + I +Y
Sbjct: 445 EDPVQLNTQRTETAAFIDELIKRY 468
>gi|281210278|gb|EFA84445.1| hypothetical protein PPL_02477 [Polysphondylium pallidum PN500]
Length = 761
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 127/445 (28%), Positives = 198/445 (44%), Gaps = 58/445 (13%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ LDHF+ Q+ +TFQQRYLIND +W G P+F+ EG + G
Sbjct: 60 QWFTQELDHFDQ--QNTKTFQQRYLINDQYWDGK---GPVFIMINGEGPMT-IGTVLGLK 113
Query: 116 Y-DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL- 173
Y D A +F AL+V +EHRYYG S +I+ +N YLSS QA + LI+ +
Sbjct: 114 YIDWAKQFNALVVALEHRYYGASF---ATPDISTENLQ---YLSSDQASKNIQRLILIIS 167
Query: 174 ---------------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
K+ + S V FGGSY G L +WFRLKYP++ ++SSAP+L
Sbjct: 168 FFRLADNAVFRQFIAKQYNVTSSSKWVSFGGSYSGALTSWFRLKYPNLVDFTISSSAPVL 227
Query: 219 NFDNIVSPYSFSNIITQDFRSVS--ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
V Y + ++ + S + C I + ++I+ + GL+K+ F +C
Sbjct: 228 AE---VDFYQYLEVVQNSLLTTSKGQECVNNIASATQKIQTLLQTSDGLQKVSDLFDLCP 284
Query: 277 SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV---FAK 333
LA + +ST A T L A + +C+ + +P ND + K
Sbjct: 285 P---LATQDDISTFMQSLAGNFMGTVQYNLEAPGAATITNLCEIMTNPD--NDPITNYVK 339
Query: 334 LYGAASVYYNYSGTAKCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES 389
++ + + DL D+ G W +Q CTE D+ F
Sbjct: 340 VWQGFTDGCTDTSYETMIDLMKNNTNDASVEGGKMWFYQTCTEFGYYQSSDSTKQPFGNL 399
Query: 390 EEDYDARARYCKEAYGVDPRPN--WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
+ + C+E +G + PN W T++GG + A NI++ NG DPW G
Sbjct: 400 IP-IEYLTKQCQEVFGFNFTPNVEWTITKYGG------INPDADNILYVNGDIDPWHALG 452
Query: 448 VLKNISK---TVLALVEKEGAHHVD 469
+ I++ + +++ G H D
Sbjct: 453 ITTPITRKSPSSSSILLIHGTSHCD 477
>gi|29840883|gb|AAP05884.1| similar to NM_066318 prolylcarboxypeptidase (angiotensinase C) in
Homo sapiens [Schistosoma japonicum]
Length = 184
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 77/161 (47%), Positives = 106/161 (65%), Gaps = 9/161 (5%)
Query: 46 SKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
+KDSQ Y+TKY +DHF++ F+ +YLIN+ + + PI YTGNEG I
Sbjct: 27 NKDSQFKYETKYFRTKIDHFSFVTDG--EFEIKYLINNESF---SSGGPILFYTGNEGAI 81
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
E FA+N+GF++ +A + A +VF EHRYYG S+P+G + ++K+ GYL++ QALAD
Sbjct: 82 ETFAENSGFIWKLAEELNASVVFAEHRYYGTSLPFGND---SFKDRQHFGYLTAEQALAD 138
Query: 166 YASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPH 205
Y LI LK N + SPV+ FGGSYGGML+AW R KYP+
Sbjct: 139 YVLLINQLKINYSCFASSPVISFGGSYGGMLSAWIRQKYPN 179
>gi|307196629|gb|EFN78126.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 433
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 121/449 (26%), Positives = 201/449 (44%), Gaps = 44/449 (9%)
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTGFMYDVAPKFKALLVFIE 130
+ ++QRY +N ++ K N PIF+ G EG+I+ W + G + A + A+ ++E
Sbjct: 2 RVWKQRYFVNSDYY---KLNGPIFLMIGAEGEIKPKWLVE--GLWIEYAKELGAMCFYVE 56
Query: 131 HRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGG 189
HRYYGKS P +++ KN YLSS ALAD A I + D+ +VFGG
Sbjct: 57 HRYYGKSHP---TVDLSVKNLM---YLSSELALADLAYFIESVNIGYKFPNDTKWIVFGG 110
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGG LAAW RLKYPH GA+++S P+L + Y ++ + S+ C +
Sbjct: 111 SYGGSLAAWMRLKYPHFVHGAVSASGPLLALIDFQEYYV---VVEDALKQHSQQCVDTVA 167
Query: 250 GSWKQIEETAKKPGGLEKLQKAFRICK-----SEKNLAIES-WLSTAFVYTAMTDYPTPS 303
+ K+ G +++++ FR+C K + I + + S A + + Y +
Sbjct: 168 DANKEFHIMLHHLIGQKQIEEKFRLCDPIDPGHTKTIDISNLYESLASNFADIVQYNKDN 227
Query: 304 NFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL- 362
+ V +C + + + G V +L S+ N + KC D D+ H L
Sbjct: 228 RQESKTANINVDTICDVLTNDELGRPV-DRLAYINSMILN-ATKEKCLDYRYDNMIHSLR 285
Query: 363 ------------SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
+W +Q CTE + +F E+ D + C + +G PR
Sbjct: 286 NITWASEQAEGGRQWMYQTCTEFGFFQTSTARPKLFSETFP-VDFYVQQCVDIFG--PRY 342
Query: 411 NWITTEFGGHKIGLV---LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
N + + ++ L +N++ +G DPW G+ K+ + A+ AH
Sbjct: 343 NLDMLKSAVTRTNILYGALNLQVTNVVHVHGSVDPWHVLGITKSSNPQAPAIYIDGTAHC 402
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
L S+++DP LK R +I +W+
Sbjct: 403 AILYPSSEKDPPQLKQARIVVKGLIKQWL 431
>gi|426351972|ref|XP_004043496.1| PREDICTED: thymus-specific serine protease [Gorilla gorilla
gorilla]
Length = 514
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 120/491 (24%), Positives = 215/491 (43%), Gaps = 72/491 (14%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHWVG--QDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRLA 156
Query: 163 LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-KLQKAFR 273
+F +++++++ S S C + ++ ++E + G + L+
Sbjct: 217 DFSE------YNDVVSRSLMSTAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELS 270
Query: 274 IC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP- 324
C ++E A+++ + Y P V+++C +
Sbjct: 271 ACGPLGRAENQAELLGALQALVGGVVQYDGQAGAP-----------LSVRQLCGLLLGGG 319
Query: 325 --KTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQAC 370
++ + + L A + + G KC + ++P G +W +Q C
Sbjct: 320 GNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTC 378
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLV 425
TE +N F + ++ C++ +G V + +GG G
Sbjct: 379 TEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQTPG-- 435
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
A+ ++F NG DPW V + + + L+ + G+H +D+ D L+ R
Sbjct: 436 ----ANQVLFVNGDTDPWHVLSVTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGR 491
Query: 486 RREVEIIGKWI 496
+ + + W+
Sbjct: 492 QNIFQQLQTWL 502
>gi|440796326|gb|ELR17435.1| Serine protease precursor, putative [Acanthamoeba castellanii str.
Neff]
Length = 509
Score = 142 bits (357), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 138/475 (29%), Positives = 219/475 (46%), Gaps = 60/475 (12%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ LDHF+ P++ +QQRY NDT + + P+F+ G EG F+
Sbjct: 60 QWFTQALDHFD--PRNSAKWQQRYFTNDTFY---RPGGPVFLMLGGEGPASPIDVGGHFI 114
Query: 116 Y-DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ A +F AL++ IEHR+YGKS+P ++++ N +L+S QALAD+A +
Sbjct: 115 LNEYAQRFNALVLSIEHRFYGKSVP---TRDLSNANLR---FLNSEQALADFAMFRQYIS 168
Query: 175 KNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFS 230
+ L + V FGGSY G L+AWFRLKYPH+ G+LA+SAP+ L+F ++
Sbjct: 169 EKLALPKTTKWVAFGGSYSGALSAWFRLKYPHLVDGSLATSAPVKAQLDFS------EYN 222
Query: 231 NIITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAI--E 284
++ + V E+C ++ + G ++LQK F +C +++ ++A+ E
Sbjct: 223 EVVQRSLEFFVGESCANRVREGTQAATNLLSSADGKKQLQKLFNLCTPIETDDDIALFFE 282
Query: 285 SWLST-AFVYTAMTDYPTPSNFLNPLPAFPV-----KEMCKAI---DDPKTG----NDVF 331
+ T A + D ++ +N + K MC DD N V+
Sbjct: 283 NMEGTVAQIVQYNNDNNNYNHGMNINKVIIIMLHWLKRMCDIFLKGDDALAAYAEFNTVY 342
Query: 332 AKLYGA---ASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE 388
KL+G + Y Y K ++D W +Q C E G + F +
Sbjct: 343 NKLFGVDCTQTKYTEYVNQLKDVRTFPENDNAAGRSWTYQTCIEFGFYQTGSAANQPFSK 402
Query: 389 SEE-DYDARARYCKEAYG-VDPRPN--WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
+ D+D + + + +P PN W T +G GL + +I NG DPW
Sbjct: 403 TVTLDWDI---FNIDPFNKAEPLPNIEWTNTFYG--STGLADPK----VILPNGSIDPWH 453
Query: 445 GGGVLKNISKT---VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
GVL + LA++ AH DL S+ +DP LKD R + V I +I
Sbjct: 454 ILGVLPETAVAHPGQLAVLINGTAHCADLYPSSADDPLSLKDARAKIVAAIASFI 508
>gi|328875430|gb|EGG23794.1| hypothetical protein DFA_05930 [Dictyostelium fasciculatum]
Length = 550
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 118/411 (28%), Positives = 192/411 (46%), Gaps = 50/411 (12%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ LDHFN Q QTFQQRY+IND +W G P+F+ EG + A TG
Sbjct: 117 QWFTQRLDHFNTINQ--QTFQQRYVINDQYWNGK---GPVFIMINGEGPMS-LATVTGLQ 170
Query: 116 Y-DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--IID 172
+ + A + AL++ +EHRYYG S +++ N + YL+ QALAD A+ +
Sbjct: 171 FVNWAQQSNALIISLEHRYYGASF---ATDDLSTDNLA---YLTPQQALADNAAFREFVA 224
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+ N+ AT S V FGGSY G L +WFR+KYP++ +ASS P+ + V+ Y + +
Sbjct: 225 VTFNVPAT-SKWVSFGGSYSGCLTSWFRIKYPNLVDYTVASSGPV---NAEVNFYQYLEV 280
Query: 233 ITQDFRSVS--ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI---CKSEKNLAIESWL 287
+ + + + C I + ++I+ +P GLE + F + +S+ ++A
Sbjct: 281 VQNSLLTATNGQQCVSNIAQATQKIQALLSQPNGLETVSDMFNLFPALESQNDVA-NFMQ 339
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGT 347
S A + + Y N P+ V+ +C+ + D + ND + Y
Sbjct: 340 SLAGNFMGVVQY----NLEEVGPS--VETLCQTMTD--SSNDALTNYIAIWNQYAQGETL 391
Query: 348 AKCFD--------LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
+D + D + G +W +Q C + D+ + F + + + +
Sbjct: 392 DVSYDTMISELTNVTNDQNIVGGRQWFFQTCAQFGFYQTSDSPNQPF-GNLFPLEFQIQQ 450
Query: 400 CKEAYGVD--PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
C + +G D P NW +FGG L SN+I+ NG DPW G+
Sbjct: 451 CSDVFGFDFLPNVNWTLLDFGG------LNPVTSNVIYVNGDIDPWHSLGI 495
>gi|193662244|ref|XP_001949662.1| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 509
Score = 141 bits (355), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 125/468 (26%), Positives = 210/468 (44%), Gaps = 46/468 (9%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
K+ Q LDHFN P +T++QRY +N ++ K + P+F+ G EG I +G
Sbjct: 45 KWFLQKLDHFN--PTDNRTWKQRYQVNQKYY---KKDGPVFLMIGGEGPISAKWMYSGAW 99
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
D A +F AL + +EHRYYGKS P ++++ KN YLSS QAL D A I++++
Sbjct: 100 IDYAKEFNALCLQLEHRYYGKSHP---TEDMSTKNLV---YLSSEQALTDLAEFIVNIRT 153
Query: 176 NL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
N T + V FGGSY G LAAW R+K+PH+ A++SS P+L + + ++
Sbjct: 154 NYDIPTTAKWVAFGGSYPGSLAAWLRMKFPHLVYAAVSSSGPLL---AKIDFKEYFKVVE 210
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWLSTA 290
+ S +C IK + + I+ K G + ++ F++C + KN + + A
Sbjct: 211 NALATYSPDCVSQIKEANQMIDSQIKTIKGAKLIENKFKLCDPLDINTKNDVANLFETLA 270
Query: 291 FVYTAMTDYPTPSNFLNPLPA--FPVKEMCKA-IDDPKTG-----NDVFAKLYGAASVYY 342
+ + Y + F ++ +C ++ KT DV +KL ++
Sbjct: 271 GNFADIVQYNKDNRFYENFERSLVTLETLCDVMVNKSKTTPLDRYADVNSKLLSINNLTC 330
Query: 343 NYSGTAKCFD------LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE---SEEDY 393
K D N DS G +W +Q CTE +D F + ++
Sbjct: 331 TQHVYTKMIDSYLNTSWNSDSAAGG-RQWTYQTCTEFGFYQTSSQEDHAFGDKFPAKFFI 389
Query: 394 DARARYCKEAYGVDPRPNWITTE---FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
D + + Y +D N I +G I + +I+ +G DPW G+
Sbjct: 390 DMCSDIFGKLYNLDLLSNAIKRSNMMYGELNIK------ENRVIYVHGSVDPWHALGITH 443
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+K +A+ + AH ++ + D LK+ R + +W+++
Sbjct: 444 TKTKNNVAIYIEGTAHCANMYPPSPTDLPQLKNARETIRAFLSEWLTE 491
>gi|344268141|ref|XP_003405921.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 429
Score = 141 bits (355), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 126/446 (28%), Positives = 197/446 (44%), Gaps = 46/446 (10%)
Query: 78 RYLINDTHWGGSKNNAPIFVYTGN-EGDIE-WFAQNTGFMYDVAPKFKALLVFIEHRYYG 135
RY IND + K P+F+ G E E W + N ++ A + AL + +EHR+YG
Sbjct: 5 RYFINDAFY---KPGGPVFLNIGGPETACESWISMNNTWV-TYAERLGALFLLLEHRFYG 60
Query: 136 KSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGML 195
S P G + ++ YLSS QALAD + + K + T++ V FG YGG L
Sbjct: 61 HSQPTGD------LSTASLRYLSSRQALADIVNFRTKIAKKMGLTENKWVAFGCWYGGFL 114
Query: 196 AAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI 255
AAW R+KYP + A+ SSAP+ N Y + ++ + + + C++ +K ++KQ+
Sbjct: 115 AAWSRIKYPELFAAAVGSSAPMQAKANF---YEYLEVVQRSLATHNSECFQTVKEAFKQV 171
Query: 256 EETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF--------VYTAMTDYPTPSNFLN 307
+ K P KL+ F +CK L + S + AF V TA+ N+
Sbjct: 172 VKMMKLPEFYSKLEDDFTLCKP---LKLYSAMDKAFFLERLIFPVKTAVQLNKNKKNYKG 228
Query: 308 PLPAFPVKEMCKAIDDPKTGND----------VFAKLYGAASVYYNYSGTAKCF-DLNGD 356
+ ++C + + G+ +F Y + + NY K F D + D
Sbjct: 229 EQVFISMDDLCDMMANTSLGSPYHRYVRIIHLLFETEY-SPCLAANYKNKLKAFLDPSID 287
Query: 357 S-DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
+P + +Q CTE D+K+ F Y + C + + DP+ N+ +
Sbjct: 288 HYNPPTDRQQFYQFCTEFGFFQTTDSKNQPFTGLPLSYFVQQ--CSDFF--DPKFNYDSL 343
Query: 416 EFGGHKIGLVLKRF---ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRF 472
+ G F S IIF NG DPW G+ K+I+K + A+ K H DL
Sbjct: 344 KKGVKSTNAYYSGFKVTGSKIIFPNGSFDPWHVLGIPKDITKDLPAVFIKGAGHCADLYK 403
Query: 473 STKEDPQWLKDVRRREVEIIGKWISQ 498
D L R R +I+ KW+ Q
Sbjct: 404 QKDIDSTELIQARERIFQILQKWLKQ 429
>gi|356559327|ref|XP_003547951.1| PREDICTED: probable serine protease EDA2-like [Glycine max]
Length = 490
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 149/511 (29%), Positives = 215/511 (42%), Gaps = 90/511 (17%)
Query: 26 FPTFPSSRITPEKLSSLISSSKDSQGLYKT---KYHTQILDHFNYNPQSYQTFQQRYLIN 82
FP + P L + K SQG Y T ++ Q LDHF+ P + F+QRY
Sbjct: 24 FPALSYGVVPPRTLLN-----KLSQGSYLTTQEQWFNQTLDHFS--PYDHHQFRQRYFEF 76
Query: 83 DTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGG 142
++ + PIF+ G EG + ++ +A KF A +V +EHRYYGKS P+
Sbjct: 77 LDYF--RIPDGPIFLVIGGEGPCNGITND--YIGVLAKKFGAAMVTLEHRYYGKSSPFNS 132
Query: 143 NKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVVVFGGSYGGMLA 196
+ K YLSS QAL D A ++ K N T T++P VFGGSY G L+
Sbjct: 133 LETENLK------YLSSKQALFDLAVFRQYYQDSLNAKLNRTKTENPWFVFGGSYAGALS 186
Query: 197 AWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIE 256
AWFRLK+PH+ G+LASSA +L + Y+F+ Q S C V++ + + IE
Sbjct: 187 AWFRLKFPHLTCGSLASSAVVL------AVYNFTEYDQQIGESAGAECKAVLQETTQLIE 240
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
G ++L+ +F EK+ F+Y + NP +
Sbjct: 241 HKLATNG--KELKASFNADDLEKD--------GDFMYLIADAAAVAFQYGNP------DK 284
Query: 317 MCKAIDDPKTGNDVFAKLYGAASVYYNYSGT----AKCFD--------LNGDSDPHGLSE 364
+CK + + K + Y A V Y GT + +D +N DS
Sbjct: 285 VCKPMVEAKNAGEDLVDAY-AKYVKEYYIGTFGVNVQTYDQEYLKKTAINEDSSTR---L 340
Query: 365 WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE--FGGHKI 422
W +Q CTE+ + DSI SE D CK +G P+ T +GG KI
Sbjct: 341 WWFQVCTEVAFFQVAPSNDSI-RSSEIDAKYHMDLCKNIFGEGIFPDVDATNLYYGGTKI 399
Query: 423 GLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL---ALVEKEGAHHVDLRFSTK---- 475
S I+F NG +DPW K S L + AH D R +
Sbjct: 400 A------GSKIVFANGSQDPWRHAS--KQTSSPDLPSYTITCSNCAHCTDFRGCPQSPLV 451
Query: 476 --------EDPQWLKDVRRREVEIIGKWISQ 498
P + VR++ E + W+S+
Sbjct: 452 LEGNEKNCSSPDAVHKVRQQITEHMDLWLSE 482
>gi|330840912|ref|XP_003292451.1| hypothetical protein DICPUDRAFT_83068 [Dictyostelium purpureum]
gi|325077291|gb|EGC31013.1| hypothetical protein DICPUDRAFT_83068 [Dictyostelium purpureum]
Length = 457
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 126/459 (27%), Positives = 211/459 (45%), Gaps = 49/459 (10%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
Y + +Q LDHF+ ++ ++F QRY IND + + N PI +Y EG + T
Sbjct: 34 YTPFWFSQTLDHFD--DENTESFSQRYFINDQFYDYT-NGGPIILYINGEGPVSSAPCQT 90
Query: 113 G---FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
G +Y A A++V +EHRYYG+S P+ +++ +N YLSS QAL D A
Sbjct: 91 GDGVVLYGQA--LNAMIVTLEHRYYGESTPF---QDLTTENLK---YLSSEQALNDLAIF 142
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
++ + L+ +V GGSY G L+AWFR+KYPH+ G++ASS + + I+ +F
Sbjct: 143 VVWFQSQLS-NAGKIVTIGGSYSGALSAWFRIKYPHITSGSIASSGVV---NAILQFTTF 198
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE-KLQKAFRICKSEKNLAIESWLS 288
+ + E+C ++ K +EE G E K+++ F+ N WL+
Sbjct: 199 DEYVA---YAAGEDCSNALRLVTKAVEEQILAGGSAEQKVKQIFQAESLTDNGDFFYWLA 255
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA-SVYYNYSGT 347
+ Y S +PL ++ M D T ++ ++G + YS
Sbjct: 256 DSMA--EGIQYGFHSQLCSPL----IEAMNNNGDMISTYSNYTINVWGQSLGTPEEYS-- 307
Query: 348 AKCFDLNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
+ N +DP +W +Q C + SI S + +C++ +G+
Sbjct: 308 -TVWQQNTTADPAKADRQWWFQTCAALGYFQDAPLSGSI-RSSMVNMTYFKTHCQQVFGI 365
Query: 407 DPRPNW--ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS---KTVLALVE 461
PN + +GG+ +NI++ NG +DPWS V++ IS ++V+ E
Sbjct: 366 PLWPNTAAVNIHYGGNNTA------GTNILYTNGSQDPWSRASVIQTISDSQQSVMVTCE 419
Query: 462 KEGAHHVDLRFSTK---EDPQWLKDVRRREVEIIGKWIS 497
G H VD+R S P + VR ++++ W+S
Sbjct: 420 NCG-HCVDIRASCPGGCAQPNNIAQVRALSIKLLESWLS 457
>gi|334328701|ref|XP_001372314.2| PREDICTED: thymus-specific serine protease-like [Monodelphis
domestica]
Length = 503
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 125/468 (26%), Positives = 203/468 (43%), Gaps = 60/468 (12%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LD FN + ++F+QRY +N HW AP+F++ G EG + + G +A
Sbjct: 56 QPLDPFNSS--DGRSFRQRYWVNVGHW--RPPRAPVFLHIGGEGSLGPSSVWKGHPGTLA 111
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK--NL 177
+ AL++ +EHR+YG+SIP G + + +LSS ALAD AS + L N+
Sbjct: 112 ASWGALVISLEHRFYGQSIPPRG------LDGAQLRFLSSRHALADVASARLRLSGIYNI 165
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIIT 234
+A+ SP + FGGSY G LAAW RLKYPH+ A+ASSAP+ L+F + S +
Sbjct: 166 SAS-SPWIAFGGSYAGSLAAWARLKYPHLIWAAVASSAPVQAQLDFSGYN--WVVSRSLA 222
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK-LQKAFRIC--------KSEKNLAIES 285
S C + + ++ ++++ + G +Q R C ++E +E
Sbjct: 223 DPQVGGSPKCQRAVAQAFSELDDRLSEGGETRAVVQAEVRACGSLEAPEDQAELLEQLEG 282
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-----DDPKTG-NDVFAKLYGAAS 339
+ Y P V+ +C + P +G D +
Sbjct: 283 LVEGVVQYDQQVGAP-----------LDVRGLCHLVLANQSRGPLSGLQDAIQLVLQTLG 331
Query: 340 VYYNYSGTAKCFDLNGDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
+ S A D++P G +W +Q CTE + F + D
Sbjct: 332 LPCLPSSKAAALAELKDTNPQAASLGYRQWFYQTCTEFGYYITCKDPSCPFSRRKTLSD- 390
Query: 396 RARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
+ + C + +G+ P NW T +GG G A+ + F NG DPW VL+
Sbjct: 391 QLQLCAQVFGLSPTSVAQAVNWTNTYYGGWSPG------ATRVFFVNGDIDPWHVLSVLQ 444
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ + A++ + +H D+ DP L R+R V+ + W+ +
Sbjct: 445 ALGPSEPAMLMRGTSHCSDMAPPQPSDPPSLYLGRQRIVQQLKIWLQE 492
>gi|344268143|ref|XP_003405922.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 465
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 123/468 (26%), Positives = 204/468 (43%), Gaps = 59/468 (12%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYD 117
Q LDHF+ N + ++QRY IN+T + K P+F+ G I W + + ++
Sbjct: 28 QKLDHFSENGSPF--WEQRYFINNTFY---KPGGPVFLMIGGWMTIGTNWVSTDYTWI-T 81
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A + A + +EHR+YG+S P G + ++ YL S Q LAD A ++ K +
Sbjct: 82 YAERLGAFCLALEHRFYGQSQPTGD------LSTASLRYLRSKQVLADIAYFRTEIAKKM 135
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIIT 234
+ VVFGGSYGG LA W R+KYP++ A++SSAP+ +NF Y + +
Sbjct: 136 GLIKNKWVVFGGSYGGSLAVWSRIKYPNLFAAAVSSSAPVKVKVNF------YEYFEGVH 189
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK-NLAIES--WLSTAF 291
+ + C K +K + Q+ + K P +L+ F +C+ K N ++S ++
Sbjct: 190 SALATHNSECSKAVKDALGQVIKLLKFPKYYRRLKNDFMLCEHLKVNSVMDSTYFVENLL 249
Query: 292 VYTAMTDYPTPSNFLNPLPAFP-VKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
++ A N + + P + + C + + G+ + Y +G C
Sbjct: 250 IFLASIIQRNKDN--ETIKSTPNIDDFCDKMTNTSLGSPYYR--YARIMTIRFKNGNPSC 305
Query: 351 FDLN----------GDSDPHGLSE---WGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
D N D D + E W +Q+CTE+ +++ F Y
Sbjct: 306 LDANYQNFLKSMSDSDLDKENIEEDRQWFYQSCTELGFFQTTHSRNHTFSGLPLRYFLSQ 365
Query: 398 RYCKEAYGVDPRPNWITTE-------FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
C +G + N +T +GG + S IIF NG DPW+ G+ K
Sbjct: 366 --CLGVFGSEFNYNSLTQSVQAINMYYGGFNVN------GSKIIFSNGSLDPWNALGITK 417
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+IS+ + A+ K G H D+ D L R + I+ KW+ +
Sbjct: 418 DISEDLPAVFIKGGEHCADMYQQKDTDSAELIQAREKIFHILQKWLKE 465
>gi|328718793|ref|XP_001947565.2| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 490
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 125/466 (26%), Positives = 204/466 (43%), Gaps = 45/466 (9%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
K+ Q +DHFN P +T++QRY +N H+ K+ PIF+ G E +I +G
Sbjct: 46 KWFKQKVDHFN--PSDTRTWKQRYHMNLQHY---KHGGPIFLSIGGEEEITHNWMTSGAW 100
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI--IDL 173
+ A K A+ +EHRYYG+S P K K YL+ Q LAD + I I
Sbjct: 101 IEYAKKLNAMCFQLEHRYYGRSHPTDNLKTKNLK------YLTVEQVLADLETFISTISN 154
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
T ++ +VFGGSY G LAAW R+KYPH+ A++SS+P++ + F I
Sbjct: 155 DNEETLRNAKWIVFGGSYSGSLAAWLRMKYPHLVYAAVSSSSPLM---AKIDYKDFYMAI 211
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLAIESWLS 288
+ + C I + I + + G + + K F+ C K++K+ + + +
Sbjct: 212 QNTISNYNPKCASNIIEATSTISDLLETSYGAKYVDKKFKTCSDMNIKNKKDKTV-FFNN 270
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
A + Y + N L A + ++C + D GN + + Y A
Sbjct: 271 LALPVALIIQYNNDNKKKNKL-ALSLVKLCDMMLDKSLGNPL--ERYVAVHKQLRSVNNQ 327
Query: 349 KCFDLNGDSDPHGLSEWGWQA--------------CTEMIMLTGGDNKDSIFEESEEDYD 394
C +N L E W A CT++ +N++ +F S D
Sbjct: 328 ICTSINYQDAISALKETSWNAENVKSGSRQWLYLICTQIGNFVTSNNRNDLFGNSIP-LD 386
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLV---LKRFASNIIFFNGLRDPWSGGGVLKN 451
C++ +G N + K ++ LK+ S II+ +G D WS G+++
Sbjct: 387 YYTGMCRDVFGKSFNANSLNAAV--RKTNMIHHDLKKKTSRIIYLHGTIDAWSTLGLIQP 444
Query: 452 ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
++K +++V + G+H DL S D LK R+ + KW+S
Sbjct: 445 MTKHSVSIVIEGGSHCSDLYPSRSSDSPQLKKARKTVEFYLKKWLS 490
>gi|66821261|ref|XP_644128.1| hypothetical protein DDB_G0274663 [Dictyostelium discoideum AX4]
gi|60472271|gb|EAL70224.1| hypothetical protein DDB_G0274663 [Dictyostelium discoideum AX4]
Length = 487
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 124/463 (26%), Positives = 209/463 (45%), Gaps = 56/463 (12%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ T +DH Y+PQ+ TF+Q++ +NDT++ +P+F G EG + F+
Sbjct: 56 QWFTNRVDH--YDPQNRNTFKQKFYVNDTYY---TPGSPVFYILGGEGPVGASYVTGHFV 110
Query: 116 YD-VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
++ A KF ALLV IEHR+YG SIP G ++ +N YL++ QALADYA+ + L
Sbjct: 111 FNQYAQKFNALLVAIEHRFYGDSIPMG---SLSLENLK---YLTTQQALADYAAFVPFLT 164
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSN 231
+ S + FGGSY G L+ W RLKYP + A+A+SAP+ L+F +
Sbjct: 165 QKYNTGSSKWISFGGSYSGNLSGWLRLKYPQLISAAIATSAPVKAQLDFP------EYFE 218
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL-ST 289
+++Q S+ C ++ + Q T G +++Q+ F C L I +++ S
Sbjct: 219 VVSQ---SIGPTCSAIVS-NITQTVTTMLNNGQNDQVQQMFSACDPIVSKLDIATFMESL 274
Query: 290 AFVYTAMTDYPTPSN---FLNPLPAFPVKEMCKAID---DPKTG----NDVFAKLYGAAS 339
+ T Y +N F N + MC+ + DP N+ + + G+
Sbjct: 275 SSGITETVQYNLDNNNYTFTN------ITAMCERFEQSSDPMKEFIDFNNEYNQFSGSQC 328
Query: 340 VYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
+Y + + + + S W WQ CTE G +++ F S + +
Sbjct: 329 TLSSYEKSIQYLQSSNYKSANASSRSWNWQCCTEYGYWQTGSSQNQPF-SSAITLEYFTQ 387
Query: 399 YCKEAYG-----VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
C + +G P +I ++GG I A+N+I+ G DPWS V +
Sbjct: 388 MCTDIFGPKGFVYQPAIQYILNDYGGTNIQ------ATNVIYERGTIDPWSVLSVQSPPN 441
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + G+H L +D + + R E+ +I +
Sbjct: 442 SESQVFLIQGGSHCSALYPPKPDDLPGVTEAREMEIALISSIV 484
>gi|170066899|ref|XP_001868268.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167863076|gb|EDS26459.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 485
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 123/471 (26%), Positives = 218/471 (46%), Gaps = 66/471 (14%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGDIEWFAQNT 112
TK+ Q +D N++PQ+ T+ RY+ N ++ GG+ +F+Y G E I +
Sbjct: 53 TKWIMQKVD--NFDPQNPSTWSMRYMDNGEYYNPGGA-----LFIYVGGEWTINEGSLVR 105
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G +D+A + A + + EHRYYG S P + + +L+ QALAD A + +
Sbjct: 106 GHFHDMARELGAYIFYTEHRYYGLSRPTANTRTDQMR------FLNVDQALADLAHFVEE 159
Query: 173 LKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+++ + A ++ V++ GGSY + AWFR KYPH+ GA ASSAP+L + +
Sbjct: 160 MRRTIPGAENAKVIMAGGSYSATMVAWFRQKYPHLINGAWASSAPLLAKLDFT---EYKE 216
Query: 232 IITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWL 287
+++ R V + C ++ ++E+ K+ G +++ +AF +C S K L +++L
Sbjct: 217 VVSDSIRLVGGDACADRVQRGVAEVEDLIKQ-GSYDQVAQAFNLCASTDLTKTLDKQNFL 275
Query: 288 STAFVYTA-MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
S+ Y A + Y P + ++ +C+ I+DP D+ A L G + SG
Sbjct: 276 SSISDYFAGVVQYHWPGD---------IEGVCEVINDPSYTTDMEA-LAG-----WFTSG 320
Query: 347 TAKCFDLNGDSD---------PHG-----LSEWGWQACTEMIMLTGGDNKDSIFEESEED 392
+ +C+D + DS HG + W +Q C E +++ IF S
Sbjct: 321 STRCYDASYDSMISYYRSTDWTHGANTGAMRPWFYQTCAEYGWYQTSGSENQIF-GSGFP 379
Query: 393 YDARARYCKEAYGVDPRPNWI------TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
+ R C + Y + T GH + +N+ F G DPW
Sbjct: 380 VELYIRMCADLYDYKFPERLLHVNVARTNTIYGH-----MNPEVTNVFFTQGQLDPWRPM 434
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G+ +++++ A+V +H DL + D ++ + R E+I W++
Sbjct: 435 GLQEDLNEHSPAVVIPLASHCADLSSISAADSPEMRAAKERVFELIKMWLA 485
>gi|145525753|ref|XP_001448693.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416248|emb|CAK81296.1| unnamed protein product [Paramecium tetraurelia]
Length = 462
Score = 139 bits (349), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 133/455 (29%), Positives = 211/455 (46%), Gaps = 46/455 (10%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ+LDH +P + QT+QQRY + ++ +K + +Y E + + + N+ F + +
Sbjct: 35 TQLLDH--SDPANTQTWQQRYHVYSQYFNPTKGG--VILYICGEWNCQGVSDNS-FSFQL 89
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK-NL 177
A A+++ +EHR+YG+S P+G + + +N S YL+ QAL D A I+ +K+ L
Sbjct: 90 AKDLGAIVIALEHRFYGQSQPFGADS-WSLENLS---YLNVHQALDDLAYFILQMKRLKL 145
Query: 178 TATDS--PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
+ DS P GGSY G L+AWFR KYPH+ +G LASS I + ++ + F + I +
Sbjct: 146 HSIDSTLPWYAIGGSYPGALSAWFRYKYPHLTVGNLASSGVI---NTVLDFWEFDDQIRK 202
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTA 295
E C ++ +++ K + ++++R K N W+ T
Sbjct: 203 STSKSGEQCPLYLQLLNSFVDKNLKNFNTKQAFKESYRCGKMTDNEFRWFWVDTI---VQ 259
Query: 296 MTDYPTPSNFLNPLPAF-PVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN 354
M S F L + V+ M + I + K YGA YY + T D N
Sbjct: 260 MVQQGKRSKFCQTLESLSSVERMAEYIREIALSQGDSYKQYGA---YYLRNET---IDEN 313
Query: 355 GDSDPHGLSEWGWQACTEMIML-TGGDNKDSIFEESEEDYDARARYCKEAY--GVDPRPN 411
+W +Q CTE+ L T NKDS+ E D +C +AY G P+
Sbjct: 314 SQH-----RQWYFQCCTEVAYLQTPPQNKDSL-RSYEMTLDWWREWCNDAYSQGEVVWPD 367
Query: 412 WITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV---LKNISKTVLALVE-KEGA 465
TE FGG K+ + ++I NG DPW + K+ SK L++ + +
Sbjct: 368 VRVTEAYFGGLKLNV------DHLIMTNGGEDPWQRASLPFARKDNSKVTTYLIDCDDCS 421
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
H VDL+ T DP L R +W +Q++
Sbjct: 422 HCVDLKAPTANDPAVLTQTRLDIKNKFKQWHAQFW 456
>gi|195492404|ref|XP_002093975.1| GE21585 [Drosophila yakuba]
gi|194180076|gb|EDW93687.1| GE21585 [Drosophila yakuba]
Length = 508
Score = 139 bits (349), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 130/492 (26%), Positives = 215/492 (43%), Gaps = 58/492 (11%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P K+ +L S+ S+ L+ Q LDHF + + +T+QQRY +N ++ + ++AP+
Sbjct: 42 PSKIPTL-QRSQQSEDLW----FEQRLDHFKSSDK--RTWQQRYFVNADYYR-NDSSAPV 93
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
F+ G EG+ G A F AL + +EHR+YGKS P A +
Sbjct: 94 FLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP------TADLSTDNLR 147
Query: 156 YLSSTQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
YLSS QAL D AS + +K D + FGGSY G LAAW R K+P + G+++SS
Sbjct: 148 YLSSEQALEDLASFVTAMKVKFNLADGQKWIAFGGSYPGSLAAWAREKFPQLIHGSISSS 207
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
P+L V + ++ + C + + S+ Q+E K G L + F+
Sbjct: 208 GPLL---AEVDFKEYFEVVKASLAAYKPECVEAVTRSFAQVEILLKHMIGQRSLDEKFKT 264
Query: 275 CKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL---------NPLPAFPVKEMCKAIDDPK 325
C K+ +IE+ L A + + NF +P + ++C + +
Sbjct: 265 CTPIKD-SIENELDMANFFENLA-----GNFAGVVQYNKDNSPHATITIDDICDVMLNTT 318
Query: 326 TG---------NDVFAKLYGAASVYYNYSG-TAKCFDLNGDSD-PHGLSEWGWQACTEMI 374
G ND+ K + Y Y A +++ DS+ G+ +W +Q C E
Sbjct: 319 AGPPVTRLGLVNDMLLKESNTTCLDYKYEKMVADMKNVSWDSETAKGMRQWTYQTCHEFG 378
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI------TTEFGGHKIGLVLKR 428
+N F + D R C + + + ++ T +F G LK
Sbjct: 379 FYQTSENPADTFGD-RFGVDFFIRQCMDVFSKNMDAKFLQLVVSGTNDFYG-----ALKP 432
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG-AHHVDLRFSTKEDPQWLKDVRRR 487
+N+++ +G DPW G++K+ S L + EG AH ++ K DP L R +
Sbjct: 433 NTTNVLYVHGSIDPWHALGLVKS-SNPALPTIYIEGTAHCANMYEPVKTDPPQLVAARNK 491
Query: 488 EVEIIGKWISQY 499
++ + K + Y
Sbjct: 492 ILKFLAKLLDGY 503
>gi|354500289|ref|XP_003512233.1| PREDICTED: thymus-specific serine protease [Cricetulus griseus]
Length = 509
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 122/492 (24%), Positives = 213/492 (43%), Gaps = 69/492 (14%)
Query: 49 SQGLY---KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
SQGL K + Q LD FN + + +TF QRY +ND H G +AP+F++ G EG +
Sbjct: 49 SQGLVAVPKQGWLEQPLDPFNTSDR--RTFLQRYWVNDRHRAG--QDAPVFLHIGGEGSL 104
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
+ G +AP + AL++ +EHR+YG S+P GG + + YLSS ALAD
Sbjct: 105 GPGSVMAGHPVALAPAWGALVISLEHRFYGLSMPSGG------LDMAQLRYLSSRHALAD 158
Query: 166 YASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
AS L + L ++ SP + FGGSY G LA W RLK+PH+ A+ASSAP+ ++
Sbjct: 159 VASARQALSRLLNVSSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---SAVL 215
Query: 225 SPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRICKS-- 277
Y+++ ++ + V S+ C ++ ++E + P L++ C S
Sbjct: 216 DFYAYNEVVARSLSQVAIGGSQECLAAASAAFAEVERLLRAGPAAQAVLREELSACGSLD 275
Query: 278 ------EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP---KTGN 328
E A+++ + Y P V+++C + + +T +
Sbjct: 276 LTEDQGELLGALQALVGGTVQYDGQAGAP-----------LSVRQLCGLLLEDSGNRTHS 324
Query: 329 DVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIML 376
+ L A + G +C + ++P G +W +Q CTE
Sbjct: 325 TPYLGLRRAVQIVLRTMGQ-RCLSFSRAETVAQLRSTEPQVSGVGDRQWLYQTCTEFGYY 383
Query: 377 TGGDNKDSIFEESEEDYDARARYCKEAYGV-----DPRPNWITTEFGGHKIGLVLKRFAS 431
+ F + + C++ +G+ + +GG G A+
Sbjct: 384 VTCEGPQCPFSQLPA-LPSHLDLCEQVFGLSAASVAQAVAQTNSYYGGQTPG------AT 436
Query: 432 NIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEI 491
+++ NG DPW V +++ + A++ +H D+ D L+ R+R +
Sbjct: 437 QVLYVNGDTDPWHVLSVTQDLGPSEPAILIPSASHCFDMAPMRPSDSPSLRLGRQRIFQQ 496
Query: 492 IGKWISQYFQDL 503
+ W+ ++L
Sbjct: 497 LQVWLKDLKKNL 508
>gi|383860201|ref|XP_003705579.1| PREDICTED: putative serine protease K12H4.7-like [Megachile
rotundata]
Length = 494
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 129/476 (27%), Positives = 207/476 (43%), Gaps = 52/476 (10%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFA 109
L K ++ Q LDHFN P +QQRY +N ++ K P+F+ G EG + +W
Sbjct: 44 LPKEQWFPQFLDHFN--PTDAHVWQQRYFVNGDYY---KVGGPVFLMIGGEGAANAKWMV 98
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
+ Y A +F AL +EHR+YGKS P +++ KN YLSS QALAD A
Sbjct: 99 EGQWIEY--AKEFGALCFQVEHRFYGKSHP---TSDLSVKNLM---YLSSEQALADLAYF 150
Query: 170 IIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
+ + ++ + FGGSY G LAAW R KYPH+ GA+++S P+L + Y
Sbjct: 151 VQTVNTMYKLPNNTKWIAFGGSYAGSLAAWLREKYPHLVHGAVSASGPLLAQIDFQEYYV 210
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-------SEKNL 281
++ + SE C I + Q + G + L K F +C K++
Sbjct: 211 ---VVENALKEYSEACVNAIVEANTQFHIMLRHRIGQQGLAKKFILCNPIDPGYTKTKDI 267
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVY 341
A + + A + + Y + + + F +K C + + G + +L ++
Sbjct: 268 A-NLYETIASDFAGIVQYNKNNRNNSAMANFTIKSACDILTNESLGLAI-DRLALISNKI 325
Query: 342 YNYSGTAKCFD------LN-------GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE 388
N + KC D +N D G +W +Q CTE + ++F
Sbjct: 326 LN-ATNKKCLDYVYNKMINELRNISWASEDAEGGRQWMYQTCTEFGFFQTSTARPNLFSN 384
Query: 389 SEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGL---VLKRFASNIIFFNGLRDPWSG 445
S D + C + +G PR N + ++ + L +N++F +G DPW
Sbjct: 385 SFP-VDFFVQQCLDIFG--PRYNIQLLKSAVNRTNIFYGALNLKVTNVVFVHGSVDPWHV 441
Query: 446 GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
G+ K+ + A+ AH ++ +K D LK R + IG ISQ+ Q
Sbjct: 442 LGITKSSNPQAPAIYINGTAHCANMYPPSKSDVPELKVARAK----IGHLISQWLQ 493
>gi|170066901|ref|XP_001868269.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167863077|gb|EDS26460.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 484
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 121/490 (24%), Positives = 228/490 (46%), Gaps = 68/490 (13%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P+ R+T ++S TK+ Q LD N++PQ+ T+ RY+ N H+
Sbjct: 41 PAKRVTRAQVS--------------TKWIKQKLD--NFDPQNPSTWSMRYMENGEHY--- 81
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
+P+F++ G E I + G YD+A + +A L + EHRYYG+S P + +
Sbjct: 82 VPGSPLFIFVGGEWTISAGSIQQGHFYDMAAEHRAYLFYTEHRYYGQSRPTVNTRTDQMR 141
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+L+ QALAD A + ++++ + A +S V++ GGSY + WFR KYPH+
Sbjct: 142 ------FLNVDQALADLAHFVEEMRRTIPGAENSKVIMVGGSYSATMVVWFRQKYPHLVN 195
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEK 267
G ASSAP+L + + ++++ R V + C ++ ++E+ K+ G ++
Sbjct: 196 GVWASSAPLLAKLDFT---EYKEVVSESIRLVGGDACADRVQRGVAEVEDLIKQ-GSYDQ 251
Query: 268 LQKAFRICKSEKNLA----IESWLST-AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID 322
+ +AF +C ++ +L+ + +LS+ + + + Y + ++ +CK I+
Sbjct: 252 VAQAFNLC-ADTDLSNTRDRQGFLSSISDTFAGVVQYHWSGD---------IEGVCKVIN 301
Query: 323 DPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSD---------PHG-----LSEWGWQ 368
DP D+ A L G ++ Y G+ +C D + +S HG + W +Q
Sbjct: 302 DPNYNTDMEA-LAG----WFTY-GSTRCLDASYESMISYYRNTDWTHGANTGSMRPWLYQ 355
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD-PRPNWITTEFGGHKIGLVLK 427
C E +++ IF S D ++C + Y P + + I +
Sbjct: 356 TCAEYGWYQTSGSENQIF-GSGFPVDLYIQWCADLYDNKFPESSMHANVARTNTIYGHMN 414
Query: 428 RFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRR 487
+N++F G DPW GV +++++ A+V +H DL + D ++ + +
Sbjct: 415 PEVTNVLFTQGQLDPWRPMGVQQDLNERSPAVVIPLASHVADLNSISDWDSTEMRAAKEK 474
Query: 488 EVEIIGKWIS 497
E++ W++
Sbjct: 475 ISELVRLWLA 484
>gi|339245689|ref|XP_003374478.1| putative serine protease [Trichinella spiralis]
gi|316972265|gb|EFV55948.1| putative serine protease [Trichinella spiralis]
Length = 490
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 135/513 (26%), Positives = 217/513 (42%), Gaps = 80/513 (15%)
Query: 27 PTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW 86
P PS RI L++ ++++ + Y Q LDHF++ Q T+ QRY +N ++
Sbjct: 17 PQKPSVRIIGVPLNTKVAANFN--------YFQQNLDHFHH--QQNITWLQRYWVNTKYY 66
Query: 87 GGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYD---VAPKFKALLVFIEHRYYGKSIPYG 141
K P F+ G EG W ++ + D A F A+ +EHRYYG+S P
Sbjct: 67 ---KPGGPAFLLIGGEGPAISSWIQESEKYPKDWMKKAQTFGAICFMLEHRYYGESHPTD 123
Query: 142 GNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRL 201
K +L+S QALAD A+ I S + FGGSY G+L+ W RL
Sbjct: 124 NMK------TENLRWLTSDQALADVANFISYATTRYNLQGSRWITFGGSYAGLLSGWSRL 177
Query: 202 KYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKK 261
KYPH+ GA+ASSAP F V+ + + + + ++ + NC I+ + Q+ + +
Sbjct: 178 KYPHLITGAVASSAP---FHIKVNFHEYLDSVFDSIKNENANCASEIENAVLQMRKLLRT 234
Query: 262 PGGLEKLQKAFRICKSEKNLA---IESWLST-AFVYTAMTDYPTPSNFLNPLPAFPVKEM 317
G ++++ +CK K +++ +T A +++ + Y P+ A +K M
Sbjct: 235 EYGRREMKEKLSLCKKIKQAEGKDVQNLFATIADMFSFIVQYNQPNT------AMSMKNM 288
Query: 318 CKAIDDPKTGNDV---------FAKLYGAASVYYNYSGT-AKCFDLNGDSDPHGLSEWGW 367
C+ + D G+ V + A + + YS A+ D +W +
Sbjct: 289 CEKLTDLSGGDPVTRLGVIIRWMLRFTSALCLNFRYSDMIAELSDTKWTKS--STRQWQY 346
Query: 368 QACTEMIMLTGGDNKDSIFEESEEDYDARARY--CKEAYGVDPRPNWI-------TTEFG 418
Q CTE D+ F + YD + + CK+ +G + +G
Sbjct: 347 QTCTEFGYFQTTDSLQHAFFGT---YDLQFFFDICKDTFGKQFTEATVHRGAFEKNVRYG 403
Query: 419 GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN---ISKTV----------LALVEKEGA 465
G ++ SNII NG DPW G++ N +SK + LA +
Sbjct: 404 GKQLK------KSNIILINGSVDPWHRLGLVNNPHPLSKAILINGTILHKLLARTNISSS 457
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
H D+ DP+ L RR I WI Q
Sbjct: 458 HCADMYPELTSDPETLIKARREITGHIASWILQ 490
>gi|440799806|gb|ELR20849.1| protease, serine, 16 (thymus), putative [Acanthamoeba castellanii
str. Neff]
Length = 478
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 127/471 (26%), Positives = 201/471 (42%), Gaps = 63/471 (13%)
Query: 44 SSSKDSQGLYKTKYH-TQILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTG 100
S+ + +GL + Y TQ +DHF+ P + +T+QQ+Y++ D ++ GG PIF++ G
Sbjct: 51 SAPTELKGLSQKVYQFTQKVDHFD--PLNGKTYQQKYIVTDDNYVPGG-----PIFLFLG 103
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
E +E+F T + +F AL + +EHR+YG S+P Y AS LSS
Sbjct: 104 GEAPVEFFDFQTVLPRSLTKQFGALYIALEHRFYGVSMPAHD-----YSTASL-ALLSSR 157
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI--- 217
QALAD A+ ++ K LT P VV+G SY G L+AWFR KYP++ +G++A S P+
Sbjct: 158 QALADAANFLVSFNKTLT-NPGPWVVWGCSYSGALSAWFRAKYPNLVVGSVAPSGPVYAS 216
Query: 218 LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
LNF +S + S C + +K + + G ++L + I S
Sbjct: 217 LNFTQYYGVFS---------TAASPQCVETVKRATAMLMAKLSTADGRKELTE---ISAS 264
Query: 278 EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA 337
+ E + F+ T F NP PA+P+ C + D+ A
Sbjct: 265 PQ----EHYY---FLLTLTEAIGGSDQFQNP-PAWPLNTTCNTM---MQSGDLLANWAQV 313
Query: 338 ASVYYNYSGTAKCFDLNGDSD--------PHGLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+ C D N ++ W +Q CTE S+F
Sbjct: 314 VNQANGPKAPNACNDFNEETSYLKPLRQPTSSDRSWLFQQCTEFGFFMPTYPGTSVFPLM 373
Query: 390 EEDYDARARYCKEAYGVD---PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
D + + ++C+ +GV P +GG+ + SNI+F NG DPW
Sbjct: 374 --DLEHQVKWCQNVFGVSGMTPNTEGTNAYYGGYDLR------GSNILFTNGDADPWHTL 425
Query: 447 GVLKNISKTV-LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ K++ + V H + T +DP L+ R I +
Sbjct: 426 SITKDLPAPAGVRAVTYAAGHCAPMTQPTSQDPVSLQHARVVVANFIASLV 476
>gi|255645752|gb|ACU23369.1| unknown [Glycine max]
Length = 490
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 148/511 (28%), Positives = 214/511 (41%), Gaps = 90/511 (17%)
Query: 26 FPTFPSSRITPEKLSSLISSSKDSQGLYKT---KYHTQILDHFNYNPQSYQTFQQRYLIN 82
FP + P L + K SQG Y T ++ Q LDHF+ P + F+QRY
Sbjct: 24 FPALSYGVVPPRTLLN-----KLSQGSYLTTQEQWFNQTLDHFS--PYDHHQFRQRYFEF 76
Query: 83 DTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGG 142
++ + PIF+ G EG + ++ +A KF A +V +EHRYYGKS P+
Sbjct: 77 LDYF--RIPDGPIFLVIGGEGPCNGITND--YIGVLAKKFGAAMVTLEHRYYGKSSPFNS 132
Query: 143 NKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVVVFGGSYGGMLA 196
+ K YLSS QAL D A ++ K N T ++P VFGGSY G L+
Sbjct: 133 LETENLK------YLSSKQALFDLAVFRQYYQDSLNAKLNRTKIENPWFVFGGSYAGALS 186
Query: 197 AWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIE 256
AWFRLK+PH+ G+LASSA +L + Y+F+ Q S C V++ + + IE
Sbjct: 187 AWFRLKFPHLTCGSLASSAVVL------AVYNFTEYDQQIGESAGAECKAVLQETTQLIE 240
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
G ++L+ +F EK+ F+Y + NP +
Sbjct: 241 HKLATNG--KELKASFNADDLEKD--------GDFMYLIADAAAVAFQYGNP------DK 284
Query: 317 MCKAIDDPKTGNDVFAKLYGAASVYYNYSGT----AKCFD--------LNGDSDPHGLSE 364
+CK + + K + Y A V Y GT + +D +N DS
Sbjct: 285 VCKPMVEAKNAGEDLVDAY-AKYVKEYYIGTFGVNVQTYDQEYLKKTAINEDSSTR---L 340
Query: 365 WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE--FGGHKI 422
W +Q CTE+ + DSI SE D CK +G P+ T +GG KI
Sbjct: 341 WWFQVCTEVAFFQVAPSNDSI-RSSEIDAKYHMDLCKNIFGEGIFPDVDATNLYYGGTKI 399
Query: 423 GLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL---ALVEKEGAHHVDLRFSTK---- 475
S I+F NG +DPW K S L + AH D R +
Sbjct: 400 A------GSKIVFANGSQDPWRHAS--KQTSSPDLPSYTITCSNCAHCTDFRGCPQSPLV 451
Query: 476 --------EDPQWLKDVRRREVEIIGKWISQ 498
P + VR++ E + W+S+
Sbjct: 452 LEGNEKNCSSPDAVHKVRQQITEHMDLWLSE 482
>gi|195497934|ref|XP_002096311.1| GE25599 [Drosophila yakuba]
gi|194182412|gb|EDW96023.1| GE25599 [Drosophila yakuba]
Length = 473
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 127/484 (26%), Positives = 216/484 (44%), Gaps = 59/484 (12%)
Query: 33 RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNN 92
RI E S I + D + +T + Q LDHF+ P +T+Q RY++ND + K+
Sbjct: 28 RIHEEPPLSTIQNRAD---VVQTLWIEQKLDHFD--PAETRTWQMRYMLNDALY---KSG 79
Query: 93 APIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAS 152
AP+F+Y G E +I G +YD+A + ALL + EHRYYG+S P +++ +N
Sbjct: 80 APLFIYLGGEWEISSGRITGGHLYDMAKEHNALLAYTEHRYYGQSKPL---PDLSNENIK 136
Query: 153 TTGYLSSTQALADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
YLS Q+LAD A I +K+N ++S V++ GGSY + WF+ YP + G
Sbjct: 137 ---YLSVNQSLADLAYFINTIKQNHEGLSESKVIIVGGSYSATMVTWFKKLYPDLVAGGW 193
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
ASSAP+ N V + + Q + + +CYK I+ ++E G E ++
Sbjct: 194 ASSAPLFAKVNFV---EYKEVTGQSIQQMGGSDCYKRIENGIAEMESMIATKRGAE-VKA 249
Query: 271 AFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV 330
++C+ + S L +++ ++D + + A ++ +C+ I G++
Sbjct: 250 ILKLCEP---FDVYSDLDVWTLFSEISD--IFAGVVQTHNAGQIEGVCQKI---MAGSND 301
Query: 331 FAKLYGAASVYYNYSGTAKCFDLNGDS----------DPHGLSEWGWQACTEM-IMLTGG 379
+ G + SG KC+DL+ D+ + + + +W +Q C E T G
Sbjct: 302 LNGVAGYLLDVFAESG-GKCYDLSYDAITGLLLDTNYNGNIMRQWMFQTCNEYGWYQTSG 360
Query: 380 DNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE-------FGGHKIGLVLKRFASN 432
+ Y C + YG +I+ + FGG G+ N
Sbjct: 361 STAQPFGTKFPVTY--YTTMCADLYGSQYSNEFISNQVSITNQFFGGLSPGV------EN 412
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
+ +G DPW G+ T++ E AH D + D +K + R E++
Sbjct: 413 VYLTHGQLDPWRAMGIQDESQATIIP----EHAHCKDFNSISSSDTAEMKASKERIAELV 468
Query: 493 GKWI 496
+W+
Sbjct: 469 REWV 472
>gi|344292454|ref|XP_003417942.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 505
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 129/507 (25%), Positives = 216/507 (42%), Gaps = 83/507 (16%)
Query: 46 SKDSQGLYKTKYHT--QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE- 102
S S+G++ ++ + Q LDHF+ N S Q + QRY IND + K P+F+ G
Sbjct: 28 SYSSKGMHPSRAGSFPQKLDHFSKN--SSQLWPQRYFINDAFY---KPGGPVFLMIGGAW 82
Query: 103 -GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQ 161
W + + ++ A + AL + +EHR+YG S P G + ++ YLSS Q
Sbjct: 83 IACESWVSISKTWV-TYAERLGALFLLLEHRFYGHSQPTGD------LSTASLHYLSSRQ 135
Query: 162 ALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
ALAD A+ ++ K + T + V +G SYGG LA W RLK+P + A+ SSAPI
Sbjct: 136 ALADIANFRTEIAKKMGLTKNNWVAYGCSYGGSLAVWSRLKHPDLFAAAVGSSAPIKAKA 195
Query: 222 NIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL 281
N Y + ++ + + + C++ +K ++ Q+ + + P KL++ F +C K L
Sbjct: 196 NF---YEYLEVVQRSLATHNSKCFQTVKEAFDQVVKMLRFPKYYRKLERDFTLC---KRL 249
Query: 282 AIESWLSTAFVYTAMTDYP--------------------TPSNFLNPLP----------A 311
+ S + A+ + +P TP P +
Sbjct: 250 KLYSAMDKAYFLERLI-FPVELAVQHNKNEKNDKILSPSTPGTLFIPPTCSRIALGEQLS 308
Query: 312 FPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN-------------GDSD 358
F + E+C A+ + G+ + Y + + CF + D
Sbjct: 309 FSMDELCDAMANTSLGSPYHRYVRIILLTYKD--KYSPCFAAHYRTKLKILLDTSINHHD 366
Query: 359 PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE-- 416
P+ ++ +Q+CTE D+K+ F Y + C + +G + + T
Sbjct: 367 PNIDRQFFYQSCTEFGFFQTTDSKNQPFTGLPLSYFLQQ--CSDFFGPKFNNDSLNTGVI 424
Query: 417 -----FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
+GG + S IIF NG DPW G+ K+ISK + A+ K H D+
Sbjct: 425 STNAYYGGFNMT------GSKIIFPNGSFDPWHPLGITKDISKDLPAVFIKGAVHCADMF 478
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQ 498
D L R + +++ KW+ Q
Sbjct: 479 EQNDTDSAELIQAREKIFQLLQKWLKQ 505
>gi|301783465|ref|XP_002927151.1| PREDICTED: thymus-specific serine protease-like [Ailuropoda
melanoleuca]
Length = 476
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 122/486 (25%), Positives = 210/486 (43%), Gaps = 75/486 (15%)
Query: 37 EKLSSLISSSKDSQGLY-----------KTKYHTQILDHFNYNPQSYQTFQQRYLINDTH 85
+L I ++SQGL K + Q LD FN + Q ++F QRY +ND H
Sbjct: 28 RRLGEHIQQFQESQGLSLTLGPGAGALPKQGWLEQPLDPFNTSDQ--RSFLQRYWVNDQH 85
Query: 86 WGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
W + P+F++ G EG + + G +AP + AL++ +EHR+YG SIP G
Sbjct: 86 W--ASQRGPVFLHLGGEGSLRSGSVMRGHPAALAPAWGALVIGLEHRFYGLSIPAEG--- 140
Query: 146 IAYKNASTTGYLSSTQAL--ADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLK 202
+ + +LSS AL AD AS L + +T SP + FGGSY G LAAW RLK
Sbjct: 141 ---LDVAQLRFLSSRHALPSADVASARRALARLFNVSTASPWICFGGSYAGSLAAWARLK 197
Query: 203 YPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETA 259
+PH+ + ++ASSAP +L+F ++ ++++ S + C +
Sbjct: 198 FPHLILASVASSAPVRAVLDFSE------YNEVVSRSLTSTAPACSSL------------ 239
Query: 260 KKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
GG E ++E A+++ + A Y A P L L P
Sbjct: 240 ---GGAED--------QAELLGALQALVGGAVQYDAQAGVPLSVRRLCGLLLGPSGSRSA 288
Query: 320 AIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH----GLSEWGWQACTEMIM 375
+ + V + G + ++ + T + +DPH G +W +Q CTE
Sbjct: 289 SYHGLRRAVQVVMRSLGQRCLSFSRAETVAQLRV---TDPHVSGVGDRQWLYQTCTEFGF 345
Query: 376 LTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFA 430
++ F + ++ C++ +G+ + + +GG G +
Sbjct: 346 YVTCEDPRCPFSQLPA-LPSQLELCEQVFGLSTASIARAVSQTNSYYGGQTPG------S 398
Query: 431 SNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVE 490
++++F NG DPW V + + + AL+ +H +D+ D L+ R+ ++
Sbjct: 399 THVLFVNGDTDPWHVLSVTQALGPSESALLIPGASHCMDMAPERPSDSPSLRLGRQSILQ 458
Query: 491 IIGKWI 496
+ W+
Sbjct: 459 QLQTWL 464
>gi|290998882|ref|XP_002682009.1| peptidase S28 [Naegleria gruberi]
gi|284095635|gb|EFC49265.1| peptidase S28 [Naegleria gruberi]
Length = 434
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/466 (26%), Positives = 204/466 (43%), Gaps = 60/466 (12%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY-D 117
TQ LDHF+ PQ+ +TFQQR+ +NDT W G +F+ G EG F+ +
Sbjct: 2 TQRLDHFD--PQNTETFQQRFWVNDTMWQGKN----VFIIIGGEGPASSKYLTGHFVINE 55
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD---YASLIIDLK 174
K ALL +EHR+YG+S+P K +A N YL+S QAL D + SL++ K
Sbjct: 56 YGKKHGALLAALEHRFYGESVP---RKSLATDNLR---YLTSEQALQDLVEFRSLLVK-K 108
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+ + V FGGSY G L+AW + KYPH+ +GA+ASS P + + F+ +
Sbjct: 109 YRMDEANVKFVCFGGSYSGNLSAWLKAKYPHLFVGAIASSGP------VEAKLEFNEYMM 162
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYT 294
S+ C ++ + IE+ P G +++ F +C E ++ + A +++
Sbjct: 163 TVANSIGPKCTDRVRKANDLIEQLIATPAGRQRVASMFNVCNPE---SMTNNDDIALLFS 219
Query: 295 AMTD-------YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGT 347
+++D Y + N F V MC I+ + FA + Y S T
Sbjct: 220 SLSDGVCEVVQYNLDN---NGAQGFNVTSMCAIIESSDDALEGFANWVKTWNSYSQSSCT 276
Query: 348 AKCFD----LNGDSDPHGLSE------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
+ D P +E W +Q C E F +
Sbjct: 277 QNLYSDFIKQMQDVRPWPANENAAGRSWTYQTCVEFGYYQNAVGPKQPF-SPRITLEWFV 335
Query: 398 RYCKEAYGVD---PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV---LKN 451
+ C + +G++ P ++ +G I +N +F G DPWS V +N
Sbjct: 336 QQCSQIFGINGMKPDIDFTNNMYGSKNIQ------TTNTVFSTGSVDPWSVLAVAQPTRN 389
Query: 452 ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
I++ + ++ AH DL S+ +D L + R + +++ +W++
Sbjct: 390 IAQNYVYHMQGT-AHCADLYSSSPKDLPTLTNTRVQTQKLLDQWLA 434
>gi|440900328|gb|ELR51489.1| Thymus-specific serine protease [Bos grunniens mutus]
Length = 516
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 125/490 (25%), Positives = 215/490 (43%), Gaps = 76/490 (15%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW 107
DS L K + Q LD FN + + ++F QRY +ND HW + + P+F++ G EG +
Sbjct: 50 DSVTLPKEGWLEQPLDPFNASDR--RSFLQRYWVNDQHW--TSQDGPVFLHLGGEGSLGP 105
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
+ G ++AP + AL++ +EHR+YG SIP G + +LSS ALAD A
Sbjct: 106 GSVMRGHPANLAPIWGALVISLEHRFYGLSIPAEGLDMAQLR------FLSSRHALADAA 159
Query: 168 SLIIDLKK--NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---ILNF-- 220
S + L + N+++T SP + FGGSY G LAAW RLK+PH+ ++ASSAP IL+F
Sbjct: 160 SARLTLSRLFNVSST-SPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPVRAILDFSK 218
Query: 221 -DNIVSPYSFSNII--TQDFRSVS-------ENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
+++VS + I + + R + E + +G+W + G LE+ +
Sbjct: 219 YNDVVSRSLMNTAIGGSLECREAASAAFAEVERRLRASRGAWATLSVELGACGSLERAED 278
Query: 271 AFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK----- 325
++E A+++ + A Y P V+++C+ + +
Sbjct: 279 -----QAELLGALQALVGGAVQYDGQAGAP-----------LSVRQLCRLLLGDRGNCKG 322
Query: 326 --TGNDVFAKLYGAASVYYNYSGTAKCFDL------------NGDSDPHGLSEWGWQACT 371
+G+ + L A V + G +C + G +W +Q CT
Sbjct: 323 NCSGSAPYRGLRRAVQVVTHSLGQ-RCLSIPRAETVAQLRVTELQVSSVGDRQWLYQTCT 381
Query: 372 EMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLVL 426
E + F + + C++ +G V + +GG G
Sbjct: 382 EFGYYVTCEVPGCPFSQLPA-LPSELELCEQVFGLSTSSVAQAVAQTNSYYGGQTPG--- 437
Query: 427 KRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRR 486
A+ ++F NG DPW V + + + AL+ +H +D+ D L+ R+
Sbjct: 438 ---ATQVLFINGDTDPWHVLSVTQPLGSSEPALLIPSASHCLDMAPERPSDSPSLRLARQ 494
Query: 487 REVEIIGKWI 496
+ + + W+
Sbjct: 495 KVSQQLQTWL 504
>gi|47224819|emb|CAG06389.1| unnamed protein product [Tetraodon nigroviridis]
Length = 493
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 123/480 (25%), Positives = 209/480 (43%), Gaps = 49/480 (10%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+ D + + ++ Q LDHF+ + + ++ QRY +N+ + K P+F+ G E
Sbjct: 39 VRRETDLKTVSDEQWFIQRLDHFSADSREWK---QRYFVNEAFY---KPGGPVFLMIGGE 92
Query: 103 GDIE--WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
G W T Y A K A+ +EHR+YGKS P +++ N +LSS
Sbjct: 93 GAASPAWMQYGTWLTY--AEKLGAICFMLEHRFYGKSHP---TSDLSTDNLR---FLSSR 144
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
QALAD A + + T++ V FGGSY G LAAW RLKYPH+ A+A+SAPI
Sbjct: 145 QALADLAHFRTVIAEARGLTNAKWVAFGGSYPGSLAAWLRLKYPHLVHAAVATSAPI--- 201
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
V+ + ++ + S C ++K + + E K P + + K F +C
Sbjct: 202 HATVNFPEYLEVVWRSLASEDAACPVLVKNASDTLAELLKDPKTYDNITKDFNLC---SK 258
Query: 281 LAIESWLSTA-FVYTAMTDYPTPSNFLNPLPAFP--------VKEMCKAIDDPKTGNDVF 331
L I++ + +A F+ T ++ + AF +K +C + D G +
Sbjct: 259 LQIQTEMDSAQFLETLAGNFMEVVQYNEDNRAFEGALGTNVTIKVLCGVMRDSSLGAP-Y 317
Query: 332 AKLYGAASVYYNYSGTAKCFDLNGDS-------------DPHGLSEWGWQACTEMIMLTG 378
A+ Y A + + + KC D + D+ G +W +Q CTE
Sbjct: 318 AR-YAAVARLMLDTVSMKCLDSSFDAYVRDMTNSSWDGPAAGGGRQWVYQTCTEFGFFQS 376
Query: 379 GDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
D+ + F + + + C++ Y + G R +S I+F NG
Sbjct: 377 SDSPNQPFTGFPLRF--QVKQCEQFYNISAELLTAAVAQTNEYYGSYDIR-SSRIVFPNG 433
Query: 439 LRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
DPW G+ ++I++ + A+ + AH ++ + ED L R ++ +W+ Q
Sbjct: 434 AIDPWHALGITQDITQDLPAVFIQGTAHCANMYPARNEDLPQLVLARDHIFFLLQQWLKQ 493
>gi|195353655|ref|XP_002043319.1| GM26836 [Drosophila sechellia]
gi|194127433|gb|EDW49476.1| GM26836 [Drosophila sechellia]
Length = 473
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 121/464 (26%), Positives = 210/464 (45%), Gaps = 54/464 (11%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN 111
+ +T + Q LDHF+ P +T+Q RY++ND + K+ AP+F+Y G E +I
Sbjct: 44 VVQTLWIEQKLDHFD--PAETRTWQMRYMLNDALY---KSGAPLFIYLGGEWEISSGRIT 98
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
G +YD+A + ALL + EHRYYG+S P +++ +N YLS Q+LAD A I
Sbjct: 99 GGHLYDMAKEHNALLAYTEHRYYGQSKPL---PDLSNENIK---YLSVNQSLADLAHFIN 152
Query: 172 DLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+K+N ++S V++ GGSY + WF+ YP + G ASSAP+L N V +
Sbjct: 153 TIKQNHEGLSESKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLLAKVNFV---EYK 209
Query: 231 NIITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST 289
+ Q + + CYK I+ ++E G E ++ ++C+ + S L
Sbjct: 210 EVTGQSIEQMGGSACYKRIENGIAEMETMIATKRGAE-VKALLKLCEP---FDVYSDLDV 265
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
+++ ++D + + A ++ +C+ I D G++ + G + SG K
Sbjct: 266 WTLFSEISD--IFAGVVQTHNAGQIEGVCEKIMD---GSNDLIGVAGYLLDVFEESG-GK 319
Query: 350 CFDLNGDS----------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
C DL+ D+ + + + +W +Q C E ++ F ++
Sbjct: 320 CHDLSYDAITALLLDTNYNGNIMRQWIFQTCNEYGWYQTSGSRAQPF-GTKFPVTYYTTM 378
Query: 400 CKEAYGVDPRPNWITTE-------FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
C + YG D +I+ + FGG L N+ +G DPW G+
Sbjct: 379 CADLYGSDYSNEFISNQVTITNQFFGG------LSPNVENVYLTHGQLDPWRPMGIQDET 432
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
T++ E AH D + D ++ + R E++ +W+
Sbjct: 433 QATIIP----EHAHCKDFNSISSSDTAEMRASKERIAELVREWV 472
>gi|395859218|ref|XP_003801939.1| PREDICTED: thymus-specific serine protease [Otolemur garnettii]
Length = 771
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 114/458 (24%), Positives = 196/458 (42%), Gaps = 58/458 (12%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S D L K + Q+LD FN + ++F QRY +ND HW + + P+F++ G E
Sbjct: 47 LSLGPDDVALPKVGWLEQLLDPFNSS--DRRSFLQRYWVNDQHW--THQDGPVFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP F AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAFGALVISLEHRFYGLSIPAGG------LDVAQLRFLSSRHA 156
Query: 163 LADYASLIIDLKKNLTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + + SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSARLSLSRLFNVSSLSPWICFGGSYAGSLAAWARLKFPHLIFASIASSAPVRAVL 216
Query: 219 NFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-----KLQ 269
+F +++++++ +S+ S C + ++ ++E G + +L
Sbjct: 217 DFSE------YNDVVSRSLKSIAIGGSLECRAAVSAAFAEVERRLSAGGAVRAALRAELG 270
Query: 270 KAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND 329
+ +SE + L D PL + + D ++
Sbjct: 271 ACGSLSRSEDQAELLGELQALVGGAVQYD----GQIGAPLSVRQLCGLLLGGGDNRSHPT 326
Query: 330 VFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIMLT 377
+ L A + G KC + ++P G +W +Q CTE
Sbjct: 327 PYGGLRRAVQIVMQSLGQ-KCLSFSRAETVAQLKATEPQVSGIGDRQWLYQTCTEFGFYV 385
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASN 432
++ F + + C++ +G+ + +GG G A+
Sbjct: 386 TCEDPMCPFSQLPA-LPSHLDLCEQVFGLSASSVAQAVAQTNSYYGGQTPG------ATQ 438
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
++F NG DPW V + + + AL+ +H +D+
Sbjct: 439 VLFVNGDTDPWHVLSVTQALGSSQSALLIPNASHCLDM 476
>gi|340376307|ref|XP_003386675.1| PREDICTED: putative serine protease K12H4.7-like [Amphimedon
queenslandica]
Length = 486
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 146/520 (28%), Positives = 222/520 (42%), Gaps = 60/520 (11%)
Query: 2 ATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQI 61
A I L F L FS +L PTF + + SS+D + L K + TQ
Sbjct: 3 AETLIILLFFLSFSHSL--------PTFLRHKWLQREKYFRHLSSQDLK-LPKDLWFTQS 53
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVA 119
DHF T+QQRY +ND+ W K N P+F+ G EG D +W + G M +A
Sbjct: 54 RDHFR--EVDTTTWQQRYWVNDSFW--DKENGPVFLMIGGEGEADPKWVVE--GEMMVLA 107
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
K+ AL +EHR+YGKS P G + + Y L+S QAL D A +++
Sbjct: 108 EKYHALAFQLEHRFYGKSQP-GADLSMDY-----ITLLNSRQALEDLAYFRMNMTTKYNM 161
Query: 180 TDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI-LNFDNIVSPYSFSNIITQDF 237
TD+ + FGGSY G LAAW R+KYP + G++ASSAPI FD Y + +++
Sbjct: 162 TDANRWIAFGGSYPGALAAWLRMKYPDIVYGSIASSAPIQAKFDF----YEYFEVVSASL 217
Query: 238 RSVSEN--CYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC------KSEKNLAIESWLST 289
C I ++ + + L K F + + K LA +S
Sbjct: 218 EQARNGPACVAAINAAFSLLNKLILDNSKWASLDKMFNLTVGLTTPEDGKMLA----MSL 273
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN--DVFAKLYGAASVYYN---Y 344
A ++ + Y S + + +C + + D A L + +++ N +
Sbjct: 274 AGIFAGIVQYNNDS---RGSAVYNMSLLCDVMTNDSISEPIDRLAHLSRSPALFDNDVSF 330
Query: 345 SGTAKCFDLNGDSDP-HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
K F N DP G +W +Q C D+ F + D C +
Sbjct: 331 QDYVK-FLSNVTIDPAQGDRQWTYQTCDSFGYFQTADSPHQPF-GTLFDTALYTLICNQV 388
Query: 404 YGVDPRPNWI----TTEFGGHKIGLVLKRFA-SNIIFFNGLRDPWSGGGVLKNISKTVLA 458
+G + + T EF G GL + A +NI+F NG DPW V N+S++V A
Sbjct: 389 FGFTEKDIPVNVNNTNEFYG---GLNFNKTAVTNIVFPNGSIDPWHSLSVTANVSESVTA 445
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ AH ++ + D L R+ I+G W+++
Sbjct: 446 VFINGTAHCANMYPANPYDKPQLVAARKEIDAILGSWLAK 485
>gi|24659664|ref|NP_648067.2| CG9953 [Drosophila melanogaster]
gi|7295307|gb|AAF50628.1| CG9953 [Drosophila melanogaster]
Length = 508
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/464 (25%), Positives = 203/464 (43%), Gaps = 45/464 (9%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYD 117
Q LDHF + + +T+QQRY +N + + ++AP+F+ G EG+ +W + Y
Sbjct: 61 QRLDHFKSSDK--RTWQQRYFVNADFYR-NDSSAPVFLMIGGEGEASAKWMREGAWVHY- 116
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A F AL + +EHR+YGKS P +++ +N YLSS QAL D AS + +K
Sbjct: 117 -AEHFGALCLQLEHRFYGKSHP---TADLSTENLH---YLSSEQALEDLASFVTAMKVKF 169
Query: 178 TATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
D + FGGSY G LAAW R KYP + G+++SS P+L V + ++
Sbjct: 170 NLGDGQKWIAFGGSYPGSLAAWAREKYPELIYGSISSSGPLL---AEVDFKEYFEVVKAS 226
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAM 296
+ C + S+ Q+E K G L + F+ C K+ +IE+ L A + +
Sbjct: 227 LAAYKPECVDAVTRSFAQVEILLKHMIGQRSLDEKFKTCTPIKD-SIENDLDMANFFENL 285
Query: 297 TDYPTPSNFL---------NPLPAFPVKEMCKAIDDPKTG---------NDVFAKLYGAA 338
NF +P + ++C + + G ND+ K
Sbjct: 286 A-----GNFAGVVQYNKDNSPHATITIDDICDVMLNTTAGPPVTRLGLVNDMLLKESNTT 340
Query: 339 SVYYNYSG-TAKCFDLNGDSD-PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
+ Y Y A +++ DS+ G+ +W +Q C E DN F + D
Sbjct: 341 CLDYKYDKMVADMKNVSWDSETAKGMRQWTYQTCHEFGFYQTSDNPADTFGD-RFGVDFF 399
Query: 397 ARYCKEAYGVDPRPNWITTEFGGHKIGL-VLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
R C + + + ++ LK +N+++ +G DPW G++K+ +
Sbjct: 400 IRQCMDVFSKNMNAKFLKLVVSATNDNYGALKPKTTNVLYVHGSIDPWHALGLVKSTNAA 459
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ + + AH ++ K DP L R + ++ + K + Y
Sbjct: 460 LPTIYIEGTAHCANMYEPVKTDPPQLVAARNKILKFLAKLLDGY 503
>gi|162287351|ref|NP_001069798.2| thymus-specific serine protease precursor [Bos taurus]
gi|296474038|tpg|DAA16153.1| TPA: protease, serine, 16 [Bos taurus]
Length = 516
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 124/490 (25%), Positives = 212/490 (43%), Gaps = 76/490 (15%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW 107
DS L K + Q LD FN + + ++F QRY +ND HW + + P+F++ G EG +
Sbjct: 50 DSVTLPKEGWLEQPLDPFNASDR--RSFLQRYWVNDQHW--TSQDGPVFLHLGGEGSLGP 105
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
+ G ++AP + AL++ +EHR+YG SIP G + +LSS ALAD A
Sbjct: 106 GSVMRGHPANLAPIWGALVISLEHRFYGLSIPAEGLDMAQLR------FLSSRHALADAA 159
Query: 168 SLIIDLKK--NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---ILNF-- 220
S + L + N+++T SP + FGGSY G LAAW RLK+PH+ ++ASSAP IL+F
Sbjct: 160 SARLTLSRLFNVSST-SPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPVRAILDFSK 218
Query: 221 -DNIVSPYSFSNIITQDFRSVS---------ENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
+++VS + I + E + +G+W + G LE+ +
Sbjct: 219 YNDVVSRSLMNTAIGGSLECRAAASAAFAEVERRLRASRGAWATLSVELGACGSLERAED 278
Query: 271 AFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK----- 325
++E A+++ + A Y P V+++C+ + +
Sbjct: 279 -----QAELLGALQALVGGAVQYDGQAGAP-----------LSVRQLCRLLLGDRGNCKG 322
Query: 326 --TGNDVFAKLYGAASVYYNYSGTAKCFDL------------NGDSDPHGLSEWGWQACT 371
+G+ + L A V + G +C + G +W +Q CT
Sbjct: 323 NCSGSAPYRGLRRAVQVVTHSLGQ-RCLSIPRAETVAQLRVTELQVSSVGDRQWLYQTCT 381
Query: 372 EMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLVL 426
E + F + + C++ +G V + +GG G
Sbjct: 382 EFGYYVTCEVPGCPFSQLPA-LPSELELCEQVFGLSTSSVAQAVAQTNSYYGGQTPG--- 437
Query: 427 KRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRR 486
A+ ++F NG DPW V + + + AL+ +H +D+ D L+ R+
Sbjct: 438 ---ATQVLFINGDTDPWHVLSVTQPLGSSEPALLIPSASHCLDMAPERPSDSPSLRLARQ 494
Query: 487 REVEIIGKWI 496
+ + + W+
Sbjct: 495 KVSQQLQTWL 504
>gi|195338045|ref|XP_002035636.1| GM14810 [Drosophila sechellia]
gi|194128729|gb|EDW50772.1| GM14810 [Drosophila sechellia]
Length = 508
Score = 137 bits (344), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/462 (25%), Positives = 199/462 (43%), Gaps = 41/462 (8%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHF + + +T+QQRY +N + + ++AP+F+ G EG+ G A
Sbjct: 61 QRLDHFKSSDK--RTWQQRYFVNADFYR-NDSSAPVFLMIGGEGEASAKWMREGAWVHYA 117
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
F AL + +EHR+YGKS P +++ +N YLSS QAL D AS + +K
Sbjct: 118 EHFGALCLQLEHRFYGKSHP---TADLSTENLH---YLSSEQALEDLASFVTAMKVKFNL 171
Query: 180 TDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
D + FGGSY G LAAW R KYP + G+++SS P+L V + ++
Sbjct: 172 GDGQKWIAFGGSYPGSLAAWAREKYPQLIYGSISSSGPLL---AEVDFKEYFEVVKASLA 228
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTD 298
S C + S+ Q+E K G L + F+ C K+ +IE+ L A + +
Sbjct: 229 SYKPECLDAVTRSFAQVEILLKHMIGQRSLDEKFKTCTPIKD-SIENDLDMANFFENLA- 286
Query: 299 YPTPSNFL---------NPLPAFPVKEMCKAIDDPKTG---------NDVFAKLYGAASV 340
NF +P + ++C + + G ND+ K +
Sbjct: 287 ----GNFAGVVQYNKDNSPHAKITIDDICDVMLNTTAGPPVTRLGLVNDMLLKESNTTCL 342
Query: 341 YYNYSG-TAKCFDLNGDSD-PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
Y Y A +++ DS+ G+ +W +Q C E D F + D R
Sbjct: 343 DYKYDKMVADMKNVSWDSETAKGMRQWTYQTCHEFGFYQTSDKPADTFGD-RFGVDFFIR 401
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGL-VLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
C + + + ++ LK +N+++ +G DPW G++K+ S +
Sbjct: 402 QCMDVFSKNMNAKFLKLVVSATNDNYGALKPRTTNVLYVHGSIDPWHALGLVKSTSPALP 461
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ + AH ++ K DP L R + ++ + K + Y
Sbjct: 462 TIYIEGTAHCANMYEPVKTDPPQLVAARNKILKFLAKLLDGY 503
>gi|24648175|ref|NP_650802.1| CG3734 [Drosophila melanogaster]
gi|19527569|gb|AAL89899.1| RE36938p [Drosophila melanogaster]
gi|23171716|gb|AAF55662.2| CG3734 [Drosophila melanogaster]
gi|220948338|gb|ACL86712.1| CG3734-PA [synthetic construct]
Length = 473
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/473 (25%), Positives = 217/473 (45%), Gaps = 54/473 (11%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+ ++++ + +T + Q LDHF+ P+ +T+Q RY++ND + ++ AP+F+Y G E
Sbjct: 35 LPTTQNRADVVQTLWIEQKLDHFD--PEETRTWQMRYMLNDALY---QSGAPLFIYLGGE 89
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
+I G +YD+A + ALL + EHRYYG+S P +++ +N YL+ Q+
Sbjct: 90 WEISSGRITGGHLYDMAKEHNALLAYTEHRYYGQSKPL---PDLSNENIK---YLNVNQS 143
Query: 163 LADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
LAD A I +K+N +DS V++ GGSY + WF+ YP + G ASSAP+L
Sbjct: 144 LADLAYFINTIKQNHEGLSDSKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLLAKV 203
Query: 222 NIVSPYSFSNIITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
N V + I Q + + CYK I+ ++E G E ++ ++C+
Sbjct: 204 NFV---EYKEITGQSIEQMGGSACYKRIENGIAEMETMIATKRGAE-VKALLKLCEP--- 256
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV 340
+ S L +++ ++D + + A ++ +C+ I G++ + G
Sbjct: 257 FDVYSDLDVWTLFSEISD--IFAGVVQTHNAGQIEGVCEKI---MAGSNDLIGVAGYLLD 311
Query: 341 YYNYSGTAKCFDLNGDS----------DPHGLSEWGWQACTEM-IMLTGGDNKDSIFEES 389
+ SG KC+DL+ D+ + + + +W +Q C E T G + +
Sbjct: 312 VFEESG-GKCYDLSYDAITALLLDTNYNGNIMRQWIFQTCNEYGWYQTSGSSAQPFGTKF 370
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA------SNIIFFNGLRDPW 443
Y C + YG + + EF +++ + + F N+ +G DPW
Sbjct: 371 PVTY--YTTMCADLYGSE-----YSNEFISNQVSITNQFFGGLFPNVENVYLTHGQLDPW 423
Query: 444 SGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
G+ T++ E AH D + D ++ + R E++ +W+
Sbjct: 424 RAMGIQDETQATIIP----EHAHCKDFNSISSSDTAEMRASKERIAELVREWV 472
>gi|194865618|ref|XP_001971519.1| GG14395 [Drosophila erecta]
gi|190653302|gb|EDV50545.1| GG14395 [Drosophila erecta]
Length = 508
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 129/487 (26%), Positives = 211/487 (43%), Gaps = 48/487 (9%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P K+ +L S+ S+ L+ Q LDHF +T+QQRY +N + + ++AP+
Sbjct: 42 PSKIPTL-QGSQHSEDLW----FEQRLDHFK--SSDVRTWQQRYFVNADFYR-NDSSAPV 93
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
F+ G EG+ G A F AL + +EHR+YGKS P +++ +N
Sbjct: 94 FLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP---TADLSTENLH--- 147
Query: 156 YLSSTQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
YLSS QAL D AS + +K D + FGGSY G LAAW R KYP + G+++SS
Sbjct: 148 YLSSEQALEDLASFVTAMKVKFNLADGQKWIAFGGSYPGSLAAWAREKYPQLIYGSISSS 207
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
P+L V + ++ + C + + S+ Q+E K G L + F+
Sbjct: 208 GPLL---AEVDFKEYFEVVKASLAAYKPECVEAVTRSFAQVEILLKHMIGQRSLDEKFKT 264
Query: 275 CKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL---------NPLPAFPVKEMCKAIDDPK 325
C K+ +IE+ L A + + NF +P + ++C + +
Sbjct: 265 CTPIKD-SIENDLDMANFFENLA-----GNFAGVVQYNKDNSPHATITIDDICDVMLNTT 318
Query: 326 TG---------NDVFAKLYGAASVYYNYSG-TAKCFDLNGDSD-PHGLSEWGWQACTEMI 374
G ND+ K + Y Y A +++ DS+ G+ +W +Q C E
Sbjct: 319 AGPPVTRLGLVNDMLLKEANTTCLDYKYEKMVADMKNVSWDSETAKGMRQWTYQTCHEFG 378
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGL-VLKRFASNI 433
N F + D R C + + + ++ G LK +N+
Sbjct: 379 FYQTSANPTDTFGD-RFGVDFFIRQCMDVFSKNMDAKFLQLVVSGTNDNYGALKPKTTNV 437
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALVEKEG-AHHVDLRFSTKEDPQWLKDVRRREVEII 492
++ +G DPW G++K+ S L + EG AH ++ K DP L R ++ +
Sbjct: 438 LYVHGSIDPWHALGLVKS-SNPALPTIYIEGTAHCANMYEPVKTDPPQLVAARNNILKFL 496
Query: 493 GKWISQY 499
K ++ Y
Sbjct: 497 AKLLNGY 503
>gi|320168054|gb|EFW44953.1| thymus-specific serine protease [Capsaspora owczarzaki ATCC 30864]
Length = 489
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 133/478 (27%), Positives = 205/478 (42%), Gaps = 59/478 (12%)
Query: 45 SSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD 104
+ + S G ++ TQ LDHFN ++ T+ Q+Y +N T WGG PIF G EG
Sbjct: 45 AGRRSLGDVPKQWFTQKLDHFNTFDET--TWLQKYYVNQTFWGGP--GYPIFFMIGGEGP 100
Query: 105 IEWFAQNTGFMYDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
I+ T Y + A +KAL+V +EHR+YG+S+P A + + +L+S QAL
Sbjct: 101 ID-DRYVTAMDYVIYARTYKALMVTLEHRFYGESVP------TADYSVANLRFLTSQQAL 153
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
AD A+ ++ A S V FGGSY G L+AW RLKYP++ G++++S P+ N
Sbjct: 154 ADAANFAANITLQFNAPTSSWVTFGGSYPGCLSAWARLKYPNLFQGSISTSGPVHAELNF 213
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKN 280
V + F + C +I + +I+ ++PGGL + K F +C S+ +
Sbjct: 214 VQYLEVVQASLEYFGGTT--CSSLITQATNKIQTLLQQPGGLSSVGKLFSVCVPLNSQDD 271
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFP-VKEMCKAI---DDPKTGNDVFAKLYG 336
+A F+ T + + N P + ++C + DP +L+
Sbjct: 272 VA-------NFMSTLAGNVMGVVQYNNEGRGGPTITDVCATMLSNSDPLQAYVNLNQLFL 324
Query: 337 AASVYYNYSGTAKCFDLN--------GDSDPH---GLSEWGWQACTEMIMLTGGDNKDSI 385
A SG C D+ D+ P G W WQ C E D+ S
Sbjct: 325 A-------SGNVTCLDVAYKPMIQQLQDTAPGADVGGRSWLWQTCQEFGFYQTTDSTFSH 377
Query: 386 FEESEEDYDARARYCKEAYGVD-------PRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
+ + C + +G + R NW +G L SN F +G
Sbjct: 378 VFGNLFPLSFSLQMCNDVFGTNYQQADFQKRLNWTNDYYGS------LDFSGSNTFFIHG 431
Query: 439 LRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DPW G+ + S A + AH ++R DP L R + E IGK++
Sbjct: 432 SIDPWHALGIYQITSPVNSATLITGTAHCANMRMPLPSDPPQLVAARLQTSETIGKFL 489
>gi|308491795|ref|XP_003108088.1| hypothetical protein CRE_10077 [Caenorhabditis remanei]
gi|308248936|gb|EFO92888.1| hypothetical protein CRE_10077 [Caenorhabditis remanei]
Length = 794
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 133/496 (26%), Positives = 215/496 (43%), Gaps = 58/496 (11%)
Query: 42 LISSSKDSQG-LYKTKYHT-----QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
L++S+ D +G + + KY Q++DHF+ + T++QRY N + +K +
Sbjct: 32 LVASASDPEGPVSEDKYMVYSEIDQVVDHFSNTTSA--TWRQRYQYNSKFY--NKTVGYV 87
Query: 96 FVYTGNEGDI------EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
F+ G EG I +W M A +F A +EHR+YG G I +
Sbjct: 88 FLMLGGEGSINATNGDKWVRHEAETMMVWAAEFGAGAFQVEHRFYGSK----GFSPIGDQ 143
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAI 208
+ L+ QALAD I + D P+ + FGGSY G L+AWFR YP +
Sbjct: 144 TTESLKLLTIDQALADIKEFINQMNALYFPLDKPIWITFGGSYPGSLSAWFRETYPEMTA 203
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETA-KKPGGLEK 267
GA++SS+ + F V Y ++ + +R+VS++C VIK +++Q+++ A P E
Sbjct: 204 GAVSSSSAVHVF---VDYYGYAINTEKTYRTVSDSCGDVIKTAFQQMQKKAYNGPDSREL 260
Query: 268 LQKAFRICKS--EKNL--AIESWLSTAFVYTAMTDYPTPSNFLNPL-PAFPVKEMCKAID 322
L+K F +C S E NL +I+ + + Y + T N N V C ++
Sbjct: 261 LKKTFNLCDSFDENNLSKSIQFFFQNVYGYFQGINQYTGDNRNNATRSGLGVPGACNILN 320
Query: 323 DPKTGNDVFAKLYGAASVYYNYSGTAKCF-------------DLNGDSDPHGLSEWGWQA 369
+ G+++ + A +Y+ G+ C D D W WQ
Sbjct: 321 NATLGDEITRVV--AVMDWYDSWGSPGCRPNSYTSFIKYYSDTTMPDDDRISTRSWIWQT 378
Query: 370 CTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD-------PRPNWITTEFGGHKI 422
CTE+ D + S D A C + +G + + + T++GG
Sbjct: 379 CTELGYYQTTDGGNGGIFGSTVPLDFFADQCIDLFGPEYTLDNTFKLIDQVRTKYGGAD- 437
Query: 423 GLVLKRFASNIIFFNGLRDPWSG-GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWL 481
+N+ F NG DPW G N + V + + AH D+ + D Q L
Sbjct: 438 ----AYRGTNVCFPNGSFDPWQDLGHKATNTNNNVDSWLIDGTAHCADMYPARDSDKQSL 493
Query: 482 KDVRRREVEIIGKWIS 497
KD RRR + + +W+S
Sbjct: 494 KDARRRIHDQLSRWLS 509
>gi|170042651|ref|XP_001849031.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167866158|gb|EDS29541.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 499
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 124/484 (25%), Positives = 214/484 (44%), Gaps = 61/484 (12%)
Query: 47 KDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE 106
++ G KT + Q+LDH +P + T++QRY +ND ++ + + P+F+ G EG+
Sbjct: 42 REVSGDVKTLWFDQLLDH--NDPTNAATWKQRYYVNDAYFD-DRTSGPVFLMIGGEGEAT 98
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
N G A + AL +EHR+YGKS P G + + GYL+S QALAD
Sbjct: 99 ARWMNEGAWIRYAKEHGALCFQLEHRFYGKSHPTGD------LSTANLGYLTSEQALADL 152
Query: 167 ASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
A + + + T + FGGSY G LAAW R KYP++ G+++SS P+L +
Sbjct: 153 AYFVEAMNEKYQLTQQNRWIAFGGSYPGSLAAWLREKYPYLVHGSVSSSGPLL---AKID 209
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
+ ++ +S S C + ++ + Q+E K G + + F++C + +I +
Sbjct: 210 FKEYFQVVVNSLQSYSPQCVEDVRNAMGQVETLLKHMIGQRSINEKFKLCDPVEK-SISN 268
Query: 286 WLSTAFVYTAMTDYPTPSNFL---------NPLPAFPVKEMCKAIDDPKTG--------- 327
L A ++ A+ NF +P + ++C + + G
Sbjct: 269 SLDIASLFEAVA-----GNFAGVVQYNKDNSPHAKITIDQICDVMANQSLGAPVSRLAAV 323
Query: 328 NDVFAKLYGAASVYYNYSGTAKCF-DLNGDSD-PHGLSEWGWQACTEMIMLTGGDNKDSI 385
N++ G + Y Y T K + + +SD +G +W +Q C E D D +
Sbjct: 324 NEMVMTQDGVKCLDYVYDKTVKQMQNTSWESDVANGARQWTYQTCNEFGFYQTSDKPDLV 383
Query: 386 FEESEEDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNG 438
F + + R C + YG +D T +G L +N+++ +G
Sbjct: 384 FGD-RFPVEFFVRQCTDIYGKKFDDRTLDRAVYRTNTNYGA------LNPSTTNVLYVHG 436
Query: 439 LRDPWSGGGVLK--NISKTVLALVEKEGAHHVDLRFSTKED--PQWLKDVRRREVEIIGK 494
DPW G+ + +I+ + + EG H + K+D PQ LK+ R + I +
Sbjct: 437 SIDPWHRLGLTESNDINTPTIFI---EGTAHCANMYEPKDDDFPQ-LKNARVQINSFIER 492
Query: 495 WISQ 498
+ Q
Sbjct: 493 LLQQ 496
>gi|225718928|gb|ACO15310.1| serine protease K12H4.7 precursor [Caligus clemensi]
Length = 485
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 129/485 (26%), Positives = 212/485 (43%), Gaps = 58/485 (11%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
+KLSS + S D G++ Y+ Q LDHFN ++ +T+ QRY +N + G++ AP+F
Sbjct: 33 KKLSSPTALS-DRSGIF-ISYYNQTLDHFN--EENKKTWNQRYFVNTEFFNGTET-APVF 87
Query: 97 VYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG- 155
+ G EG G Y A + AL++ +EHR+YG S P +N ST
Sbjct: 88 LLIGGEGTASDSWMKYGAWYGYAKEVGALMIQLEHRFYGSSRPT--------ENMSTENL 139
Query: 156 -YLSSTQALADYASLIIDLKKNLTATDS-PVVVFGGSYGGMLAAWFRLKYPHVAIGALAS 213
+L+S QAL D I K+ + ++ V FGGSY G L+ W R YP + GAL+S
Sbjct: 140 KFLTSQQALEDIVEFIRFAKQQYSLNETNKWVTFGGSYPGSLSLWMRSLYPELISGALSS 199
Query: 214 SAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE-TAKKPGGLEKLQKAF 272
SAP+ + V + ++ D C +K + +QI+ P G +K+ K F
Sbjct: 200 SAPV---EVKVDFEEYLGVVENDMNIRDPKCVPEVKKAIQQIQALIVSAPDGWKKVAKIF 256
Query: 273 RICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA 332
+C I+ S F + + + + + + + L + MC + + G+
Sbjct: 257 SLCDGWSGDNIQDLRS--FYASVLGAFYSSAQYDSVLNNDDLAHMCPYMSNEYFGDSSLE 314
Query: 333 KLYGAASVYYNYSGTAKCFDLN-------------GDSDPHGLSEWGWQACTEMIMLTGG 379
+L A+++ Y G+ C ++N + G +W +Q C E G
Sbjct: 315 RL--ASTLKGKYGGS--CLNVNYKDLLDFMTTEEWAHGEDVGYRQWVYQTCNEFGWYQTG 370
Query: 380 DNKDSIFEESEEDYDARARYCKEAYGVD-------PRPNWITTEFGGHKIGLVLKRFASN 432
+ S F + + C++ YG+D N+ +G L SN
Sbjct: 371 NIWGSFFP-----VEFYTQQCRDVYGMDFTDEIIASNANYTNIMYGSKNPPL------SN 419
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK-EDPQWLKDVRRREVEI 491
I +G DPW G+L+++S++V + +H DL D + L R E
Sbjct: 420 TIITHGSFDPWHPMGILEDMSESVKTFIINGTSHCYDLYPPNPLSDSEELTRARNITFEH 479
Query: 492 IGKWI 496
I +WI
Sbjct: 480 IKRWI 484
>gi|340500169|gb|EGR27066.1| hypothetical protein IMG5_202470 [Ichthyophthirius multifiliis]
Length = 2048
Score = 136 bits (343), Expect = 2e-29, Method: Composition-based stats.
Identities = 132/458 (28%), Positives = 210/458 (45%), Gaps = 61/458 (13%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
L+HF+ P ++QRY I D ++ + N +F+ G EG + G++ +A +
Sbjct: 1609 LNHFD--PLGLIKWKQRYTIYDEYF--NPENGTVFISIGGEGQMAGITNGRGWLIQLAQE 1664
Query: 122 FKALLVFIEHRYYGKSIPYG-GNKEIAYKNASTTGYLSSTQALADYASLIIDLKK---NL 177
F A+++ +EHR+YG S P+G N+ + +N YL+ Q+LAD A+LI +K+ +
Sbjct: 1665 FSAIVISVEHRFYGVSQPFGYTNQSYSLENLQ---YLTVDQSLADLANLISKIKQKKLHK 1721
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
+ +P + GGSY G ++AWFR KYPH+ +GALASSA + N I +DF
Sbjct: 1722 ISEINPFITIGGSYPGAMSAWFRYKYPHLTVGALASSAVV-------------NAI-EDF 1767
Query: 238 RSVSENCYKVIK--GSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTA 295
+ Y K G+W L+ Q F I K+ N L F T
Sbjct: 1768 QMYDYQVYLSTKKSGNWCP----------LKIQQFNFYIEKTLNNTLYAIKLRQKFNATM 1817
Query: 296 MTDYPTPSNFLN-------PLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV---YYNYS 345
+T+ S FLN L + + +I T +L ++V NYS
Sbjct: 1818 LTN----SEFLNFFADLYSGLVQYGQRTFLCSIFQNTTIEQQINRLADYSAVNQTAINYS 1873
Query: 346 GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
T F+ D + +W +Q CT N+ + S+ + C++ +G
Sbjct: 1874 -TKTLFNTTYDPN-QAQRQWTFQTCTYFGFFQTA-NQINPMRSSKVNLRFFEDQCRQVFG 1930
Query: 406 VDPRPNW-ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
+ P+ IT + G GL L+ A+NI+F NG D W + ++ V + + +
Sbjct: 1931 QNYVPDISITNSYLG---GLNLE--ATNIVFTNGSEDGWKWASLTQSKGSMVSLISDCDN 1985
Query: 465 -AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
AH V+L EDP LK+ RR + +WI Q+FQ
Sbjct: 1986 CAHGVELGVPKSEDPDNLKNTRRIVKILFKQWIDQHFQ 2023
>gi|224058955|ref|XP_002299662.1| predicted protein [Populus trichocarpa]
gi|222846920|gb|EEE84467.1| predicted protein [Populus trichocarpa]
Length = 153
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 68/134 (50%), Positives = 90/134 (67%), Gaps = 3/134 (2%)
Query: 373 MIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASN 432
+I L GDN S+FE D CK YGV PRP+W+TT +GGH I L+L+RF SN
Sbjct: 2 VIPLGVGDN--SMFEPDPFDLKDYIERCKSLYGVRPRPHWVTTYYGGHDIKLILQRFGSN 59
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL-RFSTKEDPQWLKDVRRREVEI 491
IIF NGLRDP+S GGVL NIS +++A+ G+H +D+ R + DP WL R++EVEI
Sbjct: 60 IIFSNGLRDPYSSGGVLNNISDSIVAVTTVNGSHCLDIQRANPSTDPDWLVMQRKKEVEI 119
Query: 492 IGKWISQYFQDLAQ 505
I WI+QY++DL +
Sbjct: 120 IEGWITQYYEDLYE 133
>gi|195588332|ref|XP_002083912.1| GD13982 [Drosophila simulans]
gi|194195921|gb|EDX09497.1| GD13982 [Drosophila simulans]
Length = 508
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 119/462 (25%), Positives = 200/462 (43%), Gaps = 41/462 (8%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHF + + +T+QQRY +N + + ++AP+F+ G EG+ G A
Sbjct: 61 QRLDHFKSSDK--RTWQQRYFVNADFYR-NDSSAPVFLMIGGEGEASAKWMREGAWVHYA 117
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
F AL + +EHR+YGKS P +++ +N YLSS QAL D AS + +K
Sbjct: 118 EHFGALCLQLEHRFYGKSHP---TADLSTENLH---YLSSEQALEDLASFVTAMKVKFNL 171
Query: 180 TDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
D + FGGSY G LAAW R KYP + G+++SS P+L V + ++
Sbjct: 172 GDGQKWIAFGGSYPGSLAAWAREKYPQLIYGSISSSGPLL---AEVDFKEYFEVVKASLV 228
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTD 298
S C + S+ Q+E K G L + F+ C K+ +IE+ L + + +
Sbjct: 229 SYKPECLDAVTRSFAQVEILLKHMIGQRSLDEKFKTCTPIKD-SIENDLDMSNFFENLA- 286
Query: 299 YPTPSNFL---------NPLPAFPVKEMCKAIDDPKTG---------NDVFAKLYGAASV 340
NF +P + + ++C + + G ND+ K +
Sbjct: 287 ----GNFAGVVQYNKDNSPHASITIDDICDVMLNTTAGPPVTRLGLVNDMLLKESNTTCL 342
Query: 341 YYNYSG-TAKCFDLNGDSDP-HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
Y Y A +++ DS+ G+ +W +Q C E D F + D R
Sbjct: 343 DYKYDKMVADMKNVSWDSETGKGMRQWTYQTCHEFGFYQTSDKPADTFGD-RFGVDFFIR 401
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGL-VLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL 457
C + + + ++ LK +N+++ +G DPW G++K+ S +
Sbjct: 402 QCMDVFSKNMNAKFLKLVVSATNDNYGALKPKTTNVLYVHGSIDPWHALGLVKSTSPALP 461
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ + AH ++ K DP L R + ++ + K + Y
Sbjct: 462 TIYIEGTAHCANMYEPVKTDPPQLVAARNKILKFLAKLLDGY 503
>gi|403308857|ref|XP_003944858.1| PREDICTED: thymus-specific serine protease isoform 1 [Saimiri
boliviensis boliviensis]
Length = 512
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 116/469 (24%), Positives = 204/469 (43%), Gaps = 50/469 (10%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q+L+ FN + + ++F QRY +ND HW G + PIF++ G EG + + G
Sbjct: 56 KVGWLEQLLNPFNVSDR--RSFLQRYWVNDQHWTG--QDGPIFLHLGGEGSLGPGSVMKG 111
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG SIP GG + + +LSS ALAD S + L
Sbjct: 112 HPAALAPAWGALVISLEHRFYGLSIPAGG------LDMAQLRFLSSRHALADVVSARLAL 165
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---ILNFDNIVSPYSF 229
+ ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L+F +
Sbjct: 166 SRLFNVSSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSE------Y 219
Query: 230 SNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEKNLAIE 284
++++++ +S S C + ++ ++E + G +A C S +
Sbjct: 220 NDVVSRSLKSAAIGGSLECQAAVSTAFSEVERRLRAGGAARAALQAELNACGSLSRAEDQ 279
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
+ L A PL + + ++ + + L A + +
Sbjct: 280 AELLGALQALVGGAVQYDGQAGAPLSVRQLCGLLLGGGANRSRSTPYCGLRRAVQIVMHS 339
Query: 345 SGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEED 392
G KC + ++P G +W +Q CTE +N F +
Sbjct: 340 LGQ-KCLSFSRAETVAQLRSTEPQVSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPA- 397
Query: 393 YDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
++ C++ +G+ P + +GG G A+ ++F NG DPW
Sbjct: 398 LPSQLDLCEQVFGLSPLSVAQAVAQTNSYYGGQTPG------ANQVLFVNGDTDPWHVLS 451
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
V + + + L+ G+H +D+ D L+ R+ + + W+
Sbjct: 452 VTQALGSSESVLLIPSGSHCLDMAPERPSDSPSLRLGRQSIFQQLQTWL 500
>gi|363814290|ref|NP_001242784.1| uncharacterized protein LOC100805858 precursor [Glycine max]
gi|255635884|gb|ACU18289.1| unknown [Glycine max]
Length = 488
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 130/453 (28%), Positives = 202/453 (44%), Gaps = 69/453 (15%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSK--DSQGLYKTKYHTQILDHFNYNP 69
++ ++ L++ FP + P L + +S K ++Q L+ Q LDHF+ P
Sbjct: 7 MITTALLSLLFVSSFPPLSYGVVPPRTLLNKLSEGKYLNTQELW----FDQTLDHFS--P 60
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
++ F+QRY ++ + PIF+ G EG + A + ++ +A KF A +V +
Sbjct: 61 YDHRQFRQRYYEFLDYF--RIPDGPIFLVIGGEGILNGVAND--YLAVLAKKFGAAMVTL 116
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSP 183
EHRYYGKS P+ + K YLSS QAL+D A I+ K N ++P
Sbjct: 117 EHRYYGKSTPFNSLETENLK------YLSSKQALSDLAVFRQYYQDSINAKLNRAKIENP 170
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
+FGGSY G L+AWFRLK+PH+ G+LASSA +L + Y+++ Q S
Sbjct: 171 WFIFGGSYSGALSAWFRLKFPHLTCGSLASSAVVL------AVYNYTEFDQQIGESAGPE 224
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT-DYPTP 302
C + ++ + + IE G ++L+ +F E + +L+ A TA+ Y P
Sbjct: 225 CKEALQETTQLIEHKLATSG--KELKASFDAADLEIDGDFFYFLADA---TAIAFQYGNP 279
Query: 303 SNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTAKCFD-------- 352
PL ++ K G D+ YY + + +D
Sbjct: 280 DKVCKPL-----------VEAKKAGEDLVDAYAKYVKEYYIGTFGTDVQTYDQKYLKRTA 328
Query: 353 LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNW 412
+N D+ W +Q CTE+ + DSI S+ D CK +G P+
Sbjct: 329 MNEDNSAR---LWWFQVCTEVAYFQVAPSNDSI-RSSKVDIKYHFDLCKNVFGEGIFPDV 384
Query: 413 ITTE--FGGHKIGLVLKRFASNIIFFNGLRDPW 443
T +GG KI S IIF NG +DPW
Sbjct: 385 DATNLYYGGTKIA------GSKIIFTNGSQDPW 411
>gi|17944482|gb|AAL48130.1| RH04336p [Drosophila melanogaster]
Length = 508
Score = 135 bits (340), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 119/464 (25%), Positives = 202/464 (43%), Gaps = 45/464 (9%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYD 117
Q LDHF + + +T+QQRY +N + + ++AP+F+ G EG+ +W + Y
Sbjct: 61 QRLDHFKSSDK--RTWQQRYFVNADFYR-NDSSAPVFLMIGGEGEASAKWMREGAWVHY- 116
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A F AL + +EHR+YGKS P +++ +N YLSS QAL D AS + +K
Sbjct: 117 -AEHFGALCLRLEHRFYGKSHP---TADLSTENLH---YLSSEQALEDLASFVTAMKVKF 169
Query: 178 TATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
D + FGGSY G LAAW KYP + G+++SS P+L V + ++
Sbjct: 170 NLGDGQKWIAFGGSYPGSLAAWAHEKYPELIYGSISSSGPLL---AEVDFKEYFEVVKAS 226
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAM 296
+ C + S+ Q+E K G L + F+ C K+ +IE+ L A + +
Sbjct: 227 LAAYKPECVDAVTRSFAQVEILLKHMIGQRSLDEKFKTCTPIKD-SIENDLDMANFFENL 285
Query: 297 TDYPTPSNFL---------NPLPAFPVKEMCKAIDDPKTG---------NDVFAKLYGAA 338
NF +P + ++C + + G ND+ K
Sbjct: 286 A-----GNFAGVVQYNKDNSPHATITIDDICDVMLNTTAGPPVTRLGLVNDMLLKESNTT 340
Query: 339 SVYYNYSG-TAKCFDLNGDSD-PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
+ Y Y A +++ DS+ G+ +W +Q C E DN F + D
Sbjct: 341 CLDYKYDKMVADMKNVSWDSETAKGMRQWTYQTCHEFGFYQTSDNPADTFGD-RFGVDFF 399
Query: 397 ARYCKEAYGVDPRPNWITTEFGGHKIGL-VLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
R C + + + ++ LK +N+++ +G DPW G++K+ +
Sbjct: 400 IRQCMDVFSKNMNAKFLKLVVSATNDNYGALKPKTTNVLYVHGSIDPWHALGLVKSTNAA 459
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ + + AH ++ K DP L R + ++ + K + Y
Sbjct: 460 LPTIYIEGTAHCANMYEPVKTDPPQLVAARNKILKFLAKLLDGY 503
>gi|145522514|ref|XP_001447101.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414601|emb|CAK79704.1| unnamed protein product [Paramecium tetraurelia]
Length = 461
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 127/459 (27%), Positives = 204/459 (44%), Gaps = 55/459 (11%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN--T 112
T + TQ LDH +P S + F+QRY + D + ++ + I G EW +
Sbjct: 35 TLWFTQKLDH--NDPTSKEVFRQRYHVYDDYVVRNQPESVILYICG-----EWTCDGIGS 87
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
G +D A + KAL++ +EHRY+G+S P+G + YL+ QAL D A I D
Sbjct: 88 GLTFDAAQQLKALVLVLEHRYFGQSQPFGD------WSTPNLKYLNIHQALDDIAYFIQD 141
Query: 173 LK-KNL--TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+K K L ++P + GGSY G L+AWFR KYPH+ IG LASSA + + + +
Sbjct: 142 VKAKGLFNIKPNTPWIHLGGSYPGALSAWFRYKYPHLTIGGLASSAVV---KAVACYHDY 198
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIE-ETAKKPGGLEKLQKAFRICKSEKNLAIESWLS 288
+ S+ C I+ ++IE + K P ++ KA + IE
Sbjct: 199 DMQVYLSALESSQECVDRIQQVNEKIEADLIKSPNTIKAEFKASELTD------IEFLSM 252
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA--SVYYNYSG 346
A +Y M S +MC+ ++ T +D F ++ A +V G
Sbjct: 253 IADIYAGMVQGRKRS------------KMCERLEGGATLDDWFKQVKEMALETVDQESYG 300
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEM-IMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
+ D++ D + +W +Q C E+ T N + + D C+ +YG
Sbjct: 301 SEFLKDISIDFSKNS-RQWTYQTCIEVGYFQTANPNVEQSTRSQQLKLDFFRNLCEYSYG 359
Query: 406 VDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK---NISKTVLALV 460
+ P+ T FGG I + ++IF NG DPW + K V +
Sbjct: 360 ISIFPDEERTNAYFGGLDINV------DHLIFSNGSDDPWQHASITKWKQGKEYDVKYIK 413
Query: 461 EKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
K+ +H +DL+ + +DP L R+ + I +WI++Y
Sbjct: 414 CKDCSHCIDLKATKADDPPELTQARKEILAIFQQWINEY 452
>gi|312380581|gb|EFR26537.1| hypothetical protein AND_07341 [Anopheles darlingi]
Length = 519
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 126/493 (25%), Positives = 216/493 (43%), Gaps = 54/493 (10%)
Query: 24 KIFPTFPSSRITPEKLSSLISSSKDSQGLY----KTKYHTQILDHFNYNPQSYQTFQQRY 79
+ P PS +T K+ S + Q L+ ++ QILDH +P + T+QQRY
Sbjct: 34 RFHPDAPSRTVTRTKVDG-ASEALAGQQLHIEETPDQWFEQILDH--NDPTNEATWQQRY 90
Query: 80 LINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIP 139
+ND + S ++P+F+ G EG+ + G A AL +EHR+YGKS P
Sbjct: 91 YVNDQFFDSSNPHSPVFLMIGGEGEATARWMHEGAWIHYAETHGALCFQLEHRFYGKSHP 150
Query: 140 YGGNKEIAYKNASTTGYLSSTQALADYASLI--IDLKKNLTATDSPVVVFGGSYGGMLAA 197
+++ KN + YL+S QALAD A I ++ K L + + FGGSY G LAA
Sbjct: 151 ---TTDLSTKNLA---YLTSEQALADLAYFIEAMNEKYQLQPQTNLWIAFGGSYPGSLAA 204
Query: 198 WFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE 257
W R KYP + G+++SS P+L + + Y + + + S S C + ++ + KQ E
Sbjct: 205 WLREKYPSLVHGSISSSGPLLAKIDFIEYY---DTVVRSLASYSPGCVEAVRSAMKQAET 261
Query: 258 TAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL---------NP 308
K G L F++C + +I++ L A ++ + SNF +P
Sbjct: 262 LLKHMIGQRTLNDKFKLCDPIER-SIDNPLDVASLFEGLA-----SNFAGVVQYNKDNSP 315
Query: 309 LPAFPVKEMCKAIDDPKTG---------NDVFAKLYGAASVYYNYSGTAKCF-DLNGDSD 358
+ E+C + + G N + + + + Y Y + + +++ DS+
Sbjct: 316 HATITIDEVCDVMMNTTIGAPVSRLAEVNRMLLEQGNQSCLDYVYDKSVRQMQNISWDSE 375
Query: 359 -PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR--ARYCKEAYGVDPRPNWITT 415
G +W +Q C E +N ++F + + A R C + YG R
Sbjct: 376 VASGARQWTFQTCNEFGFYQTSNNASAVFGDR---FPAEFFVRQCADIYGA--RFGEAAL 430
Query: 416 EFGGHKIGL---VLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRF 472
G ++ + L +N+++ +G DPW G+ ++ + AH ++
Sbjct: 431 ARGIYRTNVNYGALNPATTNVLYVHGSIDPWHRLGLTESNDLHTPVIFIDGTAHCANMYE 490
Query: 473 STKEDPQWLKDVR 485
+ D LK R
Sbjct: 491 PKESDFPQLKQAR 503
>gi|148700668|gb|EDL32615.1| protease, serine, 16 (thymus), isoform CRA_c [Mus musculus]
Length = 493
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 120/478 (25%), Positives = 204/478 (42%), Gaps = 68/478 (14%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + + +TF QRY +ND H G + P+F++ G EG + + G
Sbjct: 41 KQGWLEQPLDPFNASDR--RTFLQRYWVNDQHRTG--QDVPVFLHIGGEGSLGPGSVMAG 96
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG S+P GG + + YLSS ALAD AS L
Sbjct: 97 HPAALAPAWGALVISLEHRFYGLSMPAGG------LDLALLRYLSSRHALADVASARQAL 150
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L ++ SP + FGGSY G LA W RLK+PH+ A+ASSAP+ +V +++ +
Sbjct: 151 SGLLNVSSSSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPL---SAVVDFSAYNQV 207
Query: 233 ITQDFRSV----SENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRIC--------KSEK 279
+ + V S C ++ ++E + P L++ C ++E
Sbjct: 208 VARSLTQVAIGGSLECLAAASTAFTEVERLLRAGPAAQAVLREELGACGSLDLTEDQAEL 267
Query: 280 NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV----FAKLY 335
A+++ + Y P V+++C + K GN + L
Sbjct: 268 LGALQALVGGTVQYDGQAGAP-----------LSVRQLCGLLLG-KWGNRSRSTPYLGLR 315
Query: 336 GAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIMLTGGDNKD 383
A + G KC + +++P G +W +Q CTE +
Sbjct: 316 RAVQIVLRSMGQ-KCLSFSRAETVAQLSNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQ 374
Query: 384 SIFEESEEDYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNG 438
F + + C++ +G+ P + +GG G A+ ++F NG
Sbjct: 375 CPFSQLPA-LPFQLELCEQVFGLSPASVAQAVAQTNSYYGGQSPG------ATQVLFVNG 427
Query: 439 LRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DPW V +++ + AL+ +H D+ D L+ R++ + + W+
Sbjct: 428 DTDPWHVLSVTQDLGLSEPALLIPSASHCFDMAPMRPSDSPSLRLGRQKISQQLQDWL 485
>gi|289163387|ref|YP_003453525.1| serine carboxypeptidase [Legionella longbeachae NSW150]
gi|288856560|emb|CBJ10365.1| Similar to eukaryotic serine carboxypeptidase S28 family protein
[Legionella longbeachae NSW150]
Length = 466
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 137/509 (26%), Positives = 224/509 (44%), Gaps = 69/509 (13%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDH 64
+FL C ST ++++A I + + E+ + LI+ + Y Q++DH
Sbjct: 9 LLFLGLC----STFSLTHAGIVERY--VQHMQEEKAPLIAEKS-----IQLAYFKQLIDH 57
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKA 124
N + TF QRY I++T+ G K+++P+F Y E A N G + + A KF A
Sbjct: 58 ---NNPATGTFSQRYYIDETY--GPKDDSPVFFYICGESACSKRALN-GAIRNYAQKFNA 111
Query: 125 LLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDS-- 182
LV +EHRYYG+S+P + + YL++ AL D A +++LT+ +
Sbjct: 112 KLVALEHRYYGESLPLNS------LSTNDLRYLTTEAALDDLAYF----QRHLTSEKNWH 161
Query: 183 -PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
V FGGSY G L+A++RLKYP++ +GALASSAP++ +N + + +TQ
Sbjct: 162 GKWVAFGGSYPGSLSAYYRLKYPYLVVGALASSAPVMAKENFI---EYDAHVTQ---VAG 215
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPT 301
C ++ Q+E + K +++ F E ++ A A Y
Sbjct: 216 LQCANQMREVVTQVETSLKDTVTFAQIKSLFDAAAVED--PVDFLYLIADTGAAAVQYGM 273
Query: 302 PSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG--DSDP 359
F + L P P G FAK +Y + + TA G +P
Sbjct: 274 RDAFCSSLSEHPT---------PLEGYAYFAK-----KLYKDMATTAVEMTAQGAMSENP 319
Query: 360 H------GLSEWGWQACTEMIMLTGGDNKDSIFEESEE-DYDARARYCKEAYGVDPRPNW 412
G+ +W +Q+C E + ++ S + D C+ +G+ +
Sbjct: 320 QDYQNGLGMRQWYYQSCKEYGYWQNANPNPALSTRSSLINLDYHYNVCQRLFGLTEPAH- 378
Query: 413 ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK---NISKTVLA--LVEKEGAHH 467
T E ++ ASNI F NG DPWS + + N + L L++ E AH
Sbjct: 379 -TAELNNTFYFPLMNILASNIYFTNGENDPWSTLSLAERNGNATNPRLTYQLIQGE-AHC 436
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DLR + D LKD R++ ++ +W+
Sbjct: 437 NDLRTPSSVDSDSLKDARKKMESLLTEWL 465
>gi|270158373|ref|ZP_06187030.1| putative serine carboxypeptidase [Legionella longbeachae D-4968]
gi|269990398|gb|EEZ96652.1| putative serine carboxypeptidase [Legionella longbeachae D-4968]
Length = 465
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 137/509 (26%), Positives = 224/509 (44%), Gaps = 69/509 (13%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDH 64
+FL C ST ++++A I + + E+ + LI+ + Y Q++DH
Sbjct: 8 LLFLGLC----STFSLTHAGIVERY--VQHMQEEKAPLIAEKS-----IQLAYFKQLIDH 56
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKA 124
N + TF QRY I++T+ G K+++P+F Y E A N G + + A KF A
Sbjct: 57 ---NNPATGTFSQRYYIDETY--GPKDDSPVFFYICGESACSKRALN-GAIRNYAQKFNA 110
Query: 125 LLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDS-- 182
LV +EHRYYG+S+P + + YL++ AL D A +++LT+ +
Sbjct: 111 KLVALEHRYYGESLPLNS------LSTNDLRYLTTEAALDDLAYF----QRHLTSEKNWH 160
Query: 183 -PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
V FGGSY G L+A++RLKYP++ +GALASSAP++ +N + + +TQ
Sbjct: 161 GKWVAFGGSYPGSLSAYYRLKYPYLVVGALASSAPVMAKENFI---EYDAHVTQ---VAG 214
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPT 301
C ++ Q+E + K +++ F E ++ A A Y
Sbjct: 215 LQCANQMREVVTQVETSLKDTVTFAQIKSLFDAAAVED--PVDFLYLIADTGAAAVQYGM 272
Query: 302 PSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG--DSDP 359
F + L P P G FAK +Y + + TA G +P
Sbjct: 273 RDAFCSSLSEHPT---------PLEGYAYFAK-----KLYKDMATTAVEMTAQGAMSENP 318
Query: 360 H------GLSEWGWQACTEMIMLTGGDNKDSIFEESEE-DYDARARYCKEAYGVDPRPNW 412
G+ +W +Q+C E + ++ S + D C+ +G+ +
Sbjct: 319 QDYQNGLGMRQWYYQSCKEYGYWQNANPNPALSTRSSLINLDYHYNVCQRLFGLTEPAH- 377
Query: 413 ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK---NISKTVLA--LVEKEGAHH 467
T E ++ ASNI F NG DPWS + + N + L L++ E AH
Sbjct: 378 -TAELNNTFYFPLMNILASNIYFTNGENDPWSTLSLAERNGNATNPRLTYQLIQGE-AHC 435
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DLR + D LKD R++ ++ +W+
Sbjct: 436 NDLRTPSSVDSDSLKDARKKMESLLTEWL 464
>gi|9506995|ref|NP_062302.1| thymus-specific serine protease precursor [Mus musculus]
gi|13633993|sp|Q9QXE5.1|TSSP_MOUSE RecName: Full=Thymus-specific serine protease; AltName: Full=Serine
protease 16; Flags: Precursor
gi|6706784|emb|CAB66137.1| thymus-specific serine peptidase [Mus musculus]
gi|26352940|dbj|BAC40100.1| unnamed protein product [Mus musculus]
gi|116138603|gb|AAI25482.1| Protease, serine, 16 (thymus) [Mus musculus]
gi|124297571|gb|AAI31956.1| Protease, serine, 16 (thymus) [Mus musculus]
gi|148700669|gb|EDL32616.1| protease, serine, 16 (thymus), isoform CRA_d [Mus musculus]
Length = 509
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 123/485 (25%), Positives = 206/485 (42%), Gaps = 83/485 (17%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + + +TF QRY +ND H G + P+F++ G EG + + G
Sbjct: 57 KQGWLEQPLDPFNASDR--RTFLQRYWVNDQHRTG--QDVPVFLHIGGEGSLGPGSVMAG 112
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG S+P GG + + YLSS ALAD AS L
Sbjct: 113 HPAALAPAWGALVISLEHRFYGLSMPAGG------LDLALLRYLSSRHALADVASARQAL 166
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L ++ SP + FGGSY G LA W RLK+PH+ A+ASSAP+ +V +++ +
Sbjct: 167 SGLLNVSSSSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPL---SAVVDFSAYNQV 223
Query: 233 ITQDFRSV----SENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRIC--------KSEK 279
+ + V S C ++ ++E + P L++ C ++E
Sbjct: 224 VARSLTQVAIGGSLECLAAASTAFTEVERLLRAGPAAQAVLREELGACGSLDLTEDQAEL 283
Query: 280 NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV----FAKLY 335
A+++ + Y P V+++C + K GN + L
Sbjct: 284 LGALQALVGGTVQYDGQAGAP-----------LSVRQLCGLLLG-KWGNRSRSTPYLGLR 331
Query: 336 GAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIMLTGGDNKD 383
A + G KC + +++P G +W +Q CTE +
Sbjct: 332 RAVQIVLRSMGQ-KCLSFSRAETVAQLSNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQ 390
Query: 384 SIFEESEEDYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNG 438
F + + C++ +G+ P + +GG G A+ ++F NG
Sbjct: 391 CPFSQLPA-LPFQLELCEQVFGLSPASVAQAVAQTNSYYGGQSPG------ATQVLFVNG 443
Query: 439 LRDPWSGGGVLKNISKTVLALVEKEGAHHVD-----------LRFSTKEDPQ----WLKD 483
DPW V +++ + AL+ +H D LR ++ Q WLKD
Sbjct: 444 DTDPWHVLSVTQDLGLSEPALLIPSASHCFDMAPMRPSDSPSLRLGRQKISQQLQDWLKD 503
Query: 484 VRRRE 488
+++ +
Sbjct: 504 IKKSQ 508
>gi|114605978|ref|XP_001136226.1| PREDICTED: thymus-specific serine protease isoform 2 [Pan
troglodytes]
Length = 514
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 122/491 (24%), Positives = 215/491 (43%), Gaps = 72/491 (14%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF+ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHWVG--QDGPIFLLLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRLA 156
Query: 163 LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSAHLALSRLFNISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-KLQKAFR 273
+F +++++++ S S C + ++ ++E + G + LQ
Sbjct: 217 DFSE------YNDVVSRSLMSTAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALQTELS 270
Query: 274 IC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP- 324
C ++E A+++ + Y T P V+++C +
Sbjct: 271 ACGPLGRAENQAELLGALQALVGGVVQYDGQTGAP-----------LSVRQLCGLLLGGG 319
Query: 325 --KTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQAC 370
++ + + L A + + G KC + ++P G +W +Q C
Sbjct: 320 GNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTC 378
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLV 425
TE +N F + ++ C++ +G V + +GG G
Sbjct: 379 TEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQTPG-- 435
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
A+ ++F NG DPW V + + + L+ + G+H +D+ D L+ R
Sbjct: 436 ----ANKVLFVNGDTDPWHVLSVTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGR 491
Query: 486 RREVEIIGKWI 496
+ + + W+
Sbjct: 492 QNIFQQLQTWL 502
>gi|397479343|ref|XP_003810983.1| PREDICTED: thymus-specific serine protease [Pan paniscus]
Length = 514
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 122/491 (24%), Positives = 215/491 (43%), Gaps = 72/491 (14%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF+ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHWVG--QDGPIFLLLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRLA 156
Query: 163 LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSAHLALSRLFNISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-KLQKAFR 273
+F +++++++ S S C + ++ ++E + G + LQ
Sbjct: 217 DFSE------YNDVVSRSLMSTAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALQTELS 270
Query: 274 IC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP- 324
C ++E A+++ + Y T P V+++C +
Sbjct: 271 ACGPLGRAENQAELLGALQALVGGVVQYDGQTGAP-----------LSVRQLCGLLLGGG 319
Query: 325 --KTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQAC 370
++ + + L A + + G KC + ++P G +W +Q C
Sbjct: 320 GNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTC 378
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLV 425
TE +N F + ++ C++ +G V + +GG G
Sbjct: 379 TEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQTPG-- 435
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
A+ ++F NG DPW V + + + L+ + G+H +D+ D L+ R
Sbjct: 436 ----ANKVLFVNGDTDPWHVLSVTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGR 491
Query: 486 RREVEIIGKWI 496
+ + + W+
Sbjct: 492 QNIFQQLQTWL 502
>gi|67465637|ref|XP_648991.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|56465317|gb|EAL43602.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 480
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 125/472 (26%), Positives = 206/472 (43%), Gaps = 52/472 (11%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG-DIEWFAQNT 112
KT +T LDHFN N Q F +Y I+ + + NAP+FV G EG + E QN
Sbjct: 38 KTLTYTVPLDHFNANNQI--DFDIQYFISTDYLDNNSPNAPLFVLLGGEGPEDETGLQNY 95
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
+ D+A K K L++ +EHR+YG S P ++ Y A QAL DY +I
Sbjct: 96 FVVTDLAKKHKGLMLSVEHRFYGASTPSLEMDKLIYCTAE--------QALMDYVEVISH 147
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+++ PV+V GGSY G LAAW R KYP+V GA ASSAP+ + +V Y + +
Sbjct: 148 VQEENNLVGHPVIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEV 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
+ ++ +N ++ +++Q ++ G ++L K F C I+++ + +
Sbjct: 205 VQ---NALPKNTADLLSFAFEQWDKMTTTEEGRKELGKIFNTCTEFGEKDIQTFAES--I 259
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKE-MCKAIDDPKTGN--------------DVFAKLYGA 337
TA++ Y N N P++ + +C I++ D
Sbjct: 260 GTALSGY-VQYNSSNWKPSYESTDSICAEINEDIVNKYPLFIKEKYNPEWADKECTPSSQ 318
Query: 338 ASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
Y T+ + G+ D G S W +Q C + S+
Sbjct: 319 EESYKTLQNTSTYAE--GNEDASGRS-WFFQTCIAYGYYQAVSEQSSVKWGKLNQLQGSI 375
Query: 398 RYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
CK+ YG+D + I +GG K + +N+ F NG DPW GV ++
Sbjct: 376 DMCKDIYGIDKDTLYNAVDHINVRYGGKKPCV------TNVAFTNGNTDPWHALGVTESD 429
Query: 453 SK--TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
+ ++ L+++ +H DL + D LK R E++ + ++ Q+
Sbjct: 430 HQEGNLVQLIDR-TSHCSDLYSEKENDVPELKKARHNELKFFAQVLANVPQN 480
>gi|307196628|gb|EFN78125.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 429
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 115/451 (25%), Positives = 198/451 (43%), Gaps = 52/451 (11%)
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEG-DIEWFAQNTGFMYDVAPKFKALLVFIEH 131
+ ++QRY +N ++ K N P+F+ G E +W + G D A + A+ ++EH
Sbjct: 2 RVWKQRYFVNSDYY---KPNGPVFLMIGTEKIKPKWMVE--GLWIDYAKELGAMCFYVEH 56
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGS 190
RYYGKS P +++ N + +LSS AL D+A I ++ D+ +VFGGS
Sbjct: 57 RYYGKSHP---TVDLSTDNLT---FLSSEIALQDFAYFIRNINIEYKFPNDTKWIVFGGS 110
Query: 191 YGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKG 250
YGG LAAW RLKYPH GA+++S P+L + Y ++ + S+ C +
Sbjct: 111 YGGSLAAWMRLKYPHFVHGAVSASGPLLALIDFQEYYV---VVEDALKQHSQQCVDAVAN 167
Query: 251 SWKQIEETAKKPGGLEKLQKAFRIC------KSEKNLAIESWLSTAFVYTAMTDYPTPSN 304
+ + G E++ + FR+C + + L+ F Y +
Sbjct: 168 ANTEFHTMLHHLTGQEQIAEKFRLCDPIDPGHTADISNLYQSLANNFAYIVQNNKNNRQE 227
Query: 305 FLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL-- 362
+ V +C + + + G V +L S+ N + KC D D+ H L
Sbjct: 228 --SKTANINVDTICDVLTNDELGRPV-DRLAYMNSMILN-ATKEKCLDYKYDNMIHSLRS 283
Query: 363 ----------SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG----VDP 408
+W +Q C+E+ + +F E+ D + C + +G +D
Sbjct: 284 INWNEQVEGERQWMYQTCSEVGFFQTSTARPKLFSETFP-VDFYVQQCVDIFGPSYNLDM 342
Query: 409 RPNWIT---TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
+ +T T +G L + SN++ +G DPW G+ K+ + +A+ + A
Sbjct: 343 LKSVVTRTNTLYGA------LNQKVSNVVHVHGSLDPWHTLGITKSSNHPQVAIYINDTA 396
Query: 466 HHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
H L S+++DP LK R +I +W+
Sbjct: 397 HCAILYPSSEKDPPQLKQARIVVKGLIKQWL 427
>gi|431892225|gb|ELK02665.1| Thymus-specific serine protease [Pteropus alecto]
Length = 515
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 124/478 (25%), Positives = 204/478 (42%), Gaps = 59/478 (12%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
E+ S + + L + Q LD FN + ++F QRY +N+ HW + + P+F
Sbjct: 39 ERSSPGLGPGPGAAALPAEGWLEQALDPFNAS--DRRSFLQRYWVNEQHW--ASRDGPVF 94
Query: 97 VYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY 156
++ G EG + A G +AP + AL++ +EHR+YG SIP GG ++A+ +
Sbjct: 95 LHLGGEGSLGPGAVMRGHPAALAPAWGALVIGLEHRFYGLSIPAGG-LDMAHLR-----F 148
Query: 157 LSSTQALADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
LSS ALAD S + L + L ++ SP V FGGSY G LAAW RLK+PH+ A+ASSA
Sbjct: 149 LSSRHALADVVSARLALSRLLNVSSSSPWVCFGGSYAGSLAAWARLKFPHLVFAAVASSA 208
Query: 216 PILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
P+ V +S N S + ++ + G L +
Sbjct: 209 PV----RAVLDFSAYNECRAAASSAFAEVARRLRAGEAARAALRAELGACASLDREED-- 262
Query: 276 KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV----F 331
++E A+++ + A Y P V+++C + GN +
Sbjct: 263 RAELLGALQALVGGAVQYDGQAGAP-----------LSVRQLCGLLQ--AGGNRSRPAPY 309
Query: 332 AKLYGAASVYYNYSGTAKCFDLNGD--------SDPH----GLSEWGWQACTEMIMLTGG 379
L A V + G KC + ++P G +W +Q CTE
Sbjct: 310 RGLRQAVQVVLHSLGQ-KCLSFSRAETVAQLRVTEPQVSSVGDRQWLYQTCTEFGFYVTC 368
Query: 380 DNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLVLKRFASNII 434
++ F + ++ C++ +G V + +GG G A+ ++
Sbjct: 369 EDPGCPFSQLPA-LPSQLELCEQVFGLSTSSVAQAVAQTNSYYGGQTPG------ATQVL 421
Query: 435 FFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
F NG DPW V + + +V AL+ +H +D+ D L+ R+ V+I+
Sbjct: 422 FVNGDADPWHVLSVTQALGPSVSALLIPSASHCLDMAPERPSDSPSLRLGRQSPVQIL 479
>gi|357483671|ref|XP_003612122.1| Thymus-specific serine protease [Medicago truncatula]
gi|355513457|gb|AES95080.1| Thymus-specific serine protease [Medicago truncatula]
Length = 478
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 145/498 (29%), Positives = 209/498 (41%), Gaps = 81/498 (16%)
Query: 35 TPEKLSSLISSSKDSQGLYKTK---YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKN 91
TP L +S S Y TK + Q LDH Y+P ++ FQQRY H+
Sbjct: 19 TPHLLRRRLSES----ARYLTKEELWFPQTLDH--YSPYDHRKFQQRYYEFLDHF--RIP 70
Query: 92 NAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
+ P+F+ E + + ++ +A KF A +V +EHRYYGKS P+ K +A KN
Sbjct: 71 DGPVFLVICGEYSCDGIRND--YIGVLAKKFGAAVVSLEHRYYGKSSPF---KSLATKNL 125
Query: 152 STTGYLSSTQALADYASL------IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
YLSS QAL D A ++ K N T D+P VFG SY G L+AWFRLK+PH
Sbjct: 126 R---YLSSKQALFDLAVFRQNYQDSLNAKLNRTNADNPWFVFGVSYPGALSAWFRLKFPH 182
Query: 206 VAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGL 265
+ G+LASSA +L + Y+F+ Q S C ++ + + IE G
Sbjct: 183 LTCGSLASSAVVL------AVYNFTEFDQQIGESAGVECKAALQETTRLIERKLVTNG-- 234
Query: 266 EKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK 325
KA + + +L I+ F+Y T + N P K + KA D +
Sbjct: 235 ----KALKASFNAADLEIDG----DFLYFLADAAVTAFQYGN--PDILCKPLVKAKKDGE 284
Query: 326 TGNDVFAKLY--------GAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLT 377
D +AK G ++ YN + S W +Q CTE+
Sbjct: 285 DLVDAYAKFIKEFYLGTEGESTQDYNQNNLKNAAITENSSG----RLWWFQVCTEVAYFQ 340
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIF 435
+ DSI S+ D CK +G P+ T +GG KI S I+F
Sbjct: 341 VAPSNDSI-RSSKVDTRYHLDLCKNVFGEGIFPDVDATNIYYGGTKIA------GSKIVF 393
Query: 436 FNGLRDPWSGGGVLKNISKTVL---ALVEKEGAHHVDLRFSTKE------------DPQW 480
NG +DPW K IS + + H D+R + P
Sbjct: 394 TNGSQDPWRRAS--KQISSPNMPSYTITCHNCGHGTDMRGCPQSPFNIEGNEKNCTSPDA 451
Query: 481 LKDVRRREVEIIGKWISQ 498
+ VR++ +E + W+SQ
Sbjct: 452 VHKVRQKIIEHMDLWLSQ 469
>gi|397626302|gb|EJK68100.1| hypothetical protein THAOC_10754 [Thalassiosira oceanica]
Length = 629
Score = 133 bits (334), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 133/506 (26%), Positives = 211/506 (41%), Gaps = 98/506 (19%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-----WFAQN 111
Y Q++DHF+ S T+ RY + ++GG + PIF+ G EG +E + ++
Sbjct: 151 YADQLVDHFD---GSTDTWDNRYYASSRYFGGPGH--PIFMVVGGEGSLEKMLYPFVNEH 205
Query: 112 TGFMYDVAPKFKALLVFIEHRYYG--KSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
F F A +V IEHR+YG + +P +E+ T L+ QA+AD L
Sbjct: 206 LAF------HFGAAVVQIEHRFYGPYQPLPNATVEEL-------TELLTPQQAMADMVRL 252
Query: 170 IIDLKKNLTATD--------SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
K L PVV GG+Y G L+A FRL + A ASSAP+ +D
Sbjct: 253 TKHFKDELGCGGYDRTSPEYCPVVSVGGAYPGFLSAMFRLAHGDFVDVAYASSAPLKLYD 312
Query: 222 NIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE--- 278
+ + +T+ +S C ++ + + EE +E +A +C
Sbjct: 313 QSAPQEVYYDTVTRAAERLSPGCADAVRSALVEAEEAIMASPSVESAVRAMNMCVDSIPE 372
Query: 279 -----KNLAIESWLSTAFVYT--AMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG---- 327
L + ++ AF + M YP PS L + C+ D G
Sbjct: 373 YIHDLDTLRQDVMMAVAFSFADFDMGAYP-PSEELG------MHRACRVFQDAAGGHYTS 425
Query: 328 -------------NDVFAKLYGAASVYYNYSGTAKCFDL-----------------NGDS 357
++ F K Y + GT CFDL +G
Sbjct: 426 VQRVADFFTLVGEDEEFEKKY---PQFVGEEGT-PCFDLSIFLPDGPNARIATSDWSGSG 481
Query: 358 DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY--GVDPRPNWITT 415
+ W +Q CT ++ G +++S+F YD YC+ Y G+ P+P +
Sbjct: 482 GGNDGKMWEFQLCTTLVEPI-GISEESMFPYRTWTYDDLTEYCQLRYGEGIVPQPLALVR 540
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
+ G + +K AS IIF NG +D WSG ++++S T+LAL + GA+H DL
Sbjct: 541 DLGFDDL---VKSNASRIIFTNGKQDMWSGASYVEDVSDTILALNFENGAYHSDLSHMGP 597
Query: 476 EDPQWLKDVRRREVE---IIGKWISQ 498
D + +D+R E I+G+W+ +
Sbjct: 598 SDTE-TEDIREGFAEITDILGRWLGE 622
>gi|328718795|ref|XP_001947661.2| PREDICTED: putative serine protease K12H4.7-like isoform 1
[Acyrthosiphon pisum]
gi|328718797|ref|XP_003246580.1| PREDICTED: putative serine protease K12H4.7-like isoform 2
[Acyrthosiphon pisum]
Length = 501
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 137/499 (27%), Positives = 216/499 (43%), Gaps = 60/499 (12%)
Query: 35 TPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAP 94
+PE L S+ + + ++ Q LDHFN P + +T++QRY +N ++ KN+ P
Sbjct: 33 SPESLRSMNT---------EDEWFIQKLDHFN--PTNNRTWKQRYQVNLENY---KNDGP 78
Query: 95 IFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
+F+ G EG I ++G D A +F AL +EHRYYG+S P + S
Sbjct: 79 VFLMIGGEGKISDKWMHSGAWIDYAKEFNALCFQLEHRYYGESHPTED------MSTSNL 132
Query: 155 GYLSSTQALADYASLIIDLK--KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
YLSS QALAD A I+++K N+ +T + V FGGSY G LAAW R+KYPH+ A++
Sbjct: 133 VYLSSDQALADLAEFIVNIKIKYNIPST-AKWVAFGGSYPGTLAAWLRMKYPHLIHAAVS 191
Query: 213 SSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
SS P+L + + ++ + + C IK + + I K G + ++K F
Sbjct: 192 SSGPLL---AKIDFKEYFMVVENALATYNPECVSQIKQANQMINYYLKTDQGAKIIEKKF 248
Query: 273 RIC-----KSEKNLA-IESWLSTAFVYTAMTD-----YPTPSNFLNPLPAFPVKEMCKAI 321
++C K++K+++ + L+ F + Y P L + K+I
Sbjct: 249 KLCDPLDRKNDKDVSYLFEILADNFAVIVQYNKDNRHYKDPERSSVTLETLCDIMVNKSI 308
Query: 322 DDP----KTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH---GLSEWGWQACTEMI 374
P N+ L A YS + LN D G +W +Q CTE
Sbjct: 309 PTPLDRYAAVNNKLLSLNKADCQDNIYSQLIDLY-LNTSWDSSAAGGGRQWTYQTCTEFG 367
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWIT-------TEFGGHKIGLVLK 427
D F + D C++ +G +T T FG I
Sbjct: 368 FYQTSSQDDHAFGHNFP-IDFFINMCQDIFGKSYNSELLTAAVERTNTMFGELNIR---- 422
Query: 428 RFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRR 487
S +IF +G DPW G+ K +K +A+ +H ++ D L R
Sbjct: 423 --DSRVIFVHGSVDPWHALGITKARTKNNVAIFINGTSHCANMYPPASSDLPELTQARTT 480
Query: 488 EVEIIGKWISQY-FQDLAQ 505
+ +W+++ F D AQ
Sbjct: 481 IRSYLREWLAENDFVDSAQ 499
>gi|344268145|ref|XP_003405923.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 504
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 122/479 (25%), Positives = 204/479 (42%), Gaps = 74/479 (15%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTG 113
++ Q LDHF+ Q ++QRY IND + K P+F+ G G + W ++N
Sbjct: 64 RWFMQKLDHFD---QKEIFWRQRYFINDAFY---KPGGPVFLMIGGMGSAKRNWTSRNLP 117
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F+ A + AL + +EHR+YG+S P G + ++ Y+ + Q L D A+ I +
Sbjct: 118 FV-AYAERLGALCLVLEHRFYGRSQPTGD------LSTASLRYIRNHQVLGDIANFRIKI 170
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFS 230
K + T + V FG YGG LA W R+KYP + A+ SSAP+ +NFD S
Sbjct: 171 AKLMGLTKNKWVAFGEFYGGSLAVWSRIKYPDLFAAAVGSSAPVKAEINFDEYFEEVQVS 230
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
+ + C + + +++ + KL++ F +C+ L I+S
Sbjct: 231 ------LDAHNSECSSSVYLALREVTKRLIHQKHYSKLKRDFMLCEP---LQIDSKQHAT 281
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA--ASVYYNYS--- 345
FV + +FL P+ + K K++ + + +D K+ +S Y+ Y+
Sbjct: 282 FVLENLM------SFLIPIVQYNKKR--KSVMNILSTDDFCKKMTETPLSSPYHRYARIM 333
Query: 346 ------GTAKCFDLNGDSDPHGLSEWG-------------WQACTEMIMLTGGDNKDSIF 386
C D N + +SE +Q CTE D+K F
Sbjct: 334 SNRIKNANLSCLDANYNHHLRRMSETSLNNGNILQVRQRLYQCCTEFGFFQTTDSKYQSF 393
Query: 387 EESEEDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGL 439
E Y + C + +G ++ + +GG + S IIF NG
Sbjct: 394 SELPLRYFLKQ--CSDVFGSEYSFSALNRSAQALNKYYGGFNVK------GSKIIFSNGS 445
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
DPWS G+ K+I+K A++ + AH D+ D L R + +I+ +W+ +
Sbjct: 446 LDPWSALGITKDINKNFRAVLIEGEAHCADMDEKMDSDSAELIQAREKIFQILQEWLKE 504
>gi|194899986|ref|XP_001979538.1| GG23239 [Drosophila erecta]
gi|190651241|gb|EDV48496.1| GG23239 [Drosophila erecta]
Length = 473
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 121/464 (26%), Positives = 206/464 (44%), Gaps = 58/464 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T + Q LDHF+ P +T+Q RY++ND + K+ AP+F+Y G E +I G
Sbjct: 46 QTLWIEQKLDHFD--PAETRTWQMRYMLNDALY---KSGAPLFIYLGGEWEISAGRITGG 100
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+YD+A + ALL + EHRYYG+S P +++ +N YLS Q+LAD A I +
Sbjct: 101 HLYDMAKEHSALLAYTEHRYYGQSKPL---PDLSNENIK---YLSVNQSLADLAYFINTI 154
Query: 174 KKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K+N ++S V++ GGSY + WF+ YP + G ASSAP+ N V + +
Sbjct: 155 KQNHEGLSESKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLFAKVNFVE---YKEV 211
Query: 233 ITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q + + CYK I+ ++E G E ++ ++C+ + + L
Sbjct: 212 TGQSIEQMGGSACYKRIENGIAEMESMIATKRGAE-VKALLKLCEP---FDVYNDLDVWT 267
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY--SGTAK 349
+++ ++D + + A ++ +C+ I ND L G A + A+
Sbjct: 268 LFSEISD--IFAGVVQTHNAGQIEGVCQVI--MAGSND----LNGVARYLLDVFEESDAQ 319
Query: 350 CFDLNGDS----------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
C+DL+ D+ + + +W +Q C E D+ F ++
Sbjct: 320 CYDLSYDAITTLLLDTSYSNNIMRQWIFQTCNEYGWYQTSDSAAQPFG-TKFPVVYYTTM 378
Query: 400 CKEAYGVDPRPNWITTE-------FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
C + YG +I+ + FGG G+ N+ +G DPW G+
Sbjct: 379 CADLYGSQYSNEFISNQVVITNQYFGGLSPGV------ENVYLTHGQLDPWRAMGIQDEA 432
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
T+L E AH D + D +K + R E++ +W+
Sbjct: 433 QATIL----PEYAHCKDFNSISSSDTAEMKASKERIAELVREWV 472
>gi|149029310|gb|EDL84577.1| protease, serine, 16 (thymus), isoform CRA_b [Rattus norvegicus]
Length = 496
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 125/481 (25%), Positives = 206/481 (42%), Gaps = 67/481 (13%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + + F QRY +ND H G +AP+F++ G EG + + TG
Sbjct: 41 KQGWLEQPLDPFNATDR--RIFLQRYWVNDQHRTG--QDAPVFLHIGGEGSLGPGSVMTG 96
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG S+P GG + + YLSS ALAD AS L
Sbjct: 97 HPAALAPAWGALVISLEHRFYGLSMPAGG------LDVALLHYLSSRHALADVASARQAL 150
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L ++ SP + FGGSY G LA W RLK+PH+ A+ASSAP+ + +V +++ +
Sbjct: 151 SGLLNVSSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQV 207
Query: 233 ITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLEK---LQKAFRICKSEKNLAIES 285
+ + V S C ++ ++E + GL L++ C S + ++
Sbjct: 208 VARSLTQVTIGGSLECLAAASTAFAEVERLLR--AGLASQAVLREELGACGSLDLIEDQA 265
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---------DDPKTG-NDVFAKLY 335
L A +PL V+++C + P G ++
Sbjct: 266 ELLGALQALVGGTVQYDGQAGSPL---SVRQLCGLLLRTWGNRNHSTPYLGLRRAVQRVL 322
Query: 336 GAASVYYNYSGTAKCFDLNGDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEE 391
+ S A+ +++P G +W +Q CTE + F +
Sbjct: 323 RSMGQKCLSSSRAETVAQLRNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQCPFSQLPA 382
Query: 392 DYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
+ C++ +G+ P + +GG G A+ ++F NG DPW
Sbjct: 383 -LPFQLDLCEQVFGLSPSSVAQAVAQTNSYYGGQSPG------ATQVMFVNGDTDPWHVL 435
Query: 447 GVLKNISKTVLALVEKEGAHHVD-----------LRFSTKEDPQ----WLKDVRRREVEI 491
V +++ + AL+ +H +D LR ++ Q WLKD+RR +
Sbjct: 436 SVTQDLGPSEPALLIPSASHCLDMAPMRPSDSPSLRLGRQKIFQQLQVWLKDIRRARTDE 495
Query: 492 I 492
I
Sbjct: 496 I 496
>gi|76096362|ref|NP_001028882.1| thymus-specific serine protease precursor [Rattus norvegicus]
gi|75775148|gb|AAI04723.1| Protease, serine, 16 (thymus) [Rattus norvegicus]
gi|149029311|gb|EDL84578.1| protease, serine, 16 (thymus), isoform CRA_c [Rattus norvegicus]
Length = 512
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 125/481 (25%), Positives = 206/481 (42%), Gaps = 67/481 (13%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + + F QRY +ND H G +AP+F++ G EG + + TG
Sbjct: 57 KQGWLEQPLDPFNATDR--RIFLQRYWVNDQHRTG--QDAPVFLHIGGEGSLGPGSVMTG 112
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG S+P GG + + YLSS ALAD AS L
Sbjct: 113 HPAALAPAWGALVISLEHRFYGLSMPAGG------LDVALLHYLSSRHALADVASARQAL 166
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L ++ SP + FGGSY G LA W RLK+PH+ A+ASSAP+ + +V +++ +
Sbjct: 167 SGLLNVSSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQV 223
Query: 233 ITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLEK---LQKAFRICKSEKNLAIES 285
+ + V S C ++ ++E + GL L++ C S + ++
Sbjct: 224 VARSLTQVTIGGSLECLAAASTAFAEVERLLR--AGLASQAVLREELGACGSLDLIEDQA 281
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---------DDPKTG-NDVFAKLY 335
L A +PL V+++C + P G ++
Sbjct: 282 ELLGALQALVGGTVQYDGQAGSPL---SVRQLCGLLLRTWGNRNHSTPYLGLRRAVQRVL 338
Query: 336 GAASVYYNYSGTAKCFDLNGDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEE 391
+ S A+ +++P G +W +Q CTE + F +
Sbjct: 339 RSMGQKCLSSSRAETVAQLRNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQCPFSQLPA 398
Query: 392 DYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
+ C++ +G+ P + +GG G A+ ++F NG DPW
Sbjct: 399 -LPFQLDLCEQVFGLSPSSVAQAVAQTNSYYGGQSPG------ATQVMFVNGDTDPWHVL 451
Query: 447 GVLKNISKTVLALVEKEGAHHVD-----------LRFSTKEDPQ----WLKDVRRREVEI 491
V +++ + AL+ +H +D LR ++ Q WLKD+RR +
Sbjct: 452 SVTQDLGPSEPALLIPSASHCLDMAPMRPSDSPSLRLGRQKIFQQLQVWLKDIRRARTDE 511
Query: 492 I 492
I
Sbjct: 512 I 512
>gi|116788543|gb|ABK24916.1| unknown [Picea sitchensis]
Length = 489
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 137/471 (29%), Positives = 201/471 (42%), Gaps = 73/471 (15%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
+Q LDH Y+P ++ F QRY ++ ++ PIF+ E + A + +M +
Sbjct: 49 SQRLDH--YSPTDHRQFNQRYYEFLDYF--QAHDGPIFLKVCGEYSCDGIAND--YMAVL 102
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IID 172
A KF A +V +EHRYYGKS P+ KE N YLSS QAL D AS + +
Sbjct: 103 AKKFGAAIVSLEHRYYGKSSPF---KESTTHNLQ---YLSSKQALFDLASFRNYYQELTN 156
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K+NL+ D+ VFG SY G L+AWFRLK+PH+ G+LASSA +L + Y+F++
Sbjct: 157 KKQNLSNYDNSWFVFGVSYPGALSAWFRLKFPHLTCGSLASSAVVL------AVYNFTDF 210
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEE-------TAKKPGGLEKLQKAFRICKSEKNLAIES 285
Q S C K ++ + E+ K G EKL+ + A+
Sbjct: 211 DRQIGESAGPGCKKALQEVTRLAEQGLSTNANAVKSLFGAEKLKNDGDFLYLLADAAV-- 268
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS 345
TAF Y YP + P+ E + D + K Y + +Y
Sbjct: 269 ---TAFQYG----YPD-------VLCSPLVEASSSGKDLMVAYADYVKNYSSDGGVESY- 313
Query: 346 GTAKCFDLNGDSDPHGLS---EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
G + D+D +G+S W +Q CTE + DS+ S+ D C+
Sbjct: 314 GQQFLKNTTVDADSNGVSNSRSWWYQVCTEFAYFQVAPSNDSV-RSSKVDTKYHLDLCEN 372
Query: 403 AY--GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ G+ P + +GG I S IIF NG +DPW K+ ++
Sbjct: 373 VFGKGIYPEVDITNLYYGGTSIA------GSKIIFMNGSQDPWRHASKQKSSDNMPSYII 426
Query: 461 EKEGA-HHVDLR------------FSTKEDPQWLKDVRRREVEIIGKWISQ 498
H DLR S P + R++ VE I W+SQ
Sbjct: 427 TCHNCGHGTDLRGCPQSPSRIEGDASDCASPDVVHKARQQMVEHIDLWLSQ 477
>gi|355762458|gb|EHH61967.1| Thymus-specific serine protease [Macaca fascicularis]
Length = 514
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 119/485 (24%), Positives = 204/485 (42%), Gaps = 60/485 (12%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +N+ HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNEQHWVG--EDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRHA 156
Query: 163 LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NF---DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
+F +++VS S I + +L +
Sbjct: 217 DFSEYNDVVSRSLMSTAIGGSLEXXXXXXXXXXXXXXXXXXXXXXQAALRSELSACGTLG 276
Query: 276 KSEKNL----AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP---KTGN 328
+E A+++ + A Y P V+++C + ++ +
Sbjct: 277 SAENQAELLGALQALVGGAVQYDGQAGAP-----------LSVRQLCGLLLGGGGNRSHS 325
Query: 329 DVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIML 376
+ L A + + G KC + ++P G +W +Q CTE
Sbjct: 326 TPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFY 384
Query: 377 TGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLVLKRFAS 431
N F + ++ C++ +G V + +GG G A+
Sbjct: 385 VTCGNPRCPFSQLPA-LPSQLDLCEKVFGLSALSVAQAVAQTNSYYGGQTPG------AN 437
Query: 432 NIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEI 491
++F NG DPW V + + + AL+ + G+H +D+ D L+ R+ +
Sbjct: 438 QVLFVNGDTDPWHVLSVTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQ 497
Query: 492 IGKWI 496
+ W+
Sbjct: 498 LQTWL 502
>gi|156391046|ref|XP_001635580.1| predicted protein [Nematostella vectensis]
gi|156222675|gb|EDO43517.1| predicted protein [Nematostella vectensis]
Length = 444
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 115/454 (25%), Positives = 188/454 (41%), Gaps = 50/454 (11%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHF+ + +T++QR+ NDT ++P+F+ G EG I G M A
Sbjct: 21 QRLDHFD--DSNTETWKQRFYYNDTF--RKTKDSPVFLMVGGEGAISPVWVLIGNMMKYA 76
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
F A+ +EHR+YG+S P + K YL+S QALAD A+ +
Sbjct: 77 EGFGAMAFILEHRFYGQSHPRSDMSDANLK------YLNSEQALADLAAFRQAMSVKFNL 130
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
TDS + FGGSY G L+AW RLKYPH+ GA+ASSAP+L N + ++T +
Sbjct: 131 TDSKWISFGGSYPGSLSAWLRLKYPHLIHGAVASSAPVLAQLNFP---EYLEVVTASLET 187
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLAIESWLST--AFVYTA 295
+C K I + IEE G +KL FR+C+ + +N S S+ A ++
Sbjct: 188 TGPDCTKNIANATAAIEELLDADEGTKKLTNLFRVCEPLNRRNDNDVSTFSSNLAGLFMG 247
Query: 296 MTDYPTPSNFLNPLPA--FPVKEMCKAIDDPKTG---------NDVFAKLYGAASVYYNY 344
+ Y + +P + +C ++D G N + YG + +Y
Sbjct: 248 VVQYNKDNRAFEGVPGTNITIATVCGIMNDKSLGPALMRYAKLNSLILDTYGEKCLDASY 307
Query: 345 SGTAKCF-DLNGDSD-PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
+++ DS G +W +Q CTE ++ ++ D +
Sbjct: 308 QNAINSLRNVSWDSSAAEGGRQWTYQTCTEF----------GFYQTTDSDNQPFGKRFPL 357
Query: 403 AYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
Y + + F + +++ +N + G G+ + +
Sbjct: 358 KYSIQQCMDVFGEAFNSSNLASGIRQTNTN----------YGGKGIASSRDIVFVVFYPT 407
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
AH ++ + D LK R + I KW+
Sbjct: 408 GTAHCANMYPESDSDSPQLKQAREVIKQHIAKWL 441
>gi|224138414|ref|XP_002322808.1| predicted protein [Populus trichocarpa]
gi|222867438|gb|EEF04569.1| predicted protein [Populus trichocarpa]
Length = 124
Score = 132 bits (332), Expect = 4e-28, Method: Composition-based stats.
Identities = 62/114 (54%), Positives = 74/114 (64%), Gaps = 19/114 (16%)
Query: 383 DSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDP 442
DSIF SE +Y+ A +CK +GV+PRPNWIT EFGGH I VL+RF SN IFFNGLRDP
Sbjct: 26 DSIFPASEWNYNDTATFCKGFFGVEPRPNWITAEFGGHDIKRVLRRFGSNFIFFNGLRDP 85
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
WS GAHHVDLRF+ EDP+WL+DVR+ II +W+
Sbjct: 86 WS-------------------GAHHVDLRFAIGEDPKWLQDVRKGGASIIAEWL 120
>gi|301109703|ref|XP_002903932.1| serine protease family S28, putative [Phytophthora infestans T30-4]
gi|262096935|gb|EEY54987.1| serine protease family S28, putative [Phytophthora infestans T30-4]
Length = 528
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 131/493 (26%), Positives = 212/493 (43%), Gaps = 72/493 (14%)
Query: 28 TFPSSRITPEKLSSLISSSKDSQGLYKTK----YHTQILDHFNYNPQSYQTFQQRYLIND 83
TFPS +L L+ + +SQ L+KT+ + Q +DHF + S TFQQRY +
Sbjct: 20 TFPS-HAKHGRLWQLVQA--ESQLLFKTEAQQLWFNQTVDHFASD--SNATFQQRYYEVN 74
Query: 84 THWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGN 143
W SK + P+ +Y G EG +E GF++ +A KF A ++ +EHR+YG+SIP G
Sbjct: 75 KFW--SKPDGPVILYIGGEGAME--KAPAGFVHVIAQKFDAKILALEHRFYGRSIPNGDL 130
Query: 144 KEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLK 202
Y+ YL+ QALAD + L A D+ + GGSY G L+AWFR+
Sbjct: 131 STENYR------YLTVQQALADLKHFKESYQSQLGAKDANQWIAIGGSYPGALSAWFRIA 184
Query: 203 YPHVAIGALASSAPILNFDNIVSP-YSFSNIITQDFRSVSENCYKVIKGSWKQIEE---- 257
YP + +L+SS +V P Y F Q + +C V++ + + E+
Sbjct: 185 YPDATVASLSSSG-------VVQPVYKFHQFDEQVALAAGPSCADVLRLTTEVFEKEVAS 237
Query: 258 ----TAKKPGGLEKLQKA--FRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPA 311
KK G + L A F + +A++ + + + ++ ++ +
Sbjct: 238 ANATAVKKLFGAQDLADADFFYMIADAAAMAVQYGHKDIVCNSMVGAFERNNSLVDSFAS 297
Query: 312 FPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACT 371
F + D++ +G+ Y KC + G S W WQ C+
Sbjct: 298 FTI--------------DMYGSSFGSECFY-----DTKCLADDRSRWGDGRS-WRWQKCS 337
Query: 372 EMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP----NWITTEFGG-HKIGLVL 426
++ + S+ + D D + C+ +G P + IT +GG H G
Sbjct: 338 QLAYFQVAPKEKSL-RSAMLDLDYHLKQCQTVFGDVVHPSEGVDEITKLYGGDHPNG--- 393
Query: 427 KRFASNIIFFNGLRDPWSGGGVLKNISKTVLA-LVEKEGAHHVDLRFSTKEDPQWLKDVR 485
I F NG DPW VL +S +A L + + H + P+ LK R
Sbjct: 394 ----HKIFFSNGGDDPWQRASVLDKLSDDQIANLAKCQLCGHCGDLSANPNVPEPLKKQR 449
Query: 486 RREVEIIGKWISQ 498
+ +E + KW+ +
Sbjct: 450 EQILEYLTKWLGE 462
>gi|301102560|ref|XP_002900367.1| serine protease family S28, putative [Phytophthora infestans T30-4]
gi|262102108|gb|EEY60160.1| serine protease family S28, putative [Phytophthora infestans T30-4]
Length = 526
Score = 132 bits (332), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 132/468 (28%), Positives = 207/468 (44%), Gaps = 57/468 (12%)
Query: 61 ILDHFNYNPQSYQT-FQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-QNTGFMYDV 118
ILDHF P S ++ ++QRY N+ WGG P+F+Y G EG + A N F+Y +
Sbjct: 71 ILDHFA--PVSKRSKWKQRYQANEEFWGG--RGFPVFLYIGGEGPLGPKAITNRTFVYYL 126
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A + +ALL+ +EHR+YGKS P ++++ N + YLSS QALAD A +
Sbjct: 127 AEQHRALLLALEHRFYGKSYP---TEDMSLPNLA---YLSSEQALADLAHFHSFVTDKYG 180
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
TD V FGGSY G LAAW +LKYP + G +ASSAP+ + + + ++ R
Sbjct: 181 LTDEKWVAFGGSYPGNLAAWVKLKYPALFAGTVASSAPVQAKTDF---FEYMEVVGDGLR 237
Query: 239 SV-SENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRICKSEKN----LAIESWLSTAFV 292
CY ++ + Q+ + G +K+ + F+ C N ES + AF
Sbjct: 238 YFGGGECYHEVEKAITQLGHLMDEGQKGRDKVAELFKPCYPMTNEFDDSVFESSVMGAFQ 297
Query: 293 YTA---------MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYN 343
A MT +F P A V+++ I+ + G+ + +K GAA
Sbjct: 298 DIAQYNGIHEGVMTLSEVCEHFAKPGDA--VEKLASFINKTRVGDCLDSKFQGAA----- 350
Query: 344 YSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFE--ESEEDYDARARYCK 401
+GT + + +W +Q C E + S F + + + CK
Sbjct: 351 -NGTVEVLSRDQFDGKSSARQWVYQTCNEFGYFQTTTSVRSPFHGLRAVTEANVGTEICK 409
Query: 402 EAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI---SKTVLA 458
Y +D P+ +G+ ++R + F +G DPW V + S + A
Sbjct: 410 RVYQMDVAPDVAGANRDYGSLGIEVER----VTFPSGTIDPWHALAVQNSTILHSYSAEA 465
Query: 459 LVEKEGAHHVDLRFSTKEDP---QWLKDVRRREVEIIGKWISQYFQDL 503
+ + AH D+ + ++ D QW D VE W Y QD+
Sbjct: 466 VFIEGTAHCADMYYPSERDSLQLQWAHDKIAARVE----W---YLQDV 506
>gi|290562587|gb|ADD38689.1| serine protease K12H4.7 [Lepeophtheirus salmonis]
Length = 483
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 123/467 (26%), Positives = 202/467 (43%), Gaps = 58/467 (12%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
T ++ Q LDHFN ++ + ++QRY +N+ ++ K N P+F+ G EG + G
Sbjct: 49 TSFYDQTLDHFN--TKNKKAWKQRYFVNEENFK-DKENGPVFLKIGGEGTASIGSMKYGS 105
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADYASLIID 172
Y+ A K AL++ +EHR+YG+S P +N ST YL+S QA+ D I
Sbjct: 106 WYEYAQKVGALMIQLEHRFYGESRPT--------ENLSTENLKYLTSQQAIEDIVEFIAH 157
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+K+ ++ + GGSY G L+ W R YP + GAL+SSAP+ + V + I
Sbjct: 158 IKEKYDIPNNKWITLGGSYPGSLSLWMRSLYPELIAGALSSSAPV---EAKVDFEEYLGI 214
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRIC-----KSEKNLAIESW 286
+ D R +C + K+ E G +K+ K +++C +EK++
Sbjct: 215 VNNDMRIRDPDCPAAVIEGIKETEALINSGKEGWQKVAKIYKLCPGWSGDNEKDVKT--- 271
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV------ 340
L + V T + S + + L V ++C + + G+ KL G
Sbjct: 272 LFGSIVET----FAGASQYDSTLSTNDVSQLCSHMKNSNFGDTNMEKLAGTLIAVNGGSC 327
Query: 341 ----YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
Y ++ + + + D D G +W +Q C E G+ S +
Sbjct: 328 INVKYEDFIDFMRNEEWSVDDD--GYRQWIFQTCNEFGWYQTGNLWGSFLP-----VEFF 380
Query: 397 ARYCKEAYGVDPRPNWI------TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
C + YG + + + +F G K SN I +G DPW G+L
Sbjct: 381 VEQCTDVYGAEFTSEKVYSSAKYSNDFYGAK-----NPSLSNTIITHGSFDPWHPMGILN 435
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKE-DPQWLKDVRRREVEIIGKWI 496
+++ +V A V +H DL+ + D L VR+ E I KWI
Sbjct: 436 DMNDSVKAFVINGTSHCFDLQPANPLFDSDQLTHVRKTTFEYIKKWI 482
>gi|229594586|ref|XP_001032708.3| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|225566764|gb|EAR85045.3| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 475
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 128/467 (27%), Positives = 193/467 (41%), Gaps = 56/467 (11%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y Q LDH Y P +T+ QRY + D HW + +Y EG+ N+ F
Sbjct: 34 YFQQKLDH--YAPLDNRTWAQRYFVMD-HWFNKTAQPLVILYICGEGECNGVQYNSSFTS 90
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYG-GNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
+A +++ +EHR+YGKS P+G GN A N YL++ QAL D A I +K
Sbjct: 91 KIAEIHNGIVLSLEHRFYGKSQPFGFGNDSYALPNLK---YLTAQQALNDLAWFIQYVKD 147
Query: 176 NL---TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
N + P + GGSY G L+AWFR K+PH+ IGALASSA + + + Y F
Sbjct: 148 NQLFGITPNMPWITIGGSYPGALSAWFRYKFPHLTIGALASSAVVNAYADF---YEFDQQ 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
I+ S NC +++ + KK +K Q L F
Sbjct: 205 ISDSLSKNSGNCRQIVHDINVNVTNILKKGTPQQKQQ-----------------LKAYFN 247
Query: 293 YTAMTDYPTPSNFL---NPLPAFPVKEMCK-AIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
T +TD +F+ + + V+ + A+ D N FA + + Y G
Sbjct: 248 STLITD----GDFMFYFSDITVMGVQYGSRVAMCDLLMSNQTFAGVLQNLATYALQVGVT 303
Query: 349 K----CFDLNGDSDPH--GLSEWGWQACTEM-IMLTGGDNKDSIFEESEEDYDARARYCK 401
+ L + H +W +Q C+E + T + E Y +C
Sbjct: 304 PDQYGAYYLRNTTYSHERNARQWYYQVCSEFGWLFTPAKHYPMRSEILTMSY--WTEWCN 361
Query: 402 EAY-GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
AY G P FGG I A+N+IF NG DPW + + +
Sbjct: 362 SAYDGAFPNTEVTNNYFGGLDIQ------ATNLIFTNGGEDPWQWASKRTPTLPGMQSYI 415
Query: 461 E--KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+ AH VDLR + D LK++R + + W ++++ +Q
Sbjct: 416 ADCDQCAHCVDLRTPSPNDSPILKEIRNKTLSSFATWKNEFYAKQSQ 462
>gi|145351417|ref|XP_001420076.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580309|gb|ABO98369.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 538
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 126/493 (25%), Positives = 211/493 (42%), Gaps = 56/493 (11%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
+S++ S G ++ + Q LDHF+ + ++ QRY +ND + ++ AP+FV G EG
Sbjct: 47 ASARTSIGAHERWFAEQRLDHFDNALNA--SWTQRYFVNDA-YASAERGAPVFVCVGGEG 103
Query: 104 ---DIEWF---AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYL 157
D++ ++ +A K + L +EHR+YGKS P G + + +L
Sbjct: 104 PALDVDVAVDGGEHCAIATALAKKHRGLFFALEHRFYGKSQPTGD------LSVESLRFL 157
Query: 158 SSTQALAD------YASLIIDLK---KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
SS QAL D +A+ L+ +N S V+ FGGSY GMLAAW R+K+PHV
Sbjct: 158 SSAQALEDLVTFTRFAAAAYGLEIEPRNDGRKYSKVIAFGGSYPGMLAAWSRVKFPHVFH 217
Query: 209 GALASSAPILNFDNIVSPYS-FSNIITQDFRSVSENCYKVIKGSWK-QIEETAKKPGGLE 266
A+ASSAP+ ++ Y + + + S+ CY ++ ++ ++ E K G
Sbjct: 218 AAVASSAPVRAQIDMRGYYEVVGDALREKDVGGSDACYTAVENAFTVRLNEALKTSSGRR 277
Query: 267 KLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPT----PSNFLNPLPAFPVKEMCKAID 322
L+K F +C E + + A V AM +P PS + F + + C +
Sbjct: 278 ALEKQFNVCGDEALDGVGARDDFADVLRAM--FPAQNNDPSCLADDDSCFNIAKACTIM- 334
Query: 323 DPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS----------DPHGLSE--WGWQAC 370
G D A L + + +C L+ ++ +P G E W WQ C
Sbjct: 335 -TSHGEDKLAALAAHVAAVFR----GECVSLDSEAYMRELKSEIPNPKGEGERQWTWQTC 389
Query: 371 TEMIMLTGGDNKDSI---FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLK 427
TE + + +A + C + +GV + E + G +
Sbjct: 390 TEFAFFQTCEKSSKCPFKLDPPTMPLEAYHKICADVFGVSAEQTRLAVERSNARYGSITP 449
Query: 428 RFASNIIFFNGLRDPWSGGGVLKNI--SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
+ I+F +G DPW + + + AL+ K +HH D L + R
Sbjct: 450 -GGTRIMFPSGSIDPWIANSFVSDTFAPRFEPALIVKGASHHAWTHPPKDTDTDALVEAR 508
Query: 486 RREVEIIGKWISQ 498
V + KW+++
Sbjct: 509 AIIVGQVEKWLNE 521
>gi|326674064|ref|XP_002664605.2| PREDICTED: thymus-specific serine protease-like [Danio rerio]
Length = 581
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 117/455 (25%), Positives = 194/455 (42%), Gaps = 56/455 (12%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DHF+ Q+ +T Q Y +ND +W +++ P+F+Y G EG + F+ G ++A
Sbjct: 158 QPVDHFDR--QNDKTLPQTYFVNDVYW--QRSDGPVFLYIGGEGPLSKFSVLFGHHVEMA 213
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+ ALLV +EHR+YG+SI G + ++ LSS QALAD A+ + + +
Sbjct: 214 ERHGALLVALEHRFYGESINPDGLETDKLRD------LSSQQALADLAAFHHYISQRFSL 267
Query: 180 TDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
+ + FGGSY G L+AW R K+PH+ GA+ASSAP+ DF
Sbjct: 268 SHRNTWISFGGSYAGALSAWLRGKFPHLIYGAVASSAPVY--------------AVLDFS 313
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL--STAFVYTAM 296
S + C +KG++ + E A G ++ K F C++ L ++ L S A V+
Sbjct: 314 SYNR-CVAEVKGAFAAV-EAALLMGNETEVGKEFGCCETPLKLEDKTELLHSLADVFMGT 371
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAI---DDPK-TGNDVFAKLYGAASVYYNYSGTAKCFD 352
Y AF + E+C + DP+ D KL + Y C D
Sbjct: 372 VQYNEQG------VAFSIAELCDIMTNKSDPREEAYDRLVKLV----MMYRARENLPCLD 421
Query: 353 ---------LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
LN + +W +Q CTE ++ F ++ C
Sbjct: 422 VSHEKLFLELNNTTATSSYRQWFYQTCTEFGFYQTCEDDSCPFSR-RFTLQSQTELCSRL 480
Query: 404 YGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN--ISKTVLALVE 461
+ + ++ +F G + +++ NG DPW+ V+ N ++ +
Sbjct: 481 FNISQDSLLVSIDFTNQYYGGNQPQ-TQRVLYVNGNIDPWAALSVVWNETMADNDRVIFI 539
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
AH +D+ + L R+R E + W+
Sbjct: 540 NGTAHCMDMNSEKSVNKPALHQARKRIEERVTTWL 574
>gi|183230234|ref|XP_656762.2| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|169802986|gb|EAL51377.2| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 480
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 129/521 (24%), Positives = 221/521 (42%), Gaps = 64/521 (12%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDH 64
+ L+ C++ S N ++ S +T E L+ + S + +T LDH
Sbjct: 1 MLLLALCVICSYAKLSLNQQVMQQISQSFMTLE-LNEVESMT-----------YTVPLDH 48
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTGFMYDVAPKFK 123
FN N Q+ F +Y +N + NAP+FV G EG QN + +A K K
Sbjct: 49 FNANNQN--DFDIQYFVNKKFLDANDPNAPLFVLLGGEGPASPKVLQNNYVIDSLAKKHK 106
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP 183
L++ +EHR+YG S P ++ Y ++ QAL DY +I +++ P
Sbjct: 107 GLMLSVEHRFYGASTPSLEMDKLI--------YCTAEQALMDYVEVISHVQEENNLVGHP 158
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
V+V GGSY G LAAW R KYP+V GA ASSAP+ + +V Y + ++ ++ +N
Sbjct: 159 VIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQ---NALPKN 212
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPS 303
++ +++Q ++ G ++L K F C I+++ + + TA++ Y
Sbjct: 213 TADLLSFAFEQWDKMTTTEEGRKELGKIFNTCTEFGEKDIQTFAES--IGTALSGY-VQY 269
Query: 304 NFLNPLPAFPVKE-MCKAIDDPKTGN--------------DVFAKLYGAASVYYNYSGTA 348
N N P++ + +C I++ D Y T+
Sbjct: 270 NSSNWKPSYESTDSICAEINEDIVNKYPLFIKEKYNPEWADKECTPSSQEESYKTLQNTS 329
Query: 349 KCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
+ G+ D G S W +Q C + S+ CK+ YG+D
Sbjct: 330 TYAE--GNEDASGRS-WFFQTCIAYGYYQAVSEQSSVKWGKLNQLQGSIDMCKDIYGIDK 386
Query: 409 RP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK--TVLALVE 461
+ I +GG K + +N+ F NG DPW GV ++ + ++ L++
Sbjct: 387 DTLYNAVDHINVRYGGKKPCV------TNVAFTNGNTDPWHALGVTESDHQEGNLVQLID 440
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
+ +H DL + D LK R E++ + ++ Q+
Sbjct: 441 R-TSHCSDLYSEKENDVPELKKARHNELKFFAQVLANVPQN 480
>gi|167375432|ref|XP_001733641.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165905185|gb|EDR30254.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 480
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 127/476 (26%), Positives = 207/476 (43%), Gaps = 60/476 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-QNT 112
KT +T LDHFN N Q F +Y I+ + + NAP+FV G EG + QN
Sbjct: 38 KTLTYTVPLDHFNVNNQI--DFDIQYFISTDYLDNNSPNAPLFVLLGGEGPEDATGLQNY 95
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
+ D+A K K L++ +EHR+YG S P ++ Y A QAL DY +I
Sbjct: 96 FVVTDLAKKHKGLMLSVEHRFYGASTPSLEMDKLIYCTAE--------QALMDYVEVISH 147
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+++ PV+V GGSY G LAAW R KYP+V GA ASSAP+ + +V Y + +
Sbjct: 148 VQEENNLVGHPVIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEV 204
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
+ ++ +N ++ ++++ +E G ++L K F C I+++ + +
Sbjct: 205 VQ---NALPKNTADLLSFAFEKWDEMTTTEEGRKELGKIFNTCTEFGEKDIQTFAES--I 259
Query: 293 YTAMTDYP--TPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS-GTAK 349
TA++ Y SN+ + + +C I++ DV K YN G +
Sbjct: 260 GTALSGYVQYNSSNWKSSYES--TDSICTEINE-----DVVNKYPLFIKEKYNPEWGDKE 312
Query: 350 CFD----------------LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
C G+ G S W +Q C + S+
Sbjct: 313 CTSSSQEESYKTLQSTSTYAEGNEGAAGRS-WFFQTCIAYGYYQAVSEQSSVKWGKLNQL 371
Query: 394 DARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
CK+ YG+D + I +GG K + +N+ F NG DPW GV
Sbjct: 372 QGSIDMCKDIYGIDKDTLYNAVDHINVRYGGKKPCV------TNVAFTNGNTDPWHALGV 425
Query: 449 LKNISK--TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
++ + ++ L+++ +H DL + D LK R E++ I + ++ Q+
Sbjct: 426 TESDHQEGNLVQLIDRT-SHCSDLYSEKETDVPELKKARHNELKFIAQVLANVPQN 480
>gi|348523600|ref|XP_003449311.1| PREDICTED: thymus-specific serine protease-like [Oreochromis
niloticus]
Length = 641
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 125/470 (26%), Positives = 211/470 (44%), Gaps = 61/470 (12%)
Query: 50 QGLYKTK--YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW 107
Q L++ K + Q LDHFN Q+ TF QR+ +N+ +W ++ P+F+Y G EG +
Sbjct: 52 QPLHQVKEGWIVQPLDHFNQ--QNSNTFPQRFFVNEAYW--QHHDGPVFLYIGGEGPLVE 107
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
+ TG D+A + ALL+ +EHR+YG SI G K + G LSS QALAD A
Sbjct: 108 YDVLTGHHSDMAEEHGALLLALEHRFYGDSINPDG-----LKTENLAG-LSSQQALADLA 161
Query: 168 SL--IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDN 222
+ I NLT ++ + FGGSY G L+AWFR K+P++ GA+ASSAP+ L+F
Sbjct: 162 TFHQYISQSFNLTHRNT-WISFGGSYSGALSAWFRGKFPNLVHGAVASSAPVKAKLDFSE 220
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL- 281
+ S + + SE C ++ ++ ++E A G + ++ F C+ K+
Sbjct: 221 YNNVVGLS--LLNEAVGGSEKCLSKVRQAFAAVKE-ALMSGNINQVASDFGCCQIPKDPY 277
Query: 282 -AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP----KTGNDVFAKLYG 336
IE S A ++ Y ++ + E+C + + + + + +L
Sbjct: 278 DQIELMQSLADIFMGAVQYNEEGVLMS------INELCGIMTNSSQEYQDEMEAYNRLVK 331
Query: 337 AASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIMLTGGDNKDS 384
+ + Y ++ C D++ D+ H G +W +Q CTE ++
Sbjct: 332 LSQI-YRFTSKEPCLDISYEKSMKDLMDTSVHAGRRGERQWTYQTCTEFGFYQTCEDATC 390
Query: 385 IFEESEEDYDARARYCKEAYGVD-----PRPNWITTEFGGHKIGLVLKRFASNIIFFNGL 439
F D + + C +G+ R + T +GG +I++ NG
Sbjct: 391 PFSGMLTLQD-QTKLCTTLFGISQHSLPARIAFTNTYYGGD------NPHTHSILYVNGG 443
Query: 440 RDPWSGGGVLKNIS---KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRR 486
DPW V+++ + + + K+ AH D+ D L R+
Sbjct: 444 IDPWKTLSVVQDGTEEGEEAQTVFIKDTAHCADMSSRRVTDRSSLTKARQ 493
>gi|355561421|gb|EHH18053.1| Thymus-specific serine protease [Macaca mulatta]
Length = 514
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 121/486 (24%), Positives = 204/486 (41%), Gaps = 62/486 (12%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +N+ HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNEQHWVG--EDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRHA 156
Query: 163 LADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---IL 218
LAD S + L + ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L
Sbjct: 157 LADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVL 216
Query: 219 NF---DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
+F +++VS S I + L A
Sbjct: 217 DFSEYNDVVSRSLMSTAIGGSLEXXXXXXXXXXXXX-XXLRLGGAAQAALRSELSACGTL 275
Query: 276 KSEKNLA-----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP---KTG 327
S +N A +++ + A Y P V+++C + ++
Sbjct: 276 GSAENQAELLGSLQALVGGAVQYDGQAGAP-----------LSVRQLCGLLLGGGGNRSH 324
Query: 328 NDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIM 375
+ + L A + + G KC + ++P G +W +Q CTE
Sbjct: 325 STPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGF 383
Query: 376 LTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLVLKRFA 430
N F + ++ C++ +G V + +GG G A
Sbjct: 384 YVTCGNPRCPFSQLPA-LPSQLDLCEKVFGLSALSVAQAVAQTNSYYGGQTPG------A 436
Query: 431 SNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVE 490
+ ++F NG DPW V + + + AL+ + G+H +D+ D L+ R+ +
Sbjct: 437 NQVLFVNGDTDPWHVLSVTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQ 496
Query: 491 IIGKWI 496
+ W+
Sbjct: 497 QLQTWL 502
>gi|256574605|dbj|BAH98108.1| serine protease [Entamoeba invadens]
Length = 465
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 115/471 (24%), Positives = 200/471 (42%), Gaps = 43/471 (9%)
Query: 38 KLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFV 97
+L+ L S + + +T ++ +DHFN N + F+ +Y +++ + G+ ++P+FV
Sbjct: 21 RLNKLARSVETNTSEIETHTYSVPMDHFNANND--EEFEVKYFVSEKYLDGTDLHSPLFV 78
Query: 98 YTGNEGDIEWFAQNTGFMYD-VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY 156
G EG + ++ D +A + L++ IEHR+YG S P ++ Y A
Sbjct: 79 MLGGEGPESSKTLDNHYIIDTLAARTNGLMLAIEHRFYGDSTPSLKMDKLIYCTAE---- 134
Query: 157 LSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
QA+ DY +I +++ D PV+V GGSY G LAAW R KYP+V GA ASSAP
Sbjct: 135 ----QAMMDYIEIITYIQETRNFIDHPVIVIGGSYSGNLAAWMRQKYPNVVDGAWASSAP 190
Query: 217 ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
+ + V Y + ++ + + + + W Q+ T G ++L+K F C
Sbjct: 191 V---EAQVDFYQYLEVVQAGLPANTADLLSIAFEKWDQMTVT---ESGRKELKKVFNTCT 244
Query: 277 SEKNLAIESWLSTAFVYTAMTDY---PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK 333
I+++ T A TD + + P F + M DP+
Sbjct: 245 DFGEDDIQTFAETIGTALAGTDSICNEIIEDVNSKYPNFVINRM-----DPEWAGSTCT- 298
Query: 334 LYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
+S+ +Y G +D W +Q C K S+
Sbjct: 299 ---PSSLDESYKGLMDTTLYKDGNDEAAGRSWVFQTCIAYGYYQVVSEKSSVKFGKLNKL 355
Query: 394 DARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
D + C + Y +D + + I +GG + +N+ F NG DPW GV
Sbjct: 356 DGSIKMCHDIYNIDNQTLYNAVDHINVRYGGKNPKV------TNVAFTNGGTDPWHALGV 409
Query: 449 LKNISK--TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ + ++ L+++ +H DL + D LK R +E+ + ++
Sbjct: 410 TQQEGQDGNLVNLIDRT-SHCSDLYIEKETDVPALKLARHKELRFFDQVLA 459
>gi|255581402|ref|XP_002531509.1| catalytic, putative [Ricinus communis]
gi|223528862|gb|EEF30863.1| catalytic, putative [Ricinus communis]
Length = 482
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 137/477 (28%), Positives = 203/477 (42%), Gaps = 76/477 (15%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + QILDH Y+P ++ FQQRY ++ PIF+ E A +
Sbjct: 41 KELWFNQILDH--YSPYDHRRFQQRYYEYLDYFRAP--GGPIFLKICGESSCNGIAND-- 94
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL---- 169
++ +A KF A +V +EHRYYGKS P+ K KN YLSS QAL D A
Sbjct: 95 YISVLAKKFGAAVVSLEHRYYGKSTPF---KSSETKNLR---YLSSKQALFDLAVFRQHY 148
Query: 170 --IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
++LK N T ++P +VFG SY G L+AW+RLK+PH+ G++ASSA +L + Y
Sbjct: 149 QEALNLKLNRTNVENPWIVFGISYSGALSAWYRLKFPHLTCGSVASSAVVL------AVY 202
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES-- 285
+F+ Q S C ++ + + ++E L +KA + + L I+
Sbjct: 203 NFTEFDQQIGESAGAECKAALQETTQLVDER------LASNRKAVKTLFNAAELEIDGDF 256
Query: 286 --WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-----PKTGNDVFAKLYGAA 338
+L+ A V Y P +PL E KA +D K + + +G +
Sbjct: 257 LYFLADAAVIAFQ--YGNPDKLCSPL-----VEAKKAGEDLVEAYAKYVKEYYVGSFGVS 309
Query: 339 SVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
YN SD W +Q CTE+ + DSI S+ +
Sbjct: 310 VETYNQKHLKDTAINENSSD----RLWWFQVCTEVAYFQVAPSNDSI-RSSKVNTRYHLD 364
Query: 399 YCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK-- 454
CK +G P TT +GG KI S I+F NG +DPW K IS
Sbjct: 365 LCKNVFGEGIYPEVDTTNIYYGGTKIA------GSKIVFTNGSQDPWRHAS--KQISSPD 416
Query: 455 -TVLALVEKEGAHHVDLRFSTK------------EDPQWLKDVRRREVEIIGKWISQ 498
+ H D+R + P ++ VR++ +E I W+S+
Sbjct: 417 TPSYIITCHNCGHGTDMRGCPQSPLSLEGNAQNCSSPDAVQKVRQQVIEHIDLWLSE 473
>gi|410958405|ref|XP_003985809.1| PREDICTED: thymus-specific serine protease [Felis catus]
Length = 416
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 101/326 (30%), Positives = 160/326 (49%), Gaps = 43/326 (13%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + Q ++F QRY +ND HW + + P+F++ G EG + + G
Sbjct: 57 KQGWLEQPLDPFNTSDQ--RSFLQRYWVNDQHW--ASRHGPVFLHLGGEGSLRPGSVTRG 112
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG SIP GG + + +LSS ALAD AS + L
Sbjct: 113 HPAALAPAWGALVIGLEHRFYGLSIPAGG------LDVAQLRFLSSRHALADVASARLAL 166
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---ILNF---DNIVSP 226
+ ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L+F + +VS
Sbjct: 167 GRLFNVSSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSEYNEVVSR 226
Query: 227 YSFSNIITQD----FRSVSENCYKVIKG-SWKQIEETAKKPGGLEKLQKAFRICKSEKNL 281
S + S+ + C + + Q++ T + G+ Q ++ C
Sbjct: 227 SLTSAAVGGSPEVVMHSLGQRCLSFSRAETVAQLKVTESQVSGVGDRQWLYQTC------ 280
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVK-EMCKAIDDPKTGNDVFAKLYGAASV 340
T F + + PS + +PA P + E+C+ + T + A+ +
Sbjct: 281 -------TEFGFYVTCE--DPSCPFSRVPALPSQLELCQQVFGLSTSS--VAQAVTQTNS 329
Query: 341 YY--NYSGTAKCFDLNGDSDP-HGLS 363
YY G + +NGD+DP H LS
Sbjct: 330 YYGGQTPGATQVLFVNGDTDPWHALS 355
>gi|330803162|ref|XP_003289578.1| hypothetical protein DICPUDRAFT_36006 [Dictyostelium purpureum]
gi|325080335|gb|EGC33895.1| hypothetical protein DICPUDRAFT_36006 [Dictyostelium purpureum]
Length = 485
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 127/456 (27%), Positives = 206/456 (45%), Gaps = 57/456 (12%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ +DHFN + TFQQRYLIND ++ G+ P+F+ EG + TG
Sbjct: 52 QWFTQNVDHFNI--VNTDTFQQRYLINDQYYDGT---GPVFIMINGEGPMG-LDTVTGLQ 105
Query: 116 YDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--IID 172
+ V A + AL+V +EHRYYG S +++ N +L+S QALAD A I
Sbjct: 106 FVVWAKQLNALIVSLEHRYYGASFV---TSDLSLDNLQ---FLNSQQALADNAVFREFIA 159
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSF 229
K N+ +T + V FGGSY G L +WFR+KYPH+ +ASS P+ +NF Y +
Sbjct: 160 QKYNIPST-TKWVSFGGSYSGALTSWFRIKYPHLVDITIASSGPVNPEVNF------YQY 212
Query: 230 SNIITQDFRSVS--ENCYKVIKGSWKQIEETAKKP--GGLEKLQKAFRICKSEKNLAIES 285
++ + + C + I + +++ ++ GG+E L F +C +N +
Sbjct: 213 LQVVQNSLQQTNGGAECVQNIAIATDKVQSLLQQDNYGGVETL---FDLCSQLENANDVA 269
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPA-FPVKEMCKAI----DDPKTGN-DVFAKLYGAAS 339
F+ + ++ + N P + +C + DP T ++ + G
Sbjct: 270 ----NFMNSLAGNFMGVVQYNNEEPGQVNTQNLCDIMTNNTQDPLTNYIQLWNQFAGGEC 325
Query: 340 VYYNYSG-TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEE-DYDARA 397
V +YS A+ ++ D+ G W +Q CTE D S + + +
Sbjct: 326 VDVSYSSLVAESQNITNDATAIGGRMWMYQTCTEFGYYQSSDGASSTQPFGDLFGFAFQL 385
Query: 398 RYCKEAYGVD---PRPNWITTEFGGHKIGLVLKRFASNI---IFFNGLRDPWSGGGVLK- 450
+ C + +GV P NW TE+GG L S+I ++ NGL DPW G+
Sbjct: 386 QQCADIFGVPNMAPNTNWTLTEYGG------LSPAPSSITTTLYVNGLIDPWHALGITPV 439
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRR 486
++ +L+ AH D+ T P L ++
Sbjct: 440 SVPSIKNSLLITGTAHCADMMIPTSVSPSTLAPAQQ 475
>gi|194744691|ref|XP_001954826.1| GF16546 [Drosophila ananassae]
gi|190627863|gb|EDV43387.1| GF16546 [Drosophila ananassae]
Length = 473
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 121/476 (25%), Positives = 212/476 (44%), Gaps = 60/476 (12%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+ S+++ + +T + Q LDHF+ +T+Q RY++ND + ++ P+F+Y G E
Sbjct: 35 LPSNQNRADIVETLWIEQKLDHFD--EAETRTWQMRYMLNDAVY---QSGGPLFIYLGGE 89
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
+I G MYD+A + LL + EHRYYG+S P +++ +N YL+ Q+
Sbjct: 90 WEISSGRITGGHMYDMAKEHNGLLAYTEHRYYGQSKPL---PDLSNENIK---YLTVNQS 143
Query: 163 LADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
LAD A I +K ++S V++ GGSY + WF+ YP + G ASSAP+
Sbjct: 144 LADLAHFITSIKATHEGLSESKVIIVGGSYSATMVTWFKKTYPDLVAGGWASSAPLFAKV 203
Query: 222 NIVSPYSFSNIITQDFRSVSEN-CYKVIKGSWKQIEET-AKKPGGLEKLQKAFRICKSEK 279
N V + I Q + + CYK I+ ++E+ A K GG +++ ++C+
Sbjct: 204 NFV---EYKEITGQSIELMGGSACYKRIENGIAEMEDMFATKRGG--EVKALLKLCEP-- 256
Query: 280 NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAAS 339
+ S L +++ ++D + + A ++ +C+ I G+ + L G AS
Sbjct: 257 -FDVSSDLDIWTLFSEISD--IFAGVVQTHNAGQIEGVCQQI---MAGS---SDLIGVAS 307
Query: 340 VYYN--YSGTAKCFDLNGDS----------DPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
+ KC+DL+ D + + + +W +Q C E + D F
Sbjct: 308 YLLDEFAESGGKCYDLSYDGITGVLLDTNYNGNIMRQWIFQTCNEYGWYQTSGSADQPF- 366
Query: 388 ESEEDYDARARYCKEAYGVDPRPNWITTE-------FGGHKIGLVLKRFASNIIFFNGLR 440
++ C + YG +IT + +GG G+ N+ +G
Sbjct: 367 GTKFPVTYYTTMCADLYGSKYSNEFITNQVSTTNAYYGGLSPGV------ENVYLTHGQL 420
Query: 441 DPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DPW G+ T+L E AH D + D +K + R E++ +W+
Sbjct: 421 DPWRAMGIQDADQATIL----PEYAHCKDFNSISSSDSAEMKASKERIAELVREWV 472
>gi|167390072|ref|XP_001739195.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897172|gb|EDR24409.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 480
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 134/525 (25%), Positives = 221/525 (42%), Gaps = 72/525 (13%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDH 64
+ L+ C++ S N K+ S +T E L+ + S + +T LDH
Sbjct: 1 MLLLALCVICSYAKLSLNQKVMQRISQSFMTLE-LNEVESMT-----------YTVPLDH 48
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTGFMYDVAPKFK 123
FN N Q+ F Y +N + NAP+FV G EG QN + +A K K
Sbjct: 49 FNANNQN--DFDIHYFVNKEFLDTNDPNAPLFVLLGGEGPASPKVLQNNYVIDSLAKKHK 106
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP 183
L++ +EHR+YG S P ++ Y A QAL DY +I +++ P
Sbjct: 107 GLMLSVEHRFYGASTPSLEMDKLIYCTAE--------QALMDYVEVISHVQEENNLVGHP 158
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
V+V GGSY G LAAW R KYP+V GA ASSAP+ + +V Y + ++ ++ +N
Sbjct: 159 VIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQ---NALPKN 212
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYP--T 301
++ ++++ +E G ++L K F C I+++ + + TA++ Y
Sbjct: 213 TADLLSFAFEKWDEMTTTEEGRKELGKIFNTCTEFGEKDIQTFAES--IGTALSGYVQYN 270
Query: 302 PSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS-GTAKCFD-------- 352
SN+ + + +C I++ DV K YN G +C
Sbjct: 271 SSNWKSSYES--TDSICTEINE-----DVVNKYPLFIKEKYNPEWGDKECTSSSQEESYK 323
Query: 353 --------LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
G+ G S W +Q C + S+ CK+ Y
Sbjct: 324 TLQSTSTYAEGNEGAAGRS-WFFQTCIAYGYYQAVSEQSSVKWGKLNQLQGSIDMCKDIY 382
Query: 405 GVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK--TVL 457
G+D + I +GG K + +N+ F NG DPW GV ++ + ++
Sbjct: 383 GIDKDTLYNAVDHINVRYGGKKPCV------TNVAFTNGNTDPWHALGVTESDHQEGNLV 436
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
L+++ +H DL + D LK R E++ I + ++ Q+
Sbjct: 437 QLIDR-TSHCSDLYSEKETDVPELKKARHNELKFIAQVLANVPQN 480
>gi|224108896|ref|XP_002333330.1| predicted protein [Populus trichocarpa]
gi|222836225|gb|EEE74646.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 142/512 (27%), Positives = 217/512 (42%), Gaps = 85/512 (16%)
Query: 25 IFPTFPSSRITPEKL-SSLISSSKDSQGLYKTK--YHTQILDHFNYNPQSYQTFQQRYLI 81
+F T + + P L S SS +S TK + Q LDH Y+P + FQQRY
Sbjct: 17 LFSTASHAYLAPRTLLYSFSKSSTNSNRSLTTKELWFNQTLDH--YSPFDHHKFQQRYYE 74
Query: 82 NDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYG 141
++ + PIF+ E + A + ++ +A KF A +V +EHRYYGKS P+
Sbjct: 75 FLDYF--RVPDGPIFLKICGESSCDGIAND--YIGVLAKKFGAAVVSLEHRYYGKSSPF- 129
Query: 142 GNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVVVFGGSYGGML 195
K +N YLSS QAL D A ++LK N T ++P VFG SY G L
Sbjct: 130 --KSTTTENLR---YLSSKQALFDLAVFRQYYQESLNLKLNRTGVENPWFVFGVSYSGAL 184
Query: 196 AAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI 255
+AWFRLK+PH+ G+LASSA +L + Y+++ Q S C ++ + + +
Sbjct: 185 SAWFRLKFPHLTCGSLASSAVVL------AVYNYTEFDQQIGESAGAECKAALQETTQLV 238
Query: 256 EETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVK 315
EE L +KA + L I+ F+Y T + NP
Sbjct: 239 EER------LASNKKAVKTLFDAAELEIDG----DFLYFLADAAATAFQYGNP------D 282
Query: 316 EMCKAIDDPKTGNDVFAKLYGAASVYYNYSG----TAKCFD--------LNGDSDPHGLS 363
++C + K + + Y A V +Y G + + +D LN +S G
Sbjct: 283 KLCPPLVQAKKDGEDLVEAY-AKYVKEDYVGSFGVSVQTYDQRHLKDTTLNENS---GDR 338
Query: 364 EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY--GVDPRPNWITTEFGGHK 421
W +Q CTE+ DSI S+ D CK+ + G+ P + +GG
Sbjct: 339 LWWFQVCTEVAYFQVAPANDSI-RSSQVDTRYHLDLCKKVFGEGIYPEVDKTNIYYGGTN 397
Query: 422 IGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL---ALVEKEGAHHVDLRFSTKE-- 476
+ S I+F NG +DPW K IS + + H D+R +
Sbjct: 398 MA------GSKIVFTNGSQDPWRHAS--KQISSPDMPSFVMSCHNCGHGTDMRGCPQSPF 449
Query: 477 ----------DPQWLKDVRRREVEIIGKWISQ 498
P ++ VR + +E + W+S+
Sbjct: 450 NIEGNARNCGSPDAVEKVRHQIIEKMDLWLSE 481
>gi|441593763|ref|XP_004087106.1| PREDICTED: thymus-specific serine protease isoform 2 [Nomascus
leucogenys]
Length = 541
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 118/512 (23%), Positives = 215/512 (41%), Gaps = 87/512 (16%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHWVG--QDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + S+ + +
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRHAMGKSSG 162
Query: 163 ---------------LADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLK---- 202
LAD S + L + ++ SP + FGGSY G LAAW RLK
Sbjct: 163 IPSDEDRPSPPFDPRLADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKVLRL 222
Query: 203 --YPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWK 253
+PH+ ++ASSAP +L+F +++++++ S S C + ++
Sbjct: 223 LRFPHLIFASVASSAPVRAVLDFSE------YNDVVSRSLMSTEIGGSLECRAAVSVAFA 276
Query: 254 QIEETAKKPGGLE-KLQKAFRIC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSN 304
++E + G + L+ +C ++E A+++ + Y P
Sbjct: 277 EVERRLRSGGAAQAALRTELSVCGPLGRAENQAELLGALQALVGGVVQYDGQAGVP---- 332
Query: 305 FLNPLPAFPVKEMCKAIDDP---KTGNDVFAKLYGAASVYYNYSGTAKCFDLN------- 354
V+++C + ++ + + L A + + G KC +
Sbjct: 333 -------LSVRQLCGLLLGGGGNRSHSTPYCGLRRAVKIVLHSLGQ-KCLSFSRAETVAQ 384
Query: 355 -GDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG---- 405
++P G +W +Q CTE +N F + ++ C++ +G
Sbjct: 385 LRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSAL 443
Query: 406 -VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
V + +GG G A+ ++F NG DPW V + + + AL+ + G
Sbjct: 444 SVAQAVAQTNSYYGGQTPG------ANQVLFVNGDTDPWHVLSVTQALGSSESALLIRTG 497
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+H +D+ D L+ R+ + W+
Sbjct: 498 SHCLDMAPERPSDSPSLRQGRQNIFRQLQTWL 529
>gi|115482892|ref|NP_001065039.1| Os10g0511400 [Oryza sativa Japonica Group]
gi|110289389|gb|AAP54577.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
gi|113639648|dbj|BAF26953.1| Os10g0511400 [Oryza sativa Japonica Group]
gi|215697793|dbj|BAG91986.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 507
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 124/476 (26%), Positives = 199/476 (41%), Gaps = 73/476 (15%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ ++ Q LDHFN P ++ F+QRY ++ K PIF+Y E +
Sbjct: 53 EERWMDQTLDHFN--PTDHRQFKQRYYEFLDYYRAPK--GPIFLYICGESSCNGIPNS-- 106
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL---- 169
++ +A KF A +V EHRYYGKS P+ + + +N +LSS QAL D A
Sbjct: 107 YLAVMAKKFGAAVVSPEHRYYGKSSPF---ESLTTENLR---FLSSKQALFDLAVFRQYY 160
Query: 170 --IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
++ K N + DS VFGGSY G L+AWFRLK+PH+ G+LASS +L S Y
Sbjct: 161 QETLNAKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------SVY 214
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL 287
++++ Q S C ++ + K ++ G L+ + A + LA +
Sbjct: 215 NYTDFDKQIGESAGPECKAALQETTKLVD------GQLQSGRNAVKQLFGASTLANDGDF 268
Query: 288 STAFVYTAMT--DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--N 343
A Y P +P+ ++ K G D+ YY
Sbjct: 269 LFLLADAAAIAFQYGNPDALCSPI-----------VEAKKNGTDLVETFARYVKDYYIGT 317
Query: 344 YSGTAKCFD---LNGDSDPHGLSE---WGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
+ + +D L + P S W +Q C+E+ DS+ ++ D
Sbjct: 318 FGASVASYDQEYLKNTTPPPAESAYRLWWYQVCSEVAYFQVAPKNDSV-RSAKIDTRYHL 376
Query: 398 RYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
C+ +G P+ T +GG +I S I+F NG +DPW K+ +
Sbjct: 377 DLCRNVFGEGVYPDVFMTNLYYGGTRIA------GSKIVFANGSQDPWRHASKQKSSKEL 430
Query: 456 VLALVE-------------KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
L+E + H++ S P+ + VR++ V+ I W+S+
Sbjct: 431 PSYLIECSNCGHCSDLSGCPQAPSHIEGDSSNCSSPEAVNKVRKQIVDHIDLWLSE 486
>gi|345314847|ref|XP_001512755.2| PREDICTED: dipeptidyl peptidase 2-like, partial [Ornithorhynchus
anatinus]
Length = 452
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 128/278 (46%), Gaps = 25/278 (8%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
++ +Y Q +DHF++ +T+ QRYLI + W K + P+F YTGNEGDI FA+N+
Sbjct: 31 FQERYFEQTVDHFDFETYGNRTYLQRYLITEKFW--KKGSGPLFFYTGNEGDIWNFAKNS 88
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
F+ ++A AL++F EHRYYGKS+P G + G L+ QALADYA LI
Sbjct: 89 DFILELAAAESALVIFAEHRYYGKSLPLGPGS----IRRGSMGPLTVEQALADYAVLIGA 144
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG---ALASSAPILNFDNIVSPYSF 229
L++ L A P+V FGGS G A + H I + P V+P
Sbjct: 145 LQRQLGAAGLPLVAFGGSSGPKAEAKDPISALHTRISICQPVQEEGPRTGKCTGVAPRPP 204
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG---------GLEKLQKAFRICKSEKN 280
+ ++ ++ +W PG + + + C +
Sbjct: 205 QTLPSESPTTLGAGHRPP---AWTGTPSDPGDPGRASLSPPPAAYDPISRGMATCHRLSD 261
Query: 281 LA----IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPV 314
A + + AF AM DYP P++F+ PA PV
Sbjct: 262 GADVDQLLEFARNAFAMIAMMDYPYPTDFMGHFPAHPV 299
Score = 44.7 bits (104), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/67 (37%), Positives = 34/67 (50%), Gaps = 18/67 (26%)
Query: 430 ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREV 489
ASNIIF NG DPW+GGGV E+ G+H DP ++D R+ E
Sbjct: 399 ASNIIFSNGDLDPWAGGGVSP----------ERPGSH--------PADPPSVRDARKLEA 440
Query: 490 EIIGKWI 496
+I +W+
Sbjct: 441 LLIHQWV 447
>gi|426250769|ref|XP_004019106.1| PREDICTED: thymus-specific serine protease [Ovis aries]
Length = 516
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 122/493 (24%), Positives = 215/493 (43%), Gaps = 82/493 (16%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW 107
DS L K + Q LD FN + + ++F QRY +ND HW + + P+F++ G EG +
Sbjct: 50 DSVTLPKEGWLEQPLDPFNASDR--RSFLQRYWVNDQHW--TSQDGPVFLHLGGEGSLGP 105
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
+ G ++AP + AL++ +EHR+YG SIP G + +LSS ALAD A
Sbjct: 106 GSVMRGHPANLAPIWGALVISLEHRFYGLSIPAEGLDMAQLR------FLSSRHALADAA 159
Query: 168 SLIIDLKK--NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDN 222
S + L + N+++T SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ L+F
Sbjct: 160 SARLTLSRLFNVSST-SPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPVRATLDFSK 218
Query: 223 IVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEET-----------AKKPGGLEK 267
+++++++ + S C ++ ++E + + G
Sbjct: 219 ------YNDVVSRSLMNTAIGGSLECRAAASAAFAEVERRGRASRGARAALSVELGACGS 272
Query: 268 LQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK-AIDDPK- 325
L++A ++E A+++ + A Y P V+++C+ + DP
Sbjct: 273 LERAED--QAELLGALQALVGGAVQYDGQAGAP-----------LSVRQLCRFLLGDPGN 319
Query: 326 -----TGNDVFAKLYGAASVYYNYSGTAKCFDLNG------------DSDPHGLSEWGWQ 368
+G + L A V + G +C ++ G +W +Q
Sbjct: 320 CRGNCSGPAPYRGLRRAVQVVTHGLGQ-RCLSISRAETVAQLRVTELQVSSVGDRQWLYQ 378
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIG 423
CTE + F + + C++ +G V + +GG G
Sbjct: 379 TCTEFGYYVTCEVPGCPFSQLPA-LPSELELCEQVFGLSTSSVAQAVAQTNSYYGGQTPG 437
Query: 424 LVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKD 483
A+ ++F NG DPW V + + + AL+ +H +D+ D L+
Sbjct: 438 ------ATQVLFVNGDTDPWHVLSVTQPLGPSEPALLIPSASHCLDMAPERPSDSPGLRL 491
Query: 484 VRRREVEIIGKWI 496
R++ + + W+
Sbjct: 492 ARQKISQQLQTWL 504
>gi|224006151|ref|XP_002292036.1| hypothetical protein THAPSDRAFT_29166 [Thalassiosira pseudonana
CCMP1335]
gi|220972555|gb|EED90887.1| hypothetical protein THAPSDRAFT_29166 [Thalassiosira pseudonana
CCMP1335]
Length = 553
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 133/498 (26%), Positives = 224/498 (44%), Gaps = 92/498 (18%)
Query: 48 DSQGLYKTKYHTQILDHFNYN----PQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
D+Q ++ Y Q+++HF+ + P+ + + RY + ++ G +PIF+ G EG
Sbjct: 85 DNQAMF---YADQLVNHFHTDRSITPKDAK-WSNRYYQSTKYYKGP--GSPIFLIVGGEG 138
Query: 104 DIEWFAQNTGFMY-----DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLS 158
A ++G +Y +A +F A ++ IEHR+YG P G + + +
Sbjct: 139 -----ALDSGILYPFVSEHLARRFGAAVIQIEHRFYGPFQPIVGREATVLELLELL---T 190
Query: 159 STQALADYASLIIDLKKNLTATD--------SPVVVFGGSYGGMLAAWFRLKYPHVAIGA 210
QALAD L K+ L ++ PV+ GGSY G L+A FRL YP +
Sbjct: 191 PQQALADMVQLTKHFKELLGCSEFDRHSKKYCPVISVGGSYPGFLSAMFRLVYPDFVDIS 250
Query: 211 LASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
ASSAP+ +D + + +I+T+ S C K ++ + ++ E K + K
Sbjct: 251 YASSAPLKLYDQTANQNVYYDIVTKAAEHTSPGCAKSVRDALEEASELILKAPSVIDAVK 310
Query: 271 AFRICKSE--------KNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID 322
+ +C K L + ++ F + DY + A+P
Sbjct: 311 SMSMCVDSIPEYIDNLKTLKEDVMMAIGFSF---ADY--------DMDAYP--------- 350
Query: 323 DPKTGNDVFAKLYGAASVY-YNYSGT----AKCFDLNGDSDPHGLSEW----------GW 367
G D+ LY A V+ +N S + AK F+L G +D E+ +
Sbjct: 351 ---PGKDL--GLYKACRVFQHNKSSSMEKVAKFFELLG-TDTEFEREYPTLVGEEEVPDF 404
Query: 368 QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG--VDPRPNWITTEFGGHKIGLV 425
Q CT ++ G +K S+F + + Y+ +YC+ YG V P+P + + G +
Sbjct: 405 QLCTTLVDPIGFSSK-SMFPKRKWTYEGLTKYCQSRYGSEVTPQPYALVEDMGFDDL--- 460
Query: 426 LKRFASNIIFFNGLRDPWSGGGVLKNISKT--VLALVEKEGAHHVDLRFSTKEDPQWLKD 483
+ + AS I+F NGL+D WSG L+ +S+ +L+L + GAHH DL D +D
Sbjct: 461 VGKGASRILFTNGLQDMWSGASYLETVSEANEILSLNFENGAHHSDLSHVGPSDNDS-ED 519
Query: 484 VRRREVE---IIGKWISQ 498
+R V+ I+ KW+ +
Sbjct: 520 IRLGFVKITNILAKWLDE 537
>gi|145491267|ref|XP_001431633.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398738|emb|CAK64235.1| unnamed protein product [Paramecium tetraurelia]
Length = 464
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 126/467 (26%), Positives = 200/467 (42%), Gaps = 71/467 (15%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG- 113
T++ TQ LDH +P S + F+QR I + + + A I G EW G
Sbjct: 35 TEWFTQKLDH--NDPTSQEVFKQRVHIYNEYVKDDQPEAVILYICG-----EWTCDGIGK 87
Query: 114 -FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADYASLI 170
+D A + A+++ +EHRYYG+S P+ ++ ST YL+ QAL D A I
Sbjct: 88 GLTFDAAQQLNAVVLVLEHRYYGQSQPF--------EDWSTPNLKYLNIHQALDDIAYFI 139
Query: 171 IDLKKN---LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
+K N D+P + GGSY G L+AWFR KYPH+ IG LASSA + + +
Sbjct: 140 TSIKANGNYNIKPDTPWIHLGGSYPGALSAWFRYKYPHLTIGGLASSAVVRA---VACYH 196
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIE-ETAKKPGGLEKLQKAFRICKSEKNLAIESW 286
+ + S C I+ ++IE E A+ P + ++ AF + + IE
Sbjct: 197 EYDMQVYLSALESSTECADRIQQVNQKIEDELARDP---DAIKAAFGASELQD---IEFL 250
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPA--------FPVKEMCKAIDDPKTGNDVFAKLYGAA 338
A +Y M S + L VK+M + D ++ F +
Sbjct: 251 SMIADIYAGMVQGRKRSKMCDRLAKGSTVEEWFLEVKDMARETVDQESYGSEFLR----- 305
Query: 339 SVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEM-IMLTGGDNKDSIFEESEEDYDARA 397
+ ++S +++ +W +Q C E+ T N + E D
Sbjct: 306 DITIDFSKSSR--------------QWTYQTCIEVGYFQTANPNAEQSTRSQELVLDFFR 351
Query: 398 RYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK---NI 452
+ C+ +Y + P+ T FGG I + ++IF NG DPW + K
Sbjct: 352 QLCEYSYDIPIFPDEDRTNAYFGGLDINV------DHLIFSNGSDDPWQHASITKWKQGK 405
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
V + K+ +H +DLR S+ EDP L R+ + +WI++Y
Sbjct: 406 EYDVKYIKCKDCSHCIDLRASSPEDPPELTKARQEILATFQQWINEY 452
>gi|343960995|dbj|BAK62087.1| dipeptidyl-peptidase 2 precursor [Pan troglodytes]
Length = 486
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 66/167 (39%), Positives = 98/167 (58%), Gaps = 15/167 (8%)
Query: 340 VYYNYSGTAKCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKDSIFEES 389
+ YN SG+ C+D+ + +DP G W +QACTE+ + +N +F +
Sbjct: 306 LVYNASGSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDL 365
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ R RYC + +GV PRP+W+ T F G + R ASNIIF NG DPW+GGG+
Sbjct: 366 PFTDELRQRYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIR 420
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+N+S +V+A+ + GAHH+DLR S EDP + + R+ E IIG+W+
Sbjct: 421 RNLSASVIAVTIQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 467
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 75/132 (56%), Gaps = 5/132 (3%)
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWK 253
ML+A+ R+KYPH+ GALA+SAP+L + F +T DF S C + ++ +++
Sbjct: 1 MLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVREAFR 60
Query: 254 QIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPL 309
QI++ + G + ++ F C+ EK+L + + AF AM DYP P++FL PL
Sbjct: 61 QIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPTDFLGPL 119
Query: 310 PAFPVKEMCKAI 321
PA PVK C +
Sbjct: 120 PANPVKVGCDRL 131
>gi|145492429|ref|XP_001432212.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399322|emb|CAK64815.1| unnamed protein product [Paramecium tetraurelia]
Length = 462
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 128/458 (27%), Positives = 205/458 (44%), Gaps = 52/458 (11%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG---FM 115
TQ+LDH +P + QT+QQRY + ++ +K +++ EW Q G
Sbjct: 35 TQLLDH--SDPANTQTWQQRYHVYSQYFNPTKGGVILYICG------EWNCQGVGDNSLS 86
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
+ +A A+++ +EHR+YG+S P+G + + +N S YL+ QAL D A I+ +K+
Sbjct: 87 FQLAKDLGAIVIALEHRFYGQSQPFGADS-WSLENLS---YLNVHQALDDLAYFILQMKR 142
Query: 176 -NLTATDS--PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L DS P GGSY G L+AWFR KYPH+ +G LASS I + I+ + F +
Sbjct: 143 LKLHNIDSTLPWYAIGGSYPGALSAWFRYKYPHLTVGNLASSGVI---NTILDYWQFDDQ 199
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
I + E C ++ +++ K + ++++ K N W T
Sbjct: 200 IRKSTSKSGEQCPLYLQLLNGYVDKKLKNFNTKQAFKESYNCEKMTDNEFRWFWADTI-- 257
Query: 293 YTAMTDYPTPSNFLNPLPAFP-VKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCF 351
M + F L + P ++ M + I + K YGA YY T
Sbjct: 258 -VQMIQSGQRTRFCQTLESLPSIEAMAEYIKEIALDQGDSYKQYGA---YYLRDETV--- 310
Query: 352 DLNGDSDPHGLSEWGWQACTEMIML-TGGDNKDSIFEESEEDYDARARYCKEAY--GVDP 408
D N + +W +Q C+E+ L T NK+S+ E D +C +AY G
Sbjct: 311 DQNS-----VIRQWYFQCCSELAYLQTPPQNKESL-RSYELTLDWWRVWCNDAYSQGEVI 364
Query: 409 RPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI---SKTVLALVE-K 462
P+ TE FGG + + ++I NG DPW + K SK + L++
Sbjct: 365 WPDVRATEAYFGGLNLNV------DHLIMTNGGEDPWQTASLTKATKANSKVITYLIDCD 418
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
+ AH VDL + ++P L R+ +W Q++
Sbjct: 419 DCAHCVDLGAPSAKEPAVLTSTRQAIKNTFKQWHDQFW 456
>gi|348682257|gb|EGZ22073.1| hypothetical protein PHYSODRAFT_495993 [Phytophthora sojae]
Length = 527
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 124/457 (27%), Positives = 196/457 (42%), Gaps = 67/457 (14%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
+Q LDHF+ + + TF+QRY D W + P+ +Y G EG +E GF++ +
Sbjct: 55 SQQLDHFSSDANA--TFKQRYYEVDEFWKAP--SGPVILYIGGEGALE--QAPAGFVHVI 108
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A KF A +V +EHR+YGKS+P G Y+ YL+ QALAD ++ L
Sbjct: 109 AQKFGAKIVALEHRFYGKSVPNGDLSTANYR------YLTVQQALADLKHFKESYQRELG 162
Query: 179 ATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSP-YSFSNIITQD 236
A D+ + GGSY G L+AWFR+ YP + +L+SS +V P Y F Q
Sbjct: 163 AEDANQWIAIGGSYPGALSAWFRVAYPDTTVASLSSS-------GVVQPVYKFHQFDEQV 215
Query: 237 FRSVSENCYKVIKGSWKQIEE-----TAKKPGGLEKLQKA-----FRICKSEKNLAIESW 286
+ +C V++ + + E+ A K GL Q+ F + +A++
Sbjct: 216 ALAAGPSCADVLRLTTAEFEKEIASGNATKVKGLFGAQELADPDFFYMIADAAAMAVQYG 275
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
+ + + + + F + D++ +G+ Y
Sbjct: 276 HKDIVCESMVGAFERNVSLVESFANFTI--------------DMYGASFGSECFY----- 316
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG- 405
KC + G S W WQ C+++ + S+ + D D + CK +G
Sbjct: 317 DTKCLAHDQARWGDGRS-WRWQKCSQLAYFQVAPTEKSL-RAAMVDLDYHLKQCKTVFGD 374
Query: 406 -VDPRP--NWITTEFGG-HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA-LV 460
V+P I+ +GG H G I F NG DPW VL +S +A L
Sbjct: 375 VVNPSEGVEEISKLYGGDHPTG-------HKIFFSNGGDDPWQRASVLDTLSDDEIANLA 427
Query: 461 EKEGAHHV-DLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ E H DLR + + P+ LK R + +E + KW+
Sbjct: 428 KCELCGHCGDLR-ANPDVPEPLKKQREQILEYLTKWL 463
>gi|47224854|emb|CAG06424.1| unnamed protein product [Tetraodon nigroviridis]
Length = 418
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 117/448 (26%), Positives = 193/448 (43%), Gaps = 73/448 (16%)
Query: 77 QRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGK 136
QR+L+N+ W + P+F+Y G EG I + G D+A + ALL+ +EHR+YG
Sbjct: 4 QRFLVNEAFW--RNPDGPVFLYIGGEGPIFEYDVLAGHHVDMAQQHSALLLALEHRFYGD 61
Query: 137 SIPYGGNKEIAYKNASTTGYLSSTQALADYASL--IIDLKKNLTATDSPVVVFGGSYGGM 194
S+ G K +LSS QALAD A I NL+ ++ + FGGSY G
Sbjct: 62 SVNPDGLK------TEHLAHLSSKQALADLAVFHQYISGSFNLSHGNT-WISFGGSYAGA 114
Query: 195 LAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGS 251
L+AWFR K+PH+ GA+ASSAP+ L+F + S++ T+ F +N K ++ +
Sbjct: 115 LSAWFRGKFPHLVFGAVASSAPVRATLDFSAYTNVMLLSSMKTRVFLH-HQNTGKAVQKA 173
Query: 252 WKQIEETAKKPGGLEKLQKAFRICKSEKNL--AIESWLSTAFVYTAMTDYPTPSNFLNPL 309
+ +E G ++ F C++ KNL IE + A V+ Y +++
Sbjct: 174 FTAVEAQL-MVGNASQVASDFGCCQTPKNLDDQIELMQNLADVFMGAVQYNEEGVYMS-- 230
Query: 310 PAFPVKEMCKAIDDP----KTGNDVFAKLYGAASVYYNYSGTAKCFDLNG--------DS 357
+ ++CK + + G D + L A +Y + + C D++ D+
Sbjct: 231 ----ISDLCKVMTRQNGTYEKGRDAYNSLVKLAQIYRSIT-EEPCLDISHEKTLRDLMDT 285
Query: 358 DPHG----LSEWGWQACTEMIMLTGGDNKDSIFEESEED---------YDARARYCKEAY 404
PH +W +Q CTE F+ EE+ + C +
Sbjct: 286 SPHAGRRSERQWTYQTCTEF----------GFFQTCEENTCPFSGMVTLQFQTEVCSSVF 335
Query: 405 GVDP-----RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK--TVL 457
G+ R + T +GG +++ NG DPW V+++ +
Sbjct: 336 GISQHSLPRRVAFTNTYYGGD------SPHTHRVLYVNGGIDPWKELSVIQDRGEGDEDQ 389
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
+ ++ AH D+ D + LK R
Sbjct: 390 VIFIEDTAHCADMMSRRLTDRRSLKTAR 417
>gi|170574566|ref|XP_001892870.1| Serine protease Z688.6 precursor [Brugia malayi]
gi|158601364|gb|EDP38292.1| Serine protease Z688.6 precursor, putative [Brugia malayi]
Length = 307
Score = 127 bits (319), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/259 (32%), Positives = 126/259 (48%), Gaps = 35/259 (13%)
Query: 264 GLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD 323
GL L K + S + ++ +L F AM +YP P+N+L LP +PVK C+ +
Sbjct: 50 GLFHLDKKSHLKISTDWIMLKEYLEDIFESMAMVNYPYPTNYLAELPGWPVKVACQFFNS 109
Query: 324 PKTGND--VFAKLYGAASVYYNYSGTAKCF----DLNGDS------DPHGLSEWGWQACT 371
K+ ND + +YG ++YYNY+G K F D+ DS DP G W WQ+CT
Sbjct: 110 NKSKNDEELAQSMYGIMNLYYNYTGQKKTFCIKPDVCNDSAYGALGDPFG---WPWQSCT 166
Query: 372 EMIM--LTGGDNKDSIFEESEEDYDARARYCKEAYG------VDPRPNWITTEFGGHKIG 423
EM+M + G D + + YC +G RP+W +G
Sbjct: 167 EMVMQQCSSGPPNDFFIKNCPFSLKGQELYCINTFGKLGYTKALMRPHWSILNYGN---- 222
Query: 424 LVLKRF--ASNIIFFNGLRDPWSGGG--VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQ 479
R+ A+NI+F NG DPWS GG + + ++++++ K+GAHH DLR + D
Sbjct: 223 ----RYPTATNIVFSNGYLDPWSAGGWSLKSRVMGSLISIIIKDGAHHYDLRGKHQLDTN 278
Query: 480 WLKDVRRREVEIIGKWISQ 498
+KD RR E I W+ +
Sbjct: 279 SVKDARRLEKFYIKYWLKE 297
>gi|219121656|ref|XP_002181178.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407164|gb|EEC47101.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 544
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 194/451 (43%), Gaps = 68/451 (15%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y Q LDHF N + + QRY +D ++ G + PIFV G E + G +Y
Sbjct: 94 YKEQTLDHFTPN-KDEAPWAQRYYQDDKYFAGPGH--PIFVIMGGEDAV------NGILY 144
Query: 117 -----DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY---LSSTQALADYAS 168
+A +F+A + +EHR+YGKS P K+ ST LS QALAD
Sbjct: 145 PFVSKHLAKRFRAHTLCLEHRFYGKSKPL--------KHPSTADLRRLLSPAQALADAVQ 196
Query: 169 LIIDLKKNL------TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN 222
I +K L T + PVV GGSY G L+A R+ YP V ASSAP+ + +
Sbjct: 197 FIEYKRKQLGCGNKGTKSYCPVVTVGGSYPGFLSALLRIVYPDVVDIGYASSAPLHLYSH 256
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI-EETAKKPGGLEKLQKAFRICKSEK-- 279
V+ ++ +TQ S C +K + + E+ + ++ +C +
Sbjct: 257 RVNKAAYFEKVTQVAEQASRGCAGAVKNALMDVTEKLLASKRSVAEVAFDLGVCVATVPD 316
Query: 280 --------NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV- 330
I +ST F M YP P P + + C D K+ ++
Sbjct: 317 YIMDNEIFQQEIMMVVSTHFAEYNMGYYP-------PGPDQDLVQGCLIFQDTKSSSEQK 369
Query: 331 ---FAKLYGAASVYYNY--------SGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGG 379
F +L ++ +GT D +G D H W +Q+CT ++ G
Sbjct: 370 VSNFLRLREDFDECFDMQTELPPGPNGTISASDWSGVGDGHSGYMWDFQSCT--LLPECG 427
Query: 380 DNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGL 439
+ S+F + ++C+ +GV+P + EFG + V ++++F NG+
Sbjct: 428 MSDASMFPPRPWTIEWETQHCQVRFGVEPNLRQLVDEFGFADLSNV-----THLLFTNGI 482
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
D WS +L ++S++V A+ GAHH DL
Sbjct: 483 NDGWSVASILTDLSESVKAINFVNGAHHSDL 513
>gi|449457546|ref|XP_004146509.1| PREDICTED: probable serine protease EDA2-like [Cucumis sativus]
gi|449499940|ref|XP_004160960.1| PREDICTED: probable serine protease EDA2-like isoform 1 [Cucumis
sativus]
Length = 489
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 145/501 (28%), Positives = 216/501 (43%), Gaps = 72/501 (14%)
Query: 27 PTFPSSRITPEKLSSLISSSKDSQGLYKTK-YHTQILDHFNYNPQSYQTFQQRYLINDTH 85
P F S+ +TP + +SS+ S L +T+ + Q LDHF+ P ++ FQQRY +
Sbjct: 22 PAFISAHVTPRTVLHRLSST--SSFLNRTELWFNQTLDHFS--PYNHDKFQQRYYEFLDY 77
Query: 86 WGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
+ + PIF+ EG + + ++ +A KF A +V +EHRYYGKS P+ K
Sbjct: 78 F--RIPDGPIFLKICGEGPCNGISND--YLGVLAKKFGAAIVSLEHRYYGKSSPF---KS 130
Query: 146 IAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVVVFGGSYGGMLAAWF 199
+ N YLSS QAL D A ++LK N ++P FG SY G L+AWF
Sbjct: 131 LTTNNLR---YLSSKQALFDLAVFRQYYQDSLNLKLNKKG-ENPWFFFGVSYPGALSAWF 186
Query: 200 RLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIE--- 256
RLK+PH+ G+LASSA +L + Y+F+ Q S C V++ + + IE
Sbjct: 187 RLKFPHLTCGSLASSAVVL------AVYNFTEFDQQIGESAGPECKAVLQETNRLIEQRF 240
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
ET KK +++ F + E + L+ A V Y P +PL K
Sbjct: 241 ETNKK-----EVKALFGAGELEIDGDFFYLLADAAVIA--FQYGNPDTLCSPL--VQAKN 291
Query: 317 MCKAIDD--PKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMI 374
+ D K D + +G++ YN K + W +Q CTE+
Sbjct: 292 AGNDLVDAYAKYVKDYYIGSFGSSVQTYN----QKYLKNTTPGEDSADRLWWFQVCTEVA 347
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASN 432
DS+ S+ D CK +G P+ TT +GG I S
Sbjct: 348 YFQVAPANDSM-RSSKVDAKYHLDLCKNVFGEGVYPDVDTTNIYYGGTGIA------GSK 400
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLA---LVEKEGAHHVDLRFSTK------------ED 477
I+F NG +DPW K IS + + H DLR +
Sbjct: 401 IVFTNGSQDPWRHAS--KQISSPEMPSFLMTCHNCGHGTDLRGCPQSHLNIEGNAQNCSS 458
Query: 478 PQWLKDVRRREVEIIGKWISQ 498
P + VR++ VE + W+S+
Sbjct: 459 PDAVHKVRQQLVEKMDLWLSE 479
>gi|241161684|ref|XP_002408971.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494440|gb|EEC04081.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 201
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 73/197 (37%), Positives = 102/197 (51%), Gaps = 5/197 (2%)
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWK 253
MLAAWFRLKYPHV ALASSAPIL F I +FS ++T+ F S+ C I+ S++
Sbjct: 1 MLAAWFRLKYPHVTTAALASSAPILLFTGITPCSAFSEVLTKAFAKESDQCTNAIRTSFE 60
Query: 254 QIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLP 310
+ A G + L++ FR+CK + W + Y AM ++P S +P
Sbjct: 61 VTRKQAVTEEGAKALKEQFRLCKPLAPSNYTVLRDWFWDVYAYLAMFNHPYASKLPLLVP 120
Query: 311 AFPVKEMCKAID-DPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQA 369
PVKE CK ++ + + +Y A SV+ NY+G C DL + P L WG Q
Sbjct: 121 GHPVKEACKFLEKNFADDQSLLDGIYQAISVFTNYTGKTHCNDLPNSAVPL-LGGWGIQL 179
Query: 370 CTEMIMLTGGDNKDSIF 386
C EM+M + K +F
Sbjct: 180 CNEMVMPMCNNGKTDMF 196
>gi|449499944|ref|XP_004160961.1| PREDICTED: probable serine protease EDA2-like isoform 2 [Cucumis
sativus]
Length = 486
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 145/501 (28%), Positives = 216/501 (43%), Gaps = 72/501 (14%)
Query: 27 PTFPSSRITPEKLSSLISSSKDSQGLYKTK-YHTQILDHFNYNPQSYQTFQQRYLINDTH 85
P F S+ +TP + +SS+ S L +T+ + Q LDHF+ P ++ FQQRY +
Sbjct: 22 PAFISAHVTPRTVLHRLSST--SSFLNRTELWFNQTLDHFS--PYNHDKFQQRYYEFLDY 77
Query: 86 WGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
+ + PIF+ EG + + ++ +A KF A +V +EHRYYGKS P+ K
Sbjct: 78 F--RIPDGPIFLKICGEGPCNGISND--YLGVLAKKFGAAIVSLEHRYYGKSSPF---KS 130
Query: 146 IAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVVVFGGSYGGMLAAWF 199
+ N YLSS QAL D A ++LK N ++P FG SY G L+AWF
Sbjct: 131 LTTNNLR---YLSSKQALFDLAVFRQYYQDSLNLKLNKKG-ENPWFFFGVSYPGALSAWF 186
Query: 200 RLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIE--- 256
RLK+PH+ G+LASSA +L + Y+F+ Q S C V++ + + IE
Sbjct: 187 RLKFPHLTCGSLASSAVVL------AVYNFTEFDQQIGESAGPECKAVLQETNRLIEQRF 240
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
ET KK +++ F + E + L+ A V Y P +PL K
Sbjct: 241 ETNKK-----EVKALFGAGELEIDGDFFYLLADAAVIA--FQYGNPDTLCSPL--VQAKN 291
Query: 317 MCKAIDD--PKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMI 374
+ D K D + +G++ YN K + W +Q CTE+
Sbjct: 292 AGNDLVDAYAKYVKDYYIGSFGSSVQTYN----QKYLKNTTPGEDSADRLWWFQVCTEVA 347
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASN 432
DS+ S+ D CK +G P+ TT +GG I S
Sbjct: 348 YFQVAPANDSM-RSSKVDAKYHLDLCKNVFGEGVYPDVDTTNIYYGGTGIA------GSK 400
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLA---LVEKEGAHHVDLRFSTK------------ED 477
I+F NG +DPW K IS + + H DLR +
Sbjct: 401 IVFTNGSQDPWRHAS--KQISSPEMPSFLMTCHNCGHGTDLRGCPQSHLNIEGNAQNCSS 458
Query: 478 PQWLKDVRRREVEIIGKWISQ 498
P + VR++ VE + W+S+
Sbjct: 459 PDAVHKVRQQLVEKMDLWLSE 479
>gi|323446372|gb|EGB02557.1| hypothetical protein AURANDRAFT_9133 [Aureococcus anophagefferens]
Length = 138
Score = 127 bits (318), Expect = 2e-26, Method: Composition-based stats.
Identities = 63/147 (42%), Positives = 86/147 (58%), Gaps = 9/147 (6%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
LDHF++ + TF+QR ++ HW PIF+Y GNE D+ + TG M++ A
Sbjct: 1 LDHFDFTTNA--TFEQRVFVHADHWAPG---GPIFLYCGNEDDVTLYVNATGLMWEHAAA 55
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD 181
F A+LVF+EHRYYG ++P+G A YLS QALAD + + +K A +
Sbjct: 56 FGAMLVFVEHRYYGATLPFGA----ASFEPEHLRYLSHEQALADLVNALRRIKATYGAEN 111
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+ V FGGSYGGMLAAW R+KYP +
Sbjct: 112 AKTVAFGGSYGGMLAAWLRMKYPAAVV 138
>gi|324503010|gb|ADY41315.1| Serine protease [Ascaris suum]
Length = 1068
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 116/458 (25%), Positives = 199/458 (43%), Gaps = 36/458 (7%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMY 116
T +DHF+ + TF QRY +N + ++ P F+ G EG +++W +
Sbjct: 599 TMPVDHFDLT--NMNTFDQRYWVNPQY---AQPGGPHFLVIGGEGRANVKWVTEPNLITM 653
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
+A KF A + +EHRYYG S P ++ +L++TQALAD A I+ + +
Sbjct: 654 SMARKFNATVYMLEHRYYGDSFP------TPDQSTENLRWLTATQALADLAQFIMTMNER 707
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
+ V FGGSY GML+AWFR YP +++GA+ASSAPI + V Y + ++
Sbjct: 708 YNLVNPKWVTFGGSYPGMLSAWFRQFYPQLSVGAVASSAPI---EAKVDFYEYLIVVENA 764
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIE-SWLSTAFVYTA 295
R + C + +K ++ QI + + G L F + K E NL E + L + ++
Sbjct: 765 LRVFNATCAENVKLAFDQIHQLSLTRTGRVTLSNLFTL-KPEWNLTTEVTNLDIQYFFSI 823
Query: 296 MTD-YPTPSNFLNP-----LPAFPVKEMCK-AIDDPKTGNDVFAKL------YGAASVYY 342
+ D + + N ++E+C +++ KT + A + + + + Y
Sbjct: 824 LYDKFQGAVQYNNDNTGSYATGGGIREVCGYMLNNAKTPMENVADVNIYMTNFSSGTFSY 883
Query: 343 NYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
+ D D + S W +Q C E D ++IF + C+
Sbjct: 884 TDNNYQNYIDYLKDVNAKSSSRSWTYQTCNEFGFFQSTDVGENIF-GGPIPVNIFIDMCQ 942
Query: 402 EAYGVDPRPNWITTEFGGHKIGLVLKRF--ASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
+ YG P ++ + + + +N++F NG DPW + +V +
Sbjct: 943 DVYGSKFTPRFVYEAVDKSQRFYGGRDYFKGTNVLFTNGNIDPWHALSKYDG-NGSVTTV 1001
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ AH D+ ED L R E I +W++
Sbjct: 1002 LMNGTAHCADMYPPRDEDAADLAPTRELIGEKIAEWLA 1039
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 131/552 (23%), Positives = 219/552 (39%), Gaps = 100/552 (18%)
Query: 2 ATRFIFLSFCLLFS--STLTISNAKIFPTFPSSR---------ITPEKLSSLISSSKDSQ 50
+R I SF +L + +++ P F R I PE+L+ +
Sbjct: 3 VSRLILFSFAILIAKNDAFRQRSSRNLPYFIMGRPPHGLRPDPIKPEELNR-------AG 55
Query: 51 GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD------ 104
+ +T Q LDHFN +T+ QRY N ++ K+ PIF+ G EG
Sbjct: 56 YVIQTATLPQRLDHFN--ASDARTWAQRYHYNFNYY---KSGGPIFLMLGGEGPETGSWC 110
Query: 105 -------IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYL 157
I+W + +YD +EHR+YG+S P+ ++ YL
Sbjct: 111 VDEKLPYIQWAMSHNAAIYD-----------LEHRFYGQSRPF------PTQSIENLKYL 153
Query: 158 SSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
SS QA+ D A I + + + +VFGGSY G LAAW R K+P + IGA+ SS P+
Sbjct: 154 SSRQAIEDAAYFIRYINEQQKYVNPKWIVFGGSYSGALAAWLREKHPELVIGAVGSSGPV 213
Query: 218 ---LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
L+F Y + ++ RS + C ++ + ++ + G + L + F +
Sbjct: 214 EAKLDF------YEYLEVVENALRSYAPECADAVQQGFTEMSKMIWTLEGRKNLSELFVL 267
Query: 275 CK--SEKNLA---IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND 329
+E L I+++ +T + Y + N + + E+C + + T D
Sbjct: 268 NPKLNETKLRYKDIQNFFATIYGYFQWAVQYSGDNAGSYAIGGGISEICPLMMN--TSMD 325
Query: 330 VFAKLYGA-------------ASVYYNYSGTAKCFDLNGDSDPHGL----SEWGWQACTE 372
++ SV +Y + F + DP G W WQ CTE
Sbjct: 326 YLNRIKSVIVYLTEFDSSISFTSVGIDYDEMIE-FLADERYDPSGYYSADRSWVWQTCTE 384
Query: 373 MIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRF--- 429
D +IF S + C + +G + I + I + K +
Sbjct: 385 FGYFQSTDLGRNIF-GSVTPVNLFVDMCTDTFGSAYKIQAIE-----NSIHMTRKYYGGK 438
Query: 430 ----ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
+N++ NG DPW G+ NI +V+ ++ AH D+ + +D L + R
Sbjct: 439 DHFKGTNVVLPNGDIDPWHALGLYSNIEPSVVPILIHGTAHCADMYPARTQDLPALTNAR 498
Query: 486 RREVEIIGKWIS 497
I KW++
Sbjct: 499 NIIASNINKWLN 510
>gi|428165662|gb|EKX34652.1| hypothetical protein GUITHDRAFT_158798 [Guillardia theta CCMP2712]
Length = 490
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 121/472 (25%), Positives = 200/472 (42%), Gaps = 51/472 (10%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD------IEWF 108
+++ +Q +DHF+ +P+ T+ QRY +N+T + + P+F+ G EG +
Sbjct: 27 SRWFSQRVDHFSDSPE---TWMQRYFVNETFF--RMGSGPVFLCVGGEGPPMTEQVVVTG 81
Query: 109 AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS 168
+ M +A AL++ +EHRYYG+S P K+++ +N +LSS QAL D AS
Sbjct: 82 ENHCALMVHLARIHGALILALEHRYYGESHP---RKDLSVENMR---FLSSRQALEDIAS 135
Query: 169 LIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
++ + + FGGSY GMLAAW K+PH+ A++SSAP+ I++
Sbjct: 136 FHSHIRSAFAISSKQRWITFGGSYPGMLAAWSHAKFPHLFHAAVSSSAPV---QAILNMK 192
Query: 228 SFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAI 283
++N++ DF S C IKG++ Q+ E G L+ F +C + +
Sbjct: 193 GYNNVVASDFADETLGGSMLCLNTIKGAFAQVGEYLLSYEGRRYLKTRFSVCGGDD--VL 250
Query: 284 ESWLSTAFVYTAMTD--YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVY 341
E + A ++D P ++ P ++ CK + D G + + SV
Sbjct: 251 EDIKNRALFAETLSDPLIPQSNDPSCTSPLCDIRRQCKFLTDQSLGKPLDRLVAMMDSVR 310
Query: 342 YNYSGTAKCFDLNGDSDPHGLSE-----------WGWQACTEMIMLTGGD-NKDSIFEES 389
C D + GL + W +Q CTE D + F S
Sbjct: 311 -----DGSCLDTDYQMMLAGLQDIKISEDRTDRTWFYQTCTEFGFYQTCDPDSRCPFVSS 365
Query: 390 EEDYDA--RARYCKEAYGVD-PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
+ CK + + + E GL L+ + NIIF NG DPW
Sbjct: 366 PHLNNVYFSTDMCKVVFNMSFEKTAEFVRESNNEYGGLNLQSY--NIIFVNGGADPWKSQ 423
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+L + V ++ K +HH D +K RR + + ++Q
Sbjct: 424 SMLHPSNAYVQTVMVKGASHHFWTHPELPTDSTEVKHARRIIAQFVRDVLAQ 475
>gi|224097128|ref|XP_002310843.1| predicted protein [Populus trichocarpa]
gi|222853746|gb|EEE91293.1| predicted protein [Populus trichocarpa]
Length = 487
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 141/526 (26%), Positives = 220/526 (41%), Gaps = 92/526 (17%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTK--YHTQILDHFNYNP 69
L+F S L +S A P + + +S S +S+ T+ + Q LDHF+ P
Sbjct: 6 LVFISLLLLSTA------PYASLALRTQVHSLSESSNSKNYLTTQELWFNQTLDHFS--P 57
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
+ F QRY ++ S + PIF+ E + ++ +A KF A +V +
Sbjct: 58 FDHHKFPQRYYEFLDYFRIS--DGPIFLEICGESSCNGIVND--YISVLAKKFGAAVVSL 113
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--------IIDLKKNLTATD 181
EHRYYG+S+P+ K +N +LSS QAL D A ++LK N T+ +
Sbjct: 114 EHRYYGRSLPF---KSTTTENLR---FLSSKQALFDLAVFRHTIHMQESLNLKLNRTSVE 167
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
+P VFGGSY G L+AWFRLK+PH+ G+LASSA +L N F+ Q S
Sbjct: 168 NPWFVFGGSYAGALSAWFRLKFPHLTCGSLASSAVVLAIHN------FTEFDQQIGESAG 221
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPT 301
C ++ + + +EE + + ++ F + E + +L+ A V Y
Sbjct: 222 AECKATLQETTQLVEE--RLASNKQAVKALFDAAELEIDGDFLYFLADAAVIA--FQYGN 277
Query: 302 PSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG----TAKCFD----- 352
P PL + K G D+ A V +Y G + + +D
Sbjct: 278 PDKLCPPL-----------VQAKKDGEDLVEAY--AKYVKEDYVGSFGVSVQTYDQRHLK 324
Query: 353 ---LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY--GVD 407
LN +S G W +Q CTE+ DSI S+ D CK+ + G+
Sbjct: 325 DTTLNENS---GDRLWWFQVCTEVAYFQVAPANDSI-RSSQVDTRYHLDLCKKVFGEGIY 380
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVL---ALVEKEG 464
P + +GG + S I+F NG +DPW K IS + +
Sbjct: 381 PEVDKTNIYYGGTNMA------GSKIVFTNGSQDPWRHAS--KQISSPDMPSFVMSCHNC 432
Query: 465 AHHVDLRFSTKE------------DPQWLKDVRRREVEIIGKWISQ 498
H D+R + P ++ VR + +E + W+S+
Sbjct: 433 GHGTDMRGCPQSPFNIEGNARNCGSPDAVEKVRHQIIEKMDLWLSE 478
>gi|8574032|emb|CAB94769.1| protease, serine, 16 (thymus) [Homo sapiens]
gi|119623494|gb|EAX03089.1| protease, serine, 16 (thymus), isoform CRA_b [Homo sapiens]
Length = 541
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 118/512 (23%), Positives = 215/512 (41%), Gaps = 87/512 (16%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHWVG--QDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + S+ + +
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRLAMGKSSG 162
Query: 163 ---------------LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLK---- 202
LAD S + L + ++ SP + FGGSY G LAAW RLK
Sbjct: 163 IPSDEDRPSPPFDPRLADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKVLRL 222
Query: 203 --YPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWK 253
+PH+ ++ASSAP +L+F +++++++ S S C + ++
Sbjct: 223 LRFPHLIFASVASSAPVRAVLDFSE------YNDVVSRSLMSTAIGGSLECRAAVSVAFA 276
Query: 254 QIEETAKKPGGLE-KLQKAFRIC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSN 304
++E + G + L+ C ++E A+++ + Y T P
Sbjct: 277 EVERRLRSGGAAQAALRTELSACGPLGRAENQAELLGALQALVGGVVQYDGQTGAP---- 332
Query: 305 FLNPLPAFPVKEMCKAIDDP---KTGNDVFAKLYGAASVYYNYSGTAKCFDLN------- 354
V+++C + ++ + + L A + + G KC +
Sbjct: 333 -------LSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQ 384
Query: 355 -GDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG---- 405
++P G +W +Q CTE +N F + ++ C++ +G
Sbjct: 385 LRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSAL 443
Query: 406 -VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
V + +GG G A+ ++F NG DPW V + + + L+ + G
Sbjct: 444 SVAQAVAQTNSYYGGQTPG------ANKVLFVNGDTDPWHVLSVTQALGSSESTLLIRTG 497
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+H +D+ D L+ R+ + + W+
Sbjct: 498 SHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 529
>gi|256574601|dbj|BAH98106.1| serine protease [Entamoeba invadens]
gi|440292588|gb|ELP85775.1| hypothetical protein EIN_281360 [Entamoeba invadens IP1]
Length = 479
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 116/479 (24%), Positives = 209/479 (43%), Gaps = 44/479 (9%)
Query: 31 SSRITPEK--LSSLISSSKDSQ-GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWG 87
SS++T + L S+ +K+S+ +T +T LDHFN N + F+ Y I+ +
Sbjct: 12 SSKLTEHQRTLHSISKQAKESKKSTVETFKYTVPLDHFNANND--EEFEIVYFIDSQYLD 69
Query: 88 GSKNNAPIFVYTGNEG-DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEI 146
+ +PIF+ G EG + E QN + ++A K K L++ +EHR+YG S P + E+
Sbjct: 70 SASETSPIFILLGGEGPETEKVLQNNYVIDELAKKHKGLMLSVEHRFYGTSTP---SLEL 126
Query: 147 AYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHV 206
+T Y ++ QA+ DY +I +++ + PV+ GGSY G LA W R KYP++
Sbjct: 127 -----NTLKYCTAEQAMMDYVEVINYVQEMYSLVGHPVIALGGSYSGNLATWIRQKYPNI 181
Query: 207 AIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE 266
G+ ASSAP+ + +V Y + ++ + + EN ++ ++++ +E G +
Sbjct: 182 IDGSWASSAPL---EAVVDFYEYLEVVQSN---LPENTATLLTLAFEKWDEMVVTESGRK 235
Query: 267 KLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD--- 323
+L K F C I+++ + + TA+ Y ++ + +C D+
Sbjct: 236 QLGKIFHTCTEFGEKDIQTF--SENIGTALAGYVQYNSSVWKKNYESTNSICYEFDEDIN 293
Query: 324 -----------PKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTE 372
K+G+D +S+ NY +D W +Q C
Sbjct: 294 TKYPMFIDKTNTKSGSDC-----TGSSLETNYKELRDTTTYEKGNDGASGRAWMFQTCVA 348
Query: 373 MIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASN 432
K ++ CK+ Y +D + + E + G + +N
Sbjct: 349 YGYYQAVSEKSNVMFGRMNKLQGSIDMCKDIYNIDNQTLYQAVEHINVRYGAKNPQ-VTN 407
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVE--KEGAHHVDLRFSTKEDPQWLKDVRRREV 489
+ F NG DPW G+ + + +V+ + +H DL + D LK R +E+
Sbjct: 408 VAFTNGGVDPWHALGITQQDAVDSSNIVQYIQTTSHCSDLYSEKETDAPELKRARHKEM 466
>gi|357518131|ref|XP_003629354.1| Thymus-specific serine protease [Medicago truncatula]
gi|355523376|gb|AET03830.1| Thymus-specific serine protease [Medicago truncatula]
Length = 455
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 130/471 (27%), Positives = 194/471 (41%), Gaps = 76/471 (16%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHF+ P ++ F+QRY ++ + PIF+ G E + ++ +A
Sbjct: 20 QTLDHFS--PYDHRQFRQRYYEFLDYFRAP--DGPIFLVIGGEATCNGIVND--YIGVLA 73
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDL 173
KF A +V +EHRYYG+S P+ K YLSS QAL D A ++
Sbjct: 74 KKFGAAVVSLEHRYYGESTPFDTFSTENLK------YLSSKQALFDLAVFRQYYQDSLNA 127
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K N + ++P FGGSY G L+AWFRLK+PH+ G+LASSA +L + F+
Sbjct: 128 KLNRSGVENPWFFFGGSYSGALSAWFRLKFPHLTCGSLASSAVVLAVQD------FAEFD 181
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVY 293
Q S C V++ + + +E G KA R + +L I+ F+Y
Sbjct: 182 QQIGESAGPECKAVLQETTQLVETKLADDG------KALRSIFNADDLEIDG----DFLY 231
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA-ASVYY--NYSGTAKC 350
+ NP ++CK + D K + Y YY + T K
Sbjct: 232 YLADAAVIAFQYGNP------DKLCKPLVDAKNAGEDLVDAYAKYVKEYYVGTFGITPKS 285
Query: 351 FD--------LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
+D +N DS W +Q CTE+ + DSI S+ D CK
Sbjct: 286 YDQEYLKKTAINEDSSTR---LWWFQVCTEVAYFQVAPSNDSI-RSSKIDTKYHLDLCKN 341
Query: 403 AY--GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALV 460
+ GV P + +GG K+ S IIF NG +DPW + L+
Sbjct: 342 IFGDGVFPDVDATNLYYGGTKVA------GSKIIFTNGSQDPWRHASKQTSSPDLPSYLI 395
Query: 461 E-KEGAHHVDLRFSTK------------EDPQWLKDVRRREVEIIGKWISQ 498
+ H DLR + P + VR++ E + W+S+
Sbjct: 396 KCNNCGHCTDLRGCPQSPLVIEGNEKNCSSPDAVHKVRQKVQEDMDLWLSE 446
>gi|242039115|ref|XP_002466952.1| hypothetical protein SORBIDRAFT_01g017310 [Sorghum bicolor]
gi|241920806|gb|EER93950.1| hypothetical protein SORBIDRAFT_01g017310 [Sorghum bicolor]
Length = 510
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 128/469 (27%), Positives = 189/469 (40%), Gaps = 73/469 (15%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHFN P ++ F+QRY ++ N PIF+Y E N ++ +A
Sbjct: 64 QTLDHFN--PTDHRQFKQRYYEFLDYYRAP--NGPIFLYICGESSCSGIGNN--YLAVMA 117
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT---GYLSSTQALADYASL------I 170
KF A LV EHRYYGKS P+ N+ TT +LSS QAL D A
Sbjct: 118 KKFGAALVSPEHRYYGKSSPF---------NSLTTENLQFLSSKQALFDLAVFRQYYQET 168
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
++ K N + DS VFGGSY G L+AWFRLK+PH+ G+LASS +L + Y+F+
Sbjct: 169 LNAKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------AVYNFT 222
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
+ Q S C ++ ++ + G +++ F K E + L+ A
Sbjct: 223 DFDKQIGISAGPECKAALQEITGLVD--GQLQSGRNSVKELFGAPKLENDGDFLYLLADA 280
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTA 348
Y P +PL + K G D+ YY + +
Sbjct: 281 AAIAFQ--YGNPDVLCSPL-----------AEAKKNGTDLVETFASYVKDYYIGKFGASV 327
Query: 349 KCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+D N W +Q C+E+ DS+ + D C+ +
Sbjct: 328 ASYDQEYLKNTTPAESSYRLWWYQVCSEVAYFQVAPKNDSV-RSPKIDTRYHLDLCRNVF 386
Query: 405 GVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
G P+ T +GG I S I+F NG +DPW K+ + L+E
Sbjct: 387 GEGVYPDVFMTNLYYGGTGIA------GSKIVFANGSQDPWRHASKQKSSDELPSYLIEC 440
Query: 463 EGAHH-------------VDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
E H ++ S P+ L VR+ V+ I W+S+
Sbjct: 441 ENCGHCSDLSGCPQAPSNIEGDSSKCSPPESLNKVRKEIVDHIDLWLSE 489
>gi|357603759|gb|EHJ63915.1| carboxypeptidase 3 [Danaus plexippus]
Length = 445
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 115/422 (27%), Positives = 185/422 (43%), Gaps = 56/422 (13%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF+ PQ+ +T+Q RY+ N+ +GG NN PIF+ G E +I+ G MY +A +
Sbjct: 53 IDHFD--PQNRETYQMRYMYNEEFFGG--NNYPIFIMVGGEWNIQPGWLLAGNMYLMAQE 108
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD 181
+ L + EHRYYG+S+PY +L+ QALAD A I ++KK + +
Sbjct: 109 NRGYLFYTEHRYYGESLPY------TTFTTENLRFLNVDQALADLAYFISEIKKIPSFVN 162
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV- 240
S VV++GGSY G + W + +YPH+ +G +ASS PI +I + ++ F S
Sbjct: 163 SKVVLYGGSYAGNMVLWLKQRYPHLVVGVVASSGPIKAQVDIP---GYLEVVHNAFLSEG 219
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLA-IESWLSTAFVYT 294
+ C IK + G +Q+ +R+C S ++ +++ F +
Sbjct: 220 GQECVDTIKQGIADTIAAMETEDGRRSIQRIYRLCVPLDYSSRLSMGYFSGYITWTFSTS 279
Query: 295 AMTDYP-----TPSNFLNPL-PAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
T P NF N + + P++++ I D ++ ++ Y YN
Sbjct: 280 VQTARPGSLTAICQNFTNNVYGSTPMEQIGGYIADSRSISNCLNVTYDNYVASYN----- 334
Query: 349 KCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE-SEEDYDARARYCKEAYG-- 405
K NG + W +Q CTE + F++ + D +CK +
Sbjct: 335 KTVPSNGKA-------WYYQTCTEYGYYQTAPKSGTAFDQLTWLDVPFYVDFCKRVFSEK 387
Query: 406 -----VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV----LKNISKTV 456
V + + FGG L +N I +G DPW GV LK S T+
Sbjct: 388 FTESFVMNAIDRVNLMFGG------LYPNVNNTINIHGDIDPWHVLGVYDRDLKETSPTI 441
Query: 457 LA 458
L
Sbjct: 442 LV 443
>gi|308808360|ref|XP_003081490.1| Prolylcarboxypeptidase (angiotensinase C) (ISS) [Ostreococcus
tauri]
gi|116059953|emb|CAL56012.1| Prolylcarboxypeptidase (angiotensinase C) (ISS) [Ostreococcus
tauri]
Length = 542
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 130/530 (24%), Positives = 217/530 (40%), Gaps = 90/530 (16%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKT--------KYHTQILDHFNYNPQSYQTFQQRYLI 81
P++R+ + +S+ + ++G ++ ++ Q LDHF++ + + + QRY +
Sbjct: 25 PATRVDGLRRASVARALGGARGDFEINDDVEDAERWFDQTLDHFDHVDR--RRWSQRYFV 82
Query: 82 NDTHWGGSKNNAPIFVYTGNEGD------IEWFAQNTGFMYDVAPKFKALLVFIEHRYYG 135
N+ + + P+FV G EG + + G M D+A K + + + +EHR+YG
Sbjct: 83 NEGFVDKIEASTPVFVCVGGEGPALTARAVLDGGTHCGTMIDLAKKHRGIALALEHRFYG 142
Query: 136 KSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK----NLTATD-------SPV 184
S P G + + YL+S QAL D + + + T +D S V
Sbjct: 143 ASQPTGDLSRESLR------YLTSAQALEDVVAFVKYVADAYGLRTTPSDDGRNGSYSRV 196
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV---- 240
+ FGGSY GMLAAW R+KYPH A+ASSAPI ++ Y +++ + R
Sbjct: 197 IAFGGSYPGMLAAWSRVKYPHAIHAAVASSAPIRAELDMRGYY---DVVGKALREKDVGG 253
Query: 241 SENCYKVIKGSWK-QIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDY 299
S+ C+ + +++ ++ E K P G L+ F +C A V AM +
Sbjct: 254 SDACFDAVSETFESELNEALKTPEGRRALETRFNVCGDAALDQFGGRDGFADVLRAM--F 311
Query: 300 PT----PSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG 355
P PS + + + C + +TG +L ASV G++ C L+G
Sbjct: 312 PAQNNDPSCEMEDTSCLNIAKACTMMTRAETGK----RLDALASVVKVVFGSS-CVSLDG 366
Query: 356 DS----------DPHGLSE--WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY---- 399
+ +P G E W WQ CTE + F+ E+D +
Sbjct: 367 AAYMRELMSETPNPLGEGERQWTWQTCTEF----------AFFQTCEKDSGCPFKLDPPT 416
Query: 400 ---------CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
C + +GV E + G + + I+F +G DPW +
Sbjct: 417 MPLSSYQWICAQVFGVSAEQTKNAVERSNARYGGITP-GGTRILFPSGSVDPWIANSFVS 475
Query: 451 NI--SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
N K A V +HH D + R R + + KW++Q
Sbjct: 476 NTFSPKWEPAFVVPGASHHAWTHPPKDTDSAAVVQARARIEKQVEKWMNQ 525
>gi|224150431|ref|XP_002336956.1| predicted protein [Populus trichocarpa]
gi|222837219|gb|EEE75598.1| predicted protein [Populus trichocarpa]
Length = 114
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 56/106 (52%), Positives = 77/106 (72%)
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
CK +GV P+P+WITT +GGH I L+L+RF SNIIF NGLRDP+S GGVL NIS +++A+
Sbjct: 2 CKSLFGVLPQPHWITTYYGGHDIKLILQRFGSNIIFSNGLRDPYSSGGVLNNISDSIVAV 61
Query: 460 VEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQDLAQ 505
G+H +D++ + DP WL R+ EV+II WIS+Y+ DL +
Sbjct: 62 STVNGSHCLDIQRANPSDPHWLVMQRKIEVKIIEGWISKYYTDLLE 107
>gi|194899984|ref|XP_001979537.1| GG23250 [Drosophila erecta]
gi|190651240|gb|EDV48495.1| GG23250 [Drosophila erecta]
Length = 480
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 121/467 (25%), Positives = 202/467 (43%), Gaps = 68/467 (14%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
K+ TQ LD N++ + QT+Q RYL+ND +PIF+Y G E +IE + G
Sbjct: 57 KWITQKLD--NFDASNTQTYQMRYLVNDEF---QTQGSPIFIYLGGEWEIEKSMVSAGHW 111
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK- 174
YD+A + K +LV+ EHRYYG+S+P + + YL+ QALAD A+ I K
Sbjct: 112 YDMAEEHKGVLVYTEHRYYGQSVP------TSSMSTDNLKYLNVKQALADVANFIETFKA 165
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+N +S V++ GGSY + WF+ YP + +G ASSAP+L V + ++
Sbjct: 166 ENPQLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPLL---AKVDFTEYKEVVG 222
Query: 235 QDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL---ST 289
Q F + S+ CY IK ++E G E + R+C S + ++ W S
Sbjct: 223 QAFLQLGSQQCYDRIKNGIAELESMFANKRGAEA-KAMLRLCDSFDDQNDLDLWTLFSSI 281
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
+ +++ + Y + ++ + + DD T F VY+ + G +
Sbjct: 282 SNIFSGIAQYQSNNDIVYNC------DYIMTFDDDATAIANF--------VYWGW-GMGR 326
Query: 350 CFD----------LNGDSDPHGLSEWGWQACTEM--IMLTGGDNK-------DSIFEESE 390
C D L G + W +Q C E +G N+ +++
Sbjct: 327 CIDARYQGSVEYYLWGVENFDASRPWYYQTCNEYGWYQSSGSSNQPFGTKFPATLYINLC 386
Query: 391 EDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
D + +RY E V+ FGG + G+ NI +G DPW+ G
Sbjct: 387 GDVFS-SRYGNEQINVNAANT--NAYFGGMEPGV------ENIYMTHGALDPWNPMGHGV 437
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
T++A +H D D + ++ + + ++ +W++
Sbjct: 438 EQGATIIA----NASHCADFSSIKPTDSEEMRASKEKLAGLVRQWLA 480
>gi|359476844|ref|XP_003631898.1| PREDICTED: probable serine protease EDA2-like [Vitis vinifera]
Length = 477
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 131/466 (28%), Positives = 196/466 (42%), Gaps = 66/466 (14%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DHF+ P + F QRY ++ + PIF+ E + + ++ +A
Sbjct: 45 QTVDHFS--PLDHSKFPQRYYEFTDYF--RLPDGPIFLKICGEASCDGIPND--YISVLA 98
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDL 173
KF A +V +EHRYYGKS P+ + K YLSS QAL D A +++
Sbjct: 99 KKFGAAVVSLEHRYYGKSSPFRSLRTENLK------YLSSKQALFDLAVFRQYYQESLNV 152
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K N + ++P VFG SY G L+AWFRLK+PH+ G+LASSA +L + Y+F+
Sbjct: 153 KVNRSNVENPWFVFGVSYAGALSAWFRLKFPHLTCGSLASSAVVL------AIYNFTEFD 206
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVY 293
Q S C V++ +E+ G A + +L I++ F+Y
Sbjct: 207 RQIGESAGAECKAVLQEVNGLVEQRLAVDG------NAVKSLFGAASLKIDA----DFLY 256
Query: 294 TAMTDYPTPSNFLNP-LPAFPVKEMCKAIDDPKTGNDVFAKLY-----GAASVYYNYSGT 347
T + NP + P+ + K +D + + K Y G + YN
Sbjct: 257 LLADAAVTAFQYGNPDMLCSPLVQAKKDGEDLENAYATYVKEYYLGTFGVSIETYNQQHL 316
Query: 348 AKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
+ SD W +Q CTE+ + DS+ SE + CK +G
Sbjct: 317 KRTNSAGDTSD----RLWWFQVCTEVAFFQVAPSDDSM-RSSEINTKYHLDLCKNVFGNG 371
Query: 408 PRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGA 465
P+ T +GG KI S IIF NG +DPW L + L+
Sbjct: 372 IYPDVDATNIYYGGTKIA------GSKIIFTNGSQDPWRHASKLTSTPDMPSFLISCHNC 425
Query: 466 -HHVDLR------FSTKED------PQWLKDVRRREVEIIGKWISQ 498
H DLR S + D P + VR++ +E I W+SQ
Sbjct: 426 GHGTDLRGCPQSPLSPEGDAKNCSSPDAVHKVRQQIIEHIDLWLSQ 471
>gi|414870775|tpg|DAA49332.1| TPA: putative serine peptidase S28 family protein [Zea mays]
Length = 509
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 130/469 (27%), Positives = 189/469 (40%), Gaps = 73/469 (15%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHFN P ++ F+QRY ++ N PIF+Y E N ++ VA
Sbjct: 63 QTLDHFN--PTDHRQFKQRYYEFLDYYRAP--NGPIFLYICGESTCNGIGNN--YLAVVA 116
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT---GYLSSTQALADYASL------I 170
KF A LV EHRYYGKS P+ N+ TT +LSS QAL D A
Sbjct: 117 KKFGAALVSPEHRYYGKSSPF---------NSLTTENLQFLSSKQALFDLAVFRQYYQET 167
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
++ K N + +S VFGGSY G L+AWFRLK+PH+ G+LASS +L + Y+F+
Sbjct: 168 LNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------AVYNFT 221
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
+ Q S C ++ ++ + G +++ F K E + L+ A
Sbjct: 222 DFDKQIGISAGPECKAALQEITGLVD--GQLQSGRNSVKELFGATKLENDGDFLYLLADA 279
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTA 348
Y P +PL + K G D+ YY + +
Sbjct: 280 AAIA--FQYGNPDVLCSPL-----------AEAKKNGTDLVETFASYVKDYYIGKFGASV 326
Query: 349 KCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+D N W +Q C+E+ DS+ + D C+ +
Sbjct: 327 ASYDQQYLKNTTPAESSYRLWWYQVCSEVAYFQVAPKNDSV-RSPKIDTRYHLDLCRNVF 385
Query: 405 GVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE- 461
G P+ T +GG I S I+F NG +DPW K+ + L+E
Sbjct: 386 GEGVYPDVFMTNLYYGGTGIA------GSKIVFANGSQDPWRHASKQKSSDELPSYLIEC 439
Query: 462 KEGAHHVDLR------------FSTKEDPQWLKDVRRREVEIIGKWISQ 498
K H DL S P+ L VR+ V+ I W+S+
Sbjct: 440 KNCGHCSDLSGCPQAPSNIEGDSSKCSPPEALNKVRKEIVDHIDLWLSE 488
>gi|195391898|ref|XP_002054596.1| GJ22720 [Drosophila virilis]
gi|194152682|gb|EDW68116.1| GJ22720 [Drosophila virilis]
Length = 487
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 121/467 (25%), Positives = 207/467 (44%), Gaps = 61/467 (13%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T++ +Q LD+FN + + + + R LIN+ ++ + +PIF+Y G E IE A +G
Sbjct: 59 ETRWISQKLDNFNVSNE--EVWDDRVLINEDYF---VDGSPIFIYLGGEWKIEPSAITSG 113
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + LV+ EHR++G+SIP I + + Y + QALAD ++I L
Sbjct: 114 LWVDIAREHNGSLVYTEHRFFGESIP------IKPLSTANLKYQNVEQALADVVNVINVL 167
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
KK DS VV+ G SY +A W +L YP V +G+ ASSAP+ + V + ++
Sbjct: 168 KKEDKYKDSKVVISGCSYSATMAVWLKLLYPDVIVGSWASSAPL---EAKVDFKDYMKVV 224
Query: 234 TQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL---S 288
+ +R + + CY +I + Q E+ G E +K +C ++N + W +
Sbjct: 225 GKAYRELGGDYCYNIIDNATSQYEQLFASGNGTEA-KKILNLCDDFDENDEQDQWQIFST 283
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID--DPKTGNDVFA----KLYGAASVYY 342
A V+ + Y P N + + + C + DP V +L A V
Sbjct: 284 IANVFAGIAQYQKPEN-------YDLAQYCSVLRSFDPDDAVAVSKFVQWRLNYPACVNT 336
Query: 343 NYSGTAKCFDLNGDS-DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR--ARY 399
Y GT + + D+ D GL+ W +Q C E +K F S + A
Sbjct: 337 RYKGTVAYYKWSMDNYDGSGLA-WFYQTCREFGWFQSSGSKSQPFGSS---FPATLYTDT 392
Query: 400 CKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLK 450
C + +G ++ ++GG + N+ G DPWS G G+ +
Sbjct: 393 CHDVFGSGYSSARIERYIRATNKKYGG------VNPAVENVYMTQGGLDPWSKVGAGLAQ 446
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
N A + + +H D + D L+ + R +++ +W++
Sbjct: 447 N------ATIIPQASHCSDSGSISATDSPGLRAAKERLAKLVREWLA 487
>gi|226506854|ref|NP_001149727.1| prolyl carboxypeptidase like protein precursor [Zea mays]
gi|195629836|gb|ACG36559.1| prolyl carboxypeptidase like protein [Zea mays]
Length = 509
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 130/469 (27%), Positives = 189/469 (40%), Gaps = 73/469 (15%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHFN P ++ F+QRY ++ N PIF+Y E N ++ VA
Sbjct: 63 QTLDHFN--PTDHRQFKQRYYEFLDYYRAP--NGPIFLYICGESTCNGIGNN--YLAVVA 116
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT---GYLSSTQALADYASL------I 170
KF A LV EHRYYGKS P+ N+ TT +LSS QAL D A
Sbjct: 117 KKFGAALVSPEHRYYGKSSPF---------NSLTTENLQFLSSKQALFDLAVFRQYYQET 167
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
++ K N + +S VFGGSY G L+AWFRLK+PH+ G+LASS +L + Y+F+
Sbjct: 168 LNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------AVYNFT 221
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
+ Q S C ++ ++ + G +++ F K E + L+ A
Sbjct: 222 DFDKQIGISAGPECKAALQEITGLVD--GQLQSGRNSVKELFGATKLENDGDFLYLLADA 279
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTA 348
Y P +PL + K G D+ YY + +
Sbjct: 280 AAIA--FQYGNPDVLCSPL-----------AEAKKNGTDLVETFASYVKDYYIGKFGASV 326
Query: 349 KCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
+D N W +Q C+E+ DS+ + D C+ +
Sbjct: 327 ASYDQQYLKNTTPAESSYRLWWYQVCSEVAYFQVAPKNDSV-RSPKIDTRYHLDLCRNVF 385
Query: 405 GVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE- 461
G P+ T +GG I S I+F NG +DPW K+ + L+E
Sbjct: 386 GEGVYPDVFMTNLYYGGTGIA------GSKIVFANGSQDPWRHASKQKSSDELPSYLIEC 439
Query: 462 KEGAHHVDLR------------FSTKEDPQWLKDVRRREVEIIGKWISQ 498
K H DL S P+ L VR+ V+ I W+S+
Sbjct: 440 KNCGHCSDLSGCPQAPSNIEGDSSKCSPPEALNKVRKEIVDHIDLWLSE 488
>gi|325186496|emb|CCA21036.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 250
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 67/150 (44%), Positives = 92/150 (61%), Gaps = 12/150 (8%)
Query: 59 TQILDHFNYNPQSYQT------FQQRYLINDTHWGGSKNNAPIFVYTGNEG-DIEWFAQN 111
TQ +D F++ P SY ++QRYLIN+ W + AP F YTGNE D+ +A +
Sbjct: 106 TQRVDQFSW-PSSYSRSQPPLFYKQRYLINNETWDPNDPKAPTFFYTGNEASDVSLYANH 164
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
TG M++ A FKAL+VF EHR+YG S P+ ++ I S Y + QA+ADYA L+
Sbjct: 165 TGLMWEYAAHFKALIVFAEHRFYGLSQPFNSSQLIP----SHLRYRTHEQAIADYALLLE 220
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRL 201
++K PV+ FGGSYGGML+AWFR+
Sbjct: 221 SIQKRFHGDRHPVITFGGSYGGMLSAWFRI 250
>gi|13634059|sp|P90893.2|YM9I_CAEEL RecName: Full=Putative serine protease F56F10.1; Flags: Precursor
gi|351062821|emb|CCD70865.1| Protein F56F10.1 [Caenorhabditis elegans]
Length = 540
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 126/471 (26%), Positives = 199/471 (42%), Gaps = 54/471 (11%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMY 116
TQ LDHF+ P + +T+ Q+Y N S+NN+ IF+ G EG + +W A
Sbjct: 60 TQKLDHFD--PYNTKTWNQKYFYNPVF---SRNNSIIFLMIGGEGPENGKWAANPNVQYL 114
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
A +F A + +EHR++G S P I S+ YL++ QALAD A I + +
Sbjct: 115 QWAKEFGADVFDLEHRFFGDSWP------IPDMQTSSLRYLTTQQALADLAFFIEFMNQQ 168
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNII 233
+ V FGGSY G LAAWFR KYP + +G++ASSAP+ L+F Y ++ ++
Sbjct: 169 YGFKNPRWVTFGGSYPGSLAAWFRQKYPQLTVGSVASSAPVNLKLDF------YEYAMVV 222
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWLST 289
D R C + K ++ Q+++ A G L F + + L I ++
Sbjct: 223 EDDLRITDPKCAQATKDAFVQMQKLALTAEGRNSLNNHFNLQPPFDANTTKLDINNFFGN 282
Query: 290 AF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF----------------A 332
F Y MT Y + V++MC + + + V A
Sbjct: 283 IFNTYQGMTQYTYDGQSNSTHSDKTVRKMCDIMTNATETDVVMRVENLFLWFNQMEPASA 342
Query: 333 KLYGAASVYYNYSGTAKCFDLNG-DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEE 391
L + Y++ DLN D W W C E+ L + +++F
Sbjct: 343 NLTVMPNSYWDVISQVGSGDLNVLGPDGAAARGWMWLCCNEIGFLQTTNQGNNVFGTGVP 402
Query: 392 DYDARARYCKEAYGVDPRPNWITTEFGGHK-----IGLVLKRFASNIIFFNGLRDPWSGG 446
+ C + +G + + I GG+K G A+N++ NG DPW
Sbjct: 403 -LNLFIDMCTDMFGDSMKMSQI---MGGNKKSQNYYGGADFYNATNVVLPNGSLDPWHAL 458
Query: 447 GVLKNI-SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
G I S+++L + AH D+ S +P L R E + ++I
Sbjct: 459 GTYGTIKSQSLLPYLINGTAHCGDMYPSYDGEPGSLLAARAFVKENVRQFI 509
>gi|194744689|ref|XP_001954825.1| GF16547 [Drosophila ananassae]
gi|190627862|gb|EDV43386.1| GF16547 [Drosophila ananassae]
Length = 480
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 132/528 (25%), Positives = 227/528 (42%), Gaps = 82/528 (15%)
Query: 2 ATRFIFLSFCLLFS--STLTISNAKIFPTFPSS-----RITPEKLSSLISSSKDSQGLYK 54
A R + L+ L+ L S AK P + R P+++ + ++ ++
Sbjct: 3 AIRLVCLAIVLVLGVGHALDFSKAKDVPVLVKTLKNLNRGPPQQVVTKRANVQE------ 56
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
K+ TQ LD N++ + +T+Q RYLIND + +PIF+Y G E IE + G
Sbjct: 57 -KWITQKLD--NFDDSNTETYQMRYLINDEF---QTDGSPIFIYLGGEWTIEQSMVSAGH 110
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
YD+A + K +LV+ EHRYYG+SIP ++ +N YL QALAD A I LK
Sbjct: 111 WYDMAQEHKGVLVYTEHRYYGESIP---TTTMSTENLQ---YLHVKQALADVAHFITTLK 164
Query: 175 -KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
+N +S VV+ GGSY + WF+ YP + +G ASSAP+L V + ++
Sbjct: 165 SENAQLANSKVVLAGGSYSATMVVWFKRLYPDLVVGGWASSAPLL---AKVDFTEYKEVV 221
Query: 234 TQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWL-S 288
+ F + + CY I+ ++E G E + R+C S + +L + S S
Sbjct: 222 GRAFLELGGQQCYNRIQNGIAELESLFANKRGAEA-RAMLRLCNSFDDQNDLDLWSLFGS 280
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
+ V++ + Y + ++ + + C + + A L VY+ + G
Sbjct: 281 ISNVFSGIAQYQSGND---------INDYCDYLLSFRDDATAIANL-----VYWAW-GVP 325
Query: 349 KCFD----------LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C D L G + W +Q C E +++ F ++
Sbjct: 326 SCIDATYASTVEYYLWGVENFDAGRPWYYQTCNEYGWYQSSGSRNQPF-GTKFPATLYTT 384
Query: 399 YCKEAYGVDPRPNWIT-------TEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVL 449
C + + +I+ +FGG + G+ N+ +G DPW+ G GV
Sbjct: 385 LCGDVFNSRYGNEYISNNVAQTNVDFGGMEPGV------ENVYMTHGALDPWNPMGHGVA 438
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ T++A +H D D ++ + + E++ +W++
Sbjct: 439 E--GATLIA----NASHCSDFGSIRSTDSAEMRASKEKLAELMREWLA 480
>gi|198437028|ref|XP_002124988.1| PREDICTED: similar to CG9953 CG9953-PA [Ciona intestinalis]
Length = 508
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 121/481 (25%), Positives = 202/481 (41%), Gaps = 75/481 (15%)
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
N++ + T++QRY N+ S N P+F+ G EG G +A + A+
Sbjct: 51 NFDSANVHTWKQRYFANNQF---STPNGPVFLSIGGEGPAGSIWMTKGHWVTMAKQTGAM 107
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVV 185
L IEHR+YGKS P + + LSS QALAD A+ I ++ +
Sbjct: 108 LFMIEHRFYGKSHP------TPDASLESLSVLSSEQALADIANFITNITAEYKLAGRKWI 161
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSFSNIITQDFRSV-S 241
VFGGSY G LA W R KYPH+ GA+++SA PI+NFD + ++ + +++ S
Sbjct: 162 VFGGSYSGSLAIWARYKYPHLISGAVSASAPLHPIVNFD------GYQEVVQRSLQTLGS 215
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPT 301
C + + + +I K G + L F +C + + +++ V ++ D
Sbjct: 216 PKCVQNLANATTEITSLMKTTAGRKILTSEFNLCHALSDDVLDNQYFQESVAGSIQDVVQ 275
Query: 302 PS----NFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGD- 356
+ +F PAF + +C +DD G+ + K Y + C D +
Sbjct: 276 YNRDNMHFEGHGPAFNISYICHILDDVNLGSPL--KRYAEINRLTLKESNVSCLDSSYQK 333
Query: 357 --SDPH--------GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
SD G+ +W +Q CTE D+ F+ + + C++ +G+
Sbjct: 334 FVSDTKATSWDKATGMRQWLYQTCTEFGWFQSSDSTHQPFKGFPLKFSIQQ--CQDIFGI 391
Query: 407 DPRPNWI-------TTEFGGHKIGLVLKRFASNIIFFNGLRDPWS-----GGGVLKNISK 454
P+ I +TE G GL + +N+ +NGL DPWS G + N
Sbjct: 392 ---PSEIIYKGVQRSTENYG---GLSVAGLVTNVTLYNGLIDPWSDVSYMAGNLNLNPEN 445
Query: 455 TVL-------------------ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
T+L +L+ AH + ++ +D +LK R + +W
Sbjct: 446 TLLRPKRNVLSMSKSRTDVGIVSLIVPNTAHCAIMYPASDKDSIYLKKARLDVENAVKEW 505
Query: 496 I 496
+
Sbjct: 506 L 506
>gi|326532846|dbj|BAJ89268.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 522
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 122/468 (26%), Positives = 196/468 (41%), Gaps = 69/468 (14%)
Query: 59 TQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD 117
+Q LDHF+ P ++ F+QRY D H N P+F+ E + N ++
Sbjct: 61 SQTLDHFS--PTDHRQFKQRYYEFLDYH---RAPNGPVFLNICGEASCSGISNN--YLAV 113
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------II 171
+A KF A LV EHRYYGKS P+ +++ +N +LSS QAL+D A +
Sbjct: 114 MAKKFGAALVSPEHRYYGKSSPF---EDLTTENLR---FLSSKQALSDLAVFRQYYQETL 167
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+ K N + D+ VFGGSY G L+AWFRLK+PH+ G+LASS +L + Y+F++
Sbjct: 168 NAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSLASSGVVL------AVYNFTD 221
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q S C ++ + ++ + G +++ F E + L+ A
Sbjct: 222 FDRQIGESAGPECKAALQEITRLVD--GQLQSGNNSVKELFGAKMLENDGDFLYLLADAA 279
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTAK 349
Y P +PL ++ K G D+ + YY + +
Sbjct: 280 AIA--FQYGNPDVLCSPL-----------VEAKKNGTDLVEAFAHYVNSYYVGRFKASVA 326
Query: 350 CFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
+D N W +Q C+E+ DS+ ++ D CK +G
Sbjct: 327 SYDQKYLKNTTPAESSYRLWWYQVCSEVSYFQVAPKNDSV-RSTKIDTRYHLDLCKNVFG 385
Query: 406 VDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
P+ T +GG +I S I+F NG +DPW K+ + L+E
Sbjct: 386 EGVYPDVSMTNLYYGGTRIA------GSKIVFANGSQDPWRHASKQKSSEELPSYLIECS 439
Query: 464 GAHH-------------VDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
H ++ S P+ + VR++ V+ I W+S+
Sbjct: 440 NCGHCTDISGCPQAPSNIEGDSSKCSSPEAVNKVRKQIVDHIDLWLSE 487
>gi|395736879|ref|XP_003776822.1| PREDICTED: thymus-specific serine protease isoform 2 [Pongo abelii]
Length = 541
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 120/518 (23%), Positives = 215/518 (41%), Gaps = 99/518 (19%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVS--NRRSFLQRYWVNDQHWVG--QDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + +LSS A
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRHA 156
Query: 163 ---------------------LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFR 200
LAD S + L + ++ SP + FGGSY G LAAW R
Sbjct: 157 MGKFSGIPSDEDRPSPPFDPRLADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWAR 216
Query: 201 LK------YPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQDFRSV----SENCYKV 247
LK +PH+ ++ASSAP +L+F +++++++ S S C
Sbjct: 217 LKVLRLLRFPHLIFASVASSAPVRAVLDFSE------YNDVVSRSLMSTAIGGSLECRAA 270
Query: 248 IKGSWKQIEETAKKPGGLE-KLQKAFRIC--------KSEKNLAIESWLSTAFVYTAMTD 298
+ ++ ++E + G + L+ C ++E A+++ + Y
Sbjct: 271 VSVAFAEVERRLRSGGAAQAALRTELSACGPLGRAENQAELLGALQALVGGVVQYDGQAG 330
Query: 299 YPTPSNFLNPLPAFPVKEMCKAIDDP---KTGNDVFAKLYGAASVYYNYSGTAKCFDLN- 354
P V+++C + ++ + + L A + + G KC +
Sbjct: 331 AP-----------LSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSR 378
Query: 355 -------GDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
++P G +W +Q CTE +N F + ++ C++
Sbjct: 379 AETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQV 437
Query: 404 YG-----VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
+G V + +GG G A+ ++F NG DPW V + + +
Sbjct: 438 FGLSALSVAQAVAQTNSYYGGQTPG------ANQVLFVNGDTDPWHVLSVTQALGSSEST 491
Query: 459 LVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
L+ + G+H +D+ D L+ R+ + + W+
Sbjct: 492 LLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 529
>gi|118487876|gb|ABK95760.1| unknown [Populus trichocarpa]
Length = 485
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 135/518 (26%), Positives = 213/518 (41%), Gaps = 78/518 (15%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTK--YHTQILDHFNYNP 69
L+F S L +S A P + + +S S +S+ T+ + Q LDHF+ P
Sbjct: 6 LVFISLLLLSTA------PYASLALRTQVHSLSESSNSKNYLTTQELWFNQTLDHFS--P 57
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
+ F QRY ++ S + PIF+ E + ++ +A KF A +V +
Sbjct: 58 FDHHKFPQRYYEFLDYFRIS--DGPIFLEICGESSCNGIVND--YISVLAKKFGAAVVSL 113
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSP 183
EHRYYG+S+P+ K +N +LSS QAL D A ++LK N T+ ++P
Sbjct: 114 EHRYYGRSLPF---KSTTTENLR---FLSSKQALFDLAVFRQYYQESLNLKLNRTSVENP 167
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN 243
VFGGSY G L+AWFRLK+PH+ G+LASSA +L N F+ Q S
Sbjct: 168 WFVFGGSYAGALSAWFRLKFPHLTCGSLASSAVVLAIHN------FTEFDQQIGESAGAE 221
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPS 303
C ++ + + +EE L ++A + L I+ F+Y
Sbjct: 222 CKATLQETTQLVEER------LASNKQAVKALFDAAELEIDG----DFLYFLADAAVIAF 271
Query: 304 NFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA-ASVYY--NYSGTAKCFDLNGDSDPH 360
+ NP +C + K D + Y YY + + + ++ D
Sbjct: 272 QYGNP------DIVCSTLVKAKNNGDDLVEAYAKYVKEYYLGTFGSSVQTYNQKYLKDTS 325
Query: 361 -----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY--GVDPRPNWI 413
G W +Q CTE+ + DSI S+ D CK + G+ P +
Sbjct: 326 LNKHTGDRLWWFQVCTEVAYFQVAPSNDSI-RSSKVDARYHLDLCKNVFGEGIYPEVDVT 384
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV-DLRF 472
+GG I S I+F NG +DPW + L+ H D+R
Sbjct: 385 NIYYGGTNIS------GSKIVFANGSQDPWRHASKQTSSPDMPSFLISCHNCGHCTDIRG 438
Query: 473 STK------------EDPQWLKDVRRREVEIIGKWISQ 498
+ P+ ++ VR + +E + W+S+
Sbjct: 439 CPQTPLSLEGNARNCSSPEAVEKVRHQIIEKMDLWLSE 476
>gi|326520219|dbj|BAK04034.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 122/468 (26%), Positives = 196/468 (41%), Gaps = 69/468 (14%)
Query: 59 TQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD 117
+Q LDHF+ P ++ F+QRY D H N P+F+ E + N ++
Sbjct: 61 SQTLDHFS--PTDHRQFKQRYYEFLDYH---RAPNGPVFLNICGEASCSGISNN--YLAV 113
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------II 171
+A KF A LV EHRYYGKS P+ +++ +N +LSS QAL+D A +
Sbjct: 114 MAKKFGAALVSPEHRYYGKSSPF---EDLTTENLR---FLSSKQALSDLAVFRQYYQETL 167
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+ K N + D+ VFGGSY G L+AWFRLK+PH+ G+LASS +L + Y+F++
Sbjct: 168 NAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSLASSGVVL------AVYNFTD 221
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q S C ++ + ++ + G +++ F E + L+ A
Sbjct: 222 FDRQIGESAGPECKAALQEITRLVD--GQLQSGNNSVKELFGAKMLENDGDFLYLLADAA 279
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTAK 349
Y P +PL ++ K G D+ + YY + +
Sbjct: 280 AIAFQ--YGNPDVLCSPL-----------VEAKKNGTDLVEAFAHYVNSYYVGRFKASVA 326
Query: 350 CFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
+D N W +Q C+E+ DS+ ++ D CK +G
Sbjct: 327 SYDQKYLKNTTPAESSYRLWWYQVCSEVSYFQVAPKNDSV-RSTKIDTRYHLDLCKNVFG 385
Query: 406 VDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
P+ T +GG +I S I+F NG +DPW K+ + L+E
Sbjct: 386 EGVYPDVSMTNLYYGGTRIA------GSKIVFANGSQDPWRHASKQKSSEELPSYLIECS 439
Query: 464 GAHH-------------VDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
H ++ S P+ + VR++ V+ I W+S+
Sbjct: 440 NCGHCTDISGCPQAPSNIEGDSSKCSSPEAVNKVRKQIVDHIDLWLSE 487
>gi|195451231|ref|XP_002072825.1| GK13807 [Drosophila willistoni]
gi|194168910|gb|EDW83811.1| GK13807 [Drosophila willistoni]
Length = 481
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 132/509 (25%), Positives = 221/509 (43%), Gaps = 57/509 (11%)
Query: 9 SFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGL--YKTKYHTQILDHFN 66
SF L+ + TL ++ AK F T +KL S + Q + +T + Q LDHFN
Sbjct: 8 SFALILAVTL-LAQAKGDSIFQH---TFKKLHSEPPVPVNQQRVDQVETLWIEQKLDHFN 63
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
+ +T+Q RY++N+ + ++ P+F++ G E +I +G MYD+A + K LL
Sbjct: 64 --DEDTRTWQMRYMLNEALY---ESGGPLFIFLGGEWEISTGRITSGHMYDMAKEHKGLL 118
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA-TDSPVV 185
+ EHR+YG+S P + + + YLS QALAD A I K N DS V+
Sbjct: 119 AYTEHRFYGESKP------LDDLSVESLEYLSVKQALADLAHFIRTQKANYAGLADSKVI 172
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN-C 244
+ GGSY + WF+ YP + G +SSAP+ N V + I Q V + C
Sbjct: 173 IVGGSYSASMVVWFKRTYPDLVAGGWSSSAPLYAKVNFV---EYKEITGQSIAQVGGSAC 229
Query: 245 YKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSN 304
Y I+ ++E+ G E ++ ++C+ + S L +++ ++D +
Sbjct: 230 YNRIEKGISELEQLLADKRGAE-VKALLKLCEP---FDVNSDLDVWTLFSEISD--IFAG 283
Query: 305 FLNPLPAFPVKEMC-KAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLS 363
+ A ++ +C K + +P V + L G G KC DL+ + L
Sbjct: 284 VVQTHNAGQIEGVCDKILSEPDDLIGVTSYLLGVFE-----QGGGKCNDLSYKAILSELL 338
Query: 364 E----------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD------ 407
E W +Q C E + + F ++ C + YG
Sbjct: 339 ETKYTGNIMRQWIYQTCNEYGWYQTSGSSNQPF-GTKFPLTLYTTMCADIYGEKFSNEFI 397
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
W T E+ G L+ +I +G DPW G+ ++ S ++A V + AH
Sbjct: 398 TNQVWDTNEYFGR-----LEPGVYDIHITHGQLDPWRAMGI-QDESLELVATVIPDYAHC 451
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
D + D + ++D + E++ KW+
Sbjct: 452 KDFGSISDSDSEEMRDSKLAVAELVRKWV 480
>gi|335291865|ref|XP_003356607.1| PREDICTED: thymus-specific serine protease [Sus scrofa]
Length = 514
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 117/483 (24%), Positives = 205/483 (42%), Gaps = 78/483 (16%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + + Q+F QRY +ND +W + + P+F++ G EG + + G
Sbjct: 58 KEGWLEQPLDPFNASDR--QSFLQRYWVNDQYW--TSQDGPVFLHLGGEGSLGPGSVMRG 113
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG SIP G + +LSS ALAD S + L
Sbjct: 114 HPAALAPVWGALVIGLEHRFYGLSIPAEGLGMAKLR------FLSSRHALADVVSARLAL 167
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---ILNFDNIVSPYSF 229
+ ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L+F +
Sbjct: 168 TRLFNVSSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSE------Y 221
Query: 230 SNIITQDFRSV----SENCYKVIKGSWKQIE-ETAKKPGGLEKLQKAFRICKSEKNLA-- 282
++++++ + S C + ++ ++E L LQ C S + A
Sbjct: 222 NDVVSKSLMNTAIGGSPECRAAVSSAFAEVERRLRAGGAALAALQAELGACGSLRRAADR 281
Query: 283 ------IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI---DDPKTGNDVFAK 333
+++ + A Y P V+++C + D + +
Sbjct: 282 EELLGTLQALVGGAVQYDGQVGAP-----------LSVRQLCGLLLGDRDNSSSPAPYLG 330
Query: 334 LYGAASVYYNYSGTAKCFDLNGD--------SDPH----GLSEWGWQACTEMIMLTGGDN 381
L+ A V + G KC + ++P G +W +Q CTE ++
Sbjct: 331 LHRAVQVVTHSLGQ-KCLSFSRAETVAQLRVTEPQVSSLGERQWLYQTCTEFGYFVTCED 389
Query: 382 KDSIFEESEEDYDARARYCKEAYGVD--------PRPNWITTEFGGHKIGLVLKRFASNI 433
F + + C++ +G+ R N + +GG G A+ +
Sbjct: 390 PGCPFSQLPA-LPSHLELCEQVFGLSTSSIAQAVARTN---SYYGGQTPG------ATQV 439
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIG 493
+F NG DPW V + + + A++ +H +D+ D L+ R+ +
Sbjct: 440 LFVNGDMDPWHVLSVTQALGPSESAILIPGASHCLDMAPEKPSDSPSLRLARQNIFLQLQ 499
Query: 494 KWI 496
W+
Sbjct: 500 TWL 502
>gi|74004232|ref|XP_545414.2| PREDICTED: thymus-specific serine protease [Canis lupus familiaris]
Length = 521
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 120/483 (24%), Positives = 202/483 (41%), Gaps = 69/483 (14%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + ++F QRY +ND HW + P+F++ G E + + G
Sbjct: 56 KQGWLEQPLDPFNAS--DTRSFLQRYWVNDQHW--TSQRGPVFLHLGGESSLRSGSVLRG 111
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG S+P GG + + +LSS ALAD AS + L
Sbjct: 112 HPTALAPAWGALVIGLEHRFYGLSVPAGG------LDVAQLRFLSSRHALADVASARLAL 165
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIV--S 225
+ ++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ L+F N V S
Sbjct: 166 ARLFNVSSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRATLDFSEYNEVRGS 225
Query: 226 PYSFSNIITQDF--RSVS-ENCYKVIKGSWKQIEETAKKPGGLEK-LQKAFRIC------ 275
++ R VS C ++ ++E G L+ C
Sbjct: 226 GRQVHAVLGGGLRPRGVSLPQCRAAASAAFAEVERRLHAGGATRSALRAELGACSPLDRA 285
Query: 276 --KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK 333
++E A+++ + A Y A P V+ +C + + ++ A
Sbjct: 286 EDQAELLGALQALVGGAVQYDAQAGVP-----------LSVRGLCGLLLEGRSNRSRSAP 334
Query: 334 LYG---AASVYYNYSGTAKCFDLN----------GDSDPHGLS--EWGWQACTEMIMLTG 378
+G A V + G +C + DS G+ +W +Q CTE
Sbjct: 335 YHGLRRAVQVVMHSLGQ-RCLSFSRAETVAQLKVTDSQVSGVGDRQWLYQTCTEFGFYIT 393
Query: 379 GDNKDSIFEESEEDYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNI 433
+ F + ++ C++ +G+ + +GG G A+ +
Sbjct: 394 CEGPRCPFSQVPA-LPSQLELCEQVFGLSASSVVQAVAQTNSYYGGQTPG------ATQV 446
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIG 493
+F NG DPW + + AL+ +H +D+ D L+ R+ + +
Sbjct: 447 LFINGDTDPWHVLSITQASGPLESALLIPSASHCMDMAPDRPSDSPSLRLGRQSIFQQLQ 506
Query: 494 KWI 496
W+
Sbjct: 507 TWL 509
>gi|328718791|ref|XP_001947511.2| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 500
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 137/517 (26%), Positives = 214/517 (41%), Gaps = 50/517 (9%)
Query: 10 FCLLFSSTLTISNAKIFPTFPS--SRITPEKLSSLISSSKDSQGL--YKTKYHTQILDHF 65
FCL++ +A+ F +F S P + S I +S + L + + Q LDHF
Sbjct: 5 FCLIWCCLWCGVSAR-FKSFTSLVGIGEPNRRPSSIGTSLRAATLNFIREDFFDQKLDHF 63
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
N P +T++QRY + H K P+F+ G E I G M + A KF A+
Sbjct: 64 N--PTDNRTWKQRYQSHSLH---HKIGGPVFMLLGGEEKISNAWLKDGSMMEYAEKFNAM 118
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVV 185
+EHRYYG S P N + YLS QALAD A I +K +
Sbjct: 119 CFQLEHRYYGDSYPTDN------LNTTNLKYLSIKQALADVAEFI-KVKSQNPLYKGKWI 171
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPI-LNFDNIVSPYSFSNIITQDFRSVSENC 244
+FGGSY G LAAW R YP++ A++SS+ I DNI + + + + C
Sbjct: 172 LFGGSYPGSLAAWARKTYPNLVHAAVSSSSVIKTRIDNI----DYFKVAEKALTDYNPKC 227
Query: 245 YKVIKGSWKQIEETAKKPGGLEKLQKAFRIC--------KSEKNLAIESWLSTAFVYTAM 296
I+ + I + G + +Q F++C K + L + + A
Sbjct: 228 VSNIRQATMMISDLLDSENGTKYVQSKFKVCYRINTNVKKDVRQLFQQLSIPIAETIQYN 287
Query: 297 TDYPTPSNFLNPLPAFPVKEMC-----KAIDDPKTGNDVFAK-----LYGAASVYYNYSG 346
D SN + +C +A+ P V+ K L S Y S
Sbjct: 288 RDNRYYSNM--EKADLSITALCDIMLNRALGSPFDRYVVYHKKVRHVLKRRCSSYSYQSL 345
Query: 347 TAKCFDLN--GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
++N G S G +W +Q CTE+ + D +F + D C + +
Sbjct: 346 LQSNSEINWHGQSVKSGDRQWYYQLCTEIGNFVTSNEDDHLFGNNIP-IDFFIDLCTDVF 404
Query: 405 GVDPRPNWITTEFGGHKIGLV---LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
G N + E HK ++ LK S +I+ +G DPW+G G+ + S +++
Sbjct: 405 GEHFDLNKL--EKAVHKTTMMYHGLKNTTSRVIYLHGSFDPWNGLGLTEPESDDSISINI 462
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ +H DL S+ +DP L R + KW+++
Sbjct: 463 EGVSHCADLYTSSPKDPPQLSKARETVTFYLNKWLTE 499
>gi|281346012|gb|EFB21596.1| hypothetical protein PANDA_016894 [Ailuropoda melanoleuca]
Length = 509
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 124/503 (24%), Positives = 214/503 (42%), Gaps = 67/503 (13%)
Query: 37 EKLSSLISSSKDSQGLY-----------KTKYHTQILDHFNYNPQSYQTFQQRYLINDTH 85
+L I ++SQGL K + Q LD FN + Q ++F QRY +ND H
Sbjct: 23 RRLGEHIQQFQESQGLSLTLGPGAGALPKQGWLEQPLDPFNTSDQ--RSFLQRYWVNDQH 80
Query: 86 WGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGN-- 143
W + P+F++ G EG + + G +AP + AL++ +EHR+YG SIP G
Sbjct: 81 W--ASQRGPVFLHLGGEGSLRSGSVMRGHPAALAPAWGALVIGLEHRFYGLSIPAEGLDV 138
Query: 144 ---KEIAYKNASTTGYLS--STQAL--ADYASLIIDLKKNL-TATDSPVVVFGGSYGGML 195
+ ++ ++A G S Q+L AD AS L + +T SP + FGGSY G L
Sbjct: 139 AQLRFLSSRHAECAGTPSEEGPQSLPSADVASARRALARLFNVSTASPWICFGGSYAGSL 198
Query: 196 AAWFRLKYPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSW 252
AAW RLK+PH+ + ++ASSAP +L+F ++ + T S+
Sbjct: 199 AAWARLKFPHLILASVASSAPVRAVLDFSE------YNEVRTAGGGCGGGRGCGEPPDSF 252
Query: 253 KQIEETAKKPGGLEKLQK--AFRICKS--------EKNLAIESWLSTAFVYTAMTDYPTP 302
+ GG++K + + C S E A+++ + A Y A P
Sbjct: 253 SR----GPTVGGVQKPNEHGCWPACSSLGGAEDQAELLGALQALVGGAVQYDAQAGVPLS 308
Query: 303 SNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH-- 360
L L P + + V + G + ++ + T + +DPH
Sbjct: 309 VRRLCGLLLGPSGSRSASYHGLRRAVQVVMRSLGQRCLSFSRAETVAQLRV---TDPHVS 365
Query: 361 --GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP-----NWI 413
G +W +Q CTE ++ F + ++ C++ +G+ +
Sbjct: 366 GVGDRQWLYQTCTEFGFYVTCEDPRCPFSQLPA-LPSQLELCEQVFGLSTASIARAVSQT 424
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
+ +GG G +++++F NG DPW V + + + AL+ +H +D+
Sbjct: 425 NSYYGGQTPG------STHVLFVNGDTDPWHVLSVTQALGPSESALLIPGASHCMDMAPE 478
Query: 474 TKEDPQWLKDVRRREVEIIGKWI 496
D L+ R+ ++ + W+
Sbjct: 479 RPSDSPSLRLGRQSILQQLQTWL 501
>gi|186516786|ref|NP_567998.3| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|145651776|gb|ABP88113.1| At4g36190 [Arabidopsis thaliana]
gi|332661229|gb|AEE86629.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 482
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 145/533 (27%), Positives = 226/533 (42%), Gaps = 100/533 (18%)
Query: 8 LSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTK-YHTQILDHFN 66
L F LL + T+ +S + +F + + P ++S +S K S+ L + + + TQ LDH
Sbjct: 5 LGFALLTTFTVLLS----YLSFSNGLLQPRRISHGLS--KSSKYLTRDELWFTQTLDH-- 56
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
Y+P ++ F+QRY H + PIF+ EG N ++ +A KF A +
Sbjct: 57 YSPSDHRKFRQRYYEYLDHL--RVPDGPIFLMICGEGPCNGITNN--YISVLAKKFDAGI 112
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTA- 179
V +EHRYYGKS P+ K +A KN YLSS QAL+D A+ +++K N ++
Sbjct: 113 VSLEHRYYGKSSPF---KSLATKNLK---YLSSKQALSDLATFRQYYQDSLNVKFNRSSN 166
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
++P FG SY G L+AWFRLK+PH+ G+LASSA + + Y F Q S
Sbjct: 167 VENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAVVR------AVYEFPEFDQQIAES 220
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVY------ 293
C ++ + K +E GL+ +A + + L +++ F+Y
Sbjct: 221 AGPECETALQETNKLLEL------GLKVNNRAVKALFNATELDVDA----DFLYLIADAG 270
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV---FAK--------LYGAASVYY 342
Y P PL ++ K G D+ +AK ++G +S Y
Sbjct: 271 VMAIQYGNPDKLCVPL-----------VEAQKNGGDLVEAYAKYVREFCMGVFGQSSKTY 319
Query: 343 NYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKE 402
+ +D W +Q CTE+ DSI + + + CK
Sbjct: 320 SRKHLLDTAVTLESAD----RLWWFQVCTEVAYFQVAPANDSI-RSHQINTEYHLDLCKS 374
Query: 403 AY--GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA-- 458
+ GV P + +G KI A+ IIF NG +DPW K S L
Sbjct: 375 LFGKGVYPEVDATNLYYGSDKIA------ATKIIFTNGSQDPWRHAS--KQTSSPDLPSY 426
Query: 459 -LVEKEGAHHVDLRFSTK------------EDPQWLKDVRRREVEIIGKWISQ 498
+ H DLR + P + VR+ ++ I W+S+
Sbjct: 427 IMTCHNCGHGSDLRGCPQSAMVIEGDAQNCSSPDAVNKVRQHMIKHIDLWLSE 479
>gi|118346703|ref|XP_976866.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288597|gb|EAR86585.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 485
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 120/456 (26%), Positives = 201/456 (44%), Gaps = 43/456 (9%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
T Y TQ +DHF+ P S T+ QR+ + + + N +F++ G EG + +G+
Sbjct: 40 TYYFTQKVDHFD--PSSTDTYNQRFTVYSEAFNPA--NGTVFIFIGGEGPQQGLTTGSGW 95
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
VA +F A+++ +EHR+YG S P+G ++ +L+ Q+LAD A I +K
Sbjct: 96 YMLVAQQFSAMVICVEHRFYGVSQPFGQGQDAW--TVDHLKFLTVDQSLADLAYFISYIK 153
Query: 175 KN--LTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
N L D +P + GGSY G ++AWFR KYPH+ IGA ASSA ++N Y +
Sbjct: 154 ANNFLRINDRNPFITVGGSYPGAMSAWFRYKYPHLTIGAHASSA-VVNAIMDFQQYDYQ- 211
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKNLAIESWLST- 289
I T S E K+ K + + +EE + G + L+ F+ +NL + +LS
Sbjct: 212 IYTSTSLSGPECPIKIQKFN-EIVEEILTQNGEAAQNLKTLFK----AQNLQNDDFLSYF 266
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV--YYNYSGT 347
++ M Y + + E K + D +A G V Y S T
Sbjct: 267 GDLWAGMVQYGKRTVLCDLFAPDTFGEQLKLVVD-------YAITQGNQPVDGYDTQSLT 319
Query: 348 AKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE--EDYDARARYCKEAYG 405
+ N +W WQ CT + + + + Y + + +
Sbjct: 320 NTTYVANESG-----RQWTWQVCTYFGWFQSANQVQPMRSRTVNLQFYQNQCNVAFQNFQ 374
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK--E 463
P+ + + T +GG + A NI+F NG+ D W + + + A++ +
Sbjct: 375 NFPKSDLVNTFYGGANLQ------AFNIVFTNGVEDEWQWASI-RYPQGNMDAIISNCTD 427
Query: 464 GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
H V+ R+ ED L+ R ++ KWI+++
Sbjct: 428 CGHCVEFRYPKPEDSPQLQQTRASLIQHYTKWITEF 463
>gi|344247366|gb|EGW03470.1| Thymus-specific serine protease [Cricetulus griseus]
Length = 265
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/254 (33%), Positives = 131/254 (51%), Gaps = 17/254 (6%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLY---KTKY 57
MA + +L F LL S A + ++ SS+ SQGL K +
Sbjct: 1 MAVKAPWLGFLLLVSLWGLSDPALLLRRLREHIQKFQESSSMNPGFGLSQGLVAVPKQGW 60
Query: 58 HTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD 117
Q LD FN + + +TF QRY +ND H G +AP+F++ G EG + + G
Sbjct: 61 LEQPLDPFNTSDR--RTFLQRYWVNDRHRAG--QDAPVFLHIGGEGSLGPGSVMAGHPVA 116
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
+AP + AL++ +EHR+YG S+P GG + + YLSS ALAD AS L + L
Sbjct: 117 LAPAWGALVISLEHRFYGLSMPSGG------LDMAQLRYLSSRHALADVASARQALSRLL 170
Query: 178 -TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
++ SP + FGGSY G LA W RLK+PH+ A+ASSAP+ ++ Y+++ ++ +
Sbjct: 171 NVSSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---SAVLDFYAYNEVVARS 227
Query: 237 FRSVSENCYKVIKG 250
V+ + + G
Sbjct: 228 LSQVAIGGSQEVGG 241
>gi|299473646|emb|CBN78040.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 559
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 133/495 (26%), Positives = 205/495 (41%), Gaps = 82/495 (16%)
Query: 38 KLSSLISSSKDS---QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAP 94
+L +L+ D+ +G + + Q LDHF+ Q+ + F QRY IN +W G+ + AP
Sbjct: 49 RLGALLDPLVDNGIPRGDVEELFVEQRLDHFDR--QNSRKFLQRYFINKKYWAGASSGAP 106
Query: 95 IFVYTGNEG-----DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
+F+ G EG ++ + + M ++AP+ AL++ +EHRYYGKS P + +
Sbjct: 107 VFLCVGGEGPPLEANVLSESVHCNDMLELAPEHNALVLAVEHRYYGKSNPGDDWATDSLR 166
Query: 150 NASTTGYLSSTQALADYASL--IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
+LSS QALAD +S + K+ LT + V +GGSY GMLA W RLKYPH+
Sbjct: 167 ------WLSSQQALADLSSFHGFLSDKEGLTGAEK-WVTWGGSYPGMLAGWARLKYPHLF 219
Query: 208 IGALASSAPI---LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGG 264
A++SS+P+ L+F S D SE C ++ I E G
Sbjct: 220 HAAVSSSSPMKAQLDFPQYAEVMRDSLASGVDGVGGSEECASAVEAGHASIGELLLTEEG 279
Query: 265 LEKLQKAFRICKSEK----------------NLAIES--------WLSTAFVYTAMTDYP 300
+L F++C + L I+ + V MTD
Sbjct: 280 QLELVATFQLCDASSLQDEDARVLFAGDGVVTLPIQGNDPACNGMVCNIRAVCEIMTDAT 339
Query: 301 TPSNF--LNPLPAFPVKEMCKAIDDPKTGN---DVF--------AKLYG------AASVY 341
S L + +AI++P++ +VF A YG A +
Sbjct: 340 RGSEVERLAAIRKIQRSSDAEAIEEPRSSPSPCEVFPPKDMNTAAGRYGGRSRERGAGLR 399
Query: 342 YNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEM-IMLTGGDNKDSIFEESEEDYDARARY 399
+Y G DP W +Q CTE T F + D
Sbjct: 400 RDYPGQEMKLLGRWAMDPSDPDRAWLYQTCTEFGFYQTCEVGTRCPFTQGLHTLDLDLAM 459
Query: 400 CKEAYGVDPRPNWITTE-------FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
C+EA+G+ R + + +GG + S +IF NG DPW GVL+
Sbjct: 460 CEEAFGI--RAEEVREQVRLTNLFYGGD------RPRGSRVIFPNGAIDPWHALGVLETP 511
Query: 453 SKTVLALVEKEGAHH 467
+ + A+ + +HH
Sbjct: 512 TPGLPAIYVEGASHH 526
>gi|145498935|ref|XP_001435454.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402586|emb|CAK68057.1| unnamed protein product [Paramecium tetraurelia]
Length = 452
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 128/466 (27%), Positives = 197/466 (42%), Gaps = 67/466 (14%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAP-----IFVYTGNEGDIEWFAQN 111
+ Q++DH Y+ + F QRY W +N P +F G I
Sbjct: 28 FEHQLVDH--YDKLNKNVFHQRY------WVVEENFVPETGVVLFQICGEYTCINDIKLR 79
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA---S 168
F+ +A +F AL++ +EHRYYGKS+P G KE + K+ + YLS+ QAL D A
Sbjct: 80 L-FIIQLAKEFNALIIILEHRYYGKSMPLG--KE-SLKDENLR-YLSTRQALDDLAYFQR 134
Query: 169 LIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
++ KK+ + +P + GGSY G LAAW+R +YPH+ IGALASSA + S
Sbjct: 135 FMVLNKKHGIKSQNPWIAIGGSYPGALAAWYRYQYPHLVIGALASSAV------VESITD 188
Query: 229 FSNIITQDFRSVSEN---CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
F TQ F S ++ C K ++ K E+ G E+ +++F +EK +E
Sbjct: 189 FKMFDTQIFLSAYKSGPQCAKDVQDMNKYAEQQILNQGTKEEFKRSF---GAEKLTDLEF 245
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN--DVFAKLYGAASVYYN 343
A + Y S E+CK + D D F + S
Sbjct: 246 LFFFADAQLLIIQYGGRS------------ELCKQLKDKSITEQIDYFRSVIEEGS---- 289
Query: 344 YSGTAKCFDLNGDSDPHGLS---EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
Y + N D + L+ +W +Q C+E+ +S+ + D +C
Sbjct: 290 YMEYGSYYLKNDKYDENNLTPSRQWMYQCCSELGWWQTSPLNNSV-RSTLIDIQFYKDFC 348
Query: 401 KEAYG-----VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
+G + P FGG+++ + N+I NG DPW VL N
Sbjct: 349 NSIFGGIRKNIFPDDQLANARFGGNELNV------DNLIMTNGNEDPWKWSSVLVNQGSI 402
Query: 456 VLALVEKEGAHH-VDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
+ + E + H V+L ED LK R+ + KWI ++
Sbjct: 403 LTYEINCENSGHCVELYTPKDEDCDQLKQARKDIISQFRKWIQDHY 448
>gi|392898873|ref|NP_500595.2| Protein F19C7.2 [Caenorhabditis elegans]
gi|373219710|emb|CCD69714.1| Protein F19C7.2 [Caenorhabditis elegans]
Length = 540
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 122/475 (25%), Positives = 203/475 (42%), Gaps = 52/475 (10%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI------EWFAQNT 112
TQ +DHF+ N + +QQRY N + +K +F+ G EG I +W
Sbjct: 54 TQKVDHFS-NGTNNGVWQQRYQYNSKFY--NKTTGYVFLMLGGEGSINVTNGDKWVRHEG 110
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
M +F+A +EHR+YG S Y I + ++ L+ QALAD I
Sbjct: 111 ETMMKWVAEFQAAAFQVEHRFYG-SKEY---SPIGDQTTASMKLLTIDQALADIKEFITQ 166
Query: 173 LKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+ D P+ V FGGSY G L+A+FR YP + GA++SS+ + F V Y ++
Sbjct: 167 MNALYFKDDKPIWVTFGGSYPGSLSAFFRETYPEMTAGAVSSSSAVHVF---VDYYEYAI 223
Query: 232 IITQDFRSVSENCYKVIKGSWKQ-IEETAKKPGGLEKLQKAFRIC----KSEKNLAIESW 286
+ +R+VS++C VIK +++ I + P L++ F +C +++ + +++ +
Sbjct: 224 NTEKTYRTVSDSCGDVIKVAFQNLITKAYSGPDSRALLKQRFNLCDNFDETKLSKSVQFF 283
Query: 287 LSTAFVYTAMTDYPTPSNFLNPL-PAFPVKEMCKAIDDPKTGNDVFAKLYGAASVY---- 341
+ Y + T N N V C ++D G+++ ++ ++Y
Sbjct: 284 FQNVYGYFQGINQYTGDNRNNATRSGLGVPAACNLLNDKTIGDEI-QRVIAVMNLYDSWY 342
Query: 342 ---------YNYSGTAKCFDLNG--DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
NY+ + + D D W WQ CTE+ D + S
Sbjct: 343 KPSDSGCRPNNYTAFIQAYSDTTMPDDDTISTRSWIWQTCTELGYYQTTDGGNGGIFGST 402
Query: 391 EDYDARARYCKEAYGVD-------PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
D A C + +G + + + T++GG +N+ F NG DPW
Sbjct: 403 VPLDFFADQCIDLFGPEYTLDNTFKLVDQVRTKYGGAD-----AYRGTNVCFPNGSFDPW 457
Query: 444 SGGGVLKNIS-KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G G NI+ V + + AH D+ + D Q LKD R R + +W+S
Sbjct: 458 QGLGHTANITNNNVDSWLIDGTAHCADMYPARDSDKQSLKDARVRIHGHLSRWLS 512
>gi|357140818|ref|XP_003571960.1| PREDICTED: probable serine protease EDA2-like [Brachypodium
distachyon]
Length = 503
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 121/470 (25%), Positives = 199/470 (42%), Gaps = 73/470 (15%)
Query: 59 TQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD 117
+Q LDHF+ P ++ F+QRY D H N P+F+ E + + ++
Sbjct: 56 SQTLDHFS--PTDHRQFKQRYYEFLDYH---RVPNGPVFLNICGESSCNGISNS--YLAV 108
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------II 171
+A KF A LV EHRYYGKS P+ K + +N +LSS QAL D A +
Sbjct: 109 IAKKFGAALVSPEHRYYGKSSPF---KSLTTENLR---FLSSKQALFDLAVFRQYYQETL 162
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+ K N + D+ VFGGSY G L+AWFRLK+PH+ G+ ASS +L + Y+F++
Sbjct: 163 NAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSHASSGVVL------AVYNFTD 216
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF--RICKSEKNLAIESWLST 289
Q S C + ++ + K ++ + G +++ F R+ +++ + +
Sbjct: 217 FDKQIGESAGPECKEALQETTKLVD--GQLQSGRNSVKQLFGARMLQNDGDFLYLLADAA 274
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGT 347
A + Y P +PL ++ K G D+ + YY + +
Sbjct: 275 AIAF----QYGNPDILCSPL-----------VEAKKNGTDLVEAFAHYVNKYYVGTFGAS 319
Query: 348 AKCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
+D N W +Q C+E+ DS+ ++ D CK
Sbjct: 320 VASYDQQYLKNTTPAESSYRLWWYQVCSEVSYFQVAPKNDSV-RSAKIDTRYHLDLCKNV 378
Query: 404 YGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
+G P+ T +GG +I S I+F NG +DPW K+ + L+E
Sbjct: 379 FGEGVYPDVSMTNLYYGGTRIA------GSKIVFANGSQDPWRHASKQKSSKEMPSYLIE 432
Query: 462 KEGAHH-------------VDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
H ++ S P+ + VR++ V+ I W+S+
Sbjct: 433 CSNCGHCSDLSGCPQAPSNIEGDSSKCSSPEAVNKVRKQIVDNIDLWLSE 482
>gi|240254469|ref|NP_179399.5| serine carboxypeptidase S28-like protein [Arabidopsis thaliana]
gi|322518656|sp|Q1PF50.2|EDA2_ARATH RecName: Full=Probable serine protease EDA2; AltName: Full=Protein
EMBRYO SAC DEVELOPMENT ARREST 2; Flags: Precursor
gi|330251630|gb|AEC06724.1| serine carboxypeptidase S28-like protein [Arabidopsis thaliana]
Length = 489
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 135/517 (26%), Positives = 219/517 (42%), Gaps = 89/517 (17%)
Query: 26 FPTFPSSRITPEKLSSLISSSKDSQGLYKTK---YHTQILDHFNYNPQSYQTFQQRYLIN 82
F T + + P +S +S S+ Y T + Q LDH +P ++ F+QRY
Sbjct: 19 FSTLSHALLHPSSVSHNVSRSR----YYMTTNELWFNQTLDH--ESPNDHRKFRQRYYEF 72
Query: 83 DTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGG 142
++ + P+F+ EG A + ++ +A KF+A +V +EHRYYGKS P+
Sbjct: 73 MDYFRSP--DGPMFMIICGEGPCSGIAND--YINVLAKKFQAGVVSLEHRYYGKSSPFNS 128
Query: 143 NKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTA--TDSPVVVFGGSYGGM 194
+A +N YLSS QAL D AS ++ K N+++ +D+P FG SY G
Sbjct: 129 ---LATENLK---YLSSKQALYDLASFRQYYQESLNKKLNISSGGSDNPWFFFGISYSGA 182
Query: 195 LAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQ 254
L+AWFRLK+PH+ G+LASSA + + Y FS Q S + C ++ + K
Sbjct: 183 LSAWFRLKFPHLTCGSLASSAVVR------AIYEFSEFDQQIGESAGQECKLALQETNKL 236
Query: 255 IEETAKKPGGLEKLQKAFRICKSEKNLAIES--WLSTAFVYTAMTDYPTPSNFLNPLPAF 312
+E GL+ KA + + L +++ TA Y P PL
Sbjct: 237 LEL------GLKVKNKAVKSLFNATELDVDADFLYLTADAAVMAFQYGNPDKLCVPL--- 287
Query: 313 PVKEMCKAIDDPKTGNDVFA-----------KLYGAASVYYNYSGTAKCFDLNGDSDPHG 361
++ K G+D+ +++G YN + DS
Sbjct: 288 --------VEAKKNGSDLVVTYSTYVREYCMRIWGLRVRTYNRKHLRNTV-VTADS---A 335
Query: 362 LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD--PRPNWITTEFGG 419
W +QACTE+ DS+ + + CK +G D P+ + +GG
Sbjct: 336 YRLWWFQACTELGYFQVAPKYDSV-RSHQINTTFHLDLCKSLFGKDVYPKVDATNLYYGG 394
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE-KEGAHHVDLRFSTKED- 477
++ A+ IIF NG DPW + + +++ + H D+R +
Sbjct: 395 DRLA------ATKIIFTNGSEDPWRHASKQNSTHEMPSYIIKCRNCGHGSDIRGCPQSPM 448
Query: 478 -----------PQWLKDVRRREVEIIGKWISQYFQDL 503
P ++ VR++ VE I W+S+ Q +
Sbjct: 449 VIEGKSNNCSLPDYVNKVRQQMVEHIDLWLSECRQSI 485
>gi|297735044|emb|CBI17406.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 131/473 (27%), Positives = 198/473 (41%), Gaps = 73/473 (15%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DHF+ P + F QRY ++ + PIF+ E + + ++ +A
Sbjct: 45 QTVDHFS--PLDHSKFPQRYYEFTDYF--RLPDGPIFLKICGEASCDGIPND--YISVLA 98
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL---------- 169
KF A +V +EHRYYGKS P+ + + +N YLSS QAL D A
Sbjct: 99 KKFGAAVVSLEHRYYGKSSPF---RSLRTENLK---YLSSKQALFDLAVFRQYYQAKVVP 152
Query: 170 ---IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSP 226
+++K N + ++P VFG SY G L+AWFRLK+PH+ G+LASSA +L +
Sbjct: 153 IGESLNVKVNRSNVENPWFVFGVSYAGALSAWFRLKFPHLTCGSLASSAVVL------AI 206
Query: 227 YSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESW 286
Y+F+ Q S C V++ +E+ G A + +L I++
Sbjct: 207 YNFTEFDRQIGESAGAECKAVLQEVNGLVEQRLAVDG------NAVKSLFGAASLKIDA- 259
Query: 287 LSTAFVYTAMTDYPTPSNFLNP-LPAFPVKEMCKAIDDPKTGNDVFAKLY-----GAASV 340
F+Y T + NP + P+ + K +D + + K Y G +
Sbjct: 260 ---DFLYLLADAAVTAFQYGNPDMLCSPLVQAKKDGEDLENAYATYVKEYYLGTFGVSIE 316
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
YN + SD W +Q CTE+ + DS+ SE + C
Sbjct: 317 TYNQQHLKRTNSAGDTSD----RLWWFQVCTEVAFFQVAPSDDSM-RSSEINTKYHLDLC 371
Query: 401 KEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
K +G P+ T +GG KI S IIF NG +DPW L +
Sbjct: 372 KNVFGNGIYPDVDATNIYYGGTKIA------GSKIIFTNGSQDPWRHASKLTSTPDMPSF 425
Query: 459 LVEKEGA-HHVDLR------FSTKED------PQWLKDVRRREVEIIGKWISQ 498
L+ H DLR S + D P + VR++ +E I W+SQ
Sbjct: 426 LISCHNCGHGTDLRGCPQSPLSPEGDAKNCSSPDAVHKVRQQIIEHIDLWLSQ 478
>gi|307184675|gb|EFN71004.1| Putative serine protease F56F10.1 [Camponotus floridanus]
Length = 418
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 104/422 (24%), Positives = 191/422 (45%), Gaps = 41/422 (9%)
Query: 100 GNEG--DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYL 157
G EG +++W + Y A +F A+ ++EHR+YG S P +++ KN YL
Sbjct: 3 GAEGIANVKWMVEGQWIEY--AKEFGAMCFYLEHRFYGNSHP---TPDLSVKNLI---YL 54
Query: 158 SSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
+S QALAD A I ++ + ++ +VFGGSYGG LAAW R+KYPH+ GA+++S P
Sbjct: 55 NSQQALADLAYFIQNINIEYKFSNNTKWIVFGGSYGGSLAAWMRIKYPHLVHGAVSTSGP 114
Query: 217 ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
+L + + ++ + S+ C I +++++ + G +K++K F++C
Sbjct: 115 LL---AQIDFQEYFVVVANALKDYSQKCVDTIAEAYRELGILLRHVGSQQKIEKKFKLCD 171
Query: 277 -----SEKNLAIESWLST-AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV 330
K L I + T A + ++ Y + + ++ +C + D K G +
Sbjct: 172 PIDPGHTKKLDISNLYETLADNFASIVQYNKDNRQSSQTLNITIENVCDILVDEKIGIPI 231
Query: 331 FAKLYGAASVYYNYSGTAKCFDLNGD-------------SDPHGLSEWGWQACTEMIMLT 377
+L +++ N + KC D D G +W +Q CTE
Sbjct: 232 -DRLAYVSNMILN-ATKEKCLDYRYDKMIRELRNTTWTNEQAEGGRQWMYQTCTEFGFFQ 289
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKI---GLVLKRFASNII 434
+ ++F + + + C + +G + + + I GL LK +N++
Sbjct: 290 TSTAQPNLFSNNFP-VNFFVQQCTDIFGPRYNIDLLNSAVTRTNILYGGLNLK--VTNVV 346
Query: 435 FFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGK 494
F +G DPW G+ + + A+ AH ++ S++ D LK R + +I +
Sbjct: 347 FVHGSIDPWHVLGITTSSNPQAPAIYIDGTAHCANMYPSSENDMPQLKKARIQIKNLIKE 406
Query: 495 WI 496
W+
Sbjct: 407 WL 408
>gi|125778538|ref|XP_001360027.1| GA17650 [Drosophila pseudoobscura pseudoobscura]
gi|54639777|gb|EAL29179.1| GA17650 [Drosophila pseudoobscura pseudoobscura]
Length = 473
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 118/473 (24%), Positives = 206/473 (43%), Gaps = 54/473 (11%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+ S+++ + +T + Q LDHF+ + + +T+Q RY++ND + ++ P+F+Y G E
Sbjct: 35 LPSNQNRADVVQTLWIEQKLDHFDEDEK--RTWQMRYMLNDALY---QSGGPLFIYLGGE 89
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
+I G +YD+A + ALL + EHRYYG+S P +++ +N YL+ QA
Sbjct: 90 WEISAGRITGGHIYDMAKEHNALLAYTEHRYYGESKPL---PDLSNENIQ---YLNVRQA 143
Query: 163 LADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
L D A I LK ++S V++ GGSY + WF+ +P + G ASSAP+
Sbjct: 144 LEDLAVFIRTLKATHEGLSESKVIIVGGSYSATMVTWFKKVHPDLVAGGWASSAPLFAKV 203
Query: 222 NIVSPYSFSNIITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
N V + I Q + + CY I+ ++E G E ++ ++C +
Sbjct: 204 NFV---EYKEITGQSIALMGGSACYNRIESGIAEMETMFATKRGAE-VKALLKLC---ER 256
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV 340
+ S L +++ ++D + + A ++ +C+ I ND+ V
Sbjct: 257 FDVYSDLDVWTLFSEISDL--FAGVVQTHNAGQIEAVCQKI--MSGSNDLIGVASYLLDV 312
Query: 341 YYNYSGTAKCFDLNGDS----------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
+ G KC +L+ D+ + + +W +Q C E D+K F ++
Sbjct: 313 FSESGG--KCNELSYDAILSQLLDTSYTGNIMRQWIFQTCNEYGWYQTSDSKAQPF-GTK 369
Query: 391 EDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
C + YG +D R FGG G+ N+ +G DPW
Sbjct: 370 FPVALYTTMCGDIYGSQYSNEFIDSRVAATNDYFGGWTPGV------ENVYLTHGHLDPW 423
Query: 444 SGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
G+ TV+ E AH D + D ++ + R E++ +WI
Sbjct: 424 RAMGIQDEAQATVI----PEYAHCKDFNSISDTDTAEMRASKERIAELVREWI 472
>gi|324510461|gb|ADY44374.1| Serine protease [Ascaris suum]
Length = 529
Score = 122 bits (306), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 130/477 (27%), Positives = 201/477 (42%), Gaps = 52/477 (10%)
Query: 45 SSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD 104
SS +T Y TQ +DH N T+QQR N + +N + IF+ G E
Sbjct: 28 SSDQFAATIETFYVTQPIDHSN---PPLGTWQQRVQYNPRFY---RNESIIFLLIGGESP 81
Query: 105 I--EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
+W AQ A K+ A + +EHR++GKS PY K S+ Y + QA
Sbjct: 82 AAEKWVAQPNITYLRWAEKYGAAVFQLEHRFFGKSRPYNDLK------TSSLKYCTVDQA 135
Query: 163 LADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN 222
L D AS I + + V FGGSY G L+AWF+++YP + +GA+ASSAP L F
Sbjct: 136 LEDLASFIRQMNAKYGYVNPRWVTFGGSYPGSLSAWFQVRYPDLTVGAVASSAP-LTF-- 192
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI--CKSEKN 280
++ Y ++ ++ R+ S C++ I + I A G E+L ++ +E
Sbjct: 193 LLDYYGYAMVMENVIRNTSAECHEKIGNAITVILNKALTVAGREELSTKLKLKPAFNETT 252
Query: 281 LAIESWLS-TAFVYTAMTD---YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYG 336
L + + A+++ + + Y + + F V+ MC AI T D ++
Sbjct: 253 LTVRDLHNMMAYLFGGLQNVVQYTYDARNSITMGGFNVRNMCNAITS-STSTDPVIQMRA 311
Query: 337 AASVYYNY----SGTAK-------------CFDLNGDSDPHGLSEWGWQACTEMIMLTGG 379
Y + GT FD S+ + W W C E+ +L
Sbjct: 312 IIDWIYTFYPSDDGTIANRYSDLIGLLSNTTFDDENGSENAAMRGWMWLCCNELGVLQTT 371
Query: 380 DNKDSIFEESEEDYDARARYCKEAYG-----VDPRPNWITTEFGGHKIGLVLKRFASNII 434
D+ +IF + C +A+G V R N + ++ G A NI+
Sbjct: 372 DHGRNIFGNMLP-LNYFIDICIDAFGDTVNIVSIRDNNLAFR---NRYGDANNYKAKNIV 427
Query: 435 FFNGLRDPWSGGGVLKNISKTVLALVEKEG-AHHVDLRFSTKEDPQWLKDVRRREVE 490
NG DPW G +N + + EG AH D+ + E+P L V R E+E
Sbjct: 428 LPNGSFDPWHPLGTYENYPELHQKAILIEGTAHCADMYPAWSEEPSTLAPV-RAEIE 483
>gi|291410769|ref|XP_002721683.1| PREDICTED: protease, serine, 16 [Oryctolagus cuniculus]
Length = 505
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/186 (40%), Positives = 107/186 (57%), Gaps = 16/186 (8%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN 111
L K + Q LD FN + + ++F QRY +ND HW G + P+F++ G EG + +
Sbjct: 52 LPKVGWLEQPLDPFNTSDR--RSFLQRYWVNDQHWAG--QDGPVFLHLGGEGSLGPGSVM 107
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
TG +AP + AL++ +EHR+YG S+P GG + +LSS AL D AS +
Sbjct: 108 TGHPAALAPAWGALVIGLEHRFYGLSLPAGGLDLAQLR------FLSSRHALTDAASARL 161
Query: 172 DLKKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVS 225
L + L ++ SP V FGGSY G LAAW RLK+PH+ ++ASSAP+ L+F N V
Sbjct: 162 ALSRLLNVSSSSPWVCFGGSYAGSLAAWARLKFPHLFFASVASSAPVRATLDFSEYNDVV 221
Query: 226 PYSFSN 231
S +N
Sbjct: 222 SRSLTN 227
>gi|167533847|ref|XP_001748602.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772843|gb|EDQ86489.1| predicted protein [Monosiga brevicollis MX1]
Length = 459
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 117/477 (24%), Positives = 205/477 (42%), Gaps = 52/477 (10%)
Query: 49 SQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG----- 103
S L ++ TQ LDHF+ + + T+QQ Y +N T++ + ++AP+++ G EG
Sbjct: 3 SSSLPAPRFVTQRLDHFDGSDTT--TWQQAYYVNSTYFQ-AGSDAPVYLCVGGEGPPLDG 59
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG---YLSST 160
+ + + ++ PK A++ +EHRYYG +N G YLSS
Sbjct: 60 SVVVASVHCNVAVELLPKTGAIMFALEHRYYGCH----NMSACPVENPLAKGALRYLSSR 115
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---I 217
QAL D A+ I +++ ++ +V FGGSY GMLA W RLKYPH+ ++ASSAP +
Sbjct: 116 QALGDLAAFISYIRQQYNLPNNKIVTFGGSYPGMLAGWARLKYPHLVHASVASSAPVEAV 175
Query: 218 LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
L+ +F+ ++ + S+ C I I + G L F + S
Sbjct: 176 LDMRGYYDVTAFAYSVSDNNVGGSDACRAAIATGHATIGQYFNSSSGRNTLANIFGLPAS 235
Query: 278 --EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPA-FPVKEMCKAIDDPKTGNDVF--- 331
EK S+ Y +P+ SN + A + +C+ + + G++V
Sbjct: 236 YFEKYDNQASFAGGGVAY-----FPSQSNDPSCTQAGCNINLICQVMTNTSLGDEVHRLA 290
Query: 332 ----AKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIML-TGGDNKDSIF 386
+L + + +++ K + ++D WG+Q CTE T D F
Sbjct: 291 VVRKQQLEWLPAAFESFA--TKTLRVGAEAD-----YWGYQTCTEFAFYQTCEVGSDCFF 343
Query: 387 EESEEDYDARARYCKEAYGVD---PRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRD 441
+ +A C+ +G+D + N I + +GG S +++ NG D
Sbjct: 344 TQGYLTLNATEAACQAEFGIDFTTVQQNVIASNAWYGGRNSA------GSCLMYPNGEVD 397
Query: 442 PWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
PW+ +L + + L+ +HH S D + R+ + + ++Q
Sbjct: 398 PWNSQSILNTTAPGITTLMVPGASHHAWTHPSAPSDQPSVVAARKTIADFVSNALNQ 454
>gi|432892297|ref|XP_004075751.1| PREDICTED: thymus-specific serine protease-like [Oryzias latipes]
Length = 460
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 126/454 (27%), Positives = 195/454 (42%), Gaps = 73/454 (16%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LDHFN +TF Q++ +N+ HW + + P+F++ G EG I F G
Sbjct: 52 KESWLEQPLDHFNR--LKGKTFSQKFFVNEAHW--QRPDGPVFLFIGGEGPIFSFDVLAG 107
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--II 171
++A + ALL+ +EHR+YG SI G K + LSS QALAD A+ I
Sbjct: 108 HHVEMAQQHGALLLAVEHRFYGDSINPDGLK------TESLADLSSQQALADLATFHGYI 161
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVSP 226
NL++ + P + FGGSY G L+AWFR K+P + A+ASSAPI L+F N V
Sbjct: 162 CRSFNLSSRN-PWISFGGSYSGALSAWFRGKFPSLVYAAVASSAPIKAKLDFSDYNHVVG 220
Query: 227 YSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL--AIE 284
S N+ SE C+ ++ ++ +E G L ++ F C+ KNL IE
Sbjct: 221 LSLKNVAV----GGSEKCWAQVQQAFAAVEAELLT-GNLSQVAGDFNCCQIPKNLDDQIE 275
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
+ A ++ Y L+ VKE+C D+ + G Y
Sbjct: 276 LMQNLADLFMGTVQYNEEGVLLS------VKELC----------DLMTNVSGEDKDMEAY 319
Query: 345 SGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
S K DS QA T++ ++F S+ + + Y
Sbjct: 320 SRLIKLTQTCEDSTCPFSGLINLQAQTKLCT--------AVFGISQHSLPSNIAFTNSYY 371
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
G D E H+ +++ NG DPW V++ + V ++ ++
Sbjct: 372 GGD--------EPHTHR-----------VLYINGGIDPWKELSVVQG-GQEVQSVFIEDT 411
Query: 465 AHHVDL--RFSTKEDPQWLKDVRRREVEIIGKWI 496
AH D+ R K D L+ R+ + + W+
Sbjct: 412 AHCADMSSRRVVKRDS--LRRARQEIEKQVSDWL 443
>gi|307108517|gb|EFN56757.1| hypothetical protein CHLNCDRAFT_144219 [Chlorella variabilis]
Length = 303
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 114/214 (53%), Gaps = 25/214 (11%)
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRY 133
T++QRY + D +W APIF Y GNEG++ NTG M++ A F ALLVF EHRY
Sbjct: 79 TWRQRYFLCDQYWDREDPYAPIFFYAGNEGNVANGVNNTGLMWERAQAFGALLVFAEHRY 138
Query: 134 YGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGG 193
YG S P+G + + + +LS QA+ DY + + LK +L AT +PVV FGGSYGG
Sbjct: 139 YGNSWPFGKEESLTLEGLQ---FLSMEQAIEDYVTFLNWLKISLNATSAPVVAFGGSYGG 195
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNF---DNIVSPYSFSNIITQDFRSV---------- 240
+L A R P A++SSAP+ + D P S+ ++T+D
Sbjct: 196 VLVAIMRATRPSSVQAAVSSSAPMRGWLLQDGGYDPGSYWEVVTRDASPAAGADPACVPN 255
Query: 241 SENCYKVI--------KGSWKQIEETAK-KPGGL 265
S+ + +I + SW+Q+E+ + PG +
Sbjct: 256 SQRLFPLIISLIESGKEASWRQVEQGLRICPGAI 289
>gi|195158018|ref|XP_002019891.1| GL12646 [Drosophila persimilis]
gi|194116482|gb|EDW38525.1| GL12646 [Drosophila persimilis]
Length = 473
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 118/473 (24%), Positives = 206/473 (43%), Gaps = 54/473 (11%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+ S+++ + +T + Q LDHF+ + + +T+Q RY++ND + ++ P+F+Y G E
Sbjct: 35 LPSNQNRADVVQTLWIEQKLDHFDEDEK--RTWQMRYMLNDALY---QSGGPLFIYLGGE 89
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
+I G +YD+A + ALL + EHRYYG+S P +++ +N YL+ QA
Sbjct: 90 WEISAGRITGGHIYDMAKEHNALLAYTEHRYYGESKPL---PDLSNENIQ---YLNVRQA 143
Query: 163 LADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
L D A I LK ++S V++ GGSY + WF+ +P + G ASSAP+
Sbjct: 144 LEDLAVFIRTLKATHEGLSESKVIIVGGSYSATMVTWFKKVHPDLVAGGWASSAPLFAKV 203
Query: 222 NIVSPYSFSNIITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
N V + I Q + + CY I+ ++E G E ++ ++C +
Sbjct: 204 NFV---EYKEITGQSIALMGGSACYNRIESGIAEMETMFATKRGAE-VKALLKLC---ER 256
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV 340
+ S L +++ ++D + + A ++ +C+ I ND+ V
Sbjct: 257 FDVYSDLDVWTLFSEISDL--FAGVVQTHNAGQIEAVCQKI--MSGTNDLIGVASYLLDV 312
Query: 341 YYNYSGTAKCFDLNGDS----------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
+ G KC +L+ D+ + + +W +Q C E D+K F ++
Sbjct: 313 FSESGG--KCNELSYDAILSQLLDTSYTGNIMRQWIFQTCNEYGWYQTSDSKAQPF-GTK 369
Query: 391 EDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
C + YG +D R FGG G+ N+ +G DPW
Sbjct: 370 FPVALYTTMCGDIYGSQYSNEFIDSRVAATNDYFGGWTPGV------ENVYLTHGHLDPW 423
Query: 444 SGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
G+ TV+ E AH D + D ++ + R E++ +WI
Sbjct: 424 RAMGIQDEAQATVI----PEYAHCKDFNSISDTDTAEMRASKERIAELVREWI 472
>gi|66801433|ref|XP_629642.1| hypothetical protein DDB_G0292476 [Dictyostelium discoideum AX4]
gi|60463021|gb|EAL61217.1| hypothetical protein DDB_G0292476 [Dictyostelium discoideum AX4]
Length = 485
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 120/459 (26%), Positives = 208/459 (45%), Gaps = 60/459 (13%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
Q +DHFN +TF QR+++N +W G+ P+F E ++E + N+
Sbjct: 52 VQKVDHFNLLDD--RTFFQRFVVNSKYWNGT---GPVFFIISGEQNMEASSVNSCQYTIW 106
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL- 177
A + AL+V +EHRYYG S ++++ N YL++ QALAD I K
Sbjct: 107 AKQLNALIVSLEHRYYGGSYV---TEDLSTDNLK---YLTTQQALADCVVFIDWFTKVYY 160
Query: 178 -TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
+ S ++ FGGSY G L+A+ +KYP ++ASSAP+ + +V+ Y + +I +
Sbjct: 161 HVPSSSKIISFGGSYAGTLSAYLAMKYPSKISFSVASSAPL---NPVVNFYQYMEVIQKS 217
Query: 237 FRSVS--ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYT 294
++ E C IK + +I E P + K F +C N+ ++ LST F++
Sbjct: 218 ILLLNNGEKCLNNIKLANNKIIEMIHDPILTYNITKLFGLC---SNIDFDNDLST-FMFE 273
Query: 295 AMTDYPTPSNFLNPLPAF-PVKEMCK-AIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
+ T + + N +P + + +C +DD K D + LY ++Y + +C D
Sbjct: 274 IANVWGTAAQYGNLVPGYISLDSLCNIMVDDSKEPLDNY--LY----IWYGMKNSDECND 327
Query: 353 -----------------LNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
LN ++ ++ +W +Q CTE D+ D F + +++
Sbjct: 328 VTYQTMIANFKYSQIDHLNTRNELFNMTRQWLFQCCTEFGFFITSDSYDQPF--TNFNFN 385
Query: 395 ARARYCKEAYGVDP--RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
+ + C + +G P +W E+GG R N++F + DPWS + K
Sbjct: 386 FQRQICIDVFGKKPTLSTSWTLVEYGGISPNYNSVR---NVLFVSSTNDPWSSLSISK-- 440
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEI 491
S ++ + G H D+ + + DV R + EI
Sbjct: 441 SNQYKIVIVENGTHCSDM---IPINEVSVPDVARAQNEI 476
>gi|308448682|ref|XP_003087721.1| hypothetical protein CRE_28002 [Caenorhabditis remanei]
gi|308253266|gb|EFO97218.1| hypothetical protein CRE_28002 [Caenorhabditis remanei]
Length = 513
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 118/439 (26%), Positives = 187/439 (42%), Gaps = 49/439 (11%)
Query: 95 IFVYTGNEGDI------EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
+F+ G EG I +W M A +F A +EHR+YG G I
Sbjct: 58 VFLMLGGEGSINGTNGDKWVRHEAETMMVWAAEFGAGAFQVEHRFYGSK----GFCPIGD 113
Query: 149 KNASTTGYLSSTQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVA 207
+ + L+ QALAD I + D P+ + FGGSY G L+A+FR YP +
Sbjct: 114 QTTESLKLLTIDQALADIKEFINQMNALYFPLDKPIWITFGGSYPGSLSAFFRETYPEMT 173
Query: 208 IGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETA-KKPGGLE 266
GA++SS+ + F V Y ++ + +R+VS++C VIK +++Q+++ A P E
Sbjct: 174 AGAVSSSSAVHVF---VDYYGYAINTEKTYRTVSDSCGDVIKTAFQQMQKKAYNGPDSRE 230
Query: 267 KLQKAFRICKS--EKNL--AIESWLSTAFVYTAMTDYPTPSNFLNPL-PAFPVKEMCKAI 321
L+ F +C S E NL +I+ + + Y + T N N V C +
Sbjct: 231 LLKTTFNLCDSFDENNLSKSIQFFFQNVYGYFQGINQYTGDNRNNATRSGLGVPGACDIL 290
Query: 322 DDPKTGNDVFAKLYGAASVYYN-YSGTAKCFDLNG--------------DSDPHGLSEWG 366
++P G+++ ++ S Y + YS +A N D D W
Sbjct: 291 NNPTLGDEI-TRVVAVMSWYDSWYSPSATGCRPNSYTSFIKYYSDTTMPDDDRISTRSWI 349
Query: 367 WQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD-------PRPNWITTEFGG 419
WQ CTE+ D + S D A C + +G + + + T++GG
Sbjct: 350 WQTCTELGYYQTTDGGNGGIFGSTVPLDFFADQCIDLFGPEYTLDNTFKLIDQVRTKYGG 409
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSG-GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDP 478
+N+ F NG DPW G N + V + + AH D+ + D
Sbjct: 410 AD-----AYRGTNVCFPNGSFDPWQDLGHKANNTNNNVDSWLIDGTAHCADMYPARDSDK 464
Query: 479 QWLKDVRRREVEIIGKWIS 497
Q LKD RRR + + +W+S
Sbjct: 465 QSLKDARRRIHDHLSRWLS 483
>gi|328872043|gb|EGG20413.1| peptidase S28 family protein [Dictyostelium fasciculatum]
Length = 466
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 128/468 (27%), Positives = 212/468 (45%), Gaps = 55/468 (11%)
Query: 47 KDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE 106
KDS + ++ Q L+HF+ Q +TF QRY ND ++ SK PI +Y EG +
Sbjct: 38 KDSSS-FPIYWYNQTLNHFD--AQDSRTFMQRYYTNDAYYDYSKG-GPIILYINGEGPVS 93
Query: 107 W--FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
+ Q G + A A +V +EHRYYG S P+ ++++ +N +LSS QAL
Sbjct: 94 SPPYQQGDGVVV-YAQALGAYIVTLEHRYYGDSSPF---EDLSTENLK---FLSSRQALN 146
Query: 165 DYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFD 221
D A I D +KNL+ + + VV GGSY G L+AWFR+KYPH+ +G+++SS IL+F
Sbjct: 147 DLAVFISDFRKNLSLS-TEVVTIGGSYSGALSAWFRVKYPHITVGSVSSSGVVNAILDFT 205
Query: 222 NIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL 281
+F + + + C ++ +++E+ A G +++++ F ++
Sbjct: 206 ------AFDEWVA---YAAGDECATAMRQVTQEVEQ-AYFGGQADEIRQIFNAETLVEDG 255
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKT----GNDVFAKLYGA 337
WL+ + Y S +PL V M KT +V+ G
Sbjct: 256 DFFFWLADSNA--EGIQYGYHSQLCDPL----VDAMNNGTSLIKTYALYTANVWTGSLGT 309
Query: 338 ASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARA 397
+ Y D+N +D W +Q CTE ++SI S +
Sbjct: 310 PAEYATAWQQNTTHDINNSAD----RLWLYQTCTEFGYWQNAPAENSI-RSSIVNMTYWR 364
Query: 398 RYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI--S 453
+C++ +G+ P+ T +GG++ +NIIF N +DPWS ++ S
Sbjct: 365 DHCEQVFGIALWPDVEATNEYYGGNQTA------GTNIIFVNSSQDPWSRASIITQQYPS 418
Query: 454 KTVLALVEKEGAHHVDLRFSTK---EDPQWLKDVRRREVEIIGKWISQ 498
+ V + H D+R + P L VR+ ++ I W++Q
Sbjct: 419 EPVAMVTCGNCGHCSDIRVDCPGGCDTPNNLAQVRQVTLQAIQSWLAQ 466
>gi|344292458|ref|XP_003417944.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 522
Score = 121 bits (303), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 135/524 (25%), Positives = 217/524 (41%), Gaps = 96/524 (18%)
Query: 47 KDSQGLYKTKYH------TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
+ ++G ++TK Q LDHF+ N S Q + QRY IND + K P+F+ G
Sbjct: 23 RRTRGAHQTKVMYPNFSFQQKLDHFSEN--SSQFWPQRYFINDAFY---KPGGPVFLMVG 77
Query: 101 N--EGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLS 158
W + N ++ A + AL + +EHR+YG S P G + ++ YLS
Sbjct: 78 GVWTASESWLSINKTWV-TYAQRLGALFLLLEHRFYGYSQPTGD------LSTASLQYLS 130
Query: 159 STQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
S QALAD A+ + + + T++ V FG S G LA W R+K+P + A+ SSAPI
Sbjct: 131 SRQALADIANFRTQIAEKMGLTENKWVAFGCSSAGSLAVWSRIKHPELFAAAVGSSAPIQ 190
Query: 219 NFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKP---GGLE---KLQKAF 272
N Y + I+ + + + NC++ +K ++ QI + P G LE KL K
Sbjct: 191 AKANF---YEYLEIVQRSLATHNSNCFQAVKEAFGQIVKMLNLPRYYGKLENDFKLCKPM 247
Query: 273 RIC-----------------------KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPL 309
++ KS KN + +ST + M P +
Sbjct: 248 KLYSAMDKATFLDLLMFPVKIAVQYNKSIKNNKVGIVIST---FKDMDPDPQKGQVIGLQ 304
Query: 310 PA-------------------FPVKEMCKAIDDPKTGNDVFAKLYGAASVY--------- 341
F + E+C + + G+ + L A +Y
Sbjct: 305 SHRQNEAEPRCKTVLQGEQIFFSMDELCDTMTNTSLGSPYYRYLRIARLIYKHEYLLCFP 364
Query: 342 YNYSGTAKCF-DLNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
+Y + + D + + D G+ ++ +Q CTE D+K+ F Y +
Sbjct: 365 AHYRRKLEVYLDSSINYDNPGIDRQFFYQCCTEFGFFHTTDSKNQPFTGMPLRYFVQQ-- 422
Query: 400 CKEAYGVDPRPNWITTEFG-----GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
C + +G P+ N+ + G H G + S IIF +G DPW G+ K+ISK
Sbjct: 423 CSDFFG--PQFNYDSLNMGVLSTNAHYGGFNVT--GSKIIFSSGSFDPWHVLGITKDISK 478
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ A+ K G H D+ D L R + I+ KW+ +
Sbjct: 479 DLPAVFIKGGVHCADVFEQKDTDSAELIQAREKIFRILRKWLKK 522
>gi|52789079|gb|AAT09104.1| serine peptidase [Bigelowiella natans]
Length = 546
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 119/448 (26%), Positives = 189/448 (42%), Gaps = 80/448 (17%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y +LDHF + + QRY ++ + WGG P+F+Y G EG + ++ FMY
Sbjct: 59 YKNALLDHFG-GLSDEKHWLQRYYVDSSQWGG--EGYPVFLYIGGEGP-QGPVSSSLFMY 114
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK- 175
++A + KAL++ +EHR+YG+S P + K +L+S QAL D A + +K
Sbjct: 115 ELAVEHKALVLALEHRFYGESRPVEDMSDANLK------FLTSHQALGDLARFVEYIKAY 168
Query: 176 ---------------NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL-N 219
+A +SP V FGGSY G LAAWF+LKYP V IG++ASSAP+
Sbjct: 169 DPNVNDAKSSPPLSLPASAQESPFVAFGGSYPGNLAAWFKLKYPSVVIGSVASSAPVFAE 228
Query: 220 FDNIVSPYSFSNIITQDFRSVSENCYKVIK---GSWKQIEETAKKPGGLEKLQKAFRICK 276
+D ++ S+ CY ++ + K + ++ G +K+ R C
Sbjct: 229 YDFAEYGGVVGRALSYPLIGGSDQCYSAVEKAVTTLKTLLDSTTPAGSSDKIPSYLRPCS 288
Query: 277 SEKN-LAIESWLSTAF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL 334
L + ++ + F + + Y N N P V ++C A+ D +D+ +L
Sbjct: 289 PIGGPLDLATYEAQIFGAFQGVVQY----NLENRPPY--VSDLCTAMTDGNDDDDILLRL 342
Query: 335 YGAASVYYNYSGTAKC----------------FDLNG-DSDPHGLSEWGWQACTEM--IM 375
+ Y G C F G D + +W +Q+C E
Sbjct: 343 VKTLKLVY---GGVTCMPSSFEKSVAPLQDAQFSQAGCDLSCSSMRQWIYQSCHEFGYFQ 399
Query: 376 LTGGD--NKDSIFEESEEDYDARARYCKEAYGVDPRPNW-----------ITTEFGGHKI 422
T GD N + F+ + +A A K AY + ++ T +G +
Sbjct: 400 TTTGDKMNPFAAFDTVTAE-NAGAAIRKAAYNLSASVDYAGPAANAEGLVANTAYGARNL 458
Query: 423 GLVLKRFASNIIFFNGLRDPWSGGGVLK 450
A NI NG DPW G++
Sbjct: 459 A------AHNITAVNGNMDPWHSLGIVN 480
>gi|19528413|gb|AAL90321.1| RE11624p [Drosophila melanogaster]
Length = 480
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 114/226 (50%), Gaps = 17/226 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ K+ TQ LD N+N + QT+Q RYL+ND +PIF+Y G E +IE + G
Sbjct: 55 QEKWITQKLD--NFNASNTQTYQMRYLLNDEF---QTEGSPIFIYLGGEWEIEESMVSAG 109
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
YD+A + +LV+ EHRYYG+SIP + + YL QALAD A I
Sbjct: 110 HWYDMAQEHNGVLVYTEHRYYGQSIP------TSTMSTEDLKYLDVKQALADVAVFIETF 163
Query: 174 K-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K +N +S V++ GGSY + WF+ YP + +G ASSAPIL V + +
Sbjct: 164 KAENPQLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPIL---AKVDFTEYKEV 220
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
+ Q F + + CY I+ ++E G E + R+C S
Sbjct: 221 VGQAFLQLGGQKCYDRIENGIAELESMFANKRGAEA-RAMLRLCNS 265
>gi|123418467|ref|XP_001305332.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121886845|gb|EAX92402.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 504
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 119/457 (26%), Positives = 202/457 (44%), Gaps = 56/457 (12%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV- 118
Q LDHF+ +TF+QRY IN + SKN + VY G E + + YDV
Sbjct: 32 QKLDHFS---DLAETFKQRYYINTNYSKKSKN---LVVYIGGEAPL----LESSLKYDVQ 81
Query: 119 --APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
A K++++ +EHRY+G+SIP+G + +K YL+ QA+ D A+ I +K+N
Sbjct: 82 HIASVTKSVILALEHRYFGESIPHGNLELENFK------YLTVDQAIEDLANFITQMKQN 135
Query: 177 LT--ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
A+ ++ GGSY G L++ FR K+P + +G+ ASSAPI + +N +
Sbjct: 136 YCQDASKCKALMVGGSYPGALSSRFRQKHPELTLGSWASSAPIHSQNNFS---EYDKHEA 192
Query: 235 QDFRSVSENCYKVIKGSWKQIEE-TAKKPGGLEKLQKAFRICKSEK--NLAIESWLSTAF 291
+D++ CY ++K IE T K E++ + F + K + N +++ +
Sbjct: 193 EDYKDYG--CYDNALKAYKTIERITLLKNEKTEEMMEKFGVPKDAQFVNNSVDFLGMFSD 250
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND-----VFAKLYGAASVYYNYSG 346
VY+ Y + FL EMC+ ND V A + N+
Sbjct: 251 VYSYGNQYKAYNKFL--------LEMCEKFKKIDMSNDDEVINVMADTSNSIVGKDNFFN 302
Query: 347 TAKCFDLNGD--SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
F N SD W + C E+ + + + CKE +
Sbjct: 303 NNIEFLKNTSIYSDSKSSRSWMYMTCNELGWFSSA---SGLLRSELLTIETSLDSCKELF 359
Query: 405 GVDPRPNW--ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN-ISKTVLALVE 461
G P+ ++GG+ + + +++ N DPWS + +N K++++
Sbjct: 360 GFTQFPDTEKFNEKYGGYNPNV------TKVVYTNSHYDPWSELTMKRNDTEKSIISFNI 413
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
K+G H DL + D ++LK VR ++ + W+ +
Sbjct: 414 KDGFHCDDLHDPSDGDSEYLKSVREETIKQLLAWMKE 450
>gi|24648177|ref|NP_650803.1| CG18493 [Drosophila melanogaster]
gi|23171717|gb|AAF55663.2| CG18493 [Drosophila melanogaster]
Length = 480
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 114/226 (50%), Gaps = 17/226 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ K+ TQ LD N+N + QT+Q RYL+ND +PIF+Y G E +IE + G
Sbjct: 55 QEKWITQKLD--NFNASNTQTYQMRYLLNDEF---QTEGSPIFIYLGGEWEIEESMVSAG 109
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
YD+A + +LV+ EHRYYG+SIP + + YL QALAD A I
Sbjct: 110 HWYDMAQEHNGVLVYTEHRYYGQSIP------TSTMSTEDLKYLDVKQALADVAVFIETF 163
Query: 174 K-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K +N +S V++ GGSY + WF+ YP + +G ASSAPIL V + +
Sbjct: 164 KAENPQLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPIL---AKVDFTEYKEV 220
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
+ Q F + + CY I+ ++E G E + R+C S
Sbjct: 221 VGQAFLQLGGQKCYDRIENGIAELESMFANKRGAEA-RAMLRLCNS 265
>gi|268535772|ref|XP_002633021.1| Hypothetical protein CBG21793 [Caenorhabditis briggsae]
Length = 1085
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 119/481 (24%), Positives = 200/481 (41%), Gaps = 65/481 (13%)
Query: 50 QGLYKTKYHTQILDHF-NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIE 106
QG+++ Q +HF N NP FQQ++ N + W ++ P F+ G EG
Sbjct: 579 QGMFR-----QRENHFDNRNPD---FFQQKFYKN-SQW--AQPGGPNFLMIGGEGPEGPR 627
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
W A K+ A + +EHR+YG S+ +N L+S Q L D
Sbjct: 628 WVLNENLTWLTYAKKYGATVFILEHRFYGDSL--------VGQNNDNFNVLTSLQMLYDL 679
Query: 167 ASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
A I ++++ T T +P + FGGSY G ++AW R +P + IGA+ASS P+ +
Sbjct: 680 AEFIKAVNIR---TGTSAPWITFGGSYSGAMSAWMREVFPELVIGAVASSGPVFAKTDF- 735
Query: 225 SPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFR--------ICK 276
Y + ++ + R+ + C I+ + ++ + G + L F+ +
Sbjct: 736 --YEYLMVVEKSIRTYDKTCADRIQSGFSTMQTMFQTKEGRQNLSDIFQLQPPFGDNVTD 793
Query: 277 SEKNL---AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFA 332
++++ I A Y+ P + + + +MCK + +D T +
Sbjct: 794 TDQHYFFSNIYGNFQGAVQYSGDNTGPYAN-------GYGIPDMCKFMTNDSNTPLNNIV 846
Query: 333 KLYGAASVYYN----YSGTAKCFDLNGDSDPHGLSE---------WGWQACTEMIMLTGG 379
+ +V+YN Y+GT + DS H W WQ C E
Sbjct: 847 QFNEYMTVFYNNGRNYTGTDNKYQDMIDSLVHAQESGPESAASLLWTWQTCNEFGYFQSA 906
Query: 380 DNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG--GHKIGLVLKRFASNIIFFN 437
D + IF S + + C + +G + +I + +K G +N++F N
Sbjct: 907 DTGNGIFG-SPTPVNMFVQMCMDVFGSTYQRIFIDNQIAQTNYKYGERQHYRGTNVVFPN 965
Query: 438 GLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G DPW G+ + +V+A + AH D+ + D LK VR IGKW+S
Sbjct: 966 GNVDPWHALGLYGSADDSVVAYLINGTAHCADMYPARAADVPGLKVVRDIIDTNIGKWLS 1025
Query: 498 Q 498
Q
Sbjct: 1026 Q 1026
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 109/468 (23%), Positives = 177/468 (37%), Gaps = 47/468 (10%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW--FAQNT 112
T Y Q LDH+N N TF QRY +++ + + F+Y GD E +
Sbjct: 47 TAYMIQNLDHYNGNASG--TFIQRYYYTESY---TLHQRTAFLYISVSGDFETSVITDDR 101
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
+ A +F A + +EHRYYG+S P +A ++++ YL+S QA+ D + I
Sbjct: 102 NPVVKSAKQFGATVFSLEHRYYGQSKP-----NVANFDSNSLRYLNSFQAIQDIVAFIKY 156
Query: 173 LKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
K D V++G YGG++AA R P V G +ASS+P+ + Y F
Sbjct: 157 ANKQFNMDPDVRWVLWGAGYGGVIAAEARKWNPDVVAGVIASSSPLTHV------YDFWQ 210
Query: 232 IITQDFRSVSEN----CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWL 287
++S+ CY+ I + I + P G + F++ +
Sbjct: 211 FNDHVQIAISQEGGQLCYQKIMQGFTDIRLAMRTPEGRSNISDLFQLNPRLDQTNLTYND 270
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLY---------GA 337
F M+ + F N + +C ID T V + Y
Sbjct: 271 IQTFYLAIMSPFQEMIQFNNDF-NIDIGALCTTIDQSTWTPMQVIWQAYVYLSTTVTGSV 329
Query: 338 ASVYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIML-TGGDNKDSIFEESEEDYDA 395
+ +Y S P+ + W +Q CTE + T +N+ +F
Sbjct: 330 QPMITSYQTIVSDLGNQSASSPNIDNRMWQYQMCTEFGWIPTTNNNEQGLFGA----VIP 385
Query: 396 RARYCKEAYGVDPRPNWITTEFG------GHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
+ + + + P N T H G +N++F NG DPWS G
Sbjct: 386 TSLFLNMCFDIFPGANMDATTIRDLTIDYNHYYGSSYDYSGTNVVFTNGWYDPWSTLGKE 445
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+V+A V GA F + ++ + R E I W++
Sbjct: 446 TTADFSVVAYV-IPGASWASDMFPGSTNNTFIINAHRLMAENINIWVN 492
>gi|195109610|ref|XP_001999376.1| GI23100 [Drosophila mojavensis]
gi|193915970|gb|EDW14837.1| GI23100 [Drosophila mojavensis]
Length = 882
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/293 (30%), Positives = 147/293 (50%), Gaps = 23/293 (7%)
Query: 13 LFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSY 72
L S +++ K P F + ++ L ++ +G + K+ TQ LD F+ +
Sbjct: 14 LASQAISLELKKDVPVFVKTLKDMQRGPPLKMVKRELKG--EEKWITQPLDQFDETNK-- 69
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHR 132
+T++ RY IND +PIF++ G E + N G+ YD+A + K +L++ EHR
Sbjct: 70 ETYEMRYFINDEF---QTEGSPIFIFLGGEWEASLNMINDGYWYDLAKEHKGVLIYTEHR 126
Query: 133 YYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-KNLTATDSPVVVFGGSY 191
YYG S+P K ++ ++ YL QALAD A+ I K +N ++S VV+ G SY
Sbjct: 127 YYGASVP---TKTMSLEDLK---YLHVKQALADVANFIKTFKSENAQLSNSKVVLSGCSY 180
Query: 192 GGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKG 250
+A WF+ YP + +GA ASSAP+ V Y + ++ + FR + E CY I+
Sbjct: 181 SATMAVWFKRLYPDLVVGAWASSAPLF---AKVDFYEYKEVVGKAFRELGGEKCYNRIEK 237
Query: 251 SWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWL-STAFVYTAMTDY 299
++E K E + RIC + E +L + S S + V++++ Y
Sbjct: 238 GIAELESMFKNKRAAEA-RAMLRICSNFDHENDLDLWSLFGSISNVFSSLAQY 289
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 117/463 (25%), Positives = 203/463 (43%), Gaps = 56/463 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T++ Q LD N++ + + QR LIN+ ++ + +PIF+Y G E I+ A +G
Sbjct: 457 ETRWFNQSLD--NFDDTNKNVWDQRVLINEDNF---VDGSPIFIYLGGEWAIDPSAITSG 511
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + LV+ EHR++G+SIP I + Y S QALAD ++I L
Sbjct: 512 LWVDIAKEHNGSLVYTEHRFFGESIP------ITPLSTKNLKYQSVEQALADVVNVIKVL 565
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K+ +S VV+ G SY +A WF+L YP V +G+ ASSAP+ D IV F I+
Sbjct: 566 KEEDKYKESKVVISGCSYSASMAVWFKLLYPDVIVGSWASSAPL---DAIVDFSDFMEIV 622
Query: 234 TQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL---S 288
+ +R + + CY +I + E+ + G + ++ +C S ++N + W S
Sbjct: 623 GRAYRQLGGDYCYDLIDNATSYYEDLFQTGQG-ARAKELLNLCDSFDENNERDQWQIFSS 681
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI----DDPKTGNDVFA--KLYGAASVYY 342
A ++ + Y P N + + + C + DD + F +L V
Sbjct: 682 IANIFAGIAQYQKPEN-------YDLAQYCSVLRSFDDDDASALSQFVQWRLGKPECVNT 734
Query: 343 NYSGTAKCFDLNGDS-DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
Y GT + + ++ D GL W +Q C + NK+ F S C
Sbjct: 735 RYQGTVDYYKWSKNNYDGSGLG-WFYQTCRQFGWFQSSANKNHPF-GSTFPATLYTDTCH 792
Query: 402 EAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
+ +G ++ ++GG + N+ +G D WS G
Sbjct: 793 DVFGSQYTSAKIEEYIQATNKKYGGKHPAV------ENVYMTHGGLDGWSRVG------- 839
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ A++ + +H D D L+ + R +E++ +W++
Sbjct: 840 SDSAIIIPQASHCSDSGSINPTDSAALRATKERLIELVREWLA 882
>gi|148700667|gb|EDL32614.1| protease, serine, 16 (thymus), isoform CRA_b [Mus musculus]
Length = 475
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 108/420 (25%), Positives = 179/420 (42%), Gaps = 68/420 (16%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + + +TF QRY +ND H G + P+F++ G EG + + G
Sbjct: 57 KQGWLEQPLDPFNASDR--RTFLQRYWVNDQHRTG--QDVPVFLHIGGEGSLGPGSVMAG 112
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG S+P GG + + YLSS ALAD AS L
Sbjct: 113 HPAALAPAWGALVISLEHRFYGLSMPAGG------LDLALLRYLSSRHALADVASARQAL 166
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L ++ SP + FGGSY G LA W RLK+PH+ A+ASSAP+ +V +++ +
Sbjct: 167 SGLLNVSSSSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPL---SAVVDFSAYNQV 223
Query: 233 ITQDFRSV----SENCYKVIKGSWKQIEETAKK-PGGLEKLQKAFRIC--------KSEK 279
+ + V S C ++ ++E + P L++ C ++E
Sbjct: 224 VARSLTQVAIGGSLECLAAASTAFTEVERLLRAGPAAQAVLREELGACGSLDLTEDQAEL 283
Query: 280 NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV----FAKLY 335
A+++ + Y P V+++C + K GN + L
Sbjct: 284 LGALQALVGGTVQYDGQAGAP-----------LSVRQLCGLLLG-KWGNRSRSTPYLGLR 331
Query: 336 GAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIMLTGGDNKD 383
A + G KC + +++P G +W +Q CTE +
Sbjct: 332 RAVQIVLRSMGQ-KCLSFSRAETVAQLSNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQ 390
Query: 384 SIFEESEEDYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNG 438
F + + C++ +G+ P + +GG G A+ ++F NG
Sbjct: 391 CPFSQLPA-LPFQLELCEQVFGLSPASVAQAVAQTNSYYGGQSPG------ATQVLFVNG 443
>gi|341898957|gb|EGT54892.1| hypothetical protein CAEBREN_15465 [Caenorhabditis brenneri]
Length = 825
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 125/475 (26%), Positives = 202/475 (42%), Gaps = 60/475 (12%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMY 116
TQ LDHF+ P + +T+ Q+Y N + S+NN+ IF+ G EG + +W A
Sbjct: 60 TQKLDHFD--PYNTKTWNQKYFYNPIY---SRNNSIIFLMIGGEGPENGKWAAYPQVQYL 114
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
A +F A + +EHR++G S P I S+ YL++ QALAD A I + +
Sbjct: 115 QWAKEFGADVFDLEHRFFGDSWP------IPDMETSSLRYLTTQQALADLAFFIESMNQQ 168
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNII 233
+ V FGGSY G L+AWFR KYP + +G++ASSAP+ L+F Y ++ ++
Sbjct: 169 YGFKNPRWVTFGGSYPGSLSAWFRQKYPQLTVGSVASSAPVNLKLDF------YEYAMVV 222
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWLST 289
D + NC ++ ++ QI++ + G KL F + + L I ++
Sbjct: 223 QDDLQLTDANCAPAVRDAFTQIQQLSLTVDGRNKLNNYFNLQPPFDANTTKLDINNFFGN 282
Query: 290 AF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAI------DDPKTGNDVF----------A 332
F + MT Y + VK+MC+ + D K ++F
Sbjct: 283 LFNTFQGMTQYTYDGQSNSTHTDKTVKKMCEIMTNTTEPDKVKRVENLFLWFNQMEPAGP 342
Query: 333 KLYGAASVYYNYSGTAKCFDLNG-DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEE 391
L + Y++ DL D W W C E+ L + +++F
Sbjct: 343 DLSVMPNSYWDVIKQVGSGDLKVLGEDGAAARGWMWLCCNEIGFLQTTNQGNNVFGTGVP 402
Query: 392 DYDARARYCKEAYGVDPRPNWITTEFGGHK-----IGLVLKRFASNIIFFNGLRDPWSGG 446
+ C + +G + I G+K G A+N++ NG DPW
Sbjct: 403 -LNLFIDMCTDMFGDSMK---IKNIMAGNKKSQNYYGGADFYNATNVVLPNGSLDPWHAL 458
Query: 447 GVL-KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRR------REVEIIGK 494
G K +++ L + AH D+ S +PQ L R R+ I+G+
Sbjct: 459 GTYNKKEAQSQLPYLINGTAHCGDMYASYDGEPQSLLAARAFIKQNVRDFIILGQ 513
>gi|17539994|ref|NP_501599.1| Protein PCP-2 [Caenorhabditis elegans]
gi|3876284|emb|CAB05187.1| Protein PCP-2 [Caenorhabditis elegans]
Length = 1080
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 131/511 (25%), Positives = 212/511 (41%), Gaps = 71/511 (13%)
Query: 24 KIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLIND 83
K+F P PE S S D ++T Q DHFN Q+ FQQR+ N
Sbjct: 547 KVFLGRPPHGFLPE--SDFNMSPDDYPAGFETGSFRQRQDHFN--NQNADFFQQRFFKN- 601
Query: 84 THWGGSKNNAPIFVYTGNEG-DIEWFAQNTGFMYDV-APKFKALLVFIEHRYYGKSIPYG 141
T W +K P F+ G EG D + N Y + A K+ A + +EHR+YG+S G
Sbjct: 602 TQW--AKPGGPNFLMIGGEGPDKASWVLNENLPYLIWAKKYGATVYMLEHRFYGES-RVG 658
Query: 142 GNKEIAYKNASTTGYLSSTQALADYASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWF 199
N LSS Q + D A I +++K + T +P + FGGSY G+++AW
Sbjct: 659 DNTNF--------NRLSSLQMIYDIADFIRSVNIK---SGTSNPWITFGGSYSGLISAWT 707
Query: 200 RLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETA 259
R +P + +GA+ASSAP+ + Y + + RS + C I+ + +
Sbjct: 708 REVFPELVVGAVASSAPVFAKTDF---YEYLMVAENSIRSYNSTCADRIQEGFNSMRALF 764
Query: 260 KKPGGLEKLQKAFRI----CKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVK 315
GG + L F++ + ++ + S + + N + ++ +
Sbjct: 765 LTKGGRQTLSSMFKLDPPFADNVTDIDQHYFFSNIYSNFQGDVQYSGDNMGSYANSYGIP 824
Query: 316 EMCKAI-DDPKTG-------NDVFAKLYGAASVYY----------NYSGTAKCFDLNGDS 357
+MCK + +D T N+ A Y Y+ N+ AK D D+
Sbjct: 825 DMCKIMTNDSNTPLNNIVAFNEYMANFYNGGGPYFGLDNSYQDMINFLINAK--DFGPDA 882
Query: 358 DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY-------GVDPRP 410
+ L W WQ C+E D+ + IF S + + C + + +DP
Sbjct: 883 EASLL--WTWQTCSEFGYFQSADSGNGIF-GSPTPVNFFIQICMDVFNNYYQRSAIDPMV 939
Query: 411 NWITTEFGGHKIGLVLKRF---ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHH 467
+ +G +RF SN++F NG +DPW G+ +V++ + AH
Sbjct: 940 DNTNYMYG--------ERFHFRGSNVVFPNGNKDPWHALGLYYPTDSSVVSYLIDGTAHC 991
Query: 468 VDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
D+ + D LK VR + I W++Q
Sbjct: 992 ADMYPARDADVPGLKVVRDLIDQNIAIWLNQ 1022
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 109/474 (22%), Positives = 184/474 (38%), Gaps = 47/474 (9%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY-TGNEGDIE 106
+S+ T Y Q LDH N TF QRYL + + + + F+Y +G EG
Sbjct: 38 NSETAMTTGYMAQNLDHLIGNASG--TFTQRYLYSQQY---TLHQRTAFLYVSGVEGPNV 92
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
T + A +F A + +EHRYYG+S P + +A +L+S QA D
Sbjct: 93 VLDDRTPIV-KTAKQFGATIFTLEHRYYGESKP-----NVDKLDAYNLRHLNSFQATQDV 146
Query: 167 ASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN-FDNI 223
S I +++ N+ D VV+G YGG++AA R P+ G +ASS P+ + +D
Sbjct: 147 ISFIKYANVQFNMDQ-DVRWVVWGIGYGGIIAAEARKLDPNSVSGVIASSTPLTHEYDFW 205
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAI 283
+ + ++ + S+ CY+ + + I E K P G + F++ A+
Sbjct: 206 RFNHRVAIVLAETGGSL---CYRKVANGFADIREAMKTPEGRLNISDLFQLNPRLNETAL 262
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAASVYY 342
F + + F + + ++C ID T +V + Y S
Sbjct: 263 NYNDIQMFYLAIIAPFQEIVEFNDDF-DLSIADLCTTIDKSNWTNMEVVYQAYVYLST-- 319
Query: 343 NYSGTAKCFDLNGDSDPHGLSE------------WGWQACTEM-IMLTGGDNKDSIFEES 389
G A D++ L + W +Q CTE T DN+ +F
Sbjct: 320 TLDGFAGPMDISYQDFVDSLGDQSVDSGWIDNRIWQYQVCTEFGWFYTTNDNEQGLFGPV 379
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKI------GLVLKRFASNIIFFNGLRDPW 443
+ + + + + P N T I G +N +F NG+ DPW
Sbjct: 380 V----PASLFLNQCFDIFPDANLTATGLRDSIINYNNFYGSSYDYSGTNAVFTNGMNDPW 435
Query: 444 SGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G +V+A + + + D+ F + ++ E I W++
Sbjct: 436 RELGKTSTGDFSVVAYLIPDASTASDM-FPGNTNNSFIIQAHNLMTENINVWLN 488
>gi|125778536|ref|XP_001360026.1| GA14959 [Drosophila pseudoobscura pseudoobscura]
gi|195158020|ref|XP_002019892.1| GL12647 [Drosophila persimilis]
gi|54639776|gb|EAL29178.1| GA14959 [Drosophila pseudoobscura pseudoobscura]
gi|194116483|gb|EDW38526.1| GL12647 [Drosophila persimilis]
Length = 479
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 120/465 (25%), Positives = 199/465 (42%), Gaps = 60/465 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ K+ TQ LD N++ + +T++ RYL+ND + +PIF+Y G E +IE + G
Sbjct: 54 QEKWITQKLD--NFDDDNKETYEMRYLVNDEF---QEEGSPIFIYLGGEWEIEASMVSAG 108
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
YD+A + K +LV+ EHRYYG+S+P + + + YL QALAD A I
Sbjct: 109 HWYDLAEQHKGVLVYTEHRYYGESVP------TSTMSTANLKYLHVKQALADVAEFIKSF 162
Query: 174 K-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K ++ +S VV+ GGSY + WF+ YP + G ASSAPIL V+ + +
Sbjct: 163 KAEHPQLANSKVVLAGGSYSATMVVWFKRLYPDLVDGGWASSAPIL---AKVAFTEYKEV 219
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWL- 287
+ Q F + + CY I+ ++E G E + R+C S + +L I S
Sbjct: 220 VGQAFLQLGGQKCYDRIQNGIAELESMFDNKRGAEA-RAMLRLCNSFDDKNDLDIWSLFG 278
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-----DDPKTGNDVFAKLYGAASVYY 342
S + V+ Y P + ++ C + D N V+ + +
Sbjct: 279 SISNVFAGTAQYQRPGD---------IEYYCDYLLSFRDDATAIANFVYWGWGMPSCIDA 329
Query: 343 NYSGTAKCFDLNGDSDPHGLSEWGWQACTEM-IMLTGGDNKDSIFEESEEDYDARARYCK 401
YS T + L ++ W +Q C E T G K S+ C
Sbjct: 330 RYSSTVDYY-LWAVNNFDAGRPWYYQTCNEYGWYQTSGSAKQPF--GSKFPTAMYTTLCA 386
Query: 402 EAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNI 452
+ +G ++ +FGG + N+ +G DPW+ G GV +
Sbjct: 387 DVFGSQFSNEQINSNAAQTNLDFGG------MSPEVENVYMTHGALDPWNPMGHGVAE-- 438
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
T++A +H D + D ++ + R E++ +W++
Sbjct: 439 GATLIA----NASHCADFSSISASDSAEMRASKERLAELVSEWLA 479
>gi|344292448|ref|XP_003417939.1| PREDICTED: thymus-specific serine protease-like [Loxodonta
africana]
Length = 574
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/371 (27%), Positives = 171/371 (46%), Gaps = 51/371 (13%)
Query: 47 KDSQGLYKTK-YH-------TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
+ ++G YK+K H Q LDHF+ N S + + QRY ND + K P+F+
Sbjct: 23 RRTRGAYKSKGMHPSDDGSFQQKLDHFSKN--SSELWPQRYFFNDVFY---KPGGPVFLL 77
Query: 99 TGNEGDI--EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG- 155
G I W + N ++ A + ALL+ +EHR+YG S P G N ST
Sbjct: 78 IGGSDTICESWISTNNTWV-SYAERLGALLILLEHRFYGHSQPTG--------NVSTASL 128
Query: 156 -YLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
YLSS QALAD + I + + + T + V FG SYGG LA W R+K+P + A+ SS
Sbjct: 129 HYLSSRQALADIVNFRIKIAEKVGLTKNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSS 188
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
AP+ N Y + ++ + + + C++ +K ++ Q+ + + +L+K F +
Sbjct: 189 APMQAKANF---YEYLEVVQRSLITHNRQCFQAVKEAFGQVMKMLRLRKYHRQLRKDFTL 245
Query: 275 CK-----SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND 329
CK SE + A+ L + + + + ++ +F + EMC + + G+
Sbjct: 246 CKPLKHYSEMDKAVFIDLLIFPLKSTVQENDNEKDYEREEDSFTMDEMCDIMAETSLGSP 305
Query: 330 VFAKLYGAASVY-YNYSGTAKCFD---------LNGDS----DPHGLSEWGWQACTEMIM 375
+ L V+ Y +CF L+ S +P ++ +Q+CTE
Sbjct: 306 YYRYLRIIQLVFKLEY---LRCFPAYYRKKLEMLSNSSIDQHNPKKARQFFYQSCTEFGF 362
Query: 376 LTGGDNKDSIF 386
D+K+ +F
Sbjct: 363 FQTTDSKNQLF 373
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/130 (32%), Positives = 70/130 (53%), Gaps = 3/130 (2%)
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
+ ++ YLSS QALAD + I + + + T + V FG SYGG LA W R+K+P +
Sbjct: 382 STASLRYLSSRQALADIVNFRIKIAEKMGLTKNKWVAFGCSYGGSLAVWSRIKHPDLFAA 441
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
A+ SSAPI N Y + ++ + + + C++ +K ++ +I + + KL+
Sbjct: 442 AVGSSAPIQAKANF---YEYLEVVQRSLTTHNNKCFQAVKEAFDEITKMLRLRKYYSKLE 498
Query: 270 KAFRICKSEK 279
FR+CK K
Sbjct: 499 YDFRLCKPLK 508
>gi|218184874|gb|EEC67301.1| hypothetical protein OsI_34292 [Oryza sativa Indica Group]
Length = 550
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 130/484 (26%), Positives = 204/484 (42%), Gaps = 95/484 (19%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
++ Q LDHF+ P ++ F+QRY D H GG P+F+ E + +
Sbjct: 52 RWMDQRLDHFS--PTDHRQFKQRYYEFADYHAGG----GPVFLRICGESSCNGIPND--Y 103
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL----- 169
+ +A KF A +V EHRYYGKS P+ + + +N +LSS QAL D A+
Sbjct: 104 LAVLAKKFGAAVVTPEHRYYGKSSPF---ESLTTENLR---FLSSKQALFDLAAFRQHYQ 157
Query: 170 -IIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
I++ + N ++ D+P VFG SY G L+AWFRLK+PH+ G+LASS +L + Y
Sbjct: 158 EILNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL------AVY 211
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEE-------TAKKPGGLEKLQKAFRICKSEKN 280
+F++ Q S C ++ + ++E + K G EKL K++ +
Sbjct: 212 NFTDFDKQVGDSAGPECKAALQEVTRLVDEQLRLDSRSVKALFGAEKL-------KNDGD 264
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG-----------ND 329
+ A + Y +P +PL I+ KTG D
Sbjct: 265 FLFFLADAAAIGF----QYGSPDAVCSPL-----------INAKKTGRSLVETYAQYVQD 309
Query: 330 VFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES 389
F + +G Y+ + N D W +Q C+E+ DSI +
Sbjct: 310 FFIRRWGTTVSSYDQE-----YLKNTTPDDTSSRLWWFQVCSEVAYFQVAPKNDSI-RST 363
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
E + C+ +G P+ T +GG +I AS I+F NG +DPW
Sbjct: 364 EINTGYHLDLCRNVFGEGVYPDVFMTNLYYGGTRIA------ASKIVFTNGSQDPWRHAS 417
Query: 448 VLKNISKTVLALVE-KEGAHHVDLR------FSTKEDPQ------WLKDVRRREVEIIGK 494
K+ +++ + H DLR F + DP + VR++ I
Sbjct: 418 KQKSSKYMPSYIIKCRNCGHGTDLRGCPQLPFRIEGDPSNCSSPAAVSTVRKQIASHISL 477
Query: 495 WISQ 498
W+SQ
Sbjct: 478 WLSQ 481
>gi|308478046|ref|XP_003101235.1| hypothetical protein CRE_14134 [Caenorhabditis remanei]
gi|308263940|gb|EFP07893.1| hypothetical protein CRE_14134 [Caenorhabditis remanei]
Length = 1064
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 120/467 (25%), Positives = 202/467 (43%), Gaps = 43/467 (9%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG---- 103
D++G+ + Q +DHFN ++ + FQQ+Y N ++ P F+ G EG
Sbjct: 571 DTEGM-EIGMFRQRIDHFN--NKNTKFFQQKYFKNSRF---ARPGGPNFLMIGGEGPEYG 624
Query: 104 -DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
D+ N+ M A ++ + +EHR+YG S+ +N + LSS Q
Sbjct: 625 HDVNL---NSSIMRR-AEEYGGTVYVLEHRFYGDSV---------VENNTDLSTLSSLQM 671
Query: 163 LADYASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
L D A I ++ K + T +P + FGGSY G L+AW R +P + IGA+ASSAP+L
Sbjct: 672 LYDLAEFIKSVNFK---SETSNPWITFGGSYPGALSAWMREIFPDLVIGAIASSAPVLAK 728
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
+ Y + ++ F CY+ IK + +I E + G EKL F++ ++
Sbjct: 729 TDF---YEYMMVVENSFLRYDSACYQEIKNGFDEIHELFQTDSGREKLSVLFKLNPPFRD 785
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV 340
ES F + + + + V++ A+ N + + A V
Sbjct: 786 NISES-DKHFFFFDIIGPFQFAVQYAGRGSGGFVEDSKIAMLCRNITNGTQSSVENVAKV 844
Query: 341 ----YYNYSGTAKCFDLNGDSDPHGLSE---WGWQACTEMIMLTGGDNKDSIFEESEEDY 393
+ N S +D N +E W WQ C+E D+ +SIF + +
Sbjct: 845 VLDDFKNKSIIHSFYDKNKWKHMKKTNENYLWRWQTCSEFGYFQSADSGNSIF-GAMKPV 903
Query: 394 DARARYCKEAYGVDPRPNWI--TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
+ + C E +G + I E ++ G V +N++F NG DPW G+ +
Sbjct: 904 SFQVQRCMEMFGKEYTRGKIEENVEATNYRYGGVDGFRGTNVVFINGDVDPWHILGLYNS 963
Query: 452 ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
K+V++ + +H VD+ D +K R+ + I W+ Q
Sbjct: 964 TEKSVVSYLINGTSHCVDMYPPQDNDIDGVKIARKLVDDNIKVWLEQ 1010
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 127/501 (25%), Positives = 201/501 (40%), Gaps = 69/501 (13%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQ 60
M IFL L S I N F ++ P +L+ IS S Y Q
Sbjct: 1 MILHSIFLYLIFLLSVHSIIPNHFHFSD-NFRKVDPSQLNEDISVS--------LSYLIQ 51
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF-MYDVA 119
LD + P+S + FQQRY ++ H+ +K A ++V +G + E +N G + A
Sbjct: 52 PLDPLS--PESTKKFQQRYRYSE-HFTSNKKTAFLYV-SGRDDFNEAVLKNDGSPLVKAA 107
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+F A + +EHRYYG S P N + + +L S A+ D I+
Sbjct: 108 ERFGATIFALEHRYYGNSTPNFENF-----TSESLQHLDSYHAIQD-VIFFIEHANTQFK 161
Query: 180 TDSPV--VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
DS V V+FG YGG++AA R P G +A SAPI + + + ++N + +
Sbjct: 162 MDSDVRWVLFGSGYGGIIAAETRKWDPITVSGVVAISAPI---EREMDFWKYNNKVEKTI 218
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLAIESWLSTAFVYTA 295
+CY IK ++Q+++ G +L F + +E++LA+ F +
Sbjct: 219 MKYDSSCYNKIKKGFRQVQDLMNFSEGRNELSDLFELNPPWNERDLALNEI--QMFYLSI 276
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF--------AKLYGAA----SVYYN 343
+ + + N L +K +C AI D + + +L G+ S Y
Sbjct: 277 IAPFQQVVQYDNQL-ELSIKGLCDAIHDSRDSVEAIHQAHVYLSTQLTGSMQQMNSTYEK 335
Query: 344 YSGTAKCFDLNGDSDPHGLSE-------WGWQACTEMIML-TGGDNKDSIFEES------ 389
Y LN D S W +Q CTE T DN+D +F
Sbjct: 336 YVNDLGSKILNCLKDYQHKSCLSAQKRFWQYQMCTEFGWFPTTNDNEDGLFGSVVPLSLF 395
Query: 390 -EEDYDARARYCKEAYGVDPRPNWITTE-FGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
+ +D K + R + + F G G +N +F NG DPW+ G
Sbjct: 396 FNQCFDIFPDLYKNETAIKIRDDIEKAKNFYGKYSG-------TNAVFINGENDPWTVLG 448
Query: 448 VLKNISK--TVLALVEKEGAH 466
+N+S +V+ L +H
Sbjct: 449 --RNVSDEFSVVTLTVPRASH 467
>gi|344268147|ref|XP_003405924.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 464
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 113/445 (25%), Positives = 190/445 (42%), Gaps = 81/445 (18%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG--NEGDIEWFAQNTG 113
++ TQ LDHF+ S+ ++QRY IND + K P+F+ G W ++N
Sbjct: 52 RWFTQKLDHFDQKDLSF--WRQRYFINDAFY---KPGGPVFLMIGGMETAKRNWISRNLP 106
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
F+ A + AL + +EHR+YG S P G + ++ Y+ + Q L D + I +
Sbjct: 107 FI-AYAERLHALCLVLEHRFYGHSQPTGD------LSTASLRYIRNHQVLGDIVNFRIKI 159
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDN----IVSP 226
K + T + V FG YGG LA W R+KYP + A+ SSAP+ +NFD + +
Sbjct: 160 AKLMGLTKNKWVAFGEFYGGSLAVWSRIKYPDLFAAAVGSSAPVKVKINFDEYFEGVQTS 219
Query: 227 YSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESW 286
SNI C + ++ + ++ K P L+ F +C++ +N +S
Sbjct: 220 LDASNI----------KCSRAVQRALLEVIRMLKSPKSYSILKSDFMLCETPQN---KSS 266
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA--ASVYYNY 344
FV + +FL P+ + K K++ + + +D K+ +S Y+ Y
Sbjct: 267 QHIPFVLENLM------SFLIPIVQYNKKR--KSVMNILSIDDFCDKMTETPLSSPYHRY 318
Query: 345 SGTAK---------CFDLNGDSDPHGLSEWG-------------WQACTEMIMLTGGDNK 382
+ + C D N + LSE +Q CTE D++
Sbjct: 319 ARIVRNNIRNRNLSCLDANYNHRLRRLSETSLDKGNVDQARQRLYQCCTEFGFFQSTDSR 378
Query: 383 DSIFEESEEDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIF 435
F E Y C + +G + + + +GG + S IIF
Sbjct: 379 YQPFSELPIRYFLDK--CSDLFGSEYSFSSLRQSAEALNSYYGGFNVN------GSKIIF 430
Query: 436 FNGLRDPWSGGGVLKNISKTVLALV 460
+G DPW+ G+ ++ISK + A++
Sbjct: 431 SSGSLDPWNALGITRDISKNLRAVL 455
>gi|403308859|ref|XP_003944859.1| PREDICTED: thymus-specific serine protease isoform 2 [Saimiri
boliviensis boliviensis]
Length = 539
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 115/490 (23%), Positives = 201/490 (41%), Gaps = 65/490 (13%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q+L+ FN + + ++F QRY +ND HW G + PIF++ G EG + + G
Sbjct: 56 KVGWLEQLLNPFNVSDR--RSFLQRYWVNDQHWTG--QDGPIFLHLGGEGSLGPGSVMKG 111
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGG---------------NKEIAYKNASTTGYLS 158
+AP + AL++ +EHR+YG SIP GG K + L
Sbjct: 112 HPAALAPAWGALVISLEHRFYGLSIPAGGLDMAQLRFLSSRHAVGKSSGIPSDEDRPSLP 171
Query: 159 STQALADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLK------YPHVAIGAL 211
S LAD S + L + ++ SP + FGGSY G LAAW RLK +PH+ ++
Sbjct: 172 SDPRLADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKVLGFLRFPHLIFASV 231
Query: 212 ASSAP---ILNFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGG 264
ASSAP +L+F +++++++ +S S C + ++ ++E + G
Sbjct: 232 ASSAPVRAVLDFSE------YNDVVSRSLKSAAIGGSLECQAAVSTAFSEVERRLRAGGA 285
Query: 265 LEKLQKA-FRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD 323
+A C S ++ L A PL + +
Sbjct: 286 ARAALQAELNACGSLSRAEDQAELLGALQALVGGAVQYDGQAGAPLSVRQLCGLLLGGGA 345
Query: 324 PKTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACT 371
++ + + L A + + G KC + ++P G +W +Q CT
Sbjct: 346 NRSRSTPYCGLRRAVQIVMHSLGQ-KCLSFSRAETVAQLRSTEPQVSGVGDRQWLYQTCT 404
Query: 372 EMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVL 426
E +N F + ++ C++ +G+ P + +GG G
Sbjct: 405 EFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSPLSVAQAVAQTNSYYGGQTPG--- 460
Query: 427 KRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRR 486
A+ ++F NG DPW V + + + L+ G+H +D+ D L+ R+
Sbjct: 461 ---ANQVLFVNGDTDPWHVLSVTQALGSSESVLLIPSGSHCLDMAPERPSDSPSLRLGRQ 517
Query: 487 REVEIIGKWI 496
+ + W+
Sbjct: 518 SIFQQLQTWL 527
>gi|195391902|ref|XP_002054598.1| GJ22718 [Drosophila virilis]
gi|194152684|gb|EDW68118.1| GJ22718 [Drosophila virilis]
Length = 476
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 117/465 (25%), Positives = 194/465 (41%), Gaps = 60/465 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T + Q LDHF+ +T+Q RY++ND + K P+F++ G E +I G
Sbjct: 49 QTLWIEQKLDHFD--ESETRTWQMRYMLNDVFF---KAGGPLFIFLGGEWEISTGRITAG 103
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
MYD+A + LL + EHRYYG+S P +++ +N YL QALAD A I
Sbjct: 104 HMYDMAKEHNGLLAYTEHRYYGESHPL---PDLSNENIQ---YLHVKQALADLAHFITTQ 157
Query: 174 KKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K DS V++ GGSY + WF+ YP + +G ASSAP+ N V + +
Sbjct: 158 KATYEGLVDSKVIIVGGSYSATMVTWFKKTYPDLVVGGWASSAPLFAKMNFV---EYKEV 214
Query: 233 ITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q + + CY I+ ++E G E ++ ++C + + S L
Sbjct: 215 TGQSIALMGGSACYNRIENGIAEMEAMIASKRGAE-VKALLKLC---ERFDVYSDLDIWT 270
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVY---YNYSGTA 348
+++ ++D + + A ++ +C+ I + L G +S + SG
Sbjct: 271 LFSEISD--IFAGLVQTHDAGNIEGVCQKIMAESS------DLVGVSSYILSEFEKSG-G 321
Query: 349 KCFDLNGDSDPHGLSE----------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
C DL+ D+ LSE W +Q C E + F ++
Sbjct: 322 NCHDLSYDAMIGVLSESRYTGNIMRQWIYQTCNEYGWYQTSGSSAQPF-GTKFPVTFYTT 380
Query: 399 YCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
C + YG ++ R FGG L N+ +G DPW G+
Sbjct: 381 MCADLYGAQFTNSYIESRVAETNENFGG------LSPNVQNVYLTHGHLDPWRAMGIQDE 434
Query: 452 ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
T++ E AH D + +D ++ + R E++ +W+
Sbjct: 435 TQATII----PEHAHCKDFGSISVDDTAEMRASKERIAELVREWL 475
>gi|66800673|ref|XP_629262.1| hypothetical protein DDB_G0293014 [Dictyostelium discoideum AX4]
gi|60462657|gb|EAL60859.1| hypothetical protein DDB_G0293014 [Dictyostelium discoideum AX4]
Length = 486
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 122/465 (26%), Positives = 204/465 (43%), Gaps = 62/465 (13%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTG 113
++ TQ +DHFN P + TFQQRYLIND +W G+ P+F+ EG DI Q
Sbjct: 52 QWFTQSVDHFN--PANPTTFQQRYLINDQYWDGT---GPVFIMINGEGPMDINTVTQLQF 106
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
++ A + AL+V +EHRYYG S ++++ +N +L+S QALAD A
Sbjct: 107 VVW--AKQVSALVVSLEHRYYGASFV---TEDLSLENLQ---WLNSAQALADNAVF---- 154
Query: 174 KKNLTAT------DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIV 224
+N A +S + FGGSY G L +WFR+KYPH+ +ASSAP+ +NF
Sbjct: 155 -RNFVAQQYNVPKESKWISFGGSYSGALTSWFRIKYPHLVDATIASSAPVNPEVNF---- 209
Query: 225 SPYSFSNIITQDFRSVSEN---CYKVIKGSWKQIEETAKKP--GGLEKLQKAFRICKSEK 279
Y + + + N C + I + ++I+ + GG++++ F +C
Sbjct: 210 --YQYLETVQTALLASKSNGNLCVENINIATQKIQALLSQDNYGGVDQM---FNLCTPLG 264
Query: 280 NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPA-FPVKEMCKAI----DDPKTGN-DVFAK 333
N ++ ++T F+ + ++ + + P + +C + DP T ++ +
Sbjct: 265 N---QNDVAT-FMQSLAGNFMGVVQYNDEEPGQIDIDYLCNIMTNQSSDPLTNYIQIWDQ 320
Query: 334 LYGAASVYYNY-SGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGD--NKDSIFEESE 390
V +Y S A+ ++ D + G W +Q C E D + + F
Sbjct: 321 YADGECVDVSYASMIAQNQNVTNDENAIGGRMWFYQTCVEFGYYQSSDAPSANQPFGNLF 380
Query: 391 EDYDARARYCKEAYGVD---PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
+ + C +++G+ P NW TE+GG N ++ NG D W
Sbjct: 381 PFQPYQIQQCADSFGIPNMYPNVNWTITEYGGINPE---PSSVDNTLYVNGSNDEWHNLA 437
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
+L + L +H D+ T P L ++ E I
Sbjct: 438 ILPGNANAKNTLYIIGTSHCADMMIPTSVSPPTLAQAQQIIFEFI 482
>gi|170068625|ref|XP_001868940.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167864603|gb|EDS27986.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 366
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/253 (31%), Positives = 132/253 (52%), Gaps = 16/253 (6%)
Query: 47 KDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE 106
++ G KT + Q+LDH +P + T++QRY +ND ++ + + P+F+ G EG+
Sbjct: 42 REVSGDVKTLWFDQLLDH--NDPTNAATWKQRYYVNDAYFD-DRTSGPVFLMIGGEGEAT 98
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
N G A + AL +EHR+YGKS P G + + GYL+S QALAD
Sbjct: 99 ARWMNEGAWIRYAKEHGALCFQLEHRFYGKSHPTGD------LSTANLGYLTSEQALADL 152
Query: 167 ASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
A + ++ K LTA + + FGGSY G LAAW R KYP++ G+++SS P+L +
Sbjct: 153 AYFVEAMNEKYQLTAQNR-WIAFGGSYPGSLAAWLREKYPYLVHGSVSSSGPLL---AKI 208
Query: 225 SPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIE 284
+ ++ +S S C + ++ + Q+E K G + + F++C + +I
Sbjct: 209 DFKEYFQVVVNSLQSYSPQCVEDVRNAMGQVETLLKHMIGQRSINEKFKLCDPVEK-SIS 267
Query: 285 SWLSTAFVYTAMT 297
+ L A ++ A+
Sbjct: 268 NSLDIASLFEAVA 280
>gi|195158022|ref|XP_002019893.1| GL12648 [Drosophila persimilis]
gi|194116484|gb|EDW38527.1| GL12648 [Drosophila persimilis]
Length = 485
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 121/463 (26%), Positives = 205/463 (44%), Gaps = 50/463 (10%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T++ T LD N+N + T++ R LIN+ H+ + +PIF+Y G E +IE A +G
Sbjct: 55 ETRWFTLKLD--NFNAANNATWKDRVLINEDHF---TDGSPIFIYLGGEWEIEPSAITSG 109
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + L++ EHR++GKS P I + Y S QALAD +I L
Sbjct: 110 LWVDIAKEHNGSLIYTEHRFFGKSFP------ITPLSTKNLKYQSVQQALADVVHIIKTL 163
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K DS V+V G SY +A W R YP + +G+ ASSAP+ + V + ++
Sbjct: 164 KLEDKYKDSKVIVSGCSYSATMATWIRKLYPDIILGSWASSAPL---EAKVDFKDYMEVV 220
Query: 234 TQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL---S 288
Q F + + CY +I + ++ + G K +K +C + N + W +
Sbjct: 221 GQAFEQLGGKYCYDLIDNATSYYQDLFEGGQG-AKAKKELNLCANFNVNSKQDRWQIFST 279
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND-VFAKLYGAASVYYNYSGT 347
A V+ + Y P N+ LP + + DD + + V +L + Y GT
Sbjct: 280 IANVFAGLAQYQKPGNY--DLPKYCSVLRSFSDDDAEALSKFVQWRLGYPTCLSVTYKGT 337
Query: 348 A---KCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES-------EEDYDARA 397
K +N + D GL W +Q C+E D+++ F S + +D
Sbjct: 338 VNYYKWAKINYEDD-SGLP-WIYQTCSEFGWYQSSDSENQPFGSSFPATLYTDTCHDV-- 393
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNISKT 455
+ K ++ N T I + +K N+ + G DPWS G G+ +
Sbjct: 394 -FSKNYTLINIEANIAATNKDFQGIDIAVK----NVYWTQGGLDPWSKVGAGIAQG---- 444
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
A++ + +H DL + D L+ + + +++ W++Q
Sbjct: 445 --AIIIPQASHCSDLDSISANDSPELQASKLKLAQLVRDWLAQ 485
>gi|74013257|ref|XP_850041.1| PREDICTED: uncharacterized protein LOC608061 [Canis lupus
familiaris]
Length = 325
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 58/132 (43%), Positives = 82/132 (62%), Gaps = 5/132 (3%)
Query: 365 WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGL 424
W +QACTE+ + +N +F E R +YC + +GV PR +W+ T FGG +
Sbjct: 180 WDYQACTEINLTFSSNNVTDLFPELPFTDALRQQYCLDTWGVWPRRDWLQTSFGGDDL-- 237
Query: 425 VLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDV 484
R ASNI+F NG DPW+GGG+ N+S TVLA+ + GAHH+DLR S EDP +++
Sbjct: 238 ---RGASNILFSNGDLDPWAGGGIRSNLSATVLAITIQGGAHHLDLRASHPEDPASVREA 294
Query: 485 RRREVEIIGKWI 496
RR E +IG+W+
Sbjct: 295 RRFEARLIGEWV 306
>gi|149029309|gb|EDL84576.1| protease, serine, 16 (thymus), isoform CRA_a [Rattus norvegicus]
Length = 454
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 77/219 (35%), Positives = 118/219 (53%), Gaps = 20/219 (9%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q LD FN + + F QRY +ND H G +AP+F++ G EG + + TG
Sbjct: 57 KQGWLEQPLDPFNATDR--RIFLQRYWVNDQHRTG--QDAPVFLHIGGEGSLGPGSVMTG 112
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+AP + AL++ +EHR+YG S+P GG + + YLSS ALAD AS L
Sbjct: 113 HPAALAPAWGALVISLEHRFYGLSMPAGG------LDVALLHYLSSRHALADVASARQAL 166
Query: 174 KKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
L ++ SP + FGGSY G LA W RLK+PH+ A+ASSAP+ + +V +++ +
Sbjct: 167 SGLLNVSSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQV 223
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA 271
+ + V+ I GS + + + +E+L +A
Sbjct: 224 VARSLTQVT------IGGSLECLAAASTAFAEVERLLRA 256
>gi|10140733|gb|AAG13566.1|AC073867_12 putative serine peptidase [Oryza sativa Japonica Group]
Length = 628
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 117/458 (25%), Positives = 189/458 (41%), Gaps = 71/458 (15%)
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEH 131
++ F+QRY ++ K PIF+Y E + ++ +A KF A +V EH
Sbjct: 190 HRQFKQRYYEFLDYYRAPK--GPIFLYICGESSCNGIPNS--YLAVMAKKFGAAVVSPEH 245
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVV 185
RYYGKS P+ + + +N +LSS QAL D A ++ K N + DS
Sbjct: 246 RYYGKSSPF---ESLTTENLR---FLSSKQALFDLAVFRQYYQETLNAKYNRSGADSSWF 299
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
VFGGSY G L+AWFRLK+PH+ G+LASS +L S Y++++ Q S C
Sbjct: 300 VFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------SVYNYTDFDKQIGESAGPECK 353
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT--DYPTPS 303
++ + K ++ G L+ + A + LA + A Y P
Sbjct: 354 AALQETTKLVD------GQLQSGRNAVKQLFGASTLANDGDFLFLLADAAAIAFQYGNPD 407
Query: 304 NFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTAKCFD---LNGDSD 358
+P+ ++ K G D+ YY + + +D L +
Sbjct: 408 ALCSPI-----------VEAKKNGTDLVETFARYVKDYYIGTFGASVASYDQEYLKNTTP 456
Query: 359 PHGLSE---WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
P S W +Q C+E+ DS+ ++ D C+ +G P+ T
Sbjct: 457 PPAESAYRLWWYQVCSEVAYFQVAPKNDSV-RSAKIDTRYHLDLCRNVFGEGVYPDVFMT 515
Query: 416 E--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE------------ 461
+GG +I S I+F NG +DPW K+ + L+E
Sbjct: 516 NLYYGGTRIA------GSKIVFANGSQDPWRHASKQKSSKELPSYLIECSNCGHCSDLSG 569
Query: 462 -KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ H++ S P+ + VR++ V+ I W+S+
Sbjct: 570 CPQAPSHIEGDSSNCSSPEAVNKVRKQIVDHIDLWLSE 607
>gi|335310851|ref|XP_003362222.1| PREDICTED: dipeptidyl peptidase 2-like, partial [Sus scrofa]
Length = 214
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 101/207 (48%), Gaps = 29/207 (14%)
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAK 349
AF AM DYP P++F+ +P P + C + S + +
Sbjct: 18 AFTLLAMMDYPYPTDFVGGIPLAP-QVACDRL----------------------LSKSQR 54
Query: 350 CFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
L + P+ W +QACTE+ + +N I R +YC + +GV PR
Sbjct: 55 IKGLRELTGPNA-KAWDYQACTEINLTFSSNNVSDIXXXXXXXETLRQQYCLDMWGVWPR 113
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVD 469
+W+ T FGG ++ ASNIIF NG DPW+ GG+ N+S +VLA+ GAHH+D
Sbjct: 114 QDWLQTSFGGGEL-----TAASNIIFSNGDLDPWASGGIQSNLSASVLAITIHGGAHHLD 168
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWI 496
LR S DP + + RR E +IGKW+
Sbjct: 169 LRASHPADPASVVEARRLEATLIGKWV 195
>gi|390461155|ref|XP_002746166.2| PREDICTED: thymus-specific serine protease [Callithrix jacchus]
Length = 521
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 111/481 (23%), Positives = 195/481 (40%), Gaps = 65/481 (13%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K + Q+LD FN + + ++F QRY +ND HW + + PIF++ G EG + + G
Sbjct: 56 KVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHW--ASQDGPIFLHLGGEGSLGPGSVMKG 111
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGG---------------NKEIAYKNASTTGYLS 158
+AP AL++ +EHR+YG S+P GG K + L
Sbjct: 112 HPAALAPACGALVISLEHRFYGLSVPAGGLDMAQLRFLSSRHAVGKSSGIPSDEDRPSLP 171
Query: 159 STQALADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
S LAD S + L + ++ SP + FGGSY G LAAW RLK +G +
Sbjct: 172 SDPRLADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKV----LG-------L 220
Query: 218 LNFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE-KLQKAF 272
L F +++ F++++++ +S S C + ++ ++E + G + L+
Sbjct: 221 LRFPHLI----FASVVSRSLKSAAIGGSMECRAAVSAAFSEVERRLRTGGAAQAALRAEL 276
Query: 273 RICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA 332
C S ++ L A PL + + ++ + +
Sbjct: 277 SACGSLSRAEDQAELLGALQALVGGAVQYDGQAGAPLSVRQLCGLLLGGGANRSRSTPYC 336
Query: 333 KLYGAASVYYNYSGTAKCFDLN--------GDSDPH----GLSEWGWQACTEMIMLTGGD 380
L A + + G KC + ++P G +W +Q CTE +
Sbjct: 337 GLRRAVQIVTHSLGQ-KCLSFSRAETVAQLRSTEPQVSGVGDRQWLYQTCTEFGFYVTCE 395
Query: 381 NKDSIFEESEEDYDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIF 435
N F + ++ C++ +G+ P + +GG G A+ ++F
Sbjct: 396 NPRCPFSQLPA-LPSQLDLCEQVFGLSPLSVAQAVAQTNSYYGGQTPG------ANQVLF 448
Query: 436 FNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
NG DPW V + + + AL+ G+H +D+ D L+ RR + + W
Sbjct: 449 VNGDTDPWHVLSVTQALGSSESALLIPSGSHCLDMAPERPSDSPSLRLGRRNIFQQLQTW 508
Query: 496 I 496
+
Sbjct: 509 L 509
>gi|219127344|ref|XP_002183897.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404620|gb|EEC44566.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 526
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 130/477 (27%), Positives = 200/477 (41%), Gaps = 85/477 (17%)
Query: 57 YHTQILDHF-NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
+ Q++DH + + ++ + QR+ ++ ++ G +PIFV G EG IE +TGFM
Sbjct: 33 FDDQLVDHVASSHRHGHERWSQRFYLSHEYFKGP--GSPIFVIMGGEGAIE---PSTGFM 87
Query: 116 Y----DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG-------YLSSTQALA 164
Y +A F A+++ EHR+YG+S P + A G L+ QAL
Sbjct: 88 YPFILQLAQTFGAMVLQPEHRFYGQSQPV---TPAEIERARDDGKPDPRLKLLTVEQALH 144
Query: 165 DYASLIIDLKKNLTATD-------SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
D LI ++ + + PV+ GGSY G L+A RL++P V A A+SAP+
Sbjct: 145 DAVRLIHFVRDRVRCSRDRFSPRYCPVITVGGSYPGFLSAMARLRFPGVVDMAYAASAPM 204
Query: 218 LNFDNIVSPYSFSNII-----------TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE 266
+ V Y++ N I +QD R ++ V + +I ETA G
Sbjct: 205 KFYAQQVDQYAYYNHIGTVAEQAFTGCSQDVRRALDDFRTVYESGQSKINETAI--GICS 262
Query: 267 KLQKAFRICKSEKNLAIESWLSTAFVYTA--MTDYPTPSNFLNPLPAFPVKEMCKAIDDP 324
A+ K E + + + M YP PSN + + C+ P
Sbjct: 263 GTVPAY--IKDPATFVQEVLMMVGYTFANHNMAFYP-PSNQTH------LGRACQTFASP 313
Query: 325 KTGNDVFAKLYGAASV----YYNYSGTAKCFDLN-------------GDSDPHGLS---- 363
K + AS+ N CFD+ GD G
Sbjct: 314 SLSTLDQLKTFLVASLAPRSTENQPDEETCFDMRKQLPSGRNATISAGDWSGVGTGASGE 373
Query: 364 EWGWQACT---EMIMLTGGD-NKD----SIFEESEEDYDARARYCKEAYG--VDPRPNWI 413
W +Q CT E I GGD N+D S+F + + +C++ +G V P PN +
Sbjct: 374 SWDFQTCTSLVESIGFAGGDGNQDAYGISMFPRRDWNISWLTSHCQQRFGDAVTPMPNTL 433
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
+ ++ A+ I+F NG D WS G+ ++S T+LAL GAHH DL
Sbjct: 434 VNAW---NFDDLVAAGATRIVFTNGALDGWSVSGISHDLSDTLLALTFPNGAHHSDL 487
>gi|222613128|gb|EEE51260.1| hypothetical protein OsJ_32132 [Oryza sativa Japonica Group]
Length = 524
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 117/458 (25%), Positives = 189/458 (41%), Gaps = 71/458 (15%)
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEH 131
++ F+QRY ++ K PIF+Y E + ++ +A KF A +V EH
Sbjct: 86 HRQFKQRYYEFLDYYRAPK--GPIFLYICGESSCNGIPNS--YLAVMAKKFGAAVVSPEH 141
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVV 185
RYYGKS P+ + + +N +LSS QAL D A ++ K N + DS
Sbjct: 142 RYYGKSSPF---ESLTTENLR---FLSSKQALFDLAVFRQYYQETLNAKYNRSGADSSWF 195
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
VFGGSY G L+AWFRLK+PH+ G+LASS +L S Y++++ Q S C
Sbjct: 196 VFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------SVYNYTDFDKQIGESAGPECK 249
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT--DYPTPS 303
++ + K ++ G L+ + A + LA + A Y P
Sbjct: 250 AALQETTKLVD------GQLQSGRNAVKQLFGASTLANDGDFLFLLADAAAIAFQYGNPD 303
Query: 304 NFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTAKCFD---LNGDSD 358
+P+ ++ K G D+ YY + + +D L +
Sbjct: 304 ALCSPI-----------VEAKKNGTDLVETFARYVKDYYIGTFGASVASYDQEYLKNTTP 352
Query: 359 PHGLSE---WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
P S W +Q C+E+ DS+ ++ D C+ +G P+ T
Sbjct: 353 PPAESAYRLWWYQVCSEVAYFQVAPKNDSV-RSAKIDTRYHLDLCRNVFGEGVYPDVFMT 411
Query: 416 E--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE------------ 461
+GG +I S I+F NG +DPW K+ + L+E
Sbjct: 412 NLYYGGTRIA------GSKIVFANGSQDPWRHASKQKSSKELPSYLIECSNCGHCSDLSG 465
Query: 462 -KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ H++ S P+ + VR++ V+ I W+S+
Sbjct: 466 CPQAPSHIEGDSSNCSSPEAVNKVRKQIVDHIDLWLSE 503
>gi|71986744|ref|NP_500596.2| Protein F19C7.4 [Caenorhabditis elegans]
gi|373219711|emb|CCD69715.1| Protein F19C7.4 [Caenorhabditis elegans]
Length = 542
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 122/475 (25%), Positives = 205/475 (43%), Gaps = 52/475 (10%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI------EWFAQNT 112
TQ +DHF+ N + +QQ Y N + +K +F+ G E I W
Sbjct: 54 TQKVDHFS-NGTNIGVWQQHYQYNWKFY--NKTTGYVFLMIGGESSINKTNGDRWIRHEG 110
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
M +F+A +EHR+YG S Y I + ++ L+ QALAD I
Sbjct: 111 ETMMKWVAEFQAAAFQVEHRFYG-SKEY---SPIGDQTTASMKLLTIDQALADIKEFITQ 166
Query: 173 LKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+ D P+ V FGGSY G L+A+FR YP + GA++SS+ + F V Y ++
Sbjct: 167 INALYFKDDKPIWVTFGGSYPGSLSAFFRETYPEMTAGAVSSSSAVHVF---VDYYGYAI 223
Query: 232 IITQDFRSVSENCYKVIKGSWKQ-IEETAKKPGGLEKLQKAFRICKS--EKNL--AIESW 286
+ +R+VS++C VIK ++++ I + L++ F +C S E NL A++ +
Sbjct: 224 NTEKTYRTVSDSCGDVIKVAFQKLITKAYNGSDSRALLKQQFNLCDSFDETNLSKAVQFF 283
Query: 287 LSTAFVYTAMTDYPTPSNFLNPL-PAFPVKEMCKAIDDPKTGNDVFAKLYGAASVY---- 341
+ Y + + T N N V C +++ G++V ++ ++Y
Sbjct: 284 FQNVYGYFQIINQYTGDNKSNATRSGLGVPAACDLLNNATIGDEV-QRVIAVMNLYDSWF 342
Query: 342 ---------YNYSGTAKCFDLNG--DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
NY+ + + + + G W WQ CTE+ D + S
Sbjct: 343 KPSASGCRPNNYTAFIQAYSDTTMPNENVIGTRSWIWQTCTELGYYQTTDGGNGGIFGST 402
Query: 391 EDYDARARYCKEAYGVD-------PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
D A C + +G + + + T++GG +N++F NG DPW
Sbjct: 403 VPLDFFADQCIDLFGPEYTLDNTFKLVDQVRTKYGGAG-----TYRGTNVVFPNGSFDPW 457
Query: 444 SGGGV-LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+G G N + V A + + +H D+ ++ D Q LKD R R + +W+S
Sbjct: 458 NGLGYKWNNTNNNVDAWLIEGTSHCADMYPASDSDKQSLKDARIRIHGHLSRWLS 512
>gi|195497936|ref|XP_002096312.1| GE25600 [Drosophila yakuba]
gi|194182413|gb|EDW96024.1| GE25600 [Drosophila yakuba]
Length = 480
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/226 (33%), Positives = 115/226 (50%), Gaps = 17/226 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ K+ TQ LD N++ + QT+ RYL+ND +PIF+Y G E +IE + G
Sbjct: 55 QEKWITQKLD--NFDASNSQTYPMRYLVNDEF---QTEGSPIFIYLGGEWEIENSMVSAG 109
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
YD+A + K +LV+ EHRYYG+S+P + + YL QALAD A I
Sbjct: 110 HWYDMAEEHKGVLVYTEHRYYGQSVP------TSTMSTDNLKYLDVKQALADVAVFIETF 163
Query: 174 K-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K +N ++S V++ GGSY + WF+ YP + +G ASSAP+L V + +
Sbjct: 164 KAENPQLSNSKVILAGGSYSATMVVWFKRLYPELIVGGWASSAPLL---AKVDFTEYKEV 220
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
+ Q F + + CY I+ ++E G E + R+C S
Sbjct: 221 VGQAFLQLGGQKCYDRIENGIAELESMFANKRGAEA-KAMLRLCNS 265
>gi|323453612|gb|EGB09483.1| hypothetical protein AURANDRAFT_24425 [Aureococcus anophagefferens]
Length = 266
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 96/194 (49%), Gaps = 20/194 (10%)
Query: 95 IFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
++ YTGNE +E + NTG M++ + ALLV+ EHRY G S+P
Sbjct: 2 VWFYTGNESPVEEYVNNTGLMWETGAELGALLVWAEHRYEGGSVPACAGLR------DCL 55
Query: 155 GYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
Y S QALADYA +I L+ + D P V GGSYGGML++WFR KYP +GA+A S
Sbjct: 56 AYASVEQALADYAVVIDALRAEVG--DVPFVAVGGSYGGMLSSWFRFKYPTAVVGAIAGS 113
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVS-----ENCYKVIKGSWKQIEETAKKPG------ 263
AP+ F P S + + + C ++ +W +E+ A+
Sbjct: 114 APVWGFPLDAPPLDGSAVAISRGAGFAGGLPDDRCASNLRAAWPVLEDFARTASPAADGT 173
Query: 264 -GLEKLQKAFRICK 276
LE + AF +C+
Sbjct: 174 TALEAVAAAFDLCE 187
>gi|218184873|gb|EEC67300.1| hypothetical protein OsI_34290 [Oryza sativa Indica Group]
Length = 524
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 117/458 (25%), Positives = 188/458 (41%), Gaps = 71/458 (15%)
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEH 131
++ F QRY ++ K PIF+Y E + ++ +A KF A +V EH
Sbjct: 86 HRQFNQRYYEFLDYYRAPK--GPIFLYICGESSCNGIPNS--YLAVMAKKFGAAVVSPEH 141
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVV 185
RYYGKS P+ + + +N +LSS QAL D A ++ K N + DS
Sbjct: 142 RYYGKSSPF---ESLTTENLR---FLSSKQALFDLAVFRQYYQETLNAKYNRSGADSSWF 195
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
VFGGSY G L+AWFRLK+PH+ G+LASS +L S Y++++ Q S C
Sbjct: 196 VFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------SVYNYTDFDKQIGESAGPECK 249
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT--DYPTPS 303
++ + K ++ G L+ + A + LA + A Y P
Sbjct: 250 AALQETTKLVD------GQLQSGRNAVKQLFGASTLANDGDFLFLLADAAAIAFQYGNPD 303
Query: 304 NFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYSGTAKCFD---LNGDSD 358
+P+ ++ K G D+ YY + + +D L +
Sbjct: 304 ALCSPI-----------VEAKKNGTDLVETFARYVKDYYIGTFGASVASYDQEYLKNTTP 352
Query: 359 PHGLSE---WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT 415
P S W +Q C+E+ DS+ ++ D C+ +G P+ T
Sbjct: 353 PPAESAYRLWWYQVCSEVAYFQVAPKNDSV-RSAKIDTRYHLDLCRNVFGEGVYPDVFMT 411
Query: 416 E--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE------------ 461
+GG +I S I+F NG +DPW K+ + L+E
Sbjct: 412 NLYYGGTRIA------GSKIVFANGSQDPWRHASKQKSSKELPSYLIECSNCGHCSDLSG 465
Query: 462 -KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ H++ S P+ + VR++ V+ I W+S+
Sbjct: 466 CPQAPSHIEGDSSNCSSPEAVNKVRKQIVDHIDLWLSE 503
>gi|110289392|gb|ABB47879.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
Length = 490
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 124/479 (25%), Positives = 201/479 (41%), Gaps = 81/479 (16%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ ++ Q LDHF+ P ++ F+QRY D H GG P+F+ E +
Sbjct: 51 EERWMDQRLDHFS--PTDHRQFKQRYYEFADYHAGG----GPVFLRICGESSCNGIPND- 103
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--- 169
++ ++ KF A +V EHRYYGKS P+ + + +N +LSS QAL D +
Sbjct: 104 -YLAVLSKKFGAAVVTPEHRYYGKSSPF---ESLTTENLR---FLSSKQALFDLVAFRQH 156
Query: 170 ---IIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
I++ + N ++ D+P VFG SY G L+AWFRLK+PH+ G+LASS +L +
Sbjct: 157 YQEILNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL------A 210
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
Y+F++ Q S C ++ + ++E + K+ K++ +
Sbjct: 211 VYNFTDFDKQVGDSAGPECKAALQEVTRLVDEQLRLDSRSVKVLFGAEKLKNDGDFLFFL 270
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG-----------NDVFAKL 334
+ A + Y +P +PL I+ KTG D F +
Sbjct: 271 ADAAAIGF----QYGSPDAVCSPL-----------INAKKTGRSLVETYAQYVQDFFIRR 315
Query: 335 YGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
+G Y+ + N D W +Q C+E+ DSI +E +
Sbjct: 316 WGTTVSSYDQE-----YLKNTTPDDTSSRLWWFQVCSEVAYFQVAPKNDSI-RSTEINTG 369
Query: 395 ARARYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
C+ +G P+ T +GG +I AS I+F NG +DPW K+
Sbjct: 370 YHLDLCRNVFGEGVYPDVFMTNLYYGGTRIA------ASKIVFTNGSQDPWRHASKQKSS 423
Query: 453 SKTVLALVE-KEGAHHVDLR------FSTKEDPQ------WLKDVRRREVEIIGKWISQ 498
+++ + H DLR F + DP + VR++ I W+SQ
Sbjct: 424 KYMPSYIIKCRNCGHGTDLRGCPQLPFRIEGDPSNCSSPAAVSTVRKQIASHISLWLSQ 482
>gi|432101950|gb|ELK29783.1| Thymus-specific serine protease [Myotis davidii]
Length = 399
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 103/314 (32%), Positives = 154/314 (49%), Gaps = 47/314 (14%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LD FN + + ++F QRY I+D HW + + P+F++ G EG + + G +A
Sbjct: 62 QPLDPFNASDR--RSFLQRYWISDQHW--ASRDGPVFLHLGGEGSLGPGSVMRGHPAALA 117
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
P AL++ +EHR+YG SIP GG N + +LSS ALAD S ++L + L
Sbjct: 118 PALGALVIGLEHRFYGLSIPAGG------LNTTQLRFLSSRHALADVVSARLELSRLLNV 171
Query: 180 TDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQ 235
+ S V FGGSY G LAAW RLK+PH+ A+ASSAP +L+F +++ ++T
Sbjct: 172 SSSSRWVCFGGSYAGSLAAWARLKFPHLLFAAVASSAPVRAVLDFS------AYNEVVTH 225
Query: 236 DFRSVSENCYKVIKG-SWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYT 294
S+ + C + + Q+ T G+ Q ++ C T F +
Sbjct: 226 ---SLGQRCLSFSRAETLAQLRVTEPPVSGVGDRQWLYQTC-------------TEFGFY 269
Query: 295 AMTDYPTPSNFLNPLPAFPVK-EMCKAIDDPKTGNDVFAKLYGAASVYYNYS---GTAKC 350
+ PT + LPA P + E+C+ + T + V A + YY GT
Sbjct: 270 VTCEDPTCP--FSKLPALPSQLELCEQVFGLSTWSTVQA--VAQTNSYYGGQTPVGTQVL 325
Query: 351 FDLNGDSDP-HGLS 363
F +NGD DP H LS
Sbjct: 326 F-VNGDMDPWHVLS 338
>gi|198455509|ref|XP_001360028.2| GA17653 [Drosophila pseudoobscura pseudoobscura]
gi|198133275|gb|EAL29180.2| GA17653 [Drosophila pseudoobscura pseudoobscura]
Length = 485
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 121/463 (26%), Positives = 204/463 (44%), Gaps = 50/463 (10%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T++ T LD N+N + T++ R LIN+ H+ + +PIF+Y G E +IE A +G
Sbjct: 55 ETRWFTLKLD--NFNAANNATWKDRVLINEDHF---TDGSPIFIYLGGEWEIEPSAITSG 109
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + L++ EHR++GKS P I + Y S QALAD +I L
Sbjct: 110 LWVDIAKEHNGSLIYTEHRFFGKSFP------ITPLSTKNLKYQSVQQALADVVHIIKTL 163
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K DS V+V G SY +A W R YP + +G+ ASSAP+ + V + ++
Sbjct: 164 KLEDKYKDSKVIVSGCSYSATMATWIRKLYPDIILGSWASSAPL---EAKVDFKDYMEVV 220
Query: 234 TQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL---S 288
Q F + + CY +I + ++ + G K +K +C + N + W +
Sbjct: 221 GQAFEQLGGKYCYDLIDNATSYYQDLFEGGQG-AKAKKELNLCANFNVNSKQDRWQIFST 279
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND-VFAKLYGAASVYYNYSGT 347
A V+ + Y P N+ LP + + DD + + V +L + Y GT
Sbjct: 280 IANVFAGLAQYQKPGNY--DLPKYCSVLRSFSDDDAEALSKFVQWRLGYPTCLSVTYKGT 337
Query: 348 A---KCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES-------EEDYDARA 397
K +N + D GL W +Q C+E D+++ F S + +D
Sbjct: 338 VNYYKWAKINYEDD-SGLP-WIYQTCSEFGWYQSSDSENQPFGSSFPATLYTDTCHDV-- 393
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNISKT 455
+ K ++ N T I + +K N+ + G DPWS G G+ +
Sbjct: 394 -FSKNYTLINIEANIAATNKDFQGIDIAVK----NVYWTQGGLDPWSKVGAGIAQG---- 444
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
A + + +H DL + D L+ + + +++ W++Q
Sbjct: 445 --ATIIPQASHCSDLDSISANDSPELQASKLKLAQLVRDWLAQ 485
>gi|308489478|ref|XP_003106932.1| hypothetical protein CRE_17219 [Caenorhabditis remanei]
gi|308252820|gb|EFO96772.1| hypothetical protein CRE_17219 [Caenorhabditis remanei]
Length = 541
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 123/467 (26%), Positives = 200/467 (42%), Gaps = 57/467 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQN 111
+ K TQ LDHF+ P + +T+ Q+Y N S+NN+ IF+ G EG + +W A
Sbjct: 55 QVKNFTQKLDHFD--PYNTKTWNQKYFYNPKF---SRNNSIIFLMIGGEGPENGKWAANP 109
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
A +F A + +EHR++G S P I ++ YL++ QALAD A I
Sbjct: 110 DVQYLQWAAEFGADVFDLEHRFFGDSWP------IPDMTTNSLRYLTTQQALADLAYFIE 163
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYS 228
+ + + V FGGSY G L+AWFR KYP + +G++ASSAP+ L+F Y
Sbjct: 164 SMNQLYGFKNPRWVTFGGSYPGSLSAWFRQKYPQLTVGSVASSAPVNLKLDF------YE 217
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKNLAI 283
++ ++ D + NC +K ++ +I++ + G +L F + K+ K L I
Sbjct: 218 YAMVVEDDLKLTDVNCAPAVKDAFTKIQKLSLTAEGRNQLNGYFNLQPPFDGKTTK-LDI 276
Query: 284 ESWLSTAF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAI------DDPKTGNDVFA---- 332
++ F + MT Y + V++MC+ + D K ++F
Sbjct: 277 NNFFGNLFNTFQGMTQYTYDGQSNSTHSDKTVRKMCQIMTNTSEPDTVKRVENLFLWFNV 336
Query: 333 -------KLYGAASVYYNYSGTAKCFDLNG-DSDPHGLSEWGWQACTEMIMLTGGDNKDS 384
L + Y++ DL D W W C E+ L + ++
Sbjct: 337 MEPADPDHLTVMPNSYWDVIKQVGSGDLKVLGPDGAAARGWMWLCCNEIGFLQTTNQGNN 396
Query: 385 IFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHK-----IGLVLKRFASNIIFFNGL 439
+F S + C + +G + I G+K G A+N++ NG
Sbjct: 397 VF-GSGVTLNLFIDMCTDMFGDTMK---IKQIMAGNKKSQNYYGGADFYNATNVVLPNGS 452
Query: 440 RDPWSGGGVLKNI-SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
DPW G K + S+ +L + AH D+ S +P L R
Sbjct: 453 LDPWHALGTYKTVESQALLPYLINGTAHCGDMYPSYDGEPNSLPAAR 499
>gi|195109614|ref|XP_001999378.1| GI23097 [Drosophila mojavensis]
gi|193915972|gb|EDW14839.1| GI23097 [Drosophila mojavensis]
Length = 691
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 119/464 (25%), Positives = 197/464 (42%), Gaps = 58/464 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T + Q LDHFN +T+Q RYL+ND + K P+F+Y G E I + G
Sbjct: 265 QTLWIEQKLDHFN--DSETRTWQMRYLLNDVFF---KAGGPMFIYLGGEWAISKGRISEG 319
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
MYD+A + LL + EHRYYG+S P + + + +L QALAD A I
Sbjct: 320 HMYDMAKEHNGLLAYTEHRYYGESHP------LPDLSNDSLQFLHVKQALADLAHFIKTQ 373
Query: 174 KKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K + +DS V++ GGSY + WF+ YP + G ASSAP+ + V + I
Sbjct: 374 KASYKGLSDSKVIIVGGSYSAAMVTWFKRTYPDLVAGGWASSAPLFAKVDFV---EYKEI 430
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q + +CY I+ ++E G E ++ ++C+ + S L
Sbjct: 431 AGQSIVLMGGSDCYNRIQKGIAEMEAMFANKRGSE-VKALLKLCEP---FDVYSDLDVWN 486
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG--TAK 349
+++ ++D S + ++ +C+ I G+D L G + + G T+K
Sbjct: 487 LFSEISD--IFSGVVQTHNTGQIEGVCQKI--MAEGSD----LVGLSKFLLSEFGESTSK 538
Query: 350 CFDLNGDSDPHGLS----------EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
C DL+ ++ LS +W +Q C E + F ++
Sbjct: 539 CNDLSYNAMIDTLSDTRYSGSVRRQWLFQTCNEYGWYQTSGSASQPF-GTKFPVTFYTTM 597
Query: 400 CKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
C + YG ++ R FGG L N+ F +G DPW G+
Sbjct: 598 CADLYGPQFSKSFIEARAAETNEYFGG------LTPKVENVYFSHGQLDPWRAMGIQDEK 651
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
T++ + AH DL + +D ++ + R E++ +W+
Sbjct: 652 QATIIP----KHAHCADLGSISIDDTAEMRASKERVAELVREWL 691
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 72/218 (33%), Positives = 107/218 (49%), Gaps = 20/218 (9%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T + Q LDHFN +T+Q RY++ND + K P+F+Y G E I + G
Sbjct: 49 QTLWIEQKLDHFN--DSETRTWQMRYMLNDVFF---KAGGPMFIYLGGEWAISKGRISEG 103
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
MYD+A + LL + EHRYYG+S P +++ +N YL QALAD A I
Sbjct: 104 HMYDMAKEHNGLLAYTEHRYYGESHPL---PDLSNENLR---YLHVKQALADLAHFITTQ 157
Query: 174 KKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV-------- 224
K + +DS V++ GGSY + WF+ YP + G ASSAP+ + V
Sbjct: 158 KASYEGLSDSKVIIVGGSYSATMVTWFKRTYPDLVAGGWASSAPLFTKVDFVVKVKSTTM 217
Query: 225 SPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKP 262
+ Y IIT S+ + + ++K++ E P
Sbjct: 218 THYGVGVIITLLALGFSQANGNIFERTFKKLHEEPYVP 255
>gi|410040428|ref|XP_003950808.1| PREDICTED: thymus-specific serine protease [Pan troglodytes]
Length = 541
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 119/512 (23%), Positives = 214/512 (41%), Gaps = 87/512 (16%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +ND HW G + PIF+ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNDQHWVG--QDGPIFLLLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + S+ + +
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRLAMGKSSG 162
Query: 163 ---------------LADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLK---- 202
LAD S + L + ++ SP + FGGSY G LAAW RLK
Sbjct: 163 IPSDEDRPSPPFDPRLADVVSAHLALSRLFNISSSSPWICFGGSYAGSLAAWARLKVLRL 222
Query: 203 --YPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWK 253
+PH+ ++ASSAP +L+F +++++++ S S C + ++
Sbjct: 223 LRFPHLIFASVASSAPVRAVLDFSE------YNDVVSRSLMSTAIGGSLECRAAVSVAFA 276
Query: 254 QIEETAKKPGGLE-KLQKAFRIC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSN 304
++E + G + LQ C ++E A+++ + Y T P
Sbjct: 277 EVERRLRSGGAAQAALQTELSACGPLGRAENQAELLGALQALVGGVVQYDGQTGAP---- 332
Query: 305 FLNPLPAFPVKEMCKAIDDP---KTGNDVFAKLYGAASVYYNYSGTAKCFDLN------- 354
V+++C + ++ + + L A + + G KC +
Sbjct: 333 -------LSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQ 384
Query: 355 -GDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG---- 405
++P G +W +Q CTE +N F + ++ C++ +G
Sbjct: 385 LRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPA-LPSQLDLCEQVFGLSAL 443
Query: 406 -VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG 464
V + +GG G A+ ++F NG DPW V + + + L+ + G
Sbjct: 444 SVAQAVAQTNSYYGGQTPG------ANKVLFVNGDTDPWHVLSVTQALGSSESTLLIRTG 497
Query: 465 AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+H +D+ D L+ R+ + + W+
Sbjct: 498 SHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 529
>gi|326436306|gb|EGD81876.1| thymus specific serine peptidase [Salpingoeca sp. ATCC 50818]
Length = 500
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 118/458 (25%), Positives = 187/458 (40%), Gaps = 80/458 (17%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG-----DIEWFAQ 110
+Y TQ DHF+ + T+QQ Y +NDT W G N AP+F+ G EG + +
Sbjct: 46 RYFTQWQDHFDGT--NVNTWQQAYYVNDTFWKGDAN-APVFLCVGGEGPPIDGSVVVSSV 102
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD----- 165
+ ++ P+ A++ +EHRYYG K +LSS QALAD
Sbjct: 103 HCNGAVEMLPETGAIMFAVEHRYYGCHNMSACPVTSFLKPKDALRFLSSRQALADLAGFH 162
Query: 166 -YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
YA+ LK + V FGGSY GMLA WFRLK+PH+ ++ASSAP+ IV
Sbjct: 163 AYATATYGLKPT-----NKWVSFGGSYPGMLAGWFRLKFPHLVHASVASSAPV---QAIV 214
Query: 225 SPYSFSNIITQDFRSVSEN-------CYKVIKGSWKQIEETAKKPGGLEKLQKAF----R 273
+++++ + + +VS N C K I I + G +L F +
Sbjct: 215 DMVGYNDVVAEAY-AVSNNNVGGSPACRKAIADGHAMIGQMFSSDSGRTRLANLFGHNAK 273
Query: 274 ICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV--- 330
+++ N A + A+ + A + P ++ PA + +C + + G++V
Sbjct: 274 WYENKLNQASFAGFGVAY-FPAQGNDPACTD-----PACNIGRICAVMTNTSLGDEVSRL 327
Query: 331 --------------FAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEM-IM 375
F + S+ + A+ D W WQ CTE
Sbjct: 328 AAIRNMQDEWLSQPFETVNRKHSLMHAAGNDAELPDF-----------WSWQVCTEFGFF 376
Query: 376 LTGGDNKDSIFEESEEDYDARARYCKEAYGVDP---RPNWITTE--FGGHKIGLVLKRFA 430
T F + + ++ +C +G+ R N + +GG G
Sbjct: 377 QTCEVGSKCFFTQGYDTLQSQMEFCSAVFGIPATKVRQNIADSNLYYGGRNSG------G 430
Query: 431 SNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV 468
S +I+ NG DPW +L + + V L +HH
Sbjct: 431 SCLIYPNGEVDPWHAQSILNSTNPNVKTLWVPGASHHA 468
>gi|146425171|emb|CAM84574.1| intestinal prolyl carboxypeptidase 2 [Haemonchus contortus]
Length = 1143
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 122/477 (25%), Positives = 197/477 (41%), Gaps = 49/477 (10%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQ 110
Y+ Y TQ +DHFN N Y TF+QRY ND W +K N PIF+ G E D W
Sbjct: 602 YEAGYFTQPVDHFN-NKNPY-TFEQRYFKND-QW--AKPNGPIFLMIGGESERDSSWVLN 656
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
A +F A + +E RYYGKS + K + T YLSS Q L D A+ I
Sbjct: 657 ENLTYLKWADEFGATVYALEXRYYGKSDLFDSLDPAVSKKNTYTTYLSSLQMLYDVANFI 716
Query: 171 --IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVS 225
+D ++ ++FGGSY G LA W R +P + GA+ SSAP+ L+F
Sbjct: 717 RAVDAER---GQHGKWIMFGGSYAGSLALWMRRLFPDLVYGAIGSSAPLEAKLDF----- 768
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
Y + ++ + RS SE+C I + I + G +L + F++ +++
Sbjct: 769 -YDYYQVVEKSIRSHSEDCAYAIAEGFDDIRQRLLTEKGRAQLTEIFKLNPPWDDVSDVF 827
Query: 286 WLSTAFVYTAMTDYPTPS------NFLNPLPAFPVKEMC--------KAIDDPKTGNDVF 331
+ F + + D + N + + + +MC K I N+
Sbjct: 828 EIDKQFFISNLVDMFASAVQYSGDNRGHYAHGYGIPDMCRIMTKQGRKPISSIAAFNEYM 887
Query: 332 AKLYGA----ASVYYNYSGTAKCF---DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDS 384
++ S++ +Y + + + + W WQ CTE D+ S
Sbjct: 888 TNMFTGDTEFESMFNSYDDLKRLLYKAQFSTNPKEAAGTLWLWQTCTEFGFYQTTDSGYS 947
Query: 385 IFEESEEDYDARARYCKEAYGV----DPRPNWITTEFGGHKIGLVLKRFASNIIFF-NGL 439
+F + + + C + +G+ + N T + G A ++ +G
Sbjct: 948 LF-GNLLPLNFYTQLCSDVFGLKTSYSAKBNRRATLSANKRYGGRFNYGADPMVVMTHGS 1006
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DPW+ G + ++ K AH ++ + +D Q LKD R R I+ WI
Sbjct: 1007 LDPWNALGNITCDPADKCFMI-KGTAHCAEMYPARDKDEQDLKDTRERIRGILKSWI 1062
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 106/449 (23%), Positives = 179/449 (39%), Gaps = 52/449 (11%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQN 111
+ Y Q LDH + + T+ QRY + ++ K F+ G G DI W
Sbjct: 50 REAYFKQKLDHTKDDGEG--TWPQRYFYSQRYY--RKGGNVFFLMLGGMGVMDIGWVTNE 105
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI- 170
+ A L +EHR+YGKS P ++ +N + YL+ QA+ D A+ I
Sbjct: 106 KLPFVQWGKERGAQLYALEHRFYGKSRP---TPNLSVRNLA---YLTIDQAIGDVANFIK 159
Query: 171 -IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
++ K + D+ +VFGGSY LA W R KYP++ GA+ASS + + F
Sbjct: 160 EMNAKHRIXDEDAKWIVFGGSYAASLALWARQKYPNLIAGAVASSPLMRPRFDFWEGTQF 219
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKNLAIE 284
+ I +R C + I+ +++Q+ + G ++ + + +N+ +
Sbjct: 220 AEDI---YRKTDATCAENIEIAFQQLADMLGSERGRSQVSELLKTKPRFWTAEHRNIQLL 276
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYN 343
+ + +A+ P N + +C ++D + GA +
Sbjct: 277 TSIQLNNFISAVQFRAGPY-MQNGTSLNNTEAVCTVMNDQSLDQITALXHINGARVLQSK 335
Query: 344 Y-----SGTAKCFD------LNGDSDPHGLSEWG----WQACTEM-IMLTGGDNKDSIFE 387
Y T +D L D D G + WQ CTE+ LT +SIF
Sbjct: 336 YLHDMPENTPADYDALLKYLLQKDFDEEGWASVDRASLWQRCTEIGTFLTTDGAINSIF- 394
Query: 388 ESEEDYDARARYCK------EAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRD 441
S D A C+ +A ++ T ++GG + +N++ NG D
Sbjct: 395 GSLVSIDFYADLCQVFGEEFDAQHIERAVAATTLKYGGAHM-----YKGTNVVIANGGAD 449
Query: 442 PWSGGGVLKNISKTVLALVEKEGAHHVDL 470
P + +I TV+ V K+ H D+
Sbjct: 450 PLHVLSKITSIDPTVVTYVVKDSFHCGDM 478
>gi|324509651|gb|ADY44052.1| Serine protease, partial [Ascaris suum]
Length = 526
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 123/482 (25%), Positives = 194/482 (40%), Gaps = 69/482 (14%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI--EWFAQNTGFMYD 117
Q LDHFN +T++QR N + N + +FV G E I +W M
Sbjct: 68 QPLDHFN--KSDTRTWEQRVQYNPMFY---NNQSVVFVLIGGESMINQKWVGNENVSMMQ 122
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
A +F A +EHR++G S P+ + Y ++ QALAD A I +
Sbjct: 123 WAKEFGAAAFQLEHRFFGYSRPF---PLVLTMTTEALVYCTTEQALADLAEFIQQMNAKY 179
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIIT 234
+ + V FGGSY G L+AWFR KYP + +GA+ASSAP+ L+F Y +S ++
Sbjct: 180 SFVNPRWVTFGGSYPGSLSAWFRSKYPQLTVGAVASSAPLNLKLDF------YEYSMVVE 233
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC-----KSEKNLAIESWLST 289
R C+ ++ + IE+ G ++L + F + S L + +++S
Sbjct: 234 NVLRETDPECHWRVENAIAYIEKIMLTSTGRQQLNQVFNLQPPFDEASVTPLTLHNFMSN 293
Query: 290 AF-VYTAMTDYPTPSNFLNPLPAFPVKEMC---------------KAIDD------PKTG 327
+ ++ + Y + + V+ +C +A+ D P+TG
Sbjct: 294 LYTMFQGIVQYTYDGRNEHTMGGMNVRNLCNTVTKAPADEPLQQMRAVMDFVNSFYPQTG 353
Query: 328 NDV-------FAKLY-GAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGG 379
N V FA Y S+Y N + + +D W W C E+ L
Sbjct: 354 NCVGNVSACTFANSYEDIISLYGNVT-------YDESTDNAAYRGWMWLCCNEIGFLQTT 406
Query: 380 DNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKI----GLVLKRFASNIIF 435
D +IF E + C + +G P N T G G A+N+I
Sbjct: 407 DQGKNIFGEM-LPLNFYIDMCTDLFG--PSVNIETIAKGNAAAQKYYGRAEHYKATNVIL 463
Query: 436 FNGLRDPWSGGGVLKNISKT-VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGK 494
NG DPW G T + L+ AH D+ + ++P L R + E +
Sbjct: 464 PNGSLDPWHALGTYTEDKTTHQIPLLINGTAHCADMYPAYPDEPASLPAAREKIKEELAY 523
Query: 495 WI 496
+I
Sbjct: 524 YI 525
>gi|325186495|emb|CCA21035.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 263
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 76/253 (30%), Positives = 121/253 (47%), Gaps = 20/253 (7%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
+ NC + I+ +W + A+ G +L K F +C+ +N + WL AF AM
Sbjct: 6 APNCAEKIRQAWPALFSMAENEPGRLQLAKIFHLCRPLQNETGIHHLALWLLNAFSVLAM 65
Query: 297 TDYPTPSNFLN----PLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSGTAKCF 351
+YP PS++L+ LPA+P++ C + D + + + + L+ A SV YN + C
Sbjct: 66 RNYPYPSSYLSNGEAQLPAWPMQSACSFLADQRPDSIALISSLFEAVSVLYNATKKMDCV 125
Query: 352 DLNGD-SDPHGLSEWGWQACTEMIMLT---GGDNKDSIFEESEEDYDARARYCKEAYGVD 407
DL D + G+ WG+ CTEM++ + +F ++C+ +G
Sbjct: 126 DLPRDMTSIDGI--WGFHYCTEMLLQETYFSSNGISDMFWNRTISAKFVQQHCQRVWGTK 183
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT--VLALVEKEGA 465
P P WI +G L ASNI+F NG+ DPW GV K+ + + L + A
Sbjct: 184 PDPEWIRIMYGDAD---TLLSAASNIVFTNGMLDPWRCCGVKKSQVRNNRIKVLKIENAA 240
Query: 466 HHVDLRFSTKEDP 478
HH+DL F +DP
Sbjct: 241 HHLDLFFHHVDDP 253
>gi|195569687|ref|XP_002102840.1| GD19284 [Drosophila simulans]
gi|194198767|gb|EDX12343.1| GD19284 [Drosophila simulans]
Length = 515
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/227 (33%), Positives = 120/227 (52%), Gaps = 17/227 (7%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN 111
+ +T + Q LDHF+ +T+Q RY++ND + K+ AP+F+Y G E +I
Sbjct: 44 VVQTLWIEQKLDHFD--AAETRTWQMRYMLNDALY---KSGAPLFIYLGGEWEISSGRIT 98
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
G +YD+A + ALL + EHRYYG+S P +++ +N YLS Q+LAD A I
Sbjct: 99 GGHLYDMAKEHNALLAYTEHRYYGQSKPL---PDLSNENIK---YLSVNQSLADLAHFIN 152
Query: 172 DLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+K+N ++S V++ GGSY + WF+ YP + G ASSAP+L N V +
Sbjct: 153 TIKQNHEGLSESKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLLAKVNFV---EYK 209
Query: 231 NIITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
+ Q + + CYK I+ ++E G E ++ ++C+
Sbjct: 210 EVTGQSIEQMGGSACYKRIENGIAEMETMIATKRGAE-VKALLKLCE 255
>gi|357146992|ref|XP_003574183.1| PREDICTED: probable serine protease EDA2-like [Brachypodium
distachyon]
Length = 489
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 126/485 (25%), Positives = 200/485 (41%), Gaps = 78/485 (16%)
Query: 48 DSQGLYKTK---YHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEG 103
D G Y T+ + Q LDHF+ P ++ F+QR+ D H G P+F+ E
Sbjct: 41 DVAGRYLTREERWTNQRLDHFS--PTDHRQFKQRHFEFLDYHRAG----GPVFLRICGES 94
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
+ + ++ +A KF A +V EHRYYGKS P+ + + +N +LSS QAL
Sbjct: 95 SCDGIPND--YLAVLAKKFGAAVVTPEHRYYGKSSPF---ERLTTENLR---FLSSKQAL 146
Query: 164 ADYASL------IIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
D A ++ + N ++ D+P VFG SY G L+AWFRLK+PH+ G+LASS
Sbjct: 147 FDLAVFRQYYQDALNYRYNRSSGFDNPWFVFGVSYAGALSAWFRLKFPHLTCGSLASSGV 206
Query: 217 ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
+L + Y+F++ Q +S C ++ + + +EE + K + K
Sbjct: 207 VL------AVYNFTDFDKQVGKSAGPECKAALQETTELVEEQLQSDSHSVKALFGAQTLK 260
Query: 277 SEKN--LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL 334
++ + + +T+F Y P +PL K G ++
Sbjct: 261 NDGDFLFLLADAAATSF------QYGNPDAVCSPL-----------TKAKKNGKNLLESY 303
Query: 335 YGAASVYY--NYSGTAKCFDL----NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE 388
YY T +D N D W +Q C+E+ DS+
Sbjct: 304 AQFVRDYYIKKLGTTVSSYDQEYLKNTTPDDSSSRLWWFQVCSEVAYFQVAPKNDSV-RS 362
Query: 389 SEEDYDARARYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
++ + CK +G P+ T +GG I AS I+F NG +DPW
Sbjct: 363 AKVNTRYNLDLCKNVFGEGVYPDVFMTNLYYGGTSIA------ASRIVFTNGSQDPWRHA 416
Query: 447 GVLKNISKTVLALVE-KEGAHHVDLR------------FSTKEDPQWLKDVRRREVEIIG 493
K+ L++ H DLR S P+ + VR++ + I
Sbjct: 417 SKQKSSEDMPSYLIKCSNCGHGTDLRGCPQLPFRIEGNSSDCTSPEAVSKVRKQIAKHID 476
Query: 494 KWISQ 498
W+SQ
Sbjct: 477 LWLSQ 481
>gi|320165589|gb|EFW42488.1| serine carboxypeptidase S28 [Capsaspora owczarzaki ATCC 30864]
Length = 491
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 140/500 (28%), Positives = 207/500 (41%), Gaps = 79/500 (15%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P+S I S+L DSQ ++ Q++DHFN PQ +TF+QRY N ++
Sbjct: 36 PASLIQRRSASAL---GNDSQSVFN-----QLIDHFN--PQHRETFKQRYFENTDNF--D 83
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
N PIF+Y E + ++ ++ +F A +V +EHRYYG+S P+ A
Sbjct: 84 PVNGPIFLYICGEATCGGIPND--YIRVLSKQFNAAIVTLEHRYYGESSPF------AQL 135
Query: 150 NASTTGYLSSTQALADYASL-------IIDLK---KNLTATDSPVVVFGGSYGGMLAAWF 199
YL+S QA+ D A+ ++D++ + D+ +G SY G L+AWF
Sbjct: 136 TTPNLQYLTSRQAINDLAAFRDFYQHNVVDVRYAQQRAGRGDNLWFTYGVSYSGALSAWF 195
Query: 200 RLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE-T 258
RLK+PH+ G+LASS + D ++S F +T RSV +C + + +E
Sbjct: 196 RLKFPHLTAGSLASSGVV---DVVLSFPEFDEQVT---RSVGSDCANALHAAMSGVEALL 249
Query: 259 AKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT-DYPTPSNFLNPLPAFPVKEM 317
A P + L KA + NL S L+ + TA++ Y + PL
Sbjct: 250 AANPVATKVLFKATSL---SSNLDFLSMLADS---TALSVQYGHKDSMCPPL-------- 295
Query: 318 CKAIDDPKTGNDVFAKLYGAASVY-------YNYSGT-AKCFDLNGDSDPHGLSEWGWQA 369
+A + FA+ Y S Y ++YS K DS G +W +Q
Sbjct: 296 VQAFQAGQNMTLAFAQ-YVTTSFYTIFEVDPFSYSQEYLKQVQAGPDS---GARQWTYQT 351
Query: 370 CTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRF 429
C EM SI + D C+ +GV P T E+ G +
Sbjct: 352 CAEMGYFQVAPAGFSI-RSRQLTIDYYQSLCQNVFGVWPPVINATNEYYG------ARNI 404
Query: 430 ASNIIFF-NGLRDPWSGGG--VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRR 486
AS FF NG +DPW V N + V H VD+R + Q D
Sbjct: 405 ASTQTFFTNGAQDPWQNVTLQVSNNPLRPTATAVCDNCGHGVDMRGCPQSPAQTNGDTSL 464
Query: 487 -----REVEIIGKWISQYFQ 501
V+ I I QY Q
Sbjct: 465 CKPDGSNVKAIQASIVQYLQ 484
>gi|79326354|ref|NP_001031795.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|51968458|dbj|BAD42921.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|332661231|gb|AEE86631.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 477
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 145/521 (27%), Positives = 218/521 (41%), Gaps = 80/521 (15%)
Query: 2 ATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTK---YH 58
A F LS + S T+SN + P RI+ L SSK Y T+ +
Sbjct: 4 ALGFALLSIFAILLSLSTLSNGLLQP----RRIS----HGLTESSK-----YLTRDELWF 50
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
Q LDH Y+P ++ F+QRY H + PIF+ EG + ++ +
Sbjct: 51 NQTLDH--YSPSDHREFKQRYYEYLDHL--RVPDGPIFMMICGEGPCNGIPND--YITVL 104
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IID 172
A KF A +V +EHRYYGKS P+ K +A +N YLSS QAL D A+ ++
Sbjct: 105 AKKFDAGIVSLEHRYYGKSSPF---KSLATENLK---YLSSKQALFDLAAFRQYYQDSLN 158
Query: 173 LKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+K N + ++P FG SY G L+AWFRLK+PH+ G+LASSA + + Y F
Sbjct: 159 VKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVR------AVYEFPE 212
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q S C ++ + K +E GL+ +A + + L +++ F
Sbjct: 213 FDQQIGESAGPECKAALQETNKLLEL------GLKVNNRAVKALFNATELDVDA----DF 262
Query: 292 VYTAMTDYPTPSNFLNPLP-AFPVKEMCKAIDD-----PKTGNDVFAKLYGAASVYYNYS 345
+Y + NP P+ E K DD K + ++G +S Y+
Sbjct: 263 LYLIADAEVMAIQYGNPDKLCVPLVEAQKNRDDLVEAYAKYVREFCVGVFGLSSKTYSRK 322
Query: 346 GTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
L+ P W +Q CTE+ DSI + + + CK +
Sbjct: 323 HL-----LDTAVTPESADRLWWFQVCTEVAYFQVAPANDSI-RSHQINTEYHLDLCKSLF 376
Query: 405 --GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA---L 459
GV P + +G +I A+ IIF NG +DPW K S L +
Sbjct: 377 GKGVYPEVDATNLYYGSDRIA------ATKIIFTNGSQDPWRHAS--KQTSSPELPSYIV 428
Query: 460 VEKEGAHHVDLRFSTKEDPQWL--KDVRRREVEIIGKWISQ 498
H DLR + P + VR+ V+ + W+S+
Sbjct: 429 TCHNCGHGSDLR-GCPQSPMVIGVNKVRQHIVDHMDLWLSE 468
>gi|402866199|ref|XP_003919573.1| PREDICTED: LOW QUALITY PROTEIN: thymus-specific serine protease
[Papio anubis]
Length = 501
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 108/459 (23%), Positives = 192/459 (41%), Gaps = 87/459 (18%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+S + L K + Q+LD FN + + ++F QRY +N+ HW G + PIF++ G E
Sbjct: 47 LSLGPGAAALPKVGWLEQLLDPFNVSDR--RSFLQRYWVNEQHWVG--EDGPIFLHLGGE 102
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
G + + G +AP + AL++ +EHR+YG SIP GG + + S+ + +
Sbjct: 103 GSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRHAMGKSSG 162
Query: 163 ---------------LADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFRLK---- 202
LAD S + L + ++ SP + FGGSY G LAAW RLK
Sbjct: 163 IPSDEDRPSSPSDPRLADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKVLXL 222
Query: 203 --YPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQDFRSV----SENCYKVIKGSWK 253
+PH+ ++ASSAP +L+F +++++++ S S C + ++
Sbjct: 223 LRFPHLIFASVASSAPVRAVLDFSE------YNDVVSRSLMSTAIGGSLECRAAVSAAFA 276
Query: 254 QIEETAKKPGGLE-KLQKAFRIC--------KSEKNLAIESWLSTAFVYTAMTDYPTPSN 304
++E + G + L+ C ++E A+++ + A Y P
Sbjct: 277 EVERRLRLGGAAQAALRSELSACGTLGSAENQAELLGALQALVGGAVQYDGQAGAP---- 332
Query: 305 FLNPLPAFPVKEMCKAIDDP---KTGNDVFAKLYGAASVYYNYSGTAKCFDLN------- 354
V+++C + ++ + + L A + + G KC +
Sbjct: 333 -------LSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQ-KCLSFSRAETVAQ 384
Query: 355 -GDSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG---- 405
++P G +W +Q CTE N F + ++ C++ +G
Sbjct: 385 LRSTEPQLSGVGDRQWLYQTCTEFGFYVTCGNPRCPFSQLPA-LPSQLDLCEKVFGLSAL 443
Query: 406 -VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
V + +GG G A+ ++F NG DPW
Sbjct: 444 SVAQAVAQTNSYYGGQTPG------ANQVLFVNGDTDPW 476
>gi|322795214|gb|EFZ18036.1| hypothetical protein SINV_11633 [Solenopsis invicta]
Length = 413
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 80/235 (34%), Positives = 114/235 (48%), Gaps = 31/235 (13%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ LDHFN+ +T+ RY N KN PI + G E I G MY++
Sbjct: 49 TQPLDHFNHREN--RTWSMRYKENSAFL---KNGGPILIMIGGEWQITDGYLQGGLMYEI 103
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
K+ L+ + EHR+YG+S P K+I+ +N YL++ Q LAD A I KK
Sbjct: 104 GVKYGGLMYYTEHRFYGQSKP---TKDISSENLQ---YLNADQGLADLAYFIETKKKEKN 157
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPH-----------------VAIGALASSAPILNFD 221
+S V+V GGSY G +AAW RLKYPH + GALASSAP+
Sbjct: 158 LENSTVIVVGGSYAGNMAAWARLKYPHLIQVTFALSLYQNWERYLKKGALASSAPVKAKA 217
Query: 222 NIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
+ Y F + + S+ C + +K ++ +EE K G +L+ F +C+
Sbjct: 218 DFYEYYEF---VAKSLGRHSKKCVENVKIAFASVEELLAKQCGAIELKYLFNLCE 269
>gi|403343430|gb|EJY71042.1| Serine carboxypeptidase S28 family protein [Oxytricha trifallax]
Length = 494
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 128/464 (27%), Positives = 189/464 (40%), Gaps = 60/464 (12%)
Query: 59 TQILDHFNYNPQS-YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD 117
TQI+DHF P + T+QQRY IND H+ K+N P+F+Y EG + + G+
Sbjct: 48 TQIVDHFPPTPTNDAATYQQRYFINDKHF--DKDNGPVFLYICGEGTCK-PPSDRGYPMQ 104
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI----IDL 173
+A +F A+ +EHR+YG S P A + YL++ QALAD A I +
Sbjct: 105 LAIEFGAMFYAVEHRFYGTSQP------TADWSTDNLKYLTAEQALADLAGFIDAQNAAI 158
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K V GGSY G L+AWF+ YP A A +SS IL + F I
Sbjct: 159 IKQYGGAARKWVTIGGSYPGALSAWFKQAYPDKAAAAWSSSGVILPIRDFT---DFDMDI 215
Query: 234 TQDFRSVSENCYKVIKGSWKQIE------ETAKKPGGLEKLQKAFRICKSEKNLAIESWL 287
Q C I+ QIE E K + L + F I K + N +
Sbjct: 216 FQATSRSGPECPAFIQSLTTQIETILKNQEQGKGGADFDYLCEVFGITK-DANFG-DFMF 273
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAF---PVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
A ++T Y L + K + G +F Y ++
Sbjct: 274 YVADIFTIGVQYGGRIELCQMLESIQFSTFKAQLPVLQQYAKGKGMFFGQYDRVAL---- 329
Query: 345 SGTAKCFDLNGDSDPHGLSEWGWQACTEM--IMLTGGDNKDSIFEESEEDYDARARYCKE 402
+ TA D N + +W WQ CTE + N S +D YC+
Sbjct: 330 AQTAYVTDNN-------MRQWTWQYCTEFGWFQIPAATNP---MRSSYIGHDYWVPYCEA 379
Query: 403 AYGV---DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV--LKN--ISKT 455
+G +P+ ++ ++GG I A NI F N + DPW G+ +KN K
Sbjct: 380 VFGAKIGEPKVDYYIQKYGGLDIK------ADNIFFANSIEDPWQYAGMRTVKNPETQKN 433
Query: 456 VLA-LVE-KEGAHHVDLRFSTKEDPQWLKDVRRR-EVEIIGKWI 496
++A L++ + H DL+ + D L R +I+ W+
Sbjct: 434 LVANLIDCNDCGHCQDLKTPSTGDAPALTIARANLRQQILITWL 477
>gi|388454936|ref|ZP_10137231.1| serine carboxypeptidase [Fluoribacter dumoffii Tex-KL]
Length = 467
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 120/464 (25%), Positives = 191/464 (41%), Gaps = 62/464 (13%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
K Y Q++DH N F QRY I++++ G + +AP+F Y E A N G
Sbjct: 46 KLGYFKQLIDH---NNPGTGNFYQRYYIDESY--GPEMDAPVFFYICGEAACSKRALN-G 99
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+ + A KF A LV +EHRYYG S+P+ + +L++ AL D A+ L
Sbjct: 100 AIRNYAQKFHAKLVALEHRYYGDSLPFNT------LSTEHLRFLTTEAALDDLAAFQRHL 153
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K N + V FGGSY G L+A++RLK+P++ +GALASSAP++ ++ + + +
Sbjct: 154 K-NERNWNGKWVAFGGSYPGSLSAYYRLKFPYLVVGALASSAPVMAKEDFI---EYDAHV 209
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVY 293
TQ C ++ + ++E + +++++ F S + ++ A
Sbjct: 210 TQ---VAGLKCAAQMREAVNEVEASLSDAAKWKEMKELFE--ASAVDDPVDFLYLIADTG 264
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK-LYGAASVYYNYSGTAKCFD 352
A Y F L P P G FAK LY A +
Sbjct: 265 AAAVQYGMRDEFCTRLATSPT---------PLQGYAEFAKNLYKAMHINAVEMTAQGAMS 315
Query: 353 LNGDS--DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEE-DYDARARYCKEAYGVDPR 409
N + D G+ +W +Q+C E + S + D + C+ +G+
Sbjct: 316 ENPAAYKDGLGMRQWYYQSCKEYGYWQNAHPNPAFSTRSSLINLDYHHKICERLFGLTQP 375
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEG----- 464
N T E ++ SNI F NG DPWS L+L EK G
Sbjct: 376 VN--TEEINNTLYIPLMDTLTSNIYFTNGENDPWS-----------TLSLAEKNGNAINP 422
Query: 465 ----------AHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
AH DL + D L++ R+ ++ W+ +
Sbjct: 423 KLTYHLIQGAAHCDDLHSPSAIDSDSLREARKTMEILLANWLKK 466
>gi|195451235|ref|XP_002072826.1| GK13808 [Drosophila willistoni]
gi|194168911|gb|EDW83812.1| GK13808 [Drosophila willistoni]
Length = 445
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 124/469 (26%), Positives = 196/469 (41%), Gaps = 68/469 (14%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ K+ Q LDHF+ + QT+Q RYL+ND + +PIF+Y G E IE + G
Sbjct: 20 EEKWIAQKLDHFD--ESNTQTYQMRYLVNDEF---QEEGSPIFIYLGGEWAIEDSMVSAG 74
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
YD+A + K +L++ EHRYYG+SIP + YL QALAD A I
Sbjct: 75 HWYDMAQEHKGVLIYTEHRYYGESIP------TTTMSTEHLQYLHVKQALADVAHFIETY 128
Query: 174 K-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K +N T+S V++ GGSY + WF+ YP + G ASSAP+L V + +
Sbjct: 129 KSENSQLTNSKVLLAGGSYSATMVVWFKRLYPDLVEGGWASSAPLL---AKVDFTEYKEV 185
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS--EKNLAIESWLST 289
+ + F + + CY I ++E + E + R+C S +KN ++ W
Sbjct: 186 VGRAFLQLGGQQCYNRINNGIAELESMFENNRAAEA-RAMLRLCSSFDDKN-DLDLW--- 240
Query: 290 AFVYTAMTDYPTPSNFLNPLPAF----PVKEMCK---AIDDPKTGNDVFAK-LYGAAS-V 340
T + + SN + + + ++ C + DD T FA +G S +
Sbjct: 241 -------TLFGSISNIFSGVAQYQRYGEIEYYCDFLLSFDDDATAIANFAYWAWGYPSCI 293
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
YSGT + + L + W +Q C E + + F C
Sbjct: 294 DARYSGTVEYY-LWSIENFSASRPWYYQTCNEYGWYQSSGSNNQPFGNKFPAL-LYTTLC 351
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRF----------ASNIIFFNGLRDPWS--GGGV 448
+ +G T+F IGL + + NI +G DPW+ G GV
Sbjct: 352 ADVFG---------TQFTNENIGLSVSQTNIDFGGMAPEVENIYMTHGGLDPWNPMGHGV 402
Query: 449 LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ A V +H D D ++ + R E++ +W++
Sbjct: 403 TEG------ATVITNASHCADFGSIDAGDTAEMRASKERLAELVREWLA 445
>gi|51970574|dbj|BAD43979.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 488
Score = 115 bits (287), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 145/531 (27%), Positives = 218/531 (41%), Gaps = 89/531 (16%)
Query: 2 ATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTK---YH 58
A F LS + S T+SN + P RI+ L SSK Y T+ +
Sbjct: 4 ALGFALLSIFAILLSLSTLSNGLLQP----RRIS----HGLTESSK-----YLTRDELWF 50
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
Q LDH Y+P ++ F+QRY H + PIF+ EG + ++ +
Sbjct: 51 NQTLDH--YSPSDHREFKQRYYEYLDHL--RVPDGPIFMMICGEGPCNGIPND--YITVL 104
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IID 172
A KF A +V +EHRYYGKS P+ K +A +N YLSS QAL D A+ ++
Sbjct: 105 AKKFDAGIVSLEHRYYGKSSPF---KSLATENLK---YLSSKQALFDLAAFRQYYQDSLN 158
Query: 173 LKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+K N + ++P FG SY G L+AWFRLK+PH+ G+LASSA + + Y F
Sbjct: 159 VKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVR------AAYEFPE 212
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q S C ++ + K +E GL+ +A + + L +++ F
Sbjct: 213 FDQQIGESAGPECKAALQETNKLLEL------GLKVNNRAVKALFNATELDVDA----DF 262
Query: 292 VYTAMTDYPTPSNFLNPLP-AFPVKEMCKAIDD-----PKTGNDVFAKLYGAASVYYNYS 345
+Y + NP P+ E K DD K + ++G +S Y+
Sbjct: 263 LYLIADAEVMAIQYGNPDKLCVPLVEAQKNRDDLVEAYAKYVREFCVGVFGLSSKTYSRK 322
Query: 346 GTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
L+ P W +Q CTE+ DSI + + + CK +
Sbjct: 323 HL-----LDTAVTPESADRLWWFQVCTEVAYFQVAPANDSI-RSHQINTEYHLDLCKSLF 376
Query: 405 --GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA---L 459
GV P + +G +I A+ IIF NG +DPW K S L +
Sbjct: 377 GKGVYPEVDATNLYYGSDRIA------ATKIIFTNGSQDPWRHAS--KQTSSPELPSYIV 428
Query: 460 VEKEGAHHVDLRFSTK------------EDPQWLKDVRRREVEIIGKWISQ 498
H DLR + P + VR+ V+ + W+S+
Sbjct: 429 TCHNCGHGSDLRGCPQSPMVIGGDSKNCSSPDAVNKVRQHIVDHMDLWLSE 479
>gi|146425169|emb|CAM84573.1| intestinal prolyl carboxypeptidase 1 [Haemonchus contortus]
Length = 1071
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 125/488 (25%), Positives = 201/488 (41%), Gaps = 60/488 (12%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DI 105
D+ Y+T Y TQ +DHFN Q+ TF Q+Y N+ W ++ PIF+ G EG
Sbjct: 578 DTPSEYETGYFTQPVDHFN--NQNPATFDQKYYKNE-QW--AREGGPIFLMIGGEGPSSA 632
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST------TGYLSS 159
+W A KF A +EHRYYG S + + + + T T YLSS
Sbjct: 633 KWILNENYTWLQWAKKFGATTYMLEHRYYGDS----DLQRLLFDSTDTKLKRTYTTYLSS 688
Query: 160 TQALADYASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
Q L D A+ I ID T +VFGGSY G LA W R +P++ GA+ SSAP+
Sbjct: 689 LQMLYDTANFIQAIDADNGKKGT---WIVFGGSYAGSLALWMRKLFPNLVHGAVGSSAPL 745
Query: 218 LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
+ + + + ++ R SE+C I ++ I E G E++ K F++
Sbjct: 746 ---EAKLDYHEYYQVVEASIRXYSEDCAYAIGEGFEDIHEKMLSERGREEISKTFKLNPP 802
Query: 278 EKNLAIESWLSTAFV-------YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK----- 325
+++ + F +TA Y N + ++CK + + +
Sbjct: 803 WDDVSDVFEIDKQFFFWNPMEQFTAAVQY-XGDNSGGYADGHGIPDLCKIMTNERRTPMA 861
Query: 326 -------------TGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTE 372
TG F + + Y + TA D + W WQ CTE
Sbjct: 862 RIAEFNEYMTRFFTGKPAFEYTFNS---YKEFVSTAYKAQFATDKKAAAGTLWLWQTCTE 918
Query: 373 MIMLTGGDNKDSIFEESEEDYDARARYCKEAYG--VD--PRPNWITTEFGGHKIGLVLKR 428
D+ S+F + + + C + +G +D N T ++ G K
Sbjct: 919 FGFYGTTDSGYSLF-GNPLPLNFFTQLCSDLFGWKIDYSAEMNRRATLNVNNRYGGRYKY 977
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRRE 488
+N++ G DPW+ G ++ ++ L+ K AH ++ + + D LK+ R +
Sbjct: 978 EKTNVVMTYGTLDPWTALGPVE-CKESENCLMIKGTAHCAEMYPAREADLPSLKEARSKI 1036
Query: 489 VEIIGKWI 496
II W+
Sbjct: 1037 ENIIEGWV 1044
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 116/454 (25%), Positives = 192/454 (42%), Gaps = 52/454 (11%)
Query: 49 SQGLY--KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--D 104
SQG + + Y Q LDH Q + + QRY N+ ++ N A F+ G G D
Sbjct: 29 SQGGFGGREAYLKQKLDH----TQEVKEWSQRYFYNNRYYRKGGNVA--FLMLGGMGVLD 82
Query: 105 IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
I W +A + AL+ +EHR+YGKS P +++ KN YL+ QA+
Sbjct: 83 IGWVTNEKIPFVQMAKERGALMFALEHRFYGKSRP---TDDLSVKNLK---YLTIEQAIG 136
Query: 165 DYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKY--PHVAIGALASSAPILN--F 220
D + I ++ K + +VFGGSY G LA W R KY ++ GA+ASS PI+ F
Sbjct: 137 DIKTFIEEMNKKHKLENPKWIVFGGSYAGSLALWARDKYKDENLIAGAVASS-PIMRPKF 195
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFR-----IC 275
D + + ++ + V + C + I+ + Q+ + G +L + F+ +
Sbjct: 196 DF----WEATQFAEKEIQKVDKKCGESIRIGFMQMIDMLGNQVGRSQLSELFKMRPRFLT 251
Query: 276 KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID----DPKTGNDVF 331
+N+ + + + +A+ P N ++ +K++C+ ++ D T +
Sbjct: 252 PDLRNIQLLNSIQLNNFISAVQFRGGPY-MQNGTHSYNLKQLCEIMNTETIDQLTALERV 310
Query: 332 AKLYGAASVYYNYSG--TAKCFD------LNGDSDPHGLSEWG----WQACTEMIMLTGG 379
+ + S Y N T FD L D D G + WQ CT++
Sbjct: 311 SNVRHLQSKYLNDMDKYTPVDFDALMKYLLKKDFDEEGWASVDRASLWQRCTQLGSFPTT 370
Query: 380 DNK-DSIFEESEEDYDARARYCKEAYGVDPRPNWI--TTEFGGHKIGLVLKRFASNIIFF 436
D +SIF S D A C + +G I T E G +N++
Sbjct: 371 DGAINSIF-GSLVSIDFYADLC-QVFGEKFNAEHIEMTVEETLQHYGGADNYKGTNVVIA 428
Query: 437 NGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL 470
NG DP+ L + TV+ + + G+H D+
Sbjct: 429 NGGSDPYHLLSKLSSRDPTVVTYLIEGGSHCGDM 462
>gi|268576509|ref|XP_002643234.1| Hypothetical protein CBG08099 [Caenorhabditis briggsae]
Length = 540
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 121/465 (26%), Positives = 200/465 (43%), Gaps = 64/465 (13%)
Query: 59 TQILDHFN-YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFM 115
TQ LDHF+ YN + T+ Q+Y N + S+NN+ IF+ G EG + W A+
Sbjct: 70 TQKLDHFDRYNTK---TWNQKYFYNPKY---SRNNSIIFLMIGGEGPENGRWAAKPEVQY 123
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
A +F A + +EHR++G S P I+ S+ YL++ QALAD A I + +
Sbjct: 124 LQWASEFGADVFDLEHRFFGDSWP------ISDMETSSLQYLTTQQALADLAYFIESMNQ 177
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNI 232
+ V FGGSY G L+AWFR KYP + +G++ASSAP+ L+F Y ++ +
Sbjct: 178 KYGFKNPRWVTFGGSYPGSLSAWFRQKYPELTVGSVASSAPVNLKLDF------YEYAMV 231
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI-----CKSEKNLAIESWL 287
+ D + C ++ ++ +I++ + G L F + K+ K L I ++
Sbjct: 232 VEDDLKLTDPQCAPAVRDAFTKIQQMSLTAEGRNSLNTYFNLQPPFDAKTTK-LDINNFF 290
Query: 288 STAF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV---------FAKLYGA 337
F + MT Y + V++MC+ + + N V F ++ A
Sbjct: 291 GNLFNTFQGMTQYTYDGQSNSTHSDKTVRKMCQIMTNATEPNTVKRVENLFLWFNQMEPA 350
Query: 338 A-------SVYYNYSGTAKCFDLN----GDSDPHGLSEWGWQACTEMIMLTGGDNKDSIF 386
+ Y++ +L G + G W W C E+ L + ++ F
Sbjct: 351 GPDLSVMPNSYWDVIAQVGSGNLTVLGEGGAAARG---WMWLCCNEIGFLQTTNQGNNAF 407
Query: 387 EESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKI-----GLVLKRFASNIIFFNGLRD 441
+ C + +G + I G+K+ G A+N++ NG D
Sbjct: 408 GTGVP-LNLFIDMCTDMFGDSMK---IKRIMSGNKLSQNYYGGADFYNATNVVLPNGSLD 463
Query: 442 PWSGGGVLK-NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVR 485
PW G K N ++ +L + AH D+ S +P L R
Sbjct: 464 PWHALGTYKTNDNQNLLPYLINGTAHCGDMYASYDGEPASLPAAR 508
>gi|18419800|ref|NP_567999.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|334187220|ref|NP_001190936.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|14334574|gb|AAK59466.1| unknown protein [Arabidopsis thaliana]
gi|22136934|gb|AAM91811.1| unknown protein [Arabidopsis thaliana]
gi|51970170|dbj|BAD43777.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970244|dbj|BAD43814.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970332|dbj|BAD43858.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970726|dbj|BAD44055.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970916|dbj|BAD44150.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|332661230|gb|AEE86630.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332661232|gb|AEE86632.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 488
Score = 114 bits (286), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 145/531 (27%), Positives = 218/531 (41%), Gaps = 89/531 (16%)
Query: 2 ATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTK---YH 58
A F LS + S T+SN + P RI+ L SSK Y T+ +
Sbjct: 4 ALGFALLSIFAILLSLSTLSNGLLQP----RRIS----HGLTESSK-----YLTRDELWF 50
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
Q LDH Y+P ++ F+QRY H + PIF+ EG + ++ +
Sbjct: 51 NQTLDH--YSPSDHREFKQRYYEYLDHL--RVPDGPIFMMICGEGPCNGIPND--YITVL 104
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IID 172
A KF A +V +EHRYYGKS P+ K +A +N YLSS QAL D A+ ++
Sbjct: 105 AKKFDAGIVSLEHRYYGKSSPF---KSLATENLK---YLSSKQALFDLAAFRQYYQDSLN 158
Query: 173 LKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+K N + ++P FG SY G L+AWFRLK+PH+ G+LASSA + + Y F
Sbjct: 159 VKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVR------AVYEFPE 212
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q S C ++ + K +E GL+ +A + + L +++ F
Sbjct: 213 FDQQIGESAGPECKAALQETNKLLEL------GLKVNNRAVKALFNATELDVDA----DF 262
Query: 292 VYTAMTDYPTPSNFLNPLP-AFPVKEMCKAIDD-----PKTGNDVFAKLYGAASVYYNYS 345
+Y + NP P+ E K DD K + ++G +S Y+
Sbjct: 263 LYLIADAEVMAIQYGNPDKLCVPLVEAQKNRDDLVEAYAKYVREFCVGVFGLSSKTYSRK 322
Query: 346 GTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
L+ P W +Q CTE+ DSI + + + CK +
Sbjct: 323 HL-----LDTAVTPESADRLWWFQVCTEVAYFQVAPANDSI-RSHQINTEYHLDLCKSLF 376
Query: 405 --GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA---L 459
GV P + +G +I A+ IIF NG +DPW K S L +
Sbjct: 377 GKGVYPEVDATNLYYGSDRIA------ATKIIFTNGSQDPWRHAS--KQTSSPELPSYIV 428
Query: 460 VEKEGAHHVDLRFSTK------------EDPQWLKDVRRREVEIIGKWISQ 498
H DLR + P + VR+ V+ + W+S+
Sbjct: 429 TCHNCGHGSDLRGCPQSPMVIGGDSKNCSSPDAVNKVRQHIVDHMDLWLSE 479
>gi|51972041|dbj|BAD44685.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 488
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 145/531 (27%), Positives = 218/531 (41%), Gaps = 89/531 (16%)
Query: 2 ATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTK---YH 58
A F LS + S T+SN + P RI+ L SSK Y T+ +
Sbjct: 4 ALGFALLSIFAILLSLSTLSNGLLQP----RRIS----HGLTESSK-----YLTRDELWF 50
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
Q LDH Y+P ++ F+QRY H + PIF+ EG + ++ +
Sbjct: 51 NQTLDH--YSPSDHREFKQRYYEYLDHL--RVPDGPIFMMICGEGPCNGIPND--YITVL 104
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IID 172
A KF A +V +EHRYYGKS P+ K +A +N YLSS QAL D A+ ++
Sbjct: 105 AKKFDAGIVSLEHRYYGKSSPF---KSLATENLK---YLSSKQALFDLAAFRQYYQDSLN 158
Query: 173 LKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+K N + ++P FG SY G L+AWFRLK+PH+ G+LASSA + + Y F
Sbjct: 159 VKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVR------AVYEFPE 212
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
Q S C ++ + K +E GL+ +A + + L +++ F
Sbjct: 213 FDQQIGESAGPECKAALQETNKLLEL------GLKVNNRAVKALFNATELDVDA----DF 262
Query: 292 VYTAMTDYPTPSNFLNPLP-AFPVKEMCKAIDD-----PKTGNDVFAKLYGAASVYYNYS 345
+Y + NP P+ E K DD K + ++G +S Y+
Sbjct: 263 LYLIADAEVMAIQYGNPDKLCVPLVEAQKNRDDLVEAYAKYVREFCVGVFGLSSKTYSRK 322
Query: 346 GTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
L+ P W +Q CTE+ DSI + + + CK +
Sbjct: 323 HL-----LDTAVTPESADRLWWFQICTEVAYFQVAPANDSI-RSHQINTEYHLDLCKSLF 376
Query: 405 --GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA---L 459
GV P + +G +I A+ IIF NG +DPW K S L +
Sbjct: 377 GKGVYPEVDATNLYYGSDRIA------ATKIIFTNGSQDPWRHAS--KQTSSPELPSYIV 428
Query: 460 VEKEGAHHVDLRFSTK------------EDPQWLKDVRRREVEIIGKWISQ 498
H DLR + P + VR+ V+ + W+S+
Sbjct: 429 TCHNCGHGSDLRGCPQSPMVIGGDSKNCSSPDAVNKVRQHIVDHMDLWLSE 479
>gi|323454022|gb|EGB09893.1| hypothetical protein AURANDRAFT_10784, partial [Aureococcus
anophagefferens]
Length = 477
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 126/470 (26%), Positives = 194/470 (41%), Gaps = 95/470 (20%)
Query: 57 YHTQILDHFNYNPQS-YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
YH +LDHF + S + + QRY ++++ WGG+ P+F+Y G EG + F+
Sbjct: 8 YHDALLDHFESDVASPTRKWSQRYYVDESFWGGA--GFPVFLYIGGEGPQGPMSPRM-FI 64
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK- 174
Y A + +ALLV +EHR+YG+S+P A + + YL+S QALAD A + +
Sbjct: 65 YAQAKEHRALLVTLEHRFYGESLP------TANMDDANLRYLASAQALADLARFRVYVSS 118
Query: 175 ----------------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
K DS + FGGSY G LAAWF+ KYP + G +ASSAP+
Sbjct: 119 YSPDAPDAASTPPLELKASPGMDSKWIAFGGSYPGDLAAWFKEKYPFLTAGVVASSAPVF 178
Query: 219 -NFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
+D + + S +C ++ + +E+ KA C S
Sbjct: 179 AEYDFAQYSEVVGDALAYPLIGGSPSCADAVR---RGVEDLVAALEAGAAPPKALEPCGS 235
Query: 278 EKNLAIESWLSTAFVYTAMTDYPTPSNF-------LNPLPAFPVKEMCKAIDDPKTGNDV 330
I S + A Y+++ NF L P + V ++C A+D + +
Sbjct: 236 -----IASGVDRAQYYSSIF-----GNFQGVVQYNLEAGPPY-VSDVCDAVDGAPSPIEA 284
Query: 331 FAKLYGAASVYYNYSGTAKCFDLNGDSDP---------HGLS---EWGWQACTEMIMLTG 378
A AA+ ++ +GTA C + + D G+S +W WQ+C E
Sbjct: 285 LA----AATSLFSSNGTA-CLSSDFEKDYVSVLRNATFDGVSADRQWIWQSCNEFGFFQT 339
Query: 379 GDNKD--SIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHK---IGLVLKRF---- 429
K + F A C +GVD E+ G + GLV F
Sbjct: 340 ISPKSPFAAFGAYLNVSTAGRAVCSGGFGVD--------EYDGPRADAAGLVANAFYGGR 391
Query: 430 ---ASNIIFFNGLRDPWSGGGVLKNI---------SKTVLALVEKEGAHH 467
NI NG DPW G++ + + + +VE +G H
Sbjct: 392 TLQGINITAVNGNMDPWHSLGIVNDTDAYHAPSQRTSAGVHVVELDGTAH 441
>gi|91806190|gb|ABE65823.1| serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 417
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 118/448 (26%), Positives = 190/448 (42%), Gaps = 78/448 (17%)
Query: 92 NAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
+ P+F+ EG A + ++ +A KF+A +V +EHRYYGKS P+ +A +N
Sbjct: 8 DGPMFMIICGEGPCSGIAND--YINVLAKKFQAGVVSLEHRYYGKSSPFNS---LATENL 62
Query: 152 STTGYLSSTQALADYASL------IIDLKKNLTA--TDSPVVVFGGSYGGMLAAWFRLKY 203
YLSS QAL D AS ++ K N+++ +D+P FG SY G L+AWFRLK+
Sbjct: 63 K---YLSSKQALYDLASFRQYYQESLNKKLNISSGGSDNPWFFFGISYSGALSAWFRLKF 119
Query: 204 PHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
PH+ G+LASSA + + Y FS Q S + C ++ + K +E
Sbjct: 120 PHLTCGSLASSAVVR------AIYEFSEFDQQIGESAGQECKLALQETNKLLEL------ 167
Query: 264 GLEKLQKAFRICKSEKNLAIES--WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI 321
GL+ KA + + L +++ TA Y P PL +
Sbjct: 168 GLKVKNKAVKSLFNATELDVDADFLYLTADAAVMAFQYGNPDKLCVPL-----------V 216
Query: 322 DDPKTGNDVFA-----------KLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQAC 370
+ K G+D+ +++G YN + DS W +QAC
Sbjct: 217 EAKKNGSDLVVTYSTYVREYCMRIWGLRVRTYNRKHLRNTV-VTADS---AYRLWWFQAC 272
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD--PRPNWITTEFGGHKIGLVLKR 428
TE+ DS+ + + CK +G D P+ + +GG ++
Sbjct: 273 TELGYFQVAPKYDSV-RSHQINTTFHLDLCKSLFGKDVYPKVDATNLYYGGDRLA----- 326
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVE-KEGAHHVDLRFSTKED---------- 477
A+ IIF NG DPW + + +++ + H D+R +
Sbjct: 327 -ATKIIFTNGSEDPWRHASKQNSTHEMPSYIIKCRNCGHGSDIRGCPQSPMVIEGKSNNC 385
Query: 478 --PQWLKDVRRREVEIIGKWISQYFQDL 503
P ++ VR++ VE I W+S+ Q +
Sbjct: 386 SLPDYVNKVRQQMVEHIDLWLSECRQSI 413
>gi|297802314|ref|XP_002869041.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297314877|gb|EFH45300.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 491
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 145/541 (26%), Positives = 222/541 (41%), Gaps = 113/541 (20%)
Query: 8 LSFCLLFSSTLTISNAKIFPTFPSSRITPEKLS-SLISSSKDSQGLYKTK---YHTQILD 63
+ F LL + T+ +S + + S + P ++S L SSK Y T+ + Q LD
Sbjct: 5 ICFALLTTFTILLSYSSL-----SDGLQPRRISHGLTESSK-----YLTRDELWFNQTLD 54
Query: 64 HFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFK 123
H Y+P ++ F+QRY H + PIF+ EG + ++ +A KF
Sbjct: 55 H--YSPSDHRKFRQRYYEYLDHL--RVPDGPIFMMICGEGPCNGIPND--YITVLAKKFD 108
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL----------IIDL 173
A +V +EHRYYGKS P+ K +A +N YLSS QAL D A+ +++
Sbjct: 109 AGIVSLEHRYYGKSSPF---KSLATENLK---YLSSKQALFDLAAFRQYYQARSNDSLNV 162
Query: 174 KKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K N + ++P FG SY G L+AWFRLK+PH+ G+LASSA + + Y F
Sbjct: 163 KFNRSGNVENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVR------AVYEFPEF 216
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
Q S C ++ + K +E GL+ KA + + L +++ F+
Sbjct: 217 DQQIGESAGPECKAALQETNKLLEL------GLKVNNKAVKALFNATELDVDA----DFL 266
Query: 293 Y------TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV---FAK--------LY 335
Y Y P PL ++ K G D+ +AK ++
Sbjct: 267 YLIADAEVMAIQYGNPDKLCVPL-----------VEAHKNGGDLVEAYAKYVREFCMGVF 315
Query: 336 GAASVYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIMLTGGDNKDSIFEESEEDYD 394
G +S Y+ L+ P W +Q CTE+ DSI + + +
Sbjct: 316 GLSSKTYSRKHL-----LDTAVTPESADRLWWFQVCTEVAYFQVAPANDSI-RSHQINTE 369
Query: 395 ARARYCKEAY--GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
CK + GV P + +G +I A+ IIF NG +DPW K
Sbjct: 370 YHLDLCKSLFGKGVYPEVDATNLYYGSDRIA------ATKIIFTNGSQDPWRHAS--KQT 421
Query: 453 SKTVLA---LVEKEGAHHVDLRFSTK------------EDPQWLKDVRRREVEIIGKWIS 497
S L + H DLR + P + VR+ VE + W+S
Sbjct: 422 SSPELPSYIVTCHNCGHGSDLRGCPQSPMVIEGDSKNCSSPDAVNKVRQHIVEHMDLWLS 481
Query: 498 Q 498
+
Sbjct: 482 E 482
>gi|444523275|gb|ELV13498.1| Thymus-specific serine protease [Tupaia chinensis]
Length = 393
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 117/226 (51%), Gaps = 18/226 (7%)
Query: 52 LYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN 111
L K + Q LD FN ++F QRY +N HW G +AP+F++ G EG + +
Sbjct: 69 LPKAGWPQQPLDPFNAT--DGRSFLQRYWVNAQHWAG--QDAPVFLHLGGEGGLGPGSVM 124
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
G +AP + AL++ +EHR+YG S+P GG + +LSS ALAD AS +
Sbjct: 125 RGHPAALAPAWGALVISLEHRFYGLSVPAGGLGLAQLR------FLSSRHALADAASARL 178
Query: 172 DLKKNL-TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+L + L + SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ + FS
Sbjct: 179 ELSRLLNVSASSPWICFGGSYAGSLAAWARLKFPHLVSASVASSAPVRAVLD------FS 232
Query: 231 NIITQDFRSVSENCYKVIKG-SWKQIEETAKKPGGLEKLQKAFRIC 275
RS+ C+ V + + Q+ T + + Q ++ C
Sbjct: 233 AYNEMVMRSLCLRCFSVSRAETVAQLRATEPQAAVVGDRQWLYQTC 278
>gi|330837832|ref|XP_003292053.1| hypothetical protein DICPUDRAFT_39962 [Dictyostelium purpureum]
gi|325077726|gb|EGC31420.1| hypothetical protein DICPUDRAFT_39962 [Dictyostelium purpureum]
Length = 481
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 110/432 (25%), Positives = 189/432 (43%), Gaps = 51/432 (11%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
K Q +DH+N+ + TF+QR+ ++ G N PIF G E +++ N
Sbjct: 49 KLFNQKIDHYNFQHGNL-TFKQRFFEYSNYYDG---NGPIFFVFGPEQELKEDYINNRQY 104
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
+ A A ++ +EHRYYGKSI + + YL+S QA+AD A I KK
Sbjct: 105 EEWAKTLNASIICLEHRYYGKSIF------TDHLTTESLQYLNSDQAIADVAYFITWYKK 158
Query: 176 NLTATDSPVVV-FGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSN 231
D V FG SYGG +AA F++KYPH+ ++SS P+ LNF + N
Sbjct: 159 ENKIDDGKRWVGFGASYGGTIAAQFKIKYPHLIDIIVSSSGPVSPELNFFQYLEI--VQN 216
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLS 288
I + + E C + I+ + +IEE K G L+ FR+C ++EK+ ++ + +
Sbjct: 217 TIISEVQD-GERCVENIRNATLEIEEII-KFGNHNLLKDKFRLCAPLENEKDFSLLEFTN 274
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP-KTGNDVFAKLYGAASVYYNYSGT 347
+ + Y + + ++++C +++ K+ D + +++ S
Sbjct: 275 SLVFMDTVQYYDSNKD--------KLQKICNILNNEFKSSLDNYIQIWLEVSY-----PN 321
Query: 348 AKCFDLNGDSD---------PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
KC ++N + H W +Q CTE ++K+ F S +
Sbjct: 322 VKCINVNYKNHIEIWKERNVDHQSKAWLYQTCTEYGYFMTTESKNQPF-GSLLNLQFYTD 380
Query: 399 YCKEAYGVD---PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
C++ +G+ P W ++GG KI +I++ N DPW N+ K
Sbjct: 381 MCQDIFGIRNMIPNTKWANDQYGGFKIN---SESIKSILYINSSLDPWYPLSFTPNMEKN 437
Query: 456 VLALVEKEGAHH 467
+ + +G H
Sbjct: 438 GINTLFIKGHSH 449
>gi|168067182|ref|XP_001785503.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662877|gb|EDQ49678.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 483
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 126/458 (27%), Positives = 190/458 (41%), Gaps = 52/458 (11%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHF+ Q + FQQRY ++ NAPIF+ E + ++ +A
Sbjct: 50 QRLDHFS--SQDRREFQQRYYEFLDYF--KDPNAPIFLRICGESTCSGIPND--YLLVLA 103
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDL 173
KF A +V +EHRYYG+S P+ +E+ N YLSS QAL D AS I+
Sbjct: 104 KKFGAAVVSLEHRYYGESSPF---EELTTDNLK---YLSSKQALFDLASYRNFYQESINK 157
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K N T ++P +VFG SY G L+AWFRLK+PH+ G+L+SS +L N ++
Sbjct: 158 KFNTTEKENPWIVFGVSYPGALSAWFRLKFPHLVRGSLSSSGVVLAVHN------YTAFD 211
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVY 293
Q S C ++ +++++ K++ F + + + + L+ A
Sbjct: 212 QQVAASAGPACANALRDVTQEVDKALT--SNSHKIKALFGVEQLKNDGDFRYLLADA--A 267
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
Y P PL A + F ++G + Y+ + L
Sbjct: 268 AEAFQYGNPDILCLPLVAAYSSGQNVVAAYAEFVKLFFFGIFGVNPISYD----QEHLKL 323
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES-EEDYDARARYCKEAYGVDPRPNW 412
G +W +Q CTE+ +++SI E Y C +G P
Sbjct: 324 TKSGPDTGDRQWWYQVCTEVAYFQVAPSQNSIRSPGVNEKY--HLDLCANVFGNGTYPEV 381
Query: 413 ITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS---KTVLALVEKEGAHH 467
T +GG I ASNI+F NG +DPW K IS + + + H
Sbjct: 382 DITNLYYGGSGIT------ASNIVFTNGSQDPWRHAS--KQISSPGEPAIIITCHNCGHG 433
Query: 468 VDLRFSTKEDPQWLKDVRR----REVEIIGKWISQYFQ 501
VDLR + Q D + EV + I+ Y Q
Sbjct: 434 VDLRGCPQSPHQIEGDATKCAKPNEVHKARQQIADYIQ 471
>gi|10140734|gb|AAG13567.1|AC073867_13 putative serine peptidase [Oryza sativa Japonica Group]
Length = 502
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 124/488 (25%), Positives = 201/488 (41%), Gaps = 88/488 (18%)
Query: 54 KTKYHTQILDHFNYNP---------QSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEG 103
+ ++ Q LDHF+ Q ++ F+QRY D H GG P+F+ E
Sbjct: 51 EERWMDQRLDHFSPTRPRADVAGGVQDHRQFKQRYYEFADYHAGG----GPVFLRICGES 106
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
+ ++ ++ KF A +V EHRYYGKS P+ + + +N +LSS QAL
Sbjct: 107 SCNGIPND--YLAVLSKKFGAAVVTPEHRYYGKSSPF---ESLTTENLR---FLSSKQAL 158
Query: 164 ADYASL------IIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP 216
D + I++ + N ++ D+P VFG SY G L+AWFRLK+PH+ G+LASS
Sbjct: 159 FDLVAFRQHYQEILNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGV 218
Query: 217 ILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
+L + Y+F++ Q S C ++ + ++E + K+ K
Sbjct: 219 VL------AVYNFTDFDKQVGDSAGPECKAALQEVTRLVDEQLRLDSRSVKVLFGAEKLK 272
Query: 277 SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG--------- 327
++ + + A + Y +P +PL I+ KTG
Sbjct: 273 NDGDFLFFLADAAAIGF----QYGSPDAVCSPL-----------INAKKTGRSLVETYAQ 317
Query: 328 --NDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSI 385
D F + +G Y+ + N D W +Q C+E+ DSI
Sbjct: 318 YVQDFFIRRWGTTVSSYDQE-----YLKNTTPDDTSSRLWWFQVCSEVAYFQVAPKNDSI 372
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPW 443
+E + C+ +G P+ T +GG +I AS I+F NG +DPW
Sbjct: 373 -RSTEINTGYHLDLCRNVFGEGVYPDVFMTNLYYGGTRIA------ASKIVFTNGSQDPW 425
Query: 444 SGGGVLKNISKTVLALVE-KEGAHHVDLR------FSTKEDPQ------WLKDVRRREVE 490
K+ +++ + H DLR F + DP + VR++
Sbjct: 426 RHASKQKSSKYMPSYIIKCRNCGHGTDLRGCPQLPFRIEGDPSNCSSPAAVSTVRKQIAS 485
Query: 491 IIGKWISQ 498
I W+SQ
Sbjct: 486 HISLWLSQ 493
>gi|326427042|gb|EGD72612.1| hypothetical protein PTSG_04347 [Salpingoeca sp. ATCC 50818]
Length = 482
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 133/467 (28%), Positives = 196/467 (41%), Gaps = 74/467 (15%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y LDHF++ S TF+ RY I+D+ + KN + F Y G EG NTG
Sbjct: 62 YFDFFLDHFDH---SSPTFRGRYYIDDSQF---KNGSVCFFYMGGEG------PNTGIRN 109
Query: 117 D----VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
D +A ++KAL+V IEHR+YG S+P+ + + N YL+S QALAD A LI
Sbjct: 110 DYVSYLAKQYKALIVSIEHRFYGDSVPF---DDFSVTNLE---YLTSRQALADAAQLIKH 163
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
+ + T S FGGSY G L+AWFR+KYP V +G+L+SS + N + F+
Sbjct: 164 VNSSDTYKCSAWFAFGGSYSGALSAWFRVKYPDVIVGSLSSSGVV----NAI--LDFTAF 217
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
Q ++ +C K ++ E K KA +++ +++
Sbjct: 218 DVQVRNAIGFSCTKDLQRVTAAFETALNKSDKSNAHAKALFSVRADIPDGDFAYMLAD-- 275
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKE-MCKAIDDPKTGND------VFA----KLYGA---A 338
AM D + KE +C AI + D FA K +GA
Sbjct: 276 SAAMAD------------QYGSKEKLCSAISGLRNEKDDEIVMQTFANFTIKFWGADFGP 323
Query: 339 SVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
S +Y+ ++C N + W WQ C E+ +S+ S + +
Sbjct: 324 SCFYD----SECVRSNPAAWQPTARSWWWQKCHELAYWQNAPVVNSL-RMSLLSMNYHKQ 378
Query: 399 YCKEAY--GVDPRPNWITTEFGG-HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
C+ + GV P +GG H G +NI F + DPW V +S
Sbjct: 379 RCEFMFAKGVFPDTQGTNKYYGGKHPNG-------TNIFFSDFSDDPWQQASVRTTLSPA 431
Query: 456 V-LALVEKEGAHH-VDLRFSTKE-DPQWLKDVRRREVEIIGKWISQY 499
+ LV G H +DL E DP LK R + + W+ Y
Sbjct: 432 LPYELVTCNGCGHCMDLHAPDDENDPNALKQGRVAFEKHLSTWLKPY 478
>gi|195062815|ref|XP_001996259.1| Pro-X carboxypeptidase [Drosophila grimshawi]
gi|193899754|gb|EDV98620.1| Pro-X carboxypeptidase [Drosophila grimshawi]
Length = 478
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 117/456 (25%), Positives = 186/456 (40%), Gaps = 52/456 (11%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ LDHF+ + + +T+Q RYLIND +PIF+Y G E ++ G YD+
Sbjct: 58 TQPLDHFDESNE--KTYQMRYLINDEF---QTEGSPIFIYLGGEWEVSPGMIEKGHWYDL 112
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-KNL 177
A + K LL++ EHRYYG S+P YL QALAD I LK +N
Sbjct: 113 AKEHKGLLIYTEHRYYGNSVP------TEKMTVDDLQYLHVKQALADVKHFITTLKSENA 166
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
+S V++ GGSY + WF+ YP + +G ASSAP+L V + + + + F
Sbjct: 167 QLANSKVLLAGGSYSATMVVWFKRLYPDLVVGGWASSAPLL---AKVDFFEYKEVTGKAF 223
Query: 238 RSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWL-STAFV 292
+ + CY I+ +E E + R+C S E +L + + S + V
Sbjct: 224 AELGGQKCYDRIQKGIADLEYMFDNKRSAEA-RSMLRLCSSFDHENDLDMWNLFGSISNV 282
Query: 293 YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKT--GNDVFAKLYGAASVYYNYSGTAKC 350
+ ++ Y P ++ F + DD T N V+ Y T
Sbjct: 283 FASLAQYQQPGE-IDYYCTFLL-----TFDDDATAIANFVYWAWGYETCTDARYQETVDY 336
Query: 351 FDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG----- 405
F L+ D W +Q C E + F ++ CK+ +
Sbjct: 337 F-LSAIEDFGAARPWYYQTCNEYGWYQSSRSLKQPF-GTKFPATFYIEMCKDVFSSKYGN 394
Query: 406 --VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNISKTVLALVE 461
+ +FGG ++ N+ +G DPWS G GV + A V
Sbjct: 395 EMIQSNTAQTNLDFGG------MEPNVENVYMTHGELDPWSAIGHGVAEG------ATVI 442
Query: 462 KEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ +H D + D ++ + R E++ +W++
Sbjct: 443 SKASHCNDFGSISPSDSSEMRASKERIAELVREWLA 478
>gi|51968542|dbj|BAD42963.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 462
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 136/500 (27%), Positives = 208/500 (41%), Gaps = 82/500 (16%)
Query: 34 ITPEKLS-SLISSSKDSQGLYKTK---YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
+ P ++S L SSK Y T+ + Q LDH Y+P ++ F+QRY H
Sbjct: 1 LQPRRISHGLTESSK-----YLTRDELWFNQTLDH--YSPSDHREFKQRYYEYLDHL--R 51
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
+ PIF+ EG + ++ +A KF A +V +EHRYYGKS P+ K +A +
Sbjct: 52 VPDGPIFMMICGEGPCNGIPND--YITVLAKKFDAGIVSLEHRYYGKSSPF---KSLATE 106
Query: 150 NASTTGYLSSTQALADYASL------IIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLK 202
N YLSS QAL D A+ +++K N + ++P FG SY G L+AWFRLK
Sbjct: 107 NLK---YLSSKQALFDLAAFRQYYQDSLNVKFNRSGDVENPWFFFGASYSGALSAWFRLK 163
Query: 203 YPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKP 262
+PH+ G+LASSA + + Y F Q S C ++ + K +E
Sbjct: 164 FPHLTCGSLASSAVVR------AVYEFPEFDQQIGESAGPECKAALQETNKLLEL----- 212
Query: 263 GGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLP-AFPVKEMCKAI 321
GL+ +A + + L +++ F+Y + NP P+ E K
Sbjct: 213 -GLKVNNRAVKALFNATELDVDA----DFLYLIADAEVMAIQYGNPDKLCVPLVEAQKNR 267
Query: 322 DD-----PKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-WGWQACTEMIM 375
DD K + ++G +S Y+ L+ P W +Q CTE+
Sbjct: 268 DDLVEAYAKYVREFCVGVFGLSSKTYSRKHL-----LDTAVTPESADRLWWFQVCTEVAY 322
Query: 376 LTGGDNKDSIFEESEEDYDARARYCKEAY--GVDPRPNWITTEFGGHKIGLVLKRFASNI 433
DSI + + + CK + GV P + +G +I A+ I
Sbjct: 323 FQVAPANDSI-RSHQINTEYHLDLCKSLFGKGVYPEVDATNLYYGSDRIA------ATKI 375
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLA---LVEKEGAHHVDLRFSTK------------EDP 478
IF NG +DPW K S L + H DLR + P
Sbjct: 376 IFTNGSQDPWRHAS--KQTSSPELPSYIVTCHNCGHGSDLRGCPQSPMVIGGDSKNCSSP 433
Query: 479 QWLKDVRRREVEIIGKWISQ 498
+ VR+ V+ + W+S+
Sbjct: 434 DAVNKVRQHIVDHMDLWLSE 453
>gi|413933853|gb|AFW68404.1| putative serine peptidase S28 family protein [Zea mays]
Length = 478
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 129/477 (27%), Positives = 200/477 (41%), Gaps = 81/477 (16%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
++ Q LDHF+ P ++ F+QRY D H P+F+ E + + +
Sbjct: 40 RWMNQRLDHFS--PTDHRQFKQRYFEFLDYH---RAPGGPVFLRICGESACDGIPND--Y 92
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL----- 169
+ +A KF A +V EHRYYGKS P+ K++ +N +LSS QAL D A
Sbjct: 93 LAVLAKKFGAAVVTPEHRYYGKSSPF---KQLTTENLR---FLSSKQALFDLAVFRQYYQ 146
Query: 170 -IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
++ + N + D+P V G SY G L+AWFRLK+PH+ G+LASS +L + Y+
Sbjct: 147 ESLNARYNRSGFDNPWFVIGVSYSGALSAWFRLKFPHLTCGSLASSGVVL------AVYN 200
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ-KAFRICKSEKNLAIESWL 287
+++ Q S C V+ ++I E + LE KA ++ KN
Sbjct: 201 YTDFDKQVGESAGPECKAVL----QEITELVDEQLRLESHSVKALFGAQTLKN------- 249
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYS 345
F++ T + NP P + KA + K + +A+ YY
Sbjct: 250 DGDFLFFLADAAATTFQYGNPDALCP--PLIKAKKNRKNLVEAYAQF---VKDYYIKKME 304
Query: 346 GTAKCFDLN--GDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
+D ++ PH S W +Q C+E+ DS+ + + C+
Sbjct: 305 TPPSSYDREYLKETTPHDSSSRLWWFQVCSEVAYFQVAPKNDSV-RSARINTRYHLDLCR 363
Query: 402 EAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
+G P+ T +GG +I AS I+F NG +DPW K+ SK + +
Sbjct: 364 HVFGEGVYPDVFMTNLYYGGTRIA------ASKIVFTNGSQDPWRHASKQKS-SKDMPSY 416
Query: 460 VEK--EGAHHVDLRFSTKEDPQW----------------LKDVRRREVEIIGKWISQ 498
+ K H DLR PQW + R R + I W+SQ
Sbjct: 417 IMKCRNCGHGTDLRGC----PQWPFRIEGDASNCSSLAVVNTARERIAKHIDLWLSQ 469
>gi|157167876|ref|XP_001656138.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871004|gb|EAT35229.1| AAEL012590-PA [Aedes aegypti]
Length = 489
Score = 111 bits (278), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 119/522 (22%), Positives = 215/522 (41%), Gaps = 71/522 (13%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDH 64
F FL + +S N F + R+ P L S ++ ++ L+ T Q+
Sbjct: 6 FAFLGIAVALASAAITINPSRPVLFGTHRVIPRNLGS--NAESNNTALWNTVNLRQVYT- 62
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKA 124
NPQ+ +F RY+ N+ H+ + PIF++ G +E G D+A +
Sbjct: 63 ---NPQNRNSFSMRYVTNNRHY---RRGGPIFLFVGGPWPLEAHLVEQGHFVDMAAEMNG 116
Query: 125 LLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT-DSP 183
LV E RYYG+SIP ++++ N YL + Q L++ A+ I LK+++ ++
Sbjct: 117 FLVANELRYYGESIPV---EDVSRNNFR---YLHNVQILSELATFIAHLKEDVVRDPNAK 170
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SE 242
V++ G Y LA W R ++PH+ G +SS + N F+ ++ ++ R +
Sbjct: 171 VILAGVGYSASLAQWMRQRFPHLIHGVWSSSGMVRASTNY---REFAEVVGENIRRFGGD 227
Query: 243 NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAF--VYTAMT 297
+CY I +++ E G + + F C+ + L +E++ F + +
Sbjct: 228 DCYSTIWRAFRTAENLIDA-GLSTTVDELFHTCRPIDAANALEVEAFFYGIFNEISREVV 286
Query: 298 DYPTPSNFLNPLPAFPVKEMCK---AIDDPKTGNDVFAKLYGA-------ASVYYNYSGT 347
D N +K+MC+ A DD + ++ + L G A + + T
Sbjct: 287 DADLRGN---------IKQMCEPLTASDDENSLLELASWLTGRFPNAECLAMDFQSIVDT 337
Query: 348 AKCFDLNGDSDPHGLSEWGWQACTEMIM-LTGG------------DNKDSIFEESEEDYD 394
D++ + G +W +Q CTE+ LT D I E+ +D+
Sbjct: 338 YNTIDVDSEIVKSGERQWMFQRCTELGWPLTAASQYQPFGRRFSTDLFHGICEQLFDDWL 397
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
R R+ + +GG + + N I G DPWS GV + I
Sbjct: 398 TRDRF-------EALIRQTNDYYGGARPDI------RNSISTQGTLDPWSFAGVREVIFN 444
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ ++ H D+ ++ED L+ + + I +W+
Sbjct: 445 NTYVTIIRDAFHGQDMASISEEDSIELRRSKEMVRDTIRRWV 486
>gi|388499696|gb|AFK37914.1| unknown [Lotus japonicus]
Length = 390
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 108/362 (29%), Positives = 165/362 (45%), Gaps = 47/362 (12%)
Query: 35 TPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAP 94
TP + L ++ D + ++ +Q LDH Y+P ++ FQQRY ++ + P
Sbjct: 23 TPHR-HRLSETATDRYLTKQEQWFSQTLDH--YSPYDHRKFQQRYYEFLDYF--RIPDGP 77
Query: 95 IFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
+F+ E + ++ +A KF A +V +EHRYYGKS P+ K +A KN
Sbjct: 78 VFLVICGEYSCNGIRND--YIAVLAKKFGAAVVSLEHRYYGKSSPF---KSLATKNLR-- 130
Query: 155 GYLSSTQALADYASL------IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
YLSS QAL D A ++ K N T T++P VFG SY G L+AWFRLK+PH+
Sbjct: 131 -YLSSKQALFDLAVFRQNYQDSLNAKLNRTKTENPWFVFGVSYPGALSAWFRLKFPHLTC 189
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKL 268
G+LASSA +L + Y+F+ Q S C ++ + + IE+ G
Sbjct: 190 GSLASSAVVL------AVYNFTEFDQQIGESAGAECKAALQETTQLIEKKLATDG----- 238
Query: 269 QKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN 328
KA + + +L I+ F+Y T + NP +CK + + K
Sbjct: 239 -KALKASFNAADLEIDG----DFMYFLADAGITAFQYGNP------DILCKPLVEAKKDG 287
Query: 329 DVFAKLYGAASVYYNYSGTAKCFD---LNGDSDPHGLSE--WGWQACTEMIMLTGGDNKD 383
+ Y V NY + + +D L S S+ W +Q CTE+ + D
Sbjct: 288 EDLVDAY-VKYVKENYGESTESYDQENLKNTSVSENSSDRLWWFQVCTEVAYFQVAPSND 346
Query: 384 SI 385
SI
Sbjct: 347 SI 348
>gi|268576170|ref|XP_002643065.1| Hypothetical protein CBG22982 [Caenorhabditis briggsae]
Length = 509
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 119/472 (25%), Positives = 192/472 (40%), Gaps = 62/472 (13%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTGFMY 116
TQ LDHF+ + +TFQQRY N+ + K+ P F+ G EG W + +
Sbjct: 64 TQTLDHFD--SSNTKTFQQRYYHNNQWY---KDGGPAFLMLGGEGPESSYWVSYPGLEIT 118
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-K 175
++A K A + IEHR+YG++ P +++ N YLSS QA+ D A+ I + K
Sbjct: 119 NLAAKQGAWVFDIEHRFYGETKP---TSDMSVPNLK---YLSSAQAIEDAATFIKAMTLK 172
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
++ V FGGSY G LAAW R K+P + A+ SS P+ V + ++
Sbjct: 173 YPQLKNAKWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPV---QAEVDFKEYLEVVQN 229
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWLSTAF 291
S C + + + + + G ++L+ AF C+ +KNL W +
Sbjct: 230 SITRNSTACAASVTAGFNLVAQLLQTTDGRKQLKTAFHTCQDIQLDDKNLKY-FWETVYS 288
Query: 292 VYTAMTDY--------PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF---AKLYGAASV 340
Y + Y T N + + + + K ND F + YG +
Sbjct: 289 PYMEIVQYSGDAAGAFATQLTISNAICKYHLNAKTDTLTKMKQVNDYFNLVQEYYGCNDI 348
Query: 341 YYNYSGTAKCFDLNGDSD---PHGLSEWGWQACTEM--IMLTGGDNKDSIFEESEEDYDA 395
Y D D+ W WQ CTE T F + +
Sbjct: 349 NYQ-----AFIDFMADTSFGYAQSDRAWVWQTCTEFGYYQSTSSATAGPWFGGNA---NL 400
Query: 396 RARY----CKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
A+Y C YG V ++ +GG + GL R I+ NG DPW
Sbjct: 401 PAQYYIDECTAIYGGAYNSQEVQTAVDYTNQYYGG-RDGLTTSR----ILLPNGDIDPWH 455
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
G L + + ++ +V AH D+ ++ D +L + R++ +++ W+
Sbjct: 456 ALGKLTSSTADIVPVVINGTAHCADMYGASSHDSIYLTNARQKISDVLDGWL 507
>gi|302797895|ref|XP_002980708.1| hypothetical protein SELMODRAFT_444586 [Selaginella moellendorffii]
gi|300151714|gb|EFJ18359.1| hypothetical protein SELMODRAFT_444586 [Selaginella moellendorffii]
Length = 1028
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 138/490 (28%), Positives = 205/490 (41%), Gaps = 76/490 (15%)
Query: 4 RFIFLSFCLLFSSTLTISNAKIFPTFPSSRI--TPEKLSSLISSSKDSQGLYKTKYHT-Q 60
R + + F LL +ST + S+A + PSSR+ PE L S +G + T Q
Sbjct: 2 RMLLVCFVLLATST-SFSSAFV----PSSRLGFKPEFLPE--GSRSPPRGKETVNFFTRQ 54
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGFMY 116
LDHF P+ + F Q+YL + +N PIF+ E GD T ++
Sbjct: 55 KLDHFA--PEDPRVFSQKYLELLDFF--RPHNGPIFLVMCGESTCTGDYV-----TTYVG 105
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI------ 170
+A F A +V +EHRYYG S P+ + N YL+S Q+L D+A I
Sbjct: 106 TLAESFGAAIVTVEHRYYGHSSPF------QHLNLHNLKYLTSKQSLFDHAVFIDYYQDL 159
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
I+ K N T + P +V GGSY G L+AWFRLK+PH+ G+ ASSA + + +S
Sbjct: 160 INQKYNKTEKN-PWIVIGGSYAGALSAWFRLKFPHLVAGSWASSAV------VEAILDYS 212
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
Q SV C + ++ + E + K F + N+ ++ L A
Sbjct: 213 AYDKQLGVSVGPKCKQALQEITRLTEHGLVENATEIKYLFGF---SPKDNITDDTLL--A 267
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
+V A + + +C + + N K Y N
Sbjct: 268 YVANAAA---------GEIQYGKIDALCDPLLKAEKSNRNLLKTYAKILDRINS------ 312
Query: 351 FDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP 410
D NG+ + W +Q CTE+ ++ S S + YC E +G P
Sbjct: 313 -DTNGNERDN--ESWDFQYCTEVGYFQVASDRKSSIRSSRINTQFFINYCAEQFGNGTFP 369
Query: 411 NWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK--EGAH 466
+ TT +GG I S I+F NG +DPW + SK + ALV + +H
Sbjct: 370 DVKTTNLYYGGWNIA------GSRIMFLNGSQDPWRHASK-QTSSKDMPALVLRCHTCSH 422
Query: 467 HVDLRFSTKE 476
VDL +E
Sbjct: 423 CVDLSAPCRE 432
>gi|194744685|ref|XP_001954823.1| GF16549 [Drosophila ananassae]
gi|190627860|gb|EDV43384.1| GF16549 [Drosophila ananassae]
Length = 489
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 111/461 (24%), Positives = 193/461 (41%), Gaps = 48/461 (10%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTGF 114
++ TQ LD N++ + T+ R IN+ H+ + +PIF+Y G E +I+ W NT
Sbjct: 58 RWITQKLD--NFDDDNNATWSDRIYINEKHF---VDGSPIFIYLGGEWEIQSWDISNT-L 111
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ D+ K ++ EHR++GKSIP + Y + QALAD ++I LK
Sbjct: 112 LADITKKHNGTIITTEHRFFGKSIPI-----TPLSTENLEKYQNVNQALADVINVIQTLK 166
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+ DS VV+ G SY G +AAW R YP + +G+ ASSAPI+ V + ++
Sbjct: 167 EEGKYKDSKVVISGCSYSGAMAAWIRKLYPDIIVGSWASSAPIV---AKVDFKDYFKVVG 223
Query: 235 QDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS----EKNLAIESWLST 289
+ ++++ + CY +I + E+ G ++ +K +C K + + +
Sbjct: 224 ESYQTLGGQYCYDLIDNATSYYEDLFANGKG-DQAKKELNLCDDFDPKNKRDRWQIFSTI 282
Query: 290 AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF----AKLYGAASVYYNYS 345
A ++ + Y P+N+ P ++ DD F + + +
Sbjct: 283 ANIFAGIAQYQIPANYDIPKQCSVLRSFS---DDDAEALSKFINWKVHEHTGECISATFD 339
Query: 346 GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES------EEDYDARARY 399
GT ++ D+ W +Q C+E + F S E+ +A
Sbjct: 340 GTTGYYEWAKDNYEDSDLPWFFQTCSEFGWFQSSGSSHQPFGSSFPSKLYEDTCEAVFGS 399
Query: 400 CKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNISKTVL 457
G+ EFGG I +N+ F G D W G GV +
Sbjct: 400 KYNTTGIRANAKATNAEFGGLDIDY------TNVYFVQGGLDGWKKVGAGVEEG------ 447
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
A + AH DL + D L +++ + ++ KW+++
Sbjct: 448 ATIIPSAAHCSDLGSISASDSPELVASKQKVIALVDKWLAE 488
>gi|344268154|ref|XP_003405927.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 486
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 120/480 (25%), Positives = 201/480 (41%), Gaps = 87/480 (18%)
Query: 77 QRYLINDTHWGGSKNNAPIFV-YTGNE-GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYY 134
++Y IN + K P+F+ GNE IEW +N ++ A + AL + +EHR+Y
Sbjct: 36 EKYYINYDFY---KPGGPVFLKVQGNEPASIEWIRRNFTWI-TYAQRLGALCLLLEHRFY 91
Query: 135 GKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGM 194
G S P ++++ +N YLSS QA+AD A + +++ T++ V+FGGSYGG
Sbjct: 92 GDSQP---TRDMSTENFRR--YLSSRQAVADIAEFRTVIAQSMNLTENKWVLFGGSYGGS 146
Query: 195 LAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQ 254
LA W R+K+P++ A+ SSA + V+ Y + +I + + + C K +K ++
Sbjct: 147 LAVWSRIKHPNLFAAAVTSSAMV---QAKVNFYEYFEVIHRALATHNRECLKAVKQAYGF 203
Query: 255 IEETAKKPGGLEKL----------------------QKAFRICKS-----EKNLAIESWL 287
+ PG +L +K IC S K+ ++
Sbjct: 204 VAAMLLLPGYHSRLILDYKFCEPLTIKSEMDQLYIIEKLMLICSSIVLSNRKSNTVKHAF 263
Query: 288 STAFVYTAMTDYPTPS-NFLNP----LPAFPVKEMCKAIDDPKTGN--DVFAKLYG---A 337
TA + A TP NP + + E C+ + + G+ +A++ +
Sbjct: 264 ETALINQANL---TPEVTLFNPNLGEMGEMSIDEFCETMTNTSLGSPYHRYARIINTMIS 320
Query: 338 ASVYYNYSGTAKCF-------DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
Y Y + K F + + G +W +Q C+E D K+S F
Sbjct: 321 NRRYRCYPASYKQFVEEYSNTSMERNKYRRG-RQWLYQTCSEFGWFYTPDLKNSSFSGLP 379
Query: 391 EDYDARARYCKEAYGVDPRPNWITTEFGGHKI------------GLVLKRFASNIIFFNG 438
Y + C + +G +F H + GL ++ S IIF NG
Sbjct: 380 TRYFVKR--CSDVFG---------PKFNNHSVFQGVMSTNKYYGGLNVR--GSKIIFSNG 426
Query: 439 LRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
DPW G+ K+IS + A+ K A D+ D LK R + + + KW+ +
Sbjct: 427 SNDPWHRLGITKDISADLPAVFIKGEAFCEDMAEPQDTDSAELKQAREKIFQTLKKWLRK 486
>gi|341893031|gb|EGT48966.1| hypothetical protein CAEBREN_19097 [Caenorhabditis brenneri]
Length = 511
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 116/465 (24%), Positives = 191/465 (41%), Gaps = 46/465 (9%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTGFMY 116
TQ LDHF+ + +TFQQRY N+ + K+ P F+ G EG W + +
Sbjct: 66 TQTLDHFD--SSNGKTFQQRYYHNNQWY---KDGGPAFLMLGGEGPESSYWVSYPGLEIT 120
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-K 175
++A K A + IEHR+YG++ P + + S YLSS QA+ D A+ I +K K
Sbjct: 121 NLAAKQNAWVFDIEHRFYGETKP------TSDMSVSNLKYLSSAQAIEDAAAFITAMKIK 174
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
++ V FGGSY G LAAW R K+P + A+ SS P+ V + ++
Sbjct: 175 YPMLANAKWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPV---QAEVDFKEYLEVVQN 231
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK--NLAIESWLSTAFV- 292
S C + + + K G ++L+ AF +C+ K + +++ + T +
Sbjct: 232 SITRNSTACADSVTQGFNLVASLLKTADGRKQLKSAFHLCQDIKLDDKSLKYFWETVYSP 291
Query: 293 YTAMTDY--------PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL---YGAASVY 341
Y + Y T + + + + I K ND F + +G +
Sbjct: 292 YMEVVQYSGDAAGAFATQLTISHAICRYHLNSGSTTIQKMKQVNDYFNLVNDYFGCNDID 351
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEM------IMLTGGDNKDSIFEESEEDY-D 394
YN G + W WQ CTE T G + + Y D
Sbjct: 352 YN--GFINFMKDTSFGEAQSDRAWVWQTCTEFGYYQSTASATAGPWFGGVANLPAQYYID 409
Query: 395 ARARYCKEAYGVDPRPNWI--TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
AY N + T ++ G + L R I+ NG DPW G L +
Sbjct: 410 ECTAIYGAAYNTQEVQNAVDYTNQYYGGRDNLNTDR----ILLPNGDIDPWHALGKLNSN 465
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ ++ +V AH D+ ++ D +L + R++ ++ W++
Sbjct: 466 NTNIVPVVINGTAHCADMYGASSLDSIYLTNARQKIANVLAGWLN 510
>gi|148676288|gb|EDL08235.1| dipeptidylpeptidase 7, isoform CRA_b [Mus musculus]
Length = 212
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/162 (38%), Positives = 87/162 (53%), Gaps = 27/162 (16%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW---GGS-------------------- 89
+ Y Q +DHFN+ +TF QR+L++ W GS
Sbjct: 49 FHENYFEQYMDHFNFESFGNKTFGQRFLVSGECWLDRMGSYLRFPGLMETHLLPTDKFWK 108
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
PIF YTGNEGDI FA N+GFM ++A + +ALLVF EHRYYGKS+P+G +
Sbjct: 109 MGEGPIFFYTGNEGDIWSFANNSGFMVELAAQQEALLVFAEHRYYGKSLPFG----VQST 164
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSY 191
T L+ QALAD+A L+ L+++L D+P + FGG +
Sbjct: 165 QRGYTQLLTVEQALADFAVLLQALRQDLGVHDAPTIAFGGRW 206
>gi|226502825|ref|NP_001146300.1| uncharacterized protein LOC100279875 precursor [Zea mays]
gi|219886553|gb|ACL53651.1| unknown [Zea mays]
Length = 478
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 129/477 (27%), Positives = 199/477 (41%), Gaps = 81/477 (16%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
++ Q LDHF+ P ++ F+QRY D H P+F+ E + + +
Sbjct: 40 RWMNQRLDHFS--PTDHRQFKQRYFEFLDYH---RAPGGPVFLRICGESACDGIPND--Y 92
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL----- 169
+ +A KF A +V EHRYYGKS P+ K++ +N +LSS QAL D A
Sbjct: 93 LAVLAKKFGAAVVTPEHRYYGKSSPF---KQLTTENLR---FLSSKQALFDLAVFRQYYQ 146
Query: 170 -IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
++ + N + D+P V G SY G L AWFRLK+PH+ G+LASS +L + Y+
Sbjct: 147 ESLNARYNRSGFDNPWFVIGVSYSGALNAWFRLKFPHLTCGSLASSGVVL------AVYN 200
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ-KAFRICKSEKNLAIESWL 287
+++ Q S C V+ ++I E + LE KA ++ KN
Sbjct: 201 YTDFDKQVGESAGPECKAVL----QEITELVDEQLRLESHSVKALFGAQTLKN------- 249
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--NYS 345
F++ T + NP P + KA + K + +A+ YY
Sbjct: 250 DGDFLFFLADAAATTFQYGNPDALCP--PLIKAKKNRKNLVEAYAQF---VKDYYIKKME 304
Query: 346 GTAKCFDLN--GDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
+D ++ PH S W +Q C+E+ DS+ + + C+
Sbjct: 305 TPPSSYDREYLKETTPHDSSSRLWWFQVCSEVAYFQVAPKNDSV-RSARINTRYHLDLCR 363
Query: 402 EAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
+G P+ T +GG +I AS I+F NG +DPW K+ SK + +
Sbjct: 364 HVFGEGVYPDVFMTNLYYGGTRIA------ASKIVFTNGSQDPWRHASKQKS-SKDMPSY 416
Query: 460 VEK--EGAHHVDLRFSTKEDPQW----------------LKDVRRREVEIIGKWISQ 498
+ K H DLR PQW + R R + I W+SQ
Sbjct: 417 IMKCRNCGHGTDLRGC----PQWPFRIEGDASNCSSLAVVNTARERIAKHIDLWLSQ 469
>gi|195391900|ref|XP_002054597.1| GJ22719 [Drosophila virilis]
gi|194152683|gb|EDW68117.1| GJ22719 [Drosophila virilis]
Length = 478
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 117/465 (25%), Positives = 196/465 (42%), Gaps = 60/465 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ K+ TQ LDHF+ + +T++ RY +ND + +PIF++ G E + G
Sbjct: 53 EEKWITQPLDHFD--ESNTKTYEMRYFLNDEF---QTDGSPIFIFLGGEWEASPGMIQQG 107
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
YD+A + +L++ EHRYYG+S+P + ++ +N YL QALAD A I
Sbjct: 108 HWYDMAKEHNGVLIYTEHRYYGESVP---TETMSLENLQ---YLHVKQALADVARFIETF 161
Query: 174 K-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
K +N T+S V++ GGSY + WF+ YP + +G ASSAP+L V Y + +
Sbjct: 162 KSENAQLTNSKVLLAGGSYSATMVVWFKRLYPDLVVGGWASSAPLL---AKVDFYEYKEV 218
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESW---- 286
+ F + + CY I+ ++E E + RIC S + ++ W
Sbjct: 219 TGRAFLELGGQKCYDRIQNGIAELEYMFDNKRAAEA-RAMLRICSSFDHENDLDMWNLFG 277
Query: 287 -LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA----ASVY 341
+S F A T P + + + DD T FA Y A + +
Sbjct: 278 SISNIFASVAQTQSPGDIEYYC--------DFLLSFDDNATAIANFA--YWAWNYPSCID 327
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
Y T + + L G + W +Q C E + + F ++ CK
Sbjct: 328 ARYQETVEYY-LWGIDNFDSSRPWFYQTCNEYGWYQSSRSNNQPF-GNKFPATFNIELCK 385
Query: 402 EAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNI 452
+ + ++ +FGG L+ N+ +G DPWS G GV +
Sbjct: 386 DVFSSKFGNEQIESNIAQTNEDFGG------LEPNVENVYMTHGELDPWSAMGQGVAEG- 438
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
A V + +H D + D ++ + R E++ +W++
Sbjct: 439 -----ATVITKASHCTDFGSISSSDSSEMRASKERIAELVREWLA 478
>gi|157133206|ref|XP_001662800.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108870922|gb|EAT35147.1| AAEL012663-PA [Aedes aegypti]
Length = 485
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 120/522 (22%), Positives = 213/522 (40%), Gaps = 75/522 (14%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDH 64
F FL + +S N F + R+ P L S ++ ++ L+ T Q+
Sbjct: 6 FAFLGIAVALASAAITINPSRPVLFGTHRVIPRNLGS--NAESNNTALWNTVNLRQVYT- 62
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKA 124
NPQ+ +F RY+ N+ H+ + PIF++ G +E G D+A +
Sbjct: 63 ---NPQNRNSFSMRYVTNNRHY---RRGGPIFLFVGGPWPLEAHLVEQGHFVDMAAEMNG 116
Query: 125 LLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT-DSP 183
LV E RYYG+SIP ++++ N YL + Q L++ A+ I LK+++ ++
Sbjct: 117 FLVANELRYYGESIPV---EDVSRNNFR---YLHNVQILSELATFIAHLKEDVVRDPNAK 170
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SE 242
V++ G Y LA W R ++PH+ G +SS + N F+ ++ ++ R +
Sbjct: 171 VILAGVGYSASLAQWMRQRFPHLIHGVWSSSGMVRASTNY---REFAEVVGENIRRFGGD 227
Query: 243 NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAF--VYTAMT 297
+CY I +++ E G + + F C+ + L +E++ F + +
Sbjct: 228 DCYSTIWRAFRTAENLIDA-GLSTTVDELFHTCRPIDAANALEVEAFFYGIFNEISREVV 286
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL---- 353
D N +K+MC +P T +D L AS A+C +
Sbjct: 287 DADLRGN---------IKQMC----EPLTASDDENSLLELASWLTGRFPNAECLAMDFQS 333
Query: 354 ------NGDSDPHGLSEWGWQACTEMIM-LTGG------------DNKDSIFEESEEDYD 394
N + G +W +Q CTE+ LT D I E+ +D+
Sbjct: 334 IAQWSSNHEIVKSGERQWMFQRCTELGWPLTAASQYQPFGRRFSTDLFHGICEQLFDDWL 393
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
R R+ + +GG + + R++ I G DPWS GV + I
Sbjct: 394 TRDRF-------EALIRQTNDYYGGARPDI---RYS---ISTQGTLDPWSFAGVREVIFN 440
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ ++ H D+ ++ED L+ + + I +W+
Sbjct: 441 NTYVTIIRDAFHGQDMASISEEDSIELRRSKEMVRDTIRRWV 482
>gi|308451153|ref|XP_003088564.1| hypothetical protein CRE_09925 [Caenorhabditis remanei]
gi|308246539|gb|EFO90491.1| hypothetical protein CRE_09925 [Caenorhabditis remanei]
Length = 947
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 119/474 (25%), Positives = 191/474 (40%), Gaps = 51/474 (10%)
Query: 50 QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEW 107
QG ++ K +HF+ Q TFQQ++ N W +K P F+ G EG W
Sbjct: 432 QGTFRQK-----QNHFS--NQDPNTFQQKFFKN-AQW--AKPGGPNFLMIGGEGPEGAGW 481
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
A K+ A + +EHR+YG S+ G N + LSS Q L D A
Sbjct: 482 VLNQDITYLTWAKKYGATVYLLEHRFYGDSV-VGDNTDFQL--------LSSLQMLYDLA 532
Query: 168 SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
I ++ T T +P + FGGSY G L+AW R +P V +GA+ASS P+ + Y
Sbjct: 533 EFIREINYR-TGTSNPWITFGGSYSGALSAWMREVFPDVVVGAVASSGPVFAKTDF---Y 588
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES-- 285
+ ++ R+ C I+ + + G + L F++ + ++
Sbjct: 589 EYLMVVENSIRTYDPTCADRIQSGFNTMRTMFLTKEGRKSLSDLFQLDPPFGDNVTDTDQ 648
Query: 286 --WLSTAF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFAKLYGAASV 340
+ S + + Y + N + + +MCK + DD N++ A S+
Sbjct: 649 HYFFSNVYGNFQGAVQY-SGDNAGAYANGYGIPDMCKIMTNDDNTPLNNIVA-FNQFMSI 706
Query: 341 YYNYSGTAKCFD----------LNGDS---DPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
+YN G D +N S D W WQ CTE D + IF
Sbjct: 707 FYNGGGNYTGMDNSYQDLVNYLINAQSFGPDAAAGLLWTWQTCTEFGYFQSADTGNGIF- 765
Query: 388 ESEEDYDARARYCKEAYGVDPRPNWITT--EFGGHKIGLVLKRFASNIIFFNGLRDPWSG 445
S + + C + + + N I + ++ + G SN++F NG DPW
Sbjct: 766 GSPTPVNLFVQMCMDVFSSYYQRNTIDSRIDYTNYMYGERYHFRGSNVVFPNGNVDPWHA 825
Query: 446 GGVLK-NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
G+ + +V++ + AH D+ + D L VR R + I W+ Q
Sbjct: 826 LGLYNTSTDPSVVSYLMNGTAHCADMYPARDADVPDLAIVRNRIDQNIAIWLGQ 879
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 72/335 (21%), Positives = 130/335 (38%), Gaps = 34/335 (10%)
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN-FDNIVSPYSFSNIITQDFRSVSEN 243
V++G YGG++AA R YP G +ASSAP+ + +D + I Q+ S+
Sbjct: 23 VLWGAGYGGVIAAEARKWYPDTVAGVIASSAPLTHQYDFWQFNSHVAMAIAQEGGSL--- 79
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPS 303
C +++ + I + + P G + F++ + F + +
Sbjct: 80 CSQMVTQGFADIRQAMRTPEGRSNVSDLFQLNPRLDQTNLNYNDIQQFYLAIIAPFQEVI 139
Query: 304 NFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------- 354
F N + ++C +ID K+G ++ A VY++ + T L
Sbjct: 140 QFNNDF-NISIIDLCTSID--KSGWTPMQVIW-QAWVYFSTTVTGSVQPLVTSYQAVIND 195
Query: 355 -GDSDPHG----LSEWGWQACTEMIML-TGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
GD P+ W +Q CTE T ++ +F + + + + + P
Sbjct: 196 LGDQSPNAQFIDQRMWQYQMCTEFAWFYTTNSDEQGMFGAVV----PASIFLNQCFDLFP 251
Query: 409 RPNWITTEFGGHKI------GLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
N T I G +N++F NG DPW+ G +V+A +
Sbjct: 252 DSNLTPTSIRELVINYNNFYGSAYDYSGTNVVFTNGWYDPWNTLGKENTADFSVVAYLIP 311
Query: 463 EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+G+ D+ F D Q++ R +E I W++
Sbjct: 312 QGSWASDM-FPGDSDNQFIDVAHRLMIENINIWVN 345
>gi|302762512|ref|XP_002964678.1| hypothetical protein SELMODRAFT_63866 [Selaginella moellendorffii]
gi|300168407|gb|EFJ35011.1| hypothetical protein SELMODRAFT_63866 [Selaginella moellendorffii]
Length = 393
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 115/418 (27%), Positives = 182/418 (43%), Gaps = 70/418 (16%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF-VYTGNEGDIEWFAQNTGFMYDV 118
Q LDHF P+ +TF Q+Y ++ P+F V G +AQ T DV
Sbjct: 1 QKLDHFT--PEDTRTFPQKYFELLDYF--EPQRGPMFLVMCGETSCPGGYAQLTS---DV 53
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A +F A +V +EHR+YG+S P+ + N YL+ Q+L D+A I +K +
Sbjct: 54 AKEFGAAVVTLEHRFYGESSPF---HNLTVDNLK---YLTIQQSLLDHAEFIAFYQKVIN 107
Query: 179 AT-----DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSFS 230
A D+P +V GGSY G L+AWFRLK+PH+ IG+ ASSA PIL++ S Y
Sbjct: 108 AKFQKDGDNPWLVIGGSYAGALSAWFRLKFPHLVIGSWASSAVVHPILSY----SAYDRQ 163
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
IT C +V++ +E+ + G ++ F + N+ ++++
Sbjct: 164 MGIT-----AGPECKRVLQNVTSIVEKALLENG--TAIKSFFDPNAVKVNVDFLAYVAEI 216
Query: 291 F-----------VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAAS 339
V+ +D + F ++C ++ + N+ KL
Sbjct: 217 IAVAVRKQLQRHVFVLFSDLFRFATIQAQSGRF--NQLCTSVLNASATNNA-TKLLVTKF 273
Query: 340 VYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
V++ S P+ W +Q CTEM + D +F + +
Sbjct: 274 VFHVQS-------------PNYQWAWKYQVCTEMGLFRVSSGPDGLF-SLQINTQYYLDQ 319
Query: 400 CKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
C + +G +P+ TT FGG KI S I+F NGL DPW + +NI+ +
Sbjct: 320 CSQMFGQGIQPDVATTNLLFGGAKIA------GSKIMFLNGLEDPWRHASI-QNITSS 370
>gi|281202572|gb|EFA76774.1| peptidase S28 family protein [Polysphondylium pallidum PN500]
Length = 463
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 122/477 (25%), Positives = 200/477 (41%), Gaps = 75/477 (15%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN- 111
++ +++ Q L+HF+ + +TF QRY +ND ++ K PI +Y EG +
Sbjct: 29 WEYQWYNQTLNHFD--AEDTRTFLQRYYVNDQYYD-YKKGGPIILYINGEGPVSSPPNKP 85
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
T A AL+V +EHRYYG S P+ +++ +N +LSS QAL D A I+
Sbjct: 86 TDGTVIYAQALGALIVTLEHRYYGDSSPFA---DLSTENLK---FLSSRQALNDLAIFIL 139
Query: 172 DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
D + + ++ GGSY G L+AWFR+KYPHV +G++ASS + + I+ +F
Sbjct: 140 DYRSTI-QNAGDIITIGGSYSGALSAWFRVKYPHVTVGSVASSGVV---NAILDFTAFDE 195
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKP---GGLEKLQKAFRICKSEKNLAIESWLS 288
+ + E C + + + TA++ G +++ F+ + WL+
Sbjct: 196 WVA---YAAGEQCADAL----RLVTSTAEQQIFGGNAAAVKQLFQAETLTDDGDFFYWLA 248
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGT- 347
+ Y PL + M D +T ++ ++G GT
Sbjct: 249 DSMA--EGIQYGYHDQLCTPL----INAMNNKGDMLETYSNYTINVWGTT------LGTP 296
Query: 348 ---AKCFDLNGDSDPHGLS-EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
A F N D + +W +Q CTE + SI + R +C
Sbjct: 297 GEYATLFQQNTTHDINKADRQWWFQTCTEFGYFQNAPAQGSIRSQMVNMTYHRT-HCANV 355
Query: 404 YGVDPRPNWITTE-----FGGHKIGLVLKRFASNIIFFN---------------GLRDPW 443
+G +P W TE +GG+ +NI+F N G +DPW
Sbjct: 356 FG---KPLWPNTEATNDYYGGNHTA------GTNIVFTNVSRKLEIRENNQSPIGSQDPW 406
Query: 444 SGGG--VLKNISKTVLALVEKEGAHHVDLRFSTK--EDPQWLKDVRRREVEIIGKWI 496
S V + ++ L + H VDLR P L VR++ ++II W+
Sbjct: 407 SRASITVQEYPTEPSLMVTCNNCGHCVDLRECPGGCSTPNNLDQVRQQTLKIIETWL 463
>gi|195062821|ref|XP_001996260.1| GH22290 [Drosophila grimshawi]
gi|193899755|gb|EDV98621.1| GH22290 [Drosophila grimshawi]
Length = 633
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 120/236 (50%), Gaps = 17/236 (7%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
+ ++++ + +T + Q LDHF+ +T+Q RY++ND G K P+F++ G E
Sbjct: 39 LPANQNRADVVQTLWIEQKLDHFD--ESETRTWQMRYMLND---GFFKAGGPMFIFFGGE 93
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQA 162
I G MYD+A + LLV+ EHRYYG+S P +++ +N YL TQA
Sbjct: 94 WTISPGRITGGHMYDMAKEHNGLLVYTEHRYYGESHPL---PDLSNENIQ---YLHVTQA 147
Query: 163 LADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
LAD A I K +DS V++ GGSY + WF+ YP + +G ASSAP+L
Sbjct: 148 LADLAHFITTQKTTYEGLSDSKVIIVGGSYSATMVTWFKKIYPDLVVGGWASSAPLLAKL 207
Query: 222 NIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK 276
N + + I+ Q + +C K I+ ++E G E ++ ++C+
Sbjct: 208 NFL---EYKEIMGQSITLMGGADCNKRIENGIAEMETMFATKRGAE-VKALLKLCE 259
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 80/364 (21%), Positives = 142/364 (39%), Gaps = 49/364 (13%)
Query: 156 YLSSTQALADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
YL TQALAD A I K +DS V++ GGSY + WF+ YP + +G ASS
Sbjct: 297 YLHVTQALADLAHFITTQKTTYEGLSDSKVIIVGGSYSATMVTWFKKIYPDLVVGGWASS 356
Query: 215 APILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFR 273
AP+L N + + I+ Q + +C K I+ ++E G E ++ +
Sbjct: 357 APLLAKLNFL---EYKEIMGQSITLMGGADCNKRIENGIAEMETMFATKRGAE-VKALLK 412
Query: 274 ICKSEKNLAIESWLSTAFVYTAMTDY---PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV 330
+C ++ + S L ++ +++ ++ + K M ++ D +
Sbjct: 413 LC---EHFDVYSDLDVWTLFNEISEIFAGVVQGHYAGRIEGVCQKIMAESSDLIGVSKYI 469
Query: 331 FAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE----------WGWQACTEMIMLTGGD 380
+ + SG KC DL+ D+ L E W +Q C E
Sbjct: 470 LDE--------FEKSG-GKCNDLSYDAITTVLLESRYSGNIMRQWIYQTCNEYGWYQTSS 520
Query: 381 NKDSIFEESEEDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNI 433
+ F ++ C +AYG ++ + +FGG + N+
Sbjct: 521 SSAQPF-GTKFPLALFTTMCADAYGSQYTNSFIEKQVGNTNADFGG------MSPNVQNV 573
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIG 493
+G DPW G+ T++ E AH D + D ++ + R E++
Sbjct: 574 YLTHGQLDPWRAMGIQNEAQATII----PEHAHCTDFGSISDRDTAEMRASKERIAELVR 629
Query: 494 KWIS 497
+W++
Sbjct: 630 EWLN 633
>gi|440289971|gb|ELP83425.1| serine carboxypeptidase (S28) family protein, partial [Entamoeba
invadens IP1]
Length = 220
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/215 (31%), Positives = 109/215 (50%), Gaps = 17/215 (7%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD-VAP 120
+DHFN N + F+ +Y +++ + G+ ++P+FV G EG + ++ D +A
Sbjct: 1 MDHFNANND--EEFEVKYFVSEKYLDGTDLHSPLFVMLGGEGPESSKTLDNHYIIDTLAA 58
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT 180
+ L++ IEHR+YG S P ++ Y A QA+ DY +I +++
Sbjct: 59 RTNGLMLAIEHRFYGDSTPSLKMDKLIYCTAE--------QAMMDYIEIITYIQETRNFI 110
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
D PV+V GGSY G LAAW R KYP+V GA ASSAP+ + V Y + ++ +
Sbjct: 111 DHPVIVIGGSYSGNLAAWMRQKYPNVVDGAWASSAPV---EAQVDFYQYLEVVQAGLPAN 167
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
+ + + W Q+ T G ++L+K F C
Sbjct: 168 TADLLSIAFEKWDQMTVT---ESGRKELKKVFNTC 199
>gi|195497939|ref|XP_002096313.1| GE25601 [Drosophila yakuba]
gi|195497943|ref|XP_002096315.1| GE25604 [Drosophila yakuba]
gi|194182414|gb|EDW96025.1| GE25601 [Drosophila yakuba]
gi|194182416|gb|EDW96027.1| GE25604 [Drosophila yakuba]
Length = 485
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 112/464 (24%), Positives = 204/464 (43%), Gaps = 58/464 (12%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ LD N++ + T+Q R IN+ ++ + +PIF+Y G E I+ +G
Sbjct: 55 RWITQKLD--NFDDSNNATWQDRIYINNKYF---VDGSPIFIYLGGEWAIDPSGITSGLW 109
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
D+A + L++ EHR++G+SIP ++ +N + Y S QALAD ++I LK+
Sbjct: 110 KDIAKQHNGSLLYTEHRFFGESIPI---TPLSTENLAK--YQSVEQALADVINVIATLKQ 164
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL---NFDNIVSPYSFSNI 232
DS VVV G SY +A W R YP + G+ ASSAPIL NF + + +
Sbjct: 165 EDKYKDSKVVVSGCSYSATMATWIRKMYPEIIRGSWASSAPILAKVNFKD------YMKV 218
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL--- 287
+ + + ++ + CY +I + E + G + K +C + + N + W
Sbjct: 219 VGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQA-AKELNLCSNFDVNSDQDRWQIFS 277
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL----YGAASVYYN 343
+ A ++ + Y P + P ++E DD F + A +
Sbjct: 278 TIANIFAGIAQYQKPEKYDIPTYCSILREFS---DDDSVALSKFINWKINEHSGACLSTT 334
Query: 344 YSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKD---------SIFEESEEDYD 394
+ G+ ++ + D+ W +Q C+E ++ S++E++ E
Sbjct: 335 FKGSVGYYEWSKDNYQDSDLPWVFQTCSEFGWFQSSGSRSQPFGSTFPASLYEDTCEGVF 394
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNI 452
++Y ++ G+ +FGG + A+NI F G D WS G GV +
Sbjct: 395 G-SKY--DSDGIHANVRATNDDFGGLNVN------ATNIYFVQGALDGWSKVGAGVAQG- 444
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
A + +H D + D L +++ ++++G+W+
Sbjct: 445 -----ATIIPYASHCPDTGSISATDSAELVASKKKLIKLVGQWL 483
>gi|156547443|ref|XP_001605073.1| PREDICTED: putative serine protease K12H4.7-like [Nasonia
vitripennis]
Length = 378
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 163/386 (42%), Gaps = 52/386 (13%)
Query: 144 KEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLK 202
++++ KN YLSS QALAD A I ++ D+ ++FGGSY G LAAW R K
Sbjct: 13 RDLSVKNLV---YLSSEQALADVAYFIQGMQAAQQLPDTSRWIMFGGSYSGSLAAWMRAK 69
Query: 203 YPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKP 262
YPH+ GA+++S P+L + + I+ + ++ S+ C I + +Q+ +
Sbjct: 70 YPHLVHGAMSASGPLL---AQIDFQQYFIIVEESLKTHSQACVDTIAAAIRQVHIMLRHR 126
Query: 263 GGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPL---------PAFP 313
G + L+K F C + S A +Y + D NF + + +
Sbjct: 127 IGQQGLEKLFNFCDPIDS-GKTSQKDIANLYETLAD-----NFADVVQYNKDNRNESSIT 180
Query: 314 VKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGD-------------SDPH 360
+ +C + D K G V S KC D D
Sbjct: 181 IDMVCDVLVDEKKG--VPVNRLAEVSNMLLAKNKEKCLDYKYDKMISELRNITWAEQKNA 238
Query: 361 GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR------PNWIT 414
G +W +Q CTE + ++F E+ + + C + +G PR P I
Sbjct: 239 GGRQWTYQTCTEFGFFQTSTARPNLFSETFP-VEFFIQQCADIFG--PRFLHTLQPGVIR 295
Query: 415 TE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRF 472
T +G L L SN++F +G DPW GV K+ + A+ AH ++
Sbjct: 296 TNTMYG----ALDLPNIVSNVVFVHGSIDPWHALGVTKSSNPNAPAIFINGTAHCANVYP 351
Query: 473 STKEDPQWLKDVRRREVEIIGKWISQ 498
+K D L D R++ ++IG+W++Q
Sbjct: 352 PSKNDLPQLVDARKQVGQLIGQWLAQ 377
>gi|302766039|ref|XP_002966440.1| hypothetical protein SELMODRAFT_63867 [Selaginella moellendorffii]
gi|300165860|gb|EFJ32467.1| hypothetical protein SELMODRAFT_63867 [Selaginella moellendorffii]
Length = 393
Score = 109 bits (272), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 114/416 (27%), Positives = 180/416 (43%), Gaps = 66/416 (15%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF-VYTGNEGDIEWFAQNTGFMYDV 118
Q LDHF P+ +TF Q+Y ++ P+F V G +AQ T DV
Sbjct: 1 QKLDHFT--PEDTRTFPQKYFELLDYF--EPQRGPMFLVMCGETSCPGGYAQLTS---DV 53
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A +F A +V +EHR+YG+S P+ + N YL+ Q+L D+A I +K +
Sbjct: 54 AKEFGAAVVTLEHRFYGESSPF---HNLTVDNLK---YLTIQQSLLDHAEFIAFYQKVIN 107
Query: 179 AT-----DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSFS 230
A D+P +V GGSY G L+AWFRLK+PH+ IG+ ASSA PIL++ S Y
Sbjct: 108 AKFQKDGDNPWLVIGGSYAGALSAWFRLKFPHLVIGSWASSAVVHPILSY----SAYDRQ 163
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
IT C +V++ +E+ + G ++ F + N+ ++++
Sbjct: 164 MGIT-----AGPECKRVLQNVTSIVEKALLENG--TAIKSFFDPNAVKVNVDFLAYVAEI 216
Query: 291 FVYTAMTDYPTPSNFLNP-LPAFPV--------KEMCKAIDDPKTGNDVFAKLYGAASVY 341
+ L+ L F ++C ++ + N+ KL ++
Sbjct: 217 IAVAVRKELQRHVFVLSSDLFRFATIQAQSGRFNQLCTSVLNASATNNA-TKLLVTKFIF 275
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
+ S P+ W +Q CTEM + D +F + + C
Sbjct: 276 HVQS-------------PNYQWAWKYQVCTEMGLFRVSSGPDGLF-SLQINTQYYLDQCS 321
Query: 402 EAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
+ +G RP+ TT FGG KI S I+F NG DPW + +NI+ +
Sbjct: 322 QMFGQGIRPDVTTTNLLFGGAKIA------GSKIMFLNGSEDPWRHASI-QNITSS 370
>gi|308478052|ref|XP_003101238.1| CRE-PCP-3 protein [Caenorhabditis remanei]
gi|308263943|gb|EFP07896.1| CRE-PCP-3 protein [Caenorhabditis remanei]
Length = 1095
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 120/482 (24%), Positives = 193/482 (40%), Gaps = 67/482 (13%)
Query: 50 QGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEW 107
QG ++ K +HF+ Q TFQQ++ N W +K P F+ G EG W
Sbjct: 578 QGTFRQK-----QNHFS--NQDPNTFQQKFFKN-AQW--AKPGGPNFLMIGGEGPEGAGW 627
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
A K+ A + +EHR+YG S+ G N + LSS Q L D A
Sbjct: 628 VLNQDITYLTWAKKYGATVYLLEHRFYGDSV-VGDNTDFQL--------LSSLQMLYDLA 678
Query: 168 SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
I ++ T T +P + FGGSY G L+AW R +P V +GA+ASS P+ + Y
Sbjct: 679 EFIREINYR-TGTSNPWITFGGSYSGALSAWMREVFPDVVVGAVASSGPVFAKTDF---Y 734
Query: 228 SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES-- 285
+ ++ R+ C I+ + + G + L F++ + ++
Sbjct: 735 EYLMVVENSIRTYDPTCADRIQSGFNTMRTMFLTKEGRKSLSDLFQLDPPFGDNVTDTDQ 794
Query: 286 --WLSTAF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFAKLYGAASV 340
+ S + + Y + N + + +MCK + DD N++ A S+
Sbjct: 795 HYFFSNVYGNFQGAVQY-SGDNAGAYANGYGIPDMCKIMTNDDNTPLNNIVA-FNQFMSI 852
Query: 341 YYNYSGTAKCFD----------LNGDS---DPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
+YN G D +N S D W WQ CTE D + IF
Sbjct: 853 FYNGGGNYTGMDNSYQDLVNYLINAQSFGPDAAAGLLWTWQTCTEFGYFQSADTGNGIF- 911
Query: 388 ESEEDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRF---ASNIIFFN 437
S + + C + + +D R ++ +G +R+ SN++F N
Sbjct: 912 GSPTPVNLFVQMCMDVFSSYYQRSTIDSRIDYTNYMYG--------ERYHFRGSNVVFPN 963
Query: 438 GLRDPWSGGGVLK-NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
G DPW G+ + +V++ + AH D+ + D L VR R + I W+
Sbjct: 964 GNVDPWHALGLYNTSTDPSVVSYLMNGTAHCADMYPARDADVPDLAIVRNRIDQNIAIWL 1023
Query: 497 SQ 498
Q
Sbjct: 1024 GQ 1025
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 118/511 (23%), Positives = 194/511 (37%), Gaps = 68/511 (13%)
Query: 23 AKIFPTFPSSRITPEKLSSLISSSKDSQGL-YKTKYHTQILDHFNYNPQSYQTFQQRYLI 81
A F P+ E L+ +S+ + + T Y LD F N TF QRY
Sbjct: 13 ATAFSILPNHYHFKEYLNKAAKNSRGNDAISVTTGYLNTPLDQFVGNASG--TFSQRYFY 70
Query: 82 NDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD-------VAPKFKALLVFIEHRYY 134
+ + + F+Y GD E T + D A +F A + +EHRYY
Sbjct: 71 TRQY---ALHQKVAFLYVSVSGDFE-----TSVITDERNPIVITAKQFGATVFSLEHRYY 122
Query: 135 GKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI--IDLKKNLTATDSPVVVFGGSYG 192
G S P N +T +L+S QA+ D + I +++ N+ D +++G YG
Sbjct: 123 GGSKP-----NFDKFNGTTLRHLNSYQAIMDLNAFIKYANVQFNMDP-DCRWILWGAGYG 176
Query: 193 GMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF----SNIITQDFRSVSENCYKVI 248
G++AA R YP G +ASSAP+ + Y F ++ T + CY+ +
Sbjct: 177 GIIAAEARKWYPDTVAGVIASSAPLTH------QYDFWQFNDHVQTAIMQEGGSLCYQKV 230
Query: 249 KGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNP 308
+ I + + P G + F++ + F + + F N
Sbjct: 231 AQGFADIRQAMRTPEGRSNVSDLFQLNPRLDQTNLNYNDIQQFYLAIIAPFQEVIQFNND 290
Query: 309 LPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYNYSGTAKCFDLN----------GDS 357
+ ++C +ID P T V + A VY++ + T L GD
Sbjct: 291 F-NISIIDLCTSIDKGPWTPMQVIWQ----AWVYFSTTVTGSVQPLVTSYQAVINDLGDQ 345
Query: 358 DPHG----LSEWGWQACTEMI-MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNW 412
P+ W +Q CTE T +N+ +F + + + + + P N
Sbjct: 346 SPNAQFIDQRMWQYQMCTEFAWFYTTNNNEQGMFGAVV----PSSIFLNQCFDLFPDSNL 401
Query: 413 ITTEFGGHKI------GLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
T I G +N++F NG DPWS G + +V+ V +G+
Sbjct: 402 TPTSIRELVINYNNFYGSAYDYSGTNVVFTNGWYDPWSTLGKEFSADFSVVTYVIPQGSW 461
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
D F D ++ R +E I W++
Sbjct: 462 ASDF-FPGDSDNMFINTAHRLMIENINIWVN 491
>gi|326502560|dbj|BAJ95343.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 490
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 124/475 (26%), Positives = 193/475 (40%), Gaps = 72/475 (15%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ ++ +Q LDHF+ ++ F+QRY D H P+F+ E + +
Sbjct: 50 EERWMSQRLDHFS--SSDHRQFKQRYFEFLDYH---DDPTGPVFLRICGESSCDGIPND- 103
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
++ +A KF A +V EHRYYGKS P+ +LSS QAL D A
Sbjct: 104 -YLAVIAKKFGAAVVTPEHRYYGKSSPFDS------LTTDNLRFLSSKQALFDLAVFRQY 156
Query: 173 LKKNLTAT-------DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
++ L + D+P VFG SY G L+AWFRLK+PH+ G+LASS +L +
Sbjct: 157 YQEKLNSRYNRSAGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL------A 210
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
Y+F++ Q S C ++ + ++ K+ KA S KN
Sbjct: 211 VYNFTDFDKQVGDSAGPECKAALQEITRLVD---KQLLSDSHSVKALFGADSLKN----- 262
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA-ASVYY-- 342
F++ T + NP +C + + K + + Y YY
Sbjct: 263 --DGDFLFLLADAAATTFQYGNP------DALCSPLANAKKKGESLVETYAHFVKDYYIK 314
Query: 343 NYSGTAKCFDLN--GDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
T +D ++ P S W +Q C+E+ DS+ ++ D
Sbjct: 315 KLGTTVSSYDQEYLKETTPDDSSSRLWWFQVCSEVAYFQVAPKNDSV-RSAQIDTRYNLD 373
Query: 399 YCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTV 456
CK YG P+ T +GG I AS I+F NG +DPW K+
Sbjct: 374 LCKNVYGEGVYPDVFMTNLYYGGTSIA------ASKIVFTNGSQDPWRHASKQKSSEGMP 427
Query: 457 LALVE-KEGAHHVDLR------FSTKED------PQWLKDVRRREVEIIGKWISQ 498
+++ H DLR F + D P+ + VR++ V+ I W+SQ
Sbjct: 428 SYIIKCSNCGHGTDLRGCPQLPFRIEGDSSNCTSPEAVNTVRKQIVKHIDLWLSQ 482
>gi|349805101|gb|AEQ18023.1| hypothetical protein [Hymenochirus curtipes]
Length = 294
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/335 (30%), Positives = 160/335 (47%), Gaps = 57/335 (17%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y Q LDHFN + T++QRY +N+ HW + + P+F+Y G EG + F+ +G
Sbjct: 1 YIAQPLDHFNRRNNA--TYRQRYWVNEEHW--RQPDGPVFLYIGGEGSLSEFSVLSGEHV 56
Query: 117 DVAPKFKALLVFIEHRYYGKSI-PYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
++A +ALLV +E +YG SI P G E + +LSS QALAD AS + +
Sbjct: 57 ELAQTHRALLVSLE-CFYGSSINPDGMTLE-------SLKFLSSQQALADLASFHLFISH 108
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFD--NIVSPYSFS 230
T + + FGGSY G L+AWF LK+PH+ ++ASSAP+ LNF N V +S +
Sbjct: 109 KYNLTRNTWICFGGSYPGSLSAWF-LKFPHLVYASVASSAPVRAELNFTGYNKVVAWSLA 167
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
+++ SE C L+ + K F++ E+ ++ T
Sbjct: 168 DLVIGG----SEKC--------------------LDAVIKGFQVGVGERQWYYQT--CTG 201
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA-----KLYGAASVYYNYS 345
F Y + P+ F L ++C + T + + + + YGA +
Sbjct: 202 FGYYQTCEDPS-CPFSTLLTLQSQLDLCSQVFQVPTESVLQSVQFTNEFYGA-----DRP 255
Query: 346 GTAKCFDLNGDSDP-HGLSEWGWQACTEMIMLTGG 379
+++ +NGD DP H LS Q+ +E+ +L G
Sbjct: 256 KSSRIIFVNGDVDPWHALSVLKNQSHSEIAILING 290
>gi|66816525|ref|XP_642272.1| hypothetical protein DDB_G0278299 [Dictyostelium discoideum AX4]
gi|60470342|gb|EAL68322.1| hypothetical protein DDB_G0278299 [Dictyostelium discoideum AX4]
Length = 635
Score = 108 bits (271), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 103/415 (24%), Positives = 167/415 (40%), Gaps = 35/415 (8%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q ++H +Y+ + TF+QR+ +N + +F EG + N +A
Sbjct: 78 QTINHLSYD--TIGTFEQRFSVNKKFVPINGKPKAVFFLVSGEGPLSSEIVNHNPFVQIA 135
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+ +AL+V +E RYYG+S+P+ N S YL++ Q L D A+ +
Sbjct: 136 NETQALIVALELRYYGESMPFLN------MNNSNMAYLTTDQILEDLATFQVFFTNKYQL 189
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
D ++ G SY G ++AW+RLKYPH+ A+ASS+P + F+ + ++
Sbjct: 190 NDIKWIIMGCSYAGTISAWYRLKYPHLVTAAIASSSPFR------AELRFTEYDVKVRQN 243
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTD- 298
+ C K K + IE K K F + + LS A Y+ D
Sbjct: 244 LGAPCSKAFKNLFSYIEHLMIKNNSYVK--SKFTCERQLDDRMFLYLLSEALTYSVQYDA 301
Query: 299 -YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL---- 353
+ S F P F K + + D+F+ ++ N S A ++L
Sbjct: 302 RFKIISGF---CPKF-----VKLTNSSEALLDMFSSYVKNMFLFQNVSCDA--YNLYEFA 351
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
+ + D G W WQ C E DS +S + + CK YG RP
Sbjct: 352 SNEIDYSGTRSWTWQLCREYGWFMVPSGPDSFKPQSLGECWWQNDVCKTLYGRAMRP--- 408
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHV 468
T + G ++ SN++F N DPWS + ++S + G H
Sbjct: 409 TVDRINMVYGSTNFKYISNVLFTNCGNDPWSTLSIDPSVSLPFSQQIHIPGESHC 463
>gi|330846607|ref|XP_003295109.1| hypothetical protein DICPUDRAFT_160267 [Dictyostelium purpureum]
gi|325074265|gb|EGC28366.1| hypothetical protein DICPUDRAFT_160267 [Dictyostelium purpureum]
Length = 547
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 109/444 (24%), Positives = 176/444 (39%), Gaps = 36/444 (8%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q ++H +Y+ + TF+QRY +N + +F EG + N ++A
Sbjct: 11 QTINHLSYD--TIGTFEQRYSVNKKFLASNGKPKAVFFLVSGEGPLSSEIVNHNPFVNIA 68
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+ AL+V +E RYYG+S+P+ N S YL++ Q L D A+ +
Sbjct: 69 NETNALIVALELRYYGESMPFPN------MNNSNMAYLTTDQILEDLANFQVYFTNKYQL 122
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
D ++ G SY G ++AW+RLKYPH+ A+ASS+P + F+ + ++
Sbjct: 123 GDIKWIIMGCSYAGTISAWYRLKYPHLVTAAIASSSPFR------AELRFTEYDVKVRQN 176
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTD- 298
+ C K K + IE K K F + + LS A Y+ D
Sbjct: 177 LGAPCSKAFKNLFAYIEHLMLKNNSYVK--SKFTCERQLDDRMFLYLLSEALTYSVQYDA 234
Query: 299 -YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL---- 353
+ S+F P F K + D+F+ ++ N S A ++L
Sbjct: 235 RFKIISSF---CPKF-----VKFTNSSDDLLDMFSAYVKNMFLFQNVSCDA--YNLYEFA 284
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWI 413
+ D D G W WQ C E +S + + + CK YG RP
Sbjct: 285 SNDIDYSGTRSWTWQLCREYGWFMVPSGPESFKPQQLGECWWQNDVCKTLYGRAMRP--- 341
Query: 414 TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
T + G ++ SN++F N DPWS + N+ + G H S
Sbjct: 342 TVDRINMVYGSTNFKYISNVLFTNCGNDPWSTLSIDPNLVLPFSQQIYIPGESHCANWLS 401
Query: 474 TK-EDPQWLKDVRRREVEIIGKWI 496
+ D LK+ R + ++I
Sbjct: 402 EQPNDSLELKNARSLANSFLRQFI 425
>gi|2961378|emb|CAA18125.1| putative protein [Arabidopsis thaliana]
gi|7270572|emb|CAB80290.1| putative protein [Arabidopsis thaliana]
Length = 852
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 131/488 (26%), Positives = 202/488 (41%), Gaps = 97/488 (19%)
Query: 8 LSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKD---------SQGLYKTKYH 58
L F LL + T+ +S + +F + + P ++S +S S + L Y
Sbjct: 5 LGFALLTTFTVLLS----YLSFSNGLLQPRRISHGLSKSSNFCFSFVIEIQNSLQNPSYQ 60
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
Q+ F Q ++ F+QRY H + PIF+ EG N ++ +
Sbjct: 61 FQL---FLILFQDHRKFRQRYYEYLDHL--RVPDGPIFLMICGEGPCNGITNN--YISVL 113
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--------- 169
A KF A +V +EHRYYGKS P+ K +A KN YLSS QAL+D A+
Sbjct: 114 AKKFDAGIVSLEHRYYGKSSPF---KSLATKNLK---YLSSKQALSDLATFRQYYQATCF 167
Query: 170 --------IIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
+++K N ++ ++P FG SY G L+AWFRLK+PH+ G+LASSA +
Sbjct: 168 GICLWMQDSLNVKFNRSSNVENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAVVR-- 225
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
+ Y F Q S C ++ + K +E GL+ +A + +
Sbjct: 226 ----AVYEFPEFDQQIAESAGPECETALQETNKLLEL------GLKVNNRAVKALFNATE 275
Query: 281 LAIES---WLSTAFVYTAM----TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV--- 330
L +++ +L AM Y P PL ++ K G D+
Sbjct: 276 LDVDADFLYLIADAGVMAMFIWQIQYGNPDKLCVPL-----------VEAQKNGGDLVEA 324
Query: 331 FAK--------LYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNK 382
+AK ++G +S Y+ +D W +Q CTE+
Sbjct: 325 YAKYVREFCMGVFGQSSKTYSRKHLLDTAVTLESAD----RLWWFQVCTEVAYFQVAPAN 380
Query: 383 DSIFEESEEDYDARARYCKEAY--GVDPRPNWITTEFGGHKIG-----LVLKRFASNIIF 435
DSI + + + CK + GV P + +G KI L+L A+ IIF
Sbjct: 381 DSI-RSHQINTEYHLDLCKSLFGKGVYPEVDATNLYYGSDKIAVSSHLLILNFVATKIIF 439
Query: 436 FNGLRDPW 443
NG +DPW
Sbjct: 440 TNGSQDPW 447
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 100/200 (50%), Gaps = 29/200 (14%)
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
+ ++ F+QRY H + PIF+ EG + ++ +A KF A +V +
Sbjct: 500 KDHREFKQRYYEYLDHL--RVPDGPIFMMICGEGPCNGIPND--YITVLAKKFDAGIVSL 555
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA----------SLI--IDLKKNL 177
EHRYYGKS P+ K +A +N YLSS QAL D A SL+ +++K N
Sbjct: 556 EHRYYGKSSPF---KSLATENLK---YLSSKQALFDLAAFRQYYQASTSLMDSLNVKFNR 609
Query: 178 TA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
+ ++P FG SY G L+AWFRLK+PH+ G+LASSA + + Y F Q
Sbjct: 610 SGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVR------AVYEFPEFDQQI 663
Query: 237 FRSVSENCYKVIKGSWKQIE 256
S C ++ + K +E
Sbjct: 664 GESAGPECKAALQETNKLLE 683
>gi|297832460|ref|XP_002884112.1| hypothetical protein ARALYDRAFT_480713 [Arabidopsis lyrata subsp.
lyrata]
gi|297329952|gb|EFH60371.1| hypothetical protein ARALYDRAFT_480713 [Arabidopsis lyrata subsp.
lyrata]
Length = 417
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 117/444 (26%), Positives = 185/444 (41%), Gaps = 80/444 (18%)
Query: 92 NAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
+ P+F+ EG A + ++ +A KF+A +V +EHRYYGKS P+ +A +N
Sbjct: 8 DGPLFMIICGEGPCSGIAND--YINVLAKKFQAGVVSLEHRYYGKSSPFNS---LATENL 62
Query: 152 STTGYLSSTQALADYA--------SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKY 203
YLSS QAL D A SL + L + ++P FG SY G L+AWFRLK+
Sbjct: 63 K---YLSSKQALFDLAAFRQYYQESLNVKLNMSNGGNENPWFFFGISYSGALSAWFRLKF 119
Query: 204 PHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
PH+ G+LASSA + + Y F+ Q S + C ++ + K +E
Sbjct: 120 PHLTCGSLASSAVVR------AVYEFTEFDQQIGESAGQECKGALQETNKLLEL------ 167
Query: 264 GLEKLQKAFRICKSEKNLAIES--WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI 321
GL++ +KA + + L +++ TA Y P PL +
Sbjct: 168 GLKENRKAVKSLFNATELDVDADFLYLTADAAVMAFQYGNPDKLCVPL-----------V 216
Query: 322 DDPKTGNDVF-----------AKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQAC 370
+ K G+D+ + +G YN + DS W +Q C
Sbjct: 217 EAKKNGSDLVETYSKYVREYCMRFWGLRVRTYNRKHLRNTV-VTADS---AYRLWWFQVC 272
Query: 371 TEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY--GVDPRPNWITTEFGGHKIGLVLKR 428
TE+ DS+ + + CK + GV P+ + +GG ++
Sbjct: 273 TELGYFQVAPKNDSV-RSQQINTMFHLDLCKSLFGEGVYPKVDATNLYYGGDRLT----- 326
Query: 429 FASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK--EGAHHVDLRFSTKED--------- 477
A+ IIF NG DPW +N S + + + K H D+R +
Sbjct: 327 -ATKIIFTNGSEDPWRHASK-QNSSHEMPSYIIKCRNCGHGTDIRGCPQSPMVIEGKSNN 384
Query: 478 ---PQWLKDVRRREVEIIGKWISQ 498
P ++ VR++ VE I W+S+
Sbjct: 385 CSLPDYVNKVRQQMVEHIDLWLSE 408
>gi|170045812|ref|XP_001850488.1| lysosomal pro-X carboxypeptidase [Culex quinquefasciatus]
gi|167868716|gb|EDS32099.1| lysosomal pro-X carboxypeptidase [Culex quinquefasciatus]
Length = 466
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 125/481 (25%), Positives = 191/481 (39%), Gaps = 61/481 (12%)
Query: 51 GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQ 110
G+ + + +DHF+ P++ TF RY ND H + PIFV G+ G IE
Sbjct: 19 GMVREGWFETRVDHFS--PRNMDTFSMRYYSNDEH---AYAKGPIFVIVGSNGPIETRYL 73
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADYAS 168
G YD A A L EHRY+G S+P +AST +L+ QALAD A+
Sbjct: 74 REGLFYDTAYLEGAYLFANEHRYFGHSLP--------VDDASTENLDFLTVDQALADLAA 125
Query: 169 LIIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPY 227
I L+ + + V++ G YGG LA WF ++PH++ G SS N D + Y
Sbjct: 126 WIHHLRHEVVGNPQAKVILMGWGYGGSLATWFHTQFPHLSDGVWVSSGNN-NADLNLPEY 184
Query: 228 --SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
S N I + +CY I S+ + + + E L + F +C
Sbjct: 185 MESLGNTIGE---FGGRDCYSTIFSSFL-VAQNLIELDRSELLTEMFHLCDDLN--TDNR 238
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYS 345
W +TAF+ D ++ MC I DP+ GN LY + +
Sbjct: 239 WDTTAFLLGLQRD--IEDEMMHLRNTMSTTYMCVNIQDPEIGN----SLYSLRNWFAREH 292
Query: 346 GTAKCFDLNGDSDPHGLSEWGWQ--------------ACTEMIMLTGGDNKDSIFEESEE 391
+C +L D L E + CT + D+ F ++
Sbjct: 293 QYEQCVNLGFDHYMAPLLETNFDDEDLQAGHRQRLYLQCTALGFFPTTDSMYQPF-GTQI 351
Query: 392 DYDARARYCKEAYGVDPRPNWIT-------TEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
D D C++A+G P I+ FGG + + +N+ F +G DP
Sbjct: 352 DSDFYVEVCQQAFGF-PTEESISQGVFRTNARFGGRQPDI------ANVHFTHGDIDPMM 404
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLR-FSTKEDPQWLKDVRRREVEIIGKWISQYFQDL 503
G+ +++ A V DL + D L + + R +I WI Q F+ +
Sbjct: 405 LTGITSDLNDDAPATVIPNTFFAPDLESIDYENDSPELIEAKERTRTLIDIWIYQGFEPI 464
Query: 504 A 504
A
Sbjct: 465 A 465
>gi|26449329|dbj|BAC41792.1| putative prolyl carboxypeptidase [Arabidopsis thaliana]
Length = 281
Score = 108 bits (269), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 88/257 (34%), Positives = 134/257 (52%), Gaps = 32/257 (12%)
Query: 8 LSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTK-YHTQILDHFN 66
L F LL + T+ +S + +F + + P ++S +S K S+ L + + + TQ LDH
Sbjct: 5 LGFALLTTFTVLLS----YLSFSNGLLQPRRISHGLS--KSSKYLTRDELWFTQTLDH-- 56
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALL 126
Y+P ++ F+QRY H + PIF+ EG N ++ +A KF A +
Sbjct: 57 YSPSDHRKFRQRYYEYLDHL--RVPDGPIFLMICGEGPCNGITNN--YISVLAKKFDAGI 112
Query: 127 VFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTA- 179
V +EHRYYGKS P+ K +A KN YLSS QAL+D A+ +++K N ++
Sbjct: 113 VSLEHRYYGKSSPF---KSLATKNLK---YLSSKQALSDLATFRQYYQDSLNVKFNRSSN 166
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
++P FG SY G L+AWFRLK+PH+ G+LASSA + + Y F Q S
Sbjct: 167 VENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAV------VRAVYEFPEFDQQIAES 220
Query: 240 VSENCYKVIKGSWKQIE 256
C ++ + K +E
Sbjct: 221 AGPECETALQETNKLLE 237
>gi|308485479|ref|XP_003104938.1| hypothetical protein CRE_24446 [Caenorhabditis remanei]
gi|308257259|gb|EFP01212.1| hypothetical protein CRE_24446 [Caenorhabditis remanei]
Length = 510
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 116/468 (24%), Positives = 189/468 (40%), Gaps = 54/468 (11%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTGFMY 116
TQ LDHF+ + +TF+QRY N+ + K+ P F+ G EG W + M
Sbjct: 64 TQTLDHFDSSVG--KTFKQRYWHNNQWY---KDGGPAFLMLGGEGPESSYWVSYPGLEMT 118
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
++A K A + IEHR+YG++ P + + S YLSS QA+ D A+ I K
Sbjct: 119 NLAAKQGAWVFDIEHRFYGETKP------TSDMSVSNLKYLSSAQAIEDAAAFI----KA 168
Query: 177 LTA-----TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+TA ++ V FGGSY G LAAW R K+P + A+ SS P+ V +
Sbjct: 169 MTAQYPQLANARWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPV---QAEVDFKEYLE 225
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWL 287
++ S +C + + + K G ++L+ AF +C+ +K+L W
Sbjct: 226 VVQNSISRNSTDCATSVTAGFNLVASLLKTTDGRKQLKTAFHLCQDIQLDDKSLKY-FWE 284
Query: 288 STAFVYTAMTDY--------PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF---AKLYG 336
+ Y + Y T + + + + I K ND F + +G
Sbjct: 285 TVYSPYMEVVQYSGDAAGSFATQLTISHAICRYHNNKATTTIQKMKQVNDYFNLVNEYFG 344
Query: 337 AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEM------IMLTGGDNKDSIFEESE 390
+ YN G + W WQ CTE T G +
Sbjct: 345 CNDIDYN--GFISFMKDTSFGEAQSDRAWVWQTCTEFGYYQSTASATAGPWFGGVANLPA 402
Query: 391 EDYDARARYCKEAYGVDPRPNWITT--EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
+ Y C YG + T ++ G K + I+ NG DPW G
Sbjct: 403 QYYIDE---CTAIYGAAYNSQEVQTSVDYTNQYYGGRDKLNTARILLPNGDIDPWHALGK 459
Query: 449 LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
L + + ++ +V AH D+ ++ D +L + R++ +++ W+
Sbjct: 460 LTSANSDIVPVVINGTAHCADMYGASSLDSVYLTNARQKISDVLDGWL 507
>gi|308452876|ref|XP_003089215.1| hypothetical protein CRE_21658 [Caenorhabditis remanei]
gi|308241640|gb|EFO85592.1| hypothetical protein CRE_21658 [Caenorhabditis remanei]
Length = 971
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 114/467 (24%), Positives = 204/467 (43%), Gaps = 42/467 (8%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE- 106
D++GL + Q +DHFN + FQQ+Y N ++ P F+ G E
Sbjct: 263 DTEGLEIGMFR-QRIDHFNNKNTKF--FQQKYFKNSRF---ARPGGPNFLMIGGESPAHG 316
Query: 107 -WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
+ + A ++ A++ +EHR+YG S+ +N + LSS Q L D
Sbjct: 317 SHVKNLSSAIMRRAKEYGAIVYLLEHRFYGDSV---------VENNTDLTTLSSLQMLYD 367
Query: 166 YASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
A I ++ K + T +P + FGGSY G L+AW R +P + IGA+ASSAP+L +
Sbjct: 368 IAEFIKSVNFK---SETSNPWITFGGSYPGALSAWMREIFPDLVIGAIASSAPVLAKTDF 424
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAI 283
Y + ++ F CY+ IK + +I E + G EKL F++ ++ I
Sbjct: 425 ---YEYMMVVENSFLIYDPACYQEIKNGFDEIHELFQTDSGREKLSDLFKLNPPFRD-NI 480
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLP-AF----PVKEMCKAIDD-PKTGNDVFAKL--- 334
F + + + + AF + +C+ I + ++ + AK+
Sbjct: 481 SEIDKHFFFFDIIGPFQGAVQYAGGGSGAFEENSEIAMLCRNITNGTQSSAENVAKVVLD 540
Query: 335 -YGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
+G S+ +++ + + + + L +W Q C+E D+ +SIF +
Sbjct: 541 DFGNKSLIHSFYDKNEWKKMKKKNRDY-LWKW--QTCSEFGYFQSADSGNSIF-GAMNPV 596
Query: 394 DARARYCKEAYGVDPRPNWI--TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
+ + C + +G + I E ++ G V +N++F NG DPW G+ +
Sbjct: 597 SFQVQQCMDMFGKEYTRGKIEENVEATNYRYGGVDGFRGTNVVFINGDVDPWHILGLYNS 656
Query: 452 ISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
K+V++ + +H VD+ D +K R+ + I W+ Q
Sbjct: 657 TEKSVVSYLINGTSHCVDMYPPQDNDIDGVKIARKLVDDNIKVWLEQ 703
>gi|195062805|ref|XP_001996257.1| GH22293 [Drosophila grimshawi]
gi|193899752|gb|EDV98618.1| GH22293 [Drosophila grimshawi]
Length = 481
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 127/520 (24%), Positives = 224/520 (43%), Gaps = 66/520 (12%)
Query: 4 RFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLS-----SLISSSKDSQGLYKTKYH 58
R I L LL + + P+S T ++L LIS + + T++
Sbjct: 2 RHILLVIALLAPLASSAKLSSDVSKIPASVRTLKELHRGPPMQLISKRANVE----TRWI 57
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
+Q LD+F+ + + + + R LIN+ H+ + +PIF+Y G E +IE G D+
Sbjct: 58 SQKLDNFDEDNE--EVWDDRVLINEDHF---VDGSPIFIYLGGEWEIEPSPITAGHWVDI 112
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A + LV+ EHR++G+S+P I + Y + QALAD ++I LK+
Sbjct: 113 ASEHNGSLVYTEHRFFGQSVP------IKPLTTANLKYQNVEQALADVVNVINVLKEEEK 166
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
+S VVV G SY +A W + YP V +G+ ASSAP+ V ++ ++ Q +R
Sbjct: 167 YKNSKVVVQGCSYSATMAVWIKKLYPDVIVGSWASSAPL---QAKVDFKAYMKVVGQAYR 223
Query: 239 SVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL---STAFVY 293
+ + CY +I + E+ + G + +K +C + +N + W + A +
Sbjct: 224 ELGGDYCYNIIDNATSFYEDLFEN-GQNAEAKKLLNLCDNFNENDQHDQWQIFSTIANIL 282
Query: 294 TAMTDYPTPSNF-----LNPLPAFPVKEMCK-------AIDDPKTGNDVFAKLYGAASVY 341
+ Y P+N+ + L +F + +D+P+ N V Y A Y
Sbjct: 283 AGLAQYQNPANYDLAKHCSVLRSFSTDDATALSKFIQWRLDNPECVNTV----YKATVKY 338
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR--ARY 399
Y ++ + +G S LS W +Q C E +K+ F S + A
Sbjct: 339 YKWA----MHNYDGSS----LS-WFFQTCNEFGWYQSSGSKNQPFGSS---FPATLYTDT 386
Query: 400 CKEAYGVDPRPNWITTEFG-GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA 458
CK+ +G I +K+ + N+ +G DPW G T++
Sbjct: 387 CKDVFGSKYTAAKIEKYISEKNKVYGGVNPNVENVYMTHGGLDPWHPVGAGAAQGATIIP 446
Query: 459 LVEKEGAHHVDL-RFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ +H D+ S K+ P L ++R +++ +W++
Sbjct: 447 ----QASHCSDMGSISAKDSPAMLAS-KQRVAQLVREWLA 481
>gi|170574564|ref|XP_001892869.1| Serine protease Z688.6 precursor [Brugia malayi]
gi|158601363|gb|EDP38291.1| Serine protease Z688.6 precursor, putative [Brugia malayi]
Length = 108
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 68/99 (68%), Gaps = 5/99 (5%)
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
M+D+AP+F A ++F EHR+YGKS P+G +Y GYLSS QAL D+A LI LK
Sbjct: 1 MWDLAPEFNAAIIFAEHRFYGKSQPFGNE---SYATIRNLGYLSSEQALGDFALLIYHLK 57
Query: 175 --KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
+ L A +S V+ FGGSYGGMLAAW R+KYPH+ G+
Sbjct: 58 NKRLLVAQNSSVIAFGGSYGGMLAAWMRIKYPHLVEGSF 96
>gi|160332814|emb|CAL69923.1| hypothetical protein [Plasmodiophora brassicae]
gi|162138601|emb|CAP58027.1| hypothetical protein [Plasmodiophora brassicae]
Length = 467
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 116/459 (25%), Positives = 185/459 (40%), Gaps = 61/459 (13%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q++DH + F QRY D W G + P+ + E F +A
Sbjct: 29 QLVDHTAIGGGGAR-FSQRYFRIDQFWSGP--DGPVILQLCGEYTCAGVTDGRQFPSALA 85
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL-- 177
++ AL++ +EHRY+GKS P+ ++ +N + YL++ QAL+D A D + +
Sbjct: 86 ERYGALVLVLEHRYFGKSSPF---SVLSPRNLT---YLTTFQALSDIACFT-DWYQRVHI 138
Query: 178 -TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS-FSNIITQ 235
A + + GGSY G LAAW+RLKYPH+ GALASSA +V+P++ F Q
Sbjct: 139 GRANANKWITIGGSYPGALAAWYRLKYPHLTAGALASSA-------VVAPFAEFPEFDEQ 191
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTA 295
S C ++ +E ++ G L A C S LS A
Sbjct: 192 VALSAGPECTHALQDITAMVEGALQEGGRLADEMNALFSC---------SQLSDADFLYL 242
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNG 355
+ D + P +C I ++ +D A + S + D +G
Sbjct: 243 IAD-----AMAEAIQYGPSVSLCDPIVQAESRDDRLAAFVEFVQGTFYASMSNSPGDYDG 297
Query: 356 DS--------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-- 405
D+ D G +W W C E+ +SI + R R C++ +G
Sbjct: 298 DTMASHRWVPDSSG-RQWWWMKCNEVGWFQIAPGTNSIRSKRVNMEWHRDR-CEKLFGDV 355
Query: 406 --VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI------SKTVL 457
P + + E+ G + + SN++F NG+ DPW G +VL
Sbjct: 356 LAFPPPCHRASIEYSGFDMSV------SNVVFTNGVEDPWQWAGASAFSSSAHLRDSSVL 409
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + H VDL + +D L R + I +W+
Sbjct: 410 LINCSQCGHCVDLHTPSPDDAPALTTARSTIIAHIDRWL 448
>gi|449701650|gb|EMD42428.1| serine carboxypeptidase (S28) family protein, partial [Entamoeba
histolytica KU27]
Length = 195
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 73/214 (34%), Positives = 104/214 (48%), Gaps = 23/214 (10%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDH 64
+ L+ C++ S N ++ S +T E L+ + S + +T LDH
Sbjct: 1 MLLLALCVICSYAKLSLNQQVMQQISQSFMTLE-LNEVESMT-----------YTVPLDH 48
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTGFMYDVAPKFK 123
FN N Q+ F +Y +N + NAP+FV G EG QN + +A K K
Sbjct: 49 FNANNQN--DFDIQYFVNKKFLDANDPNAPLFVLLGGEGPASPKVLQNNYVIDSLAKKHK 106
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP 183
L++ +EHR+YG S P ++ Y A QAL DY +I +++ P
Sbjct: 107 GLMLSVEHRFYGASTPSLEMDKLIYCTAE--------QALMDYVEVISHVQEENNLVGHP 158
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
V+V GGSY G LAAW R KYP+V GA ASSAP+
Sbjct: 159 VIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV 192
>gi|341879672|gb|EGT35607.1| hypothetical protein CAEBREN_08421 [Caenorhabditis brenneri]
Length = 1088
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 112/464 (24%), Positives = 189/464 (40%), Gaps = 47/464 (10%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYD 117
Q +DHFN Q+ FQQ++ N W ++ P F+ G EG W
Sbjct: 586 QRVDHFN--NQNANFFQQKFYKN-AQW--AQPGGPNFLMIGGEGPESSRWVLNENITYLT 640
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI--IDLKK 175
A K+ A + +EHR+YG S+ G N ++ L+S Q L D A I ++LK
Sbjct: 641 WAKKYGATVYLLEHRFYGDSL-VGDNNDL--------NTLNSLQMLYDLAEFIKSVNLK- 690
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
T T +P + FGGSY G ++AW R +P + +GA+ASS P+ + Y + ++
Sbjct: 691 --TGTSNPWITFGGSYSGAMSAWMREVFPDMVVGAVASSGPVFAKTDF---YEYLMVVEN 745
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC----KSEKNLAIESWLSTAF 291
R+ C I+ + + G + L F++ + ++ + S +
Sbjct: 746 SIRTYDRTCADRIQSGFNTMRTMFLTKEGRQNLSDIFQLLPPFGDNVTDIDQHYFFSNVY 805
Query: 292 -VYTAMTDYPTPSNFLNPLPAFPVKEMCKAI-DDPKTGNDVFAKLYGAASVYYN----YS 345
+ Y + N + + +MCK + +D T + +V+YN YS
Sbjct: 806 GNFQGAVQY-SGDNTGPYANGYGIPDMCKIMTNDTNTPLNNIVAFNQFMTVFYNGGVPYS 864
Query: 346 GTAKCF----DLNGDSDPHGLSE-----WGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
G + D ++ G W WQ CTE D + IF S +
Sbjct: 865 GMDNNYQDLIDYLQNAQQFGPEAGAGLLWMWQTCTEFGYFQSADTGNGIF-GSPTPVNMY 923
Query: 397 ARYCKEAYGVDPRPNWI--TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
+ C + + + I + + +K G + +N++ NG DPW G+
Sbjct: 924 VQMCMDVFNSYYQRGTIDSSVSYTNYKYGSRDQYRGTNVVLPNGNVDPWHALGLYGAQDS 983
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+V+ + AH D+ + D LK VR + I W++Q
Sbjct: 984 SVVTYLINGTAHCADMYPARDADAPGLKIVRDLIDQNIAIWLNQ 1027
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 101/454 (22%), Positives = 172/454 (37%), Gaps = 54/454 (11%)
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW--FAQNTGFMYDVAPKFKALLVFIEH 131
TF QRY + + + F+Y GD E + A +F A + +EH
Sbjct: 63 TFSQRYFYTQDY---ALHQRVAFLYISVSGDFETSVITDERNPIVKTARQFGATVFSLEH 119
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPV--VVFGG 189
R+YG+S P + +AS T +L+S QA+ D I N D V +++G
Sbjct: 120 RFYGQSRP----NFDKFDSASLT-HLNSFQAIQDILHFI-RFANNKFQLDPDVRWILWGA 173
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVI 248
YGG++AA R P + G +ASSAP+ + + + F++ + V + CY+ +
Sbjct: 174 GYGGIIAAEARKWDPKLVAGVVASSAPLTHKYDF---WEFNDQVAIILSQVGGQLCYQKV 230
Query: 249 KGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLAIES----WLSTAFVYTAMTDYPTP 302
+ I + + P G + F + + NL W+S + + Y
Sbjct: 231 AQGFADIGQAMRTPQGRANVSDLFGLVPRLDQTNLNYNDVQMFWMSVISPFQTLAQYNND 290
Query: 303 SNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLYGAASVYYN-------YSGTAKCFDLN 354
N + +MC ID T +V + Y S N SG +
Sbjct: 291 FN-------LSIGDMCTNIDKSNWTNMEVVYQTYVYLSRTLNDGQVLPLVSGYQDVINDL 343
Query: 355 GDSDPHGL----SEWGWQACTEM-IMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
G+ P W +Q CTE T +N+ ++F + + + + P
Sbjct: 344 GNQSPDSPYLDNRMWQYQMCTEFGWFYTTNNNEQTLFGAVV----PTSLFLNLCFDLFPG 399
Query: 410 PNWITTEFG------GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
+T G +N++F NG DPWS G +V+A +
Sbjct: 400 AQLTSTSIRDIVDDYNRLYGAADDYSGTNVVFTNGWYDPWSRLGKESTADFSVVAYIIPS 459
Query: 464 GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G+ D+ F + + + + E I W++
Sbjct: 460 GSWASDM-FPGDTNDTSIINAHKLVTENINVWVN 492
>gi|402588746|gb|EJW82679.1| hypothetical protein WUBG_06411 [Wuchereria bancrofti]
Length = 396
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/378 (26%), Positives = 160/378 (42%), Gaps = 45/378 (11%)
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN-- 176
A K+ A ++EHR++G S P+ + +YK YL+ QALAD + I+ + +
Sbjct: 8 AKKYGAACFYLEHRFFGASQPFEDHSMESYK------YLTVNQALADIKNFIVQMNEMFF 61
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
L ++FGGSYGG LAAWFR + I A+ SSA + V Y ++ +
Sbjct: 62 LDIEKPRWILFGGSYGGALAAWFREMNEELTIAAIVSSAVV---QAEVDYYDYTKNLEYV 118
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK--SEKNLAIESWLSTAFVYT 294
+ + C + I+ S K + E G +L K F +C+ +E +A + A +
Sbjct: 119 LKEENAPCAETIRLSIKALIEKTYTVDGRAELGKVFNMCEPFTEPPIAKDIQFFLANILY 178
Query: 295 AMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN 354
Y + LP V C I D + D ++ A +Y + +CFD +
Sbjct: 179 TFGGYIQYAGGCR-LP--DVSYFCDLITD-GSETDYIGVIWNAWKIYDQIFQSEECFDPS 234
Query: 355 GDSDPHGLSE-------------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
+ LS+ W W CTE+ DN SIF S D A C
Sbjct: 235 YERHLEDLSDITFMDNEFASYRSWLWLCCTELGFFITTDNGKSIFGSSVS-LDYHADKCM 293
Query: 402 EAYGVDPRPNWITTEFGGHKIGLVLKRF-------ASNIIFFNGLRDPWSGGGVLK--NI 452
+ + V TE + L+ F +N +F +G DPW L +I
Sbjct: 294 DVFDVQ-----YDTERARTGVRNTLRTFGGYDNYKGTNTVFVSGSYDPWKSACCLNCTDI 348
Query: 453 SKTVLALVEKEGAHHVDL 470
++ V +++ + G+H VD+
Sbjct: 349 TRNVYSVIIEGGSHCVDV 366
>gi|312380006|gb|EFR26124.1| hypothetical protein AND_08001 [Anopheles darlingi]
Length = 506
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 117/474 (24%), Positives = 200/474 (42%), Gaps = 77/474 (16%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTGFMYD 117
T ++HF+ PQ TF+ YL ND ++ + P+FV G ++ +F +N+ F D
Sbjct: 63 TSRINHFD--PQDRSTFEFNYLTNDQYY---REGGPLFVVVGGHHRLDPYFLENSHFR-D 116
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
VA A L EHR++G S+P ++++ +N +L + QAL D I LK+ +
Sbjct: 117 VAALNGAFLANNEHRFFGTSVP---TEDLSSENLR---FLRTEQALFDLIEWIDFLKREV 170
Query: 178 TAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
++ V+V G SYGG LA W R ++P++ GA SSA + + FS T+D
Sbjct: 171 MGDPNARVIVHGFSYGGALATWARQRFPNIIDGAWGSSA------TVRATVEFSE-FTED 223
Query: 237 FRSV-----SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWL- 287
F + S+ CY I ++ IE G E++ F C ++ L +E +L
Sbjct: 224 FGNTIRVKGSDECYSAIFRAFHTIENLLDA-GLTERVSSTFNTCDPIDADNALQVELFLH 282
Query: 288 --STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY--N 343
+ + + D+ + V+ +C + D D F A + Y
Sbjct: 283 LMTLSLEISMFEDFDIEN----------VQNVCNQLTD-----DSFDTSMEALAEYLKNR 327
Query: 344 YSGTAKCFDLN--------GDSD-------PHGLSEWGWQACTEMIMLTGGDNKDSIFEE 388
Y+ CFDL+ GD +GL + + CTE +++ F
Sbjct: 328 YADVRDCFDLSFENFISILGDESLDAPQNLDYGLRQLNYHICTEFGYFQSARSREQPF-G 386
Query: 389 SEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA------SNIIFFNGLRDP 442
S+ YD C +G W+T+E + L F +N++F NG DP
Sbjct: 387 SKVTYDLFLEECAAVFG-----EWLTSEVLYDGVRLTNFHFGATDPRTTNVLFTNGGIDP 441
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + + + A V + D+R + D + + + + + I W+
Sbjct: 442 FRYVSITQYQNLLSNARVTPAAFYTEDIRSISGYDSEEMLETKHMVEQYITTWL 495
>gi|154413074|ref|XP_001579568.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121913776|gb|EAY18582.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 518
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 109/477 (22%), Positives = 184/477 (38%), Gaps = 89/477 (18%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
T Y Q LDHFN + +TF+QRY NDT + + + V+ G E I G
Sbjct: 21 TYYFDQFLDHFNTSDN--RTFKQRYYYNDT-FCQNTTTKKLIVFIGGEAAITERRVQKGA 77
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+A + + +V +EHRY+G+S P+ +E+ N YL+S QALAD A I
Sbjct: 78 YMKLAKETDSCVVALEHRYFGESQPF---EELITPNLK---YLTSDQALADLAYFIESFI 131
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
K + ++V GGSY G L+++FR+KYPH+A + ASS P+ +
Sbjct: 132 KIKYQSRPTILVVGGSYPGTLSSYFRMKYPHIADFSWASSPPLY--------------VK 177
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYT 294
DF +C +V+ ++ + +T +Y
Sbjct: 178 NDFWEYDAHCAEVL------------------------------GKISPKCLTNTKLIYD 207
Query: 295 AMTDYPTP-SNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY-NYSGTAKCFD 352
D+P +N++ P+ + D G + +Y + Y N +G + +D
Sbjct: 208 DFNDHPDHITNYIPFKPSVSHVSQLSILSDFIAGIVQYDNIYKLVTPYCENQNGDSPNYD 267
Query: 353 -----------LNGDSDPHGLSE-----------------WGWQACTEMIMLTGGDNKDS 384
+ G DP L + W W C E +
Sbjct: 268 SFHDYFYKYLEVEGVEDPSDLDDFALTNHSIHTDYADSLSWTWMTCNEFGWFQTASGQ-- 325
Query: 385 IFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
++ D + C+ +GV P+ I + I F NG DPWS
Sbjct: 326 -LRPAKVDLNYSDLVCRTYFGVGISPDIDNNRSAKMDIYNAQNPATTMIYFSNGKTDPWS 384
Query: 445 GGGVLKNISKTVL---ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
V +N+ + ++ +H DL +P+ L R++ ++ + +W++
Sbjct: 385 VLSVSENVQNPPVGRYSVQINNASHCSDLGDEAAGEPEALTVARKQIMDTMARWLNH 441
>gi|344268151|ref|XP_003405926.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 431
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 100/386 (25%), Positives = 160/386 (41%), Gaps = 60/386 (15%)
Query: 56 KYHTQILDHFNYNPQSY--------------------QTFQQRYLINDTHWGGSKNNAPI 95
++ Q+L+HF+ N S+ + +QRY IND + K P+
Sbjct: 43 RWFQQMLNHFSKNGSSFWSQAVAIEAVTWGQLAKDQIASEEQRYFINDIFY---KPGGPV 99
Query: 96 FVYTGNEGDIEW---FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAS 152
F++ E NT Y A + AL + +EHR+YG S P G + +
Sbjct: 100 FLFIKGENAASRQMVTGNNTWTTY--AERLGALCLLLEHRFYGHSQPTGD------LSTA 151
Query: 153 TTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
+ YL+S QALAD + + K + T + V+FGGSYGG LA W R+K P + A+
Sbjct: 152 SLRYLNSRQALADIVNFRTQIAKTMGLTKNKWVIFGGSYGGSLAVWSRIKRPDLFFAAVG 211
Query: 213 SSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
SSAP+L V+ Y + I + + + C + ++ ++ I K P L F
Sbjct: 212 SSAPML---AKVNFYEYLEIFYRTLTTYNIECARAVEKAFTLIVALVKIPEYYSMLADDF 268
Query: 273 RICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN--DV 330
+C+ L I S + AF M + L + + C + D G+ +
Sbjct: 269 MLCEP---LKINSEMDAAFFLHKMIYFIRIE-----LNKITIDDFCDLMTDTSLGSPYNR 320
Query: 331 FAKLYGA-------ASVYYNYSGTAKCFDLNG---DSDPHGLSEWGWQACTEMIMLTGGD 380
FA + + +Y NYS + F L+ S G +W +Q CTE D
Sbjct: 321 FATIANMKIINKKWSCLYANYSAFLEYFSLSSLDMQSIRRG-RQWFYQCCTEFGFFQTTD 379
Query: 381 NKDSIFEESEEDYDARARYCKEAYGV 406
+ + F Y + C + +G
Sbjct: 380 SNNHPFSGLPLRYYVKR--CSDVFGT 403
>gi|195109612|ref|XP_001999377.1| GI23098 [Drosophila mojavensis]
gi|193915971|gb|EDW14838.1| GI23098 [Drosophila mojavensis]
Length = 386
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 105/429 (24%), Positives = 185/429 (43%), Gaps = 55/429 (12%)
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
RYL+ND + K P+F+Y G E I G MYD+A + LL + EHRYYG+S
Sbjct: 2 RYLLNDVFF---KAGGPMFIYLGGEWTISNGFITAGHMYDMAKEHNGLLAYTEHRYYGES 58
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLA 196
P +++ +N +L QALAD A I K + +DS V++ GGSY +
Sbjct: 59 HPL---PDLSNENLR---FLHVKQALADLAHFIKTQKASYEGLSDSKVIIVGGSYSATMV 112
Query: 197 AWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN-CYKVIKGSWKQI 255
WF+ YP + +G ASSAP++ N + + ++ + + + CY I+ ++
Sbjct: 113 TWFKRTYPDLVVGGWASSAPVVAKVNF---FEYKEVMGESITLMGGSACYDRIEKGIAEL 169
Query: 256 EETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPV 314
E G E ++ ++C+ + N ++ W ++ +++ +N + + +
Sbjct: 170 ETMFANKRGAE-VKALLKLCEPFDVNNDLDIWT----LFNEISE--IFANVVQTHNSGRI 222
Query: 315 KEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMI 374
+ +C+ I TG+ L G +S + G A +G + L +W +Q C E
Sbjct: 223 EGVCQQI---MTGS---TDLIGVSSYLLSEFGKA-----SGST----LRQWIYQTCNEYG 267
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLK 427
+ F ++ C + YG ++ R FGG L
Sbjct: 268 WYQTSSSSMQPF-GTKFPLALFTTMCADLYGPQFGESFIEDRAAETNEYFGG------LT 320
Query: 428 RFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRR 487
N+ F +G DPW G+ T++ E AH D + +D ++ + R
Sbjct: 321 PEVENVYFSHGQLDPWRAMGIQDEKQATIIP----EHAHCKDFGSISNDDTPEMRASKER 376
Query: 488 EVEIIGKWI 496
E++ +W+
Sbjct: 377 VAELVREWL 385
>gi|157119585|ref|XP_001659436.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875273|gb|EAT39498.1| AAEL008698-PA [Aedes aegypti]
Length = 500
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 106/435 (24%), Positives = 174/435 (40%), Gaps = 67/435 (15%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
K+ T++ DHFN PQ+ TF+ Y ND + + PIF++ G ++ + G
Sbjct: 57 KFRTRV-DHFN--PQNRDTFEFEYYSNDEFY---RPGGPIFIFVGGNWPLDQYYIEHGHF 110
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
+D+A A + EHRYYG S P + + YL+ QA+ D A LI ++
Sbjct: 111 HDIANYENAWMFANEHRYYGHSFP------VPNLSVENLQYLTVEQAMVDLAELIYHVRH 164
Query: 176 NLTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSN 231
N+ D + V++ G Y G +A W R +YPH+ GA SS I NF ++
Sbjct: 165 NVVRDDDARVILLGTGYAGAIATWMRQRYPHLVEGAWVSSGQIEARFNFK------EYAM 218
Query: 232 IITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESW- 286
I + R N CY I +++ E G + F C+ +E L +E++
Sbjct: 219 EIGELIRDYGTNECYSQIWRAFRTAENLMDA-GLANTVTDLFNTCERVDTETMLDVETFF 277
Query: 287 --LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
+ A +++ T + +E+C+ +++ D+ YY
Sbjct: 278 YNVKEALQRAILSEQDTET----------TEELCENLNNSTEATDLHTIANWIEDFYYYL 327
Query: 345 SGTAKCFDLNGDSDPHGLSEWG-------------WQACTEMIMLTGGDNKDSIFEESEE 391
FD + H E G +Q CTE D+ D F
Sbjct: 328 D--CMPFDFDTTVAAHQFEEIGYPENAILGLRQRVYQFCTEFGWFLTADSPDQPFGYRVT 385
Query: 392 DYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA------SNIIFFNGLRDPWSG 445
Y +C+ +G +W+T+E + L F SN++F NG DP
Sbjct: 386 MY-FFLNFCRSVFG-----DWVTSEVVADGVHLTNMHFGGKNPRISNVLFTNGALDPVRD 439
Query: 446 GGVLKNISKTVLALV 460
G+ + + A+V
Sbjct: 440 VGITEYKQPSSDAIV 454
>gi|219130565|ref|XP_002185433.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403147|gb|EEC43102.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 538
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 128/481 (26%), Positives = 202/481 (41%), Gaps = 67/481 (13%)
Query: 31 SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSK 90
++++T E SSL + + G +T+Y Q L+HF+ ++F QR+ +D + +
Sbjct: 39 NAKVTDE--SSLGADVPEFVGDVQTRYVNQQLNHFD--ASDTRSFAQRFFYSDRYARARE 94
Query: 91 NNAPI--FVYTGNEG---DIEWFAQN---TGFMYDVA-----PKFKALLVFIEHRYYGKS 137
N F+ G EG D + TG M ++A K L +EHRYYG+S
Sbjct: 95 ENRNTYAFLCVGGEGPALDESVLVDSVHCTGDMLELAHILFEDGHKVHLYALEHRYYGES 154
Query: 138 IPY---GG---NKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSY 191
P GG N+ + YLSSTQALAD A + + + T+ V FGGSY
Sbjct: 155 YPVFREGGCSKNRTTSPVTNQHLVYLSSTQALADLAHFV-NSRSLDGGTNIKWVTFGGSY 213
Query: 192 GGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV----SENCYKV 247
GM+AAW R KYPH+ A++SSAP+ ++ +++N +++ S S C V
Sbjct: 214 PGMMAAWARSKYPHLIHAAVSSSAPV---QAVLDFSAYNNHVSKVLASANVGGSSECLAV 270
Query: 248 IKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLN 307
+ + ++ L F +C + L E F+ + D T N N
Sbjct: 271 FQAAHGEVTRMVHDATQHAGLADMFGLCNATSLL--EDRNKELFLGDGLVDLHTQGN--N 326
Query: 308 P---LPAFPVKEMCKA-IDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD------LNGDS 357
P + ++C+ ++D K+ F + A++ C + L+ S
Sbjct: 327 PSCDRDLCNIGKICRTLLNDAKS----FKPVTALANLAQRQRDRGACINIDWTGTLDYIS 382
Query: 358 DP-----HGLSEWGWQACTEMIMLTGGDNKDSI-----FEESEEDYD-ARARYCKEAYGV 406
DP GL W WQ CTE + + F ++D + R + K V
Sbjct: 383 DPVRGVEGGLRSWLWQTCTEFGFYQTCNMHSTCPYGRGFHRVDQDLEMCRVAFDKSGVQV 442
Query: 407 DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
+GG K+ AS I+ NG DPWS + ++S L + GA
Sbjct: 443 ATAVRSSMEAYGGWKME------ASRILSVNGDIDPWSELAI-HDVSNPQLPTYQVPGAS 495
Query: 467 H 467
H
Sbjct: 496 H 496
>gi|326520173|dbj|BAK04011.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 489
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 123/475 (25%), Positives = 193/475 (40%), Gaps = 72/475 (15%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ ++ +Q LDHF+ + ++ F+QRY D H P+F+ E + +
Sbjct: 49 EERWMSQRLDHFSSS--DHRQFKQRYFEFLDYH---DDPTGPVFLRICGESSCDGIPND- 102
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
++ +A KF A +V EHRYYGKS P+ +LSS QAL D A
Sbjct: 103 -YLAVIAKKFGAAVVTPEHRYYGKSSPFDS------LTTDNLRFLSSKQALFDLAVFRQY 155
Query: 173 LKKNLTAT-------DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
++ L + D+P VFG S G L+AWFRLK+PH+ G+LASS +L +
Sbjct: 156 YQEKLNSRYNRSAGFDNPWFVFGVSCSGALSAWFRLKFPHLTCGSLASSGVVL------A 209
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
Y+F++ Q S C ++ + ++ K+ KA S KN
Sbjct: 210 VYNFTDFDKQVGDSAGPECKAALQEITRLVD---KQLLSDSHSVKALFGADSLKN----- 261
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA-ASVYY-- 342
F++ T + NP +C + + K + + Y YY
Sbjct: 262 --DGDFLFLLADAAATTFQYGNP------DALCSPLANAKKKGESLVETYAHFVKDYYIK 313
Query: 343 NYSGTAKCFDLN--GDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
T +D ++ P S W +Q C+E+ DS+ ++ D
Sbjct: 314 KLGTTVSSYDQEYLKETTPDDSSSRLWWFQVCSEVAYFQVAPKNDSV-RSAQIDTRYNLD 372
Query: 399 YCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTV 456
CK YG P+ T +GG I AS I+F NG +DPW K+
Sbjct: 373 LCKNVYGEGVYPDVFMTNLYYGGTSIA------ASKIVFTNGSQDPWRHASKQKSSEGMP 426
Query: 457 LALVE-KEGAHHVDLR------FSTKED------PQWLKDVRRREVEIIGKWISQ 498
+++ H DLR F + D P+ + VR++ V+ I W+SQ
Sbjct: 427 SYIIKCSNCGHGTDLRGCPQLPFRIEGDSSNCTSPEAVNTVRKQIVKHIDLWLSQ 481
>gi|338718275|ref|XP_003363793.1| PREDICTED: LOW QUALITY PROTEIN: thymus-specific serine
protease-like [Equus caballus]
Length = 620
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 55/141 (39%), Positives = 82/141 (58%), Gaps = 9/141 (6%)
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
RY +ND HW + + P+F++ G EG + + G +AP + AL++ +EHR+YG S
Sbjct: 184 RYWVNDQHW--TAQDGPVFLHLGGEGSLWPGSVLRGHPATLAPSWGALVIGLEHRFYGLS 241
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLA 196
IP GG + +LSS AL D S + L + ++ SP + FGGSY G LA
Sbjct: 242 IPAGGLDMAQLR------FLSSRHALTDVVSARLALSRLFNVSSSSPWICFGGSYAGSLA 295
Query: 197 AWFRLKYPHVAIGALASSAPI 217
+W RLK+PH+ ++ASSAP+
Sbjct: 296 SWARLKFPHLIFASVASSAPV 316
>gi|297271491|ref|XP_001116882.2| PREDICTED: dipeptidyl peptidase 2-like, partial [Macaca mulatta]
Length = 142
Score = 105 bits (262), Expect = 6e-20, Method: Composition-based stats.
Identities = 56/147 (38%), Positives = 83/147 (56%), Gaps = 15/147 (10%)
Query: 334 LYGAASVYYNYSGTAKCFDL----NGDSDPHGLSE------WGWQACTEMIMLTGGDNKD 383
L A + YN SG+ C+D+ +DP G W +QACTE+ + +N
Sbjct: 1 LRALAGLVYNASGSEHCYDIYRLYRSCADPTGCGTGPDARAWDYQACTEINLTFASNNVT 60
Query: 384 SIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
+F + + R +YC + +GV PRP+W+ T F G + R ASNIIF NG DPW
Sbjct: 61 DMFPDLPFTDERRQQYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPW 115
Query: 444 SGGGVLKNISKTVLALVEKEGAHHVDL 470
+GGG+ +N+S +V+A+ + GAHH+DL
Sbjct: 116 AGGGIRRNLSASVIAITIQGGAHHLDL 142
>gi|195062810|ref|XP_001996258.1| GH22292 [Drosophila grimshawi]
gi|193899753|gb|EDV98619.1| GH22292 [Drosophila grimshawi]
Length = 480
Score = 105 bits (261), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 111/464 (23%), Positives = 203/464 (43%), Gaps = 55/464 (11%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T++ +Q LD+F+ + + + R LIN+ ++ + +PIF+Y G E +IE G
Sbjct: 52 ETRWISQKLDNFDEGNE--EVWDDRVLINEDYF---VDGSPIFIYLGGEWEIEPSPITAG 106
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + LV+ EHR++G+S+P I + Y + QALAD ++I L
Sbjct: 107 HWVDIASEHNGSLVYTEHRFFGQSVP------IKPLTTANLKYQNVEQALADVVNVINVL 160
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K+ +S VVV G SY +A W + YP V +G+ ASSAP+ V ++ ++
Sbjct: 161 KEEEKYKNSKVVVQGCSYSATMAVWIKKLYPDVIVGSWASSAPL---QAKVDFKAYMKVV 217
Query: 234 TQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL---S 288
Q +R + + CY +I + E+ + G + +K +C + +N + W +
Sbjct: 218 GQAYRELGGDYCYNIIDNATSFYEDLFEN-GQNAEAKKLLNLCDNFNENDQHDQWQIFST 276
Query: 289 TAFVYTAMTDYPTPSNF-----LNPLPAFPVKEMCK-------AIDDPKTGNDVFAKLYG 336
A + + Y P+N+ + L +F + +D+P+ N V Y
Sbjct: 277 IANILAGLAQYQNPANYDLAKHCSVLRSFSTDDATALSKFIQWRLDNPECVNTV----YK 332
Query: 337 AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
A YY ++ + D GLS W +Q C E +K+ F S + A
Sbjct: 333 ATVKYYKWAMH--------NYDGSGLS-WFFQTCNEFGWYQSSGSKNQPFGSS---FPAT 380
Query: 397 --ARYCKEAYGVDPRPNWITTEFG-GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
CK+ +G I +K+ + N+ +G DPW G
Sbjct: 381 LYTDTCKDVFGSKYTAAKIEKHISEKNKVYGGVNPNVENVYMTHGGLDPWHPVGAGAAQG 440
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
T++ + +H D+ + +D + ++R +++ +W++
Sbjct: 441 ATIIP----QASHCSDMGSISAKDSPAMIASKQRVAQLVREWLA 480
>gi|66772019|gb|AAY55321.1| IP12634p [Drosophila melanogaster]
Length = 490
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 108/466 (23%), Positives = 199/466 (42%), Gaps = 58/466 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ ++ TQ LD N++ + T+Q R IN+ ++ + +PIF+Y G E I+ +G
Sbjct: 58 EERWITQKLD--NFDDSNNATWQDRIYINNKYF---VDGSPIFIYLGGEWAIDPSGITSG 112
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + L++ EHR++G+SIP + Y S QALAD ++I L
Sbjct: 113 LWKDIAKQHNGSLLYTEHRFFGQSIPI-----TPLSTENLAKYQSVEQALADVINVIATL 167
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL---NFDNIVSPYSFS 230
K+ DS VVV G SY +A W R YP + G+ ASSAP+L NF + +
Sbjct: 168 KQEDKYKDSKVVVSGCSYSATMATWIRKLYPEIIRGSWASSAPLLAKVNFKD------YM 221
Query: 231 NIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL- 287
++ + + ++ + CY +I + E + G + + K +C + N + W
Sbjct: 222 KVVGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQAV-KELNLCSNFNVNSEQDRWQI 280
Query: 288 --STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL----YGAASVY 341
+ A ++ + Y P + P ++E DD F + A +
Sbjct: 281 FSTIANIFAGIAQYQKPEKYDIPTYCSILREFS---DDDSVALSKFINWKINEHSGACLS 337
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKD---------SIFEESEED 392
+ G ++ + ++ W +Q C+E ++ +++E++ E
Sbjct: 338 TTFKGAVGYYEWSKENYQDSDLPWIFQTCSEFGWFQSSGSRSQPFGSTFPATLYEDTCEG 397
Query: 393 YDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLK 450
A+Y ++ G+ +FGG + A+NI F G D WS G GV +
Sbjct: 398 VFG-AKY--DSAGIHANIRATNDDFGGLNVN------ATNIYFVQGALDGWSKVGAGVAQ 448
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
A + +H D + D L +++ ++++ +W+
Sbjct: 449 G------ATIIPYASHCPDTGSISASDSAELVASKKKLIKLVAQWL 488
>gi|170045808|ref|XP_001850486.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868714|gb|EDS32097.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 501
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 112/446 (25%), Positives = 179/446 (40%), Gaps = 68/446 (15%)
Query: 24 KIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLIND 83
K+ +P + PE +++S+ + L++T+ +DHFN PQ+ TF+ Y ND
Sbjct: 32 KMLKAYPIPEV-PEDYD--LANSRSVRNLFRTR-----VDHFN--PQNRDTFELAYYSND 81
Query: 84 THWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGN 143
+ + PIF++ G + + G D+A A + EHRYYG S P
Sbjct: 82 EFY---RPGGPIFIFVGGNWAVNPYFIERGHFPDIAYMEGAWMFTNEHRYYGTSFPV--- 135
Query: 144 KEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSP-VVVFGGSYGGMLAAWFRLK 202
++++ N +L+ QA+ D A LI L+ N+ D+ VV+ G YGG +A W R +
Sbjct: 136 EDLSTPNLR---FLTVEQAMVDLAELIYHLRHNVVRDDNARVVLLGMGYGGAIATWMRQR 192
Query: 203 YPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKP 262
YPH+ G+ SS + N + +D + CY I +++ E
Sbjct: 193 YPHLVDGSWVSSGQVEARFNFKEHAVEVGELIRDHG--DDECYSRIWRAFRT-AEALMDA 249
Query: 263 GGLEKLQKAFRICKS---EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
G E + FR C + E L +E+ F Y + L +C+
Sbjct: 250 GRTEIVTDMFRTCDAVDEENMLDVET-----FFYNVKE--VIQAEILLYQNVESTTRLCE 302
Query: 320 AIDDPKTGNDV--FAKLYGAASVYYNYSGTAKC--FDLNGDSDPH-------------GL 362
++D D+ A A Y+ +C FD + H GL
Sbjct: 303 TLNDSDESTDLQTLASWVNATFSYF------ECLPFDFESTVEAHSVLDIDSIENRYLGL 356
Query: 363 SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKI 422
+ +Q CTE D+ D F Y +CK YG +W+T E +
Sbjct: 357 RQRVYQFCTEFGWFLTADSDDQPFGYRVTMY-FFLNFCKATYG-----DWVTAEVVADGV 410
Query: 423 GLVLKRFA------SNIIFFNGLRDP 442
L F +N++F NG DP
Sbjct: 411 HLTNMHFGGQDPRIANVLFINGGLDP 436
>gi|24648179|ref|NP_650804.1| CG3739 [Drosophila melanogaster]
gi|7300510|gb|AAF55664.1| CG3739 [Drosophila melanogaster]
Length = 547
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 108/464 (23%), Positives = 198/464 (42%), Gaps = 58/464 (12%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ LD N++ + T+Q R IN+ ++ + +PIF+Y G E I+ +G
Sbjct: 117 RWITQKLD--NFDDSNNATWQDRIYINNKYF---VDGSPIFIYLGGEWAIDPSGITSGLW 171
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
D+A + L++ EHR++G+SIP + Y S QALAD ++I LK+
Sbjct: 172 KDIAKQHNGSLLYTEHRFFGQSIPI-----TPLSTENLAKYQSVEQALADVINVIATLKQ 226
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL---NFDNIVSPYSFSNI 232
DS VVV G SY +A W R YP + G+ ASSAP+L NF + + +
Sbjct: 227 EDKYKDSKVVVSGCSYSATMATWIRKLYPEIIRGSWASSAPLLAKVNFKD------YMKV 280
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL--- 287
+ + + ++ + CY +I + E + G + + K +C + N + W
Sbjct: 281 VGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQAV-KELNLCSNFNVNSEQDRWQIFS 339
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL----YGAASVYYN 343
+ A ++ + Y P + P ++E DD F + A +
Sbjct: 340 TIANIFAGIAQYQKPEKYDIPTYCSILREFS---DDDSVALSKFINWKINEHSGACLSTT 396
Query: 344 YSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKD---------SIFEESEEDYD 394
+ G ++ + ++ W +Q C+E ++ +++E++ E
Sbjct: 397 FKGAVGYYEWSKENYQDSDLPWIFQTCSEFGWFQSSGSRSQPFGSTFPATLYEDTCEGVF 456
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNI 452
A+Y ++ G+ +FGG + A+NI F G D WS G GV +
Sbjct: 457 G-AKY--DSAGIHANIRATNDDFGGLNVN------ATNIYFVQGALDGWSKVGAGVAQG- 506
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
A + +H D + D L +++ ++++ +W+
Sbjct: 507 -----ATIIPYASHCPDTGSISASDSAELVASKKKLIKLVAQWL 545
>gi|32564813|ref|NP_498758.2| Protein K12H4.7, isoform a [Caenorhabditis elegans]
gi|13638618|sp|P34528.2|YM67_CAEEL RecName: Full=Putative serine protease K12H4.7; Flags: Precursor
gi|351063156|emb|CCD71198.1| Protein K12H4.7, isoform a [Caenorhabditis elegans]
Length = 510
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 117/472 (24%), Positives = 191/472 (40%), Gaps = 62/472 (13%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTGFMY 116
TQ LDHF+ + +TFQQRY N+ + K P F+ G EG W + +
Sbjct: 64 TQTLDHFDSSVG--KTFQQRYYHNNQWY---KAGGPAFLMLGGEGPESSYWVSYPGLEIT 118
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
++A K A + IEHR+YG++ P +++ N YLSS QA+ D A+ I K
Sbjct: 119 NLAAKQGAWVFDIEHRFYGETHP---TSDMSVPNLK---YLSSAQAIEDAAAFI----KA 168
Query: 177 LTA-----TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+TA ++ V FGGSY G LAAW R K+P + A+ SS P+ V +
Sbjct: 169 MTAKFPQLANAKWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPV---QAEVDFKEYLE 225
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWL 287
++ S C + + + + G ++L+ AF +C+ +K+L W
Sbjct: 226 VVQNSITRNSTECAASVTQGFNLVASLLQTSDGRKQLKTAFHLCQDIQMDDKSLKY-FWE 284
Query: 288 STAFVYTAMTDY--------PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL---YG 336
+ Y + Y T + + + + + K ND F ++ +G
Sbjct: 285 TVYSPYMEVVQYSGDAAGSFATQLTISHAICRYHINTKSTPLQKLKQVNDYFNQVSGYFG 344
Query: 337 AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEM--IMLTGGDNKDSIFEESEEDYD 394
+ YN G + W WQ CTE T F +
Sbjct: 345 CNDIDYN--GFISFMKDETFGEAQSDRAWVWQTCTEFGYYQSTSSATAGPWFGGVS---N 399
Query: 395 ARARY----CKEAYGVDPRPNWI------TTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
A+Y C YG + T ++ G + L R I+ NG DPW
Sbjct: 400 LPAQYYIDECTAIYGAAYNSQEVQTSVDYTNQYYGGRDNLNTDR----ILLPNGDIDPWH 455
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
G L + + ++ +V AH D+ ++ D +L + R+R +++ W+
Sbjct: 456 ALGKLTSSNSNIVPVVINGTAHCADMYGASSLDSMYLTNARQRISDVLDGWL 507
>gi|296817873|ref|XP_002849273.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238839726|gb|EEQ29388.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 544
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 140/564 (24%), Positives = 225/564 (39%), Gaps = 93/564 (16%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSS--RITPEKLSSLISSSKDSQGLYKTKYHTQIL 62
+ L+ CLL L + P P S R ++ + L++ S+D + +
Sbjct: 1 MVKLTACLLL---LVAAVQAKLPVTPISQLRAQAQRNNELVARSEDVNAQFPAHQIKIPI 57
Query: 63 DHF----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW---FAQNTGFM 115
DHF Y P + + F RY + +H+ K P+ + G E D E F Q G +
Sbjct: 58 DHFPKSQRYEPHTMEKFNLRYWFDASHY---KEGGPVIILHGGETDGEGRIPFLQK-GIL 113
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK- 174
+A + V +EHRYYG S+P ++ + K+ +L++ QALAD A ++K
Sbjct: 114 AQLAQATNGIGVVMEHRYYGGSLP---TRDFSNKSLR---FLTTEQALADTAYFSQNIKF 167
Query: 175 -----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSP 226
NLTA + +V+GGSY G A+ R +YP V GA++SS I ++ P
Sbjct: 168 PGLEKYNLTAPGTAHIVYGGSYAGGQVAFLRTQYPDVFWGAISSSGVTKAIYDYWQYFEP 227
Query: 227 YSFSNI-----ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEKN 280
+TQ+F + +N +I G + K GL +L+ A F S
Sbjct: 228 IRQEAPQDCVHVTQNFVDIVDNI--IIHGKNANTTQELKNLFGLGRLRDADFANALSS-- 283
Query: 281 LAIESWLSTAF--------VYTAMTDYPTPSNFLNPLPA---FPVKEMCKAIDDPKTGND 329
I W ST + Y +D T +L P+ A K + +A + +
Sbjct: 284 -GITGWQSTNWDPAISGKSFYKYCSDI-TSDRYLYPVTAEQKASAKRIIEAGGHGREYPE 341
Query: 330 VFAKLYGAASVYYNYSGTAKCFD--------LNG--------DSDPHGLSEWGWQACTEM 373
+ +L + N + A C + LN DS W WQ C E
Sbjct: 342 ILPQLLNFVG-WLNKNTLATCSEQGQTAEECLNSYDEDFYKQDSADQSWRAWPWQYCNEW 400
Query: 374 IML-TGGDNKDSIFEESEE--DYDARARYCKEAYGVDPRPNW-ITTEFGGHKIGLVLKRF 429
L TG +I D + CK+A+G+ N + ++G + + R
Sbjct: 401 GYLQTGSGTPKNIRPVISRLIDLPYTSNICKQAFGITKPSNVDLVNKYGA--LDIEYDRL 458
Query: 430 ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDL------RFSTKED 477
A F +G DPW GV ++ K + HH D + +
Sbjct: 459 A----FIDGASDPWKEAGVHATAARKRRTSTNKPYILIPQAVHHWDENGLYPNETTAELP 514
Query: 478 PQWLKDVRRREVEIIGKWISQYFQ 501
PQ +KDV+ E + +W+ ++ Q
Sbjct: 515 PQRIKDVQAEEARFVKEWMKEWKQ 538
>gi|262176822|gb|ACY27467.1| serine protease Pro1 [Plasmodiophora brassicae]
Length = 467
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 117/460 (25%), Positives = 190/460 (41%), Gaps = 63/460 (13%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q++DH + F QRY D W G + P+ + E F +A
Sbjct: 29 QLVDHTAIGGGGAR-FLQRYFRIDQFWSGP--DGPVILQLCGEYTCAGVTDGRQFPSALA 85
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL-- 177
++ AL++ +EHRY+GKS P+ ++ +N + YL++ QAL+D A D + +
Sbjct: 86 ERYGALVLVLEHRYFGKSSPF---SVLSPRNLT---YLTTFQALSDIACFT-DWYQRVHI 138
Query: 178 -TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS-FSNIITQ 235
A + + GGSY G LAAW+RLKYPH+ GALASSA +V+P++ F Q
Sbjct: 139 GRANANKWITIGGSYPGALAAWYRLKYPHLTAGALASSA-------VVAPFAEFPEFDEQ 191
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA-FVYT 294
S C ++ +E ++ G L KA C S LS A F+Y
Sbjct: 192 VASSAGPACTHALQDITAMVEGALQEGGRLADEMKALFSC---------SQLSDADFLYL 242
Query: 295 AMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL--YGAASVYYNYSGTAKCFD 352
+ + L +C I ++ +D A + + Y + S + +D
Sbjct: 243 IADAMAEAIQYGHRL------ALCDPIVQAESRDDRLAAFVEFVQGTFYASMSNSPGDYD 296
Query: 353 LNGDS------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG- 405
+ + D G +W W C E+ +SI + R R C++ +G
Sbjct: 297 SDTMASHRWVPDSSG-RQWWWMKCNEVGWFQIAPGTNSIRSKRVNMEWHRDR-CEKLFGD 354
Query: 406 ---VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI------SKTV 456
P + + E+ G + + SN++F NG+ DPW G +V
Sbjct: 355 VLAFPPPCHRASIEYSGFDMSV------SNVVFTNGVEDPWQWAGASAFSSSAHLRDSSV 408
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
L + + H VDL + +D L R + I +W+
Sbjct: 409 LLINCSQCGHCVDLHTPSPDDAPALTTARSTIIAHIDRWL 448
>gi|414870774|tpg|DAA49331.1| TPA: putative serine peptidase S28 family protein [Zea mays]
Length = 357
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 101/199 (50%), Gaps = 30/199 (15%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q LDHFN P ++ F+QRY ++ N PIF+Y E N ++ VA
Sbjct: 63 QTLDHFN--PTDHRQFKQRYYEFLDYYRAP--NGPIFLYICGESTCNGIGNN--YLAVVA 116
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT---GYLSSTQALADYASL------I 170
KF A LV EHRYYGKS P+ N+ TT +LSS QAL D A
Sbjct: 117 KKFGAALVSPEHRYYGKSSPF---------NSLTTENLQFLSSKQALFDLAVFRQYYQET 167
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
++ K N + +S VFGGSY G L+AWFRLK+PH+ G+LASS +L + Y+F+
Sbjct: 168 LNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------AVYNFT 221
Query: 231 NIITQDFRSVSENCYKVIK 249
+ Q S C ++
Sbjct: 222 DFDKQIGISAGPECKAALQ 240
>gi|297838667|ref|XP_002887215.1| hypothetical protein ARALYDRAFT_894677 [Arabidopsis lyrata subsp.
lyrata]
gi|297333056|gb|EFH63474.1| hypothetical protein ARALYDRAFT_894677 [Arabidopsis lyrata subsp.
lyrata]
Length = 274
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 128/323 (39%), Gaps = 115/323 (35%)
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE 242
P V GG+ AWFRLKYP++AIGALA SA L FD I+ + FR +S
Sbjct: 62 PTKVHGGN-----PAWFRLKYPYIAIGALAPSAHTLQFDKIIR----LEASMRPFRRISR 112
Query: 243 NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTP 302
+ + V S WL +AF+ TAM + PTP
Sbjct: 113 DLHSVTSAS---------------------------------RWLESAFIDTAMVNCPTP 139
Query: 303 SNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL 362
SN + PLPA+PV+E+ I+D + + + AA NYS + CFD+ +DPHG
Sbjct: 140 SNSMAPLPAYPVEEI---IEDGEASK--LDRAFAAA----NYSDSENCFDIENQTDPHGF 190
Query: 363 SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKI 422
+ W +M+M N++ +F E D ++ G+ PR G +
Sbjct: 191 IQ--WLGLADMVMPINYSNQN-MFPPFENDDKGD----QDPTGLPPR-------IHGAEK 236
Query: 423 GLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLK 482
GL+ +R N
Sbjct: 237 GLLRRRPCVN-------------------------------------------------- 246
Query: 483 DVRRREVEIIGKWISQYFQDLAQ 505
+ RR+EV I KWIS+Y+ DL Q
Sbjct: 247 EQRRQEVAEIEKWISEYYTDLRQ 269
>gi|133930823|ref|NP_501598.2| Protein PCP-3 [Caenorhabditis elegans]
gi|119662054|emb|CAB05185.2| Protein PCP-3 [Caenorhabditis elegans]
Length = 1080
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/464 (24%), Positives = 182/464 (39%), Gaps = 48/464 (10%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYD 117
Q DHF+ Q+ FQQ++ N W +K P F+ G EG W
Sbjct: 583 QRQDHFD--NQNADFFQQKFFKN-AQW--AKQGGPNFLMIGGEGPESARWVLNENITYLT 637
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI--IDLKK 175
A K+ A + +EHR+YG S+ G N L+S Q L D A I ++++
Sbjct: 638 WAKKYGATVYLLEHRFYGDSV-VGDNTNFKL--------LNSLQMLYDLAEFIKAVNIR- 687
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ 235
T T +P + FGGSY G ++AW R +P + +GA+ASS P+ + Y + ++
Sbjct: 688 --TGTSNPWITFGGSYSGAMSAWMREVFPDLVVGAVASSGPVYAKTDF---YEYLMVVEN 742
Query: 236 DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI----CKSEKNLAIESWLSTAF 291
R + C I+ + I G + L F++ S + + S +
Sbjct: 743 SVRRYNSKCADNIQSGFDAIRTLFLTKEGRQNLSSIFQLQPPFSDSVTDTDQHYFFSNVY 802
Query: 292 -VYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFA------------KLYG 336
+ Y + N + + +MCK + D N++ A + G
Sbjct: 803 GNFQGAVQY-SGDNTGPYANGYGIPDMCKIMSNDSNTPLNNIVAFNQFMIIFYNGGQYTG 861
Query: 337 AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
+ Y N K G GL W WQ C+E D+ + IF S +
Sbjct: 862 MDNNYQNMITYLKTAQHYGPDSAAGL-LWTWQTCSEFGYFQSADSGNGIF-GSPTPVNMY 919
Query: 397 ARYCKEAYGVDPRPNWITTEFG--GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
+ C + + + I +K G +N++ NG DPW G+
Sbjct: 920 VQMCMDVFNNQYQRTSIDYSIANTNYKYGERFHYRGTNVVLPNGNVDPWHALGLYYPTDS 979
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+V++ + AH D+ + D LK VR + I KW++Q
Sbjct: 980 SVVSYLIDGTAHCADMYPARDADVPGLKVVRDLVDQNIAKWLNQ 1023
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 94/411 (22%), Positives = 158/411 (38%), Gaps = 42/411 (10%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGF 114
Y Q LDHF N TF QRY + + + F+Y +G + +
Sbjct: 47 YMLQSLDHFIGNASG--TFSQRYFYTQQY---TLHQRTAFLYVSADGVEEAAVISDERNP 101
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI--ID 172
+ A +F A + +EHRYYG+S P +A +L+S QA+ D S I ++
Sbjct: 102 IVKTAKQFGATIFSLEHRYYGQSRP-----NFDKFDAQNLRHLNSLQAILDIISFIKSVN 156
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
++ N+ D V++G YGG+LAA R P G +ASS+P+ + + + F++
Sbjct: 157 VQFNMDP-DVRWVLWGAGYGGILAAEARKWDPVTISGVIASSSPLTHLYDF---WQFNDQ 212
Query: 233 ITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
+ F V CY ++ + I + + P G + F++ + F
Sbjct: 213 VATTFSQVGGGLCYNKVRQGFADIRQAMRTPEGRRNVSSLFQLNPRLDQTPLNYNDVQIF 272
Query: 292 VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLY---------GAASVY 341
+ + F N + +MC ID T +V + Y +
Sbjct: 273 YLLIIAPFQQIVQFNNDF-NISISDMCTTIDKSTWTNMEVVRQAYVYLSTTITGSVQPMV 331
Query: 342 YNYSGTAKCFDLNGDSDPH-GLSEWGWQACTEM-IMLTGGDNKDSIFEESEEDYDARARY 399
+Y S P+ W +Q CTE T +N++ +F + +
Sbjct: 332 TSYQTIVNDLGNQSASSPYLDQRMWQYQTCTEFGWFYTTNNNENGLFGAVVPG----SLF 387
Query: 400 CKEAYGVDPRPNWITTEFGGHKI------GLVLKRFASNIIFFNGLRDPWS 444
+ + + P N T I G +N +F NGL DPW+
Sbjct: 388 LNQCFDIFPDANLTATSIRDLVIEYNNYYGSAFDYSGTNAVFTNGLLDPWT 438
>gi|253743733|gb|EET00051.1| Thymus-specific serine protease precursor [Giardia intestinalis
ATCC 50581]
Length = 521
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 96/340 (28%), Positives = 148/340 (43%), Gaps = 43/340 (12%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
+H Q +DHFN P + + + QRY NDT++ K P+F+ G EG ++ G +
Sbjct: 59 FHDQRVDHFN--PVNTKKWSQRYYYNDTYY---KAGGPVFLMIGGEGPAT--PRDVGDYF 111
Query: 117 DV---APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+ A L V +EHR+YG S P ++S L S QALAD A+ + L
Sbjct: 112 SIDYFAKSMSGLKVALEHRFYGASFPS--------TDSSDLSLLRSDQALADIATFLAYL 163
Query: 174 KKNLTATDSP-VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
KK S VV GGSY G LAAW R+++P V A++SS P L + P +I
Sbjct: 164 KKEYNLPKSTKVVAVGGSYSGNLAAWARIQFPFVIDAAISSSGPYLAQTDY--PEYLQHI 221
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE--KNLAIE--SWLS 288
Q +S + C +I + E L F++ + + A + S++S
Sbjct: 222 DAQIRKSGGDRCMDIISAAHTDAEYLLNHDKA--TLATIFKLKEESIYNSTAYDKASFMS 279
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID---DPKTGNDVFAKLYGAASVYYN-Y 344
T + + Y + +K+MCK+I+ D + + +L AS + Y
Sbjct: 280 TMGAPSGVVQYAKHDGYYTDTKDGDIKQMCKSIEAYYDAYDSGESYQQLKAYASWLLDYY 339
Query: 345 SGTAKCFDLNGDSDPHGLSE------------WGWQACTE 372
G+ + DL+ D L + W WQ C E
Sbjct: 340 GGSMEEIDLSFDGYIKALQDTSIDSEFAVDRSWMWQTCVE 379
>gi|302796113|ref|XP_002979819.1| hypothetical protein SELMODRAFT_111339 [Selaginella moellendorffii]
gi|300152579|gb|EFJ19221.1| hypothetical protein SELMODRAFT_111339 [Selaginella moellendorffii]
Length = 472
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 126/468 (26%), Positives = 194/468 (41%), Gaps = 61/468 (13%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
++TQ LDH Y Q +TF QRY ++ N P+F+ EG + +
Sbjct: 36 WYTQTLDH--YATQDDRTFSQRYYEFTDYFDAP--NGPVFLKICPEGPCVGIQNDYSAV- 90
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------I 170
+A +F A +V +EHRYYG+S P+ K A +N YLSS QAL D A+ +
Sbjct: 91 -LAKRFGAAIVSLEHRYYGQSSPF---KTHATENLI---YLSSKQALFDLAAFREYYQDL 143
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
I+ + N T +D+P +V G SY G L+AWF+LK+PH+A+G++ASS IV
Sbjct: 144 INHRTNST-SDNPWIVIGWSYAGALSAWFKLKFPHLAVGSVASSG-------IVQAIFDV 195
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
I Q S C ++ + E+ GL+K + + + + L ++
Sbjct: 196 RIHLQVAESAGATCSAALRAVTRLAEQ------GLKKNSMSTKALFNAEQLDVDGDFLYL 249
Query: 291 FVYTAMT--DYPTPSNFLNPL-PAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGT 347
A T Y P +PL A+ E A+ + L + + Y
Sbjct: 250 LADAATTAFQYGNPEILCSPLVAAYKRNEDLLAVYAKYVKDYYIDTLKSSINTY-----D 304
Query: 348 AKCFDLNGDSDPHGLSEWGW-QACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY-- 404
K N + H GW Q CTE+ SI + + C +
Sbjct: 305 QKHLKENLAAGDHSSDRLGWYQMCTELGYFQVAPANSSI-RSALINVKYHLDLCSNVFEN 363
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK-- 462
G P + +GG+KI I+F NG +DPW + S+ A V K
Sbjct: 364 GTFPEVDNTNLYYGGNKIR------GDKILFMNGSQDPWRHASK-QTSSRNEPAYVIKCQ 416
Query: 463 EGAHHVDLRFSTKEDPQW--------LKDVRRREVEIIGKWISQYFQD 502
AH VD+ + PQ +V + +I +IS++ +D
Sbjct: 417 NCAHGVDMLGCPQSPPQIGGNTSKCADPEVAQAGQRLIATYISRWLED 464
>gi|340387216|ref|XP_003392103.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like, partial
[Amphimedon queenslandica]
Length = 159
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/149 (40%), Positives = 89/149 (59%), Gaps = 11/149 (7%)
Query: 41 SLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
S+ S+ D Y T Y+TQ LD+FN N + +TF QR L +W KN+ +F Y G
Sbjct: 22 SVNSTKDDPIPPYTTSYYTQQLDNFNSNDK--RTFNQRILTAKQYW---KNDVLLF-YPG 75
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
NE I+ F NTGF++++A +F+AL+VF EHRYYG ++P+G N + YLS
Sbjct: 76 NEAPIDEFYNNTGFLFELAERFQALVVFAEHRYYGDTLPFGPQDTFTPANMA---YLSVG 132
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGG 189
QA+AD++ L+ D++ + V+V GG
Sbjct: 133 QAMADFSRLVQDIRDKMNI--KKVIVIGG 159
>gi|170045806|ref|XP_001850485.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868713|gb|EDS32096.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 485
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 104/425 (24%), Positives = 177/425 (41%), Gaps = 63/425 (14%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S S+ +++T+ +DHFN PQ+ TF+ +Y ND ++ + PIF++ G
Sbjct: 51 SQSRSIGTMFRTR-----VDHFN--PQNRDTFEFQYYSNDEYY---QPGGPIFIFVGGNW 100
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
+E + G +D+A A L EHRYYG S+P ++++ N +L+ QAL
Sbjct: 101 PVEQYYIEHGHFHDIAYYENAWLFANEHRYYGSSLPV---EDLSTPNLR---FLTVEQAL 154
Query: 164 ADYASLIIDLKKNLTATDSP-VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI-LNFD 221
D LI ++ N+ D+ V++ G Y G +A W R +YPH+ G+ SS + FD
Sbjct: 155 VDLGELIYHIRHNVVRDDNARVILLGVGYAGAIATWMRQRYPHLVDGSWVSSGQVDARFD 214
Query: 222 NIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SE 278
+I ++ CY I +++ E G E + + F C E
Sbjct: 215 FGQHAVEVGGLIRD---HGNDECYSQIWRAFRTAEALLDA-GRTETVSELFNTCSPIDEE 270
Query: 279 KNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA 338
L +E++ + + A+ + L+ +C+ +++ D L A
Sbjct: 271 NMLDVETFFFS--IKEAIQ-----TEVLSEQNVVYTDRLCQTLNNSTESTD----LQTLA 319
Query: 339 SVYYNYSGTAKC--FDLNGDSDPH-------------GLSEWGWQACTEMIMLTGGDNKD 383
+ +++ C FD ++ H GL + +Q CTE D+ D
Sbjct: 320 NWVHDHFYFLDCMPFDFESSTEIHSIVDVDAIENRILGLRQRYYQFCTEFGWFLTADSAD 379
Query: 384 SIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA------SNIIFFN 437
F Y + +CK YG W+T E + L F +N++F N
Sbjct: 380 QPFGYRITMY-LFSNFCKAVYG-----EWLTAEVVADGVHLTNMHFGGKDPRIANVLFTN 433
Query: 438 GLRDP 442
G DP
Sbjct: 434 GGLDP 438
>gi|194744673|ref|XP_001954817.1| GF16552 [Drosophila ananassae]
gi|190627854|gb|EDV43378.1| GF16552 [Drosophila ananassae]
Length = 597
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 110/430 (25%), Positives = 183/430 (42%), Gaps = 46/430 (10%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DHF+ N T++ RY N + PI+++ G E I +TG +D+A
Sbjct: 55 QRVDHFDENNNG--TWKMRYFRNAKY---HNPQGPIYIFVGGEWTISPGLMSTGLTHDMA 109
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-KNLT 178
+ +L + EHRYYG S+P+ GNK + L+ Q+LAD A I K N
Sbjct: 110 VENSGMLFYTEHRYYGLSLPH-GNKSLKVHQLKQ---LNLQQSLADLAFFIRHQKSNNPE 165
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
DS V++ GGSY G + W +YP + + ASSAP+L + Y + ++++ +
Sbjct: 166 LKDSKVILVGGSYSGSMVTWMTQRYPDLIAASWASSAPLLAKADF---YEYMDMVSSSVQ 222
Query: 239 -SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT 297
S +NC + I ++ + + + + L F C K+ ++ L A + +
Sbjct: 223 LSYGQNCSQRISRGFEYLVKLFHE-NNIRTLLSKFNGC---KDYDPKNPLDRAAFFNGLG 278
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDV-FAK----LY--GAASVYYNYSGTAKC 350
+Y + + A+ + +C+++ ++V F K LY G S G +
Sbjct: 279 NY--FALIVQSYSAY-IPRLCESLMSLDASDEVAFIKFLELLYSEGRRSTACQDFGYSSM 335
Query: 351 FDLNGDSDPH--GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR--ARYCKEAYGV 406
L + D H G W +Q C E T +K S+ + + C++A+G
Sbjct: 336 LQLFSEEDDHGSGTRAWFYQTCNEFGWYTTTQSKSSLSAAFAKQVPLSYFEKLCQDAFGP 395
Query: 407 DPRPNWITT-------EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLAL 459
+ P + EFGG G + +IF +G DPW G VL
Sbjct: 396 EQTPQKLARGIRQTNLEFGG--FGFNHSERYAQVIFTHGQLDPWRALGQQTGSQAIVLT- 452
Query: 460 VEKEGAHHVD 469
G HV+
Sbjct: 453 ----GYSHVE 458
>gi|198477695|ref|XP_002136487.1| GA23306 [Drosophila pseudoobscura pseudoobscura]
gi|198145255|gb|EDY71959.1| GA23306 [Drosophila pseudoobscura pseudoobscura]
Length = 292
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 71/223 (31%), Positives = 112/223 (50%), Gaps = 16/223 (7%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T++ T LD N+N + T++ R LIN+ H+ + +PIF+Y G E +IE A +G
Sbjct: 55 ETRWFTLKLD--NFNAANNATWKDRVLINEDHF---TDGSPIFIYLGGEWEIEPSAITSG 109
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + L++ EHR++GKS P I + Y S QALAD +I L
Sbjct: 110 LWVDIAKEHNGSLIYTEHRFFGKSFP------ITPLSTKNLKYQSVQQALADVVHIIKTL 163
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K DS V+V G SY +A W R YP + +G+ ASSAP+ + V + ++
Sbjct: 164 KLEDKYKDSKVIVSGCSYSATMATWIRKLYPDIILGSWASSAPL---EAKVDFKDYMEVV 220
Query: 234 TQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
Q F + + CY +I + ++ + G K +K +C
Sbjct: 221 GQAFEQLGGKYCYDLIDNATSYYQDLFEGGQG-AKAKKELNLC 262
>gi|158291876|ref|XP_313404.4| AGAP003639-PA [Anopheles gambiae str. PEST]
gi|157017498|gb|EAA08831.4| AGAP003639-PA [Anopheles gambiae str. PEST]
Length = 507
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 113/464 (24%), Positives = 199/464 (42%), Gaps = 57/464 (12%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTGFMYD 117
T ++HF+ PQ+ TF+ YL ND ++ + P+F+ G + +F +N+ F D
Sbjct: 66 TSRVNHFD--PQNRDTFEFNYLHNDQYY---RQGGPLFIVVGGHYPVNPYFMENSHFR-D 119
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
VA A L EHRY+G+S P ++++ +N ++ + Q L D I LK+ +
Sbjct: 120 VAALEGAWLATNEHRYFGESYP---TEDLSTENLR---FMRTEQVLFDLIEWIDFLKREV 173
Query: 178 TAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNII 233
++ V++ G YGG LA W R ++P++ GA SSAP+ NF+ NII
Sbjct: 174 MGDPNARVILHGVGYGGSLATWARQRFPNIIDGAWGSSAPVRATTNFEEFA--VEVGNII 231
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVY 293
+ S+ CY I ++ E G E + + F C + ++ L +
Sbjct: 232 RE---RGSDQCYNRIFQAFHTAENLIDA-GRTEMISEMFNTCDP---VDTDNPLEVELFF 284
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDL 353
AM + + + + +C A+ D + G + A +A + Y+ T +CFDL
Sbjct: 285 FAMM-FSLEAAMVEDYDIENIGRVCDALTDDEFGTGLEAL---SAFLLDRYADTRECFDL 340
Query: 354 N--------GDSDPH-------GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARAR 398
+ D D + GL + G+Q CTE + D F ++ YD
Sbjct: 341 SFENFIRYLTDVDINAPANVEFGLRQAGYQDCTEFGYFEMTTSPDQPF-GTKVTYDLFLA 399
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA------SNIIFFNGLRDPWSGGGVLKNI 452
C+ AYG +W++ + + L F +N+++ NG DP + +
Sbjct: 400 ECQAAYG-----DWLSKDVVYEGVRLTNFHFGATDPRITNVLYTNGELDPLRAVSITEYT 454
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ A V D++ + D + L + + + I W+
Sbjct: 455 NLLANARVTPGAFIGQDIQSISGMDSEELLETKHMAEQYITTWL 498
>gi|341889106|gb|EGT45041.1| hypothetical protein CAEBREN_04354 [Caenorhabditis brenneri]
Length = 668
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 112/428 (26%), Positives = 174/428 (40%), Gaps = 44/428 (10%)
Query: 100 GNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
G GD +W M A +F A +EHR+YG S Y I + S+ L+
Sbjct: 10 GTNGD-KWVRHEAETMMTWAAEFGAAAFQVEHRFYG-SKDY---SPIGDQTPSSMKLLTI 64
Query: 160 TQALADYASLIIDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
QALAD I + D P+ V FGGSY G L+AWFR YP + GA++SS+ +
Sbjct: 65 DQALADIKEFITQMNALYFKDDKPIWVTFGGSYPGSLSAWFRETYPEMTAGAVSSSSAVH 124
Query: 219 NFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWK-QIEETAKKPGGLEKLQKAFRICK- 276
F V Y ++ + +R+VS++C I +++ I++ L++ F +C
Sbjct: 125 VF---VDYYGYAINTEKTYRTVSDSCANTIGVAFQSMIQKAYAGKDSRILLKQQFNLCDD 181
Query: 277 -SEKNL--AIESWLSTAFVYTAMTDYPTPSNFLNPL-PAFPVKEMCKAIDDPKTGNDV-- 330
E NL +++ + + Y + T N N V C +++ K G+++
Sbjct: 182 FDENNLSKSLQFFFQNVYGYFQGINQYTGDNRNNATRSGLGVPAACDILNNNKNGDEINR 241
Query: 331 FAKLYGAASVYYNYSGTAKCFDLN-------------GDSDPHGLSEWGWQACTEMIMLT 377
++ +Y S T C N D D W WQ CTE+
Sbjct: 242 VVQVMNLYDSWYPPSATG-CRPNNYTSFIQYYSDTTMPDDDRISTRSWIWQTCTELGYYQ 300
Query: 378 GGDNKDSIFEESEEDYDARARYCKEAYGVD-------PRPNWITTEFGGHKIGLVLKRFA 430
D + S D A C + +G + + + T++GG
Sbjct: 301 TTDGGNRGIFGSTVPLDFFADQCIDLFGPEYTLDNTFKLIDQVRTKYGGAD-----AYRG 355
Query: 431 SNIIFFNGLRDPWSG-GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREV 489
+N+ F NG DPW G N V + + AH D+ + D Q LKD R R
Sbjct: 356 TNVCFPNGSFDPWQDLGHKANNTDNNVDSWLIDGTAHCADMYPARDSDKQSLKDARVRIH 415
Query: 490 EIIGKWIS 497
+ KW+S
Sbjct: 416 GHLSKWLS 423
>gi|302807545|ref|XP_002985467.1| hypothetical protein SELMODRAFT_122204 [Selaginella moellendorffii]
gi|300146930|gb|EFJ13597.1| hypothetical protein SELMODRAFT_122204 [Selaginella moellendorffii]
Length = 472
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 129/477 (27%), Positives = 195/477 (40%), Gaps = 61/477 (12%)
Query: 49 SQGLYKTKYH--TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE 106
S L T H TQ LDH Y Q +TF QRY ++ N P+F+ EG
Sbjct: 26 SDALLTTDAHWYTQTLDH--YATQDDRTFSQRYYEFTDYFDAP--NGPVFLKICPEGPCV 81
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
+ + +A +F A +V +EHRYYG+S P+ K A +N YLSS QAL D
Sbjct: 82 GIQNDYSAV--LAKRFGAAIVSLEHRYYGQSSPF---KIHATENLI---YLSSKQALFDL 133
Query: 167 ASL------IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
A+ +I+ + N T D+P +V G SY G L+AWF+LK+PH+A+G++ASS
Sbjct: 134 AAFREYYQDLINHRTNST-RDNPWIVIGWSYAGALSAWFKLKFPHLAVGSVASSG----- 187
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
IV I Q S C ++ + E+ GL+K + + + +
Sbjct: 188 --IVQAIFDVRIHLQVAESAGATCSAALRAVTRLAEQ------GLKKNSVSTKALFNAEQ 239
Query: 281 LAIESWLSTAFVYTAMT--DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA 338
L ++ A T Y P +PL A ++ K D + + ++
Sbjct: 240 LDVDGDFLYLLAEAATTAFQYGNPEILCSPLVAAYKRDEDLLAVYAKYVKDYYIDTFKSS 299
Query: 339 SVYYNYSGTAKCFDLNGDSDPHGLSEWGW-QACTEMIMLTGGDNKDSIFEESEEDYDARA 397
Y+ K N + H GW Q CTE+ +SI + +
Sbjct: 300 INTYD----QKHLKENLAAGDHSSDRLGWYQMCTELGYFQVAPANNSI-RSALINVKYHL 354
Query: 398 RYCKEAYGVDPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT 455
C +G P T +GG+KI I+F NG +DPW + S+
Sbjct: 355 DLCSNVFGNGTFPEVDDTNLCYGGNKIR------GDKILFMNGSQDPWRHASK-QTSSRN 407
Query: 456 VLALVEK--EGAHHVDLRFSTKEDPQWLKDVRRRE--------VEIIGKWISQYFQD 502
A V K H VD+ + PQ + + II +IS++ +D
Sbjct: 408 EPAYVIKCQNCGHGVDMLGCPQSPPQIRGNTSKCADPEAAQAGQRIIATYISRWLED 464
>gi|302807385|ref|XP_002985387.1| hypothetical protein SELMODRAFT_122090 [Selaginella moellendorffii]
gi|300146850|gb|EFJ13517.1| hypothetical protein SELMODRAFT_122090 [Selaginella moellendorffii]
Length = 481
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 124/472 (26%), Positives = 204/472 (43%), Gaps = 63/472 (13%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
++TQ LDH Y Q +TF QRY ++ N P+F+ EG + +
Sbjct: 39 WYTQTLDH--YATQDDRTFAQRYYEFTDYFDAP--NGPVFLKICGEGTCVGIQNDYSAV- 93
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------I 170
+A +F A +V +EHRYYG+S P+ K A +N YLSS QAL D A+ +
Sbjct: 94 -LAKRFGAAIVSLEHRYYGQSSPF---KSHATENLI---YLSSKQALFDLAAFREYYQDL 146
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
I+ + N T +D+P +V GGSY G L+AWF+LK+PH+A+G++ASS + + + F+
Sbjct: 147 INHRTNST-SDNPWIVMGGSYSGALSAWFKLKFPHLAVGSVASSGVVQ------AIFKFT 199
Query: 231 NIITQ-DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST 289
Q V+E+ + + + A++ GL++ + + + + L ++
Sbjct: 200 KFDEQARLFLVAESAGATCSAALRAVTRLAEQ--GLKENSVSTKALFNAEQLDVDGDFLY 257
Query: 290 AFVYTAMT--DYPTPSNFLNPL-PAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
A Y P +PL A+ E A+ K D + + ++ Y+
Sbjct: 258 LLADAAAIAFQYGNPDILCSPLVAAYKRNEDLLAV-YAKYVKDYYIDTFKSSINTYDQKH 316
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY-- 404
+ GD L W +Q CTE+ +SI + + C +
Sbjct: 317 LKENL-AAGDHSSDRL--WWYQVCTEVAYFQAAPANNSI-RSALVNVKYHLDLCSNVFEN 372
Query: 405 GVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK-- 462
G P + +GG+KI I+F NG +DPW + S+ A V K
Sbjct: 373 GTFPEVDNTNLYYGGNKIR------GDKILFMNGSQDPWRHASK-QTSSRNEPAYVIKCQ 425
Query: 463 EGAHHVDLR------------FSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
AH VD+R S DP+ + ++I +IS++ +D
Sbjct: 426 NCAHCVDMRGCPQTPLQIGGNTSKCADPE----AAQAGQQLIATYISRWLED 473
>gi|95768136|gb|ABF57331.1| protease, serine, 16 [Bos taurus]
Length = 230
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 63/157 (40%), Positives = 91/157 (57%), Gaps = 13/157 (8%)
Query: 48 DSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW 107
DS L K + Q LD FN + + ++F QRY +ND HW + + P+F++ G EG +
Sbjct: 50 DSVTLPKEGWLEQPLDPFNASDR--RSFLQRYWVNDQHW--TSQDGPVFLHLGGEGSLGP 105
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA 167
+ G ++AP + AL++ +EHR+YG SIP G + +LSS ALAD A
Sbjct: 106 GSVMRGHPANLAPIWGALVISLEHRFYGLSIPAEGLDMAQLR------FLSSRHALADAA 159
Query: 168 SLIIDLKK--NLTATDSPVVVFGGSYGGMLAAWFRLK 202
S + L + N+++T SP + FGGSY G LAAW RLK
Sbjct: 160 SARLTLSRLFNVSST-SPWICFGGSYAGSLAAWARLK 195
>gi|15233057|ref|NP_189509.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332643953|gb|AEE77474.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 199
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 100/195 (51%), Gaps = 16/195 (8%)
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYK 246
F G+ +LAAWF+LKYP++A+GALASSAP+L F++ + + + I+T+ F+ +S+ C+
Sbjct: 16 FHGAVHKVLAAWFKLKYPYIALGALASSAPLLYFEDTLPKHGYFYIVTKVFKEMSKECHN 75
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
I SW +I+ A KP L L K F++C ++ IE +++Y Y
Sbjct: 76 KIHKSWDEIDRIAAKPNSLSILSKNFKLCNPLNDI-IELKSYVSYIYARTAQYSDNQ--- 131
Query: 307 NPLPAFPVKEMCKAIDD--PKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE 364
F V +C+AI+ P T +D+ +++ G C+ ++ S +
Sbjct: 132 -----FSVARLCEAINTSPPNTKSDLLDQIFAGVVA---SRGNISCYGMSSPSYQMTNDD 183
Query: 365 --WGWQACTEMIMLT 377
WGWQ + L+
Sbjct: 184 RAWGWQTIHISVTLS 198
>gi|194744683|ref|XP_001954822.1| GF16550 [Drosophila ananassae]
gi|190627859|gb|EDV43383.1| GF16550 [Drosophila ananassae]
Length = 489
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 99/464 (21%), Positives = 193/464 (41%), Gaps = 50/464 (10%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ ++ TQ LD+F+ + + T++ R LIN+ ++ + +PIF+Y G E I+ +G
Sbjct: 56 EERWITQWLDNFDGDNNA--TWEDRILINEDYF---VDGSPIFIYLGGEWKIQPGDITSG 110
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + +V EHR++G+S+P + + Y + QALAD ++I +L
Sbjct: 111 LWVDIAKQHNGTIVTTEHRFFGESLPI-----TPFSTENLEKYQNVNQALADVINVIENL 165
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K+ DS +V+ G SY +A W R YP +G+ ASSAP++ V + +I
Sbjct: 166 KEEDKYKDSKIVIHGCSYSASMATWIRKLYPETILGSWASSAPLV---AKVDFKEYFKVI 222
Query: 234 TQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL---S 288
+ ++ + + CY +I + E+ G ++ +K +C + + + + W +
Sbjct: 223 GESYKVLGGQYCYDLIDNATSYYEDLFANGEG-DQAKKELNLCDNFDADNKRDRWQIFST 281
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI----DDPKTGNDVFAKL----YGAASV 340
A ++ + Y P+N + + + C + DD F + +
Sbjct: 282 IANIFAGIAQYQNPAN-------YDIAQYCSVLRSFSDDDSVALSKFINWRIHEHSGQCI 334
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES------EEDYD 394
+ GT ++ D+ + W +Q C+E ++ F S E+ +
Sbjct: 335 SATFKGTTDYYEWAKDNYQDSMLPWFFQTCSEFGWFQSSGSRQQPFGSSFPSKLYEDTCE 394
Query: 395 ARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
G+ EFGG L +N+ F G D W G
Sbjct: 395 TVFGSKYNTAGIRANAKATNAEFGG------LDNDFTNVYFVQGQMDGWRKVGAGVEQGA 448
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
T++ +H D + D L +++ + ++ +W+ +
Sbjct: 449 TIIPY----ASHCPDGGSISASDSPELVASKQKIIALVAQWLEE 488
>gi|302790231|ref|XP_002976883.1| hypothetical protein SELMODRAFT_106030 [Selaginella moellendorffii]
gi|300155361|gb|EFJ21993.1| hypothetical protein SELMODRAFT_106030 [Selaginella moellendorffii]
Length = 393
Score = 102 bits (253), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 119/428 (27%), Positives = 177/428 (41%), Gaps = 72/428 (16%)
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHR 132
+ F QRY ++ PIF+ E Q T +A A++V IEHR
Sbjct: 3 RVFSQRYFEFLDYF--QPQQGPIFLALCGESTCGGGYQRTA--QALAKSLGAVVVTIEHR 58
Query: 133 YYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATD------SPVVV 186
YYG+S P+ + +YKN YL++ QAL DYA L ID +NL +P +V
Sbjct: 59 YYGQSYPF---QNFSYKNLK---YLTTQQALYDYA-LFIDYYENLVNLQYNKQGKNPWIV 111
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYK 246
GGSY G L+AWFRLK+PH+ + + ASS + V YS + Q SV C K
Sbjct: 112 VGGSYAGALSAWFRLKFPHLVVASWASSGVV----EAVLEYSAYD--EQVGISVGPECKK 165
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
++ K EE K + + + L++ + ++ FV D P
Sbjct: 166 ALQEITKLAEEGLVTNATAVKSVFFAQKLRDDDFLSLVADIAAGFVQYGAIDMLCP---- 221
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLS--- 363
P+ E AI + +A++ G V + S + ++L ++ + +S
Sbjct: 222 ------PLLE---AIQNKTDLLMAYARIGG---VDSSSSDSYDAYELRRQAEANDISAKD 269
Query: 364 --EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP-----NWITTE 416
W +Q CTE+ DSI S + C +G + P NW
Sbjct: 270 TMSWNYQICTELAYFQVAPTNDSI-RSSRINLQYYIDICAVLFGPNTFPDVSAANW---N 325
Query: 417 FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA-------LVEKEGAHHVD 469
+GG I +S IIF NG +DPW ++ SKT + L + AH D
Sbjct: 326 YGGRDIA------SSRIIFLNGSQDPW------QHASKTTFSPGEPSFVLTCESCAHCED 373
Query: 470 LRFSTKED 477
L D
Sbjct: 374 LSMDCSSD 381
>gi|170045810|ref|XP_001850487.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868715|gb|EDS32098.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 726
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 160/381 (41%), Gaps = 60/381 (15%)
Query: 24 KIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLIND 83
++ TFP PE +S++ + G K+ T++ DHF+ PQ+ TF+ Y ND
Sbjct: 33 RMLNTFPK----PEVPEGYVSTNPHTIGY---KFRTRV-DHFD--PQNRATFEFEYYSND 82
Query: 84 THWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGN 143
++ + PIF++ G + +E + G +D+A + A + EHRYYG S P
Sbjct: 83 EYY---ERGGPIFIFVGGDWPLEQYYIERGHFHDIAQRTNAWMFTNEHRYYGHSSP---- 135
Query: 144 KEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLK 202
++ + YL+ QA+ D A I L+ N+ T++ V++ G Y G +A W R +
Sbjct: 136 --VSDYSTENLRYLTVEQAMVDLAEWIFHLRNNVVRDTNAKVILLGTGYAGAIATWARQR 193
Query: 203 YPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKK 261
YPH+ G ASS ++ I + R + ++ CY I +++ E
Sbjct: 194 YPHLVDGVWASS-------------EYAEEIGELLRDLGNDECYSRIWRAFR-TAENLMD 239
Query: 262 PGGLEKLQKAFRICK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMC 318
G E + + F C+ E L +E++ V A+ D + A +C
Sbjct: 240 AGRTEIVTEMFNTCEPVDEENMLDVETFFFN--VKAALQDAVLRGQNVEATEA-----LC 292
Query: 319 KAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWG------------ 366
+A+++ D+ Y+ FD +D L+E G
Sbjct: 293 EALEESTEETDIQVIAQWLQEFYFFLDCMP--FDFEAHTDAFRLTEIGYPENANLGLRQR 350
Query: 367 -WQACTEMIMLTGGDNKDSIF 386
+Q CTE D+ D F
Sbjct: 351 VYQFCTEFGWFLTADSADQPF 371
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 73/299 (24%), Positives = 127/299 (42%), Gaps = 37/299 (12%)
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRY 133
TF+ Y ND ++ + PIF++ G + +E + G +D+A + A + EHRY
Sbjct: 400 TFEFEYYSNDEYY---ERGGPIFIFVGGDWPLEQYYIERGHFHDIAQRTNAWMFTNEHRY 456
Query: 134 YGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA-TDSPVVVFGGSYG 192
YG S P ++ + YL+ QA+ D A I L+ N+ T++ V++ G Y
Sbjct: 457 YGHSSP------VSDYSTENLRYLTVEQAMVDLAEWIFHLRNNVVRDTNAKVILLGTGYA 510
Query: 193 GMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGS 251
G +A W R +YPH+ G ASS ++ I + R + ++ CY I +
Sbjct: 511 GAIATWARQRYPHLVDGVWASS-------------EYAEEIGELLRDLGNDECYSRIWRA 557
Query: 252 WKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNP 308
++ E G E + + F C+ E L +E++ V A+ D +
Sbjct: 558 FR-TAENLMDAGRTETVTEMFNTCEPVDEENMLDVETFFFN--VKAALQDAVLRGQNVEA 614
Query: 309 LPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGW 367
A +C+A+++ D+ Y+ FD +D L+E G+
Sbjct: 615 TEA-----LCEALEESTEETDIQVIAQWLQEFYFFLDCMP--FDFEAHTDAFRLTEIGY 666
>gi|66770655|gb|AAY54639.1| IP12434p [Drosophila melanogaster]
Length = 486
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 108/466 (23%), Positives = 201/466 (43%), Gaps = 58/466 (12%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+ ++ TQ LD N + + T+Q R IN+ ++ + +PIF+Y G E I+ +G
Sbjct: 54 EERWITQKLD--NSDDSNNATWQDRIYINNKYF---VDGSPIFIYLGGEWAIDPSGITSG 108
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + L++ EHR++G+SIP ++ +N + Y S QALAD ++I L
Sbjct: 109 LWKDIAKQHNGSLLYTEHRFFGQSIPI---TPLSTENLAK--YQSVEQALADVINVIATL 163
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL---NFDNIVSPYSFS 230
K+ DS VV G SY +A W R YP + G+ ASSAP+L NF + +
Sbjct: 164 KQEDKYKDSKVVASGCSYSATMATWIRKLYPEIIRGSWASSAPLLAKVNFKD------YM 217
Query: 231 NIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL- 287
++ + + ++ + CY +I + E + G + + K +C + N + W
Sbjct: 218 KVVGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQAV-KELNLCSNFNVNSEQDRWQI 276
Query: 288 --STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL----YGAASVY 341
+ A ++ + Y P + P ++E DD F + A +
Sbjct: 277 FSTIANIFAGIAQYQKPEKYDIPTYCSILREFS---DDDSVALSKFINWKINEHSGACLS 333
Query: 342 YNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKD---------SIFEESEED 392
+ G ++ + ++ W +Q C+E ++ +++E++ E
Sbjct: 334 TTFKGAVGYYEWSKENYQDSDLPWIFQTCSEFGWFQSSGSRSQPFGSTFPATLYEDTCEG 393
Query: 393 YDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLK 450
A+Y ++ G+ +FGG + A+NI F G D WS G GV +
Sbjct: 394 VFG-AKY--DSAGIHANIRATNDDFGGLNVN------ATNIYFVQGALDGWSKVGAGVAQ 444
Query: 451 NISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
A + +H D + D L +++ ++++ +W+
Sbjct: 445 G------ATIIPYASHCPDTGSISASDSAELVASKKKLIKLVAQWL 484
>gi|32564815|ref|NP_498759.2| Protein K12H4.7, isoform b [Caenorhabditis elegans]
gi|351063157|emb|CCD71199.1| Protein K12H4.7, isoform b [Caenorhabditis elegans]
Length = 473
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 114/463 (24%), Positives = 188/463 (40%), Gaps = 81/463 (17%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTGFMY 116
TQ LDHF+ + +TFQQRY N+ + K P F+ G EG W + +
Sbjct: 64 TQTLDHFDSSVG--KTFQQRYYHNNQWY---KAGGPAFLMLGGEGPESSYWVSYPGLEIT 118
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
++A K A + IEHR+YG++ P +++ N YLSS QA+ D A+ I K
Sbjct: 119 NLAAKQGAWVFDIEHRFYGETHP---TSDMSVPNLK---YLSSAQAIEDAAAFI----KA 168
Query: 177 LTA-----TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+TA ++ V FGGSY G LAAW R K+P + A+ SS P+ V +
Sbjct: 169 MTAKFPQLANAKWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPV---QAEVDFKEYLE 225
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKNLAIESWL 287
++ S C + + + + G ++L+ AF +C+ +K+L W
Sbjct: 226 VVQNSITRNSTECAASVTQGFNLVASLLQTSDGRKQLKTAFHLCQDIQMDDKSLKY-FWE 284
Query: 288 STAFVYTAMTDY--------PTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAAS 339
+ Y + Y T + + + + + K ND F ++
Sbjct: 285 TVYSPYMEVVQYSGDAAGSFATQLTISHAICRYHINTKSTPLQKLKQVNDYFNQV----- 339
Query: 340 VYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY 399
SG C D+ D +G + KD F E++ D
Sbjct: 340 -----SGYFGCNDI----DYNGFISF---------------MKDETFGEAQYYIDE---- 371
Query: 400 CKEAYGVDPRPNWI------TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNIS 453
C YG + T ++ G + L R I+ NG DPW G L + +
Sbjct: 372 CTAIYGAAYNSQEVQTSVDYTNQYYGGRDNLNTDR----ILLPNGDIDPWHALGKLTSSN 427
Query: 454 KTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
++ +V AH D+ ++ D +L + R+R +++ W+
Sbjct: 428 SNIVPVVINGTAHCADMYGASSLDSMYLTNARQRISDVLDGWL 470
>gi|258566732|ref|XP_002584110.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905556|gb|EEP79957.1| predicted protein [Uncinocarpus reesii 1704]
Length = 552
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/273 (27%), Positives = 133/273 (48%), Gaps = 33/273 (12%)
Query: 75 FQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-----QNTGFMYDVAPKFKALLVFI 129
++ R+ IND+H+ K+ P+FV+ G E + + +A T F + +FK + +
Sbjct: 83 YKNRFWINDSHY---KSGGPVFVFDGGEANAQRYADYYLVNETSFFVHLLEEFKGMGIVW 139
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK------NLTATDSP 183
EHRYYG+S+P+ N + A YL++ QALAD K+ +LT +P
Sbjct: 140 EHRYYGESLPFPVNLDTP---AEHFQYLNNEQALADIPYFAERFKRENFPNDDLTPKSTP 196
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS--FSNIITQDFRSVS 241
V+ GGSY GM AA+ R KYP A A+SAP+ ++ Y + ++ ++
Sbjct: 197 WVMLGGSYPGMRAAFTRDKYPETIFAAFAASAPVQAQIDMSVYYEQVYRGLVAYGYK--- 253
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPT 301
NC K ++ ++K I+ + +++K F +E+N + + A ++ T
Sbjct: 254 -NCTKDVRAAYKYIDAQLRHRESAARIKKLFFGETAEQNNNGD--FTQALIWNWAT---- 306
Query: 302 PSNFLNPLPAFPVKEMCKAID-DPKTGNDVFAK 333
+ + P PV + C ++ DP+TG A+
Sbjct: 307 ---WQSAGPNGPVGQFCDWLETDPETGKTAPAE 336
>gi|341874575|gb|EGT30510.1| hypothetical protein CAEBREN_03046 [Caenorhabditis brenneri]
Length = 517
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 105/445 (23%), Positives = 180/445 (40%), Gaps = 45/445 (10%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTGFMY 116
TQ +D N++ + T++QRY N + ++NN IF+ E W + T
Sbjct: 62 TQKVD--NFDASNAATYKQRYWYNSNY---TQNNNIIFLMIQGESPATDLWITKPTYQYL 116
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
A +F A + +EHR +G S PY + +Y + + TQALAD S I +
Sbjct: 117 QWAKEFGADVFQLEHRCFGNSRPY---PDTSYPSIKV---CTMTQALADIHSFIQQMNLQ 170
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
+ + FGGSY G L+A FR KYP +GA+ASSAP+ D + + ++ ++
Sbjct: 171 HNFRNPKWITFGGSYPGTLSALFRQKYPQDTVGAVASSAPL---DWTLDFFEYAMVVEDV 227
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFR-----ICKSEKNLAIESWLSTAF 291
R S +C+K + ++ +++ + G++KL F + I+++ + +
Sbjct: 228 LRQTSTDCWKNVNQAFTNMQQLSLTVDGIQKLNTYFNLDPPFVAGKYTQHDIDNFFANVY 287
Query: 292 -VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYG------------AA 338
Y + Y L + +C ++D DV ++
Sbjct: 288 SFYQGVIQYTYDGRNAATLGGLNAQNLCAKMNDANVP-DVITRVNNTINWINQLNGDPVG 346
Query: 339 SVYYNYSGTAKC-----FDLNG--DSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEE 391
+YS K +D N D W W C E+ L D +IF+++
Sbjct: 347 PFQNSYSDMMKVLNDPKYDDNSAVPGDIAANRGWMWLCCNELGALQTTDQGRNIFQQTVP 406
Query: 392 DYDARARYCKEAYGVDPRPNWI--TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
C E +G D +I + + G A+N++ NG DPW G
Sbjct: 407 -MGYFIDMCTEMFGPDVGIKYIRDNNKQTLYNYGGADNYQATNVVLPNGAFDPWHVLGTY 465
Query: 450 KNISKTVLALVEKEGAHHVDLRFST 474
N + + + +GA H + T
Sbjct: 466 NNNTANHMTPLLIQGAAHCSDMYPT 490
>gi|302797683|ref|XP_002980602.1| hypothetical protein SELMODRAFT_113084 [Selaginella moellendorffii]
gi|300151608|gb|EFJ18253.1| hypothetical protein SELMODRAFT_113084 [Selaginella moellendorffii]
Length = 401
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 116/430 (26%), Positives = 171/430 (39%), Gaps = 68/430 (15%)
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHR 132
+ F QRY ++ PIF+ E Q T +A A +V IEHR
Sbjct: 3 RVFSQRYFEFLDYF--QPQQGPIFLALCGESTCRGGYQRTA--QALAKSLGAAVVTIEHR 58
Query: 133 YYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI------IDLKKNLTATDSPVVV 186
YYG+S P+ + +YKN YL++ QAL DYA I I+L+ N + P +V
Sbjct: 59 YYGQSYPF---QNFSYKNLK---YLTTQQALYDYALFIEYYQNLINLRYNKQGKN-PWIV 111
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYK 246
GGSY G L+AWFRLK+PH+ + + ASS + V YS + Q SV C K
Sbjct: 112 VGGSYAGALSAWFRLKFPHLVVASWASSGVV----EAVLEYSAYD--EQIGISVGPECKK 165
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
++ K EE K + + + L++ + ++ FV D P
Sbjct: 166 ALQEITKLAEEGLVTNATAVKSVFFAQKLRDDDFLSLVADIAAGFVQYGAIDMLCP---- 221
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWG 366
P+ E + D + ++S +Y+ + + N S +S W
Sbjct: 222 ------PLLEAIQNKTDLLMAYARIGGVDSSSSDFYDAYKLRRQAEANDISAKDTMS-WN 274
Query: 367 WQACTEMIMLTGGDNKDSIFEESEEDYDARARY-------CKEAYGVDPRP-----NWIT 414
+Q CTE+ DSI + Y C +G + P NW
Sbjct: 275 YQICTELAYFQVAPTNDSIRSSRINLHVTSCCYVLDYIDICAVLFGPNTFPDVSAANW-- 332
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLA-------LVEKEGAHH 467
+GG I +S IIF NG +DPW ++ SKT + L + AH
Sbjct: 333 -NYGGRDIA------SSRIIFLNGSQDPW------QHASKTTFSPGEPSFVLTCESCAHC 379
Query: 468 VDLRFSTKED 477
DL D
Sbjct: 380 EDLSMDCSSD 389
>gi|391873795|gb|EIT82803.1| hydrolytic enzymes of the alpha/beta hydrolase fold protein
[Aspergillus oryzae 3.042]
Length = 592
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 112/464 (24%), Positives = 184/464 (39%), Gaps = 76/464 (16%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P S + S+++ + T+Y T +DH N S T+Q R+ +ND ++
Sbjct: 44 PRSLSKKNTVHSVLAKANTQIEKVTTEYITIPIDH---NDTSVGTYQNRFWVNDDYY--- 97
Query: 90 KNNAPIFVYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
K PI +Y E + E A+N F + A+ + EHRYYG S P+ ++
Sbjct: 98 KAGRPIIMYDAGETNAESIAKNHLTSSLSFFRKILEDTHAMGIIWEHRYYGNSTPFPISR 157
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAW 198
+ ++ YL++ QAL D + + +LT + +P V+ GGSY G+ AA+
Sbjct: 158 DTPPEHFK---YLTTKQALEDIPYFARNFSRPKFAEHDLTPSSTPWVLVGGSYAGIRAAF 214
Query: 199 FRLKYPHVAIGALASSAPILNFDNIVSPYS--FSNIITQDFRSVSENCYKVIKGSWKQIE 256
R KYP V A +SSAP+ N+ Y + ++ F ENC K I + I+
Sbjct: 215 ARNKYPDVIFAAYSSSAPVQAQLNMSIYYDQVYRGLVGHGF----ENCAKDIHAALGYID 270
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
+ ++K F +++N + F T Y N+ P ++E
Sbjct: 271 QQLSNNHTAAAIKKLFFGPGADQN------SNEGFTAALATIYSYFQNYGLDGPEGTLRE 324
Query: 317 MCKAID-DPKT----GNDVFAKLYGAASVYYNYSG------------TAKCFDLNGDS-- 357
+C+ ++ DP T G D FA + G+ V ++ C L+ +
Sbjct: 325 LCEHLEVDPTTKEAAGPDGFAPVRGSKHVAERWAAWPAFTPLVNNFMETNCRGLSDPAKP 384
Query: 358 ---------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD- 407
DP +S W WQ CTE + + + + C + +
Sbjct: 385 SCKLDMTYYDPDSIS-WSWQYCTEWGFYQSSNFGPHSLLSRYQTLEYQQEVCNNQFALAV 443
Query: 408 --------PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
P+ + E+GG I SN F G DPW
Sbjct: 444 ANGVLPSYPQTEALNKEYGGWNIR------PSNTFFTGGEFDPW 481
>gi|194899982|ref|XP_001979536.1| GG23262 [Drosophila erecta]
gi|190651239|gb|EDV48494.1| GG23262 [Drosophila erecta]
Length = 485
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 106/461 (22%), Positives = 192/461 (41%), Gaps = 52/461 (11%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ LD+F+ + + T+Q R IN+ ++ + +PIF+Y G E I+ +G
Sbjct: 55 RWITQKLDNFDASNNA--TWQNRIYINNKYF---VDGSPIFIYLGGEWAIDPSGITSGLW 109
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
D+A + L++ EHR++G+SIP + Y S QALAD +++ LK+
Sbjct: 110 KDIAKQHNGSLLYTEHRFFGQSIPI-----TPLSTENLEKYQSVEQALADVINVLATLKQ 164
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL---NFDNIVSPYSFSNI 232
DS VVV G SY +A W R YP V G+ ASSAP+L NF + + +
Sbjct: 165 EDKYKDSKVVVSGCSYSATMATWIRKLYPDVIRGSWASSAPLLAKVNFKD------YMKV 218
Query: 233 ITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWL--- 287
+ + + + + CY +I + E + G + K +C + N + W
Sbjct: 219 VGESYSILGGQYCYDLIDNATSYYENLFEIGNGTQA-AKELNLCSNFNVNSEQDRWQIFS 277
Query: 288 STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL----YGAASVYYN 343
+ A ++ + Y P + P ++E DD F + A +
Sbjct: 278 TIANLFAGIAQYQKPEKYDIPKYCSILREFS---DDDSVALSKFINWKINEHSGACLSTT 334
Query: 344 YSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES------EEDYDARA 397
+ G ++ + ++ W +Q C+E ++ F + E+ +
Sbjct: 335 FKGAVGYYEWSKENYEDSDLPWIFQTCSEFGWFQSSGSRSQPFGSTFPATLYEDTCEGVF 394
Query: 398 RYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS--GGGVLKNISKT 455
++ G+ +FGG + A+NI F G D WS G GV +
Sbjct: 395 GSKYDSAGIHANVRATNDDFGGLNVN------ATNIYFVQGALDGWSKVGAGVAQG---- 444
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
A + +H D + D L +++ ++++ +W+
Sbjct: 445 --ATIIPYASHCPDTGSISDTDSAELVASKKKLIKLVDQWL 483
>gi|238492811|ref|XP_002377642.1| serine peptidase, family S28, putative [Aspergillus flavus
NRRL3357]
gi|220696136|gb|EED52478.1| serine peptidase, family S28, putative [Aspergillus flavus
NRRL3357]
Length = 592
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 111/464 (23%), Positives = 184/464 (39%), Gaps = 76/464 (16%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P S + S+++ + T+Y T +DH N S T+Q R+ +ND ++
Sbjct: 44 PRSLSKKNTVHSVLAKANTQIEKVTTEYITIPIDH---NDTSVGTYQNRFWVNDDYYEAG 100
Query: 90 KNNAPIFVYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
+ PI +Y E + E A+N F + A+ + EHRYYG S P+ ++
Sbjct: 101 R---PIIMYDAGETNAESIAKNHLTSSLSFFRKILEDTHAMGIIWEHRYYGNSTPFPISR 157
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAW 198
+ ++ YL++ QAL D + + +LT + +P V+ GGSY G+ AA+
Sbjct: 158 DTPPEHFK---YLTTKQALEDIPYFARNFSRPKFAEHDLTPSSTPWVLVGGSYAGIRAAF 214
Query: 199 FRLKYPHVAIGALASSAPILNFDNIVSPYS--FSNIITQDFRSVSENCYKVIKGSWKQIE 256
R KYP V A +SSAP+ N+ Y + ++ F ENC K I + I+
Sbjct: 215 ARNKYPDVIFAAYSSSAPVQAQLNMSIYYDQVYRGLVGHGF----ENCAKDIHAALGYID 270
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
+ ++K F +++N + F T Y N+ P ++E
Sbjct: 271 QQLSNNHTAAAIKKLFFGPGADQN------SNEGFTAALATIYSYFQNYGLDGPEGTLRE 324
Query: 317 MCKAID-DPKT----GNDVFAKLYGAASVYYNYSG------------TAKCFDLNGDS-- 357
+C+ ++ DP T G D FA + G+ V ++ C L+ +
Sbjct: 325 LCEHLEVDPTTKEAAGPDGFAPVRGSKHVAERWAAWPAFTPLVNNFMETNCRGLSDPAKP 384
Query: 358 ---------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD- 407
DP +S W WQ CTE + + + + C + +
Sbjct: 385 SCKLDMTYYDPDSIS-WSWQYCTEWGFYQSSNFGPHSLLSRYQTLEYQQEVCNNQFALAV 443
Query: 408 --------PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
P+ + E+GG I SN F G DPW
Sbjct: 444 ANGVLPSYPQTEALNKEYGGWNIR------PSNTFFTGGEFDPW 481
>gi|123479668|ref|XP_001322991.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121905847|gb|EAY10768.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 437
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 114/472 (24%), Positives = 200/472 (42%), Gaps = 56/472 (11%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
L SL+ S +++ Y K Q LDH N +++ Q Y + D G K I V
Sbjct: 5 LLSLVRSD-ENRTFYSFK---QTLDHENTGSETFD--QYYYEVTDHVVGQPK---AIIVK 55
Query: 99 TGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLS 158
G E D + + F +A ++ A+++ I+HR++GKSIP G +L+
Sbjct: 56 IGAESDKLVASGVSDFNAVLAKRYNAIVLTIQHRFFGKSIPQDG------LTVDKLKFLT 109
Query: 159 STQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
QA+ DY + D +N + P +V GGSY G+L+A R KYP A++SS +
Sbjct: 110 VEQAVQDY-KVFHDYYQNEKKLNLPWLVVGGSYPGLLSALIRDKYPDDFKAAISSSGVLY 168
Query: 219 NFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE 278
+N F QD S+ + C + + + QIE+ + P + F + +
Sbjct: 169 ATNN------FVEFDLQDAISMGQECAAIARQTRYQIEKLLEDPSDKAYVMNLFGVDTEK 222
Query: 279 KNLAIESWLS-TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL--Y 335
L +++ ++T Y S +PL ++ + G D + L Y
Sbjct: 223 YPLKDGEFMNFIGELFTLSLQYNNLSKVCSPL-----------VNARRLGYDTVSALATY 271
Query: 336 GAASVYYNYS-----GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEE-- 388
Y N + TA ++ G ++ W W C ++ G + S+ E
Sbjct: 272 AKGWFYENQAKPQEYSTAHMRNITGPNNDQRC--WFWMTCNQLAYWQIGKGRLSLRGEKV 329
Query: 389 SEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
++E ++ + CK+ + + P+ F G+ L R +I + +DPW+ V
Sbjct: 330 TKEVFEDQ---CKDVFDQEMHPD--VDAFNAKYSGIPLNR--DHIFYTTASQDPWTWTCV 382
Query: 449 LKNISKTVLALVEK----EGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+++ ++V E H DL +T DP+ L +R +E+ I W+
Sbjct: 383 TEDVKVNENSVVRTYAGPELGHCSDLDGATDNDPEDLVRIREQEILTIEHWL 434
>gi|110289391|gb|ABB47878.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
gi|215706481|dbj|BAG93337.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 404
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 101/386 (26%), Positives = 172/386 (44%), Gaps = 45/386 (11%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+ ++ Q LDHF+ P ++ F+QRY D H GG P+F+ E +
Sbjct: 51 EERWMDQRLDHFS--PTDHRQFKQRYYEFADYHAGG----GPVFLRICGESSCNGIPND- 103
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--- 169
++ ++ KF A +V EHRYYGKS P+ + + +N +LSS QAL D +
Sbjct: 104 -YLAVLSKKFGAAVVTPEHRYYGKSSPF---ESLTTENLR---FLSSKQALFDLVAFRQH 156
Query: 170 ---IIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
I++ + N ++ D+P VFG SY G L+AWFRLK+PH+ G+LASS +L +
Sbjct: 157 YQEILNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL------A 210
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
Y+F++ Q S C ++ + ++E + K+ K++ +
Sbjct: 211 VYNFTDFDKQVGDSAGPECKAALQEVTRLVDEQLRLDSRSVKVLFGAEKLKNDGDFLFFL 270
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD--PKTGNDVFAKLYGAASVYYN 343
+ A + Y +P +PL K+ +++ + + D F + +G Y+
Sbjct: 271 ADAAAIGF----QYGSPDAVCSPL--INAKKTGRSLVETYAQYVQDFFIRRWGTTVSSYD 324
Query: 344 YSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEA 403
+ + N D W +Q C+E+ DSI +E + C+
Sbjct: 325 -----QEYLKNTTPDDTSSRLWWFQVCSEVAYFQVAPKNDSI-RSTEINTGYHLDLCRNV 378
Query: 404 YGVDPRPNWITTE--FGGHKI-GLVL 426
+G P+ T +GG +I GL L
Sbjct: 379 FGEGVYPDVFMTNLYYGGTRIAGLSL 404
>gi|158298288|ref|XP_318471.4| AGAP004014-PA [Anopheles gambiae str. PEST]
gi|157014452|gb|EAA13580.4| AGAP004014-PA [Anopheles gambiae str. PEST]
Length = 469
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 120/489 (24%), Positives = 202/489 (41%), Gaps = 66/489 (13%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
I S + G+ + + +DHFN P++ F RY IND H + PIF+ G
Sbjct: 13 IGCSDVAHGMLRESWFETRVDHFN--PRNQDKFAMRYYINDEH---AYARGPIFIVVGAA 67
Query: 103 GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSST 160
I+ G YD+A A L E RY+G S P +NA T +L++
Sbjct: 68 EPIQTRWITEGLFYDIAYLEGAYLFANELRYFGYSRPV--------ENAETENLDFLNAD 119
Query: 161 QALADYASLIIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN 219
QALAD A I LK+ T ++ V++ G +YGG LA WFR KYPH+ G SS I
Sbjct: 120 QALADLAEWITYLKETYTYNPNAKVILMGTAYGGALATWFRQKYPHLVDGVWVSSGAI-- 177
Query: 220 FDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-- 276
+ + ++ + + R S+ CY I ++ + + G E L F +C+
Sbjct: 178 -EADFAFAGYNEALGESIRQYGSDACYSTIWSGFR-VAQNMAHLGLAELLSTEFHLCEPL 235
Query: 277 -SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-----DV 330
++ +L + AF+ D L + + ++EMC ++ + + D
Sbjct: 236 DTDNDLDVR-----AFLLGLRDDIEFEMLHLRNINS--IREMCAEMEQQRDSSLNALIDW 288
Query: 331 FAKLYG-AASVYYNYSGTAKCF---DLNGDSDPHGLSEWGWQACTEM-IMLTGGDNKDSI 385
FA+ + V+ N+ + + D N + G + + CTE T ++++
Sbjct: 289 FAREHQYEQCVHLNFDRYMERYLETDFNAANLQAGHRQRLYLQCTEEGFFPTTSESEEQP 348
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWIT------------TEFGGHKIGLVLKRFASNI 433
F D + C+ A+G W+T T FGG + + F
Sbjct: 349 FGNMVGP-DFFVKVCQRAFG-----EWLTEDVIFKQLKSTNTRFGGLQPAIERAHFT--- 399
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDL-RFSTKEDPQWLKDVRRREVEII 492
NG DP+ L++++ A + + DL + D + L + R +I
Sbjct: 400 ---NGGIDPYRVSSPLEDLNPKAPATLIPDTFVSPDLDSIDYENDSEELIAAKERTRTLI 456
Query: 493 GKWISQYFQ 501
WI + ++
Sbjct: 457 DTWIFEDYE 465
>gi|157119583|ref|XP_001659435.1| lysosomal pro-X carboxypeptidase, putative [Aedes aegypti]
gi|108875272|gb|EAT39497.1| AAEL008708-PA [Aedes aegypti]
Length = 467
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 117/477 (24%), Positives = 188/477 (39%), Gaps = 54/477 (11%)
Query: 49 SQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWF 108
+ G+ + + +DHFN P++ TF RY ND H S PIFV G+ G IE
Sbjct: 16 ANGMVREAWFETKVDHFN--PRNVDTFSMRYYSNDEH---SYPKGPIFVIVGSNGPIETR 70
Query: 109 AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADY 166
+ G YDVA A L EHRY+G S+P +AST +L+ QALAD
Sbjct: 71 YLSEGLFYDVAYLEGAFLFANEHRYFGHSLP--------VDDASTNNLDFLTIDQALADL 122
Query: 167 ASLIIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI-LNFDNIV 224
A+ + +K + ++ V++ G YGG LA WF ++PH+ G SS + +FD
Sbjct: 123 AAFVHHIKHEVVRNPEAKVILMGYGYGGSLATWFHQQFPHLTNGVWVSSGTVEADFDLTG 182
Query: 225 SPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIE 284
S I + CY I ++ + + + L + F +C++ +
Sbjct: 183 YMESLGETIGE---FGGRGCYGTIFSGFR-VAQNLIAMDRADVLNEQFNLCEALDTDDVM 238
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNY 344
STAF+ ++ +MC ID+ + + + L + +
Sbjct: 239 D--STAFLLGLQR--AIEDEIMHLRNTQSTTDMCGIIDNEE--DTIENSLLALGNWFAEE 292
Query: 345 SGTAKCFDLN-------------GDSDPH-GLSEWGWQACTEMIMLTGGDNKDSIFEESE 390
C DL+ DSD G + + CT D+ F + +
Sbjct: 293 HQFETCVDLSFEAFMAPYMDTDFDDSDLQAGHRQRLYLQCTGTGFFATSDSFYQPFGD-Q 351
Query: 391 EDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA------SNIIFFNGLRDPWS 444
D D C+ A+G +WI + ++ RF N F +G DP
Sbjct: 352 IDSDFYVEVCRHAFG-----DWINEDLIRAQVFRTNVRFGGKQPEIDNAHFTHGDIDPMM 406
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLR-FSTKEDPQWLKDVRRREVEIIGKWISQYF 500
G+++++++ A V H DL D L + +I WI + F
Sbjct: 407 VTGIVEDLNEEAEATVIPNTFHAPDLESIDYVYDSPELIAAKEHTRNLIDLWIFEDF 463
>gi|303322745|ref|XP_003071364.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240111066|gb|EER29219.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320032884|gb|EFW14834.1| extracelular serine carboxypeptidase [Coccidioides posadasii str.
Silveira]
Length = 543
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 133/563 (23%), Positives = 221/563 (39%), Gaps = 108/563 (19%)
Query: 11 CLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN---- 66
CLL ++ + A P P TP + + D LY + + +DHF+
Sbjct: 8 CLLLWASAVHARA---PVIPIGEFTPRVKAPSALAGDDLTSLYPSHTISIPIDHFHTDDR 64
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW---FAQNTGFMYDVAPKFK 123
Y P S TF+ RY + +H+ K+ P+ V G E D E F Q G + +A
Sbjct: 65 YAPHSNGTFELRYWFDASHY---KDGGPVIVLHGGETDGEGRLPFLQK-GILGQLAQATN 120
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL------KKNL 177
+ V +EHRYYG SIP ++ + KN +L++ QA+AD A ++ K+L
Sbjct: 121 GVGVVLEHRYYGTSIP---TEDFSTKNLR---FLTTEQAMADSAYFAKNVVFEGLEDKDL 174
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSFSNIIT 234
TA ++P +++GGSY G A+ R++YP + GA++SS I ++ P
Sbjct: 175 TAPNTPYILYGGSYAGAQVAFLRVEYPDIFWGAISSSGVTKAIWHYWQYYEPTR------ 228
Query: 235 QDFRSVSENCYKVIKGSWKQIEETA---KKPGGLEKLQKAFRICKSEKN--------LAI 283
+ ++C K + ++ A K P ++L++ F + + N I
Sbjct: 229 ---KHAPQHCVKQTQTFVDLVDNIALRGKDPKVTQQLKEFFGLGELTHNDDFANVLAFGI 285
Query: 284 ESWLSTAF-------VYTAMTDYPTPSNFLNPLPAFPVKEMC----KAIDDPKTGNDVFA 332
W T + + D T L P+ + KE+ KA + G +
Sbjct: 286 SGWQGTNWDPDISRPTFAKYCDTITTDRLLQPV-SEEQKELATSLIKAGKYGRHGKKLKN 344
Query: 333 KLYGAASVYYNYSGTAKCFDLN-------GDSDPHGLSE---------WGWQACTEMIML 376
+L A+ + N + A C G DP W +Q CTE L
Sbjct: 345 RLLNYAA-WVNTTYAAPCLRRGSTLDTCFGTHDPEFYKRDDTTQEWRLWYYQVCTEWGFL 403
Query: 377 TGGDNKDSIFE---ESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNI 433
G + D + C+EAYG+ G + V K A +I
Sbjct: 404 QTGSGVPKFIKPLVSRMIDLEYTTIMCREAYGL----------HGEADVSRVNKWGAFDI 453
Query: 434 IF-----FNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDLR--FSTKEDPQW 480
+ +G DPW ++ + V+K + HH D F + P +
Sbjct: 454 EYPRLAIVDGEADPWVEATPHATFARPRRSTVDKPFILIPDAVHHWDENGVFPNQTTPDF 513
Query: 481 ----LKDVRRREVEIIGKWISQY 499
+K V+ +E+ + KW+ +
Sbjct: 514 PPDRIKKVQAQEIAFVKKWLKDW 536
>gi|317157591|ref|XP_001825996.2| serine peptidase, family S28 [Aspergillus oryzae RIB40]
Length = 580
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 111/464 (23%), Positives = 184/464 (39%), Gaps = 76/464 (16%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P S + S+++ + T+Y T +DH N S T+Q R+ +ND ++
Sbjct: 44 PRSLSKKNTVHSVLAKANTQIEKVTTEYITIPIDH---NDTSVGTYQNRFWVNDDYYEAG 100
Query: 90 KNNAPIFVYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
+ PI +Y E + E A+N F + A+ + EHRYYG S P+ ++
Sbjct: 101 R---PIIMYDAGETNAESIAKNHLTSSLSFFRKILEDTHAMGIIWEHRYYGNSTPFPISR 157
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAW 198
+ ++ YL++ QAL D + + +LT + +P V+ GGSY G+ AA+
Sbjct: 158 DTPPEHFK---YLTTKQALEDIPYFARNFSRPKFAEHDLTPSSTPWVLVGGSYAGIRAAF 214
Query: 199 FRLKYPHVAIGALASSAPILNFDNIVSPYS--FSNIITQDFRSVSENCYKVIKGSWKQIE 256
R KYP V A +SSAP+ N+ Y + ++ F ENC K I + I+
Sbjct: 215 ARNKYPDVIFAAYSSSAPVQAQLNMSIYYDQVYRGLVGHGF----ENCAKDIHAALGYID 270
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
+ ++K F +++N + F T Y N+ P ++E
Sbjct: 271 QQLSNNHTAAAIKKLFFGPGADQN------SNEGFTAALATIYSYFQNYGLDGPEGTLRE 324
Query: 317 MCKAID-DPKT----GNDVFAKLYGAASVYYNYSG------------TAKCFDLNGDS-- 357
+C+ ++ DP T G D FA + G+ V ++ C L+ +
Sbjct: 325 LCEHLEVDPTTKEAAGPDGFAPVRGSKHVAERWAAWPAFTPLVNNFMETNCRGLSDPAKP 384
Query: 358 ---------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD- 407
DP +S W WQ CTE + + + + C + +
Sbjct: 385 SCKLDMTYYDPDSIS-WSWQYCTEWGFYQSSNFGPHSLLSRYQTLEYQQEVCNNQFALAV 443
Query: 408 --------PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
P+ + E+GG I SN F G DPW
Sbjct: 444 ANGVLPSYPQTEALNKEYGGWNIR------PSNTFFTGGEFDPW 481
>gi|268535774|ref|XP_002633022.1| Hypothetical protein CBG21794 [Caenorhabditis briggsae]
Length = 1277
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 118/503 (23%), Positives = 190/503 (37%), Gaps = 50/503 (9%)
Query: 28 TFPSSRITPEKLSS------LISSSKDSQGLYKTKYHT----QILDHFNYNPQSYQTFQQ 77
TFP S+ + L + G+Y + T Q DHFN Q+ FQQ
Sbjct: 725 TFPESKFKKKMFLGRPPHGFLPEPDYNKDGVYPPGFETGTFYQRQDHFN--NQNPVHFQQ 782
Query: 78 RYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
++ N + W + + W A K+ A + +EHR+YG S
Sbjct: 783 KFYKN-SQWAQPGGPNFLMIGGEGPEGPGWVLNEQLTWIQYAKKYGATVYILEHRFYGDS 841
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAA 197
+I N S L S Q L D A I + N A +P + FGGSY G L+A
Sbjct: 842 -------KIDINN-SNFYLLHSLQMLYDLAEFIKAVNINSPAP-APWITFGGSYSGALSA 892
Query: 198 WFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE 257
W R +P + IGA+ASS P+ + Y + ++ + R+ + C I+ + +
Sbjct: 893 WMREVFPELVIGAVASSGPVFAKTDF---YEYLMVVEKSIRTYDKTCADRIQSGFNTMRT 949
Query: 258 TAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV-----YTAMTDYPTPSNFLNPLPAF 312
G + L F++ + ++ F + Y + N +
Sbjct: 950 MFLTKEGRQNLSDLFQLKPAFGENVTDTDQHYFFSNIYGNFQGAVQY-SGDNAGAYANGY 1008
Query: 313 PVKEMCKAI-DDPKTGNDVFAKLYGAASVYYN----YSGTAKCF----------DLNGDS 357
+ +MCK + +D D + +V+YN YSG + + G
Sbjct: 1009 GIADMCKIMTNDSNIPLDNIVQFNEFMTVFYNDGDAYSGLDNNYQDYIDYMQNAQMFGPE 1068
Query: 358 DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEF 417
GL W WQ C E D+ + IF S + + C + + + N I T
Sbjct: 1069 YGAGLL-WTWQTCNEFGYFQSADSGNGIF-GSPTPVNMYVQMCMDIFNAYEQRNTIDTAI 1126
Query: 418 G--GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
G + G +N++F NG DPW G+ K+V++ + AH D+ +
Sbjct: 1127 GYTNYVYGERFHYRGTNVVFPNGNVDPWHALGLYYPTDKSVVSYLINGTAHCADMYPARD 1186
Query: 476 EDPQWLKDVRRREVEIIGKWISQ 498
D LK R I W+ +
Sbjct: 1187 ADLPGLKVARDLIDTNIAIWLKK 1209
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 108/471 (22%), Positives = 175/471 (37%), Gaps = 51/471 (10%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW--FAQN 111
T Y Q LDHFN +S +TF Q Y H+ + A F+Y GD E +
Sbjct: 228 NTGYMIQTLDHFN--SRSNETFVQTYYYTQ-HFALHQRTA--FLYVSVSGDFETTVISDE 282
Query: 112 TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLII 171
+ A +F A L +EHRYYG+S P + ++ +L+S QA+ D + I
Sbjct: 283 NNPVVKSARQFGATLFSLEHRYYGQSKP-----NVEKFDSFNLRFLNSFQAIQDIVAFIK 337
Query: 172 DLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
K D V++G YGG++AA R P V G +ASS P + Y F
Sbjct: 338 YANKQFNLDPDVRWVLWGAGYGGVIAAEARKWNPDVVAGVIASSTP------LTHEYDFW 391
Query: 231 NIITQDFRSVSEN----CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESW 286
++S+ CY+ + + I + + P G + F++ +
Sbjct: 392 QFNDHVQMAISQEGGQLCYQKVAQGFADIRQAMRTPEGRSNISDLFQLNPRLDQTNLNYN 451
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLY---------G 336
F + + F N + +C ID T +V + Y G
Sbjct: 452 DVQMFYLAIIAPFQEMIQFNNDF-NIDIGALCTTIDQSTWTPMEVVWQAYVYFSNTVIGG 510
Query: 337 AASVYYNYSGTAKCFDLNGDSDPHGLSE---WGWQACTEM-IMLTGGDNKDSIFEESEED 392
+ +Y DL S + W +Q CTE T +N+ +F
Sbjct: 511 LQPLVTSYQAVID--DLGNQSVNAANIDNRMWQYQMCTEFGWFYTTNNNEQGLFGAVV-- 566
Query: 393 YDARARYCKEAYGVDPRPNWITTEFGGHKI------GLVLKRFASNIIFFNGLRDPWSGG 446
+ + + + P N T I G +N++F NG DPWS
Sbjct: 567 --PTSIFLNMCFDLFPGANLDATVIRDLTIDYNNLYGSAFDYSGTNVVFTNGWYDPWSRL 624
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
G +V+A + +G+ D+ F + ++ R + I W++
Sbjct: 625 GKESTGDFSVVAYIIPQGSWASDM-FPGDTNNTFIIQAHRLMADNINTWVN 674
>gi|158291878|ref|XP_001688441.1| AGAP003640-PA [Anopheles gambiae str. PEST]
gi|157017499|gb|EDO64163.1| AGAP003640-PA [Anopheles gambiae str. PEST]
Length = 505
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 121/513 (23%), Positives = 206/513 (40%), Gaps = 74/513 (14%)
Query: 23 AKIFPTFPSSRITPEKLSSLISSSKDSQGLYK---------TKYHTQILDHFNYNPQSYQ 73
AK P+ SR+ E L + ++ + + K T ++HF+ PQ+
Sbjct: 19 AKTVPSGKRSRLPMELLGGMFGANFQAPKVPKDYVPNPRIVEDRFTSRINHFD--PQNRD 76
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRY 133
TF+ +L ND ++ + P+F+ G F + D+A A L EHRY
Sbjct: 77 TFEFNFLWNDEYY---RPGGPLFIVVGGHHRTNPFFIDETHFKDIAALQGAFLATNEHRY 133
Query: 134 YGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT-DSPVVVFGGSYG 192
+G S+P ++++ N +L + Q L D I L++ + ++ V++ G SYG
Sbjct: 134 FGTSVP---TEDLSSDNLR---FLRTEQTLFDLIEWIDFLRREVMRDPNAKVILHGFSYG 187
Query: 193 GMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
G LA+W R ++P++ GA SSA + +NF+ F N I S+ CY I
Sbjct: 188 GALASWARQRFPNIIDGAWVSSATVRATVNFEEFTE--DFGNTIRI---KGSDECYNSIF 242
Query: 250 GSWKQIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLS--TAFVYTAMTDYPTPSN 304
++ E G + + F C +E +L +E +L T + +M D N
Sbjct: 243 RAF-HTAENLLDAGRTDIVSSMFNTCDPIDAENSLQVELFLHLMTLSLELSMFDDFNIEN 301
Query: 305 FLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN--------GD 356
V+ +C + D + + A AA + YS CFDL+ GD
Sbjct: 302 ---------VQRVCNVLTDEQYETPMEAL---AAYLKDRYSEIRDCFDLSFENFISILGD 349
Query: 357 SD-------PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR 409
GL + + CTE ++D F S+ YD C +G
Sbjct: 350 ESVDAPQNAEFGLRQLNYHICTEFGFFQTAKSRDQPF-GSKVTYDLFLAECSAVFG---- 404
Query: 410 PNWITTEFGGHKIGLVLKRFA------SNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
W+T E + L F +N+++ NG DP+ + + + A V
Sbjct: 405 -EWLTQEVLYDGVRLTNFHFGATDPRITNVLYTNGGIDPFRHVSITEYTNLLANARVTPA 463
Query: 464 GAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ D+R + D + + + + E I W+
Sbjct: 464 AFYTEDIRAISGMDSEEMLETKHMAEEYITTWL 496
>gi|392868441|gb|EAS34227.2| extracelular serine carboxypeptidase [Coccidioides immitis RS]
Length = 541
Score = 99.0 bits (245), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 133/563 (23%), Positives = 221/563 (39%), Gaps = 108/563 (19%)
Query: 11 CLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN---- 66
CLL ++ + A P P TP + + D LY + + +DHF+
Sbjct: 8 CLLLWASAVHARA---PVIPIGEFTPRVKAPSALAGDDLTSLYPSHTISIPIDHFHTDDR 64
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW---FAQNTGFMYDVAPKFK 123
Y P S TF+ RY + +H+ K+ P+ V G E D E F Q G + +A
Sbjct: 65 YAPHSNGTFELRYWFDASHY---KDGGPVIVLHGGETDGEGRLPFLQK-GILGQLAQATN 120
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL------KKNL 177
+ V +EHRYYG SIP ++ + KN +L++ QA+AD A ++ K+L
Sbjct: 121 GVGVVLEHRYYGTSIP---TEDFSTKNLR---FLTTEQAMADSAYFAKNVVFEGLEDKDL 174
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSFSNIIT 234
TA ++P +++GGSY G A+ R++YP + GA++SS I ++ P
Sbjct: 175 TAPNTPYILYGGSYAGAQVAFLRVEYPDIFWGAISSSGVTKAIWHYWQYYEPTR------ 228
Query: 235 QDFRSVSENCYKVIKGSWKQIEETA---KKPGGLEKLQKAFRICKSEKN--------LAI 283
+ ++C K + ++ A K P ++L++ F + + N I
Sbjct: 229 ---KHAPQHCVKQTQTFVDLVDNIALRGKDPKVTQQLKEFFGLGELTHNDDFANVLAFGI 285
Query: 284 ESWLSTAF-------VYTAMTDYPTPSNFLNPLPAFPVKEMC----KAIDDPKTGNDVFA 332
W T + + D T L P+ + KE+ KA + G +
Sbjct: 286 SGWQGTNWDPDISRPTFAKYCDTITTDRLLQPV-SEEQKELATSLIKAGKYGRHGKKLKN 344
Query: 333 KLYGAASVYYNYSGTAKCFDLN-------GDSDPHGLSE---------WGWQACTEMIML 376
+L A+ + N + A C G DP W +Q CTE L
Sbjct: 345 RLLNYAA-WVNTTYAAPCLRRGSTLDTCFGTHDPEFYKRDDTTQEWRLWYYQVCTEWGFL 403
Query: 377 TGGDNKDSIFE---ESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNI 433
G + D + C+EAYG+ G + V K A +I
Sbjct: 404 QTGSGVPKFIKPLVSRMIDLEYTTIMCREAYGL----------HGEADVSRVNKWGAFDI 453
Query: 434 IF-----FNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDLR--FSTKEDPQW 480
+ +G DPW ++ + V+K + HH D F + P +
Sbjct: 454 EYPRLAIVDGEADPWVEATPHATFARPRRSTVDKPFILIPDAVHHWDENGVFPNQTTPDF 513
Query: 481 ----LKDVRRREVEIIGKWISQY 499
+K V+ +E+ + KW+ +
Sbjct: 514 PPDRIKKVQAQEIAFVKKWLKDW 536
>gi|344268156|ref|XP_003405928.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 521
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 103/404 (25%), Positives = 175/404 (43%), Gaps = 54/404 (13%)
Query: 90 KNNAPIFVYT-GNE-GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIA 147
K P+F+ G+E I+W ++ ++ A + AL + +EHR+YG S P
Sbjct: 51 KPGGPVFLMIEGHEPASIQWLKRSFTWI-TYAQRLGALCILLEHRFYGDSQPI------- 102
Query: 148 YKNASTTG---YLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYP 204
+N ST YLSS QA+AD A + +++ T++ VVFGG YGG LA W R+K+P
Sbjct: 103 -RNMSTEHLRRYLSSRQAVADIAEFRTVIAQSMNFTENKWVVFGGGYGGALAVWSRIKHP 161
Query: 205 HVAIGALASSAPI---LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKK 261
++ A++SSA I +NF+ + +I + + + C + +K ++ +
Sbjct: 162 NLFAAAVSSSAMIQAKVNFN------EYFEVIYRTVDTHNSECLEAVKQAYGFVMAMLLL 215
Query: 262 PGGLEKLQKAFRICKSEKNLAIESWLSTAFV-------YTAMTDYPTPSN-FLNPLPAFP 313
P +L +++C+ K I+S + FV + A+ N + +
Sbjct: 216 PDYHSRLIFDYKLCEPFK---IKSEMDQLFVIEKLMLIFAAIVQNNKKKNTTVGIMRKMS 272
Query: 314 VKEMCKAIDDPKTGN--DVFAKLYGAA----------SVYYNYSGTAKCFDLNGDSDPHG 361
+ E C+ + + G+ +A++ + Y Y F + G
Sbjct: 273 IDEFCETMTNTSLGSPYHRYARIMNTMLRNKGYPCYPASYKQYVEEYSDFSFETNKYRRG 332
Query: 362 LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHK 421
+W +Q C E D K+S F Y + C + +G P+ N + G
Sbjct: 333 -RQWLYQCCNEFGWFYTTDLKNSSFTGLPTRYFVKK--CSDVFG--PKFNNDSVFQGVMS 387
Query: 422 IGLV---LKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
+ L S IIF NG DPW G+ K+IS + A+V K
Sbjct: 388 TNMYYGGLNVTGSKIIFSNGSNDPWHRLGITKDISANLRAVVIK 431
>gi|194899968|ref|XP_001979529.1| GG23317 [Drosophila erecta]
gi|190651232|gb|EDV48487.1| GG23317 [Drosophila erecta]
Length = 486
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 118/471 (25%), Positives = 201/471 (42%), Gaps = 55/471 (11%)
Query: 25 IFPTFPSSRITPEKLSS---LISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLI 81
TF R E L+ L SK+ + + Q +DHF+ + +T++ RY
Sbjct: 17 FLATFNPYRRNAELLNHEPVLQIRSKNELAAVEELWLDQKVDHFD--EHNNKTWRMRYYS 74
Query: 82 NDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYG 141
N ++ K PI+++ G E I +TG +D+A + +L + EHRYYG S+P+
Sbjct: 75 NAKYF---KPQGPIYIFVGGEWTISPGLLSTGLTHDMAVENSGMLFYTEHRYYGLSLPF- 130
Query: 142 GNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL-TATDSPVVVFGGSYGGMLAAWFR 200
GN+ N LS Q+LAD A I + N DS V++ GGSY G L AW
Sbjct: 131 GNESYRLNNLKQ---LSLHQSLADLAHFIRHQQSNTPEMEDSKVILVGGSYSGSLVAWMT 187
Query: 201 LKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR-SVSENCYKVIKGSWKQIEETA 259
YP + + ASSAP+L + + + ++ + R S NC I+ K ++
Sbjct: 188 QLYPDLIAASWASSAPLLAKADF---FEYMEMVDKSIRLSYGHNCSLRIE---KGLKFLV 241
Query: 260 KKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
K G +++Q+ + ++ L A + + +Y + A+ + +C+
Sbjct: 242 KLFDG-DEIQELLYNLNGCEGYRPKNPLDRAAFFNGLGNYFAL--VVQSYSAY-IPRLCE 297
Query: 320 AIDDPKTGNDV----FAKLY------GAASVYYNYSGTAKCF--DLNGDSDPHGLSEWGW 367
+ +G+++ F KL + + YS + F DL+ S+ W +
Sbjct: 298 TLMSLDSGDELAFIEFLKLLYSEGRRSSECQDFGYSSMLELFSEDLDQSSETRA---WFY 354
Query: 368 QACTEMIMLTGGDNKDSIFE--ESEEDYDARARYCKEAYGVDPRPNWIT-------TEFG 418
Q C E T +K S + ++ + C++A+G + + + ++FG
Sbjct: 355 QTCNEFGWYTTTKSKSSASQAFANQVPLGYFEQLCQDAFGAEQTAHQLAQGVEQTNSKFG 414
Query: 419 GHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVD 469
G G + +IF +G DPWS G K VL G HV+
Sbjct: 415 G--CGFNQSERYAQVIFTHGALDPWSALGQQKGDQAIVLT-----GYSHVE 458
>gi|157119587|ref|XP_001659437.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875274|gb|EAT39499.1| AAEL008702-PA [Aedes aegypti]
Length = 499
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 102/424 (24%), Positives = 168/424 (39%), Gaps = 61/424 (14%)
Query: 45 SSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD 104
+S+ + L++T+ +DHFN PQ TFQ Y ND ++ + PIF+ G
Sbjct: 50 NSRTTGHLFRTR-----VDHFN--PQKRDTFQFEYFSNDQYY---RPGGPIFIVVGGNFP 99
Query: 105 IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
+ + G +D+A A + EHR+YG S+P ++++ +N YL+ Q +
Sbjct: 100 VSPYFLEHGHFHDIAFYENAWMFTNEHRFYGNSMP---TEDLSVENLR---YLTVEQTMV 153
Query: 165 DYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNF 220
D A I L++N+ ++ V++ G Y G +A W R +YPH+ GA SS I NF
Sbjct: 154 DLAEWIFHLRQNVVRDQNARVILLGTGYSGAIATWMRQRYPHLVEGAWVSSGQIEARFNF 213
Query: 221 DNIVSPYSFSNIITQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEK 279
++ + + R N CY I +++ E G + + F C+
Sbjct: 214 K------EYAEEVGELIRDYGSNECYSQIWRAFRTAENLIDAGLG-STVSELFNTCEPIV 266
Query: 280 NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAAS 339
I F + T L+ E+C+ +++ D L A+
Sbjct: 267 TDDITMLDVETFFWHVKT--ALQRGVLDEQDTDTTNELCERLNNSTEATD----LQTIAN 320
Query: 340 VYYNYSGTAKC--FDLNGDSDPH-------------GLSEWGWQACTEMIMLTGGDNKDS 384
+ + C FD + D H GL + +Q CTE D+ D
Sbjct: 321 WVHEFYDFLDCMPFDFDAAIDAHQYVDPKVPENAVYGLRQRTYQLCTEFGWFLTADSHDQ 380
Query: 385 IFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA------SNIIFFNG 438
F Y C+ YG +W+ ++ + L F SN+ F NG
Sbjct: 381 PFGYRVSMY-FFLNVCRSVYG-----DWLNSQVVYDGVHLTNMHFGGQNPRISNVFFTNG 434
Query: 439 LRDP 442
DP
Sbjct: 435 GLDP 438
>gi|157816660|gb|ABV82323.1| IP19978p [Drosophila melanogaster]
Length = 198
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/103 (49%), Positives = 67/103 (65%), Gaps = 3/103 (2%)
Query: 42 LISSSKDSQGL-YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
LI+ SQ Y+ K LDHF++ + TF RYL ND+ S PIF YTG
Sbjct: 22 LIAGCDCSQRFKYEIKEFQVPLDHFSFLINA--TFNIRYLYNDSFVDKSNARTPIFFYTG 79
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGN 143
NEGDIE FAQNTGF+++ A + +AL++F EHRYYGKS+P+G +
Sbjct: 80 NEGDIELFAQNTGFLWEQAERQRALVIFAEHRYYGKSLPFGSS 122
>gi|195109600|ref|XP_001999371.1| GI24473 [Drosophila mojavensis]
gi|193915965|gb|EDW14832.1| GI24473 [Drosophila mojavensis]
Length = 483
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 110/469 (23%), Positives = 189/469 (40%), Gaps = 67/469 (14%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG 113
+T++ Q LD+F+ +S + QR +IN+ ++ + +PIF+ G E I+ + +G
Sbjct: 57 ETRWFNQSLDNFDDTNKS--VWSQRVMINEENF---VDGSPIFLLLGGEWTIDPNSITSG 111
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
D+A + LV+ EHR++G SIP I + Y QALAD ++I L
Sbjct: 112 LWVDIAKEHNGSLVYTEHRFFGGSIP------ILPLSTENLKYHGVEQALADVVNVIKVL 165
Query: 174 KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
K+ +S VVV G SY +A W +L YP V +G ASSA + + V F ++
Sbjct: 166 KEEDKYKNSKVVVSGCSYSASMAVWLKLLYPDVIVGGWASSAVL---EAKVDFSDFMEVV 222
Query: 234 TQDFRSVSEN-CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLS--- 288
+ +R + + CY +I + E + G K +K +C S ++N + W
Sbjct: 223 GRAYRQLGGDYCYNLINNATSYYEHLFQTGQG-AKAKKLLNLCDSFDENNERDQWQIFSL 281
Query: 289 TAFVYTAMTDYPTPSNF--------LNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV 340
A ++ + Y P N+ L L + K + Y
Sbjct: 282 IANIFAGIAQYQKPENYDLARSCSVLRNLDIDDASALSKFVQYSLRQQGCHNARYQETVD 341
Query: 341 YY-----NYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDA 395
YY NY+G W +Q C + NK+ F S
Sbjct: 342 YYKWVKNNYNGNLHL-------------SWFYQTCRQFGWFQSSANKNHPF-GSTFPATL 387
Query: 396 RARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
C++ +G ++ ++GG + N+ +G D WS G
Sbjct: 388 YTDMCRDVFGSQYTSAKIEEYIQATNKKYGGRNPAV------ENVYMTHGGLDGWSAVG- 440
Query: 449 LKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ S T++ +G+H D D L+ + R +E++ +W++
Sbjct: 441 --SDSATIIP----QGSHCFDSGSINPTDSPALRAAKERVIELVREWLA 483
>gi|123457340|ref|XP_001316398.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121899103|gb|EAY04175.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 440
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 100/450 (22%), Positives = 184/450 (40%), Gaps = 52/450 (11%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q++DH N TF+Q+++IN+ +GG ++PI + E D + F +A
Sbjct: 28 QLIDH---NHSETGTFKQKFVINN-QYGGP--DSPIILEISGESDGYYVGGVGDFEETLA 81
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+F +V ++HR+YG+S P+ +E +N YLS QA+ D + + KK A
Sbjct: 82 KEFNCTVVTLQHRFYGESYPF---EESTTENLQ---YLSVEQAVEDISYFVDYYKKTYKA 135
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRS 239
+ +++GGSY G+L+A+ + K+ GA++SS +L + F++ Q S
Sbjct: 136 DKNKWLLYGGSYPGLLSAYTKSKFDSKFAGAISSSGVVL------AQKEFTDFDKQIEIS 189
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL---AIESWLSTAFVYTAM 296
+ C + + + I+ + G + + F E ++ + S A Y
Sbjct: 190 LGHQCAAACRTARRHIDTLLETEEGTQYVLNLFNANGVEPDIFRFVVGELFSIAPQYGHR 249
Query: 297 TDYPTP---SNFLNPLPAFPVKE-MCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFD 352
P S P + E G A Y AS+
Sbjct: 250 EALCGPMEGSLITGKDPMLVLAEFNNNFFIPNFIGKSTIANEYSTASL------------ 297
Query: 353 LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV--DPRP 410
D+ W WQ C+++ G K S+ + A+ C + +G+ +P
Sbjct: 298 --KDTKNKAARSWLWQTCSQLGWWQVGAGKTSL-RSPLLTTETFAKQCNDVFGLTDEPDT 354
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV----LKNISKTVLALVEKEGAH 466
+ ++GG L + A+NI++ G +DPW+ + + N + + H
Sbjct: 355 DAFNAKWGG------LDQTATNIVYLTGSQDPWTPVCITDEKVPNENAAAHTMTGPNVGH 408
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
D + DP +K R+ + ++ KW+
Sbjct: 409 CTDYHLPSNNDPADVKRTRQMVISLVKKWL 438
>gi|170045815|ref|XP_001850489.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868717|gb|EDS32100.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 490
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 112/464 (24%), Positives = 190/464 (40%), Gaps = 37/464 (7%)
Query: 40 SSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVY 98
S+L SSS + + + T ++ N NPQ + + RY + + + G PI ++
Sbjct: 17 SALPSSSDGHKRIVVENFFTTRVN--NLNPQRNERWTMRYFSVTEFYEAG----GPILIW 70
Query: 99 TGNEGDI-EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYL 157
G I E+ + +YD+A + + E R+YG+ N+ + L
Sbjct: 71 LGGNAPIQEYMIDESSLLYDLARQMNGAIFAFESRFYGQ------NRATEDVTVESLYLL 124
Query: 158 SSTQALADYASLIIDLKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA- 215
S+ Q + D A + L++N+ D+PV+V G YGG LA WFR+ YPH+A A +S
Sbjct: 125 STYQIMGDLAEFVTYLRRNVVHDEDAPVLVSGAGYGGALATWFRVHYPHLADAAWSSGGT 184
Query: 216 --PILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK-LQKAF 272
+++F + + I + CY I ++ ++ GLE+ L + F
Sbjct: 185 HKAVMSFSEYAEAWGQTLI-----NYGGQACYNEIFVAFHVMQYLIDM--GLEEILFEKF 237
Query: 273 RICKS---EKNLAIESWLSTAF--VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD-PKT 326
+C + NL ++ + V + ++F N A M A+D
Sbjct: 238 NLCTEINLQDNLEVQHFFRMMMKAVQEFTLENDNLTDFTNFCDALMTPNMPTALDSFANW 297
Query: 327 GNDVFAKLYGAASVYYNYS-GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSI 385
N V+AK +V + K D S G +W +Q C E D+
Sbjct: 298 FNTVYAKDEICTTVDLQSTIQNLKQSDWTSPSVQSGQRQWLYQKCAEFGWFPTTDSPYQP 357
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWIT--TEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
F S + C+E +G +P+ IT + H+ G L F +G DPW
Sbjct: 358 F-GSRVQTELYTEMCREVFGDFFQPDSITRAVQRFNHRFG-ELTPDVRRAHFTSGAEDPW 415
Query: 444 SGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRR 487
G+ + +S A V + R ++ED + L+ +RR
Sbjct: 416 RLVGIRETLSTRATADVISGELSGAETRTISEEDSEVLQAAKRR 459
>gi|195062790|ref|XP_001996254.1| GH22390 [Drosophila grimshawi]
gi|193899749|gb|EDV98615.1| GH22390 [Drosophila grimshawi]
Length = 497
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 110/213 (51%), Gaps = 14/213 (6%)
Query: 11 CLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISS---SKDSQGLYKTKYHTQILDHFNY 67
C + L + A+ F TF + R T E L SS SK + + +Q LDHF+
Sbjct: 3 CFYLVTILVSTFAEQF-TFNAYRYTLELLLQEPSSGSYSKSDAAPVQELWLSQKLDHFD- 60
Query: 68 NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLV 127
+ +T+Q RYL ND + K PI+++ G E I +TG +D+A + +L
Sbjct: 61 -ELNNKTWQMRYLRNDKY---HKPQGPIYIFVGGEWTITPGLLSTGLTHDMAVENAGILF 116
Query: 128 FIEHRYYGKSIPY-GGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN-LTATDSPVV 185
+ EHRYYG+S+P+ + ++ +N +L+ QALAD A I K + T S V+
Sbjct: 117 YTEHRYYGQSLPHNSSHNSMSLENLK---HLNLHQALADLACFIRYQKSHSANLTHSKVI 173
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
+ GGSY G + AW YP + + ASSAP+L
Sbjct: 174 LIGGSYSGSMVAWMTQLYPELVTASWASSAPLL 206
>gi|308162690|gb|EFO65071.1| Thymus-specific serine protease precursor [Giardia lamblia P15]
Length = 522
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 144/339 (42%), Gaps = 41/339 (12%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
+ Q +DHF+ P + + + QRY NDT++ K P+F+ G EG + F
Sbjct: 59 FREQHVDHFD--PMNTKKWSQRYYYNDTYY---KAGGPVFLMIGGEGPVTPKYVEDYFSI 113
Query: 117 D-VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
D A L V +EHR+YG S P + +++ L S QALAD A+ + LK+
Sbjct: 114 DYFAKNMNGLKVALEHRFYGASFPSTDSADLSL--------LRSDQALADIATFLAYLKR 165
Query: 176 NLTATDSP-VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+S +V GGSY G LAAW R+++P + A++SS P L + P +I +
Sbjct: 166 EYNLPESTKIVAVGGSYSGNLAAWARIQFPFIISAAISSSGPYLAQTDY--PEYLQHIDS 223
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE--KNLAIE--SWLSTA 290
Q + + C VI + K E L F++ + N + S++S
Sbjct: 224 QIRKYGGDRCMDVISAAHKDAEYLLSHDKA--TLAAIFKLREESIYNNTGYDKASFMSAM 281
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA-----KLYGAASVYYNYS 345
+ + Y + +K+MCKAI+ G D K Y + Y Y
Sbjct: 282 GAPSGVVQYAKHDGYYTTTKDGDIKQMCKAIEAYYDGYDAGESYRQLKAYSLWLLDY-YD 340
Query: 346 GTAKCFDLNGDSDPHGLSE------------WGWQACTE 372
G+ + DL+ D + + W WQ C E
Sbjct: 341 GSMEEIDLSFDGYIKAIQDTSIDSEFAVDRSWLWQTCVE 379
>gi|258569361|ref|XP_002543484.1| predicted protein [Uncinocarpus reesii 1704]
gi|237903754|gb|EEP78155.1| predicted protein [Uncinocarpus reesii 1704]
Length = 546
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 112/447 (25%), Positives = 180/447 (40%), Gaps = 72/447 (16%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQ----- 110
+Y T +DH NP+ T++ RY +ND ++ + P+ ++ G EGD + A
Sbjct: 71 EYATIPIDH--NNPR--YTYRNRYWVNDAYY---RPGGPVIIFDGGEGDAQGLANYYLED 123
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
T ++ + +F + + EHRYYG+S PY N AS YLS+ QAL D
Sbjct: 124 QTSYIVQLLQEFGGVGLVWEHRYYGQSNPYPVNDNTP---ASQLQYLSNEQALNDLPYFA 180
Query: 171 IDLKK-----NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
++ +LT +P V+ GGSY GM AA+ RLK+P AL+SSAP+ +
Sbjct: 181 RTFRRRSISYDLTPRSTPWVMIGGSYPGMRAAFSRLKHPDTIFAALSSSAPVQ------A 234
Query: 226 PYSFSNIITQDFRSVSE----NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL 281
FS Q +R + NC + ++ +++ I+ + +++ F +E+N
Sbjct: 235 RIDFSAYYEQVYRGLIAYGYGNCTRDMQAAYQYIDSQLAQQNTATYIKQLFLGPGAERNT 294
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGNDV----FAKLYG 336
+ V+ Y P V + C ++ DP+TG +A G
Sbjct: 295 HGVFTQALLAVWVTWQTYG---------PTGEVAQFCNWMETDPRTGRTAPAEGWAPTRG 345
Query: 337 AASVYYNYS-----------------GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGG 379
+V ++ G C +DP +S W WQ C++
Sbjct: 346 VRAVVERFAAWPNFRSRVNAAFGSNCGKGDCDLKLTATDPAAIS-WAWQFCSQWGYFQTR 404
Query: 380 DNKDSIFEESEEDYDARAR-YCKEAYGVD-----PRPNWITTEFGGHKIGLVLKRFASNI 433
+ I +DY R Y + GV RP + + G + SN+
Sbjct: 405 NPSGIISIYQTDDYFQRELCYSQFPDGVSSGHLPARPG--VDQANNYTSGWYTR--PSNV 460
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALV 460
F G DPW+ L S A V
Sbjct: 461 FFTGGEFDPWNSLSTLSTESYAPRARV 487
>gi|17566516|ref|NP_507841.1| Protein PCP-4 [Caenorhabditis elegans]
gi|11064521|emb|CAC14390.1| Protein PCP-4 [Caenorhabditis elegans]
Length = 1042
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 127/506 (25%), Positives = 203/506 (40%), Gaps = 49/506 (9%)
Query: 13 LFSSTLTISNAKIFPT-FP-SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQ 70
L + LT+S I PT FP +T + S S + T Y +Q LDHF+ + Q
Sbjct: 5 LLLAYLTMSAFSIIPTHFPFKEHLTKQPASPATSVT--------TGYLSQKLDHFSNDSQ 56
Query: 71 SYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMYDVAPKFKALLVF 128
+ F Q+Y T N F+Y EG +I + A +F A L
Sbjct: 57 VF--FTQQYFY--TERLSVSNQKVAFLYVNTEGNEEIAVMTDERSPVVKAAKRFGAQLFA 112
Query: 129 IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI--IDLKKNLTATDSPVVV 186
++HRYYG S P N + AS YL+S QA+ D S I + + N+ D V+
Sbjct: 113 LKHRYYGASKPNFQNFD-----ASALRYLTSRQAIQDILSFIKYANTQFNMNP-DVRWVL 166
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCY 245
+G YGG+LAA R P GA++SSAP+ + + F++ + + NCY
Sbjct: 167 WGTGYGGILAAEARKTDPVAVSGAISSSAPLRRLYDF---WQFNDFVGNTLMQIGGSNCY 223
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTD-YPTPSN 304
++ + I + K G ++ F++ + + YTA+ +
Sbjct: 224 GRVQQGFADIRQAMKTTAGRSQISDLFQLNPRLDQTQL-GYNDIQMFYTAIIGPFQEIVQ 282
Query: 305 FLNPLPAFPVKEMCKAIDDPK-TGNDVFAKLY--------GAASVYYNYSGTAKCFDLNG 355
F N + +MC I + T +V + Y G+ S DL
Sbjct: 283 FNNDF-NISITDMCTIIANSSWTNMEVVRQAYVYLSTTLTGSVQPMTIASYQKVVNDLKN 341
Query: 356 DSDPHGLSE---WGWQACTEMIML-TGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPN 411
DS E W +Q CTE+ T +N+ +F + Y + + P +
Sbjct: 342 DSVSSPFVENRMWTYQICTELGWFPTTNNNEQGLFGAVV----PTSIYINQCSDIFPDAS 397
Query: 412 WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLR 471
T + +N++F NG DPWS G + +V+A V GA ++
Sbjct: 398 LTATSIRDSIVSSDSVYTGTNVVFTNGFYDPWSVLGQETSRDFSVVAYV-IPGASYLSDF 456
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWIS 497
F D Q+++ +E I W++
Sbjct: 457 FPGDSDNQYIQKAHDLMIENINIWVN 482
Score = 88.6 bits (218), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 120/501 (23%), Positives = 205/501 (40%), Gaps = 56/501 (11%)
Query: 23 AKIFPTFPSSRITPEKLSSLISSSKDS---QGLYKTKYHTQILDHF-NYNPQSYQTFQQR 78
++ P PS I + L++SS + QG ++ Q DHF N N FQQ+
Sbjct: 517 SRTLPRTPSRGIFHRRHLELLASSYPAGFEQGTFR-----QRQDHFDNLN---VDFFQQK 568
Query: 79 YLINDTHWGGSKNNAPIFVYTG-NEGDIEWFAQNTGFMYDV-APKFKALLVFIEHRYYGK 136
+ N + W ++ P F+ G E + E + N + + A K+ A + +EHR+YG
Sbjct: 569 FYKN-SQW--ARPGGPNFLMIGGQEAEGESWVLNEKLPWLISAQKYGATVYLLEHRFYGD 625
Query: 137 SIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLA 196
S+ N + LSS Q L D A I + T + +P + FG S+ L+
Sbjct: 626 SL---------VGNNTNLNLLSSLQVLYDSAEFIKAINYK-TQSSTPWITFGRSFP--LS 673
Query: 197 AWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIE 256
AW R +P + GA++SS IL + + + ++ R +C IK + +I
Sbjct: 674 AWTRAIFPDLVTGAVSSSGAILAKTDF---FEYLMVMETSIRKYDNSCADRIKSGFDEIR 730
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAM-TDYPTPSNFL--NPLP--- 310
G + L K F++ E+ F ++ + +++ F N P
Sbjct: 731 GLFLTSEGRQDLSKIFQLLPGFSENVTET--DQHFFFSNLYSNFQLAVQFSGDNSGPWAD 788
Query: 311 AFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYN----YSG-----TAKCFDLNGDSD-PH 360
+ + EMC+ + T D + + N Y+G TA ++L D
Sbjct: 789 GYGIPEMCRFMTGAGTPLDNIVAFNAYMTSFNNGGGTYTGMGNNYTAMIYNLKNSKDYGE 848
Query: 361 GLSE---WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITT-- 415
G+ W WQ CTE D+ +F S + C + +G + I +
Sbjct: 849 GVDPTLLWTWQTCTEYGGFQSADSGSGLFG-SPVPVSFLIQMCMDLFGNTYDRSKIDSLI 907
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTK 475
+F +K G SN++F NG DP+ G+ + +V++ + +H D+ +
Sbjct: 908 DFTNYKYGGRDNFKGSNVVFINGNIDPYHVLGLFNSPDSSVVSYLIDGSSHCADMFPARD 967
Query: 476 EDPQWLKDVRRREVEIIGKWI 496
D LK R + IG W+
Sbjct: 968 SDVPGLKVARDLVDQNIGVWL 988
>gi|357628484|gb|EHJ77798.1| hypothetical protein KGM_21289 [Danaus plexippus]
Length = 499
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 105/446 (23%), Positives = 177/446 (39%), Gaps = 38/446 (8%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
P +S SS+ ++G + L+HF+ + + TFQ RY N G I
Sbjct: 23 PPPEASARSSTNITEGWLPVR-----LNHFDAS--NTDTFQMRYYYNSQFSRGPY----I 71
Query: 96 FVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
++ G E I +G Y++A + A L + EHRYYG + P G +
Sbjct: 72 VIFVGGEWSISPGWVRSGLAYELAERIGAGLFYTEHRYYGLTRPTNGTTVAEMR------ 125
Query: 156 YLSSTQALADYASLIIDLKKNLTA----TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
YLS QAL D A I ++ + ++ V +FG SY G +A W +L YPH+ +L
Sbjct: 126 YLSVDQALGDLAQFIEYVRSDDFEGGRFRNARVALFGCSYAGSMATWMKLGYPHLVRTSL 185
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA 271
+ S P+ + P I T S+ C I+ + K+I E + GL+ +
Sbjct: 186 SDSGPLHAQQDF--PEYLEVIATALRVQGSQQCVDDIESAMKRINELIETEAGLDTVSTL 243
Query: 272 FRIC----KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDP--K 325
F C +S +L+ W + + Y TP + +P K + DP +
Sbjct: 244 FNTCSRLRRSHLDLSTFFWYGITETFAYLVQYATPGD----IPRACDHITNKTLGDPIER 299
Query: 326 TGNDVFAKLYGAASVYYNY-SGTAKCFDLNGDSDPHGLSEWGWQACTEM-IMLTGGDNKD 383
+ V ++ Y + Y A + + DS + W +Q CTE T ++
Sbjct: 300 LSSWVTSQPYTQPCIESRYFEKVASHTNTSYDSPDATMRLWTYQTCTEYGWYQTTTSSRQ 359
Query: 384 SIFEESEEDYDARARYCKEAYGVDPRPNWITTEF-GGHKIGLVLKRFASNIIFFNGLRDP 442
+Y + CK+ + N + + +++ L+ ++ G DP
Sbjct: 360 PFLNTVPLEY--FHQMCKDFFNDSIDENLLRSAIVRTNRLFAGLEHLPDGVLSVGGGHDP 417
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHV 468
WS G K + + +G H
Sbjct: 418 WSPVGPNKTHETHLAPVYVVDGVSHC 443
>gi|344299008|ref|XP_003421180.1| PREDICTED: thymus-specific serine protease, partial [Loxodonta
africana]
Length = 471
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 92/410 (22%), Positives = 172/410 (41%), Gaps = 71/410 (17%)
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL-TATDSPV 184
+V +EHR+YG SIP G + +LSS ALAD AS + L + ++ SP
Sbjct: 12 VVGLEHRFYGLSIPVRGLDMAQLR------FLSSRHALADVASAHLALSRLFNVSSSSPW 65
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---ILNFDNIVSPYSFSNIITQDFRSV- 240
+ FGGSY G LAAW RLK+PH+ ++ASSAP +L+F ++N++++ +
Sbjct: 66 ICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSE------YNNVVSRSLMNTA 119
Query: 241 ---SENCYKVIKGSWKQIEETAKKPGGLE-----KLQKAFRICKSEKNL----AIESWLS 288
S C+ ++ + E + G + +L R+ ++E A+++ +
Sbjct: 120 IGGSPECWSAASAAFAETERRLRAGGEAQAALRAELGACGRLSRAEDQAELLEALQALVG 179
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI----DDPKTGNDVFAKLYGAASVYYNY 344
Y P V+++C + D +T + L AA + +
Sbjct: 180 GTVQYNGQAGAP-----------LSVRQLCGLLVGGADRGRTAP--YRGLRRAAQIVMHS 226
Query: 345 SGTAKCFDLN--------GDSDPHGLS----EWGWQACTEMIMLTGGDNKDSIFEESEED 392
G +C + D++P G +W +Q CTE ++ F +
Sbjct: 227 LGQ-RCLSTSRAETVAQLKDTEPQGSGVGDRQWLYQTCTEFGFYITCEDPRCPFSQLPA- 284
Query: 393 YDARARYCKEAYGVDPRP-----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
++ C++ +G+ + +GG A+ ++F NG DPW
Sbjct: 285 LPSQLGLCEQVFGLSASSVAQAIAQTNSYYGGQTP------RATQVLFVNGDADPWHVLS 338
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
V +++ A++ +H +D+ D L+ R++ + + W++
Sbjct: 339 VTQSLGPFESAVLIPNASHCLDMATERPSDSPSLRLARQKIFQQLQTWLT 388
>gi|310795241|gb|EFQ30702.1| serine carboxypeptidase S28 [Glomerella graminicola M1.001]
Length = 488
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 132/494 (26%), Positives = 186/494 (37%), Gaps = 95/494 (19%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE---GDIEWFAQNTGFMY 116
Q LDH N S TF+QRY N HWGG P+F+ G E G + + +N
Sbjct: 30 QFLDHTN---TSKGTFKQRYWWNAEHWGGP--GFPVFMVNGGETNAGRLTGYLENGTLAS 84
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD------YASLI 170
A K ++ IEHRYYG+S P+ A T YL QA+ D A L
Sbjct: 85 LYAETHKGAIILIEHRYYGESWPF------KTSTADTLQYLDVPQAIRDNIHFAQTADLP 138
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSP- 226
D K A SP V+ GGSY G LAAW + P ASSA I +F +P
Sbjct: 139 FDTNKGANANTSPWVLVGGSYAGALAAWTSVVAPGTFAAYHASSAVVQAIEDFWQFFTPI 198
Query: 227 -YSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK----------LQKAFRIC 275
+ + D + V + +V+ Q + K+ GL+ LQK +
Sbjct: 199 EQAMPQNCSTDIKLVIQQVDRVLDNGTAQEVDAMKQKFGLDTLADHADFAWYLQKPIIMW 258
Query: 276 KSEKNLAIE--SWLSTAFVYTAMTDYPTPSNF-LNPLPAFPVKEMCKAIDDPKTGNDVFA 332
+ + +E W+ T+ + + S L M K ++ + +
Sbjct: 259 AYDPSKVLEFCDWIETSTNDGEVIEGREESGVGLEAAWTGYTSWMHKRYNE-QCREEESC 317
Query: 333 KLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMI--MLTGGDNKDSI-FEES 389
LYG A Y N D W WQ C E + TG + + S
Sbjct: 318 NLYGNAVGY------------NTPDDLDWSRSWVWQLCNEPLGWWHTGPSSSNGTGLVSS 365
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHK----IGLVLKRFA----------SNIIF 435
+ R R C + + GHK G+ + F N++F
Sbjct: 366 YVHLEHRKRQCDLRF----------PQTSGHKPAISKGITVSMFNEWTGGWNASYENVLF 415
Query: 436 FNGLRDPWSG---------GGVLKNISKTVLALVEKEGAHHV-DLRFSTKEDPQWLKDVR 485
+G DPW GG ++ K +V EGA HV D R S V
Sbjct: 416 IDGEFDPWKSATMSSDYRPGGPSESTEKAPRLVV--EGATHVPDFRLSKLN-----AKVV 468
Query: 486 RREVEIIGKWISQY 499
++EVEI+GKWI +
Sbjct: 469 QQEVEIMGKWIEAW 482
>gi|328865129|gb|EGG13515.1| hypothetical protein DFA_11276 [Dictyostelium fasciculatum]
Length = 576
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 118/460 (25%), Positives = 189/460 (41%), Gaps = 54/460 (11%)
Query: 43 ISSSKDSQGLYKTKYHT--QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
I S D+ + KT ++T Q +DH N + +TF QRY IN + + + G
Sbjct: 21 IYGSIDTHPISKTGFYTFNQRVDH---NGVNVKTFPQRYCINKSFVHKGAAPKSVMLVLG 77
Query: 101 NEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSST 160
EG I+ N VA ++++ +E RYYG+SIP + + YL++
Sbjct: 78 GEGPIDPEITNHIPFIGVANNTNSIIIALEIRYYGESIP------VPNMSTDNMQYLTTD 131
Query: 161 QALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
Q L D A + +V G SY G L+AW+R+KYP++A A+ASSAPI
Sbjct: 132 QILDDIAYFQTQFTNLYGLHNCKWIVMGCSYAGSLSAWYRMKYPNLAAAAIASSAPI--- 188
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
+VS + + + + ++ C K K +E+ + +++ F +
Sbjct: 189 RAVVSFHDYDRKVRE---ALGLQCTKQFKQILNHVEQQLRVNN--TSIKRKFTCDAKIDD 243
Query: 281 LAIESWLSTAFVYTAM--TDYPTPSNFLNPL--PAFPVKEMCKAIDDPKTGNDVFA---- 332
LS A Y+ + + SN PL + ++ D T +F
Sbjct: 244 KMFLFMLSEAISYSVQYNSRFKIISNICPPLIQSGSNIVKLLDIFADYITNMFLFKNGSC 303
Query: 333 ---KLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEM-IMLTGGDNKDSIFEE 388
LY AS +YSGT +W +Q C+E LT D+ S+
Sbjct: 304 NEYNLYSFASTKVDYSGT---------------RQWTYQLCSEYGWFLTASDSDLSLKSG 348
Query: 389 SEEDYDARARYCKEAYGVDPRP--NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
+ CK +G +P I E+G + ++ +N++F NG DPWS
Sbjct: 349 QINEQWWENEVCKIMFGSSMKPFVEKINLEYGIDNMKML-----TNVLFTNGGYDPWSSL 403
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFS-TKEDPQWLKDVR 485
V + ++ G H +S T ED Q LK+ R
Sbjct: 404 SVQSQCDTPLSNIISIPGESHCANWYSETPEDSQQLKNSR 443
>gi|157133204|ref|XP_001662799.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108870921|gb|EAT35146.1| AAEL012664-PA [Aedes aegypti]
Length = 493
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 128/529 (24%), Positives = 211/529 (39%), Gaps = 80/529 (15%)
Query: 6 IFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF 65
+ LS + F++ SN F S + P + S S G Y T++
Sbjct: 7 VALSMVVAFAAAAIESNPARPVLFGSQKTAPPRGPP--SDSIVDNGNY-TEWRVFDQRQS 63
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
+ N S F RY+ N + + PIF++ G ++E G D+A + A
Sbjct: 64 HSNAHSVDMFPMRYVSNSKFY---RPGGPIFLFVGGPWELEQHFVEQGHFVDLAEENNAF 120
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-KNLTATDSPV 184
+V E RYYG+S+P + + L QA D A LI+ ++ + L ++ V
Sbjct: 121 VVANEMRYYGESLP------VPNASRGNLRLLHIVQACTDIARLIVHIRYEVLRDPNARV 174
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN- 243
+V G + G LA W RL+YPH+ G AS A + +N F+ + + R N
Sbjct: 175 IVAGVGFSGSLAHWTRLRYPHLIHGVWASGAMLQANENY---REFAEEVGEYIRRYGGND 231
Query: 244 CYKVIKGSWK--QIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLSTAFVYTAMTD 298
CY + W+ + E G + + K F++C L +E AF Y +
Sbjct: 232 CYGAL---WRGFRTAENLIDAGQSQTVDKLFKVCTPINGTNPLDVE-----AFFYGIFNE 283
Query: 299 YPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS- 357
SN L P ++ MC D T D + L G AS A+C ++ +S
Sbjct: 284 --VVSNTLRPNLRQNIRNMC----DTLTHEDHDSSLTGLASWITGQFPEAECLAMDLESI 337
Query: 358 -----------DPH--GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY 404
D H G +W +Q CTE+ D+ + F + + C+ +
Sbjct: 338 VQLFQETDWQHDVHKSGERQWFYQRCTELGWPLTADSMNQPF-GVRISSNLFQQLCQRVF 396
Query: 405 GVDPRPNWITTE------------FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNI 452
W+T++ +GG++ + RF + + +G DPW GV +
Sbjct: 397 D-----GWLTSDVFRSLVRQTNTLYGGNRPEM---RF---VFYTHGSLDPWRFTGVTTVL 445
Query: 453 SKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEI---IGKWISQ 498
V + H DL ++ D W D+RR + E+ I +W+ +
Sbjct: 446 YNNNYVNVIRGAIHGEDL--ASISDLDW-ADLRRSKEEVGETIRRWLER 491
>gi|355752907|gb|EHH56953.1| hypothetical protein EGM_06474, partial [Macaca fascicularis]
Length = 330
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 82/173 (47%), Gaps = 15/173 (8%)
Query: 334 LYGAASVYYNYSGTAKCFDL----NGDSDPHGLS------EWGWQACTEMIMLTGGDNKD 383
L A + YN SG+ C+D+ +DP G W +QACTE+ + +N
Sbjct: 144 LRALAGLVYNASGSEHCYDIYRLYRSCADPTGCGTGPDARAWDYQACTEINLTFASNNVT 203
Query: 384 SIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
+F + + R +YC + +GV PRP+W+ T F G + R SNIIF NG DPW
Sbjct: 204 DMFPDLPFTDERRQQYCLDTWGVWPRPDWLLTSFWGGDL-----RATSNIIFSNGNLDPW 258
Query: 444 SGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+GGGV + R S EDP + + R+ E +IG+W+
Sbjct: 259 AGGGVSPGLGPXXXXXXXXXXXXXXLPRASHPEDPVSVVEARKLEATVIGEWV 311
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 27/65 (41%), Positives = 40/65 (61%)
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGGML+A+ R+KYPH+ GALA+SAP+L + F +T DF S C + ++
Sbjct: 1 SYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVR 60
Query: 250 GSWKQ 254
++ Q
Sbjct: 61 EAFGQ 65
>gi|195569669|ref|XP_002102831.1| GD19291 [Drosophila simulans]
gi|194198758|gb|EDX12334.1| GD19291 [Drosophila simulans]
Length = 487
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 118/486 (24%), Positives = 200/486 (41%), Gaps = 58/486 (11%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSS---LISSSKDSQGLYKTKYHTQILDHFNYN 68
L FS L TF R E L+ L SK+ + + Q +DHF+ N
Sbjct: 4 LNFSVQLIFCLPSFLATFNPYRRNVELLNHEPVLGICSKNELASVEELWLDQKVDHFDKN 63
Query: 69 PQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVF 128
+T++ RY N H+ K PI+++ G E I +TG +D+A + +L +
Sbjct: 64 NN--RTWKMRYYRNAKHF---KPQGPIYIFVGGEWTISPGLLSTGLTHDMAVENSGMLFY 118
Query: 129 IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN-LTATDSPVVVF 187
EHRYYG S+P+G + + LS Q+LAD A I K N DS V++
Sbjct: 119 TEHRYYGLSLPFGHER----YQLNNLKQLSLHQSLADLAHFIRHQKSNGPEMEDSKVILV 174
Query: 188 GGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR-SVSENCYK 246
GGSY G L W YP + + ASSAP+L + + + ++ + + S +NC
Sbjct: 175 GGSYSGSLVTWMTQLYPDLIAASWASSAPLLAKADF---FEYMEVVGKSIQLSYGKNCSL 231
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
I+ +K + AK G +++Q+ + + ++ L A + + +Y +
Sbjct: 232 RIEKGFKFL---AKLFDG-DEIQELLYNLNGCEGYSPKNPLDRAAFFNGLGNYFAL--VV 285
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDV----FAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL 362
+ + +C+ + + +++ F KL +Y ++ C D S
Sbjct: 286 QSYRSAYIPRLCETLMSLDSSDELAFIEFLKL-----LYSEGRRSSDCQDFGYSSMLELF 340
Query: 363 SE----------WGWQACTEMIMLTGGDNKDSIFE--ESEEDYDARARYCKEAYGVDPRP 410
+E W +Q C E T +K S + ++ + C++A+G +
Sbjct: 341 TEDSVQSSETRAWFYQTCNEFGWYTTTKSKSSASQAFANQVPLGYFEQLCQDAFGAEQTA 400
Query: 411 NWIT-------TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
+ ++F G G + +IF +G DPWS G K VL
Sbjct: 401 QQLAQGVEQTNSKFDG--FGFNQSERYAQVIFTHGELDPWSALGQQKGDQAIVLT----- 453
Query: 464 GAHHVD 469
G HV+
Sbjct: 454 GYSHVE 459
>gi|194744687|ref|XP_001954824.1| GF16548 [Drosophila ananassae]
gi|190627861|gb|EDV43385.1| GF16548 [Drosophila ananassae]
Length = 390
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 76/264 (28%), Positives = 122/264 (46%), Gaps = 16/264 (6%)
Query: 14 FSSTLTISNAKIFPTFPSSRITP-EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSY 72
F L I PT +S P K++ L ++ ++ Q LD+F+ + +
Sbjct: 4 FFEALAILAVLSAPTVGASFKEPMPKVNRLPKEPMITRATVHERWINQKLDNFDEDNNA- 62
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHR 132
T+ R IN+ + + +PIF+Y G E + ++G D+A + +V EHR
Sbjct: 63 -TWSNRIFINEQDF---VDGSPIFIYLGGESEQLPSRISSGLWVDIAKQHNGTIVATEHR 118
Query: 133 YYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYG 192
+YGKS P Y + Y S QALAD ++I LK+ DS VV+ G SY
Sbjct: 119 FYGKSTPI-----TPYSTENLEKYQSINQALADVINVIQTLKEEDKYKDSKVVIHGCSYS 173
Query: 193 GMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGS 251
+AAW R YP + +G+ ASSAP++ V + +I + FR + + CY +I +
Sbjct: 174 ATMAAWIRKLYPDIIVGSWASSAPLV---AKVEFKEYFKVIGESFRILGGQYCYDLIDNA 230
Query: 252 WKQIEETAKKPGGLEKLQKAFRIC 275
E G ++ +K +C
Sbjct: 231 TNYYENLFANGKG-DQAKKELNLC 253
>gi|119487142|ref|XP_001262426.1| serine peptidase, family S28, putative [Neosartorya fischeri NRRL
181]
gi|119410583|gb|EAW20529.1| serine peptidase, family S28, putative [Neosartorya fischeri NRRL
181]
Length = 550
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 114/463 (24%), Positives = 188/463 (40%), Gaps = 73/463 (15%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
PE S+I+ S DS +Y ++DH + S T++ R+ +N+ + + +PI
Sbjct: 41 PEAFRSIIADSVDSTS--AAEYTEMLIDHED---PSVGTYRNRFWVNEDFY---VSGSPI 92
Query: 96 FVYTGNEGDIEW----FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
VY E E+ ++ ++ + +F A+ + EHRYYG S+P+ ++++ ++
Sbjct: 93 MVYDIGEATAEYSVSLLTNSSSWLSLLLQEFHAMGIVWEHRYYGDSLPFPVSQDMPVEHL 152
Query: 152 STTGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
YL++ QALAD + + +LT +P V+ GGSY G+ AA+ R KYP
Sbjct: 153 K---YLTTEQALADIPYFAANFSRPNHPDIDLTPRGTPWVMIGGSYPGIRAAFTRNKYPD 209
Query: 206 VAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS----ENCYKVIKGSWKQIEETAKK 261
A ASSAP+ N+ Y Q +R++ NC K I+ + K I++
Sbjct: 210 TIFAAYASSAPVQAQLNMSVYYE------QIYRAMVANGYSNCTKDIQAALKYIDDQLSN 263
Query: 262 PGGLEKLQKAFRICKSEKNLAIESWLSTAFV--------YTAMTDYPTPSNFLNPLPAFP 313
+++ F +EKN + +TA V + + + +F N L P
Sbjct: 264 KETSASIKRLFLGRDAEKNSNAD--FTTALVALYGPFQAHGLWSGNQSLHDFCNYLELDP 321
Query: 314 VKEMCKAIDD--PKTGNDVFA--------------KLYGAASVYYNYSGTAKCFDLNGDS 357
+ P G+ A ++YG N S C D + +
Sbjct: 322 ATNQSAGPEGLAPIHGSKYVAERWASFPYFISLVNRMYGTNCNGLNASEPLSC-DFSQTN 380
Query: 358 DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-------VDPRP 410
L W WQ CTE + S + + + C + + PRP
Sbjct: 381 TIPELISWTWQYCTEWGFFQSNNFGSHALLSSYQTLEYQQELCNRQFPNAVQAGILPPRP 440
Query: 411 NW--ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
+ EFGG I SN+ F G DPW VL +
Sbjct: 441 RTESLNEEFGGWTIR------PSNVYFSGGQFDPWRPLSVLSD 477
>gi|159117921|ref|XP_001709180.1| Thymus-specific serine protease precursor [Giardia lamblia ATCC
50803]
gi|157437295|gb|EDO81506.1| Thymus-specific serine protease precursor [Giardia lamblia ATCC
50803]
Length = 522
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 90/348 (25%), Positives = 146/348 (41%), Gaps = 59/348 (16%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG-----------DI 105
+ Q +DHF+ + + + QRY NDT++ K P+F+ G EG I
Sbjct: 59 FREQHVDHFD--STNTKKWSQRYYYNDTYY---KAGGPVFLMIGGEGPATPRDVGDYFSI 113
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
++FA+N L V +EHR+YG S P N++ L S QALAD
Sbjct: 114 DYFAKN----------MNGLKVALEHRFYGASFPS--------TNSANLSLLRSDQALAD 155
Query: 166 YASLIIDLKKNLTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV 224
A+ + LK+ + + +V GGSY G LAAW R+++P + A++SS P L +
Sbjct: 156 IATFLAYLKREYNLPEGTKIVAVGGSYSGNLAAWARIQFPFIIDAAISSSGPYLAQTDY- 214
Query: 225 SPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK----SEKN 280
P +I +Q + + C +I + K E L F++ + +
Sbjct: 215 -PEYLQHIDSQVRKYGGDRCMDIISAAHKDAEYLLSHDKA--TLATIFKLKEESIYNSTG 271
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID---DPKTGNDVFAKLYGA 337
S++S + + Y + N +K+MCKAI+ D + + L
Sbjct: 272 YDKASFMSAMGAPSGVVQYAKHDGYYNTTKDGDIKQMCKAIEASYDSYDTGESYQDLKAY 331
Query: 338 ASVYYN-YSGTAKCFDLNGDSDPHGLSE------------WGWQACTE 372
AS + Y G+ + DL+ D + + W WQ C E
Sbjct: 332 ASWLLDYYGGSMEEIDLSFDGYIKAIQDTSIDSEFAVDRSWLWQTCVE 379
>gi|358370554|dbj|GAA87165.1| serine peptidase, family S28 [Aspergillus kawachii IFO 4308]
Length = 562
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 118/466 (25%), Positives = 182/466 (39%), Gaps = 81/466 (17%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
+ L++L+S S + L +T+Y T LDH N S T+Q R+ ++D + + +PIF
Sbjct: 52 QNLNTLVSHSAMAVAL-ETEYVTIPLDHDN---ASAGTYQNRFWVSDEFY---EPGSPIF 104
Query: 97 VYTGNEGDIE-----WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
VY E D E + F + +F A+ + EHRYYG S P I+Y++
Sbjct: 105 VYDTGEADGESIASAYLTSTLSFFREFLIEFNAMGIAWEHRYYGNSTP----APISYESP 160
Query: 152 STT-GYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYP 204
T YL++ QALAD + + +LT +P V+ GGSY G+ AA R +YP
Sbjct: 161 PETYQYLTTKQALADLPYFASNFSREKYPDVDLTPQGTPWVMVGGSYAGIRAALTRNEYP 220
Query: 205 HVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS----ENCYKVIKGSWKQIEETAK 260
A +SSAP+ N+ Y Q +R + NC I + + I++
Sbjct: 221 ETIFAAYSSSAPVEARVNMSVYYD------QVYRGMVADGWANCSADIHAALEYIDDQLS 274
Query: 261 KPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKA 320
+++ F +E N + + +Y+ Y + C+
Sbjct: 275 NEDTATSIKQLFFGPGAEANSNGDFTGALTAIYSYFQSYGMAGGIGG------LGAFCEY 328
Query: 321 ID-DPK----TGNDVFAKLYGAASV------------YYNYSGTAKCFDLNGD------- 356
++ DPK TG D A YG V N + C N
Sbjct: 329 LEVDPKTNGTTGPDGLAPTYGGQYVAERWAAWPTFLELVNLNMGTNCGPQNASQPIDCDF 388
Query: 357 SDPHG---LSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY------GV- 406
S P+G W WQ C+E + + + + C + G+
Sbjct: 389 SKPYGDPATITWTWQYCSEWGFFQANNEGPHSLASRYQSVEYQQEVCNRQFPDAVEKGLL 448
Query: 407 --DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
PR + I EFGG I SN+ F G DPW +L
Sbjct: 449 PPSPRADEINQEFGGWTIR------PSNVYFSGGEFDPWRSLSILS 488
>gi|193786915|dbj|BAG52238.1| unnamed protein product [Homo sapiens]
Length = 128
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 67/118 (56%), Gaps = 5/118 (4%)
Query: 383 DSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDP 442
D +FE + + C + +GV PRP+WITT +GG I +NI+F NG DP
Sbjct: 9 DDMFEPHSWNLKELSDDCFQQWGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDP 63
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
WSGGGV K+I+ T++A+ EGAHH+DLR DP + R EV + WI ++
Sbjct: 64 WSGGGVTKDITDTLVAVTISEGAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFY 121
>gi|392572044|gb|EIW65216.1| peptidase S28 [Trametes versicolor FP-101664 SS1]
Length = 528
Score = 94.7 bits (234), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 128/508 (25%), Positives = 209/508 (41%), Gaps = 116/508 (22%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG--DIEWFAQNTGFMY 116
TQ LDHF + TFQQRY ++D H+ K P+ V+ EG D +TG +
Sbjct: 64 TQPLDHFV---DTGFTFQQRYWLSDRHY---KPGGPVIVFEAGEGPGDERMPILDTGILN 117
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID---- 172
+A L + +EHRYYG+S+P + + +L++ QA AD A+ I +
Sbjct: 118 ILANATDGLAIVLEHRYYGESVP------VQNFTTDSLRWLNNEQAAADSANFIDNVTFP 171
Query: 173 -LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSN 231
+ +LTA +P + +GGSYGG AA R+ YP + GA+ASS + + + + +
Sbjct: 172 GIPGDLTAPGTPWIYYGGSYGGARAAHMRVLYPDLVFGAIASSGVV---HATLDDWRYFD 228
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAF 291
II Q S C Q+E T ++++ + ++ LAI+S F
Sbjct: 229 IIRQ---SAPAACI-------TQVERT------IDEVDRLITSPNAKTRLAIKS----VF 268
Query: 292 VYTAMTDYPTPSNFL-NPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA------ASVY--- 341
+T P ++ L NPL A+ DP G+ FA+ A A+V+
Sbjct: 269 GLQNVTYDPDFASLLSNPLGAWQSNNW-----DPAVGSTSFARFCAALGTPDNATVHTVQ 323
Query: 342 --------YNYS---------------GTAKCFDLNGDSDPHGLSE-------WGWQACT 371
+NY+ +CF D ++ W +Q CT
Sbjct: 324 GITVSNATFNYATYINRTISRECQPPQNQDECFGTITAPDQFKATDLSQTWRLWDFQVCT 383
Query: 372 EM-IMLTGGDNKDS---IFEESEEDYDARARYCKEAYG------VDPRPNW-ITTEFGGH 420
+ +T N + I + +DY + CK AY V P P+ + GG+
Sbjct: 384 QWGFFMTPPPNPATPRIISKLITQDY--ASLICKLAYDPGEHFQVPPEPDVEAVNKLGGY 441
Query: 421 KIGLVLKRFASNIIFFNGLRDPWSG----GGVLKNISKTVLALVE--KEGAHHVD---LR 471
I + +G DPW G + + T L + HH D L
Sbjct: 442 SIAY------DRLAIIDGQDDPWRGDTPHSPAARPRADTTLRPFKLIPLAVHHYDENGLA 495
Query: 472 FSTKEDPQWLKDVRRREVEIIGKWISQY 499
++E PQ ++ + ++E+E + W+ +
Sbjct: 496 DPSQEPPQ-IQAIHQQEIEFVKAWLKDF 522
>gi|302790399|ref|XP_002976967.1| hypothetical protein SELMODRAFT_416857 [Selaginella moellendorffii]
gi|300155445|gb|EFJ22077.1| hypothetical protein SELMODRAFT_416857 [Selaginella moellendorffii]
Length = 982
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 117/445 (26%), Positives = 179/445 (40%), Gaps = 70/445 (15%)
Query: 4 RFIFLSFCLLFSSTLTISNAKIFPTFPSSRI--TPEKLSSLISSSKDSQGLYKTKYHT-Q 60
R + + F LL +ST + S+A + P+SR+ PE L + S +G + T Q
Sbjct: 2 RVLLVCFVLLATST-SFSSAFV----PASRLGFKPEFLPA--GSRSPPRGKETVNFFTRQ 54
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAP 120
LDHF P+ + F Q+YL + N PIF+ E T ++ +A
Sbjct: 55 KLDHFA--PEDPRVFSQKYLELLDFF--RPRNGPIFLVMCGESTCTGNYVTT-YVGTLAE 109
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT 180
F A +V +EHRYYG S P+ + N YL+S Q+L D+A I +
Sbjct: 110 SFGAAIVTVEHRYYGHSSPF------QHLNLHNLKYLTSKQSLFDHAVFIDYYQAR---- 159
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSV 240
GSY G L+AWFRLK+PH+ G+ ASSA + + +S Q SV
Sbjct: 160 --------GSYAGALSAWFRLKFPHLVAGSWASSAV------VEAILDYSAYDKQLGVSV 205
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYP 300
C + ++ + E+ + K F + N+ ++ L +V A
Sbjct: 206 GPKCKQALQEITRLTEQGLVENATEIKYLFGF---SPQDNITDDTLLD--YVANAAA--- 257
Query: 301 TPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH 360
+ + +C + + N K Y N D NG+ +
Sbjct: 258 ------GEIQYGKIDGLCDPLLKAEKSNRNLLKTYAKILERINN-------DTNGNERDN 304
Query: 361 GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE--FG 418
W +Q CTE+ ++ S S + YC E +G P+ TT +G
Sbjct: 305 --ESWDFQYCTEVGYFQVASDRKSSVRSSRINTQFFINYCSEQFGNGTFPDVKTTNLYYG 362
Query: 419 GHKIGLVLKRFASNIIFFNGLRDPW 443
G I S I+F NG +DPW
Sbjct: 363 GRNIA------GSRIMFLNGSQDPW 381
>gi|403416809|emb|CCM03509.1| predicted protein [Fibroporia radiculosa]
Length = 562
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 100/193 (51%), Gaps = 22/193 (11%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P + + + L SL D + + TQ LDHF+ P TF QRY IN H+
Sbjct: 52 PHALVVQQPLQSL-----DEPSEFPAHWFTQPLDHFSKTPH---TFNQRYWINTRHYK-P 102
Query: 90 KNNAPIFVYTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIA 147
+NAP+ V G E G+ +TG + +A + V +EHRYYG+S+P +A
Sbjct: 103 GSNAPVIVLDGGETSGEDRIPFLDTGIVEILAKATGGVGVVLEHRYYGESVP------VA 156
Query: 148 YKNASTTGYLSSTQALADYASLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLK 202
+ +L++ Q+ AD A+ + ++K ++LTA +P + +GGSY G AA R+
Sbjct: 157 NLTTDSLRWLNNDQSAADSANFMANVKFPGIHEDLTAPGTPWIYYGGSYAGARAAHMRIL 216
Query: 203 YPHVAIGALASSA 215
YP + GA+ASS
Sbjct: 217 YPDLVYGAIASSG 229
>gi|123435014|ref|XP_001308906.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121890608|gb|EAX95976.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 527
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 109/402 (27%), Positives = 171/402 (42%), Gaps = 49/402 (12%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPK 121
+DHF+ + SY + +R+L N T + A +Y G E + G ++A +
Sbjct: 30 IDHFDTHDSSY--YMERFLENLTFVNKTFKKA--LLYIGGESTLSPRYVQAGSYLELAAR 85
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL--KKNLTA 179
A + +EHR++GKS+P+ + YK YL+ QALAD A I +L
Sbjct: 86 ENAAVFALEHRFFGKSMPFDQLTKENYK------YLTIPQALADLAEFIERYIYTHHLAD 139
Query: 180 TDS-PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
D V V GGSY G L++WFRLKYPH+A+ + ASSAP+ N N Y + +
Sbjct: 140 QDGVTVAVVGGSYPGALSSWFRLKYPHLAVASWASSAPV-NVKNDFPEY--DEYVAKRVN 196
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI---CKSEKNLAIESWLSTAFVYTA 295
++ C + + + I A K G K+ AF+ K E N I + A V +A
Sbjct: 197 LSADGCLERTRKVF-DISHEAVKSGDASKI-AAFKDKYGIKHETN-DISALYIIADVLSA 253
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA--SVYYNYSGTAKCFDL 353
M Y + L+ + CK I + ++ ++ + +Y N + +DL
Sbjct: 254 MVQYNSRYGVLD--------QYCKKITESQSESE-YENIYVQTFKDFLKNNGQEPEDYDL 304
Query: 354 --NGDSDPHGLS----EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD 407
+DP + W + C E+ K S + D C+ +G+
Sbjct: 305 LQATSTDPTSATANSRSWSYMTCNEVGWFQTASGK---LRSSLLNIDYFTTVCQNLFGIS 361
Query: 408 -PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGV 448
N + +FG G + + F NG DPWS GV
Sbjct: 362 LADTNQVNYKFGNINPG------QTQVYFSNGDVDPWSTLGV 397
>gi|393233661|gb|EJD41230.1| peptidase S28 [Auricularia delicata TFB-10046 SS5]
Length = 497
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 121/491 (24%), Positives = 197/491 (40%), Gaps = 63/491 (12%)
Query: 49 SQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIE 106
++ + ++ Q LDHF+ + TF QRY +ND H+ ++ P+ V G E G+
Sbjct: 4 AEAKFPARWFRQPLDHFDR--KRRDTFLQRYWVNDRHY---RSGGPVIVLDGGETSGENR 58
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
+TG + +A L V +EHRYYG+SIP + + +L + Q+ AD
Sbjct: 59 LPFLDTGIVDILAKATHGLGVVLEHRYYGRSIP------VLNLTTDSLRWLDNKQSAADS 112
Query: 167 ASLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFD 221
A+ + ++K ++LTA +P + +GGSY G AA R+ YP + GA+ASSA +
Sbjct: 113 ATFMANVKFEGISEDLTAPGTPWIYYGGSYAGARAAHMRVLYPDLTFGAIASSA--VTHA 170
Query: 222 NIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNL 281
IV + + +I Q S C ++ S I+ + P L+K F + E +
Sbjct: 171 AIVY-WEYYEVIRQ---SAPTGCIARLERSIDIIDRVLQVPVLRRPLKKLFGLEDLEHDD 226
Query: 282 AIESWLST------------AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND 329
S L + A T D+ N + P+K +D
Sbjct: 227 DFASVLESPLGGWQGRNWDPAVSSTRFEDFCAVIEGDN--KSAPIKLGPFELDIGVLNYA 284
Query: 330 VFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLS------EWGWQACTE--MIMLTGGDN 381
+ K Y AA + +CF DS S W +Q CTE M T D
Sbjct: 285 AYIKQYVAARCPAG-NTVEQCFGSYDDSKYLDFSLSNDWRLWLFQVCTEWGYFMTTPPDP 343
Query: 382 KDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLV--LKRF---ASNIIFF 436
R C++AY P + T G V L F A + F
Sbjct: 344 AHPRIISRLITLSYVTRICRQAY---PPGRYFTVPRLPDVDGAVNSLGDFAIAADRLAFI 400
Query: 437 NGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDLR--FSTKEDPQWLKDVRRRE 488
+G DPW +K + + HH D + ++P+ ++++ E
Sbjct: 401 DGTADPWMPATPHSWHAKPRPDTTRRPFKWIVDAVHHWDENGLANPADEPRRIREIHYEE 460
Query: 489 VEIIGKWISQY 499
V+ + W+ ++
Sbjct: 461 VQFVRSWLREF 471
>gi|302795835|ref|XP_002979680.1| hypothetical protein SELMODRAFT_419387 [Selaginella moellendorffii]
gi|300152440|gb|EFJ19082.1| hypothetical protein SELMODRAFT_419387 [Selaginella moellendorffii]
Length = 462
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 111/413 (26%), Positives = 180/413 (43%), Gaps = 45/413 (10%)
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHR 132
+TF QRY ++ N P+F+ EG + + +A +F A +V +EHR
Sbjct: 34 RTFAQRYYEFTDYFDAP--NGPVFLKICGEGPCVGIQNDYSAV--LAKRFGAAIVSLEHR 89
Query: 133 YYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVVV 186
YYG+S P+ K A +N YLSS QAL D A+ +I+ + N T +D+P +V
Sbjct: 90 YYGQSSPF---KSHATENLI---YLSSKQALFDLAAFREYYQDLINHRTNST-SDNPWIV 142
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQ-DFRSVSENCY 245
GGSY G L+AWF+LK+PH+A+G++ASS + + + F+ Q V+E+
Sbjct: 143 MGGSYSGALSAWFKLKFPHLAVGSVASSGVVQ------AIFKFTKFDEQAKLFLVAESAG 196
Query: 246 KVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT--DYPTPS 303
+ + + A++ GL++ + + + + L ++ A Y P
Sbjct: 197 ATCSAALRAVTRLAEQ--GLKENSVSTKALFNAEQLDVDGDFLYFLADAAAIAFQYGNPD 254
Query: 304 NFLNPL-PAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL 362
+PL A+ E A+ K D + + ++ Y+ + GD L
Sbjct: 255 ILCSPLVAAYKRNEDLLAV-YAKYVKDYYIDTFKSSINTYDQKHLKENL-AAGDHSSDRL 312
Query: 363 SEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE--FGGH 420
W +Q CTE+ +SI + + C +G P T +GG+
Sbjct: 313 --WWYQVCTEVAYFQAAPANNSI-RSALVNVKYHLDLCSNVFGNGTFPEVDNTNLYYGGN 369
Query: 421 KIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK--EGAHHVDLR 471
KI I+F NG +DPW + S+ A V K AH VD+R
Sbjct: 370 KIR------GDKILFMNGSQDPWRHASK-QTSSRNEPAYVIKCQNCAHCVDMR 415
>gi|315052106|ref|XP_003175427.1| hypothetical protein MGYG_02952 [Arthroderma gypseum CBS 118893]
gi|311340742|gb|EFQ99944.1| hypothetical protein MGYG_02952 [Arthroderma gypseum CBS 118893]
Length = 545
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 132/562 (23%), Positives = 218/562 (38%), Gaps = 93/562 (16%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPE--KLSSLISSSKDSQGLYKTKYHTQIL 62
+ + CLL L + P P S++ E + +L++ S+D + +
Sbjct: 1 MVKFTTCLLL---LVAAVQAKLPVTPISQLKAESHRTKALLARSEDVNAAFPAHTIKIPI 57
Query: 63 DHF----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGF 114
DHF Y P + F RY + +H+ K P+ + G E G I + + G
Sbjct: 58 DHFPKSSRYEPHTTDKFDLRYWFDASHY---KEGGPVIILHGGETDGAGRIPFLQK--GI 112
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ +A + V +EHRYYG S+P ++ + K+ +L++ QALAD A ++K
Sbjct: 113 LAQLAQATNGIGVIMEHRYYGGSLP---TRDFSNKSLR---FLTTEQALADTAYFSKNIK 166
Query: 175 ------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVS 225
NLTA + +++GGSY G A+ R +YP + GA++SS I ++
Sbjct: 167 FPGLEKYNLTAPGTAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYWQYFE 226
Query: 226 PYSFSNI-----ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEK 279
P +TQ+F + +N +I G + K GL +L+ A F S
Sbjct: 227 PIRQEAPQDCVHVTQNFVDIVDNI--IIHGKNANTIKELKNLFGLGRLRDADFANALSS- 283
Query: 280 NLAIESWLST----AFVYTAMTDYP---TPSNFLNPLPA---FPVKEMCKAIDDPKTGND 329
I W ST A + Y T +L P+ A K + +A + +
Sbjct: 284 --GITGWQSTNWDPAISGKSFYQYCGEITSDRYLYPVTAEQKASAKRIIEAGGHGRESPE 341
Query: 330 VFAKLYGAASVYYNYSGTAKCFD--------LNG--------DSDPHGLSEWGWQACTEM 373
+ +L + N S A C LN D+ W WQ C E
Sbjct: 342 ILPQLLNFVG-WLNKSTLASCAGQGQTAEECLNSYDEDFYKQDNADQSWRAWPWQYCNEW 400
Query: 374 IMLTGGDNKDSIFEESEE---DYDARARYCKEAYGV-DPRPNWITTEFGGHKIGLVLKRF 429
L G D + CK+A+G+ P + ++G + + R
Sbjct: 401 GYLQTGSGAPKNIRPVISRLIDLPYTSNICKQAFGITKPSNVELVNKYGA--LDIEYDRL 458
Query: 430 ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDL------RFSTKED 477
A F +G DPW GV ++ K E HH D + +
Sbjct: 459 A----FIDGASDPWKEAGVHAAAARKRPTSTNKPFILIPEAVHHWDENGLYPNETTAELP 514
Query: 478 PQWLKDVRRREVEIIGKWISQY 499
PQ +K+V+ E + +W+ ++
Sbjct: 515 PQRIKEVQAEEARFVKEWMKEW 536
>gi|195353641|ref|XP_002043312.1| GM26842 [Drosophila sechellia]
gi|194127426|gb|EDW49469.1| GM26842 [Drosophila sechellia]
Length = 487
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 119/486 (24%), Positives = 202/486 (41%), Gaps = 58/486 (11%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSS---LISSSKDSQGLYKTKYHTQILDHFNYN 68
L FS L TF R E L+ L SK+ + + Q +DHF+ N
Sbjct: 4 LNFSVQLIFCLPSFLATFNPYRRNVELLNHEPVLGICSKNELASVEELWLDQKVDHFDKN 63
Query: 69 PQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVF 128
+T++ RY N H+ K PI+++ G E I +TG +D+A + +L +
Sbjct: 64 NN--RTWKMRYYRNAKHF---KPQGPIYIFVGGEWTISPGLLSTGLTHDMAVENSGMLFY 118
Query: 129 IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN-LTATDSPVVVF 187
EHRYYG S+P+G +Y+ + LS Q+LAD A I K N DS V++
Sbjct: 119 TEHRYYGLSLPFGHE---SYQ-LNNLKQLSLHQSLADLAHFIRHQKSNGPEMEDSKVILV 174
Query: 188 GGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR-SVSENCYK 246
GGSY G L W YP + + ASSAP+L + + + ++ + + S +NC
Sbjct: 175 GGSYSGSLVTWMTQLYPDLIAASWASSAPLLAKADF---FEYMEVVGKSIQLSYGKNCSL 231
Query: 247 VIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL 306
I+ +K + AK G +++Q+ + + ++ L A + + +Y +
Sbjct: 232 RIEKGFKFL---AKLFDG-DEIQELLYNLNGCEGYSPKNPLDRAAFFNGLGNYFAL--VV 285
Query: 307 NPLPAFPVKEMCKAIDDPKTGNDV----FAKLYGAASVYYNYSGTAKCFDLNGDSDPHGL 362
+ + +C+ + + +++ F KL +Y ++ C D S
Sbjct: 286 QSYRSAYIPRLCETLMSLDSSDELAFIEFLKL-----LYSEGRRSSDCQDFGYSSMLELF 340
Query: 363 SE----------WGWQACTEMIMLTGGDNKDSIFE--ESEEDYDARARYCKEAYGVDPRP 410
+E W +Q C E T +K S + ++ + C++A+G +
Sbjct: 341 TEDSVQSSETRAWFYQTCNEFGWYTTTKSKSSASQAFANQVPLGYFEQLCQDAFGAEQTA 400
Query: 411 NWIT-------TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE 463
+ ++F G G + +IF +G DPWS G K VL
Sbjct: 401 QQLAQGVEQTNSKFDG--FGFNQSERYAQVIFTHGELDPWSALGQQKGDQAIVLT----- 453
Query: 464 GAHHVD 469
G HV+
Sbjct: 454 GYSHVE 459
>gi|268552333|ref|XP_002634149.1| Hypothetical protein CBG01710 [Caenorhabditis briggsae]
Length = 507
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 138/307 (44%), Gaps = 30/307 (9%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-----WFAQNTG 113
TQ +DHF N + T+ QRY N + +K +F+ G EG I+ W
Sbjct: 33 TQYVDHFANNTSA--TWLQRYQYNSKFY--NKTVGYVFLMLGGEGAIDPPGDKWVRHEGE 88
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT---GYLSSTQALADYASLI 170
M A +F A +EHR+YG +KE + TT L+ QALAD I
Sbjct: 89 TMMVWAKEFGAAAFQVEHRFYG-------SKEFSPLGDQTTESLKLLTIDQALADIKEFI 141
Query: 171 IDLKKNLTATDSPV-VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ K D P+ + FGGSY G L+AWFR YP + GA++SS+ + F V Y +
Sbjct: 142 NQMNKMYFPNDKPIWITFGGSYPGSLSAWFRETYPEMTAGAVSSSSAVHVF---VDYYGY 198
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICK--SEKNL--AIE 284
+ + +R+VS C +VI+ ++ +I++ A L K F +C E NL +++
Sbjct: 199 AINTEKTYRTVSAPCAEVIRTAFTEIQKKAYNGSDARALLKGTFNLCDDFDENNLSKSVQ 258
Query: 285 SWLSTAFVY-TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYN 343
+ + Y + Y + V C +++ G +V ++ + Y +
Sbjct: 259 FFFQNIYGYFQGINQYTGDNRNTATRNGLGVPGACAILNNATIGEEV-DRVVAVLNWYDS 317
Query: 344 YSGTAKC 350
+SG C
Sbjct: 318 FSGPLTC 324
Score = 42.0 bits (97), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 6/88 (6%)
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT-VLALVEKEGAHHVD 469
+W+ T++GG +N+ F NG DPW G N++ V + + AH D
Sbjct: 395 HWMRTKYGGAD-----AYRGTNVCFPNGSFDPWQDLGHKMNVTNNNVDSWLIDGTAHCAD 449
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ + D Q LKD R+R + + +W+S
Sbjct: 450 MYPARDSDKQSLKDARQRIHDHLARWLS 477
>gi|104531986|gb|ABF72901.1| CG3734-like [Belgica antarctica]
Length = 184
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 93/166 (56%), Gaps = 14/166 (8%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI-EWFAQNT 112
+TK+ Q L+HF+ PQ ++ +Q RY+ N ++ PIF+Y G E I E + +++
Sbjct: 24 ETKWIEQPLNHFD--PQDHRVWQMRYMENREF---LQDGGPIFIYVGGEWTISEGWLRSS 78
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID 172
F Y +A + L + EHRYYG+S P +++ N +L+ QALAD A I
Sbjct: 79 HFHY-MAEQLNGTLYYTEHRYYGESHP---TEDLTVDNLR---FLNIDQALADLAHFITH 131
Query: 173 LKKNLTA-TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
+K+ +S V++ G SY + WF KYPH+A GA +SSAP+
Sbjct: 132 IKQTTPELQNSGVILIGASYSATMVTWFMQKYPHLARGAWSSSAPL 177
>gi|195158030|ref|XP_002019897.1| GL12651 [Drosophila persimilis]
gi|194116488|gb|EDW38531.1| GL12651 [Drosophila persimilis]
Length = 482
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 103/207 (49%), Gaps = 12/207 (5%)
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
N++ + QT++ RYL N + +N PIF++ G E I +TG +D+A + +
Sbjct: 59 NFDALNNQTWKMRYLRNGKY---HRNQGPIFIFVGGEWSISPGFLSTGLTHDMAVENSGM 115
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI-IDLKKNLTATDSPV 184
L + EHRYYG+S+P+G +LS Q+LAD A I +N S V
Sbjct: 116 LFYTEHRYYGQSLPHGKES----FRVDKLQHLSIYQSLADLAHFIRFQKSENPRMKQSEV 171
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR-SVSEN 243
++ GGSY G + AW YP + + ASSAP+L + + + + + R S +N
Sbjct: 172 ILVGGSYSGSMVAWMTQLYPDLIAASWASSAPLLAKADF---HEYMEVASNSIRLSYGQN 228
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQK 270
C I+ ++ + + ++ E LQK
Sbjct: 229 CTTRIQKGFQHLTKLFEENQIPELLQK 255
>gi|195109598|ref|XP_001999370.1| GI24472 [Drosophila mojavensis]
gi|193915964|gb|EDW14831.1| GI24472 [Drosophila mojavensis]
Length = 499
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 71/210 (33%), Positives = 108/210 (51%), Gaps = 11/210 (5%)
Query: 10 FCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNP 69
F LLF+ ++ S + P S + ++ SS + +D+ + + + TQ LDHF
Sbjct: 10 FTLLFALFVSGSLKALNPYRHSWELLLQEPSSGPYTREDAAAV-QELWLTQNLDHFEAGD 68
Query: 70 QSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFI 129
+T+Q RY N + K P++++ G E I +TG +D+A + +L +
Sbjct: 69 N--RTWQMRYFRNAKY---HKPQGPMYIFLGGEWTITPGLLSTGLTHDMAVENAGILFYT 123
Query: 130 EHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-KNLTATDSPVVVFG 188
EHRYYG+S P+ N + KN YL+ QALAD A I K ++ T S V++ G
Sbjct: 124 EHRYYGQSWPFENNN-LTVKNLK---YLNLHQALADVAHFIRYQKSQSANLTHSKVILIG 179
Query: 189 GSYGGMLAAWFRLKYPHVAIGALASSAPIL 218
GSY G +AAW YP + ASSAP+L
Sbjct: 180 GSYSGSMAAWMTHLYPELVAAVWASSAPLL 209
>gi|198455513|ref|XP_001360029.2| GA11106, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198133277|gb|EAL29181.2| GA11106, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 482
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/207 (30%), Positives = 103/207 (49%), Gaps = 12/207 (5%)
Query: 66 NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKAL 125
N++ + QT++ RYL N + +N PIF++ G E I +TG +D+A + +
Sbjct: 59 NFDALNNQTWKMRYLRNGKY---HRNQGPIFIFVGGEWSISPGFLSTGLTHDMAVENSGM 115
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI-IDLKKNLTATDSPV 184
L + EHRYYG+S+P+G +LS Q+LAD A I +N S V
Sbjct: 116 LFYTEHRYYGQSLPHGKES----FRVDKLQHLSIYQSLADLAHFIRFQKSENPRMKQSEV 171
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR-SVSEN 243
++ GGSY G + AW YP + + ASSAP+L + + + + + R S +N
Sbjct: 172 ILVGGSYSGSMVAWMTQLYPDLIAASWASSAPLLAKADF---HEYMEVASNSIRLSYGQN 228
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQK 270
C I+ ++ + + ++ E LQK
Sbjct: 229 CTTRIQKGFQHLTKLFEENQIPELLQK 255
>gi|157119581|ref|XP_001659434.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875271|gb|EAT39496.1| AAEL008699-PA [Aedes aegypti]
Length = 512
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 119/529 (22%), Positives = 219/529 (41%), Gaps = 75/529 (14%)
Query: 6 IFLSFCLLFSSTLTISNAKIFPTFPSSR-ITPEKLSSLISSSKDS-----QGLYKTKYHT 59
+ ++ L+ + S ++ P SR + + ++ SS+K + + + + T
Sbjct: 5 VLVTLALVVMTATAASVSRRVPNKELSRTLMQQVMTKRFSSTKPAVDDPAKRIVIENFFT 64
Query: 60 QILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTG-NEGDIEWFAQNTGFMYD 117
+DHFN Q+ + RY + D + G PI ++ G N+ + + +YD
Sbjct: 65 TRVDHFN--SQNTAEWTLRYFAVTDYYMPG----GPILIFLGGNQPILTSMVDESTLIYD 118
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGY--LSSTQALADYASLIIDLKK 175
+A + + E R+YG+S ++AST L++ Q LAD A + LK+
Sbjct: 119 MAREMNGAVYAFESRFYGQSF--------VTEDASTENLSLLNTDQILADLAEFVQYLKR 170
Query: 176 N-LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSFSN 231
+ L ++PV+V G YGG LA WFR++YPH+A A +SS +++F + +
Sbjct: 171 DVLKNPNAPVMVSGSEYGGALATWFRVRYPHLAQAAWSSSGYHHALMDFQEFSEAWGQTL 230
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLS 288
I S+ CY I ++ ++ G + L F IC E + + + S
Sbjct: 231 I-----DHGSQECYNDIFVAFHVMQNLIDIGLG-DILYDKFNICSPIDPENRIQVMYFFS 284
Query: 289 TAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFAKLYGAASVYYNYSG 346
+ TA+ Y ++ LN ++C+ I DD T D FA + ++
Sbjct: 285 V--LMTAVEIYTLRNHDLNDF-----ADVCQDITDDDFPTALDAFANWFNT-----KFAE 332
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMI----MLTGGDNKDSIFEESEEDY--------- 393
C + D+ S+ W M+ + N+ F ++ D+
Sbjct: 333 DIGCVVTDVDTMVEAFSQPAWDDAFTMMGARQAMYQMCNEFGWFFTTDSDFQPFGSRVYL 392
Query: 394 DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASN------IIFFNGLRDPWSGGG 447
+ + C+ +G +WI+ E + RF N + F NG DPW
Sbjct: 393 ELYSETCRMVFG-----DWISYESIYYATQRANNRFGGNDPRITEVHFTNGAEDPWRMIS 447
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ + + LA V DL ++ D + L++V+RR ++ ++
Sbjct: 448 ITSDRNALALADVIPRELSSSDLPAISENDSEELQEVKRRVKALMSTYL 496
>gi|297734877|emb|CBI17111.3| unnamed protein product [Vitis vinifera]
Length = 149
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/137 (37%), Positives = 71/137 (51%), Gaps = 21/137 (15%)
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWK 253
+LA+W RLKYPHVA+GALASSAPIL FD+I SE CY I+ SW
Sbjct: 32 LLASWLRLKYPHVALGALASSAPILYFDDITPQ-----------NEASEICYNTIRESWS 80
Query: 254 QIEETAKKPGGLEKLQKAFRICKS-EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAF 312
+I++ A +P GL L K FR C + ++ +L + A ++P P +
Sbjct: 81 EIDKVASEPDGLSILSKKFRTCTELSTSDELKDYLDETYSVAAQYNHP---------PRY 131
Query: 313 PVKEMCKAIDDPKTGND 329
PV +C ID G+D
Sbjct: 132 PVTVVCGGIDGAPEGSD 148
>gi|242039113|ref|XP_002466951.1| hypothetical protein SORBIDRAFT_01g017300 [Sorghum bicolor]
gi|241920805|gb|EER93949.1| hypothetical protein SORBIDRAFT_01g017300 [Sorghum bicolor]
Length = 401
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 172/407 (42%), Gaps = 58/407 (14%)
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
+A KF A +V EHRYYGKS P+ K++ +N +LSS QAL D A +++L
Sbjct: 18 LAKKFGAAVVTPEHRYYGKSSPF---KQLTTENLR---FLSSKQALFDLAVFRQYYQESL 71
Query: 178 TAT-------DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
A D+P V G SY G L+AWFRLK+PH+ G+LASS +L + Y+++
Sbjct: 72 NARYNRSSGFDNPWFVIGISYAGALSAWFRLKFPHLTCGSLASSGVVL------AVYNYT 125
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
+ Q S C +++ + ++E + K + K++ + +L+ A
Sbjct: 126 DFDKQVGESAGPQCKAILQEITELVDEQLRLDSHSVKTLFGAQTLKNDGDFLF--FLADA 183
Query: 291 FVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
T + NP P ++ KA + K + +A+ +
Sbjct: 184 AAITF--------QYGNPDALCP--QLIKAKKNRKNLVEAYAQFVKGFYI-KEMETPPSS 232
Query: 351 FDLN--GDSDPHGLSE--WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
+D ++ P S W +Q C+E+ DS+ + + C+ +G
Sbjct: 233 YDREYLKETTPEDSSTRLWWFQVCSEVAYFQVAPKNDSV-RSARVNTKYHLDLCRYVFGE 291
Query: 407 DPRPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE-KE 463
P+ T +GG I AS I+F NG +DPW K+ L++ +
Sbjct: 292 GVYPDVFMTNLYYGGTGIA------ASKIVFTNGSQDPWRHASKQKSSKDMPSYLMKCRN 345
Query: 464 GAHHVDLR------FSTKED------PQWLKDVRRREVEIIGKWISQ 498
H DLR F T+ D P + VR+R + I W+ Q
Sbjct: 346 CGHGTDLRGCPQWPFRTEGDSSNCSSPVAVSTVRQRIAKHIDLWLKQ 392
>gi|317034919|ref|XP_001400740.2| serine peptidase, family S28 [Aspergillus niger CBS 513.88]
Length = 562
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 112/462 (24%), Positives = 179/462 (38%), Gaps = 73/462 (15%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
+ L+SLIS S + L +T+Y T +DH N S T+Q R+ ++D + + PIF
Sbjct: 52 QNLNSLISHSAMATAL-ETEYATIPIDH---NNASAGTYQNRFWVSDEFY---QPGNPIF 104
Query: 97 VYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
VY E D AQ+ F + +F A+ + EHRYYG S P ++Y+
Sbjct: 105 VYDTGESDGGSIAQSYLTSTLSFFREFLIEFNAMGIAWEHRYYGNSTP----APVSYETP 160
Query: 152 STT-GYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYP 204
YL++ QALAD + + +LT +P ++ GGSY G+ AA R +YP
Sbjct: 161 PEAWQYLTTKQALADLPYFASNFSREKYPDMDLTPQGTPWIMVGGSYAGIRAALTRKEYP 220
Query: 205 HVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGG 264
A +SS+P+ N+ + Y + S NC I + + I++
Sbjct: 221 ETIFAAFSSSSPVEAQVNMSAYY--DQVYRGMVASGWTNCSADIHAALEYIDDQLSDEDT 278
Query: 265 LEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-D 323
+++ F +E N + + +Y Y + C+ ++ D
Sbjct: 279 ATSVKQLFFGSGAETNSNGDFTAALTAIYGYFQSYGMAGGIGG------LGAFCEYLEID 332
Query: 324 PK----TGNDVFAKLYGAASVYYNYSGTAKCFDL-----NGDSDPHGLSE---------- 364
PK TG D A YG V ++ +L + P S+
Sbjct: 333 PKTNGTTGPDGLAPTYGGQYVAERWAAWPTFLELVNLNMGTNCGPQDASQPIDCDFSKPY 392
Query: 365 -------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY--GVD-------P 408
W WQ C+E ++ + + + C + VD P
Sbjct: 393 GDPSAITWTWQYCSEWGFFQANNDGPHSLASRYQSVEYQQEVCNRQFPDAVDKGLLPPSP 452
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
R + + EFGG I SN+ F G DPW +L
Sbjct: 453 RADDVNQEFGGWTIR------PSNVYFSGGEFDPWRSLSILS 488
>gi|327298966|ref|XP_003234176.1| hypothetical protein TERG_04769 [Trichophyton rubrum CBS 118892]
gi|326463070|gb|EGD88523.1| hypothetical protein TERG_04769 [Trichophyton rubrum CBS 118892]
Length = 553
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 132/555 (23%), Positives = 216/555 (38%), Gaps = 91/555 (16%)
Query: 8 LSFCLLFSSTLTISNAKIFPTFPSSRITPE--KLSSLISSSKDSQGLYKTKYHTQILDHF 65
L+ CLL L + P P S++ E + +L++ S+D + +DHF
Sbjct: 21 LTACLLL---LVAAVQAKLPVTPISQLRAESHRNKALVARSQDVNAAFPAHTIQIPIDHF 77
Query: 66 ----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGFMYD 117
Y P + + F RY + +H+ K P+ + G E G I + + G +
Sbjct: 78 PKSSRYEPHTTEKFNLRYWFDASHY---KEGGPVIILHGGETSGEGRIPFLQK--GILAQ 132
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK--- 174
+A + V +EHRYYG S+P + N S +L++ QALAD A ++K
Sbjct: 133 LAQATNGIGVIMEHRYYGGSLPTPD-----FSNKSLR-FLTTEQALADTAYFSKNIKFPG 186
Query: 175 ---KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYS 228
NLTA + +V+GGSY G A+ R +YP + GA++SS I ++ P
Sbjct: 187 LEKYNLTAPGTAHIVYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYWQYFEPIR 246
Query: 229 FSNI-----ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEKNLA 282
+TQ+F + +N +I G + K GL +L+ A F S
Sbjct: 247 QEAPQDCVHVTQNFVDIVDNI--IIHGKNANTTKELKNLFGLGRLRDADFANALSS---G 301
Query: 283 IESWLSTAF-------VYTAMTDYPTPSNFLNPLPA---FPVKEMCKAIDDPKTGNDVFA 332
I W ST + + T +L P+ A K + +A + ++
Sbjct: 302 ITGWQSTNWDPAISGKSFYQYCSEITSDRYLYPVTAQQKASAKRIIEAGGHGREAPEILP 361
Query: 333 KLYG-----AASVYYNYSGTAKCFD--LNG--------DSDPHGLSEWGWQACTEMIML- 376
+L S + SG + + LN D+ W WQ C E L
Sbjct: 362 QLLNFVGWLNKSTLESCSGQGQTAEECLNSYDEAFYKQDNADQSWRAWPWQYCNEWGYLQ 421
Query: 377 TGGDNKDSIFEESEE--DYDARARYCKEAYGVDPRPNW-ITTEFGGHKIGLVLKRFASNI 433
TG +I D + CK+A+G+ N + ++G I +
Sbjct: 422 TGSGTPKNIRPVISRLIDLPYTSNICKQAFGITKPSNVDLVNKYGAFDIEY------DRL 475
Query: 434 IFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDL------RFSTKEDPQWL 481
F +G DPW GV ++ K + HH D + + PQ +
Sbjct: 476 AFIDGGSDPWKEAGVHATAARKRGTSTNKPFILIPDSVHHWDENGLYPNETTAELPPQRI 535
Query: 482 KDVRRREVEIIGKWI 496
K+V+ E + +W+
Sbjct: 536 KEVQAEEARFVKEWM 550
>gi|281212417|gb|EFA86577.1| hypothetical protein PPL_00378 [Polysphondylium pallidum PN500]
Length = 518
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 98/402 (24%), Positives = 160/402 (39%), Gaps = 40/402 (9%)
Query: 95 IFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
+F+ G EG I VA + KAL++ +E RYYGKSIP + +
Sbjct: 15 VFLILGGEGPIVPEMTRRMPFISVANESKALVIALELRYYGKSIP------VPDLSTDNL 68
Query: 155 GYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
YLS+ Q L D A I+ + T++ +V G SY G LAAW+R+KYPH+ A++SS
Sbjct: 69 MYLSTDQILEDIAEFQIEFSRQFGLTEAKWIVMGCSYAGTLAAWYRMKYPHMVGAAISSS 128
Query: 215 APILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI 274
AP + + F + ++ C K + IE + + ++ F
Sbjct: 129 AP------LKAVTRFDAYDKKVRAALGPKCSSAFKSLFDHIEYELMELKN-QSIKDVFSC 181
Query: 275 CKSEKNLAIESWLSTAFVYTAMTDYPTPSNFL-NPLPAFPVKEMCKAIDDPKTGNDVFAK 333
+S + LS + Y+ Y + L N P F I + +
Sbjct: 182 NRSIDDRMFLFMLSESLSYSV--QYNSKFKLLANICPLF--------IKHSNNMSALLDM 231
Query: 334 LYGAASVYYNYSGTAKCFD------LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
G + + GT C D N + D G +W WQ C+E K+ +
Sbjct: 232 FIGYIKNMFLFQGTT-CDDYNIFTYANTEIDYSGTRQWTWQMCSEYGWFLVASEKNVTLK 290
Query: 388 ESEEDYD-ARARYCKEAYGVDPRP--NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS 444
S + + C+ +G +P I +G I + S++++ NG DPWS
Sbjct: 291 SSLLNETWWQNEVCRILFGRPMKPFVEKINLLYGPDNIKQL-----SSVLYTNGDLDPWS 345
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQW-LKDVR 485
V + + ++ G H + ++ W LK+ R
Sbjct: 346 TLSVSTSCDAPISNILNIAGESHCANWYGETQEDSWDLKNAR 387
>gi|344292456|ref|XP_003417943.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 440
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/337 (25%), Positives = 145/337 (43%), Gaps = 43/337 (12%)
Query: 150 NASTTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
+ ++ YLSS QALAD A+ + + + T + V FG SYGG LA W R+K+P +
Sbjct: 119 STASLHYLSSRQALADIANFRTQIAEKMGLTRNKWVAFGCSYGGSLAVWSRIKHPDLFAA 178
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
A+ SSAPI N Y + ++ + + + C++ +K ++ Q+ + K P KL+
Sbjct: 179 AVGSSAPIQAIANF---YEYLEVVQRSLATHNSKCFQAVKEAFDQVVKMLKLPKYYSKLE 235
Query: 270 KAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLP----------AFPVKEMCK 319
F + S Y ++ ++ T + +PL +F + E+C
Sbjct: 236 DDF----------MYSIFEVKKCYCSLKNFSTQT--FSPLKMIINKNGEQLSFSMDELCD 283
Query: 320 AIDDPKTGN------DVFAKLYGAASVYYNYSGTAKCFDLNGDS-----DPHGLSEWGWQ 368
+ + G+ + LY ++ + K ++ DS +P ++ +Q
Sbjct: 284 MMANTSLGSPYYRYIRIIQLLYNHEYLHCFPAHYRKKLEVYLDSYINHQNPAIGRQFFYQ 343
Query: 369 ACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKR 428
+CTE D+K+ F Y C + +G P N+ + G
Sbjct: 344 SCTEFGFFQTTDSKNLTFTGLPLSYFVEQ--CADFFG--PEFNYDSLHTGVMSTNAYYGG 399
Query: 429 F---ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK 462
F S IIF NG DPW G+ K+ISK + A+ K
Sbjct: 400 FNVTGSKIIFPNGSFDPWHPLGITKDISKDLPAVFIK 436
>gi|134081410|emb|CAK46451.1| unnamed protein product [Aspergillus niger]
Length = 542
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 112/462 (24%), Positives = 179/462 (38%), Gaps = 73/462 (15%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
+ L+SLIS S + L +T+Y T +DH N S T+Q R+ ++D + + PIF
Sbjct: 32 QNLNSLISHSAMATAL-ETEYATIPIDH---NNASAGTYQNRFWVSDEFY---QPGNPIF 84
Query: 97 VYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
VY E D AQ+ F + +F A+ + EHRYYG S P ++Y+
Sbjct: 85 VYDTGESDGGSIAQSYLTSTLSFFREFLIEFNAMGIAWEHRYYGNSTP----APVSYETP 140
Query: 152 STT-GYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYP 204
YL++ QALAD + + +LT +P ++ GGSY G+ AA R +YP
Sbjct: 141 PEAWQYLTTKQALADLPYFASNFSREKYPDMDLTPQGTPWIMVGGSYAGIRAALTRKEYP 200
Query: 205 HVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGG 264
A +SS+P+ N+ + Y + S NC I + + I++
Sbjct: 201 ETIFAAFSSSSPVEAQVNMSAYY--DQVYRGMVASGWTNCSADIHAALEYIDDQLSDEDT 258
Query: 265 LEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-D 323
+++ F +E N + + +Y Y + C+ ++ D
Sbjct: 259 ATSVKQLFFGSGAETNSNGDFTAALTAIYGYFQSYGMAGGIGG------LGAFCEYLEID 312
Query: 324 PK----TGNDVFAKLYGAASVYYNYSGTAKCFDL-----NGDSDPHGLSE---------- 364
PK TG D A YG V ++ +L + P S+
Sbjct: 313 PKTNGTTGPDGLAPTYGGQYVAERWAAWPTFLELVNLNMGTNCGPQDASQPIDCDFSKPY 372
Query: 365 -------WGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY--GVD-------P 408
W WQ C+E ++ + + + C + VD P
Sbjct: 373 GDPSAITWTWQYCSEWGFFQANNDGPHSLASRYQSVEYQQEVCNRQFPDAVDKGLLPPSP 432
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLK 450
R + + EFGG I SN+ F G DPW +L
Sbjct: 433 RADDVNQEFGGWTIR------PSNVYFSGGEFDPWRSLSILS 468
>gi|330803268|ref|XP_003289630.1| hypothetical protein DICPUDRAFT_154024 [Dictyostelium purpureum]
gi|325080278|gb|EGC33840.1| hypothetical protein DICPUDRAFT_154024 [Dictyostelium purpureum]
Length = 226
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 88/155 (56%), Gaps = 16/155 (10%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ TQ +DHFN + TFQQRYLIND ++ G+ P+F+ EG + TG
Sbjct: 63 QWFTQNVDHFNI--VNTDTFQQRYLINDQYYDGT---GPVFIMINGEGPMG-LDTVTGLQ 116
Query: 116 YDV-APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL--IID 172
+ V A + AL+V +EHRYYG S +++ N +L+S QALAD A I
Sbjct: 117 FVVWAKQLNALIVSLEHRYYGASFV---TSDLSLDNLQ---FLNSQQALADNAVFREFIA 170
Query: 173 LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
K N+ +T + V FGGSY G L +WFR+KYPH+
Sbjct: 171 QKYNIPST-TKWVSFGGSYSGALTSWFRIKYPHLV 204
>gi|302807541|ref|XP_002985465.1| hypothetical protein SELMODRAFT_424499 [Selaginella moellendorffii]
gi|300146928|gb|EFJ13595.1| hypothetical protein SELMODRAFT_424499 [Selaginella moellendorffii]
Length = 458
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 118/466 (25%), Positives = 189/466 (40%), Gaps = 71/466 (15%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
++TQ LDH Y Q +TF QRY ++ N P+F+ EG + +
Sbjct: 36 WYTQTLDH--YATQDDRTFSQRYYEFTDYFDAP--NGPVFLKICPEGPCVGIQNDYSAV- 90
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------I 170
+A +F A +V +EHRYYG+S P+ K A +N YLSS QAL D A+ +
Sbjct: 91 -LAKRFGAAIVSLEHRYYGQSSPF---KTHATENLI---YLSSKQALYDLAAFREYYQDL 143
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
I+ + N T D+P +V G SY G L+AWF+LK+PH+A+G++ASS +
Sbjct: 144 INHRTNST-RDNPWIVIGWSYAGALSAWFKLKFPHLAVGSVASSGIVA------------ 190
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTA 290
S C ++ + E+ GL+K + + + L ++
Sbjct: 191 -------ESAGATCSAALRAVTRLAEQ------GLKKNSVFTKALFNAEQLDVDGDFLYL 237
Query: 291 FVYTAMT--DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA 348
A T Y P +PL A ++ K D + + ++ Y+
Sbjct: 238 LAEAATTAFQYGNPEILCSPLVAAYKRDEDLLAVYAKYVKDYYIDTFKSSINTYDQKHLK 297
Query: 349 KCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
+ GD L W +Q CTE+ +SI + + C +G
Sbjct: 298 ENL-AAGDHSSDRL--WWYQMCTELGYFQVAPANNSI-RSALINVKYHLDLCSNVFGNGT 353
Query: 409 RPNWITTE--FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK--EG 464
P +T +GG++ I+F NG +DPW + S+ A V K
Sbjct: 354 FPEVDSTNLYYGGNR--------GDKILFMNGSQDPWRHASK-QTSSRNEPAYVIKCQNC 404
Query: 465 AHHVDLRFSTKEDPQWLKDVRRRE--------VEIIGKWISQYFQD 502
H VD+ + PQ + + II +IS++ +D
Sbjct: 405 GHGVDMLGCPQSPPQIGGNTSKCADPEAAQAGQRIIATYISRWLED 450
>gi|15144318|gb|AAK84459.1|AC087192_20 putative serine peptidase [Oryza sativa Japonica Group]
Length = 267
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 65/191 (34%), Positives = 98/191 (51%), Gaps = 22/191 (11%)
Query: 72 YQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEH 131
++ F+QRY ++ K PIF+Y E + ++ +A KF A +V EH
Sbjct: 26 HRQFKQRYYEFLDYYRAPK--GPIFLYICGESSCNGIPNS--YLAVMAKKFGAAVVSPEH 81
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL------IIDLKKNLTATDSPVV 185
RYYGKS P+ + + +N +LSS QAL D A ++ K N + DS
Sbjct: 82 RYYGKSSPF---ESLTTENLR---FLSSKQALFDLAVFRQYYQETLNAKYNRSGADSSWF 135
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCY 245
VFGGSY G L+AWFRLK+PH+ G+LASS +L S Y++++ Q S C
Sbjct: 136 VFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL------SVYNYTDFDKQIGESAGPECK 189
Query: 246 KVIKGSWKQIE 256
++ + K ++
Sbjct: 190 AALQETTKLVD 200
>gi|326482280|gb|EGE06290.1| extracelular serine carboxypeptidase [Trichophyton equinum CBS
127.97]
Length = 545
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 130/562 (23%), Positives = 215/562 (38%), Gaps = 93/562 (16%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPE--KLSSLISSSKDSQGLYKTKYHTQIL 62
+ L+ CLL L + P P S++ E + +L++ S+D + +
Sbjct: 1 MVKLTTCLLL---LVAAVQAKLPVTPISQLRAESHRNKALVARSQDVNAAFPAHTIQIPI 57
Query: 63 DHF----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGF 114
DHF Y P + F RY + +H+ K P+ + G E G I + + G
Sbjct: 58 DHFPKSSRYEPHTTAKFNLRYWFDASHY---KEGGPVIILHGGETSGEGRIPFLQK--GI 112
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ +A + V +EHRYYG S+P ++ + K+ +L++ QALAD A ++K
Sbjct: 113 LAQLAQATNGIGVIMEHRYYGGSLP---TRDFSNKSLR---FLTTEQALADTAYFSKNIK 166
Query: 175 ------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVS 225
NLTA + +++GGSY G A+ R +YP + GA++SS I ++
Sbjct: 167 FPGLEKYNLTAPGTAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYWQYFE 226
Query: 226 PYSFSNI-----ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEK 279
P +TQ+F + +N +I G K GL +L+ A F S
Sbjct: 227 PIRQEAPQDCVHVTQNFVDIVDNI--IIHGKNANTTRELKNLFGLGRLRDADFANALSS- 283
Query: 280 NLAIESWLST----AFVYTAMTDYP---TPSNFLNPLPA---FPVKEMCKAIDDPKTGND 329
I W ST A + Y T +L P+ A K + +A + +
Sbjct: 284 --GITGWQSTNWDPAISGKSFYQYCGEITSDRYLYPVTAEQKASAKRIIEAGGHGREAPE 341
Query: 330 VFAKLYGAASVYYNYSGTAKCFD--------LNG--------DSDPHGLSEWGWQACTEM 373
+ +L + N S C LN D+ W WQ C E
Sbjct: 342 ILPQLLNFVG-WLNKSTLESCSGQRQTAEECLNSYDEAFYKQDNADQSWRAWPWQYCNEW 400
Query: 374 IMLTGGDNKDSIFEESEE---DYDARARYCKEAYGVDPRPNW-ITTEFGGHKIGLVLKRF 429
L G D + CK+A+G+ N + ++G I
Sbjct: 401 GYLQTGSGAPKNIRPVISRLIDLPYTSNICKQAFGITKPSNVDLVNKYGAFDIEY----- 455
Query: 430 ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDL------RFSTKED 477
+ F +G DPW GV ++ K + HH D + +
Sbjct: 456 -DRLAFIDGGSDPWKEAGVHATAARKRPTSTNKPFILIPDAVHHWDENGLYPNETTAELP 514
Query: 478 PQWLKDVRRREVEIIGKWISQY 499
PQ +K+V+ E + +W+ ++
Sbjct: 515 PQRIKEVQAEEARFVKEWMKEW 536
>gi|159122200|gb|EDP47322.1| serine peptidase, family S28, putative [Aspergillus fumigatus
A1163]
Length = 560
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 114/464 (24%), Positives = 187/464 (40%), Gaps = 75/464 (16%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
PE S+I+ DS G HT++L ++ S T++ R+ +N+ + + +PI
Sbjct: 51 PETFRSMIA---DSVGSISAAEHTEML--IDHEDPSVGTYRNRFWVNEDFY---ISGSPI 102
Query: 96 FVYTGNEGDIEW----FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
VY E E+ ++ ++ + +F A+ + EHRYYG S+PY ++++ ++
Sbjct: 103 MVYDIGEATAEYSVSLLTNSSSWLSLLLQEFNAMGIVWEHRYYGGSLPYPVSQDMPVEHL 162
Query: 152 STTGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
YL++ QALAD + + +LT +P ++ GGSY G+ AA R KYP
Sbjct: 163 K---YLTTEQALADIPYFAANFSRLNHPDFDLTPRGTPWIMIGGSYPGIRAAITRNKYPD 219
Query: 206 VAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS----ENCYKVIKGSWKQIEETAKK 261
A ASSAP+ N+ Y Q +R++ NC K I+ + K I+
Sbjct: 220 TIFAAYASSAPVQAQLNMSVYYE------QIYRAIVGNGYSNCTKDIQAALKYIDGQLSN 273
Query: 262 PGGLEKLQKAFRICKSEKNLAIESWLSTAFV-----------YTAMTDYPTPSNFLNPLP 310
+++ F +EKN + +TA V ++ P N+L P
Sbjct: 274 KRTFALIKRLFLGPDAEKNSNED--FTTALVTLYGPFQAHGLWSGNQSLPDFCNYLELDP 331
Query: 311 AFPVKEMCKAIDDPKTGNDVFA--------------KLYGAASVYYNYSGTAKCFDLNGD 356
A + + P G+ A ++YG N S C D +
Sbjct: 332 ATNQSAGPEGL-SPIHGSKYVAERWASFPYFISLVNRMYGTNCNGLNASEPLSC-DFSQT 389
Query: 357 SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-------VDPR 409
+ L W WQ CTE + S + + + C + + PR
Sbjct: 390 NTIPELISWTWQYCTEWGFFQSNNFGSHALLSSYQTLEYQQELCYRQFPNAVQAGVLPPR 449
Query: 410 PNW--ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
P + +FGG I SN+ F G DPW VL +
Sbjct: 450 PQTESLNEKFGGWTIR------PSNVYFSGGQFDPWRTFSVLSD 487
>gi|157167874|ref|XP_001656137.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871003|gb|EAT35228.1| AAEL012589-PA, partial [Aedes aegypti]
Length = 459
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 114/467 (24%), Positives = 188/467 (40%), Gaps = 77/467 (16%)
Query: 68 NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLV 127
N S F RY+ N + + PIF++ G ++E G D+A + A +V
Sbjct: 32 NAHSVDMFPMRYVSNSKFY---RPGGPIFLFVGGPWELEQHFVEQGHFVDLAEENNAFVV 88
Query: 128 FIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-KNLTATDSPVVV 186
E RYYG+S+P + + L QA D A LI+ ++ + L ++ V+V
Sbjct: 89 ANEMRYYGESLP------VPNASRGNLRLLHIVQACTDIARLIVHIRYEVLRDPNARVIV 142
Query: 187 FGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN-CY 245
G + G LA W RL+YPH+ G AS A + +N F+ + + R N CY
Sbjct: 143 AGVGFSGSLAHWTRLRYPHLIHGVWASGAMLQANENY---REFAEEVGEYIRRYGGNDCY 199
Query: 246 KVIKGSWK--QIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESWLSTAFVYTAMTDYP 300
+ W+ + E G + + F++C L +E AF Y +
Sbjct: 200 GAL---WRGFRTAENLIDAGQSQTVDTLFKVCTPINGTNPLDVE-----AFFYGIFNEVV 251
Query: 301 TPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS--- 357
T N L P ++ MC D T D + L G AS A+C ++ +S
Sbjct: 252 T--NTLRPNLRQNIRNMC----DTLTHEDHDSSLTGLASWITGQFPEAECLAMDLESIVQ 305
Query: 358 ---------DPH--GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGV 406
D H G +W +Q CTE+ D+ + F + + C+ +
Sbjct: 306 LFQETDWQHDVHKSGERQWFYQRCTELGWPLTADSMNQPF-GVRISANLFQQLCQRVFD- 363
Query: 407 DPRPNWITTE------------FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK 454
W+T++ +GG++ + RF + + +G DPW GV +
Sbjct: 364 ----GWLTSDVFRSLVRQTNTLYGGNRPEM---RF---VFYTHGSLDPWRFTGVTTVLYN 413
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEI---IGKWISQ 498
V + H DL ++ D W D+RR + E+ I +W+ +
Sbjct: 414 NNYVNVIRGAIHGEDL--ASISDLDW-ADLRRSKEEVGETIRRWLER 457
>gi|70982075|ref|XP_746566.1| serine peptidase, family S28 [Aspergillus fumigatus Af293]
gi|66844189|gb|EAL84528.1| serine peptidase, family S28, putative [Aspergillus fumigatus
Af293]
Length = 560
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 114/464 (24%), Positives = 187/464 (40%), Gaps = 75/464 (16%)
Query: 36 PEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
PE S+I+ DS G HT++L ++ S T++ R+ +N+ + + +PI
Sbjct: 51 PETFRSMIA---DSVGSISAAEHTEML--IDHEDPSVGTYRNRFWVNEDFY---ISGSPI 102
Query: 96 FVYTGNEGDIEW----FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
VY E E+ ++ ++ + +F A+ + EHRYYG S+PY ++++ ++
Sbjct: 103 MVYDIGEATAEYSVSLLTNSSSWLSLLLQEFNAMGIVWEHRYYGGSLPYPVSQDMPVEHL 162
Query: 152 STTGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
YL++ QALAD + + +LT +P ++ GGSY G+ AA R KYP
Sbjct: 163 K---YLTTEQALADIPYFAANFSRLNHPDFDLTPRGTPWIMIGGSYPGIRAAITRNKYPD 219
Query: 206 VAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS----ENCYKVIKGSWKQIEETAKK 261
A ASSAP+ N+ Y Q +R++ NC K I+ + K I+
Sbjct: 220 TIFAAYASSAPVQAQLNMSVYYE------QIYRAIVGNGYSNCTKDIQAALKYIDGQLSN 273
Query: 262 PGGLEKLQKAFRICKSEKNLAIESWLSTAFV-----------YTAMTDYPTPSNFLNPLP 310
+++ F +EKN + +TA V ++ P N+L P
Sbjct: 274 KRTFALIKRLFLGPDAEKNSNED--FTTALVTLYGPFQAHGLWSGNQSLPDFCNYLELDP 331
Query: 311 AFPVKEMCKAIDDPKTGNDVFA--------------KLYGAASVYYNYSGTAKCFDLNGD 356
A + + P G+ A ++YG N S C D +
Sbjct: 332 ATNQSAGPEGL-SPIHGSKYVAERWASFPYFISLVNRMYGTNCNGLNASEPLSC-DFSQT 389
Query: 357 SDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG-------VDPR 409
+ L W WQ CTE + S + + + C + + PR
Sbjct: 390 NTIPELISWTWQYCTEWGFFQSNNFGSHALLSSYQTLEYQQELCYRQFPNAVQAGVLPPR 449
Query: 410 PNW--ITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
P + +FGG I SN+ F G DPW VL +
Sbjct: 450 PQTESLNEKFGGWTIR------PSNVYFSGGQFDPWRTFSVLSD 487
>gi|123446346|ref|XP_001311925.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121893752|gb|EAX98995.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 436
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 103/448 (22%), Positives = 185/448 (41%), Gaps = 42/448 (9%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
+Q +DH +PQ TF+QRY + + + +N ++ G E D +M +
Sbjct: 22 SQNIDH--SDPQK-GTFKQRY---EALFDYTTDNKTAILFIGGESDTFRPRAFNDYMATL 75
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
+F A +EHRY+G+S P +++Y N YL+ A+ D + + + +
Sbjct: 76 CKEFNAAFFMLEHRYFGESFP----TDLSYPNIK---YLTVDNAIDDLYNFKVKMVEQYK 128
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
TDS ++ GGSY G+L+A+ R KYP ++ASS ++ +N + + Q
Sbjct: 129 MTDSKWILVGGSYPGLLSAYTRAKYPKEFHASIASSGVVIASNN------YEDFDRQIAI 182
Query: 239 SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESW-LSTAFVYTAMT 297
S+ ++C V + ++ +E + + L F + EK E++ L +++
Sbjct: 183 SLGQSCASVAREIRRRTDELLETDP--DWLLATFNMTGLEK----ENFPLVLGEIFSLGA 236
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS 357
Y PL + + K ++F Y + YS + L+ S
Sbjct: 237 QYGRRQQLCGPLEDTLITGADPVMAIAKYTREIFTPNYADDDIIGTYSNSR----LSVTS 292
Query: 358 DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR-PNWITTE 416
P+G W W C E+ + ++ + D CK + + + P+ T+
Sbjct: 293 TPNGPRAWLWMTCNELAYWQVNSGRLTL-RSKKVTQDFFLNQCKTVFSDEMKTPD---TD 348
Query: 417 FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT------VLALVEKEGAHHVDL 470
K G +LK+ S I + G +DPW+ S V +V +E H DL
Sbjct: 349 AWNQKWGDLLKK-TSRIYYLTGSQDPWTPVCYTAEDSDKIGPNCYVHTIVGQEIGHCRDL 407
Query: 471 RFSTKEDPQWLKDVRRREVEIIGKWISQ 498
DP L R +I +W+++
Sbjct: 408 SSPQPSDPTDLTRTREHVKAVIHRWLAE 435
>gi|345568628|gb|EGX51521.1| hypothetical protein AOL_s00054g220 [Arthrobotrys oligospora ATCC
24927]
Length = 547
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 137/570 (24%), Positives = 236/570 (41%), Gaps = 102/570 (17%)
Query: 1 MATRFIFLSFCLLFSSTLTISN---AKIFPTF--PSSRITPEKL-------SSLISSSKD 48
M F+ + +L S +L + + P+F P R E++ +L S+S
Sbjct: 1 MRRGFVATAVAVLASVSLVLGEEGFPRYQPSFQPPGLRKRYEEILKVELEKRALDSASNP 60
Query: 49 SQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GD 104
+ Y + Q +DHF+ P + TFQQRY I+ TH+ K PIFV G E G
Sbjct: 61 NAIPYTELWFPQKVDHFD--PSNNNTFQQRYWIS-THF--YKPGGPIFVLDGGETSGAGR 115
Query: 105 IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
+E+ TG + + + +EHRYYGKS + +N +L++ Q+L
Sbjct: 116 VEYM--QTGIGRYITEYLGGIGIVLEHRYYGKSYV---TPNLTVENLK---WLNTAQSLK 167
Query: 165 DYASLIIDLKKNLTAT-------DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
D A +L K L A ++P + +GGSY G +A+ +++YP + G+LASSA
Sbjct: 168 DNAYFAENLWKELPANLSHIRPDNAPFISYGGSYAGAKSAFLQIEYPEIYYGSLASSAVT 227
Query: 218 LNFDNIVSPYSFSNIITQDFRSVSEN-CYKVIKGSWKQIEETAKKPG----------GLE 266
+N + R V + C +++ + ++I+ +K G G++
Sbjct: 228 WAGENFWE-------YNEPVRKVGDPICIAILEETSRRIDTYSKVGGPIWTDFKGLFGVK 280
Query: 267 KLQKA------------FRICKSEKN-LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFP 313
+L+ +++ +KN + W T D P FP
Sbjct: 281 RLEDRDAANFIFGILGIWQLGNWDKNQRGAKDWDRFCANITEGVDEDYSK------PGFP 334
Query: 314 VKEMCKAID-DPKTGNDVFAKL----YGAASVYYNYSGTAKCFDLNGDSDPHGLSE---- 364
K + D D + G FA+ G++ + +DL G + LSE
Sbjct: 335 EKVLRSRPDFDLEEGIHNFARWTRASLGSSCGDEDLVKCVGTWDLGG-YEGESLSESWKL 393
Query: 365 WGWQACTEM-IMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTE---FGGH 420
W WQ CTE ++ G D R C+ ++ + P T++ +GG
Sbjct: 394 WTWQVCTEWGYLIPGAPTGFPTLVPRIVDLKYLRRICQNSFKLPPDFVIDTSKVLRYGGF 453
Query: 421 KIGLVLKRFASNIIFFNGLRDPW------SGGGVLKNISKTVLALVEKEGAHHVDLR--F 472
+ A +++ +G DPW S K+ S + L++ E HH D
Sbjct: 454 NLT------APRLMYIDGTHDPWLYATPHSINSPQKDRSDRLSILIQ-EAWHHNDENGLG 506
Query: 473 STKEDPQWLKDVRRREVEIIGKWISQYFQD 502
++P ++DV RE+E++ WI ++ ++
Sbjct: 507 DISKEPLRIQDVHFREIEVVRDWILEFHKN 536
>gi|302664611|ref|XP_003023934.1| extracellular serine carboxypeptidase, putative [Trichophyton
verrucosum HKI 0517]
gi|291187955|gb|EFE43316.1| extracellular serine carboxypeptidase, putative [Trichophyton
verrucosum HKI 0517]
Length = 536
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 131/558 (23%), Positives = 214/558 (38%), Gaps = 91/558 (16%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPE--KLSSLISSSKDSQGLYKTKYHTQIL 62
+ L+ CLL L + P P S++ E + +L++ S+D + +
Sbjct: 1 MVKLTACLLL---LVAAVQAKLPVTPISQLRAESHRNKALVARSQDVNAAFPAHTIQIPI 57
Query: 63 DHF----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGF 114
DHF Y P + + F RY + +H+ K P+ + G E G I + + G
Sbjct: 58 DHFPKSSRYEPHTTEKFNLRYWFDASHY---KEGGPVIILHGGETSGEGRIPFLQK--GI 112
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ +A + V +EHRYYG S+P + N S +L++ QALAD A ++K
Sbjct: 113 LAQLAQATNGIGVIMEHRYYGGSLP-----TPDFSNKSLR-FLTTEQALADTAYFSKNIK 166
Query: 175 ------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVS 225
NLTA + +++GGSY G A+ R +YP + GA++SS I ++
Sbjct: 167 FPGLEKYNLTAPGTAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYWQYFE 226
Query: 226 PYSFSNI-----ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEK 279
P +TQ+F + +N +I G K GL +L+ A F S
Sbjct: 227 PIRQEAPQDCVHVTQNFVDIVDNI--IIHGKNANTTRELKNLFGLGRLRDADFANALSS- 283
Query: 280 NLAIESWLST----AFVYTAMTDYP---TPSNFLNPLPA---FPVKEMCKAIDDPKTGND 329
I W ST A + Y T +L P+ A K + +A + +
Sbjct: 284 --GITGWQSTNWDPAISGKSFYQYCGEITSDRYLYPVTAQQKASAKRIIEAGGHGREAPE 341
Query: 330 VFAKLYG-----AASVYYNYSGTAKCFD--LNG--------DSDPHGLSEWGWQACTEMI 374
+ +L S + SG + + LN D+ W WQ C E
Sbjct: 342 ILPQLLNFVGWLNKSTLESCSGQGQTAEECLNSYDEAFYKQDNADQSWRAWPWQYCNEWG 401
Query: 375 MLTGGDNKDSIFEESEE---DYDARARYCKEAYGVDPRPNW-ITTEFGGHKIGLVLKRFA 430
L G D + CK+A+G+ N + ++G I
Sbjct: 402 YLQTGSGAPKNIRPVISRLIDLPYTSNICKQAFGITKPSNVDLVNKYGAFDIEY------ 455
Query: 431 SNIIFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDL------RFSTKEDP 478
+ F +G DPW GV ++ K + HH D + + P
Sbjct: 456 DRLAFVDGGSDPWKEAGVHATAARKRGTSTNKPFILIPDAVHHWDENGLYPNETTAELPP 515
Query: 479 QWLKDVRRREVEIIGKWI 496
Q +K+V+ E + +W+
Sbjct: 516 QRIKEVQAEEARFVKEWM 533
>gi|302689763|ref|XP_003034561.1| hypothetical protein SCHCODRAFT_15001 [Schizophyllum commune H4-8]
gi|300108256|gb|EFI99658.1| hypothetical protein SCHCODRAFT_15001 [Schizophyllum commune H4-8]
Length = 555
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 124/249 (49%), Gaps = 23/249 (9%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQ 110
+ ++ Q LDHFN ++ TF QRY +N H+ NAP+ V G E G+
Sbjct: 74 FPAQWFEQPLDHFNN--ETGDTFGQRYWVNKRHYV-PGTNAPVIVLDGGETSGEDRLPFL 130
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
+TG + + + V +EHRYYG++ P + + +L++ Q+ AD A+ +
Sbjct: 131 DTGIVEILTRATGGVGVVLEHRYYGETKP------VQNLTTDSLRFLTNEQSAADSANFM 184
Query: 171 IDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
++K ++LTA +P + +GGSY G AA R+ YP + GA+ASSA + + D ++
Sbjct: 185 ANVKFDGIDEDLTAPGTPWIYYGGSYAGARAAHMRVLYPDLVFGAIASSA-VTHAD--IT 241
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEKNLAIE 284
+ + +I RS E C + ++ S + I+ + G + KA F + E +
Sbjct: 242 NWQYMEVIR---RSAPEACARHLENSIQTIDAVLARGGFPSRALKALFGLSGLEHDDDFA 298
Query: 285 SWLSTAFVY 293
S LS+ Y
Sbjct: 299 SVLSSPLGY 307
>gi|268579387|ref|XP_002644676.1| Hypothetical protein CBG14660 [Caenorhabditis briggsae]
Length = 512
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 87/409 (21%), Positives = 171/409 (41%), Gaps = 39/409 (9%)
Query: 110 QNTGFMY-DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS 168
QN + Y A +F A + +EHR +G+S PY K+++Y N + +QA+AD +
Sbjct: 110 QNPNYQYLKWAKEFGADVFQLEHRCFGQSRPY---KDLSYPNIKVC---TMSQAIADIHN 163
Query: 169 LIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS 228
I + + + FGGSY G L+A FR ++P +GA+ASSAP+ D + +
Sbjct: 164 FIGQMNIQYNFRNPKWITFGGSYPGTLSALFRQQHPEDTVGAVASSAPL---DWTLDFFE 220
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-----SEKNLAI 283
++ ++ S +C++ +K ++ ++++ + G+++L F + I
Sbjct: 221 YAMVVEDVLNQTSTDCWQNVKDAFYKMQQLSLTKQGIQQLNAYFDLSPPFVDGQYTQHDI 280
Query: 284 ESWLSTAF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA---- 338
+++ + + + + Y L V+++C ++D K DV ++
Sbjct: 281 DNFFANLYGFFQGVVQYTYDGRNAATLNGLNVQQLCNKMND-KNVPDVINRVNNTVNWIN 339
Query: 339 -----------SVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFE 387
+ Y + + + W W C E+ L D +IF+
Sbjct: 340 QMNGDTVGPFQNSYKDMMAVLANVSYDNSGELAANRGWMWLCCNELGALQTTDQGRNIFQ 399
Query: 388 ESEEDYDARARYCKEAYGVDPRPNWI--TTEFGGHKIGLVLKRFASNIIFFNGLRDPWSG 445
++ C + +G D +I + +K G A+N++ NG DPW
Sbjct: 400 QTVP-MGYFIDMCTDMFGPDVGIKFIRDNNKQTLYKYGGADNYQATNVVLPNGAFDPWHV 458
Query: 446 GGVLKNISKTVLALVEKEGAHHVDLRFST----KEDPQWLKDVRRREVE 490
G N + + + +GA H + T D Q +D+ +E++
Sbjct: 459 LGTYNNNTANHMTPLLIQGAAHCSDMYPTYAGEPADLQRNRDIIHKELK 507
>gi|195391886|ref|XP_002054590.1| GJ24539 [Drosophila virilis]
gi|194152676|gb|EDW68110.1| GJ24539 [Drosophila virilis]
Length = 489
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 106/413 (25%), Positives = 166/413 (40%), Gaps = 44/413 (10%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ LDHF+ + +T+Q RY N + + PI+++ G E I +TG +D+
Sbjct: 58 TQRLDHFD--GLNNKTWQMRYFRNAKY---HRPQGPIYIFLGGEWTITPGLLSTGLTHDM 112
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A + +L + EHRYYG+S PY N ++ ++ +LS QALAD A I K + +
Sbjct: 113 AVENAGILFYTEHRYYGQSWPY-ENDSLSLEHLK---HLSLHQALADLAHFIRYQKSHSS 168
Query: 179 -ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
T S V++ GGSY G +AAW YP + + ASSAP+L + Y + ++
Sbjct: 169 NLTHSKVILVGGSYSGSMAAWMTHLYPELVTASWASSAPLLAKADF---YEYMQLVENSL 225
Query: 238 R-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWLSTAFVY 293
S NC + I+ + + E L + C S +L ++ S Y
Sbjct: 226 NLSYGNNCTQRIERGLNHLVKLFNTNETTELL-TSLNACPSYDANNSLDRMTFFSGIGNY 284
Query: 294 TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY--GAASVYYN------YS 345
A+ ++ +PA M D + LY G YN Y
Sbjct: 285 FAL----VVQSYSAYIPALCETLMSLHCSDEEALQKFLQLLYLDGKRLGEYNNCLDFSYQ 340
Query: 346 GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARY----CK 401
+ F D G W +Q C + T ++ S + Y C
Sbjct: 341 SMLQLFSEYSDRSS-GTRAWFYQTCNQFGWYTTTSRINTTSTSSTFGREVPLWYFEQLCH 399
Query: 402 EAYGVDPRP-------NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
+A+G P + +FGG L+ ++F +G DPW G
Sbjct: 400 DAFGSRQTPATLALGIAQMNAQFGGLSFDQSLR--YREVVFTHGELDPWRALG 450
>gi|326475086|gb|EGD99095.1| hypothetical protein TESG_06450 [Trichophyton tonsurans CBS 112818]
Length = 565
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 129/559 (23%), Positives = 214/559 (38%), Gaps = 93/559 (16%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPE--KLSSLISSSKDSQGLYKTKYHTQIL 62
+ L+ CLL L + P P S++ E + +L++ S+D + +
Sbjct: 1 MVKLTTCLLL---LVAAVQAKLPVTPISQLRAESHRNKALVARSQDVNAAFPAHTIQIPI 57
Query: 63 DHF----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGF 114
DHF Y P + F RY + +H+ K P+ + G E G I + + G
Sbjct: 58 DHFPKSSRYEPHTTAKFNLRYWFDASHY---KEGGPVIILHGGETSGEGRIPFLQK--GI 112
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ +A + V +EHRYYG S+P ++ + K+ +L++ QALAD A ++K
Sbjct: 113 LAQLAQATNGIGVIMEHRYYGGSLP---TRDFSNKSLR---FLTTEQALADTAYFSKNIK 166
Query: 175 ------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVS 225
NLTA + +++GGSY G A+ R +YP + GA++SS I ++
Sbjct: 167 FPGLEKYNLTAPGTAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYWQYFE 226
Query: 226 PYSFSNI-----ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEK 279
P +TQ+F + +N +I G K GL +L+ A F S
Sbjct: 227 PIRQEAPQDCVHVTQNFVDIVDNI--IIHGKNANTTRELKNLFGLGRLRDADFANALSS- 283
Query: 280 NLAIESWLST----AFVYTAMTDYP---TPSNFLNPLPA---FPVKEMCKAIDDPKTGND 329
I W ST A + Y T +L P+ A K + +A + +
Sbjct: 284 --GITGWQSTNWDPAISGKSFYQYCGEITSDRYLYPVTAEQKASAKRIIEAGGHGREAPE 341
Query: 330 VFAKLYGAASVYYNYSGTAKCFD--------LNG--------DSDPHGLSEWGWQACTEM 373
+ +L + N S C LN D+ W WQ C E
Sbjct: 342 ILPQLLNFVG-WLNKSTLESCSGQRQTAEECLNSYDEAFYKQDNADQSWRAWPWQYCNEW 400
Query: 374 IMLTGGDNKDSIFEESEE---DYDARARYCKEAYGVDPRPNW-ITTEFGGHKIGLVLKRF 429
L G+ D + CK+A+G+ N + ++G I
Sbjct: 401 GYLQTGNGAPKNIRPVISRLIDLPYTSNICKQAFGITKPSNVDLVNKYGAFDIEY----- 455
Query: 430 ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDL------RFSTKED 477
+ F +G DPW GV ++ K + HH D + +
Sbjct: 456 -DRLAFIDGGSDPWKEAGVHATAARKRPTSTNKPFILIPDAVHHWDENGLYPNETTAELP 514
Query: 478 PQWLKDVRRREVEIIGKWI 496
PQ +++V+ E + +W+
Sbjct: 515 PQRIREVQAEEARFVKEWM 533
>gi|227202540|dbj|BAH56743.1| AT5G22860 [Arabidopsis thaliana]
Length = 171
Score = 91.3 bits (225), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 41/87 (47%), Positives = 53/87 (60%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S+ K + K Y Q LDHF + P+SY TFQQRY I+ THWGG+K NAPI + G E
Sbjct: 45 STQKVDESNLKMYYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEES 104
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIE 130
++ GF+ D P+ ALLV+IE
Sbjct: 105 SLDSDLAAIGFLRDNGPRLNALLVYIE 131
>gi|302500786|ref|XP_003012386.1| extracelular serine carboxypeptidase, putative [Arthroderma
benhamiae CBS 112371]
gi|291175944|gb|EFE31746.1| extracelular serine carboxypeptidase, putative [Arthroderma
benhamiae CBS 112371]
Length = 716
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 131/558 (23%), Positives = 214/558 (38%), Gaps = 91/558 (16%)
Query: 5 FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPE--KLSSLISSSKDSQGLYKTKYHTQIL 62
+ L+ CLL L + P P S++ E + +L++ S+D + +
Sbjct: 1 MVKLTACLLL---LVAAVQAKLPVTPISQLRAESHRNKALVARSQDVNAAFPAHTIQIPI 57
Query: 63 DHF----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGF 114
DHF Y P + + F RY + +H+ K P+ + G E G I + + G
Sbjct: 58 DHFPKSSRYEPHTTEKFNLRYWFDASHY---KEGGPVIILHGGETSGEGRIPFLQK--GI 112
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ +A + V +EHRYYG S+P + N S +L++ QALAD A ++K
Sbjct: 113 LAQLAQATNGIGVIMEHRYYGGSLPTPD-----FSNKSLR-FLTTEQALADTAYFSKNIK 166
Query: 175 ------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVS 225
NLTA + +++GGSY G A+ R +YP + GA++SS I ++
Sbjct: 167 FPGLEKYNLTAPGTAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYWQYFE 226
Query: 226 PYSFSNI-----ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA-FRICKSEK 279
P +TQ+F + +N +I G K GL +L+ A F S
Sbjct: 227 PIRQEAPQDCVHVTQNFVDIVDNI--IINGKNANTTRELKNLFGLGRLRDADFANALSS- 283
Query: 280 NLAIESWLST----AFVYTAMTDYP---TPSNFLNPLPA---FPVKEMCKAIDDPKTGND 329
I W ST A + Y T +L P+ A K + +A + +
Sbjct: 284 --GITGWQSTNWDPAISGKSFYQYCGEITSDRYLYPVTAQQKASAKRIIEAGGHGREAPE 341
Query: 330 VFAKLYG-----AASVYYNYSGTAKCFD--LNG--------DSDPHGLSEWGWQACTEMI 374
+ +L S + SG + + LN D+ W WQ C E
Sbjct: 342 ILPQLLNFVGWLNKSTLESCSGQGQTAEECLNSYDEAFYKQDNADQSWRAWPWQYCNEWG 401
Query: 375 MLTGGDNKDSIFEESEE---DYDARARYCKEAYGVDPRPNW-ITTEFGGHKIGLVLKRFA 430
L G D + CK+A+G+ N + ++G I
Sbjct: 402 YLQTGSGAPKNIRPVISRLIDLPYTSNICKQAFGITKPSNVDLVNKYGAFDIEY------ 455
Query: 431 SNIIFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDL------RFSTKEDP 478
+ F +G DPW GV ++ K + HH D + + P
Sbjct: 456 DRLAFVDGGSDPWKEAGVHATAARKRGTSTNKPFILIPDAVHHWDENGLYPNETTAELPP 515
Query: 479 QWLKDVRRREVEIIGKWI 496
Q +K+V+ E + +W+
Sbjct: 516 QRIKEVQAEEARFVKEWM 533
>gi|119189871|ref|XP_001245542.1| hypothetical protein CIMG_04983 [Coccidioides immitis RS]
Length = 533
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 125/529 (23%), Positives = 209/529 (39%), Gaps = 105/529 (19%)
Query: 45 SSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTG 100
+ D LY + + +DHF+ Y P S TF+ RY + +H+ K+ P+ V G
Sbjct: 31 AGDDLTSLYPSHTISIPIDHFHTDDRYAPHSNGTFELRYWFDASHY---KDGGPVIVLHG 87
Query: 101 NEGDIEW---FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYL 157
E D E F Q G + +A + V +EHRYYG SIP ++ + KN +L
Sbjct: 88 GETDGEGRLPFLQK-GILGQLAQATNGVGVVLEHRYYGTSIP---TEDFSTKNLR---FL 140
Query: 158 SSTQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
++ QA+AD A ++ K+LTA ++P +++GGSY G A+ R++YP + GA+
Sbjct: 141 TTEQAMADSAYFAKNVVFEGLEDKDLTAPNTPYILYGGSYAGAQVAFLRVEYPDIFWGAI 200
Query: 212 ASSA---PILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETA---KKPGGL 265
+SS I ++ P + ++C K + ++ A K P
Sbjct: 201 SSSGVTKAIWHYWQYYEPTR---------KHAPQHCVKQTQTFVDLVDNIALRGKDPKVT 251
Query: 266 EKLQKAFRICKSEKN--------LAIESWLSTAF-------VYTAMTDYPTPSNFLNPLP 310
++L++ F + + N I W T + + D T L P+
Sbjct: 252 QQLKEFFGLGELTHNDDFANVLAFGISGWQGTNWDPDISRPTFAKYCDTITTDRLLQPV- 310
Query: 311 AFPVKEMC----KAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLN-------GDSDP 359
+ KE+ KA + G + +L A+ + N + A C G DP
Sbjct: 311 SEEQKELATSLIKAGKYGRHGKKLKNRLLNYAA-WVNTTYAAPCLRRGSTLDTCFGTHDP 369
Query: 360 HGLSE---------WGWQACTEMIMLTGGDNKDSIFE---ESEEDYDARARYCKEAYGVD 407
W +Q CTE L G + D + C+EAYG+
Sbjct: 370 EFYKRDDTTQEWRLWYYQVCTEWGFLQTGSGVPKFIKPLVSRMIDLEYTTIMCREAYGL- 428
Query: 408 PRPNWITTEFGGHKIGLVLKRFASNIIF-----FNGLRDPWSGGGVLKNISKTVLALVEK 462
G + V K A +I + +G DPW ++ + V+K
Sbjct: 429 ---------HGEADVSRVNKWGAFDIEYPRLAIVDGEADPWVEATPHATFARPRRSTVDK 479
Query: 463 ------EGAHHVDLR--FSTKEDPQW----LKDVRRREVEIIGKWISQY 499
+ HH D F + P + +K V+ +E+ + KW+ +
Sbjct: 480 PFILIPDAVHHWDENGVFPNQTTPDFPPDRIKKVQAQEIAFVKKWLKDW 528
>gi|299119157|gb|ADJ11381.1| GA15377 [Drosophila affinis]
Length = 167
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 79/154 (51%), Gaps = 10/154 (6%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
+ NC I SWK E G +++ AF +C+ KN A ++ + AM
Sbjct: 9 NSNCTANIGRSWKTFETLGGTEAGKKQISDAFHLCQPIKNDADLKKFLDYIEEVYANLAM 68
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN-DVFAKLYGAASVYYNYSGTAKCFDLNG 355
+YP S+FL PLPA+PV+++C + D + D+ + A +VY NY+G+AKC D+
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQNDADLLHAMASALAVYTNYTGSAKCLDI-- 126
Query: 356 DSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFE 387
SD E GW Q C +M+M + D++F
Sbjct: 127 -SDTSNADEAGWNVQTCNQMVMPFCSNGTDTMFR 159
>gi|402220386|gb|EJU00458.1| peptidase S28 [Dacryopinax sp. DJM-731 SS1]
Length = 504
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 118/489 (24%), Positives = 200/489 (40%), Gaps = 88/489 (17%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE--WFAQNTGFMYD 117
Q LDHF+ QS Q + QRY +N ++ P+F++ EG E + +TG +
Sbjct: 43 QPLDHFSS--QSPQ-WAQRYWLNARYY---TPGGPVFLFDTGEGPGEDRFGVLDTGIVAI 96
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK--- 174
+A + + V +EHRYYG+S+P ++ + + +L++ QA AD A+ + +
Sbjct: 97 LARETGGMAVVLEHRYYGQSMP------VSNLSTDSLRFLNNAQAAADSANFMRSVHFPG 150
Query: 175 --KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNI 232
++++A + P + +GGSYGG AA R+ YP + GA+ASSA + N + ++ Y + I
Sbjct: 151 VDEDVSALNRPWIYYGGSYGGARAAHMRVLYPELVWGAIASSA-VTNAE--INNYEYFEI 207
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFV 292
I D+ S +C ++ S I+ P L K+ N + LS
Sbjct: 208 IA-DY--ASPHCISALRASISLIDTHLSSP-FLAPFMKSLFSLSPLSNSDFAAILSAPLG 263
Query: 293 YTAMTDYPT-----PSNFLNPLPAFPVKEMCKAIDDPKTGNDVF-AKLYGAASVYYNY-- 344
++ P + +E CK + GN + + A Y Y
Sbjct: 264 AWQGKNWDEREAHRPGEW---------EEFCKCLTR-GVGNGMGKVAAWMATRNYAQYIK 313
Query: 345 ----------SGTAKCFDLNGDSDP---------HGLSEWGWQACTEMIMLTGGDNKDSI 385
A CF G DP W +Q CTE SI
Sbjct: 314 DTVVSECPEGMTVADCF---GSDDPTKFQNTSITESWRSWAFQVCTEWGYFFAAPPYPSI 370
Query: 386 FEESEEDYDARARYCKEAYGVDPR---PNW----ITTEFGGHKIGLVLKRFASNIIFFNG 438
+ + DA A++C+ A+ P W I E G + I A + +G
Sbjct: 371 VSQHLQ-IDANAKHCQAAFPNGEHYIIPAWPNVSIVNELGSYGIA------AERLALIDG 423
Query: 439 LRDPW------SGGGVLKNISKTVLALVEKEGAHHVDLR--FSTKEDPQWLKDVRRREVE 490
DPW S ++N + + + G HH D +++P +++ V E+
Sbjct: 424 EWDPWRPRTPHSIHAPVRNDTISEPFMQIPAGVHHYDENGLADPRKEPLYIQKVHHEEIG 483
Query: 491 IIGKWISQY 499
+ W++Q+
Sbjct: 484 FVKAWLAQW 492
>gi|237840317|ref|XP_002369456.1| serine carboxypeptidase S28, putative [Toxoplasma gondii ME49]
gi|211967120|gb|EEB02316.1| serine carboxypeptidase S28, putative [Toxoplasma gondii ME49]
Length = 738
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 76/145 (52%), Gaps = 20/145 (13%)
Query: 94 PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
PIFVY G EG + G + ++ F A L +EHRYYG S P + + +
Sbjct: 274 PIFVYIGGEGPLSSLEVRQGLLAEMGDIFGASLYALEHRYYGDSHPRPDSSVVNLQ---- 329
Query: 154 TGYLSSTQALADYASLIIDLKK--------NLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
+L+S QAL D A+ + +K+ +L D PVVVFG SY G LAA+ R KYP
Sbjct: 330 --WLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPA 387
Query: 206 VAIGALASSAPI------LNFDNIV 224
+GA++SS+P+ FD +V
Sbjct: 388 SILGAVSSSSPVEASALFQAFDRVV 412
>gi|221504075|gb|EEE29752.1| serine carboxypeptidase, putative [Toxoplasma gondii VEG]
Length = 738
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 76/145 (52%), Gaps = 20/145 (13%)
Query: 94 PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
PIFVY G EG + G + ++ F A L +EHRYYG S P + + +
Sbjct: 274 PIFVYIGGEGPLSSLEVKQGLLAEMGDIFGASLYALEHRYYGDSHPRPDSSVVNLQ---- 329
Query: 154 TGYLSSTQALADYASLIIDLKK--------NLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
+L+S QAL D A+ + +K+ +L D PVVVFG SY G LAA+ R KYP
Sbjct: 330 --WLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPA 387
Query: 206 VAIGALASSAPI------LNFDNIV 224
+GA++SS+P+ FD +V
Sbjct: 388 SILGAVSSSSPVEASALFQAFDRVV 412
>gi|221483143|gb|EEE21467.1| serine carboxypeptidase, putative [Toxoplasma gondii GT1]
Length = 738
Score = 90.1 bits (222), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 54/145 (37%), Positives = 76/145 (52%), Gaps = 20/145 (13%)
Query: 94 PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
PIFVY G EG + G + ++ F A L +EHRYYG S P + + +
Sbjct: 274 PIFVYIGGEGPLSSLEVKQGLLAEMGDIFGASLYALEHRYYGDSHPRPDSSVVNLQ---- 329
Query: 154 TGYLSSTQALADYASLIIDLKK--------NLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
+L+S QAL D A+ + +K+ +L D PVVVFG SY G LAA+ R KYP
Sbjct: 330 --WLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPA 387
Query: 206 VAIGALASSAPI------LNFDNIV 224
+GA++SS+P+ FD +V
Sbjct: 388 SILGAVSSSSPVEASALFQAFDRVV 412
>gi|255938516|ref|XP_002560028.1| Pc14g00330 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584649|emb|CAP74174.1| Pc14g00330 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 557
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 110/441 (24%), Positives = 181/441 (41%), Gaps = 77/441 (17%)
Query: 54 KTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-----WF 108
+ +Y + +DH N S ++ RY +++ H+ K + PIFV+ E E +
Sbjct: 65 EAEYVSIPIDHSN---SSVGHYRNRYWVSEEHY---KEDGPIFVFDVGESTAEPAGQTYL 118
Query: 109 AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS 168
+ ++ F Y + +F + + EHRYYG S+PY + ++ ++ YL++ QALAD
Sbjct: 119 SNSSTFFYQLVKEFGGIGIVWEHRYYGDSLPYNVSLDMEPEHLQ---YLNNKQALADIPY 175
Query: 169 LIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI----- 217
+ +LT +P V+ GGSY GM AA+ R YP A ASSAP+
Sbjct: 176 FAAQFTRQDYSDVDLTPAGTPWVMVGGSYAGMRAAFTRQSYPDTIYAAFASSAPVEARID 235
Query: 218 --LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEET-AKKPGGLEKLQKAFRI 274
+ FD + + ++T NC + IK + + I+E +K +++ F
Sbjct: 236 MSVYFDQV-----YDGMVTYGHL----NCTRDIKAALEYIDEQLSKSESAAAAIKREFFG 286
Query: 275 CKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKT------- 326
+EKN + + +Y Y + L +F C+ ++ DPKT
Sbjct: 287 EGAEKNSNGDFTAALVTIYNFFQSYGMGGG-VGSLESF-----CEHMETDPKTSAAAPSE 340
Query: 327 ------GNDVFAKLYGAASVY---YNYSGTAKCFDLNG-----------DSDPHGLSEWG 366
G A+ Y + V+ N + C L ++P +S W
Sbjct: 341 GFAPTRGKKYVAERYASWPVFTQVVNMNMKTNCKKLETSEPLECDLGPPSANPDTIS-WT 399
Query: 367 WQACTEMIMLT----GGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKI 422
WQ CTE G + S ++ E + R+ EA P TE +
Sbjct: 400 WQYCTEWGYFQTNNFGAHSLLSKYQTLEYAQEYCNRFFPEAVAKGLFPKHPQTEATNAET 459
Query: 423 GLVLKRFASNIIFFNGLRDPW 443
G R SN+ + G DPW
Sbjct: 460 GGWSIR-PSNVYWSGGQFDPW 479
>gi|426196115|gb|EKV46044.1| hypothetical protein AGABI2DRAFT_193941 [Agaricus bisporus var.
bisporus H97]
Length = 552
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 122/511 (23%), Positives = 205/511 (40%), Gaps = 93/511 (18%)
Query: 53 YKTKYHTQILDHFNY-NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFA 109
++ ++ Q LDHF+ NP TF+QRY ++ H+ ++ AP+FV G E G
Sbjct: 64 FEPQWFEQPLDHFDESNPH---TFKQRYWVSKRHYK-ARQGAPVFVLDGGETSGANRLPF 119
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADYA 167
+TG + +A + L V +EHRYYG+SI A +N +T +L++ QA AD A
Sbjct: 120 LDTGIVDILARATEGLGVILEHRYYGESI--------AVENLTTDALRWLNNEQAAADSA 171
Query: 168 SLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN 222
+ + +K ++LTA +P + +GGSY G +A ++ YP + GA+ASSA
Sbjct: 172 NFMAKVKFNGIEEDLTAPSTPWIYYGGSYAGARSAHMKILYPDLVYGAIASSAVT---HA 228
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK--LQKAFRICKSEKN 280
V + + II ++ C ++ S K I+ K L+ F + +
Sbjct: 229 AVENWQYMEIIR---KAADPKCSHHLENSIKIIDSILLGHSKFFKSHLKGLFGLANLTHD 285
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAK------- 333
S +S+ ++ + + + C+A++ P G+ + AK
Sbjct: 286 DDFASTISSPLGSWQSKNWDPEESSVR------FDQFCEALNAPVHGHGISAKATDLPFG 339
Query: 334 ------------------LYGAASVYYNYSGTAK-----CFDLNGDS--DPHGLSE---- 364
L A + N + + CF DS GL E
Sbjct: 340 DPARMVEVADGLTLDLSILNYGAYIKNNTASRCRTTLDDCFGTYDDSKYQQTGLDEDWRL 399
Query: 365 WGWQACTE--MIMLTGGDNKDSIFEESEEDYDARARYCKEAY------GVDPRPNWITTE 416
W +Q CT+ T D K ++ C++AY V P PN
Sbjct: 400 WMFQVCTQWGYFTTTPPDPKQPRILSRLITLPHESKICQQAYLPGDFFQVPPLPNVTAVN 459
Query: 417 FGGHKIGLVLKRFASNIIFFNGLRDPW------SGGGVLKNISKTVLALVEKEGAHHVDL 470
G A ++F +G DPW S G ++ + + HH D
Sbjct: 460 MLGD-----FDIVADRLMFVDGEVDPWRPDTPHSDGAKPRDDTILRPFKLIPGAVHHWDE 514
Query: 471 R--FSTKEDPQWLKDVRRREVEIIGKWISQY 499
++P ++D+ EV II +W++ +
Sbjct: 515 NGLADVTQEPVVIQDIHAEEVAIIEQWLADW 545
>gi|409079210|gb|EKM79572.1| hypothetical protein AGABI1DRAFT_114101 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 552
Score = 89.7 bits (221), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 124/511 (24%), Positives = 205/511 (40%), Gaps = 93/511 (18%)
Query: 53 YKTKYHTQILDHFNY-NPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFA 109
++ ++ Q LDHF+ NP TF+QRY ++ H+ ++ AP+FV G E G
Sbjct: 64 FEPQWFEQPLDHFDESNPH---TFKQRYWVSKRHYK-ARQGAPVFVLDGGETSGANRLPF 119
Query: 110 QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADYA 167
+TG + +A + L V +EHRYYG+SI A +N +T +L++ QA AD A
Sbjct: 120 LDTGIVDILARATEGLGVILEHRYYGESI--------AVENLTTDALRWLNNEQAAADSA 171
Query: 168 SLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN 222
+ + +K ++LTA +P + +GGSY G +A ++ YP + GA+ASSA
Sbjct: 172 NFMAKVKFEGIEEDLTAPSTPWIYYGGSYAGARSAHMKILYPDLVYGAIASSAVT---HA 228
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK--LQKAFRICKSEKN 280
V + + II ++ C ++ S K I+ K L+ F + +
Sbjct: 229 AVENWQYMEIIR---KAADPKCSHHLENSIKIIDSILLGHSKFFKSHLKGLFGLANLTHD 285
Query: 281 LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKL------ 334
S +S+ ++ + + + C+A++ P G+ + AK
Sbjct: 286 DDFASTISSPLGSWQSKNWDPEESSVR------FDQFCEALNAPVHGHGISAKATDLPFG 339
Query: 335 --------------------YGA----ASVYYNYSGTAKCFDLNGDS--DPHGLSE---- 364
YGA + S CF DS GL E
Sbjct: 340 DPARMVEVADGLTLDLSILNYGAYIKNNTASRCRSTLDDCFGTYDDSKYQQTGLDEDWRL 399
Query: 365 WGWQACTE--MIMLTGGDNKDSIFEESEEDYDARARYCKEAY------GVDPRPNWITTE 416
W +Q CT+ T D K ++ C++AY V P PN
Sbjct: 400 WMFQVCTQWGYFTTTPPDPKQPRILSRLITLPHESKICQQAYLPGEFFQVPPLPNVTAVN 459
Query: 417 FGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK----TVLALVE--KEGAHHVDL 470
G A ++F +G DPW + +K T+L + HH D
Sbjct: 460 MLGD-----FDIAADRLMFVDGEVDPWRPDTPHSDDAKPRDDTILRPFKLIPGAVHHWDE 514
Query: 471 R--FSTKEDPQWLKDVRRREVEIIGKWISQY 499
++P ++D+ EV II +W++ +
Sbjct: 515 NGLADVTQEPVVIQDIHAEEVAIIEQWLADW 545
>gi|299119191|gb|ADJ11398.1| GA15377 [Drosophila pseudoobscura]
Length = 167
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 81/156 (51%), Gaps = 10/156 (6%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
+ NC I SWK E G +++ AF +C KN A ++ + AM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLDYIEEVYGNLAM 68
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYNYSGTAKCFD--L 353
+YP S+FL PLPA+PV+++C + D ++ D+ + A +VY NY+G+ KC D +
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASALAVYTNYTGSVKCLDTSV 128
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES 389
N ++D G W Q C +M+M ++ DS+F S
Sbjct: 129 NSNADDSG---WNVQTCNQMVMPFCSNSSDSMFRPS 161
>gi|299119193|gb|ADJ11399.1| GA15377 [Drosophila pseudoobscura]
gi|299119195|gb|ADJ11400.1| GA15377 [Drosophila pseudoobscura]
gi|299119197|gb|ADJ11401.1| GA15377 [Drosophila pseudoobscura]
gi|299119199|gb|ADJ11402.1| GA15377 [Drosophila pseudoobscura]
gi|299119201|gb|ADJ11403.1| GA15377 [Drosophila pseudoobscura]
gi|299119203|gb|ADJ11404.1| GA15377 [Drosophila pseudoobscura]
gi|299119205|gb|ADJ11405.1| GA15377 [Drosophila pseudoobscura]
gi|299119207|gb|ADJ11406.1| GA15377 [Drosophila pseudoobscura]
gi|299119209|gb|ADJ11407.1| GA15377 [Drosophila pseudoobscura]
gi|299119211|gb|ADJ11408.1| GA15377 [Drosophila pseudoobscura]
gi|299119213|gb|ADJ11409.1| GA15377 [Drosophila pseudoobscura]
gi|299119215|gb|ADJ11410.1| GA15377 [Drosophila pseudoobscura]
gi|299119217|gb|ADJ11411.1| GA15377 [Drosophila pseudoobscura]
gi|299119219|gb|ADJ11412.1| GA15377 [Drosophila pseudoobscura]
Length = 167
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 81/156 (51%), Gaps = 10/156 (6%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
+ NC I SWK E G +++ AF +C KN A ++ + AM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLDYIEEVYGNLAM 68
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYNYSGTAKCFD--L 353
+YP S+FL PLPA+PV+++C + D ++ D+ + A +VY NY+G+ KC D +
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASALAVYTNYTGSVKCLDTSV 128
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES 389
N ++D G W Q C +M+M ++ DS+F S
Sbjct: 129 NSNADDSG---WNVQTCNQMVMPFCSNSTDSMFRPS 161
>gi|17550650|ref|NP_508903.1| Protein C26B9.5 [Caenorhabditis elegans]
gi|351058287|emb|CCD65711.1| Protein C26B9.5 [Caenorhabditis elegans]
Length = 516
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 96/450 (21%), Positives = 184/450 (40%), Gaps = 55/450 (12%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD-IEWFAQNTGFMY- 116
+Q +D+F+ N + + QRY N T ++N +F+ E + + N + Y
Sbjct: 62 SQKVDNFDANNNA--MYNQRYWYNPTF---TQNKNIVFLMIQGEAPATDTWISNPNYQYL 116
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
A +F A + +EHR +G+S PY K + TQALAD + I + +
Sbjct: 117 QWAKEFGADVFQLEHRCFGQSRPYPDTSMPGIK------VCTMTQALADIHNFIQQMNRR 170
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
+ + FGGSY G L+A FR +YP +GA+ASSAP+ D + + ++ ++
Sbjct: 171 FNFQNPKWITFGGSYPGTLSALFRQQYPADTVGAVASSAPL---DWTLDFFEYAMVVEDV 227
Query: 237 FRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-----EKNLAIESWLSTAF 291
+ S +C++ + ++ +++ + G+++L F + + I+++ + +
Sbjct: 228 LKKTSVDCWRNVNQAFLNMQQLSLTKAGIQQLNTYFNLVPAFVDGQYTQHDIDNFFANVY 287
Query: 292 -VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKC 350
+ + Y L +++C ++D T DV ++ + +G
Sbjct: 288 SFFQGVVQYTYDGRNNATLNGLNAQQLCNKMND-ATVPDVITRVNNTINWINQMNGDPVG 346
Query: 351 FDLNGDSD----------------PHGLSE---WGWQACTEMIMLTGGDNKDSIFEESEE 391
N SD P ++ W W C E+ L D +IF+++
Sbjct: 347 PFQNSYSDMMTVLANASYDDNSAVPGDIAANRGWMWLCCNELGALQTTDQGRNIFQQTVP 406
Query: 392 DYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRF-------ASNIIFFNGLRDPWS 444
C + +G D I ++ L ++ A+N++ NG DPW
Sbjct: 407 -LGYFIDMCTDMFGAD-----IGIKYVRDNNKQTLYKYKGADNYQATNVVLPNGAFDPWH 460
Query: 445 GGGVLKNISKTVLALVEKEGAHHVDLRFST 474
G N + + + +GA H + T
Sbjct: 461 VLGTYNNDTANHMTPLLIQGAAHCSDMYPT 490
>gi|425770000|gb|EKV08475.1| Serine peptidase, family S28, putative [Penicillium digitatum Pd1]
gi|425771690|gb|EKV10127.1| Serine peptidase, family S28, putative [Penicillium digitatum
PHI26]
Length = 553
Score = 89.4 bits (220), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 111/451 (24%), Positives = 185/451 (41%), Gaps = 69/451 (15%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
+ ++ +S DS + K +Y + +DH N S +Q RY +++ ++ K P+FVY
Sbjct: 47 VHAIANSQLDS--IIKAEYVSLPIDHSN---SSVGYYQNRYWVSEDNY---KEGGPVFVY 98
Query: 99 TGNEGDIEWFAQ-----NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
E E AQ +T F Y + +F + + EHRYYG S+PY + + ++
Sbjct: 99 DVGEASAESSAQAYLGNSTTFFYQMVQEFGGIGIVWEHRYYGDSLPYNVSLHMQPEHLL- 157
Query: 154 TGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
YL++ QALAD + + +LT +P V+ GGSY GM +A+ R YP
Sbjct: 158 --YLNNEQALADIPFFAANFTRRNYSDVDLTPGGTPWVMVGGSYSGMRSAFTRHLYPETI 215
Query: 208 IGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI-EETAKKPGGLE 266
+ ASSAP+ I F + NC + ++ + + + E+ +K
Sbjct: 216 YASYASSAPVEA--RIDMSVYFDQVYDGLVAYGHLNCTRDVQAALEYVDEQLSKSTSSAA 273
Query: 267 KLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPK 325
++KAF ++ N + + A VY Y + L +F C ++ DPK
Sbjct: 274 AIKKAFFGEGADNNSNGDFTAALAIVYNYFQSYGMGGG-VGSLESF-----CAHMETDPK 327
Query: 326 TGNDV----FAKLYG---AASVYYNYSGTAKCFDLNGD--------------------SD 358
T FA G AA Y ++ + ++N D SD
Sbjct: 328 TNQTAPPQGFADSRGKQYAAERYASWPAFTELVNMNLDTNCKKLETSEALTCDLSQPSSD 387
Query: 359 PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY------GVDPRPNW 412
P +S W WQ CT+ + + + YC A+ G+ P+ +
Sbjct: 388 PDTIS-WTWQYCTQWGFFQTNNFGHRSLLSKYQTLEYAQEYCNRAFPEAIKKGLFPK-HP 445
Query: 413 ITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
+ E G ++ SN+ + G DPW
Sbjct: 446 LVDEANAETGGWTIR--PSNVYWSGGQFDPW 474
>gi|299119159|gb|ADJ11382.1| GA15377 [Drosophila miranda]
gi|299119161|gb|ADJ11383.1| GA15377 [Drosophila miranda]
gi|299119163|gb|ADJ11384.1| GA15377 [Drosophila miranda]
gi|299119165|gb|ADJ11385.1| GA15377 [Drosophila miranda]
gi|299119167|gb|ADJ11386.1| GA15377 [Drosophila miranda]
gi|299119171|gb|ADJ11388.1| GA15377 [Drosophila miranda]
gi|299119173|gb|ADJ11389.1| GA15377 [Drosophila miranda]
gi|299119175|gb|ADJ11390.1| GA15377 [Drosophila miranda]
gi|299119177|gb|ADJ11391.1| GA15377 [Drosophila miranda]
gi|299119179|gb|ADJ11392.1| GA15377 [Drosophila miranda]
gi|299119181|gb|ADJ11393.1| GA15377 [Drosophila miranda]
gi|299119183|gb|ADJ11394.1| GA15377 [Drosophila miranda]
gi|299119185|gb|ADJ11395.1| GA15377 [Drosophila miranda]
gi|299119187|gb|ADJ11396.1| GA15377 [Drosophila miranda]
gi|299119189|gb|ADJ11397.1| GA15377 [Drosophila miranda]
Length = 167
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 82/160 (51%), Gaps = 10/160 (6%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
+ NC I SWK E G +++ AF +C KN A ++ + AM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKKFLDYIEEVYGNLAM 68
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYNYSGTAKCFDL-- 353
+YP S+FL PLPA+PV+++C + D ++ D+ + A +VY NY+G+ KC D+
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASALAVYTNYTGSVKCLDISV 128
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
N ++D G W Q C +M+M + DS+F S ++
Sbjct: 129 NSNADDSG---WNVQTCNQMVMPFCSNGTDSMFRPSSWNF 165
>gi|303314813|ref|XP_003067415.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240107083|gb|EER25270.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320037761|gb|EFW19698.1| serine peptidase, family S28 [Coccidioides posadasii str. Silveira]
Length = 555
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 87/354 (24%), Positives = 155/354 (43%), Gaps = 69/354 (19%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-----QNTGFMY 116
+DH N + + ++ R+ +ND+ + K+ P+F++ G E + + +A T F
Sbjct: 77 IDHENPDAK----YKNRFWVNDSKY---KSGGPVFLFDGGEANAQRYADFYLVNETSFFV 129
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK- 175
+ +F + + EHRYYG+S P+ N + A YL++ QALAD + K+
Sbjct: 130 QLLEEFHGMGIVWEHRYYGESNPFPVNLDTP---AEHFQYLNNEQALADIPYFAKNFKRE 186
Query: 176 -----NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+LT +P V+ GGSY GM AA+ R +YP + A+ AP+ + S
Sbjct: 187 NFPDDDLTPKSTPWVMIGGSYPGMRAAFTRDQYPETIFASFAACAPVQ------AQIDMS 240
Query: 231 NIITQDFRSVSE----NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESW 286
Q +R + NC K ++ ++K I+ ++ +++K F ++ N +
Sbjct: 241 VYYEQVYRGLVAYGYGNCTKDVRAAYKYIDSKLRRGESAAEIKKLFLGDTAQNNTNGD-- 298
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGN----DVFAKLYGAASVY 341
+ A ++T T + + P V + C ++ DPKT + +A GA +V
Sbjct: 299 FTQALIWTWAT-------WQSQGPDGGVGQFCNWLETDPKTNKTAPAEGWAPTKGAKAVV 351
Query: 342 YNYSG------------TAKCFDLNGD-----------SDPHGLSEWGWQACTE 372
++ C N D +DP G++ W WQ C+E
Sbjct: 352 ERFAAWPGLVPRVNAAFETNCKGENPDEPTMCNLGKRVADPSGIA-WTWQYCSE 404
>gi|299119221|gb|ADJ11413.1| GA15377 [Drosophila pseudoobscura]
Length = 167
Score = 89.0 bits (219), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 81/156 (51%), Gaps = 10/156 (6%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
+ NC I SWK E G +++ AF +C KN A ++ + AM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLDYIEEVYGNLAM 68
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYNYSGTAKCFD--L 353
+YP S+FL PLPA+PV+++C + D ++ D+ + A +VY NY+G+ KC D +
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHGMASALAVYTNYTGSVKCLDTSV 128
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES 389
N ++D G W Q C +M+M ++ DS+F S
Sbjct: 129 NSNADDSG---WNVQTCNQMVMPFCSNSTDSMFRPS 161
>gi|395324539|gb|EJF56977.1| hypothetical protein DICSQDRAFT_174353 [Dichomitus squalens
LYAD-421 SS1]
Length = 467
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 116/457 (25%), Positives = 180/457 (39%), Gaps = 59/457 (12%)
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHR 132
QTF+Q+Y++N T++ K PI Y NE +T + D A + L +EHR
Sbjct: 10 QTFKQQYILNATYF---KEGGPILFYQSNEA-TTITCPDTLILADWAKEIGGLTATLEHR 65
Query: 133 YYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSY 191
Y+G+S+P+G N +N YL+ + D + I +K N+T A++S +V G SY
Sbjct: 66 YFGQSLPFG-NDSYTQENFK---YLTLENVMQDAVNFIDFIKSNVTGASNSKAIVVGRSY 121
Query: 192 GGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGS 251
GG L+A FR YP V GA A S P F + + Q + S + IK +
Sbjct: 122 GGTLSAIFRQNYPDVFYGAWAVSGPFYAFGDST---EIGQEVQQTYLRQSYTAFSRIKQA 178
Query: 252 WKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES------WLSTAFVYTAMTDYP-TPSN 304
+ ++ G L K +C++ + WL A Y +T + PS
Sbjct: 179 FSNVKSLVAS-GDEPTLAKELSLCQAPNVTDVAGAVTFNYWLVGA--YDILTQFSFMPSY 235
Query: 305 FLN-PLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTA-------KCFD---- 352
F N P PV I+D + A L ++++ Y G A C D
Sbjct: 236 FHNVSGPVLPV-----VINDTLSAPSPLAALN--QTLWHAYGGDAVAVPSGKPCLDHTIA 288
Query: 353 ----LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDP 408
+N + P W + G + S D A C + V
Sbjct: 289 LPSSINIAAVPFSWVRCNWVPLNNALDPRGIWQIGAPLPPSASDPSAG---CTALWNVTT 345
Query: 409 RPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWS-------GGGVLKNISKTVLALVE 461
P ++I R ++ +IF G DP + G V + + V V
Sbjct: 346 PPGAAIK--AKYRIADADLRASTRVIFSVGELDPTTSVAQNGLGDAVGTDPHQAVRVFVA 403
Query: 462 KEGAHHVDL-RFSTKEDPQWLKDVRRREVEIIGKWIS 497
G H DL ++ D Q + D R E+ +I W++
Sbjct: 404 G-GGHGQDLEQYDPGADWQSVVDARNIELNVIKGWLN 439
>gi|389740212|gb|EIM81403.1| peptidase S28 [Stereum hirsutum FP-91666 SS1]
Length = 545
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 130/495 (26%), Positives = 202/495 (40%), Gaps = 79/495 (15%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWG-GSKNNAPIFVYTGNE--GDIEWFAQNTGFM 115
TQ LDHF + TF QRY ++ H+ GS P+ V G E G+ +TG +
Sbjct: 71 TQPLDHFYGSTNG--TFPQRYWVSTRHYTPGSNATVPVIVLDGGETSGEDRLPYLDTGIV 128
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK- 174
+A + V +EHRYYG S+ G + + N +L++ QAL D A+ + ++K
Sbjct: 129 DILAEATGGVGVVLEHRYYGDSV---GVPDFSTDNLR---WLNNEQALEDSANFMRNVKF 182
Query: 175 ----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS----APILNFDNIVSP 226
++LTA +P + FGGSY G AA ++ YP + GA+ASS A I N++ +
Sbjct: 183 EGIDEDLTAPGTPWIYFGGSYAGARAAHMKVLYPDIVYGAIASSGVTHAAITNWEYMDVI 242
Query: 227 YSFSNIITQDFRSVSENCYKVIKGSWKQIEETA-KKPGGLEKLQKAFRICKSEKNLAIES 285
F+ + D + + GS + E +A K GLE L +K+ A S
Sbjct: 243 RQFATVECSDNLVQTVSTVDKYLGSNNETERSAIKAIFGLEDL-------PDDKDFA--S 293
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND-VFAKLYGAASVYYNY 344
L T Y ++ P+ F + G+D V + + NY
Sbjct: 294 VLETPLEYWQSKNW-DPAVGSTVFDDFCTLLNNETTTTSTNGSDSVTVQGLEVNNALVNY 352
Query: 345 SGTAK---------------CFDLNGDSDPHGLS------EWGWQACTEMIMLTGGDNKD 383
+G K CF +S +G W +Q CT+ LT D
Sbjct: 353 AGWIKDNLVAGCTGTNTIEQCFGTQDESQYNGTDLYQTWRLWTFQYCTQWGYLTTAPPSD 412
Query: 384 SI--FEESEEDYDARARYCKEAYG------VDPRPNWITTEFGGHKIGLVLKRFASNIIF 435
I D + + C++A+ V P PN + G G+ R A
Sbjct: 413 DIPAIISRVLDLEYEHKICEQAFAPGEFFTVPPLPNVTSVNVLG-DFGIAADRLA----I 467
Query: 436 FNGLRDPW------SGGGVLKNISKTVLALVE--KEGAHHVD---LRFSTKEDPQWLKDV 484
+G DPW S G +N + TVL + HH D L + E P+ ++ +
Sbjct: 468 VDGEIDPWRPDTPHSQYGA-QNRTDTVLRPFKLIPNAVHHYDENGLANHSAEPPE-IQAI 525
Query: 485 RRREVEIIGKWISQY 499
+EVE + W+ +
Sbjct: 526 HEQEVEFVKAWLEDW 540
>gi|345317349|ref|XP_003429868.1| PREDICTED: dipeptidyl peptidase 2-like, partial [Ornithorhynchus
anatinus]
Length = 209
Score = 88.2 bits (217), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 70/129 (54%), Gaps = 5/129 (3%)
Query: 190 SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIK 249
SYGGML+A+ RLKYPH+ GALA+SAP+L+ + P F +T DF + S C ++
Sbjct: 81 SYGGMLSAYMRLKYPHLVTGALAASAPVLSVAGLGDPRQFFRDVTADFENFSPKCSGAVR 140
Query: 250 GSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAMTDYPTPSNF 305
++ QI + A + + + + C + A + + AF AM DYP P++F
Sbjct: 141 EAFGQIWDLALR-QAYDPISRGMATCHRLSDGADVDQLLEFARNAFAMIAMMDYPYPTDF 199
Query: 306 LNPLPAFPV 314
+ PA PV
Sbjct: 200 MGHFPAHPV 208
>gi|290997480|ref|XP_002681309.1| predicted protein [Naegleria gruberi]
gi|284094933|gb|EFC48565.1| predicted protein [Naegleria gruberi]
Length = 469
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 108/442 (24%), Positives = 181/442 (40%), Gaps = 61/442 (13%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVA 119
Q +DHFN + TF QRY G S ++ +Y E N ++ + A
Sbjct: 39 QRIDHFN--ALNTDTFPQRYY-KFVPEGVSASSPNHLLYICPEATCGGTPNN--YVKNYA 93
Query: 120 PKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTA 179
+ KA + +EHR+YGKS+PY K + N YL + ALAD + I + +
Sbjct: 94 MELKATIYTLEHRFYGKSVPYKSMKTVNMAN-----YLKTEMALADLSVFIEYIATLPSD 148
Query: 180 TDSP--VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDF 237
++P ++ G SY G L+A+F +KYPH+ GAL+SS + ++I+ Y+F + Q
Sbjct: 149 NNTPHQFIIVGCSYPGALSAFFSMKYPHLVKGALSSSGVV---NSILDFYTFDMHVQQ-- 203
Query: 238 RSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMT 297
+ C ++ + +E+ P L R ++ ++ I L F A
Sbjct: 204 -AAGPECTALLTRATSIMEKM--NPTNL------LRDFQAPADMDIRD-LFLLFGDIAGE 253
Query: 298 DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND-------------VFAKLYGAASVYYNY 344
++ E+C A+ T D F K++ + + Y Y
Sbjct: 254 SVQYGYHY----------ELCNAMKSGNTNLDEVIYQNFHNYSLNFFYKVFETSPLDY-Y 302
Query: 345 SGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESE-EDYDARARYCKEA 403
+G + + +G W Q C+E+ K+ S D D CK+
Sbjct: 303 NGAIGNDNYDPSQGANGGRSWWLQTCSELSYFNTAPPKNLPSIRSRWLDLDYFYDKCKKI 362
Query: 404 YGVDPRPN--WITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
+G +PN ++ ++G ++ L +F NG +DPW GV + K +
Sbjct: 363 FGYPIKPNTDFVNNQYGAKQL---LNTTTGRTVFVNGSQDPWLRAGVDIDPKKFSFLIEC 419
Query: 462 KEGAHHVDLRFSTKEDPQWLKD 483
H VDLR P W ++
Sbjct: 420 NNCGHCVDLRGC----PSWAQN 437
>gi|401398762|ref|XP_003880396.1| protein F23B2.12, partially confirmed by transcript evidence,
related [Neospora caninum Liverpool]
gi|325114806|emb|CBZ50362.1| protein F23B2.12, partially confirmed by transcript evidence,
related [Neospora caninum Liverpool]
Length = 684
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 72/132 (54%), Gaps = 14/132 (10%)
Query: 94 PIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
P+FVY G EG + G + ++ F A + +EHRYYG S P + +
Sbjct: 224 PVFVYIGGEGPLSSMEVKQGLLAEMGEAFGASVYALEHRYYGDSHPRPDS------SVPN 277
Query: 154 TGYLSSTQALADYASLIIDLKK--------NLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
+L+S QAL D A+ + +K+ N++ D PV+VFG SY G LAA+ R KYP
Sbjct: 278 LQWLTSHQALGDLAAFVAHVKREQAEQHPQNVSPEDIPVIVFGCSYPGSLAAYARSKYPA 337
Query: 206 VAIGALASSAPI 217
+GA++SS+P+
Sbjct: 338 SILGAISSSSPV 349
>gi|294873112|ref|XP_002766517.1| prolylcarboxypeptidase, putative [Perkinsus marinus ATCC 50983]
gi|239867444|gb|EEQ99234.1| prolylcarboxypeptidase, putative [Perkinsus marinus ATCC 50983]
Length = 133
Score = 88.2 bits (217), Expect = 1e-14, Method: Composition-based stats.
Identities = 53/118 (44%), Positives = 69/118 (58%), Gaps = 12/118 (10%)
Query: 124 ALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID--LKKNLTATD 181
A LVF EHRYYGK+ Y + YL+ QALADY S++ID K+
Sbjct: 16 AFLVFAEHRYYGKTQVYSDGTPDCLR------YLTIEQALADY-SVLIDYIFDKHDLPPS 68
Query: 182 SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIV---SPYSFSNIITQD 236
+ + FGGSYGGMLA+ FR KYPH+ GA+A+SAPI + S +F+ IIT+D
Sbjct: 69 TATIAFGGSYGGMLASAFRYKYPHIVDGAIAASAPIFAIGGVTPEPSKTAFNEIITRD 126
>gi|119175285|ref|XP_001239900.1| hypothetical protein CIMG_09521 [Coccidioides immitis RS]
gi|392870094|gb|EAS28654.2| serine peptidase, family S28 [Coccidioides immitis RS]
Length = 555
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 86/354 (24%), Positives = 155/354 (43%), Gaps = 69/354 (19%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-----QNTGFMY 116
+DH N + + ++ R+ +ND+ + K+ P+F++ G E + + +A T F
Sbjct: 77 IDHENPDAK----YKNRFWVNDSKY---KSGGPVFLFDGGEANAQRYADFYLVNETSFFV 129
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK- 175
+ +F + + EHRYYG+S P+ N + A YL++ QALAD + K+
Sbjct: 130 QLLEEFHGMGIVWEHRYYGESNPFPVNLDTP---AEHFQYLNNEQALADIPYFAKNFKRE 186
Query: 176 -----NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+LT +P V+ GGSY GM AA+ R +YP + A+ AP+ + S
Sbjct: 187 NFPDDDLTPKSTPWVMIGGSYPGMRAAFTRDQYPETIFASFAACAPVQ------AQVDMS 240
Query: 231 NIITQDFRSVSE----NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESW 286
Q +R + NC K ++ ++K ++ ++ +++K F ++ N +
Sbjct: 241 VYYEQVYRGLVAYGYGNCTKDVRAAYKYMDSKLRRGESAAEIKKLFLGDTAQNNTNGD-- 298
Query: 287 LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGN----DVFAKLYGAASVY 341
+ A ++T T + + P V + C ++ DPKT + +A GA +V
Sbjct: 299 FTQALIWTWAT-------WQSQGPDGGVGQFCNWLETDPKTNKTAPAEGWAPTKGAKAVV 351
Query: 342 YNYSG------------TAKCFDLNGD-----------SDPHGLSEWGWQACTE 372
++ C N D +DP G++ W WQ C+E
Sbjct: 352 ERFAAWPGLVPRVNAAFETNCKGENPDEPTMCNLGKRVADPSGIA-WTWQYCSE 404
>gi|312066657|ref|XP_003136374.1| hypothetical protein LOAG_00786 [Loa loa]
gi|307768457|gb|EFO27691.1| hypothetical protein LOAG_00786 [Loa loa]
Length = 512
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 111/465 (23%), Positives = 193/465 (41%), Gaps = 58/465 (12%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFMY- 116
Q LDHF+ N + ++Q Y + + +++ +F+ G E D W N G Y
Sbjct: 69 QKLDHFDNN--DGRKWRQFYTHRKSPY--QRSDGAVFLIVGGEDGADRAWLT-NQGLPYV 123
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
+A + A + +EHR+YG S P + + YL + QA+ D + ++ +
Sbjct: 124 QLADQINASIFMLEHRFYGSSRP------TIDTSIQSLKYLDAKQAVEDIDRFVQEINQR 177
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNII 233
T+ + FGGSY G LAAW R K+P A+ASSAP+ LNF + F I
Sbjct: 178 EKLTNPKWITFGGSYSGNLAAWAREKHPRSIRAAVASSAPLQAKLNFKD------FERQI 231
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI-------CKSEKNLA---- 282
+ C VI+ ++++ + + G KL K FR+ S+K++A
Sbjct: 232 EKIIEKKDTKCVAVIRKLFQKMRQMSTTHEGRRKLVKIFRLDDSLIRPAVSDKDVANFFL 291
Query: 283 -IESWLSTAFVYTAMT-----DYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYG 336
I +++S +++ + D T + L P E + + + + +
Sbjct: 292 VISNYISFIVMHSGINVKDHRDLLTLDVMCSKLTHSPSLESIRELIGMVMASQGKSS-HS 350
Query: 337 AASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDAR 396
A + YN S D ++ W +Q C E + + +F +
Sbjct: 351 AIDIGYN-SFLDFMRDERWNTRNAQPRAWLYQNCHEFGHFRTSEEANGLFGGTLPLSFFL 409
Query: 397 ARYCKEAYG-------VDPRPNWITTEFGGHKIGLVLKRF-ASNIIFFNGLRDPWSGGGV 448
AR CK+ +G + R FGG+ + F A+++I NG DPW+ G+
Sbjct: 410 AR-CKDVFGGHFSLENTEKRIAETNEYFGGN------QHFQATDVILSNGSDDPWTLLGI 462
Query: 449 LKNISKTVLALVEKEGAHHV-DLRFSTKEDPQWLKDVRRREVEII 492
S ++ EG HV D + D L++ + + + +I
Sbjct: 463 YNGTSAIGNFVICIEGTSHVADFYPPSNFDSNALRNAQYKIIRVI 507
>gi|32351098|gb|AAP74974.1| thymus specific serine peptidase [Homo sapiens]
Length = 155
Score = 87.8 bits (216), Expect = 1e-14, Method: Composition-based stats.
Identities = 52/157 (33%), Positives = 86/157 (54%), Gaps = 20/157 (12%)
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
+AP + AL++ +EHR+YG SIP GG + + +LSS ALAD S + L +
Sbjct: 9 LAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRLALADVVSARLALSRLF 62
Query: 178 T-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAP---ILNFDNIVSPYSFSNII 233
++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP +L+F +++++
Sbjct: 63 NISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFS------EYNDVV 116
Query: 234 TQDFRSV----SENCYKVIKGSWKQIEETAKKPGGLE 266
++ S S C + ++ ++E + G +
Sbjct: 117 SRSLMSTAIGGSLECRAAVSVAFAEVERRLRSGGAAQ 153
>gi|395529078|ref|XP_003766647.1| PREDICTED: thymus-specific serine protease-like [Sarcophilus
harrisii]
Length = 323
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 61/168 (36%), Positives = 91/168 (54%), Gaps = 19/168 (11%)
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK-- 175
+AP + AL++ +EHR+YG S+P GG + +LSS ALAD AS + L +
Sbjct: 93 LAPHWGALVISLEHRFYGHSVPPGGLGLEQLR------FLSSRHALADVASARVHLSRIY 146
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNI 232
N++A+ SP V FGGSY G LAAW RLK+PH+ A+ASSAP+ L+F + S
Sbjct: 147 NISAS-SPWVSFGGSYAGSLAAWARLKFPHLIWAAVASSAPVQAQLDFSSYN--RGVSRS 203
Query: 233 ITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE---KLQKAFRICKS 277
+ S C + + ++ +++ + G E LQ A R C +
Sbjct: 204 LADPTVGGSLKCRRAVALAFSELDRGLSE--GTEARAALQSAVRACGA 249
>gi|170092076|ref|XP_001877260.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164648753|gb|EDR12996.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 565
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 93/170 (54%), Gaps = 16/170 (9%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQ 110
++ + Q LDHF+ + + TF QRY +N H+ SKN AP+ V G E G+
Sbjct: 68 FRPLWFKQPLDHFSTS--NKHTFHQRYWVNTRHYKPSKN-APVIVLDGGETSGEDRLPFL 124
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
+TG + +A + V +EHRYYGKSIP ++ + + +L++ Q+ AD A+ +
Sbjct: 125 DTGIVEILARATGGVGVVLEHRYYGKSIP------VSNFSTDSLRWLNNAQSAADSANFM 178
Query: 171 IDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+ K ++L A +P + +GGSY G AA R+ YP + GA++SS
Sbjct: 179 RNFKIDGIDEDLRAPHTPWIYYGGSYAGARAAHMRVLYPDLVYGAISSSG 228
>gi|145490518|ref|XP_001431259.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398363|emb|CAK63861.1| unnamed protein product [Paramecium tetraurelia]
Length = 446
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/161 (32%), Positives = 83/161 (51%), Gaps = 5/161 (3%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
+ Q++DH Y+ + QTF+QRY + ++ + + E + F
Sbjct: 27 FEHQLIDH--YDRTNTQTFRQRYWTVEEYF--QPEGGAVLFWICGEYTCPGIRKERLFPV 82
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKN 176
++A KAL+V +EHRYYGKS+P+ + + +N G + LA + I+ K
Sbjct: 83 ELAQTHKALIVVLEHRYYGKSMPFDEDA-LRLENLKYLGIRQALDDLAYFQLHIVQGKFF 141
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
P + GGSY G +AAW+R +YPH+ +GALASSA +
Sbjct: 142 GVRESHPWIAIGGSYPGAMAAWYRYQYPHLVVGALASSAVV 182
Score = 55.1 bits (131), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 42/151 (27%), Positives = 65/151 (43%), Gaps = 15/151 (9%)
Query: 358 DPHGLS---EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD----PRP 410
D H L +W +Q CTE+ +DS+ D + +YCK+ +G + P
Sbjct: 298 DEHNLRSSRQWKYQCCTEVGWWQTAPEQDSL-RSDRLDLEFYRQYCKDIFGEELKLWPDE 356
Query: 411 NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISK-TVLALVEKEGAHHVD 469
+ FGG + + NIIF NG DPW +++ K V + H V+
Sbjct: 357 DLGNAYFGGFDLQV------DNIIFTNGDEDPWKWVSIIEEKGKFNVYHINCANAGHCVE 410
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
L T +D LK R +I G WI +++
Sbjct: 411 LYTPTDQDCDQLKQARIEISQIFGNWIREHY 441
>gi|391865279|gb|EIT74569.1| hydrolytic enzymes of the alpha/beta hydrolase fold protein
[Aspergillus oryzae 3.042]
Length = 573
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 130/552 (23%), Positives = 213/552 (38%), Gaps = 125/552 (22%)
Query: 33 RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGG 88
++TPE S++ + Y Y + +DHF+ Y P S F RY + ++
Sbjct: 38 QLTPESAVKQQSTTAE----YPEYYLSVPVDHFHNDSRYAPHSDDHFDLRYWFDAQYY-- 91
Query: 89 SKNNAPIFVYTGNEGD----IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
K P+FV E D + +Q G + ++A + + V +EHRYYGKS P
Sbjct: 92 -KEGGPVFVIAAGETDATDRFPFLSQ--GIVTELASAYNGIGVILEHRYYGKSYP----- 143
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAW 198
+A +LS+ QALADYA ++ NLT+ +P + +GGSY G A+
Sbjct: 144 -VANLTTENIRFLSTDQALADYAYFASNVVFPGLEHVNLTSKTTPWIAYGGSYAGAFVAF 202
Query: 199 FRLKYPHVAIGALASSA---PILNFDNIVSP---YSFSNII--TQDFRSVSENCYKVIKG 250
R YP V GA++SS I++F P + S+ I TQ F V +
Sbjct: 203 LRKLYPDVYWGAVSSSGVTEAIIDFWQYYEPIRKFGPSDCIWSTQTFIDVVDRIL----- 257
Query: 251 SWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST---------------AFVYTA 295
I+ K G ++L+ + + + +++ S LS+ ++ + +
Sbjct: 258 ----IDHADNKTLG-QQLKTSLGVSPDKDDVSFVSLLSSGLDSFQSRNWDSNIGSYSFRS 312
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV-----YYNYSGTAKC 350
D T S+ L +P E K P + Y A + + N+ G K
Sbjct: 313 YCDNITSSDLL-----YPDTEAVK----PTIQELIEVAGYDATNSSFVNGFLNHVGLFKK 363
Query: 351 ----------FDLNGDSDPHGL-----SEWGWQACTEM-IMLTGGDNKDSIFEESEE--D 392
+ + SDP L + W +Q CTE ++G I D
Sbjct: 364 SITSDEDSAETEQSDSSDPKSLPKDDGTSWEYQVCTEWGYFMSGASVPKDIMPLISRVLD 423
Query: 393 YDARARYCKEAYGVDPRPNWIT-TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
+ + +C+ YG+ PN + GG + +GL DPW +
Sbjct: 424 VASVSTFCETTYGITSPPNVTNINKHGGFNFSY------PRVAIIDGLADPWRDATPHAD 477
Query: 452 ISKT---------VLALVEKE--------GAHHVDLRFSTKED-------PQWLKDVRRR 487
+K +L V E HH D +K D P + +V++
Sbjct: 478 GTKERESTDDEPFILIDVPAEDVWDGIRGAVHHWDQNGLSKTDEGKGQEPPAAIVEVQKE 537
Query: 488 EVEIIGKWISQY 499
+ IG W+ Q+
Sbjct: 538 VLRFIGVWLDQW 549
>gi|348672163|gb|EGZ11983.1| hypothetical protein PHYSODRAFT_355172 [Phytophthora sojae]
Length = 375
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/373 (25%), Positives = 153/373 (41%), Gaps = 49/373 (13%)
Query: 156 YLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
YLSS QALAD A + TD V FGGSY G LAAW RLKYP + G +ASSA
Sbjct: 8 YLSSEQALADLAHFHAFVTDKYGLTDEKWVAFGGSYPGNLAAWVRLKYPALFAGTVASSA 67
Query: 216 PILNFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLE---KLQKA 271
P+ + Y + ++ R CY ++ + Q+ + GG E K+ +
Sbjct: 68 PVHAKTDF---YEYMEVVGDGLRYFGGGECYHEVEQAITQLGQLMD--GGKEDRGKVDEL 122
Query: 272 FRICKSEKNLAIESWLSTAFV--------YTA-----MTDYPTPSNFLNPLPAFPVKEMC 318
F+ C N +S T+ + Y A MT +F P A V+++
Sbjct: 123 FKPCYPMANEFDDSVFETSVMGALQDIAQYNAIHEGVMTLDEVCQHFAKPGDA--VEKLA 180
Query: 319 KAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTG 378
I+ + G+ + +K GAA +GT + + +W +Q C E
Sbjct: 181 SFIEKTRVGDCLDSKFEGAA------NGTVEVLSRDQFDGKSSARQWVYQTCNEFGYFQT 234
Query: 379 GDNKDSIFE--ESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFF 436
+ S F ++ + + CK YG++ P+ +G+ ++R + F
Sbjct: 235 TTSVRSPFHALKAVTEANVGTEICKRVYGMNVAPDVAGANLDYGSLGIEVER----VTFP 290
Query: 437 NGLRDPWSGGGVLKNI---SKTVLALVEKEGAHHVDLRFSTKEDP---QWLKDVRRREVE 490
+G DPW V + S + A+ + AH D+ + ++ D QW D
Sbjct: 291 SGTIDPWHALAVQNSTKLHSFSAEAVFIEGTAHCADMYYPSERDSPQLQWAHDK------ 344
Query: 491 IIGKWISQYFQDL 503
I + Y QD+
Sbjct: 345 -IAARVGWYLQDV 356
>gi|299119169|gb|ADJ11387.1| GA15377 [Drosophila miranda]
Length = 167
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 82/160 (51%), Gaps = 10/160 (6%)
Query: 241 SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA----IESWLSTAFVYTAM 296
+ NC I SWK E G +++ AF +C KN A ++ + AM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKKFLDYIEEVYGNLAM 68
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDD-PKTGNDVFAKLYGAASVYYNYSGTAKCFDL-- 353
+YP S+FL PLPA+PV+++C + D ++ ++ + A +VY NY+G+ KC D+
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDANLLHAMASALAVYTNYTGSVKCLDISV 128
Query: 354 NGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
N ++D G W Q C +M+M + DS+F S ++
Sbjct: 129 NSNADDSG---WNVQTCNQMVMPFCSNGTDSMFRPSSWNF 165
>gi|392591874|gb|EIW81201.1| peptidase S28 [Coniophora puteana RWD-64-598 SS2]
Length = 570
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 118/510 (23%), Positives = 198/510 (38%), Gaps = 94/510 (18%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQ 110
+ ++ Q +DHF+ TF QRY +N H+ P+ V G E G+
Sbjct: 86 FPDQWFNQPVDHFS---NDSATFAQRYWVNARHYTPGAG-GPVIVLDGGETSGEDRLPFL 141
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
+TG + + + V +EHRYYGKSIP + + +L++ QA AD A+ +
Sbjct: 142 DTGIVEILTRVTGGVGVVLEHRYYGKSIP------VPNFSTDNLRWLNNAQAAADSANFM 195
Query: 171 IDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
+K ++LTA + P + +GGSY G AA R+ YP + GA+ASSA V+
Sbjct: 196 ATVKFDGIDEDLTAPEVPWIYYGGSYAGARAAHMRVMYPELVFGAIASSA--------VT 247
Query: 226 PYSFSN-IITQDFRSVSE-NCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAI 283
S SN + R+ ++ C + ++ S + I+ P ++ F + E +
Sbjct: 248 HASLSNWEYMEIIRTAADPTCSRHLENSIRTIDHLLSMPHTRHAIKALFALGGLEHDEDF 307
Query: 284 ESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGN--------------- 328
S L ++ N + E C+A++ P G
Sbjct: 308 ASLLEGPLGAWQSKNWDPAVN------SEGFDEFCEALNKPILGKVHVAELPYEHEDRMV 361
Query: 329 DVFAKLYGAASVY--YNY-------------SGTAKCFDLNGDSDPHGLS------EWGW 367
D+ L G A + +NY + +CF DS G S W +
Sbjct: 362 DLSEILPGLAVDFSIFNYAKYIRENYVSRCETNVEECFGTYDDSKFQGTSLDEDWRLWLF 421
Query: 368 QACTEMIMLTGG--DNKDSIFEESEEDYDARARYCKEAYG------VDPRPN-WITTEFG 418
Q CTE + D + +R C++AY V P PN + E G
Sbjct: 422 QVCTEWGYFSTAPPDEDQPRIVSKLLTLEYESRICQQAYPPGKHMVVPPMPNVTVVNELG 481
Query: 419 GHKIGLVLKRFASNIIFFNGLRDPWS----GGGVLKNISKTVLALVE--KEGAHHVD--- 469
I A + +G DPW + T++ + G HH D
Sbjct: 482 DFAIA------ADRLAIIDGEVDPWRPCTPHSEYAADREDTLMRPFKLIPNGVHHYDEYG 535
Query: 470 LRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
L+ S + +P ++ + + + +W++ +
Sbjct: 536 LK-SLENEPPEIRQIHGEMIAFVMEWLNNF 564
>gi|242794965|ref|XP_002482483.1| extracelular serine carboxypeptidase, putative [Talaromyces
stipitatus ATCC 10500]
gi|218719071|gb|EED18491.1| extracelular serine carboxypeptidase, putative [Talaromyces
stipitatus ATCC 10500]
Length = 561
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 114/245 (46%), Gaps = 28/245 (11%)
Query: 43 ISSSKDSQGLYKTKYHTQI-LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFV 97
+++ DS + Y+ + +DHF+ Y P S TF RY + +H+ K P+FV
Sbjct: 38 LNAKVDSSAVVYHAYNLSVPIDHFHNESRYEPHSNGTFPLRYWFDASHY---KEGGPVFV 94
Query: 98 YTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
E G+ G + +A + V +EHRYYG SIP K+ + +
Sbjct: 95 LESGETSGEDRLPYLQKGLVAQLAQLTNGIAVVLEHRYYGASIP---TKDFS---TESLR 148
Query: 156 YLSSTQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
+L++ Q LAD A ++ +NLT+ P + +GGSY G + A+ R+ YP V G
Sbjct: 149 FLTTEQGLADVAYFAQNIVYPGFEDQNLTSRHVPYIAYGGSYAGAMVAFLRVTYPDVFFG 208
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
A+ASSA + IV + + I R+ +NC ++ ++ A ++ L+
Sbjct: 209 AIASSAVT---EAIVDYWQYWEPIR---RNAPQNCIHTVENLTGVLDNLAHNTSAVKDLE 262
Query: 270 KAFRI 274
F +
Sbjct: 263 TLFGL 267
>gi|169775711|ref|XP_001822322.1| extracellular serine carboxypeptidase [Aspergillus oryzae RIB40]
gi|238502353|ref|XP_002382410.1| extracellular serine carboxypeptidase, putative [Aspergillus flavus
NRRL3357]
gi|83771057|dbj|BAE61189.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220691220|gb|EED47568.1| extracellular serine carboxypeptidase, putative [Aspergillus flavus
NRRL3357]
Length = 573
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 130/552 (23%), Positives = 213/552 (38%), Gaps = 125/552 (22%)
Query: 33 RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGG 88
++TPE S++ + Y Y + +DHF+ Y P S F RY + ++
Sbjct: 38 QLTPESAVKQQSTTAE----YPEYYLSVPVDHFHNDSRYAPHSDDHFDLRYWFDAQYY-- 91
Query: 89 SKNNAPIFVYTGNEGD----IEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
K P+FV E D + +Q G + ++A + + V +EHRYYGKS P
Sbjct: 92 -KEGGPVFVIAAGETDATDRFPFLSQ--GIVTELASAYNGIGVILEHRYYGKSYP----- 143
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAW 198
+A +LS+ QALADYA ++ NLT+ +P + +GGSY G A+
Sbjct: 144 -VANLTTENIRFLSTDQALADYAYFASNVVFPGLEHVNLTSKTTPWIAYGGSYAGAFVAF 202
Query: 199 FRLKYPHVAIGALASSA---PILNFDNIVSP---YSFSNII--TQDFRSVSENCYKVIKG 250
R YP V GA++SS I++F P + S+ I TQ F V +
Sbjct: 203 LRKLYPDVYWGAVSSSGVTEAIIDFWQYYEPIRKFGPSDCIWSTQTFIDVVDRIL----- 257
Query: 251 SWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST---------------AFVYTA 295
I+ K G ++L+ + + + +++ S LS+ ++ + +
Sbjct: 258 ----IDHADNKTLG-QQLKTSLGVSPDKDDVSFVSLLSSGLDSFQSRNWDSNIGSYSFRS 312
Query: 296 MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV-----YYNYSGTAKC 350
D T S+ L +P E K P + Y A + + N+ G K
Sbjct: 313 YCDNITSSDLL-----YPDTEAVK----PTIQELIEVAGYDATNSSFVNGFLNHVGLFKK 363
Query: 351 ----------FDLNGDSDPHGL-----SEWGWQACTEM-IMLTGGDNKDSIFEESEE--D 392
+ + SDP L + W +Q CTE ++G I D
Sbjct: 364 SISSDEDSAETEQSDSSDPKSLPKDDGTSWEYQVCTEWGYFMSGASVPKDIMPLISRVLD 423
Query: 393 YDARARYCKEAYGVDPRPNWIT-TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKN 451
+ + +C+ YG+ PN + GG + +GL DPW +
Sbjct: 424 VASVSTFCETTYGITSPPNVTNINKHGGFNFSY------PRVAIIDGLADPWRDATPHAD 477
Query: 452 ISKT---------VLALVEKE--------GAHHVDLRFSTKED-------PQWLKDVRRR 487
+K +L V E HH D +K D P + +V++
Sbjct: 478 GTKERESTDDEPFILIDVPAEDVWDGIRGAVHHWDQNGLSKTDEGKGQEPPAAIVEVQKE 537
Query: 488 EVEIIGKWISQY 499
+ IG W+ Q+
Sbjct: 538 VLRFIGVWLDQW 549
>gi|380487520|emb|CCF37988.1| serine carboxypeptidase S28 [Colletotrichum higginsianum]
Length = 516
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 140/500 (28%), Positives = 194/500 (38%), Gaps = 104/500 (20%)
Query: 60 QILDHFNYNPQSYQ-TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD- 117
Q+LDH Q ++ TF+QRY N HWGG P+F+ G E D F TG++ +
Sbjct: 53 QLLDH----TQPWRGTFKQRYWWNAEHWGGP--GYPVFLINGGESDAAGF---TGYLENG 103
Query: 118 -----VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL------ADY 166
A K ++ IEHRYYG+S PY A T L QA+ A+
Sbjct: 104 TVTGLYAETHKGAVILIEHRYYGESWPY------KTSTADTLQLLEVPQAIYDNIYFAET 157
Query: 167 ASLIID---LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
A+L D K A SP V+ GGSY G LAAW + P ASSA + ++
Sbjct: 158 AALPFDQGTTDKGANADKSPWVLIGGSYAGALAAWTSVIAPGTFAAYHASSAVVQAIED- 216
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE--KLQKAFRICKSEKNL 281
F T +++ NC IK K+++ + E +++ F + E +
Sbjct: 217 -----FWQFFTPIEQALPRNCSADIKLVIKEVDAVLDRGSDAEILAMKEEFGLETLEDHG 271
Query: 282 AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA-------KL 334
+L + A TD V + C I+ T V A L
Sbjct: 272 DFAYYLQKPVI--AWTDSEK-----------AVLDFCDWIETSTTNGQVAAGCEQSGVGL 318
Query: 335 YGAASVY-------YNYSGTA-KCFDLNGDS----DPHGLSEWG----WQACTEMI--ML 376
A + Y YN + A + DL GD+ P L EWG WQ C E +
Sbjct: 319 EAAWAGYTSWMHRRYNETCEAEEACDLYGDAVGYNRPTDL-EWGRSWVWQLCNEPLGWWH 377
Query: 377 TG--GDNKDSIFEESEEDYDARARYCK----EAYGVDPR-PNWITTEFGGHKIGLVLKRF 429
TG N SI S R R C +++G P T G F
Sbjct: 378 TGPPESNGTSIV-SSHVRLXHRZRQCDLRFPQSFGHRPAVSEGFTVAMFNEWTGGWNATF 436
Query: 430 ASNIIFFNGLRDPWS----------GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQ 479
++F +G DPW GG L ++ LV K G H D + S K
Sbjct: 437 -DKVLFCDGEFDPWRSATMSSDYRPGGPXLS--TEAAPRLVVKGGNHVPDFQLSEKN--- 490
Query: 480 WLKDVRRREVEIIGKWISQY 499
+V +EV IIG+WI +
Sbjct: 491 --AEVVAQEVAIIGRWIEAW 508
>gi|170040233|ref|XP_001847911.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167863799|gb|EDS27182.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 512
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 113/466 (24%), Positives = 197/466 (42%), Gaps = 73/466 (15%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTG 113
+ T +DHFN Q+ + RYL + D + G PI ++ G I+ + +
Sbjct: 65 NFFTTRIDHFN--AQNTDEWTLRYLAVTDWYQPG----GPILIWLGGYMPIQPYMVDESS 118
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+YD+A + + E RY+G+S ++++ +N +L++ Q LAD A + L
Sbjct: 119 LIYDMAREMHGAVYAFETRYFGQSWI---TEDVSTENLR---FLNADQVLADLAEFVAYL 172
Query: 174 KKNLTATD-SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSF 229
K+++ + + V+V G YGG LA WFR++YPH++ A +SS +++F +
Sbjct: 173 KRDVLRNEYAHVLVSGVGYGGSLATWFRVRYPHLSDAAWSSSGVHNALVDFQEFAEDWGQ 232
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLST 289
+ I DF S+ CY I ++ + + G + L + +C + E L
Sbjct: 233 TLI---DFG--SQECYNEIFVAFN-VMQNLIDAGREDILYERLDLCDE---IDTEDRLEV 283
Query: 290 AFVYTAM---TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSG 346
+F + M ++ T S L AF ++C I TG D+ L A + N
Sbjct: 284 SFFFLTMMSSVEFSTLST--GNLTAF--ADVCNDI----TGVDMPTALDAFADWFNN--- 332
Query: 347 TAKCFDLNGDSDPHGLSEW----GW--------------QACTEMIMLTGGDNKDSIFEE 388
D ++DP +W W Q CTE+ + D+ F
Sbjct: 333 KLHADDDCAEADPEVFIDWLREDDWESEWVQKGARQLFYQECTELGLFMTTDSDLQPF-G 391
Query: 389 SEEDYDARARYCKEAYGVDPRPNWIT-------TEFGGHKIGLVLKRFASNIIFFNGLRD 441
+ D C+E +G WIT T+ ++ G + R + I F NG +
Sbjct: 392 NRVGLDMWTDLCQEVFG-----EWITFESIYYATQRSNNRFGALNPR-VNFIHFTNGAEN 445
Query: 442 PWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRR 487
P +L +++ L+ V D R+ + D + L+ V+RR
Sbjct: 446 PIRRVAILNDLNPEALSDVIPNEMFGSDTRYISVNDSEELQAVKRR 491
>gi|194390064|dbj|BAG60548.1| unnamed protein product [Homo sapiens]
Length = 213
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/132 (36%), Positives = 75/132 (56%), Gaps = 5/132 (3%)
Query: 194 MLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWK 253
ML+A+ R+KYPH+ GALA+SAP+L + F +T DF S C + ++ +++
Sbjct: 1 MLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVREAFR 60
Query: 254 QIEETAKKPGGLEKLQKAFRICK---SEKNLA-IESWLSTAFVYTAMTDYPTPSNFLNPL 309
QI++ + G + ++ F C+ EK+L + + AF AM DYP P++FL PL
Sbjct: 61 QIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPTDFLGPL 119
Query: 310 PAFPVKEMCKAI 321
PA PVK C +
Sbjct: 120 PANPVKVGCDRL 131
>gi|389739084|gb|EIM80278.1| peptidase S28 [Stereum hirsutum FP-91666 SS1]
Length = 550
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/449 (23%), Positives = 176/449 (39%), Gaps = 78/449 (17%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-----WFAQNTGF-- 114
+DHF + +F R+ +NDT++ ++ P+F++ E D E + + G
Sbjct: 55 IDHFGTT-NNTDSFANRFWVNDTYY---ESGGPVFLFDSGEQDAEPLLPYYLQEYHGLSA 110
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+A ++ L + EHR+YG S+P+ N A AS +L++ QAL D +
Sbjct: 111 TMRLAKRYNGLAILWEHRFYGDSLPFPVN---ANTTASQWQFLTTEQALEDVIFFANNFN 167
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+L + +P + GGSY G+ + R + P ASSAP+ ++ S Y +
Sbjct: 168 SSLHPSTTPWIFLGGSYPGIRGSLLRQRNPSTIFATWASSAPVQAQVDMASYYKAAE--- 224
Query: 235 QDFRSVSENCYKVIKGSWKQIEETAKKPG----GLEKLQKAFRICKSE-----------K 279
RS++ NC + ++ET G E+ + F++ K+
Sbjct: 225 ---RSLTRNCSADWVAVTRFVDETLGGSGEGANETERTEMKFKLLKARLSGPGGNTSGAA 281
Query: 280 NLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAF-------PVKEMCKAID-----DPKTG 327
NL +E T+ V A S ++PL + + C ++ +P
Sbjct: 282 NLTMEQANRTSNVDAA-------SILMDPLDFYQYYGFNASLLPFCNILETQNFTEPALE 334
Query: 328 NDVFAKLYGAASVYYNYSGTAKCFDLN----GDSDPHGLSEWGWQACTEMIMLTGGDNKD 383
+ + A + ++ D + G DP W WQ C+E GD +
Sbjct: 335 SGI-ASTQEVEVAFESFLTALAEIDYDSITGGADDPVADRSWMWQYCSEYGFYQRGDPNN 393
Query: 384 SI-FEESEEDYDARARYCKEAY--GVDPRPN-WITTEFGGHKIGLVLKRFASNIIFFNGL 439
+ E S D C + G+ P P ++GG + SNI+F NG
Sbjct: 394 PLSIETSFLSLDLYQSQCNSTFPEGLPPSPAVGNINKYGGWDMA------PSNILFTNGE 447
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHV 468
DPW G LA +E H +
Sbjct: 448 FDPWRTMG---------LASIEDNAPHRL 467
>gi|407919835|gb|EKG13057.1| Peptidase S28 [Macrophomina phaseolina MS6]
Length = 563
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 114/249 (45%), Gaps = 31/249 (12%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-----QNTGFMY 116
LDHF+ + F+ R+ + ++ + G+ + P+F+Y E D A +T F
Sbjct: 69 LDHFSGDAG---FFENRFWVAESGYKGAGH--PVFIYDAGETDASQNALFRLQNSTSFFK 123
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK- 175
+ F + + EHRYYG+S+P N + A ++ YL+S QALAD + +
Sbjct: 124 QIVDAFGGIGIVWEHRYYGESVPVNINLDTAPEDFI---YLTSEQALADVPVFAANFSRA 180
Query: 176 -----NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
+LT + +P + GGSY GM AA+ R YP + ASSAP+ N +S Y F
Sbjct: 181 NFPDIDLTPSSTPWIFIGGSYPGMRAAFMREYYPETIFASFASSAPV-QAQNDMSVY-FE 238
Query: 231 NIITQDFRSVSE----NCYKVIKGSWKQIEETAKKPGGLEKLQKAF--RICKSEKNLAIE 284
+ +R ++ NC K I + ++E + P L+ F R N
Sbjct: 239 PV----YRGMNAYGFGNCSKDIHAAINYMDELMENPAAAATLKTKFLGRNADKNSNAGFG 294
Query: 285 SWLSTAFVY 293
LST F Y
Sbjct: 295 DALSTIFWY 303
>gi|212536138|ref|XP_002148225.1| extracelular serine carboxypeptidase, putative [Talaromyces
marneffei ATCC 18224]
gi|210070624|gb|EEA24714.1| extracelular serine carboxypeptidase, putative [Talaromyces
marneffei ATCC 18224]
Length = 561
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 115/245 (46%), Gaps = 28/245 (11%)
Query: 43 ISSSKDSQGLYKTKYHTQI-LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFV 97
+S+ DS Y+ + +DHF+ Y P S TF RY + +H+ K P+FV
Sbjct: 38 LSTEIDSSAAIFHAYNLSVPIDHFHNESRYEPHSSDTFALRYWFDASHY---KEGGPVFV 94
Query: 98 YTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG 155
E G+ G + +A + V +EHRYYG SIP K+ + ++
Sbjct: 95 LESGETSGEDRLPYLQKGLISQLAQLTNGIAVVLEHRYYGTSIP---TKDFSTESLR--- 148
Query: 156 YLSSTQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIG 209
+L++ QALAD A ++ ++LT+ P + +GGSY G + A+ R+ YP V G
Sbjct: 149 FLTTEQALADVAYFAQNIIYPGLEDQSLTSNFVPYIAYGGSYAGAMVAFLRVTYPDVFFG 208
Query: 210 ALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
A+ASSA + IV + + I R+ ++C + ++ ++ ++ L+
Sbjct: 209 AIASSAVT---EAIVDYWQYWEPIR---RNAPQDCVRTVENLTGVLDSLTSNTAAIQDLE 262
Query: 270 KAFRI 274
F +
Sbjct: 263 TLFGL 267
>gi|258565711|ref|XP_002583600.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907301|gb|EEP81702.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 533
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 127/547 (23%), Positives = 210/547 (38%), Gaps = 86/547 (15%)
Query: 11 CLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN---- 66
CLL + + + P T E S + +D LY + + LDHF
Sbjct: 8 CLLLWAATIQARPPVVPIGEYGLFT-ESSSDYGLAKRDLTDLYPEQTISIPLDHFQNEDR 66
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGFMYDVAPKF 122
Y P S TF+ RY + +H+ K P+ + G E G + + + G + +A
Sbjct: 67 YEPHSNATFKLRYWYDASHY---KKGGPVIILHGGETSGQGRLPFLQK--GMLAQLAKAT 121
Query: 123 KALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL------KKN 176
+ V +EHRYYG SIP K+ + KN +L++ QA+AD A ++ K+
Sbjct: 122 NGVGVVLEHRYYGTSIP---TKDFSTKNLR---FLTTEQAMADSAYFSKNVVFKGLEDKD 175
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSP---YSFS 230
LTA +P +++GGSY G A R++YP + GA++SS I ++ P ++
Sbjct: 176 LTAPKTPHILYGGSYAGAQVALLRVEYPEIFWGAISSSGVTKAIWHYWQYYEPTRQHAPQ 235
Query: 231 NII--TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLS 288
+ + TQ F + +N ++G ++ + K+ L L + + I W S
Sbjct: 236 HCVKQTQTFVDLVDNI--ALRGRNRETTQKLKEFFSLGDLTHTDDFA-NVLSFGISGWQS 292
Query: 289 TAF-------VYTAMTDYPTPSNFLNPLPAFPVK---EMCKAIDDPKTGNDVFAKLYGAA 338
T + + D T L P+ K E+ KA + G + +L A
Sbjct: 293 TNWDPAISGKNFFRYCDTITTDRLLYPVSDEQQKLATELIKAGKYGRHGRQLKNRLLNYA 352
Query: 339 SVYYNYSGTAKCFDLN-------GDSDP---------HGLSEWGWQACTEMIMLTGGDNK 382
+ Y N + A C G DP W +Q CTE L G
Sbjct: 353 A-YVNATYVAPCRRRGSTLDTCFGTHDPAFYKRDDTRQEWRLWNYQVCTEWGFLQTGSGV 411
Query: 383 DSIFEESEE---DYDARARYCKEAYGVDPRPN-WITTEFGGHKIGLVLKRFASNIIFFNG 438
+ D C+EAY + + ++GG I + +G
Sbjct: 412 PKFIKPLVSRLIDLKYTTTTCREAYNLHGEADVRRINKYGGFNIEY------PRLAIIDG 465
Query: 439 LRDPWSGGGVLKNISKTVLALVEK------EGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
DPW ++ + ++K + HH KE Q +E+E +
Sbjct: 466 EADPWKEASPHATFARPRRSTLDKPFILIPDAVHHYFPPDRIKERSQ------AQEIEFV 519
Query: 493 GKWISQY 499
KW+ +
Sbjct: 520 KKWLKDW 526
>gi|67541941|ref|XP_664738.1| hypothetical protein AN7134.2 [Aspergillus nidulans FGSC A4]
gi|40742196|gb|EAA61386.1| hypothetical protein AN7134.2 [Aspergillus nidulans FGSC A4]
Length = 1157
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 108/449 (24%), Positives = 174/449 (38%), Gaps = 94/449 (20%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGD----IEWFAQNTGFM 115
+DH +NP + TF+ RY +ND ++ GG P+ +Y E D + N+ F+
Sbjct: 87 IDH--HNP-AIGTFRNRYWVNDAYYVPGG-----PVVLYDVGEADGEPSVAHLRSNSSFL 138
Query: 116 YDVAPKFKALLVFIEHR-----------------YYGKSIPYGGNKEIAYKNASTTGYLS 158
+ +F A+ + EHR YYG S+PY N ++ YL+
Sbjct: 139 PPILQEFGAIGIVWEHRSVVSTKVETRIEIDRNRYYGDSLPYPVNNNTLPEHLI---YLT 195
Query: 159 STQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
+ QALAD + + + +LT +P V+ GGSY G AA+ R +YP + A
Sbjct: 196 TRQALADIPAFAENFTRPGLQEYDLTPASTPWVMIGGSYPGARAAFARNEYPDTIFASFA 255
Query: 213 SSAPILNFDNIVSPYS--FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
+SAP+ N+ Y + +++ FRS + N I + + I+ + +++
Sbjct: 256 ASAPVQAQINMSIYYEQVYRSMVANGFRSCASN----IHAALEYIDNQLSQNDTAASIKQ 311
Query: 271 AFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGND 329
F +EKN + + A +Y + Y PA + E C+ ++ DP TG
Sbjct: 312 LFFGPGAEKNSNEDFTHALAVIYGSFQAYGMDG------PAGSLGEFCRYLESDPVTGQP 365
Query: 330 V----FAKLYG--------------AASVYYNYSGTAKCFD--------LNGDSDPHGLS 363
A +G + NY K D LN +
Sbjct: 366 AETEGLALRHGYKHLAERWAEWPIFTQLINVNYETNCKGLDKSVAPSCELNKPTTNPTAI 425
Query: 364 EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY------GV---DPRPNWIT 414
W WQ CTE + + + + C + G+ PR + +
Sbjct: 426 AWTWQYCTEWGFYQSNNEGVHSLLSRYQTLEFQQVMCNRQFPEAVKNGLLPPQPRVDALN 485
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
EFGG I SN+ F G DPW
Sbjct: 486 AEFGGWNIR------PSNVYFSAGEFDPW 508
>gi|312080063|ref|XP_003142441.1| hypothetical protein LOAG_06857 [Loa loa]
Length = 109
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 43/85 (50%), Positives = 59/85 (69%), Gaps = 5/85 (5%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY 116
Y + +DHFNY ++ TF +YL+N +++ + P+F Y GNEGDIE FAQ TG M+
Sbjct: 13 YQSMPIDHFNY--RNLDTFGLKYLVNYSYFNC---DGPLFFYAGNEGDIETFAQMTGIMW 67
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYG 141
D+AP F A +VF EHRYYG+S P+G
Sbjct: 68 DLAPLFNAAIVFAEHRYYGESQPFG 92
>gi|350639257|gb|EHA27611.1| hypothetical protein ASPNIDRAFT_121842 [Aspergillus niger ATCC
1015]
Length = 488
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 106/441 (24%), Positives = 169/441 (38%), Gaps = 73/441 (16%)
Query: 59 TQI-LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN-----T 112
TQI +DH N S T+Q R+ ++D + + PIFVY E D AQ+
Sbjct: 1 TQIPIDH---NNASAGTYQNRFWVSDEFY---QPGNPIFVYDTGESDGGSIAQSYLTSTL 54
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT-GYLSSTQALADYASLII 171
F + +F A+ + EHRYYG S P ++Y++ T YL++ QALAD
Sbjct: 55 SFFREFLIEFNAMGIAWEHRYYGNSTP----APVSYESPPETWQYLTTKQALADLPYFAS 110
Query: 172 DLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
+ + +LT +P ++ GGSY G+ AA R +YP A +SS+P+ N+ +
Sbjct: 111 NFSREKYPDVDLTPQGTPWIMVGGSYAGIRAALTRKEYPETIFAAFSSSSPVEAQVNMSA 170
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
Y + S NC I + + I++ +++ F +E N +
Sbjct: 171 YY--DQVYRGMVASGWTNCSADIHAALEYIDDQLSDEDTATSIKQLFFGSGAETNSNGDF 228
Query: 286 WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPK----TGNDVFAKLYGAASV 340
+ +Y Y + C+ ++ DPK TG D A YG V
Sbjct: 229 TAALTAIYGYFQSYGMAGGIGG------LGAFCEYLEIDPKTNGTTGPDGLAPTYGGQYV 282
Query: 341 YYNYSGTAKCFDL-----NGDSDPHGLSE-----------------WGWQACTEMIMLTG 378
++ +L + P S+ W WQ C+E
Sbjct: 283 AERWAAWPTFLELVNLNMGTNCGPQDASQPIDCDFSKPYGDPAAITWTWQYCSEWGFFQA 342
Query: 379 GDNKDSIFEESEEDYDARARYCKEAY--GVD-------PRPNWITTEFGGHKIGLVLKRF 429
++ + + + C + VD PR + + EFGG I
Sbjct: 343 NNDGPHSLASRYQSVEYQQEVCNRQFPDAVDKGLLPPSPRADDVNQEFGGWTIR------ 396
Query: 430 ASNIIFFNGLRDPWSGGGVLK 450
SN+ F G DPW +L
Sbjct: 397 PSNVYFSGGEFDPWRSLSILS 417
>gi|384485422|gb|EIE77602.1| hypothetical protein RO3G_02306 [Rhizopus delemar RA 99-880]
Length = 242
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 82/173 (47%), Gaps = 20/173 (11%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQ 110
Y ++ Q +DHF N TF+ RY N T W + P+ +Y E D F
Sbjct: 14 YGPFFYDQPVDHFLENST---TFKHRYWAN-TEW--YQPGGPVLIYNAGETAADQRSFLV 67
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
FM ++ +++ +EHR+YG S+P A L++ QAL D AS I
Sbjct: 68 IDSFMAELTKSLNGIIIVMEHRFYGLSLPSSNF------TAKELATLNTAQALEDIASFI 121
Query: 171 IDLKKNLTATDSP------VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
+K D P +V+GGSY G LAAW RLKYP + A+ SSAP+
Sbjct: 122 RYVKIPNFEVDLPPAPETRYIVYGGSYSGNLAAWMRLKYPDIVFAAVPSSAPV 174
>gi|294884993|ref|XP_002771171.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
gi|239874541|gb|EER02987.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
Length = 371
Score = 85.9 bits (211), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 81/340 (23%), Positives = 133/340 (39%), Gaps = 35/340 (10%)
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS--FSNIITQDFRSVS 241
V+ GGSY G LA FRL+YP + A ASS+P + S + + + +T S+
Sbjct: 22 VIAVGGSYSGFLALAFRLRYPKLVYAAYASSSPGRLYSQEASRFDGRYYSRVTDAADSIR 81
Query: 242 ENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK-----SEKNLAIE--SWLSTAFVYT 294
NC + ++ E+ + +IC E L E + F
Sbjct: 82 SNCSNSVIKAFDDFVHRYAGRVTFEQAKNELKICNPEVFGREDGLFEELVQMVRMEFSGA 141
Query: 295 AMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY-------NYSG- 346
M YP SN P +C ++ A G V N G
Sbjct: 142 NMASYPPSSNS-------PTYMLCTMVEQSGIAGIFEAMTQGRRCVDVTQHLPSPNKHGV 194
Query: 347 -TAKCFDLNGDSDPHGLSEWGWQACTEMI-MLTGGDNKDSIFEESEEDYDA-RARYCKEA 403
+A C D G W WQ CT+++ ++ +F D YC+E+
Sbjct: 195 YSASCGDWTGCGVGRAGRSWDWQTCTQLVEHISSYGPPSDMFPPRRFSVDQWLNAYCEES 254
Query: 404 YGVDPRPNWITTEFGGHKIG-------LVLKRFASNIIFFNGLRDPWSGGGVLKNISKTV 456
+G + N+ T H++ L S ++F NG D W+ G V +N+S T+
Sbjct: 255 FGNNVFHNFSDTT-REHRLNDLWGFDEATLPDITSRVLFVNGGMDGWTAGAVTRNLSDTI 313
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
++L+ GAHH +++ + D + R + +I+ W+
Sbjct: 314 ISLMIPSGAHHSEMKDPSDNDTSDMIAARDQIDDILSLWL 353
>gi|32351096|gb|AAP74971.1| thymus specific serine peptidase [Homo sapiens]
Length = 164
Score = 85.5 bits (210), Expect = 5e-14, Method: Composition-based stats.
Identities = 56/160 (35%), Positives = 85/160 (53%), Gaps = 14/160 (8%)
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
+AP + AL++ +EHR+YG SIP GG + + +LSS ALAD S + L +
Sbjct: 9 LAPAWGALVISLEHRFYGLSIPAGGLEMAQLR------FLSSRLALADVVSARLALSRLF 62
Query: 178 T-ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
++ SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ V +S N I
Sbjct: 63 NISSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPV----RAVLDFSEYNDIV-- 116
Query: 237 FRSVSENCYKVIKG-SWKQIEETAKKPGGLEKLQKAFRIC 275
S+ + C + + Q+ T + G+ Q ++ C
Sbjct: 117 LHSLGQKCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTC 156
>gi|259483536|tpe|CBF79006.1| TPA: serine peptidase, family S28, putative (AFU_orthologue;
AFUA_4G03790) [Aspergillus nidulans FGSC A4]
Length = 557
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 107/449 (23%), Positives = 175/449 (38%), Gaps = 94/449 (20%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGD----IEWFAQNTGFM 115
+DH +NP + TF+ RY +ND ++ GG P+ +Y E D + N+ F+
Sbjct: 87 IDH--HNP-AIGTFRNRYWVNDAYYVPGG-----PVVLYDVGEADGEPSVAHLRSNSSFL 138
Query: 116 YDVAPKFKALLVFIEHR-----------------YYGKSIPYGGNKEIAYKNASTTGYLS 158
+ +F A+ + EHR YYG S+PY N ++ YL+
Sbjct: 139 PPILQEFGAIGIVWEHRSVVSTKVETRIEIDRNRYYGDSLPYPVNNNTLPEHLI---YLT 195
Query: 159 STQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
+ QALAD + + + +LT +P V+ GGSY G AA+ R +YP + A
Sbjct: 196 TRQALADIPAFAENFTRPGLQEYDLTPASTPWVMIGGSYPGARAAFARNEYPDTIFASFA 255
Query: 213 SSAPILNFDNIVSPYS--FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQK 270
+SAP+ N+ Y + +++ FRS + N I + + I+ + +++
Sbjct: 256 ASAPVQAQINMSIYYEQVYRSMVANGFRSCASN----IHAALEYIDNQLSQNDTAASIKQ 311
Query: 271 AFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGND 329
F +EKN + + A +Y + Y PA + E C+ ++ DP TG
Sbjct: 312 LFFGPGAEKNSNEDFTHALAVIYGSFQAYGMDG------PAGSLGEFCRYLESDPVTGQP 365
Query: 330 V-------------FAKLYGAASVY-----YNYSGTAKCFD--------LNGDSDPHGLS 363
A+ + ++ NY K D LN +
Sbjct: 366 AETEGLALRHGYKHLAERWAEWPIFTQLINVNYETNCKGLDKSVAPSCELNKPTTNPTAI 425
Query: 364 EWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY------GV---DPRPNWIT 414
W WQ CTE + + + + C + G+ PR + +
Sbjct: 426 AWTWQYCTEWGFYQSNNEGVHSLLSRYQTLEFQQVMCNRQFPEAVKNGLLPPQPRVDALN 485
Query: 415 TEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
EFGG I SN+ F G DPW
Sbjct: 486 AEFGGWNIR------PSNVYFSAGEFDPW 508
>gi|353242348|emb|CCA74000.1| related to serine protease [Piriformospora indica DSM 11827]
Length = 548
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 116/503 (23%), Positives = 193/503 (38%), Gaps = 92/503 (18%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFMYD 117
Q LDH + S TF+QRY +N H+ K P+ V G E G+ +TG
Sbjct: 71 QPLDH---DDPSSPTFEQRYWVNTRHY---KKGGPVIVIDGGETSGEDRLPFLDTGIADI 124
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADYASLIIDLK- 174
+A L V +EHRYYG+S+P KN +T +L++ QAL+D A + +
Sbjct: 125 LAKATHGLGVILEHRYYGESVPV--------KNLTTDSLRWLNNYQALSDSARFMKHVNF 176
Query: 175 --------------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
NL A +SP + +GGSY G AA R+ YP + GA+ASSA
Sbjct: 177 SSNLFPSSVSSETISNLKAPNSPWIYYGGSYAGARAAHMRVLYPEIVFGAIASSAVT--- 233
Query: 221 DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN 280
++ + + +II + C +I+ S ++ E ++ F + + +
Sbjct: 234 HATINNWEYMDIIRT---AAPPACSALIQASMSHVDAMIVNATLREPVKALFGLQGLKHD 290
Query: 281 ---LAIESW-------------------------LSTAFVYTAMTDYPTPSNFLNPLPAF 312
+++ SW L++ +A+ S + LP
Sbjct: 291 TDFVSVLSWPLGFFQGKNWDPNVDERGFDEFCSKLTSKGDDSALAKL--ESEWTTSLPGI 348
Query: 313 PVKEMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTE 372
P++ + K ++ L A + GT +G S W +Q CTE
Sbjct: 349 PIE--VAVLRYAKYIRNLVEALCPATEDQEDCFGTHNDEKFHGTSLQETWRLWTFQVCTE 406
Query: 373 --MIMLTGGDNKDSIFEESEEDYDARARYCKEAY------GVDPRPNWITTEFGGHKIGL 424
+ D + D ++ C++A+ V PN +T GL
Sbjct: 407 WGYFITAPPDPEWPSMISRLTDLHYESKICRQAFPPGEFMHVPKWPN-VTVVNALGDFGL 465
Query: 425 VLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDLR--FSTKE 476
R A F +G DPW + V + +G HH D KE
Sbjct: 466 THSRLA----FIDGEIDPWRPCTPHSQYAPERTDTVSRPFKLIPDGVHHHDQNGLKKHKE 521
Query: 477 DPQWLKDVRRREVEIIGKWISQY 499
+P+ ++ + EV+ + W+ ++
Sbjct: 522 EPKHIQKIHHEEVKFVKAWLKEW 544
>gi|347970283|ref|XP_313407.5| AGAP003642-PA [Anopheles gambiae str. PEST]
gi|333468860|gb|EAA08815.5| AGAP003642-PA [Anopheles gambiae str. PEST]
Length = 476
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 110/470 (23%), Positives = 195/470 (41%), Gaps = 63/470 (13%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
++ T++ DHF+ Q+ TF+ Y+ N ++ + PIF+ G + + G
Sbjct: 42 RFRTRV-DHFDV--QNRATFEFNYVSNGEYY---RPGGPIFIVVGGNNALNAYFIENGLF 95
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
+D+A + L EHRYYG+S P + +A +LS QAL D I L++
Sbjct: 96 HDIARRQGGWLFSNEHRYYGRSSP------VEDYSAPNMRFLSVEQALIDLIEWIDHLRR 149
Query: 176 NLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIIT 234
+ ++ V++ G YGG +A W R ++P + GA S+A ++ V + +
Sbjct: 150 EVVRDPNAKVILHGLGYGGAVAIWARQRFPSLIDGAYGSTASVI---ARVDFAEYGEDMG 206
Query: 235 QDFRSVS-ENCYKVIKGSWK--QIEETAKKPGGLEKLQKAFRIC---KSEKNLAIESW-- 286
+ R++ ++CY ++ W+ + E G +L + FR C +++ L IE++
Sbjct: 207 ETIRTLGHDDCYGIV---WRGFRTAENLIDAGLYGRLSEMFRTCVPLRADDPLTIETFFY 263
Query: 287 -LSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDV----FAKLYGA-A 338
L ++F M +P + V MC + D +T +V F + YGA
Sbjct: 264 GLKSSF-EAEMFGQASPDS---------VTRMCAELLADPAETALEVLANFFERRYGAFD 313
Query: 339 SVYYNY-SGTAKCFD----LNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDY 393
V +++ S A D + ++D G+ + +Q CTE + S F + Y
Sbjct: 314 CVPFDFESNIASALDEEVGVPNNAD-FGIRQRTYQLCTEFGWFLTSSSGGSPF-GTRITY 371
Query: 394 DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA------SNIIFFNGLRDPWSGGG 447
C+ +G WI + L F +N+++ N DP
Sbjct: 372 RYFIDTCRAVFG-----EWIDQSVVYDGVRLTNLHFGADDPRVTNVVYVNAQHDPTRFVS 426
Query: 448 VLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWIS 497
+ + A V K +D T D + L VR V + W+S
Sbjct: 427 LTDYTNLLANAFVIKGAVVSLDWMAETPLDSEDLLRVREEIVGYVVSWLS 476
>gi|170045817|ref|XP_001850490.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868718|gb|EDS32101.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 487
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 103/462 (22%), Positives = 184/462 (39%), Gaps = 49/462 (10%)
Query: 57 YHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN-TGFM 115
Y T L+HFN+ Q + + RYL H+ + PI + G + N + +
Sbjct: 29 YFTTRLNHFNH--QQREDWTLRYLSVTEHY---RPGGPILIRLSGNGPVRRDMINESSLI 83
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK 175
++A + + E R+YG S P N +L + Q +AD II LK+
Sbjct: 84 TELAREMGGAVYAFETRFYGMSKPTND------VNTEIMRFLKTDQIMADLVEFIIYLKR 137
Query: 176 NLTATDS-PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSFSN 231
++ ++ PV+V G YGG LA WFR++YPH+ A +S +L+F + +S +
Sbjct: 138 DVFRDENMPVLVSGAGYGGALATWFRVRYPHMGDAAWSSGGYHEAVLDFSDFAESWSETL 197
Query: 232 IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKNLAIESWLS 288
I D+ S+ CY + ++ + + G + L + IC E +L ++ + S
Sbjct: 198 I---DYG--SQQCYNELFVAFN-VMQNLIDIGMTDMLYEKLNICTEIDPEDSLQVQYFFS 251
Query: 289 TAFVYTAM--------TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASV 340
T M TD+ T + L A E + + D + +V
Sbjct: 252 TLMTAVEMHTLRNRNLTDFSTFCDELLSTEAATAVEAFTTWFNSRFTEDEGCIVTDLQTV 311
Query: 341 YYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYC 400
+ T D + +G +W +Q C E ++ F + + A C
Sbjct: 312 VDAFRQT----DWDSPFVANGARQWMYQQCAEFGWFQTTNSAYQPF-GNRVTVELHAEIC 366
Query: 401 KEAYGVDPRPNWITTEFGGHKIGLVLKRFAS------NIIFFNGLRDPWSGGGVLKNISK 454
+ +G W T E RFA+ I F +G DPW + +++
Sbjct: 367 RMVFG-----EWATEETILQAAERTNNRFAALTPNTRRIHFTSGAEDPWRAVTIREDLGG 421
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
A V +++ +++ D + L+ +RR ++ +I
Sbjct: 422 RATADVIPGILSGSEMKATSENDSEELRAAKRRIKTVLKDYI 463
>gi|395326580|gb|EJF58988.1| hypothetical protein DICSQDRAFT_128566 [Dichomitus squalens
LYAD-421 SS1]
Length = 490
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 123/492 (25%), Positives = 192/492 (39%), Gaps = 66/492 (13%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P S I + SS Q T++ Q +DH + N S QTF+QRY I+ +++
Sbjct: 22 PGSHILAARQSSAGDPGGAVQCPADTQWFDQPIDHASTN--SSQTFKQRYQIDTSNF--- 76
Query: 90 KNNAPIFVYTGNEG-DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
K PI Y E DI ++ D A + ++ +EHRY+G+S+P+G N
Sbjct: 77 KEGGPILFYQSPEATDIACISEL--LFMDWAKELGGIVATLEHRYFGQSLPFG-NNSYTL 133
Query: 149 KNASTTGYLSSTQALADYASLIIDLKKNLT-ATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
N + + D + +KKN+T A S +V GGSYGG LA FR YP
Sbjct: 134 DNLKP---FTLDNVMQDAVHFLDFVKKNVTGAAKSKTIVAGGSYGGFLAPVFRQNYPDTF 190
Query: 208 IGALASSAPILNFDNI----VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
GA + P + + +++ N + + S + I+ + Q+++ G
Sbjct: 191 FGAWGIAGPFRSLGTVDEVGAELHNWYNYVQSTYAHRSLEAFDRIRNGFAQVKQLIDT-G 249
Query: 264 GLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD 323
L K +C N + + L+T F ++ Y T S F PA ++D
Sbjct: 250 HNATLTKELSLCHPPSNSSDD--LAT-FASFLVSSYTTMSQFNGLPPAVFFNVSGNSLD- 305
Query: 324 PKTGNDVFAK---LYGAASVYYNYSG---TAKCFD-LNGDSDPHGLSE--WGWQACTEMI 374
ND A L G ++ G C + + + G+ E + W C
Sbjct: 306 -VVVNDTLAAPTPLAGINQTLWHAHGLDAVNGCLNYTDAQNSGFGVQEIPFMWAQCNWFP 364
Query: 375 MLTGGDNKDSIFEESEEDY---DARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRF-- 429
+ N DSIF + + C+ + + T+ G VL R+
Sbjct: 365 LNLAIAN-DSIFNIGSPGLGMSSSPSATCETLFNL--------TQVNGAD---VLTRYNV 412
Query: 430 -------ASNIIFFNGLRDP---------WSGGGVLKNISKTVLALVEKEGAHHVDLRFS 473
+++IIF DP W G V +I K+V+ V G H DL
Sbjct: 413 TREDIGNSTHIIFSENEYDPTTSVAVPPDWLGDNVSTDIDKSVVLFVAGTG-HGQDLARP 471
Query: 474 TKEDPQWLKDVR 485
DP L VR
Sbjct: 472 NATDPSSLTAVR 483
>gi|393238402|gb|EJD45939.1| peptidase S28 [Auricularia delicata TFB-10046 SS5]
Length = 535
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 116/497 (23%), Positives = 196/497 (39%), Gaps = 83/497 (16%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNE--GDIEWF 108
+ ++ Q LDHF+ + TF QRY ++D H+ GG P+ V E G+
Sbjct: 46 FPARWFRQPLDHFDRAKRD--TFLQRYWVSDRHYLPGG-----PVIVLDCGETNGEDRLP 98
Query: 109 AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYAS 168
+TG + +A L V +EHRYYG S+P + + +L++ QA AD A+
Sbjct: 99 FLDTGIVDILAKATHGLGVVLEHRYYGSSVP------VLNLTTDSLRWLNNKQAAADSAT 152
Query: 169 LIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
+ +++ +LTA +P + +GGSY G AA R+ YP + GA+ASSA + +I
Sbjct: 153 FMANVRFEGIDDDLTAPGTPWIYYGGSYAGGRAAHMRILYPDLVFGAIASSAAV--HASI 210
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAI 283
V + + +I Q+ C + ++GS I+ + P ++ F + E +
Sbjct: 211 VY-WEYFEVIRQN---APAGCMRRLEGSIDIIDRVLQVPVLRRPFKRLFGLEDLEHDDDF 266
Query: 284 ESWLSTAFV-YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY 342
S L + A P S + E C AI+ + +
Sbjct: 267 ASVLLDPLRGWQARNWDPATS-------STQFDEFCAAIEGDNKNAPIRLGPFELDVGVL 319
Query: 343 NYSGTAK---------------CFDLNGDSDPHGLS------EWGWQACTEM-IMLTGGD 380
NY+ K CF DS +S W +Q CTE + LT
Sbjct: 320 NYAEYIKTLVAALCPEGKTVEECFGTYDDSQYQNVSLEEYWRAWIFQVCTEWGLFLTSPP 379
Query: 381 N--KDSIFEESEEDYDARARYCKEAY------GVDPRP--NWITTEFGGHKIGLVLKRFA 430
N + I R C+++Y V P P + GG I A
Sbjct: 380 NPARPRIISRLIT-LPYATRICRQSYPPGEHFTVPPLPDVDGAVNSLGGFDIA------A 432
Query: 431 SNIIFFNGLRDPWSGGGVLKNISKTVLALVEK------EGAHHVDLRFSTK--EDPQWLK 482
+ F +G DPW ++ + + HH D T ++P+ ++
Sbjct: 433 DRLAFVDGTADPWVPITTHSWYARPRADTTRRPFKWIIDAVHHWDENGLTNPADEPRRIR 492
Query: 483 DVRRREVEIIGKWISQY 499
++ +V+ + W+ ++
Sbjct: 493 EIHFDQVQFVKAWLKEF 509
>gi|392565253|gb|EIW58430.1| hypothetical protein TRAVEDRAFT_47584 [Trametes versicolor
FP-101664 SS1]
Length = 534
Score = 85.1 bits (209), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 104/441 (23%), Positives = 172/441 (39%), Gaps = 66/441 (14%)
Query: 47 KDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE 106
D + LDHF + TF+ R+ +NDT++ K P+F++ E + E
Sbjct: 17 NDETATNNAHFFEMPLDHFG---NTTGTFKNRFWVNDTYY---KPGGPVFLFDSGEQNAE 70
Query: 107 -----WFAQNTGF--MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
+ + G +A ++ + + EHR+YG S+P+ N A +L++
Sbjct: 71 PLLPYYLQEYHGLSATMRLAKRYSGVAILWEHRFYGDSLPFPVNGNTT---AEQWQFLNT 127
Query: 160 TQALADYA------SLIIDLKKNLTATDSPV-------VVFGGSYGGMLAAWFRLKYPHV 206
QAL D SL + +ATD+P+ V GGSY G+ A R++ P
Sbjct: 128 EQALEDVVYFANRFSLTGGHALSTSATDNPLHPSKTPWVWLGGSYPGVRGALLRVRNPET 187
Query: 207 AIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLE 266
ASSAP+ ++ + Y + RS++ NC + +++T E
Sbjct: 188 IFAVWASSAPVHAQVDMAAYYKAAE------RSLTRNCSADWVAVTRFVDDTLANGTAEE 241
Query: 267 KLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNF-------LNPLPAFPVKEMCK 319
+ FR+ + + + + +A P NF + LP V E
Sbjct: 242 TAELKFRLLSARSDGVTKERAANTSAVSAAGVLMDPLNFYQYYGFEASLLPFCNVLESRN 301
Query: 320 AIDDP-------KTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDS-DPHGLSEWGWQACT 371
+ P G DV + +A NY + GD+ DP W WQ C+
Sbjct: 302 STITPFETGLAANLGTDVALDAFLSAIREVNYDA------IPGDADDPVQDRSWMWQYCS 355
Query: 372 EMIMLTGGDNKDSI-FEESEEDYDARARYCKEAY--GVDPRPN-WITTEFGGHKIGLVLK 427
E GD +++ E S + C + G+ P P ++GG +
Sbjct: 356 EYGFYQRGDPNNTLSIETSFLSLELFQDECNTTFPRGLPPSPAVQKVNKYGGWDMT---- 411
Query: 428 RFASNIIFFNGLRDPWSGGGV 448
SN++F NG DPW G+
Sbjct: 412 --PSNVLFTNGEFDPWRTMGL 430
>gi|390603317|gb|EIN12709.1| peptidase S28 [Punctularia strigosozonata HHB-11173 SS5]
Length = 575
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 118/507 (23%), Positives = 200/507 (39%), Gaps = 82/507 (16%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQ 110
++ + Q LDHF + QT++QRY IN H+ ++AP+ V G E G
Sbjct: 84 FEAYWFEQPLDHFAR--EDTQTWRQRYWINTRHYK-PNSSAPVIVLDGGETSGANRLPFL 140
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
+TG + +A + V +EHRYYG+SIP A + +L + Q+ AD A +
Sbjct: 141 DTGIVEILAKATGGVGVVLEHRYYGRSIPVDNLSTDALR------FLDNAQSAADSARFM 194
Query: 171 IDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
+K ++LTA +P + +GGSY G AA ++ YP + GA+ASS ++
Sbjct: 195 SHVKFEGIEEDLTAPHAPWIYYGGSYAGARAAHMKVLYPELVFGAIASSGVT---HAALT 251
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK-----LQKAFRICKSEKN 280
+ + +II ++ C ++ + I+ PG + L+ F + + E +
Sbjct: 252 IWEYMDIIRV---AMDPTCSSNLQSAIASIDNILTHPGPTGRILRRALKSLFGLGELEHD 308
Query: 281 LAIESWLSTAF---------VYTAMTDYPTPSNFLN-PLPAFPVKEMCKAIDDPKTGNDV 330
S LS T + LN P P + DD K +V
Sbjct: 309 EDFASILSQPLGSWQAKNWDPEVGSTVFDEFCAALNKPFWPLPSEHEMLPYDDNKRSIEV 368
Query: 331 FAKLYGAASV--YYNY------------SGTAKCFDLNGDSDPHGLS------EWGWQAC 370
L ++ Y NY +CF + D+ S +W +Q C
Sbjct: 369 TGALTVDYTIVNYANYIKKHVVSRCPKDMTVEECFGTSDDTKYQDTSLDQNWRKWLFQVC 428
Query: 371 TEM-IMLTGGDNKDSIFEESE-EDYDARARYCKEAYG------VDPRPNWITT-EFGGHK 421
TE T + D S+ + +++ C++A+ V P PN + G
Sbjct: 429 TEWGYFFTAPPDPDYPRIVSKLHTLEYQSKICRQAFPPGKHFIVPPLPNITAVNDLGSFD 488
Query: 422 IGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKE------GAHHVD---LRF 472
I A + +G DPW + +E+ G HH D LR
Sbjct: 489 IA------ADKLAIIDGEVDPWRPATPHSRYAAEREDTIERPFKLIPGGVHHYDEFGLR- 541
Query: 473 STKEDPQWLKDVRRREVEIIGKWISQY 499
+ +E+P + + R + + W++ +
Sbjct: 542 NIEEEPAEILKIHREMIRFVTAWLNDW 568
>gi|389645554|ref|XP_003720409.1| hypothetical protein MGG_15423 [Magnaporthe oryzae 70-15]
gi|351640178|gb|EHA48042.1| hypothetical protein MGG_15423 [Magnaporthe oryzae 70-15]
Length = 564
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 64/227 (28%), Positives = 108/227 (47%), Gaps = 24/227 (10%)
Query: 1 MATRFIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQ 60
M + I ++ + + +A P+F + R + ++ + ++ ++G Y + +
Sbjct: 1 MLAKMIIQRLAMVVAVAGQVCSAA-RPSFMTPRFYAHQQANEVVAA--AEGRYPARNISV 57
Query: 61 ILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGF 114
+DHF+ Y P S TF+ RY + +H+ N P+ V G E G G
Sbjct: 58 PVDHFHNDTSYEPHSNDTFELRYWFDASHY---VNGGPVIVLLGGETSGAERLPFMEKGI 114
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL- 173
+Y +A + + V +EHRYYG S P + +N +L++ QALAD A ++
Sbjct: 115 LYRLARATRGMAVVLEHRYYGASFP---TPNLTTENLR---FLTTDQALADTAYFAKNVV 168
Query: 174 -----KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+NLT+ +P +GGSY G AA+ R YP V GA++SS
Sbjct: 169 FHGYENRNLTSHTTPYFAYGGSYAGAFAAFVRKLYPDVFWGAISSSG 215
>gi|115387739|ref|XP_001211375.1| predicted protein [Aspergillus terreus NIH2624]
gi|114195459|gb|EAU37159.1| predicted protein [Aspergillus terreus NIH2624]
Length = 564
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 110/467 (23%), Positives = 184/467 (39%), Gaps = 84/467 (17%)
Query: 37 EKLSSLISS-SKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPI 95
E +L+S S D GL I DH N S T+Q R+ + + + +PI
Sbjct: 51 ESFQNLLSHPSNDKTGLSPELVSIPI-DH---NDPSVGTYQNRFWVTEQFY---VPGSPI 103
Query: 96 FVYTGNEGDIEWF----AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
FV+ G E + E + N F + + FK + + EHRYYG+S P+ + ++
Sbjct: 104 FVFDGGESNAEVYKSHLTSNQSFFWHILKDFKGMGILWEHRYYGESRPFPVSLSTPPEHM 163
Query: 152 STTGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPH 205
YL++ QAL D + + +L +P ++ GGSY G+ AA R +YP
Sbjct: 164 E---YLTTRQALEDIPYFAKNFSRPNHPDVDLRPQSTPWIMVGGSYAGIRAALARSEYPE 220
Query: 206 VAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSEN----CYKVIKGSWKQIEETAKK 261
A ASSAP+ + F+N+ +R + N C + I+ +++ I++ K
Sbjct: 221 TIYAAYASSAPVE--AQVDMSIYFNNV----YRGMEANGLGDCIQSIQAAYQYIDDQLGK 274
Query: 262 PGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDY--PTPSNFLNPLPAFPVKEMCK 319
+++ F +E N + + +Y + Y P+N +++ C
Sbjct: 275 EDTAAAVKRLFFGPGAENNTNEGFTAALSHLYDSFQGYGLTGPAN-------SSLQDFCN 327
Query: 320 AIDDPKTG------NDVFAKLYGAAS-------------VYYNYSGTAK---------CF 351
+ KT +F + G A+ V YN++ K C
Sbjct: 328 YMGFGKTNRTAHLTRSLFRRDSGKAAAERWASWAPLLSIVNYNFNTNCKQQNETIPLSCD 387
Query: 352 DLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAY------G 405
+DP ++ WGWQ CTE + + + + C + + G
Sbjct: 388 FEESTTDPDMIA-WGWQYCTEWGFFQSNNFGPRALLSKYQTLEYQQLVCNKQFPEAVKKG 446
Query: 406 V---DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
V PR + EFGG + SN F G DPW+ V+
Sbjct: 447 VLPSQPRADATNAEFGGWGVR------PSNTFFTVGEFDPWTTLSVM 487
>gi|123463102|ref|XP_001316920.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121899640|gb|EAY04697.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 491
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 106/224 (47%), Gaps = 26/224 (11%)
Query: 42 LISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGN 101
L+S ++ +T +TQ++DHF P F QRY +N + S+N I +Y G
Sbjct: 2 LLSFVSIARSTVQTLSYTQMVDHFARKPTY---FTQRYFVNSDYANKSRN---IILYLGG 55
Query: 102 EGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQ 161
+++ G + ++A + K++++ +EHRY+GKS+P + + Y S Q
Sbjct: 56 ANELDPNEITPGPILEIASQTKSVIIGLEHRYFGKSVPTVNMSQFNMQ------YCSVPQ 109
Query: 162 ALADYASLIIDLK-KNLTATDSPVVVF---GGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
A+ D S ++ K +N T+ F G YGG LA W + +GA ASSAP+
Sbjct: 110 AILDIKSFVLQGKIRNDYCTEPDFCKFFLMGKGYGGGLATWASTGFKRFYLGAWASSAPL 169
Query: 218 LNFDNIVS-----PYSFSNIITQDFRSVSENCYKVIKGSWKQIE 256
++ + Y NI + + NCYKV+ + IE
Sbjct: 170 VSINTFTQYDQKEAYFLGNITIE-----ATNCYKVMHDVYNTIE 208
>gi|440471426|gb|ELQ40436.1| hypothetical protein OOU_Y34scaffold00435g4 [Magnaporthe oryzae
Y34]
gi|440488251|gb|ELQ67984.1| hypothetical protein OOW_P131scaffold00272g5 [Magnaporthe oryzae
P131]
Length = 400
Score = 85.1 bits (209), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 99/200 (49%), Gaps = 23/200 (11%)
Query: 27 PTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLIN 82
P+F + R + ++ + ++ ++G Y + + +DHF+ Y P S TF+ RY +
Sbjct: 22 PSFMTPRFYAHQQANEVVAA--AEGRYPARNISVPVDHFHNDTSYEPHSNDTFELRYWFD 79
Query: 83 DTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPY 140
+H+ N P+ V G E G G +Y +A + + V +EHRYYG S P
Sbjct: 80 ASHY---VNGGPVIVLLGGETSGAERLPFMEKGILYRLARATRGMAVVLEHRYYGASFP- 135
Query: 141 GGNKEIAYKNASTTGYLSSTQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGM 194
+ +N +L++ QALAD A ++ +NLT+ +P +GGSY G
Sbjct: 136 --TPNLTTENLR---FLTTDQALADTAYFAKNVVFHGYENRNLTSHTTPYFAYGGSYAGA 190
Query: 195 LAAWFRLKYPHVAIGALASS 214
AA+ R YP V GA++SS
Sbjct: 191 FAAFVRKLYPDVFWGAISSS 210
>gi|336370214|gb|EGN98555.1| hypothetical protein SERLA73DRAFT_183618 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382973|gb|EGO24123.1| hypothetical protein SERLADRAFT_470898 [Serpula lacrymans var.
lacrymans S7.9]
Length = 555
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 104/203 (51%), Gaps = 18/203 (8%)
Query: 20 ISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRY 79
+ NA P S + + L ++ S++ + + +Y Q LDHF+ + +TF QRY
Sbjct: 42 LDNAIARPKDTSHSLFVQDLDAIQHSTEKFEE-FPEQYFRQPLDHFS---NTSETFGQRY 97
Query: 80 LINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKS 137
IN H+ P+ V G E G+ +TG + +A + V +EHRYYG S
Sbjct: 98 WINTRHYTPGAG-GPVIVLDGGETSGEDRIPFLDTGIVEILARATGGVGVVLEHRYYGSS 156
Query: 138 IPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-----KNLTATDSPVVVFGGSYG 192
IP ++ + +L++ Q+ AD A+ + ++K ++LTA ++P + +GGSY
Sbjct: 157 IP------VSNFSTDNLRWLNNEQSAADSANFMANVKFPGIEEDLTAPNTPWIYYGGSYA 210
Query: 193 GMLAAWFRLKYPHVAIGALASSA 215
G AA R+ YP + GA+ASS
Sbjct: 211 GARAAHMRVLYPELVYGAIASSG 233
>gi|123399993|ref|XP_001301579.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121882777|gb|EAX88649.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 496
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 82/159 (51%), Gaps = 12/159 (7%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDV 118
TQ LDHF+ + Q +TF QRY + N + +F Y G E + + D+
Sbjct: 9 TQKLDHFDASSQ--ETFNQRYY--KITKNSTANVSALFFYIGGEAPLIGKRMLSLAPVDL 64
Query: 119 APKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLT 178
A K A+L +EHR++G S P E YL+ Q LAD A I +K++
Sbjct: 65 AEKNNAVLFGLEHRFFGNSAPTNLTIE-------NLKYLTIEQGLADLAHFINAMKQDYD 117
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
T + V GGSY G L++WFRL YPH+A + ASSAP+
Sbjct: 118 HT-VRIGVIGGSYPGALSSWFRLLYPHLADVSWASSAPV 155
>gi|121711132|ref|XP_001273182.1| serine peptidase, putative [Aspergillus clavatus NRRL 1]
gi|119401332|gb|EAW11756.1| serine peptidase, putative [Aspergillus clavatus NRRL 1]
Length = 531
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 124/525 (23%), Positives = 194/525 (36%), Gaps = 97/525 (18%)
Query: 40 SSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYT 99
SSL+S + + + Q++DH N TFQQR+ + W G +P+ ++T
Sbjct: 36 SSLLSGNARFRQVRGNTTFDQLIDHDN---PELGTFQQRFWWSSEFWKGP--GSPVVLFT 90
Query: 100 GNEGDIEWFAQNTGFMYD------VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
E D + TG++ + A + ++ +EHRY+G S PY N T
Sbjct: 91 PGEADAPGY---TGYLTNQTLPGRFAQEIGGAVILLEHRYWGTSSPY------TNLNTET 141
Query: 154 TGYLSSTQALADY------ASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
YL+ Q++AD L D + A +P V+ GGSY G L+AW P
Sbjct: 142 LQYLTLEQSIADLTHFAKTVDLAFDSNHSSNADKAPWVLTGGSYSGALSAWTASTAPGTF 201
Query: 208 IGALASSAP---ILNFDNIVSPY--SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKP 262
+SSAP I NF P + D V E +V K K+ ++ K+
Sbjct: 202 WAYHSSSAPVEAIYNFWQYFVPVVEGMPRNCSMDVSRVVEYVDQVYKSGDKRRQQKLKEM 261
Query: 263 GGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID 322
GL LQ + +N W S +F YT +++ + + V+ K +
Sbjct: 262 FGLGALQHFDDFAAALEN-GPWLWQSNSF-YTGYSEFYQFCDMVE-----NVQPGAKTVP 314
Query: 323 DPKTGN-------------DVFAKLYGAASVYYNYSGTAKCFDLNGDSDP---------H 360
P+ F Y A Y+ CFD + S+P
Sbjct: 315 GPQGVGLEKALKGYASWFKSSFLPGYCAGFGYWTDKLAIDCFDTHKPSNPIFTDQSLANT 374
Query: 361 GLSEWGWQACTEMIML--TGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG 418
G +W W C E + G + + R C+ + E
Sbjct: 375 GNRQWTWLLCNEPLFYWQDGAPPTEITVVSRLVSAEYWQRQCQLYF----------PEIN 424
Query: 419 GHKIGLVLKRFASNI---------------IFFNGLRDPW--SG-------GGVLKNISK 454
GH G + AS++ I+ NG DPW SG GG L++ +
Sbjct: 425 GHTYGSAEGKRASDVNKWTKGWDSTDTKRLIWTNGQYDPWRDSGVSSVFRPGGPLRSTKQ 484
Query: 455 TVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ ++ G H DLR + ++ V EV I W+ +Y
Sbjct: 485 APVQVIPG-GFHCSDLRLRNGQVNAGVQKVIDNEVAQIKAWVKEY 528
>gi|299743114|ref|XP_001835550.2| hypothetical protein CC1G_13075 [Coprinopsis cinerea okayama7#130]
gi|298405506|gb|EAU86268.2| hypothetical protein CC1G_13075 [Coprinopsis cinerea okayama7#130]
Length = 564
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 95/177 (53%), Gaps = 16/177 (9%)
Query: 46 SKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--G 103
SK+ ++ ++ Q LDHF+ S + QR+ +N H+ + AP+ V G E G
Sbjct: 67 SKEKNYDFRAQWFEQPLDHFDNT--SDHRWHQRFWVNSRHYK-PRPGAPVIVLDGGETSG 123
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
+ +TG + +A + + +EHRYYG SIP +A + + +L++ Q+
Sbjct: 124 EERLPFLDTGIVNILAKATGGIGIVLEHRYYGDSIP------VANFSTDSLRWLNNAQSA 177
Query: 164 ADYASLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
AD A+ + ++K +++TA +P + +GGSY G AA ++ YP + GA+ASSA
Sbjct: 178 ADSANFMRNVKLDSIQEDITAPGTPWIYYGGSYAGARAAHMKIIYPDIVYGAIASSA 234
>gi|330944433|ref|XP_003306377.1| hypothetical protein PTT_19512 [Pyrenophora teres f. teres 0-1]
gi|311316149|gb|EFQ85539.1| hypothetical protein PTT_19512 [Pyrenophora teres f. teres 0-1]
Length = 543
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 97/191 (50%), Gaps = 22/191 (11%)
Query: 37 EKLSSLISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNN 92
++ S S+ D + LY + + +D F+ Y P S +F RY +DT++ K
Sbjct: 35 QRRSLAKSADTDPKLLYPERNISVPVDFFHNETRYEPHSNDSFNLRYWFDDTYY---KPG 91
Query: 93 APIFVYTGNEGDIEW---FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
P+FV G E D F Q G ++ V L V +EHRYYGKS P ++ K
Sbjct: 92 GPVFVLLGGETDGAGRLPFLQK-GIVHQVIKATNGLGVILEHRYYGKSFPV---PDLTTK 147
Query: 150 NASTTGYLSSTQALADYASLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYP 204
N +L++ Q+LA+ ++K +LTA ++P VV+GGSY G AA+ R+ YP
Sbjct: 148 NMR---FLTTEQSLAEIDYFAKNVKFEGIDADLTAPNTPWVVYGGSYAGAQAAFLRVVYP 204
Query: 205 HVAIGALASSA 215
GA++SS
Sbjct: 205 ETFWGAISSSG 215
>gi|336275849|ref|XP_003352678.1| hypothetical protein SMAC_01511 [Sordaria macrospora k-hell]
gi|380094568|emb|CCC07948.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 570
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 93/187 (49%), Gaps = 25/187 (13%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
I +S D LY + +DHF+ Y P + TF RY + T++ K P+ V
Sbjct: 47 IQASTDPTLLYPARTIEVPVDHFHNDTKYEPHTNDTFDLRYWFDATYY---KKGGPVIVL 103
Query: 99 TGNE----GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
E G +++ + G +Y +A + V +EHRYYGKS+P + + KN
Sbjct: 104 AAGETSGVGRLQFLQK--GIVYQLAKATGGVGVILEHRYYGKSLP---TSDFSTKNLR-- 156
Query: 155 GYLSSTQALADYASLIIDLK------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+L++ QALAD ++K +LTA ++P + +GGSY G A+ R YP V
Sbjct: 157 -FLTTDQALADTVYFAKNVKFAGLEHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYW 215
Query: 209 GALASSA 215
GA++SS
Sbjct: 216 GAISSSG 222
>gi|85112594|ref|XP_964374.1| hypothetical protein NCU00831 [Neurospora crassa OR74A]
gi|28926153|gb|EAA35138.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 561
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 92/187 (49%), Gaps = 25/187 (13%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
I S D LY + +DHF+ Y P + TF RY + T++ K P+ V
Sbjct: 51 IQESTDPTLLYPARTIKVPVDHFHNDTKYEPHTNDTFDLRYWFDATYY---KKGGPVIVL 107
Query: 99 TGNE----GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
E G +++ + G +Y +A + V +EHRYYGKS+P + + KN
Sbjct: 108 AAGETSGVGRLQFLQK--GIVYQLAKATGGVGVILEHRYYGKSLP---TSDFSTKNLR-- 160
Query: 155 GYLSSTQALADYASLIIDLK------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+L++ QALAD ++K +LTA ++P + +GGSY G A+ R YP V
Sbjct: 161 -FLTTDQALADTVYFAKNVKFAGLEHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYW 219
Query: 209 GALASSA 215
GA++SS
Sbjct: 220 GAISSSG 226
>gi|350295890|gb|EGZ76867.1| peptidase S28 [Neurospora tetrasperma FGSC 2509]
Length = 562
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 92/187 (49%), Gaps = 25/187 (13%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
I S D LY + +DHF+ Y P + TF RY + T++ K P+ V
Sbjct: 50 IQESTDPTLLYPARTIKVPVDHFHNDTKYEPHTNDTFNLRYWFDATYY---KKGGPVIVL 106
Query: 99 TGNE----GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
E G +++ + G +Y +A + V +EHRYYGKS+P + + KN
Sbjct: 107 AAGETSGVGRLQFLQK--GIVYQLAKATGGVGVILEHRYYGKSLP---TSDFSTKNLR-- 159
Query: 155 GYLSSTQALADYASLIIDLK------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+L++ QALAD ++K +LTA ++P + +GGSY G A+ R YP V
Sbjct: 160 -FLTTDQALADTVYFAKNVKFAGLEHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYW 218
Query: 209 GALASSA 215
GA++SS
Sbjct: 219 GAISSSG 225
>gi|336463818|gb|EGO52058.1| hypothetical protein NEUTE1DRAFT_118619 [Neurospora tetrasperma
FGSC 2508]
Length = 569
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 92/187 (49%), Gaps = 25/187 (13%)
Query: 43 ISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
I S D LY + +DHF+ Y P + TF RY + T++ K P+ V
Sbjct: 47 IQESTDPTLLYPARTIKVPVDHFHNDTKYEPHTNDTFDLRYWFDATYY---KKGGPVIVL 103
Query: 99 TGNE----GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
E G +++ + G +Y +A + V +EHRYYGKS+P + + KN
Sbjct: 104 AAGETSGVGRLQFLQK--GIVYQLAKATGGVGVILEHRYYGKSLP---TSDFSTKNLR-- 156
Query: 155 GYLSSTQALADYASLIIDLK------KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+L++ QALAD ++K +LTA ++P + +GGSY G A+ R YP V
Sbjct: 157 -FLTTDQALADTVYFAKNVKFAGLEHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYW 215
Query: 209 GALASSA 215
GA++SS
Sbjct: 216 GAISSSG 222
>gi|294911623|ref|XP_002778023.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239886144|gb|EER09818.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 457
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 104/469 (22%), Positives = 188/469 (40%), Gaps = 51/469 (10%)
Query: 51 GLYKTKYHTQILDHFNYN-PQSYQTFQQRYLINDTHWGGSKNNAPI-FVYTGNEG----- 103
G + Y Q++DHF N + + + Q+YL ND + + P+ +YTG E
Sbjct: 8 GNHTYMYCDQLVDHFTDNRAHAREKWCQKYLYNDEFSSRDRCHRPVVLLYTGGESPGLSD 67
Query: 104 DIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQAL 163
DI + M +A + A+ + +EHRYYG P K+++ K T + QAL
Sbjct: 68 DIVTASNVADDMMSLAKEIGAVAMALEHRYYGVEKP---TKKLSRKVLEKT--FTVDQAL 122
Query: 164 ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNI 223
AD A ++ V FGGSY G++AAW R YP ++ ++A + +
Sbjct: 123 ADVARFRDYAATKYNLENAQFVTFGGSYPGVVAAWARAVYPESSLQLYNNAAADAFANEL 182
Query: 224 VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS---EKN 280
V S C IK + ++ + + KL++ F IC + E++
Sbjct: 183 VGG--------------SIACATAIKQAHAEVGQMLEDEKLRRKLERTFNICGTNMLEEH 228
Query: 281 LAIESWLSTAFV-YTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAAS 339
W + + ++ ++ P L ++++C DPK + + G A
Sbjct: 229 DNRRLWTAEGVLSFSVQSNDPRCEGDL-----CNIEKICSRFTDPKRPASL---VEGLAE 280
Query: 340 VYYNYSGTAKCFDLN---------GDSDPHGLSEWGWQACTEM-IMLTGGDNKDSIFEES 389
V + S T +C D++ +S + W +Q C E T +K+ ++
Sbjct: 281 V--SRSRTKECVDVDFEEVARMYRNESYADWMKMWVFQTCNEFGFYQTCDSSKNCLWPPR 338
Query: 390 EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVL 449
D + C+ + P + K G L AS I+ NG DPW ++
Sbjct: 339 LNDLKWNMKLCEIGWDFTPEEISANIQHTNRKYG-GLSLNASRILSVNGGVDPWHRLALV 397
Query: 450 KNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
+ + + +HH ++E Q + R +++ +W+ +
Sbjct: 398 TSDNYERPTIWVPGASHHYWTHRGSEEVDQNIARARSGIRDVVKQWLEE 446
>gi|206598107|gb|ACI15917.1| serine carboxylase [Bodo saltans]
Length = 461
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 108/462 (23%), Positives = 170/462 (36%), Gaps = 66/462 (14%)
Query: 55 TKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGF 114
Y Q++DH N + TFQQR+ + + + A +++ G+ E G+
Sbjct: 31 VNYFEQLIDHSN---AALGTFQQRWWGDLSAFTNQSEYAMLYI----NGEGEAHGSPDGY 83
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
A + +EHRYYG+S+P N S YL+ ALAD + + L+
Sbjct: 84 PAVYGRNISAAMFGLEHRYYGESMP------APLTNRSMLNYLTVENALADLEAFRLYLQ 137
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI-LNFDNIVSPYSFSNII 233
+ + + GGSY G L+AW + YP + A +SS + FD Y+F I
Sbjct: 138 ATVLKKEVKWFICGGSYSGALSAWSKATYPASYLAAWSSSGVVNARFDY----YAFDGHI 193
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAF----RICKSEKNLAIESWLST 289
+V C K I+ + Q P + F K + + +
Sbjct: 194 VSVLPAV---CEKAIRSVFDQFSAAYDDPTQRAAMMAIFGTPAYFTKEDMAWMLADGSAM 250
Query: 290 AFVYTA---MTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYG---AASVYYN 343
A Y + + D P + NP + + L+G +S YY
Sbjct: 251 AVQYGSKNYLCDSIVPLSKTNPFEQYAT---------------IIKALWGESFTSSCYY- 294
Query: 344 YSGTAKCFDLNGDSDPHGLSEWGW--QACTEMIMLTGGDNKDSIFEESEEDYDARARYCK 401
+ +C SD + + W Q C+++ G + DY C+
Sbjct: 295 ---STECLSNAQYSDQWAAAGYAWVYQCCSQLAYWQSGYPNSLRLDVITTDY--YINQCR 349
Query: 402 EAYGVDPRPNWIT--TEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKT---V 456
A+G + P+ T +FGG A+N+I G DPW GV + V
Sbjct: 350 SAFGQNTFPDTYTFNAKFGGATPN------ATNVIALQGSDDPWQTAGVQAPLGPNYPEV 403
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQ 498
LA G H DL DP L R V + W+ +
Sbjct: 404 LAQCNGCG-HCGDLMSPLPTDPASLTAQREAIVNYLDLWLGR 444
>gi|388852407|emb|CCF54022.1| uncharacterized protein [Ustilago hordei]
Length = 656
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 65/229 (28%), Positives = 109/229 (47%), Gaps = 24/229 (10%)
Query: 43 ISSSKDSQ--GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW---GGSKNNAPIFV 97
+S++K S+ + + YH Q LDHF+ Q+ F+QR+ + H+ +N + +
Sbjct: 126 VSATKRSKKHNIKEPLYHKQPLDHFDNTTQA--QFEQRFFYSTRHYKPASARRNGEAVPI 183
Query: 98 YTGNEGDIEWFAQ----NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA-- 151
Y + G+ + A+ +TG + + + + +EHRYYG S+P E+ +A
Sbjct: 184 YILDSGEADATARIPFLDTGILDIFSKATGGIGIVLEHRYYGTSLP--NRTELGSGDAWG 241
Query: 152 -STTGYLSSTQALADYASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+L++ QAL D A I +D+ + ++ +GGSY G AA R YP +
Sbjct: 242 VDQLRWLTNKQALQDSADFIRNLDIPGTDNSEKRKIIYYGGSYPGARAAHMRFLYPELVH 301
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE 257
GA+ASSA + D P F I R NC + I+ + I+E
Sbjct: 302 GAIASSAVVTAVDEF--PEYFYPIA----RGAPTNCSQAIQAAIAGIDE 344
>gi|303319727|ref|XP_003069863.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240109549|gb|EER27718.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320034152|gb|EFW16097.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 554
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 120/255 (47%), Gaps = 28/255 (10%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
L+S I S ++ + +Y +DH N + ++ RY +ND ++ + P+ ++
Sbjct: 54 LASAIDSLAETSAVV-AEYANIPIDHRNPG----RMYRNRYWVNDQYY---QPGGPVVIF 105
Query: 99 TGNEGDIEWFAQ-----NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
E + + FA T ++ + +F + + EHRYYG+S+PY N + + A+
Sbjct: 106 DTGETNGQAFADYYLVDPTSYIVQLLREFHGVGLVWEHRYYGESLPYPVNGQTS---AAQ 162
Query: 154 TGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
YL+ QAL D ++ +LT +P ++ GGSY GM AA+ RLKYP
Sbjct: 163 FQYLTLEQALQDLPYFARTFRRPRLPNADLTPRSTPWIMVGGSYPGMRAAFSRLKYPDTI 222
Query: 208 IGALASSAPILNFDNIVSPYS--FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGL 265
A +SSAP ++ Y + ++ + NC + + +++ I+ P
Sbjct: 223 FAAFSSSAPAQARIDMSVYYEQVYRGLVAYGY----GNCTRDVNAAYRYIDAQLANPSTA 278
Query: 266 EKLQKAFRICKSEKN 280
++++ F +E+N
Sbjct: 279 AQIKRQFLGPGAEQN 293
>gi|119183270|ref|XP_001242695.1| hypothetical protein CIMG_06591 [Coccidioides immitis RS]
gi|392865604|gb|EAS31403.2| serine peptidase, family S28 [Coccidioides immitis RS]
Length = 554
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 66/257 (25%), Positives = 120/257 (46%), Gaps = 32/257 (12%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVY 98
L+S I S ++ + +Y +DH N + ++ RY +ND ++ + P+ ++
Sbjct: 54 LASAIDSLAETSAVV-AEYANIPIDHRNPG----RMYRNRYWVNDEYY---QPGGPVVIF 105
Query: 99 TGNEGDIEWFAQ-----NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST 153
E + + FA T ++ + +F + + EHRYYG+S+PY N + + A+
Sbjct: 106 DTGETNGQAFADYYLVDPTSYIVQLLREFHGVGLVWEHRYYGESLPYPVNGQTS---AAQ 162
Query: 154 TGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
YL+ QAL D ++ +LT +P ++ GGSY GM AA+ RLKYP
Sbjct: 163 FQYLTLEQALQDLPYFARTFRRPRLPNADLTPRSTPWIMVGGSYPGMRAAFSRLKYPDTI 222
Query: 208 IGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE----NCYKVIKGSWKQIEETAKKPG 263
A +SSAP ++ Y Q +R + NC + + +++ I+ P
Sbjct: 223 FAAFSSSAPAQARIDMSVYYE------QVYRGLVAYGYGNCTRDVNAAYRYIDAQLANPS 276
Query: 264 GLEKLQKAFRICKSEKN 280
++++ F +E+N
Sbjct: 277 TAAQIKRQFLGPGAEQN 293
>gi|189209519|ref|XP_001941092.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977185|gb|EDU43811.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 543
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 111/226 (49%), Gaps = 29/226 (12%)
Query: 8 LSFCLLFSSTL----TISNAKIFPTFPSSRITPEKLSSLIS-SSKDSQGLYKTKYHTQIL 62
+ F ++ ++ L T S K+ P + R + SL + D LY + + +
Sbjct: 1 MRFSVVVAAVLGGIQTASALKLSPRSLAERDEQVQRRSLAKRADTDPTLLYPERNISVPV 60
Query: 63 DHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTGF 114
D F+ Y P S ++F RY +DT++ K P+FV G E G + + + G
Sbjct: 61 DFFHNETRYEPHSNESFNLRYWFDDTYY---KPGGPVFVLLGGETNGAGRLPFLQK--GI 115
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
++ V L V +EHRYYGKS P ++ KN +L++ Q+LA+ ++K
Sbjct: 116 VHQVIKATNGLGVILEHRYYGKSFPV---PDLTTKNMR---FLTTEQSLAEIDYFAKNVK 169
Query: 175 -----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+LTA ++P VV+GGSY G AA+ R+ YP GA++SS
Sbjct: 170 FEGIDADLTAPNTPWVVYGGSYAGAQAAFLRVVYPETFWGAISSSG 215
>gi|224001020|ref|XP_002290182.1| serine protease [Thalassiosira pseudonana CCMP1335]
gi|220973604|gb|EED91934.1| serine protease, partial [Thalassiosira pseudonana CCMP1335]
Length = 398
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 96/399 (24%), Positives = 159/399 (39%), Gaps = 45/399 (11%)
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI-IDLKKNLTATDSPV 184
L IEHRYY G + +N YLSS QA+ D + ++ ++
Sbjct: 17 LYAIEHRYYDD----GSADDNHDENQFDYTYLSSRQAVKDIVEFMDVEDAAASNTNNNTW 72
Query: 185 VVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR-SVSEN 243
++FGGSY GML+AW RL +P GA+A+SAP+ + Y + D R SE
Sbjct: 73 ILFGGSYPGMLSAWARLLHPETIHGAVANSAPVQPQLDFYQYYDHVALDLVDERVGGSEE 132
Query: 244 CYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPS 303
C ++ + +Q+ + LE++ F +C L AFV + P+ S
Sbjct: 133 CKRIFVEAHEQVVAVFDEVNPLEEVATLFNVCGGADMLRASRRNMEAFVGDGLIRVPSQS 192
Query: 304 NFLNPLPAFPVKE-------MCKAIDDPKTGN------DVFAKL----YGAASVYYNYSG 346
N P+F E +C I + N ++ A L G +Y+
Sbjct: 193 ND----PSFEGAEKISLNFVICNTIIEEHMSNPNHSSMEILANLKKIQNGLLCEEVDYNE 248
Query: 347 TAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSI-----FEESEEDYDARARYCK 401
+ D + + L+ W +Q C E + + F E D + +C+
Sbjct: 249 LIEFMTTRTDENANDLA-WLYQTCNEFGFYQTCNVGSTCPYGRGFHLIERDLE----FCQ 303
Query: 402 EAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVE 461
+G+D + + + +G ++ IIF NG DPW+ V + + V V
Sbjct: 304 IVFGIDTVTQNVASSL-EYYVGGSENTASNRIIFVNGDVDPWTELAVTETSNGDVENTVM 362
Query: 462 KEGAHHVDLRFSTKE----DPQWLKDVRRREVEIIGKWI 496
GA H F T E D + + R+ + + +W+
Sbjct: 363 VPGASH---HFWTHEIKDTDDKAVVAARQAIYDTVSEWL 398
>gi|315056299|ref|XP_003177524.1| hypothetical protein MGYG_01598 [Arthroderma gypseum CBS 118893]
gi|311339370|gb|EFQ98572.1| hypothetical protein MGYG_01598 [Arthroderma gypseum CBS 118893]
Length = 551
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 107/449 (23%), Positives = 173/449 (38%), Gaps = 57/449 (12%)
Query: 34 ITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNA 93
+ PE+ +L++ S + + +T +DH T++ R+ IN+ + K
Sbjct: 48 LRPEQFKALMADSNFAGPVAETV--DMPIDH---KSNKTGTYKHRFWINEQDY---KPGG 99
Query: 94 PIFVYTGNEGDIEWFA-----QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAY 148
P+FV+ E + +A T F + KF + + EHRYYG+S PY I
Sbjct: 100 PVFVFDCGEAAGQRYADRYLFNETNFFRQLTQKFHGIGIIFEHRYYGESTPY----PITV 155
Query: 149 KNASTT-GYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAWFRL 201
K YL + QALAD D K+ +L +P V+ GGSY GM AA+ R
Sbjct: 156 KTPPEHFKYLDNDQALADLPYFAKDFKRAAFPKNDLRPNATPWVMVGGSYPGMRAAFTRD 215
Query: 202 KYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE----NCYKVIKGSWKQIEE 257
+YP + ASSAP+ ++ Y Q +R + NC K I +++ I+
Sbjct: 216 RYPETIYASWASSAPVQAKIDMAVYYE------QVYRGLVAYGWGNCTKDIHAAYRYIDR 269
Query: 258 TAKKPGGLEKLQKAFRICKSEK--NLAIESWLSTAFV-YTAMTDYPTPSNFLNPLPAFPV 314
+ ++K F ++K N + L A+ + + F N L P
Sbjct: 270 QLSRKDSAAAIKKLFLGEGADKASNGDFTAALIVAYAGWQSSGADGQVGKFCNWLEVDPK 329
Query: 315 KEMCKAIDD--PKTGNDVFAKLYGA---------ASVYYNYSGTAK-----CFDLNGDSD 358
+ PK G+ A+ + A A+ N T K C D
Sbjct: 330 TNKTAPAEGWAPKLGDKAMAERFAAWPTLAEMVNANAMTNCKQTDKSKPLECKLDKPSED 389
Query: 359 PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFG 418
P +S W WQ C+E G + + +C + + ++
Sbjct: 390 PDFIS-WIWQYCSEWGYYQGVNFPQHAILSRYQTNQYNQEFCYRQFPTGVKSGYLPIAPQ 448
Query: 419 GHKIGLVLKRF---ASNIIFFNGLRDPWS 444
HK K + SN+ + G DPW+
Sbjct: 449 THKTNRATKGWHMRPSNVYWSGGQYDPWN 477
>gi|395331113|gb|EJF63495.1| peptidase S28 [Dichomitus squalens LYAD-421 SS1]
Length = 574
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 104/212 (49%), Gaps = 36/212 (16%)
Query: 31 SSRITPEKLSSLISSSKDSQ---------------GLYKTKYHTQILDHFNYNPQSY--- 72
++R+ P L+ +D + G++ ++ Q LDHF +
Sbjct: 46 AARVRPRAQDPLLVQGQDGREDYPVSNAQAESKWGGVFPERWFEQPLDHFAEGKGAQAET 105
Query: 73 QTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFIE 130
+T++QRY +N H+ + AP+FV G E G+ +TG +A + V +E
Sbjct: 106 ETWRQRYWVNTRHYVPGPD-APVFVIDGGETSGEDRLGFLDTGIADILARATGGVGVVLE 164
Query: 131 HRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADYASLIIDLK-----KNLTATDSP 183
HRYYG+S P KN +T +L++ Q+ AD A+ + ++K ++LTA + P
Sbjct: 165 HRYYGESRPV--------KNLTTDSLRFLNNAQSAADSANFMANVKFPGIDEDLTAPNHP 216
Query: 184 VVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+ +GGSY G AA ++ YP + GA+ASS
Sbjct: 217 WIYYGGSYAGARAAHMKVLYPDLVWGAVASSG 248
>gi|358054024|dbj|GAA99823.1| hypothetical protein E5Q_06526 [Mixia osmundae IAM 14324]
Length = 1068
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 120/501 (23%), Positives = 201/501 (40%), Gaps = 90/501 (17%)
Query: 62 LDHFNYNPQ---SYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE----WFAQNTGF 114
++HF +P+ + +TF+ RY +N H+ K + ++ EG + N F
Sbjct: 601 INHFPGDPKYQPTNETFKLRYFVNADHY---KPGGAVLIWNAGEGSADDQTAAIFSNRTF 657
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA-----SL 169
+Y++ ++ + +EHRYYGKSIP + + YL+ QALAD+ +
Sbjct: 658 IYNLTQSTNSVGIVLEHRYYGKSIP------MPSFSTDDLQYLTVEQALADWEYFAKNAE 711
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSP 226
+ L + +T +P++ G SY G LAAW + YP G +ASSA IL+F V+P
Sbjct: 712 LPTLPQLITQNKAPLIYLGASYSGALAAWQSVVYPTTFWGYIASSAVTVSILDFAAYVNP 771
Query: 227 YSFSNIITQDFRSVSENCYKVIKGSWKQIE---ETAKKPG----GLEKLQKAFRI---CK 276
+DF +C + + I+ E+ +KP GL + Q F
Sbjct: 772 -------VRDF--APRDCVANLSAALDLIDTTSESMRKPLQSIFGLPQDQLEFVDFVNVL 822
Query: 277 SEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYG 336
+E L +S ++A D + + + + A +P + +
Sbjct: 823 AEGPLEWQSSSGNQREFSAFCDAISTGDGVQASDTLLEESGLAAASEPAR-----SAVLR 877
Query: 337 AASVYYNYSGTA-------KCFDLNGDSDPHGLSE-------WGWQACTEMIMLTGGD-- 380
AS N + T KCF N +D L++ W WQ C++M G+
Sbjct: 878 YASYITNRTLTDCEDGNLNKCFGTNNATDLQ-LADLDQDWRLWAWQTCSQMAFFMTGNVP 936
Query: 381 NKDSIFEESEEDYDARARYCKEAY------GVDPRPNWITT-EFGGHKIGLVLKRFASNI 433
++ D R C+ + + P+ + ++GG+ G+ R A
Sbjct: 937 TGEAAIMSKHIDLAYYKRVCQTTFPPGTFNKLPEVPDMASVLKYGGY--GIRHPRLA--- 991
Query: 434 IFFNGLRDPWS-----GGGVLKNISKTVLA-LVEKEGAHHVDLRFSTKEDPQWL---KDV 484
F +G DPW IS T L+ + G H D F + P +
Sbjct: 992 -FVDGTEDPWRPCTPHADAAPPRISTTDEPYLLVQGGVHGAD--FGGRYGPNQTATNNEA 1048
Query: 485 RRREVEIIGKWISQYFQDLAQ 505
R E+ I+ WI+ + Q L Q
Sbjct: 1049 RATELSILQSWIADW-QRLHQ 1068
>gi|347971076|ref|XP_318472.5| AGAP004015-PA [Anopheles gambiae str. PEST]
gi|333469620|gb|EAA43688.5| AGAP004015-PA [Anopheles gambiae str. PEST]
Length = 497
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 111/470 (23%), Positives = 187/470 (39%), Gaps = 59/470 (12%)
Query: 51 GLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQ 110
G ++TK +DHFN PQ+ TF+ Y N+ + + PIF++ G + +
Sbjct: 56 GRFRTK-----IDHFN--PQNRDTFEFSYFSNNEFY---RPGGPIFIFVGGNFAMTTYYI 105
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
G +YD A + A L EHRYYG S P + + +L S QAL D I
Sbjct: 106 EHGLLYDTAARDGAWLFTNEHRYYGASTP------VPDYSTENLRFLKSEQALMDLIEWI 159
Query: 171 IDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL-NFDNIVSPYS 228
L+ + ++ VV+ G Y G LA W R ++P + GA + A +L +FD
Sbjct: 160 DYLRNTVVGDPNAKVVLMGTGYAGALATWARQRFPSIIDGAWGAGATVLASFDFQEHAGD 219
Query: 229 FSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK-LQKAFRIC---KSEKNLAIE 284
+I R CY +I +++ + GL++ + C + K L +E
Sbjct: 220 IGEMIR---RFGGNECYSMIWVAFRTAQYLID--AGLDQTVTSLLNTCEPIEPGKLLDVE 274
Query: 285 SWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYY-N 343
T F + + L ++++C+A+ + T L G +VYY N
Sbjct: 275 ----TLFYHLKLA---IQEAMLGQQSTAKIRDVCEAMMN-STEETALHDLAGWLNVYYAN 326
Query: 344 YSGTAKCFDLNGDS-------DPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEED 392
FD N ++ P GL + +QACTE D + F +
Sbjct: 327 LPCNPFDFDTNMEAAQVLQPGAPENALLGLRQTQYQACTEFGWFRTTDLDEQPFGDRVTM 386
Query: 393 YDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRF------ASNIIFFNGLRDPWSGG 446
+ + C+ +G W+T + L + ++N++F NG DP
Sbjct: 387 HFFLSA-CRALFG-----EWVTDAVIYEGVRLTNLHYGGQDPRSTNVLFTNGEFDPNRLV 440
Query: 447 GVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
+ I+ A V + ++ +ED L +++ IG W+
Sbjct: 441 SITSYINPLSYAYVVPNEFLYSEIYSIAEEDSTELVTIKQSIQSFIGLWL 490
>gi|145246822|ref|XP_001395660.1| extracellular serine carboxypeptidase [Aspergillus niger CBS
513.88]
gi|134080382|emb|CAK46303.1| unnamed protein product [Aspergillus niger]
Length = 569
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 101/440 (22%), Positives = 175/440 (39%), Gaps = 86/440 (19%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD----IEWFAQNTG 113
+DHF+ Y P + TF RY ++ +H+ + P+FV E D I + +Q G
Sbjct: 60 IDHFHDESRYEPHTNATFGLRYWLDTSHY---QPGGPVFVIAAGETDGSDRIPFLSQ--G 114
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+ +A + + + +EHRYYG+S P+ + +N +L++ QALADYA ++
Sbjct: 115 VVTQLAAAYHGIGLILEHRYYGESYPF---TNLTTENIR---FLTTEQALADYAYFASNI 168
Query: 174 ------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIV 224
+LTA +P + +GGSY G A+ R YP + GA++SS I+++
Sbjct: 169 VFPGLEDLDLTAATTPWIAYGGSYAGAFVAFLRKLYPELYWGAVSSSGVTEAIIDYWEYY 228
Query: 225 SPYSFSNIITQDFRSVSENCYKVIKGSWKQIE----ETAKKPGGLEKLQKAFRICKSEKN 280
P C I+ S ++ + A ++L+ AF S N
Sbjct: 229 EPIRLYG---------PSQCISTIQTSIDIVDRILIDHADNKTLAQQLKSAF--GGSATN 277
Query: 281 LAIESWLST------------------AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID 322
L+ + ++S+ ++ A + T S+ L P A P+ K +
Sbjct: 278 LSNQDFVSSLSYGLDSFQSRNWDKTIGTPLFRAFCNNITNSDLLYPEAAEPISASVKQLV 337
Query: 323 -----DPKTGNDV-----FAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE-----WGW 367
DP V + + V+ + S + ++ + P LS+ W +
Sbjct: 338 QIAGYDPSNTTLVNNFLNWIGFLSTSKVFADSSSESSSSNITNFTTPQPLSKDSGTSWNY 397
Query: 368 QACTEMIMLTGGDNKDSIFE---ESEEDYDARARYCKEAYGVDPRPN-WITTEFGGHKIG 423
Q C E T G + S + D + +C +G+ P+ +FGG
Sbjct: 398 QVCVEWGYFTTGSSVPSTTKPLISRLLDLPYLSSFCPNTFGIKTPPDVQRINQFGGFNFS 457
Query: 424 LVLKRFASNIIFFNGLRDPW 443
+ GL DPW
Sbjct: 458 Y------PRVAIIGGLADPW 471
>gi|350636991|gb|EHA25349.1| hypothetical protein ASPNIDRAFT_56689 [Aspergillus niger ATCC 1015]
Length = 569
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 102/440 (23%), Positives = 175/440 (39%), Gaps = 86/440 (19%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD----IEWFAQNTG 113
+DHF+ Y P + TF RY ++ +H+ + P+FV E D I + +Q G
Sbjct: 60 IDHFHNESRYEPHTNATFGLRYWLDTSHY---QPGGPVFVIAAGETDGSDRIPFLSQ--G 114
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+ +A + + + +EHRYYG+S P+ + +N +L++ QALADYA ++
Sbjct: 115 VVTQLAAAYHGIGLILEHRYYGESYPF---TNLTTENIR---FLTTEQALADYAYFASNI 168
Query: 174 ------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIV 224
+LTA +P + +GGSY G A+ R YP + GA++SS I+++
Sbjct: 169 VFPGLEDLDLTAATTPWIAYGGSYAGAFVAFLRKLYPELYWGAVSSSGVTEAIIDYWEYY 228
Query: 225 SPYSFSNIITQDFRSVSENCYKVIKGSWKQIE----ETAKKPGGLEKLQKAFRICKSEKN 280
P C I+ S ++ + A ++L+ AF S N
Sbjct: 229 EPIRLYG---------PSQCISTIQTSIDIVDRILIDHADNKTLAQQLKSAF--GGSATN 277
Query: 281 LAIESWLST------------------AFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAID 322
L+ + ++S+ ++ A + T S+ L P A P+ K +
Sbjct: 278 LSNQDFVSSLSYGLDSFQSRNWDKSIGTPLFRAFCNNITNSDLLYPETAEPISASVKQLV 337
Query: 323 -----DPKTGNDVFAKL-----YGAASVYYNYSGTAKCFDLNGDSDPHGLSE-----WGW 367
DP V L + V+ + S + ++ + P LS+ W +
Sbjct: 338 QIAGYDPSNTTLVTNLLNWIGFLSTSKVFADSSSESSSSNITNFTTPQPLSKDSGTSWNY 397
Query: 368 QACTEMIMLTGGDNKDSIFE---ESEEDYDARARYCKEAYGVDPRPN-WITTEFGGHKIG 423
Q C E T G + S + D + +C +G+ P+ +FGG
Sbjct: 398 QVCVEWGYFTTGSSVPSTTKPLISRLLDLPYLSSFCPNTFGIKTPPDVQRINQFGGFNFS 457
Query: 424 LVLKRFASNIIFFNGLRDPW 443
+ GL DPW
Sbjct: 458 Y------PRVAIIGGLADPW 471
>gi|169621811|ref|XP_001804315.1| hypothetical protein SNOG_14118 [Phaeosphaeria nodorum SN15]
gi|111057235|gb|EAT78355.1| hypothetical protein SNOG_14118 [Phaeosphaeria nodorum SN15]
Length = 582
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 117/448 (26%), Positives = 176/448 (39%), Gaps = 69/448 (15%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
+S+ + + +Y T LDHF+ + +++ TF RY + + K P+F+Y EG
Sbjct: 70 ASAPPATEIIPEEYVTLPLDHFDPS-KNHGTFNNRYWAASSSY---KPGGPVFIYDVGEG 125
Query: 104 DIEWFA-----QNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLS 158
+ A +T F + K+ + + EHR+YG S P GG I A +L+
Sbjct: 126 NASTNALFRIQNSTSFFKQIVDKYNGIGIVWEHRFYGNSSP-GGPVNIDTP-AEQFRFLN 183
Query: 159 STQALADYASLI--IDLKK-----NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
+ Q+LAD A+ LK LT +P V GGSY GM AA+ R KYP +
Sbjct: 184 TEQSLADVAAFASQFSLKNRGINYTLTPETTPWVFVGGSYPGMRAAFMREKYPDTIYASY 243
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI----EETAKKPGGLEK 267
ASSAP+ ++ + F I + NC + I+ + + I + +K ++
Sbjct: 244 ASSAPVQA--SVDQSFYFEPIWRGMQKYGFGNCSRDIQAATRYIDGVFDRGSKNNAAADQ 301
Query: 268 LQKAFRICKSEKN--LAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK 325
L+ F +EKN L+T FV + L +++C I+
Sbjct: 302 LKIMFLGKGAEKNSHATFADALTTVFVTWQSYGMEGGNTGL--------RKLCDWIETGN 353
Query: 326 TGNDVFAKLY---------GAASVYY-----------NYSGTAKCFDLNGD-------SD 358
N A Y G AS Y N SG A D+ GD +D
Sbjct: 354 GTNTTSAPSYDQKIPQAVQGWASFPYFAKNVNMYLETNCSGKA---DVVGDCDLDRKFTD 410
Query: 359 PHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR---PNWITT 415
P +S W WQ CT+ + + C + PR P W
Sbjct: 411 PAMIS-WTWQYCTQWGYFQSANLGPRQLVSKYNSLVHQHDICHRQFPDAPRDLFPEWPAV 469
Query: 416 EFGGHKIGLVLKRFASNIIFFNGLRDPW 443
+ K G R SN + NG DPW
Sbjct: 470 DQTNRKFGGWSIR-PSNTYWSNGEFDPW 496
>gi|347970281|ref|XP_001230929.2| AGAP003641-PA [Anopheles gambiae str. PEST]
gi|333468859|gb|EAU76915.2| AGAP003641-PA [Anopheles gambiae str. PEST]
Length = 328
Score = 81.6 bits (200), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 65/222 (29%), Positives = 108/222 (48%), Gaps = 23/222 (10%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-WFAQNTGFMYD 117
T ++HF+ PQ+ TF+ YL ND ++ + P+F+ G + +F +N+ F D
Sbjct: 66 TSRVNHFD--PQNRDTFEFNYLHNDQYY---RQGGPLFIVVGGHYPVNPYFMENSHFR-D 119
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
VA A L EHRY+G+S P ++++ +N ++ + Q L D I L++ +
Sbjct: 120 VAALEGAWLATNEHRYFGESYP---TEDLSTENLR---FMRTEQVLFDLIEWIDFLRREV 173
Query: 178 TAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNII 233
++ V++ G YGG LA W R ++P++ GA SSAP+ NF+ NII
Sbjct: 174 MGDPNARVILHGVGYGGSLATWARQRFPNIIDGAWGSSAPVRATTNFEEFA--VEVGNII 231
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
+ S+ CY I ++ E G E + + F C
Sbjct: 232 RE---RGSDQCYNRIFQAFHTAENLIDA-GRTEMISEMFNTC 269
>gi|397579651|gb|EJK51289.1| hypothetical protein THAOC_29548 [Thalassiosira oceanica]
Length = 618
Score = 81.6 bits (200), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 123/277 (44%), Gaps = 60/277 (21%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTH--WGGSKNNAPIFVYTGNEG---DIEWFAQN--- 111
Q+LDHF+ + TF QRY +D + GG+ +A F+ G EG D +
Sbjct: 74 QVLDHFSK--EDGVTFGQRYFTSDRYVSEGGTGEDAVNFLCVGGEGPSLDASVLVNSVHC 131
Query: 112 TGFMYDVAP------KFKALLVFIEHRYYGKSIPYGGNKEIAY---KNASTTGYLSSTQA 162
TG M ++A + + +EHRYYG+SIP E K +
Sbjct: 132 TGDMVELAKLLHEEHGWDVRMYALEHRYYGESIPSPKKGEGGLRSPKEGGDGPDGGDKKG 191
Query: 163 LADYASL-----IIDLKKNLTATD--SPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
AD+A L ++D+ +T+TD + V FGGSY GML+AW L +P A++SS+
Sbjct: 192 DADFAHLSSRQAVLDIVNFVTSTDPHNRWVAFGGSYPGMLSAWSHLLHPSKIYAAVSSSS 251
Query: 216 PI---LNF----DNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQI------------- 255
P+ L+F D + S + +++ S C V++G Q+
Sbjct: 252 PLQVTLDFGRYNDRVASDLADADV------GGSGECLAVVEGGHAQVAAALEADGKKSDP 305
Query: 256 --EETAKKPG------GLEKLQKAFRICKSEKNLAIE 284
+ +K PG GL+K+ + F +C L +E
Sbjct: 306 GSDPKSKSPGKKGGEVGLDKVAEMFDVCGGADTLRVE 342
>gi|67900488|ref|XP_680500.1| hypothetical protein AN7231.2 [Aspergillus nidulans FGSC A4]
gi|40742088|gb|EAA61278.1| hypothetical protein AN7231.2 [Aspergillus nidulans FGSC A4]
gi|259483425|tpe|CBF78805.1| TPA: hypothetical serine carboxypeptidase (Eurofung) [Aspergillus
nidulans FGSC A4]
Length = 519
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/168 (33%), Positives = 85/168 (50%), Gaps = 23/168 (13%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD----IEWFAQNTG 113
+DHF+ Y P S TF RY + T++ + P+FV E D E+ +Q G
Sbjct: 27 IDHFHNETRYAPHSNGTFNLRYWFDSTYY---QPGGPVFVIAAGETDGEDRFEFLSQ--G 81
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+ +A + L V +EHRYYG+S P+ G A +LS+ Q+LADYA +
Sbjct: 82 IVTQLAEAYNGLGVILEHRYYGESYPFPG----ADVTVDELRFLSTEQSLADYAYFAKHV 137
Query: 174 ------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+LTA ++P + +GGSY G A+ R YP + GA++SS
Sbjct: 138 IFPGLEAYDLTAPNTPWIAYGGSYAGAQVAFMRKLYPSIFHGAVSSSG 185
>gi|392565931|gb|EIW59107.1| peptidase S28 [Trametes versicolor FP-101664 SS1]
Length = 548
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 70/256 (27%), Positives = 118/256 (46%), Gaps = 29/256 (11%)
Query: 28 TFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWG 87
PS + E LS+L + + + TQ LDHF S TF+QRY I+ H+
Sbjct: 48 ALPSEDDSDELLSAL-------ERKFPAHWFTQPLDHFTN--ASGHTFEQRYWISTRHYR 98
Query: 88 GSKNNAPIFVYTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
+ +AP+ V G E G +TG + + + V +EHRYYG++IP
Sbjct: 99 -PRPDAPVIVLDGGETSGRDRLPFLDTGIVEILTKATGGVGVILEHRYYGRTIP------ 151
Query: 146 IAYKNASTTGYLSSTQALADYASLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFR 200
+ + +L++ Q+ AD A+ + ++K ++L A + P + +GGSY G AA +
Sbjct: 152 VQNFTTDSLRWLNNAQSAADSANFMANVKFPGIDEDLAAPNHPWIYYGGSYAGARAAHMK 211
Query: 201 LKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAK 260
+ YP + GA+ASS + + ++ I R+ C + ++ S K I+
Sbjct: 212 ILYPDLVYGAIASSGVT---HAALELWEYAETIR---RAADATCAQHLENSIKIIDALLD 265
Query: 261 KPGGLEKLQKAFRICK 276
P L+ F + K
Sbjct: 266 VPVTKYPLKALFGLAK 281
>gi|409081101|gb|EKM81460.1| hypothetical protein AGABI1DRAFT_125845 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 588
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/246 (27%), Positives = 115/246 (46%), Gaps = 53/246 (21%)
Query: 34 ITPEKLSSLISSSKDSQG---LYKTKYHTQI-LDHFNYNPQSYQTFQQRYLINDTHWGGS 89
+TP L L +S +S G +Y +I +DHF ++ QTF+ R+ +N T+W
Sbjct: 46 LTPTPLERLSFASDESSGDPIADGGEYFIEIPVDHFEN--KTTQTFKNRFWVNATYW--- 100
Query: 90 KNNAPIFVYTGNEGDIE----WFAQN---TGFMYDVAPKFKALLVFIEHRYYGKSIPYGG 142
++ P+FV+ E D E ++ Q + +A ++ + + EHR+YG S+P+
Sbjct: 101 EDGGPVFVFDSGEQDAEPLLPYYLQEYHGQSAVMRLAERYNGVAILWEHRFYGVSLPFPV 160
Query: 143 NKEIAYKNASTTG----YLSSTQALADYASLIIDLKKN--------------------LT 178
N+ +TTG +L++ QAL D+ +K+ +
Sbjct: 161 NR-------NTTGDQWQFLNTEQALEDFIFFANSFRKSSSDRQIPSKGDIRNDPLALPIH 213
Query: 179 ATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFR 238
+ +P V GGSY G+ AA R++ P V A ASSAP+ ++ S Y + R
Sbjct: 214 PSGTPWVFLGGSYPGIRAAHLRIRNPEVVYAAWASSAPVQAEVDMASYYKAAE------R 267
Query: 239 SVSENC 244
S++ NC
Sbjct: 268 SLTRNC 273
>gi|308512235|ref|XP_003118300.1| hypothetical protein CRE_00736 [Caenorhabditis remanei]
gi|308238946|gb|EFO82898.1| hypothetical protein CRE_00736 [Caenorhabditis remanei]
Length = 526
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 109/452 (24%), Positives = 180/452 (39%), Gaps = 50/452 (11%)
Query: 59 TQILDHFN-YNPQSY--QTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFM 115
TQ +D+F+ N +Y Q QRY N +++ +KN + + + W + N +
Sbjct: 62 TQKVDNFDATNGATYIQQMLLQRYWYN-SNFTQNKNIVFLMIQGESPATDTWIS-NPNYQ 119
Query: 116 Y-DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
Y A +F A + +EHR +GKS PY K + +QALAD S I +
Sbjct: 120 YLQWAKEFGADVFQLEHRCFGKSRPYPDTSMPGIK------VCTMSQALADIHSFIGKMN 173
Query: 175 KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS---- 230
+ + FGGSY G L+A FR +YP +GA+ASSAP+ D + + FS
Sbjct: 174 DKYNFRNPKWITFGGSYPGTLSALFRQQYPQDTVGAVASSAPL---DWTLDFFGFSITKK 230
Query: 231 NIITQDFR-SVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS-----EKNLAIE 284
+I+T FR S+ C + + P E++ FR+ + L I+
Sbjct: 231 SIVTVFFRHSMLAKCEPSLYQHATTQSHKNRNP-TTEQILFNFRLTPAFVDGQYTQLDID 289
Query: 285 SWLSTAFVY-TAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAASVYYN 343
++ + + Y + Y L + +CK ++D T DV ++
Sbjct: 290 NFFANVYSYFQGVIQYTYDGRNAATLGGLNAQNLCKKMNDVNTP-DVITRVKNTIDWVNT 348
Query: 344 YSG-----------------TAKCFDLNGDSDPHGLSE--WGWQACTEMIMLTGGDNKDS 384
+G +D +GD + W W C E+ L D +
Sbjct: 349 LNGDPVGGLDNSYSDMIAVLANATYDDSGDIGMFSAANRGWMWLCCNELGALQTTDQGRN 408
Query: 385 IFEESEEDYDARARYCKEAYGVDPRPNWI--TTEFGGHKIGLVLKRFASNIIFFNGLRDP 442
IF+++ C +G D +I + +K G A+N++ NG DP
Sbjct: 409 IFQQTVP-MGYYIDMCTAMFGADVGIKFIRDNNKQTLYKYGGADNYQATNVVLPNGAFDP 467
Query: 443 WSGGGVLKNISKTVLALVEKEGAHHVDLRFST 474
W G N + + + +GA H + T
Sbjct: 468 WHVLGTYNNNTANHMTPLLIQGAAHCSDMYPT 499
>gi|241753987|ref|XP_002401168.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508365|gb|EEC17819.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 186
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/186 (27%), Positives = 86/186 (46%), Gaps = 18/186 (9%)
Query: 318 CKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE---WGWQACTEMI 374
C + + + + ++ A+++YN++G CF++ H E W +Q CTE++
Sbjct: 1 CDILTEAYDDDSLVEAVFKVANMFYNHTGDLVCFNI-----LHYFKENPVWNFQMCTELV 55
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRF--ASN 432
+ + + + C++ Y V P+P + TE+G RF +SN
Sbjct: 56 TTQCSNGDADMLYVRQWNLKKIRENCEKLYKVKPQPRKLYTEYGT--------RFWNSSN 107
Query: 433 IIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEII 492
IIF NG DPW+ G L ++TV+ ++ E AH DL F D L R +E +
Sbjct: 108 IIFSNGEFDPWTSLGYLSPKTETVIPILIGESAHQEDLAFGAPADRHDLTRAREQERRHV 167
Query: 493 GKWISQ 498
KWI +
Sbjct: 168 RKWIEE 173
>gi|158298286|ref|XP_001689125.1| AGAP004013-PA [Anopheles gambiae str. PEST]
gi|157014451|gb|EDO63478.1| AGAP004013-PA [Anopheles gambiae str. PEST]
Length = 496
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 123/518 (23%), Positives = 213/518 (41%), Gaps = 67/518 (12%)
Query: 12 LLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSS---KDSQ--GLYKTKYHTQILDHFN 66
LL ++ T AK+ + P +TP L+ L S++ K SQ ++ T +DHFN
Sbjct: 7 LLLAACGTALAAKL--STPPKALTPSFLNRLRSATVGLKPSQRNANITEEFFTTEVDHFN 64
Query: 67 YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG-FMYDVAPKFKAL 125
Q T+ RYL H+ P+ ++ + ++ + G + ++A
Sbjct: 65 --NQDLTTWSNRYLALMDHF---VEGGPMLIFLTGDAPLDPSMIDDGTLINEMARDLGGA 119
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTATDSPV- 184
+ +E R+YGKS P G + L++ Q LAD A ++ L++ T ++P
Sbjct: 120 VFALETRFYGKSQPVGD------LTVESLRLLNTDQILADVADFVVHLRR--TVINNPFA 171
Query: 185 --VVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDNIVSPYSFSNIITQDFRS 239
+V G GG LA WFR++YPH+ +SS + +F S ++ + I
Sbjct: 172 HPLVTGTGLGGGLATWFRVRYPHLVDATWSSSGYIQAVFDFQEFSSGWAETAITVG---- 227
Query: 240 VSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICK---SEKNLAIESWLSTAFVYTAM 296
S CY I ++ + + G E L + F +C SE +A+ + S + T++
Sbjct: 228 -SNECYNRIFIAF-HVAQNLIDAGFGEVLYEKFNLCSPIDSEDRMAVAYFFSV--LMTSI 283
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTGNDVFAKLYGAASV---------YYNYS 345
Y + ++ K +C I +D T D FA + V + +
Sbjct: 284 ELYTLRNGNIDEF-----KTVCDDITNNDFTTSLDAFANWFNQQFVSDAGCIIVSFDQFI 338
Query: 346 GTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYG 405
T K + + G ++ +Q CTE D+ F E + C+ +G
Sbjct: 339 ETLKETSASAEISMTGERQFLYQQCTEYGWFITTDSDLQPFGE-RVTMELYLEMCRRVFG 397
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASN------IIFFNGLRDPWSGGGVLKNISKTVLAL 459
+WI+ E + +RF + I F NG DPW V+ +++ LA
Sbjct: 398 -----DWISLELMFQSTTRMNERFGGDRPNVMQIHFTNGAFDPWRYLSVVSDLNAYALAD 452
Query: 460 VEKEGAHHVDL-RFSTKEDPQWLKDVRRREVEIIGKWI 496
V DL S + D L +V+RR E++ ++
Sbjct: 453 VIPGELAGADLGAISEENDSTELVEVKRRLKELLESYL 490
>gi|449548946|gb|EMD39912.1| hypothetical protein CERSUDRAFT_81235 [Ceriporiopsis subvermispora
B]
Length = 555
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 118/500 (23%), Positives = 197/500 (39%), Gaps = 86/500 (17%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQ 110
+ + TQ LDHF+ + F+QRY IN H+ S NAP+ V G E G+
Sbjct: 85 FPAHWFTQPLDHFS---NTTSKFRQRYWINTRHYK-SGTNAPVIVLDGGETSGEDRLPFL 140
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
+TG + +A + V +EHR S+P + + + +L++ QA AD A+ +
Sbjct: 141 DTGIVEILAKATGGVGVVLEHR----SLP------VTELSTDSLRWLNNDQAAADSANFM 190
Query: 171 IDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
++K +++TA P + +GGSY G AA ++ YP + GA+ASSA +
Sbjct: 191 ANVKFPGIDEDITAPGHPWIYYGGSYAGARAAHMKILYPELVYGAIASSAVT---HATLE 247
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLAIES 285
+ + II R+ C ++ + + I+ K P ++L+ F + + + S
Sbjct: 248 NWEYMEIIR---RAADPKCSHSLEVAIESIDTVLKLPVFGQRLKALFGLADLQHDEDFVS 304
Query: 286 WLSTAF-VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAAS----- 339
L + + A N+ + + E C A++ P G+ A + S
Sbjct: 305 LLESPLGAWQA-------KNWDPEVGSTRFDEFCNALNKPVFGSPYSASEHMVTSEGGLT 357
Query: 340 ---VYYNYSGTAK---------------CFDLNGDSDPHGLSE------WGWQACTEMIM 375
V +NY+ K CF D G S W +Q CTE
Sbjct: 358 VPLVVHNYAKYIKEHVVSRCPEDMSVEDCFGTWDDIKFQGTSTDETWRLWVFQVCTEWGY 417
Query: 376 LTGGDNKDSIFEESE-EDYDARARYCKEAY------GVDPRPNWITTEFGGHKIGLVLKR 428
+ K S D ++ C++AY V PN T G+
Sbjct: 418 FSTAPPKGHPRIVSRLLTLDYESKVCQQAYPPGKHFAVPLLPNVTTVNVLGN-----FDI 472
Query: 429 FASNIIFFNGLRDPWS----GGGVLKNISKTVLALVE--KEGAHHVD---LRFSTKEDPQ 479
A + +G DPW + T+L + G HH D LR +E P+
Sbjct: 473 AADRLAIIDGEVDPWRPDTPHSEYATDREDTLLRPFKLIPNGVHHYDEFGLRNMYEEPPE 532
Query: 480 WLKDVRRREVEIIGKWISQY 499
L+ + +E + W+ +
Sbjct: 533 ILQ-IHEEMIEFVTAWLKDW 551
>gi|4406810|gb|AAD20118.1| unknown protein [Arabidopsis thaliana]
Length = 365
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 89/369 (24%), Positives = 149/369 (40%), Gaps = 64/369 (17%)
Query: 165 DYASLIIDLKKNLTA--TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN 222
DY + ++ K N+++ +D+P FG SY G L+AWFRLK+PH+ G+LASSA +
Sbjct: 27 DYINESLNKKLNISSGGSDNPWFFFGISYSGALSAWFRLKFPHLTCGSLASSAVVR---- 82
Query: 223 IVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKNLA 282
+ Y FS Q S + C ++ + K +E GL+ KA + + L
Sbjct: 83 --AIYEFSEFDQQIGESAGQECKLALQETNKLLEL------GLKVKNKAVKSLFNATELD 134
Query: 283 IES--WLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA-------- 332
+++ TA Y P PL ++ K G+D+
Sbjct: 135 VDADFLYLTADAAVMAFQYGNPDKLCVPL-----------VEAKKNGSDLVVTYSTYVRE 183
Query: 333 ---KLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKDSIFEES 389
+++G YN + DS W +QACTE+ DS+
Sbjct: 184 YCMRIWGLRVRTYNRKHLRNTV-VTADS---AYRLWWFQACTELGYFQVAPKYDSV-RSH 238
Query: 390 EEDYDARARYCKEAYGVD--PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGG 447
+ + CK +G D P+ + +GG ++ A+ IIF NG DPW
Sbjct: 239 QINTTFHLDLCKSLFGKDVYPKVDATNLYYGGDRLA------ATKIIFTNGSEDPWRHAS 292
Query: 448 VLKNISKTVLALVE-KEGAHHVDLRFSTKED------------PQWLKDVRRREVEIIGK 494
+ + +++ + H D+R + P ++ VR++ VE I
Sbjct: 293 KQNSTHEMPSYIIKCRNCGHGSDIRGCPQSPMVIEGKSNNCSLPDYVNKVRQQMVEHIDL 352
Query: 495 WISQYFQDL 503
W+S+ Q +
Sbjct: 353 WLSECRQSI 361
>gi|358369966|dbj|GAA86579.1| extracelular serine carboxypeptidase [Aspergillus kawachii IFO
4308]
Length = 569
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 85/168 (50%), Gaps = 25/168 (14%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGD----IEWFAQNTG 113
+DHF+ Y P + TF RY ++ +H+ + P+FV E D I + +Q G
Sbjct: 60 IDHFHNESRYEPHTNATFGLRYWLDTSHY---QPGGPVFVIAAGETDGSDRIPFLSQ--G 114
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL 173
+ +A + + + +EHRYYG+S P+ A +L++ QALADYA ++
Sbjct: 115 VVTQLAAAYNGVALILEHRYYGESYPF------ANLTTENIRFLTTEQALADYAYFASNI 168
Query: 174 ------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+LTA +P + +GGSY G A+ R YP + GA++SS
Sbjct: 169 VFPGLEHLDLTAATTPWIAYGGSYAGAFVAFLRKLYPDLYWGAVSSSG 216
>gi|405962352|gb|EKC28042.1| Dipeptidyl-peptidase 2 [Crassostrea gigas]
Length = 342
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 51/81 (62%), Gaps = 2/81 (2%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNT 112
+K Y Q +DHFN+ F++RYL+ D W PIF YTGNEG IE F NT
Sbjct: 24 FKELYIDQYVDHFNFVSYGETIFKERYLLQDQWW--KPGVGPIFFYTGNEGSIEEFWDNT 81
Query: 113 GFMYDVAPKFKALLVFIEHRY 133
GF++D+AP+F AL+VF EH +
Sbjct: 82 GFVFDIAPEFNALVVFAEHMH 102
>gi|195569681|ref|XP_002102837.1| GD19287 [Drosophila simulans]
gi|194198764|gb|EDX12340.1| GD19287 [Drosophila simulans]
Length = 530
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 79/162 (48%), Gaps = 12/162 (7%)
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK-KN 176
+A + K +LV+ EHRYYG+S+P + + YL QALAD A I K +N
Sbjct: 1 MAQEHKGVLVYTEHRYYGQSVP------TSTMSTDDLKYLDVKQALADVAVFIETFKAEN 54
Query: 177 LTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQD 236
+S V++ GGSY + WF+ YP + +G ASSAP+L V + ++ Q
Sbjct: 55 PQLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPLL---AKVDFTEYKEVVGQA 111
Query: 237 FRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
F + + CY I+ ++E G E + R+C S
Sbjct: 112 FLQLGGQKCYDRIENGIAELESMFANKRGAEA-RAMLRLCNS 152
Score = 66.6 bits (161), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 143/361 (39%), Gaps = 42/361 (11%)
Query: 156 YLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
Y S QALAD ++I LK+ DS VVV G SY +A W R YP + G+ ASSA
Sbjct: 190 YQSVDQALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPEIIRGSWASSA 249
Query: 216 PIL---NFDNIVSPYSFSNIITQDFRSV-SENCYKVIKGSWKQIEETAKKPGGLEKLQKA 271
P+L NF + + ++ + + ++ + CY +I + E + G + + K
Sbjct: 250 PLLAKVNFKD------YMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQAV-KE 302
Query: 272 FRICKS-EKNLAIESWL---STAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTG 327
+C + N + W + A ++ + Y P + P ++E DD
Sbjct: 303 LNLCSNFNVNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILREFS---DDDSVA 359
Query: 328 NDVFAKL----YGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIMLTGGDNKD 383
F + A + + G ++ + ++ W +Q C+E ++
Sbjct: 360 LSKFINWKINEHSGACLSTTFKGAVGYYEWSKENYQDSDLPWIFQTCSEFGWFQSSGSRS 419
Query: 384 SIFEES------EEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFN 437
F + E+ + ++ G+ +FGG + A+NI F
Sbjct: 420 QPFGSTFPATLYEDTCEGVFGSKYDSAGIHANIRATNDDFGGLNVN------ATNIYFVQ 473
Query: 438 GLRDPWS--GGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKW 495
G D WS G GV + A + +H D + D L +++ ++++ +W
Sbjct: 474 GALDGWSKVGAGVAQG------ATIIPYASHCPDTGSISASDSAELVASKKKLIKLVAQW 527
Query: 496 I 496
+
Sbjct: 528 L 528
>gi|344268139|ref|XP_003405920.1| PREDICTED: hypothetical protein LOC100665381 [Loxodonta africana]
Length = 431
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/185 (27%), Positives = 89/185 (48%), Gaps = 14/185 (7%)
Query: 152 STTGYLSSTQALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGAL 211
++ YLSS QALAD + + + + T + V FG SYGG LA W R+K+P + A+
Sbjct: 199 ASLHYLSSRQALADIVNFRTQIAEKMGLTRNKWVAFGCSYGGSLAVWSRIKHPDLFAAAV 258
Query: 212 ASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKA 271
SSAPI N Y + ++ + + + C++ +K + Q+ + K P KL+
Sbjct: 259 GSSAPIQAKANF---YEYLEVVQRSLATHNSTCFQAVKEVFGQVVKMLKLPKYYSKLEND 315
Query: 272 FRICKSEKNLAIESWLSTAF--------VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDD 323
F +CK + + S + AF + +A+ + N+ +F + E+C + +
Sbjct: 316 FTLCKP---MRLYSAMDKAFFLDRLMFPLKSAVQNNRNEKNYKGEQISFSIDELCDMMAN 372
Query: 324 PKTGN 328
G+
Sbjct: 373 TSLGS 377
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 60/140 (42%), Gaps = 7/140 (5%)
Query: 360 HGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGG 419
H + ++ +Q CTE D+K+ F Y + C + +G P N+ + G
Sbjct: 58 HRVRQYFYQCCTEFGFFHTTDSKNQPFTGLPLSYFVQQ--CSDFFG--PNFNYDSLNTGV 113
Query: 420 HKIGLVLKRF---ASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKE 476
F S IIF NG DPW G+ KNIS+ +LA+ + H D+
Sbjct: 114 MSTNEYYGGFNVTGSKIIFPNGSFDPWHPLGITKNISEDLLAVFIEGAVHCADVYEQKDT 173
Query: 477 DPQWLKDVRRREVEIIGKWI 496
D L R + +I+ KW+
Sbjct: 174 DSAELIQAREKIFQILQKWL 193
>gi|83774740|dbj|BAE64863.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 569
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 107/464 (23%), Positives = 175/464 (37%), Gaps = 87/464 (18%)
Query: 30 PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGS 89
P S + S+++ + T+Y T +DH N S T+Q R+ +ND ++
Sbjct: 44 PRSLSKKNTVHSVLAKANTQIEKVTTEYITIPIDH---NDTSVGTYQNRFWVNDDYYEAG 100
Query: 90 KNNAPIFVYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNK 144
+ PI +Y E + E A+N F + A+ + EH + P K
Sbjct: 101 R---PIIMYDAGETNAESIAKNHLTSSLSFFRKILEDTHAMGIIWEH---SRDTPPEHFK 154
Query: 145 EIAYKNASTTGYLSSTQALADYASLIIDLKK------NLTATDSPVVVFGGSYGGMLAAW 198
YL++ QAL D + + +LT + +P V+ GGSY G+ AA+
Sbjct: 155 -----------YLTTKQALEDIPYFARNFSRPKFAEHDLTPSSTPWVLVGGSYAGIRAAF 203
Query: 199 FRLKYPHVAIGALASSAPILNFDNIVSPYS--FSNIITQDFRSVSENCYKVIKGSWKQIE 256
R KYP V A +SSAP+ N+ Y + ++ F ENC K I + I+
Sbjct: 204 ARNKYPDVIFAAYSSSAPVQAQLNMSIYYDQVYRGLVGHGF----ENCAKDIHAALGYID 259
Query: 257 ETAKKPGGLEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKE 316
+ ++K F +++N + F T Y N+ P ++E
Sbjct: 260 QQLSNNHTAAAIKKLFFGPGADQN------SNEGFTAALATIYSYFQNYGLDGPEGTLRE 313
Query: 317 MCKAID-DPKT----GNDVFAKLYGAASVYYNYSG------------TAKCFDLNGDS-- 357
+C+ ++ DP T G D FA + G+ V ++ C L+ +
Sbjct: 314 LCEHLEVDPTTKEAAGPDGFAPVRGSKHVAERWAAWPAFTPLVNNFMETNCRGLSDPAKP 373
Query: 358 ---------DPHGLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVD- 407
DP +S W WQ CTE + + + + C + +
Sbjct: 374 SCKLDMTYYDPDSIS-WSWQYCTEWGFYQSSNFGPHSLLSRYQTLEYQQEVCNNQFALAV 432
Query: 408 --------PRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW 443
P+ + E+GG I SN F G DPW
Sbjct: 433 ANGVLPSYPQTEALNKEYGGWNIR------PSNTFFTGGEFDPW 470
>gi|116192591|ref|XP_001222108.1| hypothetical protein CHGG_06013 [Chaetomium globosum CBS 148.51]
gi|88181926|gb|EAQ89394.1| hypothetical protein CHGG_06013 [Chaetomium globosum CBS 148.51]
Length = 553
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 89/183 (48%), Gaps = 25/183 (13%)
Query: 47 KDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE 102
KD Q LY + +DHF+ Y P S +TF RY + +H+ K P+ V G E
Sbjct: 44 KDLQDLYPARTLKVPVDHFHNDTLYEPHSNETFPLRYWFDASHY---KKGGPVIVLQGGE 100
Query: 103 ----GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLS 158
G + + + G + +A L V +EHRYYG+S P + + +N +L+
Sbjct: 101 TNGAGRLPFLQK--GIVAKLAQATHGLGVILEHRYYGESFP---TPDFSTENLR---FLT 152
Query: 159 STQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALA 212
+ QALAD A + +LT+ +P + +GGSY G A+ R YP V GA++
Sbjct: 153 TDQALADMAFFAEHVVFEGLEHLDLTSAKNPYIAYGGSYAGAFVAFLRKLYPDVYWGAIS 212
Query: 213 SSA 215
SS
Sbjct: 213 SSG 215
>gi|307108516|gb|EFN56756.1| hypothetical protein CHLNCDRAFT_144218 [Chlorella variabilis]
Length = 232
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/224 (26%), Positives = 99/224 (44%), Gaps = 21/224 (9%)
Query: 294 TAMTDYPTPSNFLN-----PLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA-SVYYNYSGT 347
+M +YP S + P PA+P + +C + G++ GAA SV +N + +
Sbjct: 2 ASMGNYPYRSAYFTGNPDFPYPAWPARVVCSQLAGSFQGDEELLAAGGAAISVIFNVTQS 61
Query: 348 AKCFDLNGDSDPHGLSEWG---WQACTEMIM-----LTGGDNKDSIFEESEE-DYDARAR 398
C+D L G +Q CT+ + T G +D + + + A
Sbjct: 62 VPCYDYAFAQSSTSLGAPGSYSYQTCTQFQLNSIWFGTNGAPRDMFWRAATPFNRSALDA 121
Query: 399 YCKEAYGVDPRPNWITTEFGGHKIGLVLKRFA---SNIIFFNGLRDPWSGGGVLKNISKT 455
C A+G P+ + GL +FA +N++F NGL DPW G L+ ++ +
Sbjct: 122 SCVAAFGGVVLPHIGEMHL---RYGLFPDQFAAAATNVVFSNGLLDPWGSAGYLEGLAPS 178
Query: 456 VLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQY 499
+ A+V +GAHHVDL F+ DP + R + + WI +
Sbjct: 179 LPAVVLPQGAHHVDLMFADPADPPQFAEARDEIMGHVRTWIDDW 222
>gi|367020898|ref|XP_003659734.1| hypothetical protein MYCTH_2297119 [Myceliophthora thermophila ATCC
42464]
gi|347007001|gb|AEO54489.1| hypothetical protein MYCTH_2297119 [Myceliophthora thermophila ATCC
42464]
Length = 561
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 67/216 (31%), Positives = 98/216 (45%), Gaps = 27/216 (12%)
Query: 13 LFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN----YN 68
LF ++LT+ + + R ++L + S D Q LY +DHF+ Y
Sbjct: 24 LFRTSLTLRDFRE----QLERRQARDGAALEARSSDLQDLYPAHTLQVPVDHFHNDSLYE 79
Query: 69 PQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW---FAQNTGFMYDVAPKFKAL 125
P S +TF RY + +H+ K PI V E D F Q G + +A L
Sbjct: 80 PHSSETFPLRYWFDASHY---KKGGPIIVLQSGETDGVGRLPFLQK-GIVAQLARATNGL 135
Query: 126 LVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL------KKNLTA 179
V +EHRYYG+SIP + +L++ QALAD A + +LT+
Sbjct: 136 GVILEHRYYGESIP------TPDFSTEKLRFLTTDQALADMAYFARHVVFKGLEHLDLTS 189
Query: 180 TDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+P + +GGSY G A+ R YP V GA++SS
Sbjct: 190 AKNPYIAYGGSYAGAFVAFLRKLYPDVYWGAISSSG 225
>gi|322711665|gb|EFZ03238.1| serine peptidase [Metarhizium anisopliae ARSEF 23]
Length = 538
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 126/524 (24%), Positives = 179/524 (34%), Gaps = 91/524 (17%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEG 103
S +K S G Q++DH N F QRY N W G +PI + E
Sbjct: 40 SLTKRSHGPTHEGVFQQLIDH---NKPELGRFSQRYWYNADDWAGP--GSPIILNAPAEH 94
Query: 104 DIEWF-AQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSST 160
+ F A A ++ +EHRY+GKS P+ N +TT YL+
Sbjct: 95 EANAFHATKNSLAGRFAQTNGGAVIVLEHRYWGKSSPF--------DNLTTTNLEYLNLD 146
Query: 161 QALADY------ASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
A+ D L DL T +P V+ G SY G LAAW P SS
Sbjct: 147 NAIHDLIYFAHNVELPFDLAGTSKPTKAPWVLTGCSYAGALAAWTHHLAPGTFWAYHCSS 206
Query: 215 A---PILNFDNIVSPY--SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
A I NF P + + D + V ++ ++ K + KK GLE L
Sbjct: 207 AVVEAIPNFWKYNEPIKEAMPKNCSTDMQGVMKHIDGILSDGTKDEKHALKKKFGLESLT 266
Query: 270 KAFRICKSEKNLAIESWLSTAFVYT-------AMTDY------PTPSNFLNPLPAFPVKE 316
+ ++ W T F T M DY P S ++ PA P E
Sbjct: 267 HDDDFGAALAG-GLQKWQRTVFFKTKKPNALYQMCDYLENVFPPKSSRIISDRPAVPGPE 325
Query: 317 ---MCKAIDD-PKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPH---------GLS 363
KA+D K +V+ A Y+ + TA C D+N +P
Sbjct: 326 GVGTSKALDGFAKWSKEVYLPGECAEFGYWADNNTAACMDMNNKDNPMYTDLSVNNTANR 385
Query: 364 EWGWQACTEMIMLTGGDNKDSI--FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHK 421
+W W C E D I D C+ + PR G
Sbjct: 386 QWYWLLCNEPFEWWQVSGPDDITGLASKHAGLDYAHMQCRNMF---PRE-------GNRT 435
Query: 422 IGLVLKRFA---------------SNIIFFNGLRDPWSG---------GGVLKNISKTVL 457
GL L R A + +++ NG DPW GG L +
Sbjct: 436 YGLKLGRTARETNRRTGGWGRVKTTRLMWVNGELDPWRAATVSADQRPGGPL-TFTPEAP 494
Query: 458 ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQ 501
V G H D+ E L+ V + + +W+ +Y++
Sbjct: 495 VWVLPGGVHCSDMLTRNAEANPALRRVVEDVLGTMKRWVDEYYK 538
>gi|393214541|gb|EJD00034.1| peptidase S28 [Fomitiporia mediterranea MF3/22]
Length = 424
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 101/422 (23%), Positives = 172/422 (40%), Gaps = 63/422 (14%)
Query: 61 ILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-----WFAQNTGF- 114
+LDHF S +TF+ RY IN T++ + P+F++ E + E + + G
Sbjct: 2 LLDHFG---NSTETFKNRYWINGTYY---EPGGPVFIFDSGEQNAEPLLPYYLQEYHGLS 55
Query: 115 -MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT----GYLSSTQALADYASL 169
+A +++ L + EHR+YG S+P+ N I +TT YL++ QAL D
Sbjct: 56 ATMRLAKRYRGLAILWEHRFYGASLPFPVNA-ITLNQENTTFDQWKYLTTDQALEDVVFF 114
Query: 170 IIDL---------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
+ + + +P + GGSY G+ A R++ P V + ASSAP+
Sbjct: 115 ANSFPTSSNDPASQPAIHPSITPWIWLGGSYPGVRGALMRVRNPEVIFASWASSAPVHAQ 174
Query: 221 DNIVSPY-----SFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRIC 275
++ S Y S + + D+ +V++ +V+ G+ + K +L +C
Sbjct: 175 VDMASYYKAAERSLTRNCSADWVAVTKCVDQVLNGTDETRRADMKFDLEFARLSGKVLLC 234
Query: 276 KSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLY 335
+ +IE L+T + Y + P F N L E DP + +
Sbjct: 235 WT---CSIE--LTTVYKYYGFAESLLP--FCNLL------ETQNFTADPLESG--ISNAH 279
Query: 336 GAASVYYNYSGTAKCFDLNG----DSDPHGLSEWGWQACTEMIMLTGGDNKDSI-FEESE 390
G + + + D + DP W WQ C+E GD + + E S
Sbjct: 280 GIQAAFDAFLAALAELDYDSIPGEPDDPVTDRSWMWQYCSEYGFYQRGDPNNPLSIETSF 339
Query: 391 EDYDARARYCKEAYGV----DPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGG 446
+ C A+ V P + I ++G ++ SN++F NG DPW
Sbjct: 340 LSLELFQEQCNSAFPVGLPTSPAVDHI-NKYGSWEMS------PSNVLFTNGEFDPWRTM 392
Query: 447 GV 448
G+
Sbjct: 393 GL 394
>gi|302503352|ref|XP_003013636.1| serine peptidase, family S28, putative [Arthroderma benhamiae CBS
112371]
gi|302666638|ref|XP_003024916.1| serine peptidase, family S28, putative [Trichophyton verrucosum HKI
0517]
gi|291177201|gb|EFE32996.1| serine peptidase, family S28, putative [Arthroderma benhamiae CBS
112371]
gi|291188993|gb|EFE44305.1| serine peptidase, family S28, putative [Trichophyton verrucosum HKI
0517]
Length = 551
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 99/214 (46%), Gaps = 27/214 (12%)
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-----QNTGFMYDVAPKFKALLVF 128
T++ R+ IN+ + K P+FV+ E + +A T F + KF + +
Sbjct: 83 TYKHRFWINEQDY---KPGGPVFVFDCGEAAGQRYADRYLFNETNFFRQLTKKFHGIGII 139
Query: 129 IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK------NLTATDS 182
EHRYYG+S P+ E ++ YL++ QALAD D K+ +L +
Sbjct: 140 FEHRYYGESTPFPVTVETPPEHFQ---YLNNDQALADLPYFAKDFKRKSFPNDDLRPNAT 196
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE 242
P V+ GGSY GM +A+ R +YP + ASSAP+ ++ Y Q +R +
Sbjct: 197 PWVMVGGSYPGMRSAFTRDRYPETIYASWASSAPVQAQIDMAVYYE------QVYRGLVA 250
Query: 243 ----NCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
NC K I+ +++ I+ + ++K F
Sbjct: 251 YGWGNCTKDIRAAYRYIDRQLSRNDTAAAIKKLF 284
>gi|71002774|ref|XP_756068.1| extracelular serine carboxypeptidase [Aspergillus fumigatus Af293]
gi|66853706|gb|EAL94030.1| extracelular serine carboxypeptidase, putative [Aspergillus
fumigatus Af293]
Length = 572
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 82/166 (49%), Gaps = 21/166 (12%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFM 115
+DHF+ Y P + F RY + +H+ K P+F+ E G + + G +
Sbjct: 64 IDHFHNESRYEPHTRDHFNLRYWFDASHY---KEGGPVFLIAAGETNGRDRFPFLSHGIV 120
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL-- 173
+A + L V +EHRYYG+S P+ ++ KN +LS+ QA+ADYA ++
Sbjct: 121 TQLAKTYNGLGVILEHRYYGESYPFA---DLTTKNIR---FLSTEQAMADYAYFASNVVF 174
Query: 174 ----KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
NLTA P + +GGSY G A+ R YP V G ++SS
Sbjct: 175 PGLEHLNLTADAVPWIGYGGSYAGAFVAFLRKVYPEVFFGVVSSSG 220
>gi|167536011|ref|XP_001749678.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771826|gb|EDQ85487.1| predicted protein [Monosiga brevicollis MX1]
Length = 495
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 64/163 (39%), Positives = 87/163 (53%), Gaps = 23/163 (14%)
Query: 65 FNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTG----FMYDVAP 120
N++ TF RY +D H+ + F G E A N G ++ D+A
Sbjct: 90 LNHDDSRMGTFSCRYYTSDLHY--DREQGVCFFEMGGE------APNNGIGNDYIADLAK 141
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNLTAT 180
++KAL V IEHR+YG+S+P + + N YL+S QALAD A+LI + N T
Sbjct: 142 RYKALQVSIEHRFYGESVP---GDDFSVDNLH---YLTSRQALADAAALIDHV--NRTYH 193
Query: 181 DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNF 220
+ FGGSY G L+AWFR KYPH+ GAL+SS +LNF
Sbjct: 194 CRKWMAFGGSYSGALSAWFRTKYPHIIDGALSSSGVVNAVLNF 236
>gi|121717092|ref|XP_001276006.1| extracelular serine carboxypeptidase, putative [Aspergillus
clavatus NRRL 1]
gi|119404163|gb|EAW14580.1| extracelular serine carboxypeptidase, putative [Aspergillus
clavatus NRRL 1]
Length = 582
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 100/209 (47%), Gaps = 39/209 (18%)
Query: 34 ITPEKLS-SLISSSKDSQGLYKTKYHTQIL-------------DHFN----YNPQSYQTF 75
++P +L L+ ++++ + K ++ +Q L DHF+ Y P S +F
Sbjct: 22 LSPLELQFPLLHELRNAKSIEKGEFRSQALSVSFAEHNFSVPVDHFHNESRYEPHSDDSF 81
Query: 76 QQRYLINDTHWGGSKNNAPIFVYTGNEGD----IEWFAQNTGFMYDVAPKFKALLVFIEH 131
RY + +H+ K P+F+ E D + +Q G + +A + L V +EH
Sbjct: 82 NLRYWFDASHY---KEGGPVFLIAAGETDATDRFPFLSQ--GIVAQLAKTYNGLGVILEH 136
Query: 132 RYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL------KKNLTATDSPVV 185
RYYG+S P+ + +N +LS+ QALADYA ++ NLTA P +
Sbjct: 137 RYYGESYPF---VNLTVENIR---FLSTEQALADYAHFASNVAFPGLEHLNLTAGAVPWI 190
Query: 186 VFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+GGSY G A+ R YP + G ++SS
Sbjct: 191 GYGGSYAGAFVAFLRKVYPDIFFGVVSSS 219
>gi|451856190|gb|EMD69481.1| hypothetical protein COCSADRAFT_32183 [Cochliobolus sativus ND90Pr]
Length = 548
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 92/181 (50%), Gaps = 24/181 (13%)
Query: 48 DSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE- 102
D LY + + +D F+ Y P S +F RY +DT++ K P+FV E
Sbjct: 49 DPSLLYPERNLSVPVDFFHNETRYEPHSNGSFNLRYWFDDTYY---KPGGPVFVLLSGET 105
Query: 103 ---GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
G + + + G ++ V L V +EHRYYGKS P +++ KN +L++
Sbjct: 106 SGVGRLPFLQK--GIVHQVIKATNGLGVILEHRYYGKSFPV---PDLSTKNMR---FLTT 157
Query: 160 TQALADYASLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
Q+LA+ ++K +LTA ++P VV+GGSY G AA+ R+ YP GA++SS
Sbjct: 158 EQSLAEIDYFARNVKFDGIDADLTAPNTPWVVYGGSYAGAQAAFLRVVYPETFWGAISSS 217
Query: 215 A 215
Sbjct: 218 G 218
>gi|346980149|gb|EGY23601.1| hypothetical protein VDAG_05039 [Verticillium dahliae VdLs.17]
Length = 560
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 80/166 (48%), Gaps = 21/166 (12%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFM 115
+DHF+ Y P S TF RY IN H+ + P+F+ E G+ + G +
Sbjct: 54 VDHFHNETRYEPHSNDTFPLRYWINKKHY---RPGGPVFLLASGEMTGEDRLDYLDHGII 110
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA-----SLI 170
A L + +EHRYYG S P +A + +LS+ QALAD A
Sbjct: 111 AMFAKATHGLGLVLEHRYYGTSFP------VANVSIPNLRFLSTEQALADTAFFAEHVTF 164
Query: 171 IDLK-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
DL+ + L TD P + FGGSY G AA+ R YP V GA++SS
Sbjct: 165 PDLEHEELGPTDVPWIAFGGSYAGAFAAFLRKLYPDVFWGAISSSG 210
>gi|296822298|ref|XP_002850262.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238837816|gb|EEQ27478.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 548
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 73/283 (25%), Positives = 125/283 (44%), Gaps = 48/283 (16%)
Query: 16 STLTISNAKIFPTF---------PSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFN 66
STL ++N+K+ P + PE+ +++++ S+ + + +T F+
Sbjct: 21 STLGLANSKLSRDLHDAAMMGLDPDVALRPEQFATMMAGSQSANPVAET---------FD 71
Query: 67 YNPQ-SYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN-----TGFMYDVAP 120
P T++ R+ IN+ + K P+FV+ E + +A N T F
Sbjct: 72 VWPLFKSGTYKHRFWINEEDY---KPGGPVFVFDCGEAAGQRYANNYLYNETNFFRQFTK 128
Query: 121 KFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT-GYLSSTQALADYASLIIDLKK---- 175
KF + + EHRYYG+S P+ I+ K YL++ QALAD K+
Sbjct: 129 KFNGVGIVFEHRYYGESTPF----PISVKTPPEHFQYLNNDQALADLPYFAKSFKRAAFP 184
Query: 176 --NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNII 233
+L +P ++ GGSY GM AA+ R +YP + ASSAP+ ++ Y
Sbjct: 185 NNDLRPNATPWIMVGGSYPGMRAAFTRDRYPETIYASWASSAPVQAQIDMAVYYE----- 239
Query: 234 TQDFRSVSE----NCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
Q +R + NC K I +++ I+ + ++K F
Sbjct: 240 -QVYRGLVAYGWGNCTKDIHAAYRYIDRQLSRGDTAAAIKKLF 281
>gi|449548436|gb|EMD39403.1| hypothetical protein CERSUDRAFT_93435 [Ceriporiopsis subvermispora
B]
Length = 549
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 105/438 (23%), Positives = 171/438 (39%), Gaps = 79/438 (18%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-----WFAQNTGF-- 114
LDHF + TF+ RY INDT++ K+ P+F++ E + E + + G
Sbjct: 38 LDHFG---NTSGTFKNRYWINDTYY---KSGGPVFLFDSGEQNAEPLLPYYLQEYHGLSA 91
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYA------- 167
+A ++ L + EHR+YG S+P+ N + +L++ QAL D
Sbjct: 92 TMQLAKRYGGLAILWEHRFYGDSLPFPVNDNTTIEQWQ---FLTTEQALEDVVYFANSFS 148
Query: 168 -------SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNF 220
D + + + +P + GGSY G+ A R++ P ASSAP+
Sbjct: 149 LLSSPKSESSSDQQNPIHPSVTPWIWLGGSYPGVRGALLRVRNPETIFAVWASSAPVHAQ 208
Query: 221 DNIVSPY-----SFSNIITQDFRSVSENCYKVIK-GSWKQIEETAKK--------PGGLE 266
++ S Y S + + D+ +V++ +++ GS +Q E + PGG +
Sbjct: 209 VDMSSYYKAAERSLTRNCSADWVAVTKYVDDILRNGSAQQQAEMKYRLLKARLDGPGGND 268
Query: 267 KLQKAFRICKSEKNLAIESWLSTA------FVYTAMTDYPTPSNFLNPLP-----AFPVK 315
F I K N++ + + Y P F N L A P++
Sbjct: 269 TGSSGFTI-KDAANISNDEAAGVLMDPLNFYQYYGFQASILP--FCNNLETRNFTALPLE 325
Query: 316 EMCKAIDDPKTGNDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSEWGWQACTEMIM 375
A D D F A NY D DP W WQ C+E
Sbjct: 326 SGLAAAHDVGVALDSFL----TALAELNYDAIPGSPD-----DPVADMSWMWQYCSEYGF 376
Query: 376 LTGGDNKDSI-FEESEEDYDARARYCKEAYGV----DPRPNWITTEFGGHKIGLVLKRFA 430
GD + + E S + + C +G+ P+ + I ++GG +
Sbjct: 377 YQRGDPNNPLSIETSFISLELFQQQCNATFGMGLPTSPQVDHI-NKYGGWNMT------P 429
Query: 431 SNIIFFNGLRDPWSGGGV 448
SN++F NG DPW G+
Sbjct: 430 SNVLFTNGEFDPWRTMGL 447
>gi|302410211|ref|XP_003002939.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261357963|gb|EEY20391.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 445
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 80/166 (48%), Gaps = 21/166 (12%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFM 115
+DHF+ Y P S TF RY IN H+ + P+F+ E G+ + G +
Sbjct: 54 VDHFHNETRYEPHSNGTFPLRYWINKKHY---RPGGPVFLLASGETTGEDRLGYLDHGII 110
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL-----I 170
A L + +EHRYYG S P +A + +LS+ QALAD A
Sbjct: 111 AMFAEATHGLGLVLEHRYYGTSFP------VANVSIPNLRFLSTEQALADTAFFAEHVTF 164
Query: 171 IDLK-KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
DL+ + L TD P + FGGSY G AA+ R YP V GA++SS
Sbjct: 165 PDLEHEELGPTDVPWIAFGGSYAGAFAAFLRKLYPDVFWGAVSSSG 210
>gi|443896181|dbj|GAC73525.1| hydrolytic enzymes of the alpha/beta hydrolase fold [Pseudozyma
antarctica T-34]
Length = 655
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 78/255 (30%), Positives = 121/255 (47%), Gaps = 29/255 (11%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW---GGSKNNAPI 95
L + ++ S S + + YHTQ LDHF+ Q+ FQQR+ + H+ K+ +
Sbjct: 123 LVADVAPSVASHNISEPAYHTQPLDHFDNTTQA--QFQQRFFYSTRHYKPASARKHGEAV 180
Query: 96 FVYTGNEGDIEWFAQ----NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
+Y + G+ + A+ +TG + ++ + + +EHRYYG S+P E+ A
Sbjct: 181 PIYILDSGEADARARIPFLDTGILDILSEATGGIGIVLEHRYYGTSLP--NRTELGPGEA 238
Query: 152 ---STTGYLSSTQALADYASLIIDLKKNLTATDSP----VVVFGGSYGGMLAAWFRLKYP 204
+L++ QAL D A I L N+ TD+ V+ +GGSY G +A RL YP
Sbjct: 239 WGVDQLRWLTNKQALEDSADFIRHL--NIPGTDNAEKRKVIYYGGSYPGARSAHMRLLYP 296
Query: 205 HVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE-TAKKP- 262
+ GA+ASSA + D P F I R NC + I+ + I+E A P
Sbjct: 297 ELVHGAIASSAVVAAVDEF--PEYFYPIA----RGAPTNCSQAIQAAIAGIDEIVAPNPL 350
Query: 263 -GGLEKLQKAFRICK 276
GG + + A R K
Sbjct: 351 TGGDQPDRDADRTNK 365
>gi|328854272|gb|EGG03405.1| extracellular serine carboxypeptidase [Melampsora larici-populina
98AG31]
Length = 542
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 119/524 (22%), Positives = 207/524 (39%), Gaps = 123/524 (23%)
Query: 51 GLYKTKYHTQILDHFNYN-PQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEW 107
G + + Q++ H + N +S TF QRY + T++ + P+F+ G E G
Sbjct: 64 GPFPAHHFPQLITHSDPNLDESQSTFAQRYWFDTTYY---QKGGPVFLLDGGETNGQDRL 120
Query: 108 FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYG--GNKEIAYKNA----STTGYLSSTQ 161
G + ++ + + +EHRYYG+S P+ N+ + Y N + Y S
Sbjct: 121 PYLQDGILSILSKATHGIGIILEHRYYGQSFPFKDLSNESLRYLNTRESLDDSAYFSQHI 180
Query: 162 ALADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---L 218
L + L ++TA +P + +GGSY G AA+ YP + G++ASSA I +
Sbjct: 181 VLPGHEDL------DITAPGTPWIYYGGSYAGAKAAFMMKLYPDLIWGSIASSAVIHAQV 234
Query: 219 NFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE 278
+F P E C + + + I+ RI S
Sbjct: 235 DFWQYYEPIRI---------HAPETCIEPLIIITRSID----------------RILLSN 269
Query: 279 KNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFA------ 332
++AI S L F +TD+ +F+N L A P+ + DP+ N F
Sbjct: 270 DSMAIMS-LKDLFGLANVTDH---RDFVNVL-ASPIGTWQERNWDPRISNHEFETYCDSL 324
Query: 333 ---------KLYGAA----------------------SVYYNYSGTAKCFDLN------- 354
K + + S Y S ++KC L+
Sbjct: 325 KRNPSPEPIKTFSTSLSLLQRFFEVEENFPLDSLLGYSNYIKNSISSKCEKLDQDECFGT 384
Query: 355 GDSDPHGLS-------EWGWQACTEMIMLTGGDN--KDSIFEE-SEEDYDARARYCKEAY 404
G+ H + W WQ CTE +DS+ + +Y++R C+ A+
Sbjct: 385 GNLTAHQIDSLDQTWRSWMWQVCTEWGYFQNSSPGLQDSLVSKLITLEYNSRP--CQLAF 442
Query: 405 G--VDPRPNWI-TTEFGGHKIGLVLKRFASNIIFFNGLRDPW----SGGGVLKNISKTVL 457
G + PN ++G + + ++ + F +G DPW +L+N +T+
Sbjct: 443 GSNIPKTPNTTQVNQYGDYDLD------SNRLAFIDGSHDPWIYMTVHSPLLQN--RTIR 494
Query: 458 -ALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
V G HH D + +P+ +++V +REV + +W+ Q++
Sbjct: 495 DGFVIDGGIHHWDENGNGDGEPEGIRNVHQREVGWVREWVRQFY 538
>gi|346327475|gb|EGX97071.1| extracelular serine carboxypeptidase, putative [Cordyceps militaris
CM01]
Length = 594
Score = 78.6 bits (192), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 127/503 (25%), Positives = 189/503 (37%), Gaps = 82/503 (16%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE----GDIEWFAQNTG 113
+DHF+ Y P S+ TF RY ++ TH+ + P+ V E G + + + G
Sbjct: 60 VDHFHNETKYAPHSHDTFPLRYWLDTTHY---RPGGPVIVLHSGEFDSAGRLAYL--DHG 114
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAST--TGYLSSTQALADYASLII 171
+ +A L + +EHRYYG S P +AST +L++ QALAD A
Sbjct: 115 IVPLLAAATGGLGLVLEHRYYGTSWPV--------PDASTHHMRFLTTAQALADTAYFAR 166
Query: 172 DL------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
+ NLTA +P +++GGSY G LAA R YP V G ++SS D
Sbjct: 167 HVAFPGLEHVNLTAPAAPWIIYGGSYAGGLAAMARKLYPDVFWGGISSSGVTAAVDRFWE 226
Query: 226 PYSFSNIITQDFRSVSENCYKVI------KGSWKQIEE--TAKKPGGLEKLQKAFRICKS 277
+ D S ++ I G ++++ T GL+ + A I +
Sbjct: 227 YHEAFRHFAPDGCSDAQQALTDIVDAILFGGDQDEVDDLKTMFHLAGLQDDEFATTITGA 286
Query: 278 EKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVF------ 331
L +W +T A++ Y L A KA K G
Sbjct: 287 LSGLQSTNW-ATEDDEDALSFYCAAITSTARLFASTAHLAAKARHFTKLGGHGHNLEQRS 345
Query: 332 AKLYGAASVYYNYSGTAKCFDLNGDSDPHGLSE----------------WGWQACTEMIM 375
A+L A N AK G S+P S+ W WQ CTE
Sbjct: 346 AQLLNWAGYIRNMDKKAKRSSCKGLSNPECYSQRHFPDETAISNDMYRPWLWQTCTEWGY 405
Query: 376 LTGGDN--KDSIFEESEEDYDA-RARYCKEAYGVDPRPNW-ITTEFGGHKIGLVLKRFAS 431
G++ KD + S A C+ + + P+ I + GG
Sbjct: 406 FQNGEDVPKDRLPLLSRAVTVAYSTSNCRRFFNITTPPDVDIINQHGGFNFSY------P 459
Query: 432 NIIFFNGLRDPWSGG-----GVLKNISKTVLA-LVEKEGAHHVD---LRFSTKED---PQ 479
+ +G +DPW G +S T LV G HH D R E P
Sbjct: 460 RLAIIDGRQDPWRAATPHADGQPDRVSTTSEPYLVIDWGVHHWDEFGARPGLHEKGLPPP 519
Query: 480 WLKDVRRREVEIIGKWISQYFQD 502
+ D +RR+VE + W+ ++ D
Sbjct: 520 QVVDNQRRQVEFVRAWLKEWRAD 542
>gi|345560973|gb|EGX44090.1| hypothetical protein AOL_s00210g251 [Arthrobotrys oligospora ATCC
24927]
Length = 635
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 67/235 (28%), Positives = 100/235 (42%), Gaps = 27/235 (11%)
Query: 48 DSQGLYKTKYHTQI-LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE 106
D LY TK+ +DH P++ + F Y + D H+ K PIF+Y E +
Sbjct: 36 DITPLYPTKHRLLTPVDH--QGPRTGEQFNLTYWVYDKHY---KKGGPIFLYLSGETTLS 90
Query: 107 WFAQNTGF----MYDVAPKFKALLVFIEHRYYGKSIPY-----GGNKEIAYKNASTTGYL 157
+ T +YD+ KF L + ++HRYYG S P G + A YL
Sbjct: 91 DYVAGTFLNGSRVYDLQEKFGGLGIALQHRYYGDSTPQSAWGTGASGITIDTPAEKLRYL 150
Query: 158 SSTQALAD---------YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAI 208
+ AL D Y S + +L SP VV GGSY G +A++ R YP
Sbjct: 151 RTDLALQDVKFLADNFNYTSERVPAGTDLRGKGSPWVVLGGSYAGNMASYLRKLYPDTFF 210
Query: 209 GALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
A AS AP+ + P + +I+ + S C K + + + I++ K G
Sbjct: 211 AAYASGAPVEA--RTMMPM-YWDIVAKSIGSTEPACVKNMNSAMRYIDQELAKGG 262
>gi|134077081|emb|CAK45422.1| endoprotease Endo-Pro-Aspergillus niger
gi|350629684|gb|EHA18057.1| S28 peptidase [Aspergillus niger ATCC 1015]
Length = 526
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 124/524 (23%), Positives = 193/524 (36%), Gaps = 80/524 (15%)
Query: 33 RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNN 92
R+ P+ +S SS S Y Q+LDH +NP+ TF QRY + +WGG
Sbjct: 26 RLVPKPISRPASSK--SAATTGEAYFEQLLDH--HNPEK-GTFSQRYWWSTEYWGGP--G 78
Query: 93 APIFVYTGNEGDI-----EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIA 147
+P+ ++ N G++ E + N A + + ++ IEHRY+G S PY E+
Sbjct: 79 SPVVLF--NPGEVSADGYEGYLTNDTLTGVYAQEIQGAVILIEHRYWGDSSPY----EVL 132
Query: 148 YKNASTTGYLSSTQAL------ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRL 201
NA T YL+ Q++ A+ L D A ++P V+ GGSY G L AW
Sbjct: 133 --NAETLQYLTLDQSILDMTYFAETVKLQFDNSSRSNAQNAPWVMVGGSYSGALTAWTES 190
Query: 202 KYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKK 261
P A+SAP+ Y F + +++NC K + + +++ K
Sbjct: 191 IAPGTFWAYHATSAPVEAI------YDFWQYFYPIQQGMAQNCSKDVSLVAEYVDKIGKN 244
Query: 262 PGGLEK--LQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPTP-SNFLNPLPAFPVKEMC 318
E+ L++ F + E + L D+ T S+F A E
Sbjct: 245 GTAKEQQELKELFGLGAVEHYDDFAAVLPNGPYLWQDNDFVTGYSSFFQFCDAVEGVEAG 304
Query: 319 KAIDDPKTG--------------NDVFAKLYGAASVYYNYSGTAKCFDLNGDSDPHGLS- 363
A+ G N Y A+ Y+ + CFD S P
Sbjct: 305 AAVTPGPEGVGLEKALANYANWFNSTILPNYCASYGYWTDEWSVACFDSYNASSPIFTDT 364
Query: 364 --------EWGWQACTEMIML--TGGDNKDSIFEESEEDYDARARYCK--------EAYG 405
+W W C E G S R C YG
Sbjct: 365 SVGNPVDRQWEWFLCNEPFFWWQDGAPEGTSTIVPRLVSASYWQRQCPLYFPEVNGYTYG 424
Query: 406 VDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRDPW--SG-------GGVLKNISKTV 456
N + G + R + +I+ NG DPW SG GG L + +
Sbjct: 425 SAKGKN--SATVNSWTGGWDMTRNTTRLIWTNGQYDPWRDSGVSSTFRPGGPLVSTANEP 482
Query: 457 LALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYF 500
+ ++ G H DL + ++ V EV+ I +W+ +Y+
Sbjct: 483 VQIIPG-GFHCSDLYMEDYYANEGVRKVVDNEVKQIKEWVEEYY 525
>gi|156058175|ref|XP_001595011.1| hypothetical protein SS1G_04819 [Sclerotinia sclerotiorum 1980]
gi|154702604|gb|EDO02343.1| hypothetical protein SS1G_04819 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 440
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 78/277 (28%), Positives = 124/277 (44%), Gaps = 36/277 (12%)
Query: 41 SLISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIF 96
S IS S+ YK Q +DHF Y P + TF+QRY + T++ K PI+
Sbjct: 4 SAISISRREALAYKAHTIDQPIDHFPNDPMYAPHTNATFKQRYWFDATYY---KPGGPIY 60
Query: 97 VYTGNEGDIEWFAQN--TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT 154
+Y G E + ++ N TG + + L + +E+RYYG+S P+ N STT
Sbjct: 61 LYIGGETNGQYRFSNLQTGIIQILMEATNGLGIILENRYYGESFPF---------NTSTT 111
Query: 155 ---GYLSSTQALADYASL-----IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHV 206
YL++ Q +AD A + + ++TA ++ +++GGS G A YP V
Sbjct: 112 DQLAYLTNQQTVADNAYFAQHVSLPGVNASITAPNTKWILYGGSLAGGQTALSVKIYPEV 171
Query: 207 AIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE--TAKKPGG 264
G +ASSAPI +V + N I R ++C I G + + +A
Sbjct: 172 FFGGIASSAPI---KAVVGYPEWYNPIQ---RLGPQDCISSINGIIDKFDALISANNTQA 225
Query: 265 LEKLQKAFRICKSEKNLAIESWLSTAFVYTAMTDYPT 301
+++ + F + N ++ AF DYPT
Sbjct: 226 IKQFKSLFGLEALTDNRDFA--MTIAFPLGGPMDYPT 260
>gi|339232872|ref|XP_003381553.1| serine carboxypeptidase S28 family protein [Trichinella spiralis]
gi|316979630|gb|EFV62393.1| serine carboxypeptidase S28 family protein [Trichinella spiralis]
Length = 484
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 23/201 (11%)
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGDIEWFAQNTGFMYD 117
Q L HF+ QT+ Q + D H GG+ +++ + + + + G + +
Sbjct: 60 QRLSHFDSTIN--QTWNQSSTVCDLHHQKGGA---VVVYIQSRDSPSVPSCTYSAGLLSE 114
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKKNL 177
++ + A++V R++G + P G + YLS + LAD A L+ L+
Sbjct: 115 ISKQLNAVVVTFVPRFFGINKPTGS------ASVDNLKYLSVEEVLADLAHLVHSLRSKY 168
Query: 178 TATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI---LNFDNIVSPYSFSNIIT 234
+ VVV G ++GG LA WFRLKYPH+ GA+AS AP+ L F +V I
Sbjct: 169 PDSGKTVVV-GTAHGGNLAIWFRLKYPHLCDGAIASGAPLETTLGFGRLVDG------IF 221
Query: 235 QDFRSVSENCYKVIKGSWKQI 255
+ ++ NC + ++ S+ Q+
Sbjct: 222 ERLDNIRPNCARALRDSFAQL 242
>gi|452003194|gb|EMD95651.1| hypothetical protein COCHEDRAFT_1221399 [Cochliobolus
heterostrophus C5]
Length = 548
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 91/181 (50%), Gaps = 24/181 (13%)
Query: 48 DSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE- 102
D LY + + +D F+ Y P S +F RY +DT++ K P+FV E
Sbjct: 49 DPSLLYPERNLSVPVDFFHNEPRYEPHSNGSFNLRYWFDDTYY---KPGGPVFVLLSGET 105
Query: 103 ---GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSS 159
G + + + G ++ V L V +EHRYYG S P +++ KN +L++
Sbjct: 106 SGVGRLPFLQK--GIVHQVIKATNGLGVILEHRYYGTSFPV---PDLSTKNMR---FLTT 157
Query: 160 TQALADYASLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
QALA+ ++K +LTA ++P VV+GGSY G AA+ R+ YP GA++SS
Sbjct: 158 EQALAEIDYFARNVKFDGIDADLTAPNTPWVVYGGSYAGAQAAFLRVVYPETFWGAISSS 217
Query: 215 A 215
Sbjct: 218 G 218
>gi|402083145|gb|EJT78163.1| hypothetical protein GGTG_03265 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 564
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 116/509 (22%), Positives = 195/509 (38%), Gaps = 98/509 (19%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFM 115
+DHF+ Y P S + FQ RY + +++ ++ PI V G E G G +
Sbjct: 65 IDHFHNESSYAPHSDEFFQLRYWFDASYY---RDGGPIIVLLGGETSGADRLPFMEKGIL 121
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL-- 173
+A + V +EHRYYG+S P + + S +L++ QALAD A ++
Sbjct: 122 AKLAEATGGVSVILEHRYYGESFP------VPDLSISNLRFLTTDQALADTAFFARNVIF 175
Query: 174 ----KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
++LT+ ++P +GGSY G AA+ R YP GA++SS L + Y
Sbjct: 176 EGFEDRDLTSGNTPYFTYGGSYAGAFAAFLRKLYPEDYWGAISSSGVTLAVADYWQYYEA 235
Query: 230 SN--------IITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRI------- 274
+ Q V +N ++ GS K G +L+ AF +
Sbjct: 236 QRLFAPPDCVVTIQKLTHVVDN---IVTGS--------NKFTGSARLRDAFGLPNVTYLP 284
Query: 275 ---------CKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPK 325
K + L + +S VY + T + P + + +++
Sbjct: 285 DFVNVLANGIKGWQELNWDLAVSNPDVYQFCNNI-TSDTLVYPQTERLTPAVRRLLEEGG 343
Query: 326 TGNDV------------FAKLYGAASVYYNYSGTAKCF------DLNGDSDPHGLSEWGW 367
G++V F L A CF D D W +
Sbjct: 344 YGDEVDRLAPRMLNYIGFVNLTEVAPCARRGQTQESCFGNRNLGDYKDDDITQTWRSWEY 403
Query: 368 QACTEM-IMLTGGDNKDSIFEESEE--DYDARARYCKEAYGVDPRPNWIT-TEFGGHKIG 423
Q CT+ LTG + D D C+ A+ + P+ + + GG
Sbjct: 404 QVCTQWGYFLTGASVPQTQLPLVSRLLDIDYLGFACEHAFNITKPPDVESINKHGG---- 459
Query: 424 LVLKRFASNIIFFNGLRDP--WSGGGVLKNISKTVLA-----LVEKEGAHHVD----LRF 472
L + +G DP W+G + ++T A L++ + HH D L
Sbjct: 460 --LNFSYPRVALIDGEADPWLWAGTHAPEAPARTSTASEPFILIKGKAVHHWDENGLLAN 517
Query: 473 STKED--PQWLKDVRRREVEIIGKWISQY 499
T + P+ + +V+RRE+EI+ +W+ ++
Sbjct: 518 ETTAELPPRQIVEVQRREMEIVREWMKEW 546
>gi|302795496|ref|XP_002979511.1| hypothetical protein SELMODRAFT_419277 [Selaginella moellendorffii]
gi|300152759|gb|EFJ19400.1| hypothetical protein SELMODRAFT_419277 [Selaginella moellendorffii]
Length = 905
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/107 (44%), Positives = 64/107 (59%), Gaps = 19/107 (17%)
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI------ 170
D+A +FKA +V +EHRYYG S P K YL+ Q+LAD+A+ I
Sbjct: 618 DLARQFKAGVVTLEHRYYGYSFPSKDFK-----------YLTVEQSLADHAAFIEYYQTF 666
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
I+ K N A + +V GGSY G L+AWFRLKYPH+ +G+ ASSA +
Sbjct: 667 INKKCNKHA--NKWIVIGGSYSGALSAWFRLKYPHLVVGSWASSAVV 711
>gi|119482365|ref|XP_001261211.1| extracelular serine carboxypeptidase, putative [Neosartorya
fischeri NRRL 181]
gi|119409365|gb|EAW19314.1| extracelular serine carboxypeptidase, putative [Neosartorya
fischeri NRRL 181]
Length = 572
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 81/166 (48%), Gaps = 21/166 (12%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFM 115
+DHF+ Y P + F RY + +H+ K P+F+ E G + + G +
Sbjct: 64 IDHFHNESRYEPHTGDHFNLRYWFDASHY---KEGGPVFLIAAGETNGRNRFPFLSHGIV 120
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL-- 173
+A + L V +EHRYYG+S P+ + KN +LS+ QA+ADYA ++
Sbjct: 121 TQLAKTYNGLGVILEHRYYGESYPFAN---LTTKNIR---FLSTEQAMADYAYFASNVVF 174
Query: 174 ----KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
NLTA P + +GGSY G A+ R YP V G ++SS
Sbjct: 175 PGLEHLNLTADTVPWIGYGGSYAGAFVAFLRKVYPEVFFGVVSSSG 220
>gi|451856412|gb|EMD69703.1| hypothetical protein COCSADRAFT_76240 [Cochliobolus sativus ND90Pr]
Length = 537
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 113/432 (26%), Positives = 173/432 (40%), Gaps = 78/432 (18%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIE-----WFAQNTGFMY 116
L+HF + TF RY + + K+ PIF+Y E D E + + F
Sbjct: 32 LNHFGSDAG---TFLNRYWVYSEKY---KSGGPIFIYDTGESDAEPSSAFFLSDPRSFFN 85
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTT--GYLSSTQALADYASLIID-- 172
+ +F + + EHR YGKS+P E + + S YL+ QALAD +
Sbjct: 86 QILEEFNGIGIAWEHRMYGKSVP-----ENFFNDTSLDRFKYLTVPQALADVDAFAKQFS 140
Query: 173 ---LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
+ LTA + V G SY G AAW R KYP + A++A + ++ S Y+
Sbjct: 141 LPYINATLTADHTLWVFIGASYSGGRAAWVRNKYPDTIYASWAAAATVEAMVDM-SYYAD 199
Query: 230 SNIITQDFRSVSENCYKVIKGSWKQIEETAK-KPGGLEKLQKAFRICKSEKNLAIESWLS 288
+ + R NC + I+ + + I+ P KL++ F + N++ ++
Sbjct: 200 AVWTGMNARGFG-NCTRDIQAAIRYIDHIMDTDPQATAKLKEQF-LGIGSANISNVDFVG 257
Query: 289 TAFVYTA------MTDYPTPSNFLNPLPAFPVKEMCKAID-DPKTGN----DVFAKLYGA 337
T V+ + M Y F +++ C ++ DPKT D +A GA
Sbjct: 258 TFDVFYSDWQIYDMDGY-----------RFSLRKFCDMLETDPKTNQTAPKDGWAPTKGA 306
Query: 338 AS------VYYNYSGT------AKCFDLNGDS-----------DPHGLSEWGWQACTEMI 374
S VY +YS T A+C NG S DP+ LS W WQACTE
Sbjct: 307 KSVVDKWAVYPDYSKTVIQDSDAQC-ATNGTSSNNCSSDVQTVDPNYLS-WRWQACTEWG 364
Query: 375 MLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPR---PNWITTEFGGHKIGLVLKRFAS 431
L + E +++ C + + P P W + +G R S
Sbjct: 365 YLQSANVGPYQLVSKYETLESQRDACHQLFDKSPPSVFPEWPKVQEFNKALGGWHIR-PS 423
Query: 432 NIIFFNGLRDPW 443
N + G DPW
Sbjct: 424 NTYWSGGEFDPW 435
>gi|310800016|gb|EFQ34909.1| serine carboxypeptidase S28 [Glomerella graminicola M1.001]
Length = 558
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 120/501 (23%), Positives = 195/501 (38%), Gaps = 86/501 (17%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFM 115
+DHF+ Y P S + F RY + H+ + P+ + E G + G +
Sbjct: 63 IDHFHNDSRYEPHSDEYFNLRYWFDAKHY---RPGGPVIILAAGETDGKDRLPFLDHGIL 119
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD------YASL 169
+A + V +EHRYYGKS P +++ +N +LS+ QALAD + S
Sbjct: 120 SILAKATGGVGVVLEHRYYGKSFPV---PDLSTENLR---FLSTDQALADTVYFAKHISF 173
Query: 170 IIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSF 229
NLTA +P +V+GGSY G AA+ R YP V G ++SS + Y
Sbjct: 174 PGHEDLNLTAPGTPYLVYGGSYAGAFAAFLRKLYPDVFWGGISSSGVTAAIIDYWEYYEG 233
Query: 230 SNIITQ-DFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSEKN-------- 280
+ + D ++N +V+ E KP + +L++ F + + +
Sbjct: 234 ARLFAPGDCAETTQNLTQVVDNVLLAKSEQV-KPSHVSQLKELFGLGPLQDDDFASTISY 292
Query: 281 ----LAIESW----LSTAF---VYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGND 329
L +W ST+F T +D + + PA V+E+ +G D
Sbjct: 293 GISGLQSTNWDPASDSTSFGTYCATVSSDSVLFGSTRHLRPA--VEELLFV-----SGAD 345
Query: 330 V--FAKLYGAASVYYNY-------SGTAKCFDLNGDSDPHGLS-------EWGWQACTEM 373
V +L A NY KCF +S W +Q CTE
Sbjct: 346 VKLTNRLLNYAGYVRNYVKKGCRGGDLVKCFSSRKREGYQNISVHQGYERSWFYQVCTEW 405
Query: 374 IMLTGGDN--KDSIFEESEE-DYDARARYCKEAYGVDPRPNW-ITTEFGGHKIGLVLKRF 429
G +D + S D + + YC+EA+ + PN + GG
Sbjct: 406 GYFQTGSGVPEDQLPLVSRVIDVEYSSIYCREAFNITKPPNVDAINKHGGFNFSY----- 460
Query: 430 ASNIIFFNGLRDPWSGG-----GVLKNISKTVLALVEKEGAHHVDLRFSTKED------P 478
+ +G DPW G+ + + + L+ + G HH D E+ P
Sbjct: 461 -PRVAIVDGEADPWRPATPHKIGLDRKSTTSEPFLLIELGVHHWDENGVKPENVTPDFPP 519
Query: 479 QWLKDVRRREVEIIGKWISQY 499
+K V+ +EVE + W+ ++
Sbjct: 520 ASIKKVQAQEVEFVTAWMKEW 540
>gi|294876612|ref|XP_002767728.1| Lysosomal Pro-X carboxypeptidase, putative [Perkinsus marinus ATCC
50983]
gi|239869546|gb|EER00446.1| Lysosomal Pro-X carboxypeptidase, putative [Perkinsus marinus ATCC
50983]
Length = 300
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 99/225 (44%), Gaps = 21/225 (9%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMY----- 116
LDHF+ + T Y ++ H+ +K+ IF G E + +G +Y
Sbjct: 30 LDHFSLVAKQ-PTIPLHYWLDTEHYDPAKDQCAIFYIMGGESPLP----ESGVIYPFISE 84
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIID-LKK 175
+A + L++ EHR+YG SIP K + Y LS Q+L D+A+++ L+
Sbjct: 85 RLAREHNGLVIESEHRFYGSSIPQSYEKSLPY--------LSVEQSLMDHATVLRHTLET 136
Query: 176 NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYS--FSNII 233
A V+ GGSY G LA FRL+YP + A ASS+P + S + + + +
Sbjct: 137 VENANRCRVIAVGGSYSGFLALAFRLRYPKLVYAAYASSSPGRLYSQEASRFDGRYYSRV 196
Query: 234 TQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKSE 278
T S+ NC + ++ E+ + +IC E
Sbjct: 197 TDAADSIRSNCSNSVIKAFDDFVHRYAGRVTFEQAKNELKICNPE 241
>gi|348554926|ref|XP_003463275.1| PREDICTED: thymus-specific serine protease-like [Cavia porcellus]
Length = 629
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 65/119 (54%), Gaps = 8/119 (6%)
Query: 27 PTFPSS---RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLIND 83
P PS+ R E++ L S+ G + + Q LD FN + +TF QRY +ND
Sbjct: 225 PAHPSTLLLRRLGERVQRLRGSAGPDPG-PRAGWLQQPLDPFNSS--DDRTFLQRYWVND 281
Query: 84 THWGGSKNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGG 142
HW G +AP+F++ G EG + + TG +AP AL++ +EHR+YG S+P GG
Sbjct: 282 RHWAG--GDAPVFLHLGGEGSLGPGSVMTGHPEALAPALGALVISLEHRFYGLSVPAGG 338
Score = 38.9 bits (89), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 33/156 (21%), Positives = 62/156 (39%), Gaps = 16/156 (10%)
Query: 356 DSDPH----GLSEWGWQACTEMIMLTGGDNKDSIFEESEEDYDARARYCKEAYGVDPRP- 410
D++P G +W +Q CTE +N + F + + C++ +G+
Sbjct: 474 DTEPQASGLGDRQWLYQTCTEFGFYVTCENAECPFPQFPA-LPSHLELCEQVFGLSASSV 532
Query: 411 ----NWITTEFGGHKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAH 466
+ +GG G A+ ++F NG DPW V + + AL+ +H
Sbjct: 533 ARAVTQTNSYYGGQAPG------ATRVLFVNGDIDPWHMLSVTQASGGSKAALLIPGASH 586
Query: 467 HVDLRFSTKEDPQWLKDVRRREVEIIGKWISQYFQD 502
D+ DP L+ R+ + + W+ Q ++
Sbjct: 587 CADMAPERPSDPTSLRSGRQSILHQLQTWLGQVKEE 622
>gi|396459958|ref|XP_003834591.1| similar to serine peptidase [Leptosphaeria maculans JN3]
gi|312211141|emb|CBX91226.1| similar to serine peptidase [Leptosphaeria maculans JN3]
Length = 549
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 90/176 (51%), Gaps = 22/176 (12%)
Query: 52 LYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW 107
LY + + +D F+ Y P TF RY + T++ K P+FV G E D E
Sbjct: 55 LYPERNLSVPIDFFHNETRYEPHENGTFDLRYWFDATYY---KPGGPVFVLLGGETDGEG 111
Query: 108 ---FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALA 164
F Q G ++ V L V +EHRYYGKS P ++ KN +L++ Q+LA
Sbjct: 112 RLPFLQK-GIVHQVIKATGGLGVILEHRYYGKSFPV---PDLTTKNMR---FLTTEQSLA 164
Query: 165 --DYASLIID---LKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
DY + + + +LTA ++P +V+GGSY G AA+ R+ YP GA++SS
Sbjct: 165 EIDYFARHVKFTGIDADLTAPNTPWIVYGGSYAGAQAAFVRVVYPDTFWGAISSSG 220
>gi|169603686|ref|XP_001795264.1| hypothetical protein SNOG_04851 [Phaeosphaeria nodorum SN15]
gi|111066122|gb|EAT87242.1| hypothetical protein SNOG_04851 [Phaeosphaeria nodorum SN15]
Length = 353
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 91/184 (49%), Gaps = 29/184 (15%)
Query: 60 QILDHF----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN--TG 113
Q +DHF Y P + TF+QRY+ + +++ K P+F+Y G E +E N TG
Sbjct: 40 QPIDHFPESDRYVPHTNDTFKQRYVFDSSYY---KPGGPVFLYIGGETSVESRFSNLQTG 96
Query: 114 FMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG---YLSSTQALADYA--- 167
+ + KF + V +E+RYYGKS PY STT +L++ Q +AD A
Sbjct: 97 IIQILMEKFNGIGVILENRYYGKSYPY---------KTSTTDELRFLTTEQTIADNAYFR 147
Query: 168 --SLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PILNFDN 222
+ + ++L+ D P +++GGS G A+ Y + G + SSA +LN+
Sbjct: 148 QHATFPGVNESLSGPDVPWIMYGGSLAGAHTAFTMKTYNSIFAGGIGSSATTQALLNYPQ 207
Query: 223 IVSP 226
SP
Sbjct: 208 WYSP 211
>gi|361132051|gb|EHL03666.1| putative serine protease K12H4.7 [Glarea lozoyensis 74030]
Length = 577
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 135/558 (24%), Positives = 209/558 (37%), Gaps = 96/558 (17%)
Query: 6 IFLSFCLLFSSTLT------ISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHT 59
IF +F LF S T + ++ P P I P ++ I + D+ G + +
Sbjct: 49 IFSAFAALFVSGATALDGEFVHLGRLIP--PVEAIDPGLVT--IIAQNDAMG---SGFFD 101
Query: 60 QILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE---GDIEWFAQNTGFMY 116
Q+LDH N S TF+Q++ N W G +PI ++T E + + N +
Sbjct: 102 QLLDHKN---PSKGTFKQKFWWNIEFWNGP--GSPIVMFTPGEIAAANYGAYLTNATVIG 156
Query: 117 DVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY------ASLI 170
A + K ++ +EHR++G+S PY N+ T L+ Q++AD+ A L
Sbjct: 157 LYAQEIKGAVIMVEHRFWGESSPY------QTLNSETLQLLTLEQSIADFVYFAKVAPLP 210
Query: 171 IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFS 230
D KK+ A +P V GGSY G LAAW P ASSAP+ D+ + +
Sbjct: 211 FDTKKS-NADKAPWVFSGGSYSGALAAWIESTSPGTFWAYHASSAPVQAIDDY---WQYF 266
Query: 231 NIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEK--------LQKAFRICKSEKNLA 282
+ I Q + +NC K + +++ EK LQ + L
Sbjct: 267 SPIQQ---GMPKNCSKDLSLVIDYMDKVWNTGSAAEKLALKTKFGLQGLAQPADVMATLE 323
Query: 283 IESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGAA---- 338
WL + +T T Y S F A A+ G V L G A
Sbjct: 324 YGPWLWQSNSFT--TGY---SGFFQFCDAIENVTAGAAVTPDANGVGVQTALEGYAKWTK 378
Query: 339 ---------SVYYNYSGTAKCFDLNGDSDPHGLS---------EWGWQACTEMI--MLTG 378
S Y + T +C D S+ +W W C E G
Sbjct: 379 AKLLPGFCQSYGYTDANTVECLDTYNPSNKIFTDRSVGNAIDLQWQWMLCNEPFGYWQNG 438
Query: 379 GDNKDSIFEESEEDYDARARYCKE--------AYGVDPRPNWITTEFGGHKIGLVLKRFA 430
+ R C YG P+ + H G L++
Sbjct: 439 APRGKPSIVSRLVNSAYWQRQCALFFPTVNGFTYGSAISPDNNIHQVNKHTQGWRLEK-T 497
Query: 431 SNIIFFNGLRDPW--SG-------GGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWL 481
+ +I+ NG DPW SG GG L++ + L ++ G H DLR + +
Sbjct: 498 TRLIWTNGEFDPWKTSGMSSEYRPGGPLQSTPEHPLNVIPG-GFHCSDLRLRNAQANAGV 556
Query: 482 KDVRRREVEIIGKWISQY 499
+ V +V I KW++++
Sbjct: 557 QAVVDAQVAQIVKWVAEW 574
>gi|148676289|gb|EDL08236.1| dipeptidylpeptidase 7, isoform CRA_c [Mus musculus]
Length = 179
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/70 (48%), Positives = 50/70 (71%)
Query: 428 RFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRR 487
+ ASNIIF NG DPW+GGG+ N+S +V+A+ + GAHH+DLR S EDP + +VR+
Sbjct: 88 KAASNIIFSNGDLDPWAGGGIQSNLSTSVIAVTIQGGAHHLDLRASNSEDPPSVVEVRKL 147
Query: 488 EVEIIGKWIS 497
E +I +W++
Sbjct: 148 ESTLIREWVA 157
>gi|238486428|ref|XP_002374452.1| dipeptidyl-peptidase II, putative [Aspergillus flavus NRRL3357]
gi|220699331|gb|EED55670.1| dipeptidyl-peptidase II, putative [Aspergillus flavus NRRL3357]
Length = 566
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 60/202 (29%), Positives = 98/202 (48%), Gaps = 18/202 (8%)
Query: 31 SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSK 90
++R PEK++ + S+ S + + +DH + S T+Q RY ++ + K
Sbjct: 46 TARFNPEKIAE-TAISRGSGSEVPARRISIPIDHED---PSMGTYQNRYWVSADFY---K 98
Query: 91 NNAPIFVYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
P+FV EG+ AQ+ F + +F L + EHRYYG S+P+ N
Sbjct: 99 PGGPVFVLDAGEGNAYSVAQSYLGGSDNFFAEYLKEFNGLGLVWEHRYYGDSLPFPVNTS 158
Query: 146 IAYKNASTTGYLSSTQALAD---YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLK 202
++ YL+++QALAD +A +L+ SP ++ GGSY GM AA+ R +
Sbjct: 159 TPNEHFK---YLTNSQALADLPYFAEKFTLNGTDLSPKSSPWIMLGGSYPGMRAAFTRNE 215
Query: 203 YPHVAIGALASSAPILNFDNIV 224
YP + A SAP+ + N+
Sbjct: 216 YPDTIFASFAMSAPVEAWVNMT 237
>gi|409040075|gb|EKM49563.1| hypothetical protein PHACADRAFT_265104 [Phanerochaete carnosa
HHB-10118-sp]
Length = 561
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 94/171 (54%), Gaps = 19/171 (11%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQ 110
++ + Q L+HF+ N + T+ QRY IN H+ +AP+ V G E G+
Sbjct: 64 FEPHWFRQPLNHFSNNSE---TWLQRYWINTRHYKPG-THAPVIVIDGGETSGENRLPFL 119
Query: 111 NTGFMYDVAPK-FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASL 169
+TG + D+ PK + V +EHRY+G+S+P + + +L++ QA AD A+
Sbjct: 120 DTG-IADILPKEIGGIGVILEHRYHGESLP------VQNFTTDSLRFLNNDQAAADSANF 172
Query: 170 IIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+ ++K +++T+ +P + +GGSY G +A ++ YP + GA+ASSA
Sbjct: 173 MANVKFPGVDEDITSPGNPWIYYGGSYAGARSAHMKMLYPELVYGAIASSA 223
>gi|312383569|gb|EFR28610.1| hypothetical protein AND_03264 [Anopheles darlingi]
Length = 490
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 88/180 (48%), Gaps = 18/180 (10%)
Query: 46 SKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
S+ + L ++ T++ DHF+ PQ+ TF+ Y ND + + PI+++ G +
Sbjct: 56 SEGNPRLIGARFRTRV-DHFD--PQNRDTFEFNYYSNDEFY---QPGGPIYIFVGGNFQL 109
Query: 106 EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQAL 163
+ G +YD A + A L EHRYYG S P +N ST +L + Q L
Sbjct: 110 TTYYIEHGLLYDTAARDHAWLFTNEHRYYGTSTP--------VENYSTENLRFLHTEQVL 161
Query: 164 ADYASLIIDLKKNLTAT-DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPIL-NFD 221
D I L+ + ++ V++ G Y G LA W R ++P++ GA S A +L +FD
Sbjct: 162 TDLIEWIDHLRNEVVRDPNAKVILMGVGYAGALATWARQRFPNIVDGAWGSGATVLASFD 221
>gi|171684281|ref|XP_001907082.1| hypothetical protein [Podospora anserina S mat+]
gi|170942101|emb|CAP67753.1| unnamed protein product [Podospora anserina S mat+]
Length = 528
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 94/194 (48%), Gaps = 26/194 (13%)
Query: 37 EKLSSLIS-SSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKN 91
E ++LI D + LY + +DHF+ Y P + +TF RY + +H+ +
Sbjct: 9 EMRTALIHPRDTDLEKLYPAQTIQMPIDHFHNDSIYEPHTNETFLLRYWFDASHY---QP 65
Query: 92 NAPIFVYTGNE----GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIA 147
P+ V G E G + + + G + ++ L V EHRYYG+S P + +
Sbjct: 66 GGPVIVLQGGETDGAGRLRYLQK--GIVAQLSQATNGLGVIFEHRYYGESHP---TDDFS 120
Query: 148 YKNASTTGYLSSTQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAWFRL 201
KN +L++ QALAD A ++ NLT+ + P + +GGSY G + A+ R
Sbjct: 121 TKNLR---FLTTDQALADQAYFAQNVVFPGLEHLNLTSHNVPYIAYGGSYAGSVVAFLRK 177
Query: 202 KYPHVAIGALASSA 215
YP V GA+ASS
Sbjct: 178 LYPDVYWGAIASSG 191
>gi|451852466|gb|EMD65761.1| hypothetical protein COCSADRAFT_140099 [Cochliobolus sativus
ND90Pr]
Length = 531
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/182 (32%), Positives = 87/182 (47%), Gaps = 26/182 (14%)
Query: 50 QGLYKTKYHTQILDHF----NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDI 105
Q Y Q +DHF Y P + TF+QRY + +++ K P+F+Y E
Sbjct: 35 QWAYDAHTIDQPIDHFPHSDRYVPHTNGTFKQRYFFDKSYY---KPGGPVFLYLAGETWG 91
Query: 106 EWFAQN--TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTG---YLSST 160
EW N TG + + KF L V +E+RYYG S PY N +TT +L++
Sbjct: 92 EWTLDNLQTGIIQILMKKFNGLGVILENRYYGYSFPY---------NTTTTDELRFLTTE 142
Query: 161 QALADYASLIIDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
Q +AD +K +L++ D+P V++GGS G A+ Y + G +ASSA
Sbjct: 143 QTIADNEYFRQHVKFPGVDADLSSPDTPWVMYGGSLAGSQVAFTMKMYNEIFAGGIASSA 202
Query: 216 PI 217
I
Sbjct: 203 TI 204
>gi|156058698|ref|XP_001595272.1| hypothetical protein SS1G_03361 [Sclerotinia sclerotiorum 1980]
gi|154701148|gb|EDO00887.1| hypothetical protein SS1G_03361 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 535
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 66/220 (30%), Positives = 97/220 (44%), Gaps = 26/220 (11%)
Query: 16 STLTISNAKIFPTFPSSRITP--EKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQ 73
S L + A P ++ P E +S + + + + TQ+LDH N S
Sbjct: 10 SLLVATAAAEHPFLKLRKLVPPVEADDEFPASINAATNITGSAFFTQLLDHEN---PSKG 66
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNE---GDIEWFAQNTGFMYDVAPKFKALLVFIE 130
TFQQ++ N +W G +PI +T E + + N A + K +V +E
Sbjct: 67 TFQQKFWWNSENWAGP--GSPIVFFTPGEIAAAEYGAYLTNVTVTGLFAQEVKGAVVMVE 124
Query: 131 HRYYGKSIPYGGNKEIAYKNASTTG--YLSSTQALADYA------SLIIDLKKNLTATDS 182
HRY+G+S PY N +TT YL+ QA+AD+ L D + A +
Sbjct: 125 HRYWGESSPY--------DNLTTTNLQYLNLKQAIADFVHFAKTVDLPFDTNHSSNAAAA 176
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDN 222
P ++ GGSY G LAAW P ASSAP+ +N
Sbjct: 177 PWILSGGSYSGALAAWTESTSPGTFWAYHASSAPVQAINN 216
>gi|326475271|gb|EGD99280.1| hypothetical protein TESG_06549 [Trichophyton tonsurans CBS 112818]
gi|326480381|gb|EGE04391.1| serine peptidase [Trichophyton equinum CBS 127.97]
Length = 551
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/214 (27%), Positives = 98/214 (45%), Gaps = 27/214 (12%)
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-----QNTGFMYDVAPKFKALLVF 128
T++ R+ IN+ + K P+FV+ E + +A F + KF + +
Sbjct: 83 TYKHRFWINEQDY---KPGGPVFVFDCGEAAGQRYADKYLFNEANFFRQLTKKFHGIGII 139
Query: 129 IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK------NLTATDS 182
EHRYYG+S P+ E ++ YL++ QALAD + K+ +L +
Sbjct: 140 FEHRYYGESTPFPVTVETPPEHFQ---YLNNDQALADLPFFAKEFKRKAFPNDDLRPNAT 196
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE 242
P V+ GGSY GM AA+ R +YP + ASSAP+ ++ Y Q +R +
Sbjct: 197 PWVMVGGSYPGMRAAFTRDRYPETIYASWASSAPVQAQIDMAVYYE------QVYRGLVA 250
Query: 243 ----NCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
NC K I+ +++ I+ + ++K F
Sbjct: 251 YGWGNCTKDIRAAYRYIDRQLSRNDTAAAIKKLF 284
>gi|71003550|ref|XP_756441.1| hypothetical protein UM00294.1 [Ustilago maydis 521]
gi|46096046|gb|EAK81279.1| hypothetical protein UM00294.1 [Ustilago maydis 521]
Length = 583
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 65/235 (27%), Positives = 106/235 (45%), Gaps = 30/235 (12%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNA--- 93
+ ++ + K + + YH Q LDHF+ Q+ F QR+ + H+ ++N
Sbjct: 121 VDAISPNKKSKHDIKEPAYHRQPLDHFDNTTQA--QFDQRFFYSTRHYKPASARNKGEAV 178
Query: 94 PIFVYTGNEGDIEWFAQ--NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNA 151
PI++ E D +TG + ++ + + +EHRYYG S+P
Sbjct: 179 PIYILDSGEADATARIPFLDTGILDILSKATGGIGIVLEHRYYGTSLP----NRTDLGPG 234
Query: 152 STTG-----YLSSTQALADYASLIIDLKKNLTATDSP----VVVFGGSYGGMLAAWFRLK 202
T G +L++ QAL D A I L ++ TD+ ++ +GGSY G +A RL
Sbjct: 235 DTWGVDQLRWLTNKQALEDSADFIRHL--SIPGTDNSEKRKIIYYGGSYPGARSAHMRLL 292
Query: 203 YPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEE 257
YP + GA+ASSA + D P F + R NC + I+ + I+E
Sbjct: 293 YPELVHGAIASSAVVTAVDEF--PEYFYPVA----RGAPTNCSQAIQAAIAGIDE 341
>gi|317144261|ref|XP_001820001.2| serine peptidase, family S28 [Aspergillus oryzae RIB40]
Length = 566
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 95/195 (48%), Gaps = 18/195 (9%)
Query: 31 SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSK 90
++R PEK++ + S+ S + + +DH + S T+Q RY ++ + K
Sbjct: 46 TARFNPEKIAE-TAISRGSGSEVPARRISIPIDHED---PSMGTYQNRYWVSADFY---K 98
Query: 91 NNAPIFVYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
P+FV EG+ AQ+ F + +F L + EHRYYG S+P+ N
Sbjct: 99 PGGPVFVLDAGEGNAYSVAQSYLGGSDNFFAEYLKEFNGLGLVWEHRYYGDSLPFPVNTS 158
Query: 146 IAYKNASTTGYLSSTQALAD---YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLK 202
++ YL+++QALAD +A +L+ SP ++ GGSY GM AA+ R +
Sbjct: 159 TPNEHFK---YLTNSQALADLPYFAEKFTLNGTDLSPKSSPWIMLGGSYPGMRAAFTRNE 215
Query: 203 YPHVAIGALASSAPI 217
YP + A SAP+
Sbjct: 216 YPDTIFASFAMSAPV 230
>gi|391867859|gb|EIT77098.1| hypothetical protein Ao3042_06734 [Aspergillus oryzae 3.042]
Length = 566
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 59/195 (30%), Positives = 95/195 (48%), Gaps = 18/195 (9%)
Query: 31 SSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSK 90
++R PEK++ + S+ S + + +DH + S T+Q RY ++ + K
Sbjct: 46 TARFNPEKIAE-TAISRGSGSEVPARRISIPIDHED---PSMGTYQNRYWVSADFY---K 98
Query: 91 NNAPIFVYTGNEGDIEWFAQN-----TGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKE 145
P+FV EG+ AQ+ F + +F L + EHRYYG S+P+ N
Sbjct: 99 PGGPVFVLDAGEGNAYSVAQSYLGGSDNFFAEYLKEFNGLGLVWEHRYYGDSLPFPVNTS 158
Query: 146 IAYKNASTTGYLSSTQALAD---YASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRLK 202
++ YL+++QALAD +A +L+ SP ++ GGSY GM AA+ R +
Sbjct: 159 TPNEHFK---YLTNSQALADLPYFAEKFTLNGTDLSPKSSPWIMLGGSYPGMRAAFTRNE 215
Query: 203 YPHVAIGALASSAPI 217
YP + A SAP+
Sbjct: 216 YPDTIFASFAMSAPV 230
>gi|317030966|ref|XP_001392567.2| serine peptidase [Aspergillus niger CBS 513.88]
Length = 508
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/252 (28%), Positives = 112/252 (44%), Gaps = 34/252 (13%)
Query: 33 RITPEKLSSLISSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNN 92
R+ P+ +S SS S Y Q+LDH +NP+ TF QRY + +WGG
Sbjct: 26 RLVPKPISRPASSK--SAATTGEAYFEQLLDH--HNPEK-GTFSQRYWWSTEYWGGP--G 78
Query: 93 APIFVYTGNEGDI-----EWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIA 147
+P+ ++ N G++ E + N A + + ++ IEHRY+G S PY E+
Sbjct: 79 SPVVLF--NPGEVSADGYEGYLTNDTLTGVYAQEIQGAVILIEHRYWGDSSPY----EVL 132
Query: 148 YKNASTTGYLSSTQAL------ADYASLIIDLKKNLTATDSPVVVFGGSYGGMLAAWFRL 201
NA T YL+ Q++ A+ L D A ++P V+ GGSY G L AW
Sbjct: 133 --NAETLQYLTLDQSILDMTYFAETVKLQFDNSSRSNAQNAPWVMVGGSYSGALTAWTES 190
Query: 202 KYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKK 261
P A+SAP+ Y F + +++NC K + + +++ K
Sbjct: 191 IAPGTFWAYHATSAPVEAI------YDFWQYFYPIQQGMAQNCSKDVSLVAEYVDKIGKN 244
Query: 262 PGGLEKLQKAFR 273
G K Q+ +
Sbjct: 245 --GTAKEQQELK 254
>gi|328723991|ref|XP_003248003.1| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 253
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 89/199 (44%), Gaps = 19/199 (9%)
Query: 90 KNNAPIFVYTGNEGDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYK 149
K P+F+ G I +G + A F A +EHRYYG S P
Sbjct: 7 KMGGPVFLLVGGSEKILHSWMISGAWIEYAQIFNAACFQLEHRYYGMSHPTDD------L 60
Query: 150 NASTTGYLSSTQALADYASLI--IDLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVA 207
N S YLS+ Q LAD A I I ++KN + V FG SY G L AW LKYPH+
Sbjct: 61 NTSNLVYLSTEQVLADLAIFINTISIEKNQLLGSAKWVGFGSSYSGSLVAWLILKYPHLV 120
Query: 208 IGALASSAPI---LNFDNIVSPYSFSNIITQDFRSVSENCYKV-IKGSWKQIEETAKKPG 263
A++SS+P+ ++F+ + Q SV Y++ I+ + K I + +
Sbjct: 121 YAAVSSSSPLTAKIHFEEYF-------MAVQKTLSVYNQKYELNIRQANKIISDQLQTDY 173
Query: 264 GLEKLQKAFRICKSEKNLA 282
G + +Q F C N A
Sbjct: 174 GAKYIQTKFNTCAHNLNNA 192
>gi|328860679|gb|EGG09784.1| extracellular serine carboxypeptidase [Melampsora larici-populina
98AG31]
Length = 603
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 121/546 (22%), Positives = 217/546 (39%), Gaps = 109/546 (19%)
Query: 33 RITPEKLSSLIS--SSKDSQGLYKTKYHTQILDHF----NYNPQSYQTFQQRYLINDTHW 86
RI P + S +SKDS Q + HF Y+P FQQRY ++ +
Sbjct: 75 RIIPSNPDLIKSRPTSKDSTATPVAHVFEQKISHFPTDPKYSPHIDGNFQQRYWVDTRFY 134
Query: 87 GGSKNNAPIFVYTGNE----GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGG 142
K P+F+ E I + Q G + ++ F + + +E RYYG S P
Sbjct: 135 ---KPGGPVFLLDAGEISGQSRIPFLQQ--GIIRLLSETFHGVGLILEMRYYGASFP--- 186
Query: 143 NKEIAYKNASTTGYLSSTQALADYASLIIDL------KKNLTATDSPVVVFGG------- 189
K+++ ++ +L + Q+LAD A ++ NLTA +P + +GG
Sbjct: 187 TKDLSTESLR---FLDTKQSLADAAYFAQNIVFPGLESHNLTAPGTPWIYYGGKFNSLSH 243
Query: 190 --------SYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVS 241
SY G AA+ ++ YP + GA+ SSA + + S+S+ ++
Sbjct: 244 NTAFCESGSYAGAKAAYMKVTYPDLIWGAIGSSA------TVKAIVSYSDYFRTVEKTAD 297
Query: 242 ENCYKVIKGSWKQIEET--AKKPGGLEKLQKAFRI---CKSEKNLAIESWLSTAFVYTAM 296
C K ++ + + ++ +K + K++ F + + +++ +WL +
Sbjct: 298 PECVKGLQVAMEIVDRLIGSKDMKAISKMKADFGVEGLSHIDDFVSLTTWLVATYQEQNW 357
Query: 297 TDYPTPSNFLNPLPAFPVKEMCKAIDDPKTGNDVFAKLYGA-ASVYYNY----------S 345
P + + E+ K KT F ++Y A A++ +Y
Sbjct: 358 DPNSDPKSRVWSQSC----EIIKKAAQLKTPRGSFPEVYYAFANLTTHYVSRKNICPTPE 413
Query: 346 GTAKCF---DLNGDSDPHGLSE----WGWQACTEMIML-----TGGDNKDSIFEESEEDY 393
CF DL+ S + L++ W +Q C E GG + S +
Sbjct: 414 DPESCFGTYDLSA-SQANDLTQTWRSWMYQCCREEAFFPTNTAVGGSSIVSRYLT----L 468
Query: 394 DARARYCKEAYG------VDPRPNW-ITTEFGGHKIGLVLKRFASNIIFFNGLRDPW--- 443
D R CK+++ + P P+ + +GG+ I A + F +G RDPW
Sbjct: 469 DYNTRQCKQSFPPGAHNRIPPIPDVSVPNSYGGYDIA------ADRLAFIDGRRDPWLYA 522
Query: 444 SGGGVLKNISKTVLA---LVEKEGAHHVDLR-----FSTKEDPQWLKDVRRREVEIIGKW 495
+ L K+ L + + G HH D S + P+ ++ ++ E+E I W
Sbjct: 523 TPHSPLAKPRKSTLTRPFYLIQNGVHHYDENGLPGGLSNQTLPEEIRAIQLFEIEFITSW 582
Query: 496 ISQYFQ 501
++ + Q
Sbjct: 583 LNDFKQ 588
>gi|302688585|ref|XP_003033972.1| hypothetical protein SCHCODRAFT_256599 [Schizophyllum commune H4-8]
gi|300107667|gb|EFI99069.1| hypothetical protein SCHCODRAFT_256599 [Schizophyllum commune H4-8]
Length = 546
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 62/236 (26%), Positives = 111/236 (47%), Gaps = 23/236 (9%)
Query: 53 YKTKYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQ 110
+ ++ Q LDHFN ++ TF QRY + H+ + P+ V G E G+
Sbjct: 63 FPEQWFEQPLDHFNN--ETGDTFLQRYWFSKRHYT-PGSGGPVIVLDGGETSGEGRLPFL 119
Query: 111 NTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLI 170
+TG + + + V +EHRYYG++ P + + +L++ Q+ AD A +
Sbjct: 120 DTGIVEILTRATGGVGVILEHRYYGETQP------VQNLTTDSLRFLNNDQSAADSAYFM 173
Query: 171 IDLK-----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVS 225
++K ++LTA +P + +GGSY G +A R+ YP + GA+ASS I+
Sbjct: 174 ANVKFDGIDEDLTAPGTPWIYYGGSYAGARSAHMRVLYPDLVYGAIASSGVT---HAIIE 230
Query: 226 PYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG-GLEKLQKAFRICKSEKN 280
+ + I ++ NC ++ S + I++ + G + L+ F + E N
Sbjct: 231 DWEYMETIR---KAAPANCSLGLENSIQTIDDILTEGGSAADDLKALFGLAGLEHN 283
>gi|327294299|ref|XP_003231845.1| serine peptidase, family S28 [Trichophyton rubrum CBS 118892]
gi|326465790|gb|EGD91243.1| serine peptidase, family S28 [Trichophyton rubrum CBS 118892]
Length = 551
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 99/214 (46%), Gaps = 27/214 (12%)
Query: 74 TFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFA-----QNTGFMYDVAPKFKALLVF 128
T++ R+ IN+ + K P+FV+ E + +A T F + KF + +
Sbjct: 83 TYKHRFWINEQDY---KPGGPVFVFDCGEAAGQRYADRYLFNETNFFRQLTKKFHGIGII 139
Query: 129 IEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLKK------NLTATDS 182
EHRYYG+S P+ + ++ YL++ QALAD + K+ +L +
Sbjct: 140 FEHRYYGESTPFPVTVKTPPEHFQ---YLNNDQALADLPYFAKEFKRKAFPNNDLRPNAT 196
Query: 183 PVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSE 242
P V+ GGSY GM +A+ R +YP + ASSAP+ ++ Y Q +R +
Sbjct: 197 PWVMVGGSYPGMRSAFTRDRYPETIYASWASSAPVQAQIDMAVYYE------QVYRGLVA 250
Query: 243 ----NCYKVIKGSWKQIEETAKKPGGLEKLQKAF 272
NC K I+ +++ I+ + ++K F
Sbjct: 251 YGWGNCTKDIRAAYRYIDRQLSRNDTAAAIKKLF 284
>gi|90018240|gb|ABD83933.1| hypothetical protein [Ictalurus punctatus]
Length = 138
Score = 75.9 bits (185), Expect = 5e-11, Method: Composition-based stats.
Identities = 43/135 (31%), Positives = 77/135 (57%), Gaps = 8/135 (5%)
Query: 204 PHVAIGALASSAPILNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPG 263
P++ GALA+SAPI++ + F +T DF++ + C +K ++++++ A++
Sbjct: 1 PNIVAGALAASAPIVSTAGLGDSGQFFRDVTADFQNYNPACKDAVKAAFQKLQTLAQQQD 60
Query: 264 GLEKLQKAFRICK---SEKNL-AIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCK 319
++Q AF +CK S K+L + +L AF AM DYP + F++ +PAFPVK C+
Sbjct: 61 -YARIQSAFSLCKTPSSNKDLHQLNGFLRNAFTLLAMMDYPYATIFMSKMPAFPVKVACE 119
Query: 320 AIDDPKTGNDVFAKL 334
+ + G +V + L
Sbjct: 120 VMLN---GTEVLSAL 131
>gi|115385535|ref|XP_001209314.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114187761|gb|EAU29461.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 556
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/174 (29%), Positives = 85/174 (48%), Gaps = 21/174 (12%)
Query: 53 YKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIE 106
Y Y + +DHF+ Y P + + F RY + ++ + P+FV E G+
Sbjct: 45 YPAHYLSVPIDHFHNESRYEPHTDKHFPLRYWFDAQYY---QPGGPVFVIAAGETSGEDR 101
Query: 107 WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADY 166
+ + G + +A K+ L V +EHRYYG S P+ S +LS+ QA+ADY
Sbjct: 102 FPFLSQGIVTQLAEKYHGLGVILEHRYYGDSYPFDN------LTTSNIRFLSTEQAVADY 155
Query: 167 ASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
A ++ +L+ ++P + +GGSY G ++ R YP V GA++SS
Sbjct: 156 AYFASNVVFPGLDHVDLSPENTPWIAYGGSYAGAFVSFLRKLYPDVYWGAVSSS 209
>gi|239615378|gb|EEQ92365.1| extracelular serine carboxypeptidase [Ajellomyces dermatitidis
ER-3]
Length = 552
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 103/231 (44%), Gaps = 28/231 (12%)
Query: 1 MATR-FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHT 59
MA R + L L+F+ ++ + P P + S S D Y
Sbjct: 1 MAIRSMVALRLVLVFALIQSVVASVRLPNNP--MLIGLSTPSTESDKDDLTARYPVHKIK 58
Query: 60 QILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW---FAQNT 112
+DHF Y P + + F RY + +H+ K PI V G E + E F Q
Sbjct: 59 IPIDHFRSDPRYEPHTEEKFDVRYWFDASHY---KKGGPIIVLHGGETNGEGRLPFLQK- 114
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD---YASL 169
G + ++ L V +EHRYYG+S P A + + +L++ QALAD +A
Sbjct: 115 GIVKILSEATNGLGVILEHRYYGESFP------TANLSTESLRFLTTEQALADSAYFAQN 168
Query: 170 II-----DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
++ D+ ++P +++GGSY G A+ R++YP + GA++SS
Sbjct: 169 VVFEGFEDVDLTAKGGNAPWIIYGGSYAGAQVAFLRVEYPDIFWGAISSSG 219
>gi|302890357|ref|XP_003044063.1| hypothetical protein NECHADRAFT_66929 [Nectria haematococca mpVI
77-13-4]
gi|256724982|gb|EEU38350.1| hypothetical protein NECHADRAFT_66929 [Nectria haematococca mpVI
77-13-4]
Length = 542
Score = 75.5 bits (184), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 82/166 (49%), Gaps = 21/166 (12%)
Query: 62 LDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNE--GDIEWFAQNTGFM 115
+DHF+ Y P S + F RY + ++ K P+ + E G+ + G +
Sbjct: 53 VDHFHNDTLYEPHSDEKFPLRYWFDAQYY---KEGGPVIILASGETSGEDRLPFLDHGIL 109
Query: 116 YDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK- 174
+A + V +EHRYYG S P ++ KN +LS+ QALAD A ++K
Sbjct: 110 KMLAEATNGIGVILEHRYYGTSFPV---PDLKTKNLR---FLSTEQALADTAYFAENVKF 163
Query: 175 -----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+NLTA +P +++GGSY G AA+ R YP + G ++SS
Sbjct: 164 PGLEERNLTAASTPYIIYGGSYAGAFAAFARKIYPDIFWGGISSSG 209
>gi|261199212|ref|XP_002626007.1| extracelular serine carboxypeptidase [Ajellomyces dermatitidis
SLH14081]
gi|239594215|gb|EEQ76796.1| extracelular serine carboxypeptidase [Ajellomyces dermatitidis
SLH14081]
Length = 574
Score = 75.1 bits (183), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 103/231 (44%), Gaps = 28/231 (12%)
Query: 1 MATR-FIFLSFCLLFSSTLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHT 59
MA R + L L+F+ ++ + P P + S S D Y
Sbjct: 1 MAIRSMVALRLVLVFALIQSVVASVRLPNNPM--LIGLSTPSTESDKDDLTARYPVHKIK 58
Query: 60 QILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW---FAQNT 112
+DHF Y P + + F RY + +H+ K PI V G E + E F Q
Sbjct: 59 IPIDHFRSDPRYEPHTEEKFDVRYWFDASHY---KKGGPIIVLHGGETNGEGRLPFLQK- 114
Query: 113 GFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD---YASL 169
G + ++ L V +EHRYYG+S P A + + +L++ QALAD +A
Sbjct: 115 GIVKILSEATNGLGVILEHRYYGESFP------TANLSTESLRFLTTEQALADSAYFAQN 168
Query: 170 II-----DLKKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
++ D+ ++P +++GGSY G A+ R++YP + GA++SS
Sbjct: 169 VVFEGFEDVDLTAKGGNAPWIIYGGSYAGAQVAFLRVEYPDIFWGAISSSG 219
>gi|225557666|gb|EEH05952.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 559
Score = 75.1 bits (183), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 63/222 (28%), Positives = 102/222 (45%), Gaps = 32/222 (14%)
Query: 10 FCLLFS-STLTISNAKIFPTFPSSRITPEKLSSLISSSKDSQGLYKTKYHTQILDHF--- 65
LLF+ L +++ ++ P S PE + D + Y ++DHF
Sbjct: 11 LALLFAFMQLAVASIRVDWQDPLSIPRPE------TDDADLKARYPVHKFKTLIDHFPSD 64
Query: 66 -NYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW---FAQNTGFMYDVAPK 121
Y P + + F RY + +H+ K P+ V G E + E F Q G + ++
Sbjct: 65 PRYEPHTGEKFDLRYWFDASHY---KKGGPVIVLHGGEANGEGRLPFLQK-GIVKILSEA 120
Query: 122 FKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDL------KK 175
L V +EHRYYG+S P A + + +L++ QALAD A ++
Sbjct: 121 THGLGVILEHRYYGQSFP------TANLSTESLRFLTTEQALADSADFAQNVVFEGFEDV 174
Query: 176 NLTAT--DSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSA 215
+LTA ++P + +GGSY G A+ R +YP + GA++SS
Sbjct: 175 DLTAKGGNAPWIAYGGSYAGAQVAFLRAQYPDIFWGAISSSG 216
Score = 40.0 bits (92), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 70/158 (44%), Gaps = 24/158 (15%)
Query: 365 WGWQACTEMIMLTGGDN--KDSIFEESEE-DYDARARYCKEAYGVDPRPNWI--TTEFGG 419
W +Q CTE L G +D + S D + + C+EA+G+ RP+ + ++GG
Sbjct: 396 WPYQVCTEWGFLQVGSTVPQDQMPMVSRLIDLEYASIMCREAFGI-YRPSDVNRVNKYGG 454
Query: 420 HKIGLVLKRFASNIIFFNGLRDPWSGGGVLKNISKTVLALVEK-----EGA-HHVDLRFS 473
I + F +G DPW +++ + +EK EGA HH D
Sbjct: 455 FDIEY------ERLAFIDGEVDPWRPATPHSEMARPRKSTLEKPFILIEGAGHHWDENGM 508
Query: 474 TKED------PQWLKDVRRREVEIIGKWISQYFQDLAQ 505
+ P+ + D + +EV+ + KW+ + Q+L Q
Sbjct: 509 LANESTPMLPPKAVADAQAQEVKFVKKWMKDWEQELRQ 546
>gi|303285344|ref|XP_003061962.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456373|gb|EEH53674.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 154
Score = 75.1 bits (183), Expect = 9e-11, Method: Composition-based stats.
Identities = 41/115 (35%), Positives = 67/115 (58%), Gaps = 6/115 (5%)
Query: 382 KDSIFEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFASNIIFFNGLRD 441
+D +++ + + AR C++ +GV PR W TT+FGG ++ AS I + NG D
Sbjct: 9 RDMFWKQPWNESEQIAR-CEKRWGVTPRVGWATTQFGGRRLS-----DASRIAWSNGDLD 62
Query: 442 PWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
PWS GV N+S +++A+ + GAHH+D +S +D + +K R E +I +WI
Sbjct: 63 PWSRLGVNANVSDSLVAVPVRGGAHHLDFMWSHPDDIESVKRARAMEGRLIAEWI 117
>gi|342878892|gb|EGU80177.1| hypothetical protein FOXB_09306 [Fusarium oxysporum Fo5176]
Length = 537
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 89/189 (47%), Gaps = 21/189 (11%)
Query: 39 LSSLISSSKDSQGLYKTKYHTQILDHFN----YNPQSYQTFQQRYLINDTHWGGSKNNAP 94
+ +L + + G K + +DHF+ Y P S + F RY + H+ + P
Sbjct: 23 IPALSARASSDSGSIKVHNISVPVDHFHNETKYEPHSDKKFPLRYWFDAQHY---REGGP 79
Query: 95 IFVYTGNE--GDIEWFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNAS 152
+ + E G+ G + +A + V +EHRYYG S P ++ KN
Sbjct: 80 VIILASGETSGEDRIPFLEHGILKMLANATGGVGVILEHRYYGTSFPV---PDLKTKNLR 136
Query: 153 TTGYLSSTQALADYASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHV 206
+LS+ QALAD A + K NLTA+++P +++GGSY G AA+ R YP V
Sbjct: 137 ---FLSTEQALADTAYFAEHVKFPGLEKHNLTASNTPYIIYGGSYAGAFAAFARKIYPEV 193
Query: 207 AIGALASSA 215
G ++SS
Sbjct: 194 FWGGISSSG 202
>gi|171695588|ref|XP_001912718.1| hypothetical protein [Podospora anserina S mat+]
gi|170948036|emb|CAP60200.1| unnamed protein product [Podospora anserina S mat+]
Length = 563
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 132/522 (25%), Positives = 196/522 (37%), Gaps = 116/522 (22%)
Query: 56 KYHTQILDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEW-------- 107
++ TQ +DH NP+ T+QQ Y +N WGG +P+ + T E I W
Sbjct: 74 EWFTQPIDH--SNPE-LGTWQQLYCVNPAKWGGP--GSPVVLMTPGETPI-WGSITPSRG 127
Query: 108 --FAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALAD 165
F NT A A V IEHRY+G S PY G +A T YL+ QA D
Sbjct: 128 YSFLDNTTMTGLYAQAIGAATVVIEHRYFGGSSPYDG------FDAETLQYLTMDQAAMD 181
Query: 166 YASLIIDL------KKNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILN 219
+ ++ K AT P V FG SY L +W YP V SSA +
Sbjct: 182 MVNFAQNVVFPFEDGKTSVATKVPWVWFGTSYAATLGSWIEHTYPGVFYAFHLSSAIVQA 241
Query: 220 -------FDNI---VSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQ 269
+D I + Y T VSE K + S + E ++ L+
Sbjct: 242 NTENWYYYDTIRKGIDSYRGDTRCTVALNEVSEYVDKYLLASNRNETE-------VQALK 294
Query: 270 KAFRICKSEKNLAIESWLSTAFVYTAMTDYPTPSNFLNPLPAFPVKEMCKAI--DDPKTG 327
F ++ ++T F Y T+ + + L EMC AI +
Sbjct: 295 LLFGASFPIEDDDFAYAIATPFRYWQETN-----GYRDIL------EMCDAIVGSNESEE 343
Query: 328 NDVFAKLYGAASVYYNY------SGTAKCFDLNGDSDP--------------------HG 361
NDV + G+ Y Y T + + G DP +
Sbjct: 344 NDVLGTVPGSVGNYAAYFRMNFRQSTCEYLNTWGQEDPLWCLNTHYEWNPYFIARTLGNP 403
Query: 362 LSEWGWQACTEMIM-LTGGDNKDSIFEESEE-DYDARARYCK--------EAYGV--DPR 409
W W C E I G K+S+ S + D R C+ E YG
Sbjct: 404 WRTWYWFLCNEPIASWATGAPKESLSMVSRKIDAQYWQRQCELHFPATHGEKYGSAKGKT 463
Query: 410 PNWITTEFGGHKIGLVLKRFASNIIFFNG---LRDPWSGGGVLKNISKTVLALVEKEGAH 466
P + E GG R ++ +I+ +G + GGVL++ + L++ A
Sbjct: 464 PTTLNEETGG------WLRNSTRVIWTSGGTSMSSEIRTGGVLQSTDSVSVFLIKN--AV 515
Query: 467 HVDLRFST--------KEDPQWLKDVRRREVEIIGKWISQYF 500
H D F+T K +P+ +K V+ + V I+ KW+++Y+
Sbjct: 516 HGDDAFTTRALGNLNIKPNPEVVK-VQEQSVLIVKKWVAEYY 556
>gi|212545206|ref|XP_002152757.1| serine peptidase, family S28, putative [Talaromyces marneffei ATCC
18224]
gi|210065726|gb|EEA19820.1| serine peptidase, family S28, putative [Talaromyces marneffei ATCC
18224]
Length = 608
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/167 (32%), Positives = 83/167 (49%), Gaps = 20/167 (11%)
Query: 62 LDHFNYNPQSYQTFQQRYLINDTHWGGSKNNAPIFVYTGNEGDIEWFAQN----TGFMYD 117
+DH N+ + T++ RY + ++ K P+F+Y E AQ+ F +
Sbjct: 75 IDHDNH---TIGTYKNRYWVTTKYY---KPGGPVFLYDVGESSAYNSAQHMLGEAAFFKE 128
Query: 118 VAPKFKALLVFIEHRYYGKSIPYGG-NKEIAYKNASTTGYLSSTQALADYASLIIDLKK- 175
+F L + EHRYYG+S+P G N + +N YL+ TQA+AD D +
Sbjct: 129 FLEEFGGLGIVWEHRYYGESLPMGPINADTPAENFK---YLTHTQAIADIPYFAQDFSRP 185
Query: 176 -----NLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 217
+L+ +P ++ GGSY GM AA+ R +YP A ASSAP+
Sbjct: 186 ELPSQDLSPKGTPWIMIGGSYSGMRAAFTRDEYPQSIYAAYASSAPV 232
>gi|170040235|ref|XP_001847912.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167863800|gb|EDS27183.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 492
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 108/477 (22%), Positives = 184/477 (38%), Gaps = 71/477 (14%)
Query: 44 SSSKDSQGLYKTKYHTQILDHFNYNPQSYQTFQQRYL-INDTHWGGSKNNAPIFVYTGNE 102
+S S+ + + T +DHF+ PQ+ + RYL + D + G PI ++ G
Sbjct: 51 ASEAPSKRIVIENFFTTRIDHFD--PQNTAEWTLRYLAVTDYYQPG----GPILIWLGGN 104
Query: 103 GDIE-WFAQNTGFMYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQ 161
I+ + + +YD+A + +G + + + +L++ Q
Sbjct: 105 APIQPYMVDESSLIYDMAREM-----------------HGASWVTSDTSTENLRFLNTDQ 147
Query: 162 ALADYASLIIDLKKNLTATDSP-VVVFGGSYGGMLAAWFRLKYPHVAIGALASSA---PI 217
LAD A + L++ +T ++ V+V G YGG LA WFR++YPH+A A +S +
Sbjct: 148 ILADLAEFVTYLRREVTRNENAHVLVSGVGYGGSLATWFRVRYPHLADAAWSSGGLHNAL 207
Query: 218 LNFDNIVSPYSFSNIITQDFRSVSENCYKVIKGSWKQIEETAKKPGGLEKLQKAFRICKS 277
++F + + I DF S+ CY I ++ + + G E L + +C
Sbjct: 208 MDFQEFAEAWGQTLI---DFG--SQECYNEIFVAF-HVMQNLIDAGREEILHERLNLCTE 261
Query: 278 EKNLAIESWLSTAFVY-TAMTDYPTPSNFLNPLPAFPVKEMCKAID--DPKTGNDVFAKL 334
+ E L F + T MT +F E+C + D T D FA
Sbjct: 262 ---IDTEDRLQVQFFFITMMTSIELLDSF---------SEVCNDLTGVDTPTALDSFADW 309
Query: 335 YG---AASVYYNYSGTAKCFDLNGDSD------PHGLSEWGWQACTEMIMLTGGDNKDSI 385
+ A A D D D G + +Q CTE D+
Sbjct: 310 FNNKFHAQDDCAVIDPATFIDWLRDDDWYSPFVQMGARQIFYQECTEFGWFLTTDSDQQP 369
Query: 386 FEESEEDYDARARYCKEAYGVDPRPNWITTEFGGHKIGLVLKRFAS------NIIFFNGL 439
F + DA + C +G +WI E H RF + NI F NG
Sbjct: 370 F-GNRVTVDAYSELCTRVFG-----DWIMFESIYHGTQRANNRFGALAPNVNNIHFTNGG 423
Query: 440 RDPWSGGGVLKNISKTVLALVEKEGAHHVDLRFSTKEDPQWLKDVRRREVEIIGKWI 496
DP+ + +++ L + D+ + D + L V+RR ++ ++
Sbjct: 424 EDPFRMLSIRNDLNAQALHDLIPNELIGSDVWAISDLDSKELVAVKRRLKSVLANYL 480
>gi|449540369|gb|EMD31362.1| hypothetical protein CERSUDRAFT_89238 [Ceriporiopsis subvermispora
B]
Length = 520
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 84/165 (50%), Gaps = 23/165 (13%)
Query: 59 TQILDHFNYNPQSYQTFQQRYLINDTHW--GGSKNNAPIFVYTGNEGDIEWFAQ--NTGF 114
TQ LDHF + TF+QRY ++ H+ GG PI V+ G E E +TG
Sbjct: 59 TQPLDHFE---DTGVTFEQRYWVSTRHYVPGG-----PIVVFDGGEASAEERLPILDTGI 110
Query: 115 MYDVAPKFKALLVFIEHRYYGKSIPYGGNKEIAYKNASTTGYLSSTQALADYASLIIDLK 174
+ + L + +EHRYYG S+ G N +L++ QAL D A + +++
Sbjct: 111 VDILTNATGGLGIILEHRYYGASV---GVTNFTTDNLR---WLNNDQALEDSAVFMTNVQ 164
Query: 175 -----KNLTATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASS 214
+N+TA P + +GGSY G +A +++YP + GA+ASS
Sbjct: 165 IPGISENITAPGRPWIYYGGSYAGARSAIMKVRYPDIVYGAIASS 209
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.415
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,635,877,949
Number of Sequences: 23463169
Number of extensions: 386925117
Number of successful extensions: 905079
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1029
Number of HSP's successfully gapped in prelim test: 288
Number of HSP's that attempted gapping in prelim test: 899548
Number of HSP's gapped (non-prelim): 1776
length of query: 505
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 358
effective length of database: 8,910,109,524
effective search space: 3189819209592
effective search space used: 3189819209592
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)