BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 038117
(500 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359479403|ref|XP_003632268.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Vitis vinifera]
Length = 503
Score = 560 bits (1442), Expect = e-157, Method: Compositional matrix adjust.
Identities = 276/509 (54%), Positives = 360/509 (70%), Gaps = 21/509 (4%)
Query: 3 SSQFSFQWLL----LMVIFVSTSFHANGLKLRPRLGRIRRSRILE----QKDSNHGFETF 54
S + S QWL+ L++ F +T A K+ P+L + R+ + E + ++ F+TF
Sbjct: 5 SPKVSLQWLISILVLVIFFCATCVSATQPKILPKLSVLGRTFLREPATFSESNSQDFQTF 64
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTY 112
+YNQT+DHF+Y PESY TF RYV+N K+WGG N+ PI A+LGAE +D + GF
Sbjct: 65 YYNQTLDHFNYRPESYTTFQHRYVMNFKYWGGANASAPIFAYLGAEEDLDXILSGLGFLT 124
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+NAH+FKAL+V +EHRY GKS+PFGSR+ ALKNA RGYFNSAQA+ADYA +L++IK K
Sbjct: 125 DNAHRFKALLVYIEHRYCGKSIPFGSREEALKNASIRGYFNSAQAIADYAEVLIYIKKKL 184
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
A ++PVI +G SYGG LA+WFRLKYPHV +G LASSAPILY+ DITP Y+S+V+KDF
Sbjct: 185 LAENSPVIVVGGSYGGMLASWFRLKYPHVALGVLASSAPILYFEDITPQNGYYSIVTKDF 244
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ 292
R+ SE C++TI++SWSEID +AS+P+G++ILSKKF+TC L N+ ELKD LDT+Y AAQ
Sbjct: 245 REASESCYKTIRESWSEIDRVASEPNGISILSKKFRTCDRLNNSDELKDYLDTIYCTAAQ 304
Query: 293 YDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDES 352
Y+ P PV C+ I+ AP G DIL +I AGVV G NS CY E+
Sbjct: 305 YNDPPMYPVTMACSGIDGAPE-GSDILSRIFAGVVAY------GGNSSCY-TTSHNPTET 356
Query: 353 DEGWEWQSCSEMVVPMGK-DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHD 411
EGW WQ+CSEMV+P+G+ D ++M+ P P+NLT +I+ C YGVSPRP WV TYYGGH+
Sbjct: 357 SEGWAWQTCSEMVMPIGRGDNDTMFPPAPFNLTTFIQECTSIYGVSPRPHWVTTYYGGHN 416
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPD 471
I+LILRR SNIIFSNG+RDP+S GG +K + D + GSHCLD+ A +DP+
Sbjct: 417 IELILRRFASNIIFSNGLRDPYSSGGVLK--NISDSVLAILTVNGSHCLDILPATSTDPE 474
Query: 472 WLVQQRKTEVKIMQGWITQYYDDFKAINK 500
WLV QRK EV++++ WI QYY D AI K
Sbjct: 475 WLVMQRKAEVEVIESWIAQYYADLHAITK 503
>gi|225436405|ref|XP_002271797.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 510
Score = 557 bits (1436), Expect = e-156, Method: Compositional matrix adjust.
Identities = 281/503 (55%), Positives = 357/503 (70%), Gaps = 16/503 (3%)
Query: 3 SSQFSFQWLL--LMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTI 60
S S QWLL + +I T A KL PRL I R + + + F+TFFYNQT+
Sbjct: 13 SLMISLQWLLSLMFLIIFPTCATATPSKL-PRLSTILRESEIFSELISDDFQTFFYNQTL 71
Query: 61 DHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQF 118
DHF+Y PESY TF QRYV+N K+WGG N+ PI A+LGAEA +D ++ GF +NA QF
Sbjct: 72 DHFNYRPESYYTFQQRYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPVDNALQF 131
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
KAL+V +EHRYYG+S+PFGSR+ ALKNA RGYFNSAQA+ADYA +L +IK K A ++P
Sbjct: 132 KALLVYIEHRYYGQSIPFGSREEALKNASTRGYFNSAQAIADYAEVLEYIKKKLLAENSP 191
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VI IG SYGG LA+WFRLKYPHV +G+LASSAPILY+ DITP Y+S+V+KDFR+ SE
Sbjct: 192 VIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQNGYYSIVTKDFREASES 251
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSN 298
C+ TI++SWSEID +AS+P+GL+ILSKKF+TC L + ELKD L+T+Y+ AAQY+ P
Sbjct: 252 CYSTIRESWSEIDRVASEPNGLSILSKKFRTCAELNKSNELKDYLETMYAVAAQYNHPPR 311
Query: 299 IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEW 358
PV +C I+ AP G DIL +I AGVV Y GN+S CY N E+ EGW W
Sbjct: 312 YPVTVVCGGIDGAPE-GSDILSRIFAGVV-----AYRGNSS-CY-NTSVNPTETSEGWRW 363
Query: 359 QSCSEMVVPMGK-DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR 417
Q+CSEMV+P+G+ D ++M+ P P+NLT +I+ C Y V PRP W+ TYYGGHDIKLIL
Sbjct: 364 QTCSEMVMPIGRGDNDTMFPPSPFNLTTFIQACTSLYDVPPRPHWITTYYGGHDIKLILH 423
Query: 418 RSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQR 477
R SNIIFSNG+RDP+S G +K L+ +N GSHCLD+ AK +DP+WL+ QR
Sbjct: 424 RFASNIIFSNGLRDPYSSAGVLKNISHTVLAI-HTVN-GSHCLDILPAKSTDPEWLIMQR 481
Query: 478 KTEVKIMQGWITQYYDDFKAINK 500
KTEV+I++ WI QY+ D A K
Sbjct: 482 KTEVEIIESWIAQYHADLDATRK 504
>gi|147792728|emb|CAN64374.1| hypothetical protein VITISV_018664 [Vitis vinifera]
Length = 502
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 281/503 (55%), Positives = 357/503 (70%), Gaps = 16/503 (3%)
Query: 3 SSQFSFQWLL--LMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTI 60
S S QWLL + +I T A KL PRL I R + + + F+TFFYNQT+
Sbjct: 5 SLMISLQWLLSLMFLIIFPTCATATPSKL-PRLSTILRESEIFSELISDDFQTFFYNQTL 63
Query: 61 DHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQF 118
DHF+Y PESY TF QRYV+N K+WGG N+ PI A+LGAEA +D ++ GF +NA QF
Sbjct: 64 DHFNYRPESYYTFQQRYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPVDNALQF 123
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
KAL+V +EHRYYG+S+PFGSR+ ALKNA RGYFNSAQA+ADYA +L +IK K A ++P
Sbjct: 124 KALLVYIEHRYYGQSIPFGSREEALKNASTRGYFNSAQAIADYAEVLEYIKKKLLAENSP 183
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VI IG SYGG LA+WFRLKYPHV +G+LASSAPILY+ DITP Y+S+V+KDFR+ SE
Sbjct: 184 VIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQNGYYSIVTKDFREASES 243
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSN 298
C+ TI++SWSEID +AS+P+GL+ILSKKF+TC L + ELKD L+T+Y+ AAQY+ P
Sbjct: 244 CYSTIRESWSEIDRVASEPNGLSILSKKFRTCAELNKSNELKDYLETMYAVAAQYNHPPR 303
Query: 299 IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEW 358
PV +C I+ AP G DIL +I AGVV Y GN+S CY N E+ EGW W
Sbjct: 304 YPVTVVCGGIDGAPE-GSDILSRIFAGVV-----AYRGNSS-CY-NTSVNPTETSEGWRW 355
Query: 359 QSCSEMVVPMGK-DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR 417
Q+CSEMV+P+G+ D ++M+ P P+NLT +I+ C Y V PRP W+ TYYGGHDIKLIL
Sbjct: 356 QTCSEMVMPIGRGDNDTMFPPSPFNLTTFIQACTSLYDVPPRPHWITTYYGGHDIKLILH 415
Query: 418 RSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQR 477
R SNIIFSNG+RDP+S G +K L+ +N GSHCLD+ AK +DP+WL+ QR
Sbjct: 416 RFASNIIFSNGLRDPYSSAGVLKNISHTVLAI-HTVN-GSHCLDILPAKSTDPEWLIMQR 473
Query: 478 KTEVKIMQGWITQYYDDFKAINK 500
KTEV+I++ WI QY+ D A K
Sbjct: 474 KTEVEIIESWIAQYHADLDATRK 496
>gi|255565523|ref|XP_002523752.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537056|gb|EEF38692.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 501
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 267/495 (53%), Positives = 349/495 (70%), Gaps = 16/495 (3%)
Query: 9 QWLLLMVIFVSTSFHANGL---KLRPRLGRIRR--SRILEQKDSNHGFETFFYNQTIDHF 63
QW +L + +S S A +L P RI + I + + ETFFYNQT+DHF
Sbjct: 9 QWFVLGLFILSASSTAKPFDIPRLSPTGPRIVQDPEEIFISELVSDDLETFFYNQTLDHF 68
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKAL 121
+Y PESY+TF QRY+I+SK+WGG NS PI + GAEAP+D ++ + GF +NA QF AL
Sbjct: 69 NYNPESYETFQQRYIISSKYWGGANSSSPIFVYFGAEAPLDGDLTVIGFLADNAAQFNAL 128
Query: 122 IVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIA 181
++ +EHRYYGKSVPFGS+ ALKN RGYFNSAQA+ADYA I++H+K A ++PVI
Sbjct: 129 LLYIEHRYYGKSVPFGSQGEALKNGSIRGYFNSAQAIADYAEIIIHVKKNLQAENSPVIV 188
Query: 182 IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQ 241
IG SYGG LA+WFRLKYPH+ +G+LASSAP+LY+ DITP Y+S+ S+DFR+ SE C++
Sbjct: 189 IGGSYGGMLASWFRLKYPHLALGALASSAPVLYFDDITPQDGYYSIASRDFREASENCYK 248
Query: 242 TIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPV 301
TI+KSW+EID +AS P GL +LSKKFKTC PL ++ ELKD LD++YS AAQY+ P PV
Sbjct: 249 TIQKSWAEIDGVASMPKGLDVLSKKFKTCKPLTDSDELKDRLDSMYSGAAQYNKPPTYPV 308
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSC 361
IC+ I+ A + +D L KI AGVV Y GN S CYIN E+ GW WQ+C
Sbjct: 309 NIICSGIDGAASSSNDTLDKIFAGVV-----AYRGNRS-CYINPPTNLSETSVGWRWQTC 362
Query: 362 SEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTS 421
SEMV+P+G+ ++M+ P P++L Y+++C YGV PRP WV TYYGGH IKLIL+R S
Sbjct: 363 SEMVIPIGRGNDTMFPPSPFDLNGYVQDCNAIYGVRPRPHWVTTYYGGHSIKLILQRFGS 422
Query: 422 NIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKK-SDPDWLVQQRKTE 480
NIIFSNG+RDP+S GG ++ D + GSHCLD+ A + +DP+WLV QRKTE
Sbjct: 423 NIIFSNGIRDPYSSGGVLEDIS--DTILAVHTANGSHCLDILIANETTDPEWLVAQRKTE 480
Query: 481 VKIMQGWITQYYDDF 495
+ I++GWI++YYDD
Sbjct: 481 INIIKGWISKYYDDL 495
>gi|297734875|emb|CBI17109.3| unnamed protein product [Vitis vinifera]
Length = 975
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 270/488 (55%), Positives = 351/488 (71%), Gaps = 15/488 (3%)
Query: 16 IFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQ 75
+F+ + F + ++ +L I R + + + F+TFFYNQT+DHF+Y PESY TF Q
Sbjct: 1 MFIVSRF--CNINIKTKLSTILRESEIFSELISDDFQTFFYNQTLDHFNYRPESYYTFQQ 58
Query: 76 RYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKS 133
RYV+N K+WGG N+ PI A+LGAEA +D ++ GF +NA QFKAL+V +EHRYYG+S
Sbjct: 59 RYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPVDNALQFKALLVYIEHRYYGQS 118
Query: 134 VPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATW 193
+PFGSR+ ALKNA RGYFNSAQA+ADYA +L +IK K A ++PVI IG SYGG LA+W
Sbjct: 119 IPFGSREEALKNASTRGYFNSAQAIADYAEVLEYIKKKLLAENSPVIVIGGSYGGMLASW 178
Query: 194 FRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI 253
FRLKYPHV +G+LASSAPILY+ DITP Y+S+V+KDFR+ SE C+ TI++SWSEID +
Sbjct: 179 FRLKYPHVALGALASSAPILYFDDITPQNGYYSIVTKDFREASESCYSTIRESWSEIDRV 238
Query: 254 ASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPN 313
AS+P+GL+ILSKKF+TC L + ELKD L+T+Y+ AAQY+ P PV +C I+ AP
Sbjct: 239 ASEPNGLSILSKKFRTCAELNKSNELKDYLETMYAVAAQYNHPPRYPVTVVCGGIDGAPE 298
Query: 314 CGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGK-DK 372
G DIL +I AGVV Y GN+S CY N E+ EGW WQ+CSEMV+P+G+ D
Sbjct: 299 -GSDILSRIFAGVV-----AYRGNSS-CY-NTSVNPTETSEGWRWQTCSEMVMPIGRGDN 350
Query: 373 NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
++M+ P P+NLT +I+ C Y V PRP W+ TYYGGHDIKLIL R SNIIFSNG+RDP
Sbjct: 351 DTMFPPSPFNLTTFIQACTSLYDVPPRPHWITTYYGGHDIKLILHRFASNIIFSNGLRDP 410
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+S G +K L+ +N GSHCLD+ AK +DP+WL+ QRKTEV+I++ WI QY+
Sbjct: 411 YSSAGVLKNISHTVLAI-HTVN-GSHCLDILPAKSTDPEWLIMQRKTEVEIIESWIAQYH 468
Query: 493 DDFKAINK 500
D A K
Sbjct: 469 ADLDATRK 476
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 265/506 (52%), Positives = 348/506 (68%), Gaps = 22/506 (4%)
Query: 2 KSSQFSFQWL--LLMVIFVSTSFHANGLKLRPRLGRIRRS--RILEQKDSNHGF----ET 53
K + +S QWL L+ + +S A + PRLG + R R E + F +T
Sbjct: 476 KRTLYSLQWLPFLIPTLILSCCVSAAQFNV-PRLGPLSRGILRNPEPAAVSESFYKDLKT 534
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFT 111
FFY QT+DHF+Y PESY+TF QRYV+N K WGG + PI A+LGAEAP+D ++ GF
Sbjct: 535 FFYAQTLDHFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPLDGDLVNIGFV 594
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+NA +F AL++ +EHRYYGKS+PFGS K ALKNA GYFNSAQA+ADYA++L+H+K +
Sbjct: 595 NDNAARFNALLIYIEHRYYGKSIPFGSTKVALKNASTLGYFNSAQAIADYAAVLMHVKKR 654
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
+A ++PVI IG SYGG LA+WFRLKYPH+ +G+LASSAPILY+ +I P Y+S+V+KD
Sbjct: 655 LHAQNSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPEIGYYSIVTKD 714
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
FR+ SE C++TI++SWSEID IASKP+GL+ILSK+FKTC L+++ ELKD LD++Y+EAA
Sbjct: 715 FREASESCYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKDYLDSIYAEAA 774
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGD 350
QY+ P PV +C I N + D L +I G+V CY E
Sbjct: 775 QYNEPPTYPVTVVCKGI-NGASKRTDTLGRIFHGLVAI------AGKRSCYDTKEFNYPT 827
Query: 351 ESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGG 409
E+ GW WQ CSEMV+P+G N +M+QPEP+NL ++IK C Y VSPRP WV TYYGG
Sbjct: 828 ETYLGWRWQKCSEMVLPIGHATNDTMFQPEPFNLNRFIKECNSLYSVSPRPHWVTTYYGG 887
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
DIKLIL R SNIIFSNG+RDP+S GG ++ + D + GSHCLD+ ++KSD
Sbjct: 888 RDIKLILHRFASNIIFSNGLRDPYSSGGVLE--NISDTLVAVYTRHGSHCLDILPSQKSD 945
Query: 470 PDWLVQQRKTEVKIMQGWITQYYDDF 495
P WLV QRK EV+I++GW+ +YY D
Sbjct: 946 PQWLVMQRKMEVEIIKGWMDKYYTDL 971
>gi|225436410|ref|XP_002271961.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 505
Score = 543 bits (1400), Expect = e-152, Method: Compositional matrix adjust.
Identities = 274/499 (54%), Positives = 357/499 (71%), Gaps = 18/499 (3%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILE----QKDSNHGFETFFYNQTIDHFSYG 66
+L +VIF ST+ + RLG + R+ + E + + F+TF+YNQT+DHF+Y
Sbjct: 16 ILALVIFFSTTCVSATQPHLLRLGVLGRTFLREPATFSESVSEEFQTFYYNQTLDHFNYR 75
Query: 67 PESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
PESY TF QRYV+N K+WGG N+ PI A+LGAEA +D ++ GF +NAHQFKAL+V
Sbjct: 76 PESYITFQQRYVVNFKYWGGANASAPIFAYLGAEADLDYDLSGIGFLTDNAHQFKALLVY 135
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGA 184
+EHRYYGKS+P+GSR+ A KNA GYFNSAQA+ADYA +L++IK K A ++PVI +GA
Sbjct: 136 IEHRYYGKSIPYGSREEAFKNASTLGYFNSAQAIADYAEVLIYIKKKLLAENSPVIVVGA 195
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIK 244
SYGG LA+WFRLKYPHV +G+LASSAPILY+ DITP Y S+V+KDFR+ SE C++TI
Sbjct: 196 SYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSIVTKDFREASESCYKTIS 255
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRI 304
+SWSEID +AS+P+GL+ILSKKF+ C L N+ ELK+ L+ YS AAQYD P + PV +
Sbjct: 256 ESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTYSIAAQYDAPPSYPVTVV 315
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG--DESDEGWEWQSCS 362
C I+ AP G DIL +I AGVV + GN S CY + T E+ +GW WQ+CS
Sbjct: 316 CGGIDGAPE-GSDILSRIFAGVV-----AFRGNMS-CYYTSNTTDYPIETIQGWGWQTCS 368
Query: 363 EMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTS 421
E+V+P+G+ N +M+ P+NLT + ++C YGV+PRP W+ TYYGGHDI+LILRR S
Sbjct: 369 EIVIPIGRGVNDTMFPLAPFNLTTFSQDCTSLYGVAPRPHWITTYYGGHDIELILRRFAS 428
Query: 422 NIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEV 481
NIIFSNG+RDP+S GG +K L+ +N GSHCLDL A +DP+WLV QRK EV
Sbjct: 429 NIIFSNGLRDPYSIGGVLKNISNSVLAI-LTVN-GSHCLDLLPAASTDPEWLVMQRKAEV 486
Query: 482 KIMQGWITQYYDDFKAINK 500
+I++GWI QYY D AI K
Sbjct: 487 EIIEGWIAQYYVDLHAITK 505
>gi|297734878|emb|CBI17112.3| unnamed protein product [Vitis vinifera]
Length = 624
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 274/499 (54%), Positives = 357/499 (71%), Gaps = 18/499 (3%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILE----QKDSNHGFETFFYNQTIDHFSYG 66
+L +VIF ST+ + RLG + R+ + E + + F+TF+YNQT+DHF+Y
Sbjct: 135 ILALVIFFSTTCVSATQPHLLRLGVLGRTFLREPATFSESVSEEFQTFYYNQTLDHFNYR 194
Query: 67 PESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
PESY TF QRYV+N K+WGG N+ PI A+LGAEA +D ++ GF +NAHQFKAL+V
Sbjct: 195 PESYITFQQRYVVNFKYWGGANASAPIFAYLGAEADLDYDLSGIGFLTDNAHQFKALLVY 254
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGA 184
+EHRYYGKS+P+GSR+ A KNA GYFNSAQA+ADYA +L++IK K A ++PVI +GA
Sbjct: 255 IEHRYYGKSIPYGSREEAFKNASTLGYFNSAQAIADYAEVLIYIKKKLLAENSPVIVVGA 314
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIK 244
SYGG LA+WFRLKYPHV +G+LASSAPILY+ DITP Y S+V+KDFR+ SE C++TI
Sbjct: 315 SYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSIVTKDFREASESCYKTIS 374
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRI 304
+SWSEID +AS+P+GL+ILSKKF+ C L N+ ELK+ L+ YS AAQYD P + PV +
Sbjct: 375 ESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTYSIAAQYDAPPSYPVTVV 434
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG--DESDEGWEWQSCS 362
C I+ AP G DIL +I AGVV + GN S CY + T E+ +GW WQ+CS
Sbjct: 435 CGGIDGAPE-GSDILSRIFAGVV-----AFRGNMS-CYYTSNTTDYPIETIQGWGWQTCS 487
Query: 363 EMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTS 421
E+V+P+G+ N +M+ P+NLT + ++C YGV+PRP W+ TYYGGHDI+LILRR S
Sbjct: 488 EIVIPIGRGVNDTMFPLAPFNLTTFSQDCTSLYGVAPRPHWITTYYGGHDIELILRRFAS 547
Query: 422 NIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEV 481
NIIFSNG+RDP+S GG +K L+ +N GSHCLDL A +DP+WLV QRK EV
Sbjct: 548 NIIFSNGLRDPYSIGGVLKNISNSVLAI-LTVN-GSHCLDLLPAASTDPEWLVMQRKAEV 605
Query: 482 KIMQGWITQYYDDFKAINK 500
+I++GWI QYY D AI K
Sbjct: 606 EIIEGWIAQYYVDLHAITK 624
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 45/85 (52%), Positives = 60/85 (70%), Gaps = 2/85 (2%)
Query: 359 QSCSEMVVPMGK-DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR 417
Q C+E V+P+ + D ++M+ P+NLT +I+ C YGVSP P W TYYGGH+I+LIL
Sbjct: 2 QRCNETVMPLRRGDSDTMFPLVPFNLTTFIQECISIYGVSPWPHWATTYYGGHNIELILH 61
Query: 418 RSTSNIIFSNGMRDPFSR-GGWVKT 441
R SNIIFSNG+RDP++ G W T
Sbjct: 62 RFASNIIFSNGLRDPYAVPGNWKST 86
>gi|224143318|ref|XP_002324914.1| predicted protein [Populus trichocarpa]
gi|222866348|gb|EEF03479.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 540 bits (1391), Expect = e-151, Method: Compositional matrix adjust.
Identities = 263/479 (54%), Positives = 339/479 (70%), Gaps = 13/479 (2%)
Query: 25 NGLKLRPRLGRI-RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKF 83
N +++P LGRI R S H +TF+YNQT+DHF+Y P+S+ F QRYVINSK+
Sbjct: 2 NNPRVKP-LGRISRNSASFAASSIYHNLQTFYYNQTLDHFNYRPDSFDMFQQRYVINSKY 60
Query: 84 WGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA 141
WGG NS PI + G EAP++++ G ENAH+FKAL V +EHRYYGKS+PFGSR
Sbjct: 61 WGGANSNAPIFVYFGEEAPLENDFGDIGILAENAHRFKALQVYIEHRYYGKSIPFGSRNE 120
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHV 201
A KNA GYFNSAQALADYA I++H+ +K++ +PVI +GASYGG LA+WFRLKYPH+
Sbjct: 121 AFKNASTLGYFNSAQALADYAEIIIHVNEKFHVQRSPVIVVGASYGGMLASWFRLKYPHI 180
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLA 261
+G+LASSAPILY+ DITP Y S+V+KDFR+ S+ C TIKKSW+ ID IAS+PDGL+
Sbjct: 181 ALGALASSAPILYFTDITPAHAYVSIVTKDFREDSQSCHDTIKKSWTVIDKIASEPDGLS 240
Query: 262 ILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCK 321
ILSKKF+TC PL N++EL D L +Y AAQYD P + PV +C +I+ P+ G+DIL +
Sbjct: 241 ILSKKFETCKPLNNSSELTDYLAGIYMAAAQYDAPPSYPVTMVCKSIDE-PSFGNDILGR 299
Query: 322 IAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPW 381
I AG+V Y G CY+NE E+D GW WQ+C++MV+P G +SM+QP P+
Sbjct: 300 IFAGMV-----AYQG-ELPCYVNEPTKETETDVGWSWQTCADMVIPFGISNDSMFQPYPF 353
Query: 382 NLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
+L YI +CK++YGV PRP WV TY+GGHDIKLIL+R SNIIFSNG+RDP+S GG ++
Sbjct: 354 DLNAYINDCKDEYGVPPRPHWVTTYFGGHDIKLILKRFGSNIIFSNGLRDPYSSGGVLQ- 412
Query: 442 YHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
+ D + GSHCLD+ KSDP WLV QRK EV+I++ WI Y+ D A K
Sbjct: 413 -NISDSVVAITTVKGSHCLDVLATTKSDPQWLVAQRKEEVRIIRKWIRNYFSDLDACEK 470
>gi|356543520|ref|XP_003540208.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 511
Score = 536 bits (1380), Expect = e-149, Method: Compositional matrix adjust.
Identities = 270/512 (52%), Positives = 349/512 (68%), Gaps = 22/512 (4%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLR-PRLGRI--------RRSRILEQKDSNHGF 51
MK +WLL + + +++ L+ PRL I R+ ++
Sbjct: 1 MKHHPMLSKWLLFIFLTYYSTYLTVSHSLKIPRLSPIAEWETTLHNRATVVATDTDTENR 60
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSG 109
TF+Y Q +DHF PESY+TF QRY+IN K+WGG NS PI A+LGAE PID + +L G
Sbjct: 61 VTFYYKQVLDHFKXRPESYKTFQQRYLINFKYWGGANSSAPIFAYLGAEEPIDGSPELIG 120
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F +NA F ALIV +EHRYYGKSVPFGSR+ ALKNA GYFNSAQA+ADYAS+L+HIK
Sbjct: 121 FLTDNAASFNALIVYIEHRYYGKSVPFGSREEALKNASTIGYFNSAQAIADYASVLIHIK 180
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+A +PVI IG SYGG LA+WFRLKYPH+ IG+LASSAPILY+ +ITP Y+SVVS
Sbjct: 181 KTLHAQKSPVIVIGGSYGGMLASWFRLKYPHLAIGALASSAPILYFDNITPQDGYYSVVS 240
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
+DFR+ SE C+QTI KSWSEID +AS+P GL+ILS++F TC PL ++ELKD L +Y+
Sbjct: 241 RDFREASETCYQTILKSWSEIDRVASQPKGLSILSQRFNTCRPLNESSELKDYLINMYAS 300
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG 349
+AQY+ P PV IC I+ A + G DIL KI AG+V N+ C +N
Sbjct: 301 SAQYNHPPRYPVTVICGGIDRA-SFGSDILSKIYAGLVALR------GNTTCKVNGPIIV 353
Query: 350 DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGG 409
E+ GW WQ+CSEMV+P+G N+M+QP+P++L Y K+CK+Q+GVSPRP WV TYYGG
Sbjct: 354 SETTLGWRWQTCSEMVIPIGIGNNTMFQPDPFSLKSYAKDCKKQFGVSPRPHWVTTYYGG 413
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL--DEAKK 467
H I+L+L++ SNIIFSNG+RDP+S GG +K + D + GSHCLD+ A
Sbjct: 414 HSIELVLQKFGSNIIFSNGLRDPYSSGGVLK--NISDSLVAIHTVNGSHCLDILGANANH 471
Query: 468 SDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
SDP+WLV+Q K E+ +M+GWI QYYDD A+N
Sbjct: 472 SDPEWLVEQWKREIMVMKGWIAQYYDDLVALN 503
>gi|449456172|ref|XP_004145824.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 507
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 263/473 (55%), Positives = 344/473 (72%), Gaps = 18/473 (3%)
Query: 31 PRLGRI-----RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
PRL + SR+L + F+T++YNQT+DHF+Y PESY TFPQRY+IN K+WG
Sbjct: 42 PRLSPVGEKFLHHSRVLNSLPLD-DFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWG 100
Query: 86 GGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
G NS PI A+LGAEAPIDD++ GF +NA QF AL++ +EHRYYGKS+PF SR AL
Sbjct: 101 GPNSSAPIFAYLGAEAPIDDDLDFIGFMTDNAIQFNALLIYIEHRYYGKSIPFRSRDEAL 160
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVI 203
NA GYFNSAQA+ADYA+IL+H+K +++A ++PVI IG SYGG LA+WFRLKYPHV +
Sbjct: 161 GNASTLGYFNSAQAIADYAAILIHVKKEFHANYSPVIVIGGSYGGMLASWFRLKYPHVAL 220
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
G+LASSAPILY+ DITP Y+SVV+KDFR SE C++TIKKSWSEI+ +A +P+GL+IL
Sbjct: 221 GALASSAPILYFDDITPQDGYYSVVTKDFRGLSETCYETIKKSWSEIETVAYQPNGLSIL 280
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA 323
++FKTC PL+ EL+D L ++Y+ AAQY+ P PV RIC+AI+ + + L KIA
Sbjct: 281 DQEFKTCRPLRGYFELEDYLWSMYASAAQYNHPPKYPVTRICDAIDGTYSV-NGTLSKIA 339
Query: 324 AGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNL 383
AGV + G+ S CYINE R E+D GW WQSCSEMV+P+G D + M+ P P++L
Sbjct: 340 AGV-----FAFRGSVS-CYINEPRNETETDVGWRWQSCSEMVMPIGSD-DDMFPPSPFDL 392
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
I C YGV PRP W TYYGGHDI+L+L+R SNIIFSNG++DP+S G + ++
Sbjct: 393 QSVINYCNRLYGVPPRPHWATTYYGGHDIRLVLQRFGSNIIFSNGLKDPYSIAGVL--HN 450
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
D + GSHCLD+ +A ++DP+WLV+QRKTEV I++GWI++YY D K
Sbjct: 451 ISDSLLAVYTTNGSHCLDILKAHETDPEWLVRQRKTEVGIIKGWISEYYADLK 503
>gi|449496302|ref|XP_004160098.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 514
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 263/473 (55%), Positives = 344/473 (72%), Gaps = 18/473 (3%)
Query: 31 PRLGRI-----RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
PRL + SR+L + F+T++YNQT+DHF+Y PESY TFPQRY+IN K+WG
Sbjct: 49 PRLSPVGEKFLHHSRVLNSLPLD-DFKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWG 107
Query: 86 GGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
G NS PI A+LGAEAPIDD++ GF +NA QF AL++ +EHRYYGKS+PF SR AL
Sbjct: 108 GPNSSAPIFAYLGAEAPIDDDLDFIGFMTDNAIQFNALLIYIEHRYYGKSIPFRSRDEAL 167
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVI 203
NA GYFNSAQA+ADYA+IL+H+K +++A ++PVI IG SYGG LA+WFRLKYPHV +
Sbjct: 168 GNASTLGYFNSAQAIADYAAILIHVKKEFHANYSPVIVIGGSYGGMLASWFRLKYPHVAL 227
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
G+LASSAPILY+ DITP Y+SVV+KDFR SE C++TIKKSWSEI+ +A +P+GL+IL
Sbjct: 228 GALASSAPILYFDDITPQDGYYSVVTKDFRGLSETCYETIKKSWSEIETVAYQPNGLSIL 287
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA 323
++FKTC PL+ EL+D L ++Y+ AAQY+ P PV RIC+AI+ + + L KIA
Sbjct: 288 DQEFKTCRPLRGYFELEDYLWSMYASAAQYNHPPKYPVTRICDAIDGTYSV-NGTLSKIA 346
Query: 324 AGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNL 383
AGV + G+ S CYINE R E+D GW WQSCSEMV+P+G D + M+ P P++L
Sbjct: 347 AGV-----FAFRGSVS-CYINEPRNETETDVGWRWQSCSEMVMPIGSD-DDMFPPSPFDL 399
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
I C YGV PRP W TYYGGHDI+L+L+R SNIIFSNG++DP+S G + ++
Sbjct: 400 QSVINYCNRLYGVPPRPHWATTYYGGHDIRLVLQRFGSNIIFSNGLKDPYSIAGVL--HN 457
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
D + GSHCLD+ +A ++DP+WLV+QRKTEV I++GWI++YY D K
Sbjct: 458 ISDSLLAVYTTNGSHCLDILKAHETDPEWLVRQRKTEVGIIKGWISKYYADLK 510
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 34/42 (80%), Gaps = 1/42 (2%)
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
FF L+ + +++GSHCLD+ +A ++DP+WLV QRKTEV I++
Sbjct: 4 FFTLTDTL-ISVGSHCLDILKAHETDPEWLVTQRKTEVGIVK 44
>gi|449456064|ref|XP_004145770.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
gi|449496213|ref|XP_004160074.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 499
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 258/497 (51%), Positives = 353/497 (71%), Gaps = 16/497 (3%)
Query: 10 WLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQK----DSNHGFETFFYNQTIDHFSY 65
WL ++ +S A ++ PRL I R+ + + + F+TF+YNQT+DHF+Y
Sbjct: 11 WLPFILFILSNCVTATQYRI-PRLSPIGRTFLHNAEAIPSSISDDFKTFYYNQTLDHFNY 69
Query: 66 GPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
PESY FP RY+IN K+WGG NS PILA+LGAE P++ ++ GF +NA +F AL+V
Sbjct: 70 RPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAIGFMTDNAARFDALLV 129
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIG 183
+EHRYYGKS+PFGSR+ ALKNA GYF+SAQA+ADYA++L+H+K KY+A +PVI +G
Sbjct: 130 YIEHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLIHLKQKYHAKDSPVIVLG 189
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI 243
SYGG LA WFRLKYPHV +G+LASSAPILY+ DITP Y+S+ +KDFR+ SE C++TI
Sbjct: 190 GSYGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIATKDFREVSETCYETI 249
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKR 303
+ SWS+I+ I SKP+GL+ILSK+FKTC+PL ++++L+D L ++Y+ AAQY+ P PV R
Sbjct: 250 RDSWSKIEIIGSKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYAGAAQYNHPPRYPVTR 309
Query: 304 ICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSE 363
IC I+ A + G I+ K+AAGV Y GN S CY R+ E+D GW WQ CSE
Sbjct: 310 ICGGIDGA-SPGSGIISKVAAGV-----FAYKGNLS-CYNIGPRSETETDVGWRWQRCSE 362
Query: 364 MVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNI 423
MV+P+ ++M+ P ++L ++ C + YGVS RP WV TYYGG+DIKLIL+R SNI
Sbjct: 363 MVMPLSTTNDTMFPPITFDLKSFVDYCYQLYGVSSRPHWVTTYYGGNDIKLILQRFGSNI 422
Query: 424 IFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKI 483
IFSNG+RDP+S GG ++ + D + GSHCLD+ A ++DP WLV+QR+TEV+I
Sbjct: 423 IFSNGLRDPYSSGGVLQ--NLSDSLLAVHTPKGSHCLDILRANETDPQWLVKQRETEVRI 480
Query: 484 MQGWITQYYDDFKAINK 500
++GWI++YY D + K
Sbjct: 481 IEGWISKYYADLEKSKK 497
>gi|255565527|ref|XP_002523754.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537058|gb|EEF38694.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 517
Score = 523 bits (1348), Expect = e-146, Method: Compositional matrix adjust.
Identities = 264/505 (52%), Positives = 349/505 (69%), Gaps = 27/505 (5%)
Query: 6 FSFQWLLLMVIFVSTSFHANGLKLR-PRLGRIRRSRILEQK------DSNHGFETFFYNQ 58
FSF LL +IF SFH + ++ + PRL +R+S I Q S+ TFFY Q
Sbjct: 21 FSF---LLTIIF---SFHVSSIQCKTPRLRALRKSHIQSQPKITTELSSSKDILTFFYPQ 74
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNS---PILAFLGAEAPIDDNIQLSGFTYENA 115
+DHF++ PESY TF QRYVIN K+W G N P+L F GAE I+D++ GF + A
Sbjct: 75 KLDHFNFKPESYATFKQRYVINFKYWSGPNRTSVPLLVFFGAEENIEDDVDGIGFLNDIA 134
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
FK L+V +EHRYYGKS+PFGS K +NA GYF+SAQA+ADYA+I++H+K KY+A
Sbjct: 135 SHFKGLLVYIEHRYYGKSIPFGSSKEVFRNASSLGYFSSAQAIADYAAIIMHVKKKYSAK 194
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
+PVI IG SYGG LA+WFRLKYPHV +G+LASSAPILY+ DI P Y+S+V+KDF++T
Sbjct: 195 TSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFEDIAPRNGYYSIVTKDFKET 254
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDT 295
SE C+QTI+KSW+EI+ +ASK +GL+ILSKKFKTC PLK T ELKD LD++YSEAAQY+
Sbjct: 255 SESCYQTIRKSWAEIEKVASKRNGLSILSKKFKTCNPLKRTFELKDYLDSIYSEAAQYND 314
Query: 296 PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INE-DRTGDESD 353
P PV +C I+ AP G D+L +I AGVV Y G+ S CY +N + D +
Sbjct: 315 PPRYPVTIVCGGIDGAPK-GTDVLGRIFAGVV-----AYMGDRS-CYDVNGYNHPTDATS 367
Query: 354 EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIK 413
W WQ+CSE+V+P+G ++N+M+ P+NL Y + CK YGV P+P WV YYGGHD+K
Sbjct: 368 LAWRWQTCSELVMPIGHERNTMFPTSPFNLNSYTQKCKALYGVLPQPHWVTNYYGGHDLK 427
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWL 473
LIL R SNIIFSNG++DP+S GG ++ + D + GSHCLD+ + + +DP WL
Sbjct: 428 LILHRFASNIIFSNGLKDPYSSGGVLE--NISDSIVAISTVNGSHCLDIQQTQPTDPHWL 485
Query: 474 VQQRKTEVKIMQGWITQYYDDFKAI 498
V QRK E++I+QGWI++Y D +
Sbjct: 486 VMQRKAEIEIIQGWISKYNIDLHEL 510
>gi|356541970|ref|XP_003539445.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 509
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 267/511 (52%), Positives = 355/511 (69%), Gaps = 23/511 (4%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRS---RILEQKDSNH----GFET 53
M S SFQWL L + +S + A + PRLG +RS R E S H +T
Sbjct: 1 MGSVLPSFQWLSLFFLIISVNVCAFKI---PRLGTWQRSTKERDPEISSSLHLSDDDLKT 57
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFT 111
F+Y Q +DHF+Y P+SY TF QRY++N K+WGG S PI AF GAE P+D++ + GF
Sbjct: 58 FYYTQRLDHFNYRPDSYHTFQQRYMVNFKYWGGAKSSAPIFAFFGAEGPVDEDAKYIGFL 117
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+NA QF ALIV +EHRYYGKS+PFGS + A++NA RGYFNSAQA+ADYA++LLHIK
Sbjct: 118 RDNAPQFNALIVFIEHRYYGKSIPFGSSEEAMRNASTRGYFNSAQAIADYAAVLLHIKKT 177
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
+A ++P+I IG SYGG LA+WFRLKYPH+ +G+LASSAPILY+ I P Y+ +V+KD
Sbjct: 178 LSAQNSPIIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFNGIAPQAGYYYIVTKD 237
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
F++TSE C+QTI+KSWSEID +A KP+GL+ILSK+FKTC L + +LKD LD++Y++AA
Sbjct: 238 FKETSESCYQTIRKSWSEIDRVAKKPNGLSILSKRFKTCDKLNKSFDLKDYLDSLYTDAA 297
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGD 350
QY+ PS PVK +C AI+ A DIL +I GVV Y + S CY +NE
Sbjct: 298 QYNYPSEHPVKIVCGAIDAAAK-KTDILGQIFEGVV-----AYKQHRS-CYDMNEYNHPT 350
Query: 351 ESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGG 409
ES GW WQ+CSE+++P+G +KN SM+ P P+N+ +++ C+ YGV P+P WV TYYGG
Sbjct: 351 ESFLGWRWQTCSEIIMPIGHEKNDSMFPPAPFNMKTFVQECRSLYGVLPQPHWVTTYYGG 410
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
D+KLIL R SNIIFSNG+RDP+S GG +++ ++ + +N G HCLD+ K +D
Sbjct: 411 PDLKLILHRFASNIIFSNGLRDPYSSGGVLESISNTVVAVTT-VN-GCHCLDIQSRKAND 468
Query: 470 PDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
P WLV QR TEVKI++GWI +Y D A+ K
Sbjct: 469 PQWLVTQRNTEVKIIKGWIAEYKADLIALTK 499
>gi|225436403|ref|XP_002271733.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 503
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 265/505 (52%), Positives = 347/505 (68%), Gaps = 22/505 (4%)
Query: 3 SSQFSFQWL--LLMVIFVSTSFHANGLKLRPRLGRIRRS--RILEQKDSNHGF----ETF 54
S +S QWL L+ + +S A + PRLG + R R E + F +TF
Sbjct: 5 SLSYSLQWLPFLIPTLILSCCVSAAQFNV-PRLGPLSRGILRNPEPAAVSESFYKDLKTF 63
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTY 112
FY QT+DHF+Y PESY+TF QRYV+N K WGG + PI A+LGAEAP+D ++ GF
Sbjct: 64 FYAQTLDHFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPLDGDLVNIGFVN 123
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+NA +F AL++ +EHRYYGKS+PFGS K ALKNA GYFNSAQA+ADYA++L+H+K +
Sbjct: 124 DNAARFNALLIYIEHRYYGKSIPFGSTKVALKNASTLGYFNSAQAIADYAAVLMHVKKRL 183
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+A ++PVI IG SYGG LA+WFRLKYPH+ +G+LASSAPILY+ +I P Y+S+V+KDF
Sbjct: 184 HAQNSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPEIGYYSIVTKDF 243
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ 292
R+ SE C++TI++SWSEID IASKP+GL+ILSK+FKTC L+++ ELKD LD++Y+EAAQ
Sbjct: 244 REASESCYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKDYLDSIYAEAAQ 303
Query: 293 YDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDE 351
Y+ P PV +C I N + D L +I G+V CY E E
Sbjct: 304 YNEPPTYPVTVVCKGI-NGASKRTDTLGRIFHGLVAI------AGKRSCYDTKEFNYPTE 356
Query: 352 SDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGH 410
+ GW WQ CSEMV+P+G N +M+QPEP+NL ++IK C Y VSPRP WV TYYGG
Sbjct: 357 TYLGWRWQKCSEMVLPIGHATNDTMFQPEPFNLNRFIKECNSLYSVSPRPHWVTTYYGGR 416
Query: 411 DIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDP 470
DIKLIL R SNIIFSNG+RDP+S GG ++ + D + GSHCLD+ ++KSDP
Sbjct: 417 DIKLILHRFASNIIFSNGLRDPYSSGGVLE--NISDTLVAVYTRHGSHCLDILPSQKSDP 474
Query: 471 DWLVQQRKTEVKIMQGWITQYYDDF 495
WLV QRK EV+I++GW+ +YY D
Sbjct: 475 QWLVMQRKMEVEIIKGWMDKYYTDL 499
>gi|449456174|ref|XP_004145825.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 491
Score = 520 bits (1338), Expect = e-145, Method: Compositional matrix adjust.
Identities = 258/477 (54%), Positives = 341/477 (71%), Gaps = 18/477 (3%)
Query: 31 PRLGRI-----RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
PRL I S+ LE S+ F+TF++NQT+DHF+Y PESY TFPQRY+IN K+WG
Sbjct: 26 PRLSPIGEKFLHHSKALELPPSD-DFKTFYFNQTLDHFNYRPESYTTFPQRYIINFKYWG 84
Query: 86 GGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
G NS PILA+LG EAPID + + GF +NA +F AL+V +EHRYYGKS+PFGSRK AL
Sbjct: 85 GANSSAPILAYLGPEAPIDSAMNVIGFMTDNAVKFNALLVYIEHRYYGKSIPFGSRKEAL 144
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVI 203
+NA GYFNSAQALADYA+IL+H+K +++A ++PVI IG SYGG LATWFRLKYPHV +
Sbjct: 145 RNASTLGYFNSAQALADYAAILIHVKKEFSAKYSPVIVIGGSYGGMLATWFRLKYPHVAL 204
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
G+LASSAPILY+ DITP Y+ +V+KDFR+ S+ C+++I++SWSEI+ +AS+ +GL++L
Sbjct: 205 GALASSAPILYFNDITPENGYYVIVTKDFREVSQTCYESIRESWSEIETVASQSNGLSVL 264
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA 323
K FKTC+PL+++T+L++ L +Y+ AAQY+ PS PV RIC+AI+ + G L KIA
Sbjct: 265 DKVFKTCSPLRSSTQLENYLWFMYASAAQYNHPSRYPVNRICDAIDQTYSNGT--LGKIA 322
Query: 324 AGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNL 383
AGV Y G S CYINE E+ GW+WQ CSEMV+P+ ++M+ E ++
Sbjct: 323 AGV-----FAYRGELS-CYINEPINTTETTVGWQWQRCSEMVMPISTGNDTMFPSETFDH 376
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ C + YGV+PRP WV TYYGGHDI LIL R SNIIFSNG++DP+S GG + ++
Sbjct: 377 ESFSIYCNQLYGVTPRPHWVTTYYGGHDIHLILHRFASNIIFSNGLKDPYSIGGVL--HN 434
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
D + GSHCLD+ A + DP+WLV QRKTEV I++ WI +YY D K
Sbjct: 435 ISDSLLAVYTANGSHCLDILTANRMDPEWLVTQRKTEVGIIKEWIDEYYADLANYKK 491
>gi|224104759|ref|XP_002313555.1| predicted protein [Populus trichocarpa]
gi|222849963|gb|EEE87510.1| predicted protein [Populus trichocarpa]
Length = 499
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 267/484 (55%), Positives = 346/484 (71%), Gaps = 17/484 (3%)
Query: 20 TSFHANGLKLRPRLGRIRRSRILEQKDSNHG------FETFFYNQTIDHFSYGPESYQTF 73
T+ A L PRL I L+ D G FETFFYNQT+DHF+Y PESY TF
Sbjct: 22 TTATAKRLHTIPRLSPIGPRVWLDHPDQILGESVREDFETFFYNQTLDHFNYRPESYDTF 81
Query: 74 PQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYG 131
QRY+INSK+WGG N+ PIL +LGAEAPID ++ GF + A +F +L+V +EHRYYG
Sbjct: 82 LQRYLINSKYWGGANASAPILVYLGAEAPIDGDLDAVGFLVDTAVEFNSLLVYVEHRYYG 141
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELA 191
KS+PFGSR+ ALKNA GYFNSAQA+ADYA+I++HIK A +PVI IG SYGG LA
Sbjct: 142 KSIPFGSREEALKNASTLGYFNSAQAIADYAAIIIHIKKTLQAKDSPVIVIGGSYGGMLA 201
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEID 251
+WFRLKYPH+ +G+LASSAP+LY+ DITP Y+++VSKDFR SE C+QTI++SW EID
Sbjct: 202 SWFRLKYPHIALGALASSAPVLYFDDITPQYGYYALVSKDFRGASETCYQTIRESWEEID 261
Query: 252 NIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENA 311
+ASKPDGL+ILSKKFKTC PL + +ELK+ LD++Y+ AAQY+ P PV ++C I+
Sbjct: 262 EVASKPDGLSILSKKFKTCNPLTDASELKNHLDSMYANAAQYNKPPTYPVNKVCGGIDGC 321
Query: 312 PNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKD 371
GDD+L ++ G+V Y GN S CY+NE E+ GW WQ+CSEMV+P+G
Sbjct: 322 -GFGDDLLGRVFGGLV-----AYKGNRS-CYVNEPTNQSETSVGWRWQTCSEMVMPIGYG 374
Query: 372 KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRD 431
+SM+ P+P++L YI++CK Y V+PR WV TYYGGH I+LIL+R SNIIFSNG+RD
Sbjct: 375 NDSMFPPDPFDLKAYIEDCKSLYDVTPRFHWVTTYYGGHSIRLILQRFASNIIFSNGLRD 434
Query: 432 PFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
P+S GG ++ + D + GSHCLD+ AK++DP+WLV QRKTE+KI++ WI +Y
Sbjct: 435 PYSSGGVLE--NISDTVVAVKTVNGSHCLDILFAKETDPEWLVAQRKTEIKIIKEWINKY 492
Query: 492 YDDF 495
Y D
Sbjct: 493 YADL 496
>gi|356498058|ref|XP_003517871.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 471
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 254/449 (56%), Positives = 319/449 (71%), Gaps = 11/449 (2%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSG 109
+TF++ Q +DHF+Y PESY TF QRY+IN K+WGG NS PI A+ GAE+PID++ G
Sbjct: 23 KTFYFKQVLDHFNYRPESYTTFQQRYLINFKYWGGANSSAPIFAYFGAESPIDNSPNGVG 82
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F +NA F AL+V +EHRYYGKSV FGSR+ ALKNA GYFNSAQALADYASIL H+K
Sbjct: 83 FLTDNAASFNALLVYIEHRYYGKSVQFGSREEALKNASTIGYFNSAQALADYASILKHVK 142
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+A ++PVI IG SYGG LA+WFRLKYPH+ IG+LASSAPILY+ ITP Y+S V+
Sbjct: 143 KTLHAKNSPVIVIGGSYGGMLASWFRLKYPHLAIGALASSAPILYFDKITPQNGYYSTVT 202
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
+D+R+ SE C++T+ KSWSEI IAS+P+GL LS++F TC L + EL D L + Y
Sbjct: 203 RDYREASETCYETVLKSWSEIRRIASQPNGLVTLSQRFNTCHTLNQSYELIDYLRSTYVY 262
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG 349
AAQY+ P PV IC I+ + G DIL KI AG+V NS C +N
Sbjct: 263 AAQYNQPPRYPVSMICGGID-GESLGSDILSKIYAGIVALRG------NSTCKVNGPTNV 315
Query: 350 DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGG 409
E+ GW WQ+CSEMV+P+G ++M++P P+NLT+Y + CKEQYGVSPRP WV TYYGG
Sbjct: 316 SETTVGWRWQTCSEMVIPIGIGNDTMFEPIPFNLTRYAEGCKEQYGVSPRPHWVTTYYGG 375
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
H+IKL+LRR SNIIFSNG+RDP+S GG + + D + GSHCLDL A +SD
Sbjct: 376 HNIKLVLRRLGSNIIFSNGLRDPYSIGGVLD--NISDSIVAVHTVNGSHCLDLLRANQSD 433
Query: 470 PDWLVQQRKTEVKIMQGWITQYYDDFKAI 498
P WLV+QRK EVKI++ WITQYY D A+
Sbjct: 434 PGWLVEQRKKEVKIIKRWITQYYADLDAL 462
>gi|449496306|ref|XP_004160099.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 491
Score = 517 bits (1331), Expect = e-144, Method: Compositional matrix adjust.
Identities = 255/477 (53%), Positives = 342/477 (71%), Gaps = 18/477 (3%)
Query: 31 PRLGRI-----RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
PRL I S+ LE S+ F+TF++NQT+DHF+Y PESY TFPQRY+IN K+WG
Sbjct: 26 PRLSPIGEKFLHHSKALELPPSD-DFKTFYFNQTLDHFNYRPESYTTFPQRYIINFKYWG 84
Query: 86 GG--NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
G ++PILA+LG EAPID + + GF +NA +F AL+V +EHRYYGKS+PFGSRK AL
Sbjct: 85 GAYSSAPILAYLGPEAPIDSAMNVIGFMTDNAVKFNALLVYIEHRYYGKSIPFGSRKEAL 144
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVI 203
+NA GYFNSAQA+ADYA+IL+H+K +++A ++PVI IG SYGG LATWFRLKYPHV +
Sbjct: 145 RNASTLGYFNSAQAIADYAAILIHVKKEFSAKYSPVIVIGGSYGGMLATWFRLKYPHVAL 204
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
G+LASSAPILY+ DITP Y+ +V+KDFR+ S+ C+++I++SWSEI+ +AS+ +GL++L
Sbjct: 205 GALASSAPILYFNDITPENGYYVIVTKDFREVSQTCYESIRESWSEIETVASQSNGLSVL 264
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA 323
K+FKTC+PL+++T+L++ L +Y+ AAQY+ PS PV RIC+AI+ + G L KIA
Sbjct: 265 DKEFKTCSPLRSSTQLENYLWFMYASAAQYNHPSRYPVNRICDAIDQTYSNGT--LGKIA 322
Query: 324 AGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNL 383
AGV Y G S CYINE E+ GW+WQ CSEMV+P+ ++M+ E ++
Sbjct: 323 AGV-----FAYRGELS-CYINEPINTTETTVGWQWQRCSEMVMPISTGNDTMFPSETFDH 376
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ C + YGV+PRP WV TYYGGHDI LIL R SNIIFSNG++DP+S GG + ++
Sbjct: 377 ESFSIYCNQLYGVTPRPHWVTTYYGGHDIHLILHRFASNIIFSNGLKDPYSIGGVL--HN 434
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
D + GSHCLD+ A + DP+WLV QRKTEV I++ WI +YY D K
Sbjct: 435 ISDSLLAVYTANGSHCLDILTANRMDPEWLVTQRKTEVGIIKEWIDEYYADLANYKK 491
>gi|356499942|ref|XP_003518794.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 494
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 269/506 (53%), Positives = 340/506 (67%), Gaps = 31/506 (6%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHG---------F 51
MK S+QWLLL+ + ST A PR+ S I E + S H
Sbjct: 1 MKHPSLSYQWLLLISLTHSTLLTAKHSLTIPRM-----SPIPEWETSLHDHPVATDAEEV 55
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSG 109
+TF++ Q +DHF+Y PESY TF QRY++N K+WGG NS PI A+ GAE+PID++ G
Sbjct: 56 KTFYFKQVLDHFNYRPESYTTFQQRYLVNFKYWGGANSSAPIFAYFGAESPIDNSPNGIG 115
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F +NA F AL+V +EHRYYGKSVPFGSR+ ALKNA GYFNSAQALADYA+IL HIK
Sbjct: 116 FLTDNAASFNALLVYIEHRYYGKSVPFGSREEALKNASTIGYFNSAQALADYAAILEHIK 175
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+A ++PVI IG SYGG LA+WFRLKYPH+ +G+LAS+APILY+ ITP Y+SVV+
Sbjct: 176 KTLHAQNSPVIVIGGSYGGMLASWFRLKYPHLTVGALASAAPILYFDKITPQNGYYSVVT 235
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
+D+RD SE C++TI KSWSEI +AS+P+GL LS +F TC + + EL D L + Y
Sbjct: 236 RDYRDASETCYETILKSWSEIHRVASQPNGLVTLSHRFNTCHTVNQSYELIDYLRSTYVY 295
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG 349
AAQY+ P PV IC I+ A + G DIL KI AGVV GNN+ C +N
Sbjct: 296 AAQYNQPPRYPVSEICGGIDGA-SLGSDILSKIYAGVVAL-----WGNNT-CKVNGPTNV 348
Query: 350 DESDEGWEWQSCSEMVVPMGKDKNSMY-QPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYG 408
E+ GW WQ+CSEM +P+ N+M+ QP P+NLT+Y + CK+QYGVSPRP WV TYYG
Sbjct: 349 SETSVGWRWQTCSEMAIPISIGNNTMFEQPIPFNLTRYAEGCKKQYGVSPRPHWVTTYYG 408
Query: 409 GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKS 468
GH+IKL+LRR SNIIFSNG+RDP+S GG V Y + +N GSHCLDL A+
Sbjct: 409 GHNIKLVLRRFGSNIIFSNGLRDPYSIGG-VLDYTSDSIVAVNTVN-GSHCLDLLRAR-- 464
Query: 469 DPDWLVQQRKTEVKIMQGWITQYYDD 494
WLV+Q K E++I++ WITQYYDD
Sbjct: 465 ---WLVEQXKKEIRIIKSWITQYYDD 487
>gi|224109994|ref|XP_002333168.1| predicted protein [Populus trichocarpa]
gi|222835051|gb|EEE73500.1| predicted protein [Populus trichocarpa]
Length = 446
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 252/447 (56%), Positives = 322/447 (72%), Gaps = 10/447 (2%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFT 111
F+Y QT+DHF+Y PESY TF QRYVI+ ++WGG N+ PI F GAE +DD++ GF
Sbjct: 2 FYYTQTLDHFNYKPESYTTFRQRYVIDFRYWGGANTSAPIFVFFGAEEDLDDDLDAIGFL 61
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+NA FKAL++ +EHRYYG+S+PFGSRK ALKNA GY NSAQA+ADYA++++H+K K
Sbjct: 62 SDNAPHFKALLIYIEHRYYGRSIPFGSRKEALKNAETLGYLNSAQAMADYAAVIMHLKKK 121
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
Y+A ++PVI IG SYGG L +WFRLKYPH+ +G+LASSAPILY+ DI+P Y+S+V+KD
Sbjct: 122 YSAKNSPVIVIGGSYGGMLTSWFRLKYPHIALGALASSAPILYFDDISPQEGYYSIVTKD 181
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
F++TSE C+ TI+KSW EI+ IASKP+GL+ILSKKFKTC PL T EL+D LD++Y EAA
Sbjct: 182 FKETSESCYNTIRKSWGEIEKIASKPNGLSILSKKFKTCFPLNRTFELEDFLDSIYFEAA 241
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE 351
QYD P PV +C I A DIL +I A VV Y GN S +N D
Sbjct: 242 QYDYPPEFPVSIVCGGINKASAARTDILDRILAVVV-----AYMGNRSCHDMNAFNYPDA 296
Query: 352 SDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHD 411
E W WQ+CSE+V+P+G + NSM+ P P+NL YIK+CK +GV P+P W+ TYYGGHD
Sbjct: 297 IYE-WRWQTCSEIVMPIGHESNSMFPPAPFNLNDYIKDCKSLFGVLPQPHWITTYYGGHD 355
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPD 471
IKLIL+R SNIIFSNG+RDP+S GG + + D + GSHCLD+ A SDP
Sbjct: 356 IKLILQRFGSNIIFSNGLRDPYSSGGVLN--NISDSIVAVSTVNGSHCLDIQRASPSDPH 413
Query: 472 WLVQQRKTEVKIMQGWITQYYDDFKAI 498
WLV QRK EVKI++GWI++YY D +
Sbjct: 414 WLVMQRKIEVKIIEGWISKYYTDLLEV 440
>gi|224058951|ref|XP_002299660.1| predicted protein [Populus trichocarpa]
gi|222846918|gb|EEE84465.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 272/485 (56%), Positives = 346/485 (71%), Gaps = 18/485 (3%)
Query: 20 TSFHANGLKLRPRLGRIRRSRILEQKDSN-------HGFETFFYNQTIDHFSYGPESYQT 72
T+ A L PRL I +Q D FETFF+NQT+DHF+Y PESY
Sbjct: 18 TTATAKRLNTIPRLSPIGPRVWRDQPDKTTLGEFDGEDFETFFHNQTLDHFNYRPESYDK 77
Query: 73 FPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
FPQRY+INSK+WGG N +PIL +LGAE PID+++ GF +NA QF +L+V +EHRYY
Sbjct: 78 FPQRYLINSKYWGGANVSAPILVYLGAEEPIDEDLAAVGFLVDNAVQFNSLLVFIEHRYY 137
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
GKS+PFGSR+ ALK+A GYFNSAQA+ADYA+I++HIK+ A ++PVI IG SYGG L
Sbjct: 138 GKSIPFGSREEALKDASKLGYFNSAQAIADYAAIIIHIKETLRAQYSPVIVIGGSYGGML 197
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
A+WFRLKYPH+ +G+LASSAPILY+ DITP Y+S+V+KDFR+ SE C+QTIK SWSEI
Sbjct: 198 ASWFRLKYPHIALGALASSAPILYFDDITPQDGYYSIVTKDFREASETCYQTIKTSWSEI 257
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIEN 310
D +ASKPDGL++LSKKFKTCTPL + +ELKD LDT+Y+ AAQY+ P PV +C I+
Sbjct: 258 DELASKPDGLSMLSKKFKTCTPLADASELKDHLDTMYASAAQYNRPPTYPVNEVCKGIDG 317
Query: 311 APNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGK 370
GDDIL +I G+V Y GN S CY+N E+ GW WQ+CSEM +P+G
Sbjct: 318 G-GFGDDILSRIFGGLV-----AYKGNLS-CYVNAHTDPSETTVGWRWQTCSEMAIPIGV 370
Query: 371 DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
NSM+ P+P++L YI+NCK YGV RP W+ TYYGGH IKLIL+R SNIIFSNG+R
Sbjct: 371 GNNSMFPPDPFDLEDYIENCKSLYGVPTRPHWITTYYGGHSIKLILQRFASNIIFSNGLR 430
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S GG ++ + D + GSHCLD+ AK++DP+WLV QRK E+KI++ WI +
Sbjct: 431 DPYSSGGVLE--NISDTVVAVKTVNGSHCLDILFAKETDPEWLVTQRKIEIKIIKEWINK 488
Query: 491 YYDDF 495
YY D
Sbjct: 489 YYVDL 493
>gi|224109998|ref|XP_002333169.1| predicted protein [Populus trichocarpa]
gi|222835052|gb|EEE73501.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 271/485 (55%), Positives = 345/485 (71%), Gaps = 18/485 (3%)
Query: 20 TSFHANGLKLRPRLGRIRRSRILEQKDSN-------HGFETFFYNQTIDHFSYGPESYQT 72
T+ A L PRL I +Q D FETFFYNQT+DHF+Y PESY
Sbjct: 18 TTATAKRLNTIPRLSPIGPRVWRDQPDKTTLGEFDGEDFETFFYNQTLDHFNYRPESYDK 77
Query: 73 FPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
FPQRY+INSK+WGG N +PIL FLGAE PID+++ GF +NA QF +L+V +EHRYY
Sbjct: 78 FPQRYLINSKYWGGANVSAPILVFLGAEEPIDEDLAAVGFLVDNAVQFNSLLVFIEHRYY 137
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
GKS+PFGSR+ ALK+A GYFNSAQA+ADYA+I++HIK+ A ++PVI IG SYGG L
Sbjct: 138 GKSIPFGSREEALKDASKLGYFNSAQAIADYAAIIIHIKETLRAQYSPVIVIGGSYGGML 197
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
A+WFRLKYPH+ +G+LASSAPILY+ DITP Y+S+V+KDFR+ SE C+QTIK SWSEI
Sbjct: 198 ASWFRLKYPHIALGALASSAPILYFDDITPQDGYYSIVTKDFREASETCYQTIKTSWSEI 257
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIEN 310
D +ASKPDGL++LSKKFKTC PL + +ELKD LDT+Y+ AAQY+ P PV +C I+
Sbjct: 258 DELASKPDGLSMLSKKFKTCNPLTDASELKDHLDTMYASAAQYNRPPTYPVNEVCKGIDG 317
Query: 311 APNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGK 370
GDD L +I G+V Y+GN S CY+N E+ GW+WQ CSEM +P+G
Sbjct: 318 G-GFGDDTLSRIFGGLV-----AYNGNLS-CYVNAHTDPSETTVGWQWQKCSEMAIPIGV 370
Query: 371 DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
NSM+ P+P++L YI++CK YGV+ RP WV TYYGGH IKLIL+R SNIIFSNG+R
Sbjct: 371 GNNSMFPPDPFDLKDYIEHCKSLYGVTTRPHWVTTYYGGHSIKLILQRFASNIIFSNGLR 430
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S GG ++ + D + GSHCLD+ A+++DP WLV QRK E+KI++ WI +
Sbjct: 431 DPYSSGGVLE--NISDTVVAVKTVNGSHCLDILFAEENDPAWLVTQRKIEIKIIKEWINK 488
Query: 491 YYDDF 495
YY D
Sbjct: 489 YYADL 493
>gi|225448871|ref|XP_002270231.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
gi|296083471|emb|CBI23429.3| unnamed protein product [Vitis vinifera]
Length = 503
Score = 493 bits (1269), Expect = e-137, Method: Compositional matrix adjust.
Identities = 249/507 (49%), Positives = 339/507 (66%), Gaps = 31/507 (6%)
Query: 10 WLLLMVIFVSTSFHANGLKLRPRLGRIRRSR--ILEQKDS--------NHGFETFFYNQT 59
WL+L+ F++ S A K PRLG +R S +LE S + F+TFFY QT
Sbjct: 11 WLILL--FITASVSATPSKKIPRLGVLRGSSLSVLEGSSSLRTVSVNLSENFQTFFYPQT 68
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSP--ILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHF+Y PESY TF RY++N +WGG S I +LG E+ +D +I GF +N +
Sbjct: 69 LDHFNYRPESYTTFQHRYMVNFNYWGGARSAAQIFVYLGEESDLDKDINSIGFLVDNGAR 128
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F AL+V +EHRYYGKS PFGS + +L+NA RGYFNS QALADYA +++++K +A +
Sbjct: 129 FGALLVYIEHRYYGKSNPFGSMQKSLQNASRRGYFNSGQALADYAEVIINLKKNLSADSS 188
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
PVI +G SYGG LA WFRLKYPHV +G+LASSAPILY+ DITP Y+S+V+KDFRD SE
Sbjct: 189 PVIVVGGSYGGLLAAWFRLKYPHVALGALASSAPILYFDDITPQDGYYSLVTKDFRDFSE 248
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPS 297
C+ TIK SW+EID A++ +GL LSKKF+TC PL++ ++LKD L+T+YS AAQYD P
Sbjct: 249 SCYNTIKDSWAEIDKAAAEANGLLNLSKKFRTCKPLESASQLKDYLETMYSIAAQYDRPP 308
Query: 298 NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDESDEGW 356
PV +CN I+ G DIL +I +G+V + N CY + + E++EGW
Sbjct: 309 MYPVTVVCNGIDGGLQ-GTDILDRIFSGIVASR------GNKSCYDMGQSSFPSETEEGW 361
Query: 357 EWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLI 415
WQ CSE+V+P+G+ N +M+ P++ +Y +CK YGV+PRP W+ +YYGGH+IKLI
Sbjct: 362 NWQVCSELVIPIGRGSNDTMFPAAPFDFKEYADSCKYSYGVTPRPHWITSYYGGHNIKLI 421
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS---QDLNLGSHCLDLDEAKKSDPDW 472
L+R SNIIFSNG+RDP+S GG ++ D+S S GSHCLD+ + + DP+W
Sbjct: 422 LKRFGSNIIFSNGLRDPYSSGGVLE-----DISHSIIAVHTPRGSHCLDILPSTEDDPNW 476
Query: 473 LVQQRKTEVKIMQGWITQYYDDFKAIN 499
LV QR E++I+ GW+ +YY+D N
Sbjct: 477 LVLQRNVEIEIIHGWLLKYYEDLLQSN 503
>gi|147820652|emb|CAN65360.1| hypothetical protein VITISV_036071 [Vitis vinifera]
Length = 580
Score = 490 bits (1261), Expect = e-136, Method: Compositional matrix adjust.
Identities = 247/503 (49%), Positives = 338/503 (67%), Gaps = 31/503 (6%)
Query: 10 WLLLMVIFVSTSFHANGLKLRPRLGRIRRSR--ILEQKDS--------NHGFETFFYNQT 59
WL+L+ F++ S A K PRLG +R S +LE S + F+TFFY QT
Sbjct: 11 WLILL--FITASVSATPSKKIPRLGVLRGSSLSVLEGSSSLRTVSVNLSENFQTFFYPQT 68
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSP--ILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHF+Y PESY TF RY++N +WGG S I +LG E+ +D +I GF EN +
Sbjct: 69 LDHFNYRPESYTTFQHRYMVNFNYWGGARSAAXIFVYLGEESDLDKDINSIGFLVENGAR 128
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F AL+V +EHRYYGKS PFGS + +L+NA RGYFNS QALAD+A +++++K +A +
Sbjct: 129 FGALLVYIEHRYYGKSNPFGSMQKSLQNAGQRGYFNSGQALADFAEVIINLKKNLSADSS 188
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
PVI +G S GG LA WFRLKYPHV +G+LASSAPILY+ DITP Y+S+V+KDFRD SE
Sbjct: 189 PVIVVGGSDGGLLAAWFRLKYPHVALGALASSAPILYFDDITPQDGYYSLVTKDFRDFSE 248
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPS 297
C+ TIK SW+EID A++ +GL LSKKF+TC PL++ ++LKD L+T+YS AAQYD P
Sbjct: 249 SCYNTIKDSWAEIDKAAAEANGLLNLSKKFRTCKPLESASQLKDYLETMYSIAAQYDRPP 308
Query: 298 NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDESDEGW 356
PV +CN I+ G DIL +I +G+V + N CY + + E++EGW
Sbjct: 309 MYPVTIVCNGIDGGLQ-GTDILGRIFSGIVASR------GNKSCYDMGQSSFPSETEEGW 361
Query: 357 EWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLI 415
WQ CSE+V+P+G+ N +M+ P++ +Y +CK YGV+PRP W+ +YYGGH+IKLI
Sbjct: 362 NWQVCSELVIPIGRGSNDTMFPAAPFDFKEYDDSCKYSYGVTPRPHWITSYYGGHNIKLI 421
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS---QDLNLGSHCLDLDEAKKSDPDW 472
L+R SNIIFSNG+RDP+S GG ++ D+S S GSHCLD+ +++ DP+W
Sbjct: 422 LKRFGSNIIFSNGLRDPYSSGGVLE-----DISHSIIAVHTPRGSHCLDILPSREDDPNW 476
Query: 473 LVQQRKTEVKIMQGWITQYYDDF 495
LV QR E++I+ GW+ +YY+D
Sbjct: 477 LVLQRNVEIEILHGWLLKYYEDL 499
>gi|224109890|ref|XP_002315345.1| predicted protein [Populus trichocarpa]
gi|222864385|gb|EEF01516.1| predicted protein [Populus trichocarpa]
Length = 513
Score = 486 bits (1252), Expect = e-135, Method: Compositional matrix adjust.
Identities = 239/495 (48%), Positives = 334/495 (67%), Gaps = 21/495 (4%)
Query: 13 LMVIFVSTSFHANGLKLRPRLG---------RIRRSRILEQKDSNHGFETFFYNQTIDHF 63
L+ +F +T PRLG + + ++ S+ F TF+YNQT+DHF
Sbjct: 17 LVFLFSTTRVFCASPSKVPRLGVHGPYGARNHLGKVKVQSLAPSDQEFRTFYYNQTLDHF 76
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKAL 121
+Y PESY+TF RYV++ K W G ++ PI +LG E+ ++D++ G +NA +F AL
Sbjct: 77 NYRPESYKTFQHRYVVSFKHWRGPDTMAPIFVYLGEESSLNDDLGYIGILSDNAARFGAL 136
Query: 122 IVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIA 181
V +EHR+YG+S+PF SR+ ALK+A RGYF+SAQ LADYA ++LHIK K++A +PVI
Sbjct: 137 QVYIEHRFYGESIPFVSREEALKDANLRGYFSSAQTLADYAEVILHIKKKHSADSSPVIV 196
Query: 182 IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQ 241
G SYGG LA WFRLKYPHV +G+LASSAP+LY+ +ITP Y++VV+KDF+++SE C++
Sbjct: 197 FGGSYGGMLAAWFRLKYPHVALGALASSAPVLYFDNITPSNGYYTVVTKDFKESSESCYK 256
Query: 242 TIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPV 301
TIK+SW EID +A+K DGL+IL KKF TC PL+ TELK+ LD+++S AAQYD P PV
Sbjct: 257 TIKQSWFEIDKVAAKADGLSILQKKFNTCKPLEAATELKNFLDSLFSVAAQYDRPPRYPV 316
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSC 361
+C I++AP G D+L +I +G+V Y G CY + E+ EGW WQ+C
Sbjct: 317 DLVCKGIDSAPE-GSDVLDRIFSGIV-----AYFGKKP-CYNLDAFFSSETLEGWTWQTC 369
Query: 362 SEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRST 420
SE+V+P+G+ N +M+ EP++L +YI+ CK +GV PRP W+ TYYGGH K +LRR
Sbjct: 370 SELVIPIGRGSNDTMFPAEPFDLKEYIEECKSAFGVPPRPHWITTYYGGHHFKEVLRRFG 429
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
SNIIFSNG+RDP+S GG ++ + D + G+HC+D+ A DPDW+V QR E
Sbjct: 430 SNIIFSNGLRDPYSSGGVLE--NISDSILAVYTTKGAHCMDILPATIGDPDWVVLQRNIE 487
Query: 481 VKIMQGWITQYYDDF 495
++I+ GWI +YY D
Sbjct: 488 IEIINGWILKYYQDL 502
>gi|147820345|emb|CAN63232.1| hypothetical protein VITISV_004830 [Vitis vinifera]
Length = 468
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 242/439 (55%), Positives = 317/439 (72%), Gaps = 16/439 (3%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILE----QKDSNHGFETFFYNQTIDHFSYG 66
+L +VIF ST+ + RLG + R+ + E + + F+TF+YNQT+DHF+Y
Sbjct: 16 ILALVIFFSTTCVSATQPHLLRLGVLGRTFLREPATFSESVSEEFQTFYYNQTLDHFNYR 75
Query: 67 PESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
PESY TF QRYV+N K+WGG N+ PI A+LG EA +D ++ GF +NAHQFKAL+V
Sbjct: 76 PESYITFQQRYVVNFKYWGGANASAPIFAYLGXEADLDYDLSGIGFLTDNAHQFKALLVY 135
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGA 184
+EHRYYGKS+P GSR+ A KNA GYFNSAQA+ADYA +L++IK K A ++PVI +GA
Sbjct: 136 IEHRYYGKSIPXGSREEAFKNASTLGYFNSAQAIADYAEVLIYIKKKLLAENSPVIVVGA 195
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIK 244
SYGG LA+WFRLKYPHV +G+LASSAPILY+ DITP Y S+V+KDFR+ SE C++TI
Sbjct: 196 SYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSIVTKDFREASESCYKTIS 255
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRI 304
+SWSEID +AS+P+GL+ILSKKF+ C L N+ ELK+ L+ YS AAQYD P + PV +
Sbjct: 256 ESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTYSIAAQYDAPPSYPVTVV 315
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG--DESDEGWEWQSCS 362
C I+ AP G DIL +I AGVV + GN S CY + T E+ +GW WQ+CS
Sbjct: 316 CGGIDGAPE-GSDILSRIFAGVV-----AFRGNMS-CYYTSNTTDYPIETIQGWGWQTCS 368
Query: 363 EMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTS 421
E+V+P+G+ N +M+ P+NLT + ++C YGV+PRP W+ TYYGGHDI+LILRR S
Sbjct: 369 EIVIPIGRXVNDTMFPLAPFNLTTFSQDCTSLYGVAPRPHWITTYYGGHDIELILRRFAS 428
Query: 422 NIIFSNGMRDPFSRGGWVK 440
NIIFSNG+RDP+S GG +K
Sbjct: 429 NIIFSNGLRDPYSIGGVLK 447
>gi|297734876|emb|CBI17110.3| unnamed protein product [Vitis vinifera]
Length = 405
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 228/393 (58%), Positives = 289/393 (73%), Gaps = 11/393 (2%)
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF +NAH+FKAL+V +EHRY GKS+PFGSR+ ALKNA RGYFNSAQA+ADYA +L++I
Sbjct: 23 GFLTDNAHRFKALLVYIEHRYCGKSIPFGSREEALKNASIRGYFNSAQAIADYAEVLIYI 82
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K K A ++PVI +G SYGG LA+WFRLKYPHV +G LASSAPILY+ DITP Y+S+V
Sbjct: 83 KKKLLAENSPVIVVGGSYGGMLASWFRLKYPHVALGVLASSAPILYFEDITPQNGYYSIV 142
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
+KDFR+ SE C++TI++SWSEID +AS+P+G++ILSKKF+TC L N+ ELKD LDT+Y
Sbjct: 143 TKDFREASESCYKTIRESWSEIDRVASEPNGISILSKKFRTCDRLNNSDELKDYLDTIYC 202
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
AAQY+ P PV C+ I+ AP G DIL +I AGVV G NS CY
Sbjct: 203 TAAQYNDPPMYPVTMACSGIDGAPE-GSDILSRIFAGVVAY------GGNSSCY-TTSHN 254
Query: 349 GDESDEGWEWQSCSEMVVPMGK-DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYY 407
E+ EGW WQ+CSEMV+P+G+ D ++M+ P P+NLT +I+ C YGVSPRP WV TYY
Sbjct: 255 PTETSEGWAWQTCSEMVMPIGRGDNDTMFPPAPFNLTTFIQECTSIYGVSPRPHWVTTYY 314
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKK 467
GGH+I+LILRR SNIIFSNG+RDP+S GG +K + D + GSHCLD+ A
Sbjct: 315 GGHNIELILRRFASNIIFSNGLRDPYSSGGVLK--NISDSVLAILTVNGSHCLDILPATS 372
Query: 468 SDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
+DP+WLV QRK EV++++ WI QYY D AI K
Sbjct: 373 TDPEWLVMQRKAEVEVIESWIAQYYADLHAITK 405
>gi|357451981|ref|XP_003596267.1| Lysosomal Pro-X carboxypeptidase [Medicago truncatula]
gi|355485315|gb|AES66518.1| Lysosomal Pro-X carboxypeptidase [Medicago truncatula]
Length = 479
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 238/491 (48%), Positives = 326/491 (66%), Gaps = 31/491 (6%)
Query: 12 LLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDS-----NHGFETFFYNQTIDHFSYG 66
LL + F+ S PRL S+ E +++ N + +FY QT+DHF+Y
Sbjct: 9 LLFIFFLCFSVTTTNSLTLPRLSPFSESKTTEYQNTKTFNLNEDMQPYFYEQTLDHFNYL 68
Query: 67 PESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
+SY+TF QRY+IN +WGG NS PI A+LG E DD + GF +NA FKAL+V
Sbjct: 69 SDSYKTFKQRYIINFNYWGGANSSAPIFAYLGGE---DDIVNTLGFMTDNATSFKALLVY 125
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGA 184
+EHRYYGKSVP NA + GY NSAQALADYA +LL++KD +A +PVI +G
Sbjct: 126 IEHRYYGKSVP-------SFNASY-GYLNSAQALADYAEVLLYLKDSLHAQKSPVIVVGG 177
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIK 244
SY G LA WFRLKYPH+ IG+LASSAP+LY+ +ITP + Y+ V+++DF++TS+ C++TI+
Sbjct: 178 SYAGMLAAWFRLKYPHIAIGALASSAPLLYFDNITPASGYNDVITRDFQETSKTCYETIR 237
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRI 304
SW EID +ASKP+GL LSKKF TC PL+ ELKD L +Y +AAQY+ P P I
Sbjct: 238 NSWFEIDEVASKPNGLNFLSKKFNTCYPLEQPGELKDYLGNMYQKAAQYNDP---PTTTI 294
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDR-TGDESDEGWEWQSCSE 363
C AI+ A + GDDIL +I G+V + N +C +N D+ TG + + W WQ+C+E
Sbjct: 295 CEAIDRA-SYGDDILSRIYGGMVASYG------NKKCNVNPDKYTGAKPFDRWRWQTCTE 347
Query: 364 MVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNI 423
+V+P+G +S++QP+P+N T + +NCK+ +GV PRP W+ +YYGG DI+L+L+R SNI
Sbjct: 348 IVMPIGIGDSSLFQPKPFNFTSFAENCKKDFGVQPRPHWITSYYGGQDIQLVLKRFGSNI 407
Query: 424 IFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKI 483
IFSNG+RDP+S GG + + D + G HC D+ A +SDP WLV QR TEV+I
Sbjct: 408 IFSNGLRDPYSSGGVLN--NISDSLVALPTVNGYHCQDIVPAIESDPAWLVHQRNTEVEI 465
Query: 484 MQGWITQYYDD 494
+Q WI +YYD+
Sbjct: 466 IQSWIKKYYDE 476
>gi|297812433|ref|XP_002874100.1| hypothetical protein ARALYDRAFT_910290 [Arabidopsis lyrata subsp.
lyrata]
gi|297319937|gb|EFH50359.1| hypothetical protein ARALYDRAFT_910290 [Arabidopsis lyrata subsp.
lyrata]
Length = 495
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 246/493 (49%), Positives = 334/493 (67%), Gaps = 19/493 (3%)
Query: 11 LLLMVIFVSTSF-HANGLKLRPRLGRIRRS---RILEQKDSNHGFETFFYNQTIDHFSYG 66
LLL+VIF + S H+ +L ++R + L K ++ G E FF+ QT+DHF+Y
Sbjct: 10 LLLLVIFSTISCTHSKEARLSVFPKKLRYTFDGEKLHYKFADLGIEIFFFEQTLDHFTYT 69
Query: 67 PESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
P SY+ F QRY +NSK+W GG N+PILA+LGAE+ +D + + GF +NA FKAL+V
Sbjct: 70 PGSYKKFRQRYAVNSKYWEGGKTNAPILAYLGAESSLDSELSVLGFLKDNAPHFKALMVY 129
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGA 184
+EHR+YG+++PFGS + LKNA+ GY N+AQALADYA+ILLHIK+ Y+A H+PVI IG
Sbjct: 130 IEHRFYGETMPFGSAEETLKNAKTLGYLNAAQALADYAAILLHIKETYSAKHSPVIVIGG 189
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIK 244
SYGG LA WF+LKYPH+ +G+LASSAP+LY+ D P Y +V+K F++TS++C I+
Sbjct: 190 SYGGMLAAWFKLKYPHIALGALASSAPLLYFEDTLPKHGYFYIVTKVFKETSQKCHNKIR 249
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRI 304
KSW EID IA+KP+GL+ILSKKFK C PL +T ELK L +Y+ AQY+ + V +
Sbjct: 250 KSWDEIDRIAAKPNGLSILSKKFKLCNPLNDTIELKSYLSNIYAGTAQYNN-NPYSVASL 308
Query: 305 CNAIENA-PNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSE 363
C AI + PN D+L +I AGVV + G N CY + T D W WQSCSE
Sbjct: 309 CEAINTSPPNTKSDLLDQIFAGVVAS------GGNISCYGMDQITNDA--RAWTWQSCSE 360
Query: 364 MVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSN 422
MV+P+G +K ++M+QP+P+N++ + KNC+ QYGVSPRP WV Y+G D+KLI RR +N
Sbjct: 361 MVMPIGYEKEDTMFQPKPFNMSSFTKNCESQYGVSPRPHWVTAYFGSQDVKLIFRRFGNN 420
Query: 423 IIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVK 482
IIFSNG+ DP+S GG ++ D + GSHC D+ K DP+WLV+QR+ EVK
Sbjct: 421 IIFSNGLLDPYSVGGVLEDIS--DTVIAITTRDGSHCQDIVLKSKEDPEWLVEQREKEVK 478
Query: 483 IMQGWITQYYDDF 495
I+ WI+ Y D
Sbjct: 479 IIDSWISTYQKDL 491
>gi|297812427|ref|XP_002874097.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297319934|gb|EFH50356.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 501
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 242/507 (47%), Positives = 331/507 (65%), Gaps = 24/507 (4%)
Query: 6 FSFQWLLLMVIFVSTSFHANGLKLR-PRLGRIRRSRILE-------QKDSNHGFETFFYN 57
++ LLL++ STS+ + + RLG S++L+ QK + +++N
Sbjct: 3 LAYSILLLLISSTSTSYFISFAHSKIARLGI--SSKMLKNAPDGSTQKIDESDLKMYYFN 60
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYENA 115
QT+DHF++ P+SY TF QRY INS WGG N+PILAFLG E+ +D ++ GF +N
Sbjct: 61 QTLDHFTFTPKSYMTFQQRYAINSSHWGGAKANAPILAFLGEESSLDSDLSGIGFLRDNG 120
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
KAL+V +EHRYYGK++PFGS + ALKNA GY N+AQALADYA+ILLH+K+KY+
Sbjct: 121 PHLKALLVYIEHRYYGKTMPFGSAEEALKNASTLGYLNAAQALADYAAILLHVKEKYSTK 180
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
H+P+I IG SYGG LA WFRLKYPH+ +G+LASSAP+LY+ D P Y+ +V+K F++T
Sbjct: 181 HSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKIGYYYIVTKVFKET 240
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDT 295
SE C+ I+KSW EID +A KP+GL+ILSK FKTC PL + +KD LDT+Y+EA QY+
Sbjct: 241 SERCYTRIRKSWKEIDRVAVKPNGLSILSKTFKTCAPLNGSFNIKDFLDTIYAEAVQYNR 300
Query: 296 PSNIPVKRICNAIE-NAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDESD 353
V +CNAI N PN D +L +I AGVV N CY N +
Sbjct: 301 GPTYWVANVCNAINANTPNRKDTLLDRIFAGVVALI------GNRTCYDTNMFSQPTNNH 354
Query: 354 EGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDI 412
W WQSCSE+V+P+G DK ++M+ P+N+T YI C+ YGV PRP W+ TY+G D+
Sbjct: 355 IAWRWQSCSEIVIPVGYDKQDTMFPTAPFNMTSYIDGCESYYGVPPRPHWITTYFGIQDV 414
Query: 413 KLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDW 472
KLILRR SNIIFSNG+ DP+S GG ++ ++ + + GSHC D+ K DP W
Sbjct: 415 KLILRRFGSNIIFSNGLSDPYSVGGVLEDISDTVVAITTN---GSHCQDISLKNKEDPQW 471
Query: 473 LVQQRKTEVKIMQGWITQYYDDFKAIN 499
LV QR+ E+K++ WI+ Y +D + +N
Sbjct: 472 LVMQREKEIKVIDSWISTYQNDLRDLN 498
>gi|297812429|ref|XP_002874098.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297319935|gb|EFH50357.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 427
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 227/434 (52%), Positives = 302/434 (69%), Gaps = 14/434 (3%)
Query: 71 QTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
TF QRY I++K W G N+PILAFLG EA ++ ++ GF +NA FKAL V +EHR
Sbjct: 1 MTFQQRYAIDAKHWAGAKANAPILAFLGLEASLETDLAAFGFLSDNAPHFKALKVYIEHR 60
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGG 188
YYGK++PFGS K A+KNA GY NSAQALADYA+ILLHIK+KY+ATH+P+I +G SYGG
Sbjct: 61 YYGKTIPFGSAKEAMKNASTLGYLNSAQALADYAAILLHIKEKYSATHSPIIVVGGSYGG 120
Query: 189 ELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWS 248
LA WFRLKYPH+ +G+LASSAP+LY+ D P Y+ +++K F++T++ C+ TI+KSW
Sbjct: 121 MLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIITKVFKETNKRCYNTIRKSWE 180
Query: 249 EIDNIASKPDGLAILSKKFKTC-TPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNA 307
EID +ASKP+GL ILSKKFKTC +PL + +LKD LDTVY+E QY+ + V +CNA
Sbjct: 181 EIDRVASKPNGLLILSKKFKTCASPLSRSFDLKDFLDTVYAETVQYN--DGVWVTNVCNA 238
Query: 308 IE-NAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVV 366
I N PN DIL +I AGVV G+ S N +D W WQ CSE+VV
Sbjct: 239 INANPPNRKIDILDRIFAGVV-----ALTGSQSCYNTNYSVQVTNNDMAWRWQCCSEIVV 293
Query: 367 PMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIF 425
P+G DK ++MYQ P+N+T YI++C+ YGVSPRP W+ TY+G ++KLIL+R SNIIF
Sbjct: 294 PVGHDKQDTMYQTSPFNMTSYIEDCESSYGVSPRPHWITTYFGIQNVKLILQRFGSNIIF 353
Query: 426 SNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
SNG+ DP+S GG ++ D + GSHC D++ K DP+WLV QR+ E+K++
Sbjct: 354 SNGLSDPYSVGGVLE--DISDTVVAITTKNGSHCQDINLKSKGDPEWLVMQREKEIKVIN 411
Query: 486 GWITQYYDDFKAIN 499
WI+ Y +D + +N
Sbjct: 412 SWISTYQNDLRDLN 425
>gi|225436414|ref|XP_002272152.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 493
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 222/444 (50%), Positives = 300/444 (67%), Gaps = 14/444 (3%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLS 108
F T+FYNQT+DHF+Y PESY+TF QRY++NS++WGG NS PI + G EA I +
Sbjct: 48 FVTYFYNQTLDHFNYKPESYRTFQQRYIMNSEYWGGANSSSPIFVYTGDEASITAVAAFA 107
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF E A +F L++ +EHRYYG SVPFGS+ A N GYF S QALADYA ++ ++
Sbjct: 108 GFIVELASRFNGLLLYIEHRYYGDSVPFGSKDEAFSNTSTLGYFTSTQALADYAELITNL 167
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K +A + PVIAIG SYGG LA+WFRLKYPH+VIG+LASSAPILY+ DITP YH +V
Sbjct: 168 KKNLSAENCPVIAIGGSYGGMLASWFRLKYPHIVIGALASSAPILYFDDITPGNAYHVIV 227
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
+KDFR+TSE C+ TI+ SWSEID +A++P+GLA LS+ F TC PL ++ ELK L Y
Sbjct: 228 TKDFRETSESCYSTIRDSWSEIDKVAAEPNGLANLSQIFMTCEPLNSSQELKYYLALCYV 287
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
+AQ D P PVK++C+AI+ AP G DI+ ++AAG+ + C+ D
Sbjct: 288 VSAQNDNPPAYPVKKVCDAIDGAPE-GTDIIGRVAAGLNAS-------VGPPCHFVYDFK 339
Query: 349 GDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYY 407
E W WQ+C+EMV+P+G N +M+Q P++L + K C++ +GV+PRP W+ T +
Sbjct: 340 PSNRSE-WTWQTCTEMVMPIGHGANDTMFQAWPFDLNNHTKTCQDLFGVTPRPHWITTEF 398
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKK 467
GGHDIK ++ SNIIFSNG+RDP+S GG ++ D + + G+HCLDL
Sbjct: 399 GGHDIKSVVGNFASNIIFSNGLRDPYSAGGVLQDIS--DSVVAIYTDKGAHCLDLSTPTA 456
Query: 468 SDPDWLVQQRKTEVKIMQGWITQY 491
+DPDWLV Q++ EVKI+ W+ +Y
Sbjct: 457 TDPDWLVSQQEKEVKIIGLWLAEY 480
>gi|255565519|ref|XP_002523750.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537054|gb|EEF38690.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 491
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 231/498 (46%), Positives = 320/498 (64%), Gaps = 19/498 (3%)
Query: 1 MKSSQFSFQWLLLMVIFV-STSFHANGLKLRPRLGRIRRSRILE-QKDSNHGFETFFYNQ 58
M S F ++LL++ V +++ H L R G ++R E +E +Y Q
Sbjct: 1 MASPSFQLCFVLLLLAAVCASAVHPRELTRLTRFGGVKRFAASEFSYQLPPEYEIHYYTQ 60
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAH 116
T+DHF+Y P+SY TF QRY++N K+WGG N SPI + G E + ++ A
Sbjct: 61 TLDHFNYNPQSYATFQQRYILNFKYWGGANTSSPIFVYTGEEVDVTYDVDT---ILHLAA 117
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH 176
+FKAL++ +EHRYYG+S+PFGS A +N+ GY +S QALADYA ++ +K K +A +
Sbjct: 118 RFKALLLYIEHRYYGESMPFGSEDQAFQNSSTLGYLSSEQALADYAQVVTDVKKKLSAEN 177
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
P IA+GASYGG LA WFRLKYPH+VIGSLASS+PILY+ DITP YH VV+KD+RDTS
Sbjct: 178 CPAIAVGASYGGMLAAWFRLKYPHIVIGSLASSSPILYFDDITPQNGYHVVVTKDYRDTS 237
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTP 296
E C+ TIK+SWSEID +A++P+GL LS F TC+PL ++TEL++ L+ +Y AAQYD P
Sbjct: 238 ESCYNTIKQSWSEIDRVAAQPNGLLTLSNMFNTCSPLNSSTELREYLEILYVSAAQYDNP 297
Query: 297 SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDE-G 355
PV+ C I+ AP G DIL +I AG+ + S CY + T D S++
Sbjct: 298 PYNPVQNTCRGIDGAPP-GTDILGRIVAGLKSRIP-----SWSSCY--DVPTWDLSNKSA 349
Query: 356 WEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
W+WQ+C+EMV P+G N +M+QP P+++ Y + C + +G+ PRP W T +GGHDIK
Sbjct: 350 WDWQTCTEMVFPIGYGYNETMFQPSPFDINNYTEACVQVFGIKPRPQWATTEFGGHDIKT 409
Query: 415 ILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLV 474
+L SNIIF+NG+RDP+S GG ++ D + G+HCLDL +DPDWLV
Sbjct: 410 VLGNFASNIIFANGLRDPWSAGGVLE--DISDTVVAVYTEYGAHCLDLYPPTPNDPDWLV 467
Query: 475 QQRKTEVKIMQGWITQYY 492
+QR E+KI+ WI +YY
Sbjct: 468 EQRDKEIKIIAAWIAEYY 485
>gi|30688688|ref|NP_851059.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|10177233|dbj|BAB10607.1| prolylcarboxypeptidase-like protein [Arabidopsis thaliana]
gi|332005705|gb|AED93088.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 502
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 226/461 (49%), Positives = 315/461 (68%), Gaps = 13/461 (2%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPI 101
QK + +++NQT+DHF++ PESY TF QRY I+S WGG N+PILAFLG E+ +
Sbjct: 47 QKVDESNLKMYYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSL 106
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
D ++ GF +N + AL+V +EHRYYG+++PFGS + ALKNA GY N+AQALADY
Sbjct: 107 DSDLAAIGFLRDNGPRLNALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADY 166
Query: 162 ASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
A+ILLH+K+KY+ H+P+I IG SYGG LA WFRLKYPH+ +G+LASSAP+LY+ D P
Sbjct: 167 AAILLHVKEKYSTNHSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPK 226
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
Y+ +V+K F++ SE C+ TI+ SW EID +A KP+GL+ILSK+FKTC PL + ++KD
Sbjct: 227 FGYYYIVTKVFKEASERCYNTIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFDIKD 286
Query: 282 GLDTVYSEAAQYDTPSNIPVKRICNAIE-NAPNCGDDILCKIAAGVVEADSLEYDGNNSR 340
LDT+Y+EA QY+ N V ++CNAI N PN ++L +I AGVV N
Sbjct: 287 FLDTIYAEAVQYNRGPNFWVAKVCNAINANPPNRRYNLLDRIFAGVVALVG------NRT 340
Query: 341 CYINEDRTGDESDE-GWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSP 398
CY + ++ W WQSCSE+V+P+G DK ++M+ P+N+T YI CK +GV+P
Sbjct: 341 CYDTKMFAQPTNNNIAWRWQSCSEIVMPVGYDKQDTMFPTAPFNMTSYIDGCKSYHGVTP 400
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
RP W+ TY+G ++KLIL++ SNIIFSNG+ DP+S GG ++ D + GSH
Sbjct: 401 RPHWITTYFGIQEVKLILQKFGSNIIFSNGLSDPYSVGGVLEDIS--DTLVAITTKNGSH 458
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
CLD+ K DP+WLV QR+ E+K++ WI+ Y +D + +N
Sbjct: 459 CLDITLKSKEDPEWLVIQREKEIKVIDSWISTYQNDLRDLN 499
>gi|21537161|gb|AAM61502.1| prolylcarboxypeptidase-like protein [Arabidopsis thaliana]
Length = 502
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 226/461 (49%), Positives = 315/461 (68%), Gaps = 13/461 (2%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPI 101
QK + +++NQT+DHF++ PESY TF QRY I+S WGG N+PILAFLG E+ +
Sbjct: 47 QKVDESDLKMYYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSL 106
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
D ++ GF +N + AL+V +EHRYYG+++PFGS + ALKNA GY N+AQALADY
Sbjct: 107 DSDLAAIGFLRDNGPRLNALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADY 166
Query: 162 ASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
A+ILLH+K+KY+ H+P+I IG SYGG LA WFRLKYPH+ +G+LASSAP+LY+ D P
Sbjct: 167 AAILLHVKEKYSTNHSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPK 226
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
Y+ +V+K F++ SE C+ TI+ SW EID +A KP+GL+ILSK+FKTC PL + ++KD
Sbjct: 227 FGYYYIVTKVFKEASERCYNTIRNSWKEIDRVAGKPNGLSILSKQFKTCDPLNGSFDIKD 286
Query: 282 GLDTVYSEAAQYDTPSNIPVKRICNAIE-NAPNCGDDILCKIAAGVVEADSLEYDGNNSR 340
LDT+Y+EA QY+ N V ++CNAI N PN ++L +I AGVV N
Sbjct: 287 FLDTIYAEAVQYNRGPNFWVAKVCNAINANPPNRRYNLLDRIFAGVVAL------VGNRT 340
Query: 341 CYINEDRTGDESDE-GWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSP 398
CY + ++ W WQSCSE+V+P+G DK ++M+ P+N+T YI CK +GV+P
Sbjct: 341 CYDTKMFAQPTNNNIAWRWQSCSEIVMPVGYDKQDTMFPTAPFNMTSYIDGCKSYHGVTP 400
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
RP W+ TY+G ++KLIL++ SNIIFSNG+ DP+S GG ++ D + GSH
Sbjct: 401 RPHWITTYFGIQEVKLILQKFGSNIIFSNGLSDPYSVGGVLEDIS--DTLVAITTKNGSH 458
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
CLD+ K DP+WLV QR+ E+K++ WI+ Y +D + +N
Sbjct: 459 CLDITLKSKEDPEWLVIQREKEIKVIDSWISTYQNDLRDLN 499
>gi|359479405|ref|XP_002272041.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 491
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 230/470 (48%), Positives = 309/470 (65%), Gaps = 25/470 (5%)
Query: 32 RLGRIRRS-RILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS- 89
RLG + +S + + + F T++YNQT+DHF+Y PESY F QRY+INS +WGG NS
Sbjct: 30 RLGMVSKSMKSALNAELSSDFVTYYYNQTLDHFNYRPESYTNFQQRYLINSAYWGGANSS 89
Query: 90 -PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
PI + G E I +GF + A +FK L++ +EHRYYG SVPF S+ A N
Sbjct: 90 SPIFVYTGDEGSITGAAAFAGFMVDLASRFKGLLLYIEHRYYGDSVPFRSKDIAFNNTST 149
Query: 149 RGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLAS 208
GYF+S QALADYA ++ ++K +A + PVIAIG SYGG LA+WFRLKYPHVVIG+LAS
Sbjct: 150 LGYFSSTQALADYAELITNLKKNLSAENCPVIAIGGSYGGMLASWFRLKYPHVVIGALAS 209
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
SAPILY+ DITP Y S+V+KDFR+TSE C+ TIK+SWSEID +A +P+GLA LS+ F+
Sbjct: 210 SAPILYFDDITPHNGYDSIVTKDFRETSETCYTTIKQSWSEIDEVAGQPNGLANLSQIFR 269
Query: 269 TCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVE 328
TC PL ++ +LK L+ Y +AQYD P V ICNAI+ AP G IL ++A GV
Sbjct: 270 TCEPLNSSQQLKLYLEYTYEASAQYDNPPAHYVSDICNAIDGAPE-GTSILGRVAEGVNA 328
Query: 329 ADSLEYDGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKY 386
+ C+ I + + + S GW WQ+C+EMV+P G+ +N +M+Q P++L Y
Sbjct: 329 SA-------GPPCHRIYDFQPSNMS--GWLWQTCTEMVMPFGRGENDTMFQASPFDLNNY 379
Query: 387 IKNCKEQYG--VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHF 444
K C++ +G V+PRP W+ T +GGH+IK +L SNIIFSNG+RDP+S GG ++
Sbjct: 380 TKTCQDIFGASVTPRPHWITTEFGGHNIKSVLGNFASNIIFSNGLRDPYSIGGVLQ---- 435
Query: 445 FDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
D+S S G+HCLDL SDPDWLV QR E+KI+ W+ +Y
Sbjct: 436 -DISESVVAVYTLKGAHCLDLGTPMPSDPDWLVAQRDKEIKIVALWLAEY 484
>gi|224109990|ref|XP_002333167.1| predicted protein [Populus trichocarpa]
gi|222835050|gb|EEE73499.1| predicted protein [Populus trichocarpa]
Length = 514
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 240/477 (50%), Positives = 310/477 (64%), Gaps = 34/477 (7%)
Query: 31 PRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP 90
PR+ R + + + GFET FYNQT+DHF+Y PESY TFPQRYVINSK+WGG N+
Sbjct: 38 PRVWRDQPDKTTSCEVDEEGFETCFYNQTLDHFNYRPESYDTFPQRYVINSKYWGGANAS 97
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
IL +LGAEA ID + GF +NA QFK+L+V++EHRYYG+S+P GS RG
Sbjct: 98 ILVYLGAEASIDRYLDAGGFLVDNAVQFKSLLVVIEHRYYGQSIPPGSWG-------KRG 150
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
YFNSAQALADYA+I++HIK A ++PVI IG SYGG LA+WFRLKYPH+ +G+LASSA
Sbjct: 151 YFNSAQALADYAAIIIHIKKTLRAQYSPVIVIGGSYGGMLASWFRLKYPHIALGALASSA 210
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
PILY+ DITP Y+SVV+K FR+ SE C+QTIK SWSEID +ASKPDGL++LS KF TC
Sbjct: 211 PILYFDDITPQDAYYSVVTKAFREASETCYQTIKTSWSEIDELASKPDGLSMLSTKFNTC 270
Query: 271 TPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD 330
L + +ELKD L +Y+ AAQ ++P PV +C I++ + GDDIL +I GVV
Sbjct: 271 KNLTDASELKDYLRLMYAYAAQNNSPPTYPVNEVCKGIDDDAS-GDDILSRIFRGVV--- 326
Query: 331 SLEYDGNNSRCYINEDRTGDESDE--GWEWQS--------CSEMVVPMGKDK-NSMYQPE 379
Y GN + CY N + +S+ W WQ CS + + +K Y P
Sbjct: 327 --AYYGNRT-CYFNNNAYAYQSEATLDWSWQQSLSFSYIDCSLGISLITINKYKKTYNPI 383
Query: 380 PWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWV 439
W +K ++ R ++++ DIKLIL+R SNIIFSNG+RDP+S GG +
Sbjct: 384 IWP-SKNKPTLNQELSRPCRNAFMIP-----DIKLILQRFGSNIIFSNGLRDPYSSGGVL 437
Query: 440 KTYHFFDLSFSQDLNLGSHCLDLDEAKKS-DPDWLVQQRKTEVKIMQGWITQYYDDF 495
+ D + GSHCLD+ A S DPDWLV QRK EV+I++GWITQYY+D
Sbjct: 438 N--NISDSIVAVTTVNGSHCLDIQRANPSTDPDWLVMQRKKEVEIIEGWITQYYEDL 492
>gi|297734879|emb|CBI17113.3| unnamed protein product [Vitis vinifera]
Length = 623
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 231/473 (48%), Positives = 310/473 (65%), Gaps = 28/473 (5%)
Query: 32 RLGRIRRS-RILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS- 89
RLG + +S + + + F T++YNQT+DHF+Y PESY F QRY+INS +WGG NS
Sbjct: 30 RLGMVSKSMKSALNAELSSDFVTYYYNQTLDHFNYRPESYTNFQQRYLINSAYWGGANSS 89
Query: 90 -PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV---ILEHRYYGKSVPFGSRKAALKN 145
PI + G E I +GF + A +FK L++ IL+HRYYG SVPF S+ A N
Sbjct: 90 SPIFVYTGDEGSITGAAAFAGFMVDLASRFKGLLLYYLILQHRYYGDSVPFRSKDIAFNN 149
Query: 146 ARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGS 205
GYF+S QALADYA ++ ++K +A + PVIAIG SYGG LA+WFRLKYPHVVIG+
Sbjct: 150 TSTLGYFSSTQALADYAELITNLKKNLSAENCPVIAIGGSYGGMLASWFRLKYPHVVIGA 209
Query: 206 LASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSK 265
LASSAPILY+ DITP Y S+V+KDFR+TSE C+ TIK+SWSEID +A +P+GLA LS+
Sbjct: 210 LASSAPILYFDDITPHNGYDSIVTKDFRETSETCYTTIKQSWSEIDEVAGQPNGLANLSQ 269
Query: 266 KFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAG 325
F+TC PL ++ +LK L+ Y +AQYD P V ICNAI+ AP G IL ++A G
Sbjct: 270 IFRTCEPLNSSQQLKLYLEYTYEASAQYDNPPAHYVSDICNAIDGAPE-GTSILGRVAEG 328
Query: 326 VVEADSLEYDGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNL 383
V + C+ I + + + S GW WQ+C+EMV+P G+ +N +M+Q P++L
Sbjct: 329 VNASA-------GPPCHRIYDFQPSNMS--GWLWQTCTEMVMPFGRGENDTMFQASPFDL 379
Query: 384 TKYIKNCKEQYG--VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
Y K C++ +G V+PRP W+ T +GGH+IK +L SNIIFSNG+RDP+S GG ++
Sbjct: 380 NNYTKTCQDIFGASVTPRPHWITTEFGGHNIKSVLGNFASNIIFSNGLRDPYSIGGVLQ- 438
Query: 442 YHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
D+S S G+HCLDL SDPDWLV QR E+KI+ W+ +Y
Sbjct: 439 ----DISESVVAVYTLKGAHCLDLGTPMPSDPDWLVAQRDKEIKIVALWLAEY 487
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 24/43 (55%), Positives = 33/43 (76%)
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE 278
SE C++TI++SW EI+ +AS+P+GL ILSKKF TC + T E
Sbjct: 532 SESCYKTIQESWPEINKVASEPNGLLILSKKFGTCKGEQKTQE 574
>gi|224104755|ref|XP_002313554.1| predicted protein [Populus trichocarpa]
gi|222849962|gb|EEE87509.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 220/451 (48%), Positives = 296/451 (65%), Gaps = 25/451 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLS 108
FE Y Q++DHF++ PESY TF QRY++N K+WGG N SPI +LGAE + N+ LS
Sbjct: 11 FEVHNYTQSLDHFNFKPESYATFQQRYILNYKYWGGANTSSPIFVYLGAEIDVTQNLDLS 70
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
+ A +FK L++ +EHRYYG S+PFGS A +N+ GY +S QALADYA +++ +
Sbjct: 71 --IVDLAARFKGLLLYVEHRYYGVSMPFGSEDEAFQNSSTFGYLSSEQALADYAQVIVDV 128
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K +A + P IAIG SYGG LA+WFRLKYPH+VIGSLASSAPILY+ DITP YH +V
Sbjct: 129 KKDLSAENCPAIAIGGSYGGMLASWFRLKYPHIVIGSLASSAPILYFDDITPQNGYHVIV 188
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
SKDFR+TSE C+ TI++SW+EID +AS+ +GL LS F TC+PL ++ +LK + Y
Sbjct: 189 SKDFRETSESCYNTIQQSWAEIDRVASETNGLLNLSNIFTTCSPLNSSKDLKVYTEIAYM 248
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
AAQ D P + PV +IC+AI+ AP+ G DIL ++AAGV N+ + N +
Sbjct: 249 WAAQLDNPPSYPVNKICSAIDGAPS-GTDILGRVAAGV-----------NASVFGNSCHS 296
Query: 349 GD------ESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
+S WEWQ+C+EMV PMG +N +M+Q +P ++ Y K C + +G+ PRP
Sbjct: 297 ASGSGLSRKSASAWEWQTCTEMVFPMGYGENETMFQSDPLDINNYTKECVDVFGIKPRPH 356
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
W+ T +GGHDIK +L SNIIFSNG+RDP+S GG ++ D + GSHCLD
Sbjct: 357 WITTEFGGHDIKTVLGNFASNIIFSNGLRDPWSAGGVLE--DISDSVVALYTEQGSHCLD 414
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
L SDPDWL+ R E KI+ W+ +YY
Sbjct: 415 LYPPTSSDPDWLLALRDKENKIIAYWLAEYY 445
>gi|147792727|emb|CAN64373.1| hypothetical protein VITISV_018663 [Vitis vinifera]
Length = 702
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 222/434 (51%), Positives = 294/434 (67%), Gaps = 23/434 (5%)
Query: 3 SSQFSFQWL--LLMVIFVSTSFHANGLKLRPRLGRIRRS--RILEQKDSNHGF----ETF 54
S +S QWL L+ + +S A + PRLG + R R E + F +TF
Sbjct: 7 SLSYSLQWLPFLIPTLILSCCVSAAQFNV-PRLGPLSRGILRNPEPAAVSESFYKDLKTF 65
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTY 112
FY QT+DHF+Y PESY+TF QRYV+N K WGG + PI A+LGAEAP+D ++ GF
Sbjct: 66 FYAQTLDHFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPLDGDLVNIGFVN 125
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+NA +F AL++ +EHRYYGKS+PFGS K ALKNA GYFNSAQA+ADYA++L+H+K +
Sbjct: 126 DNAARFNALLIYIEHRYYGKSIPFGSTKVALKNASTLGYFNSAQAIADYAAVLMHVKKRL 185
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+A ++PVI IG SYGG LA+WFRLKYPH+ +G+LASSAPILY+ +I P Y+S+V+KDF
Sbjct: 186 HAQNSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPEIGYYSIVTKDF 245
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ 292
R+ SE C++TI++SWSEID IASKP+GL+ILSK+FKTC L+++ ELKD LD++Y+EAAQ
Sbjct: 246 REASESCYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKDYLDSIYAEAAQ 305
Query: 293 YDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDE 351
Y+ P PV +C I N + D L +I G+V CY E E
Sbjct: 306 YNEPPTYPVTVVCKGI-NGASKRTDTLGRIFHGLVAI------AGKRSCYDTKEFNYPTE 358
Query: 352 SDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGH 410
+ GW WQ CSEMV+P+G N +M+QPEP+NL ++IK C Y VSPRP WV TYYGG
Sbjct: 359 TYLGWRWQKCSEMVLPIGHATNDTMFQPEPFNLNRFIKECNSLYSVSPRPHWVTTYYGG- 417
Query: 411 DIKLILRRSTSNII 424
+ +L ST ++
Sbjct: 418 --RTLLSLSTQGMV 429
>gi|255565521|ref|XP_002523751.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537055|gb|EEF38691.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 489
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 227/499 (45%), Positives = 317/499 (63%), Gaps = 33/499 (6%)
Query: 7 SFQWLLLMVIFV---STSFHANGLKLRPRLG---RIRRSRILEQKDSNHGFETFFYNQTI 60
SFQ +++V+ V +++ H L R G R S Q S+ +E +Y QT+
Sbjct: 5 SFQLCMVLVLLVPACASALHPRKLTRTTRFGGEKRFAASEFSYQLPSD--YEIHYYTQTL 62
Query: 61 DHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
DHF+Y PESY TF QRY++N K+WGG N SPI + GAE + ++ S E A +F
Sbjct: 63 DHFNYKPESYATFQQRYILNFKYWGGANTSSPIFLYTGAEENLIYHVDRS--IVELAARF 120
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
+ L++ +EHRYYG+S+PFGS + AL+N+ GY +S QALADYA ++ +K +A + P
Sbjct: 121 RGLLLYIEHRYYGESMPFGSEEQALQNSSTLGYLSSEQALADYAQVITDVKKNLSAENCP 180
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
IA+GASYGG LA WFRLKYPH+VIGSLASS+PILY+ DITP YH V+S+ D E
Sbjct: 181 AIAVGASYGGMLAAWFRLKYPHIVIGSLASSSPILYFDDITPQNGYH-VLSRRILD--ES 237
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSN 298
C TIK+SWSEID +A++P+GL LS F TC PL ++ E K+ L+ +Y AAQYD P +
Sbjct: 238 CHNTIKQSWSEIDRVAAQPNGLLTLSNMFNTCRPLVSSAEFKEYLELLYITAAQYDNPPD 297
Query: 299 IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN---SRCYINEDRTGDESDEG 355
PV+ C I+ AP G DIL +I G+ +G S C+ + T + S+ G
Sbjct: 298 NPVQSTCRGIDGAPP-GTDILGRIVEGL--------NGRIPGWSSCH--DIFTLELSNNG 346
Query: 356 -WEWQSCSEMVVPMG-KDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIK 413
W+WQ+C+EMV P+G D +M+QP P+++ Y K C + +G+ PRP WV T +GGHDIK
Sbjct: 347 SWDWQTCTEMVFPIGYGDNETMFQPSPFDINNYKKECLQVFGIKPRPHWVTTEFGGHDIK 406
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWL 473
+L SNIIF+NG+RDP+S GG ++ D + G+HCLDL + DP+WL
Sbjct: 407 TVLGNFASNIIFANGLRDPWSAGGVLEDIS--DSIVAVYTEHGAHCLDLYPSTPDDPNWL 464
Query: 474 VQQRKTEVKIMQGWITQYY 492
V+QR+ E+KI+ W+ +YY
Sbjct: 465 VEQREKEIKIIAAWLAEYY 483
>gi|449530679|ref|XP_004172321.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Cucumis sativus]
Length = 440
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 211/414 (50%), Positives = 287/414 (69%), Gaps = 14/414 (3%)
Query: 31 PRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN-- 88
P LG RR+ + S+ G TF Y Q +DHF+Y P+SY TF QRY+I+ K+W G N
Sbjct: 34 PVLGVQRRAFQSTPQQSD-GLATFXYKQPLDHFNYQPQSYVTFDQRYIIDFKYWEGINPK 92
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
+PI A+LGAE+ ID+++ GF A Q+KA+ V LEHR+YGKS+PFGS + A+KN
Sbjct: 93 TPIFAYLGAESDIDNDVPYVGFPLRFASQYKAMSVYLEHRFYGKSIPFGSLEKAMKNGSI 152
Query: 149 RGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLAS 208
RGYFNSAQALADYA +LLHIK + +P+I +GASYGG LA+WFRLKYPH+ +G+LAS
Sbjct: 153 RGYFNSAQALADYAELLLHIKKMFAYDTSPIIVMGASYGGMLASWFRLKYPHIALGALAS 212
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD-GLAILSKKF 267
SAPILY+ +ITP Y+S+VSK F++TS+ C TI++SW EID IA K GL+ILSK+F
Sbjct: 213 SAPILYFDNITPQDGYYSIVSKSFKETSKTCHDTIRRSWGEIDRIAGKTQGGLSILSKQF 272
Query: 268 KTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVV 327
KTC LK ++E+K+ +D+V++ AAQY+ P PV+ IC AI+ +++ ++ AGV+
Sbjct: 273 KTCGKLKTSSEIKNLMDSVFTMAAQYNDPYENPVRGICVAIDEEAKKKSNVIKQVVAGVI 332
Query: 328 EADSLEYDGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPM---GKDKNSMYQPEPWNL 383
Y G CY + E ++ + WQ CSEMV+P+ G+DKNSM+ P P+
Sbjct: 333 -----AYLGERP-CYDVYEFGYPNDPLNQYGWQVCSEMVMPIGSSGRDKNSMFPPSPFQF 386
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
+ CK+ YGV+PRP W+ T+YGG DIKL+L R SNIIFSNG++DP+S GG
Sbjct: 387 NDFKTMCKDLYGVTPRPHWITTFYGGQDIKLVLHRFGSNIIFSNGLKDPYSSGG 440
>gi|297734880|emb|CBI17114.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 417 bits (1071), Expect = e-114, Method: Compositional matrix adjust.
Identities = 202/390 (51%), Positives = 269/390 (68%), Gaps = 12/390 (3%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLS 108
F T+FYNQT+DHF+Y PESY+TF QRY++NS++WGG NS PI + G EA I +
Sbjct: 48 FVTYFYNQTLDHFNYKPESYRTFQQRYIMNSEYWGGANSSSPIFVYTGDEASITAVAAFA 107
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF E A +F L++ +EHRYYG SVPFGS+ A N GYF S QALADYA ++ ++
Sbjct: 108 GFIVELASRFNGLLLYIEHRYYGDSVPFGSKDEAFSNTSTLGYFTSTQALADYAELITNL 167
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K +A + PVIAIG SYGG LA+WFRLKYPH+VIG+LASSAPILY+ DITP YH +V
Sbjct: 168 KKNLSAENCPVIAIGGSYGGMLASWFRLKYPHIVIGALASSAPILYFDDITPGNAYHVIV 227
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
+KDFR+TSE C+ TI+ SWSEID +A++P+GLA LS+ F TC PL ++ ELK L Y
Sbjct: 228 TKDFRETSESCYSTIRDSWSEIDKVAAEPNGLANLSQIFMTCEPLNSSQELKYYLALCYV 287
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
+AQ D P PVK++C+AI+ AP G DI+ ++AAG+ + C+ D
Sbjct: 288 VSAQNDNPPAYPVKKVCDAIDGAPE-GTDIIGRVAAGLNASV-------GPPCHFVYDFK 339
Query: 349 GDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYY 407
E W WQ+C+EMV+P+G N +M+Q P++L + K C++ +GV+PRP W+ T +
Sbjct: 340 PSNRSE-WTWQTCTEMVMPIGHGANDTMFQAWPFDLNNHTKTCQDLFGVTPRPHWITTEF 398
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
GGHDIK ++ SNIIFSNG+RDP+S GG
Sbjct: 399 GGHDIKSVVGNFASNIIFSNGLRDPYSAGG 428
>gi|224058949|ref|XP_002299659.1| predicted protein [Populus trichocarpa]
gi|222846917|gb|EEE84464.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 206/363 (56%), Positives = 261/363 (71%), Gaps = 18/363 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLS 108
FETFF+NQT+DHF+Y PESY FPQRY+INSK+WGG N +PIL +LGAEA ID +
Sbjct: 1 FETFFHNQTLDHFNYRPESYDKFPQRYLINSKYWGGANVSAPILVYLGAEASIDGYRDAA 60
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF +NA QFK+L+V +EHRYYG S P G+ RGYF+SAQALADYA+I++ I
Sbjct: 61 GFLDDNAVQFKSLLVFIEHRYYGHSFPPGAWG-------KRGYFSSAQALADYAAIIIDI 113
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K+ +A ++PVI IG SYGG LA+WFRLKYPH+ +G+LASSAPILY+ DITP Y+SVV
Sbjct: 114 KENRSAQYSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDDITPQDAYYSVV 173
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
+K+FR+ SE C+QTIK SWSEID +ASKPDGL++LS KF TC L + +ELKD L +Y+
Sbjct: 174 TKEFREASETCYQTIKTSWSEIDELASKPDGLSMLSTKFNTCKNLTDASELKDYLRLMYA 233
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
AAQ ++P PV +C I++ + GDDIL +I GVV Y GN + CY N +
Sbjct: 234 YAAQNNSPPTYPVNEVCKGIDDDAS-GDDILSRIFRGVV-----AYYGNRT-CYFNNNAY 286
Query: 349 GDESDE--GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTY 406
+S+ W WQ CSEMV+P+G NSM+QP P+NLT +I+ CK YGV PRP WV TY
Sbjct: 287 AYQSEATLDWSWQRCSEMVIPLGVGDNSMFQPNPFNLTDHIERCKSLYGVRPRPHWVTTY 346
Query: 407 YGG 409
YGG
Sbjct: 347 YGG 349
>gi|30688698|ref|NP_197677.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332005706|gb|AED93089.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 439
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 204/399 (51%), Positives = 280/399 (70%), Gaps = 11/399 (2%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPI 101
QK + +++NQT+DHF++ PESY TF QRY I+S WGG N+PILAFLG E+ +
Sbjct: 47 QKVDESNLKMYYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSL 106
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
D ++ GF +N + AL+V +EHRYYG+++PFGS + ALKNA GY N+AQALADY
Sbjct: 107 DSDLAAIGFLRDNGPRLNALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADY 166
Query: 162 ASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
A+ILLH+K+KY+ H+P+I IG SYGG LA WFRLKYPH+ +G+LASSAP+LY+ D P
Sbjct: 167 AAILLHVKEKYSTNHSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPK 226
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
Y+ +V+K F++ SE C+ TI+ SW EID +A KP+GL+ILSK+FKTC PL + ++KD
Sbjct: 227 FGYYYIVTKVFKEASERCYNTIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFDIKD 286
Query: 282 GLDTVYSEAAQYDTPSNIPVKRICNAIE-NAPNCGDDILCKIAAGVVEADSLEYDGNNSR 340
LDT+Y+EA QY+ N V ++CNAI N PN ++L +I AGVV N
Sbjct: 287 FLDTIYAEAVQYNRGPNFWVAKVCNAINANPPNRRYNLLDRIFAGVVALVG------NRT 340
Query: 341 CYINEDRTGDESDE-GWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSP 398
CY + ++ W WQSCSE+V+P+G DK ++M+ P+N+T YI CK +GV+P
Sbjct: 341 CYDTKMFAQPTNNNIAWRWQSCSEIVMPVGYDKQDTMFPTAPFNMTSYIDGCKSYHGVTP 400
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
RP W+ TY+G ++KLIL++ SNIIFSNG+ DP+S GG
Sbjct: 401 RPHWITTYFGIQEVKLILQKFGSNIIFSNGLSDPYSVGG 439
>gi|148906489|gb|ABR16397.1| unknown [Picea sitchensis]
Length = 508
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 209/493 (42%), Positives = 296/493 (60%), Gaps = 26/493 (5%)
Query: 8 FQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNH------GFETFFYNQTID 61
F L+ ++ S SF+ + + P L R ++L K + + T Y QT+D
Sbjct: 29 FLILVGCLLDSSYSFNLDNWFMNPSLLTSARGKLLTMKLKSSRRPLSLDYTTNQYMQTLD 88
Query: 62 HFSYGPESYQTFPQRYVINSKFWGG--GNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
HF++ P+ Y+TFPQRY++N +WGG NSPI LG E I + G E+A F+
Sbjct: 89 HFTFRPDGYRTFPQRYLVNKTYWGGPQNNSPIFVCLGNEEDIITQLPYFGIMTEHAADFR 148
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV 179
ALIV +EHRYYG S+PFGS+ + NA GY++++QALADYA ++ +K +A PV
Sbjct: 149 ALIVFIEHRYYGTSMPFGSQDESYANASTLGYYSASQALADYAIVITDLKKNLSADDCPV 208
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEEC 239
+ G SYGG LA W RLKYPH+ IG+LASS+PILY+ D+TP Y VV+KDFRD S+ C
Sbjct: 209 VVFGGSYGGMLAAWLRLKYPHITIGALASSSPILYFEDMTPHDAYDRVVTKDFRDASDIC 268
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNI 299
++ IK+SW+E++ +AS+P GL L+K F TC L + T+L + +AQY+
Sbjct: 269 YRRIKESWAEMEKVASQPGGLQKLAKLFNTCQSLSSWTDLFYWIYPALQVSAQYNFAE-- 326
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQ 359
VK IC I N+ G DIL ++AAG EY C +N T GW+WQ
Sbjct: 327 -VKAICRVI-NSQARGTDILTRLAAGA------EYANEGLGC-LNLSTTL----SGWDWQ 373
Query: 360 SCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRR 418
+CSEMV+P+ + N +M+ P++L Y C + YGV PRP W+ T +GGH+IK +L+R
Sbjct: 374 TCSEMVIPLAPNANTTMFPSAPFDLKSYFSGCFKTYGVLPRPYWITTEFGGHNIKRVLKR 433
Query: 419 STSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRK 478
SNIIFSNG+RDP+S GG +K + D + G HC D+ + DP+WL +QR+
Sbjct: 434 FGSNIIFSNGLRDPYSSGGVLK--NISDSIVAITAKEGVHCEDIRSSTNDDPNWLKEQRQ 491
Query: 479 TEVKIMQGWITQY 491
E+ I++ WI Y
Sbjct: 492 KEINIIRKWIIDY 504
>gi|312282209|dbj|BAJ33970.1| unnamed protein product [Thellungiella halophila]
Length = 494
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/496 (39%), Positives = 294/496 (59%), Gaps = 23/496 (4%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
L + F ++ G +R+++ + S FET ++ Q +DHFS+ PESY
Sbjct: 9 FLFFISFAEATYPPGGFH---HFSSLRQNKKASKSKSELPFETLYFPQNLDHFSFRPESY 65
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ F Q+Y+INS+FW G PI + G E ID +GF + A +F AL+V +EHR+Y
Sbjct: 66 KVFHQKYLINSRFWRKGG-PIFVYTGNEGDIDWFASNTGFMSDIAPKFGALLVFIEHRFY 124
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
G+S PFG K + K+A GY NS QALADYA ++ +K ++ +PV+ G SYGG L
Sbjct: 125 GESTPFG--KKSHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGML 182
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
A WFRLKYPH+ IG+LASSAPIL++ +I P T+++ +S+DF+D S CF+ IK+SW E+
Sbjct: 183 AAWFRLKYPHIAIGALASSAPILHFDNIVPLTSFYDAISQDFKDASVNCFEVIKRSWQEL 242
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSNI-------PV 301
+ +++ GL LSKKF+TC L + +D L + +Y+ Y T +N PV
Sbjct: 243 EAVSNMKHGLPELSKKFRTCKGLHSQYAARDWLMSAFIYTAMVNYATAANFMAPLPGYPV 302
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCYINEDRTGDESDEGWEWQS 360
+++C I+ P ++ AA SL Y+ + S +C+ E T D +GW WQ+
Sbjct: 303 EQMCKIIDGFPRGSSNLDRAFAAA-----SLYYNYSGSEKCFELEQPTDDHGLDGWGWQA 357
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRST 420
C+EMV+PM SM+ P + + + C +YGV PRP W+ T +GG I+ +L+R
Sbjct: 358 CTEMVMPMSCSNQSMFPPYDNDYEAFKEQCMSRYGVKPRPHWITTEFGGKRIETVLKRFG 417
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
SNIIFSNGM+DP+SRGG +K ++ G+H DL A K DP+WL +QR+ E
Sbjct: 418 SNIIFSNGMQDPWSRGGVLKNISSSIIALVT--KKGAHHADLRAASKGDPEWLKEQRRQE 475
Query: 481 VKIMQGWITQYYDDFK 496
V I++ WI++Y+ +
Sbjct: 476 VAIIEKWISEYHRALR 491
>gi|30682358|ref|NP_850050.1| alpha/beta-hydrolase domain-containing protein [Arabidopsis
thaliana]
gi|16648801|gb|AAL25591.1| At2g24280/F27D4.19 [Arabidopsis thaliana]
gi|22655366|gb|AAM98275.1| At2g24280/F27D4.19 [Arabidopsis thaliana]
gi|330252462|gb|AEC07556.1| alpha/beta-hydrolase domain-containing protein [Arabidopsis
thaliana]
Length = 494
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/496 (39%), Positives = 295/496 (59%), Gaps = 23/496 (4%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
L I ++ G L +R + + + FET ++ Q +DHFS+ P+SY
Sbjct: 9 FLFFSIVAEATYSPGGFH---HLSSLRLKKKVSKSKHELPFETRYFPQNLDHFSFTPDSY 65
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ F Q+Y+IN++FW G PI + G E ID +GF + A +F+AL+V +EHR+Y
Sbjct: 66 KVFHQKYLINNRFWRKGG-PIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFIEHRFY 124
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
G+S PFG K + K+A GY NS QALADYA ++ +K ++ +PV+ G SYGG L
Sbjct: 125 GESTPFG--KKSHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGML 182
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
A WFRLKYPH+ IG+LASSAPIL++ +I P T+++ +S+DF+D S CF+ IK+SW E+
Sbjct: 183 AAWFRLKYPHITIGALASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIKRSWEEL 242
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSNI-------PV 301
+ +++ +GL LSKKF+TC L + +D L VY+ Y T +N PV
Sbjct: 243 EAVSTMKNGLQELSKKFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAPLPGYPV 302
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCYINEDRTGDESDEGWEWQS 360
+++C I+ P ++ AA SL Y+ + S +C+ E +T D +GW++Q+
Sbjct: 303 EQMCKIIDGFPRGSSNLDRAFAAA-----SLYYNYSGSEKCFEMEQQTDDHGLDGWQYQA 357
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRST 420
C+EMV+PM SM P + + + C +YGV PRP W+ T +GG I+ +L+R
Sbjct: 358 CTEMVMPMSCSNQSMLPPYENDSEAFQEQCMTRYGVKPRPHWITTEFGGMRIETVLKRFG 417
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
SNIIFSNGM+DP+SRGG +K ++ G+H DL A K DP+WL +QR+ E
Sbjct: 418 SNIIFSNGMQDPWSRGGVLKNISSSIVALVT--KKGAHHADLRAATKDDPEWLKEQRRQE 475
Query: 481 VKIMQGWITQYYDDFK 496
V I++ WI++YY D +
Sbjct: 476 VAIIEKWISEYYRDLR 491
>gi|356503833|ref|XP_003520707.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 508
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 203/509 (39%), Positives = 301/509 (59%), Gaps = 24/509 (4%)
Query: 6 FSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKD---SNHGFETFFYNQTIDH 62
F F + L +F + S + + PR S L+Q+ N + T F+ Q +DH
Sbjct: 5 FQFIIITLFSLFSAPSLTFAFVPILPRFPSSAVSAELKQRSHSSQNGLYRTKFFTQILDH 64
Query: 63 FSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKA 120
F++ P+S TF QRY+IN FWGG N+PI + G E I+ Q +GF +ENA F+A
Sbjct: 65 FNFNPQSNHTFQQRYLINDTFWGGAKNNAPIFVYTGNEGNIEWFTQNTGFMFENAPSFQA 124
Query: 121 LIVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV 179
L+V +EHR+YGKS+PFG K A N GY +S QALADYA++++ +K +AT +PV
Sbjct: 125 LLVFIEHRFYGKSIPFGGNKTVAYANTSTLGYLSSTQALADYATLIIDLKKNLSATDSPV 184
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEEC 239
+ G SYGG LA WFR+KYPHV IG+LASSAPIL++ + +++++++DFR SE C
Sbjct: 185 VVFGGSYGGMLAAWFRMKYPHVAIGALASSAPILHFMGLVSPDIFNNIITQDFRSESENC 244
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPS 297
++ IK SW ID+ A+KP G+ +L K F+ C L+ L +Y+ Y TPS
Sbjct: 245 YKVIKGSWDLIDDTANKPGGMELLRKTFRICNDDFGPGSLEGWLRAAWIYTAMTDYPTPS 304
Query: 298 NI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCYINEDRTG 349
N PVK++C AI+++ G++ L K+ A A S+ Y+ +RC+ +D +
Sbjct: 305 NFLNPLPAYPVKKMCEAIDSSV-TGNNRLAKLHA----AASVYYNYTGKARCFDLDDNSD 359
Query: 350 DESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYG 408
GW+WQ+C+EM++P+ G +K S++ W+ C Y V PRP W+ T +G
Sbjct: 360 PHDLGGWQWQACTEMIMPVGGSNKESIFPEYEWSYEARASWCDFFYNVQPRPHWITTEFG 419
Query: 409 GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKS 468
GH ++ +L+RS SNIIF NG+RDP+S GG +KT ++ G+H +DL + K
Sbjct: 420 GHAVERVLKRSASNIIFFNGLRDPWSGGGVLKTISKTLVAIVA--KKGAHHVDLRFSSKE 477
Query: 469 DPDWLVQQRKTEVKIMQGWITQYYDDFKA 497
DP WL RK EV I+ WI+QY+ D ++
Sbjct: 478 DPQWLKDVRKLEVNIIASWISQYHQDLQS 506
>gi|297821719|ref|XP_002878742.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297324581|gb|EFH55001.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 495
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 197/496 (39%), Positives = 291/496 (58%), Gaps = 23/496 (4%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
L I ++ G L +R+ + + FET ++ Q +DHF + PESY
Sbjct: 9 FLFFSIVAEATYSPGGFH---HLSSLRQQKKASKSKQELPFETRYFPQNLDHFGFTPESY 65
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
F Q+Y+INS+FW G PI + G E ID +GF + A +F+AL+V +EHR+Y
Sbjct: 66 TVFHQKYLINSRFWRKGG-PIFVYTGNEGDIDWFASNTGFMSDIAPKFQALLVFIEHRFY 124
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
G+S PFG K + K+A GY +S QALADYA ++ +K ++ +PV+ G SYGG L
Sbjct: 125 GESTPFG--KKSHKSAETLGYLSSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGML 182
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
A WFRLKYPH+ IG+LASSAPIL++ +I P T+++ +S+DF+D S CF IK+SW E+
Sbjct: 183 AAWFRLKYPHITIGALASSAPILHFDNIVPLTSFYDAISQDFKDASINCFTVIKRSWEEL 242
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSNI-------PV 301
+ +++ +GL LSKKF+TC L++ +D L VY+ Y T +N PV
Sbjct: 243 EAVSTMKNGLQELSKKFRTCKGLQSKYSARDWLSGAFVYTAMVNYPTAANFMAPLPGYPV 302
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCYINEDRTGDESDEGWEWQS 360
+++C I+ P ++ AA SL Y+ + S +C+ E +T D GW++Q+
Sbjct: 303 EQMCKIIDGFPRGSSNLDRAFAAA-----SLYYNYSGSEKCFEMEQQTDDHGLNGWQYQA 357
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRST 420
C+EMV+PM SM P + + + C YGV PRP W+ T +GG I+ +L+R
Sbjct: 358 CTEMVMPMSCSNQSMLPPYDNDYEAFQEQCMSTYGVKPRPHWITTEFGGKRIETVLKRFG 417
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
SNIIFSNGM+DP+SRGG +K ++ G+H DL A K DP+WL +QR+ E
Sbjct: 418 SNIIFSNGMQDPWSRGGVLKNISSSIVALVT--KKGAHHADLRAATKDDPEWLKEQRRQE 475
Query: 481 VKIMQGWITQYYDDFK 496
V I++ WI++YY D +
Sbjct: 476 VSIIEKWISEYYRDLR 491
>gi|255579351|ref|XP_002530520.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223529924|gb|EEF31852.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 508
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 197/485 (40%), Positives = 294/485 (60%), Gaps = 27/485 (5%)
Query: 31 PRLGRIRRSRILEQKDS------NHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW 84
PR ++ +QK + N ++ F+ QT+DHF++ P+SYQTF QRY+IN +W
Sbjct: 26 PRFSSSINTQPADQKKTSLSTTPNKLYKEKFFTQTLDHFNFNPKSYQTFQQRYLINDTYW 85
Query: 85 GG--GNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRK-A 141
G N+PI + G E I+ Q +GF ++NA +F AL+V +EHR+YGKS+PFG K
Sbjct: 86 AGPKNNAPIFMYTGNEGEIEWFAQNTGFMFDNAPKFNALLVFVEHRFYGKSIPFGGNKEV 145
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHV 201
A NA GY S Q+LADYA+++ +K+ +AT +PV+ G SYGG LA WFRLKYPHV
Sbjct: 146 AYSNASTLGYLTSTQSLADYATLITDLKNNLSATDSPVVVFGGSYGGMLAAWFRLKYPHV 205
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLA 261
IG+LASSAPIL + +IT +++++++ DFR SE C++ IK SW +I++ A++ GL
Sbjct: 206 TIGALASSAPILGFVNITSPYSFNNIITHDFRSESENCYKVIKGSWQQIEDTANQHGGLE 265
Query: 262 ILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSN-------IPVKRICNAIENAP 312
L K FK C + L++ L T VYS Y TPSN PVK++C AI++ P
Sbjct: 266 KLRKSFKICKNYISAGSLENWLSTAWVYSAMTDYPTPSNFLNPLPAFPVKQMCKAIDD-P 324
Query: 313 NCGDDILCKIAAGVVEADSLEYD-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM-GK 370
G+D K A A S+ Y+ + C+ +D + GW+WQ+C+EM++P G
Sbjct: 325 TAGNDTFAKFHA----AASVYYNYSGTATCFDLDDDSDPHGLGGWDWQACTEMILPTGGS 380
Query: 371 DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
S++ W+ + CK ++ + PRP+W+ T +GGH+IK+ L+R SNIIF NG+R
Sbjct: 381 TAESIFPASEWDYNDRVTYCKLRFDIDPRPNWITTEFGGHNIKMALKRFASNIIFFNGLR 440
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S GG ++ ++ ++ G+H +DL A DP WL + R+ EVKI+ W++
Sbjct: 441 DPWSGGGVLEDISKSLVALVEE--KGAHHVDLRFATSEDPKWLQEVRQKEVKIIAKWLSD 498
Query: 491 YYDDF 495
YY D
Sbjct: 499 YYQDL 503
>gi|356570875|ref|XP_003553609.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 507
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 204/508 (40%), Positives = 299/508 (58%), Gaps = 25/508 (4%)
Query: 8 FQWLL-LMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKD---SNHGFETFFYNQTIDHF 63
FQ++L L +F S + PR S L+Q+ N + T F+ Q +DHF
Sbjct: 5 FQFILTLFSLFSVPSLTFAFAPILPRFPSSAVSAELKQRSHSSQNGLYRTKFFTQILDHF 64
Query: 64 SYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKAL 121
++ P+S TF QRY+IN FWGG N+PI + G E I+ Q +GF +ENA F+AL
Sbjct: 65 NFNPQSNHTFQQRYLINDTFWGGAKNNAPIFVYTGNEGNIEWFTQNTGFMFENAPSFQAL 124
Query: 122 IVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVI 180
+V +EHR+YGKS+PFG K A N GY +S QALADYA++++ +K +AT +PV+
Sbjct: 125 LVFIEHRFYGKSIPFGGNKTVAYANTSTLGYLSSTQALADYAALIIDLKKNLSATDSPVV 184
Query: 181 AIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECF 240
G SYGG LA WFR+KYPHV IG+LASSAPIL++ + + S++++DFR SE C+
Sbjct: 185 VFGGSYGGMLAAWFRMKYPHVAIGALASSAPILHFMGMVSPDIFISIITQDFRSESENCY 244
Query: 241 QTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSN 298
+ IK SW ID+ A+KP G+ +L K F+ C L+ L +Y+ Y TPSN
Sbjct: 245 KVIKGSWDLIDDTANKPGGMELLRKTFRICNDDFGPDSLEGWLRAAWIYTAMTDYPTPSN 304
Query: 299 I-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCYINEDRTGD 350
PVK++C AI+++ G++ L K+ A A ++ Y+ + C+ +D +
Sbjct: 305 FLNPLPAYPVKKMCEAIDSSV-TGNNRLAKLYA----AANVYYNYTGKATCFDLDDNSDP 359
Query: 351 ESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGG 409
GW+WQ+C+EM++P+ G +K S++ W+ C Y V PRP W+ T +GG
Sbjct: 360 HDLGGWQWQACTEMIMPVGGSNKESIFPEYEWSYEARASWCDFFYNVQPRPHWITTEFGG 419
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
H I+ +L+RS SNIIF NG+RDP+S GG +KT ++ G+H +DL + K D
Sbjct: 420 HAIERVLKRSASNIIFFNGLRDPWSAGGVLKTISKTIVAIVA--KKGAHHVDLRYSSKED 477
Query: 470 PDWLVQQRKTEVKIMQGWITQYYDDFKA 497
P WL RK EV I+ WI+QY+ D ++
Sbjct: 478 PQWLKDVRKQEVNIIASWISQYHQDLQS 505
>gi|225440787|ref|XP_002281618.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 507
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 199/485 (41%), Positives = 293/485 (60%), Gaps = 26/485 (5%)
Query: 28 KLRPRL-GRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
K+ PR I R L +E ++ Q +DHF+Y P+SY+TF QRY+IN K+WGG
Sbjct: 27 KITPRFPSSIVRPEQLSVSSQTELYEAKYFTQLLDHFNYQPQSYRTFQQRYLINDKYWGG 86
Query: 87 GN--SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGS-RKAAL 143
+ +PI + G E I+ Q +GF ++ A F+AL+V +EHR+YGKS+PFG A
Sbjct: 87 ADKLAPIFVYTGNEGDIEWFAQNTGFMFDTAPHFQALLVFIEHRFYGKSIPFGGDTNVAY 146
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVI 203
NA GY +S QALADYA++++ +K +AT++PV+ G SYGG LA WFRLKYPHV I
Sbjct: 147 SNASTLGYLSSTQALADYATLIIDLKKNLSATNSPVVVFGGSYGGMLAAWFRLKYPHVAI 206
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
G+LASSAPIL + +IT T+++++++DFR SE C++ IK SW +I+ A K GL +L
Sbjct: 207 GALASSAPILNFENITSPYTFNNIITQDFRSESENCYKVIKGSWEQIEETAMKNGGLEVL 266
Query: 264 SKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSNI-------PVKRICNAIENAPNC 314
K F+ C + +++ L T VY+ Y TPSN PVK++C AI++ P
Sbjct: 267 RKSFRICKNYISGGAIENWLSTAYVYTAMTDYPTPSNFLSPMPAYPVKQMCKAIDD-PAQ 325
Query: 315 GDDILCKI--AAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM-GKD 371
G+D K+ AA V Y G + C+ D + W WQ+C+EM++P+ +
Sbjct: 326 GNDTFAKLYGAANVY----YNYTG-TAACFDLADDSDPHGLGEWTWQACTEMIMPVNANN 380
Query: 372 KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRD 431
+ S++ WN + CK + + PRP+W+ T +GGHDIK +L+R SNIIF NG+RD
Sbjct: 381 EESIFPVSTWNYSNRAAFCKFAFDIEPRPNWITTEFGGHDIKRVLKRFGSNIIFFNGLRD 440
Query: 432 PFSRGGWVKTYHFFDLSF-SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
P+S GG +++ ++ ++D G+H +DL A DP+WL RK EVKI+ W ++
Sbjct: 441 PWSGGGVLESISKSIVAIVAKD---GAHHVDLRFATSEDPEWLQDVRKREVKIITKWFSE 497
Query: 491 YYDDF 495
YY D
Sbjct: 498 YYHDL 502
>gi|148909204|gb|ABR17702.1| unknown [Picea sitchensis]
Length = 509
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 201/484 (41%), Positives = 292/484 (60%), Gaps = 29/484 (5%)
Query: 29 LRPR-LGR---IRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW 84
LRPR LG + R + Q S + +ET ++ Q +DHFS+ TF QRY+IN K+W
Sbjct: 29 LRPRTLGEFSTLNREKSFLQS-SQYEYETKYFTQRLDHFSFKNHKNSTFQQRYLINDKYW 87
Query: 85 GGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAA 142
G PI + G E ID +GF ++ A QF AL+V EHRYYG+S+P+GS+ A
Sbjct: 88 LGAERMGPIFYYCGNEGYIDWFAVNTGFMWDIAPQFGALLVFPEHRYYGESMPYGSQSMA 147
Query: 143 LKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVV 202
K+ Y + QALAD+A++++ +K +A PV+ G SYGG LA W RLKYPH+
Sbjct: 148 YKDGDSLSYLTAEQALADFATLIVDLKKNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIA 207
Query: 203 IGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAI 262
IG+LASSAPIL + DI P T++++VS DF+ SE CF+ I++SW ++ + +GL
Sbjct: 208 IGALASSAPILQFEDIVPSDTFYNLVSNDFKRESENCFKVIQQSWKALETYGERDEGLQN 267
Query: 263 LSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTP---------SNIPVKRICNAIENAPN 313
LSKKF+ C L +T E++D L++ YS A D P P+K +C I++ N
Sbjct: 268 LSKKFRMCRDLNSTDEIEDWLNSAYSNLAMVDYPYPASFLMPLPAYPIKEVCKVIDSFSN 327
Query: 314 CGDDILCKIAAGVVEADSLEYD-GNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKD 371
G D+L +I AGV S+ Y+ +C+ +N+D G + GW WQ+C+EMV+PM +
Sbjct: 328 -GTDVLDRIFAGV----SVYYNYTGEEKCFDVNDDPHG---ENGWNWQACTEMVMPMSSN 379
Query: 372 KNSMYQPE-PWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
S P+ +++ Y KNC +GV PRP W+ T +GG DIK +L+ SNIIFSNG+
Sbjct: 380 PESSIFPQFTFDIESYTKNCLNMFGVEPRPHWITTEFGGQDIKRVLKNYGSNIIFSNGLL 439
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S GG ++ ++ + LG+H LDL A ++DP WLV+QR E+KI+ W+ +
Sbjct: 440 DPWSGGGVLQNISNSIVALVTE--LGAHHLDLRAATENDPLWLVEQRNAEMKIINKWMNE 497
Query: 491 YYDD 494
YY D
Sbjct: 498 YYQD 501
>gi|297740163|emb|CBI30345.3| unnamed protein product [Vitis vinifera]
Length = 485
Score = 369 bits (946), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 196/476 (41%), Positives = 289/476 (60%), Gaps = 25/476 (5%)
Query: 36 IRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILA 93
I R L +E ++ Q +DHF+Y P+SY+TF QRY+IN K+WGG + +PI
Sbjct: 14 IVRPEQLSVSSQTELYEAKYFTQLLDHFNYQPQSYRTFQQRYLINDKYWGGADKLAPIFV 73
Query: 94 FLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGS-RKAALKNARHRGYF 152
+ G E I+ Q +GF ++ A F+AL+V +EHR+YGKS+PFG A NA GY
Sbjct: 74 YTGNEGDIEWFAQNTGFMFDTAPHFQALLVFIEHRFYGKSIPFGGDTNVAYSNASTLGYL 133
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
+S QALADYA++++ +K +AT++PV+ G SYGG LA WFRLKYPHV IG+LASSAPI
Sbjct: 134 SSTQALADYATLIIDLKKNLSATNSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPI 193
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
L + +IT T+++++++DFR SE C++ IK SW +I+ A K GL +L K F+ C
Sbjct: 194 LNFENITSPYTFNNIITQDFRSESENCYKVIKGSWEQIEETAMKNGGLEVLRKSFRICKN 253
Query: 273 LKNTTELKDGLDT--VYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKI- 322
+ +++ L T VY+ Y TPSN PVK++C AI++ P G+D K+
Sbjct: 254 YISGGAIENWLSTAYVYTAMTDYPTPSNFLSPMPAYPVKQMCKAIDD-PAQGNDTFAKLY 312
Query: 323 -AAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM-GKDKNSMYQPEP 380
AA V Y G + C+ D + W WQ+C+EM++P+ ++ S++
Sbjct: 313 GAANVY----YNYTG-TAACFDLADDSDPHGLGEWTWQACTEMIMPVNANNEESIFPVST 367
Query: 381 WNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
WN + CK + + PRP+W+ T +GGHDIK +L+R SNIIF NG+RDP+S GG ++
Sbjct: 368 WNYSNRAAFCKFAFDIEPRPNWITTEFGGHDIKRVLKRFGSNIIFFNGLRDPWSGGGVLE 427
Query: 441 TYHFFDLSF-SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
+ ++ ++D G+H +DL A DP+WL RK EVKI+ W ++YY D
Sbjct: 428 SISKSIVAIVAKD---GAHHVDLRFATSEDPEWLQDVRKREVKIITKWFSEYYHDL 480
>gi|326491047|dbj|BAK05623.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 511
Score = 367 bits (943), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 197/464 (42%), Positives = 280/464 (60%), Gaps = 36/464 (7%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ET +Y Q +DHF+ P SY TFPQRY++N +WGG +P+ + G E I+ +GF
Sbjct: 59 YETRYYTQRLDHFNAAPVSYSTFPQRYLVNGTYWGGKTAPVFVYAGNEGSIELFTNNTGF 118
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+E A +F+A++V +EHRYYG+SVPFGS +AA KNA GY ++ QA+AD+A+++ +K
Sbjct: 119 MWELAPRFRAMLVFIEHRYYGRSVPFGSEEAAFKNASTMGYLSTTQAVADFATLVQSLKA 178
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGDITPWTTYHSVVS 229
+A APV+ G SYGG LA W R+KYPHVVIG++ASSAPIL +YG P+ ++ ++S
Sbjct: 179 NLSAPAAPVVVFGGSYGGMLAAWMRMKYPHVVIGAVASSAPILGFYGMADPY-AFYDIIS 237
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-LKDGLDT--V 286
DF+ S+ C + SW E+DN S G A L++ FK C +T E + D LDT V
Sbjct: 238 NDFKSESKNCHDVLMNSWKELDNALSNDAGRAQLNRTFKMCR--GSTVEAIPDMLDTAIV 295
Query: 287 YSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNN 338
YS Y T S PVK IC AI++ P G D +I +A ++ Y+ N
Sbjct: 296 YSAMTDYPTESGFLTHLPAYPVKEICRAIDH-PKSGKDTFSRIK----DALTVYYNYTGN 350
Query: 339 SRCYINEDRTGDESDE-------GWEWQSCSEMVV-PMGKDKNSMYQPEPWNLTKYIKNC 390
+ C+ GD S++ GW+WQ+C+EM++ G S+ PEP+N TK + C
Sbjct: 351 AHCF------GDASEDDPYGMFNGWDWQACTEMILMSYGVRNRSVLPPEPFNFTKLLDGC 404
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
+ G+ PRP W+ T +GG DI +L+RS SNIIF NG+RDP+S GG +K L+
Sbjct: 405 RASTGLPPRPYWIPTEFGGFDIANVLKRSASNIIFFNGLRDPWSSGGVLKDISRSILALV 464
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
+ GSH +DL + K DP WL Q R+ E +I+ W+ QYY D
Sbjct: 465 EP--KGSHHVDLRFSSKDDPHWLKQVREKETRIIAHWLNQYYKD 506
>gi|118487801|gb|ABK95724.1| unknown [Populus trichocarpa]
Length = 500
Score = 367 bits (942), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 184/460 (40%), Positives = 288/460 (62%), Gaps = 20/460 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++T ++ Q +DHF++ P+S + F Q+Y++NS +W G +PI + G E I+ +GF
Sbjct: 50 YKTHYFPQVLDHFTFQPKSSKIFYQKYLVNSHYWHRG-APIFVYTGNEGDIEWFAANTGF 108
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+ A +F+AL+V +EHR+YG+S+PFG++ + K+A GY NS QALAD+A ++ +K
Sbjct: 109 LLDIAPKFRALLVFIEHRFYGESMPFGNK--SYKSAETLGYLNSQQALADFALLIRSLKH 166
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
++ +PV+ G SYGG LA WFRLKYPH+ IG+LASSAPIL + DITPW++++ VS+
Sbjct: 167 NLSSEASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFDDITPWSSFYDAVSQ 226
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYS 288
DF++ S C++ IK SW+E++ ++++ +GLA LS+ F+ C L + + + L + VY+
Sbjct: 227 DFKEASLNCYEVIKGSWAELEALSAQNEGLAELSRTFRACQDLHSLDSVWEWLWSAFVYT 286
Query: 289 EAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-R 340
Y T +N PV+ +C I+ P+ I V A SL Y+ + + +
Sbjct: 287 AMVNYPTEANFMMPLPAYPVQAMCKIIDGFPSGASKI-----TRVFAAASLYYNYSRAEK 341
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
C+ E GW WQ+C+EMV+PM + SM+ ++ ++ ++C + +GV PRP
Sbjct: 342 CFKLEHGPDAHGLHGWNWQACTEMVMPMTCSEESMFPTSSFSYKEFAEDCMKTFGVKPRP 401
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
W+ T +GG I L+L+RS NIIFSNGM+DP+SRGG +K ++ + G+H +
Sbjct: 402 HWITTEFGGKRIDLVLKRSGGNIIFSNGMQDPWSRGGVLKNISSSIIALVTE--KGAHHV 459
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
D A K DP+WL + R+ EV+I+QGWI QYY D K NK
Sbjct: 460 DFRSATKDDPEWLKELRRQEVEIIQGWIDQYYPDLKRTNK 499
>gi|359484787|ref|XP_003633162.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
gi|297735899|emb|CBI18675.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 183/454 (40%), Positives = 277/454 (61%), Gaps = 20/454 (4%)
Query: 53 TFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
T ++ Q +DHF++ P+S F Q+Y+IN+++W G +PI + G E ID +GF
Sbjct: 58 TQYFPQLLDHFTFTPKSSTIFYQKYLINTQYWTHG-APIFVYTGNEGDIDWFASNTGFLL 116
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+ A F+A++V +EHR+YG+S+PFG K + K+ GY NS QALAD+A ++ +K
Sbjct: 117 DIAPSFRAMLVFIEHRFYGESMPFG--KESYKSPETLGYLNSQQALADFAVLIRSLKQNL 174
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
++ +PV+ G SYGG LA WFRLKYPHV IG+LASSAPIL + DITPW++++ VS+DF
Sbjct: 175 SSEASPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILQFDDITPWSSFYDAVSQDF 234
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEA 290
++ S C++ IK SW+E+D +++K GLA +S+ F+TC + + +D L + VY+
Sbjct: 235 KEASLNCYEVIKGSWAELDAMSAKEGGLAEVSRTFRTCKDINSVYSARDWLWSAFVYTAM 294
Query: 291 AQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY 342
Y T +N PV+ +C I+ P+ ++ AA SL Y+ +C+
Sbjct: 295 VNYPTKANFMMPLPAYPVEEMCKIIDRFPHGATNVSRAFAAA-----SLYYNYSGTEKCF 349
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSW 402
E+ GW+WQ+C+EMV+P+ SM+ P + ++ C +YGV PRP W
Sbjct: 350 DLENGKDAHGLHGWDWQACTEMVMPLTCSNESMFPPSSFEYKEFADECTRKYGVMPRPHW 409
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
+ T +GG I+ +L+RS SNIIFSNGM+DP+SRG +K ++ G+H +D
Sbjct: 410 ITTEFGGSRIEQVLKRSASNIIFSNGMQDPWSRGSVLKNISASIVALVT--KKGAHHVDF 467
Query: 463 DEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
A K DPDWL++QR+ EV+I+Q WI Y D K
Sbjct: 468 RFATKEDPDWLIEQRRQEVEILQKWIHDYNADLK 501
>gi|449443023|ref|XP_004139280.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
gi|449493651|ref|XP_004159394.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 499
Score = 365 bits (936), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 194/461 (42%), Positives = 285/461 (61%), Gaps = 23/461 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
FET FY Q +DHF++ P+S + F Q+Y+IN ++W G +PI + G E I+ +GF
Sbjct: 49 FETRFYPQLLDHFTFTPKSSKIFYQKYLINEEYWRNG-APIFVYTGNEGDIEWFAANTGF 107
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+ A +F AL+V +EHR+YG+S PFG+ + +A GY S QALADYA ++ +K
Sbjct: 108 LPDIAPEFHALLVFIEHRFYGESTPFGND--SYNSAETLGYLTSQQALADYAVLIRSLKQ 165
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
++ +PV+ G SYGG LA WFRLKYPH+ IG+LASSAPIL++ +I PW++++ VS+
Sbjct: 166 NLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDNIVPWSSFYDAVSQ 225
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYS 288
DF+D S CF+ IK SW+E+ S+ +GLA LSK F+TC L + + ++D L + VY+
Sbjct: 226 DFKDASLNCFEVIKGSWTELQQEFSE-EGLAELSKTFRTCKNLHSVSSVQDWLWSAFVYT 284
Query: 289 EAAQYDTPSNI-------PVKRICNAIEN-APNCGDDILCKIAAGVVEADSLEYD-GNNS 339
Y T +N PV+ +C I+ AP L K A A SL Y+ +
Sbjct: 285 SMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSK--LNKAFA----AASLYYNYSHGE 338
Query: 340 RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPR 399
+C+ E+ GW WQ+C+EMV+PM SM+ P ++ ++ +CK++YGVSPR
Sbjct: 339 KCFNVENGPDLHGLSGWNWQACTEMVMPMTCSNQSMFPPSKFDYEEFATDCKKKYGVSPR 398
Query: 400 PSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHC 459
P W+ T YGG I+ +L+R SNIIFSNGM+DP+SRGG ++ ++ + G+H
Sbjct: 399 PHWITTEYGGERIEEVLKRFGSNIIFSNGMQDPWSRGGVLRNISTSIVALVTE--KGAHH 456
Query: 460 LDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
+D A K DPDWLV+QR+ EV+I+ WI ++Y D K K
Sbjct: 457 VDFRSATKDDPDWLVEQRRQEVEIIHQWINEHYADMKQDKK 497
>gi|224142419|ref|XP_002324555.1| predicted protein [Populus trichocarpa]
gi|222865989|gb|EEF03120.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 363 bits (933), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 182/456 (39%), Positives = 286/456 (62%), Gaps = 20/456 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++T ++ Q +DHF++ P+S + F Q+Y++NS +W G +PI + G E I+ +GF
Sbjct: 14 YKTHYFPQVLDHFTFQPKSSKIFYQKYLVNSHYWHRG-APIFVYTGNEGDIEWFAANTGF 72
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+ A +F+AL+V +EHR+YG+S+PFG++ + K+A GY NS QALAD+A ++ +K
Sbjct: 73 LLDIAPKFRALLVFIEHRFYGESMPFGNK--SYKSAETLGYLNSQQALADFALLIRSLKH 130
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
++ +PV+ G SYGG LA WFRLKYPH+ IG+LASSAPIL + DITPW++++ VS+
Sbjct: 131 NLSSEASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFDDITPWSSFYDAVSQ 190
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYS 288
DF++ S C++ IK SW+E++ ++++ +GLA LS+ F+ C L + + + L + VY+
Sbjct: 191 DFKEASLNCYEVIKGSWAELEALSAQNEGLAELSRTFRACQDLHSLDSVWEWLWSAFVYT 250
Query: 289 EAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-R 340
Y T +N PV+ +C I+ P+ I V A SL Y+ + + +
Sbjct: 251 AMVNYPTEANFMMPLPAYPVQAMCKIIDGFPSGASKI-----TRVFAAASLYYNYSRAEK 305
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
C+ E GW WQ+C+EMV+PM + SM+ ++ ++ ++C + +GV PRP
Sbjct: 306 CFKLEHGPDAHGLHGWNWQACTEMVMPMTCSEESMFPTSSFSYKEFAEDCMKTFGVKPRP 365
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
W+ T +GG I L+L+RS NIIFSNGM+DP+SRGG +K ++ + G+H +
Sbjct: 366 HWITTEFGGKRIDLVLKRSGGNIIFSNGMQDPWSRGGVLKNISSSIIALVTE--KGAHHV 423
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
D A K DP+WL + R+ EV+I+QGWI QYY D K
Sbjct: 424 DFRSATKDDPEWLKELRRQEVEIIQGWIDQYYPDLK 459
>gi|255565027|ref|XP_002523506.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537213|gb|EEF38845.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 501
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 185/456 (40%), Positives = 280/456 (61%), Gaps = 20/456 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++T ++ Q +DHF++ P Y+ F Q+Y+I+S++W +PI + G E I+ +GF
Sbjct: 50 YKTRYFPQLLDHFTFQPNGYKIFYQKYLISSQYWHK-EAPIFVYTGNEGDIEWFAANTGF 108
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+ A +F+AL+V +EHR+YG+S+PFG K + K+A GY NS QALAD+A ++ +K
Sbjct: 109 LLDIAPKFRALLVFIEHRFYGESMPFG--KDSYKSAETLGYLNSQQALADFAVLIRSLKQ 166
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
++ +PV+ G SYGG LATWFRLKYPH+ IG+LASSAPIL + DITPW++++ VS+
Sbjct: 167 NLSSEASPVVVFGGSYGGMLATWFRLKYPHIAIGALASSAPILQFDDITPWSSFYDAVSQ 226
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYS 288
DFR+ S C++ IK SW+E++ ++S+ +GL LS+ F+TC L + + D L + VYS
Sbjct: 227 DFREASLNCYEVIKGSWAELETLSSQKEGLIELSRTFRTCKDLHSLDSVWDWLWSAYVYS 286
Query: 289 EAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSR 340
Y T +N PVK +C I+ P + + V A SL Y+ +
Sbjct: 287 AMVNYPTEANFMKPLPAYPVKEMCKIIDGFPAGASKV-----SRVFAAASLYYNYSRGEK 341
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
C+ E+ GW WQ+C+EMV+PM K SM+ P ++ ++ + CK+++GV PR
Sbjct: 342 CFQLENVPDAHGLHGWNWQACTEMVMPMTCSKESMFPPSGYDYKEFAEECKKKFGVMPRQ 401
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
W+ T +GG I +L+R SNIIFSNGM DP+SRGG +K ++ G+H +
Sbjct: 402 HWITTEFGGKRIDKVLKRFGSNIIFSNGMEDPWSRGGVLKNISSSIIAIVT--KKGAHHV 459
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
D A K DP+WL + RK E++I+ WI +YY D K
Sbjct: 460 DFRSATKDDPNWLKEMRKQEIQIIGRWIDEYYADLK 495
>gi|297802936|ref|XP_002869352.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297315188|gb|EFH45611.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 497
Score = 361 bits (927), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 190/456 (41%), Positives = 279/456 (61%), Gaps = 25/456 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+T ++ Q +DHFS+ PESY+ F Q+Y+I+S FW G PI + G E I+ +GF
Sbjct: 51 FQTRYFPQNLDHFSFQPESYRIFHQKYLISSHFWRKGG-PIFVYTGNEGDIEWFASNTGF 109
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+ A +F+AL+V +EHR+YG+S P A+ GY NS QALADYA ++ +K
Sbjct: 110 MLDIAPKFQALLVFIEHRFYGESKPHNL-------AKTLGYLNSQQALADYAILIRSLKQ 162
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
++ +PV+ G SYGG LA WFRLKYPH+ IG+LASSAPIL + I P +++++VVS+
Sbjct: 163 NLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILQFDKIVPSSSFYNVVSQ 222
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
DF+D S CF+ IKKSW E++ ++ DGL LSKKF+TC L L+T +++
Sbjct: 223 DFKDASLNCFEVIKKSWRELEVFSTMKDGLQELSKKFRTCKDLHAVYLASRWLETAFTDT 282
Query: 291 AQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR- 340
A Y TP+N PV+ +C I+ P ++ AA SL Y+ + S
Sbjct: 283 AMVNYPTPANFMAPLPAYPVEEMCKIIDWFPLEASNLDRAFAAA-----SLYYNYSGSEN 337
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
C+ E++T GW WQ+C+EMV+P+ SM+QP ++ ++C ++YGV PRP
Sbjct: 338 CFDIENQTDPHGLNGWYWQACTEMVMPISCSNQSMFQPFEYDEKVDQEDCLKEYGVKPRP 397
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
W+ T +GGH I+++L+R SNIIFSNGM+DP+SR G +K ++F G+H
Sbjct: 398 HWITTEFGGHRIEMVLKRFGSNIIFSNGMQDPWSREGVLKNISSSIIAFVT--KKGAHHT 455
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
DL A K DP+WL +QR+ EV ++ WI++YY D +
Sbjct: 456 DLRAATKDDPEWLKEQRRQEVAEIEKWISEYYSDLR 491
>gi|115469214|ref|NP_001058206.1| Os06g0647400 [Oryza sativa Japonica Group]
gi|51535425|dbj|BAD37324.1| putative prolylcarboxypeptidase isoform 1 [Oryza sativa Japonica
Group]
gi|113596246|dbj|BAF20120.1| Os06g0647400 [Oryza sativa Japonica Group]
gi|215737061|dbj|BAG95990.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 539
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 193/467 (41%), Positives = 270/467 (57%), Gaps = 26/467 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW----GGGNSP---ILAFLGAEAPIDD 103
F ++ Q +DHF++ P + F Q+Y++N FW G +P I + G E I+
Sbjct: 82 FTAHYFPQELDHFTFTPNASAVFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
+GF ++ A F AL+V +EHR+YG+S PFG+ + + GY S QALAD+A
Sbjct: 142 FATNTGFMFDIAPSFGALLVFIEHRFYGESKPFGNESNS--SPEKLGYLTSTQALADFAV 199
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
++ +K +A +PV+ G SYGG LA+WFRLKYPHV IG++ASSAPIL + ITPW++
Sbjct: 200 LITSLKHNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSS 259
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
++ VS+D++ S CF IK +W ID S GL LSK F+ C +K+ ++ L
Sbjct: 260 FYEAVSQDYKSESFNCFSVIKAAWDLIDERGSTDAGLLQLSKTFRACKTVKSVYSFRNWL 319
Query: 284 DT--VYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
T VY+ Y TP+N P+K +C I P G DI+ K A A SL Y
Sbjct: 320 WTAFVYTAMVDYPTPANFLMNLPAYPIKEMCKIIHGFP-AGADIVDKAFA----AASLYY 374
Query: 335 D-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + C+ ED GW WQ+C+EMV+PM SM+ P + +C +
Sbjct: 375 NYTGDQTCFQLEDGEDPHGLSGWGWQACTEMVMPMTISNESMFPPFTFTYEGKSDDCFQS 434
Query: 394 YGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
YGV PRP W+ T YGG+ I L+L+R SNIIFSNGMRDP+SRGG +K ++ +
Sbjct: 435 YGVRPRPHWITTEYGGNRIDLVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTE- 493
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
G+H LD A K DPDW+V+QR+ EVKI+QGWI QY +D I+K
Sbjct: 494 -KGAHHLDFRSATKDDPDWVVEQRRQEVKIIQGWIDQYNEDLAQISK 539
>gi|218198653|gb|EEC81080.1| hypothetical protein OsI_23902 [Oryza sativa Indica Group]
Length = 539
Score = 358 bits (918), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 193/467 (41%), Positives = 270/467 (57%), Gaps = 26/467 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW----GGGNSP---ILAFLGAEAPIDD 103
F ++ Q +DHF++ P + F Q+Y++N FW G +P I + G E I+
Sbjct: 82 FTAHYFPQELDHFTFTPNASALFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
+GF + A F AL+V +EHR+YG+S PFG+ + + GY S QALAD+A
Sbjct: 142 FATNTGFMFHIAPSFGALLVFIEHRFYGESKPFGNESNS--SPEKLGYLTSTQALADFAV 199
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
++ +K +A +PV+ G SYGG LA+WFRLKYPHV IG++ASSAPIL + ITPW++
Sbjct: 200 LITSLKHNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSS 259
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
++ VS+D++ S CF IK +W ID S GL LSK F+ C +K+ ++ L
Sbjct: 260 FYDAVSQDYKSESFNCFSVIKAAWDLIDEKGSTDAGLLQLSKTFRACKTVKSVYSFRNWL 319
Query: 284 DT--VYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
T VY+ Y TP+N P+K +C I+ P G DI+ K A A SL Y
Sbjct: 320 WTAFVYTAMVDYPTPANFLMNLPAYPIKEMCKIIDGFP-AGADIVDKAFA----AASLYY 374
Query: 335 D-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + C+ ED GW WQ+C+EMV+PM SM+ P + +C +
Sbjct: 375 NYTGDQTCFQLEDGEDPHGLSGWGWQACTEMVMPMTISNESMFPPFTFTYEGKSDDCFQS 434
Query: 394 YGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
YGV PRP W+ T YGG+ I L+L+R SNIIFSNGMRDP+SRGG +K ++ +
Sbjct: 435 YGVRPRPHWITTEYGGNRIDLVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTE- 493
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
G+H LD A K DPDW+V+QR+ EVKI+QGWI QY +D I+K
Sbjct: 494 -KGAHHLDFRSATKDDPDWVVEQRRQEVKIIQGWIDQYNEDLAQISK 539
>gi|242096520|ref|XP_002438750.1| hypothetical protein SORBIDRAFT_10g025520 [Sorghum bicolor]
gi|241916973|gb|EER90117.1| hypothetical protein SORBIDRAFT_10g025520 [Sorghum bicolor]
Length = 558
Score = 357 bits (917), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 190/462 (41%), Positives = 271/462 (58%), Gaps = 26/462 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW-------GGGNSPILAFLGAEAPIDD 103
F ++ Q +DHF++ P + F Q+Y++N FW G G P+ + G E I+
Sbjct: 101 FTVHYFAQELDHFTFTPNASMVFRQKYLLNDTFWRRPSAGDGDGAGPLFVYTGNEGDIEW 160
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
+GF ++ A +F AL+V +EHR+YG+S PFG+ + K+A GY S QALAD+A
Sbjct: 161 FATNTGFMFDIAPKFGALLVFIEHRFYGESKPFGND--SYKSAETLGYLTSTQALADFAI 218
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
++ +K +A APV+ G SYGG LA+WFRLKYPHV IG++ASSAPIL + ITPW++
Sbjct: 219 LIRSLKKNLSAEAAPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSS 278
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
++ VS+DF+ S CF IK +W +D S GL LSK F+ C +K +++ L
Sbjct: 279 FYDGVSQDFKSESLNCFSVIKGTWDVLDERGSTDKGLLDLSKLFRACKTVKYAYSIRNWL 338
Query: 284 DTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
T +S A Y TP+N PVK +C I+ P G DIL K A A SL Y
Sbjct: 339 WTAFSYTAMVDYPTPANFLENLPAYPVKEMCKIIDGFPT-GADILEKAFA----AASLYY 393
Query: 335 D-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + C ED +GW+WQ+C+EM++PM SM+ P ++ + C +
Sbjct: 394 NYTGDQTCNKIEDGDDPHGLDGWQWQACTEMIMPMTVSNESMFPPSSFSYDERSDECFQS 453
Query: 394 YGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
+GV PRP W+ T YGG+ I +L+R SNIIFSNGMRDP+SRGG +K ++ +
Sbjct: 454 WGVRPRPHWITTEYGGYKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTE- 512
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
G+H LD + K DPDW+++QR+ EV I+QGWI QY+ D
Sbjct: 513 -KGAHHLDFRSSTKGDPDWVIEQRRQEVDIIQGWIDQYHQDM 553
>gi|326513984|dbj|BAJ92142.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 535
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 188/460 (40%), Positives = 267/460 (58%), Gaps = 21/460 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW--GGGNSPILAFLGAEAPIDDNIQLS 108
F ++ Q +DHF++ P + F Q+Y++N FW P+ + G E I+ +
Sbjct: 85 FTAHYFQQELDHFTFTPNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNT 144
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF ++ A F AL+V +EHR+YG+S PFG+ + K+A GY S QALAD+A ++ +
Sbjct: 145 GFLFDIAPDFGALLVFIEHRFYGESKPFGND--SYKSADTLGYLTSTQALADFAVLITSL 202
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K + APV+ G SYGG LA+WFRLKYPHV +G++ASSAPIL + DITPW++++ V
Sbjct: 203 KHNLSTVDAPVVVFGGSYGGMLASWFRLKYPHVAMGAVASSAPILQFDDITPWSSFYDTV 262
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--V 286
S+DF+ S CF IK +W +D+ S GL LSK F+ C +K+ L D L T
Sbjct: 263 SQDFKSESLNCFSVIKAAWDVLDDRGSNHTGLLELSKTFRACKTVKSADSLGDWLSTAFT 322
Query: 287 YSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNN 338
Y+ Y TP+N PVK +C I++ P G DI+ K A A SL Y+ +
Sbjct: 323 YTAMVDYPTPANFMMNLPAYPVKEMCKIIDSFPT-GADIIDKAFA----AASLYYNYTGD 377
Query: 339 SRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
+C+ E +GW+WQ+C+EMV+PM SM+ P ++ C YGV P
Sbjct: 378 QKCFQVEGDDDPHGLDGWDWQACTEMVMPMIVSNESMFPPSSFSYENNSDACLADYGVRP 437
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
R +W+ T YGGH I +L+R SNIIFSNGMRDP+SRGG +K ++ + G+H
Sbjct: 438 RMNWITTEYGGHKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTE--KGAH 495
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAI 498
LD K DPDW+V+QR+ EV+I+ GWI QY D +
Sbjct: 496 HLDFRSETKDDPDWVVEQRRQEVEIIHGWIDQYNKDIAQM 535
>gi|356563482|ref|XP_003549991.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 513
Score = 356 bits (914), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 198/504 (39%), Positives = 294/504 (58%), Gaps = 28/504 (5%)
Query: 13 LMVIFVSTSFHANGLKLR--PR-LGRIR---RSRILEQKDSNHGFETFFYNQTIDHFSYG 66
L +I + S+ A L L P+ LG+ R+ + +E ++ Q +DHFS+
Sbjct: 17 LSIIIIVLSYPAQPLALNHSPKFLGKFAATARTHSNSEPPPQFHYEKRYFQQRLDHFSFS 76
Query: 67 PESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
TFPQRY+I+++ W G + PI + G E I+ Q +GF +E A +F A++V
Sbjct: 77 --ELPTFPQRYLISTEHWVGPHRLGPIFFYCGNEGDIEWFAQNTGFVWEIAPRFGAMVVF 134
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGA 184
EHRYYG+SVP+GS + A KNA Y + QALAD++ ++ ++K Y+A PV+ G
Sbjct: 135 PEHRYYGESVPYGSAEEAYKNATTLSYLTAEQALADFSVLITYLKHNYSAKDCPVVLFGG 194
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIK 244
SYGG LA W RLKYPH+ +G+LASSAPIL + DI P T++ +VS F+ S CF IK
Sbjct: 195 SYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDLVSNAFKRESFTCFNYIK 254
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTP-------- 296
+SW+EI + +GL +L+K F C LK T +L D + YS A + P
Sbjct: 255 QSWNEIASTGQTNNGLELLTKTFNLCQKLKRTKDLYDWAEAAYSYLAMVNYPYPAEFMMT 314
Query: 297 -SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEG 355
P++ +C I+ P G IL +I GV Y G ++C+ +D S G
Sbjct: 315 LPEHPIREVCRRIDGGP-AGTSILERIYEGV--NVYYNYTG-EAKCFELDDDPHGMS--G 368
Query: 356 WEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
WEWQ+C+EMV+PM + +SM+ P +N T C +++GV PRP W+ T +GGHDI
Sbjct: 369 WEWQACTEMVMPMSSSQESSMFPPYEYNYTSIQAECLKKFGVKPRPRWITTEFGGHDIHA 428
Query: 415 ILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLV 474
L++ SNIIFSNG+ DP+S GG ++ +S + G+H +DL + K+DPDWLV
Sbjct: 429 TLKKFGSNIIFSNGLLDPWSGGGVLQNISESVVSLVTE--EGAHHIDLRSSTKNDPDWLV 486
Query: 475 QQRKTEVKIMQGWITQYYDDFKAI 498
+QR+TE+K+++GWI+ Y+ +A+
Sbjct: 487 EQRETEIKLIEGWISDYHQKNEAM 510
>gi|357123237|ref|XP_003563318.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 536
Score = 356 bits (914), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 190/477 (39%), Positives = 272/477 (57%), Gaps = 29/477 (6%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW----------GGGNSPILA 93
+K + + F ++ Q +DHF++ P + + F Q+Y++N FW P+
Sbjct: 69 EKKTKNPFRAHYFPQELDHFTFTPNASRIFYQKYLVNDTFWRKPTGKGRGAAAAPGPVFV 128
Query: 94 FLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFN 153
+ G E I+ SGF ++ A +F AL+V +EHR+YG+S PFG+ + +A GY
Sbjct: 129 YTGNEGDIEWFATNSGFLFDIAPKFSALLVFIEHRFYGESKPFGNE--SYGSAATLGYLT 186
Query: 154 SAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
S QALAD+A ++ +K +A APV+ G SYGG LA+WFRLKYPHV +G+LASSAPIL
Sbjct: 187 STQALADFAVLITSLKHNLSAPVAPVVVFGGSYGGMLASWFRLKYPHVAMGALASSAPIL 246
Query: 214 YYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL 273
+ DITPW++++ S+DF+ S+ CF IK W +D S GL LSK F+ C +
Sbjct: 247 QFDDITPWSSFYDAASQDFKSESKNCFSVIKAVWDVLDERGSNDKGLLQLSKTFRACKTV 306
Query: 274 KNTTELKDGLDT--VYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAA 324
K L + L T VY+ Y TP+N PVK +C I+ P G DI+ K
Sbjct: 307 KYVDSLSNWLWTAFVYTAMVDYPTPANFLMNLPAYPVKEMCKIIDAFPP-GADIVDK--- 362
Query: 325 GVVEADSLEYD-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNL 383
A SL Y+ + +C+ E GW WQ+C+EMV+PM SM+ P ++
Sbjct: 363 -AFSAASLYYNYTGDQKCFDVEGGDDPHGLSGWGWQACTEMVMPMTVSNKSMFPPSSFSY 421
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ + C YGV PR W+ T YGGH I +L+R SNIIFSNGMRDP+SRGG +K
Sbjct: 422 EEKSEGCLASYGVRPRMHWITTEYGGHKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNIS 481
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
++ + G+H LD A K DPDW+++QR+ EV+I+QGWI QY D +++
Sbjct: 482 SSIIALVTE--KGAHHLDFRTATKDDPDWVIEQRRQEVEIIQGWIDQYNKDIAHMSQ 536
>gi|326510399|dbj|BAJ87416.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 524
Score = 356 bits (913), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 188/460 (40%), Positives = 267/460 (58%), Gaps = 21/460 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW--GGGNSPILAFLGAEAPIDDNIQLS 108
F ++ Q +DHF++ P + F Q+Y++N FW P+ + G E I+ +
Sbjct: 74 FTAHYFQQELDHFTFTPNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNT 133
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF ++ A F AL+V +EHR+YG+S PFG+ + K+A GY S QALAD+A ++ +
Sbjct: 134 GFLFDIAPDFGALLVFIEHRFYGESKPFGND--SYKSADTLGYLTSTQALADFAVLITSL 191
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K + APV+ G SYGG LA+WFRLKYPHV +G++ASSAPIL + DITPW++++ V
Sbjct: 192 KHNLSTVDAPVVVFGGSYGGMLASWFRLKYPHVAMGAVASSAPILQFDDITPWSSFYDTV 251
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--V 286
S+DF+ S CF IK +W +D+ S GL LSK F+ C +K+ L D L T
Sbjct: 252 SQDFKSESLNCFSVIKAAWDVLDDRGSNHTGLLELSKTFRACKTVKSADSLGDWLSTAFT 311
Query: 287 YSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNN 338
Y+ Y TP+N PVK +C I++ P G DI+ K A A SL Y+ +
Sbjct: 312 YTAMVDYPTPANFMMNLPAYPVKEMCKIIDSFPT-GADIIDKAFA----AASLYYNYTGD 366
Query: 339 SRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
+C+ E +GW+WQ+C+EMV+PM SM+ P ++ C YGV P
Sbjct: 367 QKCFQVEGDDDPHGLDGWDWQACTEMVMPMIVSNESMFPPSSFSYENNSDACLADYGVRP 426
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
R +W+ T YGGH I +L+R SNIIFSNGMRDP+SRGG +K ++ + G+H
Sbjct: 427 RMNWITTEYGGHKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTE--KGAH 484
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAI 498
LD K DPDW+V+QR+ EV+I+ GWI QY D +
Sbjct: 485 HLDFRSETKDDPDWVVEQRRQEVEIIHGWIDQYNKDIAQM 524
>gi|115440161|ref|NP_001044360.1| Os01g0767100 [Oryza sativa Japonica Group]
gi|53792444|dbj|BAD53352.1| putative prolylcarboxypeptidase, isoform 1 [Oryza sativa Japonica
Group]
gi|53793554|dbj|BAD53324.1| putative prolylcarboxypeptidase, isoform 1 [Oryza sativa Japonica
Group]
gi|113533891|dbj|BAF06274.1| Os01g0767100 [Oryza sativa Japonica Group]
Length = 517
Score = 355 bits (911), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 198/477 (41%), Positives = 276/477 (57%), Gaps = 26/477 (5%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQT----FPQRYVINSKFWG 85
RPR G + E+ + +ET ++ Q +DHFS+ E + F QRY++
Sbjct: 31 RPRRGVVGA----EEAVRGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGRGGGW 86
Query: 86 GGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALK 144
G PI + G E I SG +E A +F AL+V EHRYYG+S+PFGS+ A
Sbjct: 87 AGAGGPIFFYCGNEGDIAWFAANSGLVWEAATRFAALVVFAEHRYYGESMPFGSKDKAYN 146
Query: 145 NARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIG 204
N++ Y + QALADYA +L +K ++ +PV+ G SYGG LA W RLKYPH+ +G
Sbjct: 147 NSKSLAYLTAEQALADYAVLLTDLKKNLSSEGSPVVLFGGSYGGMLAAWMRLKYPHIAVG 206
Query: 205 SLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILS 264
+LASSAPIL + D+ P T ++ +VS DF+ S CFQTIK SW +D + DGL LS
Sbjct: 207 ALASSAPILQFEDVVPSTIFYDLVSNDFKRESLSCFQTIKDSWKALDAQGNGQDGLLKLS 266
Query: 265 KKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCG 315
K F C +KNT EL D L + YS A D P P+K +C I+N P+ G
Sbjct: 267 KTFHLCKTIKNTGELSDWLSSAYSYLAMVDYPMPADFMMPLPGNPIKELCTKIDNQPD-G 325
Query: 316 DDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSM 375
IL +I AGV Y G +N+D G +GW+WQ+C+EMV+PM ++SM
Sbjct: 326 TSILERIYAGV--NVYYNYTGTVDCFDLNDDPHG---MDGWDWQACTEMVMPMSYSEDSM 380
Query: 376 YQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
+ + +N T Y K+C +GV PRP W+ T +GGH+I L+L R SNIIF NG+ DP+S
Sbjct: 381 FPADKFNYTSYEKDCINSFGVEPRPQWITTEFGGHNISLVLERFGSNIIFFNGLLDPWSG 440
Query: 436 GGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
GG +K + + + LG+H +DL A K DPDWLV+ R++E+ I+ GW++ YY
Sbjct: 441 GGVLK--NISESVVAIIAPLGAHHIDLRPASKDDPDWLVRLRESELGIISGWLSDYY 495
>gi|356505400|ref|XP_003521479.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 504
Score = 355 bits (911), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 192/454 (42%), Positives = 276/454 (60%), Gaps = 21/454 (4%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTY 112
F+ QT+DHF+Y P+SYQTF QRY+IN +WGG N+PI ++G E I+ Q +GF +
Sbjct: 56 FFTQTLDHFNYNPQSYQTFQQRYLINDTYWGGAKNNAPIFVYMGNEGDIEWFAQNTGFMF 115
Query: 113 ENAHQFKALIVILEHRYYGKSVPFG-SRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
E A FKAL+V +EHRYYGKS PFG + + A N GY +S QALADYA++++ +K+
Sbjct: 116 ETAPYFKALLVFIEHRYYGKSFPFGGNEEVADANTTTVGYMSSTQALADYATLIIDLKNN 175
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
+AT +PV+ +G SYGG LA WFR+KYPHV IG+LASSAPIL + D+ TY ++++D
Sbjct: 176 LSATDSPVVVVGGSYGGMLAAWFRMKYPHVAIGALASSAPILQFLDLVSPYTYTDIITQD 235
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
++ SE C++ IK SW +I++ A KP GL L K F+ C + L L AA
Sbjct: 236 YKSESENCYKVIKGSWKQIEDTAQKPGGLEQLRKSFRICKHYISAGALVYWLQMALGSAA 295
Query: 292 QYDTPS---------NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY 342
D P+ PV+++C AI+N + AA + + Y G + C+
Sbjct: 296 MTDYPTPSVFLAPLPAYPVRKMCEAIDNLSAVNETFTKLYAAANIFYN---YTG-TATCF 351
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMG-KDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
++ T GW+WQ+C+E+++P+G ++ SM+ P W L CK Y + PRP
Sbjct: 352 FLDNTTAPLG--GWDWQACTELIMPLGANNEGSMFPPYKWKLRDVEFYCKRVYHIQPRPH 409
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
W+ T +GGHDIK +L+RS SNIIF NG+RDP+S GG +K + + + G+H +D
Sbjct: 410 WITTEFGGHDIKRVLKRSGSNIIFFNGLRDPWSGGGVLK--NISETIVAIVAKEGAHHVD 467
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
L + DP+WL RK E+KI+ WI+QYY D
Sbjct: 468 LRFSTTEDPEWLKDIRKREIKIIANWISQYYQDL 501
>gi|357152979|ref|XP_003576298.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 517
Score = 354 bits (908), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 188/481 (39%), Positives = 277/481 (57%), Gaps = 28/481 (5%)
Query: 33 LGRIRRSRILEQKDSNHG------FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
L I R R + + G +ET +Y Q +DHF P SY+TFPQRY++N +WGG
Sbjct: 40 LTHIHRRRPIVAAAAGGGVPTQVRYETKYYEQRLDHFDALPASYRTFPQRYLVNGTYWGG 99
Query: 87 GNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNA 146
SP+ + G E ++ +GF +E A +F+AL++ +EHRYYGKS PFGS +AA +N
Sbjct: 100 KTSPVFLYAGNEGNVELFTNNTGFMWELAPRFRALLLFVEHRYYGKSFPFGSEEAAFRNT 159
Query: 147 RHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
GY + QA+AD A+++ +K +A APVI G SYGG LA W R+KYPHVV+G++
Sbjct: 160 STVGYLTTTQAVADLATLVQSLKSNLSAHAAPVIVFGGSYGGMLAAWVRMKYPHVVMGAV 219
Query: 207 ASSAPIL-YYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSK 265
ASSAPIL +YG P+ ++ V+S DF+ S+ C + KSW E+D S G A L+
Sbjct: 220 ASSAPILGFYGLADPY-AFYDVISNDFKSESKNCHDVLMKSWGELDKALSNDAGRADLNS 278
Query: 266 KFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSN---------IPVKRICNAIENAPNCGD 316
FK C + D LDT + +A D P++ PVK +C AI++ P G+
Sbjct: 279 TFKMCR-ASTVDAIPDLLDTALTYSAMTDYPTSSGFLTPLPPYPVKEMCRAIDH-PKSGN 336
Query: 317 DILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESD--EGWEWQSCSEMVV-PMGKDKN 373
D +I + + Y+ + + + D +GW+WQ+C+EM++ G
Sbjct: 337 DTFARIKGAL----DVYYNHTGAEPCLGDATESDPYGMFDGWDWQACTEMILMTYGVRNG 392
Query: 374 SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
+++ PEP+N T + C+ G+ PRP W+ T +GG DIK +LRRS SNIIF NG+RDP+
Sbjct: 393 TVFPPEPFNFTDLLAGCRASTGLPPRPYWIPTEFGGFDIKHVLRRSASNIIFFNGLRDPW 452
Query: 434 SRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
S GG +K+ ++ + GSH +DL + K DP+WL Q R E +I+ W+ QYY
Sbjct: 453 SSGGVLKSISNSIIALVEP--KGSHHVDLRFSTKEDPEWLKQVRIKETRIIAHWLRQYYK 510
Query: 494 D 494
+
Sbjct: 511 E 511
>gi|224088466|ref|XP_002308456.1| predicted protein [Populus trichocarpa]
gi|222854432|gb|EEE91979.1| predicted protein [Populus trichocarpa]
Length = 461
Score = 348 bits (893), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 175/397 (44%), Positives = 254/397 (63%), Gaps = 19/397 (4%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTY 112
F+ Q +DH+++ P+SY+TF QRY+IN K+WGG N+PI + G E I+ Q +GF +
Sbjct: 53 FFTQVLDHYTFRPQSYKTFQQRYLINDKYWGGAEKNAPIFLYTGNEGDIEWFAQNTGFIF 112
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALADYASILLHIKDK 171
+ A FK L+V +EHR+YGKS+PFG K A N+ GY S QALADYA++++ +K
Sbjct: 113 DIAPHFKPLLVFIEHRFYGKSMPFGGNKEVAYSNSSTLGYLTSTQALADYATLIIDLKKN 172
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
+AT +PV+ G SYGG LA WFRLKYPHV IG+LASS+PIL + +IT ++++++++D
Sbjct: 173 LSATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSSPILNFENITSPYSFNNIITQD 232
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSE 289
FR SE C++ IK+SW EI++ AS+P GL IL F+ C + + L+ L T VY+
Sbjct: 233 FRGESENCYKVIKRSWQEIEDTASQPGGLEILRSSFRICRNSISASSLQSWLYTALVYTA 292
Query: 290 AAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRC 341
Y TPSN PVK +C AI++ P G++ K+ A S+ Y+ N+ C
Sbjct: 293 MTDYPTPSNFLNPMPAYPVKEMCKAIDD-PKTGNNTFAKLYG----AASVYYNYSGNATC 347
Query: 342 YINEDRTGDESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
+ +D + GW WQ+C+EM++P G +K+S++ WN CK +GV PRP
Sbjct: 348 FNLDDDSDPHGLGGWSWQACTEMILPTSGNNKDSIFPASEWNYDDRASFCKAYFGVEPRP 407
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
+W+ +GGHDIK +L+R SNIIF NG+RDP+S GG
Sbjct: 408 NWITAEFGGHDIKRVLKRFGSNIIFFNGLRDPWSGGG 444
>gi|224058953|ref|XP_002299661.1| predicted protein [Populus trichocarpa]
gi|222846919|gb|EEE84466.1| predicted protein [Populus trichocarpa]
Length = 437
Score = 348 bits (893), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 179/333 (53%), Positives = 231/333 (69%), Gaps = 16/333 (4%)
Query: 29 LRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
+ PR+ R + + + GFET FYNQT+DHF+Y PESY TFPQRYVINSK+WGG N
Sbjct: 104 IGPRVWRDQPDKTTSCEVDEEGFETCFYNQTLDHFNYRPESYDTFPQRYVINSKYWGGAN 163
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
+ IL +LGAEA ID +GF +NA QFK+L+V +EHRYYG S P G+
Sbjct: 164 ASILVYLGAEASIDGYRDAAGFLDDNAVQFKSLLVFIEHRYYGHSFPPGAWG-------K 216
Query: 149 RGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLAS 208
RGYF+SAQALADYA+I++ IK+ +A ++PVI IG SYGG LA+WFRLKYPH+ +G+LAS
Sbjct: 217 RGYFSSAQALADYAAIIIDIKENRSAQYSPVIVIGGSYGGMLASWFRLKYPHIALGALAS 276
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
SAPILY+ DITP Y+SVV+K+FR+ SE C+QTIK SWSEID +ASKPDGL++LS KF
Sbjct: 277 SAPILYFDDITPQDAYYSVVTKEFREASETCYQTIKTSWSEIDELASKPDGLSMLSTKFN 336
Query: 269 TCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVE 328
TC L + +ELKD L +Y+ AAQ ++P PV +C I++ + GDDIL +I GVV
Sbjct: 337 TCKNLTDASELKDYLRLMYAYAAQNNSPPTYPVNEVCKGIDDDAS-GDDILSRIFRGVV- 394
Query: 329 ADSLEYDGNNSRCYINEDRTGDESDE--GWEWQ 359
Y GN + CY N + +S+ W WQ
Sbjct: 395 ----AYYGNRT-CYFNNNAYAYQSEATLDWSWQ 422
>gi|226503900|ref|NP_001142279.1| uncharacterized protein LOC100274448 precursor [Zea mays]
gi|194689380|gb|ACF78774.1| unknown [Zea mays]
gi|194707984|gb|ACF88076.1| unknown [Zea mays]
gi|413943525|gb|AFW76174.1| putative serine peptidase S28 family protein [Zea mays]
Length = 542
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 187/467 (40%), Positives = 268/467 (57%), Gaps = 27/467 (5%)
Query: 47 SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW--------GGGNSPILAFLGAE 98
S F ++ Q +DHF++ P + F +Y++N FW G P+ + G E
Sbjct: 80 STKPFTAHYFPQLLDHFAFTPNASTVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNE 139
Query: 99 APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
I+ +GF ++ A F AL+V +EHR+YG+S PFG+ + ++A GY S QAL
Sbjct: 140 GDIEWFATNTGFMFDIAPTFGALLVFIEHRFYGESKPFGND--SYRSAETLGYLTSTQAL 197
Query: 159 ADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI 218
AD+A ++ +K A APV+ G SYGG LA+WFRLKYPHV IG+LASSAPIL + I
Sbjct: 198 ADFAVVIRGLKRDLGAEAAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHI 257
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE 278
TPW++++ VS+DF+ S CF I+ +W +D + GL LSK F+ C +K
Sbjct: 258 TPWSSFYDAVSQDFKSESSNCFGVIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYS 317
Query: 279 LKDGLDTV--YSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEA 329
+++ L T Y+ Y TP+N PVK +C I+ P G D+L K A A
Sbjct: 318 IRNWLWTAFTYTAMVDYPTPANFLENLPAYPVKEMCKTIDAFP-AGADVLEKAFA----A 372
Query: 330 DSLEYD-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
SL Y+ + C ED +GW+WQ+C+EM++PM SM+ P ++
Sbjct: 373 ASLYYNYTGDQACNKIEDGDDPHGLDGWQWQACTEMIMPMTISNESMFPPSAFSYDDRSD 432
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
C + +GV PRP W+ T YGG+ I +L+R SNIIFSNGMRDP+SRGG +K ++
Sbjct: 433 ECFQSWGVRPRPHWITTEYGGYKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIA 492
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
+ G+H LDL + K DPDWL++QR+ EV+I+QGWI QY+ D
Sbjct: 493 LVTE--KGAHHLDLRSSTKGDPDWLIEQRRQEVEIIQGWIDQYHQDM 537
>gi|22328106|ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|95147306|gb|ABF57288.1| At5g65760 [Arabidopsis thaliana]
gi|110736177|dbj|BAF00060.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana]
gi|332010719|gb|AED98102.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 515
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 204/504 (40%), Positives = 284/504 (56%), Gaps = 33/504 (6%)
Query: 11 LLLMVIFVSTSFHANGLKLR-----PRLGRI----RRSRILE-QKDSN-HGFETFFYNQT 59
LLL+ F + F +NG L PR R R +RI + + D N + +ET F++Q
Sbjct: 7 LLLIFTFFTLVFPSNGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQ 66
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ F QRY+INS W G ++ PI + G E I+ SGF ++ A +
Sbjct: 67 LDHFSFA--DLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPK 124
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F AL+V EHRYYG+S+P+GSR+ A KNA Y + QALAD+A + +K +A
Sbjct: 125 FGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEAC 184
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
PV+ G SYGG LA W RLKYPH+ IG+LASSAPIL + D+ P T++ + S DF+ S
Sbjct: 185 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRESS 244
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTP- 296
CF TIK SW I K +GL L+K F C L +T +L D LD+ YS A D P
Sbjct: 245 SCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPY 304
Query: 297 --------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
P++ +C I+ A IL +I AG+ + Y GN +++D
Sbjct: 305 PADFMMPLPGHPIREVCRKIDGA-GSNASILDRIYAGI--SVYYNYTGNVDCFKLDDDPH 361
Query: 349 GDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYY 407
G +GW WQ+C+EMV+PM ++ NSM+ +N + Y + C + V+PRP WV T +
Sbjct: 362 G---LDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKWVTTEF 418
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKK 467
GGHDI L+ SNIIFSNG+ DP+S G +K + D + G+H LDL +
Sbjct: 419 GGHDIATTLKSFGSNIIFSNGLLDPWSGGSVLK--NLSDTIVALVTKEGAHHLDLRPSTP 476
Query: 468 SDPDWLVQQRKTEVKIMQGWITQY 491
DP WLV QR+ E++++QGWI Y
Sbjct: 477 EDPKWLVDQREAEIRLIQGWIETY 500
>gi|326499536|dbj|BAJ86079.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 526
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 187/456 (41%), Positives = 260/456 (57%), Gaps = 21/456 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW--GGGNSPILAFLGAEAPIDDNIQLS 108
F ++ Q +DHF++ P + F Q+Y++N FW P+ + G E I+ +
Sbjct: 76 FTAHYFQQELDHFTFTPNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNT 135
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF ++ A F AL+V +EHR+YG+S PFG+ + K+A GY S QALAD+A ++ +
Sbjct: 136 GFMFDIAPDFGALLVFIEHRFYGESKPFGND--SYKSADTLGYLTSTQALADFAVLITSL 193
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K +A APV+ G SYGG LA+WFRLKYPHV IG+LASSAPIL + DITPW++++ V
Sbjct: 194 KQNLSAVDAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFEDITPWSSFYEAV 253
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--V 286
S+DF+ S CF IK W + S GL LSK F+ C + L D L T
Sbjct: 254 SEDFKSESLNCFSVIKAVWDVLTVRGSNDTGLLELSKTFRACKTVLLPNSLLDWLSTAFT 313
Query: 287 YSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNN 338
Y+ Y TP+N PVK +C I++ P G D++ K A A SL Y+ +
Sbjct: 314 YTAMVDYPTPANFMQNLPAYPVKEMCKIIDSFP-AGADVVEKAFA----AASLYYNYTGD 368
Query: 339 SRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
+C+ E GW WQ+C+EMV+PM SM+ P ++ + + C Y V P
Sbjct: 369 QKCFEVEGGDDPHGLSGWGWQACTEMVMPMTVSNESMFPPSGFSYEEKSEGCIAAYDVRP 428
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
R W+ T YGGH I +L+R SNIIFSN MRDP+SRGG +K ++ + G+H
Sbjct: 429 RMHWITTEYGGHKIDKVLKRFGSNIIFSNEMRDPWSRGGVLKNISSSIIALVTE--KGAH 486
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
LD A K DPDW+V+QR+ EV+I+ GWI QY D
Sbjct: 487 HLDFRSATKDDPDWVVEQRRQEVEIIHGWIDQYNKD 522
>gi|449437430|ref|XP_004136495.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 502
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 183/459 (39%), Positives = 276/459 (60%), Gaps = 22/459 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLS 108
++T F+ Q +DHF++ P+SYQ+F QRY+IN +WGG NSPI + G E I+ Q +
Sbjct: 49 YQTSFFTQILDHFNFNPQSYQSFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNT 108
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALADYASILLH 167
GF + A F+AL+V +EHR+YGKS+PFG + A N+ GY +S QALADYA+++
Sbjct: 109 GFLLQYAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSMLGYLSSTQALADYATLITD 168
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+K +A +PV+ G SYGG LA WFRLKYPH+ +G+LASSAPIL +IT ++++
Sbjct: 169 LKKNLSAVDSPVLVFGGSYGGMLAAWFRLKYPHIALGALASSAPILQLENITSPYAFNNI 228
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT-- 285
V++DF+ S+ C+ IK+SW ID ++ P G +L K FK C + +K+ L T
Sbjct: 229 VTQDFKSESQNCYSVIKESWHLIDITSTHPQGPQLLRKSFKFCKEAE-AESIKNWLSTAI 287
Query: 286 VYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
+Y+ Y TPSN PVK++C AI++ P G+D K+ A ++ Y+
Sbjct: 288 IYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDD-PRSGNDSFTKLYG----AANIYYNFTG 342
Query: 339 S-RCYINEDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGV 396
+ C+ +D + W WQ+C+EM++P G + K S++ W+ + CK + V
Sbjct: 343 TVTCFDLDDDSDPHDLGDWSWQACTEMILPTGGNTKESIFPASTWHFADRFQFCKTFFDV 402
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
PR W+ T++GGH+I+ +L+R SNIIF NG+RDP+S GG +K ++ G
Sbjct: 403 EPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVA--KEG 460
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
+H +DL + DP WL RK EV I++ W++QYY D
Sbjct: 461 AHHVDLRFSNPDDPKWLKDVRKQEVNIIEDWLSQYYLDL 499
>gi|195643982|gb|ACG41459.1| lysosomal Pro-X carboxypeptidase precursor [Zea mays]
Length = 542
Score = 344 bits (883), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 186/467 (39%), Positives = 268/467 (57%), Gaps = 27/467 (5%)
Query: 47 SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW--------GGGNSPILAFLGAE 98
S F ++ Q +DHF++ P + F +Y++N FW G P+ + G E
Sbjct: 80 STKPFTAHYFPQLLDHFAFTPNASTVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNE 139
Query: 99 APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
I+ +GF ++ A F AL+V +EH++YG+S PFG+ + ++A GY S QAL
Sbjct: 140 GDIEWFATNTGFMFDIAPTFGALLVFIEHQFYGESKPFGND--SYRSAETLGYLTSTQAL 197
Query: 159 ADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI 218
AD+A ++ +K A APV+ G SYGG LA+WFRLKYPHV IG+LASSAPIL + I
Sbjct: 198 ADFAVVIRGLKRDLGAEAAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHI 257
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE 278
TPW++++ VS+DF+ S CF I+ +W +D + GL LSK F+ C +K
Sbjct: 258 TPWSSFYDAVSQDFKSESSNCFGVIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYS 317
Query: 279 LKDGLDTV--YSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEA 329
+++ L T Y+ Y TP+N PVK +C I+ P G D+L K A A
Sbjct: 318 IRNWLWTAFTYTAMVDYPTPANFLENLPAYPVKEMCKTIDAFP-AGADVLEKAFA----A 372
Query: 330 DSLEYD-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
SL Y+ + C ED +GW+WQ+C+EM++PM SM+ P ++
Sbjct: 373 ASLYYNYTGDQACNKIEDGDDPHGLDGWQWQACTEMIMPMTISNESMFPPSAFSYDDRSD 432
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
C + +GV PRP W+ T YGG+ I +L+R SNIIFSNGMRDP+SRGG +K ++
Sbjct: 433 ECFQSWGVRPRPHWITTEYGGYKIDKVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIA 492
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
+ G+H LDL + K DPDWL++QR+ EV+I+QGWI QY+ D
Sbjct: 493 LVTE--KGAHHLDLRSSTKGDPDWLIEQRRQEVEIIQGWIDQYHQDM 537
>gi|224095007|ref|XP_002310325.1| predicted protein [Populus trichocarpa]
gi|222853228|gb|EEE90775.1| predicted protein [Populus trichocarpa]
Length = 515
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 187/465 (40%), Positives = 271/465 (58%), Gaps = 22/465 (4%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPI 101
Q+ + +E+ ++ Q +DHFS+ + FPQRY+IN+ W G PI + G E I
Sbjct: 56 QEQQQYRYESKYFYQQLDHFSF--LNLPKFPQRYLINTDHWAGPERRGPIFLYCGNEGDI 113
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ +GF +E A F A+++ EHRYYG+S+P+G+R+ A KNA Y + QALAD+
Sbjct: 114 EWFAVNTGFVWEIAPLFGAMVLFPEHRYYGESMPYGNREEAYKNASTLSYLTAEQALADF 173
Query: 162 ASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
A ++ +K +A PV+ G SYGG LA W RLKYPHV IG+LASSAPIL + DI P
Sbjct: 174 AVLITDLKRNLSAQACPVVLFGGSYGGMLAAWMRLKYPHVAIGALASSAPILQFEDIVPP 233
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
T++++VS DF+ S CF TIK+SW + + K +GL L+K F C LK+T +L +
Sbjct: 234 ETFYNIVSNDFKRESTSCFNTIKESWDALLSEGLKKNGLVQLTKTFHLCRELKSTEDLAN 293
Query: 282 GLDTVYSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL 332
LD+ YS A D P P+ +C I+ P+ G IL +I G+ +
Sbjct: 294 WLDSAYSYLAMVDYPYPSSFMMPLPGYPIGEVCKRIDGCPD-GTSILERIFEGI--SIYY 350
Query: 333 EYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCK 391
Y G +++D G +GW WQ+C+EMV+PM N SM+ +N + Y + C
Sbjct: 351 NYTGELHCFELDDDPHG---LDGWNWQACTEMVMPMSSSHNASMFPTYDFNYSSYQEGCW 407
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
E++GV PRP W+ T +GG DIK L SNIIFSNG+ DP+S G ++ + + +
Sbjct: 408 EEFGVIPRPRWITTEFGGQDIKTALETFGSNIIFSNGLLDPWSGGSVLQ--NISETVVAL 465
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H +DL + DPDWLV+QR+TEVK+++GWI Y + K
Sbjct: 466 VTEEGAHHIDLRPSTPEDPDWLVEQRETEVKLIKGWIDGYLKEKK 510
>gi|449521545|ref|XP_004167790.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 501
Score = 342 bits (876), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 181/459 (39%), Positives = 274/459 (59%), Gaps = 23/459 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLS 108
++T F+ Q +DHF++ P+SYQ+F QRY+IN +WGG NSPI + G E I+ Q +
Sbjct: 49 YQTSFFTQILDHFNFNPQSYQSFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNT 108
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALADYASILLH 167
GF + A F+AL+V +EHR+YGKS+PFG + A N+ GY +S QALADYA+++
Sbjct: 109 GFLLQYAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSMLGYLSSTQALADYATLITD 168
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+K +A +PV+ G SYGG LA WFRLKYPH+ +G+LASSAPIL +IT ++++
Sbjct: 169 LKKNLSAVDSPVLVFGGSYGGMLAAWFRLKYPHIALGALASSAPILQLENITSPYAFNNI 228
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT-- 285
V++DF+ S+ C+ IK+SW ID ++ P G +L K F+ +K+ L T
Sbjct: 229 VTQDFKSESQNCYSVIKESWHLIDITSTHPQGPQLLRKSFQILE--AEAESIKNWLSTAI 286
Query: 286 VYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
+Y+ Y TPSN PVK++C AI++ P G+D K+ A ++ Y+
Sbjct: 287 IYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDD-PRSGNDSFTKLYG----AANIYYNFTG 341
Query: 339 S-RCYINEDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGV 396
+ C+ +D + W WQ+C+EM++P G + K S++ W+ + CK + V
Sbjct: 342 TVTCFDLDDDSDPHDLGDWSWQACTEMILPTGGNTKESIFPASTWHFADRFQFCKTFFDV 401
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
PR W+ T++GGH+I+ +L+R SNIIF NG+RDP+S GG +K ++ G
Sbjct: 402 EPRRIWIPTHFGGHNIERVLKRFGSNIIFFNGLRDPWSGGGVLKNISSTIIAIVA--KEG 459
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
+H +DL + DP WL RK EV I++ W++QYY D
Sbjct: 460 AHHVDLRFSNPDDPKWLKDVRKQEVNIIEDWLSQYYLDL 498
>gi|168047270|ref|XP_001776094.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672604|gb|EDQ59139.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 505
Score = 341 bits (875), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 197/499 (39%), Positives = 289/499 (57%), Gaps = 28/499 (5%)
Query: 13 LMVIFVSTSFHANGLKLRPR--LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
++++ +S F + RP +G + + + Q + + ++ Q IDHFS+ E+
Sbjct: 1 MLLLALSGLFTSAARPPRPLFPIGLLGGRKNVGQSSVTYAVD--YFTQVIDHFSFRREA- 57
Query: 71 QTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
+F QRY+I ++W G PI + G E ++ + +GF +E A F ALI+ EHR
Sbjct: 58 -SFQQRYLIEKRYWKGAADRGPIFMYCGNEGDVEWFAKNTGFLWEIAPSFGALILFPEHR 116
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGG 188
YYGKS+P+G+ +A+ K+A S QALAD+A++++ +K A +PV+ G SYGG
Sbjct: 117 YYGKSMPYGTMEASYKDADSLSTLTSEQALADFATLVIDLKKNLTAAASPVVLFGGSYGG 176
Query: 189 ELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWS 248
LA+W RLKYPH+ IG++A+SAPIL + DI P T++ +VS DF+ S CF I++SW
Sbjct: 177 MLASWMRLKYPHIAIGAVAASAPILQFEDIVPSDTFYKIVSADFKRESASCFNYIRESWG 236
Query: 249 EIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ--YDTPSNI------- 299
ID IASK GL LS +F C L + EL++ L + YS A Y P+N
Sbjct: 237 VIDKIASKNGGLHDLSTQFHMCRDLNASWELENWLSSAYSYVAMVDYPIPTNFITPLPAY 296
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCYINEDRTGDE-SDEGWE 357
PV+ IC I++ P G DIL +I AG S+ Y+ + C+ D D+ GW+
Sbjct: 297 PVREICRVIDSLPE-GSDILSRIFAGA----SVYYNYSGQAECFQPSDPGNDDLGVTGWD 351
Query: 358 W-QSCSEMVVP-MGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLI 415
W Q+C+EM++P NSM+QP W+L I+ C + YGV PRP+W+ T YGG DIK +
Sbjct: 352 WQQACTEMIMPMSSNSSNSMFQPYDWDLEGNIQYCMKTYGVRPRPNWITTNYGGKDIKAV 411
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQ 475
L+ SNI+FSNG+ DP+S GG ++ ++ G+H LDL + K DPDWLV+
Sbjct: 412 LKDFGSNIVFSNGLLDPWSGGGVLENISSSIIALVAP--EGAHHLDLRASTKIDPDWLVE 469
Query: 476 QRKTEVKIMQGWITQYYDD 494
QR TEV + WI Y D
Sbjct: 470 QRATEVAYITKWIAAYNSD 488
>gi|302758120|ref|XP_002962483.1| hypothetical protein SELMODRAFT_78660 [Selaginella moellendorffii]
gi|300169344|gb|EFJ35946.1| hypothetical protein SELMODRAFT_78660 [Selaginella moellendorffii]
Length = 474
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 186/465 (40%), Positives = 281/465 (60%), Gaps = 35/465 (7%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTY 112
++ QT+DHF+ G + FPQRY++++ W GG S PI + G E I + +GF +
Sbjct: 24 YFQQTLDHFNVG--NITLFPQRYLLHNASWSGGASGAPIFVYCGNEGDIVWFAENTGFMF 81
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+ A F AL+V EHR+YGKS PFG + + + ++ QALAD+A+++L +K
Sbjct: 82 DIAPLFGALLVFPEHRFYGKSQPFGGQNGP----KELAFCSAEQALADFATLILDLKRNL 137
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+A +PV+ G SYGG LA WFRLKYPH+ IG+LASSAPIL + +I P+TT++ +VS F
Sbjct: 138 SAQASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENIVPYTTFYDIVSNAF 197
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ 292
+ E+CF+ I+ SW+ I A + +GL LS+ F C+ KN EL + L++ YS A
Sbjct: 198 KREGEKCFEIIRNSWTAITEAAEQQNGLRNLSQDFHMCSDFKNADELINWLESAYSYLAM 257
Query: 293 YDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY 342
+ P +N PV+++C A+ N+P IL +I AGV ++ Y+ ++C+
Sbjct: 258 ANYPYAANFTMPLPAHPVRKVCQAMVNSP-VASSILQRIYAGV----NVYYNFTGAAKCF 312
Query: 343 -INEDRTGDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRP 400
+++D G GW WQSC+EMV+PM + N SMY P W+ + + C E YG PRP
Sbjct: 313 DLDDDPHG---LSGWNWQSCTEMVMPMSSNSNTSMYPPFEWDGEAWSRFCWENYGAIPRP 369
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG--GWVKTYHFFDLSFSQDL----- 453
SWV T +GGHDIK +LR SNI+FSNG+ DP+S G ++ F S S +
Sbjct: 370 SWVTTEFGGHDIKSVLRNFGSNIVFSNGLLDPWSGGRQAMNDSFRFVLESISSTILAFVT 429
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAI 498
G+H LD ++K DP WL++QR++EV+ ++ W+++Y+ + I
Sbjct: 430 KEGAHHLDFRWSRKDDPQWLIEQRESEVREIKRWLSEYHQNVFVI 474
>gi|302758752|ref|XP_002962799.1| hypothetical protein SELMODRAFT_78251 [Selaginella moellendorffii]
gi|300169660|gb|EFJ36262.1| hypothetical protein SELMODRAFT_78251 [Selaginella moellendorffii]
Length = 470
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 185/452 (40%), Positives = 274/452 (60%), Gaps = 29/452 (6%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTY 112
++ QT+DHF+ G + FPQRY++++ W GG S PI + G E I + +GF +
Sbjct: 24 YFQQTLDHFNVG--NITLFPQRYLLHNASWSGGASGAPIFVYCGNEGDIVWFAENTGFMF 81
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+ A F AL+V EHR+YGKS PFG + + + ++ QALAD+A+++L +K
Sbjct: 82 DIAPLFGALLVFPEHRFYGKSQPFGGQNGP----KELAFCSAEQALADFATLILDLKRNL 137
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+A +PV+ G SYGG LA WFRLKYPH+ IG+LASSAPIL + +I P+TTY+ +VS F
Sbjct: 138 SAQASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENIVPYTTYYDIVSNAF 197
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ 292
+ EECF+ I+ SW+ I A + +GL LS+ F C+ KN EL + L++ YS A
Sbjct: 198 KREGEECFEIIRNSWTAITEAAEQQNGLRNLSQDFHMCSDFKNADELINWLESAYSYLAM 257
Query: 293 YDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY 342
+ P PV+++C A+ N+P IL +I AGV ++ Y+ ++C+
Sbjct: 258 ANYPYAANFTMPLPAYPVRKVCQAMVNSP-VASSILQRIYAGV----NVYYNFTGAAKCF 312
Query: 343 -INEDRTGDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRP 400
+++D G GW WQSC+EMV+PM + + SMY P W+ + + C E+YG PRP
Sbjct: 313 DLDDDPHG---LSGWNWQSCTEMVMPMSSNSSTSMYPPFEWDGEAWSRFCWEKYGAIPRP 369
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
SWV T +GGHDIK LR SNI+FSNG+ DP+S G F S D + G+H L
Sbjct: 370 SWVTTEFGGHDIKSALRNFGSNIVFSNGLLDPWSGGRQAMNDSFRLRSSLIDFD-GAHHL 428
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
D ++K DP WL++QR+ EV+ ++ W+++Y+
Sbjct: 429 DFRWSRKDDPQWLIEQRELEVREIKRWLSEYH 460
>gi|413924933|gb|AFW64865.1| putative serine peptidase S28 family protein [Zea mays]
Length = 534
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 184/471 (39%), Positives = 274/471 (58%), Gaps = 31/471 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ET +Y Q +DHF+ P SY TF QRY++N FWGG +PI + G E +D +GF
Sbjct: 63 YETRWYTQRLDHFNSAPASYATFQQRYLVNDTFWGGPTAPIFLYAGNEGDVDLFTNNTGF 122
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFG-SRKAALKNARHRGYFNSAQALADYASILLHIK 169
+E+A +F+AL+V +EHRYYG+S+PFG +R AA ++AR +GY QALADYAS +L +K
Sbjct: 123 MWESAPRFRALLVFVEHRYYGESMPFGGTRAAAFRDARTKGYLTVTQALADYASFVLSLK 182
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGDITPWTTYHSVV 228
+A APV+ G SYGG LA W RLKYPH+V+G++ASSAPIL +YG + P+ Y +
Sbjct: 183 ANLSAPAAPVVVFGGSYGGMLAAWMRLKYPHIVMGAVASSAPILSFYGIVDPYAFYDR-I 241
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC--TPLKNTTELKDGLDTV 286
+ DF+ S+ C+ ++KSW +D+ + +G A L + F C + +++ L + V
Sbjct: 242 TDDFKSESKHCYDVLRKSWDVLDDALATKEGQAQLRRTFTMCNGSSVQDIPSLLESA-VV 300
Query: 287 YSEAAQYDTPSNI-------PVKRICNAIE-----------NAPNCG--DDILCKIAAGV 326
Y+ Y TPS PV+ +C AI+ A N G + + V
Sbjct: 301 YAAMTDYPTPSGFLTPLPAYPVRAMCRAIDASRAESAEASSGAANDGNSNSTAQLTLSQV 360
Query: 327 VEADSLEYDGNN-SRCY-INEDRTGDESDEGWEWQSCSE-MVVPMGKDKNSMYQPEPWNL 383
+A + Y+ + C+ ED +GW WQ+C+E MV+ G + QP P+N
Sbjct: 361 RDAMDVYYNHTGAAACFRAEEDDDPHGMYDGWNWQACTEVMVMAYGIRDGGVLQPSPFNF 420
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
T + +C+ G+ PRP W+ T +GG DI +L++S SNI+F NG+RDP+S GG +K+
Sbjct: 421 TDVVDSCRNYTGLPPRPFWIETEFGGFDIGNVLKKSASNIVFFNGLRDPWSTGGVLKSIS 480
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
++ + GSH +DL + K DP+WL Q R E +I+ W+ QYY D
Sbjct: 481 DSIIALVEP--KGSHHVDLRFSSKEDPEWLKQVRVKETRIIARWLKQYYSD 529
>gi|125572151|gb|EAZ13666.1| hypothetical protein OsJ_03583 [Oryza sativa Japonica Group]
Length = 549
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 198/509 (38%), Positives = 276/509 (54%), Gaps = 58/509 (11%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQT----FPQRYVINSKFWG 85
RPR G + E+ + +ET ++ Q +DHFS+ E + F QRY++
Sbjct: 31 RPRRGVVGA----EEAVRGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGRGGGW 86
Query: 86 GGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI-------------------- 124
G PI + G E I SG +E A +F AL+V
Sbjct: 87 AGAGGPIFFYCGNEGDIAWFAANSGLVWEAATRFAALVVFAEANLSSHLAPVSVFFLGCA 146
Query: 125 ------------LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
L+HRYYG+S+PFGS+ A N++ Y + QALADYA +L +K
Sbjct: 147 LIVRDPIPFTCSLQHRYYGESMPFGSKDKAYNNSKSLAYLTAEQALADYAVLLTDLKKNL 206
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
++ +PV+ G SYGG LA W RLKYPH+ +G+LASSAPIL + D+ P T ++ +VS DF
Sbjct: 207 SSEGSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDVVPSTIFYDLVSNDF 266
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ 292
+ S CFQTIK SW +D + DGL LSK F C +KNT EL D L + YS A
Sbjct: 267 KRESLSCFQTIKDSWKALDAQGNGQDGLLKLSKTFHLCKTIKNTGELSDWLSSAYSYLAM 326
Query: 293 YDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
D P P+K +C I+N P+ G IL +I AGV Y G +
Sbjct: 327 VDYPMPADFMMPLPGNPIKELCTKIDNQPD-GTSILERIYAGV--NVYYNYTGTVDCFDL 383
Query: 344 NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
N+D G +GW+WQ+C+EMV+PM ++SM+ + +N T Y K+C +GV PRP W+
Sbjct: 384 NDDPHG---MDGWDWQACTEMVMPMSYSEDSMFPADKFNYTSYEKDCINSFGVEPRPQWI 440
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
T +GGH+I L+L R SNIIF NG+ DP+S GG +K + + LG+H +DL
Sbjct: 441 TTEFGGHNISLVLERFGSNIIFFNGLLDPWSGGGVLKNIS--ESVVAIIAPLGAHHIDLR 498
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
A K DPDWLV+ R++E+ I+ GW++ YY
Sbjct: 499 PASKDDPDWLVRLRESELGIISGWLSDYY 527
>gi|20197495|gb|AAM15096.1| putative prolylcarboxypeptidase [Arabidopsis thaliana]
Length = 476
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 183/501 (36%), Positives = 279/501 (55%), Gaps = 51/501 (10%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
L I ++ G L +R + + + FET ++ Q +DHFS+ P+SY
Sbjct: 9 FLFFSIVAEATYSPGGFH---HLSSLRLKKKVSKSKHELPFETRYFPQNLDHFSFTPDSY 65
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ F Q+Y+IN++FW G PI + G E ID +GF + A +F+AL+V +EHR+Y
Sbjct: 66 KVFHQKYLINNRFWRKGG-PIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFIEHRFY 124
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
G+S PFG K + K+A GY NS QALADYA ++ +K ++ +PV+ G SYGG L
Sbjct: 125 GESTPFG--KKSHKSAETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGML 182
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
A WFRLKYPH+ IG+LASSAPIL++ +I P T+++ +S+DF+D S CF+ IK+SW E+
Sbjct: 183 AAWFRLKYPHITIGALASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIKRSWEEL 242
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSNI-------PV 301
+ +++ +GL LSKKF+TC L + +D L VY+ Y T +N PV
Sbjct: 243 EAVSTMKNGLQELSKKFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAPLPGYPV 302
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCYINEDRTGDESDEGWEWQS 360
+++C I+ P ++ AA SL Y+ + S +C+ E +T D +GW++Q+
Sbjct: 303 EQMCKIIDGFPRGSSNLDRAFAAA-----SLYYNYSGSEKCFEMEQQTDDHGLDGWQYQA 357
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGG-----HDIKLI 415
C+EMV+PM SM P + + + C +YGV PRP W+ T +GG H I +
Sbjct: 358 CTEMVMPMSCSNQSMLPPYENDSEAFQEQCMTRYGVKPRPHWITTEFGGMMDYFHQIFRV 417
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQ 475
L+ +S+I+ + G+H DL A K DP+WL +
Sbjct: 418 LKNISSSIV-------------------------ALVTKKGAHHADLRAATKDDPEWLKE 452
Query: 476 QRKTEVKIMQGWITQYYDDFK 496
QR+ EV I++ WI++YY D +
Sbjct: 453 QRRQEVAIIEKWISEYYRDLR 473
>gi|125527844|gb|EAY75958.1| hypothetical protein OsI_03875 [Oryza sativa Indica Group]
Length = 549
Score = 338 bits (868), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 198/509 (38%), Positives = 276/509 (54%), Gaps = 58/509 (11%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQT----FPQRYVINSKFWG 85
RPR G + E+ + +ET ++ Q +DHFS+ E + F QRY++
Sbjct: 31 RPRRGVVGA----EEAVRGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGRGGGW 86
Query: 86 GGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI-------------------- 124
G PI + G E I SG +E A +F AL+V
Sbjct: 87 AGAGGPIFFYCGNEGDIAWFAANSGLVWEAATRFAALVVFAEANLSSHLAPVSVFFLGCA 146
Query: 125 ------------LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
L+HRYYG+S+PFGS+ A N++ Y + QALADYA +L +K
Sbjct: 147 LIVRDPIPFTCSLQHRYYGESMPFGSKDKAYNNSKSLAYLTAEQALADYAVLLTDLKKNL 206
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
++ +PV+ G SYGG LA W RLKYPH+ +G+LASSAPIL + D+ P T ++ +VS DF
Sbjct: 207 SSEGSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDVVPSTIFYDLVSNDF 266
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ 292
+ S CFQTIK SW +D + DGL LSK F C +KNT EL D L + YS A
Sbjct: 267 KRESLICFQTIKDSWKALDAQGNGQDGLLKLSKTFHLCKTIKNTGELSDWLSSAYSYLAM 326
Query: 293 YDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
D P P+K +C I+N P+ G IL +I AGV Y G +
Sbjct: 327 VDYPMPADFMMPLPGNPIKELCTKIDNQPD-GTSILERIYAGV--NVYYNYTGTVDCFDL 383
Query: 344 NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
N+D G +GW+WQ+C+EMV+PM ++SM+ + +N T Y K+C +GV PRP W+
Sbjct: 384 NDDPHG---MDGWDWQACTEMVMPMSYSEDSMFPADKFNYTSYEKDCINSFGVEPRPQWI 440
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
T +GGH+I L+L R SNIIF NG+ DP+S GG +K + + LG+H +DL
Sbjct: 441 TTEFGGHNISLVLERFGSNIIFFNGLLDPWSGGGVLKNIS--ESVVAIIAPLGAHHIDLR 498
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
A K DPDWLV+ R++E+ I+ GW++ YY
Sbjct: 499 PASKDDPDWLVRLRESELGIISGWLSDYY 527
>gi|115484263|ref|NP_001065793.1| Os11g0156200 [Oryza sativa Japonica Group]
gi|62701876|gb|AAX92949.1| At2g24280/F27D4.19 [Oryza sativa Japonica Group]
gi|77548743|gb|ABA91540.1| Serine carboxypeptidase S28 family protein, expressed [Oryza sativa
Japonica Group]
gi|113644497|dbj|BAF27638.1| Os11g0156200 [Oryza sativa Japonica Group]
Length = 511
Score = 338 bits (868), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 180/464 (38%), Positives = 275/464 (59%), Gaps = 33/464 (7%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ET ++ Q +DHF+ P S TF QRY++N FWGG +P+ + G E + +GF
Sbjct: 53 YETRYFTQRLDHFNELPASNGTFRQRYLVNGTFWGGAAAPVFVYAGNEGDVALFASNTGF 112
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFG-SRKAALKNARHRGYFNSAQALADYASILLHIK 169
+E A +F+A++V +EHRYYG+S+PFG +R AA +A GY +AQALAD+A ++L +K
Sbjct: 113 MWEAAPRFRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTTAQALADFAELILSLK 172
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A APV+ G SYGG LA W R+KYPH+V+G++ASSAPIL ++ ++++VVS
Sbjct: 173 SNLTACKAPVVIFGGSYGGMLAAWMRMKYPHIVMGAVASSAPILGLNGLSDPYSFYNVVS 232
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL---DTV 286
DF+ S+ C+ ++ SWSE+ + G A L++ F C K + GL +
Sbjct: 233 NDFKSESKHCYDVLRNSWSEMYKALATDAGRARLNQTFNMC---KGNVDDIPGLVEKALI 289
Query: 287 YSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
Y Y TPSN PV+ IC AI+ P G+D + +I +A ++ Y+
Sbjct: 290 YGSMMDYPTPSNFLTSLPAYPVREICRAIDK-PTSGNDTVSRIK----DAMTIYYNSTGG 344
Query: 340 -RCYINEDRTGDESDE------GWEWQSCSEMVVPM--GKDKNSMYQPEPWNLTKYIKNC 390
C+ G E D+ GW WQ+C+E+++ M G +++ P+P+NLT Y+ C
Sbjct: 345 LACFPG---AGAEDDDPYGMFPGWTWQACTEVIMTMSYGIGNATVFPPDPFNLTAYLAGC 401
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
GV PRP W+ +Y+GG+DI+ +L+RS SNIIF NG+RDP+S GG +K+ ++
Sbjct: 402 LATTGVPPRPHWIQSYFGGYDIRNVLKRSGSNIIFFNGLRDPWSAGGILKSISNSIIALV 461
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
+ G H +DL + K DP+WL + R+ E++I+ W+ QYY D
Sbjct: 462 EP--KGGHHVDLRFSTKEDPEWLKKVRRQEMRIIADWLKQYYSD 503
>gi|125533461|gb|EAY80009.1| hypothetical protein OsI_35177 [Oryza sativa Indica Group]
Length = 511
Score = 338 bits (866), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 180/464 (38%), Positives = 275/464 (59%), Gaps = 33/464 (7%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ET ++ Q +DHF+ P S TF QRY++N FWGG +P+ + G E + +GF
Sbjct: 53 YETRYFTQRLDHFNELPASNGTFRQRYLVNGTFWGGAAAPVFVYAGNEGDVALFASNTGF 112
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFG-SRKAALKNARHRGYFNSAQALADYASILLHIK 169
+E A +F+A++V +EHRYYG+S+PFG +R AA +A GY AQALAD+A ++L +K
Sbjct: 113 MWEAAPRFRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTPAQALADFAELILSLK 172
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A APV+ G SYGG LA W R+KYPH+V+G++ASSAPIL ++ ++++VVS
Sbjct: 173 SNLTACKAPVVIFGGSYGGMLAAWMRMKYPHIVMGAVASSAPILGLNGLSDPYSFYNVVS 232
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL---DTV 286
DF+ S+ C+ ++ SWSE+ + G A L++ F C K + GL +
Sbjct: 233 NDFKSESKHCYDVLRNSWSEMYKALATDAGRARLNQTFNMC---KGNVDDIPGLVEKALI 289
Query: 287 YSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
Y Y TPSN PV+ IC AI+ P G+D + +I +A ++ Y+
Sbjct: 290 YGSMMDYPTPSNFLTSLPAYPVREICRAIDK-PTSGNDTVSRIK----DAMTIYYNSTGG 344
Query: 340 -RCYINEDRTGDESDE------GWEWQSCSEMVVPM--GKDKNSMYQPEPWNLTKYIKNC 390
C+ G E D+ GW WQ+C+E+++ M G +++ P+P+NLT Y+ +C
Sbjct: 345 LACFPG---AGAEDDDPYGMFPGWTWQACTEVIMTMSYGIGNATVFPPDPFNLTAYLADC 401
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
GV PRP W+ +Y+GG+DI+ +L+RS SNIIF NG+RDP+S GG +K+ ++
Sbjct: 402 LATTGVPPRPHWIQSYFGGYDIRNVLKRSGSNIIFFNGLRDPWSAGGILKSISNSIIALV 461
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
+ G H +DL + K DP+WL + R+ E++I+ W+ QYY D
Sbjct: 462 EP--KGGHHVDLRFSTKEDPEWLKKVRRQEMRIIADWLKQYYSD 503
>gi|10177334|dbj|BAB10683.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana]
Length = 529
Score = 337 bits (864), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 204/518 (39%), Positives = 283/518 (54%), Gaps = 47/518 (9%)
Query: 11 LLLMVIFVSTSFHANGLKLR-----PRLGRI----RRSRILE-QKDSN-HGFETFFYNQT 59
LLL+ F + F +NG L PR R R +RI + + D N + +ET F++Q
Sbjct: 7 LLLIFTFFTLVFPSNGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQ 66
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ F QRY+INS W G ++ PI + G E I+ SGF ++ A +
Sbjct: 67 LDHFSFA--DLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPK 124
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F AL+V EHRYYG+S+P+GSR+ A KNA Y + QALAD+A + +K +A
Sbjct: 125 FGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEAC 184
Query: 178 PVIAIGASYGGE--------------LATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
PV+ G SYGG LA W RLKYPH+ IG+LASSAPIL + D+ P T
Sbjct: 185 PVVLFGGSYGGSNNCVFVFVVIDATVLAAWMRLKYPHIAIGALASSAPILQFEDVVPPET 244
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
++ + S DF+ S CF TIK SW I K +GL L+K F C L +T +L D L
Sbjct: 245 FYDIASNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWL 304
Query: 284 DTVYSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
D+ YS A D P P++ +C I+ A IL +I AG+ + Y
Sbjct: 305 DSAYSYLAMVDYPYPADFMMPLPGHPIREVCRKIDGA-GSNASILDRIYAGI--SVYYNY 361
Query: 335 DGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQ 393
GN +++D G +GW WQ+C+EMV+PM ++ NSM+ +N + Y + C
Sbjct: 362 TGNVDCFKLDDDPHG---LDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNT 418
Query: 394 YGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
+ V+PRP WV T +GGHDI L+ SNIIFSNG+ DP+S G +K D +
Sbjct: 419 FRVNPRPKWVTTEFGGHDIATTLKSFGSNIIFSNGLLDPWSGGSVLKNLS--DTIVALVT 476
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
G+H LDL + DP WLV QR+ E++++QGWI Y
Sbjct: 477 KEGAHHLDLRPSTPEDPKWLVDQREAEIRLIQGWIETY 514
>gi|449443602|ref|XP_004139566.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 325
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/316 (52%), Positives = 224/316 (70%), Gaps = 11/316 (3%)
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASY 186
HR+YGKS+PFGS + A+KN RGYFNSAQALADYA +LLHIK + +P+I +GASY
Sbjct: 16 HRFYGKSIPFGSLEKAMKNGSIRGYFNSAQALADYAELLLHIKKMFAYDTSPIIVMGASY 75
Query: 187 GGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKS 246
GG LA+WFRLKYPH+ +G+LASSAPILY+ +ITP Y+S+VSK F++TS+ C TI++S
Sbjct: 76 GGMLASWFRLKYPHIALGALASSAPILYFDNITPQDGYYSIVSKSFKETSKTCHDTIRRS 135
Query: 247 WSEIDNIASKP-DGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRIC 305
W EID IA K GL+ILSK+FKTC LK ++E+K+ +D+V++ AAQY+ P PV+ IC
Sbjct: 136 WGEIDRIAGKTRGGLSILSKQFKTCGKLKTSSEIKNLMDSVFTMAAQYNDPYENPVRGIC 195
Query: 306 NAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDESDEGWEWQSCSEM 364
AI+ +++ ++ AGV+ Y G CY + E ++ + WQ CSEM
Sbjct: 196 VAIDEEAKKKSNVIKQVVAGVI-----AYLGERP-CYDVYEFGYPNDPLNQYGWQVCSEM 249
Query: 365 VVPM---GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTS 421
V+P+ G+DKNSM+ P P+ + CK+ YGV+PRP W+ T+YGG DIKL+L R S
Sbjct: 250 VMPIGSSGRDKNSMFPPSPFQFNDFKTMCKDLYGVTPRPHWITTFYGGQDIKLVLHRFGS 309
Query: 422 NIIFSNGMRDPFSRGG 437
NIIFSNG++DP+S GG
Sbjct: 310 NIIFSNGLKDPYSSGG 325
>gi|413954869|gb|AFW87518.1| putative serine peptidase S28 family protein [Zea mays]
Length = 1052
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 188/492 (38%), Positives = 269/492 (54%), Gaps = 62/492 (12%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW-----GGGNSPILAFLGAEAPIDDNI 105
F ++ Q +DHFS+ P + F Q+Y++N FW GG P+L ++G EA I+
Sbjct: 79 FTVHYFQQELDHFSFTPNASTVFYQKYLVNDTFWRRPGGGGTAGPLLVYVGGEADIECIA 138
Query: 106 QLSGFTYENAHQFKALIVILEH---------------------------RYYGKSVPFGS 138
GF ++ A F AL+V +EH R+YG+S+PFG+
Sbjct: 139 HNVGFMFDIAPTFGALLVFVEHIEYIFGDLNIGPQKDMARVVWWSKRKHRFYGESLPFGN 198
Query: 139 RKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKY 198
+A+ GY S QALAD A ++ +K +A +PV+ G SYGG LA+WFRLKY
Sbjct: 199 -----NSAQALGYLTSTQALADLAILITDLKRNLSAETSPVVIFGGSYGGMLASWFRLKY 253
Query: 199 PHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD 258
PHV IG+LASSAPIL + ITPW++++ VVS+D++ S CF IK +W ++ S +
Sbjct: 254 PHVTIGALASSAPILQFDYITPWSSFYDVVSQDYKSESLNCFSVIKAAWDVLEERGSNGN 313
Query: 259 GLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI 318
GL LSK F+ C +K ++ L T + A D P+ + +EN P
Sbjct: 314 GLVELSKLFRACKTVKYADSIRRWLRTAFVSIAMMDYPTP------ASFLENLPAYPVKE 367
Query: 319 LCKIAAGV-VEADSLEYDGNNSRCYINEDRTGDES-----DEG---------WEWQSCSE 363
+CKI G +AD LE + Y N TGD++ DEG W WQ+C+E
Sbjct: 368 MCKIVDGFPADADILEKVFAAASLYYN--YTGDQTCNQIEDEGNPRCLNLNYWGWQACTE 425
Query: 364 MVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNI 423
+++PM +SM+ P ++ C + +GV PRP W+ T YGG+ I +L+R SNI
Sbjct: 426 LMMPMSSANDSMFPPHTFSYEDKANYCLQTFGVRPRPHWITTEYGGYRIDEVLKRFGSNI 485
Query: 424 IFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKI 483
IFSNGMRDP+SRGG +K ++ + G+H LDL A K DPDW+ +QR+ EV+I
Sbjct: 486 IFSNGMRDPWSRGGVLKNISSSIVALVTE--KGAHHLDLRSATKDDPDWVTEQRRQEVEI 543
Query: 484 MQGWITQYYDDF 495
+ GWI QYY D
Sbjct: 544 IHGWIDQYYRDM 555
>gi|326489169|dbj|BAK01568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 514
Score = 335 bits (859), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 187/456 (41%), Positives = 267/456 (58%), Gaps = 26/456 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP-ILAFLGAEAPIDDNIQLSG 109
FET ++ Q +DHFS+ E + F QRY++ G I + G E I SG
Sbjct: 52 FETRYFRQRLDHFSFSGEE-EFFQQRYLVGRAGGWAGPGGPIFFYCGNEGDIAWFAANSG 110
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+E A +F AL+V EHRYYG+S+PFGS++ A ++R Y + QALADYA +L +K
Sbjct: 111 LVWEAAPRFAALVVFAEHRYYGESMPFGSKEKAYNSSRSMAYLTAEQALADYAVLLTDLK 170
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
++ +PV+ G SYGG LA W RLKYPH+ +G+LASSAPIL + DI P T ++ +VS
Sbjct: 171 RNLSSESSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPDTIFYDLVS 230
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
DF+ S CFQTIK SW E+D + DGL LSK F C L T L D L + YS
Sbjct: 231 NDFKRESLSCFQTIKDSWKELDEQGNGQDGLLKLSKTFHLCQTLNTTGALSDWLSSAYSY 290
Query: 290 AAQYD----------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
A D P+N P+K +C I+ P IL +I AGV ++ Y+ +
Sbjct: 291 LAMVDYPMPSEFLMPLPAN-PIKEVCRNIDKQPE-RSSILERIYAGV----NIYYNYTGT 344
Query: 340 -RCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGV 396
C+ +++D G GW+WQ+C+EMV+PM + SM+ P+ ++ Y +C + +GV
Sbjct: 345 VHCFDLDDDPHG---MGGWDWQACTEMVMPMSSSEGLSMFPPDEFDYALYADDCVKNFGV 401
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
PRP W+ T +GGH+I +L + +SNIIF NG+ DP+S GG +K D + LG
Sbjct: 402 RPRPRWISTEFGGHNISSVLEKFSSNIIFFNGLLDPWSGGGVLKNIS--DSVVAIVAPLG 459
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+H +DL A K DPDWLV R++E++I+ GW++ +Y
Sbjct: 460 AHHIDLRPATKEDPDWLVSLRESELEIISGWLSDHY 495
>gi|413952315|gb|AFW84964.1| putative serine peptidase S28 family protein [Zea mays]
Length = 517
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 190/464 (40%), Positives = 267/464 (57%), Gaps = 28/464 (6%)
Query: 48 NHGFETFFYNQTIDHFSYG-------PESYQTFPQRYVINSKFWGGGNS-PILAFLGAEA 99
++ +ET ++ Q +DHFS+ E+ F QRY++ G PI + G E
Sbjct: 45 DYEYETRYFRQRLDHFSFPGVGDEDEDEAAAFFQQRYLVGRGGGWAGPGGPIFFYCGNEG 104
Query: 100 PIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALA 159
I SG +E A +F AL+V EHRYYG+S+PFGS+ A +++ Y + QALA
Sbjct: 105 DIAWFAANSGLVWEAAPRFAALVVFAEHRYYGESMPFGSKAKAYSDSKSLAYLTAEQALA 164
Query: 160 DYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDIT 219
D+A +L +K +A +PV+ G SYGG LA W RLKYPH+ IG+LASSAPIL + DI
Sbjct: 165 DFAVLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 224
Query: 220 PWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL 279
P T ++ +VS DFR S CF TIK SW E+D+ A++ DGL LSK F C LK + +L
Sbjct: 225 PSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQANEQDGLLKLSKTFHLCQTLKTSGDL 284
Query: 280 KDGLDTVYSEAAQYD----------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA 329
D L + YS A D P+N P+K +C I++ P G L +I AGV
Sbjct: 285 SDWLSSAYSYLAMVDYPLPSEFLMPLPAN-PIKEVCRNIDSQPK-GTGTLERIYAGV--N 340
Query: 330 DSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMG-KDKNSMYQPEPWNLTKYIK 388
Y G +N+D G GW+WQ+C+EMV+PM ++ SMY P ++ Y +
Sbjct: 341 VYYNYTGTVDCFDLNDDPHG---MGGWDWQACTEMVMPMSYSEQRSMYPPYKFDYASYAE 397
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+C + YGV PRP W+ T +GGH+I +L SNIIF NG+ DP+S GG +K + +
Sbjct: 398 DCIKSYGVRPRPKWITTEFGGHNITKVLENFGSNIIFFNGLLDPWSGGGVLK--NISESV 455
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+ LG+H +DL A DPDWLV R++E++I+ GW+ YY
Sbjct: 456 VAIVAPLGAHHIDLRPATPDDPDWLVALRESELEIISGWLWDYY 499
>gi|242058825|ref|XP_002458558.1| hypothetical protein SORBIDRAFT_03g035680 [Sorghum bicolor]
gi|241930533|gb|EES03678.1| hypothetical protein SORBIDRAFT_03g035680 [Sorghum bicolor]
Length = 518
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 194/486 (39%), Positives = 273/486 (56%), Gaps = 27/486 (5%)
Query: 31 PRLGRIR-RSRILEQKDSNHGFETFFYNQTIDHFSYG-----PESYQTFPQRYVINSKFW 84
PR R + RSR ++ +ET ++ Q +DHFS+ E+ F QRY++
Sbjct: 30 PRFPRHQHRSRPGVNGVGDYEYETRYFRQRLDHFSFPGVADEDEAAAFFQQRYLVGRGGG 89
Query: 85 GGGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
G PI + G E I SG +E A +F AL+V EHRYYG+S+PFGS+ A
Sbjct: 90 WAGPGGPIFFYCGNEGDIAWFASNSGLVWEAAPRFAALVVFAEHRYYGESMPFGSKAKAY 149
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVI 203
+++ Y + QALAD+A +L +K +A +PV+ G SYGG LA W RLKYPH+ +
Sbjct: 150 NDSKSLAYLTAEQALADFAVLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHISV 209
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
G+L+SSAPIL + DI P T ++ +VS DFR S CF TIK SW E+D+ ++ GL L
Sbjct: 210 GALSSSAPILQFEDIVPSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQGNEQGGLLKL 269
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYD----------TPSNIPVKRICNAIENAPN 313
SK F C LK + +L D L + YS A D P+N P+K +C I++ P
Sbjct: 270 SKTFHLCQTLKTSGDLSDWLSSAYSYLAMVDYPLPSEFLMPLPAN-PIKEVCRNIDSQPE 328
Query: 314 CGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMG-KDK 372
G L +I AGV Y G +N+D G GW+WQ+C+EMV+PM +
Sbjct: 329 -GTSTLERIYAGV--NVYYNYTGTVGCFDLNDDPHG---MGGWDWQACTEMVMPMSYSEG 382
Query: 373 NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
SMY P ++ Y C + YGV PRP W+ T +GGH+I +L + SNIIF NG+ DP
Sbjct: 383 RSMYPPYKFDYPSYADGCIKSYGVRPRPRWITTEFGGHNITTVLEKFGSNIIFFNGLLDP 442
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+S GG +K + + + LG+H +DL A DPDWLV R++E+KI+ GW++ YY
Sbjct: 443 WSGGGVLK--NISESVIAIVAPLGAHHIDLRPATPDDPDWLVALRESELKIISGWLSDYY 500
Query: 493 DDFKAI 498
A+
Sbjct: 501 GAGGAV 506
>gi|147788546|emb|CAN61013.1| hypothetical protein VITISV_036738 [Vitis vinifera]
Length = 554
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 186/480 (38%), Positives = 280/480 (58%), Gaps = 37/480 (7%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRL-GRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
L + +F +S + K+ PR I R L +E ++ Q +DHF+Y P+S
Sbjct: 10 LPCIFLFSISSSGFSAAKITPRFPSSIVRPEQLSVSSQTELYEAKYFTQILDHFNYQPQS 69
Query: 70 YQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEH 127
Y+TF QRY+IN K+WGG + +PI + G E I+ Q +GF ++ A F+AL+V +EH
Sbjct: 70 YRTFQQRYLINDKYWGGADKLAPIFVYTGNEGDIEWFAQNTGFMFDTAPHFQALLVFIEH 129
Query: 128 RYYGKSVPFGS-RKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASY 186
R+YGKS+PFG A NA GY +S QALADYA++++ +K +AT++PV+ G SY
Sbjct: 130 RFYGKSIPFGGDTNVAYSNASTLGYLSSTQALADYATLIIDLKKNLSATNSPVVVFGGSY 189
Query: 187 GGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKS 246
GG LA WFRLKYPHV IG+LASSAPIL + +IT T+++++++DF + S
Sbjct: 190 GGMLAAWFRLKYPHVAIGALASSAPILNFENITSPYTFNNIITQDF-----------QGS 238
Query: 247 WSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSNI----- 299
W +I+ A K GL +L K F+ C + +++ L T VY+ Y TPSN
Sbjct: 239 WEQIEETAMKNGGLEVLRKSFRICKNYISGGAIENWLSTAYVYTAMTDYPTPSNFLSPMP 298
Query: 300 --PVKRICNAIENAPNCGDDILCKI--AAGVVEADSLEYDGNNSRCYINEDRTGDESDEG 355
PVK++C AI++ P G+D K+ AA V Y G + C+ D +
Sbjct: 299 AYPVKQMCKAIDD-PAXGNDTFAKLYGAANVY----YNYTG-TAACFDLADDSDPHGLGE 352
Query: 356 WEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
W WQ+C+EM++P+ ++ S++ WN + CK + + PRP+W+ T +GGHDIK
Sbjct: 353 WTWQACTEMIMPVNANNEESIFPVSTWNYSNRAAFCKFAFDIEPRPNWITTEFGGHDIKR 412
Query: 415 ILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF-SQDLNLGSHCLDLDEAKKSDPDWL 473
+L+R SNIIF NG+RDP+S GG +++ ++ ++D G+H +DL A DP+WL
Sbjct: 413 VLKRFGSNIIFFNGLRDPWSGGGVLESISKSIVAIVAKD---GAHHVDLRFATSEDPEWL 469
>gi|359497044|ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
gi|296085719|emb|CBI29519.3| unnamed protein product [Vitis vinifera]
Length = 510
Score = 330 bits (847), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 179/454 (39%), Positives = 266/454 (58%), Gaps = 22/454 (4%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLS 108
+ET ++ Q +DHFS F QRY+I+++ W G + PI + G E I+ +
Sbjct: 58 YETRYFEQRLDHFSIA--DLPKFRQRYLISTRHWTGPDRMGPIFLYCGNEGDIEWFAANT 115
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF ++ A +F A+++ EHRYYG+S+P+GSR A NA Y + QALAD+A ++ ++
Sbjct: 116 GFVWDMAPRFGAMVLFPEHRYYGESMPYGSRDKAYANAASLSYLTAEQALADFAVLVTNL 175
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K +A PV+ G SYGG LA W RLKYPH+ IG+LASSAPIL + DI P T++ +V
Sbjct: 176 KRNLSAEGCPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYDIV 235
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
S +F+ S CF TIKKSW + + K DGL L+K F+ C LK T +L D LD+ YS
Sbjct: 236 SNNFKRESISCFDTIKKSWDVLISEGQKNDGLKQLTKAFRLCRDLKRTEDLYDWLDSAYS 295
Query: 289 EAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
A Y PS+ P+K +C +++ P G +L +I GV + Y G
Sbjct: 296 FLAMVNYPYPSDFLMPLPGHPIKEVCRKMDSCPE-GTSVLERIFEGV--SVYYNYTGKVE 352
Query: 340 RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSP 398
+++D G +GW WQ+C+EMV+PM + +SM+ +N + + + C + + V P
Sbjct: 353 CFQLDDDPHG---MDGWNWQACTEMVMPMASSRESSMFPTYDYNYSSFQEECWKDFSVKP 409
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
RP+W+ T +GGH+ K L+ SNIIFSNG+ DP+S G ++ + + G+H
Sbjct: 410 RPTWITTEFGGHEFKTTLKVFGSNIIFSNGLLDPWSGGSVLQNIS--ETVVALVTEEGAH 467
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+DL + DPDWLV+QR EVK+++GWI Y+
Sbjct: 468 HIDLRSSTAEDPDWLVEQRAFEVKLIKGWIEDYH 501
>gi|297794189|ref|XP_002864979.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297310814|gb|EFH41238.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 514
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 198/503 (39%), Positives = 277/503 (55%), Gaps = 32/503 (6%)
Query: 11 LLLMVIFVSTSFHANGLKLR--------PRLGRIRRSRILE-QKDSN-HGFETFFYNQTI 60
LLL+ F + F +NG L PR R RI + + D N + +ET F++Q +
Sbjct: 7 LLLIFTFFTLLFPSNGSSLSSSKLLPRFPRYTSRNRGRIQQFRGDRNEYRYETKFFSQQL 66
Query: 61 DHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQF 118
DHFS+ FPQRY+INS +W G ++ PI + G E I+ SGF ++ A +F
Sbjct: 67 DHFSFA--DLPKFPQRYLINSDYWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKF 124
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
AL+V E R +P+GS + A KNA Y + QALAD+A + +K +A P
Sbjct: 125 GALLVFPEVRSCLFCMPYGSMEEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEACP 184
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
V+ G SYGG LA W RLKYPH+ IG+LASSAPIL + DI P T++ + S DF+ S
Sbjct: 185 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYDIASNDFKRESSS 244
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTP-- 296
CF TIK SW I K +GL L+K F C L +T +L D LD+ YS A D P
Sbjct: 245 CFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYP 304
Query: 297 -------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG 349
P++ +C I+ A + IL +I AG+ + Y GN +++D G
Sbjct: 305 ADFMMPLPGHPIREVCRKIDGA-HSDASILDRIFAGI--SVYYNYTGNVDCFKLDDDPHG 361
Query: 350 DESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYG 408
+GW WQ+C+EMV+PM ++ SM+ +N + Y + C + V+PRP WV T +G
Sbjct: 362 ---LDGWNWQACTEMVMPMSSNQEKSMFPAYDFNYSSYKEECWNTFRVNPRPKWVTTEFG 418
Query: 409 GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKS 468
GHDI+ L+ SNIIFSNGM DP+S G +K + + G+H LDL +
Sbjct: 419 GHDIETTLKLFGSNIIFSNGMLDPWSGGSVLKNLS--NTIVALVTKEGAHHLDLRPSTPE 476
Query: 469 DPDWLVQQRKTEVKIMQGWITQY 491
DP WLV QR+ E++++QGWI Y
Sbjct: 477 DPKWLVDQREAEIQLIQGWIETY 499
>gi|357136615|ref|XP_003569899.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 512
Score = 328 bits (842), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 186/456 (40%), Positives = 262/456 (57%), Gaps = 26/456 (5%)
Query: 51 FETFFYNQTIDHFSY--GPESYQTFPQRYVINSKFWGGGNSP-ILAFLGAEAPIDDNIQL 107
F+T ++ Q +DHFS+ G ES+Q QRY++ G I + G E I
Sbjct: 49 FDTRYFRQRLDHFSFSGGEESFQ---QRYLVGRAGGWAGPGGPIFFYCGNEGDIAWFAAN 105
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
SG +E A +F AL+V EHRYYG+S+PFGS++ A N+R Y + QA+ADYA +L
Sbjct: 106 SGLVWEAAPRFAALVVFAEHRYYGESMPFGSKEKAYNNSRSLAYLTAEQAIADYAVLLTD 165
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+K ++ +PV+ G SYGG LA W RLKYPH+ +G+LASSAPIL + DI P T ++ +
Sbjct: 166 LKRNLSSESSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPQTIFYDI 225
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
VS DF+ S CFQTIK SW E+D + DGL LSK F C LK T L D L++ Y
Sbjct: 226 VSNDFKRESLSCFQTIKDSWKELDEQGNGQDGLLKLSKTFHLCQTLKTTGGLSDWLNSAY 285
Query: 288 SEAAQYD----------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
A D PSN P+K +C I+ + G L +I AGV Y G
Sbjct: 286 GFLAMVDYPMSSDFLMPLPSN-PIKEVCRNIDKQLD-GTSTLERIYAGV--NIYYNYTGT 341
Query: 338 NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGV 396
+++D G GW+WQ+C+EMV+PM ++ SM+ P ++ Y +C + +GV
Sbjct: 342 VDCFDLDDDPHG---MGGWQWQACTEMVMPMSSSESLSMFPPYEFDYASYADDCVKSFGV 398
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
P P W+ T +GGH+I +L + SNIIF NG+ DP+S GG +K ++ LG
Sbjct: 399 RPSPRWITTEFGGHNISAVLEKFGSNIIFFNGLLDPWSGGGVLKNISGSVVAIVAP--LG 456
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+H +DL A K DPDWLV R++E+ I+ GW++ +Y
Sbjct: 457 AHHIDLRPATKEDPDWLVSLRESELGIISGWLSDHY 492
>gi|413952316|gb|AFW84965.1| putative serine peptidase S28 family protein [Zea mays]
Length = 515
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 188/464 (40%), Positives = 265/464 (57%), Gaps = 30/464 (6%)
Query: 48 NHGFETFFYNQTIDHFSYG-------PESYQTFPQRYVINSKFWGGGNS-PILAFLGAEA 99
++ +ET ++ Q +DHFS+ E+ F QRY++ G PI + G E
Sbjct: 45 DYEYETRYFRQRLDHFSFPGVGDEDEDEAAAFFQQRYLVGRGGGWAGPGGPIFFYCGNEG 104
Query: 100 PIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALA 159
I SG +E A +F AL+V EHRYYG+S+PFGS+ A +++ Y + QALA
Sbjct: 105 DIAWFAANSGLVWEAAPRFAALVVFAEHRYYGESMPFGSKAKAYSDSKSLAYLTAEQALA 164
Query: 160 DYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDIT 219
D+A +L +K +A +PV+ G SYGG W RLKYPH+ IG+LASSAPIL + DI
Sbjct: 165 DFAVLLTDLKRNLSAEGSPVVLFGGSYGG--TAWMRLKYPHIAIGALASSAPILQFEDIV 222
Query: 220 PWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL 279
P T ++ +VS DFR S CF TIK SW E+D+ A++ DGL LSK F C LK + +L
Sbjct: 223 PSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQANEQDGLLKLSKTFHLCQTLKTSGDL 282
Query: 280 KDGLDTVYSEAAQYD----------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA 329
D L + YS A D P+N P+K +C I++ P G L +I AGV
Sbjct: 283 SDWLSSAYSYLAMVDYPLPSEFLMPLPAN-PIKEVCRNIDSQPK-GTGTLERIYAGV--N 338
Query: 330 DSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMG-KDKNSMYQPEPWNLTKYIK 388
Y G +N+D G GW+WQ+C+EMV+PM ++ SMY P ++ Y +
Sbjct: 339 VYYNYTGTVDCFDLNDDPHG---MGGWDWQACTEMVMPMSYSEQRSMYPPYKFDYASYAE 395
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+C + YGV PRP W+ T +GGH+I +L SNIIF NG+ DP+S GG +K + +
Sbjct: 396 DCIKSYGVRPRPKWITTEFGGHNITKVLENFGSNIIFFNGLLDPWSGGGVLK--NISESV 453
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+ LG+H +DL A DPDWLV R++E++I+ GW+ YY
Sbjct: 454 VAIVAPLGAHHIDLRPATPDDPDWLVALRESELEIISGWLWDYY 497
>gi|242070053|ref|XP_002450303.1| hypothetical protein SORBIDRAFT_05g003460 [Sorghum bicolor]
gi|241936146|gb|EES09291.1| hypothetical protein SORBIDRAFT_05g003460 [Sorghum bicolor]
Length = 553
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 185/486 (38%), Positives = 273/486 (56%), Gaps = 53/486 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLS 108
+ET Y Q +DHF+ P SY TF QRY+IN FWGG + +PI + G E ID +
Sbjct: 74 YETRRYTQRLDHFNSLPSSYATFQQRYLINDTFWGGRSRTAPIFLYAGNEGDIDLFTNNT 133
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFG-SRKAALKNARHRGYFNSAQALADYASILLH 167
GF +E A +F+A++V +EHRYYG+S+PFG +R+AA ++A +GY QALADYAS +L
Sbjct: 134 GFMWEAAPRFRAMLVFVEHRYYGESMPFGGTREAAFRDAATKGYLTVTQALADYASFVLS 193
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGDITPWTTYHS 226
+K + APV+ G SYGG LA W RLKYPHVV+G++ASSAPIL +YG + P+ Y
Sbjct: 194 LKANLSVPAAPVVVFGGSYGGMLAAWMRLKYPHVVMGAVASSAPILSFYGIVDPYAFYDR 253
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
++ D++ S+ C+ ++KSW +D+ + +G A L + F C + ++ L+
Sbjct: 254 -INDDYKSESKNCYDVLRKSWDVLDDALATKEGQAQLRRTFNMCNG-SSVWDIPSLLENA 311
Query: 287 YSEAAQYDTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
EAA D P+ PV+ +C AI D + AA + S DGN
Sbjct: 312 MVEAAMTDYPTTSGFLTPLPAYPVRAMCRAI-------DARHQQAAASTSTSGSGNDDGN 364
Query: 338 NS-----------------RCYINED------RTGDESD-----EGWEWQSCSE-MVVPM 368
NS Y N R ++ D +GW WQ+C+E MV+
Sbjct: 365 NSSSSTALLLLSEQVRDAMNVYYNHTGGAACFRAEEDDDPYGLYDGWNWQACTEVMVMAY 424
Query: 369 GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNG 428
G ++ QP P+N T+ + +C+ G+ PRP W+ T +GG+DI +L++S SNIIF NG
Sbjct: 425 GVRDGTVLQPAPFNFTEVVDDCRNVTGLPPRPFWIETEFGGYDIANVLKKSASNIIFFNG 484
Query: 429 MRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+RDP+S GG +K+ ++ + G+H +DL + K DP+WL + R E +I+ W+
Sbjct: 485 LRDPWSTGGVLKSISDSIIALVEP--KGAHHVDLRFSSKEDPEWLKKVRVKETRIIARWL 542
Query: 489 TQYYDD 494
QYY D
Sbjct: 543 KQYYSD 548
>gi|356514372|ref|XP_003525880.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 597
Score = 324 bits (830), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 185/496 (37%), Positives = 284/496 (57%), Gaps = 19/496 (3%)
Query: 12 LLMVIFVSTSFHANGLKLRPR-LGRIR-RSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
L ++I +S S LK P+ LG+ +R + +ET Q++DHFS+
Sbjct: 88 LSVIIVLSYSAQPLALKHWPKFLGKFAATARTHSEPPPQFHYETRCIQQSLDHFSFS--E 145
Query: 70 YQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEH 127
TFPQRY+I+++ W G P+ + G E I+ Q +G +E A +F A++V EH
Sbjct: 146 LPTFPQRYLISTEHWVGPRRLGPVFFYSGNEDDIEWFAQNTGVVWEIAPRFGAMVVFPEH 205
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
+YYG+SVP+GS + A KN Y S QAL D++ ++ +K ++ PV G SYG
Sbjct: 206 QYYGESVPYGSAEEAYKNVTTLSYLTSEQALVDFSVVIADLKHNFSTKDCPVFLFGGSYG 265
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSW 247
G LA W RLKYPHV +G+LASSAPIL + DI P T++ +VS F+ S CF IK+SW
Sbjct: 266 GMLAAWMRLKYPHVAVGALASSAPILQFEDIVPPETFYDLVSNAFKRESFICFNYIKQSW 325
Query: 248 SEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ--YDTPSNIPVKRIC 305
+E+ + +GL +L+K F C L T +L D ++ YS A Y P+ +
Sbjct: 326 NEMASAGQTNNGLELLTKTFNLCQKLNRTKDLYDWVEAAYSYLAMVNYPYPAEFMMTLPE 385
Query: 306 NAIENAPNCGDD-ILCKIAAGVVEADSLEYD-GNNSRCYINEDRTGDESDEGWEWQSCSE 363
+ I + IL +I GV ++ Y+ ++C+ +D S GW+WQ+C+E
Sbjct: 386 HPIREVSMVSNSYILERIYEGV----NVYYNYTGEAKCFELDDDPHGMS--GWDWQACTE 439
Query: 364 MVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSN 422
M++PM + +SM+ P + T + C +++GV PRP W+ T +GGHDI L++ SN
Sbjct: 440 MIMPMSSSQESSMFLPYEYXYTSIQEECLKKFGVKPRPKWITTEFGGHDIHATLKKFGSN 499
Query: 423 IIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVK 482
IIFSNG+ DP+S G ++ +S + G+H +DL + K+DPDWLV+QR+TE+K
Sbjct: 500 IIFSNGLLDPWSGGSILQNISESVVSLVTE--EGAHHIDLRSSTKNDPDWLVEQRETEIK 557
Query: 483 IMQGWITQYYDDFKAI 498
+++GWI+ Y+ KA+
Sbjct: 558 LIEGWISDYHQKNKAM 573
>gi|449467104|ref|XP_004151265.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like, partial [Cucumis
sativus]
Length = 359
Score = 321 bits (823), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 158/300 (52%), Positives = 217/300 (72%), Gaps = 4/300 (1%)
Query: 31 PRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN-- 88
P LG RR+ + S+ G TF+Y Q +DHF+Y P+SY TF QRY+I+ K+W G N
Sbjct: 34 PVLGVQRRAFQSTPQQSD-GLATFYYKQPLDHFNYQPQSYVTFDQRYIIDFKYWEGINPK 92
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
+PI A+LGAE+ ID+++ GF A Q+KA+ V LEHR+YGKS+PFGS + A+KN
Sbjct: 93 TPIFAYLGAESDIDNDVPYVGFPLRFASQYKAMSVYLEHRFYGKSIPFGSLEKAMKNGSI 152
Query: 149 RGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLAS 208
RGYFNSAQALADYA +LLHIK + +P+I +GASYGG LA+WFRLKYPH+ +G+LAS
Sbjct: 153 RGYFNSAQALADYAELLLHIKKMFAYDTSPIIVMGASYGGMLASWFRLKYPHIALGALAS 212
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKP-DGLAILSKKF 267
SAPILY+ +ITP Y+S+VSK F++TS+ C TI++SW EID IA K GL+ILSK+F
Sbjct: 213 SAPILYFDNITPQDGYYSIVSKSFKETSKTCHDTIRRSWGEIDRIAGKTRGGLSILSKQF 272
Query: 268 KTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVV 327
KTC LK ++E+K+ +D+V++ AAQY+ P PV+ IC AI+ +++ ++ AGV+
Sbjct: 273 KTCGKLKTSSEIKNLMDSVFTMAAQYNDPYENPVRGICVAIDEEAKKKSNVIKQVVAGVI 332
>gi|222635987|gb|EEE66119.1| hypothetical protein OsJ_22160 [Oryza sativa Japonica Group]
Length = 393
Score = 315 bits (806), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 172/401 (42%), Positives = 236/401 (58%), Gaps = 21/401 (5%)
Query: 112 YENAHQFKALIVIL--EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
++ A F AL+ L +HR+YG+S PFG+ + + GY S QALAD+A ++ +K
Sbjct: 2 FDIAPSFGALLHDLSKQHRFYGESKPFGNESNS--SPEKLGYLTSTQALADFAVLITSLK 59
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+A +PV+ G SYGG LA+WFRLKYPHV IG++ASSAPIL + ITPW++++ VS
Sbjct: 60 HNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSSFYEAVS 119
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VY 287
+D++ S CF IK +W ID S GL LSK F+ C +K+ ++ L T VY
Sbjct: 120 QDYKSESFNCFSVIKAAWDLIDERGSTDAGLLQLSKTFRACKTVKSVYSFRNWLWTAFVY 179
Query: 288 SEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNS 339
+ Y TP+N P+K +C I P G DI+ K A A SL Y+ +
Sbjct: 180 TAMVDYPTPANFLMNLPAYPIKEMCKIIHGFP-AGADIVDKAFA----AASLYYNYTGDQ 234
Query: 340 RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPR 399
C+ ED GW WQ+C+EMV+PM SM+ P + +C + YGV PR
Sbjct: 235 TCFQLEDGEDPHGLSGWGWQACTEMVMPMTISNESMFPPFTFTYEGKSDDCFQSYGVRPR 294
Query: 400 PSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHC 459
P W+ T YGG+ I L+L+R SNIIFSNGMRDP+SRGG +K ++ + G+H
Sbjct: 295 PHWITTEYGGNRIDLVLKRFGSNIIFSNGMRDPWSRGGVLKNISSSIIALVTE--KGAHH 352
Query: 460 LDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
LD A K DPDW+V+QR+ EVKI+QGWI QY +D I+K
Sbjct: 353 LDFRSATKDDPDWVVEQRRQEVKIIQGWIDQYNEDLAQISK 393
>gi|413954870|gb|AFW87519.1| putative serine peptidase S28 family protein [Zea mays]
Length = 552
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 185/498 (37%), Positives = 266/498 (53%), Gaps = 67/498 (13%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP-------ILAFLGAEAPIDD 103
F T ++ Q +DHF++ P + F Q+Y++N FW + + G E I+
Sbjct: 73 FTTHYFPQELDHFTFTPNASMLFRQKYLVNDTFWRRPRRGGGGGAGPLFVYTGNEGDIEW 132
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
+GF ++ A +F AL+V +EHR+YG+S+PFG + +A GY S QALAD+A
Sbjct: 133 FATNTGFMFDIAPKFGALLVFIEHRFYGESLPFGDD--SYSSAETEGYLTSTQALADFAI 190
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
++ +K +A APV+ G SYGG LA+WFRLKYPHV IG+LASSAPIL + ITPW +
Sbjct: 191 LITGLKRNLSAETAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHITPWNS 250
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
+ VS+D++ S CF IK +W +D S GL LSK F+ C +K+ + L
Sbjct: 251 FSDAVSQDYKSESLNCFSVIKAAWDVLDERGSTDTGLLELSKLFRACKSVKHADSIAGWL 310
Query: 284 DTVYSEAAQ--YDTPSNI-------PVK---RICNAIENAPNCGDDILCKIAAGVVEADS 331
T ++ A Y TP+N PVK ++C I+ P G DIL K+ A A +
Sbjct: 311 QTAFTYTAMVDYPTPANFLMDLPAYPVKEPIQMCKIIDGFP-AGADILEKVFA----AAN 365
Query: 332 LEYDGNNSRCYINEDRTGDESDEGWE----------------WQSCSEMVVPMGKDKNSM 375
L Y + TGD++ E WQ+C+EM++PM SM
Sbjct: 366 LYY-----------NYTGDQACNQIESDSDDSSSSLGLSGWGWQACTEMIMPMSTSNASM 414
Query: 376 YQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
+ P ++ C + GV PRP W+ T YGG+ I +L+R SNIIFSNGMRDP+SR
Sbjct: 415 FPPSSFSYEDTSNACFQSTGVRPRPHWITTEYGGYRIDKVLKRFGSNIIFSNGMRDPWSR 474
Query: 436 GGWVKTYHFFDLSF----SQDLNLG----------SHCLDLDEAKKSDPDWLVQQRKTEV 481
GG +K ++ + NL +H LDL A DPDW+ +QR+ EV
Sbjct: 475 GGVLKNISSSIVALVTEKGEHGNLRRKKKKKKKKWAHHLDLRFATDEDPDWVTEQRRQEV 534
Query: 482 KIMQGWITQYYDDFKAIN 499
+I++GWI QY+ D ++
Sbjct: 535 EIIEGWIDQYHRDMAQVS 552
>gi|327269259|ref|XP_003219412.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Anolis
carolinensis]
Length = 500
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 187/505 (37%), Positives = 281/505 (55%), Gaps = 39/505 (7%)
Query: 10 WLLLMVI-FVSTSFHANGLKLRPRLGRIRR----SRILEQKDSNHGFETFFYNQTIDHFS 64
W++L V+ V+T + G+ PRL R +RI + H T F Q IDHF
Sbjct: 7 WMVLTVVSLVATLGYVEGVLGHPRLHAASRPDPANRIFQAPREYH---TCFIGQKIDHFG 63
Query: 65 YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
+ +TF QRY+I + W IL + G E I +GF + A + A++V
Sbjct: 64 F--YENRTFKQRYLIAEQHWKRDVGSILFYTGNEGDITWFANNTGFMWNVAEELDAILVF 121
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIG 183
EHRYYG S+PFG++ + +A+H Y +SAQALAD+A ++ H+K A PVIAIG
Sbjct: 122 AEHRYYGVSLPFGNK--SFSDAKHLNYLSSAQALADFAVLVQHLKATIPGAQDTPVIAIG 179
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI 243
SYGG LA WFR+KYPH+VIG+LA+SAPI + + P T++S+V++DF+ + C ++I
Sbjct: 180 GSYGGMLAAWFRMKYPHIVIGALAASAPIWQFDSLVPCGTFYSIVTQDFKKSGNCCSESI 239
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDTPSNI- 299
+ SW+ I+ +AS +GL LS F+ CTPLK T+ K L + A D P
Sbjct: 240 RNSWAAINRLASTEEGLRWLSSTFRLCTPLKTETDAATFKGWLGETWVNLAMVDYPYKAD 299
Query: 300 --------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE 351
P++ +C ++N P D +L + + +A ++ Y+ +N +T +
Sbjct: 300 FLQPLPAWPIQVVCKYLKN-PKLPDKLLLQ---NIFQAVNVYYNYTGHASCLNLTQTATK 355
Query: 352 S--DEGWEWQSCSEMVVPMGKDKNS-MYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYG 408
S +GW +Q+C+EMV+PM D S M++P+ W+ Y + C + +GV PRPSW+ T+YG
Sbjct: 356 SLGIQGWYYQACTEMVMPMCSDGVSDMFEPQKWDFHTYSEECFKNWGVRPRPSWIPTFYG 415
Query: 409 GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKS 468
G +I + SNIIFSNG DP+S GG K + + + + G+H LDL
Sbjct: 416 GKNIS-----AHSNIIFSNGGLDPWSGGGVTK--NITNTLVAVVIPEGAHHLDLRSNTPF 468
Query: 469 DPDWLVQQRKTEVKIMQGWITQYYD 493
DPD ++Q R EV M+ W+ QY+
Sbjct: 469 DPDSVLQARLLEVHYMRQWLRQYHQ 493
>gi|18700101|gb|AAL77662.1| AT5g65760/MPA24_11 [Arabidopsis thaliana]
Length = 491
Score = 311 bits (796), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 185/453 (40%), Positives = 256/453 (56%), Gaps = 31/453 (6%)
Query: 11 LLLMVIFVSTSFHANGLKLR-----PRLGRI----RRSRILE-QKDSN-HGFETFFYNQT 59
LLL+ F + F +NG L PR R R +RI + + D N + +ET F++Q
Sbjct: 7 LLLIFTFFTLVFPSNGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQ 66
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ F QRY+INS W G ++ PI + G E I+ SGF ++ A +
Sbjct: 67 LDHFSFA--DLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPK 124
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F AL+V EHRYYG+S+P+GSR+ A KNA Y + QALAD+A + +K +A
Sbjct: 125 FGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEAC 184
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
PV+ G SYGG LA W RLKYPH+ IG+LASSAPIL + D+ P T++ + S DF+ S
Sbjct: 185 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRESS 244
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTP- 296
CF TIK SW I K +GL L+K F C L +T +L D LD+ YS A D P
Sbjct: 245 SCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPY 304
Query: 297 --------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
P++ +C I+ A IL +I AG+ + Y GN +++D
Sbjct: 305 PADFMMPLPGHPIREVCRKIDGA-GSNASILDRIYAGI--SVYYNYTGNVDCFKLDDDPH 361
Query: 349 GDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYY 407
G +GW WQ+C+EMV+PM ++ NSM+ +N + Y + C + V+PRP WV T +
Sbjct: 362 G---LDGWNWQACTEMVMPMSSNQENSMFPGYGFNYSSYKEECWNTFRVNPRPKWVTTEF 418
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
GGHDI L+ SNIIFSNG+ DP+S G +K
Sbjct: 419 GGHDIATTLKSFGSNIIFSNGLLDPWSGGSVLK 451
>gi|321468146|gb|EFX79132.1| hypothetical protein DAPPUDRAFT_225197 [Daphnia pulex]
Length = 477
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 172/461 (37%), Positives = 267/461 (57%), Gaps = 29/461 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++TFF+ Q +DHFS+ + T+PQRY++NS +W G PI + G E I+ Q +GF
Sbjct: 22 WKTFFFKQQVDHFSFANQ--DTYPQRYLVNSTYWKRGGGPIFFYTGNEGDIEWFAQNTGF 79
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A +F A++V EHRYYG+S+P+G++ + +A++ GY S QALAD+A ++ +IK
Sbjct: 80 MWDIAEEFGAMLVFAEHRYYGQSLPYGNK--SYSDAKYLGYLTSEQALADFAELVAYIKS 137
Query: 171 -KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A +PVIA G SYGG L+ W R+KYPH++ GS+A+SAPIL + +TP ++ VV+
Sbjct: 138 TNSGAVDSPVIAFGGSYGGMLSAWMRIKYPHIITGSIAASAPILQFTGLTPCDAFNRVVT 197
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
DF S EC +TI+KSW + +I S+ DG L + C PL T + LKD L V
Sbjct: 198 ADFASASTECSETIRKSWKSLSSILSQDDGKEWLRTHWNLCVPLNGTDDVVNLKDWLTNV 257
Query: 287 YSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGV-VEADSLEYDG 336
++ A + P PVK +C + N+ +L ++ G+ V A+ + G
Sbjct: 258 WTNLAMVNYPYAANFLAPLPAYPVKAVCEHLTNSSLDDHSLLDELFKGLSVYAN---FTG 314
Query: 337 NNSRCYINEDRTGDESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYG 395
+++ D GW++Q+C+EMV+PM G N M++ +PWN+ KY + C +++
Sbjct: 315 QTKCLDVSQQADQSLGDMGWDFQACTEMVMPMCGDGINDMFEAQPWNIEKYSERCLKKWK 374
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V+PRP YGG +I S+SNI+FSNG+ DP+S GG K+ D + +
Sbjct: 375 VNPRPLMAPLIYGGKNIS-----SSSNIVFSNGLLDPWSTGGVTKSLS--DSIVAIIIPE 427
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL A +DP +V+ R+ E + + WI+ D K
Sbjct: 428 GAHHLDLRAADPNDPPSVVKAREIEKQFIGKWISSVKKDKK 468
>gi|350588336|ref|XP_003129755.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Sus scrofa]
Length = 493
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 171/462 (37%), Positives = 260/462 (56%), Gaps = 31/462 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ + Q +DHF + + +TF QRY+I W IL + G E I +GF
Sbjct: 44 YSIHYIQQKVDHFGFNTD--KTFKQRYLIADTHWRKDGGSILFYTGNEGDIIWFCNNTGF 101
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG+ + K++RH + S QALAD+A ++ H+K
Sbjct: 102 MWDVAEELKAMLVFAEHRYYGESLPFGAH--SFKDSRHLNFLTSEQALADFAELIRHLKR 159
Query: 171 KYNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
K T + PVIA+G SYGG LA WFR+KYPH+V+G+LASSAPI ++ ++ P + V+
Sbjct: 160 KIPGTENQPVIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWHFENLVPCGVFMKTVT 219
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
KDFR++ C +TI++SW I+ +A K GL LS+ CTPL N+ + LKD +
Sbjct: 220 KDFRESGPNCSETIRRSWDAINRLARKGTGLHWLSEALHLCTPLTNSQDVQRLKDWISET 279
Query: 287 YSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-G 336
+ A D P P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 280 WVNLAMVDYPYESDFLQPLPAWPIKVVCQYLKN-PNVSDSLLVQ---NIFQALNVYYNYS 335
Query: 337 NNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
+RC I+E T GW +Q+C+EMV+P + + M++P W+L ++ +C Q+
Sbjct: 336 GQARCLNISETATSSLGALGWSYQACTEMVMPFCSNGIDDMFEPNSWDLKEFSDDCFRQW 395
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV PRP W+ T YGG +I S +NIIFSNG DP+S GG K + D + +
Sbjct: 396 GVRPRPYWITTVYGGKNIS-----SHTNIIFSNGELDPWSGGGVTK--NVTDTLVAIVIP 448
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL DP ++ R EV+ M+ WI +Y +
Sbjct: 449 KGAHHLDLRANTAFDPTTVLLARSLEVRYMKQWIKDFYASLR 490
>gi|403287756|ref|XP_003935096.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Saimiri boliviensis
boliviensis]
Length = 496
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 167/461 (36%), Positives = 264/461 (57%), Gaps = 30/461 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG ++ K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGD--SSFKDSRHLNFLTSEQALADFAELIKHMKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I++SW I+ ++S GL LS+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPHCSESIRRSWDAINRLSSTGSGLQWLSEALHLCSPLTSEDVRHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++ +PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLK-SPNVSDLLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
++C I+E TG GW +Q+C+E+V+P + + M++P WNL + +C +Q+G
Sbjct: 340 QAKCLNISETATGSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG T D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGG--VTKDITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL DP ++ R EV+ M+ WI ++Y +
Sbjct: 453 GAHHLDLRAKNALDPTSVLLARSLEVRHMKNWIREFYGSVR 493
>gi|296216949|ref|XP_002754795.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Callithrix
jacchus]
Length = 496
Score = 305 bits (780), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 169/458 (36%), Positives = 261/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG ++ K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGD--SSFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I +SW I+ ++S GL LS+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPHCSESIHRSWDAINRLSSTGSGLQWLSEALHLCSPLTSEDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDLLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
++C I+E TG GW +Q+C+E+V+P + + M++P WNL + C +Q+G
Sbjct: 340 QAKCLNISETATGSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDYCFQQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG K D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRAKNALDPTSVLLARVLEVRHMKNWIRDFYD 490
>gi|260814189|ref|XP_002601798.1| hypothetical protein BRAFLDRAFT_279167 [Branchiostoma floridae]
gi|229287100|gb|EEN57810.1| hypothetical protein BRAFLDRAFT_279167 [Branchiostoma floridae]
Length = 481
Score = 305 bits (780), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 180/496 (36%), Positives = 269/496 (54%), Gaps = 35/496 (7%)
Query: 14 MVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTF 73
++I +S + L PR R + + Q + ++T ++ Q +DHFS+ TF
Sbjct: 7 ILICLSVAHTGGTHLLNPRFPRPKGPALKPQ----YSYDTKYFTQPVDHFSF--TRTDTF 60
Query: 74 PQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKS 133
QRY+IN K++ G PI + G E I +GF ++ A +FKAL+V EHRYYG+S
Sbjct: 61 DQRYLINMKYFEGTGGPIFLYTGNEGDITMFCDNTGFMWDIAPKFKALVVFAEHRYYGES 120
Query: 134 VPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELAT 192
+P+G K + K+ H GY + QALAD+A ++ H+K A +PV+A G SYGG LA
Sbjct: 121 MPYG--KDSYKDPEHLGYLTAEQALADFARLITHLKASIPGAADSPVVAFGGSYGGMLAA 178
Query: 193 WFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDN 252
WFR+KYP VIGSLA+SAP+ + +TP + +S++++DF+ S C I KSW +
Sbjct: 179 WFRMKYPSSVIGSLAASAPVWQFEGLTPCGSSYSIITEDFQKGSPGCETYIHKSWDLLTQ 238
Query: 253 IASKPDGLAILSKKFKTCTPLKNTTELKDG----LDTVYSEA-AQYDTPSNI-------P 300
+ G LS F C+PL TT++ L T ++ A Y P++ P
Sbjct: 239 MGQTAAGREKLSSMFSLCSPLNTTTDVATMSSWLLSTWFNLAMVNYPYPASFLEPLPGWP 298
Query: 301 VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQS 360
VK +C+ I + N +L IAAG + Y G I E D GW +Q
Sbjct: 299 VKEVCSLITASSN----VLEGIAAGAML--YYNYTGQAPCLNIEESAVSSLGDLGWSFQF 352
Query: 361 CSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRS 419
CSEM +P D + M+ +PWNLT YI CK + V+PRP W+L +GG +I +
Sbjct: 353 CSEMAMPSCTDGVHDMFPLQPWNLTTYISQCKSTWKVTPRPYWILQQFGGKNIT-----A 407
Query: 420 TSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKT 479
SNIIFSNG+ DP+S GG +++ D + + G+H LDL + +DP +++ R+
Sbjct: 408 ASNIIFSNGLLDPWSAGGVMESLS--DSLVAITIADGAHHLDLRSSNPADPQSVIEAREQ 465
Query: 480 EVKIMQGWITQYYDDF 495
EV+I++ W+ YY
Sbjct: 466 EVQIIREWLQDYYSSL 481
>gi|148909163|gb|ABR17682.1| unknown [Picea sitchensis]
Length = 501
Score = 304 bits (779), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 189/512 (36%), Positives = 282/512 (55%), Gaps = 51/512 (9%)
Query: 11 LLLMVIFVSTSF-HANGLKLRPRLGRIRRSR---------ILEQKDSNHGFETFFYNQTI 60
LL +VI + S + + L+ L R +R + S ++ +Y Q +
Sbjct: 10 LLCVVILILVSIGTSESVPLKSSLIRFSPTRKWNRGAATVLASASVSAAKYDVRYYTQIL 69
Query: 61 DHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
DHFS+ PESYQTF Q+Y+INS WGG + SPI + G E I+ + +GF ++ A QF
Sbjct: 70 DHFSFVPESYQTFQQKYLINSDHWGGASAKSPIFVYTGNEGFIEWFTENTGFMFDIAPQF 129
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
KA++V +EHR+YG S+PFGS+KAA N+ G+ +SAQALAD+A+++ +K +A +P
Sbjct: 130 KAMLVFIEHRFYGHSMPFGSQKAAYSNSSTLGFLSSAQALADFATLITDLKKNLSAEDSP 189
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
V+ G S G ++SSAPILY+ +ITP ++ VS+DFR SE
Sbjct: 190 VVVFGGS-----------------YGGISSSAPILYFDNITPIGSFDDTVSEDFRSESEN 232
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKN---TTELKDGLDTVYSEAAQYDT 295
CF+ IK SW+ ID + S P+GL L K + C + L L Y AA D
Sbjct: 233 CFKVIKGSWNVIDEMTSTPEGLKSLRKALRICKSNSDNYIAGNLAGWLYDAYYTAAMTDY 292
Query: 296 PSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINED 346
P PVK++C AI+N P+ D+L ++ GV Y G +S C+ +
Sbjct: 293 PVAANFVQPLPAYPVKQMCKAIDN-PSGSTDLLSQL-YGVANV-YYNYTGRSS-CF--DI 346
Query: 347 RTGD-ESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
R D ++GW++Q+C+EMV+PM D K SM+ ++ + + +C+ YGV PR W+
Sbjct: 347 RPSDPHGEDGWQFQACTEMVMPMADDPKKSMFPNSTFDYQERVDSCESAYGVQPRRHWIT 406
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGGH IK +L+ SNIIF NG+RDP+S GG ++ + + + G+H +D
Sbjct: 407 TQYGGHHIKRVLKNFASNIIFFNGLRDPWSGGGVLEDIN--ESVVAIVAKEGAHHVDFRF 464
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
A K DP WL R E+ I++ W+ QYY + +
Sbjct: 465 ATKDDPQWLKDARTKEISIIKSWLQQYYMELQ 496
>gi|354489839|ref|XP_003507068.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cricetulus
griseus]
Length = 494
Score = 304 bits (779), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 171/468 (36%), Positives = 271/468 (57%), Gaps = 30/468 (6%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
Q D + + F++ Q +DHF + + +TF QRY+I ++W + IL + G E I
Sbjct: 39 QPDEANNYSVFYFEQKVDHFGF--YNTKTFKQRYLIADRYWKTYDGVILFYTGNEGDITW 96
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
+GF ++ A + KAL+V EHRYYG+S+PFG+ + K+++H + S QALAD+A
Sbjct: 97 FSNHTGFMWDVAEKLKALLVFAEHRYYGESLPFGAE--SFKDSKHLNFLTSEQALADFAE 154
Query: 164 ILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ H++ A PVIAIG SYGG LA WFR+KYPH+VIG+LA+SAPI + ++ P
Sbjct: 155 LIRHLRRTVPGADSQPVIAIGGSYGGVLAAWFRMKYPHLVIGALAASAPIWQFEELIPCG 214
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELK 280
+ +V++DFR + ++C ++I +SW I+ +++ GL L+K F C+ L ++ LK
Sbjct: 215 MFMKIVTEDFRKSGKKCSESILRSWGAINRLSNTGSGLPWLTKAFHLCSSLNSEDVQRLK 274
Query: 281 DGLDTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADS 331
D + + A Y P+N P+K +C + N P D +L + + +A +
Sbjct: 275 DWISETWVNLAMVNYPYPANFLKPLPSWPLKVVCQYLTN-PFMSDSLLVQ---NIYQALN 330
Query: 332 LEYD-GNNSRC-YINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIK 388
+ Y+ SRC I+E G W +Q+C+E+++P + + M++ W+L KY
Sbjct: 331 VYYNYSGQSRCNNISETTIGSLGSRVWGYQTCTEIILPFCTNGVDDMFEARAWDLDKYSD 390
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+C +Q+GV PRPSW+ T YGG DI RS SNIIFSNG DP+S GG T D
Sbjct: 391 DCYKQWGVRPRPSWITTLYGGKDI-----RSHSNIIFSNGDLDPWSGGG--VTEDLSDTL 443
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
+ ++ G+H LDL + ++DP L R EV+ M+ WI+ +Y +++
Sbjct: 444 VAVNIPEGAHHLDLRSSTETDPPSLQLARSVEVRHMKQWISDFYGNYR 491
>gi|383408887|gb|AFH27657.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
Length = 496
Score = 304 bits (778), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 168/458 (36%), Positives = 257/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+ +G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I++SW I+ ++S GL L+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPHCSESIRRSWDAINRLSSTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + C EQ+G
Sbjct: 340 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDECFEQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S SNI+FSNG DP+S GG K D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHSNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRAKNALDPTSVLLARALEVRHMKNWIRDFYD 490
>gi|156369614|ref|XP_001628070.1| predicted protein [Nematostella vectensis]
gi|156215037|gb|EDO36007.1| predicted protein [Nematostella vectensis]
Length = 436
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 173/455 (38%), Positives = 271/455 (59%), Gaps = 39/455 (8%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF++ + TF QRY++N W G PI + G E I +GF ++NA +F
Sbjct: 1 LDHFNF--RTSATFSQRYLVNIANWRKGG-PIFFYTGNEGDITWFANNTGFMWDNAKEFG 57
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAP 178
A++V EHRYYG+++PFG R + ++ ++ GY +S QALAD+A+++ HIK AT +P
Sbjct: 58 AMLVFAEHRYYGETLPFGKR--SYESPKYLGYLSSEQALADFATLIRHIKLTTPGATGSP 115
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF-RDTSE 237
VIAIG SYGG L++W R+KYP++V +LA+SAPILY+ +TP ++ +V+KDF RD +
Sbjct: 116 VIAIGGSYGGMLSSWIRMKYPNLVTAALAASAPILYFQGLTPCEGFNEIVTKDFHRDGGD 175
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK---NTTELKDGLDTVYSEAAQYD 294
C +I+KSWS I+ + + G L+ F TC+P+K N T+L+D L ++ A +
Sbjct: 176 SCVNSIRKSWSVIEKLGATQSGRKTLTSVFNTCSPIKTKYNVTQLQDWLSETWANLAMVN 235
Query: 295 TPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN-NSRCY-I 343
P P+K++C+ + +A + D L K AG V + Y+ + ++C+ +
Sbjct: 236 YPYAATFLEPLPAWPIKKVCSHLTDA-DLPDVALLKAVAGAV---GVYYNSSGQAKCFNL 291
Query: 344 NEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSW 402
++ D+GW++Q+C+EMV+PM D N M++P W+ + ++C+ YGV PR W
Sbjct: 292 SQQAVSSLGDKGWDFQACTEMVMPMCSDGVNDMFKPVKWDFEAFAESCQGSYGVKPRQYW 351
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---GSHC 459
V YGG DI + SNIIFSNG+ DP+ GG K+ LS S L G+H
Sbjct: 352 VEVQYGGRDIS-----AHSNIIFSNGLLDPWFAGGVTKS-----LSPSLVAVLVEEGAHH 401
Query: 460 LDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
LDL + +DP L++ R+TE + + WI++YY +
Sbjct: 402 LDLRHSNPADPPSLIKARQTEKEYLHRWISEYYKN 436
>gi|114639541|ref|XP_001175153.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 7 [Pan
troglodytes]
gi|397502802|ref|XP_003822032.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Pan
paniscus]
gi|410223852|gb|JAA09145.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410254528|gb|JAA15231.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410308824|gb|JAA33012.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410334959|gb|JAA36426.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
Length = 496
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 166/458 (36%), Positives = 259/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I++SW I+ +++ GL L+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPHCSESIRRSWEAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + +C +Q+G
Sbjct: 340 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG T D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGG--VTKDITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRTKNALDPTSVLLARSLEVRHMKNWIRDFYD 490
>gi|426369948|ref|XP_004051942.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Gorilla
gorilla gorilla]
Length = 496
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 166/458 (36%), Positives = 259/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I++SW I+ +++ GL L+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + +C +Q+G
Sbjct: 340 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG K D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTK--DITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRTKNALDPTSVLLARSLEVRHMKNWIRDFYD 490
>gi|386781752|ref|NP_001248178.1| lysosomal Pro-X carboxypeptidase precursor [Macaca mulatta]
gi|380789243|gb|AFE66497.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
gi|384941696|gb|AFI34453.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
Length = 496
Score = 303 bits (775), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 167/458 (36%), Positives = 257/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+ +G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DF+ + C ++I++SW I+ ++S GL L+ C+PL ++ LKD + +
Sbjct: 224 TDFKKSGPHCSESIRRSWDAINRLSSTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + C EQ+G
Sbjct: 340 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDECFEQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S SNI+FSNG DP+S GG K D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHSNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRAKNALDPTSVLLARALEVRHMKNWIRDFYD 490
>gi|197098906|ref|NP_001125428.1| lysosomal Pro-X carboxypeptidase precursor [Pongo abelii]
gi|68565632|sp|Q5RBU7.1|PCP_PONAB RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|55728021|emb|CAH90763.1| hypothetical protein [Pongo abelii]
Length = 496
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 166/458 (36%), Positives = 258/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGDN--TFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I++SW I+ +++ GL L+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPHCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + +C +Q+G
Sbjct: 340 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG K D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTK--DITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRTKNALDPTSVLLARSLEVRHMKNWIRDFYD 490
>gi|4826940|ref|NP_005031.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Homo
sapiens]
gi|1172047|sp|P42785.1|PCP_HUMAN RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Angiotensinase C; AltName: Full=Lysosomal
carboxypeptidase C; AltName: Full=Proline
carboxypeptidase; AltName: Full=Prolylcarboxypeptidase;
Short=PRCP; Flags: Precursor
gi|431321|gb|AAA99891.1| prolylcarboxypeptidase [Homo sapiens]
gi|16306648|gb|AAH01500.1| Prolylcarboxypeptidase (angiotensinase C) [Homo sapiens]
gi|119595480|gb|EAW75074.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_a [Homo
sapiens]
gi|119595482|gb|EAW75076.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_a [Homo
sapiens]
gi|123992784|gb|ABM83994.1| prolylcarboxypeptidase (angiotensinase C) [synthetic construct]
gi|123999550|gb|ABM87320.1| prolylcarboxypeptidase (angiotensinase C) [synthetic construct]
gi|197692241|dbj|BAG70084.1| prolylcarboxypeptidase isoform 1 preproprotein [Homo sapiens]
gi|197692495|dbj|BAG70211.1| prolylcarboxypeptidase isoform 1 preproprotein [Homo sapiens]
Length = 496
Score = 301 bits (771), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 166/458 (36%), Positives = 258/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I +SW I+ +++ GL L+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + +C +Q+G
Sbjct: 340 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG K D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFYD 490
>gi|402894798|ref|XP_003910532.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Papio
anubis]
Length = 496
Score = 301 bits (770), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 166/458 (36%), Positives = 257/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+ +G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I++SW I+ +++ GL L+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPYCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + C EQ+G
Sbjct: 340 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDECFEQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG K D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTK--DITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRAKNALDPTSVLLARALEVRHMKNWIRDFYD 490
>gi|326914542|ref|XP_003203584.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Meleagris
gallopavo]
Length = 483
Score = 300 bits (769), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 170/456 (37%), Positives = 261/456 (57%), Gaps = 31/456 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ T + Q IDHF + + TF QRY+I + W N PIL + G E I +GF
Sbjct: 38 YVTRYLTQQIDHFGF--DENLTFQQRYLIADQHWQKDNGPILFYTGNEGDITWFCNNTGF 95
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + A++V EHRYYG+S+PFG+ + +++H Y S QALAD+A ++ ++K+
Sbjct: 96 MWDVAEELNAMLVFAEHRYYGESLPFGNE--SFSDSKHLNYLTSEQALADFAVLIEYLKE 153
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A H+PVIAIG SYGG LA WFR+KYPHVV+G+LA+SAPI +GD+ P Y ++V+
Sbjct: 154 TIAGARHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFGDLVPCGAYFTIVT 213
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
DF+ + C ++I+ SW+ I++++S GL LS+ F C+PLKN + LK L
Sbjct: 214 NDFKKSGTGCSESIRNSWNAINHLSSTDAGLQWLSRTFHLCSPLKNLQDALMLKSWLSET 273
Query: 287 YSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-G 336
+ + A D P P++ +C +++ P+ D +L + V +A ++ Y+
Sbjct: 274 WIDLAMVDYPYKADFLEPLPAWPIQEVCKFLKD-PSLSDKLLLQ---NVFQAINIYYNYT 329
Query: 337 NNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
+ C+ ++E T + GW +Q C+EMV+PM D + M++P+ W+ C + +
Sbjct: 330 GEASCFDMSETATKNLGQLGWYYQVCTEMVMPMCTDGVHDMFEPQKWDFEALSDECFKLW 389
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV PRPSW+L+ YGG +I S SNIIFSNG DP+S GG T + D + +
Sbjct: 390 GVRPRPSWILSMYGGKNIS-----SHSNIIFSNGGLDPWSAGG--VTQNITDSLVAIVIP 442
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G+H LDL DP + + R EV M+ WI +
Sbjct: 443 DGAHHLDLRSRNPCDPKSVQEARALEVHYMKQWIAK 478
>gi|300193275|pdb|3N2Z|B Chain B, The Structure Of Human Prolylcarboxypeptidase At 2.80
Angstroms Resolution
Length = 446
Score = 300 bits (769), Expect = 8e-79, Method: Compositional matrix adjust.
Identities = 166/458 (36%), Positives = 258/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 3 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 60
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 61 MWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 118
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 119 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 178
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I +SW I+ +++ GL L+ C+PL ++ LKD + +
Sbjct: 179 TDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 238
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 239 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 294
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + +C +Q+G
Sbjct: 295 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG 354
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG T D + ++
Sbjct: 355 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGG--VTKDITDTLVAVTISE 407
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 408 GAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFYD 445
>gi|189066514|dbj|BAG35764.1| unnamed protein product [Homo sapiens]
Length = 496
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 166/458 (36%), Positives = 257/458 (56%), Gaps = 30/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + + +TF QRY++ K+W IL + G E I +GF
Sbjct: 48 YSVLYFQQKVDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVAEDLKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 164 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I +SW I+ +++ GL L+ C+PL ++ LKD + +
Sbjct: 224 TDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 283
Query: 288 SEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GN 337
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 284 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSG 339
Query: 338 NSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+C I+E T GW +Q+C+E+V+P + + M++P WNL + +C +Q+G
Sbjct: 340 QVKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWG 399
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRPSW+ T YGG +I S +NI+FSNG DP+S GG K D + ++
Sbjct: 400 VRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISE 452
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 453 GAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFYD 490
>gi|355752501|gb|EHH56621.1| hypothetical protein EGM_06072 [Macaca fascicularis]
Length = 516
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 167/449 (37%), Positives = 253/449 (56%), Gaps = 30/449 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY++ K+W IL + G E I +GF ++ A + K
Sbjct: 77 VDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 134
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K A + P
Sbjct: 135 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 192
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+ +G+LA+SAPI + D+ P + +V+ DFR +
Sbjct: 193 VIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 252
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP 296
C ++I++SW I+ ++S GL L+ C+PL ++ LKD + + A D P
Sbjct: 253 CSESIRRSWDAINRLSSTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 312
Query: 297 --SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INE 345
SN P+K +C ++N PN D +L + + +A ++ Y+ +C I+E
Sbjct: 313 YASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSGQVKCLNISE 368
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
T GW +Q+C+E+V+P + + M++P WNL + C EQ+GV PRPSW+
Sbjct: 369 TATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDECFEQWGVRPRPSWIT 428
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGG +I S SNI+FSNG DP+S GG K D + ++ G+H LDL
Sbjct: 429 TMYGGKNIS-----SHSNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISEGAHHLDLRA 481
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
DP ++ R EV+ M+ WI +YD
Sbjct: 482 KNALDPTSVLLARALEVRHMKNWIRDFYD 510
>gi|417401904|gb|JAA47816.1| Putative lysosomal pro-x carboxypeptidase [Desmodus rotundus]
Length = 497
Score = 298 bits (764), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 169/462 (36%), Positives = 258/462 (55%), Gaps = 31/462 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ + Q +DHF + ++ +TF QRY+I + W IL + G E I +GF
Sbjct: 48 YSIHYTEQKVDHFGF--KTDKTFKQRYLIADQHWKKDGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYGKS+PFG+ + K++RH + S QALAD+ ++ H+K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGKSLPFGAN--SFKDSRHLNFLTSEQALADFGELIRHLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIA+G SYGG LA WFR+KYPH+V G+LA+SAPI + DI P + +V+
Sbjct: 164 TIPGAENQPVIAVGGSYGGMLAAWFRMKYPHMVAGALAASAPIWQFEDIVPCGVFMEIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
DFR + C ++I++SW I+ + GL LS+ CTPL N+ + LKD +
Sbjct: 224 TDFRKSGPNCSESIRRSWDAINRLTRTGTGLHWLSEALHLCTPLTNSQDVQHLKDWISET 283
Query: 287 YSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-G 336
+ A D P P+K +C ++N P+ D L + + +A ++ Y+
Sbjct: 284 WVNLAMVDYPYESDFLQPLPAWPIKVVCQYLKN-PHVSDTQLLQ---NIFQALNVYYNYS 339
Query: 337 NNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
+RC ++E TG GW +Q+C+EMV+P + + M++P W+L ++ +C +Q+
Sbjct: 340 GQARCLNMSETATGSLGTLGWSYQACTEMVMPFCTNGIDDMFEPHSWDLKEFSDDCFKQW 399
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV+PRPSW+ T YGG +I S +NIIFSNG DP+S GG K + D + +
Sbjct: 400 GVTPRPSWITTMYGGKNIS-----SHTNIIFSNGDLDPWSGGGVTK--NITDTLVAITIP 452
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL DP L+ R EVK M+ WI +Y +
Sbjct: 453 EGAHHLDLRANNAFDPKSLLLARSLEVKYMKQWIRDFYASLR 494
>gi|355713391|gb|AES04658.1| prolylcarboxypeptidase [Mustela putorius furo]
Length = 496
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 164/462 (35%), Positives = 255/462 (55%), Gaps = 31/462 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ + Q +DHF + + +TF QRY+I +W IL + G E I +GF
Sbjct: 48 YSVHYIQQKVDHFGFSAD--KTFKQRYLIADAYWKKNGGSILFYTGNEGDITWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A Q KA++V EHRYYG+S+PFG++ + +++RH + S QALAD+A ++ H+K
Sbjct: 106 MWDVADQLKAMLVFAEHRYYGESLPFGNK--SFRDSRHLNFLTSEQALADFAVLIKHLKK 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI ++G++ P + +V+
Sbjct: 164 TIPGAKNQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWHFGNLVPCGVFMEIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
KDF+ C +TI+ SW I+ + GL LS+ CT L N + LK L
Sbjct: 224 KDFKRGGPNCSETIRSSWDAINRFSRTGAGLRWLSEALDLCTALTNAQDVQHLKAWLSET 283
Query: 287 YSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYD 335
+ A D P P+K +C ++N P+ D ++ + +A ++ Y
Sbjct: 284 WINLAMVDYPYECDFLQPLPAWPIKVVCQYLKN-PSVSD---AQLLQNIFQALNVYYNYS 339
Query: 336 GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
G S I+E T + +GW +Q+C+EM++P + + M++P WNL ++ +C +Q+
Sbjct: 340 GQASCLNISETTTSNLGTQGWSYQACTEMIMPFCTNGIDDMFEPHAWNLREFSDDCFKQW 399
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
V PRP+W++T YGG +I S +NIIFSNG DP+S GG K D + +
Sbjct: 400 SVRPRPAWIITMYGGKNIS-----SHTNIIFSNGELDPWSGGGVTK--DITDTLVAITIP 452
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL DP ++ R EV+ M+ WI +Y +
Sbjct: 453 DGAHHLDLRARNAFDPTTVILARSLEVRHMKQWIRNFYASLR 494
>gi|363729398|ref|XP_425654.3| PREDICTED: lysosomal Pro-X carboxypeptidase [Gallus gallus]
Length = 482
Score = 297 bits (761), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 170/456 (37%), Positives = 256/456 (56%), Gaps = 32/456 (7%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ T + Q IDHF + + TF QRY+I + W N PIL + G E I +GF
Sbjct: 38 YLTRYLTQQIDHFGF--DENLTFQQRYLIADQHWKKDNGPILFYTGNEGDITWFCNNTGF 95
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + A++V EHRYYG+S+PFG+ + +++H Y S QALAD+A ++ ++K+
Sbjct: 96 MWDVAEELNAMLVFAEHRYYGESLPFGNE--SFSDSKHLNYLTSEQALADFAVLIEYLKE 153
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A H+PVIAIG SYGG LA WFR+KYPHVV+G+LA+SAPI +GD+ P Y ++V+
Sbjct: 154 TIAGARHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFGDLVPCGAYFTIVT 213
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
DF+ + C ++I+ SW+ I++++S GL LS+ F C+PLKN + LK L
Sbjct: 214 NDFKKSGTGCSESIRNSWNAINHLSSTDVGLQWLSRTFHLCSPLKNLQDAAMLKTWLSET 273
Query: 287 YSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-G 336
+ + A D P P++ +C + + L +AA +A ++ Y+
Sbjct: 274 WIDLAMVDYPYKADFLQPLPAWPIREVCKFLRTPVS-----LTIVAAEYFQAINIYYNYT 328
Query: 337 NNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
+ C+ ++E T + GW +Q C+EMV+PM D + M++P+ W+ C +
Sbjct: 329 GEASCFDMSETATKNLGQLGWYYQVCTEMVMPMCTDGVHDMFEPQKWDFEALSDECFRLW 388
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV PRPSW+L+ YGG +I S SNIIFSNG DP+S GG T + D + +
Sbjct: 389 GVRPRPSWILSMYGGKNIS-----SHSNIIFSNGGLDPWSAGG--VTQNVTDSLVAIVIP 441
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G+H LDL DP + Q R EV M+ WI +
Sbjct: 442 DGAHHLDLRSRNPCDPKSVQQARALEVHYMKQWIAE 477
>gi|114639539|ref|XP_001175149.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 5 [Pan
troglodytes]
gi|397502804|ref|XP_003822033.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Pan
paniscus]
Length = 517
Score = 297 bits (761), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 165/449 (36%), Positives = 255/449 (56%), Gaps = 30/449 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY++ K+W IL + G E I +GF ++ A + K
Sbjct: 78 VDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 135
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K A + P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 193
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+ DFR +
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP 296
C ++I++SW I+ +++ GL L+ C+PL ++ LKD + + A D P
Sbjct: 254 CSESIRRSWEAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 297 --SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INE 345
SN P+K +C ++N PN D +L + + +A ++ Y+ +C I+E
Sbjct: 314 YASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSGQVKCLNISE 369
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
T GW +Q+C+E+V+P + + M++P WNL + +C +Q+GV PRPSW+
Sbjct: 370 TATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRPSWIT 429
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGG +I S +NI+FSNG DP+S GG K D + ++ G+H LDL
Sbjct: 430 TMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISEGAHHLDLRT 482
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
DP ++ R EV+ M+ WI +YD
Sbjct: 483 KNALDPTSVLLARSLEVRHMKNWIRDFYD 511
>gi|426369950|ref|XP_004051943.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Gorilla
gorilla gorilla]
Length = 517
Score = 297 bits (761), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 165/449 (36%), Positives = 255/449 (56%), Gaps = 30/449 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY++ K+W IL + G E I +GF ++ A + K
Sbjct: 78 VDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 135
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K A + P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 193
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+ DFR +
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP 296
C ++I++SW I+ +++ GL L+ C+PL ++ LKD + + A D P
Sbjct: 254 CSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 297 --SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INE 345
SN P+K +C ++N PN D +L + + +A ++ Y+ +C I+E
Sbjct: 314 YASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSGQVKCLNISE 369
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
T GW +Q+C+E+V+P + + M++P WNL + +C +Q+GV PRPSW+
Sbjct: 370 TATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRPSWIT 429
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGG +I S +NI+FSNG DP+S GG K D + ++ G+H LDL
Sbjct: 430 TMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISEGAHHLDLRT 482
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
DP ++ R EV+ M+ WI +YD
Sbjct: 483 KNALDPTSVLLARSLEVRHMKNWIRDFYD 511
>gi|242096526|ref|XP_002438753.1| hypothetical protein SORBIDRAFT_10g025550 [Sorghum bicolor]
gi|241916976|gb|EER90120.1| hypothetical protein SORBIDRAFT_10g025550 [Sorghum bicolor]
Length = 569
Score = 297 bits (761), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 183/510 (35%), Positives = 262/510 (51%), Gaps = 85/510 (16%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW---GGGNS-----PILAFLGAEAPID 102
F T ++ Q +DHF++ P + F Q+Y++N FW GGN P+ + G E I+
Sbjct: 76 FTTHYFPQELDHFTFTPNASMVFRQKYLVNDTFWRRPSGGNGTAGAGPLFVYTGNEGDIE 135
Query: 103 DNIQLSGFTYENAHQFKALIVILE-----------------HRYYGKSVP---------- 135
+GF ++ A +F AL+V +E + G P
Sbjct: 136 WFATNTGFMFDIAPKFGALLVFIECLNAENMIQSKIFSFCFLKSRGPRCPEPGPRPPLAL 195
Query: 136 -----FGSRKAA-LKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGE 189
+R AA L+ G +A ALAD+A ++ +K +A APV+ G SYGG
Sbjct: 196 PLPLARQARSAAGLRGLPTAG--RAAVALADFAILITSLKQNLSAKTAPVVVFGGSYGGM 253
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA+WFRLKYPHV IG+LASSAPIL + ITPW+++ +S+D++ S CF IK +W
Sbjct: 254 LASWFRLKYPHVAIGALASSAPILQFDYITPWSSFSDAISQDYKSESLNCFSVIKATWDV 313
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSNI-------P 300
+D + GL LSK F+ C LK + L T VY+ Y TP+N P
Sbjct: 314 LDERGANDRGLLELSKLFRACKTLKYADSIAGWLQTAFVYTAMVDYPTPANFLKNLPAYP 373
Query: 301 VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEW-- 358
VK +C I+ P DIL K+ A A SL Y + TGD++ E+
Sbjct: 374 VKEMCKIIDGFP-ANADILEKVFA----AASLYY-----------NFTGDQTCNQIEYDD 417
Query: 359 -------------QSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT 405
Q+C+EM++PM SM+ P+ ++ C + YGV PRP W+ T
Sbjct: 418 DSSSSLGLSGWGWQACTEMIMPMSDSNESMFPPDTFSYEDTSNTCFQLYGVRPRPHWITT 477
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEA 465
YGG+ I +L+R SNIIFSNG+RDP+SRGG +K ++ + G+H LDL A
Sbjct: 478 EYGGYKIDKVLKRFGSNIIFSNGLRDPWSRGGVLKDISSSIVALVTE--KGAHHLDLRSA 535
Query: 466 KKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
K DPDW+++QR+ EV+I+ GWI QY+ D
Sbjct: 536 TKDDPDWVIEQRRQEVEIIHGWIDQYHQDM 565
>gi|301780038|ref|XP_002925438.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Ailuropoda
melanoleuca]
Length = 520
Score = 297 bits (761), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 166/453 (36%), Positives = 254/453 (56%), Gaps = 31/453 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY+I + W IL + G E I +GF ++ A Q K
Sbjct: 80 VDHFGFTAD--ETFKQRYLIADEHWKKNGGSILFYTGNEGDITWFCNNTGFMWDVADQLK 137
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG++ + K++RH + S QALAD+A ++ H+K A + P
Sbjct: 138 AMLVFAEHRYYGESLPFGNK--SFKDSRHLNFLTSEQALADFAVLIKHLKRTIPGAKNQP 195
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIA+G SYGG LA WFR+KYPH+V+G+LA+SAPI ++G++ P + +V+KDF+ +
Sbjct: 196 VIAVGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMEIVTKDFKRSGPN 255
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDT 295
C +TI+ SW I+ A GL LS+ CT L NT + LK + + A D
Sbjct: 256 CSETIRSSWDAINRFARTGAGLRWLSEALGLCTALTNTEDVQRLKAWISETWINLAMVDY 315
Query: 296 P---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYDGNNSRCYIN 344
P P+K +C ++N P+ D ++ + +A ++ Y G S I+
Sbjct: 316 PYESDFLQPLPAWPIKVVCQYLKN-PSVSD---AQLLQNIFQALNVYYNYSGQASCLNIS 371
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
E T +GW +Q+C+EMV+P + + M++P WNL ++ +C +Q+GV PRP+W+
Sbjct: 372 ETTTSSLGTQGWSYQACTEMVMPFCTNGIDDMFEPHLWNLGEFSDDCFKQWGVRPRPAWI 431
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
+T YGG +I S +NIIFSNG DP+S GG T D + + G+H LDL
Sbjct: 432 ITMYGGKNIS-----SHTNIIFSNGELDPWSGGG--VTKDITDTLVAITIPEGAHHLDLR 484
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
DP ++ R EV+ M+ WI +Y +
Sbjct: 485 ARNAFDPTTVLLARSLEVRHMKQWIRDFYASLR 517
>gi|431838478|gb|ELK00410.1| Lysosomal Pro-X carboxypeptidase [Pteropus alecto]
Length = 497
Score = 297 bits (760), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 172/462 (37%), Positives = 258/462 (55%), Gaps = 31/462 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ + Q +DHF + + +TF QRY+I K W IL + G E I +GF
Sbjct: 48 YSIHYIQQKVDHFGFNTD--KTFNQRYLIADKHWKKDGGSILFYTGNEGDIIWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYGKS+PFGS ++ K++RH + S QALAD+A ++ H++
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGKSLPFGS--SSFKDSRHLNFLTSEQALADFAELITHLRK 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIA+G SYGG LA WFR+KYPH+V G+LA+SAPI + +I P + +V+
Sbjct: 164 TIPGAKNQPVIALGGSYGGMLAAWFRMKYPHMVAGALAASAPIWQFDNIVPCGVFMEIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
D+R + C + I++SW I+ IA GL LS+ CTPLK+T + LKD +
Sbjct: 224 TDYRKSGPNCAECIRRSWDAINRIAENGTGLPWLSETLHLCTPLKHTQDIQNLKDWISET 283
Query: 287 YSEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-G 336
+ A D P SN PVK +C +++ P+ D L + + +A ++ Y+
Sbjct: 284 WINLAVVDYPYESNFLQPLPAWPVKVVCQYLKD-PSVSDPQLLQ---NIFQALNIYYNYT 339
Query: 337 NNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
+RC ++E T GW +Q+C+EMV+P + + M++P W+L ++ C Q+
Sbjct: 340 GQARCLNMSETSTSSLGSLGWSYQACTEMVMPFCTNGIDDMFEPHSWDLEEFSDYCFRQW 399
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV PRPSW+ T YGG +I S +NIIFSNG DP+S GG K D + +
Sbjct: 400 GVRPRPSWITTLYGGKNIS-----SHTNIIFSNGDLDPWSGGGVTK--DITDTLVAITIP 452
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL DP +V R EV+ M+ WI +Y + +
Sbjct: 453 EGAHHLDLRANNAFDPTTVVLARSMEVRYMKQWIKDFYANLR 494
>gi|395814759|ref|XP_003780909.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Otolemur
garnettii]
Length = 495
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 171/484 (35%), Positives = 270/484 (55%), Gaps = 36/484 (7%)
Query: 29 LRPRLGRIRRSRILEQKDSNHGFET-----FFYNQTIDHFSYGPESYQTFPQRYVINSKF 83
LRP L + S L+ + H F T ++ Q IDHF + S +TF QRY+I +++
Sbjct: 22 LRPALMTLG-SLHLQTNPTAHPFVTKDYSVLYFQQKIDHFGF--NSVKTFNQRYLIANEY 78
Query: 84 WGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
W IL + G E I +GF ++ A + KA++V EHRYYG+S+PFG K +
Sbjct: 79 WKKDGGSILFYTGNEGDIVWFCNNTGFMWDVAQELKAMLVFAEHRYYGESLPFG--KNSF 136
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVV 202
K++RH Y S QALAD+A ++ H+++ + PVIAIG SYGG LA WFR+KYPH+V
Sbjct: 137 KDSRHLNYLTSEQALADFAELIRHLQETIPGVKNQPVIAIGGSYGGMLAAWFRMKYPHMV 196
Query: 203 IGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAI 262
+G+LA+SAPI + D+ P + +V+ DF+ + C +TI+KSW+ I+ +++ GL
Sbjct: 197 VGALAASAPIWQFEDLVPCGVFMKIVTSDFKRSGPYCSETIRKSWNVINQLSTTSRGLQY 256
Query: 263 LSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP---------SNIPVKRICNAIENA 311
L++ CTPL ++ LKD + + A D P P+K +C ++N
Sbjct: 257 LTEVLHLCTPLTPRDVQHLKDWIAETWVNLAMVDYPYPSSFLQPLPAWPIKVVCQYLKN- 315
Query: 312 PNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMG 369
PN D ++ + + +A ++ Y+ ++C ++ T + GW +Q+C+E+V+P
Sbjct: 316 PNVSDSLMIQ---NIFQALNIYYNYSGEAQCLNVSTTTTSNLGSLGWSYQACTEIVMPFC 372
Query: 370 KDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNG 428
+ + M++P WNL ++ C +Q+GV P PSW+ T YGG +I S +NI+FSNG
Sbjct: 373 TNGIDDMFEPHAWNLKEFSDECFKQWGVRPSPSWITTMYGGKNIN-----SHTNIVFSNG 427
Query: 429 MRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
DP+S GG T D + ++ G+H LDL + DP + R E K ++ WI
Sbjct: 428 ELDPWSGGG--VTEDLTDTLVAFTISEGAHHLDLRASNALDPISVQLARTLETKHVKNWI 485
Query: 489 TQYY 492
+Y
Sbjct: 486 RDFY 489
>gi|281339633|gb|EFB15217.1| hypothetical protein PANDA_014943 [Ailuropoda melanoleuca]
Length = 441
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 166/453 (36%), Positives = 254/453 (56%), Gaps = 31/453 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY+I + W IL + G E I +GF ++ A Q K
Sbjct: 1 VDHFGFTAD--ETFKQRYLIADEHWKKNGGSILFYTGNEGDITWFCNNTGFMWDVADQLK 58
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG++ + K++RH + S QALAD+A ++ H+K A + P
Sbjct: 59 AMLVFAEHRYYGESLPFGNK--SFKDSRHLNFLTSEQALADFAVLIKHLKRTIPGAKNQP 116
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIA+G SYGG LA WFR+KYPH+V+G+LA+SAPI ++G++ P + +V+KDF+ +
Sbjct: 117 VIAVGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMEIVTKDFKRSGPN 176
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDT 295
C +TI+ SW I+ A GL LS+ CT L NT + LK + + A D
Sbjct: 177 CSETIRSSWDAINRFARTGAGLRWLSEALGLCTALTNTEDVQRLKAWISETWINLAMVDY 236
Query: 296 P---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYDGNNSRCYIN 344
P P+K +C ++N P+ D ++ + +A ++ Y G S I+
Sbjct: 237 PYESDFLQPLPAWPIKVVCQYLKN-PSVSD---AQLLQNIFQALNVYYNYSGQASCLNIS 292
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
E T +GW +Q+C+EMV+P + + M++P WNL ++ +C +Q+GV PRP+W+
Sbjct: 293 ETTTSSLGTQGWSYQACTEMVMPFCTNGIDDMFEPHLWNLGEFSDDCFKQWGVRPRPAWI 352
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
+T YGG +I S +NIIFSNG DP+S GG T D + + G+H LDL
Sbjct: 353 ITMYGGKNIS-----SHTNIIFSNGELDPWSGGG--VTKDITDTLVAITIPEGAHHLDLR 405
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
DP ++ R EV+ M+ WI +Y +
Sbjct: 406 ARNAFDPTTVLLARSLEVRHMKQWIRDFYASLR 438
>gi|118404640|ref|NP_001072639.1| prolylcarboxypeptidase precursor [Xenopus (Silurana) tropicalis]
gi|115313632|gb|AAI23913.1| prolylcarboxypeptidase (angiotensinase C) [Xenopus (Silurana)
tropicalis]
Length = 499
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 182/499 (36%), Positives = 267/499 (53%), Gaps = 35/499 (7%)
Query: 13 LMVIFVSTSFHANGLKLRPRLGRIRRSRIL----EQKDSNHG--FETFFYNQTIDHFSYG 66
+M +F A L L +R SR + SN G +ET+++ Q +DHF +
Sbjct: 9 VMASLPGCAFLAWNLLLLSAAHSVRLSRQFGSAQQYGSSNPGPSYETYYFTQQVDHFGFY 68
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILE 126
++ TF QRY+++ +W PIL + G E I +GF ++ A + A++V E
Sbjct: 69 EDA--TFKQRYLVSDTYWRKPGGPILFYTGNEGDITLFCNNTGFMWDVAEEMGAMLVFAE 126
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAPVIAIGAS 185
HRYYG+S+PFG A + +H Y S QALAD+A +L + K A ++PVIA+G S
Sbjct: 127 HRYYGESMPFG--DLAFSDPKHLNYLTSEQALADFAVLLRYFKATTEGAQNSPVIALGGS 184
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
YGG LA WFR+KYP VV+G++ASSAPI + D+ P Y+ VV+ DF+ + C ++++
Sbjct: 185 YGGMLAAWFRMKYPDVVVGAIASSAPIWQFEDLVPCNEYYQVVTNDFKKSGPGCSESVRN 244
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL---KDGLDTVYSEAAQYDTP------ 296
SW+ I+ +A DGL LS F C PLK+ ++ K L + A D P
Sbjct: 245 SWAAINRMAETTDGLQWLSNAFHLCCPLKSKEDVAVFKGWLSETWVSLAMVDYPYPASFL 304
Query: 297 ---SNIPVKRICNAIENAPNCGD-DILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDES 352
P++ +C +++ P GD D+L I V Y G+ ++ +G
Sbjct: 305 EPLPAWPIQVVCKFLKD-PKSGDKDLLQDIFQAV--NVYYNYTGDTPCLNTSQTASGSLG 361
Query: 353 DEGWEWQSCSEMVVPMGKDKNS-MYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHD 411
D GW +Q+C+EMV+P D S M++P+ W+L + C +Q+GV PR SWV T YGG +
Sbjct: 362 DLGWSYQACTEMVMPFCSDGVSDMFEPQSWDLQAFSDECYKQWGVRPRGSWVTTVYGGKN 421
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPD 471
I S SNIIFSNG DP+S GG VK D + + G+H LDL DP
Sbjct: 422 IS-----SHSNIIFSNGGLDPWSAGG-VKE-SLSDSLLAVLIPEGAHHLDLRSNNADDPK 474
Query: 472 WLVQQRKTEVKIMQGWITQ 490
++ R EV+ M+ WI Q
Sbjct: 475 SVLLARSLEVEYMKKWIAQ 493
>gi|117306169|ref|NP_955450.2| lysosomal Pro-X carboxypeptidase isoform 2 preproprotein [Homo
sapiens]
gi|119595481|gb|EAW75075.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_b [Homo
sapiens]
Length = 517
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 165/449 (36%), Positives = 254/449 (56%), Gaps = 30/449 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY++ K+W IL + G E I +GF ++ A + K
Sbjct: 78 VDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 135
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K A + P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 193
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+ DFR +
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP 296
C ++I +SW I+ +++ GL L+ C+PL ++ LKD + + A D P
Sbjct: 254 CSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 297 --SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INE 345
SN P+K +C ++N PN D +L + + +A ++ Y+ +C I+E
Sbjct: 314 YASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSGQVKCLNISE 369
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
T GW +Q+C+E+V+P + + M++P WNL + +C +Q+GV PRPSW+
Sbjct: 370 TATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRPSWIT 429
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGG +I S +NI+FSNG DP+S GG K D + ++ G+H LDL
Sbjct: 430 TMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISEGAHHLDLRT 482
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
DP ++ R EV+ M+ WI +YD
Sbjct: 483 KNALDPMSVLLARSLEVRHMKNWIRDFYD 511
>gi|402894800|ref|XP_003910533.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Papio
anubis]
Length = 516
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 165/449 (36%), Positives = 253/449 (56%), Gaps = 30/449 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY++ K+W IL + G E I +GF ++ A + K
Sbjct: 77 VDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 134
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K A + P
Sbjct: 135 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 192
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+ +G+LA+SAPI + D+ P + +V+ DFR +
Sbjct: 193 VIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPY 252
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP 296
C ++I++SW I+ +++ GL L+ C+PL ++ LKD + + A D P
Sbjct: 253 CSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 312
Query: 297 --SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INE 345
SN P+K +C ++N PN D +L + + +A ++ Y+ +C I+E
Sbjct: 313 YASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSGQVKCLNISE 368
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
T GW +Q+C+E+V+P + + M++P WNL + C EQ+GV PRPSW+
Sbjct: 369 TATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDECFEQWGVRPRPSWIT 428
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGG +I S +NI+FSNG DP+S GG K D + ++ G+H LDL
Sbjct: 429 TMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISEGAHHLDLRA 481
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
DP ++ R EV+ M+ WI +YD
Sbjct: 482 KNALDPTSVLLARALEVRHMKNWIRDFYD 510
>gi|410972567|ref|XP_003992730.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Felis catus]
Length = 497
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 166/458 (36%), Positives = 259/458 (56%), Gaps = 31/458 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ + Q +DHF + + +TF QRY++ + W + IL + G E I +GF
Sbjct: 48 YTVHYLQQKVDHFGFTTD--KTFKQRYLLADEHWKKDDGSILFYTGNEGDIVWFCNNTGF 105
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG+ + K++R+ Y S QALAD+A ++ ++K
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGESLPFGND--SFKDSRYLNYLTSEQALADFAVLIKYLKR 163
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + PVIA+G SYGG LA WFR+KYPH+V+G+LA+SAPI ++G++ P + +V+
Sbjct: 164 TIPGAKNQPVIALGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMKIVT 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
+DFR + C +TI SW I +A GL LS+ CTPL+N+ + LK +
Sbjct: 224 EDFRKSGPNCSETIHNSWGAITRLARTGSGLHWLSEALHLCTPLRNSQDVQHLKGWISET 283
Query: 287 YSEAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-G 336
+ A D P SN P+K +C + N P+ D ++ + +A ++ Y+
Sbjct: 284 WINLAMVDYPYESNFLQPLPAWPIKVVCKYLNN-PSLSD---AQLLENIFQALNIYYNYS 339
Query: 337 NNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
+RC I+E T + +GW +Q+C+EMV+P + + M++P WN +Y +C +Q+
Sbjct: 340 GQARCLNISETTTSNLGTQGWSYQACTEMVMPFCTNGIDDMFEPRSWNFREYSDDCFKQW 399
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV PRP+W+ T YGG +I S SNI+FSNG DP+S GG K D + +
Sbjct: 400 GVRPRPTWITTVYGGRNIS-----SHSNIVFSNGELDPWSGGGVTKD--ITDTLVAITIP 452
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
G+H LDL DP ++ R EV+ M+ WI +Y
Sbjct: 453 EGAHHLDLRARNAFDPTTVLLARSLEVRHMKQWIRDFY 490
>gi|354489833|ref|XP_003507065.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cricetulus
griseus]
gi|344240433|gb|EGV96536.1| Lysosomal Pro-X carboxypeptidase [Cricetulus griseus]
Length = 492
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 168/463 (36%), Positives = 262/463 (56%), Gaps = 30/463 (6%)
Query: 49 HGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLS 108
+ + ++ Q +DHF + + +TF QRY+I K W IL + G E I +
Sbjct: 44 NNYSVLYFQQKVDHFGF--YNSRTFKQRYLIADKHWKANGGTILFYTGNEGDIVWFCNNT 101
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF ++ A + KA++V EHRYYG+S+PFG + K+A+H + S QALAD+A ++ H+
Sbjct: 102 GFMWDVAKELKAMLVFAEHRYYGESLPFGEE--SFKDAQHLDFLTSEQALADFAELIKHL 159
Query: 169 KDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
K A + PVIAIG SYGG LA WFR+KYPH+V+G+LASSAPI + D+ P + +
Sbjct: 160 KRTVPGAKNQPVIAIGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFDDLVPCGVFMKI 219
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDT 285
V++DFR +S +C ++I++SW I+ ++ L L+K F C+PL ++ T LK+ L
Sbjct: 220 VTEDFRKSSPQCSESIRRSWKAINRHSTTGSDLQWLTKTFHLCSPLNFEDITTLKEWLSE 279
Query: 286 VYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD- 335
+ A D P PV +C ++N PN D +L + + +A ++ Y+
Sbjct: 280 TWVNLAMVDYPYEASFLQPLPAWPVTVVCQYLKN-PNVSDTVLLQ---NIFQALNIYYNY 335
Query: 336 GNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQ 393
++C I++ T GW +QSC+E+V+P+ + + M++ W+ K+ +C Q
Sbjct: 336 SGQAQCLNISQTATSSLGATGWGYQSCTELVMPICSNGVDDMFESYIWDFDKFSNDCFNQ 395
Query: 394 YGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
+GV PR +W++T YGG +I S SNIIFSNG DP+S GG K D + ++
Sbjct: 396 WGVKPRLNWIITLYGGKNIS-----SHSNIIFSNGDLDPWSGGGVTK--DITDTLVAINI 448
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL + DP ++ R EVK M+ WI +Y + K
Sbjct: 449 PNGAHHLDLRASTAFDPSTVLLSRSLEVKHMKKWIADFYKNAK 491
>gi|338726996|ref|XP_001917253.2| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X carboxypeptidase
[Equus caballus]
Length = 519
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 168/455 (36%), Positives = 256/455 (56%), Gaps = 31/455 (6%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+ +DHF + + +TF QRY+I K W IL + G E I +GF ++ A +
Sbjct: 77 RXVDHFGFNTD--KTFKQRYLIADKHWKKDGGSILFYTGNEGDITWFCNNTGFMWDMAEE 134
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATH 176
KA++V EHRYYG+S+PFG+ + K++ H + S QALAD+A ++ H+K A +
Sbjct: 135 LKAMLVFAEHRYYGESLPFGAN--SFKDSTHLNFLTSEQALADFAKLIKHLKRTVPGAKN 192
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
PVIA+G SYGG LA WFR+KYPH+V+G+LASSAPI + D+ P + +V+ DFR +
Sbjct: 193 QPVIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWQFEDLVPCGVFMEIVTADFRKSG 252
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQY 293
C ++I++SW I + S GL LS+ + C+PL N+ + LKD + + A
Sbjct: 253 PNCSESIRRSWDAIRRLTSTGPGLHWLSEALRLCSPLTNSQDVEHLKDWITETWVNLAMV 312
Query: 294 DTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
D P SN P+K +C ++N PN D L + + +A ++ Y+ ++ +N
Sbjct: 313 DYPYESNFLQPLPAWPIKVVCQYLKN-PNVSDPQLLE---NIFQALNVYYNYSDQVKCLN 368
Query: 345 EDRTGDES--DEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
T S GW +Q+C+EMV+P + + M++P WNL ++ +C +Q+GV PRPS
Sbjct: 369 MSETATRSLGSLGWSYQACTEMVMPFCTNGIDDMFEPHSWNLKEFSDDCFKQWGVRPRPS 428
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
W+ T YGG +I S +NIIFSNG DP+S GG T + D + + G+H LD
Sbjct: 429 WITTMYGGKNIS-----SHTNIIFSNGDLDPWSGGG--VTENVTDTLVAITIPEGAHHLD 481
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
L DP ++ R EV+ M+ WI +Y +
Sbjct: 482 LRANNAFDPTSVLLARSLEVRYMKQWIKNFYASLR 516
>gi|349604443|gb|AEP99992.1| Lysosomal Pro-X carboxypeptidase-like protein, partial [Equus
caballus]
Length = 441
Score = 294 bits (753), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 168/453 (37%), Positives = 254/453 (56%), Gaps = 31/453 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY+I K W IL + G E I +GF ++ A + K
Sbjct: 1 VDHFGFNTD--KTFKQRYLIADKHWKKDGGSILFYTGNEGDITWFCNNTGFMWDMAEELK 58
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG+ + K++ H + S QALAD+A ++ H+K A + P
Sbjct: 59 AMLVFAEHRYYGESLPFGAN--SFKDSTHLNFLTSEQALADFAKLIKHLKRTVPGAKNQP 116
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIA+G SYGG LA WFR+KYPH+V+G+LASSAPI + D+ P + +V+ DFR +
Sbjct: 117 VIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWQFEDLVPCGVFMEIVTADFRKSGPN 176
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDT 295
C ++I++SW I + S GL LS+ + C+PL N+ + LKD + + A D
Sbjct: 177 CSESIRRSWDAIRRLTSTGPGLHWLSEALRLCSPLTNSQDVEHLKDWITETWVNLAMVDY 236
Query: 296 P--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINED 346
P SN P+K +C ++N PN D L + + +A ++ Y+ + +N
Sbjct: 237 PYESNFLQPLPAWPIKVVCQYLKN-PNVSDPQLLE---NIFQALNVYYNYSGQVKCLNMS 292
Query: 347 RTGDES--DEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
T S GW +Q+C+EMV+P + + M++P WNL ++ +C +Q+GV PRPSW+
Sbjct: 293 ETATRSLGSLGWSYQACTEMVMPFCTNGIDDMFEPHSWNLKEFSDDCFKQWGVRPRPSWI 352
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
T YGG +I S +NIIFSNG DP+S GG T + D + + G+H LDL
Sbjct: 353 TTMYGGKNIS-----SHTNIIFSNGDLDPWSGGG--VTENVTDTLVAITIPEGAHHLDLR 405
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
DP ++ R EV+ M+ WI +Y +
Sbjct: 406 ANNAFDPTSVLLARSLEVRYMKQWIKNFYASLR 438
>gi|73988045|ref|XP_533994.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Canis lupus
familiaris]
Length = 497
Score = 294 bits (752), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 165/455 (36%), Positives = 253/455 (55%), Gaps = 31/455 (6%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF + + +TF QRY+I + W IL + G E I +GF ++ A +
Sbjct: 55 QKVDHFGFAVD--KTFKQRYLIADEHWKKDGGSILFYTGNEGDIIWFCNNTGFMWDVAEE 112
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATH 176
KA++V EHRYYG+S+PFG+ + K++RH Y S QALAD+A ++ H+K A +
Sbjct: 113 MKAMLVFAEHRYYGESLPFGNN--SFKDSRHLNYLTSEQALADFAMLIKHLKRTIPGAKN 170
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI +G++ + +V+ DF+ +
Sbjct: 171 QPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFGNLVSCGVFMEIVTTDFKKSG 230
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQY 293
C ++I++SW I+ + GL LS+ CTPL NT + LK + + A
Sbjct: 231 PNCSESIRRSWDAINRYSRTGAGLGWLSEALHLCTPLTNTQDVQHLKIWISETWINMAMV 290
Query: 294 DTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYDGNNSRCY 342
D P P+K +C + N PN D ++ + +A ++ Y G S
Sbjct: 291 DYPYESDFLQPLPPWPIKVVCQYLRN-PNVPD---AQLLQNIFQALNVYYNYSGQASCLN 346
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
++E T +GW +Q+C+EMV+P + + M++P WNL ++ +C +Q+GV PRP+
Sbjct: 347 VSETTTSSLGTQGWSYQACTEMVMPFCTNGIDDMFEPHSWNLREFSDDCFKQWGVRPRPA 406
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
W++T YGG +I + +NIIFSNG DP+S GG K D + + G+H LD
Sbjct: 407 WIITTYGGKNIS-----AHTNIIFSNGELDPWSGGGVTKD--ITDTLVAITIPEGAHHLD 459
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
L DP ++ R EV+ M+ WI YY + +
Sbjct: 460 LRARNAFDPTAVLLARSLEVRHMKQWIKDYYANLR 494
>gi|426251523|ref|XP_004019471.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Ovis aries]
Length = 497
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 168/461 (36%), Positives = 259/461 (56%), Gaps = 31/461 (6%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +DHF + + +TF QRY+I +W IL + G E I +GF ++
Sbjct: 52 YIQQKVDHFGFNID--RTFKQRYLIADNYWKEDGGSILFYTGNEGDIIWFCNNTGFMWDI 109
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-N 173
A + KA++V EHRYYG+S+PFG+ + ++RH + + QALAD+A ++ ++K
Sbjct: 110 AEEMKAMLVFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIKYLKRTIPG 167
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
A + PVIA+G SYGG LA WFR+KYPH+V+G+LASSAPI + D+ P + +V+ DF
Sbjct: 168 ARNQPVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDVFMKIVTTDFS 227
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP---LKNTTELKDGLDTVYSEA 290
+ C ++I++SW I+ +A K GL LS+ CTP L++ LKD + +
Sbjct: 228 QSGPNCSESIRRSWDAINRLAKKGPGLRWLSEALHLCTPLTKLQDVQHLKDWIAETWVNL 287
Query: 291 AQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSR 340
A D P SN PVK +C + + N D +L + + +A ++ Y+ ++
Sbjct: 288 AMVDYPYESNFLQPLPAWPVKVVCQYFKYS-NVPDTLLLR---NIFQALNVYYNYSGQAK 343
Query: 341 CY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSP 398
C ++E T +GW +Q+C+EMV+P D + M++P WN+ +Y +C +Q+GV P
Sbjct: 344 CLNVSETATSSLGVQGWSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWGVRP 403
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
RPSW+ T YGG +I S +NIIFSNG DP+S GG K D + + G+H
Sbjct: 404 RPSWIPTMYGGKNIS-----SHTNIIFSNGELDPWSGGGVTK--DITDTLLAIVIPNGAH 456
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
LDL + DP + R EVK M+ W+T +Y + +N
Sbjct: 457 HLDLRASNALDPVSVQLTRSLEVKYMKQWVTDFYVRLRKMN 497
>gi|348525386|ref|XP_003450203.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Oreochromis
niloticus]
Length = 502
Score = 293 bits (750), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 177/493 (35%), Positives = 262/493 (53%), Gaps = 35/493 (7%)
Query: 12 LLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQ 71
+L + FV A +L RLG + S + ++TF+++Q IDHF + +
Sbjct: 16 ILTLCFVCLHVTALKSQLFTRLGGLPYS-----TEPPVSYKTFYFDQKIDHFGFLEDG-- 68
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYG 131
TF QRY+++ K+W PIL + G E I +GF +E A + A++V EHRYYG
Sbjct: 69 TFKQRYLLSDKYWQQPGGPILFYTGNEGDITWFCNNTGFMWEIAEELDAMLVFAEHRYYG 128
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGEL 190
+S+PFG + + ++++H Y S QALAD+A ++ ++K A H+PVIA+G SYGG L
Sbjct: 129 ESLPFG--QDSYRDSKHLNYLTSEQALADFAVLIQNLKGTLPGAQHSPVIAVGGSYGGML 186
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
+ WFR+KYPHVV+G+LASSAPI + + P ++ V++DF + C I+KSW +
Sbjct: 187 SAWFRMKYPHVVVGALASSAPIWQFPGMVPCGDFYKTVTQDFAKSGINCDANIRKSWKAV 246
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTEL---KDGLDTVYSEAAQYDTPSNI-------- 299
+N++S GL LS++F C PLKN ++ K L + A D P
Sbjct: 247 NNVSSSASGLQWLSEEFSLCAPLKNKNDVLTFKSWLQETWVNLAMVDYPYEANFLQPLPR 306
Query: 300 -PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDE--GW 356
P++ +C + D L + GV +A + Y+ S +N +T S GW
Sbjct: 307 WPIQVVCKYLSFDSTVSDYQLLQ---GVAQAAKVYYNYTGSSPCLNTSQTATSSLGYLGW 363
Query: 357 EWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLI 415
+Q+C+EMV+PM D M++PE WN + CK +G+ PR W T YGG +I
Sbjct: 364 FYQACTEMVMPMCTDGVQDMFEPEEWNFQAFSDECKAMFGIRPRADWAGTVYGGKEIS-- 421
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQ 475
S SNIIFSNG DP+S GG T + S + G+H LDL + DP +
Sbjct: 422 ---SHSNIIFSNGGLDPWSSGG--VTSNISHSLVSIMIPDGAHHLDLRYSNDLDPPSVRA 476
Query: 476 QRKTEVKIMQGWI 488
R EVK + WI
Sbjct: 477 ARDLEVKYFREWI 489
>gi|344293754|ref|XP_003418585.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Loxodonta africana]
Length = 503
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 163/477 (34%), Positives = 266/477 (55%), Gaps = 32/477 (6%)
Query: 33 LGRIRR-SRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPI 91
LGR+ +R + + ++ Q +DHF + + +TF QRY+I K+W I
Sbjct: 31 LGRLHLPTRPTSRPSVARNYSILYFKQKVDHFGF--DINKTFKQRYLIADKYWKKDGGSI 88
Query: 92 LAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY 151
L + G E I +GF ++ A + KA++V EHRYYG+S+PFG+ + K++RH +
Sbjct: 89 LFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGAN--SFKDSRHLNF 146
Query: 152 FNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
S QALAD+A ++ H+K A + PVIA+G SYGG LA WFR+KYPH+V+G+LA+SA
Sbjct: 147 LTSEQALADFAKLIKHLKQTIPGAENQPVIAVGGSYGGMLAAWFRMKYPHLVVGALAASA 206
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
PI + D+ P + +V++DF+ + C ++I++SW+ I+ +A+ GL LS+ C
Sbjct: 207 PIWQFEDLVPCGIFMEIVTRDFKKSGPNCSESIRRSWNVINRLATTSSGLQWLSEALHLC 266
Query: 271 TPLKNTTE---LKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDI 318
+PL + + LKD + + A D P P+K +C ++ P D +
Sbjct: 267 SPLTDFQDFRMLKDWISETWVNLAMVDYPYECNFLQPLPAWPIKVVCQYFKD-PKVSDQL 325
Query: 319 LCKIAAGVVEADSLEYD-GNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSM 375
L + + +A ++ Y+ +C I+E T GW +Q+C+E+++P + + M
Sbjct: 326 LVQ---NLFQALNVYYNYSGQVKCLNISETATSSLGSLGWSYQACTEIIMPFCTNGVDDM 382
Query: 376 YQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
++P W+L + +C +++GV PRP W+ T YGG +I S NIIFSNG DP+S
Sbjct: 383 FEPRSWDLEELSDDCFKEWGVRPRPFWITTLYGGKNIS-----SHRNIIFSNGELDPWSG 437
Query: 436 GGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
GG K + D + ++ G+H LDL + DP ++ R EV+ M+ WI+ +Y
Sbjct: 438 GGVTK--NITDTLVAINIPDGAHHLDLRASNALDPKTVLLARSLEVRYMKQWISDFY 492
>gi|348565561|ref|XP_003468571.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cavia porcellus]
Length = 494
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 173/486 (35%), Positives = 259/486 (53%), Gaps = 33/486 (6%)
Query: 25 NGLKLRP---RLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS 81
N + LRP LG + Q + ++ Q +DHF + QTF QRY+I++
Sbjct: 17 NAVALRPVLKSLGDLHLISSTSQSAIVQNYSVLYFQQKVDHFGF--NYLQTFKQRYLISA 74
Query: 82 KFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA 141
K W IL + G E I +GF ++ A + KA++V EHRYYG+S+PFG+
Sbjct: 75 KHWEKDGGSILFYTGNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGAD-- 132
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPH 200
+ K+++ + S QALAD+A ++ H+K A PVIAIG SYGG LA WFR+KYPH
Sbjct: 133 SFKDSKRLNFLTSEQALADFAELIRHLKTTIPGAGDQPVIAIGGSYGGMLAAWFRMKYPH 192
Query: 201 VVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGL 260
+V+G+LA+SAPI + ++ P T+ +V+ DFR + C ++I++SW I+ +A+ GL
Sbjct: 193 MVVGALAASAPIWQFENLVPCDTFMKIVTTDFRKSDPNCPKSIRRSWDAINRLATTRSGL 252
Query: 261 AILSKKFKTCTPLK--NTTELKDGLDTVYSEAAQYDTP--SNI-------PVKRICNAIE 309
L+K + C PL + K + + A D P SN P+K +C +
Sbjct: 253 QWLTKTLRLCNPLTLDDVQHFKQWILETWVNLAMTDYPYESNFMQPLPAWPIKVVCQYLT 312
Query: 310 NAPNCGDDILCKIAAGVVEADSL--EYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVP 367
+ PN D L + + +A ++ Y G I+E + GW +QSC+EMV+P
Sbjct: 313 D-PNVSDSQLLQ---NIFQALNVYYNYSGQAECLNISETASSSLGISGWSYQSCTEMVMP 368
Query: 368 MGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFS 426
+ M++P WNL ++ C Q+GV P PSW+ T YGG +I S SNIIFS
Sbjct: 369 FCANGVEDMFEPFSWNLKQFSDECFNQWGVRPNPSWITTLYGGKNIS-----SHSNIIFS 423
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQG 486
NG DP+S GG K D + ++ G+H LDL DP ++ R EV+ M+
Sbjct: 424 NGDLDPWSGGGVTKDIS--DTLVAINIAEGAHHLDLRAYSAYDPASVLLARSLEVEYMKK 481
Query: 487 WITQYY 492
WIT ++
Sbjct: 482 WITDFH 487
>gi|193787211|dbj|BAG52417.1| unnamed protein product [Homo sapiens]
Length = 517
Score = 292 bits (747), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 164/449 (36%), Positives = 252/449 (56%), Gaps = 30/449 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY++ K+W IL + G E I + F ++ A + K
Sbjct: 78 VDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTEFMWDVAEELK 135
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K A + P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 193
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+ DFR +
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP 296
C ++I +SW I+ + + GL L+ C+PL ++ LKD + + A D P
Sbjct: 254 CSESIHRSWDAINRLLNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 297 --SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INE 345
SN P+K +C ++N PN D +L + + +A ++ Y+ +C I+E
Sbjct: 314 YASNFLQPLPARPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSGQVKCLNISE 369
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
T GW +Q+C+E+V+P + + M++P WNL + +C +Q+GV PRPSW+
Sbjct: 370 TATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGVRPRPSWIT 429
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGG +I S +NI+FSNG DP+S GG K D + ++ G+H LDL
Sbjct: 430 TMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISEGAHHLDLRT 482
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
DP ++ R EV+ M+ WI +YD
Sbjct: 483 KNALDPMSVLLARSLEVRHMKNWIRDFYD 511
>gi|296471920|tpg|DAA14035.1| TPA: prolylcarboxypeptidase precursor [Bos taurus]
Length = 499
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 167/461 (36%), Positives = 258/461 (55%), Gaps = 31/461 (6%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +DHF + + +TF QRY+I +W IL + G E I +GF ++
Sbjct: 54 YIQQKVDHFGFNID--RTFKQRYLIADNYWKEDGGSILFYTGNEGDIIWFCNNTGFMWDI 111
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-N 173
A + KA++V EHRYYG+S+PFG+ + ++RH + + QALAD+A ++ ++K
Sbjct: 112 AEEMKAMLVFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIRYLKRTIPG 169
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
A + PVIA+G SYGG LA WFR+KYPH+V+G+LASSAPI + D+ P + +V+ DF
Sbjct: 170 ARNQPVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDIFMKIVTTDFS 229
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEA 290
+ C ++I++SW I+ +A K GL LS+ CTPL ++ LKD + +
Sbjct: 230 QSGPNCSESIRRSWDAINRLAKKGPGLRWLSEALHLCTPLTKSQDVQRLKDWISETWVNV 289
Query: 291 AQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSR 340
A D P SN PVK +C + + N D ++ + + +A ++ Y+ ++
Sbjct: 290 AMVDYPYESNFLQPLPAWPVKVVCQYFKYS-NVPDTVMVQ---NIFQALNVYYNYSGQAK 345
Query: 341 CY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSP 398
C ++E T GW +Q+C+EMV+P D + M++P WN+ +Y +C +Q+GV P
Sbjct: 346 CLNVSETATSSLGVLGWSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWGVRP 405
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
RPSW+ T YGG +I S +NIIFSNG DP+S GG K D + + G+H
Sbjct: 406 RPSWIPTMYGGKNIS-----SHTNIIFSNGELDPWSGGGVTK--DITDTLLAIVIPNGAH 458
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
LDL + DP + R EVK M+ WI+ +Y + +N
Sbjct: 459 HLDLRASNALDPVSVQLTRSLEVKYMKQWISDFYVRLRKMN 499
>gi|449271455|gb|EMC81816.1| Lysosomal Pro-X carboxypeptidase, partial [Columba livia]
Length = 437
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 167/445 (37%), Positives = 253/445 (56%), Gaps = 31/445 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + TF QRY+I + W N PIL + G E I +GF ++ A +
Sbjct: 2 VDHFGF--DDNLTFQQRYLIADQHWKKNNGPILFYTGNEGDITWFSNNTGFMWDVAQELN 59
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG+ + +++ Y S QALAD+A ++ H+K A ++P
Sbjct: 60 AMLVFAEHRYYGESLPFGNE--SYSDSKRLNYLTSEQALADFAVLIEHLKSTIAGARYSP 117
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SY G LA WFR+KYPH+V+G+LA+SAPI +GD+ P + S+V+KDF+ +
Sbjct: 118 VIAIGGSYRGMLAAWFRMKYPHLVVGALAASAPIWQFGDLVPCGAFFSIVTKDFKRSGTG 177
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDT 295
C ++I+ SWS ID ++S +GL LSK F C+PLKN + LK GL + A +
Sbjct: 178 CAESIRNSWSAIDRLSSTDEGLLWLSKTFHLCSPLKNLEDVAVLKRGLSDTWINLAMMNY 237
Query: 296 P---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-IN 344
P P++ +C +++ P+ D +L + V A ++ Y+ C+ +
Sbjct: 238 PYKSDFLEPLPAWPIEEVCKFLKD-PSLSDKLLLQ---NVFRAINIYYNYSGQILCFDMA 293
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
E T GW +Q+C+EMV+P+ D N M++P+ W+L + C + +GV PRPSW+
Sbjct: 294 ETATKSLGHLGWNYQTCTEMVMPLCTDGVNDMFEPQKWDLKARSEECYKLWGVRPRPSWI 353
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
++ YGG +I S SNIIFSNG DP+S GG + ++ + G+H LDL
Sbjct: 354 ISMYGGKNI-----HSHSNIIFSNGGMDPWSAGGVTRNISHSLVAIM--IPEGAHHLDLR 406
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWI 488
SDP + Q R E+ M+ WI
Sbjct: 407 GRNPSDPKSVQQARDLELCYMKQWI 431
>gi|426251525|ref|XP_004019472.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Ovis aries]
Length = 517
Score = 291 bits (746), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 178/503 (35%), Positives = 275/503 (54%), Gaps = 42/503 (8%)
Query: 19 STSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFF----YNQT--IDHFSYGPESYQT 72
STSF + RP + R ++QK F +++T +DHF + + +T
Sbjct: 35 STSFGS-----RPTITPKYSIRYIQQKALGGHLHEFIIQVNHHKTPLVDHFGFNID--RT 87
Query: 73 FPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGK 132
F QRY+I +W IL + G E I +GF ++ A + KA++V EHRYYG+
Sbjct: 88 FKQRYLIADNYWKEDGGSILFYTGNEGDIIWFCNNTGFMWDIAEEMKAMLVFAEHRYYGE 147
Query: 133 SVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELA 191
S+PFG+ + ++RH + + QALAD+A ++ ++K A + PVIA+G SYGG LA
Sbjct: 148 SLPFGAD--SFSDSRHLNFLTTEQALADFAKLIKYLKRTIPGARNQPVIALGGSYGGMLA 205
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEID 251
WFR+KYPH+V+G+LASSAPI + D+ P + +V+ DF + C ++I++SW I+
Sbjct: 206 AWFRMKYPHLVVGALASSAPIWQFNDLVPCDVFMKIVTTDFSQSGPNCSESIRRSWDAIN 265
Query: 252 NIASKPDGLAILSKKFKTCTP---LKNTTELKDGLDTVYSEAAQYDTP--SNI------- 299
+A K GL LS+ CTP L++ LKD + + A D P SN
Sbjct: 266 RLAKKGPGLRWLSEALHLCTPLTKLQDVQHLKDWIAETWVNLAMVDYPYESNFLQPLPAW 325
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INEDRTGDESDEGWE 357
PVK +C + + N D +L + + +A ++ Y+ ++C ++E T +GW
Sbjct: 326 PVKVVCQYFKYS-NVPDTLLLR---NIFQALNVYYNYSGQAKCLNVSETATSSLGVQGWS 381
Query: 358 WQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLIL 416
+Q+C+EMV+P D + M++P WN+ +Y +C +Q+GV PRPSW+ T YGG +I
Sbjct: 382 YQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWGVRPRPSWIPTMYGGKNIS--- 438
Query: 417 RRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQ 476
S +NIIFSNG DP+S GG K D + + G+H LDL + DP +
Sbjct: 439 --SHTNIIFSNGELDPWSGGGVTK--DITDTLLAIVIPNGAHHLDLRASNALDPVSVQLT 494
Query: 477 RKTEVKIMQGWITQYYDDFKAIN 499
R EVK M+ W+T +Y + +N
Sbjct: 495 RSLEVKYMKQWVTDFYVRLRKMN 517
>gi|224043680|ref|XP_002188561.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Taeniopygia guttata]
Length = 479
Score = 291 bits (745), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 164/456 (35%), Positives = 262/456 (57%), Gaps = 31/456 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ T + +Q IDHF + + +TF QRY++ + W N PIL + G E I+ +GF
Sbjct: 34 YLTRYLSQQIDHFGF--DENRTFQQRYLLADQHWKKDNGPILFYTGNEGDIEWFCNNTGF 91
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + A++V EHRYYG+S+PFG+ + +++H Y S QALAD+A ++ ++K
Sbjct: 92 MWDVAEELNAMLVFAEHRYYGESLPFGNE--SFSDSKHLNYLTSEQALADFAVLVEYLKT 149
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A H+PVIAIG SYGG LA WFR+KYPHVV+G+LA+SAPI + D+ P T+ S+V+
Sbjct: 150 TIAGAQHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFADLVPCGTFFSIVT 209
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
DF+ + + C ++I+ SW+ I++++S GL LS F C+PLK + LK+ L
Sbjct: 210 NDFKKSGKGCSESIRNSWNAINHLSSTDAGLQWLSNTFHLCSPLKTLQDAAVLKNWLSET 269
Query: 287 YSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-- 335
+ A + P P++ +C +++ P+ D +L + V +A +L Y+
Sbjct: 270 WVNLAMVNYPYKADFLQPLPAWPIQEVCKFLKD-PSLSDKLLLQ---NVFQAVNLYYNYT 325
Query: 336 GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
G S +++ T + GW +Q+C+E+V+P+ D + M++P+ W+ + C +
Sbjct: 326 GEASCLDVSQTATKSLGEMGWYYQACTEIVMPLCTDGVHDMFEPQKWDFDALSEECYRMW 385
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV PR SW+L+ YGG +I S SNIIFSNG DP+S GG T + + + +
Sbjct: 386 GVRPRLSWILSMYGGKNIS-----SHSNIIFSNGGLDPWSAGG--VTQNISNSLVAVVIP 438
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G+H LDL DP + Q R E+ +M+ WI +
Sbjct: 439 DGAHHLDLRSHNPLDPKSVQQARAMEICLMKEWIEK 474
>gi|345322922|ref|XP_001513964.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Ornithorhynchus
anatinus]
Length = 742
Score = 291 bits (744), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 163/456 (35%), Positives = 256/456 (56%), Gaps = 31/456 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ET F+ Q +DHF + + TF QRY+++ + W IL + G E I +GF
Sbjct: 46 YETRFFPQKVDHFGF--DLDLTFKQRYLVSDQHWREDGGSILFYTGNEGDITWFCNNTGF 103
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+E A + +A++V EHRYYG+S+PFG + + +++H Y S QALAD+A ++ H+K
Sbjct: 104 MWEVAEELQAMLVFAEHRYYGESLPFGDQ--SFSDSKHLNYLTSEQALADFAVLIEHLKA 161
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A ++PVI+IG SYGG LA W R+KYPH+V+G+LA+SAPI +GD+ P + +V+
Sbjct: 162 TIPGAQNSPVISIGGSYGGMLAAWIRMKYPHLVVGALAASAPIWQFGDLVPCGRFFEIVT 221
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK---NTTELKDGLDTV 286
DF+ + C +TI+ SW + I+S +GL LS F C+PL+ + LK + +
Sbjct: 222 NDFKKSGAGCSETIRASWDAVGRISSTEEGLQWLSHTFHLCSPLRSQQDAVALKTWMSST 281
Query: 287 YSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYD 335
+ + A D P PV+ +C +++ P DD+L + + +A ++ Y
Sbjct: 282 WVDLAMVDYPYETDFLQPLPAWPVQAVCKYLKD-PKSADDVLLQ---NIFQALNVYYNYT 337
Query: 336 GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
G S +E TG+ +GW +Q+C+EMV+P+ D N M++P+PW+ C + +
Sbjct: 338 GKTSCLNTSETVTGNLGMQGWSYQACTEMVMPLCTDGINDMFEPQPWDFKALSDECFKLW 397
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV PR W+ T YGG +I S SNIIFSNG DP+S GG + D + +
Sbjct: 398 GVRPRLFWIPTVYGGKNIS-----SHSNIIFSNGALDPWSGGG--VNENITDTLVAVVIP 450
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G+H LDL DP ++Q R EV +++ W+ +
Sbjct: 451 EGAHHLDLRANNPYDPKSVLQARAAEVHLIKQWVAK 486
>gi|441645350|ref|XP_004093164.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X carboxypeptidase
[Nomascus leucogenys]
Length = 517
Score = 290 bits (742), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 163/449 (36%), Positives = 252/449 (56%), Gaps = 30/449 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + +TF QRY++ K+W IL + G E I +GF ++ A +
Sbjct: 78 VDHFGF--NTVKTFNQRYLVADKYWKKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEDXE 135
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG + K++RH + S QALAD A ++ H+K A + P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADLAELIKHLKRTIPGAENQP 193
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+ DFR +
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP 296
C ++I++SW I+ +++ GL L+ C+PL ++ LKD + + A D P
Sbjct: 254 CSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 297 --SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INE 345
SN P+K +C ++N PN D +L + + +A ++ Y+ +C I+E
Sbjct: 314 YASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSGQVKCLNISE 369
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
T GW +Q+C+E+V+P + + M++P WNL + +C + +GV PRPSW+
Sbjct: 370 TATSSLGTLGWSYQACTEIVMPFCTNGVDDMFEPHSWNLKELSDDCFQLWGVRPRPSWIT 429
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGG +I S +NI+FSNG DP+S GG K D + ++ G+H LDL
Sbjct: 430 TMYGGKNIS-----SHTNIVFSNGELDPWSGGGVTKD--ITDTLVAVTISEGAHHLDLRT 482
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
DP ++ R EV+ M+ WI +YD
Sbjct: 483 KNALDPTSVLLARSLEVRHMKNWIRDFYD 511
>gi|242006450|ref|XP_002424063.1| Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus
corporis]
gi|212507369|gb|EEB11325.1| Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus
corporis]
Length = 457
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 166/466 (35%), Positives = 265/466 (56%), Gaps = 39/466 (8%)
Query: 46 DSNHGFETFF-YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN 104
D N F F+ + +DHFS+ + +TF RY+IN +W PI + G E I+
Sbjct: 2 DMNTNFIVFYLFINYVDHFSF--VTNETFNIRYLINDTYWNNKTGPIFFYTGNEGDIEVF 59
Query: 105 IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
Q +GF +E A +F AL++ EHRYYGKS+P+G++ + ++ GY S QALADY +
Sbjct: 60 AQNTGFMWEIAPKFNALLIFAEHRYYGKSLPYGNK--TFSDPKYLGYLTSEQALADYVDL 117
Query: 165 LLHIKDKYNATHA-PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ H+ N ++ PVIA G SYGG LA + R+KYPH+V G++ASSAPI + +TP
Sbjct: 118 IAHLTWNDNKSYKNPVIAFGGSYGGMLAAYIRMKYPHMVAGAIASSAPIWQFTGLTPCDV 177
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT---ELK 280
+ +V+ DF ++ C I++SW+ I+NI S DGL ++ ++K C PLKN+T +LK
Sbjct: 178 FSRIVTSDFEIENKSCSFNIRRSWNIINNITSNEDGLKWITNEWKLCRPLKNSTDVRDLK 237
Query: 281 DGLDTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADS 331
+ L VY+ A Y P+N P+++ C ++N G D+L + V
Sbjct: 238 NWLSDVYNNLAMVNYPYPTNFLTPLPGYPIRKFCKKLQNTTATGKDLLNLLYKSVTVY-- 295
Query: 332 LEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNS-MYQPEPWNLTKYIKNC 390
Y G +S+C +D T + W++Q+C+EMV+PM ++ N+ M++P+ W+L K+ C
Sbjct: 296 FNYTG-SSKCLNFDDSTPSLGADLWDYQACTEMVMPMCQNGNTDMFEPQEWDLLKFTHEC 354
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
++++ V+P+P + YGG D+ + SNIIFSNG+ DP++ GG +K S
Sbjct: 355 EKKWKVTPKPYLIEKLYGGKDLS-----TASNIIFSNGLLDPWAGGGVLKVN-------S 402
Query: 451 QDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ + + +H LDL DP +++ RK KI+ WI ++
Sbjct: 403 KTIKVVLIPEAAHHLDLRATTPIDPLSVIESRKLYKKIIYLWIKEH 448
>gi|20072291|gb|AAH26424.1| Prcp protein, partial [Mus musculus]
Length = 451
Score = 289 bits (740), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 164/461 (35%), Positives = 251/461 (54%), Gaps = 30/461 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + +TF QRY++ K W IL + G E I +GF
Sbjct: 6 YSVLYFEQKVDHFGFA--DMRTFKQRYLVADKHWQRNGGSILFYTGNEGDIVWFCNNTGF 63
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + + K+++H + S QALAD+A ++ H++
Sbjct: 64 MWDVAEELKAMLVFAEHRYYGESLPFG--QDSFKDSQHLNFLTSEQALADFAELIRHLEK 121
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + P + +V+
Sbjct: 122 TIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMKIVT 181
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I+KSW+ ID ++ GL L+ C+PL + LK + +
Sbjct: 182 NDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIAETW 241
Query: 288 SEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYDG 336
A + P P+K +C ++N PN D +L + + +A S+ Y G
Sbjct: 242 VNLAMVNYPYACNFLQPLPAWPIKEVCQYLKN-PNVSDTVLLQ---NIFQALSVYYNYSG 297
Query: 337 NNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+ I++ T GW +Q+C+EMV+P + + M++P W+L KY +C Q+G
Sbjct: 298 QAACLNISQTTTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWG 357
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRP W+ T YGG +I S SNIIFSNG DP+S GG T D + +++
Sbjct: 358 VKPRPHWMTTMYGGKNIS-----SHSNIIFSNGELDPWSGGG--VTRDITDTLVAINIHD 410
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL DP ++ R EVK M+ WI +Y + +
Sbjct: 411 GAHHLDLRAHNAFDPSSVLLSRLLEVKHMKKWILDFYSNIQ 451
>gi|33469015|ref|NP_082519.1| lysosomal Pro-X carboxypeptidase precursor [Mus musculus]
gi|68565642|sp|Q7TMR0.2|PCP_MOUSE RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|26342108|dbj|BAC34716.1| unnamed protein product [Mus musculus]
gi|74193542|dbj|BAE20697.1| unnamed protein product [Mus musculus]
Length = 491
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 164/461 (35%), Positives = 251/461 (54%), Gaps = 30/461 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + +TF QRY++ K W IL + G E I +GF
Sbjct: 46 YSVLYFEQKVDHFGFA--DMRTFKQRYLVADKHWQRNGGSILFYTGNEGDIVWFCNNTGF 103
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + + K+++H + S QALAD+A ++ H++
Sbjct: 104 MWDVAEELKAMLVFAEHRYYGESLPFG--QDSFKDSQHLNFLTSEQALADFAELIRHLEK 161
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + P + +V+
Sbjct: 162 TIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMKIVT 221
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I+KSW+ ID ++ GL L+ C+PL + LK + +
Sbjct: 222 NDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIAETW 281
Query: 288 SEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYDG 336
A + P P+K +C ++N PN D +L + + +A S+ Y G
Sbjct: 282 VNLAMVNYPYACNFLQPLPAWPIKEVCQYLKN-PNVSDTVLLQ---NIFQALSVYYNYSG 337
Query: 337 NNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+ I++ T GW +Q+C+EMV+P + + M++P W+L KY +C Q+G
Sbjct: 338 QAACLNISQTTTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWG 397
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRP W+ T YGG +I S SNIIFSNG DP+S GG T D + +++
Sbjct: 398 VKPRPHWMTTMYGGKNIS-----SHSNIIFSNGELDPWSGGG--VTRDITDTLVAINIHD 450
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL DP ++ R EVK M+ WI +Y + +
Sbjct: 451 GAHHLDLRAHNAFDPSSVLLSRLLEVKHMKKWILDFYSNIQ 491
>gi|395814761|ref|XP_003780910.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Otolemur
garnettii]
Length = 515
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 160/448 (35%), Positives = 255/448 (56%), Gaps = 30/448 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + S +TF QRY+I +++W IL + G E I +GF ++ A + K
Sbjct: 77 VDHFGF--NSVKTFNQRYLIANEYWKKDGGSILFYTGNEGDIVWFCNNTGFMWDVAQELK 134
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG K + K++RH Y S QALAD+A ++ H+++ + P
Sbjct: 135 AMLVFAEHRYYGESLPFG--KNSFKDSRHLNYLTSEQALADFAELIRHLQETIPGVKNQP 192
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+ DF+ +
Sbjct: 193 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTSDFKRSGPY 252
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP 296
C +TI+KSW+ I+ +++ GL L++ CTPL ++ LKD + + A D P
Sbjct: 253 CSETIRKSWNVINQLSTTSRGLQYLTEVLHLCTPLTPRDVQHLKDWIAETWVNLAMVDYP 312
Query: 297 ---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCY-INE 345
P+K +C ++N PN D ++ + + +A ++ Y+ ++C ++
Sbjct: 313 YPSSFLQPLPAWPIKVVCQYLKN-PNVSDSLMIQ---NIFQALNIYYNYSGEAQCLNVST 368
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
T + GW +Q+C+E+V+P + + M++P WNL ++ C +Q+GV P PSW+
Sbjct: 369 TTTSNLGSLGWSYQACTEIVMPFCTNGIDDMFEPHAWNLKEFSDECFKQWGVRPSPSWIT 428
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
T YGG +I S +NI+FSNG DP+S GG T D + ++ G+H LDL
Sbjct: 429 TMYGGKNIN-----SHTNIVFSNGELDPWSGGG--VTEDLTDTLVAFTISEGAHHLDLRA 481
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+ DP + R E K ++ WI +Y
Sbjct: 482 SNALDPISVQLARTLETKHVKNWIRDFY 509
>gi|47209068|emb|CAF90249.1| unnamed protein product [Tetraodon nigroviridis]
Length = 487
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 177/496 (35%), Positives = 260/496 (52%), Gaps = 35/496 (7%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
L+L+V S+ A L+ +L RR + ++T +++Q IDHF + +
Sbjct: 11 LVLLVCLGSSPVRA----LKSQL-FTRRGGSPSSSGISISYKTLYFDQKIDHFGFLEDG- 64
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
TF QRY++N K W PI + G E I +GF +E A +F A++V EHRYY
Sbjct: 65 -TFKQRYLVNDKHWQQPGGPIFFYTGNEGDITWFCNNTGFMWEIAEEFGAMLVFAEHRYY 123
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGE 189
G+S+PFG + + +H Y S QALAD+A ++ ++K A ++PVIAIG SYGG
Sbjct: 124 GESLPFGHD--SYSDNKHLNYLTSEQALADFAVLIQNLKSTLPGAQNSPVIAIGGSYGGM 181
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA WFR+KYPH+V+G+LASSAPI + + P ++ +V++DF + C + I SW
Sbjct: 182 LAAWFRMKYPHMVVGALASSAPIWQFPGMVPCGDFYKIVTQDFARSGSNCDKNIGMSWKA 241
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTEL---KDGLDTVYSEAAQYDTPSNI------- 299
I+N++S GL LS++F CTPLKN ++ K L + A D P
Sbjct: 242 IENVSSTASGLQWLSEEFGLCTPLKNRNDVAGFKSWLQETWVNLAMVDYPYEANFLQPLP 301
Query: 300 --PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDR--TGDESDEG 355
P++ +C + + + DD L GV +A + Y+ S +N + TG G
Sbjct: 302 AWPIQVVCKYLSFSHSVPDDHLLH---GVSQATKVYYNYTGSSPCLNTSQTATGSLGFIG 358
Query: 356 WEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
W +Q+C+EMV+PM D M++PE WN + C +G PR W YGG DI
Sbjct: 359 WFYQACTEMVMPMCTDGVRDMFEPEEWNFQAFSDECNAMFGARPRAEWARAVYGGKDIA- 417
Query: 415 ILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLV 474
+ SNIIFSNG DP+S GG T++ + S + G+H LDL + DP +
Sbjct: 418 ----AHSNIIFSNGGLDPWSAGG--VTHNITESLVSILIPDGAHHLDLRYSNDRDPPSVR 471
Query: 475 QQRKTEVKIMQGWITQ 490
R EV + WI Q
Sbjct: 472 AARALEVNYFRKWIKQ 487
>gi|190702395|gb|ACE75287.1| prolylcarboxypeptidase [Glyptapanteles flavicoxis]
Length = 497
Score = 288 bits (738), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 159/450 (35%), Positives = 248/450 (55%), Gaps = 27/450 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHFS+ + TF RY +N + G N+PI + G E ++ +GF ++ A F
Sbjct: 53 VDHFSFSLNN--TFEMRYFVNDTWKSGKNAPIFFYTGNEGVLETFAANTGFMWDIAPTFG 110
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV 179
ALIV EHRYYG+S+PFG++ + N ++ GY S QALADY +++H+K + H+PV
Sbjct: 111 ALIVFAEHRYYGESMPFGNK--SFDNVKNLGYLTSQQALADYVDLIVHLKSDPSLKHSPV 168
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEEC 239
IA G SYGG L+ WFR+KYPH++ G++A+SAP+L + T + +V+ DFR C
Sbjct: 169 IAFGGSYGGMLSAWFRMKYPHIINGAIAASAPVLQFTGYTDCQAFSRIVTSDFRAVHPNC 228
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTP 296
+ I+KSW I N+ S DG +S KFK C L +N KD L +VYS A + P
Sbjct: 229 EKVIRKSWESIKNLTSTDDGKKWISTKFKVCGSLTTEENFKTFKDFLLSVYSNLAMVNYP 288
Query: 297 ---------SNIPVKRICNAIENAPNCGD-DILCKIAAGVVEADSLEYDGNNSRCYINED 346
P++ C + + D D++ + G+ + Y G +++
Sbjct: 289 YATDFLSPLPAFPIREFCKFVNDTNLSSDKDVMTNLQKGINLYSN--YTGRLKCLDLSDP 346
Query: 347 RTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT 405
+ GW +Q+C+EMV+PM D N M++PEPWN TKY ++C + V+P+P V
Sbjct: 347 EPDLGAANGWNYQACTEMVMPMCNDGVNDMFEPEPWNFTKYSQDCFTHFNVTPKPELVCD 406
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEA 465
YG D+ + SNI+FSNG+ DP+S GG ++ + + + + G+H LDL +
Sbjct: 407 MYGCDDLS-----TASNIVFSNGLLDPWSSGGVLR--NLSSSAVAIIIPEGAHHLDLRGS 459
Query: 466 KKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
SDP +V+ R+ ++ WI ++ +D
Sbjct: 460 HPSDPYSVVKAREYHAYSIKKWIREHREDL 489
>gi|84000301|ref|NP_001033253.1| lysosomal Pro-X carboxypeptidase precursor [Bos taurus]
gi|122137021|sp|Q2TA14.1|PCP_BOVIN RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|83405390|gb|AAI11172.1| Prolylcarboxypeptidase (angiotensinase C) [Bos taurus]
Length = 499
Score = 288 bits (737), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 166/461 (36%), Positives = 257/461 (55%), Gaps = 31/461 (6%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +DHF + + +TF QRY+I +W IL + G E I +GF ++
Sbjct: 54 YIQQKVDHFGFNID--RTFKQRYLIADNYWKEDGGSILFYTGNEGDIIWFCNNTGFMWDI 111
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-N 173
A + KA++V EHRYYG+S+PFG+ + ++RH + + QALAD+A ++ ++K
Sbjct: 112 AEEMKAMLVFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIRYLKRTIPG 169
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
A + VIA+G SYGG LA WFR+KYPH+V+G+LASSAPI + D+ P + +V+ DF
Sbjct: 170 ARNQHVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDIFMKIVTTDFS 229
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEA 290
+ C ++I++SW I+ +A K GL LS+ CTPL ++ LKD + +
Sbjct: 230 QSGPNCSESIRRSWDAINRLAKKGTGLRWLSEALHLCTPLTKSQDVQRLKDWISETWVNV 289
Query: 291 AQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSR 340
A D P SN PVK +C + + N D ++ + + +A ++ Y+ ++
Sbjct: 290 AMVDYPYESNFLQPLPAWPVKVVCQYFKYS-NVPDTVMVQ---NIFQALNVYYNYSGQAK 345
Query: 341 CY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSP 398
C ++E T GW +Q+C+EMV+P D + M++P WN+ +Y +C +Q+GV P
Sbjct: 346 CLNVSETATSSLGVLGWSYQACTEMVMPTCSDGVDDMFEPHSWNMKEYSDDCFKQWGVRP 405
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
RPSW+ T YGG +I S +NIIFSNG DP+S GG K D + + G+H
Sbjct: 406 RPSWIPTMYGGKNIS-----SHTNIIFSNGELDPWSGGGVTK--DITDTLLAIVIPNGAH 458
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
LDL + DP + R EVK M+ WI+ +Y + +N
Sbjct: 459 HLDLRASNALDPVSVQLTRSLEVKYMKQWISDFYVRLRKMN 499
>gi|166064014|ref|NP_001099751.2| lysosomal Pro-X carboxypeptidase precursor [Rattus norvegicus]
Length = 493
Score = 288 bits (736), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 169/484 (34%), Positives = 256/484 (52%), Gaps = 33/484 (6%)
Query: 29 LRPRL---GRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
+RPRL G S + ++ Q +DHF + +TF QRY++ K W
Sbjct: 20 IRPRLKTLGSPHLSTNTPDPAVPSSYSVHYFQQKVDHFGF--SDTRTFKQRYLVADKHWQ 77
Query: 86 GGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKN 145
IL + G E I +GF ++ A + KA++V EHRYYGKS+PFG + + K+
Sbjct: 78 RNGGSILFYTGNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHRYYGKSLPFG--RDSFKD 135
Query: 146 ARHRGYFNSAQALADYASILLHIKDKYNATHA-PVIAIGASYGGELATWFRLKYPHVVIG 204
++H + S QALAD+A ++ H+K+ T PVIAIG SYGG LA WFR+KYPH+V+G
Sbjct: 136 SQHLNFLTSEQALADFAELIRHLKETIPGTEGQPVIAIGGSYGGMLAAWFRMKYPHIVVG 195
Query: 205 SLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILS 264
+LA+SAPI + P + +V+KDF + C ++I++SWS I+ ++ GL L
Sbjct: 196 ALAASAPIWQLDGMAPCGEFMKIVTKDFSQSGPHCSESIRRSWSVINRLSDTGSGLQWLR 255
Query: 265 KKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAPN 313
C+PL + LK + + A + P P+K +C + N PN
Sbjct: 256 DTLHLCSPLTSEKMPTLKGWIAETWVNLAMVNYPYACNFLQPLPAWPIKEVCRYLRN-PN 314
Query: 314 CGDDILCKIAAGVVEADSLEYD-GNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKD 371
D +L + + +A ++ Y+ ++C I++ T GW +Q+C+EMV+P +
Sbjct: 315 VSDTVLLQ---NIFQALNVYYNYSGQTKCLNISQTTTSSLGSTGWSFQACTEMVMPFCTN 371
Query: 372 K-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
+ M++P W+L Y +C Q+GV PRP W+ T YGG +I S SNIIFSNG
Sbjct: 372 GIDDMFEPYLWDLETYSNDCFNQWGVKPRPHWMTTMYGGKNIS-----SHSNIIFSNGDL 426
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S GG K D + ++ G+H LDL DP ++ R EVK M+ WI
Sbjct: 427 DPWSGGGVTK--DITDTLVAINIPEGAHHLDLRAHNAFDPSSVLLSRLLEVKHMKKWIAD 484
Query: 491 YYDD 494
+Y +
Sbjct: 485 FYSN 488
>gi|32967631|gb|AAH55022.1| Prcp protein [Mus musculus]
gi|74147590|dbj|BAE38679.1| unnamed protein product [Mus musculus]
Length = 491
Score = 288 bits (736), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 164/461 (35%), Positives = 250/461 (54%), Gaps = 30/461 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + +TF QRY++ K W IL + G E I +GF
Sbjct: 46 YSVLYFEQKVDHFGFA--DMRTFKQRYLVADKHWQRNGGSILFYTGNEGDIVWFCNNTGF 103
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + + K+++H + S QALAD+A ++ H++
Sbjct: 104 MWDVAEELKAMLVFAEHRYYGESLPFG--QDSFKDSQHLNFLTSEQALADFAELIRHLEK 161
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + P + +V+
Sbjct: 162 TIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMKIVT 221
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVY 287
DFR + C ++I+KSW+ ID ++ GL L+ C+PL + LK + +
Sbjct: 222 NDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIAETW 281
Query: 288 SEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYDG 336
A + P P+K +C ++N PN D +L + + +A S+ Y G
Sbjct: 282 VNLAMVNYPYACNFLQPLPAWPIKEVCQYLKN-PNVSDTVLLQ---NIFQALSVYYNYSG 337
Query: 337 NNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYG 395
+ I++ T GW +Q+C+EMV+P + + M++P W+L KY +C Q+G
Sbjct: 338 QAACLNISQTTTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWG 397
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V PRP W+ T YGG +I S SNIIFSNG DP+S GG T D + ++
Sbjct: 398 VKPRPHWMTTMYGGKNIS-----SHSNIIFSNGKLDPWSGGG--VTRDITDTLVAINIPD 450
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
G+H LDL DP ++ R EVK M+ WI +Y + +
Sbjct: 451 GAHHLDLRAHNAFDPSSVLLSRLLEVKHMKKWILDFYSNIQ 491
>gi|291234129|ref|XP_002737003.1| PREDICTED: prolylcarboxypeptidase isoform 1 preproprotein-like
[Saccoglossus kowalevskii]
Length = 501
Score = 288 bits (736), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 163/480 (33%), Positives = 264/480 (55%), Gaps = 33/480 (6%)
Query: 28 KLRPRLG-RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
+ RP LG +I+ + S + ++T ++ QT+DHF++ TF QRY+++ +W
Sbjct: 33 RFRPGLGVQIKPDHGSRLRLSKYNYQTLYFKQTLDHFNFANNG--TFSQRYLLSDDYWNS 90
Query: 87 GNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNA 146
+ PI + G E I +GF ++ A QFKA+++ EHRYYG+S+PFG+ + +
Sbjct: 91 -SGPIFFYTGNEGDITWFCNNTGFIWDIAPQFKAMVIFAEHRYYGESLPFGNE--SFSDL 147
Query: 147 RHRGYFNSAQALADYASILLHIKD-KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGS 205
H GY S QALAD+A+++ +IK + A +PVI G SYGG +A WFR+KYP++V G+
Sbjct: 148 EHVGYLTSEQALADFATLIKYIKSSRPGADKSPVIVFGGSYGGMMAAWFRMKYPNIVDGA 207
Query: 206 LASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSK 265
LA+SAPI + +TP T +++++DF +C +TI KSW+ I+ + + G L+
Sbjct: 208 LAASAPIWQFPGLTPCNTLFTIITQDFVKAGRDCAETIHKSWNAINRMKDQESGRQWLTM 267
Query: 266 KFKTCTPLKNTTELKD---GLDTVYSEAAQYDTP---------SNIPVKRICNAIENAPN 313
F CTPLK ++ D L + A D P P+K+ C+ + N+
Sbjct: 268 AFHLCTPLKTPADVNDLSSWLSNTWFNLAMVDYPYPASFLEPLPAWPIKKTCSYMTNSSL 327
Query: 314 CGDDILCKIAAGVVEADSLEYDGNN-SRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKD 371
+L +A+ + + Y+ ++C+ I++D + GW +QSC+EMV+P D
Sbjct: 328 LDKPLLNSVASSL----QVYYNTTGKTQCFNISQDAVSSLGELGWSYQSCTEMVMPSCSD 383
Query: 372 K-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
+ M+ P WN ++K C+ +GV+PR W++T+YGG I ++SNIIFSNG
Sbjct: 384 GVHDMFPPNKWNFDDFVKECQNTWGVTPRADWIVTHYGGKAIT-----ASSNIIFSNGEL 438
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S GG + + + + + G+H LDL K DP ++ R E + WI Q
Sbjct: 439 DPWSGGGVLHSLS--ETLIAIVIKDGAHHLDLRSKDKGDPQSVIDARNQEKYHITKWIQQ 496
>gi|410910456|ref|XP_003968706.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Takifugu
rubripes]
Length = 500
Score = 284 bits (726), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 173/494 (35%), Positives = 259/494 (52%), Gaps = 33/494 (6%)
Query: 13 LMVIFVSTSFHANGLKLRPRLGRI--RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
L+ FV T F L +R ++ RR ++T ++ Q IDHF + +
Sbjct: 7 LLTTFVLT-FCLGSLAVRALKSQLFTRRGGSSSSAGIPISYKTLYFEQKIDHFGFLEDG- 64
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
TF QRY+I K W PI + G E I +GF +E A +F A++V EHRYY
Sbjct: 65 -TFKQRYLIADKHWQQPGGPIFFYTGNEGDITWFCNNTGFMWEIAEEFGAMLVFAEHRYY 123
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGE 189
G+S+PFG+ + + +H Y S QALAD+A ++ ++K + A ++PVIA+G SYGG
Sbjct: 124 GESLPFGAD--SYSDNKHLNYLTSEQALADFAVLVQNLKSTFPGAQNSPVIAVGGSYGGM 181
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA WFR+KYPH+V+G+LASSAPI + + P ++ +V++DF + +C + I+ SW
Sbjct: 182 LAAWFRMKYPHIVVGALASSAPIWQFPGMVPCGDFYKIVTQDFAKSGSDCDKNIRMSWKA 241
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTEL---KDGLDTVYSEAAQYDTPSNI------- 299
I+N++S GL LS++F C PLK+ +++ K+ L + A D P
Sbjct: 242 IENVSSTASGLQWLSEEFGLCAPLKSKSDIAGFKNWLQETWVNLAMVDYPYEANFLQPLP 301
Query: 300 --PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDR--TGDESDEG 355
P++ +C + + + D L GV +A + Y+ S +N + TG G
Sbjct: 302 PWPIQAVCKYLSFSGSVPDYHLLH---GVSQATKVYYNYTGSSPCLNTSQTATGSLGFLG 358
Query: 356 WEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
W +Q+C+EMV+PM D M++PE WN + +C ++G PR W T YGG DI
Sbjct: 359 WFYQACTEMVMPMCTDGIQDMFEPEEWNFQAFSDDCNARFGARPRADWAGTVYGGKDIA- 417
Query: 415 ILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLV 474
+ SNIIFSNG DP+S GG ++ D S + G+H LDL DP +
Sbjct: 418 ----AHSNIIFSNGGLDPWSAGG--VNHNITDSLISILIPDGAHHLDLRYTNDHDPPSVR 471
Query: 475 QQRKTEVKIMQGWI 488
R EV WI
Sbjct: 472 AARALEVNYFHKWI 485
>gi|443709977|gb|ELU04397.1| hypothetical protein CAPTEDRAFT_174591 [Capitella teleta]
Length = 487
Score = 284 bits (726), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 161/463 (34%), Positives = 251/463 (54%), Gaps = 35/463 (7%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ T + Q +DHF + ++ T+ QRY++N + W G SPI + G E ID +G
Sbjct: 37 YTTHYITQKVDHFGFANDN--TYKQRYLLNDQHWRPG-SPIFFYTGNEGAIDWFCNNTGI 93
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+E A F A+++ EHRYYG+S+P+G++ + + H Y S QALAD+ S++ +K
Sbjct: 94 MWEWAPSFNAMLIFAEHRYYGESLPYGNK--SFDSPNHLNYLTSEQALADFVSLIADVKQ 151
Query: 171 KYNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+ AT +PV+A G SYGG LA W R+KYP V+G+ A+SAPI +GD+ P + V +
Sbjct: 152 RMPATSKSPVVAFGGSYGGMLAAWLRMKYPSAVVGAFAASAPIWEFGDLVPLGGFAVVTT 211
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
K + + C I++SWS +D +AS +G L+ C P+K++ E+K L + +
Sbjct: 212 KSYASANPNCPIIIRRSWSVMDQLASSVEGREFLASALGLCNPVKSSAEVKSWLSSTWIN 271
Query: 290 AAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR 340
A + P PVK IC+ + + N L V+ Y G+ S
Sbjct: 272 LAMANYPYKANFLEPLPAWPVKAICSHLSDI-NLDHKELVHAVRHAVDV-YYNYTGSASC 329
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPR 399
+E TG+ D+GW+ QSC+EMV+PM D N M++P PWN+T ++C++++ ++PR
Sbjct: 330 YKTSESATGNLGDQGWDIQSCTEMVMPMSNDGVNDMFEPSPWNMTAVTEDCQKKFKLTPR 389
Query: 400 PSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---- 455
P W++ YGG +I + SNIIFSNG+ DP+S GG ++ S S+ L
Sbjct: 390 PDWIIRQYGGRNIS-----AHSNIIFSNGLLDPWSAGGVMQ-------SISESLVAIVIA 437
Query: 456 -GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKA 497
G+H +DL + DP + RK E I+ W+ KA
Sbjct: 438 DGAHHVDLRSSHPDDPISVQMARKKEKAIIAHWLGYQPKSLKA 480
>gi|395521107|ref|XP_003764661.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Sarcophilus harrisii]
Length = 450
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 160/448 (35%), Positives = 254/448 (56%), Gaps = 31/448 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + TF QRY+I ++W N IL + G E I + +GF ++ A + K
Sbjct: 14 VDHFGF--DVNLTFKQRYLIADQYWKNNNGVILFYTGNEGDITWFCKNTGFMWDVAEELK 71
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A++V EHRYYG+S+PFG++ + +++H Y + QALAD+A ++ ++K A + P
Sbjct: 72 AMLVFAEHRYYGESLPFGNQ--SFSDSKHLNYLTAEQALADFAVLIEYLKKTIPGAKNRP 129
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VIAIG SYGG LA WFR+KYPH+VIG+LA+SAPI + D+ + +V+ DF+ + +
Sbjct: 130 VIAIGGSYGGMLAAWFRMKYPHIVIGALAASAPIWQFTDLVSCGKFFEIVTNDFKKSGSK 189
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDT 295
C +TI+ SW ID +AS +GL +S+ F C PL++ + LK L + A D
Sbjct: 190 CSETIQGSWKAIDRLASTGEGLQWISEAFHLCNPLRSEMDVITLKSWLAETWVNLAMVDY 249
Query: 296 P--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINED 346
P SN P+K +C + ++ + D +L + + +A ++ Y+ + +N
Sbjct: 250 PYSSNFLMPLPAWPIKEVCKYLTDS-HASDKVLVQ---NIFKAVNIYYNYSGEASCLNTS 305
Query: 347 RTGDES--DEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
+T S +GW +Q+C+EMV+P+ + M++P+ W+ Y C +++GV PRP W+
Sbjct: 306 QTATSSLGIQGWNYQACTEMVMPICATGISDMFEPQAWDFAAYSDQCFQEWGVRPRPLWI 365
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
T +GG +I S SNIIFSNG DP+ GG + D S + G+H LDL
Sbjct: 366 PTVFGGKNIS-----SHSNIIFSNGALDPWYAGG--VNENITDTLISIVIPDGAHHLDLR 418
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQY 491
DP+ ++ R EV M+ WIT++
Sbjct: 419 ARNADDPESVLLARIMEVDYMKKWITKF 446
>gi|432889721|ref|XP_004075329.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Oryzias latipes]
Length = 504
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 166/454 (36%), Positives = 248/454 (54%), Gaps = 33/454 (7%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++TF++NQ I+HF + + TF QRY++ K W + PIL + G E I +GF
Sbjct: 52 YKTFYFNQRINHFGFLEDG--TFKQRYLVADKHWQEPDGPILFYTGNEGDITWFCNNTGF 109
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+E A + A++V EHRYYG+S+PFG + + +++H Y S QALAD+A ++ ++K
Sbjct: 110 MWEIAEELGAMLVFAEHRYYGESLPFG--QDSYSDSKHLNYLTSEQALADFAVLIQNLKS 167
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
K +PVI++G SYGG L+ W R+KYP+VV+G+LA+SAPI + + ++ +V++
Sbjct: 168 K--MPESPVISVGGSYGGMLSAWLRMKYPNVVVGALAASAPIWQFPRMVGCGDFYKIVTQ 225
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVY 287
DF + + C TI++SW I+N++S GL LS++F C+PLK T+ K L +
Sbjct: 226 DFAKSGQNCDVTIRRSWKAINNLSSTASGLQWLSEEFSLCSPLKTKTDAVIFKGWLQETW 285
Query: 288 SEAAQYDTPSNI---------PVKRICNAIENAPNCGD-DILCKIAAGVVEADSLEYDGN 337
A D P P++ +C + D D+L GV +A + Y+
Sbjct: 286 VNLAMVDYPYEASFLQPLPAWPIQVVCKYLAFDSKASDYDLL----HGVSQAAKVYYNYT 341
Query: 338 NSRCYINEDRTGDES--DEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
S +N +T S GW +Q+C+EMV+PM D M++PE WN + C +
Sbjct: 342 GSSSCLNTSQTATSSLGALGWTYQACTEMVMPMCTDGVQDMFEPEDWNFQAFSDECYSMF 401
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV PR W T YGG DI S SNIIFSNG DP+S GG TY+ + S +
Sbjct: 402 GVRPREDWAGTLYGGKDIS-----SHSNIIFSNGGLDPWSAGG--VTYNITESLVSIMIP 454
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
G+H LDL + DP + R E+K Q WI
Sbjct: 455 DGAHHLDLRYSTDLDPPSVRAARALELKFFQEWI 488
>gi|195997817|ref|XP_002108777.1| hypothetical protein TRIADDRAFT_19181 [Trichoplax adhaerens]
gi|190589553|gb|EDV29575.1| hypothetical protein TRIADDRAFT_19181 [Trichoplax adhaerens]
Length = 463
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 170/460 (36%), Positives = 260/460 (56%), Gaps = 32/460 (6%)
Query: 47 SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQ 106
SN ++T ++ Q +DHFS P + F QRY+I+ K+W G SPI + G E I +
Sbjct: 16 SNVKYQTKYFKQRLDHFS--PADDRKFQQRYLISQKYWKKG-SPIFFYTGNEGDITWFAK 72
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILL 166
+GF ++ A +F A+++ +EHRYYGK++PFG K + K+ H Y +S QALAD+A ++
Sbjct: 73 NTGFMWDIAPEFNAMLIFVEHRYYGKTLPFG--KDSFKDKEHLAYLSSEQALADFAQFIV 130
Query: 167 HIKDKYNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
K + + T ++ VIA G SYGG L W R+KYP++V G++A+SAPI +TP +
Sbjct: 131 DFKAETHGTQNSSVIAFGGSYGGMLTAWLRIKYPNIVDGAIAASAPIWQLEGLTPCDRFS 190
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDG 282
S+V+ F+ EC + I+ SW I + S G LS K C PLK+ ++ L +
Sbjct: 191 SIVTNTFKLAYPECPKNIRNSWKVIRKLGSTKSGRHTLSTTLKLCNPLKSPSDVDALVNW 250
Query: 283 LDTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLE 333
L +++ A+ Y P+N PVK +C +++ P D +L + G+ + ++
Sbjct: 251 LSSIWVNLAEVDYPYPANFLEPLPAKPVKAVCASLQK-PLANDTLLIR---GIAKGLNVY 306
Query: 334 YD-GNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNC 390
++ NS+C+ ++D T GW +QSC+EMV+P+ D + M++P P+NLT C
Sbjct: 307 FNHTGNSKCFNTDQDATSHLGIAGWNFQSCTEMVLPVCMDGVHDMFEPTPFNLTAMAAIC 366
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
KEQYGV RP W T YGG IK + SNI+FSNG DP+S GG +K+ ++
Sbjct: 367 KEQYGVRTRPFWAQTLYGGKRIK-----AASNIVFSNGNLDPWSGGGVLKSLSKSLVAIV 421
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ G+H LDL A K DP + RK E + WI +
Sbjct: 422 --IEGGAHHLDLRHANKDDPATVKHARKIEKMHIAKWIKE 459
>gi|190702489|gb|ACE75375.1| prolylcarboxypeptidase [Glyptapanteles indiensis]
Length = 497
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 161/450 (35%), Positives = 248/450 (55%), Gaps = 35/450 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHFS+ + TF RY +N + G N+PI + G E ++ +GF +E A F
Sbjct: 53 VDHFSFSLNN--TFEMRYFVNDTWKNGKNAPIFFYTGNEGVLETFAANTGFMWEIAPTFG 110
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV 179
ALIV EHRYYG+S+PFG++ + N ++ GY S QALADY +++H+K + H+PV
Sbjct: 111 ALIVFAEHRYYGESMPFGNK--SFDNVKNLGYLTSQQALADYVDLIVHLKSDPSLKHSPV 168
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEEC 239
IA G SYGG L+ WFR+KYPH++ G++A+SAP+L + T + +V+ DFR C
Sbjct: 169 IAFGGSYGGMLSAWFRMKYPHIINGAIAASAPVLQFTGYTDCQAFSRIVTSDFRAVHPNC 228
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTP 296
+ I+KSW I N+ S DG +S +FK C L +N K L +VYS A + P
Sbjct: 229 EKVIRKSWESIKNLTSTDDGKKWISSEFKVCGSLTTEENFETFKYFLLSVYSNLAMVNYP 288
Query: 297 ---------SNIPVKRICNAIENAPNCGD-DILCKIAAGVVEADSLEYDGNNSRCYINED 346
P++ C + N D D++ I G+ + Y G +C D
Sbjct: 289 YATDFLSPLPAYPIREFCKFVNNTNLSSDKDVMTNIQKGINLYSN--YTG-KLKCL---D 342
Query: 347 RTGDESD----EGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
+ E D GW++Q+C+EMV+PM D N M++P+PWN TKY ++C + V+P+P
Sbjct: 343 LSNPEPDLGAASGWDYQACTEMVMPMCNDGVNDMFEPDPWNFTKYSQDCFTHFNVTPKPE 402
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
V YG D+ + SNI+FSNG+ DP+S GG ++ + + + + G+H LD
Sbjct: 403 LVCDTYGCDDLS-----TASNIVFSNGLLDPWSSGGVLR--NLSSSAVAIIIPEGAHHLD 455
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
L + +DP +V+ R+ ++ WI ++
Sbjct: 456 LRGSHPADPYSVVKAREYHAYSIKKWIREH 485
>gi|332025885|gb|EGI66041.1| Lysosomal Pro-X carboxypeptidase [Acromyrmex echinatior]
Length = 484
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 163/466 (34%), Positives = 249/466 (53%), Gaps = 33/466 (7%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI 101
L + + +E ++ +DHFS+ + TF RY+IN + N+PI + G E I
Sbjct: 36 LSLLSARYKYEIKTFDVRVDHFSFAVQD--TFKLRYLINDTWRKQQNAPIFFYTGNEGNI 93
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ Q +GF +E A +F AL++ EHRYYG+S+P+G++ A N +HRGY S QALADY
Sbjct: 94 EVFAQNTGFLWEIAPKFDALVIFAEHRYYGESLPYGNQSFA--NLQHRGYLTSQQALADY 151
Query: 162 ASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
++ H+K + H+PVI G SYGG L+ W R+KYPHVV G++ASSAP+L + D+
Sbjct: 152 VELIAHLKSQPRYEHSPVIVFGGSYGGMLSAWMRMKYPHVVQGAIASSAPLLQFTDVVDC 211
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE--- 278
+ + + D++ + C + I+KSW+ I N+ S +G LS +K C PLK +
Sbjct: 212 EVFARITTSDYKAANPTCSKLIQKSWNTITNVTSNDEGKKWLSDNWKLCEPLKTAEDVKT 271
Query: 279 LKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEA 329
LK+ L VY + A + P P+ C + N G +L + + +
Sbjct: 272 LKNFLQEVYIDLAMVNYPYETNFLAPLPGNPINVFCQHLTNVSLTGKPLLLALHGAI--S 329
Query: 330 DSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIK 388
Y G + C ++ D+GW++Q+C+EMV+P+ D N M++P WN+T Y
Sbjct: 330 VYTNYTG-KTNCISMKNAEPGLDDQGWDYQACTEMVMPICTDGINDMFEPVKWNITDYNN 388
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
C ++Y VSP+P V YG + S SNIIFSNG+ DP++ GG ++ +LS
Sbjct: 389 ICFKKYSVSPQPYLVCEEYGCKNFN-----SASNIIFSNGLLDPWASGGVLR-----NLS 438
Query: 449 FSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
S L +H LDL E +DP + RK + WI +Y
Sbjct: 439 QSAIAILMPDAAHHLDLRETNSNDPYSVRMTRKFHQFWIYQWIREY 484
>gi|350415944|ref|XP_003490799.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Bombus impatiens]
Length = 494
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 156/467 (33%), Positives = 251/467 (53%), Gaps = 27/467 (5%)
Query: 43 EQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID 102
E + +E + +DHFS+ TF RY++N+ + ++PI + G E I+
Sbjct: 40 ELYSGKYKYEIKTIDMPVDHFSFSVPD--TFKLRYLVNNTWQIKKDAPIFFYTGNEGNIE 97
Query: 103 DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA 162
+ Q +GF ++ A +F AL++ EHRYYG+S+P+G++ + + H GY S QALADY
Sbjct: 98 NFAQNTGFMWDIAPEFGALLIFAEHRYYGESMPYGNK--SYMDLNHLGYLTSRQALADYV 155
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ ++K K ++PVI G SYGG L+ W R+KYPHVV G++ASSAPIL + +T
Sbjct: 156 DLIQYVKSKPEYKYSPVIVFGGSYGGMLSAWIRMKYPHVVQGAIASSAPILQFSGVTECE 215
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT---EL 279
+ +V+ DF+ C + I++SW+ I NI S G L+ +K C PLKN++ +L
Sbjct: 216 AFVRIVTSDFKTAHTNCPKLIRRSWNTIINITSSDKGREWLADSWKLCQPLKNSSDVQQL 275
Query: 280 KDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEAD 330
L+ +Y A + P PVK +C + N G ++L I + +
Sbjct: 276 MSYLEDIYINLAVVNYPYEANFLTPLPAYPVKAVCKHLTNQSLVGTELLIAIHKAI---N 332
Query: 331 SLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKN 389
+ ++C D GW +Q+C+EMV+P+ D N M++P WNL +YI +
Sbjct: 333 IFTNYSSETKCLNLNDSVPQLGAVGWPFQACTEMVMPICSDGINDMFKPRSWNLDRYIND 392
Query: 390 CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
C +QY V P+P+ V YG D+ + +NI+FSNG+ DP+S GG ++ + +
Sbjct: 393 CMKQYSVKPQPNLVCEQYGCKDLS-----TATNIVFSNGLMDPWSSGGVLQ--NLSSSAV 445
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
+ + +H LDL + +DP +V RK ++ WI +Y D K
Sbjct: 446 AVIIPESAHHLDLRSSNANDPYSVVLARKYHRFFIKKWIQEYRDKNK 492
>gi|50540452|ref|NP_001002694.1| lysosomal Pro-X carboxypeptidase precursor [Danio rerio]
gi|49903247|gb|AAH76507.1| Prolylcarboxypeptidase (angiotensinase C) [Danio rerio]
Length = 490
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 181/507 (35%), Positives = 266/507 (52%), Gaps = 44/507 (8%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNH---GFETFFYN 57
+ SS SF+ LL++IF+ F L L+PR R +Q S H + T + +
Sbjct: 3 LTSSALSFR--LLILIFI---FSCKALALKPRF----LGRNTDQSFSQHRSVSYNTLYID 53
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q IDHF + TF QRY++N + W PIL + G E I +GF ++ A +
Sbjct: 54 QQIDHFGFLENG--TFKQRYLLNDQHWHKDGGPILFYTGNEGDITWFCNNTGFMWDVAEE 111
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI-KDKYNATH 176
AL+V EHRYYG+S+PFG + + NA++ Y S Q LAD+A ++ + K + A
Sbjct: 112 LGALLVFAEHRYYGESLPFG--EESYSNAKYLNYLTSEQVLADFAVLIKALKKSQPGAEK 169
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
+ VIAIG SYGG LA W R+KYP+ V+G+LA+SAPI + I P ++ VV++DF +
Sbjct: 170 SSVIAIGGSYGGMLAAWLRMKYPNAVVGALAASAPIWQF--IVPCGEFYRVVTRDFTISG 227
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL---KDGLDTVYSEAAQY 293
C +I+ SW+ ID +++ +GL LS+ F C PLK+ ++ K L + A
Sbjct: 228 SNCSSSIRSSWAAIDRLSATGEGLQWLSQTFGLCGPLKSQEDVFGFKAWLQETWVNLAMV 287
Query: 294 DTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
D P PVK +C ++ D L GV +A + Y+ +N
Sbjct: 288 DYPYEADFLQPLPAWPVKVVCKNLQFNKGVSDKQLLN---GVSQAVRVYYNYTGDAVCLN 344
Query: 345 EDR--TGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
+ TG+ GW +QSC+EMV+PM D N M++P+PW+ + C Q+GV PR
Sbjct: 345 TSKTATGNLGFLGWFYQSCTEMVMPMCTDGINDMFEPQPWSFQAFSDECYNQFGVRPRED 404
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
W T YGG +I + SNIIFSNG DP+ GG K+ L+ D G+H LD
Sbjct: 405 WAETVYGGRNI-----HAHSNIIFSNGNLDPWMSGGVTKSLSESLLAIMID--GGAHHLD 457
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L + DP +++ R EV+ + WI
Sbjct: 458 LRYNNELDPQSVIKARSLEVQYFKQWI 484
>gi|340710015|ref|XP_003393594.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Bombus
terrestris]
Length = 494
Score = 275 bits (702), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 156/467 (33%), Positives = 252/467 (53%), Gaps = 27/467 (5%)
Query: 43 EQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID 102
E + +E N +DHFS+ TF RY++N+ + ++PI + G E I+
Sbjct: 40 ELYSGKYKYEIKTINMPVDHFSFSVPD--TFKLRYLVNNTWQIRKDAPIFFYTGNEGNIE 97
Query: 103 DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA 162
+ Q +GF ++ A +F AL++ EHRYYG+S+P+ ++ + + H GY S QALADY
Sbjct: 98 NFAQNTGFMWDIAPEFGALLIFAEHRYYGESMPYNNK--SYMDLNHLGYLTSRQALADYV 155
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ ++K K ++PVI G SYGG L+ W R+KYPHVV G++ASSAPIL + +T
Sbjct: 156 DLIQYVKSKPEYKYSPVIVFGGSYGGMLSAWIRMKYPHVVQGAIASSAPILQFSGVTECE 215
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
+ +V+ DF+ C + I++SWS I NI S G L+ +K C PLKN+++++
Sbjct: 216 AFVRIVTSDFKTAHTNCPKLIRRSWSTIINITSSDKGREWLADSWKLCQPLKNSSDVQQL 275
Query: 283 ---LDTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEAD 330
L+ +Y+ A + P PVK +C + N G ++L I +
Sbjct: 276 ISYLEDIYTNLAVVNYPYEANFLTPLPAYPVKAVCKHLTNQSLVGTELLIAIHKAI--NI 333
Query: 331 SLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKN 389
Y G ++C D GW +Q+C+EMV+P+ D N M++P WNL +Y K+
Sbjct: 334 FTNYTG-ETKCLNLNDSVPQLGAVGWPFQACTEMVMPICSDGINDMFKPRSWNLDRYTKD 392
Query: 390 CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
C +QY V P+P+ + YG D+ + +NI+FSNG+ DP+S GG ++ + +
Sbjct: 393 CIKQYSVKPQPNLICEQYGCKDLS-----TATNIVFSNGLMDPWSSGGVLQ--NLSSSAV 445
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
+ + +H LDL + +DP ++ RK ++ WI +Y D K
Sbjct: 446 AIIIPESAHHLDLRSSDANDPYSVILARKYHRFFIKKWIQEYRDKNK 492
>gi|380027987|ref|XP_003697693.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Apis florea]
Length = 491
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 156/473 (32%), Positives = 258/473 (54%), Gaps = 31/473 (6%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF 94
+I + R+ + ++ + +E + +DHFS+ TF RY+IN + N+PI +
Sbjct: 33 KIEQMRLTQ--NTKYKYEIKTIDMPVDHFSFSVSD--TFKLRYLINGTWQKTNNAPIFFY 88
Query: 95 LGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNS 154
G E I+ Q +GF ++ A +F+AL+V EHRYYG+S+P+ ++ + + H GY S
Sbjct: 89 TGNEGNIEIFAQNTGFMWDIAPEFEALLVFAEHRYYGESMPYSNK--SYTDLNHLGYLTS 146
Query: 155 AQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILY 214
QALADY ++ ++K K ++P+I G SYGG L+ W R+KYPH+V G++ASSAPIL
Sbjct: 147 QQALADYIDLIQYLKSKPKYKNSPIIVFGGSYGGMLSAWIRMKYPHIVQGAIASSAPILQ 206
Query: 215 YGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
+ IT ++ +V+ DF+ C + I+KSW+ I NI S +G LS +K C PLK
Sbjct: 207 FTGITECESFLRIVTSDFKKAHSNCPKLIRKSWNIITNITSTNEGKKWLSDNWKLCQPLK 266
Query: 275 NTT---ELKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKI 322
N +L L +Y+ A + P P+ +C + N G D+L I
Sbjct: 267 NENNIEQLISYLQDIYTNLAMVNYPYKANFLAPLPAYPINAVCKHLTNESLTGTDLLIAI 326
Query: 323 AAGVVEADSLEYDGNNSRCYINEDRTGDESDE-GWEWQSCSEMVVPMGKDK-NSMYQPEP 380
+ + + ++C +N + + + D GW +Q+C+EMV+P+ D N M++P
Sbjct: 327 KNAI---NIFTNYTSETKC-LNLNNSSPQLDAIGWSFQACTEMVMPICSDGINDMFKPHT 382
Query: 381 WNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
WNL +Y +C +QY V P+P+ + YG D+ + +NI+FSNG+ DP+S GG ++
Sbjct: 383 WNLDEYSNDCIKQYSVKPQPNLICEEYGCKDLS-----TATNIVFSNGLMDPWSSGGVLR 437
Query: 441 TYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
+ + + + +H LDL + +DP +V RK + WI +Y++
Sbjct: 438 --NLSSSAIAIIIPESAHHLDLRSSNPNDPFSVVLARKYHRFFITKWINEYHE 488
>gi|328778095|ref|XP_623670.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Apis mellifera]
Length = 492
Score = 271 bits (694), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 161/503 (32%), Positives = 265/503 (52%), Gaps = 32/503 (6%)
Query: 6 FSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRIL--EQKDSNHGFETFFYNQTIDHF 63
F+ L + ++ ST H + L G ++S + E + + +E + +DHF
Sbjct: 4 FNISLLFFITLWQSTLQH---IFLNKFYGNYQKSLNIQNELHSAKYRYEIKTIDMPVDHF 60
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
S+ TF RY+IN + N+PI + G E I+ Q +GF ++ A +F AL+V
Sbjct: 61 SFS--VLNTFKLRYLINGTWQKTNNAPIFFYTGNEGNIETFAQNTGFMWDIAPEFGALLV 118
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIG 183
EHRYYG+S+P+ ++ A + H GY S QALADY ++ ++K K ++P+I G
Sbjct: 119 FAEHRYYGESMPYNNKSYA--DLNHLGYLTSQQALADYVDLIQYLKSKPKYKNSPIIVFG 176
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI 243
SYGG L+ W R+KYPH+V G++ASSAPIL + IT ++ +V+ DF+ C + I
Sbjct: 177 GSYGGMLSAWIRIKYPHIVQGAIASSAPILQFTGITECESFLRIVTSDFKKAHSNCPKLI 236
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG---LDTVYSEAAQYDTPSNI- 299
+KSW+ I NI S +G LS +K C PLKN +++ L +Y+ A + P
Sbjct: 237 RKSWNIIMNITSTNEGKKWLSDNWKLCQPLKNENDIEQLISYLQDIYTNLAMVNYPYKAN 296
Query: 300 --------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE 351
P+ +C + N G ++L I + + + ++C + T
Sbjct: 297 FLAPLPAYPINAVCKHLTNESLTGIELLIAIKNAI---NIFTNYTSETKCLNLNNSTPQL 353
Query: 352 SDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGH 410
GW +Q+C+EMV+P+ D N M++P WNL +Y +C +QY V P+P+ + YG
Sbjct: 354 DAIGWSFQACTEMVMPICSDGINDMFKPHTWNLDEYSNDCIKQYSVKPQPNLICEKYGCK 413
Query: 411 DIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDP 470
D+ + +NI+FSNG+ DP+S GG ++ + + + + +H LDL + +DP
Sbjct: 414 DLS-----TATNIVFSNGLMDPWSSGGVLR--NLSSSAIAIIIPESAHHLDLRSSNPNDP 466
Query: 471 DWLVQQRKTEVKIMQGWITQYYD 493
+V RK ++ WI +Y++
Sbjct: 467 FSVVLARKYHRFFIKKWINEYHE 489
>gi|383861707|ref|XP_003706326.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Megachile
rotundata]
Length = 493
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 156/472 (33%), Positives = 255/472 (54%), Gaps = 30/472 (6%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF 94
R RSRI E ++ + +E + +DHF + S F RY++N + N+PI +
Sbjct: 32 RQFRSRI-ELVNAKYKYEYKTIDMPVDHFDFA--SVDKFKLRYLMNDTWVKTNNAPIFFY 88
Query: 95 LGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNS 154
G E I+ Q SGF ++ A +F AL++ EHRYYG+S+P+G++ + + ++ GY +S
Sbjct: 89 TGNEGDIEGFAQNSGFMWDIAPEFGALLIFAEHRYYGESMPYGNK--SYTDIKYLGYLSS 146
Query: 155 AQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILY 214
QALADY ++ +++ H+PVI G SYGG L+ W R+KYPH+V G++A SAPIL
Sbjct: 147 EQALADYVDLIQYLRSDSKHKHSPVIVFGGSYGGMLSAWMRMKYPHIVQGAIACSAPILQ 206
Query: 215 YGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
+ T + +V+ DF+ C + I+KSW+ I+NI S +G LS+ +K C PLK
Sbjct: 207 F--TTECEVFSRIVTSDFKMAHRNCPKLIRKSWNAINNITSNDEGKKWLSENWKLCQPLK 264
Query: 275 NTTEL---KDGLDTVYSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKI 322
N ++ K L +Y A + P P+K C + N G ++L +
Sbjct: 265 NENDVEVFKSYLSDIYGNFAMVNYPYASDFLAPLPPFPIKVACEYLTNESLVGKELLTAL 324
Query: 323 AAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPW 381
V + Y G ++C T +D GW +Q+C+EMV+P+ D N M++P PW
Sbjct: 325 HDAVNVFTN--YTG-ETKCLNLNTSTPQLNDRGWHFQACTEMVMPLCSDGINDMFEPTPW 381
Query: 382 NLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
N +Y K C++ + + P+P+ YG D+ + +NI+FSNG+ DP+S GG ++
Sbjct: 382 NFEEYSKECEKLFSIKPQPNMACNQYGCEDLS-----TATNIVFSNGLLDPWSSGGVLR- 435
Query: 442 YHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
+ + + + + G+H LDL + +DP +V R + WI +Y++
Sbjct: 436 -NLSESAIAIIIPEGAHHLDLRGSHTNDPFSVVIARNYHRYYINKWIQEYHE 486
>gi|221044030|dbj|BAH13692.1| unnamed protein product [Homo sapiens]
Length = 391
Score = 267 bits (683), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 148/397 (37%), Positives = 228/397 (57%), Gaps = 28/397 (7%)
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
++ A + KA++V EHRYYG+S+PFG + K++RH + S QALAD+A ++ H+K
Sbjct: 2 WDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRT 59
Query: 172 Y-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
A + PVIAIG SYGG LA WFR+KYPH+V+G+LA+SAPI + D+ P + +V+
Sbjct: 60 IPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTT 119
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYS 288
DFR + C ++I +SW I+ +++ GL L+ C+PL ++ LKD + +
Sbjct: 120 DFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWV 179
Query: 289 EAAQYDTP--SNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNN 338
A D P SN P+K +C ++N PN D +L + + +A ++ Y+
Sbjct: 180 NLAMVDYPYASNFLQPLPAWPIKVVCQYLKN-PNVSDSLLLQ---NIFQALNVYYNYSGQ 235
Query: 339 SRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGV 396
+C I+E T GW +Q+C+E+V+P + + M++P WNL + +C +Q+GV
Sbjct: 236 VKCLNISETATSSLGTLGWSYQACTEVVMPFCTNGVDDMFEPHSWNLKELSDDCFQQWGV 295
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
PRPSW+ T YGG +I S +NI+FSNG DP+S GG T D + ++ G
Sbjct: 296 RPRPSWITTMYGGKNIS-----SHTNIVFSNGELDPWSGGG--VTKDITDTLVAVTISEG 348
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
+H LDL DP ++ R EV+ M+ WI +YD
Sbjct: 349 AHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFYD 385
>gi|347964904|ref|XP_309189.4| AGAP000994-PA [Anopheles gambiae str. PEST]
gi|333466532|gb|EAA04920.4| AGAP000994-PA [Anopheles gambiae str. PEST]
Length = 481
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 166/479 (34%), Positives = 264/479 (55%), Gaps = 44/479 (9%)
Query: 39 SRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN----SPILAF 94
S + + + ++T + IDHF+Y E+ TF RY+IN + GG SPIL +
Sbjct: 14 SAAVRDGTATYVYQTKTIDVPIDHFTYTGEA--TFKLRYLINDTYAPGGADLPASPILFY 71
Query: 95 LGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNS 154
G E I+ Q +GF +E A + KA ++ +EHR+YG S+PFG+ A+ + ++ GY S
Sbjct: 72 AGNEGDIELFAQNTGFMWELAPKLKATLLFVEHRFYGHSLPFGN--ASYDSPKNLGYLTS 129
Query: 155 AQALADYASILLHIKDKYNATHA-PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
QALAD+A +L + T A PVIA G SYGG LA W R+KYPH+V G++A+SAP+
Sbjct: 130 EQALADFALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIAASAPVR 189
Query: 214 YYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
+ +T ++ +++ ++ + +C I++SW+ + N ++ DGL +L++KFK CT
Sbjct: 190 QFAGVTDCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEKFKFCTN 249
Query: 273 LKNTTE----LKDGLDTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDIL 319
L T+ L D L VY A Y PS+ PV+ C + G ++L
Sbjct: 250 LTKGTDVTETLFDYLTDVYGNLAMINYPYPSSFLAPVPAYPVREFCGRLAQN-YTGVELL 308
Query: 320 CKIAAGVVEADSLEYDGNNSRCYINEDRTGDE-SDEGWEWQSCSEMVVPMGKDK-NSMYQ 377
+ + + + YDG + IN G SD GW++Q+C+EMV+P+ D + M+
Sbjct: 309 DHLQSAL--SIYYNYDGKAACLNINSSYDGTGISDRGWDFQACTEMVMPICADGVHDMFP 366
Query: 378 PEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
P+ WN+ Y C ++YGV PRP+ L YGG L S +NI+FSNG+ DP+S GG
Sbjct: 367 PQQWNMQTYADKCFKKYGVHPRPANALLNYGGE----FLDASITNIVFSNGLLDPWSGGG 422
Query: 438 WVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
++ S ++++ + G+H LDL + +DP + + R V+ +Q W+T+Y
Sbjct: 423 VLR-------SSNENVKIVLIPEGAHHLDLRASNPADPASVTRARAVHVQNIQKWLTEY 474
>gi|320166581|gb|EFW43480.1| prolylcarboxypeptidase [Capsaspora owczarzaki ATCC 30864]
Length = 496
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 160/464 (34%), Positives = 248/464 (53%), Gaps = 35/464 (7%)
Query: 43 EQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID 102
E+ + ++ ++ QT+DHF + ++ TF QRY+I++ W G P+ + G E I+
Sbjct: 52 ERLKGQYNYDVKWFTQTLDHFRF--DTNATFQQRYLISTANWNG-YGPMFFYTGNEGDIE 108
Query: 103 DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA 162
+GF +E A ++ AL+V EHRYYG+++PFG + L GY + QALAD+A
Sbjct: 109 WFADNTGFVWEIAAEYNALVVFAEHRYYGQTMPFGDKSFDLDKV---GYLTTEQALADFA 165
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ +K + N + PV+A G SYGG LA WFRLKYP+VV G++A+SAPI+Y+ D+T
Sbjct: 166 ILIPALKAQLNVPNLPVVAFGGSYGGMLAGWFRLKYPNVVDGAIAASAPIVYFQDLTSTE 225
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---- 278
++ + + DF T C I+ +S++D ++ GL +SK FK C L+
Sbjct: 226 IFNEIATNDFALTDARCPNIIRDGFSKVDALSKTAAGLQSISKAFKLCGTLQPADYATFI 285
Query: 279 --LKDGLDTVYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKI--AAGVV 327
L+ GL Y Y SN PV C A+ + D + + A GV
Sbjct: 286 GWLEAGL--TYMAMTDYPYASNFLQPMPAWPVDASCKALLSTFAGTSDSVAALNTAVGVY 343
Query: 328 EADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKY 386
Y G + I+ T D GW++QSC+EMV+PMG D + M+ PW+L +
Sbjct: 344 ----YNYTGQTACNNISSQATSDLGVLGWDYQSCTEMVMPMGSDGIHDMFPAAPWDLKSF 399
Query: 387 IKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+ CK+++ V PRP+W T +GG +I + SNI+FSNGM DP+S GG +
Sbjct: 400 NEYCKKRWNVVPRPTWAATSFGGFNIT-----AGSNIVFSNGMLDPWS-GGSITQIQSQT 453
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
L ++ G+H LDL + +DP ++ R E + WI++
Sbjct: 454 LVVV-NIPKGAHHLDLRSSNPADPQDVIDARNVERAQISRWISR 496
>gi|340374641|ref|XP_003385846.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Amphimedon
queenslandica]
Length = 490
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 170/504 (33%), Positives = 254/504 (50%), Gaps = 42/504 (8%)
Query: 4 SQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF 63
S FS +L ++V + S + LK PR +S++ ++T ++ Q IDHF
Sbjct: 6 SSFSL-FLFILVSYTSGLLNPRSLKNLPR----------GNSESSYEYQTLYFKQPIDHF 54
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQFKALI 122
++ ES TF QRY++N FW N PI + G E I +GF ++ A +FKAL+
Sbjct: 55 NF--ESNVTFSQRYLLNDAFWDKDNGGPIFFYCGNEGDITWFANNTGFVWDIAPEFKALV 112
Query: 123 VILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAI 182
V EHRYYG ++PFG+ A N GY S QALAD+ ++ +K KY PV+A
Sbjct: 113 VFAEHRYYGNTLPFGAESYA--NLSTLGYLTSEQALADFVLLINDLKGKYG--DVPVVAF 168
Query: 183 GASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQT 242
G SYGG L+ W R+KYP VV+GS+A+SAPI + + + ++S S C+
Sbjct: 169 GGSYGGMLSAWIRMKYPSVVVGSIAASAPIWQFPGLCDCGKANEIISSTMSQYSTNCYNN 228
Query: 243 IKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ--YDTPSNI- 299
+ SW I+ + GL++LS F C PLK++ L L V+ A Y P+N
Sbjct: 229 VLSSWDIINKTGTTSGGLSLLSTTFSLCQPLKDSLTLMSWLQDVWFNLAMMNYPYPANFM 288
Query: 300 ------PVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYDGNNSRCYINEDRTGDE 351
P+ C + DD + K+ + A + Y G +S +N++ + D
Sbjct: 289 EPLPAWPLNVTCQKLSQTFTTEDDDI-KLLQALSTAMQVYYNYSGQSSCLDLNKESSTDL 347
Query: 352 SDEGWEWQSCSEMVVPM---GKDKNSMYQPEPWNLTKYIKNCKEQY-GVSPRPSWVLTYY 407
+GW +Q C+EM +PM G D N + + W + Y+KNC++ + G+ PRP W+ Y
Sbjct: 348 GAKGWSYQYCTEMAMPMCSKGGD-NDAFPKQQWTVNNYVKNCQDSFPGIQPRPYWIEKVY 406
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKK 467
G +I + SNI+FSNG DP+S G V + D + +N G+H LDL EA
Sbjct: 407 NGKNIS-----AFSNIVFSNGDLDPWSAG--VVLDNISDSLIAVIINDGAHHLDLREANP 459
Query: 468 SDPDWLVQQRKTEVKIMQGWITQY 491
D D + R + WI Y
Sbjct: 460 MDTDSVKAARNIHKDNINKWIGGY 483
>gi|225708852|gb|ACO10272.1| Lysosomal Pro-X carboxypeptidase precursor [Caligus rogercresseyi]
Length = 492
Score = 261 bits (667), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 168/472 (35%), Positives = 247/472 (52%), Gaps = 34/472 (7%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PI 91
LG + +E+ + ++T +Y +DHF + S TF QRY+IN W + PI
Sbjct: 23 LGGVNVVHGVEELQGYYTYKTEWYENLVDHFGFAINS--TFKQRYLINDTHWDSQHGGPI 80
Query: 92 LAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALK-NARHRG 150
+ G E I+ Q +GF ++ A +F ALI+ +EHRYYGKS+PFG K +LK + + G
Sbjct: 81 FLYAGNEGDIEAFAQNTGFMWDIAPEFNALIIFIEHRYYGKSLPFG--KDSLKPDPKMNG 138
Query: 151 YFNSAQALADYASILLHIKD-KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
Y S QALADYA + K + A +PVI G SYGG LA W R+KYPH+V G++A S
Sbjct: 139 YLTSEQALADYARFVTEFKSTRKGAKDSPVIVFGGSYGGMLAAWMRIKYPHIVNGAIAGS 198
Query: 210 APILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT 269
AP+ + TP + +V+ D+ S+ C I KSW+ ID + GL L K
Sbjct: 199 APVAQFD--TPCLNFGRIVTSDYSFYSKSCSGVISKSWAAIDQVGKNDKGLQRLQSLLKL 256
Query: 270 CTPLKNTTELKDGLDTVYSEAAQYDTP---------SNIPVKRICNAIE--NAPNCGDDI 318
C+ K+ LK L V++ A + P PVK IC + P I
Sbjct: 257 CSKPKSVEPLKSFLTDVWTSVAMMNYPYPTEFLMPLPGNPVKYICGKMSPTTVPTDPVAI 316
Query: 319 LCKIAAGV-VEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMG-KDKNSMY 376
L + G+ V A+ Y G ++C I+ D + W++QSC+EMV+P + + M+
Sbjct: 317 LKYVYEGLNVYAN---YSG-KAKC-IDMDNADQIGADMWDYQSCTEMVMPFCYNNVDDMF 371
Query: 377 QPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
+ WN T Y + C+E++ V+PRP YG +K + SNIIFSNG+ DP+S G
Sbjct: 372 EKSDWNFTTYAQGCQERWKVTPRPKMADIMYGSKKLK-----AASNIIFSNGLLDPWSSG 426
Query: 437 GWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
G +K+ D S + G+H LDL + +DP ++ RK E ++ WI
Sbjct: 427 GIMKSIS--DSVVSIIIPEGAHHLDLRGSNPNDPVSVIHARKLERSFIRKWI 476
>gi|170055552|ref|XP_001863632.1| lysosomal Pro-X carboxypeptidase [Culex quinquefasciatus]
gi|167875507|gb|EDS38890.1| lysosomal Pro-X carboxypeptidase [Culex quinquefasciatus]
Length = 475
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 171/484 (35%), Positives = 262/484 (54%), Gaps = 48/484 (9%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
LG + + Q + + ++T + + +DHFSY ES TF RY+ N + G+ PIL
Sbjct: 8 LGLAVLALLTGQSSAKYEYQTKYLDVPLDHFSYVNESV-TFKLRYLTNDTYNPDGSGPIL 66
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+ G E I+ Q +GF +E A + KA +V EHR+YGK++PFG+ A+ ++ RH GY
Sbjct: 67 FYTGNEGDIELFAQNTGFMWELAPKLKASLVFAEHRFYGKTLPFGN--ASYESPRHLGYL 124
Query: 153 NSAQALADYASILLHIKDKYNATHA-PVIAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
S QALAD+A +L I A PV+A G SYGG LA WFR+KYPH+V G++A+SAP
Sbjct: 125 TSEQALADFAYLLAQINPSNRTVRARPVVAFGGSYGGMLAAWFRMKYPHMVAGAIAASAP 184
Query: 212 ILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
I + T ++ +++ F ++EC I +SW + N +S DGL L++KFK C
Sbjct: 185 IRQFD--TDCGVFNQILTSVFSVAYTKECSLNIGRSWDVLKNYSSSADGLKTLNEKFKFC 242
Query: 271 TPLKNTTELK----DGLDTVYSEAAQYDTPSN---------IPVKRICNAIENAPN-CGD 316
+ L ++ D L VY A + P N PV+ C + A N G
Sbjct: 243 SNLTKAEDVTGTFFDYLIDVYGNLAMINYPYNSSFLAPVPAYPVREFCGRL--AQNYTGL 300
Query: 317 DILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDES--DEGWEWQSCSEMVVPMGKDKNS 374
++L K+ EA S+ Y+ N +N + + D S D GW +QSC+EMV+PM + S
Sbjct: 301 ELLTKLQ----EALSIYYNYNGQAKCLNINSSYDSSMGDLGWGFQSCTEMVMPMCAEGGS 356
Query: 375 --MYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
M+ +PW+ K+ +C +++GV PRP YGG ++ SNI+FSNG+ DP
Sbjct: 357 RDMFPKQPWDEKKFSDDCFKKFGVRPRPIAPAYIYGGQYLE-----GASNIVFSNGLLDP 411
Query: 433 FSRGGWVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
+S GG ++ S + ++ + G+H +DL A +DP + R+ V+ +Q W
Sbjct: 412 WSGGGVLR-------SGNSNIKIVLIPEGAHHIDLRAADDNDPGSVRSARQIHVQNIQLW 464
Query: 488 ITQY 491
+ QY
Sbjct: 465 LKQY 468
>gi|195437264|ref|XP_002066561.1| GK24560 [Drosophila willistoni]
gi|194162646|gb|EDW77547.1| GK24560 [Drosophila willistoni]
Length = 480
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 159/449 (35%), Positives = 240/449 (53%), Gaps = 32/449 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ + +F RY+ N+ F GN SPI + G E I+ Q +GF +E A +
Sbjct: 43 LDHFSFLKNA--SFNIRYLYNNSFADKGNKRSPIFFYTGNEGDIEWFAQNTGFLWELAEK 100
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
A++V EHRYYGKS+PFG + YF Q L D+A ++ ++K N
Sbjct: 101 QGAVVVFAEHRYYGKSLPFGPNTFNKTMPENLAYFTVEQTLEDFALLITYLK---NGADL 157
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
PV+A G SYGG LA WFR+KYPH+VIGSLA+SAPIL + ITP ++ + + F +
Sbjct: 158 PVVAFGGSYGGMLAAWFRMKYPHIVIGSLAASAPILQFPGITPCDIFNKITTSVFHTAYN 217
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK---NTTELKDGLDTVYSEAAQY 293
C I KSW I+N+AS G +S F C PLK + T+L D ++ VY A
Sbjct: 218 GNCTVNIGKSWKAIENVASTDAGKKQISDAFHLCNPLKGPDDLTKLLDYIELVYGNLAMA 277
Query: 294 DTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ P N PV+++C ++ D+L +A + A Y G I+
Sbjct: 278 NYPYNNSFLAPLPAYPVRQMCFYLKELHKTDADLLQAMANAL--AVYTNYTGTVKCLDIS 335
Query: 345 EDRTGDESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
+ D D GW QSC++MV+P ++MY+ W+ + +NC Y ++P+P+ +
Sbjct: 336 SNSNAD--DSGWNIQSCNQMVMPFCSNSSDTMYRTSTWDFKEVSENCVRDYHLTPKPNDI 393
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
+ YGG D+ S SNIIFSNG+ DP+S GG ++ + D + G+H LDL
Sbjct: 394 ILRYGGRDLS-----SISNIIFSNGLLDPWSGGGVLQAPN--DRVHVIIIPEGAHHLDLR 446
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+++ +DP ++ R+ E I+ WI ++Y
Sbjct: 447 KSEPADPPSVIDARQKEATIIASWIEEFY 475
>gi|198429605|ref|XP_002127825.1| PREDICTED: similar to prolylcarboxypeptidase [Ciona intestinalis]
Length = 494
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 152/453 (33%), Positives = 236/453 (52%), Gaps = 32/453 (7%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
++ Q +DHF + + TF QRY ++ ++W + PI + G E I+ I+ +G ++
Sbjct: 35 LYFKQNLDHFDFTINA--TFTQRYFVSEQYWTKMDGPIFFYTGNEGDIELFIKNTGLMWD 92
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK-Y 172
A FKA++V EHRYYGKS PFG+ K + K + Y + QALAD+A ++ HIK
Sbjct: 93 IAPMFKAMVVFAEHRYYGKSKPFGNLKPSTKTIKEFSYLTAEQALADFAILVKHIKSTDS 152
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
A ++PV+ G SYGG L+ WFRLKYPH+V G++A+SAP+LY+ + Y+ V+ +F
Sbjct: 153 KAKNSPVVVFGGSYGGMLSAWFRLKYPHIVTGAIAASAPVLYFPSTVKCSQYNEAVTNNF 212
Query: 233 RDT--SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
E C I+ W ++ A KP GL +L + F C+ + ++T ++ + ++
Sbjct: 213 LSVQNGETCVANIRNVWKTMNETAKKPGGLKLLGEIFHLCSAINSSTAVESFIKDIFGNM 272
Query: 291 AQYDTP------SNI---PVKRICNAIENAPNCGDDILCKI--AAGVVEADSLEYDGNNS 339
A D P SNI PV + C + G D+L + A GV + Y G +
Sbjct: 273 AMVDYPYANNFLSNIPAWPVNKTCQHLSEPNLQGLDLLQAMHSAIGVYQ----NYTG-SV 327
Query: 340 RCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVS 397
+CY + T S W + +C MV+P + M+ + W + K+C ++YG+
Sbjct: 328 KCYNVKTTETSKLSTTLWNYMTCGAMVMPFCANGVTDMFPVKNWTQESFDKSCFKKYGIK 387
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG- 456
RP W LT +GG NI+F+NG+ DP+ GG S L G
Sbjct: 388 SRPEWALTDFGGSK-----AVEAGNIVFTNGLLDPWHVGG---VPEMKSESVVSILMWGA 439
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+H LDL A +DP +V+ RKT+VK + WI+
Sbjct: 440 AHHLDLRHANDADPSSVVEARKTQVKHIAKWIS 472
>gi|328702869|ref|XP_001949478.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Acyrthosiphon
pisum]
Length = 507
Score = 258 bits (660), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 159/486 (32%), Positives = 250/486 (51%), Gaps = 49/486 (10%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI 101
L + + ++T ++ +DHFS+ + TF +Y+IN FW N PI + G E I
Sbjct: 14 LHLANGGYVYKTEYFTVPVDHFSF--TNNDTFRMKYLINDTFWERENGPIFFYAGNEGAI 71
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ + +GF +E A +F+AL+V EHRYYG S+P+G+R + + GY S QALADY
Sbjct: 72 EMFCENTGFMWEIAEEFRALVVFAEHRYYGVSMPYGNR--SFDDIGRVGYLTSQQALADY 129
Query: 162 ASILLHIK--------------DKYN---------ATHAPVIAIGASYGGELATWFRLKY 198
++ +++ D Y+ + PVIA G SYGG LA WFR+KY
Sbjct: 130 VDLITYLRHNGSYSNRELYQTGDIYDTAVGDTPAPSASNPVIAFGGSYGGMLAAWFRIKY 189
Query: 199 PHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD 258
P ++ G++ASSAPI + +TP ++ V S + DTS EC TI SW IDN+ D
Sbjct: 190 PAIIEGAIASSAPIWQFTGMTPCNAFYRVTSSVYTDTSAECGLTILASWKAIDNVTKTVD 249
Query: 259 GLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQ--YDTPSNI-------PVKRICN 306
G A LS+K+ C+PL N + LK+ + +Y A Y P+N PVK +C
Sbjct: 250 GKAWLSQKWNLCSPLTNDNDVSILKEWVSELYVNLAMINYPYPANFLTPLPGHPVKEVCK 309
Query: 307 AIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVV 366
++N +L + G+ + Y G + I ++GW +QSC+E+V+
Sbjct: 310 PMKNHKEDDFTLLGSVFRGL--SVYFNYTGTSKCLDILTSSAPTLGEKGWSYQSCTEIVM 367
Query: 367 PMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIF 425
PM + +++ +PWN + + C E +GV P + YGG ++ + SNIIF
Sbjct: 368 PMCSNGIKDIFEKKPWNFEENARYCFETFGVQPSIYAIEKTYGGKNL-----NAASNIIF 422
Query: 426 SNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
SNG+ DP+S GG ++ + + + +H LDL + DP+ +++ RK ++
Sbjct: 423 SNGLLDPWSSGGVLQ--NISKTVLAVVIPESAHHLDLRASHAQDPESVIKARKFYKNWIK 480
Query: 486 GWITQY 491
WI Y
Sbjct: 481 KWIFHY 486
>gi|195385893|ref|XP_002051639.1| GJ16686 [Drosophila virilis]
gi|194148096|gb|EDW63794.1| GJ16686 [Drosophila virilis]
Length = 469
Score = 257 bits (657), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 155/454 (34%), Positives = 241/454 (53%), Gaps = 33/454 (7%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYE 113
++ +DHFS+ S ++F RY+ N + NS PI + G E I+ Q SGF +E
Sbjct: 31 FSVPLDHFSF--LSNESFSIRYLYNDSYVDKANSQSPIFFYTGNEGDIEWFAQNSGFIWE 88
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A + +AL+V EHRYYGKS+PFG H YF Q L DYA ++ ++ N
Sbjct: 89 LAAKLRALVVFAEHRYYGKSMPFGGDTFNTSKPEHLAYFTVEQTLEDYALLITFLR---N 145
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
PV+A G SYGG LA WFR+KYPH+VIG+LA+SAPIL + +TP + + + F
Sbjct: 146 GQQLPVVAFGGSYGGMLAAWFRMKYPHIVIGALAASAPILQFPGLTPCDIFEKITTSVFE 205
Query: 234 DT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK---DGLDTVYSE 289
+ C I KSW I+++A+ G L F C P+KN +L D L+ VY
Sbjct: 206 TAYNANCSANIGKSWKAIESLAATDAGKKQLGDMFHLCDPVKNGDDLNKFMDYLEEVYGN 265
Query: 290 AAQYDTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR 340
A + P N PV++ C +++ D+L +A+ + Y G S+
Sbjct: 266 LAMVNYPYNSTFLAPLPAYPVRQFCYYLKDLQPNDTDLLASLASAL--GVYTNYTG-ASK 322
Query: 341 C--YINEDRTGDESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
C Y + + + GW+ Q+C++MV+P + ++MY+ W+L K+ + C ++ ++
Sbjct: 323 CVDYKDSNSGANYDSRGWDIQTCNQMVMPFCANNSDTMYRTSTWDLKKFSEKCYTKFHLT 382
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS 457
P+P +L YGG +++ + SNIIFSNG+ DP+S GG ++ + + L G+
Sbjct: 383 PKPHDILLRYGGRNLE-----AASNIIFSNGLLDPWSGGGVLRAPN--NQVHVIILPEGA 435
Query: 458 HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
H LDL ++ SDP + RK E I++ WI +
Sbjct: 436 HHLDLRQSHPSDPASVTDARKKEAAIIEQWIRDF 469
>gi|357606522|gb|EHJ65099.1| putative Lysosomal Pro-X carboxypeptidase [Danaus plexippus]
Length = 467
Score = 257 bits (656), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 155/463 (33%), Positives = 247/463 (53%), Gaps = 40/463 (8%)
Query: 48 NHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL 107
++ FET ++N +DHF + + +TF +Y+IN ++W G PI + G E I+ +
Sbjct: 17 DYKFETKWFNVPLDHFGF--QRNETFNIKYLINEEYWDKGGGPIFFYTGNEGQIEVFAKH 74
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+GF ++ A +FKA +V EHRYYG+S+PFG++ +L N H GY S QALADYA ++ +
Sbjct: 75 TGFMWDIAEEFKAKLVFAEHRYYGQSMPFGNK--SLDN-EHIGYLTSEQALADYADLINY 131
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ PVIA G SYGG L+ + R+KYPH+V G++A+SAPI Y + P +H +
Sbjct: 132 LQGNKQRPTYPVIAFGGSYGGMLSAYIRIKYPHLVTGAIAASAPIHMYPGMVPCEVFHRI 191
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK--NTTELKDGLDT 285
V+ F+ E+C + I+ SW + + L K + C P+K + L + L +
Sbjct: 192 VTSSFKIADEKCVKNIRSSWGVLRKFLESQNNTDWLHKNWNLCEPVKPADVNTLMEFLQS 251
Query: 286 VYSEAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG 336
+Y A Y PS+ PV+ +C + N G ++ I V++ S YDG
Sbjct: 252 MYETLAMVNYPFPSDFLLPLPAQPVRVVCQYL-NETLSGQKLIEAIGK-VIKVYS-NYDG 308
Query: 337 NNSRC--YINEDRTGDESDEGWEWQSCSEMVVPMGKDKNS-MYQPEPWNLTKYIKNCKEQ 393
+ C Y D G+ GW++Q+C+EM++PM N M++P PWN TKY ++C +
Sbjct: 309 -KAPCVDYKKGDDFGNLDASGWDYQACTEMIMPMCTTGNQDMFEPSPWNFTKYAEDCHRK 367
Query: 394 YGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
Y V PR YGG + R+ +NI+FSNG+ DP++ GG + S S +
Sbjct: 368 YNVYPRQEAARIQYGGDRL-----RAATNIVFSNGLLDPWAGGGILN-------SISNSV 415
Query: 454 NL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+H LDL + +DP+ + R + + WI ++
Sbjct: 416 KAVVIIDAAHHLDLMPSNPADPNSVKLARNIHKQNIDKWIREF 458
>gi|255584372|ref|XP_002532920.1| catalytic, putative [Ricinus communis]
gi|223527313|gb|EEF29462.1| catalytic, putative [Ricinus communis]
Length = 245
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 121/207 (58%), Positives = 159/207 (76%), Gaps = 2/207 (0%)
Query: 66 GPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
GP+SY TF Q+YVI+ K W G ++PI A+LG E+P++ +I GF ++NA +F AL V
Sbjct: 38 GPQSYATFQQKYVISFKHWTGAQASAPIFAYLGEESPLNADIHGIGFLFDNAAKFGALTV 97
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIG 183
+EHR+YG S+PF SR+ AL NA RGYFNSAQALADYA ILL+IK +A +P+I IG
Sbjct: 98 FIEHRFYGDSIPFVSRQEALANATLRGYFNSAQALADYAEILLNIKLILSAETSPIIVIG 157
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI 243
SYGG LA+WFRLKYPH+ +G+LASSAPILY+ +ITP Y+S+V+KD+RD SE C TI
Sbjct: 158 GSYGGMLASWFRLKYPHIALGALASSAPILYFDNITPSDAYYSLVTKDYRDASESCSNTI 217
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTC 270
K+SW E+ +AS+ +GL+ILS+KF TC
Sbjct: 218 KESWLELARVASQENGLSILSEKFHTC 244
>gi|440799092|gb|ELR20153.1| dipeptidyl-peptidase family protein [Acanthamoeba castellanii str.
Neff]
Length = 498
Score = 254 bits (649), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 160/473 (33%), Positives = 245/473 (51%), Gaps = 49/473 (10%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN-------SPILAFLGAEAPIDDNIQL 107
+++QTIDHF+ + TF QRY+ S +W N PI + G E I
Sbjct: 35 WFDQTIDHFNIETQP-ATFRQRYLTFSNYWSSANHGGELRRGPIFFYTGNEGDITAFWDN 93
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
SGF +E A + AL+V EHRYYGK+ PFGS + H GY + QALADYA+++ H
Sbjct: 94 SGFVFELAKSYGALVVFGEHRYYGKTYPFGSGGPDSYSKEHIGYLSVEQALADYATLIEH 153
Query: 168 IKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT--- 223
+K A+H+PVIA G SYGG L+ WFR+KYP VV G+LA+SAPIL+ +++ TT
Sbjct: 154 LKSTLPGASHSPVIAFGGSYGGMLSAWFRMKYPQVVDGALAASAPILWSTNVSSATTGPD 213
Query: 224 ------YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KN 275
Y V+ DFR E C ++++++++ +A P GLA ++K+F C +
Sbjct: 214 SKRPPGYFETVTNDFRAADERCPGLVQQAFAKMLQLAQTPSGLAAIAKQFSLCKNVLPHE 273
Query: 276 TTELKDGLDTVYSEAAQYDTP------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEA 329
L + + A D P + +P I A N DD+L G+ +A
Sbjct: 274 VEHLILWVVNAFGNMAMMDYPYPTGFMAPLPAYPIKVACHLMLNNTDDVL----RGLAQA 329
Query: 330 DSLEYDGNNSRCYIN--------EDRTG---DESDEGWEWQSCSEMV-VPMGKDKNSMYQ 377
L Y+ ++ N D+TG + + W++Q+C E+V P + M+
Sbjct: 330 AGLYYNSSSPLQCFNIWDEFVECADQTGCGTGPAGQSWDYQACGEIVYYPNTNNVTDMFP 389
Query: 378 PEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
P W L +C+ +G++PRP+W+ TY GG +I R S IIFSNG+ DP+ GG
Sbjct: 390 PRDWTLADLNAHCQRTWGITPRPTWLKTYTGGENI-----RYASRIIFSNGLLDPWHGGG 444
Query: 438 WVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
++++ D + + G+H LDL + DP +V+ R E +I+ W+ +
Sbjct: 445 FLES--LSDSLIAIIIKDGAHHLDLRSSDPRDPPSVVEARNHEARIIGKWLAE 495
>gi|225709542|gb|ACO10617.1| Lysosomal Pro-X carboxypeptidase precursor [Caligus rogercresseyi]
Length = 492
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 167/474 (35%), Positives = 246/474 (51%), Gaps = 38/474 (8%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PI 91
LG + +E+ + ++T +Y +DHF + S TF QRY+IN W + PI
Sbjct: 23 LGGVNVVHGVEELQDYYTYKTEWYENLVDHFGFAINS--TFKQRYLINDTHWDSQHGGPI 80
Query: 92 LAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALK-NARHRG 150
+ G E + Q +GF ++ A +F ALI+ +EHRYYGKS+PFG K +LK + + G
Sbjct: 81 FLYAGNEGDSEAFAQNTGFMWDIAPEFNALIIFIEHRYYGKSLPFG--KDSLKPDPKMNG 138
Query: 151 YFNSAQALADYASILLHIKD-KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
Y S QALADYA + K + A +PVI G SYGG LA W R+KYPH+V G++A S
Sbjct: 139 YLTSEQALADYARFVTEFKSTRKGAKDSPVIVFGGSYGGMLAAWMRIKYPHIVNGAIAGS 198
Query: 210 APILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT 269
AP+ + TP + +V+ D+ S+ C I SW+ ID + GL L K
Sbjct: 199 APVAQFD--TPCLNFGRIVTSDYSFYSKSCSGVISMSWAAIDQVGKDDKGLQRLQSLLKL 256
Query: 270 CTPLKNTTELKDGLDTVYSEAAQYDTP---------SNIPVKRICNAIE--NAPNCGDDI 318
C+ K+ LK L V++ A + P PVK IC + P I
Sbjct: 257 CSKPKSVEPLKSFLTDVWTNVAMMNYPYPTEFLMPLPGNPVKYICRKMSPTTVPTDPVAI 316
Query: 319 LCKIAAGV-VEADSLEYDGNNSRC--YINEDRTGDESDEGWEWQSCSEMVVPMG-KDKNS 374
L + G+ V A+ Y G ++C N D+ G + W++QSC+EMV+P + +
Sbjct: 317 LKYVYEGLNVYAN---YSG-KAKCIDMGNADQIGADM---WDYQSCTEMVMPFCYNNVDD 369
Query: 375 MYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFS 434
M++ WN T Y + C+E++ V+PRP YG +K + SNIIFSNG+ DP+S
Sbjct: 370 MFEKSDWNFTTYAQGCQERWKVTPRPKMADIMYGSKKLK-----AASNIIFSNGLLDPWS 424
Query: 435 RGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
GG +K+ D S + G+H LDL + +DP ++ RK E ++ WI
Sbjct: 425 SGGIMKSIS--DSVVSIIIPEGAHHLDLRGSNPNDPVSVIHARKLERSFIRKWI 476
>gi|307189813|gb|EFN74085.1| Lysosomal Pro-X carboxypeptidase [Camponotus floridanus]
Length = 453
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 155/454 (34%), Positives = 242/454 (53%), Gaps = 36/454 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGG-GNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DHFS+ + TF RY+IN + N+PI + G E I+ + +GF +E A +F
Sbjct: 18 VDHFSFAVQ--DTFNLRYLINDTWCKTVKNAPIFFYTGNEGRIELFAENTGFLWEIAPKF 75
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
AL++ EHRYYG+S+P+G++ A N R+ GY S QALADY ++ +++ K +P
Sbjct: 76 GALVIFAEHRYYGESLPYGNQSFA--NPRYLGYLTSQQALADYVELIGYLRSKEGFEFSP 133
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VI G SYGG L+ W R+KYPH+V G++A+SAPIL + DI + + + D+ ++
Sbjct: 134 VIVFGGSYGGMLSAWMRIKYPHIVQGAIAASAPILQFTDIVECDAFARIATSDYYVSNPT 193
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQ--- 292
C Q I+K+W I + S +G LS +K C PLK + LKD L +Y A
Sbjct: 194 CPQLIRKAWKTITEVTSNDEGKKWLSSNWKLCEPLKTEEDVQVLKDFLQEIYINMAMVNY 253
Query: 293 -YDT------PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINE 345
Y+T P+N P+ C + N+ G +L + + + Y G + C +
Sbjct: 254 PYETNFLAPLPAN-PINVFCQHLTNSSLTGKPLLSTLYRAINVYTN--YTG-KANCTFTK 309
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
+ T DEGW++Q+C+EMV+PM D N M++P WN Y C ++Y VS +P
Sbjct: 310 NTTPTLGDEGWDYQACTEMVMPMCTDGTNDMFEPATWNFDDYNNTCFKKYSVSSQPHLAC 369
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---GSHCLD 461
YG ++ + +NI FSNG+ DP++ GG ++ +LS S + +H LD
Sbjct: 370 QQYGCKNLDTV-----TNIHFSNGLLDPWTSGGVLR-----NLSSSATAIIIPDAAHHLD 419
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
L E+ +D ++ RK ++ WI +Y F
Sbjct: 420 LRESNSNDSYDVILTRKFHQYSIKKWIAEYQRKF 453
>gi|91088397|ref|XP_972807.1| PREDICTED: similar to prolylcarboxypeptidase [Tribolium castaneum]
gi|270012192|gb|EFA08640.1| hypothetical protein TcasGA2_TC006303 [Tribolium castaneum]
Length = 478
Score = 253 bits (645), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 155/463 (33%), Positives = 258/463 (55%), Gaps = 39/463 (8%)
Query: 48 NHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL 107
++ +ET ++ +DHFS+ + TF +Y+IN FW + PI + G E +++ +
Sbjct: 17 SYDYETKYFEVLLDHFSFTNNA--TFKLKYLINDTFWTN-DGPIFFYTGNEGTVENFAEN 73
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+GF ++ A F AL+V EHRYYG+S+PFG+ + + H GY S+QALAD+ ++ +
Sbjct: 74 TGFMFDIAPSFNALVVFAEHRYYGESLPFGND--SFVSPSHIGYLTSSQALADFVDLINY 131
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ + PVIA G SYGG LA+W R+KYP V+G++A+SAPI + TP ++ V
Sbjct: 132 LQ-TMSLEKVPVIAFGGSYGGMLASWLRMKYPASVVGAIAASAPIWQFE--TPCEDFYKV 188
Query: 228 VSKDFRD-TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
V++ +++ +++C I KSW+ + NI+ P+G A LS ++ C+PL+ + ++ + L
Sbjct: 189 VTRVYQEAVAKDCPLLITKSWTALRNISESPEGKAWLSDAWQLCSPLETSADV-ETLIGW 247
Query: 287 YSE----AAQYDTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE 333
YSE A + P + PVK C+ + A N DD +A G D+L+
Sbjct: 248 YSEILVNMAMVNYPYSTSFLAPLPPFPVKTFCSQLTQA-NIVDDKSLVMALG----DALQ 302
Query: 334 YDGN---NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNC 390
N + C +EGW +Q+C+EM++PM M++ +PW+ KY C
Sbjct: 303 IYTNFTETTTCNKINQTAEALGEEGWYFQACTEMIMPMCSIDGDMFENDPWDYGKYASQC 362
Query: 391 KEQYGVS-PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
E++GV+ P + YGG +IK + SNI+FSNG+ DP+S GG +K +
Sbjct: 363 FEKWGVNQTHPELPVLEYGGKEIK-----AASNIVFSNGLLDPWSSGGVLKNVS--ESVV 415
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
S + G+H +DL K DP+ +++ R+ V ++ WI ++Y
Sbjct: 416 SVIIPDGAHHIDLRGGNKDDPETVIEARQFHVDNIKKWIMEFY 458
>gi|427785257|gb|JAA58080.1| Putative lysosomal pro-x carboxypeptidase-like protein
[Rhipicephalus pulchellus]
Length = 467
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 159/458 (34%), Positives = 245/458 (53%), Gaps = 33/458 (7%)
Query: 47 SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQ 106
+++ +E F+ +DHF Y + T+ RY+ ++W PI + G E I
Sbjct: 21 ADYAYEVRFFETKVDHFGYA--NNDTYKMRYLFADQYWDHQGGPIFFYTGNEGSITTFAN 78
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILL 166
SG ++ A +F+AL++ EHRYYGKS+P+G+ + K+ H GY QALADYA +L
Sbjct: 79 NSGLMWDWAPEFRALLIFAEHRYYGKSMPYGND--SFKSPAHLGYLTVEQALADYADLLQ 136
Query: 167 HIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
+I+ A + V++ G SYGG LA WFR+KYPHV +LA+SAPIL + DITP
Sbjct: 137 YIRKTLPGARDSQVVSFGGSYGGMLAAWFRMKYPHVTAAALAASAPILQFQDITPCGAQS 196
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC---TPLKNTTELKDG 282
+V++ FR SE C + I+KSW I N+ S +G + ++F C TP KN T L+D
Sbjct: 197 AVITGAFRKDSELCVEAIRKSWGIIKNMTSTVEGAQAVRERFHLCGNYTP-KNYTNLRDW 255
Query: 283 LDTVYSEAAQYDTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSL- 332
L +Y+ A + P + PV+ C + + D +L G+ +A S+
Sbjct: 256 LTDLYANLAMVNYPYDNSFLAPVPGHPVREACKFLNRTFDNDDALL----DGLYQAISVF 311
Query: 333 -EYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNC 390
Y G ++C +G +GW +QSC+EMV+P+ D N M+ + W+L + K C
Sbjct: 312 QNYTG-QTQCNDLSKSSGTLDADGWNYQSCNEMVMPLCSDGVNDMFDKQDWDLNEVRKKC 370
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
++ + V+P +GG +I + SNIIFSNG DP+S GG ++T D +
Sbjct: 371 EKDFHVTPDVYKAALIFGGRNIA-----AASNIIFSNGDLDPWSAGGVLETIS--DSLIA 423
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ +H LDL + +DPD +V+ R E K + W+
Sbjct: 424 IYMEGAAHHLDLRSSNPADPDSVVRARALEKKYITKWL 461
>gi|91092240|ref|XP_971305.1| PREDICTED: similar to prolylcarboxypeptidase [Tribolium castaneum]
gi|270014428|gb|EFA10876.1| hypothetical protein TcasGA2_TC001698 [Tribolium castaneum]
Length = 488
Score = 252 bits (644), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 155/466 (33%), Positives = 249/466 (53%), Gaps = 41/466 (8%)
Query: 49 HGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLS 108
+ + T F + +DHFS+ + TF +Y+IN FW + PI + G E ++ + +
Sbjct: 17 YNYTTKFIDVPLDHFSFTNNA--TFKLKYLINDSFWID-DGPIFFYTGNEGAVETFAENT 73
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF ++ A F ALIV EHRYYG ++PFG+ A+ N H G+ S+QALADY ++ H+
Sbjct: 74 GFIFDIAPTFNALIVFAEHRYYGATLPFGN--ASFSNPGHLGFLTSSQALADYVYLINHL 131
Query: 169 KDKYNATH----APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY 224
+ + + PV+A G SYGG LA W R+KYP V+G++A+SAPI + +TP +
Sbjct: 132 QTTHQRSEYLSKVPVVAFGGSYGGMLAAWLRMKYPASVVGAIAASAPIWQFQGLTPCENF 191
Query: 225 HSVVSKDFRD-TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
+ +VS ++ ++C I+KSW I NI + DG A L+K +K C+PLK ++++ D L
Sbjct: 192 NRIVSNVYKTAVDDDCSAPIQKSWKIIRNITANDDGKAWLTKAWKLCSPLK-SSDIDDLL 250
Query: 284 DTVYSE------AAQYDTPSN-------IPVKRICNAIENAPNCGDDILCKIAAGVVEAD 330
+ YSE Y P+ PV+ C + G+ I + +
Sbjct: 251 EW-YSEILVNMAMVNYPYPTKFLAPLPAFPVRNFCYKLT-----GEKITDDKSLVTAIGN 304
Query: 331 SLEYDGN---NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
+LE N ++C +EGW++Q+C+EM++PM D N M++ + W+ KY
Sbjct: 305 ALEIYTNFTKATKCNNINQTAASLGEEGWDFQACTEMIMPMCSDDNDMFENQSWDFKKYS 364
Query: 388 KNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
C ++GV + + + YGG DI + SNI+FSNG+ DP+S GG +
Sbjct: 365 DKCYTKWGVRQTNAELPILEYGGKDIT-----AASNIVFSNGLLDPWSSGGVLSNIS--S 417
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
S + G+H LDL ++DP +++ R+ V ++ WIT +Y
Sbjct: 418 TVSSVIIPEGAHHLDLRGENRNDPKSVIEARQFHVSSIRKWITDFY 463
>gi|339245559|ref|XP_003378705.1| putative serine protease pcp-1 [Trichinella spiralis]
gi|316972372|gb|EFV56050.1| putative serine protease pcp-1 [Trichinella spiralis]
Length = 826
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 158/484 (32%), Positives = 251/484 (51%), Gaps = 64/484 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E++F N +DHFS+ E+ TF RY+IN++ W PI + G E ++ + +GF
Sbjct: 27 ESWFENMPVDHFSF--ENSDTFRLRYLINTENWNSDGGPIFFYCGNEGSVEGFAENTGFM 84
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ENA F A++V EHRYYGKS+PFG+ ++ + G NS QA+ADYA ++ +K
Sbjct: 85 WENAKDFGAMVVFAEHRYYGKSLPFGNESSS-----NLGKLNSEQAMADYAVLINWLKTN 139
Query: 172 YN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
A + VIA G SYGG LA W R KYPH+V G++A+SAP+ + +T +++ + ++
Sbjct: 140 ITGAKSSAVIAFGGSYGGMLAAWMRTKYPHLVDGAIAASAPVAQFSGMTVCSSFSDITTE 199
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC-----TPLKNTTELKDGLDT 285
+R+ S C +IK+SW I +G L+K F+ C T KN T+L + L
Sbjct: 200 VYRNASPSCALSIKRSWPIIRKWGKTAEGRLDLAKMFRLCNETQFTSKKNVTQLVNWLTD 259
Query: 286 VYSEAAQYDTP------SNIPV-------------------------------KRICNAI 308
+Y A + P +P KRIC A
Sbjct: 260 IYGTLAMVNYPYATEFLKPVPAWPVKASDEDSVCTVVVGLFFFLIIHPRACICKRICVAC 319
Query: 309 E--NAPNCGD-DILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESD-EGWEWQSCSEM 364
+ N G+ ++L +I + + + + G C + E+ GD D + W++Q+C+EM
Sbjct: 320 QFLNDTEVGETELLHRIYSTI--SIYTNFTGKKP-CNLLENDYGDSVDGKLWDYQACTEM 376
Query: 365 VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNII 424
V+PM K+SM++ WNLT++ C E++ V PRP W + YGG ++ S +N++
Sbjct: 377 VMPMCNTKDSMFEQSDWNLTEFSDECFEKFKVRPRPDWAIINYGGRKLE-----SATNVV 431
Query: 425 FSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIM 484
FSNG DP+ RGG + HF ++ + + G+H DL + +D + R E+ M
Sbjct: 432 FSNGWLDPW-RGGGIVNSHFRGVA-ALIVEDGAHHYDLRGSNSADTASVQTVRLLELGFM 489
Query: 485 QGWI 488
+ WI
Sbjct: 490 RKWI 493
>gi|167888448|gb|ACA09612.1| prolyl carboxypeptidase [Tenebrio molitor]
Length = 488
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 148/464 (31%), Positives = 248/464 (53%), Gaps = 34/464 (7%)
Query: 48 NHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL 107
++ + T ++ +DHFS+ + TF +Y+IN+ FW + PI + G E I++ +
Sbjct: 15 SYNYTTKYFEVPVDHFSFTNNA--TFKLKYLINNSFWVD-DGPIFFYTGNEGTIENFAEN 71
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
GF ++ A QF AL+V EHRYYG+S+PFG A + GY S QALADY ++ +
Sbjct: 72 MGFMFDIAPQFNALLVFAEHRYYGESLPFGDESYA--DPARLGYLTSNQALADYVDLINY 129
Query: 168 IKDKYN----ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
++ + + PV+A G SYGG LA+W R+K+P V+G++ASSAPI + +TP
Sbjct: 130 LQTTRSRSSYSNKVPVVAFGGSYGGMLASWLRMKFPASVVGAIASSAPIWQFQGLTPCEN 189
Query: 224 YHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
++ +V+ ++ ++C + +SW I NI S G A LS ++K CTPLK +++ D
Sbjct: 190 FNRIVTDVYKTALDDDCSVPLSRSWKVIRNITSNDAGKAWLSAQWKLCTPLKTQSDV-DT 248
Query: 283 LDTVYSE------AAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEA 329
L +SE Y P++ PV+ C + + D+ L +E
Sbjct: 249 LVNWFSEIVVNMAMVNYPYPTSFLAPLPAYPVRSFCYKMTESQTVDDETLLTTIGAALEI 308
Query: 330 DSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN 389
+ Y ++C + + ++ W++Q+C+EM++PM + M++ PW+ Y +N
Sbjct: 309 YT-NYT-QTTKCNVINETAAALGEDAWDFQACTEMIMPMCSTDDDMFENSPWDFDTYSEN 366
Query: 390 CKEQYGV-SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
C +++GV P + YGG +I S SNI+FSNG+ DP+S GG +
Sbjct: 367 CYKKWGVKQTHPELPILEYGGKEIS-----SASNIVFSNGLLDPWSSGG--VLSNVSSSV 419
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+ + G+H LDL K DP+ +++ R+ ++ ++ WIT YY
Sbjct: 420 SAVIIPEGAHHLDLRGENKDDPESVIEARQFHIRNIKKWITDYY 463
>gi|193718325|ref|XP_001949511.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Acyrthosiphon
pisum]
Length = 469
Score = 251 bits (642), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 156/476 (32%), Positives = 249/476 (52%), Gaps = 41/476 (8%)
Query: 41 ILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP 100
I+ S + + T + +DHFSY + TF +Y+IN K+W PI + G E
Sbjct: 12 IIFHVTSGYNYTTNYITVPVDHFSYTND--DTFELKYLINDKYWDVNKGPIFFYTGNEGR 69
Query: 101 IDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD 160
I+D +GF +E + +FKAL+V EHRYYG+S+P+G + + GY S QA+AD
Sbjct: 70 IEDFCDNTGFMWEISREFKALVVFAEHRYYGESMPYGVN--SFDDKEKLGYLTSQQAIAD 127
Query: 161 YASILLHIKDKYNATHA---PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
+ ++ +++D + PV A G SYGG LA WFR+KYP V G++ASSAPI +
Sbjct: 128 FVDLIKYLRDDALSVGRRPNPVFAFGGSYGGMLAAWFRIKYPAYVEGAIASSAPIWQFTG 187
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
+TP ++ V S +R+ S EC TI SW I+N+ G L+ +K C PLK++
Sbjct: 188 MTPCNDFYKVTSSVYRNASAECGLTISASWKAINNVTESDSGKTWLTDNWKLCKPLKDSD 247
Query: 278 E---LKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAG 325
+ LK +Y A + P P+K +C ++ N +L + G
Sbjct: 248 DVARLKYWATDLYVALAMVNYPYEANFLGPLPANPIKEVCKSMTNHTEDDKTLLMSVFHG 307
Query: 326 VVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKD--KNSMYQPEPWNL 383
+ + Y G+ S C D + GW++Q+C+EM++P+ + ++ +++ PW+
Sbjct: 308 L--SVYFNYTGSAS-CLNLSSAFSDNTMNGWDYQACTEMIMPLCTNGGEDDIFEAYPWDF 364
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
Y + C+ +Y V P V YGG ++K + SNIIFSNG+ DP+S GG +K+
Sbjct: 365 KSYAEYCENRYDVIPTTDDVEKQYGGKNLK-----AASNIIFSNGLLDPWSGGGVLKS-- 417
Query: 444 FFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
+S S L G+H LDL + +D ++ RKT ++ WIT++ D++
Sbjct: 418 ---ISSSVRALLIPDGAHHLDLRASNPNDTSSVIHARKT----IKHWITKWIFDYR 466
>gi|194878910|ref|XP_001974147.1| GG21569 [Drosophila erecta]
gi|190657334|gb|EDV54547.1| GG21569 [Drosophila erecta]
Length = 470
Score = 251 bits (640), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 157/485 (32%), Positives = 251/485 (51%), Gaps = 36/485 (7%)
Query: 26 GLKLRPRLGRIRRSRILEQKDSN--HGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKF 83
G +R L + ++ D + + +E + +DHFS+ +TF RY+ N F
Sbjct: 3 GATVRAALSAVLGILVIAGCDCSQRYTYEIKEFQVPLDHFSFLIN--ETFNIRYLYNDSF 60
Query: 84 WGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA 141
N +PI + G E I+ Q +GF +E A + +AL++ EHRYYGKS+PFGS
Sbjct: 61 VDKSNDRTPIFFYTGNEGDIELFAQNTGFLWEQAERQRALVIFAEHRYYGKSLPFGSSTF 120
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHV 201
H YF QAL DYA ++ ++ N PV+A G SYGG LA WFR+KYPH+
Sbjct: 121 NTSMPEHLAYFTVEQALEDYAMLITFLR---NDRQLPVVAFGGSYGGMLAAWFRMKYPHL 177
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGL 260
V G+LA+SAPIL + IT ++ +V+ F + ++ C I KSW + + + G
Sbjct: 178 VAGALAASAPILQFSGITDCDIFYKIVTSVFENAYNKNCSANIAKSWKLFETLGASEAGK 237
Query: 261 AILSKKFKTCTPLKNTTELKDGL---DTVYSEAAQYDTPSN---------IPVKRICNAI 308
+S F C LK +LK L + VY+ A + P N PV+++C +
Sbjct: 238 KQISDAFHLCNALKTDEDLKKFLNYVEEVYTNLAMVNYPYNSSFLAPLPAYPVRQVCYYL 297
Query: 309 ENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVP 367
+ +L +++ + + +++C I+ + DES GW+ QSC++MV+P
Sbjct: 298 KELQTTDASLLHAMSSALAVYTNYT---QSAKCLDISVNSNADES--GWDIQSCNQMVMP 352
Query: 368 MGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFS 426
+ + + +M++ WN Y + C + Y ++P+P ++ YGG +++ +TSNIIFS
Sbjct: 353 ICSNSSETMFRTSSWNFKDYAEKCYKNYRLTPKPYDIILRYGGRNLE-----ATSNIIFS 407
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQG 486
NG+ DP+S GG ++T + D F L G+H LDL +DP + R E I+
Sbjct: 408 NGLLDPWSGGGVLQTPN--DKVFVIILPEGAHHLDLRHTDPADPPSVRDARDKEAAIIAR 465
Query: 487 WITQY 491
WI +
Sbjct: 466 WIQDF 470
>gi|194759919|ref|XP_001962194.1| GF14561 [Drosophila ananassae]
gi|190615891|gb|EDV31415.1| GF14561 [Drosophila ananassae]
Length = 471
Score = 250 bits (638), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 151/449 (33%), Positives = 234/449 (52%), Gaps = 33/449 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKF---WGGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
+DHFS+ S TF RY+ N F +PIL + G E I+ Q +GF +E A
Sbjct: 39 LDHFSF--LSNATFSIRYLYNDSFVRDQSNARTPILFYTGNEGDIELFAQNTGFLWEEAE 96
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH 176
+ AL+V EHRYYGKS+PFG H YF Q L DYA ++ +++ N T
Sbjct: 97 RKHALVVFAEHRYYGKSLPFGKSTFNTSMPEHLAYFTVEQTLEDYAMLITYLR---NGTQ 153
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
PV+ G SYGG LA WFR+KYPH+ +G+LA+SAPIL + IT ++ +V+ F +
Sbjct: 154 RPVVTFGGSYGGMLAAWFRMKYPHLAVGALAASAPILQFPGITDCDIFYRIVTSVFANAY 213
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD---TVYSEAAQ 292
+ C I KSW + + G +S F C+P+K+ +LK LD VYS A
Sbjct: 214 NSNCTVNIAKSWRVFETLGGTDAGKKQISDAFNLCSPVKSDADLKKFLDYVEEVYSNLAM 273
Query: 293 YDTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
+ P + PV+++C +++ D+L +A+ + A + G I
Sbjct: 274 VNYPYSSSFLAPLPAFPVRQVCFYLKDLHTTDADLLHTMASAL--AVYTNFTGTTKCLDI 331
Query: 344 NEDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSW 402
+ DES GW Q+C++MV+P + +M++P WN ++ + C + Y ++P+P
Sbjct: 332 SATSNADES--GWNIQTCNQMVMPFCSNGTETMFRPSTWNFNEFSEKCYKDYRLTPKPYD 389
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
++ YGG +I+ + +NIIFSNG+ DP+S GG ++ + D L G+H LDL
Sbjct: 390 IILRYGGKNIE-----AATNIIFSNGLLDPWSGGGVLQAPN--DKIHVIILPEGAHHLDL 442
Query: 463 DEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ +DP + R E I+ WI ++
Sbjct: 443 RRSDPADPPSVRDARNKESAIIARWIEEF 471
>gi|195091774|ref|XP_001997565.1| GH13951 [Drosophila grimshawi]
gi|193906104|gb|EDW04971.1| GH13951 [Drosophila grimshawi]
Length = 472
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 155/450 (34%), Positives = 238/450 (52%), Gaps = 31/450 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN-SPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DHFS+ S +F RY+ N + N I + G E I+ + SGF +E A +
Sbjct: 34 LDHFSF--LSNASFSIRYLYNESYADKSNPKSIFFYTGNEGDIEWFAKNSGFVWELAEKE 91
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
+A++V EHRYYGKS+PFGS H YF Q L DYA ++ ++ N P
Sbjct: 92 RAIVVFAEHRYYGKSLPFGSDTFNASKPEHLAYFTVEQTLEDYAMLITFLR---NGRQLP 148
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SE 237
V+A G SYGG LA WFR+KYPH+V+G+LA+SAPIL + +TP ++ + + F +
Sbjct: 149 VVAFGGSYGGMLAAWFRIKYPHIVVGALAASAPILQFEGLTPCDIFNEITTSVFNTAYNA 208
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK---DGLDTVYSEAAQYD 294
C I KSW + +A+ G L KF C P+KN +L D L+ VY+ A +
Sbjct: 209 NCSANIGKSWHAFETMAATDAGKKQLGDKFHLCEPIKNGDDLNQFMDYLEDVYANLAMSN 268
Query: 295 TPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC-YIN 344
P N PV++ C +++ D++ +A+ + A Y G++ Y N
Sbjct: 269 YPYNTNFLAPLPANPVRQFCLYLKDLHADEADLVSSMASAL--AVYTNYSGSDPCVDYKN 326
Query: 345 EDRTGDESD-EGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSW 402
+ + D GWE QSC++MV+P N+MY+ W+L K C +++ ++P+P
Sbjct: 327 SNASSATVDSSGWEIQSCNQMVMPFCANSSNTMYRTSTWDLKKVSDKCYKKFQLTPKPFD 386
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
++ YGG D+ + SNIIFSNG+ DP+S GG + + + D L G+H LDL
Sbjct: 387 IVLRYGGRDL-----VAASNIIFSNGLLDPWSGGGVLMSPN--DRIHIIILPEGAHHLDL 439
Query: 463 DEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
++ DP +V R+ E I+ WI+++Y
Sbjct: 440 RQSNPMDPASVVDARQKEAAIIGQWISEFY 469
>gi|195484754|ref|XP_002090813.1| GE12590 [Drosophila yakuba]
gi|194176914|gb|EDW90525.1| GE12590 [Drosophila yakuba]
Length = 470
Score = 249 bits (636), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 149/448 (33%), Positives = 235/448 (52%), Gaps = 32/448 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ + TF RY+ N F N +PI + G E I+ Q +GF +E A +
Sbjct: 39 LDHFSFLINA--TFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQNTGFLWEQAER 96
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+AL++ EHRYYGKS+PFGS H YF Q L DYA ++ + N
Sbjct: 97 QRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLITFLT---NDRQM 153
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
PV+A G SYGG LA WFR+KYPH+V G+LA+SAPIL + IT ++ +V+ F++ +
Sbjct: 154 PVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPILQFSGITDCDIFYRIVTSVFQNAYN 213
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD---TVYSEAAQY 293
C I KSW + + + G +S F C LK +LK LD VYS A
Sbjct: 214 TNCTANIAKSWKLFETLGASEAGKKQISDAFHLCNALKTDEDLKKFLDYVEEVYSNLAMV 273
Query: 294 DTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ P N PV+++C + D+L +++ + + +++C ++
Sbjct: 274 NYPYNSSFLAPLPAYPVRQVCYYLRELQTTDADLLHAMSSALAVYTNYT---QSAKC-LD 329
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDKN-SMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
+ D GW QSC++MV+P+ + + +M++ WN Y + C + Y ++P+P +
Sbjct: 330 ISVNSNADDSGWNIQSCNQMVMPICSNSSETMFRTSSWNFKDYAEKCYKNYRLTPKPYDI 389
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
+ YGG +++ +T+NIIFSNG+ DP+S GG +++ + D F L G+H LDL
Sbjct: 390 ILRYGGRNLE-----ATTNIIFSNGLLDPWSGGGVLQSPN--DKVFVIILPEGAHHLDLR 442
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ +DP + R+ E I+ WI +
Sbjct: 443 HSDPADPPSVRDAREKEAAIIARWIQDF 470
>gi|195034070|ref|XP_001988820.1| GH11372 [Drosophila grimshawi]
gi|193904820|gb|EDW03687.1| GH11372 [Drosophila grimshawi]
Length = 472
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 155/450 (34%), Positives = 238/450 (52%), Gaps = 31/450 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN-SPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DHFS+ S +F RY+ N + N I + G E I+ + SGF +E A +
Sbjct: 34 LDHFSF--LSNASFSIRYLYNESYADKSNPKSIFFYTGNEGDIEWFAKNSGFVWELAEKE 91
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
+A++V EHRYYGKS+PFGS H YF Q L DYA ++ ++ N P
Sbjct: 92 RAIVVFAEHRYYGKSLPFGSDTFNASKPEHLAYFTVEQTLEDYALLITFLR---NGRQLP 148
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SE 237
V+A G SYGG LA WFR+KYPH+V+G+LA+SAPIL + +TP ++ + + F +
Sbjct: 149 VVAFGGSYGGMLAAWFRIKYPHIVVGALAASAPILQFEGLTPCDIFNEITTSVFNTAYNA 208
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK---DGLDTVYSEAAQYD 294
C I KSW + +A+ G L KF C P+KN +L D L+ VY+ A +
Sbjct: 209 NCSANIGKSWHAFETMAATDAGKKQLGDKFHLCEPIKNGDDLNQFMDYLEDVYANLAMSN 268
Query: 295 TPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC-YIN 344
P N PV++ C +++ D++ +A+ + A Y G++ Y N
Sbjct: 269 YPYNTNFLAPLPANPVRQFCLYLKDLHADEADLVSSMASAL--AVYTNYSGSDPCVDYKN 326
Query: 345 EDRTGDESD-EGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSW 402
+ + D GWE QSC++MV+P N+MY+ W+L K C +++ ++P+P
Sbjct: 327 SNASSATVDSSGWEIQSCNQMVMPFCANSSNTMYRTSTWDLKKVSDKCYKKFQLTPKPFD 386
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
++ YGG D+ + SNIIFSNG+ DP+S GG + + + D L G+H LDL
Sbjct: 387 IVLRYGGRDL-----VAASNIIFSNGLLDPWSGGGVLMSPN--DRIHIIILPEGAHHLDL 439
Query: 463 DEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
++ DP +V R+ E I+ WI+++Y
Sbjct: 440 RQSNPMDPASVVDARQKEAAIIGQWISEFY 469
>gi|20129649|ref|NP_610037.1| CG2493, isoform A [Drosophila melanogaster]
gi|442628593|ref|NP_001260630.1| CG2493, isoform B [Drosophila melanogaster]
gi|7298683|gb|AAF53897.1| CG2493, isoform A [Drosophila melanogaster]
gi|18447221|gb|AAL68201.1| GH14278p [Drosophila melanogaster]
gi|220945264|gb|ACL85175.1| CG2493-PA [synthetic construct]
gi|220954996|gb|ACL90041.1| CG2493-PA [synthetic construct]
gi|440213994|gb|AGB93165.1| CG2493, isoform B [Drosophila melanogaster]
Length = 475
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 149/448 (33%), Positives = 236/448 (52%), Gaps = 32/448 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ + TF RY+ N F N +PI + G E I+ Q +GF +E A +
Sbjct: 44 LDHFSFLINA--TFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQNTGFLWEQAER 101
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+AL++ EHRYYGKS+PFGS H YF Q L DYA ++ ++ N
Sbjct: 102 QRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLITFLR---NDRQM 158
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
PV+A G SYGG LA WFR+KYPH+V G+LA+SAP+L + IT ++ +V+ F++ +
Sbjct: 159 PVVAFGGSYGGMLAAWFRMKYPHLVNGALAASAPVLQFPGITDCDIFYRIVTSVFQNAYN 218
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD---TVYSEAAQY 293
E C I KSW + + + G +S F C LKN +LK LD VYS A
Sbjct: 219 ENCTLNIAKSWKLFETLGASEAGKKQISDAFHLCNALKNDDDLKKFLDYVEEVYSNLAMV 278
Query: 294 DTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ P N PV+++C ++ + D+L +++ + + +++C ++
Sbjct: 279 NYPYNSSFLAPLPAYPVRQVCYYLKELHSTDADLLHAMSSALAVYTNYT---QSAKC-LD 334
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
+ D GW QSC++MV+P+ + +M++ WN Y + C + Y ++P+P +
Sbjct: 335 ISVNSNADDSGWNIQSCNQMVMPICSNGSETMFRTSSWNFKDYAEKCYKNYRLTPKPYDI 394
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
+ YGG +++ + +NIIFSNG+ DP+S GG ++ + D F L G+H LDL
Sbjct: 395 ILRYGGRNLE-----AATNIIFSNGLLDPWSGGGVLQAPN--DKVFVIILPEGAHHLDLR 447
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ +DP + R E I+ WI +
Sbjct: 448 HSDPADPPSVRDARDKEAAIIARWIQDF 475
>gi|195118432|ref|XP_002003741.1| GI18077 [Drosophila mojavensis]
gi|193914316|gb|EDW13183.1| GI18077 [Drosophila mojavensis]
Length = 469
Score = 249 bits (635), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 152/455 (33%), Positives = 241/455 (52%), Gaps = 43/455 (9%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ S +F RY+ N + N PI + G E I+ Q SGF +E A Q
Sbjct: 35 LDHFSF--LSNASFNIRYLANDSYVDKKNPQPPIFFYTGNEGDIEWFAQNSGFVWELAAQ 92
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+AL++ EHRYYGKS+P+G+ H YF Q L DYA ++ +++ N
Sbjct: 93 QRALVIFAEHRYYGKSLPYGADTFNTSKPEHLAYFTVEQTLEDYAQLITYLR---NGKQL 149
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
PV+A G SYGG LA WFR+KYPH+V+G+LA+SAPIL + +TP ++ + + F + +
Sbjct: 150 PVVAFGGSYGGMLAAWFRMKYPHIVVGALAASAPILQFSGLTPCDIFNKITTAVFENAYN 209
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK---DGLDTVYSEAAQY 293
C I +SW +++A+ G LS F C +KN +L D L+ VY A
Sbjct: 210 ANCTANIGRSWKVFESMAATDAGKKQLSDIFHVCDAIKNADDLNNFMDYLEEVYGNLAMA 269
Query: 294 DTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ P N PV+++C +++ D++ +A+ + A Y G + +C
Sbjct: 270 NYPYNSTFLSPLPANPVRQLCFYLKDLHQNDTDLVASMASAL--AVYTNYTG-SVKCLDY 326
Query: 345 EDRTGDES--DEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
+D S GWE QSC++MV+PM + N+M++ W+ K +C +Q+ + PRP
Sbjct: 327 KDSNAGASYDGTGWEIQSCNQMVMPMCANNNNTMFRTSNWDFKKVSDDCYKQFHLIPRPY 386
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL-----G 456
++ YGG +++ + SNIIFSNG+ DP+S GG ++ + + +N+ G
Sbjct: 387 DIVMRYGGRNLE-----AASNIIFSNGLLDPWSGGGVLQ-------APNNQINVIILPEG 434
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+H LDL ++ DP + R+ E KI+ WI +
Sbjct: 435 AHHLDLRQSNPLDPPSVTVARQLEAKIIAQWIKDF 469
>gi|195580410|ref|XP_002080033.1| GD21700 [Drosophila simulans]
gi|194192042|gb|EDX05618.1| GD21700 [Drosophila simulans]
Length = 475
Score = 248 bits (634), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 148/448 (33%), Positives = 235/448 (52%), Gaps = 32/448 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ + TF RY+ N F N +PI + G E I+ Q +GF +E A +
Sbjct: 44 LDHFSFLINA--TFSIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQNTGFLWEQAER 101
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+AL++ EHRYYGKS+PFGS H YF Q L DYA ++ ++ N
Sbjct: 102 QRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLITFLR---NDRQM 158
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
PV+A G SYGG LA WFR+KYPH+V G+LA+SAP+L + IT ++ +V+ F++ +
Sbjct: 159 PVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPVLQFPGITDCDIFYRIVTSVFQNAYN 218
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD---TVYSEAAQY 293
E C I KSW + + + G +S F C LK +LK LD VYS A
Sbjct: 219 ENCTLNIAKSWKLFETLGASEAGKKQISDAFHLCNALKTDDDLKKFLDYVEEVYSNLAMV 278
Query: 294 DTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ P N PV+++C ++ + D+L +++ + + +++C ++
Sbjct: 279 NYPYNSSFLAPLPAYPVRQVCYYLKELHSTDADLLHAMSSALAVYTNYT---QSAKC-LD 334
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
+ D GW QSC++MV+P+ + +M++ WN Y + C + Y ++P+P +
Sbjct: 335 ISVNSNADDSGWNIQSCNQMVMPICSNGSETMFRTSSWNFKDYAEKCYKNYRLTPKPYDI 394
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
+ YGG +++ + +NIIFSNG+ DP+S GG ++ + D F L G+H LDL
Sbjct: 395 ILRYGGRNLE-----AATNIIFSNGLLDPWSGGGVLQAPN--DKVFVIILPEGAHHLDLR 447
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ +DP + R E I+ WI +
Sbjct: 448 HSDPADPPSVRDARDKEAAIIARWIQDF 475
>gi|328868406|gb|EGG16784.1| peptidase S28 family protein [Dictyostelium fasciculatum]
Length = 505
Score = 248 bits (633), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 158/477 (33%), Positives = 255/477 (53%), Gaps = 43/477 (9%)
Query: 37 RRSRILEQKD-SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG-------N 88
R + +E+ D S F ++Y Q +DHF++ ++ QTFPQRY+I+ +W N
Sbjct: 40 REFKGVEENDPSPPPFSEYYYIQKLDHFNF--QTQQTFPQRYLISDTYWNKPSSNDSQCN 97
Query: 89 SPILAFLGAEAPIDDNIQLSGF-TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNAR 147
PIL + G E I Q S F T A + AL+ EHRYYG+++PFG+ +N +
Sbjct: 98 GPILFYTGNEGDIVWFYQNSQFITNVLAQELGALLFFAEHRYYGQTLPFGNESTVPENLQ 157
Query: 148 HRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLA 207
Y S QALADYA+I+ + + + PVI++G SYGG LA+W R+KYP++V G+LA
Sbjct: 158 ---YCTSEQALADYATIIPQVLEDLGGLNCPVISVGGSYGGMLASWMRMKYPNIVDGALA 214
Query: 208 SSAPILYY-GDITPWTTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKPDGLAILS 264
+SAPILY+ G ++ + + DF TS + C I+ +++EI IA KP+G +LS
Sbjct: 215 ASAPILYFLGTGADPEGFNEIATNDFAQTSADGSCATRIRGAFTEISEIAEKPNGDELLS 274
Query: 265 KKFKTCTPLKNTTELKDGLDT--VYSEAAQYDTPSNI-------PVKRICNAIENAPNCG 315
+ F C +++ +L + +++ Y A Y P+ P+ C A+E
Sbjct: 275 EMFSLCG-VQSVDDLVNWIESGLTYMAMADYPYPAAFLEPMPGWPINASCAAMEPV---Q 330
Query: 316 DDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-N 373
DDI + + +S G CY ++ TG + W++Q+C+EMV+PM +
Sbjct: 331 DDIQALLQVLHIYYNS---SGQAGSCYNVSVFTTGALGSDVWDYQACTEMVMPMSSNGVQ 387
Query: 374 SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
M+ ++L I +C++Q+GV+P P W+ YYGG +SNI+FSNG+ DP+
Sbjct: 388 DMFPASSFDLDSLITSCQQQWGVTPDPYWITNYYGGSQ-----NVQSSNIVFSNGILDPW 442
Query: 434 SRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
GG ++ + ++ ++ G+H LDL DP ++ R E +++ W Q
Sbjct: 443 RAGGVIENGNEI---YAVLIDGGAHHLDLRMPSPQDPQSVINARALETQLITKWSNQ 496
>gi|195351854|ref|XP_002042435.1| GM23324 [Drosophila sechellia]
gi|194124304|gb|EDW46347.1| GM23324 [Drosophila sechellia]
Length = 475
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 148/448 (33%), Positives = 235/448 (52%), Gaps = 32/448 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ + TF RY+ N F N +PI + G E I+ Q +GF +E A +
Sbjct: 44 LDHFSFLINA--TFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQNTGFLWEQAER 101
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+AL++ EHRYYGKS+PFGS H YF Q L DYA ++ ++ N
Sbjct: 102 QRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLITFLR---NDRQM 158
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
PV+A G SYGG LA WFR+KYPH+V G+LA+SAP+L + IT ++ +V+ F++ +
Sbjct: 159 PVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPVLQFPGITDCDIFYRIVTSVFQNAYN 218
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD---TVYSEAAQY 293
E C I KSW + + + G +S F C LK +LK LD VYS A
Sbjct: 219 ENCTLNIGKSWKLFETLGASEAGKKQISDAFHLCNVLKTDDDLKKFLDYVEEVYSNLAMV 278
Query: 294 DTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ P N PV+++C ++ + D+L +++ + + +++C ++
Sbjct: 279 NYPYNSSFLAPLPAYPVRQVCYYLKELHSTDADLLHAMSSALAVYTNYT---QSAKC-LD 334
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
+ D GW QSC++MV+P+ + +M++ WN Y + C + Y ++P+P +
Sbjct: 335 ISVNSNADDSGWNIQSCNQMVMPICSNGSETMFRTSSWNFKDYAEKCYKNYRLTPKPYDI 394
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
+ YGG +++ + +NIIFSNG+ DP+S GG ++ + D F L G+H LDL
Sbjct: 395 ILRYGGRNLE-----AATNIIFSNGLLDPWSGGGVLQAPN--DKVFVIILPEGAHHLDLR 447
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ +DP + R E I+ WI +
Sbjct: 448 HSDPADPPSVRDARDKEAAIIARWIQDF 475
>gi|281206058|gb|EFA80247.1| peptidase S28 family protein [Polysphondylium pallidum PN500]
Length = 481
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 153/456 (33%), Positives = 253/456 (55%), Gaps = 38/456 (8%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN-----SPILAFLGAEAPIDDNI 105
+E F+Y QT+DHF++ + +FPQRY+++ +W SP+L + G E I
Sbjct: 29 YEEFYYMQTLDHFNFYNKG--SFPQRYLVSDTYWTRPTGPICESPVLFYTGNEGDIVWFY 86
Query: 106 QLSGF-TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
+ S F T A + AL+V EHR+YG+++PFG+ + +N GY S QALADYA +
Sbjct: 87 ENSQFVTNVLAKEMGALLVFAEHRFYGETMPFGNSSSLPENI---GYLTSEQALADYAQL 143
Query: 165 LLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY-GDITPWTT 223
+ + +H PV+A+G SYGG LA+WFR+KYP+++ G+LA+SAPILY+ G
Sbjct: 144 IPAVLSDLGGSHCPVLAVGGSYGGMLASWFRMKYPNIIDGALAASAPILYFLGTNANSEG 203
Query: 224 YHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
++ + + DF +TS E C I +++EI +++ +GL +L+K F C L ++L +
Sbjct: 204 FNEIATIDFAETSSEGTCATRIHSAFNEITQMSNTQNGLNVLTKTFSLCEELTELSDLIN 263
Query: 282 GLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA-------GVVEADSLEY 334
L+ + A D P N +E P ++ C + A G+V+ ++ +
Sbjct: 264 WLEAAITYMAMADYP------YPANFLEPMPGNPINVSCSLLAKETDNIQGLVQVMNVYF 317
Query: 335 D--GNNSRC-YINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQP-EPWNLTKYIKNC 390
+ G +C ++ TG +GW++Q+C+EMV+P+ + + P PW+L++ + C
Sbjct: 318 NSSGQAGQCNNVSVYTTGALGSDGWDYQACTEMVMPISANGVQDFFPAAPWSLSQLTQYC 377
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
++ + V+P P+W+ TYY G D+ TSNIIFSNG+ DP+ GG V + + +
Sbjct: 378 QQTWQVTPNPNWITTYYQGQDLS-----QTSNIIFSNGVLDPWRAGG-VVSNDGGNDIIT 431
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVK-IMQ 485
++ G+H LDL +DP + R E K +MQ
Sbjct: 432 IIIDGGAHHLDLRMPNSADPTAVTNARVLETKYLMQ 467
>gi|198473985|ref|XP_001356509.2| GA15377 [Drosophila pseudoobscura pseudoobscura]
gi|198138198|gb|EAL33573.2| GA15377 [Drosophila pseudoobscura pseudoobscura]
Length = 474
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 151/449 (33%), Positives = 238/449 (53%), Gaps = 34/449 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ S TF RY+ N F N +PI + G E I+ Q +GF +E A +
Sbjct: 43 LDHFSF--LSNATFNIRYLYNDSFVDKKNAHTPIFFYTGNEGDIELFAQNTGFMWELAEK 100
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+AL++ EHRYYGKS+PFG+ H YF Q L DYA ++ ++ N
Sbjct: 101 QRALLIFAEHRYYGKSLPFGASTFNASMPDHLAYFTVEQTLEDYAMLITFLR---NDLPL 157
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
PV+A G SYGG LA WFR+KYPH+V G+LA+SAPIL + IT ++ +V+ F++ +
Sbjct: 158 PVVAFGGSYGGMLAAWFRMKYPHLVAGALAASAPILQFPGITDCDIFYRIVTSVFQNAYN 217
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD---TVYSEAAQY 293
C I +SW + + G +S F C P+KN +LK+ LD VY A
Sbjct: 218 SNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLDYIEEVYGNLAMV 277
Query: 294 DTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ P N PV+++C +++ D+L +A+ + A Y G + +C ++
Sbjct: 278 NYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASAL--AVYTNYTG-SVKC-LD 333
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
+ D GW Q+C++MV+P + +SM++P WN ++ C + Y ++P+P +
Sbjct: 334 TSVNSNADDSGWNVQTCNQMVMPFCSNSTDSMFRPSSWNFKEFSDKCYKDYRLTPKPYDI 393
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT-YHFFDLSFSQDLNLGSHCLDL 462
+ YGG +I+ + +NIIFSNG+ DP+S GG ++ + D+ L G+H LDL
Sbjct: 394 ILRYGGRNIE-----TATNIIFSNGLLDPWSGGGVLQAPNNKVDIII---LPEGAHHLDL 445
Query: 463 DEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ +DP + R E I+ WI ++
Sbjct: 446 RNSNPADPPSVRDARNKEASIIARWIHEF 474
>gi|307213053|gb|EFN88584.1| Lysosomal Pro-X carboxypeptidase [Harpegnathos saltator]
Length = 404
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/401 (34%), Positives = 216/401 (53%), Gaps = 30/401 (7%)
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILL 166
+ GF +E A QF AL+V EHRYYG+S+PFG++ A + +H GY S Q LADY ++
Sbjct: 17 IKGFLWETAQQFGALVVFAEHRYYGESLPFGNKSFA--DPQHLGYLTSQQVLADYVELIQ 74
Query: 167 HIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
+++ K +PVI G SYGG L+ W R+KYPH+V G++A+SAPIL + I +
Sbjct: 75 YLRSKPGYKRSPVILFGGSYGGMLSAWMRMKYPHIVQGAIAASAPILQFTGIVKCEAFAR 134
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK---NTTELKDGL 283
+V+ DFR ++ C + I++SW+ I + S +G LS +K C PLK + LKD L
Sbjct: 135 IVTSDFRASNPTCAKLIRQSWNTITEVTSNDEGKKWLSDNWKLCEPLKTAEHVKTLKDFL 194
Query: 284 DTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
VY++ A D P P+ C + N+ G +L + V + Y
Sbjct: 195 QEVYTDLAMVDYPYETNFLTPLPGNPINVFCRHLTNSSLTGKPLLLALHGAV--SIYTNY 252
Query: 335 DGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQ 393
G + G ++ +GW++Q+C+EMV+PM D N M++P W+ Y C ++
Sbjct: 253 TGKATCISTENAEPGLDAAQGWDYQACTEMVMPMCPDGVNDMFEPSKWDFKDYNNTCFKK 312
Query: 394 YGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
Y V+P+P V YG ++ + +NI FSNG+ DP++ GG ++ +LS S
Sbjct: 313 YSVTPQPHRVCQQYGCENLSTV-----TNINFSNGLLDPWTSGGVLR-----NLSSSAIA 362
Query: 454 NL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
L +H LDL E+ +DP ++ RK ++ WI +Y
Sbjct: 363 ILIPDAAHHLDLRESNSNDPYSVILTRKFHRYSIKKWIDEY 403
>gi|195147472|ref|XP_002014704.1| GL19317 [Drosophila persimilis]
gi|194106657|gb|EDW28700.1| GL19317 [Drosophila persimilis]
Length = 474
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 151/449 (33%), Positives = 238/449 (53%), Gaps = 34/449 (7%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ S TF RY+ N F N +PI + G E I+ Q +GF +E A +
Sbjct: 43 LDHFSF--LSNATFNIRYLYNDSFVDKKNAHTPIFFYTGNEGDIELFAQNTGFMWELAEK 100
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+AL++ EHRYYGKS+PFG+ H YF Q L DYA ++ ++ N
Sbjct: 101 QRALLIFAEHRYYGKSLPFGASTFNASMPDHLAYFTVEQTLEDYAMLITFLR---NDLPL 157
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
PV+A G SYGG LA WFR+KYPH+V G+LA+SAPIL + IT ++ +V+ F++ +
Sbjct: 158 PVVAFGGSYGGMLAAWFRMKYPHLVAGALAASAPILQFPGITDCDIFYRIVTSVFQNAYN 217
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD---TVYSEAAQY 293
C I +SW + + G +S F C P+KN +LK+ LD VY A
Sbjct: 218 SNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLDYIEEVYGNLAMV 277
Query: 294 DTPSN---------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ P N PV+++C +++ D+L +A+ + A Y G + +C ++
Sbjct: 278 NYPYNSSFLAPLPPYPVRQVCFYLKDLHQSDADLLHAMASAL--AVYTNYTG-SVKC-LD 333
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
+ D GW Q+C++MV+P + +SM++P WN ++ C + Y ++P+P +
Sbjct: 334 TSVNSNADDSGWNVQTCNQMVMPFCSNSTDSMFRPSSWNFKEFSDKCYKDYRLTPKPYDI 393
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT-YHFFDLSFSQDLNLGSHCLDL 462
+ YGG +I+ + +NIIFSNG+ DP+S GG ++ + D+ L G+H LDL
Sbjct: 394 ILRYGGRNIE-----TATNIIFSNGLLDPWSGGGVLQAPNNKVDIII---LPEGAHHLDL 445
Query: 463 DEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ +DP + R E I+ WI ++
Sbjct: 446 RNSNPADPPSVRDARNKEASIIARWIHEF 474
>gi|260808833|ref|XP_002599211.1| hypothetical protein BRAFLDRAFT_200045 [Branchiostoma floridae]
gi|229284488|gb|EEN55223.1| hypothetical protein BRAFLDRAFT_200045 [Branchiostoma floridae]
Length = 436
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 154/450 (34%), Positives = 237/450 (52%), Gaps = 34/450 (7%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
FY +DHFS+ + TF RY++N ++ G PI + G E I+ ++ +G E
Sbjct: 3 FYPTQVDHFSFA--NPDTFLLRYLVNDTYFKDGG-PIFFYTGNEGDIEGFVKNTGLLMEM 59
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI-KDKYN 173
A +F A+++ EHRYYG+S+P+G + K+ H GY S QALAD+A ++ + K
Sbjct: 60 APRFGAMVIFAEHRYYGQSMPYGEE--SFKDPAHLGYLTSTQALADFAVLITRLRKTASG 117
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
A ++PV A G SYGG LA W R+KYPH+V GSLAS+A I Y IT Y V ++ F+
Sbjct: 118 AANSPVFAFGGSYGGMLAAWIRMKYPHLVAGSLASAASIFQYPGITDCEAYSHVATRTFQ 177
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK--NTTELKDGLDTVYSEAA 291
++ C I+ SW ID++ GL L+ F+ C L + L D L ++ A
Sbjct: 178 RSAAGCPGRIRSSWDIIDDLGKTGAGLQNLTSMFRLCERLTADDMPALVDWLVNLWMIYA 237
Query: 292 --QYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY 342
Y P+N PVK C+ I + +D+L +AA Y G +
Sbjct: 238 LIDYPYPANFLTPLPAWPVKEACHLIVS----NEDVLSGVAAAA--KLYFNYTGQTACLD 291
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
I++ G + W++Q+C+E + P+ D N M+ PW+L+ + + C E++ V PRP
Sbjct: 292 ISKPYYGGLA---WQYQACTEQIEPICSDGVNDMFPAIPWDLSAFSEACYERWKVRPRPH 348
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
W +T Y G +I + SNIIFSNG DP+S G +K+ D + + +H LD
Sbjct: 349 WAVTEYWGRNIS-----AASNIIFSNGDLDPWSAGCVLKSLS--DSLVAIVMEDAAHHLD 401
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
L + +DP +++ R E I++ WI +Y
Sbjct: 402 LRPSNPADPPSVIKARAQEADIIEKWIQEY 431
>gi|66804725|ref|XP_636095.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|60464439|gb|EAL62586.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 513
Score = 245 bits (625), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 162/468 (34%), Positives = 255/468 (54%), Gaps = 52/468 (11%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN-------SPILAFLGAEAPID- 102
++ F+ QT+DHF++ +S F QRY+++ +W + PIL + G E I
Sbjct: 62 YQELFFLQTLDHFNF--QSKGEFAQRYLVSDVYWKKPSPNDKVCQGPILFYTGNEGDITL 119
Query: 103 --DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD 160
DN Q T A + AL++ EHRYYG+S+PFG+ N GY S QALAD
Sbjct: 120 FYDNSQF--VTNVLAQEMNALLIFAEHRYYGESLPFGNDSWTSDNI---GYLTSEQALAD 174
Query: 161 YASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD--I 218
YA ++ + + A H PV+++G SYGG L WFR+KYP++V G+LA+SAPIL + + +
Sbjct: 175 YAQLIPAVLSEMGAEHCPVLSVGGSYGGMLTAWFRMKYPNIVDGALAASAPILSFLNTGV 234
Query: 219 TPWTTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKPDGLAILSKKFKTC-TPLKN 275
P T++ + + DF+DTS E C I+ + ++I I+++ +GLA LSK F C PL +
Sbjct: 235 NP-ETFNKIATDDFKDTSSEGTCASRIRSALNDIVTISTQSNGLAQLSKTFSVCGAPLTD 293
Query: 276 TTELKDGLDT--VYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGV 326
+L + +++ Y A Y P+N P+ C+A+ DDI G+
Sbjct: 294 VNDLINWIESALTYMAMADYPYPANFLEPMPGYPINVSCSALAQQE---DDI-----QGL 345
Query: 327 VEADSLEYD--GNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWN 382
+E + Y+ G CY ++ TG D W +Q+C+EMV+P+ D N + P P++
Sbjct: 346 LEVLHVYYNYTGQAGTCYNMSVFTTGALGDASWNYQACTEMVMPVSSDGVNDFFPPSPFS 405
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
L+ + C++Q+ +P P W+ TYYGG + S +NIIFSNG+ D + GG ++T
Sbjct: 406 LSDLTQQCQQQFQTTPDPYWITTYYGGSNF------SATNIIFSNGVLDVWRSGGILETR 459
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
D + + G+H LDL DP + Q R+ E K++Q W ++
Sbjct: 460 S--DSIVALTIEGGAHHLDLRYPNPLDPPSVTQAREIESKLLQLWASE 505
>gi|157110322|ref|XP_001651053.1| prolylcarboxypeptidase [Aedes aegypti]
gi|108878777|gb|EAT43002.1| AAEL005521-PA [Aedes aegypti]
Length = 476
Score = 244 bits (623), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 165/482 (34%), Positives = 256/482 (53%), Gaps = 44/482 (9%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
LG + + Q + + ++ + + +DHF+Y ES TF RY++N + G PIL
Sbjct: 11 LGLLVICLMTAQISAKYEYQIKYLDVPLDHFTYVNESV-TFKMRYLVNDTYNPDGKGPIL 69
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+ G E I++ Q +GF ++ A + KA +V EHR+YGK++PFG+ A+ ++ +H GY
Sbjct: 70 FYTGNEGDIENFAQNTGFMWDIAPKLKASLVFAEHRFYGKTLPFGN--ASYESPKHLGYL 127
Query: 153 NSAQALADYASILLHIKDKYNATHA-PVIAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
+S QALAD+A +L I +T PVIA G SYGG LA WFR+KYPH+V G++ASSAP
Sbjct: 128 SSEQALADFADLLAEINPSNRSTRGRPVIAFGGSYGGMLAAWFRMKYPHLVTGAIASSAP 187
Query: 212 ILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
I + T + +++ F + EC I +SW + N +S DGL L KFK C
Sbjct: 188 IRQFD--TDCGVFSQILTSVFSVAYTRECSLNIARSWDTLKNYSSTADGLKTLKDKFKFC 245
Query: 271 ---TPLKNTTE-LKDGLDTVYSEAAQYDTPSN---------IPVKRICNAIENAPN-CGD 316
T ++ TE D L VY A + P N PV+ C + A N G
Sbjct: 246 GNITKAEDITETFFDYLTDVYGNLAMINYPYNSSFLAPVPAYPVREFCGRL--AQNFTGL 303
Query: 317 DILCKIAAGVVEADSL--EYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM-----G 369
+++ ++ EA S+ Y G I + GW++QSC+EM +PM G
Sbjct: 304 ELIGRLQ----EALSIYSNYTGKTKCLNIANAYDDNMGTSGWDFQSCTEMWMPMCSEGSG 359
Query: 370 KDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGM 429
KD M+ + W+ K+ +C +++GV P+ + LT YGG + SNI+FSNG+
Sbjct: 360 KD---MFPKKAWDEKKFSDDCFKKFGVRPKKTVALTTYGG-----LYLDGASNIVFSNGL 411
Query: 430 RDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
DP+S GG +++ + + + G+H +DL A ++DP + R+ ++ +Q WI
Sbjct: 412 MDPWSGGGVLQSPN--NAIKVVLIPEGAHHIDLRAADENDPGSVRGARQVHLQNIQMWIK 469
Query: 490 QY 491
QY
Sbjct: 470 QY 471
>gi|332373508|gb|AEE61895.1| unknown [Dendroctonus ponderosae]
Length = 487
Score = 242 bits (618), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 152/463 (32%), Positives = 246/463 (53%), Gaps = 32/463 (6%)
Query: 49 HGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLS 108
+GF T + + +DHFS+ + TF RY++N ++ + PI + G E I Q +
Sbjct: 20 YGFVTKYIDMPLDHFSFTTNT--TFKLRYLVNDSYFSN-DQPIFFYTGNEGDISMFAQNT 76
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
GF +E A + ALI+ EHR+YG+++PFG+ + + + GY +S QALADY ++ ++
Sbjct: 77 GFLFELAEKMGALIIFAEHRFYGETLPFGNE--SYSSPKTLGYLSSQQALADYVYLIDNL 134
Query: 169 KDKY-----NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ KY + PVI+ G SYGG LA W R+KYP+ V+G++ASSAPI + ITP
Sbjct: 135 QKKYVNELSSLRKLPVISFGGSYGGMLAAWLRIKYPYSVLGAIASSAPIWQFQGITPCQN 194
Query: 224 YHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-- 280
++ +V+ S C +TIKKSWS IAS G +SK F CT LK+ +L
Sbjct: 195 FNKIVTDVIGALGSATCVETIKKSWSIFRKIASTDIGKQNISKTFDLCTSLKSDDDLNTF 254
Query: 281 -DGLDTVYSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD 330
+ L +Y+ + + P PV+ C+ +++ DD L K + V+
Sbjct: 255 LNWLSEMYTMIVEVNYPYPNSFLVPLPGNPVREFCSRMDSVNYNNDDGLIKALSTGVQLF 314
Query: 331 SLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNC 390
+ Y G I + + + GW++Q+C++M++PM +++ WN T+Y +C
Sbjct: 315 T-NYTGTTKCNNIGQTASPSLGELGWDFQACTDMIMPMCSTDEDLFENAAWNFTEYSDDC 373
Query: 391 KEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
+Q+GV PR V + +GG +I+ + SNI+FSNG+ DP+S GG + +
Sbjct: 374 YKQFGVRPRNEEVPILEFGGTEIE-----TASNIVFSNGLLDPWSSGGVIANVS--AQVW 426
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
S + G+H DL A + D D + R K +Q W+ ++Y
Sbjct: 427 SILMPNGAHHSDLRSANELDADDVKSARLFHEKHIQKWLDKFY 469
>gi|195998996|ref|XP_002109366.1| hypothetical protein TRIADDRAFT_21030 [Trichoplax adhaerens]
gi|190587490|gb|EDV27532.1| hypothetical protein TRIADDRAFT_21030 [Trichoplax adhaerens]
Length = 469
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 156/460 (33%), Positives = 238/460 (51%), Gaps = 42/460 (9%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++T +++Q IDHF + +S T+ QRY++N W G PI + G E I Q SG
Sbjct: 30 YKTKYFDQIIDHFDW--KSNATYRQRYLMNDDHWDKGTGPIFFYTGNEGGIVGFWQNSGL 87
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A QF+ALIV EHRYYGKS+PFG KN G S QALADYA +L +K
Sbjct: 88 LFDLAPQFRALIVFGEHRYYGKSLPFGKDSFKPKNL---GLLTSEQALADYAVLLTSLKK 144
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
NA V+A G SYGG L W RLKYP+++ LA+SAP+ G + + V+K
Sbjct: 145 SLNANKCKVVAFGGSYGGMLTAWMRLKYPNIIDAGLAASAPLYMAGGVVSPNFFFPAVTK 204
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV---Y 287
D++D + +C I+K++S + +A K ++K F C LK + ++K + + +
Sbjct: 205 DYQDANPKCVPNIRKAFSAVLEMA-KSKSKQKVAKIFNVCNKLKTSADVKQLIGWIRNGF 263
Query: 288 SEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
A D P PV C I NA + G+ +A L Y
Sbjct: 264 VSMAMGDYPYPASFFGPLPAFPVNASCKYIVNASH--------PIQGMAKAMKLFYGSKK 315
Query: 339 S----RCYIN-EDRTG---DESDEGWEWQSCSEMVVPMG-KDKNSMYQPEPWNLTKYIKN 389
+ Y++ D TG S W++Q+C+E+++P +K M+ P P+ +
Sbjct: 316 CHDIYKQYVHCADPTGCGTGASAIPWDYQACTEILLPGSTNNKTDMFPPIPFTSKIRKQY 375
Query: 390 CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
C ++YGV+PRP+WV T + + +K SNIIFSNG DP+ GG +K+ ++
Sbjct: 376 CLKKYGVTPRPNWVATQFWANRLK-----GASNIIFSNGNLDPWKNGGILKSPSSSLVAI 430
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ G+H LDL K+DP +++ RK E K+++ WI+
Sbjct: 431 --QIPHGAHHLDLRGKNKNDPASVIKARKMEAKLIKKWIS 468
>gi|330843655|ref|XP_003293764.1| hypothetical protein DICPUDRAFT_158674 [Dictyostelium purpureum]
gi|325075873|gb|EGC29712.1| hypothetical protein DICPUDRAFT_158674 [Dictyostelium purpureum]
Length = 503
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 150/463 (32%), Positives = 246/463 (53%), Gaps = 44/463 (9%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN-------SPILAFLGAEAPIDD 103
++ ++Y QT+DHF++ ++ F QRY+I+ +W + PI+ + G E I
Sbjct: 54 YKEYWYMQTLDHFNF--QTKGQFAQRYLISDTYWNKPSPSSKVCSGPIIFYTGNEGDIVW 111
Query: 104 NIQLSGF-TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA 162
+ S F T A + AL+ EHRYYG+++PFG+ +N GY S QALADYA
Sbjct: 112 FYENSQFITNVLAKEMGALLFFAEHRYYGETLPFGNESLTPENT---GYLTSEQALADYA 168
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGDITPW 221
++ + A H PVI++G SYGG L WFR+KYP++V LA+SAPIL +Y
Sbjct: 169 ELIPSVLADLGAEHCPVISVGGSYGGMLTAWFRMKYPNIVDAGLAASAPILMFYKTGASQ 228
Query: 222 TTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL 279
++ + + DF+ TSEE C I+ +++ I I+ + GL L+ F C L +L
Sbjct: 229 EGFNQIATDDFKQTSEEGTCASRIRNAFNSIMEISQQTGGLQQLTNTFSLCDSLNQVGDL 288
Query: 280 KDGLDT--VYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEAD 330
+ +++ Y A Y P+ P+ C A+ DDI G+++
Sbjct: 289 VNWIESGLTYMAMADYPYPAGFLEPMPGYPINVSCEAMATTT---DDI-----QGLLKVL 340
Query: 331 SLEYD-GNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYI 387
++ Y+ ++CY + T + W+ Q+C+EM++P+ + M+ P+NL +
Sbjct: 341 NVYYNYTGTTQCYNTSVFTTSALGSDAWDVQACNEMIMPISSNGVQDMFPAAPFNLQQLT 400
Query: 388 KNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL 447
C++ +G++P +W+ TYYGG + +TSN+IFSNG+ DP+ GG +K Y D
Sbjct: 401 SYCQQTWGITPGVNWITTYYGGSNF------TTSNLIFSNGVLDPWRAGGVLKDYG--DS 452
Query: 448 SFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
++ G+H LDL ++DPD ++Q R TE K++Q W +
Sbjct: 453 VIHIIIDGGAHHLDLRMPNEADPDSVIQARITETKLLQMWANE 495
>gi|241753992|ref|XP_002401170.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508367|gb|EEC17821.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 493
Score = 234 bits (597), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 144/451 (31%), Positives = 243/451 (53%), Gaps = 31/451 (6%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ IDHF++ S TF RY + ++W PI + G E I++ I +G +E A
Sbjct: 33 FRTKIDHFTF--HSSDTFVMRYAVADQYWDFDGGPIFFYTGNENAIENFINHTGLMWEWA 90
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-A 174
+FKA++V EHR+YG+S+PFG+R +L++ H GY ++ Q LADYA +++H+K+ A
Sbjct: 91 PEFKAMLVFAEHRFYGESMPFGNR--SLESPHHLGYLSTDQVLADYADLIIHLKESVRGA 148
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+ +PVI+ G SYGG L+ W R++YPH+V SLASSAP+ + + P ++ + V+++ FR
Sbjct: 149 SESPVISFGGSYGGMLSAWIRIRYPHLVSASLASSAPVHMFPGLVPCSSLNRVLTETFRR 208
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK--NTTELKDGLDTVYSEAA- 291
S C TI+KSW ++ S +G + KF C L+ + +D L VYS A
Sbjct: 209 ESPVCSATIRKSWPILEAKFSTVEGRKSIQDKFHLCQSLQEEDYVTFRDFLHDVYSNMAL 268
Query: 292 -QYDTPS-------NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
Y PS PVK C + + D+ + + VV ++ ++ +R
Sbjct: 269 VNYADPSVFLTPLPGYPVKESCKFLTRS-FVNDEAIVDAVSQVV---NIFFNTTGTRQCN 324
Query: 344 NEDRTGDESDE---GWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPR 399
+ + D + W++Q C+E+V+P D M+ P WN+T+ I++C++++ V+P
Sbjct: 325 DINAFHDVLNSVLIAWDFQGCTELVMPTCSDGLTDMFYPLIWNVTETIQDCQQRFNVTPD 384
Query: 400 PSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHC 459
+ YGG +++ S SNIIFSNG DP++ G +++ D + + +H
Sbjct: 385 LYKAVMTYGGRNME-----SASNIIFSNGDADPWAGVGLMESIS--DTVVAIVIPGAAHH 437
Query: 460 LDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DL + + +P + R E K ++ WI +
Sbjct: 438 YDLRFSSREEPLAVKAARGLEKKYIRDWIRK 468
>gi|219362661|ref|NP_001136852.1| uncharacterized protein LOC100217003 [Zea mays]
gi|194697358|gb|ACF82763.1| unknown [Zea mays]
Length = 319
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 181/310 (58%), Gaps = 20/310 (6%)
Query: 194 FRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI 253
RLKYPH+ IG+LASSAPIL + DI P T ++ +VS DFR S CF TIK SW E+D+
Sbjct: 1 MRLKYPHIAIGALASSAPILQFEDIVPSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQ 60
Query: 254 ASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYD----------TPSNIPVKR 303
A++ DGL LSK F C LK + +L D L + YS A D P+N P+K
Sbjct: 61 ANEQDGLLKLSKTFHLCQTLKTSGDLSDWLSSAYSYLAMVDYPLPSEFLMPLPAN-PIKE 119
Query: 304 ICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSE 363
+C I++ P G L +I AGV Y G +N+D G GW+WQ+C+E
Sbjct: 120 VCRNIDSQPK-GTGTLERIYAGV--NVYYNYTGTVDCFDLNDDPHG---MGGWDWQACTE 173
Query: 364 MVVPMG-KDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSN 422
MV+PM ++ SMY P ++ Y ++C + YGV PRP W+ T +GGH+I +L SN
Sbjct: 174 MVMPMSYSEQRSMYPPYKFDYASYAEDCIKSYGVRPRPKWITTEFGGHNITKVLENFGSN 233
Query: 423 IIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVK 482
IIF NG+ DP+S GG +K ++ LG+H +DL A DPDWLV R++E++
Sbjct: 234 IIFFNGLLDPWSGGGVLKNISESVVAIVAP--LGAHHIDLRPATPDDPDWLVALRESELE 291
Query: 483 IMQGWITQYY 492
I+ GW+ YY
Sbjct: 292 IISGWLWDYY 301
>gi|148674771|gb|EDL06718.1| prolylcarboxypeptidase (angiotensinase C) [Mus musculus]
Length = 437
Score = 231 bits (590), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 153/451 (33%), Positives = 229/451 (50%), Gaps = 64/451 (14%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q +DHF + +TF QRY++ K W IL + G E I +GF
Sbjct: 46 YSVLYFEQKVDHFGFA--DMRTFKQRYLVADKHWQRNGGSILFYTGNEGDIVWFCNNTGF 103
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG + + K+++H + S QALAD+A ++ H++
Sbjct: 104 MWDVAEELKAMLVFAEHRYYGESLPFG--QDSFKDSQHLNFLTSEQALADFAELIRHLEK 161
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A PVIAIG SYGG LA WFR+KYPH+V+G L S IL+ +P T+
Sbjct: 162 TIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGGLQSLTNILHL--CSPLTS------ 213
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
E T+ K W IA LA+++ Y
Sbjct: 214 --------EKIPTL-KGW-----IAETWVNLAMVN----------------------YPY 237
Query: 290 AAQYDTP-SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EYDGNNSRCYINED 346
A + P P+K +C ++N PN D +L + + +A S+ Y G + I++
Sbjct: 238 ACNFLQPLPAWPIKEVCQYLKN-PNVSDTVLLQ---NIFQALSVYYNYSGQAACLNISQT 293
Query: 347 RTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT 405
T GW +Q+C+EMV+P + + M++P W+L KY +C Q+GV PRP W+ T
Sbjct: 294 TTSSLGSMGWSFQACTEMVMPFCTNGIDDMFEPFLWDLEKYSNDCFNQWGVKPRPHWMTT 353
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEA 465
YGG +I S SNIIFSNG DP+S GG T D + +++ G+H LDL
Sbjct: 354 MYGGKNIS-----SHSNIIFSNGELDPWSGGG--VTRDITDTLVAINIHDGAHHLDLRAH 406
Query: 466 KKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
DP ++ R EVK M+ WI +Y + +
Sbjct: 407 NAFDPSSVLLSRLLEVKHMKKWILDFYSNIQ 437
>gi|403301430|ref|XP_003941393.1| PREDICTED: dipeptidyl peptidase 2 [Saimiri boliviensis boliviensis]
Length = 513
Score = 231 bits (588), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 149/462 (32%), Positives = 235/462 (50%), Gaps = 30/462 (6%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ ++ Q +DHF++ +TFPQR +++ KFW G PI + G E + +
Sbjct: 46 RRAPDPGFQERYFQQRLDHFNFERLGNKTFPQRVLVSDKFWIRGEGPIFFYTGNEGDVWN 105
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF E A + AL+V EHRYYGKS+PFG R H G QALAD+A
Sbjct: 106 FANNSGFIAELAAEQGALLVFAEHRYYGKSLPFGERST---QRGHTGLLTVEQALADFAE 162
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+L ++ A P IA G SYGG L+ + R+KYPH+V G+LA+SAPIL + T
Sbjct: 163 LLRALRRDLGAQDVPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPILAVAGLGDSTQ 222
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKD 281
+ V+ DF D +C Q +++++ +I ++ + +S +F TC PL K+ T+L
Sbjct: 223 FFRDVTADFEDQGPKCTQAVREAFRQIKDLFLQ-GAYDKVSWEFGTCQPLLEKDLTQLFV 281
Query: 282 GLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE---YDGNN 338
++ A D P P + N G D L A +V +L Y+ +
Sbjct: 282 FARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRIVGLRALTGLVYNASG 339
Query: 339 S-------RCYIN-EDRTG---DESDEGWEWQSCSEMVVPMGKDKNS-MYQPEPWNLTKY 386
S R Y N D TG W++Q+C+E+ + + + + M+ P+
Sbjct: 340 SEHCYDIYRLYHNCADPTGCGTGPDARAWDYQACTEINLTLASNNMTDMFPDLPFTEELR 399
Query: 387 IKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+ C++ +GV PRP W+ T + G D+ R+ SNIIFSNG DP++ GG +
Sbjct: 400 REYCRDTWGVWPRPDWLQTNFWGGDL-----RAASNIIFSNGNLDPWAGGGIQRNLSTSV 454
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +V+ RK E ++ W+
Sbjct: 455 IAVT--IQGGAHHLDLRASHPEDPASVVEARKLEAAVIGEWV 494
>gi|290997696|ref|XP_002681417.1| lysosomal carboxypeptidase [Naegleria gruberi]
gi|284095041|gb|EFC48673.1| lysosomal carboxypeptidase [Naegleria gruberi]
Length = 528
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 147/485 (30%), Positives = 246/485 (50%), Gaps = 34/485 (7%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG--GGNSP 90
L R + ++ + ++T +++Q +DHF + + +TF QRY++ F G ++P
Sbjct: 42 LSRKQSNKSTRMASALPPYKTLYFDQKLDHFDFTND--KTFKQRYLVCDSFVGKMTPSTP 99
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
I + G E I + +G ++ A QF ALI+ +EHRYYG S PFG + +
Sbjct: 100 IFFYTGNEGDIVTFYENTGLMFDTAPQFNALIIFVEHRYYGVSNPFGPVNSF--TPENIK 157
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
+ +S QALADY+ + + PVIA G SYGG L++W+R+KYPH+V G++A+SA
Sbjct: 158 WLSSEQALADYSYFITEMFGLDEKRTNPVIAFGGSYGGMLSSWWRMKYPHIVDGAIAASA 217
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTS------EECFQTIKKSWSEIDNI--ASKPDGLAI 262
PI + +T Y+ + ++DF+ +S E C IK S ++ + P L
Sbjct: 218 PIFQFTGLTAPNVYNQICTEDFKKSSNLAKYHETCDAVIKNGLSILNQYYQNNNPQILQK 277
Query: 263 LSKKFKTCTPLKNTT----ELKDGLDTVYSEAAQYDTP---------SNIPVKRICNAIE 309
LS +F+ C+P T+ +L L Y+ D P P++ IC+ I
Sbjct: 278 LSSQFRICSPGIRTSADVNQLISWLTQAYNTLPMIDYPYPANFLMPLPGYPIQVICDRIS 337
Query: 310 NAPNCGDDILCKIAAGVVEADSLEYD--GNNSRCYINEDRTGDESDEGWEWQSCSEMVVP 367
V+E ++ Y+ G ++ + + + D+GWE+Q+C+EMV+P
Sbjct: 338 TIMASEKVTTDTYLKAVLEGANVYYNYTGTSTCNNLTQPDSPSLGDDGWEYQTCNEMVMP 397
Query: 368 MGK-DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFS 426
+G + M+ P PW+L +I C++++ V+PR +W +T YGG + IL +NI+FS
Sbjct: 398 IGNYPQTDMFIPAPWDLQAWISYCQQKWKVTPRTNWAITNYGGK--RAILE--ATNIVFS 453
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQG 486
NG DP+ GG + + + G+H LDL + DP + R EVK +
Sbjct: 454 NGDLDPWHGGGVLPGMKVNEKVKVVYIEGGAHHLDLRSSNPLDPQSVRLARALEVKEITA 513
Query: 487 WITQY 491
W+ ++
Sbjct: 514 WLNEF 518
>gi|2827710|emb|CAA16683.1| lysosomal Pro-X carboxypeptidase - like protein [Arabidopsis
thaliana]
Length = 499
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 153/421 (36%), Positives = 216/421 (51%), Gaps = 49/421 (11%)
Query: 11 LLLMVIFVSTSFHANGLKLR-----PRLGRI----RRSRILE-QKDSN-HGFETFFYNQT 59
LLL+ F + F +NG L PR R R +RI + + D N + +ET F++Q
Sbjct: 7 LLLIFTFFTLVFPSNGSSLSSSKLLPRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQ 66
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ F QRY+INS W G ++ PI + G E I+ SGF ++ A +
Sbjct: 67 LDHFSFA--DLPKFSQRYLINSDHWLGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPK 124
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F AL+V EHRYYG+S+P+GSR+ A KNA Y + QALAD+A + +K +A
Sbjct: 125 FGALLVFPEHRYYGESMPYGSREEAYKNATTLSYLTTEQALADFAVFVTDLKRNLSAEAC 184
Query: 178 PVIAIGASYGGE--------------LATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
PV+ G SYGG LA W RLKYPH+ IG+LASSAPIL + D+ P T
Sbjct: 185 PVVLFGGSYGGSNNCVFVFVVIDATVLAAWMRLKYPHIAIGALASSAPILQFEDVVPPET 244
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
++ + S DF+ S CF TIK SW I K +GL L+K F C L +T +L D L
Sbjct: 245 FYDIASNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWL 304
Query: 284 DTVYSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
D+ YS A D P P++ +C I+ A IL +I AG+ + Y
Sbjct: 305 DSAYSYLAMVDYPYPADFMMPLPGHPIREVCRKIDGA-GSNASILDRIYAGI--SVYYNY 361
Query: 335 DGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKD---KNSMYQPEPWNLTKYIKNCK 391
GN +++D G +GW WQ + + G + N + +PW+ +KN
Sbjct: 362 TGNVDCFKLDDDPHG---LDGWNWQDIATTLKSFGSNIIFSNGLL--DPWSGGSVLKNLS 416
Query: 392 E 392
+
Sbjct: 417 D 417
>gi|156403949|ref|XP_001640170.1| predicted protein [Nematostella vectensis]
gi|156227303|gb|EDO48107.1| predicted protein [Nematostella vectensis]
Length = 446
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 148/466 (31%), Positives = 239/466 (51%), Gaps = 49/466 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+T + QT+DHF++ TF QRY+ K+W G PI + G E I + SGF
Sbjct: 3 FKTGTFEQTVDHFNFIQSG--TFKQRYLYTEKYWDG-KGPIFFYSGNEGGITGFWENSGF 59
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+E A F AL++ EHRYYG+S+PFG ++N GY + QALAD+A+++ +K
Sbjct: 60 VFEAAKNFSALVIFGEHRYYGESLPFGQDSFKIENI---GYLSIEQALADFATLIPALKK 116
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
++ A PV++ G SYGG L+ + R KYP+V+ +LA+SAPI + D++ + V++
Sbjct: 117 QFKAEEKPVVSFGGSYGGMLSAYLRFKYPNVIQAALAASAPIYFIADLSIRDFFFPAVTR 176
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKP-DGLAILSKKFKTCTPLKNTTELKDGL------ 283
DF++ +C ++ + E+DN+ + GL +SK FK C PLK+ ++ +
Sbjct: 177 DFKNADPKCPDLVRAGFIELDNLKKEGLKGLDAISKAFKLCKPLKSADQINHLIGWIRNA 236
Query: 284 -------DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG 336
D Y+ P+N PV C + A D L +G+ +A L Y+G
Sbjct: 237 FTIIAMCDYPYATDFLAPLPAN-PVNYACKLLATA----SDRL----SGLADAAGLAYNG 287
Query: 337 NNS--RCYIN-------EDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNL 383
+ +C+ D TG ++ W++Q+C+E+ +P G + M+ PW L
Sbjct: 288 TSGTLKCFDPWTEFVECADPTGCGLGNANLAWDYQACTELPMPAGTNNVTDMFPVLPWTL 347
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
C++ + V PR W G DI + SNIIFSNG DP+ GG +K+
Sbjct: 348 DMRADYCQKHWQVKPRLEWPGISLWGRDIS-----TASNIIFSNGNLDPWRPGGVLKSVS 402
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ + G+H LDL + DP +V R+ E+++++ WI+
Sbjct: 403 --PSLVAVLVEGGAHHLDLRSSNPEDPPSVVAAREMELELIRKWIS 446
>gi|390335012|ref|XP_797434.3| PREDICTED: dipeptidyl peptidase 2-like [Strongylocentrotus
purpuratus]
Length = 487
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 145/472 (30%), Positives = 235/472 (49%), Gaps = 52/472 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
F++Q IDHF+ QTF QRY + +W G+ PI + G E I+ +G+ ++
Sbjct: 36 FFDQFIDHFNSESHGKQTFRQRYFVTDDYWQKGSGPIFFYTGNEGAIESFFDNTGYIFDI 95
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F AL++ EHRYYGKS+P G++ N G QALADYA+++ +K++
Sbjct: 96 APEFGALVIFAEHRYYGKSLPLGNQSFTPAN---LGLLTVEQALADYATLITSLKEEPGL 152
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+P++ G SYGG L+ + R+KYPH+V G+LA+SAP+ + + + + V+KDF D
Sbjct: 153 QDSPLVVFGGSYGGMLSAYMRMKYPHLVAGALAASAPVYSTANESSQSYFFQTVTKDFYD 212
Query: 235 TSEECFQTIKKSWSEIDNIASK-PDGLAILSKKFKTCTPL-------------KNTTELK 280
C ++ ++ +D +A + GL +S F C L +N+ +
Sbjct: 213 VDTNCPDLVRAGFATLDQLAGQGTSGLNKISTAFHLCKSLVSKADYYHLLGWIRNSFTYQ 272
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS- 339
LD YS + P+N PV C+ + NA G+ +A L Y+ +++
Sbjct: 273 AMLDYPYSASFMGSLPAN-PVNVSCHMLLNA--------STPLEGLAQASGLYYNTSSTP 323
Query: 340 -RCY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYI 387
+C+ D TG W++Q+C+E+ +P G + K M+ P+
Sbjct: 324 LQCFDIFAEFIQCADPTGCGLGTDSTAWDYQACTEITLPAGSNGKTDMFPDMPFTPDMRT 383
Query: 388 KNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL 447
C+ Y V+PRP W+ G + S+SNIIFSNG+ DP+ GG K DL
Sbjct: 384 SYCQSVYNVTPRPDWLSIQGFGKGLA-----SSSNIIFSNGLLDPWRLGGVAK-----DL 433
Query: 448 S---FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFK 496
S + + G+H LDL + DP+ +++ R E I++GW+ +Y K
Sbjct: 434 SESIIAIPVPGGAHHLDLRGSNPKDPESVIEARSQEKLIIKGWVEAWYRGHK 485
>gi|66805245|ref|XP_636355.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|60464723|gb|EAL62849.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 513
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 154/487 (31%), Positives = 239/487 (49%), Gaps = 58/487 (11%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS----- 89
RI + ++ + ++NQT+DHF++ Y F QR +I +++ +
Sbjct: 40 RINKVKLNLSDPVPTPYTLLWFNQTLDHFNFETSGY--FNQRVLIIDQYFNEKSKNEIDQ 97
Query: 90 ----PILAFLGAEAPIDDNIQLSGFTYEN--------AHQFKALIVILEHRYYGKSVPFG 137
P++ F G E + F YEN A + AL++ EHRYYG+S+PFG
Sbjct: 98 ICTKPLIFFCGNEGDVT-------FFYENSLFITNTLAQEMNALVIFAEHRYYGESLPFG 150
Query: 138 SRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLK 197
++ +N + Y +S QALADY+ I+ I +YNA + PV SYGG+LA W RLK
Sbjct: 151 NQSYTNENFQ---YLSSEQALADYSKIIPSILKQYNALNCPVFTTSGSYGGDLAAWMRLK 207
Query: 198 YPHVVIGSLASSAPIL-YYGDITPWTTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIA 254
YP +V G+LASSAP+L Y G P+ + V+ DF++TS++ C I+ ++++++ IA
Sbjct: 208 YPFIVDGALASSAPLLSYMGTGVPYDVFPVGVTNDFKETSQDGSCAIKIRNAFNDLETIA 267
Query: 255 SKPDGLAILSKKFKTCTPLKNTTELKDGLDTV-----YSEAAQYDTPSNI-------PVK 302
+G +S FK CTP+ + + + L V Y A Y P++ PV
Sbjct: 268 KADNGFNEISTSFKLCTPINSNDDFQSFLGWVESGFSYMSMADYPYPASFLEPMMGNPVN 327
Query: 303 RICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDE-GWEWQSC 361
CN I N D I+ + Y G +C+ D+ W +QSC
Sbjct: 328 ETCNLINQLDNSIDIIMSGLQI------YYNYTGQMMQCFNTNIFIEDQGMLIPWSYQSC 381
Query: 362 SEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRST 420
+E V P M+ P+NLT+YI+NC+E+Y V+P P+WV + YGG +
Sbjct: 382 TEFVFPFTTTGIKDMFYYSPFNLTEYIENCQEEYNVTPDPNWVTSVYGGTP-----NFPS 436
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
SNIIFSNG+ D + G + + + + +H LDL + DP + R E
Sbjct: 437 SNIIFSNGVLDGW-HGAGINVTDYSKNIIAILIPGAAHHLDLRGSNPLDPQSITDARLLE 495
Query: 481 VKIMQGW 487
+K + W
Sbjct: 496 LKYLTEW 502
>gi|391328440|ref|XP_003738697.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 459
Score = 228 bits (580), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 139/456 (30%), Positives = 240/456 (52%), Gaps = 33/456 (7%)
Query: 49 HGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN-SPILAFLGAEAPIDDNIQL 107
+ ++TF++ IDHFS+ +F R + + K++ P+ + G E I+
Sbjct: 18 YDYQTFWFETKIDHFSFARN--DSFKMRVLYSDKYFDSSEPGPVFFYTGNEGDIETFTNN 75
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+G ++ A FKAL++ EHR+YGKS+PFG + + + GY + QALAD+A ++ H
Sbjct: 76 TGLMWDWAADFKALLIFAEHRFYGKSMPFGDK--SYDTYKQYGYLTAEQALADFADLIQH 133
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+K+ + V+A G SYGG L+ W R+KYP ++ ++A+SAPIL + D+T + +
Sbjct: 134 VKNNWPVKK--VVAFGGSYGGMLSAWMRIKYPWLIDAAIAASAPILQFQDVTACGVFDKI 191
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASK-PDGLAILSKKFKTCTPL--KNTTELKDGLD 284
V+K F SE C I++SW ++ + +G A++ + F+ C + N T ++D L
Sbjct: 192 VTKAFAKASERCADNIRRSWIALEKLGKDGENGSALIRENFRICQNVLPSNYTAVRDWLH 251
Query: 285 TVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD 335
Y A + P PV+ C+ ++ +C ++L GV +A ++ ++
Sbjct: 252 DTYGNLAMINYPYGTNFLKKVPGHPVQVSCSFLDKDFHCDAELL----KGVYQAINVFHN 307
Query: 336 -GNNSRCYINEDRTGDE-SDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKE 392
+++C + GD SD GW Q+C+EMV+P G + M+ P WN T++ K+C++
Sbjct: 308 FSGDTQCNDVGNSGGDNISDAGWNIQTCNEMVMPFCGDGQEDMFYPYSWNFTQFRKDCEK 367
Query: 393 QYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
+YG++P + +GG DI + SNI+FSNG DP+ GG +K +
Sbjct: 368 KYGMTPDLNIARRMFGGRDIS-----AASNIVFSNGDLDPWCGGGVLKQLN--PTLPVVI 420
Query: 453 LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ G+H DL A D ++ R E + ++ WI
Sbjct: 421 IEGGAHHYDLRSASPLDTPAVISARNVEKEYIKLWI 456
>gi|440795690|gb|ELR16807.1| lysosomal ProX carboxypeptidase [Acanthamoeba castellanii str.
Neff]
Length = 489
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 146/465 (31%), Positives = 243/465 (52%), Gaps = 40/465 (8%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW-----GGGNSPILAFLGAEAPIDDNI 105
+ T +++QT+DHF++ + T+ QR+++ +W GG PI + G E+P+ D
Sbjct: 39 YRTLYFDQTLDHFNFATKP-ATYKQRFLMADDYWRGSYPGGCPGPIFFYTGNESPVTDYY 97
Query: 106 QLSGF-TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
+GF T A + AL+V EHRY+G+S+PFGS+ + Y + QALADYA +
Sbjct: 98 AGAGFFTQVLAPKHNALLVFAEHRYFGESMPFGSKSF---DPEKISYLSPEQALADYAVL 154
Query: 165 LLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG-DITPWT 222
+ H+K+ +A + PV A G SYGG L WFR KYP +V+G L++SAP+ +YG I+P+
Sbjct: 155 ITHLKETLPHAKNCPVFAFGGSYGGILTAWFRSKYPDIVMGGLSASAPLAFYGTGISPY- 213
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-- 280
+ S F C + +++ + +++ DG A S FK C+PL + + +
Sbjct: 214 AFTDSASDTFAQARPGCAPLVSRAFDVLQKLSATSDGRARFSAAFKLCSPLNSQADAEAV 273
Query: 281 -DGLDTVYSEAAQYDTP---------SNIPVKRICN-AIENAPNCGDDILCKIAAGVVEA 329
+ +D+ A D P PV R C+ +E A + DD+L A A
Sbjct: 274 INWVDSGLIGMAMLDYPFATNYGISLPGWPVNRTCDRLLEKATSNDDDVL---AEAFAYA 330
Query: 330 DSLEYDGNNSR-CY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
+ Y+ + CY IN D GW++ C+E+ +P G + ++ +NLT I
Sbjct: 331 IGVFYNNTGAHTCYDINRDVPDWGKCCGWDYLHCTEVYIPSG--SSGIFPRAAYNLTSDI 388
Query: 388 KNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL 447
C++Q+GV+ RP+W +GG ++ S+SNIIFSNG+ DP+ G + + D
Sbjct: 389 AQCQQQFGVTLRPNWARIQFGGFNLT-----SSSNIIFSNGLLDPWHTSGVLHSLS--DS 441
Query: 448 SFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+ + +H LDL DP ++ + R+ E +++ W+ +Y+
Sbjct: 442 LVAIVIPEAAHHLDLWAPSPEDPVYVQRAREQEAMLIEKWLNEYW 486
>gi|380790295|gb|AFE67023.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
gi|383414091|gb|AFH30259.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
gi|384940796|gb|AFI34003.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
Length = 492
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 148/457 (32%), Positives = 227/457 (49%), Gaps = 31/457 (6%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
GF+ F+ Q +DHF++ QTFPQR++++ +FW G PI + G E + SG
Sbjct: 30 GFQERFFQQRLDHFNFERFGNQTFPQRFLVSDRFWIRGEGPIFFYTGNEGDVWAFANNSG 89
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F E A + AL+V EHRYYGKS+PFG R H QALAD+A +L ++
Sbjct: 90 FIAELAAEQGALLVFAEHRYYGKSLPFGERS---TQRGHTELLTVEQALADFAELLRALR 146
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L + + V+
Sbjct: 147 RDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVT 206
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTV 286
DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 207 ADFEGQSPKCTQGVREAFGQIKDLFLQ-GAYDRVRWEFGTCQPLSDEKDLTQLFMFARNA 265
Query: 287 YSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGNNSR-CY 342
++ A D P P + N G D L A G+ L Y+ + S CY
Sbjct: 266 FTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNASGSEHCY 323
Query: 343 -------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCK 391
D TG W++Q+C+E+ + + M+ P+ + + C
Sbjct: 324 DIYRLYRSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDERRQQYCL 383
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+ +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG + ++ +
Sbjct: 384 DTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAIT- 437
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ G+H LDL + DP +V+ RK E I+ W+
Sbjct: 438 -IQGGAHHLDLRASHPEDPVSVVEARKLEATIIGEWV 473
>gi|114627680|ref|XP_528471.2| PREDICTED: dipeptidyl peptidase 2 isoform 5 [Pan troglodytes]
Length = 492
Score = 225 bits (573), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 148/466 (31%), Positives = 234/466 (50%), Gaps = 37/466 (7%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E +
Sbjct: 24 RRAPDPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWA 83
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALAD 160
SGF E A + AL+V EHRYYGKS+PFG++ +R RG+ QALAD
Sbjct: 84 FANNSGFVAELAAEQGALLVFAEHRYYGKSLPFGAQ------SRQRGHTELLTVEQALAD 137
Query: 161 YASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
+A +L ++ A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L +
Sbjct: 138 FAELLRALRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGD 197
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTT 277
+ V+ DF S +C Q +++++ +I ++ + + +F TC PL K+ T
Sbjct: 198 SNQFFRDVTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLT 256
Query: 278 ELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEY 334
+L ++ A D P P + N G D L A G+ L Y
Sbjct: 257 QLFMFARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVY 314
Query: 335 DGNNSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWN 382
+ + S CY D TG W++Q+C+E+ + + M+ P+
Sbjct: 315 NASGSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFT 374
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ C + +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG +
Sbjct: 375 DELRQRYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNL 429
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +V+ RK E I+ W+
Sbjct: 430 SASVIAVT--IQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|443733182|gb|ELU17643.1| hypothetical protein CAPTEDRAFT_152094 [Capitella teleta]
Length = 472
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 148/462 (32%), Positives = 234/462 (50%), Gaps = 51/462 (11%)
Query: 55 FYNQTIDHFS-----YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
F+ QTIDHF+ YG +Y+ QRY+I K+W G PI + G E I +G
Sbjct: 34 FFEQTIDHFNSYWAQYGKRTYK---QRYLIQDKWWTPGKGPIFFYTGNEGDIATFWNNTG 90
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F +E A +F ALIV EHRYYGKS+PFG R + +S QALAD+A +L H+K
Sbjct: 91 FMFEIAPKFNALIVFAEHRYYGKSLPFGERSF---KQPYISLLSSQQALADFAVLLNHLK 147
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
NAT VIA G SYGG L+ + R+KYP+++ GS+A+SAP+ G + + V+
Sbjct: 148 PSLNATDCKVIAFGGSYGGMLSAYMRIKYPNLIDGSIAASAPVYLIGGDSSRDFFFEDVT 207
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT---TELKDGLDTV 286
DF+ + C + I+ +S++ +++S DGL +S F C +K T T L
Sbjct: 208 ADFQ--AAGCDKLIRDGFSKMASMSSTTDGLKKISSHFMLCKYMKTTSDFTHFLGWLRNA 265
Query: 287 YSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
++ A D P PV C A++N + C + G+ + S+ Y
Sbjct: 266 FTLMAMMDYPYPTDFMSKMPAWPVNAGCKAMKN-------VTCPV-KGLADLASIVYPYK 317
Query: 338 NSRCY---------INEDRTGDESDE-GWEWQSCSEMVVPMG-KDKNSMYQPEPWNLTKY 386
C+ + G D W++Q+C++ ++P G +K M+ P+ + +
Sbjct: 318 PDGCHDIWTDFVDCADPTGCGTGPDSYAWDYQACTDFLMPSGTNNKTDMFPILPFTMEQR 377
Query: 387 IKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
C++++GV+P W + G D+K T NI+FSNG+ DP+ RGG ++ D
Sbjct: 378 NSYCEKRWGVTPDVEWTKLSFWGKDLKY-----TGNIVFSNGLLDPWHRGGVLEDLS--D 430
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + G+H LDL + + DP+ + R+ E+ I+ W+
Sbjct: 431 SLIAITIKEGAHHLDLRASNEHDPESVKVARQKEIDIITHWL 472
>gi|15080291|gb|AAH11907.1| Dipeptidyl-peptidase 7 [Homo sapiens]
gi|16877430|gb|AAH16961.1| Dipeptidyl-peptidase 7 [Homo sapiens]
gi|123981114|gb|ABM82386.1| dipeptidyl-peptidase 7 [synthetic construct]
gi|123995923|gb|ABM85563.1| dipeptidyl-peptidase 7 [synthetic construct]
gi|158257656|dbj|BAF84801.1| unnamed protein product [Homo sapiens]
Length = 492
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 146/463 (31%), Positives = 230/463 (49%), Gaps = 31/463 (6%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E +
Sbjct: 24 RRAPDPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWA 83
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF E A + AL+V EHRYYGKS+PFG++ H QALAD+A
Sbjct: 84 FANNSGFVAELAAERGALLVFAEHRYYGKSLPFGAQS---TQRGHTELLTVEQALADFAE 140
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+L ++ A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L +
Sbjct: 141 LLRALRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQ 200
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELK 280
+ V+ DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 201 FFRDVTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLF 259
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGN 337
++ A D P P + N G D L A G+ L Y+ +
Sbjct: 260 MFARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNAS 317
Query: 338 NSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTK 385
S CY D TG W++Q+C+E+ + + M+ P+
Sbjct: 318 GSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDEL 377
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
+ C + +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG +
Sbjct: 378 RQRYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSAS 432
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +V+ RK E I+ W+
Sbjct: 433 VIAVT--IQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|410224782|gb|JAA09610.1| dipeptidyl-peptidase 7 [Pan troglodytes]
gi|410251824|gb|JAA13879.1| dipeptidyl-peptidase 7 [Pan troglodytes]
Length = 492
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 146/463 (31%), Positives = 230/463 (49%), Gaps = 31/463 (6%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E +
Sbjct: 24 RRAPDPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWA 83
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF E A + AL+V EHRYYGKS+PFG++ H QALAD+A
Sbjct: 84 FANNSGFVAELAAEQGALLVFAEHRYYGKSLPFGAQS---TQRGHTELLTVEQALADFAE 140
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+L ++ A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L +
Sbjct: 141 LLRALRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQ 200
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELK 280
+ V+ DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 201 FFRDVTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLF 259
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGN 337
++ A D P P + N G D L A G+ L Y+ +
Sbjct: 260 MFARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNAS 317
Query: 338 NSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTK 385
S CY D TG W++Q+C+E+ + + M+ P+
Sbjct: 318 GSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDEL 377
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
+ C + +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG +
Sbjct: 378 RQRYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSAS 432
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +V+ RK E I+ W+
Sbjct: 433 VIAVT--IQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|197099620|ref|NP_001127163.1| dipeptidyl peptidase 2 precursor [Pongo abelii]
gi|55725338|emb|CAH89533.1| hypothetical protein [Pongo abelii]
Length = 492
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 147/463 (31%), Positives = 230/463 (49%), Gaps = 31/463 (6%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E +
Sbjct: 24 RRAPDPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWIRGEGPIFFYTGNEGDVWA 83
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF E A + AL+V EHRYYGKS+PFG++ H QALAD+A
Sbjct: 84 FANNSGFVAELAAEQGALLVFAEHRYYGKSLPFGAQS---TRRGHTELLTVEQALADFAE 140
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+L ++ A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L +
Sbjct: 141 LLRALRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQ 200
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELK 280
+ V+ DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 201 FFRDVTADFEGQSPKCTQGVREAFRQIKDLFLR-GAYDTVRWEFGTCQPLSDEKDLTQLF 259
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGN 337
++ A D P P + N G D L A G+ L Y+ +
Sbjct: 260 MFARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNAS 317
Query: 338 NSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTK 385
S CY D TG W++Q+C+E+ + + M+ P+
Sbjct: 318 GSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDEL 377
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
+ C + +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG K
Sbjct: 378 RQQYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRKNLSAS 432
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +V+ RK E I+ W+
Sbjct: 433 VIAVT--IQGGAHHLDLRASHPEDPVSVVETRKLEATIIGEWV 473
>gi|402895977|ref|XP_003911085.1| PREDICTED: dipeptidyl peptidase 2 [Papio anubis]
Length = 492
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 146/457 (31%), Positives = 226/457 (49%), Gaps = 31/457 (6%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
GF+ F+ Q +DHF++ QTFPQR++++ +FW G PI + G E + SG
Sbjct: 30 GFQERFFQQRLDHFNFERFGNQTFPQRFLVSDRFWIRGEGPIFFYTGNEGDVWAFANNSG 89
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F E A + AL+V EHRYYGKS+PFG R H QALAD+A +L ++
Sbjct: 90 FIAELAAEQGALLVFAEHRYYGKSLPFGERS---TQRGHTELLTVEQALADFAELLRALR 146
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L + + V+
Sbjct: 147 RDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVT 206
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTV 286
DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 207 ADFEGQSPKCTQGVREAFGQIKDLFLQ-GAYDRVRWEFGTCQPLSDKKDLTQLFMFARNA 265
Query: 287 YSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGNNSR-CY 342
++ A D P P + N G D L A G+ L Y+ + S CY
Sbjct: 266 FTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNASGSEHCY 323
Query: 343 -------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCK 391
D TG W++Q+C+E+ + + M+ P+ + + C
Sbjct: 324 DIYRLYRSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDERRQQYCL 383
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+ +GV PRP W+LT + G D+K + SNIIFSNG DP++ GG + ++ +
Sbjct: 384 DTWGVWPRPDWLLTSFWGGDLK-----AASNIIFSNGNLDPWAGGGIRRNLSASVIAIT- 437
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ +H LDL + DP +V+ RK E ++ W+
Sbjct: 438 -IQGAAHHLDLRASHPEDPVSVVEARKLEATVIGEWV 473
>gi|118396082|ref|XP_001030384.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89284685|gb|EAR82721.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 495
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 155/481 (32%), Positives = 242/481 (50%), Gaps = 43/481 (8%)
Query: 41 ILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP 100
IL + ++T++++Q ++H + TF Q+Y++ F+ PIL + G E P
Sbjct: 10 ILAISCNGQDYKTYYFDQKVNHEGFEMNDL-TFKQKYLVKDDFYRYDKGPILFYCGNEGP 68
Query: 101 IDDNIQLSGF-TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALA 159
I+ +GF T+ A + L+V +EHRY+G+S PFG+ + +LK ++ Y S QAL
Sbjct: 69 IEMFYNNTGFQTHTLAKELNGLVVFMEHRYFGESWPFGNEEESLKKGNNK-YLTSLQALN 127
Query: 160 DYASILLHIKDKYNAT--HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
DY L K PVIAIG SYGG LA W R+K+P+VV SLA+SAPI + +
Sbjct: 128 DYVVFLNWFKKSLGCADDECPVIAIGGSYGGMLAAWIRMKFPNVVDASLAASAPIYQFLN 187
Query: 218 ITPW--TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDG-----------LAILS 264
T ++S++++++ C I +++ + NI P A +S
Sbjct: 188 REGLNQTLFYSIITRNYAQNG--CSDKIHQAYQYLTNIIDSPMSTKYFKYQYDSIFANIS 245
Query: 265 KKFKTCTPLKNT---TELKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAP 312
+ TC P+ N+ T+L+ +DT YS A + P P C +E A
Sbjct: 246 QAMNTCEPITNSTGLTQLRTYMDTAYSYMAMTNYPQASSFLRSMPAWPANASCIPME-AV 304
Query: 313 NCGDDILCKIAAGVVEADSL-EYD--GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMG 369
N + +A + D+ YD N S +D D GW +CS+MV+PM
Sbjct: 305 NSNSTVFELFSAIKLSTDTFYNYDQSANCSDISQGDDGASDNDMSGWNILACSDMVLPMA 364
Query: 370 KD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGG-HDIKLILRRSTSNIIFSN 427
+ K M+ +PWN +Y + C YGV+P W L +YGG +D+++ + SNI FSN
Sbjct: 365 SNGKTDMFYNQPWNFEQYKEWCNYTYGVTPNYDWALDFYGGRNDLEM---ENFSNIFFSN 421
Query: 428 GMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
GM DP+S GG Y DL + + +H DL ++ DP+ +VQ R+ E+K ++ W
Sbjct: 422 GMLDPWS-GGSPTEYLSEDLP-TNYMYASAHHNDLRLPQEGDPESVVQGRELEIKYLKKW 479
Query: 488 I 488
I
Sbjct: 480 I 480
>gi|241161688|ref|XP_002408973.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494442|gb|EEC04083.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 409
Score = 222 bits (566), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 142/408 (34%), Positives = 211/408 (51%), Gaps = 31/408 (7%)
Query: 49 HGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLS 108
+ ++T+++ +DHF Y + TF RY++ ++W PI + G E I+ +
Sbjct: 2 YSYDTYYFTTKVDHFGYA--NNDTFKMRYLVADQYWDHDGGPIFFYTGNEGDIEVFANNT 59
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G ++ A +FKAL++ EHRYYGKS+P+G K + + GY QALADYA +L H
Sbjct: 60 GLMWDWAPEFKALLIFAEHRYYGKSMPYG--KESFEGPSRHGYLTVEQALADYADLLTHF 117
Query: 169 KDKYNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
K A + V++ G SYGG LA WFRLKYP G +SAPIL + ITP + V
Sbjct: 118 KAHVPAAGDSKVVSFGGSYGGMLAAWFRLKYPPRDHG--VTSAPILQFTGITPCNALNEV 175
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDT 285
V+K F S++C I+ S+ + N A+ +G L +KF+ C L N T L+D
Sbjct: 176 VTKAFAKESDQCTSAIRTSFEVMRNQAATEEGAKALEEKFRLCQALAPSNYTVLRDWFVD 235
Query: 286 VYSEAAQYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL--EY 334
VY+ A + P PVK C +E N DD + G+ +A S+ Y
Sbjct: 236 VYTNLAMVNYPYANEFLGPVPGHPVKEACKFLEK--NFTDDQ--SLLDGIYQAISVFQNY 291
Query: 335 DGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQ 393
G + C + G GW+ QSC+EMV+PM + K M+ PW + C+++
Sbjct: 292 TG-QTHCNDLTNSAGTLDAGGWDIQSCNEMVMPMCSNGKTDMFFDNPWIIENVTAKCEQK 350
Query: 394 YGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
+G++P +GG +I + SNIIFSNG DP+S GG +K+
Sbjct: 351 FGLTPDVDKAALIFGGKNIS-----AASNIIFSNGDIDPWSGGGVLKS 393
>gi|6465985|gb|AAF12747.1|AF154502_1 quiescent cell proline dipeptidase [Homo sapiens]
Length = 492
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 145/463 (31%), Positives = 229/463 (49%), Gaps = 31/463 (6%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ F+ Q +DHF++ +TFPQR++++ +FW G P + G E +
Sbjct: 24 RRAPDPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPTFFYTGNEGDVWA 83
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF E A + AL+V EHRYYGKS+PFG++ H QALAD+A
Sbjct: 84 FANNSGFVAELAAERGALLVFAEHRYYGKSLPFGAQS---TQRGHTELLTVEQALADFAE 140
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+L ++ A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L +
Sbjct: 141 LLRALRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQ 200
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELK 280
+ V+ DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 201 FFRDVTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLF 259
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGN 337
++ A D P P + N G D L A G+ L Y+ +
Sbjct: 260 MFARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNAS 317
Query: 338 NSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTK 385
S CY D TG W++Q+C+E+ + + M+ P+
Sbjct: 318 GSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDEL 377
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
+ C + +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG +
Sbjct: 378 RQRYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSAS 432
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +V+ RK E I+ W+
Sbjct: 433 VIAVT--IQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|296191232|ref|XP_002743536.1| PREDICTED: dipeptidyl peptidase 2 [Callithrix jacchus]
Length = 492
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 145/457 (31%), Positives = 228/457 (49%), Gaps = 31/457 (6%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
GF+ ++ Q +DHF++ +TFPQR++++ +FW G PI + G E + SG
Sbjct: 30 GFQERYFQQLLDHFNFESFGNKTFPQRFLVSDRFWIRGKGPIFFYTGNEGDVWVFANNSG 89
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F E A + AL+V EHRYYGKS+PFG R + G QALAD+A +L ++
Sbjct: 90 FIAELAAEQGALLVFAEHRYYGKSLPFGERS---TQRGYTGLLTVEQALADFAELLRALR 146
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A P IA G SYGG L+ + R+KYPH+V G+LA+SAP+L + T + V+
Sbjct: 147 RDLGAQDVPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSTQFFRDVT 206
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTV 286
DF +C Q +++++ +I ++ + +S +F TC PL K+ T+L
Sbjct: 207 ADFEGQGPKCTQAVREAFWQIRDLFLQ-GAYDKVSWEFGTCQPLLDEKDLTQLFMFARNA 265
Query: 287 YSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGNNSR-CY 342
++ A D P P + N G D+L A AG+ L Y+ + S CY
Sbjct: 266 FTVLAMMDYP--YPTDFLGPLPANPVKVGCDLLLSEAQRIAGLRALAGLVYNASGSEHCY 323
Query: 343 -------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCK 391
D TG W++Q+C+E+ + + M+ P+ + C
Sbjct: 324 DIYRLYHSCADPTGCGTGPDARAWDYQACTEINLMFASNNVTDMFPVLPFTDELRQQYCL 383
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+ +GV PRP W+ T + G D+ R+ SNIIFSNG DP++ GG + ++ +
Sbjct: 384 DTWGVWPRPDWLQTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIQRNLSTSVIAVT- 437
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ G+H LDL + DPD +V+ R E ++ W+
Sbjct: 438 -IQGGAHHLDLRASHPEDPDSVVKARILEATVIGEWV 473
>gi|62420888|ref|NP_037511.2| dipeptidyl peptidase 2 preproprotein [Homo sapiens]
gi|212276510|sp|Q9UHL4.3|DPP2_HUMAN RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
Length = 492
Score = 221 bits (564), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 145/463 (31%), Positives = 229/463 (49%), Gaps = 31/463 (6%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E +
Sbjct: 24 RRAPDPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWA 83
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
S F E A + AL+V EHRYYGKS+PFG++ H QALAD+A
Sbjct: 84 FANNSAFVAELAAERGALLVFAEHRYYGKSLPFGAQS---TQRGHTELLTVEQALADFAE 140
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+L ++ A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L +
Sbjct: 141 LLRALRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQ 200
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELK 280
+ V+ DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 201 FFRDVTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLF 259
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGN 337
++ A D P P + N G D L A G+ L Y+ +
Sbjct: 260 MFARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNAS 317
Query: 338 NSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTK 385
S CY D TG W++Q+C+E+ + + M+ P+
Sbjct: 318 GSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDEL 377
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
+ C + +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG +
Sbjct: 378 RQRYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSAS 432
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +V+ RK E I+ W+
Sbjct: 433 VIAVT--IQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 473
>gi|261278872|pdb|3JYH|A Chain A, Human Dipeptidyl Peptidase Dpp7
gi|261278873|pdb|3JYH|B Chain B, Human Dipeptidyl Peptidase Dpp7
gi|261278874|pdb|3JYH|C Chain C, Human Dipeptidyl Peptidase Dpp7
gi|261278875|pdb|3JYH|D Chain D, Human Dipeptidyl Peptidase Dpp7
gi|301015979|pdb|3N0T|A Chain A, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015980|pdb|3N0T|B Chain B, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015981|pdb|3N0T|C Chain C, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015982|pdb|3N0T|D Chain D, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
Length = 469
Score = 221 bits (563), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 145/457 (31%), Positives = 226/457 (49%), Gaps = 31/457 (6%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E + S
Sbjct: 7 GFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWAFANNSA 66
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F E A + AL+V EHRYYGKS+PFG++ H QALAD+A +L ++
Sbjct: 67 FVAELAAERGALLVFAEHRYYGKSLPFGAQS---TQRGHTELLTVEQALADFAELLRALR 123
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L + + V+
Sbjct: 124 RDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVT 183
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTV 286
DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 184 ADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNA 242
Query: 287 YSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGNNSR-CY 342
++ A D P P + N G D L A G+ L Y+ + S CY
Sbjct: 243 FTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNASGSEHCY 300
Query: 343 -------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCK 391
D TG W++Q+C+E+ + + M+ P+ + C
Sbjct: 301 DIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQRYCL 360
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+ +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG + ++ +
Sbjct: 361 DTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAVT- 414
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ G+H LDL + DP +V+ RK E I+ W+
Sbjct: 415 -IQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 450
>gi|255579501|ref|XP_002530593.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223529841|gb|EEF31773.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 327
Score = 221 bits (563), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 129/337 (38%), Positives = 181/337 (53%), Gaps = 25/337 (7%)
Query: 156 QALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY 215
QALAD+A ++ +K A PV+ G SYGG LA W RLKYPH+ IG+LA+SAPIL +
Sbjct: 5 QALADFAVLITDLKRNLTAEDCPVVLFGGSYGGMLAAWMRLKYPHIAIGALAASAPILQF 64
Query: 216 GDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKN 275
++ P ++ +VS DF+ S CF TIK+SW+ I + K +GL LS+ F C+ L +
Sbjct: 65 ENVVPHEIFYDIVSNDFKRESSRCFNTIKESWNAIASEGLKENGLVKLSRTFHMCSDLNS 124
Query: 276 TTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD 335
T EL D L++ YS A D P P + + P G I + Y
Sbjct: 125 TDELADWLESAYSYLAMVDYP--YPAEFM------MPLPGHPI----------REVTFYF 166
Query: 336 GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQY 394
+ +I W CSEMV+PM K SM+ +N T + K C + +
Sbjct: 167 STSKLSHIVYPCLHILDFRNW----CSEMVMPMASSKYESMFPTYDFNYTSFEKQCWDDF 222
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
V RP W++T +GG DIK L + SNIIFSNG+ DP+S G ++ D +
Sbjct: 223 RVVSRPRWIMTEFGGQDIKTSLEKFGSNIIFSNGLLDPWSGGSVLQNIS--DTVVALVTE 280
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
G+H +DL + DPDWLV+QR TEVK+++GWI +
Sbjct: 281 EGAHHIDLRPSTPEDPDWLVEQRATEVKLIEGWINDH 317
>gi|256070800|ref|XP_002571730.1| lysosomal Pro-Xaa carboxypeptidase (S28 family) [Schistosoma
mansoni]
gi|360043145|emb|CCD78557.1| lysosomal Pro-Xaa carboxypeptidase (S28 family) [Schistosoma
mansoni]
Length = 498
Score = 219 bits (559), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 146/470 (31%), Positives = 234/470 (49%), Gaps = 32/470 (6%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSK-FWGGGNSPILAFLGAEAPIDD 103
K+ + +ET ++ +DHFS+ + + F RY+IN++ F GG PIL + G E PI+
Sbjct: 28 KNPDFTYETKYFWTRVDHFSFVND--EKFLIRYLINNESFTPGG--PILFYTGNEGPIET 83
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
+ SGF ++ + + A +V EHRYYG S+PFG+ + K+ RH GY + Q LADY
Sbjct: 84 FAENSGFIWKLSRELNASVVFAEHRYYGTSLPFGNN--SFKDRRHFGYLTAEQTLADYVL 141
Query: 164 ILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ +K Y+ +PVIA G SYGG L+ W R KYP+ + G++ASSAP+ + ++
Sbjct: 142 LINQLKANYSCFASSPVIAFGGSYGGMLSAWIRQKYPNQIAGAIASSAPVWLFPGLSDCN 201
Query: 223 TYHSVVSKDF-RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
+ ++ F + E C + I+ SWS I +I DG +L+ F CTPL + + D
Sbjct: 202 GFSMTITNSFLKYGGENCVKNIQLSWSNIVDIGQSVDGKELLTHMFNICTPLTDVQNIID 261
Query: 282 GLDTVYS--EAAQYDTPSNI-------PVKRIC-NAIENAPNCGDDILCKIAAGVVEADS 331
L Y P+++ PVK +C N E P ++ +I+ S
Sbjct: 262 YLSDFLGIISMVNYPYPASLILALPEWPVKYLCTNLSEYDPQ--QPVVTRISLLAKAVLS 319
Query: 332 LEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNC 390
L N C + GW+ Q+C EMV P ++ P W+L Y C
Sbjct: 320 LTNYTGNQTCLDISMNLPNVDVNGWDLQACMEMVTPTCASGPVNIMPPSNWDLKTYSIYC 379
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
+ ++GVSPR W + + + +NIIFSNG DP+S + + F+
Sbjct: 380 QNRFGVSPRVEWPKVEFWSKSV-----YTVTNIIFSNGEIDPWSAFSITNNSY---VPFA 431
Query: 451 QDLNL--GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAI 498
+N+ +H LDL +DP +V+ R+ E + + WI ++ +K +
Sbjct: 432 TVINMSDAAHHLDLRTPNSADPQSVVEAREIEKQKIIQWIKEWNLKYKLL 481
>gi|391347711|ref|XP_003748099.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 468
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 134/430 (31%), Positives = 220/430 (51%), Gaps = 32/430 (7%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSK-FWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
++ Q +DHF + TF Q+Y+++ K F GG PI + G E ++ + + +G +
Sbjct: 30 YFEQRVDHFGF--HKRDTFRQKYLMSDKTFQAGG--PIFFYCGGEMNVELHARQTGLMFT 85
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A +F+AL+V EHRYYG+S+P+G A+ + RGY ++ QALADYA+IL H+K +
Sbjct: 86 WAREFRALVVFAEHRYYGESLPYGD--ASFYGSERRGYLSTEQALADYAAILSHLKANHT 143
Query: 174 -ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
AT + ++ GA Y G LA W R+KYPH+ + ASSAPI +Y P + V+ F
Sbjct: 144 GATKSEIVVWGAGYSGMLAVWMRVKYPHIAKLAYASSAPIGFYSGEVPCGKFLKAVTSVF 203
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK--NTTELKDGLDTVYSEA 290
R SE C Q+I++ W+ + +A+ DG+A L+ F TC P++ N + L L +
Sbjct: 204 RSESETCVQSIRRIWNVLQTMATSRDGMAHLADAFNTCQPVRGDNISNLFRWLKESFRTI 263
Query: 291 AQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC 341
+ D P PVK C+ + N D++ +A S+ Y+
Sbjct: 264 SMLDFPYETDLFGKLPAYPVKEACSKLWNHQQSDRDLM----RAAHKAVSVLYNHTGEVV 319
Query: 342 YINEDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
+ D T ++ GW +Q+C+E+++P+ D + M+ P+ W+L K + C ++GV P
Sbjct: 320 CYSLDNT-LRNNAGWGFQACTELIMPVCSDGMDDMFNPKSWDLKKVQRKCLNKFGVWPDD 378
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
+ YGG + NI+ +N RDP+ GG + + + G+H
Sbjct: 379 QRLKRIYGG----ATGLATVDNIVVTNNQRDPWYDGGILTGTEGITVI---SIRNGAHGH 431
Query: 461 DLDEAKKSDP 470
D+ + DP
Sbjct: 432 DMRTPHEKDP 441
>gi|198425804|ref|XP_002127477.1| PREDICTED: similar to Dipeptidyl-peptidase 2 precursor
(Dipeptidyl-peptidase II) (DPP II) (Dipeptidyl
aminopeptidase II) (Quiescent cell proline dipeptidase)
(Dipeptidyl peptidase 7) [Ciona intestinalis]
Length = 494
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 154/470 (32%), Positives = 243/470 (51%), Gaps = 51/470 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ T ++ Q +DHF++ T+ QRY+I+ + W G P+L + G E I SG
Sbjct: 21 YHTKYFEQFVDHFNFQSNGNATYMQRYLISDEHWVAGKGPMLFYAGNEGDIVGFKDASGL 80
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E A + A++V EHR+YG S+PFG+ KN G + QA+ADYA +L H+K
Sbjct: 81 LTETAPKLGAMVVFAEHRFYGTSLPFGNDSFIDKNI---GLLSIEQAMADYAYLLKHLKS 137
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY-GDITPWTTYHSVVS 229
YNA P+IA G SYGG LA + R+KYP+++ G+LA+SAPI + G+ P + SV +
Sbjct: 138 SYNADDIPIIAFGGSYGGILAAYMRIKYPNLITGALAASAPIYWTSGEGNPHGFWKSVTT 197
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT--ELKDGLDTVY 287
+E C +K+ ++E A + ++SK FKTC+ +K+++ L + +
Sbjct: 198 --IFGHNEGCVNRVKEGFAETAKYAQQGK-YDVISKGFKTCSQVKSSSLMHLYGWVRNSF 254
Query: 288 SE--AAQYDTPSN-------IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
++ A Y P+N PV C + A + G++EA SL Y+G
Sbjct: 255 TQLAMANYPYPANFFGPLPAFPVNVACEKMLKANTAIE--------GMLEATSLLYNGTG 306
Query: 339 SR-C------YIN-EDRTGDE---SDEGWEWQSCSEMVVPMGK-DKNSMYQPEPWNLTKY 386
+ C YI D TG + W++Q C+E+V+P G + M+ P+
Sbjct: 307 DKDCFDIYEEYIECSDPTGCSLGLAARSWDYQGCTEIVLPGGSTNITDMFPAIPFTPEIR 366
Query: 387 IKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
K C+ V+PR +W+ + ++KL +SNIIFSNG DP+ GG + D
Sbjct: 367 KKYCETHQRVTPRRNWLALNFWTDNLKL-----SSNIIFSNGDLDPWKDGGILH-----D 416
Query: 447 LS---FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
LS + + G+H LDL + DP +++ RK EV+I+ GWI Q+++
Sbjct: 417 LSPTVVALLVKGGAHHLDLRGSNPQDPPSVIEVRKHEVEIISGWIAQHWE 466
>gi|322792044|gb|EFZ16143.1| hypothetical protein SINV_09817 [Solenopsis invicta]
Length = 446
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 152/464 (32%), Positives = 224/464 (48%), Gaps = 51/464 (10%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFW--GGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DHFS+ ++ TF RY+IN F NSPI + G E I+ + +GF +E A
Sbjct: 2 VDHFSFAVQN--TFKLRYLINDTFTRKTEDNSPIFFYTGNEGNIEVFAENTGFIWEIAPS 59
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F AL+V EHRYYG+S+P+G+R A + +H GY S QALADY ++ H+K + +
Sbjct: 60 FDALVVFAEHRYYGESLPYGNRSFA--DPQHLGYLTSEQALADYVDLIHHLKSQPEYKLS 117
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
PVI G SYGG L+ W RLKYPHVV G +L D + V+ D
Sbjct: 118 PVIVFGGSYGGMLSAWMRLKYPHVVQGY------VLSLDDSNTLSCLMRVIRSDNNFGLS 171
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK---NTTELKDGLDTVYSEAAQYD 294
C + ++ E+ ++ +G LS +K C PLK N T LKD L V A D
Sbjct: 172 RCESNVSAAYPEVVAHNNEHEGKKWLSDNWKLCEPLKTGENVTTLKDFLQEVLINLAMVD 231
Query: 295 TP------SNIP------------------VKRICNAIENAPNCGDDILCKIAAGVVEAD 330
P + +P V+ C + NA G +L + + +
Sbjct: 232 YPYEANFLAPLPANPINLQILKSMISIFTIVQAFCQHLTNASLMGKPLLSALYRAL--SV 289
Query: 331 SLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKN 389
Y G S I ++G ++D+GW++Q+C+EMV+P+ D N M++P WN+ Y
Sbjct: 290 YTNYTGKASCTSIKSAQSGLDADQGWDYQACTEMVMPICFDGVNDMFEPTEWNINAYNST 349
Query: 390 CKEQYGVSPRPSWVLTYYG--GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL 447
C ++Y +S +P + YG H SNIIFSNG+ DP++ GG V +
Sbjct: 350 CFKKYSISSQPYQICKEYGCSAH------FPGASNIIFSNGLLDPWTGGGVVANIS-AES 402
Query: 448 SFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
S + +H LDL E DP + RK ++ WI ++
Sbjct: 403 VISIVMPDAAHHLDLRETNPKDPYDVTLARKFHQFSIKKWIKEH 446
>gi|61651780|ref|NP_001013333.1| dipeptidyl peptidase 2 precursor [Danio rerio]
gi|60416102|gb|AAH90719.1| Dipeptidyl-peptidase 7 [Danio rerio]
Length = 500
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 144/471 (30%), Positives = 235/471 (49%), Gaps = 37/471 (7%)
Query: 39 SRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAE 98
+ I +D + F+ ++ Q +DHF+Y T+ QRY+I K+W G PI + G E
Sbjct: 38 NEITSHRDLSLDFKEKYFKQILDHFNYNSLGNGTYDQRYLITDKYWKKGYGPIFFYTGNE 97
Query: 99 APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
I + + SGF E A AL++ EHRYYGKS+PFG + G QAL
Sbjct: 98 GDISEFARNSGFMVELAAAQGALLIFAEHRYYGKSLPFGKNSFKIPEV---GLLTVEQAL 154
Query: 159 ADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI 218
ADYA ++ +K++ PVI G SYGG L+ + R++YP++V G+LA+SAPIL +
Sbjct: 155 ADYAVMITELKEELGGQTCPVIVFGGSYGGMLSVYMRIRYPNIVAGALAASAPILSTAGL 214
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC-TP----- 272
+ V+ DF + C ++ ++ +++ +A + D + I S F C TP
Sbjct: 215 GDPRQFFQDVTADFEKFNPACRNAVQGAFQKLNTLAQQKDYIRIQS-AFSLCKTPSSPKD 273
Query: 273 -------LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI--A 323
L+N + LD YS P+ PVK C + N G D++ +
Sbjct: 274 IHQLNGFLRNAFTMMAMLDYPYSTHFMGSMPA-FPVKVACEIMLN----GTDLMSALRDT 328
Query: 324 AGVVEADSLEYDGNNSRCYINE--DRTG---DESDEGWEWQSCSEMVVPM-GKDKNSMYQ 377
G+V ++ E + E D TG + W++Q+C+E+ + + M+
Sbjct: 329 VGIVYNNTGELTCYDLYSLYVECADPTGCGLGFNSYAWDYQACTEIEMCFESNNVTDMFP 388
Query: 378 PEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
P+ + + C ++GV PRP W+ T + G+D+ + SNIIFSNG DP++ GG
Sbjct: 389 AMPFTEQQREQYCSNRWGVVPRPGWLKTQFWGNDLS-----TASNIIFSNGDLDPWANGG 443
Query: 438 WVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
K+ ++ + + G+H LDL E+ +DP+ ++ RK E +I+ W+
Sbjct: 444 IRKSLSPSLIAIT--IPEGAHHLDLRESNPADPESVIVARKKEAEIIAQWV 492
>gi|405944913|pdb|4EBB|A Chain A, Structure Of Dpp2
gi|405944914|pdb|4EBB|B Chain B, Structure Of Dpp2
Length = 472
Score = 218 bits (556), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 145/457 (31%), Positives = 226/457 (49%), Gaps = 31/457 (6%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E + S
Sbjct: 4 GFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWAFANNSA 63
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F E A + AL+V EHRYYGKS+PFG++ H QALAD+A +L ++
Sbjct: 64 FVAELAAERGALLVFAEHRYYGKSLPFGAQS---TQRGHTELLTVEQALADFAELLRALR 120
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A AP IA G SYGG L+ + R KYPH+V G+LA+SAP+L + + V+
Sbjct: 121 RDLGAQDAPAIAFGGSYGGXLSAYLRXKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVT 180
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTV 286
DF S +C Q +++++ +I ++ + + +F TC PL K+ T+L
Sbjct: 181 ADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFXFARNA 239
Query: 287 YSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGNNSR-CY 342
++ A D P P + N G D L A G+ L Y+ + S CY
Sbjct: 240 FTVLAXXDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNASGSEHCY 297
Query: 343 -------INEDRTG---DESDEGWEWQSCSEMVVPMGKDKNSMYQPE-PWNLTKYIKNCK 391
D TG W++Q+C+E+ + + + P+ P+ + C
Sbjct: 298 DIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDXFPDLPFTDELRQRYCL 357
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+ +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG + ++ +
Sbjct: 358 DTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAVT- 411
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ G+H LDL + DP +V+ RK E I+ W+
Sbjct: 412 -IQGGAHHLDLRASHPEDPASVVEARKLEATIIGEWV 447
>gi|260828789|ref|XP_002609345.1| hypothetical protein BRAFLDRAFT_128878 [Branchiostoma floridae]
gi|229294701|gb|EEN65355.1| hypothetical protein BRAFLDRAFT_128878 [Branchiostoma floridae]
Length = 489
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 139/469 (29%), Positives = 238/469 (50%), Gaps = 50/469 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++ + Q +D+F++ QT+ Q+ +++ +W PI + G E PI + SGF
Sbjct: 31 YKVRYVEQYVDNFNFPSYGQQTYMQKVLVSDAYWEKREGPIFFYTGNEGPITAFWEASGF 90
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK- 169
E A +FKAL+V EHRYYG+S+PFG++ +N G + QA+ADYA ++ ++
Sbjct: 91 VKELAAKFKALLVFAEHRYYGESLPFGNQSFTKENI---GLLSVEQAMADYARLMTALRT 147
Query: 170 --DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
D + P+I G SYGG L+ + R KYP++V G+LA+SAPI +T +
Sbjct: 148 HLDCKSPDVCPIITFGGSYGGMLSAYMRFKYPNLVAGALAASAPIYLVAGLTEGHQFFQD 207
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD------ 281
V++DFR + C ++ ++ E++ + + GL +S +F+ C+PL + +L
Sbjct: 208 VTEDFRKSDARCPLKVQSAYFEMEELGAG--GLKEISDRFQLCSPLTDKKDLSHLYGWVR 265
Query: 282 -------GLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
LD Y + P+N PV C I N+ D+L G+ +A L Y
Sbjct: 266 NSFTTLAMLDYPYPTDFEAKLPAN-PVNVACGLILNS----SDLL----KGLSQAAGLAY 316
Query: 335 DGNNS--RCY-------INEDRTG---DESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPW 381
+G + C+ D TG + W++Q+C+E+ ++ + M+ P+ +
Sbjct: 317 NGTDGTLECFDIFDEFVACADPTGCGLGDDSTAWDYQACTEVSLLESTNNVTDMFPPDNY 376
Query: 382 NLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
+ C+ +YGV+PRP W+ + G +I S+SNIIFSNG DP+ RGG +
Sbjct: 377 TAEARAEYCRTKYGVTPRPGWMGVQFWGKNI-----LSSSNIIFSNGDLDPWRRGGVLTN 431
Query: 442 YHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
++ + + G+H LDL +DP + + R+ E ++ WI++
Sbjct: 432 LSSSLVAIT--IEGGAHHLDLRSTNPADPPSVTRARQQEEALIGQWISR 478
>gi|440790409|gb|ELR11692.1| prolylcarboxypeptidase isoform 2 preproprotein [Acanthamoeba
castellanii str. Neff]
Length = 506
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 149/464 (32%), Positives = 238/464 (51%), Gaps = 43/464 (9%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW-----GGGNSPILAFLGAEAPIDDNI 105
+ T +++QT+DHF++ + T+ QR+++ ++W GG PI + G EAP+ D
Sbjct: 60 YRTLYFDQTLDHFNFATQP-ATYKQRFLLADEYWRGSYPGGCPGPIFFYTGNEAPVTDYY 118
Query: 106 QLSGF-TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
SGF T A + AL+V E S+PFGS+ + Y + QALADYA +
Sbjct: 119 SASGFFTQVLAPKHNALLVFAE------SMPFGSKSF---DPEKISYLSPEQALADYAVL 169
Query: 165 LLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD-ITPWT 222
+ H+K+ +A + PV A G SYGG L WFR+KYP +V+G LA+SAP+ +YG I+P+
Sbjct: 170 ITHLKETLPHARNCPVFAFGGSYGGILTAWFRMKYPDIVMGGLAASAPLSFYGTGISPYA 229
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---- 278
+S S F C I +++ + ++ P+G +K FK C PL + E
Sbjct: 230 FTNS-ASDTFAQARLGCAPLIAQAFETLQRFSATPEGCERFAKAFKLCGPLNSQAEASAV 288
Query: 279 ---LKDGLDTV----YSEAAQYDTP-SNIPVKRICNAI-ENAPNCGDDILCKIAAGVVEA 329
++ GL ++ Y A+ Y PV + C+ I E A N D + A G
Sbjct: 289 VYWVEMGLASMAMLDYPFASNYGVSLPAWPVNKTCDRILEKAANNNDPDILAEALGYAIG 348
Query: 330 DSLEYDGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
G++S CY I D E GW++ C+E+ +P+G + + +NLT I+
Sbjct: 349 VFYNNTGDHS-CYDIKRDVPEWEKCCGWDYLHCTEVYIPIG--FSGFFPHATYNLTADIE 405
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
C++++G++ RP+W YGG +I S+SNIIFSNG+ DP+ G + + D
Sbjct: 406 QCRQKFGITLRPNWARIQYGGFNIT-----SSSNIIFSNGLLDPWHSSGVLHSLS--DSL 458
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
S + H LDL DP ++ + R+ E ++ W+ +Y+
Sbjct: 459 ISIMIPEAGHHLDLWAPSPEDPIYIQRAREQEAVLIDKWLKEYW 502
>gi|348535188|ref|XP_003455083.1| PREDICTED: dipeptidyl peptidase 2-like [Oreochromis niloticus]
Length = 492
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 149/467 (31%), Positives = 230/467 (49%), Gaps = 37/467 (7%)
Query: 43 EQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID 102
+ D++ F F+ QT+DHF++ TF QRY+I ++W G PI + G E I
Sbjct: 31 QHHDTDPRFTEKFFTQTLDHFNFNSMGNGTFNQRYLITDQYWEKGFGPIFFYTGNEGNIW 90
Query: 103 DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA 162
+ SGF E A Q +AL++ EHRYYGKS+PF N QALADYA
Sbjct: 91 EFALNSGFITELAAQQRALVIFAEHRYYGKSLPFEKDSF---NIPQVSLLTVEQALADYA 147
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ +K + AT PVI G SYGG L+ + R+KYP++V G+LA+SAPIL +
Sbjct: 148 IMITELKQQLGATDCPVIVFGGSYGGMLSVYMRIKYPNIVAGALAASAPILSTAGLGDSR 207
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP---------- 272
+ V+ DF S +C ++ ++ ++ +A + I S F C P
Sbjct: 208 EFFRDVTSDFESVSSDCTDAVRGAFHQLKELAQSQEYHHIQS-AFALCKPLSSAQDIHQL 266
Query: 273 ---LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI--AAGVV 327
L+N L LD YS + P+N PVK C + G D+L + AG+V
Sbjct: 267 NGFLRNAFTLMAMLDYPYSTHFIGNMPAN-PVKVACETMLR----GSDLLGNLRDTAGIV 321
Query: 328 -EADSLEYDGNNSRCYIN-EDRTG---DESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPW 381
A + + Y+ D TG W++Q+C+E+ + + M+ P P+
Sbjct: 322 YNATGVLTCFDLYSLYLECADPTGCGLGFDSLAWDYQACTEIELCYESNNVTDMFPPMPF 381
Query: 382 NLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
C +++GV PRP W+ + G + + SNIIFSNG DP++ GG V+T
Sbjct: 382 TEEDRRLYCSKRWGVVPRPGWLNIQFWGDALS-----TASNIIFSNGDLDPWANGG-VRT 435
Query: 442 YHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D + +++ G+H LDL + ++DP ++ RKTE ++ W+
Sbjct: 436 -SLSDSLIALNISGGAHHLDLRGSNEADPVSVISARKTEADLIALWV 481
>gi|417411236|gb|JAA52063.1| Putative dipeptidyl peptidase 2, partial [Desmodus rotundus]
Length = 502
Score = 214 bits (546), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 143/462 (30%), Positives = 227/462 (49%), Gaps = 39/462 (8%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F ++ Q +DHF++ QTFPQR++++ KFW G P+ + G E + SGF
Sbjct: 28 FRESYFEQFLDHFNFERFGNQTFPQRFLVSEKFWKRGEGPLFFYTGNEGDVWAFANNSGF 87
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E A Q AL+V EHRYYGKS+PFG R H QALAD+A +L ++
Sbjct: 88 ILELAAQQGALVVFAEHRYYGKSLPFGERS---TQRGHVELLTVEQALADFARLLQALRR 144
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
A P +A G SYGG L+ + R+KYPH+V G+LA+SAP++ + + VS
Sbjct: 145 DLGAQDVPAVAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVLVAGLGDAYQFFRDVSL 204
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVY 287
DF +C Q ++ ++ +I ++ ++S+ F C PL K+ +L +
Sbjct: 205 DFEGQGPKCAQGVRDAFRQIKDLFLL-GAYDVVSQAFGLCRPLSGWKDLVQLFGFARNAF 263
Query: 288 SEAAQYDTP------SNIPVKRICNAIENAPNCGDDIL-CKIAAGVVEADSLEYDGNNSR 340
+ A D P ++P + A + N D I + AG+V S E
Sbjct: 264 TVLAMMDYPYPTDFLGHLPANPVQVACDRLLNESDRIEGLRALAGLVYNSSGE-----QS 318
Query: 341 CY-------INEDRTGDESD---EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN- 389
CY D TG S W++Q+C+E+ + + + PE T+ ++
Sbjct: 319 CYDVYLQYRACADPTGCGSGPDARAWDYQACTEINLAFSSNNLTDLFPE-LLFTEELRQQ 377
Query: 390 -CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
C + +GV PR W+ T +GG D+K + SNIIFSNG DP++RGG L+
Sbjct: 378 YCLDTWGVWPRRDWLHTSFGGADLK-----AASNIIFSNGDLDPWARGGIQSNLSASILA 432
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ ++ G+H LDL + +DP +++ R+ E ++ W+ +
Sbjct: 433 IT--IHGGAHHLDLRASHPADPMSVLEARQMEATFIRQWVAE 472
>gi|356570877|ref|XP_003553610.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 349
Score = 214 bits (545), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 177/320 (55%), Gaps = 30/320 (9%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGG--GNSPILAFLGAEAPIDDNIQLSGFTY 112
F+ Q +DHF+Y P+SYQTF QRY+IN +WGG N+PI + G E I+ Q GF +
Sbjct: 39 FFTQILDHFNYNPQSYQTFQQRYLINDTYWGGDKSNAPIFFYTGNEGDIEWFAQNPGFMF 98
Query: 113 ENAHQFKALIVILE--HRYYGKSVPFGSRKA-ALKNARHRGYFNSAQALADYASILLHIK 169
E A FKAL+V +E HRYYGKS PFG + A N+ GY +S + D +K
Sbjct: 99 ETAPYFKALLVFIEEXHRYYGKSFPFGGNEEDANANSSTLGYLSSTLLIID-------LK 151
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+AT++PV+ G SYGG + WFR+KYPHV IG+LASSAPIL + D+ TY +++
Sbjct: 152 KNLSATYSPVVVFGGSYGGIILAWFRMKYPHVAIGALASSAPILQFLDLVSPNTYTDIIT 211
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
+D++ SE C++ IK SW +I++ A KP GL L K F+ C + L L
Sbjct: 212 QDYKSESENCYKVIKGSWKQIEDTARKPGGLEQLWKSFRICKNYISAGALVSWLQMALGI 271
Query: 290 AAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR 340
AA Y PSN PV+++ I+N + G++ K+ A A S+ Y+ + +
Sbjct: 272 AAMIDYPAPSNFLAHLPASPVRKVMALIDNL-SVGNEAFTKLYA----AASIFYNYSGTA 326
Query: 341 CYINEDRTGDE----SDEGW 356
D T D S GW
Sbjct: 327 ICFGLDNTTDTLERISKAGW 346
>gi|351704787|gb|EHB07706.1| Dipeptidyl-peptidase 2 [Heterocephalus glaber]
Length = 515
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 142/458 (31%), Positives = 228/458 (49%), Gaps = 37/458 (8%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F +++ Q +DHF++ +TF QR++++ KFW PI + G E + SGF
Sbjct: 39 FREYYFEQLLDHFNFESFGNKTFSQRFLVSDKFWRRSEGPIFFYTGNEGDVWGFANNSGF 98
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALADYASILLH 167
E A Q + L+V EHRYYGKS+PFG++ RGY QALAD+A +L
Sbjct: 99 LVELAQQQEGLLVFAEHRYYGKSLPFGAQST------QRGYLKLLTVEQALADFAVLLQA 152
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ AP IA G SYGG L+ + R+KYPH+V G+LA+SAP++ + +
Sbjct: 153 LRRDLGTQDAPAIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVSGLGDSYQFFRD 212
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC---TPLKNTTELKDGLD 284
V+ DF S +C Q ++ ++ +I ++ + +S++F TC + L++ T+L
Sbjct: 213 VTADFYGQSPKCAQGVRDAFQQIRDLFLQ-GAYDRISREFGTCQSLSGLEDLTQLFVFAR 271
Query: 285 TVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGNNS-R 340
++ A D P P + N G D L A AG+ L Y+ + + R
Sbjct: 272 NAFTVLAMMDYP--YPTDFLVPLPANPVKAGCDRLLSEAQRIAGLRSLAGLVYNASGTER 329
Query: 341 CY---------INEDRTGDESD-EGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKN 389
CY + G SD + W++Q+C+E+ + + M+ P+ +
Sbjct: 330 CYDIYQLYRSCADPTGCGTGSDAQAWDYQACTEINLTFDSNNVTDMFPALPFTEELRQQY 389
Query: 390 CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
C + +GV PRP W+ T +GG DIK SNIIFSNG DP++ GG + ++
Sbjct: 390 CLDTWGVWPRPDWLQTSFGGSDIK-----GASNIIFSNGDLDPWAGGGIQRNLSASVVAV 444
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
+ G+H LDL + DP +V+ R+ E +++ W
Sbjct: 445 V--IRGGAHHLDLRASHPEDPASVVEVRELEASLIREW 480
>gi|301103554|ref|XP_002900863.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262101618|gb|EEY59670.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 542
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 160/528 (30%), Positives = 252/528 (47%), Gaps = 68/528 (12%)
Query: 11 LLLMVIF----------VSTSFHANGLKLRPRLGR-IRRSRILEQKDSNHGFETFFYNQT 59
LLL +F V+T+F + L P L + + +L Q D F QT
Sbjct: 40 LLLATVFNWKGDESTAPVTTTFDQHHL---PTLNMDVVPTNLLAQCDEK------FLTQT 90
Query: 60 IDHFSYGPESYQTFPQRY-VINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DHF G +YQ QRY V + +F GG + ++G EA ++ + +G +ENA +F
Sbjct: 91 LDHFDVGAPTYQ---QRYFVCDKQFRPGG--VMFFYVGNEADVELYLNHTGLMWENADEF 145
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
A++V EHRY+GKSVPFG +H Y ++ QALAD+A ++ ++K ++ P
Sbjct: 146 GAMLVFAEHRYFGKSVPFGKDVT-----KHMKYLSTEQALADFAVLITYLKTEWK-LDIP 199
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGDITPWT--TYHSVVSKDFRD- 234
VI G SYGG L +W R+KYPH++ G +A SAPIL + GD P ++ +V+ D +
Sbjct: 200 VIGFGGSYGGMLGSWLRMKYPHIIDGVIAGSAPILSFLGDEVPLDKGSFERIVTFDASEE 259
Query: 235 --TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVY 287
++ C I+++W + + DG L + C + K+ E+ D + +
Sbjct: 260 AGSAPNCVPNIRRTWPAMKKLGDTEDGRKQLKRALSLCDSVKLESRKDVDEVMDWAKSAF 319
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD-----------SLEYDG 336
A + P P I N + P + C A D SL
Sbjct: 320 DYMAMGNYP--YPSSYIMNGVSVLPAYPVRVACSFVADEFAPDDEVALLSAFAKSLGVYY 377
Query: 337 NNSR---CY-INEDRTGDESD-EGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNC 390
N+++ CY +N D + W++ C+E+ P D N M+ PWN T +NC
Sbjct: 378 NSTKHQECYELNAASNESALDSDFWDYIFCAEIYQPQNVDGVNDMFWSIPWNFTADNENC 437
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
K ++GV RP W T YGG ++ SNI+FSNG DP+S G ++ Y D +
Sbjct: 438 KREWGVEIRPLWATTQYGGRKA----LKAASNIVFSNGNYDPWSGTGVLQNYS--DSVVA 491
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAI 498
+ G+H LDL + + D ++ R+ E + M W ++Y+ A+
Sbjct: 492 LSVEGGAHHLDLMFSNELDTASVLAVREAEKQHMHKWAREFYERKAAV 539
>gi|313246245|emb|CBY35176.1| unnamed protein product [Oikopleura dioica]
Length = 484
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 146/477 (30%), Positives = 231/477 (48%), Gaps = 40/477 (8%)
Query: 27 LKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
L L L ++ S + D TF DHFS + Q R + + +F+
Sbjct: 8 LSLTAALKPLKYSSLESYSDFCSEISTF--EAEYDHFS--TRNTQKIEIRVITDDRFYQA 63
Query: 87 GNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNA 146
G P+L + G E + + +GF + + A +V +EHRYYGKS+P KN
Sbjct: 64 GG-PVLFYTGNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIPDD------KNL 116
Query: 147 RHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
Y ++ QALADYA L+H+K + PVIA+G SYGG LA +FR+KYP++V G++
Sbjct: 117 ----YLSAEQALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVAGAI 170
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKPDGLAILS 264
A SAP+ + + ++ V ++ F +T E C I+KSW I I + G LS
Sbjct: 171 AGSAPVKFLPGLFDCRGFYRVTTRTFTNTPSEHFCSDNIRKSWETIKLIGAHMVGKRTLS 230
Query: 265 KKFKTCTPLKNTTELKDGLDTVYSEAAQYD--TPSNI-------PVKRICNAIENAPNCG 315
+ F+TC P+ + L D L+ V+ A D P+N PV C+ +++ N
Sbjct: 231 EVFRTCEPITDVEPLLDFLEDVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDINQE 290
Query: 316 DDILCKIAAGVVEADSLEYD-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKD-KN 373
+ + + +A S+ Y+ + C D GD W +Q+C+E V P D K
Sbjct: 291 E-----LLEPLRDAASVYYNYTGDLACLDLGDEGGDLGYNNWYFQTCTEFVFPFCSDGKE 345
Query: 374 SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
M++ ++ Y NC++ +G +PR W ++ +K I IIFSNG+ DP+
Sbjct: 346 DMFRVHTYDFPTYSTNCQQTFGTTPREHWAEMFFSVETMKTI-----GGIIFSNGLLDPW 400
Query: 434 SRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
S GG + ++ L+ G+H LDL +DP+ + R + IM+ WI +
Sbjct: 401 SSGGVLTQEEAGPRNYIFILSKGAHHLDLRADNPADPEEVTLARTEYISIMKNWIAE 457
>gi|225706950|gb|ACO09321.1| Dipeptidyl-peptidase 2 precursor [Osmerus mordax]
Length = 490
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 141/459 (30%), Positives = 231/459 (50%), Gaps = 37/459 (8%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+ ++ Q +DHF++ T+ QRY+I ++W G PI + G E I + SGF
Sbjct: 40 FKEKYFTQILDHFNFNSMGNGTYDQRYLITDQYWKRGYGPIFFYTGNEGDIWEFALNSGF 99
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E A KAL++ EHRYYGKS+PFG ++ G QALAD+A ++ +K
Sbjct: 100 ITELAADQKALVIFAEHRYYGKSLPFGQDSFSIPEV---GLLTVEQALADFAVMITALKP 156
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ A+ PVI G SYGG L+ + RL+YP++V G+LA+SAPIL + + V+
Sbjct: 157 QLGASECPVIVFGGSYGGMLSVYMRLRYPNIVAGALAASAPILSTAGMGDSRQFFQDVTH 216
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP-------------LKNTT 277
DF + EC ++ ++ ++ +++ D + + F C P L+N
Sbjct: 217 DFESYAPECRDAVRGAFQKLQDLSEVED-YSRIQAAFSLCKPPSSQKDIHQLNGLLRNAF 275
Query: 278 ELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI--AAGVV-EADSLEY 334
L LD YS + P+N PVK C+ + + G D+L + AG+V + +
Sbjct: 276 TLMAMLDYPYSTHFMGNMPAN-PVKVGCDIMLS----GADLLQALRDTAGIVYNSTGILK 330
Query: 335 DGNNSRCYIN-EDRTG---DESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKN 389
+ Y+ D TG + W++Q+C+E+ + + M+ P P+ +
Sbjct: 331 CFDLYSLYVECADPTGCGLGFNSLAWDYQACTEIELCFESNNVTDMFPPMPFTEAHREQY 390
Query: 390 CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
C +++GV PRP W+ T + G + S SNIIFSNG DP++ GG K+ ++
Sbjct: 391 CSKRWGVIPRPGWLKTQFWGSALS-----SASNIIFSNGDLDPWANGGVRKSLSSSLVAI 445
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+++ G+H LDL + +DP ++ RKTE + W+
Sbjct: 446 --NISEGAHHLDLRSSNDADPLSVITARKTEADTISQWV 482
>gi|226472752|emb|CAX71062.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472756|emb|CAX71064.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 212 bits (539), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 150/457 (32%), Positives = 230/457 (50%), Gaps = 41/457 (8%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++NQT+DHFS+ + TF QRY+ K W N PI + G E I +G +E
Sbjct: 40 YFNQTLDHFSFQARNL-TFKQRYLYEDK-WFKPNGPIFFYCGNEGEIGGFWNNTGLVFEL 97
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F A I+ EHRYYGKS+PF + Y + QALADYA ++ IK+K+N
Sbjct: 98 APSFNAFILFAEHRYYGKSLPFDKSF----QQPYIQYLSIGQALADYAYLIEGIKNKFNM 153
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T +PV+A G SYGG LA + R KYPH+V G+LA+SAP+ + + + V+KD+ D
Sbjct: 154 TRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTKDYHD 213
Query: 235 TSEECFQTIKKSWSEIDNIASKPD-GLAILSKKFKTCTPLKNTTE-------------LK 280
+C + IK +++ ++ KPD G LS++ + C P++N E +
Sbjct: 214 ADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWARNAFVMM 273
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI-LCKIAAGVVEADS-----LEY 334
LD Y + P N PV C +NA + D I + A GV S +Y
Sbjct: 274 AMLDYPYKASFMASLPPN-PVNVSC---KNALSAIDPIPTLREAVGVFYNSSQSLMCFDY 329
Query: 335 DGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKNS--MYQPEPWNLTKYIKNCK 391
C I G++S W++QSC+EM + D + M+ P + C+
Sbjct: 330 KTQFIECADITGCGLGNDS-LAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVTSYCQ 388
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+++GV+P + + T++G + + ++ SNIIFSNG DP+ GG + + S
Sbjct: 389 QKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQS--EKVISL 441
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ G+H LDL +DP Q R+ EV+ ++ W+
Sbjct: 442 MLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 478
>gi|226472754|emb|CAX71063.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 472
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 150/457 (32%), Positives = 230/457 (50%), Gaps = 41/457 (8%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++NQT+DHFS+ + TF QRY+ K W N PI + G E I +G +E
Sbjct: 32 YFNQTLDHFSFQARNL-TFKQRYLYEDK-WFKPNGPIFFYCGNEGEIGGFWNNTGLVFEL 89
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F A I+ EHRYYGKS+PF + Y + QALADYA ++ IK+K+N
Sbjct: 90 APSFNAFILFAEHRYYGKSLPFDKSF----QQPYIQYLSIGQALADYAYLIEGIKNKFNM 145
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T +PV+A G SYGG LA + R KYPH+V G+LA+SAP+ + + + V+KD+ D
Sbjct: 146 TRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTKDYHD 205
Query: 235 TSEECFQTIKKSWSEIDNIASKPD-GLAILSKKFKTCTPLKNTTE-------------LK 280
+C + IK +++ ++ KPD G LS++ + C P++N E +
Sbjct: 206 ADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWARNAFVMM 265
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI-LCKIAAGVVEADS-----LEY 334
LD Y + P N PV C +NA + D I + A GV S +Y
Sbjct: 266 AMLDYPYKASFMASLPPN-PVNVSC---KNALSAIDPIPTLREAVGVFYNSSQSLMCFDY 321
Query: 335 DGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKNS--MYQPEPWNLTKYIKNCK 391
C I G++S W++QSC+EM + D + M+ P + C+
Sbjct: 322 KTQFIECADITGCGLGNDS-LAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVTSYCQ 380
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+++GV+P + + T++G + + ++ SNIIFSNG DP+ GG + + S
Sbjct: 381 QKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQS--EKVISL 433
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ G+H LDL +DP Q R+ EV+ ++ W+
Sbjct: 434 MLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 470
>gi|118099097|ref|XP_415570.2| PREDICTED: dipeptidyl peptidase 2 [Gallus gallus]
Length = 495
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 150/473 (31%), Positives = 229/473 (48%), Gaps = 53/473 (11%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN 104
+ + G E + Q DHF + +TFPQRY++++KFW G PI + G E I
Sbjct: 26 QPAGPGLEEQLFPQVRDHFRFEAGGNETFPQRYLLSAKFWKKGFGPIFFYTGNEGNIWTF 85
Query: 105 IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
+ S F +E A Q +AL++ EHRYYGKS+PFG LKN QALADYA +
Sbjct: 86 AENSDFIFELAEQQQALVIFAEHRYYGKSLPFGLESTQLKNTH---LLTVEQALADYAVL 142
Query: 165 LLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY 224
+ +K +Y A PVIA G SYGG L+ + R+KYP+VV G+LA+SAP+L + T +
Sbjct: 143 ITELKQQYGAAGCPVIAFGGSYGGMLSAYMRMKYPNVVDGALAASAPVLSVAGLGDPTQF 202
Query: 225 HSVVSKDFRDTSEECFQTIKKSWSEI---------DNIASKPDGLAILSKK---FKTCTP 272
V+ DF+ + C +++++ +I D I+SK + +S K ++
Sbjct: 203 FRDVTADFQKSIPGCVPAVQRAFQQIRDLFLSGAYDEISSKMATCSKISSKEDLYQLFGF 262
Query: 273 LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL 332
+N + +D Y P+N PVK C I D + +AA V +
Sbjct: 263 ARNAFTMIAMMDYPYKTDFMGHLPAN-PVKVGCEQIL----AHTDPIQGLAALV----GV 313
Query: 333 EYDGNNS-RCY---------INEDRTGDESD-EGWEWQSCSEMVVPMGKDK-NSMYQPEP 380
Y+ + S +CY + G +D E W++Q C+E+ + + M+ P
Sbjct: 314 FYNSSGSAQCYDVYRLYRPCADPTGCGTGADAEAWDYQVCTEINLTFNSNNVTDMFPEMP 373
Query: 381 WNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
+ + C ++ V PR W+ + G D+K S SNIIFSNG DP++ GG
Sbjct: 374 FTEAMREQYCWSRWRVRPRAQWLRINFWGGDLK-----SASNIIFSNGDLDPWAGGG--- 425
Query: 441 TYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ S S L G+H LDL +DP +++ RK E I+ W+
Sbjct: 426 ----INSSLSPSLTAVTIQGGAHHLDLRGHNPADPPSVIEARKLEASIISNWV 474
>gi|313235771|emb|CBY11221.1| unnamed protein product [Oikopleura dioica]
Length = 484
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 145/477 (30%), Positives = 230/477 (48%), Gaps = 40/477 (8%)
Query: 27 LKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
L L L ++ S + D TF DHFS + Q R + + +F+
Sbjct: 8 LSLAAALKPLKYSSLESYSDFCSEISTF--EAEYDHFS--TRNTQKIEIRVITDDRFYQA 63
Query: 87 GNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNA 146
G P+L + G E + + +GF + + A +V +EHRYYGKS+P KN
Sbjct: 64 GG-PVLFYTGNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIPDD------KNL 116
Query: 147 RHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
Y ++ QALADYA L+H+K + PVIA+G SYGG LA +FR+KYP++V G++
Sbjct: 117 ----YLSAEQALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVAGAI 170
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKPDGLAILS 264
A SAP+ + + ++ V ++ F +T C I+KSW I I + G LS
Sbjct: 171 AGSAPVKFLPGLFDCRGFYRVTTRTFTNTPSGHFCSDNIRKSWETIKLIGAHMVGKRTLS 230
Query: 265 KKFKTCTPLKNTTELKDGLDTVYSEAAQYD--TPSNI-------PVKRICNAIENAPNCG 315
+ F+TC P+ + L D L+ V+ A D P+N PV C+ +++ N
Sbjct: 231 EVFRTCDPITDVEPLLDFLEDVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDINQE 290
Query: 316 DDILCKIAAGVVEADSLEYD-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKD-KN 373
+ + + +A S+ Y+ + C D GD W +Q+C+E V P D K
Sbjct: 291 E-----LLEPLRDAASVYYNYTGDLACLDLGDEGGDLGYNNWYFQTCTEFVFPFCSDGKE 345
Query: 374 SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
M++ ++ Y NC++ +G +PR W ++ +K I IIFSNG+ DP+
Sbjct: 346 DMFRVHTYDFPSYSTNCQQTFGTTPREHWAEMFFSVETMKTI-----GGIIFSNGLLDPW 400
Query: 434 SRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
S GG + ++ L+ G+H LDL +DP+ + R + IM+ WI +
Sbjct: 401 SSGGVLTQEEAGPRNYIFILSKGAHHLDLRADNPADPEEVTLARTEYISIMKNWIAE 457
>gi|66267347|gb|AAH95721.1| Dpp7 protein [Danio rerio]
Length = 487
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 144/472 (30%), Positives = 231/472 (48%), Gaps = 43/472 (9%)
Query: 39 SRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAE 98
+ I +D + F+ ++ Q +DHF+Y T+ QRY+I K+W G PI + G E
Sbjct: 29 NEITSHRDLSLDFKEKYFKQILDHFNYNSLGNGTYDQRYLITDKYWKKGYGPIFFYTGNE 88
Query: 99 APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
I + + SGF E A AL++ EHRYYGKS+PFG + G QAL
Sbjct: 89 GDISEFARNSGFMVELAAAQGALLIFAEHRYYGKSLPFGKNSFKIPEV---GLLTVEQAL 145
Query: 159 ADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI 218
ADYA ++ +K++ PVI G SYGG L+ + R++YP++V G+LA+SAPIL +
Sbjct: 146 ADYAVMITELKEELGGQTCPVIVFGGSYGGMLSVYMRIRYPNIVAGALAASAPILSTAGL 205
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC-TP----- 272
+ V+ DF + C ++ ++ +++ +A + D + I S F C TP
Sbjct: 206 GDPRQFFQDVTADFEKFNPACRDAVQGAFQKLNTLAQQKDYIRIQS-AFSLCKTPSSPKD 264
Query: 273 -------LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI--A 323
L+N + LD YS P+ PVK C + N G D++ +
Sbjct: 265 IHQLNGFLRNAFTMMAMLDYPYSTHFMGSMPA-FPVKVACEIMLN----GTDLMSALRDT 319
Query: 324 AGVVEADSLEYDGNNSRCYINE--DRTG---DESDEGWEWQ--SCSEMVVPMGKDKNSMY 376
G+V ++ E + E D TG + W++Q C E + M+
Sbjct: 320 VGIVYNNTGELTCYDLYSLYVECADPTGCGLGFNSYAWDYQIEMCFE-----SNNVTDMF 374
Query: 377 QPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
P+ + + C ++GV PRP W+ T + G+D+ + SNIIFSNG DP++ G
Sbjct: 375 PAMPFTEQQREQYCSNRWGVVPRPGWLKTQFWGNDLS-----TASNIIFSNGDLDPWANG 429
Query: 437 GWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
G K+ ++ + + G+H LDL E+ +DP+ ++ RK E +I+ W+
Sbjct: 430 GIRKSLSPSLIAIT--IPEGAHHLDLRESNPADPESVIVARKKEAEIIAQWV 479
>gi|14010871|ref|NP_114179.1| dipeptidyl peptidase 2 precursor [Rattus norvegicus]
gi|13626317|sp|Q9EPB1.1|DPP2_RAT RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
gi|9796394|dbj|BAB11691.1| dipeptidyl peptidase II [Rattus norvegicus]
gi|10119950|dbj|BAB13500.1| dipeptidyl peptidase II [Rattus norvegicus]
gi|51260653|gb|AAH78783.1| Dipeptidylpeptidase 7 [Rattus norvegicus]
gi|149039379|gb|EDL93599.1| dipeptidylpeptidase 7, isoform CRA_a [Rattus norvegicus]
Length = 500
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 143/458 (31%), Positives = 231/458 (50%), Gaps = 33/458 (7%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F ++ Q +DHF++ S +TF QR++++ KFW G PI + G E I SGF
Sbjct: 41 FRENYFEQYMDHFNFESFSNKTFGQRFLVSDKFWKMGEGPIFFYTGNEGDIWSLANNSGF 100
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALADYASILLH 167
E A Q +AL+V EHRYYGKS+PFG + + RGY QALAD+A +L
Sbjct: 101 IVELAAQQEALLVFAEHRYYGKSLPFGVQ------STQRGYTQLLTVEQALADFAVLLQA 154
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ AP IA G SYGG L+ + R+KYPH+V G+LA+SAP++ + +
Sbjct: 155 LRHNLGVQDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVAGLGNPDQFFRD 214
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLD 284
V+ DF S +C Q ++ ++ +I ++ + +S+ F TC L K+ T+L
Sbjct: 215 VTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQSLSSPKDLTQLFGFAR 273
Query: 285 TVYSEAAQYDTP------SNIPVKRICNAIENAPNCGDDI--LCKIAAGVVEADSLEYDG 336
++ A D P +P + E + G I L +A V + +E
Sbjct: 274 NAFTVLAMMDYPYPTNFLGPLPANPVKVGCERLLSEGQRIMGLRALAGLVYNSSGMEPCF 333
Query: 337 NNSRCYIN-EDRTG---DESDEGWEWQSCSEMVVPMGKDKNSMYQPE-PWNLTKYIKNCK 391
+ + Y + D TG + W++Q+C+E+ + + + PE P++ + C
Sbjct: 334 DIYQMYQSCADPTGCGTGSNARAWDYQACTEINLTFDSNNVTDMFPEIPFSDELRQQYCL 393
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+ +GV PRP W+ T + G D+K + SNIIFSNG DP++ GG + ++ +
Sbjct: 394 DTWGVWPRPDWLQTSFWGGDLK-----AASNIIFSNGDLDPWAGGGIQRNLSTSIIAVT- 447
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ G+H LDL + DP +V+ RK E +++ W+
Sbjct: 448 -IQGGAHHLDLRASNSEDPPSVVEVRKLEATLIREWVA 484
>gi|226472750|emb|CAX71061.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 41/457 (8%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+++QT+DHFS+ + TF QRY+ K W N PI + G E I +G +E
Sbjct: 40 YFDQTLDHFSFQARNL-TFKQRYLYEDK-WFKPNGPIFFYCGNEGEIGGFWNNTGLVFEL 97
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F A I+ EHRYYGKS+PF + Y + QALADYA ++ IK K+N
Sbjct: 98 APSFNAFILFAEHRYYGKSLPFDKSF----QQPYIQYLSIGQALADYAYLIEGIKSKFNM 153
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T +PV+A G SYGG LA + R KYPH+V G+LA+SAP+ + + + V+KD+ D
Sbjct: 154 TRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTKDYHD 213
Query: 235 TSEECFQTIKKSWSEIDNIASKPD-GLAILSKKFKTCTPLKNTTE-------------LK 280
+C + IK +++ ++ KPD G LS++ + C P++N E +
Sbjct: 214 ADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWARNAFVMM 273
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI-LCKIAAGVVEADS-----LEY 334
LD Y + P N PV C +NA + D I + A GV S +Y
Sbjct: 274 AMLDYPYKASFMASLPPN-PVNVSC---KNALSAIDPIPTLREAVGVFYNSSQSLMCFDY 329
Query: 335 DGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKNS--MYQPEPWNLTKYIKNCK 391
C I G++S W++QSC+EM + D + M+ P + C+
Sbjct: 330 KTQFIECADITGCGLGNDS-LAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVTSYCR 388
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+++GV+P + + T++G + + ++ SNIIFSNG DP+ GG + + S
Sbjct: 389 QKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQS--EKVISL 441
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ G+H LDL +DP Q R+ EV+ ++ W+
Sbjct: 442 MLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 478
>gi|226472758|emb|CAX71065.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 472
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 41/457 (8%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+++QT+DHFS+ + TF QRY+ K W N PI + G E I +G +E
Sbjct: 32 YFDQTLDHFSFQARNL-TFKQRYLYEDK-WFKPNGPIFFYCGNEGEIGGFWNNTGLVFEL 89
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F A I+ EHRYYGKS+PF + Y + QALADYA ++ IK K+N
Sbjct: 90 APSFNAFILFAEHRYYGKSLPFDKSF----QQPYIQYLSIGQALADYAYLIEGIKSKFNM 145
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T +PV+A G SYGG LA + R KYPH+V G+LA+SAP+ + + + V+KD+ D
Sbjct: 146 TRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTKDYHD 205
Query: 235 TSEECFQTIKKSWSEIDNIASKPD-GLAILSKKFKTCTPLKNTTE-------------LK 280
+C + IK +++ ++ KPD G LS++ + C P++N E +
Sbjct: 206 ADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWILKWARNAFVMM 265
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI-LCKIAAGVVEADS-----LEY 334
LD Y + P N PV C +NA + D I + A GV S +Y
Sbjct: 266 AMLDYPYKASFMASLPPN-PVNVSC---KNALSAIDPISTLREAVGVFYNSSQSLMCFDY 321
Query: 335 DGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKNS--MYQPEPWNLTKYIKNCK 391
C I G++S W++QSC+EM + D + M+ P + C+
Sbjct: 322 KTQFVECADITGCGLGNDS-LAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVTSYCQ 380
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+++GV+P + + T++G + + ++ SNIIFSNG DP+ GG + + S
Sbjct: 381 QKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQS--EKVISL 433
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ G+H LDL +DP Q R+ EV+ ++ W+
Sbjct: 434 MLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 470
>gi|226472746|emb|CAX71059.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472748|emb|CAX71060.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472760|emb|CAX71066.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 41/457 (8%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+++QT+DHFS+ + TF QRY+ K W N PI + G E I +G +E
Sbjct: 40 YFDQTLDHFSFQARNL-TFKQRYLYEDK-WFKPNGPIFFYCGNEGEIGGFWNNTGLVFEL 97
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F A I+ EHRYYGKS+PF + Y + QALADYA ++ IK K+N
Sbjct: 98 APSFNAFILFAEHRYYGKSLPFDKSF----QQPYIQYLSIGQALADYAYLIEGIKSKFNM 153
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T +PV+A G SYGG LA + R KYPH+V G+LA+SAP+ + + + V+KD+ D
Sbjct: 154 TRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTKDYHD 213
Query: 235 TSEECFQTIKKSWSEIDNIASKPD-GLAILSKKFKTCTPLKNTTE-------------LK 280
+C + IK +++ ++ KPD G LS++ + C P++N E +
Sbjct: 214 ADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWARNAFVMM 273
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI-LCKIAAGVVEADS-----LEY 334
LD Y + P N PV C +NA + D I + A GV S +Y
Sbjct: 274 AMLDYPYKASFMASLPPN-PVNVSC---KNALSAIDPIPTLREAVGVFYNSSQSLMCFDY 329
Query: 335 DGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKNS--MYQPEPWNLTKYIKNCK 391
C I G++S W++QSC+EM + D + M+ P + C+
Sbjct: 330 KTQFIECADITGCGLGNDS-LAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVTSYCQ 388
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+++GV+P + + T++G + + ++ SNIIFSNG DP+ GG + + S
Sbjct: 389 QKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQS--EKVISL 441
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ G+H LDL +DP Q R+ EV+ ++ W+
Sbjct: 442 MLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 478
>gi|56756895|gb|AAW26619.1| SJCHGC02147 protein [Schistosoma japonicum]
Length = 472
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 149/457 (32%), Positives = 229/457 (50%), Gaps = 41/457 (8%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+++QT+DHFS+ + TF QRY+ K W N PI + G E I +G +E
Sbjct: 32 YFDQTLDHFSFQARNL-TFKQRYLYEDK-WFKPNGPIFFYCGNEGEIGGFWNNTGLVFEL 89
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F A I+ EHRYYGKS+PF + Y + QALADYA ++ IK K+N
Sbjct: 90 APSFNAFILFAEHRYYGKSLPFDKSF----QQPYIQYLSIGQALADYAYLIEGIKSKFNM 145
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T +PV+A G SYGG LA + R KYPH+V G+LA+SAP+ + + + V+KD+ D
Sbjct: 146 TRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTKDYHD 205
Query: 235 TSEECFQTIKKSWSEIDNIASKPD-GLAILSKKFKTCTPLKNTTE-------------LK 280
+C + IK +++ ++ KPD G LS++ + C P++N E +
Sbjct: 206 ADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIQNDFEFYWMLKWARNAFVMM 265
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI-LCKIAAGVVEADS-----LEY 334
LD Y + P N PV C +NA + D I + A GV S +Y
Sbjct: 266 AMLDYPYKASFMASLPPN-PVNVSC---KNALSAIDPIPTLREAVGVFYNSSQSLMCFDY 321
Query: 335 DGNNSRCY-INEDRTGDESDEGWEWQSCSEMVVPMGKDKNS--MYQPEPWNLTKYIKNCK 391
C I G++S W++QSC+EM + D + M+ P + C+
Sbjct: 322 KTQFIECADITGCGLGNDS-LAWDFQSCTEMNLHDDSDSTTSDMFTSLPLTKQQVTSYCQ 380
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+++GV+P + + T++G + + ++ SNIIFSNG DP+ GG + + S
Sbjct: 381 QKWGVTPAFNQLSTFFGDY-----IWKTASNIIFSNGNLDPWMGGGILTDQS--EKVISL 433
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ G+H LDL +DP Q R+ EV+ ++ W+
Sbjct: 434 MLDGGAHHLDLRSPDPNDPPSARQIRQIEVQTIRSWL 470
>gi|340508129|gb|EGR33905.1| serine carboxypeptidase s28 family protein, putative
[Ichthyophthirius multifiliis]
Length = 429
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 134/432 (31%), Positives = 226/432 (52%), Gaps = 39/432 (9%)
Query: 91 ILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
I+ + G E PI+ + +GF + + + KAL++ +EHRY+G+S PFG K +L+ ++
Sbjct: 1 IIFYCGNEGPIEMFYKNTGFVTQILSKELKALVLYMEHRYFGESQPFGDEKTSLQKGNNQ 60
Query: 150 GYFNSAQALADYASILLHIKDKYNATH--APVIAIGASYGGELATWFRLKYPHVVIGSLA 207
Y S QAL+DY L++IK P+IA+G SYGG LA W R+K+P++V SLA
Sbjct: 61 -YLTSIQALSDYVEFLIYIKKSLQCQEKECPIIAVGGSYGGMLAAWIRMKFPNLVDASLA 119
Query: 208 SSAPILYY--GDITPWTTYHSVVSKDF--RDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
+SAPI + + T Y +++ ++ RD + +Q ++ +E + I + + +
Sbjct: 120 ASAPIFQFLNRENLDQTKYFQIITNNYPCRDKIKTAYQILQNLLNEKNKILEQNNIFQQI 179
Query: 264 SKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILC 320
S+ C PLKN T+ L++ +D YS A + P +++ P + C
Sbjct: 180 SQAMGLCQPLKNNTDVLNLRNYMDNAYSYMAMTNYPQETTF------LKHLPPWPANFSC 233
Query: 321 KIAAGVVEADSL---------------EYDGNNSRCYINE-DRT-GDESDEGWEWQSCSE 363
+ + S+ ++D N+ I++ D+T D++ EGW+ SC++
Sbjct: 234 IFFQNITQQSSVFDLFSAVRNSTRTLYDFDQKNNCADISQADQTVSDDNMEGWDILSCAD 293
Query: 364 MVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSN 422
MV+PM K M+ W+L Y +NC++ YGVSP P+W L +YGG + + + + SN
Sbjct: 294 MVLPMFSNGKTDMFYNSTWDLETYKQNCRKAYGVSPNPNWALNFYGGRNDQEM--KGFSN 351
Query: 423 IIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVK 482
I FSNGM DP+S G + + F + +H DL + DP ++Q RK E+
Sbjct: 352 IFFSNGMLDPWSGGSPTQFINENLPVFY--MEQAAHHNDLRLPAQGDPQSVIQGRKLEIF 409
Query: 483 IMQGWITQYYDD 494
++ WI Y ++
Sbjct: 410 YIKKWIRFYENN 421
>gi|412993304|emb|CCO16837.1| predicted protein [Bathycoccus prasinos]
Length = 563
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 149/483 (30%), Positives = 242/483 (50%), Gaps = 55/483 (11%)
Query: 58 QTIDHFSYGPESYQT--------FPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
Q + HFS+ E + F RY + S+F+ +SPI + G EA ++ ++ +G
Sbjct: 84 QPLSHFSWNSEEEEERGGEGGGEFKTRYFVCSEFYRK-DSPIFLYTGNEANVESYLENTG 142
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+ENA F AL+V EHRYYGKS P N + NS +ALADYAS++ ++
Sbjct: 143 LMWENAEHFNALLVFAEHRYYGKSSPMSDDDEEDTNKNTLKHLNSMEALADYASLVRELR 202
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP--------- 220
++Y A VIA G SYGG LA+W R+KYPHVV G++A+SAPI + P
Sbjct: 203 EEYEDAVA-VIAFGGSYGGMLASWMRMKYPHVVDGAIAASAPIYAFDGEDPPVDPNAFAR 261
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDG--LAILSKKFKTCTPLKNTTE 278
+TY ++VS EC + I+ +++ + + + D L +L F+ C +++ E
Sbjct: 262 GSTYTAMVSGH----GAECPKRIQDAFTLLIDSGDESDKIYLDVLKHTFRACDDIESPYE 317
Query: 279 LKDGLDTVYSEAAQYDTPSNI-------------PVKRICNAIE-NAPNCGDDILCKIAA 324
+ + + A D P P+K +CN + + PN + +
Sbjct: 318 VAEWAQSALDYIAMGDYPVESGYMLSGKGTLPAWPMKVVCNEMMVDDPNASNSTSLALLE 377
Query: 325 GVVEADSLEYDGNNS-RCYINEDRTGDE----SDEGWEWQSCSEMVVPMGKD--KNSMYQ 377
+ EA S+ Y+ + +C+ D + ++ +++ W +Q CSEM +PM +N MY
Sbjct: 378 NLREAVSIYYNATKTEQCFTIGDPSPNDDTKATEDLWGYQYCSEMFMPMETTGGENDMYW 437
Query: 378 PEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
WN T + C++ Y V PRP + YGG + ++ SNI+FSNGM DP+ G
Sbjct: 438 LSSWNETNEFRYCRDAYDVQPRPYFAQETYGG---RKMVENFASNIVFSNGMLDPWHLLG 494
Query: 438 WVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKA 497
++T + ++ G+H DL + K+DP + + R EVK +Q W+ D+ +A
Sbjct: 495 VLETSN--PRVVLVKIDEGAHHNDLMFSSKNDPTSVKRARLLEVKEIQRWV----DEARA 548
Query: 498 INK 500
+ K
Sbjct: 549 MKK 551
>gi|395844464|ref|XP_003794980.1| PREDICTED: dipeptidyl peptidase 2 [Otolemur garnettii]
Length = 558
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 149/471 (31%), Positives = 237/471 (50%), Gaps = 51/471 (10%)
Query: 46 DSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNI 105
D N F ++ Q +DHF++ +TFPQR++++ KFW PI + G E +
Sbjct: 94 DINPDFGERYFEQLLDHFNFERFGNKTFPQRFLVSDKFWDRAEGPIFFYTGNEGDVWSFA 153
Query: 106 QLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALADYA 162
SGF E A Q AL+V EHRYYGKS+PFG R RGY QALAD+A
Sbjct: 154 NHSGFIVELAAQEAALLVFAEHRYYGKSLPFGKRST------QRGYMELLTVEQALADFA 207
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+L ++ A AP IA G SYGG L+ + R+KYPH+V+G+LA+SAP++ +
Sbjct: 208 VLLQALQRDLGAQDAPTIAFGGSYGGMLSAYLRMKYPHLVVGALAASAPVIAAAGLGDPD 267
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTEL 279
+ V+ F S +C Q ++ ++ +I ++ + D +S++F TC PL K+ T+L
Sbjct: 268 QFFRDVTAAFESQSPKCAQGVRDAFQQIRDLFLQGD-YDTVSREFSTCQPLSGPKDQTQL 326
Query: 280 ----KDGLDTVYSEAAQYDT------PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA 329
++ + Y T P+N PVK CN + + + G+
Sbjct: 327 FVFTRNAFTVLAMMNYPYATDFMGHLPAN-PVKVGCNLLLSE--------AQRITGLRAL 377
Query: 330 DSLEYDGNNSR-CY-------INEDRTGDESD---EGWEWQSCSEMVVPMGKDK-NSMYQ 377
L Y+ + + CY D TG S + W++Q+C+E+ + + M+
Sbjct: 378 AGLVYNASGTEPCYDIYQLYHSCADPTGCGSGPDAKAWDYQACTEISLTFASNNVTDMFP 437
Query: 378 PEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
P+ + + C+E +GV PRP W+ T + G D+K +TSNIIFSNG DP++ GG
Sbjct: 438 VLPFTEAQRQQYCQEAWGVWPRPDWLHTNFWGGDLK-----ATSNIIFSNGDLDPWAGGG 492
Query: 438 WVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ ++ + + G+H LDL + DP +V+ R+ E ++ W+
Sbjct: 493 IQQNLSASVVAIT--IPGGAHHLDLRASHPEDPHSVVEARRLEAALIGKWV 541
>gi|157279995|ref|NP_001098513.1| dipeptidyl peptidase 2 precursor [Bos taurus]
gi|151556163|gb|AAI49046.1| DPP7 protein [Bos taurus]
Length = 488
Score = 208 bits (529), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 147/466 (31%), Positives = 236/466 (50%), Gaps = 51/466 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+ ++ Q +DHF++ +TF QR+++ KFW G PI + G E + SGF
Sbjct: 36 FQEAYFEQLLDHFNFERFGNKTFLQRFLMTEKFWNRGEGPIFFYTGNEGDVWSFANNSGF 95
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALADYASILLH 167
E A Q AL+V EHRYYGKS+PFG R RGY QALAD+A +L
Sbjct: 96 ILELAEQQGALVVFAEHRYYGKSLPFGERST------WRGYTELLTVEQALADFAGLLRA 149
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ + A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP++ + +
Sbjct: 150 LRQELEAPDAPAIAFGGSYGGMLSAYLRIKYPHLVAGALAASAPVVSAAGLGDPYQFFQD 209
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLD 284
VS DF+ S EC + ++ ++ +I ++ + ++S++F TC PL K+ T+L
Sbjct: 210 VSADFQGQSPECARAVQDAFRQIRDLFQQ-GAPHVVSQEFGTCQPLSGPKDLTQLFGFAR 268
Query: 285 TVYSEAAQYDTPSNI---------PVKRICNAI--ENAPNCGDDILCKIAAGVVEADSLE 333
++ A D P PV+ C+ + E++ G L +A V + +E
Sbjct: 269 NAFTVLAMMDYPYATDFVGHLPAHPVQVGCSRLLSESSRIAG---LRALAGLVYNSSGIE 325
Query: 334 --YD--------GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE-PWN 382
YD + + C + D + W++Q+C+E+ + + S PE P+
Sbjct: 326 PCYDIYLQYQACADPTGCGLGSDA------KAWDYQACTEISLTFSSNNVSDLFPELPFT 379
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ + C + +GV PR W+ T +GG D+ + SNIIFSNG DP++RGG
Sbjct: 380 EAQRQQYCLDTWGVWPRQDWLQTSFGGGDLT-----AASNIIFSNGDLDPWARGGIQSNL 434
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ + ++ G+H LDL + DP +V+ R+ E ++ W+
Sbjct: 435 SASVLAIT--IHGGAHHLDLRGSHPDDPASVVEARRLEAALIGKWV 478
>gi|334312261|ref|XP_001374504.2| PREDICTED: dipeptidyl peptidase 2-like [Monodelphis domestica]
Length = 513
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 154/491 (31%), Positives = 243/491 (49%), Gaps = 42/491 (8%)
Query: 26 GLKLRPRLGRIRRS----RILEQKDSNHG-------FETFFYNQTIDHFSYGPESYQTFP 74
GL+L P G R + L Q D + F+ ++ Q +DHF++ TF
Sbjct: 32 GLRLYPSRGDTRGECKLHKPLTQYDPSPAKGTPQVDFQERYFEQILDHFNFESYGSSTFL 91
Query: 75 QRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSV 134
QR+++ KFW G PI + G EA I S F E A +AL++ EHRYYGKS+
Sbjct: 92 QRFLVTEKFWKKGTGPIFFYTGNEADIWAFANNSNFILELAAVEEALVIFAEHRYYGKSL 151
Query: 135 PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWF 194
PFG + N G QALAD+A ++ +K +Y PVI G SYGG L+ +
Sbjct: 152 PFGDQSTRKGNT---GLLTVEQALADFAVLIQTLKKEYE--DVPVITFGGSYGGMLSAYM 206
Query: 195 RLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIA 254
R+KYP++V G+LA+SAP++ I + + V+ DF + S +C Q +++++ I ++
Sbjct: 207 RMKYPNLVAGALAASAPVVSIAGIGNSSQFFRDVTTDFENHSPKCAQRVREAFRMIRDLY 266
Query: 255 SKPDGLAILSKKFKTCTPLKN---TTELKDGLDTVYSEAAQYDTPSNI---------PVK 302
+ + + TCT N T+L + ++ + D P PVK
Sbjct: 267 LE-QAFDRIHQDMGTCTQPSNDSAITQLFEFARNAFTMISMMDYPYPTDFMGHFPANPVK 325
Query: 303 RICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN-EDRTG---DESDEGWEW 358
C+ + A N L +A + A E + + Y D TG S E W++
Sbjct: 326 VGCDRLLRAENPIQG-LRALAGLLYNASGTEPCFDIYQLYQKCADPTGCGTGPSAEAWDY 384
Query: 359 QSCSEMVVPMGKDKNSMYQPE-PWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR 417
Q+C+E+ + + + PE P+ K C ++GV PR SW+LT +GG+++K
Sbjct: 385 QACTEINLTFDSNNITDMFPEIPFTSDLREKYCFARWGVQPRKSWMLTNFGGNNLK---- 440
Query: 418 RSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQR 477
+ SNIIFSNG DP++ GG +KT L S + G+H LDL + +DP+ +++ R
Sbjct: 441 -AASNIIFSNGDLDPWA-GGGIKTNLSSSL-ISLTIRGGAHHLDLRGSNPADPESVIEVR 497
Query: 478 KTEVKIMQGWI 488
K E + + W+
Sbjct: 498 KLEAEYIHEWV 508
>gi|431899040|gb|ELK07410.1| Dipeptidyl-peptidase 2 [Pteropus alecto]
Length = 521
Score = 208 bits (529), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 143/480 (29%), Positives = 233/480 (48%), Gaps = 47/480 (9%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN 104
+D + F ++ Q +DHF++ +TF QR++I+ KFW G P+ + G E +
Sbjct: 30 RDPDPNFREDYFEQLLDHFNFERFGNKTFLQRFLISDKFWKRGEGPLFFYTGNEGDVWFF 89
Query: 105 IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
S F E A Q +AL+V EHRYYGKS+PFG + H QALAD+A +
Sbjct: 90 ANNSRFILELAMQQEALVVFAEHRYYGKSLPFGEQST---QRGHTELLTVEQALADFARL 146
Query: 165 LLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY 224
L ++ + A P IA G SYGG L+ + R+KYPH+V G+LA+SAP++ ++ +
Sbjct: 147 LRSLRQDFKARDVPAIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVSVAGLSDSHQF 206
Query: 225 HSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC---TPLKNTTELKD 281
+S F + S EC Q ++ ++ +I ++ + LS++F TC T K+ +L
Sbjct: 207 FRDLSVIFENQSPECAQGVRDAFRQIKDLFLQ-GAYEELSREFGTCQLVTDWKSLAQLFG 265
Query: 282 GLDTVYSEAAQ--YDTPSNIPVKRICNAIENAP----------NCGDDILCKIAA----- 324
+ A Y P++ N ++ P G + CK
Sbjct: 266 FARNAFIMLAMLNYPYPTDFTGHLPANPVQARPPLQGTGRGPAALGQSVGCKRLLSETHR 325
Query: 325 --GVVEADSLEYDGNNSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKD 371
G+ L Y+ + + CY D TG + + W++Q+C+EM + +
Sbjct: 326 IRGLQALAGLLYNSSGTEPCYNIYQQYQACADATGCGLGPNAKAWDYQACTEMNLAFSSN 385
Query: 372 KNSMYQPEPWNLTKYIKN--CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGM 429
+ P+ T+ ++ C+E +GV PR W+LT +GG D+ R+ SNIIFSNG
Sbjct: 386 NRTDIFPD-LQFTEDLRQQYCQETWGVYPRRDWLLTNFGGADL-----RAASNIIFSNGD 439
Query: 430 RDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
DP++ GG L+ + ++ G+H LDL + DP + + RK E ++ W+T
Sbjct: 440 LDPWAGGGIRSNLSASVLAIT--IHGGAHHLDLRASHPEDPMSVREARKLEATVIHKWVT 497
>gi|302854868|ref|XP_002958938.1| hypothetical protein VOLCADRAFT_70104 [Volvox carteri f.
nagariensis]
gi|300255730|gb|EFJ40018.1| hypothetical protein VOLCADRAFT_70104 [Volvox carteri f.
nagariensis]
Length = 557
Score = 207 bits (528), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 145/456 (31%), Positives = 226/456 (49%), Gaps = 60/456 (13%)
Query: 72 TFPQRYVINSKFW-----GGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILE 126
TF QR+ + + W G + PI +LG EA + + +G +E A F+A++V E
Sbjct: 1 TFLQRFFVCANHWRRRGPDGSSGPIFFYLGNEADVTLYLNNTGLMWEGAPDFEAMLVFAE 60
Query: 127 HRYYGKSVPFGSRKAALKNAR-HRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGAS 185
HRYYG+SVP+G KN R H GY + QA+ADYA +++ IK++Y+A + VI G S
Sbjct: 61 HRYYGESVPYG------KNVRKHMGYLMAEQAMADYAELIMEIKEEYDAEGSAVIGFGGS 114
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE------EC 239
YGG LA W RLKYPH + G++A+SAPI + TP S D SE C
Sbjct: 115 YGGMLAAWMRLKYPHALDGAIAASAPIWNFLGETPPFDSGSFAKGVTYDASELAGSAPAC 174
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-----LKDGLDTVYSEAAQYD 294
++ +W + G A L+ + C+ E L++ L + A +
Sbjct: 175 IDNVRATWGLLRIYGGDEHGRAFLADALQLCSGAGPQNEDDVILLREWLAAAWDMMAMGN 234
Query: 295 TP-------------SNIPVKRICNAIENAPNCGDDIL--CKIAAGVVEADSLEYDGNNS 339
P PV+ C+ +++ +D+L +A GV + G+ S
Sbjct: 235 FPYPSSYITNGHGRLPAFPVRVACSYLDSDSFNDEDLLEAMSLAVGVF----YNHTGDVS 290
Query: 340 RCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNS-MYQPEPWNLTKYIKNCKEQYG 395
C+ T +SD W++Q C+EM++P KD S M+ EP++L C++ +G
Sbjct: 291 -CFDPLSGTDPDSDHDADFWDFQWCAEMLMPFSKDGASDMFWREPFDLAAATSACQQHWG 349
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
+SPRP T +GG I + SNI+FSNG+ DP+ GG ++ D+S S +
Sbjct: 350 ISPRPLRATTEWGGRRIS-----AGSNIVFSNGLLDPWHGGGVLE-----DISDSMPAVI 399
Query: 456 ---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
G+H LDL + DP +V+ R+ + + ++ WI
Sbjct: 400 IPEGAHHLDLMFSHPDDPLSVVEVRRFQREAIRDWI 435
>gi|428182777|gb|EKX51637.1| hypothetical protein GUITHDRAFT_157202 [Guillardia theta CCMP2712]
Length = 481
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 154/471 (32%), Positives = 240/471 (50%), Gaps = 55/471 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP---ILAFLGAEAPIDDNIQLSGFT 111
++ Q +DHFS+ +S + QRY + + +P I + G E ++ I+ +G
Sbjct: 7 WFEQVLDHFSWRNDS--RWQQRYYVCQETEQQLANPAATIFFYCGNEGNVEMYIRNTGLM 64
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ENA F A+++ EHRYYGKS+PFG+ +A A R Y + QALADYA +L K K
Sbjct: 65 FENAKSFSAMLIFAEHRYYGKSLPFGNDFSA---ASLR-YLSHEQALADYAVLLDDFKRK 120
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY--GDITPWTT--YHSV 227
+ A VIA G SYGG L+ WFR+KYPH+V G++A+SAP+L + D PW + Y +
Sbjct: 121 HKMVRAKVIAFGGSYGGMLSAWFRMKYPHIVEGAVAASAPVLSFHSSDKGPWRSEKYWEI 180
Query: 228 VSKDFRDTS---EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD--- 281
V++D + E C +++SW ID++ + G L+ F+ C PL + E+ D
Sbjct: 181 VTRDASGAAGSDERCVPLVRQSWPIIDSMGASESGRESLAALFRLCEPLASPGEVNDLKL 240
Query: 282 ----GLDTVYSEAAQYDTPSNI-----------PVKRICNAIENAPNCGDDILCKIAAGV 326
DT+ Y PS+ PV+ C + +CGD + +
Sbjct: 241 FIAMAFDTM--AMGNYPFPSDYLTGGIGKLPPWPVREACKLLS---SCGDCKAENLLDSL 295
Query: 327 VEADSLEYDGNNSRCYINEDRTGDESDEG-WEWQSCSEMV---VPMGKD-KNSMYQPEPW 381
A SL Y+ + + ++ D S G W++Q C+EM+ +D K M+ P
Sbjct: 296 CSAISLLYNASGDQACLHLPE--DSSYAGIWDFQWCTEMLPQETYFKRDGKRDMFFPFSI 353
Query: 382 NLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
+ + ++CK +YGV PR W+ YGG ++ I R SNIIFSNG DP++ GG V
Sbjct: 354 SSKEIDQHCKSKYGVIPRRGWIEQLYGG--LEGIKR--ASNIIFSNGEFDPWAAGG-VNV 408
Query: 442 YHFFDLSF----SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D + + + G+H LDL + +DP + + R+ EV + W+
Sbjct: 409 SEVKDAAARGVEAVWIEEGAHHLDLFFSHPADPPSVKKARQMEVACISKWL 459
>gi|403352351|gb|EJY75686.1| Lysosomal Pro-X carboxypeptidase [Oxytricha trifallax]
Length = 503
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 147/479 (30%), Positives = 237/479 (49%), Gaps = 41/479 (8%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN 104
K +N+ FE +DHF+ G S TF +Y+ ++++W PI + G E ++
Sbjct: 28 KRTNYTFEV-----PLDHFASGGNS-PTFNIKYLADAQYWNPMEGPIFFYAGNEGKVEGF 81
Query: 105 IQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF + A Q +ALI+ EHRY+G S PF + A K+ H + QA+ DY
Sbjct: 82 WDNSGFLTDVLAPQHQALIIFGEHRYFGDSFPFDKKVALDKD--HNKWLTVEQAMMDYVL 139
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI-TPWT 222
++ I+ Y A+ PV+ G SYGG LA+W R+KYP G+ ASSAPILY+ D P +
Sbjct: 140 LIKEIRYIYGASDKPVVVFGGSYGGMLASWLRMKYPATFQGAYASSAPILYFKDSGVPQS 199
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNI-ASKPDGLAILSKKFKTCTPLKNT----- 276
+ ++++DF ++ C IK++W + +I ++P L F TCT +
Sbjct: 200 AFGDIITQDFYAANQNCPSIIKEAWGYLMDIKENRPTDYPALKTIFNTCTDITVKADVDN 259
Query: 277 --TELKDGLDTVYSEAAQYDT------PSNIPVKRICNAIEN----APNCGDDILCKIAA 324
T L +G + Y+T P+N PV C +++ AP + ++A
Sbjct: 260 LYTYLMNGFSYMAMTDYPYETSFLNPMPAN-PVNAACTKLKDIPYPAPKSEKSTVGALSA 318
Query: 325 -------GVVEADSL--EYDGNNSRCYI-NEDRTGDESDEGWEWQSCSEMVVPMGKDKNS 374
GV +A S+ Y G I N D TG GW+ +C+++ +P K+S
Sbjct: 319 RETLVLQGVQDASSVYFNYKGQTPCNDISNSDATGQLDGAGWDVLACNQLAMPTTNGKDS 378
Query: 375 MYQP-EPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
M+ +P++ + +C ++YG++PR WV +GG +I+ + T NIIF+NG DP+
Sbjct: 379 MFLVNDPFDEKAFNADCMQKYGLTPRYGWVWDTFGGQNIQKDFQAHT-NIIFTNGNLDPW 437
Query: 434 SRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
S GG S L +H L+L +DPD + + R T + W+++Y+
Sbjct: 438 SAGGVTANITGNPTINSILLEGSAHHLELRAPNDADPDDVKKVRGTISDTIGKWVSEYF 496
>gi|256084265|ref|XP_002578351.1| family S28 unassigned peptidase (S28 family) [Schistosoma mansoni]
gi|360045230|emb|CCD82778.1| family S28 unassigned peptidase (S28 family) [Schistosoma mansoni]
Length = 472
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 149/459 (32%), Positives = 226/459 (49%), Gaps = 45/459 (9%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+++QT+DHF++ + TF QRY+ K W N PI + G E ID +G +E
Sbjct: 32 YFDQTLDHFNFQARNL-TFKQRYLYEDK-WFKPNGPIFFYCGNEGGIDGFWNNTGLIFEL 89
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F A ++ EHRYYGKS+PF + + Y + QALADYA ++ IK +N
Sbjct: 90 APSFNAFVLFAEHRYYGKSLPFNTSF----QQPYIQYLSIDQALADYAYLIEGIKSTFNM 145
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T + V+A G SYGG LA + R KYPH++ G+LASSAP+ + + + V+KD+RD
Sbjct: 146 TRSLVVAFGGSYGGMLAAYMRAKYPHIIKGALASSAPVRWVAGEGNFHDFFESVTKDYRD 205
Query: 235 TSEECFQTIKKSWSEIDNIASKPD-GLAILSKKFKTCTPLKNTTE-------------LK 280
+C + IK +++ ++ KPD G LS + C P++N E +
Sbjct: 206 ADPKCSEKIKNAFNLAVQLSQKPDIGYKQLSNDLRLCKPIQNDFEFYWVLKWARNAFVMM 265
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI-LCKIAAGVVEADS-----LEY 334
LD Y + P+N PV C +NA D I + A GV S +Y
Sbjct: 266 AMLDYPYKASFMASLPAN-PVNVSC---KNALAVTDLIPTLREAVGVFYNSSQSLPCFDY 321
Query: 335 DGNNSRCYINEDRTG---DESDEGWEWQSCSEMVVPMGKDK--NSMYQPEPWNLTKYIKN 389
C D TG W++QSC+EM + D N M+ P +
Sbjct: 322 KTQFIEC---ADITGCGLGSDSLAWDFQSCTEMNLHDDSDSTTNDMFVSLPLTKQQVTSY 378
Query: 390 CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
C+ ++GV+P + + T+YG + + +++SNIIFSNG DP+ GG + +
Sbjct: 379 CQRRWGVTPAFNQLSTFYGDN-----IWKTSSNIIFSNGNLDPWMGGGILTDQS--EKVI 431
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
S L+ G+H LDL +DP Q R+ EV+ ++ W+
Sbjct: 432 SLVLDGGAHHLDLRSPDPNDPPSARQVRQIEVQTIRSWL 470
>gi|312080065|ref|XP_003142442.1| hypothetical protein LOAG_06858 [Loa loa]
gi|307762394|gb|EFO21628.1| hypothetical protein LOAG_06858 [Loa loa]
Length = 509
Score = 204 bits (520), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 159/512 (31%), Positives = 247/512 (48%), Gaps = 56/512 (10%)
Query: 15 VIFVSTSFHANG--LKLRPRLGRIRRSRILEQKDSNHGF---ETFFYNQTIDHFSYGPES 69
+I+V+ NG L+L + RI L+QK + + E + N IDHFS+
Sbjct: 6 IIYVTILLIGNGNALRLYDPVTRIE----LQQKKGDKNYKWSEEWLENVPIDHFSF--HD 59
Query: 70 YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRY 129
+TF RY+IN+ ++ N PI + G E ++ Q +G ++ A Q A++V EHR+
Sbjct: 60 NRTFRLRYLINTDYFAH-NGPIFFYTGNEGNVELFAQNTGLMWDLAPQLNAMVVFAEHRF 118
Query: 130 YGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY--NATHAPVIAIGASYG 187
YGKS PFG++ + ++ GY +S QAL D+A ++ H+K+KY A ++ VIA G SYG
Sbjct: 119 YGKSQPFGNK--SYITIQNFGYLSSEQALGDFALLINHLKNKYLSMAQNSSVIAFGGSYG 176
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDIT-PWTTYHSVVSKDFRDTSEECFQ-TIKK 245
G LA W R+KYPH+V GS+ASSAP+ ++ D++ P Y +V + F ++ C + I
Sbjct: 177 GMLAAWMRIKYPHLVEGSIASSAPVFWFIDMSVPDDAYSHIVKRSFVNSG--CIERNIIN 234
Query: 246 SWSEIDNIASKPDGLAILSKKFKT--CTPLKNTTE---LKDGLDTVYSEAAQ--YDTPSN 298
W + N++S G L++ F + LK+ T+ LK+ L+ ++ A Y PSN
Sbjct: 235 GWIALKNLSSTASGRDYLNRLFHLDKKSYLKSNTDWIMLKEYLEDIFQSMAMVNYPYPSN 294
Query: 299 I-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE 351
PVK C N D L + G++ Y G + I+ D
Sbjct: 295 YLAKLPGWPVKVACQFFNNTNKQTDKELAQSMYGIMNL-YYNYTGQKEQFCIDPKVCKDT 353
Query: 352 SDE------GWEWQSCSEMVVPM---GKDKNSMYQPEPWNLTKYIKNCKEQYGVSP---- 398
+ E GW WQSC+EM++ + G + + P+ L C +G
Sbjct: 354 AYEALGDPIGWSWQSCTEMIMQLCSSGPPNDFFIKNCPFTLEDQESYCINAFGKLGYTKN 413
Query: 399 --RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
RP W + YG + +NIIFSNG DP+S GGW S + G
Sbjct: 414 LMRPHWSILNYGNQ------YPTATNIIFSNGYLDPWSAGGWSLKSQLIGPLISIIIKDG 467
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+H DL + D + + R E ++ W+
Sbjct: 468 AHHYDLRGEHQLDTKSVKEARLLEKLCIKHWL 499
>gi|432876424|ref|XP_004073042.1| PREDICTED: dipeptidyl peptidase 2-like [Oryzias latipes]
Length = 480
Score = 204 bits (519), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 141/465 (30%), Positives = 229/465 (49%), Gaps = 43/465 (9%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
+ D F +++Q +DHF++ +TF QRY+I +FW + P+ + G E I +
Sbjct: 28 RTDGEPQFTEKYFSQVVDHFNFNSLGNRTFNQRYLITDRFWRRSSGPVFFYTGNEGDIWE 87
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF E A Q +AL++ EHRYYG+S+PFG+ ++ G QALADYA
Sbjct: 88 FALNSGFIMELAAQQEALVIFAEHRYYGRSLPFGNNSFSIPEV---GLLTVEQALADYAL 144
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
++ +K + A +PVIA G SYGG L+ + RLKYP++V G+LA+SAPIL +
Sbjct: 145 MITELKLQLGAAQSPVIAFGGSYGGMLSVYMRLKYPNIVAGALAASAPILSTAGLGDPRQ 204
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP----------- 272
+ V+ DF S C + ++ ++ A + D + + + C P
Sbjct: 205 FFRDVTADFERVSPACRGAVTAAFQQLREAAERRD-YSHIQAELSLCQPPSSAQDVHQLY 263
Query: 273 --LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD 330
L+N L LD YS P+N PVK G ++ V +
Sbjct: 264 GLLRNAFTLMAMLDYPYSTHFMGSLPAN-PVKT---------GSGLCVMSTKEWMVYNSS 313
Query: 331 SLEYDGNNSRCYIN-EDRTG---DESDEGWEWQSCSEMVVPMGKDKNSMYQPEP-WNLTK 385
L + Y+ D TG + W++Q+C+E + M + N++ P T+
Sbjct: 314 GLLPCFDLYSLYVECADPTGCGLGFNSLAWDYQACTE--IEMCYESNNVTDMFPAMTFTE 371
Query: 386 YIKN--CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ C +++GV PRP W+ + G D+ + SNIIFSNG DP++ GG V+T
Sbjct: 372 DARQLYCSKRWGVQPRPGWLRLQFWGDDLS-----AASNIIFSNGDLDPWANGG-VRTSL 425
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L + +++ G+H LDL + +DP+ +++ RK E +++ W+
Sbjct: 426 SPSL-IAINISGGAHHLDLRGSNAADPESVIKARKMEAELIAAWV 469
>gi|167533602|ref|XP_001748480.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772999|gb|EDQ86644.1| predicted protein [Monosiga brevicollis MX1]
Length = 508
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 147/483 (30%), Positives = 232/483 (48%), Gaps = 61/483 (12%)
Query: 56 YNQTIDHFSYGP---ESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
+ QTIDHF++G ++ TF QRY + K++ G+ + + G E I I +G +
Sbjct: 32 FEQTIDHFNWGAPLGQAQTTFQQRYFVYDKYYKPGSGALFVYFGNEDDITLYINHTGLMW 91
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
ENA F A ++ +EHRYYGKS PF A N + S QA+ADYA +L K +
Sbjct: 92 ENAKDFGAYLIFIEHRYYGKSQPFSPGTAGCMN-----WLTSEQAMADYAVLLRWFKATH 146
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW---TTYHSVVS 229
P I G SYGG LA WFR K+P VV G +++SAPI + ++TP + +V+
Sbjct: 147 QMEDVPTIGFGGSYGGMLAAWFRRKFPDVVDGVISASAPIWAFANLTPAYDDDGFAQIVT 206
Query: 230 KDFRDTSEECFQ---TIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
D S K+ I + AS GLA L+ F+ C PL++ + L V
Sbjct: 207 NDATPASGAAAACAANFKQGQKLIIDTASSAAGLANLTSIFRLCNPLQSLNDAYSLLYWV 266
Query: 287 -----YSEAAQYDTPSNI-----------PVKRICNAIENA--PNCGDDILCKIAAGV-- 326
Y + PS+ PV+ C ++ ++ P+ D+L + A +
Sbjct: 267 QEPWSYMAMGNFPYPSSYLLHGLGMLPAWPVRVACESLADSSLPDQPPDLLDAMRAALDI 326
Query: 327 ----VEADSLEYDGNNS----------RCYINEDRT--GDESDEG-WEWQSCSEMVVP-- 367
A++ YD +++ + ++ + T G E+ G W++Q C+EMV+P
Sbjct: 327 YYNYTHAETC-YDLSDAPETATLMRPRKAFLRQQGTLGGPEACTGDWDYQYCTEMVMPST 385
Query: 368 MGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSN 427
G DK+ + E ++L +C+ +GV PR +W TY D+ + +N++FSN
Sbjct: 386 QGTDKDMFWPLEKFDLASLTASCQSTWGVKPRQNWATTYLASKDLTDL-----TNVVFSN 440
Query: 428 GMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
G DP+ GG V+ D S + G+H +DL + DP+ + R EV M+ W
Sbjct: 441 GHYDPWRAGGVVQNVS--DSVVSIIIPSGAHHIDLMFSTPEDPEDVTVARAFEVSHMRRW 498
Query: 488 ITQ 490
+ Q
Sbjct: 499 VDQ 501
>gi|346467697|gb|AEO33693.1| hypothetical protein [Amblyomma maculatum]
Length = 348
Score = 201 bits (511), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 127/359 (35%), Positives = 197/359 (54%), Gaps = 27/359 (7%)
Query: 85 GGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALK 144
G PI + G E I +G ++ A +F+AL++ EHRYYGKS+P+G R + +
Sbjct: 1 GPPGGPIFFYTGNEGSITTFANNTGLMWDWAPEFRALLIFAEHRYYGKSMPYGDR--SFE 58
Query: 145 NARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVI 203
+ H GY Q LADYA +LL+I+ A ++ V++ G SYGG LA WFR+KYPHV
Sbjct: 59 SPSHLGYLTVEQTLADYADLLLYIRSTLPGAGNSQVVSFGGSYGGMLAAWFRMKYPHVTA 118
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
+LA+SAPIL + ITP ++VV++ FR SE C + I+ SW I+N +S +G A +
Sbjct: 119 AALAASAPILQFQGITPCGALNAVVTRAFRMESELCAEAIRSSWELIENRSSTEEGAADI 178
Query: 264 SKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP---------SNIPVKRICNAIENAP 312
+++F C KN T+L+D L Y+ A + P PV+ C +
Sbjct: 179 AERFHICGNYTHKNYTKLRDWLTDQYTNLAMVNYPYPNEFLNPVPGHPVREACKFLNRTF 238
Query: 313 NCGDDILCKIAAGVVEADSL--EYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGK 370
+ D +L + +A S+ Y G ++C +G +GW++QSC+EMV+PM
Sbjct: 239 DSSDVLL----ESIYQAFSVFTNYTG-QTQCNDLCKGSGTLDADGWDYQSCNEMVMPMCS 293
Query: 371 DK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNG 428
D + M+ W+L K + C++ + V+P + + +GG +I ++SNIIFSNG
Sbjct: 294 DGVDDMFYKNDWDLKKVREKCEKDFHVTPDVNKAVLIFGGRNIS-----ASSNIIFSNG 347
>gi|302832479|ref|XP_002947804.1| hypothetical protein VOLCADRAFT_41600 [Volvox carteri f.
nagariensis]
gi|300267152|gb|EFJ51337.1| hypothetical protein VOLCADRAFT_41600 [Volvox carteri f.
nagariensis]
Length = 451
Score = 201 bits (511), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 136/466 (29%), Positives = 221/466 (47%), Gaps = 50/466 (10%)
Query: 55 FYNQTIDHFSYGPESYQT-FPQRYVINSKFW----GGGNSPILAFLGAEAPIDDNIQLSG 109
++ Q +DHF + P T F QRY + KFW GG PI + G EA + + +G
Sbjct: 4 WFTQRLDHFHHHPGGNDTTFQQRYFLCDKFWSRGPGGSRGPIFFYAGNEADVTLYVNATG 63
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+E+A +F AL++ EHRYYGK+ P G + Y + QALADY+ ++ +I
Sbjct: 64 LIWEHAEEFGALVLFAEHRYYGKTQPLGPDSWSSDPT----YLSVEQALADYSVLIWNIT 119
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHSV 227
+PVIA G SYGG LA W RLKYPH+V G++A+SAP+ + + W + + V
Sbjct: 120 RTTGGEDSPVIAFGGSYGGMLAAWLRLKYPHLVTGAVAASAPVGAFPGVPGWQPSKFWEV 179
Query: 228 VSKDFRDTS---EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE------ 278
V+ D ++ EC ++ ++ + + G A L + + C P+++
Sbjct: 180 VTYDATASAGAVPECSSNVRAAFGHVMALGRTATGRAALGRLLRLCKPIEDEDAAVGVAY 239
Query: 279 -LKDGLDTV------YSEAAQYDTPSNI----PVKRICNAIENAPNCGDDILCKIAAGVV 327
L+ D Y + D P++ P++ C + D+ + AAGV+
Sbjct: 240 WLQGAFDAFAMGNYPYPSSYISDDPAHPLPAWPMRAACTRLAGRGLRASDLALRDAAGVL 299
Query: 328 EADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVV-----PMGKDKNSMYQPEPWN 382
+ +CY E T + W++Q C+E++ P + + P++
Sbjct: 300 YNVT-----GRVQCYTVE--TSGPAAGPWDYQWCTELMAQLPYYPTNGISDMFWDQGPFD 352
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
L ++C+ +GV PRP W YGG D R SNI+FSNG+ DP+S G +
Sbjct: 353 LEAINQHCEAMWGVRPRPFWSAITYGGLDY-----RYASNIVFSNGLYDPWSAYGVLTD- 406
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D + + G+H LDL + SDP + R+ E+ ++ W+
Sbjct: 407 -ISDSVVAVIIPEGAHHLDLMYSHPSDPPSVRAARQIEMDYVRNWV 451
>gi|449671875|ref|XP_002154818.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Hydra
magnipapillata]
Length = 460
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 144/502 (28%), Positives = 240/502 (47%), Gaps = 67/502 (13%)
Query: 9 QWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPE 68
WL LM++F+ + L P+ R + K+S + + T ++ QT+DHFS+ E
Sbjct: 1 MWLRLMILFIGIGALISNKPLFPKQNR-------KAKESGYYYTTHWFPQTLDHFSFRSE 53
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
YQ F QRY+IN ++ G +P+ + G E I +GF ++ A +F A++V EHR
Sbjct: 54 DYQ-FAQRYLINDDYFKPG-APVFFYTGNEGDITWFCNNTGFMWDIAEEFSAMLVFAEHR 111
Query: 129 YYGKSVPFGSRKAALKNARH--RGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASY 186
YYG+S+PFGS +K G+ D AS + I ++Y
Sbjct: 112 YYGESMPFGSDSYKVKAFVDGGGGFIKLGIGTIDVASYF-----------SDDITTRSNY 160
Query: 187 G-GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
G LA WFR+KYP V+G+++SSAPIL + D+ Y+ CF
Sbjct: 161 SEGMLAAWFRMKYPASVVGAISSSAPILAFVDMNDCELYYLKFYN--------CFGYAVL 212
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA------AQYDTPSNI 299
++ G +S+ FK C PLK ++ + L+ +SE Y P+N
Sbjct: 213 DYT----------GRLKISELFKLCKPLKTFDDVYN-LNNWFSEVWVNLAMVNYPYPANF 261
Query: 300 -------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDES 352
P+ +C ++N+ GD+++ + V + G +S I + +G
Sbjct: 262 LEDLPAWPINEVCKHLQNSNATGDELIRNLVNAV--NVYFNFTGQSSCLNIEQQASGSLG 319
Query: 353 DEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHD 411
D+GW++Q+C+EM +P+ +D M+ P ++ + +CK+++GV+ R W + YGG +
Sbjct: 320 DQGWDFQACTEMAMPLCQDGIRDMWLPYKYDFDDFATSCKQKWGVTTRKYWSQSQYGGFN 379
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH-FFDLSFSQDLNLGSHCLDLDEAKKSDP 470
+ SNI+FSNG DP+S G +K+ + D G+H LDL ++ DP
Sbjct: 380 L-----NGASNIVFSNGKLDPWSGYGVLKSQSPTIKVVMIDD---GAHHLDLRKSNSLDP 431
Query: 471 DWLVQQRKTEVKIMQGWITQYY 492
++ R + WI +Y+
Sbjct: 432 QSVIDARNIHKSNIHSWINEYH 453
>gi|348676451|gb|EGZ16269.1| hypothetical protein PHYSODRAFT_560767 [Phytophthora sojae]
Length = 574
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 151/484 (31%), Positives = 231/484 (47%), Gaps = 59/484 (12%)
Query: 54 FFYNQTIDHFSY-------------GPESY-QTFPQRYVINSKFWGGGN--SPILAFLGA 97
++ Q IDHFS+ P T+ QRY++N++FW + +P+ + G
Sbjct: 100 LWFEQRIDHFSWLAAEALDPSNAGAAPSGLPATYKQRYLLNTQFWDPKDKKAPVFFYTGN 159
Query: 98 EAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
E + +G +ENA FKAL+V EHRYYGKS PFG K H Y QA
Sbjct: 160 EGDVTLYANHTGLIWENAKAFKALVVFAEHRYYGKSFPFGD-----KYMDHLAYVTHDQA 214
Query: 158 LADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
LADY ++ H++ KY+A + PVIA G SYGG L+ WFR+KYP+++ G++A+SAPI +G
Sbjct: 215 LADYTELIYHLQKKYDAFNHPVIAFGGSYGGMLSAWFRMKYPNIIAGAIAASAPIYGFGG 274
Query: 218 ITPW--TTYHSVVSKDFR---DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
+ Y VV++D ++ C KKSW++I +A DG A LS F+ CTP
Sbjct: 275 FPAFDGQKYWQVVTRDASPAAGSAANCVPNAKKSWAQIFELAKTEDGRATLSSLFRLCTP 334
Query: 273 LKNTTELKDGLDTV---YSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA 329
L + + +D +V + A D P P + + P C AG
Sbjct: 335 LASEEQGEDLAMSVLFAFDTLAMGDFP--YPSSYLTGGAVDLPAWPVRQACSHLAGEFPT 392
Query: 330 DSLEYDG----------NNSRCYINEDR--------TGDESDEGWEWQSCSEMVVP---- 367
SL DG N + + N + T + D W++Q C+EM+
Sbjct: 393 PSLRKDGVDTTLLEALRNAANVFHNATKDLACFKIPTLWDYDGIWDYQYCTEMLPQETYF 452
Query: 368 MGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSN 427
+ M+ P + +C+ + +P + YG + RS SNI+FSN
Sbjct: 453 STNGETDMFWPRNTTFEEIRAHCQRDWHTTPDQDGIRVSYGDE-----MLRSASNIVFSN 507
Query: 428 GMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
G+ DP+S G + ++ + + G+H LDL + DP ++ RKTEVK++ W
Sbjct: 508 GLLDPWSSAGVLHAPKDAKVTIVE-IAEGAHHLDLFFSHPKDPPSVIAARKTEVKMIHQW 566
Query: 488 ITQY 491
I ++
Sbjct: 567 IDEF 570
>gi|391339066|ref|XP_003743874.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 476
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/455 (29%), Positives = 223/455 (49%), Gaps = 34/455 (7%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN-SPILAFLGAEAPIDDNIQLSGF 110
ET ++N IDHF Y + TFP R + N++++ PI + G E I + +G
Sbjct: 39 ETTWFNVPIDHFGY--YNNNTFPLRVLYNNEYFNHTKPGPIFLYAGNEGDIALFVYNTGL 96
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A +F AL+V EHRYYGKS+P+G + +LK+ + GY QALAD+A ++ IK+
Sbjct: 97 LWDWAEEFGALLVFAEHRYYGKSMPYG--RDSLKDVSYYGYLTVDQALADFAHVISEIKE 154
Query: 171 KYNATH-APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+ + V+A G SY G LA W R+KYP +V +L+S API Y + ++ V+
Sbjct: 155 TWPGVQKSKVVAFGGSYAGMLAAWLRMKYPWLVEAALSSGAPIRLYQGLVGCNAFNDGVA 214
Query: 230 KDF-RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP--LKNTTELKDGLDTV 286
+ F + ++C I+KSW ++ +G + KKF C +++D +
Sbjct: 215 RAFLAEGGKKCVNNIRKSWKALERFKESEEGTNFIFKKFHVCQKNIASGFAQVRDWIYGS 274
Query: 287 YSEAAQYDTP--------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
Y A ++ P S P++ C ++ +D+L I D++ N+
Sbjct: 275 YVNLAMHNYPYGLETRRLSTYPIRLACAFLQKDFQKDEDLLSAI------YDAVNVYHNH 328
Query: 339 S---RCYINEDRTGDESDEGWEWQSCSEMVVP-MGKDKNSMYQPEPWNLTKYIKNCKEQY 394
S C +D G+ W+ Q+C+E+V+P K+ + P W L CK +Y
Sbjct: 329 SGVVHCNNVDDVYGEHIGSAWQVQNCNELVLPYCANGKSDISYPFSWKLDGIKAYCKRRY 388
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
G++P PS V T + G I + SNI+FSNG DP+S GG +K+ + +
Sbjct: 389 GMTPNPSRVRTMFAGDKISV-----ASNIVFSNGDLDPWSSGGILKS--LGPTLPAIIVR 441
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
G+H DL D + + + R T ++ W+
Sbjct: 442 GGAHHYDLKGDHPDDTEEVRKARNTAKNYIKTWLA 476
>gi|242096524|ref|XP_002438752.1| hypothetical protein SORBIDRAFT_10g025540 [Sorghum bicolor]
gi|241916975|gb|EER90119.1| hypothetical protein SORBIDRAFT_10g025540 [Sorghum bicolor]
Length = 306
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 97/227 (42%), Positives = 139/227 (61%), Gaps = 12/227 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW-------GGGNSPILAFLGAEAPIDD 103
F ++ Q +DHF++ P + F Q+Y++N FW GG P+ F G E I+
Sbjct: 84 FTEHYFPQELDHFTFRPNASTVFYQKYLVNDTFWRRSSGRKGGSTGPLFVFTGGETDIES 143
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
+GF ++ A +F AL+V +EHR+YG+S+PF S + GY S QALAD+A
Sbjct: 144 IAINAGFMFDIAPKFGALLVFIEHRFYGESMPFRS-----NSTEALGYLTSTQALADFAI 198
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
++ +K +A APV+ G SYGG LA+WFRLKYPHV IG+LASSAPIL + ITPW++
Sbjct: 199 LITSLKQNLSAETAPVVVFGGSYGGMLASWFRLKYPHVTIGALASSAPILQFDYITPWSS 258
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
++ VVS+D++ S CF IK +W ++ S +GL LSK F+ C
Sbjct: 259 FYDVVSQDYKSESLNCFSVIKAAWDVLEERGSNDNGLLELSKLFRAC 305
>gi|47206852|emb|CAF90612.1| unnamed protein product [Tetraodon nigroviridis]
Length = 463
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 140/449 (31%), Positives = 220/449 (48%), Gaps = 34/449 (7%)
Query: 12 LLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQ 71
L++ + + + G PRL + R L+ F ++ QT+DHF++
Sbjct: 5 LVLGVLALSGWCVRGSGPPPRLLQAARPPGLQGAAL---FTESYFTQTLDHFNFNSYGNG 61
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYG 131
TF QRY++ ++W G+ P+ + G E I D SGF E A Q AL+V EHRYYG
Sbjct: 62 TFRQRYLVADRYWRRGHGPLFFYTGNEGDIWDFALNSGFITELAAQQGALVVFAEHRYYG 121
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELA 191
KS+PFG + G QALADYA ++ ++++ AT PVI G SYGG L+
Sbjct: 122 KSLPFGDASFQVPEV---GLLTVEQALADYALLISQLREQLAATRCPVIVFGGSYGGMLS 178
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEID 251
+ RL+YP++V G+LA+SAP+L + T + V+ DF+ +C ++ ++ ++
Sbjct: 179 VYMRLRYPNLVAGALAASAPVLSTAGLGEPTQFFRDVTADFQSVEPQCTGAVRGAFQQLR 238
Query: 252 NIASKPDGLAILSKKFKTC----TP---------LKNTTELKDGLDTVYSEAAQYDTPSN 298
+A D AI KK C +P L+N L LD YS P+N
Sbjct: 239 ELAEDQDYGAI-QKKLSLCQRPSSPQDVSQLYGLLRNAFTLMAMLDYPYSTHFMGSLPAN 297
Query: 299 IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCYIN-EDRTG---DESD 353
PVK C + A ++ + AAG+V S + + R Y+ D TG +
Sbjct: 298 -PVKVACQTMLRASELLTNL--RDAAGLVYNASGQLGCLDLYRLYVQCADPTGCGLGPNS 354
Query: 354 EGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDI 412
W++Q+C+E+ + + M+ P + + C +++ V PRP W+ T + G +
Sbjct: 355 WAWDYQACTEVDLCFHSNNVTDMFPPMSFGEEQRRAYCSQRWSVLPRPRWLRTQFWGDAL 414
Query: 413 KLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
+ SNIIFSNG DP++ GG K+
Sbjct: 415 S-----TASNIIFSNGDLDPWANGGVRKS 438
>gi|449666912|ref|XP_004206448.1| PREDICTED: dipeptidyl peptidase 2-like [Hydra magnipapillata]
Length = 478
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 143/467 (30%), Positives = 231/467 (49%), Gaps = 52/467 (11%)
Query: 55 FYNQTIDHFSYGPESYQT--FPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
++ Q IDHF++ ++ F QRY+I+ K+W G P+L + G E I++ + +GF +
Sbjct: 26 YFVQFIDHFNFLGQAGANGQFKQRYLISDKYWSKGKGPVLFYTGNEGSIENFWENTGFVF 85
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
E A + K L++ EHRYYGKS+PFG+ N G+ QALAD+A+++ H+K
Sbjct: 86 ELAQKLKGLVIFGEHRYYGKSLPFGNDSFTPANI---GFLTIDQALADFAALIQHLKKSM 142
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
A + V A G SYGG L + R KYPH+V G +ASSAP L P + + V++ F
Sbjct: 143 GADNCSVFAFGGSYGGMLTAYMRYKYPHIVDGGVASSAPFLTIAGKRPRSEFFQTVTETF 202
Query: 233 RDTSEECFQTIKKSWSEI-DNIASKPDGLAILSKKFKTCT-----PL---------KNTT 277
R C +++ +++++ D S +GL L K F C P +N
Sbjct: 203 RKADSNCPSSVQIAFTQLMDLFNSGKEGLQQLEKVFSLCEGQMTRPFLEKQMIAWARNAF 262
Query: 278 ELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
L +D Y D P + PV+ C+ ++ +D L AG+ + L Y G
Sbjct: 263 TLLSMVDYPYPAKFMADLPGH-PVELACSYMQV-----EDKL----AGLAKITDLLY-GK 311
Query: 338 NSRC------YIN-EDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKY 386
+ C Y++ D TG + W++Q+C+EM++P G + M+ + L
Sbjct: 312 PANCHNLYEEYVSCSDPTGCGTGPDNPPWDYQACTEMILPGGSNNITDMFPHLDFTLEMR 371
Query: 387 IKNCKEQYGVS-PRPSWVLTYYGG--HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
C +++G+ R +W+ T Y G +DIK S IIF NG DP+ GG ++
Sbjct: 372 QHYCSKRWGLGYSRLNWLATQYWGSLNDIK-----KASRIIFPNGDLDPWHTGGVLEDLS 426
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
D + + G+H LDL + +DP ++ R +I+ GW+ +
Sbjct: 427 --DSLIAIMVEGGAHHLDLRGSNPADPKSVIDARNKITEIITGWMHE 471
>gi|348680585|gb|EGZ20401.1| hypothetical protein PHYSODRAFT_558788 [Phytophthora sojae]
Length = 566
Score = 198 bits (503), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 150/482 (31%), Positives = 234/482 (48%), Gaps = 60/482 (12%)
Query: 55 FYNQTIDHF-SYGPESYQTFPQRYVINS-KFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
F Q +DHF + G S +F QRY + S + + N I ++G EA + + +G +
Sbjct: 90 FITQELDHFRANGGSSEGSFEQRYFVCSPESFDPSNGSIFFYVGNEADVTLYLNHTGLMW 149
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
ENA F ALIV EHRY+GKSVPFG L H + ++ QALADYA +L+ K
Sbjct: 150 ENAVAFNALIVFAEHRYFGKSVPFG-----LDVLEHMEFLSTQQALADYA-VLIEALKKQ 203
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGDIT-PWTT--YHSVV 228
PVI G SYGG L TWFR+KYPH++ G +A+SAP++ + GD P T ++ VV
Sbjct: 204 LGVDVPVIGFGGSYGGMLGTWFRMKYPHIIDGVIAASAPVVNFLGDPDHPADTEAFNRVV 263
Query: 229 SKDFRD---TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT 285
+ D + + C ++++ + DG L++ C ++ D + +
Sbjct: 264 TFDMSEEAGAAPNCIPNLRRALTAAIESVQTQDGRKQLAELLHLCD--TDSLSTSDDVVS 321
Query: 286 VYSEA------AQYDTPSN-----------IPVKRICNAIENAPNCGDDILCKIAAGVVE 328
+ +EA Y P++ P++ C + A + +D + E
Sbjct: 322 IAAEAYGDLAMGNYPYPTSYIMDGNVDLPGYPMRAACEPL--AGDFAEDDDLGLINAFRE 379
Query: 329 ADSLEYDGNNSR-CY--------INEDRTGDESDEG--------WEWQSCSEMVVPMGKD 371
+ + Y+ S C+ +NE T DE+ + W + CSE+ +PM D
Sbjct: 380 SIDVYYNATKSESCFFPPAPKKTVNESATSDEAKQAKIDQKGNFWGYLECSELYMPMSSD 439
Query: 372 KNS-MYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
S +Y P N +K +C EQ+GV +P+W T YGG + R+TSNI+FSNG
Sbjct: 440 GVSDIYPAVPVNQSKDDADCFEQWGVHLKPNWAQTEYGG----MKALRATSNIVFSNGNF 495
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S G + + G+H LDL + DP + + R+TE+K ++ WI +
Sbjct: 496 DPWS--GLDVLQSLSPSVVAVPVPGGAHHLDLFFSHPLDPPAVTEARQTELKYIRQWIDE 553
Query: 491 YY 492
+Y
Sbjct: 554 FY 555
>gi|301096651|ref|XP_002897422.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262107113|gb|EEY65165.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 568
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 147/481 (30%), Positives = 233/481 (48%), Gaps = 61/481 (12%)
Query: 58 QTIDHFSYGPESY--------------QTFPQRYVINSKFWGGGN--SPILAFLGAEAPI 101
Q IDHFS+ P T+ QRY++N++FW + +P+ + G E +
Sbjct: 98 QRIDHFSWLPAEAVDAADPNAAPSGLPATYKQRYLLNTQFWDPSDKKAPVFFYTGNEGDV 157
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+G +ENA FKAL+V EHRYYGKS PFG K H GY QALADY
Sbjct: 158 TLYANHTGLIWENAQTFKALVVFAEHRYYGKSFPFGD-----KYMDHLGYLTHDQALADY 212
Query: 162 ASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
A ++ H++ KY+A + PVIA G SYGG L+ WFR+KYP ++ G++A+SAPI +G +
Sbjct: 213 AELIYHVQKKYDALNHPVIAFGGSYGGMLSAWFRMKYPSIIAGAIAASAPIYGFGGFPAF 272
Query: 222 --TTYHSVVSKDFR---DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT 276
Y VV++D ++ C +K+W +I +A +G + LS F+ C PL T
Sbjct: 273 DGQKYWQVVTRDASPAAGAAKNCVPNARKAWPQIFELAQTENGRSTLSSIFRLCEPL--T 330
Query: 277 TELKDGLDTVYSEAAQYDTPS----NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL 332
TE + G D S +DT + P + + P C AG A +L
Sbjct: 331 TE-QQGEDLAMSVLFAFDTLAMGNFPYPSSYLTGGAVDLPAWPVREACSHLAGDFPASTL 389
Query: 333 EYDGNNSRCY--------INEDRTGD----------ESDEGWEWQSCSEMVVP----MGK 370
+ +++ + + TGD + D W++Q C+EM+
Sbjct: 390 RQENVDTKLLEALRDAANVFHNATGDLTCFKIPTLWDYDGIWDYQYCTEMLPQETYFSTN 449
Query: 371 DKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
+ M+ + +C+ + +P P+ + YG + RS SNI+FSNG+
Sbjct: 450 GETDMFWSRNTTFEEIRAHCQRDWHTTPDPNGIRVSYGDD-----MLRSASNIVFSNGLL 504
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S G + ++ + + G+H LDL + DP ++ RKTE++++Q W+ +
Sbjct: 505 DPWSSAGVLHAPQDAKVTIVE-IAEGAHHLDLFFSHPKDPPSVIAARKTEIRMIQKWVDE 563
Query: 491 Y 491
+
Sbjct: 564 F 564
>gi|348686510|gb|EGZ26325.1| hypothetical protein PHYSODRAFT_327243 [Phytophthora sojae]
Length = 543
Score = 197 bits (500), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 140/493 (28%), Positives = 242/493 (49%), Gaps = 52/493 (10%)
Query: 36 IRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFL 95
+ S +L Q D F+ QT+DHF G +Y QRY + + + G + ++
Sbjct: 69 VAPSNLLAQCDEK------FFTQTLDHFDVGAPTYL---QRYFVCDRHFRPGGV-MFFYV 118
Query: 96 GAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSA 155
G EA ++ + +G +ENA +F A++V EHRY+GKSVPFG +H Y ++
Sbjct: 119 GNEADVELYLNHTGLMWENADEFGAMLVFAEHRYFGKSVPFGRNVT-----KHMRYLSTE 173
Query: 156 QALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-Y 214
QALADYA ++ +K+++ PVI G SYGG L +WFR+KYPH++ G +A+SAPIL Y
Sbjct: 174 QALADYAVLITRLKEEWQ-RDIPVIGFGGSYGGMLGSWFRMKYPHIIDGVIAASAPILSY 232
Query: 215 YGDITP--WTTYHSVVSKDFR---DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT 269
+GD Y V + D +++ C ++++W + G L +
Sbjct: 233 FGDEVAHDLRGYSQVTTFDASPAAGSAQNCVPNVRRAWPTMRAFGKTTSGRRKLKEALVL 292
Query: 270 C--TPLKNTTEL-------KDGLDTV----YSEAAQY------DTPSNIPVKRICNAIEN 310
C TPL + KD D++ Y A+ Y + P+ P++ C+ +++
Sbjct: 293 CEDTPLDTDEAIDAVMQWAKDSFDSMAMGNYPYASSYIMNGVSELPA-YPMRVACSHLQD 351
Query: 311 APNCGDDILCKIAAGVVEADSLEYDGNNSR-CYINEDRTGDESDEG--WEWQSCSEMVVP 367
A + +D K+ + + Y+ + C+ + + +++ + W++ C+E+ P
Sbjct: 352 AFDETEDGDFKLLRAFAKTIGVYYNSTKDKECFQLKAPSAEDAVDSDFWDYIYCAELYGP 411
Query: 368 MGKDKNS-MYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFS 426
D + M+ PWN T +C ++G+ R +W ++GG + SNI+FS
Sbjct: 412 TTTDGVADMFWYAPWNYTADNASCHAEWGIDARIAWPTIHFGGRRFLEV----ASNIVFS 467
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQG 486
NG DP S G ++ Y D + ++ G+H LDL + DP+ + R E + M+
Sbjct: 468 NGNYDPCSATGVLQNYS--DSVVAVLIDGGAHHLDLMFSNPLDPEPVKAARAAEKQHMKR 525
Query: 487 WITQYYDDFKAIN 499
W ++Y A++
Sbjct: 526 WADEFYAHKAALD 538
>gi|238006194|gb|ACR34132.1| unknown [Zea mays]
gi|413943526|gb|AFW76175.1| putative serine peptidase S28 family protein [Zea mays]
Length = 372
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 105/274 (38%), Positives = 155/274 (56%), Gaps = 16/274 (5%)
Query: 47 SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW--------GGGNSPILAFLGAE 98
S F ++ Q +DHF++ P + F +Y++N FW G P+ + G E
Sbjct: 80 STKPFTAHYFPQLLDHFAFTPNASTVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNE 139
Query: 99 APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
I+ +GF ++ A F AL+V +EHR+YG+S PFG+ + ++A GY S QAL
Sbjct: 140 GDIEWFATNTGFMFDIAPTFGALLVFIEHRFYGESKPFGND--SYRSAETLGYLTSTQAL 197
Query: 159 ADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI 218
AD+A ++ +K A APV+ G SYGG LA+WFRLKYPHV IG+LASSAPIL + I
Sbjct: 198 ADFAVVIRGLKRDLGAEAAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHI 257
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE 278
TPW++++ VS+DF+ S CF I+ +W +D + GL LSK F+ C +K
Sbjct: 258 TPWSSFYDAVSQDFKSESSNCFGVIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYS 317
Query: 279 LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAP 312
+++ L T ++ A D P+ N +EN P
Sbjct: 318 IRNWLWTAFTYTAMVDYPTP------ANFLENLP 345
>gi|395506514|ref|XP_003757577.1| PREDICTED: dipeptidyl peptidase 2 [Sarcophilus harrisii]
Length = 465
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 136/463 (29%), Positives = 225/463 (48%), Gaps = 45/463 (9%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+ ++ Q +DHF++ TF QR+++ KFW G PI + G EA + F
Sbjct: 18 FQERYFEQILDHFNFESYGNNTFLQRFLVTEKFWKKGTGPIFFYTGNEADVWAFASNCDF 77
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E A +AL++ EHRYYGKS+PFG + N QALAD+A ++ ++
Sbjct: 78 ILELASAEEALVIFAEHRYYGKSLPFGVQSTRKGNT---SLLTVEQALADFAVLIQALQK 134
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+Y A + PVI G SYGG L+ + R+KYP++V G+LA+SAP+L I + + V+
Sbjct: 135 EYKAENVPVITFGGSYGGMLSAYMRMKYPNLVAGALAASAPVLSIAGIGDSSQFFRDVTA 194
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP-------------LKNTT 277
DF + S +C Q +++++ I ++ + +S++ TCT +N
Sbjct: 195 DFENYSPKCVQGVREAFRLIKDLYLQ-RAFDKISQEMGTCTQPSSDSAITQLFEFARNAF 253
Query: 278 ELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
+ +D Y P+N PVK C+ + +A N G+ + L Y+ +
Sbjct: 254 TMITMMDYPYPTDFMGHFPAN-PVKVGCDRLLSAKN--------QIQGLRDLVGLFYNTS 304
Query: 338 NSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTK 385
+ C+ D TG S E W++Q+C+E+ + + M+ P+
Sbjct: 305 GTEPCFDIYKLYHKCADPTGCGTGPSAEAWDYQACTEINLTFDSNNVTDMFPKIPFTDEL 364
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
K C ++GV PR SW+ T + G ++K + SNIIFSNG DP++ GG
Sbjct: 365 REKYCFTRWGVRPRKSWMQTNFWGKNLK-----AASNIIFSNGDLDPWAGGGIRSNLSSS 419
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +++ RK E + W+
Sbjct: 420 LIALT--IQGGAHHLDLRGSNPEDPVSVIEVRKLEAAYIHEWV 460
>gi|324507364|gb|ADY43126.1| Prolyl carboxy peptidase like protein 5 [Ascaris suum]
Length = 534
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 154/524 (29%), Positives = 245/524 (46%), Gaps = 58/524 (11%)
Query: 15 VIFVSTSFHANGLKLR-PRLGRIRRSRILEQKDSNHGF-ETFFYNQTIDHFSYGPESYQT 72
VIF+ + ++LR P + + N+ + E +F N IDHFS+ +T
Sbjct: 8 VIFLCSVAFVQSIRLRDPVTFEFTSKAYIGLSEPNYKWTEEWFDNMPIDHFSFADN--RT 65
Query: 73 FPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGK 132
F RY+IN+ ++ PI + G E I+ +GF ++ A +F A IV EHRYYGK
Sbjct: 66 FHLRYLINTDYFIK-YGPIFFYTGNEGNIEGFASNTGFMWDIAAEFGAAIVFAEHRYYGK 124
Query: 133 SVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY--NATHAPVIAIGASYGGEL 190
+ PFG+ A + + GY +S QALADYA ++ +++++ NA ++ VIA G SYGG L
Sbjct: 125 THPFGNESYA--SVSNLGYLSSEQALADYAHLIQYLRNERLKNAINSTVIAFGGSYGGML 182
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDI-TPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
A W R+KYPH+V G++A+SAP+ ++ P + ++V + F ++ + I +WS
Sbjct: 183 AAWIRIKYPHLVEGAIAASAPVFWFPQTNVPEDIFDNIVKRSFVNSGCKA-DAIIAAWSA 241
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA-----------AQYDTPSN 298
I+ +A+ G L+ FK L+ + L+ D + +A Y PS
Sbjct: 242 IEELANSEQGRTYLNSLFK----LEEKSFLQKSEDVNFLKAFIRESFESMAMVNYPYPSE 297
Query: 299 I-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE 351
PVK C + + + +V + G +N D D
Sbjct: 298 FLAPLPGWPVKVACGFFNSTEMKTREHHAESLYSMVNL-YYNFTGEKKTLCVNPDVCSDS 356
Query: 352 S------DEGWEWQSCSEMVVPM---GKDKNSMYQPEPWNLTKYIKNCKEQYG------V 396
+ GW WQ+C+EMV+ + G + ++ P+ + I+ C++Q+G
Sbjct: 357 AYGALGDPLGWPWQACTEMVMQLCASGPPNDFFWKDCPFTVKGVIEGCEKQFGKIGYTKQ 416
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
RP W + YG H SNIIFSNG DP+S GGW S + G
Sbjct: 417 LTRPDWAILNYGSH------YPCASNIIFSNGYLDPWSGGGWSLKPQTVGSLVSIIIEDG 470
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
+H DL + D + + + R+ E + WI +D K NK
Sbjct: 471 AHHYDLRGSHPKDTEAVKEARRLERIYIGKWI---HDAMKRFNK 511
>gi|116242320|gb|ABJ89817.1| lysosomal pro-X carboxypeptidase [Clonorchis sinensis]
Length = 434
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 122/414 (29%), Positives = 209/414 (50%), Gaps = 20/414 (4%)
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
PIL + G E I+ + +GF +E A + KA ++ EHR+YG S+PF + + K+ +H
Sbjct: 2 PILFYTGNEGAIETFAENTGFMWEIAEELKAAVLFAEHRFYGSSLPFVND--SFKDPQHF 59
Query: 150 GYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLAS 208
GY + QALADYAS++ ++K + ++PVIA G SYGG L+ WFR KYP+++ G++A+
Sbjct: 60 GYLTAEQALADYASLVQYLKSSVKDFENSPVIAFGGSYGGMLSAWFRYKYPNLIAGAIAA 119
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKF 267
SAPI + +++ ++ ++ F + S C + + W I +A + G +L F
Sbjct: 120 SAPIWLFPNMSNCAGFYDTTTRAFSTSGSTVCTKNVALVWDSIRTVAKQHSGHELLRLMF 179
Query: 268 KTCTPLKNTTELKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIENAPNCGDDI 318
+ C PL + +L D L A + P PVK C + +A + D+
Sbjct: 180 QLCDPLPDEQKLIDYLIDFLGTLAMVNYPYEASFIGTFPGEPVKYFCKGLSDAVHRDVDV 239
Query: 319 LCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQP 378
Y N S + D G ++ + W Q+C EM PM + M+
Sbjct: 240 DVVQRVATAVRSLTNYTKNQSCISLEGDLPGLDA-KAWTLQTCLEMTTPMCSNGEGMFPS 298
Query: 379 EPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW 438
W+ + ++C +++ V PR +W + G +IK + +NI+FSNG DP+S G
Sbjct: 299 LEWDPVVFSQSCFDKFAVRPRLNWSAVEFWGKNIK-----TATNIVFSNGDLDPWSAFGV 353
Query: 439 VKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+ + + + G+H LDL + DP +V R+ E++ ++ WI +++
Sbjct: 354 LTDDQAPGCNVIR-IPSGAHHLDLRAKNELDPADVVDARQRELQHIKDWIDEWH 406
>gi|354507428|ref|XP_003515758.1| PREDICTED: dipeptidyl peptidase 2-like [Cricetulus griseus]
gi|344258871|gb|EGW14975.1| Dipeptidyl-peptidase 2 [Cricetulus griseus]
Length = 506
Score = 194 bits (494), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 145/473 (30%), Positives = 236/473 (49%), Gaps = 38/473 (8%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF 94
+ R R+L+ F ++ Q +DHF++ +TF QR++++ KFW G PI +
Sbjct: 30 QARADRVLDPD-----FRENYFEQYMDHFNFESFGNKTFAQRFLVSDKFWKMGKGPIFFY 84
Query: 95 LGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY--- 151
G E I SGF E A Q +AL+V EHRYYGKS+PFG + + RGY
Sbjct: 85 TGNEGDIWTFANNSGFMVELAAQQEALLVFAEHRYYGKSLPFGLQ------STQRGYTQL 138
Query: 152 FNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
QALAD+A +L ++ P IA G SYGG L+ + R+KYPH+V G+LA+SAP
Sbjct: 139 LTVEQALADFAVLLQALRQDLKVQDIPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAP 198
Query: 212 ILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
++ + + V+ DF S +C Q ++ ++ +I ++ + + K F TC
Sbjct: 199 VVAVAGLGESYQFFRDVTADFYGQSPKCAQAVRDAFQQIRDLFLQ-GAHDTIRKNFGTCQ 257
Query: 272 PL---KNTTELKDGLDTVYSEAAQYDTP------SNIPVKRICNAIENAPNCGDDI--LC 320
L K+ T+L ++ A D P N+P + A E + G I L
Sbjct: 258 SLSSSKDLTQLFVFARNAFTVLAMTDYPYPTEFLGNLPANPVKVACERMLSKGQRIMGLR 317
Query: 321 KIAAGVVEADSLEYDGNNSRCYIN-EDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSM 375
+ V + +E + R Y + D TG + + W++Q+C+E+ + + M
Sbjct: 318 ALVGLVYNSSGMEPCFDIYRLYQSCADPTGCGTGSNAKAWDYQACTEINLTFDSNNVTDM 377
Query: 376 YQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
+ P++ + C +GV PRP W+ T + G D+K + SNIIFSNG DP++
Sbjct: 378 FPVIPFSDELRQEYCLHTWGVWPRPDWLRTSFWGGDLK-----AASNIIFSNGDLDPWAG 432
Query: 436 GGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
GG + ++ + ++ G+H LDL + +DP +V+ RK E +++ W+
Sbjct: 433 GGIRRNLSTSIIAVT--IHGGAHHLDLRASNSADPQSVVEVRKLEAALIREWV 483
>gi|326438061|gb|EGD83631.1| hypothetical protein PTSG_04239 [Salpingoeca sp. ATCC 50818]
Length = 528
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 144/494 (29%), Positives = 228/494 (46%), Gaps = 76/494 (15%)
Query: 55 FYNQTIDHFSYGPESYQ--TFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGF 110
F+ Q IDHF++ T+ QRY I ++ N +PI + G E + + +G
Sbjct: 51 FFTQNIDHFNWAKPLNDKFTYRQRYFICDQYADLSNPKTPIFFYFGNEDDVTLYVNNTGL 110
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+ENA +KAL+V EHRYYGKS PF + N + + QA+ADYA+++ +K
Sbjct: 111 MWENAASYKALLVFAEHRYYGKSKPFPAGTPGCMN-----WLTTEQAMADYATLIRDLKQ 165
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH---SV 227
N T APVI G SYGG LA +FR KYP +V G +A SAPI + +TP Y+ ++
Sbjct: 166 DLNLTPAPVIGFGGSYGGMLAAYFRRKYPDIVDGVIAGSAPIWAFSGLTPAYDYYGFNNI 225
Query: 228 VSKDFRD---TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL- 283
++ D S+ C K I IAS G +LS++ + C PL + + + L
Sbjct: 226 IADDASSKGGASDHCRNNFKAIQPRIMAIASTQHGRHMLSQQLRLCKPLASDQDAYNILL 285
Query: 284 --DTVYSEAAQYDTP-------------SNIPVKRICNAIENAPNCGDDILCKIAAGVVE 328
++ A D P PV+ C + + +D K + + +
Sbjct: 286 WAQNAWAYMAMGDFPYASGYIVHGRGKLPPYPVREACKPLSDPQLPANDT--KFISALRD 343
Query: 329 ADSLEYD-------------------GNNSRCYINEDR----------TGDESDEGWEWQ 359
A + Y+ G++ ++ R TGD W +Q
Sbjct: 344 AMDVYYNYTHTEPCFDLFPATSIPRLGHHPHHLLSRPRPAAAVAAAQCTGD-----WGYQ 398
Query: 360 SCSEMVVP--MGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR 417
C+EMV+P G K+ + P++L + IK C++Q+GV+PRP W D+
Sbjct: 399 FCTEMVMPSSQGGPKDMFWPALPFDLNETIKQCQQQWGVTPRPLWAPLNLASKDLT---- 454
Query: 418 RSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQR 477
SN++ SNG DP+ GG V D + + G+H +DL + +DP ++ R
Sbjct: 455 -DVSNMVLSNGGLDPWRAGGVVTNVS--DSVVAVVIESGAHHIDLMFSDPADPPDVIAAR 511
Query: 478 KTEVKIMQGWITQY 491
+ E++ + WI Q+
Sbjct: 512 RLELQHISRWINQH 525
>gi|73967473|ref|XP_848703.1| PREDICTED: dipeptidyl peptidase 2 isoform 1 [Canis lupus
familiaris]
Length = 497
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 146/470 (31%), Positives = 236/470 (50%), Gaps = 41/470 (8%)
Query: 44 QKDSNHGFETFF----YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA 99
+ ++ G E+ F + Q +DHF++ +TF QR++++ KFW G PI + G E
Sbjct: 25 ETGAHRGVESHFWEGYFEQLLDHFNFERFGNKTFQQRFLVSEKFWKRGKGPIFFYTGNEG 84
Query: 100 PIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQ 156
+ SGF E A Q +AL++ EHRYYGKS+PFG + + RGY Q
Sbjct: 85 NVWSFANNSGFILELAAQQEALVIFAEHRYYGKSLPFG------EQSTRRGYTELLTVEQ 138
Query: 157 ALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
ALAD+A +LL ++ A +P IA G SYGG L+ + R+KYPH+V G+LA+SAP++
Sbjct: 139 ALADFARLLLALRRDLGAQDSPAIAFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVA 198
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--- 273
+ + VS DF S +C Q ++ ++ +I ++ + ++S++F TC PL
Sbjct: 199 GLGDSYQFFRDVSADFEGQSPKCAQGVRDAFQQIQDLCFQ-GACDVVSREFGTCQPLSSR 257
Query: 274 KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILC---KIAAGVVEAD 330
K+ T+L ++ A D P P I + N G D L + G+
Sbjct: 258 KDLTQLFGFARNAFTVLAMMDYP--YPTHFIAHLPANPVKVGCDRLLSESQSIKGLRALA 315
Query: 331 SLEYDGNNS-RCY-------INEDRTGDESD---EGWEWQSCSEMVVPMGKDKNSMYQPE 379
L Y+ + + CY D TG S + W++Q+C+E+ + + + PE
Sbjct: 316 GLVYNSSGTVPCYDIYLQYQACADPTGCGSGPNAKAWDYQACTEINLTFSSNNVTDLFPE 375
Query: 380 -PWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW 438
P+ + C + +GV PR W+ T +GG D+ R SNI+FSNG DP++ GG
Sbjct: 376 LPFTDALRQQYCLDTWGVWPRRDWLQTSFGGDDL-----RGASNILFSNGDLDPWAGGGI 430
Query: 439 VKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ + + G+H LDL + DP + + R+ E +++ W+
Sbjct: 431 RSNLSATVLAIT--IQGGAHHLDLRASHPEDPASVREARRFEARLIGEWV 478
>gi|441623431|ref|XP_004088909.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 2 [Nomascus
leucogenys]
Length = 485
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 136/463 (29%), Positives = 222/463 (47%), Gaps = 38/463 (8%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E +
Sbjct: 24 RRAPDPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWTQGKGPIFFYTGNEGDVWA 83
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF E A + AL++ EH G+R+ + QALAD+A
Sbjct: 84 FANNSGFVAELAAEQGALLIFAEH--------VGARQGRGRGXXXXXXVE--QALADFAE 133
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+L ++ AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L +
Sbjct: 134 LLRALRRDLGXQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQ 193
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELK 280
+ V+ DF S +C Q +++++ I ++ + + +F TC PL K+ T+L
Sbjct: 194 FFRDVTADFEGQSPKCTQGVREAFRRIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLF 252
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGN 337
++ A D P P + N G D L A G+ L Y+ +
Sbjct: 253 RFARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNAS 310
Query: 338 NSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTK 385
S+ CY D TG + W++Q+C+E+ + + M+ P+
Sbjct: 311 GSQHCYDIYRLYHSCADPTGCGTGPNARAWDYQACTEINLTFASNNVTDMFPNLPFTEEL 370
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
+ C + +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG +
Sbjct: 371 RQQYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSAS 425
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + + G+H LDL + DP +V+ RK E I+ W+
Sbjct: 426 VIAVT--IQGGAHHLDLRASHPEDPVSVVEARKLEATIIGEWV 466
>gi|241753998|ref|XP_002401172.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508369|gb|EEC17823.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 329
Score = 191 bits (486), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 122/342 (35%), Positives = 183/342 (53%), Gaps = 38/342 (11%)
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G +E A +F AL+V EHRYYGKS+PFG+R + ++ GY S QALADYA +LLH+
Sbjct: 1 GLMWEWAPEFNALLVFAEHRYYGKSMPFGNR--SFESPSKLGYLTSEQALADYADLLLHL 58
Query: 169 KDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
K K A +PV+A G SYGG L+ WFR+KYPH++ +LASSAP+ + + P +TY
Sbjct: 59 KAKLPGAEKSPVVAFGGSYGGLLSAWFRIKYPHLITAALASSAPVNMFPGLVPCSTYSIA 118
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK--NTTELKDGLDT 285
+++ FR SE C Q I++SWS ++ + + G L +KF C L N T +D +
Sbjct: 119 ITEAFRRVSELCTQAIRQSWSPLEAMGATEKGTKTLQEKFNLCQGLNPGNYTVFRDWIRD 178
Query: 286 VYSEAA--QYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG 336
Y+ A Y P ++ PVK +C+A+ A G+ + V A +L ++
Sbjct: 179 TYAVLALVNYPEPGSLITPLPGSPVKAVCDALTKA--IGNR--SAMVDAVAAAVNLFFNS 234
Query: 337 NNSRCYINEDRTGDESDEGWEWQ-----------SCSEMVVPMGKDK-NSMYQPEPWNLT 384
+R ++ + W +Q C+E+V+P+ D M+ P WN T
Sbjct: 235 TGTR-KCHDVSIFQSAVPSWRFQVSHLCTLAYNAGCTELVMPVCSDGVTDMFYPSSWNFT 293
Query: 385 KYIKNCKEQYGVSPRP-SWVLTYYGGHDIKLILRRSTSNIIF 425
+ C+E +GV+P V+ Y GGH L R+T NI+F
Sbjct: 294 EVTAKCRETFGVTPDIYKSVMLYGGGH-----LARAT-NIVF 329
>gi|410990145|ref|XP_004001310.1| PREDICTED: dipeptidyl peptidase 2 [Felis catus]
Length = 549
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 139/457 (30%), Positives = 224/457 (49%), Gaps = 31/457 (6%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+ ++ Q +DHF++ +TF QR++++ KFW G PI + G E + SGF
Sbjct: 36 FQEGYFEQLLDHFNFERFGNKTFLQRFLVSEKFWKRGEGPIFFYTGNEGNVWSFANNSGF 95
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E A Q AL++ EHRYYGKS+PFG R H QALAD+A +L ++
Sbjct: 96 ILELAAQQGALVIFAEHRYYGKSLPFGERST---QRGHTELLTVEQALADFARLLNALRR 152
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
A P I G SYGG L+ + R+KYPH+V G+LA+SAP++ + + VS
Sbjct: 153 DLGAQDTPAIVFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVAGLGDSYQFFRDVSA 212
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVY 287
DF S +C Q ++ ++ +I ++ + +S++F TC PL K+ T+L +
Sbjct: 213 DFEGQSPKCAQGVRDAFRQIKDLFIQ-GAYDTVSQEFGTCQPLSGQKDLTQLFGFARNAF 271
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDIL---CKIAAGVVEADSLEYDGNNSR-CY- 342
+ A D P P I + N G D L + G+ L Y+ + + CY
Sbjct: 272 TVLAMMDYP--YPTDFIGHLPANPVKVGCDRLLSETQRIKGLRALAGLVYNSSGTEPCYD 329
Query: 343 ------INEDRTGDESD---EGWEWQSCSEMVVPMGKDKNSMYQPE-PWNLTKYIKNCKE 392
D TG S + W++Q+C+E+ + + + P+ P+ + C +
Sbjct: 330 IYRQYQACADPTGCGSGPDAKAWDYQACTEINLTFSSNNVTDLFPDLPFTEGLRQQYCLD 389
Query: 393 QYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
+GV PR W+ +G D+K + SNIIFSNG DP++ GG + L+ +
Sbjct: 390 TWGVWPRRDWLRISFGAGDLK-----AASNIIFSNGDLDPWAGGGIQRNLSTSVLAVT-- 442
Query: 453 LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ G+H LDL + DP + + R+ E +++ W+
Sbjct: 443 IRGGAHHLDLRASHPRDPTSVREARRLEARLIGEWVA 479
>gi|403340721|gb|EJY69653.1| Lysosomal carboxypeptidase [Oxytricha trifallax]
Length = 477
Score = 191 bits (485), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 142/478 (29%), Positives = 231/478 (48%), Gaps = 38/478 (7%)
Query: 36 IRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG-GNSPILAF 94
+ S +L+ ++ ++T ++ IDHF+ + TF RY+IN K+ G G PIL +
Sbjct: 3 LESSNVLQ---ADPDYQTKQFDADIDHFTTQGSTTNTFKLRYLINDKYVTGPGPWPILFY 59
Query: 95 LGAEAPIDDNIQLSGFTYENAHQFKALIVIL-EHRYYGKSVPFGSRKAALKNARHRGYFN 153
G E I D SGF +V+ EHRYYG+S+PFG N +
Sbjct: 60 CGNEGIITDFYDNSGFVTTTLATATNALVVFAEHRYYGQSMPFGKDSFKPGNVN---FLT 116
Query: 154 SAQALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
QA+ DY +L IK N T++PVIA G SYGG +A W R++YP ++ G+ ASSAPI
Sbjct: 117 IDQAMMDYVKLLQFIKASDNRFTNSPVIAFGGSYGGMIAAWIRMRYPQIIYGAHASSAPI 176
Query: 213 LYY-GDITPWTTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKPDGLAILSKKFKT 269
L++ G ++P+ ++ + ++ ++ +++ C I+ + ++ A+ A + F
Sbjct: 177 LFFPGTVSPY-AFNELATRSYQSATQDGRCAANIQYGFKILNQWAADNTTYAKIKDYFNA 235
Query: 270 CTPLKNTT-------ELKDGLDTVYSEAAQYDT------PSNIPVKRICNAIENAPNCGD 316
C +T E+ D L T+ YDT P+N PV+ C D
Sbjct: 236 CVAPASTDDIQFLLGEISDALGTMAQVNYPYDTNFTRFLPAN-PVQTACTKGAVDQKSDD 294
Query: 317 DILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKD-KNSM 375
D K G+ +A + Y G+ + + G GW +Q C+EMV+P+ + K M
Sbjct: 295 DGYVK---GLAQA-FMVYHGDKCVSFKPDPSDG---TSGWGYQVCNEMVMPIAQSGKTDM 347
Query: 376 YQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
+ P+PW+ ++ +C G+ P+ ++L +GG + L SNIIFSNG DP+
Sbjct: 348 FLPQPWDPDQFASDCASM-GLKPQFDFILDSFGGRNTNLDFAH-VSNIIFSNGDLDPWRA 405
Query: 436 GGWVK-TYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
GG + T D + + +H L+L +DP + R ++ GWI Y+
Sbjct: 406 GGVLPGTLAKNDKIVVRLIKNSAHHLELRLPNAADPQDVTDARTAFTTVITGWIADYW 463
>gi|449281466|gb|EMC88535.1| Dipeptidyl-peptidase 2, partial [Columba livia]
Length = 437
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 136/435 (31%), Positives = 211/435 (48%), Gaps = 43/435 (9%)
Query: 78 VINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFG 137
VI++KFW G PI + G E I + Q S F +E A + +AL++ EHRYYGKS+PFG
Sbjct: 1 VISAKFWKKGFGPIFFYTGNEGDIWNFAQNSDFIFELAEEQQALVIFAEHRYYGKSLPFG 60
Query: 138 SRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLK 197
KN G QALADYA ++ +K +Y A PVIA G SYGG L+ + R+K
Sbjct: 61 LESMQPKNT---GLLTVEQALADYAVLITELKQQYGAADCPVIAFGGSYGGMLSAYMRMK 117
Query: 198 YPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKP 257
YP++V G+LA+SAP+L + T + V+ DF +S C ++K++ +I ++ +
Sbjct: 118 YPNIVSGALAASAPLLSVAGLGDPTQFFRDVTADFDKSSPGCVPAVRKAFQQIKDLFLR- 176
Query: 258 DGLAILSKKFKTCTPL-------------KNTTELKDGLDTVYSEAAQYDTPSNIPVKRI 304
+S K TC + +N + +D Y P+N PVK
Sbjct: 177 GAYDEISSKMATCNKISTKEDVYQLFGFARNAFTMMAMMDYPYKTDFMGHLPAN-PVKVG 235
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY---------INEDRTGDESD-E 354
C+ I D + +AA V L ++++CY + G SD E
Sbjct: 236 CDQILTHA----DPIRGLAALV---GVLYNSSSSAQCYDIYQLYQSCADPTGCGIGSDAE 288
Query: 355 GWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIK 413
W++Q C+E+ + + M+ P+ + C ++ V PR W+ + G D+K
Sbjct: 289 AWDYQVCTEINLTFNSNNVTDMFPEMPFTEAMREQYCWSRWRVRPRARWLQINFWGGDLK 348
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWL 473
S SNIIFSNG DP++ GG + ++ + + G+H LDL +DP +
Sbjct: 349 -----SASNIIFSNGDLDPWAGGGINSSLSSSLIALT--IKGGAHHLDLRGHNPADPPSV 401
Query: 474 VQQRKTEVKIMQGWI 488
+ RK E I+ W+
Sbjct: 402 TEVRKLEASIINHWV 416
>gi|9858825|gb|AAG01154.1|AF285235_1 quiescent cell proline dipeptidase precursor [Mus musculus]
Length = 506
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 145/479 (30%), Positives = 232/479 (48%), Gaps = 48/479 (10%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF 94
+ R R+L+ F ++ Q +DHF++ +TF QR++++ KFW G PI +
Sbjct: 30 QARADRVLDPD-----FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFWKMGEGPIFFY 84
Query: 95 LGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY--- 151
G E I SGF E A Q +AL+V EHRYYGKS+PFG + + RGY
Sbjct: 85 TGNEGDIWSFANNSGFMVELAAQQEALLVFAEHRYYGKSLPFGVQ------STQRGYTQL 138
Query: 152 FNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
QALAD+A +L ++ AP IA G SYGG L+ + R+KYPH+V G+LA+SAP
Sbjct: 139 LTVEQALADFAVLLQALRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAP 198
Query: 212 ILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
++ + + V+ DF S +C Q ++ ++ +I ++ + +S+ F TC
Sbjct: 199 VVAVAGLGDSYQFFRDVTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQ 257
Query: 272 PL---KNTTELKDGLDTVYSEAAQYDTP------SNIPVKRICNAIENAPNCGDDIL-CK 321
L K+ T+L ++ A D P +P + + N G I+ +
Sbjct: 258 SLSSPKDLTQLFGFARNAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLNEGQRIMGLR 317
Query: 322 IAAGVVEADSLEYDGNNSRCY---------INEDRTGDESD-EGWEWQSCSEMVVPMGKD 371
AG+V S CY + G SD W++Q+C+E+ + +
Sbjct: 318 ALAGLVYNSS-----GTEPCYDIYRLYQSCADPTGCGTGSDARAWDYQACTEINLTFDSN 372
Query: 372 KNSMYQPE-PWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
+ PE P++ + C + +GV PR W+ T + G D+K + SNIIFSNG
Sbjct: 373 NVTDMFPEIPFSEELRQQYCLDTWGVWPRQDWLQTSFWGGDLK-----AASNIIFSNGDL 427
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
DP++ GG ++ + + G+H LDL + DP +V+ RK E +++ W+
Sbjct: 428 DPWAGGGIQSNLSTSVIAVT--IQGGAHHLDLRASNSEDPPSVVEVRKLESTLIREWVA 484
>gi|402591692|gb|EJW85621.1| hypothetical protein WUBG_03470, partial [Wuchereria bancrofti]
Length = 478
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 147/474 (31%), Positives = 228/474 (48%), Gaps = 49/474 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + N IDHFS+ + F RY+IN++ + N PI + G E ++ Q +G
Sbjct: 9 EEWLGNVPIDHFSF--HDNRVFRLRYLINTEHFVS-NGPIFFYTGNEGNVELFAQNTGLM 65
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
++ A +F A+I+ EHR+YGKS PFG++ A R+ GY +S QAL D+A ++ H+K+K
Sbjct: 66 WDLAPEFNAVIIFAEHRFYGKSQPFGNKSYA--TIRNLGYLSSEQALGDFALLIYHLKNK 123
Query: 172 --YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDIT---PWTTYHS 226
A ++ VIA G SYGG LA W R+KYPH+V GS+ASSAP+ ++ D++ P Y+
Sbjct: 124 RLLVAQNSSVIAFGGSYGGMLAAWMRIKYPHLVEGSIASSAPVFWFIDMSRSVPQDAYNR 183
Query: 227 VVSKDFRDTSEECFQ-TIKKSWSEIDNIASKPDGLAILSKKFKT--CTPLKNTTE---LK 280
+V + F S C + I W + N++ G A L+ F + LK +T+ LK
Sbjct: 184 IVKRSF--LSSGCIEKNILDGWIALKNLSLTTTGRAYLNGLFHLDKKSYLKISTDWIMLK 241
Query: 281 DGLDTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADS 331
+ L+ ++ A Y P+N PVK C + D+ L + G++
Sbjct: 242 EYLEDIFGSMAMVNYPYPTNYLAQLPGWPVKVACQFFNSNKAKSDEELAQSMYGIMNL-Y 300
Query: 332 LEYDGNNSRCYI-----NEDRTGDESDE-GWEWQSCSEMVVPM---GKDKNSMYQPEPWN 382
Y G I N+ G D GW WQSC+EM++ G + + P++
Sbjct: 301 YNYTGQKKTFCIKPNVCNDSAYGALGDPFGWTWQSCTEMIMQQCSSGPPNDFFIKNCPFS 360
Query: 383 LTKYIKNCKEQYG------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
L C +G RP W + YG + +NI+FSNG DP+S G
Sbjct: 361 LKGQELYCINTFGKLGYTKALMRPHWSILNYGNR------YPTATNIVFSNGYLDPWSAG 414
Query: 437 GWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
GW S + G+H DL + D + + R+ E ++ W+ +
Sbjct: 415 GWSLKSRVMGSLISIIIKDGAHHYDLRGKHQLDTNSVKDARRLEKFYIKYWLKE 468
>gi|31981425|ref|NP_114031.2| dipeptidyl peptidase 2 precursor [Mus musculus]
gi|341940461|sp|Q9ET22.2|DPP2_MOUSE RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
gi|20072621|gb|AAH27205.1| Dipeptidylpeptidase 7 [Mus musculus]
gi|148676290|gb|EDL08237.1| dipeptidylpeptidase 7, isoform CRA_d [Mus musculus]
Length = 506
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 145/479 (30%), Positives = 232/479 (48%), Gaps = 48/479 (10%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF 94
+ R R+L+ F ++ Q +DHF++ +TF QR++++ KFW G PI +
Sbjct: 30 QARADRVLDPD-----FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFWKMGEGPIFFY 84
Query: 95 LGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY--- 151
G E I SGF E A Q +AL+V EHRYYGKS+PFG + + RGY
Sbjct: 85 TGNEGDIWSFANNSGFMVELAAQQEALLVFAEHRYYGKSLPFGVQ------STQRGYTQL 138
Query: 152 FNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
QALAD+A +L ++ AP IA G SYGG L+ + R+KYPH+V G+LA+SAP
Sbjct: 139 LTVEQALADFAVLLQALRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAP 198
Query: 212 ILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
++ + + V+ DF S +C Q ++ ++ +I ++ + +S+ F TC
Sbjct: 199 VVAVAGLGDSYQFFRDVTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQ 257
Query: 272 PL---KNTTELKDGLDTVYSEAAQYDTP------SNIPVKRICNAIENAPNCGDDIL-CK 321
L K+ T+L ++ A D P +P + + N G I+ +
Sbjct: 258 SLSSPKDLTQLFGFARNAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLNEGQRIMGLR 317
Query: 322 IAAGVVEADSLEYDGNNSRCY---------INEDRTGDESD-EGWEWQSCSEMVVPMGKD 371
AG+V S CY + G SD W++Q+C+E+ + +
Sbjct: 318 ALAGLVYNSS-----GTEPCYDIYRLYQSCADPTGCGTGSDARAWDYQACTEINLTFDSN 372
Query: 372 KNSMYQPE-PWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
+ PE P++ + C + +GV PR W+ T + G D+K + SNIIFSNG
Sbjct: 373 NVTDMFPEIPFSEELRQQYCLDTWGVWPRQDWLQTSFWGGDLK-----AASNIIFSNGDL 427
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
DP++ GG ++ + + G+H LDL + DP +V+ RK E +++ W+
Sbjct: 428 DPWAGGGIQSNLSTSVIAVT--IQGGAHHLDLRASNSEDPPSVVEVRKLESTLIREWVA 484
>gi|307108156|gb|EFN56397.1| hypothetical protein CHLNCDRAFT_144962 [Chlorella variabilis]
Length = 496
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 137/467 (29%), Positives = 225/467 (48%), Gaps = 52/467 (11%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF++ +Y F QR+ + ++W G S + ++G EA + + +G +E A ++
Sbjct: 26 LDHFTWVNPTY--FKQRFFVCDEYWRPGGS-VFLYIGNEADVTLYLNNTGLMWELAPKYD 82
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV 179
A++V EHRYYG+S PF + + L+ +H + S QA+ DYA++L +K + PV
Sbjct: 83 AMLVFAEHRYYGQSKPFPA--SVLR--KHMAWLTSEQAMGDYATLLWELKRELGDPDVPV 138
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLA-SSAPI-LYYGDITPW--TTYHSVVSKDFR-- 233
I G SYGG L TWFR+KYPH+V G +A S+API Y G+ P+ ++ +V++D
Sbjct: 139 IGFGGSYGGMLGTWFRMKYPHLVDGVIAGSAAPIWTYKGENPPYDPGSFAKIVTQDASPE 198
Query: 234 -DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVY 287
++E C ++ +W +D S +G +S + C ++ T L D + +
Sbjct: 199 GGSAEACADNVRAAWKLLDRWGSSEEGRQSISAAMRLCPESAVESGEDATALCDWASSAW 258
Query: 288 SEAA-------QYDTPS------------NIPVKRICNAIENAPNCGDDILCKIAAGVVE 328
A Y PS PV+ C + G+ +L G+
Sbjct: 259 DYLASAARAMGNYPYPSVYIVNGAQPPLPAFPVRVACGHLAEPGLDGEALL----EGLAR 314
Query: 329 ADSLEYDGNNS-RCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDK-NSMYQPEPWNL 383
A + Y+ C+ + E+DE W +Q C+E KD + M+ EP++
Sbjct: 315 AAGVFYNHTGDLPCFSFKQGPNPETDEDADFWGYQYCTEQFQVFSKDGVHDMFWEEPFST 374
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
I++CK+ +GV PRP W +GG + + SNI+FSNG+ DP+S GG +
Sbjct: 375 KAAIQDCKDGWGVEPRPLWATIEWGGKRLG-----AASNIVFSNGLLDPWSGGGVLANIS 429
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ + + G+H LDL + DP + R E + WI Q
Sbjct: 430 QANDLVAVVIPEGAHHLDLMFSHPLDPPSVTAARAFEEHYIAKWIAQ 476
>gi|268575392|ref|XP_002642675.1| C. briggsae CBR-TAG-282 protein [Caenorhabditis briggsae]
Length = 505
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 150/510 (29%), Positives = 244/510 (47%), Gaps = 46/510 (9%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
+LL ++F +TS ++LR + R +++ E K + ++ + IDHFS+ +
Sbjct: 5 ILLAILFANTSCF---VRLRDPITRKSKNQYFE-KLGKYKYDEGYLKVPIDHFSFTND-- 58
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
F RY +N+ + G PIL + G E ++ + +G ++ A + KA +V +EHR+Y
Sbjct: 59 YEFDLRYFLNTDHYESGG-PILFYTGNEGSLESFAENTGLMWDLAPELKAAVVFVEHRFY 117
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY--NATHAPVIAIGASYGG 188
GKS PF + + + RH GY +S QALAD+A +++ A + VIA G SYGG
Sbjct: 118 GKSQPF--KNQSYTDIRHLGYLSSQQALADFALSAQFFRNEKIKGAQTSAVIAFGGSYGG 175
Query: 189 ELATWFRLKYPHVVIGSLASSAPILYYGDIT-PWTTYHSVVSKDFRDTSEEC-FQTIKKS 246
L+ WFR+KYPH+V G++A+SAP+ ++ D P Y +V++ F D C + ++K
Sbjct: 176 MLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVYDFIVTRAFLDAG--CNRKAVEKG 233
Query: 247 WSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG---LDTVYSEAAQ------YDTPS 297
W +D +A G L+ +K K+ E KD L EA + Y P+
Sbjct: 234 WIALDELAKSDSGRQYLNVLYKLDP--KSKLESKDDIGFLKQYIREAMEAMAMVNYPYPT 291
Query: 298 NI-------PVKRICNAIENAPNCGDDILCKIAAGVVE------ADSLEYDGNNSRCYIN 344
+ PVK C NAP + + +V D + N ++C
Sbjct: 292 SFLSSLPGWPVKEACK-YANAPGKSQEESAEQLYNIVNLYYNFTGDKTTHCANAAKCDSA 350
Query: 345 EDRTGDESDEGWEWQSCSEMVVPM---GKDKNSMYQPEPWNLTKYIKNCKEQ-YGVSPRP 400
+ GD GW +Q+C+EMV+P+ G + ++ P+ KY C++ Y +
Sbjct: 351 YEALGDPL--GWPFQTCTEMVMPLCGSGYPNDFFWKDCPFTTEKYGDYCQQTFYKIGYNK 408
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
+ + GG S SNI+FSNG DP+S GG+ + S L G+H
Sbjct: 409 TLLRPLAGGRAFGATSLPSASNIVFSNGYLDPWSGGGYDHSDRVQGSVISVILKQGAHHY 468
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DL A D + + + R E ++ WI +
Sbjct: 469 DLRGAHPQDTEEVKKVRAMETSAIKKWIKE 498
>gi|432102770|gb|ELK30246.1| Lysosomal Pro-X carboxypeptidase [Myotis davidii]
Length = 353
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 97/250 (38%), Positives = 147/250 (58%), Gaps = 8/250 (3%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ ++ Q IDHF + ++ +TF QRY+I + W IL + G E I +GF
Sbjct: 47 YSIHYFEQKIDHFGF--KNDKTFNQRYLIADQHWRKEGGSILFYTGNEGDIIWFCNNTGF 104
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A + KA++V EHRYYG+S+PFG+ + +++RH + S QALAD+A ++ H+K
Sbjct: 105 MWDVAEELKAMLVFAEHRYYGQSLPFGAD--SFQDSRHLNFLTSEQALADFAELIKHLKR 162
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + P IA+G SYGG LA WFR+KYPH+V+G+LA+SAPI + DI P + +V+
Sbjct: 163 TIPGAENQPFIALGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQFEDIVPCGVFMKIVT 222
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTV 286
DF+ + C ++I+ SW I+ + GL LS+ CTPLKN + LKD +
Sbjct: 223 TDFKRSGPNCSESIQSSWDAINRLTRNGTGLDWLSEALHLCTPLKNAQDVQHLKDWISET 282
Query: 287 YSEAAQYDTP 296
+ A D P
Sbjct: 283 WVNLAMVDYP 292
>gi|224072899|ref|XP_002190034.1| PREDICTED: dipeptidyl peptidase 2 [Taeniopygia guttata]
Length = 468
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 136/452 (30%), Positives = 217/452 (48%), Gaps = 42/452 (9%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
Y+ + HF P +++ +KFW G PI + G E I Q S F +E A
Sbjct: 19 YSASTFHFILFPVAFE---------AKFWKKGFGPIFFYTGNEGDIWTFAQNSDFIFELA 69
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
+ +AL++ EHRYYGKS+PFG LK QALADYA ++ +K ++ A
Sbjct: 70 EEQQALVIFAEHRYYGKSLPFGLESTQLKKT---ALLTVEQALADYAVLITELKQQFGAA 126
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
PVIA G SYGG L+ + R+KYP+VV G+LA+SAP+L + T + V+ DF+ +
Sbjct: 127 DCPVIAFGGSYGGMLSAYLRMKYPNVVAGALAASAPLLSVAGLGDPTQFFRDVTADFQKS 186
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-------------KNTTELKDG 282
S C ++K++ +I ++ +S K TC + +N +
Sbjct: 187 SLGCVTAVRKAFQQIKDLCLS-GAYDEISSKMATCNKISNKEDVYQLFGFARNAFTMMAM 245
Query: 283 LDTVYSEAAQYDTPSNIPVKRICNAI--ENAPNCGDDILCKI---AAGVVEADSLEYDGN 337
+D Y D P+N PVK C I P G L + ++G+ + ++ Y
Sbjct: 246 MDYPYKTDFMGDLPAN-PVKVGCEQIIAHKDPIEGLTALVGVFYNSSGLAQCYNI-YQLY 303
Query: 338 NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGV 396
S TG ++ E W++Q C+E+ + + M+ P+ + C ++ V
Sbjct: 304 QSCADPTGCGTGSDA-EAWDYQVCTEINLTFDSNNVTDMFPEMPFTEAMREQYCWNKWHV 362
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
PR W+ T + G D+K S SNIIFSNG DP++ GG + ++ + + G
Sbjct: 363 RPRAHWLQTNFWGEDLK-----SASNIIFSNGDLDPWAGGGINSSLSPSLIALT--IKGG 415
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+H LDL + +DP + + R+ E I+ W+
Sbjct: 416 AHHLDLRGSNPADPPSVTEVRRLEAGIISSWV 447
>gi|348574464|ref|XP_003473010.1| PREDICTED: dipeptidyl peptidase 2-like [Cavia porcellus]
Length = 507
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 139/467 (29%), Positives = 237/467 (50%), Gaps = 39/467 (8%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
Q ++ + +++ Q +DHF++ +TF QR++++ KFW PI + G E +
Sbjct: 35 QHSTDPEYREYYFEQLLDHFNFESYGNKTFHQRFLMSDKFWKQPKGPIFFYTGNEGDVWV 94
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALAD 160
SGF E A Q +AL++ EHRYYGKS+PFG++ + G+ QALAD
Sbjct: 95 FANNSGFLVELAQQQEALLIFAEHRYYGKSLPFGAQ------STQHGFMQLLTVEQALAD 148
Query: 161 YASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
+A +L ++ A +P I G SYGG L+ + R+KYPH+V G+LA+SAP++ +
Sbjct: 149 FAVLLQVLRQDLCAQDSPTITFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVAGLVD 208
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTT 277
+ V+ DF S +C Q +++++ EI N+ + +S++F TC L ++ T
Sbjct: 209 SYQFFRDVTADFYSQSPKCVQAVREAFQEIRNLYLQ-GAHERISREFGTCQLLSGSEDLT 267
Query: 278 ELKDGLDTVYSEAAQYDTPSNI---------PVKRICNAI--ENAPNCGDDILCKIAAGV 326
+L ++ A D P + PVK C+ + E P G L +A +
Sbjct: 268 QLFMFARNAFTVLAMMDYPYHTDFLVPLPANPVKVGCDILLNEAQPITG---LRMLAGMI 324
Query: 327 VEADSLEYDGNNSRCYIN-EDRTGDESD---EGWEWQSCSEMVVPMGKDKNS-MYQPEPW 381
+E+ + + Y + D TG S + W++Q+C+E+ + + S M+ +
Sbjct: 325 YNTSGMEHCYDIYQLYHSCADPTGCGSGSDAQAWDYQACTEINLTFSSNNVSDMFPTLLF 384
Query: 382 NLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
+ C E++GV PRP+W+ T +GG D+K + IIFSNG DP++ GG +
Sbjct: 385 TEELREQYCLEKWGVWPRPNWLQTSFGGGDLK-----GATKIIFSNGDLDPWAGGGIHR- 438
Query: 442 YHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + + + G+H LDL + DP +V+ RK E ++ W+
Sbjct: 439 -NLSESVIAVMIQGGAHHLDLRASHPEDPASVVEARKLEAGLIWEWV 484
>gi|326428644|gb|EGD74214.1| hypothetical protein PTSG_06225 [Salpingoeca sp. ATCC 50818]
Length = 501
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 137/442 (30%), Positives = 212/442 (47%), Gaps = 48/442 (10%)
Query: 55 FYNQTIDHFSYGP----ESYQTFPQRYVINSKFWGGGN-SPILAFLGAEAPIDDNIQLSG 109
+YNQT DHF + P E TF QR I ++W N PI + G E ++ + +G
Sbjct: 46 WYNQTTDHFQWRPSGTAEEPLTFQQRVFICDQYWDKTNPGPIFFYAGNEGDVELYVNHTG 105
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+E+A F+AL+V EHR+YGK+ + A+ + Y QA+ADYA +L H+K
Sbjct: 106 LMWESAPMFRALLVFAEHRFYGKTQL--TPGASGPSEHQYKYLTHDQAMADYAHLLYHLK 163
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP---WTTYHS 226
N + I G SYGG LA W R+KYP G++A+SAPIL + +TP Y
Sbjct: 164 RDRNCESSKTIVFGGSYGGMLAAWLRMKYPQTFDGAIAASAPILAFPGMTPPFDSNGYWQ 223
Query: 227 VVSKD---FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE----- 278
VV++D + C ++ +W E+ + G LS F+TC+P+ + +
Sbjct: 224 VVTRDATPAAGAAPACENNMRNAWKELFSRGKTESGRKSLSTLFRTCSPVTSEDDTWRLA 283
Query: 279 --LKDGLDTVYSEAAQYDTPSN-----------IPVKRICNAIENAPNCGDDILCKIAAG 325
L +DT+ Y PSN PV C + GD +L + G
Sbjct: 284 MFLLLSIDTL--AMGNYPYPSNYLTGGGPKLPAYPVVAACKPLAKKDLKGDALLSALRDG 341
Query: 326 -VVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMG----KDKNSMYQPEP 380
V A++ + + C+ D+ E D W++Q C+E++ M+ +P
Sbjct: 342 AAVYANATQ----DLTCFDIPDQKHVEQDGIWDYQWCTELMPQETYFSLNGTTDMFWAQP 397
Query: 381 WNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
++ +C+ +YG+ PR W+ YGG L + SNI+FSNG+ DP+S GG VK
Sbjct: 398 QDMAFVRDHCRTKYGIVPREDWMAVKYGG----LNALPAASNIVFSNGLLDPWSSGG-VK 452
Query: 441 TYHFFDLSFSQDLNLGSHCLDL 462
++ D + L G+H +DL
Sbjct: 453 -HNISDSITAIILPHGAHHIDL 473
>gi|325188924|emb|CCA23453.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 544
Score = 187 bits (476), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 143/470 (30%), Positives = 228/470 (48%), Gaps = 57/470 (12%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKF-WGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +DHF+ + +FPQRY + + +LG EA + + +G+ +EN
Sbjct: 86 FKQRLDHFNVAQNA--SFPQRYFFCDPYELNAAIDAVFFYLGNEAEVTLYLNHTGWMWEN 143
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +FKA ++ EHRY+G+S+PF K +++ ++ G+ +S QALADYA+++ IK N
Sbjct: 144 AWEFKAALIFAEHRYFGRSIPFP--KESIR--QNMGFLSSEQALADYAALITSIKQ--NR 197
Query: 175 TH---APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGDITPWTTYHSVVSK 230
TH AP I G SYGG LA WFR+KYPH++ G +A+SAP+L + GD P
Sbjct: 198 THLQRAPFIGFGGSYGGMLAAWFRVKYPHIIDGVIAASAPVLAFMGDQRPVDMEGFARVS 257
Query: 231 DFRDT-----SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE------- 278
F T S C I++SW + ++ G LSK F+ C +E
Sbjct: 258 TFDATMGAGASSNCASNIRQSWQSMWKLSKTLQGREKLSKIFQLCNDAILHSEKDAEAMI 317
Query: 279 --LKDGLDTVYSEAAQYDTPSNI-----------PVKRICNAIENA---PNCGDDILCKI 322
K+ D Y Y P++ PV+ C + +A P D +L
Sbjct: 318 MWAKEAFD--YMSMGNYPYPTSYIMNGESTLPSYPVRVACGFLSDAFVVPKEEDTLL--- 372
Query: 323 AAGVVEADSLEYDGNNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDK-NSMYQP 378
V + + Y+ + + ++S W++ CSE+ +P D + M+ P
Sbjct: 373 -EAFVRSIGVYYNSTKQKSCHDMKPASEKSRRDADFWDYIYCSELYMPSTTDGIHDMFWP 431
Query: 379 EPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW 438
WN ++ NC + +GVS RP W +T +GG +K + R SNI+FSNG DP+S G
Sbjct: 432 VAWNQSEDNANCIKTWGVSLRPFWAVTQFGG--LKALQR--ASNIVFSNGNYDPWSATGV 487
Query: 439 VKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
K+ + + G+H +DL + DP + + R+ E + ++ WI
Sbjct: 488 TKSISSSVVYIP--VPGGAHHIDLFFSNDLDPPEVRKARQLERQNIRRWI 535
>gi|260829575|ref|XP_002609737.1| hypothetical protein BRAFLDRAFT_78548 [Branchiostoma floridae]
gi|229295099|gb|EEN65747.1| hypothetical protein BRAFLDRAFT_78548 [Branchiostoma floridae]
Length = 422
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 119/391 (30%), Positives = 206/391 (52%), Gaps = 29/391 (7%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN 104
+D H ET ++NQ +DHF++ +TF +R +++ FW PI + G E PI
Sbjct: 31 RDPPH--ETRYFNQYLDHFNFASHGAETFQERVLVSDAFWRK-EGPIFFYTGNEGPITSI 87
Query: 105 IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
GF + A +F+ALIV +EHRYYG+S+PFG N + G QALADYA +
Sbjct: 88 WNEVGFIKDLAEKFEALIVFVEHRYYGESLPFGETTF---NKENMGLLTVEQALADYAVL 144
Query: 165 LLHIKDKY--NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+ ++ Y + PVIA G SYGG L+ + RLKYP++V G+LASSA + +TP
Sbjct: 145 ITNLTASYCEDPDVCPVIAFGGSYGGVLSAFMRLKYPNLVAGALASSANVYMSAGLTPGN 204
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK-PDGLAILSKKFKTCTPLKNTTELKD 281
V++DFR + C + +++ ++E++ +A + GL +S + + C+PL++ +L +
Sbjct: 205 ELFQDVTEDFRRYNPRCPERVREGFAEMERLAGQGKQGLHEISSRMRLCSPLQHHADLVN 264
Query: 282 G---LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEADSLEYD 335
+ ++ A D P +I A +C D+L K + G+++A + Y+
Sbjct: 265 MYRWVREAFTVLAMEDLPYSISNGPSLPAYPVNASC--DLLLKASDGIEGILQAVGMLYN 322
Query: 336 -GNNSRCY-------INEDRTGDE---SDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNL 383
+N C+ D TG + W++Q+C+E+ ++ + M+ P+ +
Sbjct: 323 FTSNLTCFDLHRDFVPCADPTGCSLMPGAQAWDYQTCTEISLLESTNNVTDMFPPDAFTE 382
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
+C++++GV+PRP W+ T + G +L
Sbjct: 383 ETRAVHCRQRWGVTPRPGWLSTQFWGKGQRL 413
>gi|118346549|ref|XP_977067.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288520|gb|EAR86508.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 502
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 136/483 (28%), Positives = 236/483 (48%), Gaps = 55/483 (11%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++T +++Q +DH + +TF Q+Y+I ++ PIL + G EAP+D + +GF
Sbjct: 19 YQTKYFDQLVDHIGFETGD-KTFKQKYLIKDDYYRYDKGPILFYCGNEAPVDFSFGGAGF 77
Query: 111 TYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+ A + AL+V +EHRY+G+S PFG+ K + K ++ Y S QA+ DYA L+ K
Sbjct: 78 MHTTLAQELNALVVFMEHRYFGESQPFGTEKESFKKGNNK-YLTSFQAINDYAKFLVWFK 136
Query: 170 DKYNA--THAPVIAIGA----------SYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
PV+A GA SYGG L+ W R+K+P ++ SLASSAPI Y +
Sbjct: 137 KSLGCGDDECPVVAFGALSNIFINYKASYGGMLSAWIRMKFPEIIDVSLASSAPIFLYEN 196
Query: 218 I--TPWTTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKP-----------DGLAI 262
T ++ +V+ DT E+ C I ++ + + ++ + P L
Sbjct: 197 REGIDETLFYKIVT----DTYEQNGCNTQIHRAMNILTDLINSPVPSFLFKIQNKKILNE 252
Query: 263 LSKKFKTCTPLK---NTTELKDGLDTVYSEAAQYDTPSN---------IPVKRICNAIEN 310
+++ KTC P+ N L+ +D YS + ++ P P C E
Sbjct: 253 INEGMKTCKPITDQDNLDVLRSYIDQAYSYMSMFNYPQEGHFVSKMPAWPANYSCTPFEA 312
Query: 311 APNCGDDILCKIAAGVVEADSLEYDGNNSR-CY-INEDRTGDESDEGWEWQSCSEMVVPM 368
+ + ++ V ++ + YD + C N TG+ + +E +C+++V P+
Sbjct: 313 IND--KSTISQLFQAVKKSVDVYYDFEEQKECTNFNTGSTGEINTSAYEILTCADIVQPI 370
Query: 369 GKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSN 427
+ M+ +PW+ Y + C+E +G++P +VL +YGG + + + + + IIFSN
Sbjct: 371 HPNGVTDMFYDQPWDKDSYQQYCQETFGLTPNYDYVLNFYGGKNDEEM--KQFTRIIFSN 428
Query: 428 GMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
G+ DP+ G K Y DL ++ +HC DL + D + ++Q R E K ++ W
Sbjct: 429 GLLDPWQSGSPTK-YISDDLPII-NMYAAAHCSDLRLPQNGDVESVIQARIQEEKYIKQW 486
Query: 488 ITQ 490
I +
Sbjct: 487 IQE 489
>gi|290981802|ref|XP_002673620.1| predicted protein [Naegleria gruberi]
gi|284087205|gb|EFC40876.1| predicted protein [Naegleria gruberi]
Length = 503
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 138/488 (28%), Positives = 219/488 (44%), Gaps = 61/488 (12%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVI-----------------NSKFWGGGNSPILA 93
++T F+ Q +DHF + + TFPQRY + N+ PI+A
Sbjct: 29 YQTGFFTQRLDHFDF--TNIATFPQRYFVCDLYVKHSTRSSVIVDDNNLIQIDPFIPIIA 86
Query: 94 FLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFN 153
+ G E +++ + +G +E A + AL++ +EHR+YGK++P N + Y
Sbjct: 87 YPGNEGALEEFYENTGLVFELAKYYGALVIFIEHRFYGKTIP--------PNQDPQRYLT 138
Query: 154 SAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
QA D A P+I +G SYGG+LA W R K+PH++ GS+A+SAPIL
Sbjct: 139 IEQATHDLAVFFTENFGLDEKRKNPIILVGGSYGGDLAAWMRFKFPHLIDGSIAASAPIL 198
Query: 214 YYGDITPWTTYHSVVSKDFRDTSE-------ECFQTIKKSW---SEIDNIASKPDGLAIL 263
++ ITP + ++ +R+ + C +KK + S+ + + L +L
Sbjct: 199 FFNGITPPYLAAQIATEAYRNLTNFQIYPKMTCDSAVKKGFEFLSKYFESTTSKEQLQML 258
Query: 264 SKKFKTCTPLKNTTELKD-----GLDTVYSEAAQYDTPSNI-------PVKRICNAI--- 308
S+KF+ C +K+ E+K A Y P+N PV +C +I
Sbjct: 259 SRKFRLCNEMKSNLEVKVLASYIAFSFEVLAQANYPYPTNFFNNLPAWPVNGLCTSIAKH 318
Query: 309 -ENAPNC-GDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVV 366
+PN +D+ I V Y G+ S C+ + G W Q C+EM++
Sbjct: 319 LATSPNLESEDLYFTILFDGVNLFQ-NYTGDKS-CFNTSNLGGGLQWNSWSLQLCNEMII 376
Query: 367 PMG-KDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIF 425
P G M+ P+NL +K C +Y +P+P W+ TY+GG K L SNIIF
Sbjct: 377 PSGFYPSTDMFFSNPYNLKVQMKACMSKYKFNPQPYWLATYFGG---KRALTEH-SNIIF 432
Query: 426 SNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
SNG D G K + G H LD+ + +DP + R+ E K +
Sbjct: 433 SNGQYDAVRAGSVEKGMKTSPSIIPIFIEQGGHHLDIRWSNPNDPQSVKIAREIEFKYVG 492
Query: 486 GWITQYYD 493
WI ++ +
Sbjct: 493 IWIQKFLN 500
>gi|308482020|ref|XP_003103214.1| CRE-PCP-5 protein [Caenorhabditis remanei]
gi|308260319|gb|EFP04272.1| CRE-PCP-5 protein [Caenorhabditis remanei]
Length = 507
Score = 185 bits (469), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 144/480 (30%), Positives = 233/480 (48%), Gaps = 44/480 (9%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI 101
E+ D + +E + IDHFS+ + F RY +N+ + G PIL + G E +
Sbjct: 32 FEKADGKYKYEEGYLKAPIDHFSFTND--YEFDLRYFLNTDNYESGG-PILFYTGNEGSL 88
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ + +GF ++ A + KA +V +EHR+YGKS PF + + + R+ GY +S QALAD+
Sbjct: 89 EAFAENTGFMWDLAPELKAAVVFVEHRFYGKSQPF--KNQSYTDIRNLGYLSSQQALADF 146
Query: 162 ASILLHIKDKY--NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDIT 219
A + +++ A ++ VIA G SYGG L+ WFR+KYPH+V G++A+SAP+ ++ D
Sbjct: 147 ALSVQFFRNEKIKGAKNSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSN 206
Query: 220 -PWTTYHSVVSKDFRDTSEEC-FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
P Y +V++ F D+ C + ++K W +D +A G L+ +K K+
Sbjct: 207 IPEDVYDFIVTRAFLDSG--CNRKAVEKGWIALDELAKTDSGRQYLNVLYKLDP--KSKL 262
Query: 278 ELKDG---LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA--GVVEADSL 332
E KD L E+ + N P + + + P+ CK A+ G + +S
Sbjct: 263 ENKDDVSFLKQYIRESMEAMAMVNYPYP--TSFLSSLPSWPVKEACKFASQPGKSQEESA 320
Query: 333 EYDGNNSRCYINEDRTGDESDE------------------GWEWQSCSEMVVPM---GKD 371
E N Y N TGD+S GW +Q+C+EMV+P+ G
Sbjct: 321 EQLYNIVNLYYN--YTGDKSTHCANAAKCDSAYGSLGDPLGWPFQTCTEMVMPLCGSGYP 378
Query: 372 KNSMYQPEPWNLTKYIKNCKEQYG-VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
+ ++ P+ KY + CK+ + +S + + GG S SNI+FSNG
Sbjct: 379 NDFFWKDCPFTTEKYAEYCKQTFAQISYNKTLLRPQAGGRAFGATSLPSASNIVFSNGYL 438
Query: 431 DPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DP+S GG+ + S L G+H DL A D + + + R E ++ WI +
Sbjct: 439 DPWSGGGYDHSDKVQGSVISVILKQGAHHYDLRGAHPQDTEEVKKVRAMETAAIKKWIKE 498
>gi|405950118|gb|EKC18122.1| Lysosomal Pro-X carboxypeptidase [Crassostrea gigas]
Length = 576
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 129/368 (35%), Positives = 187/368 (50%), Gaps = 33/368 (8%)
Query: 145 NARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVI 203
+ + + S QALADYA ++ HIK ++ + VIA G SYGG LA WFR+KYP+VV
Sbjct: 11 DPKKMNFLTSEQALADYAVLIKHIKSSIPGSSQSKVIAFGGSYGGMLAAWFRMKYPNVVQ 70
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
GSLA+SAPI + + V+ FR +S C IK W ++ AS+ GLA L
Sbjct: 71 GSLAASAPIWTFRKDADCDAFDRTVTGTFRKSSSTCVDNIKALWKTLNTTASQTGGLAKL 130
Query: 264 SKKFKTCTPLK---NTTELKDGLDT--VYSEAAQYDTPSNI-------PVKRICNAIENA 311
S+ F C PLK + T LK+ + + VY Y PS PVK C I
Sbjct: 131 SEMFHLCKPLKSADDVTTLKNWIVSALVYLAMVDYPYPSKFLAPLPAWPVKETCRPILTP 190
Query: 312 PNCGDDILCKIAAGVVEADSL--EYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVP-M 368
N GD+++ G+ +A ++ Y G+ S I + GW++QSC+EMV P
Sbjct: 191 LN-GDNLI----IGMAKAMNVFYNYTGSTSCFDIGSGDIPNLGISGWDYQSCTEMVAPSC 245
Query: 369 GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNG 428
K M++ W+ +Y C + + V P +W+ T Y G ++ + SNIIFSNG
Sbjct: 246 SNGKTDMFEKSAWDFKEYTNGCLKNWKVKPDINWIETQYWGKNL-----SAASNIIFSNG 300
Query: 429 MRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDP-DWLVQQRKTE----VKI 483
+ DP+S GG +K+ D + + G+H LDL + K+DP D +V TE +
Sbjct: 301 LLDPWSSGGVLKSQS--DSVVAILIPNGAHHLDLRGSNKADPADVIVMFFTTEYFCSLTC 358
Query: 484 MQGWITQY 491
GW+ Y
Sbjct: 359 TLGWVDSY 366
>gi|341900594|gb|EGT56529.1| CBN-PCP-5 protein [Caenorhabditis brenneri]
Length = 507
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 154/521 (29%), Positives = 249/521 (47%), Gaps = 49/521 (9%)
Query: 11 LLLMVIFVSTSFHANGL-KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
LL+ I ++T +GL +LR + + R + E + +E + IDHFS+ +
Sbjct: 4 FLLVAILIAT---CHGLVRLRDPVTQ-RGPQKFENSIGKYKYEVGYLKVPIDHFSFTND- 58
Query: 70 YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRY 129
F RY +N+ + G PIL + G E ++ + +GF ++ A + KA +V +EHR+
Sbjct: 59 -MEFNLRYFLNTDNYESGG-PILFYTGNEGSLEAFAENTGFMWDLAPELKAAVVFVEHRF 116
Query: 130 YGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY--NATHAPVIAIGASYG 187
YGKS PFG++ + + R GY +S QALAD+A + K++ A + VIA G SYG
Sbjct: 117 YGKSQPFGNQ--SYTDIRRLGYLSSQQALADFALSVQFFKNEKIKGAQKSAVIAFGGSYG 174
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDIT-PWTTYHSVVSKDFRDTSEEC-FQTIKK 245
G L+ WFR+KYPH+V G++A+SAP+ ++ D P Y +V++ F D C + I K
Sbjct: 175 GMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVYDFIVTRAFLDAG--CNRKAIDK 232
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG---LDTVYSEAAQYDTPSNIPVK 302
+W +D ++ G L+ +K K+ E KD L E+ + N P
Sbjct: 233 AWLALDELSKSDSGRRYLNILYKLDP--KSKLENKDDIGFLKQYIRESMEAMAMVNYPYP 290
Query: 303 RICNAIENAPNCGDDILCKIAA--GVVEADSLEYDGNNSRCYINEDRTGDESDE------ 354
+ + + P+ CK A+ G + +S E N Y N TGD+S
Sbjct: 291 --TSFLSSLPSWPVKEACKFASQPGKSQEESAEQLYNIVNLYYN--YTGDKSTHCANAAK 346
Query: 355 ------------GWEWQSCSEMVVPM---GKDKNSMYQPEPWNLTKYIKNCKEQYG-VSP 398
GW +Q+C+EMV+P+ G + ++ P+ KY + C + + +S
Sbjct: 347 CDSAYGSLGDPLGWPFQTCTEMVMPLCGSGYPNDFFWKDCPFTTAKYAEYCMQTFSQISY 406
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
+ + GG S SNI+FSNG DP+S GG+ + S L G+H
Sbjct: 407 NKTLLRPLAGGRAFGATSLPSASNIVFSNGYLDPWSGGGYEHSDKVQGTVVSVILKQGAH 466
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
DL A D + + + R E ++ WI + + + N
Sbjct: 467 HYDLRGAHPQDTEEVKKVRAMETTNIKKWIKEKARNVRRFN 507
>gi|313242000|emb|CBY34184.1| unnamed protein product [Oikopleura dioica]
Length = 402
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 128/416 (30%), Positives = 203/416 (48%), Gaps = 40/416 (9%)
Query: 27 LKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
L L L ++ S + D TF DHFS + Q R + + +F+
Sbjct: 8 LSLAAALKPLKYSSLESYSDFCSEISTF--EAEYDHFS--TRNTQKIEIRVITDDRFYQA 63
Query: 87 GNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNA 146
G P+L + G E + + +GF + + A +V +EHRYYGKS+P KN
Sbjct: 64 GG-PVLFYTGNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIP------DDKNL 116
Query: 147 RHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
Y ++ QALADYA L+H+K + PVIA+G SYGG LA +FR+KYP++V G++
Sbjct: 117 ----YLSAEQALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVAGAI 170
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDTSEE--CFQTIKKSWSEIDNIASKPDGLAILS 264
A SAP+ + + ++ V ++ F +T C I+KSW I I + G LS
Sbjct: 171 AGSAPVKFLPGLFDCRGFYRVTTRTFTNTPSGHFCSDNIRKSWETIKLIGAHMVGKRTLS 230
Query: 265 KKFKTCTPLKNTTELKDGLDTVYSEAAQYD--TPSNI-------PVKRICNAIENAPNCG 315
+ F+TC P+ + L D L+ V+ A D P+N PV C+ +++ N
Sbjct: 231 EVFRTCDPITDVEPLLDFLENVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDIN-Q 289
Query: 316 DDILCKIAAGVVEADSLEYD-GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKD-KN 373
+++L + +A S+ Y+ + C D GD + W +Q+C+E V P D K
Sbjct: 290 EELLEPLR----DAASVYYNYTGDLACLDLGDEGGDLGYDNWYFQTCTEFVFPFCSDGKE 345
Query: 374 SMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGM 429
M++ ++ Y NC++ +G +PR W ++ +K I IIFSNG+
Sbjct: 346 DMFRVHTYDFPTYSTNCQQTFGTTPREHWAEMFFSVETMKTI-----GGIIFSNGL 396
>gi|241161686|ref|XP_002408972.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494441|gb|EEC04082.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 360
Score = 181 bits (459), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 104/264 (39%), Positives = 144/264 (54%), Gaps = 18/264 (6%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID--DNIQLSGFTYENAHQ 117
+DHF Y + TF RY++ ++W PI + G EA I+ N SG +E A +
Sbjct: 20 VDHFGYA--NNDTFKMRYLVADQYWDHDGGPIFFYTGNEADIEVFANKSYSGLMWEWAPE 77
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATH 176
FKAL++ EHRYYGKS+P+G+ + K GY + QALADYA +L H K D A
Sbjct: 78 FKALLIFAEHRYYGKSMPYGNE--SFKGPSRHGYLTAEQALADYADLLTHFKADVPGAGD 135
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
+ V++ G SYGG LA WFRLKYPHV +LASSAPIL + +TP + VV+K F S
Sbjct: 136 SKVVSFGGSYGGMLAAWFRLKYPHVTTAALASSAPILQFTGMTPCNAFSEVVTKAFAKES 195
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYD 294
+C I+ S+ I A+ +G L K+F+ C PL N T L+D + V++ A +
Sbjct: 196 NQCTNAIRTSFELIRKQAATEEGAKALKKQFRLCKPLAPSNDTVLRDWIRNVFAYLAMVN 255
Query: 295 TP---------SNIPVKRICNAIE 309
P PVK C ++
Sbjct: 256 YPYASKLTLPAPGHPVKEACKFLK 279
>gi|358333026|dbj|GAA51627.1| lysosomal Pro-X carboxypeptidase [Clonorchis sinensis]
Length = 412
Score = 181 bits (459), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 197/392 (50%), Gaps = 20/392 (5%)
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+E A + KA ++ EHR+YG S+PF + + K+ +H GY + QALADYAS++ ++K
Sbjct: 2 WEIAEELKAAVLFAEHRFYGSSLPFVND--SFKDPQHFGYLTAEQALADYASLVQYLKSS 59
Query: 172 Y-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ ++PVIA G SYGG L+ WFR KYP+++ G++A+SAPI + +++ ++ ++
Sbjct: 60 VKDFENSPVIAFGGSYGGMLSAWFRYKYPNLIAGAIAASAPIWLFPNMSNCAGFYDTTTR 119
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
F + S C + + W I +A + G +L F+ C PL + +L D L
Sbjct: 120 AFSTSGSTVCTKNVALVWDSIRTVAKQHSGHELLRLMFQLCDPLPDEQKLIDYLIDFLGT 179
Query: 290 AAQYDTPSNI---------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR 340
A + P PVK C + +A + D+ Y N S
Sbjct: 180 LAMVNYPYEASFIGTFPGEPVKYFCKGLSDAVHRDVDVDVVQRVATAVRSLTNYTKNQSC 239
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
+ D G ++ + W Q+C EM PM + M+ W+ + ++C +++ V PR
Sbjct: 240 ISLEGDLPGLDA-KAWTLQTCLEMTTPMCSNGEGMFPSLEWDPVVFSQSCFDKFAVRPRL 298
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
+W + G +IK + +NI+FSNG DP+S G + + + + G+H L
Sbjct: 299 NWSAVEFWGKNIK-----TATNIVFSNGDLDPWSAFGVLTDDQAPGCNVIR-IPSGAHHL 352
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
DL + DP +V R+ E++ ++ WI +++
Sbjct: 353 DLRAKNELDPADVVDARQRELQHIKDWIDEWH 384
>gi|440913231|gb|ELR62707.1| Dipeptidyl peptidase 2, partial [Bos grunniens mutus]
Length = 413
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 136/433 (31%), Positives = 212/433 (48%), Gaps = 56/433 (12%)
Query: 82 KFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA 141
KFW G PI + G E + SGF E A Q AL+V EHRYYGKS+PFG R
Sbjct: 7 KFWNRGEGPIFFYTGNEGDVWSFANNSGFILELAEQQGALVVFAEHRYYGKSLPFGERST 66
Query: 142 ALKNARHRGY---FNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKY 198
RGY QALAD+A +L ++ + A AP IA G SYGG L+ + R+KY
Sbjct: 67 ------WRGYTELLTVEQALADFAGLLRALRQELEAPDAPAIAFGGSYGGMLSAYLRIKY 120
Query: 199 PHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD 258
PH+V G+LA+SAP++ + + VS DF+ S EC + ++ ++ +I ++ +
Sbjct: 121 PHLVAGALAASAPVVSAAGLGDPYQFFQDVSADFQGQSPECARAVQDAFRQIRDLFQQ-- 178
Query: 259 GLAILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTP------SNIPVKRI-CNAI 308
+F TC PL K+ T+L ++ A D P ++P + C+ +
Sbjct: 179 ------GEFGTCQPLSGPKDLTQLFGFARNAFTVLAMMDYPYATDFVGHLPAHPVGCSRL 232
Query: 309 --ENAPNCGDDILCKIAAGVVEADSLE--YD--------GNNSRCYINEDRTGDESDEGW 356
E++ G L +A V + +E YD + + C + D + W
Sbjct: 233 LSESSRIAG---LRALAGLVYNSSGIEPCYDIYLQYQACADPTGCGLGSDA------KAW 283
Query: 357 EWQSCSEMVVPMGKDKNSMYQPE-PWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLI 415
++Q C+E+ + + S PE P+ + + C + +GV PR W+ T +GG D+
Sbjct: 284 DYQVCTEISLTFSSNNVSDLFPELPFTEAQRQQYCLDTWGVWPRQDWLQTSFGGGDLT-- 341
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQ 475
+ SNIIFSNG DP++RGG L+ + + G+H LDL + DP +V+
Sbjct: 342 ---AASNIIFSNGDLDPWARGGIQSNLSASVLAIA--IRGGAHHLDLRGSHPDDPASVVE 396
Query: 476 QRKTEVKIMQGWI 488
R+ E ++ W+
Sbjct: 397 ARRLEAALIGKWV 409
>gi|301097471|ref|XP_002897830.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262106578|gb|EEY64630.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 569
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 143/483 (29%), Positives = 227/483 (46%), Gaps = 56/483 (11%)
Query: 52 ETFFYNQTIDHF-SYGPESYQTFPQRYVINSK-FWGGGNSPILAFLGAEAPIDDNIQLSG 109
E F Q++DHF + G S TF RY + S + N I ++G EA + + +G
Sbjct: 90 EEKFITQSLDHFRADGKSSEGTFDMRYFVCSPDNFSPTNGSIFFYVGNEADVTLYLNHTG 149
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+ENA F ALIV EHRY+GKSVPFG L H + ++ QA+ADYA +L+ +
Sbjct: 150 LMWENAAAFNALIVFAEHRYFGKSVPFG-----LDVLDHMEFLSTQQAMADYA-VLIEML 203
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGD---ITPWTTYH 225
+ PVI G SYGG L TWFR+KYPH++ G +A SAP+ ++GD ++
Sbjct: 204 KRDLKVDVPVIGFGGSYGGMLGTWFRMKYPHIIDGIIAGSAPVANFFGDPDHPADPEAFN 263
Query: 226 SVVSKDFRD---TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
VV+ D + + C I+++ + ++ G L++ C ++ + D
Sbjct: 264 RVVTFDMSEDAGAATNCIPNIRRALNTAVAMSGTKSGRKELAELLHLCD--ADSLQSSDK 321
Query: 283 LDTVYSEAAQYDTPSN--IPVKRICNAIENAPNCGDDILCKIAAGVVEAD---------- 330
+ ++ SEA N P I + + P C+ AGV D
Sbjct: 322 VISIASEAYGDLAVGNYPYPTSYIMDGKVDLPGYPMRTACEPFAGVFTEDDKSGLIRAFR 381
Query: 331 ---SLEYDGNNSR-CY--------INEDRTGDESDEG--------WEWQSCSEMVVPMGK 370
++ Y+ + S C I+E T D + + W + CSE+ +PM
Sbjct: 382 ESIAVYYNASKSESCLFPVSPVKTIDELDTSDAAKQARIDHKGNFWGYLECSELYMPMSS 441
Query: 371 DK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGM 429
D N ++ N ++ C E++GV +P W +GG + R+ SNI+FSNG
Sbjct: 442 DGVNDVFPTVAVNESQDNAACFEKWGVHLKPRWAQFEFGG----MKALRAASNIVFSNGN 497
Query: 430 RDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
DP+S G +++ + + G+H LDL SDP + + R TE+K ++ WI
Sbjct: 498 FDPWSGLGVLESLS--PSVVAVPVPGGAHHLDLFFTHPSDPPAVTEARNTELKYIRQWIN 555
Query: 490 QYY 492
++Y
Sbjct: 556 EFY 558
>gi|298710358|emb|CBJ31975.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 596
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 143/520 (27%), Positives = 222/520 (42%), Gaps = 85/520 (16%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQ----TFPQRYVINSKFWG--GGNS-------- 89
++ S E F+ QT+DHF + P S TF QRY + ++WG GG S
Sbjct: 38 RRPSTADCEERFFTQTLDHFRHTPVSEHDDDNTFQQRYFVCREYWGPTGGGSANSPRGQE 97
Query: 90 --------------------------------------------PILAFLGAEAPIDDNI 105
PI + G EA + +
Sbjct: 98 DGASTSSSSSTSTSTSRRRKSNRGTAAEGGAGRGGEQAVPGAPGPIFFYTGNEADVSLYL 157
Query: 106 QLSGFTYENAHQFKALIVILEHRYYGKSVPFGS---RKAALKNARHRGYFNSAQALADYA 162
+ SG +ENA F AL+V EHR+YG+S+PFG+ R+ L+ A + QALADYA
Sbjct: 158 EASGLMWENAPAFNALLVFAEHRFYGESLPFGAPDKRREFLRQATA----GTPQALADYA 213
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ +K + A APVIA G SYGG LA+W RLKYPH+V G++A+SAP+L + T
Sbjct: 214 RLVTALKQELGAEGAPVIAFGGSYGGMLASWLRLKYPHIVHGAIAASAPVLALEGLHRPT 273
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD- 281
+E + D+ A+ P G L + + + +
Sbjct: 274 PNPEAF-------AETVTAAAGPAGGAADSCAANPRGDGALVELAWWARAAFDYLAMGNF 326
Query: 282 GLDTVYS-EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS- 339
T Y + + + P P++ C+ + + +D + A + +A + Y+
Sbjct: 327 PYATGYMLNSGEVELPP-WPLREACSYLADPTLQAEDDDVLLGA-LADAIGVYYNATGEV 384
Query: 340 RCYINEDRTGDESD---EGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYG 395
C+ + S + W WQ+C+EM +PM D K M+ W+ C EQ+G
Sbjct: 385 GCFTPAAGANNASSVDADNWNWQACTEMSMPMSTDGKRDMFWRNDWDPVAQAAQCMEQFG 444
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
VSP W YGG+D +N++FSNG DP+S G V ++
Sbjct: 445 VSPGEGWGAAEYGGYDAW----SQVTNVVFSNGRLDPWSGMGVVDQRRAGGGVEVIMMDQ 500
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
+H LDL DP ++ R+ E+ ++ W+ Y +
Sbjct: 501 AAHHLDLFFEHPLDPQDVLDARRVEMDFVERWVDMAYGAY 540
>gi|324509048|gb|ADY43812.1| Serine protease pcp-1 [Ascaris suum]
Length = 593
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 153/524 (29%), Positives = 245/524 (46%), Gaps = 58/524 (11%)
Query: 8 FQWLLLMVIFVSTSFHANGLKLRPRLGRIR-RSRILEQKDSNHGFETFFYNQTIDHFSYG 66
F LL V S L + P R R R+ ++ +G+ T +Y+ ID+F++
Sbjct: 4 FGLLLASSFIVVCSATFRRLTVDPVALRERKRAEPRVDDETIYGWSTAYYDVPIDNFAF- 62
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILE 126
S QT+ +Y+ N ++ G PI + G E I++ + +G ++ A +FKA + E
Sbjct: 63 -TSAQTYRMKYLYNLTYYELG-GPIFFYTGNEGSIEEFAKNTGIMFDLAEKFKAAVFFAE 120
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY--NATHAPVIAIGA 184
HRYYG S+PFG+ + NA + GY +S QALAD+A ++ IK PVIA G
Sbjct: 121 HRYYGASMPFGN--ISYTNANYLGYLSSTQALADFAKLITFIKTDVLKCPPDTPVIAFGG 178
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYY--GDITPWTTYHSVVSKDFRDTSEECFQ- 241
SYGG LA W R+KYPH+V G+ +SSAP+LY+ G+++P + + V + F + C +
Sbjct: 179 SYGGMLAAWLRMKYPHIVSGAWSSSAPLLYFEGGNVSP-SAFEKAVKEVFINAG--CNEN 235
Query: 242 TIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV--------YSEAAQY 293
TI I N+ + +G L+ F+ P TE D V Y Y
Sbjct: 236 TIANGLEAIKNLMNTAEGRQFLNDLFRI-DPTSTLTESTDSDFLVEWIWAAMDYMAMVNY 294
Query: 294 DTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD--GNNSRCYIN 344
PSN PVK C EN + A + + ++ Y+ GN + +N
Sbjct: 295 PYPSNFLQPLPGWPVKYSC---ENFARSEITDARQAATALYQISNVYYNFTGNVATNCVN 351
Query: 345 EDRTGDE------SDEGWEWQSCSEMVVPMGKD--KNSMY--QPEPWN--LTKYIKNCKE 392
+ G+ +D GW WQ+C+E+V+ M + N + Q E +N + I C
Sbjct: 352 WNVCGESAIANLGADAGWSWQTCTELVLMMCSEGPPNDFFDNQCENYNGPVEIQIALCAA 411
Query: 393 QYGVSPRPSWVLTYYGGHDIKL---ILRRSTSNIIFSNGMRDPFSRGG-WVKTYHFFDLS 448
++ R W + + + + + SNIIF+NG DP+S GG + T + +
Sbjct: 412 EFT---RAGWNREFLDVNAVAIEYGFDYAAASNIIFTNGNLDPWSPGGVYANTPGIQEAT 468
Query: 449 ----FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ ++ +H LDL + DP + R I+ W+
Sbjct: 469 KNGIYTFLIDGSAHHLDLRQPNTCDPPSVKNARFQITNIIDCWV 512
>gi|17557063|ref|NP_498719.1| Protein PCP-5, isoform a [Caenorhabditis elegans]
gi|466050|sp|P34676.1|PCP5_CAEEL RecName: Full=Prolyl carboxy peptidase like protein 5; Flags:
Precursor
gi|351020562|emb|CCD62538.1| Protein PCP-5, isoform a [Caenorhabditis elegans]
Length = 507
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 40/478 (8%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI 101
E+ + +E + ID F++ + F RY +N + G PIL + G E +
Sbjct: 32 FEKSIGKYKYEEGYLKAPIDPFAFTND--LEFDLRYFLNIDHYETGG-PILFYTGNEGSL 88
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ + +GF ++ A + KA +V +EHR+YGKS PF + + + RH GY +S QALAD+
Sbjct: 89 EAFAENTGFMWDLAPELKAAVVFVEHRFYGKSQPF--KNESYTDIRHLGYLSSQQALADF 146
Query: 162 ASILLHIKDKY--NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDIT 219
A + K++ A + VIA G SYGG L+ WFR+KYPH+V G++A+SAP+ ++ D
Sbjct: 147 ALSVQFFKNEKIKGAQKSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSN 206
Query: 220 -PWTTYHSVVSKDFRDTSEEC-FQTIKKSWSEIDNIASKPDGLAILSKKFKT--CTPLKN 275
P Y +V++ F D C + I+K W +D +A G L+ +K + L+N
Sbjct: 207 IPEDVYDFIVTRAFLDAG--CNRKAIEKGWIALDELAKSDSGRQYLNVLYKLDPKSKLEN 264
Query: 276 TTE-------LKDGLDTVYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCK 321
+ +++ ++ + Y P++ PVK C + ++ +
Sbjct: 265 KDDIGFLKQYIRESMEAM--AMVNYPYPTSFLSSLPAWPVKEACKSASQPGKTQEESAEQ 322
Query: 322 IAAGV-----VEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM---GKDKN 373
+ V D + N ++C GD GW +Q+C+EMV+P+ G +
Sbjct: 323 LYKIVNLYYNYTGDKSTHCANAAKCDSAYGSLGDPL--GWPFQTCTEMVMPLCGSGYPND 380
Query: 374 SMYQPEPWNLTKYIKNCKEQY-GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
++ P+ KY + C + + + + + GG S SNI+FSNG DP
Sbjct: 381 FFWKDCPFTSEKYAEFCMQTFSSIHYNKTLLRPLAGGLAFGATSLPSASNIVFSNGYLDP 440
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+S GG+ + S L G+H DL A D + + + R E + ++ WI +
Sbjct: 441 WSGGGYDHSDKVQGSVISVILKQGAHHYDLRGAHPQDTEEVKKVRAMETQAIKKWIKE 498
>gi|17557065|ref|NP_498718.1| Protein PCP-5, isoform b [Caenorhabditis elegans]
gi|351020563|emb|CCD62539.1| Protein PCP-5, isoform b [Caenorhabditis elegans]
Length = 568
Score = 174 bits (440), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 134/478 (28%), Positives = 227/478 (47%), Gaps = 40/478 (8%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI 101
E+ + +E + ID F++ + F RY +N + G PIL + G E +
Sbjct: 93 FEKSIGKYKYEEGYLKAPIDPFAFTND--LEFDLRYFLNIDHYETGG-PILFYTGNEGSL 149
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ + +GF ++ A + KA +V +EHR+YGKS PF + + + RH GY +S QALAD+
Sbjct: 150 EAFAENTGFMWDLAPELKAAVVFVEHRFYGKSQPF--KNESYTDIRHLGYLSSQQALADF 207
Query: 162 ASILLHIKDKY--NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDIT 219
A + K++ A + VIA G SYGG L+ WFR+KYPH+V G++A+SAP+ ++ D
Sbjct: 208 ALSVQFFKNEKIKGAQKSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSN 267
Query: 220 -PWTTYHSVVSKDFRDTSEEC-FQTIKKSWSEIDNIASKPDGLAILSKKFKT--CTPLKN 275
P Y +V++ F D C + I+K W +D +A G L+ +K + L+N
Sbjct: 268 IPEDVYDFIVTRAFLDAG--CNRKAIEKGWIALDELAKSDSGRQYLNVLYKLDPKSKLEN 325
Query: 276 TTE-------LKDGLDTVYSEAAQYDTPSNI-------PVKRICNAIENAPNCGDDILCK 321
+ +++ ++ + Y P++ PVK C + ++ +
Sbjct: 326 KDDIGFLKQYIRESMEAM--AMVNYPYPTSFLSSLPAWPVKEACKSASQPGKTQEESAEQ 383
Query: 322 IAAGV-----VEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM---GKDKN 373
+ V D + N ++C GD GW +Q+C+EMV+P+ G +
Sbjct: 384 LYKIVNLYYNYTGDKSTHCANAAKCDSAYGSLGDPL--GWPFQTCTEMVMPLCGSGYPND 441
Query: 374 SMYQPEPWNLTKYIKNCKEQY-GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
++ P+ KY + C + + + + + GG S SNI+FSNG DP
Sbjct: 442 FFWKDCPFTSEKYAEFCMQTFSSIHYNKTLLRPLAGGLAFGATSLPSASNIVFSNGYLDP 501
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+S GG+ + S L G+H DL A D + + + R E + ++ WI +
Sbjct: 502 WSGGGYDHSDKVQGSVISVILKQGAHHYDLRGAHPQDTEEVKKVRAMETQAIKKWIKE 559
>gi|255081901|ref|XP_002508169.1| predicted protein [Micromonas sp. RCC299]
gi|226523445|gb|ACO69427.1| predicted protein [Micromonas sp. RCC299]
Length = 590
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 133/456 (29%), Positives = 217/456 (47%), Gaps = 61/456 (13%)
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVP-------------- 135
PI + G EA ++ + +G +E+A F A++V EHRYYG+S P
Sbjct: 140 PIFFYTGNEANVELYLNATGLMWEHAESFGAVLVFAEHRYYGESKPKPKEEDGNALDASN 199
Query: 136 -FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWF 194
G LK Y S QA+ADYA+++ +K + A APV A G SYGG LATW
Sbjct: 200 LGGIIPGHLKKKGQYPYLTSEQAMADYATLIRELKAEIRAPDAPVFAFGGSYGGMLATWM 259
Query: 195 RLKYPHVVIGSLASSAPIL-YYGDITPWT--TYHSVVSKDFR---DTSEECFQTIKKSWS 248
RLKY +VV G++A SAP+ + G+ P + V+ D + C ++ +++
Sbjct: 260 RLKYANVVDGAVAGSAPVWSFVGEDPPVDPGAFADGVTMDATAAGGSPPACAPNVRAAFA 319
Query: 249 EIDNIASKPDGLAILSKKFKTC--TPLKNTTELKD-------GLDTV----YSEAAQY-- 293
E+ S+ D +I + C TPL + T++ D D + + A+ Y
Sbjct: 320 ELLR-RSETDPKSI-KAPMRLCDDTPLGSPTDVLDVALWAQGAFDYLAMGNFPYASSYIL 377
Query: 294 ---DTPSNIPVKRICNAIENAPNC----GDDILCKIAAGVVEADSLEYD-GNNSRCYINE 345
T P + C P GD +L +A +A + Y+ C+
Sbjct: 378 NGDGTLPPYPFRVACGGAMADPKLLNKGGDALLSALA----DAVGVYYNYSKTQECFDTR 433
Query: 346 DRTGDESDEG---WEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
+ D+SDE W++Q C+EM +PM +D M+ P+PWN T + C+ ++GV P+
Sbjct: 434 HGSNDDSDEDGELWDYQYCTEMFMPMSRDGVRDMFFPQPWNETDAVLECERRWGVRPKTL 493
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
W T +GG + SN++++NG DP++ G ++ ++ L G+H LD
Sbjct: 494 WATTVFGGRRLSW-----ASNVVWTNGYLDPWAGLGVQESLSPSLVAMM--LPGGAHHLD 546
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKA 497
+ DP+ +V+ RKT++++++ WI Y A
Sbjct: 547 FMWSNDLDPEPVVEARKTQMRLLRQWILNKYQAVAA 582
>gi|281212302|gb|EFA86462.1| hypothetical protein PPL_00255 [Polysphondylium pallidum PN500]
Length = 482
Score = 171 bits (434), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 143/494 (28%), Positives = 228/494 (46%), Gaps = 65/494 (13%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
L+L +++++ L R R + R L + ETF++ Q +DHF P +
Sbjct: 7 LVLFIVYLAAVSAVINTPLYKR--RFAKERQLLPQPPLMSNETFWFTQLVDHFD--PNND 62
Query: 71 QTFPQRY-VINSKFWGGGNSPILAFLGAEAPID----DNIQLSGFTYENAHQFKALIVIL 125
+TF Q+Y VI+ F G G PI FL EAP+ +Q+ + A +F AL V+L
Sbjct: 63 ETFQQQYQVIDDYFDGTG--PIFFFLAGEAPMGFFNFQEVQI----WNWADKFNALYVVL 116
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGAS 185
EHR+YG S P + + N R Y S QALAD A+ L K + APV+ G S
Sbjct: 117 EHRFYGASNP--TNDFSTPNLR---YLTSQQALADAANFLTSFKAERGLESAPVVVFGCS 171
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WFRLKYP +V+ S+A S P+L + +T Y+S S + +C +
Sbjct: 172 YSGALSAWFRLKYPQLVVASVAPSGPVLAQLN---YTGYYSQFSN---SAAPDCVAAAQT 225
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIP---VK 302
+ ++I + + G L+K F +C+ L+N +L L T+ D +N P +
Sbjct: 226 ATNQIMALTTSKSGRDQLAKTFNSCSNLENPRDLYYFLYTLTEALGSADQMNNPPTWGLN 285
Query: 303 RICNAIENAPNCGDDILCKIAAGVVEAD--SLEYDGNNSRCYINEDRTGDESDEGWEWQS 360
C + + D+ +A G SL+ ++ R ++D+ G S W WQ+
Sbjct: 286 TTCQTLTQTSSLLDNWAQIVAGGQTGCQDYSLKSFIDSMRKTNSKDQDGSRS---WLWQT 342
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY---GVSPRPSWVLTYYGGHDIKLILR 417
C E + P N+ + +K C+E + G++P +W +YYGG I+
Sbjct: 343 CVEFGYFSTTYPGTSVFPPTLNVEEQVKWCEEIFDIKGMTPNIAWTNSYYGGQQIQ---- 398
Query: 418 RSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL----------GSHCLDLDEAKK 467
SNI+F+NG+ DP +H ++ + NL HC L ++
Sbjct: 399 --GSNIMFTNGLLDP---------WHLLSVN---EPNLEGTVQAATYEAGHCGTLIQSTS 444
Query: 468 SDPDWLVQQRKTEV 481
DP L+ R ++
Sbjct: 445 IDPPSLIAARAQKL 458
>gi|216296557|gb|ACJ72063.1| prolylcarboxypeptidase [Trichoplax adhaerens]
Length = 254
Score = 169 bits (429), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 130/230 (56%), Gaps = 5/230 (2%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
++T +++Q IDHF + +S T+ QRY++N W G PI + G E I Q SG
Sbjct: 13 YKTKYFDQIIDHFDW--KSNATYRQRYLMNDDHWDKGTGPIFFYTGNEGGIVGFWQNSGL 70
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
++ A QF+ALIV EHRYYGKS+PFG KN S QALADYA +L +K
Sbjct: 71 LFDLAPQFRALIVFGEHRYYGKSLPFGKDSFKPKNLE---LLTSEQALADYAVLLTSLKK 127
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
NA V+A G SYGG L W RLKYP+++ LA+SAP+ G + + V+K
Sbjct: 128 SLNANKCKVVAFGGSYGGMLTAWMRLKYPNIIDAGLAASAPLYMAGGVVSPNFFFPAVTK 187
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK 280
D++D + +C I+K++S + +A G ++K F C LK + ++K
Sbjct: 188 DYQDANPKCVPNIRKAFSAVLEMAKSKSGKQKVAKIFNVCNKLKTSADVK 237
>gi|313236608|emb|CBY11867.1| unnamed protein product [Oikopleura dioica]
Length = 471
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 131/470 (27%), Positives = 224/470 (47%), Gaps = 54/470 (11%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG-GNSPILAFLGAEAPIDDNIQLSG 109
+ET + +Q +D+F+Y + +T+ RY++N+ F +PI + G E PID +G
Sbjct: 23 YETKWIDQRVDNFNYYLD--KTYKMRYLVNTDFVKDEKTAPIFFYTGNEGPIDSFAANTG 80
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F E A + A IV EHRYYG+S+P+G+ +N Y + ALAD+A +++ +K
Sbjct: 81 FMNEFAEEENAFIVYAEHRYYGQSLPYGNSSFTPEN---MAYLSVENALADFAQLIVELK 137
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
Y P+I G SYGG L+ + R+ YP++V G+LA+S+P+ + + + +
Sbjct: 138 KTYKG---PLICFGGSYGGLLSMYMRMTYPNLVNGALAASSPVYWISAMGDSHGFWVKTT 194
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL------------KNTT 277
+DF ++C TI+ ++ +D + + D A ++K +TC + +N
Sbjct: 195 EDFSTALDKCEDTIRAGFAALDKMKNDKD-WAGITKTMRTCQNITEDNYMHMLGWARNAM 253
Query: 278 ELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
+D Y + P N PVK C + G D + EA L Y+G
Sbjct: 254 ATMAMMDYPYPTNFEAALPGN-PVKESC--VRAVAETGAD-------SIREAAGLVYNGT 303
Query: 338 N----SRCY-INE------DRTGDESDE---GWEWQSCSEMVVPMGKD-KNSMYQPEPWN 382
+ +C+ I E D TG + W++Q C++ V+P G D K M+ ++
Sbjct: 304 DPSKYKQCFDIMEEYVYCSDPTGCGTGPQALAWDYQCCTQQVLPGGTDGKTDMFPLIKFD 363
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ C + +GV P W+ Y +++ +TSN IFSNG DP+ GG
Sbjct: 364 VDDRAAYCNKTWGVVPDRDWLRIKYWADNLE-----ATSNTIFSNGDLDPWGPGGVTHDL 418
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
DL + ++ G+H DL + D ++ R+ ++ W+ Q+Y
Sbjct: 419 R-HDLP-APLVHGGAHHYDLRGSNSGDTQDVLNVRQFHRDTIRDWMAQFY 466
>gi|414880844|tpg|DAA57975.1| TPA: hypothetical protein ZEAMMB73_592594 [Zea mays]
Length = 761
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/178 (47%), Positives = 112/178 (62%)
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
K + + +HRYY +S+PFGS+ A +++ Y + QALAD+A L +K +A +P
Sbjct: 525 KTMEKLSKHRYYRESMPFGSKAKAYIDSKSLAYLTAKQALADFAVQLTDLKRNLSAEGSP 584
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
V+ G SYGG LA W RLKYPH+ IG+LASSAPIL + DI P T ++ +VS DFR S
Sbjct: 585 VVLFGDSYGGMLAAWIRLKYPHIAIGALASSAPILQFEDIVPSTIFYDLVSDDFRRESLS 644
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTP 296
CF IK SW E+D+ A+K DGL LSK F C LK + +L D L + YS A D P
Sbjct: 645 CFLKIKDSWKELDDQANKQDGLLKLSKTFHLCQTLKTSGDLSDWLSSAYSYLAMVDYP 702
>gi|156348420|ref|XP_001621842.1| hypothetical protein NEMVEDRAFT_v1g990 [Nematostella vectensis]
gi|156208137|gb|EDO29742.1| predicted protein [Nematostella vectensis]
Length = 283
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 137/230 (59%), Gaps = 7/230 (3%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+T + QT+DHF++ TF QRY+ K+W G PI + G E I + SGF
Sbjct: 1 FKTGTFEQTVDHFNFIQSG--TFKQRYLYTEKYWDG-KGPIFFYSGNEGGITGFWENSGF 57
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+E A F AL++ EHRYYG+S+PFG ++N GY + QALAD+A+++ +K
Sbjct: 58 VFEAAKNFSALVIFGEHRYYGESLPFGQDSFKIENI---GYLSIEQALADFATLIPALKK 114
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
++ A PV++ G SYGG L+ + R KYP+V+ +LA+SAPI + D++ + V++
Sbjct: 115 QFKAEEKPVVSFGGSYGGMLSAYLRFKYPNVIQAALAASAPIYFIADLSIRDFFFPAVTR 174
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKP-DGLAILSKKFKTCTPLKNTTEL 279
DF++ +C ++ + E+DN+ + GL +SK FK C PLK+ ++
Sbjct: 175 DFKNADPKCPDLVRAGFIELDNLKKEGLKGLDAISKAFKLCKPLKSADQI 224
>gi|328874647|gb|EGG23012.1| Putative serine protease [Dictyostelium fasciculatum]
Length = 490
Score = 165 bits (417), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 141/504 (27%), Positives = 223/504 (44%), Gaps = 62/504 (12%)
Query: 15 VIFVSTSFHA-NGLKLRPRLGRIR----RSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
V+ S++F A + LR +R R+ ++EQ+ S +F +Q +DHF P +
Sbjct: 17 VVICSSAFLAVESMPLRNEQSLVRGLRHRNVMIEQRRSPSPLALWF-DQQVDHFD--PLN 73
Query: 70 YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHR 128
TF Q+Y IN +W G P+ LG E PI F A F ALIV EHR
Sbjct: 74 QDTFKQQYFINDTYWRPG-GPVFFVLGGEGPISPGYVNGHFVVNTYAQLFDALIVACEHR 132
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGG 188
+YG S P + + +H + QALADYA+ I KYN + I+ G SY G
Sbjct: 133 FYGYSSPHPTL-----DTKHLHLLTTEQALADYANFRQFIAAKYNTGSSKWISFGGSYSG 187
Query: 189 ELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWS 248
L+ W RLKYP ++ G++A+SAP+ D +T Y VVS C +K
Sbjct: 188 SLSAWLRLKYPQLIDGAIATSAPVEAQLD---FTQYLEVVSASI---GPACSAIVKNVTQ 241
Query: 249 EIDNIASKPDGLAILSKKFKTCTPLKNTTELK---DGLDTVYSEAAQYDTPSN----IPV 301
+ + + + S F TC P+ + ++ + L + SE QY+ +N +
Sbjct: 242 IVTQMIANGQTSQVES-LFNTCDPISSELDIATFMESLTSAVSEIVQYNNDNNNYSFANI 300
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG----------DE 351
+C+ + G++ L A + +++G+N E G +
Sbjct: 301 TTMCDMLSK----GNNQLQAFAD--LNNKYNDFNGDNCTTSSYEKMIGQMQETQVNGPNA 354
Query: 352 SDEGWEWQSCSEMVVPMGKDKNSMYQP--EPWNLTKYIKNCKEQYG-----VSPRPSWVL 404
+ W WQ C+E + S QP + L +I+ C + +G P W++
Sbjct: 355 ATRLWTWQCCTEYA--YFQTGQSALQPFSDTLTLDYFIQQCTDTFGPPGYTYQPNIDWII 412
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
YGG +I+ TS IF NG+ DP+ G + T ++ ++ G+HC DL
Sbjct: 413 NEYGGKNIQ------TSQTIFPNGLVDPWHVLGVMNTTS--SSVYTITISTGAHCSDLYP 464
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWI 488
+D D LV R+ E+ ++ I
Sbjct: 465 PLPTDSDDLVLARRMEIDLISTVI 488
>gi|224058955|ref|XP_002299662.1| predicted protein [Populus trichocarpa]
gi|222846920|gb|EEE84467.1| predicted protein [Populus trichocarpa]
Length = 153
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 78/133 (58%), Positives = 97/133 (72%), Gaps = 3/133 (2%)
Query: 364 MVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNI 423
MV+P+G NSM++P+P++L YI+ CK YGV PRP WV TYYGGHDIKLIL+R SNI
Sbjct: 1 MVIPLGVGDNSMFEPDPFDLKDYIERCKSLYGVRPRPHWVTTYYGGHDIKLILQRFGSNI 60
Query: 424 IFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKS-DPDWLVQQRKTEVK 482
IFSNG+RDP+S GG + + D + GSHCLD+ A S DPDWLV QRK EV+
Sbjct: 61 IFSNGLRDPYSSGGVLN--NISDSIVAVTTVNGSHCLDIQRANPSTDPDWLVMQRKKEVE 118
Query: 483 IMQGWITQYYDDF 495
I++GWITQYY+D
Sbjct: 119 IIEGWITQYYEDL 131
>gi|323450637|gb|EGB06517.1| hypothetical protein AURANDRAFT_71973 [Aureococcus anophagefferens]
Length = 939
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 139/436 (31%), Positives = 219/436 (50%), Gaps = 45/436 (10%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
+Q +DHF + + TF QR +++ W G PI + G E + + +G +E+A
Sbjct: 40 SQNLDHFDF--TTNATFEQRVFVHADHWSPGG-PIFVYCGNEDDVTLYVNATGLMWEHAA 96
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH 176
F A++V +EHRYYG+++PFG+ A+ + R R Y + QALAD + L IK Y A +
Sbjct: 97 AFGAMLVFVEHRYYGETLPFGA--ASFEPGRLR-YLSHEQALADLVNALRRIKATYGAEN 153
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY-GDITPWTTYHSVVSKD---F 232
A +A G SYGG LA W R+KYP V+G++A+SAPIL + GD Y VV++D
Sbjct: 154 AKTVAFGGSYGGMLAAWLRMKYPAAVVGAVAASAPILAFDGDGFDGEAYWEVVTRDATAA 213
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD----GLDTV-- 286
+ C +++++S + A + D LS+ F+TC P+ + + L DT+
Sbjct: 214 AGAAPACAANVREAFSALFR-ADRDD----LSRIFRTCGPVADRSRLALLALFAFDTMAM 268
Query: 287 --YSEAAQYDTPSNI-----PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
Y + Y T + PV+ C + P GD+ + A + A + Y+ + +
Sbjct: 269 GNYPYESTYLTHGEVALPAFPVRAACEHLAG-PLDGDE---ALLAALAAAAGVFYNASGA 324
Query: 340 RCYINEDRTGDESDEGWEWQSCSEMV-----VPMGKDKNSMYQPEPWNLTKYIKNCKEQY 394
NE E D W+WQ C+E + P ++ M+ P P N + +C+ +Y
Sbjct: 325 -LACNELPADVEEDGIWDWQYCTETLPQETYFPRDGVRD-MFWPAPANDSWVDAHCEAKY 382
Query: 395 GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
GV+PR W+ YGG + +NI+FSNG DP+S G + + ++
Sbjct: 383 GVAPRRRWIADSYGGRAGVA----AATNIVFSNGALDPWSAG--GVADAAGGATETVRID 436
Query: 455 LGSHCLDLDEAKKSDP 470
LG+H LDL A DP
Sbjct: 437 LGAHHLDLMFAHPDDP 452
>gi|330799265|ref|XP_003287667.1| hypothetical protein DICPUDRAFT_151801 [Dictyostelium purpureum]
gi|325082345|gb|EGC35830.1| hypothetical protein DICPUDRAFT_151801 [Dictyostelium purpureum]
Length = 467
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 128/462 (27%), Positives = 210/462 (45%), Gaps = 65/462 (14%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF- 110
+TF+++Q IDH+ + + T+ Q+Y++ ++ G + PI +L EAP+ GF
Sbjct: 46 QTFWFDQKIDHYDFFNNN--TYKQQYIVVDDYFDG-SGPIFIYLAGEAPMG----FFGFQ 98
Query: 111 ---TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
E A QF AL +++EHR+YGKS P ++ + N + Y S QALAD A+ L
Sbjct: 99 EVQVVEWAKQFGALFIVIEHRFYGKSYP--TQDLSTNNLK---YLTSQQALADAANFLST 153
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
K + + P + G SY G L++WFRLKYP + I S+A S P+L + T + S
Sbjct: 154 YKAENDLVENPTVVFGCSYSGALSSWFRLKYPQLAIASIAPSGPVLAQLNFTGYYAQFS- 212
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ C Q + + ++I + G+ L K F +C L+N +L L T+
Sbjct: 213 -----NSAAPNCVQAAQTATNQIMQLTKSSAGIKQLEKTFNSCHSLENPRDLYYFLYTLT 267
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCG----DDILCKIAAGVVEADSLEYDGNNSRCYI 343
D +N P I N+ CG +D L A +V A + +I
Sbjct: 268 EALGSADQMNNPPT-WILNS-----TCGTFLQNDNLLTNWAQIVNAGQTGCNDYRLSTFI 321
Query: 344 NEDR----TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY---GV 396
+ R + + W +Q+C E + + P N+ + +K C+E + G+
Sbjct: 322 EQMREIKISQQDGSRSWVYQTCVEFGYFSTTYEGTSVFPPTLNVEEQVKWCEEIFDVPGM 381
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
+P W YYGG + + +N++F+NG+ DP +H LS + D G
Sbjct: 382 TPNIDWTNAYYGGQNTQ------ATNVMFTNGLLDP---------WHL--LSVNSDNEAG 424
Query: 457 S---------HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ HC L + DP LV R+ V ++G ++
Sbjct: 425 TVRAATYEAGHCASLIQETSEDPISLVNAREEVVSFLKGVLS 466
>gi|393909177|gb|EJD75348.1| serine carboxypeptidase S28 family protein [Loa loa]
Length = 542
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 134/479 (27%), Positives = 210/479 (43%), Gaps = 69/479 (14%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
IDHF+Y + TF +Y++N ++ + P+ + G E I+ Q++G ++ A F
Sbjct: 18 IDHFNY--RNLDTFGLKYLVNYSYFNC-DGPLFFYAGNEGDIETFAQMTGIMWDLAPLFN 74
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN------ 173
A IV EHRYYG+S PFG R + + GY N QALAD+A ++ +K
Sbjct: 75 AAIVFAEHRYYGESQPFGKR--SYMDVLRLGYLNEIQALADFAELISFLKTDQKELGFCP 132
Query: 174 -ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-LYYGD-ITPWTTYHSVVSK 230
T PVI G SYGG LA W R+KYPH+V G+ ASSAP+ ++YG I P + ++ +
Sbjct: 133 MGTEIPVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRIFYGTGINPESVSRTITTN 192
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKF--KTCTPLKNTTELKDGLDTVYS 288
+ + + I+ ++ +G L++ F K +K+ + +YS
Sbjct: 193 YLTSGCDR--KVFSDGFVAIEKMSKTEEGRMKLNRIFHAKPGFEMKSYNDFMSLYSYIYS 250
Query: 289 -----EAAQYDTPSNI-------PVKRICN----AIENAPNCGDDILCKIAAGVVEADSL 332
Y P++ PVK +C A N N + I I L
Sbjct: 251 AIFYMAMTDYPYPADFFEPLPGYPVKYVCQYAKKAATNEENLAEQIYSIINVYYNYTGQL 310
Query: 333 EYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM---GKDKNSMYQP-----EPWNLT 384
+ S C D+ D W WQSC+ + + + G D + +P ++
Sbjct: 311 TDNCFTSNCTTPSPIQNDDEDIAWNWQSCTSLTIQICDRGGDNDFFLNTCDNSGDP--VS 368
Query: 385 KYIKNCKEQYGVSPRPSWVLTYYGGHDIKL---ILRRSTSNIIFSNGMRDPFSRGGWVK- 440
IK C E + + +Y HD+ + ++ +TSNIIFSNG DP+S GG +
Sbjct: 369 TNIKLCTELF---KDIGYNNNFYKLHDVTIRYGMIYNTTSNIIFSNGNLDPWSAGGVYEN 425
Query: 441 -----------TYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
Y F+ L +H LD DP + +R V I++ W+
Sbjct: 426 SPGIMEAMKNGVYIFYMLG-------AAHHLDFRTPNTCDPPSVTHERFQVVNIIKCWV 477
>gi|62089160|dbj|BAD93024.1| Dipeptidyl-peptidase II precursor variant [Homo sapiens]
Length = 377
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 132/236 (55%), Gaps = 4/236 (1%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
++ + GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G E +
Sbjct: 23 RRAPDPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGNEGDVWA 82
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
SGF E A + AL+V EHRYYGKS+PFG++ H QALAD+A
Sbjct: 83 FANNSGFVAELAAERGALLVFAEHRYYGKSLPFGAQS---TQRGHTELLTVEQALADFAE 139
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+L ++ A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP+L +
Sbjct: 140 LLRALRRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQ 199
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL 279
+ V+ DF S +C Q +++++ +I ++ + + +F TC PL + +L
Sbjct: 200 FFRDVTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDL 254
>gi|390365524|ref|XP_783661.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like
[Strongylocentrotus purpuratus]
Length = 622
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 98/319 (30%), Positives = 168/319 (52%), Gaps = 28/319 (8%)
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA W R+KYP+ + G++A+SAP+ + +TP T + +SKDF+ ++ C+ ++ SW
Sbjct: 312 LAAWMRIKYPNAIAGAIAASAPVWQFTGLTPCNTQYLTISKDFQAANQLCYDSVHMSWDV 371
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA------AQYDTPSN----- 298
I I G L++ K C PLK T ++ DGL + + + Y P+N
Sbjct: 372 ITRIGQTASGRTKLAQAMKLCNPLKTTADV-DGLISWLAGSWFNLAMVDYPYPANFLEPL 430
Query: 299 --IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCY-INEDRTGDESDE 354
P+K +C+ + D +L ++ + + Y+ +S +C+ +++D T D
Sbjct: 431 PAFPIKEVCSYFKTPSPTDDQLLAELTGAL----GVYYNYTSSIQCFNLSQDATASLGDL 486
Query: 355 GWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIK 413
GW +Q+C+EMV+P D N M+ PWN + CK Q+ V+PRP+W+++ +GG +I
Sbjct: 487 GWSFQACTEMVMPFCADGVNDMFYSMPWNYDAQVAACKAQWNVTPRPNWIVSQFGGKNIT 546
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWL 473
++SNI FSNG+ DP+ GG + D + + G+H LDL K DP +
Sbjct: 547 -----ASSNIFFSNGLLDPWHLGGVLTDLS--DTLVAGIIPDGAHHLDLRGKNKLDPPSV 599
Query: 474 VQQRKTEVKIMQGWITQYY 492
+ R E + + WI +++
Sbjct: 600 IAVRNQERENINRWIAEWW 618
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 112/200 (56%), Gaps = 9/200 (4%)
Query: 9 QWL--LLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYG 66
+WL LM + V + F ++ L R + + + +H + ++ Q +DHFS+
Sbjct: 6 RWLRCTLMCLVVCSGFSSSACVLFGRRINNKLEAKISSQGCSHPHKEEYFEQQVDHFSF- 64
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILE 126
+ TF RY+++ + W G PI + G E I Q +GF ++ A ++KA+++ E
Sbjct: 65 -TNSDTFQMRYLVSDELWTKG-GPIFFYTGNEGDITWFCQNTGFVWDLAVEYKAIVIFAE 122
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK--DKYNATHAPVIAIGA 184
HRYYGKS+P+G+ + K+A H GY + QALAD+A L K + A +PV+A G
Sbjct: 123 HRYYGKSLPYGND--SYKDAAHLGYLTAEQALADFAVFLDWYKANTRGGAAGSPVVAFGG 180
Query: 185 SYGGELATWFRLKYPHVVIG 204
SYGG LA W R+KYP+ + G
Sbjct: 181 SYGGMLAAWMRIKYPNAIAG 200
>gi|110749179|ref|XP_623676.2| PREDICTED: putative serine protease K12H4.7-like [Apis mellifera]
Length = 478
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 142/493 (28%), Positives = 214/493 (43%), Gaps = 65/493 (13%)
Query: 22 FHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS 81
FH GL+ L + + I E + Q +DHF+ P +T+ RY+ NS
Sbjct: 25 FHFRGLEEPQSLDKAIQENITEA----------WIQQPLDHFN--PRDNRTWSMRYLENS 72
Query: 82 KFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA 141
+F+ N PIL +G E I +G YE A A + EHRYYGKS P
Sbjct: 73 RFFKE-NGPILIMIGGEWAISKGFLRAGLMYELASNHSASMYYTEHRYYGKSKPTND--- 128
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHV 201
++R+ Y + QALAD A + K + ++ VI G SY G +A+W RLKYPH+
Sbjct: 129 --TSSRNLQYLSVDQALADLAYFIKTKKKDESRRNSTVIVFGGSYAGNVASWARLKYPHL 186
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLA 261
+ G+LASSAP+L D + Y+ VV++ R SE+C + IK ++ E++ + +G
Sbjct: 187 IQGALASSAPVLAKLD---FNEYYEVVTESLRRYSEKCVEEIKTAFDEVEELLYIENGPQ 243
Query: 262 ILSKKFKTC-----TPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIEN--APNC 314
L + F C + L ++ QYD N K I + EN A
Sbjct: 244 RLKQYFNLCDVPNIKSFNDLAHFGSLLAESFASVVQYDKVENGRTK-IASCCENMTATYL 302
Query: 315 GDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESD--EGWEWQSCSEMVVPMGKDK 372
G L ++A V D + + + + T ++SD W +Q+C+E D
Sbjct: 303 GSP-LQRLAHFVSSKDKCLKNNYDKFVTLYRNETWNQSDIMRQWYYQTCTEYGYYQTTDS 361
Query: 373 NSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIF 425
+ L + C++ YG ++ R YGG LR N+IF
Sbjct: 362 TRSIFGSLFPLPYFTNICQDLYGEYYNRDFLNNRIKRTNMMYGG------LRPDLRNVIF 415
Query: 426 SNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---------GSHCLDLDEAKKSDPDWLVQQ 476
+NG DP+ LS QDLN SHC DL +D + L++
Sbjct: 416 TNGDVDPWHA-----------LSVLQDLNAFSPAVLIKGSSHCRDLYSDSNTDAEDLIRA 464
Query: 477 RKTEVKIMQGWIT 489
R +I+ WI+
Sbjct: 465 RVRIREIIGSWIS 477
>gi|332245708|ref|XP_003271995.1| PREDICTED: thymus-specific serine protease isoform 1 [Nomascus
leucogenys]
Length = 514
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 129/469 (27%), Positives = 207/469 (44%), Gaps = 62/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 119 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRHALADVVSARLALSRLFNV 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 230
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T S EC + +++E++ + S A L + C PL +N EL L
Sbjct: 231 STEIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELSVCGPLGRAENQAELLGALQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ QYD + +P V+++C N + P CG KI + L +
Sbjct: 291 LVGGVVQYDGQAGVPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVKIVLHSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + E + D W +Q+C+E + + + L + C++
Sbjct: 351 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQV 410
Query: 394 YGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+S + +YYGG + ++F NG DP +H LS
Sbjct: 411 FGLSALSVAQAVAQTNSYYGGQT------PGANQVLFVNGDTDP---------WHV--LS 453
Query: 449 FSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q L GSHCLD+ + SD L Q R+ + +Q W+
Sbjct: 454 VTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRQGRQNIFRQLQTWL 502
>gi|326930125|ref|XP_003211202.1| PREDICTED: dipeptidyl peptidase 2-like [Meleagris gallopavo]
Length = 432
Score = 159 bits (402), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 122/432 (28%), Positives = 196/432 (45%), Gaps = 60/432 (13%)
Query: 81 SKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRK 140
+KFW G PI + G E I + S F +E A Q +AL++ EHRYYGKS+PFG
Sbjct: 16 AKFWKKGFGPIFFYTGNEGDIWTFAENSDFIFELAEQQQALVIFAEHRYYGKSLPFGLES 75
Query: 141 AALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPH 200
+KN QALADYA ++ +K +Y A PVIA G SYGG L+ + R+KYP+
Sbjct: 76 MQIKNTH---LLTVEQALADYAVLITELKQQYGAAGCPVIAFGGSYGGMLSAYLRMKYPN 132
Query: 201 VVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI---------D 251
VV G+LA+SAP+L + T + V+ DF+ + C +++++ +I D
Sbjct: 133 VVDGALAASAPVLSVAGLGDPTQFFRDVTADFQKSIPGCVTAVRRAFQQIRDLFLSGAYD 192
Query: 252 NIASKPDGLAILSKK---FKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAI 308
I+SK +S K ++ +N + +D Y P+N PVK C I
Sbjct: 193 EISSKMATCNKISSKKDVYQLFGFARNAFTMIAMMDYPYKTDFMGHLPAN-PVKVGCEQI 251
Query: 309 ENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCY---------INEDRTGDESD-EGWE 357
D + +AA V + Y+ + S +CY + G +D E W+
Sbjct: 252 L----AHTDPIQGLAALV----GVFYNSSGSVQCYDVYQLYRPCADPTGCGTGADAEAWD 303
Query: 358 WQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLIL 416
+Q+C+E+ + + M+ P+ + C ++ V PR W+ + G D + +
Sbjct: 304 YQACTEINLTFNSNNVTDMFPEMPFTEAMREQYCWSRWRVRPRAQWLQINFWGGDQAIAI 363
Query: 417 RRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQ 476
S S + + ++ G+H LDL +DP +++
Sbjct: 364 NSSLSPSLTAVTIQG------------------------GAHHLDLRGHNPADPPSVIEA 399
Query: 477 RKTEVKIMQGWI 488
RK E I+ W+
Sbjct: 400 RKLEASIISSWV 411
>gi|17556861|ref|NP_498688.1| Protein PCP-1 [Caenorhabditis elegans]
gi|21431885|sp|P34610.2|PCP1_CAEEL RecName: Full=Putative serine protease pcp-1; Flags: Precursor
gi|351020794|emb|CCD62762.1| Protein PCP-1 [Caenorhabditis elegans]
Length = 565
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 149/518 (28%), Positives = 237/518 (45%), Gaps = 93/518 (17%)
Query: 37 RRSRI---LEQKDSNH-----GFETFFY-NQTIDHFSYGPESYQTFPQRYVINSKFWGGG 87
RRSR+ L QK SN+ +T +Y N +DHF++G +TF R + N+ F+ G
Sbjct: 19 RRSRLFKKLYQKASNYDAAPSNVQTVWYKNMKLDHFTWGDT--RTFDMRVMWNNTFYKPG 76
Query: 88 NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRK-AALKNA 146
PI + G E ++ + +G ++ A F A I+ EHR+YG++ PFG++ A+L N
Sbjct: 77 -GPIFFYTGNEGGLESFVTATGMMFDLAPMFNASIIFAEHRFYGQTQPFGNQSYASLANV 135
Query: 147 RHRGYFNSAQALADYASILLHIKDKYN------ATHAPVIAIGASYGGELATWFRLKYPH 200
GY S QALADYA +L +K N VI+ G SYGG L+ WFR KYPH
Sbjct: 136 ---GYLTSEQALADYAELLTELKRDNNQFKMTFPAATQVISFGGSYGGMLSAWFRQKYPH 192
Query: 201 VVIGSLASSAPILYY--GDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD 258
+V G+ A SAP++Y G + P + + S+ + D F + +W+ N++S
Sbjct: 193 IVKGAWAGSAPLIYMNGGGVDP-GAFDHITSRTYIDNGCNRF-ILANAWNATLNLSSTDA 250
Query: 259 GLAILSK----KFKTCTPLKNTTE-------LKDGLDTVYSEAAQYDTPSNI-------P 300
G L+ K T ++N T+ L++ ++ Y Y P+ P
Sbjct: 251 GRQWLNNNTVFKLDPRTKIRNQTDGWNLNAYLREAIE--YMAMVDYPYPTGFLEPLPAWP 308
Query: 301 VKRICNAIE-NAPNCGDDILCKIAAGVVEADSLEYDGNN----------SRCYINEDRTG 349
V C + N + D L K V A ++ Y+ N S C
Sbjct: 309 VTVACGYMNANGTSFSDKDLVK---AVANAANIYYNYNRDPNFTYCIDFSICGDQGTGGL 365
Query: 350 DESDEGWEWQSCSEMVVPM--------------GKDKNSMYQPEPWNLTKYIKNCKEQYG 395
+ GW WQ CSE+++ M GKD +YQ L + + + G
Sbjct: 366 GGDELGWPWQECSEIIMAMCASGGSNDVFWNECGKD---IYQ----TLQQGCVSIFKSMG 418
Query: 396 VSPRPSW----VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW-VKTYHFFDLSFS 450
+P+ +W V T Y G+D+ +SN+I + G DP+S GG+ V + +
Sbjct: 419 WTPK-NWNIDAVKTLY-GYDLS-----GSSNLILTQGHLDPWSGGGYKVDQNNAARGIYV 471
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ +H LDL + DP+ + R ++I++ W+
Sbjct: 472 LEIPGSAHHLDLRQPNTCDPNTVTNARFQIIQILKCWV 509
>gi|268575348|ref|XP_002642653.1| C. briggsae CBR-PCP-1.2 protein [Caenorhabditis briggsae]
Length = 564
Score = 159 bits (401), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 141/510 (27%), Positives = 233/510 (45%), Gaps = 75/510 (14%)
Query: 37 RRSRILEQ---KDSNHGFE--------TFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
RRSR+ ++ K N+ E ++ N +DHF++G +TF R + N+ F+
Sbjct: 15 RRSRLFQKFYKKPVNYDAEIIPKNIEVVWYKNMRLDHFTWGDT--RTFDMRIMWNNTFYQ 72
Query: 86 GGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKN 145
G PI + G E + +G ++ A F A I+ EHR+YG + PFG++ A N
Sbjct: 73 PG-GPIFFYTGNEGAVSTFEVATGMMFDLAPMFNASIIFAEHRFYGATQPFGNQSYA--N 129
Query: 146 ARHRGYFNSAQALADYASILLHIKDKYNA------THAPVIAIGASYGGELATWFRLKYP 199
+ GY S QALADYA +L +K N + VI+ G SYGG L+ WFR KYP
Sbjct: 130 LANVGYLTSEQALADYAELLTELKRDNNQFGKTFHRDSQVISFGGSYGGMLSAWFRQKYP 189
Query: 200 HVVIGSLASSAPILYY--GDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKP 257
H+V G+ A SAP++Y G + P + ++ S+ + + F + +W+ + N++S
Sbjct: 190 HIVKGAWAGSAPLIYMHDGGVDP-GAFDNITSRTYVENGCNRF-ILANAWNAVLNLSSTD 247
Query: 258 DGLAILSK----KFKTCTPLKNTTE-------LKDGLDTVYSEAAQYDTPSNI------- 299
G A L+ K TP+ N T+ +++ ++ Y Y P+
Sbjct: 248 SGRAWLNNNPVFKLDPRTPINNQTDGWNLNAYMREAIE--YMAMVDYPYPTGFLEPLPGW 305
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY--------INEDRTGDE 351
PV C + D + V A ++ Y+ N + + + TG
Sbjct: 306 PVTVACGYMNATGESFSDQ--DLVTAVANAANVYYNYNQNANFTWCIDFNICGDQGTGGL 363
Query: 352 SDE--GWEWQSCSEMVVPM---GKDKNSMYQPEPWNLTKYIK-NCKEQYGVS--PRPSW- 402
D+ GW WQ CSE+++ M G + + N+ +K C +G +W
Sbjct: 364 GDDALGWPWQECSEIIMAMCASGGANDVFWSECGDNIYDTLKQGCVSIFGSMKWTTANWN 423
Query: 403 ---VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW-VKTYHFFDLSFSQDLNLGSH 458
V T Y G+D+ +SN+I + G DP+S GG+ V + + ++ +H
Sbjct: 424 IDAVKTLY-GYDLS-----GSSNLILTQGHLDPWSGGGYKVDQTNTARGIYVMEIPGSAH 477
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
LDL + DP+ +V R V+I++ W+
Sbjct: 478 HLDLRQPNTCDPNTVVNARYQIVQILKCWV 507
>gi|440803021|gb|ELR23935.1| serine carboxypeptidase [Acanthamoeba castellanii str. Neff]
Length = 481
Score = 158 bits (400), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 150/519 (28%), Positives = 225/519 (43%), Gaps = 80/519 (15%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
++L+V F+ F A R + R EQ +Y Q +DHF+ G E+
Sbjct: 7 VVLLVAFLG-HFAAEAYIARSSASSVGRDLPKEQ----------WYTQRLDHFN-GQET- 53
Query: 71 QTFPQRYVINSKFWG-GGNSPILAFLGAEAPI--DDNIQLSGFTYENAHQFKALIVILEH 127
+T+ QRY IN FW PI +G E + +D + L Y H AL+V LEH
Sbjct: 54 RTWKQRYFINDTFWNPSAPGPIFFQMGGEGAVSGEDVVLLQMVQYGIKH--GALMVTLEH 111
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
R+YG S P ++++ R + +S QALAD A LL +KD+Y A +P+I G SY
Sbjct: 112 RFYGTSQPL--PDLSIESLR---FLSSEQALADAAEFLLWLKDQYQAPKSPIITFGCSYP 166
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR-DTSEECFQTIKKS 246
G LA WFRLKYPHV S+ASSAP+ D + Y VV + ++C IK++
Sbjct: 167 GALAAWFRLKYPHVTYASVASSAPVEATLD---FFEYLDVVDQSLEYFVGDKCVANIKQA 223
Query: 247 WSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD---GLDTVYSEAAQYDTPSNIPVKR 303
+ + + + P G A L F C P++N ++ + L + QY+ + P+
Sbjct: 224 TTAVSQLMASPGGRAKLQSLFNFCGPIQNELDIANFYSSLAGNWMGTVQYNDENGNPLDV 283
Query: 304 ICNAIENAPNCGDDILCKIAA--GV------VEADSLEYDGNNSRCY-------INEDRT 348
I LCKI GV V ++ + C I + R
Sbjct: 284 I-------------YLCKIMTQPGVDPLTAYVNISNIFLRSQDQSCLDVSYADAIAQLRD 330
Query: 349 GDESDEG-----WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGV--SPRPS 401
+ G W +Q+C+E D + + L + C++ +G+ PR +
Sbjct: 331 TSAAAAGVGIRQWVYQTCTEFGYFQTSDSDGQPFGDGMPLKFSLDQCRDAFGLIDPPRIN 390
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL-----NLG 456
YGG ++ SNI+F NG DP+ K S S L N
Sbjct: 391 ATNHIYGGRNLP---AWGPSNILFVNGNIDPWHALSITK-------SISPSLTTVFINGT 440
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
+HC ++ A ++DP LVQ RK + W+ Q F
Sbjct: 441 AHCANVLPAHENDPPSLVQARKDIQAQIDQWLAQAKRQF 479
>gi|341900697|gb|EGT56632.1| hypothetical protein CAEBREN_05149 [Caenorhabditis brenneri]
Length = 568
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 138/505 (27%), Positives = 229/505 (45%), Gaps = 67/505 (13%)
Query: 37 RRSRILEQ---KDSNH-----GFETFFY-NQTIDHFSYGPESYQTFPQRYVINSKFWGGG 87
RRSR+L + K +N+ ET +Y +DHF++G +TF R + N+ F+ G
Sbjct: 21 RRSRLLRKLYKKSTNYDTAPNNVETVWYKGMRLDHFTWGDT--RTFDLRVMWNNTFYKEG 78
Query: 88 NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNAR 147
PI + G E ++ + +G ++ A F A I+ EHR+YG++ PFG K + KN
Sbjct: 79 -GPIFFYTGNEGGLESFEKATGMMFDLAPMFNAAIIFAEHRFYGQTQPFG--KDSYKNLA 135
Query: 148 HRGYFNSAQALADYASILLHIKDKYN------ATHAPVIAIGASYGGELATWFRLKYPHV 201
+ GY S QALADYA +L +K N + PVI+ G SYGG L+ WFR KYPH+
Sbjct: 136 NIGYLTSEQALADYAELLTELKRDNNRMGKTFSQDTPVISFGGSYGGMLSAWFRQKYPHL 195
Query: 202 VIGSLASSAPILYY--GDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDG 259
V G+ A SAP++Y G + P + ++ S+ + D + + +W+ ++S G
Sbjct: 196 VKGAWAGSAPLIYMHDGGVDP-GAFDNITSRTYVDRGCNRY-ILANAWNATIRLSSTDAG 253
Query: 260 LAILSK----KFKTCTPLKNTTE-------LKDGLDTVYSEAAQYDTPSNI-------PV 301
L+ K TP+K + L++ ++ Y Y P+ PV
Sbjct: 254 RQWLNNNNVFKLDPRTPIKTAADGWNLNSYLREAIE--YMAMVDYPYPTGFLEPLPAWPV 311
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY--------INEDRTGDESD 353
C + D + + V A ++ Y+ N + + + TG D
Sbjct: 312 DAACGYMNATGTSFSDQ--DLVSAVANAANIYYNYNKNASFTYCIDYSICGDQGTGGLGD 369
Query: 354 E--GWEWQSCSEMVVPM---GKDKNSMYQPEPWNLTKYIK-NCKEQYGVSPRPSWVLTYY 407
+ GW WQ CSE+++ M G + + P N+ +K C +G +W +
Sbjct: 370 DQLGWPWQECSEIIMGMCARGGSNDVFWNECPDNIYDDLKQGCISIFGSM---NWTTANW 426
Query: 408 GGHDIKLIL---RRSTSNIIFSNGMRDPFSRGGWVK-TYHFFDLSFSQDLNLGSHCLDLD 463
+K + +SN+I + G DP+S GG+ + + ++ +H LDL
Sbjct: 427 NIDAVKTLYGYDLSGSSNLILTQGHLDPWSGGGYTADQTNAARGIYVMEIPGSAHHLDLR 486
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWI 488
DP+ + R V I++ W+
Sbjct: 487 TPNTCDPNTVTNARFQIVSILKCWV 511
>gi|241753990|ref|XP_002401169.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508366|gb|EEC17820.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 261
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 130/234 (55%), Gaps = 6/234 (2%)
Query: 81 SKFWGGGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSR 139
+ FW + PI + G E P++D I +G + A +FKA+I+ EHR+YG+S+P R
Sbjct: 27 THFWEPTKTGPIFFYCGHELPLEDYINNTGLLWNWAREFKAMIIFSEHRFYGQSLPDDVR 86
Query: 140 KAALKNARHRGYFNSAQALADYASILLHIKDK-YNATHAPVIAIGASYGGELATWFRLKY 198
+ L + YF++ QALADYA ++LHIK+ + A PVIA G YGG LA +FRLKY
Sbjct: 87 TSHLPTVPYLNYFSAVQALADYAHLILHIKETVHRADKVPVIAFGGFYGGMLAAYFRLKY 146
Query: 199 PHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD 258
PH++ G+LASSAP+ + + P + + ++K FR S C + I+KSW + ++
Sbjct: 147 PHLIAGALASSAPVQMFPGLVPCSAFDHTLTKAFRRESAACAKAIRKSWPHLLSLTDSSK 206
Query: 259 GLAILSKKFKTCTPLKNTTE--LKDGLDTVYSEAAQYDTPSNIPVKRICNAIEN 310
S+K+K C L + L D + Y A ++ P P R+ + N
Sbjct: 207 KAHEFSRKYKMCDNLTPQAQRMLLDWMHDTYLHLAMFNYPE--PSSRLTSLPAN 258
>gi|417400793|gb|JAA47320.1| Putative dipeptidyl peptidase 2 [Desmodus rotundus]
Length = 429
Score = 158 bits (399), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 107/362 (29%), Positives = 165/362 (45%), Gaps = 37/362 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F ++ Q +DHF++ QTFPQR++++ KFW G P+ + G E + SGF
Sbjct: 40 FRESYFEQFLDHFNFERFGNQTFPQRFLVSEKFWKRGEGPLFFYTGNEGDVWAFANNSGF 99
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E A Q AL+V EHRYYGKS+PFG R H QALAD+A +L ++
Sbjct: 100 ILELAAQQGALVVFAEHRYYGKSLPFGERS---TQRGHVELLTVEQALADFARLLQALRR 156
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
A P +A G SYGG L+ + R+KYPH+V G+LA+SAP++ + + VS
Sbjct: 157 DLGAQDVPAVAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVLVAGLGDAYQFFRDVSL 216
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVY 287
DF +C Q ++ ++ +I ++ + ++S+ F C PL K+ +L +
Sbjct: 217 DFEGQGPKCAQGVRDAFRQIKDLFLQ-GAYDVVSQAFGLCRPLSGWKDLVQLFGFARNAF 275
Query: 288 SEAAQYDTP------SNIPVKRICNAIENAPNCGDDIL-CKIAAGVVEADSLEYDGNNSR 340
+ A D P ++P + A + N D I + AG G+
Sbjct: 276 TVLAMMDYPYPTDFLGHLPANPVQVACDRLLNESDRIEGLRALAGASAPPPRGRCGDWGS 335
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
C+ +E R S +G SE P P PW + ++ P P
Sbjct: 336 CHWHEGRWAVGSAQG-----TSECRTP---------PPSPWVGSNHLL---------PGP 372
Query: 401 SW 402
+W
Sbjct: 373 AW 374
>gi|324508835|gb|ADY43728.1| Serine protease pcp-1 [Ascaris suum]
Length = 484
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 135/463 (29%), Positives = 209/463 (45%), Gaps = 65/463 (14%)
Query: 75 QRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSV 134
+Y+ N+ ++ G PI + G EA I+ + +G ++ A +F A IV EHRYYG+S
Sbjct: 1 MKYLYNNTYYKIG-GPIFFYAGNEAAIEGFAENTGIMFDLAPRFNASIVFAEHRYYGESK 59
Query: 135 PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY--NATHAPVIAIGASYGGELAT 192
PFG + + ++ G+ S QA+AD+A L H K ++ PVIA G SYGG LA
Sbjct: 60 PFGDL--SYSDVKNLGFLTSTQAMADFAKFLPHFKANVLNCSSDTPVIAFGGSYGGMLAA 117
Query: 193 WFRLKYPHVVIGSLASSAPILYY--GDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
WFR+KYPH+V G+ ASSAP+L + ++ P + VV++DF + + + K+++ I
Sbjct: 118 WFRIKYPHIVTGAWASSAPVLLFKGANVDP-GAFDKVVTEDFIEAGCN-REAVYKAFNAI 175
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV--------YSEAAQYDTPSNI--- 299
+AS GL L++ F N T+ D V Y Y P++
Sbjct: 176 HELASTTAGLTFLNEMF-VIEAKSNLTQASDADYLVSFIREAFGYLAMVNYPYPTSFLLP 234
Query: 300 ----PVKRICN----AIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE 351
PVK C A P D++ + ++ Y G + + GD+
Sbjct: 235 LPGWPVKEACKRAQAAFPQTPTTNRDLVNYLY--IISNLYYNYTGTVATNCVKTSVCGDQ 292
Query: 352 SDE-------GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGV-SPRPSWV 403
+ GW WQSC+E+V+ M S + C++ GV + +
Sbjct: 293 ATAESGDDAFGWPWQSCTELVIEMCARGGSN--------DFFYDECQQAGGVLNLITDYC 344
Query: 404 LTYYG------------GHDIKLILRRS-TSNIIFSNGMRDPFSRGG-WVKTYHFFDLS- 448
LT + G I+ L S SNIIF+NG DP+S GG + T S
Sbjct: 345 LTTFSSIGYNKNFLFELGAPIQYGLEFSAASNIIFTNGNLDPWSVGGVFANTSGIQQASE 404
Query: 449 ---FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + +H LDL + DP + R V I+ W+
Sbjct: 405 NGVYTYFIEGSAHHLDLRQPNTCDPAPVKNARFQIVNIIDCWV 447
>gi|308481892|ref|XP_003103150.1| CRE-PCP-1 protein [Caenorhabditis remanei]
gi|308260255|gb|EFP04208.1| CRE-PCP-1 protein [Caenorhabditis remanei]
Length = 564
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 144/518 (27%), Positives = 237/518 (45%), Gaps = 74/518 (14%)
Query: 37 RRSRILEQ--KDSNH-----GFETFFY-NQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
RRSR+ ++ K SN+ ET +Y N +DHF++G +TF R + N+ F+ G
Sbjct: 18 RRSRLFQKLHKASNYDAAPSNVETVWYKNMRLDHFTWGDT--RTFDMRVMWNNTFYKPG- 74
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRK-AALKNAR 147
PI + G E ++ + +G ++ A + A I+ EHR+YG++ PFG+ A L N
Sbjct: 75 GPIFFYTGNEGGLESFVTATGIMFDLAPMYNASIIFAEHRFYGQTQPFGNNSYATLANV- 133
Query: 148 HRGYFNSAQALADYASILLHIK---DKYNATH---APVIAIGASYGGELATWFRLKYPHV 201
GY S QALADYA +L +K +++N T +I+ G SYGG L+ WFR KYPH+
Sbjct: 134 --GYLTSEQALADYAELLTELKRQPNQFNLTFQKDTQIISFGGSYGGMLSAWFRQKYPHI 191
Query: 202 VIGSLASSAPILYY--GDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDG 259
V G+ A SAP++Y G + P + ++ S+ + D F + W+ + N+++ G
Sbjct: 192 VKGAWAGSAPLIYMHDGGVDP-GAFDNITSRTYVDNGCNRF-ILANVWNAVLNLSNTDAG 249
Query: 260 LAILSKK----FKTCTPLKNTTE-------LKDGLDTVYSEAAQYDTPSNI-------PV 301
L+ TP++N T+ L++ ++ Y Y P+ PV
Sbjct: 250 RQWLNNNAVFTLDPRTPIRNQTDGWNLNAYLREAIE--YMAMVDYPYPTGFLEPLPAWPV 307
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN----------SRCYINEDRTGDE 351
C + D ++ V A ++ Y+ N S C
Sbjct: 308 AVACGYMNATGTTFSDQ--QLVTMVANAANIYYNYNKDPNFKYCIDYSVCGDQGTGGLGG 365
Query: 352 SDEGWEWQSCSEMVVPM---GKDKNSMYQPEPWNLTKYIKN-CKEQYG-VSPRPS-W--- 402
GW WQ CSE+++ M G + + N+ +K C +G + PS W
Sbjct: 366 DQLGWPWQECSEIIMAMCARGGSNDVFWSECGANIYDVLKQECVSIFGSMGWTPSNWNID 425
Query: 403 -VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW-VKTYHFFDLSFSQDLNLGSHCL 460
V T Y G+D+ +SN+I + G DP+S GG+ V + + ++ +H L
Sbjct: 426 AVKTLY-GYDLS-----GSSNLILTQGHLDPWSGGGYKVDQNNAARGIYVLEIPGSAHHL 479
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAI 498
DL + DP+ + R ++I+ W+ +D I
Sbjct: 480 DLRQPNTCDPNTVTNARFQIIQILNCWVNPNCNDVPTI 517
>gi|328868233|gb|EGG16611.1| hypothetical protein DFA_07589 [Dictyostelium fasciculatum]
Length = 479
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 134/472 (28%), Positives = 219/472 (46%), Gaps = 61/472 (12%)
Query: 37 RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAF 94
R+L + SN +TF+++Q DHF + T+ Q+Y + ++ N+P+ F
Sbjct: 41 EEQRLLFEVSSN---QTFWFDQQQDHFD--QTNNITWKQQYQVIDDWFDPSQPNAPVFIF 95
Query: 95 LGAEAPID----DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
L EAP+ +Q+ + A +FKAL VILEHR+YG+S P + + N +
Sbjct: 96 LAGEAPMGFFNFQEVQIRAW----AQEFKALYVILEHRFYGQSYP--TNDLSTHNLK--- 146
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y S QALAD A+ L K + + G SY G L+ WFRLKYP +V+GS+A S
Sbjct: 147 YLTSQQALADAANFLTTFKSERGIADNQAVVFGCSYSGALSAWFRLKYPQLVVGSVAPSG 206
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD-GLAILSKKFKT 269
P+L + +T Y++ + C +++ ++ + + D G+ L K F +
Sbjct: 207 PVLAQLN---YTGYYAQFTN---SAPTSCVNAAQQASDQVMQLIKQGDKGIKQLEKTFNS 260
Query: 270 CTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA 329
C+ LKN +L + ++ D +N P + N+ N + D+L A +
Sbjct: 261 CSSLKNGRDLYYFVYSIVEALGGADQMNNPPTWTL-NSTCNTLSQNSDLLVNWAEIFNQG 319
Query: 330 DSLEYDGNNSRCYINEDRTGDESDE----GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTK 385
+ + R +I++ R +D+ W +Q+C+E S P N+ +
Sbjct: 320 LDDKCNDFTLRSFIDQARKTRINDQDGTRSWVFQTCAEFGYFSTTYPGSSVFPGLLNVEE 379
Query: 386 YIKNCKEQY---GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+K C+E Y G++P W +YYGG +IK SNI+FSNG+ DP +
Sbjct: 380 QVKWCQEIYDVPGMTPNIDWTNSYYGGQEIK------GSNIMFSNGLLDP---------W 424
Query: 443 HFFDLSFSQDLNLGS---------HCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
H LS +QD G+ HC L + DP LV R+ + ++
Sbjct: 425 HL--LSVNQDNIDGTVKAVTYEAGHCGTLIASTTIDPPSLVDARQGIIGFLK 474
>gi|195427485|ref|XP_002061807.1| GK16991 [Drosophila willistoni]
gi|194157892|gb|EDW72793.1| GK16991 [Drosophila willistoni]
Length = 512
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 124/467 (26%), Positives = 215/467 (46%), Gaps = 45/467 (9%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW-GGGNSPILAFLGAEAPIDDNIQLSGF 110
E ++ Q +DHF P + QT+ QRY +N ++ +PI +G E G
Sbjct: 59 EDLWFEQRLDHFQ--PSNTQTWQQRYFVNEDYYRNDSTAPIFLMIGGEGEASKKWMHEGA 116
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A F AL + LEHR+YGKS P ++ + N Y S QALAD A+ + +K
Sbjct: 117 WIHYAEHFSALCIQLEHRFYGKSHP--TKDLSTSNLV---YLTSEQALADLANFVAAMKV 171
Query: 171 KYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
KY+ + IA G SY G LA W R KYPH++ GS++SS P+L D + Y VV
Sbjct: 172 KYDLKDSQKWIAFGGSYPGSLAAWAREKYPHLIYGSISSSGPLLAEVD---FKEYFDVVK 228
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT-------TELKDG 282
+C + + +S+ +++ + G+A L K FKTCTP+K++ + +
Sbjct: 229 ASLASYKPDCVEAVTRSFGQVEILLKHMIGMANLDKTFKTCTPIKDSIHNDLDISNFFEN 288
Query: 283 LDTVYSEAAQY---DTP-SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
L ++ QY ++P + I + IC+ + N G + ++AA + L +N
Sbjct: 289 LAGNFAGVVQYNKDNSPHATITIDDICDVMLNT-TMGPPV-TRLAA----VNDLLLQQSN 342
Query: 339 SRCY-------------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTK 385
+ C I+ D + W +Q+C+E + + + + +
Sbjct: 343 ASCLDYKYDKMIADMKNISWDSETAQGARQWTYQTCNEFGFYQTSENKADTFGDRFGVDF 402
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKL-ILRRSTSNIIFSNGMRDPFSRGGWVKTYHF 444
+I+ C++ Y ++ G + L+ T+N+++ +G DP+ G +K +
Sbjct: 403 FIRQCQDVYSSDMDARYLEQVVGQTNKHYGALKPETTNVLYIHGSIDPWHALGLIKPINA 462
Query: 445 FDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ + +HC ++ E KSDP LV+ R +K + +T+Y
Sbjct: 463 KTPTIY--IEGTAHCANMYEPVKSDPPQLVEARNKILKYLANLLTEY 507
>gi|156354267|ref|XP_001623320.1| predicted protein [Nematostella vectensis]
gi|156210006|gb|EDO31220.1| predicted protein [Nematostella vectensis]
Length = 502
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 132/475 (27%), Positives = 211/475 (44%), Gaps = 32/475 (6%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L IR ++L + + E + + Q IDHF + P +T+ QRY +N FW G + P+L
Sbjct: 38 LKNIRHGKLLSPLPNINEVEGYDFEQYIDHFEFTPRP-RTYLQRYWMNRAFWKGPDGPVL 96
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
++G E+ + G + A ++ AL+ +EHRYYGKS FG K KN R Y
Sbjct: 97 LYVGGESVLSGGYIAGGHIVDIAKEYGALLFAVEHRYYGKSNFFGCLKT--KNMR---YL 151
Query: 153 NSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
+S ALAD A + H K+K+ T I G SY G L+ WFR+KYPH+VIG++ASSAP
Sbjct: 152 SSQLALADLAQFVAHAKNKFGLTDKNKWITYGGSYPGSLSAWFRIKYPHLVIGAVASSAP 211
Query: 212 ILYYGDITPW-TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
+ D + S +S S+ C I++++ +D + + L K F C
Sbjct: 212 VEAQTDFKDYNNVVASSLSSPLVGGSKLCMHNIEEAFKFVDRLLDTKN-FKTLEKDFIAC 270
Query: 271 ---TPLKNTTELKDGLDTVYSEAAQYDTP-SNIPVKRICNAIENA---PNCGDDILCKIA 323
+ L +T L + QY+ I + +C + NA P IL K
Sbjct: 271 NDISKLNDTWMFASNLAGFFMGLVQYNNQVPGINIAYVCKQMNNASRSPYKSLSILYKQQ 330
Query: 324 AGVVEADSLEYDGNNSRCYINEDRTGDESD--EGWEWQSCSEM-VVPMGKDKNSMYQPEP 380
+ S N + + R D D W +QSC++ + +
Sbjct: 331 IQKTASCSDFSYENFMKTVKTQKRDPDGFDMIRQWYYQSCTQFGYFQTCEPGTHCVFSKR 390
Query: 381 WNLTKYIKNCKEQYGVS-----PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
+ + C+E + ++ R ++ YYGG R S I+F NG DP+
Sbjct: 391 LGIINDMDLCQEVFEIALGQLKARINFTNEYYGGK------RPRGSKIVFVNGSIDPWHS 444
Query: 436 GGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
V +++ + SHC ++ + +DP LV+ R+ I+ W+ +
Sbjct: 445 LSVVTNQTSSEVAVF--IPGTSHCANMGANQPNDPPALVEARRRVTAIVGEWLKE 497
>gi|76156383|gb|AAX27594.2| SJCHGC06818 protein [Schistosoma japonicum]
Length = 271
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 88/249 (35%), Positives = 139/249 (55%), Gaps = 7/249 (2%)
Query: 37 RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG 96
R + KDS +ET ++ IDHFS+ + F +Y+IN++ + G PIL + G
Sbjct: 20 RHFPVPLNKDSQFKYETKYFRTKIDHFSFVTDG--EFEIKYLINNESFSSG-GPILFYTG 76
Query: 97 AEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQ 156
E I+ + SGF ++ A + A +V EHRYYG S+PFG+ + K+ ++ GY + Q
Sbjct: 77 NEGAIETFAENSGFIWKLAEELNASVVFAEHRYYGTSLPFGND--SFKDRQYFGYLTAEQ 134
Query: 157 ALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY 215
ALADY ++ +K Y+ +PVI+ G SYGG L+ W R KYP+ + G++ASSAP+ +
Sbjct: 135 ALADYVLLINQLKVNYSCFASSPVISFGGSYGGMLSAWIRQKYPNQIAGAIASSAPVWLF 194
Query: 216 GDITPWTTYHSVVSKDF-RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
++ + V + F + + C + I+ SWS I +I DG +L+ F CTPL
Sbjct: 195 PGLSDCNGFSLVATNSFLKYGGDNCVKNIQHSWSNIVDIGQSFDGKELLTNMFNICTPLT 254
Query: 275 NTTELKDGL 283
+ + D L
Sbjct: 255 DVQNIIDYL 263
>gi|118487656|gb|ABK95653.1| unknown [Populus trichocarpa]
Length = 213
Score = 155 bits (391), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 79/202 (39%), Positives = 118/202 (58%), Gaps = 8/202 (3%)
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCYINEDRTGDESDEGWEW 358
PV+ +C I+ P+ I V A SL Y+ + + +C+ E GW W
Sbjct: 18 PVQAMCKIIDGFPSGASKI-----TRVFAAASLYYNYSRAEKCFKLEHGPDAHGLHGWNW 72
Query: 359 QSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRR 418
Q+C+EMV+PM + SM+ ++ ++ ++C + +GV PRP W+ T +GG I L+L+R
Sbjct: 73 QACTEMVMPMTCSEESMFPTSSFSYKEFAEDCMKTFGVKPRPHWITTEFGGKRIDLVLKR 132
Query: 419 STSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRK 478
S NIIFSNGM+DP+SRGG +K ++ + G+H +D A K DP+WL + R+
Sbjct: 133 SGGNIIFSNGMQDPWSRGGVLKNISSSIIALVTE--KGAHHVDFRSATKDDPEWLKELRR 190
Query: 479 TEVKIMQGWITQYYDDFKAINK 500
EV+I+QGWI QYY D K NK
Sbjct: 191 QEVEIIQGWIDQYYPDLKRTNK 212
>gi|54648564|gb|AAH85041.1| LOC495469 protein, partial [Xenopus laevis]
Length = 502
Score = 154 bits (390), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 135/500 (27%), Positives = 233/500 (46%), Gaps = 44/500 (8%)
Query: 25 NGLKLRPRLGRIRRSRILEQKDSNH-----------GFETFFYNQTIDHFSYGPESYQTF 73
NG ++R +L +++ + + +N E++ Q +DHF+ + T+
Sbjct: 20 NGRQIRKKLHQLKETEQFKSFYTNRWMPKGAFPNTPSVESYIV-QPLDHFNRLNNA--TY 76
Query: 74 PQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKS 133
QRY IN ++W + P+ ++G E+ + + LSG E A +AL+V LEHRYYG S
Sbjct: 77 NQRYWINEQYWNHPDGPVFLYIGGESSLSEFSVLSGEHIELAQTHRALLVSLEHRYYGSS 136
Query: 134 V-PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELA 191
+ P G L+N + + +S QALAD AS + I KYN T I G SY G L+
Sbjct: 137 INPDG---LTLENIK---FLSSQQALADLASFHMFISQKYNLTRQNTWICFGGSYPGSLS 190
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSW 247
WFRLK+PH+V ++ASSAP+ D +T Y+ VV+ D SE+C +KK +
Sbjct: 191 AWFRLKFPHLVYAAVASSAPVRAELD---FTGYNKVVALSLADPVIGGSEKCLDAVKKGF 247
Query: 248 SEIDNIASKPDGLAILSKKFKTCTPLKNT---TELKDGLDTVYSEAAQYDTPSNIP-VKR 303
+D++ K + + + K F +C L+ + +E L ++ A QY+ S + V+
Sbjct: 248 QAVDSLLQKGN-VTQMEKDFYSCGSLQGSDDYSEFVGNLADIFMGAVQYNGMSPVSNVQY 306
Query: 304 ICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDES-----DEGWEW 358
IC + N + L + + + L N+ + + + S + W +
Sbjct: 307 ICQLMSIKDNSAYEGLRSVNRMYMNSMGLSCISNSHAKSVADLSSTKLSLIGVGERQWYY 366
Query: 359 QSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR 417
Q+C+E +D + + P L + C + + V P S + + ++ R
Sbjct: 367 QTCTEFGYYQTCEDPSCPFSP-LITLKSQLDLCSQIFQV-PTESVLQSVQFTNEFYGADR 424
Query: 418 RSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQR 477
+S IIF NG DP+ +K +++ +N SHC ++ ++ SDP L + R
Sbjct: 425 PKSSRIIFVNGDVDPWHALSVLKNQSRSEIAIL--INGTSHCANMKPSQTSDPLSLQEAR 482
Query: 478 KTEVKIMQGWITQYYDDFKA 497
K + W+ + +
Sbjct: 483 KEIATQVATWLKSAQQELRG 502
>gi|340715503|ref|XP_003396251.1| PREDICTED: putative serine protease K12H4.7-like [Bombus
terrestris]
Length = 479
Score = 154 bits (389), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 132/449 (29%), Positives = 201/449 (44%), Gaps = 38/449 (8%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ +T+ RY NS+++ N PIL +G E I +G YE A
Sbjct: 51 QPVDHFNI--RDNRTWLMRYYENSRYFKK-NGPILIMIGGEWAISKGFLEAGLMYELATT 107
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+ A++ EHRYYGKS P ++R+ Y + QALAD A + K N ++
Sbjct: 108 YNAIMYYTEHRYYGKSKP-----TEDTSSRNLQYLSVDQALADLAYFIETRKRDENLRNS 162
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
VI G SY G +ATW RLKYPH+V G+LASSAP+L D + Y+ VV++ R S+
Sbjct: 163 KVIVFGGSYAGNVATWVRLKYPHLVQGALASSAPVLAKVD---FYEYYEVVTESLRRYSQ 219
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT-----PLKNTTELKDGLDTVYSEAAQ 292
+C +K ++ E++ + + G L+K F C K+ + L ++ Q
Sbjct: 220 KCVNEVKAAFDEVEELLAIKGGAQKLTKYFNLCDVPDVHSFKDLGHFGNLLAEEFAGIVQ 279
Query: 293 YDTPSNIPVKRICNAIEN--APNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGD 350
YD N K I EN A G L ++A V + D + + + T D
Sbjct: 280 YDKVENNRTK-IAACCENMTAMYLGSP-LQRLAHLVSDKDKCLKNNYKKFVEVYRNETWD 337
Query: 351 ESDE---GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRP 400
+ W +Q+C+E + + L + C + YG + R
Sbjct: 338 SQPDITRLWFYQTCTEYGYYQTTNSRRSVFGTLFPLPYFTGLCTDLYGYYYGNRFLYTRI 397
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
T YGG LR N+IF+NG DP+ +K + F + + SHC
Sbjct: 398 GRTNTMYGG------LRPDLQNVIFTNGDVDPWHTLSVLKDLNAFSPAIL--IKGSSHCR 449
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
DL +D + L++ R KI+ WI+
Sbjct: 450 DLYSDLDTDAEDLIRARARVRKIIGTWIS 478
>gi|149039380|gb|EDL93600.1| dipeptidylpeptidase 7, isoform CRA_b [Rattus norvegicus]
Length = 329
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/252 (35%), Positives = 135/252 (53%), Gaps = 13/252 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F ++ Q +DHF++ S +TF QR++++ KFW G PI + G E I SGF
Sbjct: 41 FRENYFEQYMDHFNFESFSNKTFGQRFLVSDKFWKMGEGPIFFYTGNEGDIWSLANNSGF 100
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALADYASILLH 167
E A Q +AL+V EHRYYGKS+PFG + + RGY QALAD+A +L
Sbjct: 101 IVELAAQQEALLVFAEHRYYGKSLPFGVQ------STQRGYTQLLTVEQALADFAVLLQA 154
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ AP IA G SYGG L+ + R+KYPH+V G+LA+SAP++ + +
Sbjct: 155 LRHNLGVQDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVAGLGNPDQFFRD 214
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLD 284
V+ DF S +C Q ++ ++ +I ++ + +S+ F TC L K+ T+L
Sbjct: 215 VTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQSLSSPKDLTQLFGFAR 273
Query: 285 TVYSEAAQYDTP 296
++ A D P
Sbjct: 274 NAFTVLAMMDYP 285
>gi|407038295|gb|EKE39045.1| serine carboxypeptidase (S28) family protein [Entamoeba nuttalli
P19]
Length = 466
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 135/502 (26%), Positives = 220/502 (43%), Gaps = 82/502 (16%)
Query: 22 FHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS 81
F ++ R G + RS ++E+ + Q IDHF + +T RY IN
Sbjct: 13 FRFQNIRFR---GEMDRSDVIERVST----------QPIDHFDLTNK--KTINIRYFIND 57
Query: 82 KFWGGGNSPILAFLGAE-----APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
+ +P+L LG E A + ++ + A ++ +L++ +EHR+YGKSVP
Sbjct: 58 TIYSK-EAPLLVDLGGEGTQRAAAVGGRFVINKY----AEKYNSLMLAIEHRFYGKSVPE 112
Query: 137 GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRL 196
G + + GY ++AQAL DY I+ IK +Y T PVI G SY G LATW R
Sbjct: 113 GGL-----SQENLGYLSAAQALEDYVMIINQIKKEYQIT-GPVIVFGGSYSGNLATWIRQ 166
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
KYP+VV ++ASSAP+ +T++ + + D E+C K++ I+ +
Sbjct: 167 KYPNVVYAAVASSAPVY------ATSTFYEFLDVIYNDMGEKCGNAWKEATESIEELFKT 220
Query: 257 PDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA---QYDTPSNIPVKRICNAIENAPN 313
G A L FKTCT +K +L + + + QY+ ++ ++ +CN +
Sbjct: 221 DSGKAQLKNDFKTCTEIKEEDDLTILIQQIQATMVNYPQYNGSYSLTIEGVCNVLTTEGK 280
Query: 314 CGDDILCKIAAGVVE-----------ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCS 362
+ + + + AD L D N++ +R S W WQ CS
Sbjct: 281 TAYENMVDLMSHAFNEFGFKCAPSSYADMLT-DMANTKTEEEGNRLA--STRSWAWQICS 337
Query: 363 EMVVPMGKDKNSMYQP--EPWNLTKYIKN------CKEQYGVSP-----RPSWVLTYYGG 409
E S +QP E +K + N CK+ + V R YGG
Sbjct: 338 EY---------SYFQPVNESLPFSKRLNNEFYYLLCKDIFNVDKQRLDRRVHHTNLMYGG 388
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
+ K +N+ +++G DP+S +T ++ + +HC DL K +D
Sbjct: 389 YKPK------ATNVAYTSGSTDPWSPLAKHETLPSDINCYASYIKGTAHCADLYAEKDTD 442
Query: 470 PDWLVQQRKTEVKIMQGWITQY 491
P+ L QQR + + I++Y
Sbjct: 443 PEQLKQQRMETAQFIDELISRY 464
>gi|297677423|ref|XP_002816600.1| PREDICTED: thymus-specific serine protease isoform 1 [Pongo abelii]
Length = 514
Score = 153 bits (387), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 127/469 (27%), Positives = 206/469 (43%), Gaps = 62/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ + ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SNRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 119 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRHALADVVSARLALSRLFNI 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 230
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T S EC + +++E++ + S A L + C PL +N EL L
Sbjct: 231 STAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELSACGPLGRAENQAELLGALQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ QYD + P V+++C N + P CG +I + L +
Sbjct: 291 LVGGVVQYDGQAGAPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + E + D W +Q+C+E + + + L + C++
Sbjct: 351 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQV 410
Query: 394 YGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+S + +YYGG + ++F NG DP +H LS
Sbjct: 411 FGLSALSVAQAVAQTNSYYGGQT------PGANQVLFVNGDTDP---------WHV--LS 453
Query: 449 FSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q L GSHCLD+ + SD L R+ + +Q W+
Sbjct: 454 VTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 502
>gi|281210278|gb|EFA84445.1| hypothetical protein PPL_02477 [Polysphondylium pallidum PN500]
Length = 761
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 134/457 (29%), Positives = 209/457 (45%), Gaps = 58/457 (12%)
Query: 15 VIFVSTSFHAN-GLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTF 73
VIF +F+ + L+ +P+ + + + S+ G E ++ Q +DHF ++ +TF
Sbjct: 20 VIFGGRAFNPHFDLRKKPKFQLSIDYALKQNRHSSKGVEYQWFTQELDHFDQ--QNTKTF 77
Query: 74 PQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGK 132
QRY+IN ++W G P+ + E P+ L G Y + A QF AL+V LEHRYYG
Sbjct: 78 QQRYLINDQYWDG-KGPVFIMINGEGPMTIGTVL-GLKYIDWAKQFNALVVALEHRYYGA 135
Query: 133 S----------VPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IA 181
S + + S A KN + S LAD A I +YN T + ++
Sbjct: 136 SFATPDISTENLQYLSSDQASKNIQRLILIISFFRLADNAVFRQFIAKQYNVTSSSKWVS 195
Query: 182 IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS--EEC 239
G SY G L +WFRLKYP++V +++SSAP+L D + Y VV TS +EC
Sbjct: 196 FGGSYSGALTSWFRLKYPNLVDFTISSSAPVLAEVD---FYQYLEVVQNSLLTTSKGQEC 252
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV---YSEAAQY--D 294
I + +I + DGL +S F C PL ++ + ++ + QY +
Sbjct: 253 VNNIASATQKIQTLLQTSDGLQKVSDLFDLCPPLATQDDISTFMQSLAGNFMGTVQYNLE 312
Query: 295 TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-----INEDRTG 349
P + +C + N N K+ G DG Y + ++ T
Sbjct: 313 APGAATITNLCEIMTNPDNDPITNYVKVWQGFT-------DGCTDTSYETMIDLMKNNTN 365
Query: 350 DESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPW-NL--TKYI-KNCKEQYG--VSPRP 400
D S EG W +Q+C+E G ++S +P+ NL +Y+ K C+E +G +P
Sbjct: 366 DASVEGGKMWFYQTCTE----FGYYQSSDSTKQPFGNLIPIEYLTKQCQEVFGFNFTPNV 421
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
W +T YGG + NI++ NG DP+ G
Sbjct: 422 EWTITKYGG------INPDADNILYVNGDIDPWHALG 452
>gi|268619142|gb|ACZ13347.1| prolyl carboxy peptidase-like protein [Bursaphelenchus xylophilus]
Length = 401
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 114/389 (29%), Positives = 199/389 (51%), Gaps = 50/389 (12%)
Query: 10 WLLLMVIFVSTSFHANGLK----LRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSY 65
WLLL + V NGL+ LR + I ++RI + ++ + + IDHF++
Sbjct: 5 WLLLAPLLV------NGLRPGAFLRDPVTLIDQNRI-PNGEPSYSWSEEHLDVPIDHFAF 57
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVIL 125
+ FP RY IN ++ G PI + G E ++ + +GF ++ A ++KA IV
Sbjct: 58 A--DTREFPLRYFINLTYYEPG-GPIFFYTGNEGKLEVFAENTGFIWDIAPEYKAAIVFT 114
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY--NATHAPVIAIG 183
EHR+YG S+PFG + + K+ ++ GY S QALAD+A ++ ++K + ATH+PVI G
Sbjct: 115 EHRFYGNSLPFG--EDSYKHIKNLGYLTSEQALADFADVITYLKTQRIPQATHSPVIVFG 172
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI-TPWTTYHSVVSKDFRDTSEECFQT 242
SYGG LA WFR+KYPH+ G++A+SAP+L++ + Y ++ ++ F+ + +
Sbjct: 173 GSYGGMLAAWFRIKYPHLADGAIAASAPLLWFQNTGVRQDGYANITTRTFKLSGCD-LTH 231
Query: 243 IKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD-----TVYSEAA------ 291
++ S+ + +A DG L+K K L ++E + D ++++
Sbjct: 232 LRASFDAMRTLAKTEDGRDHLNKVLK----LGKSSEFEHSHDYNILVNIFADVMGNVVMI 287
Query: 292 QYDTPSNI-------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR---C 341
Y P+N PVK++C + DD + + + ++ Y+ + C
Sbjct: 288 DYPYPTNFFAQVPAWPVKKMCEKFNG--DIPDDPK-ETVKPLYDILNIFYNTSGKLEEFC 344
Query: 342 YINEDRTGDE--SDEGWEWQSCSEMVVPM 368
D D+ + +GW WQ C+EM++P+
Sbjct: 345 LRGPDCGNDQLGAMDGWNWQICTEMIMPI 373
>gi|91078858|ref|XP_972061.1| PREDICTED: similar to thymus-specific serine protease [Tribolium
castaneum]
Length = 501
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 137/527 (25%), Positives = 241/527 (45%), Gaps = 76/527 (14%)
Query: 10 WLLLMVIFVSTS------FHANGLKLRPRLGR----IRRSRILEQKDSNHGFETFFYNQT 59
WLLL+ +F +S NG + LG + S I E ++ ++ Q
Sbjct: 5 WLLLLTLFYISSEVSSWRIFRNGRMVGGNLGEPKCNCKESSIKEVQEE-------WFTQN 57
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DHF+ P T+ QR+ N +F+ N P+ +G E G A +F
Sbjct: 58 LDHFN--PTDETTWKQRFYSNDQFFDPKNGGPVFLMIGGEGEASIKWMTQGAWVNYAEKF 115
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHA 177
AL+ LEHRYYGKS P + + +N + Y S QALAD A+ + + +KY+
Sbjct: 116 GALMFQLEHRYYGKSHP--TDDLSTQNLK---YLTSQQALADLATFITAMNEKYSLPPDV 170
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
IA G SY G LA W R KYPH+V G++++S P+L D + Y V+ + S+
Sbjct: 171 KWIAFGGSYPGSLAAWLRFKYPHLVHGAMSASGPLLAQVD---FKDYFRVIKESLATHSD 227
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA------ 291
+C +++ +I + + G A L++ FK C P++N+ + + +Y A
Sbjct: 228 DCVTAVQQGVDQIGVLLKQEIGQANLNELFKLCDPVQNSINNEKDISNLYETIADDFAGV 287
Query: 292 -QYD--------TPSNIPVKRICNAIENAPNCGDDI--LCKIAAGVVEA---DSLEYDGN 337
QY+ +NI + +C+ + N G + L K+ ++ A L+Y+ +
Sbjct: 288 VQYNKDNRVGSPAGANITIDVVCDIMVNQ-TIGPPVNRLAKVNEVLLSAYDQKCLDYNYD 346
Query: 338 ---NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE----PWNLTKYIKNC 390
N+ ++ D E W +Q+C+E G + S Y+P+ +++ +I+ C
Sbjct: 347 KMINNLRNVSWDSEASEGGRQWTYQTCTE----FGFYQTSDYEPQIFGDQFSVDFFIQQC 402
Query: 391 KEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ +G ++ TYYGG DI++ SN++F +G DP+ G KT
Sbjct: 403 TDIFGSIYDEDFLNSATERTNTYYGGLDIEV------SNVVFVHGSIDPWHALGITKTID 456
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ + + + +HC ++ +D L + R+ + ++ W+ Q
Sbjct: 457 --EEAPAIYIEGTAHCANMYPPADTDLPQLKEAREQILNLIGTWLAQ 501
>gi|449706553|gb|EMD46378.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
KU27]
Length = 466
Score = 153 bits (386), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 136/502 (27%), Positives = 220/502 (43%), Gaps = 82/502 (16%)
Query: 22 FHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS 81
F ++ R G I RS ++E + Q IDHF + +T RY IN
Sbjct: 13 FRFQNIRFR---GEIDRSDVIEHVST----------QPIDHFDLTNK--KTINIRYFIND 57
Query: 82 KFWGGGNSPILAFLGAE-----APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
+ +P+L LG E A + ++ + A ++ +L++ +EHR+YGKSVP
Sbjct: 58 TIYSK-EAPLLVDLGGEGTQRAAAVGGRFVINKY----AEKYNSLMLAIEHRFYGKSVPE 112
Query: 137 GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRL 196
G + + GY ++AQAL DY I+ IK +Y T PVI G SY G LATW R
Sbjct: 113 GGL-----SQENLGYLSAAQALEDYIMIINQIKKEYQIT-GPVIVFGGSYSGNLATWIRQ 166
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
KYP+VV ++ASSAP+ +T++ + + D E+C K++ I+ +
Sbjct: 167 KYPNVVYAAVASSAPVY------ATSTFYEFLDVIYNDMGEKCGNAWKEATDSIEELFKT 220
Query: 257 PDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA---QYDTPSNIPVKRICNAIENAPN 313
G A L FKTCT +K +L + + + QY+ ++ ++ +CN +
Sbjct: 221 DSGKAQLKNDFKTCTEIKEEDDLTILIQQIQATMVNYPQYNGSYSLTIEGVCNILTTEGK 280
Query: 314 CGDDILCKIAAGVVE-----------ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCS 362
+ + + + AD L D N++ +R S W WQ CS
Sbjct: 281 TAYENMVDLMSHAFNEFGFECAPSSYADMLT-DMANTKTEEEGNRLA--STRSWAWQICS 337
Query: 363 EMVVPMGKDKNSMYQP--EPWNLTKYIKN------CKEQYGVSPRPSWVLTY-----YGG 409
E S +QP E +K + N CK+ + V + Y YGG
Sbjct: 338 EY---------SYFQPVNESLPFSKRLNNEFYYLLCKDIFNVDKQRLDRRVYHTNLMYGG 388
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
+ K +N+ +++G DP+S +T ++ + +HC DL K +D
Sbjct: 389 YKPK------ATNVAYTSGSTDPWSPLAKHETLPSDINCYASYIKGTAHCADLYAEKDTD 442
Query: 470 PDWLVQQRKTEVKIMQGWITQY 491
P+ L QQR + + I++Y
Sbjct: 443 PEQLKQQRMETAQFIDELISRY 464
>gi|302758754|ref|XP_002962800.1| hypothetical protein SELMODRAFT_404105 [Selaginella moellendorffii]
gi|300169661|gb|EFJ36263.1| hypothetical protein SELMODRAFT_404105 [Selaginella moellendorffii]
Length = 674
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 74/161 (45%), Positives = 107/161 (66%), Gaps = 6/161 (3%)
Query: 75 QRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGK 132
Q Y+++S W GG S PIL + G + I + +GF ++ AH F+AL+V EHR+YGK
Sbjct: 18 QHYLLHSASWSGGASGAPILVYCGNKGDIVWFAENTGFMFDIAHLFRALLVFPEHRFYGK 77
Query: 133 SVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELAT 192
S PFG + + + ++ QALAD+A+++L +K +A +PV+ G SYGG LA
Sbjct: 78 SQPFGGQNGP----KELAFCSAEQALADFATLILDLKRNLSAQASPVVVFGGSYGGMLAA 133
Query: 193 WFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
WFRLKYPH+ IG+LASSAPIL + +I P+TTY+ +VS F+
Sbjct: 134 WFRLKYPHIAIGALASSAPILQFENIVPYTTYYDIVSNAFK 174
>gi|156717482|ref|NP_001096281.1| protease, serine, 16 (thymus) precursor [Xenopus (Silurana)
tropicalis]
gi|134025460|gb|AAI35535.1| LOC100124847 protein [Xenopus (Silurana) tropicalis]
Length = 506
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 139/489 (28%), Positives = 227/489 (46%), Gaps = 40/489 (8%)
Query: 25 NGLKLRPRLGRIRRSRILEQKDSNHGF-ETFFYN---------QTIDHFSYGPESYQTFP 74
NG ++R +L +++ + + ++ G + F N Q +DHF+ S TF
Sbjct: 24 NGRQIRKKLHQLKETENFKSFYTHWGMPKGAFANTPSVESYIAQPLDHFNRRNNS--TFN 81
Query: 75 QRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSV 134
QRY IN ++W N P+ ++G E+ + + LSG + A +AL+V LEHRYYG S+
Sbjct: 82 QRYWINEEYWNHPNGPVFLYIGGESSLSEFSVLSGEHVDLAQTHRALLVSLEHRYYGSSI 141
Query: 135 -PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELAT 192
P G L+N R + +S QALAD AS + I KYN T I G SY G L+
Sbjct: 142 NPDG---LTLENIR---FLSSQQALADLASFHMFISQKYNLTRQNTWICFGGSYPGSLSA 195
Query: 193 WFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWS 248
WFRLK+PH+V ++ASSAP+ D +T Y+ VV+ D SE+C +K+ +
Sbjct: 196 WFRLKFPHLVYAAVASSAPVRAELD---FTGYNKVVAWSLADPVIGGSEKCLDAVKEGFQ 252
Query: 249 EIDNIASKPDGLAILSKKFKTCTPLKNT---TELKDGLDTVYSEAAQYDTPSNIP-VKRI 304
+D++ K + + L K F +C L+ + TE L ++ A QY+ S V++I
Sbjct: 253 AVDSLLQKGN-ITQLEKDFYSCGSLQGSDDYTEFVGNLADIFMGAVQYNGMSPASNVQQI 311
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEG-----WEWQ 359
C + N + L + ++ L N+ + + + S G W +Q
Sbjct: 312 CQLMTIKDNSAYEGLRSVNRMYMDFMGLSCVYNSHAKSVADLSSTKLSLVGVGERQWFYQ 371
Query: 360 SCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRS 419
+C+E + S L + C + + V P S + + ++
Sbjct: 372 TCTEFGYYQTCEDPSCPFSSLITLKSQLDLCSQIFQV-PTESVLQSVQFTNEFYGADHPK 430
Query: 420 TSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKT 479
+S IIF NG DP+ +K +++ +N SHC ++ + SDP L + RK
Sbjct: 431 SSRIIFVNGDVDPWHALSVLKNQSRSEIAIL--INGTSHCANMSPSHTSDPLSLQEARKE 488
Query: 480 EVKIMQGWI 488
+ W+
Sbjct: 489 IAAQVATWL 497
>gi|126631396|gb|AAI33756.1| LOC495469 protein [Xenopus laevis]
Length = 505
Score = 152 bits (385), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 134/500 (26%), Positives = 233/500 (46%), Gaps = 44/500 (8%)
Query: 25 NGLKLRPRLGRIRRSRILEQKDSNH-----------GFETFFYNQTIDHFSYGPESYQTF 73
NG ++R +L +++ + + +N E++ Q +DHF+ + T+
Sbjct: 23 NGRQIRKKLHQLKETEQFKSFYTNRWMPKGAFPNTPSVESYIV-QPLDHFNRLNNA--TY 79
Query: 74 PQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKS 133
QRY IN ++W + P+ ++G E+ + + LSG E A +AL+V LEHRYYG S
Sbjct: 80 NQRYWINEQYWNHPDGPVFLYIGGESSLSEFSVLSGEHIELAQTHRALLVSLEHRYYGSS 139
Query: 134 V-PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELA 191
+ P G L+N + + +S QALAD AS + I KYN T I G SY G L+
Sbjct: 140 INPDG---LTLENIK---FLSSQQALADLASFHMFISQKYNLTRQNTWICFGGSYPGSLS 193
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSW 247
WFRLK+PH+V ++ASSAP+ D +T Y+ VV+ D SE+C +KK +
Sbjct: 194 AWFRLKFPHLVYAAVASSAPVRAELD---FTGYNKVVALSLADPVIGGSEKCLDAVKKGF 250
Query: 248 SEIDNIASKPDGLAILSKKFKTCTPLKNT---TELKDGLDTVYSEAAQYDTPSNIP-VKR 303
+D++ K + + + K F +C L+ + +E L ++ A QY+ S + V+
Sbjct: 251 QAVDSLLQKGN-VTQMEKDFYSCGSLQGSDDYSEFVGNLADIFMGAVQYNGMSPVSNVQY 309
Query: 304 ICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDES-----DEGWEW 358
IC + N + L + + + L N+ + + + S + W +
Sbjct: 310 ICQLMSIKDNSAYEGLRSVNRMYMNSMGLSCISNSHAKSVADLSSTKLSLIGVGERQWYY 369
Query: 359 QSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR 417
Q+C+E +D + + P L + C + + V P S + + ++ +
Sbjct: 370 QTCTEFGYYQTCEDPSCPFSP-LITLKSQLDLCSQIFQV-PTESVLQSVQFTNEFYGADQ 427
Query: 418 RSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQR 477
+S IIF NG DP+ +K +++ +N SHC ++ ++ SDP L + R
Sbjct: 428 PKSSRIIFVNGDVDPWHALSVLKNQSRSEIAIL--INGTSHCANMKPSQTSDPLSLQEAR 485
Query: 478 KTEVKIMQGWITQYYDDFKA 497
K + W+ + +
Sbjct: 486 KEIATQVATWLKSAQQELRG 505
>gi|323453591|gb|EGB09462.1| hypothetical protein AURANDRAFT_23831 [Aureococcus anophagefferens]
Length = 472
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 124/471 (26%), Positives = 206/471 (43%), Gaps = 57/471 (12%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP-----------ILAFLGAEA 99
+ET ++ Q++ H +YQ QRY++N FWG G++P +L + G E
Sbjct: 20 YETKWHTQSLTHAKGDDRTYQ---QRYLVNDTFWGKGSAPLWRDDDSCPGPVLFYSGNEG 76
Query: 100 PIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
P+D +GF + A ++ A +++ E RYYG S+PFG+ +N + Y ++ L
Sbjct: 77 PVDGFWPANGFMTDYLAPKWGAYVLMAEARYYGASLPFGNASWTPENVQ---YLSTELIL 133
Query: 159 ADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI 218
ADYA +L +K PV++ G SYGG L T+FRL YP VV+G LA+SAPI YY D
Sbjct: 134 ADYARLLTELKSSLQGC--PVVSFGGSYGGTLTTFFRLTYPDVVVGGLAASAPIGYY-DP 190
Query: 219 TPWT-------TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
W T+ ++++D+ D + C I+ + ++ A+ P+ L L F C
Sbjct: 191 AHWKDHGVDAYTFSDIIARDYDDAAPGCLDAIRATTDALN--AASPEALVDL---FHLCD 245
Query: 272 PLKNTTELKDGLDTVYSEAAQYDTPSNI------PVKRICNAIENAPNCGDDILCKIAAG 325
Q D P + PV C+ + A + ++AA
Sbjct: 246 AAALGPTRAALWQYALESLPQLDYPRAVGSIPAWPVNHTCHLLARASTAAARL--RVAA- 302
Query: 326 VVEADSLEYDGNNSRCY------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE 379
E ++ C+ + W +QSC+E + S +
Sbjct: 303 --EVQAMVLGTGGETCFPALVEGPGGVPGDGPGPDSWGYQSCTENLHEF--SSKSKVRDY 358
Query: 380 PWNLTKYIKNCKEQY-GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW 438
++ C + +P P + YGG++I +N+IFSNG+ DP+ GG+
Sbjct: 359 TFDFEAQASLCGSLFDDTTPDPRRLTALYGGYEIP----AKVTNVIFSNGLLDPWHGGGF 414
Query: 439 VKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ + + + G+H DL + + DP + R E + GW+
Sbjct: 415 YPSDNADASNVFCVMPKGAHHGDLRKPEADDPADIKACRALEEATIGGWLA 465
>gi|268575350|ref|XP_002642654.1| C. briggsae CBR-PCP-1.1 protein [Caenorhabditis briggsae]
Length = 512
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 129/472 (27%), Positives = 215/472 (45%), Gaps = 56/472 (11%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF++G +TF R + N+ ++ G PI + G E + +G ++ A F
Sbjct: 19 LDHFTWG--DTRTFDLRIMWNNTYYQPG-GPIFFYTGNEGAVSTFEVATGMMFDLAPMFN 75
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA----- 174
A I+ EHR+YG + PFG++ A N + GY S QALADYA +L +K N
Sbjct: 76 ASIIFAEHRFYGATQPFGNQSYA--NLANVGYLTSEQALADYAELLTELKRDNNQFGKTF 133
Query: 175 -THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY--GDITPWTTYHSVVSKD 231
+ VI+ G SYGG L+ WFR KYPH+V G+ A SAP++Y G + P + ++ S+
Sbjct: 134 HRDSQVISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMHDGGVDP-GAFDNITSRT 192
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSK----KFKTCTPLKNTTELKDGLDTVY 287
+ + F + +W+ + N++S G A L+ K TP+ N T+ + L+
Sbjct: 193 YVENGCNRF-ILANAWNAVLNLSSTDSGRAWLNNNPVFKLDPRTPINNQTDGWN-LNAYM 250
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILC-------------KIAAGVVEADSLEY 334
EA +Y + P +E P + C + V A ++ Y
Sbjct: 251 REAIEYMAMVDYPYPT--GFLEPLPGWPVTVACGYMNATGESFSDQDLVTAVANAANVYY 308
Query: 335 DGNNSRCY--------INEDRTGDESDE--GWEWQSCSEMVVPM---GKDKNSMYQPEPW 381
+ N + + + TG D+ GW WQ CSE+++ M G + +
Sbjct: 309 NYNQNANFTWCIDFNICGDQGTGGLGDDALGWPWQECSEIIMAMCASGGANDVFWSECGD 368
Query: 382 NLTKYIK-NCKEQYGVSPRPSWVLTYYGGHDIKLIL---RRSTSNIIFSNGMRDPFSRGG 437
N+ +K C +G W + +K + +SN+I + G DP+S GG
Sbjct: 369 NIYDTLKQGCVSIFG---SMKWTTANWNIDAVKTLYGYDLSGSSNLILTQGHLDPWSGGG 425
Query: 438 W-VKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ V + + ++ +H LDL + DP+ +V R V+I++ W+
Sbjct: 426 YKVDQTNTARGIYVMEIPGSAHHLDLRQPNTCDPNTVVNARYQIVQILKCWV 477
>gi|125576268|gb|EAZ17490.1| hypothetical protein OsJ_33021 [Oryza sativa Japonica Group]
Length = 259
Score = 152 bits (384), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 90/255 (35%), Positives = 139/255 (54%), Gaps = 32/255 (12%)
Query: 259 GLAILSKKFKTCTPLKNTTELKDGL---DTVYSEAAQYDTPSNI-------PVKRICNAI 308
G A L++ F C K + GL +Y Y TPSN PV+ IC AI
Sbjct: 10 GRARLNQTFNMC---KGNVDDIPGLVEKALIYGSMMDYPTPSNFLTSLPAYPVREICRAI 66
Query: 309 ENAPNCGDDILCKIAAGVVEADSLEYDGNNS-RCYINEDRTGDESDE------GWEWQSC 361
+ P G+D + +I +A ++ Y+ C+ G E D+ GW WQ+C
Sbjct: 67 DK-PTSGNDTVSRIK----DAMTIYYNSTGGLACFPG---AGAEDDDPYGMFPGWTWQAC 118
Query: 362 SEMVVPM--GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRS 419
+E+++ M G +++ P+P+NLT Y+ C GV PRP W+ +Y+GG+DI+ +L+RS
Sbjct: 119 TEVIMTMSYGIGNATVFPPDPFNLTAYLAGCLATTGVPPRPHWIQSYFGGYDIRNVLKRS 178
Query: 420 TSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKT 479
SNIIF NG+RDP+S GG +K+ ++ + G H +DL + K DP+WL + R+
Sbjct: 179 GSNIIFFNGLRDPWSAGGILKSISNSIIALVEP--KGGHHVDLRFSTKEDPEWLKKVRRQ 236
Query: 480 EVKIMQGWITQYYDD 494
E++I+ W+ QYY D
Sbjct: 237 EMRIIADWLKQYYSD 251
>gi|5031993|ref|NP_005856.1| thymus-specific serine protease [Homo sapiens]
gi|13633990|sp|Q9NQE7.2|TSSP_HUMAN RecName: Full=Thymus-specific serine protease; AltName: Full=Serine
protease 16; Flags: Precursor
gi|3510663|gb|AAC33563.1| thymus specific serine peptidase [Homo sapiens]
gi|119623493|gb|EAX03088.1| protease, serine, 16 (thymus), isoform CRA_a [Homo sapiens]
gi|162318714|gb|AAI56929.1| Protease, serine, 16 (thymus) [synthetic construct]
gi|162318946|gb|AAI56192.1| Protease, serine, 16 (thymus) [synthetic construct]
gi|307685491|dbj|BAJ20676.1| protease, serine, 16 [synthetic construct]
Length = 514
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 127/469 (27%), Positives = 205/469 (43%), Gaps = 62/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 119 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRLALADVVSARLALSRLFNI 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 230
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T S EC + +++E++ + S A L + C PL +N EL L
Sbjct: 231 STAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELSACGPLGRAENQAELLGALQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ QYD + P V+++C N + P CG +I + L +
Sbjct: 291 LVGGVVQYDGQTGAPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + E + D W +Q+C+E + + + L + C++
Sbjct: 351 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQV 410
Query: 394 YGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+S + +YYGG + ++F NG DP +H LS
Sbjct: 411 FGLSALSVAQAVAQTNSYYGGQT------PGANKVLFVNGDTDP---------WHV--LS 453
Query: 449 FSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q L GSHCLD+ + SD L R+ + +Q W+
Sbjct: 454 VTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 502
>gi|426351972|ref|XP_004043496.1| PREDICTED: thymus-specific serine protease [Gorilla gorilla
gorilla]
Length = 514
Score = 152 bits (383), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 127/469 (27%), Positives = 205/469 (43%), Gaps = 62/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 119 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRLALADVVSARLALSRLFNI 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 230
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T S EC + +++E++ + S A L + C PL +N EL L
Sbjct: 231 STAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTELSACGPLGRAENQAELLGALQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ QYD + P V+++C N + P CG +I + L +
Sbjct: 291 LVGGVVQYDGQAGAPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + E + D W +Q+C+E + + + L + C++
Sbjct: 351 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQV 410
Query: 394 YGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+S + +YYGG + ++F NG DP +H LS
Sbjct: 411 FGLSALSVAQAVAQTNSYYGGQT------PGANQVLFVNGDTDP---------WHV--LS 453
Query: 449 FSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q L GSHCLD+ + SD L R+ + +Q W+
Sbjct: 454 VTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 502
>gi|67472594|ref|XP_652089.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|56468897|gb|EAL46703.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 466
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 135/502 (26%), Positives = 220/502 (43%), Gaps = 82/502 (16%)
Query: 22 FHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS 81
F ++ R G + RS ++E + Q IDHF + +T RY IN
Sbjct: 13 FRFQNIRFR---GEMDRSDVIEHVST----------QPIDHFDLTNK--KTINIRYFIND 57
Query: 82 KFWGGGNSPILAFLGAE-----APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
+ +P+L LG E A + ++ + A ++ +L++ +EHR+YGKSVP
Sbjct: 58 TIYSK-EAPLLVDLGGEGTQRAAAVGGRFVINKY----AEKYNSLMLAIEHRFYGKSVPE 112
Query: 137 GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRL 196
G + + GY ++AQAL DY I+ IK +Y T PVI G SY G LATW R
Sbjct: 113 GGL-----SQENLGYLSAAQALEDYIMIINQIKKEYQIT-GPVIVFGGSYSGNLATWIRQ 166
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
KYP+VV ++ASSAP+ +T++ + + D E+C K++ I+ +
Sbjct: 167 KYPNVVYAAVASSAPVY------ATSTFYEFLDVIYNDMGEKCGNAWKEATDSIEELFKT 220
Query: 257 PDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA---QYDTPSNIPVKRICNAIENAPN 313
G A L FKTCT +K +L + + + QY+ ++ ++ +CN +
Sbjct: 221 DSGKAQLKNDFKTCTEIKEEDDLTILIQQIQATMVNYPQYNGSYSLTIEGVCNILTTEGK 280
Query: 314 CGDDILCKIAAGVVE-----------ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCS 362
+ + + + AD L D N++ +R S W WQ CS
Sbjct: 281 TAYENMVDLMSHAFNEFGFECAPSSYADMLT-DMANTKTEEEGNRLA--STRSWAWQICS 337
Query: 363 EMVVPMGKDKNSMYQP--EPWNLTKYIKN------CKEQYGVSPRPSWVLTY-----YGG 409
E S +QP E +K + N CK+ + V + Y YGG
Sbjct: 338 EY---------SYFQPVNESLPFSKRLNNEFYYLLCKDIFNVDKQRLDRRVYHTNLMYGG 388
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
+ K +N+ +++G DP+S +T ++ + +HC DL K +D
Sbjct: 389 YKPK------ATNVAYTSGSTDPWSPLAKHETLPSDINCYASYIKGTAHCADLYAEKDTD 442
Query: 470 PDWLVQQRKTEVKIMQGWITQY 491
P+ L QQR + + I++Y
Sbjct: 443 PEQLKQQRMETAQFIDELISRY 464
>gi|125980321|ref|XP_001354185.1| GA22150 [Drosophila pseudoobscura pseudoobscura]
gi|195174498|ref|XP_002028010.1| GL15050 [Drosophila persimilis]
gi|54642489|gb|EAL31237.1| GA22150 [Drosophila pseudoobscura pseudoobscura]
gi|194115732|gb|EDW37775.1| GL15050 [Drosophila persimilis]
Length = 508
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 125/473 (26%), Positives = 221/473 (46%), Gaps = 57/473 (12%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW-GGGNSPILAFLGAEAPIDDNIQLSGF 110
E ++ Q +DHF + +T+ QRY +N+ ++ +PI +G E G
Sbjct: 57 EDLWFEQRLDHFQ--ARNTRTWQQRYFVNADYYRNDSTAPIFLMIGGEGEASAKWMREGA 114
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A F AL + LEHR+YGKS P +R + N + +S QALAD A+ + +K
Sbjct: 115 WVHYAEHFDALCIQLEHRFYGKSHP--TRDLSTANL---AFLSSEQALADLANFVAAMKV 169
Query: 171 KYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
KYN A +A G SY G LA W R KYPH++ GS+++S P+L D + Y VV
Sbjct: 170 KYNLAETQKWVAFGGSYPGSLAAWAREKYPHLIYGSISTSGPLLAEVD---FREYFEVVK 226
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-------LKDG 282
+ +C + + +S+++++ + G L +KFKTCTP+K++ E L +
Sbjct: 227 ASLATYNPDCVEAVTRSFTQVEILLKHMIGQRNLDEKFKTCTPIKDSIENQLDIASLFEN 286
Query: 283 LDTVYSEAAQYD---TP-SNIPVKRICNAIENAPN--------CGDDILCKIAAGVVEAD 330
L ++ QY+ +P + I + IC+ + N + +D+L K + A
Sbjct: 287 LAGNFAGVVQYNKDNSPHATITIDEICDVMLNMTSGPPVTRLGVVNDMLLKQS----NAT 342
Query: 331 SLEYDGNNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
L+Y + + E+ +G W +Q+C+E + + + + + +I
Sbjct: 343 CLDYKYDKMVSDMRNASWDSEAAKGMRQWTYQTCNEFGFYQTSENKTDTFGDRFGVDFFI 402
Query: 388 KNCKEQYGVSPRPSWVLT-------YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
+ C + + S ++ +YG L+ T+N+++ +G DP+ G VK
Sbjct: 403 RQCMDVFSNSMDARYLQNVVSQTNKHYGA------LKPETTNVLYVHGSIDPWHALGLVK 456
Query: 441 TYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEV----KIMQGWIT 489
+ + + + +HC ++ E K+DP LV R V K+++G+ T
Sbjct: 457 SSNAATPTIF--IEGTAHCANMYEPTKTDPPQLVAARNKIVKYLAKLLEGYTT 507
>gi|405945266|gb|EKC17248.1| Thymus-specific serine protease [Crassostrea gigas]
Length = 519
Score = 151 bits (382), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 128/479 (26%), Positives = 224/479 (46%), Gaps = 50/479 (10%)
Query: 27 LKLRPRLGRIRRS-RILEQKD---SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSK 82
LK+R + R + EQK S+ E Q IDHF + + + Q+ IN++
Sbjct: 25 LKIREIVESQREEDAVFEQKKLYGSSQIVEEVLLPQPIDHFD--ALNSEMYNQKVYINTE 82
Query: 83 FWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAA 142
W + PI F+G E + + SG E A ++ A++V EHR+YG S+
Sbjct: 83 NWIKPSGPIFLFIGGEGALSNRSAYSGHHVEMAKRYGAMVVAAEHRFYGSSI--NDNGLH 140
Query: 143 LKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVV 202
L H +S Q LAD + +I D+Y T I+ G SY G L+ WFRLKYPH+V
Sbjct: 141 LDQLEH---LSSQQGLADLTRVHKYITDRYELTSNKWISFGGSYPGALSAWFRLKYPHLV 197
Query: 203 IGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWSEIDNIASKPD 258
G++ASSAP+ T + Y+ VV++ D+ S++C + I +++ ID++ + +
Sbjct: 198 YGAVASSAPVQAQ---TNFEGYNEVVAQSLTDSTVGGSQQCIKQIVEAFQRIDSMI-QAN 253
Query: 259 GLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDTP-SNIPVKRICNAIENAPNC 314
L K F +C PL + + L +++ QY+ + ++ +C + + +
Sbjct: 254 QTVQLEKDFLSCGPLSEKNDQMVFVNNLAGIFAGVVQYNNEVPGLNIQHLCKQMTKSDDS 313
Query: 315 GDDI--LCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEG-----WEWQSCSEMVVP 367
++ + K+A ++ ++ +N N ++T D++ G W WQ+CS+
Sbjct: 314 YKNLQMVYKMAMKLLNQSCVDNSYSNFLSQFN-NQTVDQAASGVGIRQWTWQTCSQFGYY 372
Query: 368 MGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPR--PSWVL---TYYGGHDIKLILRRSTS 421
D+ S L + C++ Y +SP P++V YYG + R +
Sbjct: 373 QTCDEGTSCPFSRLLTLESNLVICRDIYKISPSSVPNFVEFTNEYYGAN------RPKGT 426
Query: 422 NIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD---LNLGSHCLDLDEAKKSDPDWLVQQR 477
++F NG DP+ +K+ D+S ++ +N +HC D+ + +DP L R
Sbjct: 427 RVLFVNGSIDPWHFLSVLKS----DVSLNETAVFINGTAHCADMASDRSTDPQSLKDAR 481
>gi|194752101|ref|XP_001958361.1| GF23556 [Drosophila ananassae]
gi|190625643|gb|EDV41167.1| GF23556 [Drosophila ananassae]
Length = 508
Score = 151 bits (381), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 133/501 (26%), Positives = 227/501 (45%), Gaps = 59/501 (11%)
Query: 24 ANGLKLRP-RLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSK 82
+NG P ++ ++RS LE +D ++ Q +DHF P +++ QRY +N+
Sbjct: 35 SNGFLGEPSKVATLQRS--LESED-------LWFEQRLDHFK--PSDTRSWKQRYYLNAD 83
Query: 83 FW-GGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA 141
+ +PI +G E G A F AL LEHR+YGKS P G
Sbjct: 84 HYRNDSTAPIFLMIGGEGEATAKWMREGAWVHYAEHFGALCFQLEHRFYGKSHPTGDLST 143
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLKYPH 200
A + Y +S QALAD A+ + +K K+N A +A G SY G LA W R KYPH
Sbjct: 144 A-----NLAYLSSEQALADLANFVSAMKVKFNLAESQKWVAFGGSYPGSLAAWAREKYPH 198
Query: 201 VVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGL 260
++ GS++SS P+L D + Y VV +C + + +S+++++ + G
Sbjct: 199 LIYGSISSSGPLLAEVD---FKEYFEVVKASLASYKPDCVEAVTRSFAQVEILLKHMIGQ 255
Query: 261 AILSKKFKTCTPLKNTTE-------LKDGLDTVYSEAAQY---DTP-SNIPVKRICNAIE 309
L +KFKTCTP+K+ E + L ++ QY ++P + I + IC+ +
Sbjct: 256 RNLDEKFKTCTPIKDAIENPLDIANFFENLAGNFAGVVQYNKDNSPHATITIDDICDVML 315
Query: 310 NAPNCGDDILCKIAAGVVEADS----LEYDGNNSRCYINEDRTGDESDEG---WEWQSCS 362
N + G++ +S L+Y + + E+ +G W +Q+C+
Sbjct: 316 NTTIGPPVTRLGVVNGMLLKESNTTCLDYKYDKMVADMKNVSWDSETAKGMRQWTYQTCN 375
Query: 363 EMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT-------YYGGHDIKLI 415
E + S + + + +I+ C + + S ++ +YG
Sbjct: 376 EFGFYQTSENKSDTFGDRFGVDFFIRQCMDVFSESMDAKYLQQAVAQTNKHYGA------ 429
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQ 475
L+ T+N+++ +G DP+ G VK+ + + D +HC ++ E KSDP LV
Sbjct: 430 LKPETTNVLYVHGSIDPWHALGLVKSANSATPTIYIDGT--AHCANMYEPAKSDPPQLVA 487
Query: 476 QR----KTEVKIMQGWITQYY 492
R K K+++G+ + Y
Sbjct: 488 ARNKITKYLAKLLEGYTSTPY 508
>gi|350422894|ref|XP_003493318.1| PREDICTED: putative serine protease K12H4.7-like [Bombus impatiens]
Length = 478
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 143/505 (28%), Positives = 214/505 (42%), Gaps = 58/505 (11%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
L+++ F H GL G + R L+ K + + Q +DHF+
Sbjct: 5 FLVLLCFTCRLGHGVGLHGFHFHG-LEEPRSLD-KSTCENITELWIRQPVDHFNVRNNC- 61
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
T+ RY NS+++ N PIL +G E I +G YE A + A++ EHRYY
Sbjct: 62 -TWLMRYYENSRYFKK-NGPILIMIGGEWAISKGFLEAGLMYELASAYNAIMYYTEHRYY 119
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
GKS P ++R+ Y + QALAD A + K ++ VI G SY G +
Sbjct: 120 GKSKPTED-----TSSRNLQYLSVDQALADLAYFIETRKKDEKLRNSKVIVFGGSYAGNV 174
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
ATW RLKYPH+V G+LASSAP+L D + Y+ VV++ R S++C +K ++ ++
Sbjct: 175 ATWVRLKYPHLVQGALASSAPVLAKVD---FYEYYEVVTESLRRHSQKCMDEVKAAFDDV 231
Query: 251 DNIASKPDGLAILSKKFKTCT-----PLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRIC 305
+ + + G L + F C K+ L + L ++ QYD N K I
Sbjct: 232 EELLAIQGGAQKLKEYFNLCDVPDVHSFKDLAHLGNLLAEEFASIVQYDKVENNRTK-IA 290
Query: 306 NAIEN--APNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDE---GWEWQS 360
EN A G L ++A V D + N + + D + W +Q+
Sbjct: 291 ACCENMTASYLGSP-LQRLAHLVSNKDKCLKNNYNKFVEVYRNEIWDSQPDIMRLWFYQT 349
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIK 413
C+E + + L + C + YG + R T YGG
Sbjct: 350 CTEYGYYQTTNSRKSVFGSLFPLPYFTGLCTDLYGYYYGNRFLYTRIGRTNTMYGG---- 405
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---------GSHCLDLDE 464
LR N+IF+NG DP+ LS QDLN SHC DL
Sbjct: 406 --LRPDLQNVIFTNGDVDPWHA-----------LSVLQDLNEFSPAILIKGSSHCRDLYS 452
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWIT 489
+D + L++ R KI+ WI+
Sbjct: 453 DLDTDVEDLIRARARVRKIIGTWIS 477
>gi|109070008|ref|XP_001094051.1| PREDICTED: thymus-specific serine protease isoform 2 [Macaca
mulatta]
Length = 514
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 129/471 (27%), Positives = 205/471 (43%), Gaps = 66/471 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNEQHWVGEDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 119 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRHALADVVSARLALSRLFNI 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 230
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T S EC + +++E++ + A L + C L +N EL L
Sbjct: 231 STAIGGSLECRAAVSAAFAEVERRLRLGGAAQAALRSELSACGTLGSAENQAELLGALQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ A QYD + P V+++C N + P CG +I + L +
Sbjct: 291 LVGGAVQYDGQAGAPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSE--MVVPMGKDKNSMYQPEPWNLTKYIKNCK 391
+ + E + D W +Q+C+E V G + Q L + C+
Sbjct: 351 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCGNPRCPFSQLP--ALPSQLDLCE 408
Query: 392 EQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+ +G+S + +YYGG + ++F NG DP +H
Sbjct: 409 KVFGLSALSVAQAVAQTNSYYGGQT------PGANQVLFVNGDTDP---------WHV-- 451
Query: 447 LSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
LS +Q L GSHCLD+ + SD L R+ + +Q W+
Sbjct: 452 LSVTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 502
>gi|355684809|gb|AER97524.1| dipeptidyl-peptidase 7 [Mustela putorius furo]
Length = 316
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 95/272 (34%), Positives = 145/272 (53%), Gaps = 9/272 (3%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+ ++ Q +DHF++ +TFPQR++++ KFW G PI + G E + SGF
Sbjct: 36 FQEGYFEQLLDHFNFERFGNKTFPQRFLVSEKFWKKGKGPIFFYTGNEGDVWSFANNSGF 95
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E A Q +AL+V EHRYYGKS+PFG R H QALAD+A ++ ++
Sbjct: 96 IQELAAQQEALVVFAEHRYYGKSLPFGDRST---RRGHTELLTVEQALADFARLIRALQR 152
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
A +PVIA G SYGG L+ + R+KYPH+V G+LA+SAP++ + + VS
Sbjct: 153 DLGAYDSPVIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFRDVSA 212
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVY 287
DF S +C Q ++ ++ +I ++ S+ +S++F TC PL ++ T+L +
Sbjct: 213 DFEGQSPKCAQGVRDAFRQIKDLFSQ-GAYDTVSREFGTCQPLSSREDLTQLFGFARNAF 271
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDIL 319
+ A D P P I + N G D L
Sbjct: 272 TVLAMMDYP--YPTDFIGHLPANPVKVGCDRL 301
>gi|323451280|gb|EGB07158.1| hypothetical protein AURANDRAFT_2013, partial [Aureococcus
anophagefferens]
Length = 451
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 130/475 (27%), Positives = 214/475 (45%), Gaps = 64/475 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG-FTYE 113
+ Q +DHF + + +TF Q+ +++ W P+L + G E I+D SG +E
Sbjct: 1 YVTQQLDHFRF--DETRTFSQKLLVHDA-WHRPGGPLLMYFGNEGAIEDFYGNSGGLMFE 57
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A + A + LEHRYYG S+PFG+ A + + QALAD A +L +
Sbjct: 58 LAPKLNASVAFLEHRYYGSSLPFGN---ASYGSDELAFLTVEQALADMALVLATSSEILG 114
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-LYYGDITPWTTYHSVVSKDF 232
A P + G SYGG LA WF LKYPH+ G++A+SAP+ LY G+ + + +
Sbjct: 115 AADGPAVLFGGSYGGMLAAWFMLKYPHLAAGAVAASAPVDLYPGEGKERPFFDAGLEVYG 174
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG---------- 282
S C ++ + + + A G L++ F+TC PL + DG
Sbjct: 175 TYGSAACEADLRAALAALAAAAKTAAGRDALARSFRTCEPLPDPV---DGDRLASYVNGA 231
Query: 283 ------LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG 336
LD Y+ A P+N PV+ C + AP+ + G V+ L + G
Sbjct: 232 LSTLAMLDYPYASAFVAPMPAN-PVRVACGRVAAAPSAASKL-----KGAVDV-FLNHTG 284
Query: 337 NNSRCYINED-----------RTGDESDEGWEWQSCSEMVV-PMGKDKNSMYQPE-PWNL 383
+ CY R D W +Q+C+E+ + P+ D + P+ P L
Sbjct: 285 ETA-CYDARRELLAAPGAPPLRALGAIDRPWNYQACTELPLEPLTSDGFGFFVPQSPKAL 343
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYG-GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ C++++GV+PRP W+ +G G + LR N++F++G +DP+ GG
Sbjct: 344 AEVEAACRDRFGVAPRPDWLRQSFGDGAQLAASLR----NVVFTDGDKDPWRVGGVPGDA 399
Query: 443 HFFDLSFSQDLNL-------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ S+D ++ +H DL + +D +V R E + + W+ +
Sbjct: 400 R----ALSRDGSVVHVLIADAAHHQDLFASDPADSPGVVAARVLEFEHVSRWVAR 450
>gi|80476792|gb|AAI08760.1| MGC85068 protein [Xenopus laevis]
Length = 457
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 129/449 (28%), Positives = 211/449 (46%), Gaps = 30/449 (6%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
F Q +DHF+ + T+ QRY IN ++W + P+ ++G E + + LSG E
Sbjct: 15 FIVQPLDHFNR--RNNGTYNQRYWINEQYWNYPDGPVFLYIGGEGSLSEFSVLSGEHVEL 72
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +AL+V LEHR+YG S+ L+N + + +S QALAD AS + I KYN
Sbjct: 73 AQTHRALLVSLEHRFYGSSINIDG--LTLENIK---FLSSQQALADLASFHMFISQKYNL 127
Query: 175 THAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
T I G SY G L+ WFRLK+PH+V ++ASSAP+ D +T Y+ VV+
Sbjct: 128 TRQNTWICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELD---FTGYNKVVAWSLA 184
Query: 234 DT----SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT---TELKDGLDTV 286
D SE+C +K+ + +D++ K + + L K F +C L+ + TE L +
Sbjct: 185 DPVIGGSEKCLDAVKEGFHAVDSLIQKGN-VTQLEKDFYSCGSLQGSDDYTEFVGNLADI 243
Query: 287 YSEAAQYDTPSNIP-VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINE 345
+ A QY+ S I V+ IC + N + L + + + L N+ + +
Sbjct: 244 FMGAVQYNGMSPISNVQNICQLMTTKDNSAYEGLRSVNKMYMNSMGLSCISNSHAKSVAD 303
Query: 346 DRTGDESDEG-----WEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPR 399
+ S G W +Q+C+E +D + + P L + C + + V P
Sbjct: 304 LSSTKLSLIGVGERQWYYQTCTEFGYYQTCEDPSCPFSPL-ITLKSQLDLCFQIFQV-PT 361
Query: 400 PSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHC 459
S + + ++ +S IIF NG DP+ +K +++ +N SHC
Sbjct: 362 ESVLQSVQFTNEFYGADFPKSSRIIFVNGDVDPWHALSVLKNQSRSEIAIF--INGTSHC 419
Query: 460 LDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+++ + SDP L + RK + W+
Sbjct: 420 ANMNPSSTSDPLSLQEARKEIATQVATWL 448
>gi|296491747|tpg|DAA33780.1| TPA: dipeptidyl peptidase 7 [Bos taurus]
Length = 335
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 91/252 (36%), Positives = 138/252 (54%), Gaps = 13/252 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+ ++ Q +DHF++ +TF QR+++ KFW G PI + G E + SGF
Sbjct: 36 FQEAYFEQLLDHFNFERFGNKTFLQRFLMTEKFWNRGEGPIFFYTGNEGDVWSFANNSGF 95
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALADYASILLH 167
E A Q AL+V EHRYYGKS+PFG R RGY QALAD+A +L
Sbjct: 96 ILELAEQQGALVVFAEHRYYGKSLPFGERST------WRGYTELLTVEQALADFAGLLRA 149
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ + A AP IA G SYGG L+ + R+KYPH+V G+LA+SAP++ + +
Sbjct: 150 LRQELEAPDAPAIAFGGSYGGMLSAYLRIKYPHLVAGALAASAPVVSAAGLGDPYQFFQD 209
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLD 284
VS DF+ S EC + ++ ++ +I ++ + ++S++F TC PL K+ T+L
Sbjct: 210 VSADFQGQSPECARAVQDAFRQIRDLFQQ-GAPHVVSQEFGTCQPLSGPKDLTQLFGFAR 268
Query: 285 TVYSEAAQYDTP 296
++ A D P
Sbjct: 269 NAFTVLAMMDYP 280
>gi|348537748|ref|XP_003456355.1| PREDICTED: putative serine protease K12H4.7-like [Oreochromis
niloticus]
Length = 510
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 135/486 (27%), Positives = 222/486 (45%), Gaps = 62/486 (12%)
Query: 40 RILEQKDSNHG--FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGA 97
R E +D+ G FE ++ Q +DHF+ G ++ + + Q Y IN F+ G P+ +G
Sbjct: 52 RFKEAQDTKQGSLFEEQWFTQKLDHFN-GADT-RVWKQMYFINEAFYRPGG-PVFLMIGG 108
Query: 98 EAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
E P + G A + AL ++LEHR+YGKS P + + N R + +S QA
Sbjct: 109 EGPANPAWMEHGTWLTYAEKLGALCLMLEHRFYGKSHP--TMDLSTDNLR---FLSSRQA 163
Query: 158 LADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
LAD A I T +A G SY G LA WFRLKYPH+V S+A+SAP+ Y
Sbjct: 164 LADLAHFRTMIAKARGLTDRKWVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPV--YAT 221
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
+ + Y VV + + EC +KK+ ++ P ++K F C+ L+ T
Sbjct: 222 VN-FPEYLEVVWRSLASENTECPLLVKKASDDLVERLKDPRTYDNITKDFNLCSKLQIQT 280
Query: 278 ELKDG--LDTV---YSEAAQYD---------TPSNIPVKRICNAIENAPNCGDDI--LCK 321
++ L+T+ + + QY+ T +NI +K +C+ + ++ + GD
Sbjct: 281 DMDSAYFLETLAGNFMDVVQYNEDNRGFEGLTGTNITIKVLCSVMTDS-SLGDPYARYAA 339
Query: 322 IAAGVVEADSLEYDGNNSRCYINE------DRTGDESDEGWEWQSCSEMVVPMGKDK-NS 374
+A ++E S++ + Y+++ D + W +Q+C+E D N
Sbjct: 340 VARLMMETFSMKCLDTSFSKYVSDMTNTSWDGPAAGAGRQWVYQTCAEFGFYQSTDSPNQ 399
Query: 375 MYQPEPWNLTKYIKNCKEQYGVSP-RPSWVLT----YYGGHDIKLILRRSTSNIIFSNGM 429
+ P L ++K C Y +S + + +T YYG ++I+ +S I+F NG
Sbjct: 400 PFAGFP--LVYFLKQCTNFYNISAEQVAEAVTQSNEYYGAYNIR------SSRIVFPNGS 451
Query: 430 RDPFSRGGWVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIM 484
DP+ G + S DL SHC ++ A+ DP L R ++
Sbjct: 452 IDPWHALGITQ-------DISADLPAIFIKGTSHCANMYPARSDDPPQLTLARDHIFLLL 504
Query: 485 QGWITQ 490
Q W+ Q
Sbjct: 505 QQWLKQ 510
>gi|148236037|ref|NP_001089218.1| uncharacterized protein LOC734265 precursor [Xenopus laevis]
gi|57920938|gb|AAH89148.1| MGC85068 protein [Xenopus laevis]
Length = 506
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 135/490 (27%), Positives = 227/490 (46%), Gaps = 42/490 (8%)
Query: 25 NGLKLRPRLGRIRRSRILEQKDSNH-----------GFETFFYNQTIDHFSYGPESYQTF 73
NG ++R +L +++ + + +N E+F Q +DHF+ + T+
Sbjct: 24 NGRQIRNKLHQLKETENFKSFYTNRWMPKGAFPNTPSVESFIV-QPLDHFNR--RNNGTY 80
Query: 74 PQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKS 133
QRY IN ++W + P+ ++G E + + LSG E A +AL+V LEHR+YG S
Sbjct: 81 NQRYWINEQYWNYPDGPVFLYIGGEGSLSEFSVLSGEHVELAQTHRALLVSLEHRFYGSS 140
Query: 134 VPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELAT 192
+ L+N + + +S QALAD AS + I KYN T I G SY G L+
Sbjct: 141 INIDG--LTLENIK---FLSSQQALADLASFHMFISQKYNLTRQNTWICFGGSYPGSLSA 195
Query: 193 WFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWS 248
WFRLK+PH+V ++ASSAP+ D +T Y+ VV+ D SE+C +K+ +
Sbjct: 196 WFRLKFPHLVYAAVASSAPVRAELD---FTGYNKVVAWSLADPVIGGSEKCLDAVKEGFH 252
Query: 249 EIDNIASKPDGLAILSKKFKTCTPLKNT---TELKDGLDTVYSEAAQYDTPSNIP-VKRI 304
+D++ K + + L K F +C L+ + TE L ++ A QY+ S I V+ I
Sbjct: 253 AVDSLIQKGN-VTQLEKDFYSCGSLQGSDDYTEFVGNLADIFMGAVQYNGMSPISNVQNI 311
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEG-----WEWQ 359
C + N + L + + + L N+ + + + S G W +Q
Sbjct: 312 CQLMTTKDNSAYEGLRSVNKMYMNSMGLSCISNSHAKSVADLSSTKLSLIGVGERQWYYQ 371
Query: 360 SCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRR 418
+C+E +D + + P L + C + + V P S + + ++
Sbjct: 372 TCTEFGYYQTCEDPSCPFSPLI-TLKSQLDLCFQIFQV-PTESVLQSVQFTNEFYGADFP 429
Query: 419 STSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRK 478
+S IIF NG DP+ +K +++ +N SHC +++ + SDP L + RK
Sbjct: 430 KSSRIIFVNGDVDPWHALSVLKNQSRSEIAIF--INGTSHCANMNPSSTSDPLSLQEARK 487
Query: 479 TEVKIMQGWI 488
+ W+
Sbjct: 488 EIATQVATWL 497
>gi|291236175|ref|XP_002738015.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 515
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 139/527 (26%), Positives = 230/527 (43%), Gaps = 76/527 (14%)
Query: 11 LLLMVIFVSTSFHANGL------KLRPRLGRIRRSRILEQK--------DSNH----GFE 52
LLL++ F +N + K + ++G+ ++ R L+ K ++H FE
Sbjct: 4 LLLVLSFTCVFEISNSILGTHFWKFQEKVGQYKKKRFLKNKLRWQKVFSPNSHLMYSEFE 63
Query: 53 TFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
+ Q +DHF P + QRY +N +W N P+ F+G E + G
Sbjct: 64 DLYLEQPLDHFD--PLVTDIYEQRYWVNPTYWNKENGPVFLFIGGEGALGAYDVEEGEHV 121
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+ A ++ ALI +EHR+YG S+ K LK + Y +S QALAD AS KY
Sbjct: 122 DLAKKYGALIFAVEHRFYGASI----NKDGLK-LEYLQYLSSQQALADLASFHRFATSKY 176
Query: 173 NATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
N T + + I G SY G L+ WFRLKYPH+V G++ASSAP+ + + Y+ VV+
Sbjct: 177 NITQSNIWICFGGSYPGSLSAWFRLKYPHLVYGAIASSAPVRV---VKNFEGYNQVVAAS 233
Query: 232 FRDT----SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLD 284
D S +C I ++ ID K L FK+C + +T + L
Sbjct: 234 LADPVVKGSLKCSDNIAAAFKIIDQ-KIKDKQFDTLKADFKSCNNISSYNDTALFLNNLA 292
Query: 285 TVYSEAAQYDTPSNIP---VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC 341
++ QY+ + +P V +C + + D++ V+ ++ DG C
Sbjct: 293 GIFMGIVQYN--NEMPDWNVAAVCQNMTQPASPYDNL--------VKFTTIYLDGMGQEC 342
Query: 342 YIN---------EDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPW-NLTKYIK 388
+ N ED T E G W +Q+CS+ D+N+ P +L ++
Sbjct: 343 FDNSYDNFIQELEDTTPTEEGVGVRQWTYQTCSQFGYYQTCDQNTTCLFSPLIDLKSSLE 402
Query: 389 NCKEQYG-----VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
C +G V + + +YYG + K + I+F NG DP+ ++
Sbjct: 403 VCTTVFGIHGKIVDKQVDFTNSYYGANHPK------GTRIVFVNGSIDPWHALSVLRNES 456
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+S +N +HC ++ + +DP LV+ R+ + W+ +
Sbjct: 457 PSQISIY--INGTAHCANMKSQQPTDPPSLVEARQKIDAQIGEWLNE 501
>gi|426363699|ref|XP_004048972.1| PREDICTED: dipeptidyl peptidase 2 [Gorilla gorilla gorilla]
Length = 495
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 119/470 (25%), Positives = 204/470 (43%), Gaps = 42/470 (8%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP--- 100
++ GF+ F+ Q +DHF++ +TFPQR++++ +FW G PI + G A
Sbjct: 24 RRAPEPGFQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGEGPIFFYTGTRASGSG 83
Query: 101 ----IDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQ 156
+ + S + + A Q + + P S G
Sbjct: 84 FRLSVASPRKPSHLSSQPAAQLGCAATL-------AAFPSESCTRTWPRVGICGSQKCRP 136
Query: 157 ALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
+ ++ A + SYGG L+ + R+KYPH+V G+LA+SAP+L
Sbjct: 137 PCGTCGPLRRKRGNEVLGAAALTLCPSPSYGGMLSAYLRMKYPHLVAGALAASAPVLAVA 196
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--- 273
+ + V+ DF S +C Q +++++ +I ++ + + ++F TC PL
Sbjct: 197 GLGDSNQFFRDVTADFEGQSPKCTQGVREAFRQIKDLFLQ-GAYDTVRREFGTCQPLSDE 255
Query: 274 KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA---AGVVEAD 330
K+ T+L ++ A D P P + N G D L A G+
Sbjct: 256 KDLTQLFMFARNAFTVLAMMDYP--YPTDFLGPLPANPVKVGCDRLLSEAQRITGLRALA 313
Query: 331 SLEYDGNNSR-CY-------INEDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQP 378
L Y+ + S CY D TG W++Q+C+E+ + + M+
Sbjct: 314 GLVYNASGSEHCYDIYRLYHSCADPTGCGTGPDARAWDYQACTEINLTFASNNVTDMFPD 373
Query: 379 EPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW 438
P+ + C + +GV PRP W+LT + G D+ R+ SNIIFSNG DP++ GG
Sbjct: 374 LPFTDELRQRYCLDTWGVWPRPDWLLTSFWGGDL-----RAASNIIFSNGNLDPWAGGGI 428
Query: 439 VKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ ++ + + G+H LDL + DP +V+ RK E ++ W+
Sbjct: 429 QRNLSASVIAVT--IQGGAHHLDLRASHPEDPASVVEARKLEATVIGEWV 476
>gi|384253205|gb|EIE26680.1| hypothetical protein COCSUDRAFT_11918 [Coccomyxa subellipsoidea
C-169]
Length = 395
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 114/392 (29%), Positives = 185/392 (47%), Gaps = 53/392 (13%)
Query: 12 LLMVIFVSTSFHANGLKLR------PRLGRIRRSRILEQKDSNHGF---ETFFYNQTIDH 62
L++ + + ++F ++R PR+ SR + + + + T+DH
Sbjct: 7 LVLCLIICSAFSGGQCQVRRSRKGPPRIPYKNSSRTQLTSNGREPITKCKVYTRDATLDH 66
Query: 63 FSYG--PESYQTFPQRYVINSKFW----GGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
FS+ P++ TF QRY + + W G PI ++G EA + + +G +ENA
Sbjct: 67 FSWATPPDNRTTFKQRYFLCNDHWKSHKDGTRGPIFFYVGNEADVTLYLNATGLMWENAA 126
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH 176
F AL+V EHRYYG+S PF K AL++ H Y S QA+AD+A +++ +K+ A
Sbjct: 127 AFGALLVFAEHRYYGESKPF---KKALRH--HMQYLTSEQAMADFAELIMELKEDLGAQS 181
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW---TTYHSVVSKDFR 233
+ VI G SYGG LATW R+KYPH++ G++A SAPI Y P +Y +V+ D
Sbjct: 182 SAVIGFGGSYGGMLATWMRIKYPHILDGAIAGSAPIWSYLGEEPAYDSGSYAKIVTADAS 241
Query: 234 D---TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-DGLDTVYSE 289
+ ++ C +++ W++ K I + P ++ L +G+ Y
Sbjct: 242 EAGGSAPACASNVREVWNQGSWAVQK-----ICLDMRRGNYPYPSSYILNGNGILPAY-- 294
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCYINEDRT 348
PV+ C ++ G D+L A +A + Y+ + CY
Sbjct: 295 ----------PVRVACESLRQEDLAGSDLLSAFA----DALGVFYNYTEDVECYDFGAGP 340
Query: 349 GDESDEG---WEWQSCSEMVVPMGKD-KNSMY 376
E+DE W++Q C+E P KD K+ MY
Sbjct: 341 NPETDEDGSFWDYQWCTEQFQPFSKDGKHDMY 372
>gi|156537791|ref|XP_001608051.1| PREDICTED: putative serine protease K12H4.7-like [Nasonia
vitripennis]
Length = 476
Score = 149 bits (376), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 144/515 (27%), Positives = 218/515 (42%), Gaps = 89/515 (17%)
Query: 12 LLMVIFVSTS------FHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSY 65
L++ +F S + FH GL+ +G R L+QK + Q +DHF++
Sbjct: 12 LVLALFTSVNAVGFHGFHFKGLEEPAWIG----PRKLDQK---------WIEQPLDHFNH 58
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVIL 125
+T+ RY K++ G PI LG E I+ +G ++ A Q AL+
Sbjct: 59 --RDNRTWQMRYYEEDKYFNG-IGPIFIMLGGEWTINPGFLQNGLMHDLAKQHGALMFYT 115
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGAS 185
EHRYYGKS P ++ + N + Y N QALAD A + + K +YN T + VI G S
Sbjct: 116 EHRYYGKSYP--TQNMSSDNMQ---YLNVDQALADVAYFIDNRKSEYNITDSKVIVFGGS 170
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G +A W R+KYPH++ GS+ASSAP+ D + Y+ VV+ R +C ++
Sbjct: 171 YAGNMAAWIRIKYPHLIQGSVASSAPVYAKAD---FYEYYEVVANSLRRHDSQCALDVEN 227
Query: 246 SWSEIDNIASKPDGLAILSKKFKTC-TPLKNTTE----LKDGLDTVYSEAAQYDTP---- 296
++ E + + G + K F C TP N+ + L V++ A QY+
Sbjct: 228 AFDETEELLVTEGGPEKIQKIFNICKTPNVNSMTDVGYFMNFLSEVFASAVQYNKVVNGM 287
Query: 297 SNI--------------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY 342
SNI P++R+ I + P C D V+ + D S
Sbjct: 288 SNIGQLCDTMTSASIGKPIERLAYLIRSGPKCKD----------VDYKDMIKDLRMSSWS 337
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG------- 395
+ R W +Q+C+E + + NL ++ CK+ YG
Sbjct: 338 TSAMRQ-------WYFQTCTEFGYYQTANSSKSAFGRLVNLDFFVNICKDVYGDYYEREL 390
Query: 396 VSPRPSWVLTYYGGH--DIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
+ S YGG DIK N+IF NG DP+ +K + F + +
Sbjct: 391 LDSGISRTNIMYGGRLPDIK--------NVIFVNGDVDPWHALSVLKDVNEFSPAIL--I 440
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
SHC DL D L RK I+ GW+
Sbjct: 441 QGSSHCQDLQADSAGDVPELRTARKKIRNIVSGWL 475
>gi|118361433|ref|XP_001013945.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89295712|gb|EAR93700.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 873
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 126/464 (27%), Positives = 209/464 (45%), Gaps = 46/464 (9%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F T N T DHF+ +T+ QRY + +++ N +L ++ E + F
Sbjct: 425 FSTRIQNLT-DHFNITNN--RTWSQRYWVLDQYYNPQNGSVLLYICGEYTCPGIPEERQF 481
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF-NSAQALADYASILLHIK 169
A +F +L+++LEHR+YG S+PFG + +H Y N QALAD A + ++K
Sbjct: 482 PILLAQKFSSLVLVLEHRFYGNSMPFGDQSMK----QHNLYLLNVDQALADLAYFITYVK 537
Query: 170 DKY---NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
D + H P + IG SY G ++ WFR KYPH+ +G+LASSA + D +
Sbjct: 538 DHHLHGVQNHIPWLTIGGSYPGAMSAWFRYKYPHLTVGALASSAVVNAILDY--YQMDQQ 595
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG---- 282
V+ R + E+C Q+I + N+ P + K+F N L +G
Sbjct: 596 VILSALR-SGEKCAQSIHDLNIYVQNLLQNPTSAYEIKKQF-------NAEHLNNGEFLY 647
Query: 283 -LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC 341
+++ QY + + +CN N P + ++ Y G+
Sbjct: 648 FYTDIFTGMVQYGSRT-----VLCNQTLNYPTIEQQYQSILNYTKENNVTVNYYGS---Y 699
Query: 342 YINEDRTGDESD--EGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GV 396
Y+ D E+D W WQ C+E + + + NL + CK+ + +
Sbjct: 700 YLRNDTYDPENDGSRQWTWQYCTEFGFFQTCSNPQTGSRSTEVNLDMFTNFCKQSFTQDI 759
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
P PS V YGG ++K +N+I +NG+ DP+ G ++ +S+ D +
Sbjct: 760 FPNPSRVNIQYGGVNLK------ATNLILTNGIEDPWRWAGLQQSSGDI-VSYLIDCDDC 812
Query: 457 SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
+HC+DL K++D L Q R+ V+ WI ++YD + N+
Sbjct: 813 AHCVDLYTPKETDALVLKQTREKIVEHFSQWIKEHYDSLEKSNQ 856
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 45/194 (23%), Positives = 81/194 (41%), Gaps = 31/194 (15%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVI--NSKFWGG--GNSPILAFLGAEAPIDDNIQL 107
E +F Q DHFS +++ + QRY I N K G G I+ + D +
Sbjct: 29 EHYFNEQRYDHFS---NNFELWDQRYFIAKNEKSQNGQLGKVNIIFVCDKDLTHDILSCI 85
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
F I +LE RYYG+S P+ SR + + Y + Q +AD A +
Sbjct: 86 PPFFDSQRRNSDVNIFLLEMRYYGESQPYSSRYLGID---YLSYQSIQQNIADIALFVSF 142
Query: 168 I-KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
+ KD ++ + ++KYPH++ G +A ++ ++ + Y+
Sbjct: 143 LKKDNMVSSDSK----------------KIKYPHLIDGVIAFNSQLVNIN----YEQYNQ 182
Query: 227 VVSKDFRDTSEECF 240
++ + T+ +C
Sbjct: 183 ILDQQLSQTNPQCL 196
>gi|115623598|ref|XP_783667.2| PREDICTED: putative serine protease K12H4.7-like
[Strongylocentrotus purpuratus]
Length = 492
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 124/465 (26%), Positives = 207/465 (44%), Gaps = 50/465 (10%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
++ Q +DHF+ +T+ QR+ +N F+ G P+ +G E + + G E
Sbjct: 50 MWFKQKLDHFN--DADLRTWNQRFFLNGTFYTPG-GPVFLMIGGEGEANPVWMVEGAWME 106
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A + KA +++EHR+YGKS P + ++ N + Y +S QALAD A I +
Sbjct: 107 YAKEMKAFCIMVEHRFYGKSHP--TENMSVDNLQ---YLSSEQALADLAHFRTVIGQQLK 161
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
I+ G SY G L+ WFRLKYPH+V+G++A+SAP+ D + T V +D
Sbjct: 162 FDTNKWISFGGSYPGSLSAWFRLKYPHLVVGAIATSAPVQAQLDFPEYLT----VVRDSL 217
Query: 234 DTSE---ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT-----ELKDGLDT 285
TS +C I+ + +I+++ G L+K F C+PL T+ + L
Sbjct: 218 ATSRVGSKCNDAIQAATHDIESLMMHMTGWEALAKIFNLCSPLNGTSMNDVANFYETLAG 277
Query: 286 VYSEAAQYDT--------PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
+ E QY+ +N + +C+ + N + G ++ A + +D D +
Sbjct: 278 NFMETVQYNKDNKAFEGGSNNNAIDTLCDIMVNE-SIGTELSRYAAVSALLSDGECVDAS 336
Query: 338 NSRCYINEDRTG--DESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKE 392
+ + T E+ G W +Q+C+E D + + L+ I+ C++
Sbjct: 337 YDKMIADLRETSWKSEASAGGRQWTYQTCTEFGFYQTTDTDQQPFGRHFPLSLSIQMCED 396
Query: 393 QYGVSPRPS-------WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
YG + + T YGG DI NI+F NG DP+ G K+ +
Sbjct: 397 IYGKQFNKTTNQAGVNFTNTNYGGRDIVAF------NIVFPNGSIDPWHALGITKSTDMY 450
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
F + +HC ++ A DP L Q R +Q W++Q
Sbjct: 451 SAIFIKGT---AHCANMYPASPDDPAELTQARTQIRDTIQKWLSQ 492
>gi|66800673|ref|XP_629262.1| hypothetical protein DDB_G0293014 [Dictyostelium discoideum AX4]
gi|60462657|gb|EAL60859.1| hypothetical protein DDB_G0293014 [Dictyostelium discoideum AX4]
Length = 486
Score = 148 bits (374), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 138/505 (27%), Positives = 223/505 (44%), Gaps = 51/505 (10%)
Query: 11 LLLMVIFVSTSFHANGLKL---RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGP 67
LLL+++ VS + K+ R L + R + ++ Q++DHF+ P
Sbjct: 6 LLLVLVIVSIIGSIDSKKVFNVRTDLKNQLKFRHFDDSLDYDAINYQWFTQSVDHFN--P 63
Query: 68 ESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNI--QLSGFTYENAHQFKALIVIL 125
+ TF QRY+IN ++W G P+ + E P+D N QL + A Q AL+V L
Sbjct: 64 ANPTTFQQRYLINDQYWDG-TGPVFIMINGEGPMDINTVTQLQFVVW--AKQVSALVVSL 120
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-THAPVIAIGA 184
EHRYYG S F + +L+N + + NSAQALAD A + +YN + I+ G
Sbjct: 121 EHRYYGAS--FVTEDLSLENLQ---WLNSAQALADNAVFRNFVAQQYNVPKESKWISFGG 175
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIK 244
SY G L +WFR+KYPH+V ++ASSAP+ + + + C + I
Sbjct: 176 SYSGALTSWFRIKYPHLVDATIASSAPVNPEVNFYQYLETVQTALLASKSNGNLCVENIN 235
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV---YSEAAQY--DTPSNI 299
+ +I + S+ D + + F CTPL N ++ + ++ + QY + P I
Sbjct: 236 IATQKIQALLSQ-DNYGGVDQMFNLCTPLGNQNDVATFMQSLAGNFMGVVQYNDEEPGQI 294
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN-NSRCYINEDRTGDESDEG--- 355
+ +CN + N D + I AD D + S N++ T DE+ G
Sbjct: 295 DIDYLCNIMTN--QSSDPLTNYIQIWDQYADGECVDVSYASMIAQNQNVTNDENAIGGRM 352
Query: 356 WEWQSCSEMVVPMGKDKNSMYQP-------EPWNLTKYIKNCKEQYGVS---PRPSWVLT 405
W +Q+C E D S QP +P+ I+ C + +G+ P +W +T
Sbjct: 353 WFYQTCVEFGYYQSSDAPSANQPFGNLFPFQPYQ----IQQCADSFGIPNMYPNVNWTIT 408
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL--GSHCLDLD 463
YGG + + S N ++ NG D + + + + L + SHC D+
Sbjct: 409 EYGGINPE---PSSVDNTLYVNGSNDEWHNLAILPG----NANAKNTLYIIGTSHCADMM 461
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWI 488
P L Q ++ + +Q I
Sbjct: 462 IPTSVSPPTLAQAQQIIFEFIQANI 486
>gi|410910676|ref|XP_003968816.1| PREDICTED: thymus-specific serine protease-like [Takifugu rubripes]
Length = 509
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 131/518 (25%), Positives = 237/518 (45%), Gaps = 54/518 (10%)
Query: 3 SSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKD--SNHGFETFFY---- 56
+S FS L++ + VS G LR R+R +++ + K +N G ++F +
Sbjct: 2 TSTFSPVRALMVSLLVSVD---AGRSLRRIQERVRGAQLQDAKQLLTNAGRQSFQHVTQG 58
Query: 57 --NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+Q +DHF P+ +TFPQR+ +N F G + P+ ++G E PI + L+G +
Sbjct: 59 KIHQPLDHFH--PQDRRTFPQRFFVNEAFCRGPDGPVFLYIGGEGPIFEFDVLAGHHVDM 116
Query: 115 AHQFKALIVILEHRYYGKSV-PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A + AL++ LEHR+YG S+ P G + L+N +S QALAD + HI +N
Sbjct: 117 AREHGALLLALEHRFYGDSINPDGLKTENLEN------LSSKQALADLVAFHQHISQSFN 170
Query: 174 ATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT-TYHSVVSKD 231
+ I+ G SY G L+ WFR ++PH+V G++ASSAP+ D + ++ T ++ +
Sbjct: 171 LSQRNTWISFGGSYSGSLSAWFRGQFPHLVFGAVASSAPVKATLDFSAYSDTVGLSLANE 230
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC-TP--LKNTTELKDGLDTVYS 288
S +C +K++++ ++ A ++ ++ F C TP L + EL L ++
Sbjct: 231 AVGGSAKCLDAVKEAFAAVE-AALMMGNVSQVASDFGCCQTPKNLDDQIELMQELAGIFM 289
Query: 289 EAAQYDTPSN-IPVKRICNAI-------ENAPNCGDDI--LCKIAAGVVEADSLEYDGNN 338
A QY+ + + +C + + A + + + L ++ + E L+
Sbjct: 290 GAVQYNEEGVFMSISDLCEVMTSTNGTCQKAADAYNRLVKLAQVYRSITEEPCLDVSHEK 349
Query: 339 SRCYINED--RTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGV 396
+ ++ + G S W +Q+C+E + + L + C +GV
Sbjct: 350 TLKHLMDTSPHAGRRSVRQWTYQTCTEFGFFQTCEDTTCPFSGMVTLQFETEVCPTVFGV 409
Query: 397 SPRPS------WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
S RPS + TYYGG + T +++ NG DP+ + + F
Sbjct: 410 S-RPSLARQIAFTNTYYGGDSPR------THRVLYINGGIDPWKELSVTQDRSGDQVVFI 462
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+D +HC D+ + +D L R + + W+
Sbjct: 463 EDT---AHCADMRSQRVTDRSSLQTARAEIERHVTEWL 497
>gi|195126042|ref|XP_002007483.1| GI12975 [Drosophila mojavensis]
gi|193919092|gb|EDW17959.1| GI12975 [Drosophila mojavensis]
Length = 507
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 133/520 (25%), Positives = 231/520 (44%), Gaps = 71/520 (13%)
Query: 11 LLLMVIFVSTSF-HANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
+L+ +I S+S NG + LG+ L++ + E ++ Q ++HF P+
Sbjct: 13 VLVCLIAQSSSLGFRNGRHMNGFLGQPSSMPTLQR---SLDVEDLWFEQRLNHFK--PDD 67
Query: 70 YQTFPQRYVINSKFW-GGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
+T+ QRY +N F+ +P+ +G E N G A F AL + LEHR
Sbjct: 68 TRTWQQRYFVNDAFYRNDSQAPVFLMIGGEGEATKNWMREGAWIHYAEHFGALCIQLEHR 127
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYG 187
+YGKS P + L N+ + Y +S QALAD + + +K +YN A IA G SY
Sbjct: 128 FYGKSHP----TSDLSNS-NLAYLSSEQALADLGNFVSAMKRQYNMADSQKWIAFGGSYP 182
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSW 247
G LA W R KYPH++ G+++SS P+L D +T Y VV EC + + + +
Sbjct: 183 GSLAAWAREKYPHLIDGAISSSGPLLAQVD---FTQYFEVVKASLASYKPECVEAVSRGF 239
Query: 248 SEIDNIASKPDGLAILSKKFKTCTPLKNTTE-------LKDGLDTVYSEAAQY---DTP- 296
++++ + G L +KFKTCTPLK++ E L + + + ++ QY ++P
Sbjct: 240 AQVEILLKHMIGQRNLDEKFKTCTPLKDSIENKLDIANLFENIASNFAGVVQYNKDNSPH 299
Query: 297 SNIPVKRICNAIENAP--------NCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
+ I + +IC+ + N +D+L K + + L+Y + +
Sbjct: 300 AKITIDQICDVMLNTTMGPPVTRLGAVNDLLLKQS----KTKCLDYKYDKMIAEMKNVSW 355
Query: 349 GDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSP 398
E +G W +Q+C+E + S + + + +I+ C + + +
Sbjct: 356 DSEVAKGMRQWTYQTCTEFGFYQTSENKSDTFGDRFGIDFFIRQCMDIFSDRMNGKFLEQ 415
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL--- 455
+ YYG L+ T+N+++ +G DP +H L S + N
Sbjct: 416 AVAQTNKYYGA------LKPGTTNVLYVHGSIDP---------WHALGLYVSTNSNTPAI 460
Query: 456 ----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+HC ++ E DP L + R +K + + Y
Sbjct: 461 YIEGTAHCANMYEPANCDPPQLKEARNKILKYLAKLLDGY 500
>gi|242020732|ref|XP_002430805.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516008|gb|EEB18067.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 478
Score = 148 bits (373), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 132/499 (26%), Positives = 222/499 (44%), Gaps = 67/499 (13%)
Query: 32 RLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPI 91
R GRIR + + S E ++ Q +DHF P + +T+ QRY +N F+ N P
Sbjct: 7 RWGRIRHGNLGHPESSEITPEEKYFVQKLDHFD--PTNTKTWNQRYFVNDSFYQP-NGPF 63
Query: 92 LAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY 151
+G E ++G + A ++ A V++EHR+YGKS P + +KN + Y
Sbjct: 64 FLMIGGEGEASPKWMVNGTWLDYAKKYNAYCVMVEHRFYGKSHP--TEDLGVKNLK---Y 118
Query: 152 FNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSA 210
+S QAL D A + + +K N P I +G SY G LA W RLKYPH+V+G++++S
Sbjct: 119 LSSEQALGDLAYFISSLNNKLNIFPPPKWIVMGGSYPGSLAAWMRLKYPHLVLGAVSTSG 178
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
P+L + + Y VV + EC I+ +I ++ P G L K FK C
Sbjct: 179 PLL---ALINFEEYFDVVKDSLSSYNPECVTAIEAGTKQIMSLLIHPLGQRSLFKMFKLC 235
Query: 271 TPLK-----NTTELKDGLDTVYSEAAQYDTP---------SNIPVKRICNAIENAPNCGD 316
PL+ + + L + L ++ QY+ S++ + +C+ + N + G
Sbjct: 236 DPLELNNEDDNSNLFESLAGNFAGVVQYNKDNRHDQNSGGSDLTIDYLCDIMLNQ-SLGK 294
Query: 317 DI--LCKIAAGVVEADSLE------YDGNNSRCYINEDRTGDESDEG---WEWQSCSEMV 365
+I L + V+ + E YD + + + ++ E EG W +Q+C+E
Sbjct: 295 EINRLAVVNEVVLNKTTKEKCLDYKYDKMIKQMQLTDWKS--EVAEGGRQWTYQTCTEF- 351
Query: 366 VPMGKDKNSMYQPEPWNLTK-----------YIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
+Q N TK ++K C + +G+ + LT G +
Sbjct: 352 --------GFFQTSSLNTTKQMFGNKFPPEFFLKQCTDIFGIKYNAN--LTEEGIIRTNM 401
Query: 415 I---LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPD 471
I L NI+F +G DP+ G KT + + +HC ++ + + DP
Sbjct: 402 IYGGLNLVADNIVFVHGSIDPWHALGITKTLRPGAPAIY--IQGTAHCANMYPSSEKDPP 459
Query: 472 WLVQQRKTEVKIMQGWITQ 490
LV RK +++ W+ +
Sbjct: 460 QLVDARKQIEQLIGEWLKE 478
>gi|167383690|ref|XP_001736631.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900889|gb|EDR27114.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 466
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 131/487 (26%), Positives = 208/487 (42%), Gaps = 59/487 (12%)
Query: 32 RLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPI 91
R IR R ++ D T Q IDHF + +T RY IN + +P+
Sbjct: 14 RFQNIRFGRETDKSDVIEHVST----QPIDHFDLTNK--KTINIRYFINDTIYSK-EAPL 66
Query: 92 LAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
L LG E P F A ++ +L++ +EHR+YGKSVP G + + G
Sbjct: 67 LVDLGGEGPQKAAAVGGRFVINKYAEKYNSLMLAIEHRFYGKSVPEGGL-----SQENLG 121
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y ++AQAL DY I+ IK +Y T PVI G SY G LA W R KYP+VV ++ASSA
Sbjct: 122 YLSAAQALEDYIMIINQIKKEYQVT-GPVIVFGGSYSGNLAAWIRQKYPNVVYAAVASSA 180
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
P+ +T++ + + D E+C K++ I+ + G A L F C
Sbjct: 181 PVY------ATSTFYEFLDVIYNDMGEKCGNAWKEATESIEELFKTDSGKAQLKSDFNAC 234
Query: 271 TPLKNTTELKDGLDTVYSEA---AQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA--- 324
T + +L + + + QY+ ++ ++ +C+ + + + ++
Sbjct: 235 TDINGEDDLTILIQQIQATMINYPQYNGSYSLTIEGVCSILTTEGKTAYENMVELMNHAF 294
Query: 325 ---GVVEADSLEYDGNNSRCYINEDRTGDE--SDEGWEWQSCSEMVVPMGKDKNSMYQP- 378
G A S D D G+ S W WQ CSE S +QP
Sbjct: 295 NEFGFKCAPSSYADMLTDMANTKTDEEGNRLASTRSWAWQICSEY---------SYFQPV 345
Query: 379 -EPWNLTKYIKN------CKEQYGVSP-----RPSWVLTYYGGHDIKLILRRSTSNIIFS 426
E +K + N CK+ + V R YGG+ K +N+ ++
Sbjct: 346 NESLPFSKRLNNEFYYLLCKDIFNVDKQRLDRRVHHTNLMYGGYQPK------ATNVAYT 399
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQG 486
+G DP+S +T ++ + +HC DL K +DP+ L QQR + +
Sbjct: 400 SGSTDPWSPLAKHETLPSDINCYASHIKGTAHCADLYAEKDTDPEQLKQQRIETAQFIDE 459
Query: 487 WITQYYD 493
I++Y++
Sbjct: 460 LISRYHN 466
>gi|327276291|ref|XP_003222903.1| PREDICTED: thymus-specific serine protease-like [Anolis
carolinensis]
Length = 511
Score = 148 bits (373), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 131/460 (28%), Positives = 211/460 (45%), Gaps = 49/460 (10%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
F Q +DH Y ++ TF QRY +N+ FW G P+ F+G E + + L G
Sbjct: 67 FIRQHLDH--YNKKNQATFNQRYWVNAGFWRHG-GPVFLFIGGEGRLSEYAVLKGHHVTL 123
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A ++ AL++ LEHR+YG GS K + + Y +S QAL+D S I KY
Sbjct: 124 AEKYGALLLALEHRFYG-----GSLKPEMLEDDNLQYLSSQQALSDLVSFHQFISKKYKL 178
Query: 175 T-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
T + I G SY G LA WFRLK+PH+V G++ASSAP+ D + YH VV+
Sbjct: 179 TPNNTWICFGGSYPGSLAAWFRLKFPHLVFGAVASSAPVRAQLD---FKGYHKVVAASLS 235
Query: 234 DT----SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP---LKNTTELKDGLDTV 286
+ S++C + +++S ++ + + L L + F++C P LK++ L L ++
Sbjct: 236 NPVISGSKQCLDAVTEAFSAVEELV-RSGQLDKLDQDFRSCLPLEGLKDSLWLIKNLVSM 294
Query: 287 YSEAAQYDTPSN--IPVKRICNAIENAPNCGDDILCKIAAG--VVEADSLEYDGNNSRCY 342
+ QY+ V RIC + N + G +A V+ A L+ N+ +
Sbjct: 295 FMAIVQYNGERVEWANVGRICEIMTNH-SAGSTYQRLVATNNIVLSAMRLKCLDNSYAAF 353
Query: 343 INEDR-----TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
I + + + W +Q+C+E + + NL + CK+ + +S
Sbjct: 354 IEKMTNPKFFSMNMVVRQWIFQTCTEFGYFQTCEDPACPFSRLVNLRFEMDVCKQVFNIS 413
Query: 398 PRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ- 451
R S+ YYG + K S ++F NG DP+ +K DLS S+
Sbjct: 414 DRSAQEAVSFTNEYYGANHPK------ASRVLFVNGDIDPWHVLSVLK-----DLSPSEL 462
Query: 452 --DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ SHC +++ +DP LV+ RK + W+
Sbjct: 463 AIVITGTSHCANMESPLPTDPLPLVEARKKITAQVGEWLV 502
>gi|167534702|ref|XP_001749026.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772450|gb|EDQ86101.1| predicted protein [Monosiga brevicollis MX1]
Length = 433
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 126/423 (29%), Positives = 196/423 (46%), Gaps = 54/423 (12%)
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+G +ENA FKALIV EHR++G+S + A + F+ QA+ADY L
Sbjct: 14 TGLMWENAADFKALIVFAEHRFFGQSQV--TPGADGPSTSEYPLFSVEQAMADYNHFLFE 71
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP-W--TTY 224
K + +PVI G SYGG LA W R+KYP +G++A+SAPI + P W TY
Sbjct: 72 FKQNRSIEDSPVIVFGGSYGGMLAAWLRIKYPETFLGAVAASAPISGFAGQQPEWDSNTY 131
Query: 225 HSVVSKD---FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
VV++D C ++ S+ + + G A LS F+ C PL +T++++
Sbjct: 132 WQVVTRDATPAAGAPAACADNVRNSFVTLFKTGASESGRAHLSDLFRLCKPLGSTSDVQA 191
Query: 282 -------GLDTVYSEAAQYDTPSN-----------IPVKRICNAIENAPNCGDDILCKIA 323
DT+ + PSN PV C + A GD +L
Sbjct: 192 LAMWIAYAWDTM--AMGDFPYPSNYLTSNGPMLPAYPVTAACQHLATANLTGDALL---- 245
Query: 324 AGVVEADSLEYDGN-NSRCYINEDRTGD-ESDEGWEWQSCSEMVVPMG----KDKNSMYQ 377
GV+ A S+ + N +C N+ D E D W+W C+E + + M+
Sbjct: 246 QGVLAAASVFTNATANLQC--NDVPFDDVEQDGIWDWLFCTETMHQETYFSLDGQRDMFW 303
Query: 378 PEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHD-IKLILRRSTS-----------NIIF 425
+PWN T +C ++YGV+PR + V YG + I++ RST+ N++
Sbjct: 304 SQPWNTTFINDHCFKKYGVTPRYTKVAERYGNPEAIEVGAHRSTTLSLQAALAAAGNVVL 363
Query: 426 SNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
SNG+ DP+S G + + + + L G+H LDL + DP + R E++ ++
Sbjct: 364 SNGLLDPWSSAGIKR--NISNTVTAIILEHGAHHLDLMFSTPQDPVDVSFARWFEMQRVR 421
Query: 486 GWI 488
W+
Sbjct: 422 EWL 424
>gi|388515463|gb|AFK45793.1| unknown [Lotus japonicus]
Length = 212
Score = 147 bits (372), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 79/200 (39%), Positives = 121/200 (60%), Gaps = 9/200 (4%)
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQ 359
P+K +C I+ P G IL +I GV Y G ++C+ +D S GW WQ
Sbjct: 18 PIKEVCRRIDQGP-AGTSILERIYEGV--NVYYNYTGE-AKCFELDDDPHGLS--GWNWQ 71
Query: 360 SCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRR 418
+C+EMV+PM + +SM+ P +N T Y+++C + +GV PRP W+ T +GGH+I L++
Sbjct: 72 ACTEMVMPMSSSQESSMFPPYEYNYTSYLEDCIKSFGVEPRPKWITTEFGGHNILAPLKK 131
Query: 419 STSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRK 478
SNIIFSN + DP+S G ++ + F+ G+H ++L + +DPDWLV+QR
Sbjct: 132 FGSNIIFSNSLLDPWSGGSVLQ--NIFESIVFLVTKEGAHHINLRASTGNDPDWLVEQRA 189
Query: 479 TEVKIMQGWITQYYDDFKAI 498
TE+K++QGWI+ YY KA+
Sbjct: 190 TEIKLIQGWISDYYRKSKAV 209
>gi|443725197|gb|ELU12877.1| hypothetical protein CAPTEDRAFT_173918 [Capitella teleta]
Length = 517
Score = 147 bits (372), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 137/525 (26%), Positives = 229/525 (43%), Gaps = 71/525 (13%)
Query: 10 WLLLMVIFVSTSFHANGL-----KLRPRLGRIRRSRILEQKDSNHGFE------------ 52
W + + +++ T HA+G K R ++ + R R E+ H F
Sbjct: 11 WAIAICVYLITIDHASGFGLNFWKFRSKVDQHRHHR--EKVKLFHHFSHEPTALLRGPFI 68
Query: 53 TFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
++ Q +DHF P+ ++ QRY +N+ FW G P+ ++G E + +G
Sbjct: 69 DEYFEQPLDHFD--PQVSGSYKQRYWVNADFWSGKEGPVFLYIGGEGGLTSMTVQAGEHV 126
Query: 113 ENAHQFKALIVILEHRYYGKSV-PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ A ++KALI +EHR+YG+S+ G + +L+ Y +S QALAD A + K
Sbjct: 127 DLAKKYKALIFAVEHRFYGESLNDDGLKLESLQ------YLSSQQALADLAKFHAVMSQK 180
Query: 172 YNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
YN T + G SY G L+ WFR+KYPH+V ++ASSAP+ D + Y+ VV+
Sbjct: 181 YNLTDDNHWVCFGGSYPGALSAWFRIKYPHLVHAAVASSAPVRALVD---FQGYNDVVAA 237
Query: 231 DFRDT----SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGL 283
T S++C +K+++S ID + K + L L F +C PL K+ + +
Sbjct: 238 SLSATIVNGSDKCLSQVKEAFSTIDQMLDKGN-LLQLENDFYSCAPLDGEKDIYQFTSNV 296
Query: 284 DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI--LCKIAAGVVEADSLEYDGNNSRC 341
+ QY+ IP + I E D L K+ + + N+
Sbjct: 297 ADAFMGVVQYN--QEIPGQSIAGLCEQMTASADSYANLRKLFRRFLNESDQKCSDNSWSS 354
Query: 342 YINE--DRTGDESDEG-----WEWQSCSEMVVPMGKDKNSMYQPEPW-NLTKYIKNCKEQ 393
I + + T D G W +Q+C++ D N+ + L + C E
Sbjct: 355 AIAQMSNTTVDRGGFGVGLRQWIYQTCTQFGYYQSCDVNTTCPFSRYMGLVPNLDICTEV 414
Query: 394 YGVSPRPSW-----VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+ + ++ YYG K + I+F NG DP W DLS
Sbjct: 415 FGIGGKSTYGRVDFTNAYYGSDQPK------GTRIVFVNGSIDP-----WHALSVLKDLS 463
Query: 449 FSQD---LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
Q + +HC +++ + DP L++ RK +++ W+ Q
Sbjct: 464 GGQHAIFIEGTAHCANMNSNQPWDPPQLLKARKQTDELIGSWLNQ 508
>gi|118346703|ref|XP_976866.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288597|gb|EAR86585.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 485
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 129/492 (26%), Positives = 207/492 (42%), Gaps = 42/492 (8%)
Query: 9 QWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFE--TFFYNQTIDHFSYG 66
Q L+L+ +FV + LG I K N E T+++ Q +DHF
Sbjct: 5 QKLILVFLFVGVA-----------LGNISNEAPYSTKIGNQTIEQTTYYFTQKVDHFD-- 51
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILE 126
P S T+ QR+ + S+ + N + F+G E P SG+ A QF A+++ +E
Sbjct: 52 PSSTDTYNQRFTVYSEAFNPANGTVFIFIGGEGPQQGLTTGSGWYMLVAQQFSAMVICVE 111
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD----KYNATHAPVIAI 182
HR+YG S PFG + A H + Q+LAD A + +IK + N + P I +
Sbjct: 112 HRFYGVSQPFGQGQDAW-TVDHLKFLTVDQSLADLAYFISYIKANNFLRINDRN-PFITV 169
Query: 183 GASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQT 242
G SY G ++ WFR KYPH+ IG+ ASSA + I + Y + + EC
Sbjct: 170 GGSYPGAMSAWFRYKYPHLTIGAHASSAVV---NAIMDFQQYDYQIYTSTSLSGPECPIK 226
Query: 243 IKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVK 302
I+K ++ I ++ A K L+N L D +++ QY K
Sbjct: 227 IQKFNEIVEEILTQNGEAAQNLKTLFKAQNLQNDDFLSYFGD-LWAGMVQYG-------K 278
Query: 303 RICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCS 362
R AP+ + L + + + DG +++ N +ES W WQ C+
Sbjct: 279 RTVLCDLFAPDTFGEQLKLVVDYAITQGNQPVDGYDTQSLTNTTYVANESGRQWTWQVCT 338
Query: 363 EMVVPMGKDKNSMYQPEPWNLTKYIKNCK---EQYGVSPRPSWVLTYYGGHDIKLILRRS 419
++ + NL Y C + + P+ V T+YGG +++
Sbjct: 339 YFGWFQSANQVQPMRSRTVNLQFYQNQCNVAFQNFQNFPKSDLVNTFYGGANLQAF---- 394
Query: 420 TSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKT 479
NI+F+NG+ D + D S + G HC++ K D L Q R +
Sbjct: 395 --NIVFTNGVEDEWQWASIRYPQGNMDAIISNCTDCG-HCVEFRYPKPEDSPQLQQTRAS 451
Query: 480 EVKIMQGWITQY 491
++ WIT++
Sbjct: 452 LIQHYTKWITEF 463
>gi|270004132|gb|EFA00580.1| hypothetical protein TcasGA2_TC003450 [Tribolium castaneum]
Length = 473
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 124/472 (26%), Positives = 221/472 (46%), Gaps = 59/472 (12%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGFTYE 113
++ Q +DHF+ P T+ QR+ N +F+ N P+ +G E G
Sbjct: 25 WFTQNLDHFN--PTDETTWKQRFYSNDQFFDPKNGGPVFLMIGGEGEASIKWMTQGAWVN 82
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A +F AL+ LEHRYYGKS P + + +N + Y S QALAD A+ + + +KY+
Sbjct: 83 YAEKFGALMFQLEHRYYGKSHP--TDDLSTQNLK---YLTSQQALADLATFITAMNEKYS 137
Query: 174 -ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
IA G SY G LA W R KYPH+V G++++S P+L D + Y V+ +
Sbjct: 138 LPPDVKWIAFGGSYPGSLAAWLRFKYPHLVHGAMSASGPLLAQVD---FKDYFRVIKESL 194
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA- 291
S++C +++ +I + + G A L++ FK C P++N+ + + +Y A
Sbjct: 195 ATHSDDCVTAVQQGVDQIGVLLKQEIGQANLNELFKLCDPVQNSINNEKDISNLYETIAD 254
Query: 292 ------QYD--------TPSNIPVKRICNAIENAPNCGDDI--LCKIAAGVVEA---DSL 332
QY+ +NI + +C+ + N G + L K+ ++ A L
Sbjct: 255 DFAGVVQYNKDNRVGSPAGANITIDVVCDIMVNQ-TIGPPVNRLAKVNEVLLSAYDQKCL 313
Query: 333 EYDGN---NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE----PWNLTK 385
+Y+ + N+ ++ D E W +Q+C+E G + S Y+P+ +++
Sbjct: 314 DYNYDKMINNLRNVSWDSEASEGGRQWTYQTCTE----FGFYQTSDYEPQIFGDQFSVDF 369
Query: 386 YIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW 438
+I+ C + +G ++ TYYGG DI++ SN++F +G DP+ G
Sbjct: 370 FIQQCTDIFGSIYDEDFLNSATERTNTYYGGLDIEV------SNVVFVHGSIDPWHALGI 423
Query: 439 VKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
KT + + + + +HC ++ +D L + R+ + ++ W+ Q
Sbjct: 424 TKTID--EEAPAIYIEGTAHCANMYPPADTDLPQLKEAREQILNLIGTWLAQ 473
>gi|260797277|ref|XP_002593630.1| hypothetical protein BRAFLDRAFT_284130 [Branchiostoma floridae]
gi|229278856|gb|EEN49641.1| hypothetical protein BRAFLDRAFT_284130 [Branchiostoma floridae]
Length = 464
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 124/467 (26%), Positives = 205/467 (43%), Gaps = 57/467 (12%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DH Y +T+ QRY +N F+ G P+ +G E D ++G E A +
Sbjct: 21 QRLDH--YNDADLRTWQQRYFVNDTFYKPG-GPVFLMIGGEGTADPIWMVTGSWIEYAKE 77
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F AL ++LEHRYYGKS P + +++N + Y +S QALAD A ++ +K + T
Sbjct: 78 FHALCLMLEHRYYGKSHP--TEDTSVENLQ---YLSSEQALADLAYFRNYMAEKMSLTDN 132
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
I G SY G L+ WFRLKYPH+V G++A+S P+L D + Y VV T
Sbjct: 133 KWITFGGSYPGSLSAWFRLKYPHLVAGAVATSGPLLAELD---FVEYVEVVRDSLATTGP 189
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYSEAAQ 292
EC + I+++ + + +G+ L+K F C+PL + + + Q
Sbjct: 190 ECNKNIQEATDAVKQMLETQEGVEKLNKLFNLCSPLDVNVANDVANFWSTISGNFMGVVQ 249
Query: 293 YD---------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC-- 341
Y+ +NI + +CN + + + + + L + +C
Sbjct: 250 YNKDNREFEGAVGTNITLDVLCNIMND-----NSKHMPVEERYAAVNYLMMETYKQKCLD 304
Query: 342 -----YINEDR------TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNC 390
I E R E W +Q+C+E D + + + L+ ++ C
Sbjct: 305 VSYSAMIKELRETSWESAASEGGRQWIYQTCTEFGFYQTSDSPNQPFGDGFPLSFSLQQC 364
Query: 391 KEQYGVSPRPSWVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ YG S ++ T YG L+ + + I+F NG DP+ G +
Sbjct: 365 SDIYGPQFNQSTLMEGIRRTNTNYGA------LKIAGTKIVFPNGSIDPWHALGITEDPK 418
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DL + + +HC ++ A+ DP L Q R+T +Q W++Q
Sbjct: 419 GTDLH-AVYIKGTAHCANMYPARFGDPPELTQARETIKGHIQDWLSQ 464
>gi|332026663|gb|EGI66772.1| Putative serine protease K12H4.7 [Acromyrmex echinatior]
Length = 481
Score = 147 bits (371), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 137/501 (27%), Positives = 230/501 (45%), Gaps = 44/501 (8%)
Query: 11 LLLMVIFVSTSFH--ANGLKLRP-RLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGP 67
+ L ++FV S H NG+ R + L + + E++ Q +DHF++
Sbjct: 2 IALKILFVLCSVHPLVNGVGFRGFTFKGLEEPESLTKNVGTNIVESWI-TQPLDHFNH-- 58
Query: 68 ESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEH 127
+T+ RY NS F N PIL +G E I + G YE ++ L+ EH
Sbjct: 59 RDNRTWSMRYKENSAFLKK-NGPILIMIGGEWEITNGFLQGGLMYELGVKYHGLMYYTEH 117
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
R+YG+S P ++ + +N + Y N+ QALAD A + K + N + VI +G SY
Sbjct: 118 RFYGQSRP--TKDISTENLQ---YLNADQALADLAYFIDTKKKEKNLEKSIVIVVGGSYA 172
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSW 247
G +A W RLKYPH++ G+LASSAP+ D + Y+ VV+ S+ C +++K ++
Sbjct: 173 GNMAAWARLKYPHLIQGALASSAPVRAKAD---FYEYYEVVTDALGKYSKTCIESVKTAF 229
Query: 248 SEIDNIASKPDGLAILSKKFKTC-TP-LKNTTEL---KDGLDTVYSEAAQYDTPSN--IP 300
+ ++ + + G L FK C P ++++++L + L +++ QY+ N
Sbjct: 230 ASVEELLAMRAGPQKLKLLFKLCHVPDVRSSSDLGYFMNTLSEIFAGVVQYNKIENSETG 289
Query: 301 VKRICNAIENAPNCGDDI--LCKIAAGVVEADSLEYDGNNSRCY--INEDRTGDES-DEG 355
+ +CN + A + G + L +I + + + + Y+ N + Y I+ D S
Sbjct: 290 IAALCNKM-TAKHLGSPLQRLARIFSNQKKCNDVNYN-NFLKTYREISWDSPAATSIMRQ 347
Query: 356 WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYG 408
W Q+C+E + N + + L +I C + YG + YG
Sbjct: 348 WYHQTCTEYGYYQTTNSNKSIFGKLFPLNYFINLCTDLYGDYHNKKILDSHVRRTNIMYG 407
Query: 409 GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKS 468
G L N+IF+NG DP+ ++ + F + +N SHC DL +
Sbjct: 408 GKLPDL------RNVIFTNGNSDPWHPLSVLQDLNAFSPAIV--INGSSHCRDLYSDVTT 459
Query: 469 DPDWLVQQRKTEVKIMQGWIT 489
DPD L R KI+ WI+
Sbjct: 460 DPDNLKAARAKIRKIIGKWIS 480
>gi|194865618|ref|XP_001971519.1| GG14395 [Drosophila erecta]
gi|190653302|gb|EDV50545.1| GG14395 [Drosophila erecta]
Length = 508
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 128/478 (26%), Positives = 215/478 (44%), Gaps = 42/478 (8%)
Query: 39 SRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGA 97
S+I + S H E ++ Q +DHF +T+ QRY +N+ F+ +S P+ +G
Sbjct: 43 SKIPTLQGSQHS-EDLWFEQRLDHFK--SSDVRTWQQRYFVNADFYRNDSSAPVFLMIGG 99
Query: 98 EAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
E G A F AL + LEHR+YGKS P A + + Y +S QA
Sbjct: 100 EGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP-----TADLSTENLHYLSSEQA 154
Query: 158 LADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
L D AS + +K K+N A IA G SY G LA W R KYP ++ GS++SS P+L
Sbjct: 155 LEDLASFVTAMKVKFNLADGQKWIAFGGSYPGSLAAWAREKYPQLIYGSISSSGPLLAEV 214
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT 276
D + Y VV EC + + +S+++++ + G L +KFKTCTP+K++
Sbjct: 215 D---FKEYFEVVKASLAAYKPECVEAVTRSFAQVEILLKHMIGQRSLDEKFKTCTPIKDS 271
Query: 277 TE-------LKDGLDTVYSEAAQY---DTP-SNIPVKRICNAIENAP--------NCGDD 317
E + L ++ QY ++P + I + IC+ + N +D
Sbjct: 272 IENDLDMANFFENLAGNFAGVVQYNKDNSPHATITIDDICDVMLNTTAGPPVTRLGLVND 331
Query: 318 ILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNS 374
+L K A L+Y + E+ +G W +Q+C E +
Sbjct: 332 MLLKEA----NTTCLDYKYEKMVADMKNVSWDSETAKGMRQWTYQTCHEFGFYQTSANPT 387
Query: 375 MYQPEPWNLTKYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
+ + + +I+ C + + + ++ L G +D L+ T+N+++ +G DP+
Sbjct: 388 DTFGDRFGVDFFIRQCMDVFSKNMDAKFLQLVVSGTNDNYGALKPKTTNVLYVHGSIDPW 447
Query: 434 SRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
G VK+ + + + +HC ++ E K+DP LV R +K + + Y
Sbjct: 448 HALGLVKSSNPALPTIY--IEGTAHCANMYEPVKTDPPQLVAARNNILKFLAKLLNGY 503
>gi|195376151|ref|XP_002046860.1| GJ13120 [Drosophila virilis]
gi|194154018|gb|EDW69202.1| GJ13120 [Drosophila virilis]
Length = 513
Score = 146 bits (369), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 131/507 (25%), Positives = 218/507 (42%), Gaps = 75/507 (14%)
Query: 23 HANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSK 82
H NG LG + L+Q + E ++ Q +DH P+ +T+ QRY +N
Sbjct: 37 HVNGF-----LGGPSKIATLQQ---SMDVEDLWFEQRLDHLQ--PDDTRTWQQRYFVNDA 86
Query: 83 FW-GGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA 141
F+ ++P+ +G E G A F AL + LEHR+YGKS P
Sbjct: 87 FYRNDSHAPVFLMIGGEGEATKKWMHEGAWVRYAEHFGALCIQLEHRFYGKSHPTSDLST 146
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPH 200
+ + Y +S QALAD A+ + +K KYN IA G SY G LA W R KYPH
Sbjct: 147 S-----NLAYLSSEQALADLANFVTTMKTKYNMDAKQKWIAFGGSYPGSLAAWAREKYPH 201
Query: 201 VVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGL 260
++ GS++SS P+L D ++ Y VV EC + + + ++++ + G
Sbjct: 202 LIDGSISSSGPLLAQVD---FSQYFEVVKSSLASYKPECVEAVTRGIAQVEILLKHMIGQ 258
Query: 261 AILSKKFKTCTPLKNTTELKDGLDTVYSEAA-------QY---DTP-SNIPVKRICNAIE 309
L +KFKTCTPLK++ E + + ++ A QY ++P +NI + IC+ +
Sbjct: 259 RNLDEKFKTCTPLKDSIENQLDISNLFENIAGNFAGVVQYNKDNSPHANITIDEICDVML 318
Query: 310 NAP--------NCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEG---WEW 358
N +D+L K + E+ L+Y + E+ +G W +
Sbjct: 319 NTTMGPPVTRLAAVNDMLLKQS----ESKCLDYKYEKMIADMKNVSWDSEAAKGMRQWTY 374
Query: 359 QSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHD 411
Q+C+E + S + + + +I+ C + + + + YYG
Sbjct: 375 QTCTEFGFYQTSENKSDTFGDRFGVDFFIRQCMDIFSERMDGKFLEQAVAQTNKYYGA-- 432
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL-------GSHCLDLDE 464
L+ +T+ +++ +G DP +H L S + N +HC ++ E
Sbjct: 433 ----LKPATTQVLYVHGSIDP---------WHALGLYVSPNANTPTIYIEGTAHCANMYE 479
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQY 491
SDP L R +K + + Y
Sbjct: 480 PVNSDPPQLKAARNKILKYLAKLLDGY 506
>gi|195013134|ref|XP_001983807.1| GH15372 [Drosophila grimshawi]
gi|193897289|gb|EDV96155.1| GH15372 [Drosophila grimshawi]
Length = 505
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 136/530 (25%), Positives = 229/530 (43%), Gaps = 82/530 (15%)
Query: 7 SFQWLLLMVIFVSTSF-------HANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQT 59
+F LLL+ I S H NG P S++L + S E ++ Q
Sbjct: 6 TFYVLLLLAIIAQCSSLNFRRGRHVNGFLGEP-------SKVLSLQRS-LDVEELWFEQR 57
Query: 60 IDHFSYGPESYQTFPQRYVINSKFW-GGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DHF ++ QT+ QRY IN + + N+P+ +G E G A F
Sbjct: 58 LDHFK--ADNRQTWQQRYFINDQHYVNDSNAPVFIMIGGEGEATKKWMNEGAWIHYAEHF 115
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
AL + LEHR+YGKS P G + + Y +S QALAD A+ + +K KYN
Sbjct: 116 GALCIQLEHRFYGKSHPTGDLSTS-----NLAYLSSEQALADLANFVSAMKSKYNMKATQ 170
Query: 179 V-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
IA G SY G LA W R KYP ++ G+++SS P+L D + Y VV
Sbjct: 171 KWIAFGGSYPGSLAAWAREKYPDLIDGAISSSGPLLAEVD---FRQYFEVVKASLASYKP 227
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-------LKDGLDTVYSEA 290
+C + + +S+++++ + G L +KFKTCTPLK++ E L + L ++
Sbjct: 228 DCVEAVTRSFAQVEILLKHMIGQRNLDEKFKTCTPLKDSIETPLDIANLFENLAGNFAGV 287
Query: 291 AQY---DTP-SNIPVKRICNAIENAP--------NCGDDILCKIAAGVVEADSLEYDGNN 338
QY ++P +NI + +C+ + N +D+L K + ++ L+Y +
Sbjct: 288 VQYNKDNSPHANITIDEVCDVMLNTQLGPPVTRLAAVNDMLLKQS----KSKCLDYKYDK 343
Query: 339 SRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG 395
+ E +G W +Q+C+E D S + + + +++ C + +
Sbjct: 344 MIADMKNVSWDSEVAKGMRQWTYQTCNEFGFYQTSDNKSDTFGDRFGVDFFVRQCADIFS 403
Query: 396 VSPRPSWVLT-------YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
++V +YG L+ T+ +++ +G DP +H L
Sbjct: 404 ERMDANFVEQAVLATNKFYGA------LKPDTTQVLYVHGSIDP---------WHALGLY 448
Query: 449 FSQD-------LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
S + + +HC ++ E SDP+ L R +K + + Y
Sbjct: 449 VSPNAKTPTIYIEGTAHCANMYEPVDSDPEQLKAARNKILKFLAKLLDGY 498
>gi|330790734|ref|XP_003283451.1| hypothetical protein DICPUDRAFT_147119 [Dictyostelium purpureum]
gi|325086716|gb|EGC40102.1| hypothetical protein DICPUDRAFT_147119 [Dictyostelium purpureum]
Length = 487
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 125/459 (27%), Positives = 204/459 (44%), Gaps = 63/459 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+N +DH Y P + +TF QRY +N +W G P+ LG E PI + F
Sbjct: 60 FNNQVDH--YNPLNTETFKQRYYVNDTYWTPG-GPVFLVLGGEGPISPSYVTGHFVVNYY 116
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F ALIV +EHR+YG S P G+ A +N + Y ++ QALADYA+ + K KYN
Sbjct: 117 APMFDALIVAVEHRFYGASTPKGN--LATENLK---YLSTQQALADYANFVQFFKQKYNT 171
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+ ++ G SY G L+ W RLKYP+++ ++A+SAP+ + + Y VVS
Sbjct: 172 GDSKWVSFGGSYSGNLSAWLRLKYPNLIDAAIATSAPV---KPVVDFPEYFEVVSNSIGP 228
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV---YSEAA 291
+ I K+ +++ N ++K F C P+ + ++ ++++ SE
Sbjct: 229 SCSAFVANITKTVTDMINNGQNDQ----VAKLFNACDPIVSDLDIATFMESLSGGISEIV 284
Query: 292 QYDTPSN----IPVKRICNAIE-------------NAPNCGDDILCKIAAGVVEADSLEY 334
QY+ +N + +C E N N C +++ S+ Y
Sbjct: 285 QYNLDNNAYTFTNITAMCEEFEQGSDPMQTFIDFNNRYNTFSGSPCTLSS---YEKSVIY 341
Query: 335 DGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY 394
N +N S W WQ C+E + S L +I C + +
Sbjct: 342 QQNIDPANVN------ASSRSWNWQCCTEYGYYQTGESPSQPFSSTITLDYFINMCTDVF 395
Query: 395 G-----VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
G P+ +++T YG +I+ +SNI+ ++G DP+S G H L
Sbjct: 396 GPEGFVYKPQVDYIITDYGSTNIQ------SSNIVMASGTIDPWSFLG----VHQTPLKS 445
Query: 450 SQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
S L G+HC +L K+ D +V R E+++++
Sbjct: 446 SVQPILIQGGAHCSELYMPKEHDLPDVVTARLVEIQLIK 484
>gi|9506995|ref|NP_062302.1| thymus-specific serine protease precursor [Mus musculus]
gi|13633993|sp|Q9QXE5.1|TSSP_MOUSE RecName: Full=Thymus-specific serine protease; AltName: Full=Serine
protease 16; Flags: Precursor
gi|6706784|emb|CAB66137.1| thymus-specific serine peptidase [Mus musculus]
gi|26352940|dbj|BAC40100.1| unnamed protein product [Mus musculus]
gi|116138603|gb|AAI25482.1| Protease, serine, 16 (thymus) [Mus musculus]
gi|124297571|gb|AAI31956.1| Protease, serine, 16 (thymus) [Mus musculus]
gi|148700669|gb|EDL32616.1| protease, serine, 16 (thymus), isoform CRA_d [Mus musculus]
Length = 509
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/469 (25%), Positives = 204/469 (43%), Gaps = 62/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ +TF QRY +N + G + P+ +G E + ++G
Sbjct: 60 WLEQPLDPFNA--SDRRTFLQRYWVNDQHRTGQDVPVFLHIGGEGSLGPGSVMAGHPAAL 117
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G AL Y +S ALAD AS + N
Sbjct: 118 APAWGALVISLEHRFYGLSMPAGGLDLALLR-----YLSSRHALADVASARQALSGLLNV 172
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LATW RLK+PH+V ++ASSAP+ + ++ Y+ VV++
Sbjct: 173 SSSSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPL---SAVVDFSAYNQVVARSLT 229
Query: 234 DT----SEECFQTIKKSWSEIDNIA-SKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
S EC +++E++ + + P A+L ++ C L ++ EL L
Sbjct: 230 QVAIGGSLECLAAASTAFTEVERLLRAGPAAQAVLREELGACGSLDLTEDQAELLGALQA 289
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ QYD + P V+++C N + P G +I + L +
Sbjct: 290 LVGGTVQYDGQAGAPLSVRQLCGLLLGKWGNRSRSTPYLGLRRAVQIVLRSMGQKCLSFS 349
Query: 336 GNNSRCYIN--EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ ++ E + D W +Q+C+E + + + L ++ C++
Sbjct: 350 RAETVAQLSNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQCPFSQLPALPFQLELCEQV 409
Query: 394 YGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+SP + +YYGG + ++F NG DP +H LS
Sbjct: 410 FGLSPASVAQAVAQTNSYYGGQS------PGATQVLFVNGDTDP---------WHV--LS 452
Query: 449 FSQDLNL---------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+QDL L SHC D+ + SD L R+ + +Q W+
Sbjct: 453 VTQDLGLSEPALLIPSASHCFDMAPMRPSDSPSLRLGRQKISQQLQDWL 501
>gi|148700668|gb|EDL32615.1| protease, serine, 16 (thymus), isoform CRA_c [Mus musculus]
Length = 493
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/467 (25%), Positives = 203/467 (43%), Gaps = 62/467 (13%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
Q +D F+ +TF QRY +N + G + P+ +G E + ++G A
Sbjct: 46 EQPLDPFNA--SDRRTFLQRYWVNDQHRTGQDVPVFLHIGGEGSLGPGSVMAGHPAALAP 103
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-AT 175
+ AL++ LEHR+YG S+P G AL Y +S ALAD AS + N ++
Sbjct: 104 AWGALVISLEHRFYGLSMPAGGLDLALLR-----YLSSRHALADVASARQALSGLLNVSS 158
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
+P I G SY G LATW RLK+PH+V ++ASSAP+ + ++ Y+ VV++
Sbjct: 159 SSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPL---SAVVDFSAYNQVVARSLTQV 215
Query: 236 ----SEECFQTIKKSWSEIDNIA-SKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVY 287
S EC +++E++ + + P A+L ++ C L ++ EL L +
Sbjct: 216 AIGGSLECLAAASTAFTEVERLLRAGPAAQAVLREELGACGSLDLTEDQAELLGALQALV 275
Query: 288 SEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
QYD + P V+++C N + P G +I + L +
Sbjct: 276 GGTVQYDGQAGAPLSVRQLCGLLLGKWGNRSRSTPYLGLRRAVQIVLRSMGQKCLSFSRA 335
Query: 338 NSRCYIN--EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG 395
+ ++ E + D W +Q+C+E + + + L ++ C++ +G
Sbjct: 336 ETVAQLSNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQCPFSQLPALPFQLELCEQVFG 395
Query: 396 VSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
+SP + +YYGG + ++F NG DP +H LS +
Sbjct: 396 LSPASVAQAVAQTNSYYGGQS------PGATQVLFVNGDTDP---------WHV--LSVT 438
Query: 451 QDLNL---------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
QDL L SHC D+ + SD L R+ + +Q W+
Sbjct: 439 QDLGLSEPALLIPSASHCFDMAPMRPSDSPSLRLGRQKISQQLQDWL 485
>gi|281206134|gb|EFA80323.1| Putative serine protease [Polysphondylium pallidum PN500]
Length = 484
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 131/459 (28%), Positives = 208/459 (45%), Gaps = 65/459 (14%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
++Q +DH Y P + TF Q+Y +N +W G P+ LG E P F
Sbjct: 56 FDQQVDH--YDPLNTATFKQQYFVNDTYWTTG-GPVFLLLGGEGPASVTSVTGHFVINTY 112
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A QF ALIV +EHR+YGKS P S+ A + + QALAD+A+ I KYN
Sbjct: 113 AQQFGALIVSVEHRFYGKSSP--SKTLA---TEYLNLLTTQQALADFANFRQFIAAKYNV 167
Query: 175 -THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ ++ G SY G L+ W RLKYP ++ ++A+SAP+ D + Y VV+ R
Sbjct: 168 PSTTKWVSFGGSYSGSLSAWLRLKYPQLIDAAIATSAPVQPQLD---FPEYFEVVA---R 221
Query: 234 DTSEECFQTIKKSWSEIDNIASK--PDGLAILSKKFKTCTPLKNTTELK---DGLDTVYS 288
C I +E+ N+ ++ + K F TC P+ ++ ++ + L S
Sbjct: 222 SVGPACSARI----AEVTNLVTQMLQTDRKTVEKLFNTCDPIVSSDDVATFFESLSDGIS 277
Query: 289 EAAQYDTPSN----IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-- 342
E QY+ +N + +C+ ++ G D L E + + Y
Sbjct: 278 EIVQYNNDNNKYTMFNISHMCSLLD-----GGDPLQSFVNFNNEFNQFSGNKCTQSSYKS 332
Query: 343 ----INEDRTGDESDEG--WEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKE 392
+ E E+ G W WQ+C+E G + S +P++ L +++ C +
Sbjct: 333 MIAQMRETEVNGENAAGRLWTWQTCTE----YGYFQTSESPNQPFSSSITLDWFLQQCAD 388
Query: 393 QYGVSP--RP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
+G P +P W+ T +GG +I+ TSN IF NG+ DP+ G V
Sbjct: 389 IFGPKPDGKPYLPAIEWIETDFGGRNIQ------TSNTIFPNGLIDPWHILG-VLNATTS 441
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIM 484
+S + + LG+HC DL +D + LV R+ EV ++
Sbjct: 442 SIS-TAIIPLGAHCSDLYPPLPTDNEALVLARQMEVNLI 479
>gi|195588332|ref|XP_002083912.1| GD13982 [Drosophila simulans]
gi|194195921|gb|EDX09497.1| GD13982 [Drosophila simulans]
Length = 508
Score = 145 bits (366), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 123/465 (26%), Positives = 210/465 (45%), Gaps = 41/465 (8%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGF 110
E ++ Q +DHF + +T+ QRY +N+ F+ +S P+ +G E G
Sbjct: 55 EDLWFEQRLDHFKSSDK--RTWQQRYFVNADFYRNDSSAPVFLMIGGEGEASAKWMREGA 112
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A F AL + LEHR+YGKS P A + + Y +S QAL D AS + +K
Sbjct: 113 WVHYAEHFGALCLQLEHRFYGKSHP-----TADLSTENLHYLSSEQALEDLASFVTAMKV 167
Query: 171 KYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
K+N IA G SY G LA W R KYP ++ GS++SS P+L D + Y VV
Sbjct: 168 KFNLGDGQKWIAFGGSYPGSLAAWAREKYPQLIYGSISSSGPLLAEVD---FKEYFEVVK 224
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-------LKDG 282
EC + +S+++++ + G L +KFKTCTP+K++ E +
Sbjct: 225 ASLVSYKPECLDAVTRSFAQVEILLKHMIGQRSLDEKFKTCTPIKDSIENDLDMSNFFEN 284
Query: 283 LDTVYSEAAQY---DTP-SNIPVKRICNAIENAP--------NCGDDILCKIAAGVV--- 327
L ++ QY ++P ++I + IC+ + N +D+L K +
Sbjct: 285 LAGNFAGVVQYNKDNSPHASITIDDICDVMLNTTAGPPVTRLGLVNDMLLKESNTTCLDY 344
Query: 328 EADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
+ D + D N ++ D + W +Q+C E DK + + + + +I
Sbjct: 345 KYDKMVADMKN----VSWDSETGKGMRQWTYQTCHEFGFYQTSDKPADTFGDRFGVDFFI 400
Query: 388 KNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+ C + + + ++ L +D L+ T+N+++ +G DP+ G VK+
Sbjct: 401 RQCMDVFSKNMNAKFLKLVVSATNDNYGALKPKTTNVLYVHGSIDPWHALGLVKSTSPAL 460
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ + +HC ++ E K+DP LV R +K + + Y
Sbjct: 461 PTIY--IEGTAHCANMYEPVKTDPPQLVAARNKILKFLAKLLDGY 503
>gi|297734877|emb|CBI17111.3| unnamed protein product [Vitis vinifera]
Length = 149
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 71/127 (55%), Positives = 87/127 (68%), Gaps = 11/127 (8%)
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA+W RLKYPHV +G+LASSAPILY+ DITP + SE C+ TI++SWSE
Sbjct: 33 LASWLRLKYPHVALGALASSAPILYFDDITPQN-----------EASEICYNTIRESWSE 81
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIE 309
ID +AS+PDGL+ILSKKF+TCT L + ELKD LD YS AAQY+ P PV +C I+
Sbjct: 82 IDKVASEPDGLSILSKKFRTCTELSTSDELKDYLDETYSVAAQYNHPPRYPVTVVCGGID 141
Query: 310 NAPNCGD 316
AP D
Sbjct: 142 GAPEGSD 148
>gi|195338045|ref|XP_002035636.1| GM14810 [Drosophila sechellia]
gi|194128729|gb|EDW50772.1| GM14810 [Drosophila sechellia]
Length = 508
Score = 145 bits (365), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 123/465 (26%), Positives = 209/465 (44%), Gaps = 41/465 (8%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGF 110
E ++ Q +DHF + +T+ QRY +N+ F+ +S P+ +G E G
Sbjct: 55 EDLWFEQRLDHFKSSDK--RTWQQRYFVNADFYRNDSSAPVFLMIGGEGEASAKWMREGA 112
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A F AL + LEHR+YGKS P A + + Y +S QAL D AS + +K
Sbjct: 113 WVHYAEHFGALCLQLEHRFYGKSHP-----TADLSTENLHYLSSEQALEDLASFVTAMKV 167
Query: 171 KYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
K+N IA G SY G LA W R KYP ++ GS++SS P+L D + Y VV
Sbjct: 168 KFNLGDGQKWIAFGGSYPGSLAAWAREKYPQLIYGSISSSGPLLAEVD---FKEYFEVVK 224
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-------LKDG 282
EC + +S+++++ + G L +KFKTCTP+K++ E +
Sbjct: 225 ASLASYKPECLDAVTRSFAQVEILLKHMIGQRSLDEKFKTCTPIKDSIENDLDMANFFEN 284
Query: 283 LDTVYSEAAQY---DTP-SNIPVKRICNAIENAP--------NCGDDILCKIAAGVV--- 327
L ++ QY ++P + I + IC+ + N +D+L K +
Sbjct: 285 LAGNFAGVVQYNKDNSPHAKITIDDICDVMLNTTAGPPVTRLGLVNDMLLKESNTTCLDY 344
Query: 328 EADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
+ D + D N ++ D + W +Q+C E DK + + + + +I
Sbjct: 345 KYDKMVADMKN----VSWDSETAKGMRQWTYQTCHEFGFYQTSDKPADTFGDRFGVDFFI 400
Query: 388 KNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+ C + + + ++ L +D L+ T+N+++ +G DP+ G VK+
Sbjct: 401 RQCMDVFSKNMNAKFLKLVVSATNDNYGALKPRTTNVLYVHGSIDPWHALGLVKSTSPAL 460
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ + +HC ++ E K+DP LV R +K + + Y
Sbjct: 461 PTIY--IEGTAHCANMYEPVKTDPPQLVAARNKILKFLAKLLDGY 503
>gi|405960530|gb|EKC26449.1| Putative serine protease F56F10.1 [Crassostrea gigas]
Length = 446
Score = 144 bits (363), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 131/474 (27%), Positives = 205/474 (43%), Gaps = 76/474 (16%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q + HF+Y +T+ QRY +N F+ N PI +G E + L G E A
Sbjct: 5 QKLTHFNYADT--RTWQQRYFVNDTFYKP-NGPIFLMIGGEGTANPAWMLQGAWIEYAKT 61
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
+ A+ +LEHRYYGKS P + ++ N + + +S QALAD A + ++K KYN
Sbjct: 62 YHAICFLLEHRYYGKSHP--TPDLSVDNLQ---FLSSEQALADLAYFIQYVKHKYNLMSK 116
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
I ++GG L+ WFR+KYPH+V G++A+SAPI + + Y VV T
Sbjct: 117 DQKLI--TFGGSLSAWFRVKYPHLVDGAVATSAPIFAQLN---FKEYLQVVVSSLATTGP 171
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPS 297
C + IK + I + S G K F C PL +T++ DG + + A ++
Sbjct: 172 GCNKNIKMATDTITKMISTDTGRKSAEKMFNLCDPL-DTSQYVDGSNLFSNLAGNFE--G 228
Query: 298 NIPVKRICNAIENA--PNCGDDILCKIAAG---------VVEADSLEYDGNNSRCYINE- 345
+ + A E A N D +C I + +SL + +C N
Sbjct: 229 VVQYNKDNRAFEGAIGTNITIDTICGIMTDESNGSPVERYAKVNSLMLSTYSQKCLDNSY 288
Query: 346 ------------DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQP--EPWNLTKYIKNCK 391
+ + E W +Q+C+E D + QP +NL I+ C
Sbjct: 289 NKMIKGLQATAWNSSASEGGRQWMYQTCTEFGFFQSSDLGDV-QPFGNFFNLKFSIQQCM 347
Query: 392 EQYGVSPRPSWVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHF 444
+ +G + T YGG+ ++ + I+F NG DP +HF
Sbjct: 348 DVFGAKFNQELIQMGINRTNTNYGGYGMR------ATKIVFPNGSIDP---------WHF 392
Query: 445 FDLSFSQDLNLGS---------HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
L F++DL++ S HC ++ A D LVQ R T K++ W++
Sbjct: 393 --LGFTKDLSMESPAIYIQGTAHCANMYPATSDDLPQLVQARATIEKLIGTWLS 444
>gi|24659664|ref|NP_648067.2| CG9953 [Drosophila melanogaster]
gi|7295307|gb|AAF50628.1| CG9953 [Drosophila melanogaster]
Length = 508
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 129/491 (26%), Positives = 220/491 (44%), Gaps = 48/491 (9%)
Query: 32 RLGRIRR------SRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
R GR+ + S+I + S H E ++ Q +DHF + +T+ QRY +N+ F+
Sbjct: 30 RRGRLTKGFLGEPSKIPTLQRSLHS-EDLWFEQRLDHFKSSDK--RTWQQRYFVNADFYR 86
Query: 86 GGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALK 144
+S P+ +G E G A F AL + LEHR+YGKS P A
Sbjct: 87 NDSSAPVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP-----TADL 141
Query: 145 NARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELATWFRLKYPHVVI 203
+ + Y +S QAL D AS + +K K+N IA G SY G LA W R KYP ++
Sbjct: 142 STENLHYLSSEQALEDLASFVTAMKVKFNLGDGQKWIAFGGSYPGSLAAWAREKYPELIY 201
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
GS++SS P+L D + Y VV EC + +S+++++ + G L
Sbjct: 202 GSISSSGPLLAEVD---FKEYFEVVKASLAAYKPECVDAVTRSFAQVEILLKHMIGQRSL 258
Query: 264 SKKFKTCTPLKNTTE-------LKDGLDTVYSEAAQY---DTP-SNIPVKRICNAIENAP 312
+KFKTCTP+K++ E + L ++ QY ++P + I + IC+ + N
Sbjct: 259 DEKFKTCTPIKDSIENDLDMANFFENLAGNFAGVVQYNKDNSPHATITIDDICDVMLNTT 318
Query: 313 --------NCGDDILCKIAAGVV---EADSLEYDGNNSRCYINEDRTGDESDEGWEWQSC 361
+D+L K + + D + D N ++ D + W +Q+C
Sbjct: 319 AGPPVTRLGLVNDMLLKESNTTCLDYKYDKMVADMKN----VSWDSETAKGMRQWTYQTC 374
Query: 362 SEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRST 420
E D + + + + +I+ C + + + ++ L +D L+ T
Sbjct: 375 HEFGFYQTSDNPADTFGDRFGVDFFIRQCMDVFSKNMNAKFLKLVVSATNDNYGALKPKT 434
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
+N+++ +G DP+ G VK+ + + + +HC ++ E K+DP LV R
Sbjct: 435 TNVLYVHGSIDPWHALGLVKSTNAALPTIY--IEGTAHCANMYEPVKTDPPQLVAARNKI 492
Query: 481 VKIMQGWITQY 491
+K + + Y
Sbjct: 493 LKFLAKLLDGY 503
>gi|410910698|ref|XP_003968827.1| PREDICTED: thymus-specific serine protease-like [Takifugu rubripes]
Length = 493
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 126/473 (26%), Positives = 209/473 (44%), Gaps = 61/473 (12%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F+ +++Q +DHFS ++ QRY ++ F+ + P+ +G E P + G
Sbjct: 49 FDEQWFSQRLDHFSADSREWK---QRYFLSQAFYKP-DGPVFLMIGGEGPANPAWMQYGT 104
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A + AL ++LEHR+YGKS P + + N R + +S QALAD A I +
Sbjct: 105 WLTYAEKLGALCLMLEHRFYGKSRP--TSDLSTDNLR---FLSSRQALADLAHFRTTIAE 159
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
T+A +A G SY G LA WFRLKYPH+V ++A+SAP+ + Y VV +
Sbjct: 160 ALGLTNAKWVAFGGSYPGSLAAWFRLKYPHMVHAAVATSAPV---RATVNFPEYLEVVWR 216
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG--LDTV-- 286
EC +KK+ + + +P ++K F C+ L+ TE+ L+T+
Sbjct: 217 SLASVDVECPLLVKKASDTLAELLKEPKTYDNITKDFNLCSKLQIQTEMDSAQFLETLAG 276
Query: 287 -YSEAAQYD---------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG 336
+ + QY+ +N+ +K +C + + + G+ A D+L
Sbjct: 277 NFMDVVQYNEDNRAFEGVVGTNVTIKVLCGMMRDG-SVGEPYARYAAVARFMLDTLSIKC 335
Query: 337 NNS------RCYINEDRTGDESDEG--WEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYI 387
+S R N G + G W +Q+C+E D N + P L +
Sbjct: 336 LDSSFDAYVRDMTNTSWDGPAAGGGRQWVYQTCAEFGFFQSSDSPNQPFTGFP--LMFQV 393
Query: 388 KNCKEQYGVSPR-----PSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
K C++ Y +S + YYGG+DI+ +S I+F+NG DP+ G +
Sbjct: 394 KQCEQFYNISAEMVAEAVAQTNEYYGGYDIR------SSKIVFANGDVDPWHALGITQ-- 445
Query: 443 HFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
++DL +HC ++ A+ D L R ++Q W+ Q
Sbjct: 446 -----DITRDLPAVFIQGTAHCANMYPARSEDLPQLTLARDHIFLLLQQWLKQ 493
>gi|383865215|ref|XP_003708070.1| PREDICTED: putative serine protease K12H4.7-like [Megachile
rotundata]
Length = 479
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 131/461 (28%), Positives = 214/461 (46%), Gaps = 64/461 (13%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ P +T+ RY+ NSK+ G PI+ +G E I +G YE A
Sbjct: 51 QPLDHFN--PRDNRTWSMRYLENSKYHKEG-GPIMIMIGGEWEISTGFLTTGLMYEIAST 107
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
A++ EHRYYG+S P + + KN + Y + QALAD A + K++ + ++
Sbjct: 108 HGAMMYYTEHRYYGQSKP--TEDISSKNLQ---YLSVDQALADLAYFIETKKEQDHLRNS 162
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
VI IG SY G +A W RLKYPH++ G+LASSAP+ D + Y+ VV++ R +E
Sbjct: 163 TVIVIGGSYAGSMAAWARLKYPHLIQGALASSAPVFAKAD---FYEYYEVVTESIRRQNE 219
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP--LKNTTELKDGLDTV---YSEAAQ 292
+C IK ++ ++ + +G L F C +K+ ++ ++T+ ++ Q
Sbjct: 220 KCADDIKAAFDAVEKLLFTKNGPKRLKTYFHLCDAPDVKSHNDIGHLMNTLAEGFAGIVQ 279
Query: 293 YDT--PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG- 349
YD + + C+ + A + G + + A V+ + + NN + ++ + G
Sbjct: 280 YDNVEKNQTKIAACCDKM-TATSLGSPL--QRLAHVITDGNKKCIENNYQKFVKQYSNGT 336
Query: 350 --DESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG---------VS 397
++ W +Q+CSE KNS++ + L + C + YG S
Sbjct: 337 WKNDISRQWYYQTCSEFGYYQTTNSKNSIF-GSLFPLRFFTDLCVDLYGDYYNENFLDTS 395
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL-- 455
R + ++ YGG LR N+IF+NG DP+ + LS Q+LN
Sbjct: 396 IRRTNIM--YGG------LRPDLRNVIFTNGDIDPWHK-----------LSVLQNLNADS 436
Query: 456 -------GSHCLDL-DEAKKSDPDWLVQQRKTEVKIMQGWI 488
SHC DL + +D LV R KI+ W+
Sbjct: 437 PAILIKGSSHCRDLYSDNLDTDAKDLVNARANVRKIIGTWL 477
>gi|290562587|gb|ADD38689.1| serine protease K12H4.7 [Lepeophtheirus salmonis]
Length = 483
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 135/501 (26%), Positives = 232/501 (46%), Gaps = 49/501 (9%)
Query: 12 LLMVIFVSTSFHANGLKLRPRLGRIRRSRIL---EQKDSNHGFETFFYNQTIDHFSYGPE 68
LL V +S ++ NG + + + ++L E KD++ T FY+QT+DHF+ +
Sbjct: 7 LLFVCLMSIAYAQNG-RDSFLIKLMDMKKVLSPPELKDTSR-ISTSFYDQTLDHFN--TK 62
Query: 69 SYQTFPQRYVINSK-FWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEH 127
+ + + QRY +N + F N P+ +G E G YE A + AL++ LEH
Sbjct: 63 NKKAWKQRYFVNEENFKDKENGPVFLKIGGEGTASIGSMKYGSWYEYAQKVGALMIQLEH 122
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
R+YG+S P + + +N + Y S QA+ D + HIK+KY+ + I +G SY
Sbjct: 123 RFYGESRP--TENLSTENLK---YLTSQQAIEDIVEFIAHIKEKYDIPNNKWITLGGSYP 177
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSW 247
G L+ W R YP ++ G+L+SSAP+ D + Y +V+ D R +C + +
Sbjct: 178 GSLSLWMRSLYPELIAGALSSSAPVEAKVD---FEEYLGIVNNDMRIRDPDCPAAVIEGI 234
Query: 248 SEIDN-IASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYSEAAQYD-TPSNIP 300
E + I S +G ++K +K C K+ L + ++ A+QYD T S
Sbjct: 235 KETEALINSGKEGWQKVAKIYKLCPGWSGDNEKDVKTLFGSIVETFAGASQYDSTLSTND 294
Query: 301 VKRICNAIENAPNCGDDILCKIAAGVVEAD-----SLEYDGNNSRCYINEDRTGDESDEG 355
V ++C+ ++N+ N GD + K+A ++ + +++Y+ E D+
Sbjct: 295 VSQLCSHMKNS-NFGDTNMEKLAGTLIAVNGGSCINVKYEDFIDFMRNEEWSVDDDGYRQ 353
Query: 356 WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT-------YYG 408
W +Q+C+E + + P + +++ C + YG V + +YG
Sbjct: 354 WIFQTCNEFGWYQTGNLWGSFLP----VEFFVEQCTDVYGAEFTSEKVYSSAKYSNDFYG 409
Query: 409 GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKS 468
+ L SN I ++G DP+ G + + +F +N SHC DL A
Sbjct: 410 AKNPSL------SNTIITHGSFDPWHPMGILNDMNDSVKAFV--INGTSHCFDLQPANPL 461
Query: 469 -DPDWLVQQRKTEVKIMQGWI 488
D D L RKT + ++ WI
Sbjct: 462 FDSDQLTHVRKTTFEYIKKWI 482
>gi|66803795|ref|XP_635725.1| hypothetical protein DDB_G0290409 [Dictyostelium discoideum AX4]
gi|60464058|gb|EAL62220.1| hypothetical protein DDB_G0290409 [Dictyostelium discoideum AX4]
Length = 469
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 122/462 (26%), Positives = 200/462 (43%), Gaps = 63/462 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID----DNIQL 107
E+ + Q +DH+ Y + TF QRY++ ++ G + PI +L EAP+ +Q+
Sbjct: 48 ESLIFEQNVDHYDYFNNN--TFKQRYIVVDDYFTG-DGPIFFYLAGEAPMGFFGFQEVQV 104
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+ A F AL ++LEHRYYG+S P + Y S QAL+D A+ L
Sbjct: 105 VNW----AQDFGALFIVLEHRYYGESYPVDDLST-----HNLKYLTSQQALSDAANFLST 155
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
K N V+ G SY G L+ WFRLKYP++V+ S+A S P+L + +T Y++
Sbjct: 156 YKQDNNLIDNQVVVFGCSYSGALSAWFRLKYPNLVVASVAPSGPVLAQLN---YTGYYAQ 212
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
S +C +++ +EI + + G L K F +C L + + L ++
Sbjct: 213 FSN---SAQPDCVAATQQATNEIMQLIANESGRKQLEKTFNSCHSLDDPRDQYYFLYSIT 269
Query: 288 SEAAQYDTPSNIP---VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
D +N P + C + N ++ + G + + + +I
Sbjct: 270 DALGGSDQMNNPPTWILNSTCQMLLQNTNYVNNWAQIVNVGQTQCNDFRL-----KSFIE 324
Query: 345 EDR---TGDESD-EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY---GVS 397
+ R D SD W +Q+C E + P N+ + K C+E Y G++
Sbjct: 325 QLRDISINDNSDNRMWTYQTCVEFGYFSTAYPGTSVFPPVLNVEEQTKWCEEIYDIPGMT 384
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS 457
P YYGG +I+ SNI+F+NG+ DP +H LS ++D G+
Sbjct: 385 PNIDATNNYYGGQNIQ------GSNIMFTNGLLDP---------WHL--LSVNEDNQAGT 427
Query: 458 ---------HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
HC L DP L R+ + ++ ++Q
Sbjct: 428 VKAVTYEAGHCGSLIATTNDDPISLTNARQEVLSFLKLVLSQ 469
>gi|114605978|ref|XP_001136226.1| PREDICTED: thymus-specific serine protease isoform 2 [Pan
troglodytes]
Length = 514
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 136/507 (26%), Positives = 216/507 (42%), Gaps = 76/507 (14%)
Query: 17 FVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQR 76
F +S H GL L P + + LEQ +D F+ ++F QR
Sbjct: 37 FQESSAHGLGLSLGPGAAALPKVGWLEQ--------------LLDPFNV--SDRRSFLQR 80
Query: 77 YVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
Y +N + W G + PI LG E + + G A + AL++ LEHR+YG S+P
Sbjct: 81 YWVNDQHWVGQDGPIFLLLGGEGSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPA 140
Query: 137 GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFR 195
G L+ A+ R + +S ALAD S L + +N ++ +P I G SY G LA W R
Sbjct: 141 G----GLEMAQLR-FLSSRLALADVVSAHLALSRLFNISSSSPWICFGGSYAGSLAAWAR 195
Query: 196 LKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWSEID 251
LK+PH++ S+ASSAP+ D ++ Y+ VVS+ T S EC + +++E++
Sbjct: 196 LKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLMSTAIGGSLECRAAVSVAFAEVE 252
Query: 252 -NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIP--VKRIC 305
+ S A L + C PL +N EL L + QYD + P V+++C
Sbjct: 253 RRLRSGGAAQAALQTELSACGPLGRAENQAELLGALQALVGGVVQYDGQTGAPLSVRQLC 312
Query: 306 --------NAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI--NEDRTGDESDEG 355
N + P CG +I + L + + + E + D
Sbjct: 313 GLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFSRAETVAQLRSTEPQLSGVGDRQ 372
Query: 356 WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP-----SWVLTYYGGH 410
W +Q+C+E + + + L + C++ +G+S + +YYGG
Sbjct: 373 WLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQ 432
Query: 411 DIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN---------LGSHCLD 461
+ ++F NG DP +H LS +Q L GSHCLD
Sbjct: 433 T------PGANKVLFVNGDTDP---------WHV--LSVTQALGSSESTLLIRTGSHCLD 475
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + SD L R+ + +Q W+
Sbjct: 476 MAPERPSDSPSLRLGRQNIFQQLQTWL 502
>gi|195492404|ref|XP_002093975.1| GE21585 [Drosophila yakuba]
gi|194180076|gb|EDW93687.1| GE21585 [Drosophila yakuba]
Length = 508
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 124/465 (26%), Positives = 213/465 (45%), Gaps = 41/465 (8%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGF 110
E ++ Q +DHF + +T+ QRY +N+ ++ +S P+ +G E G
Sbjct: 55 EDLWFEQRLDHFKSSDK--RTWQQRYFVNADYYRNDSSAPVFLMIGGEGEASAKWMREGA 112
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A F AL + LEHR+YGKS P + + N R Y +S QAL D AS + +K
Sbjct: 113 WVHYAEHFGALCLQLEHRFYGKSHP--TADLSTDNLR---YLSSEQALEDLASFVTAMKV 167
Query: 171 KYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
K+N A IA G SY G LA W R K+P ++ GS++SS P+L D + Y VV
Sbjct: 168 KFNLADGQKWIAFGGSYPGSLAAWAREKFPQLIHGSISSSGPLLAEVD---FKEYFEVVK 224
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-------LKDG 282
EC + + +S+++++ + G L +KFKTCTP+K++ E +
Sbjct: 225 ASLAAYKPECVEAVTRSFAQVEILLKHMIGQRSLDEKFKTCTPIKDSIENELDMANFFEN 284
Query: 283 LDTVYSEAAQY---DTP-SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
L ++ QY ++P + I + IC+ + N G + G+V L+
Sbjct: 285 LAGNFAGVVQYNKDNSPHATITIDDICDVMLNT-TAGPPV---TRLGLVNDMLLKESNTT 340
Query: 339 SRCYINEDRTGD--------ESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
Y E D E+ +G W +Q+C E + + + + + +I
Sbjct: 341 CLDYKYEKMVADMKNVSWDSETAKGMRQWTYQTCHEFGFYQTSENPADTFGDRFGVDFFI 400
Query: 388 KNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+ C + + + ++ L G +D L+ +T+N+++ +G DP+ G VK+ +
Sbjct: 401 RQCMDVFSKNMDAKFLQLVVSGTNDFYGALKPNTTNVLYVHGSIDPWHALGLVKSSNPAL 460
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ + +HC ++ E K+DP LV R +K + + Y
Sbjct: 461 PTIY--IEGTAHCANMYEPVKTDPPQLVAARNKILKFLAKLLDGY 503
>gi|66821261|ref|XP_644128.1| hypothetical protein DDB_G0274663 [Dictyostelium discoideum AX4]
gi|60472271|gb|EAL70224.1| hypothetical protein DDB_G0274663 [Dictyostelium discoideum AX4]
Length = 487
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 118/460 (25%), Positives = 208/460 (45%), Gaps = 60/460 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+ +DH Y P++ TF Q++ +N ++ G SP+ LG E P+ + F +
Sbjct: 58 FTNRVDH--YDPQNRNTFKQKFYVNDTYYTPG-SPVFYILGGEGPVGASYVTGHFVFNQY 114
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F AL+V +EHR+YG S+P GS +L+N + Y + QALADYA+ + + KYN
Sbjct: 115 AQKFNALLVAIEHRFYGDSIPMGS--LSLENLK---YLTTQQALADYAAFVPFLTQKYNT 169
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+ I+ G SY G L+ W RLKYP ++ ++A+SAP+ D + Y VVS+
Sbjct: 170 GSSKWISFGGSYSGNLSGWLRLKYPQLISAAIATSAPVKAQLD---FPEYFEVVSQSIGP 226
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS---EAA 291
T I ++ + + N + + F C P+ + ++ ++++ S E
Sbjct: 227 TCSAIVSNITQTVTTMLNNGQNDQ----VQQMFSACDPIVSKLDIATFMESLSSGITETV 282
Query: 292 QYDTPSN----IPVKRICNAIENAPNCGDDIL------CKIAAGVVEADSLEYDGNNSRC 341
QY+ +N + +C E + + + + + + S E S
Sbjct: 283 QYNLDNNNYTFTNITAMCERFEQSSDPMKEFIDFNNEYNQFSGSQCTLSSYE----KSIQ 338
Query: 342 YINED--RTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKEQYG 395
Y+ ++ + S W WQ C+E G + Q +P++ L + + C + +G
Sbjct: 339 YLQSSNYKSANASSRSWNWQCCTE----YGYWQTGSSQNQPFSSAITLEYFTQMCTDIFG 394
Query: 396 -----VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
P ++L YGG +I+ +N+I+ G DP+S V + S S
Sbjct: 395 PKGFVYQPAIQYILNDYGGTNIQ------ATNVIYERGTIDPWS----VLSVQSPPNSES 444
Query: 451 QDLNL--GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
Q + GSHC L K D + + R+ E+ ++ +
Sbjct: 445 QVFLIQGGSHCSALYPPKPDDLPGVTEAREMEIALISSIV 484
>gi|224006151|ref|XP_002292036.1| hypothetical protein THAPSDRAFT_29166 [Thalassiosira pseudonana
CCMP1335]
gi|220972555|gb|EED90887.1| hypothetical protein THAPSDRAFT_29166 [Thalassiosira pseudonana
CCMP1335]
Length = 553
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 124/474 (26%), Positives = 209/474 (44%), Gaps = 47/474 (9%)
Query: 54 FFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
F+ +Q ++HF S P+ + + RY ++K++ G SPI +G E +D I
Sbjct: 90 FYADQLVNHFHTDRSITPKDAK-WSNRYYQSTKYYKGPGSPIFLIVGGEGALDSGILYPF 148
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+ A +F A ++ +EHR+YG P R+A + QALAD + H K
Sbjct: 149 VSEHLARRFGAAVIQIEHRFYGPFQPIVGREATVLELLELL--TPQQALADMVQLTKHFK 206
Query: 170 --------DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
D+++ + PVI++G SY G L+ FRL YP V S ASSAP+ Y
Sbjct: 207 ELLGCSEFDRHSKKYCPVISVGGSYPGFLSAMFRLVYPDFVDISYASSAPLKLYDQTANQ 266
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
Y+ +V+K TS C ++++ + E + K + K C + + E D
Sbjct: 267 NVYYDIVTKAAEHTSPGCAKSVRDALEEASELILKAPSVIDAVKSMSMC--VDSIPEYID 324
Query: 282 GLDTVYSEA--------AQYDTPSNIPVK-----RICNAIENAPNCGDDILCKIAAGVVE 328
L T+ + A YD + P K + C ++ + + + K
Sbjct: 325 NLKTLKEDVMMAIGFSFADYDMDAYPPGKDLGLYKACRVFQHNKSSSMEKVAKF------ 378
Query: 329 ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
E G ++ + E T +E ++Q C+ +V P+G SM+ W K
Sbjct: 379 ---FELLGTDTE-FEREYPTLVGEEEVPDFQLCTTLVDPIGFSSKSMFPKRKWTYEGLTK 434
Query: 389 NCKEQYG--VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
C+ +YG V+P+P ++ G D ++ + S I+F+NG++D +S +++T +
Sbjct: 435 YCQSRYGSEVTPQPYALVEDMGFDD---LVGKGASRILFTNGLQDMWSGASYLETVSEAN 491
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
S + G+H DL SD D R VKI + ++ D+ K NK
Sbjct: 492 EILSLNFENGAHHSDLSHVGPSDNDS-EDIRLGFVKITN-ILAKWLDEIKVENK 543
>gi|403308857|ref|XP_003944858.1| PREDICTED: thymus-specific serine protease isoform 1 [Saimiri
boliviensis boliviensis]
Length = 512
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 118/447 (26%), Positives = 192/447 (42%), Gaps = 48/447 (10%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
++F QRY +N + W G + PI LG E + + G A + AL++ LEHR+Y
Sbjct: 73 RSFLQRYWVNDQHWTGQDGPIFLHLGGEGSLGPGSVMKGHPAALAPAWGALVISLEHRFY 132
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGE 189
G S+P G L A+ R + +S ALAD S L + +N ++ +P I G SY G
Sbjct: 133 GLSIPAG----GLDMAQLR-FLSSRHALADVVSARLALSRLFNVSSSSPWICFGGSYAGS 187
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKK 245
LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+ + S EC +
Sbjct: 188 LAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLKSAAIGGSLECQAAVST 244
Query: 246 SWSEIDNIASKPDGLAILSK-KFKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIPV 301
++SE++ + + C L ++ EL L + A QYD + P+
Sbjct: 245 AFSEVERRLRAGGAARAALQAELNACGSLSRAEDQAELLGALQALVGGAVQYDGQAGAPL 304
Query: 302 K----------RICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI--NEDRTG 349
N + P CG +I + L + + + E +
Sbjct: 305 SVRQLCGLLLGGGANRSRSTPYCGLRRAVQIVMHSLGQKCLSFSRAETVAQLRSTEPQVS 364
Query: 350 DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP-----SWVL 404
D W +Q+C+E + + + L + C++ +G+SP +
Sbjct: 365 GVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQVFGLSPLSVAQAVAQTN 424
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---GSHCLD 461
+YYGG + ++F NG DP W L S+ + L GSHCLD
Sbjct: 425 SYYGGQT------PGANQVLFVNGDTDP-----WHVLSVTQALGSSESVLLIPSGSHCLD 473
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + SD L R++ + +Q W+
Sbjct: 474 MAPERPSDSPSLRLGRQSIFQQLQTWL 500
>gi|242096522|ref|XP_002438751.1| hypothetical protein SORBIDRAFT_10g025530 [Sorghum bicolor]
gi|241916974|gb|EER90118.1| hypothetical protein SORBIDRAFT_10g025530 [Sorghum bicolor]
Length = 192
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/194 (41%), Positives = 112/194 (57%), Gaps = 10/194 (5%)
Query: 304 ICNAIENAPNCGDDILCKIAAGVVEADSLEYD-GNNSRCYINEDRTGDESDEG--WEWQS 360
+C I+ P DIL K+ A A SL Y+ + C ED+ S + W WQ+
Sbjct: 1 MCKIIDGFP-ANADILEKVFA----AASLFYNYTGDQTCNHIEDKDSPRSLDLSYWRWQA 55
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRST 420
C+EM++PM SM+ P ++ C +GV PRP W+ T YGG+ I +L+R
Sbjct: 56 CTEMIMPMSSSNESMFPPSTFSYEDRFNYCFRFFGVRPRPHWITTEYGGYKIDKVLKRFG 115
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
SNIIFSNGMRDP+S GG +K ++ + G+H LDL A K DPDW+++QR+ E
Sbjct: 116 SNIIFSNGMRDPWSGGGVLKNISSSIVALVTE--KGAHHLDLRGATKDDPDWVIEQRRQE 173
Query: 481 VKIMQGWITQYYDD 494
V+I+QGWI QYY D
Sbjct: 174 VEIIQGWIDQYYQD 187
>gi|344292454|ref|XP_003417942.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 505
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 143/512 (27%), Positives = 209/512 (40%), Gaps = 87/512 (16%)
Query: 37 RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG 96
RR+R H + Q +DHFS S Q +PQRY IN F+ G L G
Sbjct: 23 RRTRGSYSSKGMHPSRAGSFPQKLDHFS--KNSSQLWPQRYFINDAFYKPGGPVFLMIGG 80
Query: 97 AEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQ 156
A + + +S A + AL ++LEHR+YG S P G A + H Y +S Q
Sbjct: 81 AWIACESWVSISKTWVTYAERLGALFLLLEHRFYGHSQPTGDLSTA---SLH--YLSSRQ 135
Query: 157 ALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
ALAD A+ I K T +A G SYGG LA W RLK+P + ++ SSAPI
Sbjct: 136 ALADIANFRTEIAKKMGLTKNNWVAYGCSYGGSLAVWSRLKHPDLFAAAVGSSAPIKAKA 195
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT 276
+ + Y VV + + +CFQT+K+++ ++ + P L + F C LK
Sbjct: 196 N---FYEYLEVVQRSLATHNSKCFQTVKEAFDQVVKMLRFPKYYRKLERDFTLCKRLK-- 250
Query: 277 TELKDGLDTVY---------SEAAQYD------------TPSNIPVKRICN--AIENAPN 313
L +D Y A Q++ TP + + C+ A+ +
Sbjct: 251 --LYSAMDKAYFLERLIFPVELAVQHNKNEKNDKILSPSTPGTLFIPPTCSRIALGEQLS 308
Query: 314 CGDDILCKIAAGV---------VEADSLEYDGNNSRCYINEDRTG-----DES------- 352
D LC A V L Y S C+ RT D S
Sbjct: 309 FSMDELCDAMANTSLGSPYHRYVRIILLTYKDKYSPCFAAHYRTKLKILLDTSINHHDPN 368
Query: 353 -DEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT----- 405
D + +QSC+E D KN + P L+ +++ C + +G + T
Sbjct: 369 IDRQFFYQSCTEFGFFQTTDSKNQPFTGLP--LSYFLQQCSDFFGPKFNNDSLNTGVIST 426
Query: 406 --YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL-----GSH 458
YYGG ++ + S IIF NG DP+ G K S+DL H
Sbjct: 427 NAYYGGFNM------TGSKIIFPNGSFDPWHPLGITK-------DISKDLPAVFIKGAVH 473
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
C D+ E +D L+Q R+ +++Q W+ Q
Sbjct: 474 CADMFEQNDTDSAELIQAREKIFQLLQKWLKQ 505
>gi|440799806|gb|ELR20849.1| protease, serine, 16 (thymus), putative [Acanthamoeba castellanii
str. Neff]
Length = 478
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 129/449 (28%), Positives = 198/449 (44%), Gaps = 76/449 (16%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ Q +DHF P + +T+ Q+Y++ + G PI FLG EAP++ +
Sbjct: 66 FTQKVDHFD--PLNGKTYQQKYIVTDDNYVPGG-PIFLFLGGEAPVEFFDFQTVLPRSLT 122
Query: 116 HQFKALIVILEHRYYGKSVP---FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
QF AL + LEHR+YG S+P + + AL ++R QALAD A+ L+ +K
Sbjct: 123 KQFGALYIALEHRFYGVSMPAHDYSTASLALLSSR--------QALADAANFLVSF-NKT 173
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
P + G SY G L+ WFR KYP++V+GS+A S P+ Y + +T Y+ V S
Sbjct: 174 LTNPGPWVVWGCSYSGALSAWFRAKYPNLVVGSVAPSGPV--YASLN-FTQYYGVFST-- 228
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP------LKNTTELKDGLDTV 286
S +C +T+K++ + + S DG L++ + +P L TE G D
Sbjct: 229 -AASPQCVETVKRATAMLMAKLSTADGRKELTE--ISASPQEHYYFLLTLTEAIGGSD-- 283
Query: 287 YSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY-----DGNNSRC 341
Q+ P P+ CN + + D+L A V +A+ + D N
Sbjct: 284 -----QFQNPPAWPLNTTCNTMMQS----GDLLANWAQVVNQANGPKAPNACNDFNEETS 334
Query: 342 YINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS---P 398
Y+ R SD W +Q C+E M + P +L +K C+ +GVS P
Sbjct: 335 YLKPLRQPTSSDRSWLFQQCTEFGFFMPTYPGTSVFPL-MDLEHQVKWCQNVFGVSGMTP 393
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN---- 454
YYGG+D++ SNI+F+NG DP+ LS ++DL
Sbjct: 394 NTEGTNAYYGGYDLR------GSNILFTNGDADPWH-----------TLSITKDLPAPAG 436
Query: 455 ------LGSHCLDLDEAKKSDPDWLVQQR 477
HC + + DP L R
Sbjct: 437 VRAVTYAAGHCAPMTQPTSQDPVSLQHAR 465
>gi|432891041|ref|XP_004075519.1| PREDICTED: putative serine protease K12H4.7-like [Oryzias latipes]
Length = 489
Score = 142 bits (359), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 130/474 (27%), Positives = 207/474 (43%), Gaps = 68/474 (14%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E ++ Q +DHF+ G +S + + QRY +N F+ G P+ +G E P + +G
Sbjct: 45 EEQWFTQKLDHFN-GADS-RAWKQRYFLNEAFYKPG-GPVFLMIGGEGPANPAWMKNGTW 101
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A + AL ++LEHRYYGKS P + + N R Y +S QALAD A + +
Sbjct: 102 LIYAEKLGALCLMLEHRYYGKSHP--TLDLSTNNLR---YLSSRQALADLAHFRTVMGEA 156
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
T+ +A G SY G LA WFRLKYPH+V S+A+SAP+ + Y VV +
Sbjct: 157 QGLTNNKWVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPV---HATVNFPEYLEVVWRS 213
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG-----LDTV 286
+ +C +KK+ + P+ ++K F C+ L+ TE+ L
Sbjct: 214 LASENAKCPLLVKKASDSLVEKLKDPETFDNITKDFNLCSKLQIQTEMDSAYFLEMLAGN 273
Query: 287 YSEAAQYDTPS---------NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD-- 335
+ + QY+ + NI +K +C+ + + P+ GD A + D+
Sbjct: 274 FMDVVQYNEDNREFEGVQGGNITIKVLCDVMSD-PSLGDAYARYAAVARLLMDTFSLKCL 332
Query: 336 ----GNNSRCYINEDRTGDESDEG--WEWQSCSEMVVPMGKDKNSMYQP-EPWNLTKYIK 388
N R N G ++ G W +Q+C+E D S QP + L +K
Sbjct: 333 DASFSNYLRDMTNTSWEGPSANGGRQWVYQTCTEFGFYQSTD--SPNQPFTGFPLDYQLK 390
Query: 389 NCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
C + Y +S + YYGG++IK +S I+F NG DP+
Sbjct: 391 QCADFYNISAEQVAQAVAQTNEYYGGYNIK------SSRIVFPNGSIDPWHA-------- 436
Query: 444 FFDLSFSQDLNLG---------SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L +QD+ +HC ++ A+ D L R + ++Q W+
Sbjct: 437 ---LGITQDITADLPAVFIKGTAHCANMYPARSQDLPQLSLARDHVLLLLQQWL 487
>gi|397479343|ref|XP_003810983.1| PREDICTED: thymus-specific serine protease [Pan paniscus]
Length = 514
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 127/469 (27%), Positives = 205/469 (43%), Gaps = 62/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNDQHWVGQDGPIFLLLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 119 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRLALADVVSAHLALSRLFNI 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 230
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T S EC + +++E++ + S A L + C PL +N EL L
Sbjct: 231 STAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALQTELSACGPLGRAENQAELLGALQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ QYD + P V+++C N + P CG +I + L +
Sbjct: 291 LVGGVVQYDGQTGAPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + E + D W +Q+C+E + + + L + C++
Sbjct: 351 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQV 410
Query: 394 YGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+S + +YYGG + ++F NG DP +H LS
Sbjct: 411 FGLSALSVAQAVAQTNSYYGGQT------PGANKVLFVNGDTDP---------WHV--LS 453
Query: 449 FSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q L GSHCLD+ + SD L R+ + +Q W+
Sbjct: 454 VTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 502
>gi|281202858|gb|EFA77060.1| hypothetical protein PPL_09813 [Polysphondylium pallidum PN500]
Length = 487
Score = 142 bits (358), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 130/481 (27%), Positives = 217/481 (45%), Gaps = 55/481 (11%)
Query: 36 IRRSRILEQKD------SNHGFETF---FYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
+RR+ ++E+ +HG E ++ Q +DHF+ + QTF QRY++N +W G
Sbjct: 33 LRRNHLIEKAKEESYLLGHHGAENVQYQWFTQRVDHFNQANQ--QTFQQRYIVNDAYWNG 90
Query: 87 GNSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKN 145
N P+ L E P+ ++G + N A +F ALIV LEHRY+G S F + + N
Sbjct: 91 -NGPVFFMLNGEGPMSLGT-VTGLQFVNWAQEFGALIVTLEHRYFGAS--FTTEDLSTDN 146
Query: 146 ARHRGYFNSAQALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPHVVIG 204
+ Y +S QALAD A+ I + N + ++ G SY G L +WFR+KYP +V
Sbjct: 147 LQ---YLSSQQALADNAAFRQFIAETLNVPASSQWVSFGGSYSGALTSWFRIKYPALVDY 203
Query: 205 SLASSAPILYYGDITPWTTYHSVVSKDFRDTS--EECFQTIKKSWSEIDNIASKPDGLAI 262
++ASSAP+ + Y VV TS ++C I + +I + DGLA
Sbjct: 204 TVASSAPV---NAEVNFYQYLEVVQNSLLATSNGQQCIDNIAAATGKIQAMLESADGLAS 260
Query: 263 LSKKFKTCTPLKNTTELKDGLDTV---YSEAAQYDTPSNIP-VKRICNAIENAPNCGDDI 318
+S F C PL + ++ + + ++ + QY+ ++ P + +C+ + GD +
Sbjct: 261 VSNMFNLCPPLASQNDVANFMQSLAGNFMGVVQYNLEASGPSTQNLCDMMTAK---GDPL 317
Query: 319 LCKI----AAGVVEADSLEYDGNNSRCYINE--DRTGDESDEG---WEWQSCSEMVVPMG 369
I A E + YD I E + T D + G W + +C+E
Sbjct: 318 TNYISVWNAFSGDECLDVSYD-----TVIEEMLNITNDATTIGGRMWFYMTCTEFGYFQS 372
Query: 370 KDKNSMYQPEPWNLTKYIKNCKEQYGVS--PRPSWVLTYYGGHDIKLILRRSTSNIIFSN 427
D + + + + C + +G P +W T YG L +NI++ N
Sbjct: 373 SDSPNQPFGNLFPIGFSTQQCNDVFGFDFLPNTNWTHTDYGA------LSPVATNILYVN 426
Query: 428 GMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
G DP+ G + T + S ++ +HC D+ + P LV ++ +Q W
Sbjct: 427 GDIDPWHSLG-ITTNPPTSPTPSLLIHGTAHCADMMIPNQYSPSTLVPAQQIIKSTLQKW 485
Query: 488 I 488
+
Sbjct: 486 L 486
>gi|17944482|gb|AAL48130.1| RH04336p [Drosophila melanogaster]
Length = 508
Score = 142 bits (357), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 128/491 (26%), Positives = 219/491 (44%), Gaps = 48/491 (9%)
Query: 32 RLGRIRR------SRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
R GR+ + S+I + S H E ++ Q +DHF + +T+ QRY +N+ F+
Sbjct: 30 RRGRLTKGFLGEPSKIPTLQRSLHS-EDLWFEQRLDHFKSSDK--RTWQQRYFVNADFYR 86
Query: 86 GGNS-PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALK 144
+S P+ +G E G A F AL + LEHR+YGKS P A
Sbjct: 87 NDSSAPVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLRLEHRFYGKSHP-----TADL 141
Query: 145 NARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELATWFRLKYPHVVI 203
+ + Y +S QAL D AS + +K K+N IA G SY G LA W KYP ++
Sbjct: 142 STENLHYLSSEQALEDLASFVTAMKVKFNLGDGQKWIAFGGSYPGSLAAWAHEKYPELIY 201
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
GS++SS P+L D + Y VV EC + +S+++++ + G L
Sbjct: 202 GSISSSGPLLAEVD---FKEYFEVVKASLAAYKPECVDAVTRSFAQVEILLKHMIGQRSL 258
Query: 264 SKKFKTCTPLKNTTE-------LKDGLDTVYSEAAQY---DTP-SNIPVKRICNAIENAP 312
+KFKTCTP+K++ E + L ++ QY ++P + I + IC+ + N
Sbjct: 259 DEKFKTCTPIKDSIENDLDMANFFENLAGNFAGVVQYNKDNSPHATITIDDICDVMLNTT 318
Query: 313 --------NCGDDILCKIAAGVV---EADSLEYDGNNSRCYINEDRTGDESDEGWEWQSC 361
+D+L K + + D + D N ++ D + W +Q+C
Sbjct: 319 AGPPVTRLGLVNDMLLKESNTTCLDYKYDKMVADMKN----VSWDSETAKGMRQWTYQTC 374
Query: 362 SEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRST 420
E D + + + + +I+ C + + + ++ L +D L+ T
Sbjct: 375 HEFGFYQTSDNPADTFGDRFGVDFFIRQCMDVFSKNMNAKFLKLVVSATNDNYGALKPKT 434
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
+N+++ +G DP+ G VK+ + + + +HC ++ E K+DP LV R
Sbjct: 435 TNVLYVHGSIDPWHALGLVKSTNAALPTIY--IEGTAHCANMYEPVKTDPPQLVAARNKI 492
Query: 481 VKIMQGWITQY 491
+K + + Y
Sbjct: 493 LKFLAKLLDGY 503
>gi|167518842|ref|XP_001743761.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777723|gb|EDQ91339.1| predicted protein [Monosiga brevicollis MX1]
Length = 448
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 120/459 (26%), Positives = 210/459 (45%), Gaps = 50/459 (10%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-IQLSGFTYEN 114
++QT+DHF+ P+ +TF Q+Y +N F+ G P+ LG E P ++ +
Sbjct: 10 FDQTLDHFN--PQDTRTFKQQYQVNRTFYKAG-GPLFLMLGGEGPASPEWLETNTAIMLY 66
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A Q A++ +EHR+YG+S PF ++ N R Y +S QALAD A+ + + N
Sbjct: 67 AQQLNAVVAQIEHRFYGESQPF--EDLSVDNLR---YLSSEQALADAANFIQSFLEM-NG 120
Query: 175 THAP--VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
A V++ G SY G L+ + R KYPHVV ++A+S+P+L D + YH VV +
Sbjct: 121 MPADTRVVSFGGSYSGALSAFLRTKYPHVVYAAVATSSPVLAKLD---YVEYHEVVGRSM 177
Query: 233 RDTS--EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC------------TPLKNTTE 278
++ + C IK + S++D + + L++ K C T L N
Sbjct: 178 GTSTHGQACVDQIKGALSKVDAMLADSSQWNQLAQDMKICSDTDLNVDLDKQTFLSNLIG 237
Query: 279 LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGD-DILCKIAAGVVEADSLEYDGN 337
DG+ +A P+ V+ +C+ + AP D D + A ++ A +E +
Sbjct: 238 NIDGIVQYNLDAKFQGRPT---VQSMCDIM--APIEQDYDAFVALNAYLLNASDMECNDG 292
Query: 338 NSRCYINEDRTGDESDE------GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCK 391
+ + ++ + R S + W +Q+C E D + L+ + + C
Sbjct: 293 SYQSFVTDLRNTSLSSDFAGGTRQWIYQTCMEFAYFQTTDASDQPFGVGVPLSYFEQQCV 352
Query: 392 EQYGVSPRP--SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
+ YG+ P P +W +YGG + + + II+ NG DP+ V + + +
Sbjct: 353 DGYGLPPVPNVNWTNEFYGGQQV------AGTRIIYPNGSIDPW-HALSVTSNTTIEDTL 405
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ +N +HC ++ SD L R + + +Q W+
Sbjct: 406 AIFINGTAHCANMYPPSSSDLPGLTAARTSILNTLQTWL 444
>gi|441593763|ref|XP_004087106.1| PREDICTED: thymus-specific serine protease isoform 2 [Nomascus
leucogenys]
Length = 541
Score = 141 bits (356), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 128/491 (26%), Positives = 206/491 (41%), Gaps = 79/491 (16%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAA---LKNARHRGYFNSA-------------QAL 158
A + AL++ LEHR+YG S+P G + A ++RH +S L
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRHAMGKSSGIPSDEDRPSPPFDPRL 178
Query: 159 ADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLK------YPHVVIGSLASSAP 211
AD S L + +N ++ +P I G SY G LA W RLK +PH++ S+ASSAP
Sbjct: 179 ADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKVLRLLRFPHLIFASVASSAP 238
Query: 212 ILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWSEID-NIASKPDGLAILSKK 266
+ D ++ Y+ VVS+ T S EC + +++E++ + S A L +
Sbjct: 239 VRAVLD---FSEYNDVVSRSLMSTEIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTE 295
Query: 267 FKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIP--VKRIC--------NAIENAPN 313
C PL +N EL L + QYD + +P V+++C N + P
Sbjct: 296 LSVCGPLGRAENQAELLGALQALVGGVVQYDGQAGVPLSVRQLCGLLLGGGGNRSHSTPY 355
Query: 314 CGDDILCKIAAGVVEADSLEYDGNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKD 371
CG KI + L + + + E + D W +Q+C+E + +
Sbjct: 356 CGLRRAVKIVLHSLGQKCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCE 415
Query: 372 KNSMYQPEPWNLTKYIKNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFS 426
+ L + C++ +G+S + +YYGG + ++F
Sbjct: 416 NPRCPFSQLPALPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQT------PGANQVLFV 469
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQR 477
NG DP +H LS +Q L GSHCLD+ + SD L Q R
Sbjct: 470 NGDTDP---------WHV--LSVTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRQGR 518
Query: 478 KTEVKIMQGWI 488
+ + +Q W+
Sbjct: 519 QNIFRQLQTWL 529
>gi|328718795|ref|XP_001947661.2| PREDICTED: putative serine protease K12H4.7-like isoform 1
[Acyrthosiphon pisum]
gi|328718797|ref|XP_003246580.1| PREDICTED: putative serine protease K12H4.7-like isoform 2
[Acyrthosiphon pisum]
Length = 501
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 141/521 (27%), Positives = 232/521 (44%), Gaps = 71/521 (13%)
Query: 13 LMVIFVSTSFHANGLK-LRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQ 71
++ +FV S ANG+K + LG L ++ E ++ Q +DHF+ P + +
Sbjct: 7 ILWLFVWISLPANGIKNILIGLGEPNSPESLRSMNT----EDEWFIQKLDHFN--PTNNR 60
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYG 131
T+ QRY +N + + + P+ +G E I D SG + A +F AL LEHRYYG
Sbjct: 61 TWKQRYQVNLENYKN-DGPVFLMIGGEGKISDKWMHSGAWIDYAKEFNALCFQLEHRYYG 119
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGEL 190
+S P + + Y +S QALAD A +++IK KYN + A +A G SY G L
Sbjct: 120 ESHPTEDMSTS-----NLVYLSSDQALADLAEFIVNIKIKYNIPSTAKWVAFGGSYPGTL 174
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
A W R+KYPH++ +++SS P+L D + Y VV + EC IK++ I
Sbjct: 175 AAWLRMKYPHLIHAAVSSSGPLLAKID---FKEYFMVVENALATYNPECVSQIKQANQMI 231
Query: 251 DNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYSEAAQYDTP--------- 296
+ G I+ KKFK C PL K+ + L + L ++ QY+
Sbjct: 232 NYYLKTDQGAKIIEKKFKLCDPLDRKNDKDVSYLFEILADNFAVIVQYNKDNRHYKDPER 291
Query: 297 SNIPVKRICNAIENA--PNCGDDILCKIAAGVVEADSLEYDGNNSRC-----------YI 343
S++ ++ +C+ + N P D + AA + SL N + C Y+
Sbjct: 292 SSVTLETLCDIMVNKSIPTPLD----RYAAVNNKLLSL----NKADCQDNIYSQLIDLYL 343
Query: 344 NE--DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
N D + W +Q+C+E ++ + + +I C++ +G S
Sbjct: 344 NTSWDSSAAGGGRQWTYQTCTEFGFYQTSSQDDHAFGHNFPIDFFINMCQDIFGKSYNSE 403
Query: 402 WVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
+ T +G +I+ S +IF +G DP+ G K +++ +N
Sbjct: 404 LLTAAVERTNTMFGELNIR------DSRVIFVHGSVDPWHALGITKARTKNNVAIF--IN 455
Query: 455 LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
SHC ++ SD L Q R T ++ W+ + +DF
Sbjct: 456 GTSHCANMYPPASSDLPELTQARTTIRSYLREWLAE--NDF 494
>gi|15233057|ref|NP_189509.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332643953|gb|AEE77474.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 199
Score = 141 bits (356), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 107/173 (61%), Gaps = 8/173 (4%)
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA WF+LKYP++ +G+LASSAP+LY+ D P Y +V+K F++ S+EC I KSW E
Sbjct: 24 LAAWFKLKYPYIALGALASSAPLLYFEDTLPKHGYFYIVTKVFKEMSKECHNKIHKSWDE 83
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIE 309
ID IA+KP+ L+ILSK FK C PL + ELK + +Y+ AQY + + V R+C AI
Sbjct: 84 IDRIAAKPNSLSILSKNFKLCNPLNDIIELKSYVSYIYARTAQY-SDNQFSVARLCEAIN 142
Query: 310 NA-PNCGDDILCKIAAGVVEA-DSLEYDGNNSRCYINEDRTGDESDEGWEWQS 360
+ PN D+L +I AGVV + ++ G +S Y T D D W WQ+
Sbjct: 143 TSPPNTKSDLLDQIFAGVVASRGNISCYGMSSPSY---QMTND--DRAWGWQT 190
>gi|322785978|gb|EFZ12594.1| hypothetical protein SINV_07473 [Solenopsis invicta]
Length = 494
Score = 141 bits (355), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 124/478 (25%), Positives = 215/478 (44%), Gaps = 70/478 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q +DHF+ P + QRY +NS F+ N PI +GAE + + G E
Sbjct: 49 WFTQYLDHFN--PTDVHVWKQRYFVNSDFYKP-NGPIFLMIGAEGIANPKWMIEGQWIEY 105
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A +F A+ LEHR+YGKS P + ++KN Y +S QALAD A + + Y
Sbjct: 106 AKEFGAMCFYLEHRFYGKSHP--TSDLSVKNLV---YLSSEQALADLAYFIQSVNIGYKF 160
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+A I G SYGG LA W R KYPH+V G++++S P+L D + Y+ VV+ +
Sbjct: 161 PNNAKWIVFGGSYGGSLAAWMRAKYPHLVHGAVSASGPLLAQID---FEEYYIVVTNALK 217
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK-------NTTELKDGLDTV 286
SE+C I+ + S+I+ + G + KKFK C P+ + + L + L +
Sbjct: 218 GYSEKCVDVIQDANSQINMLLHHVTGQQQIQKKFKLCDPIDPGHTKKVDISNLYETLASN 277
Query: 287 YSEAAQYD-------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
++ QY+ +NI ++ +C+ + D+ + K + ++ D
Sbjct: 278 FAGIVQYNKDNRQSSQTANITIETVCDILV------DEKIGKSIDRLAYVSNMILDATKE 331
Query: 340 RC-------YINEDRT---GDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKY 386
+C I+E R E EG W +Q+C+E S + + +
Sbjct: 332 KCLDYRYDKMIHELRNVTWASEQAEGGRQWMYQTCTEFGFFQTSTGQSNIYGDTFPAAFF 391
Query: 387 IKNCKEQYGVSPRPSWVLTYYGGHDIKLI---LRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
++ C + +G PR + L G ++ L +N++F +G DP +H
Sbjct: 392 VQQCLDIFG--PRYNLNLLTSGVTRTNILYGALNLQVTNVVFVHGSIDP---------WH 440
Query: 444 FFDLSFSQD-------LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
++ S + + +HC ++ + D + Q ++ ++I QG I Q+ +
Sbjct: 441 VLGITESANPQAPAIYIKGTAHCANMYPPSEHD---MPQLKEARIQI-QGLIKQWLQN 494
>gi|320168054|gb|EFW44953.1| thymus-specific serine protease [Capsaspora owczarzaki ATCC 30864]
Length = 489
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 130/478 (27%), Positives = 207/478 (43%), Gaps = 59/478 (12%)
Query: 32 RLGRIRRSRILEQKDSNHGFETF------FYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
+LG +R+L+ +DS+ G + ++ Q +DHF+ E+ T+ Q+Y +N FWG
Sbjct: 28 KLGTSVTNRLLDAQDSHAGRRSLGDVPKQWFTQKLDHFNTFDET--TWLQKYYVNQTFWG 85
Query: 86 GGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKN 145
G PI +G E PIDD + A +KAL+V LEHR+YG+SVP + ++ N
Sbjct: 86 GPGYPIFFMIGGEGPIDDRYVTAMDYVIYARTYKALMVTLEHRFYGESVP--TADYSVAN 143
Query: 146 ARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGS 205
R + S QALAD A+ +I ++NA + + G SY G L+ W RLKYP++ GS
Sbjct: 144 LR---FLTSQQALADAANFAANITLQFNAPTSSWVTFGGSYPGCLSAWARLKYPNLFQGS 200
Query: 206 LASSAPILYYGDITPWTTYHSVVSKDFR-DTSEECFQTIKKSWSEIDNIASKPDGLAILS 264
+++S P+ + ++ + Y VV C I ++ ++I + +P GL+ +
Sbjct: 201 ISTSGPV--HAELN-FVQYLEVVQASLEYFGGTTCSSLITQATNKIQTLLQQPGGLSSVG 257
Query: 265 KKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA- 323
K F C PL + ++ + + T+ + V + N P D ++
Sbjct: 258 KLFSVCVPLNSQDDVANFMSTLAGNV--------MGVVQYNNEGRGGPTITDVCATMLSN 309
Query: 324 ----AGVVEADSLEYDGNNSRCY----------INEDRTG-DESDEGWEWQSCSEMVVPM 368
V + L N C + + G D W WQ+C E
Sbjct: 310 SDPLQAYVNLNQLFLASGNVTCLDVAYKPMIQQLQDTAPGADVGGRSWLWQTCQEFGFYQ 369
Query: 369 GKDKN-SMYQPEPWNLTKYIKNCKEQYGVS-------PRPSWVLTYYGGHDIKLILRRST 420
D S + L+ ++ C + +G + R +W YYG D S
Sbjct: 370 TTDSTFSHVFGNLFPLSFSLQMCNDVFGTNYQQADFQKRLNWTNDYYGSLDF------SG 423
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS-HCLDLDEAKKSDPDWLVQQR 477
SN F +G DP+ G Y S L G+ HC ++ SDP LV R
Sbjct: 424 SNTFFIHGSIDPWHALG---IYQITSPVNSATLITGTAHCANMRMPLPSDPPQLVAAR 478
>gi|307171940|gb|EFN63566.1| Putative serine protease K12H4.7 [Camponotus floridanus]
Length = 415
Score = 140 bits (354), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 127/437 (29%), Positives = 197/437 (45%), Gaps = 47/437 (10%)
Query: 76 RYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVP 135
RY NS F NS IL +G E I + G YE ++ AL+ EHRYYGKS P
Sbjct: 2 RYQENSAFLKK-NSSILIMIGGEWEISNGFLQGGLMYEIGSKYNALMYYTEHRYYGKSKP 60
Query: 136 FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFR 195
++ + +N + Y N QALAD A + K + N ++ VI G SY G +A W R
Sbjct: 61 --TKDTSTENLQ---YLNVDQALADLAYFIETKKKEKNLENSTVIVFGGSYAGNMAAWAR 115
Query: 196 LKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIAS 255
LKYPH++ G+LASSAP+ D + Y+ VV+K +E+C + +K ++ I+ + +
Sbjct: 116 LKYPHLIQGALASSAPVYAKAD---FYEYYEVVTKSLGKYNEKCVEDVKIAFDSIEELLT 172
Query: 256 KPDGLAILSKKFKTCTP--LKNTTE---LKDGLDTVYSEAAQYDTPSN--IPVKRICNAI 308
G L F C +K++++ L + L V++ QYD N + +CN I
Sbjct: 173 AEGGPDKLKLYFNLCNVPNVKSSSDLGYLMNTLAEVFAATVQYDNIENGQTKIAALCN-I 231
Query: 309 ENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDR-------TGDESDEGWEWQSC 361
A + G + + A +V + + NS +I + R + W Q+C
Sbjct: 232 MTAAHLGSPL--QRLAHIVSKPDMCIENYNS--FIEKYRNISWDSAAAQDIMRQWYHQTC 287
Query: 362 SEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGH--DI 412
SE N+ + L YI C + YG ++ R YGG DI
Sbjct: 288 SEYGYYQTTSANNSIFGTLFPLNYYIDMCTDLYGDYSNDKILNSRVRRTNIMYGGQLPDI 347
Query: 413 KLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDW 472
+N+IF+NG DP+ +K + F + + SHC D+ +D +
Sbjct: 348 --------TNVIFTNGDVDPWHPLSVLKDLNAFSPAIV--IKGSSHCRDIYSDVDTDLED 397
Query: 473 LVQQRKTEVKIMQGWIT 489
L + R I+ WI+
Sbjct: 398 LKKARARIRDIISKWIS 414
>gi|330840912|ref|XP_003292451.1| hypothetical protein DICPUDRAFT_83068 [Dictyostelium purpureum]
gi|325077291|gb|EGC31013.1| hypothetical protein DICPUDRAFT_83068 [Dictyostelium purpureum]
Length = 457
Score = 140 bits (353), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 121/469 (25%), Positives = 221/469 (47%), Gaps = 59/469 (12%)
Query: 43 EQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPI 101
EQ + + F+++QT+DHF E+ ++F QRY IN +F+ N PI+ ++ E P+
Sbjct: 26 EQVSNISPYTPFWFSQTLDHFD--DENTESFSQRYFINDQFYDYTNGGPIILYINGEGPV 83
Query: 102 D-------DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNS 154
D + L G A+IV LEHRYYG+S PF + +N + Y +S
Sbjct: 84 SSAPCQTGDGVVLYG------QALNAMIVTLEHRYYGESTPF--QDLTTENLK---YLSS 132
Query: 155 AQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILY 214
QAL D A ++ + + + ++ IG SY G L+ WFR+KYPH+ GS+ASS +
Sbjct: 133 EQALNDLAIFVVWFQSQL-SNAGKIVTIGGSYSGALSAWFRIKYPHITSGSIASSGVV-- 189
Query: 215 YGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
I +TT+ V+ + + + K+ E I + + + F+ L
Sbjct: 190 -NAILQFTTFDEYVAYAAGEDCSNALRLVTKAVEE--QILAGGSAEQKVKQIFQA-ESLT 245
Query: 275 NTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVE--ADSL 332
+ + L +E QY S ++C+ + A N D++ + + SL
Sbjct: 246 DNGDFFYWLADSMAEGIQYGFHS-----QLCSPLIEAMNNNGDMISTYSNYTINVWGQSL 300
Query: 333 EYDGNNSRCYINEDRTGD--ESDEGWEWQSCSEMV----VPM-GKDKNSMYQPEPWNLTK 385
S + ++ T D ++D W +Q+C+ + P+ G ++SM N+T
Sbjct: 301 GTPEEYSTVW-QQNTTADPAKADRQWWFQTCAALGYFQDAPLSGSIRSSMV-----NMTY 354
Query: 386 YIKNCKEQYGVS--PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ +C++ +G+ P + V +YGG++ + +NI+++NG +DP+SR ++T
Sbjct: 355 FKTHCQQVFGIPLWPNTAAVNIHYGGNN------TAGTNILYTNGSQDPWSRASVIQTIS 408
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKK---SDPDWLVQQRKTEVKIMQGWIT 489
S HC+D+ + + P+ + Q R +K+++ W++
Sbjct: 409 DSQQSVMVTCENCGHCVDIRASCPGGCAQPNNIAQVRALSIKLLESWLS 457
>gi|355561421|gb|EHH18053.1| Thymus-specific serine protease [Macaca mulatta]
Length = 514
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 126/471 (26%), Positives = 197/471 (41%), Gaps = 66/471 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNEQHWVGEDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 119 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRHALADVVSARLALSRLFNV 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 230
Query: 234 DTS-----EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T+ E + A L + C L +N EL L
Sbjct: 231 STAIGGSLEXXXXXXXXXXXXXXXLRLGGAAQAALRSELSACGTLGSAENQAELLGSLQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ A QYD + P V+++C N + P CG +I + L +
Sbjct: 291 LVGGAVQYDGQAGAPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSE--MVVPMGKDKNSMYQPEPWNLTKYIKNCK 391
+ + E + D W +Q+C+E V G + Q L + C+
Sbjct: 351 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCGNPRCPFSQLP--ALPSQLDLCE 408
Query: 392 EQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+ +G+S + +YYGG + ++F NG DP +H
Sbjct: 409 KVFGLSALSVAQAVAQTNSYYGGQT------PGANQVLFVNGDTDP---------WHV-- 451
Query: 447 LSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
LS +Q L GSHCLD+ + SD L R+ + +Q W+
Sbjct: 452 LSVTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 502
>gi|355762458|gb|EHH61967.1| Thymus-specific serine protease [Macaca fascicularis]
Length = 514
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 126/471 (26%), Positives = 196/471 (41%), Gaps = 66/471 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNEQHWVGEDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 119 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRHALADVVSARLALSRLFNI 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 230
Query: 234 DTS-----EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T+ E A L + C L +N EL L
Sbjct: 231 STAIGGSLEXXXXXXXXXXXXXXXXXXXXXXQAALRSELSACGTLGSAENQAELLGALQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ A QYD + P V+++C N + P CG +I + L +
Sbjct: 291 LVGGAVQYDGQAGAPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSE--MVVPMGKDKNSMYQPEPWNLTKYIKNCK 391
+ + E + D W +Q+C+E V G + Q L + C+
Sbjct: 351 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCGNPRCPFSQLP--ALPSQLDLCE 408
Query: 392 EQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+ +G+S + +YYGG + ++F NG DP +H
Sbjct: 409 KVFGLSALSVAQAVAQTNSYYGGQT------PGANQVLFVNGDTDP---------WHV-- 451
Query: 447 LSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
LS +Q L GSHCLD+ + SD L R+ + +Q W+
Sbjct: 452 LSVTQALGSSESALLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 502
>gi|145522514|ref|XP_001447101.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414601|emb|CAK79704.1| unnamed protein product [Paramecium tetraurelia]
Length = 461
Score = 139 bits (351), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 131/452 (28%), Positives = 204/452 (45%), Gaps = 43/452 (9%)
Query: 53 TFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP--ILAFLGAEAPIDDNIQLSGF 110
T ++ Q +DH P S + F QRY + + N P ++ ++ E D SG
Sbjct: 35 TLWFTQKLDH--NDPTSKEVFRQRYHVYDDY-VVRNQPESVILYICGEWTCDGI--GSGL 89
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
T++ A Q KAL+++LEHRY+G+S PFG + Y N QAL D A + +K
Sbjct: 90 TFDAAQQLKALVLVLEHRYFGQSQPFGDWSTP-----NLKYLNIHQALDDIAYFIQDVKA 144
Query: 171 K--YNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
K +N + P I +G SY G L+ WFR KYPH+ IG LASSA + + + Y
Sbjct: 145 KGLFNIKPNTPWIHLGGSYPGALSAWFRYKYPHLTIGGLASSAVV---KAVACYHDYDMQ 201
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
V ++S+EC I++ +I+ K K T ++ + + D +Y
Sbjct: 202 VYLSALESSQECVDRIQQVNEKIEADLIKSPNTIKAEFKASELTDIEFLSMIAD----IY 257
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDR 347
+ Q S ++C +E DD ++ +E E G+ I+ D
Sbjct: 258 AGMVQGRKRS-----KMCERLEGGATL-DDWFKQVKEMALETVDQESYGSEFLKDISIDF 311
Query: 348 TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLT-KYIKN-CKEQYGVSPRPSWVLT 405
+ ++ W +Q+C E+ + N L + +N C+ YG+S P T
Sbjct: 312 S--KNSRQWTYQTCIEVGYFQTANPNVEQSTRSQQLKLDFFRNLCEYSYGISIFPDEERT 369
Query: 406 --YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH--FFDLSFSQDLNLGSHCLD 461
Y+GG DI + ++IFSNG DP+ K +D+ + + + SHC+D
Sbjct: 370 NAYFGGLDINV------DHLIFSNGSDDPWQHASITKWKQGKEYDVKYIKCKDC-SHCID 422
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
L K DP L Q RK + I Q WI +Y D
Sbjct: 423 LKATKADDPPELTQARKEILAIFQQWINEYND 454
>gi|322795209|gb|EFZ18031.1| hypothetical protein SINV_09833 [Solenopsis invicta]
Length = 584
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 140/510 (27%), Positives = 218/510 (42%), Gaps = 58/510 (11%)
Query: 11 LLLMVIFVSTSFH--ANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPE 68
++L V+FV S H NG+ R R KD+ + Q +DHF++
Sbjct: 2 IILKVLFVLCSIHPLVNGVGFRGFTFRGLEDPKPFTKDAGKDIVEGWITQPLDHFNH--R 59
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
+T+ RY NS F G PIL +G E I D G YE ++ L+ EHR
Sbjct: 60 DNRTWSMRYKENSLFLKNG-GPILIMIGGEWEITDGYLQGGLMYEIGVKYGGLMYYTEHR 118
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGG 188
+YG+S P ++ + +N + Y N+ QALAD A + K + N + VI +G SY G
Sbjct: 119 FYGQSKP--TKDISSENLQ---YLNADQALADLAYFIETKKKEKNLEKSTVIVVGGSYAG 173
Query: 189 ELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWS 248
+A W RLKYPH++ G+LASSAP+ D + Y+ VV+K SE+C + +K +++
Sbjct: 174 NMAAWARLKYPHLIQGALASSAPVQAKAD---FYEYYEVVTKSLGRHSEKCVENVKIAFA 230
Query: 249 EIDNIASKPDGLAILSKKFKTC-TPLKNTTE----LKDGLDTVYSEAAQYDT--PSNIPV 301
++ + +K G L F C P N+ + L +++ QYD +
Sbjct: 231 SVEELLAKQSGAIELKYLFNLCEVPDINSASDLGYFMNMLSEIFAGIVQYDKIEKGETNI 290
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC----YINEDRT--------- 348
+C+ + + G L ++A L Y + +C Y N +T
Sbjct: 291 ATLCHNM-TGEHLGSP-LQRLAHVF-----LMYQKDQHKCVDVSYNNFVKTYRNVSWDSL 343
Query: 349 -GDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG---VSPRPSWVL 404
W Q+C+E + + + L Y+ C + + R
Sbjct: 344 AATSIMRQWYHQTCTEYGYFQTTNSDKSIFGTLFPLDYYVNLCIDFNNGKWLDSRVKRTN 403
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL-SFSQD--LNLGSHCLD 461
YGG L N+IF+NG DP W DL +FS +N SHC D
Sbjct: 404 IMYGGQLPDL------RNVIFTNGDIDP-----WHSLSVLHDLNAFSPAIFINGSSHCRD 452
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
+ +DPD L + R I+ WI+ +
Sbjct: 453 MLSDVATDPDDLKKARAKIRSIIGKWISSH 482
>gi|444521185|gb|ELV13126.1| Dipeptidyl peptidase 2 [Tupaia chinensis]
Length = 471
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 124/457 (27%), Positives = 208/457 (45%), Gaps = 53/457 (11%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
F +++Q +DHF++ +TF QR +++ +FW G PI + G E I SGF
Sbjct: 43 FRERYFDQLLDHFNFERFGNRTFRQRVLVSDRFWRRGEGPIFFYTGNEGDIWTFANNSGF 102
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E A Q AL+V EH RG AQ L +
Sbjct: 103 LAELAAQQAALLVFAEH--------------------LRG---DAQRLPEDEVPPPGGGG 139
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ V + SYGG L+ + R+KYPH+V G+LA+SAP++ + + V+
Sbjct: 140 ADSKKGPAVTRVTPSYGGMLSAYLRMKYPHLVAGALAASAPVIAVAGLGDSYQFFRDVTT 199
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVY 287
DF S EC Q +++++ ++ ++ + +S++F TC PL ++ T+L +
Sbjct: 200 DFTSQSPECAQGVREAFRQMKDLFLQ-GAHETVSREFGTCQPLSGPRDLTQLFAFARNAF 258
Query: 288 SEAAQYDTPSNI---------PVKRICNAI--ENAPNCGDDILCKIAAGVVEADSLEYDG 336
+ A D P PVK C + E P G L + V + E
Sbjct: 259 TVLAMMDYPYPTSFMGHFPAHPVKVGCARLLREAQPIPGLRALTGL---VYNSSGTEPCY 315
Query: 337 NNSRCYIN-EDRTG---DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCK 391
+ R Y + D TG + W++Q+C+E+ + + M+ P+ + + C
Sbjct: 316 DIYRQYRSCSDPTGCGTGPDAKAWDYQACTEINLTFSSNNVTDMFPALPFTDLQRRQYCL 375
Query: 392 EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+ +GV PR W+ T + G D+K + SNIIFSNG DP++ GG + ++ +
Sbjct: 376 DTWGVWPRHDWLRTSFWGGDLK-----AASNIIFSNGDLDPWAGGGIRRNVSASVIAVT- 429
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ G+H LDL ++ DP +VQ R+ E +++ W+
Sbjct: 430 -VRGGAHHLDLRASQPEDPASVVQVRQLEATLVREWV 465
>gi|291241533|ref|XP_002740665.1| PREDICTED: Prolyl Carboxy Peptidase like family member (pcp-2)-like
[Saccoglossus kowalevskii]
Length = 500
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 142/518 (27%), Positives = 236/518 (45%), Gaps = 71/518 (13%)
Query: 10 WLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
+L ++ S + NG RPR G + +L ++ E + +Q +DH Y
Sbjct: 14 FLTVLPSVFSLPYFMNG---RPRGGMVGVP-VLSERPHTEPQEQWI-SQRLDH--YNDAD 66
Query: 70 YQTFPQRYVIN-SKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY-ENAHQFKALIVILEH 127
+T+ QRY I+ S + GG P+ +G E P++ ++ T+ + A ++ AL +++EH
Sbjct: 67 LRTWQQRYYIDDSHYIAGG--PVFLNIGGEGPLNSKWLMAETTWIQYAMKYGALCLLVEH 124
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
RYYGKS P + Y +S QALAD A +I +K N T+ IA G SY
Sbjct: 125 RYYGKSHP-----TVDVSTDSLQYLSSEQALADLAYFRNYIGEKLNITNNKWIAFGGSYS 179
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT--SEECFQTIKK 245
G LA WFR+KYPH+V G++A+SAP+L + +T Y VV + E C + I+
Sbjct: 180 GNLAAWFRIKYPHLVDGAVATSAPVLAKLN---FTEYLEVVRDSLASSKAGEACNKNIQA 236
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG------LDTVYSEAAQYD----- 294
+ ++ +G +L F+ C P+ N+TELKD + + QY+
Sbjct: 237 AVIDMQKKLQTTEGEKLLQNIFQVCGPI-NSTELKDVQNFHSLVSGNFEGVVQYNRDNRE 295
Query: 295 ----TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADS-------LEYDGNNSRCYI 343
+NI + +C+ + + + GD + A + + + YD
Sbjct: 296 FEGAVGTNITLDTLCDIMVDE-SIGDPLHRYAAVNTLMLQTYQTKCLDISYDNMIQEMRQ 354
Query: 344 NE-DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN--LTKYIKNCKEQYGVSPRP 400
N + + E + W +Q+C+E D ++ QP N L+ ++ C++ YG
Sbjct: 355 NSWNSSAAEGGKQWVYQTCTEFGYYQTSD--AINQPFGHNFPLSFSLQQCQDIYGKQFNQ 412
Query: 401 SWVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD- 452
+ + T YGG +K T+N++F NG DP+ G + D+S S
Sbjct: 413 TTLTAGIKSTNTNYGGLGLK------TNNVVFPNGSIDPWHALGITQ-----DVSQSVTA 461
Query: 453 --LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ +HC ++ K D L Q RKT ++ WI
Sbjct: 462 IYIKGTAHCANMYPEKADDLPQLKQARKTIEILIGKWI 499
>gi|123703648|ref|NP_001074031.1| uncharacterized protein LOC556307 precursor [Danio rerio]
gi|120538664|gb|AAI29321.1| Zgc:158605 [Danio rerio]
Length = 488
Score = 139 bits (350), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 132/486 (27%), Positives = 218/486 (44%), Gaps = 59/486 (12%)
Query: 37 RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG 96
+ SR + G E ++ Q +DHF+ G +S + + QRY +N F+ G P+ +G
Sbjct: 30 KSSRYAARDQLKSGVEDQWFIQRLDHFN-GADS-RVWKQRYFVNDSFYRVGG-PVFLMIG 86
Query: 97 AEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQ 156
E P + G A + AL ++LEHR+YGKS P + + +N R + +S Q
Sbjct: 87 GEGPANPAWMQYGTWLTYAQKLGALCLLLEHRFYGKSHP--TEDLSTENLR---FLSSRQ 141
Query: 157 ALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
ALAD A T++ +A G SY G LA WFRLKYPH+V S+A+SAP+ +
Sbjct: 142 ALADLAHFRTVTAAARGLTNSKWVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPV--HA 199
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK-- 274
+ + Y VV + + EC +KK+ + S P ++K F+ C+ L+
Sbjct: 200 SVN-FPEYLEVVWRSLAAENPECPLLVKKASDTLLERLSDPKTYDNITKDFRLCSKLQIQ 258
Query: 275 ---NTTELKDGLDTVYSEAAQYD---------TPSNIPVKRICNAIENAPNCGD--DILC 320
++ L + L + + QY+ +NI +K +C + ++ + GD D
Sbjct: 259 SKMDSAYLLESLAGNFMDVVQYNEDNRAFEGAVGTNITIKVLCGVMLDS-SLGDPYDRYA 317
Query: 321 KIAAGVVEADSLEYDGNNSRCYI----NEDRTGDESDEG--WEWQSCSEMVVPMGKDKNS 374
+A + + S + +I N +G E+ G W +Q+C+E D S
Sbjct: 318 AVARLMQKTFSQSCINTQYKSFIQDISNSSWSGPEAGGGRQWVYQTCTEFGFYQSTD--S 375
Query: 375 MYQP-EPWNLTKYIKNCKEQYGVSPRPSWVLTY----YGGHDIKLILRRSTSNIIFSNGM 429
QP + L +++ C + Y +S + YGG+DIK ++ I+F NG
Sbjct: 376 PNQPFSGFPLGYHLQQCADIYNLSTSLDEAIQQTNEEYGGYDIK------STRIVFPNGS 429
Query: 430 RDPFSRGGWVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIM 484
DP+ G K S DL +HC ++ A+ D L R ++
Sbjct: 430 IDPWHALGVTK-------DISGDLPAVFIKGTAHCANMYPARAEDLPQLGLARDRIFILL 482
Query: 485 QGWITQ 490
Q W+ +
Sbjct: 483 QKWLAE 488
>gi|307200054|gb|EFN80400.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 495
Score = 138 bits (348), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 125/458 (27%), Positives = 195/458 (42%), Gaps = 51/458 (11%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ P +T+ RY NS N PIL +G E I G YE A
Sbjct: 50 QPLDHFN--PRENRTWSMRYYENSALLRA-NGPILITIGGEWTISTGFLQGGLMYEIASV 106
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
++ EHR+YGKS P ++ + N R Y + QALAD A+ + K + N ++
Sbjct: 107 HGGMMYYTEHRFYGKSRP--TKDTSASNLR---YLSVDQALADLANFIETKKKEKNLENS 161
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
PVI G SY G +ATW RLKYPH++ G+LASSAPI D + Y+ VV++ S
Sbjct: 162 PVIVFGGSYAGNMATWARLKYPHLIQGALASSAPIYAKAD---FYEYYEVVTRSLGRHSA 218
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT--PLKNTTE---LKDGLDTVYSEAAQ 292
+C +K ++ ++ + + G L F C +K+ ++ L + L V++E Q
Sbjct: 219 QCVADVKTAFESVEELLAAQGGPEKLKVYFDLCNVPDVKSPSDLGSLMNSLAEVFAEIVQ 278
Query: 293 YDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY---------- 342
YD N K E L ++A + +D S C+
Sbjct: 279 YDKVENGRTKIAALCAEMTATHLGSPLQRLARVIANSDP------GSACFDMSYKNVIKK 332
Query: 343 ---INEDRTGDESD-EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG--- 395
I+ D S W Q+C+E + + L+ + C + YG
Sbjct: 333 YRDISWDSPAAASAMRQWYHQTCTEYGYYQTTSSDKSIFGTLFPLSYFTDMCIDLYGDYY 392
Query: 396 ----VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+ R YGG R +N+IF+NG DP+ ++ + + +
Sbjct: 393 NEKLLDSRVKRTNMMYGGQ------RPDLTNVIFTNGDIDPWHALSVLEDLNAYAPAIL- 445
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+N SHC DL +D + L + R I+ W++
Sbjct: 446 -INGSSHCRDLYSDADTDVEDLKKARAKVRSIIGKWLS 482
>gi|344292452|ref|XP_003417941.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 482
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 132/496 (26%), Positives = 202/496 (40%), Gaps = 78/496 (15%)
Query: 37 RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG 96
RR+R Q H T ++ Q +DHFS + +PQRY+IN F+ G L G
Sbjct: 23 RRTRGAYQTKGTH--PTRYFQQKLDHFS--KNCSRLWPQRYLINDAFYKRGGPVFLLIGG 78
Query: 97 AEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQ 156
E + I ++ A + AL ++LEHR+YG S P G A Y +S Q
Sbjct: 79 FETLSESWIAINKTWVTYAERLGALFLLLEHRFYGHSQPTGDLSTA-----SLQYLSSRQ 133
Query: 157 ALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
ALAD + I +K T + G SY G LA W R+K+P + ++ SSAPI
Sbjct: 134 ALADIVNFRTKIAEKMRLTKNKWVLFGCSYAGSLAVWSRIKHPDLFAAAVGSSAPIQAKA 193
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT 276
+ + Y VV + + +CFQTIK+++ ++ + + L+ FK C P K
Sbjct: 194 N---FYEYLEVVQRSLATHNSKCFQTIKEAFDQVAKMMKLSVYYSKLANDFKLCKPAKTY 250
Query: 277 TELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGD-------DILCKIAAGV--- 326
+ + + A + PV+ N N D LC I A
Sbjct: 251 SAM---------DKAYFLERLIFPVEVAVQHNRNEKNHKGEQLSFTMDELCDIMANTSLG 301
Query: 327 ------VEADSLEYDGNNSRCYINEDRTGDES-------------DEGWEWQSCSEMVVP 367
V L + S C+ R ++ + +QSC+E
Sbjct: 302 SPYYRYVRIIHLIFKHKYSPCFAANYRQKLQTLLNSSINHHNPTKVRQYFYQSCTEFGFF 361
Query: 368 MGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT-------YYGGHDIKLILRRS 419
D KN + P L+ +++ C + +G + T YYGG ++ +
Sbjct: 362 FTTDSKNQPFTGLP--LSYFVQQCSDLFGPKFNNDSLNTGVMSTNAYYGGFNV------T 413
Query: 420 TSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLV 474
S IIF NG DP+ G K S+DL HC D+ + K +D L+
Sbjct: 414 GSKIIFPNGSFDPWHPLGITK-------DISKDLPAVFIKGAVHCADIYKQKDTDSAELI 466
Query: 475 QQRKTEVKIMQGWITQ 490
Q R+ +I+Q W+ Q
Sbjct: 467 QAREKIFRILQKWLKQ 482
>gi|148676287|gb|EDL08234.1| dipeptidylpeptidase 7, isoform CRA_a [Mus musculus]
Length = 329
Score = 138 bits (348), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 96/291 (32%), Positives = 146/291 (50%), Gaps = 27/291 (9%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF 94
+ R R+L+ F ++ Q +DHF++ +TF QR++++ KFW G PI +
Sbjct: 30 QARADRVLDPD-----FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFWKMGEGPIFFY 84
Query: 95 LGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY--- 151
G E I SGF E A Q +AL+V EHRYYGKS+PFG + + RGY
Sbjct: 85 TGNEGDIWSFANNSGFMVELAAQQEALLVFAEHRYYGKSLPFGVQ------STQRGYTQL 138
Query: 152 FNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
QALAD+A +L ++ AP IA G SYGG L+ + R+KYPH+V G+LA+SAP
Sbjct: 139 LTVEQALADFAVLLQALRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAP 198
Query: 212 ILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
++ + + V+ DF S +C Q ++ ++ +I ++ + +S+ F TC
Sbjct: 199 VVAVAGLGDSYQFFRDVTADFYGQSPKCAQAVRDAFQQIKDLFLQ-GAYDTISQNFGTCQ 257
Query: 272 PL---KNTTELKDGLDTVYSEAAQYDTPSNI---------PVKRICNAIEN 310
L K+ T+L ++ A D P PVK C + N
Sbjct: 258 SLSSPKDLTQLFGFARNAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLN 308
>gi|440900328|gb|ELR51489.1| Thymus-specific serine protease [Bos grunniens mutus]
Length = 516
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 126/475 (26%), Positives = 206/475 (43%), Gaps = 70/475 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W + P+ LG E + + G
Sbjct: 59 WLEQPLDPFNA--SDRRSFLQRYWVNDQHWTSQDGPVFLHLGGEGSLGPGSVMRGHPANL 116
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P L A+ R + +S ALAD AS L + +N
Sbjct: 117 APIWGALVISLEHRFYGLSIP----AEGLDMAQLR-FLSSRHALADAASARLTLSRLFNV 171
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH+ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 172 SSTSPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPVRAILD---FSKYNDVVSRSLM 228
Query: 234 DT----SEECFQTIKKSWSEIDNIASKPDGL-AILSKKFKTCTPL---KNTTELKDGLDT 285
+T S EC + +++E++ G A LS + C L ++ EL L
Sbjct: 229 NTAIGGSLECREAASAAFAEVERRLRASRGAWATLSVELGACGSLERAEDQAELLGALQA 288
Query: 286 VYSEAAQYDTPSNIP--VKRICN-AIENAPNCGDDILCKIAA---GVVEADSLEYDGNNS 339
+ A QYD + P V+++C + + NC + C +A G+ A +
Sbjct: 289 LVGGAVQYDGQAGAPLSVRQLCRLLLGDRGNCKGN--CSGSAPYRGLRRAVQVVTHSLGQ 346
Query: 340 RCY------------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
RC + E + D W +Q+C+E + + + L +
Sbjct: 347 RCLSIPRAETVAQLRVTELQVSSVGDRQWLYQTCTEFGYYVTCEVPGCPFSQLPALPSEL 406
Query: 388 KNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ C++ +G+S + +YYGG + ++F NG DP +
Sbjct: 407 ELCEQVFGLSTSSVAQAVAQTNSYYGGQT------PGATQVLFINGDTDP---------W 451
Query: 443 HFFDLSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
H LS +Q L SHCLD+ + SD L R+ + +Q W+
Sbjct: 452 HV--LSVTQPLGSSEPALLIPSASHCLDMAPERPSDSPSLRLARQKVSQQLQTWL 504
>gi|330803162|ref|XP_003289578.1| hypothetical protein DICPUDRAFT_36006 [Dictyostelium purpureum]
gi|325080335|gb|EGC33895.1| hypothetical protein DICPUDRAFT_36006 [Dictyostelium purpureum]
Length = 485
Score = 138 bits (347), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 127/470 (27%), Positives = 213/470 (45%), Gaps = 50/470 (10%)
Query: 36 IRRSRILEQKDSNHGFETFFY---NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
+RR ++ DS Y Q +DHF+ + TF QRY+IN +++ G P+
Sbjct: 31 LRRPLRTQKIDSGLDLSAINYQWFTQNVDHFNI--VNTDTFQQRYLINDQYYDG-TGPVF 87
Query: 93 AFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY 151
+ E P+ + ++G + A Q ALIV LEHRYYG S F + +L N + +
Sbjct: 88 IMINGEGPMGLDT-VTGLQFVVWAKQLNALIVSLEHRYYGAS--FVTSDLSLDNLQ---F 141
Query: 152 FNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
NS QALAD A I KYN + ++ G SY G L +WFR+KYPH+V ++ASS
Sbjct: 142 LNSQQALADNAVFREFIAQKYNIPSTTKWVSFGGSYSGALTSWFRIKYPHLVDITIASSG 201
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTS--EECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
P+ + Y VV + T+ EC Q I + ++ ++ + D + F
Sbjct: 202 PV---NPEVNFYQYLQVVQNSLQQTNGGAECVQNIAIATDKVQSLLQQ-DNYGGVETLFD 257
Query: 269 TCTPLKNTTELKDGLDTV---YSEAAQYDT--PSNIPVKRICNAIENAPNCGDDILCKI- 322
C+ L+N ++ + ++++ + QY+ P + + +C+ + N N D + I
Sbjct: 258 LCSQLENANDVANFMNSLAGNFMGVVQYNNEEPGQVNTQNLCDIMTN--NTQDPLTNYIQ 315
Query: 323 -----AAGVVEADSLEYDGNNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNS 374
A G E + Y +S +++ T D + G W +Q+C+E D S
Sbjct: 316 LWNQFAGG--ECVDVSY---SSLVAESQNITNDATAIGGRMWMYQTCTEFGYYQSSDGAS 370
Query: 375 MYQP--EPWNLTKYIKNCKEQYGV---SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGM 429
QP + + ++ C + +GV +P +W LT YGG S + ++ NG+
Sbjct: 371 STQPFGDLFGFAFQLQQCADIFGVPNMAPNTNWTLTEYGGLSPA---PSSITTTLYVNGL 427
Query: 430 RDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKT 479
DP+ G + + S + +HC D+ P L ++T
Sbjct: 428 IDPWHALG-ITPVSVPSIKNSLLITGTAHCADMMIPTSVSPSTLAPAQQT 476
>gi|395736879|ref|XP_003776822.1| PREDICTED: thymus-specific serine protease isoform 2 [Pongo abelii]
Length = 541
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 127/491 (25%), Positives = 207/491 (42%), Gaps = 79/491 (16%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ + ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SNRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAA---LKNARH-RGYFNSAQA------------L 158
A + AL++ LEHR+YG S+P G + A ++RH G F+ + L
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRHAMGKFSGIPSDEDRPSPPFDPRL 178
Query: 159 ADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLK------YPHVVIGSLASSAP 211
AD S L + +N ++ +P I G SY G LA W RLK +PH++ S+ASSAP
Sbjct: 179 ADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKVLRLLRFPHLIFASVASSAP 238
Query: 212 ILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWSEID-NIASKPDGLAILSKK 266
+ D ++ Y+ VVS+ T S EC + +++E++ + S A L +
Sbjct: 239 VRAVLD---FSEYNDVVSRSLMSTAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTE 295
Query: 267 FKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIP--VKRIC--------NAIENAPN 313
C PL +N EL L + QYD + P V+++C N + P
Sbjct: 296 LSACGPLGRAENQAELLGALQALVGGVVQYDGQAGAPLSVRQLCGLLLGGGGNRSHSTPY 355
Query: 314 CGDDILCKIAAGVVEADSLEYDGNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKD 371
CG +I + L + + + E + D W +Q+C+E + +
Sbjct: 356 CGLRRAVQIVLHSLGQKCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCE 415
Query: 372 KNSMYQPEPWNLTKYIKNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFS 426
+ L + C++ +G+S + +YYGG + ++F
Sbjct: 416 NPRCPFSQLPALPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQT------PGANQVLFV 469
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQR 477
NG DP +H LS +Q L GSHCLD+ + SD L R
Sbjct: 470 NGDTDP---------WHV--LSVTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGR 518
Query: 478 KTEVKIMQGWI 488
+ + +Q W+
Sbjct: 519 QNIFQQLQTWL 529
>gi|237700855|gb|ACR16009.1| carboxypeptidase 3 [Mamestra configurata]
Length = 484
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 117/448 (26%), Positives = 195/448 (43%), Gaps = 36/448 (8%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF P++ F R++ N +F+GG SPI +G E ID L+G +E A + K
Sbjct: 55 LDHFD--PQNPTEFLMRFMFNEQFFGGDGSPIFIMVGGEWDIDHRWLLAGNMFEMARENK 112
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV 179
V EHRYYG + F + A +N R + N QALAD A + +K + + V
Sbjct: 113 GYQVYTEHRYYGGTKIFANFTA--ENLR---FLNIDQALADLAYFITEMKKQPRFAESEV 167
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR-DTSEE 238
+ G SY + WF+ +YPH+V+G++ASS PIL D + Y VV + F + EE
Sbjct: 168 VLYGGSYAANMVMWFKKRYPHLVVGTVASSGPILAKVD---FPEYLEVVHEAFMLEGGEE 224
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG-----LDTVYSEAAQY 293
C I++ E G +L + ++ C PL E + G + +S + Q
Sbjct: 225 CIGHIRRGVEETIAAMQTESGRRLLEQSYRLCAPLDYDNENELGVFAGLISWTFSTSVQQ 284
Query: 294 DTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-INEDRTGDES 352
P + + +C + + + G + K + L +++ C+ IN D
Sbjct: 285 ARPGTL--QAVCQSFTDHTSYGSTPMEKFGGYIAIIRRL----SDTSCWSINYDSFLTSY 338
Query: 353 DE-----GWEWQSCSEM-VVPMGKDKNSMYQPEPW-NLTKYIKNCKEQYGVSPRPSWVLT 405
+E W +Q+C+E + + W +L Y+ CK + R
Sbjct: 339 NETTNSRAWYYQTCTEYGFYQTAPRSGTAFDGLKWLSLDFYVDICKRIF--DERFDLAFV 396
Query: 406 YYGGHDIKLI---LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
G + LI L +N I +G DP+ G K + S + +N SHC D+
Sbjct: 397 EDGAERVNLIFGGLEPVVNNTINIHGYIDPWRALGVYKN-DISETSPTYTVNRASHCFDM 455
Query: 463 DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+SD + ++ + + W+++
Sbjct: 456 QGWLQSDTIEMTAVQQRARRTVASWLSK 483
>gi|350406141|ref|XP_003487670.1| PREDICTED: putative serine protease K12H4.7-like [Bombus impatiens]
Length = 493
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 126/469 (26%), Positives = 213/469 (45%), Gaps = 61/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q +DHF+ P + + QRY +N +++ G P+ + EA + + G E
Sbjct: 49 WFTQFLDHFN--PTDARVWQQRYFVNGEYYKKG-GPVFLMISGEAAANAKWMVEGQWIEY 105
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI--LLHIKDKY 172
A QF AL +EHR+YG+S P + +KN Y +S QALAD A L++I K
Sbjct: 106 AKQFGALCFQVEHRFYGQSHP--TSDLGVKNLM---YLSSQQALADLAYFIQLMNINYKL 160
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
A IA G SY G LA W R KYPH+V G++++S P+L D + Y VV
Sbjct: 161 PAG-TKWIAFGGSYAGSLAAWLRYKYPHLVHGAVSASGPLLAEID---FQEYFVVVENAL 216
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE------LKDGLDTV 286
++ SE C I ++ + + P G ++KKF C P+ T+ L + + ++
Sbjct: 217 KEHSEACVNAILEANKQFHIMLHHPIGQQGIAKKFILCDPINEHTKRNDISNLYETIASI 276
Query: 287 YSEAAQYD-------TPSNIPVKRICNAIENAP-NCGDDILCKIAAGVVEADS---LEYD 335
++ QY+ +N+ + C+ + N D L ++ +++A L+Y
Sbjct: 277 FAGIVQYNKDNRNNSAMANLTIDSACDILTNETLGIAIDRLAILSTKILQASEEKCLDYM 336
Query: 336 GNNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQP----EPWNLTKYIK 388
N + E EG W +Q+C+E G + S +P E + + +++
Sbjct: 337 YNKMIHKLRNITWASEEAEGGRQWTYQTCTE----FGFFQTSTARPKLFSETFPVDFFVQ 392
Query: 389 NCKEQYGVSPRPSWVL---------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWV 439
C + +G PR + L YGG D+K T+N++F +G DP+ G
Sbjct: 393 QCIDIFG--PRYNIHLLNSAVNRTNILYGGLDLK------TTNVVFVHGSIDPWHVLGIT 444
Query: 440 KTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
K+ + + D +HC ++ K+DP L R ++ W+
Sbjct: 445 KSPNPQMPAIYIDGT--AHCANMYPPSKNDPLQLKAARVEVGHLIDEWL 491
>gi|334328701|ref|XP_001372314.2| PREDICTED: thymus-specific serine protease-like [Monodelphis
domestica]
Length = 503
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 120/446 (26%), Positives = 203/446 (45%), Gaps = 46/446 (10%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
++F QRY +N W +P+ +G E + + G A + AL++ LEHR+Y
Sbjct: 67 RSFRQRYWVNVGHWRPPRAPVFLHIGGEGSLGPSSVWKGHPGTLAASWGALVISLEHRFY 126
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGE 189
G+S+P L A+ R + +S ALAD AS L + YN + +P IA G SY G
Sbjct: 127 GQSIP----PRGLDGAQLR-FLSSRHALADVASARLRLSGIYNISASSPWIAFGGSYAGS 181
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKK 245
LA W RLKYPH++ ++ASSAP+ D + Y+ VVS+ D S +C + + +
Sbjct: 182 LAAWARLKYPHLIWAAVASSAPVQAQLDF---SGYNWVVSRSLADPQVGGSPKCQRAVAQ 238
Query: 246 SWSEIDN-IASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIP- 300
++SE+D+ ++ + A++ + + C L ++ EL + L+ + QYD P
Sbjct: 239 AFSELDDRLSEGGETRAVVQAEVRACGSLEAPEDQAELLEQLEGLVEGVVQYDQQVGAPL 298
Query: 301 -VKRICNAI----ENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINE--DRTGDESD 353
V+ +C+ + P G ++ V++ L ++ + E D +
Sbjct: 299 DVRGLCHLVLANQSRGPLSGLQDAIQL---VLQTLGLPCLPSSKAAALAELKDTNPQAAS 355
Query: 354 EG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP-----SWVLT 405
G W +Q+C+E + S L+ ++ C + +G+SP +W T
Sbjct: 356 LGYRQWFYQTCTEFGYYITCKDPSCPFSRRKTLSDQLQLCAQVFGLSPTSVAQAVNWTNT 415
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG-SHCLDLDE 464
YYGG + + F NG DP+ + S L G SHC D+
Sbjct: 416 YYGGWS------PGATRVFFVNGDIDPWHV---LSVLQALGPSEPAMLMRGTSHCSDMAP 466
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ SDP L R+ V+ ++ W+ +
Sbjct: 467 PQPSDPPSLYLGRQRIVQQLKIWLQE 492
>gi|413943524|gb|AFW76173.1| putative serine peptidase S28 family protein, partial [Zea mays]
Length = 266
Score = 137 bits (346), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 74/177 (41%), Positives = 105/177 (59%), Gaps = 10/177 (5%)
Query: 47 SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW--------GGGNSPILAFLGAE 98
S F ++ Q +DHF++ P + F +Y++N FW G P+ + G E
Sbjct: 80 STKPFTAHYFPQLLDHFAFTPNASTVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNE 139
Query: 99 APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
I+ +GF ++ A F AL+V +EHR+YG+S PFG+ + ++A GY S QAL
Sbjct: 140 GDIEWFATNTGFMFDIAPTFGALLVFIEHRFYGESKPFGND--SYRSAETLGYLTSTQAL 197
Query: 159 ADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY 215
AD+A ++ +K A APV+ G SYGG LA+WFRLKYPHV IG+LASSAPIL +
Sbjct: 198 ADFAVVIRGLKRDLGAEAAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQF 254
>gi|326674064|ref|XP_002664605.2| PREDICTED: thymus-specific serine protease-like [Danio rerio]
Length = 581
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 111/448 (24%), Positives = 201/448 (44%), Gaps = 46/448 (10%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
+Q +DHF ++ +T PQ Y +N +W + P+ ++G E P+ L G E A
Sbjct: 157 HQPVDHFDR--QNDKTLPQTYFVNDVYWQRSDGPVFLYIGGEGPLSKFSVLFGHHVEMAE 214
Query: 117 QFKALIVILEHRYYGKSV-PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
+ AL+V LEHR+YG+S+ P G L++ +S QALAD A+ +I +++ +
Sbjct: 215 RHGALLVALEHRFYGESINPDGLETDKLRD------LSSQQALADLAAFHHYISQRFSLS 268
Query: 176 HAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
H I+ G SY G L+ W R K+PH++ G++ASSAP+ D + +
Sbjct: 269 HRNTWISFGGSYAGALSAWLRGKFPHLIYGAVASSAPVYAVLDFSSYN------------ 316
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC-TPLK--NTTELKDGLDTVYSEAA 291
C +K +++ ++ + + K+F C TPLK + TEL L V+
Sbjct: 317 ---RCVAEVKGAFAAVEAALLMGNETEV-GKEFGCCETPLKLEDKTELLHSLADVFMGTV 372
Query: 292 QYDTPS-NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE----YDGNNSRCYIN-E 345
QY+ + +C+ + N + ++ ++ V+ + E D ++ + ++
Sbjct: 373 QYNEQGVAFSIAELCDIMTNKSDPREEAYDRLVKLVMMYRARENLPCLDVSHEKLFLELN 432
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT 405
+ T S W +Q+C+E + +S + L + C + +S V
Sbjct: 433 NTTATSSYRQWFYQTCTEFGFYQTCEDDSCPFSRRFTLQSQTELCSRLFNISQDSLLVSI 492
Query: 406 -----YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
YYGG+ + T +++ NG DP++ V D +N +HC+
Sbjct: 493 DFTNQYYGGN------QPQTQRVLYVNGNIDPWAALSVVWNETMADNDRVIFINGTAHCM 546
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D++ K + L Q RK + + W+
Sbjct: 547 DMNSEKSVNKPALHQARKRIEERVTTWL 574
>gi|118346661|ref|XP_976865.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288576|gb|EAR86564.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 480
Score = 137 bits (344), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 122/459 (26%), Positives = 198/459 (43%), Gaps = 38/459 (8%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
++Y Q +DHF+ P +T+ QRY I S + N + ++G E G+ E
Sbjct: 41 YYYTQVLDHFN--PNDQRTWQQRYAIYSDEYNPVNGTVFVYIGGEGKQKGLSPGLGWMVE 98
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD--- 170
A +F AL +I+EHR+YG S PFG + + N ++ Y + QAL D A I+ + K
Sbjct: 99 LAKKFSALFLIVEHRFYGASQPFGKDENSYSN-QNLAYLSVEQALEDLAQIIANFKTLRL 157
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + P I IG SY G ++ WFR KYPH+V+G+LASSA IL D + + +
Sbjct: 158 HGLSENVPFITIGGSYPGAVSAWFRSKYPHLVVGALASSAVILPVEDFQQYD--YQIYLS 215
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
R + + C Q I+ ++++I A + T L+ E +YS
Sbjct: 216 TLR-SGQWCPQNIQAFNKQLESILVNGGEQAEKIIQQFNATNLRQ-DEFLSFFGDLYSGL 273
Query: 291 AQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGD 350
QY S +CN D + +V+ + + ++ N D
Sbjct: 274 VQYGRRS-----LLCNFFAQNTTFYDQLNSIYQYAIVQGNQ-PIEAYDTYTLTNTTYDED 327
Query: 351 ESDEGWEWQSCSEM--------VVPMGKDKNSMYQPEPWNLTKYIKNCK-EQYGVSPRPS 401
+ W WQ+C+E V PM + + N +YI N + P +
Sbjct: 328 AAGRQWVWQTCTEFGWFQTANQVQPMRSKQVDL------NFYRYICNVAFDGEHDDPDIT 381
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
+ +GG L+ +NI+F+NG+ D + ++ F+ N +HC +
Sbjct: 382 ANVNRFGG------LKIGATNIVFTNGIEDEWQWASLRQSTPQLTSIFNNCDNC-AHCQE 434
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAINK 500
K +DP L RK I WI Q+Y + ++ K
Sbjct: 435 FRTPKPTDPPGLQSTRKQVEAIFAQWIHQFYLERQSFGK 473
>gi|335291865|ref|XP_003356607.1| PREDICTED: thymus-specific serine protease [Sus scrofa]
Length = 514
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 124/462 (26%), Positives = 204/462 (44%), Gaps = 48/462 (10%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ Q+F QRY +N ++W + P+ LG E + + G
Sbjct: 61 WLEQPLDPFNA--SDRQSFLQRYWVNDQYWTSQDGPVFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P L A+ R + +S ALAD S L + +N
Sbjct: 119 APVWGALVIGLEHRFYGLSIP----AEGLGMAKLR-FLSSRHALADVVSARLALTRLFNV 173
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVSK
Sbjct: 174 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSKSLM 230
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPLKNTT---ELKDGLDT 285
+T S EC + +++E++ + + LA L + C L+ EL L
Sbjct: 231 NTAIGGSPECRAAVSSAFAEVERRLRAGGAALAALQAELGACGSLRRAADREELLGTLQA 290
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ A QYD P V+++C N+ AP G ++ + L +
Sbjct: 291 LVGGAVQYDGQVGAPLSVRQLCGLLLGDRDNSSSPAPYLGLHRAVQVVTHSLGQKCLSFS 350
Query: 336 GNNS--RCYINEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKE 392
+ + + E + + W +Q+C+E +D + P L +++ C++
Sbjct: 351 RAETVAQLRVTEPQVSSLGERQWLYQTCTEFGYFVTCEDPGCPFSQLP-ALPSHLELCEQ 409
Query: 393 QYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL 447
+G+S + +YYGG + ++F NG DP+ +
Sbjct: 410 VFGLSTSSIAQAVARTNSYYGGQT------PGATQVLFVNGDMDPWHV---LSVTQALGP 460
Query: 448 SFSQDLNLG-SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
S S L G SHCLD+ K SD L R+ +Q W+
Sbjct: 461 SESAILIPGASHCLDMAPEKPSDSPSLRLARQNIFLQLQTWL 502
>gi|440796326|gb|ELR17435.1| Serine protease precursor, putative [Acanthamoeba castellanii str.
Neff]
Length = 509
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 138/473 (29%), Positives = 203/473 (42%), Gaps = 66/473 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG--FTYE 113
+ Q +DHF P + + QRY N F+ G P+ LG E P I + G E
Sbjct: 62 FTQALDHFD--PRNSAKWQQRYFTNDTFYRPG-GPVFLMLGGEGPASP-IDVGGHFILNE 117
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A +F AL++ +EHR+YGKSVP +R L NA R + NS QALAD+A +I +K
Sbjct: 118 YAQRFNALVLSIEHRFYGKSVP--TRD--LSNANLR-FLNSEQALADFAMFRQYISEKLA 172
Query: 174 ATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+A G SY G L+ WFRLKYPH+V GSLA+SAP+ D ++ Y+ VV +
Sbjct: 173 LPKTTKWVAFGGSYSGALSAWFRLKYPHLVDGSLATSAPVKAQLD---FSEYNEVVQRSL 229
Query: 233 R-DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
E C +++ N+ S DG L K F CTP++ ++ + + A
Sbjct: 230 EFFVGESCANRVREGTQAATNLLSSADGKKQLQKLFNLCTPIETDDDIALFFENMEGTVA 289
Query: 292 QYDTPS----------NIP---------VKRICNAIENAPNCGDDILCKIAA-GVVEADS 331
Q + NI +KR+C+ GDD L A V
Sbjct: 290 QIVQYNNDNNNYNHGMNINKVIIIMLHWLKRMCDIFLK----GDDALAAYAEFNTVYNKL 345
Query: 332 LEYDGNNSRC--YINE---DRTGDESDEG----WEWQSCSEMVVPMGKDKNSMYQPEPWN 382
D ++ Y+N+ RT E+D W +Q+C E G + +P++
Sbjct: 346 FGVDCTQTKYTEYVNQLKDVRTFPENDNAAGRSWTYQTCIE----FGFYQTGSAANQPFS 401
Query: 383 LTKY----IKNCKEQYGVSPRPS--WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
T I N P P+ W T+YG + + +I NG DP+
Sbjct: 402 KTVTLDWDIFNIDPFNKAEPLPNIEWTNTFYGSTGL------ADPKVILPNGSIDPWHIL 455
Query: 437 GWV-KTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
G + +T + +N +HC DL + DP L R V + +I
Sbjct: 456 GVLPETAVAHPGQLAVLINGTAHCADLYPSSADDPLSLKDARAKIVAAIASFI 508
>gi|354500289|ref|XP_003512233.1| PREDICTED: thymus-specific serine protease [Cricetulus griseus]
Length = 509
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 119/469 (25%), Positives = 203/469 (43%), Gaps = 62/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ +TF QRY +N + G ++P+ +G E + ++G
Sbjct: 60 WLEQPLDPFNT--SDRRTFLQRYWVNDRHRAGQDAPVFLHIGGEGSLGPGSVMAGHPVAL 117
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L A+ R Y +S ALAD AS + N
Sbjct: 118 APAWGALVISLEHRFYGLSMPSG----GLDMAQLR-YLSSRHALADVASARQALSRLLNV 172
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LATW RLK+PH+V ++ASSAP+ + + Y+ VV++
Sbjct: 173 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---SAVLDFYAYNEVVARSLS 229
Query: 234 DT----SEECFQTIKKSWSEIDNIA-SKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
S+EC +++E++ + + P A+L ++ C L ++ EL L
Sbjct: 230 QVAIGGSQECLAAASAAFAEVERLLRAGPAAQAVLREELSACGSLDLTEDQGELLGALQA 289
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ QYD + P V+++C N + P G +I + L +
Sbjct: 290 LVGGTVQYDGQAGAPLSVRQLCGLLLEDSGNRTHSTPYLGLRRAVQIVLRTMGQRCLSFS 349
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + E + D W +Q+C+E + + + L ++ C++
Sbjct: 350 RAETVAQLRSTEPQVSGVGDRQWLYQTCTEFGYYVTCEGPQCPFSQLPALPSHLDLCEQV 409
Query: 394 YGVSPRPSWVL-----TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+S +YYGG + +++ NG DP +H LS
Sbjct: 410 FGLSAASVAQAVAQTNSYYGGQT------PGATQVLYVNGDTDP---------WHV--LS 452
Query: 449 FSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+QDL SHC D+ + SD L R+ + +Q W+
Sbjct: 453 VTQDLGPSEPAILIPSASHCFDMAPMRPSDSPSLRLGRQRIFQQLQVWL 501
>gi|291410769|ref|XP_002721683.1| PREDICTED: protease, serine, 16 [Oryctolagus cuniculus]
Length = 505
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 124/469 (26%), Positives = 204/469 (43%), Gaps = 63/469 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + P+ LG E + ++G
Sbjct: 57 WLEQPLDPFNT--SDRRSFLQRYWVNDQHWAGQDGPVFLHLGGEGSLGPGSVMTGHPAAL 114
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L A+ R + +S AL D AS L + N
Sbjct: 115 APAWGALVIGLEHRFYGLSLPAG----GLDLAQLR-FLSSRHALTDAASARLALSRLLNV 169
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P + G SY G LA W RLK+PH+ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 170 SSSSPWVCFGGSYAGSLAAWARLKFPHLFFASVASSAPVRATLD---FSEYNDVVSRSLT 226
Query: 234 DT----SEECFQTIKKSWSEIDNIASKPDGLAI-LSKKFKTCTPLKNT---TELKDGLDT 285
+ S EC + ++++++ I L + C L + EL L
Sbjct: 227 NAAVGGSPECRAAVAAAFADVERRLRAGGRARIALRTELGACGRLGSAADRAELLGALQA 286
Query: 286 VYSEAAQYDTPSNIP--VKRICNAIEN-------APNCGDDILCKIAAGVVEADSLEYD- 335
V AAQYD + P V+ +C + AP G +I + L +
Sbjct: 287 VVGGAAQYDRQAGAPLGVRTLCGILLARDRRGVPAPYSGLRRAVQIVKHSLGQRCLSFSR 346
Query: 336 -GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMG-KDKNSMYQPEPWNLTKYIKNCKEQ 393
++ E + D W +Q+C+E + +D + P L ++ C++
Sbjct: 347 AATVAQLRATEPQVSGVGDRQWLYQTCTEFGFYVSCEDLRCPFSLLP-ALPSHLDLCEQV 405
Query: 394 YG-----VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G V+ + +YYGG + ++++F NG DP +H LS
Sbjct: 406 FGLTASSVAQAVAQTNSYYGGQTPR------ATHVLFVNGDIDP---------WHV--LS 448
Query: 449 FSQDLNL---------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q L SHCLD+ + SD L Q R+ + +Q W+
Sbjct: 449 VTQALGPSAPALLIPGASHCLDMAPERPSDSPSLCQGRQNISQQLQAWL 497
>gi|156391046|ref|XP_001635580.1| predicted protein [Nematostella vectensis]
gi|156222675|gb|EDO43517.1| predicted protein [Nematostella vectensis]
Length = 444
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 107/397 (26%), Positives = 183/397 (46%), Gaps = 45/397 (11%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF + +T+ QR+ N F +SP+ +G E I L G + A
Sbjct: 21 QRLDHFD--DSNTETWKQRFYYNDTFRKTKDSPVFLMVGGEGAISPVWVLIGNMMKYAEG 78
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F A+ ILEHR+YG+S P ++ + +A + Y NS QALAD A+ + K+N T +
Sbjct: 79 FGAMAFILEHRFYGQSHP----RSDMSDANLK-YLNSEQALADLAAFRQAMSVKFNLTDS 133
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
I+ G SY G L+ W RLKYPH++ G++ASSAP+L + + Y VV+ T
Sbjct: 134 KWISFGGSYPGSLSAWLRLKYPHLIHGAVASSAPVLAQLN---FPEYLEVVTASLETTGP 190
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYSEAAQ 292
+C + I + + I+ + +G L+ F+ C PL + + L ++ Q
Sbjct: 191 DCTKNIANATAAIEELLDADEGTKKLTNLFRVCEPLNRRNDNDVSTFSSNLAGLFMGVVQ 250
Query: 293 YDTP---------SNIPVKRICNAIENAPNCGDDIL--CKIAAGVVEA------DSLEYD 335
Y+ +NI + +C I N + G ++ K+ + +++ D+ +
Sbjct: 251 YNKDNRAFEGVPGTNITIATVC-GIMNDKSLGPALMRYAKLNSLILDTYGEKCLDASYQN 309
Query: 336 GNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG 395
NS ++ D + E W +Q+C+E D ++ + + L I+ C + +G
Sbjct: 310 AINSLRNVSWDSSAAEGGRQWTYQTCTEFGFYQTTDSDNQPFGKRFPLKYSIQQCMDVFG 369
Query: 396 VSPRPSWVL-------TYYGGHDIKLILRRSTSNIIF 425
+ S + T YGG I S+ +I+F
Sbjct: 370 EAFNSSNLASGIRQTNTNYGGKGIA-----SSRDIVF 401
>gi|47224819|emb|CAG06389.1| unnamed protein product [Tetraodon nigroviridis]
Length = 493
Score = 136 bits (342), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 128/488 (26%), Positives = 206/488 (42%), Gaps = 65/488 (13%)
Query: 38 RSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGA 97
R ++ + D + ++ Q +DHFS ++ QRY +N F+ G P+ +G
Sbjct: 36 RFKVRRETDLKTVSDEQWFIQRLDHFSADSREWK---QRYFVNEAFYKPGG-PVFLMIGG 91
Query: 98 EAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
E G A + A+ +LEHR+YGKS P + + N R + +S QA
Sbjct: 92 EGAASPAWMQYGTWLTYAEKLGAICFMLEHRFYGKSHP--TSDLSTDNLR---FLSSRQA 146
Query: 158 LADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
LAD A I + T+A +A G SY G LA W RLKYPH+V ++A+SAPI
Sbjct: 147 LADLAHFRTVIAEARGLTNAKWVAFGGSYPGSLAAWLRLKYPHLVHAAVATSAPI---HA 203
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
+ Y VV + C +K + + + P ++K F C+ L+ T
Sbjct: 204 TVNFPEYLEVVWRSLASEDAACPVLVKNASDTLAELLKDPKTYDNITKDFNLCSKLQIQT 263
Query: 278 ELKDG--LDTV---YSEAAQYDTP---------SNIPVKRICNAIEN----APNCGDDIL 319
E+ L+T+ + E QY+ +N+ +K +C + + AP +
Sbjct: 264 EMDSAQFLETLAGNFMEVVQYNEDNRAFEGALGTNVTIKVLCGVMRDSSLGAPYARYAAV 323
Query: 320 CKIAAGVVEADSLEYDGNN-SRCYINEDRTGDESDEG--WEWQSCSEMVVPMGKDK-NSM 375
++ V L+ + R N G + G W +Q+C+E D N
Sbjct: 324 ARLMLDTVSMKCLDSSFDAYVRDMTNSSWDGPAAGGGRQWVYQTCTEFGFFQSSDSPNQP 383
Query: 376 YQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--------YYGGHDIKLILRRSTSNIIFSN 427
+ P L +K C++ Y +S +LT YYG +DI+ +S I+F N
Sbjct: 384 FTGFP--LRFQVKQCEQFYNISAE---LLTAAVAQTNEYYGSYDIR------SSRIVFPN 432
Query: 428 GMRDPFSRGGWVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVK 482
G DP+ G + +QDL +HC ++ A+ D LV R
Sbjct: 433 GAIDPWHALGITQ-------DITQDLPAVFIQGTAHCANMYPARNEDLPQLVLARDHIFF 485
Query: 483 IMQGWITQ 490
++Q W+ Q
Sbjct: 486 LLQQWLKQ 493
>gi|326429444|gb|EGD75014.1| hypothetical protein PTSG_07238 [Salpingoeca sp. ATCC 50818]
Length = 502
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 124/512 (24%), Positives = 207/512 (40%), Gaps = 57/512 (11%)
Query: 8 FQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFF---------YNQ 58
F L + V H+ + +G + + + + H F + Q
Sbjct: 12 FVALACGTLAVPQQQHSGSARRHKHVGAMNQGLLHVMRKQGHDTSDIFTPVSADVKWFTQ 71
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-IQLSGFTYENAHQ 117
+DHF+ P+ +TF Q+Y +N+ + G P+ LG E P +++ A Q
Sbjct: 72 KVDHFN--PQDTRTFQQQYQVNATYHKQG-GPVFLMLGGEGPASPRWLEIDTAIMIYARQ 128
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
A++V LEHR+YGKS PF + H Y +S QALAD A+ L A
Sbjct: 129 HDAVVVQLEHRFYGKSQPFKDL-----STDHLQYLSSEQALADAANFLTSFMPG-----A 178
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-- 235
P + G SY G LA +FR KYPH+V G++++S+P+ D + YH VV
Sbjct: 179 PAVVFGGSYSGALAAFFRSKYPHLVNGAISTSSPVYALVD---FHQYHEVVRNSLATVPH 235
Query: 236 -SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA--- 291
C I + +I + +G L+K F C T + D ++T+++ A
Sbjct: 236 NGSHCSAAIATATEKIQTMLKTTNGRKQLAKDFNLCGDSDVTHD--DDIETLFTNLAGNI 293
Query: 292 ----QYDTPSN-------IP-VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
QY+ +N +P + +C + PN + + + +E N
Sbjct: 294 DGVVQYNLDNNHFEGRTKVPTITDVCAVMAATPNDPYAAYANLQKYLTGGECIETSYANM 353
Query: 340 RCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGV 396
+ + G W +Q+C E + N +L + C + YGV
Sbjct: 354 IAEMKNTSLSSDVAGGMRQWIYQTCVEFGFYQTSEGNDKPFLNTISLKYNLDQCSDIYGV 413
Query: 397 -SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
P +W YGG+D+ + +NI++ NG+ DP+ T D + +
Sbjct: 414 PGPNVNWTNANYGGYDV------AGTNIVYVNGLIDPWHALSRTDTA-LPDGCDAIVIPQ 466
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
+HC ++ DP L + R+T + W
Sbjct: 467 TAHCANMYPPSPDDPPALTRARETISSYLGVW 498
>gi|118787231|ref|XP_315944.3| AGAP005914-PA [Anopheles gambiae str. PEST]
gi|116126698|gb|EAA11647.3| AGAP005914-PA [Anopheles gambiae str. PEST]
Length = 502
Score = 135 bits (341), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 113/463 (24%), Positives = 203/463 (43%), Gaps = 67/463 (14%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFT 111
++ Q +DH P + T+ QRY +N +++ N+P+ +G E G
Sbjct: 55 LWFEQQLDH--NDPTNAATWQQRYYVNDQYFNASDPNAPVFLMIGGEGEATARWMHEGAW 112
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A + AL LEHR+YGKS P + Y S QALAD A ++ + DK
Sbjct: 113 IRYAEKHGALCFQLEHRFYGKSRPTEDLSTS-----SLAYLTSEQALADLAYFIVAMNDK 167
Query: 172 YNAT--HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
Y IA G SY G LA W R KYP +V G+++SS P+L D + Y+ V+
Sbjct: 168 YQLEPHRHRWIAFGGSYPGSLAAWLREKYPSLVHGAISSSGPLLAKID---FVEYYDTVT 224
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-------LKDG 282
+ S +C + ++ ++ +++ + G L++KF+ C P++ + E L +
Sbjct: 225 RSLERYSADCVRAVRSAFQQVETLLKHMIGQRTLNEKFQLCDPVERSIENPLDIANLFEA 284
Query: 283 LDTVYSEAAQY---DTP-SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
+ + ++ QY ++P + + + +C+ + N + + + E + + +N
Sbjct: 285 IASNFAGVVQYNKDNSPHATVTIDEVCDVMVN------QTIGAPVSRLAEVNRILLKQSN 338
Query: 339 SRCY-------INEDRT---GDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTK 385
+ C I E R G G W +Q+C+E + S + + +
Sbjct: 339 TTCLDFVYDKSIEEMRNTSWGSSQASGARQWTYQTCNEFGFYQTSNNASSVFGDRFPVEF 398
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRS----------TSNIIFSNGMRDPFSR 435
+++ C + YG T +G + + R+ T+N+++ +G DP+ R
Sbjct: 399 FVRQCVDVYG---------TRFGAESLARAVYRTNTNYGALDPATTNVLYVHGNIDPWHR 449
Query: 436 GGWVKTYHFFDLSFSQDLNLGS-HCLDLDEAKKSDPDWLVQQR 477
G ++ D+ L G+ HC ++ E K SDP L Q R
Sbjct: 450 LGLTESN---DIHMPTILIDGTAHCANMYEPKDSDPPQLKQAR 489
>gi|145490518|ref|XP_001431259.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398363|emb|CAK63861.1| unnamed protein product [Paramecium tetraurelia]
Length = 446
Score = 135 bits (340), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 128/452 (28%), Positives = 202/452 (44%), Gaps = 45/452 (9%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E +F +Q IDH Y + QTF QRY +++ +L ++ E + F
Sbjct: 24 ERWFEHQLIDH--YDRTNTQTFRQRYWTVEEYFQPEGGAVLFWICGEYTCPGIRKERLFP 81
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI-KD 170
E A KALIV+LEHRYYGKS+PF L+N + Y QAL D A LHI +
Sbjct: 82 VELAQTHKALIVVLEHRYYGKSMPFDEDALRLENLK---YLGIRQALDDLAYFQLHIVQG 138
Query: 171 KYNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
K+ P IAIG SY G +A W+R +YPH+V+G+LASSA + +T + + + V
Sbjct: 139 KFFGVRESHPWIAIGGSYPGAMAAWYRYQYPHLVVGALASSAVVQI---LTDFPKFDTQV 195
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
+ ++C +K ++ A D L ++ + N +LKD Y
Sbjct: 196 YLSALKSGQQCADDLKA----LNQYAE--DNLDVIRARL-------NAQKLKDDEFLFYF 242
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
A ++CN ++ + + I+ +VE + Y Y +D
Sbjct: 243 TDAIILKIQYGGRTKLCNDLK-GKTIEEQMDYFISRTLVEENPESYGS-----YYLKDDV 296
Query: 349 GDE----SDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG----VSPRP 400
DE S W++Q C+E+ + + + +L Y + CK+ +G + P
Sbjct: 297 YDEHNLRSSRQWKYQCCTEVGWWQTAPEQDSLRSDRLDLEFYRQYCKDIFGEELKLWPDE 356
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
Y+GG D+++ NIIF+NG DP+ ++ F++ N G HC+
Sbjct: 357 DLGNAYFGGFDLQV------DNIIFTNGDEDPWKWVSIIEEKGKFNVYHINCANAG-HCV 409
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+L D D L Q R +I WI ++Y
Sbjct: 410 ELYTPTDQDCDQLKQARIEISQIFGNWIREHY 441
>gi|391335934|ref|XP_003742341.1| PREDICTED: putative serine protease K12H4.7-like [Metaseiulus
occidentalis]
Length = 486
Score = 135 bits (339), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 134/525 (25%), Positives = 222/525 (42%), Gaps = 91/525 (17%)
Query: 11 LLLMVIF---VSTSFH-ANGLKLRPRLGRIRRSRILEQKDSNHGF---ETFFYNQTIDHF 63
L+L+V+ ST H L LG R + S H F E ++ Q ++HF
Sbjct: 5 LVLIVVLGLAASTRLHRGRPLHKHGMLGEPRST-------SRHSFGEAEIRWHTQRMNHF 57
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-IQLSGFTYENAHQFKALI 122
P +T+ QRY++N +F+ G P+ LG E + ++ + A + AL+
Sbjct: 58 D--PADRRTWKQRYMVNDEFYREGG-PVFLLLGGEGEASISWVEKNTHVMLMAKKHNALV 114
Query: 123 VILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT-HAPVIA 181
LEHR+YG+S P + + + Y +S QALAD A I ++ N + A +
Sbjct: 115 FQLEHRFYGQSRP-----TSDLSTENLVYLSSEQALADAAHFRNVITNRRNLSPDAKWVV 169
Query: 182 IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SEECF 240
G SY G LA WF+LKYPH+ +G++ASSAP+L I + Y VV RD+ C
Sbjct: 170 FGGSYSGSLAAWFKLKYPHLAVGAVASSAPLLA---IIDFQDYVRVV----RDSLGSSCS 222
Query: 241 QTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK--NTTELKDGLDTV---------YSE 289
+K + + A++ + +FKTC P N+ L++ T+ Y++
Sbjct: 223 AKVKDGFQALQVKAARRSSWPDIDNEFKTCVPFDGYNSLNLQNFFQTIAGNFEGIVQYNK 282
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC-------Y 342
+ + +NI + +C +ENAP + G+ + L + +S+C +
Sbjct: 283 DQRMEGRTNISIDDLCRLMENAPTPLE--------GLASVNDLLLESTDSKCLDYDYAKF 334
Query: 343 INEDR------TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCK----- 391
+ E R E W +Q+C E D + + + +I+ C+
Sbjct: 335 VREMRNVSYSSVAAEGGRQWTYQTCVEFGFFQSSDAEDQPFGDLFPVELFIQQCRDIFDD 394
Query: 392 --EQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK------TYH 443
+Q + T YGG KL +N+ F NG DP+ +K T H
Sbjct: 395 FFDQSMLDNAIFRTNTEYGGQQPKL------TNVTFPNGSIDPWHALSILKNLSDSVTAH 448
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
F + + +HC D+ D + L R+ + W+
Sbjct: 449 FIEGT--------AHCADMYPPSAEDDETLTAGRQKIEAEVAKWL 485
>gi|170574568|ref|XP_001892871.1| Serine carboxypeptidase S28 family protein [Brugia malayi]
gi|158601365|gb|EDP38293.1| Serine carboxypeptidase S28 family protein [Brugia malayi]
Length = 567
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 129/474 (27%), Positives = 206/474 (43%), Gaps = 59/474 (12%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
IDHF+Y + + F +Y+ N ++ + P+ + G E I+ Q +G ++ A +F
Sbjct: 59 IDHFTY--RNNEFFSLKYLANYSYFLC-DGPLFFYAGNEGDIEAFAQNTGIIWDLAPRFH 115
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN------ 173
A IV EHRYYG S P+G R + + GY N Q LAD+A ++ +K
Sbjct: 116 AAIVFAEHRYYGNSKPYGKR--SYMDVLRLGYLNDIQVLADFAQLITFLKTDQEELGFCP 173
Query: 174 -ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL-YYGDITPWTTYHSVVSKD 231
T PVI G SYGG LA W R+KYPH+V G+ ASSAP+ +YG + +V + +
Sbjct: 174 PGTEIPVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRNFYGTGINPESVSNVTTTN 233
Query: 232 FRDTSEEC-FQTIKKSWSEIDNIASKPDGLAILSKKF--KTCTPLKNTTELKDGLDTVYS 288
+ + C ++ + + I ++ +G L++ F K +KN+ + +YS
Sbjct: 234 Y--VTNGCDWKVFSEGFVAIQKLSKTEEGRMKLNQIFRSKPGFEMKNSDDFTSLYSYIYS 291
Query: 289 -----EAAQYDTPSN-------IPVKRICN-AIENAPNCGDDILCKIAAGVVEADSLEYD 335
Y P+N PVK C A + N D+ L + V+ Y
Sbjct: 292 AVFYMAVIDYPYPTNFLTPLPGFPVKYACQYATKXGIN--DEELAEQLYNVINV-YYNYT 348
Query: 336 GN------NSRCYINEDRTGDESDEGWEWQSCSEMVVP----MGKDK---NSMYQPEPWN 382
G S C I ++ W WQ C+ M + G++ N+ P+
Sbjct: 349 GQLTDHCFTSNCTILSPFQNNDEVIAWNWQCCTSMTMQNCDRSGENDFFLNTCDNPDGL- 407
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL---ILRRSTSNIIFSNGMRDPFSRGGWV 439
+ IK C E + S +Y HD + ++ +TSN IFSNG DP+S G
Sbjct: 408 INISIKYCTELFKDIGYSS---NFYKLHDTMIRYGMIYNATSNTIFSNGNLDPWSASGVY 464
Query: 440 KTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + + + +H LDL DP + +R I++ W+
Sbjct: 465 ENSPGITNAMRNGVYIFYMSDAAHHLDLRTPNTCDPPSVTYERFQVTNIIKCWV 518
>gi|149029310|gb|EDL84577.1| protease, serine, 16 (thymus), isoform CRA_b [Rattus norvegicus]
Length = 496
Score = 135 bits (339), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 120/474 (25%), Positives = 204/474 (43%), Gaps = 72/474 (15%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ + F QRY +N + G ++P+ +G E + ++G
Sbjct: 44 WLEQPLDPFNA--TDRRIFLQRYWVNDQHRTGQDAPVFLHIGGEGSLGPGSVMTGHPAAL 101
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G AL + Y +S ALAD AS + N
Sbjct: 102 APAWGALVISLEHRFYGLSMPAGGLDVALLH-----YLSSRHALADVASARQALSGLLNV 156
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LATW RLK+PH+V ++ASSAP+ + ++ Y+ VV++
Sbjct: 157 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQVVARSLT 213
Query: 234 DT----SEECFQTIKKSWSEIDNIASKPDGL---AILSKKFKTCTPL---KNTTELKDGL 283
S EC +++E++ + GL A+L ++ C L ++ EL L
Sbjct: 214 QVTIGGSLECLAAASTAFAEVERLLRA--GLASQAVLREELGACGSLDLIEDQAELLGAL 271
Query: 284 DTVYSEAAQYDTPSNIP--VKRICNAI--------ENAPNCG-----DDILCKIAAGVVE 328
+ QYD + P V+++C + + P G +L + +
Sbjct: 272 QALVGGTVQYDGQAGSPLSVRQLCGLLLRTWGNRNHSTPYLGLRRAVQRVLRSMGQKCLS 331
Query: 329 ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
+ E ++ E + D W +Q+C+E + + + L +
Sbjct: 332 SSRAE---TVAQLRNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQCPFSQLPALPFQLD 388
Query: 389 NCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
C++ +G+SP + +YYGG + ++F NG DP +H
Sbjct: 389 LCEQVFGLSPSSVAQAVAQTNSYYGGQS------PGATQVMFVNGDTDP---------WH 433
Query: 444 FFDLSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
LS +QDL SHCLD+ + SD L R+ + +Q W+
Sbjct: 434 V--LSVTQDLGPSEPALLIPSASHCLDMAPMRPSDSPSLRLGRQKIFQQLQVWL 485
>gi|357612135|gb|EHJ67826.1| putative thymus-specific serine protease [Danaus plexippus]
Length = 494
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 135/511 (26%), Positives = 229/511 (44%), Gaps = 76/511 (14%)
Query: 20 TSFHANGLKLRPRLGRIRRSR----ILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQ 75
TSF G+ R LGR + + KDS + F Q +DH S P + + Q
Sbjct: 15 TSFVQCGVNFR--LGRSKHGNLGAPVGADKDSLPPNKYFL--QKLDHSS--PTDQRYWEQ 68
Query: 76 RYVINSKFWGGGN-SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSV 134
RY +N F+ N P+ +G E D + G + A FKAL ++LEHRYYG+S
Sbjct: 69 RYFVNESFYDFNNPGPVFLMIGGEGTADPRWMVKGTWIDYAIHFKALCILLEHRYYGQSR 128
Query: 135 PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH-APVIAIGASYGGELATW 193
P + ++KN + Y +S QALAD A + + +KY + G SY G LA W
Sbjct: 129 P--TMDLSVKNLQ---YLSSYQALADLAYFINAMNNKYKFNKDVKWVVFGGSYPGSLAAW 183
Query: 194 FRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS--EECFQTIKKSWSEID 251
RLKYPH+V +++SS P++ + Y VV R+ + EEC +K + +I
Sbjct: 184 MRLKYPHLVHAAVSSSGPLV---AKVNFMEYFQVVVNALREKTGGEECVGQVKLAHKQIQ 240
Query: 252 NIASKPDGLAILSKKFKTCTPLKNTTE--LKDGLDTV---YSEAAQYDTPS--------- 297
I K D A + ++F+ C P ++ +K+ +++ +++ QY+ +
Sbjct: 241 EII-KTDP-ATIEREFRVCEPFSKASQNDMKNFYNSIADDFADLVQYNEDNRISGDKMYK 298
Query: 298 NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC-------YINEDRT-- 348
N+ + +C+ + G K+AA +S+ + +N C I E R
Sbjct: 299 NLTINSVCDMLTEPG--GKPAFKKLAA----YNSIVLNKSNQTCLDYGYDNMIKEMRNIS 352
Query: 349 -GDESDEGWEWQSCSEMVVPMGKDKNSMYQPE---PWNLTKYIKNCKEQYG-------VS 397
G E W +Q+C+E G + S + E ++L +I+ CK+ +G ++
Sbjct: 353 WGSEGGRQWMYQTCTE----FGFYQTSSSEIEVFGDFSLEFFIQQCKDVFGSKFNDAFIN 408
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS 457
W + YGG +I +++ +G DP+ G T + + + + +
Sbjct: 409 DAAKWTNSDYGGLNI------PAKRVVYVHGSIDPWHALGMTTTEE--NDAPAIFIRGTA 460
Query: 458 HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
HC ++ A K+D LV R ++ W+
Sbjct: 461 HCANMYPASKNDNPGLVSARMEVRSYLESWL 491
>gi|301109703|ref|XP_002903932.1| serine protease family S28, putative [Phytophthora infestans T30-4]
gi|262096935|gb|EEY54987.1| serine protease family S28, putative [Phytophthora infestans T30-4]
Length = 528
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 120/465 (25%), Positives = 200/465 (43%), Gaps = 51/465 (10%)
Query: 42 LEQKDSNHGFET----FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGA 97
L Q +S F+T ++NQT+DHF+ +S TF QRY +KFW + P++ ++G
Sbjct: 33 LVQAESQLLFKTEAQQLWFNQTVDHFA--SDSNATFQQRYYEVNKFWSKPDGPVILYIGG 90
Query: 98 EAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
E ++ +GF + A +F A I+ LEHR+YG+S+P G L +R Y QA
Sbjct: 91 EGAMEKAP--AGFVHVIAQKFDAKILALEHRFYGRSIPNGD----LSTENYR-YLTVQQA 143
Query: 158 LADYASILLHIKDKYNATHAPV-----IAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
LAD L H K+ Y + IAIG SY G L+ WFR+ YP + SL+SS
Sbjct: 144 LAD----LKHFKESYQSQLGAKDANQWIAIGGSYPGALSAWFRIAYPDATVASLSSS--- 196
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
G + P +H + C ++ + + + + A+ KK
Sbjct: 197 ---GVVQPVYKFHQFDEQVALAAGPSCADVLRLTTEVFEKEVASANATAV--KKLFGAQD 251
Query: 273 LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL 332
L + D + A QY +CN++ A + ++ A+ ++
Sbjct: 252 LADADFFYMIADAA-AMAVQYGHKD-----IVCNSMVGAFERNNSLVDSFASFTIDMYGS 305
Query: 333 EYDGN---NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN 389
+ +++C + +DR+ W WQ CS++ K + +L ++K
Sbjct: 306 SFGSECFYDTKC-LADDRSRWGDGRSWRWQKCSQLAYFQVAPKEKSLRSAMLDLDYHLKQ 364
Query: 390 CKEQYGVSPRPS----WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
C+ +G PS + YGG + I FSNG DP+ R +
Sbjct: 365 CQTVFGDVVHPSEGVDEITKLYGGD------HPNGHKIFFSNGGDDPWQRASVLDKLSDD 418
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
++ L HC DL A + P+ L +QR+ ++ + W+ +
Sbjct: 419 QIANLAKCQLCGHCGDL-SANPNVPEPLKKQREQILEYLTKWLGE 462
>gi|76096362|ref|NP_001028882.1| thymus-specific serine protease precursor [Rattus norvegicus]
gi|75775148|gb|AAI04723.1| Protease, serine, 16 (thymus) [Rattus norvegicus]
gi|149029311|gb|EDL84578.1| protease, serine, 16 (thymus), isoform CRA_c [Rattus norvegicus]
Length = 512
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 120/474 (25%), Positives = 204/474 (43%), Gaps = 72/474 (15%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ + F QRY +N + G ++P+ +G E + ++G
Sbjct: 60 WLEQPLDPFNA--TDRRIFLQRYWVNDQHRTGQDAPVFLHIGGEGSLGPGSVMTGHPAAL 117
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G AL + Y +S ALAD AS + N
Sbjct: 118 APAWGALVISLEHRFYGLSMPAGGLDVALLH-----YLSSRHALADVASARQALSGLLNV 172
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LATW RLK+PH+V ++ASSAP+ + ++ Y+ VV++
Sbjct: 173 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQVVARSLT 229
Query: 234 DT----SEECFQTIKKSWSEIDNIASKPDGL---AILSKKFKTCTPL---KNTTELKDGL 283
S EC +++E++ + GL A+L ++ C L ++ EL L
Sbjct: 230 QVTIGGSLECLAAASTAFAEVERLLRA--GLASQAVLREELGACGSLDLIEDQAELLGAL 287
Query: 284 DTVYSEAAQYDTPSNIP--VKRICNAI--------ENAPNCG-----DDILCKIAAGVVE 328
+ QYD + P V+++C + + P G +L + +
Sbjct: 288 QALVGGTVQYDGQAGSPLSVRQLCGLLLRTWGNRNHSTPYLGLRRAVQRVLRSMGQKCLS 347
Query: 329 ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
+ E ++ E + D W +Q+C+E + + + L +
Sbjct: 348 SSRAE---TVAQLRNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQCPFSQLPALPFQLD 404
Query: 389 NCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
C++ +G+SP + +YYGG + ++F NG DP +H
Sbjct: 405 LCEQVFGLSPSSVAQAVAQTNSYYGGQS------PGATQVMFVNGDTDP---------WH 449
Query: 444 FFDLSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
LS +QDL SHCLD+ + SD L R+ + +Q W+
Sbjct: 450 V--LSVTQDLGPSEPALLIPSASHCLDMAPMRPSDSPSLRLGRQKIFQQLQVWL 501
>gi|332021809|gb|EGI62155.1| Putative serine protease K12H4.7 [Acromyrmex echinatior]
Length = 493
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 118/465 (25%), Positives = 220/465 (47%), Gaps = 60/465 (12%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF P + + QR+ +N F+ N PI +GAE + + + G E A +
Sbjct: 53 QYLDHFD--PTNVNDWQQRFFVNVDFYKP-NGPIFLMIGAEGTANASWMVEGEWIEYAKE 109
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F A+ LEHRYYGKS P + ++KN Y +S QALAD A + + +
Sbjct: 110 FGAMCFYLEHRYYGKSHP--TIDLSVKNLM---YLSSEQALADLAYFIASVNVDL-PRNT 163
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
I G SYGG LA W R KYPH+V G++++S P+L D ++ Y+ VV+ ++ S+
Sbjct: 164 KWIVFGGSYGGSLAAWMRAKYPHLVHGAVSTSGPLLAQID---FSEYYQVVTNALKEYSD 220
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP-------LKNTTELKDGLDTVYSEA 290
+C + I+++ S+++ + G + KKF+ C P L + + L + L + ++
Sbjct: 221 QCVRIIQEANSQLNIMLHHTVGQQQIQKKFRLCDPIDPGYTKLVDISNLYEALASNFASV 280
Query: 291 AQYDTPS-------NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC-- 341
QY+ + NI ++ +C+ + D+ + K + +S+ + +C
Sbjct: 281 VQYNKDNRQSSSFVNITIETVCDILV------DEKIEKAIDRLAYVNSMILNATKEKCLD 334
Query: 342 -----YINEDR----TGDESDEGWEW--QSCSEMVVPMGKDKNSMYQPEPWN----LTKY 386
I+E R T ++++ G +W Q+CSE G + S Q + + + +
Sbjct: 335 YRYDTMIHELRNITWTSEQAEGGRQWIYQTCSE----FGFFQTSTGQIDVFGNKFPVEFF 390
Query: 387 IKNCKEQYGVSPRPSWVLTYYGGHDIKLI---LRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ C + +G P+ + L ++ L +N++F +G DP+ G V++ +
Sbjct: 391 AQQCVDIFG--PKYNMDLLKSAVTRTNILYGALNLQVTNVVFVHGSVDPWHVLGIVQSSN 448
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + +N +HC ++ + D L + RK +++ W+
Sbjct: 449 --PQAPAIYINGTAHCANMYPPSERDMPQLKEARKVIRGLIKQWL 491
>gi|413954868|gb|AFW87517.1| putative serine peptidase S28 family protein [Zea mays]
Length = 528
Score = 134 bits (338), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 94/252 (37%), Positives = 126/252 (50%), Gaps = 31/252 (12%)
Query: 208 SSAPILYYGD-ITPWTTYHSVVSKDFRDTSEECFQTIKKSWS--EIDNIASKPDGLAILS 264
SSAPI+ D ITPW++++ VS+DF+ S CF IK +W +D A+ GL LS
Sbjct: 287 SSAPIILQLDHITPWSSFYDAVSQDFKSESMNCFSVIKATWGWDALDERAASDGGLLDLS 346
Query: 265 KKFKTCTPLKNTTELKDGLDTV--YSEAAQYDTPSNI-------PVKRICNAIENAPNCG 315
K F+ C LK +++ L T Y+ Y TP+N PVK +C I+ P G
Sbjct: 347 KLFRACKTLKYAYSIRNWLWTAFAYTAMVDYPTPANFLQNLPAYPVKEMCKIIDGFP-AG 405
Query: 316 DDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSM 375
D+L K A A SL Y+ + W SC PM SM
Sbjct: 406 ADVLDKAFA----AASLYYNYTGDQTCTASMAGSGRLARRW---SCG----PMTVSNESM 454
Query: 376 YQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNG 428
+ P ++ + C + +G V PRP WV T YGGH I+ +L+R SNIIFSNG
Sbjct: 455 FPPSTFSYEERSDECFQSWGGGGGGGGVRPRPHWVTTEYGGHKIEKVLKRFGSNIIFSNG 514
Query: 429 MRDPFSRGGWVK 440
MRDP+SRGG +K
Sbjct: 515 MRDPWSRGGVLK 526
>gi|281202572|gb|EFA76774.1| peptidase S28 family protein [Polysphondylium pallidum PN500]
Length = 463
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 131/477 (27%), Positives = 208/477 (43%), Gaps = 71/477 (14%)
Query: 46 DSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG---GGNSPILAFLGAEAPID 102
++N +E +YNQT++HF E +TF QRY +N +++ GG PI+ ++ E P+
Sbjct: 24 EANVKWEYQWYNQTLNHFD--AEDTRTFLQRYYVNDQYYDYKKGG--PIILYINGEGPVS 79
Query: 103 D--NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD 160
N G T A ALIV LEHRYYG S PF A + + + +S QAL D
Sbjct: 80 SPPNKPTDG-TVIYAQALGALIVTLEHRYYGDSSPF-----ADLSTENLKFLSSRQALND 133
Query: 161 YASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA---PILYYGD 217
A +L + +I IG SY G L+ WFR+KYPHV +GS+ASS IL +
Sbjct: 134 LAIFILDYRSTIQ-NAGDIITIGGSYSGALSAWFRVKYPHVTVGSVASSGVVNAILDFTA 192
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
W Y + E+C ++ S + + A+ K+ L +
Sbjct: 193 FDEWVAYAA---------GEQCADALRLVTSTAEQQIFGGNAAAV--KQLFQAETLTDDG 241
Query: 278 ELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVE--ADSLEYD 335
+ L +E QY ++C + NA N D+L + + +L
Sbjct: 242 DFFYWLADSMAEGIQYGYH-----DQLCTPLINAMNNKGDMLETYSNYTINVWGTTLGTP 296
Query: 336 GNNSRCYINEDRTGD--ESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
G + + ++ T D ++D W +Q+C+E + + N+T + +C
Sbjct: 297 GEYATLF-QQNTTHDINKADRQWWFQTCTEFGYFQNAPAQGSIRSQMVNMTYHRTHCANV 355
Query: 394 YGVSPRPSWVLT-----YYGGHDIKLILRRSTSNIIFSN---------------GMRDPF 433
+G +P W T YYGG+ + +NI+F+N G +DP+
Sbjct: 356 FG---KPLWPNTEATNDYYGGN------HTAGTNIVFTNVSRKLEIRENNQSPIGSQDPW 406
Query: 434 SRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKK--SDPDWLVQQRKTEVKIMQGWI 488
SR + + S N HC+DL E S P+ L Q R+ +KI++ W+
Sbjct: 407 SRASITVQEYPTEPSLMVTCNNCGHCVDLRECPGGCSTPNNLDQVRQQTLKIIETWL 463
>gi|339245689|ref|XP_003374478.1| putative serine protease [Trichinella spiralis]
gi|316972265|gb|EFV55948.1| putative serine protease [Trichinella spiralis]
Length = 490
Score = 134 bits (338), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 117/471 (24%), Positives = 204/471 (43%), Gaps = 54/471 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP-----IDDNIQLSG 109
++ Q +DHF + + T+ QRY +N+K++ G P +G E P I ++ +
Sbjct: 39 YFQQNLDHFHH--QQNITWLQRYWVNTKYYKPG-GPAFLLIGGEGPAISSWIQESEKYPK 95
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+ A F A+ +LEHRYYG+S P + K +N R + S QALAD A+ + +
Sbjct: 96 DWMKKAQTFGAICFMLEHRYYGESHPTDNMKT--ENLR---WLTSDQALADVANFISYAT 150
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+YN + I G SY G L+ W RLKYPH++ G++ASSAP + + Y V
Sbjct: 151 TRYNLQGSRWITFGGSYAGLLSGWSRLKYPHLITGAVASSAP---FHIKVNFHEYLDSVF 207
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLD 284
++ + C I+ + ++ + G + +K C + K+ L +
Sbjct: 208 DSIKNENANCASEIENAVLQMRKLLRTEYGRREMKEKLSLCKKIKQAEGKDVQNLFATIA 267
Query: 285 TVYSEAAQYDTPSN-IPVKRICNAIENAPNCGDDILCKIAAGV------VEADSLEYDGN 337
++S QY+ P+ + +K +C + + G D + ++ + A L + +
Sbjct: 268 DMFSFIVQYNQPNTAMSMKNMCEKLTDL--SGGDPVTRLGVIIRWMLRFTSALCLNFRYS 325
Query: 338 NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
+ +++ + S W++Q+C+E D ++L + CK+ +G
Sbjct: 326 DMIAELSDTKWTKSSTRQWQYQTCTEFGYFQTTDSLQHAFFGTYDLQFFFDICKDTFGKQ 385
Query: 398 PRPSWVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
+ V YGG +K SNII NG DP+ R G V H +
Sbjct: 386 FTEATVHRGAFEKNVRYGGKQLK------KSNIILINGSVDPWHRLGLVNNPHPLSKAIL 439
Query: 451 QDLNL-----------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ + SHC D+ SDP+ L++ R+ + WI Q
Sbjct: 440 INGTILHKLLARTNISSSHCADMYPELTSDPETLIKARREITGHIASWILQ 490
>gi|426250769|ref|XP_004019106.1| PREDICTED: thymus-specific serine protease [Ovis aries]
Length = 516
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 144/532 (27%), Positives = 229/532 (43%), Gaps = 82/532 (15%)
Query: 7 SFQWL--LLMVIFVSTSFHANGLKLRPRLG----RIRRSRILE---QKDSNHGFETFFYN 57
S WL LL+V ++S A+ L+ RLG R + S L DS + +
Sbjct: 5 SVPWLGPLLLVSLWASSAPASLLR---RLGEHILRFQESSALGLGLGPDSVTLPKEGWLE 61
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +D F+ ++F QRY +N + W + P+ LG E + + G A
Sbjct: 62 QPLDPFNA--SDRRSFLQRYWVNDQHWTSQDGPVFLHLGGEGSLGPGSVMRGHPANLAPI 119
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATH 176
+ AL++ LEHR+YG S+P L A+ R + +S ALAD AS L + +N ++
Sbjct: 120 WGALVISLEHRFYGLSIP----AEGLDMAQLR-FLSSRHALADAASARLTLSRLFNVSST 174
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
+P I G SY G LA W RLK+PH+ S+ASSAP+ D ++ Y+ VVS+ +T
Sbjct: 175 SPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPVRATLD---FSKYNDVVSRSLMNTA 231
Query: 236 ---SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYS 288
S EC +++E++ + A LS + C L ++ EL L +
Sbjct: 232 IGGSLECRAAASAAFAEVERRGRASRGARAALSVELGACGSLERAEDQAELLGALQALVG 291
Query: 289 EAAQYDTPSNIP--VKRICNAIENAP-NCGDDILCKIAA---GVVEADSLEYDGNNSRCY 342
A QYD + P V+++C + P NC + C A G+ A + G RC
Sbjct: 292 GAVQYDGQAGAPLSVRQLCRFLLGDPGNCRGN--CSGPAPYRGLRRAVQVVTHGLGQRCL 349
Query: 343 ------------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNC 390
+ E + D W +Q+C+E + + + L ++ C
Sbjct: 350 SISRAETVAQLRVTELQVSSVGDRQWLYQTCTEFGYYVTCEVPGCPFSQLPALPSELELC 409
Query: 391 KEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
++ +G+S + +YYGG + ++F NG DP +H
Sbjct: 410 EQVFGLSTSSVAQAVAQTNSYYGGQT------PGATQVLFVNGDTDP---------WHV- 453
Query: 446 DLSFSQDLNL---------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
LS +Q L SHCLD+ + SD L R+ + +Q W+
Sbjct: 454 -LSVTQPLGPSEPALLIPSASHCLDMAPERPSDSPGLRLARQKISQQLQTWL 504
>gi|221108130|ref|XP_002169972.1| PREDICTED: putative serine protease K12H4.7-like [Hydra
magnipapillata]
Length = 496
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 129/494 (26%), Positives = 210/494 (42%), Gaps = 64/494 (12%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS 89
RP+ G + ++ Q N+ E ++ Q ++HF +S T+ QRY +N +++ GG
Sbjct: 30 RPKGGFMGTPKVKNQFLLNN-IEPQWFTQKLNHFDDADDS--TWKQRYYVNDEYFDGG-- 84
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
P+ +G E + G + A Q ALI+ LEHR+YG+S P + +
Sbjct: 85 PVFLMIGGEGSLSSLWVNVGAMVDYAKQHSALILGLEHRFYGESHPLSDM-----STENL 139
Query: 150 GYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLAS 208
Y +S QALAD A + KY+ IA G SY G LA W R KY H++ G++AS
Sbjct: 140 KYLSSEQALADLAHFRNEMALKYSLNDKNRWIAFGGSYPGALAAWLRYKYQHLIYGAIAS 199
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
SAPI Y + + Y V + +S C + + +++ + +GL LSK FK
Sbjct: 200 SAPI--YAQLN-FPQYLEVSTNSL--SSSRCRANVNAATKILESYLTTEEGLMKLSKDFK 254
Query: 269 TCTP----LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA 324
TC P L+N + + QY+ + + AI N D+LC I
Sbjct: 255 TCKPITNDLRNIQNFANNAANNFFGVIQYNKDN----REFEGAI--GTNITIDVLCGIMT 308
Query: 325 GV---------VEADSLEYDGNNSRCY----------INEDRTGDESDEG---WEWQSCS 362
V +SL + +C + E G + EG W +Q+C+
Sbjct: 309 DTQLGDPYNRYVAVNSLIMNTYQQKCLDVSYEDYVESMKETDWGSSAGEGGRQWLYQTCT 368
Query: 363 EMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLI 415
E D N + L ++K C + +G +S +W T YGG+
Sbjct: 369 EFGYYQTTDSNKQVFGNMFPLDFFLKQCVDIFGDKFNESSISQGINWSNTNYGGY----- 423
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQ 475
+ + I+F NG DP+ + K F +N +HC ++ + D L++
Sbjct: 424 -KMNAKRIVFPNGSIDPWHALSFTKNEKDMISVF---INGTAHCANMYPSSPDDSAELIK 479
Query: 476 QRKTEVKIMQGWIT 489
R+ ++ W+
Sbjct: 480 ARQFIGDLITKWLV 493
>gi|162287351|ref|NP_001069798.2| thymus-specific serine protease precursor [Bos taurus]
gi|296474038|tpg|DAA16153.1| TPA: protease, serine, 16 [Bos taurus]
Length = 516
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 126/475 (26%), Positives = 205/475 (43%), Gaps = 70/475 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W + P+ LG E + + G
Sbjct: 59 WLEQPLDPFNA--SDRRSFLQRYWVNDQHWTSQDGPVFLHLGGEGSLGPGSVMRGHPANL 116
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P L A+ R + +S ALAD AS L + +N
Sbjct: 117 APIWGALVISLEHRFYGLSIP----AEGLDMAQLR-FLSSRHALADAASARLTLSRLFNV 171
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH+ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 172 SSTSPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPVRAILD---FSKYNDVVSRSLM 228
Query: 234 DT----SEECFQTIKKSWSEIDNIASKPDGL-AILSKKFKTCTPL---KNTTELKDGLDT 285
+T S EC +++E++ G A LS + C L ++ EL L
Sbjct: 229 NTAIGGSLECRAAASAAFAEVERRLRASRGAWATLSVELGACGSLERAEDQAELLGALQA 288
Query: 286 VYSEAAQYDTPSNIP--VKRICN-AIENAPNCGDDILCKIAA---GVVEADSLEYDGNNS 339
+ A QYD + P V+++C + + NC + C +A G+ A +
Sbjct: 289 LVGGAVQYDGQAGAPLSVRQLCRLLLGDRGNCKGN--CSGSAPYRGLRRAVQVVTHSLGQ 346
Query: 340 RCY------------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
RC + E + D W +Q+C+E + + + L +
Sbjct: 347 RCLSIPRAETVAQLRVTELQVSSVGDRQWLYQTCTEFGYYVTCEVPGCPFSQLPALPSEL 406
Query: 388 KNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ C++ +G+S + +YYGG + ++F NG DP +
Sbjct: 407 ELCEQVFGLSTSSVAQAVAQTNSYYGGQT------PGATQVLFINGDTDP---------W 451
Query: 443 HFFDLSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
H LS +Q L SHCLD+ + SD L R+ + +Q W+
Sbjct: 452 HV--LSVTQPLGSSEPALLIPSASHCLDMAPERPSDSPSLRLARQKVSQQLQTWL 504
>gi|328776766|ref|XP_395356.3| PREDICTED: putative serine protease K12H4.7-like [Apis mellifera]
Length = 494
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 119/470 (25%), Positives = 210/470 (44%), Gaps = 62/470 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q +DHF P + + QRY IN +++ G P+ + E+ + G E
Sbjct: 49 WFTQFLDHFD--PTDARVWQQRYFINGEYYKKG-GPVFLMISGESTATAKWMVKGQWIEY 105
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A QF AL +EHR+YGKS P + ++KN + Y +S QALAD A + + Y
Sbjct: 106 AKQFGALCFQVEHRFYGKSHP--TSDLSVKNLK---YLSSQQALADLAYFIEIMNIDYKL 160
Query: 175 TH-APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ IA G SY G LA W R KYPH++ G++++S P+L D + Y+ +V +
Sbjct: 161 SNDTKWIAFGGSYAGSLAAWLRSKYPHLLHGAVSASGPLLAEID---FQEYYIIVENALK 217
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-------KNTTELKDGLDTV 286
SE C TI ++ + + P G + KKF C P+ + + L + L +
Sbjct: 218 QYSEACVNTIVEANKQFHIMLRHPIGQQGIVKKFVLCDPIDSGYTKRNDISNLYETLASN 277
Query: 287 YSEAAQYD-------TPSNIPVKRICNAIEN-APNCGDDILCKIAAGVVEADS---LEYD 335
++ QY+ +N+ ++ C+ + N + D L ++ ++ A L+Y
Sbjct: 278 FAGIVQYNKDNRNNSAMANLTIESACDILTNDSLGIAIDRLAILSTKILNASKEKCLDYT 337
Query: 336 GNNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQP----EPWNLTKYIK 388
N + E EG W +Q+C+E G + S +P E + + +++
Sbjct: 338 YNKMIHKLRNVTWASEEAEGGRQWMYQTCTE----FGFFQTSTARPKLFSETFPIDFFVQ 393
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLI---LRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
C + +G PR + L + ++ L +N++F +G DP +H
Sbjct: 394 QCIDVFG--PRYNIHLLNSAINRTNILYGALNLQVTNVVFIHGSIDP---------WHVL 442
Query: 446 DLSFSQD-------LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ S + +N +HC ++ + K DP L R ++ W+
Sbjct: 443 GLTKSSNPQMPVIYINGTAHCANMYPSSKDDPPQLKTARVKIENLISQWL 492
>gi|166240237|ref|XP_635876.2| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|165988498|gb|EAL62376.2| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 476
Score = 134 bits (337), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 125/475 (26%), Positives = 211/475 (44%), Gaps = 68/475 (14%)
Query: 46 DSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPI--- 101
D + F ++ QT+DHF++ ++ QTF Q+Y +N +++ N PI+ ++ E P+
Sbjct: 34 DGSGSFPAQWFTQTLDHFNF--QNNQTFQQKYYVNDQYYNYKNGGPIILYINGEGPVSSP 91
Query: 102 ----DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
DD + + A +IV LEHR+YG+S PF + ++N + Y + QA
Sbjct: 92 PYSSDDGVVIY------AQALNCMIVTLEHRFYGESSPF--SELTIENLQ---YLSHQQA 140
Query: 158 LADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
L D A+ ++ + K A H ++ IG SY G L+ WFR+KYPH+ +GS+ASS +
Sbjct: 141 LEDLATFVVDFQSKLVGAGH--IVTIGGSYSGALSAWFRIKYPHITVGSIASSGVVHSIL 198
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKK--SWSEIDNIASKPDGLAILSKKFKTCTPLK 274
D T + Y S EC + ++ S +E + A I ++ K +
Sbjct: 199 DFTAFDAYVSYA------VGPECTKALQAVTSAAEDEYFAG-----GIREQQMKQILQAE 247
Query: 275 NTTELKD---GLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA------- 324
+ ++ D L E QY + +C+ + +A N G + I
Sbjct: 248 SLVDIGDFFYWLADSMMEGDQYGY-----IDELCSPLVDAINSGTSGIDLITVYSNYTIN 302
Query: 325 --GVVEADSLEYDGNNSRCYINEDRTGDES---DEGWEWQSCSEMVVPMGKDKNSMYQPE 379
G V EY ++ T D S D W +Q+CS + + +
Sbjct: 303 TWGKVLGTPDEY-----STAWQQNVTYDPSKSADRAWWYQTCSSLGWMQAAPSENSIRSS 357
Query: 380 PWNLTKYIKNCKEQYGVS---PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
N+T + +C++ +G + P + V T YGG +L + +NI+F+NG DP+S+
Sbjct: 358 LVNMTYFQTHCQQLFGQAIWPPNVNAVNTQYGGDQSNPLLNAAGTNILFTNGHADPWSQA 417
Query: 437 GWVKT-YHFFDLSFSQDLNLGSHCLDLDEAKKSD--PDWLVQQRKTEVKIMQGWI 488
V + Y + S HC+DL P+ L Q R +K + W+
Sbjct: 418 SIVNSNYPNVEPSAMTTCRKCGHCVDLRGCPGGCDLPNNLDQVRSLSLKSIAQWL 472
>gi|170042651|ref|XP_001849031.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167866158|gb|EDS29541.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 499
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 122/460 (26%), Positives = 204/460 (44%), Gaps = 75/460 (16%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGF 110
+T +++Q +DH P + T+ QRY +N ++ S P+ +G E G
Sbjct: 49 KTLWFDQLLDHND--PTNAATWKQRYYVNDAYFDDRTSGPVFLMIGGEGEATARWMNEGA 106
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A + AL LEHR+YGKS P G A + GY S QALAD A + + +
Sbjct: 107 WIRYAKEHGALCFQLEHRFYGKSHPTGDLSTA-----NLGYLTSEQALADLAYFVEAMNE 161
Query: 171 KYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
KY T IA G SY G LA W R KYP++V GS++SS P+L D + Y VV
Sbjct: 162 KYQLTQQNRWIAFGGSYPGSLAAWLREKYPYLVHGSVSSSGPLLAKID---FKEYFQVVV 218
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK----NTTELKDGLDT 285
+ S +C + ++ + +++ + G +++KFK C P++ N+ ++ +
Sbjct: 219 NSLQSYSPQCVEDVRNAMGQVETLLKHMIGQRSINEKFKLCDPVEKSISNSLDIASLFEA 278
Query: 286 V---YSEAAQY---DTP-SNIPVKRICNAIEN----APNCGDDILCKIAAGVVEADSLEY 334
V ++ QY ++P + I + +IC+ + N AP + ++AA V +
Sbjct: 279 VAGNFAGVVQYNKDNSPHAKITIDQICDVMANQSLGAP------VSRLAA--VNEMVMTQ 330
Query: 335 DGNNSRCYINEDRTGDESDEGWE-----------WQSCSEMVVPMGKDKNSMYQPEPWNL 383
DG Y+ + + WE +Q+C+E DK + + + +
Sbjct: 331 DGVKCLDYVYDKTVKQMQNTSWESDVANGARQWTYQTCNEFGFYQTSDKPDLVFGDRFPV 390
Query: 384 TKYIKNCKEQYG------VSPRPSW-VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
+++ C + YG R + T YG L ST+N+++ +G DP+ R
Sbjct: 391 EFFVRQCTDIYGKKFDDRTLDRAVYRTNTNYGA------LNPSTTNVLYVHGSIDPWHRL 444
Query: 437 GWVKTYHFFDLSFSQDLNL-------GSHCLDLDEAKKSD 469
G L+ S D+N +HC ++ E K D
Sbjct: 445 G---------LTESNDINTPTIFIEGTAHCANMYEPKDDD 475
>gi|8574032|emb|CAB94769.1| protease, serine, 16 (thymus) [Homo sapiens]
gi|119623494|gb|EAX03089.1| protease, serine, 16 (thymus), isoform CRA_b [Homo sapiens]
Length = 541
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 125/491 (25%), Positives = 202/491 (41%), Gaps = 79/491 (16%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNDQHWVGQDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAA-LKNARHRGYFNSAQA---------------L 158
A + AL++ LEHR+YG S+P G + A L+ R + L
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRLAMGKSSGIPSDEDRPSPPFDPRL 178
Query: 159 ADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLK------YPHVVIGSLASSAP 211
AD S L + +N ++ +P I G SY G LA W RLK +PH++ S+ASSAP
Sbjct: 179 ADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKVLRLLRFPHLIFASVASSAP 238
Query: 212 ILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWSEID-NIASKPDGLAILSKK 266
+ D ++ Y+ VVS+ T S EC + +++E++ + S A L +
Sbjct: 239 VRAVLD---FSEYNDVVSRSLMSTAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALRTE 295
Query: 267 FKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIP--VKRIC--------NAIENAPN 313
C PL +N EL L + QYD + P V+++C N + P
Sbjct: 296 LSACGPLGRAENQAELLGALQALVGGVVQYDGQTGAPLSVRQLCGLLLGGGGNRSHSTPY 355
Query: 314 CGDDILCKIAAGVVEADSLEYDGNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKD 371
CG +I + L + + + E + D W +Q+C+E + +
Sbjct: 356 CGLRRAVQIVLHSLGQKCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCE 415
Query: 372 KNSMYQPEPWNLTKYIKNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFS 426
+ L + C++ +G+S + +YYGG + ++F
Sbjct: 416 NPRCPFSQLPALPSQLDLCEQVFGLSALSVAQAVAQTNSYYGGQT------PGANKVLFV 469
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQR 477
NG DP +H LS +Q L GSHCLD+ + SD L R
Sbjct: 470 NGDTDP---------WHV--LSVTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGR 518
Query: 478 KTEVKIMQGWI 488
+ + +Q W+
Sbjct: 519 QNIFQQLQTWL 529
>gi|183230234|ref|XP_656762.2| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|169802986|gb|EAL51377.2| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 480
Score = 134 bits (336), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 130/504 (25%), Positives = 215/504 (42%), Gaps = 68/504 (13%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
+LL+ + V S+ L + + +I +S + + + E+ Y +DHF+ ++
Sbjct: 1 MLLLALCVICSYAKLSLN-QQVMQQISQSFMTLELNE---VESMTYTVPLDHFNANNQN- 55
Query: 71 QTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEH 127
F +Y +N KF N+P+ LG E P + + + ++ A + K L++ +EH
Sbjct: 56 -DFDIQYFVNKKFLDANDPNAPLFVLLGGEGPASPKVLQNNYVIDSLAKKHKGLMLSVEH 114
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
R+YG S P + Y + QAL DY ++ H++++ N PVI +G SY
Sbjct: 115 RFYGASTPSLEMDKLI-------YCTAEQALMDYVEVISHVQEENNLVGHPVIVLGGSYS 167
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSW 247
G LA W R KYP+VV G+ ASSAP+ + + Y VV + + + W
Sbjct: 168 GNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQNALPKNTADLLSFAFEQW 224
Query: 248 SEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDT----PSNIPV 301
D + + +G L K F TCT K+ + + T S QY++ PS
Sbjct: 225 ---DKMTTTEEGRKELGKIFNTCTEFGEKDIQTFAESIGTALSGYVQYNSSNWKPSYEST 281
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG---------------NNSRCYI--N 344
IC I +DI+ K + E + E+ N+ Y N
Sbjct: 282 DSICAEI------NEDIVNKYPLFIKEKYNPEWADKECTPSSQEESYKTLQNTSTYAEGN 335
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN-LTKYIKNCKEQYGVSPRPSW- 402
ED +G W +Q+C + S + N L I CK+ YG+ +
Sbjct: 336 EDASG----RSWFFQTCIAYGYYQAVSEQSSVKWGKLNQLQGSIDMCKDIYGIDKDTLYN 391
Query: 403 ----VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
+ YGG + +N+ F+NG DP+ G ++ H + + Q ++ SH
Sbjct: 392 AVDHINVRYGGK------KPCVTNVAFTNGNTDPWHALGVTESDH-QEGNLVQLIDRTSH 444
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVK 482
C DL K++D L + R E+K
Sbjct: 445 CSDLYSEKENDVPELKKARHNELK 468
>gi|358253095|dbj|GAA51984.1| dipeptidyl peptidase 2, partial [Clonorchis sinensis]
Length = 593
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 122/223 (54%), Gaps = 13/223 (5%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q +DH ++ P + T+ RY+ K++ G PI + G E I SGF +
Sbjct: 311 YFTQRVDHMNFQPANI-TYRMRYLYEDKWYKSG-GPIFFYCGNEGDIFGFWNNSGFIFHL 368
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY---FNSAQALADYASILLHIKDK 171
A + A++V EHRYYGKS+PF KN+ + Y + Q LADYA+++ H+K+K
Sbjct: 369 ASKMDAMVVFAEHRYYGKSLPF-------KNSFSQPYIQFLSIEQTLADYANLIQHLKEK 421
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
Y + VIA G SYGG LA + R YPH+V G++ASSAP+ + + + V+ D
Sbjct: 422 YGRDNTAVIAFGGSYGGMLAAYMRASYPHLVAGAIASSAPVNWVAGLGNIHQFFEHVTDD 481
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKP-DGLAILSKKFKTCTPL 273
+ + +C +K ++ ++ + + LA +SK+ K C P+
Sbjct: 482 YNQVNPQCVARVKNAYDLLERMVMEDIRALASISKQMKLCKPM 524
>gi|344292450|ref|XP_003417940.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 484
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 122/479 (25%), Positives = 199/479 (41%), Gaps = 80/479 (16%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ Q +DHFS S + +PQRY N F+ G L G E + I + A
Sbjct: 42 FQQKLDHFS--KNSTELWPQRYFFNDAFYKPGGPVFLLIGGFETVCESWISTNNTWVSYA 99
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
+ AL ++LEHR+YG S P G A + Y +S QALAD + + + +K T
Sbjct: 100 ERLGALFLLLEHRFYGHSQPKGDLSTASLH-----YLSSRQALADIVNFRIKVAEKVGLT 154
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
+A G SYGG LA W R+K+P + ++ SSAPI + + Y VV +
Sbjct: 155 KNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSSAPIKAKAN---FYEYLEVVQRSLATH 211
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDT 295
+ +CFQ +++++ ++ + S L K F C P+K +E+ + A +
Sbjct: 212 NSKCFQAVREAFGQVVKMLSLRKYHRQLRKDFTLCKPMKRFSEM---------DKAFFIE 262
Query: 296 PSNIPVKRICNAIENAPN-------CGDDILCKIAAGV---------VEADSLEYDGNNS 339
P K +N + G D LC + V + +
Sbjct: 263 HLVFPFKSAVQDNDNESDYEGIQVSFGMDELCDMMTNTSLGSPYYRFVRLLHVLFKDEYL 322
Query: 340 RCYINE-------------DRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTK 385
RC+ + + + +Q C+E D KN ++ P L+
Sbjct: 323 RCFPAQYEKKLEVYLDSSINHHNPTKARQYFYQCCTEFGFFHTTDSKNQLFTGLP--LSY 380
Query: 386 YIKNCKEQYGVSPRPSW------VLT---YYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
+++ C + +G P ++ V++ YYGG + + S IIFSNG DP+
Sbjct: 381 FVQQCSDFFG--PEFNYDSLNMGVMSTNAYYGGFKV------TGSKIIFSNGSFDPWHPL 432
Query: 437 GWVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G K S+DL HC D+ + K +D L+Q R+ +I+Q W+ +
Sbjct: 433 GITK-------DISKDLPAVFIKGAVHCADMYQQKDTDSAELIQAREKIFQILQQWLKE 484
>gi|193662244|ref|XP_001949662.1| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 509
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 123/468 (26%), Positives = 204/468 (43%), Gaps = 60/468 (12%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ P +T+ QRY +N K++ + P+ +G E PI SG + A +
Sbjct: 49 QKLDHFN--PTDNRTWKQRYQVNQKYYKK-DGPVFLMIGGEGPISAKWMYSGAWIDYAKE 105
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATH 176
F AL + LEHRYYGKS P + + KN Y +S QAL D A +++I+ Y+ T
Sbjct: 106 FNALCLQLEHRYYGKSHP--TEDMSTKNLV---YLSSEQALTDLAEFIVNIRTNYDIPTT 160
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
A +A G SY G LA W R+K+PH+V +++SS P+L D + Y VV S
Sbjct: 161 AKWVAFGGSYPGSLAAWLRMKFPHLVYAAVSSSGPLLAKID---FKEYFKVVENALATYS 217
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-----LKDGLDTVYSEAA 291
+C IK++ ID+ G ++ KFK C PL T+ L + L +++
Sbjct: 218 PDCVSQIKEANQMIDSQIKTIKGAKLIENKFKLCDPLDINTKNDVANLFETLAGNFADIV 277
Query: 292 QYDTP---------SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC- 341
QY+ S + ++ +C+ + N L + A + +S NN C
Sbjct: 278 QYNKDNRFYENFERSLVTLETLCDVMVNKSKTTP--LDRYA----DVNSKLLSINNLTCT 331
Query: 342 ----------YINEDRTGDESDEG--WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN 389
Y+N D + G W +Q+C+E + + + +I
Sbjct: 332 QHVYTKMIDSYLNTSWNSDSAAGGRQWTYQTCTEFGFYQTSSQEDHAFGDKFPAKFFIDM 391
Query: 390 CKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
C + +G +S YG +IK + +I+ +G DP+ G T
Sbjct: 392 CSDIFGKLYNLDLLSNAIKRSNMMYGELNIK------ENRVIYVHGSVDPWHALGITHTK 445
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+++ + +HC ++ +D L R+T + W+T+
Sbjct: 446 TKNNVAIY--IEGTAHCANMYPPSPTDLPQLKNARETIRAFLSEWLTE 491
>gi|356559327|ref|XP_003547951.1| PREDICTED: probable serine protease EDA2-like [Glycine max]
Length = 490
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 133/467 (28%), Positives = 203/467 (43%), Gaps = 70/467 (14%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+NQT+DHFS P + F QRY ++ + PI +G E P N + + A
Sbjct: 54 FNQTLDHFS--PYDHHQFRQRYFEFLDYFRIPDGPIFLVIGGEGPC--NGITNDYIGVLA 109
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA- 174
+F A +V LEHRYYGKS PF S + + Y +S QAL D A + +D NA
Sbjct: 110 KKFGAAMVTLEHRYYGKSSPFNSLET-----ENLKYLSSKQALFDLAVFRQYYQDSLNAK 164
Query: 175 -----THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
T P G SY G L+ WFRLK+PH+ GSLASSA +L + +T Y +
Sbjct: 165 LNRTKTENPWFVFGGSYAGALSAWFRLKFPHLTCGSLASSAVVL---AVYNFTEYDQQIG 221
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY-- 287
+ EC ++++ I++ LA K+ K N +L+ D +Y
Sbjct: 222 E---SAGAECKAVLQETTQLIEH------KLATNGKELKASF---NADDLEKDGDFMYLI 269
Query: 288 ----SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
+ A QY P ++C + A N G+D++ A V E + G N + Y
Sbjct: 270 ADAAAVAFQYGNPD-----KVCKPMVEAKNAGEDLVDAYAKYVKEYYIGTF-GVNVQTYD 323
Query: 344 NE--DRTG---DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GV 396
E +T D S W +Q C+E+ N + + ++ CK + G+
Sbjct: 324 QEYLKKTAINEDSSTRLWWFQVCTEVAFFQVAPSNDSIRSSEIDAKYHMDLCKNIFGEGI 383
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL-SFSQDLNL 455
P YYGG I + S I+F+NG +DP+ K DL S++ +
Sbjct: 384 FPDVDATNLYYGGTKI------AGSKIVFANGSQDPWRHAS--KQTSSPDLPSYTITCSN 435
Query: 456 GSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+HC D +E S PD + + R+ + M W+++
Sbjct: 436 CAHCTDFRGCPQSPLVLEGNEKNCSSPDAVHKVRQQITEHMDLWLSE 482
>gi|290998882|ref|XP_002682009.1| peptidase S28 [Naegleria gruberi]
gi|284095635|gb|EFC49265.1| peptidase S28 [Naegleria gruberi]
Length = 434
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 120/468 (25%), Positives = 193/468 (41%), Gaps = 74/468 (15%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY-ENAH 116
Q +DHF P++ +TF QR+ +N W G N + +G E P F E
Sbjct: 3 QRLDHFD--PQNTETFQQRFWVNDTMWQGKN--VFIIIGGEGPASSKYLTGHFVINEYGK 58
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH 176
+ AL+ LEHR+YG+SVP + A N R Y S QAL D + KY
Sbjct: 59 KHGALLAALEHRFYGESVP--RKSLATDNLR---YLTSEQALQDLVEFRSLLVKKYRMDE 113
Query: 177 APV--IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
A V + G SY G L+ W + KYPH+ +G++ASS P+ + + Y V+
Sbjct: 114 ANVKFVCFGGSYSGNLSAWLKAKYPHLFVGAIASSGPVEAKLE---FNEYMMTVANSI-- 168
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT---------TELKDGLDT 285
+C ++K+ I+ + + P G ++ F C P T + L DG+
Sbjct: 169 -GPKCTDRVRKANDLIEQLIATPAGRQRVASMFNVCNPESMTNNDDIALLFSSLSDGV-- 225
Query: 286 VYSEAAQYDTPSN----IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC 341
E QY+ +N V +C IE++ DD L A V +S
Sbjct: 226 --CEVVQYNLDNNGAQGFNVTSMCAIIESS----DDALEGFANWVKTWNSYSQSSCTQNL 279
Query: 342 Y--------------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----L 383
Y NE+ G W +Q+C E G +N++ +P++ L
Sbjct: 280 YSDFIKQMQDVRPWPANENAAG----RSWTYQTCVE----FGYYQNAVGPKQPFSPRITL 331
Query: 384 TKYIKNCKEQYGVS---PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
+++ C + +G++ P + YG +I+ T+N +FS G DP+S +
Sbjct: 332 EWFVQQCSQIFGINGMKPDIDFTNNMYGSKNIQ------TTNTVFSTGSVDPWSVLAVAQ 385
Query: 441 TYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
++ + +HC DL + D L R K++ W+
Sbjct: 386 PTRNIAQNYVYHMQGTAHCADLYSSSPKDLPTLTNTRVQTQKLLDQWL 433
>gi|340715980|ref|XP_003396483.1| PREDICTED: putative serine protease K12H4.7-like [Bombus
terrestris]
Length = 493
Score = 132 bits (333), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 120/475 (25%), Positives = 209/475 (44%), Gaps = 73/475 (15%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q +DHF+ P + + QRY +N +++ G P+ + EA + + G E
Sbjct: 49 WFTQFLDHFN--PTDARVWQQRYFVNGEYYKKG-GPVFLMISGEAAANAKWMVEGQWIEY 105
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A QF AL +EHR+YG+S P + +KN Y +S QALAD A + + Y
Sbjct: 106 AKQFGALCFQVEHRFYGQSHP--TSDLGVKNLM---YLSSQQALADLAYFIQSMNINYKL 160
Query: 175 THAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
IA G SY G LA W R KYPH+V G++++S P+L D + Y VV +
Sbjct: 161 PAGTKWIAFGGSYAGSLAAWLRYKYPHLVHGAVSASGPLLAEID---FQEYFVVVENALK 217
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE------LKDGLDTVY 287
+ SE C I ++ + + P G ++KKF C P+ T+ L + + +++
Sbjct: 218 EYSEACVNAILEANKQFHIMLHHPIGQQGIAKKFILCDPINEHTKRNDISNLYETIASIF 277
Query: 288 SEAAQYD-------TPSNIPVKRICNAIENAP-NCGDDILCKIAAGVVEADS---LEYDG 336
+ QY+ +N + C+ + N + L ++ +++A L+Y
Sbjct: 278 AGIVQYNKDNRNNSAMANFTIDSACDILTNETLGIAINRLAILSTKILQASGEKCLDYMY 337
Query: 337 NNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQP----EPWNLTKYIKN 389
+ + E EG W +Q+C+E G + S +P E + + +++
Sbjct: 338 DKMIHKLRNITWASEEAEGGRQWTYQTCTE----FGFFQTSTARPKLFSETFPVDFFVQQ 393
Query: 390 CKEQYGVSPRPSWVL---------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
C + +G PR + L YGG D+K T+N++F +G DP
Sbjct: 394 CVDIFG--PRYNIHLLNSAVNRTNILYGGLDLK------TTNVVFVHGSIDP-------- 437
Query: 441 TYHFFDLSFSQDLNL-------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+H ++ S + + +HC ++ K+DP L R ++ W+
Sbjct: 438 -WHVLGITKSPNPQMPVIYIDGTAHCANMYPPSKNDPLQLKTARVEVGHLIDEWL 491
>gi|383860201|ref|XP_003705579.1| PREDICTED: putative serine protease K12H4.7-like [Megachile
rotundata]
Length = 494
Score = 132 bits (332), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 125/472 (26%), Positives = 215/472 (45%), Gaps = 66/472 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q +DHF+ P + QRY +N ++ G P+ +G E + + G E
Sbjct: 49 WFPQFLDHFN--PTDAHVWQQRYFVNGDYYKVG-GPVFLMIGGEGAANAKWMVEGQWIEY 105
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A +F AL +EHR+YGKS P + ++KN Y +S QALAD A + + Y
Sbjct: 106 AKEFGALCFQVEHRFYGKSHP--TSDLSVKNLM---YLSSEQALADLAYFVQTVNTMYKL 160
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ IA G SY G LA W R KYPH+V G++++S P+L D + Y+ VV +
Sbjct: 161 PNNTKWIAFGGSYAGSLAAWLREKYPHLVHGAVSASGPLLAQID---FQEYYVVVENALK 217
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-------KNTTELKDGLDTV 286
+ SE C I ++ ++ + G L+KKF C P+ K+ L + + +
Sbjct: 218 EYSEACVNAIVEANTQFHIMLRHRIGQQGLAKKFILCNPIDPGYTKTKDIANLYETIASD 277
Query: 287 YSEAAQYD-------TPSNIPVKRICNAIEN-APNCGDDILCKIAAGVVEADS---LEYD 335
++ QY+ +N +K C+ + N + D L I+ ++ A + L+Y
Sbjct: 278 FAGIVQYNKNNRNNSAMANFTIKSACDILTNESLGLAIDRLALISNKILNATNKKCLDYV 337
Query: 336 GNNSRCYINEDRT---GDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWN----LTK 385
N INE R E EG W +Q+C+E G + S +P ++ +
Sbjct: 338 YNK---MINELRNISWASEDAEGGRQWMYQTCTE----FGFFQTSTARPNLFSNSFPVDF 390
Query: 386 YIKNCKEQYGVSPRPSWVL---------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
+++ C + +G PR + L +YG ++K+ +N++F +G DP+
Sbjct: 391 FVQQCLDIFG--PRYNIQLLKSAVNRTNIFYGALNLKV------TNVVFVHGSVDPWHVL 442
Query: 437 GWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
G K+ + + + +N +HC ++ KSD L R ++ W+
Sbjct: 443 GITKSSN--PQAPAIYINGTAHCANMYPPSKSDVPELKVARAKIGHLISQWL 492
>gi|157120503|ref|XP_001653636.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108874911|gb|EAT39136.1| AAEL009038-PA, partial [Aedes aegypti]
Length = 495
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 122/473 (25%), Positives = 206/473 (43%), Gaps = 55/473 (11%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG--GNSPILAFLGAEAPIDDNIQLSG 109
+T +++Q +DH P + T+ QRY +N +++ N P+ +G E G
Sbjct: 47 QTLWFDQLLDH--NDPTNPTTWKQRYNVNDEYFKNEEDNGPVFLMIGGEGEATARWMTEG 104
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A ++ AL LEHR+YGKS P + + KN Y S QALAD A + +K
Sbjct: 105 AWIRYAQEYGALCFQLEHRFYGKSHP--TEDLSTKNL---AYLTSEQALADLAYFIEAMK 159
Query: 170 DKYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
KY + IA G SY G LA W R KYP++V GS++SS P+L D + Y VV
Sbjct: 160 QKYQLGRSNRWIAFGGSYPGSLAAWLREKYPYLVYGSISSSGPLLAKID---FKEYFEVV 216
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK----NTTELKDGLD 284
+ S EC ++ + S+++ + G L+ KFK C P++ N +L +
Sbjct: 217 ADSLNRYSPECVANVRSAMSQVETLLKHMIGQRNLNDKFKLCDPVEKSIANELDLSSLFE 276
Query: 285 TVYSEAAQYDTP-SNIPVKRICNAIEN----APNCGDDILCKIAAGVVEADSLEYDGNNS 339
+ S A P + + + +C+ + N AP ++ ++ +A L+Y + +
Sbjct: 277 AIASNFAGVVHPHARLTIDEVCDVMMNQTIGAPVSRLAVVNEMVMKQDKAKCLDYVYDKT 336
Query: 340 RCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG- 395
+ + G W +Q+C+E D + + + + C + YG
Sbjct: 337 IKQMQNTSWDSDVASGARQWIYQTCNEFGFYQTSDNAESVFGDRFPAEFFTRQCADVYGR 396
Query: 396 ------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
+S T YG L ST+N+++ +G DP+ R G L+
Sbjct: 397 RFDQKALSRAVYRTNTNYGA------LNPSTTNVLYVHGSIDPWHRLG---------LTE 441
Query: 450 SQDLNL-------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDF 495
S D+ + +HC ++ E K+ D L R ++ G + + DD+
Sbjct: 442 SNDIQMPTIFIDGTAHCANMYEPKEDDFPQLKAAR-VQISSFIGRLLKMDDDY 493
>gi|344268143|ref|XP_003405922.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 465
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 124/469 (26%), Positives = 193/469 (41%), Gaps = 63/469 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+ Q +DHFS + + QRY IN+ F+ G P+ +G I N + +T+
Sbjct: 26 FQQKLDHFSENGSPF--WEQRYFINNTFYKPG-GPVFLMIGGWMTIGTNWVSTDYTWITY 82
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A + A + LEHR+YG+S P G L A R Y S Q LAD A I K
Sbjct: 83 AERLGAFCLALEHRFYGQSQPTGD----LSTASLR-YLRSKQVLADIAYFRTEIAKKMGL 137
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+ G SYGG LA W R+KYP++ +++SSAP+ + + Y V
Sbjct: 138 IKNKWVVFGGSYGGSLAVWSRIKYPNLFAAAVSSSAPVKVKVN---FYEYFEGVHSALAT 194
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYD 294
+ EC + +K + ++ + P L F C LK + + T + E
Sbjct: 195 HNSECSKAVKDALGQVIKLLKFPKYYRRLKNDFMLCEHLKVNSVMD---STYFVENLLIF 251
Query: 295 TPSNIPVKRICNAIENAPNCGD--DILCKIAAG-----VVEADSLEYDGNNSRC----YI 343
S I + I++ PN D D + + G ++ + N C Y
Sbjct: 252 LASIIQRNKDNETIKSTPNIDDFCDKMTNTSLGSPYYRYARIMTIRFKNGNPSCLDANYQ 311
Query: 344 N---------EDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
N D+ E D W +QSC+E+ +N + P L ++ C
Sbjct: 312 NFLKSMSDSDLDKENIEEDRQWFYQSCTELGFFQTTHSRNHTFSGLP--LRYFLSQCLGV 369
Query: 394 YG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
+G ++ + YYGG ++ + S IIFSNG DP++ G K
Sbjct: 370 FGSEFNYNSLTQSVQAINMYYGGFNV------NGSKIIFSNGSLDPWNALGITK------ 417
Query: 447 LSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
S+DL G HC D+ + K +D L+Q R+ I+Q W+ +
Sbjct: 418 -DISEDLPAVFIKGGEHCADMYQQKDTDSAELIQAREKIFHILQKWLKE 465
>gi|359476844|ref|XP_003631898.1| PREDICTED: probable serine protease EDA2-like [Vitis vinifera]
Length = 477
Score = 132 bits (331), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 137/486 (28%), Positives = 203/486 (41%), Gaps = 71/486 (14%)
Query: 39 SRILEQKDSNHGFET---FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFL 95
SR+L++ + F T ++NQT+DHFS P + FPQRY + ++ + PI +
Sbjct: 23 SRLLQRNSHSSNFLTTDELWFNQTVDHFS--PLDHSKFPQRYYEFTDYFRLPDGPIFLKI 80
Query: 96 GAEAPIDD--NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGS-RKAALKNARHRGYF 152
EA D N +S A +F A +V LEHRYYGKS PF S R LK Y
Sbjct: 81 CGEASCDGIPNDYISVL----AKKFGAAVVSLEHRYYGKSSPFRSLRTENLK------YL 130
Query: 153 NSAQALADYASIL------LHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
+S QAL D A L++K + P G SY G L+ WFRLK+PH+ GSL
Sbjct: 131 SSKQALFDLAVFRQYYQESLNVKVNRSNVENPWFVFGVSYAGALSAWFRLKFPHLTCGSL 190
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK---PDGLAIL 263
ASSA +L + T + R E K E++ + + DG A+
Sbjct: 191 ASSAVVLAIYNFTEFD----------RQIGESAGAECKAVLQEVNGLVEQRLAVDGNAV- 239
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA 323
K LK + L A QY P +C+ + A G+D+ A
Sbjct: 240 -KSLFGAASLKIDADFLYLLADAAVTAFQYGNP-----DMLCSPLVQAKKDGEDLENAYA 293
Query: 324 AGVVE----ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE 379
V E + + N + + GD SD W +Q C+E+ + +
Sbjct: 294 TYVKEYYLGTFGVSIETYNQQHLKRTNSAGDTSDRLWWFQVCTEVAFFQVAPSDDSMRSS 353
Query: 380 PWNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
N ++ CK + G+ P YYGG I + S IIF+NG +DP+
Sbjct: 354 EINTKYHLDLCKNVFGNGIYPDVDATNIYYGGTKI------AGSKIIFTNGSQDPWRHAS 407
Query: 438 WVKTYHFFDL-SFSQDLNLGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIM 484
+ + D+ SF + H DL D S PD + + R+ ++ +
Sbjct: 408 KLTSTP--DMPSFLISCHNCGHGTDLRGCPQSPLSPEGDAKNCSSPDAVHKVRQQIIEHI 465
Query: 485 QGWITQ 490
W++Q
Sbjct: 466 DLWLSQ 471
>gi|74004232|ref|XP_545414.2| PREDICTED: thymus-specific serine protease [Canis lupus familiaris]
Length = 521
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 114/473 (24%), Positives = 204/473 (43%), Gaps = 61/473 (12%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W P+ LG E+ + L G
Sbjct: 59 WLEQPLDPFNA--SDTRSFLQRYWVNDQHWTSQRGPVFLHLGGESSLRSGSVLRGHPTAL 116
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG SVP G L A+ R + +S ALAD AS L + +N
Sbjct: 117 APAWGALVIGLEHRFYGLSVPAG----GLDVAQLR-FLSSRHALADVASARLALARLFNV 171
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT-------TYHS 226
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D + + H+
Sbjct: 172 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRATLDFSEYNEVRGSGRQVHA 231
Query: 227 VVSKDFRD---TSEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTEL 279
V+ R + +C +++E++ + + + L + C+PL ++ EL
Sbjct: 232 VLGGGLRPRGVSLPQCRAAASAAFAEVERRLHAGGATRSALRAELGACSPLDRAEDQAEL 291
Query: 280 KDGLDTVYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEA 329
L + A QYD + +P V+ +C N +AP G ++ +
Sbjct: 292 LGALQALVGGAVQYDAQAGVPLSVRGLCGLLLEGRSNRSRSAPYHGLRRAVQVVMHSLGQ 351
Query: 330 DSLEYDGNN--SRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
L + ++ + + + D W +Q+C+E + + + L +
Sbjct: 352 RCLSFSRAETVAQLKVTDSQVSGVGDRQWLYQTCTEFGFYITCEGPRCPFSQVPALPSQL 411
Query: 388 KNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ C++ +G+S + +YYGG + ++F NG DP +
Sbjct: 412 ELCEQVFGLSASSVVQAVAQTNSYYGGQTP------GATQVLFINGDTDP---------W 456
Query: 443 HFFDLSFSQD-------LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
H ++ + + SHC+D+ + SD L R++ + +Q W+
Sbjct: 457 HVLSITQASGPLESALLIPSASHCMDMAPDRPSDSPSLRLGRQSIFQQLQTWL 509
>gi|357603759|gb|EHJ63915.1| carboxypeptidase 3 [Danaus plexippus]
Length = 445
Score = 132 bits (331), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 97/330 (29%), Positives = 159/330 (48%), Gaps = 31/330 (9%)
Query: 46 DSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNI 105
+S+H + T Y+ IDHF P++ +T+ RY+ N +F+GG N PI +G E I
Sbjct: 41 ESSHFWRT--YDMPIDHFD--PQNRETYQMRYMYNEEFFGGNNYPIFIMVGGEWNIQPGW 96
Query: 106 QLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL 165
L+G Y A + + + EHRYYG+S+P+ + +N R + N QALAD A +
Sbjct: 97 LLAGNMYLMAQENRGYLFYTEHRYYGESLPYTTFTT--ENLR---FLNVDQALADLAYFI 151
Query: 166 LHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
IK + ++ V+ G SY G + W + +YPH+V+G +ASS PI DI Y
Sbjct: 152 SEIKKIPSFVNSKVVLYGGSYAGNMVLWLKQRYPHLVVGVVASSGPIKAQVDI---PGYL 208
Query: 226 SVVSKDF-RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG-- 282
VV F + +EC TIK+ ++ DG + + ++ C PL ++ L G
Sbjct: 209 EVVHNAFLSEGGQECVDTIKQGIADTIAAMETEDGRRSIQRIYRLCVPLDYSSRLSMGYF 268
Query: 283 ---LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE------ 333
+ +S + Q P ++ IC N G + +I + ++ S+
Sbjct: 269 SGYITWTFSTSVQTARPGSL--TAICQNFTNNV-YGSTPMEQIGGYIADSRSISNCLNVT 325
Query: 334 YDGNNSRCYINEDRTGDESDEGWEWQSCSE 363
YD N Y ++T + + W +Q+C+E
Sbjct: 326 YD-NYVASY---NKTVPSNGKAWYYQTCTE 351
>gi|301102560|ref|XP_002900367.1| serine protease family S28, putative [Phytophthora infestans T30-4]
gi|262102108|gb|EEY60160.1| serine protease family S28, putative [Phytophthora infestans T30-4]
Length = 526
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 98/327 (29%), Positives = 157/327 (48%), Gaps = 25/327 (7%)
Query: 52 ETFFYNQTIDHFSYGPESYQT-FPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-IQLSG 109
E +F + +DHF+ P S ++ + QRY N +FWGG P+ ++G E P+ I
Sbjct: 64 EDYFTDAILDHFA--PVSKRSKWKQRYQANEEFWGGRGFPVFLYIGGEGPLGPKAITNRT 121
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
F Y A Q +AL++ LEHR+YGKS P + +L N Y +S QALAD A +
Sbjct: 122 FVYYLAEQHRALLLALEHRFYGKSYP--TEDMSLPNL---AYLSSEQALADLAHFHSFVT 176
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
DKY T +A G SY G LA W +LKYP + G++ASSAP+ D + Y VV
Sbjct: 177 DKYGLTDEKWVAFGGSYPGNLAAWVKLKYPALFAGTVASSAPVQAKTD---FFEYMEVVG 233
Query: 230 KDFRD-TSEECFQTIKKSWSEIDNIASK-PDGLAILSKKFKTCTPLKNTTE---LKDGLD 284
R EC+ ++K+ +++ ++ + G +++ FK C P+ N + + +
Sbjct: 234 DGLRYFGGGECYHEVEKAITQLGHLMDEGQKGRDKVAELFKPCYPMTNEFDDSVFESSVM 293
Query: 285 TVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI------LCKIAAGVVEADSLEYDGNN 338
+ + AQY+ V + E+ GD + + K G + N
Sbjct: 294 GAFQDIAQYNGIHE-GVMTLSEVCEHFAKPGDAVEKLASFINKTRVGDCLDSKFQGAANG 352
Query: 339 SRCYINEDR-TGDESDEGWEWQSCSEM 364
+ ++ D+ G S W +Q+C+E
Sbjct: 353 TVEVLSRDQFDGKSSARQWVYQTCNEF 379
>gi|443699452|gb|ELT98942.1| hypothetical protein CAPTEDRAFT_229193 [Capitella teleta]
Length = 459
Score = 131 bits (330), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 117/472 (24%), Positives = 206/472 (43%), Gaps = 98/472 (20%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q +DHF+ ++T+ QR+ IN + + G P+ +G E + + G +N
Sbjct: 49 WFTQKLDHFT--SSDHRTWSQRFFINDEHYKPGG-PVFLMIGGEGAANPEWMVQGQWVQN 105
Query: 115 -AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A QF AL V+LEHR+YGKS P ++ +++ R Y +S QALAD A+ ++I +
Sbjct: 106 YAPQFNALCVMLEHRFYGKSHP--TKDLKVESLR---YLSSEQALADLAAFRVNISESRG 160
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
A IA G SY G L+ WFR KYPH+V S++SSAP+L +F+
Sbjct: 161 LADAKWIAFGGSYPGALSAWFRYKYPHLVYASVSSSAPML--------------AQLNFK 206
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYS 288
P +A L K F+ C P+ K+ + L D + +
Sbjct: 207 G----------------------PKKVAGLEKYFRLCEPIDASDAKDVSNLHDTIAQSIA 244
Query: 289 EAAQYD---------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA---DSLEYDG 336
QY+ +NI ++ IC+ + + + K+ + +++ L+
Sbjct: 245 GVIQYNRDNRAFEGAVGTNITIETICSIMTTKGSKPFESYAKVNSLLLDTYKEKCLDVSY 304
Query: 337 NNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
N + + E+ E+ EG W +Q+C+E D + + + L + C +
Sbjct: 305 NKTVQELREESWKSEASEGGRQWTYQTCTEFGFYQTSDLTTQPFGQHFPLKFSTEQCADV 364
Query: 394 YG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
YG + W + YGG++I + + ++F NG DP+
Sbjct: 365 YGTEFTQTSIQSAVDWTNSEYGGYNI------TVTRVVFVNGDIDPWHA----------- 407
Query: 447 LSFSQDLNLGS---------HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
L ++DLN S HC ++ +D L++ R++ K++ W++
Sbjct: 408 LGITRDLNAHSPAILIKGTAHCANMYPDAPNDLPQLIRARESVKKLLTLWLS 459
>gi|328875430|gb|EGG23794.1| hypothetical protein DFA_05930 [Dictyostelium fasciculatum]
Length = 550
Score = 131 bits (330), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 109/402 (27%), Positives = 186/402 (46%), Gaps = 36/402 (8%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+ Q +DHF+ + QTF QRYVIN ++W G P+ + E P+ ++G + N
Sbjct: 119 FTQRLDHFNTINQ--QTFQQRYVINDQYWNG-KGPVFIMINGEGPMSL-ATVTGLQFVNW 174
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A Q ALI+ LEHRYYG S F + + N Y QALAD A+ + +N
Sbjct: 175 AQQSNALIISLEHRYYGAS--FATDDLSTDNL---AYLTPQQALADNAAFREFVAVTFNV 229
Query: 175 -THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ ++ G SY G L +WFR+KYP++V ++ASS P+ + Y VV
Sbjct: 230 PATSKWVSFGGSYSGCLTSWFRIKYPNLVDYTVASSGPV---NAEVNFYQYLEVVQNSLL 286
Query: 234 DTS--EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV---YS 288
+ ++C I ++ +I + S+P+GL +S F L++ ++ + + ++ +
Sbjct: 287 TATNGQQCVSNIAQATQKIQALLSQPNGLETVSDMFNLFPALESQNDVANFMQSLAGNFM 346
Query: 289 EAAQYDTPSNIP-VKRICNAIENAPNCGDDILCKIAAGVVEADSLE--YDGNNSRCYINE 345
QY+ P V+ +C + ++ N I + ++L+ YD S
Sbjct: 347 GVVQYNLEEVGPSVETLCQTMTDSSNDALTNYIAIWNQYAQGETLDVSYDTMISEL---T 403
Query: 346 DRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS--PRP 400
+ T D++ G W +Q+C++ D + + L I+ C + +G P
Sbjct: 404 NVTNDQNIVGGRQWFFQTCAQFGFYQTSDSPNQPFGNLFPLEFQIQQCSDVFGFDFLPNV 463
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+W L +GG L TSN+I+ NG DP+ G ++
Sbjct: 464 NWTLLDFGG------LNPVTSNVIYVNGDIDPWHSLGITASF 499
>gi|256574599|dbj|BAH98105.1| serine protease [Entamoeba invadens]
gi|440300845|gb|ELP93292.1| hypothetical protein EIN_056770 [Entamoeba invadens IP1]
Length = 468
Score = 131 bits (329), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 134/513 (26%), Positives = 213/513 (41%), Gaps = 78/513 (15%)
Query: 12 LLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQ 71
+ + F+ +F N R+R S+ E + N G + +Q +DHF + +
Sbjct: 1 MQIFFFIVVAFSLN-------FQRVRFSK-EELEADNTGL---YMDQPLDHFDLT--NTK 47
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYY 130
+Y +N ++ P+ LG E F + A ++K +++ +EHR+Y
Sbjct: 48 KISIQYFLNDTYFTP-EGPLFVDLGGEGAASAGAIGGKFVIDKYAQKYKGMMLAIEHRFY 106
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
G+S+P G + + GY + QAL DY I+ IK K N PVI G SY G L
Sbjct: 107 GRSLPVGGL-----SQENLGYLSGIQALEDYIHIISEIK-KQNQITGPVIVFGGSYSGNL 160
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
A W R KYP+VV ++ASSAP+L +T + V+ KD +C K++ + I
Sbjct: 161 AVWIRQKYPNVVYAAVASSAPLL---ATNQFTQFMDVIEKDM---GPQCAAAWKQANANI 214
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVK-----RIC 305
+ + DG+ + FKTC +KN + L + + Y +N K +C
Sbjct: 215 EQLYKTADGIKQIQTDFKTCKDIKNDKDFTLFLQEIQANFISYPQYNNKKEKGKKCEDVC 274
Query: 306 NAI--ENAPNCGDDILCKIAAGVVE--ADSLEYDGNNSRCYINEDRTGDESD-------- 353
N + E+ P G L + ++ YD + I +T E D
Sbjct: 275 NILTGEDTPYNGMKKLVEFMLNDMKLTCSPSSYD----QMLIEMAKTEGEFDVTKPNSFA 330
Query: 354 --EGWEWQSCSEMVVPMGKDKNSMYQP----EPW----NLTKYIKNCKEQYGVS-----P 398
W WQ CSE S +QP +P+ N Y NCK+ +GVS
Sbjct: 331 STRSWAWQICSEY---------SFFQPITETQPFDQRLNNDFYYANCKDIFGVSKEKLDK 381
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
+ YG + +N+ F++G DP+S +T + ++ + SH
Sbjct: 382 KIKHTNMMYGA------MSPRVTNVAFTSGSFDPWSPLAKHETQYNDVDCYASYIEGTSH 435
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
C DL + DP L QR + I +Y
Sbjct: 436 CADLYSETEEDPVQLNTQRTETAAFIDELIKRY 468
>gi|13634059|sp|P90893.2|YM9I_CAEEL RecName: Full=Putative serine protease F56F10.1; Flags: Precursor
gi|351062821|emb|CCD70865.1| Protein F56F10.1 [Caenorhabditis elegans]
Length = 540
Score = 131 bits (329), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 127/492 (25%), Positives = 207/492 (42%), Gaps = 68/492 (13%)
Query: 29 LRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
L L R+ S +++ + + Q +DHF P + +T+ Q+Y N F N
Sbjct: 32 LNHHLDRLTASDGASIQETYPNLQVHNFTQKLDHFD--PYNTKTWNQKYFYNPVF-SRNN 88
Query: 89 SPILAFLGAEAP------IDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAA 142
S I +G E P + N+Q + A +F A + LEHR++G S P + +
Sbjct: 89 SIIFLMIGGEGPENGKWAANPNVQY----LQWAKEFGADVFDLEHRFFGDSWPIPDMQTS 144
Query: 143 LKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVV 202
Y + QALAD A + + +Y + + G SY G LA WFR KYP +
Sbjct: 145 -----SLRYLTTQQALADLAFFIEFMNQQYGFKNPRWVTFGGSYPGSLAAWFRQKYPQLT 199
Query: 203 IGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAI 262
+GS+ASSAP+ D + Y VV D R T +C Q K ++ ++ +A +G
Sbjct: 200 VGSVASSAPVNLKLD---FYEYAMVVEDDLRITDPKCAQATKDAFVQMQKLALTAEGRNS 256
Query: 263 LSKKFKTCTPL-KNTTELK-----DGLDTVYSEAAQY-------DTPSNIPVKRICNAIE 309
L+ F P NTT+L + Y QY T S+ V+++C+ +
Sbjct: 257 LNNHFNLQPPFDANTTKLDINNFFGNIFNTYQGMTQYTYDGQSNSTHSDKTVRKMCDIMT 316
Query: 310 NAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-------INEDRTGD--------ESDE 354
NA D++ ++ + + +E N I++ +GD +
Sbjct: 317 NATET--DVVMRVENLFLWFNQMEPASANLTVMPNSYWDVISQVGSGDLNVLGPDGAAAR 374
Query: 355 GWEWQSCSEM--VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL-------T 405
GW W C+E+ + + N P NL +I C + +G S + S ++
Sbjct: 375 GWMWLCCNEIGFLQTTNQGNNVFGTGVPLNL--FIDMCTDMFGDSMKMSQIMGGNKKSQN 432
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEA 465
YYGG D + +N++ NG DP+ G T L +N +HC D+ +
Sbjct: 433 YYGGADF-----YNATNVVLPNGSLDPWHALGTYGTIKSQSL-LPYLINGTAHCGDMYPS 486
Query: 466 KKSDPDWLVQQR 477
+P L+ R
Sbjct: 487 YDGEPGSLLAAR 498
>gi|255645752|gb|ACU23369.1| unknown [Glycine max]
Length = 490
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 132/467 (28%), Positives = 202/467 (43%), Gaps = 70/467 (14%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+NQT+DHFS P + F QRY ++ + PI +G E P N + + A
Sbjct: 54 FNQTLDHFS--PYDHHQFRQRYFEFLDYFRIPDGPIFLVIGGEGPC--NGITNDYIGVLA 109
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA- 174
+F A +V LEHRYYGKS PF S + + Y +S QAL D A + +D NA
Sbjct: 110 KKFGAAMVTLEHRYYGKSSPFNSLET-----ENLKYLSSKQALFDLAVFRQYYQDSLNAK 164
Query: 175 -----THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
P G SY G L+ WFRLK+PH+ GSLASSA +L + +T Y +
Sbjct: 165 LNRTKIENPWFVFGGSYAGALSAWFRLKFPHLTCGSLASSAVVL---AVYNFTEYDQQIG 221
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY-- 287
+ EC ++++ I++ LA K+ K N +L+ D +Y
Sbjct: 222 E---SAGAECKAVLQETTQLIEH------KLATNGKELKASF---NADDLEKDGDFMYLI 269
Query: 288 ----SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
+ A QY P ++C + A N G+D++ A V E + G N + Y
Sbjct: 270 ADAAAVAFQYGNPD-----KVCKPMVEAKNAGEDLVDAYAKYVKEYYIGTF-GVNVQTYD 323
Query: 344 NE--DRTG---DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GV 396
E +T D S W +Q C+E+ N + + ++ CK + G+
Sbjct: 324 QEYLKKTAINEDSSTRLWWFQVCTEVAFFQVAPSNDSIRSSEIDAKYHMDLCKNIFGEGI 383
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL-SFSQDLNL 455
P YYGG I + S I+F+NG +DP+ K DL S++ +
Sbjct: 384 FPDVDATNLYYGGTKI------AGSKIVFANGSQDPWRHAS--KQTSSPDLPSYTITCSN 435
Query: 456 GSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+HC D +E S PD + + R+ + M W+++
Sbjct: 436 CAHCTDFRGCPQSPLVLEGNEKNCSSPDAVHKVRQQITEHMDLWLSE 482
>gi|167390072|ref|XP_001739195.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897172|gb|EDR24409.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 480
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 127/502 (25%), Positives = 213/502 (42%), Gaps = 60/502 (11%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
+LL+ + V S+ L + + RI +S + + + E+ Y +DHF+ ++
Sbjct: 1 MLLLALCVICSYAKLSLN-QKVMQRISQSFMTLELNE---VESMTYTVPLDHFNANNQN- 55
Query: 71 QTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEH 127
F Y +N +F N+P+ LG E P + + + ++ A + K L++ +EH
Sbjct: 56 -DFDIHYFVNKEFLDTNDPNAPLFVLLGGEGPASPKVLQNNYVIDSLAKKHKGLMLSVEH 114
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
R+YG S P + Y + QAL DY ++ H++++ N PVI +G SY
Sbjct: 115 RFYGASTPSLEMDKLI-------YCTAEQALMDYVEVISHVQEENNLVGHPVIVLGGSYS 167
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSW 247
G LA W R KYP+VV G+ ASSAP+ + + Y VV + + + W
Sbjct: 168 GNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQNALPKNTADLLSFAFEKW 224
Query: 248 SEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDT----PSNIPV 301
D + + +G L K F TCT K+ + + T S QY++ S
Sbjct: 225 ---DEMTTTEEGRKELGKIFNTCTEFGEKDIQTFAESIGTALSGYVQYNSSNWKSSYEST 281
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINED--------RTGDESD 353
IC I +D++ K + E + E+ E+ T E +
Sbjct: 282 DSICTEI------NEDVVNKYPLFIKEKYNPEWGDKECTSSSQEESYKTLQSTSTYAEGN 335
Query: 354 EG-----WEWQSCSEMVVPMGKDKNSMYQPEPWN-LTKYIKNCKEQYGVSPRPSW----- 402
EG W +Q+C + S + N L I CK+ YG+ +
Sbjct: 336 EGAAGRSWFFQTCIAYGYYQAVSEQSSVKWGKLNQLQGSIDMCKDIYGIDKDTLYNAVDH 395
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
+ YGG + +N+ F+NG DP+ G ++ H + + Q ++ SHC DL
Sbjct: 396 INVRYGGK------KPCVTNVAFTNGNTDPWHALGVTESDH-QEGNLVQLIDRTSHCSDL 448
Query: 463 DEAKKSDPDWLVQQRKTEVKIM 484
K++D L + R E+K +
Sbjct: 449 YSEKETDVPELKKARHNELKFI 470
>gi|380014101|ref|XP_003691081.1| PREDICTED: putative serine protease K12H4.7-like [Apis florea]
Length = 494
Score = 130 bits (328), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 118/470 (25%), Positives = 208/470 (44%), Gaps = 62/470 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q +DHF P + + QRY IN +++ G P+ + E + G E
Sbjct: 49 WFTQFLDHFD--PTEARVWQQRYFINGEYYKKG-GPVFLMISGEGTATAKWMVKGQWIEY 105
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A QF AL +EHR+YGKS P + ++KN + Y +S QALAD A + + Y
Sbjct: 106 AKQFGALCFQVEHRFYGKSHP--TSDLSVKNLK---YLSSQQALADLAYFIEIMNIDYKL 160
Query: 175 TH-APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ IA G SY G LA W R KYPH++ G++++S P+L D + Y+ VV +
Sbjct: 161 PNDTKWIAFGGSYAGSLAAWLRSKYPHLLYGAVSASGPLLAEID---FQEYYVVVENALK 217
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA-- 291
SE C TI ++ + + P G ++KKF C P+ + + + +Y A
Sbjct: 218 QYSETCVNTIIEANKQFHIMLRHPIGQQGIAKKFVLCDPIDSGHTKRSDISNLYETIASN 277
Query: 292 -----QYD-------TPSNIPVKRICNAIEN-APNCGDDILCKIAAGVVEADS---LEYD 335
QY+ +N+ ++ C+ + N + D L ++ ++ A L+Y
Sbjct: 278 FAGIVQYNKDNRNNSAMANLTIESACDILTNDSLGIAIDRLAILSTKILNASKEKCLDYM 337
Query: 336 GNNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQP----EPWNLTKYIK 388
+ + +E EG W +Q+C+E G + S +P E + + +++
Sbjct: 338 YSKMIHKLRNVTWANEEAEGGRQWMYQTCTE----FGFFQTSTARPKLFSETFPIDFFVQ 393
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLI---LRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
C + +G PR + L + ++ L +N++F +G DP +H
Sbjct: 394 QCIDVFG--PRYNIHLLNSAINRTNILYGALNLKVTNVVFVHGSIDP---------WHVL 442
Query: 446 DLSFSQD-------LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L+ S + ++ +HC ++ + K DP L R ++ W+
Sbjct: 443 GLTKSSNPQMPVIYIDGTAHCANMYPSSKDDPPQLKTARVKIGNLISQWL 492
>gi|186516786|ref|NP_567998.3| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|145651776|gb|ABP88113.1| At4g36190 [Arabidopsis thaliana]
gi|332661229|gb|AEE86629.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 482
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 141/521 (27%), Positives = 215/521 (41%), Gaps = 76/521 (14%)
Query: 3 SSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDH 62
SS+ F L + +S +NGL L+PR R S L + + ++ QT+DH
Sbjct: 2 SSELGFALLTTFTVLLSYLSFSNGL-LQPR----RISHGLSKSSKYLTRDELWFTQTLDH 56
Query: 63 FSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALI 122
+S P ++ F QRY + PI + E P N + + A +F A I
Sbjct: 57 YS--PSDHRKFRQRYYEYLDHLRVPDGPIFLMICGEGPC--NGITNNYISVLAKKFDAGI 112
Query: 123 VILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-------T 175
V LEHRYYGKS PF S A KN + Y +S QAL+D A+ + +D N
Sbjct: 113 VSLEHRYYGKSSPFKS--LATKNLK---YLSSKQALSDLATFRQYYQDSLNVKFNRSSNV 167
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
P G SY G L+ WFRLK+PH+ GSLASSA + + + + +++
Sbjct: 168 ENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAVV---RAVYEFPEFDQQIAESAGPE 224
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------SE 289
E Q K GL + ++ K L N TEL D +Y
Sbjct: 225 CETALQETNKLLEL---------GLKVNNRAVKA---LFNATELDVDADFLYLIADAGVM 272
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-----IN 344
A QY P + V +E N GD L + A V + G +S+ Y ++
Sbjct: 273 AIQYGNPDKLCVP----LVEAQKNGGD--LVEAYAKYVREFCMGVFGQSSKTYSRKHLLD 326
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSW 402
T + +D W +Q C+E+ N + N ++ CK + GV P
Sbjct: 327 TAVTLESADRLWWFQVCTEVAYFQVAPANDSIRSHQINTEYHLDLCKSLFGKGVYPEVDA 386
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG----------GWVKTYHFFDLSFSQD 452
YYG I + + IIF+NG +DP+ ++ T H + D
Sbjct: 387 TNLYYGSDKI------AATKIIFTNGSQDPWRHASKQTSSPDLPSYIMTCH--NCGHGSD 438
Query: 453 LN---LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
L + ++ D S PD + + R+ +K + W+++
Sbjct: 439 LRGCPQSAMVIEGDAQNCSSPDAVNKVRQHMIKHIDLWLSE 479
>gi|321477664|gb|EFX88622.1| hypothetical protein DAPPUDRAFT_191142 [Daphnia pulex]
Length = 493
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 121/466 (25%), Positives = 199/466 (42%), Gaps = 55/466 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+++Q +DHF + +T+ QRY N F+ G +SP+ +G E + G E
Sbjct: 50 WFDQKLDHFDV--VNSKTWKQRYHTNDTFFKG-DSPVFLMIGGEGEASPKWMVQGMWIEW 106
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A QF AL LEHRYYGKS P ++ + KN + + +S QALAD A + K +
Sbjct: 107 AKQFNALCFQLEHRYYGKSHP--TKNMSTKNLK---FLSSEQALADLAYFIEAKKKELKL 161
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
++ I G SY G LA WFRLKYPH+ G++ASSAP+ + I + Y VV+ +
Sbjct: 162 SNNKWIVFGGSYPGSLAAWFRLKYPHLAHGAVASSAPL--FAKIN-FKEYLGVVTNALQT 218
Query: 235 TSE--ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVY 287
TS+ +C I+++ +++ + K F C PL T +L + L +
Sbjct: 219 TSQSSKCTNAIEQATIALEDEIQSTSCCKTMDKLFSLCDPLDTTNKLDVANFFETLAGNF 278
Query: 288 SEAAQYDTP------SNIPVKRICNAIENAPNCGDDI-----LCKIAAGVVEADSLEYDG 336
QY+ +NI + +C+ + + P G + + + G L+Y
Sbjct: 279 EGVVQYNKDNRDFKGANITMDVLCDMMTD-PKIGSPLARYAAVNNVLLGTTGEKCLDYKY 337
Query: 337 NNSRCYINEDRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + + EG W +Q+C+E D + + + ++ C +
Sbjct: 338 DKFLIDMRSTDWNSSASEGGRQWTYQTCTEFGYYQSSDLEDQPFGKRFPIEFSVRQCSDI 397
Query: 394 YG----------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+G R +++ YGG +KL +F NG DP+S G
Sbjct: 398 FGGKFNYKLLKNAVARTNFI---YGGLGLKL------DRTVFPNGSVDPWSALGITSNTT 448
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
F Q +HC D+ D L + R ++ WI+
Sbjct: 449 GNVAIFIQGT---AHCADMYPPSPKDSLELTEARNVIESHLRSWIS 491
>gi|67465637|ref|XP_648991.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|56465317|gb|EAL43602.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 480
Score = 130 bits (327), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 132/506 (26%), Positives = 209/506 (41%), Gaps = 73/506 (14%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
LL + + S + H + RL +S I K T Y +DHF+ +
Sbjct: 6 LLFLSAYASLTLHQK--QFYKRLSHSLKSDIPNVK-------TLTYTVPLDHFNANNQI- 55
Query: 71 QTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEH 127
F +Y I++ + N+P+ LG E P D+ + F + A + K L++ +EH
Sbjct: 56 -DFDIQYFISTDYLDNNSPNAPLFVLLGGEGPEDETGLQNYFVVTDLAKKHKGLMLSVEH 114
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
R+YG S P + Y + QAL DY ++ H++++ N PVI +G SY
Sbjct: 115 RFYGASTPSLEMDKLI-------YCTAEQALMDYVEVISHVQEENNLVGHPVIVLGGSYS 167
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSW 247
G LA W R KYP+VV G+ ASSAP+ + + Y VV + + + W
Sbjct: 168 GNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQNALPKNTADLLSFAFEQW 224
Query: 248 SEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDT----PSNIPV 301
D + + +G L K F TCT K+ + + T S QY++ PS
Sbjct: 225 ---DKMTTTEEGRKELGKIFNTCTEFGEKDIQTFAESIGTALSGYVQYNSSNWKPSYEST 281
Query: 302 KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG---------------NNSRCYI--N 344
IC I +DI+ K + E + E+ N+ Y N
Sbjct: 282 DSICAEI------NEDIVNKYPLFIKEKYNPEWADKECTPSSQEESYKTLQNTSTYAEGN 335
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN-LTKYIKNCKEQYGVSPRPSW- 402
ED +G W +Q+C + S + N L I CK+ YG+ +
Sbjct: 336 EDASG----RSWFFQTCIAYGYYQAVSEQSSVKWGKLNQLQGSIDMCKDIYGIDKDTLYN 391
Query: 403 ----VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
+ YGG + +N+ F+NG DP+ G ++ H + + Q ++ SH
Sbjct: 392 AVDHINVRYGGK------KPCVTNVAFTNGNTDPWHALGVTESDH-QEGNLVQLIDRTSH 444
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIM 484
C DL K++D L + R E+K
Sbjct: 445 CSDLYSEKENDVPELKKARHNELKFF 470
>gi|328866714|gb|EGG15097.1| hypothetical protein DFA_09921 [Dictyostelium fasciculatum]
Length = 495
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 115/399 (28%), Positives = 186/399 (46%), Gaps = 39/399 (9%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+ Q++DHF + + F QRY++N FW G P+ + E P+ ++G Y
Sbjct: 63 FTQSVDHFDSANQ--KKFQQRYLVNDHFWDG-KGPVFMMINGEGPMSLG-AVTGLQYVVW 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A + ALIV LEHRYYG S F + A +N Y QALAD A I + Y+
Sbjct: 119 AKEVHALIVSLEHRYYGAS--FVTDNLATENLI---YLTPQQALADNAVFRDFIANTYSV 173
Query: 175 THAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ G SY G L++WFR+KYP++V ++ASSAP+ + + Y VV
Sbjct: 174 PQTSKWVSFGGSYSGCLSSWFRIKYPNLVDYAIASSAPV---NPVIDFYQYLEVVQNALL 230
Query: 234 DTS--EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
TS ++C IK+S +I ++ +P+GL +S+ F LK ++ + + ++ +
Sbjct: 231 TTSNGQQCVDRIKQSTQKIQDLLKQPNGLKTVSELFSLDPVLKTDDDISNFMQSL-AGTF 289
Query: 292 QYDTPSNI------PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINE 345
DT N+ V+ +C + N N D L A E + + I E
Sbjct: 290 MGDTQYNLIEGPFKSVEALCLIMNNYSN---DSLTNYIQIWNNAQKGELVDVSYQSMIQE 346
Query: 346 --DRTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS--P 398
+ T D+++ G W +Q+C++ N+ + + IK C + +G + P
Sbjct: 347 YANITNDDTNVGGRQWFFQTCTQFGYYQSSTSNNHPFGHLFEIDFQIKQCTDIFGFAFLP 406
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
+W + YGG D S SNI++ NG DP+ G
Sbjct: 407 NVNWTILEYGGLD------PSASNIMYINGDIDPWHALG 439
>gi|224150431|ref|XP_002336956.1| predicted protein [Populus trichocarpa]
gi|222837219|gb|EEE75598.1| predicted protein [Populus trichocarpa]
Length = 114
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 61/110 (55%), Positives = 77/110 (70%), Gaps = 2/110 (1%)
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+CK +GV P+P W+ TYYGGHDIKLIL+R SNIIFSNG+RDP+S GG + + D
Sbjct: 1 DCKSLFGVLPQPHWITTYYGGHDIKLILQRFGSNIIFSNGLRDPYSSGGVLN--NISDSI 58
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAI 498
+ GSHCLD+ A SDP WLV QRK EVKI++GWI++YY D +
Sbjct: 59 VAVSTVNGSHCLDIQRANPSDPHWLVMQRKIEVKIIEGWISKYYTDLLEV 108
>gi|302797895|ref|XP_002980708.1| hypothetical protein SELMODRAFT_444586 [Selaginella moellendorffii]
gi|300151714|gb|EFJ18359.1| hypothetical protein SELMODRAFT_444586 [Selaginella moellendorffii]
Length = 1028
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 136/473 (28%), Positives = 202/473 (42%), Gaps = 71/473 (15%)
Query: 12 LLMVIFV----STSFHANGLKLRPRLGRIRRSRILEQKDSNHGFET--FFYNQTIDHFSY 65
+L+V FV STSF ++ RLG + G ET FF Q +DHF+
Sbjct: 3 MLLVCFVLLATSTSF-SSAFVPSSRLGFKPEFLPEGSRSPPRGKETVNFFTRQKLDHFA- 60
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVIL 125
PE + F Q+Y+ F+ N PI + E+ + ++ + A F A IV +
Sbjct: 61 -PEDPRVFSQKYLELLDFFRPHNGPIFLVMCGESTCTGDY-VTTYVGTLAESFGAAIVTV 118
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD----KYNATHA-PVI 180
EHRYYG S PF N + Y S Q+L D+A + + +D KYN T P I
Sbjct: 119 EHRYYGHSSPFQHL-----NLHNLKYLTSKQSLFDHAVFIDYYQDLINQKYNKTEKNPWI 173
Query: 181 AIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECF 240
IG SY G L+ WFRLK+PH+V GS ASSA + D + + V +C
Sbjct: 174 VIGGSYAGALSAWFRLKFPHLVAGSWASSAVVEAILDYSAYDKQLGV------SVGPKC- 226
Query: 241 QTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIP 300
K++ EI + GL + + K + D Y A
Sbjct: 227 ---KQALQEITRLTEH--GLVENATEIKYLFGFSPKDNITDDTLLAYVANAAAGEIQYGK 281
Query: 301 VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESD-EGWEWQ 359
+ +C+ + A ++L K A +++ IN D G+E D E W++Q
Sbjct: 282 IDALCDPLLKAEKSNRNLL-KTYAKILDR-------------INSDTNGNERDNESWDFQ 327
Query: 360 SCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--YYGGHDIKLIL 416
C+E+ + D+ S + N +I C EQ+G P T YYGG +I
Sbjct: 328 YCTEVGYFQVASDRKSSIRSSRINTQFFINYCAEQFGNGTFPDVKTTNLYYGGWNI---- 383
Query: 417 RRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL-------NLGSHCLDL 462
+ S I+F NG +DP+ H + S+D+ + SHC+DL
Sbjct: 384 --AGSRIMFLNGSQDPWR--------HASKQTSSKDMPALVLRCHTCSHCVDL 426
>gi|225718928|gb|ACO15310.1| serine protease K12H4.7 precursor [Caligus clemensi]
Length = 485
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 134/506 (26%), Positives = 222/506 (43%), Gaps = 55/506 (10%)
Query: 11 LLLMVIFVSTSFHANGLK-LRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
L+ V F+ + NG K +L +++ G +YNQT+DHF+ E+
Sbjct: 6 LIAWVTFLDLALANNGEKSFFLKLLDMKKLSSPTALSDRSGIFISYYNQTLDHFN--EEN 63
Query: 70 YQTFPQRYVINSKFWGGG-NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
+T+ QRY +N++F+ G +P+ +G E D+ G Y A + AL++ LEHR
Sbjct: 64 KKTWNQRYFVNTEFFNGTETAPVFLLIGGEGTASDSWMKYGAWYGYAKEVGALMIQLEHR 123
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA-PVIAIGASYG 187
+YG S P + + +N + + S QAL D + K +Y+ + G SY
Sbjct: 124 FYGSSRP--TENMSTENLK---FLTSQQALEDIVEFIRFAKQQYSLNETNKWVTFGGSYP 178
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSW 247
G L+ W R YP ++ G+L+SSAP+ D Y VV D +C +KK+
Sbjct: 179 GSLSLWMRSLYPELISGALSSSAPVEVKVDFEE---YLGVVENDMNIRDPKCVPEVKKAI 235
Query: 248 SEIDN-IASKPDGLAILSKKFKTCTPLK--NTTELKDGLDTV---YSEAAQYDTP-SNIP 300
+I I S PDG ++K F C N +L+ +V + +AQYD+ +N
Sbjct: 236 QQIQALIVSAPDGWKKVAKIFSLCDGWSGDNIQDLRSFYASVLGAFYSSAQYDSVLNNDD 295
Query: 301 VKRICNAIENAPNCGDDILCKIAA---GVVEADSLEYDGNNSRCYINEDRTGDESDEG-- 355
+ +C + N GD L ++A+ G L + + ++ + D G
Sbjct: 296 LAHMCPYMSNE-YFGDSSLERLASTLKGKYGGSCLNVNYKDLLDFMTTEEWAHGEDVGYR 354
Query: 356 -WEWQSCSEMV-VPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTY 406
W +Q+C+E G S + E Y + C++ YG ++ ++
Sbjct: 355 QWVYQTCNEFGWYQTGNIWGSFFPVE-----FYTQQCRDVYGMDFTDEIIASNANYTNIM 409
Query: 407 YGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD---LNLGSHCLDLD 463
YG + L SN I ++G DP+ G ++ D+S S +N SHC DL
Sbjct: 410 YGSKNPPL------SNTIITHGSFDPWHPMGILE-----DMSESVKTFIINGTSHCYDLY 458
Query: 464 EAKK-SDPDWLVQQRKTEVKIMQGWI 488
SD + L + R + ++ WI
Sbjct: 459 PPNPLSDSEELTRARNITFEHIKRWI 484
>gi|348523600|ref|XP_003449311.1| PREDICTED: thymus-specific serine protease-like [Oreochromis
niloticus]
Length = 641
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 116/464 (25%), Positives = 202/464 (43%), Gaps = 67/464 (14%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ ++ TFPQR+ +N +W + P+ ++G E P+ + L+G + A +
Sbjct: 64 QPLDHFNQ--QNSNTFPQRFFVNEAYWQHHDGPVFLYIGGEGPLVEYDVLTGHHSDMAEE 121
Query: 118 FKALIVILEHRYYGKSV-PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH 176
AL++ LEHR+YG S+ P G + L +S QALAD A+ +I +N TH
Sbjct: 122 HGALLLALEHRFYGDSINPDGLKTENLAG------LSSQQALADLATFHQYISQSFNLTH 175
Query: 177 APV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
I+ G SY G L+ WFR K+P++V G++ASSAP+ D ++ Y++VV +
Sbjct: 176 RNTWISFGGSYSGALSAWFRGKFPNLVHGAVASSAPVKAKLD---FSEYNNVVGLSLLNE 232
Query: 236 ----SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK---NTTELKDGLDTVYS 288
SE+C ++++++ + A + ++ F C K + EL L ++
Sbjct: 233 AVGGSEKCLSKVRQAFAAVKE-ALMSGNINQVASDFGCCQIPKDPYDQIELMQSLADIFM 291
Query: 289 EAAQYDTPSNI-PVKRICNAIENAPNCGDD---------ILCKIAAGVVEADSLE--YDG 336
A QY+ + + +C + N+ D L +I + L+ Y+
Sbjct: 292 GAVQYNEEGVLMSINELCGIMTNSSQEYQDEMEAYNRLVKLSQIYRFTSKEPCLDISYEK 351
Query: 337 NNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGV 396
+ G + W +Q+C+E + + L K C +G+
Sbjct: 352 SMKDLMDTSVHAGRRGERQWTYQTCTEFGFYQTCEDATCPFSGMLTLQDQTKLCTTLFGI 411
Query: 397 S-----PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
S R ++ TYYGG + T +I++ NG DP+ LS Q
Sbjct: 412 SQHSLPARIAFTNTYYGGDNPH------THSILYVNGGIDPWK-----------TLSVVQ 454
Query: 452 D------------LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKI 483
D + +HC D+ + +D L + R+ + I
Sbjct: 455 DGTEEGEEAQTVFIKDTAHCADMSSRRVTDRSSLTKARQPVLLI 498
>gi|297735044|emb|CBI17406.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 137/493 (27%), Positives = 203/493 (41%), Gaps = 78/493 (15%)
Query: 39 SRILEQKDSNHGFET---FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFL 95
SR+L++ + F T ++NQT+DHFS P + FPQRY + ++ + PI +
Sbjct: 23 SRLLQRNSHSSNFLTTDELWFNQTVDHFS--PLDHSKFPQRYYEFTDYFRLPDGPIFLKI 80
Query: 96 GAEAPIDD--NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGS-RKAALKNARHRGYF 152
EA D N +S A +F A +V LEHRYYGKS PF S R LK Y
Sbjct: 81 CGEASCDGIPNDYISVL----AKKFGAAVVSLEHRYYGKSSPFRSLRTENLK------YL 130
Query: 153 NSAQALADYASIL-------------LHIKDKYNATHAPVIAIGASYGGELATWFRLKYP 199
+S QAL D A L++K + P G SY G L+ WFRLK+P
Sbjct: 131 SSKQALFDLAVFRQYYQAKVVPIGESLNVKVNRSNVENPWFVFGVSYAGALSAWFRLKFP 190
Query: 200 HVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK--- 256
H+ GSLASSA +L + T + R E K E++ + +
Sbjct: 191 HLTCGSLASSAVVLAIYNFTEFD----------RQIGESAGAECKAVLQEVNGLVEQRLA 240
Query: 257 PDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGD 316
DG A+ K LK + L A QY P +C+ + A G+
Sbjct: 241 VDGNAV--KSLFGAASLKIDADFLYLLADAAVTAFQYGNP-----DMLCSPLVQAKKDGE 293
Query: 317 DILCKIAAGVVE----ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK 372
D+ A V E + + N + + GD SD W +Q C+E+
Sbjct: 294 DLENAYATYVKEYYLGTFGVSIETYNQQHLKRTNSAGDTSDRLWWFQVCTEVAFFQVAPS 353
Query: 373 NSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMR 430
+ + N ++ CK + G+ P YYGG I + S IIF+NG +
Sbjct: 354 DDSMRSSEINTKYHLDLCKNVFGNGIYPDVDATNIYYGGTKI------AGSKIIFTNGSQ 407
Query: 431 DPFSRGGWVKTYHFFDL-SFSQDLNLGSHCLDL------------DEAKKSDPDWLVQQR 477
DP+ + + D+ SF + H DL D S PD + + R
Sbjct: 408 DPWRHASKLTSTP--DMPSFLISCHNCGHGTDLRGCPQSPLSPEGDAKNCSSPDAVHKVR 465
Query: 478 KTEVKIMQGWITQ 490
+ ++ + W++Q
Sbjct: 466 QQIIEHIDLWLSQ 478
>gi|403308859|ref|XP_003944859.1| PREDICTED: thymus-specific serine protease isoform 2 [Saimiri
boliviensis boliviensis]
Length = 539
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 119/469 (25%), Positives = 192/469 (40%), Gaps = 65/469 (13%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
++F QRY +N + W G + PI LG E + + G A + AL++ LEHR+Y
Sbjct: 73 RSFLQRYWVNDQHWTGQDGPIFLHLGGEGSLGPGSVMKGHPAALAPAWGALVISLEHRFY 132
Query: 131 GKSVPFG-----------SRKAALKNA-----RHRGYFNSAQALADYASILLHIKDKYN- 173
G S+P G SR A K++ R S LAD S L + +N
Sbjct: 133 GLSIPAGGLDMAQLRFLSSRHAVGKSSGIPSDEDRPSLPSDPRLADVVSARLALSRLFNV 192
Query: 174 ATHAPVIAIGASYGGELATWFRLK------YPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ +P I G SY G LA W RLK +PH++ S+ASSAP+ D ++ Y+ V
Sbjct: 193 SSSSPWICFGGSYAGSLAAWARLKVLGFLRFPHLIFASVASSAPVRAVLD---FSEYNDV 249
Query: 228 VSKDFRDT----SEECFQTIKKSWSEIDNIASKPDGLAILSK-KFKTCTPL---KNTTEL 279
VS+ + S EC + ++SE++ + + C L ++ EL
Sbjct: 250 VSRSLKSAAIGGSLECQAAVSTAFSEVERRLRAGGAARAALQAELNACGSLSRAEDQAEL 309
Query: 280 KDGLDTVYSEAAQYDTPSNIPVK----------RICNAIENAPNCGDDILCKIAAGVVEA 329
L + A QYD + P+ N + P CG +I +
Sbjct: 310 LGALQALVGGAVQYDGQAGAPLSVRQLCGLLLGGGANRSRSTPYCGLRRAVQIVMHSLGQ 369
Query: 330 DSLEYDGNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYI 387
L + + + E + D W +Q+C+E + + + L +
Sbjct: 370 KCLSFSRAETVAQLRSTEPQVSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQL 429
Query: 388 KNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
C++ +G+SP + +YYGG + ++F NG DP W
Sbjct: 430 DLCEQVFGLSPLSVAQAVAQTNSYYGGQT------PGANQVLFVNGDTDP-----WHVLS 478
Query: 443 HFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L S+ + L GSHCLD+ + SD L R++ + +Q W+
Sbjct: 479 VTQALGSSESVLLIPSGSHCLDMAPERPSDSPSLRLGRQSIFQQLQTWL 527
>gi|148700667|gb|EDL32614.1| protease, serine, 16 (thymus), isoform CRA_b [Mus musculus]
Length = 475
Score = 129 bits (325), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 102/398 (25%), Positives = 176/398 (44%), Gaps = 42/398 (10%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
Q +D F+ +TF QRY +N + G + P+ +G E + ++G A
Sbjct: 62 EQPLDPFN--ASDRRTFLQRYWVNDQHRTGQDVPVFLHIGGEGSLGPGSVMAGHPAALAP 119
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-AT 175
+ AL++ LEHR+YG S+P G AL Y +S ALAD AS + N ++
Sbjct: 120 AWGALVISLEHRFYGLSMPAGGLDLALLR-----YLSSRHALADVASARQALSGLLNVSS 174
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
+P I G SY G LATW RLK+PH+V ++ASSAP+ + ++ Y+ VV++
Sbjct: 175 SSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPL---SAVVDFSAYNQVVARSLTQV 231
Query: 236 ----SEECFQTIKKSWSEIDNIA-SKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVY 287
S EC +++E++ + + P A+L ++ C L ++ EL L +
Sbjct: 232 AIGGSLECLAAASTAFTEVERLLRAGPAAQAVLREELGACGSLDLTEDQAELLGALQALV 291
Query: 288 SEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
QYD + P V+++C N + P G +I + L +
Sbjct: 292 GGTVQYDGQAGAPLSVRQLCGLLLGKWGNRSRSTPYLGLRRAVQIVLRSMGQKCLSFSRA 351
Query: 338 NSRCYIN--EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG 395
+ ++ E + D W +Q+C+E + + + L ++ C++ +G
Sbjct: 352 ETVAQLSNTEPQVSGVGDRQWLYQTCTEFGFYVTCEGLQCPFSQLPALPFQLELCEQVFG 411
Query: 396 VSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNG 428
+SP + +YYGG + ++F NG
Sbjct: 412 LSPASVAQAVAQTNSYYGGQS------PGATQVLFVNG 443
>gi|348682257|gb|EGZ22073.1| hypothetical protein PHYSODRAFT_495993 [Phytophthora sojae]
Length = 527
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 126/466 (27%), Positives = 191/466 (40%), Gaps = 81/466 (17%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+ +++Q +DHFS ++ TF QRY +FW + P++ ++G E ++ +GF
Sbjct: 50 QQLWFSQQLDHFS--SDANATFKQRYYEVDEFWKAPSGPVILYIGGEGALEQAP--AGFV 105
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ A +F A IV LEHR+YGKSVP G L A +R Y QALAD L H K+
Sbjct: 106 HVIAQKFGAKIVALEHRFYGKSVPNGD----LSTANYR-YLTVQQALAD----LKHFKES 156
Query: 172 YNATHAPV-----IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
Y IAIG SY G L+ WFR+ YP + SL+SS G + P +H
Sbjct: 157 YQRELGAEDANQWIAIGGSYPGALSAWFRVAYPDTTVASLSSS------GVVQPVYKFHQ 210
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+ C ++ + +E + I S L EL D D
Sbjct: 211 FDEQVALAAGPSCADVLRLTTAEFEK--------EIASGNATKVKGLFGAQELADP-DFF 261
Query: 287 YSEAAQYDTPSNIPVKRICNAIENAPNCG-DDILCKIAAGVVEAD----------SLEYD 335
Y I +A A G DI+C+ G E + +++
Sbjct: 262 Y---------------MIADAAAMAVQYGHKDIVCESMVGAFERNVSLVESFANFTIDMY 306
Query: 336 GNN--SRCYI-------NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKY 386
G + S C+ ++ R GD W WQ CS++ + +L +
Sbjct: 307 GASFGSECFYDTKCLAHDQARWGD--GRSWRWQKCSQLAYFQVAPTEKSLRAAMVDLDYH 364
Query: 387 IKNCKEQYGVSPRPS----WVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+K CK +G PS + YGG + I FSNG DP+ R + T
Sbjct: 365 LKQCKTVFGDVVNPSEGVEEISKLYGGD------HPTGHKIFFSNGGDDPWQRASVLDTL 418
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+++ L HC DL A P+ L +QR+ ++ + W+
Sbjct: 419 SDDEIANLAKCELCGHCGDL-RANPDVPEPLKKQREQILEYLTKWL 463
>gi|324503010|gb|ADY41315.1| Serine protease [Ascaris suum]
Length = 1068
Score = 129 bits (324), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 129/495 (26%), Positives = 208/495 (42%), Gaps = 58/495 (11%)
Query: 31 PRL--GRIRRSRILEQKDSN------HGFETFFYNQTIDHFSYGPESYQTFPQRYVINSK 82
PR GR R ++E S+ GF +DHF + TF QRY +N +
Sbjct: 565 PRFFRGRPLRGFLVEPPTSSLLVDYPPGFVAGNITMPVDHFDL--TNMNTFDQRYWVNPQ 622
Query: 83 FWGGGNSPILAFLG-AEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA 141
+ G L G A + + + T A +F A + +LEHRYYG S P +
Sbjct: 623 YAQPGGPHFLVIGGEGRANVKWVTEPNLITMSMARKFNATVYMLEHRYYGDSFP--TPDQ 680
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHV 201
+ +N R + + QALAD A ++ + ++YN + + G SY G L+ WFR YP +
Sbjct: 681 STENLR---WLTATQALADLAQFIMTMNERYNLVNPKWVTFGGSYPGMLSAWFRQFYPQL 737
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLA 261
+G++ASSAPI D + Y VV R + C + +K ++ +I ++ G
Sbjct: 738 SVGAVASSAPIEAKVD---FYEYLIVVENALRVFNATCAENVKLAFDQIHQLSLTRTGRV 794
Query: 262 ILSKKFKTCTPLKNTTELKDGLDTVY---------SEAAQYDT------PSNIPVKRICN 306
LS F T P N T LD Y A QY+ + ++ +C
Sbjct: 795 TLSNLF-TLKPEWNLTTEVTNLDIQYFFSILYDKFQGAVQYNNDNTGSYATGGGIREVCG 853
Query: 307 -AIENAPNCGDDIL-CKIAAGVVEADSLEYDGNNSRCYIN--EDRTGDESDEGWEWQSCS 362
+ NA +++ I + + Y NN + YI+ +D S W +Q+C+
Sbjct: 854 YMLNNAKTPMENVADVNIYMTNFSSGTFSYTDNNYQNYIDYLKDVNAKSSSRSWTYQTCN 913
Query: 363 EMVVPMGKD--KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT-------YYGGHDIK 413
E D +N P P N+ +I C++ YG P +V +YGG D
Sbjct: 914 EFGFFQSTDVGENIFGGPIPVNI--FIDMCQDVYGSKFTPRFVYEAVDKSQRFYGGRDY- 970
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWL 473
+N++F+NG DP+ + Y + +N +HC D+ + D L
Sbjct: 971 ----FKGTNVLFTNGNIDPWHA---LSKYDGNGSVTTVLMNGTAHCADMYPPRDEDAADL 1023
Query: 474 VQQRKTEVKIMQGWI 488
R+ + + W+
Sbjct: 1024 APTRELIGEKIAEWL 1038
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 124/474 (26%), Positives = 201/474 (42%), Gaps = 72/474 (15%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP------IDDNIQLSGFT 111
Q +DHF+ +T+ QRY N ++ G PI LG E P +D+ +L
Sbjct: 65 QRLDHFN--ASDARTWAQRYHYNFNYYKSG-GPIFLMLGGEGPETGSWCVDE--KLPYIQ 119
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ +H A I LEHR+YG+S PF ++ +++N + Y +S QA+ D A + +I ++
Sbjct: 120 WAMSH--NAAIYDLEHRFYGQSRPFPTQ--SIENLK---YLSSRQAIEDAAYFIRYINEQ 172
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
+ I G SY G LA W R K+P +VIG++ SS P+ D + Y VV
Sbjct: 173 QKYVNPKWIVFGGSYSGALAAWLREKHPELVIGAVGSSGPVEAKLD---FYEYLEVVENA 229
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTV 286
R + EC +++ ++E+ + +G LS+ F P N T+L+ + T+
Sbjct: 230 LRSYAPECADAVQQGFTEMSKMIWTLEGRKNLSELF-VLNPKLNETKLRYKDIQNFFATI 288
Query: 287 YSE---AAQY--DTPSNIPV----KRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN 337
Y A QY D + + IC + N D L +I + +V + +
Sbjct: 289 YGYFQWAVQYSGDNAGSYAIGGGISEICPLMMN---TSMDYLNRIKSVIVYLTEFDSSIS 345
Query: 338 NSRCYINEDRT----GDE---------SDEGWEWQSCSEMVVPMGKD--KNSMYQPEPWN 382
+ I+ D DE +D W WQ+C+E D +N P N
Sbjct: 346 FTSVGIDYDEMIEFLADERYDPSGYYSADRSWVWQTCTEFGYFQSTDLGRNIFGSVTPVN 405
Query: 383 LTKYIKNCKEQYGVSPRPSWV-------LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
L ++ C + +G + + + YYGG D +N++ NG DP+
Sbjct: 406 L--FVDMCTDTFGSAYKIQAIENSIHMTRKYYGGKD-----HFKGTNVVLPNGDIDPWHA 458
Query: 436 GGWVKTYHFFDLSFSQDLNLGS-HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
G Y + S L G+ HC D+ A+ D L R + W+
Sbjct: 459 LG---LYSNIEPSVVPILIHGTAHCADMYPARTQDLPALTNARNIIASNINKWL 509
>gi|307196629|gb|EFN78126.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 433
Score = 129 bits (323), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 122/454 (26%), Positives = 203/454 (44%), Gaps = 52/454 (11%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ + QRY +NS ++ N PI +GAE I + G E A + A+ +EHRYY
Sbjct: 2 RVWKQRYFVNSDYYKL-NGPIFLMIGAEGEIKPKWLVEGLWIEYAKELGAMCFYVEHRYY 60
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGE 189
GKS P + ++KN Y +S ALAD A + + Y I G SYGG
Sbjct: 61 GKSHP--TVDLSVKNLM---YLSSELALADLAYFIESVNIGYKFPNDTKWIVFGGSYGGS 115
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA W RLKYPH V G++++S P+L D + Y+ VV + S++C T+ + E
Sbjct: 116 LAAWMRLKYPHFVHGAVSASGPLLALID---FQEYYVVVEDALKQHSQQCVDTVADANKE 172
Query: 250 IDNIASKPDGLAILSKKFKTCTPLK-------NTTELKDGLDTVYSEAAQYD-------T 295
+ G + +KF+ C P+ + + L + L + +++ QY+
Sbjct: 173 FHIMLHHLIGQKQIEEKFRLCDPIDPGHTKTIDISNLYESLASNFADIVQYNKDNRQESK 232
Query: 296 PSNIPVKRICNAIENAPNCGD--DILCKIAAGVVEA---DSLEYDGNNSRCYINEDRTGD 350
+NI V IC+ + N G D L I + ++ A L+Y +N +
Sbjct: 233 TANINVDTICDVLTN-DELGRPVDRLAYINSMILNATKEKCLDYRYDNMIHSLRNITWAS 291
Query: 351 ESDEG---WEWQSCSEMVVPMGKDKNSMYQP----EPWNLTKYIKNCKEQYGVSPRPSWV 403
E EG W +Q+C+E G + S +P E + + Y++ C + +G PR +
Sbjct: 292 EQAEGGRQWMYQTCTE----FGFFQTSTARPKLFSETFPVDFYVQQCVDIFG--PRYNLD 345
Query: 404 LTYYGGHDIKLI---LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
+ ++ L +N++ +G DP+ G K+ + + D +HC
Sbjct: 346 MLKSAVTRTNILYGALNLQVTNVVHVHGSVDPWHVLGITKSSNPQAPAIYIDGT--AHCA 403
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
L + + DP L Q R +++G I Q+ D+
Sbjct: 404 ILYPSSEKDPPQLKQAR----IVVKGLIKQWLDN 433
>gi|47224854|emb|CAG06424.1| unnamed protein product [Tetraodon nigroviridis]
Length = 418
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 115/438 (26%), Positives = 189/438 (43%), Gaps = 59/438 (13%)
Query: 75 QRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSV 134
QR+++N FW + P+ ++G E PI + L+G + A Q AL++ LEHR+YG SV
Sbjct: 4 QRFLVNEAFWRNPDGPVFLYIGGEGPIFEYDVLAGHHVDMAQQHSALLLALEHRFYGDSV 63
Query: 135 -PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELAT 192
P G + H + +S QALAD A +I +N +H I+ G SY G L+
Sbjct: 64 NPDGLK------TEHLAHLSSKQALADLAVFHQYISGSFNLSHGNTWISFGGSYAGALSA 117
Query: 193 WFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD--TSEECFQTIKKSWSEI 250
WFR K+PH+V G++ASSAP+ D + +T + S R + + ++K+++ +
Sbjct: 118 WFRGKFPHLVFGAVASSAPVRATLDFSAYTNVMLLSSMKTRVFLHHQNTGKAVQKAFTAV 177
Query: 251 DNIASKPDGLAILSKKFKTC-TP--LKNTTELKDGLDTVYSEAAQYDTPSN-IPVKRICN 306
+ + + S F C TP L + EL L V+ A QY+ + + +C
Sbjct: 178 EAQLMVGNASQVAS-DFGCCQTPKNLDDQIELMQNLADVFMGAVQYNEEGVYMSISDLCK 236
Query: 307 AIENAPNC---GDDI------LCKIAAGVVEADSLEYDGNNS-RCYINED-RTGDESDEG 355
+ G D L +I + E L+ + R ++ G S+
Sbjct: 237 VMTRQNGTYEKGRDAYNSLVKLAQIYRSITEEPCLDISHEKTLRDLMDTSPHAGRRSERQ 296
Query: 356 WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP-----RPSWVLTYYGGH 410
W +Q+C+E ++N+ L + C +G+S R ++ TYYGG
Sbjct: 297 WTYQTCTEFGFFQTCEENTCPFSGMVTLQFQTEVCSSVFGISQHSLPRRVAFTNTYYGGD 356
Query: 411 DIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG-----------SHC 459
T +++ NG DP+ +LS QD G +HC
Sbjct: 357 SPH------THRVLYVNGGIDPWK-----------ELSVIQDRGEGDEDQVIFIEDTAHC 399
Query: 460 LDLDEAKKSDPDWLVQQR 477
D+ + +D L R
Sbjct: 400 ADMMSRRLTDRRSLKTAR 417
>gi|357518131|ref|XP_003629354.1| Thymus-specific serine protease [Medicago truncatula]
gi|355523376|gb|AET03830.1| Thymus-specific serine protease [Medicago truncatula]
Length = 455
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 132/477 (27%), Positives = 204/477 (42%), Gaps = 76/477 (15%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+ ++NQT+DHFS P ++ F QRY ++ + PI +G EA N ++ +
Sbjct: 14 DVIWFNQTLDHFS--PYDHRQFRQRYYEFLDYFRAPDGPIFLVIGGEATC--NGIVNDYI 69
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A +F A +V LEHRYYG+S PF + + + Y +S QAL D A + +D
Sbjct: 70 GVLAKKFGAAVVSLEHRYYGESTPFDTF-----STENLKYLSSKQALFDLAVFRQYYQDS 124
Query: 172 YNA------THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
NA P G SY G L+ WFRLK+PH+ GSLASSA +L D +
Sbjct: 125 LNAKLNRSGVENPWFFFGGSYSGALSAWFRLKFPHLTCGSLASSAVVLAVQDFAEFD--- 181
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT 285
+ EC ++++ +++ DG A+ S + N +L+ D
Sbjct: 182 ---QQIGESAGPECKAVLQET-TQLVETKLADDGKALRS--------IFNADDLEIDGDF 229
Query: 286 VY--SEAA----QYDTPSNIPVKRICNAIENAPNCGDDILCKIA-------AGVVEADSL 332
+Y ++AA QY P ++C + +A N G+D++ A G
Sbjct: 230 LYYLADAAVIAFQYGNPD-----KLCKPLVDAKNAGEDLVDAYAKYVKEYYVGTFGITPK 284
Query: 333 EYDGNN-SRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCK 391
YD + INE D S W +Q C+E+ N + + ++ CK
Sbjct: 285 SYDQEYLKKTAINE----DSSTRLWWFQVCTEVAYFQVAPSNDSIRSSKIDTKYHLDLCK 340
Query: 392 EQY--GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL-S 448
+ GV P YYGG + + S IIF+NG +DP+ K DL S
Sbjct: 341 NIFGDGVFPDVDATNLYYGGTKV------AGSKIIFTNGSQDPWRHAS--KQTSSPDLPS 392
Query: 449 FSQDLNLGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
+ N HC DL +E S PD + + R+ + M W+++ D
Sbjct: 393 YLIKCNNCGHCTDLRGCPQSPLVIEGNEKNCSSPDAVHKVRQKVQEDMDLWLSECID 449
>gi|196014354|ref|XP_002117036.1| hypothetical protein TRIADDRAFT_63413 [Trichoplax adhaerens]
gi|190580258|gb|EDV20342.1| hypothetical protein TRIADDRAFT_63413 [Trichoplax adhaerens]
Length = 439
Score = 128 bits (322), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 125/463 (26%), Positives = 192/463 (41%), Gaps = 57/463 (12%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF + + + QRY +N F G PI LG E P ++G A +
Sbjct: 2 QKLDHFDH--TNTAVWSQRYFVNDTFHKKGG-PIFLMLGGEGPASPVWNVAGAWQIYAKK 58
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATH 176
A+ + +EHRYYG+S P A+ N + Y +S QALAD A + N +
Sbjct: 59 LNAITIQIEHRYYGQSHPVSD--ASTPNLK---YLSSEQALADAAYFREYFMTSKNMSAD 113
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
I G SY G L+ W R KYPH+ S+A+SAPIL D + Y VV+K +
Sbjct: 114 TKWIVFGGSYSGALSAWLRTKYPHLFHASVATSAPILAKVD---FEQYLQVVTKSLQTAG 170
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL----KNTTELKDGLDTVYSEAAQ 292
C + I + + I + G LS+ FKTC PL + + L + Q
Sbjct: 171 MACTKNIHNATTIIQGMIKTSAGRKKLSQMFKTCKPLSKDPNDISTFMQSLAGNFEGIVQ 230
Query: 293 YDTPSNIPVKRICNAIENAPNCGDDILCKIAA------GVVEADSLEYDGNNSRC----Y 342
Y+ N +R + P LCKI G V+ +SL N +C Y
Sbjct: 231 YNK-DNTGFER------HTPATTLTDLCKIMEKNKPLDGYVKVNSLILKQNGQKCNDVVY 283
Query: 343 INEDRTGDESD-------EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG 395
+ + +S W +Q+C+E D + + +++ C + +G
Sbjct: 284 KDMIKQMQQSKFQPGIAGRLWFYQTCTEFGYYQTSDSKKQSFGNMFPIKYWVQQCADVFG 343
Query: 396 VSPRPSWV-----LT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
PS++ +T YYG IK + I+F NG DP+ G +K+ D +
Sbjct: 344 KKFSPSYINGEITMTNNYYGALAIK------GTRIVFPNGSIDPWHALGLLKST---DAT 394
Query: 449 FSQDLNLGS-HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G+ HC ++ DP L Q R + + W+ +
Sbjct: 395 RPTIFIKGTAHCANMYPPTSKDPAGLRQARTKILGYLTKWLKE 437
>gi|340376307|ref|XP_003386675.1| PREDICTED: putative serine protease K12H4.7-like [Amphimedon
queenslandica]
Length = 486
Score = 128 bits (321), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 135/499 (27%), Positives = 213/499 (42%), Gaps = 38/499 (7%)
Query: 11 LLLMVIFVSTSFHANGLKLRPR-LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
L++++ F+S S H+ LR + L R + R L +D + +F Q+ DHF +
Sbjct: 6 LIILLFFLSFS-HSLPTFLRHKWLQREKYFRHLSSQDLKLPKDLWF-TQSRDHFREVDTT 63
Query: 70 YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRY 129
T+ QRY +N FW N P+ +G E D + G A ++ AL LEHR+
Sbjct: 64 --TWQQRYWVNDSFWDKENGPVFLMIGGEGEADPKWVVEGEMMVLAEKYHALAFQLEHRF 121
Query: 130 YGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGG 188
YGKS P A + + NS QAL D A +++ KYN T A IA G SY G
Sbjct: 122 YGKSQP-----GADLSMDYITLLNSRQALEDLAYFRMNMTTKYNMTDANRWIAFGGSYPG 176
Query: 189 ELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE--ECFQTIKKS 246
LA W R+KYP +V GS+ASSAPI D + Y VVS C I +
Sbjct: 177 ALAAWLRMKYPDIVYGSIASSAPIQAKFD---FYEYFEVVSASLEQARNGPACVAAINAA 233
Query: 247 WSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGLDTVYSEAAQYDTPSN----I 299
+S ++ + A L K F L + L L +++ QY+ S
Sbjct: 234 FSLLNKLILDNSKWASLDKMFNLTVGLTTPEDGKMLAMSLAGIFAGIVQYNNDSRGSAVY 293
Query: 300 PVKRICNAIEN-APNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEW 358
+ +C+ + N + + D L ++ D+ + + N + D W +
Sbjct: 294 NMSLLCDVMTNDSISEPIDRLAHLSRSPALFDNDVSFQDYVKFLSNVTIDPAQGDRQWTY 353
Query: 359 QSCSEMVVPMGKDKNSMYQP--EPWNLTKYIKNCKEQYGVSPRPSWVLT-----YYGGHD 411
Q+C D S +QP ++ Y C + +G + + V +YGG +
Sbjct: 354 QTCDSFGYFQTAD--SPHQPFGTLFDTALYTLICNQVFGFTEKDIPVNVNNTNEFYGGLN 411
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPD 471
+ + +NI+F NG DP+ T + + + +N +HC ++ A D
Sbjct: 412 FN---KTAVTNIVFPNGSIDPWHSLS--VTANVSESVTAVFINGTAHCANMYPANPYDKP 466
Query: 472 WLVQQRKTEVKIMQGWITQ 490
LV RK I+ W+ +
Sbjct: 467 QLVAARKEIDAILGSWLAK 485
>gi|159489554|ref|XP_001702762.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280784|gb|EDP06541.1| predicted protein [Chlamydomonas reinhardtii]
Length = 419
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 109/402 (27%), Positives = 173/402 (43%), Gaps = 63/402 (15%)
Query: 2 KSSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRR--SRILEQKDSNHGFETF----- 54
+S+ S + + + +SF + RP L R+ SR L++ S+ +
Sbjct: 5 RSTTLSLWSCAALAVVLLSSFTVQAILPRPGLVRLGNGLSRKLQRPFSSVATHSLRKDLL 64
Query: 55 ------FYNQTIDHFSYGPES--YQTFPQRYVINSKFW-----GGGNSPILAFLGAEAPI 101
N ++DHFS P + TFPQRY + + W G PI +LG EA +
Sbjct: 65 AQCKLQLRNASLDHFSRVPPAGDVTTFPQRYFVCASHWQRENPDGTPGPIFFYLGNEADV 124
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ +G +E+A F A++V EHRYYG+SVP+G A+K +H GY ++ Q
Sbjct: 125 TLYLNNTGLMWESAADFGAMLVFAEHRYYGESVPYGE---AVK--KHMGYLSAEQ----- 174
Query: 162 ASILLHIKDKYNATHA-PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
+++ +K+++ VI G SYGG LA W RLKYPHV+ G++A+SAPI + P
Sbjct: 175 --LIMELKEQFQLPQGTAVIGFGGSYGGMLAAWMRLKYPHVLDGAIAASAPIWNFLGEVP 232
Query: 221 WTTYHSVVSKDFRDTSE------ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
S D SE C +K +W + + + G F
Sbjct: 233 AFDAGSFAKGVTYDASELAGSAPACVDNVKATWDVMASYGNDDAGRGGAMGNFP------ 286
Query: 275 NTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDD-ILCKIAAGVVEADSLE 333
+ Y P+ PV+ C + GDD + + + A +
Sbjct: 287 --------YPSSYITNGHGQLPA-FPVRAACEPLAG----GDDWVDADLLDAMAAAVGVF 333
Query: 334 YD-GNNSRC---YINEDRTGDESDEGWEWQSCSEMVVPMGKD 371
Y+ + C + D D W++Q C+EM++P KD
Sbjct: 334 YNHTGDLECFDPFAGTDPDSDHDANWWDYQWCAEMLMPFSKD 375
>gi|79326354|ref|NP_001031795.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|51968458|dbj|BAD42921.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|332661231|gb|AEE86631.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 477
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 131/449 (29%), Positives = 190/449 (42%), Gaps = 61/449 (13%)
Query: 4 SQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF 63
S F L + I +S S +NGL L+PR R S L + + ++NQT+DH+
Sbjct: 3 SALGFALLSIFAILLSLSTLSNGL-LQPR----RISHGLTESSKYLTRDELWFNQTLDHY 57
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
S P ++ F QRY + PI + E P N + + A +F A IV
Sbjct: 58 S--PSDHREFKQRYYEYLDHLRVPDGPIFMMICGEGPC--NGIPNDYITVLAKKFDAGIV 113
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-------TH 176
LEHRYYGKS PF S A +N + Y +S QAL D A+ + +D N
Sbjct: 114 SLEHRYYGKSSPFKS--LATENLK---YLSSKQALFDLAAFRQYYQDSLNVKFNRSGDVE 168
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
P GASY G L+ WFRLK+PH+ GSLASSA + + + +
Sbjct: 169 NPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVRAVYEFPEFD----------QQIG 218
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------SEA 290
E K + E + + GL + ++ K L N TEL D +Y A
Sbjct: 219 ESAGPECKAALQETNKLLEL--GLKVNNRAVKA---LFNATELDVDADFLYLIADAEVMA 273
Query: 291 AQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-----INE 345
QY P ++C + A DD L + A V + G +S+ Y ++
Sbjct: 274 IQYGNPD-----KLCVPLVEAQKNRDD-LVEAYAKYVREFCVGVFGLSSKTYSRKHLLDT 327
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWV 403
T + +D W +Q C+E+ N + N ++ CK + GV P
Sbjct: 328 AVTPESADRLWWFQVCTEVAYFQVAPANDSIRSHQINTEYHLDLCKSLFGKGVYPEVDAT 387
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
YYG R + + IIF+NG +DP
Sbjct: 388 NLYYGSD------RIAATKIIFTNGSQDP 410
>gi|18419800|ref|NP_567999.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|334187220|ref|NP_001190936.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|14334574|gb|AAK59466.1| unknown protein [Arabidopsis thaliana]
gi|22136934|gb|AAM91811.1| unknown protein [Arabidopsis thaliana]
gi|51970170|dbj|BAD43777.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970244|dbj|BAD43814.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970332|dbj|BAD43858.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970726|dbj|BAD44055.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970916|dbj|BAD44150.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|332661230|gb|AEE86630.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332661232|gb|AEE86632.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 488
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 131/449 (29%), Positives = 190/449 (42%), Gaps = 61/449 (13%)
Query: 4 SQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF 63
S F L + I +S S +NGL L+PR R S L + + ++NQT+DH+
Sbjct: 3 SALGFALLSIFAILLSLSTLSNGL-LQPR----RISHGLTESSKYLTRDELWFNQTLDHY 57
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
S P ++ F QRY + PI + E P N + + A +F A IV
Sbjct: 58 S--PSDHREFKQRYYEYLDHLRVPDGPIFMMICGEGPC--NGIPNDYITVLAKKFDAGIV 113
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-------TH 176
LEHRYYGKS PF S A +N + Y +S QAL D A+ + +D N
Sbjct: 114 SLEHRYYGKSSPFKS--LATENLK---YLSSKQALFDLAAFRQYYQDSLNVKFNRSGDVE 168
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
P GASY G L+ WFRLK+PH+ GSLASSA + + + +
Sbjct: 169 NPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVRAVYEFPEFD----------QQIG 218
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------SEA 290
E K + E + + GL + ++ K L N TEL D +Y A
Sbjct: 219 ESAGPECKAALQETNKLLEL--GLKVNNRAVKA---LFNATELDVDADFLYLIADAEVMA 273
Query: 291 AQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-----INE 345
QY P ++C + A DD L + A V + G +S+ Y ++
Sbjct: 274 IQYGNPD-----KLCVPLVEAQKNRDD-LVEAYAKYVREFCVGVFGLSSKTYSRKHLLDT 327
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWV 403
T + +D W +Q C+E+ N + N ++ CK + GV P
Sbjct: 328 AVTPESADRLWWFQVCTEVAYFQVAPANDSIRSHQINTEYHLDLCKSLFGKGVYPEVDAT 387
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
YYG R + + IIF+NG +DP
Sbjct: 388 NLYYGSD------RIAATKIIFTNGSQDP 410
>gi|308452876|ref|XP_003089215.1| hypothetical protein CRE_21658 [Caenorhabditis remanei]
gi|308241640|gb|EFO85592.1| hypothetical protein CRE_21658 [Caenorhabditis remanei]
Length = 971
Score = 127 bits (320), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 131/496 (26%), Positives = 204/496 (41%), Gaps = 55/496 (11%)
Query: 19 STSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYV 78
S FH G+K R R I + + G E + Q IDHF+ ++ + F Q+Y
Sbjct: 239 SEDFHFFGMKPIARRFRGDEDEIFDTE----GLEIGMFRQRIDHFN--NKNTKFFQQKYF 292
Query: 79 INSKFWGGGNSPILAFLGAEAPIDDN--IQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
NS+F G P +G E+P + LS A ++ A++ +LEHR+YG SV
Sbjct: 293 KNSRFARPG-GPNFLMIGGESPAHGSHVKNLSSAIMRRAKEYGAIVYLLEHRFYGDSV-- 349
Query: 137 GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRL 196
++N +S Q L D A + + K + T P I G SY G L+ W R
Sbjct: 350 ------VENNTDLTTLSSLQMLYDIAEFIKSVNFK-SETSNPWITFGGSYPGALSAWMRE 402
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
+P +VIG++ASSAP+L D + Y VV F C+Q IK + EI +
Sbjct: 403 IFPDLVIGAIASSAPVLAKTD---FYEYMMVVENSFLIYDPACYQEIKNGFDEIHELFQT 459
Query: 257 PDGLAILSKKFKTCTPLKNTTELKDG----LDTV--YSEAAQYDT------PSNIPVKRI 304
G LS FK P ++ D D + + A QY N + +
Sbjct: 460 DSGREKLSDLFKLNPPFRDNISEIDKHFFFFDIIGPFQGAVQYAGGGSGAFEENSEIAML 519
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLE---YDGNNSRCYINEDRTGDESDEGWEWQSC 361
C I N + + K+ SL YD N + ++R D W+WQ+C
Sbjct: 520 CRNITNGTQSSAENVAKVVLDDFGNKSLIHSFYDKNEWKKMKKKNR-----DYLWKWQTC 574
Query: 362 SEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKL 414
SE D + ++ ++ C + +G + YGG D
Sbjct: 575 SEFGYFQSADSGNSIFGAMNPVSFQVQQCMDMFGKEYTRGKIEENVEATNYRYGGVD--- 631
Query: 415 ILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLV 474
+N++F NG DP+ G + +S+ +N SHC+D+ + +D D +
Sbjct: 632 --GFRGTNVVFINGDVDPWHILGLYNSTEKSVVSYL--INGTSHCVDMYPPQDNDIDGVK 687
Query: 475 QQRKTEVKIMQGWITQ 490
RK ++ W+ Q
Sbjct: 688 IARKLVDDNIKVWLEQ 703
>gi|51970574|dbj|BAD43979.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 488
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 131/449 (29%), Positives = 190/449 (42%), Gaps = 61/449 (13%)
Query: 4 SQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF 63
S F L + I +S S +NGL L+PR R S L + + ++NQT+DH+
Sbjct: 3 SALGFALLSIFAILLSLSTLSNGL-LQPR----RISHGLTESSKYLTRDELWFNQTLDHY 57
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
S P ++ F QRY + PI + E P N + + A +F A IV
Sbjct: 58 S--PSDHREFKQRYYEYLDHLRVPDGPIFMMICGEGPC--NGIPNDYITVLAKKFDAGIV 113
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-------TH 176
LEHRYYGKS PF S A +N + Y +S QAL D A+ + +D N
Sbjct: 114 SLEHRYYGKSSPFKS--LATENLK---YLSSKQALFDLAAFRQYYQDSLNVKFNRSGDVE 168
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
P GASY G L+ WFRLK+PH+ GSLASSA + + + +
Sbjct: 169 NPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVRAAYEFPEFD----------QQIG 218
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------SEA 290
E K + E + + GL + ++ K L N TEL D +Y A
Sbjct: 219 ESAGPECKAALQETNKLLEL--GLKVNNRAVKA---LFNATELDVDADFLYLIADAEVMA 273
Query: 291 AQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-----INE 345
QY P ++C + A DD L + A V + G +S+ Y ++
Sbjct: 274 IQYGNPD-----KLCVPLVEAQKNRDD-LVEAYAKYVREFCVGVFGLSSKTYSRKHLLDT 327
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWV 403
T + +D W +Q C+E+ N + N ++ CK + GV P
Sbjct: 328 AVTPESADRLWWFQVCTEVAYFQVAPANDSIRSHQINTEYHLDLCKSLFGKGVYPEVDAT 387
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
YYG R + + IIF+NG +DP
Sbjct: 388 NLYYGSD------RIAATKIIFTNGSQDP 410
>gi|363814290|ref|NP_001242784.1| uncharacterized protein LOC100805858 precursor [Glycine max]
gi|255635884|gb|ACU18289.1| unknown [Glycine max]
Length = 488
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 125/453 (27%), Positives = 197/453 (43%), Gaps = 64/453 (14%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLRPR--LGRIRRSRILEQKDSNHGFETFFYNQ 58
MK ++ LL ++FVS+ + + PR L ++ + L ++ +++Q
Sbjct: 1 MKQEEWMITTALLSLLFVSSFPPLSYGVVPPRTLLNKLSEGKYLNTQE-------LWFDQ 53
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
T+DHFS P ++ F QRY ++ + PI +G E + N + + A +F
Sbjct: 54 TLDHFS--PYDHRQFRQRYYEFLDYFRIPDGPIFLVIGGEGIL--NGVANDYLAVLAKKF 109
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA---- 174
A +V LEHRYYGKS PF S + + Y +S QAL+D A + +D NA
Sbjct: 110 GAAMVTLEHRYYGKSTPFNSLET-----ENLKYLSSKQALSDLAVFRQYYQDSINAKLNR 164
Query: 175 --THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
P G SY G L+ WFRLK+PH+ GSLASSA +L + +T + + +
Sbjct: 165 AKIENPWFIFGGSYSGALSAWFRLKFPHLTCGSLASSAVVL---AVYNYTEFDQQIGE-- 219
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT---CTPLKNTTELKDGLDTVYSE 289
EC + ++++ I++ LA K+ K L+ + L +
Sbjct: 220 -SAGPECKEALQETTQLIEH------KLATSGKELKASFDAADLEIDGDFFYFLADATAI 272
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVE-------ADSLEYDGNN-SRC 341
A QY P ++C + A G+D++ A V E D YD R
Sbjct: 273 AFQYGNPD-----KVCKPLVEAKKAGEDLVDAYAKYVKEYYIGTFGTDVQTYDQKYLKRT 327
Query: 342 YINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPR 399
+NED S W +Q C+E+ N + ++ + CK + G+ P
Sbjct: 328 AMNEDN----SARLWWFQVCTEVAYFQVAPSNDSIRSSKVDIKYHFDLCKNVFGEGIFPD 383
Query: 400 PSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
YYGG I + S IIF+NG +DP
Sbjct: 384 VDATNLYYGGTKI------AGSKIIFTNGSQDP 410
>gi|51972041|dbj|BAD44685.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 488
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 131/449 (29%), Positives = 190/449 (42%), Gaps = 61/449 (13%)
Query: 4 SQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF 63
S F L + I +S S +NGL L+PR R S L + + ++NQT+DH+
Sbjct: 3 SALGFALLSIFAILLSLSTLSNGL-LQPR----RISHGLTESSKYLTRDELWFNQTLDHY 57
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
S P ++ F QRY + PI + E P N + + A +F A IV
Sbjct: 58 S--PSDHREFKQRYYEYLDHLRVPDGPIFMMICGEGPC--NGIPNDYITVLAKKFDAGIV 113
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-------TH 176
LEHRYYGKS PF S A +N + Y +S QAL D A+ + +D N
Sbjct: 114 SLEHRYYGKSSPFKS--LATENLK---YLSSKQALFDLAAFRQYYQDSLNVKFNRSGDVE 168
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
P GASY G L+ WFRLK+PH+ GSLASSA + + + +
Sbjct: 169 NPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVRAVYEFPEFD----------QQIG 218
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------SEA 290
E K + E + + GL + ++ K L N TEL D +Y A
Sbjct: 219 ESAGPECKAALQETNKLLEL--GLKVNNRAVKA---LFNATELDVDADFLYLIADAEVMA 273
Query: 291 AQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-----INE 345
QY P ++C + A DD L + A V + G +S+ Y ++
Sbjct: 274 IQYGNPD-----KLCVPLVEAQKNRDD-LVEAYAKYVREFCVGVFGLSSKTYSRKHLLDT 327
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWV 403
T + +D W +Q C+E+ N + N ++ CK + GV P
Sbjct: 328 AVTPESADRLWWFQICTEVAYFQVAPANDSIRSHQINTEYHLDLCKSLFGKGVYPEVDAT 387
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
YYG R + + IIF+NG +DP
Sbjct: 388 NLYYGSD------RIAATKIIFTNGSQDP 410
>gi|395859218|ref|XP_003801939.1| PREDICTED: thymus-specific serine protease [Otolemur garnettii]
Length = 771
Score = 127 bits (318), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 122/470 (25%), Positives = 194/470 (41%), Gaps = 52/470 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W + P+ LG E + + G
Sbjct: 61 WLEQLLDPFNS--SDRRSFLQRYWVNDQHWTHQDGPVFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A F AL++ LEHR+YG S+P G L A+ R + +S ALAD S L + +N
Sbjct: 119 APAFGALVISLEHRFYGLSIPAG----GLDVAQLR-FLSSRHALADVVSARLSLSRLFNV 173
Query: 175 TH-APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+ +
Sbjct: 174 SSLSPWICFGGSYAGSLAAWARLKFPHLIFASIASSAPVRAVLD---FSEYNDVVSRSLK 230
Query: 234 DT----SEECFQTIKKSWSEIDNIASKPDGL-AILSKKFKTCTPL---KNTTELKDGLDT 285
S EC + +++E++ S + A L + C L ++ EL L
Sbjct: 231 SIAIGGSLECRAAVSAAFAEVERRLSAGGAVRAALRAELGACGSLSRSEDQAELLGELQA 290
Query: 286 VYSEAAQYDTPSNIPVK----------RICNAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ A QYD P+ N P G +I + L +
Sbjct: 291 LVGGAVQYDGQIGAPLSVRQLCGLLLGGGDNRSHPTPYGGLRRAVQIVMQSLGQKCLSFS 350
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKE 392
+ + E + D W +Q+C+E +D + P L ++ C++
Sbjct: 351 RAETVAQLKATEPQVSGIGDRQWLYQTCTEFGFYVTCEDPMCPFSQLP-ALPSHLDLCEQ 409
Query: 393 QYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL 447
+G+S + +YYGG + ++F NG DP W L
Sbjct: 410 VFGLSASSVAQAVAQTNSYYGGQTP------GATQVLFVNGDTDP-----WHVLSVTQAL 458
Query: 448 SFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
SQ L SHCLD+ + D L R+ + + W+T D
Sbjct: 459 GSSQSALLIPNASHCLDMAPERPLDSPSLQLGRQNISQQLLIWLTLAKDS 508
>gi|312380581|gb|EFR26537.1| hypothetical protein AND_07341 [Anopheles darlingi]
Length = 519
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 116/459 (25%), Positives = 194/459 (42%), Gaps = 61/459 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTY 112
++ Q +DH P + T+ QRY +N +F+ N SP+ +G E G
Sbjct: 70 WFEQILDH--NDPTNEATWQQRYYVNDQFFDSSNPHSPVFLMIGGEGEATARWMHEGAWI 127
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
A AL LEHR+YGKS P + + KN Y S QALAD A + + +KY
Sbjct: 128 HYAETHGALCFQLEHRFYGKSHP--TTDLSTKNL---AYLTSEQALADLAYFIEAMNEKY 182
Query: 173 NATHAPV--IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
IA G SY G LA W R KYP +V GS++SS P+L D + Y+ V +
Sbjct: 183 QLQPQTNLWIAFGGSYPGSLAAWLREKYPSLVHGSISSSGPLLAKID---FIEYYDTVVR 239
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-------LKDGL 283
S C + ++ + + + + G L+ KFK C P++ + + L +GL
Sbjct: 240 SLASYSPGCVEAVRSAMKQAETLLKHMIGQRTLNDKFKLCDPIERSIDNPLDVASLFEGL 299
Query: 284 DTVYSEAAQY---DTP-SNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
+ ++ QY ++P + I + +C+ + N + + + E + + + N
Sbjct: 300 ASNFAGVVQYNKDNSPHATITIDEVCDVMMNT------TIGAPVSRLAEVNRMLLEQGNQ 353
Query: 340 RCY-------------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKY 386
C I+ D W +Q+C+E + S + + +
Sbjct: 354 SCLDYVYDKSVRQMQNISWDSEVASGARQWTFQTCNEFGFYQTSNNASAVFGDRFPAEFF 413
Query: 387 IKNCKEQYGVS-PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
++ C + YG + Y + L +T+N+++ +G DP+ R G
Sbjct: 414 VRQCADIYGARFGEAALARGIYRTNVNYGALNPATTNVLYVHGSIDPWHRLG-------- 465
Query: 446 DLSFSQDLNL-------GSHCLDLDEAKKSDPDWLVQQR 477
L+ S DL+ +HC ++ E K+SD L Q R
Sbjct: 466 -LTESNDLHTPVIFIDGTAHCANMYEPKESDFPQLKQAR 503
>gi|301624087|ref|XP_002941341.1| PREDICTED: putative serine protease K12H4.7 [Xenopus (Silurana)
tropicalis]
Length = 516
Score = 126 bits (317), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 120/492 (24%), Positives = 216/492 (43%), Gaps = 62/492 (12%)
Query: 31 PRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP 90
P G + ++ + S+ +++ Q ++HFS G ++ + + QRY +N + G P
Sbjct: 51 PCYGFLAKAHKRQPTASSPSVPEYWFEQRLNHFS-GADT-RVWQQRYFMNDTYRQSGG-P 107
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
+ +G E P + SG A + AL ++LEHR+YGKS P ++ + N
Sbjct: 108 VFLMIGGEGPANPAWMTSGAWLTYAEKLGALCLMLEHRFYGKSHP--TQDISTDNLH--- 162
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y +S QALAD A +K+K + + G SY G L+ W+R+KYPH+ ++ASSA
Sbjct: 163 YLSSQQALADIAHFQTVMKEKLGLADSKWVVFGGSYPGSLSAWYRMKYPHLAHMAVASSA 222
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
P+ + ++ Y VV +C + IK + E+ + L+ F+ C
Sbjct: 223 PVKVQVN---FSEYLEVVQLALSLNHSDCPEAIKMASEEVSKLLVL-SSFQKLTDDFRLC 278
Query: 271 TPLK-----NTTELKDGLDTVYSEAAQYD---------TPSNIPVKRICNAIENAPNCGD 316
L+ +T L D L QY+ ++I ++ +C+ + N + G
Sbjct: 279 ELLQINSWMDTAYLLDTLAENIMNVVQYNKDNREFEGVKDTSITIQVVCDIMANK-SLGT 337
Query: 317 DI--LCKIAAGVVEADSLEYDGNNSRCYINEDR-----TGDESDEGWEWQSCSEMVVPMG 369
+ ++E + + + + +I + R + W +Q+C+E
Sbjct: 338 PYYRYANVIRTILEVNGEKCMDASYKKFIQDMRNITWASVSLGGRQWMYQTCTEFGFFQS 397
Query: 370 KDKNSMYQPEPWN---LTKYIKNCKEQYGVSPRPSWVL-------TYYGGHDIKLILRRS 419
D + +P++ L+ +++ C + +G S V YYGG DIK
Sbjct: 398 TDSAA----QPFSGIPLSYHVQQCSDIFGPEYNLSMVTDSVQQTNEYYGGLDIK------ 447
Query: 420 TSNIIFSNGMRDPFSRGGWVKTYHFFDLS---FSQDLNLGSHCLDLDEAKKSDPDWLVQQ 476
S IIF NG+ DP+ R G DLS + + +HC ++ A+ DP +
Sbjct: 448 GSRIIFPNGLIDPWHRLGINA-----DLSGDLIAIQMEGAAHCANMYPARLEDPPSVPSA 502
Query: 477 RKTEVKIMQGWI 488
R+ +++ W+
Sbjct: 503 RQYIFQLLTKWL 514
>gi|323454022|gb|EGB09893.1| hypothetical protein AURANDRAFT_10784, partial [Aureococcus
anophagefferens]
Length = 477
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 123/459 (26%), Positives = 196/459 (42%), Gaps = 77/459 (16%)
Query: 54 FFYNQTIDHFSYGPES-YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
F+++ +DHF S + + QRY ++ FWGG P+ ++G E P + F Y
Sbjct: 7 FYHDALLDHFESDVASPTRKWSQRYYVDESFWGGAGFPVFLYIGGEGP-QGPMSPRMFIY 65
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK--- 169
A + +AL+V LEHR+YG+S+P A + +A R Y SAQALAD A +++
Sbjct: 66 AQAKEHRALLVTLEHRFYGESLP----TANMDDANLR-YLASAQALADLARFRVYVSSYS 120
Query: 170 -DKYNATHAPV-------------IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY 215
D +A P IA G SY G+LA WF+ KYP + G +ASSAP+
Sbjct: 121 PDAPDAASTPPLELKASPGMDSKWIAFGGSYPGDLAAWFKEKYPFLTAGVVASSAPVFAE 180
Query: 216 GDITPWT-TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
D ++ ++ S C +++ ++++ + + A K + C +
Sbjct: 181 YDFAQYSEVVGDALAYPLIGGSPSCADAVRRG---VEDLVAALEAGAAPPKALEPCGSIA 237
Query: 275 NTTELKDGLDTVYSE---AAQYDTPSNIP-VKRICNAIENAPNCGDDILCKIAAGVVEAD 330
+ + +++ QY+ + P V +C+A++ AP+ I A + A
Sbjct: 238 SGVDRAQYYSSIFGNFQGVVQYNLEAGPPYVSDVCDAVDGAPS-------PIEA-LAAAT 289
Query: 331 SLEYDGNNSRC--------YINEDRT----GDESDEGWEWQSCSEMVVPMGKDKNSMYQP 378
SL + N + C Y++ R G +D W WQSC+E S +
Sbjct: 290 SL-FSSNGTACLSSDFEKDYVSVLRNATFDGVSADRQWIWQSCNEFGFFQTISPKSPFAA 348
Query: 379 EPWNL---TKYIKNCKEQYGV----SPRPSWVL----TYYGGHDIKLILRRSTSNIIFSN 427
L T C +GV PR +YGG ++ I NI N
Sbjct: 349 FGAYLNVSTAGRAVCSGGFGVDEYDGPRADAAGLVANAFYGGRTLQGI------NITAVN 402
Query: 428 GMRDPFSRGGWVK---TYHFFDLSFSQDLNLGSHCLDLD 463
G DP+ G V YH + SQ + G H ++LD
Sbjct: 403 GNMDPWHSLGIVNDTDAYH----APSQRTSAGVHVVELD 437
>gi|171847157|gb|AAI61646.1| LOC100145763 protein [Xenopus (Silurana) tropicalis]
Length = 494
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 120/492 (24%), Positives = 216/492 (43%), Gaps = 62/492 (12%)
Query: 31 PRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP 90
P G + ++ + S+ +++ Q ++HFS G ++ + + QRY +N + G P
Sbjct: 29 PCYGFLAKAHKRQPTASSPSVPEYWFEQRLNHFS-GADT-RVWQQRYFMNDTYRQSGG-P 85
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
+ +G E P + SG A + AL ++LEHR+YGKS P ++ + N
Sbjct: 86 VFLMIGGEGPANPAWMTSGAWLTYAEKLGALCLMLEHRFYGKSHP--TQDISTDNLH--- 140
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y +S QALAD A +K+K + + G SY G L+ W+R+KYPH+ ++ASSA
Sbjct: 141 YLSSQQALADIAHFQTVMKEKLGLADSKWVVFGGSYPGSLSAWYRMKYPHLAHMAVASSA 200
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
P+ + ++ Y VV +C + IK + E+ + L+ F+ C
Sbjct: 201 PVKVQVN---FSEYLEVVQLALSLNHSDCPEAIKMASEEVSKLLVL-SSFQKLTDDFRLC 256
Query: 271 TPLK-----NTTELKDGLDTVYSEAAQYD---------TPSNIPVKRICNAIENAPNCGD 316
L+ +T L D L QY+ ++I ++ +C+ + N + G
Sbjct: 257 ELLQINSWMDTAYLLDTLAENIMNVVQYNKDNREFEGVKDTSITIQVVCDIMANK-SLGT 315
Query: 317 DI--LCKIAAGVVEADSLEYDGNNSRCYINEDR-----TGDESDEGWEWQSCSEMVVPMG 369
+ ++E + + + + +I + R + W +Q+C+E
Sbjct: 316 PYYRYANVIRKILEVNGEKCMDASYKKFIQDMRNITWASVSLGGRQWMYQTCTEFGFFQS 375
Query: 370 KDKNSMYQPEPWN---LTKYIKNCKEQYGVSPRPSWVL-------TYYGGHDIKLILRRS 419
D + +P++ L+ +++ C + +G S V YYGG DIK
Sbjct: 376 TDSAA----QPFSGIPLSYHVQQCSDIFGPEYNLSMVTDSVQQTNEYYGGLDIK------ 425
Query: 420 TSNIIFSNGMRDPFSRGGWVKTYHFFDLS---FSQDLNLGSHCLDLDEAKKSDPDWLVQQ 476
S IIF NG+ DP+ R G DLS + + +HC ++ A+ DP +
Sbjct: 426 GSRIIFPNGLIDPWHRLGINA-----DLSGDLIAIQMEGAAHCANMYPARLEDPPSVPSA 480
Query: 477 RKTEVKIMQGWI 488
R+ +++ W+
Sbjct: 481 RQYIFQLLTKWL 492
>gi|330846607|ref|XP_003295109.1| hypothetical protein DICPUDRAFT_160267 [Dictyostelium purpureum]
gi|325074265|gb|EGC28366.1| hypothetical protein DICPUDRAFT_160267 [Dictyostelium purpureum]
Length = 547
Score = 126 bits (316), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 120/448 (26%), Positives = 195/448 (43%), Gaps = 42/448 (9%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP--ILAFLGAEAPIDDNIQLSGFT 111
+ + QTI+H SY ++ TF QRY +N KF P + + E P+ I
Sbjct: 7 YIFQQTINHLSY--DTIGTFEQRYSVNKKFLASNGKPKAVFFLVSGEGPLSSEIVNHNPF 64
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A++ ALIV LE RYYG+S+PF + N + Y + Q L D A+ ++ +K
Sbjct: 65 VNIANETNALIVALELRYYGESMPFPNM-----NNSNMAYLTTDQILEDLANFQVYFTNK 119
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
Y I +G SY G ++ W+RLKYPH+V ++ASS+P + +T Y V ++
Sbjct: 120 YQLGDIKWIIMGCSYAGTISAWYRLKYPHLVTAAIASSSP---FRAELRFTEYDVKVRQN 176
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
C + K ++ I+++ K + + + KF TC + L + +
Sbjct: 177 L---GAPCSKAFKNLFAYIEHLMLKNN--SYVKSKF-TCERQLDDRMFLYLLSEALTYSV 230
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA---DSLEYDGNNSRCYINEDRT 348
QYD I + C N DD+L +A V ++ D N + + D
Sbjct: 231 QYDARFKI-ISSFCPKFVKFTNSSDDLLDMFSAYVKNMFLFQNVSCDAYNLYEFASND-I 288
Query: 349 GDESDEGWEWQSCSE---MVVPMGKDKNSMYQPEPWNLTKYIKN-CKEQYGVSPRPSW-- 402
W WQ C E +VP G + ++P+ + + CK YG + RP+
Sbjct: 289 DYSGTRSWTWQLCREYGWFMVPSGPES---FKPQQLGECWWQNDVCKTLYGRAMRPTVDR 345
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL--GSHCL 460
+ YG + K I SN++F+N DP+S + L FSQ + + SHC
Sbjct: 346 INMVYGSTNFKYI-----SNVLFTNCGNDPWST---LSIDPNLVLPFSQQIYIPGESHCA 397
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + +D L R ++ +I
Sbjct: 398 NWLSEQPNDSLELKNARSLANSFLRQFI 425
>gi|410040428|ref|XP_003950808.1| PREDICTED: thymus-specific serine protease [Pan troglodytes]
Length = 541
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 134/529 (25%), Positives = 213/529 (40%), Gaps = 93/529 (17%)
Query: 17 FVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQR 76
F +S H GL L P + + LEQ +D F+ ++F QR
Sbjct: 37 FQESSAHGLGLSLGPGAAALPKVGWLEQ--------------LLDPFNV--SDRRSFLQR 80
Query: 77 YVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
Y +N + W G + PI LG E + + G A + AL++ LEHR+YG S+P
Sbjct: 81 YWVNDQHWVGQDGPIFLLLGGEGSLGPGSVMRGHPAALAPAWGALVISLEHRFYGLSIPA 140
Query: 137 GSRKAA-LKNARHRGYFNSAQA---------------LADYASILLHIKDKYN-ATHAPV 179
G + A L+ R + LAD S L + +N ++ +P
Sbjct: 141 GGLEMAQLRFLSSRLAMGKSSGIPSDEDRPSPPFDPRLADVVSAHLALSRLFNISSSSPW 200
Query: 180 IAIGASYGGELATWFRLK------YPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
I G SY G LA W RLK +PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 201 ICFGGSYAGSLAAWARLKVLRLLRFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 257
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
T S EC + +++E++ + S A L + C PL +N EL L
Sbjct: 258 STAIGGSLECRAAVSVAFAEVERRLRSGGAAQAALQTELSACGPLGRAENQAELLGALQA 317
Query: 286 VYSEAAQYDTPSNIP--VKRIC--------NAIENAPNCGDDILCKIAAGVVEADSLEYD 335
+ QYD + P V+++C N + P CG +I + L +
Sbjct: 318 LVGGVVQYDGQTGAPLSVRQLCGLLLGGGGNRSHSTPYCGLRRAVQIVLHSLGQKCLSFS 377
Query: 336 GNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ 393
+ + E + D W +Q+C+E + + + L + C++
Sbjct: 378 RAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCENPRCPFSQLPALPSQLDLCEQV 437
Query: 394 YGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+G+S + +YYGG + ++F NG DP +H LS
Sbjct: 438 FGLSALSVAQAVAQTNSYYGGQT------PGANKVLFVNGDTDP---------WHV--LS 480
Query: 449 FSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q L GSHCLD+ + SD L R+ + +Q W+
Sbjct: 481 VTQALGSSESTLLIRTGSHCLDMAPERPSDSPSLRLGRQNIFQQLQTWL 529
>gi|118487876|gb|ABK95760.1| unknown [Populus trichocarpa]
Length = 485
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 139/519 (26%), Positives = 232/519 (44%), Gaps = 80/519 (15%)
Query: 7 SFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFET--FFYNQTIDHFS 64
+ + + + ++ +ST+ +A+ L LR ++ + E +S + T ++NQT+DHFS
Sbjct: 3 TMKLVFISLLLLSTAPYAS-LALRTQVHSLS-----ESSNSKNYLTTQELWFNQTLDHFS 56
Query: 65 YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
P + FPQRY ++ + PI + E+ N ++ + A +F A +V
Sbjct: 57 --PFDHHKFPQRYYEFLDYFRISDGPIFLEICGESSC--NGIVNDYISVLAKKFGAAVVS 112
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL------LHIKDKYNATHAP 178
LEHRYYG+S+PF S +N R + +S QAL D A L++K + P
Sbjct: 113 LEHRYYGRSLPFKS--TTTENLR---FLSSKQALFDLAVFRQYYQESLNLKLNRTSVENP 167
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
G SY G L+ WFRLK+PH+ GSLASSA +L I +T + + + E
Sbjct: 168 WFVFGGSYAGALSAWFRLKFPHLTCGSLASSAVVL---AIHNFTEFDQQIGE---SAGAE 221
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA----Q 292
C T++++ ++ L+ + L + EL+ D +Y ++AA Q
Sbjct: 222 CKATLQETTQLVEE---------RLASNKQAVKALFDAAELEIDGDFLYFLADAAVIAFQ 272
Query: 293 YDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDES 352
Y P +C+ + A N GDD L + A V+ L G++ + Y N+ D S
Sbjct: 273 YGNP-----DIVCSTLVKAKNNGDD-LVEAYAKYVKEYYLGTFGSSVQTY-NQKYLKDTS 325
Query: 353 ------DEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT- 405
D W +Q C+E+ N + + ++ CK +G P +T
Sbjct: 326 LNKHTGDRLWWFQVCTEVAYFQVAPSNDSIRSSKVDARYHLDLCKNVFGEGIYPEVDVTN 385
Query: 406 -YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL-SFSQDLNLGSHCLDLD 463
YYGG +I S S I+F+NG +DP+ K D+ SF + HC D+
Sbjct: 386 IYYGGTNI------SGSKIVFANGSQDPWRHAS--KQTSSPDMPSFLISCHNCGHCTDIR 437
Query: 464 EAKK------------SDPDWLVQQRKTEVKIMQGWITQ 490
+ S P+ + + R ++ M W+++
Sbjct: 438 GCPQTPLSLEGNARNCSSPEAVEKVRHQIIEKMDLWLSE 476
>gi|308478046|ref|XP_003101235.1| hypothetical protein CRE_14134 [Caenorhabditis remanei]
gi|308263940|gb|EFP07893.1| hypothetical protein CRE_14134 [Caenorhabditis remanei]
Length = 1064
Score = 125 bits (315), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 123/492 (25%), Positives = 202/492 (41%), Gaps = 48/492 (9%)
Query: 19 STSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYV 78
S FH G+K R R + + + G E + Q IDHF+ ++ + F Q+Y
Sbjct: 547 SEDFHFFGMKPIARKFRGDEDEVFDTE----GMEIGMFRQRIDHFN--NKNTKFFQQKYF 600
Query: 79 INSKFWGGGNSPILAFLGAEAP-IDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFG 137
NS+F G P +G E P ++ L+ A ++ + +LEHR+YG SV
Sbjct: 601 KNSRFARPG-GPNFLMIGGEGPEYGHDVNLNSSIMRRAEEYGGTVYVLEHRFYGDSV--- 656
Query: 138 SRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLK 197
++N +S Q L D A + + K + T P I G SY G L+ W R
Sbjct: 657 -----VENNTDLSTLSSLQMLYDLAEFIKSVNFK-SETSNPWITFGGSYPGALSAWMREI 710
Query: 198 YPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKP 257
+P +VIG++ASSAP+L D + Y VV F C+Q IK + EI +
Sbjct: 711 FPDLVIGAIASSAPVLAKTD---FYEYMMVVENSFLRYDSACYQEIKNGFDEIHELFQTD 767
Query: 258 DGLAILSKKFKTCTPLKNTTELKDG----LDTV--YSEAAQY------DTPSNIPVKRIC 305
G LS FK P ++ D D + + A QY + + +C
Sbjct: 768 SGREKLSVLFKLNPPFRDNISESDKHFFFFDIIGPFQFAVQYAGRGSGGFVEDSKIAMLC 827
Query: 306 NAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMV 365
I N + + K+ + S+ + + + + +T + + W WQ+CSE
Sbjct: 828 RNITNGTQSSVENVAKVVLDDFKNKSIIHSFYDKNKWKHMKKTNE--NYLWRWQTCSEFG 885
Query: 366 VPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRR 418
D + ++ ++ C E +G + YGG D
Sbjct: 886 YFQSADSGNSIFGAMKPVSFQVQRCMEMFGKEYTRGKIEENVEATNYRYGGVD-----GF 940
Query: 419 STSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRK 478
+N++F NG DP+ G + +S+ +N SHC+D+ + +D D + RK
Sbjct: 941 RGTNVVFINGDVDPWHILGLYNSTEKSVVSYL--INGTSHCVDMYPPQDNDIDGVKIARK 998
Query: 479 TEVKIMQGWITQ 490
++ W+ Q
Sbjct: 999 LVDDNIKVWLEQ 1010
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/270 (25%), Positives = 114/270 (42%), Gaps = 18/270 (6%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF-TYENAH 116
Q +D S PES + F QRY + F + L G + + ++ G + A
Sbjct: 51 QPLDPLS--PESTKKFQQRYRYSEHFTSNKKTAFLYVSGRDDFNEAVLKNDGSPLVKAAE 108
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-T 175
+F A I LEHRYYG S P ++ +H +S A+ D + H ++ +
Sbjct: 109 RFGATIFALEHRYYGNSTP-NFENFTSESLQH---LDSYHAIQDVIFFIEHANTQFKMDS 164
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
+ G+ YGG +A R P V G +A SAPI D W Y++ V K
Sbjct: 165 DVRWVLFGSGYGGIIAAETRKWDPITVSGVVAISAPIEREMDF--W-KYNNKVEKTIMKY 221
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP-------LKNTTELKDGLDTVYS 288
C+ IKK + ++ ++ + +G LS F+ P L + +
Sbjct: 222 DSSCYNKIKKGFRQVQDLMNFSEGRNELSDLFELNPPWNERDLALNEIQMFYLSIIAPFQ 281
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDI 318
+ QYD + +K +C+AI ++ + + I
Sbjct: 282 QVVQYDNQLELSIKGLCDAIHDSRDSVEAI 311
>gi|52789079|gb|AAT09104.1| serine peptidase [Bigelowiella natans]
Length = 546
Score = 125 bits (314), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 150/310 (48%), Gaps = 40/310 (12%)
Query: 41 ILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP 100
+LE ++ F+ N +DHF G + + QRY ++S WGG P+ ++G E P
Sbjct: 45 LLESHAGSNSTTHFYKNALLDHFG-GLSDEKHWLQRYYVDSSQWGGEGYPVFLYIGGEGP 103
Query: 101 IDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALA 159
+ S F YE A + KAL++ LEHR+YG+S P A LK + S QAL
Sbjct: 104 -QGPVSSSLFMYELAVEHKALVLALEHRFYGESRPVEDMSDANLK------FLTSHQALG 156
Query: 160 DYASILLHIKD----------------KYNATHAPVIAIGASYGGELATWFRLKYPHVVI 203
D A + +IK +A +P +A G SY G LA WF+LKYP VVI
Sbjct: 157 DLARFVEYIKAYDPNVNDAKSSPPLSLPASAQESPFVAFGGSYPGNLAAWFKLKYPSVVI 216
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFR----DTSEECFQTIKKSWSEIDNI--ASKP 257
GS+ASSAP+ D + Y VV + S++C+ ++K+ + + + ++ P
Sbjct: 217 GSVASSAPVFAEYD---FAEYGGVVGRALSYPLIGGSDQCYSAVEKAVTTLKTLLDSTTP 273
Query: 258 DGLA-ILSKKFKTCTPLKNTTEL---KDGLDTVYSEAAQYDTPSNIP-VKRICNAIENAP 312
G + + + C+P+ +L + + + QY+ + P V +C A+ +
Sbjct: 274 AGSSDKIPSYLRPCSPIGGPLDLATYEAQIFGAFQGVVQYNLENRPPYVSDLCTAMTDG- 332
Query: 313 NCGDDILCKI 322
N DDIL ++
Sbjct: 333 NDDDDILLRL 342
>gi|431892225|gb|ELK02665.1| Thymus-specific serine protease [Pteropus alecto]
Length = 515
Score = 125 bits (313), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 112/457 (24%), Positives = 193/457 (42%), Gaps = 63/457 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W + P+ LG E + + G
Sbjct: 59 WLEQALDPFNA--SDRRSFLQRYWVNEQHWASRDGPVFLHLGGEGSLGPGAVMRGHPAAL 116
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G A H + +S ALAD S L + N
Sbjct: 117 APAWGALVIGLEHRFYGLSIPAGGLDMA-----HLRFLSSRHALADVVSARLALSRLLNV 171
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P + G SY G LA W RLK+PH+V ++ASSAP+ D + + + S F
Sbjct: 172 SSSSPWVCFGGSYAGSLAAWARLKFPHLVFAAVASSAPVRAVLDFSAYNECRAAASSAF- 230
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQY 293
E + ++ + + ++ A L ++ ++ EL L + A QY
Sbjct: 231 ---AEVARRLRAGEAARAALRAELGACASLDRE-------EDRAELLGALQALVGGAVQY 280
Query: 294 DTPSNIP--VKRICNAIEN-------APNCGDDILCKIAAGVVEADSLEYDGNN--SRCY 342
D + P V+++C ++ AP G ++ + L + ++
Sbjct: 281 DGQAGAPLSVRQLCGLLQAGGNRSRPAPYRGLRQAVQVVLHSLGQKCLSFSRAETVAQLR 340
Query: 343 INEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP- 400
+ E + D W +Q+C+E +D + P L ++ C++ +G+S
Sbjct: 341 VTEPQVSSVGDRQWLYQTCTEFGFYVTCEDPGCPFSQLP-ALPSQLELCEQVFGLSTSSV 399
Query: 401 ----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL- 455
+ +YYGG + ++F NG DP +H LS +Q L
Sbjct: 400 AQAVAQTNSYYGGQT------PGATQVLFVNGDADP---------WHV--LSVTQALGPS 442
Query: 456 --------GSHCLDLDEAKKSDPDWLVQQRKTEVKIM 484
SHCLD+ + SD L R++ V+I+
Sbjct: 443 VSALLIPSASHCLDMAPERPSDSPSLRLGRQSPVQIL 479
>gi|167375432|ref|XP_001733641.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165905185|gb|EDR30254.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 480
Score = 125 bits (313), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 120/461 (26%), Positives = 194/461 (42%), Gaps = 56/461 (12%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPID-DNIQLS 108
+T Y +DHF+ + F +Y I++ + N+P+ LG E P D +Q
Sbjct: 38 KTLTYTVPLDHFNVNNQI--DFDIQYFISTDYLDNNSPNAPLFVLLGGEGPEDATGLQNY 95
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
+ A + K L++ +EHR+YG S P + Y + QAL DY ++ H+
Sbjct: 96 FVVTDLAKKHKGLMLSVEHRFYGASTPSLEMDKLI-------YCTAEQALMDYVEVISHV 148
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
+++ N PVI +G SY G LA W R KYP+VV G+ ASSAP+ + + Y VV
Sbjct: 149 QEENNLVGHPVIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVV 205
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTV 286
+ + + W D + + +G L K F TCT K+ + + T
Sbjct: 206 QNALPKNTADLLSFAFEKW---DEMTTTEEGRKELGKIFNTCTEFGEKDIQTFAESIGTA 262
Query: 287 YSEAAQYDT----PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY 342
S QY++ S IC I +D++ K + E + E+
Sbjct: 263 LSGYVQYNSSNWKSSYESTDSICTEI------NEDVVNKYPLFIKEKYNPEWGDKECTSS 316
Query: 343 INED--------RTGDESDEG-----WEWQSCSEMVVPMGKDKNSMYQPEPWN-LTKYIK 388
E+ T E +EG W +Q+C + S + N L I
Sbjct: 317 SQEESYKTLQSTSTYAEGNEGAAGRSWFFQTCIAYGYYQAVSEQSSVKWGKLNQLQGSID 376
Query: 389 NCKEQYGVSPRPSW-----VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
CK+ YG+ + + YGG + +N+ F+NG DP+ G ++ H
Sbjct: 377 MCKDIYGIDKDTLYNAVDHINVRYGGK------KPCVTNVAFTNGNTDPWHALGVTESDH 430
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIM 484
+ + Q ++ SHC DL K++D L + R E+K +
Sbjct: 431 -QEGNLVQLIDRTSHCSDLYSEKETDVPELKKARHNELKFI 470
>gi|344268141|ref|XP_003405921.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 429
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 124/459 (27%), Positives = 185/459 (40%), Gaps = 78/459 (16%)
Query: 76 RYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVP 135
RY IN F+ G L G E + I ++ A + AL ++LEHR+YG S P
Sbjct: 5 RYFINDAFYKPGGPVFLNIGGPETACESWISMNNTWVTYAERLGALFLLLEHRFYGHSQP 64
Query: 136 FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFR 195
G L A R Y +S QALAD + I K T +A G YGG LA W R
Sbjct: 65 TGD----LSTASLR-YLSSRQALADIVNFRTKIAKKMGLTENKWVAFGCWYGGFLAAWSR 119
Query: 196 LKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIAS 255
+KYP + ++ SSAP+ + + Y VV + + ECFQT+K+++ ++ +
Sbjct: 120 IKYPELFAAAVGSSAPMQAKAN---FYEYLEVVQRSLATHNSECFQTVKEAFKQVVKMMK 176
Query: 256 KPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPN-- 313
P+ + L F C PLK L +D + + PVK +N N
Sbjct: 177 LPEFYSKLEDDFTLCKPLK----LYSAMDKAF-----FLERLIFPVKTAVQLNKNKKNYK 227
Query: 314 -----CGDDILCKIAAGV---------VEADSLEYDGNNSRC----YINE---------D 346
D LC + A V L ++ S C Y N+ D
Sbjct: 228 GEQVFISMDDLCDMMANTSLGSPYHRYVRIIHLLFETEYSPCLAANYKNKLKAFLDPSID 287
Query: 347 RTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSW--- 402
+D +Q C+E D KN + P L+ +++ C + + P+ ++
Sbjct: 288 HYNPPTDRQQFYQFCTEFGFFQTTDSKNQPFTGLP--LSYFVQQCSDFF--DPKFNYDSL 343
Query: 403 ------VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL- 455
YY G + + S IIF NG DP+ G K ++DL
Sbjct: 344 KKGVKSTNAYYSGFKV------TGSKIIFPNGSFDPWHVLGIPK-------DITKDLPAV 390
Query: 456 ----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
HC DL + K D L+Q R+ +I+Q W+ Q
Sbjct: 391 FIKGAGHCADLYKQKDIDSTELIQARERIFQILQKWLKQ 429
>gi|397492345|ref|XP_003817084.1| PREDICTED: dipeptidyl peptidase 2 [Pan paniscus]
Length = 406
Score = 124 bits (312), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 152/321 (47%), Gaps = 28/321 (8%)
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
YGG L+ + R+KYPH+V G+LA+SAP+L + + V+ DF S +C Q +++
Sbjct: 77 YGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVRE 136
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIPVK 302
++ +I ++ + + +F TC PL K+ T+L ++ A D P P
Sbjct: 137 AFRQIKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYP--YPTD 193
Query: 303 RICNAIENAPNCGDDILCKIA---AGVVEADSLEYDGNNSR-CY-------INEDRTG-- 349
+ N G D L A G+ L Y+ + S CY D TG
Sbjct: 194 FLGPLPANPVKVGCDRLLSEAQRITGLRALAGLVYNASGSEHCYDIYRLYHSCADPTGCG 253
Query: 350 -DESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYY 407
W++Q+C+E+ + + M+ P+ + C + +GV PRP W+LT +
Sbjct: 254 TGPDARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQRYCLDTWGVWPRPDWLLTSF 313
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKK 467
G D+ R+ SNIIFSNG DP++ GG + ++ + + G+H LDL +
Sbjct: 314 WGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAVT--IQGGAHHLDLRASHP 366
Query: 468 SDPDWLVQQRKTEVKIMQGWI 488
DP +V+ RK E I+ W+
Sbjct: 367 EDPASVVEARKLEATIIGEWV 387
>gi|307196628|gb|EFN78125.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 429
Score = 124 bits (312), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 116/464 (25%), Positives = 205/464 (44%), Gaps = 76/464 (16%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ + QRY +NS ++ N P+ +G E I + G + A + A+ +EHRYY
Sbjct: 2 RVWKQRYFVNSDYYKP-NGPVFLMIGTEK-IKPKWMVEGLWIDYAKELGAMCFYVEHRYY 59
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGE 189
GKS P + + + +S AL D+A + +I +Y I G SYGG
Sbjct: 60 GKSHP-----TVDLSTDNLTFLSSEIALQDFAYFIRNINIEYKFPNDTKWIVFGGSYGGS 114
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA W RLKYPH V G++++S P+L D + Y+ VV + S++C + + +E
Sbjct: 115 LAAWMRLKYPHFVHGAVSASGPLLALID---FQEYYVVVEDALKQHSQQCVDAVANANTE 171
Query: 250 IDNIASKPDGLAILSKKFKTCTPLK--NTTELKDGLDTVYSEAA----------QYDTPS 297
+ G +++KF+ C P+ +T ++ + ++ + A Q +
Sbjct: 172 FHTMLHHLTGQEQIAEKFRLCDPIDPGHTADISNLYQSLANNFAYIVQNNKNNRQESKTA 231
Query: 298 NIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC-------YINEDRTGD 350
NI V IC+ + N D L + + +S+ + +C I+ R+ +
Sbjct: 232 NINVDTICDVLTN------DELGRPVDRLAYMNSMILNATKEKCLDYKYDNMIHSLRSIN 285
Query: 351 -----ESDEGWEWQSCSEMVVPMGKDKNSMYQP----EPWNLTKYIKNCKEQYGVSPRPS 401
E + W +Q+CSE +G + S +P E + + Y++ C + +G PS
Sbjct: 286 WNEQVEGERQWMYQTCSE----VGFFQTSTARPKLFSETFPVDFYVQQCVDIFG----PS 337
Query: 402 WVL-----------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
+ L T YG L + SN++ +G DP+ G K+ + ++
Sbjct: 338 YNLDMLKSVVTRTNTLYGA------LNQKVSNVVHVHGSLDPWHTLGITKSSNHPQVAIY 391
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
+N +HC L + + DP L Q R +++G I Q+ D+
Sbjct: 392 --INDTAHCAILYPSSEKDPPQLKQAR----IVVKGLIKQWLDN 429
>gi|240254469|ref|NP_179399.5| serine carboxypeptidase S28-like protein [Arabidopsis thaliana]
gi|322518656|sp|Q1PF50.2|EDA2_ARATH RecName: Full=Probable serine protease EDA2; AltName: Full=Protein
EMBRYO SAC DEVELOPMENT ARREST 2; Flags: Precursor
gi|330251630|gb|AEC06724.1| serine carboxypeptidase S28-like protein [Arabidopsis thaliana]
Length = 489
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 130/473 (27%), Positives = 204/473 (43%), Gaps = 76/473 (16%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI----DDNIQLSG 109
++NQT+DH S P ++ F QRY ++ + P+ + E P +D I +
Sbjct: 48 LWFNQTLDHES--PNDHRKFRQRYYEFMDYFRSPDGPMFMIICGEGPCSGIANDYINVL- 104
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A +F+A +V LEHRYYGKS PF S A +N + Y +S QAL D AS + +
Sbjct: 105 -----AKKFQAGVVSLEHRYYGKSSPFNS--LATENLK---YLSSKQALYDLASFRQYYQ 154
Query: 170 DKYN--------ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
+ N + P G SY G L+ WFRLK+PH+ GSLASSA + + + +
Sbjct: 155 ESLNKKLNISSGGSDNPWFFFGISYSGALSAWFRLKFPHLTCGSLASSAVVRAIYEFSEF 214
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
+ E Q K + E + + GL + +K K+ L N TEL
Sbjct: 215 D----------QQIGESAGQECKLALQETNKLLEL--GLKVKNKAVKS---LFNATELDV 259
Query: 282 GLDTVY--SEAA----QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD----S 331
D +Y ++AA QY P ++C + A G D++ + V E
Sbjct: 260 DADFLYLTADAAVMAFQYGNPD-----KLCVPLVEAKKNGSDLVVTYSTYVREYCMRIWG 314
Query: 332 LEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCK 391
L N + N T D + W +Q+C+E+ K + N T ++ CK
Sbjct: 315 LRVRTYNRKHLRNTVVTADSAYRLWWFQACTELGYFQVAPKYDSVRSHQINTTFHLDLCK 374
Query: 392 EQYG--VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
+G V P+ YYGG R + + IIF+NG DP+ + H
Sbjct: 375 SLFGKDVYPKVDATNLYYGGD------RLAATKIIFTNGSEDPWRHASKQNSTHEMPSYI 428
Query: 450 SQDLNLGSHCLDLD---------EAKKSD---PDWLVQQRKTEVKIMQGWITQ 490
+ N G H D+ E K ++ PD++ + R+ V+ + W+++
Sbjct: 429 IKCRNCG-HGSDIRGCPQSPMVIEGKSNNCSLPDYVNKVRQQMVEHIDLWLSE 480
>gi|66816525|ref|XP_642272.1| hypothetical protein DDB_G0278299 [Dictyostelium discoideum AX4]
gi|60470342|gb|EAL68322.1| hypothetical protein DDB_G0278299 [Dictyostelium discoideum AX4]
Length = 635
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 119/449 (26%), Positives = 199/449 (44%), Gaps = 44/449 (9%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP--ILAFLGAEAPIDDNIQLSGFT 111
+ + QTI+H SY ++ TF QR+ +N KF P + + E P+ I
Sbjct: 74 YIFQQTINHLSY--DTIGTFEQRFSVNKKFVPINGKPKAVFFLVSGEGPLSSEIVNHNPF 131
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ A++ +ALIV LE RYYG+S+PF + N + Y + Q L D A+ + +K
Sbjct: 132 VQIANETQALIVALELRYYGESMPFLNM-----NNSNMAYLTTDQILEDLATFQVFFTNK 186
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
Y I +G SY G ++ W+RLKYPH+V ++ASS+P + ++ +T Y V ++
Sbjct: 187 YQLNDIKWIIMGCSYAGTISAWYRLKYPHLVTAAIASSSP--FRAELR-FTEYDVKVRQN 243
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
C + K +S I+++ K + + + KF TC + L + +
Sbjct: 244 L---GAPCSKAFKNLFSYIEHLMIKNN--SYVKSKF-TCERQLDDRMFLYLLSEALTYSV 297
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE 351
QYD I + C N + +L ++ V + + + Y + +E
Sbjct: 298 QYDARFKI-ISGFCPKFVKLTNSSEALLDMFSSYV--KNMFLFQNVSCDAYNLYEFASNE 354
Query: 352 SD----EGWEWQSCSE---MVVPMGKDKNSMYQPEPWNLTKYIKN-CKEQYGVSPRPSW- 402
D W WQ C E +VP G D ++P+ + + CK YG + RP+
Sbjct: 355 IDYSGTRSWTWQLCREYGWFMVPSGPDS---FKPQSLGECWWQNDVCKTLYGRAMRPTVD 411
Query: 403 -VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL--GSHC 459
+ YG + K I SN++F+N DP+S + L FSQ +++ SHC
Sbjct: 412 RINMVYGSTNFKYI-----SNVLFTNCGNDPWSTLSIDPS---VSLPFSQQIHIPGESHC 463
Query: 460 LDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + SD L R ++ +I
Sbjct: 464 ANWLSEQPSDSIELKNARALANSFLRQFI 492
>gi|345323658|ref|XP_001512518.2| PREDICTED: putative serine protease K12H4.7-like [Ornithorhynchus
anatinus]
Length = 489
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 114/452 (25%), Positives = 189/452 (41%), Gaps = 64/452 (14%)
Query: 76 RYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVP 135
+Y IN F+ G P+ +G E P + G A + AL ++LEHR+YGKS P
Sbjct: 65 QYFINDTFYKTG-GPVFLMIGGEGPANPAWMHHGTWLTYAEKLGALCLLLEHRFYGKSHP 123
Query: 136 FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFR 195
++ + N Y +S QALAD A + +K +A G SY G LA WFR
Sbjct: 124 --TQDLSTSNLH---YLSSRQALADLAHFRTVMAEKLGLVDNKWVAFGGSYPGSLAAWFR 178
Query: 196 LKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIAS 255
LKYPH+V ++A+SAPI + + Y VV K R EC + ++++ + +
Sbjct: 179 LKYPHLVDVAVATSAPIY---AVMNFPEYLEVVQKSLRTFRRECPEIVREASDTVVEMLK 235
Query: 256 KPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVYSEAAQYDTP---------SNIPV 301
++K F C L+ +++ + L + + + QY+ +NI +
Sbjct: 236 YKANYFKITKDFNLCEQLQIKSKMDSAYFLESLASNFMDVVQYNEDNRAFEGVKGTNITI 295
Query: 302 KRICNAIEN----APNCGDDILCKIAAGVVEADSLEYDGNNSRCYINE------DRTGDE 351
+C+ + + P + ++ E L+ +N Y+ E D
Sbjct: 296 NVLCDIMSDISLGTPYDRYATVVRLLLNTFEMTCLDASYSN---YVQEMTNFSWDGPAAT 352
Query: 352 SDEGWEWQSCSEMVVPMGKDKNSMYQP-EPWNLTKYIKNCKEQYGVSPRPSWVLTY---- 406
W +Q+C+E D S QP + L +++ C + YG + +
Sbjct: 353 GGRQWVYQTCTEFGFFQSTD--SKKQPFSGFPLHYHLQQCSDIYGQEFNNTLIANAIRDT 410
Query: 407 ---YGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG-----SH 458
YGG +I + S I+F NG DP+S G + S+DL +H
Sbjct: 411 NENYGGFNI------TGSRIVFPNGSIDPWSALGVIS-------DISKDLPAAFIKGTAH 457
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
C + + D LV R +++Q W+ Q
Sbjct: 458 CAIMYPERTEDSAELVNARVRVFRLLQKWLKQ 489
>gi|123418467|ref|XP_001305332.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121886845|gb|EAX92402.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 504
Score = 124 bits (311), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 114/460 (24%), Positives = 216/460 (46%), Gaps = 56/460 (12%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+ +++Q +DHFS + +TF QRY IN+ + + ++ ++G EAP+ + S
Sbjct: 26 DQMWFDQKLDHFS---DLAETFKQRYYINTNY-SKKSKNLVVYIGGEAPLLE----SSLK 77
Query: 112 YENAH---QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
Y+ H K++I+ LEHRY+G+S+P G+ + L+N + Y QA+ D A+ + +
Sbjct: 78 YDVQHIASVTKSVILALEHRYFGESIPHGNLE--LENFK---YLTVDQAIEDLANFITQM 132
Query: 169 KDKY--NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
K Y +A+ + +G SY G L++ FR K+P + +GS ASSAPI + + + +
Sbjct: 133 KQNYCQDASKCKALMVGGSYPGALSSRFRQKHPELTLGSWASSAPIHSQNNFSEYDKHE- 191
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIA-SKPDGLAILSKKF---KTCTPLKNTTELKDG 282
++D++D C+ K++ I+ I K + + +KF K + N+ +
Sbjct: 192 --AEDYKDYG--CYDNALKAYKTIERITLLKNEKTEEMMEKFGVPKDAQFVNNSVDFLGM 247
Query: 283 LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA----GVVEADSLEYDGNN 338
VYS QY + ++ +C + DD + + A +V D+ + NN
Sbjct: 248 FSDVYSYGNQYKAYNKFLLE-MCEKFKKIDMSNDDEVINVMADTSNSIVGKDN--FFNNN 304
Query: 339 SRCYINEDRTGD-ESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
N D +S W + +C+E+ + + + E + + +CKE +G +
Sbjct: 305 IEFLKNTSIYSDSKSSRSWMYMTCNEL--GWFSSASGLLRSELLTIETSLDSCKELFGFT 362
Query: 398 --PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFS-----RGGWVKTYHFFDLSFS 450
P YGG++ + + ++++N DP+S R K+ +SF
Sbjct: 363 QFPDTEKFNEKYGGYN------PNVTKVVYTNSHYDPWSELTMKRNDTEKSI----ISF- 411
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
++ G HC DL + D ++L R+ +K + W+ +
Sbjct: 412 -NIKDGFHCDDLHDPSDGDSEYLKSVREETIKQLLAWMKE 450
>gi|308808360|ref|XP_003081490.1| Prolylcarboxypeptidase (angiotensinase C) (ISS) [Ostreococcus
tauri]
gi|116059953|emb|CAL56012.1| Prolylcarboxypeptidase (angiotensinase C) (ISS) [Ostreococcus
tauri]
Length = 542
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 129/492 (26%), Positives = 206/492 (41%), Gaps = 82/492 (16%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGG--GNSPILAFLGAEAP------IDDNIQ 106
+++QT+DHF + + + QRY +N F ++P+ +G E P + D
Sbjct: 60 WFDQTLDHFDH--VDRRRWSQRYFVNEGFVDKIEASTPVFVCVGGEGPALTARAVLDGGT 117
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGS-RKAALKNARHRGYFNSAQALADYASIL 165
G + A + + + + LEHR+YG S P G + +L+ Y SAQAL D + +
Sbjct: 118 HCGTMIDLAKKHRGIALALEHRFYGASQPTGDLSRESLR------YLTSAQALEDVVAFV 171
Query: 166 LHIKDKY-----------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILY 214
++ D Y N +++ VIA G SY G LA W R+KYPH + ++ASSAPI
Sbjct: 172 KYVADAYGLRTTPSDDGRNGSYSRVIAFGGSYPGMLAAWSRVKYPHAIHAAVASSAPIRA 231
Query: 215 YGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSW-SEIDNIASKPDGLAILSKKFKT 269
D+ Y+ VV K R+ S+ CF + +++ SE++ P+G L +F
Sbjct: 232 ELDM---RGYYDVVGKALREKDVGGSDACFDAVSETFESELNEALKTPEGRRALETRFNV 288
Query: 270 CTPLK-NTTELKDGLDTVYSE--AAQYDTP-------SNIPVKRICNAIENAPNCGD-DI 318
C + +DG V AQ + P S + + + C + A D
Sbjct: 289 CGDAALDQFGGRDGFADVLRAMFPAQNNDPSCEMEDTSCLNIAKACTMMTRAETGKRLDA 348
Query: 319 LCKIAAGVVEADSLEYDGNNSRCYINE--DRTGDESDEG---WEWQSCSEMVVPMGKDKN 373
L + V + + DG Y+ E T + EG W WQ+C+E +K+
Sbjct: 349 LASVVKVVFGSSCVSLDGA---AYMRELMSETPNPLGEGERQWTWQTCTEFAFFQTCEKD 405
Query: 374 S----MYQPEPWNLTKYIKNCKEQYGVSPRPSWVL-----TYYGGHDIKLILRRSTSNII 424
S P L+ Y C + +GVS + YGG + + I+
Sbjct: 406 SGCPFKLDPPTMPLSSYQWICAQVFGVSAEQTKNAVERSNARYGG------ITPGGTRIL 459
Query: 425 FSNGMRDPFSRGGWVKTYHFFDLSFSQD------LNLGSHCLDLDEAKKSDPDWLVQQRK 478
F +G DP W+ F +FS + SH K +D +VQ R
Sbjct: 460 FPSGSVDP-----WIAN-SFVSNTFSPKWEPAFVVPGASHHAWTHPPKDTDSAAVVQARA 513
Query: 479 TEVKIMQGWITQ 490
K ++ W+ Q
Sbjct: 514 RIEKQVEKWMNQ 525
>gi|116788543|gb|ABK24916.1| unknown [Picea sitchensis]
Length = 489
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 139/521 (26%), Positives = 216/521 (41%), Gaps = 78/521 (14%)
Query: 9 QWLLLMVIFVSTSF---HANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSY 65
+W+LL VI V S+ + L L + R S +D ++++Q +DH+S
Sbjct: 4 KWVLLAVILVIFSYVVLECSALPLPVGIKSGRSSSAYLTQDE------YWFSQRLDHYS- 56
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVIL 125
P ++ F QRY ++ + PI + E D + + A +F A IV L
Sbjct: 57 -PTDHRQFNQRYYEFLDYFQAHDGPIFLKVCGEYSCDG--IANDYMAVLAKKFGAAIVSL 113
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI------LLHIKDKYNATHAPV 179
EHRYYGKS PF +++ N + Y +S QAL D AS L + K +
Sbjct: 114 EHRYYGKSSPF--KESTTHNLQ---YLSSKQALFDLASFRNYYQELTNKKQNLSNYDNSW 168
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF-RDTSEE 238
G SY G L+ WFRLK+PH+ GSLASSA +L + T DF R E
Sbjct: 169 FVFGVSYPGALSAWFRLKFPHLTCGSLASSAVVLAVYNFT-----------DFDRQIGES 217
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKT---CTPLKNTTELKDGLDTVYSEAAQYDT 295
KK+ E+ +A + GL+ + K+ LKN + L A QY
Sbjct: 218 AGPGCKKALQEVTRLAEQ--GLSTNANAVKSLFGAEKLKNDGDFLYLLADAAVTAFQYGY 275
Query: 296 PSNIPVKRICNAIENAPNCGDDILCKIA--------AGVVEADSLEYDGNNSRCYINEDR 347
P +C+ + A + G D++ A G VE+ ++ N + ++ D
Sbjct: 276 P-----DVLCSPLVEASSSGKDLMVAYADYVKNYSSDGGVESYGQQFLKNTT---VDADS 327
Query: 348 TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT-- 405
G + W +Q C+E N + + ++ C+ +G P +T
Sbjct: 328 NGVSNSRSWWYQVCTEFAYFQVAPSNDSVRSSKVDTKYHLDLCENVFGKGIYPEVDITNL 387
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL--- 462
YYGG I + S IIF NG +DP+ K+ N G H DL
Sbjct: 388 YYGGTSI------AGSKIIFMNGSQDPWRHASKQKSSDNMPSYIITCHNCG-HGTDLRGC 440
Query: 463 ---------DEAKKSDPDWLVQQRKTEVKIMQGWITQYYDD 494
D + + PD + + R+ V+ + W++Q D+
Sbjct: 441 PQSPSRIEGDASDCASPDVVHKARQQMVEHIDLWLSQCQDE 481
>gi|256574601|dbj|BAH98106.1| serine protease [Entamoeba invadens]
gi|440292588|gb|ELP85775.1| hypothetical protein EIN_281360 [Entamoeba invadens IP1]
Length = 479
Score = 124 bits (310), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 116/458 (25%), Positives = 195/458 (42%), Gaps = 49/458 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSG 109
ETF Y +DHF+ + + F Y I+S++ + SPI LG E P + + +
Sbjct: 38 ETFKYTVPLDHFNANND--EEFEIVYFIDSQYLDSASETSPIFILLGGEGPETEKVLQNN 95
Query: 110 FTY-ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
+ E A + K L++ +EHR+YG S P L + Y + QA+ DY ++ ++
Sbjct: 96 YVIDELAKKHKGLMLSVEHRFYGTSTP----SLELNTLK---YCTAEQAMMDYVEVINYV 148
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
++ Y+ PVIA+G SY G LATW R KYP+++ GS ASSAP+ + + Y VV
Sbjct: 149 QEMYSLVGHPVIALGGSYSGNLATWIRQKYPNIIDGSWASSAPL---EAVVDFYEYLEVV 205
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTV 286
+ + + + W D + G L K F TCT K+ + + T
Sbjct: 206 QSNLPENTATLLTLAFEKW---DEMVVTESGRKQLGKIFHTCTEFGEKDIQTFSENIGTA 262
Query: 287 YSEAAQYDTP----SNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD----SLEYDGNN 338
+ QY++ + IC + N + +D SLE +
Sbjct: 263 LAGYVQYNSSVWKKNYESTNSICYEFDEDINTKYPMFIDKTNTKSGSDCTGSSLETNYKE 322
Query: 339 SRCYINEDRTGD-ESDEGWEWQSCSEMVVPMG-----KDKNSMYQPEPWNLTKYIKNCKE 392
R ++ D S W +Q+C V G +K+++ L I CK+
Sbjct: 323 LRDTTTYEKGNDGASGRAWMFQTC----VAYGYYQAVSEKSNVMFGRMNKLQGSIDMCKD 378
Query: 393 QYGVSPRPSW-----VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL 447
Y + + + + YG + ++ +N+ F+NG DP+ G +
Sbjct: 379 IYNIDNQTLYQAVEHINVRYGAKNPQV------TNVAFTNGGVDPWHALGITQQDAVDSS 432
Query: 448 SFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
+ Q + SHC DL K++D L + R E++ +
Sbjct: 433 NIVQYIQTTSHCSDLYSEKETDAPELKRARHKEMRFFE 470
>gi|301783465|ref|XP_002927151.1| PREDICTED: thymus-specific serine protease-like [Ailuropoda
melanoleuca]
Length = 476
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 113/454 (24%), Positives = 193/454 (42%), Gaps = 68/454 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ + ++F QRY +N + W P+ LG E + + G
Sbjct: 59 WLEQPLDPFNTSDQ--RSFLQRYWVNDQHWASQRGPVFLHLGGEGSLRSGSVMRGHPAAL 116
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL--ADYASILLHIKDKY 172
A + AL++ LEHR+YG S+P L A+ R + +S AL AD AS + +
Sbjct: 117 APAWGALVIGLEHRFYGLSIP----AEGLDVAQLR-FLSSRHALPSADVASARRALARLF 171
Query: 173 N-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
N +T +P I G SY G LA W RLK+PH+++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 172 NVSTASPWICFGGSYAGSLAAWARLKFPHLILASVASSAPVRAVLD---FSEYNEVVSRS 228
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
T+ C + ++ EL L + A
Sbjct: 229 LTSTAPAC----------------------------SSLGGAEDQAELLGALQALVGGAV 260
Query: 292 QYDTPSNIP--VKRICNAI------ENAPNCGDDILCKIAAGVVEADSLEYDGNN--SRC 341
QYD + +P V+R+C + +A G ++ + L + ++
Sbjct: 261 QYDAQAGVPLSVRRLCGLLLGPSGSRSASYHGLRRAVQVVMRSLGQRCLSFSRAETVAQL 320
Query: 342 YINEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
+ + D W +Q+C+E +D + P L ++ C++ +G+S
Sbjct: 321 RVTDPHVSGVGDRQWLYQTCTEFGFYVTCEDPRCPFSQLP-ALPSQLELCEQVFGLSTAS 379
Query: 401 -----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
S +YYGG +++++F NG DP+ + S S L
Sbjct: 380 IARAVSQTNSYYGGQT------PGSTHVLFVNGDTDPWHV---LSVTQALGPSESALLIP 430
Query: 456 G-SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
G SHC+D+ + SD L R++ ++ +Q W+
Sbjct: 431 GASHCMDMAPERPSDSPSLRLGRQSILQQLQTWL 464
>gi|330837832|ref|XP_003292053.1| hypothetical protein DICPUDRAFT_39962 [Dictyostelium purpureum]
gi|325077726|gb|EGC31420.1| hypothetical protein DICPUDRAFT_39962 [Dictyostelium purpureum]
Length = 481
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 109/396 (27%), Positives = 175/396 (44%), Gaps = 39/396 (9%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+NQ IDH+++ + TF QR+ S ++ G N PI G E + ++ ++ YE
Sbjct: 50 LFNQKIDHYNFQHGNL-TFKQRFFEYSNYYDG-NGPIFFVFGPEQELKEDY-INNRQYEE 106
Query: 115 -AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A A I+ LEHRYYGKS+ F Y NS QA+AD A + K +
Sbjct: 107 WAKTLNASIICLEHRYYGKSI-FTDHLTT----ESLQYLNSDQAIADVAYFITWYKKENK 161
Query: 174 ATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPI------LYYGDITPWTTYHS 226
+ GASYGG +A F++KYPH++ ++SS P+ Y +I T
Sbjct: 162 IDDGKRWVGFGASYGGTIAAQFKIKYPHLIDIIVSSSGPVSPELNFFQYLEIVQNTIISE 221
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE---LKDGL 283
V E C + I+ + EI+ I K +L KF+ C PL+N + L+
Sbjct: 222 V------QDGERCVENIRNATLEIEEII-KFGNHNLLKDKFRLCAPLENEKDFSLLEFTN 274
Query: 284 DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDD----ILCKIAAGVVEADSLEYDGNNS 339
V+ + QY + +++ICN + N D I +++ V+ ++ Y +
Sbjct: 275 SLVFMDTVQYYDSNKDKLQKICNILNNEFKSSLDNYIQIWLEVSYPNVKCINVNYK---N 331
Query: 340 RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS-- 397
I ++R D + W +Q+C+E M + + NL Y C++ +G+
Sbjct: 332 HIEIWKERNVDHQSKAWLYQTCTEYGYFMTTESKNQPFGSLLNLQFYTDMCQDIFGIRNM 391
Query: 398 -PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
P W YGG I S +I++ N DP
Sbjct: 392 IPNTKWANDQYGGFKIN---SESIKSILYINSSLDP 424
>gi|160332814|emb|CAL69923.1| hypothetical protein [Plasmodiophora brassicae]
gi|162138601|emb|CAP58027.1| hypothetical protein [Plasmodiophora brassicae]
Length = 467
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 122/460 (26%), Positives = 186/460 (40%), Gaps = 57/460 (12%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAE---APIDDNIQLS 108
E F +Q +DH + G + F QRY +FW G + P++ L E A + D Q
Sbjct: 23 EHVFPDQLVDHTAIGGGGAR-FSQRYFRIDQFWSGPDGPVILQLCGEYTCAGVTDGRQ-- 79
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
F A ++ AL+++LEHRY+GKS PF ++ + R+ Y + QAL+D A
Sbjct: 80 -FPSALAERYGALVLVLEHRYFGKSSPF-----SVLSPRNLTYLTTFQALSDIACF---- 129
Query: 169 KDKYNATHA------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
D Y H I IG SY G LA W+RLKYPH+ G+LASSA + P+
Sbjct: 130 TDWYQRVHIGRANANKWITIGGSYPGALAAWYRLKYPHLTAGALASSAV------VAPFA 183
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
+ + EC ++ + ++ + LA +C+ L + L
Sbjct: 184 EFPEFDEQVALSAGPECTHALQDITAMVEGALQEGGRLADEMNALFSCSQLSDADFLYLI 243
Query: 283 LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI--LCKIAAGVVEADSL----EYDG 336
D + +EA QY P +C+ I A + D + + G A +YDG
Sbjct: 244 ADAM-AEAIQYG-----PSVSLCDPIVQAESRDDRLAAFVEFVQGTFYASMSNSPGDYDG 297
Query: 337 NNSRCYINEDRTGDESDEGWEWQSCSEM----VVPMGKDKNSMYQPEPWNLTKYIKNCKE 392
+ + D S W W C+E+ + P S W+ + K +
Sbjct: 298 DT---MASHRWVPDSSGRQWWWMKCNEVGWFQIAPGTNSIRSKRVNMEWHRDRCEKLFGD 354
Query: 393 QYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
P Y G D+ S SN++F+NG+ DP+ G L S
Sbjct: 355 VLAFPPPCHRASIEYSGFDM------SVSNVVFTNGVEDPWQWAGASAFSSSAHLRDSSV 408
Query: 453 LNLG----SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L + HC+DL D L R T + + W+
Sbjct: 409 LLINCSQCGHCVDLHTPSPDDAPALTTARSTIIAHIDRWL 448
>gi|241176364|ref|XP_002399549.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215495171|gb|EEC04812.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 317
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/150 (41%), Positives = 93/150 (62%), Gaps = 5/150 (3%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+ +DHF++ +T+ RY I +FW P+ + G E P + I+ +G +E A
Sbjct: 18 RRVDHFTF--RDNRTYQMRYAIADQFWDRKGGPVFFYTGNEDPYETFIKETGVIWEWAPD 75
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATH 176
FKALIV EHR+YGKS+PFG + ++ ++ GY S QALADYA +++++K A
Sbjct: 76 FKALIVFAEHRFYGKSLPFGDE--SYQSPKNLGYLTSEQALADYAYLVVYLKTTLAGAAK 133
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSL 206
+ +A G SYGG LATWFR+KYPH++ +L
Sbjct: 134 SQFVAFGGSYGGMLATWFRIKYPHLIAATL 163
>gi|297802314|ref|XP_002869041.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297314877|gb|EFH45300.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 491
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 142/525 (27%), Positives = 214/525 (40%), Gaps = 81/525 (15%)
Query: 3 SSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDH 62
SS F L I +S S ++GL+ R R S L + + ++NQT+DH
Sbjct: 2 SSAICFALLTTFTILLSYSSLSDGLQPR------RISHGLTESSKYLTRDELWFNQTLDH 55
Query: 63 FSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALI 122
+S P ++ F QRY + PI + E P N + + A +F A I
Sbjct: 56 YS--PSDHRKFRQRYYEYLDHLRVPDGPIFMMICGEGPC--NGIPNDYITVLAKKFDAGI 111
Query: 123 VILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT------- 175
V LEHRYYGKS PF S A +N + Y +S QAL D A+ + + + N +
Sbjct: 112 VSLEHRYYGKSSPFKS--LATENLK---YLSSKQALFDLAAFRQYYQARSNDSLNVKFNR 166
Query: 176 ----HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
P GASY G L+ WFRLK+PH+ GSLASSA + + +
Sbjct: 167 SGNVENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVVRAVYEFPEFD--------- 217
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY---- 287
+ E K + E + + GL + +K K L N TEL D +Y
Sbjct: 218 -QQIGESAGPECKAALQETNKLLEL--GLKVNNKAVKA---LFNATELDVDADFLYLIAD 271
Query: 288 --SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY--- 342
A QY P + V +E N GD L + A V + G +S+ Y
Sbjct: 272 AEVMAIQYGNPDKLCVP----LVEAHKNGGD--LVEAYAKYVREFCMGVFGLSSKTYSRK 325
Query: 343 --INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSP 398
++ T + +D W +Q C+E+ N + N ++ CK + GV P
Sbjct: 326 HLLDTAVTPESADRLWWFQVCTEVAYFQVAPANDSIRSHQINTEYHLDLCKSLFGKGVYP 385
Query: 399 RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRG----------GWVKTYHFFDLS 448
YYG R + + IIF+NG +DP+ ++ T H +
Sbjct: 386 EVDATNLYYGSD------RIAATKIIFTNGSQDPWRHASKQTSSPELPSYIVTCH--NCG 437
Query: 449 FSQDLN---LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
DL ++ D S PD + + R+ V+ M W+++
Sbjct: 438 HGSDLRGCPQSPMVIEGDSKNCSSPDAVNKVRQHIVEHMDLWLSE 482
>gi|328718793|ref|XP_001947565.2| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 490
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 130/489 (26%), Positives = 211/489 (43%), Gaps = 58/489 (11%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS 89
RP RS I+E K ++ Q +DHF+ P +T+ QRY +N + + G
Sbjct: 31 RPNSVATARSLIIEDK---------WFKQKVDHFN--PSDTRTWKQRYHMNLQHYKHG-G 78
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
PI +G E I N SG E A + A+ LEHRYYG+S P + K KN +
Sbjct: 79 PIFLSIGGEEEITHNWMTSGAWIEYAKKLNAMCFQLEHRYYGRSHPTDNLKT--KNLK-- 134
Query: 150 GYFNSAQALADYASILLHIKDKYNAT--HAPVIAIGASYGGELATWFRLKYPHVVIGSLA 207
Y Q LAD + + I + T +A I G SY G LA W R+KYPH+V +++
Sbjct: 135 -YLTVEQVLADLETFISTISNDNEETLRNAKWIVFGGSYSGSLAAWLRMKYPHLVYAAVS 193
Query: 208 SSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKF 267
SS+P++ D + ++ + + + +C I ++ S I ++ G + KKF
Sbjct: 194 SSSPLMAKID---YKDFYMAIQNTISNYNPKCASNIIEATSTISDLLETSYGAKYVDKKF 250
Query: 268 KTCTPL-----KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI 322
KTC+ + K+ T + L + QY+ + K + ++ D L
Sbjct: 251 KTCSDMNIKNKKDKTVFFNNLALPVALIIQYNNDNKKKNKLALSLVKLCDMMLDKSLGNP 310
Query: 323 AAGVVEADSLEYDGNNSRCY-IN-EDRTGDESDEGWEWQS------------CSEM-VVP 367
V NN C IN +D + W ++ C+++
Sbjct: 311 LERYVAVHKQLRSVNNQICTSINYQDAISALKETSWNAENVKSGSRQWLYLICTQIGNFV 370
Query: 368 MGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV-----LTYYGGHDIKLILRRSTSN 422
++N ++ L Y C++ +G S + + T HD L++ TS
Sbjct: 371 TSNNRNDLF-GNSIPLDYYTGMCRDVFGKSFNANSLNAAVRKTNMIHHD----LKKKTSR 425
Query: 423 IIFSNGMRDPFSRGGWVK--TYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
II+ +G D +S G ++ T H S S + GSHC DL ++ SD L + RKT
Sbjct: 426 IIYLHGTIDAWSTLGLIQPMTKH----SVSIVIEGGSHCSDLYPSRSSDSPQLKKARKTV 481
Query: 481 VKIMQGWIT 489
++ W++
Sbjct: 482 EFYLKKWLS 490
>gi|187447082|emb|CAO84855.1| ENSANGG00000016082 protein [Anopheles gambiae]
Length = 227
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/229 (34%), Positives = 122/229 (53%), Gaps = 19/229 (8%)
Query: 150 GYFNSAQALADYASILLHIKDKYNATHA-PVIAIGASYGGELATWFRLKYPHVVIGSLAS 208
GY S QALAD+A +L + T A PVIA G SYGG LA W R+KYPH+V G++A+
Sbjct: 2 GYLTSEQALADFALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIAA 61
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKF 267
SAP+ + +T ++ +++ ++ + +C I++SW+ + N ++ DGL +L++KF
Sbjct: 62 SAPVRQFAGVTDCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEKF 121
Query: 268 KTCTPLKNTTE----LKDGLDTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNC 314
K CT L T+ L D L VY A Y PS+ PV+ C +
Sbjct: 122 KFCTNLTKGTDVTETLFDYLTDVYGNXAMINYPYPSSFLAPVPAYPVREFCGRLAQNYT- 180
Query: 315 GDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE-SDEGWEWQSCS 362
G ++L + + + + YDG + IN G SD GW++Q+C+
Sbjct: 181 GVELLDHLQSAL--SIYYNYDGKAACLNINSSYDGTGISDRGWDFQACT 227
>gi|187447074|emb|CAO84851.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447076|emb|CAO84852.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447078|emb|CAO84853.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447080|emb|CAO84854.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447084|emb|CAO84856.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447086|emb|CAO84857.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447088|emb|CAO84858.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447090|emb|CAO84859.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447092|emb|CAO84860.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447094|emb|CAO84861.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447096|emb|CAO84862.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447098|emb|CAO84863.1| ENSANGG00000016082 protein [Anopheles gambiae]
Length = 227
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/229 (34%), Positives = 122/229 (53%), Gaps = 19/229 (8%)
Query: 150 GYFNSAQALADYASILLHIKDKYNATHA-PVIAIGASYGGELATWFRLKYPHVVIGSLAS 208
GY S QALAD+A +L + T A PVIA G SYGG LA W R+KYPH+V G++A+
Sbjct: 2 GYLTSEQALADFALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIAA 61
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKF 267
SAP+ + +T ++ +++ ++ + +C I++SW+ + N ++ DGL +L++KF
Sbjct: 62 SAPVRQFAGVTDCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEKF 121
Query: 268 KTCTPLKNTTE----LKDGLDTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNC 314
K CT L T+ L D L VY A Y PS+ PV+ C +
Sbjct: 122 KFCTNLTKGTDVTETLFDYLTDVYGNLAMINYPYPSSFLAPVPAYPVREFCGRLAQNYT- 180
Query: 315 GDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDE-SDEGWEWQSCS 362
G ++L + + + + YDG + IN G SD GW++Q+C+
Sbjct: 181 GVELLDHLQSAL--SIYYNYDGKAACLNINSSYDGTGISDRGWDFQACT 227
>gi|255581402|ref|XP_002531509.1| catalytic, putative [Ricinus communis]
gi|223528862|gb|EEF30863.1| catalytic, putative [Ricinus communis]
Length = 482
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 116/397 (29%), Positives = 181/397 (45%), Gaps = 53/397 (13%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
++NQ +DH+S P ++ F QRY ++ PI + E+ N + +
Sbjct: 43 LWFNQILDHYS--PYDHRRFQQRYYEYLDYFRAPGGPIFLKICGESSC--NGIANDYISV 98
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD--- 170
A +F A +V LEHRYYGKS PF S + KN R Y +S QAL D A H ++
Sbjct: 99 LAKKFGAAVVSLEHRYYGKSTPFKSSET--KNLR---YLSSKQALFDLAVFRQHYQEALN 153
Query: 171 -KYNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
K N T+ P I G SY G L+ W+RLK+PH+ GS+ASSA +L + +T +
Sbjct: 154 LKLNRTNVENPWIVFGISYSGALSAWYRLKFPHLTCGSVASSAVVL---AVYNFTEFDQQ 210
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ + EC ++++ +D + LA K KT L N EL+ D +Y
Sbjct: 211 IGE---SAGAECKAALQETTQLVD------ERLASNRKAVKT---LFNAAELEIDGDFLY 258
Query: 288 --SEAA----QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVE----ADSLEYDGN 337
++AA QY P ++C+ + A G+D++ A V E + + +
Sbjct: 259 FLADAAVIAFQYGNPD-----KLCSPLVEAKKAGEDLVEAYAKYVKEYYVGSFGVSVETY 313
Query: 338 NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--G 395
N + + + SD W +Q C+E+ N + N ++ CK + G
Sbjct: 314 NQKHLKDTAINENSSDRLWWFQVCTEVAYFQVAPSNDSIRSSKVNTRYHLDLCKNVFGEG 373
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
+ P YYGG I + S I+F+NG +DP
Sbjct: 374 IYPEVDTTNIYYGGTKI------AGSKIVFTNGSQDP 404
>gi|307184675|gb|EFN71004.1| Putative serine protease F56F10.1 [Camponotus floridanus]
Length = 418
Score = 122 bits (306), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 117/447 (26%), Positives = 203/447 (45%), Gaps = 81/447 (18%)
Query: 94 FLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFN 153
+GAE + + G E A +F A+ LEHR+YG S P + ++KN Y N
Sbjct: 1 MIGAEGIANVKWMVEGQWIEYAKEFGAMCFYLEHRFYGNSHP--TPDLSVKNLI---YLN 55
Query: 154 SAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
S QALAD A + +I +Y + + I G SYGG LA W R+KYPH+V G++++S P+
Sbjct: 56 SQQALADLAYFIQNINIEYKFSNNTKWIVFGGSYGGSLAAWMRIKYPHLVHGAVSTSGPL 115
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
L D + Y VV+ +D S++C TI +++ E+ + + KKFK C P
Sbjct: 116 LAQID---FQEYFVVVANALKDYSQKCVDTIAEAYRELGILLRHVGSQQKIEKKFKLCDP 172
Query: 273 LK-------NTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAG 325
+ + + L + L ++ QY+ + + + IEN C DIL G
Sbjct: 173 IDPGHTKKLDISNLYETLADNFASIVQYNKDNRQSSQTLNITIENV--C--DILVDEKIG 228
Query: 326 VVEADSLEYDGN------NSRC-------YINEDR----TGDESDEG--WEWQSCSEMVV 366
+ D L Y N +C I E R T ++++ G W +Q+C+E
Sbjct: 229 -IPIDRLAYVSNMILNATKEKCLDYRYDKMIRELRNTTWTNEQAEGGRQWMYQTCTE--- 284
Query: 367 PMGKDKNSMYQPEPWN----LTKYIKNCKEQYGVSPRPSWVL---------TYYGGHDIK 413
G + S QP ++ + +++ C + +G PR + L YGG ++K
Sbjct: 285 -FGFFQTSTAQPNLFSNNFPVNFFVQQCTDIFG--PRYNIDLLNSAVTRTNILYGGLNLK 341
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD-------LNLGSHCLDLDEAK 466
+ +N++F +G DP +H ++ S + ++ +HC ++ +
Sbjct: 342 V------TNVVFVHGSIDP---------WHVLGITTSSNPQAPAIYIDGTAHCANMYPSS 386
Query: 467 KSDPDWLVQQRKTEVKIMQGWITQYYD 493
++D L + R +++ W+ Y+
Sbjct: 387 ENDMPQLKKARIQIKNLIKEWLKNSYN 413
>gi|32564813|ref|NP_498758.2| Protein K12H4.7, isoform a [Caenorhabditis elegans]
gi|13638618|sp|P34528.2|YM67_CAEEL RecName: Full=Putative serine protease K12H4.7; Flags: Precursor
gi|351063156|emb|CCD71198.1| Protein K12H4.7, isoform a [Caenorhabditis elegans]
Length = 510
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 131/504 (25%), Positives = 217/504 (43%), Gaps = 66/504 (13%)
Query: 25 NGLKLRPRLGRIRRSRILEQKDSNHGFETFF---YNQTIDHF--SYGPESYQTFPQRYVI 79
N ++ RPR G ++++ + F+ + QT+DHF S G +TF QRY
Sbjct: 30 NMIRGRPR-GGMKKTPPMSSVSHMINFDNVVSSTFTQTLDHFDSSVG----KTFQQRYYH 84
Query: 80 NSKFWGGGNSPILAFLGAEAPIDDN-IQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFG 137
N++++ G P LG E P + G N A + A + +EHR+YG++ P
Sbjct: 85 NNQWYKAGG-PAFLMLGGEGPESSYWVSYPGLEITNLAAKQGAWVFDIEHRFYGETHP-- 141
Query: 138 SRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRL 196
+ ++ N + Y +SAQA+ D A+ + + K+ +A + G SY G LA W R
Sbjct: 142 TSDMSVPNLK---YLSSAQAIEDAAAFIKAMTAKFPQLANAKWVTFGGSYSGALAAWTRA 198
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
K+P +V ++ SS P+ D + Y VV S EC ++ + ++ + ++
Sbjct: 199 KHPELVYAAVGSSGPVQAEVD---FKEYLEVVQNSITRNSTECAASVTQGFNLVASLLQT 255
Query: 257 PDGLAILSKKFKTCTPLK-NTTELKDGLDTVYS---EAAQY--DTPSNIPVK-----RIC 305
DG L F C ++ + LK +TVYS E QY D + + IC
Sbjct: 256 SDGRKQLKTAFHLCQDIQMDDKSLKYFWETVYSPYMEVVQYSGDAAGSFATQLTISHAIC 315
Query: 306 NAIENAPNCGDDILCKI------AAGVVEADSLEYDGNNSRCYINEDRTGD-ESDEGWEW 358
N + L ++ +G + ++Y+G S ++ ++ G+ +SD W W
Sbjct: 316 RYHINTKSTPLQKLKQVNDYFNQVSGYFGCNDIDYNGFIS--FMKDETFGEAQSDRAWVW 373
Query: 359 QSCSEMVVPMGKDKNSMYQPEPW-----NLTK--YIKNCKEQYGVSPRPSWVLT------ 405
Q+C+E S PW NL YI C YG + V T
Sbjct: 374 QTCTEFGY---YQSTSSATAGPWFGGVSNLPAQYYIDECTAIYGAAYNSQEVQTSVDYTN 430
Query: 406 -YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
YYGG D +T I+ NG DP+ G + + + + +N +HC D+
Sbjct: 431 QYYGGRD-----NLNTDRILLPNGDIDPWHALGKLTSSNSNIVPVV--INGTAHCADMYG 483
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWI 488
A D +L R+ ++ GW+
Sbjct: 484 ASSLDSMYLTNARQRISDVLDGWL 507
>gi|402866199|ref|XP_003919573.1| PREDICTED: LOW QUALITY PROTEIN: thymus-specific serine protease
[Papio anubis]
Length = 501
Score = 122 bits (305), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 112/429 (26%), Positives = 180/429 (41%), Gaps = 63/429 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W G + PI LG E + + G
Sbjct: 61 WLEQLLDPFNV--SDRRSFLQRYWVNEQHWVGEDGPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAA---LKNARH-------------RGYFNSAQAL 158
A + AL++ LEHR+YG S+P G + A ++RH R S L
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMAQLRFLSSRHAMGKSSGIPSDEDRPSSPSDPRL 178
Query: 159 ADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLK------YPHVVIGSLASSAP 211
AD S L + +N ++ +P I G SY G LA W RLK +PH++ S+ASSAP
Sbjct: 179 ADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKVLXLLRFPHLIFASVASSAP 238
Query: 212 ILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWSEID-NIASKPDGLAILSKK 266
+ D ++ Y+ VVS+ T S EC + +++E++ + A L +
Sbjct: 239 VRAVLD---FSEYNDVVSRSLMSTAIGGSLECRAAVSAAFAEVERRLRLGGAAQAALRSE 295
Query: 267 FKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIP--VKRIC--------NAIENAPN 313
C L +N EL L + A QYD + P V+++C N + P
Sbjct: 296 LSACGTLGSAENQAELLGALQALVGGAVQYDGQAGAPLSVRQLCGLLLGGGGNRSHSTPY 355
Query: 314 CGDDILCKIAAGVVEADSLEYDGNNSRCYI--NEDRTGDESDEGWEWQSCSE--MVVPMG 369
CG +I + L + + + E + D W +Q+C+E V G
Sbjct: 356 CGLRRAVQIVLHSLGQKCLSFSRAETVAQLRSTEPQLSGVGDRQWLYQTCTEFGFYVTCG 415
Query: 370 KDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNII 424
+ Q L + C++ +G+S + +YYGG + ++
Sbjct: 416 NPRCPFSQLP--ALPSQLDLCEKVFGLSALSVAQAVAQTNSYYGGQT------PGANQVL 467
Query: 425 FSNGMRDPF 433
F NG DP+
Sbjct: 468 FVNGDTDPW 476
>gi|322795214|gb|EFZ18036.1| hypothetical protein SINV_11633 [Solenopsis invicta]
Length = 413
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 149/316 (47%), Gaps = 35/316 (11%)
Query: 11 LLLMVIFVSTSFH--ANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPE 68
++++ +FV S H NG+ R R KD+ + Q +DHF++
Sbjct: 1 MIILKVFVLCSIHPLVNGVGFRGFTFRGLEDPEPFTKDAGKDIVEGWITQPLDHFNH--R 58
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
+T+ RY NS F G PIL +G E I D G YE ++ L+ EHR
Sbjct: 59 ENRTWSMRYKENSAFLKNG-GPILIMIGGEWQITDGYLQGGLMYEIGVKYGGLMYYTEHR 117
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGG 188
+YG+S P ++ + +N + Y N+ Q LAD A + K + N ++ VI +G SY G
Sbjct: 118 FYGQSKP--TKDISSENLQ---YLNADQGLADLAYFIETKKKEKNLENSTVIVVGGSYAG 172
Query: 189 ELATWFRLKYPHVV-----------------IGSLASSAPILYYGDITPWTTYHSVVSKD 231
+A W RLKYPH++ G+LASSAP+ D + Y+ V+K
Sbjct: 173 NMAAWARLKYPHLIQVTFALSLYQNWERYLKKGALASSAPVKAKAD---FYEYYEFVAKS 229
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP--LKNTTEL---KDGLDTV 286
S++C + +K +++ ++ + +K G L F C P + ++++L + L V
Sbjct: 230 LGRHSKKCVENVKIAFASVEELLAKQCGAIELKYLFNLCEPPDINSSSDLGYFTNMLSEV 289
Query: 287 YSEAAQYDTPSNIPVK 302
++E QYD N K
Sbjct: 290 FAETVQYDEIVNGETK 305
>gi|56757429|gb|AAW26883.1| SJCHGC06819 protein [Schistosoma japonicum]
Length = 331
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 89/318 (27%), Positives = 152/318 (47%), Gaps = 28/318 (8%)
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF-RDTSEECFQTIKKSWS 248
L+ W R KYP+ + G++ASSAP+ + ++ + V + F + + C + I+ SWS
Sbjct: 2 LSAWIRQKYPNQIAGAIASSAPVWLFPGLSDCNGFSLVATNSFLKYGGDNCVKNIQHSWS 61
Query: 249 EIDNIASKPDGLAILSKKFKTCTPLKNTTE----LKDGLDTVYSEAAQYDTPSNI----- 299
I +I DG +L+ F CTPL + L D L T+ Y P+N
Sbjct: 62 NIVDIGQSFDGKELLTNMFNICTPLTDVQNIIDYLSDYLGTI--SMVNYPYPANFLGTLP 119
Query: 300 --PVKRICNAIENAPNCGDDILCKIAAGVVEADSL-EYDGNNSRCYINEDRTGDESDEGW 356
PVK +C+ + + ++ +I+ +L Y GN + I+ G ++ + W
Sbjct: 120 AWPVKYLCSNL-TVYDPQQPVVTRISLLAKAILALTNYTGNQNCLDISGSLPGIDA-KAW 177
Query: 357 EWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLI 415
E Q+C EM PM ++ P W+L + C++QYG+SPR +W + + I
Sbjct: 178 EIQTCMEMTTPMCASGAVNIMPPVNWDLNSFSAYCQKQYGISPRVNWPKVEFWSKSVDTI 237
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL--GSHCLDLDEAKKSDPDWL 473
+NI+FSNG DP+ + + + F+ +N+ +H LDL +DPD +
Sbjct: 238 -----TNIVFSNGEIDPWFALSITNSSY---VPFATVINIADAAHHLDLRTPNPADPDSV 289
Query: 474 VQQRKTEVKIMQGWITQY 491
V+ R E + + WI ++
Sbjct: 290 VKARTLEKQKIIQWIKEW 307
>gi|414870775|tpg|DAA49332.1| TPA: putative serine peptidase S28 family protein [Zea mays]
Length = 509
Score = 122 bits (305), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 123/467 (26%), Positives = 195/467 (41%), Gaps = 63/467 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + NQT+DHF+ P ++ F QRY ++ N PI ++ E+ N + +
Sbjct: 57 EEHWMNQTLDHFN--PTDHRQFKQRYYEFLDYYRAPNGPIFLYICGESTC--NGIGNNYL 112
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A +F A +V EHRYYGKS PF S + + +S QAL D A + ++
Sbjct: 113 AVVAKKFGAALVSPEHRYYGKSSPFNSL-----TTENLQFLSSKQALFDLAVFRQYYQET 167
Query: 172 YNA------THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
NA ++ G SY G L+ WFRLK+PH+ GSLASS +L + T +
Sbjct: 168 LNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVLAVYNFTDFDKQI 227
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPLKNTTELKDGLD 284
+ EC +++ +D + S + + K+ T L+N + L
Sbjct: 228 GI------SAGPECKAALQEITGLVDGQLQSGRNSV----KELFGATKLENDGDFLYLLA 277
Query: 285 TVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA-------GVVEADSLEYDGN 337
+ A QY P +C+ + A G D++ A+ G A YD
Sbjct: 278 DAAAIAFQYGNPD-----VLCSPLAEAKKNGTDLVETFASYVKDYYIGKFGASVASYD-- 330
Query: 338 NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
+ Y+ + S W +Q CSE+ KN + + ++ C+ +G
Sbjct: 331 --QQYLKNTTPAESSYRLWWYQVCSEVAYFQVAPKNDSVRSPKIDTRYHLDLCRNVFGEG 388
Query: 398 PRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
P +T YYGG I + S I+F+NG +DP+ K+ + N
Sbjct: 389 VYPDVFMTNLYYGGTGI------AGSKIVFANGSQDPWRHASKQKSSDELPSYLIECKNC 442
Query: 456 GSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G HC DL D +K S P+ L + RK V + W+++
Sbjct: 443 G-HCSDLSGCPQAPSNIEGDSSKCSPPEALNKVRKEIVDHIDLWLSE 488
>gi|145525753|ref|XP_001448693.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416248|emb|CAK81296.1| unnamed protein product [Paramecium tetraurelia]
Length = 462
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 118/470 (25%), Positives = 204/470 (43%), Gaps = 50/470 (10%)
Query: 41 ILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAE-- 98
+L Q ETF + Q +DH P + QT+ QRY + S+++ ++ ++ E
Sbjct: 19 VLTQGPFTETKETFQFTQLLDHSD--PANTQTWQQRYHVYSQYFNPTKGGVILYICGEWN 76
Query: 99 -APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
+ DN F+++ A A+++ LEHR+YG+S PFG+ +L+N Y N QA
Sbjct: 77 CQGVSDN----SFSFQLAKDLGAIVIALEHRFYGQSQPFGADSWSLENL---SYLNVHQA 129
Query: 158 LADYASILLHIKD-KYNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILY 214
L D A +L +K K ++ + P AIG SY G L+ WFR KYPH+ +G+LASS I
Sbjct: 130 LDDLAYFILQMKRLKLHSIDSTLPWYAIGGSYPGALSAWFRYKYPHLTVGNLASSGVI-- 187
Query: 215 YGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
+ + + + K + E+C ++ S +D + + ++ C +
Sbjct: 188 -NTVLDFWEFDDQIRKSTSKSGEQCPLYLQLLNSFVDKNLKNFNTKQAFKESYR-CGKMT 245
Query: 275 NTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
+ +DT+ Q + C +E+ + + + DS +
Sbjct: 246 DNEFRWFWVDTIVQMVQQGKR------SKFCQTLESLSSVERMAEYIREIALSQGDSYKQ 299
Query: 335 DGNNSRCYINEDRTGDESDEGWEW--QSCSEMVV----PMGKDKNSMYQPEPWNLTKYIK 388
G Y + T DE+ + +W Q C+E+ P KD Y+ L + +
Sbjct: 300 YG----AYYLRNETIDENSQHRQWYFQCCTEVAYLQTPPQNKDSLRSYEM---TLDWWRE 352
Query: 389 NCKEQYG----VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG--WVKTY 442
C + Y V P Y+GG L+ + ++I +NG DP+ R + +
Sbjct: 353 WCNDAYSQGEVVWPDVRVTEAYFGG------LKLNVDHLIMTNGGEDPWQRASLPFARKD 406
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+ ++ D + SHC+DL +DP L Q R + W Q++
Sbjct: 407 NSKVTTYLIDCDDCSHCVDLKAPTANDPAVLTQTRLDIKNKFKQWHAQFW 456
>gi|224097128|ref|XP_002310843.1| predicted protein [Populus trichocarpa]
gi|222853746|gb|EEE91293.1| predicted protein [Populus trichocarpa]
Length = 487
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 122/449 (27%), Positives = 210/449 (46%), Gaps = 65/449 (14%)
Query: 7 SFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFET--FFYNQTIDHFS 64
+ + + + ++ +ST+ +A+ L LR ++ + E +S + T ++NQT+DHFS
Sbjct: 3 TMKLVFISLLLLSTAPYAS-LALRTQVHSLS-----ESSNSKNYLTTQELWFNQTLDHFS 56
Query: 65 YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
P + FPQRY ++ + PI + E+ N ++ + A +F A +V
Sbjct: 57 --PFDHHKFPQRYYEFLDYFRISDGPIFLEICGESSC--NGIVNDYISVLAKKFGAAVVS 112
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI--LLHIKDKYN------ATH 176
LEHRYYG+S+PF S +N R + +S QAL D A +H+++ N +
Sbjct: 113 LEHRYYGRSLPFKS--TTTENLR---FLSSKQALFDLAVFRHTIHMQESLNLKLNRTSVE 167
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
P G SY G L+ WFRLK+PH+ GSLASSA +L I +T + + +
Sbjct: 168 NPWFVFGGSYAGALSAWFRLKFPHLTCGSLASSAVVL---AIHNFTEFDQQIGE---SAG 221
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA--- 291
EC T++++ ++ L+ + L + EL+ D +Y ++AA
Sbjct: 222 AECKATLQETTQLVEE---------RLASNKQAVKALFDAAELEIDGDFLYFLADAAVIA 272
Query: 292 -QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN---EDR 347
QY P ++C + A G+D L + A V+ D + G + + Y +D
Sbjct: 273 FQYGNPD-----KLCPPLVQAKKDGED-LVEAYAKYVKEDYVGSFGVSVQTYDQRHLKDT 326
Query: 348 TGDES--DEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWV 403
T +E+ D W +Q C+E+ N + + ++ CK+ + G+ P
Sbjct: 327 TLNENSGDRLWWFQVCTEVAYFQVAPANDSIRSSQVDTRYHLDLCKKVFGEGIYPEVDKT 386
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
YYGG ++ + S I+F+NG +DP
Sbjct: 387 NIYYGGTNM------AGSKIVFTNGSQDP 409
>gi|226506854|ref|NP_001149727.1| prolyl carboxypeptidase like protein precursor [Zea mays]
gi|195629836|gb|ACG36559.1| prolyl carboxypeptidase like protein [Zea mays]
Length = 509
Score = 121 bits (304), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 123/467 (26%), Positives = 195/467 (41%), Gaps = 63/467 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + NQT+DHF+ P ++ F QRY ++ N PI ++ E+ N + +
Sbjct: 57 EEHWMNQTLDHFN--PTDHRQFKQRYYEFLDYYRAPNGPIFLYICGESTC--NGIGNNYL 112
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A +F A +V EHRYYGKS PF S + + +S QAL D A + ++
Sbjct: 113 AVVAKKFGAALVSPEHRYYGKSSPFNSL-----TTENLQFLSSKQALFDLAVFRQYYQET 167
Query: 172 YNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
NA + + G SY G L+ WFRLK+PH+ GSLASS +L + T +
Sbjct: 168 LNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVLAVYNFTDFDKQI 227
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPLKNTTELKDGLD 284
+ EC +++ +D + S + + K+ T L+N + L
Sbjct: 228 GI------SAGPECKAALQEITGLVDGQLQSGRNSV----KELFGATKLENDGDFLYLLA 277
Query: 285 TVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA-------GVVEADSLEYDGN 337
+ A QY P +C+ + A G D++ A+ G A YD
Sbjct: 278 DAAAIAFQYGNPD-----VLCSPLAEAKKNGTDLVETFASYVKDYYIGKFGASVASYD-- 330
Query: 338 NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
+ Y+ + S W +Q CSE+ KN + + ++ C+ +G
Sbjct: 331 --QQYLKNTTPAESSYRLWWYQVCSEVAYFQVAPKNDSVRSPKIDTRYHLDLCRNVFGEG 388
Query: 398 PRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
P +T YYGG I + S I+F+NG +DP+ K+ + N
Sbjct: 389 VYPDVFMTNLYYGGTGI------AGSKIVFANGSQDPWRHASKQKSSDELPSYLIECKNC 442
Query: 456 GSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G HC DL D +K S P+ L + RK V + W+++
Sbjct: 443 G-HCSDLSGCPQAPSNIEGDSSKCSPPEALNKVRKEIVDHIDLWLSE 488
>gi|229594586|ref|XP_001032708.3| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|225566764|gb|EAR85045.3| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 475
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 118/449 (26%), Positives = 188/449 (41%), Gaps = 34/449 (7%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP-ILAFLGAEAPIDDNIQLSGFTY 112
++ Q +DH Y P +T+ QRY + ++ P ++ ++ E + S FT
Sbjct: 33 LYFQQKLDH--YAPLDNRTWAQRYFVMDHWFNKTAQPLVILYICGEGECNGVQYNSSFTS 90
Query: 113 ENAHQFKALIVILEHRYYGKSVPFG--SRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+ A +++ LEHR+YGKS PFG + AL N + Y + QAL D A + ++KD
Sbjct: 91 KIAEIHNGIVLSLEHRFYGKSQPFGFGNDSYALPNLK---YLTAQQALNDLAWFIQYVKD 147
Query: 171 K--YNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+ T + P I IG SY G L+ WFR K+PH+ IG+LASSA + Y D +
Sbjct: 148 NQLFGITPNMPWITIGGSYPGALSAWFRYKFPHLTIGALASSAVVNAYADF---YEFDQQ 204
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASK--PDGLAILSKKFKTCTPLKNTTELKDGLDT 285
+S S C Q + + NI K P L F N+T + DG
Sbjct: 205 ISDSLSKNSGNCRQIVHDINVNVTNILKKGTPQQKQQLKAYF-------NSTLITDGDFM 257
Query: 286 VY-SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
Y S+ V + N G +L +A ++ + D + N
Sbjct: 258 FYFSDITVMGVQYGSRVAMCDLLMSNQTFAG--VLQNLATYALQV-GVTPDQYGAYYLRN 314
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY-GVSPRPSWV 403
+ + + W +Q CSE K+ + E ++ + + C Y G P
Sbjct: 315 TTYSHERNARQWYYQVCSEFGWLFTPAKHYPMRSEILTMSYWTEWCNSAYDGAFPNTEVT 374
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
Y+GG DI+ +N+IF+NG DP+ S+ D + +HC+DL
Sbjct: 375 NNYFGGLDIQ------ATNLIFTNGGEDPWQWASKRTPTLPGMQSYIADCDQCAHCVDLR 428
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+D L + R + W ++Y
Sbjct: 429 TPSPNDSPILKEIRNKTLSSFATWKNEFY 457
>gi|428165662|gb|EKX34652.1| hypothetical protein GUITHDRAFT_158798 [Guillardia theta CCMP2712]
Length = 490
Score = 121 bits (303), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 120/439 (27%), Positives = 196/439 (44%), Gaps = 62/439 (14%)
Query: 34 GRIRRSRILEQKDSNH-----GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
R R + Q+D +H G + +++Q +DHFS PE++ QRY +N F+ G+
Sbjct: 3 ARPYRELRVGQRDRHHATSMDGNNSRWFSQRVDHFSDSPETWM---QRYFVNETFFRMGS 59
Query: 89 SPILAFLGAEAP-IDDNIQLSG-----FTYENAHQFKALIVILEHRYYGKSVPFGSRKAA 142
P+ +G E P + + + ++G A ALI+ LEHRYYG+S P + +
Sbjct: 60 GPVFLCVGGEGPPMTEQVVVTGENHCALMVHLARIHGALILALEHRYYGESHP--RKDLS 117
Query: 143 LKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLKYPHV 201
++N R + +S QAL D AS HI+ + ++ I G SY G LA W K+PH+
Sbjct: 118 VENMR---FLSSRQALEDIASFHSHIRSAFAISSKQRWITFGGSYPGMLAAWSHAKFPHL 174
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKKSWSEIDNIASKP 257
+++SSAP+ I Y++VV+ DF D S C TIK +++++
Sbjct: 175 FHAAVSSSAPV---QAILNMKGYNNVVASDFADETLGGSMLCLNTIKGAFAQVGEYLLSY 231
Query: 258 DGLAILSKKFKTC------TPLKNTTELKDGL-DTVYSEAAQYDTPSNI-PVKRICNAIE 309
+G L +F C +KN + L D + ++ S + ++R C +
Sbjct: 232 EGRRYLKTRFSVCGGDDVLEDIKNRALFAETLSDPLIPQSNDPSCTSPLCDIRRQCKFLT 291
Query: 310 NAPNCGD--DILCKIAAGVVEADSLEYD-----GNNSRCYINEDRTGDESDEGWEWQSCS 362
+ + G D L + V + L+ D I+EDRT D W +Q+C+
Sbjct: 292 DQ-SLGKPLDRLVAMMDSVRDGSCLDTDYQMMLAGLQDIKISEDRT----DRTWFYQTCT 346
Query: 363 EMVVPMGKDKNS----MYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTY-----YGGHDIK 413
E D +S + P N+ CK + +S + YGG +++
Sbjct: 347 EFGFYQTCDPDSRCPFVSSPHLNNVYFSTDMCKVVFNMSFEKTAEFVRESNNEYGGLNLQ 406
Query: 414 LILRRSTSNIIFSNGMRDP 432
+ NIIF NG DP
Sbjct: 407 ------SYNIIFVNGGADP 419
>gi|340500169|gb|EGR27066.1| hypothetical protein IMG5_202470 [Ichthyophthirius multifiliis]
Length = 2048
Score = 120 bits (302), Expect = 1e-24, Method: Composition-based stats.
Identities = 117/479 (24%), Positives = 211/479 (44%), Gaps = 44/479 (9%)
Query: 36 IRRSRILEQKDSNHGFETFFY-----NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP 90
+ SR Q + GF F+ +++HF P + QRY I +++ N
Sbjct: 1580 LNNSRENNQSGGSVGFMNFYQGMQEKQNSLNHFD--PLGLIKWKQRYTIYDEYFNPENGT 1637
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFG--SRKAALKNARH 148
+ +G E + G+ + A +F A+++ +EHR+YG S PFG ++ +L+N +
Sbjct: 1638 VFISIGGEGQMAGITNGRGWLIQLAQEFSAIVISVEHRFYGVSQPFGYTNQSYSLENLQ- 1696
Query: 149 RGYFNSAQALADYASILLHIKDKYNATHA-----PVIAIGASYGGELATWFRLKYPHVVI 203
Y Q+LAD A+++ IK K H P I IG SY G ++ WFR KYPH+ +
Sbjct: 1697 --YLTVDQSLADLANLISKIKQK--KLHKISEINPFITIGGSYPGAMSAWFRYKYPHLTV 1752
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAIL 263
G+LASSA + I + Y V + + C I++ I+ + L
Sbjct: 1753 GALASSAVV---NAIEDFQMYDYQVYLSTKKSGNWCPLKIQQFNFYIEKTLNNTLYAIKL 1809
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA 323
+KF T L N +E + +YS QY + +C+ +N + + ++A
Sbjct: 1810 RQKFNA-TMLTN-SEFLNFFADLYSGLVQYGQRT-----FLCSIFQNTTI--EQQINRLA 1860
Query: 324 -AGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN 382
V ++ Y +++ N +++ W +Q+C+ ++ + + N
Sbjct: 1861 DYSAVNQTAINY---STKTLFNTTYDPNQAQRQWTFQTCTYFGFFQTANQINPMRSSKVN 1917
Query: 383 LTKYIKNCKEQYGVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
L + C++ +G + P +T Y GG L +NI+F+NG D + +
Sbjct: 1918 LRFFEDQCRQVFGQNYVPDISITNSYLGG------LNLEATNIVFTNGSEDGWKWASLTQ 1971
Query: 441 TYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
+ +S D + +H ++L K DPD L R+ + + WI Q++ ++ N
Sbjct: 1972 SKGSM-VSLISDCDNCAHGVELGVPKSEDPDNLKNTRRIVKILFKQWIDQHFQKYQKPN 2029
>gi|302790399|ref|XP_002976967.1| hypothetical protein SELMODRAFT_416857 [Selaginella moellendorffii]
gi|300155445|gb|EFJ22077.1| hypothetical protein SELMODRAFT_416857 [Selaginella moellendorffii]
Length = 982
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 132/437 (30%), Positives = 192/437 (43%), Gaps = 75/437 (17%)
Query: 12 LLMVIFV----STSFHANGLKLRPRLGRIRRSRILEQKDSNHGFET--FFYNQTIDHFSY 65
+L+V FV STSF ++ RLG + G ET FF Q +DHF+
Sbjct: 3 VLLVCFVLLATSTSF-SSAFVPASRLGFKPEFLPAGSRSPPRGKETVNFFTRQKLDHFA- 60
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVIL 125
PE + F Q+Y+ F+ N PI + E+ N ++ + A F A IV +
Sbjct: 61 -PEDPRVFSQKYLELLDFFRPRNGPIFLVMCGESTCTGNY-VTTYVGTLAESFGAAIVTV 118
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGAS 185
EHRYYG S PF + L N + Y S Q+L D+A + D Y A S
Sbjct: 119 EHRYYGHSSPF--QHLNLHNLK---YLTSKQSLFDHAVFI----DYYQAR--------GS 161
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WFRLK+PH+V GS ASSA + D + + V +C K+
Sbjct: 162 YAGALSAWFRLKFPHLVAGSWASSAVVEAILDYSAYDKQLGV------SVGPKC----KQ 211
Query: 246 SWSEIDNIASKPDGLAILSKKFKTC---TPLKNTTELKDGLDTVYSEAA---QYDTPSNI 299
+ EI + + GL + + K +P N T+ LD V + AA QY
Sbjct: 212 ALQEITRLTEQ--GLVENATEIKYLFGFSPQDNITD-DTLLDYVANAAAGEIQYGK---- 264
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESD-EGWEW 358
+ +C+ + A ++L K A ++E IN D G+E D E W++
Sbjct: 265 -IDGLCDPLLKAEKSNRNLL-KTYAKILER-------------INNDTNGNERDNESWDF 309
Query: 359 QSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--YYGGHDIKLI 415
Q C+E+ + D+ S + N +I C EQ+G P T YYGG +I
Sbjct: 310 QYCTEVGYFQVASDRKSSVRSSRINTQFFINYCSEQFGNGTFPDVKTTNLYYGGRNI--- 366
Query: 416 LRRSTSNIIFSNGMRDP 432
+ S I+F NG +DP
Sbjct: 367 ---AGSRIMFLNGSQDP 380
>gi|432892297|ref|XP_004075751.1| PREDICTED: thymus-specific serine protease-like [Oryzias latipes]
Length = 460
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 115/447 (25%), Positives = 196/447 (43%), Gaps = 77/447 (17%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ +TF Q++ +N W + P+ F+G E PI L+G E A Q
Sbjct: 58 QPLDHFNR--LKGKTFSQKFFVNEAHWQRPDGPVFLFIGGEGPIFSFDVLAGHHVEMAQQ 115
Query: 118 FKALIVILEHRYYGKSV-PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-AT 175
AL++ +EHR+YG S+ P G + +L + +S QALAD A+ +I +N ++
Sbjct: 116 HGALLLAVEHRFYGDSINPDGLKTESLAD------LSSQQALADLATFHGYICRSFNLSS 169
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
P I+ G SY G L+ WFR K+P +V ++ASSAPI D ++ Y+ VV ++
Sbjct: 170 RNPWISFGGSYSGALSAWFRGKFPSLVYAAVASSAPIKAKLD---FSDYNHVVGLSLKNV 226
Query: 236 ----SEECFQTIKKSWSEIDNIASKPDG-LAILSKKFKTCTPLKN---TTELKDGLDTVY 287
SE+C+ ++++++ ++ A G L+ ++ F C KN EL L ++
Sbjct: 227 AVGGSEKCWAQVQQAFAAVE--AELLTGNLSQVAGDFNCCQIPKNLDDQIELMQNLADLF 284
Query: 288 SEAAQYDTPSN-IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINED 346
QY+ + VK +C+ + N G+D + + +++
Sbjct: 285 MGTVQYNEEGVLLSVKELCDLMTNVS--GEDKDMEAYSRLIKLT---------------- 326
Query: 347 RTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPR--PSWVL 404
Q+C + P NL K C +G+S PS +
Sbjct: 327 ------------QTCEDSTCPFSG---------LINLQAQTKLCTAVFGISQHSLPSNIA 365
Query: 405 ---TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLD 461
+YYGG + T +++ NG DP+ V+ F +D +HC D
Sbjct: 366 FTNSYYGGDE------PHTHRVLYINGGIDPWKELSVVQGGQEVQSVFIEDT---AHCAD 416
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + D L + R+ K + W+
Sbjct: 417 MSSRRVVKRDSLRRARQEIEKQVSDWL 443
>gi|149068959|gb|EDM18511.1| prolylcarboxypeptidase (angiotensinase C) (predicted) [Rattus
norvegicus]
Length = 193
Score = 120 bits (301), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 96/173 (55%), Gaps = 8/173 (4%)
Query: 29 LRPRL---GRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG 85
+RPRL G S + ++ Q +DHF + +TF QRY++ K W
Sbjct: 20 IRPRLKTLGSPHLSTNTPDPAVPSSYSVHYFQQKVDHFGF--SDTRTFKQRYLVADKHWQ 77
Query: 86 GGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKN 145
IL + G E I +GF ++ A + KA++V EHRYYGKS+PFG + + K+
Sbjct: 78 RNGGSILFYTGNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHRYYGKSLPFG--RDSFKD 135
Query: 146 ARHRGYFNSAQALADYASILLHIKDKYNATHA-PVIAIGASYGGELATWFRLK 197
++H + S QALAD+A ++ H+K+ T PVIAIG SYGG LA WFR+K
Sbjct: 136 SQHLNFLTSEQALADFAELIRHLKETIPGTEGQPVIAIGGSYGGMLAAWFRMK 188
>gi|308485479|ref|XP_003104938.1| hypothetical protein CRE_24446 [Caenorhabditis remanei]
gi|308257259|gb|EFP01212.1| hypothetical protein CRE_24446 [Caenorhabditis remanei]
Length = 510
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 137/523 (26%), Positives = 221/523 (42%), Gaps = 67/523 (12%)
Query: 11 LLLMVIFVSTSFHA---------NGLKLRPRLGRIRRSRILEQKDSNHGFETFF---YNQ 58
LLL +S F A N ++ RPR G ++++ + + F+T + Q
Sbjct: 7 LLLAACVLSQVFAAPAEEKRIRRNMIRGRPR-GGMKKTPPMSEVSHKINFDTVVSGTFTQ 65
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-IQLSGFTYEN-AH 116
T+DHF +TF QRY N++++ G P LG E P + G N A
Sbjct: 66 TLDHFD--SSVGKTFKQRYWHNNQWYKDGG-PAFLMLGGEGPESSYWVSYPGLEMTNLAA 122
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NAT 175
+ A + +EHR+YG++ P + ++ N + Y +SAQA+ D A+ + + +Y
Sbjct: 123 KQGAWVFDIEHRFYGETKP--TSDMSVSNLK---YLSSAQAIEDAAAFIKAMTAQYPQLA 177
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
+A + G SY G LA W R K+P +V ++ SS P+ D Y VV
Sbjct: 178 NARWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPVQAEVDFKE---YLEVVQNSISRN 234
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK-NTTELKDGLDTVYS---EAA 291
S +C ++ ++ + ++ DG L F C ++ + LK +TVYS E
Sbjct: 235 STDCATSVTAGFNLVASLLKTTDGRKQLKTAFHLCQDIQLDDKSLKYFWETVYSPYMEVV 294
Query: 292 QY--DTPSNIPVK-RICNAIENAPNCGDDILCKIAAGVVEADSL--EYDGNNSRCY---- 342
QY D + + I +AI N + V + +L EY G N Y
Sbjct: 295 QYSGDAAGSFATQLTISHAICRYHNNKATTTIQKMKQVNDYFNLVNEYFGCNDIDYNGFI 354
Query: 343 --INEDRTGD-ESDEGWEWQSCSEMVVPMGKDKNSMYQPEPW-----NLTK--YIKNCKE 392
+ + G+ +SD W WQ+C+E + PW NL YI C
Sbjct: 355 SFMKDTSFGEAQSDRAWVWQTCTEFGY---YQSTASATAGPWFGGVANLPAQYYIDECTA 411
Query: 393 QYGVSPRPSWVLT-------YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
YG + V T YYGG D + +T+ I+ NG DP+ G + + +
Sbjct: 412 IYGAAYNSQEVQTSVDYTNQYYGGRD-----KLNTARILLPNGDIDPWHALGKLTSANSD 466
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ +N +HC D+ A D +L R+ ++ GW+
Sbjct: 467 IVPVV--INGTAHCADMYGASSLDSVYLTNARQKISDVLDGWL 507
>gi|224108896|ref|XP_002333330.1| predicted protein [Populus trichocarpa]
gi|222836225|gb|EEE74646.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 124/451 (27%), Positives = 199/451 (44%), Gaps = 58/451 (12%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTI 60
M+ + + L + ST+ HA L PR S+ + + + ++NQT+
Sbjct: 1 MEMMKLALMSTFLFFLLFSTASHA---YLAPRTLLYSFSKSSTNSNRSLTTKELWFNQTL 57
Query: 61 DHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKA 120
DH+S P + F QRY ++ + PI + E+ D + + A +F A
Sbjct: 58 DHYS--PFDHHKFQQRYYEFLDYFRVPDGPIFLKICGESSCDGIA--NDYIGVLAKKFGA 113
Query: 121 LIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL------LHIKDKYNA 174
+V LEHRYYGKS PF S +N R Y +S QAL D A L++K
Sbjct: 114 AVVSLEHRYYGKSSPFKS--TTTENLR---YLSSKQALFDLAVFRQYYQESLNLKLNRTG 168
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
P G SY G L+ WFRLK+PH+ GSLASSA +L + +T + + +
Sbjct: 169 VENPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSAVVL---AVYNYTEFDQQIGE---S 222
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------S 288
EC ++++ ++ + LA K KT L + EL+ D +Y +
Sbjct: 223 AGAECKAALQETTQLVE------ERLASNKKAVKT---LFDAAELEIDGDFLYFLADAAA 273
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN---E 345
A QY P ++C + A G+D L + A V+ D + G + + Y +
Sbjct: 274 TAFQYGNPD-----KLCPPLVQAKKDGED-LVEAYAKYVKEDYVGSFGVSVQTYDQRHLK 327
Query: 346 DRTGDES--DEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPS 401
D T +E+ D W +Q C+E+ N + + ++ CK+ + G+ P
Sbjct: 328 DTTLNENSGDRLWWFQVCTEVAYFQVAPANDSIRSSQVDTRYHLDLCKKVFGEGIYPEVD 387
Query: 402 WVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
YYGG ++ + S I+F+NG +DP
Sbjct: 388 KTNIYYGGTNM------AGSKIVFTNGSQDP 412
>gi|194899984|ref|XP_001979537.1| GG23250 [Drosophila erecta]
gi|190651240|gb|EDV48495.1| GG23250 [Drosophila erecta]
Length = 480
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 119/477 (24%), Positives = 208/477 (43%), Gaps = 58/477 (12%)
Query: 31 PRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP 90
P +R ILE+ + Q +D+F + QT+ RY++N +F G SP
Sbjct: 44 PHQVMTKRVNILEK----------WITQKLDNFD--ASNTQTYQMRYLVNDEFQTQG-SP 90
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
I +LG E I+ ++ +G Y+ A + K ++V EHRYYG+SVP S + +
Sbjct: 91 IFIYLGGEWEIEKSMVSAGHWYDMAEEHKGVLVYTEHRYYGQSVPTSSM-----STDNLK 145
Query: 151 YFNSAQALADYASILLHIK-DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
Y N QALAD A+ + K + ++ VI G SY + WF+ YP +++G ASS
Sbjct: 146 YLNVKQALADVANFIETFKAENPQLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASS 205
Query: 210 APILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
AP+L D +T Y VV + F S++C+ IK +E++++ + G A +
Sbjct: 206 APLLAKVD---FTEYKEVVGQAFLQLGSQQCYDRIKNGIAELESMFANKRG-AEAKAMLR 261
Query: 269 TCTPLKNTTELK-----DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA 323
C + +L + ++S AQY + ++I C+ I + I +
Sbjct: 262 LCDSFDDQNDLDLWTLFSSISNIFSGIAQYQSNNDIVYN--CDYIMTFDDDATAIANFVY 319
Query: 324 AGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNL 383
G ++ S Y ++ W +Q+C+E G ++S +P+
Sbjct: 320 WGWGMGRCIDARYQGSVEYYLWGVENFDASRPWYYQTCNE----YGWYQSSGSSNQPFGT 375
Query: 384 ----TKYIKNCKE----QYG---VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
T YI C + +YG ++ + Y+GG + NI ++G DP
Sbjct: 376 KFPATLYINLCGDVFSSRYGNEQINVNAANTNAYFGG------MEPGVENIYMTHGALDP 429
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
++ G H + + N SHC D K +D + + ++ +++ W+
Sbjct: 430 WNPMG-----HGVEQGATIIAN-ASHCADFSSIKPTDSEEMRASKEKLAGLVRQWLA 480
>gi|397626302|gb|EJK68100.1| hypothetical protein THAOC_10754 [Thalassiosira oceanica]
Length = 629
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 115/474 (24%), Positives = 198/474 (41%), Gaps = 66/474 (13%)
Query: 46 DSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNI 105
D + E F+ +Q +DHF S T+ RY +S+++GG PI +G E ++
Sbjct: 142 DHDEDEELFYADQLVDHFD---GSTDTWDNRYYASSRYFGGPGHPIFMVVGGEGSLEK-- 196
Query: 106 QLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
L F E+ A F A +V +EHR+YG P A ++ QA+AD +
Sbjct: 197 MLYPFVNEHLAFHFGAAVVQIEHRFYGPYQPL--PNATVEELTE--LLTPQQAMADMVRL 252
Query: 165 LLHIKDK-----YNAT---HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
H KD+ Y+ T + PV+++G +Y G L+ FRL + V + ASSAP+ Y
Sbjct: 253 TKHFKDELGCGGYDRTSPEYCPVVSVGGAYPGFLSAMFRLAHGDFVDVAYASSAPLKLYD 312
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT 276
P Y+ V++ S C ++ + E + + + C + +
Sbjct: 313 QSAPQEVYYDTVTRAAERLSPGCADAVRSALVEAEEAIMASPSVESAVRAMNMC--VDSI 370
Query: 277 TELKDGLDTVYSEA--------AQYDTPSNIP-----VKRICNAIENAPNCGDDILCKIA 323
E LDT+ + A +D + P + R C ++A + ++A
Sbjct: 371 PEYIHDLDTLRQDVMMAVAFSFADFDMGAYPPSEELGMHRACRVFQDAAGGHYTSVQRVA 430
Query: 324 ---AGVVEADSLEY---------------------DGNNSRCYINE--DRTGDESDEGWE 357
V E + E DG N+R ++ G + WE
Sbjct: 431 DFFTLVGEDEEFEKKYPQFVGEEGTPCFDLSIFLPDGPNARIATSDWSGSGGGNDGKMWE 490
Query: 358 WQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHDIKLI 415
+Q C+ +V P+G + SM+ W + C+ +Y G+ P+P ++ G D +
Sbjct: 491 FQLCTTLVEPIGISEESMFPYRTWTYDDLTEYCQLRYGEGIVPQPLALVRDLGFDD---L 547
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
++ + S IIF+NG +D +S +V+ D + + G++ DL SD
Sbjct: 548 VKSNASRIIFTNGKQDMWSGASYVEDVS--DTILALNFENGAYHSDLSHMGPSD 599
>gi|145498935|ref|XP_001435454.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402586|emb|CAK68057.1| unnamed protein product [Paramecium tetraurelia]
Length = 452
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 120/457 (26%), Positives = 196/457 (42%), Gaps = 53/457 (11%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRY-VINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
+F +Q +DH Y + F QRY V+ F + G I+D I+L F
Sbjct: 27 WFEHQLVDH--YDKLNKNVFHQRYWVVEENFVPETGVVLFQICGEYTCIND-IKLRLFII 83
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI--LLHIKD 170
+ A +F ALI+ILEHRYYGKS+P G K +LK+ R Y ++ QAL D A + +
Sbjct: 84 QLAKEFNALIIILEHRYYGKSMPLG--KESLKDENLR-YLSTRQALDDLAYFQRFMVLNK 140
Query: 171 KYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
K+ + P IAIG SY G LA W+R +YPH+VIG+LASSA + IT + + + +
Sbjct: 141 KHGIKSQNPWIAIGGSYPGALAAWYRYQYPHLVIGALASSAVV---ESITDFKMFDTQIF 197
Query: 230 KDFRDTSEECFQTI----KKSWSEIDNIASKPDGLAILSKKF--KTCTPLKNTTELKDGL 283
+ +C + + K + +I N +K + + F + T L+ D
Sbjct: 198 LSAYKSGPQCAKDVQDMNKYAEQQILNQGTKEE----FKRSFGAEKLTDLEFLFFFADAQ 253
Query: 284 DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
+ + + + K I I+ + + E +EY Y+
Sbjct: 254 LLIIQYGGRSELCKQLKDKSITEQID-----------YFRSVIEEGSYMEY----GSYYL 298
Query: 344 NEDRTGDES---DEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG----- 395
D+ + + W +Q CSE+ N+ + ++ Y C +G
Sbjct: 299 KNDKYDENNLTPSRQWMYQCCSELGWWQTSPLNNSVRSTLIDIQFYKDFCNSIFGGIRKN 358
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
+ P +GG+++ + N+I +NG DP+ V L++ +
Sbjct: 359 IFPDDQLANARFGGNEL------NVDNLIMTNGNEDPWKWSS-VLVNQGSILTYEINCEN 411
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
HC++L K D D L Q RK + + WI +Y
Sbjct: 412 SGHCVELYTPKDEDCDQLKQARKDIISQFRKWIQDHY 448
>gi|149029309|gb|EDL84576.1| protease, serine, 16 (thymus), isoform CRA_a [Rattus norvegicus]
Length = 454
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 98/389 (25%), Positives = 171/389 (43%), Gaps = 50/389 (12%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ F QRY +N + G ++P+ +G E + ++G A + AL++ LEHR+Y
Sbjct: 74 RIFLQRYWVNDQHRTGQDAPVFLHIGGEGSLGPGSVMTGHPAALAPAWGALVISLEHRFY 133
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGE 189
G S+P G AL + Y +S ALAD AS + N ++ +P I G SY G
Sbjct: 134 GLSMPAGGLDVALLH-----YLSSRHALADVASARQALSGLLNVSSSSPWICFGGSYAGS 188
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEECFQTIKK 245
LATW RLK+PH+V ++ASSAP+ + ++ Y+ VV++ S EC
Sbjct: 189 LATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQVVARSLTQVTIGGSLECLAAAST 245
Query: 246 SWSEIDNIASKPDGL---AILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNI 299
+++E++ + GL A+L ++ C L ++ EL L + QYD +
Sbjct: 246 AFAEVERLLRA--GLASQAVLREELGACGSLDLIEDQAELLGALQALVGGTVQYDGQAGS 303
Query: 300 P--VKRICNAI--------ENAPNCG-----DDILCKIAAGVVEADSLEYDGNNSRCYIN 344
P V+++C + + P G +L + + + E ++
Sbjct: 304 PLSVRQLCGLLLRTWGNRNHSTPYLGLRRAVQRVLRSMGQKCLSSSRAE---TVAQLRNT 360
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP---- 400
E + D W +Q+C+E + + + L + C++ +G+SP
Sbjct: 361 EPQVSGVGDRQWLYQTCTEFGFYVTCEGLQCPFSQLPALPFQLDLCEQVFGLSPSSVAQA 420
Query: 401 -SWVLTYYGGHDIKLILRRSTSNIIFSNG 428
+ +YYGG + ++F NG
Sbjct: 421 VAQTNSYYGGQS------PGATQVMFVNG 443
>gi|262176822|gb|ACY27467.1| serine protease Pro1 [Plasmodiophora brassicae]
Length = 467
Score = 119 bits (298), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 120/457 (26%), Positives = 184/457 (40%), Gaps = 51/457 (11%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAE---APIDDNIQLS 108
E F +Q +DH + G + F QRY +FW G + P++ L E A + D Q
Sbjct: 23 EHVFPDQLVDHTAIGGGGAR-FLQRYFRIDQFWSGPDGPVILQLCGEYTCAGVTDGRQ-- 79
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
F A ++ AL+++LEHRY+GKS PF ++ + R+ Y + QAL+D A
Sbjct: 80 -FPSALAERYGALVLVLEHRYFGKSSPF-----SVLSPRNLTYLTTFQALSDIACF---- 129
Query: 169 KDKYNATHA------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
D Y H I IG SY G LA W+RLKYPH+ G+LASSA + P+
Sbjct: 130 TDWYQRVHIGRANANKWITIGGSYPGALAAWYRLKYPHLTAGALASSAV------VAPFA 183
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
+ + C ++ + ++ + LA K +C+ L + L
Sbjct: 184 EFPEFDEQVASSAGPACTHALQDITAMVEGALQEGGRLADEMKALFSCSQLSDADFLYLI 243
Query: 283 LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI--LCKIAAGVVEADSLEYDGN-NS 339
D + +EA QY +C+ I A + D + + G A G+ +S
Sbjct: 244 ADAM-AEAIQYGHR-----LALCDPIVQAESRDDRLAAFVEFVQGTFYASMSNSPGDYDS 297
Query: 340 RCYINEDRTGDESDEGWEWQSCSEM----VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG 395
+ D S W W C+E+ + P S W+ + K +
Sbjct: 298 DTMASHRWVPDSSGRQWWWMKCNEVGWFQIAPGTNSIRSKRVNMEWHRDRCEKLFGDVLA 357
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
P Y G D+ S SN++F+NG+ DP+ G L S L +
Sbjct: 358 FPPPCHRASIEYSGFDM------SVSNVVFTNGVEDPWQWAGASAFSSSAHLRDSSVLLI 411
Query: 456 G----SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
HC+DL D L R T + + W+
Sbjct: 412 NCSQCGHCVDLHTPSPDDAPALTTARSTIIAHIDRWL 448
>gi|195497936|ref|XP_002096312.1| GE25600 [Drosophila yakuba]
gi|194182413|gb|EDW96024.1| GE25600 [Drosophila yakuba]
Length = 480
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 115/481 (23%), Positives = 212/481 (44%), Gaps = 52/481 (10%)
Query: 29 LRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
L L + R L+ + + Q +D+F + QT+P RY++N +F G
Sbjct: 32 LVKTLKNLNRGPPLQVMTKRVNVQEKWITQKLDNFD--ASNSQTYPMRYLVNDEFQTEG- 88
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
SPI +LG E I++++ +G Y+ A + K ++V EHRYYG+SVP + + +
Sbjct: 89 SPIFIYLGGEWEIENSMVSAGHWYDMAEEHKGVLVYTEHRYYGQSVP-----TSTMSTDN 143
Query: 149 RGYFNSAQALADYASILLHIK-DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLA 207
Y + QALAD A + K + +++ VI G SY + WF+ YP +++G A
Sbjct: 144 LKYLDVKQALADVAVFIETFKAENPQLSNSKVILAGGSYSATMVVWFKRLYPELIVGGWA 203
Query: 208 SSAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKK 266
SSAP+L D T Y VV + F ++C+ I+ +E++++ + G A
Sbjct: 204 SSAPLLAKVDFTE---YKEVVGQAFLQLGGQKCYDRIENGIAELESMFANKRG-AEAKAM 259
Query: 267 FKTCTPLKNTTELK-----DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCK 321
+ C + +L + +++ AQY + ++I C+ + + + I
Sbjct: 260 LRLCNSFDDQNDLDLWTLFSSISNIFAGVAQYQSGNDISYN--CDYLLSFNDDATAIANY 317
Query: 322 I--AAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE 379
+ A G+ Y+G+ D G + W +Q+C+E G ++S + +
Sbjct: 318 VYWAWGMGTCIDARYEGSVEYYLWGVDNFG--ASRPWYYQTCNE----YGWYQSSGSRNQ 371
Query: 380 PWNL----TKYIKNCKE----QYG---VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNG 428
P+ T Y C + QYG ++ + Y+GG + N+ ++G
Sbjct: 372 PFGTKFPATLYTNLCGDVFSSQYGNEQININAANTNEYFGG------MEPDVENVYMTHG 425
Query: 429 MRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
DP++ G H + + N SHC D K +D + + ++ +++ W+
Sbjct: 426 ALDPWNPMG-----HGVEQGATLIAN-ASHCADFGSIKSTDSEEMRASKEKLAGLVRQWL 479
Query: 489 T 489
Sbjct: 480 A 480
>gi|222615542|gb|EEE51674.1| hypothetical protein OsJ_33022 [Oryza sativa Japonica Group]
Length = 184
Score = 119 bits (297), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 59/130 (45%), Positives = 84/130 (64%), Gaps = 1/130 (0%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF+ P S TF QRY++N FWGG +P+ + G E + +GF +E A +F+
Sbjct: 41 LDHFNELPASNGTFRQRYLVNGTFWGGAAAPVFVYAGNEGDVALFASNTGFMWEAAPRFR 100
Query: 120 ALIVILEHRYYGKSVPF-GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
A++V +EHRYYG+S+PF G+R AA +A GY +AQALAD+A ++L +K A AP
Sbjct: 101 AMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTTAQALADFAELILSLKSNLTACKAP 160
Query: 179 VIAIGASYGG 188
V+ G SYGG
Sbjct: 161 VVIFGGSYGG 170
>gi|268576170|ref|XP_002643065.1| Hypothetical protein CBG22982 [Caenorhabditis briggsae]
Length = 509
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 140/500 (28%), Positives = 213/500 (42%), Gaps = 58/500 (11%)
Query: 25 NGLKLRPRLGRIRRSRI--LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSK 82
N ++ RPR G + + +E + + + QT+DHF + +TF QRY N++
Sbjct: 30 NMIRGRPRGGMKKTPPMSSVEHLINKDTVVSSTFTQTLDHFD--SSNTKTFQQRYYHNNQ 87
Query: 83 FWGGGNSPILAFLGAEAPIDDN-IQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRK 140
++ G P LG E P + G N A + A + +EHR+YG++ P +
Sbjct: 88 WYKDGG-PAFLMLGGEGPESSYWVSYPGLEITNLAAKQGAWVFDIEHRFYGETKP--TSD 144
Query: 141 AALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYP 199
++ N + Y +SAQA+ D A+ + + KY +A + G SY G LA W R K+P
Sbjct: 145 MSVPNLK---YLSSAQAIEDAATFIKAMTLKYPQLKNAKWVTFGGSYSGALAAWTRAKHP 201
Query: 200 HVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDG 259
+V ++ SS P+ D + Y VV S C ++ ++ + + DG
Sbjct: 202 ELVYAAVGSSGPVQAEVD---FKEYLEVVQNSITRNSTACAASVTAGFNLVAQLLQTTDG 258
Query: 260 LAILSKKFKTCTPLK-NTTELKDGLDTVYS---EAAQY--DTPSNIPVK-RICNAI-ENA 311
L F TC ++ + LK +TVYS E QY D + I NAI +
Sbjct: 259 RKQLKTAFHTCQDIQLDDKNLKYFWETVYSPYMEIVQYSGDAAGAFATQLTISNAICKYH 318
Query: 312 PNCGDDILCKIAAGVVEADSL--EYDGNNSRCY------INEDRTG-DESDEGWEWQSCS 362
N D L K+ V + +L EY G N Y + + G +SD W WQ+C+
Sbjct: 319 LNAKTDTLTKMKQ-VNDYFNLVQEYYGCNDINYQAFIDFMADTSFGYAQSDRAWVWQTCT 377
Query: 363 EMVVPMGKDKNSMYQPEPW-----NLTK--YIKNCKEQYGVSPRPSWVLT-------YYG 408
E S PW NL YI C YG + V T YYG
Sbjct: 378 EFGY---YQSTSSATAGPWFGGNANLPAQYYIDECTAIYGGAYNSQEVQTAVDYTNQYYG 434
Query: 409 GHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKS 468
G D +TS I+ NG DP+ G + T D+ +N +HC D+ A
Sbjct: 435 GRD-----GLTTSRILLPNGDIDPWHALGKL-TSSTADI-VPVVINGTAHCADMYGASSH 487
Query: 469 DPDWLVQQRKTEVKIMQGWI 488
D +L R+ ++ GW+
Sbjct: 488 DSIYLTNARQKISDVLDGWL 507
>gi|145491267|ref|XP_001431633.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398738|emb|CAK64235.1| unnamed protein product [Paramecium tetraurelia]
Length = 464
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 124/476 (26%), Positives = 205/476 (43%), Gaps = 54/476 (11%)
Query: 33 LGRI-RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQR-YVINSKFWGGGNSP 90
LGR+ +R+ + ++ T ++ Q +DH P S + F QR ++ N
Sbjct: 14 LGRLTQRNDLFGKEQIVKAAITEWFTQKLDH--NDPTSQEVFKQRVHIYNEYVKDDQPEA 71
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
++ ++ E D + G T++ A Q A++++LEHRYYG+S PF +
Sbjct: 72 VILYICGEWTCDGIGK--GLTFDAAQQLNAVVLVLEHRYYGQSQPFEDWSTP-----NLK 124
Query: 151 YFNSAQALADYASILLHIKDKYN---ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLA 207
Y N QAL D A + IK N P I +G SY G L+ WFR KYPH+ IG LA
Sbjct: 125 YLNIHQALDDIAYFITSIKANGNYNIKPDTPWIHLGGSYPGALSAWFRYKYPHLTIGGLA 184
Query: 208 SSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI-DNIASKPDGLAILSKK 266
SSA + + + Y V ++S EC I++ +I D +A PD + K
Sbjct: 185 SSAVV---RAVACYHEYDMQVYLSALESSTECADRIQQVNQKIEDELARDPDAI----KA 237
Query: 267 FKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV 326
+ L++ E + +Y+ Q S ++C+ + +
Sbjct: 238 AFGASELQD-IEFLSMIADIYAGMVQGRKRS-----KMCDRLAKGST--------VEEWF 283
Query: 327 VEADSLEYDGNNSRCYINE---DRTGD--ESDEGWEWQSCSEMVVPMGKDKNSMYQPEPW 381
+E + + + Y +E D T D +S W +Q+C E+ + N+
Sbjct: 284 LEVKDMARETVDQESYGSEFLRDITIDFSKSSRQWTYQTCIEVGYFQTANPNAEQSTRSQ 343
Query: 382 NLTK--YIKNCKEQYGVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
L + + C+ Y + P T Y+GG DI + ++IFSNG DP+
Sbjct: 344 ELVLDFFRQLCEYSYDIPIFPDEDRTNAYFGGLDINV------DHLIFSNGSDDPWQHAS 397
Query: 438 WVKTYH--FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
K +D+ + + + SHC+DL + DP L + R+ + Q WI +Y
Sbjct: 398 ITKWKQGKEYDVKYIKCKDC-SHCIDLRASSPEDPPELTKARQEILATFQQWINEY 452
>gi|242039115|ref|XP_002466952.1| hypothetical protein SORBIDRAFT_01g017310 [Sorghum bicolor]
gi|241920806|gb|EER93950.1| hypothetical protein SORBIDRAFT_01g017310 [Sorghum bicolor]
Length = 510
Score = 118 bits (296), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 123/470 (26%), Positives = 194/470 (41%), Gaps = 69/470 (14%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP---IDDNIQLS 108
E + NQT+DHF+ P ++ F QRY ++ N PI ++ E+ I +N
Sbjct: 58 EEHWMNQTLDHFN--PTDHRQFKQRYYEFLDYYRAPNGPIFLYICGESSCSGIGNN---- 111
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
+ A +F A +V EHRYYGKS PF S + + +S QAL D A +
Sbjct: 112 -YLAVMAKKFGAALVSPEHRYYGKSSPFNSL-----TTENLQFLSSKQALFDLAVFRQYY 165
Query: 169 KDKYNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ NA + + G SY G L+ WFRLK+PH+ GSLASS +L + T +
Sbjct: 166 QETLNAKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVLAVYNFTDFD 225
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPLKNTTELKD 281
+ EC +++ +D + S + + K+ L+N +
Sbjct: 226 KQIGI------SAGPECKAALQEITGLVDGQLQSGRNSV----KELFGAPKLENDGDFLY 275
Query: 282 GLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA-------GVVEADSLEY 334
L + A QY P +C+ + A G D++ A+ G A Y
Sbjct: 276 LLADAAAIAFQYGNPD-----VLCSPLAEAKKNGTDLVETFASYVKDYYIGKFGASVASY 330
Query: 335 DGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY 394
D Y+ + S W +Q CSE+ KN + + ++ C+ +
Sbjct: 331 DQE----YLKNTTPAESSYRLWWYQVCSEVAYFQVAPKNDSVRSPKIDTRYHLDLCRNVF 386
Query: 395 GVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
G P +T YYGG I + S I+F+NG +DP+ K+ +
Sbjct: 387 GEGVYPDVFMTNLYYGGTGI------AGSKIVFANGSQDPWRHASKQKSSDELPSYLIEC 440
Query: 453 LNLGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
N G HC DL D +K S P+ L + RK V + W+++
Sbjct: 441 ENCG-HCSDLSGCPQAPSNIEGDSSKCSPPESLNKVRKEIVDHIDLWLSE 489
>gi|294881794|ref|XP_002769498.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
gi|239872962|gb|EER02216.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
Length = 455
Score = 118 bits (296), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 105/396 (26%), Positives = 174/396 (43%), Gaps = 39/396 (9%)
Query: 57 NQTIDHFSYG----PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
Q DHFS+G P +Q QRY F+ G P+ ++G E P++ + +G +
Sbjct: 34 TQDRDHFSFGEGGNPGKFQ---QRYFTFKDFYRPGG-PLFFYVGNEGPVEIYVNHTGLMW 89
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
E A +V EHRYYGK+ + Y QALADY S++++ +
Sbjct: 90 ELGSDLGAFLVFAEHRYYGKTQVYSDGTPDCLR-----YLTIEQALADY-SVMINTYTRI 143
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW---TTYHSVVS 229
++ IA G SYGG LA+ FR KYPH++ G++A+SAPI G +TP T ++ +++
Sbjct: 144 ASSLIATIAFGGSYGGMLASAFRYKYPHIIDGAIAASAPIFAIGGVTPEPSKTAFNEIIT 203
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
RD C Q + S + N + + + + + N + + +
Sbjct: 204 ---RDAGPVCAQRWCDNSSHL-NSSDLANQMVAWATAPWAYLAMGNFPFPSNYITAAMNV 259
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS---RCYINED 346
D P++ PV+ C E + E+ S+ Y+ + + D
Sbjct: 260 GGGADLPAH-PVRVACEPFERLDTMPTKKEGAHIRALAESLSIYYNASGELACNSFAETD 318
Query: 347 RTGDESDEG-----WEWQSCSEMVVPMGKD----KNSMYQPEPWNLTKYIKNCKEQYGVS 397
G EG + +Q+C+EM P G+D ++ + P ++ Y C ++YGV+
Sbjct: 319 GGGAPIPEGSCKGDYGFQTCTEM--PWGQDSGTDRDMFWPPSEFDPDDYKAECLQKYGVT 376
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
+ L + L S SN+ FSNG DP+
Sbjct: 377 TKAWAGLQFLRNMADAL---ASMSNVFFSNGKFDPW 409
>gi|449499944|ref|XP_004160961.1| PREDICTED: probable serine protease EDA2-like isoform 2 [Cucumis
sativus]
Length = 486
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 123/441 (27%), Positives = 186/441 (42%), Gaps = 59/441 (13%)
Query: 10 WLLLMVI-FVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPE 68
WL++M + F S +F + + R L R+ + + ++NQT+DHFS P
Sbjct: 11 WLVMMALAFSSPAFISAHVTPRTVLHRLSSTSSFLNRTE------LWFNQTLDHFS--PY 62
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
++ F QRY ++ + PI + E P N + + A +F A IV LEHR
Sbjct: 63 NHDKFQQRYYEFLDYFRIPDGPIFLKICGEGPC--NGISNDYLGVLAKKFGAAIVSLEHR 120
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT-----HAPVIAIG 183
YYGKS PF S N R Y +S QAL D A + +D N P G
Sbjct: 121 YYGKSSPFKSLTT--NNLR---YLSSKQALFDLAVFRQYYQDSLNLKLNKKGENPWFFFG 175
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI 243
SY G L+ WFRLK+PH+ GSLASSA +L + +T + + + EC +
Sbjct: 176 VSYPGALSAWFRLKFPHLTCGSLASSAVVL---AVYNFTEFDQQIGE---SAGPECKAVL 229
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA----QYDTPS 297
+++ I+ K L EL+ D Y ++AA QY P
Sbjct: 230 QETNRLIEQ---------RFETNKKEVKALFGAGELEIDGDFFYLLADAAVIAFQYGNPD 280
Query: 298 NIPVKRICNAIENAPNCGDDILCKIAAGV----VEADSLEYDGNNSRCYINEDRTGDESD 353
+C+ + A N G+D++ A V + + N + N D +D
Sbjct: 281 T-----LCSPLVQAKNAGNDLVDAYAKYVKDYYIGSFGSSVQTYNQKYLKNTTPGEDSAD 335
Query: 354 EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHD 411
W +Q C+E+ N + + ++ CK + GV P YYGG
Sbjct: 336 RLWWFQVCTEVAYFQVAPANDSMRSSKVDAKYHLDLCKNVFGEGVYPDVDTTNIYYGGTG 395
Query: 412 IKLILRRSTSNIIFSNGMRDP 432
I + S I+F+NG +DP
Sbjct: 396 I------AGSKIVFTNGSQDP 410
>gi|324509651|gb|ADY44052.1| Serine protease, partial [Ascaris suum]
Length = 526
Score = 118 bits (295), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 127/484 (26%), Positives = 193/484 (39%), Gaps = 73/484 (15%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID-- 102
K S E F Q +DHF+ +T+ QR N F+ S + +G E+ I+
Sbjct: 55 KASYPNVEEFTIIQPLDHFN--KSDTRTWEQRVQYNPMFYNN-QSVVFVLIGGESMINQK 111
Query: 103 ----DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
+N+ + + A +F A LEHR++G S PF A Y + QAL
Sbjct: 112 WVGNENVSMMQW----AKEFGAAAFQLEHRFFGYSRPFPLVLTMTTEAL--VYCTTEQAL 165
Query: 159 ADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI 218
AD A + + KY+ + + G SY G L+ WFR KYP + +G++ASSAP+ D
Sbjct: 166 ADLAEFIQQMNAKYSFVNPRWVTFGGSYPGSLSAWFRSKYPQLTVGAVASSAPLNLKLD- 224
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT- 277
+ Y VV R+T EC ++ + + I+ I G L++ F P +
Sbjct: 225 --FYEYSMVVENVLRETDPECHWRVENAIAYIEKIMLTSTGRQQLNQVFNLQPPFDEASV 282
Query: 278 ------ELKDGLDTVYSEAAQYD-------TPSNIPVKRICNAIENAPNCGDDILCKIAA 324
L T++ QY T + V+ +CN + AP D+ L ++ A
Sbjct: 283 TPLTLHNFMSNLYTMFQGIVQYTYDGRNEHTMGGMNVRNLCNTVTKAP--ADEPLQQMRA 340
Query: 325 GVVEADSL-----EYDGNNSRCYINE----------DRTGDESDE-----GWEWQSCSEM 364
+ +S GN S C + T DES + GW W C+E+
Sbjct: 341 VMDFVNSFYPQTGNCVGNVSACTFANSYEDIISLYGNVTYDESTDNAAYRGWMWLCCNEI 400
Query: 365 VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL-------TYYGGHDIKLILR 417
D+ E L YI C + +G S + YYG +
Sbjct: 401 GFLQTTDQGKNIFGEMLPLNFYIDMCTDLFGPSVNIETIAKGNAAAQKYYGRAE-----H 455
Query: 418 RSTSNIIFSNGMRDPFSRGGWV---KTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLV 474
+N+I NG DP+ G KT H L +N +HC D+ A +P L
Sbjct: 456 YKATNVILPNGSLDPWHALGTYTEDKTTHQIPLL----INGTAHCADMYPAYPDEPASLP 511
Query: 475 QQRK 478
R+
Sbjct: 512 AARE 515
>gi|449457546|ref|XP_004146509.1| PREDICTED: probable serine protease EDA2-like [Cucumis sativus]
gi|449499940|ref|XP_004160960.1| PREDICTED: probable serine protease EDA2-like isoform 1 [Cucumis
sativus]
Length = 489
Score = 118 bits (295), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 123/441 (27%), Positives = 186/441 (42%), Gaps = 59/441 (13%)
Query: 10 WLLLMVI-FVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPE 68
WL++M + F S +F + + R L R+ + + ++NQT+DHFS P
Sbjct: 11 WLVMMALAFSSPAFISAHVTPRTVLHRLSSTSSFLNRTE------LWFNQTLDHFS--PY 62
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
++ F QRY ++ + PI + E P N + + A +F A IV LEHR
Sbjct: 63 NHDKFQQRYYEFLDYFRIPDGPIFLKICGEGPC--NGISNDYLGVLAKKFGAAIVSLEHR 120
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT-----HAPVIAIG 183
YYGKS PF S N R Y +S QAL D A + +D N P G
Sbjct: 121 YYGKSSPFKSLTT--NNLR---YLSSKQALFDLAVFRQYYQDSLNLKLNKKGENPWFFFG 175
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI 243
SY G L+ WFRLK+PH+ GSLASSA +L + +T + + + EC +
Sbjct: 176 VSYPGALSAWFRLKFPHLTCGSLASSAVVL---AVYNFTEFDQQIGE---SAGPECKAVL 229
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA----QYDTPS 297
+++ I+ K L EL+ D Y ++AA QY P
Sbjct: 230 QETNRLIEQ---------RFETNKKEVKALFGAGELEIDGDFFYLLADAAVIAFQYGNPD 280
Query: 298 NIPVKRICNAIENAPNCGDDILCKIAAGV----VEADSLEYDGNNSRCYINEDRTGDESD 353
+C+ + A N G+D++ A V + + N + N D +D
Sbjct: 281 T-----LCSPLVQAKNAGNDLVDAYAKYVKDYYIGSFGSSVQTYNQKYLKNTTPGEDSAD 335
Query: 354 EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHD 411
W +Q C+E+ N + + ++ CK + GV P YYGG
Sbjct: 336 RLWWFQVCTEVAYFQVAPANDSMRSSKVDAKYHLDLCKNVFGEGVYPDVDTTNIYYGGTG 395
Query: 412 IKLILRRSTSNIIFSNGMRDP 432
I + S I+F+NG +DP
Sbjct: 396 I------AGSKIVFTNGSQDP 410
>gi|51968542|dbj|BAD42963.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 462
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 122/424 (28%), Positives = 178/424 (41%), Gaps = 60/424 (14%)
Query: 29 LRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
L+PR R S L + + ++NQT+DH+S P ++ F QRY +
Sbjct: 1 LQPR----RISHGLTESSKYLTRDELWFNQTLDHYS--PSDHREFKQRYYEYLDHLRVPD 54
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
PI + E P N + + A +F A IV LEHRYYGKS PF S A +N +
Sbjct: 55 GPIFMMICGEGPC--NGIPNDYITVLAKKFDAGIVSLEHRYYGKSSPFKS--LATENLK- 109
Query: 149 RGYFNSAQALADYASILLHIKDKYNA-------THAPVIAIGASYGGELATWFRLKYPHV 201
Y +S QAL D A+ + +D N P GASY G L+ WFRLK+PH+
Sbjct: 110 --YLSSKQALFDLAAFRQYYQDSLNVKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHL 167
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLA 261
GSLASSA + + + + E K + E + + GL
Sbjct: 168 TCGSLASSAVVRAVYEFPEFD----------QQIGESAGPECKAALQETNKLLEL--GLK 215
Query: 262 ILSKKFKTCTPLKNTTELKDGLDTVY------SEAAQYDTPSNIPVKRICNAIENAPNCG 315
+ ++ K L N TEL D +Y A QY P ++C + A
Sbjct: 216 VNNRAVKA---LFNATELDVDADFLYLIADAEVMAIQYGNPD-----KLCVPLVEAQKNR 267
Query: 316 DDILCKIAAGVVEADSLEYDGNNSRCY-----INEDRTGDESDEGWEWQSCSEMVVPMGK 370
DD L + A V + G +S+ Y ++ T + +D W +Q C+E+
Sbjct: 268 DD-LVEAYAKYVREFCVGVFGLSSKTYSRKHLLDTAVTPESADRLWWFQVCTEVAYFQVA 326
Query: 371 DKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNG 428
N + N ++ CK + GV P YYG I + + IIF+NG
Sbjct: 327 PANDSIRSHQINTEYHLDLCKSLFGKGVYPEVDATNLYYGSDRI------AATKIIFTNG 380
Query: 429 MRDP 432
+DP
Sbjct: 381 SQDP 384
>gi|119608755|gb|EAW88349.1| dipeptidyl-peptidase 7 [Homo sapiens]
Length = 327
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 90/317 (28%), Positives = 149/317 (47%), Gaps = 28/317 (8%)
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
L+ + R+KYPH+V G+LA+SAP+L + + V+ DF S +C Q +++++ +
Sbjct: 2 LSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVREAFRQ 61
Query: 250 IDNIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICN 306
I ++ + + +F TC PL K+ T+L ++ A D P P +
Sbjct: 62 IKDLFLQ-GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYP--YPTDFLGP 118
Query: 307 AIENAPNCGDDILCKIA---AGVVEADSLEYDGNNSR-CY-------INEDRTG---DES 352
N G D L A G+ L Y+ + S CY D TG
Sbjct: 119 LPANPVKVGCDRLLSEAQRITGLRALAGLVYNASGSEHCYDIYRLYHSCADPTGCGTGPD 178
Query: 353 DEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHD 411
W++Q+C+E+ + + M+ P+ + C + +GV PRP W+LT + G D
Sbjct: 179 ARAWDYQACTEINLTFASNNVTDMFPDLPFTDELRQRYCLDTWGVWPRPDWLLTSFWGGD 238
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPD 471
+ R+ SNIIFSNG DP++ GG + ++ + + G+H LDL + DP
Sbjct: 239 L-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAVT--IQGGAHHLDLRASHPEDPA 291
Query: 472 WLVQQRKTEVKIMQGWI 488
+V+ RK E I+ W+
Sbjct: 292 SVVEARKLEATIIGEWV 308
>gi|341893031|gb|EGT48966.1| hypothetical protein CAEBREN_19097 [Caenorhabditis brenneri]
Length = 511
Score = 117 bits (294), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 129/502 (25%), Positives = 214/502 (42%), Gaps = 62/502 (12%)
Query: 25 NGLKLRPRLGRIRRSRILEQKDSNHGFETFF---YNQTIDHFSYGPESYQTFPQRYVINS 81
N ++ RPR G ++++ + F++ + QT+DHF + +TF QRY N+
Sbjct: 32 NMIRGRPR-GGMKKTPPMSTVSHLINFDSVVSSTFTQTLDHFD--SSNGKTFQQRYYHNN 88
Query: 82 KFWGGGNSPILAFLGAEAPIDDN-IQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSR 139
+++ G P LG E P + G N A + A + +EHR+YG++ P +
Sbjct: 89 QWYKDG-GPAFLMLGGEGPESSYWVSYPGLEITNLAAKQNAWVFDIEHRFYGETKP--TS 145
Query: 140 KAALKNARHRGYFNSAQALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKY 198
++ N + Y +SAQA+ D A+ + +K KY +A + G SY G LA W R K+
Sbjct: 146 DMSVSNLK---YLSSAQAIEDAAAFITAMKIKYPMLANAKWVTFGGSYSGALAAWTRAKH 202
Query: 199 PHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD 258
P +V ++ SS P+ D + Y VV S C ++ + ++ + ++ D
Sbjct: 203 PELVYAAVGSSGPVQAEVD---FKEYLEVVQNSITRNSTACADSVTQGFNLVASLLKTAD 259
Query: 259 GLAILSKKFKTCTPLK-NTTELKDGLDTVYS---EAAQY--DTPSNIPVK-RICNAI-EN 310
G L F C +K + LK +TVYS E QY D + I +AI
Sbjct: 260 GRKQLKSAFHLCQDIKLDDKSLKYFWETVYSPYMEVVQYSGDAAGAFATQLTISHAICRY 319
Query: 311 APNCGDDILCK---------IAAGVVEADSLEYDGNNSRCYINEDRTGD-ESDEGWEWQS 360
N G + K + + ++Y+G + ++ + G+ +SD W WQ+
Sbjct: 320 HLNSGSTTIQKMKQVNDYFNLVNDYFGCNDIDYNGFIN--FMKDTSFGEAQSDRAWVWQT 377
Query: 361 CSEMVVPMGKDKNSMYQPEPW-----NLTK--YIKNCKEQYG-------VSPRPSWVLTY 406
C+E + PW NL YI C YG V + Y
Sbjct: 378 CTEFGY---YQSTASATAGPWFGGVANLPAQYYIDECTAIYGAAYNTQEVQNAVDYTNQY 434
Query: 407 YGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAK 466
YGG D +T I+ NG DP+ G + + + + +N +HC D+ A
Sbjct: 435 YGGRD-----NLNTDRILLPNGDIDPWHALGKLNSNNTNIVPVV--INGTAHCADMYGAS 487
Query: 467 KSDPDWLVQQRKTEVKIMQGWI 488
D +L R+ ++ GW+
Sbjct: 488 SLDSIYLTNARQKIANVLAGWL 509
>gi|344292448|ref|XP_003417939.1| PREDICTED: thymus-specific serine protease-like [Loxodonta
africana]
Length = 574
Score = 117 bits (293), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/247 (30%), Positives = 121/247 (48%), Gaps = 10/247 (4%)
Query: 37 RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG 96
RR+R + H + + Q +DHFS S + +PQRY N F+ G L G
Sbjct: 23 RRTRGAYKSKGMHPSDDGSFQQKLDHFS--KNSSELWPQRYFFNDVFYKPGGPVFLLIGG 80
Query: 97 AEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQ 156
++ + I + A + AL+++LEHR+YG S P G+ A + H Y +S Q
Sbjct: 81 SDTICESWISTNNTWVSYAERLGALLILLEHRFYGHSQPTGNVSTA---SLH--YLSSRQ 135
Query: 157 ALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
ALAD + + I +K T +A G SYGG LA W R+K+P + ++ SSAP+
Sbjct: 136 ALADIVNFRIKIAEKVGLTKNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSSAPMQAKA 195
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT 276
+ + Y VV + + +CFQ +K+++ ++ + L K F C PLK+
Sbjct: 196 N---FYEYLEVVQRSLITHNRQCFQAVKEAFGQVMKMLRLRKYHRQLRKDFTLCKPLKHY 252
Query: 277 TELKDGL 283
+E+ +
Sbjct: 253 SEMDKAV 259
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/137 (32%), Positives = 68/137 (49%), Gaps = 7/137 (5%)
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y +S QALAD + + I +K T +A G SYGG LA W R+K+P + ++ SSA
Sbjct: 388 YLSSRQALADIVNFRIKIAEKMGLTKNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSSA 447
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
PI + + Y VV + + +CFQ +K+++ EI + + L F+ C
Sbjct: 448 PIQAKAN---FYEYLEVVQRSLTTHNNKCFQAVKEAFDEITKMLRLRKYYSKLEYDFRLC 504
Query: 271 TPLKNTTELKDGLDTVY 287
PLK L +D Y
Sbjct: 505 KPLK----LYSAMDKAY 517
>gi|326532846|dbj|BAJ89268.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 522
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 133/479 (27%), Positives = 197/479 (41%), Gaps = 81/479 (16%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP---IDDNIQLS 108
E + +QT+DHFS P ++ F QRY + N P+ + EA I +N
Sbjct: 56 EEHWMSQTLDHFS--PTDHRQFKQRYYEFLDYHRAPNGPVFLNICGEASCSGISNN---- 109
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
+ A +F A +V EHRYYGKS PF +N R + +S QAL+D A +
Sbjct: 110 -YLAVMAKKFGAALVSPEHRYYGKSSPF--EDLTTENLR---FLSSKQALSDLAVFRQYY 163
Query: 169 KDKYNATHAPVIA------IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ NA + A G SY G L+ WFRLK+PH+ GSLASS +L + T
Sbjct: 164 QETLNAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSLASSGVVLAVYNFT--- 220
Query: 223 TYHSVVSKDF-RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
DF R E K + EI + DG L + L L++
Sbjct: 221 --------DFDRQIGESAGPECKAALQEITRLV---DGQ--LQSGNNSVKELFGAKMLEN 267
Query: 282 GLDTVY--SEAA----QYDTPSNIPVKRICNAIENAPNCGDDILCKIA-------AGVVE 328
D +Y ++AA QY P +C+ + A G D++ A G +
Sbjct: 268 DGDFLYLLADAAAIAFQYGNP-----DVLCSPLVEAKKNGTDLVEAFAHYVNSYYVGRFK 322
Query: 329 ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
A YD Y+ + S W +Q CSE+ KN + + ++
Sbjct: 323 ASVASYDQK----YLKNTTPAESSYRLWWYQVCSEVSYFQVAPKNDSVRSTKIDTRYHLD 378
Query: 389 NCKEQY--GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
CK + GV P S YYGG R + S I+F+NG +DP+ K+
Sbjct: 379 LCKNVFGEGVYPDVSMTNLYYGG------TRIAGSKIVFANGSQDPWRHASKQKSSEELP 432
Query: 447 LSFSQDLNLGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
+ N G HC D+ D +K S P+ + + RK V + W+++ D
Sbjct: 433 SYLIECSNCG-HCTDISGCPQAPSNIEGDSSKCSSPEAVNKVRKQIVDHIDLWLSECQD 490
>gi|344299008|ref|XP_003421180.1| PREDICTED: thymus-specific serine protease, partial [Loxodonta
africana]
Length = 471
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 103/407 (25%), Positives = 180/407 (44%), Gaps = 49/407 (12%)
Query: 122 IVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVI 180
+V LEHR+YG S+P L A+ R + +S ALAD AS L + +N ++ +P I
Sbjct: 12 VVGLEHRFYGLSIPV----RGLDMAQLR-FLSSRHALADVASAHLALSRLFNVSSSSPWI 66
Query: 181 AIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----S 236
G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y++VVS+ +T S
Sbjct: 67 CFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNNVVSRSLMNTAIGGS 123
Query: 237 EECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQ 292
EC+ +++E + + + + A L + C L ++ EL + L + Q
Sbjct: 124 PECWSAASAAFAETERRLRAGGEAQAALRAELGACGRLSRAEDQAELLEALQALVGGTVQ 183
Query: 293 YDTPSNIP--VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI------- 343
Y+ + P V+++C + + G + G+ A + RC
Sbjct: 184 YNGQAGAPLSVRQLCGLLVGGADRGRTAPYR---GLRRAAQIVMHSLGQRCLSTSRAETV 240
Query: 344 -----NEDRTGDESDEGWEWQSCSEMVVPMG-KDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
E + D W +Q+C+E + +D + P L + C++ +G+S
Sbjct: 241 AQLKDTEPQGSGVGDRQWLYQTCTEFGFYITCEDPRCPFSQLP-ALPSQLGLCEQVFGLS 299
Query: 398 PRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
+ +YYGG + + ++F NG DP+ ++ F+ +
Sbjct: 300 ASSVAQAIAQTNSYYGGQTPR------ATQVLFVNGDADPWHVLSVTQSLGPFESAVL-- 351
Query: 453 LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
+ SHCLD+ + SD L R+ + +Q W+T + K +
Sbjct: 352 IPNASHCLDMATERPSDSPSLRLARQKIFQQLQTWLTLTKESQKHVG 398
>gi|326520219|dbj|BAK04034.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 117 bits (292), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 133/479 (27%), Positives = 198/479 (41%), Gaps = 81/479 (16%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP---IDDNIQLS 108
E + +QT+DHFS P ++ F QRY + N P+ + EA I +N
Sbjct: 56 EEHWMSQTLDHFS--PTDHRQFKQRYYEFLDYHRAPNGPVFLNICGEASCSGISNN---- 109
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
+ A +F A +V EHRYYGKS PF +N R + +S QAL+D A +
Sbjct: 110 -YLAVMAKKFGAALVSPEHRYYGKSSPF--EDLTTENLR---FLSSKQALSDLAVFRQYY 163
Query: 169 KDKYNATHAPVIA------IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
++ NA + A G SY G L+ WFRLK+PH+ GSLASS +L + T
Sbjct: 164 QETLNAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSLASSGVVLAVYNFT--- 220
Query: 223 TYHSVVSKDF-RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
DF R E K + EI + DG L + L L++
Sbjct: 221 --------DFDRQIGESAGPECKAALQEITRLV---DGQ--LQSGNNSVKELFGAKMLEN 267
Query: 282 GLDTVY--SEAA----QYDTPSNIPVKRICNAIENAPNCGDDILCKIA-------AGVVE 328
D +Y ++AA QY P +C+ + A G D++ A G +
Sbjct: 268 DGDFLYLLADAAAIAFQYGNP-----DVLCSPLVEAKKNGTDLVEAFAHYVNSYYVGRFK 322
Query: 329 ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
A YD + Y+ + S W +Q CSE+ KN + + ++
Sbjct: 323 ASVASYD----QKYLKNTTPAESSYRLWWYQVCSEVSYFQVAPKNDSVRSTKIDTRYHLD 378
Query: 389 NCKEQY--GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
CK + GV P S YYGG R + S I+F+NG +DP+ K+
Sbjct: 379 LCKNVFGEGVYPDVSMTNLYYGG------TRIAGSKIVFANGSQDPWRHASKQKSSEELP 432
Query: 447 LSFSQDLNLGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
+ N G HC D+ D +K S P+ + + RK V + W+++ D
Sbjct: 433 SYLIECSNCG-HCTDISGCPQAPSNIEGDSSKCSSPEAVNKVRKQIVDHIDLWLSECQD 490
>gi|357483671|ref|XP_003612122.1| Thymus-specific serine protease [Medicago truncatula]
gi|355513457|gb|AES95080.1| Thymus-specific serine protease [Medicago truncatula]
Length = 478
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 130/447 (29%), Positives = 198/447 (44%), Gaps = 69/447 (15%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
LL+ + F+ST A LR RL +R L + E ++ QT+DH+S P +
Sbjct: 6 LLVFLFFIST-VSATPHLLRRRLSE--SARYLTK-------EELWFPQTLDHYS--PYDH 53
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ F QRY + + P+ + E D I+ + + A +F A +V LEHRYY
Sbjct: 54 RKFQQRYYEFLDHFRIPDGPVFLVICGEYSCD-GIR-NDYIGVLAKKFGAAVVSLEHRYY 111
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA----THA--PVIAIGA 184
GKS PF S A KN R Y +S QAL D A + +D NA T+A P G
Sbjct: 112 GKSSPFKS--LATKNLR---YLSSKQALFDLAVFRQNYQDSLNAKLNRTNADNPWFVFGV 166
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIK 244
SY G L+ WFRLK+PH+ GSLASSA +L + +T + + + EC ++
Sbjct: 167 SYPGALSAWFRLKFPHLTCGSLASSAVVL---AVYNFTEFDQQIGE---SAGVECKAALQ 220
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA----QYDTPSN 298
++ I+ L K K N +L+ D +Y ++AA QY P
Sbjct: 221 ETTRLIER------KLVTNGKALKASF---NAADLEIDGDFLYFLADAAVTAFQYGNPD- 270
Query: 299 IPVKRICNAIENAPNCGDDILCKIAAGVVE-------ADSLEYDGNNSRCYINEDRTGDE 351
+C + A G+D++ A + E + +Y+ NN + N T +
Sbjct: 271 ----ILCKPLVKAKKDGEDLVDAYAKFIKEFYLGTEGESTQDYNQNNLK---NAAITENS 323
Query: 352 SDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWVLTYYGG 409
S W +Q C+E+ N + + ++ CK + G+ P YYGG
Sbjct: 324 SGRLWWFQVCTEVAYFQVAPSNDSIRSSKVDTRYHLDLCKNVFGEGIFPDVDATNIYYGG 383
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRG 436
I + S I+F+NG +DP+ R
Sbjct: 384 TKI------AGSKIVFTNGSQDPWRRA 404
>gi|256574605|dbj|BAH98108.1| serine protease [Entamoeba invadens]
Length = 465
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 123/475 (25%), Positives = 207/475 (43%), Gaps = 59/475 (12%)
Query: 28 KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG 87
K + RL ++ RS +E S ET Y+ +DHF+ + + F +Y ++ K+ G
Sbjct: 17 KHQHRLNKLARS--VETNTSE--IETHTYSVPMDHFNANND--EEFEVKYFVSEKYLDGT 70
Query: 88 N--SPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALK 144
+ SP+ LG E P + + + A + L++ +EHR+YG S P +LK
Sbjct: 71 DLHSPLFVMLGGEGPESSKTLDNHYIIDTLAARTNGLMLAIEHRFYGDSTP------SLK 124
Query: 145 NARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIG 204
+ Y + QA+ DY I+ +I++ N PVI IG SY G LA W R KYP+VV G
Sbjct: 125 MDK-LIYCTAEQAMMDYIEIITYIQETRNFIDHPVIVIGGSYSGNLAAWMRQKYPNVVDG 183
Query: 205 SLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILS 264
+ ASSAP+ D + Y VV + + + W D + G L
Sbjct: 184 AWASSAPVEAQVD---FYQYLEVVQAGLPANTADLLSIAFEKW---DQMTVTESGRKELK 237
Query: 265 KKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIEN-----------APN 313
K F TCT +++ +T+ + A D+ N ++ + + N
Sbjct: 238 KVFNTCTDF-GEDDIQTFAETIGTALAGTDSICNEIIEDVNSKYPNFVINRMDPEWAGST 296
Query: 314 CGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSC-SEMVVPMGKDK 372
C L + G+++ +L DGN+ + + W +Q+C + + +K
Sbjct: 297 CTPSSLDESYKGLMDT-TLYKDGND-----------EAAGRSWVFQTCIAYGYYQVVSEK 344
Query: 373 NSMYQPEPWNLTKYIKNCKEQYGVSPRPSW-----VLTYYGGHDIKLILRRSTSNIIFSN 427
+S+ + L IK C + Y + + + + YGG + K+ +N+ F+N
Sbjct: 345 SSVKFGKLNKLDGSIKMCHDIYNIDNQTLYNAVDHINVRYGGKNPKV------TNVAFTN 398
Query: 428 GMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVK 482
G DP+ G V D + ++ SHC DL K++D L R E++
Sbjct: 399 GGTDPWHALG-VTQQEGQDGNLVNLIDRTSHCSDLYIEKETDVPALKLARHKELR 452
>gi|328872043|gb|EGG20413.1| peptidase S28 family protein [Dictyostelium fasciculatum]
Length = 466
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 126/464 (27%), Positives = 210/464 (45%), Gaps = 53/464 (11%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG---GGNSPILAFLGAEAPI 101
KDS+ F ++YNQT++HF + +TF QRY N ++ GG PI+ ++ E P+
Sbjct: 38 KDSS-SFPIYWYNQTLNHFD--AQDSRTFMQRYYTNDAYYDYSKGG--PIILYINGEGPV 92
Query: 102 DDN-IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD 160
Q A A IV LEHRYYG S PF + +N + + +S QAL D
Sbjct: 93 SSPPYQQGDGVVVYAQALGAYIVTLEHRYYGDSSPF--EDLSTENLK---FLSSRQALND 147
Query: 161 YASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA---PILYYGD 217
A + + + + V+ IG SY G L+ WFR+KYPH+ +GS++SS IL +
Sbjct: 148 LAVFISDFRKNLSLS-TEVVTIGGSYSGALSAWFRVKYPHITVGSVSSSGVVNAILDFTA 206
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
W Y + +EC +++ E++ + G A ++ L
Sbjct: 207 FDEWVAYAA---------GDECATAMRQVTQEVEQ--AYFGGQADEIRQIFNAETLVEDG 255
Query: 278 ELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA--AGVVEADSLEYD 335
+ L +E QY S ++C+ + +A N G ++ A V SL
Sbjct: 256 DFFFWLADSNAEGIQYGYHS-----QLCDPLVDAMNNGTSLIKTYALYTANVWTGSLGTP 310
Query: 336 GNNSRCYINEDRTGD---ESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKE 392
+ + ++ T D +D W +Q+C+E + + N+T + +C++
Sbjct: 311 AEYATAW-QQNTTHDINNSADRLWLYQTCTEFGYWQNAPAENSIRSSIVNMTYWRDHCEQ 369
Query: 393 QYGVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWV-KTYHFFDLSF 449
+G++ P T YYGG+ + + +NIIF N +DP+SR + + Y ++
Sbjct: 370 VFGIALWPDVEATNEYYGGN------QTAGTNIIFVNSSQDPWSRASIITQQYPSEPVAM 423
Query: 450 SQDLNLGSHCLDL--DEAKKSD-PDWLVQQRKTEVKIMQGWITQ 490
N G HC D+ D D P+ L Q R+ ++ +Q W+ Q
Sbjct: 424 VTCGNCG-HCSDIRVDCPGGCDTPNNLAQVRQVTLQAIQSWLAQ 466
>gi|170068625|ref|XP_001868940.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167864603|gb|EDS27986.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 366
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 79/261 (30%), Positives = 128/261 (49%), Gaps = 19/261 (7%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGF 110
+T +++Q +DH P + T+ QRY +N ++ S P+ +G E G
Sbjct: 49 KTLWFDQLLDHND--PTNAATWKQRYYVNDAYFDDRTSGPVFLMIGGEGEATARWMNEGA 106
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A + AL LEHR+YGKS P G A + GY S QALAD A + + +
Sbjct: 107 WIRYAKEHGALCFQLEHRFYGKSHPTGDLSTA-----NLGYLTSEQALADLAYFVEAMNE 161
Query: 171 KYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
KY T IA G SY G LA W R KYP++V GS++SS P+L D + Y VV
Sbjct: 162 KYQLTAQNRWIAFGGSYPGSLAAWLREKYPYLVHGSVSSSGPLLAKID---FKEYFQVVV 218
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK----NTTELKDGLDT 285
+ S +C + ++ + +++ + G +++KFK C P++ N+ ++ +
Sbjct: 219 NSLQSYSPQCVEDVRNAMGQVETLLKHMIGQRSINEKFKLCDPVEKSISNSLDIASLFEA 278
Query: 286 V---YSEAAQYDTPSNIPVKR 303
V ++ QY+ + +P +R
Sbjct: 279 VAGNFAGVVQYNKDNILPHRR 299
>gi|341898957|gb|EGT54892.1| hypothetical protein CAEBREN_15465 [Caenorhabditis brenneri]
Length = 825
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 117/469 (24%), Positives = 201/469 (42%), Gaps = 66/469 (14%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ + + Q +DHF P + +T+ Q+Y N + NS I +G E P +N + + +
Sbjct: 54 LQVYNFTQKLDHFD--PYNTKTWNQKYFYNP-IYSRNNSIIFLMIGGEGP--ENGKWAAY 108
Query: 111 T----YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILL 166
+ A +F A + LEHR++G S P + + Y + QALAD A +
Sbjct: 109 PQVQYLQWAKEFGADVFDLEHRFFGDSWPIPDMETS-----SLRYLTTQQALADLAFFIE 163
Query: 167 HIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
+ +Y + + G SY G L+ WFR KYP + +GS+ASSAP+ D + Y
Sbjct: 164 SMNQQYGFKNPRWVTFGGSYPGSLSAWFRQKYPQLTVGSVASSAPVNLKLD---FYEYAM 220
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-KNTTELK----- 280
VV D + T C ++ ++++I ++ DG L+ F P NTT+L
Sbjct: 221 VVQDDLQLTDANCAPAVRDAFTQIQQLSLTVDGRNKLNNYFNLQPPFDANTTKLDINNFF 280
Query: 281 DGLDTVYSEAAQY-------DTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE 333
L + QY T ++ VK++C + N D + ++ + + +E
Sbjct: 281 GNLFNTFQGMTQYTYDGQSNSTHTDKTVKKMCEIMTNTTE--PDKVKRVENLFLWFNQME 338
Query: 334 YDG-------NNSRCYINEDRTGD--------ESDEGWEWQSCSEM--VVPMGKDKNSMY 376
G N+ I + +GD + GW W C+E+ + + N
Sbjct: 339 PAGPDLSVMPNSYWDVIKQVGSGDLKVLGEDGAAARGWMWLCCNEIGFLQTTNQGNNVFG 398
Query: 377 QPEPWNLTKYIKNCKEQYGVSPRPSWVLT-------YYGGHDIKLILRRSTSNIIFSNGM 429
P NL +I C + +G S + ++ YYGG D + +N++ NG
Sbjct: 399 TGVPLNL--FIDMCTDMFGDSMKIKNIMAGNKKSQNYYGGADF-----YNATNVVLPNGS 451
Query: 430 RDPF-SRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQR 477
DP+ + G + K L + +N +HC D+ + +P L+ R
Sbjct: 452 LDPWHALGTYNKKEAQSQLPYL--INGTAHCGDMYASYDGEPQSLLAAR 498
>gi|390461155|ref|XP_002746166.2| PREDICTED: thymus-specific serine protease [Callithrix jacchus]
Length = 521
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 120/485 (24%), Positives = 190/485 (39%), Gaps = 85/485 (17%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W + PI LG E + + G
Sbjct: 59 WLEQLLDPFNV--SDRRSFLQRYWVNDQHWASQDGPIFLHLGGEGSLGPGSVMKGHPAAL 116
Query: 115 AHQFKALIVILEHRYYGKSVPFG-----------SRKAALKNA-----RHRGYFNSAQAL 158
A AL++ LEHR+YG SVP G SR A K++ R S L
Sbjct: 117 APACGALVISLEHRFYGLSVPAGGLDMAQLRFLSSRHAVGKSSGIPSDEDRPSLPSDPRL 176
Query: 159 ADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
AD S L + +N ++ +P I G SY G LA W RLK V+G +L +
Sbjct: 177 ADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLK----VLG-------LLRF-- 223
Query: 218 ITPWTTYHSVVSKDFRDT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTP 272
P + SVVS+ + S EC + ++SE++ + + A L + C
Sbjct: 224 --PHLIFASVVSRSLKSAAIGGSMECRAAVSAAFSEVERRLRTGGAAQAALRAELSACGS 281
Query: 273 L---KNTTELKDGLDTVYSEAAQYDTPSNIPVK----------RICNAIENAPNCGDDIL 319
L ++ EL L + A QYD + P+ N + P CG
Sbjct: 282 LSRAEDQAELLGALQALVGGAVQYDGQAGAPLSVRQLCGLLLGGGANRSRSTPYCGLRRA 341
Query: 320 CKIAAGVVEADSLEYDGNNSRCYI--NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQ 377
+I + L + + + E + D W +Q+C+E + +
Sbjct: 342 VQIVTHSLGQKCLSFSRAETVAQLRSTEPQVSGVGDRQWLYQTCTEFGFYVTCENPRCPF 401
Query: 378 PEPWNLTKYIKNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
+ L + C++ +G+SP + +YYGG + ++F NG DP
Sbjct: 402 SQLPALPSQLDLCEQVFGLSPLSVAQAVAQTNSYYGGQT------PGANQVLFVNGDTDP 455
Query: 433 FSRGGWVKTYHFFDLSFSQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKI 483
+H LS +Q L GSHCLD+ + SD L R+ +
Sbjct: 456 ---------WHV--LSVTQALGSSESALLIPSGSHCLDMAPERPSDSPSLRLGRRNIFQQ 504
Query: 484 MQGWI 488
+Q W+
Sbjct: 505 LQTWL 509
>gi|219130123|ref|XP_002185222.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403401|gb|EEC43354.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 283
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/259 (28%), Positives = 123/259 (47%), Gaps = 23/259 (8%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAPIDDNIQLSGFTYE 113
F+ Q ++HF T+ QRY + + F S PI + G E+P++ I +G +E
Sbjct: 5 FFQQALNHFDLPRGQSGTYQQRYCVYNDFMVNETSAPIFLYTGNESPLEQYINHTGLIWE 64
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA-----SILLHI 168
+A F A +V +EHRY G+S+P + + Y ++ QALAD+A + +
Sbjct: 65 SAEAFGAQVVFIEHRYEGQSLPSPFISSCM------AYSSTIQALADFARFVELKLFVDT 118
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP--WTTYHS 226
D PVIA G SYGG L+ W R+KYP+ + G++A SAPI + P +
Sbjct: 119 GDFSRLRRRPVIAFGGSYGGMLSAWLRMKYPNTIAGAIAGSAPIWGFPRNFPSKIDAAYR 178
Query: 227 VVSKDFRDT---------SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
V+ + + + C + +W I +A +GL +L+ F+ C LK+
Sbjct: 179 VIQHGLQQSYPPTLKPLDNNHCATNLLATWPLISALAQHHEGLQLLTSSFRLCEVLKDGD 238
Query: 278 ELKDGLDTVYSEAAQYDTP 296
L D + + + A+ P
Sbjct: 239 TLIDWAQSPWFDLAEGSFP 257
>gi|168067182|ref|XP_001785503.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662877|gb|EDQ49678.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 483
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 117/405 (28%), Positives = 177/405 (43%), Gaps = 65/405 (16%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA-----PIDDNIQ 106
+ ++ Q +DHFS + + F QRY ++ N+PI + E+ P D +
Sbjct: 44 QPVWFRQRLDHFS--SQDRREFQQRYYEFLDYFKDPNAPIFLRICGESTCSGIPNDYLLV 101
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL- 165
L A +F A +V LEHRYYG+S PF + N + Y +S QAL D AS
Sbjct: 102 L-------AKKFGAAVVSLEHRYYGESSPF--EELTTDNLK---YLSSKQALFDLASYRN 149
Query: 166 ---LHIKDKYNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
I K+N T P I G SY G L+ WFRLK+PH+V GSL+SS +L +
Sbjct: 150 FYQESINKKFNTTEKENPWIVFGVSYPGALSAWFRLKFPHLVRGSLSSSGVVL---AVHN 206
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT---CTPLKNTT 277
+T + V+ C ++ E+D L S K K LKN
Sbjct: 207 YTAFDQQVAA---SAGPACANALRDVTQEVDK------ALTSNSHKIKALFGVEQLKNDG 257
Query: 278 ELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA-------GVVEAD 330
+ + L +EA QY P +C + A + G +++ A G+ +
Sbjct: 258 DFRYLLADAAAEAFQYGNPD-----ILCLPLVAAYSSGQNVVAAYAEFVKLFFFGIFGVN 312
Query: 331 SLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMV-VPMGKDKNSMYQPEPWNLTKYIKN 389
+ YD + + + TGD W +Q C+E+ + +NS+ P N ++
Sbjct: 313 PISYDQEHLKLTKSGPDTGDRQ---WWYQVCTEVAYFQVAPSQNSIRSPG-VNEKYHLDL 368
Query: 390 CKEQYGVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDP 432
C +G P +T YYGG I + SNI+F+NG +DP
Sbjct: 369 CANVFGNGTYPEVDITNLYYGGSGI------TASNIVFTNGSQDP 407
>gi|308489478|ref|XP_003106932.1| hypothetical protein CRE_17219 [Caenorhabditis remanei]
gi|308252820|gb|EFO96772.1| hypothetical protein CRE_17219 [Caenorhabditis remanei]
Length = 541
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 121/495 (24%), Positives = 198/495 (40%), Gaps = 70/495 (14%)
Query: 23 HANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSK 82
H N L + G + + L+ K+ + Q +DHF P + +T+ Q+Y N K
Sbjct: 35 HLNQLTVTSNAGVLDQYPWLQVKN---------FTQKLDHFD--PYNTKTWNQKYFYNPK 83
Query: 83 FWGGGNSPILAFLGAEAP------IDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
F NS I +G E P + ++Q + A +F A + LEHR++G S P
Sbjct: 84 F-SRNNSIIFLMIGGEGPENGKWAANPDVQY----LQWAAEFGADVFDLEHRFFGDSWPI 138
Query: 137 GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRL 196
Y + QALAD A + + Y + + G SY G L+ WFR
Sbjct: 139 PDMTT-----NSLRYLTTQQALADLAYFIESMNQLYGFKNPRWVTFGGSYPGSLSAWFRQ 193
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
KYP + +GS+ASSAP+ D + Y VV D + T C +K ++++I ++
Sbjct: 194 KYPQLTVGSVASSAPVNLKLD---FYEYAMVVEDDLKLTDVNCAPAVKDAFTKIQKLSLT 250
Query: 257 PDGLAILSKKFKTCTPLKNTTELKD------GLDTVYSEAAQY-------DTPSNIPVKR 303
+G L+ F P T D L + QY T S+ V++
Sbjct: 251 AEGRNQLNGYFNLQPPFDGKTTKLDINNFFGNLFNTFQGMTQYTYDGQSNSTHSDKTVRK 310
Query: 304 ICNAIENA--PNCGDDI----LCKIAAGVVEADSLEYDGNNSRCYINEDRTGD------- 350
+C + N P+ + L + D L N+ I + +GD
Sbjct: 311 MCQIMTNTSEPDTVKRVENLFLWFNVMEPADPDHLTVMPNSYWDVIKQVGSGDLKVLGPD 370
Query: 351 -ESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT---- 405
+ GW W C+E+ ++ + L +I C + +G + + ++
Sbjct: 371 GAAARGWMWLCCNEIGFLQTTNQGNNVFGSGVTLNLFIDMCTDMFGDTMKIKQIMAGNKK 430
Query: 406 ---YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
YYGG D + +N++ NG DP+ G KT L +N +HC D+
Sbjct: 431 SQNYYGGADF-----YNATNVVLPNGSLDPWHALGTYKTVESQAL-LPYLINGTAHCGDM 484
Query: 463 DEAKKSDPDWLVQQR 477
+ +P+ L R
Sbjct: 485 YPSYDGEPNSLPAAR 499
>gi|159470599|ref|XP_001693444.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282947|gb|EDP08698.1| predicted protein [Chlamydomonas reinhardtii]
Length = 264
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 114/209 (54%), Gaps = 16/209 (7%)
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
PI + G EA ++ + ++G +ENA F ALI+ +EHRYYGK+ PFG + +
Sbjct: 2 PIFFYAGNEANVELYVNVTGLIWENAQAFGALIIFVEHRYYGKTQPFGPDSWQVDPS--- 58
Query: 150 GYFNSAQALADYASILLHIK---DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
Y QALADYA++L H+K A +PVIA G SYGG L+ W R+KYPH++ G++
Sbjct: 59 -YLTVEQALADYAALLWHLKADSPAGGAADSPVIAFGGSYGGMLSAWMRVKYPHIIAGAV 117
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKK 266
A+SAP+ + + TY + + ++ EC ++ ++ + ++ +G A LS+
Sbjct: 118 AASAPVAAFPGL---VTYDATPAA---GSAPECVTNVRLAFGNLRQLSRFAEGRAALSQL 171
Query: 267 FKTCTPLKNTTELKDGLDTVYSEAAQYDT 295
+ C PL + E LD Y +D
Sbjct: 172 LRLCKPLADEGE---ALDAAYWLQGAFDA 197
>gi|324510461|gb|ADY44374.1| Serine protease [Ascaris suum]
Length = 529
Score = 115 bits (289), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 125/465 (26%), Positives = 188/465 (40%), Gaps = 59/465 (12%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-IQLSG 109
ETF+ Q IDH + T+ QR N +F+ S I +G E+P + +
Sbjct: 36 IETFYVTQPIDHSN---PPLGTWQQRVQYNPRFYRN-ESIIFLLIGGESPAAEKWVAQPN 91
Query: 110 FTYEN-AHQFKALIVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALADYASILLH 167
TY A ++ A + LEHR++GKS P+ K ++LK Y QAL D AS +
Sbjct: 92 ITYLRWAEKYGAAVFQLEHRFFGKSRPYNDLKTSSLK------YCTVDQALEDLASFIRQ 145
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+ KY + + G SY G L+ WF+++YP + +G++ASSAP+ + D + Y V
Sbjct: 146 MNAKYGYVNPRWVTFGGSYPGSLSAWFQVRYPDLTVGAVASSAPLTFLLD---YYGYAMV 202
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT---------- 277
+ R+TS EC + I + + I N A G LS K K TT
Sbjct: 203 MENVIRNTSAECHEKIGNAITVILNKALTVAGREELSTKLKLKPAFNETTLTVRDLHNMM 262
Query: 278 -ELKDGLDTVYS---EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE 333
L GL V +A T V+ +CNAI ++ + I + +
Sbjct: 263 AYLFGGLQNVVQYTYDARNSITMGGFNVRNMCNAITSSTSTDPVIQMRAIIDWIYTFYPS 322
Query: 334 YDGNNSRCYIN----------EDRTGDESD--EGWEWQSCSEMVVPMGKDKNSMYQPEPW 381
DG + Y + +D G E+ GW W C+E+ V D
Sbjct: 323 DDGTIANRYSDLIGLLSNTTFDDENGSENAAMRGWMWLCCNELGVLQTTDHGRNIFGNML 382
Query: 382 NLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRR--------STSNIIFSNGMRDPF 433
L +I C + +G + + D L R NI+ NG DP+
Sbjct: 383 PLNYFIDICIDAFGDT------VNIVSIRDNNLAFRNRYGDANNYKAKNIVLPNGSFDPW 436
Query: 434 SRGGWVKTYHFFDLSFSQDLNLGS-HCLDLDEAKKSDPDWLVQQR 477
G + Y +L L G+ HC D+ A +P L R
Sbjct: 437 HPLGTYENYP--ELHQKAILIEGTAHCADMYPAWSEEPSTLAPVR 479
>gi|388454936|ref|ZP_10137231.1| serine carboxypeptidase [Fluoribacter dumoffii Tex-KL]
Length = 467
Score = 115 bits (289), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 116/458 (25%), Positives = 204/458 (44%), Gaps = 62/458 (13%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q IDH + G + F QRY I+ + ++P+ ++ EA L+G
Sbjct: 49 YFKQLIDHNNPGTGN---FYQRYYIDESYGPEMDAPVFFYICGEAACSKR-ALNGAIRNY 104
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F A +V LEHRYYG S+PF + + H + + AL D A+ H+K++ N
Sbjct: 105 AQKFHAKLVALEHRYYGDSLPFNTL-----STEHLRFLTTEAALDDLAAFQRHLKNERN- 158
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+ +A G SY G L+ ++RLK+P++V+G+LASSAP++ D + + + V+
Sbjct: 159 WNGKWVAFGGSYPGSLSAYYRLKFPYLVVGALASSAPVMAKEDFIEYDAHVTQVA----- 213
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------S 288
+C ++++ +E++ AS D + K+K L + + D +D +Y +
Sbjct: 214 -GLKCAAQMREAVNEVE--ASLSD-----AAKWKEMKELFEASAVDDPVDFLYLIADTGA 265
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT 348
A QY C + +P + A + +A + ++ ++E+
Sbjct: 266 AAVQYGMRD-----EFCTRLATSPTPLQGY-AEFAKNLYKAMHINAVEMTAQGAMSENPA 319
Query: 349 GDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEP--WNLTKYIKNCKEQYGVSPRPSWV 403
+ G W +QSC E N + NL + K C+ +G++ +P V
Sbjct: 320 AYKDGLGMRQWYYQSCKEYGYWQNAHPNPAFSTRSSLINLDYHHKICERLFGLT-QP--V 376
Query: 404 LTYYGGHDIKLILRRS-TSNIIFSNGMRDPFSRGGWVK----------TYHFFDLSFSQD 452
T + + + L + TSNI F+NG DP+S + TYH
Sbjct: 377 NTEEINNTLYIPLMDTLTSNIYFTNGENDPWSTLSLAEKNGNAINPKLTYHL-------- 428
Query: 453 LNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ +HC DL D D L + RKT ++ W+ +
Sbjct: 429 IQGAAHCDDLHSPSAIDSDSLREARKTMEILLANWLKK 466
>gi|24648177|ref|NP_650803.1| CG18493 [Drosophila melanogaster]
gi|23171717|gb|AAF55663.2| CG18493 [Drosophila melanogaster]
Length = 480
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 111/452 (24%), Positives = 202/452 (44%), Gaps = 52/452 (11%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +D+F+ + QT+ RY++N +F G SPI +LG E I++++ +G Y+ A +
Sbjct: 61 QKLDNFN--ASNTQTYQMRYLLNDEFQTEG-SPIFIYLGGEWEIEESMVSAGHWYDMAQE 117
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATH 176
++V EHRYYG+S+P + + Y + QALAD A + K + +
Sbjct: 118 HNGVLVYTEHRYYGQSIP-----TSTMSTEDLKYLDVKQALADVAVFIETFKAENPQLAN 172
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
+ VI G SY + WF+ YP +++G ASSAPIL D T Y VV + F
Sbjct: 173 SKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPILAKVDFTE---YKEVVGQAFLQLG 229
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVYSEA 290
++C+ I+ +E++++ + G A + C + +L + +++
Sbjct: 230 GQKCYDRIENGIAELESMFANKRG-AEARAMLRLCNSFDDQNDLDLWTLFSSISNIFAGV 288
Query: 291 AQYDTPSNIPVKRICNAIENAPNCGDDILCKI--AAGVVEADSLEYDGNNSRCYINEDRT 348
AQY +I + C+ + + + I + A G+ Y+G+ D
Sbjct: 289 AQYQGTGDI--EYYCDYLLSFNDDATAIANFVYWAWGMGNCIDARYEGSVEYYLWGVDHF 346
Query: 349 GDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNL----TKYIKNCKE----QYG---VS 397
++ W +Q+C+E G ++S + +P+ T YI C + QYG ++
Sbjct: 347 --DASRPWYYQTCNE----YGWYQSSGSRNQPFGTKFPATLYINLCGDVFSSQYGNEQIN 400
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS 457
+ Y+GG + NI ++G DP++ G H + + N S
Sbjct: 401 NNAASTNEYFGG------MEPGVDNIYMTHGELDPWNPMG-----HGVEQGATVIAN-AS 448
Query: 458 HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
HC D K +D D + ++ ++++ W+
Sbjct: 449 HCSDFGSIKSTDSDEMRASKEKLAELVRQWLA 480
>gi|218184874|gb|EEC67301.1| hypothetical protein OsI_34292 [Oryza sativa Indica Group]
Length = 550
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 125/467 (26%), Positives = 192/467 (41%), Gaps = 63/467 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + +Q +DHFS P ++ F QRY + + GG L G + N + +
Sbjct: 50 EERWMDQRLDHFS--PTDHRQFKQRYYEFADYHAGGGPVFLRICGESSC---NGIPNDYL 104
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A +F A +V EHRYYGKS PF S +N R + +S QAL D A+ H ++
Sbjct: 105 AVLAKKFGAAVVTPEHRYYGKSSPFES--LTTENLR---FLSSKQALFDLAAFRQHYQEI 159
Query: 172 YNATHA-------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY 224
NA + P G SY G L+ WFRLK+PH+ GSLASS +L + T +
Sbjct: 160 LNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVLAVYNFTDF--- 216
Query: 225 HSVVSKDFRDTS-EECFQTIKKSWSEIDNIASKPDGLAILSKKFKT---CTPLKNTTELK 280
K D++ EC +++ +D + L + S+ K LKN +
Sbjct: 217 ----DKQVGDSAGPECKAALQEVTRLVD------EQLRLDSRSVKALFGAEKLKNDGDFL 266
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS- 339
L + QY +P +C+ + NA G ++ A V + + S
Sbjct: 267 FFLADAAAIGFQYGSPDA-----VCSPLINAKKTGRSLVETYAQYVQDFFIRRWGTTVSS 321
Query: 340 --RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
+ Y+ D S W +Q CSE+ KN + N ++ C+ +G
Sbjct: 322 YDQEYLKNTTPDDTSSRLWWFQVCSEVAYFQVAPKNDSIRSTEINTGYHLDLCRNVFGEG 381
Query: 398 PRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
P +T YYGG R + S I+F+NG +DP+ K+ + + N
Sbjct: 382 VYPDVFMTNLYYGG------TRIAASKIVFTNGSQDPWRHASKQKSSKYMPSYIIKCRNC 435
Query: 456 GSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G H DL D + S P + RK + W++Q
Sbjct: 436 G-HGTDLRGCPQLPFRIEGDPSNCSSPAAVSTVRKQIASHISLWLSQ 481
>gi|19528413|gb|AAL90321.1| RE11624p [Drosophila melanogaster]
Length = 480
Score = 115 bits (288), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 111/452 (24%), Positives = 202/452 (44%), Gaps = 52/452 (11%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +D+F+ + QT+ RY++N +F G SPI +LG E I++++ +G Y+ A +
Sbjct: 61 QKLDNFN--ASNTQTYQMRYLLNDEFQTEG-SPIFIYLGGEWEIEESMVSAGHWYDMAQE 117
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATH 176
++V EHRYYG+S+P + + Y + QALAD A + K + +
Sbjct: 118 HNGVLVYTEHRYYGQSIP-----TSTMSTEDLKYLDVKQALADVAVFIETFKAENPQLAN 172
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
+ VI G SY + WF+ YP +++G ASSAPIL D T Y VV + F
Sbjct: 173 SKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPILAKVDFTE---YKEVVGQAFLQLG 229
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVYSEA 290
++C+ I+ +E++++ + G A + C + +L + +++
Sbjct: 230 GQKCYDRIENGIAELESMFANKRG-AEARAMLRLCNSFDDQNDLDLWTLFSSISNIFAGV 288
Query: 291 AQYDTPSNIPVKRICNAIENAPNCGDDILCKI--AAGVVEADSLEYDGNNSRCYINEDRT 348
AQY +I + C+ + + + I + A G+ Y+G+ D
Sbjct: 289 AQYQGTGDI--EYYCDYLLSFNDDATAIANFVYWAWGMGNCIDARYEGSVEYYLWGVDHF 346
Query: 349 GDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNL----TKYIKNCKE----QYG---VS 397
++ W +Q+C+E G ++S + +P+ T YI C + QYG ++
Sbjct: 347 --DASRPWYYQTCNE----YGWCQSSGSRNQPFGTKFPATLYINLCGDVFSSQYGNEQIN 400
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS 457
+ Y+GG + NI ++G DP++ G H + + N S
Sbjct: 401 NNTASTNEYFGG------MEPGVDNIYMTHGELDPWNPMG-----HGVEQGATVIAN-AS 448
Query: 458 HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
HC D K +D D + ++ ++++ W+
Sbjct: 449 HCSDFGSIKSTDSDEMRASKEILAELVRQWLA 480
>gi|157132186|ref|XP_001662504.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871255|gb|EAT35480.1| AAEL012351-PA [Aedes aegypti]
Length = 478
Score = 115 bits (288), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 109/451 (24%), Positives = 198/451 (43%), Gaps = 42/451 (9%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +D+F P++ T+ RY+ N + + G P+ FLG E I + G Y+ A +
Sbjct: 51 QYVDNFD--PQNPSTWSMRYIQNGEHYQPG-GPLFIFLGGEWEISPGYVMYGHFYDMAKE 107
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATH 176
A + EHRYYG+S P S ++ L + N QALAD A + ++ A +
Sbjct: 108 LGAHLFYTEHRYYGQSRPTASTRSDLLK-----FLNIDQALADLAHFVEEMRRAIPGAEN 162
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
+ VI G SY + WFR KYPH+V G ASSAP+L D + Y VVS+ R
Sbjct: 163 SKVIMAGGSYSATMVAWFRQKYPHLVDGGWASSAPLLAKLD---FVEYKEVVSESIRLVG 219
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL-----DTVYSEA 290
+ C I++++ +I++ ++ + + ++FK C + L + ++
Sbjct: 220 GDACADRIERAYEQIEDHLAREE-FDKVREEFKVCNNINFANSLDSAMFLSSISDYFAGV 278
Query: 291 AQYDTPSNIPVKRICNAI--ENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINED-- 346
QY +P +I + +C I ++ N + + GV + + YD + R Y + D
Sbjct: 279 VQYHSPGDI--EGVCEIIMDDSIENDMEALANWFIRGVNQCMDMTYD-STIRYYRSIDWN 335
Query: 347 -RTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV-- 403
+ W +Q+C+E + + + Y++ C + Y P+ +
Sbjct: 336 HGANRGAMRPWLYQTCAEYGWYQTSGSENQIFGSGFPVDLYVRMCYDLYDYIFYPARLDA 395
Query: 404 -----LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
T YG + +N+ F+ G DP+ G + + D S + + + SH
Sbjct: 396 NIKRTNTIYGH------MNPEVTNVFFTQGQLDPWRPMGLQEDLN--DQSPTVVIPMASH 447
Query: 459 CLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
D+ D ++ ++ ++++ WI+
Sbjct: 448 VADMGSISDRDSPEMLAAKERVFELIKQWIS 478
>gi|198455509|ref|XP_001360028.2| GA17653 [Drosophila pseudoobscura pseudoobscura]
gi|198133275|gb|EAL29180.2| GA17653 [Drosophila pseudoobscura pseudoobscura]
Length = 485
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 120/492 (24%), Positives = 206/492 (41%), Gaps = 57/492 (11%)
Query: 27 LKLRPR-------LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVI 79
L+L+P L +RR+ LE+ ET ++ +D+F+ + T+ R +I
Sbjct: 23 LELQPEPNAFVQSLRELRRAPPLEKSRKRANVETRWFTLKLDNFNAANNA--TWKDRVLI 80
Query: 80 NSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSR 139
N + G SPI +LG E I+ + SG + A + ++ EHR++GKS P
Sbjct: 81 NEDHFTDG-SPIFIYLGGEWEIEPSAITSGLWVDIAKEHNGSLIYTEHRFFGKSFPI--T 137
Query: 140 KAALKNARHRGYFNSAQALADYASIL--LHIKDKYNATHAPVIAIGASYGGELATWFRLK 197
+ KN +++ + QALAD I+ L ++DKY + VI G SY +ATW R
Sbjct: 138 PLSTKNLKYQ---SVQQALADVVHIIKTLKLEDKYK--DSKVIVSGCSYSATMATWIRKL 192
Query: 198 YPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASK 256
YP +++GS ASSAP+ D Y VV + F + C+ I + S ++
Sbjct: 193 YPDIILGSWASSAPLEAKVDF---KDYMEVVGQAFEQLGGKYCYDLIDNATSYYQDLFEG 249
Query: 257 PDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENA 311
G A K+ C ++ ++ + V++ AQY P N + + C+ + +
Sbjct: 250 GQG-AKAKKELNLCANFNVNSKQDRWQIFSTIANVFAGLAQYQKPGNYDLPKYCSVLRSF 308
Query: 312 PNCGDDILCKIAA---GVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM 368
+ + L K G S+ Y G + + D+S W +Q+CSE
Sbjct: 309 SDDDAEALSKFVQWRLGYPTCLSVTYKGTVNYYKWAKINYEDDSGLPWIYQTCSEFGWYQ 368
Query: 369 GKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTS 421
D + + T Y C + + + + + G DI +
Sbjct: 369 SSDSENQPFGSSFPATLYTDTCHDVFSKNYTLINIEANIAATNKDFQGIDI------AVK 422
Query: 422 NIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRK 478
N+ ++ G DP+S+ G +Q + SHC DLD +D L +
Sbjct: 423 NVYWTQGGLDPWSKVG---------AGIAQGATIIPQASHCSDLDSISANDSPELQASKL 473
Query: 479 TEVKIMQGWITQ 490
++++ W+ Q
Sbjct: 474 KLAQLVRDWLAQ 485
>gi|268576509|ref|XP_002643234.1| Hypothetical protein CBG08099 [Caenorhabditis briggsae]
Length = 540
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 118/469 (25%), Positives = 194/469 (41%), Gaps = 76/469 (16%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD------NIQLSG 109
+ Q +DHF + +T+ Q+Y N K+ NS I +G E P + +Q
Sbjct: 69 FTQKLDHFDR--YNTKTWNQKYFYNPKY-SRNNSIIFLMIGGEGPENGRWAAKPEVQY-- 123
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+ A +F A + LEHR++G S P + + Y + QALAD A + +
Sbjct: 124 --LQWASEFGADVFDLEHRFFGDSWPISDMETS-----SLQYLTTQQALADLAYFIESMN 176
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
KY + + G SY G L+ WFR KYP + +GS+ASSAP+ D + Y VV
Sbjct: 177 QKYGFKNPRWVTFGGSYPGSLSAWFRQKYPELTVGSVASSAPVNLKLD---FYEYAMVVE 233
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD------GL 283
D + T +C ++ ++++I ++ +G L+ F P T D L
Sbjct: 234 DDLKLTDPQCAPAVRDAFTKIQQMSLTAEGRNSLNTYFNLQPPFDAKTTKLDINNFFGNL 293
Query: 284 DTVYSEAAQY-------DTPSNIPVKRICNAIENA--PN-----------------CGDD 317
+ QY T S+ V+++C + NA PN G D
Sbjct: 294 FNTFQGMTQYTYDGQSNSTHSDKTVRKMCQIMTNATEPNTVKRVENLFLWFNQMEPAGPD 353
Query: 318 ILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEM--VVPMGKDKNSM 375
+ + D + G+ + + E G + GW W C+E+ + + N+
Sbjct: 354 LSVMPNS---YWDVIAQVGSGNLTVLGE---GGAAARGWMWLCCNEIGFLQTTNQGNNAF 407
Query: 376 YQPEPWNLTKYIKNCKEQYGVSPRPSWVLT-------YYGGHDIKLILRRSTSNIIFSNG 428
P NL +I C + +G S + +++ YYGG D + +N++ NG
Sbjct: 408 GTGVPLNL--FIDMCTDMFGDSMKIKRIMSGNKLSQNYYGGADF-----YNATNVVLPNG 460
Query: 429 MRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQR 477
DP+ G KT +L +N +HC D+ + +P L R
Sbjct: 461 SLDPWHALGTYKTNDNQNL-LPYLINGTAHCGDMYASYDGEPASLPAAR 508
>gi|413933853|gb|AFW68404.1| putative serine peptidase S28 family protein [Zea mays]
Length = 478
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 112/390 (28%), Positives = 160/390 (41%), Gaps = 46/390 (11%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD--NIQLSGFTYEN 114
NQ +DHFS P ++ F QRY + P+ + E+ D N L+
Sbjct: 43 NQRLDHFS--PTDHRQFKQRYFEFLDYHRAPGGPVFLRICGESACDGIPNDYLAVL---- 96
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F A +V EHRYYGKS PF ++ +N R + +S QAL D A + ++ NA
Sbjct: 97 AKKFGAAVVTPEHRYYGKSSPF--KQLTTENLR---FLSSKQALFDLAVFRQYYQESLNA 151
Query: 175 THA------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
+ P IG SY G L+ WFRLK+PH+ GSLASS +L + T
Sbjct: 152 RYNRSGFDNPWFVIGVSYSGALSAWFRLKFPHLTCGSLASSGVVLAVYNYT--------- 202
Query: 229 SKDF-RDTSEECFQTIKKSWSEIDNIASKPDGLAILS-KKFKTCTPLKNTTELKDGLDTV 286
DF + E K EI + + L S K LKN + L
Sbjct: 203 --DFDKQVGESAGPECKAVLQEITELVDEQLRLESHSVKALFGAQTLKNDGDFLFFLADA 260
Query: 287 YSEAAQYDTPSNI--PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN 344
+ QY P + P+ + +N + +E YD R Y+
Sbjct: 261 AATTFQYGNPDALCPPLIKAKKNRKNLVEAYAQFVKDYYIKKMETPPSSYD----REYLK 316
Query: 345 EDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL 404
E D S W +Q CSE+ KN + N ++ C+ +G P +
Sbjct: 317 ETTPHDSSSRLWWFQVCSEVAYFQVAPKNDSVRSARINTRYHLDLCRHVFGEGVYPDVFM 376
Query: 405 T--YYGGHDIKLILRRSTSNIIFSNGMRDP 432
T YYGG I + S I+F+NG +DP
Sbjct: 377 TNLYYGGTRI------AASKIVFTNGSQDP 400
>gi|154413074|ref|XP_001579568.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121913776|gb|EAY18582.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 518
Score = 115 bits (287), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 115/444 (25%), Positives = 189/444 (42%), Gaps = 27/444 (6%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP-ILAFLGAEAPIDDNIQLSGF 110
+T++++Q +DHF+ +TF QRY N F + ++ F+G EA I + G
Sbjct: 20 DTYYFDQFLDHFNTSDN--RTFKQRYYYNDTFCQNTTTKKLIVFIGGEAAITERRVQKGA 77
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL-LHIK 169
+ A + + +V LEHRY+G+S PF + N + Y S QALAD A + IK
Sbjct: 78 YMKLAKETDSCVVALEHRYFGESQPF--EELITPNLK---YLTSDQALADLAYFIESFIK 132
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
KY + ++ +G SY G L+++FR+KYPH+ S ASS P+ D W Y + +
Sbjct: 133 IKYQS-RPTILVVGGSYPGTLSSYFRMKYPHIADFSWASSPPLYVKNDF--W-EYDAHCA 188
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
+ S +C K + D+ PD + + + + + L D + +
Sbjct: 189 EVLGKISPKCLTNTKLIY---DDFNDHPDHITNYIPFKPSVSHVSQLSILSDFIAGI--- 242
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN-NSRCYINEDRT 348
QYD + N ++PN D +E + +E + + N
Sbjct: 243 -VQYDNIYKLVTPYCENQNGDSPNY--DSFHDYFYKYLEVEGVEDPSDLDDFALTNHSIH 299
Query: 349 GDESDE-GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYY 407
D +D W W +C+E + + +P +L C+ +GV P
Sbjct: 300 TDYADSLSWTWMTCNEF--GWFQTASGQLRPAKVDLNYSDLVCRTYFGVGISPDIDNNRS 357
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS-FSQDLNLGSHCLDLDEAK 466
DI +T+ I FSNG DP+S + + +S +N SHC DL +
Sbjct: 358 AKMDIYNAQNPATTMIYFSNGKTDPWSVLSVSENVQNPPVGRYSVQINNASHCSDLGDEA 417
Query: 467 KSDPDWLVQQRKTEVKIMQGWITQ 490
+P+ L RK + M W+
Sbjct: 418 AGEPEALTVARKQIMDTMARWLNH 441
>gi|110289392|gb|ABB47879.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
Length = 490
Score = 114 bits (286), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 124/474 (26%), Positives = 191/474 (40%), Gaps = 63/474 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA----PIDDNIQL 107
E + +Q +DHFS P ++ F QRY + + GG L G + P D L
Sbjct: 51 EERWMDQRLDHFS--PTDHRQFKQRYYEFADYHAGGGPVFLRICGESSCNGIPNDYLAVL 108
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
S +F A +V EHRYYGKS PF S +N R + +S QAL D + H
Sbjct: 109 S-------KKFGAAVVTPEHRYYGKSSPFES--LTTENLR---FLSSKQALFDLVAFRQH 156
Query: 168 IKDKYNATHA-------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
++ NA + P G SY G L+ WFRLK+PH+ GSLASS +L +
Sbjct: 157 YQEILNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL---AVYN 213
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK 280
+T + V + Q + + E + S+ + ++K LKN +
Sbjct: 214 FTDFDKQVGDSAGPECKAALQEVTRLVDEQLRLDSRSVKVLFGAEK------LKNDGDFL 267
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS- 339
L + QY +P +C+ + NA G ++ A V + + S
Sbjct: 268 FFLADAAAIGFQYGSPDA-----VCSPLINAKKTGRSLVETYAQYVQDFFIRRWGTTVSS 322
Query: 340 --RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS 397
+ Y+ D S W +Q CSE+ KN + N ++ C+ +G
Sbjct: 323 YDQEYLKNTTPDDTSSRLWWFQVCSEVAYFQVAPKNDSIRSTEINTGYHLDLCRNVFGEG 382
Query: 398 PRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
P +T YYGG I + S I+F+NG +DP+ K+ + + N
Sbjct: 383 VYPDVFMTNLYYGGTRI------AASKIVFTNGSQDPWRHASKQKSSKYMPSYIIKCRNC 436
Query: 456 GSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKA 497
G H DL D + S P + RK + W++Q + +A
Sbjct: 437 G-HGTDLRGCPQLPFRIEGDPSNCSSPAAVSTVRKQIASHISLWLSQCQEPTRA 489
>gi|24648175|ref|NP_650802.1| CG3734 [Drosophila melanogaster]
gi|19527569|gb|AAL89899.1| RE36938p [Drosophila melanogaster]
gi|23171716|gb|AAF55662.2| CG3734 [Drosophila melanogaster]
gi|220948338|gb|ACL86712.1| CG3734-PA [synthetic construct]
Length = 473
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 202/466 (43%), Gaps = 68/466 (14%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF PE +T+ RY++N + G +P+ +LG E I G
Sbjct: 46 QTLWIEQKLDHFD--PEETRTWQMRYMLNDALYQSG-APLFIYLGGEWEISSGRITGGHL 102
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + AL+ EHRYYG+S P L N + Y N Q+LAD A + IK
Sbjct: 103 YDMAKEHNALLAYTEHRYYGQSKPLPD----LSNENIK-YLNVNQSLADLAYFINTIKQN 157
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + + VI +G SY + TWF+ YP +V G ASSAP+L + Y + +
Sbjct: 158 HEGLSDSKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLLAK---VNFVEYKEITGQ 214
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
C++ I+ +E++ + + G A + K C P ++L + T++SE
Sbjct: 215 SIEQMGGSACYKRIENGIAEMETMIATKRG-AEVKALLKLCEPFDVYSDLD--VWTLFSE 271
Query: 290 -----AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-- 342
A T + ++ +C I G + L +A +++ ++ + +CY
Sbjct: 272 ISDIFAGVVQTHNAGQIEGVCEKI----MAGSNDLIGVAGYLLDV----FEESGGKCYDL 323
Query: 343 ---------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKN 389
++ + G+ + W +Q+C+E G + S +P+ +T Y
Sbjct: 324 SYDAITALLLDTNYNGNIMRQ-WIFQTCNE----YGWYQTSGSSAQPFGTKFPVTYYTTM 378
Query: 390 CKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
C + YG +S + S ++GG L + N+ ++G DP+ G
Sbjct: 379 CADLYGSEYSNEFISNQVSITNQFFGG------LFPNVENVYLTHGQLDPWRAMG----- 427
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D + + + +HC D + SD + ++ ++++ W+
Sbjct: 428 -IQDETQATIIPEHAHCKDFNSISSSDTAEMRASKERIAELVREWV 472
>gi|170066899|ref|XP_001868268.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167863076|gb|EDS26459.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 485
Score = 114 bits (286), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 112/454 (24%), Positives = 204/454 (44%), Gaps = 48/454 (10%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +D+F P++ T+ RY+ N +++ G + + ++G E I++ + G ++ A +
Sbjct: 58 QKVDNFD--PQNPSTWSMRYMDNGEYYNPGGA-LFIYVGGEWTINEGSLVRGHFHDMARE 114
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATH 176
A I EHRYYG S P A + + R + N QALAD A + ++ A +
Sbjct: 115 LGAYIFYTEHRYYGLSRP----TANTRTDQMR-FLNVDQALADLAHFVEEMRRTIPGAEN 169
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
A VI G SY + WFR KYPH++ G+ ASSAP+L D +T Y VVS R
Sbjct: 170 AKVIMAGGSYSATMVAWFRQKYPHLINGAWASSAPLLAKLD---FTEYKEVVSDSIRLVG 226
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC--TPLKNTTELKDGLDTV---YSEA 290
+ C +++ +E++++ K +++ F C T L T + ++ L ++ ++
Sbjct: 227 GDACADRVQRGVAEVEDLI-KQGSYDQVAQAFNLCASTDLTKTLDKQNFLSSISDYFAGV 285
Query: 291 AQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-------I 343
QY P +I + +C I N P+ D+ EA + + ++RCY I
Sbjct: 286 VQYHWPGDI--EGVCEVI-NDPSYTTDM---------EALAGWFTSGSTRCYDASYDSMI 333
Query: 344 NEDRTGDESDEG-------WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGV 396
+ R+ D + W +Q+C+E + + + YI+ C + Y
Sbjct: 334 SYYRSTDWTHGANTGAMRPWFYQTCAEYGWYQTSGSENQIFGSGFPVELYIRMCADLYDY 393
Query: 397 S-PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
P + + I + +N+ F+ G DP+ G + + + S + + L
Sbjct: 394 KFPERLLHVNVARTNTIYGHMNPEVTNVFFTQGQLDPWRPMGLQEDLN--EHSPAVVIPL 451
Query: 456 GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
SHC DL +D + ++ ++++ W+
Sbjct: 452 ASHCADLSSISAADSPEMRAAKERVFELIKMWLA 485
>gi|344292458|ref|XP_003417944.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 522
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 75/238 (31%), Positives = 112/238 (47%), Gaps = 12/238 (5%)
Query: 37 RRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG 96
RR+R Q + + F + Q +DHFS S Q +PQRY IN F+ G L G
Sbjct: 23 RRTRGAHQ--TKVMYPNFSFQQKLDHFS--ENSSQFWPQRYFINDAFYKPGGPVFLMVGG 78
Query: 97 AEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQ 156
+ + ++ A + AL ++LEHR+YG S P G A Y +S Q
Sbjct: 79 VWTASESWLSINKTWVTYAQRLGALFLLLEHRFYGYSQPTGDLSTA-----SLQYLSSRQ 133
Query: 157 ALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
ALAD A+ I +K T +A G S G LA W R+K+P + ++ SSAPI
Sbjct: 134 ALADIANFRTQIAEKMGLTENKWVAFGCSSAGSLAVWSRIKHPELFAAAVGSSAPIQAKA 193
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
+ + Y +V + + CFQ +K+++ +I + + P L FK C P+K
Sbjct: 194 N---FYEYLEIVQRSLATHNSNCFQAVKEAFGQIVKMLNLPRYYGKLENDFKLCKPMK 248
Score = 43.9 bits (102), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 75/163 (46%), Gaps = 34/163 (20%)
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPS 401
IN D G D + +Q C+E D KN + P L +++ C + +G P+ +
Sbjct: 379 INYDNPG--IDRQFFYQCCTEFGFFHTTDSKNQPFTGMP--LRYFVQQCSDFFG--PQFN 432
Query: 402 W------VLT---YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
+ VL+ +YGG ++ + S IIFS+G DP+ G K S+D
Sbjct: 433 YDSLNMGVLSTNAHYGGFNV------TGSKIIFSSGSFDPWHVLGITK-------DISKD 479
Query: 453 LNL-----GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
L G HC D+ E K +D L+Q R+ +I++ W+ +
Sbjct: 480 LPAVFIKGGVHCADVFEQKDTDSAELIQAREKIFRILRKWLKK 522
>gi|357626528|gb|EHJ76588.1| hypothetical protein KGM_15000 [Danaus plexippus]
Length = 413
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 112/431 (25%), Positives = 188/431 (43%), Gaps = 45/431 (10%)
Query: 76 RYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVP 135
RY N +W N PI FLG E+ G +E A + + + + EHRYYG+S P
Sbjct: 2 RYFENVLYWQE-NGPIFVFLGGESASSPQWTRFGIIHELAKESQGAMYVTEHRYYGESKP 60
Query: 136 FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFR 195
L Y +S QALAD A ++ ++K ++ V+ IG SY G LA W +
Sbjct: 61 -----KNLTKEDQFKYLSSRQALADIAKLIHYLKLLPMYKNSKVVVIGGSYAGNLAAWMK 115
Query: 196 LKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD-TSEECFQTIKKSWSEIDNIA 254
+ YP +V ++ASSAP+L D + Y V++D+ + C IK + +
Sbjct: 116 VLYPDLVDAAVASSAPVLAKKD---FFEYLEKVTEDYETYGTHGCSDKIKNIFDRFHQLL 172
Query: 255 SKPDGLAILSKKFKTC-----TPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIE 309
+G+ L K+ C + ++N + +++ +QY + I K+ C +
Sbjct: 173 QSSEGIKQLKKEENICDSCDMSVIENQAVFFEVKTSIFMSNSQYGSTKTI--KQHCEKLS 230
Query: 310 NAPNCGDDILCKIAAGVVEADSLE-YDGNNSRCYINEDRTGDESDEGWEWQSCSEM-VVP 367
+ + ++ ++ L YD + +R I ++ D D W +Q+C+E
Sbjct: 231 DVSYDTKSLTDNSMLPIIYSEKLNCYDYDFNRM-IQVMKSND--DLFWIYQTCTEFGYYQ 287
Query: 368 MGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRST 420
K +++ P L YIK C E +G V YGG L +
Sbjct: 288 TTNSKAQIFKNIP--LEFYIKICTEMFGNDFNETRVDQAVKNTNKLYGG------LNPNV 339
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD---LNLGSHCLDLDEAKKSDPDWLVQQR 477
+ ++FSNG DP+S G ++ LS+ + +HC DL + D + L + R
Sbjct: 340 TKVVFSNGNLDPWSTIGVLE-----GLSYDAPAVVIPRSTHCADLLPIFEPDNEELKEAR 394
Query: 478 KTEVKIMQGWI 488
K +++ WI
Sbjct: 395 KHIKYLIKKWI 405
>gi|226502825|ref|NP_001146300.1| uncharacterized protein LOC100279875 precursor [Zea mays]
gi|219886553|gb|ACL53651.1| unknown [Zea mays]
Length = 478
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 112/389 (28%), Positives = 161/389 (41%), Gaps = 44/389 (11%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD--NIQLSGFTYEN 114
NQ +DHFS P ++ F QRY + P+ + E+ D N L+
Sbjct: 43 NQRLDHFS--PTDHRQFKQRYFEFLDYHRAPGGPVFLRICGESACDGIPNDYLAVL---- 96
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F A +V EHRYYGKS PF ++ +N R + +S QAL D A + ++ NA
Sbjct: 97 AKKFGAAVVTPEHRYYGKSSPF--KQLTTENLR---FLSSKQALFDLAVFRQYYQESLNA 151
Query: 175 THA------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
+ P IG SY G L WFRLK+PH+ GSLASS +L + +T + V
Sbjct: 152 RYNRSGFDNPWFVIGVSYSGALNAWFRLKFPHLTCGSLASSGVVL---AVYNYTDFDKQV 208
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILS-KKFKTCTPLKNTTELKDGLDTVY 287
+ EC K EI + + L S K LKN + L
Sbjct: 209 GE---SAGPEC----KAVLQEITELVDEQLRLESHSVKALFGAQTLKNDGDFLFFLADAA 261
Query: 288 SEAAQYDTPSNI--PVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINE 345
+ QY P + P+ + +N + +E YD R Y+ E
Sbjct: 262 ATTFQYGNPDALCPPLIKAKKNRKNLVEAYAQFVKDYYIKKMETPPSSYD----REYLKE 317
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT 405
D S W +Q CSE+ KN + N ++ C+ +G P +T
Sbjct: 318 TTPHDSSSRLWWFQVCSEVAYFQVAPKNDSVRSARINTRYHLDLCRHVFGEGVYPDVFMT 377
Query: 406 --YYGGHDIKLILRRSTSNIIFSNGMRDP 432
YYGG I + S I+F+NG +DP
Sbjct: 378 NLYYGGTRI------AASKIVFTNGSQDP 400
>gi|145492429|ref|XP_001432212.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399322|emb|CAK64815.1| unnamed protein product [Paramecium tetraurelia]
Length = 462
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 117/474 (24%), Positives = 194/474 (40%), Gaps = 56/474 (11%)
Query: 40 RILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAE- 98
+L Q ETF + Q +DH P + QT+ QRY + S+++ ++ ++ E
Sbjct: 18 HVLTQGPFTETKETFQFTQLLDHSD--PANTQTWQQRYHVYSQYFNPTKGGVILYICGEW 75
Query: 99 --APIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQ 156
+ DN +++ A A+++ LEHR+YG+S PFG+ +L+N Y N Q
Sbjct: 76 NCQGVGDN----SLSFQLAKDLGAIVIALEHRFYGQSQPFGADSWSLENL---SYLNVHQ 128
Query: 157 ALADYASILLHIK--DKYNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
AL D A +L +K +N + P AIG SY G L+ WFR KYPH+ +G+LASS I
Sbjct: 129 ALDDLAYFILQMKRLKLHNIDSTLPWYAIGGSYPGALSAWFRYKYPHLTVGNLASSGVI- 187
Query: 214 YYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL 273
I + + + K + E+C ++ +D K F T
Sbjct: 188 --NTILDYWQFDDQIRKSTSKSGEQCPLYLQLLNGYVDKKL----------KNFNTKQAF 235
Query: 274 K---NTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD 330
K N ++ D + + R C +E+ P+ + + D
Sbjct: 236 KESYNCEKMTDNEFRWFWADTIVQMIQSGQRTRFCQTLESLPSIEAMAEYIKEIALDQGD 295
Query: 331 SLEYDGNNSRCYINEDRTGDESD--EGWEWQSCSEMVV----PMGKDKNSMYQPEPWNLT 384
S + G Y D T D++ W +Q CSE+ P K+ Y+ L
Sbjct: 296 SYKQYG----AYYLRDETVDQNSVIRQWYFQCCSELAYLQTPPQNKESLRSYEL---TLD 348
Query: 385 KYIKNCKEQYG----VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
+ C + Y + P Y+GG L + ++I +NG DP+ K
Sbjct: 349 WWRVWCNDAYSQGEVIWPDVRATEAYFGG------LNLNVDHLIMTNGGEDPWQTASLTK 402
Query: 441 TYHFFD--LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+++ D + +HC+DL +P L R+ + W Q++
Sbjct: 403 ATKANSKVITYLIDCDDCAHCVDLGAPSAKEPAVLTSTRQAIKNTFKQWHDQFW 456
>gi|195158022|ref|XP_002019893.1| GL12648 [Drosophila persimilis]
gi|194116484|gb|EDW38527.1| GL12648 [Drosophila persimilis]
Length = 485
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 120/492 (24%), Positives = 205/492 (41%), Gaps = 57/492 (11%)
Query: 27 LKLRPR-------LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVI 79
L+L+P L +RR+ LE ET ++ +D+F+ + T+ R +I
Sbjct: 23 LELQPEPNAFVQSLRELRRAPPLETSRKRANVETRWFTLKLDNFNAANNA--TWKDRVLI 80
Query: 80 NSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSR 139
N + G SPI +LG E I+ + SG + A + ++ EHR++GKS P
Sbjct: 81 NEDHFTDG-SPIFIYLGGEWEIEPSAITSGLWVDIAKEHNGSLIYTEHRFFGKSFPI--T 137
Query: 140 KAALKNARHRGYFNSAQALADYASIL--LHIKDKYNATHAPVIAIGASYGGELATWFRLK 197
+ KN +++ + QALAD I+ L ++DKY + VI G SY +ATW R
Sbjct: 138 PLSTKNLKYQ---SVQQALADVVHIIKTLKLEDKYK--DSKVIVSGCSYSATMATWIRKL 192
Query: 198 YPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASK 256
YP +++GS ASSAP+ D Y VV + F + C+ I + S ++
Sbjct: 193 YPDIILGSWASSAPLEAKVDF---KDYMEVVGQAFEQLGGKYCYDLIDNATSYYQDLFEG 249
Query: 257 PDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENA 311
G A K+ C ++ ++ + V++ AQY P N + + C+ + +
Sbjct: 250 GQG-AKAKKELNLCANFNVNSKQDRWQIFSTIANVFAGLAQYQKPGNYDLPKYCSVLRSF 308
Query: 312 PNCGDDILCKIAA---GVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM 368
+ + L K G S+ Y G + + D+S W +Q+CSE
Sbjct: 309 SDDDAEALSKFVQWRLGYPTCLSVTYKGTVNYYKWAKINYEDDSGLPWIYQTCSEFGWYQ 368
Query: 369 GKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTS 421
D + + T Y C + + + + + G DI +
Sbjct: 369 SSDSENQPFGSSFPATLYTDTCHDVFSKNYTLINIEANIAATNKDFQGIDI------AVK 422
Query: 422 NIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD---LNLGSHCLDLDEAKKSDPDWLVQQRK 478
N+ ++ G DP+S+ G +Q + SHC DLD +D L +
Sbjct: 423 NVYWTQGGLDPWSKVG---------AGIAQGAIIIPQASHCSDLDSISANDSPELQASKL 473
Query: 479 TEVKIMQGWITQ 490
++++ W+ Q
Sbjct: 474 KLAQLVRDWLAQ 485
>gi|281346012|gb|EFB21596.1| hypothetical protein PANDA_016894 [Ailuropoda melanoleuca]
Length = 509
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 116/467 (24%), Positives = 200/467 (42%), Gaps = 52/467 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W P+ LG E + + G
Sbjct: 54 WLEQPLDPFNT--SDQRSFLQRYWVNDQHWASQRGPVFLHLGGEGSLRSGSVMRGHPAAL 111
Query: 115 AHQFKALIVILEHRYYGKSVPF-GSRKAALK--NARHRGYFNSA-----QAL--ADYASI 164
A + AL++ LEHR+YG S+P G A L+ ++RH + Q+L AD AS
Sbjct: 112 APAWGALVIGLEHRFYGLSIPAEGLDVAQLRFLSSRHAECAGTPSEEGPQSLPSADVASA 171
Query: 165 LLHIKDKYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ +N +T +P I G SY G LA W RLK+PH+++ S+ASSAP+ D ++
Sbjct: 172 RRALARLFNVSTASPWICFGGSYAGSLAAWARLKFPHLILASVASSAPVRAVLD---FSE 228
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIAS--KPDGLAILSKKFKTCTPL---KNTTE 278
Y+ V + S+S + KP+ + C+ L ++ E
Sbjct: 229 YNEVRTAGGGCGGGRGCGEPPDSFSRGPTVGGVQKPNEHGC----WPACSSLGGAEDQAE 284
Query: 279 LKDGLDTVYSEAAQYDTPSNIP--VKRICNAI------ENAPNCGDDILCKIAAGVVEAD 330
L L + A QYD + +P V+R+C + +A G ++ +
Sbjct: 285 LLGALQALVGGAVQYDAQAGVPLSVRRLCGLLLGPSGSRSASYHGLRRAVQVVMRSLGQR 344
Query: 331 SLEYDGNN--SRCYINEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYI 387
L + ++ + + D W +Q+C+E +D + P L +
Sbjct: 345 CLSFSRAETVAQLRVTDPHVSGVGDRQWLYQTCTEFGFYVTCEDPRCPFSQLP-ALPSQL 403
Query: 388 KNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ C++ +G+S S +YYGG +++++F NG DP+ +
Sbjct: 404 ELCEQVFGLSTASIARAVSQTNSYYGGQT------PGSTHVLFVNGDTDPWHV---LSVT 454
Query: 443 HFFDLSFSQDLNLG-SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
S S L G SHC+D+ + SD L R++ ++ +Q W+
Sbjct: 455 QALGPSESALLIPGASHCMDMAPERPSDSPSLRLGRQSILQQLQTWL 501
>gi|146425169|emb|CAM84573.1| intestinal prolyl carboxypeptidase 1 [Haemonchus contortus]
Length = 1071
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 130/511 (25%), Positives = 212/511 (41%), Gaps = 78/511 (15%)
Query: 30 RPRLGRIRRSRI--LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG 87
R RLGR + L+ D+ +ET ++ Q +DHF+ ++ TF Q+Y N + W
Sbjct: 560 RVRLGRPPHGFVPNLDTVDTPSEYETGYFTQPVDHFN--NQNPATFDQKYYKNEQ-WARE 616
Query: 88 NSPILAFLGAEAPIDDNIQLS-GFTY-ENAHQFKALIVILEHRYYGKS----VPFGSRKA 141
PI +G E P L+ +T+ + A +F A +LEHRYYG S + F S
Sbjct: 617 GGPIFLMIGGEGPSSAKWILNENYTWLQWAKKFGATTYMLEHRYYGDSDLQRLLFDSTDT 676
Query: 142 ALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHV 201
LK + Y +S Q L D A+ + I D N I G SY G LA W R +P++
Sbjct: 677 KLKRT-YTTYLSSLQMLYDTANFIQAI-DADNGKKGTWIVFGGSYAGSLALWMRKLFPNL 734
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLA 261
V G++ SSAP+ D + Y+ VV R SE+C I + + +I G
Sbjct: 735 VHGAVGSSAPLEAKLD---YHEYYQVVEASIRXYSEDCAYAIGEGFEDIHEKMLSERGRE 791
Query: 262 ILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIE--------NAPN 313
+SK FK P D + V+ Q+ + P+++ A++ A
Sbjct: 792 EISKTFKLNPPW-------DDVSDVFEIDKQFFFWN--PMEQFTAAVQYXGDNSGGYADG 842
Query: 314 CGDDILCKI--------AAGVVEAD-----------SLEYDGNN-----SRCYINEDRTG 349
G LCKI A + E + + EY N+ S Y + T
Sbjct: 843 HGIPDLCKIMTNERRTPMARIAEFNEYMTRFFTGKPAFEYTFNSYKEFVSTAYKAQFATD 902
Query: 350 DESDEG--WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYY 407
++ G W WQ+C+E D P L + + C + +G W + Y
Sbjct: 903 KKAAAGTLWLWQTCTEFGFYGTTDSGYSLFGNPLPLNFFTQLCSDLFG------WKIDYS 956
Query: 408 GGHDIKLIL----------RRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS 457
+ + L + +N++ + G DP++ G V+ + + +
Sbjct: 957 AEMNRRATLNVNNRYGGRYKYEKTNVVMTYGTLDPWTALGPVECKESENCLMIKGT---A 1013
Query: 458 HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
HC ++ A+++D L + R I++GW+
Sbjct: 1014 HCAEMYPAREADLPSLKEARSKIENIIEGWV 1044
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 103/456 (22%), Positives = 184/456 (40%), Gaps = 80/456 (17%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +DH + + + QRY N++++ G + LG +D + T E
Sbjct: 39 YLKQKLDH----TQEVKEWSQRYFYNNRYYRKGGNVAFLMLGGMGVLD----IGWVTNEK 90
Query: 115 ------AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
A + AL+ LEHR+YGKS P + ++KN + Y QA+ D + + +
Sbjct: 91 IPFVQMAKERGALMFALEHRFYGKSRP--TDDLSVKNLK---YLTIEQAIGDIKTFIEEM 145
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKY--PHVVIGSLASSAPILYYGDITPWTTYHS 226
K+ + I G SY G LA W R KY +++ G++ASS + D T +
Sbjct: 146 NKKHKLENPKWIVFGGSYAGSLALWARDKYKDENLIAGAVASSPIMRPKFDFWEATQF-- 203
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT----CTP------LKNT 276
K+ + ++C ++I+ + ++ ++ G + LS+ FK TP L N+
Sbjct: 204 -AEKEIQKVDKKCGESIRIGFMQMIDMLGNQVGRSQLSELFKMRPRFLTPDLRNIQLLNS 262
Query: 277 TELKDGLDTVYSEAAQY--DTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
+L + + V Y + + +K++C I N + + V S
Sbjct: 263 IQLNNFISAVQFRGGPYMQNGTHSYNLKQLCE-IMNTETIDQLTALERVSNVRHLQSKYL 321
Query: 335 DGNNSRCYINEDR------TGDESDEGWE-------WQSCSEMVVPMGKDK--NSMYQPE 379
+ + ++ D D +EGW WQ C+++ D NS++
Sbjct: 322 NDMDKYTPVDFDALMKYLLKKDFDEEGWASVDRASLWQRCTQLGSFPTTDGAINSIFG-S 380
Query: 380 PWNLTKYIKNCK---EQYG---VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
++ Y C+ E++ + L +YGG D +N++ +NG DP
Sbjct: 381 LVSIDFYADLCQVFGEKFNAEHIEMTVEETLQHYGGAD-----NYKGTNVVIANGGSDP- 434
Query: 434 SRGGWVKTYHFFDLSFSQDLNL-------GSHCLDL 462
YH S+D + GSHC D+
Sbjct: 435 --------YHLLSKLSSRDPTVVTYLIEGGSHCGDM 462
>gi|328718791|ref|XP_001947511.2| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 500
Score = 113 bits (282), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 119/468 (25%), Positives = 197/468 (42%), Gaps = 55/468 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
F++Q +DHF+ P +T+ QRY +S G P+ LG E I + G E
Sbjct: 55 FFDQKLDHFN--PTDNRTWKQRYQSHSLHHKIG-GPVFMLLGGEEKISNAWLKDGSMMEY 111
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F A+ LEHRYYG S P + N + Y + QALAD A + +K +
Sbjct: 112 AEKFNAMCFQLEHRYYGDSYPTDNL-----NTTNLKYLSIKQALADVAE-FIKVKSQNPL 165
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
I G SY G LA W R YP++V +++SS+ I D Y V K D
Sbjct: 166 YKGKWILFGGSYPGSLAAWARKTYPNLVHAAVSSSSVIKTRIDNID---YFKVAEKALTD 222
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYSE 289
+ +C I+++ I ++ +G + KFK C + K+ +L L +E
Sbjct: 223 YNPKCVSNIRQATMMISDLLDSENGTKYVQSKFKVCYRINTNVKKDVRQLFQQLSIPIAE 282
Query: 290 AAQYD---------TPSNIPVKRICNAIEN----APNCGDDILCKIAAGVVEADSLEYDG 336
QY+ +++ + +C+ + N +P + K V++ Y
Sbjct: 283 TIQYNRDNRYYSNMEKADLSITALCDIMLNRALGSPFDRYVVYHKKVRHVLKRRCSSYSY 342
Query: 337 NN---SRCYIN-EDRTGDESDEGWEWQSCSEM--VVPMGKDKNSMYQPEPWNLTKYIKNC 390
+ S IN ++ D W +Q C+E+ V +D + P + +I C
Sbjct: 343 QSLLQSNSEINWHGQSVKSGDRQWYYQLCTEIGNFVTSNEDDHLFGNNIPIDF--FIDLC 400
Query: 391 KEQYG--------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
+ +G + Y+G L+ +TS +I+ +G DP++ G +
Sbjct: 401 TDVFGEHFDLNKLEKAVHKTTMMYHG-------LKNTTSRVIYLHGSFDPWNGLGLTEPE 453
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
D S S ++ SHC DL + DP L + R+T + W+T+
Sbjct: 454 S--DDSISINIEGVSHCADLYTSSPKDPPQLSKARETVTFYLNKWLTE 499
>gi|357146992|ref|XP_003574183.1| PREDICTED: probable serine protease EDA2-like [Brachypodium
distachyon]
Length = 489
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 127/467 (27%), Positives = 193/467 (41%), Gaps = 67/467 (14%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD--NIQLSGFTYEN 114
NQ +DHFS P ++ F QR+ + G P+ + E+ D N L+
Sbjct: 55 NQRLDHFS--PTDHRQFKQRHFEFLDYHRAGG-PVFLRICGESSCDGIPNDYLAVL---- 107
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD---- 170
A +F A +V EHRYYGKS PF + +N R + +S QAL D A + +D
Sbjct: 108 AKKFGAAVVTPEHRYYGKSSPF--ERLTTENLR---FLSSKQALFDLAVFRQYYQDALNY 162
Query: 171 KYNATHA---PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+YN + P G SY G L+ WFRLK+PH+ GSLASS +L + +T +
Sbjct: 163 RYNRSSGFDNPWFVFGVSYAGALSAWFRLKFPHLTCGSLASSGVVL---AVYNFTDFDKQ 219
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
V K EC ++++ +E+ + D ++ K LKN + L
Sbjct: 220 VGK---SAGPECKAALQET-TELVEEQLQSDSHSV--KALFGAQTLKNDGDFLFLLADAA 273
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV-------VEADSLEYDGNNSR 340
+ + QY P +C+ + A G ++L A V + YD
Sbjct: 274 ATSFQYGNPDA-----VCSPLTKAKKNGKNLLESYAQFVRDYYIKKLGTTVSSYDQE--- 325
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
Y+ D S W +Q CSE+ KN + N + CK +G P
Sbjct: 326 -YLKNTTPDDSSSRLWWFQVCSEVAYFQVAPKNDSVRSAKVNTRYNLDLCKNVFGEGVYP 384
Query: 401 SWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSH 458
+T YYGG I + S I+F+NG +DP+ K+ + N G H
Sbjct: 385 DVFMTNLYYGGTSI------AASRIVFTNGSQDPWRHASKQKSSEDMPSYLIKCSNCG-H 437
Query: 459 CLDLD---------EAKKSD---PDWLVQQRKTEVKIMQGWITQYYD 493
DL E SD P+ + + RK K + W++Q ++
Sbjct: 438 GTDLRGCPQLPFRIEGNSSDCTSPEAVSKVRKQIAKHIDLWLSQCHN 484
>gi|344268145|ref|XP_003405923.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 504
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 120/462 (25%), Positives = 187/462 (40%), Gaps = 50/462 (10%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY-EN 114
+ Q +DHF ++ QRY IN F+ G P+ +G N +
Sbjct: 66 FMQKLDHFDQKEIFWR---QRYFINDAFYKPG-GPVFLMIGGMGSAKRNWTSRNLPFVAY 121
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A + AL ++LEHR+YG+S P G L A R Y + Q L D A+ + I
Sbjct: 122 AERLGALCLVLEHRFYGRSQPTGD----LSTASLR-YIRNHQVLGDIANFRIKIAKLMGL 176
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T +A G YGG LA W R+KYP + ++ SSAP+ +I + Y V
Sbjct: 177 TKNKWVAFGEFYGGSLAVWSRIKYPDLFAAAVGSSAPV--KAEIN-FDEYFEEVQVSLDA 233
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVYSE 289
+ EC ++ + E+ + L + F C PL ++ T + + L +
Sbjct: 234 HNSECSSSVYLALREVTKRLIHQKHYSKLKRDFMLCEPLQIDSKQHATFVLENLMSFLIP 293
Query: 290 AAQYDTPSN-----IPVKRICNAIENAPNCGD-DILCKIAAGVVEADSLE-YDGNNSRCY 342
QY+ + C + P +I + ++ +L D N +
Sbjct: 294 IVQYNKKRKSVMNILSTDDFCKKMTETPLSSPYHRYARIMSNRIKNANLSCLDANYNHHL 353
Query: 343 INEDRTGDESDEGWE-----WQSCSEMVVPMGKDKNSMYQP-EPWNLTKYIKNCKEQYGV 396
T + + +Q C+E D S YQ L ++K C + +G
Sbjct: 354 RRMSETSLNNGNILQVRQRLYQCCTEFGFFQTTD--SKYQSFSELPLRYFLKQCSDVFGS 411
Query: 397 S------PRPSWVLT-YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
R + L YYGG ++K S IIFSNG DP+S G K + +F
Sbjct: 412 EYSFSALNRSAQALNKYYGGFNVK------GSKIIFSNGSLDPWSALGITKD---INKNF 462
Query: 450 SQDLNLG-SHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
L G +HC D+DE SD L+Q R+ +I+Q W+ +
Sbjct: 463 RAVLIEGEAHCADMDEKMDSDSAELIQAREKIFQILQEWLKE 504
>gi|347965569|ref|XP_321914.5| AGAP001240-PA [Anopheles gambiae str. PEST]
gi|333470451|gb|EAA01781.5| AGAP001240-PA [Anopheles gambiae str. PEST]
Length = 496
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 110/461 (23%), Positives = 204/461 (44%), Gaps = 48/461 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
ET Q +DHF P++ T+ RY+ N + + G P+ ++G E I + G
Sbjct: 61 ETKHIMQRLDHFD--PQNVNTWSMRYMANGEHYVEGG-PLFIYVGGEWEISEGSISRGHV 117
Query: 112 YENAHQFKALIVILEHRYYGKSVP-FGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
Y+ A + K + EHR+YG+S P R LK Y N QALAD A ++ ++
Sbjct: 118 YDMAAELKGYLFYTEHRFYGQSHPTVDLRTDKLK------YLNIDQALADLAHFVVEMRK 171
Query: 171 KY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
A + VI IG SY + +WFR KYPH++ G+ ASSAP+ +T Y +V+
Sbjct: 172 TIPGAEKSGVIMIGGSYSATMVSWFRQKYPHLINGAWASSAPVFAK---VEFTEYKEIVT 228
Query: 230 KDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGL 283
+ R + C I+++ + + + + + A ++++F+ C+ + + L L
Sbjct: 229 ESIRLVGGQSCADRIERAIRQTEELLDRGE-YASVAQEFQLCSDVDLSQPLDRMNFFSSL 287
Query: 284 DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGD-DILCKIAAGVVEADSLEYDGNNSRCY 342
++ QY + +I + +C IE+A D L K+ + + + G +
Sbjct: 288 SDEFAGVVQYHSTGDI--EGVCQVIEDATITDDMQALAKLVTRGLTSTNCNSYGYKAMVD 345
Query: 343 INEDRTGDE-----SDEGWEWQSCSEM--VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG 395
++ +E S W +Q+C+E G K P +L ++K C + Y
Sbjct: 346 YYKNTAWNEGAAMSSMRQWLYQTCAEYGWYQISGSSKQIFGSSFPVDL--FVKLCGDLYD 403
Query: 396 VSPRPSWVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+ ++ YGG + ++ +N+ F+ G DP+ G + + D S
Sbjct: 404 GFFDKTRMMNNADRTNVIYGGWNPEV------TNVFFTQGQLDPWRAMGIQQDLN--DQS 455
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ + +HC DL D + ++ +++++ W+
Sbjct: 456 PAVVIPGAAHCADLSSITAQDSAEMRAAKEKILELVKKWLA 496
>gi|145351417|ref|XP_001420076.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580309|gb|ABO98369.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 538
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 133/513 (25%), Positives = 207/513 (40%), Gaps = 48/513 (9%)
Query: 15 VIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFP 74
I V+ + + L+ R I + S E +F Q +DHF + T
Sbjct: 20 AISVNARWRVDALRARAIRRGIGGGHDASARTSIGAHERWFAEQRLDHFDNALNASWT-- 77
Query: 75 QRYVINSKFWGGG-NSPILAFLGAEAP-IDDNIQLSG-----FTYENAHQFKALIVILEH 127
QRY +N + +P+ +G E P +D ++ + G A + + L LEH
Sbjct: 78 QRYFVNDAYASAERGAPVFVCVGGEGPALDVDVAVDGGEHCAIATALAKKHRGLFFALEH 137
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP--------- 178
R+YGKS P G ++++ R + +SAQAL D + Y P
Sbjct: 138 RFYGKSQPTGD--LSVESLR---FLSSAQALEDLVTFTRFAAAAYGLEIEPRNDGRKYSK 192
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT--- 235
VIA G SY G LA W R+K+PHV ++ASSAP+ D+ Y+ VV R+
Sbjct: 193 VIAFGGSYPGMLAAWSRVKFPHVFHAAVASSAPVRAQIDM---RGYYEVVGDALREKDVG 249
Query: 236 -SEECFQTIKKSWSEIDNIASK-PDGLAILSKKFKTC--TPLKNTTELKDGLDTVYSE-A 290
S+ C+ ++ +++ N A K G L K+F C L D D + +
Sbjct: 250 GSDACYTAVENAFTVRLNEALKTSSGRRALEKQFNVCGDEALDGVGARDDFADVLRAMFP 309
Query: 291 AQYDTPSNIPVKRICNAIENAPNC----GDDILCKIAAGVVEADSLEYDGNNSRCYINED 346
AQ + PS + C I A G+D L +AA V E +S Y+ E
Sbjct: 310 AQNNDPSCLADDDSCFNIAKACTIMTSHGEDKLAALAAHVAAVFRGECVSLDSEAYMREL 369
Query: 347 RT-----GDESDEGWEWQSCSEMVVPMGKDKNS----MYQPEPWNLTKYIKNCKEQYGVS 397
++ E + W WQ+C+E +K+S P L Y K C + +GVS
Sbjct: 370 KSEIPNPKGEGERQWTWQTCTEFAFFQTCEKSSKCPFKLDPPTMPLEAYHKICADVFGVS 429
Query: 398 PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS 457
+ + I T I+F +G DP+ +V + + S
Sbjct: 430 AEQTRLAVERSNARYGSITPGGT-RIMFPSGSIDPWIANSFVSDTFAPRFEPALIVKGAS 488
Query: 458 HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
H K +D D LV+ R V ++ W+ +
Sbjct: 489 HHAWTHPPKDTDTDALVEARAIIVGQVEKWLNE 521
>gi|357140818|ref|XP_003571960.1| PREDICTED: probable serine protease EDA2-like [Brachypodium
distachyon]
Length = 503
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 124/470 (26%), Positives = 196/470 (41%), Gaps = 63/470 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + +QT+DHFS P ++ F QRY + N P+ + E+ N + +
Sbjct: 51 EEHWMSQTLDHFS--PTDHRQFKQRYYEFLDYHRVPNGPVFLNICGESSC--NGISNSYL 106
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A +F A +V EHRYYGKS PF + +N R + +S QAL D A + ++
Sbjct: 107 AVIAKKFGAALVSPEHRYYGKSSPF--KSLTTENLR---FLSSKQALFDLAVFRQYYQET 161
Query: 172 YNATHAPVIA------IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
NA + A G SY G L+ WFRLK+PH+ GS ASS +L + +T +
Sbjct: 162 LNAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSHASSGVVL---AVYNFTDFD 218
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPLKNTTELKDGLD 284
+ + EC + ++++ +D + S + + K+ L+N + L
Sbjct: 219 KQIGE---SAGPECKEALQETTKLVDGQLQSGRNSV----KQLFGARMLQNDGDFLYLLA 271
Query: 285 TVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA-------AGVVEADSLEYDGN 337
+ A QY P +C+ + A G D++ A G A YD
Sbjct: 272 DAAAIAFQYGNP-----DILCSPLVEAKKNGTDLVEAFAHYVNKYYVGTFGASVASYDQQ 326
Query: 338 NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--G 395
Y+ + S W +Q CSE+ KN + + ++ CK + G
Sbjct: 327 ----YLKNTTPAESSYRLWWYQVCSEVSYFQVAPKNDSVRSAKIDTRYHLDLCKNVFGEG 382
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL 455
V P S YYGG R + S I+F+NG +DP+ K+ + N
Sbjct: 383 VYPDVSMTNLYYGG------TRIAGSKIVFANGSQDPWRHASKQKSSKEMPSYLIECSNC 436
Query: 456 GSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQYYD 493
G HC DL D +K S P+ + + RK V + W+++ D
Sbjct: 437 G-HCSDLSGCPQAPSNIEGDSSKCSSPEAVNKVRKQIVDNIDLWLSECQD 485
>gi|29840883|gb|AAP05884.1| similar to NM_066318 prolylcarboxypeptidase (angiotensinase C) in
Homo sapiens [Schistosoma japonicum]
Length = 184
Score = 112 bits (281), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/160 (39%), Positives = 96/160 (60%), Gaps = 6/160 (3%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN 104
KDS +ET ++ IDHFS+ + F +Y+IN++ + G PIL + G E I+
Sbjct: 28 KDSQFKYETKYFRTKIDHFSFVTDG--EFEIKYLINNESFSSGG-PILFYTGNEGAIETF 84
Query: 105 IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
+ SGF ++ A + A +V EHRYYG S+PFG+ + K+ +H GY + QALADY +
Sbjct: 85 AENSGFIWKLAEELNASVVFAEHRYYGTSLPFGND--SFKDRQHFGYLTAEQALADYVLL 142
Query: 165 LLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPHVVI 203
+ +K Y+ +PVI+ G SYGG L+ W R KYP+ ++
Sbjct: 143 INQLKINYSCFASSPVISFGGSYGGMLSAWIRQKYPNQIV 182
>gi|320165589|gb|EFW42488.1| serine carboxypeptidase S28 [Capsaspora owczarzaki ATCC 30864]
Length = 491
Score = 112 bits (280), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 128/497 (25%), Positives = 202/497 (40%), Gaps = 84/497 (16%)
Query: 33 LGRIRRSRILEQKDSNHGFET-FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPI 91
LGR S I + S G ++ +NQ IDHF+ P+ +TF QRY N+ + N PI
Sbjct: 32 LGRTPASLIQRRSASALGNDSQSVFNQLIDHFN--PQHRETFKQRYFENTDNFDPVNGPI 89
Query: 92 LAFLGAEAPI----DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNAR 147
++ EA +D I++ + QF A IV LEHRYYG+S PF A
Sbjct: 90 FLYICGEATCGGIPNDYIRVL------SKQFNAAIVTLEHRYYGESSPF-----AQLTTP 138
Query: 148 HRGYFNSAQALADYASILLHIKDKYNATHAPV--------------IAIGASYGGELATW 193
+ Y S QA+ D A+ +D Y V G SY G L+ W
Sbjct: 139 NLQYLTSRQAINDLAAF----RDFYQHNVVDVRYAQQRAGRGDNLWFTYGVSYSGALSAW 194
Query: 194 FRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDN- 252
FRLK+PH+ GSLASS G + ++ + R +C + + S ++
Sbjct: 195 FRLKFPHLTAGSLASS------GVVDVVLSFPEFDEQVTRSVGSDCANALHAAMSGVEAL 248
Query: 253 IASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAP 312
+A+ P +L K T L + + L + + QY + +C + A
Sbjct: 249 LAANPVATKVLFKA----TSLSSNLDFLSMLADSTALSVQYGHKDS-----MCPPLVQAF 299
Query: 313 NCGDDILCKIAAGVV-------EADSLEYDGNNSRCYINEDRTG-DESDEGWEWQSCSEM 364
G ++ A V E D Y S+ Y+ + + G D W +Q+C+EM
Sbjct: 300 QAGQNMTLAFAQYVTTSFYTIFEVDPFSY----SQEYLKQVQAGPDSGARQWTYQTCAEM 355
Query: 365 ----VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGV-SPRPSWVLTYYGGHDIKLILRRS 419
V P G S + Y C+ +GV P + YYG +I +
Sbjct: 356 GYFQVAPAGFSIRS----RQLTIDYYQSLCQNVFGVWPPVINATNEYYGARNI------A 405
Query: 420 TSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL-----DEAKKSDPDWLV 474
++ F+NG +DP+ + + + + + H +D+ A+ + L
Sbjct: 406 STQTFFTNGAQDPWQNVTLQVSNNPLRPTATAVCDNCGHGVDMRGCPQSPAQTNGDTSLC 465
Query: 475 QQRKTEVKIMQGWITQY 491
+ + VK +Q I QY
Sbjct: 466 KPDGSNVKAIQASIVQY 482
>gi|115482892|ref|NP_001065039.1| Os10g0511400 [Oryza sativa Japonica Group]
gi|110289389|gb|AAP54577.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
gi|113639648|dbj|BAF26953.1| Os10g0511400 [Oryza sativa Japonica Group]
gi|215697793|dbj|BAG91986.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 507
Score = 112 bits (280), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 121/468 (25%), Positives = 193/468 (41%), Gaps = 63/468 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + +QT+DHF+ P ++ F QRY ++ PI ++ E+ N + +
Sbjct: 53 EERWMDQTLDHFN--PTDHRQFKQRYYEFLDYYRAPKGPIFLYICGESSC--NGIPNSYL 108
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A +F A +V EHRYYGKS PF S +N R + +S QAL D A + ++
Sbjct: 109 AVMAKKFGAAVVSPEHRYYGKSSPFESLTT--ENLR---FLSSKQALFDLAVFRQYYQET 163
Query: 172 YNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
NA + + G SY G L+ WFRLK+PH+ GSLASS +L + +T +
Sbjct: 164 LNAKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL---SVYNYTDFD 220
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT 285
+ + EC ++++ +D G + + F T L N + L
Sbjct: 221 KQIGE---SAGPECKAALQETTKLVD--GQLQSGRNAVKQLFGAST-LANDGDFLFLLAD 274
Query: 286 VYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA-------AGVVEADSLEYDGNN 338
+ A QY P +C+ I A G D++ A G A YD
Sbjct: 275 AAAIAFQYGNPD-----ALCSPIVEAKKNGTDLVETFARYVKDYYIGTFGASVASYDQE- 328
Query: 339 SRCYINEDR--TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGV 396
Y+ + + W +Q CSE+ KN + + ++ C+ +G
Sbjct: 329 ---YLKNTTPPPAESAYRLWWYQVCSEVAYFQVAPKNDSVRSAKIDTRYHLDLCRNVFGE 385
Query: 397 SPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
P +T YYGG R + S I+F+NG +DP+ K+ + N
Sbjct: 386 GVYPDVFMTNLYYGG------TRIAGSKIVFANGSQDPWRHASKQKSSKELPSYLIECSN 439
Query: 455 LGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
G HC DL D + S P+ + + RK V + W+++
Sbjct: 440 CG-HCSDLSGCPQAPSHIEGDSSNCSSPEAVNKVRKQIVDHIDLWLSE 486
>gi|146425171|emb|CAM84574.1| intestinal prolyl carboxypeptidase 2 [Haemonchus contortus]
Length = 1143
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 125/482 (25%), Positives = 200/482 (41%), Gaps = 65/482 (13%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLS-G 109
+E ++ Q +DHF+ ++ TF QRY N + W N PI +G E+ D + L+
Sbjct: 602 YEAGYFTQPVDHFNN--KNPYTFEQRYFKNDQ-WAKPNGPIFLMIGGESERDSSWVLNEN 658
Query: 110 FTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAAL-KNARHRGYFNSAQALADYASILLH 167
TY A +F A + LE RYYGKS F S A+ K + Y +S Q L D A+ +
Sbjct: 659 LTYLKWADEFGATVYALEXRYYGKSDLFDSLDPAVSKKNTYTTYLSSLQMLYDVANFIRA 718
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+ D H I G SY G LA W R +P +V G++ SSAP+ D + Y+ V
Sbjct: 719 V-DAERGQHGKWIMFGGSYAGSLALWMRRLFPDLVYGAIGSSAPLEAKLD---FYDYYQV 774
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD------ 281
V K R SE+C I + + +I G A L++ FK P + +++ +
Sbjct: 775 VEKSIRSHSEDCAYAIAEGFDDIRQRLLTEKGRAQLTEIFKLNPPWDDVSDVFEIDKQFF 834
Query: 282 --GLDTVYSEAAQYDTPS--------NIP-VKRICNAIENAPNCG----DDILCKIAAGV 326
L +++ A QY + IP + RI P ++ + + G
Sbjct: 835 ISNLVDMFASAVQYSGDNRGHYAHGYGIPDMCRIMTKQGRKPISSIAAFNEYMTNMFTGD 894
Query: 327 VEADSL--EYDGNNSRCYINEDRTGDESDEG--WEWQSCSEMVVPMGKDKNSMYQPEPWN 382
E +S+ YD Y + T + G W WQ+C+E D
Sbjct: 895 TEFESMFNSYDDLKRLLYKAQFSTNPKEAAGTLWLWQTCTEFGFYQTTDSGYSLFGNLLP 954
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSN----------------IIFS 426
L Y + C + +G+ T Y B RR+T + ++ +
Sbjct: 955 LNFYTQLCSDVFGLK-------TSYSAKBN----RRATLSANKRYGGRFNYGADPMVVMT 1003
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQG 486
+G DP++ G + T D F + +HC ++ A+ D L R+ I++
Sbjct: 1004 HGSLDPWNALGNI-TCDPADKCFM--IKGTAHCAEMYPARDKDEQDLKDTRERIRGILKS 1060
Query: 487 WI 488
WI
Sbjct: 1061 WI 1062
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 62/245 (25%), Positives = 108/245 (44%), Gaps = 30/245 (12%)
Query: 41 ILEQKDSNH--------GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
I+ ++NH G ++ Q +DH E T+PQRY + +++ G +
Sbjct: 31 IIPMDENNHTLMAQNGFGGREAYFKQKLDHTKDDGEG--TWPQRYFYSQRYYRKGGNVFF 88
Query: 93 AFLGAEAPID------DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNA 146
LG +D + + + E Q AL EHR+YGKS P + +
Sbjct: 89 LMLGGMGVMDIGWVTNEKLPFVQWGKERGAQLYAL----EHRFYGKSRPTPNL-----SV 139
Query: 147 RHRGYFNSAQALADYASIL--LHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIG 204
R+ Y QA+ D A+ + ++ K + A I G SY LA W R KYP+++ G
Sbjct: 140 RNLAYLTIDQAIGDVANFIKEMNAKHRIXDEDAKWIVFGGSYAASLALWARQKYPNLIAG 199
Query: 205 SLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILS 264
++ASS + D T + + +R T C + I+ ++ ++ ++ G + +S
Sbjct: 200 AVASSPLMRPRFDFWEGTQFAEDI---YRKTDATCAENIEIAFQQLADMLGSERGRSQVS 256
Query: 265 KKFKT 269
+ KT
Sbjct: 257 ELLKT 261
>gi|308162690|gb|EFO65071.1| Thymus-specific serine protease precursor [Giardia lamblia P15]
Length = 522
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 120/466 (25%), Positives = 191/466 (40%), Gaps = 79/466 (16%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E +F Q +DHF P + + + QRY N ++ G P+ +G E P+ F+
Sbjct: 56 ELWFREQHVDHFD--PMNTKKWSQRYYYNDTYYKAGG-PVFLMIGGEGPVTPKYVEDYFS 112
Query: 112 YEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+ A L V LEHR+YG S P + A L R S QALAD A+ L ++K
Sbjct: 113 IDYFAKNMNGLKVALEHRFYGASFP-STDSADLSLLR------SDQALADIATFLAYLKR 165
Query: 171 KYNATHAP-VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
+YN + ++A+G SY G LA W R+++P ++ +++SS P L D + + + S
Sbjct: 166 EYNLPESTKIVAVGGSYSGNLAAWARIQFPFIISAAISSSGPYLAQTDYPEYLQH--IDS 223
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK--------- 280
+ + + C I + + + + S K + + NT K
Sbjct: 224 QIRKYGGDRCMDVISAAHKDAEYLLSHDKATLAAIFKLREESIYNNTGYDKASFMSAMGA 283
Query: 281 -DGLDTVYSEAAQYDTPSNIPVKRICNAIE---NAPNCGDDI---------LCKIAAGVV 327
G+ Y T + +K++C AIE + + G+ L G +
Sbjct: 284 PSGVVQYAKHDGYYTTTKDGDIKQMCKAIEAYYDGYDAGESYRQLKAYSLWLLDYYDGSM 343
Query: 328 EADSLEYDGNNSRCYIN--EDRTGDES---DEGWEWQSCSEMVVPMGKDKNSMYQPEPWN 382
E L +DG YI +D + D D W WQ+C E + +
Sbjct: 344 EEIDLSFDG-----YIKAIQDTSIDSEFAVDRSWLWQTCVEFGYYQTSSPTAGFGTM-IT 397
Query: 383 LTKYIKNCKEQY---GVSPRPSWVLT-----------------YYGGHDIKLILRRSTSN 422
L +++ C + Y G +P + T YYG +IK+ SN
Sbjct: 398 LDYFLELCYKAYFAPGATPPGTQSFTRSQSDDLVKKAVQFTNVYYGARNIKI------SN 451
Query: 423 IIFSNGMRDPFS----RGG--WVKTYHFFDLSFSQDLNLGSHCLDL 462
I +NG DP+S R G W +H + S + + GSHC DL
Sbjct: 452 IYITNGHVDPWSELSYREGETWSTGHHLHNDSTTSYIPNGSHCTDL 497
>gi|326502560|dbj|BAJ95343.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 490
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 130/477 (27%), Positives = 196/477 (41%), Gaps = 68/477 (14%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD--NIQLSG 109
E + +Q +DHFS ++ F QRY + P+ + E+ D N L+
Sbjct: 50 EERWMSQRLDHFS--SSDHRQFKQRYFEFLDYHDDPTGPVFLRICGESSCDGIPNDYLAV 107
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A +F A +V EHRYYGKS PF S N R + +S QAL D A + +
Sbjct: 108 I----AKKFGAAVVTPEHRYYGKSSPFDS--LTTDNLR---FLSSKQALFDLAVFRQYYQ 158
Query: 170 DKYNATHA-------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+K N+ + P G SY G L+ WFRLK+PH+ GSLASS +L + T +
Sbjct: 159 EKLNSRYNRSAGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVLAVYNFTDF- 217
Query: 223 TYHSVVSKDFRDTS-EECFQTIKKSWSEIDNIASK---PDGLAILSKKFKTCTPLKNTTE 278
K D++ EC K + EI + K D ++ K LKN +
Sbjct: 218 ------DKQVGDSAGPEC----KAALQEITRLVDKQLLSDSHSV--KALFGADSLKNDGD 265
Query: 279 LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
L + QY P +C+ + NA G+ L + A V+ ++ G
Sbjct: 266 FLFLLADAAATTFQYGNPDA-----LCSPLANAKKKGES-LVETYAHFVKDYYIKKLGTT 319
Query: 339 ----SRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY 394
+ Y+ E D S W +Q CSE+ KN + + + CK Y
Sbjct: 320 VSSYDQEYLKETTPDDSSSRLWWFQVCSEVAYFQVAPKNDSVRSAQIDTRYNLDLCKNVY 379
Query: 395 GVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
G P +T YYGG I + S I+F+NG +DP+ K+ +
Sbjct: 380 GEGVYPDVFMTNLYYGGTSI------AASKIVFTNGSQDPWRHASKQKSSEGMPSYIIKC 433
Query: 453 LNLGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKA 497
N G H DL D + + P+ + RK VK + W++Q ++ +A
Sbjct: 434 SNCG-HGTDLRGCPQLPFRIEGDSSNCTSPEAVNTVRKQIVKHIDLWLSQCHEPARA 489
>gi|195497934|ref|XP_002096311.1| GE25599 [Drosophila yakuba]
gi|194182412|gb|EDW96023.1| GE25599 [Drosophila yakuba]
Length = 473
Score = 112 bits (279), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 110/459 (23%), Positives = 197/459 (42%), Gaps = 54/459 (11%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF P +T+ RY++N + G +P+ +LG E I G
Sbjct: 46 QTLWIEQKLDHFD--PAETRTWQMRYMLNDALYKSG-APLFIYLGGEWEISSGRITGGHL 102
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + AL+ EHRYYG+S P L N + Y + Q+LAD A + IK
Sbjct: 103 YDMAKEHNALLAYTEHRYYGQSKPL----PDLSNENIK-YLSVNQSLADLAYFINTIKQN 157
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + + VI +G SY + TWF+ YP +V G ASSAP+ + Y V +
Sbjct: 158 HEGLSESKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLFAK---VNFVEYKEVTGQ 214
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
+ +C++ I+ +E++++ + G A + K C P ++L + T++SE
Sbjct: 215 SIQQMGGSDCYKRIENGIAEMESMIATKRG-AEVKAILKLCEPFDVYSDLD--VWTLFSE 271
Query: 290 -----AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADS----LEYDGNNSR 340
A T + ++ +C I N + + + E+ L YD
Sbjct: 272 ISDIFAGVVQTHNAGQIEGVCQKIMAGSNDLNGVAGYLLDVFAESGGKCYDLSYDAITG- 330
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKEQYG- 395
++ + G+ + W +Q+C+E G + S +P+ +T Y C + YG
Sbjct: 331 LLLDTNYNGNIMRQ-WMFQTCNE----YGWYQTSGSTAQPFGTKFPVTYYTTMCADLYGS 385
Query: 396 ------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
+S + S ++GG L N+ ++G DP+ G D S
Sbjct: 386 QYSNEFISNQVSITNQFFGG------LSPGVENVYLTHGQLDPWRAMG------IQDESQ 433
Query: 450 SQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + +HC D + SD + ++ ++++ W+
Sbjct: 434 ATIIPEHAHCKDFNSISSSDTAEMKASKERIAELVREWV 472
>gi|338718275|ref|XP_003363793.1| PREDICTED: LOW QUALITY PROTEIN: thymus-specific serine
protease-like [Equus caballus]
Length = 620
Score = 111 bits (278), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 122/465 (26%), Positives = 194/465 (41%), Gaps = 69/465 (14%)
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
S G + FP RY +N + W + P+ LG E + L G A + AL++
Sbjct: 173 STGEKDVLEFP-RYWVNDQHWTAQDGPVFLHLGGEGSLWPGSVLRGHPATLAPSWGALVI 231
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAI 182
LEHR+YG S+P G L A+ R + +S AL D S L + +N ++ +P I
Sbjct: 232 GLEHRFYGLSIPAG----GLDMAQLR-FLSSRHALTDVVSARLALSRLFNVSSSSPWICF 286
Query: 183 GASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT----SEE 238
G SY G LA+W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+ + S E
Sbjct: 287 GGSYAGSLASWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLMNPAIGGSPE 343
Query: 239 CFQTIKKSWSEIDN--IASKP--DGLAILSKKFKTCTPLK-NTTELKDGLDTVYSEAAQY 293
C +++E++ A P L + + + P + N EL L + A QY
Sbjct: 344 CQAAAAAAFAEVERRLRAGGPARAALRGGAGRLRGAGPARSNQAELLGALQALLGGALQY 403
Query: 294 D---TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-------- 342
D P + V+R + P G G+ A + RC
Sbjct: 404 DGAGGPRRMSVRRPLRSPPRGP--GQRQRPAPXRGLRRAAQVVIQSLGQRCLSSSRAETV 461
Query: 343 ----INEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-- 395
+ E + D W +Q+C+E +D + P L + C++ +G
Sbjct: 462 AQLKVTEPQVSGVGDRQWVYQTCTEFGFYVTCEDPGCPFSKLP-ALPSQLALCEQVFGLS 520
Query: 396 ---VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
V+ + +YYGG + ++F NG DP ++ LS +Q
Sbjct: 521 ISSVAQAVARTNSYYGGQTP------GATRVLFVNGDTDP-----------WYVLSVTQA 563
Query: 453 LNL---------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L SHCLD+ + SD L+ R+ + +Q W+
Sbjct: 564 LGPSESALLIPSASHCLDMAPERPSDSPSLLLGRQHISQQLQTWL 608
>gi|326427042|gb|EGD72612.1| hypothetical protein PTSG_04347 [Salpingoeca sp. ATCC 50818]
Length = 482
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 120/450 (26%), Positives = 191/450 (42%), Gaps = 44/450 (9%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
+++ +DHF + S TF RY I+ + G S ++G E P + I+ +Y
Sbjct: 61 LYFDFFLDHFDH---SSPTFRGRYYIDDSQFKNG-SVCFFYMGGEGP-NTGIRNDYVSYL 115
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A Q+KALIV +EHR+YG SVPF + + Y S QALAD A ++ H+
Sbjct: 116 -AKQYKALIVSIEHRFYGDSVPFDDF-----SVTNLEYLTSRQALADAAQLIKHVNSSDT 169
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ A G SY G L+ WFR+KYP V++GSL+SS + I +T + V
Sbjct: 170 YKCSAWFAFGGSYSGALSAWFRVKYPDVIVGSLSSSGVV---NAILDFTAFDVQVRNAIG 226
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQY 293
+ + Q + ++ N + K + A + P + + L + A QY
Sbjct: 227 FSCTKDLQRVTAAFETALNKSDKSNAHAKALFSVRADIPDGDFAYM---LADSAAMADQY 283
Query: 294 DTPSNIPVKRICNAIENAPNCGDD-ILCKIAA-------GVVEADSLEYDGNNSRCYINE 345
+ +++C+AI N DD I+ + A G S YD S C +
Sbjct: 284 GSK-----EKLCSAISGLRNEKDDEIVMQTFANFTIKFWGADFGPSCFYD---SECVRSN 335
Query: 346 DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWV 403
+ W WQ C E+ + + ++ + + C+ + GV P
Sbjct: 336 PAAWQPTARSWWWQKCHELAYWQNAPVVNSLRMSLLSMNYHKQRCEFMFAKGVFPDTQGT 395
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ-DLNLGSHCLDL 462
YYGG + +NI FS+ DP+ + V+T L + N HC+DL
Sbjct: 396 NKYYGGK------HPNGTNIFFSDFSDDPWQQAS-VRTTLSPALPYELVTCNGCGHCMDL 448
Query: 463 DEA-KKSDPDWLVQQRKTEVKIMQGWITQY 491
++DP+ L Q R K + W+ Y
Sbjct: 449 HAPDDENDPNALKQGRVAFEKHLSTWLKPY 478
>gi|268535772|ref|XP_002633021.1| Hypothetical protein CBG21793 [Caenorhabditis briggsae]
Length = 1085
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 116/490 (23%), Positives = 197/490 (40%), Gaps = 48/490 (9%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF-SYGPESYQTFPQRYVINSKFWGGGN 88
RP G + D GFE + Q +HF + P+ +Q Q++ NS+ W
Sbjct: 556 RPPHGFLPEPDYEMPADMPPGFEQGMFRQRENHFDNRNPDFFQ---QKFYKNSQ-WAQPG 611
Query: 89 SPILAFLGAEAPIDD----NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALK 144
P +G E P N L+ TY A ++ A + ILEHR+YG S+ +
Sbjct: 612 GPNFLMIGGEGPEGPRWVLNENLTWLTY--AKKYGATVFILEHRFYGDSL-------VGQ 662
Query: 145 NARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIG 204
N + S Q L D A + + + T AP I G SY G ++ W R +P +VIG
Sbjct: 663 NNDNFNVLTSLQMLYDLAEFIKAVNIR-TGTSAPWITFGGSYSGAMSAWMREVFPELVIG 721
Query: 205 SLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILS 264
++ASS P+ D + Y VV K R + C I+ +S + + +G LS
Sbjct: 722 AVASSGPVFAKTD---FYEYLMVVEKSIRTYDKTCADRIQSGFSTMQTMFQTKEGRQNLS 778
Query: 265 KKFKTCTPLKNTTELKDG---LDTVYSE---AAQYDTPSNIP------VKRICNAIENAP 312
F+ P + D +Y A QY + P + +C + N
Sbjct: 779 DIFQLQPPFGDNVTDTDQHYFFSNIYGNFQGAVQYSGDNTGPYANGYGIPDMCKFMTNDS 838
Query: 313 NCGDDILCKIA--AGVVEADSLEYDGNNSR------CYINEDRTGDESDEG--WEWQSCS 362
N + + + V + Y G +++ ++ +G ES W WQ+C+
Sbjct: 839 NTPLNNIVQFNEYMTVFYNNGRNYTGTDNKYQDMIDSLVHAQESGPESAASLLWTWQTCN 898
Query: 363 EMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRR--ST 420
E D + P + +++ C + +G + + ++ + K R+
Sbjct: 899 EFGYFQSADTGNGIFGSPTPVNMFVQMCMDVFGSTYQRIFIDNQIAQTNYKYGERQHYRG 958
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
+N++F NG DP+ G + D + +N +HC D+ A+ +D L R
Sbjct: 959 TNVVFPNGNVDPWHALGLYGSAD--DSVVAYLINGTAHCADMYPARAADVPGLKVVRDII 1016
Query: 481 VKIMQGWITQ 490
+ W++Q
Sbjct: 1017 DTNIGKWLSQ 1026
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 100/437 (22%), Positives = 175/437 (40%), Gaps = 42/437 (9%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF 94
++ ++ L ++N T + Q +DH Y + TF QRY + + L +
Sbjct: 29 KLNKAARLSTGNANVPVTTAYMIQNLDH--YNGNASGTFIQRYYYTESYTLHQRTAFL-Y 85
Query: 95 LGAEAPIDDNIQLS--GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+ + ++ ++A QF A + LEHRYYG+S P A ++ Y
Sbjct: 86 ISVSGDFETSVITDDRNPVVKSAKQFGATVFSLEHRYYGQSKP----NVANFDSNSLRYL 141
Query: 153 NSAQALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
NS QA+ D + + + ++N + GA YGG +A R P VV G +ASS+P
Sbjct: 142 NSFQAIQDIVAFIKYANKQFNMDPDVRWVLWGAGYGGVIAAEARKWNPDVVAGVIASSSP 201
Query: 212 ILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
+ + D W V ++ + C+Q I + +++I P+G + +S F+
Sbjct: 202 LTHVYDF--WQFNDHVQIAISQEGGQLCYQKIMQGFTDIRLAMRTPEGRSNISDLFQLNP 259
Query: 272 PLKNTTELKDGLDTVY-------SEAAQYDTPSNIPVKRICNAIENAPNCGDDI------ 318
L T + + T Y E Q++ NI + +C I+ + +
Sbjct: 260 RLDQTNLTYNDIQTFYLAIMSPFQEMIQFNNDFNIDIGALCTTIDQSTWTPMQVIWQAYV 319
Query: 319 -LCKIAAGVVEADSLEYD------GNNSRCYINEDRTGDESDEGWEWQSCSEMV-VPMGK 370
L G V+ Y GN S N D + W++Q C+E +P
Sbjct: 320 YLSTTVTGSVQPMITSYQTIVSDLGNQSASSPNID------NRMWQYQMCTEFGWIPTTN 373
Query: 371 DKNSMYQPEPWNLTKYIKNCKEQY-GVSPRPSWV--LTYYGGHDIKLILRRSTSNIIFSN 427
+ + ++ C + + G + + + LT H S +N++F+N
Sbjct: 374 NNEQGLFGAVIPTSLFLNMCFDIFPGANMDATTIRDLTIDYNHYYGSSYDYSGTNVVFTN 433
Query: 428 GMRDPFSRGGWVKTYHF 444
G DP+S G T F
Sbjct: 434 GWYDPWSTLGKETTADF 450
>gi|170066901|ref|XP_001868269.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167863077|gb|EDS26460.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 484
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 120/468 (25%), Positives = 200/468 (42%), Gaps = 68/468 (14%)
Query: 53 TFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
T + Q +D+F P++ T+ RY+ N + + G SP+ F+G E I G Y
Sbjct: 52 TKWIKQKLDNFD--PQNPSTWSMRYMENGEHYVPG-SPLFIFVGGEWTISAGSIQQGHFY 108
Query: 113 ENAHQFKALIVILEHRYYGKSVP-FGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ A + +A + EHRYYG+S P +R ++ + N QALAD A + ++
Sbjct: 109 DMAAEHRAYLFYTEHRYYGQSRPTVNTRTDQMR------FLNVDQALADLAHFVEEMRRT 162
Query: 172 Y-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
A ++ VI +G SY + WFR KYPH+V G ASSAP+L D T Y VVS+
Sbjct: 163 IPGAENSKVIMVGGSYSATMVVWFRQKYPHLVNGVWASSAPLLAKLDF---TEYKEVVSE 219
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC--TPLKNTTELKDGLDTV- 286
R + C +++ +E++++ K +++ F C T L NT + + L ++
Sbjct: 220 SIRLVGGDACADRVQRGVAEVEDLI-KQGSYDQVAQAFNLCADTDLSNTRDRQGFLSSIS 278
Query: 287 --YSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRC--- 341
++ QY +I + +C I N PN D+ EA + + ++RC
Sbjct: 279 DTFAGVVQYHWSGDI--EGVCKVI-NDPNYNTDM---------EALAGWFTYGSTRCLDA 326
Query: 342 --------YINEDRT---GDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNC 390
Y N D T S W +Q+C+E + + + YI+ C
Sbjct: 327 SYESMISYYRNTDWTHGANTGSMRPWLYQTCAEYGWYQTSGSENQIFGSGFPVDLYIQWC 386
Query: 391 KEQYGVS-PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSF 449
+ Y P S + I + +N++F+ G DP+ G
Sbjct: 387 ADLYDNKFPESSMHANVARTNTIYGHMNPEVTNVLFTQGQLDPWRPMG-----------V 435
Query: 450 SQDLN---------LGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
QDLN L SH DL+ D + ++ ++++ W+
Sbjct: 436 QQDLNERSPAVVIPLASHVADLNSISDWDSTEMRAAKEKISELVRLWL 483
>gi|219121656|ref|XP_002181178.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407164|gb|EEC47101.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 544
Score = 111 bits (277), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/462 (24%), Positives = 189/462 (40%), Gaps = 71/462 (15%)
Query: 52 ETFFY-NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
E F+Y QT+DHF+ + + QRY + K++ G PI +G E ++ I
Sbjct: 90 EPFYYKEQTLDHFTPNKDE-APWAQRYYQDDKYFAGPGHPIFVIMGGEDAVN-GILYPFV 147
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL----- 165
+ A +F+A + LEHR+YGKS P A R + AQALAD +
Sbjct: 148 SKHLAKRFRAHTLCLEHRFYGKSKPLKHPS----TADLRRLLSPAQALADAVQFIEYKRK 203
Query: 166 -LHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY 224
L +K ++ PV+ +G SY G L+ R+ YP VV ASSAP+ Y Y
Sbjct: 204 QLGCGNKGTKSYCPVVTVGGSYPGFLSALLRIVYPDVVDIGYASSAPLHLYSHRVNKAAY 263
Query: 225 HSVVSKDFRDTSEECFQTIKKSWSEI-DNIASKPDGLAILSKKFKTCTP------LKNTT 277
V++ S C +K + ++ + + + +A ++ C + N
Sbjct: 264 FEKVTQVAEQASRGCAGAVKNALMDVTEKLLASKRSVAEVAFDLGVCVATVPDYIMDNEI 323
Query: 278 ELKDGLDTVYSEAAQYD-----------------------TPSNIPVKRICNAIENAPNC 314
++ + V + A+Y+ + S V E+ C
Sbjct: 324 FQQEIMMVVSTHFAEYNMGYYPPGPDQDLVQGCLIFQDTKSSSEQKVSNFLRLREDFDEC 383
Query: 315 GD--DILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDK 372
D L G + A G+ Y+ W++QSC+ ++ G
Sbjct: 384 FDMQTELPPGPNGTISASDWSGVGDGHSGYM------------WDFQSCT-LLPECGMSD 430
Query: 373 NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
SM+ P PW + ++C+ ++GV P ++ +G D+ + ++++F+NG+ D
Sbjct: 431 ASMFPPRPWTIEWETQHCQVRFGVEPNLRQLVDEFGFADLS-----NVTHLLFTNGIND- 484
Query: 433 FSRGGWVKTYHFFDLSFS-QDLNL--GSHCLDLDEAKKSDPD 471
GW DLS S + +N G+H DL S+ D
Sbjct: 485 ----GWSVASILTDLSESVKAINFVNGAHHSDLSHVDLSEHD 522
>gi|195353655|ref|XP_002043319.1| GM26836 [Drosophila sechellia]
gi|194127433|gb|EDW49476.1| GM26836 [Drosophila sechellia]
Length = 473
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 110/463 (23%), Positives = 201/463 (43%), Gaps = 62/463 (13%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF P +T+ RY++N + G +P+ +LG E I G
Sbjct: 46 QTLWIEQKLDHFD--PAETRTWQMRYMLNDALYKSG-APLFIYLGGEWEISSGRITGGHL 102
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + AL+ EHRYYG+S P L N + Y + Q+LAD A + IK
Sbjct: 103 YDMAKEHNALLAYTEHRYYGQSKPLPD----LSNENIK-YLSVNQSLADLAHFINTIKQN 157
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + + VI +G SY + TWF+ YP +V G ASSAP+L + Y V +
Sbjct: 158 HEGLSESKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLLAK---VNFVEYKEVTGQ 214
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
C++ I+ +E++ + + G A + K C P ++L + T++SE
Sbjct: 215 SIEQMGGSACYKRIENGIAEMETMIATKRG-AEVKALLKLCEPFDVYSDLD--VWTLFSE 271
Query: 290 -----AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA--------DSLEYDG 336
A T + ++ +C I + G + L +A +++ L YD
Sbjct: 272 ISDIFAGVVQTHNAGQIEGVCEKIMD----GSNDLIGVAGYLLDVFEESGGKCHDLSYDA 327
Query: 337 NNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKE 392
+ ++ + G+ + W +Q+C+E G + S + +P+ +T Y C +
Sbjct: 328 ITA-LLLDTNYNGNIMRQ-WIFQTCNE----YGWYQTSGSRAQPFGTKFPVTYYTTMCAD 381
Query: 393 QYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
YG +S + + ++GG L + N+ ++G DP+ G
Sbjct: 382 LYGSDYSNEFISNQVTITNQFFGG------LSPNVENVYLTHGQLDPWRPMG------IQ 429
Query: 446 DLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D + + + +HC D + SD + ++ ++++ W+
Sbjct: 430 DETQATIIPEHAHCKDFNSISSSDTAEMRASKERIAELVREWV 472
>gi|194744689|ref|XP_001954825.1| GF16547 [Drosophila ananassae]
gi|190627862|gb|EDV43386.1| GF16547 [Drosophila ananassae]
Length = 480
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/249 (29%), Positives = 125/249 (50%), Gaps = 19/249 (7%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +D+F + +T+ RY+IN +F G SPI +LG E I+ ++ +G Y+ A +
Sbjct: 61 QKLDNFD--DSNTETYQMRYLINDEFQTDG-SPIFIYLGGEWTIEQSMVSAGHWYDMAQE 117
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATH 176
K ++V EHRYYG+S+P + + Y + QALAD A + +K + +
Sbjct: 118 HKGVLVYTEHRYYGESIP-----TTTMSTENLQYLHVKQALADVAHFITTLKSENAQLAN 172
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
+ V+ G SY + WF+ YP +V+G ASSAP+L D T Y VV + F +
Sbjct: 173 SKVVLAGGSYSATMVVWFKRLYPDLVVGGWASSAPLLAKVDFTE---YKEVVGRAFLELG 229
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVYSEA 290
++C+ I+ +E++++ + G A + C + +L + V+S
Sbjct: 230 GQQCYNRIQNGIAELESLFANKRG-AEARAMLRLCNSFDDQNDLDLWSLFGSISNVFSGI 288
Query: 291 AQYDTPSNI 299
AQY + ++I
Sbjct: 289 AQYQSGNDI 297
>gi|10140734|gb|AAG13567.1|AC073867_13 putative serine peptidase [Oryza sativa Japonica Group]
Length = 502
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 123/476 (25%), Positives = 189/476 (39%), Gaps = 70/476 (14%)
Query: 52 ETFFYNQTIDHFS---------YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA--- 99
E + +Q +DHFS G + ++ F QRY + + GG L G +
Sbjct: 51 EERWMDQRLDHFSPTRPRADVAGGVQDHRQFKQRYYEFADYHAGGGPVFLRICGESSCNG 110
Query: 100 -PIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
P D LS +F A +V EHRYYGKS PF S +N R + +S QAL
Sbjct: 111 IPNDYLAVLS-------KKFGAAVVTPEHRYYGKSSPFES--LTTENLR---FLSSKQAL 158
Query: 159 ADYASILLHIKDKYNATHA-------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAP 211
D + H ++ NA + P G SY G L+ WFRLK+PH+ GSLASS
Sbjct: 159 FDLVAFRQHYQEILNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGV 218
Query: 212 ILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
+L + +T + V + Q + + E + S+ + ++K
Sbjct: 219 VL---AVYNFTDFDKQVGDSAGPECKAALQEVTRLVDEQLRLDSRSVKVLFGAEK----- 270
Query: 272 PLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADS 331
LKN + L + QY +P +C+ + NA G ++ A V +
Sbjct: 271 -LKNDGDFLFFLADAAAIGFQYGSPDA-----VCSPLINAKKTGRSLVETYAQYVQDFFI 324
Query: 332 LEYDGNNS---RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIK 388
+ S + Y+ D S W +Q CSE+ KN + N ++
Sbjct: 325 RRWGTTVSSYDQEYLKNTTPDDTSSRLWWFQVCSEVAYFQVAPKNDSIRSTEINTGYHLD 384
Query: 389 NCKEQYGVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
C+ +G P +T YYGG I + S I+F+NG +DP+ K+ +
Sbjct: 385 LCRNVFGEGVYPDVFMTNLYYGGTRI------AASKIVFTNGSQDPWRHASKQKSSKYMP 438
Query: 447 LSFSQDLNLGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ N G H DL D + S P + RK + W++Q
Sbjct: 439 SYIIKCRNCG-HGTDLRGCPQLPFRIEGDPSNCSSPAAVSTVRKQIASHISLWLSQ 493
>gi|323446372|gb|EGB02557.1| hypothetical protein AURANDRAFT_9133 [Aureococcus anophagefferens]
Length = 138
Score = 110 bits (274), Expect = 2e-21, Method: Composition-based stats.
Identities = 57/144 (39%), Positives = 80/144 (55%), Gaps = 6/144 (4%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF + + TF QR +++ W G PI + G E + + +G +E+A F
Sbjct: 1 LDHFDFTTNA--TFEQRVFVHADHWAPG-GPIFLYCGNEDDVTLYVNATGLMWEHAAAFG 57
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV 179
A++V +EHRYYG ++PFG AA H Y + QALAD + L IK Y A +A
Sbjct: 58 AMLVFVEHRYYGATLPFG---AASFEPEHLRYLSHEQALADLVNALRRIKATYGAENAKT 114
Query: 180 IAIGASYGGELATWFRLKYPHVVI 203
+A G SYGG LA W R+KYP V+
Sbjct: 115 VAFGGSYGGMLAAWLRMKYPAAVV 138
>gi|253743733|gb|EET00051.1| Thymus-specific serine protease precursor [Giardia intestinalis
ATCC 50581]
Length = 521
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 119/473 (25%), Positives = 194/473 (41%), Gaps = 93/473 (19%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP-----IDDNIQ 106
E +F++Q +DHF+ P + + + QRY N ++ G P+ +G E P + D
Sbjct: 56 ELWFHDQRVDHFN--PVNTKKWSQRYYYNDTYYKAGG-PVFLMIGGEGPATPRDVGDYFS 112
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILL 166
+ F A L V LEHR+YG S P ++ S QALAD A+ L
Sbjct: 113 IDYF----AKSMSGLKVALEHRFYGASFP-------STDSSDLSLLRSDQALADIATFLA 161
Query: 167 HIKDKYNATHAP-VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
++K +YN + V+A+G SY G LA W R+++P V+ +++SS P L D + Y
Sbjct: 162 YLKKEYNLPKSTKVVAVGGSYSGNLAAWARIQFPFVIDAAISSSGPYLAQTD---YPEYL 218
Query: 226 SVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT------------CTP 272
+ R + + C I + ++ + + + A L+ FK
Sbjct: 219 QHIDAQIRKSGGDRCMDIISAAHTDAEYLLNHDK--ATLATIFKLKEESIYNSTAYDKAS 276
Query: 273 LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIE---NAPNCGDDI---------LC 320
+T G+ Y + +K++C +IE +A + G+ L
Sbjct: 277 FMSTMGAPSGVVQYAKHDGYYTDTKDGDIKQMCKSIEAYYDAYDSGESYQQLKAYASWLL 336
Query: 321 KIAAGVVEADSLEYDGNNSRCYIN--EDRTGDES---DEGWEWQSCSEMVVPMGKDKNSM 375
G +E L +DG YI +D + D D W WQ+C E +
Sbjct: 337 DYYGGSMEEIDLSFDG-----YIKALQDTSIDSEFAVDRSWMWQTCVEFGYYQTSSSTAG 391
Query: 376 YQPEPWNLTKYIKNCKEQY---GVSPRPSWVLT-----------------YYGGHDIKLI 415
+ L +++ C + + GV+P + T YYG +IK+
Sbjct: 392 FGTMI-TLDYFLEMCYKAFFAPGVAPAGASSFTRSQSDDVVNKAVQFTNVYYGARNIKM- 449
Query: 416 LRRSTSNIIFSNGMRDPFS----RGG--WVKTYHFFDLSFSQDLNLGSHCLDL 462
SNI +NG DP+S R G W ++ + S + + GSHC DL
Sbjct: 450 -----SNIYIANGHVDPWSELSYREGETWSTGHYLHNGSTASYIPNGSHCTDL 497
>gi|195451235|ref|XP_002072826.1| GK13808 [Drosophila willistoni]
gi|194168911|gb|EDW83812.1| GK13808 [Drosophila willistoni]
Length = 445
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/243 (31%), Positives = 119/243 (48%), Gaps = 19/243 (7%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF + QT+ RY++N +F G SPI +LG E I+D++ +G Y+ A +
Sbjct: 26 QKLDHFD--ESNTQTYQMRYLVNDEFQEEG-SPIFIYLGGEWAIEDSMVSAGHWYDMAQE 82
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATH 176
K +++ EHRYYG+S+P + H Y + QALAD A + K + T+
Sbjct: 83 HKGVLIYTEHRYYGESIP-----TTTMSTEHLQYLHVKQALADVAHFIETYKSENSQLTN 137
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
+ V+ G SY + WF+ YP +V G ASSAP+L D T Y VV + F
Sbjct: 138 SKVLLAGGSYSATMVVWFKRLYPDLVEGGWASSAPLLAKVDFTE---YKEVVGRAFLQLG 194
Query: 236 SEECFQTIKKSWSEID-----NIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
++C+ I +E++ N A++ + L F L T L + ++S
Sbjct: 195 GQQCYNRINNGIAELESMFENNRAAEARAMLRLCSSFDDKNDLDLWT-LFGSISNIFSGV 253
Query: 291 AQY 293
AQY
Sbjct: 254 AQY 256
>gi|159117921|ref|XP_001709180.1| Thymus-specific serine protease precursor [Giardia lamblia ATCC
50803]
gi|157437295|gb|EDO81506.1| Thymus-specific serine protease precursor [Giardia lamblia ATCC
50803]
Length = 522
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 120/470 (25%), Positives = 192/470 (40%), Gaps = 87/470 (18%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP-----IDDNIQ 106
E +F Q +DHF + + + QRY N ++ G P+ +G E P + D
Sbjct: 56 ELWFREQHVDHFD--STNTKKWSQRYYYNDTYYKAGG-PVFLMIGGEGPATPRDVGDYFS 112
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILL 166
+ F A L V LEHR+YG S P N+ + S QALAD A+ L
Sbjct: 113 IDYF----AKNMNGLKVALEHRFYGASFP-------STNSANLSLLRSDQALADIATFLA 161
Query: 167 HIKDKYNATHAP-VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
++K +YN ++A+G SY G LA W R+++P ++ +++SS P L D + +
Sbjct: 162 YLKREYNLPEGTKIVAVGGSYSGNLAAWARIQFPFIIDAAISSSGPYLAQTDYPEYLQH- 220
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK----- 280
+ S+ + + C I + + + + S K K + +T K
Sbjct: 221 -IDSQVRKYGGDRCMDIISAAHKDAEYLLSHDKATLATIFKLKEESIYNSTGYDKASFMS 279
Query: 281 -----DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGD------DI------LCKIA 323
G+ Y+T + +K++C AIE + + D D+ L
Sbjct: 280 AMGAPSGVVQYAKHDGYYNTTKDGDIKQMCKAIEASYDSYDTGESYQDLKAYASWLLDYY 339
Query: 324 AGVVEADSLEYDGNNSRCYIN--EDRTGDES---DEGWEWQSCSEMVVPMGKDKNSMYQP 378
G +E L +DG YI +D + D D W WQ+C E + +
Sbjct: 340 GGSMEEIDLSFDG-----YIKAIQDTSIDSEFAVDRSWLWQTCVEFGYYQTSSPAAGFGT 394
Query: 379 EPWNLTKYIKNCKEQY---GVSP--RPSWVLT---------------YYGGHDIKLILRR 418
L +++ C + Y G +P PS+ + YYG +IK+
Sbjct: 395 M-ITLDYFLEMCYKAYFAPGATPPGAPSFTRSQSDDLVNKAVRFTNVYYGARNIKM---- 449
Query: 419 STSNIIFSNGMRDPFS----RGG--WVKTYHFFDLSFSQDLNLGSHCLDL 462
SNI +NG DP+S R G W +H + S + + GSHC DL
Sbjct: 450 --SNIYITNGHVDPWSELSYREGETWSTGHHLHNGSTTSYIPNGSHCTDL 497
>gi|323451276|gb|EGB07154.1| hypothetical protein AURANDRAFT_28266, partial [Aureococcus
anophagefferens]
Length = 311
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 134/279 (48%), Gaps = 22/279 (7%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG-FTYE 113
+ Q +DHF + + +TF Q+ +++ W P+L + G E I+D SG +E
Sbjct: 22 YVTQQLDHFRF--DETRTFSQKLLVHDA-WHRPGGPLLMYFGNEGAIEDFYGNSGGLMFE 78
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A + A + LEHRYYG S+PFG+ A + + QALAD A +L +
Sbjct: 79 LAPKLNASVAFLEHRYYGSSLPFGN---ASYGSDELAFLTVEQALADMALVLATSSEILG 135
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-LYYGDITPWTTYHSVVSKDF 232
A P + G SYGG LA WF LKYPH+ G++A+SAP+ LY G+ + + +
Sbjct: 136 AADGPAVLFGGSYGGMLAAWFMLKYPHLAAGAVAASAPVDLYPGEGKERPFFDAGLEVYG 195
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKN-------TTELKDGLDT 285
S C ++ + + + A +G L++ F+TC PL + T+ + L T
Sbjct: 196 TYGSAACEADLRAALAALAAAAKTAEGRDALARSFRTCEPLPDPVDGDRLTSYVNGALST 255
Query: 286 V------YSEAAQYDTPSNIPVKRICNAIENAPNCGDDI 318
+ Y+ A P+N PV+ C + AP+ +
Sbjct: 256 LAMLDYPYASAFVAPMPAN-PVRVACGRVAAAPSAASKL 293
>gi|195109610|ref|XP_001999376.1| GI23100 [Drosophila mojavensis]
gi|193915970|gb|EDW14837.1| GI23100 [Drosophila mojavensis]
Length = 882
Score = 109 bits (273), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 141/291 (48%), Gaps = 31/291 (10%)
Query: 16 IFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQ 75
+FV T ++ P L ++R E+K + Q +D F + +T+
Sbjct: 29 VFVKT---LKDMQRGPPLKMVKRELKGEEK---------WITQPLDQFD--ETNKETYEM 74
Query: 76 RYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVP 135
RY IN +F G SPI FLG E N+ G+ Y+ A + K +++ EHRYYG SVP
Sbjct: 75 RYFINDEFQTEG-SPIFIFLGGEWEASLNMINDGYWYDLAKEHKGVLIYTEHRYYGASVP 133
Query: 136 FGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAPVIAIGASYGGELATWF 194
++ +L++ + Y + QALAD A+ + K + +++ V+ G SY +A WF
Sbjct: 134 --TKTMSLEDLK---YLHVKQALADVANFIKTFKSENAQLSNSKVVLSGCSYSATMAVWF 188
Query: 195 RLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNI 253
+ YP +V+G+ ASSAP+ D Y VV K FR+ E+C+ I+K +E++++
Sbjct: 189 KRLYPDLVVGAWASSAPLFAKVDFYE---YKEVVGKAFRELGGEKCYNRIEKGIAELESM 245
Query: 254 ASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVYSEAAQYDTPSNI 299
K A + C+ + +L + V+S AQY ++
Sbjct: 246 F-KNKRAAEARAMLRICSNFDHENDLDLWSLFGSISNVFSSLAQYQGAGDL 295
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 105/452 (23%), Positives = 196/452 (43%), Gaps = 40/452 (8%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
ET ++NQ++D+F + + QR +IN + G SPI +LG E ID + SG
Sbjct: 457 ETRWFNQSLDNFD--DTNKNVWDQRVLINEDNFVDG-SPIFIYLGGEWAIDPSAITSGLW 513
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ A + +V EHR++G+S+P + KN +++ + QALAD +++ +K++
Sbjct: 514 VDIAKEHNGSLVYTEHRFFGESIPI--TPLSTKNLKYQ---SVEQALADVVNVIKVLKEE 568
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
+ V+ G SY +A WF+L YP V++GS ASSAP+ I ++ + +V +
Sbjct: 569 DKYKESKVVISGCSYSASMAVWFKLLYPDVIVGSWASSAPL---DAIVDFSDFMEIVGRA 625
Query: 232 FRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-----LKDGLDT 285
+R + C+ I + S +++ G A + C E + +
Sbjct: 626 YRQLGGDYCYDLIDNATSYYEDLFQTGQG-ARAKELLNLCDSFDENNERDQWQIFSSIAN 684
Query: 286 VYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAA---GVVEADSLEYDGNNSRCY 342
+++ AQY P N + + C+ + + + L + G E + Y G Y
Sbjct: 685 IFAGIAQYQKPENYDLAQYCSVLRSFDDDDASALSQFVQWRLGKPECVNTRYQGTVDY-Y 743
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKEQYGVSP 398
D S GW +Q+C + G ++S + P+ T Y C + +G
Sbjct: 744 KWSKNNYDGSGLGWFYQTCRQ----FGWFQSSANKNHPFGSTFPATLYTDTCHDVFGSQY 799
Query: 399 RPSWVLTYYGGHDIKLILRR-STSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS 457
+ + Y + K + + N+ ++G D +SR G + + S
Sbjct: 800 TSAKIEEYIQATNKKYGGKHPAVENVYMTHGGLDGWSRVG---------SDSAIIIPQAS 850
Query: 458 HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
HC D +D L ++ +++++ W+
Sbjct: 851 HCSDSGSINPTDSAALRATKERLIELVREWLA 882
>gi|326520173|dbj|BAK04011.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 489
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 128/477 (26%), Positives = 196/477 (41%), Gaps = 68/477 (14%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD--NIQLSG 109
E + +Q +DHFS ++ F QRY + P+ + E+ D N L+
Sbjct: 49 EERWMSQRLDHFS--SSDHRQFKQRYFEFLDYHDDPTGPVFLRICGESSCDGIPNDYLAV 106
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A +F A +V EHRYYGKS PF S N R + +S QAL D A + +
Sbjct: 107 I----AKKFGAAVVTPEHRYYGKSSPFDS--LTTDNLR---FLSSKQALFDLAVFRQYYQ 157
Query: 170 DKYNATHA-------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+K N+ + P G S G L+ WFRLK+PH+ GSLASS +L + T +
Sbjct: 158 EKLNSRYNRSAGFDNPWFVFGVSCSGALSAWFRLKFPHLTCGSLASSGVVLAVYNFTDF- 216
Query: 223 TYHSVVSKDFRDTS-EECFQTIKKSWSEIDNIASK---PDGLAILSKKFKTCTPLKNTTE 278
K D++ EC K + EI + K D ++ K LKN +
Sbjct: 217 ------DKQVGDSAGPEC----KAALQEITRLVDKQLLSDSHSV--KALFGADSLKNDGD 264
Query: 279 LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN 338
L + QY P +C+ + NA G+ ++ + A V+ ++ G
Sbjct: 265 FLFLLADAAATTFQYGNPDA-----LCSPLANAKKKGESLV-ETYAHFVKDYYIKKLGTT 318
Query: 339 ----SRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY 394
+ Y+ E D S W +Q CSE+ KN + + + CK Y
Sbjct: 319 VSSYDQEYLKETTPDDSSSRLWWFQVCSEVAYFQVAPKNDSVRSAQIDTRYNLDLCKNVY 378
Query: 395 GVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD 452
G P +T YYGG I + S I+F+NG +DP+ K+ +
Sbjct: 379 GEGVYPDVFMTNLYYGGTSI------AASKIVFTNGSQDPWRHASKQKSSEGMPSYIIKC 432
Query: 453 LNLGSHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKA 497
N G H DL D + + P+ + RK VK + W++Q ++ +A
Sbjct: 433 SNCG-HGTDLRGCPQLPFRIEGDSSNCTSPEAVNTVRKQIVKHIDLWLSQCHEPARA 488
>gi|2961378|emb|CAA18125.1| putative protein [Arabidopsis thaliana]
gi|7270572|emb|CAB80290.1| putative protein [Arabidopsis thaliana]
Length = 852
Score = 108 bits (271), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 136/479 (28%), Positives = 192/479 (40%), Gaps = 83/479 (17%)
Query: 3 SSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQ-TID 61
SS+ F L + +S +NGL L+PR R S L K SN F Q ++
Sbjct: 2 SSELGFALLTTFTVLLSYLSFSNGL-LQPR----RISHGLS-KSSNFCFSFVIEIQNSLQ 55
Query: 62 HFSYGPE-------SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ SY + ++ F QRY + PI + E P N + +
Sbjct: 56 NPSYQFQLFLILFQDHRKFRQRYYEYLDHLRVPDGPIFLMICGEGPC--NGITNNYISVL 113
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS----------- 163
A +F A IV LEHRYYGKS PF S A KN + Y +S QAL+D A+
Sbjct: 114 AKKFDAGIVSLEHRYYGKSSPFKS--LATKNLK---YLSSKQALSDLATFRQYYQATCFG 168
Query: 164 ILLHIKDKYNA-------THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
I L ++D N P G SY G L+ WFRLK+PH+ GSLASSA +
Sbjct: 169 ICLWMQDSLNVKFNRSSNVENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAVV---R 225
Query: 217 DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT 276
+ + + +++ E Q K GL + ++ K L N
Sbjct: 226 AVYEFPEFDQQIAESAGPECETALQETNKLLEL---------GLKVNNRAVKA---LFNA 273
Query: 277 TELKDGLDTVYSEA-----------AQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAG 325
TEL D +Y A QY P + V +E N GD L + A
Sbjct: 274 TELDVDADFLYLIADAGVMAMFIWQIQYGNPDKLCVP----LVEAQKNGGD--LVEAYAK 327
Query: 326 VVEADSLEYDGNNSRCY-----INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEP 380
V + G +S+ Y ++ T + +D W +Q C+E+ N +
Sbjct: 328 YVREFCMGVFGQSSKTYSRKHLLDTAVTLESADRLWWFQVCTEVAYFQVAPANDSIRSHQ 387
Query: 381 WNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHDIK-----LILRRSTSNIIFSNGMRDP 432
N ++ CK + GV P YYG I LIL + IIF+NG +DP
Sbjct: 388 INTEYHLDLCKSLFGKGVYPEVDATNLYYGSDKIAVSSHLLILNFVATKIIFTNGSQDP 446
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 79/158 (50%), Gaps = 20/158 (12%)
Query: 68 ESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEH 127
+ ++ F QRY + PI + E P N + + A +F A IV LEH
Sbjct: 500 KDHREFKQRYYEYLDHLRVPDGPIFMMICGEGPC--NGIPNDYITVLAKKFDAGIVSLEH 557
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA----------SILLHIKDKYNAT-- 175
RYYGKS PF S A +N + Y +S QAL D A S++ + K+N +
Sbjct: 558 RYYGKSSPFKS--LATENLK---YLSSKQALFDLAAFRQYYQASTSLMDSLNVKFNRSGD 612
Query: 176 -HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
P GASY G L+ WFRLK+PH+ GSLASSA +
Sbjct: 613 VENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVV 650
>gi|357628484|gb|EHJ77798.1| hypothetical protein KGM_21289 [Danaus plexippus]
Length = 499
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 132/509 (25%), Positives = 220/509 (43%), Gaps = 69/509 (13%)
Query: 10 WLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
WLLL + V+++F L+ P R S + + + ++HF +
Sbjct: 5 WLLLCL--VTSAFSLKTLEPPPPEASARSSTNITEG---------WLPVRLNHFD--ASN 51
Query: 70 YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRY 129
TF RY NS+F G I+ F+G E I SG YE A + A + EHRY
Sbjct: 52 TDTFQMRYYYNSQFSRGPY--IVIFVGGEWSISPGWVRSGLAYELAERIGAGLFYTEHRY 109
Query: 130 YGKSVPF-GSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNA---THAPVIAIGA 184
YG + P G+ A ++ Y + QAL D A + +++ D + +A V G
Sbjct: 110 YGLTRPTNGTTVAEMR------YLSVDQALGDLAQFIEYVRSDDFEGGRFRNARVALFGC 163
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR-DTSEECFQTI 243
SY G +ATW +L YPH+V SL+ S P+ D + Y V++ R S++C I
Sbjct: 164 SYAGSMATWMKLGYPHLVRTSLSDSGPLHAQQD---FPEYLEVIATALRVQGSQQCVDDI 220
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKN-----TTELKDGLDTVYSEAAQYDTPSN 298
+ + I+ + GL +S F TC+ L+ +T G+ ++ QY TP +
Sbjct: 221 ESAMKRINELIETEAGLDTVSTLFNTCSRLRRSHLDLSTFFWYGITETFAYLVQYATPGD 280
Query: 299 IPVKRICNAIENAPNCGDDILCKIAAGVVEA-------DSLEYDGNNSRCYINEDRTGDE 351
IP R C+ I N GD I ++++ V +S ++ S + D + D
Sbjct: 281 IP--RACDHITNK-TLGDPIE-RLSSWVTSQPYTQPCIESRYFEKVASHTNTSYD-SPDA 335
Query: 352 SDEGWEWQSCSEMVVPMGKDKNSMYQPEPW----NLTKYIKNCKEQYGVS-----PRPSW 402
+ W +Q+C+E G + + +P+ L + + CK+ + S R +
Sbjct: 336 TMRLWTYQTCTE----YGWYQTTTSSRQPFLNTVPLEYFHQMCKDFFNDSIDENLLRSAI 391
Query: 403 VLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL 460
V T + G L ++ G DP+S G KT H L+ ++ SHC
Sbjct: 392 VRTNRLFAG------LEHLPDGVLSVGGGHDPWSPVGPNKT-HETHLAPVYVVDGVSHCR 444
Query: 461 DLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+ S+ + L +++ + M+G +T
Sbjct: 445 AIRPTGSSETEQLEITKQSSLLFMEGLMT 473
>gi|302796113|ref|XP_002979819.1| hypothetical protein SELMODRAFT_111339 [Selaginella moellendorffii]
gi|300152579|gb|EFJ19221.1| hypothetical protein SELMODRAFT_111339 [Selaginella moellendorffii]
Length = 472
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 119/442 (26%), Positives = 189/442 (42%), Gaps = 72/442 (16%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
LL+ SFHA + R L D++ +Y QT+DH Y +
Sbjct: 8 FLLVSACAGVSFHAT----------LHRGDALLTTDAH------WYTQTLDH--YATQDD 49
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+TF QRY + ++ N P+ + E P IQ + ++ A +F A IV LEHRYY
Sbjct: 50 RTFSQRYYEFTDYFDAPNGPVFLKICPEGPCV-GIQ-NDYSAVLAKRFGAAIVSLEHRYY 107
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-----ATHAPVIAIGAS 185
G+S PF + + Y +S QAL D A+ + +D N + P I IG S
Sbjct: 108 GQSSPFKTHAT-----ENLIYLSSKQALFDLAAFREYYQDLINHRTNSTSDNPWIVIGWS 162
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WF+LK+PH+ +GS+ASS + D+ + +E T
Sbjct: 163 YAGALSAWFKLKFPHLAVGSVASSGIVQAIFDV-----------RIHLQVAESAGATCSA 211
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------SEAAQYDTPSNI 299
+ + +A + L K + L N +L D +Y + A QY P
Sbjct: 212 ALRAVTRLAEQG-----LKKNSMSTKALFNAEQLDVDGDFLYLLADAATTAFQYGNP--- 263
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVE--ADSLEYDGNN-SRCYINED-RTGDESDE- 354
+ +C+ + A +D+L A V + D+L+ N + ++ E+ GD S +
Sbjct: 264 --EILCSPLVAAYKRNEDLLAVYAKYVKDYYIDTLKSSINTYDQKHLKENLAAGDHSSDR 321
Query: 355 -GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHD 411
GW +Q C+E+ NS + N+ ++ C + G P YYGG+
Sbjct: 322 LGW-YQMCTELGYFQVAPANSSIRSALINVKYHLDLCSNVFENGTFPEVDNTNLYYGGNK 380
Query: 412 IKLILRRSTSNIIFSNGMRDPF 433
I+ I+F NG +DP+
Sbjct: 381 IR------GDKILFMNGSQDPW 396
>gi|66801433|ref|XP_629642.1| hypothetical protein DDB_G0292476 [Dictyostelium discoideum AX4]
gi|60463021|gb|EAL61217.1| hypothetical protein DDB_G0292476 [Dictyostelium discoideum AX4]
Length = 485
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 118/445 (26%), Positives = 187/445 (42%), Gaps = 45/445 (10%)
Query: 41 ILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP 100
I+++ H + Q +DHF+ + +TF QR+V+NSK+W G P+ + E
Sbjct: 36 IVDESIQIHEIVYQLFVQKVDHFNLLDD--RTFFQRFVVNSKYWNG-TGPVFFIISGEQN 92
Query: 101 IDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD 160
++ + S A Q ALIV LEHRYYG GS + + Y + QALAD
Sbjct: 93 MEASSVNSCQYTIWAKQLNALIVSLEHRYYG-----GSYVTEDLSTDNLKYLTTQQALAD 147
Query: 161 YASILLHIKDKYNATHAP----VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYG 216
+ Y H P +I+ G SY G L+ + +KYP + S+ASSAP+
Sbjct: 148 CVVFIDWFTKVY--YHVPSSSKIISFGGSYAGTLSAYLAMKYPSKISFSVASSAPL---N 202
Query: 217 DITPWTTYHSVVSKD--FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
+ + Y V+ K + E+C IK + ++I + P ++K F C+ +
Sbjct: 203 PVVNFYQYMEVIQKSILLLNNGEKCLNNIKLANNKIIEMIHDPILTYNITKLFGLCSNID 262
Query: 275 NTTELKD---GLDTVYSEAAQYD--TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA 329
+L + V+ AAQY P I + +CN + + D I G+ +
Sbjct: 263 FDNDLSTFMFEIANVWGTAAQYGNLVPGYISLDSLCNIMVDDSKEPLDNYLYIWYGMKNS 322
Query: 330 DS---LEYD---GNNSRCYINEDRTGDE---SDEGWEWQSCSEMVVPMGKDKNSMYQP-E 379
D + Y N I+ T +E W +Q C+E + D S QP
Sbjct: 323 DECNDVTYQTMIANFKYSQIDHLNTRNELFNMTRQWLFQCCTEFGFFITSD--SYDQPFT 380
Query: 380 PWNLTKYIKNCKEQYGVSP--RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
+N + C + +G P SW L YGG S N++F + DP+S
Sbjct: 381 NFNFNFQRQICIDVFGKKPTLSTSWTLVEYGGISPN---YNSVRNVLFVSSTNDPWSSLS 437
Query: 438 WVKTYHFFDLSFSQDLNLGSHCLDL 462
K+ + + G+HC D+
Sbjct: 438 ISKSNQYKIVIVEN----GTHCSDM 458
>gi|125778536|ref|XP_001360026.1| GA14959 [Drosophila pseudoobscura pseudoobscura]
gi|195158020|ref|XP_002019892.1| GL12647 [Drosophila persimilis]
gi|54639776|gb|EAL29178.1| GA14959 [Drosophila pseudoobscura pseudoobscura]
gi|194116483|gb|EDW38526.1| GL12647 [Drosophila persimilis]
Length = 479
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 78/276 (28%), Positives = 133/276 (48%), Gaps = 29/276 (10%)
Query: 31 PRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP 90
P+ ++R+ + E+ + Q +D+F ++ +T+ RY++N +F G SP
Sbjct: 43 PKQTVLKRANVQEK----------WITQKLDNFD--DDNKETYEMRYLVNDEFQEEG-SP 89
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
I +LG E I+ ++ +G Y+ A Q K ++V EHRYYG+SVP + A +
Sbjct: 90 IFIYLGGEWEIEASMVSAGHWYDLAEQHKGVLVYTEHRYYGESVPTSTMSTA-----NLK 144
Query: 151 YFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
Y + QALAD A + K ++ ++ V+ G SY + WF+ YP +V G ASS
Sbjct: 145 YLHVKQALADVAEFIKSFKAEHPQLANSKVVLAGGSYSATMVVWFKRLYPDLVDGGWASS 204
Query: 210 APILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
APIL +T Y VV + F ++C+ I+ +E++++ G A +
Sbjct: 205 APIL---AKVAFTEYKEVVGQAFLQLGGQKCYDRIQNGIAELESMFDNKRG-AEARAMLR 260
Query: 269 TCTPLKNTTELK-----DGLDTVYSEAAQYDTPSNI 299
C + +L + V++ AQY P +I
Sbjct: 261 LCNSFDDKNDLDIWSLFGSISNVFAGTAQYQRPGDI 296
>gi|91806190|gb|ABE65823.1| serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 417
Score = 108 bits (270), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 113/408 (27%), Positives = 174/408 (42%), Gaps = 64/408 (15%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A +F+A +V LEHRYYGKS PF S A +N + Y +S QAL D AS + ++ N
Sbjct: 33 AKKFQAGVVSLEHRYYGKSSPFNS--LATENLK---YLSSKQALYDLASFRQYYQESLNK 87
Query: 174 -------ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
+ P G SY G L+ WFRLK+PH+ GSLASSA + + + +
Sbjct: 88 KLNISSGGSDNPWFFFGISYSGALSAWFRLKFPHLTCGSLASSAVVRAIYEFSEFD---- 143
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+ E Q K + E + + GL + +K K+ L N TEL D +
Sbjct: 144 ------QQIGESAGQECKLALQETNKLLEL--GLKVKNKAVKS---LFNATELDVDADFL 192
Query: 287 Y--SEAA----QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD----SLEYDG 336
Y ++AA QY P ++C + A G D++ + V E L
Sbjct: 193 YLTADAAVMAFQYGNPD-----KLCVPLVEAKKNGSDLVVTYSTYVREYCMRIWGLRVRT 247
Query: 337 NNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG- 395
N + N T D + W +Q+C+E+ K + N T ++ CK +G
Sbjct: 248 YNRKHLRNTVVTADSAYRLWWFQACTELGYFQVAPKYDSVRSHQINTTFHLDLCKSLFGK 307
Query: 396 -VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLN 454
V P+ YYGG R + + IIF+NG DP+ + H + N
Sbjct: 308 DVYPKVDATNLYYGGD------RLAATKIIFTNGSEDPWRHASKQNSTHEMPSYIIKCRN 361
Query: 455 LGSHCLDLDEAKKSD------------PDWLVQQRKTEVKIMQGWITQ 490
G H D+ +S PD++ + R+ V+ + W+++
Sbjct: 362 CG-HGSDIRGCPQSPMVIEGKSNNCSLPDYVNKVRQQMVEHIDLWLSE 408
>gi|194899986|ref|XP_001979538.1| GG23239 [Drosophila erecta]
gi|190651241|gb|EDV48496.1| GG23239 [Drosophila erecta]
Length = 473
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 108/457 (23%), Positives = 191/457 (41%), Gaps = 50/457 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF P +T+ RY++N + G +P+ +LG E I G
Sbjct: 46 QTLWIEQKLDHFD--PAETRTWQMRYMLNDALYKSG-APLFIYLGGEWEISAGRITGGHL 102
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + AL+ EHRYYG+S P L N + Y + Q+LAD A + IK
Sbjct: 103 YDMAKEHSALLAYTEHRYYGQSKPLPD----LSNENIK-YLSVNQSLADLAYFINTIKQN 157
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + + VI +G SY + TWF+ YP +V G ASSAP+ + Y V +
Sbjct: 158 HEGLSESKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLF---AKVNFVEYKEVTGQ 214
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD--TVY 287
C++ I+ +E++++ + G A + K C P ++ + LD T++
Sbjct: 215 SIEQMGGSACYKRIENGIAEMESMIATKRG-AEVKALLKLCEPF----DVYNDLDVWTLF 269
Query: 288 SE-----AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADS----LEYDGNN 338
SE A T + ++ +C I N + + + E+D+ L YD
Sbjct: 270 SEISDIFAGVVQTHNAGQIEGVCQVIMAGSNDLNGVARYLLDVFEESDAQCYDLSYDAIT 329
Query: 339 SRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG--- 395
+ + + + W +Q+C+E D + + + Y C + YG
Sbjct: 330 T--LLLDTSYSNNIMRQWIFQTCNEYGWYQTSDSAAQPFGTKFPVVYYTTMCADLYGSQY 387
Query: 396 ----VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
+S + Y+GG L N+ ++G DP+ G D + +
Sbjct: 388 SNEFISNQVVITNQYFGG------LSPGVENVYLTHGQLDPWRAMG------IQDEAQAT 435
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
L +HC D + SD + ++ ++++ W+
Sbjct: 436 ILPEYAHCKDFNSISSSDTAEMKASKERIAELVREWV 472
>gi|302790231|ref|XP_002976883.1| hypothetical protein SELMODRAFT_106030 [Selaginella moellendorffii]
gi|300155361|gb|EFJ21993.1| hypothetical protein SELMODRAFT_106030 [Selaginella moellendorffii]
Length = 393
Score = 108 bits (269), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 114/408 (27%), Positives = 169/408 (41%), Gaps = 38/408 (9%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ F QRY ++ PI L E+ Q + A A++V +EHRYY
Sbjct: 3 RVFSQRYFEFLDYFQPQQGPIFLALCGESTCGGGYQRTAQAL--AKSLGAVVVTIEHRYY 60
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA-----PVIAIGAS 185
G+S PF + + KN + Y + QAL DYA + + ++ N + P I +G S
Sbjct: 61 GQSYPF--QNFSYKNLK---YLTTQQALYDYALFIDYYENLVNLQYNKQGKNPWIVVGGS 115
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WFRLK+PH+V+ S ASS + + ++ Y V EC KK
Sbjct: 116 YAGALSAWFRLKFPHLVVASWASSGVV---EAVLEYSAYDEQVGI---SVGPEC----KK 165
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRIC 305
+ EI +A +GL + K+ + + D L V AA + I + +C
Sbjct: 166 ALQEITKLAE--EGLVTNATAVKSVFFAQKLRD-DDFLSLVADIAAGFVQYGAIDM--LC 220
Query: 306 NAIENAPNCGDDILCKIA--AGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSE 363
+ A D+L A GV + S YD R + + W +Q C+E
Sbjct: 221 PPLLEAIQNKTDLLMAYARIGGVDSSSSDSYDAYELRRQAEANDISAKDTMSWNYQICTE 280
Query: 364 MVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS--PRPSWVLTYYGGHDIKLILRRSTS 421
+ N + NL YI C +G + P S YGG DI ++S
Sbjct: 281 LAYFQVAPTNDSIRSSRINLQYYIDICAVLFGPNTFPDVSAANWNYGGRDI------ASS 334
Query: 422 NIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
IIF NG +DP+ T+ + SF +HC DL SD
Sbjct: 335 RIIFLNGSQDPWQHAS-KTTFSPGEPSFVLTCESCAHCEDLSMDCSSD 381
>gi|194744691|ref|XP_001954826.1| GF16546 [Drosophila ananassae]
gi|190627863|gb|EDV43387.1| GF16546 [Drosophila ananassae]
Length = 473
Score = 107 bits (268), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 109/466 (23%), Positives = 195/466 (41%), Gaps = 68/466 (14%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
ET + Q +DHF +T+ RY++N + G P+ +LG E I G
Sbjct: 46 ETLWIEQKLDHFD--EAETRTWQMRYMLNDAVYQSGG-PLFIYLGGEWEISSGRITGGHM 102
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + L+ EHRYYG+S P L N + Y Q+LAD A + IK
Sbjct: 103 YDMAKEHNGLLAYTEHRYYGQSKPL----PDLSNENIK-YLTVNQSLADLAHFITSIKAT 157
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + + VI +G SY + TWF+ YP +V G ASSAP+ + Y + +
Sbjct: 158 HEGLSESKVIIVGGSYSATMVTWFKKTYPDLVAGGWASSAPLFAK---VNFVEYKEITGQ 214
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
C++ I+ +E++++ + G + K C P +++L + T++SE
Sbjct: 215 SIELMGGSACYKRIENGIAEMEDMFATKRGGEV-KALLKLCEPFDVSSDLD--IWTLFSE 271
Query: 290 -----AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY-- 342
A T + ++ +C I G L +A+ +++ E+ + +CY
Sbjct: 272 ISDIFAGVVQTHNAGQIEGVCQQI----MAGSSDLIGVASYLLD----EFAESGGKCYDL 323
Query: 343 ---------INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKN 389
++ + G+ + W +Q+C+E G + S +P+ +T Y
Sbjct: 324 SYDGITGVLLDTNYNGNIMRQ-WIFQTCNE----YGWYQTSGSADQPFGTKFPVTYYTTM 378
Query: 390 CKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
C + YG ++ + S YYGG L N+ ++G DP+ G
Sbjct: 379 CADLYGSKYSNEFITNQVSTTNAYYGG------LSPGVENVYLTHGQLDPWRAMG----- 427
Query: 443 HFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D + L +HC D + SD + ++ ++++ W+
Sbjct: 428 -IQDADQATILPEYAHCKDFNSISSSDSAEMKASKERIAELVREWV 472
>gi|297832460|ref|XP_002884112.1| hypothetical protein ARALYDRAFT_480713 [Arabidopsis lyrata subsp.
lyrata]
gi|297329952|gb|EFH60371.1| hypothetical protein ARALYDRAFT_480713 [Arabidopsis lyrata subsp.
lyrata]
Length = 417
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 111/409 (27%), Positives = 171/409 (41%), Gaps = 66/409 (16%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A +F+A +V LEHRYYGKS PF S A +N + Y +S QAL D A+ + ++ N
Sbjct: 33 AKKFQAGVVSLEHRYYGKSSPFNS--LATENLK---YLSSKQALFDLAAFRQYYQESLNV 87
Query: 174 -------ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
P G SY G L+ WFRLK+PH+ GSLASSA + + T +
Sbjct: 88 KLNMSNGGNENPWFFFGISYSGALSAWFRLKFPHLTCGSLASSAVVRAVYEFTEFD---- 143
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+ E Q K + E + + L + K L N TEL D +
Sbjct: 144 ------QQIGESAGQECKGALQETNKLLELG-----LKENRKAVKSLFNATELDVDADFL 192
Query: 287 Y--SEAA----QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR 340
Y ++AA QY P ++C + A G D++ + + V + + G R
Sbjct: 193 YLTADAAVMAFQYGNPD-----KLCVPLVEAKKNGSDLV-ETYSKYVREYCMRFWGLRVR 246
Query: 341 CY-----INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY- 394
Y N T D + W +Q C+E+ KN + + N ++ CK +
Sbjct: 247 TYNRKHLRNTVVTADSAYRLWWFQVCTELGYFQVAPKNDSVRSQQINTMFHLDLCKSLFG 306
Query: 395 -GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
GV P+ YYGG R + + IIF+NG DP+ + H +
Sbjct: 307 EGVYPKVDATNLYYGGD------RLTATKIIFTNGSEDPWRHASKQNSSHEMPSYIIKCR 360
Query: 454 NLGSHCLDLDEAKKSD------------PDWLVQQRKTEVKIMQGWITQ 490
N G H D+ +S PD++ + R+ V+ + W+++
Sbjct: 361 NCG-HGTDIRGCPQSPMVIEGKSNNCSLPDYVNKVRQQMVEHIDLWLSE 408
>gi|195391898|ref|XP_002054596.1| GJ22720 [Drosophila virilis]
gi|194152682|gb|EDW68116.1| GJ22720 [Drosophila virilis]
Length = 487
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 112/470 (23%), Positives = 197/470 (41%), Gaps = 73/470 (15%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
ET + +Q +D+F+ E + + R +IN ++ G SPI +LG E I+ + SG
Sbjct: 59 ETRWISQKLDNFNVSNE--EVWDDRVLINEDYFVDG-SPIFIYLGGEWKIEPSAITSGLW 115
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ A + +V EHR++G+S+P A + Y N QALAD +++ +K +
Sbjct: 116 VDIAREHNGSLVYTEHRFFGESIPIKPLSTA-----NLKYQNVEQALADVVNVINVLKKE 170
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
+ V+ G SY +A W +L YP V++GS ASSAP+ D + Y VV K
Sbjct: 171 DKYKDSKVVISGCSYSATMAVWLKLLYPDVIVGSWASSAPLEAKVD---FKDYMKVVGKA 227
Query: 232 FRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-----LKDGLDT 285
+R+ + C+ I + S+ + + + +G K C E + +
Sbjct: 228 YRELGGDYCYNIIDNATSQYEQLFASGNGTE-AKKILNLCDDFDENDEQDQWQIFSTIAN 286
Query: 286 VYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINE 345
V++ AQY P N + + C+ + + DD V + +++ N C +N
Sbjct: 287 VFAGIAQYQKPENYDLAQYCSVLRSFDP--DD-------AVAVSKFVQWRLNYPAC-VNT 336
Query: 346 DRTG------------DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKN 389
G D S W +Q+C E G ++S + +P+ T Y
Sbjct: 337 RYKGTVAYYKWSMDNYDGSGLAWFYQTCRE----FGWFQSSGSKSQPFGSSFPATLYTDT 392
Query: 390 CKEQYGVSPRPSWVLTY-------YGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTY 442
C + +G + + Y YGG + + N+ + G DP+S+ G
Sbjct: 393 CHDVFGSGYSSARIERYIRATNKKYGG------VNPAVENVYMTQGGLDPWSKVG----- 441
Query: 443 HFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+Q+ + SHC D +D L ++ K+++ W+
Sbjct: 442 ----AGLAQNATIIPQASHCSDSGSISATDSPGLRAAKERLAKLVREWLA 487
>gi|195451231|ref|XP_002072825.1| GK13807 [Drosophila willistoni]
gi|194168910|gb|EDW83811.1| GK13807 [Drosophila willistoni]
Length = 481
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 113/460 (24%), Positives = 191/460 (41%), Gaps = 53/460 (11%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
ET + Q +DHF+ E +T+ RY++N + G P+ FLG E I SG
Sbjct: 51 ETLWIEQKLDHFN--DEDTRTWQMRYMLNEALYESGG-PLFIFLGGEWEISTGRITSGHM 107
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + K L+ EHR+YG+S P + Y + QALAD A + K
Sbjct: 108 YDMAKEHKGLLAYTEHRFYGESKPLDDL-----SVESLEYLSVKQALADLAHFIRTQKAN 162
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
Y + VI +G SY + WF+ YP +V G +SSAP+ Y + + Y + +
Sbjct: 163 YAGLADSKVIIVGGSYSASMVVWFKRTYPDLVAGGWSSSAPL--YAKVN-FVEYKEITGQ 219
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
C+ I+K SE++ + + G A + K C P ++L + T++SE
Sbjct: 220 SIAQVGGSACYNRIEKGISELEQLLADKRG-AEVKALLKLCEPFDVNSDLD--VWTLFSE 276
Query: 290 -----AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA--AGVVEADSLEYDGNNSRCY 342
A T + ++ +C+ I + P DD++ + GV E + + + +
Sbjct: 277 ISDIFAGVVQTHNAGQIEGVCDKILSEP---DDLIGVTSYLLGVFEQGGGKCNDLSYKAI 333
Query: 343 INE----DRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKEQY 394
++E TG+ + W +Q+C+E G + S +P+ LT Y C + Y
Sbjct: 334 LSELLETKYTGNIMRQ-WIYQTCNE----YGWYQTSGSSNQPFGTKFPLTLYTTMCADIY 388
Query: 395 GVS------PRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
G W Y G L +I ++G DP+ G +
Sbjct: 389 GEKFSNEFITNQVWDTNEYFGR-----LEPGVYDIHITHGQLDPWRAMGIQDESLELVAT 443
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D +HC D SD + + + ++++ W+
Sbjct: 444 VIPDY---AHCKDFGSISDSDSEEMRDSKLAVAELVRKWV 480
>gi|195109614|ref|XP_001999378.1| GI23097 [Drosophila mojavensis]
gi|193915972|gb|EDW14839.1| GI23097 [Drosophila mojavensis]
Length = 691
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/460 (24%), Positives = 189/460 (41%), Gaps = 56/460 (12%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF+ +T+ RY++N F+ G P+ +LG E I G
Sbjct: 265 QTLWIEQKLDHFN--DSETRTWQMRYLLNDVFFKAGG-PMFIYLGGEWAISKGRISEGHM 321
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + L+ EHRYYG+S P L N + + + QALAD A + K
Sbjct: 322 YDMAKEHNGLLAYTEHRYYGESHPLPD----LSNDSLQ-FLHVKQALADLAHFIKTQKAS 376
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY--HSVV 228
Y + + VI +G SY + TWF+ YP +V G ASSAP+ D + S+V
Sbjct: 377 YKGLSDSKVIIVGGSYSAAMVTWFKRTYPDLVAGGWASSAPLFAKVDFVEYKEIAGQSIV 436
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGL 283
D C+ I+K +E++ + + G + K C P ++L +
Sbjct: 437 LMGGSD----CYNRIQKGIAEMEAMFANKRGSEV-KALLKLCEPFDVYSDLDVWNLFSEI 491
Query: 284 DTVYSEAAQYDTPSNIPVKRICNAI--ENAPNCGDD--ILCKIAAGVVEADSLEYDGNNS 339
++S Q T + ++ +C I E + G +L + + + L Y N
Sbjct: 492 SDIFSGVVQ--THNTGQIEGVCQKIMAEGSDLVGLSKFLLSEFGESTSKCNDLSY--NAM 547
Query: 340 RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKEQYG 395
+++ R W +Q+C+E G + S +P+ +T Y C + YG
Sbjct: 548 IDTLSDTRYSGSVRRQWLFQTCNE----YGWYQTSGSASQPFGTKFPVTFYTTMCADLYG 603
Query: 396 -------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+ R + Y+GG L N+ FS+G DP+ G D
Sbjct: 604 PQFSKSFIEARAAETNEYFGG------LTPKVENVYFSHGQLDPWRAMG------IQDEK 651
Query: 449 FSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + +HC DL D + ++ ++++ W+
Sbjct: 652 QATIIPKHAHCADLGSISIDDTAEMRASKERVAELVREWL 691
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/163 (34%), Positives = 81/163 (49%), Gaps = 9/163 (5%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF+ +T+ RY++N F+ G P+ +LG E I G
Sbjct: 49 QTLWIEQKLDHFN--DSETRTWQMRYMLNDVFFKAGG-PMFIYLGGEWAISKGRISEGHM 105
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + L+ EHRYYG+S P L N R Y + QALAD A + K
Sbjct: 106 YDMAKEHNGLLAYTEHRYYGESHPLPD----LSNENLR-YLHVKQALADLAHFITTQKAS 160
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
Y + + VI +G SY + TWF+ YP +V G ASSAP+
Sbjct: 161 YEGLSDSKVIIVGGSYSATMVTWFKRTYPDLVAGGWASSAPLF 203
>gi|341879672|gb|EGT35607.1| hypothetical protein CAEBREN_08421 [Caenorhabditis brenneri]
Length = 1088
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 122/502 (24%), Positives = 194/502 (38%), Gaps = 73/502 (14%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS 89
RP G + E GFE + Q +DHF+ ++ F Q++ N++ W
Sbjct: 558 RPPHGFLPEPDFNEAASMPSGFEEGTFRQRVDHFN--NQNANFFQQKFYKNAQ-WAQPGG 614
Query: 90 PILAFLGAEAPIDDNIQLS-GFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNAR 147
P +G E P L+ TY A ++ A + +LEHR+YG S+ + +
Sbjct: 615 PNFLMIGGEGPESSRWVLNENITYLTWAKKYGATVYLLEHRFYGDSL--------VGDNN 666
Query: 148 HRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLA 207
NS Q L D A + + K T P I G SY G ++ W R +P +V+G++A
Sbjct: 667 DLNTLNSLQMLYDLAEFIKSVNLK-TGTSNPWITFGGSYSGAMSAWMREVFPDMVVGAVA 725
Query: 208 SSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKF 267
SS P+ D + Y VV R C I+ ++ + + +G LS F
Sbjct: 726 SSGPVFAKTD---FYEYLMVVENSIRTYDRTCADRIQSGFNTMRTMFLTKEGRQNLSDIF 782
Query: 268 KTCTPL-KNTTELKDG--LDTVYSE---AAQYDTPSNIP------VKRICNAIENAPNCG 315
+ P N T++ VY A QY + P + +C + N N
Sbjct: 783 QLLPPFGDNVTDIDQHYFFSNVYGNFQGAVQYSGDNTGPYANGYGIPDMCKIMTNDTNTP 842
Query: 316 DDILCKIAA-----GVVEADSLEYDG--NNSRCYI----NEDRTGDESDEG--WEWQSCS 362
L I A V + Y G NN + I N + G E+ G W WQ+C+
Sbjct: 843 ---LNNIVAFNQFMTVFYNGGVPYSGMDNNYQDLIDYLQNAQQFGPEAGAGLLWMWQTCT 899
Query: 363 EMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLI 415
E D + P + Y++ C + + + S+ YG D
Sbjct: 900 EFGYFQSADTGNGIFGSPTPVNMYVQMCMDVFNSYYQRGTIDSSVSYTNYKYGSRD---- 955
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD-------LNLGSHCLDLDEAKKS 468
+ +N++ NG DP +H L +QD +N +HC D+ A+ +
Sbjct: 956 -QYRGTNVVLPNGNVDP---------WHALGLYGAQDSSVVTYLINGTAHCADMYPARDA 1005
Query: 469 DPDWLVQQRKTEVKIMQGWITQ 490
D L R + + W+ Q
Sbjct: 1006 DAPGLKIVRDLIDQNIAIWLNQ 1027
Score = 63.2 bits (152), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 89/373 (23%), Positives = 149/373 (39%), Gaps = 68/373 (18%)
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSR--KAALKNARHRGYFNSAQALADYASILLHIKD 170
+ A QF A + LEHR+YG+S P + A+L + NS QA+ D + +
Sbjct: 105 KTARQFGATVFSLEHRFYGQSRPNFDKFDSASLT------HLNSFQAIQDILHFIRFANN 158
Query: 171 KYNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
K+ I GA YGG +A R P +V G +ASSAP+ + D W V
Sbjct: 159 KFQLDPDVRWILWGAGYGGIIAAEARKWDPKLVAGVVASSAPLTHKYDF--WEFNDQVAI 216
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD----- 284
+ + C+Q + + +++I P G A +S F L T + +
Sbjct: 217 ILSQVGGQLCYQKVAQGFADIGQAMRTPQGRANVSDLFGLVPRLDQTNLNYNDVQMFWMS 276
Query: 285 --TVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG------ 336
+ + AQY+ N+ + +C I+ + +++ + V + +L DG
Sbjct: 277 VISPFQTLAQYNNDFNLSIGDMCTNIDKSNWTNMEVVYQTY--VYLSRTLN-DGQVLPLV 333
Query: 337 NNSRCYINEDRTGDESDEG-------WEWQSCSEM----------------VVPMGKDKN 373
+ + IN+ G++S + W++Q C+E VVP N
Sbjct: 334 SGYQDVIND--LGNQSPDSPYLDNRMWQYQMCTEFGWFYTTNNNEQTLFGAVVPTSLFLN 391
Query: 374 SMYQ--PEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRD 431
+ P + I++ + Y YG D S +N++F+NG D
Sbjct: 392 LCFDLFPGAQLTSTSIRDIVDDYN---------RLYGAAD-----DYSGTNVVFTNGWYD 437
Query: 432 PFSRGGWVKTYHF 444
P+SR G T F
Sbjct: 438 PWSRLGKESTADF 450
>gi|384249050|gb|EIE22532.1| hypothetical protein COCSUDRAFT_16137, partial [Coccomyxa
subellipsoidea C-169]
Length = 291
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 129/289 (44%), Gaps = 43/289 (14%)
Query: 96 GAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSA 155
G E ++ + +GF +E + AL++ EHRYYG S P G R Y +
Sbjct: 2 GNEDAVEVYVNFTGFMWELGREMNALLIFAEHRYYGDSQPLGPSSLD----RDPSYLSIE 57
Query: 156 QALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY 215
QALAD+A+++ H+K+K+ A +PVIA G SYGG LA W R KYP+ V G++A SAP+ Y
Sbjct: 58 QALADFATLIYHVKEKHGARDSPVIAFGGSYGGMLAAWLRAKYPNAVQGAIAGSAPVGAY 117
Query: 216 GDITPWTTYHSVVSKDFR---DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
VV+ D ++ C + + E+ + L+ F+ C
Sbjct: 118 -----------VVTYDASPEAGAAKHCRANVHSFFQELLADKERASFWQHLADVFRLCLA 166
Query: 273 LKNTTELKDGLDTVYSEAAQYDT--------PSNI--------PVKRICNAIENAPNCGD 316
++ KD + Y +D+ PS P++ C+ + + +
Sbjct: 167 PESG---KDVENVAYWVQGAFDSFAMGNYPYPSTYMGGALPAWPMRAACDHLADEKPSKE 223
Query: 317 DILCKIAAGVVEADSLEYDG-NNSRCYINEDRTGDESDEG-WEWQSCSE 363
D+L +AA V L Y+ ++ CY G W +Q C+E
Sbjct: 224 DLLQGMAAAV----GLLYNATGDAPCYNATQLVGPAGPGATWMFQWCTE 268
>gi|123435014|ref|XP_001308906.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121890608|gb|EAX95976.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 527
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 121/476 (25%), Positives = 208/476 (43%), Gaps = 46/476 (9%)
Query: 40 RILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA 99
RILE ++ ++ F IDHF SY + +R++ N F L ++G E+
Sbjct: 12 RILEPPGNDDEYKIF--QNRIDHFDTHDSSY--YMERFLENLTFVNKTFKKALLYIGGES 67
Query: 100 PIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALA 159
+ +G E A + A + LEHR++GKS+PF L ++ Y QALA
Sbjct: 68 TLSPRYVQAGSYLELAARENAAVFALEHRFFGKSMPFDQ----LTKENYK-YLTIPQALA 122
Query: 160 DYASILLHIKDKYNATH-------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
D A + ++Y TH V +G SY G L++WFRLKYPH+ + S ASSAP+
Sbjct: 123 DLAEFI----ERYIYTHHLADQDGVTVAVVGGSYPGALSSWFRLKYPHLAVASWASSAPV 178
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
D + Y V+K +++ C + +K + +I + A K G A FK
Sbjct: 179 NVKNDFPEYDEY---VAKRVNLSADGCLERTRKVF-DISHEAVK-SGDASKIAAFKDKYG 233
Query: 273 LKNTTELKDGL---DTVYSEAAQYDTPSNIPVKRICNAIENAPNCG--DDILCKIAAGVV 327
+K+ T L V S QY++ + + + C I + + ++I + +
Sbjct: 234 IKHETNDISALYIIADVLSAMVQYNSRYGV-LDQYCKKITESQSESEYENIYVQTFKDFL 292
Query: 328 EADSLEYDGNNSRCYINEDRTGDESD-EGWEWQSCSEM---VVPMGKDKNSMYQPEPWNL 383
+ + E + + + D T ++ W + +C+E+ GK ++S+ N+
Sbjct: 293 KNNGQEPEDYDLLQATSTDPTSATANSRSWSYMTCNEVGWFQTASGKLRSSLL-----NI 347
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ C+ +G+S + + Y G+ + + + FSNG DP+S G V+T
Sbjct: 348 DYFTTVCQNLFGISLADTNQVNYKFGN-----INPGQTQVYFSNGDVDPWSTLG-VETAS 401
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
++ + SHC DL + S L + + MQ W+ D I+
Sbjct: 402 PNIQRYAVVIPGESHCADLGKYNASLESNLTIAQAKIINQMQKWMNMRNDCVNGIS 457
>gi|302807385|ref|XP_002985387.1| hypothetical protein SELMODRAFT_122090 [Selaginella moellendorffii]
gi|300146850|gb|EFJ13517.1| hypothetical protein SELMODRAFT_122090 [Selaginella moellendorffii]
Length = 481
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 122/470 (25%), Positives = 201/470 (42%), Gaps = 65/470 (13%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
LLL+ SFHA + RS + D++ +Y QT+DH Y +
Sbjct: 11 LLLVSTCAGVSFHAT----------LHRSDAVLTTDAH------WYTQTLDH--YATQDD 52
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+TF QRY + ++ N P+ + E IQ + ++ A +F A IV LEHRYY
Sbjct: 53 RTFAQRYYEFTDYFDAPNGPVFLKICGEGTCV-GIQ-NDYSAVLAKRFGAAIVSLEHRYY 110
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-----ATHAPVIAIGAS 185
G+S PF S + Y +S QAL D A+ + +D N + P I +G S
Sbjct: 111 GQSSPFKSHAT-----ENLIYLSSKQALFDLAAFREYYQDLINHRTNSTSDNPWIVMGGS 165
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WF+LK+PH+ +GS+ASS + T + + +E T
Sbjct: 166 YSGALSAWFKLKFPHLAVGSVASSGVVQAIFKFTKFDEQARLFL-----VAESAGATCSA 220
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA----QYDTPSNI 299
+ + +A + L + + L N +L D +Y ++AA QY P
Sbjct: 221 ALRAVTRLAEQG-----LKENSVSTKALFNAEQLDVDGDFLYLLADAAAIAFQYGNPD-- 273
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVE--ADSLEYDGNN-SRCYINED-RTGDE-SDE 354
+C+ + A +D+L A V + D+ + N + ++ E+ GD SD
Sbjct: 274 ---ILCSPLVAAYKRNEDLLAVYAKYVKDYYIDTFKSSINTYDQKHLKENLAAGDHSSDR 330
Query: 355 GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY--GVSPRPSWVLTYYGGHDI 412
W +Q C+E+ N+ + N+ ++ C + G P YYGG+ I
Sbjct: 331 LWWYQVCTEVAYFQAAPANNSIRSALVNVKYHLDLCSNVFENGTFPEVDNTNLYYGGNKI 390
Query: 413 KLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
+ I+F NG +DP+ R +T + ++ +HC+D+
Sbjct: 391 R------GDKILFMNGSQDPW-RHASKQTSSRNEPAYVIKCQNCAHCVDM 433
>gi|344268154|ref|XP_003405927.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 486
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 121/489 (24%), Positives = 192/489 (39%), Gaps = 91/489 (18%)
Query: 64 SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIV 123
+YG + T ++Y IN F+ G L G E + I+ + A + AL +
Sbjct: 27 AYGDVTQST--EKYYINYDFYKPGGPVFLKVQGNEPASIEWIRRNFTWITYAQRLGALCL 84
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIG 183
+LEHR+YG S P +R + +N R Y +S QA+AD A I N T + G
Sbjct: 85 LLEHRFYGDSQP--TRDMSTENFRR--YLSSRQAVADIAEFRTVIAQSMNLTENKWVLFG 140
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI 243
SYGG LA W R+K+P++ ++ SSA + + Y V+ + + EC + +
Sbjct: 141 GSYGGSLAVWSRIKHPNLFAAAVTSSAMV---QAKVNFYEYFEVIHRALATHNRECLKAV 197
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY-SEAAQYDTPSNIPVK 302
K+++ + + P + L +K C PL +K +D +Y E S +
Sbjct: 198 KQAYGFVAAMLLLPGYHSRLILDYKFCEPLT----IKSEMDQLYIIEKLMLICSSIVLSN 253
Query: 303 RICNAIENA-------------------PNCGD------DILCKIAAGV---------VE 328
R N +++A PN G+ D C+
Sbjct: 254 RKSNTVKHAFETALINQANLTPEVTLFNPNLGEMGEMSIDEFCETMTNTSLGSPYHRYAR 313
Query: 329 ADSLEYDGNNSRCY-------------INEDRTGDESDEGWEWQSCSEMVVPMGKD-KNS 374
+ RCY + +R W +Q+CSE D KNS
Sbjct: 314 IINTMISNRRYRCYPASYKQFVEEYSNTSMERNKYRRGRQWLYQTCSEFGWFYTPDLKNS 373
Query: 375 MYQPEPWNLTKY-IKNCKEQYGVSPRPSWVLT-------YYGGHDIKLILRRSTSNIIFS 426
+ P T+Y +K C + +G V YYGG +++ S IIFS
Sbjct: 374 SFSGLP---TRYFVKRCSDVFGPKFNNHSVFQGVMSTNKYYGGLNVR------GSKIIFS 424
Query: 427 NGMRDPFSRGGWVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKTEV 481
NG DP+ R G K S DL + C D+ E + +D L Q R+
Sbjct: 425 NGSNDPWHRLGITK-------DISADLPAVFIKGEAFCEDMAEPQDTDSAELKQAREKIF 477
Query: 482 KIMQGWITQ 490
+ ++ W+ +
Sbjct: 478 QTLKKWLRK 486
>gi|302766039|ref|XP_002966440.1| hypothetical protein SELMODRAFT_63867 [Selaginella moellendorffii]
gi|300165860|gb|EFJ32467.1| hypothetical protein SELMODRAFT_63867 [Selaginella moellendorffii]
Length = 393
Score = 105 bits (263), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 109/427 (25%), Positives = 175/427 (40%), Gaps = 60/427 (14%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ PE +TFPQ+Y ++ P+ + E + T + A +
Sbjct: 1 QKLDHFT--PEDTRTFPQKYFELLDYFEPQRGPMFLVMCGETSCPGG--YAQLTSDVAKE 56
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F A +V LEHR+YG+S PF + N + Y Q+L D+A + + NA
Sbjct: 57 FGAAVVTLEHRFYGESSPF--HNLTVDNLK---YLTIQQSLLDHAEFIAFYQKVINAKFQ 111
Query: 178 -----PVIAIGASYGGELATWFRLKYPHVVIGSLASSA---PILYYG------DITPWTT 223
P + IG SY G L+ WFRLK+PH+VIGS ASSA PIL Y IT
Sbjct: 112 KDGDNPWLVIGGSYAGALSAWFRLKFPHLVIGSWASSAVVHPILSYSAYDRQMGITAGPE 171
Query: 224 YHSVVSKDFRDTSEECFQ--TIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
V+ + + T KS+ + + + D LA +++ EL+
Sbjct: 172 CKRVLQNVTSIVEKALLENGTAIKSFFDPNAVKVNVDFLAYVAEIIAVAV----RKELQR 227
Query: 282 GLDTVYSEAAQYDT--PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
+ + S+ ++ T + ++C ++ NA + + + S Y
Sbjct: 228 HVFVLSSDLFRFATIQAQSGRFNQLCTSVLNASATNNATKLLVTKFIFHVQSPNYQW--- 284
Query: 340 RCYINEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
W++Q C+EM + + + ++ + N Y+ C + +G
Sbjct: 285 ---------------AWKYQVCTEMGLFRVSSGPDGLFSLQ-INTQYYLDQCSQMFGQGI 328
Query: 399 RPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK-TYHFFDLSFSQDLNL 455
RP T +GG I + S I+F NG DP+ T F + SF D +
Sbjct: 329 RPDVTTTNLLFGGAKI------AGSKIMFLNGSEDPWRHASIQNITSSFSEPSFMLDCHS 382
Query: 456 GSHCLDL 462
SH D
Sbjct: 383 CSHVQDF 389
>gi|219127344|ref|XP_002183897.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404620|gb|EEC44566.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 526
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 108/467 (23%), Positives = 194/467 (41%), Gaps = 69/467 (14%)
Query: 54 FFYNQTIDHF-SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
+F +Q +DH S ++ + QR+ ++ +++ G SPI +G E I+ + +GF Y
Sbjct: 32 YFDDQLVDHVASSHRHGHERWSQRFYLSHEYFKGPGSPIFVIMGGEGAIEPS---TGFMY 88
Query: 113 ----ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG-------YFNSAQALADY 161
+ A F A+++ EHR+YG+S P A ++ AR G QAL D
Sbjct: 89 PFILQLAQTFGAMVLQPEHRFYGQSQPV--TPAEIERARDDGKPDPRLKLLTVEQALHD- 145
Query: 162 ASILLHI--------KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
A L+H +D+++ + PVI +G SY G L+ RL++P VV + A+SAP+
Sbjct: 146 AVRLIHFVRDRVRCSRDRFSPRYCPVITVGGSYPGFLSAMARLRFPGVVDMAYAASAPMK 205
Query: 214 YYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI----ASKPDGLAI------L 263
+Y Y++ + C Q ++++ + + SK + AI +
Sbjct: 206 FYAQQVDQYAYYNHIGTVAEQAFTGCSQDVRRALDDFRTVYESGQSKINETAIGICSGTV 265
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA 323
K + G A Y + + R C + D L
Sbjct: 266 PAYIKDPATFVQEVLMMVGYTFANHNMAFYPPSNQTHLGRACQTFASPSLSTLDQLKTFL 325
Query: 324 AGVVEADSLEYDGNNSRCYINEDR----------TGD-------ESDEGWEWQSCSEMVV 366
+ S E + C+ + GD S E W++Q+C+ +V
Sbjct: 326 VASLAPRSTENQPDEETCFDMRKQLPSGRNATISAGDWSGVGTGASGESWDFQTCTSLVE 385
Query: 367 PM---GKDKN------SMYQPEPWNLTKYIKNCKEQYG--VSPRPSWVLTYYGGHDIKLI 415
+ G D N SM+ WN++ +C++++G V+P P+ ++ + D +
Sbjct: 386 SIGFAGGDGNQDAYGISMFPRRDWNISWLTSHCQQRFGDAVTPMPNTLVNAWNFDD---L 442
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
+ + I+F+NG D +S G ++ D + G+H DL
Sbjct: 443 VAAGATRIVFTNGALDGWSVSGI--SHDLSDTLLALTFPNGAHHSDL 487
>gi|198437028|ref|XP_002124988.1| PREDICTED: similar to CG9953 CG9953-PA [Ciona intestinalis]
Length = 508
Score = 105 bits (262), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 104/396 (26%), Positives = 166/396 (41%), Gaps = 55/396 (13%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
T+ QRY N++F N P+ +G E P G A Q A++ ++EHR+Y
Sbjct: 58 HTWKQRYFANNQF-STPNGPVFLSIGGEGPAGSIWMTKGHWVTMAKQTGAMLFMIEHRFY 116
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGEL 190
GKS P + +S QALAD A+ + +I +Y I G SY G L
Sbjct: 117 GKSHPTPDASLESLSVL-----SSEQALADIANFITNITAEYKLAGRKWIVFGGSYSGSL 171
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSE 249
A W R KYPH++ G++++SAP+ I + Y VV + + S +C Q + + +E
Sbjct: 172 AIWARYKYPHLISGAVSASAPL---HPIVNFDGYQEVVQRSLQTLGSPKCVQNLANATTE 228
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY-SEAAQYDTPSNIPVKRICNAI 308
I ++ G IL+ +F C L + D LD Y E+ + R
Sbjct: 229 ITSLMKTTAGRKILTSEFNLCHALSD-----DVLDNQYFQESVAGSIQDVVQYNRDNMHF 283
Query: 309 E-NAPNCGDDILCKIAAGV---------VEADSLEYDGNNSRCY------INEDRTGDES 352
E + P +C I V E + L +N C D
Sbjct: 284 EGHGPAFNISYICHILDDVNLGSPLKRYAEINRLTLKESNVSCLDSSYQKFVSDTKATSW 343
Query: 353 DEG-----WEWQSCSEMVVPMGKDKNSMYQP-EPWNLTKYIKNCKEQYGVSPRPSWVL-- 404
D+ W +Q+C+E D S +QP + + L I+ C++ +G+ PS ++
Sbjct: 344 DKATGMRQWLYQTCTEFGWFQSSD--STHQPFKGFPLKFSIQQCQDIFGI---PSEIIYK 398
Query: 405 ------TYYGGHDIKLILRRSTSNIIFSNGMRDPFS 434
YGG + ++ +N+ NG+ DP+S
Sbjct: 399 GVQRSTENYGGLSVAGLV----TNVTLYNGLIDPWS 430
>gi|388499696|gb|AFK37914.1| unknown [Lotus japonicus]
Length = 390
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 108/367 (29%), Positives = 158/367 (43%), Gaps = 57/367 (15%)
Query: 13 LMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQT 72
++++F+ST R RL R L +++ +++QT+DH Y P ++
Sbjct: 12 MLLLFLST---VAATPHRHRLSETATDRYLTKQEQ-------WFSQTLDH--YSPYDHRK 59
Query: 73 FPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGK 132
F QRY ++ + P+ + E N + + A +F A +V LEHRYYGK
Sbjct: 60 FQQRYYEFLDYFRIPDGPVFLVICGEYSC--NGIRNDYIAVLAKKFGAAVVSLEHRYYGK 117
Query: 133 SVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA------THAPVIAIGASY 186
S PF S A KN R Y +S QAL D A + +D NA T P G SY
Sbjct: 118 SSPFKS--LATKNLR---YLSSKQALFDLAVFRQNYQDSLNAKLNRTKTENPWFVFGVSY 172
Query: 187 GGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKS 246
G L+ WFRLK+PH+ GSLASSA +L + T + + E K +
Sbjct: 173 PGALSAWFRLKFPHLTCGSLASSAVVLAVYNFTEFD----------QQIGESAGAECKAA 222
Query: 247 WSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------SEAAQYDTPSNIP 300
E + K LA K K N +L+ D +Y A QY P
Sbjct: 223 LQETTQLIEKK--LATDGKALKASF---NAADLEIDGDFMYFLADAGITAFQYGNPD--- 274
Query: 301 VKRICNAIENAPNCGDDILCKIAAGVVE--ADSLE-YDGNNSRCYINEDRTGDESDEGWE 357
+C + A G+D++ V E +S E YD N + N + + SD W
Sbjct: 275 --ILCKPLVEAKKDGEDLVDAYVKYVKENYGESTESYDQENLK---NTSVSENSSDRLWW 329
Query: 358 WQSCSEM 364
+Q C+E+
Sbjct: 330 FQVCTEV 336
>gi|302762512|ref|XP_002964678.1| hypothetical protein SELMODRAFT_63866 [Selaginella moellendorffii]
gi|300168407|gb|EFJ35011.1| hypothetical protein SELMODRAFT_63866 [Selaginella moellendorffii]
Length = 393
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 100/396 (25%), Positives = 166/396 (41%), Gaps = 59/396 (14%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ PE +TFPQ+Y ++ P+ + E + T + A +
Sbjct: 1 QKLDHFT--PEDTRTFPQKYFELLDYFEPQRGPMFLVMCGETSCPGG--YAQLTSDVAKE 56
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
F A +V LEHR+YG+S PF + N + Y Q+L D+A + + NA
Sbjct: 57 FGAAVVTLEHRFYGESSPF--HNLTVDNLK---YLTIQQSLLDHAEFIAFYQKVINAKFQ 111
Query: 178 -----PVIAIGASYGGELATWFRLKYPHVVIGSLASSA---PILYYG------DITPWTT 223
P + IG SY G L+ WFRLK+PH+VIGS ASSA PIL Y IT
Sbjct: 112 KDGDNPWLVIGGSYAGALSAWFRLKFPHLVIGSWASSAVVHPILSYSAYDRQMGITAGPE 171
Query: 224 YHSVVSKDFRDTSEECFQ--TIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
V+ + + T KS+ + + + D LA +++ +L+
Sbjct: 172 CKRVLQNVTSIVEKALLENGTAIKSFFDPNAVKVNVDFLAYVAEIIAVAV----RKQLQR 227
Query: 282 GLDTVYSEAAQYDT--PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS 339
+ ++S+ ++ T + ++C ++ NA + + V S Y
Sbjct: 228 HVFVLFSDLFRFATIQAQSGRFNQLCTSVLNASATNNATKLLVTKFVFHVQSPNYQW--- 284
Query: 340 RCYINEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
W++Q C+EM + + + ++ + N Y+ C + +G
Sbjct: 285 ---------------AWKYQVCTEMGLFRVSSGPDGLFSLQ-INTQYYLDQCSQMFGQGI 328
Query: 399 RPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDP 432
+P T +GG I + S I+F NG+ DP
Sbjct: 329 QPDVATTNLLFGGAKI------AGSKIMFLNGLEDP 358
>gi|110289391|gb|ABB47878.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
gi|215706481|dbj|BAG93337.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 404
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 102/372 (27%), Positives = 153/372 (41%), Gaps = 44/372 (11%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA----PIDDNIQLSGFTY 112
+Q +DHFS P ++ F QRY + + GG L G + P D LS
Sbjct: 56 DQRLDHFS--PTDHRQFKQRYYEFADYHAGGGPVFLRICGESSCNGIPNDYLAVLS---- 109
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+F A +V EHRYYGKS PF S +N R + +S QAL D + H ++
Sbjct: 110 ---KKFGAAVVTPEHRYYGKSSPFES--LTTENLR---FLSSKQALFDLVAFRQHYQEIL 161
Query: 173 NATHA-------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
NA + P G SY G L+ WFRLK+PH+ GSLASS +L + +T +
Sbjct: 162 NARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL---AVYNFTDFD 218
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT 285
V + Q + + E + S+ + ++K LKN + L
Sbjct: 219 KQVGDSAGPECKAALQEVTRLVDEQLRLDSRSVKVLFGAEK------LKNDGDFLFFLAD 272
Query: 286 VYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS---RCY 342
+ QY +P +C+ + NA G ++ A V + + S + Y
Sbjct: 273 AAAIGFQYGSPDA-----VCSPLINAKKTGRSLVETYAQYVQDFFIRRWGTTVSSYDQEY 327
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSW 402
+ D S W +Q CSE+ KN + N ++ C+ +G P
Sbjct: 328 LKNTTPDDTSSRLWWFQVCSEVAYFQVAPKNDSIRSTEINTGYHLDLCRNVFGEGVYPDV 387
Query: 403 VLT--YYGGHDI 412
+T YYGG I
Sbjct: 388 FMTNLYYGGTRI 399
>gi|302807541|ref|XP_002985465.1| hypothetical protein SELMODRAFT_424499 [Selaginella moellendorffii]
gi|300146928|gb|EFJ13595.1| hypothetical protein SELMODRAFT_424499 [Selaginella moellendorffii]
Length = 458
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 119/435 (27%), Positives = 186/435 (42%), Gaps = 72/435 (16%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
LL+ SFHA + RS L D++ +Y QT+DH Y +
Sbjct: 8 FLLVSTCAGVSFHAT----------LHRSGALLTTDAH------WYTQTLDH--YATQDD 49
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+TF QRY + ++ N P+ + E P IQ + ++ A +F A IV LEHRYY
Sbjct: 50 RTFSQRYYEFTDYFDAPNGPVFLKICPEGPCV-GIQ-NDYSAVLAKRFGAAIVSLEHRYY 107
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD----KYNATH-APVIAIGAS 185
G+S PF + A +N Y +S QAL D A+ + +D + N+T P I IG S
Sbjct: 108 GQSSPF--KTHATENLI---YLSSKQALYDLAAFREYYQDLINHRTNSTRDNPWIVIGWS 162
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WF+LK+PH+ +GS+ASS + T S R + Q +KK
Sbjct: 163 YAGALSAWFKLKFPHLAVGSVASSGIVAESAGAT--------CSAALRAVTRLAEQGLKK 214
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRIC 305
+ ++ +K L + L + A QY P + +C
Sbjct: 215 N--------------SVFTKALFNAEQLDVDGDFLYLLAEAATTAFQYGNP-----EILC 255
Query: 306 NAIENAPNCGDDILCKIAAGVVE--ADSLEYDGNN-SRCYINED-RTGDE-SDEGWEWQS 360
+ + A +D+L A V + D+ + N + ++ E+ GD SD W +Q
Sbjct: 256 SPLVAAYKRDEDLLAVYAKYVKDYYIDTFKSSINTYDQKHLKENLAAGDHSSDRLWWYQM 315
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--YYGGHDIKLILRR 418
C+E+ N+ + N+ ++ C +G P T YYGG+
Sbjct: 316 CTELGYFQVAPANNSIRSALINVKYHLDLCSNVFGNGTFPEVDSTNLYYGGN-------- 367
Query: 419 STSNIIFSNGMRDPF 433
I+F NG +DP+
Sbjct: 368 RGDKILFMNGSQDPW 382
>gi|302807545|ref|XP_002985467.1| hypothetical protein SELMODRAFT_122204 [Selaginella moellendorffii]
gi|300146930|gb|EFJ13597.1| hypothetical protein SELMODRAFT_122204 [Selaginella moellendorffii]
Length = 472
Score = 103 bits (258), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 121/442 (27%), Positives = 193/442 (43%), Gaps = 72/442 (16%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
LL+ SFHA + RS L D++ +Y QT+DH Y +
Sbjct: 8 FLLVSTCAGVSFHAT----------LHRSDALLTTDAH------WYTQTLDH--YATQDD 49
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+TF QRY + ++ N P+ + E P IQ + ++ A +F A IV LEHRYY
Sbjct: 50 RTFSQRYYEFTDYFDAPNGPVFLKICPEGPCV-GIQ-NDYSAVLAKRFGAAIVSLEHRYY 107
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD----KYNATH-APVIAIGAS 185
G+S PF + + Y +S QAL D A+ + +D + N+T P I IG S
Sbjct: 108 GQSSPF-----KIHATENLIYLSSKQALFDLAAFREYYQDLINHRTNSTRDNPWIVIGWS 162
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WF+LK+PH+ +GS+ASS + D+ + +E T
Sbjct: 163 YAGALSAWFKLKFPHLAVGSVASSGIVQAIFDV-----------RIHLQVAESAGATCSA 211
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA----QYDTPSNI 299
+ + +A + L K + L N +L D +Y +EAA QY P
Sbjct: 212 ALRAVTRLAEQG-----LKKNSVSTKALFNAEQLDVDGDFLYLLAEAATTAFQYGNP--- 263
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVE--ADSLEYDGNN-SRCYINED-RTGDESDE- 354
+ +C+ + A +D+L A V + D+ + N + ++ E+ GD S +
Sbjct: 264 --EILCSPLVAAYKRDEDLLAVYAKYVKDYYIDTFKSSINTYDQKHLKENLAAGDHSSDR 321
Query: 355 -GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--YYGGHD 411
GW +Q C+E+ N+ + N+ ++ C +G P T YGG+
Sbjct: 322 LGW-YQMCTELGYFQVAPANNSIRSALINVKYHLDLCSNVFGNGTFPEVDDTNLCYGGNK 380
Query: 412 IKLILRRSTSNIIFSNGMRDPF 433
I+ I+F NG +DP+
Sbjct: 381 IR------GDKILFMNGSQDPW 396
>gi|413933337|gb|AFW67888.1| hypothetical protein ZEAMMB73_712720 [Zea mays]
Length = 773
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 127/306 (41%), Gaps = 79/306 (25%)
Query: 60 IDHFSYG-----PESYQTFPQRYVIN-SKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
+DHFS+ E+ F QRY++ W G PI + G E I SG ++
Sbjct: 482 LDHFSFPGVEDEDEAVVFFQQRYLVGRDSGWAGPGGPIFFYCGNEGDIAWFAANSGLIWD 541
Query: 114 NAHQFKA---------------LIVILE--------HRYYGKSVPFGSRKAALKNARHRG 150
A +F A L +L+ HRYY +S+PFGS+ A +++
Sbjct: 542 AAPRFAARGNRSSAASLVSYSYLFFVLKTFRLKRYIHRYYRESMPFGSKAKAYSDSKFPT 601
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y + QALAD+ +L +K +A +PV+ G SYGG LA W RLKYPH+ IG L
Sbjct: 602 YLTAEQALADFVVLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHIAIGVL---- 657
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
+H + S R + I L IL
Sbjct: 658 -------------HHQLRSCSLRTLFLLLYSMISY--------------LMILGT----- 685
Query: 271 TPLKNTTELKDGLDTVYSEAAQYD----------TPSNIPVKRICNAIENAPNCGDDILC 320
LK + +L D L + YS A D P+N P+K +C I++ P G L
Sbjct: 686 --LKTSGDLSDWLSSAYSYLAMVDYPLPSEFLRPLPAN-PIKEVCGNIDSQPK-GIGTLE 741
Query: 321 KIAAGV 326
+I AGV
Sbjct: 742 RIYAGV 747
>gi|194744685|ref|XP_001954823.1| GF16549 [Drosophila ananassae]
gi|190627860|gb|EDV43384.1| GF16549 [Drosophila ananassae]
Length = 489
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 112/457 (24%), Positives = 190/457 (41%), Gaps = 54/457 (11%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD----NIQLSGFTYE 113
Q +D+F ++ T+ R IN K + G SPI +LG E I N L+ T
Sbjct: 62 QKLDNFD--DDNNATWSDRIYINEKHFVDG-SPIFIYLGGEWEIQSWDISNTLLADIT-- 116
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
+ I+ EHR++GKS+P L Y N QALAD +++ +K++
Sbjct: 117 --KKHNGTIITTEHRFFGKSIPI----TPLSTENLEKYQNVNQALADVINVIQTLKEEGK 170
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ V+ G SY G +A W R YP +++GS ASSAPI+ D + Y VV + ++
Sbjct: 171 YKDSKVVISGCSYSGAMAAWIRKLYPDIIVGSWASSAPIVAKVD---FKDYFKVVGESYQ 227
Query: 234 DTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-----KNTTELKDGLDTVY 287
+ C+ I + S +++ + G K+ C ++ ++ + ++
Sbjct: 228 TLGGQYCYDLIDNATSYYEDLFANGKGDQ-AKKELNLCDDFDPKNKRDRWQIFSTIANIF 286
Query: 288 SEAAQYDTPSNIPVKRICNAI-----ENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY 342
+ AQY P+N + + C+ + ++A I K+ E S +DG
Sbjct: 287 AGIAQYQIPANYDIPKQCSVLRSFSDDDAEALSKFINWKVHEHTGECISATFDGTTGYYE 346
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQP--EPWNLTKYIKNCKEQY------ 394
+D D SD W +Q+CSE + S +QP + Y C+ +
Sbjct: 347 WAKDNYED-SDLPWFFQTCSEF--GWFQSSGSSHQPFGSSFPSKLYEDTCEAVFGSKYNT 403
Query: 395 -GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
G+ +GG DI +N+ F G D GW K + + +
Sbjct: 404 TGIRANAKATNAEFGGLDIDY------TNVYFVQGGLD-----GWKKVGAGVEEG-ATII 451
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+HC DL SD LV ++ + ++ W+ +
Sbjct: 452 PSAAHCSDLGSISASDSPELVASKQKVIALVDKWLAE 488
>gi|10140733|gb|AAG13566.1|AC073867_12 putative serine peptidase [Oryza sativa Japonica Group]
Length = 628
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 117/466 (25%), Positives = 187/466 (40%), Gaps = 69/466 (14%)
Query: 62 HFSYGPES--------YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
H ++GP ++ F QRY ++ PI ++ E+ N + +
Sbjct: 174 HETFGPHKKAIGTSADHRQFKQRYYEFLDYYRAPKGPIFLYICGESSC--NGIPNSYLAV 231
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A +F A +V EHRYYGKS PF S +N R + +S QAL D A + ++ N
Sbjct: 232 MAKKFGAAVVSPEHRYYGKSSPFESLTT--ENLR---FLSSKQALFDLAVFRQYYQETLN 286
Query: 174 ATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
A + + G SY G L+ WFRLK+PH+ GSLASS +L + +T +
Sbjct: 287 AKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL---SVYNYTDFDKQ 343
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ + EC ++++ +D G + + F T L N + L
Sbjct: 344 IGE---SAGPECKAALQETTKLVD--GQLQSGRNAVKQLFGAST-LANDGDFLFLLADAA 397
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA-------AGVVEADSLEYDGNNSR 340
+ A QY P +C+ I A G D++ A G A YD
Sbjct: 398 AIAFQYGNPD-----ALCSPIVEAKKNGTDLVETFARYVKDYYIGTFGASVASYDQE--- 449
Query: 341 CYINEDR--TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
Y+ + + W +Q CSE+ KN + + ++ C+ +G
Sbjct: 450 -YLKNTTPPPAESAYRLWWYQVCSEVAYFQVAPKNDSVRSAKIDTRYHLDLCRNVFGEGV 508
Query: 399 RPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
P +T YYGG R + S I+F+NG +DP+ K+ + N G
Sbjct: 509 YPDVFMTNLYYGG------TRIAGSKIVFANGSQDPWRHASKQKSSKELPSYLIECSNCG 562
Query: 457 SHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
HC DL D + S P+ + + RK V + W+++
Sbjct: 563 -HCSDLSGCPQAPSHIEGDSSNCSSPEAVNKVRKQIVDHIDLWLSE 607
>gi|195109600|ref|XP_001999371.1| GI24473 [Drosophila mojavensis]
gi|193915965|gb|EDW14832.1| GI24473 [Drosophila mojavensis]
Length = 483
Score = 103 bits (256), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 112/477 (23%), Positives = 202/477 (42%), Gaps = 51/477 (10%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L ++ R L ET ++NQ++D+F +S + QR +IN + + G SPI
Sbjct: 38 LNQLHRGPPLPPSTKRANVETRWFNQSLDNFDDTNKS--VWSQRVMINEENFVDG-SPIF 94
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
LG E ID N SG + A + +V EHR++G S+P + +N ++ G
Sbjct: 95 LLLGGEWTIDPNSITSGLWVDIAKEHNGSLVYTEHRFFGGSIPI--LPLSTENLKYHGV- 151
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
QALAD +++ +K++ ++ V+ G SY +A W +L YP V++G ASSA +
Sbjct: 152 --EQALADVVNVIKVLKEEDKYKNSKVVVSGCSYSASMAVWLKLLYPDVIVGGWASSAVL 209
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
D ++ + VV + +R + C+ I + S +++ G A K C
Sbjct: 210 EAKVD---FSDFMEVVGRAYRQLGGDYCYNLINNATSYYEHLFQTGQG-AKAKKLLNLCD 265
Query: 272 PLKNTTE-----LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV 326
E + + +++ AQY P N + R C+ + N DI A
Sbjct: 266 SFDENNERDQWQIFSLIANIFAGIAQYQKPENYDLARSCSVLRNL-----DIDDASALSK 320
Query: 327 VEADSLEYDG-NNSRC--------YINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQ 377
SL G +N+R ++ + G+ W +Q+C + G ++S +
Sbjct: 321 FVQYSLRQQGCHNARYQETVDYYKWVKNNYNGN-LHLSWFYQTCRQ----FGWFQSSANK 375
Query: 378 PEPWN----LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRR-STSNIIFSNGMRDP 432
P+ T Y C++ +G + + Y + K R + N+ ++G D
Sbjct: 376 NHPFGSTFPATLYTDMCRDVFGSQYTSAKIEEYIQATNKKYGGRNPAVENVYMTHGGLDG 435
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
+S G + + GSHC D +D L ++ +++++ W+
Sbjct: 436 WSAVG---------SDSATIIPQGSHCFDSGSINPTDSPALRAAKERVIELVREWLA 483
>gi|222613128|gb|EEE51260.1| hypothetical protein OsJ_32132 [Oryza sativa Japonica Group]
Length = 524
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 117/466 (25%), Positives = 187/466 (40%), Gaps = 69/466 (14%)
Query: 62 HFSYGPES--------YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
H ++GP ++ F QRY ++ PI ++ E+ N + +
Sbjct: 70 HETFGPHKKAIGTSADHRQFKQRYYEFLDYYRAPKGPIFLYICGESSC--NGIPNSYLAV 127
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A +F A +V EHRYYGKS PF S +N R + +S QAL D A + ++ N
Sbjct: 128 MAKKFGAAVVSPEHRYYGKSSPFESLTT--ENLR---FLSSKQALFDLAVFRQYYQETLN 182
Query: 174 ATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
A + + G SY G L+ WFRLK+PH+ GSLASS +L + +T +
Sbjct: 183 AKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL---SVYNYTDFDKQ 239
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ + EC ++++ +D G + + F T L N + L
Sbjct: 240 IGE---SAGPECKAALQETTKLVD--GQLQSGRNAVKQLFGAST-LANDGDFLFLLADAA 293
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA-------AGVVEADSLEYDGNNSR 340
+ A QY P +C+ I A G D++ A G A YD
Sbjct: 294 AIAFQYGNP-----DALCSPIVEAKKNGTDLVETFARYVKDYYIGTFGASVASYDQE--- 345
Query: 341 CYINEDR--TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
Y+ + + W +Q CSE+ KN + + ++ C+ +G
Sbjct: 346 -YLKNTTPPPAESAYRLWWYQVCSEVAYFQVAPKNDSVRSAKIDTRYHLDLCRNVFGEGV 404
Query: 399 RPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
P +T YYGG R + S I+F+NG +DP+ K+ + N G
Sbjct: 405 YPDVFMTNLYYGG------TRIAGSKIVFANGSQDPWRHASKQKSSKELPSYLIECSNCG 458
Query: 457 SHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
HC DL D + S P+ + + RK V + W+++
Sbjct: 459 -HCSDLSGCPQAPSHIEGDSSNCSSPEAVNKVRKQIVDHIDLWLSE 503
>gi|195062815|ref|XP_001996259.1| Pro-X carboxypeptidase [Drosophila grimshawi]
gi|193899754|gb|EDV98620.1| Pro-X carboxypeptidase [Drosophila grimshawi]
Length = 478
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/250 (29%), Positives = 118/250 (47%), Gaps = 19/250 (7%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
Q +DHF E +T+ RY+IN +F G SPI +LG E + + G Y+ A
Sbjct: 58 TQPLDHFDESNE--KTYQMRYLINDEFQTEG-SPIFIYLGGEWEVSPGMIEKGHWYDLAK 114
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNAT 175
+ K L++ EHRYYG SVP + K + + + Y + QALAD + +K +
Sbjct: 115 EHKGLLIYTEHRYYGNSVP--TEKMTVDDLQ---YLHVKQALADVKHFITTLKSENAQLA 169
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
++ V+ G SY + WF+ YP +V+G ASSAP+L D Y V K F +
Sbjct: 170 NSKVLLAGGSYSATMVVWFKRLYPDLVVGGWASSAPLLAKVDFFE---YKEVTGKAFAEL 226
Query: 236 -SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVYSE 289
++C+ I+K ++++ + S + C+ + +L + V++
Sbjct: 227 GGQKCYDRIQKGIADLEYMFDNKRSAEARS-MLRLCSSFDHENDLDMWNLFGSISNVFAS 285
Query: 290 AAQYDTPSNI 299
AQY P I
Sbjct: 286 LAQYQQPGEI 295
>gi|402591691|gb|EJW85620.1| serine carboxypeptidase S28 family protein [Wuchereria bancrofti]
Length = 183
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/132 (43%), Positives = 76/132 (57%), Gaps = 9/132 (6%)
Query: 88 NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNAR 147
+ P+ + G E I+ Q +G ++ A +F A IV EHRYYG S P+G R + +
Sbjct: 47 DGPLFFYAGNEGDIEAFAQNTGIIWDLAPRFHAAIVFAEHRYYGDSKPYGKR--SYMDVL 104
Query: 148 HRGYFNSAQALADYASIL--LHIKDKY-----NATHAPVIAIGASYGGELATWFRLKYPH 200
GY N Q LAD+A ++ L I D+ T PVI G SYGG LATW R+KYPH
Sbjct: 105 RLGYLNDIQVLADFAQLITFLKIDDEELGFCPPGTEIPVIVFGGSYGGMLATWLRMKYPH 164
Query: 201 VVIGSLASSAPI 212
+V G+ ASSAP+
Sbjct: 165 IVDGAWASSAPL 176
>gi|26449329|dbj|BAC41792.1| putative prolyl carboxypeptidase [Arabidopsis thaliana]
Length = 281
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 105/217 (48%), Gaps = 21/217 (9%)
Query: 3 SSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDH 62
SS+ F L + +S +NGL L+PR R S L + + ++ QT+DH
Sbjct: 2 SSELGFALLTTFTVLLSYLSFSNGL-LQPR----RISHGLSKSSKYLTRDELWFTQTLDH 56
Query: 63 FSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALI 122
Y P ++ F QRY + PI + E P N + + A +F A I
Sbjct: 57 --YSPSDHRKFRQRYYEYLDHLRVPDGPIFLMICGEGPC--NGITNNYISVLAKKFDAGI 112
Query: 123 VILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-------T 175
V LEHRYYGKS PF S A KN + Y +S QAL+D A+ + +D N
Sbjct: 113 VSLEHRYYGKSSPFKS--LATKNLK---YLSSKQALSDLATFRQYYQDSLNVKFNRSSNV 167
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
P G SY G L+ WFRLK+PH+ GSLASSA +
Sbjct: 168 ENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAVV 204
>gi|218184873|gb|EEC67300.1| hypothetical protein OsI_34290 [Oryza sativa Indica Group]
Length = 524
Score = 102 bits (255), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 117/466 (25%), Positives = 187/466 (40%), Gaps = 69/466 (14%)
Query: 62 HFSYGPES--------YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
H ++GP ++ F QRY ++ PI ++ E+ N + +
Sbjct: 70 HETFGPHKKAIGTSADHRQFNQRYYEFLDYYRAPKGPIFLYICGESSC--NGIPNSYLAV 127
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A +F A +V EHRYYGKS PF S +N R + +S QAL D A + ++ N
Sbjct: 128 MAKKFGAAVVSPEHRYYGKSSPFESLTT--ENLR---FLSSKQALFDLAVFRQYYQETLN 182
Query: 174 ATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
A + + G SY G L+ WFRLK+PH+ GSLASS +L + +T +
Sbjct: 183 AKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL---SVYNYTDFDKQ 239
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ + EC ++++ +D G + + F T L N + L
Sbjct: 240 IGE---SAGPECKAALQETTKLVD--GQLQSGRNAVKQLFGAST-LANDGDFLFLLADAA 293
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIA-------AGVVEADSLEYDGNNSR 340
+ A QY P +C+ I A G D++ A G A YD
Sbjct: 294 AIAFQYGNP-----DALCSPIVEAKKNGTDLVETFARYVKDYYIGTFGASVASYDQE--- 345
Query: 341 CYINEDR--TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
Y+ + + W +Q CSE+ KN + + ++ C+ +G
Sbjct: 346 -YLKNTTPPPAESAYRLWWYQVCSEVAYFQVAPKNDSVRSAKIDTRYHLDLCRNVFGEGV 404
Query: 399 RPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
P +T YYGG R + S I+F+NG +DP+ K+ + N G
Sbjct: 405 YPDVFMTNLYYGG------TRIAGSKIVFANGSQDPWRHASKQKSSKELPSYLIECSNCG 458
Query: 457 SHCLDL------------DEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
HC DL D + S P+ + + RK V + W+++
Sbjct: 459 -HCSDLSGCPQAPSHIEGDSSNCSSPEAVNKVRKQIVDHIDLWLSE 503
>gi|194899982|ref|XP_001979536.1| GG23262 [Drosophila erecta]
gi|190651239|gb|EDV48494.1| GG23262 [Drosophila erecta]
Length = 485
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 113/474 (23%), Positives = 197/474 (41%), Gaps = 42/474 (8%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L ++ R ++ + E + Q +D+F + T+ R IN+K++ G SPI
Sbjct: 34 LRQLHRGPPVQPMTTRAKVEERWITQKLDNFDASNNA--TWQNRIYINNKYFVDG-SPIF 90
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+LG E ID + SG + A Q ++ EHR++G+S+P L Y
Sbjct: 91 IYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGQSIPI----TPLSTENLEKYQ 146
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
+ QALAD ++L +K + + V+ G SY +ATW R YP V+ GS ASSAP+
Sbjct: 147 SVEQALADVINVLATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPDVIRGSWASSAPL 206
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
L + Y VV + + + C+ I + S +N+ +G +K+ C+
Sbjct: 207 LAK---VNFKDYMKVVGESYSILGGQYCYDLIDNATSYYENLFEIGNGTQA-AKELNLCS 262
Query: 272 PLKNTTE-----LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV 326
+E + + +++ AQY P + + C+ + + L K
Sbjct: 263 NFNVNSEQDRWQIFSTIANLFAGIAQYQKPEKYDIPKYCSILREFSDDDSVALSKFINWK 322
Query: 327 VEADS---LEYDGNNSRCYINEDRTG-DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN 382
+ S L + Y + ++SD W +Q+CSE G ++S + +P+
Sbjct: 323 INEHSGACLSTTFKGAVGYYEWSKENYEDSDLPWIFQTCSE----FGWFQSSGSRSQPFG 378
Query: 383 ----LTKYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
T Y C+ +G + + +D L + +NI F G D +S+ G
Sbjct: 379 STFPATLYEDTCEGVFGSKYDSAGIHANVRATNDDFGGLNVNATNIYFVQGALDGWSKVG 438
Query: 438 WVKTYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q + SHC D +D LV +K +K++ W+
Sbjct: 439 ---------AGVAQGATIIPYASHCPDTGSISDTDSAELVASKKKLIKLVDQWL 483
>gi|66772019|gb|AAY55321.1| IP12634p [Drosophila melanogaster]
Length = 490
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 113/474 (23%), Positives = 194/474 (40%), Gaps = 42/474 (8%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L + R +E + E + Q +D+F + T+ R IN+K++ G SPI
Sbjct: 39 LRELHRGPPVEPMKTRAKVEERWITQKLDNFD--DSNNATWQDRIYINNKYFVDG-SPIF 95
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+LG E ID + SG + A Q ++ EHR++G+S+P L Y
Sbjct: 96 IYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGQSIPI----TPLSTENLAKYQ 151
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
+ QALAD +++ +K + + V+ G SY +ATW R YP ++ GS ASSAP+
Sbjct: 152 SVEQALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPEIIRGSWASSAPL 211
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
L + Y VV + + + C+ I + S +N+ +G + K+ C+
Sbjct: 212 LAK---VNFKDYMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQAV-KELNLCS 267
Query: 272 PLKNTTE-----LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV 326
+E + + +++ AQY P + C+ + + L K
Sbjct: 268 NFNVNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILREFSDDDSVALSKFINWK 327
Query: 327 VEADS---LEYDGNNSRCYINEDRTG-DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN 382
+ S L + Y + +SD W +Q+CSE G ++S + +P+
Sbjct: 328 INEHSGACLSTTFKGAVGYYEWSKENYQDSDLPWIFQTCSE----FGWFQSSGSRSQPFG 383
Query: 383 ----LTKYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
T Y C+ +G + + +D L + +NI F G D +S+ G
Sbjct: 384 STFPATLYEDTCEGVFGAKYDSAGIHANIRATNDDFGGLNVNATNIYFVQGALDGWSKVG 443
Query: 438 WVKTYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q + SHC D SD LV +K +K++ W+
Sbjct: 444 ---------AGVAQGATIIPYASHCPDTGSISASDSAELVASKKKLIKLVAQWL 488
>gi|133930823|ref|NP_501598.2| Protein PCP-3 [Caenorhabditis elegans]
gi|119662054|emb|CAB05185.2| Protein PCP-3 [Caenorhabditis elegans]
Length = 1080
Score = 102 bits (254), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 118/473 (24%), Positives = 194/473 (41%), Gaps = 56/473 (11%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP------IDD 103
GFE + Q DHF ++ F Q++ N++ W P +G E P +++
Sbjct: 575 GFEQGTFRQRQDHFD--NQNADFFQQKFFKNAQ-WAKQGGPNFLMIGGEGPESARWVLNE 631
Query: 104 NIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA 162
NI TY A ++ A + +LEHR+YG SV + + + NS Q L D A
Sbjct: 632 NI-----TYLTWAKKYGATVYLLEHRFYGDSV--------VGDNTNFKLLNSLQMLYDLA 678
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+ + + T P I G SY G ++ W R +P +V+G++ASS P+ D +
Sbjct: 679 EFIKAVNIR-TGTSNPWITFGGSYSGAMSAWMREVFPDLVVGAVASSGPVYAKTD---FY 734
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
Y VV R + +C I+ + I + +G LS F+ P ++ D
Sbjct: 735 EYLMVVENSVRRYNSKCADNIQSGFDAIRTLFLTKEGRQNLSSIFQLQPPFSDSVTDTDQ 794
Query: 283 ---LDTVYSE---AAQYDTPSNIP------VKRICNAIENAPNCG-DDILCKIAAGVVEA 329
VY A QY + P + +C + N N ++I+ ++
Sbjct: 795 HYFFSNVYGNFQGAVQYSGDNTGPYANGYGIPDMCKIMSNDSNTPLNNIVAFNQFMIIFY 854
Query: 330 DSLEYDG--NNSRCYINEDRT----GDESDEG--WEWQSCSEMVVPMGKDKNSMYQPEPW 381
+ +Y G NN + I +T G +S G W WQ+CSE D + P
Sbjct: 855 NGGQYTGMDNNYQNMITYLKTAQHYGPDSAAGLLWTWQTCSEFGYFQSADSGNGIFGSPT 914
Query: 382 NLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR---RSTSNIIFSNGMRDPFSRGGW 438
+ Y++ C + + + + + + K R R T N++ NG DP+ G
Sbjct: 915 PVNMYVQMCMDVFNNQYQRTSIDYSIANTNYKYGERFHYRGT-NVVLPNGNVDPWHALG- 972
Query: 439 VKTYHFFDLSFSQDLNLGS-HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
Y+ D S L G+ HC D+ A+ +D L R + + W+ Q
Sbjct: 973 --LYYPTDSSVVSYLIDGTAHCADMYPARDADVPGLKVVRDLVDQNIAKWLNQ 1023
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 101/425 (23%), Positives = 172/425 (40%), Gaps = 64/425 (15%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLS---GFTYEN 114
Q++DHF G S TF QRY ++ + L ++ A+ +++ +S +
Sbjct: 50 QSLDHF-IGNAS-GTFSQRYFYTQQYTLHQRTAFL-YVSADG-VEEAAVISDERNPIVKT 105
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A QF A I LEHRYYG+S P K +N RH NS QA+ D S + + ++N
Sbjct: 106 AKQFGATIFSLEHRYYGQSRP-NFDKFDAQNLRH---LNSLQAILDIISFIKSVNVQFNM 161
Query: 175 -THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ GA YGG LA R P + G +ASS+P+ + D W V + +
Sbjct: 162 DPDVRWVLWGAGYGGILAAEARKWDPVTISGVIASSSPLTHLYDF--WQFNDQVATTFSQ 219
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------ 287
C+ +++ +++I P+G +S F+ L T + + Y
Sbjct: 220 VGGGLCYNKVRQGFADIRQAMRTPEGRRNVSSLFQLNPRLDQTPLNYNDVQIFYLLIIAP 279
Query: 288 -SEAAQYDTPSNIPVKRICNAIENAPNCGDDI-------LCKIAAGVVEADSLEYD---- 335
+ Q++ NI + +C I+ + ++ L G V+ Y
Sbjct: 280 FQQIVQFNNDFNISISDMCTTIDKSTWTNMEVVRQAYVYLSTTITGSVQPMVTSYQTIVN 339
Query: 336 --GNNSRCYINEDRTGDESDEGWEWQSCSEM--VVPMGKDKNSMYQP-EPWNLTKYIKNC 390
GN S D+ W++Q+C+E ++N ++ P +L ++ C
Sbjct: 340 DLGNQSASSPYLDQ------RMWQYQTCTEFGWFYTTNNNENGLFGAVVPGSL--FLNQC 391
Query: 391 KEQYGVSPRPSWVLT-----------YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWV 439
+ + P + T YYG S +N +F+NG+ DP++ G
Sbjct: 392 ---FDIFPDANLTATSIRDLVIEYNNYYGS-----AFDYSGTNAVFTNGLLDPWTILGKK 443
Query: 440 KTYHF 444
T F
Sbjct: 444 STGDF 448
>gi|24648179|ref|NP_650804.1| CG3739 [Drosophila melanogaster]
gi|7300510|gb|AAF55664.1| CG3739 [Drosophila melanogaster]
Length = 547
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 113/474 (23%), Positives = 195/474 (41%), Gaps = 42/474 (8%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L + R +E + E + Q +D+F + T+ R IN+K++ G SPI
Sbjct: 96 LRELHRGPPVEPMKTRAKVEERWITQKLDNFD--DSNNATWQDRIYINNKYFVDG-SPIF 152
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+LG E ID + SG + A Q ++ EHR++G+S+P L Y
Sbjct: 153 IYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGQSIPI----TPLSTENLAKYQ 208
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
+ QALAD +++ +K + + V+ G SY +ATW R YP ++ GS ASSAP+
Sbjct: 209 SVEQALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPEIIRGSWASSAPL 268
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
L + + Y VV + + + C+ I + S +N+ +G + K+ C+
Sbjct: 269 LAKVN---FKDYMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQAV-KELNLCS 324
Query: 272 PLKNTTE-----LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV 326
+E + + +++ AQY P + C+ + + L K
Sbjct: 325 NFNVNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILREFSDDDSVALSKFINWK 384
Query: 327 VEADS---LEYDGNNSRCYINEDRTG-DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN 382
+ S L + Y + +SD W +Q+CSE G ++S + +P+
Sbjct: 385 INEHSGACLSTTFKGAVGYYEWSKENYQDSDLPWIFQTCSE----FGWFQSSGSRSQPFG 440
Query: 383 ----LTKYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
T Y C+ +G + + +D L + +NI F G D +S+ G
Sbjct: 441 STFPATLYEDTCEGVFGAKYDSAGIHANIRATNDDFGGLNVNATNIYFVQGALDGWSKVG 500
Query: 438 WVKTYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q + SHC D SD LV +K +K++ W+
Sbjct: 501 ---------AGVAQGATIIPYASHCPDTGSISASDSAELVASKKKLIKLVAQWL 545
>gi|270158373|ref|ZP_06187030.1| putative serine carboxypeptidase [Legionella longbeachae D-4968]
gi|269990398|gb|EEZ96652.1| putative serine carboxypeptidase [Legionella longbeachae D-4968]
Length = 465
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 109/449 (24%), Positives = 187/449 (41%), Gaps = 48/449 (10%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q IDH + TF QRY I+ + +SP+ ++ E+ L+G
Sbjct: 49 YFKQLIDH---NNPATGTFSQRYYIDETYGPKDDSPVFFYICGESACSKRA-LNGAIRNY 104
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F A +V LEHRYYG+S+P S + Y + AL D A H+ + N
Sbjct: 105 AQKFNAKLVALEHRYYGESLPLNSL-----STNDLRYLTTEAALDDLAYFQRHLTSEKN- 158
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
H +A G SY G L+ ++RLKYP++V+G+LASSAP++ + + + + V+
Sbjct: 159 WHGKWVAFGGSYPGSLSAYYRLKYPYLVVGALASSAPVMAKENFIEYDAHVTQVA----- 213
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------S 288
+C +++ ++++ S D + F L + ++D +D +Y +
Sbjct: 214 -GLQCANQMREVVTQVET--SLKDTVT-----FAQIKSLFDAAAVEDPVDFLYLIADTGA 265
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNC--GDDILCKIAAGVVEADSLEYDGNNSRCYINED 346
A QY C+++ P G K + ++E + +D
Sbjct: 266 AAVQYGMRD-----AFCSSLSEHPTPLEGYAYFAKKLYKDMATTAVEMTAQGAMSENPQD 320
Query: 347 RTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEP--WNLTKYIKNCKEQYGVS-PRPSWV 403
W +QSC E + N NL + C+ +G++ P +
Sbjct: 321 YQNGLGMRQWYYQSCKEYGYWQNANPNPALSTRSSLINLDYHYNVCQRLFGLTEPAHTAE 380
Query: 404 L--TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWV-KTYHFFDLSFSQDLNLG-SHC 459
L T+Y ++ SNI F+NG DP+S + + + + L G +HC
Sbjct: 381 LNNTFYFP-----LMNILASNIYFTNGENDPWSTLSLAERNGNATNPRLTYQLIQGEAHC 435
Query: 460 LDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
DL D D L RK ++ W+
Sbjct: 436 NDLRTPSSVDSDSLKDARKKMESLLTEWL 464
>gi|395529078|ref|XP_003766647.1| PREDICTED: thymus-specific serine protease-like [Sarcophilus
harrisii]
Length = 323
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 105/195 (53%), Gaps = 17/195 (8%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG SVP G L+ R + +S ALAD AS +H+ YN
Sbjct: 94 APHWGALVISLEHRFYGHSVPPGG--LGLEQLR---FLSSRHALADVASARVHLSRIYNI 148
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ +P ++ G SY G LA W RLK+PH++ ++ASSAP+ D +++Y+ VS+
Sbjct: 149 SASSPWVSFGGSYAGSLAAWARLKFPHLIWAAVASSAPVQAQLD---FSSYNRGVSRSLA 205
Query: 234 DT----SEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPL---KNTTELKDGLDT 285
D S +C + + ++SE+D ++ + A L + C L ++ EL + L+
Sbjct: 206 DPTVGGSLKCRRAVALAFSELDRGLSEGTEARAALQSAVRACGALEAPEDQAELLEQLEG 265
Query: 286 VYSEAAQYDTPSNIP 300
+ A QYD + P
Sbjct: 266 LVGAAVQYDQQAGAP 280
>gi|289163387|ref|YP_003453525.1| serine carboxypeptidase [Legionella longbeachae NSW150]
gi|288856560|emb|CBJ10365.1| Similar to eukaryotic serine carboxypeptidase S28 family protein
[Legionella longbeachae NSW150]
Length = 466
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 109/449 (24%), Positives = 187/449 (41%), Gaps = 48/449 (10%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
++ Q IDH + TF QRY I+ + +SP+ ++ E+ L+G
Sbjct: 50 YFKQLIDH---NNPATGTFSQRYYIDETYGPKDDSPVFFYICGESACSKRA-LNGAIRNY 105
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F A +V LEHRYYG+S+P S + Y + AL D A H+ + N
Sbjct: 106 AQKFNAKLVALEHRYYGESLPLNSL-----STNDLRYLTTEAALDDLAYFQRHLTSEKN- 159
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
H +A G SY G L+ ++RLKYP++V+G+LASSAP++ + + + + V+
Sbjct: 160 WHGKWVAFGGSYPGSLSAYYRLKYPYLVVGALASSAPVMAKENFIEYDAHVTQVA----- 214
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------S 288
+C +++ ++++ S D + F L + ++D +D +Y +
Sbjct: 215 -GLQCANQMREVVTQVET--SLKDTVT-----FAQIKSLFDAAAVEDPVDFLYLIADTGA 266
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNC--GDDILCKIAAGVVEADSLEYDGNNSRCYINED 346
A QY C+++ P G K + ++E + +D
Sbjct: 267 AAVQYGMRD-----AFCSSLSEHPTPLEGYAYFAKKLYKDMATTAVEMTAQGAMSENPQD 321
Query: 347 RTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEP--WNLTKYIKNCKEQYGVS-PRPSWV 403
W +QSC E + N NL + C+ +G++ P +
Sbjct: 322 YQNGLGMRQWYYQSCKEYGYWQNANPNPALSTRSSLINLDYHYNVCQRLFGLTEPAHTAE 381
Query: 404 L--TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWV-KTYHFFDLSFSQDLNLG-SHC 459
L T+Y ++ SNI F+NG DP+S + + + + L G +HC
Sbjct: 382 LNNTFYFP-----LMNILASNIYFTNGENDPWSTLSLAERNGNATNPRLTYQLIQGEAHC 436
Query: 460 LDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
DL D D L RK ++ W+
Sbjct: 437 NDLRTPSSVDSDSLKDARKKMESLLTEWL 465
>gi|290997480|ref|XP_002681309.1| predicted protein [Naegleria gruberi]
gi|284094933|gb|EFC48565.1| predicted protein [Naegleria gruberi]
Length = 469
Score = 102 bits (253), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 118/447 (26%), Positives = 189/447 (42%), Gaps = 57/447 (12%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP-ILAFLGAEAPIDDNIQLS 108
G F+++Q IDHF+ + TFPQRY +SP L ++ EA +
Sbjct: 31 GEPEFWFHQRIDHFN--ALNTDTFPQRYYKFVPEGVSASSPNHLLYICPEATCGGTP--N 86
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
+ A + KA I LEHR+YGKSVP+ S +K Y + ALAD + + +I
Sbjct: 87 NYVKNYAMELKATIYTLEHRFYGKSVPYKS----MKTVNMANYLKTEMALADLSVFIEYI 142
Query: 169 KDKYNATHAP--VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
+ + P I +G SY G L+ +F +KYPH+V G+L+SS + I + T+
Sbjct: 143 ATLPSDNNTPHQFIIVGCSYPGALSAFFSMKYPHLVKGALSSSGVV---NSILDFYTFDM 199
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
V + EC + ++ S ++ + P L + F+ + + +L +
Sbjct: 200 HVQQ---AAGPECTALLTRATSIMEKM--NPTNLL---RDFQAPADM-DIRDLFLLFGDI 250
Query: 287 YSEAAQYDTPSNIPVKRICNAIENAPNCGDDIL--------CKIAAGVVEADSLEY---- 334
E+ QY +CNA+++ D+++ V E L+Y
Sbjct: 251 AGESVQYGYH-----YELCNAMKSGNTNLDEVIYQNFHNYSLNFFYKVFETSPLDYYNGA 305
Query: 335 DGNNSRCYINEDRTGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPW-NLTKYIKNCKE 392
GN++ + G W Q+CSE+ KN W +L + CK+
Sbjct: 306 IGNDNY----DPSQGANGGRSWWLQTCSELSYFNTAPPKNLPSIRSRWLDLDYFYDKCKK 361
Query: 393 QYG--VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFS 450
+G + P +V YG K +L +T +F NG +DP+ R G F SF
Sbjct: 362 IFGYPIKPNTDFVNNQYGA---KQLLNTTTGRTVFVNGSQDPWLRAGVDIDPKKF--SFL 416
Query: 451 QDLNLGSHCLDLDEAKKSDPDWLVQQR 477
+ N HC+DL + P W Q
Sbjct: 417 IECNNCGHCVDL----RGCPSWAQNQE 439
>gi|241161684|ref|XP_002408971.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494440|gb|EEC04081.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 201
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 69/205 (33%), Positives = 101/205 (49%), Gaps = 18/205 (8%)
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA WFRLKYPHV +LASSAPIL + ITP + + V++K F S++C I+ S+
Sbjct: 2 LAAWFRLKYPHVTTAALASSAPILLFTGITPCSAFSEVLTKAFAKESDQCTNAIRTSFEV 61
Query: 250 IDNIASKPDGLAILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTP---------SN 298
A +G L ++F+ C PL N T L+D VY+ A ++ P
Sbjct: 62 TRKQAVTEEGAKALKEQFRLCKPLAPSNYTVLRDWFWDVYAYLAMFNHPYASKLPLLVPG 121
Query: 299 IPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINE-DRTGDESDEGWE 357
PVK C +E N DD + G+ +A S+ + + + N+ + GW
Sbjct: 122 HPVKEACKFLEK--NFADDQ--SLLDGIYQAISV-FTNYTGKTHCNDLPNSAVPLLGGWG 176
Query: 358 WQSCSEMVVPMGKD-KNSMYQPEPW 381
Q C+EMV+PM + K M+ PW
Sbjct: 177 IQLCNEMVMPMCNNGKTDMFFDNPW 201
>gi|195109598|ref|XP_001999370.1| GI24472 [Drosophila mojavensis]
gi|193915964|gb|EDW14831.1| GI24472 [Drosophila mojavensis]
Length = 499
Score = 101 bits (252), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 128/277 (46%), Gaps = 15/277 (5%)
Query: 12 LLMVIFVSTSFHA-NGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESY 70
LL +FVS S A N + L S ++ + + Q +DHF G
Sbjct: 12 LLFALFVSGSLKALNPYRHSWELLLQEPSSGPYTREDAAAVQELWLTQNLDHFEAGDN-- 69
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+T+ RY N+K+ P+ FLG E I + +G T++ A + ++ EHRYY
Sbjct: 70 RTWQMRYFRNAKY-HKPQGPMYIFLGGEWTITPGLLSTGLTHDMAVENAGILFYTEHRYY 128
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK-YNATHAPVIAIGASYGGE 189
G+S PF + +KN + Y N QALAD A + + K + N TH+ VI IG SY G
Sbjct: 129 GQSWPFENNNLTVKNLK---YLNLHQALADVAHFIRYQKSQSANLTHSKVILIGGSYSGS 185
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
+A W YP +V ASSAP+L D + H V + C + +++ +
Sbjct: 186 MAAWMTHLYPELVAAVWASSAPLLAKADFYEYM--HRVENSLTLSYGSNCTRRLERGFEH 243
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+ N+ ++ +L ++ C P N + D LD +
Sbjct: 244 LVNLFNENKASDLL-RRLNAC-PNYNAS---DSLDRI 275
>gi|66770655|gb|AAY54639.1| IP12434p [Drosophila melanogaster]
Length = 486
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 107/435 (24%), Positives = 181/435 (41%), Gaps = 40/435 (9%)
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYG 131
T+ R IN+K++ G SPI +LG E ID + SG + A Q ++ EHR++G
Sbjct: 72 TWQDRIYINNKYFVDG-SPIFIYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFG 130
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELA 191
+S+P L Y + QALAD +++ +K + + V+A G SY +A
Sbjct: 131 QSIPI----TPLSTENLAKYQSVEQALADVINVIATLKQEDKYKDSKVVASGCSYSATMA 186
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEI 250
TW R YP ++ GS ASSAP+L + Y VV + + + C+ I + S
Sbjct: 187 TWIRKLYPEIIRGSWASSAPLLAK---VNFKDYMKVVGESYATLGGQYCYDLIDNATSYY 243
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTE-----LKDGLDTVYSEAAQYDTPSNIPVKRIC 305
+N+ +G + K+ C+ +E + + +++ AQY P + C
Sbjct: 244 ENLFEIGNGTQAV-KELNLCSNFNVNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYC 302
Query: 306 NAIENAPNCGDDILCKIAAGVVEADS---LEYDGNNSRCYINEDRTG-DESDEGWEWQSC 361
+ + + L K + S L + Y + +SD W +Q+C
Sbjct: 303 SILREFSDDDSVALSKFINWKINEHSGACLSTTFKGAVGYYEWSKENYQDSDLPWIFQTC 362
Query: 362 SEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLIL 416
SE G ++S + +P+ T Y C+ +G + + +D L
Sbjct: 363 SE----FGWFQSSGSRSQPFGSTFPATLYEDTCEGVFGAKYDSAGIHANIRATNDDFGGL 418
Query: 417 RRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWL 473
+ +NI F G D +S+ G +Q + SHC D SD L
Sbjct: 419 NVNATNIYFVQGALDGWSKVG---------AGVAQGATIIPYASHCPDTGSISASDSAEL 469
Query: 474 VQQRKTEVKIMQGWI 488
V +K +K++ W+
Sbjct: 470 VASKKKLIKLVAQWL 484
>gi|158298288|ref|XP_318471.4| AGAP004014-PA [Anopheles gambiae str. PEST]
gi|157014452|gb|EAA13580.4| AGAP004014-PA [Anopheles gambiae str. PEST]
Length = 469
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/241 (34%), Positives = 117/241 (48%), Gaps = 23/241 (9%)
Query: 46 DSNHGF--ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
D HG E++F + +DHF+ P + F RY IN + PI +GA PI
Sbjct: 17 DVAHGMLRESWFETR-VDHFN--PRNQDKFAMRYYINDEH-AYARGPIFIVVGAAEPIQT 72
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG--YFNSAQALADY 161
G Y+ A+ A + E RY+G S P ++NA + N+ QALAD
Sbjct: 73 RWITEGLFYDIAYLEGAYLFANELRYFGYSRP-------VENAETENLDFLNADQALADL 125
Query: 162 ASILLHIKDKYNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
A + ++K+ Y +A VI +G +YGG LATWFR KYPH+V G SS I D
Sbjct: 126 AEWITYLKETYTYNPNAKVILMGTAYGGALATWFRQKYPHLVDGVWVSSGAI--EADFA- 182
Query: 221 WTTYHSVVSKDFRD-TSEECFQTIKKSWSEIDNIASKPDGLA-ILSKKFKTCTPLKNTTE 278
+ Y+ + + R S+ C+ TI + N+A GLA +LS +F C PL +
Sbjct: 183 FAGYNEALGESIRQYGSDACYSTIWSGFRVAQNMAHL--GLAELLSTEFHLCEPLDTDND 240
Query: 279 L 279
L
Sbjct: 241 L 241
>gi|195391900|ref|XP_002054597.1| GJ22719 [Drosophila virilis]
gi|194152683|gb|EDW68117.1| GJ22719 [Drosophila virilis]
Length = 478
Score = 101 bits (251), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 136/299 (45%), Gaps = 32/299 (10%)
Query: 18 VSTSFHANGLKLRP----------RLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGP 67
V+ + AN LKL+ L R R + + D+ E + Q +DHF
Sbjct: 12 VALASQANSLKLKKDVPVFVKTLKELYRGPPQRTVARADTA---EEKWITQPLDHFD--E 66
Query: 68 ESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEH 127
+ +T+ RY +N +F G SPI FLG E + G Y+ A + +++ EH
Sbjct: 67 SNTKTYEMRYFLNDEFQTDG-SPIFIFLGGEWEASPGMIQQGHWYDMAKEHNGVLIYTEH 125
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAPVIAIGASY 186
RYYG+SVP + +L+N + Y + QALAD A + K + T++ V+ G SY
Sbjct: 126 RYYGESVP--TETMSLENLQ---YLHVKQALADVARFIETFKSENAQLTNSKVLLAGGSY 180
Query: 187 GGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKK 245
+ WF+ YP +V+G ASSAP+L D Y V + F + ++C+ I+
Sbjct: 181 SATMVVWFKRLYPDLVVGGWASSAPLLAKVDFYE---YKEVTGRAFLELGGQKCYDRIQN 237
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVYSEAAQYDTPSNI 299
+E++ + A + C+ + +L + +++ AQ +P +I
Sbjct: 238 GIAELEYMFDNKRA-AEARAMLRICSSFDHENDLDMWNLFGSISNIFASVAQTQSPGDI 295
>gi|410958405|ref|XP_003985809.1| PREDICTED: thymus-specific serine protease [Felis catus]
Length = 416
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 96/179 (53%), Gaps = 11/179 (6%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ + ++F QRY +N + W + P+ LG E + G
Sbjct: 60 WLEQPLDPFNTSDQ--RSFLQRYWVNDQHWASRHGPVFLHLGGEGSLRPGSVTRGHPAAL 117
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L A+ R + +S ALAD AS L + +N
Sbjct: 118 APAWGALVIGLEHRFYGLSIPAG----GLDVAQLR-FLSSRHALADVASARLALGRLFNV 172
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 173 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNEVVSRSL 228
>gi|195497939|ref|XP_002096313.1| GE25601 [Drosophila yakuba]
gi|195497943|ref|XP_002096315.1| GE25604 [Drosophila yakuba]
gi|194182414|gb|EDW96025.1| GE25601 [Drosophila yakuba]
gi|194182416|gb|EDW96027.1| GE25604 [Drosophila yakuba]
Length = 485
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/477 (24%), Positives = 198/477 (41%), Gaps = 48/477 (10%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L + R +E + E + Q +D+F + T+ R IN+K++ G SPI
Sbjct: 34 LRELHRGPPVEPMKTRAKVEERWITQKLDNFD--DSNNATWQDRIYINNKYFVDG-SPIF 90
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+LG E ID + SG + A Q ++ EHR++G+S+P L Y
Sbjct: 91 IYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGESIPI----TPLSTENLAKYQ 146
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
+ QALAD +++ +K + + V+ G SY +ATW R YP ++ GS ASSAPI
Sbjct: 147 SVEQALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKMYPEIIRGSWASSAPI 206
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
L + Y VV + + + C+ I + S +N+ +G +K+ C+
Sbjct: 207 LAK---VNFKDYMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQ-AAKELNLCS 262
Query: 272 PL-----KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILC------ 320
++ ++ + +++ AQY P + C+ + + DD +
Sbjct: 263 NFDVNSDQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILREFSD--DDSVALSKFIN 320
Query: 321 -KIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE 379
KI S + G+ ++D D SD W +Q+CSE G ++S + +
Sbjct: 321 WKINEHSGACLSTTFKGSVGYYEWSKDNYQD-SDLPWVFQTCSE----FGWFQSSGSRSQ 375
Query: 380 PWNLT----KYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFS 434
P+ T Y C+ +G + +D L + +NI F G D +S
Sbjct: 376 PFGSTFPASLYEDTCEGVFGSKYDSDGIHANVRATNDDFGGLNVNATNIYFVQGALDGWS 435
Query: 435 RGGWVKTYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ G +Q + SHC D +D LV +K +K++ W+
Sbjct: 436 KVG---------AGVAQGATIIPYASHCPDTGSISATDSAELVASKKKLIKLVGQWL 483
>gi|449701650|gb|EMD42428.1| serine carboxypeptidase (S28) family protein, partial [Entamoeba
histolytica KU27]
Length = 195
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 87/164 (53%), Gaps = 12/164 (7%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG--NSPILAFLGAEAPIDDNIQLSG 109
E+ Y +DHF+ ++ F +Y +N KF N+P+ LG E P + +
Sbjct: 38 ESMTYTVPLDHFNANNQN--DFDIQYFVNKKFLDANDPNAPLFVLLGGEGPASPKVLQNN 95
Query: 110 FTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
+ ++ A + K L++ +EHR+YG S P + Y + QAL DY ++ H+
Sbjct: 96 YVIDSLAKKHKGLMLSVEHRFYGASTPSLEMDKLI-------YCTAEQALMDYVEVISHV 148
Query: 169 KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
+++ N PVI +G SY G LA W R KYP+VV G+ ASSAP+
Sbjct: 149 QEENNLVGHPVIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV 192
>gi|299473646|emb|CBN78040.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 559
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 111/230 (48%), Gaps = 18/230 (7%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS--PILAFLGAEAPIDDNIQLSG 109
E F Q +DHF ++ + F QRY IN K+W G +S P+ +G E P + LS
Sbjct: 68 EELFVEQRLDHFDR--QNSRKFLQRYFINKKYWAGASSGAPVFLCVGGEGPPLEANVLSE 125
Query: 110 FTYEN-----AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
+ N A + AL++ +EHRYYGKS P A + R + +S QALAD +S
Sbjct: 126 SVHCNDMLELAPEHNALVLAVEHRYYGKSNP--GDDWATDSLR---WLSSQQALADLSSF 180
Query: 165 LLHIKDKYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ DK T A + G SY G LA W RLKYPH+ +++SS+P+ D +
Sbjct: 181 HGFLSDKEGLTGAEKWVTWGGSYPGMLAGWARLKYPHLFHAAVSSSSPMKAQLDFPQYAE 240
Query: 224 Y--HSVVSK-DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
S+ S D SEEC ++ + I + +G L F+ C
Sbjct: 241 VMRDSLASGVDGVGGSEECASAVEAGHASIGELLLTEEGQLELVATFQLC 290
>gi|195062810|ref|XP_001996258.1| GH22292 [Drosophila grimshawi]
gi|193899753|gb|EDV98619.1| GH22292 [Drosophila grimshawi]
Length = 480
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 109/489 (22%), Positives = 201/489 (41%), Gaps = 73/489 (14%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L + R ++ ET + +Q +D+F G E + + R +IN ++ G SPI
Sbjct: 33 LNELHRGPPMQLISKRANVETRWISQKLDNFDEGNE--EVWDDRVLINEDYFVDG-SPIF 89
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+LG E I+ + +G + A + +V EHR++G+SVP A + Y
Sbjct: 90 IYLGGEWEIEPSPITAGHWVDIASEHNGSLVYTEHRFFGQSVPIKPLTTA-----NLKYQ 144
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
N QALAD +++ +K++ ++ V+ G SY +A W + YP V++GS ASSAP+
Sbjct: 145 NVEQALADVVNVINVLKEEEKYKNSKVVVQGCSYSATMAVWIKKLYPDVIVGSWASSAPL 204
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
D + Y VV + +R+ + C+ I + S +++ +G +KK
Sbjct: 205 QAKVD---FKAYMKVVGQAYRELGGDYCYNIIDNATSFYEDLFE--NGQNAEAKKLLNLC 259
Query: 272 PLKNTTELKD------GLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAG 325
N + D + + + AQY P+N + + C+ + + L K
Sbjct: 260 DNFNENDQHDQWQIFSTIANILAGLAQYQNPANYDLAKHCSVLRSFSTDDATALSKF--- 316
Query: 326 VVEADSLEYDGNNSRC-----------YINEDRTGDESDEGWEWQSCSEMVVPMGKDKNS 374
+++ +N C Y D S W +Q+C+E G ++S
Sbjct: 317 ------IQWRLDNPECVNTVYKATVKYYKWAMHNYDGSGLSWFFQTCNE----FGWYQSS 366
Query: 375 MYQPEPWN----LTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNI 423
+ +P+ T Y CK+ +G + S YGG + + N+
Sbjct: 367 GSKNQPFGSSFPATLYTDTCKDVFGSKYTAAKIEKHISEKNKVYGG------VNPNVENV 420
Query: 424 IFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTE 480
++G DP +H +Q + SHC D+ D ++ ++
Sbjct: 421 YMTHGGLDP---------WHPVGAGAAQGATIIPQASHCSDMGSISAKDSPAMIASKQRV 471
Query: 481 VKIMQGWIT 489
++++ W+
Sbjct: 472 AQLVREWLA 480
>gi|349805101|gb|AEQ18023.1| hypothetical protein [Hymenochirus curtipes]
Length = 294
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 103/195 (52%), Gaps = 18/195 (9%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
Q +DHF+ + T+ QRY +N + W + P+ ++G E + + LSG E A
Sbjct: 4 QPLDHFN--RRNNATYRQRYWVNEEHWRQPDGPVFLYIGGEGSLSEFSVLSGEHVELAQT 61
Query: 118 FKALIVILEHRYYGKSV-PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH 176
+AL+V LE +YG S+ P G +LK + +S QALAD AS L I KYN T
Sbjct: 62 HRALLVSLE-CFYGSSINPDGMTLESLK------FLSSQQALADLASFHLFISHKYNLTR 114
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT- 235
I G SY G L+ WF LK+PH+V S+ASSAP+ + +T Y+ VV+ D
Sbjct: 115 NTWICFGGSYPGSLSAWF-LKFPHLVYASVASSAPVRAELN---FTGYNKVVAWSLADLV 170
Query: 236 ---SEECFQTIKKSW 247
SE+C + K +
Sbjct: 171 IGGSEKCLDAVIKGF 185
>gi|242039113|ref|XP_002466951.1| hypothetical protein SORBIDRAFT_01g017300 [Sorghum bicolor]
gi|241920805|gb|EER93949.1| hypothetical protein SORBIDRAFT_01g017300 [Sorghum bicolor]
Length = 401
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/331 (29%), Positives = 139/331 (41%), Gaps = 39/331 (11%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A +F A +V EHRYYGKS PF ++ +N R + +S QAL D A + ++ NA
Sbjct: 19 AKKFGAAVVTPEHRYYGKSSPF--KQLTTENLR---FLSSKQALFDLAVFRQYYQESLNA 73
Query: 175 THA-------PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+ P IG SY G L+ WFRLK+PH+ GSLASS +L + T
Sbjct: 74 RYNRSSGFDNPWFVIGISYAGALSAWFRLKFPHLTCGSLASSGVVLAVYNYT-------- 125
Query: 228 VSKDF-RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT---CTPLKNTTELKDGL 283
DF + E K EI + + L + S KT LKN + L
Sbjct: 126 ---DFDKQVGESAGPQCKAILQEITELVD--EQLRLDSHSVKTLFGAQTLKNDGDFLFFL 180
Query: 284 DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
+ QY P + ++ A +N N + + G + + R Y+
Sbjct: 181 ADAAAITFQYGNPDAL-CPQLIKAKKNRKNLV-EAYAQFVKGFYIKEMETPPSSYDREYL 238
Query: 344 NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
E D S W +Q CSE+ KN + N ++ C+ +G P
Sbjct: 239 KETTPEDSSTRLWWFQVCSEVAYFQVAPKNDSVRSARVNTKYHLDLCRYVFGEGVYPDVF 298
Query: 404 LT--YYGGHDIKLILRRSTSNIIFSNGMRDP 432
+T YYGG I + S I+F+NG +DP
Sbjct: 299 MTNLYYGGTGI------AASKIVFTNGSQDP 323
>gi|308491795|ref|XP_003108088.1| hypothetical protein CRE_10077 [Caenorhabditis remanei]
gi|308248936|gb|EFO92888.1| hypothetical protein CRE_10077 [Caenorhabditis remanei]
Length = 794
Score = 100 bits (248), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 129/506 (25%), Positives = 198/506 (39%), Gaps = 69/506 (13%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFF-YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
RPR G + + E S + + +Q +DHFS + T+ QRY NSKF+
Sbjct: 27 RPREGLVASASDPEGPVSEDKYMVYSEIDQVVDHFSNTTSA--TWRQRYQYNSKFYNKTV 84
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYEN------AHQFKALIVILEHRYYGKS--VPFGSRK 140
+ LG E I+ +E A +F A +EHR+YG P G +
Sbjct: 85 GYVFLMLGGEGSINATNGDKWVRHEAETMMVWAAEFGAGAFQVEHRFYGSKGFSPIGDQT 144
Query: 141 A-ALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGASYGGELATWFRLKY 198
+LK QALAD + + Y P+ I G SY G L+ WFR Y
Sbjct: 145 TESLK------LLTIDQALADIKEFINQMNALYFPLDKPIWITFGGSYPGSLSAWFRETY 198
Query: 199 PHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIA-SKP 257
P + G+++SS+ + + D + Y K +R S+ C IK ++ ++ A + P
Sbjct: 199 PEMTAGAVSSSSAVHVFVD---YYGYAINTEKTYRTVSDSCGDVIKTAFQQMQKKAYNGP 255
Query: 258 DGLAILSKKFKTCTP-------------LKNTTELKDGLDTVYSEAAQYDTPSNIPVKRI 304
D +L K F C +N G++ + T S + V
Sbjct: 256 DSRELLKKTFNLCDSFDENNLSKSIQFFFQNVYGYFQGINQYTGDNRNNATRSGLGVPGA 315
Query: 305 CNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN-----EDRTGDESD----EG 355
CN + NA GD+I ++ A + DS G Y + D T + D
Sbjct: 316 CNILNNA-TLGDEIT-RVVAVMDWYDSWGSPGCRPNSYTSFIKYYSDTTMPDDDRISTRS 373
Query: 356 WEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL---------- 404
W WQ+C+E+ D N L + C + +G P + L
Sbjct: 374 WIWQTCTELGYYQTTDGGNGGIFGSTVPLDFFADQCIDLFG----PEYTLDNTFKLIDQV 429
Query: 405 -TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLD 463
T YGG D R T N+ F NG DP+ G T ++ S ++ +HC D+
Sbjct: 430 RTKYGGADA----YRGT-NVCFPNGSFDPWQDLGHKATNTNNNVD-SWLIDGTAHCADMY 483
Query: 464 EAKKSDPDWLVQQRKTEVKIMQGWIT 489
A+ SD L R+ + W++
Sbjct: 484 PARDSDKQSLKDARRRIHDQLSRWLS 509
>gi|410931333|ref|XP_003979050.1| PREDICTED: dipeptidyl peptidase 2-like, partial [Takifugu rubripes]
Length = 317
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 147/323 (45%), Gaps = 38/323 (11%)
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD-TSEECFQTIK 244
YGG L+ + RL+YP++V G+LA+SAPIL T S+ FRD T+
Sbjct: 1 YGGMLSVYMRLRYPNMVAGALAASAPILS-------TAGLGDPSQFFRDVTAVSPVCLSA 53
Query: 245 KSWSEIDNIASKPDGLAILSKKFKTCTP-------------LKNTTELKDGLDTVYSEAA 291
PD I +K C P L+N L LD YS
Sbjct: 54 CLSLSALLSLLLPDYRRI-QEKLSLCKPPSSPEDIYQLNGLLRNAFTLMAMLDYPYSTHF 112
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVV--EADSLEYDGNNSRCYINEDRTG 349
+ P+N PVK C + A +++ + AG+V +L S D TG
Sbjct: 113 MGNMPAN-PVKVACETMLRASGLLENL--RDTAGIVYNSTGALGCFDLYSLYVQCADPTG 169
Query: 350 ---DESDEGWEWQSCSEMVVPM-GKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT 405
+ W++Q+C+E+ + ++ M+ P + T+ C +++ V PRP W+ T
Sbjct: 170 CGLGSNSLAWDYQACTEINLCYDSNNETDMFPPMTFGETERNIYCSKRWAVLPRPRWLQT 229
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEA 465
+ G + + SNIIFSNG DP++ GG K+ ++ ++ G+H LDL +
Sbjct: 230 QFWGDALS-----AASNIIFSNGDLDPWANGGVRKSLSSSLIAV--NIPGGAHHLDLRGS 282
Query: 466 KKSDPDWLVQQRKTEVKIMQGWI 488
+DP+ +++ RKTE ++ W+
Sbjct: 283 NDADPESVIKARKTEADLIAQWV 305
>gi|344268151|ref|XP_003405926.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 431
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 98/206 (47%), Gaps = 10/206 (4%)
Query: 75 QRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKS 133
QRY IN F+ G P+ F+ E + T+ A + AL ++LEHR+YG S
Sbjct: 84 QRYFINDIFYKPG-GPVFLFIKGENAASRQMVTGNNTWTTYAERLGALCLLLEHRFYGHS 142
Query: 134 VPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATW 193
P G L A R Y NS QALAD + I T + G SYGG LA W
Sbjct: 143 QPTGD----LSTASLR-YLNSRQALADIVNFRTQIAKTMGLTKNKWVIFGGSYGGSLAVW 197
Query: 194 FRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI 253
R+K P + ++ SSAP+L + Y + + + EC + ++K+++ I +
Sbjct: 198 SRIKRPDLFFAAVGSSAPML---AKVNFYEYLEIFYRTLTTYNIECARAVEKAFTLIVAL 254
Query: 254 ASKPDGLAILSKKFKTCTPLKNTTEL 279
P+ ++L+ F C PLK +E+
Sbjct: 255 VKIPEYYSMLADDFMLCEPLKINSEM 280
>gi|312090033|ref|XP_003146464.1| hypothetical protein LOAG_10893 [Loa loa]
Length = 390
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 93/358 (25%), Positives = 147/358 (41%), Gaps = 57/358 (15%)
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-LYYGD-ITPWTTYHSVVSKD 231
T PVI G SYGG LA W R+KYPH+V G+ ASSAP+ ++YG I P + ++ +
Sbjct: 2 GTEIPVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRIFYGTGINPESVSRTITTNY 61
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKF--KTCTPLKNTTELKDGLDTVYS- 288
+ + + I+ ++ +G L++ F K +K+ + +YS
Sbjct: 62 LTSGCDR--KVFSDGFVAIEKMSKTEEGRMKLNRIFHAKPGFEMKSYNDFMSLYSYIYSA 119
Query: 289 ----EAAQYDTPSNI-------PVKRICN----AIENAPNCGDDILCKIAAGVVEADSLE 333
Y P++ PVK +C A N N + I I L
Sbjct: 120 IFYMAMTDYPYPADFFEPLPGYPVKYVCQYAKKAATNEENLAEQIYSIINVYYNYTGQLT 179
Query: 334 YDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPM---GKDKNSMYQP-----EPWNLTK 385
+ S C D+ D W WQSC+ + + + G D + +P ++
Sbjct: 180 DNCFTSNCTTPSPIQNDDEDIAWNWQSCTSLTIQICDRGGDNDFFLNTCDNSGDP--VST 237
Query: 386 YIKNCKEQYGVSPRPSWVLTYYGGHDIKL---ILRRSTSNIIFSNGMRDPFSRGGWVK-- 440
IK C E + + +Y HD+ + ++ +TSNIIFSNG DP+S GG +
Sbjct: 238 NIKLCTELFK---DIGYNNNFYKLHDVTIRYGMIYNTTSNIIFSNGNLDPWSAGGVYENS 294
Query: 441 ----------TYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
Y F+ L +H LD DP + +R V I++ W+
Sbjct: 295 PGIMEAMKNGVYIFYMLG-------AAHHLDFRTPNTCDPPSVTHERFQVVNIIKCWV 345
>gi|323453612|gb|EGB09483.1| hypothetical protein AURANDRAFT_24425 [Aureococcus anophagefferens]
Length = 266
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/229 (28%), Positives = 112/229 (48%), Gaps = 22/229 (9%)
Query: 94 FLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFN 153
+ G E+P+++ + +G +E + AL+V EHRY G SVP A L++ Y +
Sbjct: 5 YTGNESPVEEYVNNTGLMWETGAELGALLVWAEHRYEGGSVP---ACAGLRDCLA--YAS 59
Query: 154 SAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
QALADYA ++ ++ + P +A+G SYGG L++WFR KYP V+G++A SAP+
Sbjct: 60 VEQALADYAVVIDALRAEVG--DVPFVAVGGSYGGMLSSWFRFKYPTAVVGAIAGSAPVW 117
Query: 214 YYG-DITPWTTYHSVVSKDFRDTS----EECFQTIKKSWSEIDNIA----SKPDG---LA 261
+ D P +S+ + C ++ +W +++ A DG L
Sbjct: 118 GFPLDAPPLDGSAVAISRGAGFAGGLPDDRCASNLRAAWPVLEDFARTASPAADGTTALE 177
Query: 262 ILSKKFKTCTPLK---NTTELKDGLDTVYSEAAQYDTPSNIPVKRICNA 307
++ F C PL + EL + V+ + A+ + P + V + A
Sbjct: 178 AVAAAFDLCEPLASPDDVAELVQSVQGVFFDLAEANYPFLVGVPTLSRA 226
>gi|392898873|ref|NP_500595.2| Protein F19C7.2 [Caenorhabditis elegans]
gi|373219710|emb|CCD69714.1| Protein F19C7.2 [Caenorhabditis elegans]
Length = 540
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 128/489 (26%), Positives = 194/489 (39%), Gaps = 74/489 (15%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI-----DDNIQLSGFTY 112
Q +DHFS G + + QRY NSKF+ + LG E I D ++ G T
Sbjct: 55 QKVDHFSNGTNN-GVWQQRYQYNSKFYNKTTGYVFLMLGGEGSINVTNGDKWVRHEGETM 113
Query: 113 EN-AHQFKALIVILEHRYYGKS--VPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+F+A +EHR+YG P G + A QALAD + +
Sbjct: 114 MKWVAEFQAAAFQVEHRFYGSKEYSPIGDQTTA-----SMKLLTIDQALADIKEFITQMN 168
Query: 170 DKYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
Y P+ + G SY G L+ +FR YP + G+++SS+ + + D + Y
Sbjct: 169 ALYFKDDKPIWVTFGGSYPGSLSAFFRETYPEMTAGAVSSSSAVHVFVD---YYEYAINT 225
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIA-SKPDGLAILSKKFKTCTPLKNTTELK------- 280
K +R S+ C IK ++ + A S PD A+L ++F C T K
Sbjct: 226 EKTYRTVSDSCGDVIKVAFQNLITKAYSGPDSRALLKQRFNLCDNFDETKLSKSVQFFFQ 285
Query: 281 ------DGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
G++ + T S + V CN + N GD+I IA V+ Y
Sbjct: 286 NVYGYFQGINQYTGDNRNNATRSGLGVPAACNLL-NDKTIGDEIQRVIA--VMNLYDSWY 342
Query: 335 DGNNSRCYIN---------EDRTGDESD----EGWEWQSCSEMVVPMGKD-KNSMYQPEP 380
++S C N D T + D W WQ+C+E+ D N
Sbjct: 343 KPSDSGCRPNNYTAFIQAYSDTTMPDDDTISTRSWIWQTCTELGYYQTTDGGNGGIFGST 402
Query: 381 WNLTKYIKNCKEQYGVSPRPSWVL-----------TYYGGHDIKLILRRSTSNIIFSNGM 429
L + C + +G P + L T YGG D R T N+ F NG
Sbjct: 403 VPLDFFADQCIDLFG----PEYTLDNTFKLVDQVRTKYGGAD----AYRGT-NVCFPNGS 453
Query: 430 RDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
DP+ G ++ S ++ +HC D+ A+ SD L R V+I G ++
Sbjct: 454 FDPWQGLGHTANITNNNVD-SWLIDGTAHCADMYPARDSDKQSLKDAR---VRI-HGHLS 508
Query: 490 QYYDDFKAI 498
++ D +AI
Sbjct: 509 RWLSDAQAI 517
>gi|32564815|ref|NP_498759.2| Protein K12H4.7, isoform b [Caenorhabditis elegans]
gi|351063157|emb|CCD71199.1| Protein K12H4.7, isoform b [Caenorhabditis elegans]
Length = 473
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 126/495 (25%), Positives = 204/495 (41%), Gaps = 85/495 (17%)
Query: 25 NGLKLRPRLGRIRRSRILEQKDSNHGFETFF---YNQTIDHF--SYGPESYQTFPQRYVI 79
N ++ RPR G ++++ + F+ + QT+DHF S G +TF QRY
Sbjct: 30 NMIRGRPR-GGMKKTPPMSSVSHMINFDNVVSSTFTQTLDHFDSSVG----KTFQQRYYH 84
Query: 80 NSKFWGGGNSPILAFLGAEAPIDDN-IQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFG 137
N++++ G P LG E P + G N A + A + +EHR+YG++ P
Sbjct: 85 NNQWYKAGG-PAFLMLGGEGPESSYWVSYPGLEITNLAAKQGAWVFDIEHRFYGETHP-- 141
Query: 138 SRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRL 196
+ ++ N + Y +SAQA+ D A+ + + K+ +A + G SY G LA W R
Sbjct: 142 TSDMSVPNLK---YLSSAQAIEDAAAFIKAMTAKFPQLANAKWVTFGGSYSGALAAWTRA 198
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
K+P +V ++ SS P+ D + Y VV S EC ++ + ++ + ++
Sbjct: 199 KHPELVYAAVGSSGPVQAEVD---FKEYLEVVQNSITRNSTECAASVTQGFNLVASLLQT 255
Query: 257 PDGLAILSKKFKTCTPLK-NTTELKDGLDTVYS---EAAQYDTPSNIPVKRICNAIENAP 312
DG L F C ++ + LK +TVYS E QY
Sbjct: 256 SDGRKQLKTAFHLCQDIQMDDKSLKYFWETVYSPYMEVVQYS------------------ 297
Query: 313 NCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRT--------GDESDEGWEWQSCSEM 364
GD AAG A L R +IN T D ++ + C+++
Sbjct: 298 --GD------AAGSF-ATQLTISHAICRYHINTKSTPLQKLKQVNDYFNQVSGYFGCNDI 348
Query: 365 VVPMGKDKN---SMYQPEPWNLTK-YIKNCKEQYGVSPRPSWVLT-------YYGGHDIK 413
D N S + E + + YI C YG + V T YYGG D
Sbjct: 349 ------DYNGFISFMKDETFGEAQYYIDECTAIYGAAYNSQEVQTSVDYTNQYYGGRD-- 400
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWL 473
+T I+ NG DP+ G + + + + +N +HC D+ A D +L
Sbjct: 401 ---NLNTDRILLPNGDIDPWHALGKLTSSNSNIVPVV--INGTAHCADMYGASSLDSMYL 455
Query: 474 VQQRKTEVKIMQGWI 488
R+ ++ GW+
Sbjct: 456 TNARQRISDVLDGWL 470
>gi|195391902|ref|XP_002054598.1| GJ22718 [Drosophila virilis]
gi|194152684|gb|EDW68118.1| GJ22718 [Drosophila virilis]
Length = 476
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 105/457 (22%), Positives = 183/457 (40%), Gaps = 50/457 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF +T+ RY++N F+ G P+ FLG E I +G
Sbjct: 49 QTLWIEQKLDHFD--ESETRTWQMRYMLNDVFFKAGG-PLFIFLGGEWEISTGRITAGHM 105
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + L+ EHRYYG+S P L N + Y + QALAD A + K
Sbjct: 106 YDMAKEHNGLLAYTEHRYYGESHPLPD----LSNENIQ-YLHVKQALADLAHFITTQKAT 160
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
Y + VI +G SY + TWF+ YP +V+G ASSAP+ + + Y V +
Sbjct: 161 YEGLVDSKVIIVGGSYSATMVTWFKKTYPDLVVGGWASSAPLFAKMN---FVEYKEVTGQ 217
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAI-----LSKKFKTCTPLKNTTELKDGLD 284
C+ I+ +E++ + + G + L ++F + L T + +
Sbjct: 218 SIALMGGSACYNRIENGIAEMEAMIASKRGAEVKALLKLCERFDVYSDLDIWTLFSE-IS 276
Query: 285 TVYSEAAQYDTPSNIP--VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY 342
+++ Q NI ++I + IL + L YD
Sbjct: 277 DIFAGLVQTHDAGNIEGVCQKIMAESSDLVGVSSYILSEFEKSGGNCHDLSYDAMIG--V 334
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN----LTKYIKNCKEQYGVSP 398
++E R W +Q+C+E G + S +P+ +T Y C + YG
Sbjct: 335 LSESRYTGNIMRQWIYQTCNE----YGWYQTSGSSAQPFGTKFPVTFYTTMCADLYGAQF 390
Query: 399 RPSWVLTY-------YGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
S++ + +GG L + N+ ++G DP+ G D + +
Sbjct: 391 TNSYIESRVAETNENFGG------LSPNVQNVYLTHGHLDPWRAMG------IQDETQAT 438
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ +HC D D + ++ ++++ W+
Sbjct: 439 IIPEHAHCKDFGSISVDDTAEMRASKERIAELVREWL 475
>gi|344268156|ref|XP_003405928.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 521
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 99/399 (24%), Positives = 163/399 (40%), Gaps = 55/399 (13%)
Query: 76 RYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSV 134
+Y +N F+ G P+ + P FT+ A + AL ++LEHR+YG S
Sbjct: 42 KYYVNYDFYKPG-GPVFLMIEGHEPASIQWLKRSFTWITYAQRLGALCILLEHRFYGDSQ 100
Query: 135 PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWF 194
P + + R Y +S QA+AD A I N T + G YGG LA W
Sbjct: 101 PIRN----MSTEHLRRYLSSRQAVADIAEFRTVIAQSMNFTENKWVVFGGGYGGALAVWS 156
Query: 195 RLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIA 254
R+K+P++ +++SSA I + Y V+ + + EC + +K+++ + +
Sbjct: 157 RIKHPNLFAAAVSSSAMI---QAKVNFNEYFEVIYRTVDTHNSECLEAVKQAYGFVMAML 213
Query: 255 SKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVYSEAAQYDTPSN--------IPV 301
PD + L +K C P K +E+ + L +++ Q + N + +
Sbjct: 214 LLPDYHSRLIFDYKLCEPFKIKSEMDQLFVIEKLMLIFAAIVQNNKKKNTTVGIMRKMSI 273
Query: 302 KRICNAIENA---------PNCGDDILCKIAAGVVEADSLEY--DGNNSRCYINEDRTGD 350
C + N + +L A +Y + ++ N+ R G
Sbjct: 274 DEFCETMTNTSLGSPYHRYARIMNTMLRNKGYPCYPASYKQYVEEYSDFSFETNKYRRGR 333
Query: 351 ESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKY-IKNCKEQYGVSPRPSWVL---- 404
+ W +Q C+E D KNS + P T+Y +K C + +G V
Sbjct: 334 Q----WLYQCCNEFGWFYTTDLKNSSFTGLP---TRYFVKKCSDVFGPKFNNDSVFQGVM 386
Query: 405 ---TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
YYGG ++ + S IIFSNG DP+ R G K
Sbjct: 387 STNMYYGGLNV------TGSKIIFSNGSNDPWHRLGITK 419
>gi|302797683|ref|XP_002980602.1| hypothetical protein SELMODRAFT_113084 [Selaginella moellendorffii]
gi|300151608|gb|EFJ18253.1| hypothetical protein SELMODRAFT_113084 [Selaginella moellendorffii]
Length = 401
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 114/416 (27%), Positives = 167/416 (40%), Gaps = 46/416 (11%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+ F QRY ++ PI L E+ Q + A A +V +EHRYY
Sbjct: 3 RVFSQRYFEFLDYFQPQQGPIFLALCGESTCRGGYQRTAQAL--AKSLGAAVVTIEHRYY 60
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA-----PVIAIGAS 185
G+S PF + + KN + Y + QAL DYA + + ++ N + P I +G S
Sbjct: 61 GQSYPF--QNFSYKNLK---YLTTQQALYDYALFIEYYQNLINLRYNKQGKNPWIVVGGS 115
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WFRLK+PH+V+ S ASS G + Y + + EC KK
Sbjct: 116 YAGALSAWFRLKFPHLVVASWASS------GVVEAVLEYSAYDEQIGISVGPEC----KK 165
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRIC 305
+ EI +A +GL + K+ + + D L V AA + I + +C
Sbjct: 166 ALQEITKLAE--EGLVTNATAVKSVFFAQKLRD-DDFLSLVADIAAGFVQYGAIDM--LC 220
Query: 306 NAIENAPNCGDDILCKIA--AGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSE 363
+ A D+L A GV + S YD R + + W +Q C+E
Sbjct: 221 PPLLEAIQNKTDLLMAYARIGGVDSSSSDFYDAYKLRRQAEANDISAKDTMSWNYQICTE 280
Query: 364 MVVPMGKDKNSMYQPEPWNL--------TKYIKNCKEQYGVS--PRPSWVLTYYGGHDIK 413
+ N + NL YI C +G + P S YGG DI
Sbjct: 281 LAYFQVAPTNDSIRSSRINLHVTSCCYVLDYIDICAVLFGPNTFPDVSAANWNYGGRDI- 339
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
++S IIF NG +DP+ T+ + SF +HC DL SD
Sbjct: 340 -----ASSRIIFLNGSQDPWQHAS-KTTFSPGEPSFVLTCESCAHCEDLSMDCSSD 389
>gi|167533847|ref|XP_001748602.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772843|gb|EDQ86489.1| predicted protein [Monosiga brevicollis MX1]
Length = 459
Score = 99.0 bits (245), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 106/408 (25%), Positives = 181/408 (44%), Gaps = 50/408 (12%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSK-FWGGGNSPILAFLGAEAP-IDDNIQLSGFTY 112
F Q +DHF G ++ T+ Q Y +NS F G ++P+ +G E P +D ++ ++
Sbjct: 11 FVTQRLDHFD-GSDT-TTWQQAYYVNSTYFQAGSDAPVYLCVGGEGPPLDGSVVVASVHC 68
Query: 113 ENAHQF----KALIVILEHRYYGKSVPFGSRKAALKNARHRG---YFNSAQALADYASIL 165
A + A++ LEHRYYG ++N +G Y +S QAL D A+ +
Sbjct: 69 NVAVELLPKTGAIMFALEHRYYGC---HNMSACPVENPLAKGALRYLSSRQALGDLAAFI 125
Query: 166 LHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW---T 222
+I+ +YN + ++ G SY G LA W RLKYPH+V S+ASSAP+ D+ + T
Sbjct: 126 SYIRQQYNLPNNKIVTFGGSYPGMLAGWARLKYPHLVHASVASSAPVEAVLDMRGYYDVT 185
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
+ VS + S+ C I + I + G L+ F P + +
Sbjct: 186 AFAYSVSDNNVGGSDACRAAIATGHATIGQYFNSSSGRNTLANIFG--LPASYFEKYDNQ 243
Query: 283 LDTVYSEAAQYDTPSNIP--------VKRICNAIENAPNCGDDILCKIAAGVVEADSLEY 334
A + + SN P + IC + N + GD++ ++A VV LE+
Sbjct: 244 ASFAGGGVAYFPSQSNDPSCTQAGCNINLICQVMTNT-SLGDEVH-RLA--VVRKQQLEW 299
Query: 335 DGNNSRCYINED-RTGDESDEGWEWQSCSEMV----VPMGKDKNSMYQPEPWNLTKYIKN 389
+ + R G E+D W +Q+C+E +G D + L
Sbjct: 300 LPAAFESFATKTLRVGAEADY-WGYQTCTEFAFYQTCEVGSD--CFFTQGYLTLNATEAA 356
Query: 390 CKEQYGV---SPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDP 432
C+ ++G+ + + + + + +YGG + + S +++ NG DP
Sbjct: 357 CQAEFGIDFTTVQQNVIASNAWYGGRN------SAGSCLMYPNGEVDP 398
>gi|195109612|ref|XP_001999377.1| GI23098 [Drosophila mojavensis]
gi|193915971|gb|EDW14838.1| GI23098 [Drosophila mojavensis]
Length = 386
Score = 98.6 bits (244), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 98/426 (23%), Positives = 175/426 (41%), Gaps = 55/426 (12%)
Query: 76 RYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVP 135
RY++N F+ G P+ +LG E I + +G Y+ A + L+ EHRYYG+S P
Sbjct: 2 RYLLNDVFFKAGG-PMFIYLGGEWTISNGFITAGHMYDMAKEHNGLLAYTEHRYYGESHP 60
Query: 136 FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWF 194
L N R + + QALAD A + K Y + + VI +G SY + TWF
Sbjct: 61 L----PDLSNENLR-FLHVKQALADLAHFIKTQKASYEGLSDSKVIIVGGSYSATMVTWF 115
Query: 195 RLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNI 253
+ YP +V+G ASSAP++ + Y V+ + C+ I+K +E++ +
Sbjct: 116 KRTYPDLVVGGWASSAPVVAK---VNFFEYKEVMGESITLMGGSACYDRIEKGIAELETM 172
Query: 254 ASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPN 313
+ G A + K C P +L + T+++E ++ N ++ +
Sbjct: 173 FANKRG-AEVKALLKLCEPFDVNNDLD--IWTLFNEISEI----------FANVVQTHNS 219
Query: 314 CGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKN 373
+ +C+ + S + G +S + + W +Q+C+E G +
Sbjct: 220 GRIEGVCQ----QIMTGSTDLIGVSSYLLSEFGKASGSTLRQWIYQTCNE----YGWYQT 271
Query: 374 SMYQPEPWN----LTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSN 422
S +P+ L + C + YG + R + Y+GG L N
Sbjct: 272 SSSSMQPFGTKFPLALFTTMCADLYGPQFGESFIEDRAAETNEYFGG------LTPEVEN 325
Query: 423 IIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVK 482
+ FS+G DP+ G D + + +HC D D + ++ +
Sbjct: 326 VYFSHGQLDPWRAMG------IQDEKQATIIPEHAHCKDFGSISNDDTPEMRASKERVAE 379
Query: 483 IMQGWI 488
+++ W+
Sbjct: 380 LVREWL 385
>gi|444523275|gb|ELV13498.1| Thymus-specific serine protease [Tupaia chinensis]
Length = 393
Score = 98.6 bits (244), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 109/204 (53%), Gaps = 15/204 (7%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
++F QRY +N++ W G ++P+ LG E + + G A + AL++ LEHR+Y
Sbjct: 88 RSFLQRYWVNAQHWAGQDAPVFLHLGGEGGLGPGSVMRGHPAALAPAWGALVISLEHRFY 147
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT-HAPVIAIGASYGGE 189
G SVP G L A+ R + +S ALAD AS L + N + +P I G SY G
Sbjct: 148 GLSVPAG----GLGLAQLR-FLSSRHALADAASARLELSRLLNVSASSPWICFGGSYAGS 202
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
LA W RLK+PH+V S+ASSAP+ D ++ Y+ +V R CF ++ ++ +
Sbjct: 203 LAAWARLKFPHLVSASVASSAPVRAVLD---FSAYNEMV---MRSLCLRCF-SVSRAETV 255
Query: 250 IDNIASKPDGLAILSKK--FKTCT 271
A++P + ++ ++TCT
Sbjct: 256 AQLRATEPQAAVVGDRQWLYQTCT 279
>gi|302795835|ref|XP_002979680.1| hypothetical protein SELMODRAFT_419387 [Selaginella moellendorffii]
gi|300152440|gb|EFJ19082.1| hypothetical protein SELMODRAFT_419387 [Selaginella moellendorffii]
Length = 462
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 108/410 (26%), Positives = 179/410 (43%), Gaps = 47/410 (11%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYY 130
+TF QRY + ++ N P+ + E P IQ + ++ A +F A IV LEHRYY
Sbjct: 34 RTFAQRYYEFTDYFDAPNGPVFLKICGEGPCV-GIQ-NDYSAVLAKRFGAAIVSLEHRYY 91
Query: 131 GKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-----ATHAPVIAIGAS 185
G+S PF S + Y +S QAL D A+ + +D N + P I +G S
Sbjct: 92 GQSSPFKSHAT-----ENLIYLSSKQALFDLAAFREYYQDLINHRTNSTSDNPWIVMGGS 146
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ WF+LK+PH+ +GS+ASS + T + + +E T
Sbjct: 147 YSGALSAWFKLKFPHLAVGSVASSGVVQAIFKFTKFDEQAKLFL-----VAESAGATCSA 201
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA----QYDTPSNI 299
+ + +A + L + + L N +L D +Y ++AA QY P
Sbjct: 202 ALRAVTRLAEQG-----LKENSVSTKALFNAEQLDVDGDFLYFLADAAAIAFQYGNPD-- 254
Query: 300 PVKRICNAIENAPNCGDDILCKIAAGVVE--ADSLEYDGNN-SRCYINED-RTGDE-SDE 354
+C+ + A +D+L A V + D+ + N + ++ E+ GD SD
Sbjct: 255 ---ILCSPLVAAYKRNEDLLAVYAKYVKDYYIDTFKSSINTYDQKHLKENLAAGDHSSDR 311
Query: 355 GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--YYGGHDI 412
W +Q C+E+ N+ + N+ ++ C +G P T YYGG+ I
Sbjct: 312 LWWYQVCTEVAYFQAAPANNSIRSALVNVKYHLDLCSNVFGNGTFPEVDNTNLYYGGNKI 371
Query: 413 KLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
+ I+F NG +DP+ R +T + ++ +HC+D+
Sbjct: 372 R------GDKILFMNGSQDPW-RHASKQTSSRNEPAYVIKCQNCAHCVDM 414
>gi|157133206|ref|XP_001662800.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108870922|gb|EAT35147.1| AAEL012663-PA [Aedes aegypti]
Length = 485
Score = 98.2 bits (243), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 78/323 (24%), Positives = 156/323 (48%), Gaps = 26/323 (8%)
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILE 126
P++ +F RYV N++ + G PI F+G P++ ++ G + A + +V E
Sbjct: 64 PQNRNSFSMRYVTNNRHYRRG-GPIFLFVGGPWPLEAHLVEQGHFVDMAAEMNGFLVANE 122
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAPVIAIGAS 185
RYYG+S+P + N R Y ++ Q L++ A+ + H+K D +A VI G
Sbjct: 123 LRYYGESIPV--EDVSRNNFR---YLHNVQILSELATFIAHLKEDVVRDPNAKVILAGVG 177
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD-TSEECFQTIK 244
Y LA W R ++PH++ G +SS + T + + VV ++ R ++C+ TI
Sbjct: 178 YSASLAQWMRQRFPHLIHGVWSSSGMVRAS---TNYREFAEVVGENIRRFGGDDCYSTIW 234
Query: 245 KSWSEIDNIASKPDGLA-ILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTPSNIP- 300
+++ +N+ GL+ + + F TC P+ N E++ +++E ++ +++
Sbjct: 235 RAFRTAENLIDA--GLSTTVDELFHTCRPIDAANALEVEAFFYGIFNEISREVVDADLRG 292
Query: 301 -VKRICNAIENAPNCGDDILCKIAAGVV----EADSLEYDGNNSRCYINEDRTGDESDEG 355
+K++C + + + ++ L ++A+ + A+ L D + + + +
Sbjct: 293 NIKQMCEPLTASDD--ENSLLELASWLTGRFPNAECLAMDFQSIAQWSSNHEIVKSGERQ 350
Query: 356 WEWQSCSEMVVPMGKDKNSMYQP 378
W +Q C+E+ P+ S YQP
Sbjct: 351 WMFQRCTELGWPL--TAASQYQP 371
>gi|344247366|gb|EGW03470.1| Thymus-specific serine protease [Cricetulus griseus]
Length = 265
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 99/190 (52%), Gaps = 11/190 (5%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ +TF QRY +N + G ++P+ +G E + ++G
Sbjct: 60 WLEQPLDPFNT--SDRRTFLQRYWVNDRHRAGQDAPVFLHIGGEGSLGPGSVMAGHPVAL 117
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L A+ R Y +S ALAD AS + N
Sbjct: 118 APAWGALVISLEHRFYGLSMPSG----GLDMAQLR-YLSSRHALADVASARQALSRLLNV 172
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LATW RLK+PH+V ++ASSAP+ + + Y+ VV++
Sbjct: 173 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---SAVLDFYAYNEVVARSLS 229
Query: 234 DTSEECFQTI 243
+ Q +
Sbjct: 230 QVAIGGSQEV 239
>gi|195569669|ref|XP_002102831.1| GD19291 [Drosophila simulans]
gi|194198758|gb|EDX12334.1| GD19291 [Drosophila simulans]
Length = 487
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 160/382 (41%), Gaps = 39/382 (10%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILE--QKDSNHGFETFFYNQ 58
MK FS Q +IF SF A R + + +L K+ E + +Q
Sbjct: 1 MKWLNFSVQ-----LIFCLPSFLATFNPYRRNVELLNHEPVLGICSKNELASVEELWLDQ 55
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DHF + +T+ RY N+K + PI F+G E I + +G T++ A +
Sbjct: 56 KVDHFD--KNNNRTWKMRYYRNAKHFKP-QGPIYIFVGGEWTISPGLLSTGLTHDMAVEN 112
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHA 177
++ EHRYYG S+PFG + L N + + Q+LAD A + H K + +
Sbjct: 113 SGMLFYTEHRYYGLSLPFGHERYQLNNLKQ---LSLHQSLADLAHFIRHQKSNGPEMEDS 169
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
VI +G SY G L TW YP ++ S ASSAP+L D + Y VV K + +
Sbjct: 170 KVILVGGSYSGSLVTWMTQLYPDLIAASWASSAPLLAKAD---FFEYMEVVGKSIQLSYG 226
Query: 237 EECFQTIKKSWS---------EIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ C I+K + EI + +G S K PL + +GL +
Sbjct: 227 KNCSLRIEKGFKFLAKLFDGDEIQELLYNLNGCEGYSPK----NPL-DRAAFFNGLGNYF 281
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI-----AAGVVEADSLEYDGNNSRCY 342
+ Q + IP R+C + + + + + + G +D ++ ++
Sbjct: 282 ALVVQSYRSAYIP--RLCETLMSLDSSDELAFIEFLKLLYSEGRRSSDCQDFGYSSMLEL 339
Query: 343 INEDRTGDESDEGWEWQSCSEM 364
ED W +Q+C+E
Sbjct: 340 FTEDSVQSSETRAWFYQTCNEF 361
>gi|308478052|ref|XP_003101238.1| CRE-PCP-3 protein [Caenorhabditis remanei]
gi|308263943|gb|EFP07896.1| CRE-PCP-3 protein [Caenorhabditis remanei]
Length = 1095
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 125/487 (25%), Positives = 194/487 (39%), Gaps = 70/487 (14%)
Query: 45 KDSNH--GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID 102
KD+ + GFE + Q +HFS + TF Q++ N++ W P +G E P
Sbjct: 568 KDATYPPGFEQGTFRQKQNHFS--NQDPNTFQQKFFKNAQ-WAKPGGPNFLMIGGEGPEG 624
Query: 103 DNIQLS-GFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD 160
L+ TY A ++ A + +LEHR+YG SV + L +S Q L D
Sbjct: 625 AGWVLNQDITYLTWAKKYGATVYLLEHRFYGDSVVGDNTDFQL--------LSSLQMLYD 676
Query: 161 YASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
A + I + T P I G SY G L+ W R +P VV+G++ASS P+ D
Sbjct: 677 LAEFIREINYR-TGTSNPWITFGGSYSGALSAWMREVFPDVVVGAVASSGPVFAKTD--- 732
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK 280
+ Y VV R C I+ ++ + + +G LS F+ P +
Sbjct: 733 FYEYLMVVENSIRTYDPTCADRIQSGFNTMRTMFLTKEGRKSLSDLFQLDPPFGDNVTDT 792
Query: 281 DG---LDTVYSE---AAQYDT------PSNIPVKRICNAIENAPNCGDDILCKIAAGVVE 328
D VY A QY + + +C + N N L I A +
Sbjct: 793 DQHYFFSNVYGNFQGAVQYSGDNAGAYANGYGIPDMCKIMTNDDNTP---LNNIVA-FNQ 848
Query: 329 ADSLEYDGNNSRC------------YINEDRTGDESDEG--WEWQSCSEMVVPMGKDK-N 373
S+ Y+G + IN G ++ G W WQ+C+E D N
Sbjct: 849 FMSIFYNGGGNYTGMDNSYQDLVNYLINAQSFGPDAAAGLLWTWQTCTEFGYFQSADTGN 908
Query: 374 SMY-QPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIF 425
++ P P NL +++ C + + + R + YG SN++F
Sbjct: 909 GIFGSPTPVNL--FVQMCMDVFSSYYQRSTIDSRIDYTNYMYGER-----YHFRGSNVVF 961
Query: 426 SNGMRDPFSRGGWVKTYHFFDLS-FSQDLNLGSHCLDLDEAKKSD-PDWLVQQRKTEVKI 483
NG DP+ G T D S S +N +HC D+ A+ +D PD + + + + I
Sbjct: 962 PNGNVDPWHALGLYNTST--DPSVVSYLMNGTAHCADMYPARDADVPDLAIVRNRIDQNI 1019
Query: 484 MQGWITQ 490
W+ Q
Sbjct: 1020 AI-WLGQ 1025
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 83/346 (23%), Positives = 145/346 (41%), Gaps = 33/346 (9%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A QF A + LEHRYYG S P K RH NS QA+ D + + + ++N
Sbjct: 107 AKQFGATVFSLEHRYYGGSKP-NFDKFNGTTLRH---LNSYQAIMDLNAFIKYANVQFNM 162
Query: 175 -THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
I GA YGG +A R YP V G +ASSAP+ + D W V + +
Sbjct: 163 DPDCRWILWGAGYGGIIAAEARKWYPDTVAGVIASSAPLTHQYDF--WQFNDHVQTAIMQ 220
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY------ 287
+ C+Q + + +++I P+G + +S F+ L T + + Y
Sbjct: 221 EGGSLCYQKVAQGFADIRQAMRTPEGRSNVSDLFQLNPRLDQTNLNYNDIQQFYLAIIAP 280
Query: 288 -SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCK--IAAGVVEADSLEYDGNNSRCYIN 344
E Q++ NI + +C +I+ P ++ + + S++ + + IN
Sbjct: 281 FQEVIQFNNDFNISIIDLCTSIDKGPWTPMQVIWQAWVYFSTTVTGSVQPLVTSYQAVIN 340
Query: 345 EDRTGDESDEG-------WEWQSCSEMV--VPMGKDKNSMYQP-EPWNLTKYIKNCKEQY 394
+ GD+S W++Q C+E ++ M+ P ++ ++ C + +
Sbjct: 341 D--LGDQSPNAQFIDQRMWQYQMCTEFAWFYTTNNNEQGMFGAVVPSSI--FLNQCFDLF 396
Query: 395 ---GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
++P L + S +N++F+NG DP+S G
Sbjct: 397 PDSNLTPTSIRELVINYNNFYGSAYDYSGTNVVFTNGWYDPWSTLG 442
>gi|341874575|gb|EGT30510.1| hypothetical protein CAEBREN_03046 [Caenorhabditis brenneri]
Length = 517
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 116/480 (24%), Positives = 199/480 (41%), Gaps = 73/480 (15%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN--- 114
Q +D+F + T+ QRY NS + N+ I + E+P D + ++ TY+
Sbjct: 63 QKVDNFD--ASNAATYKQRYWYNSNY-TQNNNIIFLMIQGESPATD-LWITKPTYQYLQW 118
Query: 115 AHQFKALIVILEHRYYGKSVPF-GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A +F A + LEHR +G S P+ + ++K QALAD S + + ++N
Sbjct: 119 AKEFGADVFQLEHRCFGNSRPYPDTSYPSIKVCTM------TQALADIHSFIQQMNLQHN 172
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+ I G SY G L+ FR KYP +G++ASSAP+ + D + Y VV R
Sbjct: 173 FRNPKWITFGGSYPGTLSALFRQKYPQDTVGAVASSAPLDWTLD---FFEYAMVVEDVLR 229
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD-------TV 286
TS +C++ + ++++ + ++ DG+ L+ F P + +D +
Sbjct: 230 QTSTDCWKNVNQAFTNMQQLSLTVDGIQKLNTYFNLDPPFVAGKYTQHDIDNFFANVYSF 289
Query: 287 YSEAAQYD-------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYD--GN 337
Y QY T + + +C + +A D++ ++ + + L D G
Sbjct: 290 YQGVIQYTYDGRNAATLGGLNAQNLCAKMNDA--NVPDVITRVNNTINWINQLNGDPVGP 347
Query: 338 NSRCY------INEDRTGDES--------DEGWEWQSCSEMVVPMGKD--KNSMYQPEPW 381
Y +N+ + D S + GW W C+E+ D +N Q P
Sbjct: 348 FQNSYSDMMKVLNDPKYDDNSAVPGDIAANRGWMWLCCNELGALQTTDQGRNIFQQTVP- 406
Query: 382 NLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFS 434
+ +I C E +G + L YGG D +N++ NG DP+
Sbjct: 407 -MGYFIDMCTEMFGPDVGIKYIRDNNKQTLYNYGGAD-----NYQATNVVLPNGAFDPWH 460
Query: 435 RGGWVKTYHFFDLSFSQDLNL--GSHCLDLDEAKKSDPDWLVQQRKT---EVKIMQGWIT 489
G TY+ + L + +HC D+ ++P L + R+ E+K G T
Sbjct: 461 VLG---TYNNNTANHMTPLLIQGAAHCSDMYPTYANEPTDLAKNRQIIHDEIKYFLGLST 517
>gi|157133204|ref|XP_001662799.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108870921|gb|EAT35146.1| AAEL012664-PA [Aedes aegypti]
Length = 493
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/449 (24%), Positives = 187/449 (41%), Gaps = 75/449 (16%)
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
S FP RYV NSKF+ G PI F+G ++ + G + A + A +V E R
Sbjct: 69 SVDMFPMRYVSNSKFYRPG-GPIFLFVGGPWELEQHFVEQGHFVDLAEENNAFVVANEMR 127
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAPVIAIGASYG 187
YYG+S+P + A+ N R + QA D A +++HI+ + +A VI G +
Sbjct: 128 YYGESLPVPN--ASRGNLR---LLHIVQACTDIARLIVHIRYEVLRDPNARVIVAGVGFS 182
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE--------- 238
G LA W RL+YPH++ G AS A + ++++R+ +EE
Sbjct: 183 GSLAHWTRLRYPHLIHGVWASGAMLQ--------------ANENYREFAEEVGEYIRRYG 228
Query: 239 ---CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL--KDGLDTVYSEAAQY 293
C+ + + + +N+ + K FK CTP+ T L + +++E
Sbjct: 229 GNDCYGALWRGFRTAENLIDAGQSQTV-DKLFKVCTPINGTNPLDVEAFFYGIFNEVVSN 287
Query: 294 DTPSNIP--VKRICNAIENAPNCGDDILCKIAAGVV----EADSLEYDGNN-SRCYINED 346
N+ ++ +C+ + + + D L +A+ + EA+ L D + + + D
Sbjct: 288 TLRPNLRQNIRNMCDTLTHEDH--DSSLTGLASWITGQFPEAECLAMDLESIVQLFQETD 345
Query: 347 RTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
D G W +Q C+E+ P+ D SM QP + N +Q W+
Sbjct: 346 WQHDVHKSGERQWFYQRCTELGWPLTAD--SMNQPFG---VRISSNLFQQLCQRVFDGWL 400
Query: 404 L------------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
T YGG+ R + +++G DP+ G T ++ ++
Sbjct: 401 TSDVFRSLVRQTNTLYGGN------RPEMRFVFYTHGSLDPWRFTG--VTTVLYNNNYVN 452
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
+ H DL A SD DW +R E
Sbjct: 453 VIRGAIHGEDL--ASISDLDWADLRRSKE 479
>gi|195062805|ref|XP_001996257.1| GH22293 [Drosophila grimshawi]
gi|193899752|gb|EDV98618.1| GH22293 [Drosophila grimshawi]
Length = 481
Score = 97.4 bits (241), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 108/489 (22%), Positives = 200/489 (40%), Gaps = 73/489 (14%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L + R ++ ET + +Q +D+F E + + R +IN + G SPI
Sbjct: 34 LKELHRGPPMQLISKRANVETRWISQKLDNFDEDNE--EVWDDRVLINEDHFVDG-SPIF 90
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+LG E I+ + +G + A + +V EHR++G+SVP A + Y
Sbjct: 91 IYLGGEWEIEPSPITAGHWVDIASEHNGSLVYTEHRFFGQSVPIKPLTTA-----NLKYQ 145
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
N QALAD +++ +K++ ++ V+ G SY +A W + YP V++GS ASSAP+
Sbjct: 146 NVEQALADVVNVINVLKEEEKYKNSKVVVQGCSYSATMAVWIKKLYPDVIVGSWASSAPL 205
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
D + Y VV + +R+ + C+ I + S +++ +G +KK
Sbjct: 206 QAKVD---FKAYMKVVGQAYRELGGDYCYNIIDNATSFYEDLFE--NGQNAEAKKLLNLC 260
Query: 272 PLKNTTELKD------GLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAG 325
N + D + + + AQY P+N + + C+ + + L K
Sbjct: 261 DNFNENDQHDQWQIFSTIANILAGLAQYQNPANYDLAKHCSVLRSFSTDDATALSKF--- 317
Query: 326 VVEADSLEYDGNNSRC-----------YINEDRTGDESDEGWEWQSCSEMVVPMGKDKNS 374
+++ +N C Y D S W +Q+C+E G ++S
Sbjct: 318 ------IQWRLDNPECVNTVYKATVKYYKWAMHNYDGSSLSWFFQTCNE----FGWYQSS 367
Query: 375 MYQPEPWN----LTKYIKNCKEQYGVSPRPSWVLTY-------YGGHDIKLILRRSTSNI 423
+ +P+ T Y CK+ +G + + Y YGG + + N+
Sbjct: 368 GSKNQPFGSSFPATLYTDTCKDVFGSKYTAAKIEKYISEKNKVYGG------VNPNVENV 421
Query: 424 IFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTE 480
++G DP +H +Q + SHC D+ D ++ ++
Sbjct: 422 YMTHGGLDP---------WHPVGAGAAQGATIIPQASHCSDMGSISAKDSPAMLASKQRV 472
Query: 481 VKIMQGWIT 489
++++ W+
Sbjct: 473 AQLVREWLA 481
>gi|428184604|gb|EKX53459.1| hypothetical protein GUITHDRAFT_133167 [Guillardia theta CCMP2712]
Length = 467
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/192 (28%), Positives = 94/192 (48%), Gaps = 16/192 (8%)
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPF-GSRKAALKNARHR 149
I + G E ++ + +G +E+A +F A++V LEHRYYGKSV F R+ ++
Sbjct: 36 IFFYFGNEDSVELYVNNTGLMWESASEFDAVMVFLEHRYYGKSVLFEPGREGCME----- 90
Query: 150 GYFNSAQALADYASILLHIKDK---------YNATHAPVIAIGASYGGELATWFRLKYPH 200
+ + QAL D + L +K P+I G SYGG +A+WFR+++PH
Sbjct: 91 -FLTTDQALLDASQFLSTLKANPKEILPKKISKKPVGPIIGFGGSYGGMIASWFRMRFPH 149
Query: 201 VVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGL 260
++ G +A SAPIL + + P + +D S C + ++ + +++ +G
Sbjct: 150 LIDGVIAGSAPILSFEGLRPAYDNGGYMRVVTQDASPLCARNFLRAQKAVYSVSQSEEGR 209
Query: 261 AILSKKFKTCTP 272
L + C P
Sbjct: 210 EFLKEALLLCVP 221
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 46/171 (26%), Positives = 67/171 (39%), Gaps = 34/171 (19%)
Query: 281 DGLDTVYSEAAQYDTPSNIPVKRICNAIENAP----NCGDDILCKIAAGVVEADSLEYDG 336
DG+ YS+ +I + + NA A NC D+ C + G
Sbjct: 309 DGI--TYSDVCCNRQAKSIDLPELLNATRYALGVFYNCSQDVQCFFNGNSKQMQIPRKMG 366
Query: 337 NNSRCYINEDRTGDES-------DEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN 389
+ R + E R E D G++W + P WN I
Sbjct: 367 SIKRFHSKEGRKKAEGADLSCVGDWGYQW----------------FWPPHLWNAESAIAG 410
Query: 390 CKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
CKE +GV PR W + +G D+ + SNI+FSNG+ DP+ GG +K
Sbjct: 411 CKEAWGVKPRSHWAVVRFGDRDLS-----AASNILFSNGLLDPWYVGGVLK 456
>gi|195353641|ref|XP_002043312.1| GM26842 [Drosophila sechellia]
gi|194127426|gb|EDW49469.1| GM26842 [Drosophila sechellia]
Length = 487
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 99/382 (25%), Positives = 159/382 (41%), Gaps = 39/382 (10%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILE--QKDSNHGFETFFYNQ 58
MK FS Q +IF SF A R + + +L K+ E + +Q
Sbjct: 1 MKWLNFSVQ-----LIFCLPSFLATFNPYRRNVELLNHEPVLGICSKNELASVEELWLDQ 55
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DHF + +T+ RY N+K + PI F+G E I + +G T++ A +
Sbjct: 56 KVDHFD--KNNNRTWKMRYYRNAKHFKP-QGPIYIFVGGEWTISPGLLSTGLTHDMAVEN 112
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHA 177
++ EHRYYG S+PFG L N + + Q+LAD A + H K + +
Sbjct: 113 SGMLFYTEHRYYGLSLPFGHESYQLNNLKQ---LSLHQSLADLAHFIRHQKSNGPEMEDS 169
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-S 236
VI +G SY G L TW YP ++ S ASSAP+L D + Y VV K + +
Sbjct: 170 KVILVGGSYSGSLVTWMTQLYPDLIAASWASSAPLLAKAD---FFEYMEVVGKSIQLSYG 226
Query: 237 EECFQTIKKSWS---------EIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ C I+K + EI + +G S K PL + +GL +
Sbjct: 227 KNCSLRIEKGFKFLAKLFDGDEIQELLYNLNGCEGYSPK----NPL-DRAAFFNGLGNYF 281
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI-----AAGVVEADSLEYDGNNSRCY 342
+ Q + IP R+C + + + + + + G +D ++ ++
Sbjct: 282 ALVVQSYRSAYIP--RLCETLMSLDSSDELAFIEFLKLLYSEGRRSSDCQDFGYSSMLEL 339
Query: 343 INEDRTGDESDEGWEWQSCSEM 364
ED W +Q+C+E
Sbjct: 340 FTEDSVQSSETRAWFYQTCNEF 361
>gi|170045806|ref|XP_001850485.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868713|gb|EDS32096.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 485
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 117/450 (26%), Positives = 197/450 (43%), Gaps = 66/450 (14%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ +DHF+ P++ TF +Y N +++ G PI F+G P++ G ++
Sbjct: 59 MFRTRVDHFN--PQNRDTFEFQYYSNDEYYQPG-GPIFIFVGGNWPVEQYYIEHGHFHDI 115
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYN 173
A+ A + EHRYYG S+P + N R + QAL D ++ HI+ +
Sbjct: 116 AYYENAWLFANEHRYYGSSLPV--EDLSTPNLR---FLTVEQALVDLGELIYHIRHNVVR 170
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV-VSKDF 232
+A VI +G Y G +ATW R +YPH+V GS SS + D H+V V
Sbjct: 171 DDNARVILLGVGYAGAIATWMRQRYPHLVDGSWVSSGQVDARFDFGQ----HAVEVGGLI 226
Query: 233 RD-TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY---S 288
RD ++EC+ I +++ + + +S+ F TC+P+ L ++T +
Sbjct: 227 RDHGNDECYSQIWRAFRTAEALLDA-GRTETVSELFNTCSPIDEENMLD--VETFFFSIK 283
Query: 289 EAAQYDTPSN---IPVKRICNAIENAPNCGDDILCKIAAGVVE----ADSLEYDGNNS-- 339
EA Q + S + R+C + N+ D L +A V + D + +D +S
Sbjct: 284 EAIQTEVLSEQNVVYTDRLCQTLNNSTESTD--LQTLANWVHDHFYFLDCMPFDFESSTE 341
Query: 340 -RCYINEDRTGDE--SDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN--CKEQY 394
++ D + +Q C+E + D S QP + +T Y+ + CK Y
Sbjct: 342 IHSIVDVDAIENRILGLRQRYYQFCTEFGWFLTAD--SADQPFGYRITMYLFSNFCKAVY 399
Query: 395 G-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDL 447
G V+ ++GG D ++ +N++F+NG DP D+
Sbjct: 400 GEWLTAEVVADGVHLTNMHFGGKDPRI------ANVLFTNGGLDPVR-----------DI 442
Query: 448 SFSQ-DLN--LGSHCLDLDEAKKSDPDWLV 474
S + DLN G + +L AKK D++V
Sbjct: 443 SITDADLNSISGYNSPELLAAKKQIHDYIV 472
>gi|308451153|ref|XP_003088564.1| hypothetical protein CRE_09925 [Caenorhabditis remanei]
gi|308246539|gb|EFO90491.1| hypothetical protein CRE_09925 [Caenorhabditis remanei]
Length = 947
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 125/487 (25%), Positives = 194/487 (39%), Gaps = 70/487 (14%)
Query: 45 KDSNH--GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID 102
KD+ + GFE + Q +HFS + TF Q++ N++ W P +G E P
Sbjct: 422 KDATYPPGFEQGTFRQKQNHFS--NQDPNTFQQKFFKNAQ-WAKPGGPNFLMIGGEGPEG 478
Query: 103 DNIQLS-GFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD 160
L+ TY A ++ A + +LEHR+YG SV + L +S Q L D
Sbjct: 479 AGWVLNQDITYLTWAKKYGATVYLLEHRFYGDSVVGDNTDFQL--------LSSLQMLYD 530
Query: 161 YASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
A + I + T P I G SY G L+ W R +P VV+G++ASS P+ D
Sbjct: 531 LAEFIREINYR-TGTSNPWITFGGSYSGALSAWMREVFPDVVVGAVASSGPVFAKTD--- 586
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK 280
+ Y VV R C I+ ++ + + +G LS F+ P +
Sbjct: 587 FYEYLMVVENSIRTYDPTCADRIQSGFNTMRTMFLTKEGRKSLSDLFQLDPPFGDNVTDT 646
Query: 281 DG---LDTVYSE---AAQYDT------PSNIPVKRICNAIENAPNCGDDILCKIAAGVVE 328
D VY A QY + + +C + N N L I A +
Sbjct: 647 DQHYFFSNVYGNFQGAVQYSGDNAGAYANGYGIPDMCKIMTNDDNTP---LNNIVA-FNQ 702
Query: 329 ADSLEYDGNNSRC------------YINEDRTGDESDEG--WEWQSCSEMVVPMGKDK-N 373
S+ Y+G + IN G ++ G W WQ+C+E D N
Sbjct: 703 FMSIFYNGGGNYTGMDNSYQDLVNYLINAQSFGPDAAAGLLWTWQTCTEFGYFQSADTGN 762
Query: 374 SMY-QPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIF 425
++ P P NL +++ C + + + R + YG SN++F
Sbjct: 763 GIFGSPTPVNL--FVQMCMDVFSSYYQRNTIDSRIDYTNYMYGER-----YHFRGSNVVF 815
Query: 426 SNGMRDPFSRGGWVKTYHFFDLS-FSQDLNLGSHCLDLDEAKKSD-PDWLVQQRKTEVKI 483
NG DP+ G T D S S +N +HC D+ A+ +D PD + + + + I
Sbjct: 816 PNGNVDPWHALGLYNTST--DPSVVSYLMNGTAHCADMYPARDADVPDLAIVRNRIDQNI 873
Query: 484 MQGWITQ 490
W+ Q
Sbjct: 874 AI-WLGQ 879
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/284 (21%), Positives = 115/284 (40%), Gaps = 28/284 (9%)
Query: 183 GASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQT 242
GA YGG +A R YP V G +ASSAP+ + D W V ++ C Q
Sbjct: 26 GAGYGGVIAAEARKWYPDTVAGVIASSAPLTHQYDF--WQFNSHVAMAIAQEGGSLCSQM 83
Query: 243 IKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY-------SEAAQYDT 295
+ + +++I P+G + +S F+ L T + + Y E Q++
Sbjct: 84 VTQGFADIRQAMRTPEGRSNVSDLFQLNPRLDQTNLNYNDIQQFYLAIIAPFQEVIQFNN 143
Query: 296 PSNIPVKRICNAIENAPNCGDDILCK--IAAGVVEADSLEYDGNNSRCYINEDRTGDESD 353
NI + +C +I+ + ++ + + S++ + + IN+ GD+S
Sbjct: 144 DFNISIIDLCTSIDKSGWTPMQVIWQAWVYFSTTVTGSVQPLVTSYQAVIND--LGDQSP 201
Query: 354 EG-------WEWQSCSEMV--VPMGKDKNSMYQP-EPWNLTKYIKNCKEQY---GVSPRP 400
W++Q C+E D+ M+ P ++ ++ C + + ++P
Sbjct: 202 NAQFIDQRMWQYQMCTEFAWFYTTNSDEQGMFGAVVPASI--FLNQCFDLFPDSNLTPTS 259
Query: 401 SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHF 444
L + S +N++F+NG DP++ G T F
Sbjct: 260 IRELVINYNNFYGSAYDYSGTNVVFTNGWYDPWNTLGKENTADF 303
>gi|440289971|gb|ELP83425.1| serine carboxypeptidase (S28) family protein, partial [Entamoeba
invadens IP1]
Length = 220
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 68/215 (31%), Positives = 102/215 (47%), Gaps = 18/215 (8%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYEN-AH 116
+DHF+ + + F +Y ++ K+ G + SP+ LG E P + + + A
Sbjct: 1 MDHFNANND--EEFEVKYFVSEKYLDGTDLHSPLFVMLGGEGPESSKTLDNHYIIDTLAA 58
Query: 117 QFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH 176
+ L++ +EHR+YG S P +LK + Y + QA+ DY I+ +I++ N
Sbjct: 59 RTNGLMLAIEHRFYGDSTP------SLKMDKLI-YCTAEQAMMDYIEIITYIQETRNFID 111
Query: 177 APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
PVI IG SY G LA W R KYP+VV G+ ASSAP+ D + Y VV +
Sbjct: 112 HPVIVIGGSYSGNLAAWMRQKYPNVVDGAWASSAPVEAQVD---FYQYLEVVQAGLPANT 168
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
+ + W D + G L K F TCT
Sbjct: 169 ADLLSIAFEKW---DQMTVTESGRKELKKVFNTCT 200
>gi|123399993|ref|XP_001301579.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121882777|gb|EAX88649.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 496
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 99/392 (25%), Positives = 170/392 (43%), Gaps = 45/392 (11%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ Q +DHF S +TF QRY +K S + ++G EAP+ LS + A
Sbjct: 8 FTQKLDHFD--ASSQETFNQRYYKITKNSTANVSALFFYIGGEAPLIGKRMLSLAPVDLA 65
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
+ A++ LEHR++G S P LK Y Q LAD A + +K Y+ T
Sbjct: 66 EKNNAVLFGLEHRFFGNSAPTNLTIENLK------YLTIEQGLADLAHFINAMKQDYDHT 119
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT--TYHSVVSKDFR 233
+ IG SY G L++WFRL YPH+ S ASSAP+ + T + Y ++ S
Sbjct: 120 -VRIGVIGGSYPGALSSWFRLLYPHLADVSWASSAPVEAKNNFTEYDYHCYEAITSVG-- 176
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQY 293
++C + +K++ ++ + ++KKF + N T +D Y A
Sbjct: 177 --GDKCSENTRKAFQYLET-----EDYNEVAKKF-----IGNDTPPEDHATLYYMVADTI 224
Query: 294 DTP-----SNIPVKRICNAIENAPNCGD-----DILCKIAAGVVEADSLEYDGNNSRCYI 343
TP S+ + +C+ + P D+L K+ +++ +S+ +D + ++
Sbjct: 225 ATPVQYKRSSENLTYLCDLMNKLPEKATKTEYIDVLAKVTKEILQGESI-WDSDLTQYTD 283
Query: 344 NEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVS-PRPSW 402
+ W W +C++ V + + + + NL + + C++ + P
Sbjct: 284 VSIDAPTKDGRAWTWMTCNQ--VGWFQTASGKLRSDSINLEYFDRVCRKLFNRGIPNDKL 341
Query: 403 VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFS 434
+GG + R TS F NG DP+S
Sbjct: 342 TNQRFGGKNA-----RGTST-YFINGAVDPWS 367
>gi|195158018|ref|XP_002019891.1| GL12646 [Drosophila persimilis]
gi|194116482|gb|EDW38525.1| GL12646 [Drosophila persimilis]
Length = 473
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 103/455 (22%), Positives = 187/455 (41%), Gaps = 46/455 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF + +T+ RY++N + G P+ +LG E I G
Sbjct: 46 QTLWIEQKLDHFD--EDEKRTWQMRYMLNDALYQSGG-PLFIYLGGEWEISAGRITGGHI 102
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + AL+ EHRYYG+S P L N + Y N QAL D A + +K
Sbjct: 103 YDMAKEHNALLAYTEHRYYGESKPLPD----LSNENIQ-YLNVRQALEDLAVFIRTLKAT 157
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + + VI +G SY + TWF+ +P +V G ASSAP+ + Y + +
Sbjct: 158 HEGLSESKVIIVGGSYSATMVTWFKKVHPDLVAGGWASSAPLFAK---VNFVEYKEITGQ 214
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
C+ I+ +E++ + + G A + K C ++L + T++SE
Sbjct: 215 SIALMGGSACYNRIESGIAEMETMFATKRG-AEVKALLKLCERFDVYSDLD--VWTLFSE 271
Query: 290 -----AAQYDTPSNIPVKRICNAIENAPN----CGDDILCKIAAGVVEADSLEYDGNNSR 340
A T + ++ +C I + N +L + + + L YD S+
Sbjct: 272 ISDLFAGVVQTHNAGQIEAVCQKIMSGTNDLIGVASYLLDVFSESGGKCNELSYDAILSQ 331
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG----- 395
++ TG+ + W +Q+C+E D + + + Y C + YG
Sbjct: 332 L-LDTSYTGNIMRQ-WIFQTCNEYGWYQTSDSKAQPFGTKFPVALYTTMCGDIYGSQYSN 389
Query: 396 --VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
+ R + Y+GG N+ ++G DP+ G D + + +
Sbjct: 390 EFIDSRVAATNDYFGG------WTPGVENVYLTHGHLDPWRAMG------IQDEAQATVI 437
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+HC D + +D + ++ ++++ WI
Sbjct: 438 PEYAHCKDFNSISDTDTAEMRASKERIAELVREWI 472
>gi|125778538|ref|XP_001360027.1| GA17650 [Drosophila pseudoobscura pseudoobscura]
gi|54639777|gb|EAL29179.1| GA17650 [Drosophila pseudoobscura pseudoobscura]
Length = 473
Score = 96.3 bits (238), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 103/455 (22%), Positives = 187/455 (41%), Gaps = 46/455 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF + +T+ RY++N + G P+ +LG E I G
Sbjct: 46 QTLWIEQKLDHFD--EDEKRTWQMRYMLNDALYQSGG-PLFIYLGGEWEISAGRITGGHI 102
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + AL+ EHRYYG+S P L N + Y N QAL D A + +K
Sbjct: 103 YDMAKEHNALLAYTEHRYYGESKPLPD----LSNENIQ-YLNVRQALEDLAVFIRTLKAT 157
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + + VI +G SY + TWF+ +P +V G ASSAP+ + Y + +
Sbjct: 158 HEGLSESKVIIVGGSYSATMVTWFKKVHPDLVAGGWASSAPLFAK---VNFVEYKEITGQ 214
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
C+ I+ +E++ + + G A + K C ++L + T++SE
Sbjct: 215 SIALMGGSACYNRIESGIAEMETMFATKRG-AEVKALLKLCERFDVYSDLD--VWTLFSE 271
Query: 290 -----AAQYDTPSNIPVKRICNAIENAPN----CGDDILCKIAAGVVEADSLEYDGNNSR 340
A T + ++ +C I + N +L + + + L YD S+
Sbjct: 272 ISDLFAGVVQTHNAGQIEAVCQKIMSGSNDLIGVASYLLDVFSESGGKCNELSYDAILSQ 331
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG----- 395
++ TG+ + W +Q+C+E D + + + Y C + YG
Sbjct: 332 L-LDTSYTGNIMRQ-WIFQTCNEYGWYQTSDSKAQPFGTKFPVALYTTMCGDIYGSQYSN 389
Query: 396 --VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL 453
+ R + Y+GG N+ ++G DP+ G D + + +
Sbjct: 390 EFIDSRVAATNDYFGG------WTPGVENVYLTHGHLDPWRAMG------IQDEAQATVI 437
Query: 454 NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+HC D + +D + ++ ++++ WI
Sbjct: 438 PEYAHCKDFNSISDTDTAEMRASKERIAELVREWI 472
>gi|195569687|ref|XP_002102840.1| GD19284 [Drosophila simulans]
gi|194198767|gb|EDX12343.1| GD19284 [Drosophila simulans]
Length = 515
Score = 96.3 bits (238), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 70/240 (29%), Positives = 113/240 (47%), Gaps = 16/240 (6%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF +T+ RY++N + G +P+ +LG E I G
Sbjct: 46 QTLWIEQKLDHFD--AAETRTWQMRYMLNDALYKSG-APLFIYLGGEWEISSGRITGGHL 102
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + AL+ EHRYYG+S P L N + Y + Q+LAD A + IK
Sbjct: 103 YDMAKEHNALLAYTEHRYYGQSKPLPD----LSNENIK-YLSVNQSLADLAHFINTIKQN 157
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ + + VI +G SY + TWF+ YP +V G ASSAP+L + Y V +
Sbjct: 158 HEGLSESKVIIVGGSYSATMVTWFKKLYPDLVAGGWASSAPLLAK---VNFVEYKEVTGQ 214
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
C++ I+ +E++ + + G A + K C P ++L + T++SE
Sbjct: 215 SIEQMGGSACYKRIENGIAEMETMIATKRG-AEVKALLKLCEPFDVYSDLD--VWTLFSE 271
>gi|157167874|ref|XP_001656137.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871003|gb|EAT35228.1| AAEL012589-PA, partial [Aedes aegypti]
Length = 459
Score = 95.9 bits (237), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 110/449 (24%), Positives = 186/449 (41%), Gaps = 75/449 (16%)
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
S FP RYV NSKF+ G PI F+G ++ + G + A + A +V E R
Sbjct: 35 SVDMFPMRYVSNSKFYRPG-GPIFLFVGGPWELEQHFVEQGHFVDLAEENNAFVVANEMR 93
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAPVIAIGASYG 187
YYG+S+P + A+ N R + QA D A +++HI+ + +A VI G +
Sbjct: 94 YYGESLPVPN--ASRGNLR---LLHIVQACTDIARLIVHIRYEVLRDPNARVIVAGVGFS 148
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE--------- 238
G LA W RL+YPH++ G AS A + ++++R+ +EE
Sbjct: 149 GSLAHWTRLRYPHLIHGVWASGAMLQ--------------ANENYREFAEEVGEYIRRYG 194
Query: 239 ---CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL--KDGLDTVYSEAAQY 293
C+ + + + +N+ + FK CTP+ T L + +++E
Sbjct: 195 GNDCYGALWRGFRTAENLIDAGQSQTV-DTLFKVCTPINGTNPLDVEAFFYGIFNEVVTN 253
Query: 294 DTPSNIP--VKRICNAIENAPNCGDDILCKIAAGVV----EADSLEYDGNN-SRCYINED 346
N+ ++ +C+ + + + D L +A+ + EA+ L D + + + D
Sbjct: 254 TLRPNLRQNIRNMCDTLTHEDH--DSSLTGLASWITGQFPEAECLAMDLESIVQLFQETD 311
Query: 347 RTGDESDEG---WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWV 403
D G W +Q C+E+ P+ D SM QP + N +Q W+
Sbjct: 312 WQHDVHKSGERQWFYQRCTELGWPLTAD--SMNQPFG---VRISANLFQQLCQRVFDGWL 366
Query: 404 L------------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQ 451
T YGG+ R + +++G DP+ G T ++ ++
Sbjct: 367 TSDVFRSLVRQTNTLYGGN------RPEMRFVFYTHGSLDPWRFTG--VTTVLYNNNYVN 418
Query: 452 DLNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
+ H DL A SD DW +R E
Sbjct: 419 VIRGAIHGEDL--ASISDLDWADLRRSKE 445
>gi|156547443|ref|XP_001605073.1| PREDICTED: putative serine protease K12H4.7-like [Nasonia
vitripennis]
Length = 378
Score = 95.5 bits (236), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 97/383 (25%), Positives = 162/383 (42%), Gaps = 70/383 (18%)
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPV----IAIGASYGGELATWFRLKYPHVVIGSL 206
Y +S QALAD A I+ A P I G SY G LA W R KYPH+V G++
Sbjct: 22 YLSSEQALADVAYF---IQGMQAAQQLPDTSRWIMFGGSYSGSLAAWMRAKYPHLVHGAM 78
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKK 266
++S P+L D + Y +V + + S+ C TI + ++ + G L K
Sbjct: 79 SASGPLLAQID---FQQYFIIVEESLKTHSQACVDTIAAAIRQVHIMLRHRIGQQGLEKL 135
Query: 267 FKTCTPL-------KNTTELKDGLDTVYSEAAQYD----TPSNIPVKRICNAIENAPNCG 315
F C P+ K+ L + L +++ QY+ S+I + +C
Sbjct: 136 FNFCDPIDSGKTSQKDIANLYETLADNFADVVQYNKDNRNESSITIDMVC---------- 185
Query: 316 DDILCKIAAGV-----VEADSLEYDGNNSRC-------YINEDRT---GDESDEG---WE 357
D+L GV E ++ N +C I+E R ++ + G W
Sbjct: 186 -DVLVDEKKGVPVNRLAEVSNMLLAKNKEKCLDYKYDKMISELRNITWAEQKNAGGRQWT 244
Query: 358 WQSCSEMVVPMGKDKNSMYQP----EPWNLTKYIKNCKEQYG------VSPRPSWVLTYY 407
+Q+C+E G + S +P E + + +I+ C + +G + P T Y
Sbjct: 245 YQTCTE----FGFFQTSTARPNLFSETFPVEFFIQQCADIFGPRFLHTLQPGVIRTNTMY 300
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKK 467
G D+ I+ SN++F +G DP+ G K+ + + +N +HC ++ K
Sbjct: 301 GALDLPNIV----SNVVFVHGSIDPWHALGVTKSSNPNAPAIF--INGTAHCANVYPPSK 354
Query: 468 SDPDWLVQQRKTEVKIMQGWITQ 490
+D LV RK +++ W+ Q
Sbjct: 355 NDLPQLVDARKQVGQLIGQWLAQ 377
>gi|344268147|ref|XP_003405924.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 464
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 101/414 (24%), Positives = 170/414 (41%), Gaps = 51/414 (12%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP----IDDNIQLSGFT 111
+ Q +DHF S+ + QRY IN F+ G L G E I N+ +
Sbjct: 54 FTQKLDHFDQKDLSF--WRQRYFINDAFYKPGGPVFLMIGGMETAKRNWISRNLPFIAY- 110
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A + AL ++LEHR+YG S P G L A R Y + Q L D + + I
Sbjct: 111 ---AERLHALCLVLEHRFYGHSQPTGD----LSTASLR-YIRNHQVLGDIVNFRIKIAKL 162
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
T +A G YGG LA W R+KYP + ++ SSAP+ + + Y V
Sbjct: 163 MGLTKNKWVAFGEFYGGSLAVWSRIKYPDLFAAAVGSSAPVKVKIN---FDEYFEGVQTS 219
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC-TPLKNTTE----LKDGLDTV 286
++ +C + ++++ E+ + P +IL F C TP +++ + + L +
Sbjct: 220 LDASNIKCSRAVQRALLEVIRMLKSPKSYSILKSDFMLCETPQNKSSQHIPFVLENLMSF 279
Query: 287 YSEAAQYDTPSN-----IPVKRICNAIENAPNCGD-DILCKIAAGVVEADSLE-YDGN-N 338
QY+ + + C+ + P +I + +L D N N
Sbjct: 280 LIPIVQYNKKRKSVMNILSIDDFCDKMTETPLSSPYHRYARIVRNNIRNRNLSCLDANYN 339
Query: 339 SRCYINEDRTGDES--DEGWE--WQSCSEMVVPMGKDKNSMYQP-EPWNLTKYIKNCKEQ 393
R + + D+ D+ + +Q C+E D S YQP + ++ C +
Sbjct: 340 HRLRRLSETSLDKGNVDQARQRLYQCCTEFGFFQSTD--SRYQPFSELPIRYFLDKCSDL 397
Query: 394 YG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
+G + + +YYGG ++ + S IIFS+G DP++ G +
Sbjct: 398 FGSEYSFSSLRQSAEALNSYYGGFNV------NGSKIIFSSGSLDPWNALGITR 445
>gi|398408866|ref|XP_003855898.1| serine carboxypeptidase [Zymoseptoria tritici IPO323]
gi|339475783|gb|EGP90874.1| serine carboxypeptidase [Zymoseptoria tritici IPO323]
Length = 538
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 95/333 (28%), Positives = 145/333 (43%), Gaps = 36/333 (10%)
Query: 10 WLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSN---HGFETFFYNQTIDHFSYG 66
WLL+ +I +++++ + P L + RS +L+ + +N F F + Q IDH
Sbjct: 5 WLLITLINLTSAYRRSTSNRHPHL--LHRS-LLQARSANDVASKFANFTFEQYIDHDQ-- 59
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNI----QLSGFTYENAHQFKALI 122
PE TFPQRYV+++ +W G SP++ ++ E PI+D + + G A + A
Sbjct: 60 PE-LGTFPQRYVVDTTYWNGTGSPVILWIWGEGPIEDGLIYFNKSLGTAGLLASEIGAAQ 118
Query: 123 VILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD----YASILLHIKDKYNATHAP 178
VILEHR++G+SV F ++ Y S A+ D S+ LH + P
Sbjct: 119 VILEHRFFGESVVFDEW-----TTQNLQYLTSDNAIRDAIRFAKSVQLHFSNVTGLGDVP 173
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD-FRDTSE 237
IA G SYGG L TW +P ASSA + ++ P V+ ++ FR
Sbjct: 174 WIATGESYGGALVTWLAQLHPDTFWAYYASSATV----EVVPDNFGFYVIGEEVFRQNCT 229
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPS 297
+ Q + EI + D + + F T LK+ + L + D P
Sbjct: 230 KDLQLVAAHIDEI-LVNGSADQIHDIKALFGMET-LKDDVDFVTALGRPTASYFPGDNPG 287
Query: 298 NIPVKRI---CNAIENAPNCG---DDILCKIAA 324
RI +AI AP G D L AA
Sbjct: 288 GF-CDRIEGGLDAIGGAPATGIGLDKALVNFAA 319
>gi|194744683|ref|XP_001954822.1| GF16550 [Drosophila ananassae]
gi|190627859|gb|EDV43383.1| GF16550 [Drosophila ananassae]
Length = 489
Score = 95.1 bits (235), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 106/474 (22%), Positives = 195/474 (41%), Gaps = 38/474 (8%)
Query: 33 LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL 92
L + R E + E + Q +D+F ++ T+ R +IN ++ G SPI
Sbjct: 37 LKDLHRGSPAEPTMTRANVEERWITQWLDNFD--GDNNATWEDRILINEDYFVDG-SPIF 93
Query: 93 AFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYF 152
+LG E I SG + A Q IV EHR++G+S+P Y
Sbjct: 94 IYLGGEWKIQPGDITSGLWVDIAKQHNGTIVTTEHRFFGESLPI----TPFSTENLEKYQ 149
Query: 153 NSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
N QALAD +++ ++K++ + ++ G SY +ATW R YP ++GS ASSAP+
Sbjct: 150 NVNQALADVINVIENLKEEDKYKDSKIVIHGCSYSASMATWIRKLYPETILGSWASSAPL 209
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT 271
+ D Y V+ + ++ + C+ I + S +++ + +G K+ C
Sbjct: 210 VAKVDFKE---YFKVIGESYKVLGGQYCYDLIDNATSYYEDLFANGEGDQ-AKKELNLCD 265
Query: 272 PL-----KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCK----- 321
++ ++ + +++ AQY P+N + + C+ + + + L K
Sbjct: 266 NFDADNKRDRWQIFSTIANIFAGIAQYQNPANYDIAQYCSVLRSFSDDDSVALSKFINWR 325
Query: 322 IAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPW 381
I + S + G +D D S W +Q+CSE G ++S + +P+
Sbjct: 326 IHEHSGQCISATFKGTTDYYEWAKDNYQD-SMLPWFFQTCSE----FGWFQSSGSRQQPF 380
Query: 382 NLT----KYIKNCKEQYGVSPRPSWVLTYYGGHDIKL-ILRRSTSNIIFSNGMRDPFSRG 436
+ Y C+ +G + + + + L +N+ F G D
Sbjct: 381 GSSFPSKLYEDTCETVFGSKYNTAGIRANAKATNAEFGGLDNDFTNVYFVQGQMD----- 435
Query: 437 GWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
GW K + + + SHC D SD LV ++ + ++ W+ +
Sbjct: 436 GWRKVGAGVEQG-ATIIPYASHCPDGGSISASDSPELVASKQKIIALVAQWLEE 488
>gi|157167876|ref|XP_001656138.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871004|gb|EAT35229.1| AAEL012590-PA [Aedes aegypti]
Length = 489
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 82/327 (25%), Positives = 160/327 (48%), Gaps = 30/327 (9%)
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILE 126
P++ +F RYV N++ + G PI F+G P++ ++ G + A + +V E
Sbjct: 64 PQNRNSFSMRYVTNNRHYRRG-GPIFLFVGGPWPLEAHLVEQGHFVDMAAEMNGFLVANE 122
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAPVIAIGAS 185
RYYG+S+P + N R Y ++ Q L++ A+ + H+K D +A VI G
Sbjct: 123 LRYYGESIPV--EDVSRNNFR---YLHNVQILSELATFIAHLKEDVVRDPNAKVILAGVG 177
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD-TSEECFQTIK 244
Y LA W R ++PH++ G +SS + T + + VV ++ R ++C+ TI
Sbjct: 178 YSASLAQWMRQRFPHLIHGVWSSSGMVRAS---TNYREFAEVVGENIRRFGGDDCYSTIW 234
Query: 245 KSWSEIDNIASKPDGLA-ILSKKFKTCTPL--KNTTELKDGLDTVYSEAAQYDTPSNIP- 300
+++ +N+ GL+ + + F TC P+ N E++ +++E ++ +++
Sbjct: 235 RAFRTAENLIDA--GLSTTVDELFHTCRPIDAANALEVEAFFYGIFNEISREVVDADLRG 292
Query: 301 -VKRICNAIENAPNCGDDILCKIAAGVV----EADSLEYDGN---NSRCYINEDRTGDES 352
+K++C + + + ++ L ++A+ + A+ L D ++ I+ D +S
Sbjct: 293 NIKQMCEPLTASDD--ENSLLELASWLTGRFPNAECLAMDFQSIVDTYNTIDVDSEIVKS 350
Query: 353 DE-GWEWQSCSEMVVPMGKDKNSMYQP 378
E W +Q C+E+ P+ S YQP
Sbjct: 351 GERQWMFQRCTELGWPL--TAASQYQP 375
>gi|384485422|gb|EIE77602.1| hypothetical protein RO3G_02306 [Rhizopus delemar RA 99-880]
Length = 242
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/167 (34%), Positives = 86/167 (51%), Gaps = 17/167 (10%)
Query: 54 FFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN--IQLSGFT 111
FFY+Q +DHF E+ TF RY N++++ G P+L + E D + + F
Sbjct: 17 FFYDQPVDHFL---ENSTTFKHRYWANTEWYQPGG-PVLIYNAGETAADQRSFLVIDSFM 72
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-- 169
E +I+++EHR+YG S+P ++ A+ N+AQAL D AS + ++K
Sbjct: 73 AELTKSLNGIIIVMEHRFYGLSLP-----SSNFTAKELATLNTAQALEDIASFIRYVKIP 127
Query: 170 ----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
D A I G SY G LA W RLKYP +V ++ SSAP+
Sbjct: 128 NFEVDLPPAPETRYIVYGGSYSGNLAAWMRLKYPDIVFAAVPSSAPV 174
>gi|17539994|ref|NP_501599.1| Protein PCP-2 [Caenorhabditis elegans]
gi|3876284|emb|CAB05187.1| Protein PCP-2 [Caenorhabditis elegans]
Length = 1080
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 118/498 (23%), Positives = 196/498 (39%), Gaps = 64/498 (12%)
Query: 30 RPRLGRIRRSRI-LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
RP G + S + D GFET + Q DHF+ ++ F QR+ N++ W
Sbjct: 552 RPPHGFLPESDFNMSPDDYPAGFETGSFRQRQDHFN--NQNADFFQQRFFKNTQ-WAKPG 608
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYEN------AHQFKALIVILEHRYYGKSVPFGSRKAA 142
P +G E P + L+ EN A ++ A + +LEHR+YG+S
Sbjct: 609 GPNFLMIGGEGPDKASWVLN----ENLPYLIWAKKYGATVYMLEHRFYGES--------R 656
Query: 143 LKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVV 202
+ + + +S Q + D A + + K + T P I G SY G ++ W R +P +V
Sbjct: 657 VGDNTNFNRLSSLQMIYDIADFIRSVNIK-SGTSNPWITFGGSYSGLISAWTREVFPELV 715
Query: 203 IGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAI 262
+G++ASSAP+ D + Y V R + C I++ ++ + + G
Sbjct: 716 VGAVASSAPVFAKTD---FYEYLMVAENSIRSYNSTCADRIQEGFNSMRALFLTKGGRQT 772
Query: 263 LSKKFKTCTPLK-NTTELKDGL-----------DTVYSEAAQYDTPSNIPVKRICNAIEN 310
LS FK P N T++ D YS ++ + +C + N
Sbjct: 773 LSSMFKLDPPFADNVTDIDQHYFFSNIYSNFQGDVQYSGDNMGSYANSYGIPDMCKIMTN 832
Query: 311 APNCG-DDILC--KIAAGVVEADSLEYDGNNS-----RCYINEDRTGDESDEG--WEWQS 360
N ++I+ + A + +NS IN G +++ W WQ+
Sbjct: 833 DSNTPLNNIVAFNEYMANFYNGGGPYFGLDNSYQDMINFLINAKDFGPDAEASLLWTWQT 892
Query: 361 CSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY-------GVSPRPSWVLTYYGGHDIK 413
CSE D + P + +I+ C + + + P YG
Sbjct: 893 CSEFGYFQSADSGNGIFGSPTPVNFFIQICMDVFNNYYQRSAIDPMVDNTNYMYGER--- 949
Query: 414 LILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS-HCLDLDEAKKSDPDW 472
SN++F NG +DP+ G Y+ D S L G+ HC D+ A+ +D
Sbjct: 950 --FHFRGSNVVFPNGNKDPWHALG---LYYPTDSSVVSYLIDGTAHCADMYPARDADVPG 1004
Query: 473 LVQQRKTEVKIMQGWITQ 490
L R + + W+ Q
Sbjct: 1005 LKVVRDLIDQNIAIWLNQ 1022
Score = 82.0 bits (201), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 107/430 (24%), Positives = 176/430 (40%), Gaps = 52/430 (12%)
Query: 46 DSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP---ID 102
+S T + Q +DH G S TF QRY+ + ++ + L G E P +D
Sbjct: 38 NSETAMTTGYMAQNLDHL-IGNAS-GTFTQRYLYSQQYTLHQRTAFLYVSGVEGPNVVLD 95
Query: 103 DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA 162
D + + A QF A I LEHRYYG+S P K N RH NS QA D
Sbjct: 96 DRTPI----VKTAKQFGATIFTLEHRYYGESKP-NVDKLDAYNLRH---LNSFQATQDVI 147
Query: 163 SILLHIKDKYNATH-APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
S + + ++N + G YGG +A R P+ V G +ASS P+ + D W
Sbjct: 148 SFIKYANVQFNMDQDVRWVVWGIGYGGIIAAEARKLDPNSVSGVIASSTPLTHEYDF--W 205
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
H V C++ + +++I P+G +S F+ L T +
Sbjct: 206 RFNHRVAIVLAETGGSLCYRKVANGFADIREAMKTPEGRLNISDLFQLNPRLNETALNYN 265
Query: 282 GLDTVY-------SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE- 333
+ Y E +++ ++ + +C I+ + +++ + A V + +L+
Sbjct: 266 DIQMFYLAIIAPFQEIVEFNDDFDLSIADLCTTIDKSNWTNMEVVYQ--AYVYLSTTLDG 323
Query: 334 YDGNNSRCYIN-EDRTGDES-DEG------WEWQSCSEM--VVPMGKDKNSMYQP-EPWN 382
+ G Y + D GD+S D G W++Q C+E ++ ++ P P +
Sbjct: 324 FAGPMDISYQDFVDSLGDQSVDSGWIDNRIWQYQVCTEFGWFYTTNDNEQGLFGPVVPAS 383
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRR--------STSNIIFSNGMRDPFS 434
L ++ C + + P LT G D + S +N +F+NGM DP+
Sbjct: 384 L--FLNQCFDIF-----PDANLTATGLRDSIINYNNFYGSSYDYSGTNAVFTNGMNDPWR 436
Query: 435 RGGWVKTYHF 444
G T F
Sbjct: 437 ELGKTSTGDF 446
>gi|194744673|ref|XP_001954817.1| GF16552 [Drosophila ananassae]
gi|190627854|gb|EDV43378.1| GF16552 [Drosophila ananassae]
Length = 597
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 98/416 (23%), Positives = 173/416 (41%), Gaps = 39/416 (9%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN 104
K+ E + +Q +DHF T+ RY N+K+ PI F+G E I
Sbjct: 42 KNEVASVEELWLDQRVDHFDENNNG--TWKMRYFRNAKY-HNPQGPIYIFVGGEWTISPG 98
Query: 105 IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
+ +G T++ A + ++ EHRYYG S+P G++ + + N Q+LAD A
Sbjct: 99 LMSTGLTHDMAVENSGMLFYTEHRYYGLSLPHGNKSLKVHQLKQ---LNLQQSLADLAFF 155
Query: 165 LLHIK-DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ H K + + VI +G SY G + TW +YP ++ S ASSAP+L D +
Sbjct: 156 IRHQKSNNPELKDSKVILVGGSYSGSMVTWMTQRYPDLIAASWASSAPLLAKAD---FYE 212
Query: 224 YHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC------TPLKNT 276
Y +VS + + + C Q I + + + + + + +LS KF C PL +
Sbjct: 213 YMDMVSSSVQLSYGQNCSQRISRGFEYLVKLFHENNIRTLLS-KFNGCKDYDPKNPL-DR 270
Query: 277 TELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI-----AAGVVEADS 331
+GL ++ Q + IP R+C ++ + + K + G
Sbjct: 271 AAFFNGLGNYFALIVQ-SYSAYIP--RLCESLMSLDASDEVAFIKFLELLYSEGRRSTAC 327
Query: 332 LEYDGNNSRCYINEDRTGDESDEGWEWQSCSEM---VVPMGKDKNSMYQPEPWNLTKYIK 388
++ ++ +E+ W +Q+C+E K S + L+ + K
Sbjct: 328 QDFGYSSMLQLFSEEDDHGSGTRAWFYQTCNEFGWYTTTQSKSSLSAAFAKQVPLSYFEK 387
Query: 389 NCKEQYGVSPRPSWVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
C++ +G P + +GG R + +IF++G DP+ G
Sbjct: 388 LCQDAFGPEQTPQKLARGIRQTNLEFGGFGFNHSER--YAQVIFTHGQLDPWRALG 441
>gi|268535774|ref|XP_002633022.1| Hypothetical protein CBG21794 [Caenorhabditis briggsae]
Length = 1277
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 123/451 (27%), Positives = 186/451 (41%), Gaps = 52/451 (11%)
Query: 50 GFETFFYNQTIDHFS-YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD----N 104
GFET + Q DHF+ P +Q Q++ NS+ W P +G E P N
Sbjct: 759 GFETGTFYQRQDHFNNQNPVHFQ---QKFYKNSQ-WAQPGGPNFLMIGGEGPEGPGWVLN 814
Query: 105 IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
QL+ Y A ++ A + ILEHR+YG S K + N+ +S Q L D A
Sbjct: 815 EQLTWIQY--AKKYGATVYILEHRFYGDS------KIDINNSNFY-LLHSLQMLYDLAEF 865
Query: 165 LLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY 224
+ + A AP I G SY G L+ W R +P +VIG++ASS P+ D + Y
Sbjct: 866 IKAVNINSPAP-APWITFGGSYSGALSAWMREVFPELVIGAVASSGPVFAKTD---FYEY 921
Query: 225 HSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-KNTTELKDG- 282
VV K R + C I+ ++ + + +G LS F+ +N T+
Sbjct: 922 LMVVEKSIRTYDKTCADRIQSGFNTMRTMFLTKEGRQNLSDLFQLKPAFGENVTDTDQHY 981
Query: 283 -LDTVYSE---AAQYDT------PSNIPVKRICNAIENAPNCGDDILCKIA--AGVVEAD 330
+Y A QY + + +C + N N D + + V D
Sbjct: 982 FFSNIYGNFQGAVQYSGDNAGAYANGYGIADMCKIMTNDSNIPLDNIVQFNEFMTVFYND 1041
Query: 331 SLEYDG--NNSRCYI----NEDRTGDESDEG--WEWQSCSEMVVPMGKDKNSMYQPEPWN 382
Y G NN + YI N G E G W WQ+C+E D + P
Sbjct: 1042 GDAYSGLDNNYQDYIDYMQNAQMFGPEYGAGLLWTWQTCNEFGYFQSADSGNGIFGSPTP 1101
Query: 383 LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILR---RSTSNIIFSNGMRDPFSRGGWV 439
+ Y++ C + + + + + T G + R R T N++F NG DP+ G
Sbjct: 1102 VNMYVQMCMDIFNAYEQRNTIDTAIGYTNYVYGERFHYRGT-NVVFPNGNVDPWHALG-- 1158
Query: 440 KTYHFFDLS-FSQDLNLGSHCLDLDEAKKSD 469
Y+ D S S +N +HC D+ A+ +D
Sbjct: 1159 -LYYPTDKSVVSYLINGTAHCADMYPARDAD 1188
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 107/448 (23%), Positives = 175/448 (39%), Gaps = 78/448 (17%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF 94
++ ++ L ++N T + QT+DHF+ S +TF Q Y F + AF
Sbjct: 211 KLNKAARLSTGNANIPVNTGYMIQTLDHFN--SRSNETFVQTYYYTQHFALHQRT---AF 265
Query: 95 LGAEAPIDDNIQLSGFTYEN------AHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
L + + + + + EN A QF A + LEHRYYG+S P K N R
Sbjct: 266 LYVS--VSGDFETTVISDENNPVVKSARQFGATLFSLEHRYYGQSKP-NVEKFDSFNLR- 321
Query: 149 RGYFNSAQALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLA 207
+ NS QA+ D + + + ++N + GA YGG +A R P VV G +A
Sbjct: 322 --FLNSFQAIQDIVAFIKYANKQFNLDPDVRWVLWGAGYGGVIAAEARKWNPDVVAGVIA 379
Query: 208 SSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKF 267
SS P+ + D W V ++ + C+Q + + +++I P+G + +S F
Sbjct: 380 SSTPLTHEYDF--WQFNDHVQMAISQEGGQLCYQKVAQGFADIRQAMRTPEGRSNISDLF 437
Query: 268 KTCTPLKNTTELKDGLDTVY-------SEAAQYDTPSNIPVKRICNAIENAPNCGDDIL- 319
+ L T + + Y E Q++ NI + +C I+ + +++
Sbjct: 438 QLNPRLDQTNLNYNDVQMFYLAIIAPFQEMIQFNNDFNIDIGALCTTIDQSTWTPMEVVW 497
Query: 320 ------CKIAAGVVEADSLEYD------GNNSRCYINEDRTGDESDEGWEWQSCSEM--- 364
G ++ Y GN S N D + W++Q C+E
Sbjct: 498 QAYVYFSNTVIGGLQPLVTSYQAVIDDLGNQSVNAANID------NRMWQYQMCTEFGWF 551
Query: 365 -------------VVPMGKDKNSMYQPEPW-NL-TKYIKNCKEQYGVSPRPSWVLTYYGG 409
VVP N + P NL I++ Y YG
Sbjct: 552 YTTNNNEQGLFGAVVPTSIFLNMCFDLFPGANLDATVIRDLTIDYN---------NLYGS 602
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGG 437
S +N++F+NG DP+SR G
Sbjct: 603 -----AFDYSGTNVVFTNGWYDPWSRLG 625
>gi|195391886|ref|XP_002054590.1| GJ24539 [Drosophila virilis]
gi|194152676|gb|EDW68110.1| GJ24539 [Drosophila virilis]
Length = 489
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 109/235 (46%), Gaps = 28/235 (11%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + Q +DHF + +T+ RY N+K+ PI FLG E I + +G T
Sbjct: 53 EELWLTQRLDHFD--GLNNKTWQMRYFRNAKY-HRPQGPIYIFLGGEWTITPGLLSTGLT 109
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD- 170
++ A + ++ EHRYYG+S P+ + +L++ +H + QALAD A + + K
Sbjct: 110 HDMAVENAGILFYTEHRYYGQSWPYENDSLSLEHLKH---LSLHQALADLAHFIRYQKSH 166
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
N TH+ VI +G SY G +A W YP +V S ASSAP+L D Y +V
Sbjct: 167 SSNLTHSKVILVGGSYSGSMAAWMTHLYPELVTASWASSAPLLAKADFYE---YMQLVEN 223
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD 284
+ C Q I++ GL L K F T TTEL L+
Sbjct: 224 SLNLSYGNNCTQRIER-------------GLNHLVKLFNT----NETTELLTSLN 261
>gi|325186496|emb|CCA21036.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 250
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/149 (38%), Positives = 82/149 (55%), Gaps = 13/149 (8%)
Query: 57 NQTIDHFSYGPESYQT------FPQRYVINSKFWGGGN--SPILAFLGAEAP-IDDNIQL 107
Q +D FS+ P SY + QRY+IN++ W + +P + G EA +
Sbjct: 106 TQRVDQFSW-PSSYSRSQPPLFYKQRYLINNETWDPNDPKAPTFFYTGNEASDVSLYANH 164
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+G +E A FKALIV EHR+YG S PF S + + R+R + QA+ADYA +L
Sbjct: 165 TGLMWEYAAHFKALIVFAEHRFYGLSQPFNSSQLIPSHLRYRTH---EQAIADYALLLES 221
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRL 196
I+ +++ PVI G SYGG L+ WFR+
Sbjct: 222 IQKRFHGDRHPVITFGGSYGGMLSAWFRI 250
>gi|123479668|ref|XP_001322991.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121905847|gb|EAY10768.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 437
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 115/476 (24%), Positives = 209/476 (43%), Gaps = 78/476 (16%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI 101
L + D N F +F QT+DH + G E TF Q Y + G I+ +GAE+
Sbjct: 8 LVRSDENRTFYSF--KQTLDHENTGSE---TFDQYYYEVTDHVVGQPKAIIVKIGAES-- 60
Query: 102 DDNIQLSGFTYENA---HQFKALIVILEHRYYGKSVPF-GSRKAALKNARHRGYFNSAQA 157
D + SG + NA ++ A+++ ++HR++GKS+P G LK + QA
Sbjct: 61 -DKLVASGVSDFNAVLAKRYNAIVLTIQHRFFGKSIPQDGLTVDKLK------FLTVEQA 113
Query: 158 LADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
+ DY + +++ + P + +G SY G L+ R KYP +++SS +LY
Sbjct: 114 VQDYKVFHDYYQNE-KKLNLPWLVVGGSYPGLLSALIRDKYPDDFKAAISSSG-VLY--- 168
Query: 218 ITPWTTYHSVVSKDFRDT---SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCT--- 271
++ V D +D +EC +++ +I+ + P A + F T
Sbjct: 169 -----ATNNFVEFDLQDAISMGQECAAIARQTRYQIEKLLEDPSDKAYVMNLFGVDTEKY 223
Query: 272 PLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVV---- 327
PLK+ E + + +++ + QY+ S ++C+ + NA G D + +A
Sbjct: 224 PLKDG-EFMNFIGELFTLSLQYNNLS-----KVCSPLVNARRLGYDTVSALATYAKGWFY 277
Query: 328 --EADSLEYDGNNSRCYINEDRTGDESDE-GWEWQSCSEMVV-PMGKDKNSMYQPEPWNL 383
+A EY + R + TG +D+ W W +C+++ +GK + S+ + E
Sbjct: 278 ENQAKPQEYSTAHMR-----NITGPNNDQRCWFWMTCNQLAYWQIGKGRLSL-RGEKVTK 331
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFS--------- 434
+ CK+ + P + + + L R +I ++ +DP++
Sbjct: 332 EVFEDQCKDVFDQEMHPD--VDAFNAKYSGIPLNRD--HIFYTTASQDPWTWTCVTEDVK 387
Query: 435 --RGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
V+TY +L HC DLD A +DP+ LV+ R+ E+ ++ W+
Sbjct: 388 VNENSVVRTYAGPELG---------HCSDLDGATDNDPEDLVRIREQEILTIEHWL 434
>gi|195062821|ref|XP_001996260.1| GH22290 [Drosophila grimshawi]
gi|193899755|gb|EDV98621.1| GH22290 [Drosophila grimshawi]
Length = 633
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 82/163 (50%), Gaps = 9/163 (5%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T + Q +DHF +T+ RY++N F+ G P+ F G E I G
Sbjct: 50 QTLWIEQKLDHFD--ESETRTWQMRYMLNDGFFKAGG-PMFIFFGGEWTISPGRITGGHM 106
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
Y+ A + L+V EHRYYG+S P L N + Y + QALAD A + K
Sbjct: 107 YDMAKEHNGLLVYTEHRYYGESHPLPD----LSNENIQ-YLHVTQALADLAHFITTQKTT 161
Query: 172 YNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
Y + + VI +G SY + TWF+ YP +V+G ASSAP+L
Sbjct: 162 YEGLSDSKVIIVGGSYSATMVTWFKKIYPDLVVGGWASSAPLL 204
Score = 55.5 bits (132), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 68/303 (22%), Positives = 125/303 (41%), Gaps = 28/303 (9%)
Query: 151 YFNSAQALADYASILLHIKDKYNA-THAPVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
Y + QALAD A + K Y + + VI +G SY + TWF+ YP +V+G ASS
Sbjct: 297 YLHVTQALADLAHFITTQKTTYEGLSDSKVIIVGGSYSATMVTWFKKIYPDLVVGGWASS 356
Query: 210 APILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAI-----L 263
AP+L + + Y ++ + +C + I+ +E++ + + G + L
Sbjct: 357 APLLAKLN---FLEYKEIMGQSITLMGGADCNKRIENGIAEMETMFATKRGAEVKALLKL 413
Query: 264 SKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAI--ENAPNCGDD--IL 319
+ F + L T L + + +++ Q I + +C I E++ G IL
Sbjct: 414 CEHFDVYSDLDVWT-LFNEISEIFAGVVQGHYAGRI--EGVCQKIMAESSDLIGVSKYIL 470
Query: 320 CKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE 379
+ + + L YD + + E R W +Q+C+E G + S +
Sbjct: 471 DEFEKSGGKCNDLSYDAITT--VLLESRYSGNIMRQWIYQTCNE----YGWYQTSSSSAQ 524
Query: 380 PWN----LTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL-ILRRSTSNIIFSNGMRDPFS 434
P+ L + C + YG S++ G + + + N+ ++G DP+
Sbjct: 525 PFGTKFPLALFTTMCADAYGSQYTNSFIEKQVGNTNADFGGMSPNVQNVYLTHGQLDPWR 584
Query: 435 RGG 437
G
Sbjct: 585 AMG 587
>gi|344309381|ref|XP_003423355.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 2-like
[Loxodonta africana]
Length = 579
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 49/138 (35%), Positives = 73/138 (52%), Gaps = 3/138 (2%)
Query: 47 SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQ 106
++ GF ++ Q +DHF++ +TF QR++++ KFW PI + G E +
Sbjct: 161 ADPGFREGYFEQLLDHFNFERFGNKTFRQRFLVSEKFWKRNEGPIFFYTGNEGDVWSFAN 220
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILL 166
SGF E A + AL+V EHRYYGKS+PFG++ QALAD+A +L
Sbjct: 221 NSGFILELAAREAALVVFAEHRYYGKSLPFGAQSTQRGRTE---LLTVEQALADFAVLLQ 277
Query: 167 HIKDKYNATHAPVIAIGA 184
++ + A AP IA G
Sbjct: 278 ALRASFGAQDAPAIAFGG 295
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/137 (31%), Positives = 74/137 (54%), Gaps = 10/137 (7%)
Query: 354 EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN--CKEQYGVSPRPSWVLTYYGGHD 411
E W++Q+C+E+ + + S PE T+ ++ C + +GV PR W+ T + G +
Sbjct: 427 EAWDYQACTEINLTFASNNVSDMFPE-LLFTESLRQQYCLDTWGVWPRRDWLRTSFWGAE 485
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPD 471
+K + SNIIFSNG DP++ GG ++ + + G+H LDL E+ +DP
Sbjct: 486 LK-----AASNIIFSNGDLDPWAGGGIRSNLSASVIAVT--IQGGAHHLDLRESNPADPA 538
Query: 472 WLVQQRKTEVKIMQGWI 488
+V+ RK E ++ W+
Sbjct: 539 SVVEARKLEAALIHEWV 555
>gi|157119585|ref|XP_001659436.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875273|gb|EAT39498.1| AAEL008698-PA [Aedes aegypti]
Length = 500
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 101/402 (25%), Positives = 174/402 (43%), Gaps = 46/402 (11%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF+ P++ TF Y N +F+ G PI F+G P+D G ++ A+
Sbjct: 62 VDHFN--PQNRDTFEFEYYSNDEFYRPG-GPIFIFVGGNWPLDQYYIEHGHFHDIANYEN 118
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAP 178
A + EHRYYG S P + +++N + Y QA+ D A ++ H++ + A
Sbjct: 119 AWMFANEHRYYGHSFPVPN--LSVENLQ---YLTVEQAMVDLAELIYHVRHNVVRDDDAR 173
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VI +G Y G +ATW R +YPH+V G+ SS I + + + +D+ + E
Sbjct: 174 VILLGTGYAGAIATWMRQRYPHLVEGAWVSSGQIEARFNFKEYAMEIGELIRDY--GTNE 231
Query: 239 CFQTIKKSWSEIDNIASKPDGLA-ILSKKFKTCTPLKNTTELKDGLDTVY---SEAAQYD 294
C+ I +++ +N+ GLA ++ F TC + T L ++T + EA Q
Sbjct: 232 CYSQIWRAFRTAENLMDA--GLANTVTDLFNTCERVDTETMLD--VETFFYNVKEALQRA 287
Query: 295 TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVE-----ADSLEYDGNNSRCYINEDRTG 349
S + EN N + A +E D + +D + + + G
Sbjct: 288 ILSEQDTETTEELCENLNNSTEATDLHTIANWIEDFYYYLDCMPFDFDTTVAAHQFEEIG 347
Query: 350 --DESDEGWE---WQSCSEMVVPMGKDKNSMYQPEPWNLTKY--IKNCKEQYG------- 395
+ + G +Q C+E + D S QP + +T Y + C+ +G
Sbjct: 348 YPENAILGLRQRVYQFCTEFGWFLTAD--SPDQPFGYRVTMYFFLNFCRSVFGDWVTSEV 405
Query: 396 VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
V+ ++GG + ++ SN++F+NG DP G
Sbjct: 406 VADGVHLTNMHFGGKNPRI------SNVLFTNGALDPVRDVG 441
>gi|345314847|ref|XP_001512755.2| PREDICTED: dipeptidyl peptidase 2-like, partial [Ornithorhynchus
anatinus]
Length = 452
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/143 (35%), Positives = 78/143 (54%), Gaps = 9/143 (6%)
Query: 48 NHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL 107
+ GF+ ++ QT+DHF + +T+ QRY+I KFW G+ P+ + G E I + +
Sbjct: 28 DPGFQERYFEQTVDHFDFETYGNRTYLQRYLITEKFWKKGSGPLFFYTGNEGDIWNFAKN 87
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGS---RKAALKNARHRGYFNSAQALADYASI 164
S F E A AL++ EHRYYGKS+P G R+ ++ G QALADYA +
Sbjct: 88 SDFILELAAAESALVIFAEHRYYGKSLPLGPGSIRRGSM------GPLTVEQALADYAVL 141
Query: 165 LLHIKDKYNATHAPVIAIGASYG 187
+ ++ + A P++A G S G
Sbjct: 142 IGALQRQLGAAGLPLVAFGGSSG 164
>gi|157119581|ref|XP_001659434.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875271|gb|EAT39496.1| AAEL008699-PA [Aedes aegypti]
Length = 512
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 108/222 (48%), Gaps = 15/222 (6%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI-DDNIQLSGFTYE 113
F+ +DHF+ + T RY + ++ G PIL FLG PI + S Y+
Sbjct: 62 FFTTRVDHFNSQNTAEWTL--RYFAVTDYYMPG-GPILIFLGGNQPILTSMVDESTLIYD 118
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKY 172
A + + E R+YG+S F + A+ +N N+ Q LAD A + ++K D
Sbjct: 119 MAREMNGAVYAFESRFYGQS--FVTEDASTENL---SLLNTDQILADLAEFVQYLKRDVL 173
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+APV+ G+ YGG LATWFR++YPH+ + +SS Y+ + + + +
Sbjct: 174 KNPNAPVMVSGSEYGGALATWFRVRYPHLAQAAWSSSG---YHHALMDFQEFSEAWGQTL 230
Query: 233 RD-TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL 273
D S+EC+ I ++ + N+ G IL KF C+P+
Sbjct: 231 IDHGSQECYNDIFVAFHVMQNLIDIGLG-DILYDKFNICSPI 271
>gi|390603317|gb|EIN12709.1| peptidase S28 [Punctularia strigosozonata HHB-11173 SS5]
Length = 575
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 70/238 (29%), Positives = 117/238 (49%), Gaps = 24/238 (10%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS-PILAFLGAEAP 100
L+ + FE +++ Q +DHF+ E QT+ QRY IN++ + +S P++ G E
Sbjct: 75 LDPSSALPSFEAYWFEQPLDHFAR--EDTQTWRQRYWINTRHYKPNSSAPVIVLDGGETS 132
Query: 101 IDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA-ALKNARHRGYFNSAQA 157
+ + +G A + V+LEHRYYG+S+P + AL+ + ++AQ+
Sbjct: 133 GANRLPFLDTGIVEILAKATGGVGVVLEHRYYGRSIPVDNLSTDALR------FLDNAQS 186
Query: 158 LADYASILLHIK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
AD A + H+K + A HAP I G SY G A ++ YP +V G++ASS
Sbjct: 187 AADSARFMSHVKFEGIEEDLTAPHAPWIYYGGSYAGARAAHMKVLYPELVFGAIASSG-- 244
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLA-ILSKKFKT 269
+ + +T W Y ++ T C ++ + + IDNI + P IL + K+
Sbjct: 245 VTHAALTIW-EYMDIIRVAMDPT---CSSNLQSAIASIDNILTHPGPTGRILRRALKS 298
>gi|170045810|ref|XP_001850487.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868715|gb|EDS32098.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 726
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 105/226 (46%), Gaps = 24/226 (10%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ +DHF P++ TF Y N +++ G PI F+G + P++ G ++ A
Sbjct: 59 FRTRVDHFD--PQNRATFEFEYYSNDEYYERG-GPIFIFVGGDWPLEQYYIERGHFHDIA 115
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NA 174
+ A + EHRYYG S P +N R Y QA+ D A + H+++
Sbjct: 116 QRTNAWMFTNEHRYYGHSSPVSDYST--ENLR---YLTVEQAMVDLAEWIFHLRNNVVRD 170
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T+A VI +G Y G +ATW R +YPH+V G ASS Y + + RD
Sbjct: 171 TNAKVILLGTGYAGAIATWARQRYPHLVDGVWASSE-------------YAEEIGELLRD 217
Query: 235 T-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTEL 279
++EC+ I +++ +N+ I+++ F TC P+ L
Sbjct: 218 LGNDECYSRIWRAFRTAENLMDA-GRTEIVTEMFNTCEPVDEENML 262
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 96/210 (45%), Gaps = 22/210 (10%)
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYG 131
TF Y N +++ G PI F+G + P++ G ++ A + A + EHRYYG
Sbjct: 400 TFEFEYYSNDEYYERG-GPIFIFVGGDWPLEQYYIERGHFHDIAQRTNAWMFTNEHRYYG 458
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGEL 190
S P +N R Y QA+ D A + H+++ T+A VI +G Y G +
Sbjct: 459 HSSPVSDYST--ENLR---YLTVEQAMVDLAEWIFHLRNNVVRDTNAKVILLGTGYAGAI 513
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSE 249
ATW R +YPH+V G ASS Y + + RD ++EC+ I +++
Sbjct: 514 ATWARQRYPHLVDGVWASSE-------------YAEEIGELLRDLGNDECYSRIWRAFRT 560
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTEL 279
+N+ + ++ F TC P+ L
Sbjct: 561 AENLMDAGRTETV-TEMFNTCEPVDEENML 589
>gi|328865129|gb|EGG13515.1| hypothetical protein DFA_11276 [Dictyostelium fasciculatum]
Length = 576
Score = 92.4 bits (228), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 110/400 (27%), Positives = 174/400 (43%), Gaps = 41/400 (10%)
Query: 47 SNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSP--ILAFLGAEAPIDDN 104
S GF TF NQ +DH + +TFPQRY IN F G +P ++ LG E PID
Sbjct: 31 SKTGFYTF--NQRVDH---NGVNVKTFPQRYCINKSFVHKGAAPKSVMLVLGGEGPIDPE 85
Query: 105 IQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
I A+ ++I+ LE RYYG+S+P + + + Y + Q L D A
Sbjct: 86 ITNHIPFIGVANNTNSIIIALEIRYYGESIPVPNM-----STDNMQYLTTDQILDDIAYF 140
Query: 165 LLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY 224
+ Y + I +G SY G L+ W+R+KYP++ ++ASSAP I ++
Sbjct: 141 QTQFTNLYGLHNCKWIVMGCSYAGSLSAWYRMKYPNLAAAAIASSAP------IRAVVSF 194
Query: 225 HSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLD 284
H K +C + K+ + ++ + + +I +KF TC ++ D +
Sbjct: 195 HDYDRKVREALGLQCTKQFKQILNHVEQ-QLRVNNTSI-KRKF-TC-----DAKIDDKMF 246
Query: 285 T-VYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI--LCKIAAGVVEADSLEYDG--NNS 339
+ SEA Y N K I N G +I L I A + L +G N
Sbjct: 247 LFMLSEAISYSVQYNSRFKIISNICPPLIQSGSNIVKLLDIFADYITNMFLFKNGSCNEY 306
Query: 340 RCYINEDRTGDES-DEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN--CKEQYGV 396
Y D S W +Q CSE + + + ++ +N CK +G
Sbjct: 307 NLYSFASTKVDYSGTRQWTYQLCSEYGWFLTASDSDLSLKSGQINEQWWENEVCKIMFGS 366
Query: 397 SPRP--SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFS 434
S +P + YG ++K++ +N++F+NG DP+S
Sbjct: 367 SMKPFVEKINLEYGIDNMKML-----TNVLFTNGGYDPWS 401
>gi|194899968|ref|XP_001979529.1| GG23317 [Drosophila erecta]
gi|190651232|gb|EDV48487.1| GG23317 [Drosophila erecta]
Length = 486
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 101/426 (23%), Positives = 177/426 (41%), Gaps = 47/426 (11%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI 101
+ K+ E + +Q +DHF + +T+ RY N+K++ PI F+G E I
Sbjct: 39 IRSKNELAAVEELWLDQKVDHFD--EHNNKTWRMRYYSNAKYFKP-QGPIYIFVGGEWTI 95
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ +G T++ A + ++ EHRYYG S+PFG+ L N + + Q+LAD
Sbjct: 96 SPGLLSTGLTHDMAVENSGMLFYTEHRYYGLSLPFGNESYRLNNLKQ---LSLHQSLADL 152
Query: 162 ASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
A + H + + VI +G SY G L W YP ++ S ASSAP+L D
Sbjct: 153 AHFIRHQQSNTPEMEDSKVILVGGSYSGSLVAWMTQLYPDLIAASWASSAPLLAKAD--- 209
Query: 221 WTTYHSVVSKDFRDT-SEECFQTIKKSWS---------EIDNIASKPDGLAILSKKFKTC 270
+ Y +V K R + C I+K EI + +G + ++
Sbjct: 210 FFEYMEMVDKSIRLSYGHNCSLRIEKGLKFLVKLFDGDEIQELLYNLNG----CEGYRPK 265
Query: 271 TPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDI----LCKI--AA 324
PL + +GL ++ Q + IP R+C + + + GD++ K+ +
Sbjct: 266 NPL-DRAAFFNGLGNYFALVVQ-SYSAYIP--RLCETLMSL-DSGDELAFIEFLKLLYSE 320
Query: 325 GVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEM---VVPMGKDKNSMYQPEPW 381
G ++ ++ ++ +ED W +Q+C+E K S
Sbjct: 321 GRRSSECQDFGYSSMLELFSEDLDQSSETRAWFYQTCNEFGWYTTTKSKSSASQAFANQV 380
Query: 382 NLTKYIKNCKEQYGVSPRPSWVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFS 434
L + + C++ +G + + +GG R + +IF++G DP+S
Sbjct: 381 PLGYFEQLCQDAFGAEQTAHQLAQGVEQTNSKFGGCGFNQSER--YAQVIFTHGALDPWS 438
Query: 435 RGGWVK 440
G K
Sbjct: 439 ALGQQK 444
>gi|307108517|gb|EFN56757.1| hypothetical protein CHLNCDRAFT_144219 [Chlorella variabilis]
Length = 303
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 95/186 (51%), Gaps = 13/186 (6%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSG 109
E F+ Q ++ S P T+ QRY + ++W + +PI + G E + + + +G
Sbjct: 63 ELFYEEQQTNNVSGKP----TWRQRYFLCDQYWDREDPYAPIFFYAGNEGNVANGVNNTG 118
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+E A F AL+V EHRYYG S PFG ++ + + QA+ DY + L +K
Sbjct: 119 LMWERAQAFGALLVFAEHRYYGNSWPFGKEESLTLEGLQ--FLSMEQAIEDYVTFLNWLK 176
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAP----ILYYGDITPWTTYH 225
NAT APV+A G SYGG L R P V +++SSAP +L G P +Y
Sbjct: 177 ISLNATSAPVVAFGGSYGGVLVAIMRATRPSSVQAAVSSSAPMRGWLLQDGGYDP-GSYW 235
Query: 226 SVVSKD 231
VV++D
Sbjct: 236 EVVTRD 241
>gi|330803268|ref|XP_003289630.1| hypothetical protein DICPUDRAFT_154024 [Dictyostelium purpureum]
gi|325080278|gb|EGC33840.1| hypothetical protein DICPUDRAFT_154024 [Dictyostelium purpureum]
Length = 226
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 58/149 (38%), Positives = 83/149 (55%), Gaps = 11/149 (7%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+ Q +DHF+ + TF QRY+IN +++ G P+ + E P+ + ++G +
Sbjct: 65 FTQNVDHFNI--VNTDTFQQRYLINDQYYDG-TGPVFIMINGEGPMGLDT-VTGLQFVVW 120
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A Q ALIV LEHRYYG S F + +L N + + NS QALAD A I KYN
Sbjct: 121 AKQLNALIVSLEHRYYGAS--FVTSDLSLDNLQ---FLNSQQALADNAVFREFIAQKYNI 175
Query: 175 -THAPVIAIGASYGGELATWFRLKYPHVV 202
+ ++ G SY G L +WFR+KYPH+V
Sbjct: 176 PSTTKWVSFGGSYSGALTSWFRIKYPHLV 204
>gi|193786915|dbj|BAG52238.1| unnamed protein product [Homo sapiens]
Length = 128
Score = 92.0 bits (227), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 69/121 (57%), Gaps = 7/121 (5%)
Query: 373 NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
+ M++P WNL + +C +Q+GV PRPSW+ T YGG +I S +NI+FSNG DP
Sbjct: 9 DDMFEPHSWNLKELSDDCFQQWGVRPRPSWITTMYGGKNIS-----SHTNIVFSNGELDP 63
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
+S GG K D + ++ G+H LDL DP ++ R EV+ M+ WI +Y
Sbjct: 64 WSGGGVTKD--ITDTLVAVTISEGAHHLDLRTKNALDPMSVLLARSLEVRHMKNWIRDFY 121
Query: 493 D 493
D
Sbjct: 122 D 122
>gi|291240192|ref|XP_002740004.1| PREDICTED: predicted protein-like, partial [Saccoglossus
kowalevskii]
Length = 381
Score = 91.7 bits (226), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 75/270 (27%), Positives = 125/270 (46%), Gaps = 49/270 (18%)
Query: 259 GLAILSKKFKTCTPLKNTTELKDGLDTV-------YSEAAQYDTP---------SNIPVK 302
GL L+K + C P+ KD L+ V + A +D P PVK
Sbjct: 54 GLQKLTKILRLCKPMT-----KDRLNHVNGWIRNSLTNLAMFDYPYPTTFLVPVPAYPVK 108
Query: 303 RICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNS--RCYINE-------DRTG---D 350
C+ I N+ D ++ G+V+A L Y+G +C+ + D TG
Sbjct: 109 VACSYIMNS---SDPLV-----GLVQAAGLYYNGTKGSLKCFDVDTEFVECADPTGCGVG 160
Query: 351 ESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGG 409
W++Q+C+E ++P G + K M+ P+ L + C++++ + PR W+ + G
Sbjct: 161 PDSMAWDYQACTESMMPAGSNGKTDMFPDLPFTLKMRDEYCEKKWNIVPRNDWLNVHLWG 220
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSD 469
DI + SNI+F+NG DP+ RGG + + D + ++ G+H LDL A D
Sbjct: 221 KDI-----LTASNIVFANGSLDPWRRGGVLSS--LSDSLIAVLIDGGAHHLDLRGANPLD 273
Query: 470 PDWLVQQRKTEVKIMQGWITQYYDDFKAIN 499
P ++ R+ EVK +Q WI Q + ++
Sbjct: 274 PQSVLNARQEEVKYIQKWIEQEQEALALVD 303
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 3/57 (5%)
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGA 184
RYYGKS+PFG L N G+ + QA+ADYA ++ ++K K NA PVIA G
Sbjct: 1 RYYGKSLPFGDSSFILGNI---GFLSIEQAMADYAVLIHYLKIKLNAAKCPVIAFGG 54
>gi|268579387|ref|XP_002644676.1| Hypothetical protein CBG14660 [Caenorhabditis briggsae]
Length = 512
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 100/428 (23%), Positives = 181/428 (42%), Gaps = 63/428 (14%)
Query: 91 ILAFLGAEAPIDDN-IQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
+ + EAP D IQ + Y A +F A + LEHR +G+S P+ + + N +
Sbjct: 94 VFLMIQGEAPATDLWIQNPNYQYLKWAKEFGADVFQLEHRCFGQSRPY--KDLSYPNIK- 150
Query: 149 RGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLAS 208
+QA+AD + + + +YN + I G SY G L+ FR ++P +G++AS
Sbjct: 151 --VCTMSQAIADIHNFIGQMNIQYNFRNPKWITFGGSYPGTLSALFRQQHPEDTVGAVAS 208
Query: 209 SAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
SAP+ + D + Y VV TS +C+Q +K ++ ++ ++ G+ L+ F
Sbjct: 209 SAPLDWTLD---FFEYAMVVEDVLNQTSTDCWQNVKDAFYKMQQLSLTKQGIQQLNAYFD 265
Query: 269 TCTPLKNTTELKDGLDTVYSE-------AAQYD-------TPSNIPVKRICNAI--ENAP 312
P + + +D ++ QY T + + V+++CN + +N P
Sbjct: 266 LSPPFVDGQYTQHDIDNFFANLYGFFQGVVQYTYDGRNAATLNGLNVQQLCNKMNDKNVP 325
Query: 313 NCGDDILCKIAAGVVEADSLEYD--GNNSRCY---------INEDRTGD-ESDEGWEWQS 360
D++ ++ V + + D G Y ++ D +G+ ++ GW W
Sbjct: 326 ----DVINRVNNTVNWINQMNGDTVGPFQNSYKDMMAVLANVSYDNSGELAANRGWMWLC 381
Query: 361 CSEMVVPMGKD--KNSMYQPEPWNLTKYIKNCKEQYG-------VSPRPSWVLTYYGGHD 411
C+E+ D +N Q P + +I C + +G + L YGG D
Sbjct: 382 CNELGALQTTDQGRNIFQQTVP--MGYFIDMCTDMFGPDVGIKFIRDNNKQTLYKYGGAD 439
Query: 412 IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL--GSHCLDLDEAKKSD 469
+N++ NG DP+ G TY+ + L + +HC D+ +
Sbjct: 440 -----NYQATNVVLPNGAFDPWHVLG---TYNNNTANHMTPLLIQGAAHCSDMYPTYAGE 491
Query: 470 PDWLVQQR 477
P L + R
Sbjct: 492 PADLQRNR 499
>gi|195062790|ref|XP_001996254.1| GH22390 [Drosophila grimshawi]
gi|193899749|gb|EDV98615.1| GH22390 [Drosophila grimshawi]
Length = 497
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/160 (36%), Positives = 85/160 (53%), Gaps = 9/160 (5%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
+Q +DHF + +T+ RY+ N K+ PI F+G E I + +G T++ A
Sbjct: 53 SQKLDHFD--ELNNKTWQMRYLRNDKY-HKPQGPIYIFVGGEWTITPGLLSTGLTHDMAV 109
Query: 117 QFKALIVILEHRYYGKSVPFGS--RKAALKNARHRGYFNSAQALADYASILLHIKD-KYN 173
+ ++ EHRYYG+S+P S +L+N +H N QALAD A + + K N
Sbjct: 110 ENAGILFYTEHRYYGQSLPHNSSHNSMSLENLKH---LNLHQALADLACFIRYQKSHSAN 166
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
TH+ VI IG SY G + W YP +V S ASSAP+L
Sbjct: 167 LTHSKVILIGGSYSGSMVAWMTQLYPELVTASWASSAPLL 206
>gi|403416809|emb|CCM03509.1| predicted protein [Fibroporia radiculosa]
Length = 562
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 72/222 (32%), Positives = 105/222 (47%), Gaps = 22/222 (9%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG-GGNSPILAFLGAEAP 100
L+ D F ++ Q +DHFS P TF QRY IN++ + G N+P++ G E
Sbjct: 61 LQSLDEPSEFPAHWFTQPLDHFSKTP---HTFNQRYWINTRHYKPGSNAPVIVLDGGETS 117
Query: 101 IDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
+D I +G A + V+LEHRYYG+SVP A L R + N+ Q+
Sbjct: 118 GEDRIPFLDTGIVEILAKATGGVGVVLEHRYYGESVPV----ANLTTDSLR-WLNNDQSA 172
Query: 159 ADYASILLHIK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
AD A+ + ++K + A P I G SY G A R+ YP +V G++ASS +
Sbjct: 173 ADSANFMANVKFPGIHEDLTAPGTPWIYYGGSYAGARAAHMRILYPDLVYGAIASSG--V 230
Query: 214 YYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIAS 255
+ I W Y ++ R EC I+ + ID I S
Sbjct: 231 THAAIEHW-EYMDIIR---RAADPECSANIQSAIEAIDTILS 268
>gi|326436306|gb|EGD81876.1| thymus specific serine peptidase [Salpingoeca sp. ATCC 50818]
Length = 500
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 142/337 (42%), Gaps = 39/337 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFW-GGGNSPILAFLGAEAP-IDDNIQLSGFTY 112
++ Q DHF + T+ Q Y +N FW G N+P+ +G E P ID ++ +S
Sbjct: 47 YFTQWQDHFD--GTNVNTWQQAYYVNDTFWKGDANAPVFLCVGGEGPPIDGSVVVSSVHC 104
Query: 113 ENAHQF----KALIVILEHRYYG----KSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
A + A++ +EHRYYG + P S LK + +S QALAD A
Sbjct: 105 NGAVEMLPETGAIMFAVEHRYYGCHNMSACPVTSF---LKPKDALRFLSSRQALADLAGF 161
Query: 165 LLHIKDKYNATHAPV---IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
+ Y P ++ G SY G LA WFRLK+PH+V S+ASSAP+ D+
Sbjct: 162 HAYATATYGLK--PTNKWVSFGGSYPGMLAGWFRLKFPHLVHASVASSAPVQAIVDM--- 216
Query: 222 TTYHSVVSKDFRDT------SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-- 273
Y+ VV++ + + S C + I + I + S G L+ F
Sbjct: 217 VGYNDVVAEAYAVSNNNVGGSPACRKAIADGHAMIGQMFSSDSGRTRLANLFGHNAKWYE 276
Query: 274 -KNTTELKDGLDTVYSEAAQYDTPSNIP---VKRICNAIENAPNCGDDI--LCKIAAGVV 327
K G Y A D P + RIC + N + GD++ L I
Sbjct: 277 NKLNQASFAGFGVAYFPAQGNDPACTDPACNIGRICAVMTNT-SLGDEVSRLAAIRNMQD 335
Query: 328 EADSLEYDGNNSRCYINEDRTGD-ESDEGWEWQSCSE 363
E S ++ N + + D E + W WQ C+E
Sbjct: 336 EWLSQPFETVNRKHSLMHAAGNDAELPDFWSWQVCTE 372
>gi|219130565|ref|XP_002185433.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403147|gb|EEC43102.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 538
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/346 (25%), Positives = 147/346 (42%), Gaps = 37/346 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG--GNSPILAFL--GAEAP------I 101
+T + NQ ++HF ++F QR+ + ++ N AFL G E P +
Sbjct: 60 QTRYVNQQLNHFD--ASDTRSFAQRFFYSDRYARAREENRNTYAFLCVGGEGPALDESVL 117
Query: 102 DDNIQLSGFTYENAHQF-----KALIVILEHRYYGKSVPFGSRKAALKNA-------RHR 149
D++ +G E AH K + LEHRYYG+S P KN +H
Sbjct: 118 VDSVHCTGDMLELAHILFEDGHKVHLYALEHRYYGESYPVFREGGCSKNRTTSPVTNQHL 177
Query: 150 GYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
Y +S QALAD A ++ + T+ + G SY G +A W R KYPH++ +++SS
Sbjct: 178 VYLSSTQALADLAH-FVNSRSLDGGTNIKWVTFGGSYPGMMAAWARSKYPHLIHAAVSSS 236
Query: 210 APILYYGDITPWTTYHS-VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
AP+ D + + + S V++ S EC + + E+ + A L+ F
Sbjct: 237 APVQAVLDFSAYNNHVSKVLASANVGGSSECLAVFQAAHGEVTRMVHDATQHAGLADMFG 296
Query: 269 TCTP---LKNTTE---LKDGLDTVYSEAAQYDTPSNI-PVKRICNAIENAPNCGDDILCK 321
C L++ + L DGL ++++ ++ + +IC + N +
Sbjct: 297 LCNATSLLEDRNKELFLGDGLVDLHTQGNNPSCDRDLCNIGKICRTLLNDAKSFKPVTAL 356
Query: 322 IAAGVVEAD---SLEYDGNNSRCYINEDRTGDESD-EGWEWQSCSE 363
+ D + D + YI++ G E W WQ+C+E
Sbjct: 357 ANLAQRQRDRGACINIDWTGTLDYISDPVRGVEGGLRSWLWQTCTE 402
>gi|104531986|gb|ABF72901.1| CG3734-like [Belgica antarctica]
Length = 184
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 84/162 (51%), Gaps = 9/162 (5%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
ET + Q ++HF P+ ++ + RY+ N +F G PI ++G E I + S
Sbjct: 24 ETKWIEQPLNHFD--PQDHRVWQMRYMENREFLQDG-GPIFIYVGGEWTISEGWLRSSHF 80
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
+ A Q + EHRYYG+S P + + N R + N QALAD A + HIK
Sbjct: 81 HYMAEQLNGTLYYTEHRYYGESHP--TEDLTVDNLR---FLNIDQALADLAHFITHIKQT 135
Query: 172 Y-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
++ VI IGASY + TWF KYPH+ G+ +SSAP+
Sbjct: 136 TPELQNSGVILIGASYSATMVTWFMQKYPHLARGAWSSSAPL 177
>gi|198477695|ref|XP_002136487.1| GA23306 [Drosophila pseudoobscura pseudoobscura]
gi|198145255|gb|EDY71959.1| GA23306 [Drosophila pseudoobscura pseudoobscura]
Length = 292
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 73/240 (30%), Positives = 116/240 (48%), Gaps = 32/240 (13%)
Query: 27 LKLRPR-------LGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVI 79
L+L+P L +RR+ LE+ ET ++ +D+F+ + T+ R +I
Sbjct: 23 LELQPEPNAFVQSLRELRRAPPLEKSRKRANVETRWFTLKLDNFNAANNA--TWKDRVLI 80
Query: 80 NSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSR 139
N + G SPI +LG E I+ + SG + A + ++ EHR++GKS P
Sbjct: 81 NEDHFTDG-SPIFIYLGGEWEIEPSAITSGLWVDIAKEHNGSLIYTEHRFFGKSFPITP- 138
Query: 140 KAALKNARHRGYFNSAQALADYASIL--LHIKDKYNATHAPVIAIGASYGGELATWFRLK 197
+ KN +++ + QALAD I+ L ++DKY + VI G SY +ATW R
Sbjct: 139 -LSTKNLKYQ---SVQQALADVVHIIKTLKLEDKYK--DSKVIVSGCSYSATMATWIRKL 192
Query: 198 YPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI--KKSWSEIDNIAS 255
YP +++GS ASSAP+ D KD+ + + F+ + K + IDN S
Sbjct: 193 YPDIILGSWASSAPLEAKVDF-----------KDYMEVVGQAFEQLGGKYCYDLIDNATS 241
>gi|157119587|ref|XP_001659437.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875274|gb|EAT39499.1| AAEL008702-PA [Aedes aegypti]
Length = 499
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 98/404 (24%), Positives = 170/404 (42%), Gaps = 48/404 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ +DHF+ P+ TF Y N +++ G PI +G P+ G ++
Sbjct: 57 LFRTRVDHFN--PQKRDTFQFEYFSNDQYYRPG-GPIFIVVGGNFPVSPYFLEHGHFHDI 113
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-N 173
A A + EHR+YG S+P + +++N R Y Q + D A + H++
Sbjct: 114 AFYENAWMFTNEHRFYGNSMP--TEDLSVENLR---YLTVEQTMVDLAEWIFHLRQNVVR 168
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+A VI +G Y G +ATW R +YPH+V G+ SS I + + Y V + R
Sbjct: 169 DQNARVILLGTGYSGAIATWMRQRYPHLVEGAWVSSGQIEARFN---FKEYAEEVGELIR 225
Query: 234 D-TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY----- 287
D S EC+ I +++ +N+ G + +S+ F TC P+ ++T +
Sbjct: 226 DYGSNECYSQIWRAFRTAENLIDAGLG-STVSELFNTCEPIVTDDITMLDVETFFWHVKT 284
Query: 288 -SEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVE----ADSLEYDGN---NS 339
+ D +C + N+ D L IA V E D + +D + ++
Sbjct: 285 ALQRGVLDEQDTDTTNELCERLNNSTEATD--LQTIANWVHEFYDFLDCMPFDFDAAIDA 342
Query: 340 RCYINEDRTGDESDEGWE---WQSCSEMVVPMGKDKNSMYQPEPWNLTKY--IKNCKEQY 394
Y++ + + + G +Q C+E + D + QP + ++ Y + C+ Y
Sbjct: 343 HQYVDP-KVPENAVYGLRQRTYQLCTEFGWFLTADSHD--QPFGYRVSMYFFLNVCRSVY 399
Query: 395 GVSPRPSWV---LTYYGGHDIKLIL---RRSTSNIIFSNGMRDP 432
G W+ + Y G H + SN+ F+NG DP
Sbjct: 400 G-----DWLNSQVVYDGVHLTNMHFGGQNPRISNVFFTNGGLDP 438
>gi|403343430|gb|EJY71042.1| Serine carboxypeptidase S28 family protein [Oxytricha trifallax]
Length = 494
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 119/519 (22%), Positives = 196/519 (37%), Gaps = 88/519 (16%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES- 69
L+ V FV+T A + R + + +++ + F + Q +DHF P +
Sbjct: 6 LISTVAFVATQVLAKDNYVADAKIRAKYTFHMDETE----FVEMNFTQIVDHFPPTPTND 61
Query: 70 YQTFPQRYVINSKFWGGGNSPILAFLGAEA---PIDDNIQLSGFTYENAHQFKALIVILE 126
T+ QRY IN K + N P+ ++ E P D G+ + A +F A+ +E
Sbjct: 62 AATYQQRYFINDKHFDKDNGPVFLYICGEGTCKPPSDR----GYPMQLAIEFGAMFYAVE 117
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL----LHIKDKYNATHAPVIAI 182
HR+YG S P A + + Y + QALAD A + I +Y + I
Sbjct: 118 HRFYGTSQP-----TADWSTDNLKYLTAEQALADLAGFIDAQNAAIIKQYGGAARKWVTI 172
Query: 183 GASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQT 242
G SY G L+ WF+ YP + +SS IL +DF D + FQ
Sbjct: 173 GGSYPGALSAWFKQAYPDKAAAAWSSSGVIL--------------PIRDFTDFDMDIFQA 218
Query: 243 IKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY-SEAAQYDTPSN--- 298
+S E P + L+ + +T LKN + K G D Y E +N
Sbjct: 219 TSRSGPEC------PAFIQSLTTQIETI--LKNQEQGKGGADFDYLCEVFGITKDANFGD 270
Query: 299 ----------IPVK-----RICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYI 343
I V+ +C +E+ + + + + +
Sbjct: 271 FMFYVADIFTIGVQYGGRIELCQMLESIQFSTFKAQLPVLQQYAKGKGMFFGQYDRVALA 330
Query: 344 NEDRTGDESDEGWEWQSCSE---MVVPMGKD--KNSMYQPEPWNLTKYIKNCKEQYGV-- 396
D + W WQ C+E +P + ++S + W + C+ +G
Sbjct: 331 QTAYVTDNNMRQWTWQYCTEFGWFQIPAATNPMRSSYIGHDYW-----VPYCEAVFGAKI 385
Query: 397 -SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD-----LSFS 450
P+ + + YGG DIK NI F+N + DP+ G ++T + ++
Sbjct: 386 GEPKVDYYIQKYGGLDIK------ADNIFFANSIEDPWQYAG-MRTVKNPETQKNLVANL 438
Query: 451 QDLNLGSHCLDLDEAKKSD-PDWLVQQRKTEVKIMQGWI 488
D N HC DL D P + + +I+ W+
Sbjct: 439 IDCNDCGHCQDLKTPSTGDAPALTIARANLRQQILITWL 477
>gi|170574564|ref|XP_001892869.1| Serine protease Z688.6 precursor [Brugia malayi]
gi|158601363|gb|EDP38291.1| Serine protease Z688.6 precursor, putative [Brugia malayi]
Length = 108
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 47/94 (50%), Positives = 62/94 (65%), Gaps = 4/94 (4%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK--Y 172
A +F A I+ EHR+YGKS PFG+ A R+ GY +S QAL D+A ++ H+K+K
Sbjct: 5 APEFNAAIIFAEHRFYGKSQPFGNESYA--TIRNLGYLSSEQALGDFALLIYHLKNKRLL 62
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
A ++ VIA G SYGG LA W R+KYPH+V GS
Sbjct: 63 VAQNSSVIAFGGSYGGMLAAWMRIKYPHLVEGSF 96
>gi|123457340|ref|XP_001316398.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121899103|gb|EAY04175.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 440
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 105/473 (22%), Positives = 198/473 (41%), Gaps = 48/473 (10%)
Query: 29 LRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN 88
+ P + IRR +Q TF +Q IDH TF Q++VIN+++ GG +
Sbjct: 1 MLPLILAIRRPITRQQLRGTETNATF--DQLIDH---NHSETGTFKQKFVINNQY-GGPD 54
Query: 89 SPILAFLGAEAPIDDNIQLSG---FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKN 145
SPI+ + E+ D + G F A +F +V L+HR+YG+S PF ++ +N
Sbjct: 55 SPIILEISGES---DGYYVGGVGDFEETLAKEFNCTVVTLQHRFYGESYPF--EESTTEN 109
Query: 146 ARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGS 205
+ Y + QA+ D + + + K Y A + G SY G L+ + + K+ G+
Sbjct: 110 LQ---YLSVEQAVEDISYFVDYYKKTYKADKNKWLLYGGSYPGLLSAYTKSKFDSKFAGA 166
Query: 206 LASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSK 265
++SS +L + T + + +C + + ID + +G +
Sbjct: 167 ISSSGVVLAQKEFTDFDKQIEI------SLGHQCAAACRTARRHIDTLLETEEGTQYVLN 220
Query: 266 KFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAG 325
F + G ++S A QY + +C +E + G D + +A
Sbjct: 221 LFNANGVEPDIFRFVVG--ELFSIAPQYGHR-----EALCGPMEGSLITGKDPMLVLAEF 273
Query: 326 VVEADSLEYDGN----NSRCYINEDRTGDESDEGWEWQSCSEMVV-PMGKDKNSMYQPEP 380
+ G N + T +++ W WQ+CS++ +G K S+ P
Sbjct: 274 NNNFFIPNFIGKSTIANEYSTASLKDTKNKAARSWLWQTCSQLGWWQVGAGKTSLRSPLL 333
Query: 381 WNLTKYIKNCKEQYGVSPRPSW--VLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW 438
T + K C + +G++ P +GG D ++ +NI++ G +DP++
Sbjct: 334 TTET-FAKQCNDVFGLTDEPDTDAFNAKWGGLD------QTATNIVYLTGSQDPWTPVCI 386
Query: 439 VKTYHFFDLSFSQDL---NLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ + + + N+G HC D +DP + + R+ + +++ W+
Sbjct: 387 TDEKVPNENAAAHTMTGPNVG-HCTDYHLPSNNDPADVKRTRQMVISLVKKWL 438
>gi|414870774|tpg|DAA49331.1| TPA: putative serine peptidase S28 family protein [Zea mays]
Length = 357
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 79/282 (28%), Positives = 125/282 (44%), Gaps = 31/282 (10%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + NQT+DHF+ P ++ F QRY ++ N PI ++ E+ N + +
Sbjct: 57 EEHWMNQTLDHFN--PTDHRQFKQRYYEFLDYYRAPNGPIFLYICGESTC--NGIGNNYL 112
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
A +F A +V EHRYYGKS PF S + + +S QAL D A + ++
Sbjct: 113 AVVAKKFGAALVSPEHRYYGKSSPFNSL-----TTENLQFLSSKQALFDLAVFRQYYQET 167
Query: 172 YNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
NA + + G SY G L+ WFRLK+PH+ GSLASS +L + T +
Sbjct: 168 LNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVLAVYNFTDFDKQI 227
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEID-NIASKPDGLAILSKKFKTCTPLKNTTELKDGLD 284
+ EC +++ +D + S + + K+ T L+N + L
Sbjct: 228 GI------SAGPECKAALQEITGLVDGQLQSGRNSV----KELFGATKLENDGDFLYLLA 277
Query: 285 TVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV 326
+ A QY P +C+ + A G D++ A+ V
Sbjct: 278 DAAAIAFQYGNP-----DVLCSPLAEAKKNGTDLVETFASYV 314
>gi|218198654|gb|EEC81081.1| hypothetical protein OsI_23904 [Oryza sativa Indica Group]
Length = 232
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 81/153 (52%), Gaps = 13/153 (8%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW----GGGNSP---ILAFLGAEAPIDD 103
F ++ Q +DHF++ P + F Q+Y++N FW G +P I + G E I+
Sbjct: 82 FTAHYFPQELDHFTFTPNASALFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 104 NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS 163
+GF ++ A F AL+ HR+YG+S PFG+ + + GY S QALAD+A
Sbjct: 142 FATNTGFMFDIAPSFGALL----HRFYGESKPFGNESNS--SPEKLGYLTSTQALADFAV 195
Query: 164 ILLHIKDKYNATHAPVIAIGASYGGELATWFRL 196
++ +K +A +PV+ G SYGG + +L
Sbjct: 196 LITSLKHNLSAVSSPVVVFGGSYGGSKSPIIKL 228
>gi|167536011|ref|XP_001749678.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771826|gb|EDQ85487.1| predicted protein [Monosiga brevicollis MX1]
Length = 495
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 107/432 (24%), Positives = 178/432 (41%), Gaps = 51/432 (11%)
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYG 131
TF RY + + +G EAP +N + + + A ++KAL V +EHR+YG
Sbjct: 99 TFSCRYYTSDLHYDREQGVCFFEMGGEAP--NNGIGNDYIADLAKRYKALQVSIEHRFYG 156
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELA 191
+SVP ++ N Y S QALAD A+++ H+ Y+ +A G SY G L+
Sbjct: 157 ESVP--GDDFSVDNLH---YLTSRQALADAAALIDHVNRTYHCR--KWMAFGGSYSGALS 209
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEID 251
WFR KYPH++ G+L+SS + + + + V++ D + + + ++
Sbjct: 210 AWFRTKYPHIIDGALSSSGVV---NAVLNFVEFDEQVAEAIGDQCADNVRYVTAAF---- 262
Query: 252 NIASKPDGLA-ILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIEN 310
+AS+ + L +K P + + L + AAQY+ + +C+A+ N
Sbjct: 263 -VASQQNDLINAATKALFGLDPGMDDGDFAYMLADSAAMAAQYNNKA-----ALCDALAN 316
Query: 311 APNCGDDILCKIAAGVVE-------ADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSE 363
+ + A S YD RC + + W WQ C E
Sbjct: 317 VQKDSHEAARQRFANFTNFFWGSDFGSSCFYD---RRCVYAQPTRWQPTARSWWWQKCYE 373
Query: 364 MVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--YYGGHDIKLILRRSTS 421
+ ++ + + + + K C +G+S P T YYGG +T+
Sbjct: 374 LAYWQNHPQSGSLRMDLITMDYHQKRCNAMFGLSSLPDTNGTNAYYGG------AAPNTT 427
Query: 422 NIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQ 476
+ FSN DP+ + K +LS S L HC DL +SDP L
Sbjct: 428 KVYFSNFSDDPWQQASVRK-----ELSPSLPFELVHCDGCGHCDDLHLPSRSDPPQLTAS 482
Query: 477 RKTEVKIMQGWI 488
R+ + W+
Sbjct: 483 RQRFETYLTTWL 494
>gi|17566516|ref|NP_507841.1| Protein PCP-4 [Caenorhabditis elegans]
gi|11064521|emb|CAC14390.1| Protein PCP-4 [Caenorhabditis elegans]
Length = 1042
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 123/489 (25%), Positives = 192/489 (39%), Gaps = 69/489 (14%)
Query: 19 STSFHANGLKLRPRLGRIRRSRILEQKDSNH--GFETFFYNQTIDHFSYGPESYQTFPQR 76
S+ + L P G R R LE S++ GFE + Q DHF + F Q+
Sbjct: 512 SSKVFSRTLPRTPSRGIFHR-RHLELLASSYPAGFEQGTFRQRQDHFDNLNVDF--FQQK 568
Query: 77 YVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE-NAHQFKALIVILEHRYYGKSVP 135
+ NS++ G L G EA + + + +A ++ A + +LEHR+YG S+
Sbjct: 569 FYKNSQWARPGGPNFLMIGGQEAEGESWVLNEKLPWLISAQKYGATVYLLEHRFYGDSL- 627
Query: 136 FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFR 195
+ N + +S Q L D A + I K ++ P I G S+ L+ W R
Sbjct: 628 -------VGNNTNLNLLSSLQVLYDSAEFIKAINYKTQSS-TPWITFGRSF--PLSAWTR 677
Query: 196 LKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIAS 255
+P +V G+++SS IL D + Y V+ R C IK + EI +
Sbjct: 678 AIFPDLVTGAVSSSGAILAKTD---FFEYLMVMETSIRKYDNSCADRIKSGFDEIRGLFL 734
Query: 256 KPDGLAILSKKFKTCTPL-KNTTELKD-----GLDTVYSEAAQYDTPSNIP------VKR 303
+G LSK F+ +N TE L + + A Q+ ++ P +
Sbjct: 735 TSEGRQDLSKIFQLLPGFSENVTETDQHFFFSNLYSNFQLAVQFSGDNSGPWADGYGIPE 794
Query: 304 ICNAIENAPNCGDDILCKIAAGVV---EADSLEYDGNNSRCYINEDRTGDESDEG----- 355
+C + A D+I+ A + GNN I + + EG
Sbjct: 795 MCRFMTGAGTPLDNIVAFNAYMTSFNNGGGTYTGMGNNYTAMIYNLKNSKDYGEGVDPTL 854
Query: 356 -WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTY-------Y 407
W WQ+C+E D S P ++ I+ C + +G + S + + Y
Sbjct: 855 LWTWQTCTEYGGFQSADSGSGLFGSPVPVSFLIQMCMDLFGNTYDRSKIDSLIDFTNYKY 914
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD-------LNLGSHCL 460
GG D SN++F NG DP YH L S D ++ SHC
Sbjct: 915 GGRD-----NFKGSNVVFINGNIDP---------YHVLGLFNSPDSSVVSYLIDGSSHCA 960
Query: 461 DLDEAKKSD 469
D+ A+ SD
Sbjct: 961 DMFPARDSD 969
Score = 65.9 bits (159), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 91/414 (21%), Positives = 163/414 (39%), Gaps = 50/414 (12%)
Query: 53 TFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTY 112
T + +Q +DHFS + + F Q+Y + ++ E N +++ T
Sbjct: 41 TGYLSQKLDHFSNDSQVF--FTQQYFYTERLSVSNQKVAFLYVNTEG----NEEIAVMTD 94
Query: 113 EN------AHQFKALIVILEHRYYGKSVP-FGSRKA-ALKNARHRGYFNSAQALADYASI 164
E A +F A + L+HRYYG S P F + A AL+ Y S QA+ D S
Sbjct: 95 ERSPVVKAAKRFGAQLFALKHRYYGASKPNFQNFDASALR------YLTSRQAIQDILSF 148
Query: 165 LLHIKDKYNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ + ++N + G YGG LA R P V G+++SSAP+ D W
Sbjct: 149 IKYANTQFNMNPDVRWVLWGTGYGGILAAEARKTDPVAVSGAISSSAPLRRLYDF--WQF 206
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
V + + C+ +++ +++I G + +S F+ L T + +
Sbjct: 207 NDFVGNTLMQIGGSNCYGRVQQGFADIRQAMKTTAGRSQISDLFQLNPRLDQTQLGYNDI 266
Query: 284 DTVYS-------EAAQYDTPSNIPVKRICNAIENAPNCGDDI-------LCKIAAGVVEA 329
Y+ E Q++ NI + +C I N+ ++ L G V+
Sbjct: 267 QMFYTAIIGPFQEIVQFNNDFNISITDMCTIIANSSWTNMEVVRQAYVYLSTTLTGSVQP 326
Query: 330 DSLEYDGNNSRCYINEDRTGDES-----DEGWEWQSCSEMV-VPMGKDKNSMYQPEPWNL 383
++ + + +N+ + S + W +Q C+E+ P +
Sbjct: 327 MTIA----SYQKVVNDLKNDSVSSPFVENRMWTYQICTELGWFPTTNNNEQGLFGAVVPT 382
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
+ YI C + + P S T + + +N++F+NG DP+S G
Sbjct: 383 SIYINQCSD---IFPDASLTATSIRDSIVSSDSVYTGTNVVFTNGFYDPWSVLG 433
>gi|71986744|ref|NP_500596.2| Protein F19C7.4 [Caenorhabditis elegans]
gi|373219711|emb|CCD69715.1| Protein F19C7.4 [Caenorhabditis elegans]
Length = 542
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 123/495 (24%), Positives = 195/495 (39%), Gaps = 82/495 (16%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-----IQLSGFTY 112
Q +DHFS G + + Q Y N KF+ + +G E+ I+ I+ G T
Sbjct: 55 QKVDHFSNG-TNIGVWQQHYQYNWKFYNKTTGYVFLMIGGESSINKTNGDRWIRHEGETM 113
Query: 113 EN-AHQFKALIVILEHRYYGKS--VPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
+F+A +EHR+YG P G + A QALAD + I
Sbjct: 114 MKWVAEFQAAAFQVEHRFYGSKEYSPIGDQTTA-----SMKLLTIDQALADIKEFITQIN 168
Query: 170 DKYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
Y P+ + G SY G L+ +FR YP + G+++SS+ + + D + Y
Sbjct: 169 ALYFKDDKPIWVTFGGSYPGSLSAFFRETYPEMTAGAVSSSSAVHVFVD---YYGYAINT 225
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIA-SKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
K +R S+ C IK ++ ++ A + D A+L ++F C + T L + +
Sbjct: 226 EKTYRTVSDSCGDVIKVAFQKLITKAYNGSDSRALLKQQFNLCDSF-DETNLSKAVQFFF 284
Query: 288 SEAAQY--------------DTPSNIPVKRICNAIENAPNCGDDILCKIAAGVV-----E 328
Y T S + V C+ + NA GD++ IA + +
Sbjct: 285 QNVYGYFQIINQYTGDNKSNATRSGLGVPAACDLLNNA-TIGDEVQRVIAVMNLYDSWFK 343
Query: 329 ADSLEYDGNNSRCYI---------NEDRTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQP 378
+ NN +I NE+ G S W WQ+C+E+ D N
Sbjct: 344 PSASGCRPNNYTAFIQAYSDTTMPNENVIGTRS---WIWQTCTELGYYQTTDGGNGGIFG 400
Query: 379 EPWNLTKYIKNCKEQYGVSPRPSWVL-----------TYYGGHDIKLILRRSTSNIIFSN 427
L + C + +G P + L T YGG R T N++F N
Sbjct: 401 STVPLDFFADQCIDLFG----PEYTLDNTFKLVDQVRTKYGGAGT----YRGT-NVVFPN 451
Query: 428 GMRDPFSRGG--WVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
G DP++ G W T + D + SHC D+ A SD L R ++I
Sbjct: 452 GSFDPWNGLGYKWNNTNNNVDAWLIEGT---SHCADMYPASDSDKQSLKDAR---IRI-H 504
Query: 486 GWITQYYDDFKAINK 500
G ++++ D +AI +
Sbjct: 505 GHLSRWLSDAQAIRQ 519
>gi|227202540|dbj|BAH56743.1| AT5G22860 [Arabidopsis thaliana]
Length = 171
Score = 88.2 bits (217), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 70/116 (60%), Gaps = 4/116 (3%)
Query: 44 QKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG--GNSPILAFLGAEAPI 101
QK + +++NQT+DHF++ PESY TF QRY I+S WGG N+PILAFLG E+ +
Sbjct: 47 QKVDESNLKMYYFNQTLDHFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSL 106
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
D ++ GF +N + AL+V +E +G + SR++ K ++ G F + +
Sbjct: 107 DSDLAAIGFLRDNGPRLNALLVYIE-VLWGDDAIWISRRST-KERKYVGVFERSTS 160
>gi|32351098|gb|AAP74974.1| thymus specific serine peptidase [Homo sapiens]
Length = 155
Score = 87.8 bits (216), Expect = 1e-14, Method: Composition-based stats.
Identities = 53/142 (37%), Positives = 82/142 (57%), Gaps = 13/142 (9%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 10 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRLALADVVSARLALSRLFNI 64
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D ++ Y+ VVS+
Sbjct: 65 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD---FSEYNDVVSRSLM 121
Query: 234 DT----SEECFQTIKKSWSEID 251
T S EC + +++E++
Sbjct: 122 STAIGGSLECRAAVSVAFAEVE 143
>gi|157119583|ref|XP_001659435.1| lysosomal pro-X carboxypeptidase, putative [Aedes aegypti]
gi|108875272|gb|EAT39497.1| AAEL008708-PA [Aedes aegypti]
Length = 467
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 65/227 (28%), Positives = 103/227 (45%), Gaps = 13/227 (5%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ +DHF+ P + TF RY N + PI +G+ PI+ G Y+ A
Sbjct: 25 FETKVDHFN--PRNVDTFSMRYYSNDEH-SYPKGPIFVIVGSNGPIETRYLSEGLFYDVA 81
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NA 174
+ A + EHRY+G S+P + + + QALAD A+ + HIK +
Sbjct: 82 YLEGAFLFANEHRYFGHSLPVDD-----ASTNNLDFLTIDQALADLAAFVHHIKHEVVRN 136
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
A VI +G YGG LATWF ++PH+ G SS + D+T + +F
Sbjct: 137 PEAKVILMGYGYGGSLATWFHQQFPHLTNGVWVSSGTVEADFDLTGYMESLGETIGEF-- 194
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
C+ TI + N+ + D +L+++F C L +T ++ D
Sbjct: 195 GGRGCYGTIFSGFRVAQNLIAM-DRADVLNEQFNLCEAL-DTDDVMD 239
>gi|392591874|gb|EIW81201.1| peptidase S28 [Coniophora puteana RWD-64-598 SS2]
Length = 570
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 104/215 (48%), Gaps = 22/215 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG-GGNSPILAFLGAEAPIDDNIQL-- 107
F ++NQ +DHFS TF QRY +N++ + G P++ G E +D +
Sbjct: 86 FPDQWFNQPVDHFS---NDSATFAQRYWVNARHYTPGAGGPVIVLDGGETSGEDRLPFLD 142
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+G + V+LEHRYYGKS+P + + N R + N+AQA AD A+ +
Sbjct: 143 TGIVEILTRVTGGVGVVLEHRYYGKSIPVPN--FSTDNLR---WLNNAQAAADSANFMAT 197
Query: 168 IK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+K + A P I G SY G A R+ YP +V G++ASSA + + ++ W
Sbjct: 198 VKFDGIDEDLTAPEVPWIYYGGSYAGARAAHMRVMYPELVFGAIASSA--VTHASLSNW- 254
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKP 257
Y ++ T C + ++ S ID++ S P
Sbjct: 255 EYMEIIRTAADPT---CSRHLENSIRTIDHLLSMP 286
>gi|170045808|ref|XP_001850486.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868714|gb|EDS32097.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 501
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 100/428 (23%), Positives = 170/428 (39%), Gaps = 77/428 (17%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ +DHF+ P++ TF Y N +F+ G PI F+G ++ G +
Sbjct: 57 LFRTRVDHFN--PQNRDTFELAYYSNDEFYRPG-GPIFIFVGGNWAVNPYFIERGHFPDI 113
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYN 173
A+ A + EHRYYG S P + N R + QA+ D A ++ H++ +
Sbjct: 114 AYMEGAWMFTNEHRYYGTSFPV--EDLSTPNLR---FLTVEQAMVDLAELIYHLRHNVVR 168
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
+A V+ +G YGG +ATW R +YPH+V GS SS + + + +D
Sbjct: 169 DDNARVVLLGMGYGGAIATWMRQRYPHLVDGSWVSSGQVEARFNFKEHAVEVGELIRDHG 228
Query: 234 DTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK------DGLDTVY 287
D +EC+ I +++ + + I++ F+TC + L + + +
Sbjct: 229 D--DECYSRIWRAFRTAEALMDA-GRTEIVTDMFRTCDAVDEENMLDVETFFYNVKEVIQ 285
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGD--------------------DILCKIAA-GV 326
+E Y + R+C + ++ D D + A V
Sbjct: 286 AEILLYQNVES--TTRLCETLNDSDESTDLQTLASWVNATFSYFECLPFDFESTVEAHSV 343
Query: 327 VEADSLE--YDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLT 384
++ DS+E Y G R Y Q C+E + D + QP + +T
Sbjct: 344 LDIDSIENRYLGLRQRVY----------------QFCTEFGWFLTADSDD--QPFGYRVT 385
Query: 385 KY--IKNCKEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
Y + CK YG V+ ++GG D ++ +N++F NG DP R
Sbjct: 386 MYFFLNFCKATYGDWVTAEVVADGVHLTNMHFGGQDPRI------ANVLFINGGLDPV-R 438
Query: 436 GGWVKTYH 443
+ YH
Sbjct: 439 DISITEYH 446
>gi|432101950|gb|ELK29783.1| Thymus-specific serine protease [Myotis davidii]
Length = 399
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/205 (27%), Positives = 94/205 (45%), Gaps = 10/205 (4%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY I+ + W + P+ LG E + + G
Sbjct: 59 WLEQPLDPFNA--SDRRSFLQRYWISDQHWASRDGPVFLHLGGEGSLGPGSVMRGHPAAL 116
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
A AL++ LEHR+YG S+P G N + +S ALAD S L + N
Sbjct: 117 APALGALVIGLEHRFYGLSIPAGG-----LNTTQLRFLSSRHALADVVSARLELSRLLNV 171
Query: 175 THAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHSVVSKD 231
+ + + G SY G LA W RLK+PH++ ++ASSAP+ D + + HS+ +
Sbjct: 172 SSSSRWVCFGGSYAGSLAAWARLKFPHLLFAAVASSAPVRAVLDFSAYNEVVTHSLGQRC 231
Query: 232 FRDTSEECFQTIKKSWSEIDNIASK 256
+ E ++ + + + +
Sbjct: 232 LSFSRAETLAQLRVTEPPVSGVGDR 256
>gi|134077081|emb|CAK45422.1| endoprotease Endo-Pro-Aspergillus niger
gi|350629684|gb|EHA18057.1| S28 peptidase [Aspergillus niger ATCC 1015]
Length = 526
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 129/530 (24%), Positives = 204/530 (38%), Gaps = 95/530 (17%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS 89
RPRL SR K + E +F Q +DH + PE TF QRY ++++WGG S
Sbjct: 24 RPRLVPKPISRPASSKSAATTGEAYF-EQLLDH--HNPEK-GTFSQRYWWSTEYWGGPGS 79
Query: 90 PILAFLGAEAPIDD------NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
P++ F E D N L+G A + + ++++EHRY+G S P+ +
Sbjct: 80 PVVLFNPGEVSADGYEGYLTNDTLTGVY---AQEIQGAVILIEHRYWGDSSPY-----EV 131
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKY------NATHAPVIAIGASYGGELATWFRLK 197
NA Y Q++ D +K ++ NA +AP + +G SY G L W
Sbjct: 132 LNAETLQYLTLDQSILDMTYFAETVKLQFDNSSRSNAQNAPWVMVGGSYSGALTAWTESI 191
Query: 198 YPHVVIGSLASSAPILYYGDITPWTTYHSV-------VSKDFRDTSEECFQTIK----KS 246
P A+SAP+ D W ++ + SKD +E + K K
Sbjct: 192 APGTFWAYHATSAPVEAIYDF--WQYFYPIQQGMAQNCSKDVSLVAEYVDKIGKNGTAKE 249
Query: 247 WSEI-------------DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQY 293
E+ D A P+G L + T + + D ++ V + AA
Sbjct: 250 QQELKELFGLGAVEHYDDFAAVLPNG-PYLWQDNDFVTGYSSFFQFCDAVEGVEAGAAVT 308
Query: 294 DTPSNIPVKRICNAIENAPNCGDDILCKIAAG----VVEADSLEYDGNNSRCYINEDRT- 348
P + +++ N N IL A E +D N+ I D +
Sbjct: 309 PGPEGVGLEKALANYANWFN--STILPNYCASYGYWTDEWSVACFDSYNASSPIFTDTSV 366
Query: 349 GDESDEGWEWQSCSE--------------MVVP------MGKDKNSMYQPEPWNLTKYIK 388
G+ D WEW C+E +VP + + +Y PE + Y
Sbjct: 367 GNPVDRQWEWFLCNEPFFWWQDGAPEGTSTIVPRLVSASYWQRQCPLYFPE---VNGYTY 423
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS 448
+ + SW GG D + R+T+ +I++NG DP+ G T+
Sbjct: 424 GSAKGKNSATVNSWT----GGWD----MTRNTTRLIWTNGQYDPWRDSGVSSTFRPGGPL 475
Query: 449 FS------QDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
S Q + G HC DL + + + EVK ++ W+ +YY
Sbjct: 476 VSTANEPVQIIPGGFHCSDLYMEDYYANEGVRKVVDNEVKQIKEWVEEYY 525
>gi|123463102|ref|XP_001316920.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121899640|gb|EAY04697.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 491
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 96/207 (46%), Gaps = 13/207 (6%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
+T Y Q +DHF+ P F QRY +NS + + IL +LG +D N G
Sbjct: 14 QTLSYTQMVDHFARKPTY---FTQRYFVNSDYANKSRNIIL-YLGGANELDPNEITPGPI 69
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
E A Q K++I+ LEHRY+GKSVP + + Y + QA+ D S +L K +
Sbjct: 70 LEIASQTKSVIIGLEHRYFGKSVP-----TVNMSQFNMQYCSVPQAILDIKSFVLQGKIR 124
Query: 172 YNATHAP----VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+ P +G YGG LATW + +G+ ASSAP++ T + +
Sbjct: 125 NDYCTEPDFCKFFLMGKGYGGGLATWASTGFKRFYLGAWASSAPLVSINTFTQYDQKEAY 184
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIA 254
+ + C++ + ++ I+ +
Sbjct: 185 FLGNITIEATNCYKVMHDVYNTIETVV 211
>gi|194744687|ref|XP_001954824.1| GF16548 [Drosophila ananassae]
gi|190627861|gb|EDV43385.1| GF16548 [Drosophila ananassae]
Length = 390
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/253 (25%), Positives = 119/253 (47%), Gaps = 15/253 (5%)
Query: 8 FQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGP 67
F+ L ++ + + + A+ + P++ R+ + ++ + + + NQ +D+F
Sbjct: 5 FEALAILAVLSAPTVGASFKEPMPKVNRLPKEPMITRATVHERW----INQKLDNFD--E 58
Query: 68 ESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEH 127
++ T+ R IN + + G SPI +LG E+ + SG + A Q IV EH
Sbjct: 59 DNNATWSNRIFINEQDFVDG-SPIFIYLGGESEQLPSRISSGLWVDIAKQHNGTIVATEH 117
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYG 187
R+YGKS P Y + QALAD +++ +K++ + V+ G SY
Sbjct: 118 RFYGKSTPI----TPYSTENLEKYQSINQALADVINVIQTLKEEDKYKDSKVVIHGCSYS 173
Query: 188 GELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE-CFQTIKKS 246
+A W R YP +++GS ASSAP++ + Y V+ + FR + C+ I +
Sbjct: 174 ATMAAWIRKLYPDIIVGSWASSAPLVAK---VEFKEYFKVIGESFRILGGQYCYDLIDNA 230
Query: 247 WSEIDNIASKPDG 259
+ +N+ + G
Sbjct: 231 TNYYENLFANGKG 243
>gi|198455513|ref|XP_001360029.2| GA11106, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198133277|gb|EAL29181.2| GA11106, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 482
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 134/332 (40%), Gaps = 42/332 (12%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + Q +D+F + QT+ RY+ N K+ PI F+G E I +G T
Sbjct: 49 EELWLEQKVDNFD--ALNNQTWKMRYLRNGKY-HRNQGPIFIFVGGEWSISPGFLSTGLT 105
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-D 170
++ A + ++ EHRYYG+S+P G + +H + Q+LAD A + K +
Sbjct: 106 HDMAVENSGMLFYTEHRYYGQSLPHGKESFRVDKLQHLSIY---QSLADLAHFIRFQKSE 162
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ VI +G SY G + W YP ++ S ASSAP+L D + Y V S
Sbjct: 163 NPRMKQSEVILVGGSYSGSMVAWMTQLYPDLIAASWASSAPLLAKAD---FHEYMEVASN 219
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
R + + C I+K + + + + + + L +K C E D LD
Sbjct: 220 SIRLSYGQNCTTRIQKGFQHLTKLFEE-NQIPELLQKLNGC----EDYEPNDPLD----R 270
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG 349
AA ++ N + + P LC + D + ++G + R+
Sbjct: 271 AAFFNGLGNYFALIVQSYSSYIPQ-----LCDTLMSLNSNDEVAFEGFLELLFAEGRRST 325
Query: 350 DESDEG-----------------WEWQSCSEM 364
D D G W +Q+C+E
Sbjct: 326 DCQDFGYGAMLQLFSDPISGIRAWFYQTCNEF 357
>gi|195158030|ref|XP_002019897.1| GL12651 [Drosophila persimilis]
gi|194116488|gb|EDW38531.1| GL12651 [Drosophila persimilis]
Length = 482
Score = 85.5 bits (210), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 82/332 (24%), Positives = 134/332 (40%), Gaps = 42/332 (12%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E + Q +D+F + QT+ RY+ N K+ PI F+G E I +G T
Sbjct: 49 EELWLEQKVDNFD--ALNNQTWKMRYLRNGKY-HRNQGPIFIFVGGEWSISPGFLSTGLT 105
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-D 170
++ A + ++ EHRYYG+S+P G + +H + Q+LAD A + K +
Sbjct: 106 HDMAVENSGMLFYTEHRYYGQSLPHGKESFRVDKLQHLSIY---QSLADLAHFIRFQKSE 162
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+ VI +G SY G + W YP ++ S ASSAP+L D + Y V S
Sbjct: 163 NPRMKQSEVILVGGSYSGSMVAWMTQLYPDLIAASWASSAPLLAKAD---FHEYMEVASN 219
Query: 231 DFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE 289
R + + C I+K + + + + + + L +K C E D LD
Sbjct: 220 SIRLSYGQNCTTRIQKGFQHLTKLFEE-NQIPELLQKLNGC----EDYEPNDPLD----R 270
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTG 349
AA ++ N + + P LC + D + ++G + R+
Sbjct: 271 AAFFNGLGNYFALIVQSYSSYIPQ-----LCDTLMSLNSNDEVAFEGFLELLFAEGRRST 325
Query: 350 DESDEG-----------------WEWQSCSEM 364
D D G W +Q+C+E
Sbjct: 326 DCQDFGYGAMLQLFSDPISGIRAWFYQTCNEF 357
>gi|336370214|gb|EGN98555.1| hypothetical protein SERLA73DRAFT_183618 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382973|gb|EGO24123.1| hypothetical protein SERLADRAFT_470898 [Serpula lacrymans var.
lacrymans S7.9]
Length = 555
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 107/224 (47%), Gaps = 25/224 (11%)
Query: 45 KDSNHGFETF---FYNQTIDHFSYGPESYQTFPQRYVINSKFWG-GGNSPILAFLGAEAP 100
+ S FE F ++ Q +DHFS + +TF QRY IN++ + G P++ G E
Sbjct: 65 QHSTEKFEEFPEQYFRQPLDHFS---NTSETFGQRYWINTRHYTPGAGGPVIVLDGGETS 121
Query: 101 IDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
+D I +G A + V+LEHRYYG S+P + + N R + N+ Q+
Sbjct: 122 GEDRIPFLDTGIVEILARATGGVGVVLEHRYYGSSIPVSN--FSTDNLR---WLNNEQSA 176
Query: 159 ADYASILLHIK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
AD A+ + ++K + A + P I G SY G A R+ YP +V G++ASS +
Sbjct: 177 ADSANFMANVKFPGIEEDLTAPNTPWIYYGGSYAGARAAHMRVLYPELVYGAIASSG--V 234
Query: 214 YYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKP 257
+ + W Y ++ K T C ++KS ID + P
Sbjct: 235 THAQLAMW-EYSEIIRKAADPT---CAGHLEKSIETIDFLLGVP 274
>gi|402220386|gb|EJU00458.1| peptidase S28 [Dacryopinax sp. DJM-731 SS1]
Length = 504
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 117/242 (48%), Gaps = 23/242 (9%)
Query: 42 LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI 101
L++ F + Q +DHFS + QRY +N++++ G P+ F E P
Sbjct: 27 LDEDTDKPFFPPHTFLQPLDHFS---SQSPQWAQRYWLNARYYTPGG-PVFLFDTGEGPG 82
Query: 102 DDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALA 159
+D + +G A + + V+LEHRYYG+S+P + L R + N+AQA A
Sbjct: 83 EDRFGVLDTGIVAILARETGGMAVVLEHRYYGQSMPVSN----LSTDSLR-FLNNAQAAA 137
Query: 160 DYASIL--LH---IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILY 214
D A+ + +H + + +A + P I G SYGG A R+ YP +V G++ASSA +
Sbjct: 138 DSANFMRSVHFPGVDEDVSALNRPWIYYGGSYGGARAAHMRVLYPELVWGAIASSA--VT 195
Query: 215 YGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
+I + + + D+ S C ++ S S ID S P LA K + +PL
Sbjct: 196 NAEINNYEYFEIIA--DY--ASPHCISALRASISLIDTHLSSP-FLAPFMKSLFSLSPLS 250
Query: 275 NT 276
N+
Sbjct: 251 NS 252
>gi|340387216|ref|XP_003392103.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like, partial
[Amphimedon queenslandica]
Length = 159
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/134 (35%), Positives = 77/134 (57%), Gaps = 8/134 (5%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGF 110
+ T +Y Q +D+F+ + +TF QR + ++W N +L + G EAPID+ +GF
Sbjct: 34 YTTSYYTQQLDNFNSNDK--RTFNQRILTAKQYWK--NDVLLFYPGNEAPIDEFYNNTGF 89
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
+E A +F+AL+V EHRYYG ++PFG + + Y + QA+AD++ ++ I+D
Sbjct: 90 LFELAERFQALVVFAEHRYYGDTLPFGPQDTF--TPANMAYLSVGQAMADFSRLVQDIRD 147
Query: 171 KYNATHAPVIAIGA 184
K N VI IG
Sbjct: 148 KMNIKK--VIVIGG 159
>gi|17550650|ref|NP_508903.1| Protein C26B9.5 [Caenorhabditis elegans]
gi|351058287|emb|CCD65711.1| Protein C26B9.5 [Caenorhabditis elegans]
Length = 516
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 106/464 (22%), Positives = 186/464 (40%), Gaps = 68/464 (14%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-IQLSGFTY-ENA 115
Q +D+F + + QRY N F N L G EAP D I + Y + A
Sbjct: 63 QKVDNFDANNNA--MYNQRYWYNPTFTQNKNIVFLMIQG-EAPATDTWISNPNYQYLQWA 119
Query: 116 HQFKALIVILEHRYYGKSVPF-GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
+F A + LEHR +G+S P+ + +K QALAD + + + ++N
Sbjct: 120 KEFGADVFQLEHRCFGQSRPYPDTSMPGIKVCT------MTQALADIHNFIQQMNRRFNF 173
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+ I G SY G L+ FR +YP +G++ASSAP+ + D + Y VV +
Sbjct: 174 QNPKWITFGGSYPGTLSALFRQQYPADTVGAVASSAPLDWTLD---FFEYAMVVEDVLKK 230
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSE----- 289
TS +C++ + +++ + ++ G+ L+ F + + +D ++
Sbjct: 231 TSVDCWRNVNQAFLNMQQLSLTKAGIQQLNTYFNLVPAFVDGQYTQHDIDNFFANVYSFF 290
Query: 290 --AAQYD-------TPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG---- 336
QY T + + +++CN + +A D++ ++ + + + D
Sbjct: 291 QGVVQYTYDGRNNATLNGLNAQQLCNKMNDA--TVPDVITRVNNTINWINQMNGDPVGPF 348
Query: 337 -----------NNSRCYINEDRTGD-ESDEGWEWQSCSEMVVPMGKD--KNSMYQPEPWN 382
N+ N GD ++ GW W C+E+ D +N Q P
Sbjct: 349 QNSYSDMMTVLANASYDDNSAVPGDIAANRGWMWLCCNELGALQTTDQGRNIFQQTVP-- 406
Query: 383 LTKYIKNCKEQYGVSPRPSWV-------LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSR 435
L +I C + +G +V L Y G D +N++ NG DP+
Sbjct: 407 LGYFIDMCTDMFGADIGIKYVRDNNKQTLYKYKGAD-----NYQATNVVLPNGAFDPWHV 461
Query: 436 GGWVKTYHFFDLSFSQDLNL--GSHCLDLDEAKKSDPDWLVQQR 477
G TY+ + L + +HC D+ +P L + R
Sbjct: 462 LG---TYNNDTANHMTPLLIQGAAHCSDMYPTYPGEPTDLAKNR 502
>gi|344292456|ref|XP_003417943.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 440
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 82/320 (25%), Positives = 129/320 (40%), Gaps = 50/320 (15%)
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y +S QALAD A+ I +K T +A G SYGG LA W R+K+P + ++ SSA
Sbjct: 125 YLSSRQALADIANFRTQIAEKMGLTRNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSSA 184
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
PI I + Y VV + + +CFQ +K+++ ++ + P + L F
Sbjct: 185 PI---QAIANFYEYLEVVQRSLATHNSKCFQAVKEAFDQVVKMLKLPKYYSKLEDDF--- 238
Query: 271 TPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV---- 326
+ + E+K Y + T + P+K I N + D LC + A
Sbjct: 239 --MYSIFEVKK----CYCSLKNFSTQTFSPLKMIINKNGEQLSFSMDELCDMMANTSLGS 292
Query: 327 -----VEADSLEYDGNNSRCYINEDRTGDES-------------DEGWEWQSCSEMVVPM 368
+ L Y+ C+ R E + +QSC+E
Sbjct: 293 PYYRYIRIIQLLYNHEYLHCFPAHYRKKLEVYLDSYINHQNPAIGRQFFYQSCTEFGFFQ 352
Query: 369 GKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT-------YYGGHDIKLILRRST 420
D KN + P L+ +++ C + +G + T YYGG ++ +
Sbjct: 353 TTDSKNLTFTGLP--LSYFVEQCADFFGPEFNYDSLHTGVMSTNAYYGGFNV------TG 404
Query: 421 SNIIFSNGMRDPFSRGGWVK 440
S IIF NG DP+ G K
Sbjct: 405 SKIIFPNGSFDPWHPLGITK 424
>gi|395324539|gb|EJF56977.1| hypothetical protein DICSQDRAFT_174353 [Dichomitus squalens
LYAD-421 SS1]
Length = 467
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 109/458 (23%), Positives = 182/458 (39%), Gaps = 69/458 (15%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPI---DDNIQLSGFTYENAHQFKALIVILEH 127
QTF Q+Y++N+ ++ G PIL + EA D + L+ + A + L LEH
Sbjct: 10 QTFKQQYILNATYFKEGG-PILFYQSNEATTITCPDTLILADW----AKEIGGLTATLEH 64
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASY 186
RY+G+S+PFG+ +N + Y + D + + IK A+++ I +G SY
Sbjct: 65 RYFGQSLPFGNDSYTQENFK---YLTLENVMQDAVNFIDFIKSNVTGASNSKAIVVGRSY 121
Query: 187 GGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKS 246
GG L+ FR YP V G+ A S P +GD T V + + S F IK++
Sbjct: 122 GGTLSAIFRQNYPDVFYGAWAVSGPFYAFGDS---TEIGQEVQQTYLRQSYTAFSRIKQA 178
Query: 247 WSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICN 306
+S + ++ + D L+K+ C N T++ + Y YD I
Sbjct: 179 FSNVKSLVASGDE-PTLAKELSLCQ-APNVTDVAGAVTFNYWLVGAYD---------ILT 227
Query: 307 AIENAP----NCGDDILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCS 362
P N +L + + A S N + + G ++ + C
Sbjct: 228 QFSFMPSYFHNVSGPVLPVVINDTLSAPSPLAALNQTLWH----AYGGDAVAVPSGKPCL 283
Query: 363 EMVVPMGKDKNSMYQPEPWNLTKYI--KNCKEQYGV----SPRPS------------WVL 404
+ + + N P W ++ N + G+ +P P W +
Sbjct: 284 DHTIALPSSINIAAVPFSWVRCNWVPLNNALDPRGIWQIGAPLPPSASDPSAGCTALWNV 343
Query: 405 TYYGGHDIKLILR------RSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNL--- 455
T G IK R R+++ +IFS G DP + Q + +
Sbjct: 344 TTPPGAAIKAKYRIADADLRASTRVIFSVGELDPTTSVAQNGLGDAVGTDPHQAVRVFVA 403
Query: 456 -GSHCLDLDEAKKSDP--DW--LVQQRKTEVKIMQGWI 488
G H DL++ DP DW +V R E+ +++GW+
Sbjct: 404 GGGHGQDLEQ---YDPGADWQSVVDARNIELNVIKGWL 438
>gi|392572044|gb|EIW65216.1| peptidase S28 [Trametes versicolor FP-101664 SS1]
Length = 528
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 131/278 (47%), Gaps = 28/278 (10%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRI--RRSRILEQKDSNHG--FETFFYNQTIDHFSYG 66
L +++ ++ + A L P+ ++ RR++ K + F F + Q +DHF
Sbjct: 14 LAVLLFQLAGTSEARHLPANPQWTKLAARRAQNSPAKRAADATTFPVFNFTQPLDHFV-- 71
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVI 124
++ TF QRY ++ + + G P++ F E P D+ + + +G A+ L ++
Sbjct: 72 -DTGFTFQQRYWLSDRHYKPGG-PVIVFEAGEGPGDERMPILDTGILNILANATDGLAIV 129
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH-----IKDKYNATHAPV 179
LEHRYYG+SVP + + R + N+ QA AD A+ + + I A P
Sbjct: 130 LEHRYYGESVPV--QNFTTDSLR---WLNNEQAAADSANFIDNVTFPGIPGDLTAPGTPW 184
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEEC 239
I G SYGG A R+ YP +V G++ASS + + + W Y ++ + C
Sbjct: 185 IYYGGSYGGARAAHMRVLYPDLVFGAIASSG--VVHATLDDW-RYFDIIRQ---SAPAAC 238
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
++++ E+D + + P+ L+ K+ L+N T
Sbjct: 239 ITQVERTIDEVDRLITSPNAKTRLA--IKSVFGLQNVT 274
>gi|156058698|ref|XP_001595272.1| hypothetical protein SS1G_03361 [Sclerotinia sclerotiorum 1980]
gi|154701148|gb|EDO00887.1| hypothetical protein SS1G_03361 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 535
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 118/502 (23%), Positives = 196/502 (39%), Gaps = 90/502 (17%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD------NIQLS 108
F+ Q +DH + S TF Q++ NS+ W G SPI+ F E + N+ ++
Sbjct: 53 FFTQLLDHEN---PSKGTFQQKFWWNSENWAGPGSPIVFFTPGEIAAAEYGAYLTNVTVT 109
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G A + K +V++EHRY+G+S P+ + + Y N QA+AD+ +
Sbjct: 110 GLF---AQEVKGAVVMVEHRYWGESSPYDNLTTT-----NLQYLNLKQAIADFVHFAKTV 161
Query: 169 KDKY------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+ NA AP I G SY G LA W P ASSAP+ + W
Sbjct: 162 DLPFDTNHSSNAAAAPWILSGGSYSGALAAWTESTSPGTFWAYHASSAPVQAINNY--WQ 219
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
++ V ++ C + I +DN+ + + A+ + KT L++ T D
Sbjct: 220 YFYPVQD----GMAKNCSKDISLVIDYMDNVLTHGNKSAVTA--LKTKFGLESVTHNDDF 273
Query: 283 LDTVYSEAAQYDTPSNIP----VKRICNAIENAPNCGDDILCKIAAGVVEADSLE----- 333
+ + S + + S + C+AIEN + A GV +LE
Sbjct: 274 MAVLESGPWLWQSNSFTTGYSGFFQFCDAIENV--TAGAAVTPDANGVGLTTALEGFAKW 331
Query: 334 -------------YDGNNSRCYINED---------RTGDESDEGWEWQSCSEMVVPMGKD 371
YD ++ C D G+ +D W W C+E P G
Sbjct: 332 TKSLIPGICEDYGYDADDLSCLNTYDFNNFMFRDYSVGNAADRQWNWMLCNE---PFGYW 388
Query: 372 KNSMYQPEPWNLTKYI--KNCKEQYGVSPRPSWVLTYYG--GHDIKLILR-------RST 420
++ +P +++ I K + Q + TY G +K + + +T
Sbjct: 389 QDGAPSNKPTLVSRLINAKYWQRQCALYFPAEGKYTYASAKGATVKQVNQYTQGWNLENT 448
Query: 421 SNIIFSNGMRDP---------FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPD 471
+ +I++NG DP F GG +++ L Q + G HC DL +
Sbjct: 449 TRLIWTNGQYDPWRTSGVSSQFRPGGELQSTAQHPL---QIIPGGFHCSDLRLSNGKANA 505
Query: 472 WLVQQRKTEVKIMQGWITQYYD 493
+ + EV + W +YY+
Sbjct: 506 GVQKVIDNEVAQIVAWTAEYYN 527
>gi|395326580|gb|EJF58988.1| hypothetical protein DICSQDRAFT_128566 [Dichomitus squalens
LYAD-421 SS1]
Length = 490
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 74/256 (28%), Positives = 114/256 (44%), Gaps = 26/256 (10%)
Query: 39 SRILEQKDSNHG---------FETFFYNQTIDHFSYGPESYQTFPQRYVIN-SKFWGGGN 88
S IL + S+ G +T +++Q IDH S S QTF QRY I+ S F GG
Sbjct: 24 SHILAARQSSAGDPGGAVQCPADTQWFDQPIDHAST--NSSQTFKQRYQIDTSNFKEGG- 80
Query: 89 SPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNAR 147
PIL + EA D +S + + A + ++ LEHRY+G+S+PFG+ L N +
Sbjct: 81 -PILFYQSPEA--TDIACISELLFMDWAKELGGIVATLEHRYFGQSLPFGNNSYTLDNLK 137
Query: 148 HRGYFNSAQALADYASILLHIKDKYN-ATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
F + D L +K A + I G SYGG LA FR YP G+
Sbjct: 138 P---FTLDNVMQDAVHFLDFVKKNVTGAAKSKTIVAGGSYGGFLAPVFRQNYPDTFFGAW 194
Query: 207 ASSAPILYYGDI----TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAI 262
+ P G + +++ V + S E F I+ ++++ + A
Sbjct: 195 GIAGPFRSLGTVDEVGAELHNWYNYVQSTYAHRSLEAFDRIRNGFAQVKQLIDTGHN-AT 253
Query: 263 LSKKFKTCTPLKNTTE 278
L+K+ C P N+++
Sbjct: 254 LTKELSLCHPPSNSSD 269
>gi|313212780|emb|CBY36704.1| unnamed protein product [Oikopleura dioica]
Length = 395
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 151/349 (43%), Gaps = 51/349 (14%)
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
Y P+ + A+ G + L YP++V G+LA+S+P+ + + + ++D
Sbjct: 65 YTGNEGPIDSFAANTG----KFNELTYPNLVNGALAASSPVYWISAMGDSHGFWVKTTED 120
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL------------KNTTEL 279
F ++C TI+ ++ +D + + D A ++K +TC + +N
Sbjct: 121 FSTALDKCEDTIRAGFAALDKMKNDKD-WAGITKTMRTCQNITEDNYMHMLGWARNAMAT 179
Query: 280 KDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNN- 338
+D Y + P N PVK C + G D + EA L Y+G +
Sbjct: 180 MAMMDYPYPTNFEAALPGN-PVKASC--VRAMAETGAD-------SIREAAGLVYNGTDP 229
Query: 339 ---SRCY-INED----------RTGDESDEGWEWQSCSEMVVPMGKD-KNSMYQPEPWNL 383
+C+ I E+ TG E+ W++Q C++ V+P G D K M+ +++
Sbjct: 230 SKYKQCFDIMEEYVYCSDPTGCGTGPEA-LAWDYQCCTQQVLPGGTDGKTDMFPVIKFDV 288
Query: 384 TKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
C + +GV P W+ Y +++ +TSN IFSNG DP+ GG
Sbjct: 289 DDRAAYCNKTWGVVPDRDWLRIKYWADNLE-----ATSNTIFSNGDLDPWGPGGVTHDLR 343
Query: 444 FFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYY 492
DL + ++ G+H DL + D ++ R+ ++ W+ Q+Y
Sbjct: 344 -HDLP-APLVHGGAHHYDLRGSNSGDTQDVLNVRQFHRDTIRDWMAQFY 390
>gi|302795496|ref|XP_002979511.1| hypothetical protein SELMODRAFT_419277 [Selaginella moellendorffii]
gi|300152759|gb|EFJ19400.1| hypothetical protein SELMODRAFT_419277 [Selaginella moellendorffii]
Length = 905
Score = 84.0 bits (206), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/136 (39%), Positives = 73/136 (53%), Gaps = 21/136 (15%)
Query: 82 KFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRK 140
+FW ++ IL LG + + GF ++ A QFKA +V LEHRYYG S P
Sbjct: 592 RFWRNEDTGILLGLGVK------VGYPGFPSKDLARQFKAGVVTLEHRYYGYSFP----- 640
Query: 141 AALKNARHRGYFNSAQALADYASILLH----IKDKYNATHAPVIAIGASYGGELATWFRL 196
++ Y Q+LAD+A+ + + I K N I IG SY G L+ WFRL
Sbjct: 641 -----SKDFKYLTVEQSLADHAAFIEYYQTFINKKCNKHANKWIVIGGSYSGALSAWFRL 695
Query: 197 KYPHVVIGSLASSAPI 212
KYPH+V+GS ASSA +
Sbjct: 696 KYPHLVVGSWASSAVV 711
>gi|407849017|gb|EKG03883.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi]
Length = 483
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 112/504 (22%), Positives = 191/504 (37%), Gaps = 65/504 (12%)
Query: 12 LLMVIFVS--TSFHANGLKLRPRLGRIRRSRIL-----------EQKDSNHGFETFFYNQ 58
LL V F T+ H + + R R G + R+ + +Q+ H +YNQ
Sbjct: 16 LLAVAFSGPVTALHPSVIPSRARHGEVYRAALRRSLKEGSGGQEKQRHDVHNNAARYYNQ 75
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DH + TF QR+ ++ W + P + + E + GF E
Sbjct: 76 RVDH---ADATLGTFRQRWWVDRSSWDVNSGPAILLVNGEG-TAHGLPDGGFVGEYGKSV 131
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
KA++ LEHRYYG+S+P A L N Y ALAD + + + K
Sbjct: 132 KAIVFSLEHRYYGESMP-----APLTNRSMLKYLTVENALADLQAFKKYAEKKVVKKKVK 186
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
+ +G SY G L+ W R KYP + +SS G + Y +
Sbjct: 187 WLIVGGSYAGALSAWARAKYPGDFDAAWSSS------GVVNAIFDYEAFDGHLLEVLPSS 240
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKT---CTPLKNTTELKDGLDTVYSEAAQYDT 295
C ++ +++ P+ A + K F T T L DG + A QY
Sbjct: 241 CAAAVRTVFNKFSKAYDNPNRRAKMMKIFGTPNYFTKSDMAWMLADGA----AMAIQYGY 296
Query: 296 PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR-CYINEDRTGDES-- 352
++C++IE + L K A +++ L + +R CY + + + S
Sbjct: 297 KD-----KLCSSIEFTE---ERALFKRYAEIMK---LLWGEEFTRSCYYSTECLSNPSYS 345
Query: 353 ---DEG--WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP---SWVL 404
EG W +Q CS++ S+ +P N + ++ C+ +G + P ++
Sbjct: 346 ESWKEGYAWAYQCCSQLAYWQTGFPGSL-RPREVNTSYFMYQCRAAFGEAILPDTYAFNK 404
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
+ G H + ++ + + DP+ G K + N HC +L
Sbjct: 405 KHGGAHP-------DATRVVATQALDDPWLTAGAKKAIDEDYPVITAQCNGCGHCGELAA 457
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWI 488
D L QR+ ++ W+
Sbjct: 458 TNPLDHPSLKAQRRAVKFYLKQWL 481
>gi|294876612|ref|XP_002767728.1| Lysosomal Pro-X carboxypeptidase, putative [Perkinsus marinus ATCC
50983]
gi|239869546|gb|EER00446.1| Lysosomal Pro-X carboxypeptidase, putative [Perkinsus marinus ATCC
50983]
Length = 300
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 79/285 (27%), Positives = 127/285 (44%), Gaps = 32/285 (11%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW--GGGNSPILAFLGAEAPIDDNIQL 107
G E + +DHFS + T P Y ++++ + I +G E+P+ ++ +
Sbjct: 20 GQELLTMDVPLDHFSLVAKQ-PTIPLHYWLDTEHYDPAKDQCAIFYIMGGESPLPESGVI 78
Query: 108 SGFTYEN-AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILL 166
F E A + L++ EHR+YG S+P K+ Y + Q+L D+A++L
Sbjct: 79 YPFISERLAREHNGLVIESEHRFYGSSIPQSYEKSL-------PYLSVEQSLMDHATVLR 131
Query: 167 H-IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAP-ILYYGDITPWT-T 223
H ++ NA VIA+G SY G LA FRL+YP +V + ASS+P LY + + +
Sbjct: 132 HTLETVENANRCRVIAVGGSYSGFLALAFRLRYPKLVYAAYASSSPGRLYSQEASRFDGR 191
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSK---KFKTCTPLKNTTELK 280
Y+S V+ C ++ K++ D+ + G + + K C P E
Sbjct: 192 YYSRVTDAADSIRSNCSNSVIKAF---DDFVHRYAGRVTFEQAKNELKICNPEVFGRE-- 246
Query: 281 DGL-----DTVYSE-----AAQYDTPSNIPVKRICNAIENAPNCG 315
DGL V E A Y SN P +C +E + G
Sbjct: 247 DGLFEELVQMVRMEFSGANMASYPPSSNSPTYMLCTMVEQSGIAG 291
>gi|281212417|gb|EFA86577.1| hypothetical protein PPL_00378 [Polysphondylium pallidum PN500]
Length = 518
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 97/380 (25%), Positives = 157/380 (41%), Gaps = 36/380 (9%)
Query: 91 ILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG 150
+ LG E PI + A++ KAL++ LE RYYGKS+P + +
Sbjct: 15 VFLILGGEGPIVPEMTRRMPFISVANESKALVIALELRYYGKSIPVPDL-----STDNLM 69
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y ++ Q L D A + ++ T A I +G SY G LA W+R+KYPH+V +++SSA
Sbjct: 70 YLSTDQILEDIAEFQIEFSRQFGLTEAKWIVMGCSYAGTLAAWYRMKYPHMVGAAISSSA 129
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
P+ +T + Y V +C K + I+ + +I K +C
Sbjct: 130 PL---KAVTRFDAYDKKVRAAL---GPKCSSAFKSLFDHIEYELMELKNQSI--KDVFSC 181
Query: 271 TPLKNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNA-IENAPNCGD--DILCKIAAGVV 327
+ L S + QY++ + + IC I+++ N D+ +
Sbjct: 182 NRSIDDRMFLFMLSESLSYSVQYNSKFKL-LANICPLFIKHSNNMSALLDMFIGYIKNMF 240
Query: 328 EADSLEYDGNNSRCYINE--DRTGDESDEGWEWQSCSEMV-VPMGKDKNSMYQPEPWNLT 384
D N Y N D +G W WQ CSE + +KN + N T
Sbjct: 241 LFQGTTCDDYNIFTYANTEIDYSGTRQ---WTWQMCSEYGWFLVASEKNVTLKSSLLNET 297
Query: 385 KYIKN-CKEQYGVSPRP--SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKT 441
+ C+ +G +P + YG +IK + S+++++NG DP+S +
Sbjct: 298 WWQNEVCRILFGRPMKPFVEKINLLYGPDNIKQL-----SSVLYTNGDLDPWST---LSV 349
Query: 442 YHFFDLSFSQDLNLG--SHC 459
D S LN+ SHC
Sbjct: 350 STSCDAPISNILNIAGESHC 369
>gi|440636097|gb|ELR06016.1| hypothetical protein GMDG_07727 [Geomyces destructans 20631-21]
Length = 546
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 114/494 (23%), Positives = 188/494 (38%), Gaps = 72/494 (14%)
Query: 54 FFYNQTIDH-FSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD------NIQ 106
F + Q +DH +G TF Q NS+ WGG SPI+ F E D+ N
Sbjct: 47 FVFTQLLDHDLPHG----DTFGQHVWWNSEHWGGPGSPIILFTPGETAADEYEGYLTNAT 102
Query: 107 LSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAA-LKNARHRGYFNSAQALADYASIL 165
L+G + A + +V++EHRY+G+S P+ LK R +A A +
Sbjct: 103 LTG---KFAQEVNGAVVMVEHRYWGESSPYADLTGHNLKQLTLRNSIADFVRIAATAQLP 159
Query: 166 LHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
K +A HAP I +G SY G L+ W P +SSAP+ D W +
Sbjct: 160 FDPSHKSDAAHAPWIMMGGSYAGSLSAWTESVSPGTFWAYHSSSAPVEAIDDY--WQYFV 217
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDT 285
V R+ C + K+ ID + +K ++ K K + L+
Sbjct: 218 PVEKAMPRN----CSSDVSKAVEYIDKVFAKGSQKEQVALKEKFGLGKLRNDDFSSVLEY 273
Query: 286 VYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE------------ 333
S+ + P+ C+ +E + AGV A +LE
Sbjct: 274 GPSQEQSNTFYTPQPLTIFCDYVEGTHSNSTGASTNGVAGVGVAKALEGYAQWVKQVYLP 333
Query: 334 ----------------YDGNNSRCYINEDRT-GDESDEGWEWQSCSEMVVPMG------- 369
YD N + D T G+ D W+W C+E P G
Sbjct: 334 ESCQVYGYEDPASIECYDTYNPDNKLFTDHTVGNAIDRQWQWMLCNE---PFGWWQGGAP 390
Query: 370 KDKNSMYQPEPWNLTKYIKNCKEQYGV---SPRPSWVLTY-YGGHDIKLILRRSTSNIIF 425
K+ ++ N + + C+ + SP ++ T H R + +++
Sbjct: 391 KNHKTIVSRNI-NTAYWQRQCELFFPPSQGSPNSAFGRTVDVPNHYTSGWTPRKSKRLLY 449
Query: 426 SNGMRDPFSRGGWVKTYH-FFDLSFSQDLNL-----GSHCLDLDEAKKSDPDWLVQQRKT 479
NG DP+ G + L+ ++++ + G HC DL DP + +
Sbjct: 450 VNGELDPWRTAGVSSEFRPGGPLTSTEEIPVIIIPGGFHCSDLILKNYIDPG-VKKVVDK 508
Query: 480 EVKIMQGWITQYYD 493
E++I++GW+ ++Y+
Sbjct: 509 EIEILKGWVGEWYE 522
>gi|367041275|ref|XP_003651018.1| hypothetical protein THITE_2110971 [Thielavia terrestris NRRL 8126]
gi|346998279|gb|AEO64682.1| hypothetical protein THITE_2110971 [Thielavia terrestris NRRL 8126]
Length = 549
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 139/342 (40%), Gaps = 45/342 (13%)
Query: 12 LLMVIFVSTSF---HANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPE 68
L + V+ +F + + P + + ++ ++ +G+ TF +Q IDH +
Sbjct: 12 LCLASHVAATFTFLRGTAMDIGPDDDTVGAEALHKRAETFNGWGTF--DQLIDHAN---P 66
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID-------DNIQLSGFTYENAHQFKAL 121
+ TF QRY ++FW G SPI E N +L+G A+Q
Sbjct: 67 ALGTFKQRYWYGTEFWKGPGSPIYLVTPGEQDGTGFNRTWLSNARLTGVM---ANQTGGA 123
Query: 122 IVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL---LHIKDKYNATHAP 178
++ILEHRY+G+S P+ + ++N ++ NS Q L +A K T AP
Sbjct: 124 VIILEHRYWGESSPY--QNLTVQNLKYLTLDNSLQDLVYFAKTFAPPFDTSGKSAPTEAP 181
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
+ +G SY G L+ W KYP +SS + GD W+ + V + T +
Sbjct: 182 WVVVGGSYSGALSAWLAAKYPGTFWAYYSSSGVVEAVGDF--WSYFAPVQAA----TPKN 235
Query: 239 CFQTIKKSWSEIDNIAS--KPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYD-- 294
C + + ID + + P A L KF + E D + S Q+
Sbjct: 236 CSADVNAVINYIDFVLTFGSPQQKADLKNKF-----MLGDLEDADFASAIESGPWQWQSG 290
Query: 295 ------TPSNIPVKRICNAIENA-PNCGDDILCKIAAGVVEA 329
T P R C+ IEN PN + + G+ +A
Sbjct: 291 QFYSITTLGYTPYYRFCDYIENVWPNSTNKVPGPWGVGLSKA 332
>gi|302689763|ref|XP_003034561.1| hypothetical protein SCHCODRAFT_15001 [Schizophyllum commune H4-8]
gi|300108256|gb|EFI99658.1| hypothetical protein SCHCODRAFT_15001 [Schizophyllum commune H4-8]
Length = 555
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 104/214 (48%), Gaps = 21/214 (9%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFW-GGGNSPILAFLGAEAPIDDNIQL-- 107
F ++ Q +DHF+ E+ TF QRY +N + + G N+P++ G E +D +
Sbjct: 74 FPAQWFEQPLDHFNN--ETGDTFGQRYWVNKRHYVPGTNAPVIVLDGGETSGEDRLPFLD 131
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+G + V+LEHRYYG++ P + L R + + Q+ AD A+ + +
Sbjct: 132 TGIVEILTRATGGVGVVLEHRYYGETKPVQN----LTTDSLR-FLTNEQSAADSANFMAN 186
Query: 168 IK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+K + A P I G SY G A R+ YP +V G++ASSA + + DIT W
Sbjct: 187 VKFDGIDEDLTAPGTPWIYYGGSYAGARAAHMRVLYPDLVFGAIASSA--VTHADITNW- 243
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
Y V+ R E C + ++ S ID + ++
Sbjct: 244 QYMEVIR---RSAPEACARHLENSIQTIDAVLAR 274
>gi|170045812|ref|XP_001850488.1| lysosomal pro-X carboxypeptidase [Culex quinquefasciatus]
gi|167868716|gb|EDS32099.1| lysosomal pro-X carboxypeptidase [Culex quinquefasciatus]
Length = 466
Score = 83.2 bits (204), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 64/220 (29%), Positives = 99/220 (45%), Gaps = 12/220 (5%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ +DHFS P + TF RY N + PI +G+ PI+ G Y+ A
Sbjct: 26 FETRVDHFS--PRNMDTFSMRYYSNDEH-AYAKGPIFVIVGSNGPIETRYLREGLFYDTA 82
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
+ A + EHRY+G S+P A+ +N + QALAD A+ + H++ +
Sbjct: 83 YLEGAYLFANEHRYFGHSLPVD--DASTENL---DFLTVDQALADLAAWIHHLRHEVVGN 137
Query: 176 -HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
A VI +G YGG LATWF ++PH+ G SS D+ S+ +
Sbjct: 138 PQAKVILMGWGYGGSLATWFHTQFPHLSDGVWVSSGN--NNADLNLPEYMESLGNTIGEF 195
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
+C+ TI S+ N+ + D +L++ F C L
Sbjct: 196 GGRDCYSTIFSSFLVAQNLI-ELDRSELLTEMFHLCDDLN 234
>gi|294873112|ref|XP_002766517.1| prolylcarboxypeptidase, putative [Perkinsus marinus ATCC 50983]
gi|239867444|gb|EEQ99234.1| prolylcarboxypeptidase, putative [Perkinsus marinus ATCC 50983]
Length = 133
Score = 83.2 bits (204), Expect = 3e-13, Method: Composition-based stats.
Identities = 46/116 (39%), Positives = 67/116 (57%), Gaps = 9/116 (7%)
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN-ATHAP 178
A +V EHRYYGK+ + Y QALADY+ ++ +I DK++
Sbjct: 16 AFLVFAEHRYYGKTQVYSDGTPDCLR-----YLTIEQALADYSVLIDYIFDKHDLPPSTA 70
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW---TTYHSVVSKD 231
IA G SYGG LA+ FR KYPH+V G++A+SAPI G +TP T ++ ++++D
Sbjct: 71 TIAFGGSYGGMLASAFRYKYPHIVDGAIAASAPIFAIGGVTPEPSKTAFNEIITRD 126
>gi|71663192|ref|XP_818592.1| serine carboxypeptidase S28 [Trypanosoma cruzi strain CL Brener]
gi|70883852|gb|EAN96741.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi]
Length = 483
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 112/504 (22%), Positives = 189/504 (37%), Gaps = 65/504 (12%)
Query: 12 LLMVIFVS--TSFHANGLKLRPRLGRIRRSRILEQ-KDSNHGFETF----------FYNQ 58
LL V F T+ H + + R R G + R+ + K+ + G E +YNQ
Sbjct: 16 LLAVAFSGPVTALHPSVIPSRARHGEVYRAALRRSLKEGSGGQEKLRHDVHNNAARYYNQ 75
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DH + TF QR+ ++ W + P + + E + GF E
Sbjct: 76 RVDH---ADVTLGTFRQRWWVDRSSWDANSGPAILLVNGEG-TAPGLPDGGFVGEYGKSV 131
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
KA+I LEHRYYG+S+P A L N Y ALAD + + + K
Sbjct: 132 KAIIFSLEHRYYGESMP-----APLTNRSMLKYLTVENALADLQAFKKYAEKKVVKKKVK 186
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
+ +G SY G L+ W R KYP + +SS G + Y + +
Sbjct: 187 WLIVGGSYAGALSAWARAKYPGDFDAAWSSS------GVVNAIFDYEAFDGHLLKVLPSS 240
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKT---CTPLKNTTELKDGLDTVYSEAAQYDT 295
C ++ + + P+ A + K F T T L DG + A QY
Sbjct: 241 CAAAVRTVFGKFSKAYDNPNRRAKMMKTFGTPNYFTKPDMAWMLADGA----AMAIQYGY 296
Query: 296 PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR-CYINEDRTGDES-- 352
++C++IE ++ + A E L + +R CY + + + S
Sbjct: 297 KD-----KLCSSIEFTEE--RELFRRYA----ELMKLLWGEEFTRSCYYSTECLSNPSYS 345
Query: 353 ---DEG--WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP---SWVL 404
EG W +Q CS++ + +P N + ++ C+ +G + P ++
Sbjct: 346 ESWKEGYAWAYQCCSQLAYWQTGFPGGL-RPREVNTSYFMYQCRAAFGEAILPDTYAFNK 404
Query: 405 TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDE 464
+ G H + ++ + + DP+ G K + N HC DL
Sbjct: 405 KHGGAHP-------DATRVVATQALDDPWLTAGVKKALSEDYPVITAQCNGCGHCGDLAA 457
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWI 488
+ L QR+ ++ W+
Sbjct: 458 TNPLNHPSLKAQRRAVKFYLKQWL 481
>gi|401398762|ref|XP_003880396.1| protein F23B2.12, partially confirmed by transcript evidence,
related [Neospora caninum Liverpool]
gi|325114806|emb|CBZ50362.1| protein F23B2.12, partially confirmed by transcript evidence,
related [Neospora caninum Liverpool]
Length = 684
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 111/460 (24%), Positives = 185/460 (40%), Gaps = 75/460 (16%)
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
P+ ++G E P+ G E F A + LEHRYYG S P +++ N +
Sbjct: 224 PVFVYIGGEGPLSSMEVKQGLLAEMGEAFGASVYALEHRYYGDSHP--RPDSSVPNLQ-- 279
Query: 150 GYFNSAQALADYASILLHIKDKYNATHA--------PVIAIGASYGGELATWFRLKYPHV 201
+ S QAL D A+ + H+K + H PVI G SY G LA + R KYP
Sbjct: 280 -WLTSHQALGDLAAFVAHVKREQAEQHPQNVSPEDIPVIVFGCSYPGSLAAYARSKYPAS 338
Query: 202 VIGSLASSAPI----LYYG-DITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
++G+++SS+P+ L+ D T S + + + + I S E +A+K
Sbjct: 339 ILGAISSSSPVEASALFQAFDQTVQRVLPSACTSQIKAATAIVERRIFTSEEEAVKVAAK 398
Query: 257 PDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPSNIP-VKRICNAIENAPNC- 314
A +S K T + L D V +++ QY+ P + +CN AP
Sbjct: 399 FGCGAEVSMK----THDQRVALLYVIADAV-AQSVQYNRNPKRPWIDEVCNCFAEAPPAR 453
Query: 315 ---------------GDDILCKIAAGV--VEADSLEYDGNNSRCYINEDRTGDESDEG-- 355
D++L +A V + A+ +++ + + R G ++
Sbjct: 454 TEKVGNEDTKARRSEEDELLDALAKAVQLMLAELKMTCKDSNMLQLADTRLGPQASASAR 513
Query: 356 -WEWQSCSE----MVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVL------ 404
W WQSC+E V G ++ + +L +++ C + P PS
Sbjct: 514 LWVWQSCAEYGYWQVAYKGSVRSRLI-----DLNWHLRMCDALF---PLPSGSKFSTDVV 565
Query: 405 --TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD-----LSFSQDLNLGS 457
T D ++ + +NI F+NG DP++ + F SF+ + GS
Sbjct: 566 DETNVWSGDKHVVGVGAATNIHFTNGENDPWAPLSVTEISPFVTERQGLSSFT--IKNGS 623
Query: 458 HCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQYYDDFKA 497
HC D + V + K + Q I + +DF+A
Sbjct: 624 HCNDFYAYEDGTEPLPVTEAKARI---QRAIRLWLEDFQA 660
>gi|170092076|ref|XP_001877260.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164648753|gb|EDR12996.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 565
Score = 82.8 bits (203), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 71/253 (28%), Positives = 115/253 (45%), Gaps = 27/253 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG-GGNSPILAFLGAEAPIDDNIQL-- 107
F ++ Q +DHFS + TF QRY +N++ + N+P++ G E +D +
Sbjct: 68 FRPLWFKQPLDHFSTS--NKHTFHQRYWVNTRHYKPSKNAPVIVLDGGETSGEDRLPFLD 125
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA-ALKNARHRGYFNSAQALADYASILL 166
+G A + V+LEHRYYGKS+P + +L+ + N+AQ+ AD A+ +
Sbjct: 126 TGIVEILARATGGVGVVLEHRYYGKSIPVSNFSTDSLR------WLNNAQSAADSANFMR 179
Query: 167 HIK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
+ K + A H P I G SY G A R+ YP +V G+++SS + + + W
Sbjct: 180 NFKIDGIDEDLRAPHTPWIYYGGSYAGARAAHMRVLYPDLVYGAISSSG--VTHATLQNW 237
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI-ASKPDGLAILSKKFKTCTPLKNTTELK 280
Y V+ +C + S ID I A PD + ++ K L+N
Sbjct: 238 -QYMEVIRT---AADAKCSSNLVNSIERIDAILAHSPD---FIKRQLKGLFGLQNLQNDD 290
Query: 281 DGLDTVYSEAAQY 293
D + S A +
Sbjct: 291 DFASMIESPLAAW 303
>gi|308512235|ref|XP_003118300.1| hypothetical protein CRE_00736 [Caenorhabditis remanei]
gi|308238946|gb|EFO82898.1| hypothetical protein CRE_00736 [Caenorhabditis remanei]
Length = 526
Score = 82.4 bits (202), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 109/454 (24%), Positives = 177/454 (38%), Gaps = 68/454 (14%)
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-AHQFKALIVILEHRY 129
Q QRY NS F N L G D I + Y A +F A + LEHR
Sbjct: 79 QMLLQRYWYNSNFTQNKNIVFLMIQGESPATDTWISNPNYQYLQWAKEFGADVFQLEHRC 138
Query: 130 YGKSVPF-GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGG 188
+GKS P+ + +K +QALAD S + + DKYN + I G SY G
Sbjct: 139 FGKSRPYPDTSMPGIKVC------TMSQALADIHSFIGKMNDKYNFRNPKWITFGGSYPG 192
Query: 189 ELATWFRLKYPHVVIGSLASSAPILYYGDITPWT-TYHSVVSKDFRDTS-EECFQTIKKS 246
L+ FR +YP +G++ASSAP+ + D ++ T S+V+ FR + +C ++ +
Sbjct: 193 TLSALFRQQYPQDTVGAVASSAPLDWTLDFFGFSITKKSIVTVFFRHSMLAKCEPSLYQH 252
Query: 247 WSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQY------------- 293
+ + P IL F+ + + +D ++ Y
Sbjct: 253 ATTQSHKNRNPTTEQILF-NFRLTPAFVDGQYTQLDIDNFFANVYSYFQGVIQYTYDGRN 311
Query: 294 -DTPSNIPVKRICNAIE--NAPNCGDDILCKIAAGVVEADSLEYD--GNNSRCYINE--- 345
T + + +C + N P D++ ++ + ++L D G Y +
Sbjct: 312 AATLGGLNAQNLCKKMNDVNTP----DVITRVKNTIDWVNTLNGDPVGGLDNSYSDMIAV 367
Query: 346 ------DRTGD-----ESDEGWEWQSCSEMVVPMGKD--KNSMYQPEPWNLTKYIKNCKE 392
D +GD ++ GW W C+E+ D +N Q P + YI C
Sbjct: 368 LANATYDDSGDIGMFSAANRGWMWLCCNELGALQTTDQGRNIFQQTVP--MGYYIDMCTA 425
Query: 393 QYGVSPRPSWV-------LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFF 445
+G ++ L YGG D +N++ NG DP+ G TY+
Sbjct: 426 MFGADVGIKFIRDNNKQTLYKYGGAD-----NYQATNVVLPNGAFDPWHVLG---TYNNN 477
Query: 446 DLSFSQDLNL--GSHCLDLDEAKKSDPDWLVQQR 477
+ L + +HC D+ +P L + R
Sbjct: 478 TANHMTPLLIQGAAHCSDMYPTYAGEPADLAKNR 511
>gi|224138414|ref|XP_002322808.1| predicted protein [Populus trichocarpa]
gi|222867438|gb|EEF04569.1| predicted protein [Populus trichocarpa]
Length = 124
Score = 82.4 bits (202), Expect = 5e-13, Method: Composition-based stats.
Identities = 43/116 (37%), Positives = 57/116 (49%), Gaps = 21/116 (18%)
Query: 373 NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
+S++ WN CK +GV PRP+W+ +GGHDIK +LRR SN IF NG+RDP
Sbjct: 26 DSIFPASEWNYNDTATFCKGFFGVEPRPNWITAEFGGHDIKRVLRRFGSNFIFFNGLRDP 85
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+S G+H +DL A DP WL RK I+ W+
Sbjct: 86 WS---------------------GAHHVDLRFAIGEDPKWLQDVRKGGASIIAEWL 120
>gi|268552333|ref|XP_002634149.1| Hypothetical protein CBG01710 [Caenorhabditis briggsae]
Length = 507
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 67/223 (30%), Positives = 100/223 (44%), Gaps = 21/223 (9%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN----IQLSGFTYE 113
Q +DHF+ + T+ QRY NSKF+ + LG E ID ++ G T
Sbjct: 34 QYVDHFANNTSA--TWLQRYQYNSKFYNKTVGYVFLMLGGEGAIDPPGDKWVRHEGETMM 91
Query: 114 N-AHQFKALIVILEHRYYGKS--VPFGSRKA-ALKNARHRGYFNSAQALADYASILLHIK 169
A +F A +EHR+YG P G + +LK QALAD + +
Sbjct: 92 VWAKEFGAAAFQVEHRFYGSKEFSPLGDQTTESLK------LLTIDQALADIKEFINQMN 145
Query: 170 DKYNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
Y P+ I G SY G L+ WFR YP + G+++SS+ + + D + Y
Sbjct: 146 KMYFPNDKPIWITFGGSYPGSLSAWFRETYPEMTAGAVSSSSAVHVFVD---YYGYAINT 202
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIA-SKPDGLAILSKKFKTC 270
K +R S C + I+ +++EI A + D A+L F C
Sbjct: 203 EKTYRTVSAPCAEVIRTAFTEIQKKAYNGSDARALLKGTFNLC 245
>gi|308448682|ref|XP_003087721.1| hypothetical protein CRE_28002 [Caenorhabditis remanei]
gi|308253266|gb|EFO97218.1| hypothetical protein CRE_28002 [Caenorhabditis remanei]
Length = 513
Score = 82.0 bits (201), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 114/463 (24%), Positives = 178/463 (38%), Gaps = 71/463 (15%)
Query: 76 RYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN------AHQFKALIVILEHRY 129
RY NSKF+ + LG E I+ +E A +F A +EHR+
Sbjct: 43 RYQYNSKFYNKTVGYVFLMLGGEGSINGTNGDKWVRHEAETMMVWAAEFGAGAFQVEHRF 102
Query: 130 YGKS--VPFGSRKA-ALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV-IAIGAS 185
YG P G + +LK QALAD + + Y P+ I G S
Sbjct: 103 YGSKGFCPIGDQTTESLK------LLTIDQALADIKEFINQMNALYFPLDKPIWITFGGS 156
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G L+ +FR YP + G+++SS+ + + D + Y K +R S+ C IK
Sbjct: 157 YPGSLSAFFRETYPEMTAGAVSSSSAVHVFVD---YYGYAINTEKTYRTVSDSCGDVIKT 213
Query: 246 SWSEIDNIA-SKPDGLAILSKKFKTCTP-------------LKNTTELKDGLDTVYSEAA 291
++ ++ A + PD +L F C +N G++ +
Sbjct: 214 AFQQMQKKAYNGPDSRELLKTTFNLCDSFDENNLSKSIQFFFQNVYGYFQGINQYTGDNR 273
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCYIN------- 344
T S + V C+ + N P GD+I ++ A + DS Y + + C N
Sbjct: 274 NNATRSGLGVPGACDILNN-PTLGDEI-TRVVAVMSWYDSW-YSPSATGCRPNSYTSFIK 330
Query: 345 --EDRTGDESD----EGWEWQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVS 397
D T + D W WQ+C+E+ D N L + C + +G
Sbjct: 331 YYSDTTMPDDDRISTRSWIWQTCTELGYYQTTDGGNGGIFGSTVPLDFFADQCIDLFG-- 388
Query: 398 PRPSWVL-----------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFD 446
P + L T YGG D R T N+ F NG DP+ G K + +
Sbjct: 389 --PEYTLDNTFKLIDQVRTKYGGADA----YRGT-NVCFPNGSFDPWQDLGH-KANNTNN 440
Query: 447 LSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
S ++ +HC D+ A+ SD L R+ + W++
Sbjct: 441 NVDSWLIDGTAHCADMYPARDSDKQSLKDARRRIHDHLSRWLS 483
>gi|393216141|gb|EJD01632.1| hypothetical protein FOMMEDRAFT_111593 [Fomitiporia mediterranea
MF3/22]
Length = 564
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 115/494 (23%), Positives = 195/494 (39%), Gaps = 81/494 (16%)
Query: 53 TFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAE--APIDDNIQLSGF 110
T+ ++Q IDH + S TF QR+ +F+ G IL G AP +
Sbjct: 67 TYLFDQLIDHNN---PSLGTFKQRFWFTYEFYEPGGPIILMTPGEANAAPYTGYLTNRTI 123
Query: 111 TYENAHQFKALIVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALADYASILLHIK 169
A Q ++LEHRYYG S PF A+LK Y QA+ D ++K
Sbjct: 124 NGLIAQQQNGSTIVLEHRYYGLSNPFDDLSVASLK------YHTIQQAIDDLVYFAQNVK 177
Query: 170 ------DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA---PILYYGDITP 220
DK AP + IG SY G L W + P + ASSA I+Y+
Sbjct: 178 LPMPGGDKVTPDKAPWVLIGGSYSGALTGWTMVNKPDIFFAGYASSAVEESIIYF----- 232
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK 280
W + + + F C ++ + + ID + + I S K+ L + T L
Sbjct: 233 WQYFDPI--RQF--MPANCSADVQAAIAHIDQVFTHGSTAEIAS--IKSNFGLGDMTHLD 286
Query: 281 DGLDTVYSEAAQYD--TPSNIPVK---RICNAIE--NAPNCG-----------------D 316
D + + + +P++ P C+A+E N + G +
Sbjct: 287 DVAGALRNNLWDWQSLSPTSGPGALFFTFCDALEVKNGVSAGPRGWGVDHALQAWGKFWN 346
Query: 317 DILCKIAAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMY 376
+ ++ G ++A++ + ++ + + T D S W W C++ +G ++
Sbjct: 347 ETYYELLCGDMDAETCLGTYDPTQDFWT-NTTIDNSGRSWTWIVCNQ----VGFFQDGAP 401
Query: 377 QPEPWNLTKYI------KNCK----EQYGVSPRPSWVLT--YYGGHDIKLILRRSTSNII 424
+ P +T+ + + C E + P P+ T YGG D+ + +
Sbjct: 402 EGTPSLVTRLVQPAYDERQCTYWFPEAFSKPPTPNVASTNRAYGGWDL------TADRLF 455
Query: 425 FSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS--HCLDLDEAKKSDPDWLVQQRKTEVK 482
F+NG RDP+ +H + SQ + +G HC DL A D + +K +
Sbjct: 456 FANGKRDPWRDATVSSDFHTRQSTSSQPIAVGDGFHCSDLSAAAGMVDDTIASVQKEALA 515
Query: 483 IMQGWITQYYDDFK 496
M+GW+ + + K
Sbjct: 516 KMRGWLATWNSNTK 529
>gi|350636991|gb|EHA25349.1| hypothetical protein ASPNIDRAFT_56689 [Aspergillus niger ATCC 1015]
Length = 569
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 151/360 (41%), Gaps = 40/360 (11%)
Query: 11 LLLMVIFVSTSFHANGLKLR-PRLGRIRRSRILEQKDSNHGFETF-FYNQT--IDHF--- 63
LL +I + + ++ L+LR P L ++ + Q N +F YN + IDHF
Sbjct: 7 LLGTLIASTVALDSSPLELRVPHLHQLSQLSKSHQASLNQDSASFPVYNLSVPIDHFHNE 66
Query: 64 -SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKA 120
Y P + TF RY +++ + G P+ E D I G + A +
Sbjct: 67 SRYEPHTNATFGLRYWLDTSHYQPGG-PVFVIAAGETDGSDRIPFLSQGVVTQLAAAYHG 125
Query: 121 LIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA-----SILLHIKD-KYNA 174
+ +ILEHRYYG+S PF +N R + + QALADYA + ++D A
Sbjct: 126 IGLILEHRYYGESYPF--TNLTTENIR---FLTTEQALADYAYFASNIVFPGLEDLDLTA 180
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
P IA G SY G + R YP + G+++SS + I W Y +
Sbjct: 181 ATTPWIAYGGSYAGAFVAFLRKLYPELYWGAVSSSG--VTEAIIDYWEYYEPIRLYG--- 235
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK-----TCTPLKN---TTELKDGLDTV 286
+C TI+ S +D I L+++ K + T L N + L GLD+
Sbjct: 236 -PSQCISTIQTSIDIVDRILIDHADNKTLAQQLKSAFGGSATNLSNQDFVSSLSYGLDSF 294
Query: 287 YSEAAQYDTPSNIPVKR-ICNAIENAPNCGDDILCKIAAGVVEADSLE-YDGNNSRCYIN 344
S +D P+ R CN I N+ + I+A V + + YD +N+ N
Sbjct: 295 QSR--NWDKSIGTPLFRAFCNNITNSDLLYPETAEPISASVKQLVQIAGYDPSNTTLVTN 352
>gi|115385535|ref|XP_001209314.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114187761|gb|EAU29461.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 556
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/205 (29%), Positives = 99/205 (48%), Gaps = 20/205 (9%)
Query: 18 VSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF----SYGPESYQTF 73
V + H + P L +I+RS ++ +D + + + IDHF Y P + + F
Sbjct: 14 VQATIHPLDVLSNPHLRQIKRSHAIQPRDVT--YPAHYLSVPIDHFHNESRYEPHTDKHF 71
Query: 74 PQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYG 131
P RY +++++ G P+ E +D G + A ++ L VILEHRYYG
Sbjct: 72 PLRYWFDAQYYQPGG-PVFVIAAGETSGEDRFPFLSQGIVTQLAEKYHGLGVILEHRYYG 130
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYA----SILLHIKDKYNAT--HAPVIAIGAS 185
S PF + + N R + ++ QA+ADYA +++ D + + + P IA G S
Sbjct: 131 DSYPFDNLTTS--NIR---FLSTEQAVADYAYFASNVVFPGLDHVDLSPENTPWIAYGGS 185
Query: 186 YGGELATWFRLKYPHVVIGSLASSA 210
Y G ++ R YP V G+++SS
Sbjct: 186 YAGAFVSFLRKLYPDVYWGAVSSSG 210
>gi|242794965|ref|XP_002482483.1| extracelular serine carboxypeptidase, putative [Talaromyces
stipitatus ATCC 10500]
gi|218719071|gb|EED18491.1| extracelular serine carboxypeptidase, putative [Talaromyces
stipitatus ATCC 10500]
Length = 561
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/234 (27%), Positives = 102/234 (43%), Gaps = 28/234 (11%)
Query: 60 IDHFS----YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
IDHF Y P S TFP RY ++ + G P+ E +D + G +
Sbjct: 58 IDHFHNESRYEPHSNGTFPLRYWFDASHYKEG-GPVFVLESGETSGEDRLPYLQKGLVAQ 116
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI----- 168
A + V+LEHRYYG S+P ++ + ++ R + + Q LAD A +I
Sbjct: 117 LAQLTNGIAVVLEHRYYGASIP--TKDFSTESLR---FLTTEQGLADVAYFAQNIVYPGF 171
Query: 169 -KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+ H P IA G SY G + + R+ YP V G++ASSA D W + +
Sbjct: 172 EDQNLTSRHVPYIAYGGSYAGAMVAFLRVTYPDVFFGAIASSAVTEAIVDY--WQYWEPI 229
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
R+ + C T++ +DN+A + K +T L++ + + D
Sbjct: 230 R----RNAPQNCIHTVENLTGVLDNLAHNTSAV----KDLETLFGLQDLSHVDD 275
>gi|67900488|ref|XP_680500.1| hypothetical protein AN7231.2 [Aspergillus nidulans FGSC A4]
gi|40742088|gb|EAA61278.1| hypothetical protein AN7231.2 [Aspergillus nidulans FGSC A4]
gi|259483425|tpe|CBF78805.1| TPA: hypothetical serine carboxypeptidase (Eurofung) [Aspergillus
nidulans FGSC A4]
Length = 519
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 68/260 (26%), Positives = 115/260 (44%), Gaps = 28/260 (10%)
Query: 51 FETFFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQ 106
++ + ++ IDHF Y P S TF RY +S ++ G P+ E +D +
Sbjct: 18 YQAYNFSVPIDHFHNETRYAPHSNGTFNLRYWFDSTYYQPG-GPVFVIAAGETDGEDRFE 76
Query: 107 L--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
G + A + L VILEHRYYG+S PF + R + ++ Q+LADYA
Sbjct: 77 FLSQGIVTQLAEAYNGLGVILEHRYYGESYPFPGADVTVDELR---FLSTEQSLADYAYF 133
Query: 165 LLHI----KDKYNAT--HAPVIAIGASYGGELATWFRLKYPHVVIGSLASS---APILYY 215
H+ + Y+ T + P IA G SY G + R YP + G+++SS A I+ Y
Sbjct: 134 AKHVIFPGLEAYDLTAPNTPWIAYGGSYAGAQVAFMRKLYPSIFHGAVSSSGVTAAIIDY 193
Query: 216 GDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKN 275
W + + + RD C ++I+ ID I G L + ++ +
Sbjct: 194 -----WKYFEPIRNYGPRD----CIESIQTLTDLIDRILIDHPGNRTLHAQLQSAFGVNP 244
Query: 276 TTELKDGLDTVYSEAAQYDT 295
+ +D ++ + + + +
Sbjct: 245 AIDNRDFVNMLSTPLGSFQS 264
>gi|449540369|gb|EMD31362.1| hypothetical protein CERSUDRAFT_89238 [Ceriporiopsis subvermispora
B]
Length = 520
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/303 (24%), Positives = 137/303 (45%), Gaps = 44/303 (14%)
Query: 1 MKSSQFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHG---------- 50
MKS+ +F + + ++ +G P ++R L+ + G
Sbjct: 1 MKSALVAFAAIAVALL--------DGASAIPHSPVVQRLLALKANSAKAGKQLETRASPS 52
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--S 108
F + + Q +DHF ++ TF QRY ++++ + G PI+ F G EA ++ + + +
Sbjct: 53 FPQYNFTQPLDHFE---DTGVTFEQRYWVSTRHYVPGG-PIVVFDGGEASAEERLPILDT 108
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G + L +ILEHRYYG SV G N R + N+ QAL D A + ++
Sbjct: 109 GIVDILTNATGGLGIILEHRYYGASV--GVTNFTTDNLR---WLNNDQALEDSAVFMTNV 163
Query: 169 K-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ + A P I G SY G + +++YP +V G++ASS + + + W
Sbjct: 164 QIPGISENITAPGRPWIYYGGSYAGARSAIMKVRYPDIVYGAIASSG--VAHATLRDWRY 221
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
Y + + F + C ++++ E+DN+ ++ A + F L N T ++D +
Sbjct: 222 YD--IIRQFAPAA--CMAQVEQAIVEVDNLVAEDATRAKVQALFG----LPNVTHVQDFV 273
Query: 284 DTV 286
T+
Sbjct: 274 ATL 276
>gi|389645554|ref|XP_003720409.1| hypothetical protein MGG_15423 [Magnaporthe oryzae 70-15]
gi|351640178|gb|EHA48042.1| hypothetical protein MGG_15423 [Magnaporthe oryzae 70-15]
Length = 564
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/234 (30%), Positives = 99/234 (42%), Gaps = 26/234 (11%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF SY P S TF RY ++ + G P++ LG E + + G Y
Sbjct: 59 VDHFHNDTSYEPHSNDTFELRYWFDASHYVNG-GPVIVLLGGETSGAERLPFMEKGILYR 117
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA----SILLHIK 169
A + + V+LEHRYYG S P + +N R + + QALAD A +++ H
Sbjct: 118 LARATRGMAVVLEHRYYGASFP--TPNLTTENLR---FLTTDQALADTAYFAKNVVFHGY 172
Query: 170 DKYNATH--APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+ N T P A G SY G A + R YP V G+++SS L D Y
Sbjct: 173 ENRNLTSHTTPYFAYGGSYAGAFAAFVRKLYPDVFWGAISSSGVPLAVID------YWEY 226
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
+ EC +K + +D IA DG KK K L N T D
Sbjct: 227 CEAQRKFAPSECVDVTQKLTNVLDTIAQ--DGKFEDMKKLKEVFGLSNLTNRHD 278
>gi|320593738|gb|EFX06147.1| serine-type peptidase [Grosmannia clavigera kw1407]
Length = 515
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 101/220 (45%), Gaps = 23/220 (10%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+ Q +DH S TF QR+ +++ FW G SP+ F+ E +D G+ E
Sbjct: 55 FQQLLDH---SDASKGTFTQRFWLDTHFWDGPGSPVFLFMAGE---EDASGYLGYLREGI 108
Query: 115 ----AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA-SILLHIK 169
A F L+V++EHRY+GKS PF + A + R NS + + +A ++ + +
Sbjct: 109 PGLYAENFGGLVVVIEHRYFGKSQPFDTLTA--ETLRFLDLPNSMKDMTYFAQNVDIEVA 166
Query: 170 DKY---NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
+ + AP + IG SY G LA W + K P V ASSA + I+ +++Y
Sbjct: 167 NGTVLDKPSEAPWVLIGGSYSGALAAWIQQKEPGVFFAYHASSAVV---ETISDFSSYFK 223
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKK 266
+ + C ++ IDN + D +A+ K
Sbjct: 224 PIEEGL---PRNCSADVRAVVQYIDNTLTDGDSIAVEDLK 260
>gi|440471426|gb|ELQ40436.1| hypothetical protein OOU_Y34scaffold00435g4 [Magnaporthe oryzae
Y34]
gi|440488251|gb|ELQ67984.1| hypothetical protein OOW_P131scaffold00272g5 [Magnaporthe oryzae
P131]
Length = 400
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/234 (30%), Positives = 99/234 (42%), Gaps = 26/234 (11%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF SY P S TF RY ++ + G P++ LG E + + G Y
Sbjct: 55 VDHFHNDTSYEPHSNDTFELRYWFDASHYVNG-GPVIVLLGGETSGAERLPFMEKGILYR 113
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA----SILLHIK 169
A + + V+LEHRYYG S P + +N R + + QALAD A +++ H
Sbjct: 114 LARATRGMAVVLEHRYYGASFP--TPNLTTENLR---FLTTDQALADTAYFAKNVVFHGY 168
Query: 170 DKYNATH--APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+ N T P A G SY G A + R YP V G+++SS L D Y
Sbjct: 169 ENRNLTSHTTPYFAYGGSYAGAFAAFVRKLYPDVFWGAISSSGVPLAVID------YWEY 222
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
+ EC +K + +D IA DG KK K L N T D
Sbjct: 223 CEAQRKFAPSECVDVTQKLTNVLDTIAQ--DGKFEDMKKLKEVFGLSNLTNRHD 274
>gi|170040233|ref|XP_001847911.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167863799|gb|EDS27182.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 512
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 64/230 (27%), Positives = 109/230 (47%), Gaps = 17/230 (7%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD-NIQLSGFTYE 113
F+ IDHF+ ++ + RY+ + ++ G PIL +LG PI + S Y+
Sbjct: 66 FFTTRIDHFN--AQNTDEWTLRYLAVTDWYQPG-GPILIWLGGYMPIQPYMVDESSLIYD 122
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKY 172
A + + E RY+G+S + + + +N R + N+ Q LAD A + ++K D
Sbjct: 123 MAREMHGAVYAFETRYFGQS--WITEDVSTENLR---FLNADQVLADLAEFVAYLKRDVL 177
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+A V+ G YGG LATWFR++YPH+ + +SS D + DF
Sbjct: 178 RNEYAHVLVSGVGYGGSLATWFRVRYPHLSDAAWSSSGVHNALVDFQEFAEDWGQTLIDF 237
Query: 233 RDTSEECFQTIKKSWSEIDNI--ASKPDGLAILSKKFKTCTPLKNTTELK 280
S+EC+ I +++ + N+ A + D IL ++ C + L+
Sbjct: 238 --GSQECYNEIFVAFNVMQNLIDAGRED---ILYERLDLCDEIDTEDRLE 282
>gi|336466859|gb|EGO55023.1| hypothetical protein NEUTE1DRAFT_117639 [Neurospora tetrasperma
FGSC 2508]
gi|350288537|gb|EGZ69773.1| hypothetical protein NEUTE2DRAFT_145652 [Neurospora tetrasperma
FGSC 2509]
Length = 547
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 85/332 (25%), Positives = 137/332 (41%), Gaps = 48/332 (14%)
Query: 22 FHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS 81
F G+++ P I Q + G+ TF +Q IDH + PE TF QR+
Sbjct: 22 FKTIGMEIGPIDDHIESLNKRAQVEGTSGYGTF--DQLIDHNT--PE-LGTFKQRFWYGF 76
Query: 82 KFWGGGNSPILAFLGAEAPID-------DNIQLSGFTYENAHQFKALIVILEHRYYGKSV 134
++W G SPI+ E D + +L+G+ A A +VI+EHRY+G S
Sbjct: 77 QYWKGPGSPIILVNPGEQAADGFNKSYLSDQRLAGWM---AKDMGAAVVIMEHRYWGNSS 133
Query: 135 PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT---HAPVIAIGASYGGELA 191
PF + +KN ++ NS + + +A + DK N + +AP I G SY G LA
Sbjct: 134 PF--DELTVKNLQYLTLENSLKDINYFAEHIELPFDKTNGSKPANAPWIFSGGSYSGALA 191
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEID 251
W YP +S + G WT + V+ T + C + + +D
Sbjct: 192 GWLEALYPGTFWAYHGTSGVVETVGHF--WTYFVPVLEA----TPQNCTKDLTAVIDFVD 245
Query: 252 NIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQY-------------DTPSN 298
++ +L K LK+ +L+D D ++ A + T
Sbjct: 246 SV--------LLHGTPKAKRELKSKFKLQDLTDADFASAIESGPWSWQSTQFYSEKTTGY 297
Query: 299 IPVKRICNAIENA-PNCGDDILCKIAAGVVEA 329
P R C+ +EN PN + + + G+ +A
Sbjct: 298 TPYYRFCDYVENVWPNSTNKVPGPLGVGIKKA 329
>gi|312380006|gb|EFR26124.1| hypothetical protein AND_08001 [Anopheles darlingi]
Length = 506
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 72/264 (27%), Positives = 122/264 (46%), Gaps = 28/264 (10%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ I+HF P+ TF Y+ N +++ G P+ +G +D + + A
Sbjct: 62 FTSRINHFD--PQDRSTFEFNYLTNDQYYREG-GPLFVVVGGHHRLDPYFLENSHFRDVA 118
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
A + EHR++G SVP + + +N R + + QAL D + +K +
Sbjct: 119 ALNGAFLANNEHRFFGTSVP--TEDLSSENLR---FLRTEQALFDLIEWIDFLKREVMGD 173
Query: 176 -HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+A VI G SYGG LATW R ++P+++ G+ SSA + T S ++DF +
Sbjct: 174 PNARVIVHGFSYGGALATWARQRFPNIIDGAWGSSATV-------RATVEFSEFTEDFGN 226
Query: 235 T-----SEECFQTIKKSWSEIDNIASKPDGLA-ILSKKFKTCTPLKN----TTELKDGLD 284
T S+EC+ I +++ I+N+ GL +S F TC P+ EL L
Sbjct: 227 TIRVKGSDECYSAIFRAFHTIENLLDA--GLTERVSSTFNTCDPIDADNALQVELFLHLM 284
Query: 285 TVYSEAAQYDTPSNIPVKRICNAI 308
T+ E + ++ V+ +CN +
Sbjct: 285 TLSLEISMFEDFDIENVQNVCNQL 308
>gi|312383569|gb|EFR28610.1| hypothetical protein AND_03264 [Anopheles darlingi]
Length = 490
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 103/223 (46%), Gaps = 19/223 (8%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFK 119
+DHF P++ TF Y N +F+ G PI F+G + G Y+ A +
Sbjct: 71 VDHFD--PQNRDTFEFNYYSNDEFYQPG-GPIYIFVGGNFQLTTYYIEHGLLYDTAARDH 127
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
A + EHRYYG S P + +N R + ++ Q L D + H++++ +A
Sbjct: 128 AWLFTNEHRYYGTSTPV--ENYSTENLR---FLHTEQVLTDLIEWIDHLRNEVVRDPNAK 182
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
VI +G Y G LATW R ++P++V G+ S A +L D + + F +E
Sbjct: 183 VILMGVGYAGALATWARQRFPNIVDGAWGSGATVLASFDFQEHANDIGSIIRRF--GGDE 240
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
C+ T+ ++ N+ I S++ +T T L NT E D
Sbjct: 241 CYSTLWVAFRTAQNL--------IDSERDETVTTLLNTCEPID 275
>gi|258566732|ref|XP_002584110.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905556|gb|EEP79957.1| predicted protein [Uncinocarpus reesii 1704]
Length = 552
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/342 (24%), Positives = 134/342 (39%), Gaps = 52/342 (15%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDN-----IQLSGFTYEN 114
IDH + G + + R+ IN + G P+ F G EA + + F
Sbjct: 74 IDHKNPGAK----YKNRFWINDSHYKSG-GPVFVFDGGEANAQRYADYYLVNETSFFVHL 128
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK----- 169
+FK + ++ EHRYYG+S+PF A H Y N+ QALAD K
Sbjct: 129 LEEFKGMGIVWEHRYYGESLPFPVNLDT--PAEHFQYLNNEQALADIPYFAERFKRENFP 186
Query: 170 -DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHS 226
D P + +G SY G A + R KYP + + A+SAP+ D++ + Y
Sbjct: 187 NDDLTPKSTPWVMLGGSYPGMRAAFTRDKYPETIFAAFAASAPVQAQIDMSVYYEQVYRG 246
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT-ELKDGLDT 285
+V+ +++ C + ++ ++ ID + A + K F T +N + L
Sbjct: 247 LVAYGYKN----CTKDVRAAYKYIDAQLRHRESAARIKKLFFGETAEQNNNGDFTQAL-- 300
Query: 286 VYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSL------------- 332
+++ A N PV + C+ +E P G + A ++
Sbjct: 301 IWNWATWQSAGPNGPVGQFCDWLETDPETGKTAPAEGWAPTKGGKAMAERFAAWPPLVPS 360
Query: 333 --EYDGNNSR---------CYINEDRTGDESDEGWEWQSCSE 363
Y G N + C + R D + W WQ CSE
Sbjct: 361 INPYFGTNCKGQNPNKPTSCNLGM-RNADPTLISWTWQYCSE 401
>gi|158298286|ref|XP_001689125.1| AGAP004013-PA [Anopheles gambiae str. PEST]
gi|157014451|gb|EDO63478.1| AGAP004013-PA [Anopheles gambiae str. PEST]
Length = 496
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 76/275 (27%), Positives = 130/275 (47%), Gaps = 22/275 (8%)
Query: 9 QWLLLMVIFVSTSFHANGLKLRPR------LGRIRRSRI-LEQKDSNHGFETFFYNQTID 61
++L+++++ + A L P+ L R+R + + L+ N F+ +D
Sbjct: 2 KFLVVLLLAACGTALAAKLSTPPKALTPSFLNRLRSATVGLKPSQRNANITEEFFTTEVD 61
Query: 62 HFSYGPESYQTFPQRYV-INSKFWGGGNSPILAFLGAEAPIDDNIQLSG-FTYENAHQFK 119
HF+ + T+ RY+ + F GG P+L FL +AP+D ++ G E A
Sbjct: 62 HFN--NQDLTTWSNRYLALMDHFVEGG--PMLIFLTGDAPLDPSMIDDGTLINEMARDLG 117
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-NATHAP 178
+ LE R+YGKS P G +++ R N+ Q LAD A ++H++ N A
Sbjct: 118 GAVFALETRFYGKSQPVG--DLTVESLR---LLNTDQILADVADFVVHLRRTVINNPFAH 172
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
+ G GG LATWFR++YPH+V + +SS I D +++ + + S E
Sbjct: 173 PLVTGTGLGGGLATWFRVRYPHLVDATWSSSGYIQAVFDFQEFSSGWAETAITV--GSNE 230
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL 273
C+ I ++ N+ G +L +KF C+P+
Sbjct: 231 CYNRIFIAFHVAQNLIDAGFG-EVLYEKFNLCSPI 264
>gi|123446346|ref|XP_001311925.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121893752|gb|EAX98995.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 436
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 108/474 (22%), Positives = 187/474 (39%), Gaps = 59/474 (12%)
Query: 34 GRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILA 93
GR+RR RIL D N+ F TF +Q IDH P+ TF QRY + + IL
Sbjct: 4 GRLRR-RIL--GDDNYTFLTF--SQNIDHSD--PQK-GTFKQRYEALFDYTTDNKTAIL- 54
Query: 94 FLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFN 153
F+G E+ + + +F A +LEHRY+G+S P +K Y
Sbjct: 55 FIGGESDTFRPRAFNDYMATLCKEFNAAFFMLEHRYFGESFPTDLSYPNIK------YLT 108
Query: 154 SAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
A+ D + + + ++Y T + I +G SY G L+ + R KYP S+ASS ++
Sbjct: 109 VDNAIDDLYNFKVKMVEQYKMTDSKWILVGGSYPGLLSAYTRAKYPKEFHASIASSGVVI 168
Query: 214 YYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL 273
+ + + ++ + + I++ E+ + + PD L F
Sbjct: 169 ASNN---YEDFDRQIAISLGQSCASVAREIRRRTDEL--LETDPDWLL---ATFNMTGLE 220
Query: 274 KNTTELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLE 333
K L L ++S AQY +++C +E+ G D + IA E +
Sbjct: 221 KENFPLV--LGEIFSLGAQYGR-----RQQLCGPLEDTLITGADPVMAIAKYTREIFTPN 273
Query: 334 YDGN-------NSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKY 386
Y + NSR + G + W W +C+E+ + + +
Sbjct: 274 YADDDIIGTYSNSRLSVTSTPNGPRA---WLWMTCNELAYWQVNSGRLTLRSKKVTQDFF 330
Query: 387 IKNCKEQYG---VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGW----- 438
+ CK + +P +G L + TS I + G +DP++ +
Sbjct: 331 LNQCKTVFSDEMKTPDTDAWNQKWGD------LLKKTSRIYYLTGSQDPWTPVCYTAEDS 384
Query: 439 --VKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQ 490
+ + Q++ HC DL + SDP L + R+ ++ W+ +
Sbjct: 385 DKIGPNCYVHTIVGQEI---GHCRDLSSPQPSDPTDLTRTREHVKAVIHRWLAE 435
>gi|145246822|ref|XP_001395660.1| extracellular serine carboxypeptidase [Aspergillus niger CBS
513.88]
gi|134080382|emb|CAK46303.1| unnamed protein product [Aspergillus niger]
Length = 569
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 98/360 (27%), Positives = 149/360 (41%), Gaps = 40/360 (11%)
Query: 11 LLLMVIFVSTSFHANGLKLR-PRLGRIRRSRILEQKDSNHGFETF-FYNQT--IDHF--- 63
LL +I + + ++ L+LR P L ++ + Q N +F YN + IDHF
Sbjct: 7 LLGTLIASTVALDSSPLELRVPHLHQLSQLSKSHQASLNQDSASFPVYNLSVPIDHFHDE 66
Query: 64 -SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKA 120
Y P + TF RY +++ + G P+ E D I G + A +
Sbjct: 67 SRYEPHTNATFGLRYWLDTSHYQPGG-PVFVIAAGETDGSDRIPFLSQGVVTQLAAAYHG 125
Query: 121 LIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI------KDKYNA 174
+ +ILEHRYYG+S PF +N R + + QALADYA +I A
Sbjct: 126 IGLILEHRYYGESYPF--TNLTTENIR---FLTTEQALADYAYFASNIVFPGLEDLDLTA 180
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
P IA G SY G + R YP + G+++SS + I W Y +
Sbjct: 181 ATTPWIAYGGSYAGAFVAFLRKLYPELYWGAVSSSG--VTEAIIDYWEYYEPIRLYG--- 235
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK-----TCTPLKN---TTELKDGLDTV 286
+C TI+ S +D I L+++ K + T L N + L GLD+
Sbjct: 236 -PSQCISTIQTSIDIVDRILIDHADNKTLAQQLKSAFGGSATNLSNQDFVSSLSYGLDSF 294
Query: 287 YSEAAQYDTPSNIPVKR-ICNAIENAPNCGDDILCKIAAGVVEADSLE-YDGNNSRCYIN 344
S +D P+ R CN I N+ + I+A V + + YD +N+ N
Sbjct: 295 QSR--NWDKTIGTPLFRAFCNNITNSDLLYPEAAEPISASVKQLVQIAGYDPSNTTLVNN 352
>gi|32351096|gb|AAP74971.1| thymus specific serine peptidase [Homo sapiens]
Length = 164
Score = 80.1 bits (196), Expect = 2e-12, Method: Composition-based stats.
Identities = 48/145 (33%), Positives = 78/145 (53%), Gaps = 8/145 (5%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P G L+ A+ R + +S ALAD S L + +N
Sbjct: 10 APAWGALVISLEHRFYGLSIPAG----GLEMAQLR-FLSSRLALADVVSARLALSRLFNI 64
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT--TYHSVVSKD 231
++ +P I G SY G LA W RLK+PH++ S+ASSAP+ D + + HS+ K
Sbjct: 65 SSSSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSEYNDIVLHSLGQKC 124
Query: 232 FRDTSEECFQTIKKSWSEIDNIASK 256
+ E ++ + ++ + +
Sbjct: 125 LSFSRAETVAQLRSTEPQLSGVGDR 149
>gi|148676288|gb|EDL08235.1| dipeptidylpeptidase 7, isoform CRA_b [Mus musculus]
Length = 212
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/180 (31%), Positives = 80/180 (44%), Gaps = 39/180 (21%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS------------- 81
+ R R+L+ F ++ Q +DHF++ +TF QR++++
Sbjct: 38 QARADRVLDPD-----FHENYFEQYMDHFNFESFGNKTFGQRFLVSGECWLDRMGSYLRF 92
Query: 82 ------------KFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRY 129
KFW G PI + G E I SGF E A Q +AL+V EHRY
Sbjct: 93 PGLMETHLLPTDKFWKMGEGPIFFYTGNEGDIWSFANNSGFMVELAAQQEALLVFAEHRY 152
Query: 130 YGKSVPFGSRKAALKNARHRGY---FNSAQALADYASILLHIKDKYNATHAPVIAIGASY 186
YGKS+PFG + + RGY QALAD+A +L ++ AP IA G +
Sbjct: 153 YGKSLPFGVQ------STQRGYTQLLTVEQALADFAVLLQALRQDLGVHDAPTIAFGGRW 206
>gi|317030966|ref|XP_001392567.2| serine peptidase [Aspergillus niger CBS 513.88]
Length = 508
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 116/475 (24%), Positives = 183/475 (38%), Gaps = 89/475 (18%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS 89
RPRL SR K + E +F Q +DH + PE TF QRY ++++WGG S
Sbjct: 24 RPRLVPKPISRPASSKSAATTGEAYF-EQLLDH--HNPEK-GTFSQRYWWSTEYWGGPGS 79
Query: 90 PILAFLGAEAPIDD------NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
P++ F E D N L+G A + + ++++EHRY+G S P+ +
Sbjct: 80 PVVLFNPGEVSADGYEGYLTNDTLTGVY---AQEIQGAVILIEHRYWGDSSPY-----EV 131
Query: 144 KNARHRGYFNSAQALADYASILLHIKDKY------NATHAPVIAIGASYGGELATWFRLK 197
NA Y Q++ D +K ++ NA +AP + +G SY G L W
Sbjct: 132 LNAETLQYLTLDQSILDMTYFAETVKLQFDNSSRSNAQNAPWVMVGGSYSGALTAWTESI 191
Query: 198 YPHVVIGSLASSAPILYYGDITPWTTYHSV-------VSKDFRDTSEECFQTIK----KS 246
P A+SAP+ D W ++ + SKD +E + K K
Sbjct: 192 APGTFWAYHATSAPVEAIYDF--WQYFYPIQQGMAQNCSKDVSLVAEYVDKIGKNGTAKE 249
Query: 247 WSEI-------------DNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQY 293
E+ D A P+G L + T + + D ++ V + AA
Sbjct: 250 QQELKELFGLGAVEHYDDFAAVLPNG-PYLWQDNDFVTGYSSFFQFCDAVEGVEAGAAVT 308
Query: 294 DTPSNIPVKRICNAIENAPNCGDDILCKIAAG----VVEADSLEYDGNNSRCYINEDRT- 348
P + +++ N N IL A E +D N+ I D +
Sbjct: 309 PGPEGVGLEKALANYANWFN--STILPNYCASYGYWTDEWSVACFDSYNASSPIFTDTSV 366
Query: 349 GDESDEGWEWQSCSE--------------MVVP------MGKDKNSMYQPEPWNLTKYIK 388
G+ D WEW C+E +VP + + +Y PE + Y
Sbjct: 367 GNPVDRQWEWFLCNEPFFWWQDGAPEGTSTIVPRLVSASYWQRQCPLYFPE---VNGYTY 423
Query: 389 NCKEQYGVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ + SW GG D + R+T+ +I++NG DP+ G T+
Sbjct: 424 GSAKGKNSATVNSWT----GGWD----MTRNTTRLIWTNGQYDPWRDSGVSSTFR 470
>gi|407407956|gb|EKF31559.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi
marinkellei]
Length = 483
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 111/506 (21%), Positives = 187/506 (36%), Gaps = 69/506 (13%)
Query: 12 LLMVIFVS--TSFHANGLKLRPRLGRIRRSRILEQ-KDSNHGFE----------TFFYNQ 58
LL V F T+ H + + R R G + R+ + K+ G E +YNQ
Sbjct: 16 LLAVAFSGPVTALHPSVIPSRARHGEVYRAALRRNVKEGPGGHEKQRHDVRNNPAQYYNQ 75
Query: 59 TIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQF 118
+DH + TF QR+ ++ W + P + + E P + GF E
Sbjct: 76 RVDHADV---TLGTFRQRWWVDRSSWDANSGPAILLVNGEGPAP-GLPDGGFVGEYGKSV 131
Query: 119 KALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAP 178
KA+I LEHRYYG+S+P A L N Y ALAD + + +
Sbjct: 132 KAIIFSLEHRYYGESMP-----APLTNRSMLKYLTVENALADLQAFKKYAEKNVVKKKVK 186
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEE 238
+ +G SY G L+ W R KYP + +SS G + Y + +
Sbjct: 187 WLIVGGSYAGALSAWARAKYPGDFDAAWSSS------GVVNAIFDYEAFDGHLLKVLPPS 240
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKKFKT---CTPLKNTTELKDGLDTVYSEAAQYDT 295
C ++ + + P A + K F T T L DG + A QY
Sbjct: 241 CAAAVRAIFGKFSKAYDDPKRRAKMMKTFGTPNYFTKPDMAWMLADGA----AMAVQYGY 296
Query: 296 PSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSR-CYINEDRTGDES-- 352
++C++IE K+ + L + +R CY + + D+S
Sbjct: 297 KD-----KLCSSIEFTRK------GKLFKRYAKLMKLLWGEEFTRSCYYSTECLSDKSYS 345
Query: 353 -----DEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP---SWVL 404
W +Q CS++ + E N + ++ C+ +G + P ++
Sbjct: 346 ESWKQGYAWAYQCCSQLAYWQTGFPGGLRLMEV-NTSYFMYQCRAAFGEAILPDTYAFNK 404
Query: 405 TYYGGHD--IKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDL 462
+ G H +++ +++ + + G +D S + L +Q N HC DL
Sbjct: 405 RHGGAHPNATRVVATQASDDPWLTAGAKDALSAE--------YPLVVAQ-CNGCGHCGDL 455
Query: 463 DEAKKSDPDWLVQQRKTEVKIMQGWI 488
+ L QR ++ W+
Sbjct: 456 AATNPFEHPSLKAQRSAVKSYLKEWL 481
>gi|358369966|dbj|GAA86579.1| extracelular serine carboxypeptidase [Aspergillus kawachii IFO
4308]
Length = 569
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 92/360 (25%), Positives = 147/360 (40%), Gaps = 40/360 (11%)
Query: 11 LLLMVIFVSTSFHANGLKLRP----RLGRIRRSRILEQKDSNHGFETFFYNQTIDHF--- 63
LL +I + + ++ L+LR +L ++ +S + F + + IDHF
Sbjct: 7 LLGTLIASTVALDSSPLELRTPHLHQLSQLSKSHQASPNQESASFPVYNLSVPIDHFHNE 66
Query: 64 -SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKA 120
Y P + TF RY +++ + G P+ E D I G + A +
Sbjct: 67 SRYEPHTNATFGLRYWLDTSHYQPGG-PVFVIAAGETDGSDRIPFLSQGVVTQLAAAYNG 125
Query: 121 LIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI------KDKYNA 174
+ +ILEHRYYG+S PF + +N R + + QALADYA +I A
Sbjct: 126 VALILEHRYYGESYPFANLTT--ENIR---FLTTEQALADYAYFASNIVFPGLEHLDLTA 180
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
P IA G SY G + R YP + G+++SS + I W Y +
Sbjct: 181 ATTPWIAYGGSYAGAFVAFLRKLYPDLYWGAVSSSG--VTEAIIDYWEYYEPIRLYG--- 235
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK-----TCTPLKN---TTELKDGLDTV 286
+C T++ S +D I L+++ K + L N + L GLD+
Sbjct: 236 -PSQCISTLQTSIDIVDRILIDHANNKTLAQQLKSAFGGSAANLGNQDFVSSLSYGLDSF 294
Query: 287 YSEAAQYDTPSNIPVKR-ICNAIENAPNCGDDILCKIAAGVVEADSLE-YDGNNSRCYIN 344
S +D P+ R CN I N+ + I+A V + + YD +N+ N
Sbjct: 295 QSR--NWDKSIGTPLFRAFCNNITNSNLLYPETAEPISASVKQLVQIAGYDSSNTTLVNN 352
>gi|406866881|gb|EKD19920.1| putative endoprotease [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 549
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 75/165 (45%), Gaps = 13/165 (7%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID------DNIQLS 108
F+ Q +DH S TF Q+Y NS+FW G SP++ F EA N+ +S
Sbjct: 54 FFTQLLDH---DDPSKGTFQQKYWWNSEFWAGPGSPVVFFTPGEAAAAPYGSYLTNVTVS 110
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKA-ALKNARHRGYFNSAQALADYASILLH 167
G A + + +++ EHRYYG S P+ + A L+ A+ A++
Sbjct: 111 GLF---AQEVQGAVILFEHRYYGDSSPYDTLDAETLQLLTLHQSMQDFTYFANTAALPFD 167
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
I NA AP + G SY G LA W +P ASSAP+
Sbjct: 168 INHSSNANKAPWVFTGGSYSGALAAWTEKLFPGTFWAYHASSAPV 212
>gi|389740212|gb|EIM81403.1| peptidase S28 [Stereum hirsutum FP-91666 SS1]
Length = 545
Score = 78.6 bits (192), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 65/211 (30%), Positives = 105/211 (49%), Gaps = 23/211 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNS---PILAFLGAEAPIDDNIQL 107
F + + Q +DHF YG + TFPQRY ++++ + G++ P++ G E +D +
Sbjct: 65 FPQYNFTQPLDHF-YG-STNGTFPQRYWVSTRHYTPGSNATVPVIVLDGGETSGEDRLPY 122
Query: 108 --SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL 165
+G A + V+LEHRYYG SV G + N R + N+ QAL D A+ +
Sbjct: 123 LDTGIVDILAEATGGVGVVLEHRYYGDSV--GVPDFSTDNLR---WLNNEQALEDSANFM 177
Query: 166 LHIK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITP 220
++K + A P I G SY G A ++ YP +V G++ASS + + IT
Sbjct: 178 RNVKFEGIDEDLTAPGTPWIYFGGSYAGARAAHMKVLYPDIVYGAIASSG--VTHAAITN 235
Query: 221 WTTYHSVVSKDFRDTSEECFQTIKKSWSEID 251
W Y V+ + F + EC + ++ S +D
Sbjct: 236 W-EYMDVI-RQF--ATVECSDNLVQTVSTVD 262
>gi|302410211|ref|XP_003002939.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
gi|261357963|gb|EEY20391.1| conserved hypothetical protein [Verticillium albo-atrum VaMs.102]
Length = 445
Score = 78.2 bits (191), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 93/212 (43%), Gaps = 20/212 (9%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF----SYG 66
+ L VIF + A + + + RI + D+ G F + +DHF Y
Sbjct: 7 IALPVIFATAPSAATFILGTEHVSLTSQPRITLKDDTVLG--PFNLSVPVDHFHNETRYE 64
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVI 124
P S TFP RY IN K + G P+ E +D + G A L ++
Sbjct: 65 PHSNGTFPLRYWINKKHYRPGG-PVFLLASGETTGEDRLGYLDHGIIAMFAEATHGLGLV 123
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK------DKYNATHAP 178
LEHRYYG S P + ++ N R + ++ QALAD A H+ ++ T P
Sbjct: 124 LEHRYYGTSFPVAN--VSIPNLR---FLSTEQALADTAFFAEHVTFPDLEHEELGPTDVP 178
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
IA G SY G A + R YP V G+++SS
Sbjct: 179 WIAFGGSYAGAFAAFLRKLYPDVFWGAVSSSG 210
>gi|15144318|gb|AAK84459.1|AC087192_20 putative serine peptidase [Oryza sativa Japonica Group]
Length = 267
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 117/272 (43%), Gaps = 35/272 (12%)
Query: 62 HFSYGPES--------YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYE 113
H ++GP ++ F QRY ++ PI ++ E+ N + +
Sbjct: 10 HETFGPHKKAIGTSADHRQFKQRYYEFLDYYRAPKGPIFLYICGESSC--NGIPNSYLAV 67
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A +F A +V EHRYYGKS PF S +N R + +S QAL D A + ++ N
Sbjct: 68 MAKKFGAAVVSPEHRYYGKSSPFESLTT--ENLR---FLSSKQALFDLAVFRQYYQETLN 122
Query: 174 ATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
A + + G SY G L+ WFRLK+PH+ GSLASS +L + +T +
Sbjct: 123 AKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL---SVYNYTDFDKQ 179
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ + EC ++++ +D G + + F T L N + L
Sbjct: 180 IGE---SAGPECKAALQETTKLVD--GQLQSGRNAVKQLFGAST-LANDGDFLFLLADAA 233
Query: 288 SEAAQYDTPSNIPVKRICNAIENAPNCGDDIL 319
+ A QY P +C+ I A G D++
Sbjct: 234 AIAFQYGNPD-----ALCSPIVEAKKNGTDLV 260
>gi|347970283|ref|XP_313407.5| AGAP003642-PA [Anopheles gambiae str. PEST]
gi|333468860|gb|EAA08815.5| AGAP003642-PA [Anopheles gambiae str. PEST]
Length = 476
Score = 78.2 bits (191), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 70/269 (26%), Positives = 121/269 (44%), Gaps = 22/269 (8%)
Query: 52 ETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFT 111
E F + +DHF ++ TF YV N +++ G PI +G ++ +G
Sbjct: 39 EAFRFRTRVDHFDV--QNRATFEFNYVSNGEYYRPG-GPIFIVVGGNNALNAYFIENGLF 95
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-D 170
++ A + + EHRYYG+S P A N R + + QAL D + H++ +
Sbjct: 96 HDIARRQGGWLFSNEHRYYGRSSPVEDYSA--PNMR---FLSVEQALIDLIEWIDHLRRE 150
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
+A VI G YGG +A W R ++P ++ G+ S+A ++ D + Y + +
Sbjct: 151 VVRDPNAKVILHGLGYGGAVAIWARQRFPSLIDGAYGSTASVIARVD---FAEYGEDMGE 207
Query: 231 DFRDTS-EECFQTIKKSWSEIDNIASKPDGL-AILSKKFKTCTPLKNTTELKD-----GL 283
R ++C+ + + + +N+ GL LS+ F+TC PL+ L GL
Sbjct: 208 TIRTLGHDDCYGIVWRGFRTAENLIDA--GLYGRLSEMFRTCVPLRADDPLTIETFFYGL 265
Query: 284 DTVYSEAAQYDTPSNIPVKRICNAIENAP 312
+ + EA + S V R+C + P
Sbjct: 266 KSSF-EAEMFGQASPDSVTRMCAELLADP 293
>gi|195569681|ref|XP_002102837.1| GD19287 [Drosophila simulans]
gi|194198764|gb|EDX12340.1| GD19287 [Drosophila simulans]
Length = 530
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 89/189 (47%), Gaps = 16/189 (8%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYN 173
A + K ++V EHRYYG+SVP + + Y + QALAD A + K +
Sbjct: 2 AQEHKGVLVYTEHRYYGQSVP-----TSTMSTDDLKYLDVKQALADVAVFIETFKAENPQ 56
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
++ VI G SY + WF+ YP +++G ASSAP+L D +T Y VV + F
Sbjct: 57 LANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPLLAKVD---FTEYKEVVGQAFL 113
Query: 234 DT-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELK-----DGLDTVY 287
++C+ I+ +E++++ + G A + C + +L + ++
Sbjct: 114 QLGGQKCYDRIENGIAELESMFANKRG-AEARAMLRLCNSFDDQNDLDLWTLFSSISNIF 172
Query: 288 SEAAQYDTP 296
+ AQY P
Sbjct: 173 AGVAQYQGP 181
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 80/351 (22%), Positives = 140/351 (39%), Gaps = 35/351 (9%)
Query: 156 QALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYY 215
QALAD +++ +K + + V+ G SY +ATW R YP ++ GS ASSAP+L
Sbjct: 195 QALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPEIIRGSWASSAPLLAK 254
Query: 216 GDITPWTTYHSVVSKDFRDTSEE-CFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLK 274
+ Y VV + + + C+ I + S +N+ +G + K+ C+
Sbjct: 255 ---VNFKDYMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGNGTQAV-KELNLCSNFN 310
Query: 275 NTTE-----LKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEA 329
+E + + +++ AQY P + C+ + + L K +
Sbjct: 311 VNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILREFSDDDSVALSKFINWKINE 370
Query: 330 DS---LEYDGNNSRCYINEDRTG-DESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN--- 382
S L + Y + +SD W +Q+CSE G ++S + +P+
Sbjct: 371 HSGACLSTTFKGAVGYYEWSKENYQDSDLPWIFQTCSE----FGWFQSSGSRSQPFGSTF 426
Query: 383 -LTKYIKNCKEQYGVSPRPSWV-LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
T Y C+ +G + + +D L + +NI F G D +S+ G
Sbjct: 427 PATLYEDTCEGVFGSKYDSAGIHANIRATNDDFGGLNVNATNIYFVQGALDGWSKVG--- 483
Query: 441 TYHFFDLSFSQDLNL---GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
+Q + SHC D SD LV +K +K++ W+
Sbjct: 484 ------AGVAQGATIIPYASHCPDTGSISASDSAELVASKKKLIKLVAQWL 528
>gi|347970281|ref|XP_001230929.2| AGAP003641-PA [Anopheles gambiae str. PEST]
gi|333468859|gb|EAU76915.2| AGAP003641-PA [Anopheles gambiae str. PEST]
Length = 328
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 65/259 (25%), Positives = 121/259 (46%), Gaps = 18/259 (6%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ ++HF P++ TF Y+ N +++ G P+ +G P++ + + A
Sbjct: 65 FTSRVNHFD--PQNRDTFEFNYLHNDQYYRQG-GPLFIVVGGHYPVNPYFMENSHFRDVA 121
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS-ILLHIKDKYNA 174
A + EHRY+G+S P + + +N R + + Q L D I ++
Sbjct: 122 ALEGAWLATNEHRYFGESYP--TEDLSTENLR---FMRTEQVLFDLIEWIDFLRREVMGD 176
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+A VI G YGG LATW R ++P+++ G+ SSAP+ T + + V R+
Sbjct: 177 PNARVILHGVGYGGSLATWARQRFPNIIDGAWGSSAPVRA---TTNFEEFAVEVGNIIRE 233
Query: 235 T-SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL---DTVYS-E 289
S++C+ I +++ +N+ ++S+ F TC P+ L+ L ++S E
Sbjct: 234 RGSDQCYNRIFQAFHTAENLIDA-GRTEMISEMFNTCDPVDTDNPLEVELFFFAMMFSLE 292
Query: 290 AAQYDTPSNIPVKRICNAI 308
AA + + R+C+A+
Sbjct: 293 AAMVEDYDIENIGRVCDAL 311
>gi|393238402|gb|EJD45939.1| peptidase S28 [Auricularia delicata TFB-10046 SS5]
Length = 535
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 109/432 (25%), Positives = 174/432 (40%), Gaps = 81/432 (18%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--S 108
F ++ Q +DHF TF QRY ++ + + G P++ E +D + +
Sbjct: 46 FPARWFRQPLDHFDRA--KRDTFLQRYWVSDRHYLPGG-PVIVLDCGETNGEDRLPFLDT 102
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G A L V+LEHRYYG SVP + L R + N+ QA AD A+ + ++
Sbjct: 103 GIVDILAKATHGLGVVLEHRYYGSSVPVLN----LTTDSLR-WLNNKQAAADSATFMANV 157
Query: 169 K-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ D A P I G SY G A R+ YP +V G++ASSA + + I W
Sbjct: 158 RFEGIDDDLTAPGTPWIYYGGSYAGGRAAHMRILYPDLVFGAIASSAAV--HASIVYW-E 214
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG- 282
Y V+ + + C + ++ S ID + P +L + FK L++ D
Sbjct: 215 YFEVIRQ---NAPAGCMRRLEGSIDIIDRVLQVP----VLRRPFKRLFGLEDLEHDDDFA 267
Query: 283 ---LDTVYSEAAQYDTP--SNIPVKRICNAIE----NAP-------------NCGDDILC 320
LD + A+ P S+ C AIE NAP N + I
Sbjct: 268 SVLLDPLRGWQARNWDPATSSTQFDEFCAAIEGDNKNAPIRLGPFELDVGVLNYAEYIKT 327
Query: 321 KIAAGVVEADSLE-----YDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSM 375
+AA E ++E YD + ++ + +E W +Q C+E + + N
Sbjct: 328 LVAALCPEGKTVEECFGTYDDSQY-----QNVSLEEYWRAWIFQVCTEWGLFLTSPPNPA 382
Query: 376 YQPEPWNLTKYI------KNCK------EQYGVSPRP--SWVLTYYGGHDIKLILRRSTS 421
P +++ I + C+ E + V P P + GG DI +
Sbjct: 383 ---RPRIISRLITLPYATRICRQSYPPGEHFTVPPLPDVDGAVNSLGGFDI------AAD 433
Query: 422 NIIFSNGMRDPF 433
+ F +G DP+
Sbjct: 434 RLAFVDGTADPW 445
>gi|95768136|gb|ABF57331.1| protease, serine, 16 [Bos taurus]
Length = 230
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 77/153 (50%), Gaps = 12/153 (7%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN 114
+ Q +D F+ ++F QRY +N + W + P+ LG E + + G
Sbjct: 59 WLEQPLDPFNA--SDRRSFLQRYWVNDQHWTSQDGPVFLHLGGEGSLGPGSVMRGHPANL 116
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN- 173
A + AL++ LEHR+YG S+P L A+ R + +S ALAD AS L + +N
Sbjct: 117 APIWGALVISLEHRFYGLSIP----AEGLDMAQLR-FLSSRHALADAASARLTLSRLFNV 171
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
++ +P I G SY G LA W RLK V+G L
Sbjct: 172 SSTSPWICFGGSYAGSLAAWARLK----VLGPL 200
>gi|158291876|ref|XP_313404.4| AGAP003639-PA [Anopheles gambiae str. PEST]
gi|157017498|gb|EAA08831.4| AGAP003639-PA [Anopheles gambiae str. PEST]
Length = 507
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 133/287 (46%), Gaps = 29/287 (10%)
Query: 29 LRPRLGRIRRS-RILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGG 87
+RP G + +S R +E + + ++HF P++ TF Y+ N +++ G
Sbjct: 47 IRPPKGYVSQSPRTVEGR----------FTSRVNHFD--PQNRDTFEFNYLHNDQYYRQG 94
Query: 88 NSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNAR 147
P+ +G P++ + + A A + EHRY+G+S P + + +N R
Sbjct: 95 -GPLFIVVGGHYPVNPYFMENSHFRDVAALEGAWLATNEHRYFGESYP--TEDLSTENLR 151
Query: 148 HRGYFNSAQALADYASILLHIKDKYNAT-HAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
+ + Q L D + +K + +A VI G YGG LATW R ++P+++ G+
Sbjct: 152 ---FMRTEQVLFDLIEWIDFLKREVMGDPNARVILHGVGYGGSLATWARQRFPNIIDGAW 208
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDT-SEECFQTIKKSWSEIDNIASKPDGLAILSK 265
SSAP+ T + + V R+ S++C+ I +++ +N+ ++S+
Sbjct: 209 GSSAPVRA---TTNFEEFAVEVGNIIRERGSDQCYNRIFQAFHTAENLIDA-GRTEMISE 264
Query: 266 KFKTCTPLKNTTELKDGL---DTVYS-EAAQYDTPSNIPVKRICNAI 308
F TC P+ L+ L ++S EAA + + R+C+A+
Sbjct: 265 MFNTCDPVDTDNPLEVELFFFAMMFSLEAAMVEDYDIENIGRVCDAL 311
>gi|346326225|gb|EGX95821.1| serine peptidase, putative [Cordyceps militaris CM01]
Length = 565
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/266 (28%), Positives = 117/266 (43%), Gaps = 26/266 (9%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNI-QLSGFTYEN 114
++Q IDH S TF QRY N +F+ G +PI+ EA D + + +T +
Sbjct: 54 FDQLIDHDD---PSLGTFKQRYWANPEFYAGPGAPIVLSGPGEAAADPYVWYTTNYTLDG 110
Query: 115 --AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
A + + ++LEHRY+G+S PF K KN R+ N+ + +A + D
Sbjct: 111 VFAQEIRGASIVLEHRYWGESSPFD--KMTTKNMRYLTLDNAMRDTIYFARNVKLPFDPA 168
Query: 173 NATH---APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVS 229
++H AP + G SY G LA W YP A S + D W Y + +
Sbjct: 169 QSSHPDRAPWVFTGGSYSGALAAWIEKLYPGTFWAYHAVSGVVQPIADY--WQYYVPIEA 226
Query: 230 KDFRDTSEECFQTIKKSWSEIDNIASKPD--GLAILSKKFKTCTPLKNTT---ELKDGLD 284
R+ C + +K + ID D G L KF L++ L DGL
Sbjct: 227 AMPRN----CSRDMKLAIRHIDRRLGSSDAAGKEALKAKFGLAG-LEDDDFGWALTDGLQ 281
Query: 285 TVYSEAAQYDT-PSNIPVKRICNAIE 309
+ + Q++T P NIP+ +C+ IE
Sbjct: 282 SW--QGTQFNTGPGNIPLYEMCDYIE 305
>gi|170045817|ref|XP_001850490.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868718|gb|EDS32101.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 487
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 92/404 (22%), Positives = 174/404 (43%), Gaps = 47/404 (11%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYV-INSKFWGGGNSPILAFLGAEAPID-DNIQLSGFTY 112
++ ++HF++ + + + RY+ + + GG PIL L P+ D I S
Sbjct: 29 YFTTRLNHFNH--QQREDWTLRYLSVTEHYRPGG--PILIRLSGNGPVRRDMINESSLIT 84
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DK 171
E A + + E R+YG S P N + + Q +AD ++++K D
Sbjct: 85 ELAREMGGAVYAFETRFYGMSKPTND-----VNTEIMRFLKTDQIMADLVEFIIYLKRDV 139
Query: 172 YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKD 231
+ + PV+ GA YGG LATWFR++YPH+ + +S Y+ + ++ + S+
Sbjct: 140 FRDENMPVLVSGAGYGGALATWFRVRYPHMGDAAWSSGG---YHEAVLDFSDFAESWSET 196
Query: 232 FRD-TSEECFQTIKKSWSEIDNIASKPDGLA-ILSKKFKTCTPLKNTTELKDGLDTVYSE 289
D S++C+ + +++ + N+ G+ +L +K CT + + +D L Y
Sbjct: 197 LIDYGSQQCYNELFVAFNVMQNLIDI--GMTDMLYEKLNICTEI----DPEDSLQVQYFF 250
Query: 290 AAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG---NNSRCYINED 346
+ T + R N + + C D++L AA VEA + ++ + C + +
Sbjct: 251 STLM-TAVEMHTLRNRNLTDFSTFC-DELLSTEAATAVEAFTTWFNSRFTEDEGCIVTDL 308
Query: 347 RTGDES--------------DEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTK--YIKNC 390
+T ++ W +Q C+E + NS YQP +T + + C
Sbjct: 309 QTVVDAFRQTDWDSPFVANGARQWMYQQCAEF--GWFQTTNSAYQPFGNRVTVELHAEIC 366
Query: 391 KEQYGVSPRPSWVLTYYGGHDIKL-ILRRSTSNIIFSNGMRDPF 433
+ +G +L + + L +T I F++G DP+
Sbjct: 367 RMVFGEWATEETILQAAERTNNRFAALTPNTRRIHFTSGAEDPW 410
>gi|367030635|ref|XP_003664601.1| hypothetical protein MYCTH_2307580 [Myceliophthora thermophila ATCC
42464]
gi|347011871|gb|AEO59356.1| hypothetical protein MYCTH_2307580 [Myceliophthora thermophila ATCC
42464]
Length = 544
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 126/545 (23%), Positives = 211/545 (38%), Gaps = 113/545 (20%)
Query: 27 LKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
+ + P + + + ++ ++ +G+ TF +Q IDH + S TF QRY +++W G
Sbjct: 27 MDIGPDDDTVSETELQKRAETVNGWGTF--DQLIDHAN---PSLGTFKQRYWYGTEYWKG 81
Query: 87 GNSPILAFLGAEAPIDD-------NIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSR 139
SPI E + +LSG A+Q +++LEHRY+G+S P+
Sbjct: 82 PGSPIYLVSPGEQSGEGFNRTWLTTARLSGVM---ANQTGGAVIVLEHRYWGQSSPY--D 136
Query: 140 KAALKNARHRGYFNSAQALADYASILLHIKD---KYNATHAPVIAIGASYGGELATWFRL 196
+ ++N R+ NS + L +A D +A AP I G SY G LA W
Sbjct: 137 ELTVENLRYLTLDNSLKDLVYFAKNFAPPFDPSGSSSADKAPWIFAGGSYSGALAGWLAA 196
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
+ P +SS + GD W + V T + C + + +D +
Sbjct: 197 REPGTFWAYYSSSGVVEAIGDF--WQYFVPVQEA----TPKNCSADVGAVINYVDLV--- 247
Query: 257 PDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQ-------------YDTPSNIPVKR 303
L+ SKK K LK+ L + D ++ A + ++T P R
Sbjct: 248 ---LSFGSKKSKQA--LKDKFGLGELEDADFAAALEWGPWEWQSGQFYSFNTTGYNPFYR 302
Query: 304 ICNAIENA-PNCGDDILCKIAAGVVE--------------------ADSLEYDG-NNSRC 341
C+ +EN PN + + G+ + A E++G NN C
Sbjct: 303 FCDYVENVWPNSTNKVPGPRGVGLTKALEGYAKYVKEQVIPGFCESAGYPEWEGENNIAC 362
Query: 342 YINEDRTG---------DESDEGWEWQSCSEMVVPMGKDKNSMYQP----EPWNLTKYIK 388
+ N D + + W W C+E +D + +P N + K
Sbjct: 363 FKNLDPNNAAYKDLSLDNWINRQWNWMLCNE-PFEWWQDGAPLTRPTLVSRLVNADYWRK 421
Query: 389 NC-----KEQYGVSP--RPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP--------- 432
C + YG++ R V + GG + +T+ + +NG DP
Sbjct: 422 QCPLHFPEGGYGIAAGKRAKDVNRWTGGWSVT-----NTTRAMHTNGQYDPWRDATLSSK 476
Query: 433 FSRGGWVKTYHFFDLSFSQDLNLGSHCLDL---DEAKKSDPDWLVQQRKTEVKIMQGWIT 489
F GG V++ + + G+HC DL + A D L Q E M GW+
Sbjct: 477 FRPGGPVQSTERLPVRLVKG---GTHCSDLYGQNWAVNEDVKKLAQDAADE---MTGWVG 530
Query: 490 QYYDD 494
++Y++
Sbjct: 531 EWYEE 535
>gi|346980149|gb|EGY23601.1| hypothetical protein VDAG_05039 [Verticillium dahliae VdLs.17]
Length = 560
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 92/210 (43%), Gaps = 20/210 (9%)
Query: 13 LMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF----SYGPE 68
L VI + A + + + RI ++D+ G F + +DHF Y P
Sbjct: 9 LTVISATAPLAAAFISGTEHVSLTSQPRITLKEDAVLG--PFNLSVPVDHFHNETRYEPH 66
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILE 126
S TFP RY IN K + G P+ E +D + G A L ++LE
Sbjct: 67 SNDTFPLRYWINKKHYRPGG-PVFLLASGEMTGEDRLDYLDHGIIAMFAKATHGLGLVLE 125
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK------DKYNATHAPVI 180
HRYYG S P + ++ N R + ++ QALAD A H+ ++ T P I
Sbjct: 126 HRYYGTSFPVAN--VSIPNLR---FLSTEQALADTAFFAEHVTFPDLEHEELGPTDVPWI 180
Query: 181 AIGASYGGELATWFRLKYPHVVIGSLASSA 210
A G SY G A + R YP V G+++SS
Sbjct: 181 AFGGSYAGAFAAFLRKLYPDVFWGAISSSG 210
>gi|348672163|gb|EGZ11983.1| hypothetical protein PHYSODRAFT_355172 [Phytophthora sojae]
Length = 375
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 94/379 (24%), Positives = 149/379 (39%), Gaps = 65/379 (17%)
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y +S QALAD A + DKY T +A G SY G LA W RLKYP + G++ASSA
Sbjct: 8 YLSSEQALADLAHFHAFVTDKYGLTDEKWVAFGGSYPGNLAAWVRLKYPALFAGTVASSA 67
Query: 211 PILYYGDITPWTTYHSVVSKDFRD-TSEECFQTIKKSWSEIDNI--ASKPDGLAILSKKF 267
P+ D + Y VV R EC+ ++++ +++ + K D + + F
Sbjct: 68 PVHAKTD---FYEYMEVVGDGLRYFGGGECYHEVEQAITQLGQLMDGGKED-RGKVDELF 123
Query: 268 KTCTPLKNTTE--------------------LKDGLDTVYSEAAQYDTPSNIPVKRICNA 307
K C P+ N + + +G+ T+ E Q+ V+++ +
Sbjct: 124 KPCYPMANEFDDSVFETSVMGALQDIAQYNAIHEGVMTL-DEVCQHFAKPGDAVEKLASF 182
Query: 308 IEN--APNCGDDILCKIAAGVVEADSL-EYDGNNSRCYINEDRTGDESDEGWEWQSCSEM 364
IE +C D A G VE S ++DG +S W +Q+C+E
Sbjct: 183 IEKTRVGDCLDSKFEGAANGTVEVLSRDQFDGKSSA-------------RQWVYQTCNE- 228
Query: 365 VVPMGKDKNSMYQPEPWNLTKYIKN-------CKEQYG--VSPRPSWVLTYYGGHDIKLI 415
G + + P++ K + CK YG V+P + YG I++
Sbjct: 229 ---FGYFQTTTSVRSPFHALKAVTEANVGTEICKRVYGMNVAPDVAGANLDYGSLGIEV- 284
Query: 416 LRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGS-HCLDLDEAKKSDPDWLV 474
+ F +G DP+ + S G+ HC D+ + D L
Sbjct: 285 -----ERVTFPSGTIDPWHALAVQNSTKLHSFSAEAVFIEGTAHCADMYYPSERDSPQL- 338
Query: 475 QQRKTEVKIMQGWITQYYD 493
Q ++ GW Q D
Sbjct: 339 QWAHDKIAARVGWYLQDVD 357
>gi|347971076|ref|XP_318472.5| AGAP004015-PA [Anopheles gambiae str. PEST]
gi|333469620|gb|EAA43688.5| AGAP004015-PA [Anopheles gambiae str. PEST]
Length = 497
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 90/406 (22%), Positives = 168/406 (41%), Gaps = 56/406 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ IDHF+ P++ TF Y N++F+ G PI F+G + G Y+ A
Sbjct: 58 FRTKIDHFN--PQNRDTFEFSYFSNNEFYRPG-GPIFIFVGGNFAMTTYYIEHGLLYDTA 114
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNA- 174
+ A + EHRYYG S P +N R + S QAL D + ++++
Sbjct: 115 ARDGAWLFTNEHRYYGASTPVPDYST--ENLR---FLKSEQALMDLIEWIDYLRNTVVGD 169
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
+A V+ +G Y G LATW R ++P ++ G+ + A +L D + + F
Sbjct: 170 PNAKVVLMGTGYAGALATWARQRFPSIIDGAWGAGATVLASFDFQEHAGDIGEMIRRF-- 227
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGL-AILSKKFKTCTPLKNTTELKDGLDTVYS----- 288
EC+ I ++ + GL ++ TC P++ L ++T++
Sbjct: 228 GGNECYSMIWVAFRTAQYLIDA--GLDQTVTSLLNTCEPIEPGKLLD--VETLFYHLKLA 283
Query: 289 -EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV------VEADSLEYDGNNSRC 341
+ A S ++ +C A+ N+ + L +A + + + ++D N
Sbjct: 284 IQEAMLGQQSTAKIRDVCEAMMNSTE--ETALHDLAGWLNVYYANLPCNPFDFDTNMEAA 341
Query: 342 YINEDRTGDESDEGW---EWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSP 398
+ + + + G ++Q+C+E D + + + ++ C+ +G
Sbjct: 342 QVLQPGAPENALLGLRQTQYQACTEFGWFRTTDLDEQPFGDRVTMHFFLSACRALFG--- 398
Query: 399 RPSWV----------LT--YYGGHDIKLILRRSTSNIIFSNGMRDP 432
WV LT +YGG D + ++N++F+NG DP
Sbjct: 399 --EWVTDAVIYEGVRLTNLHYGGQDPR------STNVLFTNGEFDP 436
>gi|400599740|gb|EJP67431.1| serine peptidase, putative [Beauveria bassiana ARSEF 2860]
Length = 549
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 99/389 (25%), Positives = 150/389 (38%), Gaps = 67/389 (17%)
Query: 26 GLKLRPRLGRIRRSRI---LEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSK 82
G +LRP + + L K + E ++Q IDH S TF QRY +N +
Sbjct: 25 GQQLRPSPKVLHDEKYCYGLYPKGTPCPIEAATFDQLIDHDD---PSLGTFKQRYWVNPE 81
Query: 83 FWGGGNSPILAFLGAEAPID------DNIQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
F+ G +PI+ E D N+ L G T+ A + K ++LEHRY+G+S PF
Sbjct: 82 FYAGPGAPIVLSGPGEQAADRMVWYTTNLTLDG-TF--AQEIKGAAIVLEHRYWGESSPF 138
Query: 137 GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH---APVIAIGASYGGELATW 193
K KN RH N+ + +A + D +H AP + G SY G LA W
Sbjct: 139 D--KMTTKNMRHLTLDNAMRDTTYFARNVKLSFDLDGTSHPDKAPWVFTGGSYPGALAAW 196
Query: 194 FRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI 253
+P A SA + D W Y + + R+ S++ IK +D
Sbjct: 197 INKLHPGTFWAYHAVSAVVQPIADF--WQYYVPIEAAMPRNCSKDMKLAIKY----VDKQ 250
Query: 254 ASKPDGLA--ILSKKFKTCTPLKNTT--ELKDGLDTVYSEAAQYDTPSN-IPVKRICNAI 308
S D + KF + L DGL + + Q+ TP N IP+ +C+ I
Sbjct: 251 LSSGDQARKDAIKAKFGLSGLADDDFGWALADGLQSW--QGTQFTTPPNKIPLFDMCDYI 308
Query: 309 ENA----------PNCGDDILCKIAAGVVE---------------ADSLEYDGNNSRC-- 341
E P+ C AG+ + D + D + C
Sbjct: 309 EGVGGKNGSAIHIPDENGVGRCAAVAGLAKWFRTVRIPGSCQRNWGDYWKNDSQTTECWD 368
Query: 342 -------YINEDRTGDESDEGWEWQSCSE 363
+ + +E + W+W SC+E
Sbjct: 369 THNPNSPFFTDRSVRNEWNLQWQWMSCNE 397
>gi|449704577|gb|EMD44793.1| serine carboxypeptidase (S28) family protein, partial [Entamoeba
histolytica KU27]
Length = 87
Score = 77.0 bits (188), Expect = 2e-11, Method: Composition-based stats.
Identities = 37/88 (42%), Positives = 51/88 (57%), Gaps = 7/88 (7%)
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGA 184
+EHR+YG S P + Y + QAL DY ++ H++++ N PVI +G
Sbjct: 4 VEHRFYGASTPSLEMDKLI-------YCTAEQALMDYVEVISHVQEENNLVGHPVIVLGG 56
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPI 212
SY G LA W R KYP+VV G+ ASSAP+
Sbjct: 57 SYSGNLAAWMRQKYPNVVEGAWASSAPV 84
>gi|237840317|ref|XP_002369456.1| serine carboxypeptidase S28, putative [Toxoplasma gondii ME49]
gi|211967120|gb|EEB02316.1| serine carboxypeptidase S28, putative [Toxoplasma gondii ME49]
Length = 738
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 66/131 (50%), Gaps = 13/131 (9%)
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
PI ++G E P+ G E F A + LEHRYYG S P +++ N +
Sbjct: 274 PIFVYIGGEGPLSSLEVRQGLLAEMGDIFGASLYALEHRYYGDSHP--RPDSSVVNLQ-- 329
Query: 150 GYFNSAQALADYASILLHIKDKYNATH--------APVIAIGASYGGELATWFRLKYPHV 201
+ S QAL D A+ + H+K + H PV+ G SY G LA + R KYP
Sbjct: 330 -WLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPAS 388
Query: 202 VIGSLASSAPI 212
++G+++SS+P+
Sbjct: 389 ILGAVSSSSPV 399
>gi|303314813|ref|XP_003067415.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240107083|gb|EER25270.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320037761|gb|EFW19698.1| serine peptidase, family S28 [Coccidioides posadasii str. Silveira]
Length = 555
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 80/325 (24%), Positives = 133/325 (40%), Gaps = 49/325 (15%)
Query: 77 YVINSKFWGGGNSPILAFLGAEAPIDDN-----IQLSGFTYENAHQFKALIVILEHRYYG 131
+V +SK+ GG P+ F G EA + + F + +F + ++ EHRYYG
Sbjct: 91 WVNDSKYKSGG--PVFLFDGGEANAQRYADFYLVNETSFFVQLLEEFHGMGIVWEHRYYG 148
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK------DKYNATHAPVIAIGAS 185
+S PF A H Y N+ QALAD + K D P + IG S
Sbjct: 149 ESNPFPVNLDT--PAEHFQYLNNEQALADIPYFAKNFKRENFPDDDLTPKSTPWVMIGGS 206
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHSVVSKDFRDTSEECFQTI 243
Y G A + R +YP + S A+ AP+ D++ + Y +V+ + + C + +
Sbjct: 207 YPGMRAAFTRDQYPETIFASFAACAPVQAQIDMSVYYEQVYRGLVAYGYGN----CTKDV 262
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT-ELKDGLDTVYSEAAQYDTPSNIPVK 302
+ ++ ID+ + + A + K F T NT + L +++ A + V
Sbjct: 263 RAAYKYIDSKLRRGESAAEIKKLFLGDTAQNNTNGDFTQAL--IWTWATWQSQGPDGGVG 320
Query: 303 RICNAIENAPNC--------------GDDILCKIAA--GVVEADSLEYDGN--------N 338
+ CN +E P ++ + AA G+V + ++ N
Sbjct: 321 QFCNWLETDPKTNKTAPAEGWAPTKGAKAVVERFAAWPGLVPRVNAAFETNCKGENPDEP 380
Query: 339 SRCYINEDRTGDESDEGWEWQSCSE 363
+ C + + R D S W WQ CSE
Sbjct: 381 TMCNLGK-RVADPSGIAWTWQYCSE 404
>gi|90077758|dbj|BAE88559.1| unnamed protein product [Macaca fascicularis]
Length = 229
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 77/142 (54%), Gaps = 9/142 (6%)
Query: 348 TGDESDEGWEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTY 406
TG ++ W++Q+C+E+ + + M+ P+ + + C + +GV PRP W+LT
Sbjct: 77 TGPDA-RAWDYQACTEINLTFASNNVTDMFPDLPFTDERRQQYCLDTWGVWPRPDWLLTS 135
Query: 407 YGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAK 466
+ G D+ R+ SNIIFSNG DP++ GG + ++ + + G+H LDL +
Sbjct: 136 FWGGDL-----RAASNIIFSNGNLDPWAGGGIRRNLSASVIAIT--IQGGAHHLDLRASH 188
Query: 467 KSDPDWLVQQRKTEVKIMQGWI 488
DP +V+ RK E ++ W+
Sbjct: 189 PEDPVSVVEARKLEATVIGEWV 210
>gi|221483143|gb|EEE21467.1| serine carboxypeptidase, putative [Toxoplasma gondii GT1]
Length = 738
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 66/131 (50%), Gaps = 13/131 (9%)
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
PI ++G E P+ G E F A + LEHRYYG S P +++ N +
Sbjct: 274 PIFVYIGGEGPLSSLEVKQGLLAEMGDIFGASLYALEHRYYGDSHP--RPDSSVVNLQ-- 329
Query: 150 GYFNSAQALADYASILLHIKDKYNATH--------APVIAIGASYGGELATWFRLKYPHV 201
+ S QAL D A+ + H+K + H PV+ G SY G LA + R KYP
Sbjct: 330 -WLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPAS 388
Query: 202 VIGSLASSAPI 212
++G+++SS+P+
Sbjct: 389 ILGAVSSSSPV 399
>gi|221504075|gb|EEE29752.1| serine carboxypeptidase, putative [Toxoplasma gondii VEG]
Length = 738
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/131 (33%), Positives = 66/131 (50%), Gaps = 13/131 (9%)
Query: 90 PILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHR 149
PI ++G E P+ G E F A + LEHRYYG S P +++ N +
Sbjct: 274 PIFVYIGGEGPLSSLEVKQGLLAEMGDIFGASLYALEHRYYGDSHP--RPDSSVVNLQ-- 329
Query: 150 GYFNSAQALADYASILLHIKDKYNATH--------APVIAIGASYGGELATWFRLKYPHV 201
+ S QAL D A+ + H+K + H PV+ G SY G LA + R KYP
Sbjct: 330 -WLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPAS 388
Query: 202 VIGSLASSAPI 212
++G+++SS+P+
Sbjct: 389 ILGAVSSSSPV 399
>gi|299743114|ref|XP_001835550.2| hypothetical protein CC1G_13075 [Coprinopsis cinerea okayama7#130]
gi|298405506|gb|EAU86268.2| hypothetical protein CC1G_13075 [Coprinopsis cinerea okayama7#130]
Length = 564
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 65/268 (24%), Positives = 124/268 (46%), Gaps = 29/268 (10%)
Query: 43 EQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG-GNSPILAFLGAEAPI 101
E K+ N+ F ++ Q +DHF S + QR+ +NS+ + +P++ G E
Sbjct: 66 ESKEKNYDFRAQWFEQPLDHFDN--TSDHRWHQRFWVNSRHYKPRPGAPVIVLDGGETSG 123
Query: 102 DDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKA-ALKNARHRGYFNSAQAL 158
++ + +G A + ++LEHRYYG S+P + +L+ + N+AQ+
Sbjct: 124 EERLPFLDTGIVNILAKATGGIGIVLEHRYYGDSIPVANFSTDSLR------WLNNAQSA 177
Query: 159 ADYASILLHIK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL 213
AD A+ + ++K + A P I G SY G A ++ YP +V G++ASSA +
Sbjct: 178 ADSANFMRNVKLDSIQEDITAPGTPWIYYGGSYAGARAAHMKIIYPDIVYGAIASSA--V 235
Query: 214 YYGDITPWTTYHSVVSKDFRDTSE-ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
+ + W Y +++ RD ++ +C I S + ID I + + ++ K+
Sbjct: 236 THATLQAW-EYMTII----RDAADPKCSANIVNSIATIDTILQR----GVFKRQLKSLFG 286
Query: 273 LKNTTELKDGLDTVYSEAAQYDTPSNIP 300
L +D + + + + + +P
Sbjct: 287 LGELEHDEDFVSVLETPLGSWQSKCWVP 314
>gi|361132051|gb|EHL03666.1| putative serine protease K12H4.7 [Glarea lozoyensis 74030]
Length = 577
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 116/485 (23%), Positives = 186/485 (38%), Gaps = 90/485 (18%)
Query: 41 ILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAE-A 99
I+ Q D+ + F++Q +DH + S TF Q++ N +FW G SPI+ F E A
Sbjct: 88 IIAQNDA---MGSGFFDQLLDHKN---PSKGTFKQKFWWNIEFWNGPGSPIVMFTPGEIA 141
Query: 100 PIDDNIQLSGFTYEN--AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQA 157
+ L+ T A + K ++++EHR++G+S P+ + N+ Q+
Sbjct: 142 AANYGAYLTNATVIGLYAQEIKGAVIMVEHRFWGESSPYQTL-----NSETLQLLTLEQS 196
Query: 158 LADYA-----SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
+AD+ + L K NA AP + G SY G LA W P ASSAP+
Sbjct: 197 IADFVYFAKVAPLPFDTKKSNADKAPWVFSGGSYSGALAAWIESTSPGTFWAYHASSAPV 256
Query: 213 LYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTP 272
D + Y S + + + + ++ N S + LA+ KT
Sbjct: 257 QAIDD---YWQYFSPIQQGMPKNCSKDLSLVIDYMDKVWNTGSAAEKLAL-----KTKFG 308
Query: 273 LKNTTELKDGLDTVYSEAAQYDTPSNIPVK------RICNAIENAPNCGDDILCKIAAGV 326
L+ + D + T+ E + SN + C+AIEN + A GV
Sbjct: 309 LQGLAQPADVMATL--EYGPWLWQSNSFTTGYSGFFQFCDAIENV--TAGAAVTPDANGV 364
Query: 327 VEADSLE--------------------YDGNNSRCY--------INEDRT-GDESDEGWE 357
+LE D N C I DR+ G+ D W+
Sbjct: 365 GVQTALEGYAKWTKAKLLPGFCQSYGYTDANTVECLDTYNPSNKIFTDRSVGNAIDLQWQ 424
Query: 358 WQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQ--------------YGVSPRPSWV 403
W C+E P G +N + +P +++ + + Q YG + P
Sbjct: 425 WMLCNE---PFGYWQNGAPRGKPSIVSRLVNSAYWQRQCALFFPTVNGFTYGSAISPDNN 481
Query: 404 LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH---FFDLSFSQDLNL---GS 457
+ H L + T+ +I++NG DP+ G Y + LN+ G
Sbjct: 482 IHQVNKHTQGWRLEK-TTRLIWTNGEFDPWKTSGMSSEYRPGGPLQSTPEHPLNVIPGGF 540
Query: 458 HCLDL 462
HC DL
Sbjct: 541 HCSDL 545
>gi|402083145|gb|EJT78163.1| hypothetical protein GGTG_03265 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 564
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 78/267 (29%), Positives = 116/267 (43%), Gaps = 31/267 (11%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
IDHF SY P S + F RY ++ ++ G PI+ LG E D + G +
Sbjct: 65 IDHFHNESSYAPHSDEFFQLRYWFDASYYRDGG-PIIVLLGGETSGADRLPFMEKGILAK 123
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA-----SILLHI 168
A + VILEHRYYG+S P ++ N R + + QALAD A I
Sbjct: 124 LAEATGGVSVILEHRYYGESFPVPD--LSISNLR---FLTTDQALADTAFFARNVIFEGF 178
Query: 169 KDK-YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
+D+ + + P G SY G A + R YP G+++SS L D W Y +
Sbjct: 179 EDRDLTSGNTPYFTYGGSYAGAFAAFLRKLYPEDYWGAISSSGVTLAVADY--WQYYEA- 235
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY 287
+ F +C TI+K +DNI + + S + + L N T L D ++ +
Sbjct: 236 -QRLF--APPDCVVTIQKLTHVVDNIVTGSNKFT-GSARLRDAFGLPNVTYLPDFVNVLA 291
Query: 288 S-----EAAQYDTP-SNIPVKRICNAI 308
+ + +D SN V + CN I
Sbjct: 292 NGIKGWQELNWDLAVSNPDVYQFCNNI 318
>gi|347840247|emb|CCD54819.1| similar to extracelular serine carboxypeptidase (secreted protein)
[Botryotinia fuckeliana]
Length = 530
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 75/287 (26%), Positives = 127/287 (44%), Gaps = 27/287 (9%)
Query: 12 LLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF----SYGP 67
LL+ V+ + + ++G +R S I + + +Q IDHF +Y P
Sbjct: 5 LLLAGVVTGLISSTSASILTKVGTVRESSISIDRRDTLAYTAHTIDQPIDHFPNDPAYAP 64
Query: 68 ESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD---NIQLSGFTYENAHQFKALIVI 124
+ TF QRY ++K++ G P+ ++G E + N+Q +G L +I
Sbjct: 65 HTNATFKQRYWYDAKYYKPG-GPVYLYIGGETNGQNRFSNLQ-TGIIQILMEATNGLGII 122
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAPVIAIG 183
LE+RYYG+S PF + + Y + Q +AD A H+ NA+
Sbjct: 123 LENRYYGQSWPFNT-----STTDNLAYLTNQQTVADNAYFAQHVSLPGLNASITAPDTKW 177
Query: 184 ASYGGELA---TWFRLK-YPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEEC 239
YGG LA T +K YP V+ G +A+SAP+ W Y+ + R ++C
Sbjct: 178 ILYGGSLAGGQTALSVKIYPDVLFGGIAASAPVKTVVGYPEW--YNPIQ----RLAPQDC 231
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+I + D + + + AI ++FK+ L+ T+ +D T+
Sbjct: 232 ISSINGIIDKFDALVAVNNTRAI--REFKSLFGLEALTDNRDFAMTI 276
>gi|85090193|ref|XP_958301.1| hypothetical protein NCU09992 [Neurospora crassa OR74A]
gi|28919648|gb|EAA29065.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 547
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 86/329 (26%), Positives = 139/329 (42%), Gaps = 42/329 (12%)
Query: 22 FHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS 81
F G+++ P I + + G+ TF +Q IDH + PE TF QR+
Sbjct: 22 FKTLGMEIGPIDDHIESLNKRAEVEGTSGYGTF--DQLIDHNT--PE-LGTFKQRFWYGF 76
Query: 82 KFWGGGNSPILAFLGAEAPID-------DNIQLSGFTYENAHQFKALIVILEHRYYGKSV 134
++W G SPI+ E D + +L+G+ A A +VI+EHRY+G S
Sbjct: 77 QYWKGPGSPIILVNPGEQAADGFNKSYLSDQRLAGWM---AKDMGAAVVIMEHRYWGNSS 133
Query: 135 PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT---HAPVIAIGASYGGELA 191
PF + +KN ++ NS + + +A + DK N + +AP I G SY G LA
Sbjct: 134 PF--DELTVKNLQYLTLENSLKDINYFAEHIDLPFDKTNGSKPANAPWIFSGGSYSGALA 191
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEID 251
W YP +S + G WT + V+ T + C + + +D
Sbjct: 192 GWLEALYPGTFWAYHGTSGVVETLGHF--WTYFVPVLEA----TPQNCTKDLTAVIDYVD 245
Query: 252 NIA--SKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS-----EAAQYDTPSNI---PV 301
++ P L KFK L+N T+ D + S ++ Q+ + P
Sbjct: 246 SVLLHGTPKAKRELKSKFK----LQNLTD-ADFASAIESGPWSWQSTQFYSEKITGYNPY 300
Query: 302 KRICNAIENA-PNCGDDILCKIAAGVVEA 329
R C+ +EN PN + + + G+ +A
Sbjct: 301 YRFCDYVENVWPNSTNKVPGPLGVGIKKA 329
>gi|392585005|gb|EIW74346.1| peptidase S28 [Coniophora puteana RWD-64-598 SS2]
Length = 537
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 75/304 (24%), Positives = 129/304 (42%), Gaps = 39/304 (12%)
Query: 24 ANGLKLRPRLGRIRRSRILEQKDSNHGFETFF-YNQTIDHFSYGPESYQTFPQRYVINSK 82
A G + PRL +R +I++Q+ + T + ++Q IDH + + TF QRY +++
Sbjct: 30 AKGRPVIPRLAPPQR-QIVDQQGAPVNLSTVYTFDQLIDHAN---PALGTFKQRYWTSNE 85
Query: 83 FWGGGNSPILAFLGAEAPIDD------NIQLSGFTYENAHQFKALIVILEHRYYGKSVPF 136
++ G P++ E D N+ ++G A Q +V++EHR++G+S P+
Sbjct: 86 YYKTG-GPVVLMTPGETNADGYESMLTNVSVNGLI---AQQNNGAVVVIEHRFFGQSNPY 141
Query: 137 GSRKAALKNARHRGYFNSAQALADYASILLHIK------DKYNATHAPVIAIGASYGGEL 190
G+ A+ Y AQA+ D A + D P + G SY G L
Sbjct: 142 GNL-----TAQSLRYLTIAQAIDDLAHFAQTVDLPWAGGDAVKPDKTPWVLTGGSYAGAL 196
Query: 191 ATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEI 250
+W +K P V +SS G + T Y++ + + C ++ +
Sbjct: 197 TSWTMVKKPDVFYAGWSSS------GVVEAITDYYAYFTPILEHMPKNCSADVQAVVGYL 250
Query: 251 DNIASKPDGLAILSKKFKTCTPLKNTTELKD-GLDTVYSEA----AQYDTPSNIPVKRIC 305
D + S + I + + L N T D L Y+ A Q D+ +N R C
Sbjct: 251 DQLNSTSNATGI--QTMQDTFGLGNLTHADDFALALTYNLADWQELQPDSGANQTFYRFC 308
Query: 306 NAIE 309
+A+E
Sbjct: 309 DALE 312
>gi|119175285|ref|XP_001239900.1| hypothetical protein CIMG_09521 [Coccidioides immitis RS]
gi|392870094|gb|EAS28654.2| serine peptidase, family S28 [Coccidioides immitis RS]
Length = 555
Score = 76.3 bits (186), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/325 (24%), Positives = 133/325 (40%), Gaps = 49/325 (15%)
Query: 77 YVINSKFWGGGNSPILAFLGAEAPIDDN-----IQLSGFTYENAHQFKALIVILEHRYYG 131
+V +SK+ GG P+ F G EA + + F + +F + ++ EHRYYG
Sbjct: 91 WVNDSKYKSGG--PVFLFDGGEANAQRYADFYLVNETSFFVQLLEEFHGMGIVWEHRYYG 148
Query: 132 KSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK------DKYNATHAPVIAIGAS 185
+S PF A H Y N+ QALAD + K D P + IG S
Sbjct: 149 ESNPFPVNLDT--PAEHFQYLNNEQALADIPYFAKNFKRENFPDDDLTPKSTPWVMIGGS 206
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHSVVSKDFRDTSEECFQTI 243
Y G A + R +YP + S A+ AP+ D++ + Y +V+ + + C + +
Sbjct: 207 YPGMRAAFTRDQYPETIFASFAACAPVQAQVDMSVYYEQVYRGLVAYGYGN----CTKDV 262
Query: 244 KKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT-ELKDGLDTVYSEAAQYDTPSNIPVK 302
+ ++ +D+ + + A + K F T NT + L +++ A + V
Sbjct: 263 RAAYKYMDSKLRRGESAAEIKKLFLGDTAQNNTNGDFTQAL--IWTWATWQSQGPDGGVG 320
Query: 303 RICNAIENAPNC--------------GDDILCKIAA--GVVEADSLEYDGN--------N 338
+ CN +E P ++ + AA G+V + ++ N
Sbjct: 321 QFCNWLETDPKTNKTAPAEGWAPTKGAKAVVERFAAWPGLVPRVNAAFETNCKGENPDEP 380
Query: 339 SRCYINEDRTGDESDEGWEWQSCSE 363
+ C + + R D S W WQ CSE
Sbjct: 381 TMCNLGK-RVADPSGIAWTWQYCSE 404
>gi|358054024|dbj|GAA99823.1| hypothetical protein E5Q_06526 [Mixia osmundae IAM 14324]
Length = 1068
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 120/503 (23%), Positives = 197/503 (39%), Gaps = 105/503 (20%)
Query: 57 NQTIDHFSYGPE---SYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL----SG 109
+ I+HF P+ + +TF RY +N+ + G + +L + E DD
Sbjct: 598 DMPINHFPGDPKYQPTNETFKLRYFVNADHYKPGGA-VLIWNAGEGSADDQTAAIFSNRT 656
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA-----SI 164
F Y ++ ++LEHRYYGKS+P S + Y QALAD+ +
Sbjct: 657 FIYNLTQSTNSVGIVLEHRYYGKSIPMPSF-----STDDLQYLTVEQALADWEYFAKNAE 711
Query: 165 LLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTY 224
L + AP+I +GASY G LA W + YP G +ASSA + I + Y
Sbjct: 712 LPTLPQLITQNKAPLIYLGASYSGALAAWQSVVYPTTFWGYIASSAVTV---SILDFAAY 768
Query: 225 HSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPL-KNTTELKDGL 283
+ V +DF +C + + ID + + K ++ L ++ E D +
Sbjct: 769 VNPV-RDF--APRDCVANLSAALDLIDTTSES------MRKPLQSIFGLPQDQLEFVDFV 819
Query: 284 DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCK--------IAAGVVEADS--LE 333
+ + ++ + S +R +A +A + GD + +AA A S L
Sbjct: 820 NVLAEGPLEWQSSSG--NQREFSAFCDAISTGDGVQASDTLLEESGLAAASEPARSAVLR 877
Query: 334 Y-------------DGNNSRCYINEDRTG------DESDEGWEWQSCSEMV------VPM 368
Y DGN ++C+ + T D+ W WQ+CS+M VP
Sbjct: 878 YASYITNRTLTDCEDGNLNKCFGTNNATDLQLADLDQDWRLWAWQTCSQMAFFMTGNVPT 937
Query: 369 GKDKNSMYQPEPWNLTKYIKNCKEQYGVS--------PRPSWVLTYYGGHDIKLILRRST 420
G+ + + +L Y + C+ + P + VL YGG+ I+
Sbjct: 938 GE---AAIMSKHIDLAYYKRVCQTTFPPGTFNKLPEVPDMASVLK-YGGYGIR------H 987
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFD-----LSFSQDLNL----GSHCLDLDEAKKSDPD 471
+ F +G DP W D +S + + L G H D + P+
Sbjct: 988 PRLAFVDGTEDP-----WRPCTPHADAAPPRISTTDEPYLLVQGGVHGADF--GGRYGPN 1040
Query: 472 WLV---QQRKTEVKIMQGWITQY 491
+ R TE+ I+Q WI +
Sbjct: 1041 QTATNNEARATELSILQSWIADW 1063
>gi|212536138|ref|XP_002148225.1| extracelular serine carboxypeptidase, putative [Talaromyces
marneffei ATCC 18224]
gi|210070624|gb|EEA24714.1| extracelular serine carboxypeptidase, putative [Talaromyces
marneffei ATCC 18224]
Length = 561
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/229 (26%), Positives = 99/229 (43%), Gaps = 24/229 (10%)
Query: 51 FETFFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQ 106
F + + IDHF Y P S TF RY ++ + G P+ E +D +
Sbjct: 49 FHAYNLSVPIDHFHNESRYEPHSSDTFALRYWFDASHYKEGG-PVFVLESGETSGEDRLP 107
Query: 107 L--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
G + A + V+LEHRYYG S+P ++ + ++ R + + QALAD A
Sbjct: 108 YLQKGLISQLAQLTNGIAVVLEHRYYGTSIP--TKDFSTESLR---FLTTEQALADVAYF 162
Query: 165 LLHI------KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDI 218
+I + P IA G SY G + + R+ YP V G++ASSA D
Sbjct: 163 AQNIIYPGLEDQSLTSNFVPYIAYGGSYAGAMVAFLRVTYPDVFFGAIASSAVTEAIVDY 222
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKF 267
W + + R+ ++C +T++ +D++ S + L F
Sbjct: 223 --WQYWEPIR----RNAPQDCVRTVENLTGVLDSLTSNTAAIQDLETLF 265
>gi|158291878|ref|XP_001688441.1| AGAP003640-PA [Anopheles gambiae str. PEST]
gi|157017499|gb|EDO64163.1| AGAP003640-PA [Anopheles gambiae str. PEST]
Length = 505
Score = 75.9 bits (185), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 74/272 (27%), Positives = 129/272 (47%), Gaps = 44/272 (16%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ I+HF P++ TF ++ N +++ G P+ +G + + F +
Sbjct: 63 FTSRINHFD--PQNRDTFEFNFLWNDEYYRPG-GPLFIVVGGHH------RTNPFFIDET 113
Query: 116 HQFK-------ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYAS-ILLH 167
H FK A + EHRY+G SVP + + N R + + Q L D I
Sbjct: 114 H-FKDIAALQGAFLATNEHRYFGTSVP--TEDLSSDNLR---FLRTEQTLFDLIEWIDFL 167
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
++ +A VI G SYGG LA+W R ++P+++ G+ SSA + + +T
Sbjct: 168 RREVMRDPNAKVILHGFSYGGALASWARQRFPNIIDGAWVSSATVRATVNFEEFT----- 222
Query: 228 VSKDFRDT-----SEECFQTIKKSWSEIDNI--ASKPDGLAILSKKFKTCTPL--KNT-- 276
+DF +T S+EC+ +I +++ +N+ A + D I+S F TC P+ +N+
Sbjct: 223 --EDFGNTIRIKGSDECYNSIFRAFHTAENLLDAGRTD---IVSSMFNTCDPIDAENSLQ 277
Query: 277 TELKDGLDTVYSEAAQYDTPSNIPVKRICNAI 308
EL L T+ E + +D + V+R+CN +
Sbjct: 278 VELFLHLMTLSLELSMFDDFNIENVQRVCNVL 309
>gi|154303088|ref|XP_001551952.1| hypothetical protein BC1G_09564 [Botryotinia fuckeliana B05.10]
gi|347839320|emb|CCD53892.1| similar to serine peptidase (secreted protein) [Botryotinia
fuckeliana]
Length = 544
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 110/502 (21%), Positives = 189/502 (37%), Gaps = 90/502 (17%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD------NIQLS 108
F+ Q +DH + S TF Q++ NS+FW G SPI+ F E + N+ ++
Sbjct: 54 FFTQLLDHDN---PSKGTFQQKFWWNSEFWAGPGSPIVFFTPGEIAAANYGAYLTNVTVT 110
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G A + K +V++EHR++G+S P+ + QA+AD+ +
Sbjct: 111 GLF---AQEIKGAVVMVEHRFWGESSPYDNLTTTNLQLLT-----LKQAIADFVHFAKTV 162
Query: 169 KDKYNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+++ H AP I G SY G L+ W P ASSAP+ D W
Sbjct: 163 DLPFDSNHSSNAASAPWINSGGSYSGALSAWTESTSPGTFWAYHASSAPVQAIDDY--WQ 220
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
++ V ++ S++ I +DN+ + + A+ + K K + E D
Sbjct: 221 YFYPVQDGMPKNCSKDVSLVIDY----MDNVLTHGNKSAVTALKTKFGL---ESVEHNDD 273
Query: 283 LDTVYSEAAQYDTPSNIPVK-----RICNAIENA-------PNCGDDILCKIAAGVVE-- 328
V ++ + C+AIEN P+ L G +
Sbjct: 274 FMAVLENGPWLWQSNSFSTGYSGFYQFCDAIENVTAGAAVTPDANGVGLTTALEGYAKWT 333
Query: 329 -------ADSLEYDGNNSRCYINED---------RTGDESDEGWEWQSCSEMVVPMGKDK 372
+ YD ++ C D G+ D W W C+E P G +
Sbjct: 334 KSYIPGYCEGFGYDADDLSCLNTHDFNNLMFRDYSVGNAIDRQWNWMLCNE---PFGYWQ 390
Query: 373 NSMYQPEPWNLTKYIKNCKEQYGVS----PRPSWVLTYYGGHDIKLILR-------RSTS 421
+ + P +++ + Q + ++ G +K + + +T+
Sbjct: 391 DGAPKNRPTIVSRLVDANYWQRQCALFFPTEGNYTYASAKGATVKRVNKVTKGWDLENTT 450
Query: 422 NIIFSNGMRDP---------FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDW 472
+I++NG DP F GG +K+ + Q + G HC DL K +
Sbjct: 451 RLIWTNGQYDPWRTSGVSSQFRPGGELKSTAKHPV---QIIPGGFHCSDL-RLKNGQVNA 506
Query: 473 LVQQR-KTEVKIMQGWITQYYD 493
VQ+ EV + W +YY+
Sbjct: 507 GVQKVIDNEVAQIVAWTAEYYN 528
>gi|4406810|gb|AAD20118.1| unknown protein [Arabidopsis thaliana]
Length = 365
Score = 75.5 bits (184), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 137/337 (40%), Gaps = 51/337 (15%)
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
P G SY G L+ WFRLK+PH+ GSLASSA + + + + + E
Sbjct: 47 PWFFFGISYSGALSAWFRLKFPHLTCGSLASSAVVRAIYEFSEFD----------QQIGE 96
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAA---- 291
Q K + E + + GL + +K K+ L N TEL D +Y ++AA
Sbjct: 97 SAGQECKLALQETNKLLEL--GLKVKNKAVKS---LFNATELDVDADFLYLTADAAVMAF 151
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEAD----SLEYDGNNSRCYINEDR 347
QY P ++C + A G D++ + V E L N + N
Sbjct: 152 QYGNPD-----KLCVPLVEAKKNGSDLVVTYSTYVREYCMRIWGLRVRTYNRKHLRNTVV 206
Query: 348 TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG--VSPRPSWVLT 405
T D + W +Q+C+E+ K + N T ++ CK +G V P+
Sbjct: 207 TADSAYRLWWFQACTELGYFQVAPKYDSVRSHQINTTFHLDLCKSLFGKDVYPKVDATNL 266
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEA 465
YYGG R + + IIF+NG DP+ + H + N G H D+
Sbjct: 267 YYGGD------RLAATKIIFTNGSEDPWRHASKQNSTHEMPSYIIKCRNCG-HGSDIRGC 319
Query: 466 KKSD------------PDWLVQQRKTEVKIMQGWITQ 490
+S PD++ + R+ V+ + W+++
Sbjct: 320 PQSPMVIEGKSNNCSLPDYVNKVRQQMVEHIDLWLSE 356
>gi|401882439|gb|EJT46697.1| hypothetical protein A1Q1_04662 [Trichosporon asahii var. asahii
CBS 2479]
Length = 674
Score = 75.1 bits (183), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 107/439 (24%), Positives = 173/439 (39%), Gaps = 75/439 (17%)
Query: 43 EQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG-AEAPI 101
E++D G F ++Q I HF + TF QRY I++K W G + IL G +EA
Sbjct: 174 EEEDPVKG--PFCFDQKISHFDNSQQG--TFCQRYWISTKEWKAGGAVILHDAGESEASG 229
Query: 102 DDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
G + L ++LEHRYYGKS P S + N R + N ++L D
Sbjct: 230 STYYMKKGLLHHLMAATHGLGIVLEHRYYGKSTPLDS--FSTDNMR---FLNLKESLEDS 284
Query: 162 ASILLHIK------------DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
A+ + + K D + + P I G SY G A + R +YP +V G++A S
Sbjct: 285 ANFIRNFKLPEGVTVEGANADTFKPNNVPWIYQGCSYPGAKAAFMRQQYPDLVFGAVAGS 344
Query: 210 APILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKT 269
A I + Y+ K + + ++C + I+ + ID + F
Sbjct: 345 AVTQA---IDEFPQYYDAFQKYYYN--QDCVKGIQGAIKVIDEWLDDEKKAPAMKSLFGA 399
Query: 270 CT------------PL---KNTTEL-KDGLDTVYSEAAQYDT-PSNIPVKRICNAIENAP 312
T PL K + L K + + Q DT P + + A+ N
Sbjct: 400 ATLKNDDFAYFLRWPLQFQKRSVPLGKASIPDAFCRTLQEDTKPIQVAGAEVPGAVVNFA 459
Query: 313 NCGDDILCKIAAGVVEADSLE--YDGNNSRCYINEDRTGDESDEGWE---WQSCSEMVVP 367
N + + +L YD ++ E R D + W WQ C+E
Sbjct: 460 NATRVWIRNTGSCGRNESTLAACYDTSDGSP-AAEKRKSDTLRDTWRPWIWQQCTEFGYF 518
Query: 368 MGKDKNSMYQPEPWNLTKYI------KNCKE------QYGVSPRP-SWVLTYYGGHDIKL 414
G K E L+KY+ + C++ +Y + RP + + +GG +I +
Sbjct: 519 FGPAK------EGGVLSKYLTYDVHHRVCRQSFPAGSKYQIPERPDTEKVNCHGGREINV 572
Query: 415 ILRRSTSNIIFSNGMRDPF 433
++F+ G +DP+
Sbjct: 573 ------ERVLFTAGEQDPW 585
>gi|393233661|gb|EJD41230.1| peptidase S28 [Auricularia delicata TFB-10046 SS5]
Length = 497
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 102/224 (45%), Gaps = 21/224 (9%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--S 108
F ++ Q +DHF + TF QRY +N + + G P++ G E ++ + +
Sbjct: 8 FPARWFRQPLDHFDR--KRRDTFLQRYWVNDRHYRSGG-PVIVLDGGETSGENRLPFLDT 64
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G A L V+LEHRYYG+S+P + L R + ++ Q+ AD A+ + ++
Sbjct: 65 GIVDILAKATHGLGVVLEHRYYGRSIPVLN----LTTDSLR-WLDNKQSAADSATFMANV 119
Query: 169 K-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
K + A P I G SY G A R+ YP + G++ASSA + + I W
Sbjct: 120 KFEGISEDLTAPGTPWIYYGGSYAGARAAHMRVLYPDLTFGAIASSA--VTHAAIVYW-E 176
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKF 267
Y+ V+ + C +++S ID + P L K F
Sbjct: 177 YYEVIRQ---SAPTGCIARLERSIDIIDRVLQVPVLRRPLKKLF 217
>gi|310790227|gb|EFQ25760.1| serine carboxypeptidase S28 [Glomerella graminicola M1.001]
Length = 565
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 70/250 (28%), Positives = 104/250 (41%), Gaps = 26/250 (10%)
Query: 51 FETFFYNQTIDHFS----YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQ 106
++ + ++ IDHF Y P S TFP RY + +F+ G P++A E + +
Sbjct: 70 YQAYNFSVPIDHFHNDSIYEPHSNGTFPLRYWYDDRFYKPG-GPVIALAAGETSGNGRLP 128
Query: 107 L--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
G A + +ILEHRYYG+S P + + KN R + + QALAD A
Sbjct: 129 FLQKGIVAILAEATNGVGIILEHRYYGRSYP--TPDFSTKNLR---FLTTDQALADTAYF 183
Query: 165 LLHI-------KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
H+ A P I G SY G + R YP V G+++SS D
Sbjct: 184 AQHVVFPGKLATLNLTAPGTPWIMYGGSYAGAFVAFLRKVYPDVFWGAISSSGVTAAVVD 243
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI-ASKPDGLAILSKKFKTCTPLKNT 276
W Y + E C +T +K +DNI K + + +T L+N
Sbjct: 244 F--WEYYEAARLY----APEGCAETTQKLTHVVDNILLQKGNATEDDVRTLQTAFGLQNV 297
Query: 277 TELKDGLDTV 286
T+ D T+
Sbjct: 298 TKKVDFASTI 307
>gi|409040075|gb|EKM49563.1| hypothetical protein PHACADRAFT_265104 [Phanerochaete carnosa
HHB-10118-sp]
Length = 561
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/217 (28%), Positives = 111/217 (51%), Gaps = 24/217 (11%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWG-GGNSPILAFLGAEAPIDDNIQL-- 107
FE ++ Q ++HFS E T+ QRY IN++ + G ++P++ G E ++ +
Sbjct: 64 FEPHWFRQPLNHFSNNSE---TWLQRYWINTRHYKPGTHAPVIVIDGGETSGENRLPFLD 120
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+G + + VILEHRY+G+S+P + + R + N+ QA AD A+ + +
Sbjct: 121 TGIADILPKEIGGIGVILEHRYHGESLPV--QNFTTDSLR---FLNNDQAAADSANFMAN 175
Query: 168 IK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+K + + P I G SY G + ++ YP +V G++ASSA + + + W
Sbjct: 176 VKFPGVDEDITSPGNPWIYYGGSYAGARSAHMKMLYPELVYGAIASSA--VTHASLENW- 232
Query: 223 TYHSVVSKDFRDTSE-ECFQTIKKSWSEIDNIASKPD 258
Y +V RD ++ +C Q ++ + ID+I ++ D
Sbjct: 233 EYMEIV----RDAADPKCSQHLENAIETIDSILTRGD 265
>gi|426192499|gb|EKV42435.1| hypothetical protein AGABI2DRAFT_195776 [Agaricus bisporus var.
bisporus H97]
Length = 561
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 130/546 (23%), Positives = 207/546 (37%), Gaps = 98/546 (17%)
Query: 11 LLLMVIFVSTSF-HANGLKLRPRLGRIRR--------SRILEQKDSNHGF------ETFF 55
+LL+ F++ A+G RP L IR S+ L S +G +T++
Sbjct: 5 VLLVASFLAIDVAQADG---RPHLNMIRPPGMPLMSDSQPLGPVTSRNGTVLPPFNQTYW 61
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG---AEAPIDDNIQLSGFTY 112
++Q IDH + S TF QR+ +F+ G +L G A+ D + +
Sbjct: 62 FDQLIDHNN---PSRGTFKQRFWHTWQFYEPGGPVLLMTPGEVNADGYADSYLSTKAISG 118
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKA-ALKNARHRGYFNSAQALADYASILLHI--- 168
+ A Q +VI+EHR+YG S P KA +LK Y QA+ D + ++
Sbjct: 119 QIAQQQNGSVVIIEHRFYGLSNPINDLKAESLK------YHTIQQAIEDLEYFIKNVILP 172
Query: 169 ---KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
DK AP + G SY G L +W + P + ASSA + + W +
Sbjct: 173 QPDGDKLTPDKAPWVLFGGSYSGALTSWTMVNKPDLFAAGYASSA--VVEAILNFWRYFE 230
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL-- 283
+ C ++ + S +D + S + AI ++ K L T L D
Sbjct: 231 PIREH----MPANCSADVQVAISHVDEVFSGDNKTAI--QEMKDLFGLGEMTHLDDVAGA 284
Query: 284 --DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN---- 337
+ ++ + T N C+A+E D + A G +L+ GN
Sbjct: 285 LRNNLWDWQSLQPTSQNSLFYEFCDALEV-----KDGVSASAEGWGLEHALQAWGNFWTE 339
Query: 338 ----NSRCYINE----------------DRTGDESDEGWEWQSCSEM-----VVPMGKDK 372
C N D + D + W W C+E+ P G
Sbjct: 340 GGYLQRLCRNNSPEACLGTYDPNQPFYTDTSIDNNLRSWMWIVCNEVGYLQEGAPKGHSS 399
Query: 373 --NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNG 428
+ QP P++L + E + P+ T Y G D+++ NI F+NG
Sbjct: 400 LVTRLVQP-PYDLRQCQLMFPEAFEKPPKVDIKGTNKAYKGWDVRI------ENIFFANG 452
Query: 429 MRDPFSRGGWVKTYHFFDLSFSQDLNLGS--HCLDLDEAKK-SDPDWLVQQRKTEVKIMQ 485
MRDP+ + Q + LG HC DL A +DP L Q+ + M+
Sbjct: 453 MRDPWREATMSAEGLDVKSTPEQPIGLGDGFHCSDLSTASGLADPTILAVQQAA-LASMK 511
Query: 486 GWITQY 491
W+ +
Sbjct: 512 IWLADW 517
>gi|344268139|ref|XP_003405920.1| PREDICTED: hypothetical protein LOC100665381 [Loxodonta africana]
Length = 431
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/124 (31%), Positives = 61/124 (49%), Gaps = 3/124 (2%)
Query: 151 YFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Y +S QALAD + I +K T +A G SYGG LA W R+K+P + ++ SSA
Sbjct: 203 YLSSRQALADIVNFRTQIAEKMGLTRNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSSA 262
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTC 270
PI + + Y VV + + CFQ +K+ + ++ + P + L F C
Sbjct: 263 PIQAKAN---FYEYLEVVQRSLATHNSTCFQAVKEVFGQVVKMLKLPKYYSKLENDFTLC 319
Query: 271 TPLK 274
P++
Sbjct: 320 KPMR 323
Score = 43.5 bits (101), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 40/144 (27%), Positives = 64/144 (44%), Gaps = 28/144 (19%)
Query: 358 WQSCSEMVVPMGKD-KNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT-------YYGG 409
+Q C+E D KN + P L+ +++ C + +G + + T YYGG
Sbjct: 65 YQCCTEFGFFHTTDSKNQPFTGLP--LSYFVQQCSDFFGPNFNYDSLNTGVMSTNEYYGG 122
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDL-----NLGSHCLDLDE 464
++ + S IIF NG DP+ G K + S+DL HC D+ E
Sbjct: 123 FNV------TGSKIIFPNGSFDPWHPLGITK-------NISEDLLAVFIEGAVHCADVYE 169
Query: 465 AKKSDPDWLVQQRKTEVKIMQGWI 488
K +D L+Q R+ +I+Q W+
Sbjct: 170 QKDTDSAELIQAREKIFQILQKWL 193
>gi|409079550|gb|EKM79911.1| hypothetical protein AGABI1DRAFT_113157 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 561
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 130/546 (23%), Positives = 207/546 (37%), Gaps = 98/546 (17%)
Query: 11 LLLMVIFVSTSF-HANGLKLRPRLGRIRR--------SRILEQKDSNHGF------ETFF 55
+LL+ F++ A+G RP L IR S+ L S +G +T++
Sbjct: 5 VLLVASFLAIDVAQADG---RPHLNMIRPPGMPLMSDSQPLGPVTSRNGTVLPPFNQTYW 61
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG---AEAPIDDNIQLSGFTY 112
++Q IDH + S TF QR+ +F+ G +L G A+ D + +
Sbjct: 62 FDQLIDHNN---PSRGTFKQRFWHTWQFYEPGGPVLLMTPGEVNADGYADSYLSTKAISG 118
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKA-ALKNARHRGYFNSAQALADYASILLHI--- 168
+ A Q +VI+EHR+YG S P KA +LK Y QA+ D + ++
Sbjct: 119 QIAQQQNGSVVIIEHRFYGLSNPINDLKAESLK------YHTIQQAIEDLEYFIKNVILP 172
Query: 169 ---KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYH 225
D+ AP + G SY G L +W + P + ASSA + + W +
Sbjct: 173 QPDGDRLTPDKAPWVLFGGSYSGALTSWTMVNKPDLFAAGYASSA--VVEAILNFWRYFE 230
Query: 226 SVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL-- 283
+ C ++ + S ID + S + AI ++ K L T L D
Sbjct: 231 PIREH----MPANCSADVQVAISHIDEVFSGDNKTAI--QEMKDLFGLGEMTHLDDVAGA 284
Query: 284 --DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGN---- 337
+ ++ + T N C+A+E D + A G +L+ GN
Sbjct: 285 LRNNLWDWQSLQPTSQNSLFYEFCDALEV-----KDGVSASAEGWGLEHALQAWGNFWTE 339
Query: 338 ----NSRCYINE----------------DRTGDESDEGWEWQSCSEM-----VVPMGKDK 372
C N D + D + W W C+E+ P G
Sbjct: 340 GGYLQRLCRNNSPEACLGTYDPNQPFYTDTSIDNNLRSWMWIVCNEVGYLQEGAPKGHPS 399
Query: 373 --NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNG 428
+ QP P++L + E + P+ T Y G D+++ NI F+NG
Sbjct: 400 LVTRLVQP-PYDLRQCQLMFPEAFEKPPKVDIKGTNKAYKGWDVRI------ENIFFANG 452
Query: 429 MRDPFSRGGWVKTYHFFDLSFSQDLNLGS--HCLDLDEAKK-SDPDWLVQQRKTEVKIMQ 485
MRDP+ + Q + LG HC DL A +DP L Q+ + M+
Sbjct: 453 MRDPWREATMSAEGLDVKSTPEQPIGLGDGFHCSDLSTASGLADPTILAVQQAA-LASMK 511
Query: 486 GWITQY 491
W+ +
Sbjct: 512 RWLADW 517
>gi|395331113|gb|EJF63495.1| peptidase S28 [Dichomitus squalens LYAD-421 SS1]
Length = 574
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/227 (27%), Positives = 110/227 (48%), Gaps = 24/227 (10%)
Query: 44 QKDSNHG--FETFFYNQTIDHFSYGPESY---QTFPQRYVINSKFW-GGGNSPILAFLGA 97
Q +S G F ++ Q +DHF+ G + +T+ QRY +N++ + G ++P+ G
Sbjct: 74 QAESKWGGVFPERWFEQPLDHFAEGKGAQAETETWRQRYWVNTRHYVPGPDAPVFVIDGG 133
Query: 98 EAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSA 155
E +D + +G A + V+LEHRYYG+S P + + R + N+A
Sbjct: 134 ETSGEDRLGFLDTGIADILARATGGVGVVLEHRYYGESRPV--KNLTTDSLR---FLNNA 188
Query: 156 QALADYASILLHIK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
Q+ AD A+ + ++K + A + P I G SY G A ++ YP +V G++ASS
Sbjct: 189 QSAADSANFMANVKFPGIDEDLTAPNHPWIYYGGSYAGARAAHMKVLYPDLVWGAVASSG 248
Query: 211 PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKP 257
+ + I W Y +V R +C I+ + +D++ + P
Sbjct: 249 --VTHASIENW-EYMEIVR---RAADLKCSGNIQLTVEIVDSLLANP 289
>gi|336258109|ref|XP_003343875.1| hypothetical protein SMAC_09286 [Sordaria macrospora k-hell]
gi|380087075|emb|CCC05489.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 547
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 128/557 (22%), Positives = 212/557 (38%), Gaps = 111/557 (19%)
Query: 22 FHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINS 81
F G+++ P +I + + G+ TF +Q IDH + PE TF QR+
Sbjct: 22 FKTVGMEIGPIDDQIETLNKRAEVEGTSGYGTF--DQLIDHDT--PE-LGTFKQRFWYGF 76
Query: 82 KFWGGGNSPILAFLGAEAPIDD-------NIQLSGFTYENAHQFKALIVILEHRYYGKSV 134
+ W G SPI+ E D + +L+G+ A A +VI+EHRY+G+S
Sbjct: 77 QHWKGPGSPIILVNPGEQAADGFNKSYLTDQRLAGWM---AKDIGAAVVIMEHRYWGESS 133
Query: 135 PFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT---HAPVIAIGASYGGELA 191
P+ + + N ++ NS + + +A + D+ N + +AP I G SY G LA
Sbjct: 134 PY--DQLTVNNLQYLTLENSLKDINYFAEHIELPFDETNGSKPANAPWIFTGGSYSGALA 191
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEID 251
W YP +S + G WT + V T + C + + +D
Sbjct: 192 GWLEALYPGTFWAYHGTSGVVESVGHF--WTYFVPVQEA----TPQNCSKDLSAVIDYVD 245
Query: 252 NIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQY-------------DTPSN 298
+I L +KK K LK +L+D D ++ A + T
Sbjct: 246 SI------LLFGTKKAK--HDLKAKFKLQDLTDQDFASAIESGPWTWQSTQFYSEKTTGY 297
Query: 299 IPVKRICNAIENA-PNCGDDILCKIAAGVVEA-----------------DSLEYDG---- 336
P R C+ +EN PN + + + G+ +A +S+ Y G
Sbjct: 298 TPYYRFCDYVENVWPNSTNKVPGPLGVGIKKALDGYAKWFTEESLPGSCESIGYAGFEGT 357
Query: 337 NNSRCYINEDRT---------GDESDEGWEWQSCSEMV------VPMGKDK--NSMYQPE 379
+N C+ N++ + G+ + W W C+E P+G+ + + +
Sbjct: 358 DNVLCFQNQNASNPIFHDLSVGNPYNRQWNWFLCNEPFEWWQDGAPLGRPSIVSRLVDAD 417
Query: 380 PWNLTKYI-----KNCKEQYGV--SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
W + K K YG+ R V + GG + + I+ +NG DP
Sbjct: 418 YWRKQCPLWFPVEKGSKVSYGIKQGKRAQDVNKWTGGWK-----HTNGTRIMQANGSLDP 472
Query: 433 ---------FSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKI 483
F GG K + + G+HC D S + + + + EV
Sbjct: 473 WRDATLSSKFRPGGPFKGNKNHQVRV---IEGGTHCSDFYGPNWSANEGVKKIAEEEVAQ 529
Query: 484 MQGWITQYYDDFKAINK 500
M W+ +Y K I K
Sbjct: 530 MGEWVKDFY-KVKGITK 545
>gi|242805460|ref|XP_002484534.1| serine peptidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218715159|gb|EED14581.1| serine peptidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 535
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 119/544 (21%), Positives = 209/544 (38%), Gaps = 96/544 (17%)
Query: 12 LLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQ 71
+++ I + S+ + + +RP + R S + + T +++Q +DH PE
Sbjct: 13 IILFIAICASWISLTVAVRPNIPRPPASAAPAAFE----YSTGWFDQLLDHDK--PE-LG 65
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPID------DNIQLSGFTYENAHQFKALIVIL 125
TF QRY ++++W G SP++ F E D N+ +SG A +F +IL
Sbjct: 66 TFRQRYFYSTQYWKGSGSPVILFQPGEQTADGFQGYLTNVTISGVY---AQEFGGAGIIL 122
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT------HAPV 179
EHRY+G+S P + K +H + N ALAD ++K ++ + +AP
Sbjct: 123 EHRYWGESSPVNTLTP--KTMQHLTFKN---ALADAVHFAKNVKLPFDNSTRSSPKNAPW 177
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEEC 239
I +G SY G A W P ASSAP+ + W + + + ++ S +
Sbjct: 178 ILVGGSYSGAQAGWTAATLPGTFWAYHASSAPVEAIWNY--WQYFVPIQQRLPKNCSTDL 235
Query: 240 FQTIKKSWSEIDNIASKPDGLAI--LSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPS 297
I ID+I + + A L +KF L++ + Y PS
Sbjct: 236 VNVID----HIDSILTGSNESAKDDLKRKF-MLGDLRDDDFAAAIVGGPYLGQTTSWGPS 290
Query: 298 NIPVKRICNAIENAPNCGDDILCKIAAGVVEA---------------------------D 330
+ + C+ IEN + GV +A +
Sbjct: 291 GV-IYEFCDYIENVHATPPANVSSSGVGVTKALEGYAQWWTTTFFPGTCASYGYWTDQYE 349
Query: 331 SLEYDGNNSRCYINEDRT-GDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTK---- 385
+ YD NS + DR+ + +D W W C+E P G ++ P +++
Sbjct: 350 TACYDTYNSSNPLYADRSINNVADRQWIWFCCNE---PFGAWQDGAPDGVPSIVSRLVTA 406
Query: 386 --YIKNCKEQY----------GVSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
Y++ C + RP ++ ++ G +T +I++ G DP+
Sbjct: 407 DYYLRTCGTYFQPDDGYTYASADGKRPDFLNSWTQG------WTGTTERVIWAQGQYDPW 460
Query: 434 SRGGWVKTYHFFDLSFSQDLNL------GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGW 487
+ + S +N SHC DL + +K EV+ M+ W
Sbjct: 461 REETISSDFRPGGPAISSQVNPIYIMPEASHCYDLLWRNGEGNSGVRDVQKKEVEQMKAW 520
Query: 488 ITQY 491
I ++
Sbjct: 521 IDEW 524
>gi|170045815|ref|XP_001850489.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868717|gb|EDS32100.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 490
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 101/225 (44%), Gaps = 23/225 (10%)
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD-NIQLSGFTYENAHQFKALIVIL 125
P+ + + RY ++F+ G PIL +LG API + I S Y+ A Q I
Sbjct: 44 PQRNERWTMRYFSVTEFYEAGG-PILIWLGGNAPIQEYMIDESSLLYDLARQMNGAIFAF 102
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATH---APVIAI 182
E R+YG++ R ++ Q + D A + +++ N H APV+
Sbjct: 103 ESRFYGQN-----RATEDVTVESLYLLSTYQIMGDLAEFVTYLRR--NVVHDEDAPVLVS 155
Query: 183 GASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD-TSEECFQ 241
GA YGG LATWFR+ YPH+ + +S + + ++ Y + + + C+
Sbjct: 156 GAGYGGALATWFRVHYPHLADAAWSSGGT---HKAVMSFSEYAEAWGQTLINYGGQACYN 212
Query: 242 TIKKSWSEIDNIASKPDGL-AILSKKFKTCTPLKNTTELKDGLDT 285
I ++ + + GL IL +KF CT + L+D L+
Sbjct: 213 EIFVAFHVMQYLIDM--GLEEILFEKFNLCTEIN----LQDNLEV 251
>gi|343960995|dbj|BAK62087.1| dipeptidyl-peptidase 2 precursor [Pan troglodytes]
Length = 486
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 72/134 (53%), Gaps = 8/134 (5%)
Query: 356 WEWQSCSEMVVPMGKDK-NSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
W++Q+C+E+ + + M+ P+ + C + +GV PRP W+LT + G D+
Sbjct: 341 WDYQACTEINLTFASNNVTDMFPDLPFTDELRQRYCLDTWGVWPRPDWLLTSFWGGDL-- 398
Query: 415 ILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLV 474
R+ SNIIFSNG DP++ GG + ++ + + G+H LDL + DP +V
Sbjct: 399 ---RAASNIIFSNGNLDPWAGGGIRRNLSASVIAVT--IQGGAHHLDLRASHPEDPASVV 453
Query: 475 QQRKTEVKIMQGWI 488
+ RK E I+ W+
Sbjct: 454 EARKLEATIIGEWV 467
Score = 56.2 bits (134), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/90 (30%), Positives = 51/90 (56%), Gaps = 1/90 (1%)
Query: 190 LATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSE 249
L+ + R+KYPH+V G+LA+SAP+L + + V+ DF S +C Q +++++ +
Sbjct: 2 LSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVREAFRQ 61
Query: 250 IDNIASKPDGLAILSKKFKTCTPLKNTTEL 279
I ++ + + +F TC PL + +L
Sbjct: 62 IKDLFLQ-GAYDTVRWEFGTCQPLSDEKDL 90
>gi|409079210|gb|EKM79572.1| hypothetical protein AGABI1DRAFT_114101 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 552
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/315 (27%), Positives = 135/315 (42%), Gaps = 47/315 (14%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETF---FYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
R R+ RR LE S FE F ++ Q +DHF + TF QRY ++ + +
Sbjct: 42 RERVASSRRDLPLEI--SARAFEEFEPQWFEQPLDHFDES--NPHTFKQRYWVSKRHYKA 97
Query: 87 -GNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
+P+ G E + + +G A + L VILEHRYYG+S+ A+
Sbjct: 98 RQGAPVFVLDGGETSGANRLPFLDTGIVDILARATEGLGVILEHRYYGESI-------AV 150
Query: 144 KNARHRG--YFNSAQALADYASILLHIK-----DKYNATHAPVIAIGASYGGELATWFRL 196
+N + N+ QA AD A+ + +K + A P I G SY G + ++
Sbjct: 151 ENLTTDALRWLNNEQAAADSANFMAKVKFEGIEEDLTAPSTPWIYYGGSYAGARSAHMKI 210
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
YP +V G++ASSA + + + W Y ++ K +C ++ S ID+I
Sbjct: 211 LYPDLVYGAIASSA--VTHAAVENW-QYMEIIRK---AADPKCSHHLENSIKIIDSI--- 261
Query: 257 PDGLAILSKKFKTCTP----LKNTTELKDGLDTVYSEAAQYDT------PSNIPVKRICN 306
L SK FK+ L N T D T+ S + + S++ + C
Sbjct: 262 ---LLGHSKFFKSHLKGLFGLANLTHDDDFASTISSPLGSWQSKNWDPEESSVRFDQFCE 318
Query: 307 AIENAPNCGDDILCK 321
A+ NAP G I K
Sbjct: 319 AL-NAPVHGHGISAK 332
>gi|449294816|gb|EMC90840.1| hypothetical protein BAUCODRAFT_99497 [Baudoinia compniacensis UAMH
10762]
Length = 497
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 79/174 (45%), Gaps = 14/174 (8%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPI---DDNIQLSGFTY 112
+ Q IDH S TF Q Y +++FWGG SP++ F E + + ++ T
Sbjct: 22 FTQLIDHTD---PSVGTFEQFYYYSTEFWGGPGSPVILFTPGEINVTGYQSYLTINRTTG 78
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI--LLHIKD 170
A + A +++LEHRY+G S PF +A N + S L +A+ L +
Sbjct: 79 VLAEKIGAAVIVLEHRYWGTSTPFTVLSSA--NLTYLTLNQSIYDLTHFANTVRLPFAQH 136
Query: 171 KYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL----YYGDITP 220
NA P + +G SY G LA W P LASSAP+ YYG P
Sbjct: 137 GSNAKQVPWVFMGGSYSGALAAWTESVAPGTFWAYLASSAPVEAISDYYGYFLP 190
>gi|146322738|ref|XP_749261.2| serine peptidase [Aspergillus fumigatus Af293]
gi|129556778|gb|EAL87223.2| serine peptidase, putative [Aspergillus fumigatus Af293]
Length = 525
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 84/201 (41%), Gaps = 25/201 (12%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+ Q IDH + G TFPQRY N +FW G SP+L F E+ D GF
Sbjct: 45 FEQYIDHNNPG---LGTFPQRYWYNPEFWAGPGSPVLLFTPGES---DAADYDGFLTNKT 98
Query: 115 -----AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A + +++LEHRY+G S P+ Y Q++AD +
Sbjct: 99 IVGRFAEEIGGAVILLEHRYWGASSPYPELTTETLQ-----YLTLEQSIADLVHFAKTVN 153
Query: 170 DKY------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ NA +AP + G SY G LA W P ASSAP+ D W
Sbjct: 154 LPFDEIHSSNADNAPWVMTGGSYSGALAAWTASIAPGTFWAYHASSAPVQAIYDF--WQY 211
Query: 224 YHSVVSKDFRDTSEECFQTIK 244
+ VV ++ S++ + ++
Sbjct: 212 FVPVVEGMPKNCSKDLNRVVE 232
>gi|159128675|gb|EDP53789.1| serine peptidase, putative [Aspergillus fumigatus A1163]
Length = 525
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/201 (28%), Positives = 84/201 (41%), Gaps = 25/201 (12%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
+ Q IDH + G TFPQRY N +FW G SP+L F E+ D GF
Sbjct: 45 FEQYIDHNNPG---LGTFPQRYWYNPEFWAGPGSPVLLFTPGES---DAADYDGFLTNKT 98
Query: 115 -----AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A + +++LEHRY+G S P+ Y Q++AD +
Sbjct: 99 IVGRFAEEIGGAVILLEHRYWGASSPYPELTTETLQ-----YLTLEQSIADLVHFAKTVN 153
Query: 170 DKY------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ NA +AP + G SY G LA W P ASSAP+ D W
Sbjct: 154 LPFDENHSSNADNAPWVMTGGSYSGALAAWTASIAPGTFWAYHASSAPVQAIYDF--WQY 211
Query: 224 YHSVVSKDFRDTSEECFQTIK 244
+ VV ++ S++ + ++
Sbjct: 212 FVPVVEGMPKNCSKDLNRVVE 232
>gi|429862127|gb|ELA36786.1| serine peptidase, putative [Colletotrichum gloeosporioides Nara
gc5]
Length = 335
Score = 73.9 bits (180), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 95/234 (40%), Gaps = 25/234 (10%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD---NIQLSGFTY 112
+ Q IDH + F QRY N+K WGG PI EA ++ ++ Y
Sbjct: 49 FTQLIDHTD---PTKGCFEQRYWHNAKIWGGPGYPIFMVNVGEADAEEYAWHLINQRLAY 105
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY 172
+ + +V+ EHRYYGK P + A Y Q+L D +K +
Sbjct: 106 LYGEKLQGAVVLFEHRYYGKPQPLKTMTAETLQ-----YHTVPQSLQDNKRFAQTVKFGF 160
Query: 173 ------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
N +P + IG SY G L W + P V ASSA I GD W +
Sbjct: 161 DNCGSANVDKSPWVLIGGSYAGALPAWQSVITPGVFAAHHASSAVIHAIGDF--WQFWTP 218
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPD--GLAILSKKFKTCTPLKNTTE 278
V R+ SE IKK +D+I S+ D +A + K+F + N E
Sbjct: 219 VEQAMPRNFSEGVKLVIKK----VDSILSRGDMQEIAAIKKEFGYVSEFYNHME 268
>gi|342878892|gb|EGU80177.1| hypothetical protein FOXB_09306 [Fusarium oxysporum Fo5176]
Length = 537
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 57/163 (34%), Positives = 79/163 (48%), Gaps = 18/163 (11%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF Y P S + FP RY +++ + G P++ E +D I G
Sbjct: 46 VDHFHNETKYEPHSDKKFPLRYWFDAQHYREGG-PVIILASGETSGEDRIPFLEHGILKM 104
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK---- 169
A+ + VILEHRYYG S P K KN R + ++ QALAD A H+K
Sbjct: 105 LANATGGVGVILEHRYYGTSFPVPDLKT--KNLR---FLSTEQALADTAYFAEHVKFPGL 159
Query: 170 DKYN--ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
+K+N A++ P I G SY G A + R YP V G ++SS
Sbjct: 160 EKHNLTASNTPYIIYGGSYAGAFAAFARKIYPEVFWGGISSSG 202
>gi|341889106|gb|EGT45041.1| hypothetical protein CAEBREN_04354 [Caenorhabditis brenneri]
Length = 668
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 108/435 (24%), Positives = 174/435 (40%), Gaps = 81/435 (18%)
Query: 115 AHQFKALIVILEHRYYGKS--VPFGSRK-AALKNARHRGYFNSAQALADYASILLHIKDK 171
A +F A +EHR+YG P G + +++K QALAD + +
Sbjct: 28 AAEFGAAAFQVEHRFYGSKDYSPIGDQTPSSMK------LLTIDQALADIKEFITQMNAL 81
Query: 172 YNATHAPV-IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSK 230
Y P+ + G SY G L+ WFR YP + G+++SS+ + + D + Y K
Sbjct: 82 YFKDDKPIWVTFGGSYPGSLSAWFRETYPEMTAGAVSSSSAVHVFVD---YYGYAINTEK 138
Query: 231 DFRDTSEECFQTIKKSW-SEIDNIASKPDGLAILSKKFKTCTP-------------LKNT 276
+R S+ C TI ++ S I + D +L ++F C +N
Sbjct: 139 TYRTVSDSCANTIGVAFQSMIQKAYAGKDSRILLKQQFNLCDDFDENNLSKSLQFFFQNV 198
Query: 277 TELKDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG 336
G++ + T S + V C+ + N N GD+I VV+ +L YD
Sbjct: 199 YGYFQGINQYTGDNRNNATRSGLGVPAACDILNNNKN-GDEI-----NRVVQVMNL-YDS 251
Query: 337 -----------NNSRCYIN--EDRTGDESD----EGWEWQSCSEM---VVPMGKDKNSMY 376
NN +I D T + D W WQ+C+E+ G ++
Sbjct: 252 WYPPSATGCRPNNYTSFIQYYSDTTMPDDDRISTRSWIWQTCTELGYYQTTDGGNRGIFG 311
Query: 377 QPEPWNLTKYIKNCKEQYGVSPRPSWVL-----------TYYGGHDIKLILRRSTSNIIF 425
P L + C + +G P + L T YGG D R T N+ F
Sbjct: 312 STVP--LDFFADQCIDLFG----PEYTLDNTFKLIDQVRTKYGGADA----YRGT-NVCF 360
Query: 426 SNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQ 485
NG DP+ G K + + S ++ +HC D+ A+ SD L R V+I
Sbjct: 361 PNGSFDPWQDLG-HKANNTDNNVDSWLIDGTAHCADMYPARDSDKQSLKDAR---VRI-H 415
Query: 486 GWITQYYDDFKAINK 500
G ++++ D +A+ +
Sbjct: 416 GHLSKWLSDAQAVRQ 430
>gi|312066657|ref|XP_003136374.1| hypothetical protein LOAG_00786 [Loa loa]
gi|307768457|gb|EFO27691.1| hypothetical protein LOAG_00786 [Loa loa]
Length = 512
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 78/354 (22%), Positives = 151/354 (42%), Gaps = 49/354 (13%)
Query: 115 AHQFKALIVILEHRYYGKSVP-FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A Q A I +LEHR+YG S P + +LK Y ++ QA+ D + I +
Sbjct: 126 ADQINASIFMLEHRFYGSSRPTIDTSIQSLK------YLDAKQAVEDIDRFVQEINQREK 179
Query: 174 ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI---LYYGDITPWTTYHSVVSK 230
T+ I G SY G LA W R K+P + ++ASSAP+ L + D + + K
Sbjct: 180 LTNPKWITFGGSYSGNLAAWAREKHPRSIRAAVASSAPLQAKLNFKD------FERQIEK 233
Query: 231 DFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK----TCTPLKNTTELKDGLDTV 286
+C I+K + ++ +++ +G L K F+ P + ++ + +
Sbjct: 234 IIEKKDTKCVAVIRKLFQKMRQMSTTHEGRRKLVKIFRLDDSLIRPAVSDKDVANFFLVI 293
Query: 287 YSEAAQYDTPSNIPVKR---------ICNAIENAPNCGD--DILCKIAA--GVVEADSLE 333
+ + S I VK +C+ + ++P+ +++ + A G +++
Sbjct: 294 SNYISFIVMHSGINVKDHRDLLTLDVMCSKLTHSPSLESIRELIGMVMASQGKSSHSAID 353
Query: 334 YDGNNSRCYINEDR--TGDESDEGWEWQSCSEM-VVPMGKDKNSMYQPEPWNLTKYIKNC 390
N+ ++ ++R T + W +Q+C E ++ N ++ L+ ++ C
Sbjct: 354 IGYNSFLDFMRDERWNTRNAQPRAWLYQNCHEFGHFRTSEEANGLFGG-TLPLSFFLARC 412
Query: 391 KEQYG-------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGG 437
K+ +G R + Y+GG+ +++I SNG DP++ G
Sbjct: 413 KDVFGGHFSLENTEKRIAETNEYFGGNQ-----HFQATDVILSNGSDDPWTLLG 461
>gi|353242348|emb|CCA74000.1| related to serine protease [Piriformospora indica DSM 11827]
Length = 548
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 67/230 (29%), Positives = 100/230 (43%), Gaps = 32/230 (13%)
Query: 41 ILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP 100
IL + F + Q +DH S TF QRY +N++ + G P++ G E
Sbjct: 54 ILGNQPQYPEFPPQMFIQPLDH---DDPSSPTFEQRYWVNTRHYKKGG-PVIVIDGGETS 109
Query: 101 IDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
+D + +G A L VILEHRYYG+SVP + + R + N+ QAL
Sbjct: 110 GEDRLPFLDTGIADILAKATHGLGVILEHRYYGESVPV--KNLTTDSLR---WLNNYQAL 164
Query: 159 ADYASILLHIKDKYN---------------ATHAPVIAIGASYGGELATWFRLKYPHVVI 203
+D A + H+ N A ++P I G SY G A R+ YP +V
Sbjct: 165 SDSARFMKHVNFSSNLFPSSVSSETISNLKAPNSPWIYYGGSYAGARAAHMRVLYPEIVF 224
Query: 204 GSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI 253
G++ASSA + + I W Y ++ C I+ S S +D +
Sbjct: 225 GAIASSA--VTHATINNW-EYMDIIRT---AAPPACSALIQASMSHVDAM 268
>gi|171676507|ref|XP_001903206.1| hypothetical protein [Podospora anserina S mat+]
gi|170936320|emb|CAP60978.1| unnamed protein product [Podospora anserina S mat+]
Length = 598
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/206 (29%), Positives = 88/206 (42%), Gaps = 28/206 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
++Q IDH + S TF QRY ++ +W G SP++ F EA + SGF
Sbjct: 92 FSQLIDHSN---PSLGTFSQRYWWDTTYWDGPGSPVVVFSPGEASAE---YYSGFLTNQT 145
Query: 115 -----AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A A I+++EHRY+G S PF + Y ++AD+A ++
Sbjct: 146 IVGLYAQAIGAAILLIEHRYWGDSSPFSHLSTV-----NLTYLTLNDSVADFAHFARQVQ 200
Query: 170 ------DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL----YYGDIT 219
+ NA AP I +G SY G LA W P ASSAP+ ++ T
Sbjct: 201 LPFDESGRSNAPKAPWIFVGGSYSGNLAAWLDHLSPGTFWAYHASSAPVQAIRHFWEYFT 260
Query: 220 P-WTTYHSVVSKDFRDTSEECFQTIK 244
P W SKDF + Q ++
Sbjct: 261 PIWEGMPRNCSKDFEKITAHIDQVLE 286
>gi|310795241|gb|EFQ30702.1| serine carboxypeptidase S28 [Glomerella graminicola M1.001]
Length = 488
Score = 73.6 bits (179), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 140/357 (39%), Gaps = 79/357 (22%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
++Q +DH + S TF QRY N++ WGG P+ G E + +L+G+ EN
Sbjct: 28 FDQFLDHTN---TSKGTFKQRYWWNAEHWGGPGFPVFMVNGGET---NAGRLTGY-LENG 80
Query: 115 ------AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD------YA 162
A K I+++EHRYYG+S PF + A Y + QA+ D A
Sbjct: 81 TLASLYAETHKGAIILIEHRYYGESWPFKTSTADTLQ-----YLDVPQAIRDNIHFAQTA 135
Query: 163 SILLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+ NA +P + +G SY G LA W + P ASSA + D W
Sbjct: 136 DLPFDTNKGANANTSPWVLVGGSYAGALAAWTSVVAPGTFAAYHASSAVVQAIEDF--WQ 193
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
+ + + + C IK ++D + + + K K G
Sbjct: 194 FFTPIE----QAMPQNCSTDIKLVIQQVDRVLDNGTAQEVDAMK------------QKFG 237
Query: 283 LDTVYSEAA------------QYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV-VEA 329
LDT+ A YD PS V C+ IE + N G+ I + +GV +EA
Sbjct: 238 LDTLADHADFAWYLQKPIIMWAYD-PSK--VLEFCDWIETSTNDGEVIEGREESGVGLEA 294
Query: 330 -----DSLEYDGNNSRCYINED----------RTGDESD--EGWEWQSCSEMVVPMG 369
S + N +C E T D+ D W WQ C+E P+G
Sbjct: 295 AWTGYTSWMHKRYNEQCREEESCNLYGNAVGYNTPDDLDWSRSWVWQLCNE---PLG 348
>gi|440482790|gb|ELQ63249.1| hypothetical protein OOW_P131scaffold00998g31 [Magnaporthe oryzae
P131]
Length = 497
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 93/225 (41%), Gaps = 31/225 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID------DNIQLSG 109
++Q +DH + S TF QRY ++ W G SP+ F E D DN L G
Sbjct: 21 FDQLLDHHN---PSKGTFKQRYFWDASSWAGPGSPVFLFNPGEDAADGYVGYLDNHTLPG 77
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A F+ ++++EHRY+GKS+PF + A Y + Q++ D ++
Sbjct: 78 LY---ADTFQGAVIVIEHRYWGKSIPFD-----ILTAETLQYLDVPQSIMDMTHFAKTVQ 129
Query: 170 DKY--------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
+ NA AP + IG SY G LA W + P V A+SA I D
Sbjct: 130 LSFDSSGDGGANAEKAPWVLIGGSYSGALAAWTQKLSPGVFWAYHATSAVIEAVHD---- 185
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKK 266
+H+ + C ++ + +D + + A+ K
Sbjct: 186 --FHTYFAPIEAALPRNCSADVRAVVAHVDRVLDSRNSTAVRRLK 228
>gi|328854272|gb|EGG03405.1| extracellular serine carboxypeptidase [Melampsora larici-populina
98AG31]
Length = 542
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 116/489 (23%), Positives = 184/489 (37%), Gaps = 96/489 (19%)
Query: 68 ESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVIL 125
ES TF QRY ++ ++ G P+ G E D + G + + +IL
Sbjct: 84 ESQSTFAQRYWFDTTYYQKG-GPVFLLDGGETNGQDRLPYLQDGILSILSKATHGIGIIL 142
Query: 126 EHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI------KDKYNATHAPV 179
EHRYYG+S PF L N R Y N+ ++L D A HI A P
Sbjct: 143 EHRYYGQSFPFKD----LSNESLR-YLNTRESLDDSAYFSQHIVLPGHEDLDITAPGTPW 197
Query: 180 IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEEC 239
I G SY G A + YP ++ GS+ASSA I + + W Y + E C
Sbjct: 198 IYYGGSYAGAKAAFMMKLYPDLIWGSIASSAVI--HAQVDFWQYYEPIRIH----APETC 251
Query: 240 FQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYD----- 294
+ + ID I D +AI+S K L N T+ +D ++ + S +
Sbjct: 252 IEPLIIITRSIDRILLSNDSMAIMS--LKDLFGLANVTDHRDFVNVLASPIGTWQERNWD 309
Query: 295 -----------------TPSNIPVKRICNAI----------ENAP-----NCGDDILCKI 322
PS P+K ++ EN P + I I
Sbjct: 310 PRISNHEFETYCDSLKRNPSPEPIKTFSTSLSLLQRFFEVEENFPLDSLLGYSNYIKNSI 369
Query: 323 AAGVVEADSLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWN 382
++ + D E G + D D++ W WQ C+E G +NS +
Sbjct: 370 SSKCEKLDQDECFGTGNLTAHQIDSL-DQTWRSWMWQVCTE----WGYFQNSSPGLQDSL 424
Query: 383 LTKYI------KNCKEQYGV----SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDP 432
++K I + C+ +G +P + V YG +D+ ++ + F +G DP
Sbjct: 425 VSKLITLEYNSRPCQLAFGSNIPKTPNTTQV-NQYGDYDLD------SNRLAFIDGSHDP 477
Query: 433 FSRGGWVKTYHFFDLSFSQDLNL--------GSHCLDLDEAKKSDPDWLVQQRKTEVKIM 484
W+ Y Q+ + G H D + +P+ + + EV +
Sbjct: 478 -----WI--YMTVHSPLLQNRTIRDGFVIDGGIHHWDENGNGDGEPEGIRNVHQREVGWV 530
Query: 485 QGWITQYYD 493
+ W+ Q+YD
Sbjct: 531 REWVRQFYD 539
>gi|391865279|gb|EIT74569.1| hydrolytic enzymes of the alpha/beta hydrolase fold protein
[Aspergillus oryzae 3.042]
Length = 573
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 84/181 (46%), Gaps = 18/181 (9%)
Query: 42 LEQKDSNHGFETFFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGA 97
++Q+ + + ++ + +DHF Y P S F RY +++++ G P+
Sbjct: 45 VKQQSTTAEYPEYYLSVPVDHFHNDSRYAPHSDDHFDLRYWFDAQYYKEGG-PVFVIAAG 103
Query: 98 EAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSA 155
E D G E A + + VILEHRYYGKS P + +N R + ++
Sbjct: 104 ETDATDRFPFLSQGIVTELASAYNGIGVILEHRYYGKSYPVAN--LTTENIR---FLSTD 158
Query: 156 QALADYA----SILLHIKDKYNATH--APVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
QALADYA +++ + N T P IA G SY G + R YP V G+++SS
Sbjct: 159 QALADYAYFASNVVFPGLEHVNLTSKTTPWIAYGGSYAGAFVAFLRKLYPDVYWGAVSSS 218
Query: 210 A 210
Sbjct: 219 G 219
>gi|169775711|ref|XP_001822322.1| extracellular serine carboxypeptidase [Aspergillus oryzae RIB40]
gi|238502353|ref|XP_002382410.1| extracellular serine carboxypeptidase, putative [Aspergillus flavus
NRRL3357]
gi|83771057|dbj|BAE61189.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220691220|gb|EED47568.1| extracellular serine carboxypeptidase, putative [Aspergillus flavus
NRRL3357]
Length = 573
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/181 (29%), Positives = 84/181 (46%), Gaps = 18/181 (9%)
Query: 42 LEQKDSNHGFETFFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGA 97
++Q+ + + ++ + +DHF Y P S F RY +++++ G P+
Sbjct: 45 VKQQSTTAEYPEYYLSVPVDHFHNDSRYAPHSDDHFDLRYWFDAQYYKEGG-PVFVIAAG 103
Query: 98 EAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSA 155
E D G E A + + VILEHRYYGKS P + +N R + ++
Sbjct: 104 ETDATDRFPFLSQGIVTELASAYNGIGVILEHRYYGKSYPVAN--LTTENIR---FLSTD 158
Query: 156 QALADYA----SILLHIKDKYNATH--APVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
QALADYA +++ + N T P IA G SY G + R YP V G+++SS
Sbjct: 159 QALADYAYFASNVVFPGLEHVNLTSKTTPWIAYGGSYAGAFVAFLRKLYPDVYWGAVSSS 218
Query: 210 A 210
Sbjct: 219 G 219
>gi|440468398|gb|ELQ37563.1| hypothetical protein OOU_Y34scaffold00590g77 [Magnaporthe oryzae
Y34]
Length = 487
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 93/225 (41%), Gaps = 31/225 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID------DNIQLSG 109
++Q +DH + S TF QRY ++ W G SP+ F E D DN L G
Sbjct: 21 FDQLLDHHN---PSKGTFKQRYFWDASSWAGPGSPVFLFNPGEDAADGYVGYLDNHTLPG 77
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A F+ ++++EHRY+GKS+PF + A Y + Q++ D ++
Sbjct: 78 LY---ADTFQGAVIVIEHRYWGKSIPFD-----ILTAETLQYLDVPQSIMDMTHFAKTVQ 129
Query: 170 DKY--------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
+ NA AP + IG SY G LA W + P V A+SA I D
Sbjct: 130 LSFDSSGDGGANAEKAPWVLIGGSYSGALAAWTQKLSPGVFWAYHATSAVIEAVHD---- 185
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKK 266
+H+ + C ++ + +D + + A+ K
Sbjct: 186 --FHTYFAPIEAALPRNCSADVRAVVAHVDRVLDSRNSTAVRRLK 228
>gi|389633507|ref|XP_003714406.1| hypothetical protein MGG_01453 [Magnaporthe oryzae 70-15]
gi|351646739|gb|EHA54599.1| hypothetical protein MGG_01453 [Magnaporthe oryzae 70-15]
Length = 528
Score = 73.2 bits (178), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 59/225 (26%), Positives = 93/225 (41%), Gaps = 31/225 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID------DNIQLSG 109
++Q +DH + S TF QRY ++ W G SP+ F E D DN L G
Sbjct: 62 FDQLLDHHN---PSKGTFKQRYFWDASSWAGPGSPVFLFNPGEDAADGYVGYLDNHTLPG 118
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A F+ ++++EHRY+GKS+PF + A Y + Q++ D ++
Sbjct: 119 LY---ADTFQGAVIVIEHRYWGKSIPFD-----ILTAETLQYLDVPQSIMDMTHFAKTVQ 170
Query: 170 DKY--------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW 221
+ NA AP + IG SY G LA W + P V A+SA I D
Sbjct: 171 LSFDSSGDGGANAEKAPWVLIGGSYSGALAAWTQKLSPGVFWAYHATSAVIEAVHD---- 226
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKK 266
+H+ + C ++ + +D + + A+ K
Sbjct: 227 --FHTYFAPIEAALPRNCSADVRAVVAHVDRVLDSRNSTAVRRLK 269
>gi|115387739|ref|XP_001211375.1| predicted protein [Aspergillus terreus NIH2624]
gi|114195459|gb|EAU37159.1| predicted protein [Aspergillus terreus NIH2624]
Length = 564
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 71/262 (27%), Positives = 106/262 (40%), Gaps = 25/262 (9%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID----DNIQLSGFTYENA 115
IDH S T+ R+ + +F+ G SPI F G E+ + F +
Sbjct: 76 IDH---NDPSVGTYQNRFWVTEQFYVPG-SPIFVFDGGESNAEVYKSHLTSNQSFFWHIL 131
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA------SILLHIK 169
FK + ++ EHRYYG+S PF + H Y + QAL D S H
Sbjct: 132 KDFKGMGILWEHRYYGESRPFPVSLST--PPEHMEYLTTRQALEDIPYFAKNFSRPNHPD 189
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHSV 227
P I +G SY G A R +YP + + ASSAP+ D++ + Y +
Sbjct: 190 VDLRPQSTPWIMVGGSYAGIRAALARSEYPETIYAAYASSAPVEAQVDMSIYFNNVYRGM 249
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-LKDGLDTV 286
+ D C Q+I+ ++ ID+ K D A + + F NT E L +
Sbjct: 250 EANGLGD----CIQSIQAAYQYIDDQLGKEDTAAAVKRLFFGPGAENNTNEGFTAALSHL 305
Query: 287 YSEAAQYDT--PSNIPVKRICN 306
Y Y P+N ++ CN
Sbjct: 306 YDSFQGYGLTGPANSSLQDFCN 327
>gi|320588605|gb|EFX01073.1| serine-type peptidase [Grosmannia clavigera kw1407]
Length = 551
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 76/294 (25%), Positives = 117/294 (39%), Gaps = 49/294 (16%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD------NIQLSG 109
+ Q +DH S TF QR+ ++++WGG SP++ F E P + N ++G
Sbjct: 60 FQQLLDHKD---PSQGTFSQRFWWSTQYWGGPGSPVVFFTPGEEPATNYTGYLTNRTITG 116
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASIL---L 166
+ A +V+LEHRY+G+S PF KN R NS + +A +
Sbjct: 117 ---QFAQAIGGAVVMLEHRYWGESSPF--DDLTTKNMRFLTLANSIADVTHFARTVELPF 171
Query: 167 HIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
NA AP + G SYGG LA + P ASSAP+ D W +
Sbjct: 172 DTNGTSNAPTAPWVMSGGSYGGALAAYIEHVDPGTFWAYHASSAPVQVIEDF--WQYFEP 229
Query: 227 VVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAI--LSKKFKTCTPLKNTTEL----- 279
V+ + + C ++ ID++ + I L KF T ++++ +
Sbjct: 230 VI----QGLPQNCSADLQLVIPHIDDVLANGTTAEIQALKDKFGLGT-IEHSDDFVVALE 284
Query: 280 -------KDGLDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV 326
+ G D YS PV C+ +EN+ D A GV
Sbjct: 285 NGPWDWQESGFDADYS-----------PVYEFCDYVENSVASVFDSATAGAEGV 327
>gi|296424960|ref|XP_002842012.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295638268|emb|CAZ86203.1| unnamed protein product [Tuber melanosporum]
Length = 544
Score = 72.8 bits (177), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 108/491 (21%), Positives = 185/491 (37%), Gaps = 99/491 (20%)
Query: 77 YVINSKFWGGGNSPILAFLGAEAPIDDNIQ-----LSGFTYENAHQFKALIVILEHRYYG 131
++++SKF+ GN P+ + E + ++Q + F + +F + ++ EHRYYG
Sbjct: 72 FIVHSKFYQPGN-PVFIYDAGEGNAEPSVQDRLVDGASFFRQIVQEFNGIGIVWEHRYYG 130
Query: 132 KSVPFG-SRKAALKNARHRGYFNSAQALADYASILLHIKDK------YNATHAPVIAIGA 184
S S ++ R + N+AQALAD + + + P + IG
Sbjct: 131 ASSHVNISLATPVEEFR---WLNTAQALADIPAFANNFTLRSFSNIDLTPASTPWVFIGG 187
Query: 185 SYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHSVVSKDFRDTSEECFQT 242
SY G A + R KYP + S +SSAP+ D++ + Y + F++ + +
Sbjct: 188 SYPGSRAAFARHKYPETIFASFSSSAPVQAQIDMSAYFEQVYRGLRGLGFKNCTNDIVAA 247
Query: 243 IKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-LKDGLDTVYSEAAQYDTPSNIP- 300
IK ID+ S+ D A + K++ N+ E D L +Y T N
Sbjct: 248 IKY----IDDQLSRADTSAAIKKQYLGRGAENNSNEGFADALTYIYYSWQSNLTEGNPAN 303
Query: 301 VKRICNAIENAPNCGD-----------------------DILCKIAAGVVEADSLEYDGN 337
++ C+ + P G+ + + Y GN
Sbjct: 304 MRSFCDWVSTNPETGEVSGEEGWAKSKGAKFASDRWASWEPFGTVVNRFAPTSCAGYYGN 363
Query: 338 NS----RCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN-CKE 392
++ +C + +D D + W +Q CSE ++ +N K+ ++ C
Sbjct: 364 STVGPPKCRL-DDPFPDPAAVSWTYQYCSEWGYFQSSNRGLNPIVSSYNSPKHQQDICFR 422
Query: 393 QY------GVSPRP---SWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPF------SRGG 437
Q+ GV P+ + YGG D++ SN+ FS G DP+ S
Sbjct: 423 QFPQGLSSGVLPQHPDIDKINREYGGWDLR------PSNVFFSGGEYDPWRPVSLLSEES 476
Query: 438 WVKTYHFFDLSFSQD-----------------LNLGSHCLDLDEAKKSDPDWLVQQRKTE 480
W + +QD L SHC DL + S PD RK
Sbjct: 477 WAPK----GVKLTQDIPEEGVSTPRSEVFGYLLKGASHCFDL---RTSWPD-GANSRKIF 528
Query: 481 VKIMQGWITQY 491
++ W+ +
Sbjct: 529 TTALRKWLGNF 539
>gi|345568628|gb|EGX51521.1| hypothetical protein AOL_s00054g220 [Arthrobotrys oligospora ATCC
24927]
Length = 547
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 84/169 (49%), Gaps = 17/169 (10%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--S 108
+ ++ Q +DHF P + TF QRY I++ F+ G PI G E ++ +
Sbjct: 65 YTELWFPQKVDHFD--PSNNNTFQQRYWISTHFYKPG-GPIFVLDGGETSGAGRVEYMQT 121
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G + ++LEHRYYGKS + + ++N + + N+AQ+L D A ++
Sbjct: 122 GIGRYITEYLGGIGIVLEHRYYGKS--YVTPNLTVENLK---WLNTAQSLKDNAYFAENL 176
Query: 169 KDKYNAT-------HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
+ A +AP I+ G SY G + + +++YP + GSLASSA
Sbjct: 177 WKELPANLSHIRPDNAPFISYGGSYAGAKSAFLQIEYPEIYYGSLASSA 225
>gi|402084362|gb|EJT79380.1| endoprotease [Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 554
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 101/429 (23%), Positives = 167/429 (38%), Gaps = 80/429 (18%)
Query: 19 STSFHANGLKLRPRLGRIRRSRILE----QKDSNHGFETFFYNQTIDHFSYGPESYQTFP 74
+ SFH +G+++ P + L + D +G+ F +Q IDH + TF
Sbjct: 23 AVSFHRSGMEIGPIEDEVELFSALATSRGRADPMNGWGEF--DQLIDHAN---PQLGTFK 77
Query: 75 QRYVINSKFWGGGNSPILAF-LGAEAPIDDNI------QLSGFTYENAHQFKALIVILEH 127
QR+ ++FW G SPI+ G ++ N+ +L+G A + A +V++EH
Sbjct: 78 QRFWYGTEFWNGTGSPIIVVNPGEQSATGFNVTYTTKGRLAGLM---AEKIGAAVVVVEH 134
Query: 128 RYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA----PVIAIG 183
RY+G+S P+ + +N ++ NS Q + +A D A P + G
Sbjct: 135 RYWGESTPY--KVLTSENLKYLDLKNSIQDMVYFAHNFKAPFDPAEGATAPGRVPWLYSG 192
Query: 184 ASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTI 243
SY G LA W K P SS + D+ WT + V R+ S + I
Sbjct: 193 GSYSGALAGWIAAKAPGTFWAIHGSSGVVEAVADM--WTMFVPVQEAMPRNCSRDVSAAI 250
Query: 244 --------KKSWSEIDNIAS--KPDGL--AILSKKFKTCTPLKNTTELKDGLDTVYSEAA 291
S ++ D + + K DGL A + + L +T+ YS
Sbjct: 251 DYMDGILMSGSQAQKDKLKAKFKLDGLSDADFGQAIENGPWLWQSTQF-------YSANK 303
Query: 292 QYDTPSNIPVKRICNAIENA------PNCGDDIL--CKIAAGVVE-------------AD 330
P R C+ +E A P G+ + CK G + +
Sbjct: 304 DLGGPGFNSFHRFCDYVEGAWPGSTTPVPGEKGVGSCKAVEGYAKWFTELWLPGQCESSG 363
Query: 331 SLEYDG-NNSRCYINED---------RTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEP 380
E+ G NN+ CY N++ R + + W W C+E P G +N + P
Sbjct: 364 YPEWQGQNNTACYQNQNASNPQYHDLRYNNPVNRQWNWMLCNE---PFGWWQNGAPKDRP 420
Query: 381 WNLTKYIKN 389
+++ + N
Sbjct: 421 SIVSRLVNN 429
>gi|407850455|gb|EKG04846.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi]
Length = 631
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 103/448 (22%), Positives = 167/448 (37%), Gaps = 52/448 (11%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ Q +DH G TF QRY ++ W N+ LA L D G+
Sbjct: 60 FRQLVDHSKNGG---STFDQRYWVDYSAW---NNSELAMLYIRIGSGDFTSPRGYPGIYG 113
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
H+ L+ LE RYYGKS+PF + + + Y N AL D +++K
Sbjct: 114 HERNMLLFTLEGRYYGKSLPF----PLTETEKLKKYLNVDIALEDIRGFQKFVEEKLLQK 169
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
+ +G SY G LA WF+ KYP + +SSA + D ++ +
Sbjct: 170 KLRWLIVGGSYAGALAVWFKAKYPTAALAVWSSSAVVEAQFD------FYGFDGRVKSAI 223
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDT 295
S EC + I S + +F + L +D V + A QY
Sbjct: 224 SPECVREIYAVQSLFSELWENETARVSFLNRFNIPHYFDKSGILYMMVDAV-AGAVQYGK 282
Query: 296 PSNIPVKRICNAIENAPNCGDDILCKI--AAGVVEADSLEYDG-NNSRCYINEDRTGDES 352
++C+ I + DI+ + ++ S ++ C N +
Sbjct: 283 K-----WKMCDLITQKNDM--DIMGRFFNMINLIYGQSFTTSCIYSTECLSNSTMSNQWV 335
Query: 353 DEGWEW--QSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTY---Y 407
G+ W QSCSE+ N + E N ++ C+ +G S P V + +
Sbjct: 336 GTGYAWFYQSCSELAFFQVGYYNGLRSLE-LNTEYFVNQCRSAFGDSVFPD-VFRFNAKW 393
Query: 408 GGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCL------D 461
GG + SN++ ++G DP+ G T S L ++C D
Sbjct: 394 GGK------YPNASNVVATHGSSDPWIDSGVTTT------SGPGYRVLIANCADCGRSGD 441
Query: 462 LDEAKKSDPDWLVQQRKTEVKIMQGWIT 489
L + +D + L QR ++ W+T
Sbjct: 442 LATPRPTDSEALQLQRDELALLLDNWMT 469
>gi|426196115|gb|EKV46044.1| hypothetical protein AGABI2DRAFT_193941 [Agaricus bisporus var.
bisporus H97]
Length = 552
Score = 72.4 bits (176), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 87/315 (27%), Positives = 134/315 (42%), Gaps = 47/315 (14%)
Query: 30 RPRLGRIRRSRILEQKDSNHGFETF---FYNQTIDHFSYGPESYQTFPQRYVINSKFWGG 86
R R RR LE S FE F ++ Q +DHF + TF QRY ++ + +
Sbjct: 42 RERGASSRRDLPLEI--SARAFEEFEPQWFEQPLDHFDES--NPHTFKQRYWVSKRHYKA 97
Query: 87 -GNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAAL 143
+P+ G E + + +G A + L VILEHRYYG+S+ A+
Sbjct: 98 RQGAPVFVLDGGETSGANRLPFLDTGIVDILARATEGLGVILEHRYYGESI-------AV 150
Query: 144 KNARHRG--YFNSAQALADYASILLHIK-----DKYNATHAPVIAIGASYGGELATWFRL 196
+N + N+ QA AD A+ + +K + A P I G SY G + ++
Sbjct: 151 ENLTTDALRWLNNEQAAADSANFMAKVKFNGIEEDLTAPSTPWIYYGGSYAGARSAHMKI 210
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
YP +V G++ASSA + + + W Y ++ K +C ++ S ID+I
Sbjct: 211 LYPDLVYGAIASSA--VTHAAVENW-QYMEIIRK---AADPKCSHHLENSIKIIDSI--- 261
Query: 257 PDGLAILSKKFKTCTP----LKNTTELKDGLDTVYSEAAQYDT------PSNIPVKRICN 306
L SK FK+ L N T D T+ S + + S++ + C
Sbjct: 262 ---LLGHSKFFKSHLKGLFGLANLTHDDDFASTISSPLGSWQSKNWDPEESSVRFDQFCE 318
Query: 307 AIENAPNCGDDILCK 321
A+ NAP G I K
Sbjct: 319 AL-NAPVHGHGISAK 332
>gi|339232872|ref|XP_003381553.1| serine carboxypeptidase S28 family protein [Trichinella spiralis]
gi|316979630|gb|EFV62393.1| serine carboxypeptidase S28 family protein [Trichinella spiralis]
Length = 484
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 38/105 (36%), Positives = 63/105 (60%), Gaps = 6/105 (5%)
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+G E + Q A++V R++G + P GS A++ N + Y + + LAD A ++
Sbjct: 109 AGLLSEISKQLNAVVVTFVPRFFGINKPTGS--ASVDNLK---YLSVEEVLADLAHLVHS 163
Query: 168 IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
++ KY + V+ +G ++GG LA WFRLKYPH+ G++AS AP+
Sbjct: 164 LRSKYPDSGKTVV-VGTAHGGNLAIWFRLKYPHLCDGAIASGAPL 207
>gi|71653617|ref|XP_815443.1| serine carboxypeptidase S28 [Trypanosoma cruzi strain CL Brener]
gi|70880498|gb|EAN93592.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi]
Length = 631
Score = 72.0 bits (175), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 112/501 (22%), Positives = 190/501 (37%), Gaps = 52/501 (10%)
Query: 8 FQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQ---KDSNHGFETFFYNQT----- 59
F +L ++++F+++ + L L R+ R +D + F+ ++N+T
Sbjct: 2 FFFLSIVLLFLASCGASGLLPLDVIPERLNRGEFFCSPVGEDGGNLFDEEYFNKTPATFR 61
Query: 60 --IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DH G TF QRY ++ W N LA L D G+ H+
Sbjct: 62 QLVDHSKNGG---STFDQRYWVDYSAW---NKSELAMLYIRIGSGDFTSPRGYPGIYGHE 115
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
L+ LE RYYGKS+PF + + + Y N AL D +++K
Sbjct: 116 RNMLLFTLEGRYYGKSLPF----PLTETEKLKKYLNVDIALEDIRGFQKFVEEKLLQKKL 171
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
+ +G SY G LA WF+ KYP + +SSA + D ++ + S
Sbjct: 172 RWLIVGGSYAGALAVWFKAKYPTAALAVWSSSAIVEAQFD------FYGFDGRVKSAISP 225
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDTPS 297
EC + I S + +F + L D V + A QY
Sbjct: 226 ECVREIYAVQSLFSELWENETARVSFLNRFNIPHYFDKSGILYMMADAV-AGAVQYGKK- 283
Query: 298 NIPVKRICNAIENAPNCGDDILCKI--AAGVVEADSLEYDG-NNSRCYINEDRTGDESDE 354
++C+ I + DI+ + ++ S ++ C N +
Sbjct: 284 ----WKMCDLITQKNDM--DIMGRFFYMINLIYGQSFTTSCIYSTECLSNSTMSNQWVGT 337
Query: 355 GWEW--QSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTY---YGG 409
G+ W QSCSE+ N + E N ++ C+ +G S P V + +GG
Sbjct: 338 GYAWFYQSCSELAFFQVGYYNGLRSLE-LNTEYFVNQCRSAFGDSVFPD-VFRFNVKWGG 395
Query: 410 HDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH-FFDLSFSQDLNLGSHCLDLDEAKKS 468
K SN++ ++G DP+ G T + + + + G DL + +
Sbjct: 396 KYPK------ASNVVATHGSSDPWIDSGVTTTNGPGYRVLIANCADCGRSG-DLATPRPT 448
Query: 469 DPDWLVQQRKTEVKIMQGWIT 489
D + L QR ++ W+T
Sbjct: 449 DSEALQLQRDELALLLDTWMT 469
>gi|452003099|gb|EMD95556.1| hypothetical protein COCHEDRAFT_1061531, partial [Cochliobolus
heterostrophus C5]
Length = 497
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 87/345 (25%), Positives = 140/345 (40%), Gaps = 56/345 (16%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA---PIDDNIQLSGFTYEN-A 115
+DHF TFP RY + S+ + G P+ E+ P+ I F +
Sbjct: 5 LDHFG---SDAGTFPNRYWVYSENYKPG-GPVFILDQGESNAEPVSRWIPDPRFFFNQIV 60
Query: 116 HQFKALIVILEHRYYGKSVPFG-SRKAALKNARHRGYFNSAQALADYASI-----LLHIK 169
+F + + EHR YG+SVP G +L + Y N QALAD + L +I
Sbjct: 61 KEFNGIGIAWEHRMYGESVPAGFHNDTSLDRFK---YLNVPQALADIDAFAKQFSLPYIN 117
Query: 170 DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPIL------YYGDITPWTT 223
+A H P + IGASY G A W R KYP + S ASSA + YY D
Sbjct: 118 ATLDADHTPWVFIGASYSGGRAAWVRNKYPDSIYASWASSAVVEAMVDMGYYAD----AI 173
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIA-SKPDGLAILSKKFKTCTPLK-NTTELKD 281
+ + +K F + C + I+ + D+I + P A L ++F + +
Sbjct: 174 WAGMNAKGFGN----CTRDIQAAIRYADHIMDTDPQAAAKLKEQFLGANNANISNVDFGG 229
Query: 282 GLDTVYSEAAQYDTPS-NIPVKRICNAIENAPNC--------------GDDILCKIA--A 324
L V+ + Y ++ +++ C+ +E P +L K A
Sbjct: 230 TLQVVFGDWQTYAMDGYSVSLRQFCDELETDPKTNQTAPKEGWAPTRGAKSVLDKWALYP 289
Query: 325 GVVEADSLEY------DGNNSRCYINEDRTGDESDEGWEWQSCSE 363
G + + L+ +G SR + + D + W WQ+C++
Sbjct: 290 GYLNVELLDSNAPCAKNGTVSRNCSSNAQPVDPNVLSWRWQACTQ 334
>gi|322709089|gb|EFZ00665.1| serine peptidase, putative [Metarhizium anisopliae ARSEF 23]
Length = 506
Score = 72.0 bits (175), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 100/220 (45%), Gaps = 27/220 (12%)
Query: 5 QFSFQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGF---ETFFYNQTID 61
+ SF +L + V +S N K RRS I + SN+G + ++Q ID
Sbjct: 2 RISFLAAVLAELLVGSSAAGNSHKYD------RRSHIPAR--SNNGTAAPKAGLFDQLID 53
Query: 62 HFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNI------QLSGFTYENA 115
H + S TF QR+ ++ F+GG SPI+ EA +D+N+ L+G A
Sbjct: 54 HNN---PSLGTFKQRFWWSADFYGGPGSPIILEAPGEAAVDENLVNHSNETLTGLF---A 107
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
KA + LEHRY+G S P G A + +H NS Q L +A+ + D + +
Sbjct: 108 QTNKAAAITLEHRYWGGSSPVGKNWTA-QTLQHLNLDNSIQDLIYFANNVYLPFDSHGDS 166
Query: 176 ---HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
AP + G SY G L W + P + ASSA +
Sbjct: 167 KPDKAPWVLTGCSYPGALTAWTNVLAPGIFWAYHASSAVV 206
>gi|336463818|gb|EGO52058.1| hypothetical protein NEUTE1DRAFT_118619 [Neurospora tetrasperma
FGSC 2508]
Length = 569
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 69/239 (28%), Positives = 99/239 (41%), Gaps = 25/239 (10%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF Y P + TF RY ++ ++ G P++ E +Q G Y+
Sbjct: 66 VDHFHNDTKYEPHTNDTFDLRYWFDATYYKKG-GPVIVLAAGETSGVGRLQFLQKGIVYQ 124
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK---- 169
A + VILEHRYYGKS+P + + KN R + + QALAD ++K
Sbjct: 125 LAKATGGVGVILEHRYYGKSLP--TSDFSTKNLR---FLTTDQALADTVYFAKNVKFAGL 179
Query: 170 --DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
A + P IA G SY G + R YP V G+++SS D W Y +
Sbjct: 180 EHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYWGAISSSGVTEAIYDY--WQYYEAA 237
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+D C +K +DNI A +K K L N T D +T+
Sbjct: 238 RIYGPKD----CVTATQKLTHVVDNIILNKAN-AKYVQKLKDAFGLGNLTHTDDFANTI 291
>gi|156048248|ref|XP_001590091.1| hypothetical protein SS1G_08855 [Sclerotinia sclerotiorum 1980]
gi|154693252|gb|EDN92990.1| hypothetical protein SS1G_08855 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 588
Score = 71.6 bits (174), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 114/265 (43%), Gaps = 27/265 (10%)
Query: 37 RRSRILEQKDS---NHGFETFFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNS 89
RR L ++D+ + + T+ + ID+F Y P S TFP RY ++ ++ G
Sbjct: 114 RRELALSKRDNIDPSLLYPTYNLSVPIDYFHNESRYEPHSNGTFPLRYWFDATYYKPG-G 172
Query: 90 PILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNAR 147
P++ E + +G ++ A + V+LEHRYYG+S+P + + KN R
Sbjct: 173 PVIVLQSGETDATGRLPFLQNGLLHQLAVATNGIGVVLEHRYYGESIP--TPDFSTKNLR 230
Query: 148 HRGYFNSAQALADYASILLHI------KDKYNATHAPVIAIGASYGGELATWFRLKYPHV 201
+ + QAL D +I A + I G SY G + R YP
Sbjct: 231 ---FLTTEQALMDEVYFARNIVFPGLEDQNLTAPNVAYIGYGGSYAGAFNAFLRKLYPDT 287
Query: 202 VIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLA 261
G+++SS + D WT + + + F D ++C + + + +DNI +
Sbjct: 288 FWGTISSSGVVEAIYDY--WTYFEPI--RVFAD--QKCIKNTQLITNSMDNIVIGQENNT 341
Query: 262 ILSKKFKTCTPLKNTTELKDGLDTV 286
L ++ KT L N T D + V
Sbjct: 342 ALIQELKTVWGLPNVTYSNDFMSVV 366
>gi|70982075|ref|XP_746566.1| serine peptidase, family S28 [Aspergillus fumigatus Af293]
gi|66844189|gb|EAL84528.1| serine peptidase, family S28, putative [Aspergillus fumigatus
Af293]
Length = 560
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 104/417 (24%), Positives = 158/417 (37%), Gaps = 70/417 (16%)
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAF----LGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
S T+ R+ +N F+ G SPI+ + AE + S + +F A+ ++
Sbjct: 81 SVGTYRNRFWVNEDFYISG-SPIMVYDIGEATAEYSVSLLTNSSSWLSLLLQEFNAMGIV 139
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA------SILLHIKDKYNATHAP 178
EHRYYG S+P+ + H Y + QALAD S L H P
Sbjct: 140 WEHRYYGGSLPYPVSQD--MPVEHLKYLTTEQALADIPYFAANFSRLNHPDFDLTPRGTP 197
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-------LYYGDITPWTTYHSVVSKD 231
I IG SY G A R KYP + + ASSAP+ +YY I Y ++V
Sbjct: 198 WIMIGGSYPGIRAAITRNKYPDTIFAAYASSAPVQAQLNMSVYYEQI-----YRAIVGNG 252
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-LKDGLDTVYSEA 290
+ + C + I+ + ID S A++ + F KN+ E L T+Y
Sbjct: 253 YSN----CTKDIQAALKYIDGQLSNKRTFALIKRLFLGPDAEKNSNEDFTTALVTLYGPF 308
Query: 291 AQYDTPS-NIPVKRICNAIENAP----NCGDDILCKIAAGVVEAD------------SLE 333
+ S N + CN +E P + G + L I A+ +
Sbjct: 309 QAHGLWSGNQSLPDFCNYLELDPATNQSAGPEGLSPIHGSKYVAERWASFPYFISLVNRM 368
Query: 334 YDGNNSRCYINEDRTGDESDE-------GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKY 386
Y N + +E + D S W WQ C+E + S + +Y
Sbjct: 369 YGTNCNGLNASEPLSCDFSQTNTIPELISWTWQYCTEWGFFQSNNFGSHALLSSYQTLEY 428
Query: 387 IKN-CKEQYG-------VSPRPSW--VLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
+ C Q+ + PRP + +GG I+ SN+ FS G DP+
Sbjct: 429 QQELCYRQFPNAVQAGVLPPRPQTESLNEKFGGWTIR------PSNVYFSGGQFDPW 479
>gi|159122200|gb|EDP47322.1| serine peptidase, family S28, putative [Aspergillus fumigatus
A1163]
Length = 560
Score = 71.6 bits (174), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 104/417 (24%), Positives = 158/417 (37%), Gaps = 70/417 (16%)
Query: 69 SYQTFPQRYVINSKFWGGGNSPILAF----LGAEAPIDDNIQLSGFTYENAHQFKALIVI 124
S T+ R+ +N F+ G SPI+ + AE + S + +F A+ ++
Sbjct: 81 SVGTYRNRFWVNEDFYISG-SPIMVYDIGEATAEYSVSLLTNSSSWLSLLLQEFNAMGIV 139
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA------SILLHIKDKYNATHAP 178
EHRYYG S+P+ + H Y + QALAD S L H P
Sbjct: 140 WEHRYYGGSLPYPVSQD--MPVEHLKYLTTEQALADIPYFAANFSRLNHPDFDLTPRGTP 197
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-------LYYGDITPWTTYHSVVSKD 231
I IG SY G A R KYP + + ASSAP+ +YY I Y ++V
Sbjct: 198 WIMIGGSYPGIRAAITRNKYPDTIFAAYASSAPVQAQLNMSVYYEQI-----YRAIVGNG 252
Query: 232 FRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-LKDGLDTVYSEA 290
+ + C + I+ + ID S A++ + F KN+ E L T+Y
Sbjct: 253 YSN----CTKDIQAALKYIDGQLSNKRTFALIKRLFLGPDAEKNSNEDFTTALVTLYGPF 308
Query: 291 AQYDTPS-NIPVKRICNAIENAP----NCGDDILCKIAAGVVEAD------------SLE 333
+ S N + CN +E P + G + L I A+ +
Sbjct: 309 QAHGLWSGNQSLPDFCNYLELDPATNQSAGPEGLSPIHGSKYVAERWASFPYFISLVNRM 368
Query: 334 YDGNNSRCYINEDRTGDESDE-------GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKY 386
Y N + +E + D S W WQ C+E + S + +Y
Sbjct: 369 YGTNCNGLNASEPLSCDFSQTNTIPELISWTWQYCTEWGFFQSNNFGSHALLSSYQTLEY 428
Query: 387 IKN-CKEQYG-------VSPRPSW--VLTYYGGHDIKLILRRSTSNIIFSNGMRDPF 433
+ C Q+ + PRP + +GG I+ SN+ FS G DP+
Sbjct: 429 QQELCYRQFPNAVQAGVLPPRPQTESLNEKFGGWTIR------PSNVYFSGGQFDPW 479
>gi|161078413|ref|NP_001097835.1| CG11626 [Drosophila melanogaster]
gi|158030303|gb|AAF55668.2| CG11626 [Drosophila melanogaster]
gi|189181944|gb|ACD81748.1| IP20428p [Drosophila melanogaster]
Length = 379
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 137/347 (39%), Gaps = 42/347 (12%)
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAP 178
++ EHRYYG S+PFG+ L N + + Q+LAD A + H K + +
Sbjct: 7 GMLFYTEHRYYGLSLPFGNESYRLSNLKQ---LSLHQSLADLAHFIRHQKSNDPEMEDSK 63
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SE 237
VI +G SY G L W YP ++ S ASSAP+L D + Y +V K + +
Sbjct: 64 VILVGGSYSGSLVAWMTQLYPDLIAASWASSAPLLAKAD---FFEYMEMVGKSIQLSYGN 120
Query: 238 ECFQTIKKSWS---------EIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
C I+K + EI + +G S K PL + +GL ++
Sbjct: 121 NCSLRIEKGFKFLVKLFDGDEIQELLYNLNGCVGYSPK----NPL-DRAAFFNGLGNYFA 175
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI-----AAGVVEADSLEYDGNNSRCYI 343
Q + S IP R+C + + + + + + G +D ++ ++
Sbjct: 176 LVVQSYSAS-IP--RLCETLMSLDSSDELAFIEFLKLLYSEGRRSSDCQDFGYSSMLELF 232
Query: 344 NEDRTGDESDEGWEWQSCSEM---VVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP 400
ED W +Q+C+E K S L + + C++ +G
Sbjct: 233 TEDSVQSSETRAWFYQTCNEFGWYTTTKSKSSASQAFANQVPLGYFEQLCQDAFGAEQTA 292
Query: 401 SWVL-------TYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVK 440
+ + +GG R + +IF++G DP+S G K
Sbjct: 293 HQLAHGVEQTNSKFGGFGFNQSERY--AQVIFTHGELDPWSALGQQK 337
>gi|350295890|gb|EGZ76867.1| peptidase S28 [Neurospora tetrasperma FGSC 2509]
Length = 562
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/239 (28%), Positives = 99/239 (41%), Gaps = 25/239 (10%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF Y P + TF RY ++ ++ G P++ E +Q G Y+
Sbjct: 69 VDHFHNDTKYEPHTNDTFNLRYWFDATYYKKG-GPVIVLAAGETSGVGRLQFLQKGIVYQ 127
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK---- 169
A + VILEHRYYGKS+P + + KN R + + QALAD ++K
Sbjct: 128 LAKATGGVGVILEHRYYGKSLP--TSDFSTKNLR---FLTTDQALADTVYFAKNVKFAGL 182
Query: 170 --DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
A + P IA G SY G + R YP V G+++SS D W Y +
Sbjct: 183 EHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYWGAISSSGVTEAIYDY--WQYYEAA 240
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+D C +K +DNI A +K K L N T D +T+
Sbjct: 241 RIYGPKD----CVTATQKLTHVVDNIILNKAN-AKYVQKLKDTFGLGNLTHTDDFANTI 294
>gi|255565517|ref|XP_002523749.1| hypothetical protein RCOM_0476160 [Ricinus communis]
gi|223537053|gb|EEF38689.1| hypothetical protein RCOM_0476160 [Ricinus communis]
Length = 151
Score = 71.2 bits (173), Expect = 1e-09, Method: Composition-based stats.
Identities = 53/179 (29%), Positives = 70/179 (39%), Gaps = 65/179 (36%)
Query: 58 QTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
QT DHF+Y PES++ E I D++ F E AH+
Sbjct: 2 QTFDHFNYNPESFEE-------------------------EGDIIDDVLFVNFIAELAHR 36
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
FK L++ Y ++ +K K +A
Sbjct: 37 FKGLLL-------------------------------------YIEVITDVKRKLSAVRN 59
Query: 178 PVIAIGASYGGELATWFRLK---YPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFR 233
PVI +G S GG + LK + IG+LA SAPILY D+TP Y VVSKDFR
Sbjct: 60 PVIVVGGSSGGNNSLVINLKNFDLTLLAIGALAPSAPILYSEDLTPHDGYQVVVSKDFR 118
>gi|85112594|ref|XP_964374.1| hypothetical protein NCU00831 [Neurospora crassa OR74A]
gi|28926153|gb|EAA35138.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 561
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 69/239 (28%), Positives = 99/239 (41%), Gaps = 25/239 (10%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF Y P + TF RY ++ ++ G P++ E +Q G Y+
Sbjct: 70 VDHFHNDTKYEPHTNDTFDLRYWFDATYYKKG-GPVIVLAAGETSGVGRLQFLQKGIVYQ 128
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK---- 169
A + VILEHRYYGKS+P + + KN R + + QALAD ++K
Sbjct: 129 LAKATGGVGVILEHRYYGKSLP--TSDFSTKNLR---FLTTDQALADTVYFAKNVKFAGL 183
Query: 170 --DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
A + P IA G SY G + R YP V G+++SS D W Y +
Sbjct: 184 EHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYWGAISSSGVTEAIYDY--WQYYEAA 241
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+D C +K +DNI A +K K L N T D +T+
Sbjct: 242 RIYGPKD----CVTATQKLTHVVDNIILNKAN-ARYVQKLKDTFGLGNLTHTDDFANTI 295
>gi|116192591|ref|XP_001222108.1| hypothetical protein CHGG_06013 [Chaetomium globosum CBS 148.51]
gi|88181926|gb|EAQ89394.1| hypothetical protein CHGG_06013 [Chaetomium globosum CBS 148.51]
Length = 553
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/191 (30%), Positives = 85/191 (44%), Gaps = 18/191 (9%)
Query: 32 RLGRIRRSRILEQKDSNHGFETFFYNQTIDHFS----YGPESYQTFPQRYVINSKFWGGG 87
+L R +L +KD + +DHF Y P S +TFP RY ++ + G
Sbjct: 31 QLDRRGEESVLVKKDLQDLYPARTLKVPVDHFHNDTLYEPHSNETFPLRYWFDASHYKKG 90
Query: 88 NSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKN 145
P++ G E + G + A L VILEHRYYG+S P + + +N
Sbjct: 91 -GPVIVLQGGETNGAGRLPFLQKGIVAKLAQATHGLGVILEHRYYGESFP--TPDFSTEN 147
Query: 146 ARHRGYFNSAQALADYASILLHIK----DKYNATHA--PVIAIGASYGGELATWFRLKYP 199
R + + QALAD A H+ + + T A P IA G SY G + R YP
Sbjct: 148 LR---FLTTDQALADMAFFAEHVVFEGLEHLDLTSAKNPYIAYGGSYAGAFVAFLRKLYP 204
Query: 200 HVVIGSLASSA 210
V G+++SS
Sbjct: 205 DVYWGAISSSG 215
>gi|407411104|gb|EKF33307.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi
marinkellei]
Length = 629
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 100/444 (22%), Positives = 164/444 (36%), Gaps = 44/444 (9%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
+ Q +DH G TF QRY ++ W G+ LA L D G+
Sbjct: 60 FRQLVDHSQNGG---STFDQRYWVDYSAWNNGD---LAMLYIRIGSGDFTSPRGYPGIYG 113
Query: 116 HQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT 175
H+ L+ LE RYYGKS+PF + + + Y N AL D +++K
Sbjct: 114 HERNMLLFTLEGRYYGKSLPF----PLTETEKLKKYLNVDIALEDIRGFQKFVEEKLLRK 169
Query: 176 HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT 235
+ +G SY G LA WF+ KYP + +SSA + D ++ +
Sbjct: 170 KLRWLIVGGSYAGALAVWFKAKYPTAALAVWSSSAVVEAQFD------FYGFDGRVKSAI 223
Query: 236 SEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEAAQYDT 295
S EC I S + +F + L D V + A QY
Sbjct: 224 SIECAHEIYAVQSLFSELWENETVRMSFLNRFNIPHYFDKSGILYMMADAV-AGAVQYGK 282
Query: 296 PSNIPVKRICNAIENAPNCGD-----DILCKIAAGVVEADSLEYDGNNSRCYINEDRTGD 350
++C+ I +++ +I D + ++ C N +
Sbjct: 283 K-----WKMCDLITQNNEMDKMARFFNMINRIYGPSFTTDCIY----STECLSNSTMSNQ 333
Query: 351 ESDEGWEW--QSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYG--VSPRPSWVLTY 406
+ G+ W QSCSE+ N + E N ++ C+ +G V P
Sbjct: 334 WVETGYAWFYQSCSELAFFQVGYYNGLRSLEV-NTEYFVNQCRSAFGEPVFPDVFRFNAK 392
Query: 407 YGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH-FFDLSFSQDLNLGSHCLDLDEA 465
+GG + SN++ ++G DP+ G T + + + + G DL
Sbjct: 393 WGGKYPR------ASNVVATHGSSDPWIDSGVTTTNGPGYQVLIANCADCGRSG-DLATP 445
Query: 466 KKSDPDWLVQQRKTEVKIMQGWIT 489
+ +D + L QRK ++ W+T
Sbjct: 446 RPTDSEALELQRKELALLLDTWMT 469
>gi|328723991|ref|XP_003248003.1| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 253
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 59/203 (29%), Positives = 89/203 (43%), Gaps = 12/203 (5%)
Query: 79 INSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGS 138
+N K++ G P+ +G I + +SG E A F A LEHRYYG S P
Sbjct: 1 MNLKYYKMG-GPVFLLVGGSEKILHSWMISGAWIEYAQIFNAACFQLEHRYYGMSHPTDD 59
Query: 139 RKAALKNARHRGYFNSAQALADYASILLHIKDKYNAT--HAPVIAIGASYGGELATWFRL 196
N + Y ++ Q LAD A + I + N A + G+SY G L W L
Sbjct: 60 L-----NTSNLVYLSTEQVLADLAIFINTISIEKNQLLGSAKWVGFGSSYSGSLVAWLIL 114
Query: 197 KYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
KYPH+V +++SS+P+ + Y V K +++ I+++ I +
Sbjct: 115 KYPHLVYAAVSSSSPL---TAKIHFEEYFMAVQKTLSVYNQKYELNIRQANKIISDQLQT 171
Query: 257 PDGLAILSKKFKTCTP-LKNTTE 278
G + KF TC L N T+
Sbjct: 172 DYGAKYIQTKFNTCAHNLNNATK 194
>gi|392565931|gb|EIW59107.1| peptidase S28 [Trametes versicolor FP-101664 SS1]
Length = 548
Score = 70.9 bits (172), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 100/215 (46%), Gaps = 21/215 (9%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGG-GNSPILAFLGAEAPIDDNIQL-- 107
F ++ Q +DHF+ S TF QRY I+++ + ++P++ G E D +
Sbjct: 66 FPAHWFTQPLDHFTNA--SGHTFEQRYWISTRHYRPRPDAPVIVLDGGETSGRDRLPFLD 123
Query: 108 SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLH 167
+G + VILEHRYYG+++P + + R + N+AQ+ AD A+ + +
Sbjct: 124 TGIVEILTKATGGVGVILEHRYYGRTIPV--QNFTTDSLR---WLNNAQSAADSANFMAN 178
Query: 168 IK-----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+K + A + P I G SY G A ++ YP +V G++ASS + + + W
Sbjct: 179 VKFPGIDEDLAAPNHPWIYYGGSYAGARAAHMKILYPDLVYGAIASSG--VTHAALELWE 236
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKP 257
++ R C Q ++ S ID + P
Sbjct: 237 YAETI----RRAADATCAQHLENSIKIIDALLDVP 267
>gi|326475271|gb|EGD99280.1| hypothetical protein TESG_06549 [Trichophyton tonsurans CBS 112818]
gi|326480381|gb|EGE04391.1| serine peptidase [Trichophyton equinum CBS 127.97]
Length = 551
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 86/347 (24%), Positives = 133/347 (38%), Gaps = 55/347 (15%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-- 114
+ IDH S T+ R+ IN + + G P+ F EA F N
Sbjct: 71 DMPIDHKS---NKTGTYKHRFWINEQDYKPGG-PVFVFDCGEAAGQRYADKYLFNEANFF 126
Query: 115 ---AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-- 169
+F + +I EHRYYG+S PF H Y N+ QALAD K
Sbjct: 127 RQLTKKFHGIGIIFEHRYYGESTPFPV--TVETPPEHFQYLNNDQALADLPFFAKEFKRK 184
Query: 170 ----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TT 223
D P + +G SY G A + R +YP + S ASSAP+ D+ +
Sbjct: 185 AFPNDDLRPNATPWVMVGGSYPGMRAAFTRDRYPETIYASWASSAPVQAQIDMAVYYEQV 244
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
Y +V+ + + C + I+ ++ ID S+ D A + K F L + E
Sbjct: 245 YRGLVAYGWGN----CTKDIRAAYRYIDRQLSRNDTAAAIKKLF-----LGDGAEQASNG 295
Query: 284 D----TVYSEAAQYDTPSNIPVKRICNAIEN-------------APNCGDDILCK----- 321
D + + A + ++ V + CN +E AP GD + +
Sbjct: 296 DFTAALIVAYAGWQSSGADGQVGKFCNWLEVDPKTNKTAPAEGWAPTLGDKAMAERFAAW 355
Query: 322 -IAAGVVEADSLEYDGNNSRCYINEDRTGDESDE----GWEWQSCSE 363
A +V A+++ + E + S++ W WQ CSE
Sbjct: 356 PTLAEMVNANAMTNCKQTDKSKPLECKLDKPSEDPDFISWIWQYCSE 402
>gi|407919835|gb|EKG13057.1| Peptidase S28 [Macrophomina phaseolina MS6]
Length = 563
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 103/435 (23%), Positives = 158/435 (36%), Gaps = 74/435 (17%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL-----SG 109
F + +DHFS G + F R+ + + G P+ + E N +
Sbjct: 64 FVSLPLDHFS-GDAGF--FENRFWVAESGYKGAGHPVFIYDAGETDASQNALFRLQNSTS 120
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRG-YFNSAQALADYASILLHI 168
F + F + ++ EHRYYG+SVP L A Y S QALAD +
Sbjct: 121 FFKQIVDAFGGIGIVWEHRYYGESVPV---NINLDTAPEDFIYLTSEQALADVPVFAANF 177
Query: 169 KDK------YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW- 221
+ P I IG SY G A + R YP + S ASSAP+ D++ +
Sbjct: 178 SRANFPDIDLTPSSTPWIFIGGSYPGMRAAFMREYYPETIFASFASSAPVQAQNDMSVYF 237
Query: 222 -TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE-L 279
Y + + F + C + I + + +D + P A L KF KN+
Sbjct: 238 EPVYRGMNAYGFGN----CSKDIHAAINYMDELMENPAAAATLKTKFLGRNADKNSNAGF 293
Query: 280 KDGLDTVYSEAAQYDTPSNIPVKRICNAIEN------------------APNCGDDILCK 321
D L T++ Y I ++ C+ IE AP+ G + + +
Sbjct: 294 GDALSTIFWYWQSYGV--EISLRPFCDWIETDHGSSNATNSTVAGADGWAPSRGAEYVVE 351
Query: 322 IAA---GVVEADSLEYD----------GNNSRCYIN-EDRTGDESDEGWEWQSCSEMVVP 367
A G E S D N++ N E R D S W WQ C++
Sbjct: 352 QWASYPGFAETVSSSLDTICEGSTPPTDNSTAPECNLEKRFEDPSSISWTWQYCTQWGFL 411
Query: 368 MGKDKNSMYQPEPWNLTKYIKN-CKEQYGVS--------PRPSWVLTYYGGHDIKLILRR 418
+ + WN ++ + C Q+ + PR + GG I+
Sbjct: 412 QSANLGATQIVSRWNSLQHQADICHRQFPTANETLLPAWPRDNATNESLGGWRIR----- 466
Query: 419 STSNIIFSNGMRDPF 433
SN +S G DP+
Sbjct: 467 -PSNTYWSGGEFDPW 480
>gi|296817873|ref|XP_002849273.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238839726|gb|EEQ29388.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 544
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 121/294 (41%), Gaps = 35/294 (11%)
Query: 11 LLLMVIFVSTSFHANGL-KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHF----SY 65
LLL+V V + +LR + R + +D N F IDHF Y
Sbjct: 8 LLLLVAAVQAKLPVTPISQLRAQAQR-NNELVARSEDVNAQFPAHQIKIPIDHFPKSQRY 66
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIV 123
P + + F RY ++ + G P++ G E + I G + A + V
Sbjct: 67 EPHTMEKFNLRYWFDASHYKEG-GPVIILHGGETDGEGRIPFLQKGILAQLAQATNGIGV 125
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK----DKYNATHAPV 179
++EHRYYG S+P +R + K+ R + + QALAD A +IK +KYN T AP
Sbjct: 126 VMEHRYYGGSLP--TRDFSNKSLR---FLTTEQALADTAYFSQNIKFPGLEKYNLT-APG 179
Query: 180 ---IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
I G SY G + R +YP V G+++SS D W + + ++
Sbjct: 180 TAHIVYGGSYAGGQVAFLRTQYPDVFWGAISSSGVTKAIYDY--WQYFEPIR----QEAP 233
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
++C + +DNI I K T LKN L D ++ A
Sbjct: 234 QDCVHVTQNFVDIVDNI-------IIHGKNANTTQELKNLFGLGRLRDADFANA 280
>gi|119487142|ref|XP_001262426.1| serine peptidase, family S28, putative [Neosartorya fischeri NRRL
181]
gi|119410583|gb|EAW20529.1| serine peptidase, family S28, putative [Neosartorya fischeri NRRL
181]
Length = 550
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 109/448 (24%), Positives = 169/448 (37%), Gaps = 75/448 (16%)
Query: 38 RSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAF--- 94
RS I + DS E + IDH S T+ R+ +N F+ G SPI+ +
Sbjct: 45 RSIIADSVDSTSAAE--YTEMLIDHED---PSVGTYRNRFWVNEDFYVSG-SPIMVYDIG 98
Query: 95 -LGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFN 153
AE + S + +F A+ ++ EHRYYG S+PF + H Y
Sbjct: 99 EATAEYSVSLLTNSSSWLSLLLQEFHAMGIVWEHRYYGDSLPFPVSQD--MPVEHLKYLT 156
Query: 154 SAQALADYASILL------HIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLA 207
+ QALAD H P + IG SY G A + R KYP + + A
Sbjct: 157 TEQALADIPYFAANFSRPNHPDIDLTPRGTPWVMIGGSYPGIRAAFTRNKYPDTIFAAYA 216
Query: 208 SSAPI-------LYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGL 260
SSAP+ +YY I Y ++V+ + + C + I+ + ID+ S +
Sbjct: 217 SSAPVQAQLNMSVYYEQI-----YRAMVANGYSN----CTKDIQAALKYIDDQLSNKETS 267
Query: 261 AILSKKFKTCTPLKNT-TELKDGLDTVYSEAAQYDTPS-NIPVKRICNAIENAP----NC 314
A + + F KN+ + L +Y + S N + CN +E P +
Sbjct: 268 ASIKRLFLGRDAEKNSNADFTTALVALYGPFQAHGLWSGNQSLHDFCNYLELDPATNQSA 327
Query: 315 GDDILCKIAAGVVEAD------------SLEYDGNNSRCYINEDRTGDESDE-------G 355
G + L I A+ + Y N + +E + D S
Sbjct: 328 GPEGLAPIHGSKYVAERWASFPYFISLVNRMYGTNCNGLNASEPLSCDFSQTNTIPELIS 387
Query: 356 WEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKN-CKEQY------GV---SPRPSWVLT 405
W WQ C+E + S + +Y + C Q+ G+ PR +
Sbjct: 388 WTWQYCTEWGFFQSNNFGSHALLSSYQTLEYQQELCNRQFPNAVQAGILPPRPRTESLNE 447
Query: 406 YYGGHDIKLILRRSTSNIIFSNGMRDPF 433
+GG I+ SN+ FS G DP+
Sbjct: 448 EFGGWTIR------PSNVYFSGGQFDPW 469
>gi|378728864|gb|EHY55323.1| extracelular serine carboxypeptidase [Exophiala dermatitidis
NIH/UT8656]
Length = 583
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 98/239 (41%), Gaps = 26/239 (10%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF Y P S F RY ++ ++ G P++ EA + + G
Sbjct: 96 VDHFHNDSKYEPHSNDYFNLRYFFDATYYEPG-GPVIVLQSGEADATERLPFLQKGIAAI 154
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI----- 168
A + V+LEHRYYG S P + + N R + ++ QALAD A H+
Sbjct: 155 LASATNGIGVVLEHRYYGTSFP--TPDLSTDNLR---FLSTEQALADQAYFASHVVFPGL 209
Query: 169 -KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSV 227
A P IA G SY G L + R+ YP + GS++SS D W Y +
Sbjct: 210 EHLNLTAPGTPYIAYGGSYAGGLVAFLRVLYPDLTWGSISSSGVTEAIYDY--WQYYEPI 267
Query: 228 VSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
EC T +K + +DNI + D + + KT L N T D + V
Sbjct: 268 RQYG----PPECIATQEKLINVVDNILTHNDSTTV--TELKTGFGLGNLTYSDDFANVV 320
>gi|389626267|ref|XP_003710787.1| hypothetical protein MGG_13765 [Magnaporthe oryzae 70-15]
gi|351650316|gb|EHA58175.1| hypothetical protein MGG_13765 [Magnaporthe oryzae 70-15]
Length = 582
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 69/233 (29%), Positives = 101/233 (43%), Gaps = 22/233 (9%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLS-GFTYEN 114
IDH+ Y P + TFP RY +++F+ G I+ G + + L G Y
Sbjct: 97 IDHYHNETRYEPHADGTFPLRYWFDAQFYKPGGPVIVLSAGETSGVGRLPFLQKGIVYIM 156
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK--- 171
A + VILEHRYYG SVP + + +N R + + QALAD A ++K K
Sbjct: 157 AKALGGVGVILEHRYYGTSVP--TPDFSTENLR---FLTTEQALADTAYFAQNVKFKGLE 211
Query: 172 -YNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
Y+ + A P IA G SY G + R YP V G+++SS + D W + +
Sbjct: 212 DYDLSPAATPWIAYGGSYAGAFVAFLRKVYPDVFWGAISSSGVPVAIWDY--WKYFEAAA 269
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
C +T +K +D + G S+K K L+ E D
Sbjct: 270 VYG----PAACVETTQKLTHVVDTLLLGKAGDDPTSQKVKAAFGLEFVKENAD 318
>gi|402588746|gb|EJW82679.1| hypothetical protein WUBG_06411 [Wuchereria bancrofti]
Length = 396
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 89/379 (23%), Positives = 137/379 (36%), Gaps = 51/379 (13%)
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKY-- 172
A ++ A LEHR++G S PF + Y QALAD + ++ + + +
Sbjct: 8 AKKYGAACFYLEHRFFGASQPFEDH-----SMESYKYLTVNQALADIKNFIVQMNEMFFL 62
Query: 173 NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+ I G SYGG LA WFR + I ++ SSA + D +T V K
Sbjct: 63 DIEKPRWILFGGSYGGALAAWFREMNEELTIAAIVSSAVVQAEVDYYDYTKNLEYVLK-- 120
Query: 233 RDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD------GLDTV 286
+ + C +TI+ S + DG A L K F C P KD +
Sbjct: 121 -EENAPCAETIRLSIKALIEKTYTVDGRAELGKVFNMCEPFTEPPIAKDIQFFLANILYT 179
Query: 287 YSEAAQYDTPSNIP-VKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDG--NNSRCY- 342
+ QY +P V C+ I + GV+ YD + C+
Sbjct: 180 FGGYIQYAGGCRLPDVSYFCDLITDGSETD-------YIGVIWNAWKIYDQIFQSEECFD 232
Query: 343 -INEDRTGDESD-----------EGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNC 390
E D SD W W C+E+ + D +L + C
Sbjct: 233 PSYERHLEDLSDITFMDNEFASYRSWLWLCCTELGFFITTDNGKSIFGSSVSLDYHADKC 292
Query: 391 KE----QYGVSPRPSWV---LTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYH 443
+ QY + V L +GG+D +N +F +G DP+ +
Sbjct: 293 MDVFDVQYDTERARTGVRNTLRTFGGYD-----NYKGTNTVFVSGSYDPWKSACCLNCTD 347
Query: 444 FFDLSFSQDLNLGSHCLDL 462
+S + GSHC+D+
Sbjct: 348 ITRNVYSVIIEGGSHCVDV 366
>gi|336275849|ref|XP_003352678.1| hypothetical protein SMAC_01511 [Sordaria macrospora k-hell]
gi|380094568|emb|CCC07948.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 570
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/163 (31%), Positives = 75/163 (46%), Gaps = 18/163 (11%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF Y P + TF RY ++ ++ G P++ E +Q G Y+
Sbjct: 66 VDHFHNDTKYEPHTNDTFDLRYWFDATYYKKG-GPVIVLAAGETSGVGRLQFLQKGIVYQ 124
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK---- 169
A + VILEHRYYGKS+P + + KN R + + QALAD ++K
Sbjct: 125 LAKATGGVGVILEHRYYGKSLP--TSDFSTKNLR---FLTTDQALADTVYFAKNVKFAGL 179
Query: 170 --DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
A + P IA G SY G + R YP V G+++SS
Sbjct: 180 EHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYWGAISSSG 222
>gi|380490188|emb|CCF36185.1| serine carboxypeptidase S28 [Colletotrichum higginsianum]
Length = 565
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/216 (29%), Positives = 90/216 (41%), Gaps = 25/216 (11%)
Query: 51 FETFFYNQTIDHFS----YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQ 106
++ + + +DHF Y P S TFP RY + +F+ G P++A E +
Sbjct: 71 YQAYNLSVPVDHFHNESMYAPHSNGTFPLRYWYDDRFYKPGG-PVIALAAGETNGAGRLP 129
Query: 107 L--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
G A L VILEHRYYG S P + + KN R + + QALAD A
Sbjct: 130 FLQKGIVAILAEATNGLGVILEHRYYGSSYP--TPDLSTKNLR---FLTTDQALADTAYF 184
Query: 165 LLHI-------KDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGD 217
+I + A P I G SY G + R YP V G+++SS D
Sbjct: 185 AKNIVFPGKLAELDLTAPGTPWIMYGGSYAGAFVAFLRKVYPEVFWGAISSSGVTAAVVD 244
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI 253
W Y + + E C +T +K +DNI
Sbjct: 245 F--WEYYEAARLY----SPEGCAETTQKLTHVVDNI 274
>gi|408400451|gb|EKJ79531.1| hypothetical protein FPSE_00216 [Fusarium pseudograminearum CS3096]
Length = 537
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 81/164 (49%), Gaps = 18/164 (10%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTY 112
F+N+TI Y P S + FP RY +++++ G P++ E +D I G
Sbjct: 49 FHNETI----YEPHSDKKFPLRYWFDAQYYRKGG-PVIILASGETSGEDRIPFLEHGILQ 103
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK--- 169
A+ + VILEHRYYG S P K +N R + ++ QALAD A H++
Sbjct: 104 MLANATGGIGVILEHRYYGTSFPVPDLKP--ENMR---FLSTEQALADTAYFAQHVEFPG 158
Query: 170 -DKYN--ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
+++N A+ P I G SY G A + R YP + G ++SS
Sbjct: 159 MEEHNLTASTTPYIIYGGSYAGAFAAFARKIYPDLFWGGISSSG 202
>gi|326482280|gb|EGE06290.1| extracelular serine carboxypeptidase [Trichophyton equinum CBS
127.97]
Length = 545
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 121/294 (41%), Gaps = 35/294 (11%)
Query: 11 LLLMVIFVSTSFHANGL-KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFS----Y 65
LLL+V V + +LR R ++ + +D N F IDHF Y
Sbjct: 8 LLLLVAAVQAKLPVTPISQLRAESHR-NKALVARSQDVNAAFPAHTIQIPIDHFPKSSRY 66
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIV 123
P + F RY ++ + G P++ G E + I G + A + V
Sbjct: 67 EPHTTAKFNLRYWFDASHYKEG-GPVIILHGGETSGEGRIPFLQKGILAQLAQATNGIGV 125
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK----DKYNATHAPV 179
I+EHRYYG S+P +R + K+ R + + QALAD A +IK +KYN T AP
Sbjct: 126 IMEHRYYGGSLP--TRDFSNKSLR---FLTTEQALADTAYFSKNIKFPGLEKYNLT-APG 179
Query: 180 IA---IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
A G SY G + R +YP + G+++SS D W + + ++
Sbjct: 180 TAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDY--WQYFEPIR----QEAP 233
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
++C + +DNI I K T LKN L D ++ A
Sbjct: 234 QDCVHVTQNFVDIVDNI-------IIHGKNANTTRELKNLFGLGRLRDADFANA 280
>gi|46110645|ref|XP_382380.1| hypothetical protein FG02204.1 [Gibberella zeae PH-1]
Length = 537
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 81/164 (49%), Gaps = 18/164 (10%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTY 112
F+N+TI Y P S + FP RY +++++ G P++ E +D I G
Sbjct: 49 FHNETI----YEPHSDKKFPLRYWFDAQYYRKGG-PVIILASGETSGEDRIPFLEHGILQ 103
Query: 113 ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK--- 169
A+ + VILEHRYYG S P K +N R + ++ QALAD A H++
Sbjct: 104 MLANATGGIGVILEHRYYGTSFPVPDLKP--ENMR---FLSTEQALADTAYFAQHVEFPG 158
Query: 170 -DKYN--ATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
+++N A+ P I G SY G A + R YP + G ++SS
Sbjct: 159 MEEHNLTASTTPYIIYGGSYAGAFAAFARKIYPDLFWGGISSSG 202
>gi|259483536|tpe|CBF79006.1| TPA: serine peptidase, family S28, putative (AFU_orthologue;
AFUA_4G03790) [Aspergillus nidulans FGSC A4]
Length = 557
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 117/292 (40%), Gaps = 51/292 (17%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLG---AEAPIDDNIQLSGFTYENAH 116
IDH + + TF RY +N ++ G +L +G E + S F
Sbjct: 87 IDHHN---PAIGTFRNRYWVNDAYYVPGGPVVLYDVGEADGEPSVAHLRSNSSFLPPILQ 143
Query: 117 QFKALIVILEHR-----------------YYGKSVPFGSRKAALKNARHRGYFNSAQALA 159
+F A+ ++ EHR YYG S+P+ L H Y + QALA
Sbjct: 144 EFGAIGIVWEHRSVVSTKVETRIEIDRNRYYGDSLPYPVNNNTLP--EHLIYLTTRQALA 201
Query: 160 DYASILLHIK----DKYNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI- 212
D + + +Y+ T A P + IG SY G A + R +YP + S A+SAP+
Sbjct: 202 DIPAFAENFTRPGLQEYDLTPASTPWVMIGGSYPGARAAFARNEYPDTIFASFAASAPVQ 261
Query: 213 ------LYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKK 266
+YY + Y S+V+ FR C I + IDN S+ D A + +
Sbjct: 262 AQINMSIYYEQV-----YRSMVANGFRS----CASNIHAALEYIDNQLSQNDTAASIKQL 312
Query: 267 FKTCTPLKNTTE-LKDGLDTVYSEAAQY--DTPSNIPVKRICNAIENAPNCG 315
F KN+ E L +Y Y D P+ + C +E+ P G
Sbjct: 313 FFGPGAEKNSNEDFTHALAVIYGSFQAYGMDGPAG-SLGEFCRYLESDPVTG 363
>gi|409047038|gb|EKM56517.1| hypothetical protein PHACADRAFT_183166 [Phanerochaete carnosa
HHB-10118-sp]
Length = 522
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 122/537 (22%), Positives = 209/537 (38%), Gaps = 97/537 (18%)
Query: 11 LLLMVIFVSTSF------HANGLK---LRPRLG------RIRRSRILEQKDSNHGFETFF 55
LL+++ ++TSF +AN + LRP G + +L ++ T++
Sbjct: 7 LLVLLPVLATSFAQIRRPNANIPRPPALRPLTGLDGPVYHVSTGTVLPPLNT-----TYY 61
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENA 115
++Q IDH + S TF QRY +++ G++ +LG N ++G + A
Sbjct: 62 FDQLIDHTN---PSLGTFKQRYWHTWEWYEEGDT---GYLG-------NRSING---QLA 105
Query: 116 HQFKALIVILEHRYYGKSVPFGSRK-AALKNARHRGYFNSAQALADYASILLHIKDKYNA 174
Q ++LEHRYYG S PF +LK + + + AD + + D
Sbjct: 106 QQEHGATIVLEHRYYGLSNPFSDMSDRSLKYHTIQQAIDDLEYFADNVKLPMPGGDNVGP 165
Query: 175 THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
T AP + IG SY G L +W + P V ASSA + D W + V ++
Sbjct: 166 TEAPWVLIGGSYSGALTSWTMVNKPGVFRAGYASSAVVEAMVDF--WQYFEPVR----QN 219
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS-----E 289
+ C ++ + IDN+ + + I K L T L D + +
Sbjct: 220 MPQNCSADVEAVIAHIDNVFTSGNQSQI--NHIKELFGLSELTHLDDVAGALRYNLWDWQ 277
Query: 290 AAQYDTPSNIPVKRICNAIE-----NAPNCGDDILCKIAA-------GVVE-------AD 330
+ DT N C+A+E +AP G + + A G ++ AD
Sbjct: 278 SLAPDTGPNSTFFEFCDALEVKNGTSAPVSGWGVDHALPAWGAFWTNGYLDNLCSDSIAD 337
Query: 331 SLEYDGNNSRCYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNC 390
L +++ Y N D + W W C+ +G + S P +T+ +K
Sbjct: 338 CLGTYNMSNQLYTNISIGND--NRAWTWIVCNN----VGWYQESPPAGYPSVVTQLVKPS 391
Query: 391 KEQY------------GVSPRPSWVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRG 436
++ G P P+ T Y G +++ + + F+NG RDP+
Sbjct: 392 YDERQCAYFFPETFPNGSLPVPNVAQTNVAYDGWGVQV------NQLFFANGRRDPWKYA 445
Query: 437 GWVKTYHFFDLSFSQDLNL--GSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWITQY 491
F + Q + L G HC DL A + ++ ++ + GW+ +
Sbjct: 446 TVSSPDTFVPSTAEQPIALSDGFHCSDLVTANGEADATVKAVQQQGLQAIHGWLQGF 502
>gi|440486121|gb|ELQ66017.1| hypothetical protein OOW_P131scaffold00435g7 [Magnaporthe oryzae
P131]
Length = 548
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 69/233 (29%), Positives = 101/233 (43%), Gaps = 22/233 (9%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLS-GFTYEN 114
IDH+ Y P + TFP RY +++F+ G I+ G + + L G Y
Sbjct: 63 IDHYHNETRYEPHADGTFPLRYWFDAQFYKPGGPVIVLSAGETSGVGRLPFLQKGIVYIM 122
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK--- 171
A + VILEHRYYG SVP + + +N R + + QALAD A ++K K
Sbjct: 123 AKALGGVGVILEHRYYGTSVP--TPDFSTENLR---FLTTEQALADTAYFAQNVKFKGLE 177
Query: 172 -YNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVV 228
Y+ + A P IA G SY G + R YP V G+++SS + D W + +
Sbjct: 178 DYDLSPAATPWIAYGGSYAGAFVAFLRKVYPDVFWGAISSSGVPVAIWDY--WKYFEAAA 235
Query: 229 SKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
C +T +K +D + G S+K K L+ E D
Sbjct: 236 VYG----PAACVETTQKLTHVVDTLLLGKAGDDPTSQKVKAAFGLEFVKENAD 284
>gi|326475086|gb|EGD99095.1| hypothetical protein TESG_06450 [Trichophyton tonsurans CBS 112818]
Length = 565
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 121/294 (41%), Gaps = 35/294 (11%)
Query: 11 LLLMVIFVSTSFHANGL-KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFS----Y 65
LLL+V V + +LR R ++ + +D N F IDHF Y
Sbjct: 8 LLLLVAAVQAKLPVTPISQLRAESHR-NKALVARSQDVNAAFPAHTIQIPIDHFPKSSRY 66
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIV 123
P + F RY ++ + G P++ G E + I G + A + V
Sbjct: 67 EPHTTAKFNLRYWFDASHYKEG-GPVIILHGGETSGEGRIPFLQKGILAQLAQATNGIGV 125
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK----DKYNATHAPV 179
I+EHRYYG S+P +R + K+ R + + QALAD A +IK +KYN T AP
Sbjct: 126 IMEHRYYGGSLP--TRDFSNKSLR---FLTTEQALADTAYFSKNIKFPGLEKYNLT-APG 179
Query: 180 IA---IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
A G SY G + R +YP + G+++SS D W + + ++
Sbjct: 180 TAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDY--WQYFEPIR----QEAP 233
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
++C + +DNI I K T LKN L D ++ A
Sbjct: 234 QDCVHVTQNFVDIVDNI-------IIHGKNANTTRELKNLFGLGRLRDADFANA 280
>gi|302890357|ref|XP_003044063.1| hypothetical protein NECHADRAFT_66929 [Nectria haematococca mpVI
77-13-4]
gi|256724982|gb|EEU38350.1| hypothetical protein NECHADRAFT_66929 [Nectria haematococca mpVI
77-13-4]
Length = 542
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 96/223 (43%), Gaps = 32/223 (14%)
Query: 6 FSFQWLL------LMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQT 59
+ +WLL L V H + + R G R SR ++ G +
Sbjct: 1 MALRWLLGSLAAGLFVFSAEAKIHGSSTEAR---GLHRLSR-----RADDGPRAVNISVP 52
Query: 60 IDHFS----YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF Y P S + FP RY +++++ G P++ E +D + G
Sbjct: 53 VDHFHNDTLYEPHSDEKFPLRYWFDAQYYKEGG-PVIILASGETSGEDRLPFLDHGILKM 111
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK---- 169
A + VILEHRYYG S P K KN R + ++ QALAD A ++K
Sbjct: 112 LAEATNGIGVILEHRYYGTSFPVPDLKT--KNLR---FLSTEQALADTAYFAENVKFPGL 166
Query: 170 DKYNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
++ N T A P I G SY G A + R YP + G ++SS
Sbjct: 167 EERNLTAASTPYIIYGGSYAGAFAAFARKIYPDIFWGGISSSG 209
>gi|119497857|ref|XP_001265686.1| serine peptidase, putative [Neosartorya fischeri NRRL 181]
gi|119413850|gb|EAW23789.1| serine peptidase, putative [Neosartorya fischeri NRRL 181]
Length = 525
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 59/215 (27%), Positives = 89/215 (41%), Gaps = 29/215 (13%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN- 114
++Q IDH + G TF QRY N ++W G SP++ F E+ D GF
Sbjct: 45 FDQYIDHNNPG---LGTFSQRYWYNPEYWAGPGSPVVLFTPGES---DAADYDGFLTNKT 98
Query: 115 -----AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
A + +++LEHRY+G S P+ + A Y Q++AD +
Sbjct: 99 IVGRFAEEIGGAVILLEHRYWGASSPYPNLTAETLQ-----YLTLEQSIADLVHFAKTVN 153
Query: 170 DKY------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTT 223
+ NA +AP + G SY G LA W P ASSAP+ D W
Sbjct: 154 LPFDEHHSSNADNAPWVMTGGSYSGALAAWTASIAPGTFWAYHASSAPVQAIYDF--WQY 211
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPD 258
+ VV + C + + + ID++ + D
Sbjct: 212 FVPVV----EGMPKNCSKDVNRVVEYIDHVYASGD 242
>gi|226443172|ref|NP_001140069.1| Thymus-specific serine protease precursor [Salmo salar]
gi|221221650|gb|ACM09486.1| Thymus-specific serine protease precursor [Salmo salar]
Length = 148
Score = 69.7 bits (169), Expect = 3e-09, Method: Composition-based stats.
Identities = 30/78 (38%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAH 116
+Q +DHF ++ +TFPQR+ +N +W + P+ F+G E PI + L+G + A
Sbjct: 63 HQPLDHFD--SQNDETFPQRFFVNEAYWERPHGPVFLFIGGEGPISEFNVLAGHHVDMAE 120
Query: 117 QFKALIVILEHRYYGKSV 134
+ AL+V LEHR+YG+S+
Sbjct: 121 EHGALLVALEHRFYGESI 138
>gi|358060630|dbj|GAA93671.1| hypothetical protein E5Q_00316 [Mixia osmundae IAM 14324]
Length = 544
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 64/223 (28%), Positives = 101/223 (45%), Gaps = 36/223 (16%)
Query: 28 KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGP-ESYQTFPQRYVINSKFWGG 86
K R +++R+ L K+ ++ + IDHF P E+ +F RY + K++
Sbjct: 23 KASSRNAQVKRAAPLAHKE-------YYIDTPIDHFPTNPSENVGSFKLRYFFSDKYFDR 75
Query: 87 GN-SPILAFLGAEAPIDDNIQLSGFTY----ENAHQFKALIVILEHRYYGKSVP------ 135
PI F GAE +D + + Y + A ++VILE RYYGKS P
Sbjct: 76 AKPGPIYLFDGAE--VDAEVMIDYLDYSWMMDAAKLTGGMVVILEQRYYGKSQPFSDYST 133
Query: 136 ----FGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPVIAIGASYGGELA 191
F S ++++A+H F + A Y ++ L K+ A +I +G SYGG A
Sbjct: 134 DSMRFSSTLQSIEDAKH---FATFATYAGYENLDLTYKN------AMIIYVGVSYGGAKA 184
Query: 192 TWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRD 234
R KY + G++A SA + + W Y +V+ RD
Sbjct: 185 AIARNKYGDIFAGAVAVSAVTESF--LNYWQFYGAVIEHGIRD 225
>gi|156058175|ref|XP_001595011.1| hypothetical protein SS1G_04819 [Sclerotinia sclerotiorum 1980]
gi|154702604|gb|EDO02343.1| hypothetical protein SS1G_04819 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 440
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 74/257 (28%), Positives = 111/257 (43%), Gaps = 32/257 (12%)
Query: 57 NQTIDHFS----YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPID---DNIQLSG 109
+Q IDHF Y P + TF QRY ++ ++ G PI ++G E N+Q +G
Sbjct: 22 DQPIDHFPNDPMYAPHTNATFKQRYWFDATYYKPG-GPIYLYIGGETNGQYRFSNLQ-TG 79
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK 169
L +ILE+RYYG+S PF + Y + Q +AD A H+
Sbjct: 80 IIQILMEATNGLGIILENRYYGESFPFNT-----STTDQLAYLTNQQTVADNAYFAQHV- 133
Query: 170 DKYNATHAPVIAIGAS---YGGELA---TWFRLK-YPHVVIGSLASSAPILYYGDITPWT 222
+A + A YGG LA T +K YP V G +ASSAPI W
Sbjct: 134 -SLPGVNASITAPNTKWILYGGSLAGGQTALSVKIYPEVFFGGIASSAPIKAVVGYPEW- 191
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD- 281
Y+ + R ++C +I + D + S + AI K+FK+ L+ T+ +D
Sbjct: 192 -YNPIQ----RLGPQDCISSINGIIDKFDALISANNTQAI--KQFKSLFGLEALTDNRDF 244
Query: 282 GLDTVYSEAAQYDTPSN 298
+ + D P+N
Sbjct: 245 AMTIAFPLGGPMDYPTN 261
>gi|327298966|ref|XP_003234176.1| hypothetical protein TERG_04769 [Trichophyton rubrum CBS 118892]
gi|326463070|gb|EGD88523.1| hypothetical protein TERG_04769 [Trichophyton rubrum CBS 118892]
Length = 553
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 124/495 (25%), Positives = 183/495 (36%), Gaps = 98/495 (19%)
Query: 11 LLLMVIFVSTSFHANGL-KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFS----Y 65
LLL+V V + +LR R ++ + +D N F IDHF Y
Sbjct: 25 LLLLVAAVQAKLPVTPISQLRAESHR-NKALVARSQDVNAAFPAHTIQIPIDHFPKSSRY 83
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIV 123
P + + F RY ++ + G P++ G E + I G + A + V
Sbjct: 84 EPHTTEKFNLRYWFDASHYKEGG-PVIILHGGETSGEGRIPFLQKGILAQLAQATNGIGV 142
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK----DKYNATHAPV 179
I+EHRYYG S+P N R + + QALAD A +IK +KYN T AP
Sbjct: 143 IMEHRYYGGSLP----TPDFSNKSLR-FLTTEQALADTAYFSKNIKFPGLEKYNLT-APG 196
Query: 180 ---IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
I G SY G + R +YP + G+++SS D W + + ++
Sbjct: 197 TAHIVYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDY--WQYFEPIR----QEAP 250
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKN------------TTELKDGL- 283
++C + +DNI I K T LKN L G+
Sbjct: 251 QDCVHVTQNFVDIVDNI-------IIHGKNANTTKELKNLFGLGRLRDADFANALSSGIT 303
Query: 284 -------DTVYSEAAQYDTPSNI-------PV--------KRICNA---IENAPNCGDDI 318
D S + Y S I PV KRI A AP +
Sbjct: 304 GWQSTNWDPAISGKSFYQYCSEITSDRYLYPVTAQQKASAKRIIEAGGHGREAPEILPQL 363
Query: 319 LCKIAAGVVEADSLEY---DGNNSRCYIN-------EDRTGDESDEGWEWQSCSE---MV 365
L + G + +LE G + +N + D+S W WQ C+E +
Sbjct: 364 LNFV--GWLNKSTLESCSGQGQTAEECLNSYDEAFYKQDNADQSWRAWPWQYCNEWGYLQ 421
Query: 366 VPMGKDKNSMYQPEPWNLTKYIKN-CKEQYGVSPRPSWV--LTYYGGHDIKLILRRSTSN 422
G KN Y N CK+ +G++ +PS V + YG DI+
Sbjct: 422 TGSGTPKNIRPVISRLIDLPYTSNICKQAFGIT-KPSNVDLVNKYGAFDIEY------DR 474
Query: 423 IIFSNGMRDPFSRGG 437
+ F +G DP+ G
Sbjct: 475 LAFIDGGSDPWKEAG 489
>gi|401886138|gb|EJT50197.1| hypothetical protein A1Q1_00498 [Trichosporon asahii var. asahii
CBS 2479]
Length = 643
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 111/267 (41%), Gaps = 49/267 (18%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP----IDDNIQ 106
F F ++Q + HF G + +TF QRY ++S+++ G P+ G E DD+I
Sbjct: 110 FGPFCFDQKVSHFDEGEK--RTFCQRYWVDSQYYVEG-GPVFILDGGETDGANRADDSIP 166
Query: 107 L--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
G ++ + VILEHRYYG S + + + N R + N+ ++L D A
Sbjct: 167 FMEQGILQILSNATGGMSVILEHRYYGAS--YVTEDLSTDNLR---WLNNKESLEDSAEF 221
Query: 165 L------------LHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS--- 209
+ L KD + P I G SY G A + R +YP +V G + SS
Sbjct: 222 IRKFPVPKDVQKKLKNKDVFKPDCTPYIYYGGSYAGARAAFMRKEYPDLVFGGIGSSGVT 281
Query: 210 -APILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
A + Y P TY C + +++S + +D +L K +
Sbjct: 282 HAQVYYPEYFDPIITYGEPA----------CIKAMEESIASVDE---------LLDKSAE 322
Query: 269 TCTPLKNTTELKDGLDTVYSEAAQYDT 295
T LK L+ D ++ QY T
Sbjct: 323 TNAALKKLFGLEALNDVDFAGVLQYPT 349
>gi|391873795|gb|EIT82803.1| hydrolytic enzymes of the alpha/beta hydrolase fold protein
[Aspergillus oryzae 3.042]
Length = 592
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 76/277 (27%), Positives = 111/277 (40%), Gaps = 43/277 (15%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA---PIDDNIQLSGFTY---- 112
IDH S T+ R+ +N ++ G PI+ + E I N S ++
Sbjct: 76 IDH---NDTSVGTYQNRFWVNDDYYKAGR-PIIMYDAGETNAESIAKNHLTSSLSFFRKI 131
Query: 113 -ENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDK 171
E+ H A+ +I EHRYYG S PF + H Y + QAL D +
Sbjct: 132 LEDTH---AMGIIWEHRYYGNSTPFPISRDT--PPEHFKYLTTKQALEDIPYFARNFSRP 186
Query: 172 YNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-------LYYGDI 218
A H P + +G SY G A + R KYP V+ + +SSAP+ +YY +
Sbjct: 187 KFAEHDLTPSSTPWVLVGGSYAGIRAAFARNKYPDVIFAAYSSSAPVQAQLNMSIYYDQV 246
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE 278
Y +V F E C + I + ID S A + K F +N+ E
Sbjct: 247 -----YRGLVGHGF----ENCAKDIHAALGYIDQQLSNNHTAAAIKKLFFGPGADQNSNE 297
Query: 279 -LKDGLDTVYSEAAQY--DTPSNIPVKRICNAIENAP 312
L T+YS Y D P ++ +C +E P
Sbjct: 298 GFTAALATIYSYFQNYGLDGPEGT-LRELCEHLEVDP 333
>gi|170040235|ref|XP_001847912.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167863800|gb|EDS27183.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 492
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 66/264 (25%), Positives = 114/264 (43%), Gaps = 43/264 (16%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD-NIQLSGFTYE 113
F+ IDHF P++ + RY+ + ++ G PIL +LG API + S Y+
Sbjct: 64 FFTTRIDHFD--PQNTAEWTLRYLAVTDYYQPG-GPILIWLGGNAPIQPYMVDESSLIYD 120
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYN 173
A + +G S + + + +N R + N+ Q LAD A + +++ +
Sbjct: 121 MAREM-----------HGAS--WVTSDTSTENLR---FLNTDQILADLAEFVTYLRREVT 164
Query: 174 AT-HAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDF 232
+A V+ G YGG LATWFR++YPH+ + +S D + DF
Sbjct: 165 RNENAHVLVSGVGYGGSLATWFRVRYPHLADAAWSSGGLHNALMDFQEFAEAWGQTLIDF 224
Query: 233 RDTSEECFQTIKKSWSEIDNI--ASKPDGLAILSKKFKTCT---------------PLKN 275
S+EC+ I ++ + N+ A + + IL ++ CT +
Sbjct: 225 --GSQECYNEIFVAFHVMQNLIDAGREE---ILHERLNLCTEIDTEDRLQVQFFFITMMT 279
Query: 276 TTELKDGLDTVYSEAAQYDTPSNI 299
+ EL D V ++ DTP+ +
Sbjct: 280 SIELLDSFSEVCNDLTGVDTPTAL 303
>gi|406697786|gb|EKD01038.1| hypothetical protein A1Q2_04659 [Trichosporon asahii var. asahii
CBS 8904]
Length = 643
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 71/267 (26%), Positives = 111/267 (41%), Gaps = 49/267 (18%)
Query: 51 FETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAP----IDDNIQ 106
F F ++Q + HF G + +TF QRY ++S+++ G P+ G E DD+I
Sbjct: 110 FGPFCFDQKVSHFDEGEK--RTFCQRYWVDSQYYVEG-GPVFILDGGETDGANRADDSIP 166
Query: 107 L--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI 164
G ++ + VILEHRYYG S + + + N R + N+ ++L D A
Sbjct: 167 FMEQGILQILSNATGGMSVILEHRYYGAS--YVTEDLSTDNLR---WLNNKESLEDSAEF 221
Query: 165 L------------LHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS--- 209
+ L KD + P I G SY G A + R +YP +V G + SS
Sbjct: 222 IRKFPVPKDVQKKLKNKDVFKPDCTPYIYYGGSYAGARAAFMRKEYPDLVFGGIGSSGVT 281
Query: 210 -APILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
A + Y P TY C + +++S + +D +L K +
Sbjct: 282 HAQVYYPEYFDPIITYGEPA----------CIKAMEESIASVDE---------LLDKSAE 322
Query: 269 TCTPLKNTTELKDGLDTVYSEAAQYDT 295
T LK L+ D ++ QY T
Sbjct: 323 TNAALKKLFGLEALNDVDFAGVLQYPT 349
>gi|312383570|gb|EFR28611.1| hypothetical protein AND_03266 [Anopheles darlingi]
Length = 359
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 70/132 (53%), Gaps = 8/132 (6%)
Query: 151 YFNSAQALADYASILLHIKDKY-NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASS 209
+ N+ QA+AD A + ++K + +A VI +G YGG LATWFR KYPH+ G SS
Sbjct: 3 FLNADQAMADLAEWITYLKQTFVRNPNAKVILMGTGYGGALATWFRQKYPHLADGVWVSS 62
Query: 210 APILYYGDITPWTTYHSVVSKDFRD-TSEECFQTIKKSWSEIDNIASKPDGLA-ILSKKF 267
I ++ Y+ + + R+ S+ C+ TI + N+ S G + +LS++F
Sbjct: 63 GAIEAN---FAFSGYNEALGESIREYGSDACYSTIWTGFRVAQNMVSL--GFSDLLSEEF 117
Query: 268 KTCTPLKNTTEL 279
C PL +EL
Sbjct: 118 HLCDPLDTDSEL 129
>gi|238492811|ref|XP_002377642.1| serine peptidase, family S28, putative [Aspergillus flavus
NRRL3357]
gi|220696136|gb|EED52478.1| serine peptidase, family S28, putative [Aspergillus flavus
NRRL3357]
Length = 592
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 112/278 (40%), Gaps = 45/278 (16%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA---PIDDNIQLSGFTY---- 112
IDH S T+ R+ +N ++ G PI+ + E I N S ++
Sbjct: 76 IDH---NDTSVGTYQNRFWVNDDYYEAGR-PIIMYDAGETNAESIAKNHLTSSLSFFRKI 131
Query: 113 -ENAHQFKALIVILEHRYYGKSVPFG-SRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E+ H A+ +I EHRYYG S PF SR H Y + QAL D +
Sbjct: 132 LEDTH---AMGIIWEHRYYGNSTPFPISRDTP---PEHFKYLTTKQALEDIPYFARNFSR 185
Query: 171 KYNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-------LYYGD 217
A H P + +G SY G A + R KYP V+ + +SSAP+ +YY
Sbjct: 186 PKFAEHDLTPSSTPWVLVGGSYAGIRAAFARNKYPDVIFAAYSSSAPVQAQLNMSIYYDQ 245
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
+ Y +V F E C + I + ID S A + K F +N+
Sbjct: 246 V-----YRGLVGHGF----ENCAKDIHAALGYIDQQLSNNHTAAAIKKLFFGPGADQNSN 296
Query: 278 E-LKDGLDTVYSEAAQY--DTPSNIPVKRICNAIENAP 312
E L T+YS Y D P ++ +C +E P
Sbjct: 297 EGFTAALATIYSYFQNYGLDGPEGT-LRELCEHLEVDP 333
>gi|67541941|ref|XP_664738.1| hypothetical protein AN7134.2 [Aspergillus nidulans FGSC A4]
gi|40742196|gb|EAA61386.1| hypothetical protein AN7134.2 [Aspergillus nidulans FGSC A4]
Length = 1157
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/280 (26%), Positives = 112/280 (40%), Gaps = 48/280 (17%)
Query: 72 TFPQRYVINSKFWGGGNSPILAFLG---AEAPIDDNIQLSGFTYENAHQFKALIVILEHR 128
TF RY +N ++ G +L +G E + S F +F A+ ++ EHR
Sbjct: 96 TFRNRYWVNDAYYVPGGPVVLYDVGEADGEPSVAHLRSNSSFLPPILQEFGAIGIVWEHR 155
Query: 129 -----------------YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-- 169
YYG S+P+ L H Y + QALAD + +
Sbjct: 156 SVVSTKVETRIEIDRNRYYGDSLPYPVNNNTLPE--HLIYLTTRQALADIPAFAENFTRP 213
Query: 170 --DKYNATHA--PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-------LYYGDI 218
+Y+ T A P + IG SY G A + R +YP + S A+SAP+ +YY +
Sbjct: 214 GLQEYDLTPASTPWVMIGGSYPGARAAFARNEYPDTIFASFAASAPVQAQINMSIYYEQV 273
Query: 219 TPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTE 278
Y S+V+ FR C I + IDN S+ D A + + F KN+ E
Sbjct: 274 -----YRSMVANGFRS----CASNIHAALEYIDNQLSQNDTAASIKQLFFGPGAEKNSNE 324
Query: 279 -LKDGLDTVYSEAAQY--DTPSNIPVKRICNAIENAPNCG 315
L +Y Y D P+ + C +E+ P G
Sbjct: 325 DFTHALAVIYGSFQAYGMDGPAG-SLGEFCRYLESDPVTG 363
>gi|327294299|ref|XP_003231845.1| serine peptidase, family S28 [Trichophyton rubrum CBS 118892]
gi|326465790|gb|EGD91243.1| serine peptidase, family S28 [Trichophyton rubrum CBS 118892]
Length = 551
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 89/347 (25%), Positives = 136/347 (39%), Gaps = 61/347 (17%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN----- 114
IDH S T+ R+ IN + + G P+ F EA F N
Sbjct: 74 IDHKS---NKTGTYKHRFWINEQDYKPG-GPVFVFDCGEAAGQRYADRYLFNETNFFRQL 129
Query: 115 AHQFKALIVILEHRYYGKSVPFGSRKAALKNA-RHRGYFNSAQALADYASILLHIKDKY- 172
+F + +I EHRYYG+S PF +K H Y N+ QALAD K K
Sbjct: 130 TKKFHGIGIIFEHRYYGESTPF---PVTVKTPPEHFQYLNNDQALADLPYFAKEFKRKAF 186
Query: 173 -------NATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TT 223
NAT P + +G SY G + + R +YP + S ASSAP+ D+ +
Sbjct: 187 PNNDLRPNAT--PWVMVGGSYPGMRSAFTRDRYPETIYASWASSAPVQAQIDMAVYYEQV 244
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
Y +V+ + + C + I+ ++ ID S+ D A + K F L E
Sbjct: 245 YRGLVAYGWGN----CTKDIRAAYRYIDRQLSRNDTAAAIKKLF-----LGEGAEQASNG 295
Query: 284 D----TVYSEAAQYDTPSNIPVKRICNAIEN-------------APNCGDDILCK----- 321
D + + A + ++ V + CN +E AP GD + +
Sbjct: 296 DFTAALIVAYAGWQSSGADGQVGKFCNWLEVDPKTNKTAPAEGWAPTLGDKAMAERFAAW 355
Query: 322 -IAAGVVEADSLEYDGNNSRCYINEDRTGDESDE----GWEWQSCSE 363
A +V A+++ + E + S++ W WQ CSE
Sbjct: 356 PTLAEMVNANAMTNCKQTDKSKPLECKLDKPSEDPDFISWIWQYCSE 402
>gi|317157591|ref|XP_001825996.2| serine peptidase, family S28 [Aspergillus oryzae RIB40]
Length = 580
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 78/278 (28%), Positives = 112/278 (40%), Gaps = 45/278 (16%)
Query: 60 IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA---PIDDNIQLSGFTY---- 112
IDH S T+ R+ +N ++ G PI+ + E I N S ++
Sbjct: 76 IDH---NDTSVGTYQNRFWVNDDYYEAGR-PIIMYDAGETNAESIAKNHLTSSLSFFRKI 131
Query: 113 -ENAHQFKALIVILEHRYYGKSVPFG-SRKAALKNARHRGYFNSAQALADYASILLHIKD 170
E+ H A+ +I EHRYYG S PF SR H Y + QAL D +
Sbjct: 132 LEDTH---AMGIIWEHRYYGNSTPFPISRDTP---PEHFKYLTTKQALEDIPYFARNFSR 185
Query: 171 KYNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI-------LYYGD 217
A H P + +G SY G A + R KYP V+ + +SSAP+ +YY
Sbjct: 186 PKFAEHDLTPSSTPWVLVGGSYAGIRAAFARNKYPDVIFAAYSSSAPVQAQLNMSIYYDQ 245
Query: 218 ITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT 277
+ Y +V F E C + I + ID S A + K F +N+
Sbjct: 246 V-----YRGLVGHGF----ENCAKDIHAALGYIDQQLSNNHTAAAIKKLFFGPGADQNSN 296
Query: 278 E-LKDGLDTVYSEAAQY--DTPSNIPVKRICNAIENAP 312
E L T+YS Y D P ++ +C +E P
Sbjct: 297 EGFTAALATIYSYFQNYGLDGPEGT-LRELCEHLEVDP 333
>gi|401883336|gb|EJT47549.1| hypothetical protein A1Q1_03570 [Trichosporon asahii var. asahii
CBS 2479]
Length = 603
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 65/261 (24%), Positives = 105/261 (40%), Gaps = 36/261 (13%)
Query: 39 SRILEQKDSNHG-FETFFYNQTIDHFSYGPESY-----------QTFPQRYVINSKFWGG 86
+R LEQ+ N G F+ Q +GP + +TF Q+Y++N + +
Sbjct: 30 TRDLEQRLQNVGVFKKPLNKQMRVGSPWGPHCFTQKRSHFDGSTETFCQQYLVNKEHYKP 89
Query: 87 GNSPILAFLGAEAPIDDNIQLSGFTYEN--AHQFKALIVILEHRYYGKSVPFGSRKAALK 144
G P+ E+ + + + + A + + V LEHRYYG GS
Sbjct: 90 G-GPVFILDNGESEVLSTTTMDKGSLASLLARETNGIYVKLEHRYYG-----GSNVTEDL 143
Query: 145 NARHRGYFNSAQALADYASIL----------LHIKDKYNATHAPVIAIGASYGGELATWF 194
+ + + N ++L D A + L D P I IG SY G A W
Sbjct: 144 STDNLRWLNVRESLEDSAEFIRNFPVPEGLELPEPDLLTPAKTPFIYIGGSYPGGKANWM 203
Query: 195 RLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIA 254
R YP +V GS+ SSA + + ++ W + +VV + EC I +S + +D +
Sbjct: 204 RKHYPDIVWGSIGSSA--VVHAEVDFWQYFDTVV----KHGEPECVSAITESIAAVDKLL 257
Query: 255 SKPDGLAILSKKFKTCTPLKN 275
P A + F + N
Sbjct: 258 DDPKTNAAIKAVFGYTGEISN 278
>gi|296822298|ref|XP_002850262.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238837816|gb|EEQ27478.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 548
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 101/409 (24%), Positives = 159/409 (38%), Gaps = 62/409 (15%)
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDD--NIQL---SGFTYENAHQFKALIVILE 126
T+ R+ IN + + G P+ F EA N L + F + +F + ++ E
Sbjct: 80 TYKHRFWINEEDYKPG-GPVFVFDCGEAAGQRYANNYLYNETNFFRQFTKKFNGVGIVFE 138
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKD--------KYNATHAP 178
HRYYG+S PF + H Y N+ QALAD K + NAT P
Sbjct: 139 HRYYGESTPFPI--SVKTPPEHFQYLNNDQALADLPYFAKSFKRAAFPNNDLRPNAT--P 194
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHSVVSKDFRDTS 236
I +G SY G A + R +YP + S ASSAP+ D+ + Y +V+ + +
Sbjct: 195 WIMVGGSYPGMRAAFTRDRYPETIYASWASSAPVQAQIDMAVYYEQVYRGLVAYGWGN-- 252
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT-ELKDGLDTVYSEAAQYDT 295
C + I ++ ID S+ D A + K F K + + L Y A +
Sbjct: 253 --CTKDIHAAYRYIDRQLSRGDTAAAIKKLFLGEGADKASNGDFTAALIVAY--AGWQSS 308
Query: 296 PSNIPVKRICNAIEN-------------APNCGDDILCK------IAAGVVEADSLEYDG 336
++ V + CN +E AP GD + + A +V A+++
Sbjct: 309 GADGQVGKFCNWLEVDPHTNKTAPAEGWAPTLGDKAMAERFAAWPTLAEMVNANAMTNCK 368
Query: 337 NNSRCYINEDRTGDESDE----GWEWQSCSEMVVPMGKDKN-----SMYQPEPWNLTKYI 387
+ E + S++ W WQ CSE G + S YQ WN
Sbjct: 369 QTDKSKPLECKLDKPSEDPDFISWIWQYCSEWGYYQGVNFPQHAILSRYQTNQWNQ---- 424
Query: 388 KNCKEQYGVSPRPSWVLTYYGGHDIKLILR---RSTSNIIFSNGMRDPF 433
+ C Q+ R ++ + L + SN+ +S G DP+
Sbjct: 425 EFCYRQFPTGVRSGYLPLFPQTVKTNLATKGWHMRPSNVYWSGGQFDPW 473
>gi|159116781|ref|XP_001708611.1| Serine peptidase, putative [Giardia lamblia ATCC 50803]
gi|157436724|gb|EDO80937.1| Serine peptidase, putative [Giardia lamblia ATCC 50803]
Length = 469
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 111/505 (21%), Positives = 198/505 (39%), Gaps = 94/505 (18%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGR-IRRSRILEQKDSNHGFETFFYNQTIDHFSYGPES 69
LL + I V+T LGR ++R R+ F +DHF+ P +
Sbjct: 2 LLALCILVAT------------LGRFVQRERVGPST-------QLFIENRVDHFN--PFN 40
Query: 70 YQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-------AHQFKALI 122
F QRY NS+F G+ + +G E I+ G T N A + A I
Sbjct: 41 QDVFRQRYYYNSEFVRDGSHVAILEIGGEGEIN---SAPGGTKSNPDILGRIADNYGAHI 97
Query: 123 VILEHRYYGKSVPF--GSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHAPV- 179
+LEHR+YG S PF S K + + R Y +S QA +D + + D+ ++
Sbjct: 98 FVLEHRFYGISHPFQHTSEKYDVGTDKLR-YLSSKQAQSDLLYFISVMDDRLCPANSKDG 156
Query: 180 -------------IAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHS 226
+ +G SY G + W ++P++ L+SS + +I + T+
Sbjct: 157 SFKRIEGRTCFQWVIVGGSYPGAVTGWIYQRHPNLFAAGLSSSGVVNARYEIPEFDTHTL 216
Query: 227 VVSKDFRDTSEECFQTIKKSWSEI----DNIASKPDGLAILSKKFKTCTPLKNTTELKDG 282
+V S+ +Q ++ ++ DNI + G+ + K + +T +
Sbjct: 217 MVPG--APCSDALYQAQHEATRQVEAGEDNIVYERLGIRTDADKSDIHYFIADTMLM--- 271
Query: 283 LDTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGVVEADSLEYDGNNSRCY 342
QY + R+ A E + D ++ ++ +D +S
Sbjct: 272 -------CFQYGKSKSCCDSRLSKAWEGHGDILDALVDYLSTS-------SFDSYDSINL 317
Query: 343 INEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQY-GVSPRPS 401
++ ++ W WQ+C+E+ + + E ++ CK+ + G+
Sbjct: 318 ASDTAKHSDAFRQWWWQTCTEVAYYQPAPLINSLRSEKITTQWHLDMCKKIFDGLDLGNP 377
Query: 402 WVLT--YYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG--- 456
+ T +YGG +K ++ FSN +DP W D+ +D N+G
Sbjct: 378 TIKTNEFYGGEHVK------ADDVFFSNFWQDP-----WHMCSMTDDMGGQKD-NIGFIR 425
Query: 457 ----SHCLDLDEAKKSDPDWLVQQR 477
HC+DL +++DP LV+ R
Sbjct: 426 CKDCGHCVDLHLPQETDPIELVELR 450
>gi|406698047|gb|EKD01293.1| hypothetical protein A1Q2_04371 [Trichosporon asahii var. asahii
CBS 8904]
Length = 605
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 65/261 (24%), Positives = 105/261 (40%), Gaps = 36/261 (13%)
Query: 39 SRILEQKDSNHG-FETFFYNQTIDHFSYGPESY-----------QTFPQRYVINSKFWGG 86
+R LEQ+ N G F+ Q +GP + +TF Q+Y++N + +
Sbjct: 30 TRNLEQRLQNVGVFKKPLNKQMRVGSPWGPHCFTQKRSHFDGSTETFCQQYLVNKEHYKP 89
Query: 87 GNSPILAFLGAEAPIDDNIQLSGFTYEN--AHQFKALIVILEHRYYGKSVPFGSRKAALK 144
G P+ E+ + + + + A + + V LEHRYYG GS
Sbjct: 90 G-GPVFILDNGESEVLSTTTMDKGSLASLLARETNGIYVKLEHRYYG-----GSNVTEDL 143
Query: 145 NARHRGYFNSAQALADYASIL----------LHIKDKYNATHAPVIAIGASYGGELATWF 194
+ + + N ++L D A + L D P I IG SY G A W
Sbjct: 144 STDNLRWLNVRESLEDSAEFIRNFPVPDGLELPEPDLLTPAKTPFIYIGGSYPGGKANWM 203
Query: 195 RLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIA 254
R YP +V GS+ SSA + + ++ W + +VV + EC I +S + +D +
Sbjct: 204 RKHYPDIVWGSIGSSA--VVHAEVDFWQYFDTVV----KHGEPECVSAITESIAAVDKLL 257
Query: 255 SKPDGLAILSKKFKTCTPLKN 275
P A + F + N
Sbjct: 258 DDPKTNAAIKAVFGYTGEISN 278
>gi|58268250|ref|XP_571281.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57227516|gb|AAW43974.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 561
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 105/236 (44%), Gaps = 43/236 (18%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQ-TFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
K S+ FE + + Q I HF ES TF QRY +++ + G PI G E +
Sbjct: 70 KPSSSIFEPYCFPQFISHFD---ESVNGTFCQRYWVDASSYRPGG-PIYLLDGGETSGEY 125
Query: 104 NIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ G ++ L V+LEHRYYG+SVP S + + R + N+A+AL D
Sbjct: 126 RLPFLEKGILDILSNATGGLSVVLEHRYYGESVPVSS--FSTDDLR---FLNNAEALEDS 180
Query: 162 ASILLHIK---------------DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
A + + K ++ + P I G SY G A R++YP++V G++
Sbjct: 181 AYFIENFKLPASLSNALPFELEETAFHPNNTPWIYYGGSYAGARAAHMRVQYPNLVWGAI 240
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDTSE-----ECFQTIKKSWSEIDNIASKP 257
ASSA T+ + + D + EC T++++ IDNI P
Sbjct: 241 ASSA-----------VTHAQIDFPQYYDPIQEYGPPECISTLRRAIIFIDNILDHP 285
>gi|134113512|ref|XP_774572.1| hypothetical protein CNBF4600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257214|gb|EAL19925.1| hypothetical protein CNBF4600 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 561
Score = 69.3 bits (168), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 68/236 (28%), Positives = 105/236 (44%), Gaps = 43/236 (18%)
Query: 45 KDSNHGFETFFYNQTIDHFSYGPESYQ-TFPQRYVINSKFWGGGNSPILAFLGAEAPIDD 103
K S+ FE + + Q I HF ES TF QRY +++ + G PI G E +
Sbjct: 70 KPSSSIFEPYCFPQFISHFD---ESVNGTFCQRYWVDASSYRPGG-PIYLLDGGETSGEY 125
Query: 104 NIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADY 161
+ G ++ L V+LEHRYYG+SVP S + + R + N+A+AL D
Sbjct: 126 RLPFLEKGILDILSNATGGLSVVLEHRYYGESVPVSS--FSTDDLR---FLNNAEALEDS 180
Query: 162 ASILLHIK---------------DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSL 206
A + + K ++ + P I G SY G A R++YP++V G++
Sbjct: 181 AYFIENFKLPASLSNALPFELEETAFHPNNTPWIYYGGSYAGARAAHMRVQYPNLVWGAI 240
Query: 207 ASSAPILYYGDITPWTTYHSVVSKDFRDTSE-----ECFQTIKKSWSEIDNIASKP 257
ASSA T+ + + D + EC T++++ IDNI P
Sbjct: 241 ASSA-----------VTHAQIDFPQYYDPIQEYGPPECISTLRRAIIFIDNILDHP 285
>gi|327290302|ref|XP_003229862.1| PREDICTED: dipeptidyl peptidase 2-like [Anolis carolinensis]
Length = 380
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 36/86 (41%), Positives = 49/86 (56%)
Query: 50 GFETFFYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSG 109
GFE +++Q +DHF++ S +TFPQRY+I KFW GN PI + G E + + + G
Sbjct: 29 GFEEKYFDQWLDHFNFESYSNRTFPQRYLITDKFWKRGNRPIFFYTGNEGDVWNFGENCG 88
Query: 110 FTYENAHQFKALIVILEHRYYGKSVP 135
F E A Q AL+V EH VP
Sbjct: 89 FILELAGQQGALVVFAEHVSVFIGVP 114
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 42/142 (29%), Positives = 76/142 (53%), Gaps = 9/142 (6%)
Query: 348 TGDESDEGWEWQSCSEMVVPMGKDKNSMYQPE-PWNLTKYIKNCKEQYGVSPRPSWVLTY 406
TG ++D W++Q+C+E+ + + + PE P+ C ++GV PRPSW+ T
Sbjct: 234 TGSDAD-AWDYQACTEINLTFDSNNVTDMFPEIPFTEADRDSYCFRRWGVHPRPSWLDTS 292
Query: 407 YGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAK 466
+ G ++ + SNI+FSNG DP++ GG K ++ + ++ G+H LDL +
Sbjct: 293 FWGSNLA-----AASNIVFSNGDLDPWAGGGIRKNLSSSLIAIT--ISGGAHHLDLRASN 345
Query: 467 KSDPDWLVQQRKTEVKIMQGWI 488
+DP + + R E ++ W+
Sbjct: 346 PADPPSVREARLQEASLILDWV 367
>gi|303322745|ref|XP_003071364.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240111066|gb|EER29219.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320032884|gb|EFW14834.1| extracelular serine carboxypeptidase [Coccidioides posadasii str.
Silveira]
Length = 543
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 101/237 (42%), Gaps = 30/237 (12%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNSP 90
R++ L D + + + IDHF Y P S TF RY ++ + G P
Sbjct: 31 RVKAPSALAGDDLTSLYPSHTISIPIDHFHTDDRYAPHSNGTFELRYWFDASHYKDGG-P 89
Query: 91 ILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
++ G E + + G + A + V+LEHRYYG S+P + + KN R
Sbjct: 90 VIVLHGGETDGEGRLPFLQKGILGQLAQATNGVGVVLEHRYYGTSIP--TEDFSTKNLR- 146
Query: 149 RGYFNSAQALADYA-----SILLHIKDK-YNATHAPVIAIGASYGGELATWFRLKYPHVV 202
+ + QA+AD A + ++DK A + P I G SY G + R++YP +
Sbjct: 147 --FLTTEQAMADSAYFAKNVVFEGLEDKDLTAPNTPYILYGGSYAGAQVAFLRVEYPDIF 204
Query: 203 IGSLASSA---PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
G+++SS I +Y W Y + + C + + +DNIA +
Sbjct: 205 WGAISSSGVTKAIWHY-----WQYYEPTR----KHAPQHCVKQTQTFVDLVDNIALR 252
>gi|392868441|gb|EAS34227.2| extracelular serine carboxypeptidase [Coccidioides immitis RS]
Length = 541
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 62/237 (26%), Positives = 101/237 (42%), Gaps = 30/237 (12%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNSP 90
R++ L D + + + IDHF Y P S TF RY ++ + G P
Sbjct: 31 RVKAPSALAGDDLTSLYPSHTISIPIDHFHTDDRYAPHSNGTFELRYWFDASHYKDGG-P 89
Query: 91 ILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARH 148
++ G E + + G + A + V+LEHRYYG S+P + + KN R
Sbjct: 90 VIVLHGGETDGEGRLPFLQKGILGQLAQATNGVGVVLEHRYYGTSIP--TEDFSTKNLR- 146
Query: 149 RGYFNSAQALADYA-----SILLHIKDK-YNATHAPVIAIGASYGGELATWFRLKYPHVV 202
+ + QA+AD A + ++DK A + P I G SY G + R++YP +
Sbjct: 147 --FLTTEQAMADSAYFAKNVVFEGLEDKDLTAPNTPYILYGGSYAGAQVAFLRVEYPDIF 204
Query: 203 IGSLASSA---PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
G+++SS I +Y W Y + + C + + +DNIA +
Sbjct: 205 WGAISSSGVTKAIWHY-----WQYYEPTR----KHAPQHCVKQTQTFVDLVDNIALR 252
>gi|302664611|ref|XP_003023934.1| extracellular serine carboxypeptidase, putative [Trichophyton
verrucosum HKI 0517]
gi|291187955|gb|EFE43316.1| extracellular serine carboxypeptidase, putative [Trichophyton
verrucosum HKI 0517]
Length = 536
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 118/294 (40%), Gaps = 35/294 (11%)
Query: 11 LLLMVIFVSTSFHANGL-KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFS----Y 65
LLL+V V + +LR R ++ + +D N F IDHF Y
Sbjct: 8 LLLLVAAVQAKLPVTPISQLRAESHR-NKALVARSQDVNAAFPAHTIQIPIDHFPKSSRY 66
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIV 123
P + + F RY ++ + G P++ G E + I G + A + V
Sbjct: 67 EPHTTEKFNLRYWFDASHYKEG-GPVIILHGGETSGEGRIPFLQKGILAQLAQATNGIGV 125
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK----DKYNATHAPV 179
I+EHRYYG S+P N R + + QALAD A +IK +KYN T AP
Sbjct: 126 IMEHRYYGGSLP----TPDFSNKSLR-FLTTEQALADTAYFSKNIKFPGLEKYNLT-APG 179
Query: 180 IA---IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTS 236
A G SY G + R +YP + G+++SS D W + + ++
Sbjct: 180 TAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDY--WQYFEPIR----QEAP 233
Query: 237 EECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
++C + +DNI I K T LKN L D ++ A
Sbjct: 234 QDCVHVTQNFVDIVDNI-------IIHGKNANTTRELKNLFGLGRLRDADFANA 280
>gi|154302877|ref|XP_001551847.1| hypothetical protein BC1G_09182 [Botryotinia fuckeliana B05.10]
gi|347832253|emb|CCD47950.1| similar to extracelular serine carboxypeptidase (secreted protein)
[Botryotinia fuckeliana]
Length = 563
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 69/284 (24%), Positives = 119/284 (41%), Gaps = 28/284 (9%)
Query: 19 STSFHANGL-KLRPRLGRIRRSRILEQKDS---NHGFETFFYNQTIDHF----SYGPESY 70
S + NG+ + R R RR L ++D + + T+ ID+F Y P S
Sbjct: 20 SHQLYPNGVDRKRSRTTISRRELALAKRDDVDPSLLYPTYNLTVPIDYFHNESRYEPHSN 79
Query: 71 QTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHR 128
TFP RY ++ ++ G P++ E + + G ++ A + V+LEHR
Sbjct: 80 GTFPLRYWFDATYYKPG-GPVIVLQSGETDAEGRLPFLQKGILHQLAVATNGIGVVLEHR 138
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI------KDKYNATHAPVIAI 182
YYG+S+P + + +N R + + QAL D +I A + I
Sbjct: 139 YYGQSIP--TPDFSTENLR---FLTTEQALMDEVYFARNIVFPGLEDQNLTAPNVAYIGY 193
Query: 183 GASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQT 242
G SY G + R YP G+++SS + D W + + + + D ++C +
Sbjct: 194 GGSYAGAFNAFLRKLYPDTFWGTISSSGVVEAIYDY--WDYFEPI--RVYAD--QKCIKN 247
Query: 243 IKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTV 286
+ + +DNI + L ++ K+ L N T D + V
Sbjct: 248 TQLITNSMDNIVIGQENNTALVQELKSVWGLPNITYTNDFMSVV 291
>gi|302503352|ref|XP_003013636.1| serine peptidase, family S28, putative [Arthroderma benhamiae CBS
112371]
gi|302666638|ref|XP_003024916.1| serine peptidase, family S28, putative [Trichophyton verrucosum HKI
0517]
gi|291177201|gb|EFE32996.1| serine peptidase, family S28, putative [Arthroderma benhamiae CBS
112371]
gi|291188993|gb|EFE44305.1| serine peptidase, family S28, putative [Trichophyton verrucosum HKI
0517]
Length = 551
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 85/347 (24%), Positives = 132/347 (38%), Gaps = 55/347 (15%)
Query: 57 NQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-- 114
+ IDH S T+ R+ IN + + G P+ F EA F N
Sbjct: 71 DMPIDHKS---NKTGTYKHRFWINEQDYKPGG-PVFVFDCGEAAGQRYADRYLFNETNFF 126
Query: 115 ---AHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-- 169
+F + +I EHRYYG+S PF H Y N+ QALAD K
Sbjct: 127 RQLTKKFHGIGIIFEHRYYGESTPFPV--TVETPPEHFQYLNNDQALADLPYFAKDFKRK 184
Query: 170 ----DKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TT 223
D P + +G SY G + + R +YP + S ASSAP+ D+ +
Sbjct: 185 SFPNDDLRPNATPWVMVGGSYPGMRSAFTRDRYPETIYASWASSAPVQAQIDMAVYYEQV 244
Query: 224 YHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGL 283
Y +V+ + + C + I+ ++ ID S+ D A + K F L E
Sbjct: 245 YRGLVAYGWGN----CTKDIRAAYRYIDRQLSRNDTAAAIKKLF-----LGEGAEQASNG 295
Query: 284 D----TVYSEAAQYDTPSNIPVKRICNAIEN-------------APNCGDDILCK----- 321
D + + A + ++ V + CN +E AP GD + +
Sbjct: 296 DFTAALIVAYAGWQSSGADGQVGKFCNWLEVDPKTNKTAPAEGWAPTLGDKAMAERFAAW 355
Query: 322 -IAAGVVEADSLEYDGNNSRCYINEDRTGDESDE----GWEWQSCSE 363
A +V A+++ + E + S++ W WQ CSE
Sbjct: 356 PTLAEMVNANAMTNCKQTDKSKPLECKLDKPSEDPDFISWIWQYCSE 402
>gi|154303090|ref|XP_001551953.1| hypothetical protein BC1G_09565 [Botryotinia fuckeliana B05.10]
Length = 411
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 59/226 (26%), Positives = 99/226 (43%), Gaps = 29/226 (12%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD------NIQLS 108
F+ Q +DH + S TF Q++ NS+FW G SPI+ F E + N+ ++
Sbjct: 54 FFTQLLDHDN---PSKGTFQQKFWWNSEFWAGPGSPIVFFTPGEIAAANYGAYLTNVTVT 110
Query: 109 GFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHI 168
G A + K +V++EHR++G+S P+ + QA+AD+ +
Sbjct: 111 GLF---AQEIKGAVVMVEHRFWGESSPYDNLTTTNLQLLTL-----KQAIADFVHFAKTV 162
Query: 169 KDKYNATH------APVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWT 222
+++ H AP I G SY G L+ W P ASSAP+ D W
Sbjct: 163 DLPFDSNHSSNAASAPWINSGGSYSGALSAWTESTSPGTFWAYHASSAPVQAIDDY--WQ 220
Query: 223 TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFK 268
++ V ++ S++ I +DN+ + + A+ + K K
Sbjct: 221 YFYPVQDGMPKNCSKDVSLVIDY----MDNVLTHGNKSAVTALKTK 262
>gi|325186495|emb|CCA21035.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 263
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 64/258 (24%), Positives = 106/258 (41%), Gaps = 23/258 (8%)
Query: 235 TSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTT---ELKDGLDTVYSEAA 291
++ C + I+++W + ++A G L+K F C PL+N T L L +S A
Sbjct: 5 SAPNCAEKIRQAWPALFSMAENEPGRLQLAKIFHLCRPLQNETGIHHLALWLLNAFSVLA 64
Query: 292 QYDTPSNIPVKRICNAIENAPNCGDDILCKIAA-----------GVVEADSLEYDGNNSR 340
+ P P + N P C A + EA S+ Y+
Sbjct: 65 MRNYP--YPSSYLSNGEAQLPAWPMQSACSFLADQRPDSIALISSLFEAVSVLYNATKKM 122
Query: 341 CYINEDRTGDESDEGWEWQSCSEMVVPMGKDKNSMYQPEPWNLT---KYI-KNCKEQYGV 396
++ R D W + C+EM++ ++ WN T K++ ++C+ +G
Sbjct: 123 DCVDLPRDMTSIDGIWGFHYCTEMLLQETYFSSNGISDMFWNRTISAKFVQQHCQRVWGT 182
Query: 397 SPRPSWVLTYYGGHDIKLILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQDLNLG 456
P P W+ YG D L + SNI+F+NGM DP+ G K+ + +
Sbjct: 183 KPDPEWIRIMYGDAD---TLLSAASNIVFTNGMLDPWRCCGVKKSQVRNNRIKVLKIENA 239
Query: 457 SHCLDLDEAKKSDPDWLV 474
+H LDL DP+ L+
Sbjct: 240 AHHLDLFFHHVDDPNPLL 257
>gi|119189871|ref|XP_001245542.1| hypothetical protein CIMG_04983 [Coccidioides immitis RS]
Length = 533
Score = 68.6 bits (166), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 93/212 (43%), Gaps = 30/212 (14%)
Query: 60 IDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
IDHF Y P S TF RY ++ + G P++ G E + + G +
Sbjct: 48 IDHFHTDDRYAPHSNGTFELRYWFDASHYKDGG-PVIVLHGGETDGEGRLPFLQKGILGQ 106
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA-----SILLHI 168
A + V+LEHRYYG S+P + + KN R + + QA+AD A + +
Sbjct: 107 LAQATNGVGVVLEHRYYGTSIP--TEDFSTKNLR---FLTTEQAMADSAYFAKNVVFEGL 161
Query: 169 KDK-YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA---PILYYGDITPWTTY 224
+DK A + P I G SY G + R++YP + G+++SS I +Y W Y
Sbjct: 162 EDKDLTAPNTPYILYGGSYAGAQVAFLRVEYPDIFWGAISSSGVTKAIWHY-----WQYY 216
Query: 225 HSVVSKDFRDTSEECFQTIKKSWSEIDNIASK 256
+ + C + + +DNIA +
Sbjct: 217 EPTR----KHAPQHCVKQTQTFVDLVDNIALR 244
>gi|380487520|emb|CCF37988.1| serine carboxypeptidase S28 [Colletotrichum higginsianum]
Length = 516
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 101/377 (26%), Positives = 143/377 (37%), Gaps = 59/377 (15%)
Query: 26 GLKL--RPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYVINSKF 83
GLK+ PRLG +R + S + ++Q +DH TF QRY N++
Sbjct: 24 GLKMADTPRLGGLR-----PRAHSGNQIHNGTFDQLLDHTQ---PWRGTFKQRYWWNAEH 75
Query: 84 WGGGNSPILAFLGAEAPIDDNIQLSGFTYEN-------AHQFKALIVILEHRYYGKSVPF 136
WGG P+ G E+ D +G+ EN A K ++++EHRYYG+S P+
Sbjct: 76 WGGPGYPVFLINGGES---DAAGFTGY-LENGTVTGLYAETHKGAVILIEHRYYGESWPY 131
Query: 137 GSRKA---ALKNARHRGYFNSAQALADYASILLHIKDK-YNATHAPVIAIGASYGGELAT 192
+ A L Y N A DK NA +P + IG SY G LA
Sbjct: 132 KTSTADTLQLLEVPQAIYDNIYFAETAALPFDQGTTDKGANADKSPWVLIGGSYAGALAA 191
Query: 193 WFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDN 252
W + P ASSA + D W + + R+ S + IK E+D
Sbjct: 192 WTSVIAPGTFAAYHASSAVVQAIEDF--WQFFTPIEQALPRNCSADIKLVIK----EVDA 245
Query: 253 IASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVY--SEAAQYDTPSNIPVKRICNAIEN 310
+ + IL+ K + L+D D Y + T S V C+ IE
Sbjct: 246 VLDRGSDAEILAMKEEF-----GLETLEDHGDFAYYLQKPVIAWTDSEKAVLDFCDWIET 300
Query: 311 APNCGDDIL-CKIAAGVVEADSLEYDG-----NNSRCYINE--DRTGD----------ES 352
+ G C+ + +EA Y N C E D GD E
Sbjct: 301 STTNGQVAAGCEQSGVGLEAAWAGYTSWMHRRYNETCEAEEACDLYGDAVGYNRPTDLEW 360
Query: 353 DEGWEWQSCSEMVVPMG 369
W WQ C+E P+G
Sbjct: 361 GRSWVWQLCNE---PLG 374
>gi|171684281|ref|XP_001907082.1| hypothetical protein [Podospora anserina S mat+]
gi|170942101|emb|CAP67753.1| unnamed protein product [Podospora anserina S mat+]
Length = 528
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 69/236 (29%), Positives = 105/236 (44%), Gaps = 32/236 (13%)
Query: 35 RIRRSRILEQKDSNHGFETFFYNQTI----DHFS----YGPESYQTFPQRYVINSKFWGG 86
R R+ ++ +D++ E + QTI DHF Y P + +TF RY ++ +
Sbjct: 8 REMRTALIHPRDTD--LEKLYPAQTIQMPIDHFHNDSIYEPHTNETFLLRYWFDASHYQP 65
Query: 87 GNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALK 144
G P++ G E ++ G + + L VI EHRYYG+S P + + K
Sbjct: 66 G-GPVIVLQGGETDGAGRLRYLQKGIVAQLSQATNGLGVIFEHRYYGESHP--TDDFSTK 122
Query: 145 NARHRGYFNSAQALADYA----SILLHIKDKYNAT--HAPVIAIGASYGGELATWFRLKY 198
N R + + QALAD A +++ + N T + P IA G SY G + + R Y
Sbjct: 123 NLR---FLTTDQALADQAYFAQNVVFPGLEHLNLTSHNVPYIAYGGSYAGSVVAFLRKLY 179
Query: 199 PHVVIGSLASSA-PILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKKSWSEIDNI 253
P V G++ASS P Y W Y + R EC + +K IDN+
Sbjct: 180 PDVYWGAIASSGVPEAIY---DYWEYYEAARIYGPR----ECIEATQKLTHAIDNV 228
>gi|255938516|ref|XP_002560028.1| Pc14g00330 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584649|emb|CAP74174.1| Pc14g00330 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 557
Score = 68.6 bits (166), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 78/317 (24%), Positives = 129/317 (40%), Gaps = 30/317 (9%)
Query: 18 VSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDH------FSYGPESYQ 71
+ TS LKL LG I +L QK + H ++TI+ + S
Sbjct: 23 IVTSQFKRDLKLSAELG-IHPDTLLGQKTTVHDVTNSQLDETIEAEYVSIPIDHSNSSVG 81
Query: 72 TFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL-----SGFTYENAHQFKALIVILE 126
+ RY ++ + + + PI F E+ + Q S F Y+ +F + ++ E
Sbjct: 82 HYRNRYWVSEEHYKE-DGPIFVFDVGESTAEPAGQTYLSNSSTFFYQLVKEFGGIGIVWE 140
Query: 127 HRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD--YASILLHIKDKYNATHAPV----I 180
HRYYG S+P+ + H Y N+ QALAD Y + +D + P +
Sbjct: 141 HRYYGDSLPY--NVSLDMEPEHLQYLNNKQALADIPYFAAQFTRQDYSDVDLTPAGTPWV 198
Query: 181 AIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW--TTYHSVVSKDFRDTSEE 238
+G SY G A + R YP + + ASSAP+ D++ + Y +V+ +
Sbjct: 199 MVGGSYAGMRAAFTRQSYPDTIYAAFASSAPVEARIDMSVYFDQVYDGMVTYGHLN---- 254
Query: 239 CFQTIKKSWSEIDNIASKPDGLAILSKK--FKTCTPLKNTTELKDGLDTVYSEAAQYDTP 296
C + IK + ID SK + A K+ F + + L T+Y+ Y
Sbjct: 255 CTRDIKAALEYIDEQLSKSESAAAAIKREFFGEGAEKNSNGDFTAALVTIYNFFQSYGMG 314
Query: 297 SNI-PVKRICNAIENAP 312
+ ++ C +E P
Sbjct: 315 GGVGSLESFCEHMETDP 331
>gi|123501342|ref|XP_001328052.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121910990|gb|EAY15829.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 369
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 70/317 (22%), Positives = 137/317 (43%), Gaps = 29/317 (9%)
Query: 182 IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQ 241
IG SY G L++WFR K+P + +GS ASSAPI + + + + ++DF + C++
Sbjct: 2 IGGSYAGSLSSWFRQKHPELALGSWASSAPIFAKLNFSEYDKHE---AEDFMEYG--CYE 56
Query: 242 TIKKSWSEIDNIA-SKPDGLAILSKKFKTCTP---LKNTTELKDGLDTVYSEAAQYDTPS 297
+ ++ I+ +A + D L F P + ++ E D Y+ QY +
Sbjct: 57 NVLNAYKTIEKVALLQNDKTEELMMMFGVPNPEEFVSHSLEFLDMFSYAYTYGNQYKAWN 116
Query: 298 NIPVKRICNAIENAPNCGDDILCKIAAG---VVEADSLEYDGNNSRCYINEDRTGDESDE 354
I + +C++++ D + + A ++ D N + D + +++
Sbjct: 117 QI-ILDMCDSLKEIDTSDSDEVIGVMATTSYLLGMDKFLELYPNGLKNTSVD-SPNKASR 174
Query: 355 GWEWQSCSEMVVPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRPSWVLTYYGGHDIKL 414
GW + C+E + + + + Y C+ +G P P YGG++
Sbjct: 175 GWAYMMCNEF--GWFYSASGLLKSNLLTIQYYSDFCQNIFGKQPDPDKFNEKYGGYN--- 229
Query: 415 ILRRSTSNIIFSNGMRDPFSRGGWVKTYHFFDLS---FSQDLNLGSHCLDLDEAKKSDPD 471
+ + ++++N D +S T D S +S +N G HC D+ + SD
Sbjct: 230 ---PNVTRVVYTNSHYDSWSE----LTMKRNDTSKSIYSFSINDGFHCDDIHDPNDSDSI 282
Query: 472 WLVQQRKTEVKIMQGWI 488
L+ R+ +K++ W+
Sbjct: 283 SLMSVREESIKLLLKWM 299
>gi|71407906|ref|XP_806390.1| prolyl carboxypeptidase [Trypanosoma cruzi strain CL Brener]
gi|70870123|gb|EAN84539.1| putative prolyl carboxypeptidase, putative [Trypanosoma cruzi]
Length = 279
Score = 68.2 bits (165), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 93/215 (43%), Gaps = 20/215 (9%)
Query: 8 FQWLLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHG---FETFFYNQT----- 59
F +L ++++F+++ + L L R+ R + G F+ ++N+T
Sbjct: 2 FFFLSIVLLFLASCGASGLLPLDVIPERLNRGELFCSPVGEGGGNLFDEEYFNKTPATFR 61
Query: 60 --IDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQ 117
+DH G TF QRY ++ W N+ LA L D G+ H+
Sbjct: 62 QLVDHSKNGS---STFDQRYWVDYSAW---NNSELAMLYIRIGSGDFTSPRGYPGMYGHE 115
Query: 118 FKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIKDKYNATHA 177
L+ LE RYYGKS+PF + + + Y N AL D +++K
Sbjct: 116 RNMLLFTLEGRYYGKSLPF----PLTETEKLKKYLNVDIALEDIRGFQKFVEEKLLQKKL 171
Query: 178 PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
+ +G SY G LA WF+ KYP + +SSA +
Sbjct: 172 RWLIVGGSYAGALAVWFKAKYPTAALAVWSSSAVV 206
>gi|302500786|ref|XP_003012386.1| extracelular serine carboxypeptidase, putative [Arthroderma
benhamiae CBS 112371]
gi|291175944|gb|EFE31746.1| extracelular serine carboxypeptidase, putative [Arthroderma
benhamiae CBS 112371]
Length = 716
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 93/214 (43%), Gaps = 22/214 (10%)
Query: 11 LLLMVIFVSTSFHANGL-KLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFS----Y 65
LLL+V V + +LR R ++ + +D N F IDHF Y
Sbjct: 8 LLLLVAAVQAKLPVTPISQLRAESHR-NKALVARSQDVNAAFPAHTIQIPIDHFPKSSRY 66
Query: 66 GPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIV 123
P + + F RY ++ + G P++ G E + I G + A + V
Sbjct: 67 EPHTTEKFNLRYWFDASHYKEGG-PVIILHGGETSGEGRIPFLQKGILAQLAQATNGIGV 125
Query: 124 ILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK----DKYNATHAPV 179
I+EHRYYG S+P N R + + QALAD A +IK +KYN T AP
Sbjct: 126 IMEHRYYGGSLP----TPDFSNKSLR-FLTTEQALADTAYFSKNIKFPGLEKYNLT-APG 179
Query: 180 IA---IGASYGGELATWFRLKYPHVVIGSLASSA 210
A G SY G + R +YP + G+++SS
Sbjct: 180 TAHILYGGSYAGGQVAFLRTQYPDIFWGAISSSG 213
>gi|340514339|gb|EGR44603.1| predicted protein [Trichoderma reesei QM6a]
Length = 533
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 74/149 (49%), Gaps = 17/149 (11%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDD------NIQLSG 109
+ Q IDH PE TF QRYV N +F+ G SPI+ E+ +D N+ L G
Sbjct: 52 FQQLIDHDH--PE-LGTFSQRYVWNDEFYAGPGSPIILMGPNESALDGYERYTTNLTLPG 108
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYA-SILLHI 168
A + A +I+EHRY+G+S PF S +N R+ S Q L +A +++L
Sbjct: 109 VM---AQELGAGALIIEHRYWGQSSPFDSLTT--ENMRYLTLEQSVQDLVYFAQNVVLPF 163
Query: 169 KDKYNAT--HAPVIAIGASYGGELATWFR 195
+T AP + +G SY G LA W +
Sbjct: 164 DQNRTSTPDKAPWVLVGCSYSGALAAWVQ 192
>gi|170574566|ref|XP_001892870.1| Serine protease Z688.6 precursor [Brugia malayi]
gi|158601364|gb|EDP38292.1| Serine protease Z688.6 precursor, putative [Brugia malayi]
Length = 307
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 75/302 (24%), Positives = 120/302 (39%), Gaps = 39/302 (12%)
Query: 219 TPWTTYHSVVSKDFRDTSEECFQ-TIKKSWSEIDNIASKPDGLAILSKKFK--TCTPLKN 275
P Y+ +V + F S C + I W + N++ G A L+ F + LK
Sbjct: 5 VPQDAYNRIVKRSF--LSSGCIEKNILDGWIALKNLSLTTTGRAYLNGLFHLDKKSHLKI 62
Query: 276 TTE---LKDGLDTVYSEAAQ--YDTPSNI-------PVKRICNAIENAPNCGDDILCKIA 323
+T+ LK+ L+ ++ A Y P+N PVK C + + D+ L +
Sbjct: 63 STDWIMLKEYLEDIFESMAMVNYPYPTNYLAELPGWPVKVACQFFNSNKSKNDEELAQSM 122
Query: 324 AGVVEADSLEYDGNNSRCYINEDRTGDES------DEGWEWQSCSEMVVPM---GKDKNS 374
G++ Y G I D D + GW WQSC+EMV+ G +
Sbjct: 123 YGIMNL-YYNYTGQKKTFCIKPDVCNDSAYGALGDPFGWPWQSCTEMVMQQCSSGPPNDF 181
Query: 375 MYQPEPWNLTKYIKNCKEQYG------VSPRPSWVLTYYGGHDIKLILRRSTSNIIFSNG 428
+ P++L C +G RP W + YG + +NI+FSNG
Sbjct: 182 FIKNCPFSLKGQELYCINTFGKLGYTKALMRPHWSILNYGNR------YPTATNIVFSNG 235
Query: 429 MRDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
DP+S GGW S + G+H DL + D + + R+ E ++ W+
Sbjct: 236 YLDPWSAGGWSLKSRVMGSLISIIIKDGAHHYDLRGKHQLDTNSVKDARRLEKFYIKYWL 295
Query: 489 TQ 490
+
Sbjct: 296 KE 297
>gi|367020898|ref|XP_003659734.1| hypothetical protein MYCTH_2297119 [Myceliophthora thermophila ATCC
42464]
gi|347007001|gb|AEO54489.1| hypothetical protein MYCTH_2297119 [Myceliophthora thermophila ATCC
42464]
Length = 561
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 72/166 (43%), Gaps = 24/166 (14%)
Query: 60 IDHFS----YGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYE 113
+DHF Y P S +TFP RY ++ + G PI+ E + G +
Sbjct: 69 VDHFHNDSLYEPHSSETFPLRYWFDASHYKKG-GPIIVLQSGETDGVGRLPFLQKGIVAQ 127
Query: 114 NAHQFKALIVILEHRYYGKSVP---FGSRKAALKNARHRGYFNSAQALADYASILLHIKD 170
A L VILEHRYYG+S+P F + K + + QALAD A H+
Sbjct: 128 LARATNGLGVILEHRYYGESIPTPDFSTEKLR--------FLTTDQALADMAYFARHVVF 179
Query: 171 K------YNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSA 210
K + P IA G SY G + R YP V G+++SS
Sbjct: 180 KGLEHLDLTSAKNPYIAYGGSYAGAFVAFLRKLYPDVYWGAISSSG 225
>gi|398398187|ref|XP_003852551.1| serine carboxypeptidase, partial [Zymoseptoria tritici IPO323]
gi|339472432|gb|EGP87527.1| serine carboxypeptidase [Zymoseptoria tritici IPO323]
Length = 487
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 77/166 (46%), Gaps = 15/166 (9%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPIL----AFLGAEAPIDDNIQLSGFT 111
+ Q +DH + S TFP RY ++ FW G SP++ GAE +D S
Sbjct: 1 FAQLVDHDN---PSLGTFPARYWFDNTFWKGPGSPVVFETPGEFGAEIYLDTVFNYS-MV 56
Query: 112 YENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASI--LLHIK 169
A + A +VILEHRY+GKS+P + +N ++ NS + L +A L +K
Sbjct: 57 RVIAEKIGASMVILEHRYFGKSIPVDNLTT--ENMKYLTLENSLKDLTYFARTVELPFVK 114
Query: 170 DKYNATHA---PVIAIGASYGGELATWFRLKYPHVVIGSLASSAPI 212
D + A P + +G SY G LA W + ASSA +
Sbjct: 115 DASRGSTAVDVPWVMVGGSYAGALAAWTATLFSGTFWAYYASSATV 160
>gi|322712275|gb|EFZ03848.1| serine peptidase, putative [Metarhizium anisopliae ARSEF 23]
Length = 550
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 123/313 (39%), Gaps = 41/313 (13%)
Query: 13 LMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQT 72
L +S H N L R +RS I Q+ + + Q IDH + T
Sbjct: 18 LSAARISRRHHLNRLPSRSPKEYSKRSNIPTQEGT--------FQQLIDHNN---PDLGT 66
Query: 73 FPQRYVINSKFWGGGNSPILAFLGAEAPID----DNIQLSGFTYENAHQFKALIVILEHR 128
F Q Y+ N++++ G SPI+ E +D N L G A A +++LEHR
Sbjct: 67 FTQHYLYNAEYYAGPGSPIVLNTPGEDVVDGFYATNNTLPGLF---AQTNNAAVIVLEHR 123
Query: 129 YYGKSVPFGSRKAALKNARHRGYFNSAQALADYA---SILLHIKDKYNATHAPVIAIGAS 185
Y+GKS P+ S + N ++ N+ Q L +A + + T AP I G S
Sbjct: 124 YWGKSSPYDS--LSTTNLQYLTLENAIQDLIYFAHNVQLPFDVDGASKPTKAPWILTGCS 181
Query: 186 YGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSEECFQTIKK 245
Y G LA W P +SA I D W Y + + C +++
Sbjct: 182 YPGALAAWTHHLAPGTFWAYHCTSAVIEIISDF--WQYYEPIKAA----MPTNCSTDMQR 235
Query: 246 SWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA--------AQYDTPS 297
+ +ID + S G K L+ D ++++ A+ D P+
Sbjct: 236 ATEQIDKVLSH--GTKTQKHALKKTFGLEALAHDNDFVESILGGLQEWQGMTFARSDKPN 293
Query: 298 NIPVKRICNAIEN 310
P+ + C+ +EN
Sbjct: 294 --PLYQFCDYLEN 304
>gi|348554926|ref|XP_003463275.1| PREDICTED: thymus-specific serine protease-like [Cavia porcellus]
Length = 629
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 59/119 (49%), Gaps = 4/119 (3%)
Query: 19 STSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFSYGPESYQTFPQRYV 78
S H + L LR RLG R R+ + G + Q +D F+ +TF QRY
Sbjct: 223 SAPAHPSTLLLR-RLGE-RVQRLRGSAGPDPGPRAGWLQQPLDPFNS--SDDRTFLQRYW 278
Query: 79 INSKFWGGGNSPILAFLGAEAPIDDNIQLSGFTYENAHQFKALIVILEHRYYGKSVPFG 137
+N + W GG++P+ LG E + ++G A AL++ LEHR+YG SVP G
Sbjct: 279 VNDRHWAGGDAPVFLHLGGEGSLGPGSVMTGHPEALAPALGALVISLEHRFYGLSVPAG 337
Score = 45.1 bits (105), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 51/261 (19%), Positives = 100/261 (38%), Gaps = 42/261 (16%)
Query: 261 AILSKKFKTCTPL---KNTTELKDGLDTVYSEAAQYDTPSNIP--VKRICNAIENAPNCG 315
+ L + C L ++ EL + L + A QYD + P V+++C + G
Sbjct: 377 STLRTELGACGSLSRAEDRAELLEALQALVGGAVQYDGQTGAPLSVRQLCGLLLEGARNG 436
Query: 316 DD----ILCKIAAGVVEADSLEYDGNNSRCYI------NEDRTGDESDEGWEWQSCSEMV 365
++A +V + + SR E + D W +Q+C+E
Sbjct: 437 SHPEAYFGLRLAVQIVLHSLGQRCLSASRAQTVAQLKDTEPQASGLGDRQWLYQTCTEFG 496
Query: 366 VPMGKDKNSMYQPEPWNLTKYIKNCKEQYGVSPRP-----SWVLTYYGGHDIKLILRRST 420
+ + P+ L +++ C++ +G+S + +YYGG
Sbjct: 497 FYVTCENAECPFPQFPALPSHLELCEQVFGLSASSVARAVTQTNSYYGGQAP------GA 550
Query: 421 SNIIFSNGMRDPFSRGGWVKTYHFFDLSFSQD-------LNLGSHCLDLDEAKKSDPDWL 473
+ ++F NG DP +H ++ + + SHC D+ + SDP L
Sbjct: 551 TRVLFVNGDIDP---------WHMLSVTQASGGSKAALLIPGASHCADMAPERPSDPTSL 601
Query: 474 VQQRKTEVKIMQGWITQYYDD 494
R++ + +Q W+ Q ++
Sbjct: 602 RSGRQSILHQLQTWLGQVKEE 622
>gi|315052106|ref|XP_003175427.1| hypothetical protein MGYG_02952 [Arthroderma gypseum CBS 118893]
gi|311340742|gb|EFQ99944.1| hypothetical protein MGYG_02952 [Arthroderma gypseum CBS 118893]
Length = 545
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/293 (26%), Positives = 116/293 (39%), Gaps = 33/293 (11%)
Query: 11 LLLMVIFVSTSFHANGLKLRPRLGRIRRSRILEQKDSNHGFETFFYNQTIDHFS----YG 66
LLL+V V + ++ + +D N F IDHF Y
Sbjct: 8 LLLLVAAVQAKLPVTPISQLKAESHRTKALLARSEDVNAAFPAHTIKIPIDHFPKSSRYE 67
Query: 67 PESYQTFPQRYVINSKFWGGGNSPILAFLGAEAPIDDNIQL--SGFTYENAHQFKALIVI 124
P + F RY ++ + G P++ G E I G + A + VI
Sbjct: 68 PHTTDKFDLRYWFDASHYKEGG-PVIILHGGETDGAGRIPFLQKGILAQLAQATNGIGVI 126
Query: 125 LEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK----DKYNATHAPVI 180
+EHRYYG S+P +R + K+ R + + QALAD A +IK +KYN T AP
Sbjct: 127 MEHRYYGGSLP--TRDFSNKSLR---FLTTEQALADTAYFSKNIKFPGLEKYNLT-APGT 180
Query: 181 A---IGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDTSE 237
A G SY G + R +YP + G+++SS D W + + ++ +
Sbjct: 181 AHILYGGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDY--WQYFEPIR----QEAPQ 234
Query: 238 ECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYSEA 290
+C + +DNI I K T LKN L D ++ A
Sbjct: 235 DCVHVTQNFVDIVDNI-------IIHGKNANTIKELKNLFGLGRLRDADFANA 280
>gi|303319727|ref|XP_003069863.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240109549|gb|EER27718.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320034152|gb|EFW16097.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 554
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 103/428 (24%), Positives = 174/428 (40%), Gaps = 71/428 (16%)
Query: 55 FYNQTIDHFSYGPESYQTFPQRYVINSKFWGGGNSPILAFLGAEA---PIDDN--IQLSG 109
+ N IDH + G + + RY +N +++ G P++ F E D + +
Sbjct: 71 YANIPIDHRNPG----RMYRNRYWVNDQYYQPG-GPVVIFDTGETNGQAFADYYLVDPTS 125
Query: 110 FTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALAD--YASILLH 167
+ + +F + ++ EHRYYG+S+P+ +A Y QAL D Y +
Sbjct: 126 YIVQLLREFHGVGLVWEHRYYGESLPYPVN--GQTSAAQFQYLTLEQALQDLPYFARTFR 183
Query: 168 IKDKYNA----THAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPW-- 221
NA P I +G SY G A + RLKYP + + +SSAP D++ +
Sbjct: 184 RPRLPNADLTPRSTPWIMVGGSYPGMRAAFSRLKYPDTIFAAFSSSAPAQARIDMSVYYE 243
Query: 222 TTYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNTTELKD 281
Y +V+ + + C + + ++ ID + P A + ++F +N+
Sbjct: 244 QVYRGLVAYGYGN----CTRDVNAAYRYIDAQLANPSTAAQIKRQFLGPGAEQNSNGDFT 299
Query: 282 GLDTVYSEAAQYDTPSNIPVKRICNAIEN------------APNCG-DDILCKIAA--GV 326
+ +Y+ A +N P + CN +E AP+ G ++ + AA G+
Sbjct: 300 AV-LLYNWATWQSFGANGPAGQFCNWLETDQYGRVAPAEGWAPSRGARSVVDRWAAWPGL 358
Query: 327 VEADS--LEYDGNNSRCYINEDRTGDESDE---GWEWQSCSEMVV-----PMGKDKNSMY 376
A + E + N + D + +D W WQ CS+ P + S Y
Sbjct: 359 SRAINSIFETNCNCPEETCSCDLSAPPADPLAISWSWQFCSQFGYFQYQNPRPHEIASRY 418
Query: 377 QPEPWNLTKYIK-NCKEQY--GVS-------PRPSWVLTYYGGHDIKLILRRSTSNIIFS 426
Q E YI+ NC Q+ GVS PR Y GG +++ SN+
Sbjct: 419 QTE-----AYIQDNCYRQFPDGVSSGHLPRRPRADATNNYTGGWNMR------PSNVFHG 467
Query: 427 NGMRDPFS 434
G DP++
Sbjct: 468 AGQYDPWT 475
>gi|294884993|ref|XP_002771171.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
gi|239874541|gb|EER02987.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
Length = 371
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 81/359 (22%), Positives = 153/359 (42%), Gaps = 37/359 (10%)
Query: 160 DYASILLH-IKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVIGSLASSAP-ILYYGD 217
D+A++L + ++ NA VIA+G SY G LA FRL+YP +V + ASS+P LY +
Sbjct: 2 DHATVLRYTLETVENAKRCRVIAVGGSYSGFLALAFRLRYPKLVYAAYASSSPGRLYSQE 61
Query: 218 ITPWT-TYHSVVSKDFRDTSEECFQTIKKSWSEIDNIASKPDGLAILSKKFKTCTPLKNT 276
+ + Y+S V+ C ++ K++ + + + + K C P
Sbjct: 62 ASRFDGRYYSRVTDAADSIRSNCSNSVIKAFDDFVHRYAGRVTFEQAKNELKICNPEVFG 121
Query: 277 TELKDGL----------DTVYSEAAQYDTPSNIPVKRICNAIENAPNCGDDILCKIAAGV 326
E DGL + + A Y SN P +C +E + G I + G
Sbjct: 122 RE--DGLFEELVQMVRMEFSGANMASYPPSSNSPTYMLCTMVEQSGIAG--IFEAMTQGR 177
Query: 327 VEADSLEYDGNNSRCYINEDRTGD-------ESDEGWEWQSCSEMVVPMGK--DKNSMYQ 377
D ++ + ++ + GD + W+WQ+C+++V + + M+
Sbjct: 178 RCVDVTQHLPSPNKHGVYSASCGDWTGCGVGRAGRSWDWQTCTQLVEHISSYGPPSDMFP 237
Query: 378 PEPWNLTKYIKN-CKEQYGVSPRPSWV-------LTYYGGHDIKLILRRSTSNIIFSNGM 429
P +++ +++ C+E +G + ++ L G D + L TS ++F NG
Sbjct: 238 PRRFSVDQWLNAYCEESFGNNVFHNFSDTTREHRLNDLWGFD-EATLPDITSRVLFVNGG 296
Query: 430 RDPFSRGGWVKTYHFFDLSFSQDLNLGSHCLDLDEAKKSDPDWLVQQRKTEVKIMQGWI 488
D ++ G T + D S + G+H ++ + +D ++ R I+ W+
Sbjct: 297 MDGWTAGA--VTRNLSDTIISLMIPSGAHHSEMKDPSDNDTSDMIAARDQIDDILSLWL 353
>gi|121717092|ref|XP_001276006.1| extracelular serine carboxypeptidase, putative [Aspergillus
clavatus NRRL 1]
gi|119404163|gb|EAW14580.1| extracelular serine carboxypeptidase, putative [Aspergillus
clavatus NRRL 1]
Length = 582
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 78/187 (41%), Gaps = 27/187 (14%)
Query: 38 RSRILEQKDSNHGFETFFYNQTIDHF----SYGPESYQTFPQRYVINSKFWGGGNSPILA 93
RS+ L + H F +DHF Y P S +F RY ++ + G P+
Sbjct: 47 RSQALSVSFAEHNFSV-----PVDHFHNESRYEPHSDDSFNLRYWFDASHYKEGG-PVFL 100
Query: 94 FLGAEAPIDDNIQL--SGFTYENAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGY 151
E D G + A + L VILEHRYYG+S PF ++N R +
Sbjct: 101 IAAGETDATDRFPFLSQGIVAQLAKTYNGLGVILEHRYYGESYPF--VNLTVENIR---F 155
Query: 152 FNSAQALADYASI--------LLHIKDKYNATHAPVIAIGASYGGELATWFRLKYPHVVI 203
++ QALADYA L H+ A P I G SY G + R YP +
Sbjct: 156 LSTEQALADYAHFASNVAFPGLEHL--NLTAGAVPWIGYGGSYAGAFVAFLRKVYPDIFF 213
Query: 204 GSLASSA 210
G ++SS
Sbjct: 214 GVVSSSG 220
>gi|157816660|gb|ABV82323.1| IP19978p [Drosophila melanogaster]
Length = 198
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 55/105 (52%), Gaps = 4/105 (3%)
Query: 56 YNQTIDHFSYGPESYQTFPQRYVINSKFWGGGN--SPILAFLGAEAPIDDNIQLSGFTYE 113
+ +DHFS+ + TF RY+ N F N +PI + G E I+ Q +GF +E
Sbjct: 39 FQVPLDHFSFLINA--TFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQNTGFLWE 96
Query: 114 NAHQFKALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQAL 158
A + +AL++ EHRYYGKS+PFGS H YF Q +
Sbjct: 97 QAERQRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQPI 141
>gi|195497954|ref|XP_002096319.1| GE25606 [Drosophila yakuba]
gi|194182420|gb|EDW96031.1| GE25606 [Drosophila yakuba]
Length = 380
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 66/263 (25%), Positives = 107/263 (40%), Gaps = 33/263 (12%)
Query: 120 ALIVILEHRYYGKSVPFGSRKAALKNARHRGYFNSAQALADYASILLHIK-DKYNATHAP 178
++ EHRYYG S+PFG+ N + G Q+ AD A + H K + +
Sbjct: 7 GMLFYTEHRYYGLSLPFGNESYRPNNLKKLGL---HQSFADLAHFIRHQKLNSPEMKDSK 63
Query: 179 VIAIGASYGGELATWFRLKYPHVVIGSLASSAPILYYGDITPWTTYHSVVSKDFRDT-SE 237
VI +G SY G L W YP ++ S ASSAP+L D + Y +V K +
Sbjct: 64 VILVGGSYSGSLVAWMTQLYPDLIAASWASSAPLLAKAD---FFEYMEMVGKSINLSYGH 120
Query: 238 ECFQTIKKSWS---------EIDNIASKPDGLAILSKKFKTCTPLKNTTELKDGLDTVYS 288
C I++ + EI + +G K +++ PL + +GL ++
Sbjct: 121 NCSLRIERGFKFLVKLFDGDEIQELLYNLNG----CKGYRSKNPL-DRAAFFNGLGNYFA 175
Query: 289 EAAQYDTPSNIPVKRICNAIENAPNCGDDILCKI-------AAGVVEADSLEYDGNNSRC 341
Q + IP R+C + N + DD L I + G D ++ ++
Sbjct: 176 LVVQSYRSAYIP--RLCETLMNLGS--DDELAFIEFLKLLYSEGRRSIDCQDFGYSSMLE 231
Query: 342 YINEDRTGDESDEGWEWQSCSEM 364
+ D W +Q+C+E
Sbjct: 232 LFSGDSDESSETRAWFYQTCNEF 254
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.135 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 8,495,422,651
Number of Sequences: 23463169
Number of extensions: 377824859
Number of successful extensions: 752132
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 859
Number of HSP's successfully gapped in prelim test: 429
Number of HSP's that attempted gapping in prelim test: 747299
Number of HSP's gapped (non-prelim): 1612
length of query: 500
length of database: 8,064,228,071
effective HSP length: 147
effective length of query: 353
effective length of database: 8,910,109,524
effective search space: 3145268661972
effective search space used: 3145268661972
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)