BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 016937
(380 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224095007|ref|XP_002310325.1| predicted protein [Populus trichocarpa]
gi|222853228|gb|EEE90775.1| predicted protein [Populus trichocarpa]
Length = 515
Score = 593 bits (1530), Expect = e-167, Method: Compositional matrix adjust.
Identities = 279/356 (78%), Positives = 313/356 (87%), Gaps = 7/356 (1%)
Query: 28 PLSLAAQPSKF---RRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
P +LA+QP +RAPRF+ K + P + Q Q+QQQYRYE++YF Q+LDHFSF +
Sbjct: 24 PPALASQPLNHLSSKRAPRFLSKHSY----PIKTQLQEQQQYRYESKYFYQQLDHFSFLN 79
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
LP F QRYLINTDHW GP R GPIFLYCGNEGDIEWFAVN+GFVW+IAP FGAM++FPEH
Sbjct: 80 LPKFPQRYLINTDHWAGPERRGPIFLYCGNEGDIEWFAVNTGFVWEIAPLFGAMVLFPEH 139
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYG 204
RYYGESMPYG+ E AY+NA+TLSYLTAEQALADFAV IT+LK+NLSA+A PVVLFGGSYG
Sbjct: 140 RYYGESMPYGNREEAYKNASTLSYLTAEQALADFAVLITDLKRNLSAQACPVVLFGGSYG 199
Query: 205 GMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESW 264
GMLAAWMRLKYPH+AIGALASSAPILQFEDIVPPETFYNIVS+DFKRES SCFNTIKESW
Sbjct: 200 GMLAAWMRLKYPHVAIGALASSAPILQFEDIVPPETFYNIVSNDFKRESTSCFNTIKESW 259
Query: 265 GELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPG 324
L+S G K+NGL++LTKTFHLCREL STEDLA+WL+SAYSYLAMVDYPYPS FMMPLPG
Sbjct: 260 DALLSEGLKKNGLVQLTKTFHLCRELKSTEDLANWLDSAYSYLAMVDYPYPSSFMMPLPG 319
Query: 325 YPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
YPI EVCK+ID PD TSILERIFEG+S+YYNYTG + CF+LDDDPHGLDGWNWQ
Sbjct: 320 YPIGEVCKRIDGCPDGTSILERIFEGISIYYNYTGELHCFELDDDPHGLDGWNWQA 375
>gi|22328106|ref|NP_201377.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|95147306|gb|ABF57288.1| At5g65760 [Arabidopsis thaliana]
gi|110736177|dbj|BAF00060.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana]
gi|332010719|gb|AED98102.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 515
Score = 581 bits (1497), Expect = e-163, Method: Compositional matrix adjust.
Identities = 267/339 (78%), Positives = 302/339 (89%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVG 101
PRF E +Q R + +YRYET++F Q+LDHFSFADLP FSQRYLIN+DHW+G
Sbjct: 32 PRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFSQRYLINSDHWLG 91
Query: 102 PNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ 161
+ LGPIFLYCGNEGDIEWFA NSGF+WDIAP+FGA+LVFPEHRYYGESMPYGS E AY+
Sbjct: 92 ASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEHRYYGESMPYGSREEAYK 151
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIG 221
NATTLSYLT EQALADFAVF+T+LK+NLSAEA PVVLFGGSYGGMLAAWMRLKYPHIAIG
Sbjct: 152 NATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIG 211
Query: 222 ALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELT 281
ALASSAPILQFED+VPPETFY+I S+DFKRES+SCFNTIK+SW +++ GQKENGLL+LT
Sbjct: 212 ALASSAPILQFEDVVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLT 271
Query: 282 KTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT 341
KTFH CR LNST+DL+DWL+SAYSYLAMVDYPYP+DFMMPLPG+PIREVC+KID A
Sbjct: 272 KTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNA 331
Query: 342 SILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
SIL+RI+ G+SVYYNYTGNVDCF+LDDDPHGLDGWNWQ
Sbjct: 332 SILDRIYAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQA 370
>gi|18700101|gb|AAL77662.1| AT5g65760/MPA24_11 [Arabidopsis thaliana]
Length = 491
Score = 580 bits (1495), Expect = e-163, Method: Compositional matrix adjust.
Identities = 267/339 (78%), Positives = 302/339 (89%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVG 101
PRF E +Q R + +YRYET++F Q+LDHFSFADLP FSQRYLIN+DHW+G
Sbjct: 32 PRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFSQRYLINSDHWLG 91
Query: 102 PNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ 161
+ LGPIFLYCGNEGDIEWFA NSGF+WDIAP+FGA+LVFPEHRYYGESMPYGS E AY+
Sbjct: 92 ASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEHRYYGESMPYGSREEAYK 151
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIG 221
NATTLSYLT EQALADFAVF+T+LK+NLSAEA PVVLFGGSYGGMLAAWMRLKYPHIAIG
Sbjct: 152 NATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIAIG 211
Query: 222 ALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELT 281
ALASSAPILQFED+VPPETFY+I S+DFKRES+SCFNTIK+SW +++ GQKENGLL+LT
Sbjct: 212 ALASSAPILQFEDVVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQLT 271
Query: 282 KTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT 341
KTFH CR LNST+DL+DWL+SAYSYLAMVDYPYP+DFMMPLPG+PIREVC+KID A
Sbjct: 272 KTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREVCRKIDGAGSNA 331
Query: 342 SILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
SIL+RI+ G+SVYYNYTGNVDCF+LDDDPHGLDGWNWQ
Sbjct: 332 SILDRIYAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQA 370
>gi|2827710|emb|CAA16683.1| lysosomal Pro-X carboxypeptidase - like protein [Arabidopsis
thaliana]
Length = 499
Score = 573 bits (1477), Expect = e-161, Method: Compositional matrix adjust.
Identities = 266/352 (75%), Positives = 302/352 (85%), Gaps = 14/352 (3%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVG 101
PRF E +Q R + +YRYET++F Q+LDHFSFADLP FSQRYLIN+DHW+G
Sbjct: 32 PRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFSQRYLINSDHWLG 91
Query: 102 PNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ 161
+ LGPIFLYCGNEGDIEWFA NSGF+WDIAP+FGA+LVFPEHRYYGESMPYGS E AY+
Sbjct: 92 ASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEHRYYGESMPYGSREEAYK 151
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGG--------------ML 207
NATTLSYLT EQALADFAVF+T+LK+NLSAEA PVVLFGGSYGG +L
Sbjct: 152 NATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGSNNCVFVFVVIDATVL 211
Query: 208 AAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGEL 267
AAWMRLKYPHIAIGALASSAPILQFED+VPPETFY+I S+DFKRES+SCFNTIK+SW +
Sbjct: 212 AAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRESSSCFNTIKDSWDAI 271
Query: 268 VSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPI 327
++ GQKENGLL+LTKTFH CR LNST+DL+DWL+SAYSYLAMVDYPYP+DFMMPLPG+PI
Sbjct: 272 IAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPI 331
Query: 328 REVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQ 379
REVC+KID A SIL+RI+ G+SVYYNYTGNVDCF+LDDDPHGLDGWNWQ
Sbjct: 332 REVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQ 383
>gi|10177334|dbj|BAB10683.1| lysosomal Pro-X carboxypeptidase [Arabidopsis thaliana]
Length = 529
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 266/353 (75%), Positives = 302/353 (85%), Gaps = 14/353 (3%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVG 101
PRF E +Q R + +YRYET++F Q+LDHFSFADLP FSQRYLIN+DHW+G
Sbjct: 32 PRFPRYTFQNREARIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFSQRYLINSDHWLG 91
Query: 102 PNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ 161
+ LGPIFLYCGNEGDIEWFA NSGF+WDIAP+FGA+LVFPEHRYYGESMPYGS E AY+
Sbjct: 92 ASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEHRYYGESMPYGSREEAYK 151
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGG--------------ML 207
NATTLSYLT EQALADFAVF+T+LK+NLSAEA PVVLFGGSYGG +L
Sbjct: 152 NATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGSNNCVFVFVVIDATVL 211
Query: 208 AAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGEL 267
AAWMRLKYPHIAIGALASSAPILQFED+VPPETFY+I S+DFKRES+SCFNTIK+SW +
Sbjct: 212 AAWMRLKYPHIAIGALASSAPILQFEDVVPPETFYDIASNDFKRESSSCFNTIKDSWDAI 271
Query: 268 VSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPI 327
++ GQKENGLL+LTKTFH CR LNST+DL+DWL+SAYSYLAMVDYPYP+DFMMPLPG+PI
Sbjct: 272 IAEGQKENGLLQLTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPI 331
Query: 328 REVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
REVC+KID A SIL+RI+ G+SVYYNYTGNVDCF+LDDDPHGLDGWNWQ
Sbjct: 332 REVCRKIDGAGSNASILDRIYAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQA 384
>gi|359497044|ref|XP_002263389.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
gi|296085719|emb|CBI29519.3| unnamed protein product [Vitis vinifera]
Length = 510
Score = 565 bits (1455), Expect = e-158, Method: Compositional matrix adjust.
Identities = 263/364 (72%), Positives = 309/364 (84%), Gaps = 12/364 (3%)
Query: 20 IVIISILSPLSLAAQPSK---FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQR 76
I I+ +P ++A++ + + PRF+GK + P R + ++YETRYFEQR
Sbjct: 16 IFIVLTSAPPTIASEAATKGYSKSIPRFLGKFAY----PNRGK-----PFQYETRYFEQR 66
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
LDHFS ADLP F QRYLI+T HW GP+R+GPIFLYCGNEGDIEWFA N+GFVWD+APRFG
Sbjct: 67 LDHFSIADLPKFRQRYLISTRHWTGPDRMGPIFLYCGNEGDIEWFAANTGFVWDMAPRFG 126
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV 196
AM++FPEHRYYGESMPYGS + AY NA +LSYLTAEQALADFAV +TNLK+NLSAE PV
Sbjct: 127 AMVLFPEHRYYGESMPYGSRDKAYANAASLSYLTAEQALADFAVLVTNLKRNLSAEGCPV 186
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASC 256
VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY+IVS++FKRES SC
Sbjct: 187 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYDIVSNNFKRESISC 246
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPS 316
F+TIK+SW L+S GQK +GL +LTK F LCR+L TEDL DWL+SAYS+LAMV+YPYPS
Sbjct: 247 FDTIKKSWDVLISEGQKNDGLKQLTKAFRLCRDLKRTEDLYDWLDSAYSFLAMVNYPYPS 306
Query: 317 DFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGW 376
DF+MPLPG+PI+EVC+K+D+ P+ TS+LERIFEGVSVYYNYTG V+CFQLDDDPHG+DGW
Sbjct: 307 DFLMPLPGHPIKEVCRKMDSCPEGTSVLERIFEGVSVYYNYTGKVECFQLDDDPHGMDGW 366
Query: 377 NWQV 380
NWQ
Sbjct: 367 NWQA 370
>gi|297794189|ref|XP_002864979.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297310814|gb|EFH41238.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 514
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 260/341 (76%), Positives = 296/341 (86%), Gaps = 4/341 (1%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
R PR+ + +Q R + +YRYET++F Q+LDHFSFADLP F QRYLIN+D+W
Sbjct: 33 RFPRYTSR----NRGRIQQFRGDRNEYRYETKFFSQQLDHFSFADLPKFPQRYLINSDYW 88
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA 159
+G + LGPIFLYCGNEGDIEWFA NSGF+WDIAP+FGA+LVFPE R MPYGS E A
Sbjct: 89 LGASALGPIFLYCGNEGDIEWFATNSGFIWDIAPKFGALLVFPEVRSCLFCMPYGSMEEA 148
Query: 160 YQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
Y+NATTLSYLT EQALADFAVF+T+LK+NLSAEA PVVLFGGSYGGMLAAWMRLKYPHIA
Sbjct: 149 YKNATTLSYLTTEQALADFAVFVTDLKRNLSAEACPVVLFGGSYGGMLAAWMRLKYPHIA 208
Query: 220 IGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLE 279
IGALASSAPILQFEDIVPPETFY+I S+DFKRES+SCFNTIK+SW +++ GQKENGLL+
Sbjct: 209 IGALASSAPILQFEDIVPPETFYDIASNDFKRESSSCFNTIKDSWDAIIAEGQKENGLLQ 268
Query: 280 LTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPD 339
LTKTFH CR LNST+DL+DWL+SAYSYLAMVDYPYP+DFMMPLPG+PIREVC+KID A
Sbjct: 269 LTKTFHFCRVLNSTDDLSDWLDSAYSYLAMVDYPYPADFMMPLPGHPIREVCRKIDGAHS 328
Query: 340 ATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
SIL+RIF G+SVYYNYTGNVDCF+LDDDPHGLDGWNWQ
Sbjct: 329 DASILDRIFAGISVYYNYTGNVDCFKLDDDPHGLDGWNWQA 369
>gi|356563482|ref|XP_003549991.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 513
Score = 545 bits (1404), Expect = e-152, Method: Compositional matrix adjust.
Identities = 258/376 (68%), Positives = 305/376 (81%), Gaps = 5/376 (1%)
Query: 6 QQNQNSLYLSP-VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQ 64
++++ SL +S V T+ II I+ LS AQP +P+F+GK +
Sbjct: 2 EEHRLSLRMSTLVFTLSIIIIV--LSYPAQPLALNHSPKFLGKFAATAR--THSNSEPPP 57
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
Q+ YE RYF+QRLDHFSF++LPTF QRYLI+T+HWVGP+RLGPIF YCGNEGDIEWFA N
Sbjct: 58 QFHYEKRYFQQRLDHFSFSELPTFPQRYLISTEHWVGPHRLGPIFFYCGNEGDIEWFAQN 117
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GFVW+IAPRFGAM+VFPEHRYYGES+PYGS E AY+NATTLSYLTAEQALADF+V IT
Sbjct: 118 TGFVWEIAPRFGAMVVFPEHRYYGESVPYGSAEEAYKNATTLSYLTAEQALADFSVLITY 177
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK N SA+ PVVLFGGSYGGMLAAWMRLKYPHIA+GALASSAPILQFEDIVPPETFY++
Sbjct: 178 LKHNYSAKDCPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPPETFYDL 237
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
VS+ FKRES +CFN IK+SW E+ S GQ NGL LTKTF+LC++L T+DL DW E+AY
Sbjct: 238 VSNAFKRESFTCFNYIKQSWNEIASTGQTNNGLELLTKTFNLCQKLKRTKDLYDWAEAAY 297
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
SYLAMV+YPYP++FMM LP +PIREVC++ID P TSILERI+EGV+VYYNYTG CF
Sbjct: 298 SYLAMVNYPYPAEFMMTLPEHPIREVCRRIDGGPAGTSILERIYEGVNVYYNYTGEAKCF 357
Query: 365 QLDDDPHGLDGWNWQV 380
+LDDDPHG+ GW WQ
Sbjct: 358 ELDDDPHGMSGWEWQA 373
>gi|356514372|ref|XP_003525880.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 597
Score = 500 bits (1287), Expect = e-139, Method: Compositional matrix adjust.
Identities = 234/360 (65%), Positives = 284/360 (78%), Gaps = 9/360 (2%)
Query: 20 IVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDH 79
+ +S++ LS +AQP + P+F+GK R + Q+ YETR +Q LDH
Sbjct: 85 VFTLSVIIVLSYSAQPLALKHWPKFLGKFA----ATARTHSEPPPQFHYETRCIQQSLDH 140
Query: 80 FSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAML 139
FSF++LPTF QRYLI+T+HWVGP RLGP+F Y GNE DIEWFA N+G VW+IAPRFGAM+
Sbjct: 141 FSFSELPTFPQRYLISTEHWVGPRRLGPVFFYSGNEDDIEWFAQNTGVVWEIAPRFGAMV 200
Query: 140 VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLF 199
VFPEH+YYGES+PYGS E AY+N TTLSYLT+EQAL DF+V I +LK N S + PV LF
Sbjct: 201 VFPEHQYYGESVPYGSAEEAYKNVTTLSYLTSEQALVDFSVVIADLKHNFSTKDCPVFLF 260
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNT 259
GGSYGGMLAAWMRLKYPH+A+GALASSAPILQFEDIVPPETFY++VS+ FKRES CFN
Sbjct: 261 GGSYGGMLAAWMRLKYPHVAVGALASSAPILQFEDIVPPETFYDLVSNAFKRESFICFNY 320
Query: 260 IKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFM 319
IK+SW E+ S GQ NGL LTKTF+LC++LN T+DL DW+E+AYSYLAMV+YPYP++FM
Sbjct: 321 IKQSWNEMASAGQTNNGLELLTKTFNLCQKLNRTKDLYDWVEAAYSYLAMVNYPYPAEFM 380
Query: 320 MPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQ 379
M LP +PIREV + N+ ILERI+EGV+VYYNYTG CF+LDDDPHG+ GW+WQ
Sbjct: 381 MTLPEHPIREV-SMVSNS----YILERIYEGVNVYYNYTGEAKCFELDDDPHGMSGWDWQ 435
>gi|148909204|gb|ABR17702.1| unknown [Picea sitchensis]
Length = 509
Score = 497 bits (1280), Expect = e-138, Method: Compositional matrix adjust.
Identities = 233/358 (65%), Positives = 284/358 (79%), Gaps = 7/358 (1%)
Query: 25 ILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
++S ++ A +P R PR +G+ L + + Q QY YET+YF QRLDHFSF +
Sbjct: 16 MMSCVADAGRPYFLR--PRTLGEFSTLN---REKSFLQSSQYEYETKYFTQRLDHFSFKN 70
Query: 85 LP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
TF QRYLIN +W+G R+GPIF YCGNEG I+WFAVN+GF+WDIAP+FGA+LVFP
Sbjct: 71 HKNSTFQQRYLINDKYWLGAERMGPIFYYCGNEGYIDWFAVNTGFMWDIAPQFGALLVFP 130
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGS 202
EHRYYGESMPYGS +AY++ +LSYLTAEQALADFA I +LK+NLSAEA PVVLFGGS
Sbjct: 131 EHRYYGESMPYGSQSMAYKDGDSLSYLTAEQALADFATLIVDLKKNLSAEACPVVLFGGS 190
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP +TFYN+VS+DFKRES +CF I++
Sbjct: 191 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPSDTFYNLVSNDFKRESENCFKVIQQ 250
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPL 322
SW L + G+++ GL L+K F +CR+LNST+++ DWL SAYS LAMVDYPYP+ F+MPL
Sbjct: 251 SWKALETYGERDEGLQNLSKKFRMCRDLNSTDEIEDWLNSAYSNLAMVDYPYPASFLMPL 310
Query: 323 PGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
P YPI+EVCK ID+ + T +L+RIF GVSVYYNYTG CF ++DDPHG +GWNWQ
Sbjct: 311 PAYPIKEVCKVIDSFSNGTDVLDRIFAGVSVYYNYTGEEKCFDVNDDPHGENGWNWQA 368
>gi|115440161|ref|NP_001044360.1| Os01g0767100 [Oryza sativa Japonica Group]
gi|53792444|dbj|BAD53352.1| putative prolylcarboxypeptidase, isoform 1 [Oryza sativa Japonica
Group]
gi|53793554|dbj|BAD53324.1| putative prolylcarboxypeptidase, isoform 1 [Oryza sativa Japonica
Group]
gi|113533891|dbj|BAF06274.1| Os01g0767100 [Oryza sativa Japonica Group]
Length = 517
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 220/323 (68%), Positives = 260/323 (80%), Gaps = 7/323 (2%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPT------FSQRYLINTDHWVGPNRLGPIFLYCGNEGD 117
+ Y YETRYF QRLDHFSF + F QRYL+ GPIF YCGNEGD
Sbjct: 44 RGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGRGG-GWAGAGGPIFFYCGNEGD 102
Query: 118 IEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALAD 177
I WFA NSG VW+ A RF A++VF EHRYYGESMP+GS + AY N+ +L+YLTAEQALAD
Sbjct: 103 IAWFAANSGLVWEAATRFAALVVFAEHRYYGESMPFGSKDKAYNNSKSLAYLTAEQALAD 162
Query: 178 FAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP 237
+AV +T+LK+NLS+E SPVVLFGGSYGGMLAAWMRLKYPHIA+GALASSAPILQFED+VP
Sbjct: 163 YAVLLTDLKKNLSSEGSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDVVP 222
Query: 238 PETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA 297
FY++VS+DFKRES SCF TIK+SW L + G ++GLL+L+KTFHLC+ + +T +L+
Sbjct: 223 STIFYDLVSNDFKRESLSCFQTIKDSWKALDAQGNGQDGLLKLSKTFHLCKTIKNTGELS 282
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNY 357
DWL SAYSYLAMVDYP P+DFMMPLPG PI+E+C KIDN PD TSILERI+ GV+VYYNY
Sbjct: 283 DWLSSAYSYLAMVDYPMPADFMMPLPGNPIKELCTKIDNQPDGTSILERIYAGVNVYYNY 342
Query: 358 TGNVDCFQLDDDPHGLDGWNWQV 380
TG VDCF L+DDPHG+DGW+WQ
Sbjct: 343 TGTVDCFDLNDDPHGMDGWDWQA 365
>gi|326489169|dbj|BAK01568.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 514
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 216/316 (68%), Positives = 258/316 (81%), Gaps = 2/316 (0%)
Query: 66 YRYETRYFEQRLDHFSFA-DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
Y +ETRYF QRLDHFSF+ + F QRYL+ GPIF YCGNEGDI WFA N
Sbjct: 50 YEFETRYFRQRLDHFSFSGEEEFFQQRYLVGRAG-GWAGPGGPIFFYCGNEGDIAWFAAN 108
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
SG VW+ APRF A++VF EHRYYGESMP+GS E AY ++ +++YLTAEQALAD+AV +T+
Sbjct: 109 SGLVWEAAPRFAALVVFAEHRYYGESMPFGSKEKAYNSSRSMAYLTAEQALADYAVLLTD 168
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK+NLS+E+SPVVLFGGSYGGMLAAWMRLKYPHIA+GALASSAPILQFEDIVP FY++
Sbjct: 169 LKRNLSSESSPVVLFGGSYGGMLAAWMRLKYPHIAVGALASSAPILQFEDIVPDTIFYDL 228
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
VS+DFKRES SCF TIK+SW EL G ++GLL+L+KTFHLC+ LN+T L+DWL SAY
Sbjct: 229 VSNDFKRESLSCFQTIKDSWKELDEQGNGQDGLLKLSKTFHLCQTLNTTGALSDWLSSAY 288
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
SYLAMVDYP PS+F+MPLP PI+EVC+ ID P+ +SILERI+ GV++YYNYTG V CF
Sbjct: 289 SYLAMVDYPMPSEFLMPLPANPIKEVCRNIDKQPERSSILERIYAGVNIYYNYTGTVHCF 348
Query: 365 QLDDDPHGLDGWNWQV 380
LDDDPHG+ GW+WQ
Sbjct: 349 DLDDDPHGMGGWDWQA 364
>gi|357136615|ref|XP_003569899.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 512
Score = 456 bits (1173), Expect = e-126, Method: Compositional matrix adjust.
Identities = 222/340 (65%), Positives = 261/340 (76%), Gaps = 7/340 (2%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD-LPTFSQRYLINTDHWV 100
PRF G PH + R Y ++TRYF QRLDHFSF+ +F QRYL+
Sbjct: 28 PRFPG--PHAR---RAGVRDGVGGYEFDTRYFRQRLDHFSFSGGEESFQQRYLVGRAG-G 81
Query: 101 GPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAY 160
GPIF YCGNEGDI WFA NSG VW+ APRF A++VF EHRYYGESMP+GS E AY
Sbjct: 82 WAGPGGPIFFYCGNEGDIAWFAANSGLVWEAAPRFAALVVFAEHRYYGESMPFGSKEKAY 141
Query: 161 QNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAI 220
N+ +L+YLTAEQA+AD+AV +T+LK+NLS+E+SPVVLFGGSYGGMLAAWMRLKYPHIA+
Sbjct: 142 NNSRSLAYLTAEQAIADYAVLLTDLKRNLSSESSPVVLFGGSYGGMLAAWMRLKYPHIAV 201
Query: 221 GALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLEL 280
GALASSAPILQFEDIVP FY+IVS+DFKRES SCF TIK+SW EL G ++GLL+L
Sbjct: 202 GALASSAPILQFEDIVPQTIFYDIVSNDFKRESLSCFQTIKDSWKELDEQGNGQDGLLKL 261
Query: 281 TKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA 340
+KTFHLC+ L +T L+DWL SAY +LAMVDYP SDF+MPLP PI+EVC+ ID D
Sbjct: 262 SKTFHLCQTLKTTGGLSDWLNSAYGFLAMVDYPMSSDFLMPLPSNPIKEVCRNIDKQLDG 321
Query: 341 TSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
TS LERI+ GV++YYNYTG VDCF LDDDPHG+ GW WQ
Sbjct: 322 TSTLERIYAGVNIYYNYTGTVDCFDLDDDPHGMGGWQWQA 361
>gi|413952315|gb|AFW84964.1| putative serine peptidase S28 family protein [Zea mays]
Length = 517
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 217/325 (66%), Positives = 253/325 (77%), Gaps = 10/325 (3%)
Query: 65 QYRYETRYFEQRLDHFSFADLPT---------FSQRYLINTDHWVGPNRLGPIFLYCGNE 115
Y YETRYF QRLDHFSF + F QRYL+ GPIF YCGNE
Sbjct: 45 DYEYETRYFRQRLDHFSFPGVGDEDEDEAAAFFQQRYLVGRGG-GWAGPGGPIFFYCGNE 103
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
GDI WFA NSG VW+ APRF A++VF EHRYYGESMP+GS AY ++ +L+YLTAEQAL
Sbjct: 104 GDIAWFAANSGLVWEAAPRFAALVVFAEHRYYGESMPFGSKAKAYSDSKSLAYLTAEQAL 163
Query: 176 ADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
ADFAV +T+LK+NLSAE SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI
Sbjct: 164 ADFAVLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 223
Query: 236 VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
VP FY++VS DF+RES SCF TIK+SW EL +++GLL+L+KTFHLC+ L ++ D
Sbjct: 224 VPSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQANEQDGLLKLSKTFHLCQTLKTSGD 283
Query: 296 LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYY 355
L+DWL SAYSYLAMVDYP PS+F+MPLP PI+EVC+ ID+ P T LERI+ GV+VYY
Sbjct: 284 LSDWLSSAYSYLAMVDYPLPSEFLMPLPANPIKEVCRNIDSQPKGTGTLERIYAGVNVYY 343
Query: 356 NYTGNVDCFQLDDDPHGLDGWNWQV 380
NYTG VDCF L+DDPHG+ GW+WQ
Sbjct: 344 NYTGTVDCFDLNDDPHGMGGWDWQA 368
>gi|242058825|ref|XP_002458558.1| hypothetical protein SORBIDRAFT_03g035680 [Sorghum bicolor]
gi|241930533|gb|EES03678.1| hypothetical protein SORBIDRAFT_03g035680 [Sorghum bicolor]
Length = 518
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 218/338 (64%), Positives = 259/338 (76%), Gaps = 13/338 (3%)
Query: 55 PQRQQRQQQQ-----QYRYETRYFEQRLDHFSFADLPT-------FSQRYLINTDHWVGP 102
P+ Q R + Y YETRYF QRLDHFSF + F QRYL+
Sbjct: 33 PRHQHRSRPGVNGVGDYEYETRYFRQRLDHFSFPGVADEDEAAAFFQQRYLVGRGG-GWA 91
Query: 103 NRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN 162
GPIF YCGNEGDI WFA NSG VW+ APRF A++VF EHRYYGESMP+GS AY +
Sbjct: 92 GPGGPIFFYCGNEGDIAWFASNSGLVWEAAPRFAALVVFAEHRYYGESMPFGSKAKAYND 151
Query: 163 ATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
+ +L+YLTAEQALADFAV +T+LK+NLSAE SPVVLFGGSYGGMLAAWMRLKYPHI++GA
Sbjct: 152 SKSLAYLTAEQALADFAVLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHISVGA 211
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
L+SSAPILQFEDIVP FY++VS DF+RES SCF TIK+SW EL G ++ GLL+L+K
Sbjct: 212 LSSSAPILQFEDIVPSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQGNEQGGLLKLSK 271
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS 342
TFHLC+ L ++ DL+DWL SAYSYLAMVDYP PS+F+MPLP PI+EVC+ ID+ P+ TS
Sbjct: 272 TFHLCQTLKTSGDLSDWLSSAYSYLAMVDYPLPSEFLMPLPANPIKEVCRNIDSQPEGTS 331
Query: 343 ILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
LERI+ GV+VYYNYTG V CF L+DDPHG+ GW+WQ
Sbjct: 332 TLERIYAGVNVYYNYTGTVGCFDLNDDPHGMGGWDWQA 369
>gi|125572151|gb|EAZ13666.1| hypothetical protein OsJ_03583 [Oryza sativa Japonica Group]
Length = 549
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 220/355 (61%), Positives = 260/355 (73%), Gaps = 39/355 (10%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPT------FSQRYLINTDHWVGPNRLGPIFLYCGNEGD 117
+ Y YETRYF QRLDHFSF + F QRYL+ GPIF YCGNEGD
Sbjct: 44 RGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGRGG-GWAGAGGPIFFYCGNEGD 102
Query: 118 IEWFAVNSGFVWDIAPRFGAMLVFPE--------------------------------HR 145
I WFA NSG VW+ A RF A++VF E HR
Sbjct: 103 IAWFAANSGLVWEAATRFAALVVFAEANLSSHLAPVSVFFLGCALIVRDPIPFTCSLQHR 162
Query: 146 YYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGG 205
YYGESMP+GS + AY N+ +L+YLTAEQALAD+AV +T+LK+NLS+E SPVVLFGGSYGG
Sbjct: 163 YYGESMPFGSKDKAYNNSKSLAYLTAEQALADYAVLLTDLKKNLSSEGSPVVLFGGSYGG 222
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG 265
MLAAWMRLKYPHIA+GALASSAPILQFED+VP FY++VS+DFKRES SCF TIK+SW
Sbjct: 223 MLAAWMRLKYPHIAVGALASSAPILQFEDVVPSTIFYDLVSNDFKRESLSCFQTIKDSWK 282
Query: 266 ELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGY 325
L + G ++GLL+L+KTFHLC+ + +T +L+DWL SAYSYLAMVDYP P+DFMMPLPG
Sbjct: 283 ALDAQGNGQDGLLKLSKTFHLCKTIKNTGELSDWLSSAYSYLAMVDYPMPADFMMPLPGN 342
Query: 326 PIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
PI+E+C KIDN PD TSILERI+ GV+VYYNYTG VDCF L+DDPHG+DGW+WQ
Sbjct: 343 PIKELCTKIDNQPDGTSILERIYAGVNVYYNYTGTVDCFDLNDDPHGMDGWDWQA 397
>gi|125527844|gb|EAY75958.1| hypothetical protein OsI_03875 [Oryza sativa Indica Group]
Length = 549
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 219/355 (61%), Positives = 259/355 (72%), Gaps = 39/355 (10%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPT------FSQRYLINTDHWVGPNRLGPIFLYCGNEGD 117
+ Y YETRYF QRLDHFSF + F QRYL+ GPIF YCGNEGD
Sbjct: 44 RGYDYETRYFRQRLDHFSFLEEEGEEGDGFFQQRYLVGRGG-GWAGAGGPIFFYCGNEGD 102
Query: 118 IEWFAVNSGFVWDIAPRFGAMLVFPE--------------------------------HR 145
I WFA NSG VW+ A RF A++VF E HR
Sbjct: 103 IAWFAANSGLVWEAATRFAALVVFAEANLSSHLAPVSVFFLGCALIVRDPIPFTCSLQHR 162
Query: 146 YYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGG 205
YYGESMP+GS + AY N+ +L+YLTAEQALAD+AV +T+LK+NLS+E SPVVLFGGSYGG
Sbjct: 163 YYGESMPFGSKDKAYNNSKSLAYLTAEQALADYAVLLTDLKKNLSSEGSPVVLFGGSYGG 222
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG 265
MLAAWMRLKYPHIA+GALASSAPILQFED+VP FY++VS+DFKRES CF TIK+SW
Sbjct: 223 MLAAWMRLKYPHIAVGALASSAPILQFEDVVPSTIFYDLVSNDFKRESLICFQTIKDSWK 282
Query: 266 ELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGY 325
L + G ++GLL+L+KTFHLC+ + +T +L+DWL SAYSYLAMVDYP P+DFMMPLPG
Sbjct: 283 ALDAQGNGQDGLLKLSKTFHLCKTIKNTGELSDWLSSAYSYLAMVDYPMPADFMMPLPGN 342
Query: 326 PIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
PI+E+C KIDN PD TSILERI+ GV+VYYNYTG VDCF L+DDPHG+DGW+WQ
Sbjct: 343 PIKELCTKIDNQPDGTSILERIYAGVNVYYNYTGTVDCFDLNDDPHGMDGWDWQA 397
>gi|168047270|ref|XP_001776094.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672604|gb|EDQ59139.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 505
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 200/320 (62%), Positives = 252/320 (78%), Gaps = 3/320 (0%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q Y YF Q +DHFSF +F QRYLI +W G GPIF+YCGNEGD+EWFA
Sbjct: 34 QSSVTYAVDYFTQVIDHFSFRREASFQQRYLIEKRYWKGAADRGPIFMYCGNEGDVEWFA 93
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N+GF+W+IAP FGA+++FPEHRYYG+SMPYG+ E +Y++A +LS LT+EQALADFA +
Sbjct: 94 KNTGFLWEIAPSFGALILFPEHRYYGKSMPYGTMEASYKDADSLSTLTSEQALADFATLV 153
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
+LK+NL+A ASPVVLFGGSYGGMLA+WMRLKYPHIAIGA+A+SAPILQFEDIVP +TFY
Sbjct: 154 IDLKKNLTAAASPVVLFGGSYGGMLASWMRLKYPHIAIGAVAASAPILQFEDIVPSDTFY 213
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
IVS+DFKRESASCFN I+ESWG + + K GL +L+ FH+CR+LN++ +L +WL S
Sbjct: 214 KIVSADFKRESASCFNYIRESWGVIDKIASKNGGLHDLSTQFHMCRDLNASWELENWLSS 273
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
AYSY+AMVDYP P++F+ PLP YP+RE+C+ ID+ P+ + IL RIF G SVYYNY+G +
Sbjct: 274 AYSYVAMVDYPIPTNFITPLPAYPVREICRVIDSLPEGSDILSRIFAGASVYYNYSGQAE 333
Query: 363 CFQLDD---DPHGLDGWNWQ 379
CFQ D D G+ GW+WQ
Sbjct: 334 CFQPSDPGNDDLGVTGWDWQ 353
>gi|413952316|gb|AFW84965.1| putative serine peptidase S28 family protein [Zea mays]
Length = 515
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 214/325 (65%), Positives = 250/325 (76%), Gaps = 12/325 (3%)
Query: 65 QYRYETRYFEQRLDHFSFADLPT---------FSQRYLINTDHWVGPNRLGPIFLYCGNE 115
Y YETRYF QRLDHFSF + F QRYL+ GPIF YCGNE
Sbjct: 45 DYEYETRYFRQRLDHFSFPGVGDEDEDEAAAFFQQRYLVGRGG-GWAGPGGPIFFYCGNE 103
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
GDI WFA NSG VW+ APRF A++VF EHRYYGESMP+GS AY ++ +L+YLTAEQAL
Sbjct: 104 GDIAWFAANSGLVWEAAPRFAALVVFAEHRYYGESMPFGSKAKAYSDSKSLAYLTAEQAL 163
Query: 176 ADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
ADFAV +T+LK+NLSAE SPVVLFGGSYGG AWMRLKYPHIAIGALASSAPILQFEDI
Sbjct: 164 ADFAVLLTDLKRNLSAEGSPVVLFGGSYGG--TAWMRLKYPHIAIGALASSAPILQFEDI 221
Query: 236 VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
VP FY++VS DF+RES SCF TIK+SW EL +++GLL+L+KTFHLC+ L ++ D
Sbjct: 222 VPSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQANEQDGLLKLSKTFHLCQTLKTSGD 281
Query: 296 LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYY 355
L+DWL SAYSYLAMVDYP PS+F+MPLP PI+EVC+ ID+ P T LERI+ GV+VYY
Sbjct: 282 LSDWLSSAYSYLAMVDYPLPSEFLMPLPANPIKEVCRNIDSQPKGTGTLERIYAGVNVYY 341
Query: 356 NYTGNVDCFQLDDDPHGLDGWNWQV 380
NYTG VDCF L+DDPHG+ GW+WQ
Sbjct: 342 NYTGTVDCFDLNDDPHGMGGWDWQA 366
>gi|302758120|ref|XP_002962483.1| hypothetical protein SELMODRAFT_78660 [Selaginella moellendorffii]
gi|300169344|gb|EFJ35946.1| hypothetical protein SELMODRAFT_78660 [Selaginella moellendorffii]
Length = 474
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 189/318 (59%), Positives = 243/318 (76%), Gaps = 6/318 (1%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
++ + Y YF+Q LDHF+ ++ F QRYL++ W G PIF+YCGNEGDI WFA
Sbjct: 15 KEGFAYTEHYFQQTLDHFNVGNITLFPQRYLLHNASWSGGASGAPIFVYCGNEGDIVWFA 74
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA-TTLSYLTAEQALADFAVF 181
N+GF++DIAP FGA+LVFPEHR+YG+S P+G QN L++ +AEQALADFA
Sbjct: 75 ENTGFMFDIAPLFGALLVFPEHRFYGKSQPFGG-----QNGPKELAFCSAEQALADFATL 129
Query: 182 ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
I +LK+NLSA+ASPVV+FGGSYGGMLAAW RLKYPHIAIGALASSAPILQFE+IVP TF
Sbjct: 130 ILDLKRNLSAQASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENIVPYTTF 189
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLE 301
Y+IVS+ FKRE CF I+ SW + +++NGL L++ FH+C + + ++L +WLE
Sbjct: 190 YDIVSNAFKREGEKCFEIIRNSWTAITEAAEQQNGLRNLSQDFHMCSDFKNADELINWLE 249
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
SAYSYLAM +YPY ++F MPLP +P+R+VC+ + N+P A+SIL+RI+ GV+VYYN+TG
Sbjct: 250 SAYSYLAMANYPYAANFTMPLPAHPVRKVCQAMVNSPVASSILQRIYAGVNVYYNFTGAA 309
Query: 362 DCFQLDDDPHGLDGWNWQ 379
CF LDDDPHGL GWNWQ
Sbjct: 310 KCFDLDDDPHGLSGWNWQ 327
>gi|302758752|ref|XP_002962799.1| hypothetical protein SELMODRAFT_78251 [Selaginella moellendorffii]
gi|300169660|gb|EFJ36262.1| hypothetical protein SELMODRAFT_78251 [Selaginella moellendorffii]
Length = 470
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 189/318 (59%), Positives = 243/318 (76%), Gaps = 6/318 (1%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
++ + Y YF+Q LDHF+ ++ F QRYL++ W G PIF+YCGNEGDI WFA
Sbjct: 15 KEGFAYTEHYFQQTLDHFNVGNITLFPQRYLLHNASWSGGASGAPIFVYCGNEGDIVWFA 74
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA-TTLSYLTAEQALADFAVF 181
N+GF++DIAP FGA+LVFPEHR+YG+S P+G QN L++ +AEQALADFA
Sbjct: 75 ENTGFMFDIAPLFGALLVFPEHRFYGKSQPFGG-----QNGPKELAFCSAEQALADFATL 129
Query: 182 ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
I +LK+NLSA+ASPVV+FGGSYGGMLAAW RLKYPHIAIGALASSAPILQFE+IVP T+
Sbjct: 130 ILDLKRNLSAQASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFENIVPYTTY 189
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLE 301
Y+IVS+ FKRE CF I+ SW + +++NGL L++ FH+C + + ++L +WLE
Sbjct: 190 YDIVSNAFKREGEECFEIIRNSWTAITEAAEQQNGLRNLSQDFHMCSDFKNADELINWLE 249
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
SAYSYLAM +YPY ++F MPLP YP+R+VC+ + N+P A+SIL+RI+ GV+VYYN+TG
Sbjct: 250 SAYSYLAMANYPYAANFTMPLPAYPVRKVCQAMVNSPVASSILQRIYAGVNVYYNFTGAA 309
Query: 362 DCFQLDDDPHGLDGWNWQ 379
CF LDDDPHGL GWNWQ
Sbjct: 310 KCFDLDDDPHGLSGWNWQ 327
>gi|326513984|dbj|BAJ92142.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 535
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 184/317 (58%), Positives = 236/317 (74%), Gaps = 6/317 (1%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YF+Q LDHF+F + FSQ+YL+N W GP+F+Y GNEGDIEWFA N+
Sbjct: 85 FTAHYFQQELDHFTFTPNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNT 144
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF++DIAP FGA+LVF EHR+YGES P+G+ +Y++A TL YLT+ QALADFAV IT+L
Sbjct: 145 GFLFDIAPDFGALLVFIEHRFYGESKPFGND--SYKSADTLGYLTSTQALADFAVLITSL 202
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K NLS +PVV+FGGSYGGMLA+W RLKYPH+A+GA+ASSAPILQF+DI P +FY+ V
Sbjct: 203 KHNLSTVDAPVVVFGGSYGGMLASWFRLKYPHVAMGAVASSAPILQFDDITPWSSFYDTV 262
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
S DFK ES +CF+ IK +W L G GLLEL+KTF C+ + S + L DWL +A++
Sbjct: 263 SQDFKSESLNCFSVIKAAWDVLDDRGSNHTGLLELSKTFRACKTVKSADSLGDWLSTAFT 322
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
Y AMVDYP P++FMM LP YP++E+CK ID+ P I+++ F S+YYNYTG+ CFQ
Sbjct: 323 YTAMVDYPTPANFMMNLPAYPVKEMCKIIDSFPTGADIIDKAFAAASLYYNYTGDQKCFQ 382
Query: 366 L--DDDPHGLDGWNWQV 380
+ DDDPHGLDGW+WQ
Sbjct: 383 VEGDDDPHGLDGWDWQA 399
>gi|326510399|dbj|BAJ87416.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 524
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 191/361 (52%), Positives = 246/361 (68%), Gaps = 21/361 (5%)
Query: 39 RRAPRFVGKLPHLTEPPQRQQRQQQQQYR---------------YETRYFEQRLDHFSFA 83
RR P P +QQ+ +Y + YF+Q LDHF+F
Sbjct: 30 RRPTHHTPPFPGAGLAPLQQQQHSNGRYAAAAAAVTASAAAAKPFTAHYFQQELDHFTFT 89
Query: 84 DLPT--FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVF 141
+ FSQ+YL+N W GP+F+Y GNEGDIEWFA N+GF++DIAP FGA+LVF
Sbjct: 90 PNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNTGFLFDIAPDFGALLVF 149
Query: 142 PEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGG 201
EHR+YGES P+G+ +Y++A TL YLT+ QALADFAV IT+LK NLS +PVV+FGG
Sbjct: 150 IEHRFYGESKPFGND--SYKSADTLGYLTSTQALADFAVLITSLKHNLSTVDAPVVVFGG 207
Query: 202 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIK 261
SYGGMLA+W RLKYPH+A+GA+ASSAPILQF+DI P +FY+ VS DFK ES +CF+ IK
Sbjct: 208 SYGGMLASWFRLKYPHVAMGAVASSAPILQFDDITPWSSFYDTVSQDFKSESLNCFSVIK 267
Query: 262 ESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMP 321
+W L G GLLEL+KTF C+ + S + L DWL +A++Y AMVDYP P++FMM
Sbjct: 268 AAWDVLDDRGSNHTGLLELSKTFRACKTVKSADSLGDWLSTAFTYTAMVDYPTPANFMMN 327
Query: 322 LPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL--DDDPHGLDGWNWQ 379
LP YP++E+CK ID+ P I+++ F S+YYNYTG+ CFQ+ DDDPHGLDGW+WQ
Sbjct: 328 LPAYPVKEMCKIIDSFPTGADIIDKAFAAASLYYNYTGDQKCFQVEGDDDPHGLDGWDWQ 387
Query: 380 V 380
Sbjct: 388 A 388
>gi|326499536|dbj|BAJ86079.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 526
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/317 (58%), Positives = 232/317 (73%), Gaps = 6/317 (1%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YF+Q LDHF+F + FSQ+YL+N W GP+F+Y GNEGDIEWFA N+
Sbjct: 76 FTAHYFQQELDHFTFTPNASNLFSQKYLLNDTFWRRKPAAGPLFVYTGNEGDIEWFATNT 135
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF++DIAP FGA+LVF EHR+YGES P+G+ +Y++A TL YLT+ QALADFAV IT+L
Sbjct: 136 GFMFDIAPDFGALLVFIEHRFYGESKPFGND--SYKSADTLGYLTSTQALADFAVLITSL 193
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
KQNLSA +PVV+FGGSYGGMLA+W RLKYPH+AIGALASSAPILQFEDI P +FY V
Sbjct: 194 KQNLSAVDAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFEDITPWSSFYEAV 253
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
S DFK ES +CF+ IK W L G + GLLEL+KTF C+ + L DWL +A++
Sbjct: 254 SEDFKSESLNCFSVIKAVWDVLTVRGSNDTGLLELSKTFRACKTVLLPNSLLDWLSTAFT 313
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
Y AMVDYP P++FM LP YP++E+CK ID+ P ++E+ F S+YYNYTG+ CF+
Sbjct: 314 YTAMVDYPTPANFMQNLPAYPVKEMCKIIDSFPAGADVVEKAFAAASLYYNYTGDQKCFE 373
Query: 366 LD--DDPHGLDGWNWQV 380
++ DDPHGL GW WQ
Sbjct: 374 VEGGDDPHGLSGWGWQA 390
>gi|255565027|ref|XP_002523506.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537213|gb|EEF38845.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 501
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 185/328 (56%), Positives = 242/328 (73%), Gaps = 9/328 (2%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
+Q + + + Y+TRYF Q LDHF+F F Q+YLI++ +W ++ PIF+Y GN
Sbjct: 39 KQPKVTKPKIPYKTRYFPQLLDHFTFQPNGYKIFYQKYLISSQYW---HKEAPIFVYTGN 95
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
EGDIEWFA N+GF+ DIAP+F A+LVF EHR+YGESMP+G +Y++A TL YL ++QA
Sbjct: 96 EGDIEWFAANTGFLLDIAPKFRALLVFIEHRFYGESMPFGKD--SYKSAETLGYLNSQQA 153
Query: 175 LADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
LADFAV I +LKQNLS+EASPVV+FGGSYGGMLA W RLKYPHIAIGALASSAPILQF+D
Sbjct: 154 LADFAVLIRSLKQNLSSEASPVVVFGGSYGGMLATWFRLKYPHIAIGALASSAPILQFDD 213
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
I P +FY+ VS DF+ S +C+ IK SW EL ++ ++ GL+EL++TF C++L+S +
Sbjct: 214 ITPWSSFYDAVSQDFREASLNCYEVIKGSWAELETLSSQKEGLIELSRTFRTCKDLHSLD 273
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVY 354
+ DWL SAY Y AMV+YP ++FM PLP YP++E+CK ID P S + R+F S+Y
Sbjct: 274 SVWDWLWSAYVYSAMVNYPTEANFMKPLPAYPVKEMCKIIDGFPAGASKVSRVFAAASLY 333
Query: 355 YNYTGNVDCFQLDD--DPHGLDGWNWQV 380
YNY+ CFQL++ D HGL GWNWQ
Sbjct: 334 YNYSRGEKCFQLENVPDAHGLHGWNWQA 361
>gi|297821719|ref|XP_002878742.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297324581|gb|EFH55001.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 495
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/340 (54%), Positives = 241/340 (70%), Gaps = 10/340 (2%)
Query: 46 GKLPHLTE-PPQRQQRQQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGP 102
G HL+ Q++ + +Q+ +ETRYF Q LDHF F F Q+YLIN+ W
Sbjct: 23 GGFHHLSSLRQQKKASKSKQELPFETRYFPQNLDHFGFTPESYTVFHQKYLINSRFW--- 79
Query: 103 NRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN 162
+ GPIF+Y GNEGDI+WFA N+GF+ DIAP+F A+LVF EHR+YGES P+G ++++
Sbjct: 80 RKGGPIFVYTGNEGDIDWFASNTGFMSDIAPKFQALLVFIEHRFYGESTPFGKK--SHKS 137
Query: 163 ATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
A TL YL+++QALAD+A+ I +LKQNLS+EASPVV+FGGSYGGMLAAW RLKYPHI IGA
Sbjct: 138 AETLGYLSSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGA 197
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
LASSAPIL F++IVP +FY+ +S DFK S +CF IK SW EL +V +NGL EL+K
Sbjct: 198 LASSAPILHFDNIVPLTSFYDAISQDFKDASINCFTVIKRSWEELEAVSTMKNGLQELSK 257
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS 342
F C+ L S DWL A+ Y AMV+YP ++FM PLPGYP+ ++CK ID P +S
Sbjct: 258 KFRTCKGLQSKYSARDWLSGAFVYTAMVNYPTAANFMAPLPGYPVEQMCKIIDGFPRGSS 317
Query: 343 ILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLDGWNWQV 380
L+R F S+YYNY+G+ CF+++ D HGL+GW +Q
Sbjct: 318 NLDRAFAAASLYYNYSGSEKCFEMEQQTDDHGLNGWQYQA 357
>gi|118487801|gb|ABK95724.1| unknown [Populus trichocarpa]
Length = 500
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 182/328 (55%), Positives = 239/328 (72%), Gaps = 9/328 (2%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
+Q + + + Y+T YF Q LDHF+F F Q+YL+N+ +W +R PIF+Y GN
Sbjct: 39 KQPKASKPKIPYKTHYFPQVLDHFTFQPKSSKIFYQKYLVNSHYW---HRGAPIFVYTGN 95
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
EGDIEWFA N+GF+ DIAP+F A+LVF EHR+YGESMP+G+ +Y++A TL YL ++QA
Sbjct: 96 EGDIEWFAANTGFLLDIAPKFRALLVFIEHRFYGESMPFGNK--SYKSAETLGYLNSQQA 153
Query: 175 LADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
LADFA+ I +LK NLS+EASPVV+FGGSYGGMLAAW RLKYPHIAIGALASSAPILQF+D
Sbjct: 154 LADFALLIRSLKHNLSSEASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFDD 213
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
I P +FY+ VS DFK S +C+ IK SW EL ++ + GL EL++TF C++L+S +
Sbjct: 214 ITPWSSFYDAVSQDFKEASLNCYEVIKGSWAELEALSAQNEGLAELSRTFRACQDLHSLD 273
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVY 354
+ +WL SA+ Y AMV+YP ++FMMPLP YP++ +CK ID P S + R+F S+Y
Sbjct: 274 SVWEWLWSAFVYTAMVNYPTEANFMMPLPAYPVQAMCKIIDGFPSGASKITRVFAAASLY 333
Query: 355 YNYTGNVDCFQLDDDP--HGLDGWNWQV 380
YNY+ CF+L+ P HGL GWNWQ
Sbjct: 334 YNYSRAEKCFKLEHGPDAHGLHGWNWQA 361
>gi|224142419|ref|XP_002324555.1| predicted protein [Populus trichocarpa]
gi|222865989|gb|EEF03120.1| predicted protein [Populus trichocarpa]
Length = 459
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 182/328 (55%), Positives = 239/328 (72%), Gaps = 9/328 (2%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
+Q + + + Y+T YF Q LDHF+F F Q+YL+N+ +W +R PIF+Y GN
Sbjct: 3 KQPKASKPKIPYKTHYFPQVLDHFTFQPKSSKIFYQKYLVNSHYW---HRGAPIFVYTGN 59
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
EGDIEWFA N+GF+ DIAP+F A+LVF EHR+YGESMP+G+ +Y++A TL YL ++QA
Sbjct: 60 EGDIEWFAANTGFLLDIAPKFRALLVFIEHRFYGESMPFGNK--SYKSAETLGYLNSQQA 117
Query: 175 LADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
LADFA+ I +LK NLS+EASPVV+FGGSYGGMLAAW RLKYPHIAIGALASSAPILQF+D
Sbjct: 118 LADFALLIRSLKHNLSSEASPVVVFGGSYGGMLAAWFRLKYPHIAIGALASSAPILQFDD 177
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
I P +FY+ VS DFK S +C+ IK SW EL ++ + GL EL++TF C++L+S +
Sbjct: 178 ITPWSSFYDAVSQDFKEASLNCYEVIKGSWAELEALSAQNEGLAELSRTFRACQDLHSLD 237
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVY 354
+ +WL SA+ Y AMV+YP ++FMMPLP YP++ +CK ID P S + R+F S+Y
Sbjct: 238 SVWEWLWSAFVYTAMVNYPTEANFMMPLPAYPVQAMCKIIDGFPSGASKITRVFAAASLY 297
Query: 355 YNYTGNVDCFQLDDDP--HGLDGWNWQV 380
YNY+ CF+L+ P HGL GWNWQ
Sbjct: 298 YNYSRAEKCFKLEHGPDAHGLHGWNWQA 325
>gi|359484787|ref|XP_003633162.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
gi|297735899|emb|CBI18675.3| unnamed protein product [Vitis vinifera]
Length = 502
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 183/315 (58%), Positives = 230/315 (73%), Gaps = 9/315 (2%)
Query: 70 TRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
T+YF Q LDHF+F + F Q+YLINT +W PIF+Y GNEGDI+WFA N+GF
Sbjct: 58 TQYFPQLLDHFTFTPKSSTIFYQKYLINTQYWT---HGAPIFVYTGNEGDIDWFASNTGF 114
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+ DIAP F AMLVF EHR+YGESMP+G + +Y++ TL YL ++QALADFAV I +LKQ
Sbjct: 115 LLDIAPSFRAMLVFIEHRFYGESMPFG--KESYKSPETLGYLNSQQALADFAVLIRSLKQ 172
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
NLS+EASPVV+FGGSYGGMLAAW RLKYPH+AIGALASSAPILQF+DI P +FY+ VS
Sbjct: 173 NLSSEASPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILQFDDITPWSSFYDAVSQ 232
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYL 307
DFK S +C+ IK SW EL ++ KE GL E+++TF C+++NS DWL SA+ Y
Sbjct: 233 DFKEASLNCYEVIKGSWAELDAMSAKEGGLAEVSRTFRTCKDINSVYSARDWLWSAFVYT 292
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLD 367
AMV+YP ++FMMPLP YP+ E+CK ID P + + R F S+YYNY+G CF L+
Sbjct: 293 AMVNYPTKANFMMPLPAYPVEEMCKIIDRFPHGATNVSRAFAAASLYYNYSGTEKCFDLE 352
Query: 368 D--DPHGLDGWNWQV 380
+ D HGL GW+WQ
Sbjct: 353 NGKDAHGLHGWDWQA 367
>gi|20197495|gb|AAM15096.1| putative prolylcarboxypeptidase [Arabidopsis thaliana]
Length = 476
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/340 (54%), Positives = 240/340 (70%), Gaps = 10/340 (2%)
Query: 46 GKLPHLTEPP-QRQQRQQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGP 102
G HL+ +++ + + + +ETRYF Q LDHFSF F Q+YLIN W
Sbjct: 23 GGFHHLSSLRLKKKVSKSKHELPFETRYFPQNLDHFSFTPDSYKVFHQKYLINNRFW--- 79
Query: 103 NRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN 162
+ GPIF+Y GNEGDI+WFA N+GF+ DIAP+F A+LVF EHR+YGES P+G ++++
Sbjct: 80 RKGGPIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFIEHRFYGESTPFGKK--SHKS 137
Query: 163 ATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
A TL YL ++QALAD+A+ I +LKQNLS+EASPVV+FGGSYGGMLAAW RLKYPHI IGA
Sbjct: 138 AETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGA 197
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
LASSAPIL F++IVP +FY+ +S DFK S +CF IK SW EL +V +NGL EL+K
Sbjct: 198 LASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIKRSWEELEAVSTMKNGLQELSK 257
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS 342
F C+ L+S DWL A+ Y AMV+YP ++FM PLPGYP+ ++CK ID P +S
Sbjct: 258 KFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAPLPGYPVEQMCKIIDGFPRGSS 317
Query: 343 ILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLDGWNWQV 380
L+R F S+YYNY+G+ CF+++ D HGLDGW +Q
Sbjct: 318 NLDRAFAAASLYYNYSGSEKCFEMEQQTDDHGLDGWQYQA 357
>gi|30682358|ref|NP_850050.1| alpha/beta-hydrolase domain-containing protein [Arabidopsis
thaliana]
gi|16648801|gb|AAL25591.1| At2g24280/F27D4.19 [Arabidopsis thaliana]
gi|22655366|gb|AAM98275.1| At2g24280/F27D4.19 [Arabidopsis thaliana]
gi|330252462|gb|AEC07556.1| alpha/beta-hydrolase domain-containing protein [Arabidopsis
thaliana]
Length = 494
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/340 (54%), Positives = 240/340 (70%), Gaps = 10/340 (2%)
Query: 46 GKLPHLTEPP-QRQQRQQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGP 102
G HL+ +++ + + + +ETRYF Q LDHFSF F Q+YLIN W
Sbjct: 23 GGFHHLSSLRLKKKVSKSKHELPFETRYFPQNLDHFSFTPDSYKVFHQKYLINNRFW--- 79
Query: 103 NRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN 162
+ GPIF+Y GNEGDI+WFA N+GF+ DIAP+F A+LVF EHR+YGES P+G ++++
Sbjct: 80 RKGGPIFVYTGNEGDIDWFASNTGFMLDIAPKFRALLVFIEHRFYGESTPFGKK--SHKS 137
Query: 163 ATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
A TL YL ++QALAD+A+ I +LKQNLS+EASPVV+FGGSYGGMLAAW RLKYPHI IGA
Sbjct: 138 AETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGA 197
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
LASSAPIL F++IVP +FY+ +S DFK S +CF IK SW EL +V +NGL EL+K
Sbjct: 198 LASSAPILHFDNIVPLTSFYDAISQDFKDASINCFKVIKRSWEELEAVSTMKNGLQELSK 257
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS 342
F C+ L+S DWL A+ Y AMV+YP ++FM PLPGYP+ ++CK ID P +S
Sbjct: 258 KFRTCKGLHSQYSARDWLSGAFVYTAMVNYPTAANFMAPLPGYPVEQMCKIIDGFPRGSS 317
Query: 343 ILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLDGWNWQV 380
L+R F S+YYNY+G+ CF+++ D HGLDGW +Q
Sbjct: 318 NLDRAFAAASLYYNYSGSEKCFEMEQQTDDHGLDGWQYQA 357
>gi|312282209|dbj|BAJ33970.1| unnamed protein product [Thellungiella halophila]
Length = 494
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/340 (54%), Positives = 241/340 (70%), Gaps = 10/340 (2%)
Query: 46 GKLPHLTEPPQ-RQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGP 102
G H + Q ++ + + + +ET YF Q LDHFSF F Q+YLIN+ W
Sbjct: 23 GGFHHFSSLRQNKKASKSKSELPFETLYFPQNLDHFSFRPESYKVFHQKYLINSRFW--- 79
Query: 103 NRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN 162
+ GPIF+Y GNEGDI+WFA N+GF+ DIAP+FGA+LVF EHR+YGES P+G ++++
Sbjct: 80 RKGGPIFVYTGNEGDIDWFASNTGFMSDIAPKFGALLVFIEHRFYGESTPFGKK--SHKS 137
Query: 163 ATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
A TL YL ++QALAD+A+ I +LKQNLS+EASPVV+FGGSYGGMLAAW RLKYPHIAIGA
Sbjct: 138 AETLGYLNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHIAIGA 197
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
LASSAPIL F++IVP +FY+ +S DFK S +CF IK SW EL +V ++GL EL+K
Sbjct: 198 LASSAPILHFDNIVPLTSFYDAISQDFKDASVNCFEVIKRSWQELEAVSNMKHGLPELSK 257
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS 342
F C+ L+S DWL SA+ Y AMV+Y ++FM PLPGYP+ ++CK ID P +S
Sbjct: 258 KFRTCKGLHSQYAARDWLMSAFIYTAMVNYATAANFMAPLPGYPVEQMCKIIDGFPRGSS 317
Query: 343 ILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLDGWNWQV 380
L+R F S+YYNY+G+ CF+L+ D HGLDGW WQ
Sbjct: 318 NLDRAFAAASLYYNYSGSEKCFELEQPTDDHGLDGWGWQA 357
>gi|225440787|ref|XP_002281618.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 507
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/347 (52%), Positives = 233/347 (67%), Gaps = 10/347 (2%)
Query: 39 RRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINT 96
+ PRF + P Q Q + YE +YF Q LDHF++ TF QRYLIN
Sbjct: 27 KITPRFPSSI---VRPEQLSVSSQTE--LYEAKYFTQLLDHFNYQPQSYRTFQQRYLIND 81
Query: 97 DHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGS- 155
+W G ++L PIF+Y GNEGDIEWFA N+GF++D AP F A+LVF EHR+YG+S+P+G
Sbjct: 82 KYWGGADKLAPIFVYTGNEGDIEWFAQNTGFMFDTAPHFQALLVFIEHRFYGKSIPFGGD 141
Query: 156 TEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKY 215
T VAY NA+TL YL++ QALAD+A I +LK+NLSA SPVV+FGGSYGGMLAAW RLKY
Sbjct: 142 TNVAYSNASTLGYLSSTQALADYATLIIDLKKNLSATNSPVVVFGGSYGGMLAAWFRLKY 201
Query: 216 PHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKEN 275
PH+AIGALASSAPIL FE+I P TF NI++ DF+ ES +C+ IK SW ++ K
Sbjct: 202 PHVAIGALASSAPILNFENITSPYTFNNIITQDFRSESENCYKVIKGSWEQIEETAMKNG 261
Query: 276 GLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID 335
GL L K+F +C+ S + +WL +AY Y AM DYP PS+F+ P+P YP++++CK ID
Sbjct: 262 GLEVLRKSFRICKNYISGGAIENWLSTAYVYTAMTDYPTPSNFLSPMPAYPVKQMCKAID 321
Query: 336 NAPDATSILERIFEGVSVYYNYTGNVDCFQL--DDDPHGLDGWNWQV 380
+ +++ +VYYNYTG CF L D DPHGL W WQ
Sbjct: 322 DPAQGNDTFAKLYGAANVYYNYTGTAACFDLADDSDPHGLGEWTWQA 368
>gi|357123237|ref|XP_003563318.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 536
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 185/364 (50%), Positives = 241/364 (66%), Gaps = 19/364 (5%)
Query: 29 LSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPT- 87
L A+ S R A V P+++ + + YF Q LDHF+F +
Sbjct: 42 LRAASSSSGGRYAASAVASGGKAAPEPEKKTKNP-----FRAHYFPQELDHFTFTPNASR 96
Query: 88 -FSQRYLINTDHWVGPNRLG--------PIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAM 138
F Q+YL+N W P G P+F+Y GNEGDIEWFA NSGF++DIAP+F A+
Sbjct: 97 IFYQKYLVNDTFWRKPTGKGRGAAAAPGPVFVYTGNEGDIEWFATNSGFLFDIAPKFSAL 156
Query: 139 LVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVL 198
LVF EHR+YGES P+G+ +Y +A TL YLT+ QALADFAV IT+LK NLSA +PVV+
Sbjct: 157 LVFIEHRFYGESKPFGNE--SYGSAATLGYLTSTQALADFAVLITSLKHNLSAPVAPVVV 214
Query: 199 FGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFN 258
FGGSYGGMLA+W RLKYPH+A+GALASSAPILQF+DI P +FY+ S DFK ES +CF+
Sbjct: 215 FGGSYGGMLASWFRLKYPHVAMGALASSAPILQFDDITPWSSFYDAASQDFKSESKNCFS 274
Query: 259 TIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDF 318
IK W L G + GLL+L+KTF C+ + + L++WL +A+ Y AMVDYP P++F
Sbjct: 275 VIKAVWDVLDERGSNDKGLLQLSKTFRACKTVKYVDSLSNWLWTAFVYTAMVDYPTPANF 334
Query: 319 MMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLD--DDPHGLDGW 376
+M LP YP++E+CK ID P I+++ F S+YYNYTG+ CF ++ DDPHGL GW
Sbjct: 335 LMNLPAYPVKEMCKIIDAFPPGADIVDKAFSAASLYYNYTGDQKCFDVEGGDDPHGLSGW 394
Query: 377 NWQV 380
WQ
Sbjct: 395 GWQA 398
>gi|224088466|ref|XP_002308456.1| predicted protein [Populus trichocarpa]
gi|222854432|gb|EEE91979.1| predicted protein [Populus trichocarpa]
Length = 461
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 176/318 (55%), Positives = 224/318 (70%), Gaps = 5/318 (1%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y ++F Q LDH++F TF QRYLIN +W G + PIFLY GNEGDIEWFA N+
Sbjct: 49 YHEKFFTQVLDHYTFRPQSYKTFQQRYLINDKYWGGAEKNAPIFLYTGNEGDIEWFAQNT 108
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG-STEVAYQNATTLSYLTAEQALADFAVFITN 184
GF++DIAP F +LVF EHR+YG+SMP+G + EVAY N++TL YLT+ QALAD+A I +
Sbjct: 109 GFIFDIAPHFKPLLVFIEHRFYGKSMPFGGNKEVAYSNSSTLGYLTSTQALADYATLIID 168
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK+NLSA SPVV+FGGSYGGMLAAW RLKYPH+AIGALASS+PIL FE+I P +F NI
Sbjct: 169 LKKNLSATDSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSSPILNFENITSPYSFNNI 228
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
++ DF+ ES +C+ IK SW E+ + GL L +F +CR S L WL +A
Sbjct: 229 ITQDFRGESENCYKVIKRSWQEIEDTASQPGGLEILRSSFRICRNSISASSLQSWLYTAL 288
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
Y AM DYP PS+F+ P+P YP++E+CK ID+ + +++ SVYYNY+GN CF
Sbjct: 289 VYTAMTDYPTPSNFLNPMPAYPVKEMCKAIDDPKTGNNTFAKLYGAASVYYNYSGNATCF 348
Query: 365 QLDD--DPHGLDGWNWQV 380
LDD DPHGL GW+WQ
Sbjct: 349 NLDDDSDPHGLGGWSWQA 366
>gi|297740163|emb|CBI30345.3| unnamed protein product [Vitis vinifera]
Length = 485
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 177/329 (53%), Positives = 226/329 (68%), Gaps = 5/329 (1%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
Q Q YE +YF Q LDHF++ TF QRYLIN +W G ++L PIF+Y GN
Sbjct: 18 EQLSVSSQTELYEAKYFTQLLDHFNYQPQSYRTFQQRYLINDKYWGGADKLAPIFVYTGN 77
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGS-TEVAYQNATTLSYLTAEQ 173
EGDIEWFA N+GF++D AP F A+LVF EHR+YG+S+P+G T VAY NA+TL YL++ Q
Sbjct: 78 EGDIEWFAQNTGFMFDTAPHFQALLVFIEHRFYGKSIPFGGDTNVAYSNASTLGYLSSTQ 137
Query: 174 ALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
ALAD+A I +LK+NLSA SPVV+FGGSYGGMLAAW RLKYPH+AIGALASSAPIL FE
Sbjct: 138 ALADYATLIIDLKKNLSATNSPVVVFGGSYGGMLAAWFRLKYPHVAIGALASSAPILNFE 197
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST 293
+I P TF NI++ DF+ ES +C+ IK SW ++ K GL L K+F +C+ S
Sbjct: 198 NITSPYTFNNIITQDFRSESENCYKVIKGSWEQIEETAMKNGGLEVLRKSFRICKNYISG 257
Query: 294 EDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSV 353
+ +WL +AY Y AM DYP PS+F+ P+P YP++++CK ID+ +++ +V
Sbjct: 258 GAIENWLSTAYVYTAMTDYPTPSNFLSPMPAYPVKQMCKAIDDPAQGNDTFAKLYGAANV 317
Query: 354 YYNYTGNVDCFQL--DDDPHGLDGWNWQV 380
YYNYTG CF L D DPHGL W WQ
Sbjct: 318 YYNYTGTAACFDLADDSDPHGLGEWTWQA 346
>gi|255579351|ref|XP_002530520.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223529924|gb|EEF31852.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 508
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 175/344 (50%), Positives = 230/344 (66%), Gaps = 5/344 (1%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHW 99
PRF + ++ Y+ ++F Q LDHF+F TF QRYLIN +W
Sbjct: 26 PRFSSSINTQPADQKKTSLSTTPNKLYKEKFFTQTLDHFNFNPKSYQTFQQRYLINDTYW 85
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYG-STEV 158
GP PIF+Y GNEG+IEWFA N+GF++D AP+F A+LVF EHR+YG+S+P+G + EV
Sbjct: 86 AGPKNNAPIFMYTGNEGEIEWFAQNTGFMFDNAPKFNALLVFVEHRFYGKSIPFGGNKEV 145
Query: 159 AYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHI 218
AY NA+TL YLT+ Q+LAD+A IT+LK NLSA SPVV+FGGSYGGMLAAW RLKYPH+
Sbjct: 146 AYSNASTLGYLTSTQSLADYATLITDLKNNLSATDSPVVVFGGSYGGMLAAWFRLKYPHV 205
Query: 219 AIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLL 278
IGALASSAPIL F +I P +F NI++ DF+ ES +C+ IK SW ++ + GL
Sbjct: 206 TIGALASSAPILGFVNITSPYSFNNIITHDFRSESENCYKVIKGSWQQIEDTANQHGGLE 265
Query: 279 ELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
+L K+F +C+ S L +WL +A+ Y AM DYP PS+F+ PLP +P++++CK ID+
Sbjct: 266 KLRKSFKICKNYISAGSLENWLSTAWVYSAMTDYPTPSNFLNPLPAFPVKQMCKAIDDPT 325
Query: 339 DATSILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLDGWNWQV 380
+ SVYYNY+G CF LDD DPHGL GW+WQ
Sbjct: 326 AGNDTFAKFHAAASVYYNYSGTATCFDLDDDSDPHGLGGWDWQA 369
>gi|242096520|ref|XP_002438750.1| hypothetical protein SORBIDRAFT_10g025520 [Sorghum bicolor]
gi|241916973|gb|EER90117.1| hypothetical protein SORBIDRAFT_10g025520 [Sorghum bicolor]
Length = 558
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 177/322 (54%), Positives = 232/322 (72%), Gaps = 11/322 (3%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGP-----NRLGPIFLYCGNEGDIEW 120
+ YF Q LDHF+F + F Q+YL+N W P + GP+F+Y GNEGDIEW
Sbjct: 101 FTVHYFAQELDHFTFTPNASMVFRQKYLLNDTFWRRPSAGDGDGAGPLFVYTGNEGDIEW 160
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA N+GF++DIAP+FGA+LVF EHR+YGES P+G+ +Y++A TL YLT+ QALADFA+
Sbjct: 161 FATNTGFMFDIAPKFGALLVFIEHRFYGESKPFGND--SYKSAETLGYLTSTQALADFAI 218
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
I +LK+NLSAEA+PVV+FGGSYGGMLA+W RLKYPH+ IGA+ASSAPILQF+ I P +
Sbjct: 219 LIRSLKKNLSAEAAPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSS 278
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
FY+ VS DFK ES +CF+ IK +W L G + GLL+L+K F C+ + + +WL
Sbjct: 279 FYDGVSQDFKSESLNCFSVIKGTWDVLDERGSTDKGLLDLSKLFRACKTVKYAYSIRNWL 338
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A+SY AMVDYP P++F+ LP YP++E+CK ID P ILE+ F S+YYNYTG+
Sbjct: 339 WTAFSYTAMVDYPTPANFLENLPAYPVKEMCKIIDGFPTGADILEKAFAAASLYYNYTGD 398
Query: 361 VDCFQLD--DDPHGLDGWNWQV 380
C +++ DDPHGLDGW WQ
Sbjct: 399 QTCNKIEDGDDPHGLDGWQWQA 420
>gi|297802936|ref|XP_002869352.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297315188|gb|EFH45611.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 497
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/334 (54%), Positives = 237/334 (70%), Gaps = 14/334 (4%)
Query: 51 LTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPI 108
L + + + + + + ++TRYF Q LDHFSF F Q+YLI++ W + GPI
Sbjct: 34 LRQNSKTSKSKAELPFHFQTRYFPQNLDHFSFQPESYRIFHQKYLISSHFW---RKGGPI 90
Query: 109 FLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY 168
F+Y GNEGDIEWFA N+GF+ DIAP+F A+LVF EHR+YGES P+ A TL Y
Sbjct: 91 FVYTGNEGDIEWFASNTGFMLDIAPKFQALLVFIEHRFYGESKPHNL-------AKTLGY 143
Query: 169 LTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
L ++QALAD+A+ I +LKQNLS+EASPVV+FGGSYGGMLAAW RLKYPHI IGALASSAP
Sbjct: 144 LNSQQALADYAILIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAP 203
Query: 229 ILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR 288
ILQF+ IVP +FYN+VS DFK S +CF IK+SW EL ++GL EL+K F C+
Sbjct: 204 ILQFDKIVPSSSFYNVVSQDFKDASLNCFEVIKKSWRELEVFSTMKDGLQELSKKFRTCK 263
Query: 289 ELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIF 348
+L++ + WLE+A++ AMV+YP P++FM PLP YP+ E+CK ID P S L+R F
Sbjct: 264 DLHAVYLASRWLETAFTDTAMVNYPTPANFMAPLPAYPVEEMCKIIDWFPLEASNLDRAF 323
Query: 349 EGVSVYYNYTGNVDCFQLDD--DPHGLDGWNWQV 380
S+YYNY+G+ +CF +++ DPHGL+GW WQ
Sbjct: 324 AAASLYYNYSGSENCFDIENQTDPHGLNGWYWQA 357
>gi|449443023|ref|XP_004139280.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
gi|449493651|ref|XP_004159394.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 499
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 182/328 (55%), Positives = 232/328 (70%), Gaps = 10/328 (3%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGPNRLGPIFLYCGN 114
+Q + + +ETR++ Q LDHF+F + F Q+YLIN ++W PIF+Y GN
Sbjct: 38 KQATSLKPKIHFETRFYPQLLDHFTFTPKSSKIFYQKYLINEEYW---RNGAPIFVYTGN 94
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
EGDIEWFA N+GF+ DIAP F A+LVF EHR+YGES P+G+ +Y +A TL YLT++QA
Sbjct: 95 EGDIEWFAANTGFLPDIAPEFHALLVFIEHRFYGESTPFGND--SYNSAETLGYLTSQQA 152
Query: 175 LADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
LAD+AV I +LKQNLS+EASPVV+FGGSYGGMLAAW RLKYPHI IGALASSAPIL F++
Sbjct: 153 LADYAVLIRSLKQNLSSEASPVVVFGGSYGGMLAAWFRLKYPHITIGALASSAPILHFDN 212
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
IVP +FY+ VS DFK S +CF IK SW EL +E GL EL+KTF C+ L+S
Sbjct: 213 IVPWSSFYDAVSQDFKDASLNCFEVIKGSWTELQQEFSEE-GLAELSKTFRTCKNLHSVS 271
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVY 354
+ DWL SA+ Y +MV+YP ++FM PLP YP++E+CK ID TS L + F S+Y
Sbjct: 272 SVQDWLWSAFVYTSMVNYPTEANFMRPLPAYPVQEMCKIIDAFAPETSKLNKAFAAASLY 331
Query: 355 YNYTGNVDCFQLDDDP--HGLDGWNWQV 380
YNY+ CF +++ P HGL GWNWQ
Sbjct: 332 YNYSHGEKCFNVENGPDLHGLSGWNWQA 359
>gi|218198653|gb|EEC81080.1| hypothetical protein OsI_23902 [Oryza sativa Indica Group]
Length = 539
Score = 364 bits (934), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 176/322 (54%), Positives = 225/322 (69%), Gaps = 11/322 (3%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHW-----VGPNRLGPIFLYCGNEGDIEW 120
+ YF Q LDHF+F + F Q+YL+N W G GPIF+Y GNEGDIEW
Sbjct: 82 FTAHYFPQELDHFTFTPNASALFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA N+GF++ IAP FGA+LVF EHR+YGES P+G+ + + L YLT+ QALADFAV
Sbjct: 142 FATNTGFMFHIAPSFGALLVFIEHRFYGESKPFGNE--SNSSPEKLGYLTSTQALADFAV 199
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
IT+LK NLSA +SPVV+FGGSYGGMLA+W RLKYPH+ IGA+ASSAPILQF+ I P +
Sbjct: 200 LITSLKHNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSS 259
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
FY+ VS D+K ES +CF+ IK +W + G + GLL+L+KTF C+ + S +WL
Sbjct: 260 FYDAVSQDYKSESFNCFSVIKAAWDLIDEKGSTDAGLLQLSKTFRACKTVKSVYSFRNWL 319
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A+ Y AMVDYP P++F+M LP YPI+E+CK ID P I+++ F S+YYNYTG+
Sbjct: 320 WTAFVYTAMVDYPTPANFLMNLPAYPIKEMCKIIDGFPAGADIVDKAFAAASLYYNYTGD 379
Query: 361 VDCFQLDD--DPHGLDGWNWQV 380
CFQL+D DPHGL GW WQ
Sbjct: 380 QTCFQLEDGEDPHGLSGWGWQA 401
>gi|115469214|ref|NP_001058206.1| Os06g0647400 [Oryza sativa Japonica Group]
gi|51535425|dbj|BAD37324.1| putative prolylcarboxypeptidase isoform 1 [Oryza sativa Japonica
Group]
gi|113596246|dbj|BAF20120.1| Os06g0647400 [Oryza sativa Japonica Group]
gi|215737061|dbj|BAG95990.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 539
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 176/322 (54%), Positives = 224/322 (69%), Gaps = 11/322 (3%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHW-----VGPNRLGPIFLYCGNEGDIEW 120
+ YF Q LDHF+F + F Q+YL+N W G GPIF+Y GNEGDIEW
Sbjct: 82 FTAHYFPQELDHFTFTPNASAVFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA N+GF++DIAP FGA+LVF EHR+YGES P+G+ + + L YLT+ QALADFAV
Sbjct: 142 FATNTGFMFDIAPSFGALLVFIEHRFYGESKPFGNE--SNSSPEKLGYLTSTQALADFAV 199
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
IT+LK NLSA +SPVV+FGGSYGGMLA+W RLKYPH+ IGA+ASSAPILQF+ I P +
Sbjct: 200 LITSLKHNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSS 259
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
FY VS D+K ES +CF+ IK +W + G + GLL+L+KTF C+ + S +WL
Sbjct: 260 FYEAVSQDYKSESFNCFSVIKAAWDLIDERGSTDAGLLQLSKTFRACKTVKSVYSFRNWL 319
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A+ Y AMVDYP P++F+M LP YPI+E+CK I P I+++ F S+YYNYTG+
Sbjct: 320 WTAFVYTAMVDYPTPANFLMNLPAYPIKEMCKIIHGFPAGADIVDKAFAAASLYYNYTGD 379
Query: 361 VDCFQLDD--DPHGLDGWNWQV 380
CFQL+D DPHGL GW WQ
Sbjct: 380 QTCFQLEDGEDPHGLSGWGWQA 401
>gi|356570875|ref|XP_003553609.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 507
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 179/363 (49%), Positives = 238/363 (65%), Gaps = 8/363 (2%)
Query: 25 ILSPLSLAAQPS-KFRRAPRFVGKLPHLTEPPQRQQRQQQQQYR-YETRYFEQRLDHFSF 82
IL+ SL + PS F AP + + P + +QR Q Y T++F Q LDHF+F
Sbjct: 8 ILTLFSLFSVPSLTFAFAP-ILPRFPSSAVSAELKQRSHSSQNGLYRTKFFTQILDHFNF 66
Query: 83 --ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLV 140
TF QRYLIN W G PIF+Y GNEG+IEWF N+GF+++ AP F A+LV
Sbjct: 67 NPQSNHTFQQRYLINDTFWGGAKNNAPIFVYTGNEGNIEWFTQNTGFMFENAPSFQALLV 126
Query: 141 FPEHRYYGESMPYGSTE-VAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLF 199
F EHR+YG+S+P+G + VAY N +TL YL++ QALAD+A I +LK+NLSA SPVV+F
Sbjct: 127 FIEHRFYGKSIPFGGNKTVAYANTSTLGYLSSTQALADYAALIIDLKKNLSATDSPVVVF 186
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNT 259
GGSYGGMLAAW R+KYPH+AIGALASSAPIL F +V P+ F +I++ DF+ ES +C+
Sbjct: 187 GGSYGGMLAAWFRMKYPHVAIGALASSAPILHFMGMVSPDIFISIITQDFRSESENCYKV 246
Query: 260 IKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFM 319
IK SW + K G+ L KTF +C + + L WL +A+ Y AM DYP PS+F+
Sbjct: 247 IKGSWDLIDDTANKPGGMELLRKTFRICNDDFGPDSLEGWLRAAWIYTAMTDYPTPSNFL 306
Query: 320 MPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLDGWN 377
PLP YP++++C+ ID++ + L +++ +VYYNYTG CF LDD DPH L GW
Sbjct: 307 NPLPAYPVKKMCEAIDSSVTGNNRLAKLYAAANVYYNYTGKATCFDLDDNSDPHDLGGWQ 366
Query: 378 WQV 380
WQ
Sbjct: 367 WQA 369
>gi|147788546|emb|CAN61013.1| hypothetical protein VITISV_036738 [Vitis vinifera]
Length = 554
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 178/361 (49%), Positives = 232/361 (64%), Gaps = 17/361 (4%)
Query: 25 ILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA- 83
IL + L + S A + + P P++ Q + YE +YF Q LDHF++
Sbjct: 9 ILPCIFLFSISSSGFSAAKITPRFPSSIVRPEQLSVSSQTEL-YEAKYFTQILDHFNYQP 67
Query: 84 -DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
TF QRYLIN +W G ++L PIF+Y GNEGDIEWFA N+GF++D AP F A+LVF
Sbjct: 68 QSYRTFQQRYLINDKYWGGADKLAPIFVYTGNEGDIEWFAQNTGFMFDTAPHFQALLVFI 127
Query: 143 EHRYYGESMPYGS-TEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGG 201
EHR+YG+S+P+G T VAY NA+TL YL++ QALAD+A I +LK+NLSA SPVV+FGG
Sbjct: 128 EHRFYGKSIPFGGDTNVAYSNASTLGYLSSTQALADYATLIIDLKKNLSATNSPVVVFGG 187
Query: 202 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIK 261
SYGGMLAAW RLKYPH+AIGALASSAPIL FE+I P TF NI++ DF+
Sbjct: 188 SYGGMLAAWFRLKYPHVAIGALASSAPILNFENITSPYTFNNIITQDFQG---------- 237
Query: 262 ESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMP 321
SW ++ K GL L K+F +C+ S + +WL +AY Y AM DYP PS+F+ P
Sbjct: 238 -SWEQIEETAMKNGGLEVLRKSFRICKNYISGGAIENWLSTAYVYTAMTDYPTPSNFLSP 296
Query: 322 LPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL--DDDPHGLDGWNWQ 379
+P YP++++CK ID+ +++ +VYYNYTG CF L D DPHGL W WQ
Sbjct: 297 MPAYPVKQMCKAIDDPAXGNDTFAKLYGAANVYYNYTGTAACFDLADDSDPHGLGEWTWQ 356
Query: 380 V 380
Sbjct: 357 A 357
>gi|356503833|ref|XP_003520707.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 508
Score = 357 bits (917), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 182/375 (48%), Positives = 235/375 (62%), Gaps = 23/375 (6%)
Query: 11 SLYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYET 70
SL+ +P +T + IL PRF E QR Q YR T
Sbjct: 14 SLFSAPSLTFAFVPIL---------------PRFPSSAVS-AELKQRSHSSQNGLYR--T 55
Query: 71 RYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
++F Q LDHF+F TF QRYLIN W G PIF+Y GNEG+IEWF N+GF+
Sbjct: 56 KFFTQILDHFNFNPQSNHTFQQRYLINDTFWGGAKNNAPIFVYTGNEGNIEWFTQNTGFM 115
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTE-VAYQNATTLSYLTAEQALADFAVFITNLKQ 187
++ AP F A+LVF EHR+YG+S+P+G + VAY N +TL YL++ QALAD+A I +LK+
Sbjct: 116 FENAPSFQALLVFIEHRFYGKSIPFGGNKTVAYANTSTLGYLSSTQALADYATLIIDLKK 175
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
NLSA SPVV+FGGSYGGMLAAW R+KYPH+AIGALASSAPIL F +V P+ F NI++
Sbjct: 176 NLSATDSPVVVFGGSYGGMLAAWFRMKYPHVAIGALASSAPILHFMGLVSPDIFNNIITQ 235
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYL 307
DF+ ES +C+ IK SW + K G+ L KTF +C + L WL +A+ Y
Sbjct: 236 DFRSESENCYKVIKGSWDLIDDTANKPGGMELLRKTFRICNDDFGPGSLEGWLRAAWIYT 295
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLD 367
AM DYP PS+F+ PLP YP++++C+ ID++ + L ++ SVYYNYTG CF LD
Sbjct: 296 AMTDYPTPSNFLNPLPAYPVKKMCEAIDSSVTGNNRLAKLHAAASVYYNYTGKARCFDLD 355
Query: 368 D--DPHGLDGWNWQV 380
D DPH L GW WQ
Sbjct: 356 DNSDPHDLGGWQWQA 370
>gi|226503900|ref|NP_001142279.1| uncharacterized protein LOC100274448 precursor [Zea mays]
gi|194689380|gb|ACF78774.1| unknown [Zea mays]
gi|194707984|gb|ACF88076.1| unknown [Zea mays]
gi|413943525|gb|AFW76174.1| putative serine peptidase S28 family protein [Zea mays]
Length = 542
Score = 357 bits (916), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 174/323 (53%), Positives = 229/323 (70%), Gaps = 12/323 (3%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHW------VGPNRLGPIFLYCGNEGDIE 119
+ YF Q LDHF+F + F +YL+N W G + GP+F+Y GNEGDIE
Sbjct: 84 FTAHYFPQLLDHFAFTPNASTVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNEGDIE 143
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
WFA N+GF++DIAP FGA+LVF EHR+YGES P+G+ +Y++A TL YLT+ QALADFA
Sbjct: 144 WFATNTGFMFDIAPTFGALLVFIEHRFYGESKPFGND--SYRSAETLGYLTSTQALADFA 201
Query: 180 VFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
V I LK++L AEA+PVV+FGGSYGGMLA+W RLKYPH+AIGALASSAPILQF+ I P
Sbjct: 202 VVIRGLKRDLGAEAAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHITPWS 261
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+FY+ VS DFK ES++CF I+ +W L G + GLL+L+K F C+ + + +W
Sbjct: 262 SFYDAVSQDFKSESSNCFGVIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYSIRNW 321
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG 359
L +A++Y AMVDYP P++F+ LP YP++E+CK ID P +LE+ F S+YYNYTG
Sbjct: 322 LWTAFTYTAMVDYPTPANFLENLPAYPVKEMCKTIDAFPAGADVLEKAFAAASLYYNYTG 381
Query: 360 NVDCFQLD--DDPHGLDGWNWQV 380
+ C +++ DDPHGLDGW WQ
Sbjct: 382 DQACNKIEDGDDPHGLDGWQWQA 404
>gi|326491047|dbj|BAK05623.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 511
Score = 356 bits (914), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 168/319 (52%), Positives = 226/319 (70%), Gaps = 8/319 (2%)
Query: 67 RYETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
RYETRY+ QRLDHF+ A + TF QRYL+N +W G + P+F+Y GNEG IE F N
Sbjct: 58 RYETRYYTQRLDHFNAAPVSYSTFPQRYLVNGTYWGG--KTAPVFVYAGNEGSIELFTNN 115
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GF+W++APRF AMLVF EHRYYG S+P+GS E A++NA+T+ YL+ QA+ADFA + +
Sbjct: 116 TGFMWELAPRFRAMLVFIEHRYYGRSVPFGSEEAAFKNASTMGYLSTTQAVADFATLVQS 175
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK NLSA A+PVV+FGGSYGGMLAAWMR+KYPH+ IGA+ASSAPIL F + P FY+I
Sbjct: 176 LKANLSAPAAPVVVFGGSYGGMLAAWMRMKYPHVVIGAVASSAPILGFYGMADPYAFYDI 235
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
+S+DFK ES +C + + SW EL + + G +L +TF +CR ++ E + D L++A
Sbjct: 236 ISNDFKSESKNCHDVLMNSWKELDNALSNDAGRAQLNRTFKMCRG-STVEAIPDMLDTAI 294
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
Y AM DYP S F+ LP YP++E+C+ ID+ RI + ++VYYNYTGN CF
Sbjct: 295 VYSAMTDYPTESGFLTHLPAYPVKEICRAIDHPKSGKDTFSRIKDALTVYYNYTGNAHCF 354
Query: 365 --QLDDDPHGL-DGWNWQV 380
+DDP+G+ +GW+WQ
Sbjct: 355 GDASEDDPYGMFNGWDWQA 373
>gi|195643982|gb|ACG41459.1| lysosomal Pro-X carboxypeptidase precursor [Zea mays]
Length = 542
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 173/323 (53%), Positives = 229/323 (70%), Gaps = 12/323 (3%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHW------VGPNRLGPIFLYCGNEGDIE 119
+ YF Q LDHF+F + F +YL+N W G + GP+F+Y GNEGDIE
Sbjct: 84 FTAHYFPQLLDHFAFTPNASTVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNEGDIE 143
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
WFA N+GF++DIAP FGA+LVF EH++YGES P+G+ +Y++A TL YLT+ QALADFA
Sbjct: 144 WFATNTGFMFDIAPTFGALLVFIEHQFYGESKPFGND--SYRSAETLGYLTSTQALADFA 201
Query: 180 VFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
V I LK++L AEA+PVV+FGGSYGGMLA+W RLKYPH+AIGALASSAPILQF+ I P
Sbjct: 202 VVIRGLKRDLGAEAAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHITPWS 261
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+FY+ VS DFK ES++CF I+ +W L G + GLL+L+K F C+ + + +W
Sbjct: 262 SFYDAVSQDFKSESSNCFGVIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYSIRNW 321
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG 359
L +A++Y AMVDYP P++F+ LP YP++E+CK ID P +LE+ F S+YYNYTG
Sbjct: 322 LWTAFTYTAMVDYPTPANFLENLPAYPVKEMCKTIDAFPAGADVLEKAFAAASLYYNYTG 381
Query: 360 NVDCFQLD--DDPHGLDGWNWQV 380
+ C +++ DDPHGLDGW WQ
Sbjct: 382 DQACNKIEDGDDPHGLDGWQWQA 404
>gi|413924933|gb|AFW64865.1| putative serine peptidase S28 family protein [Zea mays]
Length = 534
Score = 348 bits (893), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 191/405 (47%), Positives = 248/405 (61%), Gaps = 51/405 (12%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQ------------- 63
V+ V+I+ + L + QPS+ A R V + P PP +RQ+
Sbjct: 2 VMERVVIAAVFLLLFSCQPSE---AGRVVVRRP----PPTLARRQRHYTSPRAGGDGGGG 54
Query: 64 ----QQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD 117
+YETR++ QRLDHF+ A TF QRYL+N W GP PIFLY GNEGD
Sbjct: 55 VSVPPAVQYETRWYTQRLDHFNSAPASYATFQQRYLVNDTFWGGPT--APIFLYAGNEGD 112
Query: 118 IEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA-YQNATTLSYLTAEQALA 176
++ F N+GF+W+ APRF A+LVF EHRYYGESMP+G T A +++A T YLT QALA
Sbjct: 113 VDLFTNNTGFMWESAPRFRALLVFVEHRYYGESMPFGGTRAAAFRDARTKGYLTVTQALA 172
Query: 177 DFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
D+A F+ +LK NLSA A+PVV+FGGSYGGMLAAWMRLKYPHI +GA+ASSAPIL F IV
Sbjct: 173 DYASFVLSLKANLSAPAAPVVVFGGSYGGMLAAWMRLKYPHIVMGAVASSAPILSFYGIV 232
Query: 237 PPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
P FY+ ++ DFK ES C++ +++SW L + G +L +TF +C +S +D+
Sbjct: 233 DPYAFYDRITDDFKSESKHCYDVLRKSWDVLDDALATKEGQAQLRRTFTMCNG-SSVQDI 291
Query: 297 ADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID------------NAPDATS-- 342
LESA Y AM DYP PS F+ PLP YP+R +C+ ID A D S
Sbjct: 292 PSLLESAVVYAAMTDYPTPSGFLTPLPAYPVRAMCRAIDASRAESAEASSGAANDGNSNS 351
Query: 343 ----ILERIFEGVSVYYNYTGNVDCFQL--DDDPHGL-DGWNWQV 380
L ++ + + VYYN+TG CF+ DDDPHG+ DGWNWQ
Sbjct: 352 TAQLTLSQVRDAMDVYYNHTGAAACFRAEEDDDPHGMYDGWNWQA 396
>gi|413954870|gb|AFW87519.1| putative serine peptidase S28 family protein [Zea mays]
Length = 552
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 168/310 (54%), Positives = 223/310 (71%), Gaps = 12/310 (3%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGPNRLGP-----IFLYCGNEGDIEW 120
+ T YF Q LDHF+F + F Q+YL+N W P R G +F+Y GNEGDIEW
Sbjct: 73 FTTHYFPQELDHFTFTPNASMLFRQKYLVNDTFWRRPRRGGGGGAGPLFVYTGNEGDIEW 132
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA N+GF++DIAP+FGA+LVF EHR+YGES+P+G + +Y +A T YLT+ QALADFA+
Sbjct: 133 FATNTGFMFDIAPKFGALLVFIEHRFYGESLPFG--DDSYSSAETEGYLTSTQALADFAI 190
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
IT LK+NLSAE +PVV+FGGSYGGMLA+W RLKYPH+AIGALASSAPILQF+ I P +
Sbjct: 191 LITGLKRNLSAETAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHITPWNS 250
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
F + VS D+K ES +CF+ IK +W L G + GLLEL+K F C+ + + +A WL
Sbjct: 251 FSDAVSQDYKSESLNCFSVIKAAWDVLDERGSTDTGLLELSKLFRACKSVKHADSIAGWL 310
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIRE---VCKKIDNAPDATSILERIFEGVSVYYNY 357
++A++Y AMVDYP P++F+M LP YP++E +CK ID P ILE++F ++YYNY
Sbjct: 311 QTAFTYTAMVDYPTPANFLMDLPAYPVKEPIQMCKIIDGFPAGADILEKVFAAANLYYNY 370
Query: 358 TGNVDCFQLD 367
TG+ C Q++
Sbjct: 371 TGDQACNQIE 380
>gi|357152979|ref|XP_003576298.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Brachypodium
distachyon]
Length = 517
Score = 343 bits (880), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 160/321 (49%), Positives = 222/321 (69%), Gaps = 8/321 (2%)
Query: 65 QYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q RYET+Y+EQRLDHF A TF QRYL+N +W G + P+FLY GNEG++E F
Sbjct: 61 QVRYETKYYEQRLDHFDALPASYRTFPQRYLVNGTYWGG--KTSPVFLYAGNEGNVELFT 118
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N+GF+W++APRF A+L+F EHRYYG+S P+GS E A++N +T+ YLT QA+AD A +
Sbjct: 119 NNTGFMWELAPRFRALLLFVEHRYYGKSFPFGSEEAAFRNTSTVGYLTTTQAVADLATLV 178
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
+LK NLSA A+PV++FGGSYGGMLAAW+R+KYPH+ +GA+ASSAPIL F + P FY
Sbjct: 179 QSLKSNLSAHAAPVIVFGGSYGGMLAAWVRMKYPHVVMGAVASSAPILGFYGLADPYAFY 238
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
+++S+DFK ES +C + + +SWGEL + G +L TF +CR ++ + + D L++
Sbjct: 239 DVISNDFKSESKNCHDVLMKSWGELDKALSNDAGRADLNSTFKMCRA-STVDAIPDLLDT 297
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
A +Y AM DYP S F+ PLP YP++E+C+ ID+ RI + VYYN+TG
Sbjct: 298 ALTYSAMTDYPTSSGFLTPLPPYPVKEMCRAIDHPKSGNDTFARIKGALDVYYNHTGAEP 357
Query: 363 CF--QLDDDPHGL-DGWNWQV 380
C + DP+G+ DGW+WQ
Sbjct: 358 CLGDATESDPYGMFDGWDWQA 378
>gi|242070053|ref|XP_002450303.1| hypothetical protein SORBIDRAFT_05g003460 [Sorghum bicolor]
gi|241936146|gb|EES09291.1| hypothetical protein SORBIDRAFT_05g003460 [Sorghum bicolor]
Length = 553
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 173/344 (50%), Positives = 225/344 (65%), Gaps = 31/344 (9%)
Query: 67 RYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+YETR + QRLDHF+ + TF QRYLIN W G +R PIFLY GNEGDI+ F N
Sbjct: 73 QYETRRYTQRLDHFNSLPSSYATFQQRYLINDTFWGGRSRTAPIFLYAGNEGDIDLFTNN 132
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGST-EVAYQNATTLSYLTAEQALADFAVFIT 183
+GF+W+ APRF AMLVF EHRYYGESMP+G T E A+++A T YLT QALAD+A F+
Sbjct: 133 TGFMWEAAPRFRAMLVFVEHRYYGESMPFGGTREAAFRDAATKGYLTVTQALADYASFVL 192
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+LK NLS A+PVV+FGGSYGGMLAAWMRLKYPH+ +GA+ASSAPIL F IV P FY+
Sbjct: 193 SLKANLSVPAAPVVVFGGSYGGMLAAWMRLKYPHVVMGAVASSAPILSFYGIVDPYAFYD 252
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
++ D+K ES +C++ +++SW L + G +L +TF++C +S D+ LE+A
Sbjct: 253 RINDDYKSESKNCYDVLRKSWDVLDDALATKEGQAQLRRTFNMCNG-SSVWDIPSLLENA 311
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKID----NAPDATSI---------------- 343
AM DYP S F+ PLP YP+R +C+ ID A +TS
Sbjct: 312 MVEAAMTDYPTTSGFLTPLPAYPVRAMCRAIDARHQQAAASTSTSGSGNDDGNNSSSSTA 371
Query: 344 ----LERIFEGVSVYYNYTGNVDCFQL--DDDPHGL-DGWNWQV 380
E++ + ++VYYN+TG CF+ DDDP+GL DGWNWQ
Sbjct: 372 LLLLSEQVRDAMNVYYNHTGGAACFRAEEDDDPYGLYDGWNWQA 415
>gi|115484263|ref|NP_001065793.1| Os11g0156200 [Oryza sativa Japonica Group]
gi|62701876|gb|AAX92949.1| At2g24280/F27D4.19 [Oryza sativa Japonica Group]
gi|77548743|gb|ABA91540.1| Serine carboxypeptidase S28 family protein, expressed [Oryza sativa
Japonica Group]
gi|113644497|dbj|BAF27638.1| Os11g0156200 [Oryza sativa Japonica Group]
Length = 511
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 164/327 (50%), Positives = 217/327 (66%), Gaps = 16/327 (4%)
Query: 64 QQYRYETRYFEQRLDHFSFADLP----TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
Q +YETRYF QRLDHF+ +LP TF QRYL+N W G P+F+Y GNEGD+
Sbjct: 49 QVVQYETRYFTQRLDHFN--ELPASNGTFRQRYLVNGTFWGGA--AAPVFVYAGNEGDVA 104
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL-SYLTAEQALADF 178
FA N+GF+W+ APRF AMLVF EHRYYGES+P+G T A + YLT QALADF
Sbjct: 105 LFASNTGFMWEAAPRFRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTTAQALADF 164
Query: 179 AVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
A I +LK NL+A +PVV+FGGSYGGMLAAWMR+KYPHI +GA+ASSAPIL + P
Sbjct: 165 AELILSLKSNLTACKAPVVIFGGSYGGMLAAWMRMKYPHIVMGAVASSAPILGLNGLSDP 224
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
+FYN+VS+DFK ES C++ ++ SW E+ + G L +TF++C+ + +D+
Sbjct: 225 YSFYNVVSNDFKSESKHCYDVLRNSWSEMYKALATDAGRARLNQTFNMCK--GNVDDIPG 282
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT 358
+E A Y +M+DYP PS+F+ LP YP+RE+C+ ID + RI + +++YYN T
Sbjct: 283 LVEKALIYGSMMDYPTPSNFLTSLPAYPVREICRAIDKPTSGNDTVSRIKDAMTIYYNST 342
Query: 359 GNVDCF----QLDDDPHGL-DGWNWQV 380
G + CF DDDP+G+ GW WQ
Sbjct: 343 GGLACFPGAGAEDDDPYGMFPGWTWQA 369
>gi|321468146|gb|EFX79132.1| hypothetical protein DAPPUDRAFT_225197 [Daphnia pulex]
Length = 477
Score = 331 bits (848), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 163/325 (50%), Positives = 224/325 (68%), Gaps = 12/325 (3%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q Y ++T +F+Q++DHFSFA+ T+ QRYL+N+ +W GPIF Y GNEGDIEWFA
Sbjct: 18 QCYEWKTFFFKQQVDHFSFANQDTYPQRYLVNSTYW--KRGGGPIFFYTGNEGDIEWFAQ 75
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+GF+WDIA FGAMLVF EHRYYG+S+PYG+ +Y +A L YLT+EQALADFA +
Sbjct: 76 NTGFMWDIAEEFGAMLVFAEHRYYGQSLPYGNK--SYSDAKYLGYLTSEQALADFAELVA 133
Query: 184 NLKQ-NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
+K N A SPV+ FGGSYGGML+AWMR+KYPHI G++A+SAPILQF + P + F
Sbjct: 134 YIKSTNSGAVDSPVIAFGGSYGGMLSAWMRIKYPHIITGSIAASAPILQFTGLTPCDAFN 193
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADW 299
+V++DF S C TI++SW L S+ +++G L ++LC LN T+D L DW
Sbjct: 194 RVVTADFASASTECSETIRKSWKSLSSILSQDDGKEWLRTHWNLCVPLNGTDDVVNLKDW 253
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP-DATSILERIFEGVSVYYNYT 358
L + ++ LAMV+YPY ++F+ PLP YP++ VC+ + N+ D S+L+ +F+G+SVY N+T
Sbjct: 254 LTNVWTNLAMVNYPYAANFLAPLPAYPVKAVCEHLTNSSLDDHSLLDELFKGLSVYANFT 313
Query: 359 GNVDCF---QLDDDPHGLDGWNWQV 380
G C Q D G GW++Q
Sbjct: 314 GQTKCLDVSQQADQSLGDMGWDFQA 338
>gi|125533461|gb|EAY80009.1| hypothetical protein OsI_35177 [Oryza sativa Indica Group]
Length = 511
Score = 330 bits (847), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 164/327 (50%), Positives = 217/327 (66%), Gaps = 16/327 (4%)
Query: 64 QQYRYETRYFEQRLDHFSFADLP----TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
Q +YETRYF QRLDHF+ +LP TF QRYL+N W G P+F+Y GNEGD+
Sbjct: 49 QVVQYETRYFTQRLDHFN--ELPASNGTFRQRYLVNGTFWGGA--AAPVFVYAGNEGDVA 104
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL-SYLTAEQALADF 178
FA N+GF+W+ APRF AMLVF EHRYYGES+P+G T A + YLT QALADF
Sbjct: 105 LFASNTGFMWEAAPRFRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTPAQALADF 164
Query: 179 AVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
A I +LK NL+A +PVV+FGGSYGGMLAAWMR+KYPHI +GA+ASSAPIL + P
Sbjct: 165 AELILSLKSNLTACKAPVVIFGGSYGGMLAAWMRMKYPHIVMGAVASSAPILGLNGLSDP 224
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
+FYN+VS+DFK ES C++ ++ SW E+ + G L +TF++C+ + +D+
Sbjct: 225 YSFYNVVSNDFKSESKHCYDVLRNSWSEMYKALATDAGRARLNQTFNMCK--GNVDDIPG 282
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT 358
+E A Y +M+DYP PS+F+ LP YP+RE+C+ ID + RI + +++YYN T
Sbjct: 283 LVEKALIYGSMMDYPTPSNFLTSLPAYPVREICRAIDKPTSGNDTVSRIKDAMTIYYNST 342
Query: 359 GNVDCF----QLDDDPHGL-DGWNWQV 380
G + CF DDDP+G+ GW WQ
Sbjct: 343 GGLACFPGAGAEDDDPYGMFPGWTWQA 369
>gi|356505400|ref|XP_003521479.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 504
Score = 328 bits (841), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 169/342 (49%), Positives = 220/342 (64%), Gaps = 6/342 (1%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHW 99
PRF + H P Q Y ++F Q LDHF++ TF QRYLIN +W
Sbjct: 29 PRFPSSMLH---PALDVNLLSAQNGLYTAKFFTQTLDHFNYNPQSYQTFQQRYLINDTYW 85
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE-V 158
G PIF+Y GNEGDIEWFA N+GF+++ AP F A+LVF EHRYYG+S P+G E V
Sbjct: 86 GGAKNNAPIFVYMGNEGDIEWFAQNTGFMFETAPYFKALLVFIEHRYYGKSFPFGGNEEV 145
Query: 159 AYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHI 218
A N TT+ Y+++ QALAD+A I +LK NLSA SPVV+ GGSYGGMLAAW R+KYPH+
Sbjct: 146 ADANTTTVGYMSSTQALADYATLIIDLKNNLSATDSPVVVVGGSYGGMLAAWFRMKYPHV 205
Query: 219 AIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLL 278
AIGALASSAPILQF D+V P T+ +I++ D+K ES +C+ IK SW ++ QK GL
Sbjct: 206 AIGALASSAPILQFLDLVSPYTYTDIITQDYKSESENCYKVIKGSWKQIEDTAQKPGGLE 265
Query: 279 ELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
+L K+F +C+ S L WL+ A AM DYP PS F+ PLP YP+R++C+ IDN
Sbjct: 266 QLRKSFRICKHYISAGALVYWLQMALGSAAMTDYPTPSVFLAPLPAYPVRKMCEAIDNLS 325
Query: 339 DATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
+++ +++YNYTG CF LD+ L GW+WQ
Sbjct: 326 AVNETFTKLYAAANIFYNYTGTATCFFLDNTTAPLGGWDWQA 367
>gi|449437430|ref|XP_004136495.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 502
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 165/318 (51%), Positives = 216/318 (67%), Gaps = 6/318 (1%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y+T +F Q LDHF+F +F QRYLIN +W G PIF+Y GNEG+IEWFA N+
Sbjct: 49 YQTSFFTQILDHFNFNPQSYQSFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNT 108
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE-VAYQNATTLSYLTAEQALADFAVFITN 184
GF+ AP F A++VF EHR+YG+S+P+G E VA N++ L YL++ QALAD+A IT+
Sbjct: 109 GFLLQYAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSMLGYLSSTQALADYATLITD 168
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK+NLSA SPV++FGGSYGGMLAAW RLKYPHIA+GALASSAPILQ E+I P F NI
Sbjct: 169 LKKNLSAVDSPVLVFGGSYGGMLAAWFRLKYPHIALGALASSAPILQLENITSPYAFNNI 228
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
V+ DFK ES +C++ IKESW + G L K+F C+E + E + +WL +A
Sbjct: 229 VTQDFKSESQNCYSVIKESWHLIDITSTHPQGPQLLRKSFKFCKEAEA-ESIKNWLSTAI 287
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
Y AM DYP PS+F+ PLP YP++++CK ID+ +++ ++YYN+TG V CF
Sbjct: 288 IYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDPRSGNDSFTKLYGAANIYYNFTGTVTCF 347
Query: 365 --QLDDDPHGLDGWNWQV 380
D DPH L W+WQ
Sbjct: 348 DLDDDSDPHDLGDWSWQA 365
>gi|260814189|ref|XP_002601798.1| hypothetical protein BRAFLDRAFT_279167 [Branchiostoma floridae]
gi|229287100|gb|EEN57810.1| hypothetical protein BRAFLDRAFT_279167 [Branchiostoma floridae]
Length = 481
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 175/365 (47%), Positives = 231/365 (63%), Gaps = 24/365 (6%)
Query: 22 IISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFS 81
+ +IL LS+A PRF P+ + + QY Y+T+YF Q +DHFS
Sbjct: 4 VAAILICLSVAHTGGTHLLNPRF----------PRPKGPALKPQYSYDTKYFTQPVDHFS 53
Query: 82 FADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVF 141
F TF QRYLIN ++ G GPIFLY GNEGDI F N+GF+WDIAP+F A++VF
Sbjct: 54 FTRTDTFDQRYLINMKYFEGTG--GPIFLYTGNEGDITMFCDNTGFMWDIAPKFKALVVF 111
Query: 142 PEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPVVLFG 200
EHRYYGESMPYG +Y++ L YLTAEQALADFA IT+LK ++ A SPVV FG
Sbjct: 112 AEHRYYGESMPYGKD--SYKDPEHLGYLTAEQALADFARLITHLKASIPGAADSPVVAFG 169
Query: 201 GSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTI 260
GSYGGMLAAW R+KYP IG+LA+SAP+ QFE + P + Y+I++ DF++ S C I
Sbjct: 170 GSYGGMLAAWFRMKYPSSVIGSLAASAPVWQFEGLTPCGSSYSIITEDFQKGSPGCETYI 229
Query: 261 KESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESAYSYLAMVDYPYPSD 317
+SW L +GQ G +L+ F LC LN+T D+A WL S + LAMV+YPYP+
Sbjct: 230 HKSWDLLTQMGQTAAGREKLSSMFSLCSPLNTTTDVATMSSWLLSTWFNLAMVNYPYPAS 289
Query: 318 FMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDP---HGLD 374
F+ PLPG+P++EVC I ++++LE I G +YYNYTG C +++ G
Sbjct: 290 FLEPLPGWPVKEVCSLITA---SSNVLEGIAAGAMLYYNYTGQAPCLNIEESAVSSLGDL 346
Query: 375 GWNWQ 379
GW++Q
Sbjct: 347 GWSFQ 351
>gi|363729398|ref|XP_425654.3| PREDICTED: lysosomal Pro-X carboxypeptidase [Gallus gallus]
Length = 482
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 160/333 (48%), Positives = 214/333 (64%), Gaps = 11/333 (3%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
P+R++ Y TRY Q++DHF F + TF QRYLI HW N GPI Y GN
Sbjct: 25 PRRRRGVAPLPAPYLTRYLTQQIDHFGFDENLTFQQRYLIADQHWKKDN--GPILFYTGN 82
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
EGDI WF N+GF+WD+A AMLVF EHRYYGES+P+G+ ++ ++ L+YLT+EQA
Sbjct: 83 EGDITWFCNNTGFMWDVAEELNAMLVFAEHRYYGESLPFGNE--SFSDSKHLNYLTSEQA 140
Query: 175 LADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
LADFAV I LK+ ++ A SPV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QF
Sbjct: 141 LADFAVLIEYLKETIAGARHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFG 200
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST 293
D+VP ++ IV++DFK+ C +I+ SW + + + GL L++TFHLC L +
Sbjct: 201 DLVPCGAYFTIVTNDFKKSGTGCSESIRNSWNAINHLSSTDVGLQWLSRTFHLCSPLKNL 260
Query: 294 ED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEG 350
+D L WL + LAMVDYPY +DF+ PLP +PIREVCK + T + F+
Sbjct: 261 QDAAMLKTWLSETWIDLAMVDYPYKADFLQPLPAWPIREVCKFLRTPVSLTIVAAEYFQA 320
Query: 351 VSVYYNYTGNVDCFQLDDDPH---GLDGWNWQV 380
+++YYNYTG CF + + G GW +QV
Sbjct: 321 INIYYNYTGEASCFDMSETATKNLGQLGWYYQV 353
>gi|345322922|ref|XP_001513964.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Ornithorhynchus
anatinus]
Length = 742
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 158/321 (49%), Positives = 213/321 (66%), Gaps = 12/321 (3%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
YETR+F Q++DHF F TF QRYL++ HW G I Y GNEGDI WF N+GF
Sbjct: 46 YETRFFPQKVDHFGFDLDLTFKQRYLVSDQHWREDG--GSILFYTGNEGDITWFCNNTGF 103
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+W++A AMLVF EHRYYGES+P+G + ++ ++ L+YLT+EQALADFAV I +LK
Sbjct: 104 MWEVAEELQAMLVFAEHRYYGESLPFG--DQSFSDSKHLNYLTSEQALADFAVLIEHLKA 161
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ A+ SPV+ GGSYGGMLAAW+R+KYPH+ +GALA+SAPI QF D+VP F+ IV+
Sbjct: 162 TIPGAQNSPVISIGGSYGGMLAAWIRMKYPHLVVGALAASAPIWQFGDLVPCGRFFEIVT 221
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESA 303
+DFK+ A C TI+ SW + + E GL L+ TFHLC L S +D L W+ S
Sbjct: 222 NDFKKSGAGCSETIRASWDAVGRISSTEEGLQWLSHTFHLCSPLRSQQDAVALKTWMSST 281
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSI-LERIFEGVSVYYNYTGNVD 362
+ LAMVDYPY +DF+ PLP +P++ VCK + + A + L+ IF+ ++VYYNYTG
Sbjct: 282 WVDLAMVDYPYETDFLQPLPAWPVQAVCKYLKDPKSADDVLLQNIFQALNVYYNYTGKTS 341
Query: 363 CFQLDDDPH---GLDGWNWQV 380
C + G+ GW++Q
Sbjct: 342 CLNTSETVTGNLGMQGWSYQA 362
>gi|326914542|ref|XP_003203584.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Meleagris
gallopavo]
Length = 483
Score = 323 bits (829), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 160/334 (47%), Positives = 218/334 (65%), Gaps = 12/334 (3%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
P+R++ Y TRY Q++DHF F + TF QRYLI HW N GPI Y GN
Sbjct: 25 PRRRRGIAPLSGPYVTRYLTQQIDHFGFDENLTFQQRYLIADQHWQKDN--GPILFYTGN 82
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
EGDI WF N+GF+WD+A AMLVF EHRYYGES+P+G+ ++ ++ L+YLT+EQA
Sbjct: 83 EGDITWFCNNTGFMWDVAEELNAMLVFAEHRYYGESLPFGNE--SFSDSKHLNYLTSEQA 140
Query: 175 LADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
LADFAV I LK+ ++ A SPV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QF
Sbjct: 141 LADFAVLIEYLKETIAGARHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFG 200
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST 293
D+VP ++ IV++DFK+ C +I+ SW + + + GL L++TFHLC L +
Sbjct: 201 DLVPCGAYFTIVTNDFKKSGTGCSESIRNSWNAINHLSSTDAGLQWLSRTFHLCSPLKNL 260
Query: 294 ED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFE 349
+D L WL + LAMVDYPY +DF+ PLP +PI+EVCK + D + +L+ +F+
Sbjct: 261 QDALMLKSWLSETWIDLAMVDYPYKADFLEPLPAWPIQEVCKFLKDPSLSDKLLLQNVFQ 320
Query: 350 GVSVYYNYTGNVDCFQLDDDPH---GLDGWNWQV 380
+++YYNYTG CF + + G GW +QV
Sbjct: 321 AINIYYNYTGEASCFDMSETATKNLGQLGWYYQV 354
>gi|224043680|ref|XP_002188561.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Taeniopygia guttata]
Length = 479
Score = 323 bits (829), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 160/333 (48%), Positives = 217/333 (65%), Gaps = 12/333 (3%)
Query: 56 QRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE 115
+R +R Y TRY Q++DHF F + TF QRYL+ HW N GPI Y GNE
Sbjct: 22 RRHRRASLPVGPYLTRYLSQQIDHFGFDENRTFQQRYLLADQHWKKDN--GPILFYTGNE 79
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
GDIEWF N+GF+WD+A AMLVF EHRYYGES+P+G+ ++ ++ L+YLT+EQAL
Sbjct: 80 GDIEWFCNNTGFMWDVAEELNAMLVFAEHRYYGESLPFGNE--SFSDSKHLNYLTSEQAL 137
Query: 176 ADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
ADFAV + LK ++ A+ SPV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QF D
Sbjct: 138 ADFAVLVEYLKTTIAGAQHSPVIAIGGSYGGMLAAWFRMKYPHVVVGALAASAPIWQFAD 197
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+VP TF++IV++DFK+ C +I+ SW + + + GL L+ TFHLC L + +
Sbjct: 198 LVPCGTFFSIVTNDFKKSGKGCSESIRNSWNAINHLSSTDAGLQWLSNTFHLCSPLKTLQ 257
Query: 295 D---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEG 350
D L +WL + LAMV+YPY +DF+ PLP +PI+EVCK + D + +L+ +F+
Sbjct: 258 DAAVLKNWLSETWVNLAMVNYPYKADFLQPLPAWPIQEVCKFLKDPSLSDKLLLQNVFQA 317
Query: 351 VSVYYNYTGNVDCF---QLDDDPHGLDGWNWQV 380
V++YYNYTG C Q G GW +Q
Sbjct: 318 VNLYYNYTGEASCLDVSQTATKSLGEMGWYYQA 350
>gi|413954869|gb|AFW87518.1| putative serine peptidase S28 family protein [Zea mays]
Length = 1052
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 163/349 (46%), Positives = 218/349 (62%), Gaps = 41/349 (11%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGPN---RLGPIFLYCGNEGDIEWFA 122
+ YF+Q LDHFSF + F Q+YL+N W P GP+ +Y G E DIE A
Sbjct: 79 FTVHYFQQELDHFSFTPNASTVFYQKYLVNDTFWRRPGGGGTAGPLLVYVGGEADIECIA 138
Query: 123 VNSGFVWDIAPRFGAMLVFPEH---------------------------RYYGESMPYGS 155
N GF++DIAP FGA+LVF EH R+YGES+P+G+
Sbjct: 139 HNVGFMFDIAPTFGALLVFVEHIEYIFGDLNIGPQKDMARVVWWSKRKHRFYGESLPFGN 198
Query: 156 TEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKY 215
+A L YLT+ QALAD A+ IT+LK+NLSAE SPVV+FGGSYGGMLA+W RLKY
Sbjct: 199 -----NSAQALGYLTSTQALADLAILITDLKRNLSAETSPVVIFGGSYGGMLASWFRLKY 253
Query: 216 PHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKEN 275
PH+ IGALASSAPILQF+ I P +FY++VS D+K ES +CF+ IK +W L G N
Sbjct: 254 PHVTIGALASSAPILQFDYITPWSSFYDVVSQDYKSESLNCFSVIKAAWDVLEERGSNGN 313
Query: 276 GLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID 335
GL+EL+K F C+ + + + WL +A+ +AM+DYP P+ F+ LP YP++E+CK +D
Sbjct: 314 GLVELSKLFRACKTVKYADSIRRWLRTAFVSIAMMDYPTPASFLENLPAYPVKEMCKIVD 373
Query: 336 NAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPH----GLDGWNWQV 380
P ILE++F S+YYNYTG+ C Q++D+ + L+ W WQ
Sbjct: 374 GFPADADILEKVFAAASLYYNYTGDQTCNQIEDEGNPRCLNLNYWGWQA 422
>gi|449521545|ref|XP_004167790.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 501
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 163/318 (51%), Positives = 213/318 (66%), Gaps = 7/318 (2%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y+T +F Q LDHF+F +F QRYLIN +W G PIF+Y GNEG+IEWFA N+
Sbjct: 49 YQTSFFTQILDHFNFNPQSYQSFQQRYLINDTYWGGAAHNSPIFVYTGNEGNIEWFAQNT 108
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE-VAYQNATTLSYLTAEQALADFAVFITN 184
GF+ AP F A++VF EHR+YG+S+P+G E VA N++ L YL++ QALAD+A IT+
Sbjct: 109 GFLLQYAPHFRALVVFIEHRFYGKSIPFGGDEDVANSNSSMLGYLSSTQALADYATLITD 168
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK+NLSA SPV++FGGSYGGMLAAW RLKYPHIA+GALASSAPILQ E+I P F NI
Sbjct: 169 LKKNLSAVDSPVLVFGGSYGGMLAAWFRLKYPHIALGALASSAPILQLENITSPYAFNNI 228
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
V+ DFK ES +C++ IKESW + G L K+F + E + +WL +A
Sbjct: 229 VTQDFKSESQNCYSVIKESWHLIDITSTHPQGPQLLRKSFQILEA--EAESIKNWLSTAI 286
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
Y AM DYP PS+F+ PLP YP++++CK ID+ +++ ++YYN+TG V CF
Sbjct: 287 IYTAMTDYPTPSNFLNPLPAYPVKQMCKAIDDPRSGNDSFTKLYGAANIYYNFTGTVTCF 346
Query: 365 --QLDDDPHGLDGWNWQV 380
D DPH L W+WQ
Sbjct: 347 DLDDDSDPHDLGDWSWQA 364
>gi|291234129|ref|XP_002737003.1| PREDICTED: prolylcarboxypeptidase isoform 1 preproprotein-like
[Saccoglossus kowalevskii]
Length = 501
Score = 319 bits (817), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 164/371 (44%), Positives = 227/371 (61%), Gaps = 14/371 (3%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQR 76
++ + I I + L SK RF L + P R + +Y Y+T YF+Q
Sbjct: 7 IVALFAIIITASTRLIDTESKSHILSRFRPGLG-VQIKPDHGSRLRLSKYNYQTLYFKQT 65
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
LDHF+FA+ TFSQRYL++ D+W N GPIF Y GNEGDI WF N+GF+WDIAP+F
Sbjct: 66 LDHFNFANNGTFSQRYLLSDDYW---NSSGPIFFYTGNEGDITWFCNNTGFIWDIAPQFK 122
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AM++F EHRYYGES+P+G+ ++ + + YLT+EQALADFA I +K + A+ SP
Sbjct: 123 AMVIFAEHRYYGESLPFGNE--SFSDLEHVGYLTSEQALADFATLIKYIKSSRPGADKSP 180
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++FGGSYGGM+AAW R+KYP+I GALA+SAPI QF + P T + I++ DF +
Sbjct: 181 VIVFGGSYGGMMAAWFRMKYPNIVDGALAASAPIWQFPGLTPCNTLFTIITQDFVKAGRD 240
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS---TEDLADWLESAYSYLAMVDY 312
C TI +SW + + +E+G LT FHLC L + DL+ WL + + LAMVDY
Sbjct: 241 CAETIHKSWNAINRMKDQESGRQWLTMAFHLCTPLKTPADVNDLSSWLSNTWFNLAMVDY 300
Query: 313 PYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDP- 370
PYP+ F+ PLP +PI++ C + N+ +L + + VYYN TG CF + D
Sbjct: 301 PYPASFLEPLPAWPIKKTCSYMTNSSLLDKPLLNSVASSLQVYYNTTGKTQCFNISQDAV 360
Query: 371 --HGLDGWNWQ 379
G GW++Q
Sbjct: 361 SSLGELGWSYQ 371
>gi|118404640|ref|NP_001072639.1| prolylcarboxypeptidase precursor [Xenopus (Silurana) tropicalis]
gi|115313632|gb|AAI23913.1| prolylcarboxypeptidase (angiotensinase C) [Xenopus (Silurana)
tropicalis]
Length = 499
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 158/321 (49%), Positives = 209/321 (65%), Gaps = 12/321 (3%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
YET YF Q++DHF F + TF QRYL++ +W P GPI Y GNEGDI F N+GF
Sbjct: 53 YETYYFTQQVDHFGFYEDATFKQRYLVSDTYWRKPG--GPILFYTGNEGDITLFCNNTGF 110
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+A GAMLVF EHRYYGESMP+G ++A+ + L+YLT+EQALADFAV + K
Sbjct: 111 MWDVAEEMGAMLVFAEHRYYGESMPFG--DLAFSDPKHLNYLTSEQALADFAVLLRYFKA 168
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
A+ SPV+ GGSYGGMLAAW R+KYP + +GA+ASSAPI QFED+VP +Y +V+
Sbjct: 169 TTEGAQNSPVIALGGSYGGMLAAWFRMKYPDVVVGAIASSAPIWQFEDLVPCNEYYQVVT 228
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESA 303
+DFK+ C +++ SW + + + +GL L+ FHLC L S ED+A WL
Sbjct: 229 NDFKKSGPGCSESVRNSWAAINRMAETTDGLQWLSNAFHLCCPLKSKEDVAVFKGWLSET 288
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNYTGNVD 362
+ LAMVDYPYP+ F+ PLP +PI+ VCK + D +L+ IF+ V+VYYNYTG+
Sbjct: 289 WVSLAMVDYPYPASFLEPLPAWPIQVVCKFLKDPKSGDKDLLQDIFQAVNVYYNYTGDTP 348
Query: 363 CF---QLDDDPHGLDGWNWQV 380
C Q G GW++Q
Sbjct: 349 CLNTSQTASGSLGDLGWSYQA 369
>gi|47209068|emb|CAF90249.1| unnamed protein product [Tetraodon nigroviridis]
Length = 487
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 163/323 (50%), Positives = 208/323 (64%), Gaps = 15/323 (4%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y+T YF+Q++DHF F + TF QRYL+N HW P GPIF Y GNEGDI WF N+GF
Sbjct: 46 YKTLYFDQKIDHFGFLEDGTFKQRYLVNDKHWQQPG--GPIFFYTGNEGDITWFCNNTGF 103
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+W+IA FGAMLVF EHRYYGES+P+G +Y + L+YLT+EQALADFAV I NLK
Sbjct: 104 MWEIAEEFGAMLVFAEHRYYGESLPFGHD--SYSDNKHLNYLTSEQALADFAVLIQNLKS 161
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
L A+ SPV+ GGSYGGMLAAW R+KYPH+ +GALASSAPI QF +VP FY IV+
Sbjct: 162 TLPGAQNSPVIAIGGSYGGMLAAWFRMKYPHMVVGALASSAPIWQFPGMVPCGDFYKIVT 221
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESA 303
DF R ++C I SW + +V +GL L++ F LC L + D+A WL+
Sbjct: 222 QDFARSGSNCDKNIGMSWKAIENVSSTASGLQWLSEEFGLCTPLKNRNDVAGFKSWLQET 281
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKID---NAPDATSILERIFEGVSVYYNYTGN 360
+ LAMVDYPY ++F+ PLP +PI+ VCK + + PD +L + + VYYNYTG+
Sbjct: 282 WVNLAMVDYPYEANFLQPLPAWPIQVVCKYLSFSHSVPD-DHLLHGVSQATKVYYNYTGS 340
Query: 361 VDCF---QLDDDPHGLDGWNWQV 380
C Q G GW +Q
Sbjct: 341 SPCLNTSQTATGSLGFIGWFYQA 363
>gi|410910456|ref|XP_003968706.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Takifugu
rubripes]
Length = 500
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 163/323 (50%), Positives = 208/323 (64%), Gaps = 15/323 (4%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y+T YFEQ++DHF F + TF QRYLI HW P GPIF Y GNEGDI WF N+GF
Sbjct: 46 YKTLYFEQKIDHFGFLEDGTFKQRYLIADKHWQQPG--GPIFFYTGNEGDITWFCNNTGF 103
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+W+IA FGAMLVF EHRYYGES+P+G+ +Y + L+YLT+EQALADFAV + NLK
Sbjct: 104 MWEIAEEFGAMLVFAEHRYYGESLPFGAD--SYSDNKHLNYLTSEQALADFAVLVQNLKS 161
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
A+ SPV+ GGSYGGMLAAW R+KYPHI +GALASSAPI QF +VP FY IV+
Sbjct: 162 TFPGAQNSPVIAVGGSYGGMLAAWFRMKYPHIVVGALASSAPIWQFPGMVPCGDFYKIVT 221
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESA 303
DF + + C I+ SW + +V +GL L++ F LC L S D+A +WL+
Sbjct: 222 QDFAKSGSDCDKNIRMSWKAIENVSSTASGLQWLSEEFGLCAPLKSKSDIAGFKNWLQET 281
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKID---NAPDATSILERIFEGVSVYYNYTGN 360
+ LAMVDYPY ++F+ PLP +PI+ VCK + + PD +L + + VYYNYTG+
Sbjct: 282 WVNLAMVDYPYEANFLQPLPPWPIQAVCKYLSFSGSVPD-YHLLHGVSQATKVYYNYTGS 340
Query: 361 VDCF---QLDDDPHGLDGWNWQV 380
C Q G GW +Q
Sbjct: 341 SPCLNTSQTATGSLGFLGWFYQA 363
>gi|386781752|ref|NP_001248178.1| lysosomal Pro-X carboxypeptidase precursor [Macaca mulatta]
gi|380789243|gb|AFE66497.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
gi|384941696|gb|AFI34453.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
Length = 496
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 163/364 (44%), Positives = 224/364 (61%), Gaps = 19/364 (5%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
S L+P + A R A R +G L T P + Y YF+Q++DHF F
Sbjct: 12 SFLAPWATKA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKVDHFGFN 63
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A AMLVF E
Sbjct: 64 TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAE 121
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGS 202
HRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE PV+ GGS
Sbjct: 122 HRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGS 179
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAW R+KYPH+A+GALA+SAPI QFED+VP F IV++DFK+ C +I+
Sbjct: 180 YGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVTTDFKKSGPHCSESIRR 239
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMM 320
SW + + +GL LT HLC L S + L DW+ + LAMVDYPY S+F+
Sbjct: 240 SWDAINRLSSTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQ 299
Query: 321 PLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGW 376
PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + + G GW
Sbjct: 300 PLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGW 359
Query: 377 NWQV 380
++Q
Sbjct: 360 SYQA 363
>gi|327269259|ref|XP_003219412.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Anolis
carolinensis]
Length = 500
Score = 313 bits (802), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 211/340 (62%), Gaps = 14/340 (4%)
Query: 50 HLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIF 109
H P R Q Y T + Q++DHF F + TF QRYLI HW +G I
Sbjct: 32 HAASRPDPANRIFQAPREYHTCFIGQKIDHFGFYENRTFKQRYLIAEQHW--KRDVGSIL 89
Query: 110 LYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYL 169
Y GNEGDI WFA N+GF+W++A A+LVF EHRYYG S+P+G+ ++ +A L+YL
Sbjct: 90 FYTGNEGDITWFANNTGFMWNVAEELDAILVFAEHRYYGVSLPFGNK--SFSDAKHLNYL 147
Query: 170 TAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
++ QALADFAV + +LK + A+ +PV+ GGSYGGMLAAW R+KYPHI IGALA+SAP
Sbjct: 148 SSAQALADFAVLVQHLKATIPGAQDTPVIAIGGSYGGMLAAWFRMKYPHIVIGALAASAP 207
Query: 229 ILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR 288
I QF+ +VP TFY+IV+ DFK+ C +I+ SW + + E GL L+ TF LC
Sbjct: 208 IWQFDSLVPCGTFYSIVTQDFKKSGNCCSESIRNSWAAINRLASTEEGLRWLSSTFRLCT 267
Query: 289 ELNSTEDLA---DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN--APDATSI 343
L + D A WL + LAMVDYPY +DF+ PLP +PI+ VCK + N PD +
Sbjct: 268 PLKTETDAATFKGWLGETWVNLAMVDYPYKADFLQPLPAWPIQVVCKYLKNPKLPDKL-L 326
Query: 344 LERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGWNWQV 380
L+ IF+ V+VYYNYTG+ C L G+ GW +Q
Sbjct: 327 LQNIFQAVNVYYNYTGHASCLNLTQTATKSLGIQGWYYQA 366
>gi|449271455|gb|EMC81816.1| Lysosomal Pro-X carboxypeptidase, partial [Columba livia]
Length = 437
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 156/313 (49%), Positives = 208/313 (66%), Gaps = 12/313 (3%)
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
++DHF F D TF QRYLI HW N GPI Y GNEGDI WF+ N+GF+WD+A
Sbjct: 1 QVDHFGFDDNLTFQQRYLIADQHWKKNN--GPILFYTGNEGDITWFSNNTGFMWDVAQEL 58
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEAS 194
AMLVF EHRYYGES+P+G+ +Y ++ L+YLT+EQALADFAV I +LK ++ A S
Sbjct: 59 NAMLVFAEHRYYGESLPFGNE--SYSDSKRLNYLTSEQALADFAVLIEHLKSTIAGARYS 116
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA 254
PV+ GGSY GMLAAW R+KYPH+ +GALA+SAPI QF D+VP F++IV+ DFKR
Sbjct: 117 PVIAIGGSYRGMLAAWFRMKYPHLVVGALAASAPIWQFGDLVPCGAFFSIVTKDFKRSGT 176
Query: 255 SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESAYSYLAMVD 311
C +I+ SW + + + GLL L+KTFHLC L + ED+A L + LAM++
Sbjct: 177 GCAESIRNSWSAIDRLSSTDEGLLWLSKTFHLCSPLKNLEDVAVLKRGLSDTWINLAMMN 236
Query: 312 YPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDP 370
YPY SDF+ PLP +PI EVCK + D + +L+ +F +++YYNY+G + CF + +
Sbjct: 237 YPYKSDFLEPLPAWPIEEVCKFLKDPSLSDKLLLQNVFRAINIYYNYSGQILCFDMAETA 296
Query: 371 H---GLDGWNWQV 380
G GWN+Q
Sbjct: 297 TKSLGHLGWNYQT 309
>gi|383408887|gb|AFH27657.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Macaca
mulatta]
Length = 496
Score = 312 bits (800), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 162/364 (44%), Positives = 224/364 (61%), Gaps = 19/364 (5%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
S L+P + A R A R +G L T P + Y YF+Q++DHF F
Sbjct: 12 SFLAPWATKA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKVDHFGFN 63
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A AMLVF E
Sbjct: 64 TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAE 121
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGS 202
HRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE PV+ GGS
Sbjct: 122 HRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGS 179
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAW R+KYPH+A+GALA+SAPI QFED+VP F IV++DF++ C +I+
Sbjct: 180 YGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRR 239
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMM 320
SW + + +GL LT HLC L S + L DW+ + LAMVDYPY S+F+
Sbjct: 240 SWDAINRLSSTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQ 299
Query: 321 PLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGW 376
PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + + G GW
Sbjct: 300 PLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGW 359
Query: 377 NWQV 380
++Q
Sbjct: 360 SYQA 363
>gi|402894798|ref|XP_003910532.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Papio
anubis]
Length = 496
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 162/364 (44%), Positives = 224/364 (61%), Gaps = 19/364 (5%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
S L+P + A R A R +G L T P + Y YF+Q++DHF F
Sbjct: 12 SFLAPWATKA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKVDHFGFN 63
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A AMLVF E
Sbjct: 64 TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAE 121
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGS 202
HRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE PV+ GGS
Sbjct: 122 HRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGS 179
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAW R+KYPH+A+GALA+SAPI QFED+VP F IV++DF++ C +I+
Sbjct: 180 YGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPYCSESIRR 239
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMM 320
SW + + +GL LT HLC L S + L DW+ + LAMVDYPY S+F+
Sbjct: 240 SWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQ 299
Query: 321 PLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGW 376
PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + + G GW
Sbjct: 300 PLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGW 359
Query: 377 NWQV 380
++Q
Sbjct: 360 SYQA 363
>gi|426369948|ref|XP_004051942.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Gorilla
gorilla gorilla]
Length = 496
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 161/364 (44%), Positives = 223/364 (61%), Gaps = 19/364 (5%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
S L+P + A R A R +G L T P + Y YF+Q++DHF F
Sbjct: 12 SFLAPWATIA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKVDHFGFN 63
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A AMLVF E
Sbjct: 64 TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAE 121
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGS 202
HRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE PV+ GGS
Sbjct: 122 HRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGS 179
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++ C +I+
Sbjct: 180 YGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRR 239
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMM 320
SW + + +GL LT HLC L S + L DW+ + LAMVDYPY S+F+
Sbjct: 240 SWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQ 299
Query: 321 PLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGW 376
PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + + G GW
Sbjct: 300 PLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGW 359
Query: 377 NWQV 380
++Q
Sbjct: 360 SYQA 363
>gi|197098906|ref|NP_001125428.1| lysosomal Pro-X carboxypeptidase precursor [Pongo abelii]
gi|68565632|sp|Q5RBU7.1|PCP_PONAB RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|55728021|emb|CAH90763.1| hypothetical protein [Pongo abelii]
Length = 496
Score = 310 bits (795), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 161/364 (44%), Positives = 222/364 (60%), Gaps = 19/364 (5%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
S L+P + A R A R +G L T P + Y YF+Q++DHF F
Sbjct: 12 SFLAPWTTIA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKVDHFGFN 63
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A AMLVF E
Sbjct: 64 TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAE 121
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGS 202
HRYYGES+P+G ++++ L++LT+EQALADFA I +LK+ + AE PV+ GGS
Sbjct: 122 HRYYGESLPFGDN--TFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGS 179
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++ C +I+
Sbjct: 180 YGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRR 239
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMM 320
SW + + +GL LT HLC L S + L DW+ + LAMVDYPY S+F+
Sbjct: 240 SWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQ 299
Query: 321 PLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGW 376
PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + + G GW
Sbjct: 300 PLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGW 359
Query: 377 NWQV 380
++Q
Sbjct: 360 SYQA 363
>gi|114639541|ref|XP_001175153.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 7 [Pan
troglodytes]
gi|397502802|ref|XP_003822032.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Pan
paniscus]
gi|410223852|gb|JAA09145.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410254528|gb|JAA15231.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410308824|gb|JAA33012.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
gi|410334959|gb|JAA36426.1| prolylcarboxypeptidase (angiotensinase C) [Pan troglodytes]
Length = 496
Score = 310 bits (794), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 161/364 (44%), Positives = 223/364 (61%), Gaps = 19/364 (5%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
S L+P + A R A R +G L T P + Y YF+Q++DHF F
Sbjct: 12 SFLAPWATIA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKVDHFGFN 63
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A AMLVF E
Sbjct: 64 TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAE 121
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGS 202
HRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE PV+ GGS
Sbjct: 122 HRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGS 179
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++ C +I+
Sbjct: 180 YGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIRR 239
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMM 320
SW + + +GL LT HLC L S + L DW+ + LAMVDYPY S+F+
Sbjct: 240 SWEAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQ 299
Query: 321 PLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGW 376
PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + + G GW
Sbjct: 300 PLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGW 359
Query: 377 NWQV 380
++Q
Sbjct: 360 SYQA 363
>gi|242006450|ref|XP_002424063.1| Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus
corporis]
gi|212507369|gb|EEB11325.1| Lysosomal Pro-X carboxypeptidase, putative [Pediculus humanus
corporis]
Length = 457
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 156/314 (49%), Positives = 212/314 (67%), Gaps = 11/314 (3%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F +DHFSF TF+ RYLIN +W N+ GPIF Y GNEGDIE FA N+GF+W+IA
Sbjct: 13 FINYVDHFSFVTNETFNIRYLINDTYW--NNKTGPIFFYTGNEGDIEVFAQNTGFMWEIA 70
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
P+F A+L+F EHRYYG+S+PYG+ + + L YLT+EQALAD+ I +L N +
Sbjct: 71 PKFNALLIFAEHRYYGKSLPYGNK--TFSDPKYLGYLTSEQALADYVDLIAHLTWNDNKS 128
Query: 193 -ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
+PV+ FGGSYGGMLAA++R+KYPH+ GA+ASSAPI QF + P + F IV+SDF+
Sbjct: 129 YKNPVIAFGGSYGGMLAAYIRMKYPHMVAGAIASSAPIWQFTGLTPCDVFSRIVTSDFEI 188
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL-NSTE--DLADWLESAYSYLA 308
E+ SC I+ SW + ++ E+GL +T + LCR L NST+ DL +WL Y+ LA
Sbjct: 189 ENKSCSFNIRRSWNIINNITSNEDGLKWITNEWKLCRPLKNSTDVRDLKNWLSDVYNNLA 248
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSVYYNYTGNVDCFQLD 367
MV+YPYP++F+ PLPGYPIR+ CKK+ N +L +++ V+VY+NYTG+ C D
Sbjct: 249 MVNYPYPTNFLTPLPGYPIRKFCKKLQNTTATGKDLLNLLYKSVTVYFNYTGSSKCLNFD 308
Query: 368 DD--PHGLDGWNWQ 379
D G D W++Q
Sbjct: 309 DSTPSLGADLWDYQ 322
>gi|4826940|ref|NP_005031.1| lysosomal Pro-X carboxypeptidase isoform 1 preproprotein [Homo
sapiens]
gi|1172047|sp|P42785.1|PCP_HUMAN RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Angiotensinase C; AltName: Full=Lysosomal
carboxypeptidase C; AltName: Full=Proline
carboxypeptidase; AltName: Full=Prolylcarboxypeptidase;
Short=PRCP; Flags: Precursor
gi|431321|gb|AAA99891.1| prolylcarboxypeptidase [Homo sapiens]
gi|16306648|gb|AAH01500.1| Prolylcarboxypeptidase (angiotensinase C) [Homo sapiens]
gi|119595480|gb|EAW75074.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_a [Homo
sapiens]
gi|119595482|gb|EAW75076.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_a [Homo
sapiens]
gi|123992784|gb|ABM83994.1| prolylcarboxypeptidase (angiotensinase C) [synthetic construct]
gi|123999550|gb|ABM87320.1| prolylcarboxypeptidase (angiotensinase C) [synthetic construct]
gi|197692241|dbj|BAG70084.1| prolylcarboxypeptidase isoform 1 preproprotein [Homo sapiens]
gi|197692495|dbj|BAG70211.1| prolylcarboxypeptidase isoform 1 preproprotein [Homo sapiens]
Length = 496
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 161/364 (44%), Positives = 222/364 (60%), Gaps = 19/364 (5%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
S L+P + A R A R +G L T P + Y YF+Q++DHF F
Sbjct: 12 SFLAPWATIA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKVDHFGFN 63
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A AMLVF E
Sbjct: 64 TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAE 121
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGS 202
HRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE PV+ GGS
Sbjct: 122 HRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGS 179
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++ C +I
Sbjct: 180 YGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIHR 239
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMM 320
SW + + +GL LT HLC L S + L DW+ + LAMVDYPY S+F+
Sbjct: 240 SWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQ 299
Query: 321 PLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGW 376
PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + + G GW
Sbjct: 300 PLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGW 359
Query: 377 NWQV 380
++Q
Sbjct: 360 SYQA 363
>gi|225708852|gb|ACO10272.1| Lysosomal Pro-X carboxypeptidase precursor [Caligus rogercresseyi]
Length = 492
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 154/325 (47%), Positives = 210/325 (64%), Gaps = 9/325 (2%)
Query: 60 RQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
+ Q Y Y+T ++E +DHF FA TF QRYLIN HW GPIFLY GNEGDIE
Sbjct: 33 EELQGYYTYKTEWYENLVDHFGFAINSTFKQRYLINDTHW-DSQHGGPIFLYAGNEGDIE 91
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
FA N+GF+WDIAP F A+++F EHRYYG+S+P+G + + YLT+EQALAD+A
Sbjct: 92 AFAQNTGFMWDIAPEFNALIIFIEHRYYGKSLPFGKDSLK-PDPKMNGYLTSEQALADYA 150
Query: 180 VFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
F+T K A+ SPV++FGGSYGGMLAAWMR+KYPHI GA+A SAP+ QF+ P
Sbjct: 151 RFVTEFKSTRKGAKDSPVIVFGGSYGGMLAAWMRIKYPHIVNGAIAGSAPVAQFD--TPC 208
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
F IV+SD+ S SC I +SW + VG+ + GL L LC + S E L
Sbjct: 209 LNFGRIVTSDYSFYSKSCSGVISKSWAAIDQVGKNDKGLQRLQSLLKLCSKPKSVEPLKS 268
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA---PDATSILERIFEGVSVYY 355
+L ++ +AM++YPYP++F+MPLPG P++ +C K+ D +IL+ ++EG++VY
Sbjct: 269 FLTDVWTSVAMMNYPYPTEFLMPLPGNPVKYICGKMSPTTVPTDPVAILKYVYEGLNVYA 328
Query: 356 NYTGNVDCFQLDD-DPHGLDGWNWQ 379
NY+G C +D+ D G D W++Q
Sbjct: 329 NYSGKAKCIDMDNADQIGADMWDYQ 353
>gi|189066514|dbj|BAG35764.1| unnamed protein product [Homo sapiens]
Length = 496
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 161/364 (44%), Positives = 222/364 (60%), Gaps = 19/364 (5%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
S L+P + A R A R +G L T P + Y YF+Q++DHF F
Sbjct: 12 SFLAPWATIA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKVDHFGFN 63
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A AMLVF E
Sbjct: 64 TVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEDLKAMLVFAE 121
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGS 202
HRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE PV+ GGS
Sbjct: 122 HRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGS 179
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++ C +I
Sbjct: 180 YGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIHR 239
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMM 320
SW + + +GL LT HLC L S + L DW+ + LAMVDYPY S+F+
Sbjct: 240 SWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYPYASNFLQ 299
Query: 321 PLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGW 376
PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + + G GW
Sbjct: 300 PLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATSSLGTLGW 359
Query: 377 NWQV 380
++Q
Sbjct: 360 SYQA 363
>gi|410972567|ref|XP_003992730.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Felis catus]
Length = 497
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 209/322 (64%), Gaps = 12/322 (3%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y Y +Q++DHF F TF QRYL+ +HW + G I Y GNEGDI WF N+G
Sbjct: 47 KYTVHYLQQKVDHFGFTTDKTFKQRYLLADEHWKKDD--GSILFYTGNEGDIVWFCNNTG 104
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WD+A AMLVF EHRYYGES+P+G+ +++++ L+YLT+EQALADFAV I LK
Sbjct: 105 FMWDVAEELKAMLVFAEHRYYGESLPFGND--SFKDSRYLNYLTSEQALADFAVLIKYLK 162
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + A+ PV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI F ++VP F IV
Sbjct: 163 RTIPGAKNQPVIALGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMKIV 222
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
+ DF++ +C TI SWG + + + +GL L++ HLC L +++D L W+
Sbjct: 223 TEDFRKSGPNCSETIHNSWGAITRLARTGSGLHWLSEALHLCTPLRNSQDVQHLKGWISE 282
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSVYYNYTGNV 361
+ LAMVDYPY S+F+ PLP +PI+ VCK ++N + +LE IF+ +++YYNY+G
Sbjct: 283 TWINLAMVDYPYESNFLQPLPAWPIKVVCKYLNNPSLSDAQLLENIFQALNIYYNYSGQA 342
Query: 362 DCFQLDDDPH---GLDGWNWQV 380
C + + G GW++Q
Sbjct: 343 RCLNISETTTSNLGTQGWSYQA 364
>gi|395814759|ref|XP_003780909.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Otolemur
garnettii]
Length = 495
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 150/319 (47%), Positives = 206/319 (64%), Gaps = 11/319 (3%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y YF+Q++DHF F + TF+QRYLI ++W G I Y GNEGDI WF N+GF
Sbjct: 48 YSVLYFQQKIDHFGFNSVKTFNQRYLIANEYWKKDG--GSILFYTGNEGDIVWFCNNTGF 105
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+A AMLVF EHRYYGES+P+G +++++ L+YLT+EQALADFA I +L++
Sbjct: 106 MWDVAQELKAMLVFAEHRYYGESLPFGKN--SFKDSRHLNYLTSEQALADFAELIRHLQE 163
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ + PV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV+
Sbjct: 164 TIPGVKNQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 223
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLESAY 304
SDFKR C TI++SW + + GL LT+ HLC L + L DW+ +
Sbjct: 224 SDFKRSGPYCSETIRKSWNVINQLSTTSRGLQYLTEVLHLCTPLTPRDVQHLKDWIAETW 283
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSIL-ERIFEGVSVYYNYTGNVDC 363
LAMVDYPYPS F+ PLP +PI+ VC+ + N + S++ + IF+ +++YYNY+G C
Sbjct: 284 VNLAMVDYPYPSSFLQPLPAWPIKVVCQYLKNPNVSDSLMIQNIFQALNIYYNYSGEAQC 343
Query: 364 FQLDDDPH---GLDGWNWQ 379
+ G GW++Q
Sbjct: 344 LNVSTTTTSNLGSLGWSYQ 362
>gi|403287756|ref|XP_003935096.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Saimiri boliviensis
boliviensis]
Length = 496
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 157/370 (42%), Positives = 229/370 (61%), Gaps = 21/370 (5%)
Query: 18 ITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
+ +++S L+P + A R A R +G L T P + Y YF+Q++
Sbjct: 6 LRFLLLSFLAPWATIA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKV 57
Query: 78 DHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGA 137
DHF F + TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A A
Sbjct: 58 DHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKA 115
Query: 138 MLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPV 196
MLVF EHRYYGES+P+G + +++++ L++LT+EQALADFA I ++K+ + AE PV
Sbjct: 116 MLVFAEHRYYGESLPFGDS--SFKDSRHLNFLTSEQALADFAELIKHMKRTIPGAENQPV 173
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASC 256
+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++ C
Sbjct: 174 IAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHC 233
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESAYSYLAMVDYPY 314
+I+ SW + + +GL L++ HLC L S + L DW+ + LAMVDYPY
Sbjct: 234 SESIRRSWDAINRLSSTGSGLQWLSEALHLCSPLTSEDVRHLKDWISETWVNLAMVDYPY 293
Query: 315 PSDFMMPLPGYPIREVCKKIDNAPDATS--ILERIFEGVSVYYNYTGNVDCFQLDDDPH- 371
S+F+ PLP +PI+ VC+ + +P+ + +L+ IF+ ++VYYNY+G C + +
Sbjct: 294 ASNFLQPLPAWPIKVVCQYL-KSPNVSDLLLLQNIFQALNVYYNYSGQAKCLNISETATG 352
Query: 372 --GLDGWNWQ 379
G GW++Q
Sbjct: 353 SLGTLGWSYQ 362
>gi|195997817|ref|XP_002108777.1| hypothetical protein TRIADDRAFT_19181 [Trichoplax adhaerens]
gi|190589553|gb|EDV29575.1| hypothetical protein TRIADDRAFT_19181 [Trichoplax adhaerens]
Length = 463
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 211/325 (64%), Gaps = 13/325 (4%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q +Y+T+YF+QRLDHFS AD F QRYLI+ +W + PIF Y GNEGDI WFA
Sbjct: 15 QSNVKYQTKYFKQRLDHFSPADDRKFQQRYLISQKYW---KKGSPIFFYTGNEGDITWFA 71
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N+GF+WDIAP F AML+F EHRYYG+++P+G ++++ L+YL++EQALADFA FI
Sbjct: 72 KNTGFMWDIAPEFNAMLIFVEHRYYGKTLPFGKD--SFKDKEHLAYLSSEQALADFAQFI 129
Query: 183 TNLK-QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
+ K + + S V+ FGGSYGGML AW+R+KYP+I GA+A+SAPI Q E + P + F
Sbjct: 130 VDFKAETHGTQNSSVIAFGGSYGGMLTAWLRIKYPNIVDGAIAASAPIWQLEGLTPCDRF 189
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LAD 298
+IV++ FK C I+ SW + +G ++G L+ T LC L S D L +
Sbjct: 190 SSIVTNTFKLAYPECPKNIRNSWKVIRKLGSTKSGRHTLSTTLKLCNPLKSPSDVDALVN 249
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSVYYNY 357
WL S + LA VDYPYP++F+ PLP P++ VC + + T ++ I +G++VY+N+
Sbjct: 250 WLSSIWVNLAEVDYPYPANFLEPLPAKPVKAVCASLQKPLANDTLLIRGIAKGLNVYFNH 309
Query: 358 TGNVDCFQLDDDPH---GLDGWNWQ 379
TGN CF D D G+ GWN+Q
Sbjct: 310 TGNSKCFNTDQDATSHLGIAGWNFQ 334
>gi|300193275|pdb|3N2Z|B Chain B, The Structure Of Human Prolylcarboxypeptidase At 2.80
Angstroms Resolution
Length = 446
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 150/320 (46%), Positives = 207/320 (64%), Gaps = 11/320 (3%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
Y YF+Q++DHF F + TF+QRYL+ +W G I Y GNEGDI WF N+G
Sbjct: 2 NYSVLYFQQKVDHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTG 59
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WD+A AMLVF EHRYYGES+P+G +++++ L++LT+EQALADFA I +LK
Sbjct: 60 FMWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLK 117
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + AE PV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV
Sbjct: 118 RTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIV 177
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESA 303
++DF++ C +I SW + + +GL LT HLC L S + L DW+
Sbjct: 178 TTDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISET 237
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVD 362
+ LAMVDYPY S+F+ PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V
Sbjct: 238 WVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVK 297
Query: 363 CFQLDDDPH---GLDGWNWQ 379
C + + G GW++Q
Sbjct: 298 CLNISETATSSLGTLGWSYQ 317
>gi|395521107|ref|XP_003764661.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Sarcophilus harrisii]
Length = 450
Score = 307 bits (787), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 154/312 (49%), Positives = 203/312 (65%), Gaps = 12/312 (3%)
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
++DHF F TF QRYLI +W N G I Y GNEGDI WF N+GF+WD+A
Sbjct: 13 QVDHFGFDVNLTFKQRYLIADQYW--KNNNGVILFYTGNEGDITWFCKNTGFMWDVAEEL 70
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEAS 194
AMLVF EHRYYGES+P+G+ ++ ++ L+YLTAEQALADFAV I LK+ + A+
Sbjct: 71 KAMLVFAEHRYYGESLPFGNQ--SFSDSKHLNYLTAEQALADFAVLIEYLKKTIPGAKNR 128
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA 254
PV+ GGSYGGMLAAW R+KYPHI IGALA+SAPI QF D+V F+ IV++DFK+ +
Sbjct: 129 PVIAIGGSYGGMLAAWFRMKYPHIVIGALAASAPIWQFTDLVSCGKFFEIVTNDFKKSGS 188
Query: 255 SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMVD 311
C TI+ SW + + GL +++ FHLC L S D L WL + LAMVD
Sbjct: 189 KCSETIQGSWKAIDRLASTGEGLQWISEAFHLCNPLRSEMDVITLKSWLAETWVNLAMVD 248
Query: 312 YPYPSDFMMPLPGYPIREVCKKIDNAPDATSIL-ERIFEGVSVYYNYTGNVDCF---QLD 367
YPY S+F+MPLP +PI+EVCK + ++ + +L + IF+ V++YYNY+G C Q
Sbjct: 249 YPYSSNFLMPLPAWPIKEVCKYLTDSHASDKVLVQNIFKAVNIYYNYSGEASCLNTSQTA 308
Query: 368 DDPHGLDGWNWQ 379
G+ GWN+Q
Sbjct: 309 TSSLGIQGWNYQ 320
>gi|348565561|ref|XP_003468571.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cavia porcellus]
Length = 494
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 208/319 (65%), Gaps = 11/319 (3%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y YF+Q++DHF F L TF QRYLI+ HW G I Y GNEGDI WF N+GF
Sbjct: 46 YSVLYFQQKVDHFGFNYLQTFKQRYLISAKHWEKDG--GSILFYTGNEGDIVWFCNNTGF 103
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+A AMLVF EHRYYGES+P+G+ +++++ L++LT+EQALADFA I +LK
Sbjct: 104 MWDVAEELKAMLVFAEHRYYGESLPFGAD--SFKDSKRLNFLTSEQALADFAELIRHLKT 161
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ A PV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFE++VP +TF IV+
Sbjct: 162 TIPGAGDQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFENLVPCDTFMKIVT 221
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLADWLESAY 304
+DF++ +C +I+ SW + + +GL LTKT LC L+ + W+ +
Sbjct: 222 TDFRKSDPNCPKSIRRSWDAINRLATTRSGLQWLTKTLRLCNPLTLDDVQHFKQWILETW 281
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNYTGNVDC 363
LAM DYPY S+FM PLP +PI+ VC+ + D + +L+ IF+ ++VYYNY+G +C
Sbjct: 282 VNLAMTDYPYESNFMQPLPAWPIKVVCQYLTDPNVSDSQLLQNIFQALNVYYNYSGQAEC 341
Query: 364 FQLDDDPH---GLDGWNWQ 379
+ + G+ GW++Q
Sbjct: 342 LNISETASSSLGISGWSYQ 360
>gi|417401904|gb|JAA47816.1| Putative lysosomal pro-x carboxypeptidase [Desmodus rotundus]
Length = 497
Score = 307 bits (786), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 208/322 (64%), Gaps = 12/322 (3%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y Y EQ++DHF F TF QRYLI HW G I Y GNEGDI WF N+G
Sbjct: 47 KYSIHYTEQKVDHFGFKTDKTFKQRYLIADQHWKKDG--GSILFYTGNEGDIIWFCNNTG 104
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WD+A AMLVF EHRYYG+S+P+G+ +++++ L++LT+EQALADF I +LK
Sbjct: 105 FMWDVAEELKAMLVFAEHRYYGKSLPFGAN--SFKDSRHLNFLTSEQALADFGELIRHLK 162
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + AE PV+ GGSYGGMLAAW R+KYPH+ GALA+SAPI QFEDIVP F IV
Sbjct: 163 RTIPGAENQPVIAVGGSYGGMLAAWFRMKYPHMVAGALAASAPIWQFEDIVPCGVFMEIV 222
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
++DF++ +C +I+ SW + + + GL L++ HLC L +++D L DW+
Sbjct: 223 TTDFRKSGPNCSESIRRSWDAINRLTRTGTGLHWLSEALHLCTPLTNSQDVQHLKDWISE 282
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA-TSILERIFEGVSVYYNYTGNV 361
+ LAMVDYPY SDF+ PLP +PI+ VC+ + N + T +L+ IF+ ++VYYNY+G
Sbjct: 283 TWVNLAMVDYPYESDFLQPLPAWPIKVVCQYLKNPHVSDTQLLQNIFQALNVYYNYSGQA 342
Query: 362 DCFQLDDDPH---GLDGWNWQV 380
C + + G GW++Q
Sbjct: 343 RCLNMSETATGSLGTLGWSYQA 364
>gi|348525386|ref|XP_003450203.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Oreochromis
niloticus]
Length = 502
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 160/345 (46%), Positives = 215/345 (62%), Gaps = 23/345 (6%)
Query: 45 VGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNR 104
+G LP+ TEPP Y+T YF+Q++DHF F + TF QRYL++ +W P
Sbjct: 37 LGGLPYSTEPP----------VSYKTFYFDQKIDHFGFLEDGTFKQRYLLSDKYWQQPG- 85
Query: 105 LGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNAT 164
GPI Y GNEGDI WF N+GF+W+IA AMLVF EHRYYGES+P+G + +Y+++
Sbjct: 86 -GPILFYTGNEGDITWFCNNTGFMWEIAEELDAMLVFAEHRYYGESLPFG--QDSYRDSK 142
Query: 165 TLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
L+YLT+EQALADFAV I NLK L A+ SPV+ GGSYGGML+AW R+KYPH+ +GAL
Sbjct: 143 HLNYLTSEQALADFAVLIQNLKGTLPGAQHSPVIAVGGSYGGMLSAWFRMKYPHVVVGAL 202
Query: 224 ASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKT 283
ASSAPI QF +VP FY V+ DF + +C I++SW + +V +GL L++
Sbjct: 203 ASSAPIWQFPGMVPCGDFYKTVTQDFAKSGINCDANIRKSWKAVNNVSSSASGLQWLSEE 262
Query: 284 FHLCRELNSTEDL---ADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCK--KIDNAP 338
F LC L + D+ WL+ + LAMVDYPY ++F+ PLP +PI+ VCK D+
Sbjct: 263 FSLCAPLKNKNDVLTFKSWLQETWVNLAMVDYPYEANFLQPLPRWPIQVVCKYLSFDSTV 322
Query: 339 DATSILERIFEGVSVYYNYTGNVDCF---QLDDDPHGLDGWNWQV 380
+L+ + + VYYNYTG+ C Q G GW +Q
Sbjct: 323 SDYQLLQGVAQAAKVYYNYTGSSPCLNTSQTATSSLGYLGWFYQA 367
>gi|354489833|ref|XP_003507065.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cricetulus
griseus]
gi|344240433|gb|EGV96536.1| Lysosomal Pro-X carboxypeptidase [Cricetulus griseus]
Length = 492
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 153/321 (47%), Positives = 203/321 (63%), Gaps = 13/321 (4%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
Y YF+Q++DHF F + TF QRYLI HW G I Y GNEGDI WF N+G
Sbjct: 45 NYSVLYFQQKVDHFGFYNSRTFKQRYLIADKHWKANG--GTILFYTGNEGDIVWFCNNTG 102
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WD+A AMLVF EHRYYGES+P+G E ++++A L +LT+EQALADFA I +LK
Sbjct: 103 FMWDVAKELKAMLVFAEHRYYGESLPFG--EESFKDAQHLDFLTSEQALADFAELIKHLK 160
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + A+ PV+ GGSYGGMLAAW R+KYPH+ +GALASSAPI QF+D+VP F IV
Sbjct: 161 RTVPGAKNQPVIAIGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFDDLVPCGVFMKIV 220
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLESA 303
+ DF++ S C +I+ SW + + L LTKTFHLC LN L +WL
Sbjct: 221 TEDFRKSSPQCSESIRRSWKAINRHSTTGSDLQWLTKTFHLCSPLNFEDITTLKEWLSET 280
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA--TSILERIFEGVSVYYNYTGNV 361
+ LAMVDYPY + F+ PLP +P+ VC+ + N P+ T +L+ IF+ +++YYNY+G
Sbjct: 281 WVNLAMVDYPYEASFLQPLPAWPVTVVCQYLKN-PNVSDTVLLQNIFQALNIYYNYSGQA 339
Query: 362 DCFQLDDDPH---GLDGWNWQ 379
C + G GW +Q
Sbjct: 340 QCLNISQTATSSLGATGWGYQ 360
>gi|426251523|ref|XP_004019471.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 1 [Ovis aries]
Length = 497
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/323 (47%), Positives = 207/323 (64%), Gaps = 14/323 (4%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y RY +Q++DHF F TF QRYLI ++W G I Y GNEGDI WF N+G
Sbjct: 47 KYSIRYIQQKVDHFGFNIDRTFKQRYLIADNYWKEDG--GSILFYTGNEGDIIWFCNNTG 104
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WDIA AMLVF EHRYYGES+P+G+ ++ ++ L++LT EQALADFA I LK
Sbjct: 105 FMWDIAEEMKAMLVFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIKYLK 162
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + A PV+ GGSYGGMLAAW R+KYPH+ +GALASSAPI QF D+VP + F IV
Sbjct: 163 RTIPGARNQPVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDVFMKIV 222
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
++DF + +C +I+ SW + + +K GL L++ HLC L +D L DW+
Sbjct: 223 TTDFSQSGPNCSESIRRSWDAINRLAKKGPGLRWLSEALHLCTPLTKLQDVQHLKDWIAE 282
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCK--KIDNAPDATSILERIFEGVSVYYNYTGN 360
+ LAMVDYPY S+F+ PLP +P++ VC+ K N PD T +L IF+ ++VYYNY+G
Sbjct: 283 TWVNLAMVDYPYESNFLQPLPAWPVKVVCQYFKYSNVPD-TLLLRNIFQALNVYYNYSGQ 341
Query: 361 VDCFQLDDDPH---GLDGWNWQV 380
C + + G+ GW++Q
Sbjct: 342 AKCLNVSETATSSLGVQGWSYQA 364
>gi|350588336|ref|XP_003129755.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Sus scrofa]
Length = 493
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 149/322 (46%), Positives = 206/322 (63%), Gaps = 12/322 (3%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y Y +Q++DHF F TF QRYLI HW G I Y GNEGDI WF N+G
Sbjct: 43 KYSIHYIQQKVDHFGFNTDKTFKQRYLIADTHWRKDG--GSILFYTGNEGDIIWFCNNTG 100
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WD+A AMLVF EHRYYGES+P+G+ +++++ L++LT+EQALADFA I +LK
Sbjct: 101 FMWDVAEELKAMLVFAEHRYYGESLPFGAH--SFKDSRHLNFLTSEQALADFAELIRHLK 158
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + E PV+ GGSYGGMLAAW R+KYPH+ +GALASSAPI FE++VP F V
Sbjct: 159 RKIPGTENQPVIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWHFENLVPCGVFMKTV 218
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
+ DF+ +C TI+ SW + + +K GL L++ HLC L +++D L DW+
Sbjct: 219 TKDFRESGPNCSETIRRSWDAINRLARKGTGLHWLSEALHLCTPLTNSQDVQRLKDWISE 278
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSIL-ERIFEGVSVYYNYTGNV 361
+ LAMVDYPY SDF+ PLP +PI+ VC+ + N + S+L + IF+ ++VYYNY+G
Sbjct: 279 TWVNLAMVDYPYESDFLQPLPAWPIKVVCQYLKNPNVSDSLLVQNIFQALNVYYNYSGQA 338
Query: 362 DCFQLDDDPH---GLDGWNWQV 380
C + + G GW++Q
Sbjct: 339 RCLNISETATSSLGALGWSYQA 360
>gi|32967631|gb|AAH55022.1| Prcp protein [Mus musculus]
gi|74147590|dbj|BAE38679.1| unnamed protein product [Mus musculus]
Length = 491
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 214/340 (62%), Gaps = 16/340 (4%)
Query: 49 PHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
PHL+ P + +Y YFEQ++DHF FAD+ TF QRYL+ HW G I
Sbjct: 30 PHLSASPTPDPAVAR---KYSVLYFEQKVDHFGFADMRTFKQRYLVADKHW--QRNGGSI 84
Query: 109 FLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY 168
Y GNEGDI WF N+GF+WD+A AMLVF EHRYYGES+P+G + +++++ L++
Sbjct: 85 LFYTGNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFG--QDSFKDSQHLNF 142
Query: 169 LTAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
LT+EQALADFA I +L++ + A+ PV+ GGSYGGMLAAW R+KYPHI +GALA+SA
Sbjct: 143 LTSEQALADFAELIRHLEKTIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASA 202
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC 287
PI Q + +VP F IV++DF++ C +I++SW + + +GL LT HLC
Sbjct: 203 PIWQLDGMVPCGEFMKIVTNDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLC 262
Query: 288 RELNSTE--DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA--TSI 343
L S + L W+ + LAMV+YPY +F+ PLP +PI+EVC+ + N P+ T +
Sbjct: 263 SPLTSEKIPTLKGWIAETWVNLAMVNYPYACNFLQPLPAWPIKEVCQYLKN-PNVSDTVL 321
Query: 344 LERIFEGVSVYYNYTGNVDCF---QLDDDPHGLDGWNWQV 380
L+ IF+ +SVYYNY+G C Q G GW++Q
Sbjct: 322 LQNIFQALSVYYNYSGQAACLNISQTTTSSLGSMGWSFQA 361
>gi|33469015|ref|NP_082519.1| lysosomal Pro-X carboxypeptidase precursor [Mus musculus]
gi|68565642|sp|Q7TMR0.2|PCP_MOUSE RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|26342108|dbj|BAC34716.1| unnamed protein product [Mus musculus]
gi|74193542|dbj|BAE20697.1| unnamed protein product [Mus musculus]
Length = 491
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 214/340 (62%), Gaps = 16/340 (4%)
Query: 49 PHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
PHL+ P + +Y YFEQ++DHF FAD+ TF QRYL+ HW G I
Sbjct: 30 PHLSASPTPDPAVAR---KYSVLYFEQKVDHFGFADMRTFKQRYLVADKHW--QRNGGSI 84
Query: 109 FLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY 168
Y GNEGDI WF N+GF+WD+A AMLVF EHRYYGES+P+G + +++++ L++
Sbjct: 85 LFYTGNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFG--QDSFKDSQHLNF 142
Query: 169 LTAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
LT+EQALADFA I +L++ + A+ PV+ GGSYGGMLAAW R+KYPHI +GALA+SA
Sbjct: 143 LTSEQALADFAELIRHLEKTIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASA 202
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC 287
PI Q + +VP F IV++DF++ C +I++SW + + +GL LT HLC
Sbjct: 203 PIWQLDGMVPCGEFMKIVTNDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLC 262
Query: 288 RELNSTE--DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA--TSI 343
L S + L W+ + LAMV+YPY +F+ PLP +PI+EVC+ + N P+ T +
Sbjct: 263 SPLTSEKIPTLKGWIAETWVNLAMVNYPYACNFLQPLPAWPIKEVCQYLKN-PNVSDTVL 321
Query: 344 LERIFEGVSVYYNYTGNVDCF---QLDDDPHGLDGWNWQV 380
L+ IF+ +SVYYNY+G C Q G GW++Q
Sbjct: 322 LQNIFQALSVYYNYSGQAACLNISQTTTSSLGSMGWSFQA 361
>gi|344293754|ref|XP_003418585.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Loxodonta africana]
Length = 503
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 155/348 (44%), Positives = 216/348 (62%), Gaps = 16/348 (4%)
Query: 41 APRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWV 100
A R++G+L T P R + Y YF+Q++DHF F TF QRYLI +W
Sbjct: 27 ALRYLGRLHLPTRPTSRPSVARN----YSILYFKQKVDHFGFDINKTFKQRYLIADKYWK 82
Query: 101 GPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAY 160
G I Y GNEGDI WF N+GF+WD+A AMLVF EHRYYGES+P+G+ ++
Sbjct: 83 KDG--GSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGAN--SF 138
Query: 161 QNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
+++ L++LT+EQALADFA I +LKQ + AE PV+ GGSYGGMLAAW R+KYPH+
Sbjct: 139 KDSRHLNFLTSEQALADFAKLIKHLKQTIPGAENQPVIAVGGSYGGMLAAWFRMKYPHLV 198
Query: 220 IGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLE 279
+GALA+SAPI QFED+VP F IV+ DFK+ +C +I+ SW + + +GL
Sbjct: 199 VGALAASAPIWQFEDLVPCGIFMEIVTRDFKKSGPNCSESIRRSWNVINRLATTSSGLQW 258
Query: 280 LTKTFHLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
L++ HLC L +D L DW+ + LAMVDYPY +F+ PLP +PI+ VC+ +
Sbjct: 259 LSEALHLCSPLTDFQDFRMLKDWISETWVNLAMVDYPYECNFLQPLPAWPIKVVCQYFKD 318
Query: 337 APDATSIL-ERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGWNWQV 380
+ +L + +F+ ++VYYNY+G V C + + G GW++Q
Sbjct: 319 PKVSDQLLVQNLFQALNVYYNYSGQVKCLNISETATSSLGSLGWSYQA 366
>gi|296216949|ref|XP_002754795.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Callithrix
jacchus]
Length = 496
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 155/351 (44%), Positives = 218/351 (62%), Gaps = 17/351 (4%)
Query: 38 FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD 97
R A R +G L T P + Y YF+Q++DHF F + TF+QRYL+
Sbjct: 22 LRPALRALGSLHLPTNPTSLPPLAKN----YSVLYFQQKVDHFGFNTVKTFNQRYLVADK 77
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
+W G I Y GNEGDI WF N+GF+WD+A AMLVF EHRYYGES+P+G +
Sbjct: 78 YW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFGDS- 134
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYP 216
+++++ L++LT+EQALADFA I +LK+ + AE PV+ GGSYGGMLAAW R+KYP
Sbjct: 135 -SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYP 193
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
H+ +GALA+SAPI QFED+VP F IV++DF++ C +I SW + + +G
Sbjct: 194 HMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPHCSESIHRSWDAINRLSSTGSG 253
Query: 277 LLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI 334
L L++ HLC L S + L DW+ + LAMVDYPY S+F+ PLP +PI+ VC+ +
Sbjct: 254 LQWLSEALHLCSPLTSEDIQHLKDWISETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYL 313
Query: 335 DNAPDATS--ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGWNWQV 380
N P+ + +L+ IF+ ++VYYNY+G C + + G GW++Q
Sbjct: 314 KN-PNVSDLLLLQNIFQALNVYYNYSGQAKCLNISETATGSLGTLGWSYQA 363
>gi|296471920|tpg|DAA14035.1| TPA: prolylcarboxypeptidase precursor [Bos taurus]
Length = 499
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 209/322 (64%), Gaps = 14/322 (4%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y RY +Q++DHF F TF QRYLI ++W G I Y GNEGDI WF N+G
Sbjct: 49 KYSIRYIQQKVDHFGFNIDRTFKQRYLIADNYWKEDG--GSILFYTGNEGDIIWFCNNTG 106
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WDIA AMLVF EHRYYGES+P+G+ ++ ++ L++LT EQALADFA I LK
Sbjct: 107 FMWDIAEEMKAMLVFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIRYLK 164
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + A PV+ GGSYGGMLAAW R+KYPH+ +GALASSAPI QF D+VP + F IV
Sbjct: 165 RTIPGARNQPVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDIFMKIV 224
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
++DF + +C +I+ SW + + +K GL L++ HLC L ++D L DW+
Sbjct: 225 TTDFSQSGPNCSESIRRSWDAINRLAKKGPGLRWLSEALHLCTPLTKSQDVQRLKDWISE 284
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCK--KIDNAPDATSILERIFEGVSVYYNYTGN 360
+ +AMVDYPY S+F+ PLP +P++ VC+ K N PD T +++ IF+ ++VYYNY+G
Sbjct: 285 TWVNVAMVDYPYESNFLQPLPAWPVKVVCQYFKYSNVPD-TVMVQNIFQALNVYYNYSGQ 343
Query: 361 VDCFQLDDDPH---GLDGWNWQ 379
C + + G+ GW++Q
Sbjct: 344 AKCLNVSETATSSLGVLGWSYQ 365
>gi|242096526|ref|XP_002438753.1| hypothetical protein SORBIDRAFT_10g025550 [Sorghum bicolor]
gi|241916976|gb|EER90120.1| hypothetical protein SORBIDRAFT_10g025550 [Sorghum bicolor]
Length = 569
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 157/339 (46%), Positives = 208/339 (61%), Gaps = 39/339 (11%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGPNR------LGPIFLYCGNEGDIE 119
+ T YF Q LDHF+F + F Q+YL+N W P+ GP+F+Y GNEGDIE
Sbjct: 76 FTTHYFPQELDHFTFTPNASMVFRQKYLVNDTFWRRPSGGNGTAGAGPLFVYTGNEGDIE 135
Query: 120 WFAVNSGFVWDIAPRFGAMLVF---------------------------PEHRYYGESMP 152
WFA N+GF++DIAP+FGA+LVF PE
Sbjct: 136 WFATNTGFMFDIAPKFGALLVFIECLNAENMIQSKIFSFCFLKSRGPRCPEPGPRPPLAL 195
Query: 153 YGSTEVAYQNATTLSYL----TAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLA 208
++A L L A ALADFA+ IT+LKQNLSA+ +PVV+FGGSYGGMLA
Sbjct: 196 PLPLARQARSAAGLRGLPTAGRAAVALADFAILITSLKQNLSAKTAPVVVFGGSYGGMLA 255
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELV 268
+W RLKYPH+AIGALASSAPILQF+ I P +F + +S D+K ES +CF+ IK +W L
Sbjct: 256 SWFRLKYPHVAIGALASSAPILQFDYITPWSSFSDAISQDYKSESLNCFSVIKATWDVLD 315
Query: 269 SVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIR 328
G + GLLEL+K F C+ L + +A WL++A+ Y AMVDYP P++F+ LP YP++
Sbjct: 316 ERGANDRGLLELSKLFRACKTLKYADSIAGWLQTAFVYTAMVDYPTPANFLKNLPAYPVK 375
Query: 329 EVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLD 367
E+CK ID P ILE++F S+YYN+TG+ C Q++
Sbjct: 376 EMCKIIDGFPANADILEKVFAAASLYYNFTGDQTCNQIE 414
>gi|20072291|gb|AAH26424.1| Prcp protein, partial [Mus musculus]
Length = 451
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 150/321 (46%), Positives = 208/321 (64%), Gaps = 13/321 (4%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y YFEQ++DHF FAD+ TF QRYL+ HW G I Y GNEGDI WF N+G
Sbjct: 5 KYSVLYFEQKVDHFGFADMRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNTG 62
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WD+A AMLVF EHRYYGES+P+G + +++++ L++LT+EQALADFA I +L+
Sbjct: 63 FMWDVAEELKAMLVFAEHRYYGESLPFG--QDSFKDSQHLNFLTSEQALADFAELIRHLE 120
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + A+ PV+ GGSYGGMLAAW R+KYPHI +GALA+SAPI Q + +VP F IV
Sbjct: 121 KTIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMVPCGEFMKIV 180
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESA 303
++DF++ C +I++SW + + +GL LT HLC L S + L W+
Sbjct: 181 TNDFRKSGPYCSESIRKSWNVIDKLSGSGSGLQSLTNILHLCSPLTSEKIPTLKGWIAET 240
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA--TSILERIFEGVSVYYNYTGNV 361
+ LAMV+YPY +F+ PLP +PI+EVC+ + N P+ T +L+ IF+ +SVYYNY+G
Sbjct: 241 WVNLAMVNYPYACNFLQPLPAWPIKEVCQYLKN-PNVSDTVLLQNIFQALSVYYNYSGQA 299
Query: 362 DCF---QLDDDPHGLDGWNWQ 379
C Q G GW++Q
Sbjct: 300 ACLNISQTTTSSLGSMGWSFQ 320
>gi|225709542|gb|ACO10617.1| Lysosomal Pro-X carboxypeptidase precursor [Caligus rogercresseyi]
Length = 492
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 208/325 (64%), Gaps = 9/325 (2%)
Query: 60 RQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
+ Q Y Y+T ++E +DHF FA TF QRYLIN HW GPIFLY GNEGD E
Sbjct: 33 EELQDYYTYKTEWYENLVDHFGFAINSTFKQRYLINDTHW-DSQHGGPIFLYAGNEGDSE 91
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
FA N+GF+WDIAP F A+++F EHRYYG+S+P+G + + YLT+EQALAD+A
Sbjct: 92 AFAQNTGFMWDIAPEFNALIIFIEHRYYGKSLPFGKDSLK-PDPKMNGYLTSEQALADYA 150
Query: 180 VFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
F+T K A+ SPV++FGGSYGGMLAAWMR+KYPHI GA+A SAP+ QF+ P
Sbjct: 151 RFVTEFKSTRKGAKDSPVIVFGGSYGGMLAAWMRIKYPHIVNGAIAGSAPVAQFD--TPC 208
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
F IV+SD+ S SC I SW + VG+ + GL L LC + S E L
Sbjct: 209 LNFGRIVTSDYSFYSKSCSGVISMSWAAIDQVGKDDKGLQRLQSLLKLCSKPKSVEPLKS 268
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA---PDATSILERIFEGVSVYY 355
+L ++ +AM++YPYP++F+MPLPG P++ +C+K+ D +IL+ ++EG++VY
Sbjct: 269 FLTDVWTNVAMMNYPYPTEFLMPLPGNPVKYICRKMSPTTVPTDPVAILKYVYEGLNVYA 328
Query: 356 NYTGNVDCFQLDD-DPHGLDGWNWQ 379
NY+G C + + D G D W++Q
Sbjct: 329 NYSGKAKCIDMGNADQIGADMWDYQ 353
>gi|356570877|ref|XP_003553610.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 349
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 210/345 (60%), Gaps = 20/345 (5%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHW 99
PRF + H P + Q Y ++F Q LDHF++ TF QRYLIN +W
Sbjct: 12 PRFPSSVVH---PAIDVTSRSAQNGVYTKKFFTQILDHFNYNPQSYQTFQQRYLINDTYW 68
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE--HRYYGESMPYGSTE 157
G PIF Y GNEGDIEWFA N GF+++ AP F A+LVF E HRYYG+S P+G E
Sbjct: 69 GGDKSNAPIFFYTGNEGDIEWFAQNPGFMFETAPYFKALLVFIEEXHRYYGKSFPFGGNE 128
Query: 158 V-AYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYP 216
A N++TL YL++ + I +LK+NLSA SPVV+FGGSYGG++ AW R+KYP
Sbjct: 129 EDANANSSTLGYLSS-------TLLIIDLKKNLSATYSPVVVFGGSYGGIILAWFRMKYP 181
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
H+AIGALASSAPILQF D+V P T+ +I++ D+K ES +C+ IK SW ++ +K G
Sbjct: 182 HVAIGALASSAPILQFLDLVSPNTYTDIITQDYKSESENCYKVIKGSWKQIEDTARKPGG 241
Query: 277 LLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
L +L K+F +C+ S L WL+ A AM+DYP PS+F+ LP P+R+V IDN
Sbjct: 242 LEQLWKSFRICKNYISAGALVSWLQMALGIAAMIDYPAPSNFLAHLPASPVRKVMALIDN 301
Query: 337 APDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLD-----GW 376
+++ S++YNY+G CF LD+ L+ GW
Sbjct: 302 LSVGNEAFTKLYAAASIFYNYSGTAICFGLDNTTDTLERISKAGW 346
>gi|340374641|ref|XP_003385846.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Amphimedon
queenslandica]
Length = 490
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 152/345 (44%), Positives = 209/345 (60%), Gaps = 21/345 (6%)
Query: 42 PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVG 101
PR + LP + + Y Y+T YF+Q +DHF+F TFSQRYL+N W
Sbjct: 25 PRSLKNLP---------RGNSESSYEYQTLYFKQPIDHFNFESNVTFSQRYLLNDAFWDK 75
Query: 102 PNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ 161
N GPIF YCGNEGDI WFA N+GFVWDIAP F A++VF EHRYYG ++P+G+ +Y
Sbjct: 76 DNG-GPIFFYCGNEGDITWFANNTGFVWDIAPEFKALVVFAEHRYYGNTLPFGAE--SYA 132
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIG 221
N +TL YLT+EQALADF + I +LK PVV FGGSYGGML+AW+R+KYP + +G
Sbjct: 133 NLSTLGYLTSEQALADFVLLINDLKGKYGD--VPVVAFGGSYGGMLSAWIRMKYPSVVVG 190
Query: 222 ALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELT 281
++A+SAPI QF + I+SS + S +C+N + SW + G GL L+
Sbjct: 191 SIAASAPIWQFPGLCDCGKANEIISSTMSQYSTNCYNNVLSSWDIINKTGTTSGGLSLLS 250
Query: 282 KTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA---- 337
TF LC+ L + L WL+ + LAM++YPYP++FM PLP +P+ C+K+
Sbjct: 251 TTFSLCQPLKDSLTLMSWLQDVWFNLAMMNYPYPANFMEPLPAWPLNVTCQKLSQTFTTE 310
Query: 338 PDATSILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGWNWQ 379
D +L+ + + VYYNY+G C L+ + G GW++Q
Sbjct: 311 DDDIKLLQALSTAMQVYYNYSGQSSCLDLNKESSTDLGAKGWSYQ 355
>gi|84000301|ref|NP_001033253.1| lysosomal Pro-X carboxypeptidase precursor [Bos taurus]
gi|122137021|sp|Q2TA14.1|PCP_BOVIN RecName: Full=Lysosomal Pro-X carboxypeptidase; AltName:
Full=Proline carboxypeptidase; AltName:
Full=Prolylcarboxypeptidase; Short=PRCP; Flags:
Precursor
gi|83405390|gb|AAI11172.1| Prolylcarboxypeptidase (angiotensinase C) [Bos taurus]
Length = 499
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 149/322 (46%), Positives = 208/322 (64%), Gaps = 14/322 (4%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y RY +Q++DHF F TF QRYLI ++W G I Y GNEGDI WF N+G
Sbjct: 49 KYSIRYIQQKVDHFGFNIDRTFKQRYLIADNYWKEDG--GSILFYTGNEGDIIWFCNNTG 106
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WDIA AMLVF EHRYYGES+P+G+ ++ ++ L++LT EQALADFA I LK
Sbjct: 107 FMWDIAEEMKAMLVFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIRYLK 164
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + A V+ GGSYGGMLAAW R+KYPH+ +GALASSAPI QF D+VP + F IV
Sbjct: 165 RTIPGARNQHVIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDIFMKIV 224
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
++DF + +C +I+ SW + + +K GL L++ HLC L ++D L DW+
Sbjct: 225 TTDFSQSGPNCSESIRRSWDAINRLAKKGTGLRWLSEALHLCTPLTKSQDVQRLKDWISE 284
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCK--KIDNAPDATSILERIFEGVSVYYNYTGN 360
+ +AMVDYPY S+F+ PLP +P++ VC+ K N PD T +++ IF+ ++VYYNY+G
Sbjct: 285 TWVNVAMVDYPYESNFLQPLPAWPVKVVCQYFKYSNVPD-TVMVQNIFQALNVYYNYSGQ 343
Query: 361 VDCFQLDDDPH---GLDGWNWQ 379
C + + G+ GW++Q
Sbjct: 344 AKCLNVSETATSSLGVLGWSYQ 365
>gi|73988045|ref|XP_533994.2| PREDICTED: lysosomal Pro-X carboxypeptidase [Canis lupus
familiaris]
Length = 497
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/323 (46%), Positives = 209/323 (64%), Gaps = 14/323 (4%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y Y Q++DHF FA TF QRYLI +HW G I Y GNEGDI WF N+G
Sbjct: 47 KYSVHYILQKVDHFGFAVDKTFKQRYLIADEHWKKDG--GSILFYTGNEGDIIWFCNNTG 104
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WD+A AMLVF EHRYYGES+P+G+ +++++ L+YLT+EQALADFA+ I +LK
Sbjct: 105 FMWDVAEEMKAMLVFAEHRYYGESLPFGNN--SFKDSRHLNYLTSEQALADFAMLIKHLK 162
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + A+ PV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QF ++V F IV
Sbjct: 163 RTIPGAKNQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFGNLVSCGVFMEIV 222
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
++DFK+ +C +I+ SW + + GL L++ HLC L +T+D L W+
Sbjct: 223 TTDFKKSGPNCSESIRRSWDAINRYSRTGAGLGWLSEALHLCTPLTNTQDVQHLKIWISE 282
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYNYTGN 360
+ +AMVDYPY SDF+ PLP +PI+ VC+ + N PDA +L+ IF+ ++VYYNY+G
Sbjct: 283 TWINMAMVDYPYESDFLQPLPPWPIKVVCQYLRNPNVPDA-QLLQNIFQALNVYYNYSGQ 341
Query: 361 VDCFQLDDDPH---GLDGWNWQV 380
C + + G GW++Q
Sbjct: 342 ASCLNVSETTTSSLGTQGWSYQA 364
>gi|328778095|ref|XP_623670.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Apis mellifera]
Length = 492
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 153/328 (46%), Positives = 208/328 (63%), Gaps = 10/328 (3%)
Query: 58 QQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD 117
Q +YRYE + + +DHFSF+ L TF RYLIN W N PIF Y GNEG+
Sbjct: 38 QNELHSAKYRYEIKTIDMPVDHFSFSVLNTFKLRYLINGT-WQKTNN-APIFFYTGNEGN 95
Query: 118 IEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALAD 177
IE FA N+GF+WDIAP FGA+LVF EHRYYGESMPY + +Y + L YLT++QALAD
Sbjct: 96 IETFAQNTGFMWDIAPEFGALLVFAEHRYYGESMPYNNK--SYADLNHLGYLTSQQALAD 153
Query: 178 FAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP 237
+ I LK + SP+++FGGSYGGML+AW+R+KYPHI GA+ASSAPILQF I
Sbjct: 154 YVDLIQYLKSKPKYKNSPIIVFGGSYGGMLSAWIRIKYPHIVQGAIASSAPILQFTGITE 213
Query: 238 PETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL---NSTE 294
E+F IV+SDFK+ ++C I++SW ++++ G L+ + LC+ L N E
Sbjct: 214 CESFLRIVTSDFKKAHSNCPKLIRKSWNIIMNITSTNEGKKWLSDNWKLCQPLKNENDIE 273
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSV 353
L +L+ Y+ LAMV+YPY ++F+ PLP YPI VCK + N + +L I +++
Sbjct: 274 QLISYLQDIYTNLAMVNYPYKANFLAPLPAYPINAVCKHLTNESLTGIELLIAIKNAINI 333
Query: 354 YYNYTGNVDCFQLDDDPHGLD--GWNWQ 379
+ NYT C L++ LD GW++Q
Sbjct: 334 FTNYTSETKCLNLNNSTPQLDAIGWSFQ 361
>gi|354489839|ref|XP_003507068.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cricetulus
griseus]
Length = 494
Score = 301 bits (770), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 147/327 (44%), Positives = 215/327 (65%), Gaps = 11/327 (3%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
Q + Y YFEQ++DHF F + TF QRYLI +W + G I Y GNEGDI W
Sbjct: 39 QPDEANNYSVFYFEQKVDHFGFYNTKTFKQRYLIADRYWKTYD--GVILFYTGNEGDITW 96
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
F+ ++GF+WD+A + A+LVF EHRYYGES+P+G+ +++++ L++LT+EQALADFA
Sbjct: 97 FSNHTGFMWDVAEKLKALLVFAEHRYYGESLPFGAE--SFKDSKHLNFLTSEQALADFAE 154
Query: 181 FITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
I +L++ + A++ PV+ GGSYGG+LAAW R+KYPH+ IGALA+SAPI QFE+++P
Sbjct: 155 LIRHLRRTVPGADSQPVIAIGGSYGGVLAAWFRMKYPHLVIGALAASAPIWQFEELIPCG 214
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLA 297
F IV+ DF++ C +I SWG + + +GL LTK FHLC LNS + L
Sbjct: 215 MFMKIVTEDFRKSGKKCSESILRSWGAINRLSNTGSGLPWLTKAFHLCSSLNSEDVQRLK 274
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSIL-ERIFEGVSVYYN 356
DW+ + LAMV+YPYP++F+ PLP +P++ VC+ + N + S+L + I++ ++VYYN
Sbjct: 275 DWISETWVNLAMVNYPYPANFLKPLPSWPLKVVCQYLTNPFMSDSLLVQNIYQALNVYYN 334
Query: 357 YTGNVDCFQLDDDPHGLDG---WNWQV 380
Y+G C + + G G W +Q
Sbjct: 335 YSGQSRCNNISETTIGSLGSRVWGYQT 361
>gi|431838478|gb|ELK00410.1| Lysosomal Pro-X carboxypeptidase [Pteropus alecto]
Length = 497
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 209/321 (65%), Gaps = 12/321 (3%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y Y +Q++DHF F TF+QRYLI HW G I Y GNEGDI WF N+GF
Sbjct: 48 YSIHYIQQKVDHFGFNTDKTFNQRYLIADKHWKKDG--GSILFYTGNEGDIIWFCNNTGF 105
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+A AMLVF EHRYYG+S+P+GS+ +++++ L++LT+EQALADFA IT+L++
Sbjct: 106 MWDVAEELKAMLVFAEHRYYGKSLPFGSS--SFKDSRHLNFLTSEQALADFAELITHLRK 163
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ A+ PV+ GGSYGGMLAAW R+KYPH+ GALA+SAPI QF++IVP F IV+
Sbjct: 164 TIPGAKNQPVIALGGSYGGMLAAWFRMKYPHMVAGALAASAPIWQFDNIVPCGVFMEIVT 223
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESA 303
+D+++ +C I+ SW + + + GL L++T HLC L T+D L DW+
Sbjct: 224 TDYRKSGPNCAECIRRSWDAINRIAENGTGLPWLSETLHLCTPLKHTQDIQNLKDWISET 283
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNYTGNVD 362
+ LA+VDYPY S+F+ PLP +P++ VC+ + D + +L+ IF+ +++YYNYTG
Sbjct: 284 WINLAVVDYPYESNFLQPLPAWPVKVVCQYLKDPSVSDPQLLQNIFQALNIYYNYTGQAR 343
Query: 363 CFQLDDDPH---GLDGWNWQV 380
C + + G GW++Q
Sbjct: 344 CLNMSETSTSSLGSLGWSYQA 364
>gi|380027987|ref|XP_003697693.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Apis florea]
Length = 491
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 153/334 (45%), Positives = 212/334 (63%), Gaps = 10/334 (2%)
Query: 52 TEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
TE ++ + Q +Y+YE + + +DHFSF+ TF RYLIN W N PIF Y
Sbjct: 31 TEKIEQMRLTQNTKYKYEIKTIDMPVDHFSFSVSDTFKLRYLINGT-WQKTNN-APIFFY 88
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
GNEG+IE FA N+GF+WDIAP F A+LVF EHRYYGESMPY + +Y + L YLT+
Sbjct: 89 TGNEGNIEIFAQNTGFMWDIAPEFEALLVFAEHRYYGESMPYSNK--SYTDLNHLGYLTS 146
Query: 172 EQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
+QALAD+ I LK + SP+++FGGSYGGML+AW+R+KYPHI GA+ASSAPILQ
Sbjct: 147 QQALADYIDLIQYLKSKPKYKNSPIIVFGGSYGGMLSAWIRMKYPHIVQGAIASSAPILQ 206
Query: 232 FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL- 290
F I E+F IV+SDFK+ ++C I++SW + ++ G L+ + LC+ L
Sbjct: 207 FTGITECESFLRIVTSDFKKAHSNCPKLIRKSWNIITNITSTNEGKKWLSDNWKLCQPLK 266
Query: 291 --NSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERI 347
N+ E L +L+ Y+ LAMV+YPY ++F+ PLP YPI VCK + N + T +L I
Sbjct: 267 NENNIEQLISYLQDIYTNLAMVNYPYKANFLAPLPAYPINAVCKHLTNESLTGTDLLIAI 326
Query: 348 FEGVSVYYNYTGNVDCFQLDDDPHGLD--GWNWQ 379
++++ NYT C L++ LD GW++Q
Sbjct: 327 KNAINIFTNYTSETKCLNLNNSSPQLDAIGWSFQ 360
>gi|355752501|gb|EHH56621.1| hypothetical protein EGM_06072 [Macaca fascicularis]
Length = 516
Score = 300 bits (769), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 147/311 (47%), Positives = 203/311 (65%), Gaps = 11/311 (3%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F + TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A
Sbjct: 77 VDHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 134
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE P
Sbjct: 135 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 192
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+A+GALA+SAPI QFED+VP F IV++DF++
Sbjct: 193 VIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 252
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYP 313
C +I+ SW + + +GL LT HLC L S + L DW+ + LAMVDYP
Sbjct: 253 CSESIRRSWDAINRLSSTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 312
Query: 314 YPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH- 371
Y S+F+ PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + +
Sbjct: 313 YASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATS 372
Query: 372 --GLDGWNWQV 380
G GW++Q
Sbjct: 373 SLGTLGWSYQA 383
>gi|355713391|gb|AES04658.1| prolylcarboxypeptidase [Mustela putorius furo]
Length = 496
Score = 300 bits (768), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 147/321 (45%), Positives = 203/321 (63%), Gaps = 12/321 (3%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y Y +Q++DHF F+ TF QRYLI +W G I Y GNEGDI WF N+GF
Sbjct: 48 YSVHYIQQKVDHFGFSADKTFKQRYLIADAYW--KKNGGSILFYTGNEGDITWFCNNTGF 105
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+A + AMLVF EHRYYGES+P+G+ +++++ L++LT+EQALADFAV I +LK+
Sbjct: 106 MWDVADQLKAMLVFAEHRYYGESLPFGNK--SFRDSRHLNFLTSEQALADFAVLIKHLKK 163
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ A+ PV+ GGSYGGMLAAW R+KYPHI +GALA+SAPI F ++VP F IV+
Sbjct: 164 TIPGAKNQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWHFGNLVPCGVFMEIVT 223
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESA 303
DFKR +C TI+ SW + + GL L++ LC L + +D L WL
Sbjct: 224 KDFKRGGPNCSETIRSSWDAINRFSRTGAGLRWLSEALDLCTALTNAQDVQHLKAWLSET 283
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSVYYNYTGNVD 362
+ LAMVDYPY DF+ PLP +PI+ VC+ + N + +L+ IF+ ++VYYNY+G
Sbjct: 284 WINLAMVDYPYECDFLQPLPAWPIKVVCQYLKNPSVSDAQLLQNIFQALNVYYNYSGQAS 343
Query: 363 CFQLDDDPH---GLDGWNWQV 380
C + + G GW++Q
Sbjct: 344 CLNISETTTSNLGTQGWSYQA 364
>gi|402894800|ref|XP_003910533.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Papio
anubis]
Length = 516
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 162/384 (42%), Positives = 224/384 (58%), Gaps = 39/384 (10%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQR------- 76
S L+P + A R A R +G L T P + Y YF+Q+
Sbjct: 12 SFLAPWATKA----LRPALRALGSLHLPTNPTSLPAVAKN----YSVLYFQQKALAGHLH 63
Query: 77 -------------LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
+DHF F + TF+QRYL+ +W G I Y GNEGDI WF
Sbjct: 64 ICIFQLNHYKTPLVDHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCN 121
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+GF+WD+A AMLVF EHRYYGES+P+G +++++ L++LT+EQALADFA I
Sbjct: 122 NTGFMWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIK 179
Query: 184 NLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
+LK+ + AE PV+ GGSYGGMLAAW R+KYPH+A+GALA+SAPI QFED+VP F
Sbjct: 180 HLKRTIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMAVGALAASAPIWQFEDLVPCGVFM 239
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWL 300
IV++DF++ C +I+ SW + + +GL LT HLC L S + L DW+
Sbjct: 240 KIVTTDFRKSGPYCSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWI 299
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTG 359
+ LAMVDYPY S+F+ PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G
Sbjct: 300 SETWVNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSG 359
Query: 360 NVDCFQLDDDPH---GLDGWNWQV 380
V C + + G GW++Q
Sbjct: 360 QVKCLNISETATSSLGTLGWSYQA 383
>gi|432889721|ref|XP_004075329.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Oryzias latipes]
Length = 504
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 164/375 (43%), Positives = 221/375 (58%), Gaps = 29/375 (7%)
Query: 15 SPVITIVIISILSPLSLAAQPSK-FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYF 73
S +T I L+ L + A S+ F R +G +EPP Y+T YF
Sbjct: 12 SSAVTCFITLSLACLHVTAFKSQLFTR----LGARSFSSEPP----------IIYKTFYF 57
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAP 133
QR++HF F + TF QRYL+ HW P+ GPI Y GNEGDI WF N+GF+W+IA
Sbjct: 58 NQRINHFGFLEDGTFKQRYLVADKHWQEPD--GPILFYTGNEGDITWFCNNTGFMWEIAE 115
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA 193
GAMLVF EHRYYGES+P+G + +Y ++ L+YLT+EQALADFAV I NLK +
Sbjct: 116 ELGAMLVFAEHRYYGESLPFG--QDSYSDSKHLNYLTSEQALADFAVLIQNLKSKM--PE 171
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRES 253
SPV+ GGSYGGML+AW+R+KYP++ +GALA+SAPI QF +V FY IV+ DF +
Sbjct: 172 SPVISVGGSYGGMLSAWLRMKYPNVVVGALAASAPIWQFPRMVGCGDFYKIVTQDFAKSG 231
Query: 254 ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMV 310
+C TI+ SW + ++ +GL L++ F LC L + D WL+ + LAMV
Sbjct: 232 QNCDVTIRRSWKAINNLSSTASGLQWLSEEFSLCSPLKTKTDAVIFKGWLQETWVNLAMV 291
Query: 311 DYPYPSDFMMPLPGYPIREVCKKI--DNAPDATSILERIFEGVSVYYNYTGNVDCF---Q 365
DYPY + F+ PLP +PI+ VCK + D+ +L + + VYYNYTG+ C Q
Sbjct: 292 DYPYEASFLQPLPAWPIQVVCKYLAFDSKASDYDLLHGVSQAAKVYYNYTGSSSCLNTSQ 351
Query: 366 LDDDPHGLDGWNWQV 380
G GW +Q
Sbjct: 352 TATSSLGALGWTYQA 366
>gi|426369950|ref|XP_004051943.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Gorilla
gorilla gorilla]
Length = 517
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 202/310 (65%), Gaps = 11/310 (3%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F + TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A
Sbjct: 78 VDHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 135
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 193
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYP 313
C +I+ SW + + +GL LT HLC L S + L DW+ + LAMVDYP
Sbjct: 254 CSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 314 YPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH- 371
Y S+F+ PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + +
Sbjct: 314 YASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATS 373
Query: 372 --GLDGWNWQ 379
G GW++Q
Sbjct: 374 SLGTLGWSYQ 383
>gi|238006194|gb|ACR34132.1| unknown [Zea mays]
gi|413943526|gb|AFW76175.1| putative serine peptidase S28 family protein [Zea mays]
Length = 372
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 148/272 (54%), Positives = 195/272 (71%), Gaps = 10/272 (3%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHW------VGPNRLGPIFLYCGNEGDIE 119
+ YF Q LDHF+F + F +YL+N W G + GP+F+Y GNEGDIE
Sbjct: 84 FTAHYFPQLLDHFAFTPNASTVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNEGDIE 143
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
WFA N+GF++DIAP FGA+LVF EHR+YGES P+G+ +Y++A TL YLT+ QALADFA
Sbjct: 144 WFATNTGFMFDIAPTFGALLVFIEHRFYGESKPFGND--SYRSAETLGYLTSTQALADFA 201
Query: 180 VFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
V I LK++L AEA+PVV+FGGSYGGMLA+W RLKYPH+AIGALASSAPILQF+ I P
Sbjct: 202 VVIRGLKRDLGAEAAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFDHITPWS 261
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+FY+ VS DFK ES++CF I+ +W L G + GLL+L+K F C+ + + +W
Sbjct: 262 SFYDAVSQDFKSESSNCFGVIRAAWDVLDERGATDKGLLDLSKLFRACKTVKYAYSIRNW 321
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L +A++Y AMVDYP P++F+ LP YP++EV
Sbjct: 322 LWTAFTYTAMVDYPTPANFLENLPAYPVKEVV 353
>gi|332025885|gb|EGI66041.1| Lysosomal Pro-X carboxypeptidase [Acromyrmex echinatior]
Length = 484
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 151/321 (47%), Positives = 207/321 (64%), Gaps = 10/321 (3%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+Y+YE + F+ R+DHFSFA TF RYLIN D W PIF Y GNEG+IE FA N
Sbjct: 42 RYKYEIKTFDVRVDHFSFAVQDTFKLRYLIN-DTWRKQQN-APIFFYTGNEGNIEVFAQN 99
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GF+W+IAP+F A+++F EHRYYGES+PYG+ A N YLT++QALAD+ I +
Sbjct: 100 TGFLWEIAPKFDALVIFAEHRYYGESLPYGNQSFA--NLQHRGYLTSQQALADYVELIAH 157
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK E SPV++FGGSYGGML+AWMR+KYPH+ GA+ASSAP+LQF D+V E F I
Sbjct: 158 LKSQPRYEHSPVIVFGGSYGGMLSAWMRMKYPHVVQGAIASSAPLLQFTDVVDCEVFARI 217
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLE 301
+SD+K + +C I++SW + +V + G L+ + LC L + ED L ++L+
Sbjct: 218 TTSDYKAANPTCSKLIQKSWNTITNVTSNDEGKKWLSDNWKLCEPLKTAEDVKTLKNFLQ 277
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP-DATSILERIFEGVSVYYNYTGN 360
Y LAMV+YPY ++F+ PLPG PI C+ + N +L + +SVY NYTG
Sbjct: 278 EVYIDLAMVNYPYETNFLAPLPGNPINVFCQHLTNVSLTGKPLLLALHGAISVYTNYTGK 337
Query: 361 VDCFQLDDDPHGLD--GWNWQ 379
+C + + GLD GW++Q
Sbjct: 338 TNCISMKNAEPGLDDQGWDYQ 358
>gi|301780038|ref|XP_002925438.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Ailuropoda
melanoleuca]
Length = 520
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 146/316 (46%), Positives = 202/316 (63%), Gaps = 12/316 (3%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
+ ++DHF F TF QRYLI +HW G I Y GNEGDI WF N+GF+WD+A
Sbjct: 76 LDLKVDHFGFTADETFKQRYLIADEHW--KKNGGSILFYTGNEGDITWFCNNTGFMWDVA 133
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SA 191
+ AMLVF EHRYYGES+P+G+ +++++ L++LT+EQALADFAV I +LK+ + A
Sbjct: 134 DQLKAMLVFAEHRYYGESLPFGNK--SFKDSRHLNFLTSEQALADFAVLIKHLKRTIPGA 191
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
+ PV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI F ++VP F IV+ DFKR
Sbjct: 192 KNQPVIAVGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMEIVTKDFKR 251
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLA 308
+C TI+ SW + + GL L++ LC L +TED L W+ + LA
Sbjct: 252 SGPNCSETIRSSWDAINRFARTGAGLRWLSEALGLCTALTNTEDVQRLKAWISETWINLA 311
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSVYYNYTGNVDCFQLD 367
MVDYPY SDF+ PLP +PI+ VC+ + N + +L+ IF+ ++VYYNY+G C +
Sbjct: 312 MVDYPYESDFLQPLPAWPIKVVCQYLKNPSVSDAQLLQNIFQALNVYYNYSGQASCLNIS 371
Query: 368 DDPH---GLDGWNWQV 380
+ G GW++Q
Sbjct: 372 ETTTSSLGTQGWSYQA 387
>gi|114639539|ref|XP_001175149.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 5 [Pan
troglodytes]
gi|397502804|ref|XP_003822033.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Pan
paniscus]
Length = 517
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 202/310 (65%), Gaps = 11/310 (3%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F + TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A
Sbjct: 78 VDHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 135
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 193
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYP 313
C +I+ SW + + +GL LT HLC L S + L DW+ + LAMVDYP
Sbjct: 254 CSESIRRSWEAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 314 YPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH- 371
Y S+F+ PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + +
Sbjct: 314 YASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATS 373
Query: 372 --GLDGWNWQ 379
G GW++Q
Sbjct: 374 SLGTLGWSYQ 383
>gi|148909163|gb|ABR17682.1| unknown [Picea sitchensis]
Length = 501
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 155/320 (48%), Positives = 206/320 (64%), Gaps = 25/320 (7%)
Query: 67 RYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+Y+ RY+ Q LDHFSF TF Q+YLIN+DHW G + PIF+Y GNEG IEWF N
Sbjct: 59 KYDVRYYTQILDHFSFVPESYQTFQQKYLINSDHWGGASAKSPIFVYTGNEGFIEWFTEN 118
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GF++DIAP+F AMLVF EHR+YG SMP+GS + AY N++TL +L++ QALADFA IT+
Sbjct: 119 TGFMFDIAPQFKAMLVFIEHRFYGHSMPFGSQKAAYSNSSTLGFLSSAQALADFATLITD 178
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK+NLSAE SPVV+FG + G ++SSAPIL F++I P +F +
Sbjct: 179 LKKNLSAEDSPVVVFG-----------------GSYGGISSSAPILYFDNITPIGSFDDT 221
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE----DLADWL 300
VS DF+ ES +CF IK SW + + GL L K +C+ NS +LA WL
Sbjct: 222 VSEDFRSESENCFKVIKGSWNVIDEMTSTPEGLKSLRKALRICKS-NSDNYIAGNLAGWL 280
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
AY AM DYP ++F+ PLP YP++++CK IDN +T +L +++ +VYYNYTG
Sbjct: 281 YDAYYTAAMTDYPVAANFVQPLPAYPVKQMCKAIDNPSGSTDLLSQLYGVANVYYNYTGR 340
Query: 361 VDCFQLD-DDPHGLDGWNWQ 379
CF + DPHG DGW +Q
Sbjct: 341 SSCFDIRPSDPHGEDGWQFQ 360
>gi|349604443|gb|AEP99992.1| Lysosomal Pro-X carboxypeptidase-like protein, partial [Equus
caballus]
Length = 441
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 205/313 (65%), Gaps = 14/313 (4%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F TF QRYLI HW G I Y GNEGDI WF N+GF+WD+A
Sbjct: 1 VDHFGFNTDKTFKQRYLIADKHWKKDG--GSILFYTGNEGDITWFCNNTGFMWDMAEELK 58
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G+ +++++T L++LT+EQALADFA I +LK+ + A+ P
Sbjct: 59 AMLVFAEHRYYGESLPFGAN--SFKDSTHLNFLTSEQALADFAKLIKHLKRTVPGAKNQP 116
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALASSAPI QFED+VP F IV++DF++ +
Sbjct: 117 VIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWQFEDLVPCGVFMEIVTADFRKSGPN 176
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMVDY 312
C +I+ SW + + GL L++ LC L +++D L DW+ + LAMVDY
Sbjct: 177 CSESIRRSWDAIRRLTSTGPGLHWLSEALRLCSPLTNSQDVEHLKDWITETWVNLAMVDY 236
Query: 313 PYPSDFMMPLPGYPIREVCKKIDNAPDATS--ILERIFEGVSVYYNYTGNVDCFQLDDDP 370
PY S+F+ PLP +PI+ VC+ + N P+ + +LE IF+ ++VYYNY+G V C + +
Sbjct: 237 PYESNFLQPLPAWPIKVVCQYLKN-PNVSDPQLLENIFQALNVYYNYSGQVKCLNMSETA 295
Query: 371 H---GLDGWNWQV 380
G GW++Q
Sbjct: 296 TRSLGSLGWSYQA 308
>gi|117306169|ref|NP_955450.2| lysosomal Pro-X carboxypeptidase isoform 2 preproprotein [Homo
sapiens]
gi|119595481|gb|EAW75075.1| prolylcarboxypeptidase (angiotensinase C), isoform CRA_b [Homo
sapiens]
Length = 517
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 201/310 (64%), Gaps = 11/310 (3%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F + TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A
Sbjct: 78 VDHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEELK 135
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 193
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYP 313
C +I SW + + +GL LT HLC L S + L DW+ + LAMVDYP
Sbjct: 254 CSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 314 YPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH- 371
Y S+F+ PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + +
Sbjct: 314 YASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATS 373
Query: 372 --GLDGWNWQ 379
G GW++Q
Sbjct: 374 SLGTLGWSYQ 383
>gi|395814761|ref|XP_003780910.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Otolemur
garnettii]
Length = 515
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 200/310 (64%), Gaps = 11/310 (3%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F + TF+QRYLI ++W G I Y GNEGDI WF N+GF+WD+A
Sbjct: 77 VDHFGFNSVKTFNQRYLIANEYWKKDG--GSILFYTGNEGDIVWFCNNTGFMWDVAQELK 134
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G +++++ L+YLT+EQALADFA I +L++ + + P
Sbjct: 135 AMLVFAEHRYYGESLPFGKN--SFKDSRHLNYLTSEQALADFAELIRHLQETIPGVKNQP 192
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV+SDFKR
Sbjct: 193 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTSDFKRSGPY 252
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLESAYSYLAMVDYP 313
C TI++SW + + GL LT+ HLC L + L DW+ + LAMVDYP
Sbjct: 253 CSETIRKSWNVINQLSTTSRGLQYLTEVLHLCTPLTPRDVQHLKDWIAETWVNLAMVDYP 312
Query: 314 YPSDFMMPLPGYPIREVCKKIDNAPDATSIL-ERIFEGVSVYYNYTGNVDCFQLDDDPH- 371
YPS F+ PLP +PI+ VC+ + N + S++ + IF+ +++YYNY+G C +
Sbjct: 313 YPSSFLQPLPAWPIKVVCQYLKNPNVSDSLMIQNIFQALNIYYNYSGEAQCLNVSTTTTS 372
Query: 372 --GLDGWNWQ 379
G GW++Q
Sbjct: 373 NLGSLGWSYQ 382
>gi|281339633|gb|EFB15217.1| hypothetical protein PANDA_014943 [Ailuropoda melanoleuca]
Length = 441
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 200/312 (64%), Gaps = 12/312 (3%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F TF QRYLI +HW G I Y GNEGDI WF N+GF+WD+A +
Sbjct: 1 VDHFGFTADETFKQRYLIADEHW--KKNGGSILFYTGNEGDITWFCNNTGFMWDVADQLK 58
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G+ +++++ L++LT+EQALADFAV I +LK+ + A+ P
Sbjct: 59 AMLVFAEHRYYGESLPFGNK--SFKDSRHLNFLTSEQALADFAVLIKHLKRTIPGAKNQP 116
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI F ++VP F IV+ DFKR +
Sbjct: 117 VIAVGGSYGGMLAAWFRMKYPHMVVGALAASAPIWHFGNLVPCGVFMEIVTKDFKRSGPN 176
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMVDY 312
C TI+ SW + + GL L++ LC L +TED L W+ + LAMVDY
Sbjct: 177 CSETIRSSWDAINRFARTGAGLRWLSEALGLCTALTNTEDVQRLKAWISETWINLAMVDY 236
Query: 313 PYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSVYYNYTGNVDCFQLDDDPH 371
PY SDF+ PLP +PI+ VC+ + N + +L+ IF+ ++VYYNY+G C + +
Sbjct: 237 PYESDFLQPLPAWPIKVVCQYLKNPSVSDAQLLQNIFQALNVYYNYSGQASCLNISETTT 296
Query: 372 ---GLDGWNWQV 380
G GW++Q
Sbjct: 297 SSLGTQGWSYQA 308
>gi|338726996|ref|XP_001917253.2| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X carboxypeptidase
[Equus caballus]
Length = 519
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 205/315 (65%), Gaps = 14/315 (4%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPR 134
+ +DHF F TF QRYLI HW G I Y GNEGDI WF N+GF+WD+A
Sbjct: 77 RXVDHFGFNTDKTFKQRYLIADKHWKKDG--GSILFYTGNEGDITWFCNNTGFMWDMAEE 134
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEA 193
AMLVF EHRYYGES+P+G+ +++++T L++LT+EQALADFA I +LK+ + A+
Sbjct: 135 LKAMLVFAEHRYYGESLPFGAN--SFKDSTHLNFLTSEQALADFAKLIKHLKRTVPGAKN 192
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRES 253
PV+ GGSYGGMLAAW R+KYPH+ +GALASSAPI QFED+VP F IV++DF++
Sbjct: 193 QPVIALGGSYGGMLAAWFRMKYPHMVVGALASSAPIWQFEDLVPCGVFMEIVTADFRKSG 252
Query: 254 ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMV 310
+C +I+ SW + + GL L++ LC L +++D L DW+ + LAMV
Sbjct: 253 PNCSESIRRSWDAIRRLTSTGPGLHWLSEALRLCSPLTNSQDVEHLKDWITETWVNLAMV 312
Query: 311 DYPYPSDFMMPLPGYPIREVCKKIDNAPDATS--ILERIFEGVSVYYNYTGNVDCFQLDD 368
DYPY S+F+ PLP +PI+ VC+ + N P+ + +LE IF+ ++VYYNY+ V C + +
Sbjct: 313 DYPYESNFLQPLPAWPIKVVCQYLKN-PNVSDPQLLENIFQALNVYYNYSDQVKCLNMSE 371
Query: 369 DPH---GLDGWNWQV 380
G GW++Q
Sbjct: 372 TATRSLGSLGWSYQA 386
>gi|50540452|ref|NP_001002694.1| lysosomal Pro-X carboxypeptidase precursor [Danio rerio]
gi|49903247|gb|AAH76507.1| Prolylcarboxypeptidase (angiotensinase C) [Danio rerio]
Length = 490
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 165/369 (44%), Positives = 221/369 (59%), Gaps = 29/369 (7%)
Query: 20 IVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDH 79
+++I I S +LA +P RF+G+ Q Q + Y T Y +Q++DH
Sbjct: 13 LILIFIFSCKALALKP-------RFLGR-------NTDQSFSQHRSVSYNTLYIDQQIDH 58
Query: 80 FSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAML 139
F F + TF QRYL+N HW GPI Y GNEGDI WF N+GF+WD+A GA+L
Sbjct: 59 FGFLENGTFKQRYLLNDQHWHKDG--GPILFYTGNEGDITWFCNNTGFMWDVAEELGALL 116
Query: 140 VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASPVVL 198
VF EHRYYGES+P+G E +Y NA L+YLT+EQ LADFAV I LK++ AE S V+
Sbjct: 117 VFAEHRYYGESLPFG--EESYSNAKYLNYLTSEQVLADFAVLIKALKKSQPGAEKSSVIA 174
Query: 199 FGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFN 258
GGSYGGMLAAW+R+KYP+ +GALA+SAPI QF IVP FY +V+ DF ++C +
Sbjct: 175 IGGSYGGMLAAWLRMKYPNAVVGALAASAPIWQF--IVPCGEFYRVVTRDFTISGSNCSS 232
Query: 259 TIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL---ADWLESAYSYLAMVDYPYP 315
+I+ SW + + GL L++TF LC L S ED+ WL+ + LAMVDYPY
Sbjct: 233 SIRSSWAAIDRLSATGEGLQWLSQTFGLCGPLKSQEDVFGFKAWLQETWVNLAMVDYPYE 292
Query: 316 SDFMMPLPGYPIREVCKKI--DNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPH-- 371
+DF+ PLP +P++ VCK + + +L + + V VYYNYTG+ C
Sbjct: 293 ADFLQPLPAWPVKVVCKNLQFNKGVSDKQLLNGVSQAVRVYYNYTGDAVCLNTSKTATGN 352
Query: 372 -GLDGWNWQ 379
G GW +Q
Sbjct: 353 LGFLGWFYQ 361
>gi|166064014|ref|NP_001099751.2| lysosomal Pro-X carboxypeptidase precursor [Rattus norvegicus]
Length = 493
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 146/321 (45%), Positives = 200/321 (62%), Gaps = 13/321 (4%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y YF+Q++DHF F+D TF QRYL+ HW G I Y GNEGDI WF N+GF
Sbjct: 45 YSVHYFQQKVDHFGFSDTRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNTGF 102
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+A AMLVF EHRYYG+S+P+G +++++ L++LT+EQALADFA I +LK+
Sbjct: 103 MWDVAEELKAMLVFAEHRYYGKSLPFGRD--SFKDSQHLNFLTSEQALADFAELIRHLKE 160
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ E PV+ GGSYGGMLAAW R+KYPHI +GALA+SAPI Q + + P F IV+
Sbjct: 161 TIPGTEGQPVIAIGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQLDGMAPCGEFMKIVT 220
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESAY 304
DF + C +I+ SW + + +GL L T HLC L S + L W+ +
Sbjct: 221 KDFSQSGPHCSESIRRSWSVINRLSDTGSGLQWLRDTLHLCSPLTSEKMPTLKGWIAETW 280
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA--TSILERIFEGVSVYYNYTGNVD 362
LAMV+YPY +F+ PLP +PI+EVC+ + N P+ T +L+ IF+ ++VYYNY+G
Sbjct: 281 VNLAMVNYPYACNFLQPLPAWPIKEVCRYLRN-PNVSDTVLLQNIFQALNVYYNYSGQTK 339
Query: 363 CF---QLDDDPHGLDGWNWQV 380
C Q G GW++Q
Sbjct: 340 CLNISQTTTSSLGSTGWSFQA 360
>gi|426251525|ref|XP_004019472.1| PREDICTED: lysosomal Pro-X carboxypeptidase isoform 2 [Ovis aries]
Length = 517
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 200/313 (63%), Gaps = 14/313 (4%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F TF QRYLI ++W G I Y GNEGDI WF N+GF+WDIA
Sbjct: 77 VDHFGFNIDRTFKQRYLIADNYWKEDG--GSILFYTGNEGDIIWFCNNTGFMWDIAEEMK 134
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G+ ++ ++ L++LT EQALADFA I LK+ + A P
Sbjct: 135 AMLVFAEHRYYGESLPFGAD--SFSDSRHLNFLTTEQALADFAKLIKYLKRTIPGARNQP 192
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALASSAPI QF D+VP + F IV++DF + +
Sbjct: 193 VIALGGSYGGMLAAWFRMKYPHLVVGALASSAPIWQFNDLVPCDVFMKIVTTDFSQSGPN 252
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMVDY 312
C +I+ SW + + +K GL L++ HLC L +D L DW+ + LAMVDY
Sbjct: 253 CSESIRRSWDAINRLAKKGPGLRWLSEALHLCTPLTKLQDVQHLKDWIAETWVNLAMVDY 312
Query: 313 PYPSDFMMPLPGYPIREVCK--KIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDP 370
PY S+F+ PLP +P++ VC+ K N PD T +L IF+ ++VYYNY+G C + +
Sbjct: 313 PYESNFLQPLPAWPVKVVCQYFKYSNVPD-TLLLRNIFQALNVYYNYSGQAKCLNVSETA 371
Query: 371 H---GLDGWNWQV 380
G+ GW++Q
Sbjct: 372 TSSLGVQGWSYQA 384
>gi|441645350|ref|XP_004093164.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X carboxypeptidase
[Nomascus leucogenys]
Length = 517
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 201/310 (64%), Gaps = 11/310 (3%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F + TF+QRYL+ +W G I Y GNEGDI WF N+GF+WD+A
Sbjct: 78 VDHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTGFMWDVAEDXE 135
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G +++++ L++LT+EQALAD A I +LK+ + AE P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADLAELIKHLKRTIPGAENQP 193
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYP 313
C +I+ SW + + +GL LT HLC L S + L DW+ + LAMVDYP
Sbjct: 254 CSESIRRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 314 YPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH- 371
Y S+F+ PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + +
Sbjct: 314 YASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATS 373
Query: 372 --GLDGWNWQ 379
G GW++Q
Sbjct: 374 SLGTLGWSYQ 383
>gi|190702395|gb|ACE75287.1| prolylcarboxypeptidase [Glyptapanteles flavicoxis]
Length = 497
Score = 294 bits (753), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 206/322 (63%), Gaps = 12/322 (3%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y+Y T+ F +DHFSF+ TF RY +N G N PIF Y GNEG +E FA N+
Sbjct: 42 YKYVTKKFIVPVDHFSFSLNNTFEMRYFVNDTWKSGKN--APIFFYTGNEGVLETFAANT 99
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+WDIAP FGA++VF EHRYYGESMP+G+ ++ N L YLT++QALAD+ I +L
Sbjct: 100 GFMWDIAPTFGALIVFAEHRYYGESMPFGNK--SFDNVKNLGYLTSQQALADYVDLIVHL 157
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K + S + SPV+ FGGSYGGML+AW R+KYPHI GA+A+SAP+LQF + F IV
Sbjct: 158 KSDPSLKHSPVIAFGGSYGGMLSAWFRMKYPHIINGAIAASAPVLQFTGYTDCQAFSRIV 217
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
+SDF+ +C I++SW + ++ ++G ++ F +C L + E+ D+L S
Sbjct: 218 TSDFRAVHPNCEKVIRKSWESIKNLTSTDDGKKWISTKFKVCGSLTTEENFKTFKDFLLS 277
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYNYTGN 360
YS LAMV+YPY +DF+ PLP +PIRE CK ++ N ++ + +G+++Y NYTG
Sbjct: 278 VYSNLAMVNYPYATDFLSPLPAFPIREFCKFVNDTNLSSDKDVMTNLQKGINLYSNYTGR 337
Query: 361 VDCFQLDD---DPHGLDGWNWQ 379
+ C L D D +GWN+Q
Sbjct: 338 LKCLDLSDPEPDLGAANGWNYQ 359
>gi|320166581|gb|EFW43480.1| prolylcarboxypeptidase [Capsaspora owczarzaki ATCC 30864]
Length = 496
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 151/350 (43%), Positives = 208/350 (59%), Gaps = 13/350 (3%)
Query: 36 SKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLIN 95
S RR P V T P + + + QY Y+ ++F Q LDHF F TF QRYLI+
Sbjct: 29 SHGRRLPLSVAAGLRST-PAEATSERLKGQYNYDVKWFTQTLDHFRFDTNATFQQRYLIS 87
Query: 96 TDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGS 155
T +W N GP+F Y GNEGDIEWFA N+GFVW+IA + A++VF EHRYYG++MP+G
Sbjct: 88 TANW---NGYGPMFFYTGNEGDIEWFADNTGFVWEIAAEYNALVVFAEHRYYGQTMPFGD 144
Query: 156 TEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKY 215
+ + YLT EQALADFA+ I LK L+ PVV FGGSYGGMLA W RLKY
Sbjct: 145 KSF---DLDKVGYLTTEQALADFAILIPALKAQLNVPNLPVVAFGGSYGGMLAGWFRLKY 201
Query: 216 PHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKEN 275
P++ GA+A+SAPI+ F+D+ E F I ++DF A C N I++ + ++ ++ +
Sbjct: 202 PNVVDGAIAASAPIVYFQDLTSTEIFNEIATNDFALTDARCPNIIRDGFSKVDALSKTAA 261
Query: 276 GLLELTKTFHLCRELNSTE--DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKK 333
GL ++K F LC L + WLE+ +Y+AM DYPY S+F+ P+P +P+ CK
Sbjct: 262 GLQSISKAFKLCGTLQPADYATFIGWLEAGLTYMAMTDYPYASNFLQPMPAWPVDASCKA 321
Query: 334 IDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGWNWQ 379
+ + TS + + V VYYNYTG C + G+ GW++Q
Sbjct: 322 LLSTFAGTSDSVAALNTAVGVYYNYTGQTACNNISSQATSDLGVLGWDYQ 371
>gi|359479405|ref|XP_002272041.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 491
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 147/316 (46%), Positives = 200/316 (63%), Gaps = 16/316 (5%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ T Y+ Q LDHF++ F QRYLIN+ +W G N PIF+Y G+EG I A +
Sbjct: 50 FVTYYYNQTLDHFNYRPESYTNFQQRYLINSAYWGGANSSSPIFVYTGDEGSITGAAAFA 109
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D+A RF +L++ EHRYYG+S+P+ S ++A+ N +TL Y ++ QALAD+A ITNL
Sbjct: 110 GFMVDLASRFKGLLLYIEHRYYGDSVPFRSKDIAFNNTSTLGYFSSTQALADYAELITNL 169
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+NLSAE PV+ GGSYGGMLA+W RLKYPH+ IGALASSAPIL F+DI P + +IV
Sbjct: 170 KKNLSAENCPVIAIGGSYGGMLASWFRLKYPHVVIGALASSAPILYFDDITPHNGYDSIV 229
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S +C+ TIK+SW E+ V + NGL L++ F C LNS++ L +LE Y
Sbjct: 230 TKDFRETSETCYTTIKQSWSEIDEVAGQPNGLANLSQIFRTCEPLNSSQQLKLYLEYTYE 289
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
A D P P + + ++C ID AP+ TSIL R+ EGV N + C +
Sbjct: 290 ASAQYDNP---------PAHYVSDICNAIDGAPEGTSILGRVAEGV----NASAGPPCHR 336
Query: 366 LDD-DPHGLDGWNWQV 380
+ D P + GW WQ
Sbjct: 337 IYDFQPSNMSGWLWQT 352
>gi|413933337|gb|AFW67888.1| hypothetical protein ZEAMMB73_712720 [Zea mays]
Length = 773
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 163/332 (49%), Positives = 189/332 (56%), Gaps = 71/332 (21%)
Query: 77 LDHFSFADLPT-------FSQRYLINTDH-WVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
LDHFSF + F QRYL+ D W GP GPIF YCGNEGDI WFA NSG +
Sbjct: 482 LDHFSFPGVEDEDEAVVFFQQRYLVGRDSGWAGPG--GPIFFYCGNEGDIAWFAANSGLI 539
Query: 129 WDIAPRFGA----------------MLVFPE-------HRYYGESMPYGSTEVAYQNATT 165
WD APRF A V HRYY ESMP+GS AY ++
Sbjct: 540 WDAAPRFAARGNRSSAASLVSYSYLFFVLKTFRLKRYIHRYYRESMPFGSKAKAYSDSKF 599
Query: 166 LSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALAS 225
+YLTAEQALADF V +T+LK+NLSAE SPVVLFGGSYGGMLAAWMRLKYPHIAIG L
Sbjct: 600 PTYLTAEQALADFVVLLTDLKRNLSAEGSPVVLFGGSYGGMLAAWMRLKYPHIAIGVL-- 657
Query: 226 SAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
+ SC S L + L +
Sbjct: 658 ------------------------HHQLRSC------SLRTLFLL------LYSMISYLM 681
Query: 286 LCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILE 345
+ L ++ DL+DWL SAYSYLAMVDYP PS+F+ PLP PI+EVC ID+ P LE
Sbjct: 682 ILGTLKTSGDLSDWLSSAYSYLAMVDYPLPSEFLRPLPANPIKEVCGNIDSQPKGIGTLE 741
Query: 346 RIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWN 377
RI+ GV+VYYNYT VDCF L+DDPHG+ GW+
Sbjct: 742 RIYAGVNVYYNYTDIVDCFDLNDDPHGMGGWD 773
>gi|225436410|ref|XP_002271961.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 505
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 157/371 (42%), Positives = 224/371 (60%), Gaps = 25/371 (6%)
Query: 16 PVITIVIISILSPLSLAA-QPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFE 74
P+ + ++ S ++A QP R +G+ L EP + ++ ++T Y+
Sbjct: 13 PISILALVIFFSTTCVSATQPHLLRLG--VLGR-TFLREPATFSESVSEE---FQTFYYN 66
Query: 75 QRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
Q LDHF++ TF QRY++N +W G N PIF Y G E D+++ GF+ D A
Sbjct: 67 QTLDHFNYRPESYITFQQRYVVNFKYWGGANASAPIFAYLGAEADLDYDLSGIGFLTDNA 126
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
+F A+LV+ EHRYYG+S+PYGS E A++NA+TL Y + QA+AD+A + +K+ L AE
Sbjct: 127 HQFKALLVYIEHRYYGKSIPYGSREEAFKNASTLGYFNSAQAIADYAEVLIYIKKKLLAE 186
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE 252
SPV++ G SYGGMLA+W RLKYPH+A+GALASSAPIL F+DI P +++IV+ DF+
Sbjct: 187 NSPVIVVGASYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSIVTKDFREA 246
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDY 312
S SC+ TI ESW E+ V + NGL L+K F +C +LN++++L ++LE YS A D
Sbjct: 247 SESCYKTISESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTYSIAAQYDA 306
Query: 313 PYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF---QLDDD 369
P P YP+ VC ID AP+ + IL RIF GV + GN+ C+ D
Sbjct: 307 P---------PSYPVTVVCGGIDGAPEGSDILSRIFAGVVA---FRGNMSCYYTSNTTDY 354
Query: 370 P-HGLDGWNWQ 379
P + GW WQ
Sbjct: 355 PIETIQGWGWQ 365
>gi|297734878|emb|CBI17112.3| unnamed protein product [Vitis vinifera]
Length = 624
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 157/371 (42%), Positives = 224/371 (60%), Gaps = 25/371 (6%)
Query: 16 PVITIVIISILSPLSLAA-QPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFE 74
P+ + ++ S ++A QP R +G+ L EP + ++ ++T Y+
Sbjct: 132 PISILALVIFFSTTCVSATQPHLLRLG--VLGR-TFLREPATFSESVSEE---FQTFYYN 185
Query: 75 QRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
Q LDHF++ TF QRY++N +W G N PIF Y G E D+++ GF+ D A
Sbjct: 186 QTLDHFNYRPESYITFQQRYVVNFKYWGGANASAPIFAYLGAEADLDYDLSGIGFLTDNA 245
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
+F A+LV+ EHRYYG+S+PYGS E A++NA+TL Y + QA+AD+A + +K+ L AE
Sbjct: 246 HQFKALLVYIEHRYYGKSIPYGSREEAFKNASTLGYFNSAQAIADYAEVLIYIKKKLLAE 305
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE 252
SPV++ G SYGGMLA+W RLKYPH+A+GALASSAPIL F+DI P +++IV+ DF+
Sbjct: 306 NSPVIVVGASYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSIVTKDFREA 365
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDY 312
S SC+ TI ESW E+ V + NGL L+K F +C +LN++++L ++LE YS A D
Sbjct: 366 SESCYKTISESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTYSIAAQYDA 425
Query: 313 PYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF---QLDDD 369
P P YP+ VC ID AP+ + IL RIF GV + GN+ C+ D
Sbjct: 426 P---------PSYPVTVVCGGIDGAPEGSDILSRIFAGVVA---FRGNMSCYYTSNTTDY 473
Query: 370 P-HGLDGWNWQ 379
P + GW WQ
Sbjct: 474 PIETIQGWGWQ 484
>gi|340710015|ref|XP_003393594.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Bombus
terrestris]
Length = 494
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 149/334 (44%), Positives = 210/334 (62%), Gaps = 14/334 (4%)
Query: 56 QRQQRQQQQ----QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
Q Q R Q + +Y+YE + +DHFSF+ TF RYL+N + + PIF Y
Sbjct: 32 QNQLRTQNELYSGKYKYEIKTINMPVDHFSFSVPDTFKLRYLVNNTWQIRKD--APIFFY 89
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
GNEG+IE FA N+GF+WDIAP FGA+L+F EHRYYGESMPY + +Y + L YLT+
Sbjct: 90 TGNEGNIENFAQNTGFMWDIAPEFGALLIFAEHRYYGESMPYNNK--SYMDLNHLGYLTS 147
Query: 172 EQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
QALAD+ I +K + SPV++FGGSYGGML+AW+R+KYPH+ GA+ASSAPILQ
Sbjct: 148 RQALADYVDLIQYVKSKPEYKYSPVIVFGGSYGGMLSAWIRMKYPHVVQGAIASSAPILQ 207
Query: 232 FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN 291
F + E F IV+SDFK +C I+ SW ++++ + G L ++ LC+ L
Sbjct: 208 FSGVTECEAFVRIVTSDFKTAHTNCPKLIRRSWSTIINITSSDKGREWLADSWKLCQPLK 267
Query: 292 STED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERI 347
++ D L +LE Y+ LA+V+YPY ++F+ PLP YP++ VCK + N + T +L I
Sbjct: 268 NSSDVQQLISYLEDIYTNLAVVNYPYEANFLTPLPAYPVKAVCKHLTNQSLVGTELLIAI 327
Query: 348 FEGVSVYYNYTGNVDCFQLDDD-PH-GLDGWNWQ 379
+ ++++ NYTG C L+D P G GW +Q
Sbjct: 328 HKAINIFTNYTGETKCLNLNDSVPQLGAVGWPFQ 361
>gi|225448871|ref|XP_002270231.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
gi|296083471|emb|CBI23429.3| unnamed protein product [Vitis vinifera]
Length = 503
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 156/368 (42%), Positives = 215/368 (58%), Gaps = 23/368 (6%)
Query: 22 IISILSPLSLAAQPSKFRRAPRFV----GKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
+I + S++A PSK + PR L L + ++T ++ Q L
Sbjct: 12 LILLFITASVSATPSK--KIPRLGVLRGSSLSVLEGSSSLRTVSVNLSENFQTFFYPQTL 69
Query: 78 DHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
DHF++ TF RY++N ++W G IF+Y G E D++ + GF+ D RF
Sbjct: 70 DHFNYRPESYTTFQHRYMVNFNYWGGARSAAQIFVYLGEESDLDKDINSIGFLVDNGARF 129
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
GA+LV+ EHRYYG+S P+GS + + QNA+ Y + QALAD+A I NLK+NLSA++SP
Sbjct: 130 GALLVYIEHRYYGKSNPFGSMQKSLQNASRRGYFNSGQALADYAEVIINLKKNLSADSSP 189
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++ GGSYGG+LAAW RLKYPH+A+GALASSAPIL F+DI P + +Y++V+ DF+ S S
Sbjct: 190 VIVVGGSYGGLLAAWFRLKYPHVALGALASSAPILYFDDITPQDGYYSLVTKDFRDFSES 249
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYP 315
C+NTIK+SW E+ + NGLL L+K F C+ L S L D+LE+ YS A D P
Sbjct: 250 CYNTIKDSWAEIDKAAAEANGLLNLSKKFRTCKPLESASQLKDYLETMYSIAAQYDRP-- 307
Query: 316 SDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDD---PHG 372
P YP+ VC ID T IL+RIF G+ GN C+ +
Sbjct: 308 -------PMYPVTVVCNGIDGGLQGTDILDRIFSGIVA---SRGNKSCYDMGQSSFPSET 357
Query: 373 LDGWNWQV 380
+GWNWQV
Sbjct: 358 EEGWNWQV 365
>gi|193787211|dbj|BAG52417.1| unnamed protein product [Homo sapiens]
Length = 517
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 199/310 (64%), Gaps = 11/310 (3%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHF F + TF+QRYL+ +W G I Y GNEGDI WF N+ F+WD+A
Sbjct: 78 VDHFGFNTVKTFNQRYLVADKYW--KKNGGSILFYTGNEGDIIWFCNNTEFMWDVAEELK 135
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGES+P+G +++++ L++LT+EQALADFA I +LK+ + AE P
Sbjct: 136 AMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKRTIPGAENQP 193
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV++DF++
Sbjct: 194 VIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVTTDFRKSGPH 253
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAYSYLAMVDYP 313
C +I SW + + +GL LT HLC L S + L DW+ + LAMVDYP
Sbjct: 254 CSESIHRSWDAINRLLNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETWVNLAMVDYP 313
Query: 314 YPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPH- 371
Y S+F+ PLP PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C + +
Sbjct: 314 YASNFLQPLPARPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKCLNISETATS 373
Query: 372 --GLDGWNWQ 379
G GW++Q
Sbjct: 374 SLGTLGWSYQ 383
>gi|147820345|emb|CAN63232.1| hypothetical protein VITISV_004830 [Vitis vinifera]
Length = 468
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 156/372 (41%), Positives = 223/372 (59%), Gaps = 25/372 (6%)
Query: 16 PVITIVIISILSPLSLAA-QPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFE 74
P+ + ++ S ++A QP R +G+ L EP + ++ ++T Y+
Sbjct: 13 PISILALVIFFSTTCVSATQPHLLRLG--VLGR-TFLREPATFSESVSEE---FQTFYYN 66
Query: 75 QRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
Q LDHF++ TF QRY++N +W G N PIF Y G E D+++ GF+ D A
Sbjct: 67 QTLDHFNYRPESYITFQQRYVVNFKYWGGANASAPIFAYLGXEADLDYDLSGIGFLTDNA 126
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
+F A+LV+ EHRYYG+S+P GS E A++NA+TL Y + QA+AD+A + +K+ L AE
Sbjct: 127 HQFKALLVYIEHRYYGKSIPXGSREEAFKNASTLGYFNSAQAIADYAEVLIYIKKKLLAE 186
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE 252
SPV++ G SYGGMLA+W RLKYPH+A+GALASSAPIL F+DI P +++IV+ DF+
Sbjct: 187 NSPVIVVGASYGGMLASWFRLKYPHVALGALASSAPILYFDDITPHNGYFSIVTKDFREA 246
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDY 312
S SC+ TI ESW E+ V + NGL L+K F +C +LN++++L ++LE YS A D
Sbjct: 247 SESCYKTISESWSEIDRVASEPNGLSILSKKFRICGQLNNSDELKEFLERTYSIAAQYDA 306
Query: 313 PYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF---QLDDD 369
P P YP+ VC ID AP+ + IL RIF GV + GN+ C+ D
Sbjct: 307 P---------PSYPVTVVCGGIDGAPEGSDILSRIFAGVVA---FRGNMSCYYTSNTTDY 354
Query: 370 P-HGLDGWNWQV 380
P + GW WQ
Sbjct: 355 PIETIQGWGWQT 366
>gi|225436405|ref|XP_002271797.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 510
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 158/383 (41%), Positives = 226/383 (59%), Gaps = 27/383 (7%)
Query: 3 KTRQQNQNSLYLSP--VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQR 60
K+ N +SL +S +++++ + I+ P A PSK R + + +E
Sbjct: 5 KSTAMNSSSLMISLQWLLSLMFL-IIFPTCATATPSKLPRLSTILRESEIFSELISDD-- 61
Query: 61 QQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI 118
++T ++ Q LDHF++ TF QRY++N +W G N PIF Y G E +
Sbjct: 62 -------FQTFFYNQTLDHFNYRPESYYTFQQRYVMNFKYWGGANASAPIFAYLGAEAAL 114
Query: 119 EWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADF 178
++ GF D A +F A+LV+ EHRYYG+S+P+GS E A +NA+T Y + QA+AD+
Sbjct: 115 DFDLTGVGFPVDNALQFKALLVYIEHRYYGQSIPFGSREEALKNASTRGYFNSAQAIADY 174
Query: 179 AVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
A + +K+ L AE SPV++ GGSYGGMLA+W RLKYPH+A+GALASSAPIL F+DI P
Sbjct: 175 AEVLEYIKKKLLAENSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQ 234
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
+Y+IV+ DF+ S SC++TI+ESW E+ V + NGL L+K F C ELN + +L D
Sbjct: 235 NGYYSIVTKDFREASESCYSTIRESWSEIDRVASEPNGLSILSKKFRTCAELNKSNELKD 294
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT 358
+LE+ Y+ A ++P P YP+ VC ID AP+ + IL RIF GV Y
Sbjct: 295 YLETMYAVAAQYNHP---------PRYPVTVVCGGIDGAPEGSDILSRIFAGVVA---YR 342
Query: 359 GNVDCFQLDDDP-HGLDGWNWQV 380
GN C+ +P +GW WQ
Sbjct: 343 GNSSCYNTSVNPTETSEGWRWQT 365
>gi|222635987|gb|EEE66119.1| hypothetical protein OsJ_22160 [Oryza sativa Japonica Group]
Length = 393
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 140/257 (54%), Positives = 183/257 (71%), Gaps = 6/257 (2%)
Query: 128 VWDIAPRFGAML--VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
++DIAP FGA+L + +HR+YGES P+G+ + + L YLT+ QALADFAV IT+L
Sbjct: 1 MFDIAPSFGALLHDLSKQHRFYGESKPFGNE--SNSSPEKLGYLTSTQALADFAVLITSL 58
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K NLSA +SPVV+FGGSYGGMLA+W RLKYPH+ IGA+ASSAPILQF+ I P +FY V
Sbjct: 59 KHNLSAVSSPVVVFGGSYGGMLASWFRLKYPHVTIGAVASSAPILQFDYITPWSSFYEAV 118
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
S D+K ES +CF+ IK +W + G + GLL+L+KTF C+ + S +WL +A+
Sbjct: 119 SQDYKSESFNCFSVIKAAWDLIDERGSTDAGLLQLSKTFRACKTVKSVYSFRNWLWTAFV 178
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
Y AMVDYP P++F+M LP YPI+E+CK I P I+++ F S+YYNYTG+ CFQ
Sbjct: 179 YTAMVDYPTPANFLMNLPAYPIKEMCKIIHGFPAGADIVDKAFAAASLYYNYTGDQTCFQ 238
Query: 366 LDD--DPHGLDGWNWQV 380
L+D DPHGL GW WQ
Sbjct: 239 LEDGEDPHGLSGWGWQA 255
>gi|350415944|ref|XP_003490799.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Bombus impatiens]
Length = 494
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 210/334 (62%), Gaps = 14/334 (4%)
Query: 56 QRQQRQQQQ----QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
Q Q R Q + +Y+YE + + +DHFSF+ TF RYL+N + + PIF Y
Sbjct: 32 QNQLRTQNELYSGKYKYEIKTIDMPVDHFSFSVPDTFKLRYLVNNTWQIKKD--APIFFY 89
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
GNEG+IE FA N+GF+WDIAP FGA+L+F EHRYYGESMPYG+ +Y + L YLT+
Sbjct: 90 TGNEGNIENFAQNTGFMWDIAPEFGALLIFAEHRYYGESMPYGNK--SYMDLNHLGYLTS 147
Query: 172 EQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
QALAD+ I +K + SPV++FGGSYGGML+AW+R+KYPH+ GA+ASSAPILQ
Sbjct: 148 RQALADYVDLIQYVKSKPEYKYSPVIVFGGSYGGMLSAWIRMKYPHVVQGAIASSAPILQ 207
Query: 232 FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN 291
F + E F IV+SDFK +C I+ SW ++++ + G L ++ LC+ L
Sbjct: 208 FSGVTECEAFVRIVTSDFKTAHTNCPKLIRRSWNTIINITSSDKGREWLADSWKLCQPLK 267
Query: 292 STED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERI 347
++ D L +LE Y LA+V+YPY ++F+ PLP YP++ VCK + N + T +L I
Sbjct: 268 NSSDVQQLMSYLEDIYINLAVVNYPYEANFLTPLPAYPVKAVCKHLTNQSLVGTELLIAI 327
Query: 348 FEGVSVYYNYTGNVDCFQLDDD-PH-GLDGWNWQ 379
+ ++++ NY+ C L+D P G GW +Q
Sbjct: 328 HKAINIFTNYSSETKCLNLNDSVPQLGAVGWPFQ 361
>gi|255565527|ref|XP_002523754.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537058|gb|EEF38694.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 517
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 200/318 (62%), Gaps = 19/318 (5%)
Query: 70 TRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLG-PIFLYCGNEGDIEWFAVNSG 126
T ++ Q+LDHF+F TF QRY+IN +W GPNR P+ ++ G E +IE G
Sbjct: 69 TFFYPQKLDHFNFKPESYATFKQRYVINFKYWSGPNRTSVPLLVFFGAEENIEDDVDGIG 128
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+ DIA F +LV+ EHRYYG+S+P+GS++ ++NA++L Y ++ QA+AD+A I ++K
Sbjct: 129 FLNDIASHFKGLLVYIEHRYYGKSIPFGSSKEVFRNASSLGYFSSAQAIADYAAIIMHVK 188
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ SA+ SPV++ GGSYGGMLA+W RLKYPH+A+GALASSAPIL FEDI P +Y+IV+
Sbjct: 189 KKYSAKTSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFEDIAPRNGYYSIVT 248
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
DFK S SC+ TI++SW E+ V K NGL L+K F C L T +L D+L+S YS
Sbjct: 249 KDFKETSESCYQTIRKSWAEIEKVASKRNGLSILSKKFKTCNPLKRTFELKDYLDSIYSE 308
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
A + P P YP+ VC ID AP T +L RIF GV Y G+ C+ +
Sbjct: 309 AAQYNDP---------PRYPVTIVCGGIDGAPKGTDVLGRIFAGVVA---YMGDRSCYDV 356
Query: 367 DDDPHGLD----GWNWQV 380
+ H D W WQ
Sbjct: 357 NGYNHPTDATSLAWRWQT 374
>gi|156369614|ref|XP_001628070.1| predicted protein [Nematostella vectensis]
gi|156215037|gb|EDO36007.1| predicted protein [Nematostella vectensis]
Length = 436
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 202/314 (64%), Gaps = 16/314 (5%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
LDHF+F TFSQRYL+N +W + GPIF Y GNEGDI WFA N+GF+WD A FG
Sbjct: 1 LDHFNFRTSATFSQRYLVNIANW---RKGGPIFFYTGNEGDITWFANNTGFMWDNAKEFG 57
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
AMLVF EHRYYGE++P+G + +Y++ L YL++EQALADFA I ++K A SP
Sbjct: 58 AMLVFAEHRYYGETLPFG--KRSYESPKYLGYLSSEQALADFATLIRHIKLTTPGATGSP 115
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA- 254
V+ GGSYGGML++W+R+KYP++ ALA+SAPIL F+ + P E F IV+ DF R+
Sbjct: 116 VIAIGGSYGGMLSSWIRMKYPNLVTAALAASAPILYFQGLTPCEGFNEIVTKDFHRDGGD 175
Query: 255 SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS---TEDLADWLESAYSYLAMVD 311
SC N+I++SW + +G ++G LT F+ C + + L DWL ++ LAMV+
Sbjct: 176 SCVNSIRKSWSVIEKLGATQSGRKTLTSVFNTCSPIKTKYNVTQLQDWLSETWANLAMVN 235
Query: 312 YPYPSDFMMPLPGYPIREVCKKIDNA--PDATSILERIFEGVSVYYNYTGNVDCFQLDD- 368
YPY + F+ PLP +PI++VC + +A PD ++L+ + V VYYN +G CF L
Sbjct: 236 YPYAATFLEPLPAWPIKKVCSHLTDADLPD-VALLKAVAGAVGVYYNSSGQAKCFNLSQQ 294
Query: 369 --DPHGLDGWNWQV 380
G GW++Q
Sbjct: 295 AVSSLGDKGWDFQA 308
>gi|147792728|emb|CAN64374.1| hypothetical protein VITISV_018664 [Vitis vinifera]
Length = 502
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 153/359 (42%), Positives = 212/359 (59%), Gaps = 24/359 (6%)
Query: 25 ILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF-- 82
I+ P A PSK R + + +E ++T ++ Q LDHF++
Sbjct: 20 IIFPTCATATPSKLPRLSTILRESEIFSELISDD---------FQTFFYNQTLDHFNYRP 70
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
TF QRY++N +W G N PIF Y G E +++ GF D A +F A+LV+
Sbjct: 71 ESYYTFQQRYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGVGFPVDNALQFKALLVYI 130
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGS 202
EHRYYG+S+P+GS E A +NA+T Y + QA+AD+A + +K+ L AE SPV++ GGS
Sbjct: 131 EHRYYGQSIPFGSREEALKNASTRGYFNSAQAIADYAEVLEYIKKKLLAENSPVIVIGGS 190
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGMLA+W RLKYPH+A+GALASSAPIL F+DI P +Y+IV+ DF+ S SC++TI+E
Sbjct: 191 YGGMLASWFRLKYPHVALGALASSAPILYFDDITPQNGYYSIVTKDFREASESCYSTIRE 250
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPL 322
SW E+ V + NGL L+K F C ELN + +L D+LE+ Y+ A ++P
Sbjct: 251 SWSEIDRVASEPNGLSILSKKFRTCAELNKSNELKDYLETMYAVAAQYNHP--------- 301
Query: 323 PGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDP-HGLDGWNWQV 380
P YP+ VC ID AP+ + IL RIF GV Y GN C+ +P +GW WQ
Sbjct: 302 PRYPVTVVCGGIDGAPEGSDILSRIFAGVVA---YRGNSSCYNTSVNPTETSEGWRWQT 357
>gi|297734875|emb|CBI17109.3| unnamed protein product [Vitis vinifera]
Length = 975
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 200/316 (63%), Gaps = 15/316 (4%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++T ++ Q LDHF++ TF QRY++N +W G N PIF Y G E +++
Sbjct: 34 FQTFFYNQTLDHFNYRPESYYTFQQRYVMNFKYWGGANASAPIFAYLGAEAALDFDLTGV 93
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF D A +F A+LV+ EHRYYG+S+P+GS E A +NA+T Y + QA+AD+A + +
Sbjct: 94 GFPVDNALQFKALLVYIEHRYYGQSIPFGSREEALKNASTRGYFNSAQAIADYAEVLEYI 153
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ L AE SPV++ GGSYGGMLA+W RLKYPH+A+GALASSAPIL F+DI P +Y+IV
Sbjct: 154 KKKLLAENSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQNGYYSIV 213
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S SC++TI+ESW E+ V + NGL L+K F C ELN + +L D+LE+ Y+
Sbjct: 214 TKDFREASESCYSTIRESWSEIDRVASEPNGLSILSKKFRTCAELNKSNELKDYLETMYA 273
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
A ++P P YP+ VC ID AP+ + IL RIF GV Y GN C+
Sbjct: 274 VAAQYNHP---------PRYPVTVVCGGIDGAPEGSDILSRIFAGVVA---YRGNSSCYN 321
Query: 366 LDDDP-HGLDGWNWQV 380
+P +GW WQ
Sbjct: 322 TSVNPTETSEGWRWQT 337
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 157/381 (41%), Positives = 218/381 (57%), Gaps = 17/381 (4%)
Query: 1 MPKTRQQNQNSLYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQR 60
+ TR++ SL P + +I LS AAQ + R P G L + P+
Sbjct: 471 LDATRKRTLYSLQWLPFLIPTLI--LSCCVSAAQFNVPRLGPLSRGILRN----PEPAAV 524
Query: 61 QQQQQYRYETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI 118
+ +T ++ Q LDHF++ TF QRY++N HW G PIF Y G E +
Sbjct: 525 SESFYKDLKTFFYAQTLDHFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPL 584
Query: 119 EWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADF 178
+ VN GFV D A RF A+L++ EHRYYG+S+P+GST+VA +NA+TL Y + QA+AD+
Sbjct: 585 DGDLVNIGFVNDNAARFNALLIYIEHRYYGKSIPFGSTKVALKNASTLGYFNSAQAIADY 644
Query: 179 AVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
A + ++K+ L A+ SPV++ GGSYGGMLA+W RLKYPHIA+GALASSAPIL F++I P
Sbjct: 645 AAVLMHVKKRLHAQNSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPE 704
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
+Y+IV+ DF+ S SC+ TI+ SW E+ + K NGL L+K F C L S+ +L D
Sbjct: 705 IGYYSIVTKDFREASESCYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKD 764
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT 358
+L+S Y+ A + P P YP+ VCK I+ A T L RIF G+
Sbjct: 765 YLDSIYAEAAQYNEP---------PTYPVTVVCKGINGASKRTDTLGRIFHGLVAIAGKR 815
Query: 359 GNVDCFQLDDDPHGLDGWNWQ 379
D + + GW WQ
Sbjct: 816 SCYDTKEFNYPTETYLGWRWQ 836
>gi|225436414|ref|XP_002272152.1| PREDICTED: lysosomal Pro-X carboxypeptidase [Vitis vinifera]
Length = 493
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 199/317 (62%), Gaps = 16/317 (5%)
Query: 67 RYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+ T ++ Q LDHF++ TF QRY++N+++W G N PIF+Y G+E I A
Sbjct: 47 EFVTYFYNQTLDHFNYKPESYRTFQQRYIMNSEYWGGANSSSPIFVYTGDEASITAVAAF 106
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GF+ ++A RF +L++ EHRYYG+S+P+GS + A+ N +TL Y T+ QALAD+A ITN
Sbjct: 107 AGFIVELASRFNGLLLYIEHRYYGDSVPFGSKDEAFSNTSTLGYFTSTQALADYAELITN 166
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK+NLSAE PV+ GGSYGGMLA+W RLKYPHI IGALASSAPIL F+DI P ++ I
Sbjct: 167 LKKNLSAENCPVIAIGGSYGGMLASWFRLKYPHIVIGALASSAPILYFDDITPGNAYHVI 226
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
V+ DF+ S SC++TI++SW E+ V + NGL L++ F C LNS+++L +L Y
Sbjct: 227 VTKDFRETSESCYSTIRDSWSEIDKVAAEPNGLANLSQIFMTCEPLNSSQELKYYLALCY 286
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC- 363
A D P P YP+++VC ID AP+ T I+ R+ G+ N + C
Sbjct: 287 VVSAQNDNP---------PAYPVKKVCDAIDGAPEGTDIIGRVAAGL----NASVGPPCH 333
Query: 364 FQLDDDPHGLDGWNWQV 380
F D P W WQ
Sbjct: 334 FVYDFKPSNRSEWTWQT 350
>gi|190702489|gb|ACE75375.1| prolylcarboxypeptidase [Glyptapanteles indiensis]
Length = 497
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 207/322 (64%), Gaps = 12/322 (3%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y+Y T+ F +DHFSF+ TF RY +N D W + PIF Y GNEG +E FA N+
Sbjct: 42 YKYVTKKFIVPVDHFSFSLNNTFEMRYFVN-DTWKN-GKNAPIFFYTGNEGVLETFAANT 99
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+W+IAP FGA++VF EHRYYGESMP+G+ ++ N L YLT++QALAD+ I +L
Sbjct: 100 GFMWEIAPTFGALIVFAEHRYYGESMPFGNK--SFDNVKNLGYLTSQQALADYVDLIVHL 157
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K + S + SPV+ FGGSYGGML+AW R+KYPHI GA+A+SAP+LQF + F IV
Sbjct: 158 KSDPSLKHSPVIAFGGSYGGMLSAWFRMKYPHIINGAIAASAPVLQFTGYTDCQAFSRIV 217
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLES 302
+SDF+ +C I++SW + ++ ++G ++ F +C L + E+ +L S
Sbjct: 218 TSDFRAVHPNCEKVIRKSWESIKNLTSTDDGKKWISSEFKVCGSLTTEENFETFKYFLLS 277
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT--SILERIFEGVSVYYNYTGN 360
YS LAMV+YPY +DF+ PLP YPIRE CK ++N ++ ++ I +G+++Y NYTG
Sbjct: 278 VYSNLAMVNYPYATDFLSPLPAYPIREFCKFVNNTNLSSDKDVMTNIQKGINLYSNYTGK 337
Query: 361 VDCFQLDD---DPHGLDGWNWQ 379
+ C L + D GW++Q
Sbjct: 338 LKCLDLSNPEPDLGAASGWDYQ 359
>gi|297734880|emb|CBI17114.3| unnamed protein product [Vitis vinifera]
Length = 428
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 199/317 (62%), Gaps = 16/317 (5%)
Query: 67 RYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+ T ++ Q LDHF++ TF QRY++N+++W G N PIF+Y G+E I A
Sbjct: 47 EFVTYFYNQTLDHFNYKPESYRTFQQRYIMNSEYWGGANSSSPIFVYTGDEASITAVAAF 106
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GF+ ++A RF +L++ EHRYYG+S+P+GS + A+ N +TL Y T+ QALAD+A ITN
Sbjct: 107 AGFIVELASRFNGLLLYIEHRYYGDSVPFGSKDEAFSNTSTLGYFTSTQALADYAELITN 166
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LK+NLSAE PV+ GGSYGGMLA+W RLKYPHI IGALASSAPIL F+DI P ++ I
Sbjct: 167 LKKNLSAENCPVIAIGGSYGGMLASWFRLKYPHIVIGALASSAPILYFDDITPGNAYHVI 226
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
V+ DF+ S SC++TI++SW E+ V + NGL L++ F C LNS+++L +L Y
Sbjct: 227 VTKDFRETSESCYSTIRDSWSEIDKVAAEPNGLANLSQIFMTCEPLNSSQELKYYLALCY 286
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC- 363
A D P P YP+++VC ID AP+ T I+ R+ G+ N + C
Sbjct: 287 VVSAQNDNP---------PAYPVKKVCDAIDGAPEGTDIIGRVAAGL----NASVGPPCH 333
Query: 364 FQLDDDPHGLDGWNWQV 380
F D P W WQ
Sbjct: 334 FVYDFKPSNRSEWTWQT 350
>gi|297734879|emb|CBI17113.3| unnamed protein product [Vitis vinifera]
Length = 623
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 202/319 (63%), Gaps = 19/319 (5%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ T Y+ Q LDHF++ F QRYLIN+ +W G N PIF+Y G+EG I A +
Sbjct: 50 FVTYYYNQTLDHFNYRPESYTNFQQRYLINSAYWGGANSSSPIFVYTGDEGSITGAAAFA 109
Query: 126 GFVWDIAPRFGAMLVF---PEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
GF+ D+A RF +L++ +HRYYG+S+P+ S ++A+ N +TL Y ++ QALAD+A I
Sbjct: 110 GFMVDLASRFKGLLLYYLILQHRYYGDSVPFRSKDIAFNNTSTLGYFSSTQALADYAELI 169
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
TNLK+NLSAE PV+ GGSYGGMLA+W RLKYPH+ IGALASSAPIL F+DI P +
Sbjct: 170 TNLKKNLSAENCPVIAIGGSYGGMLASWFRLKYPHVVIGALASSAPILYFDDITPHNGYD 229
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
+IV+ DF+ S +C+ TIK+SW E+ V + NGL L++ F C LNS++ L +LE
Sbjct: 230 SIVTKDFRETSETCYTTIKQSWSEIDEVAGQPNGLANLSQIFRTCEPLNSSQQLKLYLE- 288
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
Y+Y A Y P P + + ++C ID AP+ TSIL R+ EGV N +
Sbjct: 289 -YTYEASAQYDNP-------PAHYVSDICNAIDGAPEGTSILGRVAEGV----NASAGPP 336
Query: 363 CFQLDD-DPHGLDGWNWQV 380
C ++ D P + GW WQ
Sbjct: 337 CHRIYDFQPSNMSGWLWQT 355
Score = 43.1 bits (100), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%)
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
+ S SC+ TI+ESW E+ V + NGLL L+K F C+ T++
Sbjct: 530 QASESCYKTIQESWPEINKVASEPNGLLILSKKFGTCKGEQKTQE 574
>gi|356541970|ref|XP_003539445.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 509
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 200/314 (63%), Gaps = 11/314 (3%)
Query: 69 ETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+T Y+ QRLDHF++ TF QRY++N +W G PIF + G EG ++ A G
Sbjct: 56 KTFYYTQRLDHFNYRPDSYHTFQQRYMVNFKYWGGAKSSAPIFAFFGAEGPVDEDAKYIG 115
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+ D AP+F A++VF EHRYYG+S+P+GS+E A +NA+T Y + QA+AD+A + ++K
Sbjct: 116 FLRDNAPQFNALIVFIEHRYYGKSIPFGSSEEAMRNASTRGYFNSAQAIADYAAVLLHIK 175
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ LSA+ SP+++ GGSYGGMLA+W RLKYPHIA+GALASSAPIL F I P +Y IV+
Sbjct: 176 KTLSAQNSPIIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFNGIAPQAGYYYIVT 235
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
DFK S SC+ TI++SW E+ V +K NGL L+K F C +LN + DL D+L+S Y+
Sbjct: 236 KDFKETSESCYQTIRKSWSEIDRVAKKPNGLSILSKRFKTCDKLNKSFDLKDYLDSLYTD 295
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
A Y YPS+ +P++ VC ID A T IL +IFEGV Y + D +
Sbjct: 296 AAQ--YNYPSE-------HPVKIVCGAIDAAAKKTDILGQIFEGVVAYKQHRSCYDMNEY 346
Query: 367 DDDPHGLDGWNWQV 380
+ GW WQ
Sbjct: 347 NHPTESFLGWRWQT 360
>gi|383861707|ref|XP_003706326.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Megachile
rotundata]
Length = 493
Score = 287 bits (735), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 207/335 (61%), Gaps = 16/335 (4%)
Query: 56 QRQQRQQ----QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
+RQ R + +Y+YE + + +DHF FA + F RYL+N D WV N PIF Y
Sbjct: 31 KRQFRSRIELVNAKYKYEYKTIDMPVDHFDFASVDKFKLRYLMN-DTWVKTNN-APIFFY 88
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
GNEGDIE FA NSGF+WDIAP FGA+L+F EHRYYGESMPYG+ +Y + L YL++
Sbjct: 89 TGNEGDIEGFAQNSGFMWDIAPEFGALLIFAEHRYYGESMPYGNK--SYTDIKYLGYLSS 146
Query: 172 EQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
EQALAD+ I L+ + + SPV++FGGSYGGML+AWMR+KYPHI GA+A SAPILQ
Sbjct: 147 EQALADYVDLIQYLRSDSKHKHSPVIVFGGSYGGMLSAWMRMKYPHIVQGAIACSAPILQ 206
Query: 232 FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL- 290
F E F IV+SDFK +C I++SW + ++ + G L++ + LC+ L
Sbjct: 207 F--TTECEVFSRIVTSDFKMAHRNCPKLIRKSWNAINNITSNDEGKKWLSENWKLCQPLK 264
Query: 291 --NSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERI 347
N E +L Y AMV+YPY SDF+ PLP +PI+ C+ + N + +L +
Sbjct: 265 NENDVEVFKSYLSDIYGNFAMVNYPYASDFLAPLPPFPIKVACEYLTNESLVGKELLTAL 324
Query: 348 FEGVSVYYNYTGNVDCFQLDDDPHGLD--GWNWQV 380
+ V+V+ NYTG C L+ L+ GW++Q
Sbjct: 325 HDAVNVFTNYTGETKCLNLNTSTPQLNDRGWHFQA 359
>gi|147820652|emb|CAN65360.1| hypothetical protein VITISV_036071 [Vitis vinifera]
Length = 580
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 155/368 (42%), Positives = 212/368 (57%), Gaps = 23/368 (6%)
Query: 22 IISILSPLSLAAQPSKFRRAPRFV----GKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
+I + S++A PSK + PR L L + ++T ++ Q L
Sbjct: 12 LILLFITASVSATPSK--KIPRLGVLRGSSLSVLEGSSSLRTVSVNLSENFQTFFYPQTL 69
Query: 78 DHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
DHF++ TF RY++N ++W G IF+Y G E D++ + GF+ + RF
Sbjct: 70 DHFNYRPESYTTFQHRYMVNFNYWGGARSAAXIFVYLGEESDLDKDINSIGFLVENGARF 129
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
GA+LV+ EHRYYG+S P+GS + + QNA Y + QALADFA I NLK+NLSA++SP
Sbjct: 130 GALLVYIEHRYYGKSNPFGSMQKSLQNAGQRGYFNSGQALADFAEVIINLKKNLSADSSP 189
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++ GGS GG+LAAW RLKYPH+A+GALASSAPIL F+DI P + +Y++V+ DF+ S S
Sbjct: 190 VIVVGGSDGGLLAAWFRLKYPHVALGALASSAPILYFDDITPQDGYYSLVTKDFRDFSES 249
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYP 315
C+NTIK+SW E+ + NGLL L+K F C+ L S L D+LE+ YS A D P
Sbjct: 250 CYNTIKDSWAEIDKAAAEANGLLNLSKKFRTCKPLESASQLKDYLETMYSIAAQYDRP-- 307
Query: 316 SDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDD---PHG 372
P YP+ VC ID T IL RIF G+ GN C+ +
Sbjct: 308 -------PMYPVTIVCNGIDGGLQGTDILGRIFSGIVA---SRGNKSCYDMGQSSFPSET 357
Query: 373 LDGWNWQV 380
+GWNWQV
Sbjct: 358 EEGWNWQV 365
>gi|224109890|ref|XP_002315345.1| predicted protein [Populus trichocarpa]
gi|222864385|gb|EEF01516.1| predicted protein [Populus trichocarpa]
Length = 513
Score = 285 bits (728), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 151/359 (42%), Positives = 213/359 (59%), Gaps = 26/359 (7%)
Query: 31 LAAQPSKFRRAPRF-----VGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--A 83
A PSK PR G HL + + Q++R T Y+ Q LDHF++
Sbjct: 27 FCASPSK---VPRLGVHGPYGARNHLGKVKVQSLAPSDQEFR--TFYYNQTLDHFNYRPE 81
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
TF RY+++ HW GP+ + PIF+Y G E + G + D A RFGA+ V+ E
Sbjct: 82 SYKTFQHRYVVSFKHWRGPDTMAPIFVYLGEESSLNDDLGYIGILSDNAARFGALQVYIE 141
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSY 203
HR+YGES+P+ S E A ++A Y ++ Q LAD+A I ++K+ SA++SPV++FGGSY
Sbjct: 142 HRFYGESIPFVSREEALKDANLRGYFSSAQTLADYAEVILHIKKKHSADSSPVIVFGGSY 201
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKES 263
GGMLAAW RLKYPH+A+GALASSAP+L F++I P +Y +V+ DFK S SC+ TIK+S
Sbjct: 202 GGMLAAWFRLKYPHVALGALASSAPVLYFDNITPSNGYYTVVTKDFKESSESCYKTIKQS 261
Query: 264 WGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLP 323
W E+ V K +GL L K F+ C+ L + +L ++L+S +S A D P P
Sbjct: 262 WFEIDKVAAKADGLSILQKKFNTCKPLEAATELKNFLDSLFSVAAQYDRP---------P 312
Query: 324 GYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLDGWNWQV 380
YP+ VCK ID+AP+ + +L+RIF G+ Y+ G C+ LD L+GW WQ
Sbjct: 313 RYPVDLVCKGIDSAPEGSDVLDRIFSGIVAYF---GKKPCYNLDAFFSSETLEGWTWQT 368
>gi|307189813|gb|EFN74085.1| Lysosomal Pro-X carboxypeptidase [Camponotus floridanus]
Length = 453
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 145/313 (46%), Positives = 201/313 (64%), Gaps = 9/313 (2%)
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAP 133
E ++DHFSFA TF+ RYLIN D W + PIF Y GNEG IE FA N+GF+W+IAP
Sbjct: 15 EYQVDHFSFAVQDTFNLRYLIN-DTWCKTVKNAPIFFYTGNEGRIELFAENTGFLWEIAP 73
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA 193
+FGA+++F EHRYYGES+PYG+ A N L YLT++QALAD+ I L+ E
Sbjct: 74 KFGALVIFAEHRYYGESLPYGNQSFA--NPRYLGYLTSQQALADYVELIGYLRSKEGFEF 131
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRES 253
SPV++FGGSYGGML+AWMR+KYPHI GA+A+SAPILQF DIV + F I +SD+ +
Sbjct: 132 SPVIVFGGSYGGMLSAWMRIKYPHIVQGAIAASAPILQFTDIVECDAFARIATSDYYVSN 191
Query: 254 ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMV 310
+C I+++W + V + G L+ + LC L + ED L D+L+ Y +AMV
Sbjct: 192 PTCPQLIRKAWKTITEVTSNDEGKKWLSSNWKLCEPLKTEEDVQVLKDFLQEIYINMAMV 251
Query: 311 DYPYPSDFMMPLPGYPIREVCKKIDNAP-DATSILERIFEGVSVYYNYTGNVDC-FQLDD 368
+YPY ++F+ PLP PI C+ + N+ +L ++ ++VY NYTG +C F +
Sbjct: 252 NYPYETNFLAPLPANPINVFCQHLTNSSLTGKPLLSTLYRAINVYTNYTGKANCTFTKNT 311
Query: 369 DPH-GLDGWNWQV 380
P G +GW++Q
Sbjct: 312 TPTLGDEGWDYQA 324
>gi|195385893|ref|XP_002051639.1| GJ16686 [Drosophila virilis]
gi|194148096|gb|EDW63794.1| GJ16686 [Drosophila virilis]
Length = 469
Score = 284 bits (726), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 148/327 (45%), Positives = 197/327 (60%), Gaps = 12/327 (3%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
+Q+++Y + F LDHFSF +FS RYL N + N PIF Y GNEGDIEWFA
Sbjct: 21 EQKFQYAIKEFSVPLDHFSFLSNESFSIRYLYNDSYVDKANSQSPIFFYTGNEGDIEWFA 80
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
NSGF+W++A + A++VF EHRYYG+SMP+G L+Y T EQ L D+A+ I
Sbjct: 81 QNSGFIWELAAKLRALVVFAEHRYYGKSMPFGGDTFNTSKPEHLAYFTVEQTLEDYALLI 140
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T L+ + + PVV FGGSYGGMLAAW R+KYPHI IGALA+SAPILQF + P + F
Sbjct: 141 TFLR---NGQQLPVVAFGGSYGGMLAAWFRMKYPHIVIGALAASAPILQFPGLTPCDIFE 197
Query: 243 NIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL---AD 298
I +S F+ +A+C I +SW + S+ + G +L FHLC + + +DL D
Sbjct: 198 KITTSVFETAYNANCSANIGKSWKAIESLAATDAGKKQLGDMFHLCDPVKNGDDLNKFMD 257
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNY 357
+LE Y LAMV+YPY S F+ PLP YP+R+ C + D P+ T +L + + VY NY
Sbjct: 258 YLEEVYGNLAMVNYPYNSTFLAPLPAYPVRQFCYYLKDLQPNDTDLLASLASALGVYTNY 317
Query: 358 TGNVDCFQLDDDPHGLD----GWNWQV 380
TG C D G + GW+ Q
Sbjct: 318 TGASKCVDYKDSNSGANYDSRGWDIQT 344
>gi|359479403|ref|XP_003632268.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Vitis vinifera]
Length = 503
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 152/366 (41%), Positives = 212/366 (57%), Gaps = 20/366 (5%)
Query: 18 ITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
I +++I + A QP K +G+ L EP + Q ++T Y+ Q L
Sbjct: 16 ILVLVIFFCATCVSATQP-KILPKLSVLGRT-FLREPATFSESNSQD---FQTFYYNQTL 70
Query: 78 DHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
DHF++ TF RY++N +W G N PIF Y G E D++ GF+ D A RF
Sbjct: 71 DHFNYRPESYTTFQHRYVMNFKYWGGANASAPIFAYLGAEEDLDXILSGLGFLTDNAHRF 130
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
A+LV+ EHRY G+S+P+GS E A +NA+ Y + QA+AD+A + +K+ L AE SP
Sbjct: 131 KALLVYIEHRYCGKSIPFGSREEALKNASIRGYFNSAQAIADYAEVLIYIKKKLLAENSP 190
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++ GGSYGGMLA+W RLKYPH+A+G LASSAPIL FEDI P +Y+IV+ DF+ S S
Sbjct: 191 VIVVGGSYGGMLASWFRLKYPHVALGVLASSAPILYFEDITPQNGYYSIVTKDFREASES 250
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYP 315
C+ TI+ESW E+ V + NG+ L+K F C LN++++L D+L++ Y A + P
Sbjct: 251 CYKTIRESWSEIDRVASEPNGISILSKKFRTCDRLNNSDELKDYLDTIYCTAAQYNDP-- 308
Query: 316 SDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDP-HGLD 374
P YP+ C ID AP+ + IL RIF GV Y GN C+ +P +
Sbjct: 309 -------PMYPVTMACSGIDGAPEGSDILSRIFAGVVA---YGGNSSCYTTSHNPTETSE 358
Query: 375 GWNWQV 380
GW WQ
Sbjct: 359 GWAWQT 364
>gi|432102770|gb|ELK30246.1| Lysosomal Pro-X carboxypeptidase [Myotis davidii]
Length = 353
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 136/265 (51%), Positives = 179/265 (67%), Gaps = 8/265 (3%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+Y YFEQ++DHF F + TF+QRYLI HW G I Y GNEGDI WF N+G
Sbjct: 46 KYSIHYFEQKIDHFGFKNDKTFNQRYLIADQHWRKEG--GSILFYTGNEGDIIWFCNNTG 103
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+WD+A AMLVF EHRYYG+S+P+G+ ++Q++ L++LT+EQALADFA I +LK
Sbjct: 104 FMWDVAEELKAMLVFAEHRYYGQSLPFGAD--SFQDSRHLNFLTSEQALADFAELIKHLK 161
Query: 187 QNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + AE P + GGSYGGMLAAW R+KYPHI +GALA+SAPI QFEDIVP F IV
Sbjct: 162 RTIPGAENQPFIALGGSYGGMLAAWFRMKYPHIVVGALAASAPIWQFEDIVPCGVFMKIV 221
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
++DFKR +C +I+ SW + + + GL L++ HLC L + +D L DW+
Sbjct: 222 TTDFKRSGPNCSESIQSSWDAINRLTRNGTGLDWLSEALHLCTPLKNAQDVQHLKDWISE 281
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPI 327
+ LAMVDYPY S+F+ PLP +PI
Sbjct: 282 TWVNLAMVDYPYESNFLQPLPAWPI 306
>gi|147792727|emb|CAN64373.1| hypothetical protein VITISV_018663 [Vitis vinifera]
Length = 702
Score = 280 bits (717), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 193/313 (61%), Gaps = 11/313 (3%)
Query: 69 ETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+T ++ Q LDHF++ TF QRY++N HW G PIF Y G E ++ VN G
Sbjct: 63 KTFFYAQTLDHFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPLDGDLVNIG 122
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
FV D A RF A+L++ EHRYYG+S+P+GST+VA +NA+TL Y + QA+AD+A + ++K
Sbjct: 123 FVNDNAARFNALLIYIEHRYYGKSIPFGSTKVALKNASTLGYFNSAQAIADYAAVLMHVK 182
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ L A+ SPV++ GGSYGGMLA+W RLKYPHIA+GALASSAPIL F++I P +Y+IV+
Sbjct: 183 KRLHAQNSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPEIGYYSIVT 242
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
DF+ S SC+ TI+ SW E+ + K NGL L+K F C L S+ +L D+L+S Y+
Sbjct: 243 KDFREASESCYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKDYLDSIYAE 302
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
A + P P YP+ VCK I+ A T L RIF G+ D +
Sbjct: 303 AAQYNEP---------PTYPVTVVCKGINGASKRTDTLGRIFHGLVAIAGKRSCYDTKEF 353
Query: 367 DDDPHGLDGWNWQ 379
+ GW WQ
Sbjct: 354 NYPTETYLGWRWQ 366
>gi|443709977|gb|ELU04397.1| hypothetical protein CAPTEDRAFT_174591 [Capitella teleta]
Length = 487
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 139/329 (42%), Positives = 198/329 (60%), Gaps = 12/329 (3%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLG-PIFLYCGNE 115
R+ + Q Y T Y Q++DHF FA+ T+ QRYL+N HW R G PIF Y GNE
Sbjct: 26 RKGPETNQDISYTTHYITQKVDHFGFANDNTYKQRYLLNDQHW----RPGSPIFFYTGNE 81
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
G I+WF N+G +W+ AP F AML+F EHRYYGES+PYG+ ++ + L+YLT+EQAL
Sbjct: 82 GAIDWFCNNTGIMWEWAPSFNAMLIFAEHRYYGESLPYGNK--SFDSPNHLNYLTSEQAL 139
Query: 176 ADFAVFITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
ADF I ++KQ + A + SPVV FGGSYGGMLAAW+R+KYP +GA A+SAPI +F D
Sbjct: 140 ADFVSLIADVKQRMPATSKSPVVAFGGSYGGMLAAWLRMKYPSAVVGAFAASAPIWEFGD 199
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+VP F + + + + +C I+ SW + + G L LC + S+
Sbjct: 200 LVPLGGFAVVTTKSYASANPNCPIIIRRSWSVMDQLASSVEGREFLASALGLCNPVKSSA 259
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSV 353
++ WL S + LAM +YPY ++F+ PLP +P++ +C + D D ++ + V V
Sbjct: 260 EVKSWLSSTWINLAMANYPYKANFLEPLPAWPVKAICSHLSDINLDHKELVHAVRHAVDV 319
Query: 354 YYNYTGNVDCFQLDDDPHGL---DGWNWQ 379
YYNYTG+ C++ + G GW+ Q
Sbjct: 320 YYNYTGSASCYKTSESATGNLGDQGWDIQ 348
>gi|195437264|ref|XP_002066561.1| GK24560 [Drosophila willistoni]
gi|194162646|gb|EDW77547.1| GK24560 [Drosophila willistoni]
Length = 480
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 145/326 (44%), Positives = 197/326 (60%), Gaps = 11/326 (3%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
Q ++YE + F+ LDHFSF +F+ RYL N N+ PIF Y GNEGDIEW
Sbjct: 27 SDNQPFKYEIKEFQVPLDHFSFLKNASFNIRYLYNNSFADKGNKRSPIFFYTGNEGDIEW 86
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA N+GF+W++A + GA++VF EHRYYG+S+P+G L+Y T EQ L DFA+
Sbjct: 87 FAQNTGFLWELAEKQGAVVVFAEHRYYGKSLPFGPNTFNKTMPENLAYFTVEQTLEDFAL 146
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
IT LK PVV FGGSYGGMLAAW R+KYPHI IG+LA+SAPILQF I P +
Sbjct: 147 LITYLKNGADL---PVVAFGGSYGGMLAAWFRMKYPHIVIGSLAASAPILQFPGITPCDI 203
Query: 241 FYNIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA-- 297
F I +S F + +C I +SW + +V + G +++ FHLC L +DL
Sbjct: 204 FNKITTSVFHTAYNGNCTVNIGKSWKAIENVASTDAGKKQISDAFHLCNPLKGPDDLTKL 263
Query: 298 -DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVY 354
D++E Y LAM +YPY + F+ PLP YP+R++C + + DA +L+ + ++VY
Sbjct: 264 LDYIELVYGNLAMANYPYNNSFLAPLPAYPVRQMCFYLKELHKTDA-DLLQAMANALAVY 322
Query: 355 YNYTGNVDCFQLDDDPHGLD-GWNWQ 379
NYTG V C + + + D GWN Q
Sbjct: 323 TNYTGTVKCLDISSNSNADDSGWNIQ 348
>gi|449456064|ref|XP_004145770.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
gi|449496213|ref|XP_004160074.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 499
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 195/316 (61%), Gaps = 16/316 (5%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++T Y+ Q LDHF++ F RY+IN +W G N PI Y G EG +E
Sbjct: 55 FKTFYYNQTLDHFNYRPESYTCFPHRYIINFKYWGGANSSAPILAYLGAEGPLEGDLNAI 114
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D A RF A+LV+ EHRYYG+SMP+GS E A +NA+TL Y ++ QA+AD+A + +L
Sbjct: 115 GFMTDNAARFDALLVYIEHRYYGKSMPFGSREEALKNASTLGYFSSAQAIADYAAVLIHL 174
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
KQ A+ SPV++ GGSYGGMLAAW RLKYPH+A+GALASSAPIL FEDI P +Y+I
Sbjct: 175 KQKYHAKDSPVIVLGGSYGGMLAAWFRLKYPHVALGALASSAPILYFEDITPHNGYYSIA 234
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S +C+ TI++SW ++ +G K NGL L+K F C LNS+ L D+L S Y+
Sbjct: 235 TKDFREVSETCYETIRDSWSKIEIIGSKPNGLSILSKEFKTCSPLNSSSQLEDYLWSMYA 294
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
A ++P P YP+ +C ID A + I+ ++ GV + Y GN+ C+
Sbjct: 295 GAAQYNHP---------PRYPVTRICGGIDGASPGSGIISKVAAGV---FAYKGNLSCYN 342
Query: 366 LD--DDPHGLDGWNWQ 379
+ + GW WQ
Sbjct: 343 IGPRSETETDVGWRWQ 358
>gi|225436403|ref|XP_002271733.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Vitis vinifera]
Length = 503
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 193/313 (61%), Gaps = 11/313 (3%)
Query: 69 ETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+T ++ Q LDHF++ TF QRY++N HW G PIF Y G E ++ VN G
Sbjct: 61 KTFFYAQTLDHFNYRPESYKTFRQRYVMNFKHWGGAKAGAPIFAYLGAEAPLDGDLVNIG 120
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
FV D A RF A+L++ EHRYYG+S+P+GST+VA +NA+TL Y + QA+AD+A + ++K
Sbjct: 121 FVNDNAARFNALLIYIEHRYYGKSIPFGSTKVALKNASTLGYFNSAQAIADYAAVLMHVK 180
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ L A+ SPV++ GGSYGGMLA+W RLKYPHIA+GALASSAPIL F++I P +Y+IV+
Sbjct: 181 KRLHAQNSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDEIAPEIGYYSIVT 240
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
DF+ S SC+ TI+ SW E+ + K NGL L+K F C L S+ +L D+L+S Y+
Sbjct: 241 KDFREASESCYRTIRRSWSEIDRIASKPNGLSILSKRFKTCAHLESSFELKDYLDSIYAE 300
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
A + P P YP+ VCK I+ A T L RIF G+ D +
Sbjct: 301 AAQYNEP---------PTYPVTVVCKGINGASKRTDTLGRIFHGLVAIAGKRSCYDTKEF 351
Query: 367 DDDPHGLDGWNWQ 379
+ GW WQ
Sbjct: 352 NYPTETYLGWRWQ 364
>gi|357606522|gb|EHJ65099.1| putative Lysosomal Pro-X carboxypeptidase [Danaus plexippus]
Length = 467
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 202/320 (63%), Gaps = 11/320 (3%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y++ET++F LDHF F TF+ +YLIN ++W GPIF Y GNEG IE FA ++
Sbjct: 18 YKFETKWFNVPLDHFGFQRNETFNIKYLINEEYWDKGG--GPIFFYTGNEGQIEVFAKHT 75
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+WDIA F A LVF EHRYYG+SMP+G+ + ++ + YLT+EQALAD+A I L
Sbjct: 76 GFMWDIAEEFKAKLVFAEHRYYGQSMPFGNKSLDNEH---IGYLTSEQALADYADLINYL 132
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ N PV+ FGGSYGGML+A++R+KYPH+ GA+A+SAPI + +VP E F+ IV
Sbjct: 133 QGNKQRPTYPVIAFGGSYGGMLSAYIRIKYPHLVTGAIAASAPIHMYPGMVPCEVFHRIV 192
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESA 303
+S FK C I+ SWG L + +N L K ++LC + + L ++L+S
Sbjct: 193 TSSFKIADEKCVKNIRSSWGVLRKFLESQNNTDWLHKNWNLCEPVKPADVNTLMEFLQSM 252
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
Y LAMV+YP+PSDF++PLP P+R VC+ ++ ++E I + + VY NY G C
Sbjct: 253 YETLAMVNYPFPSDFLLPLPAQPVRVVCQYLNETLSGQKLIEAIGKVIKVYSNYDGKAPC 312
Query: 364 --FQLDDDPHGLD--GWNWQ 379
++ DD LD GW++Q
Sbjct: 313 VDYKKGDDFGNLDASGWDYQ 332
>gi|328702869|ref|XP_001949478.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Acyrthosiphon
pisum]
Length = 507
Score = 278 bits (712), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 204/344 (59%), Gaps = 34/344 (9%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y Y+T YF +DHFSF + TF +YLIN W N GPIF Y GNEG IE F N+
Sbjct: 21 YVYKTEYFTVPVDHFSFTNNDTFRMKYLINDTFWEREN--GPIFFYAGNEGAIEMFCENT 78
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+W+IA F A++VF EHRYYG SMPYG+ ++ + + YLT++QALAD+ IT L
Sbjct: 79 GFMWEIAEEFRALVVFAEHRYYGVSMPYGNR--SFDDIGRVGYLTSQQALADYVDLITYL 136
Query: 186 KQNLS-----------------------AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
+ N S + ++PV+ FGGSYGGMLAAW R+KYP I GA
Sbjct: 137 RHNGSYSNRELYQTGDIYDTAVGDTPAPSASNPVIAFGGSYGGMLAAWFRIKYPAIIEGA 196
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
+ASSAPI QF + P FY + SS + SA C TI SW + +V + +G L++
Sbjct: 197 IASSAPIWQFTGMTPCNAFYRVTSSVYTDTSAECGLTILASWKAIDNVTKTVDGKAWLSQ 256
Query: 283 TFHLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-AP 338
++LC L + D L +W+ Y LAM++YPYP++F+ PLPG+P++EVCK + N
Sbjct: 257 KWNLCSPLTNDNDVSILKEWVSELYVNLAMINYPYPANFLTPLPGHPVKEVCKPMKNHKE 316
Query: 339 DATSILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGWNWQ 379
D ++L +F G+SVY+NYTG C + G GW++Q
Sbjct: 317 DDFTLLGSVFRGLSVYFNYTGTSKCLDILTSSAPTLGEKGWSYQ 360
>gi|427785257|gb|JAA58080.1| Putative lysosomal pro-x carboxypeptidase-like protein
[Rhipicephalus pulchellus]
Length = 467
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 204/320 (63%), Gaps = 10/320 (3%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y YE R+FE ++DHF +A+ T+ RYL +W ++ GPIF Y GNEG I FA NS
Sbjct: 23 YAYEVRFFETKVDHFGYANNDTYKMRYLFADQYW--DHQGGPIFFYTGNEGSITTFANNS 80
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G +WD AP F A+L+F EHRYYG+SMPYG+ ++++ L YLT EQALAD+A + +
Sbjct: 81 GLMWDWAPEFRALLIFAEHRYYGKSMPYGND--SFKSPAHLGYLTVEQALADYADLLQYI 138
Query: 186 KQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
++ L A S VV FGGSYGGMLAAW R+KYPH+ ALA+SAPILQF+DI P +
Sbjct: 139 RKTLPGARDSQVVSFGGSYGGMLAAWFRMKYPHVTAAALAASAPILQFQDITPCGAQSAV 198
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NSTEDLADWLES 302
++ F+++S C I++SWG + ++ G + + FHLC + +L DWL
Sbjct: 199 ITGAFRKDSELCVEAIRKSWGIIKNMTSTVEGAQAVRERFHLCGNYTPKNYTNLRDWLTD 258
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA-TSILERIFEGVSVYYNYTGNV 361
Y+ LAMV+YPY + F+ P+PG+P+RE CK ++ D ++L+ +++ +SV+ NYTG
Sbjct: 259 LYANLAMVNYPYDNSFLAPVPGHPVREACKFLNRTFDNDDALLDGLYQAISVFQNYTGQT 318
Query: 362 DCFQLDDDPHGL--DGWNWQ 379
C L L DGWN+Q
Sbjct: 319 QCNDLSKSSGTLDADGWNYQ 338
>gi|193718325|ref|XP_001949511.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Acyrthosiphon
pisum]
Length = 469
Score = 278 bits (711), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 145/324 (44%), Positives = 200/324 (61%), Gaps = 13/324 (4%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y Y T Y +DHFS+ + TF +YLIN +W N+ GPIF Y GNEG IE F N+
Sbjct: 20 YNYTTNYITVPVDHFSYTNDDTFELKYLINDKYW-DVNK-GPIFFYTGNEGRIEDFCDNT 77
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+W+I+ F A++VF EHRYYGESMPYG ++ + L YLT++QA+ADF I L
Sbjct: 78 GFMWEISREFKALVVFAEHRYYGESMPYGVN--SFDDKEKLGYLTSQQAIADFVDLIKYL 135
Query: 186 KQN---LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
+ + + +PV FGGSYGGMLAAW R+KYP GA+ASSAPI QF + P FY
Sbjct: 136 RDDALSVGRRPNPVFAFGGSYGGMLAAWFRIKYPAYVEGAIASSAPIWQFTGMTPCNDFY 195
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---W 299
+ SS ++ SA C TI SW + +V + ++G LT + LC+ L ++D+A W
Sbjct: 196 KVTSSVYRNASAECGLTISASWKAINNVTESDSGKTWLTDNWKLCKPLKDSDDVARLKYW 255
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSVYYNYT 358
Y LAMV+YPY ++F+ PLP PI+EVCK + N D ++L +F G+SVY+NYT
Sbjct: 256 ATDLYVALAMVNYPYEANFLGPLPANPIKEVCKSMTNHTEDDKTLLMSVFHGLSVYFNYT 315
Query: 359 GNVDCFQLDD--DPHGLDGWNWQV 380
G+ C L + ++GW++Q
Sbjct: 316 GSASCLNLSSAFSDNTMNGWDYQA 339
>gi|195484754|ref|XP_002090813.1| GE12590 [Drosophila yakuba]
gi|194176914|gb|EDW90525.1| GE12590 [Drosophila yakuba]
Length = 470
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 197/324 (60%), Gaps = 11/324 (3%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q+Y+YE + F+ LDHFSF TF+ RYL N N PIF Y GNEGDIE FA
Sbjct: 25 NQKYKYEIKEFQVPLDHFSFLINATFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFA 84
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N+GF+W+ A R A+++F EHRYYG+S+P+GS+ L+Y T EQ L D+A+ I
Sbjct: 85 QNTGFLWEQAERQRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLI 144
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T L + PVV FGGSYGGMLAAW R+KYPH+ GALA+SAPILQF I + FY
Sbjct: 145 TFLTND---RQMPVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPILQFSGITDCDIFY 201
Query: 243 NIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---D 298
IV+S F+ + +C I +SW ++G E G +++ FHLC L + EDL D
Sbjct: 202 RIVTSVFQNAYNTNCTANIAKSWKLFETLGASEAGKKQISDAFHLCNALKTDEDLKKFLD 261
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYN 356
++E YS LAMV+YPY S F+ PLP YP+R+VC + DA +L + ++VY N
Sbjct: 262 YVEEVYSNLAMVNYPYNSSFLAPLPAYPVRQVCYYLRELQTTDA-DLLHAMSSALAVYTN 320
Query: 357 YTGNVDCFQLDDDPHGLD-GWNWQ 379
YT + C + + + D GWN Q
Sbjct: 321 YTQSAKCLDISVNSNADDSGWNIQ 344
>gi|260808833|ref|XP_002599211.1| hypothetical protein BRAFLDRAFT_200045 [Branchiostoma floridae]
gi|229284488|gb|EEN55223.1| hypothetical protein BRAFLDRAFT_200045 [Branchiostoma floridae]
Length = 436
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 196/312 (62%), Gaps = 11/312 (3%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
++ ++DHFSFA+ TF RYL+N ++ GPIF Y GNEGDIE F N+G + ++
Sbjct: 3 FYPTQVDHFSFANPDTFLLRYLVNDTYF---KDGGPIFFYTGNEGDIEGFVKNTGLLMEM 59
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
APRFGAM++F EHRYYG+SMPYG E ++++ L YLT+ QALADFAV IT L++ S
Sbjct: 60 APRFGAMVIFAEHRYYGQSMPYG--EESFKDPAHLGYLTSTQALADFAVLITRLRKTASG 117
Query: 192 EA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
A SPV FGGSYGGMLAAW+R+KYPH+ G+LAS+A I Q+ I E + ++ + F+
Sbjct: 118 AANSPVFAFGGSYGGMLAAWIRMKYPHLVAGSLASAASIFQYPGITDCEAYSHVATRTFQ 177
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESAYSYLA 308
R +A C I+ SW + +G+ GL LT F LC L + + L DWL + + A
Sbjct: 178 RSAAGCPGRIRSSWDIIDDLGKTGAGLQNLTSMFRLCERLTADDMPALVDWLVNLWMIYA 237
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDD 368
++DYPYP++F+ PLP +P++E C I + D +L + +Y+NYTG C +
Sbjct: 238 LIDYPYPANFLTPLPAWPVKEACHLIVSNED---VLSGVAAAAKLYFNYTGQTACLDISK 294
Query: 369 DPHGLDGWNWQV 380
+G W +Q
Sbjct: 295 PYYGGLAWQYQA 306
>gi|224104755|ref|XP_002313554.1| predicted protein [Populus trichocarpa]
gi|222849962|gb|EEE87509.1| predicted protein [Populus trichocarpa]
Length = 449
Score = 276 bits (707), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 194/318 (61%), Gaps = 17/318 (5%)
Query: 67 RYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+E + Q LDHF+F TF QRY++N +W G N PIF+Y G E D+
Sbjct: 10 EFEVHNYTQSLDHFNFKPESYATFQQRYILNYKYWGGANTSSPIFVYLGAEIDVTQNLDL 69
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
S + D+A RF +L++ EHRYYG SMP+GS + A+QN++T YL++EQALAD+A I +
Sbjct: 70 S--IVDLAARFKGLLLYVEHRYYGVSMPFGSEDEAFQNSSTFGYLSSEQALADYAQVIVD 127
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
+K++LSAE P + GGSYGGMLA+W RLKYPHI IG+LASSAPIL F+DI P ++ I
Sbjct: 128 VKKDLSAENCPAIAIGGSYGGMLASWFRLKYPHIVIGSLASSAPILYFDDITPQNGYHVI 187
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
VS DF+ S SC+NTI++SW E+ V + NGLL L+ F C LNS++DL + E AY
Sbjct: 188 VSKDFRETSESCYNTIQQSWAEIDRVASETNGLLNLSNIFTTCSPLNSSKDLKVYTEIAY 247
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGV--SVYYNYTGNVD 362
+ A +D P P YP+ ++C ID AP T IL R+ GV SV+ N +
Sbjct: 248 MWAAQLDNP---------PSYPVNKICSAIDGAPSGTDILGRVAAGVNASVFGNSCHSAS 298
Query: 363 CFQLDDDPHGLDGWNWQV 380
L W WQ
Sbjct: 299 GSGLSR--KSASAWEWQT 314
>gi|195580410|ref|XP_002080033.1| GD21700 [Drosophila simulans]
gi|194192042|gb|EDX05618.1| GD21700 [Drosophila simulans]
Length = 475
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/323 (45%), Positives = 200/323 (61%), Gaps = 11/323 (3%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q+++YE + F+ LDHFSF TFS RYL N N PIF Y GNEGDIE FA
Sbjct: 31 QRFKYEIKEFQVPLDHFSFLINATFSIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQ 90
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+GF+W+ A R A+++F EHRYYG+S+P+GS+ L+Y T EQ L D+A+ IT
Sbjct: 91 NTGFLWEQAERQRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLIT 150
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+ + PVV FGGSYGGMLAAW R+KYPH+ GALA+SAP+LQF I + FY
Sbjct: 151 FLRND---RQMPVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPVLQFPGITDCDIFYR 207
Query: 244 IVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DW 299
IV+S F+ + +C I +SW ++G E G +++ FHLC L + +DL D+
Sbjct: 208 IVTSVFQNAYNENCTLNIAKSWKLFETLGASEAGKKQISDAFHLCNALKTDDDLKKFLDY 267
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYNY 357
+E YS LAMV+YPY S F+ PLP YP+R+VC + ++ DA +L + ++VY NY
Sbjct: 268 VEEVYSNLAMVNYPYNSSFLAPLPAYPVRQVCYYLKELHSTDA-DLLHAMSSALAVYTNY 326
Query: 358 TGNVDCFQLDDDPHGLD-GWNWQ 379
T + C + + + D GWN Q
Sbjct: 327 TQSAKCLDISVNSNADDSGWNIQ 349
>gi|241161686|ref|XP_002408972.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494441|gb|EEC04082.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 360
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 141/300 (47%), Positives = 189/300 (63%), Gaps = 10/300 (3%)
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS--GFVWDIAP 133
++DHF +A+ TF RYL+ +W + GPIF Y GNE DIE FA S G +W+ AP
Sbjct: 19 QVDHFGYANNDTFKMRYLVADQYW--DHDGGPIFFYTGNEADIEVFANKSYSGLMWEWAP 76
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAE 192
F A+L+F EHRYYG+SMPYG+ +++ + YLTAEQALAD+A +T+ K ++ A
Sbjct: 77 EFKALLIFAEHRYYGKSMPYGNE--SFKGPSRHGYLTAEQALADYADLLTHFKADVPGAG 134
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE 252
S VV FGGSYGGMLAAW RLKYPH+ ALASSAPILQF + P F +V+ F +E
Sbjct: 135 DSKVVSFGGSYGGMLAAWFRLKYPHVTTAALASSAPILQFTGMTPCNAFSEVVTKAFAKE 194
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED--LADWLESAYSYLAMV 310
S C N I+ S+ + E G L K F LC+ L + D L DW+ + ++YLAMV
Sbjct: 195 SNQCTNAIRTSFELIRKQAATEEGAKALKKQFRLCKPLAPSNDTVLRDWIRNVFAYLAMV 254
Query: 311 DYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDD 369
+YPY S +P PG+P++E CK + N D S+L+ I+ +SV NYTG + C L D+
Sbjct: 255 NYPYASKLTLPAPGHPVKEACKFLKKNFTDVQSLLDGIYRAISVLTNYTGKIHCNDLSDN 314
>gi|224109994|ref|XP_002333168.1| predicted protein [Populus trichocarpa]
gi|222835051|gb|EEE73500.1| predicted protein [Populus trichocarpa]
Length = 446
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 199/314 (63%), Gaps = 17/314 (5%)
Query: 72 YFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
Y+ Q LDHF++ TF QRY+I+ +W G N PIF++ G E D++ GF+
Sbjct: 3 YYTQTLDHFNYKPESYTTFRQRYVIDFRYWGGANTSAPIFVFFGAEEDLDDDLDAIGFLS 62
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
D AP F A+L++ EHRYYG S+P+GS + A +NA TL YL + QA+AD+A I +LK+
Sbjct: 63 DNAPHFKALLIYIEHRYYGRSIPFGSRKEALKNAETLGYLNSAQAMADYAAVIMHLKKKY 122
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
SA+ SPV++ GGSYGGML +W RLKYPHIA+GALASSAPIL F+DI P E +Y+IV+ DF
Sbjct: 123 SAKNSPVIVIGGSYGGMLTSWFRLKYPHIALGALASSAPILYFDDISPQEGYYSIVTKDF 182
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAM 309
K S SC+NTI++SWGE+ + K NGL L+K F C LN T +L D+L+S Y A
Sbjct: 183 KETSESCYNTIRKSWGEIEKIASKPNGLSILSKKFKTCFPLNRTFELEDFLDSIYFEAAQ 242
Query: 310 VDYPYPSDFMMPLPGYPIREVCKKIDNAPDA-TSILERIFEGVSVYYNYTGNVDCFQLD- 367
DYP P +P+ VC I+ A A T IL+RI ++V Y GN C ++
Sbjct: 243 YDYP---------PEFPVSIVCGGINKASAARTDILDRI---LAVVVAYMGNRSCHDMNA 290
Query: 368 -DDPHGLDGWNWQV 380
+ P + W WQ
Sbjct: 291 FNYPDAIYEWRWQT 304
>gi|198429605|ref|XP_002127825.1| PREDICTED: similar to prolylcarboxypeptidase [Ciona intestinalis]
Length = 494
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 133/310 (42%), Positives = 194/310 (62%), Gaps = 7/310 (2%)
Query: 62 QQQQYRYETR-YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
+ + Y Y T YF+Q LDHF F TF+QRY ++ +W + GPIF Y GNEGDIE
Sbjct: 25 KNEGYIYPTELYFKQNLDHFDFTINATFTQRYFVSEQYWTKMD--GPIFFYTGNEGDIEL 82
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
F N+G +WDIAP F AM+VF EHRYYG+S P+G+ + + + SYLTAEQALADFA+
Sbjct: 83 FIKNTGLMWDIAPMFKAMVVFAEHRYYGKSKPFGNLKPSTKTIKEFSYLTAEQALADFAI 142
Query: 181 FITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
+ ++K S A+ SPVV+FGGSYGGML+AW RLKYPHI GA+A+SAP+L F V
Sbjct: 143 LVKHIKSTDSKAKNSPVVVFGGSYGGMLSAWFRLKYPHIVTGAIAASAPVLYFPSTVKCS 202
Query: 240 TFYNIVSSDF--KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA 297
+ V+++F + +C I+ W + +K GL L + FHLC +NS+ +
Sbjct: 203 QYNEAVTNNFLSVQNGETCVANIRNVWKTMNETAKKPGGLKLLGEIFHLCSAINSSTAVE 262
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP-DATSILERIFEGVSVYYN 356
+++ + +AMVDYPY ++F+ +P +P+ + C+ + +L+ + + VY N
Sbjct: 263 SFIKDIFGNMAMVDYPYANNFLSNIPAWPVNKTCQHLSEPNLQGLDLLQAMHSAIGVYQN 322
Query: 357 YTGNVDCFQL 366
YTG+V C+ +
Sbjct: 323 YTGSVKCYNV 332
>gi|195118432|ref|XP_002003741.1| GI18077 [Drosophila mojavensis]
gi|193914316|gb|EDW13183.1| GI18077 [Drosophila mojavensis]
Length = 469
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 141/325 (43%), Positives = 197/325 (60%), Gaps = 10/325 (3%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
++++YE + F LDHFSF +F+ RYL N + N PIF Y GNEGDIEWFA
Sbjct: 21 NEKFQYEIKEFSVPLDHFSFLSNASFNIRYLANDSYVDKKNPQPPIFFYTGNEGDIEWFA 80
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
NSGFVW++A + A+++F EHRYYG+S+PYG+ L+Y T EQ L D+A I
Sbjct: 81 QNSGFVWELAAQQRALVIFAEHRYYGKSLPYGADTFNTSKPEHLAYFTVEQTLEDYAQLI 140
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T L+ + + PVV FGGSYGGMLAAW R+KYPHI +GALA+SAPILQF + P + F
Sbjct: 141 TYLR---NGKQLPVVAFGGSYGGMLAAWFRMKYPHIVVGALAASAPILQFSGLTPCDIFN 197
Query: 243 NIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL---AD 298
I ++ F+ +A+C I SW S+ + G +L+ FH+C + + +DL D
Sbjct: 198 KITTAVFENAYNANCTANIGRSWKVFESMAATDAGKKQLSDIFHVCDAIKNADDLNNFMD 257
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNY 357
+LE Y LAM +YPY S F+ PLP P+R++C + D + T ++ + ++VY NY
Sbjct: 258 YLEEVYGNLAMANYPYNSTFLSPLPANPVRQLCFYLKDLHQNDTDLVASMASALAVYTNY 317
Query: 358 TGNVDCFQLDDDPHGL--DGWNWQV 380
TG+V C D G DG W++
Sbjct: 318 TGSVKCLDYKDSNAGASYDGTGWEI 342
>gi|194878910|ref|XP_001974147.1| GG21569 [Drosophila erecta]
gi|190657334|gb|EDV54547.1| GG21569 [Drosophila erecta]
Length = 470
Score = 274 bits (701), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 198/323 (61%), Gaps = 11/323 (3%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q+Y YE + F+ LDHFSF TF+ RYL N N PIF Y GNEGDIE FA
Sbjct: 26 QRYTYEIKEFQVPLDHFSFLINETFNIRYLYNDSFVDKSNDRTPIFFYTGNEGDIELFAQ 85
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+GF+W+ A R A+++F EHRYYG+S+P+GS+ L+Y T EQAL D+A+ IT
Sbjct: 86 NTGFLWEQAERQRALVIFAEHRYYGKSLPFGSSTFNTSMPEHLAYFTVEQALEDYAMLIT 145
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+ + PVV FGGSYGGMLAAW R+KYPH+ GALA+SAPILQF I + FY
Sbjct: 146 FLRND---RQLPVVAFGGSYGGMLAAWFRMKYPHLVAGALAASAPILQFSGITDCDIFYK 202
Query: 244 IVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL-- 300
IV+S F+ + +C I +SW ++G E G +++ FHLC L + EDL +L
Sbjct: 203 IVTSVFENAYNKNCSANIAKSWKLFETLGASEAGKKQISDAFHLCNALKTDEDLKKFLNY 262
Query: 301 -ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYNY 357
E Y+ LAMV+YPY S F+ PLP YP+R+VC + DA S+L + ++VY NY
Sbjct: 263 VEEVYTNLAMVNYPYNSSFLAPLPAYPVRQVCYYLKELQTTDA-SLLHAMSSALAVYTNY 321
Query: 358 TGNVDCFQLDDDPHGLD-GWNWQ 379
T + C + + + + GW+ Q
Sbjct: 322 TQSAKCLDISVNSNADESGWDIQ 344
>gi|198473985|ref|XP_001356509.2| GA15377 [Drosophila pseudoobscura pseudoobscura]
gi|198138198|gb|EAL33573.2| GA15377 [Drosophila pseudoobscura pseudoobscura]
Length = 474
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 200/324 (61%), Gaps = 9/324 (2%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
+ +++YE + F+ LDHFSF TF+ RYL N N PIF Y GNEGDIE FA
Sbjct: 29 EVKFKYEIKEFQVPLDHFSFLSNATFNIRYLYNDSFVDKKNAHTPIFFYTGNEGDIELFA 88
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N+GF+W++A + A+L+F EHRYYG+S+P+G++ L+Y T EQ L D+A+ I
Sbjct: 89 QNTGFMWELAEKQRALLIFAEHRYYGKSLPFGASTFNASMPDHLAYFTVEQTLEDYAMLI 148
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T L+ +L PVV FGGSYGGMLAAW R+KYPH+ GALA+SAPILQF I + FY
Sbjct: 149 TFLRNDLPL---PVVAFGGSYGGMLAAWFRMKYPHLVAGALAASAPILQFPGITDCDIFY 205
Query: 243 NIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---D 298
IV+S F+ +++C I SW ++G E G +++ F+LC + + DL D
Sbjct: 206 RIVTSVFQNAYNSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLD 265
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNY 357
++E Y LAMV+YPY S F+ PLP YP+R+VC + + + + +L + ++VY NY
Sbjct: 266 YIEEVYGNLAMVNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASALAVYTNY 325
Query: 358 TGNVDCFQLDDDPHGLD-GWNWQV 380
TG+V C + + D GWN Q
Sbjct: 326 TGSVKCLDTSVNSNADDSGWNVQT 349
>gi|255565519|ref|XP_002523750.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537054|gb|EEF38690.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 491
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 148/366 (40%), Positives = 211/366 (57%), Gaps = 22/366 (6%)
Query: 18 ITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
+ V++ + + + A P + R RF G + + Q YE Y+ Q L
Sbjct: 8 LCFVLLLLAAVCASAVHPRELTRLTRFGG-----VKRFAASEFSYQLPPEYEIHYYTQTL 62
Query: 78 DHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
DHF++ TF QRY++N +W G N PIF+Y G E D+ + + + +A RF
Sbjct: 63 DHFNYNPQSYATFQQRYILNFKYWGGANTSSPIFVYTGEEVDVTY---DVDTILHLAARF 119
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
A+L++ EHRYYGESMP+GS + A+QN++TL YL++EQALAD+A +T++K+ LSAE P
Sbjct: 120 KALLLYIEHRYYGESMPFGSEDQAFQNSSTLGYLSSEQALADYAQVVTDVKKKLSAENCP 179
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
+ G SYGGMLAAW RLKYPHI IG+LASS+PIL F+DI P ++ +V+ D++ S S
Sbjct: 180 AIAVGASYGGMLAAWFRLKYPHIVIGSLASSSPILYFDDITPQNGYHVVVTKDYRDTSES 239
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYP 315
C+NTIK+SW E+ V + NGLL L+ F+ C LNS+ +L ++LE Y A D P
Sbjct: 240 CYNTIKQSWSEIDRVAAQPNGLLTLSNMFNTCSPLNSSTELREYLEILYVSAAQYDNP-- 297
Query: 316 SDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDD-DPHGLD 374
P P++ C+ ID AP T IL RI G+ C+ + D
Sbjct: 298 -------PYNPVQNTCRGIDGAPPGTDILGRIVAGLKS--RIPSWSSCYDVPTWDLSNKS 348
Query: 375 GWNWQV 380
W+WQ
Sbjct: 349 AWDWQT 354
>gi|414880844|tpg|DAA57975.1| TPA: hypothetical protein ZEAMMB73_592594 [Zea mays]
Length = 761
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 131/196 (66%), Positives = 158/196 (80%)
Query: 138 MLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVV 197
M +HRYY ESMP+GS AY ++ +L+YLTA+QALADFAV +T+LK+NLSAE SPVV
Sbjct: 527 MEKLSKHRYYRESMPFGSKAKAYIDSKSLAYLTAKQALADFAVQLTDLKRNLSAEGSPVV 586
Query: 198 LFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCF 257
LFG SYGGMLAAW+RLKYPHIAIGALASSAPILQFEDIVP FY++VS DF+RES SCF
Sbjct: 587 LFGDSYGGMLAAWIRLKYPHIAIGALASSAPILQFEDIVPSTIFYDLVSDDFRRESLSCF 646
Query: 258 NTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSD 317
IK+SW EL K++GLL+L+KTFHLC+ L ++ DL+DWL SAYSYLAMVDYP S
Sbjct: 647 LKIKDSWKELDDQANKQDGLLKLSKTFHLCQTLKTSGDLSDWLSSAYSYLAMVDYPLSSK 706
Query: 318 FMMPLPGYPIREVCKK 333
F+ PLP PI+++ K
Sbjct: 707 FLRPLPANPIKKLVLK 722
>gi|195351854|ref|XP_002042435.1| GM23324 [Drosophila sechellia]
gi|194124304|gb|EDW46347.1| GM23324 [Drosophila sechellia]
Length = 475
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 200/323 (61%), Gaps = 11/323 (3%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q+++YE + F+ LDHFSF TF+ RYL N N PIF Y GNEGDIE FA
Sbjct: 31 QRFKYEIKEFQVPLDHFSFLINATFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQ 90
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+GF+W+ A R A+++F EHRYYG+S+P+GS+ L+Y T EQ L D+A+ IT
Sbjct: 91 NTGFLWEQAERQRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLIT 150
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+ + PVV FGGSYGGMLAAW R+KYPH+ GALA+SAP+LQF I + FY
Sbjct: 151 FLRND---RQMPVVAFGGSYGGMLAAWFRMKYPHLVTGALAASAPVLQFPGITDCDIFYR 207
Query: 244 IVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DW 299
IV+S F+ + +C I +SW ++G E G +++ FHLC L + +DL D+
Sbjct: 208 IVTSVFQNAYNENCTLNIGKSWKLFETLGASEAGKKQISDAFHLCNVLKTDDDLKKFLDY 267
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYNY 357
+E YS LAMV+YPY S F+ PLP YP+R+VC + ++ DA +L + ++VY NY
Sbjct: 268 VEEVYSNLAMVNYPYNSSFLAPLPAYPVRQVCYYLKELHSTDA-DLLHAMSSALAVYTNY 326
Query: 358 TGNVDCFQLDDDPHGLD-GWNWQ 379
T + C + + + D GWN Q
Sbjct: 327 TQSAKCLDISVNSNADDSGWNIQ 349
>gi|20129649|ref|NP_610037.1| CG2493, isoform A [Drosophila melanogaster]
gi|442628593|ref|NP_001260630.1| CG2493, isoform B [Drosophila melanogaster]
gi|7298683|gb|AAF53897.1| CG2493, isoform A [Drosophila melanogaster]
gi|18447221|gb|AAL68201.1| GH14278p [Drosophila melanogaster]
gi|220945264|gb|ACL85175.1| CG2493-PA [synthetic construct]
gi|220954996|gb|ACL90041.1| CG2493-PA [synthetic construct]
gi|440213994|gb|AGB93165.1| CG2493, isoform B [Drosophila melanogaster]
Length = 475
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 200/323 (61%), Gaps = 11/323 (3%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q+++YE + F+ LDHFSF TF+ RYL N N PIF Y GNEGDIE FA
Sbjct: 31 QRFKYEIKEFQVPLDHFSFLINATFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQ 90
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+GF+W+ A R A+++F EHRYYG+S+P+GS+ L+Y T EQ L D+A+ IT
Sbjct: 91 NTGFLWEQAERQRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQTLEDYAMLIT 150
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+ + PVV FGGSYGGMLAAW R+KYPH+ GALA+SAP+LQF I + FY
Sbjct: 151 FLRND---RQMPVVAFGGSYGGMLAAWFRMKYPHLVNGALAASAPVLQFPGITDCDIFYR 207
Query: 244 IVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DW 299
IV+S F+ + +C I +SW ++G E G +++ FHLC L + +DL D+
Sbjct: 208 IVTSVFQNAYNENCTLNIAKSWKLFETLGASEAGKKQISDAFHLCNALKNDDDLKKFLDY 267
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYNY 357
+E YS LAMV+YPY S F+ PLP YP+R+VC + ++ DA +L + ++VY NY
Sbjct: 268 VEEVYSNLAMVNYPYNSSFLAPLPAYPVRQVCYYLKELHSTDA-DLLHAMSSALAVYTNY 326
Query: 358 TGNVDCFQLDDDPHGLD-GWNWQ 379
T + C + + + D GWN Q
Sbjct: 327 TQSAKCLDISVNSNADDSGWNIQ 349
>gi|332373508|gb|AEE61895.1| unknown [Dendroctonus ponderosae]
Length = 487
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 139/332 (41%), Positives = 205/332 (61%), Gaps = 19/332 (5%)
Query: 62 QQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF 121
Q ++Y + T+Y + LDHFSF TF RYL+N ++ PIF Y GNEGDI F
Sbjct: 16 QGEEYGFVTKYIDMPLDHFSFTTNTTFKLRYLVNDSYFSNDQ---PIFFYTGNEGDISMF 72
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
A N+GF++++A + GA+++F EHR+YGE++P+G+ +Y + TL YL+++QALAD+
Sbjct: 73 AQNTGFLFELAEKMGALIIFAEHRFYGETLPFGNE--SYSSPKTLGYLSSQQALADYVYL 130
Query: 182 ITNLKQNLSAEAS-----PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
I NL++ E S PV+ FGGSYGGMLAAW+R+KYP+ +GA+ASSAPI QF+ I
Sbjct: 131 IDNLQKKYVNELSSLRKLPVISFGGSYGGMLAAWLRIKYPYSVLGAIASSAPIWQFQGIT 190
Query: 237 PPETFYNIVSSDFKR-ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
P + F IV+ SA+C TIK+SW + + G ++KTF LC L S +D
Sbjct: 191 PCQNFNKIVTDVIGALGSATCVETIKKSWSIFRKIASTDIGKQNISKTFDLCTSLKSDDD 250
Query: 296 L---ADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEG 350
L +WL Y+ + V+YPYP+ F++PLPG P+RE C ++D N + +++ + G
Sbjct: 251 LNTFLNWLSEMYTMIVEVNYPYPNSFLVPLPGNPVREFCSRMDSVNYNNDDGLIKALSTG 310
Query: 351 VSVYYNYTGNVDCFQLDDDPH---GLDGWNWQ 379
V ++ NYTG C + G GW++Q
Sbjct: 311 VQLFTNYTGTTKCNNIGQTASPSLGELGWDFQ 342
>gi|255579501|ref|XP_002530593.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223529841|gb|EEF31773.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 327
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 126/162 (77%), Positives = 148/162 (91%)
Query: 169 LTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
+ EQALADFAV IT+LK+NL+AE PVVLFGGSYGGMLAAWMRLKYPHIAIGALA+SAP
Sbjct: 1 MKPEQALADFAVLITDLKRNLTAEDCPVVLFGGSYGGMLAAWMRLKYPHIAIGALAASAP 60
Query: 229 ILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR 288
ILQFE++VP E FY+IVS+DFKRES+ CFNTIKESW + S G KENGL++L++TFH+C
Sbjct: 61 ILQFENVVPHEIFYDIVSNDFKRESSRCFNTIKESWNAIASEGLKENGLVKLSRTFHMCS 120
Query: 289 ELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREV 330
+LNST++LADWLESAYSYLAMVDYPYP++FMMPLPG+PIREV
Sbjct: 121 DLNSTDELADWLESAYSYLAMVDYPYPAEFMMPLPGHPIREV 162
>gi|195147472|ref|XP_002014704.1| GL19317 [Drosophila persimilis]
gi|194106657|gb|EDW28700.1| GL19317 [Drosophila persimilis]
Length = 474
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 200/324 (61%), Gaps = 9/324 (2%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
+ +++YE + F+ LDHFSF TF+ RYL N N PIF Y GNEGDIE FA
Sbjct: 29 EVKFKYEIKEFQVPLDHFSFLSNATFNIRYLYNDSFVDKKNAHTPIFFYTGNEGDIELFA 88
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N+GF+W++A + A+L+F EHRYYG+S+P+G++ L+Y T EQ L D+A+ I
Sbjct: 89 QNTGFMWELAEKQRALLIFAEHRYYGKSLPFGASTFNASMPDHLAYFTVEQTLEDYAMLI 148
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T L+ +L PVV FGGSYGGMLAAW R+KYPH+ GALA+SAPILQF I + FY
Sbjct: 149 TFLRNDLPL---PVVAFGGSYGGMLAAWFRMKYPHLVAGALAASAPILQFPGITDCDIFY 205
Query: 243 NIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---D 298
IV+S F+ +++C I SW ++G E G +++ F+LC + + DL D
Sbjct: 206 RIVTSVFQNAYNSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLD 265
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNY 357
++E Y LAMV+YPY S F+ PLP YP+R+VC + + + + +L + ++VY NY
Sbjct: 266 YIEEVYGNLAMVNYPYNSSFLAPLPPYPVRQVCFYLKDLHQSDADLLHAMASALAVYTNY 325
Query: 358 TGNVDCFQLDDDPHGLD-GWNWQV 380
TG+V C + + D GWN Q
Sbjct: 326 TGSVKCLDTSVNSNADDSGWNVQT 349
>gi|241161688|ref|XP_002408973.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494442|gb|EEC04083.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 409
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 197/319 (61%), Gaps = 10/319 (3%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y Y+T YF ++DHF +A+ TF RYL+ +W + GPIF Y GNEGDIE FA N+
Sbjct: 2 YSYDTYYFTTKVDHFGYANNDTFKMRYLVADQYW--DHDGGPIFFYTGNEGDIEVFANNT 59
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G +WD AP F A+L+F EHRYYG+SMPYG + +++ + YLT EQALAD+A +T+
Sbjct: 60 GLMWDWAPEFKALLIFAEHRYYGKSMPYG--KESFEGPSRHGYLTVEQALADYADLLTHF 117
Query: 186 KQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
K ++ A S VV FGGSYGGMLAAW RLKYP G +SAPILQF I P +
Sbjct: 118 KAHVPAAGDSKVVSFGGSYGGMLAAWFRLKYPPRDHG--VTSAPILQFTGITPCNALNEV 175
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NSTEDLADWLES 302
V+ F +ES C + I+ S+ + + E G L + F LC+ L ++ L DW
Sbjct: 176 VTKAFAKESDQCTSAIRTSFEVMRNQAATEEGAKALEEKFRLCQALAPSNYTVLRDWFVD 235
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID-NAPDATSILERIFEGVSVYYNYTGNV 361
Y+ LAMV+YPY ++F+ P+PG+P++E CK ++ N D S+L+ I++ +SV+ NYTG
Sbjct: 236 VYTNLAMVNYPYANEFLGPVPGHPVKEACKFLEKNFTDDQSLLDGIYQAISVFQNYTGQT 295
Query: 362 DCFQLDDDPHGLDGWNWQV 380
C L + LD W +
Sbjct: 296 HCNDLTNSAGTLDAGGWDI 314
>gi|255565521|ref|XP_002523751.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537055|gb|EEF38691.1| lysosomal pro-X carboxypeptidase, putative [Ricinus communis]
Length = 489
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 154/366 (42%), Positives = 215/366 (58%), Gaps = 24/366 (6%)
Query: 18 ITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
+ +V++ ++ + A P K R RF G+ + + Q YE Y+ Q L
Sbjct: 8 LCMVLVLLVPACASALHPRKLTRTTRFGGE-----KRFAASEFSYQLPSDYEIHYYTQTL 62
Query: 78 DHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
DHF++ TF QRY++N +W G N PIFLY G E ++ + S + ++A RF
Sbjct: 63 DHFNYKPESYATFQQRYILNFKYWGGANTSSPIFLYTGAEENLIYHVDRS--IVELAARF 120
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
+L++ EHRYYGESMP+GS E A QN++TL YL++EQALAD+A IT++K+NLSAE P
Sbjct: 121 RGLLLYIEHRYYGESMPFGSEEQALQNSSTLGYLSSEQALADYAQVITDVKKNLSAENCP 180
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
+ G SYGGMLAAW RLKYPHI IG+LASS+PIL F+DI P+ Y+++S E S
Sbjct: 181 AIAVGASYGGMLAAWFRLKYPHIVIGSLASSSPILYFDDIT-PQNGYHVLSRRILDE--S 237
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYP 315
C NTIK+SW E+ V + NGLL L+ F+ CR L S+ + ++LE Y A D P
Sbjct: 238 CHNTIKQSWSEIDRVAAQPNGLLTLSNMFNTCRPLVSSAEFKEYLELLYITAAQYDNP-- 295
Query: 316 SDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYY-NYTGNVDCFQLDDDPHGLD 374
P P++ C+ ID AP T IL RI EG++ ++ D F L+ +G
Sbjct: 296 -------PDNPVQSTCRGIDGAPPGTDILGRIVEGLNGRIPGWSSCHDIFTLELSNNG-- 346
Query: 375 GWNWQV 380
W+WQ
Sbjct: 347 SWDWQT 352
>gi|339245559|ref|XP_003378705.1| putative serine protease pcp-1 [Trichinella spiralis]
gi|316972372|gb|EFV56050.1| putative serine protease pcp-1 [Trichinella spiralis]
Length = 826
Score = 270 bits (691), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 146/355 (41%), Positives = 207/355 (58%), Gaps = 49/355 (13%)
Query: 68 YETRYFEQR-LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
Y +FE +DHFSF + TF RYLINT++W GPIF YCGNEG +E FA N+G
Sbjct: 25 YNESWFENMPVDHFSFENSDTFRLRYLINTENWNSDG--GPIFFYCGNEGSVEGFAENTG 82
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+W+ A FGAM+VF EHRYYG+S+P+G+ ++++ L L +EQA+AD+AV I LK
Sbjct: 83 FMWENAKDFGAMVVFAEHRYYGKSLPFGN-----ESSSNLGKLNSEQAMADYAVLINWLK 137
Query: 187 QNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
N++ A++S V+ FGGSYGGMLAAWMR KYPH+ GA+A+SAP+ QF + +F +I
Sbjct: 138 TNITGAKSSAVIAFGGSYGGMLAAWMRTKYPHLVDGAIAASAPVAQFSGMTVCSSFSDIT 197
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED-----LADWL 300
+ ++ S SC +IK SW + G+ G L+L K F LC E T L +WL
Sbjct: 198 TEVYRNASPSCALSIKRSWPIIRKWGKTAEGRLDLAKMFRLCNETQFTSKKNVTQLVNWL 257
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIRE------------------------VCKKI-- 334
Y LAMV+YPY ++F+ P+P +P++ +CK+I
Sbjct: 258 TDIYGTLAMVNYPYATEFLKPVPAWPVKASDEDSVCTVVVGLFFFLIIHPRACICKRICV 317
Query: 335 ------DNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDD-PHGLDG--WNWQV 380
D T +L RI+ +S+Y N+TG C L++D +DG W++Q
Sbjct: 318 ACQFLNDTEVGETELLHRIYSTISIYTNFTGKKPCNLLENDYGDSVDGKLWDYQA 372
>gi|347964904|ref|XP_309189.4| AGAP000994-PA [Anopheles gambiae str. PEST]
gi|333466532|gb|EAA04920.4| AGAP000994-PA [Anopheles gambiae str. PEST]
Length = 481
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 138/333 (41%), Positives = 197/333 (59%), Gaps = 14/333 (4%)
Query: 60 RQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRL--GPIFLYCGNEGD 117
R Y Y+T+ + +DHF++ TF RYLIN + G L PI Y GNEGD
Sbjct: 18 RDGTATYVYQTKTIDVPIDHFTYTGEATFKLRYLINDTYAPGGADLPASPILFYAGNEGD 77
Query: 118 IEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALAD 177
IE FA N+GF+W++AP+ A L+F EHR+YG S+P+G+ +Y + L YLT+EQALAD
Sbjct: 78 IELFAQNTGFMWELAPKLKATLLFVEHRFYGHSLPFGN--ASYDSPKNLGYLTSEQALAD 135
Query: 178 FAVFITNLKQ-NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
FA+ + L N + A PV+ FGGSYGGMLAAW+R+KYPH+ GA+A+SAP+ QF +
Sbjct: 136 FALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIAASAPVRQFAGVT 195
Query: 237 PPETFYNIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--- 292
F I++S ++ +A C + I+ SW L + +GL L + F C L
Sbjct: 196 DCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEKFKFCTNLTKGTD 255
Query: 293 -TEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGV 351
TE L D+L Y LAM++YPYPS F+ P+P YP+RE C ++ +L+ + +
Sbjct: 256 VTETLFDYLTDVYGNLAMINYPYPSSFLAPVPAYPVREFCGRLAQNYTGVELLDHLQSAL 315
Query: 352 SVYYNYTGNVDCFQLDD--DPHGLD--GWNWQV 380
S+YYNY G C ++ D G+ GW++Q
Sbjct: 316 SIYYNYDGKAACLNINSSYDGTGISDRGWDFQA 348
>gi|157110322|ref|XP_001651053.1| prolylcarboxypeptidase [Aedes aegypti]
gi|108878777|gb|EAT43002.1| AAEL005521-PA [Aedes aegypti]
Length = 476
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/330 (42%), Positives = 194/330 (58%), Gaps = 18/330 (5%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLP-TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
Q +Y Y+ +Y + LDHF++ + TF RYL+N + P+ GPI Y GNEGDIE
Sbjct: 22 QISAKYEYQIKYLDVPLDHFTYVNESVTFKMRYLVNDTY--NPDGKGPILFYTGNEGDIE 79
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
FA N+GF+WDIAP+ A LVF EHR+YG+++P+G+ +Y++ L YL++EQALADFA
Sbjct: 80 NFAQNTGFMWDIAPKLKASLVFAEHRFYGKTLPFGN--ASYESPKHLGYLSSEQALADFA 137
Query: 180 VFITNLK-QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI--V 236
+ + N S PV+ FGGSYGGMLAAW R+KYPH+ GA+ASSAPI QF+ V
Sbjct: 138 DLLAEINPSNRSTRGRPVIAFGGSYGGMLAAWFRMKYPHLVTGAIASSAPIRQFDTDCGV 197
Query: 237 PPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
+ ++ S + RE C I SW L + +GL L F C + ED+
Sbjct: 198 FSQILTSVFSVAYTRE---CSLNIARSWDTLKNYSSTADGLKTLKDKFKFCGNITKAEDI 254
Query: 297 A----DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVS 352
D+L Y LAM++YPY S F+ P+P YP+RE C ++ ++ R+ E +S
Sbjct: 255 TETFFDYLTDVYGNLAMINYPYNSSFLAPVPAYPVREFCGRLAQNFTGLELIGRLQEALS 314
Query: 353 VYYNYTGNVDCFQLD---DDPHGLDGWNWQ 379
+Y NYTG C + DD G GW++Q
Sbjct: 315 IYSNYTGKTKCLNIANAYDDNMGTSGWDFQ 344
>gi|66804725|ref|XP_636095.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|60464439|gb|EAL62586.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 513
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 149/363 (41%), Positives = 214/363 (58%), Gaps = 31/363 (8%)
Query: 43 RFVGKLPHLTEPPQRQQRQQQQQY-----------RYETRYFEQRLDHFSFADLPTFSQR 91
R+ P+ ++ +++Q+Q+Q+ Y+ +F Q LDHF+F F+QR
Sbjct: 26 RYRSPFPYYSKKHVDRKQQEQEQHFVSNDPPPPPPPYQELFFLQTLDHFNFQSKGEFAQR 85
Query: 92 YLINTDHWVGPN-----RLGPIFLYCGNEGDIEWFAVNSGFVWDI-APRFGAMLVFPEHR 145
YL++ +W P+ GPI Y GNEGDI F NS FV ++ A A+L+F EHR
Sbjct: 86 YLVSDVYWKKPSPNDKVCQGPILFYTGNEGDITLFYDNSQFVTNVLAQEMNALLIFAEHR 145
Query: 146 YYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGG 205
YYGES+P+G+ N + YLT+EQALAD+A I + + AE PV+ GGSYGG
Sbjct: 146 YYGESLPFGNDSWTSDN---IGYLTSEQALADYAQLIPAVLSEMGAEHCPVLSVGGSYGG 202
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDI-VPPETFYNIVSSDFKRESA--SCFNTIKE 262
ML AW R+KYP+I GALA+SAPIL F + V PETF I + DFK S+ +C + I+
Sbjct: 203 MLTAWFRMKYPNIVDGALAASAPILSFLNTGVNPETFNKIATDDFKDTSSEGTCASRIRS 262
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRE-LNSTEDLADWLESAYSYLAMVDYPYPSDFMMP 321
+ ++V++ + NGL +L+KTF +C L DL +W+ESA +Y+AM DYPYP++F+ P
Sbjct: 263 ALNDIVTISTQSNGLAQLSKTFSVCGAPLTDVNDLINWIESALTYMAMADYPYPANFLEP 322
Query: 322 LPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD-CFQLDDDPHGLDG---WN 377
+PGYPI C + D ++ + E + VYYNYTG C+ + G G WN
Sbjct: 323 MPGYPINVSCSALAQQEDD---IQGLLEVLHVYYNYTGQAGTCYNMSVFTTGALGDASWN 379
Query: 378 WQV 380
+Q
Sbjct: 380 YQA 382
>gi|330843655|ref|XP_003293764.1| hypothetical protein DICPUDRAFT_158674 [Dictyostelium purpureum]
gi|325075873|gb|EGC29712.1| hypothetical protein DICPUDRAFT_158674 [Dictyostelium purpureum]
Length = 503
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 139/325 (42%), Positives = 194/325 (59%), Gaps = 18/325 (5%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPN-----RLGPIFLYCGNEGDIEWFA 122
Y+ ++ Q LDHF+F F+QRYLI+ +W P+ GPI Y GNEGDI WF
Sbjct: 54 YKEYWYMQTLDHFNFQTKGQFAQRYLISDTYWNKPSPSSKVCSGPIIFYTGNEGDIVWFY 113
Query: 123 VNSGFVWDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
NS F+ ++ A GA+L F EHRYYGE++P+G+ + +N YLT+EQALAD+A
Sbjct: 114 ENSQFITNVLAKEMGALLFFAEHRYYGETLPFGNESLTPENT---GYLTSEQALADYAEL 170
Query: 182 ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF-EDIVPPET 240
I ++ +L AE PV+ GGSYGGML AW R+KYP+I LA+SAPIL F + E
Sbjct: 171 IPSVLADLGAEHCPVISVGGSYGGMLTAWFRMKYPNIVDAGLAASAPILMFYKTGASQEG 230
Query: 241 FYNIVSSDFKR--ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
F I + DFK+ E +C + I+ ++ ++ + Q+ GL +LT TF LC LN DL +
Sbjct: 231 FNQIATDDFKQTSEEGTCASRIRNAFNSIMEISQQTGGLQQLTNTFSLCDSLNQVGDLVN 290
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT 358
W+ES +Y+AM DYPYP+ F+ P+PGYPI C+ + D L ++ ++VYYNYT
Sbjct: 291 WIESGLTYMAMADYPYPAGFLEPMPGYPINVSCEAMATTTDDIQGLLKV---LNVYYNYT 347
Query: 359 GNVDCFQ---LDDDPHGLDGWNWQV 380
G C+ G D W+ Q
Sbjct: 348 GTTQCYNTSVFTTSALGSDAWDVQA 372
>gi|170055552|ref|XP_001863632.1| lysosomal Pro-X carboxypeptidase [Culex quinquefasciatus]
gi|167875507|gb|EDS38890.1| lysosomal Pro-X carboxypeptidase [Culex quinquefasciatus]
Length = 475
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 194/329 (58%), Gaps = 16/329 (4%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLP-TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
Q +Y Y+T+Y + LDHFS+ + TF RYL N + P+ GPI Y GNEGDIE
Sbjct: 19 QSSAKYEYQTKYLDVPLDHFSYVNESVTFKLRYLTNDTY--NPDGSGPILFYTGNEGDIE 76
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
FA N+GF+W++AP+ A LVF EHR+YG+++P+G+ +Y++ L YLT+EQALADFA
Sbjct: 77 LFAQNTGFMWELAPKLKASLVFAEHRFYGKTLPFGN--ASYESPRHLGYLTSEQALADFA 134
Query: 180 VFITNLK-QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
+ + N + A PVV FGGSYGGMLAAW R+KYPH+ GA+A+SAPI QF+
Sbjct: 135 YLLAQINPSNRTVRARPVVAFGGSYGGMLAAWFRMKYPHMVAGAIAASAPIRQFD--TDC 192
Query: 239 ETFYNIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA 297
F I++S F + C I SW L + +GL L + F C L ED+
Sbjct: 193 GVFNQILTSVFSVAYTKECSLNIGRSWDVLKNYSSSADGLKTLNEKFKFCSNLTKAEDVT 252
Query: 298 ----DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSV 353
D+L Y LAM++YPY S F+ P+P YP+RE C ++ +L ++ E +S+
Sbjct: 253 GTFFDYLIDVYGNLAMINYPYNSSFLAPVPAYPVREFCGRLAQNYTGLELLTKLQEALSI 312
Query: 354 YYNYTGNVDCFQLD---DDPHGLDGWNWQ 379
YYNY G C ++ D G GW +Q
Sbjct: 313 YYNYNGQAKCLNINSSYDSSMGDLGWGFQ 341
>gi|356499942|ref|XP_003518794.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 494
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 193/318 (60%), Gaps = 20/318 (6%)
Query: 69 ETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+T YF+Q LDHF++ TF QRYL+N +W G N PIF Y G E I+ G
Sbjct: 56 KTFYFKQVLDHFNYRPESYTTFQQRYLVNFKYWGGANSSAPIFAYFGAESPIDNSPNGIG 115
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+ D A F A+LV+ EHRYYG+S+P+GS E A +NA+T+ Y + QALAD+A + ++K
Sbjct: 116 FLTDNAASFNALLVYIEHRYYGKSVPFGSREEALKNASTIGYFNSAQALADYAAILEHIK 175
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ L A+ SPV++ GGSYGGMLA+W RLKYPH+ +GALAS+APIL F+ I P +Y++V+
Sbjct: 176 KTLHAQNSPVIVIGGSYGGMLASWFRLKYPHLTVGALASAAPILYFDKITPQNGYYSVVT 235
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
D++ S +C+ TI +SW E+ V + NGL+ L+ F+ C +N + +L D+L S Y Y
Sbjct: 236 RDYRDASETCYETILKSWSEIHRVASQPNGLVTLSHRFNTCHTVNQSYELIDYLRSTYVY 295
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
A + P P YP+ E+C ID A + IL +I+ GV + GN C
Sbjct: 296 AAQYNQP---------PRYPVSEICGGIDGASLGSDILSKIYAGVVALW---GNNTCKV- 342
Query: 367 DDDPHGLD----GWNWQV 380
+ P + GW WQ
Sbjct: 343 -NGPTNVSETSVGWRWQT 359
>gi|255565523|ref|XP_002523752.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
gi|223537056|gb|EEF38692.1| Lysosomal Pro-X carboxypeptidase, putative [Ricinus communis]
Length = 501
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 142/356 (39%), Positives = 207/356 (58%), Gaps = 21/356 (5%)
Query: 32 AAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTFS 89
A+ +K PR P + + P+ + ET ++ Q LDHF++ TF
Sbjct: 20 ASSTAKPFDIPRLSPTGPRIVQDPEEIFISELVSDDLETFFYNQTLDHFNYNPESYETFQ 79
Query: 90 QRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGE 149
QRY+I++ +W G N PIF+Y G E ++ GF+ D A +F A+L++ EHRYYG+
Sbjct: 80 QRYIISSKYWGGANSSSPIFVYFGAEAPLDGDLTVIGFLADNAAQFNALLLYIEHRYYGK 139
Query: 150 SMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAA 209
S+P+GS A +N + Y + QA+AD+A I ++K+NL AE SPV++ GGSYGGMLA+
Sbjct: 140 SVPFGSQGEALKNGSIRGYFNSAQAIADYAEIIIHVKKNLQAENSPVIVIGGSYGGMLAS 199
Query: 210 WMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVS 269
W RLKYPH+A+GALASSAP+L F+DI P + +Y+I S DF+ S +C+ TI++SW E+
Sbjct: 200 WFRLKYPHLALGALASSAPVLYFDDITPQDGYYSIASRDFREASENCYKTIQKSWAEIDG 259
Query: 270 VGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
V GL L+K F C+ L +++L D L+S YS A + P P YP+
Sbjct: 260 VASMPKGLDVLSKKFKTCKPLTDSDELKDRLDSMYSGAAQYNKP---------PTYPVNI 310
Query: 330 VCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDDDPHGLD----GWNWQV 380
+C ID A +++ L++IF GV Y GN C+ + P L GW WQ
Sbjct: 311 ICSGIDGAASSSNDTLDKIFAGVVA---YRGNRSCYI--NPPTNLSETSVGWRWQT 361
>gi|194759919|ref|XP_001962194.1| GF14561 [Drosophila ananassae]
gi|190615891|gb|EDV31415.1| GF14561 [Drosophila ananassae]
Length = 471
Score = 263 bits (673), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 143/328 (43%), Positives = 197/328 (60%), Gaps = 12/328 (3%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWV--GPNRLGPIFLYCGNEGDI 118
+ +++Y Y + F LDHFSF TFS RYL N D +V N PI Y GNEGDI
Sbjct: 23 KSEEKYHYLIKEFVVPLDHFSFLSNATFSIRYLYN-DSFVRDQSNARTPILFYTGNEGDI 81
Query: 119 EWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADF 178
E FA N+GF+W+ A R A++VF EHRYYG+S+P+G + L+Y T EQ L D+
Sbjct: 82 ELFAQNTGFLWEEAERKHALVVFAEHRYYGKSLPFGKSTFNTSMPEHLAYFTVEQTLEDY 141
Query: 179 AVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
A+ IT L+ + PVV FGGSYGGMLAAW R+KYPH+A+GALA+SAPILQF I
Sbjct: 142 AMLITYLR---NGTQRPVVTFGGSYGGMLAAWFRMKYPHLAVGALAASAPILQFPGITDC 198
Query: 239 ETFYNIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA 297
+ FY IV+S F +++C I +SW ++G + G +++ F+LC + S DL
Sbjct: 199 DIFYRIVTSVFANAYNSNCTVNIAKSWRVFETLGGTDAGKKQISDAFNLCSPVKSDADLK 258
Query: 298 ---DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSV 353
D++E YS LAMV+YPY S F+ PLP +P+R+VC + D +L + ++V
Sbjct: 259 KFLDYVEEVYSNLAMVNYPYSSSFLAPLPAFPVRQVCFYLKDLHTTDADLLHTMASALAV 318
Query: 354 YYNYTGNVDCFQLDDDPHGLD-GWNWQV 380
Y N+TG C + + + GWN Q
Sbjct: 319 YTNFTGTTKCLDISATSNADESGWNIQT 346
>gi|195091774|ref|XP_001997565.1| GH13951 [Drosophila grimshawi]
gi|193906104|gb|EDW04971.1| GH13951 [Drosophila grimshawi]
Length = 472
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/306 (44%), Positives = 189/306 (61%), Gaps = 9/306 (2%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q ++ YE + F+ LDHFSF +FS RYL N + + + IF Y GNEGDIEWFA
Sbjct: 20 QSKFEYEIKEFQVPLDHFSFLSNASFSIRYLYN-ESYADKSNPKSIFFYTGNEGDIEWFA 78
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
NSGFVW++A + A++VF EHRYYG+S+P+GS L+Y T EQ L D+A+ I
Sbjct: 79 KNSGFVWELAEKERAIVVFAEHRYYGKSLPFGSDTFNASKPEHLAYFTVEQTLEDYAMLI 138
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T L+ + PVV FGGSYGGMLAAW R+KYPHI +GALA+SAPILQFE + P + F
Sbjct: 139 TFLR---NGRQLPVVAFGGSYGGMLAAWFRIKYPHIVVGALAASAPILQFEGLTPCDIFN 195
Query: 243 NIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL---AD 298
I +S F +A+C I +SW ++ + G +L FHLC + + +DL D
Sbjct: 196 EITTSVFNTAYNANCSANIGKSWHAFETMAATDAGKKQLGDKFHLCEPIKNGDDLNQFMD 255
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNY 357
+LE Y+ LAM +YPY ++F+ PLP P+R+ C + D D ++ + ++VY NY
Sbjct: 256 YLEDVYANLAMSNYPYNTNFLAPLPANPVRQFCLYLKDLHADEADLVSSMASALAVYTNY 315
Query: 358 TGNVDC 363
+G+ C
Sbjct: 316 SGSDPC 321
>gi|195034070|ref|XP_001988820.1| GH11372 [Drosophila grimshawi]
gi|193904820|gb|EDW03687.1| GH11372 [Drosophila grimshawi]
Length = 472
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 136/306 (44%), Positives = 189/306 (61%), Gaps = 9/306 (2%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q ++ YE + F+ LDHFSF +FS RYL N + + + IF Y GNEGDIEWFA
Sbjct: 20 QSKFEYEIKEFQVPLDHFSFLSNASFSIRYLYN-ESYADKSNPKSIFFYTGNEGDIEWFA 78
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
NSGFVW++A + A++VF EHRYYG+S+P+GS L+Y T EQ L D+A+ I
Sbjct: 79 KNSGFVWELAEKERAIVVFAEHRYYGKSLPFGSDTFNASKPEHLAYFTVEQTLEDYALLI 138
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T L+ + PVV FGGSYGGMLAAW R+KYPHI +GALA+SAPILQFE + P + F
Sbjct: 139 TFLR---NGRQLPVVAFGGSYGGMLAAWFRIKYPHIVVGALAASAPILQFEGLTPCDIFN 195
Query: 243 NIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL---AD 298
I +S F +A+C I +SW ++ + G +L FHLC + + +DL D
Sbjct: 196 EITTSVFNTAYNANCSANIGKSWHAFETMAATDAGKKQLGDKFHLCEPIKNGDDLNQFMD 255
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNY 357
+LE Y+ LAM +YPY ++F+ PLP P+R+ C + D D ++ + ++VY NY
Sbjct: 256 YLEDVYANLAMSNYPYNTNFLAPLPANPVRQFCLYLKDLHADEADLVSSMASALAVYTNY 315
Query: 358 TGNVDC 363
+G+ C
Sbjct: 316 SGSDPC 321
>gi|290997696|ref|XP_002681417.1| lysosomal carboxypeptidase [Naegleria gruberi]
gi|284095041|gb|EFC48673.1| lysosomal carboxypeptidase [Naegleria gruberi]
Length = 528
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 143/334 (42%), Positives = 193/334 (57%), Gaps = 24/334 (7%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGP-NRLGPIFLYCGNEGDIEWFAVNSG 126
Y+T YF+Q+LDHF F + TF QRYL+ D +VG PIF Y GNEGDI F N+G
Sbjct: 60 YKTLYFDQKLDHFDFTNDKTFKQRYLV-CDSFVGKMTPSTPIFFYTGNEGDIVTFYENTG 118
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
++D AP+F A+++F EHRYYG S P+G V + +L++EQALAD++ FIT +
Sbjct: 119 LMFDTAPQFNALIIFVEHRYYGVSNPFG--PVNSFTPENIKWLSSEQALADYSYFITEMF 176
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+PV+ FGGSYGGML++W R+KYPHI GA+A+SAPI QF + P + I +
Sbjct: 177 GLDEKRTNPVIAFGGSYGGMLSSWWRMKYPHIVDGAIAASAPIFQFTGLTAPNVYNQICT 236
Query: 247 SDFKRES------ASCFNTIKESWGELVSVGQKENG--LLELTKTFHLC----RELNSTE 294
DFK+ S +C IK L Q N L +L+ F +C R
Sbjct: 237 EDFKKSSNLAKYHETCDAVIKNGLSILNQYYQNNNPQILQKLSSQFRICSPGIRTSADVN 296
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSI-----LERIFE 349
L WL AY+ L M+DYPYP++F+MPLPGYPI+ +C +I + + L+ + E
Sbjct: 297 QLISWLTQAYNTLPMIDYPYPANFLMPLPGYPIQVICDRISTIMASEKVTTDTYLKAVLE 356
Query: 350 GVSVYYNYTGNVDCFQL---DDDPHGLDGWNWQV 380
G +VYYNYTG C L D G DGW +Q
Sbjct: 357 GANVYYNYTGTSTCNNLTQPDSPSLGDDGWEYQT 390
>gi|356498058|ref|XP_003517871.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Glycine max]
Length = 471
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 190/318 (59%), Gaps = 20/318 (6%)
Query: 69 ETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+T YF+Q LDHF++ TF QRYLIN +W G N PIF Y G E I+ G
Sbjct: 23 KTFYFKQVLDHFNYRPESYTTFQQRYLINFKYWGGANSSAPIFAYFGAESPIDNSPNGVG 82
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+ D A F A+LV+ EHRYYG+S+ +GS E A +NA+T+ Y + QALAD+A + ++K
Sbjct: 83 FLTDNAASFNALLVYIEHRYYGKSVQFGSREEALKNASTIGYFNSAQALADYASILKHVK 142
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ L A+ SPV++ GGSYGGMLA+W RLKYPH+AIGALASSAPIL F+ I P +Y+ V+
Sbjct: 143 KTLHAKNSPVIVIGGSYGGMLASWFRLKYPHLAIGALASSAPILYFDKITPQNGYYSTVT 202
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
D++ S +C+ T+ +SW E+ + + NGL+ L++ F+ C LN + +L D+L S Y Y
Sbjct: 203 RDYREASETCYETVLKSWSEIRRIASQPNGLVTLSQRFNTCHTLNQSYELIDYLRSTYVY 262
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
A + P P YP+ +C ID + IL +I+ G+ GN C
Sbjct: 263 AAQYNQP---------PRYPVSMICGGIDGESLGSDILSKIYAGIVA---LRGNSTC--K 308
Query: 367 DDDPHGLD----GWNWQV 380
+ P + GW WQ
Sbjct: 309 VNGPTNVSETTVGWRWQT 326
>gi|148906489|gb|ABR16397.1| unknown [Picea sitchensis]
Length = 508
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 141/315 (44%), Positives = 188/315 (59%), Gaps = 20/315 (6%)
Query: 68 YETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y T + Q LDHF+F TF QRYL+N +W GP PIF+ GNE DI
Sbjct: 78 YTTNQYMQTLDHFTFRPDGYRTFPQRYLVNKTYWGGPQNNSPIFVCLGNEEDIITQLPYF 137
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G + + A F A++VF EHRYYG SMP+GS + +Y NA+TL Y +A QALAD+A+ IT+L
Sbjct: 138 GIMTEHAADFRALIVFIEHRYYGTSMPFGSQDESYANASTLGYYSASQALADYAIVITDL 197
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+NLSA+ PVV+FGGSYGGMLAAW+RLKYPHI IGALASS+PIL FED+ P + + +V
Sbjct: 198 KKNLSADDCPVVVFGGSYGGMLAAWLRLKYPHITIGALASSSPILYFEDMTPHDAYDRVV 257
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S C+ IKESW E+ V + GL +L K F+ C+ L+S DL W+ A
Sbjct: 258 TKDFRDASDICYRRIKESWAEMEKVASQPGGLQKLAKLFNTCQSLSSWTDLFYWIYPALQ 317
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
A ++ ++ +C+ I++ T IL R+ G Y N + C
Sbjct: 318 VSAQYNFA------------EVKAICRVINSQARGTDILTRLAAGAE-YAN--EGLGCLN 362
Query: 366 LDDDPHGLDGWNWQV 380
L L GW+WQ
Sbjct: 363 LSTT---LSGWDWQT 374
>gi|167888448|gb|ACA09612.1| prolyl carboxypeptidase [Tenebrio molitor]
Length = 488
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 192/330 (58%), Gaps = 17/330 (5%)
Query: 62 QQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF 121
Q Y Y T+YFE +DHFSF + TF +YLIN WV GPIF Y GNEG IE F
Sbjct: 12 QVLSYNYTTKYFEVPVDHFSFTNNATFKLKYLINNSFWVDD---GPIFFYTGNEGTIENF 68
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
A N GF++DIAP+F A+LVF EHRYYGES+P+G + +Y + L YLT+ QALAD+
Sbjct: 69 AENMGFMFDIAPQFNALLVFAEHRYYGESLPFG--DESYADPARLGYLTSNQALADYVDL 126
Query: 182 ITNLKQNLS----AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP 237
I L+ S + PVV FGGSYGGMLA+W+R+K+P +GA+ASSAPI QF+ + P
Sbjct: 127 INYLQTTRSRSSYSNKVPVVAFGGSYGGMLASWLRMKFPASVVGAIASSAPIWQFQGLTP 186
Query: 238 PETFYNIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED- 295
E F IV+ +K C + SW + ++ + G L+ + LC L + D
Sbjct: 187 CENFNRIVTDVYKTALDDDCSVPLSRSWKVIRNITSNDAGKAWLSAQWKLCTPLKTQSDV 246
Query: 296 --LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP--DATSILERIFEGV 351
L +W +AMV+YPYP+ F+ PLP YP+R C K+ + D ++L I +
Sbjct: 247 DTLVNWFSEIVVNMAMVNYPYPTSFLAPLPAYPVRSFCYKMTESQTVDDETLLTTIGAAL 306
Query: 352 SVYYNYTGNVDCFQLDDDPHGL--DGWNWQ 379
+Y NYT C +++ L D W++Q
Sbjct: 307 EIYTNYTQTTKCNVINETAAALGEDAWDFQ 336
>gi|219362661|ref|NP_001136852.1| uncharacterized protein LOC100217003 [Zea mays]
gi|194697358|gb|ACF82763.1| unknown [Zea mays]
Length = 319
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 117/170 (68%), Positives = 140/170 (82%)
Query: 211 MRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSV 270
MRLKYPHIAIGALASSAPILQFEDIVP FY++VS DF+RES SCF TIK+SW EL
Sbjct: 1 MRLKYPHIAIGALASSAPILQFEDIVPSTIFYDLVSDDFRRESLSCFLTIKDSWKELDDQ 60
Query: 271 GQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREV 330
+++GLL+L+KTFHLC+ L ++ DL+DWL SAYSYLAMVDYP PS+F+MPLP PI+EV
Sbjct: 61 ANEQDGLLKLSKTFHLCQTLKTSGDLSDWLSSAYSYLAMVDYPLPSEFLMPLPANPIKEV 120
Query: 331 CKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
C+ ID+ P T LERI+ GV+VYYNYTG VDCF L+DDPHG+ GW+WQ
Sbjct: 121 CRNIDSQPKGTGTLERIYAGVNVYYNYTGTVDCFDLNDDPHGMGGWDWQA 170
>gi|91092240|ref|XP_971305.1| PREDICTED: similar to prolylcarboxypeptidase [Tribolium castaneum]
gi|270014428|gb|EFA10876.1| hypothetical protein TcasGA2_TC001698 [Tribolium castaneum]
Length = 488
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 134/325 (41%), Positives = 197/325 (60%), Gaps = 16/325 (4%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y Y T++ + LDHFSF + TF +YLIN W+ GPIF Y GNEG +E FA N+
Sbjct: 17 YNYTTKFIDVPLDHFSFTNNATFKLKYLINDSFWIDD---GPIFFYTGNEGAVETFAENT 73
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF++DIAP F A++VF EHRYYG ++P+G+ ++ N L +LT+ QALAD+ I +L
Sbjct: 74 GFIFDIAPTFNALIVFAEHRYYGATLPFGN--ASFSNPGHLGFLTSSQALADYVYLINHL 131
Query: 186 K----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
+ ++ PVV FGGSYGGMLAAW+R+KYP +GA+A+SAPI QF+ + P E F
Sbjct: 132 QTTHQRSEYLSKVPVVAFGGSYGGMLAAWLRMKYPASVVGAIAASAPIWQFQGLTPCENF 191
Query: 242 YNIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST--EDLAD 298
IVS+ +K C I++SW + ++ ++G LTK + LC L S+ +DL +
Sbjct: 192 NRIVSNVYKTAVDDDCSAPIQKSWKIIRNITANDDGKAWLTKAWKLCSPLKSSDIDDLLE 251
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI--DNAPDATSILERIFEGVSVYYN 356
W +AMV+YPYP+ F+ PLP +P+R C K+ + D S++ I + +Y N
Sbjct: 252 WYSEILVNMAMVNYPYPTKFLAPLPAFPVRNFCYKLTGEKITDDKSLVTAIGNALEIYTN 311
Query: 357 YTGNVDCFQLDDDPHGL--DGWNWQ 379
+T C ++ L +GW++Q
Sbjct: 312 FTKATKCNNINQTAASLGEEGWDFQ 336
>gi|356543520|ref|XP_003540208.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Glycine max]
Length = 511
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 194/314 (61%), Gaps = 14/314 (4%)
Query: 70 TRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
T Y++Q LDHF TF QRYLIN +W G N PIF Y G E I+ GF
Sbjct: 62 TFYYKQVLDHFKXRPESYKTFQQRYLINFKYWGGANSSAPIFAYLGAEEPIDGSPELIGF 121
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+ D A F A++V+ EHRYYG+S+P+GS E A +NA+T+ Y + QA+AD+A + ++K+
Sbjct: 122 LTDNAASFNALIVYIEHRYYGKSVPFGSREEALKNASTIGYFNSAQAIADYASVLIHIKK 181
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
L A+ SPV++ GGSYGGMLA+W RLKYPH+AIGALASSAPIL F++I P + +Y++VS
Sbjct: 182 TLHAQKSPVIVIGGSYGGMLASWFRLKYPHLAIGALASSAPILYFDNITPQDGYYSVVSR 241
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYL 307
DF+ S +C+ TI +SW E+ V + GL L++ F+ CR LN + +L D+L + Y+
Sbjct: 242 DFREASETCYQTILKSWSEIDRVASQPKGLSILSQRFNTCRPLNESSELKDYLINMYASS 301
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEG-VSVYYNYTGNVDCFQL 366
A ++P P YP+ +C ID A + IL +I+ G V++ N T V+ +
Sbjct: 302 AQYNHP---------PRYPVTVICGGIDRASFGSDILSKIYAGLVALRGNTTCKVNGPII 352
Query: 367 DDDPHGLDGWNWQV 380
+ GW WQ
Sbjct: 353 VSET--TLGWRWQT 364
>gi|224058949|ref|XP_002299659.1| predicted protein [Populus trichocarpa]
gi|222846917|gb|EEE84464.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 190/319 (59%), Gaps = 27/319 (8%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ET + Q LDHF++ F QRYLIN+ +W G N PI +Y G E I+ + +
Sbjct: 1 FETFFHNQTLDHFNYRPESYDKFPQRYLINSKYWGGANVSAPILVYLGAEASIDGYRDAA 60
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D A +F ++LVF EHRYYG S P G+ Y ++ QALAD+A I ++
Sbjct: 61 GFLDDNAVQFKSLLVFIEHRYYGHSFPPGAW-------GKRGYFSSAQALADYAAIIIDI 113
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+N SA+ SPV++ GGSYGGMLA+W RLKYPHIA+GALASSAPIL F+DI P + +Y++V
Sbjct: 114 KENRSAQYSPVIVIGGSYGGMLASWFRLKYPHIALGALASSAPILYFDDITPQDAYYSVV 173
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ +F+ S +C+ TIK SW E+ + K +GL L+ F+ C+ L +L D+L Y+
Sbjct: 174 TKEFREASETCYQTIKTSWSEIDELASKPDGLSMLSTKFNTCKNLTDASELKDYLRLMYA 233
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
Y A + P P YP+ EVCK ID+ IL RIF GV YY GN C+
Sbjct: 234 YAAQNNSP---------PTYPVNEVCKGIDDDASGDDILSRIFRGVVAYY---GNRTCY- 280
Query: 366 LDDDPHGLDG-----WNWQ 379
+++ + W+WQ
Sbjct: 281 FNNNAYAYQSEATLDWSWQ 299
>gi|224143318|ref|XP_002324914.1| predicted protein [Populus trichocarpa]
gi|222866348|gb|EEF03479.1| predicted protein [Populus trichocarpa]
Length = 474
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 189/319 (59%), Gaps = 16/319 (5%)
Query: 66 YRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
+ +T Y+ Q LDHF++ F QRY+IN+ +W G N PIF+Y G E +E
Sbjct: 26 HNLQTFYYNQTLDHFNYRPDSFDMFQQRYVINSKYWGGANSNAPIFVYFGEEAPLENDFG 85
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
+ G + + A RF A+ V+ EHRYYG+S+P+GS A++NA+TL Y + QALAD+A I
Sbjct: 86 DIGILAENAHRFKALQVYIEHRYYGKSIPFGSRNEAFKNASTLGYFNSAQALADYAEIII 145
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
++ + + SPV++ G SYGGMLA+W RLKYPHIA+GALASSAPIL F DI P + +
Sbjct: 146 HVNEKFHVQRSPVIVVGASYGGMLASWFRLKYPHIALGALASSAPILYFTDITPAHAYVS 205
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
IV+ DF+ +S SC +TIK+SW + + + +GL L+K F C+ LN++ +L D+L
Sbjct: 206 IVTKDFREDSQSCHDTIKKSWTVIDKIASEPDGLSILSKKFETCKPLNNSSELTDYLAGI 265
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
Y A D P P YP+ VCK ID IL RIF G+ Y G + C
Sbjct: 266 YMAAAQYDAP---------PSYPVTMVCKSIDEPSFGNDILGRIFAGMVA---YQGELPC 313
Query: 364 F--QLDDDPHGLDGWNWQV 380
+ + + GW+WQ
Sbjct: 314 YVNEPTKETETDVGWSWQT 332
>gi|449496302|ref|XP_004160098.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 514
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 192/316 (60%), Gaps = 16/316 (5%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++T Y+ Q LDHF++ TF QRY+IN +W GPN PIF Y G E I+
Sbjct: 73 FKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWGGPNSSAPIFAYLGAEAPIDDDLDFI 132
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D A +F A+L++ EHRYYG+S+P+ S + A NA+TL Y + QA+AD+A + ++
Sbjct: 133 GFMTDNAIQFNALLIYIEHRYYGKSIPFRSRDEALGNASTLGYFNSAQAIADYAAILIHV 192
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ A SPV++ GGSYGGMLA+W RLKYPH+A+GALASSAPIL F+DI P + +Y++V
Sbjct: 193 KKEFHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYSVV 252
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S +C+ TIK+SW E+ +V + NGL L + F CR L +L D+L S Y+
Sbjct: 253 TKDFRGLSETCYETIKKSWSEIETVAYQPNGLSILDQEFKTCRPLRGYFELEDYLWSMYA 312
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF- 364
A ++P P YP+ +C ID L +I GV + + G+V C+
Sbjct: 313 SAAQYNHP---------PKYPVTRICDAIDGTYSVNGTLSKIAAGV---FAFRGSVSCYI 360
Query: 365 -QLDDDPHGLDGWNWQ 379
+ ++ GW WQ
Sbjct: 361 NEPRNETETDVGWRWQ 376
>gi|449456172|ref|XP_004145824.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 507
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 192/316 (60%), Gaps = 16/316 (5%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++T Y+ Q LDHF++ TF QRY+IN +W GPN PIF Y G E I+
Sbjct: 66 FKTYYYNQTLDHFNYRPESYTTFPQRYIINFKYWGGPNSSAPIFAYLGAEAPIDDDLDFI 125
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D A +F A+L++ EHRYYG+S+P+ S + A NA+TL Y + QA+AD+A + ++
Sbjct: 126 GFMTDNAIQFNALLIYIEHRYYGKSIPFRSRDEALGNASTLGYFNSAQAIADYAAILIHV 185
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ A SPV++ GGSYGGMLA+W RLKYPH+A+GALASSAPIL F+DI P + +Y++V
Sbjct: 186 KKEFHANYSPVIVIGGSYGGMLASWFRLKYPHVALGALASSAPILYFDDITPQDGYYSVV 245
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S +C+ TIK+SW E+ +V + NGL L + F CR L +L D+L S Y+
Sbjct: 246 TKDFRGLSETCYETIKKSWSEIETVAYQPNGLSILDQEFKTCRPLRGYFELEDYLWSMYA 305
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF- 364
A ++P P YP+ +C ID L +I GV + + G+V C+
Sbjct: 306 SAAQYNHP---------PKYPVTRICDAIDGTYSVNGTLSKIAAGV---FAFRGSVSCYI 353
Query: 365 -QLDDDPHGLDGWNWQ 379
+ ++ GW WQ
Sbjct: 354 NEPRNETETDVGWRWQ 369
>gi|224058953|ref|XP_002299661.1| predicted protein [Populus trichocarpa]
gi|222846919|gb|EEE84466.1| predicted protein [Populus trichocarpa]
Length = 437
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 140/358 (39%), Positives = 203/358 (56%), Gaps = 30/358 (8%)
Query: 30 SLAAQPSKFRRAPRFVGKLPHL-TEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLP 86
+ A + PR P + + P + + + +ET ++ Q LDHF++
Sbjct: 87 TATATAKRLNTIPRHSPIGPRVWRDQPDKTTSCEVDEEGFETCFYNQTLDHFNYRPESYD 146
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRY 146
TF QRY+IN+ +W G N I +Y G E I+ + +GF+ D A +F ++LVF EHRY
Sbjct: 147 TFPQRYVINSKYWGGAN--ASILVYLGAEASIDGYRDAAGFLDDNAVQFKSLLVFIEHRY 204
Query: 147 YGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGM 206
YG S P G+ Y ++ QALAD+A I ++K+N SA+ SPV++ GGSYGGM
Sbjct: 205 YGHSFPPGAW-------GKRGYFSSAQALADYAAIIIDIKENRSAQYSPVIVIGGSYGGM 257
Query: 207 LAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGE 266
LA+W RLKYPHIA+GALASSAPIL F+DI P + +Y++V+ +F+ S +C+ TIK SW E
Sbjct: 258 LASWFRLKYPHIALGALASSAPILYFDDITPQDAYYSVVTKEFREASETCYQTIKTSWSE 317
Query: 267 LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYP 326
+ + K +GL L+ F+ C+ L +L D+L Y+Y A + P P YP
Sbjct: 318 IDELASKPDGLSMLSTKFNTCKNLTDASELKDYLRLMYAYAAQNNSP---------PTYP 368
Query: 327 IREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDG-----WNWQ 379
+ EVCK ID+ IL RIF GV YY GN C+ +++ + W+WQ
Sbjct: 369 VNEVCKGIDDDASGDDILSRIFRGVVAYY---GNRTCY-FNNNAYAYQSEATLDWSWQ 422
>gi|281206058|gb|EFA80247.1| peptidase S28 family protein [Polysphondylium pallidum PN500]
Length = 481
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/326 (42%), Positives = 195/326 (59%), Gaps = 21/326 (6%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWV---GPNRLGPIFLYCGNEGDIEWFAVN 124
YE Y+ Q LDHF+F + +F QRYL++ +W GP P+ Y GNEGDI WF N
Sbjct: 29 YEEFYYMQTLDHFNFYNKGSFPQRYLVSDTYWTRPTGPICESPVLFYTGNEGDIVWFYEN 88
Query: 125 SGFVWDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
S FV ++ A GA+LVF EHR+YGE+MP+G++ +N + YLT+EQALAD+A I
Sbjct: 89 SQFVTNVLAKEMGALLVFAEHRFYGETMPFGNSSSLPEN---IGYLTSEQALADYAQLIP 145
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI-VPPETFY 242
+ +L PV+ GGSYGGMLA+W R+KYP+I GALA+SAPIL F E F
Sbjct: 146 AVLSDLGGSHCPVLAVGGSYGGMLASWFRMKYPNIIDGALAASAPILYFLGTNANSEGFN 205
Query: 243 NIVSSDFKRESA--SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
I + DF S+ +C I ++ E+ + +NGL LTKTF LC EL DL +WL
Sbjct: 206 EIATIDFAETSSEGTCATRIHSAFNEITQMSNTQNGLNVLTKTFSLCEELTELSDLINWL 265
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG- 359
E+A +Y+AM DYPYP++F+ P+PG PI C + D ++ + + ++VY+N +G
Sbjct: 266 EAAITYMAMADYPYPANFLEPMPGNPINVSCSLLAKETDN---IQGLVQVMNVYFNSSGQ 322
Query: 360 -----NVDCFQLDDDPHGLDGWNWQV 380
NV + G DGW++Q
Sbjct: 323 AGQCNNVSVYT--TGALGSDGWDYQA 346
>gi|242096524|ref|XP_002438752.1| hypothetical protein SORBIDRAFT_10g025540 [Sorghum bicolor]
gi|241916975|gb|EER90119.1| hypothetical protein SORBIDRAFT_10g025540 [Sorghum bicolor]
Length = 306
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 126/228 (55%), Positives = 162/228 (71%), Gaps = 12/228 (5%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHWVGPN-----RLGPIFLYCGNEGDIEW 120
+ YF Q LDHF+F + F Q+YL+N W + GP+F++ G E DIE
Sbjct: 84 FTEHYFPQELDHFTFRPNASTVFYQKYLVNDTFWRRSSGRKGGSTGPLFVFTGGETDIES 143
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
A+N+GF++DIAP+FGA+LVF EHR+YGESMP+ S + L YLT+ QALADFA+
Sbjct: 144 IAINAGFMFDIAPKFGALLVFIEHRFYGESMPFRS-----NSTEALGYLTSTQALADFAI 198
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
IT+LKQNLSAE +PVV+FGGSYGGMLA+W RLKYPH+ IGALASSAPILQF+ I P +
Sbjct: 199 LITSLKQNLSAETAPVVVFGGSYGGMLASWFRLKYPHVTIGALASSAPILQFDYITPWSS 258
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR 288
FY++VS D+K ES +CF+ IK +W L G +NGLLEL+K F C+
Sbjct: 259 FYDVVSQDYKSESLNCFSVIKAAWDVLEERGSNDNGLLELSKLFRACK 306
>gi|91088397|ref|XP_972807.1| PREDICTED: similar to prolylcarboxypeptidase [Tribolium castaneum]
gi|270012192|gb|EFA08640.1| hypothetical protein TcasGA2_TC006303 [Tribolium castaneum]
Length = 478
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 136/322 (42%), Positives = 195/322 (60%), Gaps = 16/322 (4%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y YET+YFE LDHFSF + TF +YLIN W GPIF Y GNEG +E FA N+
Sbjct: 18 YDYETKYFEVLLDHFSFTNNATFKLKYLINDTFWTND---GPIFFYTGNEGTVENFAENT 74
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF++DIAP F A++VF EHRYYGES+P+G+ ++ + + + YLT+ QALADF I N
Sbjct: 75 GFMFDIAPSFNALVVFAEHRYYGESLPFGND--SFVSPSHIGYLTSSQALADFVDLI-NY 131
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
Q +S E PV+ FGGSYGGMLA+W+R+KYP +GA+A+SAPI QFE P E FY +V
Sbjct: 132 LQTMSLEKVPVIAFGGSYGGMLASWLRMKYPASVVGAIAASAPIWQFE--TPCEDFYKVV 189
Query: 246 SSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLE 301
+ ++ A C I +SW L ++ + G L+ + LC L ++ D L W
Sbjct: 190 TRVYQEAVAKDCPLLITKSWTALRNISESPEGKAWLSDAWQLCSPLETSADVETLIGWYS 249
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYNYTG 359
+AMV+YPY + F+ PLP +P++ C ++ N D S++ + + + +Y N+T
Sbjct: 250 EILVNMAMVNYPYSTSFLAPLPPFPVKTFCSQLTQANIVDDKSLVMALGDALQIYTNFTE 309
Query: 360 NVDCFQLDDDPHGL--DGWNWQ 379
C +++ L +GW +Q
Sbjct: 310 TTTCNKINQTAEALGEEGWYFQ 331
>gi|391328440|ref|XP_003738697.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 459
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 132/324 (40%), Positives = 199/324 (61%), Gaps = 12/324 (3%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q Y Y+T +FE ++DHFSFA +F R L +D + + GP+F Y GNEGDIE F
Sbjct: 16 QGYDYQTFWFETKIDHFSFARNDSFKMRVLY-SDKYFDSSEPGPVFFYTGNEGDIETFTN 74
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+G +WD A F A+L+F EHR+YG+SMP+G + +Y YLTAEQALADFA I
Sbjct: 75 NTGLMWDWAADFKALLIFAEHRFYGKSMPFG--DKSYDTYKQYGYLTAEQALADFADLIQ 132
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
++K N + VV FGGSYGGML+AWMR+KYP + A+A+SAPILQF+D+ F
Sbjct: 133 HVKNNWPVK--KVVAFGGSYGGMLSAWMRIKYPWLIDAAIAASAPILQFQDVTACGVFDK 190
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQK-ENGLLELTKTFHLCREL--NSTEDLADWL 300
IV+ F + S C + I+ SW L +G+ ENG + + F +C+ + ++ + DWL
Sbjct: 191 IVTKAFAKASERCADNIRRSWIALEKLGKDGENGSALIRENFRICQNVLPSNYTAVRDWL 250
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTG 359
Y LAM++YPY ++F+ +PG+P++ C +D + +L+ +++ ++V++N++G
Sbjct: 251 HDTYGNLAMINYPYGTNFLKKVPGHPVQVSCSFLDKDFHCDAELLKGVYQAINVFHNFSG 310
Query: 360 NVDCFQLDD---DPHGLDGWNWQV 380
+ C + + D GWN Q
Sbjct: 311 DTQCNDVGNSGGDNISDAGWNIQT 334
>gi|224109990|ref|XP_002333167.1| predicted protein [Populus trichocarpa]
gi|222835050|gb|EEE73499.1| predicted protein [Populus trichocarpa]
Length = 514
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 139/358 (38%), Positives = 200/358 (55%), Gaps = 30/358 (8%)
Query: 30 SLAAQPSKFRRAPRFVGKLPHL-TEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLP 86
+ A + PR P + + P + + + +ET ++ Q LDHF++
Sbjct: 19 TATATAKRLNTIPRHSPIGPRVWRDQPDKTTSCEVDEEGFETCFYNQTLDHFNYRPESYD 78
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRY 146
TF QRY+IN+ +W G N I +Y G E I+ + GF+ D A +F ++LV EHRY
Sbjct: 79 TFPQRYVINSKYWGGAN--ASILVYLGAEASIDRYLDAGGFLVDNAVQFKSLLVVIEHRY 136
Query: 147 YGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGM 206
YG+S+P GS Y + QALAD+A I ++K+ L A+ SPV++ GGSYGGM
Sbjct: 137 YGQSIPPGS-------WGKRGYFNSAQALADYAAIIIHIKKTLRAQYSPVIVIGGSYGGM 189
Query: 207 LAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGE 266
LA+W RLKYPHIA+GALASSAPIL F+DI P + +Y++V+ F+ S +C+ TIK SW E
Sbjct: 190 LASWFRLKYPHIALGALASSAPILYFDDITPQDAYYSVVTKAFREASETCYQTIKTSWSE 249
Query: 267 LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYP 326
+ + K +GL L+ F+ C+ L +L D+L Y+Y A + P P YP
Sbjct: 250 IDELASKPDGLSMLSTKFNTCKNLTDASELKDYLRLMYAYAAQNNSP---------PTYP 300
Query: 327 IREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDG-----WNWQ 379
+ EVCK ID+ IL RIF GV YY GN C+ +++ + W+WQ
Sbjct: 301 VNEVCKGIDDDASGDDILSRIFRGVVAYY---GNRTCY-FNNNAYAYQSEATLDWSWQ 354
>gi|449456174|ref|XP_004145825.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 491
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 188/316 (59%), Gaps = 17/316 (5%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++T YF Q LDHF++ TF QRY+IN +W G N PI Y G E I+
Sbjct: 50 FKTFYFNQTLDHFNYRPESYTTFPQRYIINFKYWGGANSSAPILAYLGPEAPIDSAMNVI 109
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D A +F A+LV+ EHRYYG+S+P+GS + A +NA+TL Y + QALAD+A + ++
Sbjct: 110 GFMTDNAVKFNALLVYIEHRYYGKSIPFGSRKEALRNASTLGYFNSAQALADYAAILIHV 169
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ SA+ SPV++ GGSYGGMLA W RLKYPH+A+GALASSAPIL F DI P +Y IV
Sbjct: 170 KKEFSAKYSPVIVIGGSYGGMLATWFRLKYPHVALGALASSAPILYFNDITPENGYYVIV 229
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S +C+ +I+ESW E+ +V + NGL L K F C L S+ L ++L +
Sbjct: 230 TKDFREVSQTCYESIRESWSEIETVASQSNGLSVLDKVFKTCSPLRSSTQLENYL--WFM 287
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
Y + Y +PS YP+ +C ID + L +I GV + Y G + C+
Sbjct: 288 YASAAQYNHPSR-------YPVNRICDAIDQTY-SNGTLGKIAAGV---FAYRGELSCYI 336
Query: 366 LD--DDPHGLDGWNWQ 379
+ + GW WQ
Sbjct: 337 NEPINTTETTVGWQWQ 352
>gi|312080065|ref|XP_003142442.1| hypothetical protein LOAG_06858 [Loa loa]
gi|307762394|gb|EFO21628.1| hypothetical protein LOAG_06858 [Loa loa]
Length = 509
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/360 (40%), Positives = 210/360 (58%), Gaps = 35/360 (9%)
Query: 46 GKLPHLTEPPQR---QQRQQQQQYRYETRYFEQR-LDHFSFADLPTFSQRYLINTDHWVG 101
G L +P R QQ++ + Y++ + E +DHFSF D TF RYLINTD++
Sbjct: 17 GNALRLYDPVTRIELQQKKGDKNYKWSEEWLENVPIDHFSFHDNRTFRLRYLINTDYFA- 75
Query: 102 PNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ 161
GPIF Y GNEG++E FA N+G +WD+AP+ AM+VF EHR+YG+S P+G+ +Y
Sbjct: 76 --HNGPIFFYTGNEGNVELFAQNTGLMWDLAPQLNAMVVFAEHRFYGKSQPFGNK--SYI 131
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNL--SAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
YL++EQAL DFA+ I +LK A+ S V+ FGGSYGGMLAAWMR+KYPH+
Sbjct: 132 TIQNFGYLSSEQALGDFALLINHLKNKYLSMAQNSSVIAFGGSYGGMLAAWMRIKYPHLV 191
Query: 220 IGALASSAPILQFEDI-VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLL 278
G++ASSAP+ F D+ VP + + +IV F S I W L ++ +G
Sbjct: 192 EGSIASSAPVFWFIDMSVPDDAYSHIVKRSF-VNSGCIERNIINGWIALKNLSSTASGRD 250
Query: 279 ELTKTFHLCRE--LNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKK 333
L + FHL ++ L S D L ++LE + +AMV+YPYPS+++ LPG+P++ C+
Sbjct: 251 YLNRLFHLDKKSYLKSNTDWIMLKEYLEDIFQSMAMVNYPYPSNYLAKLPGWPVKVACQF 310
Query: 334 IDNAPDATS--ILERIFEGVSVYYNYTGNVDCFQLD------------DDPHGLDGWNWQ 379
+N T + + ++ +++YYNYTG + F +D DP GW+WQ
Sbjct: 311 FNNTNKQTDKELAQSMYGIMNLYYNYTGQKEQFCIDPKVCKDTAYEALGDP---IGWSWQ 367
>gi|328868406|gb|EGG16784.1| peptidase S28 family protein [Dictyostelium fasciculatum]
Length = 505
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 144/382 (37%), Positives = 217/382 (56%), Gaps = 33/382 (8%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYR-----YETR 71
V+ +++SI+S SK + + P+ +E + ++ + ++ +
Sbjct: 7 VLLTIVVSIVS--------SKEQLYQKIKSPYPYYSEKNKNREFKGVEENDPSPPPFSEY 58
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRL-----GPIFLYCGNEGDIEWFAVNSG 126
Y+ Q+LDHF+F TF QRYLI+ +W P+ GPI Y GNEGDI WF NS
Sbjct: 59 YYIQKLDHFNFQTQQTFPQRYLISDTYWNKPSSNDSQCNGPILFYTGNEGDIVWFYQNSQ 118
Query: 127 FVWDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
F+ ++ A GA+L F EHRYYG+++P+G+ +N L Y T+EQALAD+A I +
Sbjct: 119 FITNVLAQELGALLFFAEHRYYGQTLPFGNESTVPEN---LQYCTSEQALADYATIIPQV 175
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI-VPPETFYNI 244
++L PV+ GGSYGGMLA+WMR+KYP+I GALA+SAPIL F PE F I
Sbjct: 176 LEDLGGLNCPVISVGGSYGGMLASWMRMKYPNIVDGALAASAPILYFLGTGADPEGFNEI 235
Query: 245 VSSDFKRESA--SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
++DF + SA SC I+ ++ E+ + +K NG L++ F LC + S +DL +W+ES
Sbjct: 236 ATNDFAQTSADGSCATRIRGAFTEISEIAEKPNGDELLSEMFSLCG-VQSVDDLVNWIES 294
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV- 361
+Y+AM DYPYP+ F+ P+PG+PI C ++ D L ++ +YYN +G
Sbjct: 295 GLTYMAMADYPYPAAFLEPMPGWPINASCAAMEPVQDDIQALLQVLH---IYYNSSGQAG 351
Query: 362 DCFQLD---DDPHGLDGWNWQV 380
C+ + G D W++Q
Sbjct: 352 SCYNVSVFTTGALGSDVWDYQA 373
>gi|195998996|ref|XP_002109366.1| hypothetical protein TRIADDRAFT_21030 [Trichoplax adhaerens]
gi|190587490|gb|EDV27532.1| hypothetical protein TRIADDRAFT_21030 [Trichoplax adhaerens]
Length = 469
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 122/278 (43%), Positives = 177/278 (63%), Gaps = 9/278 (3%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q +Y+T+YF+Q +DHF + T+ QRYL+N DHW GPIF Y GNEG I F
Sbjct: 25 QAAVKYKTKYFDQIIDHFDWKSNATYRQRYLMNDDHW--DKGTGPIFFYTGNEGGIVGFW 82
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
NSG ++D+AP+F A++VF EHRYYG+S+P+G +N L LT+EQALAD+AV +
Sbjct: 83 QNSGLLFDLAPQFRALIVFGEHRYYGKSLPFGKDSFKPKN---LGLLTSEQALADYAVLL 139
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T+LK++L+A VV FGGSYGGML AWMRLKYP+I LA+SAP+ +V P F+
Sbjct: 140 TSLKKSLNANKCKVVAFGGSYGGMLTAWMRLKYPNIIDAGLAASAPLYMAGGVVSPNFFF 199
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADW 299
V+ D++ + C I++++ ++ + K ++ K F++C +L ++ D L W
Sbjct: 200 PAVTKDYQDANPKCVPNIRKAFSAVLEMA-KSKSKQKVAKIFNVCNKLKTSADVKQLIGW 258
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA 337
+ + + +AM DYPYP+ F PLP +P+ CK I NA
Sbjct: 259 IRNGFVSMAMGDYPYPASFFGPLPAFPVNASCKYIVNA 296
>gi|224104759|ref|XP_002313555.1| predicted protein [Populus trichocarpa]
gi|222849963|gb|EEE87510.1| predicted protein [Populus trichocarpa]
Length = 499
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 145/355 (40%), Positives = 202/355 (56%), Gaps = 20/355 (5%)
Query: 32 AAQPSKFRRAPRF--VGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLPT 87
A + PR +G L P Q ++ + ET ++ Q LDHF++ T
Sbjct: 23 TATAKRLHTIPRLSPIGPRVWLDHPDQILGESVREDF--ETFFYNQTLDHFNYRPESYDT 80
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYY 147
F QRYLIN+ +W G N PI +Y G E I+ GF+ D A F ++LV+ EHRYY
Sbjct: 81 FLQRYLINSKYWGGANASAPILVYLGAEAPIDGDLDAVGFLVDTAVEFNSLLVYVEHRYY 140
Query: 148 GESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGML 207
G+S+P+GS E A +NA+TL Y + QA+AD+A I ++K+ L A+ SPV++ GGSYGGML
Sbjct: 141 GKSIPFGSREEALKNASTLGYFNSAQAIADYAAIIIHIKKTLQAKDSPVIVIGGSYGGML 200
Query: 208 AAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGEL 267
A+W RLKYPHIA+GALASSAP+L F+DI P +Y +VS DF+ S +C+ TI+ESW E+
Sbjct: 201 ASWFRLKYPHIALGALASSAPVLYFDDITPQYGYYALVSKDFRGASETCYQTIRESWEEI 260
Query: 268 VSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPI 327
V K +GL L+K F C L +L + L+S Y+ A + P P YP+
Sbjct: 261 DEVASKPDGLSILSKKFKTCNPLTDASELKNHLDSMYANAAQYNKP---------PTYPV 311
Query: 328 REVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF--QLDDDPHGLDGWNWQV 380
+VC ID +L R+F G+ Y GN C+ + + GW WQ
Sbjct: 312 NKVCGGIDGCGFGDDLLGRVFGGLVA---YKGNRSCYVNEPTNQSETSVGWRWQT 363
>gi|449496306|ref|XP_004160099.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 491
Score = 248 bits (634), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 187/316 (59%), Gaps = 17/316 (5%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++T YF Q LDHF++ TF QRY+IN +W G PI Y G E I+
Sbjct: 50 FKTFYFNQTLDHFNYRPESYTTFPQRYIINFKYWGGAYSSAPILAYLGPEAPIDSAMNVI 109
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D A +F A+LV+ EHRYYG+S+P+GS + A +NA+TL Y + QA+AD+A + ++
Sbjct: 110 GFMTDNAVKFNALLVYIEHRYYGKSIPFGSRKEALRNASTLGYFNSAQAIADYAAILIHV 169
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ SA+ SPV++ GGSYGGMLA W RLKYPH+A+GALASSAPIL F DI P +Y IV
Sbjct: 170 KKEFSAKYSPVIVIGGSYGGMLATWFRLKYPHVALGALASSAPILYFNDITPENGYYVIV 229
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S +C+ +I+ESW E+ +V + NGL L K F C L S+ L ++L +
Sbjct: 230 TKDFREVSQTCYESIRESWSEIETVASQSNGLSVLDKEFKTCSPLRSSTQLENYL--WFM 287
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
Y + Y +PS YP+ +C ID + L +I GV + Y G + C+
Sbjct: 288 YASAAQYNHPSR-------YPVNRICDAIDQT-YSNGTLGKIAAGV---FAYRGELSCYI 336
Query: 366 LD--DDPHGLDGWNWQ 379
+ + GW WQ
Sbjct: 337 NEPINTTETTVGWQWQ 352
>gi|224109998|ref|XP_002333169.1| predicted protein [Populus trichocarpa]
gi|222835052|gb|EEE73501.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 145/353 (41%), Positives = 202/353 (57%), Gaps = 17/353 (4%)
Query: 32 AAQPSKFRRAPRFVGKLPHL-TEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTF 88
A + PR P + + P + + +ET ++ Q LDHF++ F
Sbjct: 19 TATAKRLNTIPRLSPIGPRVWRDQPDKTTLGEFDGEDFETFFYNQTLDHFNYRPESYDKF 78
Query: 89 SQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYG 148
QRYLIN+ +W G N PI ++ G E I+ GF+ D A +F ++LVF EHRYYG
Sbjct: 79 PQRYLINSKYWGGANVSAPILVFLGAEEPIDEDLAAVGFLVDNAVQFNSLLVFIEHRYYG 138
Query: 149 ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLA 208
+S+P+GS E A ++A+ L Y + QA+AD+A I ++K+ L A+ SPV++ GGSYGGMLA
Sbjct: 139 KSIPFGSREEALKDASKLGYFNSAQAIADYAAIIIHIKETLRAQYSPVIVIGGSYGGMLA 198
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELV 268
+W RLKYPHIA+GALASSAPIL F+DI P + +Y+IV+ DF+ S +C+ TIK SW E+
Sbjct: 199 SWFRLKYPHIALGALASSAPILYFDDITPQDGYYSIVTKDFREASETCYQTIKTSWSEID 258
Query: 269 SVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIR 328
+ K +GL L+K F C L +L D L++ Y+ A + P P YP+
Sbjct: 259 ELASKPDGLSMLSKKFKTCNPLTDASELKDHLDTMYASAAQYNRP---------PTYPVN 309
Query: 329 EVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF-QLDDDPHGLD-GWNWQ 379
EVCK ID L RIF G+ Y GN+ C+ DP GW WQ
Sbjct: 310 EVCKGIDGGGFGDDTLSRIFGGLVA---YNGNLSCYVNAHTDPSETTVGWQWQ 359
>gi|440799092|gb|ELR20153.1| dipeptidyl-peptidase family protein [Acanthamoeba castellanii str.
Neff]
Length = 498
Score = 248 bits (632), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 191/317 (60%), Gaps = 20/317 (6%)
Query: 71 RYFEQRLDHFSFADLP-TFSQRYLINTDHWVGPN-----RLGPIFLYCGNEGDIEWFAVN 124
+F+Q +DHF+ P TF QRYL +++W N R GPIF Y GNEGDI F N
Sbjct: 34 NWFDQTIDHFNIETQPATFRQRYLTFSNYWSSANHGGELRRGPIFFYTGNEGDITAFWDN 93
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
SGFV+++A +GA++VF EHRYYG++ P+GS + + YL+ EQALAD+A I +
Sbjct: 94 SGFVFELAKSYGALVVFGEHRYYGKTYPFGSGGPDSYSKEHIGYLSVEQALADYATLIEH 153
Query: 185 LKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP------ 237
LK L A SPV+ FGGSYGGML+AW R+KYP + GALA+SAPIL ++
Sbjct: 154 LKSTLPGASHSPVIAFGGSYGGMLSAWFRMKYPQVVDGALAASAPILWSTNVSSATTGPD 213
Query: 238 ---PETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NS 292
P ++ V++DF+ C +++++ +++ + Q +GL + K F LC+ + +
Sbjct: 214 SKRPPGYFETVTNDFRAADERCPGLVQQAFAKMLQLAQTPSGLAAIAKQFSLCKNVLPHE 273
Query: 293 TEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVS 352
E L W+ +A+ +AM+DYPYP+ FM PLP YPI+ C + N D +L + +
Sbjct: 274 VEHLILWVVNAFGNMAMMDYPYPTGFMAPLPAYPIKVACHLMLNNTD--DVLRGLAQAAG 331
Query: 353 VYYNYTGNVDCFQLDDD 369
+YYN + + CF + D+
Sbjct: 332 LYYNSSSPLQCFNIWDE 348
>gi|61651780|ref|NP_001013333.1| dipeptidyl peptidase 2 precursor [Danio rerio]
gi|60416102|gb|AAH90719.1| Dipeptidyl-peptidase 7 [Danio rerio]
Length = 500
Score = 247 bits (631), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 125/304 (41%), Positives = 193/304 (63%), Gaps = 14/304 (4%)
Query: 68 YETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ +YF+Q LDHF++ L T+ QRYLI +W GPIF Y GNEGDI FA NS
Sbjct: 50 FKEKYFKQILDHFNYNSLGNGTYDQRYLITDKYW--KKGYGPIFFYTGNEGDISEFARNS 107
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A GA+L+F EHRYYG+S+P+G + LT EQALAD+AV IT L
Sbjct: 108 GFMVELAAAQGALLIFAEHRYYGKSLPFGKNSF---KIPEVGLLTVEQALADYAVMITEL 164
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ L + PV++FGGSYGGML+ +MR++YP+I GALA+SAPIL + P F+ V
Sbjct: 165 KEELGGQTCPVIVFGGSYGGMLSVYMRIRYPNIVAGALAASAPILSTAGLGDPRQFFQDV 224
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
++DF++ + +C N ++ ++ +L ++ Q+++ + + F LC+ +S +D L +L +
Sbjct: 225 TADFEKFNPACRNAVQGAFQKLNTLAQQKD-YIRIQSAFSLCKTPSSPKDIHQLNGFLRN 283
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
A++ +AM+DYPY + FM +P +P++ C+ + N D S L + V + YN TG +
Sbjct: 284 AFTMMAMLDYPYSTHFMGSMPAFPVKVACEIMLNGTDLMSALR---DTVGIVYNNTGELT 340
Query: 363 CFQL 366
C+ L
Sbjct: 341 CYDL 344
>gi|221044030|dbj|BAH13692.1| unnamed protein product [Homo sapiens]
Length = 391
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 121/259 (46%), Positives = 170/259 (65%), Gaps = 9/259 (3%)
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+A AMLVF EHRYYGES+P+G +++++ L++LT+EQALADFA I +LK+
Sbjct: 1 MWDVAEELKAMLVFAEHRYYGESLPFGDN--SFKDSRHLNFLTSEQALADFAELIKHLKR 58
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ AE PV+ GGSYGGMLAAW R+KYPH+ +GALA+SAPI QFED+VP F IV+
Sbjct: 59 TIPGAENQPVIAIGGSYGGMLAAWFRMKYPHMVVGALAASAPIWQFEDLVPCGVFMKIVT 118
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS--TEDLADWLESAY 304
+DF++ C +I SW + + +GL LT HLC L S + L DW+ +
Sbjct: 119 TDFRKSGPHCSESIHRSWDAINRLSNTGSGLQWLTGALHLCSPLTSQDIQHLKDWISETW 178
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDC 363
LAMVDYPY S+F+ PLP +PI+ VC+ + N + S +L+ IF+ ++VYYNY+G V C
Sbjct: 179 VNLAMVDYPYASNFLQPLPAWPIKVVCQYLKNPNVSDSLLLQNIFQALNVYYNYSGQVKC 238
Query: 364 FQLDDDPH---GLDGWNWQ 379
+ + G GW++Q
Sbjct: 239 LNISETATSSLGTLGWSYQ 257
>gi|224058951|ref|XP_002299660.1| predicted protein [Populus trichocarpa]
gi|222846918|gb|EEE84465.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 147/354 (41%), Positives = 202/354 (57%), Gaps = 17/354 (4%)
Query: 32 AAQPSKFRRAPRFVGKLPHL-TEPPQRQQRQQQQQYRYETRYFEQRLDHFSF--ADLPTF 88
A + PR P + + P + + +ET + Q LDHF++ F
Sbjct: 19 TATAKRLNTIPRLSPIGPRVWRDQPDKTTLGEFDGEDFETFFHNQTLDHFNYRPESYDKF 78
Query: 89 SQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYG 148
QRYLIN+ +W G N PI +Y G E I+ GF+ D A +F ++LVF EHRYYG
Sbjct: 79 PQRYLINSKYWGGANVSAPILVYLGAEEPIDEDLAAVGFLVDNAVQFNSLLVFIEHRYYG 138
Query: 149 ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLA 208
+S+P+GS E A ++A+ L Y + QA+AD+A I ++K+ L A+ SPV++ GGSYGGMLA
Sbjct: 139 KSIPFGSREEALKDASKLGYFNSAQAIADYAAIIIHIKETLRAQYSPVIVIGGSYGGMLA 198
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELV 268
+W RLKYPHIA+GALASSAPIL F+DI P + +Y+IV+ DF+ S +C+ TIK SW E+
Sbjct: 199 SWFRLKYPHIALGALASSAPILYFDDITPQDGYYSIVTKDFREASETCYQTIKTSWSEID 258
Query: 269 SVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIR 328
+ K +GL L+K F C L +L D L++ Y+ A + P P YP+
Sbjct: 259 ELASKPDGLSMLSKKFKTCTPLADASELKDHLDTMYASAAQYNRP---------PTYPVN 309
Query: 329 EVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF-QLDDDPHGLD-GWNWQV 380
EVCK ID IL RIF G+ Y GN+ C+ DP GW WQ
Sbjct: 310 EVCKGIDGGGFGDDILSRIFGGLVA---YKGNLSCYVNAHTDPSETTVGWRWQT 360
>gi|66267347|gb|AAH95721.1| Dpp7 protein [Danio rerio]
Length = 487
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 129/330 (39%), Positives = 201/330 (60%), Gaps = 26/330 (7%)
Query: 68 YETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ +YF+Q LDHF++ L T+ QRYLI +W GPIF Y GNEGDI FA NS
Sbjct: 41 FKEKYFKQILDHFNYNSLGNGTYDQRYLITDKYW--KKGYGPIFFYTGNEGDISEFARNS 98
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A GA+L+F EHRYYG+S+P+G + LT EQALAD+AV IT L
Sbjct: 99 GFMVELAAAQGALLIFAEHRYYGKSLPFGKNSF---KIPEVGLLTVEQALADYAVMITEL 155
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ L + PV++FGGSYGGML+ +MR++YP+I GALA+SAPIL + P F+ V
Sbjct: 156 KEELGGQTCPVIVFGGSYGGMLSVYMRIRYPNIVAGALAASAPILSTAGLGDPRQFFQDV 215
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
++DF++ + +C + ++ ++ +L ++ Q+++ + + F LC+ +S +D L +L +
Sbjct: 216 TADFEKFNPACRDAVQGAFQKLNTLAQQKD-YIRIQSAFSLCKTPSSPKDIHQLNGFLRN 274
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
A++ +AM+DYPY + FM +P +P++ C+ + N D S L + V + YN TG +
Sbjct: 275 AFTMMAMLDYPYSTHFMGSMPAFPVKVACEIMLNGTDLMSALR---DTVGIVYNNTGELT 331
Query: 363 CFQLDD------DPHGLD------GWNWQV 380
C+ L DP G W++Q+
Sbjct: 332 CYDLYSLYVECADPTGCGLGFNSYAWDYQI 361
>gi|346467697|gb|AEO33693.1| hypothetical protein [Amblyomma maculatum]
Length = 348
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 129/280 (46%), Positives = 176/280 (62%), Gaps = 8/280 (2%)
Query: 106 GPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT 165
GPIF Y GNEG I FA N+G +WD AP F A+L+F EHRYYG+SMPYG + ++++ +
Sbjct: 5 GPIFFYTGNEGSITTFANNTGLMWDWAPEFRALLIFAEHRYYGKSMPYG--DRSFESPSH 62
Query: 166 LSYLTAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
L YLT EQ LAD+A + ++ L A S VV FGGSYGGMLAAW R+KYPH+ ALA
Sbjct: 63 LGYLTVEQTLADYADLLLYIRSTLPGAGNSQVVSFGGSYGGMLAAWFRMKYPHVTAAALA 122
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
+SAPILQF+ I P +V+ F+ ES C I+ SW + + E G ++ + F
Sbjct: 123 ASAPILQFQGITPCGALNAVVTRAFRMESELCAEAIRSSWELIENRSSTEEGAADIAERF 182
Query: 285 HLCRELNSTE--DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS 342
H+C L DWL Y+ LAMV+YPYP++F+ P+PG+P+RE CK ++ D++
Sbjct: 183 HICGNYTHKNYTKLRDWLTDQYTNLAMVNYPYPNEFLNPVPGHPVREACKFLNRTFDSSD 242
Query: 343 I-LERIFEGVSVYYNYTGNVDCFQLDDDPHGL--DGWNWQ 379
+ LE I++ SV+ NYTG C L L DGW++Q
Sbjct: 243 VLLESIYQAFSVFTNYTGQTQCNDLCKGSGTLDADGWDYQ 282
>gi|391347711|ref|XP_003748099.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 468
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 126/313 (40%), Positives = 186/313 (59%), Gaps = 9/313 (2%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
+YFEQR+DHF F TF Q+YL++ + GPIF YCG E ++E A +G ++
Sbjct: 29 QYFEQRVDHFGFHKRDTFRQKYLMSDKTFQAG---GPIFFYCGGEMNVELHARQTGLMFT 85
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN-L 189
A F A++VF EHRYYGES+PYG + ++ + YL+ EQALAD+A +++LK N
Sbjct: 86 WAREFRALVVFAEHRYYGESLPYG--DASFYGSERRGYLSTEQALADYAAILSHLKANHT 143
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
A S +V++G Y GMLA WMR+KYPHIA A ASSAPI + VP F V+S F
Sbjct: 144 GATKSEIVVWGAGYSGMLAVWMRVKYPHIAKLAYASSAPIGFYSGEVPCGKFLKAVTSVF 203
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NSTEDLADWLESAYSYL 307
+ ES +C +I+ W L ++ +G+ L F+ C+ + ++ +L WL+ ++ +
Sbjct: 204 RSESETCVQSIRRIWNVLQTMATSRDGMAHLADAFNTCQPVRGDNISNLFRWLKESFRTI 263
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILER-IFEGVSVYYNYTGNVDCFQL 366
+M+D+PY +D LP YP++E C K+ N + L R + VSV YN+TG V C+ L
Sbjct: 264 SMLDFPYETDLFGKLPAYPVKEACSKLWNHQQSDRDLMRAAHKAVSVLYNHTGEVVCYSL 323
Query: 367 DDDPHGLDGWNWQ 379
D+ GW +Q
Sbjct: 324 DNTLRNNAGWGFQ 336
>gi|268619142|gb|ACZ13347.1| prolyl carboxy peptidase-like protein [Bursaphelenchus xylophilus]
Length = 401
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 140/373 (37%), Positives = 216/373 (57%), Gaps = 31/373 (8%)
Query: 25 ILSPLSL-AAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
+L+PL + +P F R P +T Q + + Y + + + +DHF+FA
Sbjct: 7 LLAPLLVNGLRPGAFLRDP--------VTLIDQNRIPNGEPSYSWSEEHLDVPIDHFAFA 58
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
D F RY IN ++ GPIF Y GNEG +E FA N+GF+WDIAP + A +VF E
Sbjct: 59 DTREFPLRYFINLTYY---EPGGPIFFYTGNEGKLEVFAENTGFIWDIAPEYKAAIVFTE 115
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA--SPVVLFGG 201
HR+YG S+P+G E +Y++ L YLT+EQALADFA IT LK +A SPV++FGG
Sbjct: 116 HRFYGNSLPFG--EDSYKHIKNLGYLTSEQALADFADVITYLKTQRIPQATHSPVIVFGG 173
Query: 202 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI-VPPETFYNIVSSDFKRESASCFNTI 260
SYGGMLAAW R+KYPH+A GA+A+SAP+L F++ V + + NI + FK S +
Sbjct: 174 SYGGMLAAWFRIKYPHLADGAIAASAPLLWFQNTGVRQDGYANITTRTFKL-SGCDLTHL 232
Query: 261 KESWGELVSVGQKENGLLELTKTFHLCR--ELNSTED---LADWLESAYSYLAMVDYPYP 315
+ S+ + ++ + E+G L K L + E + D L + + M+DYPYP
Sbjct: 233 RASFDAMRTLAKTEDGRDHLNKVLKLGKSSEFEHSHDYNILVNIFADVMGNVVMIDYPYP 292
Query: 316 SDFMMPLPGYPIREVCKKI--DNAPDATSILERIFEGVSVYYNYTGNVDCFQL------D 367
++F +P +P++++C+K D D ++ +++ ++++YN +G ++ F L +
Sbjct: 293 TNFFAQVPAWPVKKMCEKFNGDIPDDPKETVKPLYDILNIFYNTSGKLEEFCLRGPDCGN 352
Query: 368 DDPHGLDGWNWQV 380
D +DGWNWQ+
Sbjct: 353 DQLGAMDGWNWQI 365
>gi|241753992|ref|XP_002401170.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508367|gb|EEC17821.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 493
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 124/303 (40%), Positives = 186/303 (61%), Gaps = 8/303 (2%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
Q Y+ R F ++DHF+F TF RY + +W GPIF Y GNE IE F +
Sbjct: 25 QVTYKMRTFRTKIDHFTFHSSDTFVMRYAVADQYWDFDG--GPIFFYTGNENAIENFINH 82
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+G +W+ AP F AMLVF EHR+YGESMP+G+ + ++ L YL+ +Q LAD+A I +
Sbjct: 83 TGLMWEWAPEFKAMLVFAEHRFYGESMPFGNRSL--ESPHHLGYLSTDQVLADYADLIIH 140
Query: 185 LKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
LK+++ A SPV+ FGGSYGGML+AW+R++YPH+ +LASSAP+ F +VP +
Sbjct: 141 LKESVRGASESPVISFGGSYGGMLSAWIRIRYPHLVSASLASSAPVHMFPGLVPCSSLNR 200
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLE 301
+++ F+RES C TI++SW L + G + FHLC+ L + D+L
Sbjct: 201 VLTETFRRESPVCSATIRKSWPILEAKFSTVEGRKSIQDKFHLCQSLQEEDYVTFRDFLH 260
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA-PDATSILERIFEGVSVYYNYTGN 360
YS +A+V+Y PS F+ PLPGYP++E CK + + + +I++ + + V++++N TG
Sbjct: 261 DVYSNMALVNYADPSVFLTPLPGYPVKESCKFLTRSFVNDEAIVDAVSQVVNIFFNTTGT 320
Query: 361 VDC 363
C
Sbjct: 321 RQC 323
>gi|324507364|gb|ADY43126.1| Prolyl carboxy peptidase like protein 5 [Ascaris suum]
Length = 534
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 198/336 (58%), Gaps = 26/336 (7%)
Query: 65 QYRYETRYFEQR-LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Y++ +F+ +DHFSFAD TF RYLINTD+++ + GPIF Y GNEG+IE FA
Sbjct: 42 NYKWTEEWFDNMPIDHFSFADNRTFHLRYLINTDYFI---KYGPIFFYTGNEGNIEGFAS 98
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+GF+WDIA FGA +VF EHRYYG++ P+G+ +Y + + L YL++EQALAD+A I
Sbjct: 99 NTGFMWDIAAEFGAAIVFAEHRYYGKTHPFGNE--SYASVSNLGYLSSEQALADYAHLIQ 156
Query: 184 NLKQNLSAEA--SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF-EDIVPPET 240
L+ A S V+ FGGSYGGMLAAW+R+KYPH+ GA+A+SAP+ F + VP +
Sbjct: 157 YLRNERLKNAINSTVIAFGGSYGGMLAAWIRIKYPHLVEGAIAASAPVFWFPQTNVPEDI 216
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRE--LNSTED--- 295
F NIV F + I +W + + E G L F L + L +ED
Sbjct: 217 FDNIVKRSFVNSGCKA-DAIIAAWSAIEELANSEQGRTYLNSLFKLEEKSFLQKSEDVNF 275
Query: 296 LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT--SILERIFEGVSV 353
L ++ ++ +AMV+YPYPS+F+ PLPG+P++ C ++ T E ++ V++
Sbjct: 276 LKAFIRESFESMAMVNYPYPSEFLAPLPGWPVKVACGFFNSTEMKTREHHAESLYSMVNL 335
Query: 354 YYNYTGNVD--CFQLD---DDPHGL----DGWNWQV 380
YYN+TG C D D +G GW WQ
Sbjct: 336 YYNFTGEKKTLCVNPDVCSDSAYGALGDPLGWPWQA 371
>gi|297812433|ref|XP_002874100.1| hypothetical protein ARALYDRAFT_910290 [Arabidopsis lyrata subsp.
lyrata]
gi|297319937|gb|EFH50359.1| hypothetical protein ARALYDRAFT_910290 [Arabidopsis lyrata subsp.
lyrata]
Length = 495
Score = 243 bits (621), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 143/373 (38%), Positives = 211/373 (56%), Gaps = 29/373 (7%)
Query: 16 PVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRY-----ET 70
PV +I+++ ++ +++ SK R F KL + + ++ Y++ E
Sbjct: 4 PVPSIILLLLVIFSTISCTHSKEARLSVFPKKLRYTFD-------GEKLHYKFADLGIEI 56
Query: 71 RYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+FEQ LDHF++ F QRY +N+ +W G PI Y G E ++ GF+
Sbjct: 57 FFFEQTLDHFTYTPGSYKKFRQRYAVNSKYWEGGKTNAPILAYLGAESSLDSELSVLGFL 116
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D AP F A++V+ EHR+YGE+MP+GS E +NA TL YL A QALAD+A + ++K+
Sbjct: 117 KDNAPHFKALMVYIEHRFYGETMPFGSAEETLKNAKTLGYLNAAQALADYAAILLHIKET 176
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
SA+ SPV++ GGSYGGMLAAW +LKYPHIA+GALASSAP+L FED +P ++ IV+
Sbjct: 177 YSAKHSPVIVIGGSYGGMLAAWFKLKYPHIALGALASSAPLLYFEDTLPKHGYFYIVTKV 236
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
FK S C N I++SW E+ + K NGL L+K F LC LN T +L +L + Y+ A
Sbjct: 237 FKETSQKCHNKIRKSWDEIDRIAAKPNGLSILSKKFKLCNPLNDTIELKSYLSNIYAGTA 296
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS--ILERIFEGVSVYYNYTGNVDCFQL 366
+ Y + +C+ I+ +P T +L++IF GV GN+ C+ +
Sbjct: 297 QYNN----------NPYSVASLCEAINTSPPNTKSDLLDQIFAGVVA---SGGNISCYGM 343
Query: 367 DDDPHGLDGWNWQ 379
D + W WQ
Sbjct: 344 DQITNDARAWTWQ 356
>gi|390335012|ref|XP_797434.3| PREDICTED: dipeptidyl peptidase 2-like [Strongylocentrotus
purpuratus]
Length = 487
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 127/309 (41%), Positives = 190/309 (61%), Gaps = 15/309 (4%)
Query: 65 QYRYETRYFEQRLDHFSFAD--LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
+Y Y+ ++F+Q +DHF+ TF QRY + D+W + GPIF Y GNEG IE F
Sbjct: 29 RYPYKEKFFDQFIDHFNSESHGKQTFRQRYFVTDDYWQKGS--GPIFFYTGNEGAIESFF 86
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N+G+++DIAP FGA+++F EHRYYG+S+P G+ N L LT EQALAD+A I
Sbjct: 87 DNTGYIFDIAPEFGALVIFAEHRYYGKSLPLGNQSFTPAN---LGLLTVEQALADYATLI 143
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
T+LK+ + SP+V+FGGSYGGML+A+MR+KYPH+ GALA+SAP+ + F+
Sbjct: 144 TSLKEEPGLQDSPLVVFGGSYGGMLSAYMRMKYPHLVAGALAASAPVYSTANESSQSYFF 203
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSV-GQKENGLLELTKTFHLCRELNSTED---LAD 298
V+ DF +C + ++ + L + GQ +GL +++ FHLC+ L S D L
Sbjct: 204 QTVTKDFYDVDTNCPDLVRAGFATLDQLAGQGTSGLNKISTAFHLCKSLVSKADYYHLLG 263
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT 358
W+ ++++Y AM+DYPY + FM LP P+ C + N A++ LE + + +YYN +
Sbjct: 264 WIRNSFTYQAMLDYPYSASFMGSLPANPVNVSCHMLLN---ASTPLEGLAQASGLYYNTS 320
Query: 359 GN-VDCFQL 366
+ CF +
Sbjct: 321 STPLQCFDI 329
>gi|402591692|gb|EJW85621.1| hypothetical protein WUBG_03470, partial [Wuchereria bancrofti]
Length = 478
Score = 241 bits (615), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 134/325 (41%), Positives = 195/325 (60%), Gaps = 29/325 (8%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DHFSF D F RYLINT+H+V GPIF Y GNEG++E FA N+G +WD+AP F
Sbjct: 17 IDHFSFHDNRVFRLRYLINTEHFVSN---GPIFFYTGNEGNVELFAQNTGLMWDLAPEFN 73
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN--LSAEAS 194
A+++F EHR+YG+S P+G+ +Y L YL++EQAL DFA+ I +LK L A+ S
Sbjct: 74 AVIIFAEHRFYGKSQPFGNK--SYATIRNLGYLSSEQALGDFALLIYHLKNKRLLVAQNS 131
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI---VPPETFYNIVSSDFKR 251
V+ FGGSYGGMLAAWMR+KYPH+ G++ASSAP+ F D+ VP + + IV F
Sbjct: 132 SVIAFGGSYGGMLAAWMRIKYPHLVEGSIASSAPVFWFIDMSRSVPQDAYNRIVKRSFL- 190
Query: 252 ESASCF-NTIKESWGELVSVGQKENGLLELTKTFHLCRE--LNSTED---LADWLESAYS 305
S+ C I + W L ++ G L FHL ++ L + D L ++LE +
Sbjct: 191 -SSGCIEKNILDGWIALKNLSLTTTGRAYLNGLFHLDKKSYLKISTDWIMLKEYLEDIFG 249
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCK--KIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
+AMV+YPYP++++ LPG+P++ C+ + A + + ++ +++YYNYTG
Sbjct: 250 SMAMVNYPYPTNYLAQLPGWPVKVACQFFNSNKAKSDEELAQSMYGIMNLYYNYTGQKKT 309
Query: 364 FQL-----DDDPHGLD----GWNWQ 379
F + +D +G GW WQ
Sbjct: 310 FCIKPNVCNDSAYGALGDPFGWTWQ 334
>gi|256070800|ref|XP_002571730.1| lysosomal Pro-Xaa carboxypeptidase (S28 family) [Schistosoma
mansoni]
gi|360043145|emb|CCD78557.1| lysosomal Pro-Xaa carboxypeptidase (S28 family) [Schistosoma
mansoni]
Length = 498
Score = 240 bits (612), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 187/325 (57%), Gaps = 12/325 (3%)
Query: 62 QQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF 121
+ + YET+YF R+DHFSF + F RYLIN + + GPI Y GNEG IE F
Sbjct: 28 KNPDFTYETKYFWTRVDHFSFVNDEKFLIRYLINNESFTPG---GPILFYTGNEGPIETF 84
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
A NSGF+W ++ A +VF EHRYYG S+P+G+ ++++ YLTAEQ LAD+ +
Sbjct: 85 AENSGFIWKLSRELNASVVFAEHRYYGTSLPFGNN--SFKDRRHFGYLTAEQTLADYVLL 142
Query: 182 ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
I LK N S A SPV+ FGGSYGGML+AW+R KYP+ GA+ASSAP+ F +
Sbjct: 143 INQLKANYSCFASSPVIAFGGSYGGMLSAWIRQKYPNQIAGAIASSAPVWLFPGLSDCNG 202
Query: 241 FYNIVSSDF-KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
F +++ F K +C I+ SW +V +GQ +G LT F++C L +++ D+
Sbjct: 203 FSMTITNSFLKYGGENCVKNIQLSWSNIVDIGQSVDGKELLTHMFNICTPLTDVQNIIDY 262
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERI---FEGVSVYYN 356
L ++MV+YPYP+ ++ LP +P++ +C + ++ RI + V N
Sbjct: 263 LSDFLGIISMVNYPYPASLILALPEWPVKYLCTNLSEYDPQQPVVTRISLLAKAVLSLTN 322
Query: 357 YTGNVDCFQLDDDPHGLD--GWNWQ 379
YTGN C + + +D GW+ Q
Sbjct: 323 YTGNQTCLDISMNLPNVDVNGWDLQ 347
>gi|156403949|ref|XP_001640170.1| predicted protein [Nematostella vectensis]
gi|156227303|gb|EDO48107.1| predicted protein [Nematostella vectensis]
Length = 446
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 130/304 (42%), Positives = 181/304 (59%), Gaps = 14/304 (4%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+++T FEQ +DHF+F TF QRYL +W G GPIF Y GNEG I F NSG
Sbjct: 2 QFKTGTFEQTVDHFNFIQSGTFKQRYLYTEKYWDGK---GPIFFYSGNEGGITGFWENSG 58
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
FV++ A F A+++F EHRYYGES+P+G +N + YL+ EQALADFA I LK
Sbjct: 59 FVFEAAKNFSALVIFGEHRYYGESLPFGQDSFKIEN---IGYLSIEQALADFATLIPALK 115
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ AE PVV FGGSYGGML+A++R KYP++ ALA+SAPI D+ + F+ V+
Sbjct: 116 KQFKAEEKPVVSFGGSYGGMLSAYLRFKYPNVIQAALAASAPIYFIADLSIRDFFFPAVT 175
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKE-NGLLELTKTFHLCRELNSTED---LADWLES 302
DFK C + ++ + EL ++ ++ GL ++K F LC+ L S + L W+ +
Sbjct: 176 RDFKNADPKCPDLVRAGFIELDNLKKEGLKGLDAISKAFKLCKPLKSADQINHLIGWIRN 235
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT-GNV 361
A++ +AM DYPY +DF+ PLP P+ CK + A D S L + + YN T G +
Sbjct: 236 AFTIIAMCDYPYATDFLAPLPANPVNYACKLLATASDRLSGLA---DAAGLAYNGTSGTL 292
Query: 362 DCFQ 365
CF
Sbjct: 293 KCFD 296
>gi|297734876|emb|CBI17110.3| unnamed protein product [Vitis vinifera]
Length = 405
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 119/255 (46%), Positives = 162/255 (63%), Gaps = 13/255 (5%)
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D A RF A+LV+ EHRY G+S+P+GS E A +NA+ Y + QA+AD+A + +
Sbjct: 23 GFLTDNAHRFKALLVYIEHRYCGKSIPFGSREEALKNASIRGYFNSAQAIADYAEVLIYI 82
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ L AE SPV++ GGSYGGMLA+W RLKYPH+A+G LASSAPIL FEDI P +Y+IV
Sbjct: 83 KKKLLAENSPVIVVGGSYGGMLASWFRLKYPHVALGVLASSAPILYFEDITPQNGYYSIV 142
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ DF+ S SC+ TI+ESW E+ V + NG+ L+K F C LN++++L D+L++ Y
Sbjct: 143 TKDFREASESCYKTIRESWSEIDRVASEPNGISILSKKFRTCDRLNNSDELKDYLDTIYC 202
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
A + P P YP+ C ID AP+ + IL RIF GV Y GN C+
Sbjct: 203 TAAQYNDP---------PMYPVTMACSGIDGAPEGSDILSRIFAGVVA---YGGNSSCYT 250
Query: 366 LDDDP-HGLDGWNWQ 379
+P +GW WQ
Sbjct: 251 TSHNPTETSEGWAWQ 265
>gi|30688698|ref|NP_197677.2| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332005706|gb|AED93089.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 439
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 149/368 (40%), Positives = 203/368 (55%), Gaps = 20/368 (5%)
Query: 19 TIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLD 78
TI+I+ I S S P + R L P Q+ + + YF Q LD
Sbjct: 6 TILILFIFSTSSSYLIPLAHSKIARLGISSKTLKNEPD-GSTQKVDESNLKMYYFNQTLD 64
Query: 79 HFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
HF+F TF QRY I++ HW G PI + G E ++ GF+ D PR
Sbjct: 65 HFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSLDSDLAAIGFLRDNGPRLN 124
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV 196
A+LV+ EHRYYGE+MP+GS E A +NA+TL YL A QALAD+A + ++K+ S SP+
Sbjct: 125 ALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADYAAILLHVKEKYSTNHSPI 184
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASC 256
++ GGSYGGMLAAW RLKYPHIA+GALASSAP+L FED P +Y IV+ FK S C
Sbjct: 185 IVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIVTKVFKEASERC 244
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPS 316
+NTI+ SW E+ V K NGL L+K F C LN + D+ D+L++ Y+ +
Sbjct: 245 YNTIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFDIKDFLDTIYAEAVQYNRG--- 301
Query: 317 DFMMPLPGYPIREVCKKID-NAPDAT-SILERIFEGVSVYYNYTGNVDCF--QLDDDPHG 372
P + + +VC I+ N P+ ++L+RIF GV GN C+ ++ P
Sbjct: 302 ------PNFWVAKVCNAINANPPNRRYNLLDRIFAGVVA---LVGNRTCYDTKMFAQPTN 352
Query: 373 LD-GWNWQ 379
+ W WQ
Sbjct: 353 NNIAWRWQ 360
>gi|21537161|gb|AAM61502.1| prolylcarboxypeptidase-like protein [Arabidopsis thaliana]
Length = 502
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 149/368 (40%), Positives = 203/368 (55%), Gaps = 20/368 (5%)
Query: 19 TIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLD 78
TI+I+ I S S P + R L P Q+ + + YF Q LD
Sbjct: 6 TILILFIFSTSSSYLIPLAHSKIARLGISSKTLKNEPD-GSTQKVDESDLKMYYFNQTLD 64
Query: 79 HFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
HF+F TF QRY I++ HW G PI + G E ++ GF+ D PR
Sbjct: 65 HFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSLDSDLAAIGFLRDNGPRLN 124
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV 196
A+LV+ EHRYYGE+MP+GS E A +NA+TL YL A QALAD+A + ++K+ S SP+
Sbjct: 125 ALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADYAAILLHVKEKYSTNHSPI 184
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASC 256
++ GGSYGGMLAAW RLKYPHIA+GALASSAP+L FED P +Y IV+ FK S C
Sbjct: 185 IVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIVTKVFKEASERC 244
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPS 316
+NTI+ SW E+ V K NGL L+K F C LN + D+ D+L++ Y+ +
Sbjct: 245 YNTIRNSWKEIDRVAGKPNGLSILSKQFKTCDPLNGSFDIKDFLDTIYAEAVQYNRG--- 301
Query: 317 DFMMPLPGYPIREVCKKID-NAPDAT-SILERIFEGVSVYYNYTGNVDCF--QLDDDPHG 372
P + + +VC I+ N P+ ++L+RIF GV GN C+ ++ P
Sbjct: 302 ------PNFWVAKVCNAINANPPNRRYNLLDRIFAGVVA---LVGNRTCYDTKMFAQPTN 352
Query: 373 LD-GWNWQ 379
+ W WQ
Sbjct: 353 NNIAWRWQ 360
>gi|30688688|ref|NP_851059.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|10177233|dbj|BAB10607.1| prolylcarboxypeptidase-like protein [Arabidopsis thaliana]
gi|332005705|gb|AED93088.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 502
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 149/368 (40%), Positives = 203/368 (55%), Gaps = 20/368 (5%)
Query: 19 TIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLD 78
TI+I+ I S S P + R L P Q+ + + YF Q LD
Sbjct: 6 TILILFIFSTSSSYLIPLAHSKIARLGISSKTLKNEPD-GSTQKVDESNLKMYYFNQTLD 64
Query: 79 HFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
HF+F TF QRY I++ HW G PI + G E ++ GF+ D PR
Sbjct: 65 HFTFTPESYMTFQQRYAIDSTHWGGAKANAPILAFLGEESSLDSDLAAIGFLRDNGPRLN 124
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV 196
A+LV+ EHRYYGE+MP+GS E A +NA+TL YL A QALAD+A + ++K+ S SP+
Sbjct: 125 ALLVYIEHRYYGETMPFGSAEEALKNASTLGYLNAAQALADYAAILLHVKEKYSTNHSPI 184
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASC 256
++ GGSYGGMLAAW RLKYPHIA+GALASSAP+L FED P +Y IV+ FK S C
Sbjct: 185 IVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIVTKVFKEASERC 244
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPS 316
+NTI+ SW E+ V K NGL L+K F C LN + D+ D+L++ Y+ +
Sbjct: 245 YNTIRNSWIEIDRVAGKPNGLSILSKQFKTCAPLNGSFDIKDFLDTIYAEAVQYNRG--- 301
Query: 317 DFMMPLPGYPIREVCKKID-NAPDAT-SILERIFEGVSVYYNYTGNVDCF--QLDDDPHG 372
P + + +VC I+ N P+ ++L+RIF GV GN C+ ++ P
Sbjct: 302 ------PNFWVAKVCNAINANPPNRRYNLLDRIFAGVVA---LVGNRTCYDTKMFAQPTN 352
Query: 373 LD-GWNWQ 379
+ W WQ
Sbjct: 353 NNIAWRWQ 360
>gi|348535188|ref|XP_003455083.1| PREDICTED: dipeptidyl peptidase 2-like [Oreochromis niloticus]
Length = 492
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 125/305 (40%), Positives = 189/305 (61%), Gaps = 14/305 (4%)
Query: 67 RYETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
R+ ++F Q LDHF+F + TF+QRYLI +W GPIF Y GNEG+I FA+N
Sbjct: 38 RFTEKFFTQTLDHFNFNSMGNGTFNQRYLITDQYW--EKGFGPIFFYTGNEGNIWEFALN 95
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
SGF+ ++A + A+++F EHRYYG+S+P+ E N +S LT EQALAD+A+ IT
Sbjct: 96 SGFITELAAQQRALVIFAEHRYYGKSLPF---EKDSFNIPQVSLLTVEQALADYAIMITE 152
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
LKQ L A PV++FGGSYGGML+ +MR+KYP+I GALA+SAPIL + F+
Sbjct: 153 LKQQLGATDCPVIVFGGSYGGMLSVYMRIKYPNIVAGALAASAPILSTAGLGDSREFFRD 212
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLE 301
V+SDF+ S+ C + ++ ++ +L + Q + + F LC+ L+S +D L +L
Sbjct: 213 VTSDFESVSSDCTDAVRGAFHQLKELAQSQE-YHHIQSAFALCKPLSSAQDIHQLNGFLR 271
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ +AM+DYPY + F+ +P P++ C+ + D +L + + + YN TG +
Sbjct: 272 NAFTLMAMLDYPYSTHFIGNMPANPVKVACETMLRGSD---LLGNLRDTAGIVYNATGVL 328
Query: 362 DCFQL 366
CF L
Sbjct: 329 TCFDL 333
>gi|449467104|ref|XP_004151265.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like, partial [Cucumis
sativus]
Length = 359
Score = 238 bits (608), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 129/340 (37%), Positives = 199/340 (58%), Gaps = 17/340 (5%)
Query: 46 GKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPN 103
G +P L + Q QQ T Y++Q LDHF++ TF QRY+I+ +W G N
Sbjct: 31 GHIPVLGVQRRAFQSTPQQSDGLATFYYKQPLDHFNYQPQSYVTFDQRYIIDFKYWEGIN 90
Query: 104 RLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA 163
PIF Y G E DI+ GF A ++ AM V+ EHR+YG+S+P+GS E A +N
Sbjct: 91 PKTPIFAYLGAESDIDNDVPYVGFPLRFASQYKAMSVYLEHRFYGKSIPFGSLEKAMKNG 150
Query: 164 TTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
+ Y + QALAD+A + ++K+ + + SP+++ G SYGGMLA+W RLKYPHIA+GAL
Sbjct: 151 SIRGYFNSAQALADYAELLLHIKKMFAYDTSPIIVMGASYGGMLASWFRLKYPHIALGAL 210
Query: 224 ASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSV-GQKENGLLELTK 282
ASSAPIL F++I P + +Y+IVS FK S +C +TI+ SWGE+ + G+ GL L+K
Sbjct: 211 ASSAPILYFDNITPQDGYYSIVSKSFKETSKTCHDTIRRSWGEIDRIAGKTRGGLSILSK 270
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID-NAPDAT 341
F C +L ++ ++ + ++S ++ A + PY + P+R +C ID A +
Sbjct: 271 QFKTCGKLKTSSEIKNLMDSVFTMAAQYNDPYEN---------PVRGICVAIDEEAKKKS 321
Query: 342 SILERIFEGVSVYYNYTGNVDCFQLD--DDPHGLDGWNWQ 379
++++++ GV Y D ++ +DP L+ + WQ
Sbjct: 322 NVIKQVVAGVIAYLGERPCYDVYEFGYPNDP--LNQYGWQ 359
>gi|449530679|ref|XP_004172321.1| PREDICTED: LOW QUALITY PROTEIN: lysosomal Pro-X
carboxypeptidase-like [Cucumis sativus]
Length = 440
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 200/341 (58%), Gaps = 17/341 (4%)
Query: 46 GKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPN 103
G +P L + Q QQ T ++Q LDHF++ TF QRY+I+ +W G N
Sbjct: 31 GHIPVLGVQRRAFQSTPQQSDGLATFXYKQPLDHFNYQPQSYVTFDQRYIIDFKYWEGIN 90
Query: 104 RLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA 163
PIF Y G E DI+ GF A ++ AM V+ EHR+YG+S+P+GS E A +N
Sbjct: 91 PKTPIFAYLGAESDIDNDVPYVGFPLRFASQYKAMSVYLEHRFYGKSIPFGSLEKAMKNG 150
Query: 164 TTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
+ Y + QALAD+A + ++K+ + + SP+++ G SYGGMLA+W RLKYPHIA+GAL
Sbjct: 151 SIRGYFNSAQALADYAELLLHIKKMFAYDTSPIIVMGASYGGMLASWFRLKYPHIALGAL 210
Query: 224 ASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSV-GQKENGLLELTK 282
ASSAPIL F++I P + +Y+IVS FK S +C +TI+ SWGE+ + G+ + GL L+K
Sbjct: 211 ASSAPILYFDNITPQDGYYSIVSKSFKETSKTCHDTIRRSWGEIDRIAGKTQGGLSILSK 270
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID-NAPDAT 341
F C +L ++ ++ + ++S ++ A + PY + P+R +C ID A +
Sbjct: 271 QFKTCGKLKTSSEIKNLMDSVFTMAAQYNDPYEN---------PVRGICVAIDEEAKKKS 321
Query: 342 SILERIFEGVSVYYNYTGNVDCFQLD--DDPHGLDGWNWQV 380
++++++ GV Y D ++ +DP L+ + WQV
Sbjct: 322 NVIKQVVAGVIAYLGERPCYDVYEFGYPNDP--LNQYGWQV 360
>gi|156348420|ref|XP_001621842.1| hypothetical protein NEMVEDRAFT_v1g990 [Nematostella vectensis]
gi|156208137|gb|EDO29742.1| predicted protein [Nematostella vectensis]
Length = 283
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 124/281 (44%), Positives = 171/281 (60%), Gaps = 10/281 (3%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
++T FEQ +DHF+F TF QRYL +W G GPIF Y GNEG I F NSGF
Sbjct: 1 FKTGTFEQTVDHFNFIQSGTFKQRYLYTEKYWDGK---GPIFFYSGNEGGITGFWENSGF 57
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
V++ A F A+++F EHRYYGES+P+G +N + YL+ EQALADFA I LK+
Sbjct: 58 VFEAAKNFSALVIFGEHRYYGESLPFGQDSFKIEN---IGYLSIEQALADFATLIPALKK 114
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
AE PVV FGGSYGGML+A++R KYP++ ALA+SAPI D+ + F+ V+
Sbjct: 115 QFKAEEKPVVSFGGSYGGMLSAYLRFKYPNVIQAALAASAPIYFIADLSIRDFFFPAVTR 174
Query: 248 DFKRESASCFNTIKESWGELVSVGQKE-NGLLELTKTFHLCRELNSTED---LADWLESA 303
DFK C + ++ + EL ++ ++ GL ++K F LC+ L S + L W+ +A
Sbjct: 175 DFKNADPKCPDLVRAGFIELDNLKKEGLKGLDAISKAFKLCKPLKSADQINHLIGWIRNA 234
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSIL 344
++ +AM DYPY +DF+ PLP P+ CK + A D S L
Sbjct: 235 FTIIAMCDYPYATDFLAPLPANPVNYACKLLATASDRLSGL 275
>gi|198425804|ref|XP_002127477.1| PREDICTED: similar to Dipeptidyl-peptidase 2 precursor
(Dipeptidyl-peptidase II) (DPP II) (Dipeptidyl
aminopeptidase II) (Quiescent cell proline dipeptidase)
(Dipeptidyl peptidase 7) [Ciona intestinalis]
Length = 494
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 123/303 (40%), Positives = 185/303 (61%), Gaps = 14/303 (4%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y T+YFEQ +DHF+F T+ QRYLI+ +HWV GP+ Y GNEGDI F S
Sbjct: 21 YHTKYFEQFVDHFNFQSNGNATYMQRYLISDEHWVAGK--GPMLFYAGNEGDIVGFKDAS 78
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G + + AP+ GAM+VF EHR+YG S+P+G+ +N + L+ EQA+AD+A + +L
Sbjct: 79 GLLTETAPKLGAMVVFAEHRFYGTSLPFGNDSFIDKN---IGLLSIEQAMADYAYLLKHL 135
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K + +A+ P++ FGGSYGG+LAA+MR+KYP++ GALA+SAPI P F+ V
Sbjct: 136 KSSYNADDIPIIAFGGSYGGILAAYMRIKYPNLITGALAASAPIYWTSGEGNPHGFWKSV 195
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESA 303
++ F C N +KE + E Q+ + ++K F C ++ S+ L W+ ++
Sbjct: 196 TTIFGHNEG-CVNRVKEGFAETAKYAQQGKYDV-ISKGFKTCSQVKSSSLMHLYGWVRNS 253
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
++ LAM +YPYP++F PLP +P+ C+K+ A A +E + E S+ YN TG+ DC
Sbjct: 254 FTQLAMANYPYPANFFGPLPAFPVNVACEKMLKANTA---IEGMLEATSLLYNGTGDKDC 310
Query: 364 FQL 366
F +
Sbjct: 311 FDI 313
>gi|307213053|gb|EFN88584.1| Lysosomal Pro-X carboxypeptidase [Harpegnathos saltator]
Length = 404
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 118/262 (45%), Positives = 164/262 (62%), Gaps = 9/262 (3%)
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+W+ A +FGA++VF EHRYYGES+P+G+ A + L YLT++Q LAD+ I L
Sbjct: 19 GFLWETAQQFGALVVFAEHRYYGESLPFGNKSFA--DPQHLGYLTSQQVLADYVELIQYL 76
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + SPV+LFGGSYGGML+AWMR+KYPHI GA+A+SAPILQF IV E F IV
Sbjct: 77 RSKPGYKRSPVILFGGSYGGMLSAWMRMKYPHIVQGAIAASAPILQFTGIVKCEAFARIV 136
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
+SDF+ + +C I++SW + V + G L+ + LC L + E L D+L+
Sbjct: 137 TSDFRASNPTCAKLIRQSWNTITEVTSNDEGKKWLSDNWKLCEPLKTAEHVKTLKDFLQE 196
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP-DATSILERIFEGVSVYYNYTGNV 361
Y+ LAMVDYPY ++F+ PLPG PI C+ + N+ +L + VS+Y NYTG
Sbjct: 197 VYTDLAMVDYPYETNFLTPLPGNPINVFCRHLTNSSLTGKPLLLALHGAVSIYTNYTGKA 256
Query: 362 DCFQLDDDPHGLD---GWNWQV 380
C ++ GLD GW++Q
Sbjct: 257 TCISTENAEPGLDAAQGWDYQA 278
>gi|297812427|ref|XP_002874097.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297319934|gb|EFH50356.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 501
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 138/326 (42%), Positives = 188/326 (57%), Gaps = 19/326 (5%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI 118
Q+ + + YF Q LDHF+F TF QRY IN+ HW G PI + G E +
Sbjct: 47 QKIDESDLKMYYFNQTLDHFTFTPKSYMTFQQRYAINSSHWGGAKANAPILAFLGEESSL 106
Query: 119 EWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADF 178
+ GF+ D P A+LV+ EHRYYG++MP+GS E A +NA+TL YL A QALAD+
Sbjct: 107 DSDLSGIGFLRDNGPHLKALLVYIEHRYYGKTMPFGSAEEALKNASTLGYLNAAQALADY 166
Query: 179 AVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
A + ++K+ S + SP+++ GGSYGGMLAAW RLKYPHIA+GALASSAP+L FED P
Sbjct: 167 AAILLHVKEKYSTKHSPIIVIGGSYGGMLAAWFRLKYPHIALGALASSAPLLYFEDTRPK 226
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
+Y IV+ FK S C+ I++SW E+ V K NGL L+KTF C LN + ++ D
Sbjct: 227 IGYYYIVTKVFKETSERCYTRIRKSWKEIDRVAVKPNGLSILSKTFKTCAPLNGSFNIKD 286
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID-NAPD-ATSILERIFEGVSVYYN 356
+L++ Y+ V Y P Y + VC I+ N P+ ++L+RIF GV
Sbjct: 287 FLDTIYA--EAVQYNRG-------PTYWVANVCNAINANTPNRKDTLLDRIFAGVVA--- 334
Query: 357 YTGNVDCFQLD---DDPHGLDGWNWQ 379
GN C+ + + W WQ
Sbjct: 335 LIGNRTCYDTNMFSQPTNNHIAWRWQ 360
>gi|297812429|ref|XP_002874098.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297319935|gb|EFH50357.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 427
Score = 233 bits (595), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 128/299 (42%), Positives = 174/299 (58%), Gaps = 20/299 (6%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRY 146
TF QRY I+ HW G PI + G E +E GF+ D AP F A+ V+ EHRY
Sbjct: 2 TFQQRYAIDAKHWAGAKANAPILAFLGLEASLETDLAAFGFLSDNAPHFKALKVYIEHRY 61
Query: 147 YGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGM 206
YG+++P+GS + A +NA+TL YL + QALAD+A + ++K+ SA SP+++ GGSYGGM
Sbjct: 62 YGKTIPFGSAKEAMKNASTLGYLNSAQALADYAAILLHIKEKYSATHSPIIVVGGSYGGM 121
Query: 207 LAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGE 266
LAAW RLKYPHIA+GALASSAP+L FED P +Y I++ FK + C+NTI++SW E
Sbjct: 122 LAAWFRLKYPHIALGALASSAPLLYFEDTRPKFGYYYIITKVFKETNKRCYNTIRKSWEE 181
Query: 267 LVSVGQKENGLLELTKTFHLCRE-LNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGY 325
+ V K NGLL L+K F C L+ + DL D+L++ Y+ V Y G
Sbjct: 182 IDRVASKPNGLLILSKKFKTCASPLSRSFDLKDFLDTVYA--ETVQYN---------DGV 230
Query: 326 PIREVCKKIDNAP--DATSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLD---GWNWQ 379
+ VC I+ P IL+RIF GV TG+ C+ + + W WQ
Sbjct: 231 WVTNVCNAINANPPNRKIDILDRIFAGVVA---LTGSQSCYNTNYSVQVTNNDMAWRWQ 286
>gi|322792044|gb|EFZ16143.1| hypothetical protein SINV_09817 [Solenopsis invicta]
Length = 446
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 131/326 (40%), Positives = 181/326 (55%), Gaps = 30/326 (9%)
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
++DHFSFA TF RYLIN PIF Y GNEG+IE FA N+GF+W+IAP F
Sbjct: 1 KVDHFSFAVQNTFKLRYLINDTFTRKTEDNSPIFFYTGNEGNIEVFAENTGFIWEIAPSF 60
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
A++VF EHRYYGES+PYG+ A + L YLT+EQALAD+ I +LK + SP
Sbjct: 61 DALVVFAEHRYYGESLPYGNRSFA--DPQHLGYLTSEQALADYVDLIHHLKSQPEYKLSP 118
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++FGGSYGGML+AWMRLKYPH+ G +L +D ++ SD +
Sbjct: 119 VIVFGGSYGGMLSAWMRLKYPHVVQGY------VLSLDDSNTLSCLMRVIRSDNNFGLSR 172
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMVDY 312
C + + ++ E+V+ + G L+ + LC L + E+ L D+L+ LAMVDY
Sbjct: 173 CESNVSAAYPEVVAHNNEHEGKKWLSDNWKLCEPLKTGENVTTLKDFLQEVLINLAMVDY 232
Query: 313 PYPSDFMMPLPGYPI---------------REVCKKIDNAP-DATSILERIFEGVSVYYN 356
PY ++F+ PLP PI + C+ + NA +L ++ +SVY N
Sbjct: 233 PYEANFLAPLPANPINLQILKSMISIFTIVQAFCQHLTNASLMGKPLLSALYRALSVYTN 292
Query: 357 YTGNVDCFQLDDDPHGLD---GWNWQ 379
YTG C + GLD GW++Q
Sbjct: 293 YTGKASCTSIKSAQSGLDADQGWDYQ 318
>gi|357451981|ref|XP_003596267.1| Lysosomal Pro-X carboxypeptidase [Medicago truncatula]
gi|355485315|gb|AES66518.1| Lysosomal Pro-X carboxypeptidase [Medicago truncatula]
Length = 479
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 179/315 (56%), Gaps = 33/315 (10%)
Query: 72 YFEQRLDHFSFAD--LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS-GFV 128
++EQ LDHF++ TF QRY+IN ++W G N PIF Y G E DI VN+ GF+
Sbjct: 57 FYEQTLDHFNYLSDSYKTFKQRYIINFNYWGGANSSAPIFAYLGGEDDI----VNTLGFM 112
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D A F A+LV+ EHRYYG+S+P S +Y YL + QALAD+A + LK +
Sbjct: 113 TDNATSFKALLVYIEHRYYGKSVP--SFNASY------GYLNSAQALADYAEVLLYLKDS 164
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
L A+ SPV++ GGSY GMLAAW RLKYPHIAIGALASSAP+L F++I P + ++++ D
Sbjct: 165 LHAQKSPVIVVGGSYAGMLAAWFRLKYPHIAIGALASSAPLLYFDNITPASGYNDVITRD 224
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
F+ S +C+ TI+ SW E+ V K NGL L+K F+ C L +L D+L + Y A
Sbjct: 225 FQETSKTCYETIRNSWFEIDEVASKPNGLNFLSKKFNTCYPLEQPGELKDYLGNMYQKAA 284
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDD 368
+ P P +C+ ID A IL RI+ G+ Y GN C D
Sbjct: 285 QYNDP------------PTTTICEAIDRASYGDDILSRIYGGMVASY---GNKKCNVNPD 329
Query: 369 DPHG---LDGWNWQV 380
G D W WQ
Sbjct: 330 KYTGAKPFDRWRWQT 344
>gi|393909177|gb|EJD75348.1| serine carboxypeptidase S28 family protein [Loa loa]
Length = 542
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 132/343 (38%), Positives = 198/343 (57%), Gaps = 32/343 (9%)
Query: 62 QQQQYRYETRYFEQR-LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
+ Y ++ ++++ +DHF++ +L TF +YL+N ++ N GP+F Y GNEGDIE
Sbjct: 2 EAMNYEWDIKWYQSMPIDHFNYRNLDTFGLKYLVNYSYF---NCDGPLFFYAGNEGDIET 58
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA +G +WD+AP F A +VF EHRYYGES P+G + +Y + L YL QALADFA
Sbjct: 59 FAQMTGIMWDLAPLFNAAIVFAEHRYYGESQPFG--KRSYMDVLRLGYLNEIQALADFAE 116
Query: 181 FITNLKQNLS-------AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
I+ LK + PV++FGGSYGGMLAAW+R+KYPHI GA ASSAP+ F
Sbjct: 117 LISFLKTDQKELGFCPMGTEIPVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRIFY 176
Query: 234 DI-VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL-----C 287
+ PE+ +++++ S + + + + + E G ++L + FH
Sbjct: 177 GTGINPESVSRTITTNY-LTSGCDRKVFSDGFVAIEKMSKTEEGRMKLNRIFHAKPGFEM 235
Query: 288 RELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP-DATSILER 346
+ N L ++ SA Y+AM DYPYP+DF PLPGYP++ VC+ A + ++ E+
Sbjct: 236 KSYNDFMSLYSYIYSAIFYMAMTDYPYPADFFEPLPGYPVKYVCQYAKKAATNEENLAEQ 295
Query: 347 IFEGVSVYYNYTGNV--DCFQ--------LDDDPHGLDGWNWQ 379
I+ ++VYYNYTG + +CF + +D + WNWQ
Sbjct: 296 IYSIINVYYNYTGQLTDNCFTSNCTTPSPIQNDDEDI-AWNWQ 337
>gi|260829575|ref|XP_002609737.1| hypothetical protein BRAFLDRAFT_78548 [Branchiostoma floridae]
gi|229295099|gb|EEN65747.1| hypothetical protein BRAFLDRAFT_78548 [Branchiostoma floridae]
Length = 422
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 178/310 (57%), Gaps = 17/310 (5%)
Query: 68 YETRYFEQRLDHFSFAD--LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ETRYF Q LDHF+FA TF +R L++ W + GPIF Y GNEG I
Sbjct: 35 HETRYFNQYLDHFNFASHGAETFQERVLVSDAFW---RKEGPIFFYTGNEGPITSIWNEV 91
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ D+A +F A++VF EHRYYGES+P+G T +N + LT EQALAD+AV ITNL
Sbjct: 92 GFIKDLAEKFEALIVFVEHRYYGESLPFGETTFNKEN---MGLLTVEQALADYAVLITNL 148
Query: 186 KQNLSAE--ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+ + PV+ FGGSYGG+L+A+MRLKYP++ GALASSA + + P +
Sbjct: 149 TASYCEDPDVCPVIAFGGSYGGVLSAFMRLKYPNLVAGALASSANVYMSAGLTPGNELFQ 208
Query: 244 IVSSDFKRESASCFNTIKESWGELVSV-GQKENGLLELTKTFHLCRELNSTEDLAD---W 299
V+ DF+R + C ++E + E+ + GQ + GL E++ LC L DL + W
Sbjct: 209 DVTEDFRRYNPRCPERVREGFAEMERLAGQGKQGLHEISSRMRLCSPLQHHADLVNMYRW 268
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG 359
+ A++ LAM D PY LP YP+ C + A D +E I + V + YN+T
Sbjct: 269 VREAFTVLAMEDLPYSISNGPSLPAYPVNASCDLLLKASDG---IEGILQAVGMLYNFTS 325
Query: 360 NVDCFQLDDD 369
N+ CF L D
Sbjct: 326 NLTCFDLHRD 335
>gi|380790295|gb|AFE67023.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
gi|383414091|gb|AFH30259.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
gi|384940796|gb|AFI34003.1| dipeptidyl peptidase 2 preproprotein [Macaca mulatta]
Length = 492
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 120/304 (39%), Positives = 189/304 (62%), Gaps = 14/304 (4%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ W+ GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNQTFPQRFLVSDRFWIRGE--GPIFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A GA+LVF EHRYYG+S+P+G E + Q T LT EQALADFA + L
Sbjct: 89 GFIAELAAEQGALLVFAEHRYYGKSLPFG--ERSTQRGHT-ELLTVEQALADFAELLRAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 146 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 205
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLES 302
++DF+ +S C ++E++G++ + + + F C+ L+ +DL + +
Sbjct: 206 TADFEGQSPKCTQGVREAFGQIKDL-FLQGAYDRVRWEFGTCQPLSDEKDLTQLFMFARN 264
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 265 AFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSEH 321
Query: 363 CFQL 366
C+ +
Sbjct: 322 CYDI 325
>gi|402895977|ref|XP_003911085.1| PREDICTED: dipeptidyl peptidase 2 [Papio anubis]
Length = 492
Score = 232 bits (591), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 120/304 (39%), Positives = 189/304 (62%), Gaps = 14/304 (4%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ W+ GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNQTFPQRFLVSDRFWIRGE--GPIFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A GA+LVF EHRYYG+S+P+G E + Q T LT EQALADFA + L
Sbjct: 89 GFIAELAAEQGALLVFAEHRYYGKSLPFG--ERSTQRGHT-ELLTVEQALADFAELLRAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 146 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 205
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLES 302
++DF+ +S C ++E++G++ + + + F C+ L+ +DL + +
Sbjct: 206 TADFEGQSPKCTQGVREAFGQIKDL-FLQGAYDRVRWEFGTCQPLSDKKDLTQLFMFARN 264
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 265 AFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSEH 321
Query: 363 CFQL 366
C+ +
Sbjct: 322 CYDI 325
>gi|443733182|gb|ELU17643.1| hypothetical protein CAPTEDRAFT_152094 [Capitella teleta]
Length = 472
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 129/331 (38%), Positives = 203/331 (61%), Gaps = 26/331 (7%)
Query: 65 QYRYETRYFEQRLDHFS--FADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
Q Y+ ++FEQ +DHF+ +A T+ QRYLI D W P + GPIF Y GNEGDI
Sbjct: 27 QTPYKEQFFEQTIDHFNSYWAQYGKRTYKQRYLIQ-DKWWTPGK-GPIFFYTGNEGDIAT 84
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
F N+GF+++IAP+F A++VF EHRYYG+S+P+G E +++ +S L+++QALADFAV
Sbjct: 85 FWNNTGFMFEIAPKFNALIVFAEHRYYGKSLPFG--ERSFKQPY-ISLLSSQQALADFAV 141
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
+ +LK +L+A V+ FGGSYGGML+A+MR+KYP++ G++A+SAP+ +
Sbjct: 142 LLNHLKPSLNATDCKVIAFGGSYGGMLSAYMRIKYPNLIDGSIAASAPVYLIGGDSSRDF 201
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA--- 297
F+ V++DF ++A C I++ + ++ S+ +GL +++ F LC+ + +T D
Sbjct: 202 FFEDVTADF--QAAGCDKLIRDGFSKMASMSSTTDGLKKISSHFMLCKYMKTTSDFTHFL 259
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNY 357
WL +A++ +AM+DYPYP+DFM +P +P+ CK + N T ++ + + S+ Y Y
Sbjct: 260 GWLRNAFTLMAMMDYPYPTDFMSKMPAWPVNAGCKAMKN---VTCPVKGLADLASIVYPY 316
Query: 358 ---------TGNVDCFQLDDDPHGLDGWNWQ 379
T VDC G D + W
Sbjct: 317 KPDGCHDIWTDFVDCADPTGCGTGPDSYAWD 347
>gi|66805245|ref|XP_636355.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|60464723|gb|EAL62849.1| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 513
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 129/330 (39%), Positives = 194/330 (58%), Gaps = 24/330 (7%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNR-------LGPIFLYCGNEGDIEW 120
Y +F Q LDHF+F F+QR LI ++ ++ P+ +CGNEGD+ +
Sbjct: 56 YTLLWFNQTLDHFNFETSGYFNQRVLIIDQYFNEKSKNEIDQICTKPLIFFCGNEGDVTF 115
Query: 121 FAVNSGFVWD-IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
F NS F+ + +A A+++F EHRYYGES+P+G+ +N YL++EQALAD++
Sbjct: 116 FYENSLFITNTLAQEMNALVIFAEHRYYGESLPFGNQSYTNEN---FQYLSSEQALADYS 172
Query: 180 VFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI-VPP 238
I ++ + +A PV GSYGG LAAWMRLKYP I GALASSAP+L + VP
Sbjct: 173 KIIPSILKQYNALNCPVFTTSGSYGGDLAAWMRLKYPFIVDGALASSAPLLSYMGTGVPY 232
Query: 239 ETFYNIVSSDFKRES--ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
+ F V++DFK S SC I+ ++ +L ++ + +NG E++ +F LC +NS +D
Sbjct: 233 DVFPVGVTNDFKETSQDGSCAIKIRNAFNDLETIAKADNGFNEISTSFKLCTPINSNDDF 292
Query: 297 AD---WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSV 353
W+ES +SY++M DYPYP+ F+ P+ G P+ E C I+ ++ I I G+ +
Sbjct: 293 QSFLGWVESGFSYMSMADYPYPASFLEPMMGNPVNETCNLINQLDNSIDI---IMSGLQI 349
Query: 354 YYNYTGN-VDCFQLD---DDPHGLDGWNWQ 379
YYNYTG + CF + +D L W++Q
Sbjct: 350 YYNYTGQMMQCFNTNIFIEDQGMLIPWSYQ 379
>gi|148674771|gb|EDL06718.1| prolylcarboxypeptidase (angiotensinase C) [Mus musculus]
Length = 437
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 179/338 (52%), Gaps = 66/338 (19%)
Query: 49 PHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
PHL+ P + +Y YFEQ++DHF FAD+ TF QRYL+ HW G I
Sbjct: 30 PHLSASPTPDPAVAR---KYSVLYFEQKVDHFGFADMRTFKQRYLVADKHW--QRNGGSI 84
Query: 109 FLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY 168
Y GNEGDI WF N+GF+WD+A AMLVF EHRYYGES+P+G + +++++ L++
Sbjct: 85 LFYTGNEGDIVWFCNNTGFMWDVAEELKAMLVFAEHRYYGESLPFG--QDSFKDSQHLNF 142
Query: 169 LTAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
LT+EQALADFA I +L++ + A+ PV+ GGSYGGMLAAW R+KYPHI +G L S
Sbjct: 143 LTSEQALADFAELIRHLEKTIPGAQGQPVIAIGGSYGGMLAAWFRMKYPHIVVGGLQSLT 202
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC 287
IL + E + I E+W L
Sbjct: 203 NILHLCSPLTSEKIPTLKG------------WIAETWVNL-------------------- 230
Query: 288 RELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA--TSILE 345
AMV+YPY +F+ PLP +PI+EVC+ + N P+ T +L+
Sbjct: 231 --------------------AMVNYPYACNFLQPLPAWPIKEVCQYLKN-PNVSDTVLLQ 269
Query: 346 RIFEGVSVYYNYTGNVDCF---QLDDDPHGLDGWNWQV 380
IF+ +SVYYNY+G C Q G GW++Q
Sbjct: 270 NIFQALSVYYNYSGQAACLNISQTTTSSLGSMGWSFQA 307
>gi|410224782|gb|JAA09610.1| dipeptidyl-peptidase 7 [Pan troglodytes]
gi|410251824|gb|JAA13879.1| dipeptidyl-peptidase 7 [Pan troglodytes]
Length = 492
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 189/305 (61%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGE--GPIFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GFV ++A GA+LVF EHRYYG+S+P+G+ Q T LT EQALADFA + L
Sbjct: 89 GFVAELAAEQGALLVFAEHRYYGKSLPFGAQST--QRGHT-ELLTVEQALADFAELLRAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 146 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 205
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 206 TADFEGQSPKCTQGVREAFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSE 320
Query: 362 DCFQL 366
C+ +
Sbjct: 321 HCYDI 325
>gi|15080291|gb|AAH11907.1| Dipeptidyl-peptidase 7 [Homo sapiens]
gi|16877430|gb|AAH16961.1| Dipeptidyl-peptidase 7 [Homo sapiens]
gi|123981114|gb|ABM82386.1| dipeptidyl-peptidase 7 [synthetic construct]
gi|123995923|gb|ABM85563.1| dipeptidyl-peptidase 7 [synthetic construct]
gi|158257656|dbj|BAF84801.1| unnamed protein product [Homo sapiens]
Length = 492
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 189/305 (61%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGE--GPIFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GFV ++A GA+LVF EHRYYG+S+P+G+ Q T LT EQALADFA + L
Sbjct: 89 GFVAELAAERGALLVFAEHRYYGKSLPFGAQST--QRGHT-ELLTVEQALADFAELLRAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 146 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 205
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 206 TADFEGQSPKCTQGVREAFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSE 320
Query: 362 DCFQL 366
C+ +
Sbjct: 321 HCYDI 325
>gi|114627680|ref|XP_528471.2| PREDICTED: dipeptidyl peptidase 2 isoform 5 [Pan troglodytes]
Length = 492
Score = 229 bits (584), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 190/305 (62%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGE--GPIFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GFV ++A GA+LVF EHRYYG+S+P+G+ + Q T LT EQALADFA + L
Sbjct: 89 GFVAELAAEQGALLVFAEHRYYGKSLPFGAQ--SRQRGHT-ELLTVEQALADFAELLRAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 146 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 205
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 206 TADFEGQSPKCTQGVREAFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSE 320
Query: 362 DCFQL 366
C+ +
Sbjct: 321 HCYDI 325
>gi|225706950|gb|ACO09321.1| Dipeptidyl-peptidase 2 precursor [Osmerus mordax]
Length = 490
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 123/315 (39%), Positives = 187/315 (59%), Gaps = 14/315 (4%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGN 114
+ + Q +++ +YF Q LDHF+F + T+ QRYLI +W GPIF Y GN
Sbjct: 29 KHHEPNKDQPQFKEKYFTQILDHFNFNSMGNGTYDQRYLITDQYW--KRGYGPIFFYTGN 86
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
EGDI FA+NSGF+ ++A A+++F EHRYYG+S+P+G + + LT EQA
Sbjct: 87 EGDIWEFALNSGFITELAADQKALVIFAEHRYYGKSLPFGQDSFSIPE---VGLLTVEQA 143
Query: 175 LADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
LADFAV IT LK L A PV++FGGSYGGML+ +MRL+YP+I GALA+SAPIL
Sbjct: 144 LADFAVMITALKPQLGASECPVIVFGGSYGGMLSVYMRLRYPNIVAGALAASAPILSTAG 203
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+ F+ V+ DF+ + C + ++ ++ +L + + E+ + F LC+ +S +
Sbjct: 204 MGDSRQFFQDVTHDFESYAPECRDAVRGAFQKLQDLSEVED-YSRIQAAFSLCKPPSSQK 262
Query: 295 D---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGV 351
D L L +A++ +AM+DYPY + FM +P P++ C + + D +L+ + +
Sbjct: 263 DIHQLNGLLRNAFTLMAMLDYPYSTHFMGNMPANPVKVGCDIMLSGAD---LLQALRDTA 319
Query: 352 SVYYNYTGNVDCFQL 366
+ YN TG + CF L
Sbjct: 320 GIVYNSTGILKCFDL 334
>gi|197099620|ref|NP_001127163.1| dipeptidyl peptidase 2 precursor [Pongo abelii]
gi|55725338|emb|CAH89533.1| hypothetical protein [Pongo abelii]
Length = 492
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 191/305 (62%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ W+ GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWIRGE--GPIFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GFV ++A GA+LVF EHRYYG+S+P+G+ + + T L LT EQALADFA + L
Sbjct: 89 GFVAELAAEQGALLVFAEHRYYGKSLPFGA-QSTRRGHTEL--LTVEQALADFAELLRAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 146 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 205
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 206 TADFEGQSPKCTQGVREAFRQIKDLFLR--GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSE 320
Query: 362 DCFQL 366
C+ +
Sbjct: 321 HCYDI 325
>gi|351704787|gb|EHB07706.1| Dipeptidyl-peptidase 2 [Heterocephalus glaber]
Length = 515
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 119/302 (39%), Positives = 185/302 (61%), Gaps = 18/302 (5%)
Query: 72 YFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
YFEQ LDHF+F TFSQR+L++ W GPIF Y GNEGD+ FA NSGF+
Sbjct: 43 YFEQLLDHFNFESFGNKTFSQRFLVSDKFWR--RSEGPIFFYTGNEGDVWGFANNSGFLV 100
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
++A + +LVF EHRYYG+S+P+G ST+ Y L LT EQALADFAV + L++
Sbjct: 101 ELAQQQEGLLVFAEHRYYGKSLPFGAQSTQRGY-----LKLLTVEQALADFAVLLQALRR 155
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+L + +P + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+ V++
Sbjct: 156 DLGTQDAPAIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVSGLGDSYQFFRDVTA 215
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLESAY 304
DF +S C +++++ ++ + + +++ F C+ L+ EDL + +A+
Sbjct: 216 DFYGQSPKCAQGVRDAFQQIRDL-FLQGAYDRISREFGTCQSLSGLEDLTQLFVFARNAF 274
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
+ LAM+DYPYP+DF++PLP P++ C ++ + + L + + YN +G C+
Sbjct: 275 TVLAMMDYPYPTDFLVPLPANPVKAGCDRLLSEAQRIAGLRSL---AGLVYNASGTERCY 331
Query: 365 QL 366
+
Sbjct: 332 DI 333
>gi|6465985|gb|AAF12747.1|AF154502_1 quiescent cell proline dipeptidase [Homo sapiens]
Length = 492
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 188/305 (61%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GP F Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGE--GPTFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GFV ++A GA+LVF EHRYYG+S+P+G+ Q T LT EQALADFA + L
Sbjct: 89 GFVAELAAERGALLVFAEHRYYGKSLPFGAQST--QRGHT-ELLTVEQALADFAELLRAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 146 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 205
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 206 TADFEGQSPKCTQGVREAFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSE 320
Query: 362 DCFQL 366
C+ +
Sbjct: 321 HCYDI 325
>gi|260828789|ref|XP_002609345.1| hypothetical protein BRAFLDRAFT_128878 [Branchiostoma floridae]
gi|229294701|gb|EEN65355.1| hypothetical protein BRAFLDRAFT_128878 [Branchiostoma floridae]
Length = 489
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 128/334 (38%), Positives = 194/334 (58%), Gaps = 31/334 (9%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y+ RY EQ +D+F+F T+ Q+ L++ +W R GPIF Y GNEG I F S
Sbjct: 31 YKVRYVEQYVDNFNFPSYGQQTYMQKVLVSDAYW--EKREGPIFFYTGNEGPITAFWEAS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GFV ++A +F A+LVF EHRYYGES+P+G+ +N + L+ EQA+AD+A +T L
Sbjct: 89 GFVKELAAKFKALLVFAEHRYYGESLPFGNQSFTKEN---IGLLSVEQAMADYARLMTAL 145
Query: 186 KQNL---SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
+ +L S + P++ FGGSYGGML+A+MR KYP++ GALA+SAPI + F+
Sbjct: 146 RTHLDCKSPDVCPIITFGGSYGGMLSAYMRFKYPNLVAGALAASAPIYLVAGLTEGHQFF 205
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DW 299
V+ DF++ A C ++ ++ E+ +G GL E++ F LC L +DL+ W
Sbjct: 206 QDVTEDFRKSDARCPLKVQSAYFEMEELGA--GGLKEISDRFQLCSPLTDKKDLSHLYGW 263
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT- 358
+ ++++ LAM+DYPYP+DF LP P+ C I N+ D +L+ + + + YN T
Sbjct: 264 VRNSFTTLAMLDYPYPTDFEAKLPANPVNVACGLILNSSD---LLKGLSQAAGLAYNGTD 320
Query: 359 GNVDCFQLDD------DPHGLD------GWNWQV 380
G ++CF + D DP G W++Q
Sbjct: 321 GTLECFDIFDEFVACADPTGCGLGDDSTAWDYQA 354
>gi|62420888|ref|NP_037511.2| dipeptidyl peptidase 2 preproprotein [Homo sapiens]
gi|212276510|sp|Q9UHL4.3|DPP2_HUMAN RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
Length = 492
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 188/305 (61%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGE--GPIFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
FV ++A GA+LVF EHRYYG+S+P+G+ Q T LT EQALADFA + L
Sbjct: 89 AFVAELAAERGALLVFAEHRYYGKSLPFGAQST--QRGHT-ELLTVEQALADFAELLRAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 146 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 205
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 206 TADFEGQSPKCTQGVREAFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 263
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 264 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSE 320
Query: 362 DCFQL 366
C+ +
Sbjct: 321 HCYDI 325
>gi|324509048|gb|ADY43812.1| Serine protease pcp-1 [Ascaris suum]
Length = 593
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 135/335 (40%), Positives = 192/335 (57%), Gaps = 27/335 (8%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y + T Y++ +D+F+F T+ +YL N ++ GPIF Y GNEG IE FA N+
Sbjct: 46 YGWSTAYYDVPIDNFAFTSAQTYRMKYLYNLTYY---ELGGPIFFYTGNEGSIEEFAKNT 102
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G ++D+A +F A + F EHRYYG SMP+G+ ++Y NA L YL++ QALADFA IT +
Sbjct: 103 GIMFDLAEKFKAAVFFAEHRYYGASMPFGN--ISYTNANYLGYLSSTQALADFAKLITFI 160
Query: 186 KQNL--SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE-DIVPPETFY 242
K ++ +PV+ FGGSYGGMLAAW+R+KYPHI GA +SSAP+L FE V P F
Sbjct: 161 KTDVLKCPPDTPVIAFGGSYGGMLAAWLRMKYPHIVSGAWSSSAPLLYFEGGNVSPSAFE 220
Query: 243 NIVSSDFKRESASCF-NTIKESWGELVSVGQKENGLLELTKTFHL-----CRELNSTEDL 296
V F +A C NTI + ++ G L F + E ++ L
Sbjct: 221 KAVKEVFI--NAGCNENTIANGLEAIKNLMNTAEGRQFLNDLFRIDPTSTLTESTDSDFL 278
Query: 297 ADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA--PDATSILERIFEGVSVY 354
+W+ +A Y+AMV+YPYPS+F+ PLPG+P++ C+ + DA +++ +VY
Sbjct: 279 VEWIWAAMDYMAMVNYPYPSNFLQPLPGWPVKYSCENFARSEITDARQAATALYQISNVY 338
Query: 355 YNYTGNVD--------CFQLDDDPHGLD-GWNWQV 380
YN+TGNV C + G D GW+WQ
Sbjct: 339 YNFTGNVATNCVNWNVCGESAIANLGADAGWSWQT 373
>gi|268575392|ref|XP_002642675.1| C. briggsae CBR-TAG-282 protein [Caenorhabditis briggsae]
Length = 505
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 127/333 (38%), Positives = 193/333 (57%), Gaps = 17/333 (5%)
Query: 38 FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD 97
F FV +T + Q ++ +Y+Y+ Y + +DHFSF + F RY +NTD
Sbjct: 11 FANTSCFVRLRDPITRKSKNQYFEKLGKYKYDEGYLKVPIDHFSFTNDYEFDLRYFLNTD 70
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
H+ GPI Y GNEG +E FA N+G +WD+AP A +VF EHR+YG+S P+ +
Sbjct: 71 HY---ESGGPILFYTGNEGSLESFAENTGLMWDLAPELKAAVVFVEHRFYGKSQPFKNQ- 126
Query: 158 VAYQNATTLSYLTAEQALADFAV---FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLK 214
+Y + L YL+++QALADFA+ F N K A+ S V+ FGGSYGGML+AW R+K
Sbjct: 127 -SYTDIRHLGYLSSQQALADFALSAQFFRNEKIK-GAQTSAVIAFGGSYGGMLSAWFRIK 184
Query: 215 YPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKE 274
YPHI GA+A+SAP+ F D PE Y+ + + ++ +++ W L + + +
Sbjct: 185 YPHIVDGAIAASAPVFWFTDSNIPEDVYDFIVTRAFLDAGCNRKAVEKGWIALDELAKSD 244
Query: 275 NGLLELTKTFHL--CRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
+G L + L +L S +D L ++ A +AMV+YPYP+ F+ LPG+P++E
Sbjct: 245 SGRQYLNVLYKLDPKSKLESKDDIGFLKQYIREAMEAMAMVNYPYPTSFLSSLPGWPVKE 304
Query: 330 VCKKIDNAPDAT--SILERIFEGVSVYYNYTGN 360
C K NAP + E+++ V++YYN+TG+
Sbjct: 305 AC-KYANAPGKSQEESAEQLYNIVNLYYNFTGD 336
>gi|261278872|pdb|3JYH|A Chain A, Human Dipeptidyl Peptidase Dpp7
gi|261278873|pdb|3JYH|B Chain B, Human Dipeptidyl Peptidase Dpp7
gi|261278874|pdb|3JYH|C Chain C, Human Dipeptidyl Peptidase Dpp7
gi|261278875|pdb|3JYH|D Chain D, Human Dipeptidyl Peptidase Dpp7
gi|301015979|pdb|3N0T|A Chain A, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015980|pdb|3N0T|B Chain B, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015981|pdb|3N0T|C Chain C, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
gi|301015982|pdb|3N0T|D Chain D, Human Dipeptidil Peptidase Dpp7 Complexed With Inhibitor
Gsk237826a
Length = 469
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 188/305 (61%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GPIF Y GNEGD+ FA NS
Sbjct: 8 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGE--GPIFFYTGNEGDVWAFANNS 65
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
FV ++A GA+LVF EHRYYG+S+P+G+ Q T LT EQALADFA + L
Sbjct: 66 AFVAELAAERGALLVFAEHRYYGKSLPFGAQST--QRGHT-ELLTVEQALADFAELLRAL 122
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 123 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 182
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 183 TADFEGQSPKCTQGVREAFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 240
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 241 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSE 297
Query: 362 DCFQL 366
C+ +
Sbjct: 298 HCYDI 302
>gi|301103554|ref|XP_002900863.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262101618|gb|EEY59670.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 542
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 185/317 (58%), Gaps = 28/317 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ ++ Q LDHF PT+ QRY + D P G +F Y GNE D+E + ++G +
Sbjct: 83 DEKFLTQTLDHFDVG-APTYQQRYFV-CDKQFRPG--GVMFFYVGNEADVELYLNHTGLM 138
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
W+ A FGAMLVF EHRY+G+S+P+G + + YL+ EQALADFAV IT LK
Sbjct: 139 WENADEFGAMLVFAEHRYFGKSVPFGKDVTKH-----MKYLSTEQALADFAVLITYLKTE 193
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF-EDIVPPE--TFYNIV 245
+ PV+ FGGSYGGML +W+R+KYPHI G +A SAPIL F D VP + +F IV
Sbjct: 194 WKLDI-PVIGFGGSYGGMLGSWLRMKYPHIIDGVIAGSAPILSFLGDEVPLDKGSFERIV 252
Query: 246 SSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLCREL-----NSTEDLA 297
+ D E+ S C I+ +W + +G E+G +L + LC + +++
Sbjct: 253 TFDASEEAGSAPNCVPNIRRTWPAMKKLGDTEDGRKQLKRALSLCDSVKLESRKDVDEVM 312
Query: 298 DWLESAYSYLAMVDYPYPSDFMMP----LPGYPIREVCKKIDN--AP-DATSILERIFEG 350
DW +SA+ Y+AM +YPYPS ++M LP YP+R C + + AP D ++L +
Sbjct: 313 DWAKSAFDYMAMGNYPYPSSYIMNGVSVLPAYPVRVACSFVADEFAPDDEVALLSAFAKS 372
Query: 351 VSVYYNYTGNVDCFQLD 367
+ VYYN T + +C++L+
Sbjct: 373 LGVYYNSTKHQECYELN 389
>gi|428182777|gb|EKX51637.1| hypothetical protein GUITHDRAFT_157202 [Guillardia theta CCMP2712]
Length = 481
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 127/326 (38%), Positives = 193/326 (59%), Gaps = 23/326 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLI--NTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
+FEQ LDHFS+ + + QRY + T+ + N IF YCGNEG++E + N+G ++
Sbjct: 7 WFEQVLDHFSWRNDSRWQQRYYVCQETEQQLA-NPAATIFFYCGNEGNVEMYIRNTGLMF 65
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
+ A F AML+F EHRYYG+S+P+G+ +A +L YL+ EQALAD+AV + + K+
Sbjct: 66 ENAKSFSAMLIFAEHRYYGKSLPFGND----FSAASLRYLSHEQALADYAVLLDDFKRKH 121
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE--DIVP--PETFYNIV 245
+ V+ FGGSYGGML+AW R+KYPHI GA+A+SAP+L F D P E ++ IV
Sbjct: 122 KMVRAKVIAFGGSYGGMLSAWFRMKYPHIVEGAVAASAPVLSFHSSDKGPWRSEKYWEIV 181
Query: 246 SSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS---TEDLADW 299
+ D + S C +++SW + S+G E+G L F LC L S DL +
Sbjct: 182 TRDASGAAGSDERCVPLVRQSWPIIDSMGASESGRESLAALFRLCEPLASPGEVNDLKLF 241
Query: 300 LESAYSYLAMVDYPYPSDFMM----PLPGYPIREVCKKIDNAPD--ATSILERIFEGVSV 353
+ A+ +AM +YP+PSD++ LP +P+RE CK + + D A ++L+ + +S+
Sbjct: 242 IAMAFDTMAMGNYPFPSDYLTGGIGKLPPWPVREACKLLSSCGDCKAENLLDSLCSAISL 301
Query: 354 YYNYTGNVDCFQLDDDPHGLDGWNWQ 379
YN +G+ C L +D W++Q
Sbjct: 302 LYNASGDQACLHLPEDSSYAGIWDFQ 327
>gi|170574568|ref|XP_001892871.1| Serine carboxypeptidase S28 family protein [Brugia malayi]
gi|158601365|gb|EDP38293.1| Serine carboxypeptidase S28 family protein [Brugia malayi]
Length = 567
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 128/346 (36%), Positives = 199/346 (57%), Gaps = 36/346 (10%)
Query: 62 QQQQYRYETRYFEQR-LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
+ Y + ++++ +DHF++ + FS +YL N +++ GP+F Y GNEGDIE
Sbjct: 43 ESMNYEWNVKWYQSMPIDHFTYRNNEFFSLKYLANYSYFLCD---GPLFFYAGNEGDIEA 99
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA N+G +WD+APRF A +VF EHRYYG S PYG + +Y + L YL Q LADFA
Sbjct: 100 FAQNTGIIWDLAPRFHAAIVFAEHRYYGNSKPYG--KRSYMDVLRLGYLNDIQVLADFAQ 157
Query: 181 FITNLKQNLS-------AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
IT LK + PV++FGGSYGGMLAAW+R+KYPHI GA ASSAP+ F
Sbjct: 158 LITFLKTDQEELGFCPPGTEIPVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRNFY 217
Query: 234 DI-VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL--CREL 290
+ PE+ N+ ++++ + E + + + + E G ++L + F E+
Sbjct: 218 GTGINPESVSNVTTTNYVTNGCD-WKVFSEGFVAIQKLSKTEEGRMKLNQIFRSKPGFEM 276
Query: 291 NSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCK-KIDNAPDATSILER 346
+++D L ++ SA Y+A++DYPYP++F+ PLPG+P++ C+ + + E+
Sbjct: 277 KNSDDFTSLYSYIYSAVFYMAVIDYPYPTNFLTPLPGFPVKYACQYATKXGINDEELAEQ 336
Query: 347 IFEGVSVYYNYTGNV-------DC-----FQLDDDPHGLDGWNWQV 380
++ ++VYYNYTG + +C FQ +D+ + WNWQ
Sbjct: 337 LYNVINVYYNYTGQLTDHCFTSNCTILSPFQNNDE---VIAWNWQC 379
>gi|47206852|emb|CAF90612.1| unnamed protein product [Tetraodon nigroviridis]
Length = 463
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 121/326 (37%), Positives = 194/326 (59%), Gaps = 24/326 (7%)
Query: 68 YETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YF Q LDHF+F TF QRYL+ +W + GP+F Y GNEGDI FA+NS
Sbjct: 41 FTESYFTQTLDHFNFNSYGNGTFRQRYLVADRYWRRGH--GPLFFYTGNEGDIWDFALNS 98
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A + GA++VF EHRYYG+S+P+G + ++Q + LT EQALAD+A+ I+ L
Sbjct: 99 GFITELAAQQGALVVFAEHRYYGKSLPFG--DASFQ-VPEVGLLTVEQALADYALLISQL 155
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
++ L+A PV++FGGSYGGML+ +MRL+YP++ GALA+SAP+L + P F+ V
Sbjct: 156 REQLAATRCPVIVFGGSYGGMLSVYMRLRYPNLVAGALAASAPVLSTAGLGEPTQFFRDV 215
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLES 302
++DF+ C ++ ++ +L + + ++ + K LC+ +S +D++ L +
Sbjct: 216 TADFQSVEPQCTGAVRGAFQQLRELAEDQD-YGAIQKKLSLCQRPSSPQDVSQLYGLLRN 274
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
A++ +AM+DYPY + FM LP P++ C+ + A+ +L + + + YN +G +
Sbjct: 275 AFTLMAMLDYPYSTHFMGSLPANPVKVACQTMLR---ASELLTNLRDAAGLVYNASGQLG 331
Query: 363 CFQL------DDDPHGL----DGWNW 378
C L DP G + W W
Sbjct: 332 CLDLYRLYVQCADPTGCGLGPNSWAW 357
>gi|391339066|ref|XP_003743874.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Metaseiulus
occidentalis]
Length = 476
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 186/323 (57%), Gaps = 9/323 (2%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
++++ ET +F +DHF + + TF R L N +++ + GPIFLY GNEGDI F
Sbjct: 33 RRRFVTETTWFNVPIDHFGYYNNNTFPLRVLYNNEYF-NHTKPGPIFLYAGNEGDIALFV 91
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N+G +WD A FGA+LVF EHRYYG+SMPYG + ++ + YLT +QALADFA I
Sbjct: 92 YNTGLLWDWAEEFGALLVFAEHRYYGKSMPYGRDSL--KDVSYYGYLTVDQALADFAHVI 149
Query: 183 TNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
+ +K+ + S VV FGGSY GMLAAW+R+KYP + AL+S API ++ +V F
Sbjct: 150 SEIKETWPGVQKSKVVAFGGSYAGMLAAWLRMKYPWLVEAALSSGAPIRLYQGLVGCNAF 209
Query: 242 YNIVSSDFKRESA-SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRE--LNSTEDLAD 298
+ V+ F E C N I++SW L + E G + K FH+C++ + + D
Sbjct: 210 NDGVARAFLAEGGKKCVNNIRKSWKALERFKESEEGTNFIFKKFHVCQKNIASGFAQVRD 269
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNY 357
W+ +Y LAM +YPY + L YPIR C + + +L I++ V+VY+N+
Sbjct: 270 WIYGSYVNLAMHNYPYGLE-TRRLSTYPIRLACAFLQKDFQKDEDLLSAIYDAVNVYHNH 328
Query: 358 TGNVDCFQLDDDPHGLDGWNWQV 380
+G V C +DD G WQV
Sbjct: 329 SGVVHCNNVDDVYGEHIGSAWQV 351
>gi|302854868|ref|XP_002958938.1| hypothetical protein VOLCADRAFT_70104 [Volvox carteri f.
nagariensis]
gi|300255730|gb|EFJ40018.1| hypothetical protein VOLCADRAFT_70104 [Volvox carteri f.
nagariensis]
Length = 557
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 179/317 (56%), Gaps = 29/317 (9%)
Query: 87 TFSQRYLINTDHWV--GPN-RLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
TF QR+ + +HW GP+ GPIF Y GNE D+ + N+G +W+ AP F AMLVF E
Sbjct: 1 TFLQRFFVCANHWRRRGPDGSSGPIFFYLGNEADVTLYLNNTGLMWEGAPDFEAMLVFAE 60
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSY 203
HRYYGES+PYG + + YL AEQA+AD+A I +K+ AE S V+ FGGSY
Sbjct: 61 HRYYGESVPYGKNVRKH-----MGYLMAEQAMADYAELIMEIKEEYDAEGSAVIGFGGSY 115
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP---ETFYNIVSSDFKRESAS---CF 257
GGMLAAWMRLKYPH GA+A+SAPI F PP +F V+ D + S C
Sbjct: 116 GGMLAAWMRLKYPHALDGAIAASAPIWNFLGETPPFDSGSFAKGVTYDASELAGSAPACI 175
Query: 258 NTIKESWGELVSVGQKENGLLELTKTFHLCRELN-STED----LADWLESAYSYLAMVDY 312
+ ++ +WG L G E+G L LC ED L +WL +A+ +AM ++
Sbjct: 176 DNVRATWGLLRIYGGDEHGRAFLADALQLCSGAGPQNEDDVILLREWLAAAWDMMAMGNF 235
Query: 313 PYPSDFMM----PLPGYPIREVCKKID-NAPDATSILERIFEGVSVYYNYTGNVDCFQ-- 365
PYPS ++ LP +P+R C +D ++ + +LE + V V+YN+TG+V CF
Sbjct: 236 PYPSSYITNGHGRLPAFPVRVACSYLDSDSFNDEDLLEAMSLAVGVFYNHTGDVSCFDPL 295
Query: 366 ---LDDDPHGLDGWNWQ 379
D H D W++Q
Sbjct: 296 SGTDPDSDHDADFWDFQ 312
>gi|14010871|ref|NP_114179.1| dipeptidyl peptidase 2 precursor [Rattus norvegicus]
gi|13626317|sp|Q9EPB1.1|DPP2_RAT RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
gi|9796394|dbj|BAB11691.1| dipeptidyl peptidase II [Rattus norvegicus]
gi|10119950|dbj|BAB13500.1| dipeptidyl peptidase II [Rattus norvegicus]
gi|51260653|gb|AAH78783.1| Dipeptidylpeptidase 7 [Rattus norvegicus]
gi|149039379|gb|EDL93599.1| dipeptidylpeptidase 7, isoform CRA_a [Rattus norvegicus]
Length = 500
Score = 224 bits (572), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 119/306 (38%), Positives = 185/306 (60%), Gaps = 18/306 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ +DHF+F TF QR+L++ W GPIF Y GNEGDI A NS
Sbjct: 41 FRENYFEQYMDHFNFESFSNKTFGQRFLVSDKFWKMGE--GPIFFYTGNEGDIWSLANNS 98
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A + A+LVF EHRYYG+S+P+G ST+ Y LT EQALADFAV +
Sbjct: 99 GFIVELAAQQEALLVFAEHRYYGKSLPFGVQSTQRGYTQ-----LLTVEQALADFAVLLQ 153
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+ NL + +P + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + P+ F+
Sbjct: 154 ALRHNLGVQDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVAGLGNPDQFFR 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WL 300
V++DF +S C +++++ ++ + + +++ F C+ L+S +DL +
Sbjct: 214 DVTADFYGQSPKCAQAVRDAFQQIKDL-FLQGAYDTISQNFGTCQSLSSPKDLTQLFGFA 272
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A++ LAM+DYPYP++F+ PLP P++ C+++ + L + + YN +G
Sbjct: 273 RNAFTVLAMMDYPYPTNFLGPLPANPVKVGCERLLSEGQRIMGLRAL---AGLVYNSSGM 329
Query: 361 VDCFQL 366
CF +
Sbjct: 330 EPCFDI 335
>gi|62089160|dbj|BAD93024.1| Dipeptidyl-peptidase II precursor variant [Homo sapiens]
Length = 377
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 116/273 (42%), Positives = 176/273 (64%), Gaps = 13/273 (4%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GPIF Y GNEGD+ FA NS
Sbjct: 30 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWV--RGEGPIFFYTGNEGDVWAFANNS 87
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GFV ++A GA+LVF EHRYYG+S+P+G+ Q T LT EQALADFA + L
Sbjct: 88 GFVAELAAERGALLVFAEHRYYGKSLPFGAQST--QRGHT-ELLTVEQALADFAELLRAL 144
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 145 RRDLGAQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 204
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 205 TADFEGQSPKCTQGVREAFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFAR 262
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI 334
+A++ LAM+DYPYP+DF+ PLP P++ C ++
Sbjct: 263 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRL 295
>gi|149039380|gb|EDL93600.1| dipeptidylpeptidase 7, isoform CRA_b [Rattus norvegicus]
Length = 329
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 113/274 (41%), Positives = 174/274 (63%), Gaps = 15/274 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ +DHF+F TF QR+L++ W GPIF Y GNEGDI A NS
Sbjct: 41 FRENYFEQYMDHFNFESFSNKTFGQRFLVSDKFWKMGE--GPIFFYTGNEGDIWSLANNS 98
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A + A+LVF EHRYYG+S+P+G ST+ Y LT EQALADFAV +
Sbjct: 99 GFIVELAAQQEALLVFAEHRYYGKSLPFGVQSTQRGYTQ-----LLTVEQALADFAVLLQ 153
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+ NL + +P + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + P+ F+
Sbjct: 154 ALRHNLGVQDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVIAVAGLGNPDQFFR 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WL 300
V++DF +S C +++++ ++ + + +++ F C+ L+S +DL +
Sbjct: 214 DVTADFYGQSPKCAQAVRDAFQQIKDL-FLQGAYDTISQNFGTCQSLSSPKDLTQLFGFA 272
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI 334
+A++ LAM+DYPYP++F+ PLP P++ C+++
Sbjct: 273 RNAFTVLAMMDYPYPTNFLGPLPANPVKVGCERL 306
>gi|341900594|gb|EGT56529.1| CBN-PCP-5 protein [Caenorhabditis brenneri]
Length = 507
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 118/304 (38%), Positives = 181/304 (59%), Gaps = 13/304 (4%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+Y+YE Y + +DHFSF + F+ RY +NTD++ GPI Y GNEG +E FA N
Sbjct: 38 KYKYEVGYLKVPIDHFSFTNDMEFNLRYFLNTDNY---ESGGPILFYTGNEGSLEAFAEN 94
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GF+WD+AP A +VF EHR+YG+S P+G+ +Y + L YL+++QALADFA+ +
Sbjct: 95 TGFMWDLAPELKAAVVFVEHRFYGKSQPFGNQ--SYTDIRRLGYLSSQQALADFALSVQF 152
Query: 185 LKQNL--SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
K A+ S V+ FGGSYGGML+AW R+KYPHI GA+A+SAP+ F D PE Y
Sbjct: 153 FKNEKIKGAQKSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVY 212
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL--CRELNSTED---LA 297
+ + + ++ I ++W L + + ++G L + L +L + +D L
Sbjct: 213 DFIVTRAFLDAGCNRKAIDKAWLALDELSKSDSGRRYLNILYKLDPKSKLENKDDIGFLK 272
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APDATSILERIFEGVSVYYN 356
++ + +AMV+YPYP+ F+ LP +P++E CK E+++ V++YYN
Sbjct: 273 QYIRESMEAMAMVNYPYPTSFLSSLPSWPVKEACKFASQPGKSQEESAEQLYNIVNLYYN 332
Query: 357 YTGN 360
YTG+
Sbjct: 333 YTGD 336
>gi|403301430|ref|XP_003941393.1| PREDICTED: dipeptidyl peptidase 2 [Saimiri boliviensis boliviensis]
Length = 513
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 184/303 (60%), Gaps = 13/303 (4%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ RYF+QRLDHF+F L TF QR L++ W+ GPIF Y GNEGD+ FA NS
Sbjct: 53 FQERYFQQRLDHFNFERLGNKTFPQRVLVSDKFWIRGE--GPIFFYTGNEGDVWNFANNS 110
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A GA+LVF EHRYYG+S+P+G E + Q T LT EQALADFA + L
Sbjct: 111 GFIAELAAEQGALLVFAEHRYYGKSLPFG--ERSTQRGHT-GLLTVEQALADFAELLRAL 167
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ P + FGGSYGGML+A++R+KYPH+ GALA+SAPIL + F+ V
Sbjct: 168 RRDLGAQDVPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPILAVAGLGDSTQFFRDV 227
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NSTEDLADWLESA 303
++DF+ + C ++E++ ++ + + +++ F C+ L L + +A
Sbjct: 228 TADFEDQGPKCTQAVREAFRQIKDL-FLQGAYDKVSWEFGTCQPLLEKDLTQLFVFARNA 286
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
++ LAM+DYPYP+DF+ PLP P++ C ++ + L + + YN +G+ C
Sbjct: 287 FTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRIVGLRAL---TGLVYNASGSEHC 343
Query: 364 FQL 366
+ +
Sbjct: 344 YDI 346
>gi|307108156|gb|EFN56397.1| hypothetical protein CHLNCDRAFT_144962 [Chlorella variabilis]
Length = 496
Score = 223 bits (567), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/328 (37%), Positives = 186/328 (56%), Gaps = 30/328 (9%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
R + R+ E RLDHF++ + F QR+ + ++W G +FLY GNE D+ + N+G
Sbjct: 16 RCKERWRETRLDHFTWVNPTYFKQRFFVCDEYW---RPGGSVFLYIGNEADVTLYLNNTG 72
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
+W++AP++ AMLVF EHRYYG+S P+ ++ + +++LT+EQA+ D+A + LK
Sbjct: 73 LMWELAPKYDAMLVFAEHRYYGQSKPFPASVLRKH----MAWLTSEQAMGDYATLLWELK 128
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA-SSAPILQFEDIVP---PETFY 242
+ L PV+ FGGSYGGML W R+KYPH+ G +A S+API ++ P P +F
Sbjct: 129 RELGDPDVPVIGFGGSYGGMLGTWFRMKYPHLVDGVIAGSAAPIWTYKGENPPYDPGSFA 188
Query: 243 NIVSSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLCRE--LNSTED-- 295
IV+ D E S C + ++ +W L G E G ++ LC E + S ED
Sbjct: 189 KIVTQDASPEGGSAEACADNVRAAWKLLDRWGSSEEGRQSISAAMRLCPESAVESGEDAT 248
Query: 296 -LADWLESAYSYL-----AMVDYPYPSDFMM-----PLPGYPIREVCKKI-DNAPDATSI 343
L DW SA+ YL AM +YPYPS +++ PLP +P+R C + + D ++
Sbjct: 249 ALCDWASSAWDYLASAARAMGNYPYPSVYIVNGAQPPLPAFPVRVACGHLAEPGLDGEAL 308
Query: 344 LERIFEGVSVYYNYTGNVDCFQLDDDPH 371
LE + V+YN+TG++ CF P+
Sbjct: 309 LEGLARAAGVFYNHTGDLPCFSFKQGPN 336
>gi|255584372|ref|XP_002532920.1| catalytic, putative [Ricinus communis]
gi|223527313|gb|EEF29462.1| catalytic, putative [Ricinus communis]
Length = 245
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 105/201 (52%), Positives = 139/201 (69%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRY 146
TF Q+Y+I+ HW G PIF Y G E + GF++D A +FGA+ VF EHR+
Sbjct: 44 TFQQKYVISFKHWTGAQASAPIFAYLGEESPLNADIHGIGFLFDNAAKFGALTVFIEHRF 103
Query: 147 YGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGM 206
YG+S+P+ S + A NAT Y + QALAD+A + N+K LSAE SP+++ GGSYGGM
Sbjct: 104 YGDSIPFVSRQEALANATLRGYFNSAQALADYAEILLNIKLILSAETSPIIVIGGSYGGM 163
Query: 207 LAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGE 266
LA+W RLKYPHIA+GALASSAPIL F++I P + +Y++V+ D++ S SC NTIKESW E
Sbjct: 164 LASWFRLKYPHIALGALASSAPILYFDNITPSDAYYSLVTKDYRDASESCSNTIKESWLE 223
Query: 267 LVSVGQKENGLLELTKTFHLC 287
L V +ENGL L++ FH C
Sbjct: 224 LARVASQENGLSILSEKFHTC 244
>gi|296191232|ref|XP_002743536.1| PREDICTED: dipeptidyl peptidase 2 [Callithrix jacchus]
Length = 492
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 118/306 (38%), Positives = 185/306 (60%), Gaps = 18/306 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ RYF+Q LDHF+F TF QR+L++ W+ GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERYFQQLLDHFNFESFGNKTFPQRFLVSDRFWI--RGKGPIFFYTGNEGDVWVFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A GA+LVF EHRYYG+S+P+G ST+ Y LT EQALADFA +
Sbjct: 89 GFIAELAAEQGALLVFAEHRYYGKSLPFGERSTQRGYTG-----LLTVEQALADFAELLR 143
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+++L A+ P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+
Sbjct: 144 ALRRDLGAQDVPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSTQFFR 203
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WL 300
V++DF+ + C ++E++ ++ + + +++ F C+ L +DL +
Sbjct: 204 DVTADFEGQGPKCTQAVREAFWQIRDL-FLQGAYDKVSWEFGTCQPLLDEKDLTQLFMFA 262
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A++ LAM+DYPYP+DF+ PLP P++ C + + + L + + YN +G+
Sbjct: 263 RNAFTVLAMMDYPYPTDFLGPLPANPVKVGCDLLLSEAQRIAGLRAL---AGLVYNASGS 319
Query: 361 VDCFQL 366
C+ +
Sbjct: 320 EHCYDI 325
>gi|440795690|gb|ELR16807.1| lysosomal ProX carboxypeptidase [Acanthamoeba castellanii str.
Neff]
Length = 489
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 121/314 (38%), Positives = 179/314 (57%), Gaps = 15/314 (4%)
Query: 68 YETRYFEQRLDHFSFADLP-TFSQRYLINTDHWVG--PNRL-GPIFLYCGNEGDIEWFAV 123
Y T YF+Q LDHF+FA P T+ QR+L+ D+W G P GPIF Y GNE + +
Sbjct: 39 YRTLYFDQTLDHFNFATKPATYKQRFLMADDYWRGSYPGGCPGPIFFYTGNESPVTDYYA 98
Query: 124 NSGFVWDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
+GF + AP+ A+LVF EHRY+GESMP+GS + +SYL+ EQALAD+AV I
Sbjct: 99 GAGFFTQVLAPKHNALLVFAEHRYFGESMPFGSKSF---DPEKISYLSPEQALADYAVLI 155
Query: 183 TNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
T+LK+ L A+ PV FGGSYGG+L AW R KYP I +G L++SAP+ + + P F
Sbjct: 156 THLKETLPHAKNCPVFAFGGSYGGILTAWFRSKYPDIVMGGLSASAPLAFYGTGISPYAF 215
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LAD 298
+ S F + C + ++ L + +G + F LC LNS D + +
Sbjct: 216 TDSASDTFAQARPGCAPLVSRAFDVLQKLSATSDGRARFSAAFKLCSPLNSQADAEAVIN 275
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI---DNAPDATSILERIFEGVSVYY 355
W++S +AM+DYP+ +++ + LPG+P+ C ++ + D + E + V+Y
Sbjct: 276 WVDSGLIGMAMLDYPFATNYGISLPGWPVNRTCDRLLEKATSNDDDVLAEAFAYAIGVFY 335
Query: 356 NYTGNVDCFQLDDD 369
N TG C+ ++ D
Sbjct: 336 NNTGAHTCYDINRD 349
>gi|417411236|gb|JAA52063.1| Putative dipeptidyl peptidase 2, partial [Desmodus rotundus]
Length = 502
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/310 (39%), Positives = 186/310 (60%), Gaps = 26/310 (8%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ LDHF+F TF QR+L++ W GP+F Y GNEGD+ FA NS
Sbjct: 28 FRESYFEQFLDHFNFERFGNQTFPQRFLVSEKFW--KRGEGPLFFYTGNEGDVWAFANNS 85
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A + GA++VF EHRYYG+S+P+G ST+ + + LT EQALADFA +
Sbjct: 86 GFILELAAQQGALVVFAEHRYYGKSLPFGERSTQRGH-----VELLTVEQALADFARLLQ 140
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+++L A+ P V FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+
Sbjct: 141 ALRRDLGAQDVPAVAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVLVAGLGDAYQFFR 200
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLE----LTKTFHLCRELNSTEDLAD- 298
VS DF+ + C +++++ ++ K+ LL +++ F LCR L+ +DL
Sbjct: 201 DVSLDFEGQGPKCAQGVRDAFRQI-----KDLFLLGAYDVVSQAFGLCRPLSGWKDLVQL 255
Query: 299 --WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYN 356
+ +A++ LAM+DYPYP+DF+ LP P++ C ++ N D L + + YN
Sbjct: 256 FGFARNAFTVLAMMDYPYPTDFLGHLPANPVQVACDRLLNESDRIEGLRAL---AGLVYN 312
Query: 357 YTGNVDCFQL 366
+G C+ +
Sbjct: 313 SSGEQSCYDV 322
>gi|226472758|emb|CAX71065.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 472
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 127/301 (42%), Positives = 176/301 (58%), Gaps = 15/301 (4%)
Query: 69 ETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+ YF+Q LDHFSF A TF QRYL D W PN GPIF YCGNEG+I F N+G
Sbjct: 29 KENYFDQTLDHFSFQARNLTFKQRYLYE-DKWFKPN--GPIFFYCGNEGEIGGFWNNTGL 85
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
V+++AP F A ++F EHRYYG+S+P+ + ++Q + YL+ QALAD+A I +K
Sbjct: 86 VFELAPSFNAFILFAEHRYYGKSLPF---DKSFQQPY-IQYLSIGQALADYAYLIEGIKS 141
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ SPVV FGGSYGGMLAA+MR KYPHI GALA+SAP+ F+ V+
Sbjct: 142 KFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTK 201
Query: 248 DFKRESASCFNTIKESWGELVSVGQKEN-GLLELTKTFHLCRELNSTED---LADWLESA 303
D+ C IK ++ V + QK + G +L++ LC+ + + + + W +A
Sbjct: 202 DYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWILKWARNA 261
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
+ +AM+DYPY + FM LP P+ CK +A D S L E V V+YN + ++ C
Sbjct: 262 FVMMAMLDYPYKASFMASLPPNPVNVSCKNALSAIDPISTLR---EAVGVFYNSSQSLMC 318
Query: 364 F 364
F
Sbjct: 319 F 319
>gi|308482020|ref|XP_003103214.1| CRE-PCP-5 protein [Caenorhabditis remanei]
gi|308260319|gb|EFP04272.1| CRE-PCP-5 protein [Caenorhabditis remanei]
Length = 507
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 122/321 (38%), Positives = 187/321 (58%), Gaps = 16/321 (4%)
Query: 49 PHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
P + PQ+ ++ + Y+YE Y + +DHFSF + F RY +NTD++ GPI
Sbjct: 23 PVTQKGPQKFEKADGK-YKYEEGYLKAPIDHFSFTNDYEFDLRYFLNTDNY---ESGGPI 78
Query: 109 FLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY 168
Y GNEG +E FA N+GF+WD+AP A +VF EHR+YG+S P+ + +Y + L Y
Sbjct: 79 LFYTGNEGSLEAFAENTGFMWDLAPELKAAVVFVEHRFYGKSQPFKNQ--SYTDIRNLGY 136
Query: 169 LTAEQALADFAV---FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALAS 225
L+++QALADFA+ F N K A+ S V+ FGGSYGGML+AW R+KYPHI GA+A+
Sbjct: 137 LSSQQALADFALSVQFFRNEKIK-GAKNSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAA 195
Query: 226 SAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
SAP+ F D PE Y+ + + +S +++ W L + + ++G L +
Sbjct: 196 SAPVFWFTDSNIPEDVYDFIVTRAFLDSGCNRKAVEKGWIALDELAKTDSGRQYLNVLYK 255
Query: 286 L--CRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN-APD 339
L +L + +D L ++ + +AMV+YPYP+ F+ LP +P++E CK
Sbjct: 256 LDPKSKLENKDDVSFLKQYIRESMEAMAMVNYPYPTSFLSSLPSWPVKEACKFASQPGKS 315
Query: 340 ATSILERIFEGVSVYYNYTGN 360
E+++ V++YYNYTG+
Sbjct: 316 QEESAEQLYNIVNLYYNYTGD 336
>gi|256084265|ref|XP_002578351.1| family S28 unassigned peptidase (S28 family) [Schistosoma mansoni]
gi|360045230|emb|CCD82778.1| family S28 unassigned peptidase (S28 family) [Schistosoma mansoni]
Length = 472
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 124/301 (41%), Positives = 177/301 (58%), Gaps = 15/301 (4%)
Query: 69 ETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+ YF+Q LDHF+F A TF QRYL D W PN GPIF YCGNEG I+ F N+G
Sbjct: 29 KENYFDQTLDHFNFQARNLTFKQRYLYE-DKWFKPN--GPIFFYCGNEGGIDGFWNNTGL 85
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
++++AP F A ++F EHRYYG+S+P+ ++ +Q + YL+ +QALAD+A I +K
Sbjct: 86 IFELAPSFNAFVLFAEHRYYGKSLPFNTS---FQQPY-IQYLSIDQALADYAYLIEGIKS 141
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ S VV FGGSYGGMLAA+MR KYPHI GALASSAP+ F+ V+
Sbjct: 142 TFNMTRSLVVAFGGSYGGMLAAYMRAKYPHIIKGALASSAPVRWVAGEGNFHDFFESVTK 201
Query: 248 DFKRESASCFNTIKESWGELVSVGQKEN-GLLELTKTFHLCRELNSTED---LADWLESA 303
D++ C IK ++ V + QK + G +L+ LC+ + + + + W +A
Sbjct: 202 DYRDADPKCSEKIKNAFNLAVQLSQKPDIGYKQLSNDLRLCKPIQNDFEFYWVLKWARNA 261
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
+ +AM+DYPY + FM LP P+ CK NA T ++ + E V V+YN + ++ C
Sbjct: 262 FVMMAMLDYPYKASFMASLPANPVNVSCK---NALAVTDLIPTLREAVGVFYNSSQSLPC 318
Query: 364 F 364
F
Sbjct: 319 F 319
>gi|118099097|ref|XP_415570.2| PREDICTED: dipeptidyl peptidase 2 [Gallus gallus]
Length = 495
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 124/329 (37%), Positives = 187/329 (56%), Gaps = 26/329 (7%)
Query: 69 ETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
E + F Q DHF F TF QRYL++ W GPIF Y GNEG+I FA NS
Sbjct: 33 EEQLFPQVRDHFRFEAGGNETFPQRYLLSAKFW--KKGFGPIFFYTGNEGNIWTFAENSD 90
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F++++A + A+++F EHRYYG+S+P+G +N LT EQALAD+AV IT LK
Sbjct: 91 FIFELAEQQQALVIFAEHRYYGKSLPFGLESTQLKNT---HLLTVEQALADYAVLITELK 147
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
Q A PV+ FGGSYGGML+A+MR+KYP++ GALA+SAP+L + P F+ V+
Sbjct: 148 QQYGAAGCPVIAFGGSYGGMLSAYMRMKYPNVVDGALAASAPVLSVAGLGDPTQFFRDVT 207
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLESA 303
+DF++ C ++ ++ ++ + E++ C +++S EDL + +A
Sbjct: 208 ADFQKSIPGCVPAVQRAFQQIRDL-FLSGAYDEISSKMATCSKISSKEDLYQLFGFARNA 266
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
++ +AM+DYPY +DFM LP P++ C++I D L + V V+YN +G+ C
Sbjct: 267 FTMIAMMDYPYKTDFMGHLPANPVKVGCEQILAHTDPIQGLAAL---VGVFYNSSGSAQC 323
Query: 364 FQL------DDDPHGL------DGWNWQV 380
+ + DP G + W++QV
Sbjct: 324 YDVYRLYRPCADPTGCGTGADAEAWDYQV 352
>gi|56756895|gb|AAW26619.1| SJCHGC02147 protein [Schistosoma japonicum]
Length = 472
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 176/301 (58%), Gaps = 15/301 (4%)
Query: 69 ETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+ YF+Q LDHFSF A TF QRYL D W PN GPIF YCGNEG+I F N+G
Sbjct: 29 KENYFDQTLDHFSFQARNLTFKQRYLYE-DKWFKPN--GPIFFYCGNEGEIGGFWNNTGL 85
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
V+++AP F A ++F EHRYYG+S+P+ + ++Q + YL+ QALAD+A I +K
Sbjct: 86 VFELAPSFNAFILFAEHRYYGKSLPF---DKSFQQPY-IQYLSIGQALADYAYLIEGIKS 141
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ SPVV FGGSYGGMLAA+MR KYPHI GALA+SAP+ F+ V+
Sbjct: 142 KFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTK 201
Query: 248 DFKRESASCFNTIKESWGELVSVGQKEN-GLLELTKTFHLCRELNSTED---LADWLESA 303
D+ C IK ++ V + QK + G +L++ LC+ + + + + W +A
Sbjct: 202 DYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIQNDFEFYWMLKWARNA 261
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
+ +AM+DYPY + FM LP P+ CK NA A + + E V V+YN + ++ C
Sbjct: 262 FVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSLMC 318
Query: 364 F 364
F
Sbjct: 319 F 319
>gi|226472746|emb|CAX71059.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472748|emb|CAX71060.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472760|emb|CAX71066.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 176/301 (58%), Gaps = 15/301 (4%)
Query: 69 ETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+ YF+Q LDHFSF A TF QRYL D W PN GPIF YCGNEG+I F N+G
Sbjct: 37 KENYFDQTLDHFSFQARNLTFKQRYLYE-DKWFKPN--GPIFFYCGNEGEIGGFWNNTGL 93
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
V+++AP F A ++F EHRYYG+S+P+ + ++Q + YL+ QALAD+A I +K
Sbjct: 94 VFELAPSFNAFILFAEHRYYGKSLPF---DKSFQQPY-IQYLSIGQALADYAYLIEGIKS 149
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ SPVV FGGSYGGMLAA+MR KYPHI GALA+SAP+ F+ V+
Sbjct: 150 KFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTK 209
Query: 248 DFKRESASCFNTIKESWGELVSVGQKEN-GLLELTKTFHLCRELNSTED---LADWLESA 303
D+ C IK ++ V + QK + G +L++ LC+ + + + + W +A
Sbjct: 210 DYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWARNA 269
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
+ +AM+DYPY + FM LP P+ CK NA A + + E V V+YN + ++ C
Sbjct: 270 FVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSLMC 326
Query: 364 F 364
F
Sbjct: 327 F 327
>gi|226472750|emb|CAX71061.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 176/301 (58%), Gaps = 15/301 (4%)
Query: 69 ETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+ YF+Q LDHFSF A TF QRYL D W PN GPIF YCGNEG+I F N+G
Sbjct: 37 KENYFDQTLDHFSFQARNLTFKQRYLYE-DKWFKPN--GPIFFYCGNEGEIGGFWNNTGL 93
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
V+++AP F A ++F EHRYYG+S+P+ + ++Q + YL+ QALAD+A I +K
Sbjct: 94 VFELAPSFNAFILFAEHRYYGKSLPF---DKSFQQPY-IQYLSIGQALADYAYLIEGIKS 149
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ SPVV FGGSYGGMLAA+MR KYPHI GALA+SAP+ F+ V+
Sbjct: 150 KFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTK 209
Query: 248 DFKRESASCFNTIKESWGELVSVGQKEN-GLLELTKTFHLCRELNSTED---LADWLESA 303
D+ C IK ++ V + QK + G +L++ LC+ + + + + W +A
Sbjct: 210 DYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWARNA 269
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
+ +AM+DYPY + FM LP P+ CK NA A + + E V V+YN + ++ C
Sbjct: 270 FVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSLMC 326
Query: 364 F 364
F
Sbjct: 327 F 327
>gi|405944913|pdb|4EBB|A Chain A, Structure Of Dpp2
gi|405944914|pdb|4EBB|B Chain B, Structure Of Dpp2
Length = 472
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 118/305 (38%), Positives = 184/305 (60%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GPIF Y GNEGD+ FA NS
Sbjct: 5 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWVRGE--GPIFFYTGNEGDVWAFANNS 62
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
FV ++A GA+LVF EHRYYG+S+P+G+ Q T LT EQALADFA + L
Sbjct: 63 AFVAELAAERGALLVFAEHRYYGKSLPFGAQST--QRGHT-ELLTVEQALADFAELLRAL 119
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ +P + FGGSYGG L+A++R KYPH+ GALA+SAP+L + F+ V
Sbjct: 120 RRDLGAQDAPAIAFGGSYGGXLSAYLRXKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 179
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ ++ + + G + + F C+ L+ +DL +
Sbjct: 180 TADFEGQSPKCTQGVREAFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFXFAR 237
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LA DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 238 NAFTVLAXXDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSE 294
Query: 362 DCFQL 366
C+ +
Sbjct: 295 HCYDI 299
>gi|403352351|gb|EJY75686.1| Lysosomal Pro-X carboxypeptidase [Oxytricha trifallax]
Length = 503
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/336 (39%), Positives = 192/336 (57%), Gaps = 35/336 (10%)
Query: 73 FEQRLDHF-SFADLPTFSQRYLINTDHWVGPNRL-GPIFLYCGNEGDIEWFAVNSGFVWD 130
FE LDHF S + PTF+ +YL + +W N + GPIF Y GNEG +E F NSGF+ D
Sbjct: 34 FEVPLDHFASGGNSPTFNIKYLADAQYW---NPMEGPIFFYAGNEGKVEGFWDNSGFLTD 90
Query: 131 I-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
+ AP+ A+++F EHRY+G+S P+ +VA +LT EQA+ D+ + I ++
Sbjct: 91 VLAPQHQALIIFGEHRYFGDSFPF-DKKVALDKDHN-KWLTVEQAMMDYVLLIKEIRYIY 148
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI-VPPETFYNIVSSD 248
A PVV+FGGSYGGMLA+W+R+KYP GA ASSAPIL F+D VP F +I++ D
Sbjct: 149 GASDKPVVVFGGSYGGMLASWLRMKYPATFQGAYASSAPILYFKDSGVPQSAFGDIITQD 208
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLE---LTKTFHLCRELNSTEDLAD---WLES 302
F + +C + IKE+WG L+ + KEN + L F+ C ++ D+ + +L +
Sbjct: 209 FYAANQNCPSIIKEAWGYLMDI--KENRPTDYPALKTIFNTCTDITVKADVDNLYTYLMN 266
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP----------------DATSILER 346
+SY+AM DYPY + F+ P+P P+ C K+ + P T +L+
Sbjct: 267 GFSYMAMTDYPYETSFLNPMPANPVNAACTKLKDIPYPAPKSEKSTVGALSARETLVLQG 326
Query: 347 IFEGVSVYYNYTGNVDCFQL-DDDPHG-LDGWNWQV 380
+ + SVY+NY G C + + D G LDG W V
Sbjct: 327 VQDASSVYFNYKGQTPCNDISNSDATGQLDGAGWDV 362
>gi|226472754|emb|CAX71063.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 472
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 175/301 (58%), Gaps = 15/301 (4%)
Query: 69 ETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+ YF Q LDHFSF A TF QRYL D W PN GPIF YCGNEG+I F N+G
Sbjct: 29 KENYFNQTLDHFSFQARNLTFKQRYLYE-DKWFKPN--GPIFFYCGNEGEIGGFWNNTGL 85
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
V+++AP F A ++F EHRYYG+S+P+ + ++Q + YL+ QALAD+A I +K
Sbjct: 86 VFELAPSFNAFILFAEHRYYGKSLPF---DKSFQQPY-IQYLSIGQALADYAYLIEGIKN 141
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ SPVV FGGSYGGMLAA+MR KYPHI GALA+SAP+ F+ V+
Sbjct: 142 KFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTK 201
Query: 248 DFKRESASCFNTIKESWGELVSVGQKEN-GLLELTKTFHLCRELNSTED---LADWLESA 303
D+ C IK ++ V + QK + G +L++ LC+ + + + + W +A
Sbjct: 202 DYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWARNA 261
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
+ +AM+DYPY + FM LP P+ CK NA A + + E V V+YN + ++ C
Sbjct: 262 FVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSLMC 318
Query: 364 F 364
F
Sbjct: 319 F 319
>gi|302832479|ref|XP_002947804.1| hypothetical protein VOLCADRAFT_41600 [Volvox carteri f.
nagariensis]
gi|300267152|gb|EFJ51337.1| hypothetical protein VOLCADRAFT_41600 [Volvox carteri f.
nagariensis]
Length = 451
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/329 (35%), Positives = 184/329 (55%), Gaps = 23/329 (6%)
Query: 69 ETRYFEQRLDHFSF---ADLPTFSQRYLINTDHWV-GPN-RLGPIFLYCGNEGDIEWFAV 123
+ +F QRLDHF + TF QRY + W GP GPIF Y GNE D+ +
Sbjct: 1 QESWFTQRLDHFHHHPGGNDTTFQQRYFLCDKFWSRGPGGSRGPIFFYAGNEADVTLYVN 60
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
+G +W+ A FGA+++F EHRYYG++ P G + + +YL+ EQALAD++V I
Sbjct: 61 ATGLIWEHAEEFGALVLFAEHRYYGKTQPLGPDSWS----SDPTYLSVEQALADYSVLIW 116
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI--VPPETF 241
N+ + E SPV+ FGGSYGGMLAAW+RLKYPH+ GA+A+SAP+ F + P F
Sbjct: 117 NITRTTGGEDSPVIAFGGSYGGMLAAWLRLKYPHLVTGAVAASAPVGAFPGVPGWQPSKF 176
Query: 242 YNIVSSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLCREL---NSTED 295
+ +V+ D + + C + ++ ++G ++++G+ G L + LC+ + ++
Sbjct: 177 WEVVTYDATASAGAVPECSSNVRAAFGHVMALGRTATGRAALGRLLRLCKPIEDEDAAVG 236
Query: 296 LADWLESAYSYLAMVDYPYPSDFM-----MPLPGYPIREVCKKIDNAPDATSILERIFEG 350
+A WL+ A+ AM +YPYPS ++ PLP +P+R C ++ S L + +
Sbjct: 237 VAYWLQGAFDAFAMGNYPYPSSYISDDPAHPLPAWPMRAACTRLAGRGLRASDLA-LRDA 295
Query: 351 VSVYYNYTGNVDCFQLDDDPHGLDGWNWQ 379
V YN TG V C+ ++ W++Q
Sbjct: 296 AGVLYNVTGRVQCYTVETSGPAAGPWDYQ 324
>gi|226472752|emb|CAX71062.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
gi|226472756|emb|CAX71064.1| putative Lysosomal Pro-X carboxypeptidase precursor [Schistosoma
japonicum]
Length = 480
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/301 (41%), Positives = 175/301 (58%), Gaps = 15/301 (4%)
Query: 69 ETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+ YF Q LDHFSF A TF QRYL D W PN GPIF YCGNEG+I F N+G
Sbjct: 37 KENYFNQTLDHFSFQARNLTFKQRYLYE-DKWFKPN--GPIFFYCGNEGEIGGFWNNTGL 93
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
V+++AP F A ++F EHRYYG+S+P+ + ++Q + YL+ QALAD+A I +K
Sbjct: 94 VFELAPSFNAFILFAEHRYYGKSLPF---DKSFQQPY-IQYLSIGQALADYAYLIEGIKN 149
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ SPVV FGGSYGGMLAA+MR KYPHI GALA+SAP+ F+ V+
Sbjct: 150 KFNMTRSPVVAFGGSYGGMLAAYMRAKYPHIVKGALAASAPVRWVAGEGNFHDFFEAVTK 209
Query: 248 DFKRESASCFNTIKESWGELVSVGQKEN-GLLELTKTFHLCRELNSTED---LADWLESA 303
D+ C IK ++ V + QK + G +L++ LC+ + + + + W +A
Sbjct: 210 DYHDADPKCSEKIKNAFTVAVQLSQKPDVGYKQLSEQLRLCQPIENDFEFYWMLKWARNA 269
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDC 363
+ +AM+DYPY + FM LP P+ CK NA A + + E V V+YN + ++ C
Sbjct: 270 FVMMAMLDYPYKASFMASLPPNPVNVSCK---NALSAIDPIPTLREAVGVFYNSSQSLMC 326
Query: 364 F 364
F
Sbjct: 327 F 327
>gi|348574464|ref|XP_003473010.1| PREDICTED: dipeptidyl peptidase 2-like [Cavia porcellus]
Length = 507
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/308 (40%), Positives = 190/308 (61%), Gaps = 20/308 (6%)
Query: 67 RYETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
Y YFEQ LDHF+F TF QR+L++ W P GPIF Y GNEGD+ FA N
Sbjct: 41 EYREYYFEQLLDHFNFESYGNKTFHQRFLMSDKFWKQPK--GPIFFYTGNEGDVWVFANN 98
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
SGF+ ++A + A+L+F EHRYYG+S+P+G+ + + LT EQALADFAV +
Sbjct: 99 SGFLVELAQQQEALLIFAEHRYYGKSLPFGAQSTQH---GFMQLLTVEQALADFAVLLQV 155
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
L+Q+L A+ SP + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ +V F+
Sbjct: 156 LRQDLCAQDSPTITFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVAGLVDSYQFFRD 215
Query: 245 VSSDFKRESASCFNTIKESWGELVSV---GQKENGLLELTKTFHLCRELNSTEDLAD--- 298
V++DF +S C ++E++ E+ ++ G E +++ F C+ L+ +EDL
Sbjct: 216 VTADFYSQSPKCVQAVREAFQEIRNLYLQGAHE----RISREFGTCQLLSGSEDLTQLFM 271
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT 358
+ +A++ LAM+DYPY +DF++PLP P++ C + N + L R+ G + YN +
Sbjct: 272 FARNAFTVLAMMDYPYHTDFLVPLPANPVKVGCDILLNEAQPITGL-RMLAG--MIYNTS 328
Query: 359 GNVDCFQL 366
G C+ +
Sbjct: 329 GMEHCYDI 336
>gi|324508835|gb|ADY43728.1| Serine protease pcp-1 [Ascaris suum]
Length = 484
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 182/313 (58%), Gaps = 31/313 (9%)
Query: 91 RYLINTDHW-VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGE 149
+YL N ++ +G GPIF Y GNE IE FA N+G ++D+APRF A +VF EHRYYGE
Sbjct: 2 KYLYNNTYYKIG----GPIFFYAGNEAAIEGFAENTGIMFDLAPRFNASIVFAEHRYYGE 57
Query: 150 SMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL--SAEASPVVLFGGSYGGML 207
S P+G +++Y + L +LT+ QA+ADFA F+ + K N+ + +PV+ FGGSYGGML
Sbjct: 58 SKPFG--DLSYSDVKNLGFLTSTQAMADFAKFLPHFKANVLNCSSDTPVIAFGGSYGGML 115
Query: 208 AAWMRLKYPHIAIGALASSAPILQFEDI-VPPETFYNIVSSDFKRESASCFNTIKESWGE 266
AAW R+KYPHI GA ASSAP+L F+ V P F +V+ DF E+ + +++
Sbjct: 116 AAWFRIKYPHIVTGAWASSAPVLLFKGANVDPGAFDKVVTEDF-IEAGCNREAVYKAFNA 174
Query: 267 LVSVGQKENGLLELTKTFHLCRELNSTED-----LADWLESAYSYLAMVDYPYPSDFMMP 321
+ + GL L + F + + N T+ L ++ A+ YLAMV+YPYP+ F++P
Sbjct: 175 IHELASTTAGLTFLNEMFVIEAKSNLTQASDADYLVSFIREAFGYLAMVNYPYPTSFLLP 234
Query: 322 LPGYPIREVCKKIDNAPDATSILER-----IFEGVSVYYNYTGNVD--------CFQLDD 368
LPG+P++E CK+ A T R ++ ++YYNYTG V C
Sbjct: 235 LPGWPVKEACKRAQAAFPQTPTTNRDLVNYLYIISNLYYNYTGTVATNCVKTSVCGDQAT 294
Query: 369 DPHGLD--GWNWQ 379
G D GW WQ
Sbjct: 295 AESGDDAFGWPWQ 307
>gi|302758754|ref|XP_002962800.1| hypothetical protein SELMODRAFT_404105 [Selaginella moellendorffii]
gi|300169661|gb|EFJ36263.1| hypothetical protein SELMODRAFT_404105 [Selaginella moellendorffii]
Length = 674
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 105/162 (64%), Positives = 128/162 (79%), Gaps = 6/162 (3%)
Query: 90 QRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGE 149
Q YL+++ W G PI +YCGN+GDI WFA N+GF++DIA F A+LVFPEHR+YG+
Sbjct: 18 QHYLLHSASWSGGASGAPILVYCGNKGDIVWFAENTGFMFDIAHLFRALLVFPEHRFYGK 77
Query: 150 SMPYGSTEVAYQNA-TTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLA 208
S P+G QN L++ +AEQALADFA I +LK+NLSA+ASPVV+FGGSYGGMLA
Sbjct: 78 SQPFGG-----QNGPKELAFCSAEQALADFATLILDLKRNLSAQASPVVVFGGSYGGMLA 132
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
AW RLKYPHIAIGALASSAPILQFE+IVP T+Y+IVS+ FK
Sbjct: 133 AWFRLKYPHIAIGALASSAPILQFENIVPYTTYYDIVSNAFK 174
>gi|216296557|gb|ACJ72063.1| prolylcarboxypeptidase [Trichoplax adhaerens]
Length = 254
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 105/248 (42%), Positives = 160/248 (64%), Gaps = 8/248 (3%)
Query: 60 RQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
+ Q +Y+T+YF+Q +DHF + T+ QRYL+N DHW GPIF Y GNEG I
Sbjct: 5 QSTQAAVKYKTKYFDQIIDHFDWKSNATYRQRYLMNDDHW--DKGTGPIFFYTGNEGGIV 62
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
F NSG ++D+AP+F A++VF EHRYYG+S+P+G +N L LT+EQALAD+A
Sbjct: 63 GFWQNSGLLFDLAPQFRALIVFGEHRYYGKSLPFGKDSFKPKN---LELLTSEQALADYA 119
Query: 180 VFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
V +T+LK++L+A VV FGGSYGGML AWMRLKYP+I LA+SAP+ +V P
Sbjct: 120 VLLTSLKKSLNANKCKVVAFGGSYGGMLTAWMRLKYPNIIDAGLAASAPLYMAGGVVSPN 179
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---L 296
F+ V+ D++ + C I++++ ++ + + ++G ++ K F++C +L ++ D L
Sbjct: 180 FFFPAVTKDYQDANPKCVPNIRKAFSAVLEMAKSKSGKQKVAKIFNVCNKLKTSADVKQL 239
Query: 297 ADWLESAY 304
W+ + +
Sbjct: 240 IGWIRNGF 247
>gi|395506514|ref|XP_003757577.1| PREDICTED: dipeptidyl peptidase 2 [Sarcophilus harrisii]
Length = 465
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 117/307 (38%), Positives = 176/307 (57%), Gaps = 14/307 (4%)
Query: 65 QYRYETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q ++ RYFEQ LDHF+F TF QR+L+ W GPIF Y GNE D+ FA
Sbjct: 15 QVDFQERYFEQILDHFNFESYGNNTFLQRFLVTEKFW--KKGTGPIFFYTGNEADVWAFA 72
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
N F+ ++A A+++F EHRYYG+S+P+G N S LT EQALADFAV I
Sbjct: 73 SNCDFILELASAEEALVIFAEHRYYGKSLPFGVQSTRKGNT---SLLTVEQALADFAVLI 129
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
L++ AE PV+ FGGSYGGML+A+MR+KYP++ GALA+SAP+L I F+
Sbjct: 130 QALQKEYKAENVPVITFGGSYGGMLSAYMRMKYPNLVAGALAASAPVLSIAGIGDSSQFF 189
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLE- 301
V++DF+ S C ++E++ L+ + ++++ C + +S + E
Sbjct: 190 RDVTADFENYSPKCVQGVREAF-RLIKDLYLQRAFDKISQEMGTCTQPSSDSAITQLFEF 248
Query: 302 --SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG 359
+A++ + M+DYPYP+DFM P P++ C ++ +A + L + V ++YN +G
Sbjct: 249 ARNAFTMITMMDYPYPTDFMGHFPANPVKVGCDRLLSAKNQIQGLRDL---VGLFYNTSG 305
Query: 360 NVDCFQL 366
CF +
Sbjct: 306 TEPCFDI 312
>gi|17557065|ref|NP_498718.1| Protein PCP-5, isoform b [Caenorhabditis elegans]
gi|351020563|emb|CCD62539.1| Protein PCP-5, isoform b [Caenorhabditis elegans]
Length = 568
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 180/305 (59%), Gaps = 15/305 (4%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+Y+YE Y + +D F+F + F RY +N DH+ GPI Y GNEG +E FA N
Sbjct: 99 KYKYEEGYLKAPIDPFAFTNDLEFDLRYFLNIDHY---ETGGPILFYTGNEGSLEAFAEN 155
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GF+WD+AP A +VF EHR+YG+S P+ + +Y + L YL+++QALADFA+ +
Sbjct: 156 TGFMWDLAPELKAAVVFVEHRFYGKSQPFKNE--SYTDIRHLGYLSSQQALADFALSVQF 213
Query: 185 LKQNL--SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
K A+ S V+ FGGSYGGML+AW R+KYPHI GA+A+SAP+ F D PE Y
Sbjct: 214 FKNEKIKGAQKSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVY 273
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL--CRELNSTED---LA 297
+ + + ++ I++ W L + + ++G L + L +L + +D L
Sbjct: 274 DFIVTRAFLDAGCNRKAIEKGWIALDELAKSDSGRQYLNVLYKLDPKSKLENKDDIGFLK 333
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT--SILERIFEGVSVYY 355
++ + +AMV+YPYP+ F+ LP +P++E CK P T E++++ V++YY
Sbjct: 334 QYIRESMEAMAMVNYPYPTSFLSSLPAWPVKEACKSASQ-PGKTQEESAEQLYKIVNLYY 392
Query: 356 NYTGN 360
NYTG+
Sbjct: 393 NYTGD 397
>gi|432876424|ref|XP_004073042.1| PREDICTED: dipeptidyl peptidase 2-like [Oryzias latipes]
Length = 480
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 124/313 (39%), Positives = 182/313 (58%), Gaps = 22/313 (7%)
Query: 59 QRQQQQQYRYETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEG 116
Q + + ++ +YF Q +DHF+F L TF+QRYLI W + GP+F Y GNEG
Sbjct: 26 QSRTDGEPQFTEKYFSQVVDHFNFNSLGNRTFNQRYLITDRFWRRSS--GPVFFYTGNEG 83
Query: 117 DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALA 176
DI FA+NSGF+ ++A + A+++F EHRYYG S+P+G+ + + LT EQALA
Sbjct: 84 DIWEFALNSGFIMELAAQQEALVIFAEHRYYGRSLPFGNNSFSIPE---VGLLTVEQALA 140
Query: 177 DFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
D+A+ IT LK L A SPV+ FGGSYGGML+ +MRLKYP+I GALA+SAPIL +
Sbjct: 141 DYALMITELKLQLGAAQSPVIAFGGSYGGMLSVYMRLKYPNIVAGALAASAPILSTAGLG 200
Query: 237 PPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED- 295
P F+ V++DF+R S +C + ++ +L ++ + + LC+ +S +D
Sbjct: 201 DPRQFFRDVTADFERVSPACRGAVTAAFQQLREAAERRD-YSHIQAELSLCQPPSSAQDV 259
Query: 296 --LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSV 353
L L +A++ +AM+DYPY + FM LP P+ K + S E +
Sbjct: 260 HQLYGLLRNAFTLMAMLDYPYSTHFMGSLPANPV-----KTGSGLCVMSTKEWMV----- 309
Query: 354 YYNYTGNVDCFQL 366
YN +G + CF L
Sbjct: 310 -YNSSGLLPCFDL 321
>gi|17557063|ref|NP_498719.1| Protein PCP-5, isoform a [Caenorhabditis elegans]
gi|466050|sp|P34676.1|PCP5_CAEEL RecName: Full=Prolyl carboxy peptidase like protein 5; Flags:
Precursor
gi|351020562|emb|CCD62538.1| Protein PCP-5, isoform a [Caenorhabditis elegans]
Length = 507
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 181/305 (59%), Gaps = 15/305 (4%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+Y+YE Y + +D F+F + F RY +N DH+ GPI Y GNEG +E FA N
Sbjct: 38 KYKYEEGYLKAPIDPFAFTNDLEFDLRYFLNIDHY---ETGGPILFYTGNEGSLEAFAEN 94
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+GF+WD+AP A +VF EHR+YG+S P+ + +Y + L YL+++QALADFA+ +
Sbjct: 95 TGFMWDLAPELKAAVVFVEHRFYGKSQPFKNE--SYTDIRHLGYLSSQQALADFALSVQF 152
Query: 185 LKQNL--SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
K A+ S V+ FGGSYGGML+AW R+KYPHI GA+A+SAP+ F D PE Y
Sbjct: 153 FKNEKIKGAQKSAVIAFGGSYGGMLSAWFRIKYPHIVDGAIAASAPVFWFTDSNIPEDVY 212
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL--CRELNSTED---LA 297
+ + + ++ I++ W L + + ++G L + L +L + +D L
Sbjct: 213 DFIVTRAFLDAGCNRKAIEKGWIALDELAKSDSGRQYLNVLYKLDPKSKLENKDDIGFLK 272
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT--SILERIFEGVSVYY 355
++ + +AMV+YPYP+ F+ LP +P++E CK + P T E++++ V++YY
Sbjct: 273 QYIRESMEAMAMVNYPYPTSFLSSLPAWPVKEACKSA-SQPGKTQEESAEQLYKIVNLYY 331
Query: 356 NYTGN 360
NYTG+
Sbjct: 332 NYTGD 336
>gi|417400793|gb|JAA47320.1| Putative dipeptidyl peptidase 2 [Desmodus rotundus]
Length = 429
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/291 (39%), Positives = 175/291 (60%), Gaps = 12/291 (4%)
Query: 68 YETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ LDHF+F TF QR+L++ W GP+F Y GNEGD+ FA NS
Sbjct: 40 FRESYFEQFLDHFNFERFGNQTFPQRFLVSEKFW--KRGEGPLFFYTGNEGDVWAFANNS 97
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A + GA++VF EHRYYG+S+P+G + + LT EQALADFA + L
Sbjct: 98 GFILELAAQQGALVVFAEHRYYGKSLPFGERSTQRGH---VELLTVEQALADFARLLQAL 154
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A+ P V FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+ V
Sbjct: 155 RRDLGAQDVPAVAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVLVAGLGDAYQFFRDV 214
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLES 302
S DF+ + C +++++ ++ + + +++ F LCR L+ +DL + +
Sbjct: 215 SLDFEGQGPKCAQGVRDAFRQIKDL-FLQGAYDVVSQAFGLCRPLSGWKDLVQLFGFARN 273
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSV 353
A++ LAM+DYPYP+DF+ LP P++ C ++ N D L R G S
Sbjct: 274 AFTVLAMMDYPYPTDFLGHLPANPVQVACDRLLNESDRIEGL-RALAGASA 323
>gi|313242000|emb|CBY34184.1| unnamed protein product [Oikopleura dioica]
Length = 402
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 119/332 (35%), Positives = 178/332 (53%), Gaps = 19/332 (5%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+P + + + E FE DHFS + R +I D + GP+ Y
Sbjct: 15 KPLKYSSLESYSDFCSEISTFEAEYDHFSTRNTQKIEIR-VITDDRFYQAG--GPVLFYT 71
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
GNEGD++ F N+GF+ A LVF EHRYYG+S+P + L YL+AE
Sbjct: 72 GNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIP---------DDKNL-YLSAE 121
Query: 173 QALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF 232
QALAD+A ++ +LK S PV+ GGSYGGMLAA+ R+KYP++ GA+A SAP+
Sbjct: 122 QALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVAGAIAGSAPVKFL 179
Query: 233 EDIVPPETFYNIVSSDFKRESAS--CFNTIKESWGELVSVGQKENGLLELTKTFHLCREL 290
+ FY + + F + C + I++SW + +G G L++ F C +
Sbjct: 180 PGLFDCRGFYRVTTRTFTNTPSGHFCSDNIRKSWETIKLIGAHMVGKRTLSEVFRTCDPI 239
Query: 291 NSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEG 350
E L D+LE+ + LAM+DYPYP++F+ +PG+P+ C +D+ + +LE + +
Sbjct: 240 TDVEPLLDFLENVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDINQEELLEPLRDA 299
Query: 351 VSVYYNYTGNVDCFQLDDDPH--GLDGWNWQV 380
SVYYNYTG++ C L D+ G D W +Q
Sbjct: 300 ASVYYNYTGDLACLDLGDEGGDLGYDNWYFQT 331
>gi|313235771|emb|CBY11221.1| unnamed protein product [Oikopleura dioica]
Length = 484
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 117/328 (35%), Positives = 173/328 (52%), Gaps = 17/328 (5%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+P + + + E FE DHFS + R +I D + GP+ Y
Sbjct: 15 KPLKYSSLESYSDFCSEISTFEAEYDHFSTRNTQKIEIR-VITDDRFYQAG--GPVLFYT 71
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
GNEGD++ F N+GF+ A LVF EHRYYG+S+P YL+AE
Sbjct: 72 GNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIPDDKN----------LYLSAE 121
Query: 173 QALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF 232
QALAD+A ++ +LK S PV+ GGSYGGMLAA+ R+KYP++ GA+A SAP+
Sbjct: 122 QALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVAGAIAGSAPVKFL 179
Query: 233 EDIVPPETFYNIVSSDFKRESAS--CFNTIKESWGELVSVGQKENGLLELTKTFHLCREL 290
+ FY + + F + C + I++SW + +G G L++ F C +
Sbjct: 180 PGLFDCRGFYRVTTRTFTNTPSGHFCSDNIRKSWETIKLIGAHMVGKRTLSEVFRTCDPI 239
Query: 291 NSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEG 350
E L D+LE + LAM+DYPYP++F+ +PG+P+ C +D+ + +LE + +
Sbjct: 240 TDVEPLLDFLEDVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDINQEELLEPLRDA 299
Query: 351 VSVYYNYTGNVDCFQLDDDPHGLDGWNW 378
SVYYNYTG++ C L D+ L NW
Sbjct: 300 ASVYYNYTGDLACLDLGDEGGDLGYNNW 327
>gi|313246245|emb|CBY35176.1| unnamed protein product [Oikopleura dioica]
Length = 484
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 117/328 (35%), Positives = 173/328 (52%), Gaps = 17/328 (5%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+P + + + E FE DHFS + R +I D + GP+ Y
Sbjct: 15 KPLKYSSLESYSDFCSEISTFEAEYDHFSTRNTQKIEIR-VITDDRFYQAG--GPVLFYT 71
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
GNEGD++ F N+GF+ A LVF EHRYYG+S+P YL+AE
Sbjct: 72 GNEGDVQLFCENTGFMRKAGKELNAKLVFMEHRYYGKSIPDDKN----------LYLSAE 121
Query: 173 QALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF 232
QALAD+A ++ +LK S PV+ GGSYGGMLAA+ R+KYP++ GA+A SAP+
Sbjct: 122 QALADYAEYLVHLKS--SGVTGPVIAMGGSYGGMLAAYFRIKYPNLVAGAIAGSAPVKFL 179
Query: 233 EDIVPPETFYNIVSSDFKRESAS--CFNTIKESWGELVSVGQKENGLLELTKTFHLCREL 290
+ FY + + F + C + I++SW + +G G L++ F C +
Sbjct: 180 PGLFDCRGFYRVTTRTFTNTPSEHFCSDNIRKSWETIKLIGAHMVGKRTLSEVFRTCEPI 239
Query: 291 NSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEG 350
E L D+LE + LAM+DYPYP++F+ +PG+P+ C +D+ + +LE + +
Sbjct: 240 TDVEPLLDFLEDVWGTLAMMDYPYPTNFVGDVPGWPVNVACSHLDHDINQEELLEPLRDA 299
Query: 351 VSVYYNYTGNVDCFQLDDDPHGLDGWNW 378
SVYYNYTG++ C L D+ L NW
Sbjct: 300 ASVYYNYTGDLACLDLGDEGGDLGYNNW 327
>gi|326428644|gb|EGD74214.1| hypothetical protein PTSG_06225 [Salpingoeca sp. ATCC 50818]
Length = 501
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 127/324 (39%), Positives = 187/324 (57%), Gaps = 31/324 (9%)
Query: 72 YFEQRLDHFSF-----ADLP-TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ Q DHF + A+ P TF QR I +W N GPIF Y GNEGD+E + ++
Sbjct: 46 WYNQTTDHFQWRPSGTAEEPLTFQQRVFICDQYWDKTNP-GPIFFYAGNEGDVELYVNHT 104
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGES--MPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
G +W+ AP F A+LVF EHR+YG++ P S +Q YLT +QA+AD+A +
Sbjct: 105 GLMWESAPMFRALLVFAEHRFYGKTQLTPGASGPSEHQ----YKYLTHDQAMADYAHLLY 160
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP---ET 240
+LK++ + E+S ++FGGSYGGMLAAW+R+KYP GA+A+SAPIL F + PP
Sbjct: 161 HLKRDRNCESSKTIVFGGSYGGMLAAWLRMKYPQTFDGAIAASAPILAFPGMTPPFDSNG 220
Query: 241 FYNIVSSD---FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED-- 295
++ +V+ D + +C N ++ +W EL S G+ E+G L+ F C + S +D
Sbjct: 221 YWQVVTRDATPAAGAAPACENNMRNAWKELFSRGKTESGRKSLSTLFRTCSPVTSEDDTW 280
Query: 296 -LADWLESAYSYLAMVDYPYPSDFMM----PLPGYPIREVCK---KIDNAPDATSILERI 347
LA +L + LAM +YPYPS+++ LP YP+ CK K D DA +L +
Sbjct: 281 RLAMFLLLSIDTLAMGNYPYPSNYLTGGGPKLPAYPVVAACKPLAKKDLKGDA--LLSAL 338
Query: 348 FEGVSVYYNYTGNVDCFQLDDDPH 371
+G +VY N T ++ CF + D H
Sbjct: 339 RDGAAVYANATQDLTCFDIPDQKH 362
>gi|390365524|ref|XP_783661.3| PREDICTED: lysosomal Pro-X carboxypeptidase-like
[Strongylocentrotus purpuratus]
Length = 622
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 120/279 (43%), Positives = 160/279 (57%), Gaps = 24/279 (8%)
Query: 56 QRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE 115
+ + Q + ++ YFEQ++DHFSF + TF RYL++ + W + GPIF Y GNE
Sbjct: 38 EAKISSQGCSHPHKEEYFEQQVDHFSFTNSDTFQMRYLVSDELWT---KGGPIFFYTGNE 94
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
GDI WF N+GFVWD+A + A+++F EHRYYG+S+PYG+ +Y++A L YLTAEQAL
Sbjct: 95 GDITWFCQNTGFVWDLAVEYKAIVIFAEHRYYGKSLPYGND--SYKDAAHLGYLTAEQAL 152
Query: 176 ADFAVFITNLKQNL--SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
ADFAVF+ K N A SPVV FGGSYGGMLAAWMR+KYP+ G
Sbjct: 153 ADFAVFLDWYKANTRGGAAGSPVVAFGGSYGGMLAAWMRIKYPNAIAGDDKDCDDDDDDN 212
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST 293
+ + N IK VS +G +L + LC L +T
Sbjct: 213 NNSSDDKDIND-------------KIIKPLHVNHVSP-LTASGRTKLAQAMKLCNPLKTT 258
Query: 294 ED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
D L WL ++ LAMVDYPYP++F+ PLP +PI+
Sbjct: 259 ADVDGLISWLAGSWFNLAMVDYPYPANFLEPLPAFPIKS 297
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 72/181 (39%), Positives = 106/181 (58%), Gaps = 7/181 (3%)
Query: 207 LAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGE 266
LAAWMR+KYP+ GA+A+SAP+ QF + P T Y +S DF+ + C++++ SW
Sbjct: 312 LAAWMRIKYPNAIAGAIAASAPVWQFTGLTPCNTQYLTISKDFQAANQLCYDSVHMSWDV 371
Query: 267 LVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLP 323
+ +GQ +G +L + LC L +T D L WL ++ LAMVDYPYP++F+ PLP
Sbjct: 372 ITRIGQTASGRTKLAQAMKLCNPLKTTADVDGLISWLAGSWFNLAMVDYPYPANFLEPLP 431
Query: 324 GYPIREVCKKIDN-APDATSILERIFEGVSVYYNYTGNVDCFQLDDDPH---GLDGWNWQ 379
+PI+EVC +P +L + + VYYNYT ++ CF L D G GW++Q
Sbjct: 432 AFPIKEVCSYFKTPSPTDDQLLAELTGALGVYYNYTSSIQCFNLSQDATASLGDLGWSFQ 491
Query: 380 V 380
Sbjct: 492 A 492
>gi|449666912|ref|XP_004206448.1| PREDICTED: dipeptidyl peptidase 2-like [Hydra magnipapillata]
Length = 478
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 109/273 (39%), Positives = 164/273 (60%), Gaps = 14/273 (5%)
Query: 68 YETRYFEQRLDHFSFADLP----TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Y +YF Q +DHF+F F QRYLI+ +W GP+ Y GNEG IE F
Sbjct: 22 YVEKYFVQFIDHFNFLGQAGANGQFKQRYLISDKYW--SKGKGPVLFYTGNEGSIENFWE 79
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
N+GFV+++A + +++F EHRYYG+S+P+G+ N + +LT +QALADFA I
Sbjct: 80 NTGFVFELAQKLKGLVIFGEHRYYGKSLPFGNDSFTPAN---IGFLTIDQALADFAALIQ 136
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+LK+++ A+ V FGGSYGGML A+MR KYPHI G +ASSAP L P F+
Sbjct: 137 HLKKSMGADNCSVFAFGGSYGGMLTAYMRYKYPHIVDGGVASSAPFLTIAGKRPRSEFFQ 196
Query: 244 IVSSDFKRESASCFNTIKESWGELVSV-GQKENGLLELTKTFHLCRELNS----TEDLAD 298
V+ F++ ++C ++++ ++ +L+ + + GL +L K F LC + + +
Sbjct: 197 TVTETFRKADSNCPSSVQIAFTQLMDLFNSGKEGLQQLEKVFSLCEGQMTRPFLEKQMIA 256
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
W +A++ L+MVDYPYP+ FM LPG+P+ C
Sbjct: 257 WARNAFTLLSMVDYPYPAKFMADLPGHPVELAC 289
>gi|412993304|emb|CCO16837.1| predicted protein [Bathycoccus prasinos]
Length = 563
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/360 (35%), Positives = 196/360 (54%), Gaps = 45/360 (12%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSF----------ADLPTFSQRYLINTDHWVGPNRLG 106
Q+ + + R E R+ +Q L HFS+ F RY + ++ + +
Sbjct: 66 HQEEENKYVSRCEERWIQQPLSHFSWNSEEEEERGGEGGGEFKTRYFVCSEFY---RKDS 122
Query: 107 PIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
PIFLY GNE ++E + N+G +W+ A F A+LVF EHRYYG+S P + N TL
Sbjct: 123 PIFLYTGNEANVESYLENTGLMWENAEHFNALLVFAEHRYYGKSSPMSDDDEEDTNKNTL 182
Query: 167 SYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASS 226
+L + +ALAD+A + L++ +A V+ FGGSYGGMLA+WMR+KYPH+ GA+A+S
Sbjct: 183 KHLNSMEALADYASLVRELREEYE-DAVAVIAFGGSYGGMLASWMRMKYPHVVDGAIAAS 241
Query: 227 APILQFEDIVPP---------ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGL 277
API F+ PP T+ +VS A C I++++ L+ G + + +
Sbjct: 242 APIYAFDGEDPPVDPNAFARGSTYTAMVSG----HGAECPKRIQDAFTLLIDSGDESDKI 297
Query: 278 -LELTK-TFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMP----LPGYPIREVC 331
L++ K TF C ++ S ++A+W +SA Y+AM DYP S +M+ LP +P++ VC
Sbjct: 298 YLDVLKHTFRACDDIESPYEVAEWAQSALDYIAMGDYPVESGYMLSGKGTLPAWPMKVVC 357
Query: 332 KKI----DNAPDATS--ILERIFEGVSVYYNYTGNVDCFQL------DDDPHGLDGWNWQ 379
++ NA ++TS +LE + E VS+YYN T CF + DD D W +Q
Sbjct: 358 NEMMVDDPNASNSTSLALLENLREAVSIYYNATKTEQCFTIGDPSPNDDTKATEDLWGYQ 417
>gi|449671875|ref|XP_002154818.2| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Hydra
magnipapillata]
Length = 460
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/334 (35%), Positives = 181/334 (54%), Gaps = 37/334 (11%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCG 113
P++ ++ ++ Y Y T +F Q LDHFSF ++ F+QRYLIN D++ P P+F Y G
Sbjct: 23 PKQNRKAKESGYYYTTHWFPQTLDHFSFRSEDYQFAQRYLINDDYF-KPG--APVFFYTG 79
Query: 114 NEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQ 173
NEGDI WF N+GF+WDIA F AMLVF EHRYYGESMP+GS + +++
Sbjct: 80 NEGDITWFCNNTGFMWDIAEEFSAMLVFAEHRYYGESMPFGSDSYKVK-----AFVDGGG 134
Query: 174 ALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
+ ++ S + + + GMLAAW R+KYP +GA++SSAPIL F
Sbjct: 135 GFIKLGIGTIDVASYFSDDITTRSNYS---EGMLAAWFRMKYPASVVGAISSSAPILAFV 191
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST 293
D+ E +Y + +CF + G L++++ F LC+ L +
Sbjct: 192 DMNDCELYY--------LKFYNCFGYAVLDY----------TGRLKISELFKLCKPLKTF 233
Query: 294 ED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA-PDATSILERIFE 349
+D L +W + LAMV+YPYP++F+ LP +PI EVCK + N+ ++ +
Sbjct: 234 DDVYNLNNWFSEVWVNLAMVNYPYPANFLEDLPAWPINEVCKHLQNSNATGDELIRNLVN 293
Query: 350 GVSVYYNYTGNVDCFQLDDDPHGL---DGWNWQV 380
V+VY+N+TG C ++ G GW++Q
Sbjct: 294 AVNVYFNFTGQSSCLNIEQQASGSLGDQGWDFQA 327
>gi|326438061|gb|EGD83631.1| hypothetical protein PTSG_04239 [Salpingoeca sp. ATCC 50818]
Length = 528
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 118/315 (37%), Positives = 178/315 (56%), Gaps = 25/315 (7%)
Query: 72 YFEQRLDHFSFA----DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+F Q +DHF++A D T+ QRY I + N PIF Y GNE D+ + N+G
Sbjct: 51 FFTQNIDHFNWAKPLNDKFTYRQRYFICDQYADLSNPKTPIFFYFGNEDDVTLYVNNTGL 110
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+W+ A + A+LVF EHRYYG+S P+ + N +LT EQA+AD+A I +LKQ
Sbjct: 111 MWENAASYKALLVFAEHRYYGKSKPFPAGTPGCMN-----WLTTEQAMADYATLIRDLKQ 165
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY---NI 244
+L+ +PV+ FGGSYGGMLAA+ R KYP I G +A SAPI F + P +Y NI
Sbjct: 166 DLNLTPAPVIGFGGSYGGMLAAYFRRKYPDIVDGVIAGSAPIWAFSGLTPAYDYYGFNNI 225
Query: 245 VSSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD--- 298
++ D + + C N K ++++ ++G L++ LC+ L S +D +
Sbjct: 226 IADDASSKGGASDHCRNNFKAIQPRIMAIASTQHGRHMLSQQLRLCKPLASDQDAYNILL 285
Query: 299 WLESAYSYLAMVDYPYPSDFMM----PLPGYPIREVCKKIDN---APDATSILERIFEGV 351
W ++A++Y+AM D+PY S +++ LP YP+RE CK + + + T + + + +
Sbjct: 286 WAQNAWAYMAMGDFPYASGYIVHGRGKLPPYPVREACKPLSDPQLPANDTKFISALRDAM 345
Query: 352 SVYYNYTGNVDCFQL 366
VYYNYT CF L
Sbjct: 346 DVYYNYTHTEPCFDL 360
>gi|241753998|ref|XP_002401172.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508369|gb|EEC17823.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 329
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 107/259 (41%), Positives = 162/259 (62%), Gaps = 6/259 (2%)
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G +W+ AP F A+LVF EHRYYG+SMP+G+ ++++ + L YLT+EQALAD+A + +L
Sbjct: 1 GLMWEWAPEFNALLVFAEHRYYGKSMPFGNR--SFESPSKLGYLTSEQALADYADLLLHL 58
Query: 186 KQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
K L AE SPVV FGGSYGG+L+AW R+KYPH+ ALASSAP+ F +VP T+
Sbjct: 59 KAKLPGAEKSPVVAFGGSYGGLLSAWFRIKYPHLITAALASSAPVNMFPGLVPCSTYSIA 118
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLES 302
++ F+R S C I++SW L ++G E G L + F+LC+ LN + DW+
Sbjct: 119 ITEAFRRVSELCTQAIRQSWSPLEAMGATEKGTKTLQEKFNLCQGLNPGNYTVFRDWIRD 178
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA-PDATSILERIFEGVSVYYNYTGNV 361
Y+ LA+V+YP P + PLPG P++ VC + A + +++++ + V++++N TG
Sbjct: 179 TYAVLALVNYPEPGSLITPLPGSPVKAVCDALTKAIGNRSAMVDAVAAAVNLFFNSTGTR 238
Query: 362 DCFQLDDDPHGLDGWNWQV 380
C + + W +QV
Sbjct: 239 KCHDVSIFQSAVPSWRFQV 257
>gi|325188924|emb|CCA23453.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 544
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/375 (34%), Positives = 208/375 (55%), Gaps = 29/375 (7%)
Query: 14 LSPVITIVIISILSPLSLAAQPSKF--RRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETR 71
LS ++ I +S+ +A+Q K R + + +PH T P + +QY E
Sbjct: 27 LSTLLVITDLSLKLDAPIASQSIKLTARNSLLKLEPIPH-TLPNFTLDKNNLRQYCKELS 85
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
F+QRLDHF+ A +F QRY D + + +F Y GNE ++ + ++G++W+
Sbjct: 86 -FKQRLDHFNVAQNASFPQRYFF-CDPYELNAAIDAVFFYLGNEAEVTLYLNHTGWMWEN 143
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
A F A L+F EHRY+G S+P+ E QN + +L++EQALAD+A IT++KQN +
Sbjct: 144 AWEFKAALIFAEHRYFGRSIPF-PKESIRQN---MGFLSSEQALADYAALITSIKQNRTH 199
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF---EDIVPPETFYNIVSS 247
+ +P + FGGSYGGMLAAW R+KYPHI G +A+SAP+L F + V E F + +
Sbjct: 200 LQRAPFIGFGGSYGGMLAAWFRVKYPHIIDGVIAASAPVLAFMGDQRPVDMEGFARVSTF 259
Query: 248 DFKR---ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRE--LNSTED---LADW 299
D S++C + I++SW + + + G +L+K F LC + L+S +D + W
Sbjct: 260 DATMGAGASSNCASNIRQSWQSMWKLSKTLQGREKLSKIFQLCNDAILHSEKDAEAMIMW 319
Query: 300 LESAYSYLAMVDYPYPSDFMM----PLPGYPIREVCKKIDNA----PDATSILERIFEGV 351
+ A+ Y++M +YPYP+ ++M LP YP+R C + +A + ++LE +
Sbjct: 320 AKEAFDYMSMGNYPYPTSYIMNGESTLPSYPVRVACGFLSDAFVVPKEEDTLLEAFVRSI 379
Query: 352 SVYYNYTGNVDCFQL 366
VYYN T C +
Sbjct: 380 GVYYNSTKQKSCHDM 394
>gi|9858825|gb|AAG01154.1|AF285235_1 quiescent cell proline dipeptidase precursor [Mus musculus]
Length = 506
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 120/306 (39%), Positives = 184/306 (60%), Gaps = 18/306 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ +DHF+F TF QR+L++ W GPIF Y GNEGDI FA NS
Sbjct: 41 FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFWKMGE--GPIFFYTGNEGDIWSFANNS 98
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A + A+LVF EHRYYG+S+P+G ST+ Y LT EQALADFAV +
Sbjct: 99 GFMVELAAQQEALLVFAEHRYYGKSLPFGVQSTQRGYTQ-----LLTVEQALADFAVLLQ 153
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+Q+L +P + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+
Sbjct: 154 ALRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFR 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WL 300
V++DF +S C +++++ ++ + + +++ F C+ L+S +DL +
Sbjct: 214 DVTADFYGQSPKCAQAVRDAFQQIKDL-FLQGAYDTISQNFGTCQSLSSPKDLTQLFGFA 272
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A++ LAM+DYPYP+DF+ PLP P++ C+++ N L + + YN +G
Sbjct: 273 RNAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLNEGQRIMGLRAL---AGLVYNSSGT 329
Query: 361 VDCFQL 366
C+ +
Sbjct: 330 EPCYDI 335
>gi|31981425|ref|NP_114031.2| dipeptidyl peptidase 2 precursor [Mus musculus]
gi|341940461|sp|Q9ET22.2|DPP2_MOUSE RecName: Full=Dipeptidyl peptidase 2; AltName: Full=Dipeptidyl
aminopeptidase II; AltName: Full=Dipeptidyl peptidase 7;
AltName: Full=Dipeptidyl peptidase II; Short=DPP II;
AltName: Full=Quiescent cell proline dipeptidase; Flags:
Precursor
gi|20072621|gb|AAH27205.1| Dipeptidylpeptidase 7 [Mus musculus]
gi|148676290|gb|EDL08237.1| dipeptidylpeptidase 7, isoform CRA_d [Mus musculus]
Length = 506
Score = 208 bits (529), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 120/306 (39%), Positives = 184/306 (60%), Gaps = 18/306 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ +DHF+F TF QR+L++ W GPIF Y GNEGDI FA NS
Sbjct: 41 FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFWKMGE--GPIFFYTGNEGDIWSFANNS 98
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A + A+LVF EHRYYG+S+P+G ST+ Y LT EQALADFAV +
Sbjct: 99 GFMVELAAQQEALLVFAEHRYYGKSLPFGVQSTQRGYTQ-----LLTVEQALADFAVLLQ 153
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+Q+L +P + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+
Sbjct: 154 ALRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFR 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WL 300
V++DF +S C +++++ ++ + + +++ F C+ L+S +DL +
Sbjct: 214 DVTADFYGQSPKCAQAVRDAFQQIKDL-FLQGAYDTISQNFGTCQSLSSPKDLTQLFGFA 272
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A++ LAM+DYPYP+DF+ PLP P++ C+++ N L + + YN +G
Sbjct: 273 RNAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLNEGQRIMGLRAL---AGLVYNSSGT 329
Query: 361 VDCFQL 366
C+ +
Sbjct: 330 EPCYDI 335
>gi|334312261|ref|XP_001374504.2| PREDICTED: dipeptidyl peptidase 2-like [Monodelphis domestica]
Length = 513
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 121/333 (36%), Positives = 179/333 (53%), Gaps = 28/333 (8%)
Query: 65 QYRYETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q ++ RYFEQ LDHF+F TF QR+L+ W GPIF Y GNE DI FA
Sbjct: 65 QVDFQERYFEQILDHFNFESYGSSTFLQRFLVTEKFW--KKGTGPIFFYTGNEADIWAFA 122
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
NS F+ ++A A+++F EHRYYG+S+P+G N LT EQALADFAV I
Sbjct: 123 NNSNFILELAAVEEALVIFAEHRYYGKSLPFGDQSTRKGNT---GLLTVEQALADFAVLI 179
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
LK+ E PV+ FGGSYGGML+A+MR+KYP++ GALA+SAP++ I F+
Sbjct: 180 QTLKKEY--EDVPVITFGGSYGGMLSAYMRMKYPNLVAGALAASAPVVSIAGIGNSSQFF 237
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLE- 301
V++DF+ S C ++E++ ++ E + + C + ++ + E
Sbjct: 238 RDVTTDFENHSPKCAQRVREAF-RMIRDLYLEQAFDRIHQDMGTCTQPSNDSAITQLFEF 296
Query: 302 --SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG 359
+A++ ++M+DYPYP+DFM P P++ C ++ A + L + + YN +G
Sbjct: 297 ARNAFTMISMMDYPYPTDFMGHFPANPVKVGCDRLLRAENPIQGLRAL---AGLLYNASG 353
Query: 360 NVDCF------QLDDDPHGL------DGWNWQV 380
CF Q DP G + W++Q
Sbjct: 354 TEPCFDIYQLYQKCADPTGCGTGPSAEAWDYQA 386
>gi|148676287|gb|EDL08234.1| dipeptidylpeptidase 7, isoform CRA_a [Mus musculus]
Length = 329
Score = 207 bits (526), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 117/276 (42%), Positives = 176/276 (63%), Gaps = 15/276 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ +DHF+F TF QR+L++ W GPIF Y GNEGDI FA NS
Sbjct: 41 FHENYFEQYMDHFNFESFGNKTFGQRFLVSDKFWKMGE--GPIFFYTGNEGDIWSFANNS 98
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A + A+LVF EHRYYG+S+P+G ST+ Y T L LT EQALADFAV +
Sbjct: 99 GFMVELAAQQEALLVFAEHRYYGKSLPFGVQSTQRGY---TQL--LTVEQALADFAVLLQ 153
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+Q+L +P + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+
Sbjct: 154 ALRQDLGVHDAPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFR 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WL 300
V++DF +S C +++++ ++ + + +++ F C+ L+S +DL +
Sbjct: 214 DVTADFYGQSPKCAQAVRDAFQQIKDL-FLQGAYDTISQNFGTCQSLSSPKDLTQLFGFA 272
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
+A++ LAM+DYPYP+DF+ PLP P++ C+++ N
Sbjct: 273 RNAFTVLAMMDYPYPTDFLGPLPANPVKVGCQRLLN 308
>gi|296491747|tpg|DAA33780.1| TPA: dipeptidyl peptidase 7 [Bos taurus]
Length = 335
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/305 (39%), Positives = 188/305 (61%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNR-LGPIFLYCGNEGDIEWFAVN 124
++ YFEQ LDHF+F TF QR+L+ W NR GPIF Y GNEGD+ FA N
Sbjct: 36 FQEAYFEQLLDHFNFERFGNKTFLQRFLMTEKFW---NRGEGPIFFYTGNEGDVWSFANN 92
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
SGF+ ++A + GA++VF EHRYYG+S+P+G ++ T L LT EQALADFA +
Sbjct: 93 SGFILELAEQQGALVVFAEHRYYGKSLPFGERST-WRGYTEL--LTVEQALADFAGLLRA 149
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
L+Q L A +P + FGGSYGGML+A++R+KYPH+ GALA+SAP++ + P F+
Sbjct: 150 LRQELEAPDAPAIAFGGSYGGMLSAYLRIKYPHLVAGALAASAPVVSAAGLGDPYQFFQD 209
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLE 301
VS+DF+ +S C +++++ ++ + Q+ + +++ F C+ L+ +DL +
Sbjct: 210 VSADFQGQSPECARAVQDAFRQIRDLFQQGAPHV-VSQEFGTCQPLSGPKDLTQLFGFAR 268
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPY +DF+ LP +P++ C ++ + + L + + YN +G
Sbjct: 269 NAFTVLAMMDYPYATDFVGHLPAHPVQVGCSRLLSESSRIAGLRAL---AGLVYNSSGIE 325
Query: 362 DCFQL 366
C+ +
Sbjct: 326 PCYDI 330
>gi|224072899|ref|XP_002190034.1| PREDICTED: dipeptidyl peptidase 2 [Taeniopygia guttata]
Length = 468
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 113/291 (38%), Positives = 172/291 (59%), Gaps = 22/291 (7%)
Query: 105 LGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNAT 164
GPIF Y GNEGDI FA NS F++++A A+++F EHRYYG+S+P+G E T
Sbjct: 42 FGPIFFYTGNEGDIWTFAQNSDFIFELAEEQQALVIFAEHRYYGKSLPFG-LESTQLKKT 100
Query: 165 TLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
L LT EQALAD+AV IT LKQ A PV+ FGGSYGGML+A++R+KYP++ GALA
Sbjct: 101 AL--LTVEQALADYAVLITELKQQFGAADCPVIAFGGSYGGMLSAYLRMKYPNVVAGALA 158
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
+SAP+L + P F+ V++DF++ S C +++++ ++ + E++
Sbjct: 159 ASAPLLSVAGLGDPTQFFRDVTADFQKSSLGCVTAVRKAFQQIKDLCL-SGAYDEISSKM 217
Query: 285 HLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT 341
C ++++ ED L + +A++ +AM+DYPY +DFM LP P++ C++I D
Sbjct: 218 ATCNKISNKEDVYQLFGFARNAFTMMAMMDYPYKTDFMGDLPANPVKVGCEQIIAHKDP- 276
Query: 342 SILERIFEGVSVYYNYTGNVDC------FQLDDDPHGL------DGWNWQV 380
+E + V V+YN +G C +Q DP G + W++QV
Sbjct: 277 --IEGLTALVGVFYNSSGLAQCYNIYQLYQSCADPTGCGTGSDAEAWDYQV 325
>gi|157279995|ref|NP_001098513.1| dipeptidyl peptidase 2 precursor [Bos taurus]
gi|151556163|gb|AAI49046.1| DPP7 protein [Bos taurus]
Length = 488
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 119/305 (39%), Positives = 188/305 (61%), Gaps = 16/305 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNR-LGPIFLYCGNEGDIEWFAVN 124
++ YFEQ LDHF+F TF QR+L+ W NR GPIF Y GNEGD+ FA N
Sbjct: 36 FQEAYFEQLLDHFNFERFGNKTFLQRFLMTEKFW---NRGEGPIFFYTGNEGDVWSFANN 92
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
SGF+ ++A + GA++VF EHRYYG+S+P+G ++ T L LT EQALADFA +
Sbjct: 93 SGFILELAEQQGALVVFAEHRYYGKSLPFGERST-WRGYTEL--LTVEQALADFAGLLRA 149
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
L+Q L A +P + FGGSYGGML+A++R+KYPH+ GALA+SAP++ + P F+
Sbjct: 150 LRQELEAPDAPAIAFGGSYGGMLSAYLRIKYPHLVAGALAASAPVVSAAGLGDPYQFFQD 209
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLE 301
VS+DF+ +S C +++++ ++ + Q+ + +++ F C+ L+ +DL +
Sbjct: 210 VSADFQGQSPECARAVQDAFRQIRDLFQQGAPHV-VSQEFGTCQPLSGPKDLTQLFGFAR 268
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPY +DF+ LP +P++ C ++ + + L + + YN +G
Sbjct: 269 NAFTVLAMMDYPYATDFVGHLPAHPVQVGCSRLLSESSRIAGLRAL---AGLVYNSSGIE 325
Query: 362 DCFQL 366
C+ +
Sbjct: 326 PCYDI 330
>gi|354507428|ref|XP_003515758.1| PREDICTED: dipeptidyl peptidase 2-like [Cricetulus griseus]
gi|344258871|gb|EGW14975.1| Dipeptidyl-peptidase 2 [Cricetulus griseus]
Length = 506
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 120/306 (39%), Positives = 184/306 (60%), Gaps = 18/306 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ +DHF+F TF+QR+L++ W GPIF Y GNEGDI FA NS
Sbjct: 41 FRENYFEQYMDHFNFESFGNKTFAQRFLVSDKFWKMGK--GPIFFYTGNEGDIWTFANNS 98
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A + A+LVF EHRYYG+S+P+G ST+ Y LT EQALADFAV +
Sbjct: 99 GFMVELAAQQEALLVFAEHRYYGKSLPFGLQSTQRGYTQ-----LLTVEQALADFAVLLQ 153
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+Q+L + P + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+
Sbjct: 154 ALRQDLKVQDIPTIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGESYQFFR 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WL 300
V++DF +S C +++++ ++ + + + K F C+ L+S++DL +
Sbjct: 214 DVTADFYGQSPKCAQAVRDAFQQIRDL-FLQGAHDTIRKNFGTCQSLSSSKDLTQLFVFA 272
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A++ LAM DYPYP++F+ LP P++ C+++ + L + V + YN +G
Sbjct: 273 RNAFTVLAMTDYPYPTEFLGNLPANPVKVACERMLSKGQRIMGLRAL---VGLVYNSSGM 329
Query: 361 VDCFQL 366
CF +
Sbjct: 330 EPCFDI 335
>gi|76156383|gb|AAX27594.2| SJCHGC06818 protein [Schistosoma japonicum]
Length = 271
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 106/243 (43%), Positives = 149/243 (61%), Gaps = 7/243 (2%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
+ Q++YET+YF ++DHFSF F +YLIN + + + GPI Y GNEG IE
Sbjct: 27 NKDSQFKYETKYFRTKIDHFSFVTDGEFEIKYLINNESF---SSGGPILFYTGNEGAIET 83
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA NSGF+W +A A +VF EHRYYG S+P+G+ ++++ YLTAEQALAD+ +
Sbjct: 84 FAENSGFIWKLAEELNASVVFAEHRYYGTSLPFGND--SFKDRQYFGYLTAEQALADYVL 141
Query: 181 FITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
I LK N S A SPV+ FGGSYGGML+AW+R KYP+ GA+ASSAP+ F +
Sbjct: 142 LINQLKVNYSCFASSPVISFGGSYGGMLSAWIRQKYPNQIAGAIASSAPVWLFPGLSDCN 201
Query: 240 TFYNIVSSDF-KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
F + ++ F K +C I+ SW +V +GQ +G LT F++C L +++ D
Sbjct: 202 GFSLVATNSFLKYGGDNCVKNIQHSWSNIVDIGQSFDGKELLTNMFNICTPLTDVQNIID 261
Query: 299 WLE 301
+L
Sbjct: 262 YLS 264
>gi|348686510|gb|EGZ26325.1| hypothetical protein PHYSODRAFT_327243 [Phytophthora sojae]
Length = 543
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 182/319 (57%), Gaps = 32/319 (10%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ ++F Q LDHF PT+ QRY + H+ P G +F Y GNE D+E + ++G +
Sbjct: 79 DEKFFTQTLDHFDVG-APTYLQRYFVCDRHFR-PG--GVMFFYVGNEADVELYLNHTGLM 134
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT-LSYLTAEQALADFAVFITNLKQ 187
W+ A FGAMLVF EHRY+G+S+P+G +N T + YL+ EQALAD+AV IT LK+
Sbjct: 135 WENADEFGAMLVFAEHRYFGKSVPFG------RNVTKHMRYLSTEQALADYAVLITRLKE 188
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ-FEDIVPPE--TFYNI 244
+ PV+ FGGSYGGML +W R+KYPHI G +A+SAPIL F D V + + +
Sbjct: 189 EWQRDI-PVIGFGGSYGGMLGSWFRMKYPHIIDGVIAASAPILSYFGDEVAHDLRGYSQV 247
Query: 245 VSSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLCRE--LNSTEDL--- 296
+ D + S C ++ +W + + G+ +G +L + LC + L++ E +
Sbjct: 248 TTFDASPAAGSAQNCVPNVRRAWPTMRAFGKTTSGRRKLKEALVLCEDTPLDTDEAIDAV 307
Query: 297 ADWLESAYSYLAMVDYPYPSDFMM----PLPGYPIREVCKKIDNAPDAT-----SILERI 347
W + ++ +AM +YPY S ++M LP YP+R C + +A D T +L
Sbjct: 308 MQWAKDSFDSMAMGNYPYASSYIMNGVSELPAYPMRVACSHLQDAFDETEDGDFKLLRAF 367
Query: 348 FEGVSVYYNYTGNVDCFQL 366
+ + VYYN T + +CFQL
Sbjct: 368 AKTIGVYYNSTKDKECFQL 386
>gi|413943524|gb|AFW76173.1| putative serine peptidase S28 family protein, partial [Zea mays]
Length = 266
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 106/174 (60%), Positives = 132/174 (75%), Gaps = 10/174 (5%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHW------VGPNRLGPIFLYCGNEGDIE 119
+ YF Q LDHF+F + F +YL+N W G + GP+F+Y GNEGDIE
Sbjct: 84 FTAHYFPQLLDHFAFTPNASTVFRHKYLLNDTFWRRPGAGAGDDGPGPLFVYTGNEGDIE 143
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
WFA N+GF++DIAP FGA+LVF EHR+YGES P+G+ +Y++A TL YLT+ QALADFA
Sbjct: 144 WFATNTGFMFDIAPTFGALLVFIEHRFYGESKPFGND--SYRSAETLGYLTSTQALADFA 201
Query: 180 VFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
V I LK++L AEA+PVV+FGGSYGGMLA+W RLKYPH+AIGALASSAPILQF+
Sbjct: 202 VVIRGLKRDLGAEAAPVVVFGGSYGGMLASWFRLKYPHVAIGALASSAPILQFD 255
>gi|440790409|gb|ELR11692.1| prolylcarboxypeptidase isoform 2 preproprotein [Acanthamoeba
castellanii str. Neff]
Length = 506
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 174/317 (54%), Gaps = 26/317 (8%)
Query: 68 YETRYFEQRLDHFSFADLP-TFSQRYLINTDHWVG--PNRL-GPIFLYCGNEGDIEWFAV 123
Y T YF+Q LDHF+FA P T+ QR+L+ ++W G P GPIF Y GNE + +
Sbjct: 60 YRTLYFDQTLDHFNFATQPATYKQRFLLADEYWRGSYPGGCPGPIFFYTGNEAPVTDYYS 119
Query: 124 NSGFVWDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
SGF + AP+ A+LVF ESMP+GS + +SYL+ EQALAD+AV I
Sbjct: 120 ASGFFTQVLAPKHNALLVF------AESMPFGSKSF---DPEKISYLSPEQALADYAVLI 170
Query: 183 TNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
T+LK+ L A PV FGGSYGG+L AW R+KYP I +G LA+SAP+ + + P F
Sbjct: 171 THLKETLPHARNCPVFAFGGSYGGILTAWFRMKYPDIVMGGLAASAPLSFYGTGISPYAF 230
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD--- 298
N S F + C I +++ L G K F LC LNS + +
Sbjct: 231 TNSASDTFAQARLGCAPLIAQAFETLQRFSATPEGCERFAKAFKLCGPLNSQAEASAVVY 290
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI------DNAPDATSILERIFEGVS 352
W+E + +AM+DYP+ S++ + LP +P+ + C +I +N PD + E + +
Sbjct: 291 WVEMGLASMAMLDYPFASNYGVSLPAWPVNKTCDRILEKAANNNDPDI--LAEALGYAIG 348
Query: 353 VYYNYTGNVDCFQLDDD 369
V+YN TG+ C+ + D
Sbjct: 349 VFYNNTGDHSCYDIKRD 365
>gi|358253095|dbj|GAA51984.1| dipeptidyl peptidase 2, partial [Clonorchis sinensis]
Length = 593
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 124/355 (34%), Positives = 189/355 (53%), Gaps = 43/355 (12%)
Query: 8 NQNSLYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYR 67
N + YL +I + I+++ L +A R PP+ Q
Sbjct: 273 NLQATYLFSMIYLTIVAVFCLLDCSANQFDIPR-------------PPKEQ--------- 310
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
YF QR+DH +F A++ T+ RYL D W GPIF YCGNEGDI F NS
Sbjct: 311 ----YFTQRVDHMNFQPANI-TYRMRYLYE-DKWYKSG--GPIFFYCGNEGDIFGFWNNS 362
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF++ +A + AM+VF EHRYYG+S+P+ ++ + + +L+ EQ LAD+A I +L
Sbjct: 363 GFIFHLASKMDAMVVFAEHRYYGKSLPFKNSF----SQPYIQFLSIEQTLADYANLIQHL 418
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K+ + + V+ FGGSYGGMLAA+MR YPH+ GA+ASSAP+ + F+ V
Sbjct: 419 KEKYGRDNTAVIAFGGSYGGMLAAYMRASYPHLVAGAIASSAPVNWVAGLGNIHQFFEHV 478
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKE-NGLLELTKTFHLCRELNSTED---LADWLE 301
+ D+ + + C +K ++ L + ++ L ++K LC+ +++ D + W
Sbjct: 479 TDDYNQVNPQCVARVKNAYDLLERMVMEDIRALASISKQMKLCKPMHTIFDFVWMLKWSR 538
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYN 356
+A+ + M+DYP + F+ LP YP+ C KI APD S L + VSV+YN
Sbjct: 539 NAFVMMTMLDYPTDNTFISQLPAYPVNVSCAKILAAPDVISALR---DAVSVWYN 590
>gi|410990145|ref|XP_004001310.1| PREDICTED: dipeptidyl peptidase 2 [Felis catus]
Length = 549
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 116/307 (37%), Positives = 187/307 (60%), Gaps = 14/307 (4%)
Query: 65 QYRYETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q ++ YFEQ LDHF+F TF QR+L++ W GPIF Y GNEG++ FA
Sbjct: 33 QCDFQEGYFEQLLDHFNFERFGNKTFLQRFLVSEKFW--KRGEGPIFFYTGNEGNVWSFA 90
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
NSGF+ ++A + GA+++F EHRYYG+S+P+G E + Q T LT EQALADFA +
Sbjct: 91 NNSGFILELAAQQGALVIFAEHRYYGKSLPFG--ERSTQRGHT-ELLTVEQALADFARLL 147
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
L+++L A+ +P ++FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+
Sbjct: 148 NALRRDLGAQDTPAIVFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVAGLGDSYQFF 207
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---W 299
VS+DF+ +S C +++++ ++ + + +++ F C+ L+ +DL +
Sbjct: 208 RDVSADFEGQSPKCAQGVRDAFRQIKDL-FIQGAYDTVSQEFGTCQPLSGQKDLTQLFGF 266
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG 359
+A++ LAM+DYPYP+DF+ LP P++ C ++ + L + + YN +G
Sbjct: 267 ARNAFTVLAMMDYPYPTDFIGHLPANPVKVGCDRLLSETQRIKGLRAL---AGLVYNSSG 323
Query: 360 NVDCFQL 366
C+ +
Sbjct: 324 TEPCYDI 330
>gi|313236608|emb|CBY11867.1| unnamed protein product [Oikopleura dioica]
Length = 471
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 109/283 (38%), Positives = 168/283 (59%), Gaps = 14/283 (4%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+YET++ +QR+D+F++ T+ RYL+NTD +V + PIF Y GNEG I+ FA N+G
Sbjct: 22 KYETKWIDQRVDNFNYYLDKTYKMRYLVNTD-FVKDEKTAPIFFYTGNEGPIDSFAANTG 80
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F+ + A A +V+ EHRYYG+S+PYG++ +N ++YL+ E ALADFA I LK
Sbjct: 81 FMNEFAEEENAFIVYAEHRYYGQSLPYGNSSFTPEN---MAYLSVENALADFAQLIVELK 137
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ P++ FGGSYGG+L+ +MR+ YP++ GALA+S+P+ + F+ +
Sbjct: 138 KTYKG---PLICFGGSYGGLLSMYMRMTYPNLVNGALAASSPVYWISAMGDSHGFWVKTT 194
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED----LADWLES 302
DF C +TI+ + L + + + +TKT C+ N TED + W +
Sbjct: 195 EDFSTALDKCEDTIRAGFAALDKM-KNDKDWAGITKTMRTCQ--NITEDNYMHMLGWARN 251
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILE 345
A + +AM+DYPYP++F LPG P++E C + A SI E
Sbjct: 252 AMATMAMMDYPYPTNFEAALPGNPVKESCVRAVAETGADSIRE 294
>gi|326930125|ref|XP_003211202.1| PREDICTED: dipeptidyl peptidase 2-like [Meleagris gallopavo]
Length = 432
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 104/265 (39%), Positives = 163/265 (61%), Gaps = 10/265 (3%)
Query: 105 LGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNAT 164
GPIF Y GNEGDI FA NS F++++A + A+++F EHRYYG+S+P+G + +N
Sbjct: 23 FGPIFFYTGNEGDIWTFAENSDFIFELAEQQQALVIFAEHRYYGKSLPFGLESMQIKNT- 81
Query: 165 TLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
LT EQALAD+AV IT LKQ A PV+ FGGSYGGML+A++R+KYP++ GALA
Sbjct: 82 --HLLTVEQALADYAVLITELKQQYGAAGCPVIAFGGSYGGMLSAYLRMKYPNVVDGALA 139
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
+SAP+L + P F+ V++DF++ C ++ ++ ++ + E++
Sbjct: 140 ASAPVLSVAGLGDPTQFFRDVTADFQKSIPGCVTAVRRAFQQIRDL-FLSGAYDEISSKM 198
Query: 285 HLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT 341
C +++S +D L + +A++ +AM+DYPY +DFM LP P++ C++I D
Sbjct: 199 ATCNKISSKKDVYQLFGFARNAFTMIAMMDYPYKTDFMGHLPANPVKVGCEQILAHTDPI 258
Query: 342 SILERIFEGVSVYYNYTGNVDCFQL 366
L + V V+YN +G+V C+ +
Sbjct: 259 QGLAAL---VGVFYNSSGSVQCYDV 280
>gi|395844464|ref|XP_003794980.1| PREDICTED: dipeptidyl peptidase 2 [Otolemur garnettii]
Length = 558
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 115/306 (37%), Positives = 188/306 (61%), Gaps = 18/306 (5%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ RYFEQ LDHF+F TF QR+L++ W GPIF Y GNEGD+ FA +S
Sbjct: 99 FGERYFEQLLDHFNFERFGNKTFPQRFLVSDKFW--DRAEGPIFFYTGNEGDVWSFANHS 156
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFIT 183
GF+ ++A + A+LVF EHRYYG+S+P+G ST+ Y + LT EQALADFAV +
Sbjct: 157 GFIVELAAQEAALLVFAEHRYYGKSLPFGKRSTQRGY-----MELLTVEQALADFAVLLQ 211
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
L+++L A+ +P + FGGSYGGML+A++R+KYPH+ +GALA+SAP++ + P+ F+
Sbjct: 212 ALQRDLGAQDAPTIAFGGSYGGMLSAYLRMKYPHLVVGALAASAPVIAAAGLGDPDQFFR 271
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWL 300
V++ F+ +S C +++++ ++ + + + +++ F C+ L+ +D L +
Sbjct: 272 DVTAAFESQSPKCAQGVRDAFQQIRDLFLQGD-YDTVSREFSTCQPLSGPKDQTQLFVFT 330
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGN 360
+A++ LAM++YPY +DFM LP P++ C + + + L + + YN +G
Sbjct: 331 RNAFTVLAMMNYPYATDFMGHLPANPVKVGCNLLLSEAQRITGLRAL---AGLVYNASGT 387
Query: 361 VDCFQL 366
C+ +
Sbjct: 388 EPCYDI 393
>gi|355684809|gb|AER97524.1| dipeptidyl-peptidase 7 [Mustela putorius furo]
Length = 316
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 118/275 (42%), Positives = 174/275 (63%), Gaps = 17/275 (6%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ YFEQ LDHF+F TF QR+L++ W GPIF Y GNEGD+ FA NS
Sbjct: 36 FQEGYFEQLLDHFNFERFGNKTFPQRFLVSEKFW--KKGKGPIFFYTGNEGDVWSFANNS 93
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A + A++VF EHRYYG+S+P+G + T L LT EQALADFA I L
Sbjct: 94 GFIQELAAQQEALVVFAEHRYYGKSLPFGDRSTR-RGHTEL--LTVEQALADFARLIRAL 150
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L A SPV+ FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+ V
Sbjct: 151 QRDLGAYDSPVIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVAVAGLGDSYQFFRDV 210
Query: 246 SSDFKRESASCFNTIKESW---GELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---W 299
S+DF+ +S C +++++ +L S G + +++ F C+ L+S EDL +
Sbjct: 211 SADFEGQSPKCAQGVRDAFRQIKDLFSQGAYDT----VSREFGTCQPLSSREDLTQLFGF 266
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI 334
+A++ LAM+DYPYP+DF+ LP P++ C ++
Sbjct: 267 ARNAFTVLAMMDYPYPTDFIGHLPANPVKVGCDRL 301
>gi|449281466|gb|EMC88535.1| Dipeptidyl-peptidase 2, partial [Columba livia]
Length = 437
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 112/291 (38%), Positives = 169/291 (58%), Gaps = 22/291 (7%)
Query: 105 LGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNAT 164
GPIF Y GNEGDI FA NS F++++A A+++F EHRYYG+S+P+G + +N
Sbjct: 11 FGPIFFYTGNEGDIWNFAQNSDFIFELAEEQQALVIFAEHRYYGKSLPFGLESMQPKNT- 69
Query: 165 TLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
LT EQALAD+AV IT LKQ A PV+ FGGSYGGML+A+MR+KYP+I GALA
Sbjct: 70 --GLLTVEQALADYAVLITELKQQYGAADCPVIAFGGSYGGMLSAYMRMKYPNIVSGALA 127
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
+SAP+L + P F+ V++DF + S C +++++ ++ + + E++
Sbjct: 128 ASAPLLSVAGLGDPTQFFRDVTADFDKSSPGCVPAVRKAFQQIKDLFLR-GAYDEISSKM 186
Query: 285 HLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT 341
C ++++ ED L + +A++ +AM+DYPY +DFM LP P++ C +I D
Sbjct: 187 ATCNKISTKEDVYQLFGFARNAFTMMAMMDYPYKTDFMGHLPANPVKVGCDQILTHADPI 246
Query: 342 SILERIFEGVSVYYNYTGNVDC------FQLDDDPHGL------DGWNWQV 380
L + V V YN + + C +Q DP G + W++QV
Sbjct: 247 RGLAAL---VGVLYNSSSSAQCYDIYQLYQSCADPTGCGIGSDAEAWDYQV 294
>gi|268575350|ref|XP_002642654.1| C. briggsae CBR-PCP-1.1 protein [Caenorhabditis briggsae]
Length = 512
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 125/340 (36%), Positives = 184/340 (54%), Gaps = 35/340 (10%)
Query: 69 ETRYFE-QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
E R+++ RLDHF++ D TF R + N ++ GPIF Y GNEG + F V +G
Sbjct: 10 EVRWYKGMRLDHFTWGDTRTFDLRIMWNNTYY---QPGGPIFFYTGNEGAVSTFEVATGM 66
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
++D+AP F A ++F EHR+YG + P+G+ +Y N + YLT+EQALAD+A +T LK+
Sbjct: 67 MFDLAPMFNASIIFAEHRFYGATQPFGNQ--SYANLANVGYLTSEQALADYAELLTELKR 124
Query: 188 NLSA------EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED-IVPPET 240
+ + S V+ FGGSYGGML+AW R KYPHI GA A SAP++ D V P
Sbjct: 125 DNNQFGKTFHRDSQVISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMHDGGVDPGA 184
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKT--FHLC--RELNSTED- 295
F NI S + + F + +W ++++ ++G L F L +N+ D
Sbjct: 185 FDNITSRTYVENGCNRF-ILANAWNAVLNLSSTDSGRAWLNNNPVFKLDPRTPINNQTDG 243
Query: 296 --LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS---ILERIFEG 350
L ++ A Y+AMVDYPYP+ F+ PLPG+P+ C ++ ++ S ++ +
Sbjct: 244 WNLNAYMREAIEYMAMVDYPYPTGFLEPLPGWPVTVACGYMNATGESFSDQDLVTAVANA 303
Query: 351 VSVYYNYTGNVD---CFQL----DDDPHGLD----GWNWQ 379
+VYYNY N + C D GL GW WQ
Sbjct: 304 ANVYYNYNQNANFTWCIDFNICGDQGTGGLGDDALGWPWQ 343
>gi|268575348|ref|XP_002642653.1| C. briggsae CBR-PCP-1.2 protein [Caenorhabditis briggsae]
Length = 564
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 124/336 (36%), Positives = 179/336 (53%), Gaps = 34/336 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y RLDHF++ D TF R + N + GPIF Y GNEG + F V +G ++D+
Sbjct: 44 YKNMRLDHFTWGDTRTFDMRIMWNNTFY---QPGGPIFFYTGNEGAVSTFEVATGMMFDL 100
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
AP F A ++F EHR+YG + P+G+ +Y N + YLT+EQALAD+A +T LK++ +
Sbjct: 101 APMFNASIIFAEHRFYGATQPFGNQ--SYANLANVGYLTSEQALADYAELLTELKRDNNQ 158
Query: 192 ------EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED-IVPPETFYNI 244
S V+ FGGSYGGML+AW R KYPHI GA A SAP++ D V P F NI
Sbjct: 159 FGKTFHRDSQVISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMHDGGVDPGAFDNI 218
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK--TFHLC--RELNSTED---LA 297
S + + F + +W ++++ ++G L F L +N+ D L
Sbjct: 219 TSRTYVENGCNRF-ILANAWNAVLNLSSTDSGRAWLNNNPVFKLDPRTPINNQTDGWNLN 277
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS---ILERIFEGVSVY 354
++ A Y+AMVDYPYP+ F+ PLPG+P+ C ++ ++ S ++ + +VY
Sbjct: 278 AYMREAIEYMAMVDYPYPTGFLEPLPGWPVTVACGYMNATGESFSDQDLVTAVANAANVY 337
Query: 355 YNYTGNVD---CFQL----DDDPHGLD----GWNWQ 379
YNY N + C D GL GW WQ
Sbjct: 338 YNYNQNANFTWCIDFNICGDQGTGGLGDDALGWPWQ 373
>gi|348676451|gb|EGZ16269.1| hypothetical protein PHYSODRAFT_560767 [Phytophthora sojae]
Length = 574
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/333 (36%), Positives = 191/333 (57%), Gaps = 44/333 (13%)
Query: 72 YFEQRLDHFSF---------------ADLP-TFSQRYLINTDHWVGPNRLGPIFLYCGNE 115
+FEQR+DHFS+ + LP T+ QRYL+NT W ++ P+F Y GNE
Sbjct: 101 WFEQRIDHFSWLAAEALDPSNAGAAPSGLPATYKQRYLLNTQFWDPKDKKAPVFFYTGNE 160
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
GD+ +A ++G +W+ A F A++VF EHRYYG+S P+G + + L+Y+T +QAL
Sbjct: 161 GDVTLYANHTGLIWENAKAFKALVVFAEHRYYGKSFPFGDKYMDH-----LAYVTHDQAL 215
Query: 176 ADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
AD+ I +L++ A PV+ FGGSYGGML+AW R+KYP+I GA+A+SAPI F
Sbjct: 216 ADYTELIYHLQKKYDAFNHPVIAFGGSYGGMLSAWFRMKYPNIIAGAIAASAPIYGFGGF 275
Query: 236 --VPPETFYNIVSSDFK---RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL 290
+ ++ +V+ D +A+C K+SW ++ + + E+G L+ F LC L
Sbjct: 276 PAFDGQKYWQVVTRDASPAAGSAANCVPNAKKSWAQIFELAKTEDGRATLSSLFRLCTPL 335
Query: 291 NST---EDLADWLESAYSYLAMVDYPYPSDFM----MPLPGYPIREVCKKI--------- 334
S EDLA + A+ LAM D+PYPS ++ + LP +P+R+ C +
Sbjct: 336 ASEEQGEDLAMSVLFAFDTLAMGDFPYPSSYLTGGAVDLPAWPVRQACSHLAGEFPTPSL 395
Query: 335 -DNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
+ D T++LE + +V++N T ++ CF++
Sbjct: 396 RKDGVD-TTLLEALRNAANVFHNATKDLACFKI 427
>gi|384253205|gb|EIE26680.1| hypothetical protein COCSUDRAFT_11918 [Coccomyxa subellipsoidea
C-169]
Length = 395
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 126/374 (33%), Positives = 190/374 (50%), Gaps = 56/374 (14%)
Query: 18 ITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQR---QQRQQQQQYRYETRYFE 74
IT +++ ++ + + + RR+ + ++P+ + R+ + + TR +
Sbjct: 4 ITPLVLCLIICSAFSGGQCQVRRSRKGPPRIPYKNSSRTQLTSNGREPITKCKVYTR--D 61
Query: 75 QRLDHFSFADLP----TFSQRYLINTDHWVG--PNRLGPIFLYCGNEGDIEWFAVNSGFV 128
LDHFS+A P TF QRY + DHW GPIF Y GNE D+ + +G +
Sbjct: 62 ATLDHFSWATPPDNRTTFKQRYFLCNDHWKSHKDGTRGPIFFYVGNEADVTLYLNATGLM 121
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
W+ A FGA+LVF EHRYYGES P+ + + YLT+EQA+ADFA I LK++
Sbjct: 122 WENAAAFGALLVFAEHRYYGESKPFKKALRHH-----MQYLTSEQAMADFAELIMELKED 176
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF---EDIVPPETFYNIV 245
L A++S V+ FGGSYGGMLA WMR+KYPHI GA+A SAPI + E ++ IV
Sbjct: 177 LGAQSSAVIGFGGSYGGMLATWMRIKYPHILDGAIAGSAPIWSYLGEEPAYDSGSYAKIV 236
Query: 246 SSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
++D S C + ++E W + QK +C ++
Sbjct: 237 TADASEAGGSAPACASNVREVWNQGSWAVQK------------ICLDMRRG--------- 275
Query: 303 AYSYLAMVDYPYPSDFMMP----LPGYPIREVCKKIDNAPDATSILERIF-EGVSVYYNY 357
+YPYPS +++ LP YP+R C+ + A S L F + + V+YNY
Sbjct: 276 --------NYPYPSSYILNGNGILPAYPVRVACESLRQEDLAGSDLLSAFADALGVFYNY 327
Query: 358 TGNVDCFQLDDDPH 371
T +V+C+ P+
Sbjct: 328 TEDVECYDFGAGPN 341
>gi|290981802|ref|XP_002673620.1| predicted protein [Naegleria gruberi]
gi|284087205|gb|EFC40876.1| predicted protein [Naegleria gruberi]
Length = 503
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 119/349 (34%), Positives = 184/349 (52%), Gaps = 45/349 (12%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRY---------------LINTDHWVGPNRLGPIFLYC 112
Y+T +F QRLDHF F ++ TF QRY +++ ++ + + PI Y
Sbjct: 29 YQTGFFTQRLDHFDFTNIATFPQRYFVCDLYVKHSTRSSVIVDDNNLIQIDPFIPIIAYP 88
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
GNEG +E F N+G V+++A +GA+++F EHR+YG+++P N YLT E
Sbjct: 89 GNEGALEEFYENTGLVFELAKYYGALVIFIEHRFYGKTIP--------PNQDPQRYLTIE 140
Query: 173 QALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF 232
QA D AVF T +P++L GGSYGG LAAWMR K+PH+ G++A+SAPIL F
Sbjct: 141 QATHDLAVFFTENFGLDEKRKNPIILVGGSYGGDLAAWMRFKFPHLIDGSIAASAPILFF 200
Query: 233 EDIVPPETFYNIVSSDFKRES-------ASCFNTIKESWGELV----SVGQKENGLLELT 281
I PP I + ++ + +C + +K+ + L S KE L L+
Sbjct: 201 NGITPPYLAAQIATEAYRNLTNFQIYPKMTCDSAVKKGFEFLSKYFESTTSKEQ-LQMLS 259
Query: 282 KTFHLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
+ F LC E+ S + LA ++ ++ LA +YPYP++F LP +P+ +C I
Sbjct: 260 RKFRLCNEMKSNLEVKVLASYIAFSFEVLAQANYPYPTNFFNNLPAWPVNGLCTSIAKHL 319
Query: 339 DATSILER-------IFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
+ LE +F+GV+++ NYTG+ CF + GL +W +
Sbjct: 320 ATSPNLESEDLYFTILFDGVNLFQNYTGDKSCFNTSNLGGGLQWNSWSL 368
>gi|341900697|gb|EGT56632.1| hypothetical protein CAEBREN_05149 [Caenorhabditis brenneri]
Length = 568
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 119/336 (35%), Positives = 176/336 (52%), Gaps = 34/336 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y RLDHF++ D TF R + N + GPIF Y GNEG +E F +G ++D+
Sbjct: 48 YKGMRLDHFTWGDTRTFDLRVMWNNTFY---KEGGPIFFYTGNEGGLESFEKATGMMFDL 104
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP F A ++F EHR+YG++ P+G + +Y+N + YLT+EQALAD+A +T LK++ +
Sbjct: 105 APMFNAAIIFAEHRFYGQTQPFG--KDSYKNLANIGYLTSEQALADYAELLTELKRDNNR 162
Query: 191 -----AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED-IVPPETFYNI 244
++ +PV+ FGGSYGGML+AW R KYPH+ GA A SAP++ D V P F NI
Sbjct: 163 MGKTFSQDTPVISFGGSYGGMLSAWFRQKYPHLVKGAWAGSAPLIYMHDGGVDPGAFDNI 222
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENG--LLELTKTFHL-----CRELNSTEDLA 297
S + + + + +W + + + G L F L + +L
Sbjct: 223 TSRTYVDRGCNRY-ILANAWNATIRLSSTDAGRQWLNNNNVFKLDPRTPIKTAADGWNLN 281
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS---ILERIFEGVSVY 354
+L A Y+AMVDYPYP+ F+ PLP +P+ C ++ + S ++ + ++Y
Sbjct: 282 SYLREAIEYMAMVDYPYPTGFLEPLPAWPVDAACGYMNATGTSFSDQDLVSAVANAANIY 341
Query: 355 YNYTGNVD---CFQL----DDDPHGLD----GWNWQ 379
YNY N C D GL GW WQ
Sbjct: 342 YNYNKNASFTYCIDYSICGDQGTGGLGDDQLGWPWQ 377
>gi|431899040|gb|ELK07410.1| Dipeptidyl-peptidase 2 [Pteropus alecto]
Length = 521
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 107/266 (40%), Positives = 163/266 (61%), Gaps = 11/266 (4%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ YFEQ LDHF+F TF QR+LI+ W GP+F Y GNEGD+ +FA NS
Sbjct: 36 FREDYFEQLLDHFNFERFGNKTFLQRFLISDKFW--KRGEGPLFFYTGNEGDVWFFANNS 93
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
F+ ++A + A++VF EHRYYG+S+P+G E + Q T LT EQALADFA + +L
Sbjct: 94 RFILELAMQQEALVVFAEHRYYGKSLPFG--EQSTQRGHT-ELLTVEQALADFARLLRSL 150
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+Q+ A P + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+ +
Sbjct: 151 RQDFKARDVPAIAFGGSYGGMLSAYMRMKYPHLVAGALAASAPVVSVAGLSDSHQFFRDL 210
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLES 302
S F+ +S C +++++ ++ + + EL++ F C+ + + LA + +
Sbjct: 211 SVIFENQSPECAQGVRDAFRQIKDL-FLQGAYEELSREFGTCQLVTDWKSLAQLFGFARN 269
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIR 328
A+ LAM++YPYP+DF LP P++
Sbjct: 270 AFIMLAMLNYPYPTDFTGHLPANPVQ 295
>gi|405950118|gb|EKC18122.1| Lysosomal Pro-X carboxypeptidase [Crassostrea gigas]
Length = 576
Score = 198 bits (503), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 96/232 (41%), Positives = 150/232 (64%), Gaps = 7/232 (3%)
Query: 155 STEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRL 213
S V+ + +++LT+EQALAD+AV I ++K ++ + S V+ FGGSYGGMLAAW R+
Sbjct: 4 SFNVSSHDPKKMNFLTSEQALADYAVLIKHIKSSIPGSSQSKVIAFGGSYGGMLAAWFRM 63
Query: 214 KYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQK 273
KYP++ G+LA+SAPI F + F V+ F++ S++C + IK W L + +
Sbjct: 64 KYPNVVQGSLAASAPIWTFRKDADCDAFDRTVTGTFRKSSSTCVDNIKALWKTLNTTASQ 123
Query: 274 ENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREV 330
GL +L++ FHLC+ L S +D L +W+ SA YLAMVDYPYPS F+ PLP +P++E
Sbjct: 124 TGGLAKLSEMFHLCKPLKSADDVTTLKNWIVSALVYLAMVDYPYPSKFLAPLPAWPVKET 183
Query: 331 CKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL--DDDPH-GLDGWNWQ 379
C+ I + +++ + + ++V+YNYTG+ CF + D P+ G+ GW++Q
Sbjct: 184 CRPILTPLNGDNLIIGMAKAMNVFYNYTGSTSCFDIGSGDIPNLGISGWDYQ 235
>gi|348680585|gb|EGZ20401.1| hypothetical protein PHYSODRAFT_558788 [Phytophthora sojae]
Length = 566
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 109/315 (34%), Positives = 173/315 (54%), Gaps = 26/315 (8%)
Query: 69 ETRYFEQRLDHF---SFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ ++ Q LDHF + +F QRY + + P+ G IF Y GNE D+ + ++
Sbjct: 87 QEKFITQELDHFRANGGSSEGSFEQRYFVCSPESFDPSN-GSIFFYVGNEADVTLYLNHT 145
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G +W+ A F A++VF EHRY+G+S+P+G + + + +L+ +QALAD+AV I L
Sbjct: 146 GLMWENAVAFNALIVFAEHRYFGKSVPFGLDVLEH-----MEFLSTQQALADYAVLIEAL 200
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF----EDIVPPETF 241
K+ L + PV+ FGGSYGGML W R+KYPHI G +A+SAP++ F + E F
Sbjct: 201 KKQLGVDV-PVIGFGGSYGGMLGTWFRMKYPHIIDGVIAASAPVVNFLGDPDHPADTEAF 259
Query: 242 YNIVSSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLC--RELNSTEDL 296
+V+ D E+ + C ++ + + Q ++G +L + HLC L++++D+
Sbjct: 260 NRVVTFDMSEEAGAAPNCIPNLRRALTAAIESVQTQDGRKQLAELLHLCDTDSLSTSDDV 319
Query: 297 ADWLESAYSYLAMVDYPYPSDFMM----PLPGYPIREVCKKIDN---APDATSILERIFE 349
AY LAM +YPYP+ ++M LPGYP+R C+ + D ++ E
Sbjct: 320 VSIAAEAYGDLAMGNYPYPTSYIMDGNVDLPGYPMRAACEPLAGDFAEDDDLGLINAFRE 379
Query: 350 GVSVYYNYTGNVDCF 364
+ VYYN T + CF
Sbjct: 380 SIDVYYNATKSESCF 394
>gi|441623431|ref|XP_004088909.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 2 [Nomascus
leucogenys]
Length = 485
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 108/305 (35%), Positives = 176/305 (57%), Gaps = 23/305 (7%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ W GPIF Y GNEGD+ FA NS
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWT--QGKGPIFFYTGNEGDVWAFANNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GFV ++A GA+L+F EH G+ + + + EQALADFA + L
Sbjct: 89 GFVAELAAEQGALLIFAEH--------VGARQGRGRGXXXXXXV--EQALADFAELLRAL 138
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+++L + +P + FGGSYGGML+A++R+KYPH+ GALA+SAP+L + F+ V
Sbjct: 139 RRDLGXQDAPAIAFGGSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDV 198
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLE 301
++DF+ +S C ++E++ + + + G + + F C+ L+ +DL +
Sbjct: 199 TADFEGQSPKCTQGVREAFRRIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFRFAR 256
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN +G+
Sbjct: 257 NAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSQ 313
Query: 362 DCFQL 366
C+ +
Sbjct: 314 HCYDI 318
>gi|241753990|ref|XP_002401169.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215508366|gb|EEC17820.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 261
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 96/234 (41%), Positives = 142/234 (60%), Gaps = 3/234 (1%)
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
H+ P + GPIF YCG+E +E + N+G +W+ A F AM++F EHR+YG+S+P
Sbjct: 28 HFWEPTKTGPIFFYCGHELPLEDYINNTGLLWNWAREFKAMIIFSEHRFYGQSLPDDVRT 87
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYP 216
L+Y +A QALAD+A I ++K+ + A+ PV+ FGG YGGMLAA+ RLKYP
Sbjct: 88 SHLPTVPYLNYFSAVQALADYAHLILHIKETVHRADKVPVIAFGGFYGGMLAAYFRLKYP 147
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
H+ GALASSAP+ F +VP F + ++ F+RESA+C I++SW L+S+
Sbjct: 148 HLIAGALASSAPVQMFPGLVPCSAFDHTLTKAFRRESAACAKAIRKSWPHLLSLTDSSKK 207
Query: 277 LLELTKTFHLCRELN--STEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIR 328
E ++ + +C L + L DW+ Y +LAM +YP PS + LP P++
Sbjct: 208 AHEFSRKYKMCDNLTPQAQRMLLDWMHDTYLHLAMFNYPEPSSRLTSLPANPVK 261
>gi|116242320|gb|ABJ89817.1| lysosomal pro-X carboxypeptidase [Clonorchis sinensis]
Length = 434
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 108/280 (38%), Positives = 157/280 (56%), Gaps = 7/280 (2%)
Query: 106 GPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT 165
GPI Y GNEG IE FA N+GF+W+IA A ++F EHR+YG S+P+ ++++
Sbjct: 1 GPILFYTGNEGAIETFAENTGFMWEIAEELKAAVLFAEHRFYGSSLPF--VNDSFKDPQH 58
Query: 166 LSYLTAEQALADFAVFITNLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
YLTAEQALAD+A + LK ++ E SPV+ FGGSYGGML+AW R KYP++ GA+A
Sbjct: 59 FGYLTAEQALADYASLVQYLKSSVKDFENSPVIAFGGSYGGMLSAWFRYKYPNLIAGAIA 118
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKT 283
+SAPI F ++ FY+ + F ++ C + W + +V ++ +G L
Sbjct: 119 ASAPIWLFPNMSNCAGFYDTTTRAFSTSGSTVCTKNVALVWDSIRTVAKQHSGHELLRLM 178
Query: 284 FHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID---NAPDA 340
F LC L + L D+L LAMV+YPY + F+ PG P++ CK + +
Sbjct: 179 FQLCDPLPDEQKLIDYLIDFLGTLAMVNYPYEASFIGTFPGEPVKYFCKGLSDAVHRDVD 238
Query: 341 TSILERIFEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
+++R+ V NYT N C L+ D GLD W +
Sbjct: 239 VDVVQRVATAVRSLTNYTKNQSCISLEGDLPGLDAKAWTL 278
>gi|167533602|ref|XP_001748480.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772999|gb|EDQ86644.1| predicted protein [Monosiga brevicollis MX1]
Length = 508
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 117/322 (36%), Positives = 174/322 (54%), Gaps = 34/322 (10%)
Query: 73 FEQRLDHFSFAD-----LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
FEQ +DHF++ TF QRY + D + P G +F+Y GNE DI + ++G
Sbjct: 32 FEQTIDHFNWGAPLGQAQTTFQQRYFV-YDKYYKPGS-GALFVYFGNEDDITLYINHTGL 89
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+W+ A FGA L+F EHRYYG+S P+ A +++LT+EQA+AD+AV + K
Sbjct: 90 MWENAKDFGAYLIFIEHRYYGKSQPFSPG-----TAGCMNWLTSEQAMADYAVLLRWFKA 144
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP---ETFYNI 244
E P + FGGSYGGMLAAW R K+P + G +++SAPI F ++ P + F I
Sbjct: 145 THQMEDVPTIGFGGSYGGMLAAWFRRKFPDVVDGVISASAPIWAFANLTPAYDDDGFAQI 204
Query: 245 VSSD---FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LAD 298
V++D +A+C K+ ++ GL LT F LC L S D L
Sbjct: 205 VTNDATPASGAAAACAANFKQGQKLIIDTASSAAGLANLTSIFRLCNPLQSLNDAYSLLY 264
Query: 299 WLESAYSYLAMVDYPYPSDFMMP----LPGYPIREVCKKI------DNAPDATSILERIF 348
W++ +SY+AM ++PYPS +++ LP +P+R C+ + D PD +L+ +
Sbjct: 265 WVQEPWSYMAMGNFPYPSSYLLHGLGMLPAWPVRVACESLADSSLPDQPPD---LLDAMR 321
Query: 349 EGVSVYYNYTGNVDCFQLDDDP 370
+ +YYNYT C+ L D P
Sbjct: 322 AALDIYYNYTHAETCYDLSDAP 343
>gi|301096651|ref|XP_002897422.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262107113|gb|EEY65165.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 568
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 120/332 (36%), Positives = 188/332 (56%), Gaps = 42/332 (12%)
Query: 72 YFEQRLDHFSF---------------ADLP-TFSQRYLINTDHWVGPNRLGPIFLYCGNE 115
+ QR+DHFS+ + LP T+ QRYL+NT W ++ P+F Y GNE
Sbjct: 95 WIAQRIDHFSWLPAEAVDAADPNAAPSGLPATYKQRYLLNTQFWDPSDKKAPVFFYTGNE 154
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
GD+ +A ++G +W+ A F A++VF EHRYYG+S P+G + + L YLT +QAL
Sbjct: 155 GDVTLYANHTGLIWENAQTFKALVVFAEHRYYGKSFPFGDKYMDH-----LGYLTHDQAL 209
Query: 176 ADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
AD+A I ++++ A PV+ FGGSYGGML+AW R+KYP I GA+A+SAPI F
Sbjct: 210 ADYAELIYHVQKKYDALNHPVIAFGGSYGGMLSAWFRMKYPSIIAGAIAASAPIYGFGGF 269
Query: 236 --VPPETFYNIVSSDFKRESASCFNTI---KESWGELVSVGQKENGLLELTKTFHLCREL 290
+ ++ +V+ D + + N + +++W ++ + Q ENG L+ F LC L
Sbjct: 270 PAFDGQKYWQVVTRDASPAAGAAKNCVPNARKAWPQIFELAQTENGRSTLSSIFRLCEPL 329
Query: 291 NST---EDLADWLESAYSYLAMVDYPYPSDFM----MPLPGYPIREVCKKIDNAPDA--- 340
+ EDLA + A+ LAM ++PYPS ++ + LP +P+RE C + A
Sbjct: 330 TTEQQGEDLAMSVLFAFDTLAMGNFPYPSSYLTGGAVDLPAWPVREACSHLAGDFPASTL 389
Query: 341 ------TSILERIFEGVSVYYNYTGNVDCFQL 366
T +LE + + +V++N TG++ CF++
Sbjct: 390 RQENVDTKLLEALRDAANVFHNATGDLTCFKI 421
>gi|301097471|ref|XP_002897830.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
gi|262106578|gb|EEY64630.1| lysosomal Pro-X carboxypeptidase, putative [Phytophthora infestans
T30-4]
Length = 569
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 171/315 (54%), Gaps = 26/315 (8%)
Query: 69 ETRYFEQRLDHF---SFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
E ++ Q LDHF + TF RY + + P G IF Y GNE D+ + ++
Sbjct: 90 EEKFITQSLDHFRADGKSSEGTFDMRYFVCSPDNFSPTN-GSIFFYVGNEADVTLYLNHT 148
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G +W+ A F A++VF EHRY+G+S+P+G + + + +L+ +QA+AD+AV I L
Sbjct: 149 GLMWENAAAFNALIVFAEHRYFGKSVPFGLDVLDH-----MEFLSTQQAMADYAVLIEML 203
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF----EDIVPPETF 241
K++L + PV+ FGGSYGGML W R+KYPHI G +A SAP+ F + PE F
Sbjct: 204 KRDLKVDV-PVIGFGGSYGGMLGTWFRMKYPHIIDGIIAGSAPVANFFGDPDHPADPEAF 262
Query: 242 YNIVSSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLC--RELNSTEDL 296
+V+ D ++ + C I+ + V++ ++G EL + HLC L S++ +
Sbjct: 263 NRVVTFDMSEDAGAATNCIPNIRRALNTAVAMSGTKSGRKELAELLHLCDADSLQSSDKV 322
Query: 297 ADWLESAYSYLAMVDYPYPSDFMM----PLPGYPIREVCK---KIDNAPDATSILERIFE 349
AY LA+ +YPYP+ ++M LPGYP+R C+ + D + ++ E
Sbjct: 323 ISIASEAYGDLAVGNYPYPTSYIMDGKVDLPGYPMRTACEPFAGVFTEDDKSGLIRAFRE 382
Query: 350 GVSVYYNYTGNVDCF 364
++VYYN + + C
Sbjct: 383 SIAVYYNASKSESCL 397
>gi|308481892|ref|XP_003103150.1| CRE-PCP-1 protein [Caenorhabditis remanei]
gi|308260255|gb|EFP04208.1| CRE-PCP-1 protein [Caenorhabditis remanei]
Length = 564
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 113/304 (37%), Positives = 165/304 (54%), Gaps = 25/304 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y RLDHF++ D TF R + N + GPIF Y GNEG +E F +G ++D+
Sbjct: 44 YKNMRLDHFTWGDTRTFDMRVMWNNTFY---KPGGPIFFYTGNEGGLESFVTATGIMFDL 100
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ---- 187
AP + A ++F EHR+YG++ P+G+ +Y + YLT+EQALAD+A +T LK+
Sbjct: 101 APMYNASIIFAEHRFYGQTQPFGNN--SYATLANVGYLTSEQALADYAELLTELKRQPNQ 158
Query: 188 -NLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED-IVPPETFYNI 244
NL+ + + ++ FGGSYGGML+AW R KYPHI GA A SAP++ D V P F NI
Sbjct: 159 FNLTFQKDTQIISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMHDGGVDPGAFDNI 218
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENG--LLELTKTFHL-----CRELNSTEDLA 297
S + + F + W ++++ + G L F L R +L
Sbjct: 219 TSRTYVDNGCNRF-ILANVWNAVLNLSNTDAGRQWLNNNAVFTLDPRTPIRNQTDGWNLN 277
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT----SILERIFEGVSV 353
+L A Y+AMVDYPYP+ F+ PLP +P+ C + NA T ++ + ++
Sbjct: 278 AYLREAIEYMAMVDYPYPTGFLEPLPAWPVAVACGYM-NATGTTFSDQQLVTMVANAANI 336
Query: 354 YYNY 357
YYNY
Sbjct: 337 YYNY 340
>gi|73967473|ref|XP_848703.1| PREDICTED: dipeptidyl peptidase 2 isoform 1 [Canis lupus
familiaris]
Length = 497
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 116/302 (38%), Positives = 183/302 (60%), Gaps = 18/302 (5%)
Query: 72 YFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
YFEQ LDHF+F TF QR+L++ W GPIF Y GNEG++ FA NSGF+
Sbjct: 40 YFEQLLDHFNFERFGNKTFQQRFLVSEKFW--KRGKGPIFFYTGNEGNVWSFANNSGFIL 97
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
++A + A+++F EHRYYG+S+P+G ST Y LT EQALADFA + L++
Sbjct: 98 ELAAQQEALVIFAEHRYYGKSLPFGEQSTRRGYTE-----LLTVEQALADFARLLLALRR 152
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+L A+ SP + FGGSYGGML+A+MR+KYPH+ GALA+SAP++ + F+ VS+
Sbjct: 153 DLGAQDSPAIAFGGSYGGMLSAYMRIKYPHLVAGALAASAPVVAVAGLGDSYQFFRDVSA 212
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD---WLESAY 304
DF+ +S C +++++ ++ + + +++ F C+ L+S +DL + +A+
Sbjct: 213 DFEGQSPKCAQGVRDAFQQIQDL-CFQGACDVVSREFGTCQPLSSRKDLTQLFGFARNAF 271
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
+ LAM+DYPYP+ F+ LP P++ C ++ + + L + + YN +G V C+
Sbjct: 272 TVLAMMDYPYPTHFIAHLPANPVKVGCDRLLSESQSIKGLRAL---AGLVYNSSGTVPCY 328
Query: 365 QL 366
+
Sbjct: 329 DI 330
>gi|323451280|gb|EGB07158.1| hypothetical protein AURANDRAFT_2013, partial [Aureococcus
anophagefferens]
Length = 451
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 124/303 (40%), Positives = 174/303 (57%), Gaps = 22/303 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG-FVWD 130
Y Q+LDHF F + TFSQ+ L++ D W P GP+ +Y GNEG IE F NSG +++
Sbjct: 1 YVTQQLDHFRFDETRTFSQKLLVH-DAWHRPG--GPLLMYFGNEGAIEDFYGNSGGLMFE 57
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
+AP+ A + F EHRYYG S+P+G+ A + L++LT EQALAD A+ + + L
Sbjct: 58 LAPKLNASVAFLEHRYYGSSLPFGN---ASYGSDELAFLTVEQALADMALVLATSSEILG 114
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET----FYNIVS 246
A P VLFGGSYGGMLAAW LKYPH+A GA+A+SAP+ D+ P E F++
Sbjct: 115 AADGPAVLFGGSYGGMLAAWFMLKYPHLAAGAVAASAPV----DLYPGEGKERPFFDAGL 170
Query: 247 SDFKR-ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
+ SA+C ++ + L + + G L ++F C L D LA ++
Sbjct: 171 EVYGTYGSAACEADLRAALAALAAAAKTAAGRDALARSFRTCEPLPDPVDGDRLASYVNG 230
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
A S LAM+DYPY S F+ P+P P+R C ++ AP A S L+ V V+ N+TG
Sbjct: 231 ALSTLAMLDYPYASAFVAPMPANPVRVACGRVAAAPSAASKLK---GAVDVFLNHTGETA 287
Query: 363 CFQ 365
C+
Sbjct: 288 CYD 290
>gi|323451276|gb|EGB07154.1| hypothetical protein AURANDRAFT_28266, partial [Aureococcus
anophagefferens]
Length = 311
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 123/303 (40%), Positives = 173/303 (57%), Gaps = 22/303 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG-FVWD 130
Y Q+LDHF F + TFSQ+ L++ D W P GP+ +Y GNEG IE F NSG +++
Sbjct: 22 YVTQQLDHFRFDETRTFSQKLLVH-DAWHRPG--GPLLMYFGNEGAIEDFYGNSGGLMFE 78
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
+AP+ A + F EHRYYG S+P+G+ A + L++LT EQALAD A+ + + L
Sbjct: 79 LAPKLNASVAFLEHRYYGSSLPFGN---ASYGSDELAFLTVEQALADMALVLATSSEILG 135
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET----FYNIVS 246
A P VLFGGSYGGMLAAW LKYPH+A GA+A+SAP+ D+ P E F++
Sbjct: 136 AADGPAVLFGGSYGGMLAAWFMLKYPHLAAGAVAASAPV----DLYPGEGKERPFFDAGL 191
Query: 247 SDFKR-ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLES 302
+ SA+C ++ + L + + G L ++F C L D L ++
Sbjct: 192 EVYGTYGSAACEADLRAALAALAAAAKTAEGRDALARSFRTCEPLPDPVDGDRLTSYVNG 251
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVD 362
A S LAM+DYPY S F+ P+P P+R C ++ AP A S L+ V V+ N+TG
Sbjct: 252 ALSTLAMLDYPYASAFVAPMPANPVRVACGRVAAAPSAASKLKG---AVDVFLNHTGETA 308
Query: 363 CFQ 365
C+
Sbjct: 309 CYD 311
>gi|17556861|ref|NP_498688.1| Protein PCP-1 [Caenorhabditis elegans]
gi|21431885|sp|P34610.2|PCP1_CAEEL RecName: Full=Putative serine protease pcp-1; Flags: Precursor
gi|351020794|emb|CCD62762.1| Protein PCP-1 [Caenorhabditis elegans]
Length = 565
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 109/303 (35%), Positives = 165/303 (54%), Gaps = 23/303 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y +LDHF++ D TF R + N + GPIF Y GNEG +E F +G ++D+
Sbjct: 46 YKNMKLDHFTWGDTRTFDMRVMWNNTFY---KPGGPIFFYTGNEGGLESFVTATGMMFDL 102
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
AP F A ++F EHR+YG++ P+G+ +Y + + YLT+EQALAD+A +T LK++ +
Sbjct: 103 APMFNASIIFAEHRFYGQTQPFGNQ--SYASLANVGYLTSEQALADYAELLTELKRDNNQ 160
Query: 192 ------EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE-DIVPPETFYNI 244
A+ V+ FGGSYGGML+AW R KYPHI GA A SAP++ V P F +I
Sbjct: 161 FKMTFPAATQVISFGGSYGGMLSAWFRQKYPHIVKGAWAGSAPLIYMNGGGVDPGAFDHI 220
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENG--LLELTKTFHL-----CRELNSTEDLA 297
S + + F + +W +++ + G L F L R +L
Sbjct: 221 TSRTYIDNGCNRF-ILANAWNATLNLSSTDAGRQWLNNNTVFKLDPRTKIRNQTDGWNLN 279
Query: 298 DWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS---ILERIFEGVSVY 354
+L A Y+AMVDYPYP+ F+ PLP +P+ C ++ + S +++ + ++Y
Sbjct: 280 AYLREAIEYMAMVDYPYPTGFLEPLPAWPVTVACGYMNANGTSFSDKDLVKAVANAANIY 339
Query: 355 YNY 357
YNY
Sbjct: 340 YNY 342
>gi|449443602|ref|XP_004139566.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like [Cucumis sativus]
Length = 325
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 93/241 (38%), Positives = 150/241 (62%), Gaps = 15/241 (6%)
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSY 203
HR+YG+S+P+GS E A +N + Y + QALAD+A + ++K+ + + SP+++ G SY
Sbjct: 16 HRFYGKSIPFGSLEKAMKNGSIRGYFNSAQALADYAELLLHIKKMFAYDTSPIIVMGASY 75
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKES 263
GGMLA+W RLKYPHIA+GALASSAPIL F++I P + +Y+IVS FK S +C +TI+ S
Sbjct: 76 GGMLASWFRLKYPHIALGALASSAPILYFDNITPQDGYYSIVSKSFKETSKTCHDTIRRS 135
Query: 264 WGELVSV-GQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPL 322
WGE+ + G+ GL L+K F C +L ++ ++ + ++S ++ A + PY +
Sbjct: 136 WGEIDRIAGKTRGGLSILSKQFKTCGKLKTSSEIKNLMDSVFTMAAQYNDPYEN------ 189
Query: 323 PGYPIREVCKKID-NAPDATSILERIFEGVSVYYNYTGNVDCFQLD--DDPHGLDGWNWQ 379
P+R +C ID A +++++++ GV Y D ++ +DP L+ + WQ
Sbjct: 190 ---PVRGICVAIDEEAKKKSNVIKQVVAGVIAYLGERPCYDVYEFGYPNDP--LNQYGWQ 244
Query: 380 V 380
V
Sbjct: 245 V 245
>gi|323450637|gb|EGB06517.1| hypothetical protein AURANDRAFT_71973 [Aureococcus anophagefferens]
Length = 939
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 171/314 (54%), Gaps = 20/314 (6%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPR 134
Q LDHF F TF QR ++ DHW P GPIF+YCGNE D+ + +G +W+ A
Sbjct: 41 QNLDHFDFTTNATFEQRVFVHADHW-SPG--GPIFVYCGNEDDVTLYVNATGLMWEHAAA 97
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEAS 194
FGAMLVF EHRYYGE++P+G+ L YL+ EQALAD + +K AE +
Sbjct: 98 FGAMLVFVEHRYYGETLPFGAASF---EPGRLRYLSHEQALADLVNALRRIKATYGAENA 154
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE-DIVPPETFYNIVSSD---FK 250
V FGGSYGGMLAAW+R+KYP +GA+A+SAPIL F+ D E ++ +V+ D
Sbjct: 155 KTVAFGGSYGGMLAAWLRMKYPAAVVGAVAASAPILAFDGDGFDGEAYWEVVTRDATAAA 214
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMV 310
+ +C ++E++ L + +L++ F C + LA A+ +AM
Sbjct: 215 GAAPACAANVREAFSALFRADRD-----DLSRIFRTCGPVADRSRLALLALFAFDTMAMG 269
Query: 311 DYPYPSDFM----MPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQ 365
+YPY S ++ + LP +P+R C+ + D +L + V+YN +G + C +
Sbjct: 270 NYPYESTYLTHGEVALPAFPVRAACEHLAGPLDGDEALLAALAAAAGVFYNASGALACNE 329
Query: 366 LDDDPHGLDGWNWQ 379
L D W+WQ
Sbjct: 330 LPADVEEDGIWDWQ 343
>gi|298710358|emb|CBJ31975.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 596
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 114/299 (38%), Positives = 157/299 (52%), Gaps = 47/299 (15%)
Query: 102 PNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ 161
P GPIF Y GNE D+ + SG +W+ AP F A+LVF EHR+YGES+P+G+ + +
Sbjct: 137 PGAPGPIFFYTGNEADVSLYLEASGLMWENAPAFNALLVFAEHRFYGESLPFGAPD---K 193
Query: 162 NATTLSYLTAE--QALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
L TA QALAD+A +T LKQ L AE +PV+ FGGSYGGMLA+W+RLKYPHI
Sbjct: 194 RREFLRQATAGTPQALADYARLVTALKQELGAEGAPVIAFGGSYGGMLASWLRLKYPHIV 253
Query: 220 IGALASSAPILQFEDI---VP-PETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKEN 275
GA+A+SAP+L E + P PE F V++ + + G+ V
Sbjct: 254 HGAIAASAPVLALEGLHRPTPNPEAFAETVTAAAGPAGGAADSCAANPRGDGALV----- 308
Query: 276 GLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMM-----PLPGYPIREV 330
+LA W +A+ YLAM ++PY + +M+ LP +P+RE
Sbjct: 309 -------------------ELAWWARAAFDYLAMGNFPYATGYMLNSGEVELPPWPLREA 349
Query: 331 CKKIDN----APDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHG-----LDGWNWQV 380
C + + A D +L + + + VYYN TG V CF + D WNWQ
Sbjct: 350 CSYLADPTLQAEDDDVLLGALADAIGVYYNATGEVGCFTPAAGANNASSVDADNWNWQA 408
>gi|118396082|ref|XP_001030384.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89284685|gb|EAR82721.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 495
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 114/340 (33%), Positives = 188/340 (55%), Gaps = 37/340 (10%)
Query: 68 YETRYFEQRLDH--FSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y+T YF+Q+++H F DL TF Q+YL+ D + GPI YCGNEG IE F N+
Sbjct: 20 YKTYYFDQKVNHEGFEMNDL-TFKQKYLVKDDFYRYDK--GPILFYCGNEGPIEMFYNNT 76
Query: 126 GF-VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
GF +A ++VF EHRY+GES P+G+ E + + YLT+ QAL D+ VF+
Sbjct: 77 GFQTHTLAKELNGLVVFMEHRYFGESWPFGNEEESLKKGNN-KYLTSLQALNDYVVFLNW 135
Query: 185 LKQNLSA--EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF--EDIVPPET 240
K++L + PV+ GGSYGGMLAAW+R+K+P++ +LA+SAPI QF + +
Sbjct: 136 FKKSLGCADDECPVIAIGGSYGGMLAAWIRMKFPNVVDASLAASAPIYQFLNREGLNQTL 195
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSV-----------GQKENGLLELTKTFHLCRE 289
FY+I++ ++ + C + I +++ L ++ Q ++ +++ + C
Sbjct: 196 FYSIITRNYAQN--GCSDKIHQAYQYLTNIIDSPMSTKYFKYQYDSIFANISQAMNTCEP 253
Query: 290 L-NST--EDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILER 346
+ NST L ++++AYSY+AM +YP S F+ +P +P C ++ A ++ S +
Sbjct: 254 ITNSTGLTQLRTYMDTAYSYMAMTNYPQASSFLRSMPAWPANASCIPME-AVNSNSTVFE 312
Query: 347 IFEGVSV----YYNYTGNVDCFQLDDDPHG-----LDGWN 377
+F + + +YNY + +C + G + GWN
Sbjct: 313 LFSAIKLSTDTFYNYDQSANCSDISQGDDGASDNDMSGWN 352
>gi|159489554|ref|XP_001702762.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280784|gb|EDP06541.1| predicted protein [Chlamydomonas reinhardtii]
Length = 419
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 129/396 (32%), Positives = 190/396 (47%), Gaps = 69/396 (17%)
Query: 11 SLYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKL--PHLTEPPQRQQRQQQQQYRY 68
SL+ + +V++S + ++ +P R KL P + ++ Q +
Sbjct: 10 SLWSCAALAVVLLSSFTVQAILPRPGLVRLGNGLSRKLQRPFSSVATHSLRKDLLAQCKL 69
Query: 69 ETRYFEQRLDHFS----FADLPTFSQRYLINTDHWVGPN---RLGPIFLYCGNEGDIEWF 121
+ R LDHFS D+ TF QRY + HW N GPIF Y GNE D+ +
Sbjct: 70 QLR--NASLDHFSRVPPAGDVTTFPQRYFVCASHWQRENPDGTPGPIFFYLGNEADVTLY 127
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
N+G +W+ A FGAMLVF EHRYYGES+PYG + + YL+AEQ
Sbjct: 128 LNNTGLMWESAADFGAMLVFAEHRYYGESVPYGEAVKKH-----MGYLSAEQ-------L 175
Query: 182 ITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP--- 237
I LK+ + + V+ FGGSYGGMLAAWMRLKYPH+ GA+A+SAPI F VP
Sbjct: 176 IMELKEQFQLPQGTAVIGFGGSYGGMLAAWMRLKYPHVLDGAIAASAPIWNFLGEVPAFD 235
Query: 238 PETFYNIVS---SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+F V+ S+ + +C + +K +W + S G + G
Sbjct: 236 AGSFAKGVTYDASELAGSAPACVDNVKATWDVMASYGNDDAGRGG--------------- 280
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMM----PLPGYPIREVCKKIDNAPD--ATSILERIF 348
AM ++PYPS ++ LP +P+R C+ + D +L+ +
Sbjct: 281 -------------AMGNFPYPSSYITNGHGQLPAFPVRAACEPLAGGDDWVDADLLDAMA 327
Query: 349 EGVSVYYNYTGNVDCFQ--LDDDP---HGLDGWNWQ 379
V V+YN+TG+++CF DP H + W++Q
Sbjct: 328 AAVGVFYNHTGDLECFDPFAGTDPDSDHDANWWDYQ 363
>gi|187447074|emb|CAO84851.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447076|emb|CAO84852.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447078|emb|CAO84853.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447080|emb|CAO84854.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447084|emb|CAO84856.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447086|emb|CAO84857.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447088|emb|CAO84858.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447090|emb|CAO84859.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447092|emb|CAO84860.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447094|emb|CAO84861.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447096|emb|CAO84862.1| ENSANGG00000016082 protein [Anopheles gambiae]
gi|187447098|emb|CAO84863.1| ENSANGG00000016082 protein [Anopheles gambiae]
Length = 227
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 132/225 (58%), Gaps = 10/225 (4%)
Query: 166 LSYLTAEQALADFAVFITNLKQ-NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
L YLT+EQALADFA+ + L N + A PV+ FGGSYGGMLAAW+R+KYPH+ GA+A
Sbjct: 1 LGYLTSEQALADFALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIA 60
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKT 283
+SAP+ QF + F I++S ++ +A C + I+ SW L + +GL L +
Sbjct: 61 ASAPVRQFAGVTDCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEK 120
Query: 284 FHLCRELNS----TEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPD 339
F C L TE L D+L Y LAM++YPYPS F+ P+P YP+RE C ++
Sbjct: 121 FKFCTNLTKGTDVTETLFDYLTDVYGNLAMINYPYPSSFLAPVPAYPVREFCGRLAQNYT 180
Query: 340 ATSILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLD--GWNWQV 380
+L+ + +S+YYNY G C ++ D G+ GW++Q
Sbjct: 181 GVELLDHLQSALSIYYNYDGKAACLNINSSYDGTGISDRGWDFQA 225
>gi|440913231|gb|ELR62707.1| Dipeptidyl peptidase 2, partial [Bos grunniens mutus]
Length = 413
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 109/290 (37%), Positives = 169/290 (58%), Gaps = 31/290 (10%)
Query: 106 GPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT 165
GPIF Y GNEGD+ FA NSGF+ ++A + GA++VF EHRYYG+S+P+G ++ T
Sbjct: 14 GPIFFYTGNEGDVWSFANNSGFILELAEQQGALVVFAEHRYYGKSLPFGERST-WRGYTE 72
Query: 166 LSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALAS 225
L LT EQALADFA + L+Q L A +P + FGGSYGGML+A++R+KYPH+ GALA+
Sbjct: 73 L--LTVEQALADFAGLLRALRQELEAPDAPAIAFGGSYGGMLSAYLRIKYPHLVAGALAA 130
Query: 226 SAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
SAP++ + P F+ VS+DF+ +S C +++++ ++ + Q+ F
Sbjct: 131 SAPVVSAAGLGDPYQFFQDVSADFQGQSPECARAVQDAFRQIRDLFQQ--------GEFG 182
Query: 286 LCRELNSTEDLAD---WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS 342
C+ L+ +DL + +A++ LAM+DYPY +DF+ LP +P+ C ++ + +
Sbjct: 183 TCQPLSGPKDLTQLFGFARNAFTVLAMMDYPYATDFVGHLPAHPVG--CSRLLSESSRIA 240
Query: 343 ILERIFEGVSVYYNYTGNVDC------FQLDDDPHGL------DGWNWQV 380
L + + YN +G C +Q DP G W++QV
Sbjct: 241 GLRAL---AGLVYNSSGIEPCYDIYLQYQACADPTGCGLGSDAKAWDYQV 287
>gi|187447082|emb|CAO84855.1| ENSANGG00000016082 protein [Anopheles gambiae]
Length = 227
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 91/225 (40%), Positives = 131/225 (58%), Gaps = 10/225 (4%)
Query: 166 LSYLTAEQALADFAVFITNLKQ-NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
L YLT+EQALADFA+ + L N + A PV+ FGGSYGGMLAAW+R+KYPH+ GA+A
Sbjct: 1 LGYLTSEQALADFALVLRTLNPPNGTTRARPVIAFGGSYGGMLAAWIRIKYPHLVAGAIA 60
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKT 283
+SAP+ QF + F I++S ++ +A C + I+ SW L + +GL L +
Sbjct: 61 ASAPVRQFAGVTDCGIFNQILTSVYQVAYTADCADNIRRSWTTLQNYSTSADGLRLLNEK 120
Query: 284 FHLCRELNS----TEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPD 339
F C L TE L D+L Y AM++YPYPS F+ P+P YP+RE C ++
Sbjct: 121 FKFCTNLTKGTDVTETLFDYLTDVYGNXAMINYPYPSSFLAPVPAYPVREFCGRLAQNYT 180
Query: 340 ATSILERIFEGVSVYYNYTGNVDCFQLDD--DPHGLD--GWNWQV 380
+L+ + +S+YYNY G C ++ D G+ GW++Q
Sbjct: 181 GVELLDHLQSALSIYYNYDGKAACLNINSSYDGTGISDRGWDFQA 225
>gi|159470599|ref|XP_001693444.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282947|gb|EDP08698.1| predicted protein [Chlamydomonas reinhardtii]
Length = 264
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 103/274 (37%), Positives = 159/274 (58%), Gaps = 23/274 (8%)
Query: 106 GPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT 165
GPIF Y GNE ++E + +G +W+ A FGA+++F EHRYYG++ P+G ++Q
Sbjct: 1 GPIFFYAGNEANVELYVNVTGLIWENAQAFGALIIFVEHRYYGKTQPFGPD--SWQ--VD 56
Query: 166 LSYLTAEQALADFAVFITNLKQNLSAEA---SPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
SYLT EQALAD+A + +LK + A SPV+ FGGSYGGML+AWMR+KYPHI GA
Sbjct: 57 PSYLTVEQALADYAALLWHLKADSPAGGAADSPVIAFGGSYGGMLSAWMRVKYPHIIAGA 116
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
+A+SAP+ F +V + ++ + C ++ ++G L + + G L++
Sbjct: 117 VAASAPVAAFPGLVTYD------ATPAAGSAPECVTNVRLAFGNLRQLSRFAEGRAALSQ 170
Query: 283 TFHLCREL---NSTEDLADWLESAYSYLAMVDYPYPSDFM-----MPLPGYPIREVCKK- 333
LC+ L D A WL+ A+ AM +YP+PS ++ PLP +P+R C
Sbjct: 171 LLRLCKPLADEGEALDAAYWLQGAFDAFAMGNYPFPSSYISDNPDRPLPAWPMRAACTHM 230
Query: 334 -IDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
+ + ++++E + + +V YN TG+V CF L
Sbjct: 231 VVGRSTKPSTLIEALRDAAAVLYNVTGDVQCFDL 264
>gi|255081901|ref|XP_002508169.1| predicted protein [Micromonas sp. RCC299]
gi|226523445|gb|ACO69427.1| predicted protein [Micromonas sp. RCC299]
Length = 590
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 116/315 (36%), Positives = 162/315 (51%), Gaps = 46/315 (14%)
Query: 107 PIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ-NATT 165
PIF Y GNE ++E + +G +W+ A FGA+LVF EHRYYGES P E +A+
Sbjct: 140 PIFFYTGNEANVELYLNATGLMWEHAESFGAVLVFAEHRYYGESKPKPKEEDGNALDASN 199
Query: 166 LS--------------YLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWM 211
L YLT+EQA+AD+A I LK + A +PV FGGSYGGMLA WM
Sbjct: 200 LGGIIPGHLKKKGQYPYLTSEQAMADYATLIRELKAEIRAPDAPVFAFGGSYGGMLATWM 259
Query: 212 RLKYPHIAIGALASSAPILQF--ED-IVPPETFYNIVSSDFKRESAS---CFNTIKESWG 265
RLKY ++ GA+A SAP+ F ED V P F + V+ D S C ++ ++
Sbjct: 260 RLKYANVVDGAVAGSAPVWSFVGEDPPVDPGAFADGVTMDATAAGGSPPACAPNVRAAFA 319
Query: 266 ELVSVGQKENGLLELTKTFHLCRE--LNSTEDLAD---WLESAYSYLAMVDYPYPSDFMM 320
EL+ + E + LC + L S D+ D W + A+ YLAM ++PY S +++
Sbjct: 320 ELLR--RSETDPKSIKAPMRLCDDTPLGSPTDVLDVALWAQGAFDYLAMGNFPYASSYIL 377
Query: 321 ----PLPGYPIREVC-------KKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL--- 366
LP YP R C K ++ DA +L + + V VYYNY+ +CF
Sbjct: 378 NGDGTLPPYPFRVACGGAMADPKLLNKGGDA--LLSALADAVGVYYNYSKTQECFDTRHG 435
Query: 367 --DDDPHGLDGWNWQ 379
DD + W++Q
Sbjct: 436 SNDDSDEDGELWDYQ 450
>gi|403340721|gb|EJY69653.1| Lysosomal carboxypeptidase [Oxytricha trifallax]
Length = 477
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 113/325 (34%), Positives = 163/325 (50%), Gaps = 20/325 (6%)
Query: 68 YETRYFEQRLDHFSF--ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y+T+ F+ +DHF+ + TF RYLIN + GP PI YCGNEG I F NS
Sbjct: 15 YQTKQFDADIDHFTTQGSTTNTFKLRYLINDKYVTGPGPW-PILFYCGNEGIITDFYDNS 73
Query: 126 GFVWDIAPRFGAM-LVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
GFV +VF EHRYYG+SMP+G N +++LT +QA+ D+ +
Sbjct: 74 GFVTTTLATATNALVVFAEHRYYGQSMPFGKDSFKPGN---VNFLTIDQAMMDYVKLLQF 130
Query: 185 LKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+K + + SPV+ FGGSYGGM+AAW+R++YP I GA ASSAPIL F V P F
Sbjct: 131 IKASDNRFTNSPVIAFGGSYGGMIAAWIRMRYPQIIYGAHASSAPILFFPGTVSPYAFNE 190
Query: 244 IVSSDFKR--ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---LAD 298
+ + ++ + C I+ + L ++ F+ C ST+D L
Sbjct: 191 LATRSYQSATQDGRCAANIQYGFKILNQWAADNTTYAKIKDYFNACVAPASTDDIQFLLG 250
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKK--IDNAPDATSILERIFEGVSVYYN 356
+ A +A V+YPY ++F LP P++ C K +D D ++ + + VY+
Sbjct: 251 EISDALGTMAQVNYPYDTNFTRFLPANPVQTACTKGAVDQKSDDDGYVKGLAQAFMVYHG 310
Query: 357 YTGNVDCFQLDDDP-HGLDGWNWQV 380
C DP G GW +QV
Sbjct: 311 DK----CVSFKPDPSDGTSGWGYQV 331
>gi|167534702|ref|XP_001749026.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772450|gb|EDQ86101.1| predicted protein [Monosiga brevicollis MX1]
Length = 433
Score = 169 bits (427), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 100/272 (36%), Positives = 152/272 (55%), Gaps = 21/272 (7%)
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY--LTAEQALADFAVF 181
++G +W+ A F A++VF EHR++G+S + +T Y + EQA+AD+ F
Sbjct: 13 HTGLMWENAADFKALIVFAEHRFFGQSQVTPGAD----GPSTSEYPLFSVEQAMADYNHF 68
Query: 182 ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP---P 238
+ KQN S E SPV++FGGSYGGMLAAW+R+KYP +GA+A+SAPI F P
Sbjct: 69 LFEFKQNRSIEDSPVIVFGGSYGGMLAAWLRIKYPETFLGAVAASAPISGFAGQQPEWDS 128
Query: 239 ETFYNIVSSD---FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
T++ +V+ D A+C + ++ S+ L G E+G L+ F LC+ L ST D
Sbjct: 129 NTYWQVVTRDATPAAGAPAACADNVRNSFVTLFKTGASESGRAHLSDLFRLCKPLGSTSD 188
Query: 296 ---LADWLESAYSYLAMVDYPYPSDFMMP----LPGYPIREVCKKIDNAP-DATSILERI 347
LA W+ A+ +AM D+PYPS+++ LP YP+ C+ + A ++L+ +
Sbjct: 189 VQALAMWIAYAWDTMAMGDFPYPSNYLTSNGPMLPAYPVTAACQHLATANLTGDALLQGV 248
Query: 348 FEGVSVYYNYTGNVDCFQLDDDPHGLDG-WNW 378
SV+ N T N+ C + D DG W+W
Sbjct: 249 LAAASVFTNATANLQCNDVPFDDVEQDGIWDW 280
>gi|384249050|gb|EIE22532.1| hypothetical protein COCSUDRAFT_16137, partial [Coccomyxa
subellipsoidea C-169]
Length = 291
Score = 167 bits (423), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 95/261 (36%), Positives = 143/261 (54%), Gaps = 23/261 (8%)
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
GNE +E + +GF+W++ A+L+F EHRYYG+S P G + + SYL+ E
Sbjct: 2 GNEDAVEVYVNFTGFMWELGREMNALLIFAEHRYYGDSQPLGPSSLDRDP----SYLSIE 57
Query: 173 QALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF 232
QALADFA I ++K+ A SPV+ FGGSYGGMLAAW+R KYP+ GA+A SAP+ +
Sbjct: 58 QALADFATLIYHVKEKHGARDSPVIAFGGSYGGMLAAWLRAKYPNAVQGAIAGSAPVGAY 117
Query: 233 EDIVPPETFYNIVSSDFKRESAS---CFNTIKESWGELVSVGQKENGLLELTKTFHLCRE 289
+V+ D E+ + C + + EL++ ++ + L F LC
Sbjct: 118 -----------VVTYDASPEAGAAKHCRANVHSFFQELLADKERASFWQHLADVFRLCLA 166
Query: 290 LNS---TEDLADWLESAYSYLAMVDYPYPSDFM-MPLPGYPIREVCKKI-DNAPDATSIL 344
S E++A W++ A+ AM +YPYPS +M LP +P+R C + D P +L
Sbjct: 167 PESGKDVENVAYWVQGAFDSFAMGNYPYPSTYMGGALPAWPMRAACDHLADEKPSKEDLL 226
Query: 345 ERIFEGVSVYYNYTGNVDCFQ 365
+ + V + YN TG+ C+
Sbjct: 227 QGMAAAVGLLYNATGDAPCYN 247
>gi|149068959|gb|EDM18511.1| prolylcarboxypeptidase (angiotensinase C) (predicted) [Rattus
norvegicus]
Length = 193
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 78/148 (52%), Positives = 102/148 (68%), Gaps = 5/148 (3%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y YF+Q++DHF F+D TF QRYL+ HW G I Y GNEGDI WF N+GF
Sbjct: 45 YSVHYFQQKVDHFGFSDTRTFKQRYLVADKHW--QRNGGSILFYTGNEGDIVWFCNNTGF 102
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+A AMLVF EHRYYG+S+P+G +++++ L++LT+EQALADFA I +LK+
Sbjct: 103 MWDVAEELKAMLVFAEHRYYGKSLPFGRD--SFKDSQHLNFLTSEQALADFAELIRHLKE 160
Query: 188 NL-SAEASPVVLFGGSYGGMLAAWMRLK 214
+ E PV+ GGSYGGMLAAW R+K
Sbjct: 161 TIPGTEGQPVIAIGGSYGGMLAAWFRMK 188
>gi|118346549|ref|XP_977067.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288520|gb|EAR86508.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 502
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 105/339 (30%), Positives = 173/339 (51%), Gaps = 38/339 (11%)
Query: 67 RYETRYFEQRLDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+Y+T+YF+Q +DH F TF Q+YLI D++ GPI YCGNE +++ +
Sbjct: 18 QYQTKYFDQLVDHIGFETGDKTFKQKYLIKDDYYRYDK--GPILFYCGNEAPVDFSFGGA 75
Query: 126 GFVWD-IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
GF+ +A A++VF EHRY+GES P+G+ + +++ YLT+ QA+ D+A F+
Sbjct: 76 GFMHTTLAQELNALVVFMEHRYFGESQPFGTEKESFKKGNN-KYLTSFQAINDYAKFLVW 134
Query: 185 LKQNLSA--EASPVVLFGG----------SYGGMLAAWMRLKYPHIAIGALASSAPILQF 232
K++L + PVV FG SYGGML+AW+R+K+P I +LASSAPI +
Sbjct: 135 FKKSLGCGDDECPVVAFGALSNIFINYKASYGGMLSAWIRMKFPEIIDVSLASSAPIFLY 194
Query: 233 EDI--VPPETFYNIVSSDFKRESASCFNTIKESWGELVSV-----------GQKENGLLE 279
E+ + FY IV+ + E C I + L + Q + L E
Sbjct: 195 ENREGIDETLFYKIVTDTY--EQNGCNTQIHRAMNILTDLINSPVPSFLFKIQNKKILNE 252
Query: 280 LTKTFHLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
+ + C+ + ++ L +++ AYSY++M +YP F+ +P +P C +
Sbjct: 253 INEGMKTCKPITDQDNLDVLRSYIDQAYSYMSMFNYPQEGHFVSKMPAWPANYSCTPFEA 312
Query: 337 APDATSI---LERIFEGVSVYYNYTGNVDCFQLDDDPHG 372
D ++I + + + V VYY++ +C + G
Sbjct: 313 INDKSTISQLFQAVKKSVDVYYDFEEQKECTNFNTGSTG 351
>gi|358333026|dbj|GAA51627.1| lysosomal Pro-X carboxypeptidase [Clonorchis sinensis]
Length = 412
Score = 164 bits (414), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 93/258 (36%), Positives = 141/258 (54%), Gaps = 7/258 (2%)
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+W+IA A ++F EHR+YG S+P+ ++++ YLTAEQALAD+A + LK
Sbjct: 1 MWEIAEELKAAVLFAEHRFYGSSLPF--VNDSFKDPQHFGYLTAEQALADYASLVQYLKS 58
Query: 188 NLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
++ E SPV+ FGGSYGGML+AW R KYP++ GA+A+SAPI F ++ FY+ +
Sbjct: 59 SVKDFENSPVIAFGGSYGGMLSAWFRYKYPNLIAGAIAASAPIWLFPNMSNCAGFYDTTT 118
Query: 247 SDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
F ++ C + W + +V ++ +G L F LC L + L D+L
Sbjct: 119 RAFSTSGSTVCTKNVALVWDSIRTVAKQHSGHELLRLMFQLCDPLPDEQKLIDYLIDFLG 178
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKID---NAPDATSILERIFEGVSVYYNYTGNVD 362
LAMV+YPY + F+ PG P++ CK + + +++R+ V NYT N
Sbjct: 179 TLAMVNYPYEASFIGTFPGEPVKYFCKGLSDAVHRDVDVDVVQRVATAVRSLTNYTKNQS 238
Query: 363 CFQLDDDPHGLDGWNWQV 380
C L+ D GLD W +
Sbjct: 239 CISLEGDLPGLDAKAWTL 256
>gi|340508129|gb|EGR33905.1| serine carboxypeptidase s28 family protein, putative
[Ichthyophthirius multifiliis]
Length = 429
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 90/259 (34%), Positives = 150/259 (57%), Gaps = 22/259 (8%)
Query: 108 IFLYCGNEGDIEWFAVNSGFVWDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
I YCGNEG IE F N+GFV I + A++++ EHRY+GES P+G + + Q
Sbjct: 1 IIFYCGNEGPIEMFYKNTGFVTQILSKELKALVLYMEHRYFGESQPFGDEKTSLQKGNN- 59
Query: 167 SYLTAEQALADFAVFITNLKQNLSAEAS--PVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
YLT+ QAL+D+ F+ +K++L + P++ GGSYGGMLAAW+R+K+P++ +LA
Sbjct: 60 QYLTSIQALSDYVEFLIYIKKSLQCQEKECPIIAVGGSYGGMLAAWIRMKFPNLVDASLA 119
Query: 225 SSAPILQF--EDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
+SAPI QF + + ++ I+++++ C + IK ++ L ++ ++N +LE
Sbjct: 120 ASAPIFQFLNRENLDQTKYFQIITNNY-----PCRDKIKTAYQILQNLLNEKNKILEQNN 174
Query: 283 TFH-------LCRELNSTED---LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCK 332
F LC+ L + D L +++++AYSY+AM +YP + F+ LP +P C
Sbjct: 175 IFQQISQAMGLCQPLKNNTDVLNLRNYMDNAYSYMAMTNYPQETTFLKHLPPWPANFSCI 234
Query: 333 KIDNAPDATSILERIFEGV 351
N +S+ + +F V
Sbjct: 235 FFQNITQQSSVFD-LFSAV 252
>gi|29840883|gb|AAP05884.1| similar to NM_066318 prolylcarboxypeptidase (angiotensinase C) in
Homo sapiens [Schistosoma japonicum]
Length = 184
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 80/158 (50%), Positives = 105/158 (66%), Gaps = 6/158 (3%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
+ Q++YET+YF ++DHFSF F +YLIN + + GPI Y GNEG IE
Sbjct: 27 NKDSQFKYETKYFRTKIDHFSFVTDGEFEIKYLINNESFSSG---GPILFYTGNEGAIET 83
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA NSGF+W +A A +VF EHRYYG S+P+G+ ++++ YLTAEQALAD+ +
Sbjct: 84 FAENSGFIWKLAEELNASVVFAEHRYYGTSLPFGND--SFKDRQHFGYLTAEQALADYVL 141
Query: 181 FITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPH 217
I LK N S A SPV+ FGGSYGGML+AW+R KYP+
Sbjct: 142 LINQLKINYSCFASSPVISFGGSYGGMLSAWIRQKYPN 179
>gi|219130123|ref|XP_002185222.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403401|gb|EEC43354.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 283
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 100/290 (34%), Positives = 149/290 (51%), Gaps = 41/290 (14%)
Query: 72 YFEQRLDHFSFADLP-----TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+F+Q L+HF DLP T+ QRY + D V PIFLY GNE +E + ++G
Sbjct: 5 FFQQALNHF---DLPRGQSGTYQQRYCVYNDFMVNETS-APIFLYTGNESPLEQYINHTG 60
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
+W+ A FGA +VF EHRY G+S+P ++ ++Y + QALADFA F+ LK
Sbjct: 61 LIWESAEAFGAQVVFIEHRYEGQSLPSPFI------SSCMAYSSTIQALADFARFV-ELK 113
Query: 187 QNLSA------EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP-- 238
+ PV+ FGGSYGGML+AW+R+KYP+ GA+A SAPI F P
Sbjct: 114 LFVDTGDFSRLRRRPVIAFGGSYGGMLSAWLRMKYPNTIAGAIAGSAPIWGFPRNFPSKI 173
Query: 239 ETFYNIVSSDFKR---------ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRE 289
+ Y ++ ++ ++ C + +W + ++ Q GL LT +F LC
Sbjct: 174 DAAYRVIQHGLQQSYPPTLKPLDNNHCATNLLATWPLISALAQHHEGLQLLTSSFRLCEV 233
Query: 290 LNSTEDLADWLESAYSYLAMVDYPYPSDFM--------MPLPGYPIREVC 331
L + L DW +S + LA +PY S ++ LP +P++ C
Sbjct: 234 LKDGDTLIDWAQSPWFDLAEGSFPYSSSYIPFALTHKDAKLPAWPLQAAC 283
>gi|218198654|gb|EEC81081.1| hypothetical protein OsI_23904 [Oryza sativa Indica Group]
Length = 232
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 81/153 (52%), Positives = 103/153 (67%), Gaps = 13/153 (8%)
Query: 68 YETRYFEQRLDHFSFADLPT--FSQRYLINTDHW-----VGPNRLGPIFLYCGNEGDIEW 120
+ YF Q LDHF+F + F Q+YL+N W G GPIF+Y GNEGDIEW
Sbjct: 82 FTAHYFPQELDHFTFTPNASALFYQKYLVNDTFWRRSAAAGETPAGPIFVYTGNEGDIEW 141
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
FA N+GF++DIAP FGA+L HR+YGES P+G+ + + L YLT+ QALADFAV
Sbjct: 142 FATNTGFMFDIAPSFGALL----HRFYGESKPFGNE--SNSSPEKLGYLTSTQALADFAV 195
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRL 213
IT+LK NLSA +SPVV+FGGSYGG + ++L
Sbjct: 196 LITSLKHNLSAVSSPVVVFGGSYGGSKSPIIKL 228
>gi|294881794|ref|XP_002769498.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
gi|239872962|gb|EER02216.1| Dipeptidyl-peptidase 2 precursor, putative [Perkinsus marinus ATCC
50983]
Length = 455
Score = 154 bits (389), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 155/314 (49%), Gaps = 54/314 (17%)
Query: 71 RYFEQRLDHFSFADLPT---FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+ Q DHFSF + F QRY D + GP+F Y GNEG +E + ++G
Sbjct: 31 NWITQDRDHFSFGEGGNPGKFQQRYFTFKDFY---RPGGPLFFYVGNEGPVEIYVNHTGL 87
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+W++ GA LVF EHRYYG++ Y L YLT EQALAD++V I N
Sbjct: 88 MWELGSDLGAFLVFAEHRYYGKTQVYSD-----GTPDCLRYLTIEQALADYSVMI-NTYT 141
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+++ + FGGSYGGMLA+ R KYPHI GA+A+SAPI + P
Sbjct: 142 RIASSLIATIAFGGSYGGMLASAFRYKYPHIIDGAIAASAPIFAIGGVTP---------- 191
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD----WLESA 303
S + FN I ++ G + + LNS+ DLA+ W +
Sbjct: 192 ---EPSKTAFNEII----------TRDAGPVCAQRWCDNSSHLNSS-DLANQMVAWATAP 237
Query: 304 YSYLAMVDYPYPSDFMMP---------LPGYPIREVC---KKIDNAPDAT--SILERIFE 349
++YLAM ++P+PS+++ LP +P+R C +++D P + + + E
Sbjct: 238 WAYLAMGNFPFPSNYITAAMNVGGGADLPAHPVRVACEPFERLDTMPTKKEGAHIRALAE 297
Query: 350 GVSVYYNYTGNVDC 363
+S+YYN +G + C
Sbjct: 298 SLSIYYNASGELAC 311
>gi|323453591|gb|EGB09462.1| hypothetical protein AURANDRAFT_23831 [Aureococcus anophagefferens]
Length = 472
Score = 150 bits (380), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 104/313 (33%), Positives = 154/313 (49%), Gaps = 31/313 (9%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHW-VGPNRL--------GPIFLYCGNEGDI 118
YET++ Q L H + D T+ QRYL+N W G L GP+ Y GNEG +
Sbjct: 20 YETKWHTQSLTH-AKGDDRTYQQRYLVNDTFWGKGSAPLWRDDDSCPGPVLFYSGNEGPV 78
Query: 119 EWFAVNSGFVWD-IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALAD 177
+ F +GF+ D +AP++GA ++ E RYYG S+P+G+ +N + YL+ E LAD
Sbjct: 79 DGFWPANGFMTDYLAPKWGAYVLMAEARYYGASLPFGNASWTPEN---VQYLSTELILAD 135
Query: 178 FAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI-- 235
+A +T LK +L + PVV FGGSYGG L + RL YP + +G LA+SAPI ++
Sbjct: 136 YARLLTELKSSL--QGCPVVSFGGSYGGTLTTFFRLTYPDVVVGGLAASAPIGYYDPAHW 193
Query: 236 ----VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN 291
V TF +I++ D+ + C + I+ + L + + L FHLC
Sbjct: 194 KDHGVDAYTFSDIIARDYDDAAPGCLDAIRATTDALNAASPE-----ALVDLFHLCDAAA 248
Query: 292 STEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGV 351
A + A L +DYP + +P +P+ C + A A + L R+ V
Sbjct: 249 LGPTRAALWQYALESLPQLDYPRA---VGSIPAWPVNHTCHLLARASTAAARL-RVAAEV 304
Query: 352 SVYYNYTGNVDCF 364
TG CF
Sbjct: 305 QAMVLGTGGETCF 317
>gi|443725197|gb|ELU12877.1| hypothetical protein CAPTEDRAFT_173918 [Capitella teleta]
Length = 517
Score = 150 bits (379), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 96/287 (33%), Positives = 156/287 (54%), Gaps = 29/287 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
YFEQ LDHF ++ QRY +N D W G + GP+FLY G EG + V +G D+
Sbjct: 71 YFEQPLDHFDPQVSGSYKQRYWVNADFWSG--KEGPVFLYIGGEGGLTSMTVQAGEHVDL 128
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF--ITNLKQNL 189
A ++ A++ EHR+YGES+ ++ +L YL+++QALAD A F + + K NL
Sbjct: 129 AKKYKALIFAVEHRFYGESLNDDGLKL-----ESLQYLSSQQALADLAKFHAVMSQKYNL 183
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ------FEDIVPPETFYN 243
+ + + V FGGSY G L+AW R+KYPH+ A+ASSAP+ + D+V
Sbjct: 184 TDD-NHWVCFGGSYPGALSAWFRIKYPHLVHAAVASSAPVRALVDFQGYNDVVAASLSAT 242
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES- 302
IV+ S C + +KE++ + + K N LL+L F+ C L+ +D+ + +
Sbjct: 243 IVNG-----SDKCLSQVKEAFSTIDQMLDKGN-LLQLENDFYSCAPLDGEKDIYQFTSNV 296
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFE 349
A +++ +V Y +PG I +C+++ + D+ + L ++F
Sbjct: 297 ADAFMGVVQYN------QEIPGQSIAGLCEQMTASADSYANLRKLFR 337
>gi|383865215|ref|XP_003708070.1| PREDICTED: putative serine protease K12H4.7-like [Megachile
rotundata]
Length = 479
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/245 (36%), Positives = 137/245 (55%), Gaps = 16/245 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTD-HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
+ +Q LDHF+ D T+S RYL N+ H G GPI + G E +I + +G +++
Sbjct: 48 WIQQPLDHFNPRDNRTWSMRYLENSKYHKEG----GPIMIMIGGEWEISTGFLTTGLMYE 103
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
IA GAM+ + EHRYYG+S P + +++ +N L YL+ +QALAD A FI K+
Sbjct: 104 IASTHGAMMYYTEHRYYGQSKP--TEDISSKN---LQYLSVDQALADLAYFIETKKEQDH 158
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
S V++ GGSY G +AAW RLKYPH+ GALASSAP+ D +Y +V+ +
Sbjct: 159 LRNSTVIVIGGSYAGSMAAWARLKYPHLIQGALASSAPVFAKADFYE---YYEVVTESIR 215
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLADWLES-AYSYL 307
R++ C + IK ++ + + +NG L FHLC ++ S D+ + + A +
Sbjct: 216 RQNEKCADDIKAAFDAVEKLLFTKNGPKRLKTYFHLCDAPDVKSHNDIGHLMNTLAEGFA 275
Query: 308 AMVDY 312
+V Y
Sbjct: 276 GIVQY 280
>gi|322795209|gb|EFZ18031.1| hypothetical protein SINV_09833 [Solenopsis invicta]
Length = 584
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/228 (36%), Positives = 129/228 (56%), Gaps = 13/228 (5%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q LDHF+ D T+S RY N+ + GPI + G E +I + G +++I
Sbjct: 48 WITQPLDHFNHRDNRTWSMRYKENS---LFLKNGGPILIMIGGEWEITDGYLQGGLMYEI 104
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
++G ++ + EHR+YG+S P + +++ +N L YL A+QALAD A FI K+ +
Sbjct: 105 GVKYGGLMYYTEHRFYGQSKP--TKDISSEN---LQYLNADQALADLAYFIETKKKEKNL 159
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
E S V++ GGSY G +AAW RLKYPH+ GALASSAP+ D +Y +V+ R
Sbjct: 160 EKSTVIVVGGSYAGNMAAWARLKYPHLIQGALASSAPVQAKADFYE---YYEVVTKSLGR 216
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLA 297
S C +K ++ + + K++G +EL F+LC ++NS DL
Sbjct: 217 HSEKCVENVKIAFASVEELLAKQSGAIELKYLFNLCEVPDINSASDLG 264
>gi|402591691|gb|EJW85620.1| serine carboxypeptidase S28 family protein [Wuchereria bancrofti]
Length = 183
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/149 (52%), Positives = 96/149 (64%), Gaps = 12/149 (8%)
Query: 91 RYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGES 150
+YL N +++ GP+F Y GNEGDIE FA N+G +WD+APRF A +VF EHRYYG+S
Sbjct: 36 KYLANYSYFLCD---GPLFFYAGNEGDIEAFAQNTGIIWDLAPRFHAAIVFAEHRYYGDS 92
Query: 151 MPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS-------AEASPVVLFGGSY 203
PYG +Y + L YL Q LADFA IT LK + PV++FGGSY
Sbjct: 93 KPYGKR--SYMDVLRLGYLNDIQVLADFAQLITFLKIDDEELGFCPPGTEIPVIVFGGSY 150
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQF 232
GGMLA W+R+KYPHI GA ASSAP+ F
Sbjct: 151 GGMLATWLRMKYPHIVDGAWASSAPLRNF 179
>gi|340715503|ref|XP_003396251.1| PREDICTED: putative serine protease K12H4.7-like [Bombus
terrestris]
Length = 479
Score = 147 bits (372), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 83/228 (36%), Positives = 132/228 (57%), Gaps = 13/228 (5%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q +DHF+ D T+ RY N+ ++ + GPI + G E I + +G ++++
Sbjct: 48 WIRQPVDHFNIRDNRTWLMRYYENSRYF---KKNGPILIMIGGEWAISKGFLEAGLMYEL 104
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A + A++ + EHRYYG+S P T ++ L YL+ +QALAD A FI K++ +
Sbjct: 105 ATTYNAIMYYTEHRYYGKSKPTEDT-----SSRNLQYLSVDQALADLAYFIETRKRDENL 159
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
S V++FGGSY G +A W+RLKYPH+ GALASSAP+L D +Y +V+ +R
Sbjct: 160 RNSKVIVFGGSYAGNVATWVRLKYPHLVQGALASSAPVLAKVDFYE---YYEVVTESLRR 216
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLA 297
S C N +K ++ E+ + + G +LTK F+LC +++S +DL
Sbjct: 217 YSQKCVNEVKAAFDEVEELLAIKGGAQKLTKYFNLCDVPDVHSFKDLG 264
>gi|110749179|ref|XP_623676.2| PREDICTED: putative serine protease K12H4.7-like [Apis mellifera]
Length = 478
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 92/247 (37%), Positives = 133/247 (53%), Gaps = 15/247 (6%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
E PQ + Q+ + +Q LDHF+ D T+S RYL N+ + GPI +
Sbjct: 31 EEPQSLDKAIQENI--TEAWIQQPLDHFNPRDNRTWSMRYLENSRFF---KENGPILIMI 85
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
G E I + +G ++++A A + + EHRYYG+S P T ++ L YL+ +
Sbjct: 86 GGEWAISKGFLRAGLMYELASNHSASMYYTEHRYYGKSKPTNDT-----SSRNLQYLSVD 140
Query: 173 QALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQF 232
QALAD A FI K++ S S V++FGGSY G +A+W RLKYPH+ GALASSAP+L
Sbjct: 141 QALADLAYFIKTKKKDESRRNSTVIVFGGSYAGNVASWARLKYPHLIQGALASSAPVLAK 200
Query: 233 EDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--EL 290
D +Y +V+ +R S C IK ++ E+ + ENG L + F+LC +
Sbjct: 201 LDF---NEYYEVVTESLRRYSEKCVEEIKTAFDEVEELLYIENGPQRLKQYFNLCDVPNI 257
Query: 291 NSTEDLA 297
S DLA
Sbjct: 258 KSFNDLA 264
>gi|222615542|gb|EEE51674.1| hypothetical protein OsJ_33022 [Oryza sativa Japonica Group]
Length = 184
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 78/136 (57%), Positives = 97/136 (71%), Gaps = 9/136 (6%)
Query: 76 RLDHFSFADLP----TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
RLDHF+ +LP TF QRYL+N W G P+F+Y GNEGD+ FA N+GF+W+
Sbjct: 40 RLDHFN--ELPASNGTFRQRYLVNGTFWGGAA--APVFVYAGNEGDVALFASNTGFMWEA 95
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTE-VAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
APRF AMLVF EHRYYGES+P+G T A+ +A+ YLT QALADFA I +LK NL+
Sbjct: 96 APRFRAMLVFVEHRYYGESLPFGGTRAAAFADASAAGYLTTAQALADFAELILSLKSNLT 155
Query: 191 AEASPVVLFGGSYGGM 206
A +PVV+FGGSYGG+
Sbjct: 156 ACKAPVVIFGGSYGGI 171
>gi|242020732|ref|XP_002430805.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516008|gb|EEB18067.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 478
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 91/248 (36%), Positives = 141/248 (56%), Gaps = 15/248 (6%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E +YF Q+LDHF + T++QRY +N D + PN GP FL G EG+ + +G
Sbjct: 27 EEKYFVQKLDHFDPTNTKTWNQRYFVN-DSFYQPN--GPFFLMIGGEGEASPKWMVNGTW 83
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D A ++ A V EHR+YG+S P + ++ +N L YL++EQAL D A FI++L
Sbjct: 84 LDYAKKYNAYCVMVEHRFYGKSHP--TEDLGVKN---LKYLSSEQALGDLAYFISSLNNK 138
Query: 189 LSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
L+ P ++ GGSY G LAAWMRLKYPH+ +GA+++S P+L ++ E ++++V
Sbjct: 139 LNIFPPPKWIVMGGSYPGSLAAWMRLKYPHLVLGAVSTSGPLLA---LINFEEYFDVVKD 195
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLADWLES-AY 304
+ C I+ +++S+ G L K F LC ELN+ +D ++ ES A
Sbjct: 196 SLSSYNPECVTAIEAGTKQIMSLLIHPLGQRSLFKMFKLCDPLELNNEDDNSNLFESLAG 255
Query: 305 SYLAMVDY 312
++ +V Y
Sbjct: 256 NFAGVVQY 263
>gi|156537791|ref|XP_001608051.1| PREDICTED: putative serine protease K12H4.7-like [Nasonia
vitripennis]
Length = 476
Score = 145 bits (365), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 130/233 (55%), Gaps = 13/233 (5%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+ + ++ EQ LDHF+ D T+ RY ++ N +GPIF+ G E I + +G
Sbjct: 43 KLDQKWIEQPLDHFNHRDNRTWQMRYYEEDKYF---NGIGPIFIMLGGEWTINPGFLQNG 99
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
+ D+A + GA++ + EHRYYG+S P ++ + YL +QALAD A FI N K
Sbjct: 100 LMHDLAKQHGALMFYTEHRYYGKSYP-----TQNMSSDNMQYLNVDQALADVAYFIDNRK 154
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ S V++FGGSY G +AAW+R+KYPH+ G++ASSAP+ D +Y +V+
Sbjct: 155 SEYNITDSKVIVFGGSYAGNMAAWIRIKYPHLIQGSVASSAPVYAKADFYE---YYEVVA 211
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLA 297
+ +R + C ++ ++ E + E G ++ K F++C+ +NS D+
Sbjct: 212 NSLRRHDSQCALDVENAFDETEELLVTEGGPEKIQKIFNICKTPNVNSMTDVG 264
>gi|241176364|ref|XP_002399549.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215495171|gb|EEC04812.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 317
Score = 144 bits (364), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 76/167 (45%), Positives = 102/167 (61%), Gaps = 6/167 (3%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPR 134
+R+DHF+F D T+ RY I W + GP+F Y GNE E F +G +W+ AP
Sbjct: 18 RRVDHFTFRDNRTYQMRYAIADQFW--DRKGGPVFFYTGNEDPYETFIKETGVIWEWAPD 75
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA- 193
F A++VF EHR+YG+S+P+G + +YQ+ L YLT+EQALAD+A + LK L+ A
Sbjct: 76 FKALIVFAEHRFYGKSLPFG--DESYQSPKNLGYLTSEQALADYAYLVVYLKTTLAGAAK 133
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ-FEDIVPPE 239
S V FGGSYGGMLA W R+KYPH+ L + Q F +V P
Sbjct: 134 SQFVAFGGSYGGMLATWFRIKYPHLIAATLRERPEVRQIFVSLVLPR 180
>gi|260797277|ref|XP_002593630.1| hypothetical protein BRAFLDRAFT_284130 [Branchiostoma floridae]
gi|229278856|gb|EEN49641.1| hypothetical protein BRAFLDRAFT_284130 [Branchiostoma floridae]
Length = 464
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 87/245 (35%), Positives = 134/245 (54%), Gaps = 14/245 (5%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++ QRLDH++ ADL T+ QRY +N D + P GP+FL G EG + + +G +
Sbjct: 17 QWVTQRLDHYNDADLRTWQQRYFVN-DTFYKPG--GPVFLMIGGEGTADPIWMVTGSWIE 73
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A F A+ + EHRYYG+S P T V L YL++EQALAD A F + + +S
Sbjct: 74 YAKEFHALCLMLEHRYYGKSHPTEDTSVE-----NLQYLSSEQALADLAYFRNYMAEKMS 128
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ + FGGSY G L+AW RLKYPH+ GA+A+S P+L D V + +V
Sbjct: 129 LTDNKWITFGGSYPGSLSAWFRLKYPHLVAGAVATSGPLLAELDFV---EYVEVVRDSLA 185
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLAD-WLESAYSYL 307
C I+E+ + + + + G+ +L K F+LC ++N D+A+ W + +++
Sbjct: 186 TTGPECNKNIQEATDAVKQMLETQEGVEKLNKLFNLCSPLDVNVANDVANFWSTISGNFM 245
Query: 308 AMVDY 312
+V Y
Sbjct: 246 GVVQY 250
>gi|323446372|gb|EGB02557.1| hypothetical protein AURANDRAFT_9133 [Aureococcus anophagefferens]
Length = 138
Score = 144 bits (362), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 71/140 (50%), Positives = 90/140 (64%), Gaps = 6/140 (4%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
LDHF F TF QR ++ DHW P GPIFLYCGNE D+ + +G +W+ A FG
Sbjct: 1 LDHFDFTTNATFEQRVFVHADHWA-PG--GPIFLYCGNEDDVTLYVNATGLMWEHAAAFG 57
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV 196
AMLVF EHRYYG ++P+G+ ++ L YL+ EQALAD + +K AE +
Sbjct: 58 AMLVFVEHRYYGATLPFGAASFEPEH---LRYLSHEQALADLVNALRRIKATYGAENAKT 114
Query: 197 VLFGGSYGGMLAAWMRLKYP 216
V FGGSYGGMLAAW+R+KYP
Sbjct: 115 VAFGGSYGGMLAAWLRMKYP 134
>gi|91078858|ref|XP_972061.1| PREDICTED: similar to thymus-specific serine protease [Tribolium
castaneum]
Length = 501
Score = 144 bits (362), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 95/296 (32%), Positives = 158/296 (53%), Gaps = 21/296 (7%)
Query: 25 ILSPLSLAAQPSKFR--RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF 82
+L+ ++++ S +R R R VG +L EP + ++ + E +F Q LDHF+
Sbjct: 8 LLTLFYISSEVSSWRIFRNGRMVGG--NLGEPKCNCKESSIKEVQEE--WFTQNLDHFNP 63
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
D T+ QR+ N D + P GP+FL G EG+ + G + A +FGA++
Sbjct: 64 TDETTWKQRFYSN-DQFFDPKNGGPVFLMIGGEGEASIKWMTQGAWVNYAEKFGALMFQL 122
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEAS-PVVLFGG 201
EHRYYG+S P + +++ QN L YLT++QALAD A FIT + + S + FGG
Sbjct: 123 EHRYYGKSHP--TDDLSTQN---LKYLTSQQALADLATFITAMNEKYSLPPDVKWIAFGG 177
Query: 202 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIK 261
SY G LAAW+R KYPH+ GA+++S P+L D + ++ ++ S C ++
Sbjct: 178 SYPGSLAAWLRFKYPHLVHGAMSASGPLLAQVDF---KDYFRVIKESLATHSDDCVTAVQ 234
Query: 262 ESWGELVSVGQKENGLLELTKTFHLC----RELNSTEDLADWLES-AYSYLAMVDY 312
+ ++ + ++E G L + F LC +N+ +D+++ E+ A + +V Y
Sbjct: 235 QGVDQIGVLLKQEIGQANLNELFKLCDPVQNSINNEKDISNLYETIADDFAGVVQY 290
>gi|344309381|ref|XP_003423355.1| PREDICTED: LOW QUALITY PROTEIN: dipeptidyl peptidase 2-like
[Loxodonta africana]
Length = 579
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 100/315 (31%), Positives = 145/315 (46%), Gaps = 62/315 (19%)
Query: 54 PPQRQQRQQQQQYRYETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLY 111
P R R +R YFEQ LDHF+F TF QR+L++ W GPIF Y
Sbjct: 153 PAARAHRVADPGFREG--YFEQLLDHFNFERFGNKTFRQRFLVSEKFWK--RNEGPIFFY 208
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
GNEGD+ FA NSGF+ ++A R A++VF EHRYYG+S+P+G+ Q T LT
Sbjct: 209 TGNEGDVWSFANNSGFILELAAREAALVVFAEHRYYGKSLPFGAQST--QRGRT-ELLTV 265
Query: 172 EQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
EQALADFAV + L+ + A+ +P + FGG + A + ++ S P
Sbjct: 266 EQALADFAVLLQALRASFGAQDAPAIAFGGRSANLSAPGGIVSPSSPHSPSVRSGPPP-- 323
Query: 232 FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN 291
ESAS + +LT+ F R
Sbjct: 324 --------------------ESASRMTPLSSQ---------------DLTQLFAFAR--- 345
Query: 292 STEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGV 351
+ ++ LAM+DYPYP++F+ PLP P+R C ++ + + L + V
Sbjct: 346 ----------NXFTVLAMMDYPYPTNFLGPLPANPVRAGCDRLLSEKQSIVGLRAL---V 392
Query: 352 SVYYNYTGNVDCFQL 366
+ YN +G C+ +
Sbjct: 393 GLLYNSSGTEPCYSI 407
>gi|156354267|ref|XP_001623320.1| predicted protein [Nematostella vectensis]
gi|156210006|gb|EDO31220.1| predicted protein [Nematostella vectensis]
Length = 502
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 93/274 (33%), Positives = 144/274 (52%), Gaps = 28/274 (10%)
Query: 73 FEQRLDHFSFADLP-TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
FEQ +DHF F P T+ QRY +N W GP+ GP+ LY G E + + G + DI
Sbjct: 61 FEQYIDHFEFTPRPRTYLQRYWMNRAFWKGPD--GPVLLYVGGESVLSGGYIAGGHIVDI 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
A +GA+L EHRYYG+S +G + + YL+++ ALAD A F+ + K
Sbjct: 119 AKEYGALLFAVEHRYYGKSNFFGCLKT-----KNMRYLSSQLALADLAQFVAHAKNKFGL 173
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ + + +GGSY G L+AW R+KYPH+ IGA+ASSAP+ D + + N+V+S
Sbjct: 174 TDKNKWITYGGSYPGSLSAWFRIKYPHLVIGAVASSAPVEAQTDF---KDYNNVVASSLS 230
Query: 251 R----ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES---A 303
S C + I+E++ + + +N L K F C +++ D W+ + A
Sbjct: 231 SPLVGGSKLCMHNIEEAFKFVDRLLDTKN-FKTLEKDFIACNDISKLND--TWMFASNLA 287
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA 337
++ +V Y +PG I VCK+++NA
Sbjct: 288 GFFMGLVQYN------NQVPGINIAYVCKQMNNA 315
>gi|350422894|ref|XP_003493318.1| PREDICTED: putative serine protease K12H4.7-like [Bombus impatiens]
Length = 478
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 88/275 (32%), Positives = 144/275 (52%), Gaps = 15/275 (5%)
Query: 25 ILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
ILS L L + G H E P+ + + + Q +DHF+ +
Sbjct: 2 ILSFLVLLCFTCRLGHGVGLHGFHFHGLEEPRSLDKSTCENI--TELWIRQPVDHFNVRN 59
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
T+ RY N+ ++ + GPI + G E I + +G ++++A + A++ + EH
Sbjct: 60 NCTWLMRYYENSRYF---KKNGPILIMIGGEWAISKGFLEAGLMYELASAYNAIMYYTEH 116
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYG 204
RYYG+S P T ++ L YL+ +QALAD A FI K++ S V++FGGSY
Sbjct: 117 RYYGKSKPTEDT-----SSRNLQYLSVDQALADLAYFIETRKKDEKLRNSKVIVFGGSYA 171
Query: 205 GMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESW 264
G +A W+RLKYPH+ GALASSAP+L D +Y +V+ +R S C + +K ++
Sbjct: 172 GNVATWVRLKYPHLVQGALASSAPVLAKVDFYE---YYEVVTESLRRHSQKCMDEVKAAF 228
Query: 265 GELVSVGQKENGLLELTKTFHLCR--ELNSTEDLA 297
++ + + G +L + F+LC +++S +DLA
Sbjct: 229 DDVEELLAIQGGAQKLKEYFNLCDVPDVHSFKDLA 263
>gi|195013134|ref|XP_001983807.1| GH15372 [Drosophila grimshawi]
gi|193897289|gb|EDV96155.1| GH15372 [Drosophila grimshawi]
Length = 505
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 105/320 (32%), Positives = 159/320 (49%), Gaps = 25/320 (7%)
Query: 25 ILSPLSLAAQPS--KFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF 82
+L L++ AQ S FRR G L EP + Q+ E +FEQRLDHF
Sbjct: 9 VLLLLAIIAQCSSLNFRRGRHVNG---FLGEPSKVLSLQRS--LDVEELWFEQRLDHFKA 63
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
+ T+ QRY IN H+V + P+F+ G EG+ +N G A FGA+ +
Sbjct: 64 DNRQTWQQRYFINDQHYVNDSN-APVFIMIGGEGEATKKWMNEGAWIHYAEHFGALCIQL 122
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV-VLFGG 201
EHR+YG+S P G + + L+YL++EQALAD A F++ +K + +A+ + FGG
Sbjct: 123 EHRFYGKSHPTGDL-----STSNLAYLSSEQALADLANFVSAMKSKYNMKATQKWIAFGG 177
Query: 202 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIK 261
SY G LAAW R KYP + GA++SS P+L D ++ +V + C +
Sbjct: 178 SYPGSLAAWAREKYPDLIDGAISSSGPLLAEVDF---RQYFEVVKASLASYKPDCVEAVT 234
Query: 262 ESWGELVSVGQKENGLLELTKTFHLCREL-NSTE---DLADWLES-AYSYLAMVDYPYPS 316
S+ ++ + + G L + F C L +S E D+A+ E+ A ++ +V Y +
Sbjct: 235 RSFAQVEILLKHMIGQRNLDEKFKTCTPLKDSIETPLDIANLFENLAGNFAGVVQYNKDN 294
Query: 317 DFMMPLPGYPIREVCKKIDN 336
P I EVC + N
Sbjct: 295 S---PHANITIDEVCDVMLN 311
>gi|307200054|gb|EFN80400.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 495
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 87/233 (37%), Positives = 125/233 (53%), Gaps = 13/233 (5%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q LDHF+ + T+S RY N+ + N GPI + G E I + G +++I
Sbjct: 47 WITQPLDHFNPRENRTWSMRYYENS-ALLRAN--GPILITIGGEWTISTGFLQGGLMYEI 103
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A G M+ + EHR+YG+S P T +A+ L YL+ +QALAD A FI K+ +
Sbjct: 104 ASVHGGMMYYTEHRFYGKSRPTKDT-----SASNLRYLSVDQALADLANFIETKKKEKNL 158
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
E SPV++FGGSY G +A W RLKYPH+ GALASSAPI D +Y +V+ R
Sbjct: 159 ENSPVIVFGGSYAGNMATWARLKYPHLIQGALASSAPIYAKADFYE---YYEVVTRSLGR 215
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLADWLES 302
SA C +K ++ + + + G +L F LC ++ S DL + S
Sbjct: 216 HSAQCVADVKTAFESVEELLAAQGGPEKLKVYFDLCNVPDVKSPSDLGSLMNS 268
>gi|332021809|gb|EGI62155.1| Putative serine protease K12H4.7 [Acromyrmex echinatior]
Length = 493
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 97/279 (34%), Positives = 148/279 (53%), Gaps = 22/279 (7%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++F Q LDHF ++ + QR+ +N D + PN GPIFL G EG + G +
Sbjct: 49 QWFLQYLDHFDPTNVNDWQQRFFVNVDFY-KPN--GPIFLMIGAEGTANASWMVEGEWIE 105
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A FGAM + EHRYYG+S P + +++ +N L YL++EQALAD A FI ++ +L
Sbjct: 106 YAKEFGAMCFYLEHRYYGKSHP--TIDLSVKN---LMYLSSEQALADLAYFIASVNVDLP 160
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+V FGGSYGG LAAWMR KYPH+ GA+++S P+L D +Y +V++ K
Sbjct: 161 RNTKWIV-FGGSYGGSLAAWMRAKYPHLVHGAVSTSGPLLAQIDF---SEYYQVVTNALK 216
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN----STEDLADWLES-AYS 305
S C I+E+ +L + G ++ K F LC ++ D+++ E+ A +
Sbjct: 217 EYSDQCVRIIQEANSQLNIMLHHTVGQQQIQKKFRLCDPIDPGYTKLVDISNLYEALASN 276
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVC-----KKIDNAPD 339
+ ++V Y + I VC +KI+ A D
Sbjct: 277 FASVVQYNKDNRQSSSFVNITIETVCDILVDEKIEKAID 315
>gi|156717482|ref|NP_001096281.1| protease, serine, 16 (thymus) precursor [Xenopus (Silurana)
tropicalis]
gi|134025460|gb|AAI35535.1| LOC100124847 protein [Xenopus (Silurana) tropicalis]
Length = 506
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 106/301 (35%), Positives = 160/301 (53%), Gaps = 36/301 (11%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y Q LDHF+ + TF+QRY IN ++W PN GP+FLY G E + F+V SG D+
Sbjct: 64 YIAQPLDHFNRRNNSTFNQRYWINEEYWNHPN--GPVFLYIGGESSLSEFSVLSGEHVDL 121
Query: 132 APRFGAMLVFPEHRYYGESM-PYGSTEVAYQNATTLSYLTAEQALADFAVF--ITNLKQN 188
A A+LV EHRYYG S+ P G T + +L+++QALAD A F + K N
Sbjct: 122 AQTHRALLVSLEHRYYGSSINPDGLT------LENIRFLSSQQALADLASFHMFISQKYN 175
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV--- 245
L+ + + + FGGSY G L+AW RLK+PH+ A+ASSAP+ D T YN V
Sbjct: 176 LTRQ-NTWICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDF----TGYNKVVAW 230
Query: 246 --SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES- 302
+ S C + +KE + + S+ QK N + +L K F+ C L ++D +++ +
Sbjct: 231 SLADPVIGGSEKCLDAVKEGFQAVDSLLQKGN-ITQLEKDFYSCGSLQGSDDYTEFVGNL 289
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCK----KIDNAPDATSILERI---FEGVSVYY 355
A ++ V Y S P ++++C+ K ++A + + R+ F G+S Y
Sbjct: 290 ADIFMGAVQYNGMS------PASNVQQICQLMTIKDNSAYEGLRSVNRMYMDFMGLSCVY 343
Query: 356 N 356
N
Sbjct: 344 N 344
>gi|195427485|ref|XP_002061807.1| GK16991 [Drosophila willistoni]
gi|194157892|gb|EDW72793.1| GK16991 [Drosophila willistoni]
Length = 512
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 162/331 (48%), Gaps = 34/331 (10%)
Query: 20 IVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDH 79
++++ I SLA FR F G L ++ P Q + E +FEQRLDH
Sbjct: 19 LIVVGIAGTNSLA----HFRHGRLFNGFLGKPSKTPGLQGSLET-----EDLWFEQRLDH 69
Query: 80 FSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAML 139
F ++ T+ QRY +N D++ + PIFL G EG+ ++ G A F A+
Sbjct: 70 FQPSNTQTWQQRYFVNEDYYRN-DSTAPIFLMIGGEGEASKKWMHEGAWIHYAEHFSALC 128
Query: 140 VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV-VL 198
+ EHR+YG+S P + +++ N L YLT+EQALAD A F+ +K + S +
Sbjct: 129 IQLEHRFYGKSHP--TKDLSTSN---LVYLTSEQALADLANFVAAMKVKYDLKDSQKWIA 183
Query: 199 FGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFN 258
FGGSY G LAAW R KYPH+ G+++SS P+L D + ++++V + C
Sbjct: 184 FGGSYPGSLAAWAREKYPHLIYGSISSSGPLLAEVDF---KEYFDVVKASLASYKPDCVE 240
Query: 259 TIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDF 318
+ S+G++ + + G+ L KTF C + +S ++ L + S+F
Sbjct: 241 AVTRSFGQVEILLKHMIGMANLDKTFKTCTPIK---------DSIHNDLDI------SNF 285
Query: 319 MMPLPGYPIREVCKKIDNAPDATSILERIFE 349
L G V DN+P AT ++ I +
Sbjct: 286 FENLAGNFAGVVQYNKDNSPHATITIDDICD 316
>gi|270004132|gb|EFA00580.1| hypothetical protein TcasGA2_TC003450 [Tribolium castaneum]
Length = 473
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 88/269 (32%), Positives = 144/269 (53%), Gaps = 17/269 (6%)
Query: 50 HLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIF 109
+L EP + ++ + E +F Q LDHF+ D T+ QR+ N D + P GP+F
Sbjct: 5 NLGEPKCNCKESSIKEVQEE--WFTQNLDHFNPTDETTWKQRFYSN-DQFFDPKNGGPVF 61
Query: 110 LYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYL 169
L G EG+ + G + A +FGA++ EHRYYG+S P + +++ QN L YL
Sbjct: 62 LMIGGEGEASIKWMTQGAWVNYAEKFGALMFQLEHRYYGKSHP--TDDLSTQN---LKYL 116
Query: 170 TAEQALADFAVFITNLKQNLSAEAS-PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
T++QALAD A FIT + + S + FGGSY G LAAW+R KYPH+ GA+++S P
Sbjct: 117 TSQQALADLATFITAMNEKYSLPPDVKWIAFGGSYPGSLAAWLRFKYPHLVHGAMSASGP 176
Query: 229 ILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC- 287
+L D + ++ ++ S C +++ ++ + ++E G L + F LC
Sbjct: 177 LLAQVDF---KDYFRVIKESLATHSDDCVTAVQQGVDQIGVLLKQEIGQANLNELFKLCD 233
Query: 288 ---RELNSTEDLADWLES-AYSYLAMVDY 312
+N+ +D+++ E+ A + +V Y
Sbjct: 234 PVQNSINNEKDISNLYETIADDFAGVVQY 262
>gi|444521185|gb|ELV13126.1| Dipeptidyl peptidase 2 [Tupaia chinensis]
Length = 471
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 98/307 (31%), Positives = 152/307 (49%), Gaps = 40/307 (13%)
Query: 68 YETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ RYF+Q LDHF+F TF QR L++ W GPIF Y GNEGDI FA NS
Sbjct: 43 FRERYFDQLLDHFNFERFGNRTFRQRVLVSDRFWR--RGEGPIFFYTGNEGDIWTFANNS 100
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
GF+ ++A + A+LVF EH G++ EV AD
Sbjct: 101 GFLAELAAQQAALLVFAEH-LRGDAQRLPEDEVPPPGGGG----------AD-------- 141
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
S + V SYGGML+A++R+KYPH+ GALA+SAP++ + F+ V
Sbjct: 142 ----SKKGPAVTRVTPSYGGMLSAYLRMKYPHLVAGALAASAPVIAVAGLGDSYQFFRDV 197
Query: 246 SSDFKRESASCFNTIKESWGELVSV---GQKENGLLELTKTFHLCRELNSTEDLAD---W 299
++DF +S C ++E++ ++ + G E +++ F C+ L+ DL +
Sbjct: 198 TTDFTSQSPECAQGVREAFRQMKDLFLQGAHET----VSREFGTCQPLSGPRDLTQLFAF 253
Query: 300 LESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG 359
+A++ LAM+DYPYP+ FM P +P++ C ++ L + + YN +G
Sbjct: 254 ARNAFTVLAMMDYPYPTSFMGHFPAHPVKVGCARLLREAQPIPGLRAL---TGLVYNSSG 310
Query: 360 NVDCFQL 366
C+ +
Sbjct: 311 TEPCYDI 317
>gi|322785978|gb|EFZ12594.1| hypothetical protein SINV_07473 [Solenopsis invicta]
Length = 494
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/282 (34%), Positives = 150/282 (53%), Gaps = 19/282 (6%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ ++F Q LDHF+ D+ + QRY +N+D + PN GPIFL G EG + G
Sbjct: 46 KEQWFTQYLDHFNPTDVHVWKQRYFVNSDFY-KPN--GPIFLMIGAEGIANPKWMIEGQW 102
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI--TNLK 186
+ A FGAM + EHR+YG+S P +++++ +N L YL++EQALAD A FI N+
Sbjct: 103 IEYAKEFGAMCFYLEHRFYGKSHP--TSDLSVKN---LVYLSSEQALADLAYFIQSVNIG 157
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
A +V FGGSYGG LAAWMR KYPH+ GA+++S P+L D E +Y +V+
Sbjct: 158 YKFPNNAKWIV-FGGSYGGSLAAWMRAKYPHLVHGAVSASGPLLAQIDF---EEYYIVVT 213
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN----STEDLADWLES 302
+ K S C + I+++ ++ + G ++ K F LC ++ D+++ E+
Sbjct: 214 NALKGYSEKCVDVIQDANSQINMLLHHVTGQQQIQKKFKLCDPIDPGHTKKVDISNLYET 273
Query: 303 -AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSI 343
A ++ +V Y + I VC + + SI
Sbjct: 274 LASNFAGIVQYNKDNRQSSQTANITIETVCDILVDEKIGKSI 315
>gi|291241533|ref|XP_002740665.1| PREDICTED: Prolyl Carboxy Peptidase like family member (pcp-2)-like
[Saccoglossus kowalevskii]
Length = 500
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 96/287 (33%), Positives = 149/287 (51%), Gaps = 32/287 (11%)
Query: 13 YLSPVITIVIISILS-PLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETR 71
YL V V+ S+ S P + +P VG +P L+E P + ++Q
Sbjct: 9 YLLCVFLTVLPSVFSLPYFMNGRPRG-----GMVG-VPVLSERPHTEPQEQ--------- 53
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSGFVW 129
+ QRLDH++ ADL T+ QRY I+ H++ GP+FL G EG + +W + ++
Sbjct: 54 WISQRLDHYNDADLRTWQQRYYIDDSHYIAG---GPVFLNIGGEGPLNSKWLMAETTWI- 109
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
A ++GA+ + EHRYYG+S P + +V+ +L YL++EQALAD A F + + L
Sbjct: 110 QYAMKYGALCLLVEHRYYGKSHP--TVDVS---TDSLQYLSSEQALADLAYFRNYIGEKL 164
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
+ + + FGGSY G LAAW R+KYPH+ GA+A+SAP+L + + +V
Sbjct: 165 NITNNKWIAFGGSYSGNLAAWFRIKYPHLVDGAVATSAPVLAKLNFTE---YLEVVRDSL 221
Query: 250 KRESA--SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
A +C I+ + ++ Q G L F +C +NSTE
Sbjct: 222 ASSKAGEACNKNIQAAVIDMQKKLQTTEGEKLLQNIFQVCGPINSTE 268
>gi|115623598|ref|XP_783667.2| PREDICTED: putative serine protease K12H4.7-like
[Strongylocentrotus purpuratus]
Length = 492
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/246 (36%), Positives = 136/246 (55%), Gaps = 16/246 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F+Q+LDHF+ ADL T++QR+ +N + P GP+FL G EG+ + G +
Sbjct: 51 WFKQKLDHFNDADLRTWNQRFFLNGTFYT-PG--GPVFLMIGGEGEANPVWMVEGAWMEY 107
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A A + EHR+YG+S P + V L YL++EQALAD A F T + Q L
Sbjct: 108 AKEMKAFCIMVEHRFYGKSHPTENMSV-----DNLQYLSSEQALADLAHFRTVIGQQLKF 162
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF-- 249
+ + + FGGSY G L+AW RLKYPH+ +GA+A+SAP+ D PE + +V
Sbjct: 163 DTNKWISFGGSYPGSLSAWFRLKYPHLVVGAIATSAPVQAQLDF--PE-YLTVVRDSLAT 219
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST--EDLADWLES-AYSY 306
R + C + I+ + ++ S+ G L K F+LC LN T D+A++ E+ A ++
Sbjct: 220 SRVGSKCNDAIQAATHDIESLMMHMTGWEALAKIFNLCSPLNGTSMNDVANFYETLAGNF 279
Query: 307 LAMVDY 312
+ V Y
Sbjct: 280 METVQY 285
>gi|330790734|ref|XP_003283451.1| hypothetical protein DICPUDRAFT_147119 [Dictyostelium purpureum]
gi|325086716|gb|EGC40102.1| hypothetical protein DICPUDRAFT_147119 [Dictyostelium purpureum]
Length = 487
Score = 139 bits (350), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 135/233 (57%), Gaps = 16/233 (6%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++F ++DH++ + TF QRY +N +W P GP+FL G EG I V FV +
Sbjct: 58 QWFNNQVDHYNPLNTETFKQRYYVNDTYWT-PG--GPVFLVLGGEGPISPSYVTGHFVVN 114
Query: 131 I-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
AP F A++V EHR+YG S P G+ +A +N L YL+ +QALAD+A F+ KQ
Sbjct: 115 YYAPMFDALIVAVEHRFYGASTPKGN--LATEN---LKYLSTQQALADYANFVQFFKQKY 169
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
+ S V FGGSY G L+AW+RLKYP++ A+A+SAP+ D PE ++ +VS+
Sbjct: 170 NTGDSKWVSFGGSYSGNLSAWLRLKYPNLIDAAIATSAPVKPVVDF--PE-YFEVVSNSI 226
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
++ I ++ ++++ GQ + ++ K F+ C + S D+A ++ES
Sbjct: 227 GPSCSAFVANITKTVTDMINNGQND----QVAKLFNACDPIVSDLDIATFMES 275
>gi|194744689|ref|XP_001954825.1| GF16547 [Drosophila ananassae]
gi|190627862|gb|EDV43386.1| GF16547 [Drosophila ananassae]
Length = 480
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 97/288 (33%), Positives = 144/288 (50%), Gaps = 18/288 (6%)
Query: 16 PVITIVIISILSPLSL--AAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYF 73
I +V ++I+ L + A SK + P V L +L P +Q ++ + ++
Sbjct: 2 AAIRLVCLAIVLVLGVGHALDFSKAKDVPVLVKTLKNLNRGPPQQVVTKRANV--QEKWI 59
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAP 133
Q+LD+F ++ T+ RYLIN + + PIF+Y G E IE V++G +D+A
Sbjct: 60 TQKLDNFDDSNTETYQMRYLINDEFQTDGS---PIFIYLGGEWTIEQSMVSAGHWYDMAQ 116
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNLSAE 192
+LV+ EHRYYGES+P +T ++ +N L YL +QALAD A FIT LK +N
Sbjct: 117 EHKGVLVYTEHRYYGESIP--TTTMSTEN---LQYLHVKQALADVAHFITTLKSENAQLA 171
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF-KR 251
S VVL GGSY + W + YP + +G ASSAP+L D + +V F +
Sbjct: 172 NSKVVLAGGSYSATMVVWFKRLYPDLVVGGWASSAPLLAKVDFTE---YKEVVGRAFLEL 228
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
C+N I+ EL S+ + G E LC + DL W
Sbjct: 229 GGQQCYNRIQNGIAELESLFANKRG-AEARAMLRLCNSFDDQNDLDLW 275
>gi|281212302|gb|EFA86462.1| hypothetical protein PPL_00255 [Polysphondylium pallidum PN500]
Length = 482
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 169/369 (45%), Gaps = 38/369 (10%)
Query: 18 ITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
+ ++ I L+ +S ++R RF + L +PP ET +F Q +
Sbjct: 6 VLVLFIVYLAAVSAVINTPLYKR--RFAKERQLLPQPPLMSN---------ETFWFTQLV 54
Query: 78 DHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGA 137
DHF + TF Q+Y + D++ G GPIF + E + +F +W+ A +F A
Sbjct: 55 DHFDPNNDETFQQQYQVIDDYFDG---TGPIFFFLAGEAPMGFFNFQEVQIWNWADKFNA 111
Query: 138 MLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVV 197
+ V EHR+YG S P + + + N L YLT++QALAD A F+T+ K E++PVV
Sbjct: 112 LYVVLEHRFYGASNP--TNDFSTPN---LRYLTSQQALADAANFLTSFKAERGLESAPVV 166
Query: 198 LFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA-SC 256
+FG SY G L+AW RLKYP + + ++A S P+L + Y S F +A C
Sbjct: 167 VFGCSYSGALSAWFRLKYPQLVVASVAPSGPVL-------AQLNYTGYYSQFSNSAAPDC 219
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPS 316
+ + +++++ ++G +L KTF+ C L + DL +L + L D
Sbjct: 220 VAAAQTATNQIMALTTSKSGRDQLAKTFNSCSNLENPRDLYYFLYTLTEALGSADQ---- 275
Query: 317 DFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVY---YNYTGNVDCFQLDD--DPH 371
M P + + C+ + +I G Y+ +D + + D
Sbjct: 276 --MNNPPTWGLNTTCQTLTQTSSLLDNWAQIVAGGQTGCQDYSLKSFIDSMRKTNSKDQD 333
Query: 372 GLDGWNWQV 380
G W WQ
Sbjct: 334 GSRSWLWQT 342
>gi|194752101|ref|XP_001958361.1| GF23556 [Drosophila ananassae]
gi|190625643|gb|EDV41167.1| GF23556 [Drosophila ananassae]
Length = 508
Score = 138 bits (348), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 91/280 (32%), Positives = 141/280 (50%), Gaps = 18/280 (6%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q+ E +FEQRLDHF +D ++ QRY +N DH+ + PIFL G EG+
Sbjct: 49 QRSLESEDLWFEQRLDHFKPSDTRSWKQRYYLNADHYRN-DSTAPIFLMIGGEGEATAKW 107
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
+ G A FGA+ EHR+YG+S P G A L+YL++EQALAD A F+
Sbjct: 108 MREGAWVHYAEHFGALCFQLEHRFYGKSHPTGDLSTA-----NLAYLSSEQALADLANFV 162
Query: 183 TNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
+ +K + AE+ V FGGSY G LAAW R KYPH+ G+++SS P+L D + +
Sbjct: 163 SAMKVKFNLAESQKWVAFGGSYPGSLAAWAREKYPHLIYGSISSSGPLLAEVDF---KEY 219
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST----EDLA 297
+ +V + C + S+ ++ + + G L + F C + D+A
Sbjct: 220 FEVVKASLASYKPDCVEAVTRSFAQVEILLKHMIGQRNLDEKFKTCTPIKDAIENPLDIA 279
Query: 298 DWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
++ E+ A ++ +V Y + P I ++C + N
Sbjct: 280 NFFENLAGNFAGVVQYNKDNS---PHATITIDDICDVMLN 316
>gi|413954868|gb|AFW87517.1| putative serine peptidase S28 family protein [Zea mays]
Length = 528
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 65/142 (45%), Positives = 91/142 (64%), Gaps = 3/142 (2%)
Query: 225 SSAPI-LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG--ELVSVGQKENGLLELT 281
SSAPI LQ + I P +FY+ VS DFK ES +CF+ IK +WG L + GLL+L+
Sbjct: 287 SSAPIILQLDHITPWSSFYDAVSQDFKSESMNCFSVIKATWGWDALDERAASDGGLLDLS 346
Query: 282 KTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT 341
K F C+ L + +WL +A++Y AMVDYP P++F+ LP YP++E+CK ID P
Sbjct: 347 KLFRACKTLKYAYSIRNWLWTAFAYTAMVDYPTPANFLQNLPAYPVKEMCKIIDGFPAGA 406
Query: 342 SILERIFEGVSVYYNYTGNVDC 363
+L++ F S+YYNYTG+ C
Sbjct: 407 DVLDKAFAAASLYYNYTGDQTC 428
>gi|66803795|ref|XP_635725.1| hypothetical protein DDB_G0290409 [Dictyostelium discoideum AX4]
gi|60464058|gb|EAL62220.1| hypothetical protein DDB_G0290409 [Dictyostelium discoideum AX4]
Length = 469
Score = 138 bits (347), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 86/267 (32%), Positives = 136/267 (50%), Gaps = 22/267 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E+ FEQ +DH+ + + TF QRY++ D++ G GPIF Y E + +F V
Sbjct: 48 ESLIFEQNVDHYDYFNNNTFKQRYIVVDDYFTGD---GPIFFYLAGEAPMGFFGFQEVQV 104
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
+ A FGA+ + EHRYYGES P V + L YLT++QAL+D A F++ KQ+
Sbjct: 105 VNWAQDFGALFIVLEHRYYGESYP-----VDDLSTHNLKYLTSQQALSDAANFLSTYKQD 159
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
+ + VV+FG SY G L+AW RLKYP++ + ++A S P+L + Y +
Sbjct: 160 NNLIDNQVVVFGCSYSGALSAWFRLKYPNLVVASVAPSGPVL-------AQLNYTGYYAQ 212
Query: 249 FKRESA-SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYL 307
F + C +++ E++ + E+G +L KTF+ C L+ D Y
Sbjct: 213 FSNSAQPDCVAATQQATNEIMQLIANESGRKQLEKTFNSCHSLDDPRD------QYYFLY 266
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKI 334
++ D SD M P + + C+ +
Sbjct: 267 SITDALGGSDQMNNPPTWILNSTCQML 293
>gi|391335934|ref|XP_003742341.1| PREDICTED: putative serine protease K12H4.7-like [Metaseiulus
occidentalis]
Length = 486
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 108/330 (32%), Positives = 177/330 (53%), Gaps = 34/330 (10%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQR 76
+ ++V+I +L LAA R P + K L EP + + E R+ QR
Sbjct: 2 IRSLVLIVVLG---LAASTRLHRGRP--LHKHGMLGEPRSTSRHSFGEA---EIRWHTQR 53
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD--IEWFAVNSGFVWDIAPR 134
++HF AD T+ QRY++N + + GP+FL G EG+ I W N+ V +A +
Sbjct: 54 MNHFDPADRRTWKQRYMVNDEFY---REGGPVFLLLGGEGEASISWVEKNT-HVMLMAKK 109
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF---ITNLKQNLSA 191
A++ EHR+YG+S P +++++ +N L YL++EQALAD A F ITN ++NLS
Sbjct: 110 HNALVFQLEHRFYGQSRP--TSDLSTEN---LVYLSSEQALADAAHFRNVITN-RRNLSP 163
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
+A VV FGGSY G LAAW +LKYPH+A+GA+ASSAP+L I+ + + +V
Sbjct: 164 DAKWVV-FGGSYSGSLAAWFKLKYPHLAVGAVASSAPLLA---IIDFQDYVRVVRDSL-- 217
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLES-AYSYLA 308
+SC +K+ + L + + ++ F C + ++ +L ++ ++ A ++
Sbjct: 218 -GSSCSAKVKDGFQALQVKAARRSSWPDIDNEFKTCVPFDGYNSLNLQNFFQTIAGNFEG 276
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
+V Y M I ++C+ ++NAP
Sbjct: 277 IVQYN-KDQRMEGRTNISIDDLCRLMENAP 305
>gi|193662244|ref|XP_001949662.1| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 509
Score = 137 bits (345), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 89/254 (35%), Positives = 140/254 (55%), Gaps = 15/254 (5%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E ++F Q+LDHF+ D T+ QRY +N ++ + GP+FL G EG I + SG
Sbjct: 43 EDKWFLQKLDHFNPTDNRTWKQRYQVNQKYY---KKDGPVFLMIGGEGPISAKWMYSGAW 99
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D A F A+ + EHRYYG+S P + +++ +N L YL++EQAL D A FI N++ N
Sbjct: 100 IDYAKEFNALCLQLEHRYYGKSHP--TEDMSTKN---LVYLSSEQALTDLAEFIVNIRTN 154
Query: 189 LSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ V FGGSY G LAAW+R+K+PH+ A++SS P+L D + ++ +V +
Sbjct: 155 YDIPTTAKWVAFGGSYPGSLAAWLRMKFPHLVYAAVSSSGPLLAKIDF---KEYFKVVEN 211
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLADWLES-AY 304
S C + IKE+ + S + G + F LC ++N+ D+A+ E+ A
Sbjct: 212 ALATYSPDCVSQIKEANQMIDSQIKTIKGAKLIENKFKLCDPLDINTKNDVANLFETLAG 271
Query: 305 SYLAMVDYPYPSDF 318
++ +V Y + F
Sbjct: 272 NFADIVQYNKDNRF 285
>gi|330799265|ref|XP_003287667.1| hypothetical protein DICPUDRAFT_151801 [Dictyostelium purpureum]
gi|325082345|gb|EGC35830.1| hypothetical protein DICPUDRAFT_151801 [Dictyostelium purpureum]
Length = 467
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 139/256 (54%), Gaps = 16/256 (6%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG 116
R+Q + + +T +F+Q++DH+ F + T+ Q+Y++ D++ G GPIF+Y E
Sbjct: 34 REQLGEVPVFSNQTFWFDQKIDHYDFFNNNTYKQQYIVVDDYFDGS---GPIFIYLAGEA 90
Query: 117 DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALA 176
+ +F V + A +FGA+ + EHR+YG+S P + +++ N L YLT++QALA
Sbjct: 91 PMGFFGFQEVQVVEWAKQFGALFIVIEHRFYGKSYP--TQDLSTNN---LKYLTSQQALA 145
Query: 177 DFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
D A F++ K +P V+FG SY G L++W RLKYP +AI ++A S P+L
Sbjct: 146 DAANFLSTYKAENDLVENPTVVFGCSYSGALSSWFRLKYPQLAIASIAPSGPVL------ 199
Query: 237 PPETFYNIVSSDFKRESA-SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
+ + + F +A +C + + +++ + + G+ +L KTF+ C L + D
Sbjct: 200 -AQLNFTGYYAQFSNSAAPNCVQAAQTATNQIMQLTKSSAGIKQLEKTFNSCHSLENPRD 258
Query: 296 LADWLESAYSYLAMVD 311
L +L + L D
Sbjct: 259 LYYFLYTLTEALGSAD 274
>gi|345314847|ref|XP_001512755.2| PREDICTED: dipeptidyl peptidase 2-like, partial [Ornithorhynchus
anatinus]
Length = 452
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 94/281 (33%), Positives = 132/281 (46%), Gaps = 33/281 (11%)
Query: 68 YETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ RYFEQ +DHF F T+ QRYLI W GP+F Y GNEGDI FA NS
Sbjct: 31 FQERYFEQTVDHFDFETYGNRTYLQRYLITEKFWK--KGSGPLFFYTGNEGDIWNFAKNS 88
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
F+ ++A A+++F EHRYYG+S+P G + ++ LT EQALAD+AV I L
Sbjct: 89 DFILELAAAESALVIFAEHRYYGKSLPLGPGSI---RRGSMGPLTVEQALADYAVLIGAL 145
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAI-------------GALASSAPILQF 232
++ L A P+V FGGS G A + H I G AP
Sbjct: 146 QRQLGAAGLPLVAFGGSSGPKAEAKDPISALHTRISICQPVQEEGPRTGKCTGVAP---- 201
Query: 233 EDIVPPETFYNIVSSDF---KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRE 289
PP+T + + R A S+ +++ C
Sbjct: 202 ---RPPQTLPSESPTTLGAGHRPPAWTGTPSDPGDPGRASLSPPPAAYDPISRGMATCHR 258
Query: 290 LNSTEDLADWLE---SAYSYLAMVDYPYPSDFMMPLPGYPI 327
L+ D+ LE +A++ +AM+DYPYP+DFM P +P+
Sbjct: 259 LSDGADVDQLLEFARNAFAMIAMMDYPYPTDFMGHFPAHPV 299
>gi|118787231|ref|XP_315944.3| AGAP005914-PA [Anopheles gambiae str. PEST]
gi|116126698|gb|EAA11647.3| AGAP005914-PA [Anopheles gambiae str. PEST]
Length = 502
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 146/290 (50%), Gaps = 17/290 (5%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+ P R +R Q + +FEQ+LDH + T+ QRY +N ++ + P+FL
Sbjct: 38 DAPSRSERFHTQAAVPDL-WFEQQLDHNDPTNAATWQQRYYVNDQYFNASDPNAPVFLMI 96
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
G EG+ ++ G A + GA+ EHR+YG+S P + ++L+YLT+E
Sbjct: 97 GGEGEATARWMHEGAWIRYAEKHGALCFQLEHRFYGKSRPTEDL-----STSSLAYLTSE 151
Query: 173 QALADFAVFIT--NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
QALAD A FI N K L + FGGSY G LAAW+R KYP + GA++SS P+L
Sbjct: 152 QALADLAYFIVAMNDKYQLEPHRHRWIAFGGSYPGSLAAWLREKYPSLVHGAISSSGPLL 211
Query: 231 QFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC--- 287
D V +Y+ V+ +R SA C ++ ++ ++ ++ + G L + F LC
Sbjct: 212 AKIDFVE---YYDTVTRSLERYSADCVRAVRSAFQQVETLLKHMIGQRTLNEKFQLCDPV 268
Query: 288 -RELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
R + + D+A+ E+ S A V Y D P I EVC + N
Sbjct: 269 ERSIENPLDIANLFEAIASNFAGV-VQYNKD-NSPHATVTIDEVCDVMVN 316
>gi|347965569|ref|XP_321914.5| AGAP001240-PA [Anopheles gambiae str. PEST]
gi|333470451|gb|EAA01781.5| AGAP001240-PA [Anopheles gambiae str. PEST]
Length = 496
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 105/371 (28%), Positives = 179/371 (48%), Gaps = 45/371 (12%)
Query: 15 SPVITIVIISILSPLSLAAQPSKFRRAPRF-----VGKLPHLTEPPQRQQRQQQQQYRYE 69
SP+ + ++ + L QP + R V + H EPP + ++ E
Sbjct: 3 SPLALVCLLGFVGAFVLEPQPQQKDGVQRVLRNQRVWERMH-REPPIQGGSPSRKAGPVE 61
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
T++ QRLDHF ++ T+S RY+ N +H+V GP+F+Y G E +I +++ G V+
Sbjct: 62 TKHIMQRLDHFDPQNVNTWSMRYMANGEHYV---EGGPLFIYVGGEWEISEGSISRGHVY 118
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
D+A L + EHR+YG+S P L YL +QALAD A F+ +++ +
Sbjct: 119 DMAAELKGYLFYTEHRFYGQSHP-----TVDLRTDKLKYLNIDQALADLAHFVVEMRKTI 173
Query: 190 -SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
AE S V++ GGSY + +W R KYPH+ GA ASSAP+ F + E + IV+
Sbjct: 174 PGAEKSGVIMIGGSYSATMVSWFRQKYPHLINGAWASSAPV--FAKVEFTE-YKEIVTES 230
Query: 249 FKRESA-SCFNTIKES---WGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLES 302
+ SC + I+ + EL+ G+ + + + F LC +++ ++ D ++ S
Sbjct: 231 IRLVGGQSCADRIERAIRQTEELLDRGEYAS----VAQEFQLCSDVDLSQPLDRMNFFSS 286
Query: 303 -AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP---DATSILERIFEGVSVY---- 354
+ + +V Y D I VC+ I++A D ++ + + G++
Sbjct: 287 LSDEFAGVVQYHSTGD---------IEGVCQVIEDATITDDMQALAKLVTRGLTSTNCNS 337
Query: 355 YNYTGNVDCFQ 365
Y Y VD ++
Sbjct: 338 YGYKAMVDYYK 348
>gi|332026663|gb|EGI66772.1| Putative serine protease K12H4.7 [Acromyrmex echinatior]
Length = 481
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 80/241 (33%), Positives = 130/241 (53%), Gaps = 13/241 (5%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q LDHF+ D T+S RY N+ + GPI + G E +I + G ++++
Sbjct: 48 WITQPLDHFNHRDNRTWSMRYKENSAFL---KKNGPILIMIGGEWEITNGFLQGGLMYEL 104
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
++ ++ + EHR+YG+S P + +++ +N L YL A+QALAD A FI K+ +
Sbjct: 105 GVKYHGLMYYTEHRFYGQSRP--TKDISTEN---LQYLNADQALADLAYFIDTKKKEKNL 159
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
E S V++ GGSY G +AAW RLKYPH+ GALASSAP+ D +Y +V+ +
Sbjct: 160 EKSIVIVVGGSYAGNMAAWARLKYPHLIQGALASSAPVRAKADFYE---YYEVVTDALGK 216
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLADWLESAYSYLAM 309
S +C ++K ++ + + G +L F LC ++ S+ DL ++ + A
Sbjct: 217 YSKTCIESVKTAFASVEELLAMRAGPQKLKLLFKLCHVPDVRSSSDLGYFMNTLSEIFAG 276
Query: 310 V 310
V
Sbjct: 277 V 277
>gi|241161684|ref|XP_002408971.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
gi|215494440|gb|EEC04081.1| prolylcarboxypeptidase, putative [Ixodes scapularis]
Length = 201
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 73/182 (40%), Positives = 102/182 (56%), Gaps = 10/182 (5%)
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG 265
MLAAW RLKYPH+ ALASSAPIL F I P F +++ F +ES C N I+ S+
Sbjct: 1 MLAAWFRLKYPHVTTAALASSAPILLFTGITPCSAFSEVLTKAFAKESDQCTNAIRTSFE 60
Query: 266 ELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESAYSYLAMVDYPYPSDFMMPLP 323
E G L + F LC+ L + L DW Y+YLAM ++PY S + +P
Sbjct: 61 VTRKQAVTEEGAKALKEQFRLCKPLAPSNYTVLRDWFWDVYAYLAMFNHPYASKLPLLVP 120
Query: 324 GYPIREVCKKID-NAPDATSILERIFEGVSVYYNYTGNVDCFQLDDDPHG----LDGWNW 378
G+P++E CK ++ N D S+L+ I++ +SV+ NYTG C +D P+ L GW
Sbjct: 121 GHPVKEACKFLEKNFADDQSLLDGIYQAISVFTNYTGKTHC---NDLPNSAVPLLGGWGI 177
Query: 379 QV 380
Q+
Sbjct: 178 QL 179
>gi|357603759|gb|EHJ63915.1| carboxypeptidase 3 [Danaus plexippus]
Length = 445
Score = 134 bits (338), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 91/303 (30%), Positives = 155/303 (51%), Gaps = 22/303 (7%)
Query: 51 LTEPPQRQQRQQQQQYR-YETRYF----EQRLDHFSFADLPTFSQRYLINTDHWVGPNRL 105
+ EPP R + +Y +E+ +F + +DHF + T+ RY+ N + + G N
Sbjct: 22 ILEPPLRVDLKPPVEYNGFESSHFWRTYDMPIDHFDPQNRETYQMRYMYNEEFFGGNNY- 80
Query: 106 GPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT 165
PIF+ G E +I+ + +G ++ +A L + EHRYYGES+PY T +N
Sbjct: 81 -PIFIMVGGEWNIQPGWLLAGNMYLMAQENRGYLFYTEHRYYGESLPY--TTFTTEN--- 134
Query: 166 LSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALAS 225
L +L +QALAD A FI+ +K+ S S VVL+GGSY G + W++ +YPH+ +G +AS
Sbjct: 135 LRFLNVDQALADLAYFISEIKKIPSFVNSKVVLYGGSYAGNMVLWLKQRYPHLVVGVVAS 194
Query: 226 SAPILQFEDIVPPETFYNIVSSDFKRESA-SCFNTIKESWGELVSVGQKENGLLELTKTF 284
S PI DI + +V + F E C +TIK+ + ++ + E+G + + +
Sbjct: 195 SGPIKAQVDI---PGYLEVVHNAFLSEGGQECVDTIKQGIADTIAAMETEDGRRSIQRIY 251
Query: 285 HLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSIL 344
LC L+ + L+ S Y + + + + PG + +C+ N ++ +
Sbjct: 252 RLCVPLDYSSRLSMGYFSGY-----ITWTFSTSVQTARPG-SLTAICQNFTNNVYGSTPM 305
Query: 345 ERI 347
E+I
Sbjct: 306 EQI 308
>gi|221108130|ref|XP_002169972.1| PREDICTED: putative serine protease K12H4.7-like [Hydra
magnipapillata]
Length = 496
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 91/232 (39%), Positives = 133/232 (57%), Gaps = 23/232 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E ++F Q+L+HF AD T+ QRY +N +++ G GP+FL G EG + VN G +
Sbjct: 51 EPQWFTQKLNHFDDADDSTWKQRYYVNDEYFDG----GPVFLMIGGEGSLSSLWVNVGAM 106
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D A + A+++ EHR+YGES P ++++ +N L YL++EQALAD A F +
Sbjct: 107 VDYAKQHSALILGLEHRFYGESHPL--SDMSTEN---LKYLSSEQALADLAHFRNEMALK 161
Query: 189 LS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNI 244
S + + + FGGSY G LAAW+R KY H+ GA+ASSAPI L F + T N
Sbjct: 162 YSLNDKNRWIAFGGSYPGALAAWLRYKYQHLIYGAIASSAPIYAQLNFPQYLEVST--NS 219
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
+SS R + + I ES+ L + E GL++L+K F C+ + T DL
Sbjct: 220 LSSSRCRANVNAATKILESY--LTT----EEGLMKLSKDFKTCKPI--TNDL 263
>gi|383860201|ref|XP_003705579.1| PREDICTED: putative serine protease K12H4.7-like [Megachile
rotundata]
Length = 494
Score = 134 bits (336), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 148/284 (52%), Gaps = 23/284 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHW-VGPNRLGPIFLYCGNEG--DIEWFAVNS 125
+ ++F Q LDHF+ D + QRY +N D++ VG GP+FL G EG + +W
Sbjct: 46 KEQWFPQFLDHFNPTDAHVWQQRYFVNGDYYKVG----GPVFLMIGGEGAANAKWMV--E 99
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G + A FGA+ EHR+YG+S P +++++ +N L YL++EQALAD A F+ +
Sbjct: 100 GQWIEYAKEFGALCFQVEHRFYGKSHP--TSDLSVKN---LMYLSSEQALADLAYFVQTV 154
Query: 186 KQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
+ + FGGSY G LAAW+R KYPH+ GA+++S P+L D + +Y +
Sbjct: 155 NTMYKLPNNTKWIAFGGSYAGSLAAWLREKYPHLVHGAVSASGPLLAQIDF---QEYYVV 211
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN----STEDLADWL 300
V + K S +C N I E+ + + + G L K F LC ++ T+D+A+
Sbjct: 212 VENALKEYSEACVNAIVEANTQFHIMLRHRIGQQGLAKKFILCNPIDPGYTKTKDIANLY 271
Query: 301 ES-AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSI 343
E+ A + +V Y + + + I+ C + N +I
Sbjct: 272 ETIASDFAGIVQYNKNNRNNSAMANFTIKSACDILTNESLGLAI 315
>gi|321477664|gb|EFX88622.1| hypothetical protein DAPPUDRAFT_191142 [Daphnia pulex]
Length = 493
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 91/249 (36%), Positives = 140/249 (56%), Gaps = 20/249 (8%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSGFV 128
++F+Q+LDHF + T+ QRY N + G + P+FL G EG+ +W
Sbjct: 49 QWFDQKLDHFDVVNSKTWKQRYHTNDTFFKGDS---PVFLMIGGEGEASPKWMVQGMWIE 105
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
W A +F A+ EHRYYG+S P + ++ +N L +L++EQALAD A FI K+
Sbjct: 106 W--AKQFNALCFQLEHRYYGKSHP--TKNMSTKN---LKFLSSEQALADLAYFIEAKKKE 158
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
L + ++FGGSY G LAAW RLKYPH+A GA+ASSAP+ F I E + +V++
Sbjct: 159 LKLSNNKWIVFGGSYPGSLAAWFRLKYPHLAHGAVASSAPL--FAKINFKE-YLGVVTNA 215
Query: 249 FK--RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLES-A 303
+ +S+ C N I+++ L Q + + K F LC L++T D+A++ E+ A
Sbjct: 216 LQTTSQSSKCTNAIEQATIALEDEIQSTSCCKTMDKLFSLCDPLDTTNKLDVANFFETLA 275
Query: 304 YSYLAMVDY 312
++ +V Y
Sbjct: 276 GNFEGVVQY 284
>gi|195497936|ref|XP_002096312.1| GE25600 [Drosophila yakuba]
gi|194182413|gb|EDW96024.1| GE25600 [Drosophila yakuba]
Length = 480
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 90/291 (30%), Positives = 139/291 (47%), Gaps = 20/291 (6%)
Query: 11 SLYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYET 70
+L L +I +V I ++ L + K + P V L +L P Q ++ +
Sbjct: 3 ALRLVCLIVVVTIGLVHSLDIP----KIKDVPLLVKTLKNLNRGPPLQV--MTKRVNVQE 56
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++ Q+LD+F ++ T+ RYL+N + PIF+Y G E +IE V++G +D
Sbjct: 57 KWITQKLDNFDASNSQTYPMRYLVNDEF---QTEGSPIFIYLGGEWEIENSMVSAGHWYD 113
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNL 189
+A +LV+ EHRYYG+S+P + + L YL +QALAD AVFI K +N
Sbjct: 114 MAEEHKGVLVYTEHRYYGQSVP-----TSTMSTDNLKYLDVKQALADVAVFIETFKAENP 168
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S V+L GGSY + W + YP + +G ASSAP+L D + +V F
Sbjct: 169 QLSNSKVILAGGSYSATMVVWFKRLYPELIVGGWASSAPLLAKVDFTE---YKEVVGQAF 225
Query: 250 -KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+ C++ I+ EL S+ + G E LC + DL W
Sbjct: 226 LQLGGQKCYDRIENGIAELESMFANKRG-AEAKAMLRLCNSFDDQNDLDLW 275
>gi|428184604|gb|EKX53459.1| hypothetical protein GUITHDRAFT_133167 [Guillardia theta CCMP2712]
Length = 467
Score = 134 bits (336), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 105/367 (28%), Positives = 162/367 (44%), Gaps = 89/367 (24%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
Q Y RYF + D S + TD +L IF Y GNE +E + N
Sbjct: 7 QATYRQRYFVCKCDQAS-----------VRITD---ATKKLQTIFFYFGNEDSVELYVNN 52
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+G +W+ A F A++VF EHRYYG+S+ + + +LT +QAL D + F++
Sbjct: 53 TGLMWESASEFDAVMVFLEHRYYGKSVLFEPGREG-----CMEFLTTDQALLDASQFLST 107
Query: 185 LKQN--------LSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
LK N +S + P++ FGGSYGGM+A+W R+++PH+ G +A SAPIL FE +
Sbjct: 108 LKANPKEILPKKISKKPVGPIIGFGGSYGGMIASWFRMRFPHLIDGVIAGSAPILSFEGL 167
Query: 236 VPP---ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC----- 287
P + +V+ D S C + + SV Q E G L + LC
Sbjct: 168 RPAYDNGGYMRVVTQD---ASPLCARNFLRAQKAVYSVSQSEEGREFLKEALLLCVPPAA 224
Query: 288 ----------------RE---------LNST---EDLADWLESAYSYLAMVDYPYPSDFM 319
RE L S+ ++L W++ +S+LAM +YPY S ++
Sbjct: 225 WRQAVLTGAVRCEDFAREEILVLAPPMLPSSYALQNLISWMQDPWSFLAMGEYPYSSSYL 284
Query: 320 ----MPLPGYPIREVCKKID------------------NAPDATSILERIFEGVSVYYNY 357
LP P+ C+ ++ + D +L + V+YN
Sbjct: 285 THGKCNLPPKPLSFACQYLEMYSDDGITYSDVCCNRQAKSIDLPELLNATRYALGVFYNC 344
Query: 358 TGNVDCF 364
+ +V CF
Sbjct: 345 SQDVQCF 351
>gi|426363699|ref|XP_004048972.1| PREDICTED: dipeptidyl peptidase 2 [Gorilla gorilla gorilla]
Length = 495
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 87/310 (28%), Positives = 147/310 (47%), Gaps = 23/310 (7%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
++ R+F+QRLDHF+F TF QR+L++ WV GPIF Y G A S
Sbjct: 31 FQERFFQQRLDHFNFERFGNKTFPQRFLVSDRFWV--RGEGPIFFYTGTR------ASGS 82
Query: 126 GFVWDIA-PRFGAMLVFPEHRYYG-----ESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
GF +A PR + L G + P S +
Sbjct: 83 GFRLSVASPRKPSHLSSQPAAQLGCAATLAAFPSESCTRTWPRVGICGSQKCRPPCGTCG 142
Query: 180 VFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
+ A+ + SYGGML+A++R+KYPH+ GALA+SAP+L +
Sbjct: 143 PLRRKRGNEVLGAAALTLCPSPSYGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSN 202
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD- 298
F+ V++DF+ +S C ++E++ ++ + + + + F C+ L+ +DL
Sbjct: 203 QFFRDVTADFEGQSPKCTQGVREAFRQIKDL-FLQGAYDTVRREFGTCQPLSDEKDLTQL 261
Query: 299 --WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYN 356
+ +A++ LAM+DYPYP+DF+ PLP P++ C ++ + + L + + YN
Sbjct: 262 FMFARNAFTVLAMMDYPYPTDFLGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYN 318
Query: 357 YTGNVDCFQL 366
+G+ C+ +
Sbjct: 319 ASGSEHCYDI 328
>gi|195391898|ref|XP_002054596.1| GJ22720 [Drosophila virilis]
gi|194152682|gb|EDW68116.1| GJ22720 [Drosophila virilis]
Length = 487
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 91/286 (31%), Positives = 138/286 (48%), Gaps = 19/286 (6%)
Query: 20 IVIISILSPLSLAA-----QPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFE 74
++++ + PL+ AA Q S P FV L L P Q + ETR+
Sbjct: 6 LIVLPVFIPLAFAASLGDSQGSDDVEIPAFVQTLRELYRGPPPGQVSTTRA-NVETRWIS 64
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPR 134
Q+LD+F+ ++ + R LIN D++V + PIF+Y G E IE A+ SG DIA
Sbjct: 65 QKLDNFNVSNEEVWDDRVLINEDYFVDGS---PIFIYLGGEWKIEPSAITSGLWVDIARE 121
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEAS 194
LV+ EHR++GES+P A L Y EQALAD I LK+ + S
Sbjct: 122 HNGSLVYTEHRFFGESIPIKPLSTA-----NLKYQNVEQALADVVNVINVLKKEDKYKDS 176
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA 254
VV+ G SY +A W++L YP + +G+ ASSAP+ E V + + +V ++
Sbjct: 177 KVVISGCSYSATMAVWLKLLYPDVIVGSWASSAPL---EAKVDFKDYMKVVGKAYRELGG 233
Query: 255 S-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
C+N I + + + NG E K +LC + + ++ W
Sbjct: 234 DYCYNIIDNATSQYEQLFASGNG-TEAKKILNLCDDFDENDEQDQW 278
>gi|291236175|ref|XP_002738015.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
Length = 515
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/305 (30%), Positives = 155/305 (50%), Gaps = 29/305 (9%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+E Y EQ LDHF + QRY +N +W N GP+FL+ G EG + + V G
Sbjct: 61 EFEDLYLEQPLDHFDPLVTDIYEQRYWVNPTYWNKEN--GPVFLFIGGEGALGAYDVEEG 118
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF--ITN 184
D+A ++GA++ EHR+YG S+ ++ Y L YL+++QALAD A F
Sbjct: 119 EHVDLAKKYGALIFAVEHRFYGASINKDGLKLEY-----LQYLSSQQALADLASFHRFAT 173
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
K N++ +++ + FGGSY G L+AW RLKYPH+ GA+ASSAP+ ++ E + +
Sbjct: 174 SKYNIT-QSNIWICFGGSYPGSLSAWFRLKYPHLVYGAIASSAPVRVVKNF---EGYNQV 229
Query: 245 VSSDFK----RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
V++ + S C + I ++ +++ K+ L F C ++S D A +L
Sbjct: 230 VAASLADPVVKGSLKCSDNIAAAF-KIIDQKIKDKQFDTLKADFKSCNNISSYNDTALFL 288
Query: 301 ES-AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTG 359
+ A ++ +V Y +P + + VC+ N S + + + ++Y + G
Sbjct: 289 NNLAGIFMGIVQYN------NEMPDWNVAAVCQ---NMTQPASPYDNLVKFTTIYLDGMG 339
Query: 360 NVDCF 364
+CF
Sbjct: 340 Q-ECF 343
>gi|281210278|gb|EFA84445.1| hypothetical protein PPL_02477 [Polysphondylium pallidum PN500]
Length = 761
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 169/347 (48%), Gaps = 38/347 (10%)
Query: 12 LYLSPVITIVIISILSPLSLAAQPSK----FRRAPRFVGKLPHLTEPPQRQQRQQQQQYR 67
+ LS ++T+ I++IL + + R+ P+F + + +Q R +
Sbjct: 3 IILSLIVTLCILAILPDVIFGGRAFNPHFDLRKKPKFQLSIDY----ALKQNRHSSKGVE 58
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y+ +F Q LDHF + TF QRYLIN +W G GP+F+ EG + V
Sbjct: 59 YQ--WFTQELDHFDQQNTKTFQQRYLINDQYWDGK---GPVFIMINGEGPMTIGTVLGLK 113
Query: 128 VWDIAPRFGAMLVFPEHRYYG----------ESMPYGSTEVAYQNATTLSYLTAEQALAD 177
D A +F A++V EHRYYG E++ Y S++ A +N L + + LAD
Sbjct: 114 YIDWAKQFNALVVALEHRYYGASFATPDISTENLQYLSSDQASKNIQRLILIISFFRLAD 173
Query: 178 FAVFITNL-KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
AVF + KQ +S V FGGSY G L +W RLKYP++ ++SSAP+L D
Sbjct: 174 NAVFRQFIAKQYNVTSSSKWVSFGGSYSGALTSWFRLKYPNLVDFTISSSAPVLAEVDFY 233
Query: 237 PPETFYNIVSSDFKRESAS--CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+ +V + S C N I + ++ ++ Q +GL +++ F LC L + +
Sbjct: 234 ---QYLEVVQNSLLTTSKGQECVNNIASATQKIQTLLQTSDGLQKVSDLFDLCPPLATQD 290
Query: 295 DLADWLES-AYSYLAMVDYPYPSDFMMPLPGYP-IREVCKKIDNAPD 339
D++ +++S A +++ V Y + PG I +C+ + N PD
Sbjct: 291 DISTFMQSLAGNFMGTVQYN------LEAPGAATITNLCEIMTN-PD 330
>gi|312090033|ref|XP_003146464.1| hypothetical protein LOAG_10893 [Loa loa]
Length = 390
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 75/202 (37%), Positives = 115/202 (56%), Gaps = 19/202 (9%)
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI-VPPETFYNIVSSDFKRES 253
PV++FGGSYGGMLAAW+R+KYPHI GA ASSAP+ F + PE+ +++++ S
Sbjct: 6 PVIVFGGSYGGMLAAWLRMKYPHIVDGAWASSAPLRIFYGTGINPESVSRTITTNY-LTS 64
Query: 254 ASCFNTIKESWGELVSVGQKENGLLELTKTFHL-----CRELNSTEDLADWLESAYSYLA 308
+ + + + + E G ++L + FH + N L ++ SA Y+A
Sbjct: 65 GCDRKVFSDGFVAIEKMSKTEEGRMKLNRIFHAKPGFEMKSYNDFMSLYSYIYSAIFYMA 124
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAP-DATSILERIFEGVSVYYNYTGNV--DCFQ 365
M DYPYP+DF PLPGYP++ VC+ A + ++ E+I+ ++VYYNYTG + +CF
Sbjct: 125 MTDYPYPADFFEPLPGYPVKYVCQYAKKAATNEENLAEQIYSIINVYYNYTGQLTDNCFT 184
Query: 366 --------LDDDPHGLDGWNWQ 379
+ +D + WNWQ
Sbjct: 185 SNCTTPSPIQNDDEDI-AWNWQ 205
>gi|194865618|ref|XP_001971519.1| GG14395 [Drosophila erecta]
gi|190653302|gb|EDV50545.1| GG14395 [Drosophila erecta]
Length = 508
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/305 (31%), Positives = 153/305 (50%), Gaps = 23/305 (7%)
Query: 38 FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD 97
FRR G L ++ P Q Q E +FEQRLDHF +D+ T+ QRY +N D
Sbjct: 29 FRRGRLSNGFLGEPSKIPTLQGSQHS-----EDLWFEQRLDHFKSSDVRTWQQRYFVNAD 83
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
+ + P+FL G EG+ + G A FGA+ + EHR+YG+S P + +
Sbjct: 84 FYRNDSS-APVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP--TAD 140
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYP 216
++ +N L YL++EQAL D A F+T +K + A+ + FGGSY G LAAW R KYP
Sbjct: 141 LSTEN---LHYLSSEQALEDLASFVTAMKVKFNLADGQKWIAFGGSYPGSLAAWAREKYP 197
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
+ G+++SS P+L D + ++ +V + C + S+ ++ + + G
Sbjct: 198 QLIYGSISSSGPLLAEVDF---KEYFEVVKASLAAYKPECVEAVTRSFAQVEILLKHMIG 254
Query: 277 LLELTKTFHLCREL-NSTE---DLADWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L + F C + +S E D+A++ E+ A ++ +V Y + P I ++C
Sbjct: 255 QRSLDEKFKTCTPIKDSIENDLDMANFFENLAGNFAGVVQYNKDNS---PHATITIDDIC 311
Query: 332 KKIDN 336
+ N
Sbjct: 312 DVMLN 316
>gi|195492404|ref|XP_002093975.1| GE21585 [Drosophila yakuba]
gi|194180076|gb|EDW93687.1| GE21585 [Drosophila yakuba]
Length = 508
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 95/305 (31%), Positives = 151/305 (49%), Gaps = 23/305 (7%)
Query: 38 FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD 97
FRR G L ++ P Q+ QQ E +FEQRLDHF +D T+ QRY +N D
Sbjct: 29 FRRGRLGNGFLGEPSKIPTLQRSQQS-----EDLWFEQRLDHFKSSDKRTWQQRYFVNAD 83
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
++ + P+FL G EG+ + G A FGA+ + EHR+YG+S P
Sbjct: 84 YYRNDSS-APVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP----- 137
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYP 216
A + L YL++EQAL D A F+T +K + A+ + FGGSY G LAAW R K+P
Sbjct: 138 TADLSTDNLRYLSSEQALEDLASFVTAMKVKFNLADGQKWIAFGGSYPGSLAAWAREKFP 197
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
+ G+++SS P+L D + ++ +V + C + S+ ++ + + G
Sbjct: 198 QLIHGSISSSGPLLAEVDF---KEYFEVVKASLAAYKPECVEAVTRSFAQVEILLKHMIG 254
Query: 277 LLELTKTFHLCREL-NSTE---DLADWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L + F C + +S E D+A++ E+ A ++ +V Y + P I ++C
Sbjct: 255 QRSLDEKFKTCTPIKDSIENELDMANFFENLAGNFAGVVQYNKDNS---PHATITIDDIC 311
Query: 332 KKIDN 336
+ N
Sbjct: 312 DVMLN 316
>gi|19528413|gb|AAL90321.1| RE11624p [Drosophila melanogaster]
Length = 480
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 139/291 (47%), Gaps = 20/291 (6%)
Query: 11 SLYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYET 70
+L L + ++ I ++ L + K + P V L +L P Q ++ +
Sbjct: 3 ALRLVCLFVVLAIGLVQSLDIP----KIKDVPLLVKTLKNLNRGPPHQVVTKRA--NVQE 56
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++ Q+LD+F+ ++ T+ RYL+N + PIF+Y G E +IE V++G +D
Sbjct: 57 KWITQKLDNFNASNTQTYQMRYLLNDEF---QTEGSPIFIYLGGEWEIEESMVSAGHWYD 113
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNL 189
+A +LV+ EHRYYG+S+P + + L YL +QALAD AVFI K +N
Sbjct: 114 MAQEHNGVLVYTEHRYYGQSIP-----TSTMSTEDLKYLDVKQALADVAVFIETFKAENP 168
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S V+L GGSY + W + YP + +G ASSAPIL D + +V F
Sbjct: 169 QLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPILAKVDFTE---YKEVVGQAF 225
Query: 250 -KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+ C++ I+ EL S+ + G E LC + DL W
Sbjct: 226 LQLGGQKCYDRIENGIAELESMFANKRG-AEARAMLRLCNSFDDQNDLDLW 275
>gi|344292452|ref|XP_003417941.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 482
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 86/250 (34%), Positives = 127/250 (50%), Gaps = 16/250 (6%)
Query: 56 QRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE 115
+R + Q + + TRYF+Q+LDHFS + QRYLIN + R GP+FL G
Sbjct: 23 RRTRGAYQTKGTHPTRYFQQKLDHFSKNCSRLWPQRYLINDAFY---KRGGPVFLLIGGF 79
Query: 116 GDIE--WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQ 173
+ W A+N +V A R GA+ + EHR+YG S P G A +L YL++ Q
Sbjct: 80 ETLSESWIAINKTWV-TYAERLGALFLLLEHRFYGHSQPTGDLSTA-----SLQYLSSRQ 133
Query: 174 ALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
ALAD F T + + + + VLFG SY G LA W R+K+P + A+ SSAPI
Sbjct: 134 ALADIVNFRTKIAEKMRLTKNKWVLFGCSYAGSLAVWSRIKHPDLFAAAVGSSAPIQAKA 193
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELN 291
+ + +V ++ CF TIKE++ ++ + + +L F LC+ +
Sbjct: 194 NFYE---YLEVVQRSLATHNSKCFQTIKEAFDQVAKMMKLSVYYSKLANDFKLCKPAKTY 250
Query: 292 STEDLADWLE 301
S D A +LE
Sbjct: 251 SAMDKAYFLE 260
>gi|24648177|ref|NP_650803.1| CG18493 [Drosophila melanogaster]
gi|23171717|gb|AAF55663.2| CG18493 [Drosophila melanogaster]
Length = 480
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 89/291 (30%), Positives = 139/291 (47%), Gaps = 20/291 (6%)
Query: 11 SLYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYET 70
+L L + ++ I ++ L + K + P V L +L P Q ++ +
Sbjct: 3 ALRLVCLFVVLAIGLVQSLDIP----KIKDVPLLVKTLKNLNRGPPHQVVTKRA--NVQE 56
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++ Q+LD+F+ ++ T+ RYL+N + PIF+Y G E +IE V++G +D
Sbjct: 57 KWITQKLDNFNASNTQTYQMRYLLNDEF---QTEGSPIFIYLGGEWEIEESMVSAGHWYD 113
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNL 189
+A +LV+ EHRYYG+S+P + + L YL +QALAD AVFI K +N
Sbjct: 114 MAQEHNGVLVYTEHRYYGQSIP-----TSTMSTEDLKYLDVKQALADVAVFIETFKAENP 168
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S V+L GGSY + W + YP + +G ASSAPIL D + +V F
Sbjct: 169 QLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPILAKVDFTE---YKEVVGQAF 225
Query: 250 -KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+ C++ I+ EL S+ + G E LC + DL W
Sbjct: 226 LQLGGQKCYDRIENGIAELESMFANKRG-AEARAMLRLCNSFDDQNDLDLW 275
>gi|157132186|ref|XP_001662504.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871255|gb|EAT35480.1| AAEL012351-PA [Aedes aegypti]
Length = 478
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 104/345 (30%), Positives = 167/345 (48%), Gaps = 41/345 (11%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQR 76
+ +V + ++SP L A P + + + P L P+ RQ E ++ Q
Sbjct: 2 IFLVVALLLVSPALLEASPQTYWEQ---LHREPPLRGHPRNVTRQA------EVKWIMQY 52
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+D+F + T+S RY+ N +H+ P GP+F++ G E +I V G +D+A G
Sbjct: 53 VDNFDPQNPSTWSMRYIQNGEHY-QPG--GPLFIFLGGEWEISPGYVMYGHFYDMAKELG 109
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEASP 195
A L + EHRYYG+S P ST + L +L +QALAD A F+ +++ + AE S
Sbjct: 110 AHLFYTEHRYYGQSRPTASTR-----SDLLKFLNIDQALADLAHFVEEMRRAIPGAENSK 164
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA- 254
V++ GGSY + AW R KYPH+ G ASSAP+L D V + +VS +
Sbjct: 165 VIMAGGSYSATMVAWFRQKYPHLVDGGWASSAPLLAKLDFV---EYKEVVSESIRLVGGD 221
Query: 255 SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLESAYSYLA-MVD 311
+C + I+ ++ ++ +E ++ + F +C +N ++ D A +L S Y A +V
Sbjct: 222 ACADRIERAYEQIEDHLAREE-FDKVREEFKVCNNINFANSLDSAMFLSSISDYFAGVVQ 280
Query: 312 YPYPSDF---------------MMPLPGYPIREVCKKIDNAPDAT 341
Y P D M L + IR V + +D D+T
Sbjct: 281 YHSPGDIEGVCEIIMDDSIENDMEALANWFIRGVNQCMDMTYDST 325
>gi|194899984|ref|XP_001979537.1| GG23250 [Drosophila erecta]
gi|190651240|gb|EDV48495.1| GG23250 [Drosophila erecta]
Length = 480
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 96/321 (29%), Positives = 150/321 (46%), Gaps = 29/321 (9%)
Query: 11 SLYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYET 70
+L L + ++ I ++ L + K + P V L +L P Q ++
Sbjct: 3 ALRLVCLFVVLTIGLVHSLDIP----KIKDVPLLVKTLKNLNRGPPHQV--MTKRVNILE 56
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++ Q+LD+F ++ T+ RYL+N + + PIF+Y G E +IE V++G +D
Sbjct: 57 KWITQKLDNFDASNTQTYQMRYLVNDEF---QTQGSPIFIYLGGEWEIEKSMVSAGHWYD 113
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNL 189
+A +LV+ EHRYYG+S+P S + L YL +QALAD A FI K +N
Sbjct: 114 MAEEHKGVLVYTEHRYYGQSVPTSS-----MSTDNLKYLNVKQALADVANFIETFKAENP 168
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S V+L GGSY + W + YP + +G ASSAP+L D + +V F
Sbjct: 169 QLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPLLAKVDFTE---YKEVVGQAF 225
Query: 250 -KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW-----LESA 303
+ S C++ IK EL S+ + G E LC + DL W + +
Sbjct: 226 LQLGSQQCYDRIKNGIAELESMFANKRG-AEAKAMLRLCDSFDDQNDLDLWTLFSSISNI 284
Query: 304 YSYLAMV----DYPYPSDFMM 320
+S +A D Y D++M
Sbjct: 285 FSGIAQYQSNNDIVYNCDYIM 305
>gi|339245689|ref|XP_003374478.1| putative serine protease [Trichinella spiralis]
gi|316972265|gb|EFV55948.1| putative serine protease [Trichinella spiralis]
Length = 490
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 140/270 (51%), Gaps = 19/270 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSGFVW 129
YF+Q LDHF T+ QRY +NT ++ P GP FL G EG W + +
Sbjct: 39 YFQQNLDHFHHQQNITWLQRYWVNTKYY-KPG--GPAFLLIGGEGPAISSWIQESEKYPK 95
Query: 130 D---IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
D A FGA+ EHRYYGES P + + L +LT++QALAD A FI+
Sbjct: 96 DWMKKAQTFGAICFMLEHRYYGESHPTDNMKTE-----NLRWLTSDQALADVANFISYAT 150
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ + S + FGGSY G+L+ W RLKYPH+ GA+ASSAP F V + + V
Sbjct: 151 TRYNLQGSRWITFGGSYAGLLSGWSRLKYPHLITGAVASSAP---FHIKVNFHEYLDSVF 207
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
K E+A+C + I+ + ++ + + E G E+ + LC+++ E +++ ++
Sbjct: 208 DSIKNENANCASEIENAVLQMRKLLRTEYGRREMKEKLSLCKKIKQAEGKD--VQNLFAT 265
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
+A + + + + P ++ +C+K+ +
Sbjct: 266 IADM-FSFIVQYNQPNTAMSMKNMCEKLTD 294
>gi|322795214|gb|EFZ18036.1| hypothetical protein SINV_11633 [Solenopsis invicta]
Length = 413
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 82/242 (33%), Positives = 124/242 (51%), Gaps = 30/242 (12%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPR 134
Q LDHF+ + T+S RY N+ GPI + G E I + G +++I +
Sbjct: 50 QPLDHFNHRENRTWSMRYKENSAFL---KNGGPILIMIGGEWQITDGYLQGGLMYEIGVK 106
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEAS 194
+G ++ + EHR+YG+S P + +++ +N L YL A+Q LAD A FI K+ + E S
Sbjct: 107 YGGLMYYTEHRFYGQSKP--TKDISSEN---LQYLNADQGLADLAYFIETKKKEKNLENS 161
Query: 195 PVVLFGGSYGGMLAAWMRLKYPH-----------------IAIGALASSAPILQFEDIVP 237
V++ GGSY G +AAW RLKYPH + GALASSAP+ D
Sbjct: 162 TVIVVGGSYAGNMAAWARLKYPHLIQVTFALSLYQNWERYLKKGALASSAPVKAKADFYE 221
Query: 238 PETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTED 295
+Y V+ R S C +K ++ + + K+ G +EL F+LC ++NS+ D
Sbjct: 222 ---YYEFVAKSLGRHSKKCVENVKIAFASVEELLAKQCGAIELKYLFNLCEPPDINSSSD 278
Query: 296 LA 297
L
Sbjct: 279 LG 280
>gi|157120503|ref|XP_001653636.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108874911|gb|EAT39136.1| AAEL009038-PA, partial [Aedes aegypti]
Length = 495
Score = 131 bits (330), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 86/273 (31%), Positives = 141/273 (51%), Gaps = 21/273 (7%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+T +F+Q LDH + T+ QRY +N +++ GP+FL G EG+ + G
Sbjct: 47 QTLWFDQLLDHNDPTNPTTWKQRYNVNDEYFKNEEDNGPVFLMIGGEGEATARWMTEGAW 106
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
A +GA+ EHR+YG+S P + +++ +N L+YLT+EQALAD A FI +KQ
Sbjct: 107 IRYAQEYGALCFQLEHRFYGKSHP--TEDLSTKN---LAYLTSEQALADLAYFIEAMKQK 161
Query: 189 LS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
++ + FGGSY G LAAW+R KYP++ G+++SS P+L D + ++ +V+
Sbjct: 162 YQLGRSNRWIAFGGSYPGSLAAWLREKYPYLVYGSISSSGPLLAKIDF---KEYFEVVAD 218
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC----RELNSTEDLADWLESA 303
R S C ++ + ++ ++ + G L F LC + + + DL+ E+
Sbjct: 219 SLNRYSPECVANVRSAMSQVETLLKHMIGQRNLNDKFKLCDPVEKSIANELDLSSLFEAI 278
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
S A V +P+ I EVC + N
Sbjct: 279 ASNFAGVVHPHAR--------LTIDEVCDVMMN 303
>gi|405945266|gb|EKC17248.1| Thymus-specific serine protease [Crassostrea gigas]
Length = 519
Score = 131 bits (329), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 91/321 (28%), Positives = 153/321 (47%), Gaps = 32/321 (9%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPR 134
Q +DHF + ++Q+ INT++W+ P+ GPIFL+ G EG + + SG ++A R
Sbjct: 60 QPIDHFDALNSEMYNQKVYINTENWIKPS--GPIFLFIGGEGALSNRSAYSGHHVEMAKR 117
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEAS 194
+GAM+V EHR+YG S+ + L +L+++Q LAD + ++
Sbjct: 118 YGAMVVAAEHRFYGSSINDNGLHL-----DQLEHLSSQQGLADLTRVHKYITDRYELTSN 172
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE-- 252
+ FGGSY G L+AW RLKYPH+ GA+ASSAP+ + E + +V+
Sbjct: 173 KWISFGGSYPGALSAWFRLKYPHLVYGAVASSAPVQAQTNF---EGYNEVVAQSLTDSTV 229
Query: 253 --SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMV 310
S C I E++ + S+ Q N ++L K F C L+ D ++ + A V
Sbjct: 230 GGSQQCIKQIVEAFQRIDSMIQA-NQTVQLEKDFLSCGPLSEKNDQMVFVNNLAGIFAGV 288
Query: 311 DYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGV----------SVYYNYTGN 360
+ +PG I+ +CK++ + D+ L+ +++ + Y N+
Sbjct: 289 -----VQYNNEVPGLNIQHLCKQMTKSDDSYKNLQMVYKMAMKLLNQSCVDNSYSNFLSQ 343
Query: 361 VDCFQLDDDPH--GLDGWNWQ 379
+ +D G+ W WQ
Sbjct: 344 FNNQTVDQAASGVGIRQWTWQ 364
>gi|118346661|ref|XP_976865.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288576|gb|EAR86564.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 480
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 100/167 (59%), Gaps = 8/167 (4%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y+ Q LDHF+ D T+ QRY I +D + N G +F+Y G EG + + G++ ++
Sbjct: 42 YYTQVLDHFNPNDQRTWQQRYAIYSDEYNPVN--GTVFVYIGGEGKQKGLSPGLGWMVEL 99
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK----Q 187
A +F A+ + EHR+YG S P+G E +Y N L+YL+ EQAL D A I N K
Sbjct: 100 AKKFSALFLIVEHRFYGASQPFGKDENSYSNQN-LAYLSVEQALEDLAQIIANFKTLRLH 158
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
LS E P + GGSY G ++AW R KYPH+ +GALASSA IL ED
Sbjct: 159 GLS-ENVPFITIGGSYPGAVSAWFRSKYPHLVVGALASSAVILPVED 204
>gi|328718795|ref|XP_001947661.2| PREDICTED: putative serine protease K12H4.7-like isoform 1
[Acyrthosiphon pisum]
gi|328718797|ref|XP_003246580.1| PREDICTED: putative serine protease K12H4.7-like isoform 2
[Acyrthosiphon pisum]
Length = 501
Score = 130 bits (328), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 123/228 (53%), Gaps = 12/228 (5%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E +F Q+LDHF+ + T+ QRY +N +++ GP+FL G EG I ++SG
Sbjct: 43 EDEWFIQKLDHFNPTNNRTWKQRYQVNLENYKND---GPVFLMIGGEGKISDKWMHSGAW 99
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D A F A+ EHRYYGES P + + L YL+++QALAD A FI N+K
Sbjct: 100 IDYAKEFNALCFQLEHRYYGESHP-----TEDMSTSNLVYLSSDQALADLAEFIVNIKIK 154
Query: 189 LSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ ++ V FGGSY G LAAW+R+KYPH+ A++SS P+L D + ++ +V +
Sbjct: 155 YNIPSTAKWVAFGGSYPGTLAAWLRMKYPHLIHAAVSSSGPLLAKIDF---KEYFMVVEN 211
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
+ C + IK++ + + + G + K F LC L+ D
Sbjct: 212 ALATYNPECVSQIKQANQMINYYLKTDQGAKIIEKKFKLCDPLDRKND 259
>gi|301624087|ref|XP_002941341.1| PREDICTED: putative serine protease K12H4.7 [Xenopus (Silurana)
tropicalis]
Length = 516
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 87/233 (37%), Positives = 125/233 (53%), Gaps = 16/233 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+FEQRL+HFS AD + QRY +N + GP+FL G EG + SG
Sbjct: 75 WFEQRLNHFSGADTRVWQQRYFMNDTY---RQSGGPVFLMIGGEGPANPAWMTSGAWLTY 131
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A + GA+ + EHR+YG+S P + +++ N L YL+++QALAD A F T +K+ L
Sbjct: 132 AEKLGALCLMLEHRFYGKSHP--TQDISTDN---LHYLSSQQALADIAHFQTVMKEKLGL 186
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
S V+FGGSY G L+AW R+KYPH+A A+ASSAP+ + V + +V
Sbjct: 187 ADSKWVVFGGSYPGSLSAWYRMKYPHLAHMAVASSAPV---KVQVNFSEYLEVVQLALSL 243
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
+ C IK + E VS + +LT F LC L + W+++AY
Sbjct: 244 NHSDCPEAIKMA-SEEVSKLLVLSSFQKLTDDFRLCELLQ----INSWMDTAY 291
>gi|171847157|gb|AAI61646.1| LOC100145763 protein [Xenopus (Silurana) tropicalis]
Length = 494
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 87/233 (37%), Positives = 125/233 (53%), Gaps = 16/233 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+FEQRL+HFS AD + QRY +N + GP+FL G EG + SG
Sbjct: 53 WFEQRLNHFSGADTRVWQQRYFMNDTY---RQSGGPVFLMIGGEGPANPAWMTSGAWLTY 109
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A + GA+ + EHR+YG+S P + +++ N L YL+++QALAD A F T +K+ L
Sbjct: 110 AEKLGALCLMLEHRFYGKSHP--TQDISTDN---LHYLSSQQALADIAHFQTVMKEKLGL 164
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
S V+FGGSY G L+AW R+KYPH+A A+ASSAP+ + V + +V
Sbjct: 165 ADSKWVVFGGSYPGSLSAWYRMKYPHLAHMAVASSAPV---KVQVNFSEYLEVVQLALSL 221
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
+ C IK + E VS + +LT F LC L + W+++AY
Sbjct: 222 NHSDCPEAIKMA-SEEVSKLLVLSSFQKLTDDFRLCELLQ----INSWMDTAY 269
>gi|195376151|ref|XP_002046860.1| GJ13120 [Drosophila virilis]
gi|194154018|gb|EDW69202.1| GJ13120 [Drosophila virilis]
Length = 513
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/280 (32%), Positives = 142/280 (50%), Gaps = 18/280 (6%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
QQ E +FEQRLDH D T+ QRY +N D + + P+FL G EG+
Sbjct: 52 QQSMDVEDLWFEQRLDHLQPDDTRTWQQRYFVN-DAFYRNDSHAPVFLMIGGEGEATKKW 110
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
++ G A FGA+ + EHR+YG+S P +++++ N L+YL++EQALAD A F+
Sbjct: 111 MHEGAWVRYAEHFGALCIQLEHRFYGKSHP--TSDLSTSN---LAYLSSEQALADLANFV 165
Query: 183 TNLKQNLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
T +K + +A + FGGSY G LAAW R KYPH+ G+++SS P+L D +
Sbjct: 166 TTMKTKYNMDAKQKWIAFGGSYPGSLAAWAREKYPHLIDGSISSSGPLLAQVDF---SQY 222
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL-NSTE---DLA 297
+ +V S C + ++ + + G L + F C L +S E D++
Sbjct: 223 FEVVKSSLASYKPECVEAVTRGIAQVEILLKHMIGQRNLDEKFKTCTPLKDSIENQLDIS 282
Query: 298 DWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
+ E+ A ++ +V Y + P I E+C + N
Sbjct: 283 NLFENIAGNFAGVVQYNKDNS---PHANITIDEICDVMLN 319
>gi|195338045|ref|XP_002035636.1| GM14810 [Drosophila sechellia]
gi|194128729|gb|EDW50772.1| GM14810 [Drosophila sechellia]
Length = 508
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 93/305 (30%), Positives = 152/305 (49%), Gaps = 23/305 (7%)
Query: 38 FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD 97
FRR G L ++ P Q+ E +FEQRLDHF +D T+ QRY +N D
Sbjct: 29 FRRGRLTKGFLGEPSKIPTLQRNLHS-----EDLWFEQRLDHFKSSDKRTWQQRYFVNAD 83
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
+ + P+FL G EG+ + G A FGA+ + EHR+YG+S P + +
Sbjct: 84 FYRNDSS-APVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP--TAD 140
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYP 216
++ +N L YL++EQAL D A F+T +K + + + FGGSY G LAAW R KYP
Sbjct: 141 LSTEN---LHYLSSEQALEDLASFVTAMKVKFNLGDGQKWIAFGGSYPGSLAAWAREKYP 197
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
+ G+++SS P+L D + ++ +V + C + + S+ ++ + + G
Sbjct: 198 QLIYGSISSSGPLLAEVDF---KEYFEVVKASLASYKPECLDAVTRSFAQVEILLKHMIG 254
Query: 277 LLELTKTFHLCREL-NSTE---DLADWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L + F C + +S E D+A++ E+ A ++ +V Y + P I ++C
Sbjct: 255 QRSLDEKFKTCTPIKDSIENDLDMANFFENLAGNFAGVVQYNKDNS---PHAKITIDDIC 311
Query: 332 KKIDN 336
+ N
Sbjct: 312 DVMLN 316
>gi|440803021|gb|ELR23935.1| serine carboxypeptidase [Acanthamoeba castellanii str. Neff]
Length = 481
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 83/251 (33%), Positives = 127/251 (50%), Gaps = 21/251 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ +++ QRLDHF+ + T+ QRY IN W P+ GPIF G EG + V +
Sbjct: 37 KEQWYTQRLDHFNGQETRTWKQRYFINDTFW-NPSAPGPIFFQMGGEGAVSGEDVVLLQM 95
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
+ GA++V EHR+YG S P + +L +L++EQALAD A F+ LK
Sbjct: 96 VQYGIKHGALMVTLEHRFYGTSQPLPDLSI-----ESLRFLSSEQALADAAEFLLWLKDQ 150
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI------LQFEDIVPPETFY 242
A SP++ FG SY G LAAW RLKYPH+ ++ASSAP+ ++ D+V Y
Sbjct: 151 YQAPKSPIITFGCSYPGALAAWFRLKYPHVTYASVASSAPVEATLDFFEYLDVVDQSLEY 210
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
+ C IK++ + + G +L F+ C + + D+A++ S
Sbjct: 211 FV--------GDKCVANIKQATTAVSQLMASPGGRAKLQSLFNFCGPIQNELDIANFYSS 262
Query: 303 -AYSYLAMVDY 312
A +++ V Y
Sbjct: 263 LAGNWMGTVQY 273
>gi|66821261|ref|XP_644128.1| hypothetical protein DDB_G0274663 [Dictyostelium discoideum AX4]
gi|60472271|gb|EAL70224.1| hypothetical protein DDB_G0274663 [Dictyostelium discoideum AX4]
Length = 487
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 130/236 (55%), Gaps = 16/236 (6%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++F R+DH+ + TF Q++ +N D + P P+F G EG + V FV++
Sbjct: 56 QWFTNRVDHYDPQNRNTFKQKFYVN-DTYYTPG--SPVFYILGGEGPVGASYVTGHFVFN 112
Query: 131 -IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
A +F A+LV EHR+YG+S+P GS + L YLT +QALAD+A F+ L Q
Sbjct: 113 QYAQKFNALLVAIEHRFYGDSIPMGSLSLE-----NLKYLTTQQALADYAAFVPFLTQKY 167
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
+ +S + FGGSY G L+ W+RLKYP + A+A+SAP+ D PE ++ +VS
Sbjct: 168 NTGSSKWISFGGSYSGNLSGWLRLKYPQLISAAIATSAPVKAQLDF--PE-YFEVVSQSI 224
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
++ + I ++ +++ GQ + ++ + F C + S D+A ++ES S
Sbjct: 225 GPTCSAIVSNITQTVTTMLNNGQND----QVQQMFSACDPIVSKLDIATFMESLSS 276
>gi|109070008|ref|XP_001094051.1| PREDICTED: thymus-specific serine protease isoform 2 [Macaca
mulatta]
Length = 514
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/271 (32%), Positives = 135/271 (49%), Gaps = 30/271 (11%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNEQHWVGED--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRHALADVVSARLALSRLFNI 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL------QFEDIVPPETFYN 243
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ ++ D+V
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSEYNDVVSRSLMST 232
Query: 244 IVSSDFKRESASCFNTIKESWGEL---VSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
+ + C + ++ E+ + +G L C L S E+ A+ L
Sbjct: 233 AIGGSLE-----CRAAVSAAFAEVERRLRLGGAAQAALR--SELSACGTLGSAENQAELL 285
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
A L Y PL +R++C
Sbjct: 286 -GALQALVGGAVQYDGQAGAPL---SVRQLC 312
>gi|395859218|ref|XP_003801939.1| PREDICTED: thymus-specific serine protease [Otolemur garnettii]
Length = 771
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/237 (34%), Positives = 124/237 (52%), Gaps = 20/237 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW ++ GP+FL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNSSDRRSFLQRYWVNDQHWT--HQDGPVFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
AP FGA+++ EHR+YG S+P G +VA L +L++ ALAD +L + +
Sbjct: 119 APAFGALVISLEHRFYGLSIPAGGLDVAQ-----LRFLSSRHALADVVSARLSLSRLFNV 173
Query: 192 EA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D + ++VS K
Sbjct: 174 SSLSPWICFGGSYAGSLAAWARLKFPHLIFASIASSAPVRAVLDF---SEYNDVVSRSLK 230
Query: 251 R----ESASCFNTIKESWGEL---VSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
S C + ++ E+ +S G L C L+ +ED A+ L
Sbjct: 231 SIAIGGSLECRAAVSAAFAEVERRLSAGGAVRAALR--AELGACGSLSRSEDQAELL 285
>gi|54648564|gb|AAH85041.1| LOC495469 protein, partial [Xenopus laevis]
Length = 502
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/249 (36%), Positives = 132/249 (53%), Gaps = 21/249 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y Q LDHF+ + T++QRY IN +W P+ GP+FLY G E + F+V SG ++
Sbjct: 60 YIVQPLDHFNRLNNATYNQRYWINEQYWNHPD--GPVFLYIGGESSLSEFSVLSGEHIEL 117
Query: 132 APRFGAMLVFPEHRYYGESM-PYGSTEVAYQNATTLSYLTAEQALADFAVF--ITNLKQN 188
A A+LV EHRYYG S+ P G T + +L+++QALAD A F + K N
Sbjct: 118 AQTHRALLVSLEHRYYGSSINPDGLT------LENIKFLSSQQALADLASFHMFISQKYN 171
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
L+ + + + FGGSY G L+AW RLK+PH+ A+ASSAP+ D + +D
Sbjct: 172 LTRQ-NTWICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDFTGYNKVVALSLAD 230
Query: 249 -FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED-------LADWL 300
S C + +K+ + + S+ QK N + ++ K F+ C L ++D LAD
Sbjct: 231 PVIGGSEKCLDAVKKGFQAVDSLLQKGN-VTQMEKDFYSCGSLQGSDDYSEFVGNLADIF 289
Query: 301 ESAYSYLAM 309
A Y M
Sbjct: 290 MGAVQYNGM 298
>gi|126631396|gb|AAI33756.1| LOC495469 protein [Xenopus laevis]
Length = 505
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/249 (36%), Positives = 132/249 (53%), Gaps = 21/249 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y Q LDHF+ + T++QRY IN +W P+ GP+FLY G E + F+V SG ++
Sbjct: 63 YIVQPLDHFNRLNNATYNQRYWINEQYWNHPD--GPVFLYIGGESSLSEFSVLSGEHIEL 120
Query: 132 APRFGAMLVFPEHRYYGESM-PYGSTEVAYQNATTLSYLTAEQALADFAVF--ITNLKQN 188
A A+LV EHRYYG S+ P G T + +L+++QALAD A F + K N
Sbjct: 121 AQTHRALLVSLEHRYYGSSINPDGLT------LENIKFLSSQQALADLASFHMFISQKYN 174
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
L+ + + + FGGSY G L+AW RLK+PH+ A+ASSAP+ D + +D
Sbjct: 175 LTRQ-NTWICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDFTGYNKVVALSLAD 233
Query: 249 -FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED-------LADWL 300
S C + +K+ + + S+ QK N + ++ K F+ C L ++D LAD
Sbjct: 234 PVIGGSEKCLDAVKKGFQAVDSLLQKGN-VTQMEKDFYSCGSLQGSDDYSEFVGNLADIF 292
Query: 301 ESAYSYLAM 309
A Y M
Sbjct: 293 MGAVQYNGM 301
>gi|170068625|ref|XP_001868940.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167864603|gb|EDS27986.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 366
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 87/275 (31%), Positives = 141/275 (51%), Gaps = 17/275 (6%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+ P R ++ +T +F+Q LDH + T+ QRY +N D + GP+FL
Sbjct: 33 DGPSRNALLREVSGDVKTLWFDQLLDHNDPTNAATWKQRYYVN-DAYFDDRTSGPVFLMI 91
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
G EG+ +N G A GA+ EHR+YG+S P G A L YLT+E
Sbjct: 92 GGEGEATARWMNEGAWIRYAKEHGALCFQLEHRFYGKSHPTGDLSTA-----NLGYLTSE 146
Query: 173 QALADFAVFI--TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
QALAD A F+ N K L+A+ + + FGGSY G LAAW+R KYP++ G+++SS P+L
Sbjct: 147 QALADLAYFVEAMNEKYQLTAQ-NRWIAFGGSYPGSLAAWLREKYPYLVHGSVSSSGPLL 205
Query: 231 QFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC--- 287
D + ++ +V + + S C ++ + G++ ++ + G + + F LC
Sbjct: 206 AKIDF---KEYFQVVVNSLQSYSPQCVEDVRNAMGQVETLLKHMIGQRSINEKFKLCDPV 262
Query: 288 -RELNSTEDLADWLESAYSYLAMVDYPYPSDFMMP 321
+ ++++ D+A E+ A V Y D ++P
Sbjct: 263 EKSISNSLDIASLFEAVAGNFAGV-VQYNKDNILP 296
>gi|332245708|ref|XP_003271995.1| PREDICTED: thymus-specific serine protease isoform 1 [Nomascus
leucogenys]
Length = 514
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 138/270 (51%), Gaps = 28/270 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQD--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRHALADVVSARLALSRLFNV 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D + ++VS
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDF---SEYNDVVSRSL 229
Query: 250 KRE----SASCFNTIKESWGE----LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLE 301
S C + ++ E L S G + L +C L E+ A+ L
Sbjct: 230 MSTEIGGSLECRAAVSVAFAEVERRLRSGGAAQAA---LRTELSVCGPLGRAENQAELLG 286
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
+ + + V Y +PL +R++C
Sbjct: 287 ALQALVGGV-VQYDGQAGVPL---SVRQLC 312
>gi|24659664|ref|NP_648067.2| CG9953 [Drosophila melanogaster]
gi|7295307|gb|AAF50628.1| CG9953 [Drosophila melanogaster]
Length = 508
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 93/305 (30%), Positives = 152/305 (49%), Gaps = 23/305 (7%)
Query: 38 FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD 97
FRR G L ++ P Q+ E +FEQRLDHF +D T+ QRY +N D
Sbjct: 29 FRRGRLTKGFLGEPSKIPTLQRSLHS-----EDLWFEQRLDHFKSSDKRTWQQRYFVNAD 83
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
+ + P+FL G EG+ + G A FGA+ + EHR+YG+S P + +
Sbjct: 84 FYRNDSS-APVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP--TAD 140
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYP 216
++ +N L YL++EQAL D A F+T +K + + + FGGSY G LAAW R KYP
Sbjct: 141 LSTEN---LHYLSSEQALEDLASFVTAMKVKFNLGDGQKWIAFGGSYPGSLAAWAREKYP 197
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
+ G+++SS P+L D + ++ +V + C + + S+ ++ + + G
Sbjct: 198 ELIYGSISSSGPLLAEVDF---KEYFEVVKASLAAYKPECVDAVTRSFAQVEILLKHMIG 254
Query: 277 LLELTKTFHLCREL-NSTE---DLADWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L + F C + +S E D+A++ E+ A ++ +V Y + P I ++C
Sbjct: 255 QRSLDEKFKTCTPIKDSIENDLDMANFFENLAGNFAGVVQYNKDNS---PHATITIDDIC 311
Query: 332 KKIDN 336
+ N
Sbjct: 312 DVMLN 316
>gi|195588332|ref|XP_002083912.1| GD13982 [Drosophila simulans]
gi|194195921|gb|EDX09497.1| GD13982 [Drosophila simulans]
Length = 508
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 152/305 (49%), Gaps = 23/305 (7%)
Query: 38 FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD 97
FRR G L ++ P Q+ E +FEQRLDHF +D T+ QRY +N D
Sbjct: 29 FRRGRLTKGFLGEPSKIPTLQRNLHS-----EDLWFEQRLDHFKSSDKRTWQQRYFVNAD 83
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
+ + P+FL G EG+ + G A FGA+ + EHR+YG+S P + +
Sbjct: 84 FYRNDSS-APVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLQLEHRFYGKSHP--TAD 140
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYP 216
++ +N L YL++EQAL D A F+T +K + + + FGGSY G LAAW R KYP
Sbjct: 141 LSTEN---LHYLSSEQALEDLASFVTAMKVKFNLGDGQKWIAFGGSYPGSLAAWAREKYP 197
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
+ G+++SS P+L D + ++ +V + C + + S+ ++ + + G
Sbjct: 198 QLIYGSISSSGPLLAEVDF---KEYFEVVKASLVSYKPECLDAVTRSFAQVEILLKHMIG 254
Query: 277 LLELTKTFHLCREL-NSTE---DLADWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L + F C + +S E D++++ E+ A ++ +V Y + P I ++C
Sbjct: 255 QRSLDEKFKTCTPIKDSIENDLDMSNFFENLAGNFAGVVQYNKDNS---PHASITIDDIC 311
Query: 332 KKIDN 336
+ N
Sbjct: 312 DVMLN 316
>gi|156391046|ref|XP_001635580.1| predicted protein [Nematostella vectensis]
gi|156222675|gb|EDO43517.1| predicted protein [Nematostella vectensis]
Length = 444
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 81/224 (36%), Positives = 112/224 (50%), Gaps = 10/224 (4%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F QRLDHF ++ T+ QR+ N + P+FL G EG I V G +
Sbjct: 18 WFIQRLDHFDDSNTETWKQRFYYNDT--FRKTKDSPVFLMVGGEGAISPVWVLIGNMMKY 75
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A FGAM EHR+YG+S P A L YL +EQALAD A F + +
Sbjct: 76 AEGFGAMAFILEHRFYGQSHPRSDMSDA-----NLKYLNSEQALADLAAFRQAMSVKFNL 130
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
S + FGGSY G L+AW+RLKYPH+ GA+ASSAP+L + PE + +V++ +
Sbjct: 131 TDSKWISFGGSYPGSLSAWLRLKYPHLIHGAVASSAPVL--AQLNFPE-YLEVVTASLET 187
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
C I + + + + G +LT F +C LN D
Sbjct: 188 TGPDCTKNIANATAAIEELLDADEGTKKLTNLFRVCEPLNRRND 231
>gi|195451231|ref|XP_002072825.1| GK13807 [Drosophila willistoni]
gi|194168910|gb|EDW83811.1| GK13807 [Drosophila willistoni]
Length = 481
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 94/304 (30%), Positives = 145/304 (47%), Gaps = 26/304 (8%)
Query: 52 TEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
+EPP +Q+ Q ET + EQ+LDHF+ D T+ RY++N + GP+F++
Sbjct: 36 SEPPVPVNQQRVDQV--ETLWIEQKLDHFNDEDTRTWQMRYMLNEALY---ESGGPLFIF 90
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
G E +I + SG ++D+A +L + EHR+YGES P V +L YL+
Sbjct: 91 LGGEWEISTGRITSGHMYDMAKEHKGLLAYTEHRFYGESKPLDDLSVE-----SLEYLSV 145
Query: 172 EQALADFAVFITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
+QALAD A FI K N + A S V++ GGSY + W + YP + G +SSAP+
Sbjct: 146 KQALADLAHFIRTQKANYAGLADSKVIIVGGSYSASMVVWFKRTYPDLVAGGWSSSAPLY 205
Query: 231 QFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRE 289
+ V + I + S C+N I++ EL + + G E+ LC
Sbjct: 206 AKVNFVE---YKEITGQSIAQVGGSACYNRIEKGISELEQLLADKRG-AEVKALLKLCEP 261
Query: 290 LNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPD----ATSILE 345
+ DL W + +S ++ + + + I VC KI + PD TS L
Sbjct: 262 FDVNSDLDVW--TLFSEISDI----FAGVVQTHNAGQIEGVCDKILSEPDDLIGVTSYLL 315
Query: 346 RIFE 349
+FE
Sbjct: 316 GVFE 319
>gi|326429444|gb|EGD75014.1| hypothetical protein PTSG_07238 [Salpingoeca sp. ATCC 50818]
Length = 502
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 99/347 (28%), Positives = 168/347 (48%), Gaps = 39/347 (11%)
Query: 12 LYLSPVITIVIISILSPLSLAA---QPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQ--- 65
+ L+ ++ + + L+ +LA Q S R + VG + R+Q
Sbjct: 1 MTLATLVGVAVFVALACGTLAVPQQQHSGSARRHKHVGAMNQGLLHVMRKQGHDTSDIFT 60
Query: 66 -YRYETRYFEQRLDHFSFADLPTFSQRYLIN-TDHWVGPNRLGPIFLYCGNEGDI--EWF 121
+ ++F Q++DHF+ D TF Q+Y +N T H G GP+FL G EG W
Sbjct: 61 PVSADVKWFTQKVDHFNPQDTRTFQQQYQVNATYHKQG----GPVFLMLGGEGPASPRWL 116
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
+++ + A + A++V EHR+YG+S P+ + L YL++EQALAD A F
Sbjct: 117 EIDTAIMI-YARQHDAVVVQLEHRFYGKSQPFKDLSTDH-----LQYLSSEQALADAANF 170
Query: 182 ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
+T+ +P V+FGGSY G LAA+ R KYPH+ GA+++S+P+ D +
Sbjct: 171 LTSFM-----PGAPAVVFGGSYSGALAAFFRSKYPHLVNGAISTSSPVYALVDF---HQY 222
Query: 242 YNIVSSDFK---RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD 298
+ +V + + C I + ++ ++ + NG +L K F+LC + + T D D
Sbjct: 223 HEVVRNSLATVPHNGSHCSAAIATATEKIQTMLKTTNGRKQLAKDFNLCGDSDVTHD--D 280
Query: 299 WLESAYSYLA-----MVDYPYPSDFMMPLPGYP-IREVCKKIDNAPD 339
+E+ ++ LA +V Y ++ P I +VC + P+
Sbjct: 281 DIETLFTNLAGNIDGVVQYNLDNNHFEGRTKVPTITDVCAVMAATPN 327
>gi|281202858|gb|EFA77060.1| hypothetical protein PPL_09813 [Polysphondylium pallidum PN500]
Length = 487
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 83/250 (33%), Positives = 141/250 (56%), Gaps = 23/250 (9%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++F QR+DHF+ A+ TF QRY++N +W G GP+F EG + V +
Sbjct: 60 QWFTQRVDHFNQANQQTFQQRYIVNDAYWNGN---GPVFFMLNGEGPMSLGTVTGLQFVN 116
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A FGA++V EHRY+G S + + +++ N L YL+++QALAD A F + + L+
Sbjct: 117 WAQEFGALIVTLEHRYFGAS--FTTEDLSTDN---LQYLSSQQALADNAAFRQFIAETLN 171
Query: 191 AEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI------LQFEDIVPPETFYN 243
AS V FGGSY G L +W R+KYP + +ASSAP+ Q+ ++V
Sbjct: 172 VPASSQWVSFGGSYSGALTSWFRIKYPALVDYTVASSAPVNAEVNFYQYLEVVQNSL--- 228
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES- 302
+ +S+ ++ C + I + G++ ++ + +GL ++ F+LC L S D+A++++S
Sbjct: 229 LATSNGQQ----CIDNIAAATGKIQAMLESADGLASVSNMFNLCPPLASQNDVANFMQSL 284
Query: 303 AYSYLAMVDY 312
A +++ +V Y
Sbjct: 285 AGNFMGVVQY 294
>gi|328718793|ref|XP_001947565.2| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 490
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 75/225 (33%), Positives = 118/225 (52%), Gaps = 13/225 (5%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E ++F+Q++DHF+ +D T+ QRY +N H+ GPIFL G E +I + SG
Sbjct: 44 EDKWFKQKVDHFNPSDTRTWKQRYHMNLQHY---KHGGPIFLSIGGEEEITHNWMTSGAW 100
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
+ A + AM EHRYYG S P + + L YLT EQ LAD FI+ + +
Sbjct: 101 IEYAKKLNAMCFQLEHRYYGRSHPTDNLKTK-----NLKYLTVEQVLADLETFISTISND 155
Query: 189 --LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+ + ++FGGSY G LAAW+R+KYPH+ A++SS+P++ D + FY +
Sbjct: 156 NEETLRNAKWIVFGGSYSGSLAAWLRMKYPHLVYAAVSSSSPLMAKIDY---KDFYMAIQ 212
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN 291
+ + C + I E+ + + + G + K F C ++N
Sbjct: 213 NTISNYNPKCASNIIEATSTISDLLETSYGAKYVDKKFKTCSDMN 257
>gi|170066901|ref|XP_001868269.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167863077|gb|EDS26460.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 484
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 90/304 (29%), Positives = 153/304 (50%), Gaps = 32/304 (10%)
Query: 49 PHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
P +P +R R Q T++ +Q+LD+F + T+S RY+ N +H+V + P+
Sbjct: 36 PVRGDPAKRVTRAQ-----VSTKWIKQKLDNFDPQNPSTWSMRYMENGEHYVPGS---PL 87
Query: 109 FLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY 168
F++ G E I ++ G +D+A A L + EHRYYG+S P +T + +
Sbjct: 88 FIFVGGEWTISAGSIQQGHFYDMAAEHRAYLFYTEHRYYGQSRPTVNTRT-----DQMRF 142
Query: 169 LTAEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
L +QALAD A F+ +++ + AE S V++ GGSY + W R KYPH+ G ASSA
Sbjct: 143 LNVDQALADLAHFVEEMRRTIPGAENSKVIMVGGSYSATMVVWFRQKYPHLVNGVWASSA 202
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESA-SCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
P+L D + +VS + +C + ++ E+ + K+ ++ + F+L
Sbjct: 203 PLLAKLDFTE---YKEVVSESIRLVGGDACADRVQRGVAEVEDL-IKQGSYDQVAQAFNL 258
Query: 287 C--RELNSTEDLADWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSI 343
C +L++T D +L S + ++ +V Y + D I VCK I N P+ +
Sbjct: 259 CADTDLSNTRDRQGFLSSISDTFAGVVQYHWSGD---------IEGVCKVI-NDPNYNTD 308
Query: 344 LERI 347
+E +
Sbjct: 309 MEAL 312
>gi|170066899|ref|XP_001868268.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167863076|gb|EDS26459.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 485
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 104/345 (30%), Positives = 167/345 (48%), Gaps = 43/345 (12%)
Query: 12 LYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQR---QQRQQQQQYRY 68
L S + + +++ L S + P+ F R R EPP R ++ +
Sbjct: 3 LLFSIGLAVGLVTFLEATS--SGPAAFERLHR---------EPPIRGDPAKKDSGPRAPV 51
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
T++ Q++D+F + T+S RY+ N +++ N G +F+Y G E I ++ G
Sbjct: 52 LTKWIMQKVDNFDPQNPSTWSMRYMDNGEYY---NPGGALFIYVGGEWTINEGSLVRGHF 108
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D+A GA + + EHRYYG S P +T + +L +QALAD A F+ +++
Sbjct: 109 HDMARELGAYIFYTEHRYYGLSRPTANTRT-----DQMRFLNVDQALADLAHFVEEMRRT 163
Query: 189 L-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ AE + V++ GGSY + AW R KYPH+ GA ASSAP+L D T Y V S
Sbjct: 164 IPGAENAKVIMAGGSYSATMVAWFRQKYPHLINGAWASSAPLLAKLDF----TEYKEVVS 219
Query: 248 DFKR--ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC--RELNSTEDLADWLESA 303
D R +C + ++ E+ + K+ ++ + F+LC +L T D ++L S
Sbjct: 220 DSIRLVGGDACADRVQRGVAEVEDL-IKQGSYDQVAQAFNLCASTDLTKTLDKQNFLSSI 278
Query: 304 YSYLA-MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERI 347
Y A +V Y +P D I VC+ I N P T+ +E +
Sbjct: 279 SDYFAGVVQYHWPGD---------IEGVCEVI-NDPSYTTDMEAL 313
>gi|443699452|gb|ELT98942.1| hypothetical protein CAPTEDRAFT_229193 [Capitella teleta]
Length = 459
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 69/162 (42%), Positives = 101/162 (62%), Gaps = 11/162 (6%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFV 128
++F Q+LDHF+ +D T+SQR+ IN +H+ GP+FL G EG + EW V +V
Sbjct: 48 KWFTQKLDHFTSSDHRTWSQRFFINDEHY---KPGGPVFLMIGGEGAANPEWM-VQGQWV 103
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
+ AP+F A+ V EHR+YG+S P +V +L YL++EQALAD A F N+ ++
Sbjct: 104 QNYAPQFNALCVMLEHRFYGKSHPTKDLKVE-----SLRYLSSEQALADLAAFRVNISES 158
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
+ + FGGSY G L+AW R KYPH+ +++SSAP+L
Sbjct: 159 RGLADAKWIAFGGSYPGALSAWFRYKYPHLVYASVSSSAPML 200
>gi|170042651|ref|XP_001849031.1| thymus-specific serine protease [Culex quinquefasciatus]
gi|167866158|gb|EDS29541.1| thymus-specific serine protease [Culex quinquefasciatus]
Length = 499
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 145/290 (50%), Gaps = 18/290 (6%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+ P R ++ +T +F+Q LDH + T+ QRY +N D + GP+FL
Sbjct: 33 DGPSRNALLREVSGDVKTLWFDQLLDHNDPTNAATWKQRYYVN-DAYFDDRTSGPVFLMI 91
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
G EG+ +N G A GA+ EHR+YG+S P G A L YLT+E
Sbjct: 92 GGEGEATARWMNEGAWIRYAKEHGALCFQLEHRFYGKSHPTGDLSTA-----NLGYLTSE 146
Query: 173 QALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
QALAD A F+ + + + + + FGGSY G LAAW+R KYP++ G+++SS P+L
Sbjct: 147 QALADLAYFVEAMNEKYQLTQQNRWIAFGGSYPGSLAAWLREKYPYLVHGSVSSSGPLLA 206
Query: 232 FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC---- 287
D + ++ +V + + S C ++ + G++ ++ + G + + F LC
Sbjct: 207 KIDF---KEYFQVVVNSLQSYSPQCVEDVRNAMGQVETLLKHMIGQRSINEKFKLCDPVE 263
Query: 288 RELNSTEDLADWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
+ ++++ D+A E+ A ++ +V Y + P I ++C + N
Sbjct: 264 KSISNSLDIASLFEAVAGNFAGVVQYNKDNS---PHAKITIDQICDVMAN 310
>gi|195126042|ref|XP_002007483.1| GI12975 [Drosophila mojavensis]
gi|193919092|gb|EDW17959.1| GI12975 [Drosophila mojavensis]
Length = 507
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 158/330 (47%), Gaps = 28/330 (8%)
Query: 12 LYLSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETR 71
L L V+ + +I+ S L FR G L + P Q+ E
Sbjct: 7 LCLPLVVLVCLIAQSSSLG-------FRNGRHMNGFLGQPSSMPTLQR-----SLDVEDL 54
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+FEQRL+HF D T+ QRY +N D + + P+FL G EG+ + G
Sbjct: 55 WFEQRLNHFKPDDTRTWQQRYFVN-DAFYRNDSQAPVFLMIGGEGEATKNWMREGAWIHY 113
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
A FGA+ + EHR+YG+S P +++++ N L+YL++EQALAD F++ +K+ +
Sbjct: 114 AEHFGALCIQLEHRFYGKSHP--TSDLSNSN---LAYLSSEQALADLGNFVSAMKRQYNM 168
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
A++ + FGGSY G LAAW R KYPH+ GA++SS P+L D ++ +V +
Sbjct: 169 ADSQKWIAFGGSYPGSLAAWAREKYPHLIDGAISSSGPLLAQVDFT---QYFEVVKASLA 225
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL-NSTE---DLADWLESAYSY 306
C + + ++ + + G L + F C L +S E D+A+ E+ S
Sbjct: 226 SYKPECVEAVSRGFAQVEILLKHMIGQRNLDEKFKTCTPLKDSIENKLDIANLFENIASN 285
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
A V Y D P I ++C + N
Sbjct: 286 FAGV-VQYNKD-NSPHAKITIDQICDVMLN 313
>gi|80476792|gb|AAI08760.1| MGC85068 protein [Xenopus laevis]
Length = 457
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/252 (36%), Positives = 133/252 (52%), Gaps = 27/252 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q LDHF+ + T++QRY IN +W P+ GP+FLY G EG + F+V SG ++
Sbjct: 15 FIVQPLDHFNRRNNGTYNQRYWINEQYWNYPD--GPVFLYIGGEGSLSEFSVLSGEHVEL 72
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF--ITNLKQNL 189
A A+LV EHR+YG S+ + + +L+++QALAD A F + K NL
Sbjct: 73 AQTHRALLVSLEHRFYGSSINIDGLTLE-----NIKFLSSQQALADLASFHMFISQKYNL 127
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV---- 245
+ + + + FGGSY G L+AW RLK+PH+ A+ASSAP+ D T YN V
Sbjct: 128 TRQ-NTWICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDF----TGYNKVVAWS 182
Query: 246 -SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED-------LA 297
+ S C + +KE + + S+ QK N + +L K F+ C L ++D LA
Sbjct: 183 LADPVIGGSEKCLDAVKEGFHAVDSLIQKGN-VTQLEKDFYSCGSLQGSDDYTEFVGNLA 241
Query: 298 DWLESAYSYLAM 309
D A Y M
Sbjct: 242 DIFMGAVQYNGM 253
>gi|148236037|ref|NP_001089218.1| uncharacterized protein LOC734265 precursor [Xenopus laevis]
gi|57920938|gb|AAH89148.1| MGC85068 protein [Xenopus laevis]
Length = 506
Score = 128 bits (321), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 91/269 (33%), Positives = 145/269 (53%), Gaps = 27/269 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q LDHF+ + T++QRY IN +W P+ GP+FLY G EG + F+V SG ++
Sbjct: 64 FIVQPLDHFNRRNNGTYNQRYWINEQYWNYPD--GPVFLYIGGEGSLSEFSVLSGEHVEL 121
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF--ITNLKQNL 189
A A+LV EHR+YG S+ + + +L+++QALAD A F + K NL
Sbjct: 122 AQTHRALLVSLEHRFYGSSINIDGLTLE-----NIKFLSSQQALADLASFHMFISQKYNL 176
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV---- 245
+ + + + FGGSY G L+AW RLK+PH+ A+ASSAP+ D T YN V
Sbjct: 177 TRQ-NTWICFGGSYPGSLSAWFRLKFPHLVYAAVASSAPVRAELDF----TGYNKVVAWS 231
Query: 246 -SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES-A 303
+ S C + +KE + + S+ QK N + +L K F+ C L ++D +++ + A
Sbjct: 232 LADPVIGGSEKCLDAVKEGFHAVDSLIQKGN-VTQLEKDFYSCGSLQGSDDYTEFVGNLA 290
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCK 332
++ V Y + M P+ ++ +C+
Sbjct: 291 DIFMGAVQY----NGMSPISN--VQNICQ 313
>gi|307196629|gb|EFN78126.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 433
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 125/229 (54%), Gaps = 16/229 (6%)
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
+ + QRY +N+D++ GPIFL G EG+I+ + G + A GAM + EH
Sbjct: 1 MRVWKQRYFVNSDYY---KLNGPIFLMIGAEGEIKPKWLVEGLWIEYAKELGAMCFYVEH 57
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSY 203
RYYG+S P + +++ +N L YL++E ALAD A FI ++ + ++FGGSY
Sbjct: 58 RYYGKSHP--TVDLSVKN---LMYLSSELALADLAYFIESVNIGYKFPNDTKWIVFGGSY 112
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKES 263
GG LAAWMRLKYPH GA+++S P+L D + +Y +V K+ S C +T+ ++
Sbjct: 113 GGSLAAWMRLKYPHFVHGAVSASGPLLALIDF---QEYYVVVEDALKQHSQQCVDTVADA 169
Query: 264 WGELVSVGQKENGLLELTKTFHLCRELN----STEDLADWLESAYSYLA 308
E + G ++ + F LC ++ T D+++ ES S A
Sbjct: 170 NKEFHIMLHHLIGQKQIEEKFRLCDPIDPGHTKTIDISNLYESLASNFA 218
>gi|125980321|ref|XP_001354185.1| GA22150 [Drosophila pseudoobscura pseudoobscura]
gi|195174498|ref|XP_002028010.1| GL15050 [Drosophila persimilis]
gi|54642489|gb|EAL31237.1| GA22150 [Drosophila pseudoobscura pseudoobscura]
gi|194115732|gb|EDW37775.1| GL15050 [Drosophila persimilis]
Length = 508
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 89/280 (31%), Positives = 138/280 (49%), Gaps = 18/280 (6%)
Query: 63 QQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q+ E +FEQRLDHF + T+ QRY +N D++ + PIFL G EG+
Sbjct: 51 QRSLDSEDLWFEQRLDHFQARNTRTWQQRYFVNADYYRN-DSTAPIFLMIGGEGEASAKW 109
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
+ G A F A+ + EHR+YG+S P A L++L++EQALAD A F+
Sbjct: 110 MREGAWVHYAEHFDALCIQLEHRFYGKSHPTRDLSTA-----NLAFLSSEQALADLANFV 164
Query: 183 TNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
+K + AE V FGGSY G LAAW R KYPH+ G++++S P+L D +
Sbjct: 165 AAMKVKYNLAETQKWVAFGGSYPGSLAAWAREKYPHLIYGSISTSGPLLAEVDF---REY 221
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL-NSTE---DLA 297
+ +V + + C + S+ ++ + + G L + F C + +S E D+A
Sbjct: 222 FEVVKASLATYNPDCVEAVTRSFTQVEILLKHMIGQRNLDEKFKTCTPIKDSIENQLDIA 281
Query: 298 DWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
E+ A ++ +V Y + P I E+C + N
Sbjct: 282 SLFENLAGNFAGVVQYNKDNS---PHATITIDEICDVMLN 318
>gi|328776766|ref|XP_395356.3| PREDICTED: putative serine protease K12H4.7-like [Apis mellifera]
Length = 494
Score = 127 bits (319), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 89/273 (32%), Positives = 144/273 (52%), Gaps = 19/273 (6%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++F Q LDHF D + QRY IN +++ + GP+FL E + G +
Sbjct: 48 QWFTQFLDHFDPTDARVWQQRYFINGEYY---KKGGPVFLMISGESTATAKWMVKGQWIE 104
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI--TNLKQN 188
A +FGA+ EHR+YG+S P +++++ +N L YL+++QALAD A FI N+
Sbjct: 105 YAKQFGALCFQVEHRFYGKSHP--TSDLSVKN---LKYLSSQQALADLAYFIEIMNIDYK 159
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
LS + + + FGGSY G LAAW+R KYPH+ GA+++S P+L D + +Y IV +
Sbjct: 160 LSND-TKWIAFGGSYAGSLAAWLRSKYPHLLHGAVSASGPLLAEIDF---QEYYIIVENA 215
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS----TEDLADWLES-A 303
K+ S +C NTI E+ + + + G + K F LC ++S D+++ E+ A
Sbjct: 216 LKQYSEACVNTIVEANKQFHIMLRHPIGQQGIVKKFVLCDPIDSGYTKRNDISNLYETLA 275
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
++ +V Y + + I C + N
Sbjct: 276 SNFAGIVQYNKDNRNNSAMANLTIESACDILTN 308
>gi|431892225|gb|ELK02665.1| Thymus-specific serine protease [Pteropus alecto]
Length = 515
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 73/179 (40%), Positives = 103/179 (57%), Gaps = 8/179 (4%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW +R GP+FL+ G EG + AV G +
Sbjct: 59 WLEQALDPFNASDRRSFLQRYWVNEQHWA--SRDGPVFLHLGGEGSLGPGAVMRGHPAAL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG S+P G ++A+ L +L++ ALAD L + L+
Sbjct: 117 APAWGALVIGLEHRFYGLSIPAGGLDMAH-----LRFLSSRHALADVVSARLALSRLLNV 171
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
+ +SP V FGGSY G LAAW RLK+PH+ A+ASSAP+ D SS F
Sbjct: 172 SSSSPWVCFGGSYAGSLAAWARLKFPHLVFAAVASSAPVRAVLDFSAYNECRAAASSAF 230
>gi|17944482|gb|AAL48130.1| RH04336p [Drosophila melanogaster]
Length = 508
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 151/305 (49%), Gaps = 23/305 (7%)
Query: 38 FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD 97
FRR G L ++ P Q+ E +FEQRLDHF +D T+ QRY +N D
Sbjct: 29 FRRGRLTKGFLGEPSKIPTLQRSLHS-----EDLWFEQRLDHFKSSDKRTWQQRYFVNAD 83
Query: 98 HWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
+ + P+FL G EG+ + G A FGA+ + EHR+YG+S P + +
Sbjct: 84 FYRNDSS-APVFLMIGGEGEASAKWMREGAWVHYAEHFGALCLRLEHRFYGKSHP--TAD 140
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYP 216
++ +N L YL++EQAL D A F+T +K + + + FGGSY G LAAW KYP
Sbjct: 141 LSTEN---LHYLSSEQALEDLASFVTAMKVKFNLGDGQKWIAFGGSYPGSLAAWAHEKYP 197
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
+ G+++SS P+L D + ++ +V + C + + S+ ++ + + G
Sbjct: 198 ELIYGSISSSGPLLAEVDF---KEYFEVVKASLAAYKPECVDAVTRSFAQVEILLKHMIG 254
Query: 277 LLELTKTFHLCREL-NSTE---DLADWLES-AYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L + F C + +S E D+A++ E+ A ++ +V Y + P I ++C
Sbjct: 255 QRSLDEKFKTCTPIKDSIENDLDMANFFENLAGNFAGVVQYNKDNS---PHATITIDDIC 311
Query: 332 KKIDN 336
+ N
Sbjct: 312 DVMLN 316
>gi|167383690|ref|XP_001736631.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165900889|gb|EDR27114.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 466
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 133/252 (52%), Gaps = 19/252 (7%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-IAP 133
Q +DHF + T + RY IN + ++ P+ + G EG + AV FV + A
Sbjct: 36 QPIDHFDLTNKKTINIRYFINDTIY---SKEAPLLVDLGGEGPQKAAAVGGRFVINKYAE 92
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA 193
++ ++++ EHR+YG+S+P G ++ +N L YL+A QAL D+ + I +K+
Sbjct: 93 KYNSLMLAIEHRFYGKSVPEGG--LSQEN---LGYLSAAQALEDYIMIINQIKKEYQV-T 146
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRES 253
PV++FGGSY G LAAW+R KYP++ A+ASSAP+ TFY + +
Sbjct: 147 GPVIVFGGSYSGNLAAWIRQKYPNVVYAAVASSAPVY------ATSTFYEFLDVIYNDMG 200
Query: 254 ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYP 313
C N KE+ + + + ++G +L F+ C ++N +DL ++ + M++YP
Sbjct: 201 EKCGNAWKEATESIEELFKTDSGKAQLKSDFNACTDINGEDDLTILIQQIQA--TMINYP 258
Query: 314 -YPSDFMMPLPG 324
Y + + + G
Sbjct: 259 QYNGSYSLTIEG 270
>gi|388515463|gb|AFK45793.1| unknown [Lotus japonicus]
Length = 212
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 52/72 (72%), Positives = 62/72 (86%)
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDD 368
MV+YPYPS+F+M LPG+PI+EVC++ID P TSILERI+EGV+VYYNYTG CF+LDD
Sbjct: 1 MVNYPYPSEFLMTLPGHPIKEVCRRIDQGPAGTSILERIYEGVNVYYNYTGEAKCFELDD 60
Query: 369 DPHGLDGWNWQV 380
DPHGL GWNWQ
Sbjct: 61 DPHGLSGWNWQA 72
>gi|5031993|ref|NP_005856.1| thymus-specific serine protease [Homo sapiens]
gi|13633990|sp|Q9NQE7.2|TSSP_HUMAN RecName: Full=Thymus-specific serine protease; AltName: Full=Serine
protease 16; Flags: Precursor
gi|3510663|gb|AAC33563.1| thymus specific serine peptidase [Homo sapiens]
gi|119623493|gb|EAX03088.1| protease, serine, 16 (thymus), isoform CRA_a [Homo sapiens]
gi|162318714|gb|AAI56929.1| Protease, serine, 16 (thymus) [synthetic construct]
gi|162318946|gb|AAI56192.1| Protease, serine, 16 (thymus) [synthetic construct]
gi|307685491|dbj|BAJ20676.1| protease, serine, 16 [synthetic construct]
Length = 514
Score = 127 bits (319), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 134/268 (50%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQD--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRLALADVVSARLALSRLFNI 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL------QFEDIVPPETFYN 243
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ ++ D+V
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSEYNDVVSRSLMST 232
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ + +A E L S G + L C L E+ A+ L +
Sbjct: 233 AIGGSLECRAAVSV-AFAEVERRLRSGGAAQAA---LRTELSACGPLGRAENQAELLGAL 288
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
+ + V Y PL +R++C
Sbjct: 289 QALVGGV-VQYDGQTGAPL---SVRQLC 312
>gi|426351972|ref|XP_004043496.1| PREDICTED: thymus-specific serine protease [Gorilla gorilla
gorilla]
Length = 514
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 134/268 (50%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQD--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRLALADVVSARLALSRLFNI 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL------QFEDIVPPETFYN 243
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ ++ D+V
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSEYNDVVSRSLMST 232
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ + +A E L S G + L C L E+ A+ L +
Sbjct: 233 AIGGSLECRAAVSV-AFAEVERRLRSGGAAQAA---LRTELSACGPLGRAENQAELLGAL 288
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
+ + V Y PL +R++C
Sbjct: 289 QALVGGV-VQYDGQAGAPL---SVRQLC 312
>gi|320168054|gb|EFW44953.1| thymus-specific serine protease [Capsaspora owczarzaki ATCC 30864]
Length = 489
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 101/325 (31%), Positives = 162/325 (49%), Gaps = 34/325 (10%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++F Q+LDHF+ D T+ Q+Y +N W GP PIF G EG I+ V +
Sbjct: 56 QWFTQKLDHFNTFDETTWLQKYYVNQTFWGGPGY--PIFFMIGGEGPIDDRYVTAMDYVI 113
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A + A++V EHR+YGES+P VA L +LT++QALAD A F N+ +
Sbjct: 114 YARTYKALMVTLEHRFYGESVPTADYSVA-----NLRFLTSQQALADAANFAANITLQFN 168
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
A S V FGGSY G L+AW RLKYP++ G++++S P+ + V + +V + +
Sbjct: 169 APTSSWVTFGGSYPGCLSAWARLKYPNLFQGSISTSGPVHAELNFV---QYLEVVQASLE 225
Query: 251 R-ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES-AYSYLA 308
+C + I ++ ++ ++ Q+ GL + K F +C LNS +D+A+++ + A + +
Sbjct: 226 YFGGTTCSSLITQATNKIQTLLQQPGGLSSVGKLFSVCVPLNSQDDVANFMSTLAGNVMG 285
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF---- 364
+V Y G I +VC + + D + + ++ + +GNV C
Sbjct: 286 VVQYNNEGR-----GGPTITDVCATMLSNSDPL----QAYVNLNQLFLASGNVTCLDVAY 336
Query: 365 -----QLDDDPHGLD----GWNWQV 380
QL D G D W WQ
Sbjct: 337 KPMIQQLQDTAPGADVGGRSWLWQT 361
>gi|328874647|gb|EGG23012.1| Putative serine protease [Dictyostelium fasciculatum]
Length = 490
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/289 (30%), Positives = 143/289 (49%), Gaps = 21/289 (7%)
Query: 23 ISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF 82
+ I S LA + R V L H + +Q++ +F+Q++DHF
Sbjct: 17 VVICSSAFLAVESMPLRNEQSLVRGLRH-----RNVMIEQRRSPSPLALWFDQQVDHFDP 71
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI-APRFGAMLVF 141
+ TF Q+Y IN +W GP+F G EG I VN FV + A F A++V
Sbjct: 72 LNQDTFKQQYFINDTYW---RPGGPVFFVLGGEGPISPGYVNGHFVVNTYAQLFDALIVA 128
Query: 142 PEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGG 201
EHR+YG S P+ + + + L LT EQALAD+A F + + +S + FGG
Sbjct: 129 CEHRFYGYSSPHPTLDTKH-----LHLLTTEQALADYANFRQFIAAKYNTGSSKWISFGG 183
Query: 202 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIK 261
SY G L+AW+RLKYP + GA+A+SAP+ D + +VS+ ++ +
Sbjct: 184 SYSGSLSAWLRLKYPQLIDGAIATSAPVEAQLDFT---QYLEVVSASIGPACSAIVKNVT 240
Query: 262 ESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMV 310
+ ++++ GQ ++ F+ C ++S D+A ++ES S ++ +
Sbjct: 241 QIVTQMIANGQTS----QVESLFNTCDPISSELDIATFMESLTSAVSEI 285
>gi|195062805|ref|XP_001996257.1| GH22293 [Drosophila grimshawi]
gi|193899752|gb|EDV98618.1| GH22293 [Drosophila grimshawi]
Length = 481
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/285 (30%), Positives = 137/285 (48%), Gaps = 22/285 (7%)
Query: 20 IVIISILSPLSLAAQ-PSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLD 78
+++I++L+PL+ +A+ S + P V L L P Q ++ ETR+ Q+LD
Sbjct: 5 LLVIALLAPLASSAKLSSDVSKIPASVRTLKELHRGPPMQLISKRA--NVETRWISQKLD 62
Query: 79 HFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAM 138
+F + + R LIN DH+V + PIF+Y G E +IE + +G DIA
Sbjct: 63 NFDEDNEEVWDDRVLINEDHFVDGS---PIFIYLGGEWEIEPSPITAGHWVDIASEHNGS 119
Query: 139 LVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVL 198
LV+ EHR++G+S+P A L Y EQALAD I LK+ + S VV+
Sbjct: 120 LVYTEHRFFGQSVPIKPLTTA-----NLKYQNVEQALADVVNVINVLKEEEKYKNSKVVV 174
Query: 199 FGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CF 257
G SY +A W++ YP + +G+ ASSAP+ D + + +V ++ C+
Sbjct: 175 QGCSYSATMAVWIKKLYPDVIVGSWASSAPLQAKVDF---KAYMKVVGQAYRELGGDYCY 231
Query: 258 NTIKES---WGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
N I + + +L GQ E K +LC N + W
Sbjct: 232 NIIDNATSFYEDLFENGQNA----EAKKLLNLCDNFNENDQHDQW 272
>gi|196014354|ref|XP_002117036.1| hypothetical protein TRIADDRAFT_63413 [Trichoplax adhaerens]
gi|190580258|gb|EDV20342.1| hypothetical protein TRIADDRAFT_63413 [Trichoplax adhaerens]
Length = 439
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/274 (33%), Positives = 142/274 (51%), Gaps = 26/274 (9%)
Query: 75 QRLDHFSFADLPTFSQRYLIN-TDHWVGPNRLGPIFLYCGNEGDIE--WFAVNSGFVWDI 131
Q+LDHF + +SQRY +N T H G GPIFL G EG W N W I
Sbjct: 2 QKLDHFDHTNTAVWSQRYFVNDTFHKKG----GPIFLMLGGEGPASPVW---NVAGAWQI 54
Query: 132 -APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--KQN 188
A + A+ + EHRYYG+S P V+ + L YL++EQALAD A F +N
Sbjct: 55 YAKKLNAITIQIEHRYYGQSHP-----VSDASTPNLKYLSSEQALADAAYFREYFMTSKN 109
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
+SA+ +V FGGSY G L+AW+R KYPH+ ++A+SAPIL D E + +V+
Sbjct: 110 MSADTKWIV-FGGSYSGALSAWLRTKYPHLFHASVATSAPILAKVDF---EQYLQVVTKS 165
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST-EDLADWLES-AYSY 306
+ +C I + + + + G +L++ F C+ L+ D++ +++S A ++
Sbjct: 166 LQTAGMACTKNIHNATTIIQGMIKTSAGRKKLSQMFKTCKPLSKDPNDISTFMQSLAGNF 225
Query: 307 LAMVDYPYP-SDFMMPLPGYPIREVCKKID-NAP 338
+V Y + F P + ++CK ++ N P
Sbjct: 226 EGIVQYNKDNTGFERHTPATTLTDLCKIMEKNKP 259
>gi|297677423|ref|XP_002816600.1| PREDICTED: thymus-specific serine protease isoform 1 [Pongo abelii]
Length = 514
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/268 (32%), Positives = 134/268 (50%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ ++ +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSNRRSFLQRYWVNDQHWVGQD--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRHALADVVSARLALSRLFNI 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL------QFEDIVPPETFYN 243
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ ++ D+V
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSEYNDVVSRSLMST 232
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ + +A E L S G + L C L E+ A+ L +
Sbjct: 233 AIGGSLECRAAVSV-AFAEVERRLRSGGAAQAA---LRTELSACGPLGRAENQAELLGAL 288
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
+ + V Y PL +R++C
Sbjct: 289 QALVGGV-VQYDGQAGAPL---SVRQLC 312
>gi|355762458|gb|EHH61967.1| Thymus-specific serine protease [Macaca fascicularis]
Length = 514
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 101/165 (61%), Gaps = 10/165 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNEQHWVGED--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRHALADVVSARLALSRLFNI 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD 217
>gi|281206134|gb|EFA80323.1| Putative serine protease [Polysphondylium pallidum PN500]
Length = 484
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 83/240 (34%), Positives = 133/240 (55%), Gaps = 26/240 (10%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
++++F+Q++DH+ + TF Q+Y +N +W GP+FL G EG +V FV
Sbjct: 52 QSQWFDQQVDHYDPLNTATFKQQYFVNDTYWTTG---GPVFLLLGGEGPASVTSVTGHFV 108
Query: 129 WDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+ A +FGA++V EHR+YG+S P + Y L+ LT +QALADFA N +Q
Sbjct: 109 INTYAQQFGALIVSVEHRFYGKSSPSKTLATEY-----LNLLTTQQALADFA----NFRQ 159
Query: 188 NLSAE-----ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
++A+ + V FGGSY G L+AW+RLKYP + A+A+SAP+ D PE ++
Sbjct: 160 FIAAKYNVPSTTKWVSFGGSYSGSLSAWLRLKYPQLIDAAIATSAPVQPQLDF--PE-YF 216
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
+V+ + +C I E + + Q + +E K F+ C + S++D+A + ES
Sbjct: 217 EVVA---RSVGPACSARIAEVTNLVTQMLQTDRKTVE--KLFNTCDPIVSSDDVATFFES 271
>gi|410910676|ref|XP_003968816.1| PREDICTED: thymus-specific serine protease-like [Takifugu rubripes]
Length = 509
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 107/355 (30%), Positives = 176/355 (49%), Gaps = 31/355 (8%)
Query: 14 LSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETR-Y 72
SPV + ++S+L +S+ A S R R G L + Q +Q +++ T+
Sbjct: 5 FSPVRAL-MVSLL--VSVDAGRSLRRIQERVRG--AQLQDAKQLLTNAGRQSFQHVTQGK 59
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
Q LDHF D TF QR+ +N GP+ GP+FLY G EG I F V +G D+A
Sbjct: 60 IHQPLDHFHPQDRRTFPQRFFVNEAFCRGPD--GPVFLYIGGEGPIFEFDVLAGHHVDMA 117
Query: 133 PRFGAMLVFPEHRYYGESM-PYG-STEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
GA+L+ EHR+YG+S+ P G TE L L+++QALAD F ++ Q+ +
Sbjct: 118 REHGALLLALEHRFYGDSINPDGLKTE-------NLENLSSKQALADLVAFHQHISQSFN 170
Query: 191 -AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP-PETFYNIVSSD 248
++ + + FGGSY G L+AW R ++PH+ GA+ASSAP+ D +T ++++
Sbjct: 171 LSQRNTWISFGGSYSGSLSAWFRGQFPHLVFGAVASSAPVKATLDFSAYSDTVGLSLANE 230
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL-ESAYSYL 307
SA C + +KE++ + + N + ++ F C+ + +D + + E A ++
Sbjct: 231 AVGGSAKCLDAVKEAFAAVEAALMMGN-VSQVASDFGCCQTPKNLDDQIELMQELAGIFM 289
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKIDN----APDATSILERIFEGVSVYYNYT 358
V Y FM I ++C+ + + A R+ + VY + T
Sbjct: 290 GAVQYNEEGVFM------SISDLCEVMTSTNGTCQKAADAYNRLVKLAQVYRSIT 338
>gi|328868233|gb|EGG16611.1| hypothetical protein DFA_07589 [Dictyostelium fasciculatum]
Length = 479
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/283 (29%), Positives = 140/283 (49%), Gaps = 22/283 (7%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNR-LGPIFLYCGNEGDIEWFAVNSGF 127
+T +F+Q+ DHF + T+ Q+Y + D W P++ P+F++ E + +F
Sbjct: 53 QTFWFDQQQDHFDQTNNITWKQQYQV-IDDWFDPSQPNAPVFIFLAGEAPMGFFNFQEVQ 111
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+ A F A+ V EHR+YG+S P + +++ N L YLT++QALAD A F+T K
Sbjct: 112 IRAWAQEFKALYVILEHRFYGQSYP--TNDLSTHN---LKYLTSQQALADAANFLTTFKS 166
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ V+FG SY G L+AW RLKYP + +G++A S P+L + Y +
Sbjct: 167 ERGIADNQAVVFGCSYSGALSAWFRLKYPQLVVGSVAPSGPVL-------AQLNYTGYYA 219
Query: 248 DFKRES-ASCFNTIKESWGELVS-VGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
F + SC N +++ +++ + Q + G+ +L KTF+ C L + DL Y
Sbjct: 220 QFTNSAPTSCVNAAQQASDQVMQLIKQGDKGIKQLEKTFNSCSSLKNGRDL------YYF 273
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIF 348
++V+ +D M P + + C + D IF
Sbjct: 274 VYSIVEALGGADQMNNPPTWTLNSTCNTLSQNSDLLVNWAEIF 316
>gi|440796326|gb|ELR17435.1| Serine protease precursor, putative [Acanthamoeba castellanii str.
Neff]
Length = 509
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/242 (34%), Positives = 130/242 (53%), Gaps = 12/242 (4%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW- 129
++F Q LDHF + + QRY N D + P GP+FL G EG V F+
Sbjct: 60 QWFTQALDHFDPRNSAKWQQRYFTN-DTFYRPG--GPVFLMLGGEGPASPIDVGGHFILN 116
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
+ A RF A+++ EHR+YG+S+P + +++ N L +L +EQALADFA+F + + L
Sbjct: 117 EYAQRFNALVLSIEHRFYGKSVP--TRDLSNAN---LRFLNSEQALADFAMFRQYISEKL 171
Query: 190 SA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
+ + + V FGGSY G L+AW RLKYPH+ G+LA+SAP+ D
Sbjct: 172 ALPKTTKWVAFGGSYSGALSAWFRLKYPHLVDGSLATSAPVKAQLDFSEYNEVVQRSLEF 231
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
F E SC N ++E ++ +G +L K F+LC + + +D+A + E+ +A
Sbjct: 232 FVGE--SCANRVREGTQAATNLLSSADGKKQLQKLFNLCTPIETDDDIALFFENMEGTVA 289
Query: 309 MV 310
+
Sbjct: 290 QI 291
>gi|355561421|gb|EHH18053.1| Thymus-specific serine protease [Macaca mulatta]
Length = 514
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/165 (43%), Positives = 101/165 (61%), Gaps = 10/165 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNEQHWVGED--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRHALADVVSARLALSRLFNV 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD 217
>gi|344268143|ref|XP_003405922.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 465
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 137/270 (50%), Gaps = 17/270 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW-D 130
+F+Q+LDHFS P + QRY IN + P GP+FL G I V++ + W
Sbjct: 25 WFQQKLDHFSENGSPFWEQRYFINNTFY-KPG--GPVFLMIGGWMTIGTNWVSTDYTWIT 81
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A R GA + EHR+YG+S P G A +L YL ++Q LAD A F T + + +
Sbjct: 82 YAERLGAFCLALEHRFYGQSQPTGDLSTA-----SLRYLRSKQVLADIAYFRTEIAKKMG 136
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ V+FGGSYGG LA W R+KYP++ A++SSAP+ + V ++ V S
Sbjct: 137 LIKNKWVVFGGSYGGSLAVWSRIKYPNLFAAAVSSSAPV---KVKVNFYEYFEGVHSALA 193
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NSTEDLADWLESAYSYLA 308
++ C +K++ G+++ + + L F LC L NS D ++E+ +LA
Sbjct: 194 THNSECSKAVKDALGQVIKLLKFPKYYRRLKNDFMLCEHLKVNSVMDSTYFVENLLIFLA 253
Query: 309 -MVDYPYPSDFMMPLPGYPIREVCKKIDNA 337
++ ++ + P I + C K+ N
Sbjct: 254 SIIQRNKDNETIKSTPN--IDDFCDKMTNT 281
>gi|148700668|gb|EDL32615.1| protease, serine, 16 (thymus), isoform CRA_c [Mus musculus]
Length = 493
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 90/268 (33%), Positives = 132/268 (49%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D TF QRY +N H G + P+FL+ G EG + +V +G +
Sbjct: 44 WLEQPLDPFNASDRRTFLQRYWVNDQHRTGQDV--PVFLHIGGEGSLGPGSVMAGHPAAL 101
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG SMP G ++A L YL++ ALAD A L L+
Sbjct: 102 APAWGALVISLEHRFYGLSMPAGGLDLAL-----LRYLSSRHALADVASARQALSGLLNV 156
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ +SP + FGGSY G LA W RLK+PH+ A+ASSAP+ D + +V+
Sbjct: 157 SSSSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPLSAVVDF---SAYNQVVARSLT 213
Query: 251 R----ESASCFNTIKESWGE---LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ S C ++ E L+ G +L + C L+ TED A+ L A
Sbjct: 214 QVAIGGSLECLAAASTAFTEVERLLRAGPAAQAVLR--EELGACGSLDLTEDQAELL-GA 270
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L Y PL +R++C
Sbjct: 271 LQALVGGTVQYDGQAGAPL---SVRQLC 295
>gi|340715980|ref|XP_003396483.1| PREDICTED: putative serine protease K12H4.7-like [Bombus
terrestris]
Length = 493
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 140/278 (50%), Gaps = 16/278 (5%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
+F Q LDHF+ D + QRY +N +++ + GP+FL E + G +
Sbjct: 48 EWFTQFLDHFNPTDARVWQQRYFVNGEYY---KKGGPVFLMISGEAAANAKWMVEGQWIE 104
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A +FGA+ EHR+YG+S P ++++ +N L YL+++QALAD A FI ++ N
Sbjct: 105 YAKQFGALCFQVEHRFYGQSHP--TSDLGVKN---LMYLSSQQALADLAYFIQSMNINYK 159
Query: 191 AEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
A + + FGGSY G LAAW+R KYPH+ GA+++S P+L D + ++ +V +
Sbjct: 160 LPAGTKWIAFGGSYAGSLAAWLRYKYPHLVHGAVSASGPLLAEIDF---QEYFVVVENAL 216
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN---STEDLADWLESAYSY 306
K S +C N I E+ + + G + K F LC +N D+++ E+ S
Sbjct: 217 KEYSEACVNAILEANKQFHIMLHHPIGQQGIAKKFILCDPINEHTKRNDISNLYETIASI 276
Query: 307 LA-MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSI 343
A +V Y + + + I C + N +I
Sbjct: 277 FAGIVQYNKDNRNNSAMANFTIDSACDILTNETLGIAI 314
>gi|9506995|ref|NP_062302.1| thymus-specific serine protease precursor [Mus musculus]
gi|13633993|sp|Q9QXE5.1|TSSP_MOUSE RecName: Full=Thymus-specific serine protease; AltName: Full=Serine
protease 16; Flags: Precursor
gi|6706784|emb|CAB66137.1| thymus-specific serine peptidase [Mus musculus]
gi|26352940|dbj|BAC40100.1| unnamed protein product [Mus musculus]
gi|116138603|gb|AAI25482.1| Protease, serine, 16 (thymus) [Mus musculus]
gi|124297571|gb|AAI31956.1| Protease, serine, 16 (thymus) [Mus musculus]
gi|148700669|gb|EDL32616.1| protease, serine, 16 (thymus), isoform CRA_d [Mus musculus]
Length = 509
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 90/268 (33%), Positives = 132/268 (49%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D TF QRY +N H G + P+FL+ G EG + +V +G +
Sbjct: 60 WLEQPLDPFNASDRRTFLQRYWVNDQHRTGQDV--PVFLHIGGEGSLGPGSVMAGHPAAL 117
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG SMP G ++A L YL++ ALAD A L L+
Sbjct: 118 APAWGALVISLEHRFYGLSMPAGGLDLAL-----LRYLSSRHALADVASARQALSGLLNV 172
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ +SP + FGGSY G LA W RLK+PH+ A+ASSAP+ D + +V+
Sbjct: 173 SSSSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPLSAVVDF---SAYNQVVARSLT 229
Query: 251 R----ESASCFNTIKESWGE---LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ S C ++ E L+ G +L + C L+ TED A+ L A
Sbjct: 230 QVAIGGSLECLAAASTAFTEVERLLRAGPAAQAVLR--EELGACGSLDLTEDQAELL-GA 286
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L Y PL +R++C
Sbjct: 287 LQALVGGTVQYDGQAGAPL---SVRQLC 311
>gi|312380581|gb|EFR26537.1| hypothetical protein AND_07341 [Anopheles darlingi]
Length = 519
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 95/282 (33%), Positives = 140/282 (49%), Gaps = 18/282 (6%)
Query: 63 QQQYRYET--RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
QQ + ET ++FEQ LDH + T+ QRY +N + N P+FL G EG+
Sbjct: 59 QQLHIEETPDQWFEQILDHNDPTNEATWQQRYYVNDQFFDSSNPHSPVFLMIGGEGEATA 118
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
++ G A GA+ EHR+YG+S P +T+++ +N L+YLT+EQALAD A
Sbjct: 119 RWMHEGAWIHYAETHGALCFQLEHRFYGKSHP--TTDLSTKN---LAYLTSEQALADLAY 173
Query: 181 FI--TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
FI N K L + + + FGGSY G LAAW+R KYP + G+++SS P+L D +
Sbjct: 174 FIEAMNEKYQLQPQTNLWIAFGGSYPGSLAAWLREKYPSLVHGSISSSGPLLAKIDFIE- 232
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC----RELNSTE 294
+Y+ V S C ++ + + ++ + G L F LC R +++
Sbjct: 233 --YYDTVVRSLASYSPGCVEAVRSAMKQAETLLKHMIGQRTLNDKFKLCDPIERSIDNPL 290
Query: 295 DLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
D+A E S A V Y D P I EVC + N
Sbjct: 291 DVASLFEGLASNFAGV-VQYNKD-NSPHATITIDEVCDVMMN 330
>gi|380014101|ref|XP_003691081.1| PREDICTED: putative serine protease K12H4.7-like [Apis florea]
Length = 494
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 87/273 (31%), Positives = 144/273 (52%), Gaps = 19/273 (6%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++F Q LDHF + + QRY IN +++ + GP+FL EG + G +
Sbjct: 48 QWFTQFLDHFDPTEARVWQQRYFINGEYY---KKGGPVFLMISGEGTATAKWMVKGQWIE 104
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI--TNLKQN 188
A +FGA+ EHR+YG+S P +++++ +N L YL+++QALAD A FI N+
Sbjct: 105 YAKQFGALCFQVEHRFYGKSHP--TSDLSVKN---LKYLSSQQALADLAYFIEIMNIDYK 159
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
L + + + FGGSY G LAAW+R KYPH+ GA+++S P+L D + +Y +V +
Sbjct: 160 LPND-TKWIAFGGSYAGSLAAWLRSKYPHLLYGAVSASGPLLAEIDF---QEYYVVVENA 215
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE----DLADWLES-A 303
K+ S +C NTI E+ + + + G + K F LC ++S D+++ E+ A
Sbjct: 216 LKQYSETCVNTIIEANKQFHIMLRHPIGQQGIAKKFVLCDPIDSGHTKRSDISNLYETIA 275
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
++ +V Y + + I C + N
Sbjct: 276 SNFAGIVQYNKDNRNNSAMANLTIESACDILTN 308
>gi|407038295|gb|EKE39045.1| serine carboxypeptidase (S28) family protein [Entamoeba nuttalli
P19]
Length = 466
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 79/256 (30%), Positives = 130/256 (50%), Gaps = 19/256 (7%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
R Q +DHF + T + RY IN + ++ P+ + G EG AV FV +
Sbjct: 32 RVSTQPIDHFDLTNKKTINIRYFINDTIY---SKEAPLLVDLGGEGTQRAAAVGGRFVIN 88
Query: 131 -IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
A ++ ++++ EHR+YG+S+P G ++ +N L YL+A QAL D+ + I +K+
Sbjct: 89 KYAEKYNSLMLAIEHRFYGKSVPEGG--LSQEN---LGYLSAAQALEDYVMIINQIKKEY 143
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
PV++FGGSY G LA W+R KYP++ A+ASSAP+ TFY + +
Sbjct: 144 QI-TGPVIVFGGSYSGNLATWIRQKYPNVVYAAVASSAPVY------ATSTFYEFLDVIY 196
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAM 309
C N KE+ + + + ++G +L F C E+ +DL ++ + M
Sbjct: 197 NDMGEKCGNAWKEATESIEELFKTDSGKAQLKNDFKTCTEIKEEDDLTILIQQIQA--TM 254
Query: 310 VDYP-YPSDFMMPLPG 324
V+YP Y + + + G
Sbjct: 255 VNYPQYNGSYSLTIEG 270
>gi|432891041|ref|XP_004075519.1| PREDICTED: putative serine protease K12H4.7-like [Oryzias latipes]
Length = 489
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 89/259 (34%), Positives = 138/259 (53%), Gaps = 14/259 (5%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG 116
R +++ E ++F Q+LDHF+ AD + QRY +N + + P GP+FL G EG
Sbjct: 33 RSHAREEGSREAEEQWFTQKLDHFNGADSRAWKQRYFLN-EAFYKPG--GPVFLMIGGEG 89
Query: 117 DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALA 176
+ +G A + GA+ + EHRYYG+S P + +++ N L YL++ QALA
Sbjct: 90 PANPAWMKNGTWLIYAEKLGALCLMLEHRYYGKSHP--TLDLSTNN---LRYLSSRQALA 144
Query: 177 DFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
D A F T + + + V FGGSY G LAAW RLKYPH+ ++A+SAP+ +
Sbjct: 145 DLAHFRTVMGEAQGLTNNKWVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPV--HATVN 202
Query: 237 PPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NSTE 294
PE + +V E+A C +K++ LV + +TK F+LC +L +
Sbjct: 203 FPE-YLEVVWRSLASENAKCPLLVKKASDSLVEKLKDPETFDNITKDFNLCSKLQIQTEM 261
Query: 295 DLADWLES-AYSYLAMVDY 312
D A +LE A +++ +V Y
Sbjct: 262 DSAYFLEMLAGNFMDVVQY 280
>gi|307196628|gb|EFN78125.1| Putative serine protease K12H4.7 [Harpegnathos saltator]
Length = 429
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 124/230 (53%), Gaps = 17/230 (7%)
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG-DIEWFAVNSGFVWDIAPRFGAMLVFPE 143
+ + QRY +N+D++ PN GP+FL G E +W G D A GAM + E
Sbjct: 1 MRVWKQRYFVNSDYY-KPN--GPVFLMIGTEKIKPKWMV--EGLWIDYAKELGAMCFYVE 55
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGS 202
HRYYG+S P + +++ N L++L++E AL DFA FI N+ + ++FGGS
Sbjct: 56 HRYYGKSHP--TVDLSTDN---LTFLSSEIALQDFAYFIRNINIEYKFPNDTKWIVFGGS 110
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGG LAAWMRLKYPH GA+++S P+L D + +Y +V K+ S C + +
Sbjct: 111 YGGSLAAWMRLKYPHFVHGAVSASGPLLALIDF---QEYYVVVEDALKQHSQQCVDAVAN 167
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLESAYSYLAMV 310
+ E ++ G ++ + F LC ++ T D+++ +S + A +
Sbjct: 168 ANTEFHTMLHHLTGQEQIAEKFRLCDPIDPGHTADISNLYQSLANNFAYI 217
>gi|356559327|ref|XP_003547951.1| PREDICTED: probable serine protease EDA2-like [Glycine max]
Length = 490
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 108/198 (54%), Gaps = 20/198 (10%)
Query: 44 FVGKLPHLTEP--PQRQQRQQQQQYRYET---RYFEQRLDHFSFADLPTFSQRYLINTDH 98
FV P L+ P R + Q Y T ++F Q LDHFS D F QRY D+
Sbjct: 20 FVSSFPALSYGVVPPRTLLNKLSQGSYLTTQEQWFNQTLDHFSPYDHHQFRQRYFEFLDY 79
Query: 99 WVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEV 158
+ P+ GPIFL G EG + + ++ +A +FGA +V EHRYYG+S P+ S E
Sbjct: 80 FRIPD--GPIFLVIGGEGPCN--GITNDYIGVLAKKFGAAMVTLEHRYYGKSSPFNSLET 135
Query: 159 AYQNATTLSYLTAEQALADFAVF------ITNLKQNLSAEASPVVLFGGSYGGMLAAWMR 212
L YL+++QAL D AVF N K N + +P +FGGSY G L+AW R
Sbjct: 136 E-----NLKYLSSKQALFDLAVFRQYYQDSLNAKLNRTKTENPWFVFGGSYAGALSAWFR 190
Query: 213 LKYPHIAIGALASSAPIL 230
LK+PH+ G+LASSA +L
Sbjct: 191 LKFPHLTCGSLASSAVVL 208
>gi|125778536|ref|XP_001360026.1| GA14959 [Drosophila pseudoobscura pseudoobscura]
gi|195158020|ref|XP_002019892.1| GL12647 [Drosophila persimilis]
gi|54639776|gb|EAL29178.1| GA14959 [Drosophila pseudoobscura pseudoobscura]
gi|194116483|gb|EDW38526.1| GL12647 [Drosophila persimilis]
Length = 479
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 137/288 (47%), Gaps = 21/288 (7%)
Query: 14 LSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYF 73
L+ V+ + +++ L L P+K P V L +L P +Q ++ + ++
Sbjct: 6 LAVVLLALFVAVSQGLDL---PNK--DVPLLVKTLQNLHRGPPKQTVLKRANV--QEKWI 58
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAP 133
Q+LD+F + T+ RYL+N + PIF+Y G E +IE V++G +D+A
Sbjct: 59 TQKLDNFDDDNKETYEMRYLVNDEF---QEEGSPIFIYLGGEWEIEASMVSAGHWYDLAE 115
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA 193
+ +LV+ EHRYYGES+P + A L YL +QALAD A FI + K A
Sbjct: 116 QHKGVLVYTEHRYYGESVPTSTMSTA-----NLKYLHVKQALADVAEFIKSFKAEHPQLA 170
Query: 194 -SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF-KR 251
S VVL GGSY + W + YP + G ASSAPIL V + +V F +
Sbjct: 171 NSKVVLAGGSYSATMVVWFKRLYPDLVDGGWASSAPILA---KVAFTEYKEVVGQAFLQL 227
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
C++ I+ EL S+ + G E LC + DL W
Sbjct: 228 GGQKCYDRIQNGIAELESMFDNKRG-AEARAMLRLCNSFDDKNDLDIW 274
>gi|410958405|ref|XP_003985809.1| PREDICTED: thymus-specific serine protease [Felis catus]
Length = 416
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 70/165 (42%), Positives = 101/165 (61%), Gaps = 10/165 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW +R GP+FL+ G EG + +V G +
Sbjct: 60 WLEQPLDPFNTSDQRSFLQRYWVNDQHWA--SRHGPVFLHLGGEGSLRPGSVTRGHPAAL 117
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G +VA L +L++ ALAD A L + N+
Sbjct: 118 APAWGALVIGLEHRFYGLSIPAGGLDVA-----QLRFLSSRHALADVASARLALGRLFNV 172
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D
Sbjct: 173 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD 216
>gi|148700667|gb|EDL32614.1| protease, serine, 16 (thymus), isoform CRA_b [Mus musculus]
Length = 475
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/268 (33%), Positives = 133/268 (49%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D TF QRY +N H G + P+FL+ G EG + +V +G +
Sbjct: 60 WLEQPLDPFNASDRRTFLQRYWVNDQHRTGQDV--PVFLHIGGEGSLGPGSVMAGHPAAL 117
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG SMP G ++A L YL++ ALAD A L L+
Sbjct: 118 APAWGALVISLEHRFYGLSMPAGGLDLAL-----LRYLSSRHALADVASARQALSGLLNV 172
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ +SP + FGGSY G LA W RLK+PH+ A+ASSAP+ +V + +V+
Sbjct: 173 SSSSPWICFGGSYAGSLATWARLKFPHLVFAAVASSAPL---SAVVDFSAYNQVVARSLT 229
Query: 251 R----ESASCFNTIKESWGE---LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ S C ++ E L+ G +L + C L+ TED A+ L A
Sbjct: 230 QVAIGGSLECLAAASTAFTEVERLLRAGPAAQAVLR--EELGACGSLDLTEDQAELL-GA 286
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L Y PL +R++C
Sbjct: 287 LQALVGGTVQYDGQAGAPL---SVRQLC 311
>gi|449706553|gb|EMD46378.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
KU27]
Length = 466
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 129/252 (51%), Gaps = 19/252 (7%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-IAP 133
Q +DHF + T + RY IN + ++ P+ + G EG AV FV + A
Sbjct: 36 QPIDHFDLTNKKTINIRYFINDTIY---SKEAPLLVDLGGEGTQRAAAVGGRFVINKYAE 92
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA 193
++ ++++ EHR+YG+S+P G ++ +N L YL+A QAL D+ + I +K+
Sbjct: 93 KYNSLMLAIEHRFYGKSVPEGG--LSQEN---LGYLSAAQALEDYIMIINQIKKEYQI-T 146
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRES 253
PV++FGGSY G LA W+R KYP++ A+ASSAP+ TFY + +
Sbjct: 147 GPVIVFGGSYSGNLATWIRQKYPNVVYAAVASSAPVY------ATSTFYEFLDVIYNDMG 200
Query: 254 ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYP 313
C N KE+ + + + ++G +L F C E+ +DL ++ + MV+YP
Sbjct: 201 EKCGNAWKEATDSIEELFKTDSGKAQLKNDFKTCTEIKEEDDLTILIQQIQA--TMVNYP 258
Query: 314 -YPSDFMMPLPG 324
Y + + + G
Sbjct: 259 QYNGSYSLTIEG 270
>gi|67472594|ref|XP_652089.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|56468897|gb|EAL46703.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 466
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 129/252 (51%), Gaps = 19/252 (7%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-IAP 133
Q +DHF + T + RY IN + ++ P+ + G EG AV FV + A
Sbjct: 36 QPIDHFDLTNKKTINIRYFINDTIY---SKEAPLLVDLGGEGTQRAAAVGGRFVINKYAE 92
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA 193
++ ++++ EHR+YG+S+P G ++ +N L YL+A QAL D+ + I +K+
Sbjct: 93 KYNSLMLAIEHRFYGKSVPEGG--LSQEN---LGYLSAAQALEDYIMIINQIKKEYQI-T 146
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRES 253
PV++FGGSY G LA W+R KYP++ A+ASSAP+ TFY + +
Sbjct: 147 GPVIVFGGSYSGNLATWIRQKYPNVVYAAVASSAPVY------ATSTFYEFLDVIYNDMG 200
Query: 254 ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYP 313
C N KE+ + + + ++G +L F C E+ +DL ++ + MV+YP
Sbjct: 201 EKCGNAWKEATDSIEELFKTDSGKAQLKNDFKTCTEIKEEDDLTILIQQIQA--TMVNYP 258
Query: 314 -YPSDFMMPLPG 324
Y + + + G
Sbjct: 259 QYNGSYSLTIEG 270
>gi|198455509|ref|XP_001360028.2| GA17653 [Drosophila pseudoobscura pseudoobscura]
gi|198133275|gb|EAL29180.2| GA17653 [Drosophila pseudoobscura pseudoobscura]
Length = 485
Score = 124 bits (312), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 145/300 (48%), Gaps = 30/300 (10%)
Query: 29 LSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTF 88
L L +P+ F ++ R + + P L + +R ETR+F +LD+F+ A+ T+
Sbjct: 23 LELQPEPNAFVQSLRELRRAPPLEKSRKRA--------NVETRWFTLKLDNFNAANNATW 74
Query: 89 SQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYG 148
R LIN DH+ + PIF+Y G E +IE A+ SG DIA L++ EHR++G
Sbjct: 75 KDRVLINEDHFTDGS---PIFIYLGGEWEIEPSAITSGLWVDIAKEHNGSLIYTEHRFFG 131
Query: 149 ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLA 208
+S P T ++ +N L Y + +QALAD I LK + S V++ G SY +A
Sbjct: 132 KSFPI--TPLSTKN---LKYQSVQQALADVVHIIKTLKLEDKYKDSKVIVSGCSYSATMA 186
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKES---W 264
W+R YP I +G+ ASSAP+ E V + + +V F++ C++ I + +
Sbjct: 187 TWIRKLYPDIILGSWASSAPL---EAKVDFKDYMEVVGQAFEQLGGKYCYDLIDNATSYY 243
Query: 265 GELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL---ESAYSYLAMVDYPYPSDFMMP 321
+L GQ EL +LC N W A + + Y P ++ +P
Sbjct: 244 QDLFEGGQGAKAKKEL----NLCANFNVNSKQDRWQIFSTIANVFAGLAQYQKPGNYDLP 299
>gi|255645752|gb|ACU23369.1| unknown [Glycine max]
Length = 490
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 81/198 (40%), Positives = 108/198 (54%), Gaps = 20/198 (10%)
Query: 44 FVGKLPHLTEP--PQRQQRQQQQQYRYET---RYFEQRLDHFSFADLPTFSQRYLINTDH 98
FV P L+ P R + Q Y T ++F Q LDHFS D F QRY D+
Sbjct: 20 FVSSFPALSYGVVPPRTLLNKLSQGSYLTTQEQWFNQTLDHFSPYDHHQFRQRYFEFLDY 79
Query: 99 WVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEV 158
+ P+ GPIFL G EG + + ++ +A +FGA +V EHRYYG+S P+ S E
Sbjct: 80 FRIPD--GPIFLVIGGEGPCN--GITNDYIGVLAKKFGAAMVTLEHRYYGKSSPFNSLET 135
Query: 159 AYQNATTLSYLTAEQALADFAVF------ITNLKQNLSAEASPVVLFGGSYGGMLAAWMR 212
L YL+++QAL D AVF N K N + +P +FGGSY G L+AW R
Sbjct: 136 -----ENLKYLSSKQALFDLAVFRQYYQDSLNAKLNRTKIENPWFVFGGSYAGALSAWFR 190
Query: 213 LKYPHIAIGALASSAPIL 230
LK+PH+ G+LASSA +L
Sbjct: 191 LKFPHLTCGSLASSAVVL 208
>gi|363814290|ref|NP_001242784.1| uncharacterized protein LOC100805858 precursor [Glycine max]
gi|255635884|gb|ACU18289.1| unknown [Glycine max]
Length = 488
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 83/222 (37%), Positives = 120/222 (54%), Gaps = 32/222 (14%)
Query: 15 SPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFE 74
+ +++++ +S PLS P PR + L L+E ++ +F+
Sbjct: 10 TALLSLLFVSSFPPLSYGVVP------PRTL--LNKLSEGKYLNTQE---------LWFD 52
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPR 134
Q LDHFS D F QRY D++ P+ GPIFL G EG + V + ++ +A +
Sbjct: 53 QTLDHFSPYDHRQFRQRYYEFLDYFRIPD--GPIFLVIGGEGILN--GVANDYLAVLAKK 108
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNLKQN 188
FGA +V EHRYYG+S P+ S E L YL+++QAL+D AVF N K N
Sbjct: 109 FGAAMVTLEHRYYGKSTPFNSLETE-----NLKYLSSKQALSDLAVFRQYYQDSINAKLN 163
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
+ +P +FGGSY G L+AW RLK+PH+ G+LASSA +L
Sbjct: 164 RAKIENPWFIFGGSYSGALSAWFRLKFPHLTCGSLASSAVVL 205
>gi|149029310|gb|EDL84577.1| protease, serine, 16 (thymus), isoform CRA_b [Rattus norvegicus]
Length = 496
Score = 124 bits (312), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 130/268 (48%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ D F QRY +N H G + P+FL+ G EG + +V +G +
Sbjct: 44 WLEQPLDPFNATDRRIFLQRYWVNDQHRTGQD--APVFLHIGGEGSLGPGSVMTGHPAAL 101
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG SMP G +VA L YL++ ALAD A L L+
Sbjct: 102 APAWGALVISLEHRFYGLSMPAGGLDVAL-----LHYLSSRHALADVASARQALSGLLNV 156
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ +SP + FGGSY G LA W RLK+PH+ A+ASSAP+ +V + +V+
Sbjct: 157 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQVVARSLT 213
Query: 251 R----ESASCFNTIKESWGE---LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ S C ++ E L+ G +L + C L+ ED A+ L A
Sbjct: 214 QVTIGGSLECLAAASTAFAEVERLLRAGLASQAVLR--EELGACGSLDLIEDQAELL-GA 270
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L Y PL +R++C
Sbjct: 271 LQALVGGTVQYDGQAGSPL---SVRQLC 295
>gi|76096362|ref|NP_001028882.1| thymus-specific serine protease precursor [Rattus norvegicus]
gi|75775148|gb|AAI04723.1| Protease, serine, 16 (thymus) [Rattus norvegicus]
gi|149029311|gb|EDL84578.1| protease, serine, 16 (thymus), isoform CRA_c [Rattus norvegicus]
Length = 512
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 130/268 (48%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ D F QRY +N H G + P+FL+ G EG + +V +G +
Sbjct: 60 WLEQPLDPFNATDRRIFLQRYWVNDQHRTGQD--APVFLHIGGEGSLGPGSVMTGHPAAL 117
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG SMP G +VA L YL++ ALAD A L L+
Sbjct: 118 APAWGALVISLEHRFYGLSMPAGGLDVAL-----LHYLSSRHALADVASARQALSGLLNV 172
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ +SP + FGGSY G LA W RLK+PH+ A+ASSAP+ +V + +V+
Sbjct: 173 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQVVARSLT 229
Query: 251 R----ESASCFNTIKESWGE---LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ S C ++ E L+ G +L + C L+ ED A+ L A
Sbjct: 230 QVTIGGSLECLAAASTAFAEVERLLRAGLASQAVLR--EELGACGSLDLIEDQAELL-GA 286
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L Y PL +R++C
Sbjct: 287 LQALVGGTVQYDGQAGSPL---SVRQLC 311
>gi|344292448|ref|XP_003417939.1| PREDICTED: thymus-specific serine protease-like [Loxodonta
africana]
Length = 574
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 75/226 (33%), Positives = 115/226 (50%), Gaps = 14/226 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSGFVWD 130
F+Q+LDHFS + QRY N D + P GP+FL G I W + N+ +V
Sbjct: 42 FQQKLDHFSKNSSELWPQRYFFN-DVFYKPG--GPVFLLIGGSDTICESWISTNNTWV-S 97
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A R GA+L+ EHR+YG S P G+ A +L YL++ QALAD F + + +
Sbjct: 98 YAERLGALLILLEHRFYGHSQPTGNVSTA-----SLHYLSSRQALADIVNFRIKIAEKVG 152
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ V FG SYGG LA W R+K+P + A+ SSAP+ + + +V
Sbjct: 153 LTKNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSSAPMQAKANFYE---YLEVVQRSLI 209
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
+ CF +KE++G+++ + + +L K F LC+ L ++
Sbjct: 210 THNRQCFQAVKEAFGQVMKMLRLRKYHRQLRKDFTLCKPLKHYSEM 255
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/130 (30%), Positives = 63/130 (48%), Gaps = 3/130 (2%)
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIG 221
+ +L YL++ QALAD F + + + + V FG SYGG LA W R+K+P +
Sbjct: 382 STASLRYLSSRQALADIVNFRIKIAEKMGLTKNKWVAFGCSYGGSLAVWSRIKHPDLFAA 441
Query: 222 ALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELT 281
A+ SSAPI + + +V + CF +KE++ E+ + + +L
Sbjct: 442 AVGSSAPIQAKANFYE---YLEVVQRSLTTHNNKCFQAVKEAFDEITKMLRLRKYYSKLE 498
Query: 282 KTFHLCRELN 291
F LC+ L
Sbjct: 499 YDFRLCKPLK 508
>gi|350406141|ref|XP_003487670.1| PREDICTED: putative serine protease K12H4.7-like [Bombus impatiens]
Length = 493
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 86/279 (30%), Positives = 138/279 (49%), Gaps = 16/279 (5%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
+F Q LDHF+ D + QRY +N +++ + GP+FL E + G
Sbjct: 47 AEWFTQFLDHFNPTDARVWQQRYFVNGEYY---KKGGPVFLMISGEAAANAKWMVEGQWI 103
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
+ A +FGA+ EHR+YG+S P ++++ +N L YL+++QALAD A FI + N
Sbjct: 104 EYAKQFGALCFQVEHRFYGQSHP--TSDLGVKN---LMYLSSQQALADLAYFIQLMNINY 158
Query: 190 SAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
A + + FGGSY G LAAW+R KYPH+ GA+++S P+L D + ++ +V +
Sbjct: 159 KLPAGTKWIAFGGSYAGSLAAWLRYKYPHLVHGAVSASGPLLAEIDF---QEYFVVVENA 215
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN---STEDLADWLESAYS 305
K S +C N I E+ + + G + K F LC +N D+++ E+ S
Sbjct: 216 LKEHSEACVNAILEANKQFHIMLHHPIGQQGIAKKFILCDPINEHTKRNDISNLYETIAS 275
Query: 306 YLA-MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSI 343
A +V Y + + I C + N +I
Sbjct: 276 IFAGIVQYNKDNRNNSAMANLTIDSACDILTNETLGIAI 314
>gi|299473646|emb|CBN78040.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 559
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 119/232 (51%), Gaps = 22/232 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG-----DIEWFAV 123
E + EQRLDHF + F QRY IN +W G + P+FL G EG ++ +V
Sbjct: 68 EELFVEQRLDHFDRQNSRKFLQRYFINKKYWAGASSGAPVFLCVGGEGPPLEANVLSESV 127
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
+ + ++AP A+++ EHRYYG+S P +L +L+++QALAD + F
Sbjct: 128 HCNDMLELAPEHNALVLAVEHRYYGKSNPGDDWAT-----DSLRWLSSQQALADLSSFHG 182
Query: 184 NL--KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL------QFEDI 235
L K+ L+ A V +GGSY GMLA W RLKYPH+ A++SS+P+ Q+ ++
Sbjct: 183 FLSDKEGLTG-AEKWVTWGGSYPGMLAGWARLKYPHLFHAAVSSSSPMKAQLDFPQYAEV 241
Query: 236 VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC 287
+ + E AS S GEL+ E G LEL TF LC
Sbjct: 242 MRDSLASGVDGVGGSEECASAVEAGHASIGELLLT---EEGQLELVATFQLC 290
>gi|403308857|ref|XP_003944858.1| PREDICTED: thymus-specific serine protease isoform 1 [Saimiri
boliviensis boliviensis]
Length = 512
Score = 124 bits (311), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 80/236 (33%), Positives = 124/236 (52%), Gaps = 18/236 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ L+ F+ +D +F QRY +N HW G + GPIFL+ G EG + +V G +
Sbjct: 59 WLEQLLNPFNVSDRRSFLQRYWVNDQHWTGQD--GPIFLHLGGEGSLGPGSVMKGHPAAL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G ++A L +L++ ALAD L + N+
Sbjct: 117 APAWGALVISLEHRFYGLSIPAGGLDMA-----QLRFLSSRHALADVVSARLALSRLFNV 171
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D + ++VS
Sbjct: 172 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDF---SEYNDVVSRSL 227
Query: 250 KRE----SASCFNTIKESWGELVSVGQKENGLLELTKT-FHLCRELNSTEDLADWL 300
K S C + ++ E+ + + + C L+ ED A+ L
Sbjct: 228 KSAAIGGSLECQAAVSTAFSEVERRLRAGGAARAALQAELNACGSLSRAEDQAELL 283
>gi|195158022|ref|XP_002019893.1| GL12648 [Drosophila persimilis]
gi|194116484|gb|EDW38527.1| GL12648 [Drosophila persimilis]
Length = 485
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 144/300 (48%), Gaps = 30/300 (10%)
Query: 29 LSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTF 88
L L +P+ F ++ R + + P L +R ETR+F +LD+F+ A+ T+
Sbjct: 23 LELQPEPNAFVQSLRELRRAPPLETSRKRA--------NVETRWFTLKLDNFNAANNATW 74
Query: 89 SQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYG 148
R LIN DH+ + PIF+Y G E +IE A+ SG DIA L++ EHR++G
Sbjct: 75 KDRVLINEDHFTDGS---PIFIYLGGEWEIEPSAITSGLWVDIAKEHNGSLIYTEHRFFG 131
Query: 149 ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLA 208
+S P T ++ +N L Y + +QALAD I LK + S V++ G SY +A
Sbjct: 132 KSFPI--TPLSTKN---LKYQSVQQALADVVHIIKTLKLEDKYKDSKVIVSGCSYSATMA 186
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKES---W 264
W+R YP I +G+ ASSAP+ E V + + +V F++ C++ I + +
Sbjct: 187 TWIRKLYPDIILGSWASSAPL---EAKVDFKDYMEVVGQAFEQLGGKYCYDLIDNATSYY 243
Query: 265 GELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL---ESAYSYLAMVDYPYPSDFMMP 321
+L GQ EL +LC N W A + + Y P ++ +P
Sbjct: 244 QDLFEGGQGAKAKKEL----NLCANFNVNSKQDRWQIFSTIANVFAGLAQYQKPGNYDLP 299
>gi|348537748|ref|XP_003456355.1| PREDICTED: putative serine protease K12H4.7-like [Oreochromis
niloticus]
Length = 510
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 86/259 (33%), Positives = 139/259 (53%), Gaps = 14/259 (5%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG 116
++ + +Q +E ++F Q+LDHF+ AD + Q Y IN + + P GP+FL G EG
Sbjct: 54 KEAQDTKQGSLFEEQWFTQKLDHFNGADTRVWKQMYFIN-EAFYRPG--GPVFLMIGGEG 110
Query: 117 DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALA 176
+ G A + GA+ + EHR+YG+S P + +++ N L +L++ QALA
Sbjct: 111 PANPAWMEHGTWLTYAEKLGALCLMLEHRFYGKSHP--TMDLSTDN---LRFLSSRQALA 165
Query: 177 DFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
D A F T + + V FGGSY G LAAW RLKYPH+ ++A+SAP+ + +
Sbjct: 166 DLAHFRTMIAKARGLTDRKWVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPV--YATVN 223
Query: 237 PPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
PE + +V E+ C +K++ +LV + +TK F+LC +L D+
Sbjct: 224 FPE-YLEVVWRSLASENTECPLLVKKASDDLVERLKDPRTYDNITKDFNLCSKLQIQTDM 282
Query: 297 --ADWLES-AYSYLAMVDY 312
A +LE+ A +++ +V Y
Sbjct: 283 DSAYFLETLAGNFMDVVQY 301
>gi|149029309|gb|EDL84576.1| protease, serine, 16 (thymus), isoform CRA_a [Rattus norvegicus]
Length = 454
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 130/268 (48%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ D F QRY +N H G + P+FL+ G EG + +V +G +
Sbjct: 60 WLEQPLDPFNATDRRIFLQRYWVNDQHRTGQD--APVFLHIGGEGSLGPGSVMTGHPAAL 117
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG SMP G +VA L YL++ ALAD A L L+
Sbjct: 118 APAWGALVISLEHRFYGLSMPAGGLDVAL-----LHYLSSRHALADVASARQALSGLLNV 172
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ +SP + FGGSY G LA W RLK+PH+ A+ASSAP+ +V + +V+
Sbjct: 173 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPL---NAVVDFSAYNQVVARSLT 229
Query: 251 R----ESASCFNTIKESWGE---LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ S C ++ E L+ G +L + C L+ ED A+ L A
Sbjct: 230 QVTIGGSLECLAAASTAFAEVERLLRAGLASQAVLR--EELGACGSLDLIEDQAELL-GA 286
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L Y PL +R++C
Sbjct: 287 LQALVGGTVQYDGQAGSPL---SVRQLC 311
>gi|330840912|ref|XP_003292451.1| hypothetical protein DICPUDRAFT_83068 [Dictyostelium purpureum]
gi|325077291|gb|EGC31013.1| hypothetical protein DICPUDRAFT_83068 [Dictyostelium purpureum]
Length = 457
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 104/324 (32%), Positives = 141/324 (43%), Gaps = 44/324 (13%)
Query: 44 FVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPN 103
+G + L P + Q Y +F Q LDHF + +FSQRY IN D +
Sbjct: 10 IIGLINGLAYPKNFNEEQVSNISPYTPFWFSQTLDHFDDENTESFSQRYFIN-DQFYDYT 68
Query: 104 RLGPIFLYCGNEGDIEWFAVNSG-FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN 162
GPI LY EG + +G V AM+V EHRYYGES P+ Q+
Sbjct: 69 NGGPIILYINGEGPVSSAPCQTGDGVVLYGQALNAMIVTLEHRYYGESTPF-------QD 121
Query: 163 ATT--LSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAI 220
TT L YL++EQAL D A+F+ + LS A +V GGSY G L+AW R+KYPHI
Sbjct: 122 LTTENLKYLSSEQALNDLAIFVVWFQSQLS-NAGKIVTIGGSYSGALSAWFRIKYPHITS 180
Query: 221 GALASSA---PILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGL 277
G++ASS ILQF TF V+ + ++ + ++ E + G
Sbjct: 181 GSIASSGVVNAILQF------TTFDEYVAYAAGEDCSNALRLVTKAVEEQILAGGSAE-- 232
Query: 278 LELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA 337
+ K L D WL A S + Y + S PL + ++N
Sbjct: 233 -QKVKQIFQAESLTDNGDFFYWL--ADSMAEGIQYGFHSQLCSPL--------IEAMNNN 281
Query: 338 PDATSILERIFEGVSVYYNYTGNV 361
D +S Y NYT NV
Sbjct: 282 GDM----------ISTYSNYTINV 295
>gi|344292458|ref|XP_003417944.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 522
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 83/250 (33%), Positives = 124/250 (49%), Gaps = 16/250 (6%)
Query: 56 QRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN- 114
+R + Q + Y F+Q+LDHFS + QRY IN D + P GP+FL G
Sbjct: 23 RRTRGAHQTKVMYPNFSFQQKLDHFSENSSQFWPQRYFIN-DAFYKPG--GPVFLMVGGV 79
Query: 115 -EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQ 173
W ++N +V A R GA+ + EHR+YG S P G A +L YL++ Q
Sbjct: 80 WTASESWLSINKTWV-TYAQRLGALFLLLEHRFYGYSQPTGDLSTA-----SLQYLSSRQ 133
Query: 174 ALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
ALAD A F T + + + + V FG S G LA W R+K+P + A+ SSAPI
Sbjct: 134 ALADIANFRTQIAEKMGLTENKWVAFGCSSAGSLAVWSRIKHPELFAAAVGSSAPIQAKA 193
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELN 291
+ + IV +++CF +KE++G++V + +L F LC+ +L
Sbjct: 194 NFYE---YLEIVQRSLATHNSNCFQAVKEAFGQIVKMLNLPRYYGKLENDFKLCKPMKLY 250
Query: 292 STEDLADWLE 301
S D A +L+
Sbjct: 251 SAMDKATFLD 260
>gi|327276291|ref|XP_003222903.1| PREDICTED: thymus-specific serine protease-like [Anolis
carolinensis]
Length = 511
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 82/250 (32%), Positives = 130/250 (52%), Gaps = 24/250 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q LDH++ + TF+QRY +N W GP+FL+ G EG + +AV G +
Sbjct: 67 FIRQHLDHYNKKNQATFNQRYWVNAGFW---RHGGPVFLFIGGEGRLSEYAVLKGHHVTL 123
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A ++GA+L+ EHR+YG GS + L YL+++QAL+D F + +
Sbjct: 124 AEKYGALLLALEHRFYG-----GSLKPEMLEDDNLQYLSSQQALSDLVSFHQFISKKYKL 178
Query: 192 EASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ + FGGSY G LAAW RLK+PH+ GA+ASSAP+ D + ++ +V++
Sbjct: 179 TPNNTWICFGGSYPGSLAAWFRLKFPHLVFGAVASSAPVRAQLDF---KGYHKVVAASLS 235
Query: 251 R----ESASCFNTIKESWG---ELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
S C + + E++ ELV GQ L +L + F C L +D +++
Sbjct: 236 NPVISGSKQCLDAVTEAFSAVEELVRSGQ----LDKLDQDFRSCLPLEGLKDSLWLIKNL 291
Query: 304 YS-YLAMVDY 312
S ++A+V Y
Sbjct: 292 VSMFMAIVQY 301
>gi|354500289|ref|XP_003512233.1| PREDICTED: thymus-specific serine protease [Cricetulus griseus]
Length = 509
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/265 (33%), Positives = 130/265 (49%), Gaps = 18/265 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D TF QRY +N H G + P+FL+ G EG + +V +G +
Sbjct: 60 WLEQPLDPFNTSDRRTFLQRYWVNDRHRAGQD--APVFLHIGGEGSLGPGSVMAGHPVAL 117
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG SMP G ++A L YL++ ALAD A L + L+
Sbjct: 118 APAWGALVISLEHRFYGLSMPSGGLDMA-----QLRYLSSRHALADVASARQALSRLLNV 172
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP-PETFYNIVSSDF 249
+ +SP + FGGSY G LA W RLK+PH+ A+ASSAP+ D E +S
Sbjct: 173 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPLSAVLDFYAYNEVVARSLSQVA 232
Query: 250 KRESASCFNTIKESWGE---LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
S C ++ E L+ G +L + C L+ TED + L A
Sbjct: 233 IGGSQECLAAASAAFAEVERLLRAGPAAQAVLR--EELSACGSLDLTEDQGELL-GALQA 289
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVC 331
L Y PL +R++C
Sbjct: 290 LVGGTVQYDGQAGAPL---SVRQLC 311
>gi|324503010|gb|ADY41315.1| Serine protease [Ascaris suum]
Length = 1068
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 93/286 (32%), Positives = 145/286 (50%), Gaps = 23/286 (8%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWDIAPR 134
+DHF ++ TF QRY +N + P GP FL G EG +++W + +A +
Sbjct: 602 VDHFDLTNMNTFDQRYWVNP-QYAQPG--GPHFLVIGGEGRANVKWVTEPNLITMSMARK 658
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEAS 194
F A + EHRYYG+S P Q+ L +LTA QALAD A FI + + +
Sbjct: 659 FNATVYMLEHRYYGDSFP-----TPDQSTENLRWLTATQALADLAQFIMTMNERYNLVNP 713
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA 254
V FGGSY GML+AW R YP +++GA+ASSAPI E V + +V + + +A
Sbjct: 714 KWVTFGGSYPGMLSAWFRQFYPQLSVGAVASSAPI---EAKVDFYEYLIVVENALRVFNA 770
Query: 255 SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVD--- 311
+C +K ++ ++ + G + L+ F L E N T ++ + L+ Y + + D
Sbjct: 771 TCAENVKLAFDQIHQLSLTRTGRVTLSNLFTLKPEWNLTTEVTN-LDIQYFFSILYDKFQ 829
Query: 312 --YPYPSDFMMPLP-GYPIREVCKKIDNAPDATSILERIFEGVSVY 354
Y +D G IREVC + N +A + +E + + V++Y
Sbjct: 830 GAVQYNNDNTGSYATGGGIREVCGYMLN--NAKTPMENVAD-VNIY 872
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/288 (28%), Positives = 139/288 (48%), Gaps = 41/288 (14%)
Query: 39 RRAPRFV-GKLPHLTEP-PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINT 96
R P F+ G+ PH P P + + + Y +T QRLDHF+ +D T++QRY N
Sbjct: 27 RNLPYFIMGRPPHGLRPDPIKPEELNRAGYVIQTATLPQRLDHFNASDARTWAQRYHYNF 86
Query: 97 DHWVGPNRLGPIFLYCGNEGD-------------IEWFAVNSGFVWDIAPRFGAMLVFPE 143
+++ GPIFL G EG I+W ++ ++D+ E
Sbjct: 87 NYY---KSGGPIFLMLGGEGPETGSWCVDEKLPYIQWAMSHNAAIYDL-----------E 132
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSY 203
HR+YG+S P+ + Q+ L YL++ QA+ D A FI + + ++FGGSY
Sbjct: 133 HRFYGQSRPFPT-----QSIENLKYLSSRQAIEDAAYFIRYINEQQKYVNPKWIVFGGSY 187
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKES 263
G LAAW+R K+P + IGA+ SS P+ D + +V + + + C + +++
Sbjct: 188 SGALAAWLREKHPELVIGAVGSSGPVEAKLDFY---EYLEVVENALRSYAPECADAVQQG 244
Query: 264 WGELVSVGQKENGLLELTKTFHLCRELNST----EDLADWLESAYSYL 307
+ E+ + G L++ F L +LN T +D+ ++ + Y Y
Sbjct: 245 FTEMSKMIWTLEGRKNLSELFVLNPKLNETKLRYKDIQNFFATIYGYF 292
>gi|256574601|dbj|BAH98106.1| serine protease [Entamoeba invadens]
gi|440292588|gb|ELP85775.1| hypothetical protein EIN_281360 [Entamoeba invadens IP1]
Length = 479
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/289 (29%), Positives = 144/289 (49%), Gaps = 26/289 (8%)
Query: 21 VIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHF 80
+I+ L+ L ++++ ++ +R + K Q ++ ++ ET + LDHF
Sbjct: 1 MILFFLTLLLVSSKLTEHQRTLHSISK-----------QAKESKKSTVETFKYTVPLDHF 49
Query: 81 SFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG-DIEWFAVNSGFVWDIAPRFGAML 139
+ + F Y I++ + + PIF+ G EG + E N+ + ++A + ++
Sbjct: 50 NANNDEEFEIVYFIDSQYLDSASETSPIFILLGGEGPETEKVLQNNYVIDELAKKHKGLM 109
Query: 140 VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLF 199
+ EHR+YG S P S E+ TL Y TAEQA+ D+ I +++ S PV+
Sbjct: 110 LSVEHRFYGTSTP--SLEL-----NTLKYCTAEQAMMDYVEVINYVQEMYSLVGHPVIAL 162
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNT 259
GGSY G LA W+R KYP+I G+ ASSAP+ E +V + +V S+ +A+
Sbjct: 163 GGSYSGNLATWIRQKYPNIIDGSWASSAPL---EAVVDFYEYLEVVQSNLPENTATLLTL 219
Query: 260 IKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
E W E+V E+G +L K FH C E +D+ + E+ + LA
Sbjct: 220 AFEKWDEMVVT---ESGRKQLGKIFHTCTEFGE-KDIQTFSENIGTALA 264
>gi|414870775|tpg|DAA49332.1| TPA: putative serine peptidase S28 family protein [Zea mays]
Length = 509
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 96/173 (55%), Gaps = 18/173 (10%)
Query: 67 RYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
RY TR + Q LDHF+ D F QRY D++ PN GPIFLY E +
Sbjct: 52 RYLTREEHWMNQTLDHFNPTDHRQFKQRYYEFLDYYRAPN--GPIFLYICGESTCN--GI 107
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF-- 181
+ ++ +A +FGA LV PEHRYYG+S P+ S L +L+++QAL D AVF
Sbjct: 108 GNNYLAVVAKKFGAALVSPEHRYYGKSSPFNSLTT-----ENLQFLSSKQALFDLAVFRQ 162
Query: 182 ----ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N K N S S +FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 163 YYQETLNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 215
>gi|414870774|tpg|DAA49331.1| TPA: putative serine peptidase S28 family protein [Zea mays]
Length = 357
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 96/173 (55%), Gaps = 18/173 (10%)
Query: 67 RYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
RY TR + Q LDHF+ D F QRY D++ PN GPIFLY E +
Sbjct: 52 RYLTREEHWMNQTLDHFNPTDHRQFKQRYYEFLDYYRAPN--GPIFLYICGESTCN--GI 107
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF-- 181
+ ++ +A +FGA LV PEHRYYG+S P+ S L +L+++QAL D AVF
Sbjct: 108 GNNYLAVVAKKFGAALVSPEHRYYGKSSPFNSLTT-----ENLQFLSSKQALFDLAVFRQ 162
Query: 182 ----ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N K N S S +FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 163 YYQETLNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 215
>gi|290562587|gb|ADD38689.1| serine protease K12H4.7 [Lepeophtheirus salmonis]
Length = 483
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 76/243 (31%), Positives = 128/243 (52%), Gaps = 19/243 (7%)
Query: 48 LPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGP 107
+ + PP+ + R T +++Q LDHF+ + + QRY +N +++ GP
Sbjct: 32 MKKVLSPPELKDTS-----RISTSFYDQTLDHFNTKNKKAWKQRYFVNEENFKDKEN-GP 85
Query: 108 IFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLS 167
+FL G EG ++ G ++ A + GA+++ EHR+YGES P + ++ +N L
Sbjct: 86 VFLKIGGEGTASIGSMKYGSWYEYAQKVGALMIQLEHRFYGESRP--TENLSTEN---LK 140
Query: 168 YLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
YLT++QA+ D FI ++K+ + + GGSY G L+ WMR YP + GAL+SSA
Sbjct: 141 YLTSQQAIEDIVEFIAHIKEKYDIPNNKWITLGGSYPGSLSLWMRSLYPELIAGALSSSA 200
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGE---LVSVGQKENGLLELTKTF 284
P+ E V E + IV++D + C + E E L++ G++ G ++ K +
Sbjct: 201 PV---EAKVDFEEYLGIVNNDMRIRDPDCPAAVIEGIKETEALINSGKE--GWQKVAKIY 255
Query: 285 HLC 287
LC
Sbjct: 256 KLC 258
>gi|226506854|ref|NP_001149727.1| prolyl carboxypeptidase like protein precursor [Zea mays]
gi|195629836|gb|ACG36559.1| prolyl carboxypeptidase like protein [Zea mays]
Length = 509
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 96/173 (55%), Gaps = 18/173 (10%)
Query: 67 RYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
RY TR + Q LDHF+ D F QRY D++ PN GPIFLY E +
Sbjct: 52 RYLTREEHWMNQTLDHFNPTDHRQFKQRYYEFLDYYRAPN--GPIFLYICGESTCN--GI 107
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF-- 181
+ ++ +A +FGA LV PEHRYYG+S P+ S L +L+++QAL D AVF
Sbjct: 108 GNNYLAVVAKKFGAALVSPEHRYYGKSSPFNSL-----TTENLQFLSSKQALFDLAVFRQ 162
Query: 182 ----ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N K N S S +FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 163 YYQETLNAKYNRSGANSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 215
>gi|47224819|emb|CAG06389.1| unnamed protein product [Tetraodon nigroviridis]
Length = 493
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 91/259 (35%), Positives = 140/259 (54%), Gaps = 15/259 (5%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG 116
+ +R+ + + ++F QRLDHFS AD + QRY +N + + P GP+FL G EG
Sbjct: 38 KVRRETDLKTVSDEQWFIQRLDHFS-ADSREWKQRYFVN-EAFYKPG--GPVFLMIGGEG 93
Query: 117 DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALA 176
+ G A + GA+ EHR+YG+S P +++++ N L +L++ QALA
Sbjct: 94 AASPAWMQYGTWLTYAEKLGAICFMLEHRFYGKSHP--TSDLSTDN---LRFLSSRQALA 148
Query: 177 DFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
D A F T + + + V FGGSY G LAAW+RLKYPH+ A+A+SAPI +
Sbjct: 149 DLAHFRTVIAEARGLTNAKWVAFGGSYPGSLAAWLRLKYPHLVHAAVATSAPI--HATVN 206
Query: 237 PPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN-STE- 294
PE + +V E A+C +K + L + + +TK F+LC +L TE
Sbjct: 207 FPE-YLEVVWRSLASEDAACPVLVKNASDTLAELLKDPKTYDNITKDFNLCSKLQIQTEM 265
Query: 295 DLADWLES-AYSYLAMVDY 312
D A +LE+ A +++ +V Y
Sbjct: 266 DSAQFLETLAGNFMEVVQY 284
>gi|198477695|ref|XP_002136487.1| GA23306 [Drosophila pseudoobscura pseudoobscura]
gi|198145255|gb|EDY71959.1| GA23306 [Drosophila pseudoobscura pseudoobscura]
Length = 292
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 88/275 (32%), Positives = 136/275 (49%), Gaps = 27/275 (9%)
Query: 29 LSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTF 88
L L +P+ F ++ R + + P L + +R ETR+F +LD+F+ A+ T+
Sbjct: 23 LELQPEPNAFVQSLRELRRAPPLEKSRKRAN--------VETRWFTLKLDNFNAANNATW 74
Query: 89 SQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYG 148
R LIN DH+ + PIF+Y G E +IE A+ SG DIA L++ EHR++G
Sbjct: 75 KDRVLINEDHFTDGS---PIFIYLGGEWEIEPSAITSGLWVDIAKEHNGSLIYTEHRFFG 131
Query: 149 ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLA 208
+S P T ++ +N L Y + +QALAD I LK + S V++ G SY +A
Sbjct: 132 KSFPI--TPLSTKN---LKYQSVQQALADVVHIIKTLKLEDKYKDSKVIVSGCSYSATMA 186
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKES---W 264
W+R YP I +G+ ASSAP+ E V + + +V F++ C++ I + +
Sbjct: 187 TWIRKLYPDIILGSWASSAPL---EAKVDFKDYMEVVGQAFEQLGGKYCYDLIDNATSYY 243
Query: 265 GELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+L GQ EL +LC N W
Sbjct: 244 QDLFEGGQGAKAKKEL----NLCANFNVNSKQDRW 274
>gi|242039115|ref|XP_002466952.1| hypothetical protein SORBIDRAFT_01g017310 [Sorghum bicolor]
gi|241920806|gb|EER93950.1| hypothetical protein SORBIDRAFT_01g017310 [Sorghum bicolor]
Length = 510
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/173 (42%), Positives = 96/173 (55%), Gaps = 18/173 (10%)
Query: 67 RYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
RY TR + Q LDHF+ D F QRY D++ PN GPIFLY E +
Sbjct: 53 RYLTREEHWMNQTLDHFNPTDHRQFKQRYYEFLDYYRAPN--GPIFLYICGESSCS--GI 108
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF-- 181
+ ++ +A +FGA LV PEHRYYG+S P+ S L +L+++QAL D AVF
Sbjct: 109 GNNYLAVMAKKFGAALVSPEHRYYGKSSPFNSL-----TTENLQFLSSKQALFDLAVFRQ 163
Query: 182 ----ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N K N S S +FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 164 YYQETLNAKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 216
>gi|307171940|gb|EFN63566.1| Putative serine protease K12H4.7 [Camponotus floridanus]
Length = 415
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 103/192 (53%), Gaps = 10/192 (5%)
Query: 108 IFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLS 167
I + G E +I + G +++I ++ A++ + EHRYYG+S P T + L
Sbjct: 16 ILIMIGGEWEISNGFLQGGLMYEIGSKYNALMYYTEHRYYGKSKPTKDT-----STENLQ 70
Query: 168 YLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
YL +QALAD A FI K+ + E S V++FGGSY G +AAW RLKYPH+ GALASSA
Sbjct: 71 YLNVDQALADLAYFIETKKKEKNLENSTVIVFGGSYAGNMAAWARLKYPHLIQGALASSA 130
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC 287
P+ D +Y +V+ + + C +K ++ + + E G +L F+LC
Sbjct: 131 PVYAKADFYE---YYEVVTKSLGKYNEKCVEDVKIAFDSIEELLTAEGGPDKLKLYFNLC 187
Query: 288 R--ELNSTEDLA 297
+ S+ DL
Sbjct: 188 NVPNVKSSSDLG 199
>gi|291410769|ref|XP_002721683.1| PREDICTED: protease, serine, 16 [Oryctolagus cuniculus]
Length = 505
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 67/159 (42%), Positives = 99/159 (62%), Gaps = 8/159 (5%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW G + GP+FL+ G EG + +V +G +
Sbjct: 57 WLEQPLDPFNTSDRRSFLQRYWVNDQHWAGQD--GPVFLHLGGEGSLGPGSVMTGHPAAL 114
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG S+P G ++A L +L++ AL D A L + L+
Sbjct: 115 APAWGALVIGLEHRFYGLSLPAGGLDLA-----QLRFLSSRHALTDAASARLALSRLLNV 169
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ +SP V FGGSY G LAAW RLK+PH+ ++ASSAP+
Sbjct: 170 SSSSPWVCFGGSYAGSLAAWARLKFPHLFFASVASSAPV 208
>gi|344292454|ref|XP_003417942.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 505
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/233 (34%), Positives = 119/233 (51%), Gaps = 16/233 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWD 130
F Q+LDHFS + QRY IN D + P GP+FL G W +++ +V
Sbjct: 42 FPQKLDHFSKNSSQLWPQRYFIN-DAFYKPG--GPVFLMIGGAWIACESWVSISKTWV-T 97
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A R GA+ + EHR+YG S P G A +L YL++ QALAD A F T + + +
Sbjct: 98 YAERLGALFLLLEHRFYGHSQPTGDLSTA-----SLHYLSSRQALADIANFRTEIAKKMG 152
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ V +G SYGG LA W RLK+P + A+ SSAPI + + +V
Sbjct: 153 LTKNNWVAYGCSYGGSLAVWSRLKHPDLFAAAVGSSAPIKAKANFY---EYLEVVQRSLA 209
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLE 301
++ CF T+KE++ ++V + + +L + F LC+ L S D A +LE
Sbjct: 210 THNSKCFQTVKEAFDQVVKMLRFPKYYRKLERDFTLCKRLKLYSAMDKAYFLE 262
>gi|440900328|gb|ELR51489.1| Thymus-specific serine protease [Bos grunniens mutus]
Length = 516
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 133/270 (49%), Gaps = 22/270 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW + GP+FL+ G EG + +V G ++
Sbjct: 59 WLEQPLDPFNASDRRSFLQRYWVNDQHWTSQD--GPVFLHLGGEGSLGPGSVMRGHPANL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P ++A L +L++ ALAD A L + N+
Sbjct: 117 APIWGALVISLEHRFYGLSIPAEGLDMA-----QLRFLSSRHALADAASARLTLSRLFNV 171
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S+ SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D + ++VS
Sbjct: 172 SS-TSPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPVRAILDF---SKYNDVVSRSL 227
Query: 250 KRE----SASCFNTIKESWGELVSVGQKENGL-LELTKTFHLCRELNSTEDLADWLESAY 304
S C ++ E+ + G L+ C L ED A+ L A
Sbjct: 228 MNTAIGGSLECREAASAAFAEVERRLRASRGAWATLSVELGACGSLERAEDQAELL-GAL 286
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKI 334
L Y PL +R++C+ +
Sbjct: 287 QALVGGAVQYDGQAGAPL---SVRQLCRLL 313
>gi|357518131|ref|XP_003629354.1| Thymus-specific serine protease [Medicago truncatula]
gi|355523376|gb|AET03830.1| Thymus-specific serine protease [Medicago truncatula]
Length = 455
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/173 (43%), Positives = 97/173 (56%), Gaps = 19/173 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q LDHFS D F QRY D++ P+ GPIFL G E + + ++ +
Sbjct: 17 WFNQTLDHFSPYDHRQFRQRYYEFLDYFRAPD--GPIFLVIGGEATCN--GIVNDYIGVL 72
Query: 132 APRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVF------IT 183
A +FGA +V EHRYYGES P+ STE L YL+++QAL D AVF
Sbjct: 73 AKKFGAAVVSLEHRYYGESTPFDTFSTE-------NLKYLSSKQALFDLAVFRQYYQDSL 125
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
N K N S +P FGGSY G L+AW RLK+PH+ G+LASSA +L +D
Sbjct: 126 NAKLNRSGVENPWFFFGGSYSGALSAWFRLKFPHLTCGSLASSAVVLAVQDFA 178
>gi|229594586|ref|XP_001032708.3| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|225566764|gb|EAR85045.3| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 475
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 72/194 (37%), Positives = 102/194 (52%), Gaps = 8/194 (4%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
YF+Q+LDH++ D T++QRY + DHW + LY EG+ NS F I
Sbjct: 34 YFQQKLDHYAPLDNRTWAQRYFV-MDHWFNKTAQPLVILYICGEGECNGVQYNSSFTSKI 92
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A +++ EHR+YG+S P+G +Y L YLTA+QAL D A FI +K N
Sbjct: 93 AEIHNGIVLSLEHRFYGKSQPFGFGNDSYA-LPNLKYLTAQQALNDLAWFIQYVKDNQLF 151
Query: 192 EASPV---VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
+P + GGSY G L+AW R K+PH+ IGALASSA + + D F +S
Sbjct: 152 GITPNMPWITIGGSYPGALSAWFRYKFPHLTIGALASSAVVNAYADFYE---FDQQISDS 208
Query: 249 FKRESASCFNTIKE 262
+ S +C + +
Sbjct: 209 LSKNSGNCRQIVHD 222
>gi|307108517|gb|EFN56757.1| hypothetical protein CHLNCDRAFT_144219 [Chlorella variabilis]
Length = 303
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/183 (40%), Positives = 107/183 (58%), Gaps = 7/183 (3%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E Y EQ+ ++ S PT+ QRY + +W + PIF Y GNEG++ N+G +
Sbjct: 63 ELFYEEQQTNNVS--GKPTWRQRYFLCDQYWDREDPYAPIFFYAGNEGNVANGVNNTGLM 120
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
W+ A FGA+LVF EHRYYG S P+G E L +L+ EQA+ D+ F+ LK +
Sbjct: 121 WERAQAFGALLVFAEHRYYGNSWPFGKEESL--TLEGLQFLSMEQAIEDYVTFLNWLKIS 178
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNIV 245
L+A ++PVV FGGSYGG+L A MR P A++SSAP+ L + P +++ +V
Sbjct: 179 LNATSAPVVAFGGSYGGVLVAIMRATRPSSVQAAVSSSAPMRGWLLQDGGYDPGSYWEVV 238
Query: 246 SSD 248
+ D
Sbjct: 239 TRD 241
>gi|167375432|ref|XP_001733641.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165905185|gb|EDR30254.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 480
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 124/233 (53%), Gaps = 15/233 (6%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV-WDIAPRF 135
LDHF+ + F +Y I+TD+ + P+F+ G EG + + + FV D+A +
Sbjct: 46 LDHFNVNNQIDFDIQYFISTDYLDNNSPNAPLFVLLGGEGPEDATGLQNYFVVTDLAKKH 105
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
+++ EHR+YG S P S E+ L Y TAEQAL D+ I+++++ + P
Sbjct: 106 KGLMLSVEHRFYGASTP--SLEM-----DKLIYCTAEQALMDYVEVISHVQEENNLVGHP 158
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++ GGSY G LAAWMR KYP++ GA ASSAP+ E +V + +V + + +A
Sbjct: 159 VIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQNALPKNTAD 215
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
+ E W E+ + E G EL K F+ C E +D+ + ES + L+
Sbjct: 216 LLSFAFEKWDEMTTT---EEGRKELGKIFNTCTEFGE-KDIQTFAESIGTALS 264
>gi|195109610|ref|XP_001999376.1| GI23100 [Drosophila mojavensis]
gi|193915970|gb|EDW14837.1| GI23100 [Drosophila mojavensis]
Length = 882
Score = 121 bits (304), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 139/308 (45%), Gaps = 28/308 (9%)
Query: 29 LSLAAQPSKF---RRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADL 85
++LA+Q + P FV L + P + +++ + E ++ Q LD F +
Sbjct: 12 VALASQAISLELKKDVPVFVKTLKDMQRGPPL--KMVKRELKGEEKWITQPLDQFDETNK 69
Query: 86 PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHR 145
T+ RY IN + PIF++ G E + +N G+ +D+A +L++ EHR
Sbjct: 70 ETYEMRYFINDEF---QTEGSPIFIFLGGEWEASLNMINDGYWYDLAKEHKGVLIYTEHR 126
Query: 146 YYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNLSAEASPVVLFGGSYG 204
YYG S+P + + L YL +QALAD A FI K +N S VVL G SY
Sbjct: 127 YYGASVPTKTMSL-----EDLKYLHVKQALADVANFIKTFKSENAQLSNSKVVLSGCSYS 181
Query: 205 GMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA-SCFNTIKES 263
+A W + YP + +GA ASSAP+ D + +V F+ C+N I++
Sbjct: 182 ATMAVWFKRLYPDLVVGAWASSAPLFAKVDFYE---YKEVVGKAFRELGGEKCYNRIEKG 238
Query: 264 WGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW-----LESAYSYLAMV----DYPY 314
EL S+ + + E +C + DL W + + +S LA D Y
Sbjct: 239 IAELESMFKNKRA-AEARAMLRICSNFDHENDLDLWSLFGSISNVFSSLAQYQGAGDLEY 297
Query: 315 PSDFMMPL 322
DF+M
Sbjct: 298 YCDFIMSF 305
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 97/317 (30%), Positives = 153/317 (48%), Gaps = 23/317 (7%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
ETR+F Q LD+F + + QR LIN D++V + PIF+Y G E I+ A+ SG
Sbjct: 457 ETRWFNQSLDNFDDTNKNVWDQRVLINEDNFVDGS---PIFIYLGGEWAIDPSAITSGLW 513
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
DIA LV+ EHR++GES+P T ++ +N L Y + EQALAD I LK+
Sbjct: 514 VDIAKEHNGSLVYTEHRFFGESIPI--TPLSTKN---LKYQSVEQALADVVNVIKVLKEE 568
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
+ S VV+ G SY +A W +L YP + +G+ ASSAP+ + IV F IV
Sbjct: 569 DKYKESKVVISGCSYSASMAVWFKLLYPDVIVGSWASSAPL---DAIVDFSDFMEIVGRA 625
Query: 249 FKRESAS-CFNTIKES---WGELVSVGQ--KENGLLELTKTFHLCRELNSTEDLADWLES 302
+++ C++ I + + +L GQ + LL L +F E N + +
Sbjct: 626 YRQLGGDYCYDLIDNATSYYEDLFQTGQGARAKELLNLCDSFD---ENNERDQWQIFSSI 682
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILE-RIFEGVSVYYNYTGNV 361
A + + Y P ++ + +R D+A + ++ R+ + V Y G V
Sbjct: 683 ANIFAGIAQYQKPENYDLAQYCSVLRSF--DDDDASALSQFVQWRLGKPECVNTRYQGTV 740
Query: 362 DCFQLDDDPHGLDGWNW 378
D ++ + + G W
Sbjct: 741 DYYKWSKNNYDGSGLGW 757
>gi|344292450|ref|XP_003417940.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 484
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 84/270 (31%), Positives = 132/270 (48%), Gaps = 18/270 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSGFVWD 130
F+Q+LDHFS + QRY N D + P GP+FL G + W + N+ +V
Sbjct: 42 FQQKLDHFSKNSTELWPQRYFFN-DAFYKPG--GPVFLLIGGFETVCESWISTNNTWV-S 97
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A R GA+ + EHR+YG S P G A +L YL++ QALAD F + + +
Sbjct: 98 YAERLGALFLLLEHRFYGHSQPKGDLSTA-----SLHYLSSRQALADIVNFRIKVAEKVG 152
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ V FG SYGG LA W R+K+P + A+ SSAPI + + +V
Sbjct: 153 LTKNKWVAFGCSYGGSLAVWSRIKHPDLFAAAVGSSAPIKAKANFYE---YLEVVQRSLA 209
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLES-AYSYL 307
++ CF ++E++G++V + +L K F LC+ + S D A ++E + +
Sbjct: 210 THNSKCFQAVREAFGQVVKMLSLRKYHRQLRKDFTLCKPMKRFSEMDKAFFIEHLVFPFK 269
Query: 308 AMV-DYPYPSDFMMPLPGYPIREVCKKIDN 336
+ V D SD+ + + E+C + N
Sbjct: 270 SAVQDNDNESDYEGIQVSFGMDELCDMMTN 299
>gi|194899986|ref|XP_001979538.1| GG23239 [Drosophila erecta]
gi|190651241|gb|EDV48496.1| GG23239 [Drosophila erecta]
Length = 473
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 133/277 (48%), Gaps = 22/277 (7%)
Query: 26 LSPLSLA-AQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
L+ L+L A S F R + + + P L P Q R Q T + EQ+LDHF A+
Sbjct: 9 LALLALGQAHGSIFERTFKRIHEEPPL---PTTQNRADAVQ----TLWIEQKLDHFDPAE 61
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
T+ RY++N + P+F+Y G E +I + G ++D+A A+L + EH
Sbjct: 62 TRTWQMRYMLNDALYKSG---APLFIYLGGEWEISAGRITGGHLYDMAKEHSALLAYTEH 118
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA-EASPVVLFGGSY 203
RYYG+S P +++ +N + YL+ Q+LAD A FI +KQN S V++ GGSY
Sbjct: 119 RYYGQSKPL--PDLSNEN---IKYLSVNQSLADLAYFINTIKQNHEGLSESKVIIVGGSY 173
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKE 262
+ W + YP + G ASSAP+ + V + + ++ S C+ I+
Sbjct: 174 SATMVTWFKKLYPDLVAGGWASSAPLFAKVNFVE---YKEVTGQSIEQMGGSACYKRIEN 230
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
E+ S+ + G E+ LC + DL W
Sbjct: 231 GIAEMESMIATKRG-AEVKALLKLCEPFDVYNDLDVW 266
>gi|198437028|ref|XP_002124988.1| PREDICTED: similar to CG9953 CG9953-PA [Ciona intestinalis]
Length = 508
Score = 121 bits (303), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 95/282 (33%), Positives = 142/282 (50%), Gaps = 19/282 (6%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW 120
+ +Q R+ EQRL++F A++ T+ QRY N + + PN GP+FL G EG
Sbjct: 33 ESSKQDSANDRWIEQRLNNFDSANVHTWKQRYFAN-NQFSTPN--GPVFLSIGGEGPAGS 89
Query: 121 FAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
+ G +A + GAML EHR+YG+S P +LS L++EQALAD A
Sbjct: 90 IWMTKGHWVTMAKQTGAMLFMIEHRFYGKSHPTPDAS-----LESLSVLSSEQALADIAN 144
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
FITN+ ++FGGSY G LA W R KYPH+ GA+++SAP+ IV +
Sbjct: 145 FITNITAEYKLAGRKWIVFGGSYSGSLAIWARYKYPHLISGAVSASAPL---HPIVNFDG 201
Query: 241 FYNIVSSDFKR-ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD- 298
+ +V + S C + + E+ S+ + G LT F+LC L ++D+ D
Sbjct: 202 YQEVVQRSLQTLGSPKCVQNLANATTEITSLMKTTAGRKILTSEFNLCHAL--SDDVLDN 259
Query: 299 --WLES-AYSYLAMVDYPYPS-DFMMPLPGYPIREVCKKIDN 336
+ ES A S +V Y + F P + I +C +D+
Sbjct: 260 QYFQESVAGSIQDVVQYNRDNMHFEGHGPAFNISYICHILDD 301
>gi|123703648|ref|NP_001074031.1| uncharacterized protein LOC556307 precursor [Danio rerio]
gi|120538664|gb|AAI29321.1| Zgc:158605 [Danio rerio]
Length = 488
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 91/260 (35%), Positives = 133/260 (51%), Gaps = 16/260 (6%)
Query: 57 RQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW-VGPNRLGPIFLYCGNE 115
R + Q + E ++F QRLDHF+ AD + QRY +N + VG GP+FL G E
Sbjct: 33 RYAARDQLKSGVEDQWFIQRLDHFNGADSRVWKQRYFVNDSFYRVG----GPVFLMIGGE 88
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
G + G A + GA+ + EHR+YG+S P + +++ +N L +L++ QAL
Sbjct: 89 GPANPAWMQYGTWLTYAQKLGALCLLLEHRFYGKSHP--TEDLSTEN---LRFLSSRQAL 143
Query: 176 ADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
AD A F T S V FGGSY G LAAW RLKYPH+ ++A+SAP+ +
Sbjct: 144 ADLAHFRTVTAAARGLTNSKWVAFGGSYPGSLAAWFRLKYPHLVHASVATSAPV--HASV 201
Query: 236 VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NST 293
PE + +V E+ C +K++ L+ +TK F LC +L S
Sbjct: 202 NFPE-YLEVVWRSLAAENPECPLLVKKASDTLLERLSDPKTYDNITKDFRLCSKLQIQSK 260
Query: 294 EDLADWLES-AYSYLAMVDY 312
D A LES A +++ +V Y
Sbjct: 261 MDSAYLLESLAGNFMDVVQY 280
>gi|405960530|gb|EKC26449.1| Putative serine protease F56F10.1 [Crassostrea gigas]
Length = 446
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 123/228 (53%), Gaps = 23/228 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ +Q+L HF++AD T+ QRY +N D + PN GPIFL G EG + G +
Sbjct: 2 WLKQKLTHFNYADTRTWQQRYFVN-DTFYKPN--GPIFLMIGGEGTANPAWMLQGAWIEY 58
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
A + A+ EHRYYG+S P + +++ N L +L++EQALAD A FI +K NL
Sbjct: 59 AKTYHAICFLLEHRYYGKSHP--TPDLSVDN---LQFLSSEQALADLAYFIQYVKHKYNL 113
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNIVS 246
++ ++ FGGS L+AW R+KYPH+ GA+A+SAPI L F++ + +V
Sbjct: 114 MSKDQKLITFGGS----LSAWFRVKYPHLVDGAVATSAPIFAQLNFKE------YLQVVV 163
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
S C IK + + + + G K F+LC L++++
Sbjct: 164 SSLATTGPGCNKNIKMATDTITKMISTDTGRKSAEKMFNLCDPLDTSQ 211
>gi|74004232|ref|XP_545414.2| PREDICTED: thymus-specific serine protease [Canis lupus familiaris]
Length = 521
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 67/160 (41%), Positives = 99/160 (61%), Gaps = 10/160 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW ++ GP+FL+ G E + +V G +
Sbjct: 59 WLEQPLDPFNASDTRSFLQRYWVNDQHWT--SQRGPVFLHLGGESSLRSGSVLRGHPTAL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G +VA L +L++ ALAD A L + N+
Sbjct: 117 APAWGALVIGLEHRFYGLSVPAGGLDVA-----QLRFLSSRHALADVASARLALARLFNV 171
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+
Sbjct: 172 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPV 210
>gi|410910698|ref|XP_003968827.1| PREDICTED: thymus-specific serine protease-like [Takifugu rubripes]
Length = 493
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/248 (34%), Positives = 135/248 (54%), Gaps = 15/248 (6%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
++ ++F QRLDHFS AD + QRY + + + P+ GP+FL G EG + G
Sbjct: 49 FDEQWFSQRLDHFS-ADSREWKQRYFL-SQAFYKPD--GPVFLMIGGEGPANPAWMQYGT 104
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
A + GA+ + EHR+YG+S P +++++ N L +L++ QALAD A F T + +
Sbjct: 105 WLTYAEKLGALCLMLEHRFYGKSRP--TSDLSTDN---LRFLSSRQALADLAHFRTTIAE 159
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
L + V FGGSY G LAAW RLKYPH+ A+A+SAP+ + PE + +V
Sbjct: 160 ALGLTNAKWVAFGGSYPGSLAAWFRLKYPHMVHAAVATSAPVRATVNF--PE-YLEVVWR 216
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NSTEDLADWLES-AY 304
C +K++ L + ++ +TK F+LC +L + D A +LE+ A
Sbjct: 217 SLASVDVECPLLVKKASDTLAELLKEPKTYDNITKDFNLCSKLQIQTEMDSAQFLETLAG 276
Query: 305 SYLAMVDY 312
+++ +V Y
Sbjct: 277 NFMDVVQY 284
>gi|357628484|gb|EHJ77798.1| hypothetical protein KGM_21289 [Danaus plexippus]
Length = 499
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 137/296 (46%), Gaps = 26/296 (8%)
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
RL+HF ++ TF RY N+ GP I ++ G E I V SG +++A R
Sbjct: 43 RLNHFDASNTDTFQMRYYYNSQFSRGPY----IVIFVGGEWSISPGWVRSGLAYELAERI 98
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK----QNLSA 191
GA L + EHRYYG + P T VA + YL+ +QAL D A FI ++ +
Sbjct: 99 GAGLFYTEHRYYGLTRPTNGTTVA-----EMRYLSVDQALGDLAQFIEYVRSDDFEGGRF 153
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
+ V LFG SY G +A WM+L YPH+ +L+ S P+ +D PE I ++ +
Sbjct: 154 RNARVALFGCSYAGSMATWMKLGYPHLVRTSLSDSGPLHAQQDF--PEYLEVIATALRVQ 211
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE-DLAD--WLESAYSYLA 308
S C + I+ + + + + E GL ++ F+ C L + DL+ W ++
Sbjct: 212 GSQQCVDDIESAMKRINELIETEAGLDTVSTLFNTCSRLRRSHLDLSTFFWYGITETFAY 271
Query: 309 MVDYPYPSDFMMPLP-------GYPIREVCKKIDNAPDATSILE-RIFEGVSVYYN 356
+V Y P D G PI + + + P +E R FE V+ + N
Sbjct: 272 LVQYATPGDIPRACDHITNKTLGDPIERLSSWVTSQPYTQPCIESRYFEKVASHTN 327
>gi|195451235|ref|XP_002072826.1| GK13808 [Drosophila willistoni]
gi|194168911|gb|EDW83812.1| GK13808 [Drosophila willistoni]
Length = 445
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 112/233 (48%), Gaps = 14/233 (6%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E ++ Q+LDHF ++ T+ RYL+N + PIF+Y G E IE V++G
Sbjct: 20 EEKWIAQKLDHFDESNTQTYQMRYLVNDEF---QEEGSPIFIYLGGEWAIEDSMVSAGHW 76
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-Q 187
+D+A +L++ EHRYYGES+P + + L YL +QALAD A FI K +
Sbjct: 77 YDMAQEHKGVLIYTEHRYYGESIPTTTMSTEH-----LQYLHVKQALADVAHFIETYKSE 131
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
N S V+L GGSY + W + YP + G ASSAP+L D + +V
Sbjct: 132 NSQLTNSKVLLAGGSYSATMVVWFKRLYPDLVEGGWASSAPLLAKVDFTE---YKEVVGR 188
Query: 248 DF-KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
F + C+N I EL S+ + N E LC + DL W
Sbjct: 189 AFLQLGGQQCYNRINNGIAELESMFEN-NRAAEARAMLRLCSSFDDKNDLDLW 240
>gi|195353655|ref|XP_002043319.1| GM26836 [Drosophila sechellia]
gi|194127433|gb|EDW49476.1| GM26836 [Drosophila sechellia]
Length = 473
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/277 (30%), Positives = 134/277 (48%), Gaps = 22/277 (7%)
Query: 26 LSPLSLA-AQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
L+ L+L A S F R + + + P L P Q R Q T + EQ+LDHF A+
Sbjct: 9 LALLALGQAHGSIFERTFKRIHEEPPL---PTIQNRADVVQ----TLWIEQKLDHFDPAE 61
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
T+ RY++N + P+F+Y G E +I + G ++D+A A+L + EH
Sbjct: 62 TRTWQMRYMLNDALYKSG---APLFIYLGGEWEISSGRITGGHLYDMAKEHNALLAYTEH 118
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA-EASPVVLFGGSY 203
RYYG+S P +++ +N + YL+ Q+LAD A FI +KQN S V++ GGSY
Sbjct: 119 RYYGQSKPL--PDLSNEN---IKYLSVNQSLADLAHFINTIKQNHEGLSESKVIIVGGSY 173
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKE 262
+ W + YP + G ASSAP+L + V + + ++ S C+ I+
Sbjct: 174 SATMVTWFKKLYPDLVAGGWASSAPLLAKVNFVE---YKEVTGQSIEQMGGSACYKRIEN 230
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
E+ ++ + G E+ LC + DL W
Sbjct: 231 GIAEMETMIATKRG-AEVKALLKLCEPFDVYSDLDVW 266
>gi|195109614|ref|XP_001999378.1| GI23097 [Drosophila mojavensis]
gi|193915972|gb|EDW14839.1| GI23097 [Drosophila mojavensis]
Length = 691
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 135/280 (48%), Gaps = 19/280 (6%)
Query: 21 VIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHF 80
VII++L+ A + F R + + + P++ P Q R + Q T + EQ+LDHF
Sbjct: 224 VIITLLALGFSQANGNIFERTFKKLHEEPYV---PTNQNRADEVQ----TLWIEQKLDHF 276
Query: 81 SFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLV 140
+ ++ T+ RYL+N V GP+F+Y G E I ++ G ++D+A +L
Sbjct: 277 NDSETRTWQMRYLLND---VFFKAGGPMFIYLGGEWAISKGRISEGHMYDMAKEHNGLLA 333
Query: 141 FPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPVVLF 199
+ EHRYYGES P + + +L +L +QALAD A FI K + + S V++
Sbjct: 334 YTEHRYYGESHP-----LPDLSNDSLQFLHVKQALADLAHFIKTQKASYKGLSDSKVIIV 388
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNT 259
GGSY + W + YP + G ASSAP+ D V + S + C+N
Sbjct: 389 GGSYSAAMVTWFKRTYPDLVAGGWASSAPLFAKVDFVEYKEIAG--QSIVLMGGSDCYNR 446
Query: 260 IKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
I++ E+ ++ + G E+ LC + DL W
Sbjct: 447 IQKGIAEMEAMFANKRG-SEVKALLKLCEPFDVYSDLDVW 485
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 74/217 (34%), Positives = 115/217 (52%), Gaps = 16/217 (7%)
Query: 21 VIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHF 80
VI+++L+ A + F R + + + P++ P Q R + Q T + EQ+LDHF
Sbjct: 8 VILTLLAIGFSQANGNIFERTFKKLHEEPYV---PTNQNRADEVQ----TLWIEQKLDHF 60
Query: 81 SFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLV 140
+ ++ T+ RY++N V GP+F+Y G E I ++ G ++D+A +L
Sbjct: 61 NDSETRTWQMRYMLND---VFFKAGGPMFIYLGGEWAISKGRISEGHMYDMAKEHNGLLA 117
Query: 141 FPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPVVLF 199
+ EHRYYGES P +++ +N L YL +QALAD A FIT K + + S V++
Sbjct: 118 YTEHRYYGESHPL--PDLSNEN---LRYLHVKQALADLAHFITTQKASYEGLSDSKVIIV 172
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
GGSY + W + YP + G ASSAP+ D V
Sbjct: 173 GGSYSATMVTWFKRTYPDLVAGGWASSAPLFTKVDFV 209
>gi|225718928|gb|ACO15310.1| serine protease K12H4.7 precursor [Caligus clemensi]
Length = 485
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 96/312 (30%), Positives = 152/312 (48%), Gaps = 21/312 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y+ Q LDHF+ + T++QRY +NT+ + G P+FL G EG + G +
Sbjct: 51 YYNQTLDHFNEENKKTWNQRYFVNTEFFNG-TETAPVFLLIGGEGTASDSWMKYGAWYGY 109
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
A GA+++ EHR+YG S P + ++ +N L +LT++QAL D FI KQ S
Sbjct: 110 AKEVGALMIQLEHRFYGSSRP--TENMSTEN---LKFLTSQQALEDIVEFIRFAKQQYSL 164
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
E + V FGGSY G L+ WMR YP + GAL+SSAP+ E V E + +V +D
Sbjct: 165 NETNKWVTFGGSYPGSLSLWMRSLYPELISGALSSSAPV---EVKVDFEEYLGVVENDMN 221
Query: 251 RESASCFNTIKESWGELVS-VGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAM 309
C +K++ ++ + + +G ++ K F LC S +++ D L S Y+ +
Sbjct: 222 IRDPKCVPEVKKAIQQIQALIVSAPDGWKKVAKIFSLCDGW-SGDNIQD-LRSFYASVLG 279
Query: 310 VDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERI-------FEGVSVYYNYTGNVD 362
Y + + L + +C + N S LER+ + G + NY +D
Sbjct: 280 AFYS-SAQYDSVLNNDDLAHMCPYMSNEYFGDSSLERLASTLKGKYGGSCLNVNYKDLLD 338
Query: 363 CFQLDDDPHGLD 374
++ HG D
Sbjct: 339 FMTTEEWAHGED 350
>gi|115482892|ref|NP_001065039.1| Os10g0511400 [Oryza sativa Japonica Group]
gi|110289389|gb|AAP54577.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
gi|113639648|dbj|BAF26953.1| Os10g0511400 [Oryza sativa Japonica Group]
gi|215697793|dbj|BAG91986.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 507
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/168 (41%), Positives = 94/168 (55%), Gaps = 15/168 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E R+ +Q LDHF+ D F QRY D++ P GPIFLY E + + ++
Sbjct: 53 EERWMDQTLDHFNPTDHRQFKQRYYEFLDYYRAPK--GPIFLYICGESSCN--GIPNSYL 108
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
+A +FGA +V PEHRYYG+S P+ S L +L+++QAL D AVF
Sbjct: 109 AVMAKKFGAAVVSPEHRYYGKSSPFESL-----TTENLRFLSSKQALFDLAVFRQYYQET 163
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N K N S S +FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 164 LNAKYNRSGADSSWFVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 211
>gi|432892297|ref|XP_004075751.1| PREDICTED: thymus-specific serine protease-like [Oryzias latipes]
Length = 460
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 99/341 (29%), Positives = 168/341 (49%), Gaps = 45/341 (13%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQR 76
V + + S+ S +L + ++ + KL +PPQ+ + + EQ
Sbjct: 10 VFLLFLSSVDSGKTLWKIKEQLQKNKQVNLKLNVEGQPPQQ----------VKESWLEQP 59
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
LDHF+ TFSQ++ +N HW P+ GP+FL+ G EG I F V +G ++A + G
Sbjct: 60 LDHFNRLKGKTFSQKFFVNEAHWQRPD--GPVFLFIGGEGPIFSFDVLAGHHVEMAQQHG 117
Query: 137 AMLVFPEHRYYGESM-PYG-STEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NLSAE 192
A+L+ EHR+YG+S+ P G TE +L+ L+++QALAD A F + + NLS+
Sbjct: 118 ALLLAVEHRFYGDSINPDGLKTE-------SLADLSSQQALADLATFHGYICRSFNLSSR 170
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNIVSSDF 249
+P + FGGSY G L+AW R K+P + A+ASSAPI L F D + ++V
Sbjct: 171 -NPWISFGGSYSGALSAWFRGKFPSLVYAAVASSAPIKAKLDFSD------YNHVVGLSL 223
Query: 250 KR----ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES-AY 304
K S C+ +++++ V L ++ F+ C+ + +D + +++ A
Sbjct: 224 KNVAVGGSEKCWAQVQQAFAA-VEAELLTGNLSQVAGDFNCCQIPKNLDDQIELMQNLAD 282
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILE 345
++ V Y + ++E+C + N +E
Sbjct: 283 LFMGTVQYNEEGVLL------SVKELCDLMTNVSGEDKDME 317
>gi|440799806|gb|ELR20849.1| protease, serine, 16 (thymus), putative [Acanthamoeba castellanii
str. Neff]
Length = 478
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 85/290 (29%), Positives = 138/290 (47%), Gaps = 32/290 (11%)
Query: 50 HLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIF 109
H P + Q+ Y+ F Q++DHF + T+ Q+Y++ D++V P GPIF
Sbjct: 48 HAGSAPTELKGLSQKVYQ-----FTQKVDHFDPLNGKTYQQKYIVTDDNYV-PG--GPIF 99
Query: 110 LYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYL 169
L+ G E +E+F + + +FGA+ + EHR+YG SMP A +L+ L
Sbjct: 100 LFLGGEAPVEFFDFQTVLPRSLTKQFGALYIALEHRFYGVSMPAHDYSTA-----SLALL 154
Query: 170 TAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
++ QALAD A F+ + + L+ P V++G SY G L+AW R KYP++ +G++A S P+
Sbjct: 155 SSRQALADAANFLVSFNKTLT-NPGPWVVWGCSYSGALSAWFRAKYPNLVVGSVAPSGPV 213
Query: 230 LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRE 289
+ +Y + S+ S C T+K + L++ +G EL
Sbjct: 214 YASLNFT---QYYGVFST---AASPQCVETVKRATAMLMAKLSTADGRKEL--------- 258
Query: 290 LNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPD 339
TE A E Y L + + SD P +P+ C + + D
Sbjct: 259 ---TEISASPQEHYYFLLTLTEAIGGSDQFQNPPAWPLNTTCNTMMQSGD 305
>gi|328866714|gb|EGG15097.1| hypothetical protein DFA_09921 [Dictyostelium fasciculatum]
Length = 495
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 83/242 (34%), Positives = 132/242 (54%), Gaps = 20/242 (8%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+ + +F Q +DHF A+ F QRYL+N W G GP+F+ EG + AV +G
Sbjct: 57 KIDYEWFTQSVDHFDSANQKKFQQRYLVNDHFWDGK---GPVFMMINGEGPMSLGAV-TG 112
Query: 127 F---VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
VW A A++V EHRYYG S + + +A +N L YLT +QALAD AVF
Sbjct: 113 LQYVVW--AKEVHALIVSLEHRYYGAS--FVTDNLATEN---LIYLTPQQALADNAVFRD 165
Query: 184 NLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
+ S + S V FGGSY G L++W R+KYP++ A+ASSAP+ ++ +
Sbjct: 166 FIANTYSVPQTSKWVSFGGSYSGCLSSWFRIKYPNLVDYAIASSAPV---NPVIDFYQYL 222
Query: 243 NIVSSDFKRES--ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
+V + S C + IK+S ++ + ++ NGL +++ F L L + +D+++++
Sbjct: 223 EVVQNALLTTSNGQQCVDRIKQSTQKIQDLLKQPNGLKTVSELFSLDPVLKTDDDISNFM 282
Query: 301 ES 302
+S
Sbjct: 283 QS 284
>gi|341893031|gb|EGT48966.1| hypothetical protein CAEBREN_19097 [Caenorhabditis brenneri]
Length = 511
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 80/245 (32%), Positives = 124/245 (50%), Gaps = 16/245 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWD 130
F Q LDHF ++ TF QRY N + W GP FL G EG W + + +
Sbjct: 65 FTQTLDHFDSSNGKTFQQRYYHN-NQWYKDG--GPAFLMLGGEGPESSYWVSYPGLEITN 121
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
+A + A + EHR+YGE+ P V + L YL++ QA+ D A FIT +K
Sbjct: 122 LAAKQNAWVFDIEHRFYGETKPTSDMSV-----SNLKYLSSAQAIEDAAAFITAMKIKYP 176
Query: 191 AEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
A+ V FGGSY G LAAW R K+P + A+ SS P+ D + + +V +
Sbjct: 177 MLANAKWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPVQAEVDF---KEYLEVVQNSI 233
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN-STEDLADWLESAYS-YL 307
R S +C +++ + + + S+ + +G +L FHLC+++ + L + E+ YS Y+
Sbjct: 234 TRNSTACADSVTQGFNLVASLLKTADGRKQLKSAFHLCQDIKLDDKSLKYFWETVYSPYM 293
Query: 308 AMVDY 312
+V Y
Sbjct: 294 EVVQY 298
>gi|118346703|ref|XP_976866.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89288597|gb|EAR86585.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 485
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 110/201 (54%), Gaps = 7/201 (3%)
Query: 33 AQPSKFRRAPRFVG-KLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQR 91
+Q K FVG L +++ + Q T YF Q++DHF + T++QR
Sbjct: 2 SQAQKLILVFLFVGVALGNISNEAPYSTKIGNQTIEQTTYYFTQKVDHFDPSSTDTYNQR 61
Query: 92 YLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESM 151
+ + ++ + N G +F++ G EG + SG+ +A +F AM++ EHR+YG S
Sbjct: 62 FTVYSEAFNPAN--GTVFIFIGGEGPQQGLTTGSGWYMLVAQQFSAMVICVEHRFYGVSQ 119
Query: 152 PYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN---LSAEASPVVLFGGSYGGMLA 208
P+G + A+ L +LT +Q+LAD A FI+ +K N + +P + GGSY G ++
Sbjct: 120 PFGQGQDAW-TVDHLKFLTVDQSLADLAYFISYIKANNFLRINDRNPFITVGGSYPGAMS 178
Query: 209 AWMRLKYPHIAIGALASSAPI 229
AW R KYPH+ IGA ASSA +
Sbjct: 179 AWFRYKYPHLTIGAHASSAVV 199
>gi|344247366|gb|EGW03470.1| Thymus-specific serine protease [Cricetulus griseus]
Length = 265
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 98/164 (59%), Gaps = 8/164 (4%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D TF QRY +N H G + P+FL+ G EG + +V +G +
Sbjct: 60 WLEQPLDPFNTSDRRTFLQRYWVNDRHRAGQD--APVFLHIGGEGSLGPGSVMAGHPVAL 117
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS- 190
AP +GA+++ EHR+YG SMP G ++A L YL++ ALAD A L + L+
Sbjct: 118 APAWGALVISLEHRFYGLSMPSGGLDMA-----QLRYLSSRHALADVASARQALSRLLNV 172
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+ +SP + FGGSY G LA W RLK+PH+ A+ASSAP+ D
Sbjct: 173 SSSSPWICFGGSYAGSLATWTRLKFPHLVFAAVASSAPLSAVLD 216
>gi|67465637|ref|XP_648991.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|56465317|gb|EAL43602.1| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 480
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 15/233 (6%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV-WDIAPRF 135
LDHF+ + F +Y I+TD+ + P+F+ G EG + + + FV D+A +
Sbjct: 46 LDHFNANNQIDFDIQYFISTDYLDNNSPNAPLFVLLGGEGPEDETGLQNYFVVTDLAKKH 105
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
+++ EHR+YG S P S E+ L Y TAEQAL D+ I+++++ + P
Sbjct: 106 KGLMLSVEHRFYGASTP--SLEM-----DKLIYCTAEQALMDYVEVISHVQEENNLVGHP 158
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++ GGSY G LAAWMR KYP++ GA ASSAP+ E +V + +V + + +A
Sbjct: 159 VIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQNALPKNTAD 215
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
+ E W ++ + E G EL K F+ C E +D+ + ES + L+
Sbjct: 216 LLSFAFEQWDKMTTT---EEGRKELGKIFNTCTEFGE-KDIQTFAESIGTALS 264
>gi|195391902|ref|XP_002054598.1| GJ22718 [Drosophila virilis]
gi|194152684|gb|EDW68118.1| GJ22718 [Drosophila virilis]
Length = 476
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/282 (29%), Positives = 135/282 (47%), Gaps = 23/282 (8%)
Query: 21 VIISILSPLSLAAQPSKFRRA-PRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDH 79
VI+ +L+ + A + F+R + G+ P P Q R + Q T + EQ+LDH
Sbjct: 8 VILVLLALGLVQADGNIFQRTFNKLHGEPP----VPANQNRADEVQ----TLWIEQKLDH 59
Query: 80 FSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAML 139
F ++ T+ RY++N V GP+F++ G E +I + +G ++D+A +L
Sbjct: 60 FDESETRTWQMRYMLND---VFFKAGGPLFIFLGGEWEISTGRITAGHMYDMAKEHNGLL 116
Query: 140 VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPVVL 198
+ EHRYYGES P +++ +N + YL +QALAD A FIT K S V++
Sbjct: 117 AYTEHRYYGESHPL--PDLSNEN---IQYLHVKQALADLAHFITTQKATYEGLVDSKVII 171
Query: 199 FGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CF 257
GGSY + W + YP + +G ASSAP+ + V + + S C+
Sbjct: 172 VGGSYSATMVTWFKKTYPDLVVGGWASSAPLFAKMNFVE---YKEVTGQSIALMGGSACY 228
Query: 258 NTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
N I+ E+ ++ + G E+ LC + DL W
Sbjct: 229 NRIENGIAEMEAMIASKRG-AEVKALLKLCERFDVYSDLDIW 269
>gi|194744691|ref|XP_001954826.1| GF16546 [Drosophila ananassae]
gi|190627863|gb|EDV43387.1| GF16546 [Drosophila ananassae]
Length = 473
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 80/262 (30%), Positives = 123/262 (46%), Gaps = 15/262 (5%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
RA F L E P Q + ET + EQ+LDHF A+ T+ RY++N +
Sbjct: 18 RATIFQRTFKRLHEEPPLPSNQNRADI-VETLWIEQKLDHFDEAETRTWQMRYMLNDAVY 76
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA 159
GP+F+Y G E +I + G ++D+A +L + EHRYYG+S P +++
Sbjct: 77 ---QSGGPLFIYLGGEWEISSGRITGGHMYDMAKEHNGLLAYTEHRYYGQSKPL--PDLS 131
Query: 160 YQNATTLSYLTAEQALADFAVFITNLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHI 218
+N + YLT Q+LAD A FIT++K S V++ GGSY + W + YP +
Sbjct: 132 NEN---IKYLTVNQSLADLAHFITSIKATHEGLSESKVIIVGGSYSATMVTWFKKTYPDL 188
Query: 219 AIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKENGL 277
G ASSAP+ + V + I + S C+ I+ E+ + + G
Sbjct: 189 VAGGWASSAPLFAKVNFVE---YKEITGQSIELMGGSACYKRIENGIAEMEDMFATKRG- 244
Query: 278 LELTKTFHLCRELNSTEDLADW 299
E+ LC + + DL W
Sbjct: 245 GEVKALLKLCEPFDVSSDLDIW 266
>gi|170045812|ref|XP_001850488.1| lysosomal pro-X carboxypeptidase [Culex quinquefasciatus]
gi|167868716|gb|EDS32099.1| lysosomal pro-X carboxypeptidase [Culex quinquefasciatus]
Length = 466
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 78/225 (34%), Positives = 117/225 (52%), Gaps = 18/225 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+FE R+DHFS ++ TFS RY N +H GPIF+ G+ G IE + G +D
Sbjct: 25 WFETRVDHFSPRNMDTFSMRYYSNDEHAYAK---GPIFVIVGSNGPIETRYLREGLFYDT 81
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A GA L EHRY+G S+P V + L +LT +QALAD A +I +L+ +
Sbjct: 82 AYLEGAYLFANEHRYFGHSLP-----VDDASTENLDFLTVDQALADLAAWIHHLRHEVVG 136
Query: 192 E-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ V+L G YGG LA W ++PH++ G SS D+ PE ++ ++ +
Sbjct: 137 NPQAKVILMGWGYGGSLATWFHTQFPHLSDGVWVSSGN--NNADLNLPEYMESLGNTIGE 194
Query: 251 RESASCFNTIKESW---GELVSVGQKENGLLELTKTFHLCRELNS 292
C++TI S+ L+ + + E LT+ FHLC +LN+
Sbjct: 195 FGGRDCYSTIFSSFLVAQNLIELDRSE----LLTEMFHLCDDLNT 235
>gi|301783465|ref|XP_002927151.1| PREDICTED: thymus-specific serine protease-like [Ailuropoda
melanoleuca]
Length = 476
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 72/189 (38%), Positives = 107/189 (56%), Gaps = 15/189 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW ++ GP+FL+ G EG + +V G +
Sbjct: 59 WLEQPLDPFNTSDQRSFLQRYWVNDQHWA--SQRGPVFLHLGGEGSLRSGSVMRGHPAAL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL--ADFAVFITNLKQ-- 187
AP +GA+++ EHR+YG S+P +VA L +L++ AL AD A L +
Sbjct: 117 APAWGALVIGLEHRFYGLSIPAEGLDVA-----QLRFLSSRHALPSADVASARRALARLF 171
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
N+S ASP + FGGSY G LAAW RLK+PH+ + ++ASSAP+ D + +VS
Sbjct: 172 NVST-ASPWICFGGSYAGSLAAWARLKFPHLILASVASSAPVRAVLDF---SEYNEVVSR 227
Query: 248 DFKRESASC 256
+ +C
Sbjct: 228 SLTSTAPAC 236
>gi|162287351|ref|NP_001069798.2| thymus-specific serine protease precursor [Bos taurus]
gi|296474038|tpg|DAA16153.1| TPA: protease, serine, 16 [Bos taurus]
Length = 516
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 87/270 (32%), Positives = 133/270 (49%), Gaps = 22/270 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW + GP+FL+ G EG + +V G ++
Sbjct: 59 WLEQPLDPFNASDRRSFLQRYWVNDQHWTSQD--GPVFLHLGGEGSLGPGSVMRGHPANL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P ++A L +L++ ALAD A L + N+
Sbjct: 117 APIWGALVISLEHRFYGLSIPAEGLDMA-----QLRFLSSRHALADAASARLTLSRLFNV 171
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S+ SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D + ++VS
Sbjct: 172 SS-TSPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPVRAILDF---SKYNDVVSRSL 227
Query: 250 KRE----SASCFNTIKESWGELVSVGQKENGL-LELTKTFHLCRELNSTEDLADWLESAY 304
S C ++ E+ + G L+ C L ED A+ L A
Sbjct: 228 MNTAIGGSLECRAAASAAFAEVERRLRASRGAWATLSVELGACGSLERAEDQAELL-GAL 286
Query: 305 SYLAMVDYPYPSDFMMPLPGYPIREVCKKI 334
L Y PL +R++C+ +
Sbjct: 287 QALVGGAVQYDGQAGAPL---SVRQLCRLL 313
>gi|334328701|ref|XP_001372314.2| PREDICTED: thymus-specific serine protease-like [Monodelphis
domestica]
Length = 503
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 85/240 (35%), Positives = 125/240 (52%), Gaps = 24/240 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q LD F+ +D +F QRY +N HW P P+FL+ G EG + +V G +
Sbjct: 53 WLWQPLDPFNSSDGRSFRQRYWVNVGHWRPPR--APVFLHIGGEGSLGPSSVWKGHPGTL 110
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
A +GA+++ EHR+YG+S+P + A L +L++ ALAD A L N+
Sbjct: 111 AASWGALVISLEHRFYGQSIPPRGLDGA-----QLRFLSSRHALADVASARLRLSGIYNI 165
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
SA +SP + FGGSY G LAAW RLKYPH+ A+ASSAP+ D + YN V S
Sbjct: 166 SA-SSPWIAFGGSYAGSLAAWARLKYPHLIWAAVASSAPVQAQLDF----SGYNWVVSRS 220
Query: 250 KRE-----SASCFNTIKESWGEL---VSVGQKENGLLELTKTFHLCRELNSTEDLADWLE 301
+ S C + +++ EL +S G + +++ C L + ED A+ LE
Sbjct: 221 LADPQVGGSPKCQRAVAQAFSELDDRLSEGGETRAVVQ--AEVRACGSLEAPEDQAELLE 278
>gi|195569687|ref|XP_002102840.1| GD19284 [Drosophila simulans]
gi|194198767|gb|EDX12343.1| GD19284 [Drosophila simulans]
Length = 515
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 133/277 (48%), Gaps = 22/277 (7%)
Query: 26 LSPLSLA-AQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
L+ L+L S F R + + + P L P Q R Q T + EQ+LDHF A+
Sbjct: 9 LALLALGQTHGSIFERTFKRIHEEPPL---PTIQNRADVVQ----TLWIEQKLDHFDAAE 61
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
T+ RY++N + P+F+Y G E +I + G ++D+A A+L + EH
Sbjct: 62 TRTWQMRYMLNDALYKSG---APLFIYLGGEWEISSGRITGGHLYDMAKEHNALLAYTEH 118
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA-EASPVVLFGGSY 203
RYYG+S P +++ +N + YL+ Q+LAD A FI +KQN S V++ GGSY
Sbjct: 119 RYYGQSKPL--PDLSNEN---IKYLSVNQSLADLAHFINTIKQNHEGLSESKVIIVGGSY 173
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKE 262
+ W + YP + G ASSAP+L + V + + ++ S C+ I+
Sbjct: 174 SATMVTWFKKLYPDLVAGGWASSAPLLAKVNFVE---YKEVTGQSIEQMGGSACYKRIEN 230
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
E+ ++ + G E+ LC + DL W
Sbjct: 231 GIAEMETMIATKRG-AEVKALLKLCEPFDVYSDLDVW 266
>gi|326502560|dbj|BAJ95343.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 490
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 76/206 (36%), Positives = 108/206 (52%), Gaps = 25/206 (12%)
Query: 32 AAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQR 91
+A+P F PR G L+ P R + E R+ QRLDHFS +D F QR
Sbjct: 22 SAEPLGFSHRPRTAGG--ELSAAPSRYLAR-------EERWMSQRLDHFSSSDHRQFKQR 72
Query: 92 YLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESM 151
Y D+ P GP+FL E + + + ++ IA +FGA +V PEHRYYG+S
Sbjct: 73 YFEFLDYHDDPT--GPVFLRICGESSCD--GIPNDYLAVIAKKFGAAVVTPEHRYYGKSS 128
Query: 152 PYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE-------ASPVVLFGGSYG 204
P+ S L +L+++QAL D AVF ++ L++ +P +FG SY
Sbjct: 129 PFDSL-----TTDNLRFLSSKQALFDLAVFRQYYQEKLNSRYNRSAGFDNPWFVFGVSYS 183
Query: 205 GMLAAWMRLKYPHIAIGALASSAPIL 230
G L+AW RLK+PH+ G+LASS +L
Sbjct: 184 GALSAWFRLKFPHLTCGSLASSGVVL 209
>gi|325186496|emb|CCA21036.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 250
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 95/159 (59%), Gaps = 11/159 (6%)
Query: 63 QQQYRYETRYFEQRLDHFSF------ADLPTF-SQRYLINTDHWVGPNRLGPIFLYCGNE 115
+ + + + QR+D FS+ + P F QRYLIN + W + P F Y GNE
Sbjct: 95 KTKLNWSEEWITQRVDQFSWPSSYSRSQPPLFYKQRYLINNETWDPNDPKAPTFFYTGNE 154
Query: 116 G-DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
D+ +A ++G +W+ A F A++VF EHR+YG S P+ S+++ + L Y T EQA
Sbjct: 155 ASDVSLYANHTGLMWEYAAHFKALIVFAEHRFYGLSQPFNSSQLI---PSHLRYRTHEQA 211
Query: 175 LADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRL 213
+AD+A+ + ++++ + PV+ FGGSYGGML+AW R+
Sbjct: 212 IADYALLLESIQKRFHGDRHPVITFGGSYGGMLSAWFRI 250
>gi|195497934|ref|XP_002096311.1| GE25599 [Drosophila yakuba]
gi|194182412|gb|EDW96023.1| GE25599 [Drosophila yakuba]
Length = 473
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 82/277 (29%), Positives = 131/277 (47%), Gaps = 22/277 (7%)
Query: 26 LSPLSLA-AQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
L+ L+L S F R + + + P L+ R Q T + EQ+LDHF A+
Sbjct: 9 LALLALGQTHGSIFERTFKRIHEEPPLSTIQNRADVVQ-------TLWIEQKLDHFDPAE 61
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
T+ RY++N + P+F+Y G E +I + G ++D+A A+L + EH
Sbjct: 62 TRTWQMRYMLNDALYKSG---APLFIYLGGEWEISSGRITGGHLYDMAKEHNALLAYTEH 118
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA-EASPVVLFGGSY 203
RYYG+S P +++ +N + YL+ Q+LAD A FI +KQN S V++ GGSY
Sbjct: 119 RYYGQSKPL--PDLSNEN---IKYLSVNQSLADLAYFINTIKQNHEGLSESKVIIVGGSY 173
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKE 262
+ W + YP + G ASSAP+ + V + + ++ S C+ I+
Sbjct: 174 SATMVTWFKKLYPDLVAGGWASSAPLFAKVNFVE---YKEVTGQSIQQMGGSDCYKRIEN 230
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
E+ S+ + G E+ LC + DL W
Sbjct: 231 GIAEMESMIATKRG-AEVKAILKLCEPFDVYSDLDVW 266
>gi|330803162|ref|XP_003289578.1| hypothetical protein DICPUDRAFT_36006 [Dictyostelium purpureum]
gi|325080335|gb|EGC33895.1| hypothetical protein DICPUDRAFT_36006 [Dictyostelium purpureum]
Length = 485
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 85/253 (33%), Positives = 139/253 (54%), Gaps = 30/253 (11%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF--- 127
++F Q +DHF+ + TF QRYLIN ++ G GP+F+ EG + V +G
Sbjct: 52 QWFTQNVDHFNIVNTDTFQQRYLINDQYYDGT---GPVFIMINGEGPMGLDTV-TGLQFV 107
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
VW A + A++V EHRYYG S + +++++ N L +L ++QALAD AVF + Q
Sbjct: 108 VW--AKQLNALIVSLEHRYYGAS--FVTSDLSLDN---LQFLNSQQALADNAVFREFIAQ 160
Query: 188 NLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY---N 243
+ ++ V FGGSY G L +W R+KYPH+ +ASS P + P FY
Sbjct: 161 KYNIPSTTKWVSFGGSYSGALTSWFRIKYPHLVDITIASSGP------VNPEVNFYQYLQ 214
Query: 244 IVSSDFKRES--ASCFNTIKESWGELVSVGQKEN-GLLELTKTFHLCRELNSTEDLADWL 300
+V + ++ + A C I + ++ S+ Q++N G +E F LC +L + D+A+++
Sbjct: 215 VVQNSLQQTNGGAECVQNIAIATDKVQSLLQQDNYGGVE--TLFDLCSQLENANDVANFM 272
Query: 301 ES-AYSYLAMVDY 312
S A +++ +V Y
Sbjct: 273 NSLAGNFMGVVQY 285
>gi|359476844|ref|XP_003631898.1| PREDICTED: probable serine protease EDA2-like [Vitis vinifera]
Length = 477
Score = 118 bits (295), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 84/215 (39%), Positives = 116/215 (53%), Gaps = 34/215 (15%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q +DHFS D F QRY TD++ P+ GPIFL E + + + ++ +
Sbjct: 42 WFNQTVDHFSPLDHSKFPQRYYEFTDYFRLPD--GPIFLKICGEASCD--GIPNDYISVL 97
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNL 185
A +FGA +V EHRYYG+S P+ S L YL+++QAL D AVF N+
Sbjct: 98 AKKFGAAVVSLEHRYYGKSSPFRSLRTE-----NLKYLSSKQALFDLAVFRQYYQESLNV 152
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K N S +P +FG SY G L+AW RLK+PH+ G+LASSA +L YN
Sbjct: 153 KVNRSNVENPWFVFGVSYAGALSAWFRLKFPHLTCGSLASSAVVL---------AIYNF- 202
Query: 246 SSDFKRESASCFNTIKESWG-ELVSVGQKENGLLE 279
++F R+ I ES G E +V Q+ NGL+E
Sbjct: 203 -TEFDRQ-------IGESAGAECKAVLQEVNGLVE 229
>gi|146425169|emb|CAM84573.1| intestinal prolyl carboxypeptidase 1 [Haemonchus contortus]
Length = 1071
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 118/226 (52%), Gaps = 16/226 (7%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNS 125
YET YF Q +DHF+ + TF Q+Y N + W GPIFL G EG +W +N
Sbjct: 583 YETGYFTQPVDHFNNQNPATFDQKYYKN-EQWAREG--GPIFLMIGGEGPSSAKWI-LNE 638
Query: 126 GFVW-DIAPRFGAMLVFPEHRYYGES----MPYGSTEVAYQNATTLSYLTAEQALADFAV 180
+ W A +FGA EHRYYG+S + + ST+ + T +YL++ Q L D A
Sbjct: 639 NYTWLQWAKKFGATTYMLEHRYYGDSDLQRLLFDSTDTKLKRTYT-TYLSSLQMLYDTAN 697
Query: 181 FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
FI + + + + +V FGGSY G LA WMR +P++ GA+ SSAP+ E +
Sbjct: 698 FIQAIDADNGKKGTWIV-FGGSYAGSLALWMRKLFPNLVHGAVGSSAPL---EAKLDYHE 753
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
+Y +V + + S C I E + ++ E G E++KTF L
Sbjct: 754 YYQVVEASIRXYSEDCAYAIGEGFEDIHEKMLSERGREEISKTFKL 799
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 143/305 (46%), Gaps = 28/305 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVW 129
Y +Q+LDH ++ +SQRY N ++ + FL G G DI W
Sbjct: 39 YLKQKLDHTQ--EVKEWSQRYFYNNRYYRKGGNVA--FLMLGGMGVLDIGWVTNEKIPFV 94
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
+A GA++ EHR+YG+S P + +++ +N L YLT EQA+ D FI + +
Sbjct: 95 QMAKERGALMFALEHRFYGKSRP--TDDLSVKN---LKYLTIEQAIGDIKTFIEEMNKKH 149
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYP--HIAIGALASSAPILQFEDIVPPETFY---NI 244
E ++FGGSY G LA W R KY ++ GA+ASS PI++ P F+
Sbjct: 150 KLENPKWIVFGGSYAGSLALWARDKYKDENLIAGAVASS-PIMR-----PKFDFWEATQF 203
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLAD--WLES 302
+ ++ C +I+ + +++ + + G +L++ F + R T DL + L S
Sbjct: 204 AEKEIQKVDKKCGESIRIGFMQMIDMLGNQVGRSQLSELFKM-RPRFLTPDLRNIQLLNS 262
Query: 303 AY--SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP-DATSILERIFEGVSVYYNYTG 359
++++ V + Y ++++C+ ++ D + LER+ + Y
Sbjct: 263 IQLNNFISAVQFRGGPYMQNGTHSYNLKQLCEIMNTETIDQLTALERVSNVRHLQSKYLN 322
Query: 360 NVDCF 364
++D +
Sbjct: 323 DMDKY 327
>gi|328875430|gb|EGG23794.1| hypothetical protein DFA_05930 [Dictyostelium fasciculatum]
Length = 550
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 84/279 (30%), Positives = 143/279 (51%), Gaps = 30/279 (10%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
++F QRLDHF+ + TF QRY+IN +W G GP+F+ EG + V +
Sbjct: 117 QWFTQRLDHFNTINQQTFQQRYVINDQYWNGK---GPVFIMINGEGPMSLATVTGLQFVN 173
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A + A+++ EHRYYG S + + +++ N L+YLT +QALAD A F + +
Sbjct: 174 WAQQSNALIISLEHRYYGAS--FATDDLSTDN---LAYLTPQQALADNAAFREFVAVTFN 228
Query: 191 AEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI------LQFEDIVPPETFYN 243
A S V FGGSY G L +W R+KYP++ +ASS P+ Q+ ++V
Sbjct: 229 VPATSKWVSFGGSYSGCLTSWFRIKYPNLVDYTVASSGPVNAEVNFYQYLEVVQNSLLT- 287
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES- 302
C + I ++ ++ ++ + NGL ++ F+L L S D+A++++S
Sbjct: 288 ------ATNGQQCVSNIAQATQKIQALLSQPNGLETVSDMFNLFPALESQNDVANFMQSL 341
Query: 303 AYSYLAMVDYPYPSDFMMPLPGYPIREVCKKI-DNAPDA 340
A +++ +V Y + G + +C+ + D++ DA
Sbjct: 342 AGNFMGVVQYN------LEEVGPSVETLCQTMTDSSNDA 374
>gi|340376307|ref|XP_003386675.1| PREDICTED: putative serine protease K12H4.7-like [Amphimedon
queenslandica]
Length = 486
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 118/228 (51%), Gaps = 15/228 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q DHF D T+ QRY +N W N GP+FL G EG+ + V G + +
Sbjct: 49 WFTQSRDHFREVDTTTWQQRYWVNDSFWDKEN--GPVFLMIGGEGEADPKWVVEGEMMVL 106
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--KQNL 189
A ++ A+ EHR+YG+S P + Y ++ L + QAL D A F N+ K N+
Sbjct: 107 AEKYHALAFQLEHRFYGKSQPGADLSMDY-----ITLLNSRQALEDLAYFRMNMTTKYNM 161
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
+ +A+ + FGGSY G LAAW+R+KYP I G++ASSAPI D ++ +VS+
Sbjct: 162 T-DANRWIAFGGSYPGALAAWLRMKYPDIVYGSIASSAPIQAKFDFYE---YFEVVSASL 217
Query: 250 K--RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
+ R +C I ++ L + + L K F+L L + ED
Sbjct: 218 EQARNGPACVAAINAAFSLLNKLILDNSKWASLDKMFNLTVGLTTPED 265
>gi|237700855|gb|ACR16009.1| carboxypeptidase 3 [Mamestra configurata]
Length = 484
Score = 117 bits (293), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 133/294 (45%), Gaps = 17/294 (5%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
LDHF + F R++ N + G PIF+ G E DI+ + +G ++++A
Sbjct: 55 LDHFDPQNPTEFLMRFMFNEQFFGGDG--SPIFIMVGGEWDIDHRWLLAGNMFEMARENK 112
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV 196
V+ EHRYYG G+ A A L +L +QALAD A FIT +K+ S V
Sbjct: 113 GYQVYTEHRYYG-----GTKIFANFTAENLRFLNIDQALADLAYFITEMKKQPRFAESEV 167
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA-S 255
VL+GGSY + W + +YPH+ +G +ASS PIL D PE + +V F E
Sbjct: 168 VLYGGSYAANMVMWFKKRYPHLVVGTVASSGPILAKVDF--PE-YLEVVHEAFMLEGGEE 224
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYP 315
C I+ E ++ Q E+G L +++ LC L+ + + ++ + +
Sbjct: 225 CIGHIRRGVEETIAAMQTESGRRLLEQSYRLCAPLDYDNE-----NELGVFAGLISWTFS 279
Query: 316 SDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDDD 369
+ PG ++ VC+ + S F G + C+ ++ D
Sbjct: 280 TSVQQARPG-TLQAVCQSFTDHTSYGSTPMEKFGGYIAIIRRLSDTSCWSINYD 332
>gi|326520219|dbj|BAK04034.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 508
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 69/168 (41%), Positives = 96/168 (57%), Gaps = 15/168 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + Q LDHFS D F QRY D+ PN GP+FL E +++ ++
Sbjct: 56 EEHWMSQTLDHFSPTDHRQFKQRYYEFLDYHRAPN--GPVFLNICGEASCS--GISNNYL 111
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
+A +FGA LV PEHRYYG+S P+ ++ +N L +L+++QAL+D AVF
Sbjct: 112 AVMAKKFGAALVSPEHRYYGKSSPF--EDLTTEN---LRFLSSKQALSDLAVFRQYYQET 166
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N K N S + +FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 167 LNAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSLASSGVVL 214
>gi|428165662|gb|EKX34652.1| hypothetical protein GUITHDRAFT_158798 [Guillardia theta CCMP2712]
Length = 490
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 145/297 (48%), Gaps = 32/297 (10%)
Query: 55 PQRQQRQQQQQYRYET-------RYFEQRLDHFSFADLP-TFSQRYLINTDHWVGPNRLG 106
P R+ R Q+ + T R+F QR+DHFS D P T+ QRY +N + + G
Sbjct: 5 PYRELRVGQRDRHHATSMDGNNSRWFSQRVDHFS--DSPETWMQRYFVNETFFRMGS--G 60
Query: 107 PIFLYCGNEGD--IEWFAV----NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAY 160
P+FL G EG E V + + +A GA+++ EHRYYGES P V
Sbjct: 61 PVFLCVGGEGPPMTEQVVVTGENHCALMVHLARIHGALILALEHRYYGESHPRKDLSVE- 119
Query: 161 QNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIA 219
+ +L++ QAL D A F ++++ + + + FGGSY GMLAAW K+PH+
Sbjct: 120 ----NMRFLSSRQALEDIASFHSHIRSAFAISSKQRWITFGGSYPGMLAAWSHAKFPHLF 175
Query: 220 IGALASSAPILQFEDIVPPETFYNIVSSDFKRE----SASCFNTIKESWGELVSVGQKEN 275
A++SSAP+ + I+ + + N+V+SDF E S C NTIK ++ ++
Sbjct: 176 HAAVSSSAPV---QAILNMKGYNNVVASDFADETLGGSMLCLNTIKGAFAQVGEYLLSYE 232
Query: 276 GLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCK 332
G L F +C + ED+ + A + L+ P +D P IR CK
Sbjct: 233 GRRYLKTRFSVCGGDDVLEDIKNRALFAET-LSDPLIPQSNDPSCTSPLCDIRRQCK 288
>gi|24648175|ref|NP_650802.1| CG3734 [Drosophila melanogaster]
gi|19527569|gb|AAL89899.1| RE36938p [Drosophila melanogaster]
gi|23171716|gb|AAF55662.2| CG3734 [Drosophila melanogaster]
gi|220948338|gb|ACL86712.1| CG3734-PA [synthetic construct]
Length = 473
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 132/277 (47%), Gaps = 22/277 (7%)
Query: 26 LSPLSLA-AQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
L+ L+L S F R + + + P L P Q R Q T + EQ+LDHF +
Sbjct: 9 LALLALGQTHGSIFERTFKRIHEEPPL---PTTQNRADVVQ----TLWIEQKLDHFDPEE 61
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
T+ RY++N + P+F+Y G E +I + G ++D+A A+L + EH
Sbjct: 62 TRTWQMRYMLNDALYQSG---APLFIYLGGEWEISSGRITGGHLYDMAKEHNALLAYTEH 118
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPVVLFGGSY 203
RYYG+S P +++ +N + YL Q+LAD A FI +KQN + S V++ GGSY
Sbjct: 119 RYYGQSKPL--PDLSNEN---IKYLNVNQSLADLAYFINTIKQNHEGLSDSKVIIVGGSY 173
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKE 262
+ W + YP + G ASSAP+L + V + I ++ S C+ I+
Sbjct: 174 SATMVTWFKKLYPDLVAGGWASSAPLLAKVNFVE---YKEITGQSIEQMGGSACYKRIEN 230
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
E+ ++ + G E+ LC + DL W
Sbjct: 231 GIAEMETMIATKRG-AEVKALLKLCEPFDVYSDLDVW 266
>gi|326532846|dbj|BAJ89268.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 522
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/168 (41%), Positives = 96/168 (57%), Gaps = 15/168 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + Q LDHFS D F QRY D+ PN GP+FL E +++ ++
Sbjct: 56 EEHWMSQTLDHFSPTDHRQFKQRYYEFLDYHRAPN--GPVFLNICGEASCS--GISNNYL 111
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
+A +FGA LV PEHRYYG+S P+ ++ +N L +L+++QAL+D AVF
Sbjct: 112 AVMAKKFGAALVSPEHRYYGKSSPF--EDLTTEN---LRFLSSKQALSDLAVFRQYYQET 166
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N K N S + +FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 167 LNAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSLASSGVVL 214
>gi|340500169|gb|EGR27066.1| hypothetical protein IMG5_202470 [Ichthyophthirius multifiliis]
Length = 2048
Score = 117 bits (292), Expect = 1e-23, Method: Composition-based stats.
Identities = 64/164 (39%), Positives = 94/164 (57%), Gaps = 6/164 (3%)
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAP 133
+ L+HF L + QRY I D + P G +F+ G EG + G++ +A
Sbjct: 1606 QNSLNHFDPLGLIKWKQRYTI-YDEYFNPEN-GTVFISIGGEGQMAGITNGRGWLIQLAQ 1663
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ---NLS 190
F A+++ EHR+YG S P+G T +Y + L YLT +Q+LAD A I+ +KQ +
Sbjct: 1664 EFSAIVISVEHRFYGVSQPFGYTNQSY-SLENLQYLTVDQSLADLANLISKIKQKKLHKI 1722
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+E +P + GGSY G ++AW R KYPH+ +GALASSA + ED
Sbjct: 1723 SEINPFITIGGSYPGAMSAWFRYKYPHLTVGALASSAVVNAIED 1766
>gi|357483671|ref|XP_003612122.1| Thymus-specific serine protease [Medicago truncatula]
gi|355513457|gb|AES95080.1| Thymus-specific serine protease [Medicago truncatula]
Length = 478
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/189 (39%), Positives = 111/189 (58%), Gaps = 18/189 (9%)
Query: 51 LTEPPQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGP 107
++ P +R+ + RY T+ +F Q LDH+S D F QRY DH+ P+ GP
Sbjct: 16 VSATPHLLRRRLSESARYLTKEELWFPQTLDHYSPYDHRKFQQRYYEFLDHFRIPD--GP 73
Query: 108 IFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLS 167
+FL E + + + ++ +A +FGA +V EHRYYG+S P+ S +A +N L
Sbjct: 74 VFLVICGEYSCD--GIRNDYIGVLAKKFGAAVVSLEHRYYGKSSPFKS--LATKN---LR 126
Query: 168 YLTAEQALADFAVFITNLKQNLSAEAS------PVVLFGGSYGGMLAAWMRLKYPHIAIG 221
YL+++QAL D AVF N + +L+A+ + P +FG SY G L+AW RLK+PH+ G
Sbjct: 127 YLSSKQALFDLAVFRQNYQDSLNAKLNRTNADNPWFVFGVSYPGALSAWFRLKFPHLTCG 186
Query: 222 ALASSAPIL 230
+LASSA +L
Sbjct: 187 SLASSAVVL 195
>gi|348682257|gb|EGZ22073.1| hypothetical protein PHYSODRAFT_495993 [Phytophthora sojae]
Length = 527
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/183 (40%), Positives = 102/183 (55%), Gaps = 15/183 (8%)
Query: 56 QRQQRQQQQQYRYETR--YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCG 113
Q Q + QQ +R + +F Q+LDHFS TF QRY + W P+ GP+ LY G
Sbjct: 35 QLLQAEDQQLFRAPAQQLWFSQQLDHFSSDANATFKQRYYEVDEFWKAPS--GPVILYIG 92
Query: 114 NEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQ 173
EG +E +GFV IA +FGA +V EHR+YG+S+P G A YLT +Q
Sbjct: 93 GEGALE--QAPAGFVHVIAQKFGAKIVALEHRFYGKSVPNGDLSTA-----NYRYLTVQQ 145
Query: 174 ALADFAVFITNLKQNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PI 229
ALAD F + ++ L AE A+ + GGSY G L+AW R+ YP + +L+SS P+
Sbjct: 146 ALADLKHFKESYQRELGAEDANQWIAIGGSYPGALSAWFRVAYPDTTVASLSSSGVVQPV 205
Query: 230 LQF 232
+F
Sbjct: 206 YKF 208
>gi|32564815|ref|NP_498759.2| Protein K12H4.7, isoform b [Caenorhabditis elegans]
gi|351063157|emb|CCD71199.1| Protein K12H4.7, isoform b [Caenorhabditis elegans]
Length = 473
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 80/245 (32%), Positives = 119/245 (48%), Gaps = 16/245 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWD 130
F Q LDHF + TF QRY N + W GP FL G EG W + + +
Sbjct: 63 FTQTLDHFDSSVGKTFQQRYYHN-NQWYKAG--GPAFLMLGGEGPESSYWVSYPGLEITN 119
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
+A + GA + EHR+YGE+ P V L YL++ QA+ D A FI +
Sbjct: 120 LAAKQGAWVFDIEHRFYGETHPTSDMSVP-----NLKYLSSAQAIEDAAAFIKAMTAKFP 174
Query: 191 AEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
A+ V FGGSY G LAAW R K+P + A+ SS P+ D + + +V +
Sbjct: 175 QLANAKWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPVQAEVDF---KEYLEVVQNSI 231
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN-STEDLADWLESAYS-YL 307
R S C ++ + + + S+ Q +G +L FHLC+++ + L + E+ YS Y+
Sbjct: 232 TRNSTECAASVTQGFNLVASLLQTSDGRKQLKTAFHLCQDIQMDDKSLKYFWETVYSPYM 291
Query: 308 AMVDY 312
+V Y
Sbjct: 292 EVVQY 296
>gi|167518842|ref|XP_001743761.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777723|gb|EDQ91339.1| predicted protein [Monosiga brevicollis MX1]
Length = 448
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 78/229 (34%), Positives = 120/229 (52%), Gaps = 17/229 (7%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSGFVWD 130
F+Q LDHF+ D TF Q+Y +N + GP+FL G EG EW N+ +
Sbjct: 10 FDQTLDHFNPQDTRTFKQQYQVNRTFYKAG---GPLFLMLGGEGPASPEWLETNTAIML- 65
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN-LKQNL 189
A + A++ EHR+YGES P+ V L YL++EQALAD A FI + L+ N
Sbjct: 66 YAQQLNAVVAQIEHRFYGESQPFEDLSV-----DNLRYLSSEQALADAANFIQSFLEMNG 120
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV--SS 247
+ VV FGGSY G L+A++R KYPH+ A+A+S+P+L D V ++ +V S
Sbjct: 121 MPADTRVVSFGGSYSGALSAFLRTKYPHVVYAAVATSSPVLAKLDYVE---YHEVVGRSM 177
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
+C + IK + ++ ++ + +L + +C + + DL
Sbjct: 178 GTSTHGQACVDQIKGALSKVDAMLADSSQWNQLAQDMKICSDTDLNVDL 226
>gi|341898957|gb|EGT54892.1| hypothetical protein CAEBREN_15465 [Caenorhabditis brenneri]
Length = 825
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/325 (29%), Positives = 154/325 (47%), Gaps = 26/325 (8%)
Query: 39 RRAPRFVGK------LPHLTEPPQRQQRQQQQQYRYETRY-FEQRLDHFSFADLPTFSQR 91
R P F+GK L H +R Q Y Y Y F Q+LDHF + T++Q+
Sbjct: 18 RLPPFFLGKPNGQTLLNHHLNRFDVTKRGVLQDYPYLQVYNFTQKLDHFDPYNTKTWNQK 77
Query: 92 YLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGE 149
Y N + IFL G EG + +W A A FGA + EHR++G+
Sbjct: 78 YFYNP---IYSRNNSIIFLMIGGEGPENGKWAAYPQVQYLQWAKEFGADVFDLEHRFFGD 134
Query: 150 SMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAA 209
S P E ++L YLT +QALAD A FI ++ Q + V FGGSY G L+A
Sbjct: 135 SWPIPDME-----TSSLRYLTTQQALADLAFFIESMNQQYGFKNPRWVTFGGSYPGSLSA 189
Query: 210 WMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVS 269
W R KYP + +G++ASSAP+ D + +V D + A+C +++++ ++
Sbjct: 190 WFRQKYPQLTVGSVASSAPVNLKLDFYE---YAMVVQDDLQLTDANCAPAVRDAFTQIQQ 246
Query: 270 VGQKENGLLELTKTFHLC--RELNSTE-DLADWLESAY-SYLAMVDYPYPSDFMMPLPGY 325
+ +G +L F+L + N+T+ D+ ++ + + ++ M Y Y
Sbjct: 247 LSLTVDGRNKLNNYFNLQPPFDANTTKLDINNFFGNLFNTFQGMTQYTYDGQSNSTHTDK 306
Query: 326 PIREVCKKIDNA--PDATSILERIF 348
++++C+ + N PD +E +F
Sbjct: 307 TVKKMCEIMTNTTEPDKVKRVENLF 331
>gi|426250769|ref|XP_004019106.1| PREDICTED: thymus-specific serine protease [Ovis aries]
Length = 516
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/160 (41%), Positives = 98/160 (61%), Gaps = 10/160 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW + GP+FL+ G EG + +V G ++
Sbjct: 59 WLEQPLDPFNASDRRSFLQRYWVNDQHWTSQD--GPVFLHLGGEGSLGPGSVMRGHPANL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P ++A L +L++ ALAD A L + N+
Sbjct: 117 APIWGALVISLEHRFYGLSIPAEGLDMA-----QLRFLSSRHALADAASARLTLSRLFNV 171
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
S+ SP + FGGSY G LAAW RLK+PH+ ++ASSAP+
Sbjct: 172 SS-TSPWICFGGSYAGSLAAWARLKFPHLFFASIASSAPV 210
>gi|326674064|ref|XP_002664605.2| PREDICTED: thymus-specific serine protease-like [Danio rerio]
Length = 581
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/269 (32%), Positives = 126/269 (46%), Gaps = 31/269 (11%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
Q +DHF + T Q Y +N +W + GP+FLY G EG + F+V G ++A
Sbjct: 156 IHQPVDHFDRQNDKTLPQTYFVNDVYWQRSD--GPVFLYIGGEGPLSKFSVLFGHHVEMA 213
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS-A 191
R GA+LV EHR+YGES+ E L L+++QALAD A F + Q S +
Sbjct: 214 ERHGALLVALEHRFYGESINPDGLET-----DKLRDLSSQQALADLAAFHHYISQRFSLS 268
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
+ + FGGSY G L+AW+R K+PH+ GA+ASSAP+ D S + R
Sbjct: 269 HRNTWISFGGSYAGALSAWLRGKFPHLIYGAVASSAPVYAVLDF-----------SSYNR 317
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES-AYSYLAMV 310
C +K ++ + + N E+ K F C ED + L S A ++ V
Sbjct: 318 ----CVAEVKGAFAAVEAALLMGNE-TEVGKEFGCCETPLKLEDKTELLHSLADVFMGTV 372
Query: 311 DYPYPSDFMMPLPGYPIREVCKKIDNAPD 339
Y + I E+C + N D
Sbjct: 373 QYNEQG------VAFSIAELCDIMTNKSD 395
>gi|32564813|ref|NP_498758.2| Protein K12H4.7, isoform a [Caenorhabditis elegans]
gi|13638618|sp|P34528.2|YM67_CAEEL RecName: Full=Putative serine protease K12H4.7; Flags: Precursor
gi|351063156|emb|CCD71198.1| Protein K12H4.7, isoform a [Caenorhabditis elegans]
Length = 510
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/245 (32%), Positives = 119/245 (48%), Gaps = 16/245 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWD 130
F Q LDHF + TF QRY N + W GP FL G EG W + + +
Sbjct: 63 FTQTLDHFDSSVGKTFQQRYYHN-NQWYKAG--GPAFLMLGGEGPESSYWVSYPGLEITN 119
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
+A + GA + EHR+YGE+ P V L YL++ QA+ D A FI +
Sbjct: 120 LAAKQGAWVFDIEHRFYGETHPTSDMSVP-----NLKYLSSAQAIEDAAAFIKAMTAKFP 174
Query: 191 AEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
A+ V FGGSY G LAAW R K+P + A+ SS P+ D + + +V +
Sbjct: 175 QLANAKWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPVQAEVDF---KEYLEVVQNSI 231
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN-STEDLADWLESAYS-YL 307
R S C ++ + + + S+ Q +G +L FHLC+++ + L + E+ YS Y+
Sbjct: 232 TRNSTECAASVTQGFNLVASLLQTSDGRKQLKTAFHLCQDIQMDDKSLKYFWETVYSPYM 291
Query: 308 AMVDY 312
+V Y
Sbjct: 292 EVVQY 296
>gi|146425171|emb|CAM84574.1| intestinal prolyl carboxypeptidase 2 [Haemonchus contortus]
Length = 1143
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 92/281 (32%), Positives = 124/281 (44%), Gaps = 30/281 (10%)
Query: 26 LSPLSLAAQPSKFRRAPRFVGKLP----------HLTEPPQ----RQQRQQQQQYRYETR 71
L L A +P+K +R +L HL PP +YE
Sbjct: 546 LEGLKNAFKPTKQQRQRNSANELTPEESRILRRVHLGRPPHGLFPDPDPLPDMPVQYEAG 605
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVW 129
YF Q +DHF+ + TF QRY N D W PN GPIFL G E D W +
Sbjct: 606 YFTQPVDHFNNKNPYTFEQRYFKN-DQWAKPN--GPIFLMIGGESERDSSWVLNENLTYL 662
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT-LSYLTAEQALADFAVFITNLKQN 188
A FGA + E RYYG+S + S + A T +YL++ Q L D A FI +
Sbjct: 663 KWADEFGATVYALEXRYYGKSDLFDSLDPAVSKKNTYTTYLSSLQMLYDVANFIRAVDAE 722
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNIV 245
+ ++FGGSY G LA WMR +P + GA+ SSAP+ L F D +Y +V
Sbjct: 723 -RGQHGKWIMFGGSYAGSLALWMRRLFPDLVYGAIGSSAPLEAKLDFYD------YYQVV 775
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
+ S C I E + ++ E G +LT+ F L
Sbjct: 776 EKSIRSHSEDCAYAIAEGFDDIRQRLLTEKGRAQLTEIFKL 816
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 70/219 (31%), Positives = 102/219 (46%), Gaps = 17/219 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVW 129
YF+Q+LDH T+ QRY + ++ + FL G G DI W
Sbjct: 53 YFKQKLDHTKDDGEGTWPQRYFYSQRYYRKGGNV--FFLMLGGMGVMDIGWVTNEKLPFV 110
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT--NLKQ 187
GA L EHR+YG+S P + V L+YLT +QA+ D A FI N K
Sbjct: 111 QWGKERGAQLYALEHRFYGKSRPTPNLSV-----RNLAYLTIDQAIGDVANFIKEMNAKH 165
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ E + ++FGGSY LA W R KYP++ GA+ASS + D F +
Sbjct: 166 RIXDEDAKWIVFGGSYAASLALWARQKYPNLIAGAVASSPLMRPRFDFWEGTQFAEDI-- 223
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENG---LLELTKT 283
+++ A+C I+ ++ +L + E G + EL KT
Sbjct: 224 -YRKTDATCAENIEIAFQQLADMLGSERGRSQVSELLKT 261
>gi|388499696|gb|AFK37914.1| unknown [Lotus japonicus]
Length = 390
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/192 (38%), Positives = 110/192 (57%), Gaps = 18/192 (9%)
Query: 48 LPHLTEPPQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNR 104
L + P R + + RY T+ +F Q LDH+S D F QRY D++ P+
Sbjct: 17 LSTVAATPHRHRLSETATDRYLTKQEQWFSQTLDHYSPYDHRKFQQRYYEFLDYFRIPD- 75
Query: 105 LGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNAT 164
GP+FL G+ + + ++ +A +FGA +V EHRYYG+S P+ S +A +N
Sbjct: 76 -GPVFLVIC--GEYSCNGIRNDYIAVLAKKFGAAVVSLEHRYYGKSSPFKS--LATKN-- 128
Query: 165 TLSYLTAEQALADFAVFITNLKQNLSAE------ASPVVLFGGSYGGMLAAWMRLKYPHI 218
L YL+++QAL D AVF N + +L+A+ +P +FG SY G L+AW RLK+PH+
Sbjct: 129 -LRYLSSKQALFDLAVFRQNYQDSLNAKLNRTKTENPWFVFGVSYPGALSAWFRLKFPHL 187
Query: 219 AIGALASSAPIL 230
G+LASSA +L
Sbjct: 188 TCGSLASSAVVL 199
>gi|195158018|ref|XP_002019891.1| GL12646 [Drosophila persimilis]
gi|194116482|gb|EDW38525.1| GL12646 [Drosophila persimilis]
Length = 473
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 18/247 (7%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
P Q R Q T + EQ+LDHF + T+ RY++N + GP+F+Y G
Sbjct: 36 PSNQNRADVVQ----TLWIEQKLDHFDEDEKRTWQMRYMLNDALYQSG---GPLFIYLGG 88
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
E +I + G ++D+A A+L + EHRYYGES P +++ +N + YL QA
Sbjct: 89 EWEISAGRITGGHIYDMAKEHNALLAYTEHRYYGESKPL--PDLSNEN---IQYLNVRQA 143
Query: 175 LADFAVFITNLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
L D AVFI LK S V++ GGSY + W + +P + G ASSAP+
Sbjct: 144 LEDLAVFIRTLKATHEGLSESKVIIVGGSYSATMVTWFKKVHPDLVAGGWASSAPLFAKV 203
Query: 234 DIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS 292
+ V + I S C+N I+ E+ ++ + G E+ LC +
Sbjct: 204 NFVE---YKEITGQSIALMGGSACYNRIESGIAEMETMFATKRG-AEVKALLKLCERFDV 259
Query: 293 TEDLADW 299
DL W
Sbjct: 260 YSDLDVW 266
>gi|125778538|ref|XP_001360027.1| GA17650 [Drosophila pseudoobscura pseudoobscura]
gi|54639777|gb|EAL29179.1| GA17650 [Drosophila pseudoobscura pseudoobscura]
Length = 473
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 115/247 (46%), Gaps = 18/247 (7%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
P Q R Q T + EQ+LDHF + T+ RY++N + GP+F+Y G
Sbjct: 36 PSNQNRADVVQ----TLWIEQKLDHFDEDEKRTWQMRYMLNDALYQSG---GPLFIYLGG 88
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
E +I + G ++D+A A+L + EHRYYGES P +++ +N + YL QA
Sbjct: 89 EWEISAGRITGGHIYDMAKEHNALLAYTEHRYYGESKPL--PDLSNEN---IQYLNVRQA 143
Query: 175 LADFAVFITNLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
L D AVFI LK S V++ GGSY + W + +P + G ASSAP+
Sbjct: 144 LEDLAVFIRTLKATHEGLSESKVIIVGGSYSATMVTWFKKVHPDLVAGGWASSAPLFAKV 203
Query: 234 DIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS 292
+ V + I S C+N I+ E+ ++ + G E+ LC +
Sbjct: 204 NFVE---YKEITGQSIALMGGSACYNRIESGIAEMETMFATKRG-AEVKALLKLCERFDV 259
Query: 293 TEDLADW 299
DL W
Sbjct: 260 YSDLDVW 266
>gi|194744685|ref|XP_001954823.1| GF16549 [Drosophila ananassae]
gi|190627860|gb|EDV43384.1| GF16549 [Drosophila ananassae]
Length = 489
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 154/310 (49%), Gaps = 24/310 (7%)
Query: 20 IVIISILSPLSLAAQPSKFR-RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLD 78
+ I+++L+PL+ AA+ + + + F+ L L R + + + R+ Q+LD
Sbjct: 8 LAILAVLAPLTAAARLGEPKPKVNAFIKTLRDLHRG--RPEEHIMTRAKAHERWITQKLD 65
Query: 79 HFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAM 138
+F + T+S R IN H+V + PIF+Y G E +I+ + +++ + DI +
Sbjct: 66 NFDDDNNATWSDRIYINEKHFVDGS---PIFIYLGGEWEIQSWDISNTLLADITKKHNGT 122
Query: 139 LVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVL 198
++ EHR++G+S+P T ++ +N Y QALAD I LK+ + S VV+
Sbjct: 123 IITTEHRFFGKSIPI--TPLSTENLE--KYQNVNQALADVINVIQTLKEEGKYKDSKVVI 178
Query: 199 FGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CF 257
G SY G +AAW+R YP I +G+ ASSAPI+ D + ++ +V ++ C+
Sbjct: 179 SGCSYSGAMAAWIRKLYPDIIVGSWASSAPIVAKVDF---KDYFKVVGESYQTLGGQYCY 235
Query: 258 NTIKES---WGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL---ESAYSYLAMVD 311
+ I + + +L + G+ + EL +LC + + W A + +
Sbjct: 236 DLIDNATSYYEDLFANGKGDQAKKEL----NLCDDFDPKNKRDRWQIFSTIANIFAGIAQ 291
Query: 312 YPYPSDFMMP 321
Y P+++ +P
Sbjct: 292 YQIPANYDIP 301
>gi|166240237|ref|XP_635876.2| peptidase S28 family protein [Dictyostelium discoideum AX4]
gi|165988498|gb|EAL62376.2| peptidase S28 family protein [Dictyostelium discoideum AX4]
Length = 476
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 138/296 (46%), Gaps = 33/296 (11%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW--FAVNS 125
+ ++F Q LDHF+F + TF Q+Y +N D + GPI LY EG + ++ +
Sbjct: 39 FPAQWFTQTLDHFNFQNNQTFQQKYYVN-DQYYNYKNGGPIILYINGEGPVSSPPYSSDD 97
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G V A M+V EHR+YGES P+ +E+ +N L YL+ +QAL D A F+ +
Sbjct: 98 GVVI-YAQALNCMIVTLEHRFYGESSPF--SELTIEN---LQYLSHQQALEDLATFVVDF 151
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ L A +V GGSY G L+AW R+KYPHI +G++ASS + D + + +
Sbjct: 152 QSKLVG-AGHIVTIGGSYSGALSAWFRIKYPHITVGSIASSGVVHSILDFTAFDAYVSYA 210
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
++ + E E + G +E + ++ + L D WL A S
Sbjct: 211 VGPECTKALQAVTSAAED--EYFAGGIREQQMKQILQA----ESLVDIGDFFYWL--ADS 262
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNV 361
+ Y Y + PL +D TS ++ I +VY NYT N
Sbjct: 263 MMEGDQYGYIDELCSPL-----------VDAINSGTSGIDLI----TVYSNYTINT 303
>gi|302807385|ref|XP_002985387.1| hypothetical protein SELMODRAFT_122090 [Selaginella moellendorffii]
gi|300146850|gb|EFJ13517.1| hypothetical protein SELMODRAFT_122090 [Selaginella moellendorffii]
Length = 481
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 83/251 (33%), Positives = 128/251 (50%), Gaps = 30/251 (11%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ ++ Q LDH++ D TF+QRY TD++ PN GP+FL EG + + +
Sbjct: 36 DAHWYTQTLDHYATQDDRTFAQRYYEFTDYFDAPN--GPVFLKICGEGTC--VGIQNDYS 91
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
+A RFGA +V EHRYYG+S P+ S L YL+++QAL D A F +
Sbjct: 92 AVLAKRFGAAIVSLEHRYYGQSSPFKS-----HATENLIYLSSKQALFDLAAFREYYQDL 146
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQFEDIVPPE 239
N + N +++ +P ++ GGSY G L+AW +LK+PH+A+G++ASS I +F
Sbjct: 147 INHRTNSTSD-NPWIVMGGSYSGALSAWFKLKFPHLAVGSVASSGVVQAIFKFTKFDEQA 205
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED---- 295
+ + S A+C ++ + L G KEN + TK +L+ D
Sbjct: 206 RLFLVAES----AGATCSAALR-AVTRLAEQGLKENSV--STKALFNAEQLDVDGDFLYL 258
Query: 296 LADWLESAYSY 306
LAD A+ Y
Sbjct: 259 LADAAAIAFQY 269
>gi|256574605|dbj|BAH98108.1| serine protease [Entamoeba invadens]
Length = 465
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 123/244 (50%), Gaps = 15/244 (6%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
ET + +DHF+ + F +Y ++ + G + P+F+ G EG +++ ++
Sbjct: 37 ETHTYSVPMDHFNANNDEEFEVKYFVSEKYLDGTDLHSPLFVMLGGEGPESSKTLDNHYI 96
Query: 129 WD-IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
D +A R +++ EHR+YG+S P + L Y TAEQA+ D+ IT +++
Sbjct: 97 IDTLAARTNGLMLAIEHRFYGDSTPSLKMD-------KLIYCTAEQAMMDYIEIITYIQE 149
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ PV++ GGSY G LAAWMR KYP++ GA ASSAP+ E V + +V +
Sbjct: 150 TRNFIDHPVIVIGGSYSGNLAAWMRQKYPNVVDGAWASSAPV---EAQVDFYQYLEVVQA 206
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYL 307
+A + E W ++ E+G EL K F+ C + +D+ + E+ + L
Sbjct: 207 GLPANTADLLSIAFEKWDQMTVT---ESGRKELKKVFNTCTDFGE-DDIQTFAETIGTAL 262
Query: 308 AMVD 311
A D
Sbjct: 263 AGTD 266
>gi|195062810|ref|XP_001996258.1| GH22292 [Drosophila grimshawi]
gi|193899753|gb|EDV98619.1| GH22292 [Drosophila grimshawi]
Length = 480
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 82/264 (31%), Positives = 122/264 (46%), Gaps = 21/264 (7%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
R P V L L P Q ++ ETR+ Q+LD+F + + R LIN D++
Sbjct: 25 RIPASVRTLNELHRGPPMQLISKRA--NVETRWISQKLDNFDEGNEEVWDDRVLINEDYF 82
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA 159
V + PIF+Y G E +IE + +G DIA LV+ EHR++G+S+P A
Sbjct: 83 VDGS---PIFIYLGGEWEIEPSPITAGHWVDIASEHNGSLVYTEHRFFGQSVPIKPLTTA 139
Query: 160 YQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
L Y EQALAD I LK+ + S VV+ G SY +A W++ YP +
Sbjct: 140 -----NLKYQNVEQALADVVNVINVLKEEEKYKNSKVVVQGCSYSATMAVWIKKLYPDVI 194
Query: 220 IGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKES---WGELVSVGQKEN 275
+G+ ASSAP+ D + + +V ++ C+N I + + +L GQ
Sbjct: 195 VGSWASSAPLQAKVDF---KAYMKVVGQAYRELGGDYCYNIIDNATSFYEDLFENGQN-- 249
Query: 276 GLLELTKTFHLCRELNSTEDLADW 299
E K +LC N + W
Sbjct: 250 --AEAKKLLNLCDNFNENDQHDQW 271
>gi|167390072|ref|XP_001739195.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165897172|gb|EDR24409.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 480
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 76/233 (32%), Positives = 120/233 (51%), Gaps = 15/233 (6%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-IAPRF 135
LDHF+ + F Y +N + + P+F+ G EG + + +V D +A +
Sbjct: 46 LDHFNANNQNDFDIHYFVNKEFLDTNDPNAPLFVLLGGEGPASPKVLQNNYVIDSLAKKH 105
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
+++ EHR+YG S P S E+ L Y TAEQAL D+ I+++++ + P
Sbjct: 106 KGLMLSVEHRFYGASTP--SLEM-----DKLIYCTAEQALMDYVEVISHVQEENNLVGHP 158
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++ GGSY G LAAWMR KYP++ GA ASSAP+ E +V + +V + + +A
Sbjct: 159 VIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQNALPKNTAD 215
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
+ E W E+ + E G EL K F+ C E +D+ + ES + L+
Sbjct: 216 LLSFAFEKWDEMTTT---EEGRKELGKIFNTCTEFGE-KDIQTFAESIGTALS 264
>gi|66800673|ref|XP_629262.1| hypothetical protein DDB_G0293014 [Dictyostelium discoideum AX4]
gi|60462657|gb|EAL60859.1| hypothetical protein DDB_G0293014 [Dictyostelium discoideum AX4]
Length = 486
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 83/252 (32%), Positives = 132/252 (52%), Gaps = 27/252 (10%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFV 128
++F Q +DHF+ A+ TF QRYLIN +W G GP+F+ EG DI V
Sbjct: 52 QWFTQSVDHFNPANPTTFQQRYLINDQYWDGT---GPVFIMINGEGPMDINTVTQLQFVV 108
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
W A + A++V EHRYYG S + + +++ +N L +L + QALAD AVF + Q
Sbjct: 109 W--AKQVSALVVSLEHRYYGAS--FVTEDLSLEN---LQWLNSAQALADNAVFRNFVAQQ 161
Query: 189 LSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ + S + FGGSY G L +W R+KYPH+ +ASSAP + P FY + +
Sbjct: 162 YNVPKESKWISFGGSYSGALTSWFRIKYPHLVDATIASSAP------VNPEVNFYQYLET 215
Query: 248 ------DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLE 301
K C I + ++ ++ ++N + + F+LC L + D+A +++
Sbjct: 216 VQTALLASKSNGNLCVENINIATQKIQALLSQDN-YGGVDQMFNLCTPLGNQNDVATFMQ 274
Query: 302 S-AYSYLAMVDY 312
S A +++ +V Y
Sbjct: 275 SLAGNFMGVVQY 286
>gi|15233057|ref|NP_189509.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332643953|gb|AEE77474.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 199
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 100/198 (50%), Gaps = 23/198 (11%)
Query: 193 ASPVVLF----GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
AS ++LF G+ +LAAW +LKYP+IA+GALASSAP+L FED +P ++ IV+
Sbjct: 6 ASKIILFYFQFHGAVHKVLAAWFKLKYPYIALGALASSAPLLYFEDTLPKHGYFYIVTKV 65
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
FK S C N I +SW E+ + K N L L+K F LC LN +L +
Sbjct: 66 FKEMSKECHNKIHKSWDEIDRIAAKPNSLSILSKNFKLCNPLNDIIELKSY--------- 116
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS--ILERIFEGVSVYYNYTGNVDCFQL 366
V Y Y + + +C+ I+ +P T +L++IF GV GN+ C+ +
Sbjct: 117 -VSYIYARTAQYSDNQFSVARLCEAINTSPPNTKSDLLDQIFAGVVA---SRGNISCYGM 172
Query: 367 DDDPHGLD----GWNWQV 380
+ + W WQ
Sbjct: 173 SSPSYQMTNDDRAWGWQT 190
>gi|326520173|dbj|BAK04011.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 489
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 107/206 (51%), Gaps = 25/206 (12%)
Query: 32 AAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQR 91
+A+P F PR G L+ P R + E R+ QRLDHFS +D F QR
Sbjct: 21 SAEPLGFSHRPRTAGG--ELSAAPSRYLAR-------EERWMSQRLDHFSSSDHRQFKQR 71
Query: 92 YLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESM 151
Y D+ P GP+FL E + + + ++ IA +FGA +V PEHRYYG+S
Sbjct: 72 YFEFLDYHDDPT--GPVFLRICGESSCD--GIPNDYLAVIAKKFGAAVVTPEHRYYGKSS 127
Query: 152 PYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE-------ASPVVLFGGSYG 204
P+ S L +L+++QAL D AVF ++ L++ +P +FG S
Sbjct: 128 PFDSL-----TTDNLRFLSSKQALFDLAVFRQYYQEKLNSRYNRSAGFDNPWFVFGVSCS 182
Query: 205 GMLAAWMRLKYPHIAIGALASSAPIL 230
G L+AW RLK+PH+ G+LASS +L
Sbjct: 183 GALSAWFRLKFPHLTCGSLASSGVVL 208
>gi|297734877|emb|CBI17111.3| unnamed protein product [Vitis vinifera]
Length = 149
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 58/136 (42%), Positives = 82/136 (60%), Gaps = 20/136 (14%)
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG 265
+LA+W+RLKYPH+A+GALASSAPIL F+DI P I C+NTI+ESW
Sbjct: 32 LLASWLRLKYPHVALGALASSAPILYFDDITPQNEASEI-----------CYNTIRESWS 80
Query: 266 ELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGY 325
E+ V + +GL L+K F C EL+++++L D+L+ YS A ++P P Y
Sbjct: 81 EIDKVASEPDGLSILSKKFRTCTELSTSDELKDYLDETYSVAAQYNHP---------PRY 131
Query: 326 PIREVCKKIDNAPDAT 341
P+ VC ID AP+ +
Sbjct: 132 PVTVVCGGIDGAPEGS 147
>gi|297735044|emb|CBI17406.3| unnamed protein product [Vitis vinifera]
Length = 487
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 84/222 (37%), Positives = 116/222 (52%), Gaps = 41/222 (18%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q +DHFS D F QRY TD++ P+ GPIFL E + + + ++ +
Sbjct: 42 WFNQTVDHFSPLDHSKFPQRYYEFTDYFRLPD--GPIFLKICGEASCD--GIPNDYISVL 97
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF---------- 181
A +FGA +V EHRYYG+S P+ S L YL+++QAL D AVF
Sbjct: 98 AKKFGAAVVSLEHRYYGKSSPFRSLRT-----ENLKYLSSKQALFDLAVFRQYYQAKVVP 152
Query: 182 ---ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPP 238
N+K N S +P +FG SY G L+AW RLK+PH+ G+LASSA +L
Sbjct: 153 IGESLNVKVNRSNVENPWFVFGVSYAGALSAWFRLKFPHLTCGSLASSAVVL-------- 204
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWG-ELVSVGQKENGLLE 279
YN ++F R+ I ES G E +V Q+ NGL+E
Sbjct: 205 -AIYNF--TEFDRQ-------IGESAGAECKAVLQEVNGLVE 236
>gi|308485479|ref|XP_003104938.1| hypothetical protein CRE_24446 [Caenorhabditis remanei]
gi|308257259|gb|EFP01212.1| hypothetical protein CRE_24446 [Caenorhabditis remanei]
Length = 510
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 86/289 (29%), Positives = 138/289 (47%), Gaps = 19/289 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWD 130
F Q LDHF + TF QRY N + W GP FL G EG W + + +
Sbjct: 63 FTQTLDHFDSSVGKTFKQRYWHN-NQWYKDG--GPAFLMLGGEGPESSYWVSYPGLEMTN 119
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
+A + GA + EHR+YGE+ P V + L YL++ QA+ D A FI +
Sbjct: 120 LAAKQGAWVFDIEHRFYGETKPTSDMSV-----SNLKYLSSAQAIEDAAAFIKAMTAQYP 174
Query: 191 AEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
A+ V FGGSY G LAAW R K+P + A+ SS P+ D + + +V +
Sbjct: 175 QLANARWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPVQAEVDF---KEYLEVVQNSI 231
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN-STEDLADWLESAYS-YL 307
R S C ++ + + S+ + +G +L FHLC+++ + L + E+ YS Y+
Sbjct: 232 SRNSTDCATSVTAGFNLVASLLKTTDGRKQLKTAFHLCQDIQLDDKSLKYFWETVYSPYM 291
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYN 356
+V Y + +C+ +N AT+ ++++ + V+ Y+N
Sbjct: 292 EVVQYSGDAAGSFATQLTISHAICRYHNN--KATTTIQKMKQ-VNDYFN 337
>gi|323453612|gb|EGB09483.1| hypothetical protein AURANDRAFT_24425 [Aureococcus anophagefferens]
Length = 266
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 72/222 (32%), Positives = 108/222 (48%), Gaps = 22/222 (9%)
Query: 108 IFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLS 167
++ Y GNE +E + N+G +W+ GA+LV+ EHRY G S+P L+
Sbjct: 2 VWFYTGNESPVEEYVNNTGLMWETGAELGALLVWAEHRYEGGSVP-----ACAGLRDCLA 56
Query: 168 YLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
Y + EQALAD+AV I L+ + P V GGSYGGML++W R KYP +GA+A SA
Sbjct: 57 YASVEQALADYAVVIDALRAEVGD--VPFVAVGGSYGGMLSSWFRFKYPTAVVGAIAGSA 114
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESAS-----CFNTIKESWGELVSVGQKEN------- 275
P+ F PP + S + C + ++ +W L + +
Sbjct: 115 PVWGFPLDAPPLDGSAVAISRGAGFAGGLPDDRCASNLRAAWPVLEDFARTASPAADGTT 174
Query: 276 GLLELTKTFHLCRELNSTEDLADWLESAYSY---LAMVDYPY 314
L + F LC L S +D+A+ ++S LA +YP+
Sbjct: 175 ALEAVAAAFDLCEPLASPDDVAELVQSVQGVFFDLAEANYPF 216
>gi|118487876|gb|ABK95760.1| unknown [Populus trichocarpa]
Length = 485
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/171 (40%), Positives = 96/171 (56%), Gaps = 15/171 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q LDHFS D F QRY D++ + GPIFL E + + ++ +
Sbjct: 47 WFNQTLDHFSPFDHHKFPQRYYEFLDYFRISD--GPIFLEICGESSCN--GIVNDYISVL 102
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNL 185
A +FGA +V EHRYYG S+P+ ST L +L+++QAL D AVF NL
Sbjct: 103 AKKFGAAVVSLEHRYYGRSLPFKST-----TTENLRFLSSKQALFDLAVFRQYYQESLNL 157
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
K N ++ +P +FGGSY G L+AW RLK+PH+ G+LASSA +L +
Sbjct: 158 KLNRTSVENPWFVFGGSYAGALSAWFRLKFPHLTCGSLASSAVVLAIHNFT 208
>gi|255581402|ref|XP_002531509.1| catalytic, putative [Ricinus communis]
gi|223528862|gb|EEF30863.1| catalytic, putative [Ricinus communis]
Length = 482
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/165 (41%), Positives = 95/165 (57%), Gaps = 15/165 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q LDH+S D F QRY D++ P GPIFL E A + ++ +
Sbjct: 44 WFNQILDHYSPYDHRRFQQRYYEYLDYFRAPG--GPIFLKICGESSCNGIA--NDYISVL 99
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNL 185
A +FGA +V EHRYYG+S P+ S+E L YL+++QAL D AVF NL
Sbjct: 100 AKKFGAAVVSLEHRYYGKSTPFKSSET-----KNLRYLSSKQALFDLAVFRQHYQEALNL 154
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
K N + +P ++FG SY G L+AW RLK+PH+ G++ASSA +L
Sbjct: 155 KLNRTNVENPWIVFGISYSGALSAWYRLKFPHLTCGSVASSAVVL 199
>gi|183230234|ref|XP_656762.2| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
gi|169802986|gb|EAL51377.2| serine carboxypeptidase (S28) family protein [Entamoeba histolytica
HM-1:IMSS]
Length = 480
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 120/233 (51%), Gaps = 15/233 (6%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-IAPRF 135
LDHF+ + F +Y +N + P+F+ G EG + + +V D +A +
Sbjct: 46 LDHFNANNQNDFDIQYFVNKKFLDANDPNAPLFVLLGGEGPASPKVLQNNYVIDSLAKKH 105
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
+++ EHR+YG S P S E+ L Y TAEQAL D+ I+++++ + P
Sbjct: 106 KGLMLSVEHRFYGASTP--SLEM-----DKLIYCTAEQALMDYVEVISHVQEENNLVGHP 158
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++ GGSY G LAAWMR KYP++ GA ASSAP+ E +V + +V + + +A
Sbjct: 159 VIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV---EAVVDFYQYLEVVQNALPKNTAD 215
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
+ E W ++ + E G EL K F+ C E +D+ + ES + L+
Sbjct: 216 LLSFAFEQWDKMTTT---EEGRKELGKIFNTCTEFGE-KDIQTFAESIGTALS 264
>gi|357612135|gb|EHJ67826.1| putative thymus-specific serine protease [Danaus plexippus]
Length = 494
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 84/252 (33%), Positives = 133/252 (52%), Gaps = 23/252 (9%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGF 127
+YF Q+LDH S D + QRY +N + + N GP+FL G EG D W G
Sbjct: 48 NKYFLQKLDHSSPTDQRYWEQRYFVN-ESFYDFNNPGPVFLMIGGEGTADPRWMV--KGT 104
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT--NL 185
D A F A+ + EHRYYG+S P + +++ +N L YL++ QALAD A FI N
Sbjct: 105 WIDYAIHFKALCILLEHRYYGQSRP--TMDLSVKN---LQYLSSYQALADLAYFINAMNN 159
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K + + VV FGGSY G LAAWMRLKYPH+ A++SS P++ + + ++ +V
Sbjct: 160 KYKFNKDVKWVV-FGGSYPGSLAAWMRLKYPHLVHAAVSSSGPLVAKVNFM---EYFQVV 215
Query: 246 SSDFKRESAS--CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN--STEDLADWLE 301
+ + ++ C +K + ++ + + + +E + F +C + S D+ ++
Sbjct: 216 VNALREKTGGEECVGQVKLAHKQIQEIIKTDPATIE--REFRVCEPFSKASQNDMKNFYN 273
Query: 302 S-AYSYLAMVDY 312
S A + +V Y
Sbjct: 274 SIADDFADLVQY 285
>gi|224108896|ref|XP_002333330.1| predicted protein [Populus trichocarpa]
gi|222836225|gb|EEE74646.1| predicted protein [Populus trichocarpa]
Length = 490
Score = 114 bits (286), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/165 (41%), Positives = 95/165 (57%), Gaps = 15/165 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q LDH+S D F QRY D++ P+ GPIFL E + + + ++ +
Sbjct: 52 WFNQTLDHYSPFDHHKFQQRYYEFLDYFRVPD--GPIFLKICGESSCD--GIANDYIGVL 107
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNL 185
A +FGA +V EHRYYG+S P+ ST L YL+++QAL D AVF NL
Sbjct: 108 AKKFGAAVVSLEHRYYGKSSPFKST-----TTENLRYLSSKQALFDLAVFRQYYQESLNL 162
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
K N + +P +FG SY G L+AW RLK+PH+ G+LASSA +L
Sbjct: 163 KLNRTGVENPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSAVVL 207
>gi|308478046|ref|XP_003101235.1| hypothetical protein CRE_14134 [Caenorhabditis remanei]
gi|308263940|gb|EFP07893.1| hypothetical protein CRE_14134 [Caenorhabditis remanei]
Length = 1064
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 78/220 (35%), Positives = 116/220 (52%), Gaps = 18/220 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN-SGF 127
E F QR+DHF+ + F Q+Y N+ + P GP FL G EG VN +
Sbjct: 576 EIGMFRQRIDHFNNKNTKFFQQKYFKNS-RFARPG--GPNFLMIGGEGPEYGHDVNLNSS 632
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+ A +G + EHR+YG+S+ +N T LS L++ Q L D A FI ++
Sbjct: 633 IMRRAEEYGGTVYVLEHRFYGDSV--------VENNTDLSTLSSLQMLYDLAEFIKSV-- 682
Query: 188 NLSAEAS-PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
N +E S P + FGGSY G L+AWMR +P + IGA+ASSAP+L D + +V
Sbjct: 683 NFKSETSNPWITFGGSYPGALSAWMREIFPDLVIGAIASSAPVLAKTDFYE---YMMVVE 739
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
+ F R ++C+ IK + E+ + Q ++G +L+ F L
Sbjct: 740 NSFLRYDSACYQEIKNGFDEIHELFQTDSGREKLSVLFKL 779
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 97/220 (44%), Gaps = 17/220 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI-EWFAVNSGF-VW 129
Y Q LD S F QRY ++H+ + FLY D E N G +
Sbjct: 48 YLIQPLDPLSPESTKKFQQRYRY-SEHFTSNKKTA--FLYVSGRDDFNEAVLKNDGSPLV 104
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQALADFAVFITNLKQ 187
A RFGA + EHRYYG S P ++N T+ L +L + A+ D FI +
Sbjct: 105 KAAERFGATIFALEHRYYGNSTP------NFENFTSESLQHLDSYHAIQDVIFFIEHANT 158
Query: 188 NLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
++ VLFG YGG++AA R P G +A SAPI + D + N V
Sbjct: 159 QFKMDSDVRWVLFGSGYGGIIAAETRKWDPITVSGVVAISAPIEREMDFWK---YNNKVE 215
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
+ +SC+N IK+ + ++ + G EL+ F L
Sbjct: 216 KTIMKYDSSCYNKIKKGFRQVQDLMNFSEGRNELSDLFEL 255
>gi|328718791|ref|XP_001947511.2| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 500
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 74/222 (33%), Positives = 113/222 (50%), Gaps = 14/222 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINT-DHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
+F+Q+LDHF+ D T+ QRY ++ H +G GP+F+ G E I + G + +
Sbjct: 55 FFDQKLDHFNPTDNRTWKQRYQSHSLHHKIG----GPVFMLLGGEEKISNAWLKDGSMME 110
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A +F AM EHRYYG+S P + N T L YL+ +QALAD A FI QN
Sbjct: 111 YAEKFNAMCFQLEHRYYGDSYPTDNL-----NTTNLKYLSIKQALADVAEFIKVKSQN-P 164
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+LFGGSY G LAAW R YP++ A++SS+ I D + ++ +
Sbjct: 165 LYKGKWILFGGSYPGSLAAWARKTYPNLVHAAVSSSSVIKTRIDNID---YFKVAEKALT 221
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS 292
+ C + I+++ + + ENG + F +C +N+
Sbjct: 222 DYNPKCVSNIRQATMMISDLLDSENGTKYVQSKFKVCYRINT 263
>gi|348523600|ref|XP_003449311.1| PREDICTED: thymus-specific serine protease-like [Oreochromis
niloticus]
Length = 641
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 165/354 (46%), Gaps = 41/354 (11%)
Query: 25 ILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
IL + A+ ++ ++ + + KL P Q ++ + Q LDHF+ +
Sbjct: 23 ILRTIKRRARDNQLQKNKQLLLKLTVSLHQPLHQVKEG---------WIVQPLDHFNQQN 73
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
TF QR+ +N +W + GP+FLY G EG + + V +G D+A GA+L+ EH
Sbjct: 74 SNTFPQRFFVNEAYW--QHHDGPVFLYIGGEGPLVEYDVLTGHHSDMAEEHGALLLALEH 131
Query: 145 RYYGESM-PYG-STEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGG 201
R+YG+S+ P G TE L+ L+++QALAD A F + Q+ + + + FGG
Sbjct: 132 RFYGDSINPDGLKTE-------NLAGLSSQQALADLATFHQYISQSFNLTHRNTWISFGG 184
Query: 202 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE----SASCF 257
SY G L+AW R K+P++ GA+ASSAP+ D + N+V E S C
Sbjct: 185 SYSGALSAWFRGKFPNLVHGAVASSAPVKAKLDF---SEYNNVVGLSLLNEAVGGSEKCL 241
Query: 258 NTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES-AYSYLAMVDYPYPS 316
+ +++++ + N + ++ F C+ D + ++S A ++ V Y
Sbjct: 242 SKVRQAFAAVKEALMSGN-INQVASDFGCCQIPKDPYDQIELMQSLADIFMGAVQYNEEG 300
Query: 317 DFMMPLPGYPIREVCKKIDNAP----DATSILERIFEGVSVYYNYTGNVDCFQL 366
M I E+C + N+ D R+ + +S Y +T C +
Sbjct: 301 VLM------SINELCGIMTNSSQEYQDEMEAYNRLVK-LSQIYRFTSKEPCLDI 347
>gi|158298288|ref|XP_318471.4| AGAP004014-PA [Anopheles gambiae str. PEST]
gi|157014452|gb|EAA13580.4| AGAP004014-PA [Anopheles gambiae str. PEST]
Length = 469
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 91/275 (33%), Positives = 132/275 (48%), Gaps = 25/275 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+FE R+DHF+ + F+ RY IN +H GPIF+ G I+ + G +DI
Sbjct: 27 WFETRVDHFNPRNQDKFAMRYYINDEHAYAR---GPIFIVVGAAEPIQTRWITEGLFYDI 83
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A GA L E RY+G S P + E L +L A+QALAD A +IT LK+ +
Sbjct: 84 AYLEGAYLFANELRYFGYSRPVENAET-----ENLDFLNADQALADLAEWITYLKETYTY 138
Query: 192 E-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ V+L G +YGG LA W R KYPH+ G SS I + YN +
Sbjct: 139 NPNAKVILMGTAYGGALATWFRQKYPHLVDGVWVSSGAI----EADFAFAGYNEALGESI 194
Query: 251 RE--SASCFNTIKESW-GELVSVGQKENGLLELTKT-FHLCRELNSTEDLADWLESAYSY 306
R+ S +C++TI W G V+ GL EL T FHLC L++ DL ++
Sbjct: 195 RQYGSDACYSTI---WSGFRVAQNMAHLGLAELLSTEFHLCEPLDTDNDL-----DVRAF 246
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDAT 341
L + + + IRE+C +++ D++
Sbjct: 247 LLGLRDDIEFEMLHLRNINSIREMCAEMEQQRDSS 281
>gi|308808360|ref|XP_003081490.1| Prolylcarboxypeptidase (angiotensinase C) (ISS) [Ostreococcus
tauri]
gi|116059953|emb|CAL56012.1| Prolylcarboxypeptidase (angiotensinase C) (ISS) [Ostreococcus
tauri]
Length = 542
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 76/239 (31%), Positives = 113/239 (47%), Gaps = 30/239 (12%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG------DIEWFAVN 124
R+F+Q LDHF D +SQRY +N P+F+ G EG + +
Sbjct: 59 RWFDQTLDHFDHVDRRRWSQRYFVNEGFVDKIEASTPVFVCVGGEGPALTARAVLDGGTH 118
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
G + D+A + + + EHR+YG S P G + +L YLT+ QAL D F+
Sbjct: 119 CGTMIDLAKKHRGIALALEHRFYGASQPTGDL-----SRESLRYLTSAQALEDVVAFVKY 173
Query: 185 LKQNLSAEASP-----------VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
+ +P V+ FGGSY GMLAAW R+KYPH A+ASSAPI
Sbjct: 174 VADAYGLRTTPSDDGRNGSYSRVIAFGGSYPGMLAAWSRVKYPHAIHAAVASSAPIRAEL 233
Query: 234 DIVPPETFYNIVSSDFKRE----SASCFNTIKESW-GELVSVGQKENGLLELTKTFHLC 287
D+ +Y++V + + S +CF+ + E++ EL + G L F++C
Sbjct: 234 DM---RGYYDVVGKALREKDVGGSDACFDAVSETFESELNEALKTPEGRRALETRFNVC 289
>gi|268576170|ref|XP_002643065.1| Hypothetical protein CBG22982 [Caenorhabditis briggsae]
Length = 509
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 77/245 (31%), Positives = 119/245 (48%), Gaps = 16/245 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWD 130
F Q LDHF ++ TF QRY N + W GP FL G EG W + + +
Sbjct: 63 FTQTLDHFDSSNTKTFQQRYYHN-NQWYKDG--GPAFLMLGGEGPESSYWVSYPGLEITN 119
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL- 189
+A + GA + EHR+YGE+ P V L YL++ QA+ D A FI +
Sbjct: 120 LAAKQGAWVFDIEHRFYGETKPTSDMSVP-----NLKYLSSAQAIEDAATFIKAMTLKYP 174
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
+ + V FGGSY G LAAW R K+P + A+ SS P+ D + + +V +
Sbjct: 175 QLKNAKWVTFGGSYSGALAAWTRAKHPELVYAAVGSSGPVQAEVDF---KEYLEVVQNSI 231
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELN-STEDLADWLESAYS-YL 307
R S +C ++ + + + Q +G +L FH C+++ ++L + E+ YS Y+
Sbjct: 232 TRNSTACAASVTAGFNLVAQLLQTTDGRKQLKTAFHTCQDIQLDDKNLKYFWETVYSPYM 291
Query: 308 AMVDY 312
+V Y
Sbjct: 292 EIVQY 296
>gi|13634059|sp|P90893.2|YM9I_CAEEL RecName: Full=Putative serine protease F56F10.1; Flags: Precursor
gi|351062821|emb|CCD70865.1| Protein F56F10.1 [Caenorhabditis elegans]
Length = 540
Score = 114 bits (284), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 153/329 (46%), Gaps = 27/329 (8%)
Query: 42 PRFVGKLP-------HLTEPPQRQQRQQQQQY-RYETRYFEQRLDHFSFADLPTFSQRYL 93
P F+G+L HL Q+ Y + F Q+LDHF + T++Q+Y
Sbjct: 20 PFFLGRLNGKTLLNHHLDRLTASDGASIQETYPNLQVHNFTQKLDHFDPYNTKTWNQKYF 79
Query: 94 INTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESM 151
N V IFL G EG + +W A + A FGA + EHR++G+S
Sbjct: 80 YNP---VFSRNNSIIFLMIGGEGPENGKWAANPNVQYLQWAKEFGADVFDLEHRFFGDSW 136
Query: 152 PYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWM 211
P + ++L YLT +QALAD A FI + Q + V FGGSY G LAAW
Sbjct: 137 P-----IPDMQTSSLRYLTTQQALADLAFFIEFMNQQYGFKNPRWVTFGGSYPGSLAAWF 191
Query: 212 RLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVG 271
R KYP + +G++ASSAP+ D + +V D + C K+++ ++ +
Sbjct: 192 RQKYPQLTVGSVASSAPVNLKLDFY---EYAMVVEDDLRITDPKCAQATKDAFVQMQKLA 248
Query: 272 QKENGLLELTKTFHLC--RELNSTE-DLADWLESAY-SYLAMVDYPYPSDFMMPLPGYPI 327
G L F+L + N+T+ D+ ++ + + +Y M Y Y +
Sbjct: 249 LTAEGRNSLNNHFNLQPPFDANTTKLDINNFFGNIFNTYQGMTQYTYDGQSNSTHSDKTV 308
Query: 328 REVCKKIDNAPDATSILERIFEGVSVYYN 356
R++C + NA + T ++ R+ E + +++N
Sbjct: 309 RKMCDIMTNATE-TDVVMRV-ENLFLWFN 335
>gi|66801433|ref|XP_629642.1| hypothetical protein DDB_G0292476 [Dictyostelium discoideum AX4]
gi|60463021|gb|EAL61217.1| hypothetical protein DDB_G0292476 [Dictyostelium discoideum AX4]
Length = 485
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 79/238 (33%), Positives = 126/238 (52%), Gaps = 27/238 (11%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG--FVWD 130
F Q++DHF+ D TF QR+++N+ +W G GP+F E ++E +VNS +W
Sbjct: 51 FVQKVDHFNLLDDRTFFQRFVVNSKYWNGT---GPVFFIISGEQNMEASSVNSCQYTIW- 106
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI---TNLKQ 187
A + A++V EHRYYG S Y + +++ N L YLT +QALAD VFI T +
Sbjct: 107 -AKQLNALIVSLEHRYYGGS--YVTEDLSTDN---LKYLTTQQALADCVVFIDWFTKVYY 160
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
++ + +S ++ FGGSY G L+A++ +KYP ++ASSAP + P FY +
Sbjct: 161 HVPS-SSKIISFGGSYAGTLSAYLAMKYPSKISFSVASSAP------LNPVVNFYQYMEV 213
Query: 248 DFK-----RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
K C N IK + +++ + +TK F LC ++ DL+ ++
Sbjct: 214 IQKSILLLNNGEKCLNNIKLANNKIIEMIHDPILTYNITKLFGLCSNIDFDNDLSTFM 271
>gi|289163387|ref|YP_003453525.1| serine carboxypeptidase [Legionella longbeachae NSW150]
gi|288856560|emb|CBJ10365.1| Similar to eukaryotic serine carboxypeptidase S28 family protein
[Legionella longbeachae NSW150]
Length = 466
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/235 (37%), Positives = 121/235 (51%), Gaps = 32/235 (13%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
YF+Q +DH + A TFSQRY I D GP P+F Y E A+N G + +
Sbjct: 50 YFKQLIDHNNPAT-GTFSQRYYI--DETYGPKDDSPVFFYICGESACSKRALN-GAIRNY 105
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A +F A LV EHRYYGES+P S L YLT E AL D A F +++L++
Sbjct: 106 AQKFNAKLVALEHRYYGESLPLNSLST-----NDLRYLTTEAALDDLAYF----QRHLTS 156
Query: 192 EAS---PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
E + V FGGSY G L+A+ RLKYP++ +GALASSAP++ E+ + + V+
Sbjct: 157 EKNWHGKWVAFGGSYPGSLSAYYRLKYPYLVVGALASSAPVMAKENFIEYDAHVTQVA-- 214
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL---NSTEDLADWL 300
C N ++E V Q E L + T TF + L + ED D+L
Sbjct: 215 ----GLQCANQMRE------VVTQVETSLKD-TVTFAQIKSLFDAAAVEDPVDFL 258
>gi|270158373|ref|ZP_06187030.1| putative serine carboxypeptidase [Legionella longbeachae D-4968]
gi|269990398|gb|EEZ96652.1| putative serine carboxypeptidase [Legionella longbeachae D-4968]
Length = 465
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/235 (37%), Positives = 121/235 (51%), Gaps = 32/235 (13%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
YF+Q +DH + A TFSQRY I D GP P+F Y E A+N G + +
Sbjct: 49 YFKQLIDHNNPAT-GTFSQRYYI--DETYGPKDDSPVFFYICGESACSKRALN-GAIRNY 104
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A +F A LV EHRYYGES+P S L YLT E AL D A F +++L++
Sbjct: 105 AQKFNAKLVALEHRYYGESLPLNSLST-----NDLRYLTTEAALDDLAYF----QRHLTS 155
Query: 192 EAS---PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
E + V FGGSY G L+A+ RLKYP++ +GALASSAP++ E+ + + V+
Sbjct: 156 EKNWHGKWVAFGGSYPGSLSAYYRLKYPYLVVGALASSAPVMAKENFIEYDAHVTQVA-- 213
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL---NSTEDLADWL 300
C N ++E V Q E L + T TF + L + ED D+L
Sbjct: 214 ----GLQCANQMRE------VVTQVETSLKD-TVTFAQIKSLFDAAAVEDPVDFL 257
>gi|224097128|ref|XP_002310843.1| predicted protein [Populus trichocarpa]
gi|222853746|gb|EEE91293.1| predicted protein [Populus trichocarpa]
Length = 487
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 96/173 (55%), Gaps = 17/173 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q LDHFS D F QRY D++ + GPIFL E + + ++ +
Sbjct: 47 WFNQTLDHFSPFDHHKFPQRYYEFLDYFRISD--GPIFLEICGESSCN--GIVNDYISVL 102
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF--------IT 183
A +FGA +V EHRYYG S+P+ ST L +L+++QAL D AVF
Sbjct: 103 AKKFGAAVVSLEHRYYGRSLPFKST-----TTENLRFLSSKQALFDLAVFRHTIHMQESL 157
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
NLK N ++ +P +FGGSY G L+AW RLK+PH+ G+LASSA +L +
Sbjct: 158 NLKLNRTSVENPWFVFGGSYAGALSAWFRLKFPHLTCGSLASSAVVLAIHNFT 210
>gi|397479343|ref|XP_003810983.1| PREDICTED: thymus-specific serine protease [Pan paniscus]
Length = 514
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 133/268 (49%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQD--GPIFLLLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRLALADVVSAHLALSRLFNI 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL------QFEDIVPPETFYN 243
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ ++ D+V
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSEYNDVVSRSLMST 232
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ + +A E L S G + L C L E+ A+ L +
Sbjct: 233 AIGGSLECRAAVSV-AFAEVERRLRSGGAAQAA---LQTELSACGPLGRAENQAELLGAL 288
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
+ + V Y PL +R++C
Sbjct: 289 QALVGGV-VQYDGQTGAPL---SVRQLC 312
>gi|114605978|ref|XP_001136226.1| PREDICTED: thymus-specific serine protease isoform 2 [Pan
troglodytes]
Length = 514
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 89/268 (33%), Positives = 133/268 (49%), Gaps = 24/268 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQD--GPIFLLLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRLALADVVSAHLALSRLFNI 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL------QFEDIVPPETFYN 243
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ ++ D+V
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDFSEYNDVVSRSLMST 232
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ + +A E L S G + L C L E+ A+ L +
Sbjct: 233 AIGGSLECRAAVSV-AFAEVERRLRSGGAAQAA---LQTELSACGPLGRAENQAELLGAL 288
Query: 304 YSYLAMVDYPYPSDFMMPLPGYPIREVC 331
+ + V Y PL +R++C
Sbjct: 289 QALVGGV-VQYDGQTGAPL---SVRQLC 312
>gi|357140818|ref|XP_003571960.1| PREDICTED: probable serine protease EDA2-like [Brachypodium
distachyon]
Length = 503
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/168 (41%), Positives = 91/168 (54%), Gaps = 15/168 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + Q LDHFS D F QRY D+ PN GP+FL E +++ ++
Sbjct: 51 EEHWMSQTLDHFSPTDHRQFKQRYYEFLDYHRVPN--GPVFLNICGESSCN--GISNSYL 106
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
IA +FGA LV PEHRYYG+S P+ S L +L+++QAL D AVF
Sbjct: 107 AVIAKKFGAALVSPEHRYYGKSSPFKSLTT-----ENLRFLSSKQALFDLAVFRQYYQET 161
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N K N S + +FGGSY G L+AW RLK+PH+ G+ ASS +L
Sbjct: 162 LNAKYNRSGADNSWFVFGGSYSGALSAWFRLKFPHLTCGSHASSGVVL 209
>gi|186516786|ref|NP_567998.3| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|145651776|gb|ABP88113.1| At4g36190 [Arabidopsis thaliana]
gi|332661229|gb|AEE86629.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 482
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 93/278 (33%), Positives = 135/278 (48%), Gaps = 30/278 (10%)
Query: 55 PQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
P+R + +Y TR +F Q LDH+S +D F QRY DH P+ GPIFL
Sbjct: 29 PRRISHGLSKSSKYLTRDELWFTQTLDHYSPSDHRKFRQRYYEYLDHLRVPD--GPIFLM 86
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
EG + + ++ +A +F A +V EHRYYG+S P+ S +A +N L YL++
Sbjct: 87 ICGEGPCN--GITNNYISVLAKKFDAGIVSLEHRYYGKSSPFKS--LATKN---LKYLSS 139
Query: 172 EQALADFAVF------ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+QAL+D A F N+K N S+ +P FG SY G L+AW RLK+PH+ G+LA
Sbjct: 140 KQALSDLATFRQYYQDSLNVKFNRSSNVENPWFFFGVSYSGALSAWFRLKFPHLTCGSLA 199
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
SSA + + PE I S C ++E+ +L+ +G K N K
Sbjct: 200 SSAVVRAVYEF--PEFDQQIAES----AGPECETALQET-NKLLELGLKVNN--RAVKAL 250
Query: 285 HLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPL 322
EL+ D + A + + Y P +PL
Sbjct: 251 FNATELDVDADFLYLIADA--GVMAIQYGNPDKLCVPL 286
>gi|195109600|ref|XP_001999371.1| GI24473 [Drosophila mojavensis]
gi|193915965|gb|EDW14832.1| GI24473 [Drosophila mojavensis]
Length = 483
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 89/275 (32%), Positives = 131/275 (47%), Gaps = 24/275 (8%)
Query: 21 VIISILSPLSLAA-----QPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQ 75
V++++L PL+ AA S P FV L L P ++ ETR+F Q
Sbjct: 6 VVLALLVPLAPAAILEDIPGSDEMEIPAFVQTLNQLHRGPPLPPSTKRANV--ETRWFNQ 63
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
LD+F + +SQR +IN +++V + PIFL G E I+ ++ SG DIA
Sbjct: 64 SLDNFDDTNKSVWSQRVMINEENFVDGS---PIFLLLGGEWTIDPNSITSGLWVDIAKEH 120
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
LV+ EHR++G S+P + + L Y EQALAD I LK+ + S
Sbjct: 121 NGSLVYTEHRFFGGSIP-----ILPLSTENLKYHGVEQALADVVNVIKVLKEEDKYKNSK 175
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
VV+ G SY +A W++L YP + +G ASSA + E V F +V +++
Sbjct: 176 VVVSGCSYSASMAVWLKLLYPDVIVGGWASSAVL---EAKVDFSDFMEVVGRAYRQLGGD 232
Query: 256 -CFNTIKES---WGELVSVGQ--KENGLLELTKTF 284
C+N I + + L GQ K LL L +F
Sbjct: 233 YCYNLINNATSYYEHLFQTGQGAKAKKLLNLCDSF 267
>gi|56757429|gb|AAW26883.1| SJCHGC06819 protein [Schistosoma japonicum]
Length = 331
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 56/179 (31%), Positives = 96/179 (53%), Gaps = 4/179 (2%)
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF-KRESASCFNTIKESW 264
ML+AW+R KYP+ GA+ASSAP+ F + F + ++ F K +C I+ SW
Sbjct: 1 MLSAWIRQKYPNQIAGAIASSAPVWLFPGLSDCNGFSLVATNSFLKYGGDNCVKNIQHSW 60
Query: 265 GELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPG 324
+V +GQ +G LT F++C L +++ D+L ++MV+YPYP++F+ LP
Sbjct: 61 SNIVDIGQSFDGKELLTNMFNICTPLTDVQNIIDYLSDYLGTISMVNYPYPANFLGTLPA 120
Query: 325 YPIREVCKKIDNAPDATSILERI---FEGVSVYYNYTGNVDCFQLDDDPHGLDGWNWQV 380
+P++ +C + ++ RI + + NYTGN +C + G+D W++
Sbjct: 121 WPVKYLCSNLTVYDPQQPVVTRISLLAKAILALTNYTGNQNCLDISGSLPGIDAKAWEI 179
>gi|26449329|dbj|BAC41792.1| putative prolyl carboxypeptidase [Arabidopsis thaliana]
Length = 281
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 74/185 (40%), Positives = 104/185 (56%), Gaps = 19/185 (10%)
Query: 55 PQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
P+R + +Y TR +F Q LDH+S +D F QRY DH P+ GPIFL
Sbjct: 29 PRRISHGLSKSSKYLTRDELWFTQTLDHYSPSDHRKFRQRYYEYLDHLRVPD--GPIFLM 86
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
EG + + ++ +A +F A +V EHRYYG+S P+ S +A +N L YL++
Sbjct: 87 ICGEGPCN--GITNNYISVLAKKFDAGIVSLEHRYYGKSSPFKS--LATKN---LKYLSS 139
Query: 172 EQALADFAVF------ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+QAL+D A F N+K N S+ +P FG SY G L+AW RLK+PH+ G+LA
Sbjct: 140 KQALSDLATFRQYYQDSLNVKFNRSSNVENPWFFFGVSYSGALSAWFRLKFPHLTCGSLA 199
Query: 225 SSAPI 229
SSA +
Sbjct: 200 SSAVV 204
>gi|268576509|ref|XP_002643234.1| Hypothetical protein CBG08099 [Caenorhabditis briggsae]
Length = 540
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 133/284 (46%), Gaps = 19/284 (6%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWD 130
F Q+LDHF + T++Q+Y N + IFL G EG + W A
Sbjct: 69 FTQKLDHFDRYNTKTWNQKYFYNPKY---SRNNSIIFLMIGGEGPENGRWAAKPEVQYLQ 125
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A FGA + EHR++G+S P E ++L YLT +QALAD A FI ++ Q
Sbjct: 126 WASEFGADVFDLEHRFFGDSWPISDMET-----SSLQYLTTQQALADLAYFIESMNQKYG 180
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ V FGGSY G L+AW R KYP + +G++ASSAP+ D + +V D K
Sbjct: 181 FKNPRWVTFGGSYPGSLSAWFRQKYPELTVGSVASSAPVNLKLDFY---EYAMVVEDDLK 237
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE---DLADWLESAY-SY 306
C +++++ ++ + G L F+L ++ D+ ++ + + ++
Sbjct: 238 LTDPQCAPAVRDAFTKIQQMSLTAEGRNSLNTYFNLQPPFDAKTTKLDINNFFGNLFNTF 297
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKIDNA--PDATSILERIF 348
M Y Y +R++C+ + NA P+ +E +F
Sbjct: 298 QGMTQYTYDGQSNSTHSDKTVRKMCQIMTNATEPNTVKRVENLF 341
>gi|148676288|gb|EDL08235.1| dipeptidylpeptidase 7, isoform CRA_b [Mus musculus]
Length = 212
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/163 (41%), Positives = 91/163 (55%), Gaps = 32/163 (19%)
Query: 68 YETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVG--------PNRL------------ 105
+ YFEQ +DHF+F TF QR+L++ + W+ P +
Sbjct: 49 FHENYFEQYMDHFNFESFGNKTFGQRFLVSGECWLDRMGSYLRFPGLMETHLLPTDKFWK 108
Query: 106 ---GPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAY 160
GPIF Y GNEGDI FA NSGF+ ++A + A+LVF EHRYYG+S+P+G ST+ Y
Sbjct: 109 MGEGPIFFYTGNEGDIWSFANNSGFMVELAAQQEALLVFAEHRYYGKSLPFGVQSTQRGY 168
Query: 161 QNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSY 203
LT EQALADFAV + L+Q+L +P + FGG +
Sbjct: 169 TQ-----LLTVEQALADFAVLLQALRQDLGVHDAPTIAFGGRW 206
>gi|195391900|ref|XP_002054597.1| GJ22719 [Drosophila virilis]
gi|194152683|gb|EDW68117.1| GJ22719 [Drosophila virilis]
Length = 478
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/309 (28%), Positives = 140/309 (45%), Gaps = 30/309 (9%)
Query: 29 LSLAAQPSKFR---RAPRFVGKLPHLTE-PPQRQQRQQQQQYRYETRYFEQRLDHFSFAD 84
++LA+Q + + P FV L L PPQR + E ++ Q LDHF ++
Sbjct: 12 VALASQANSLKLKKDVPVFVKTLKELYRGPPQRTVARADTA---EEKWITQPLDHFDESN 68
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
T+ RY +N + + PIF++ G E + + G +D+A +L++ EH
Sbjct: 69 TKTYEMRYFLNDEFQTDGS---PIFIFLGGEWEASPGMIQQGHWYDMAKEHNGVLIYTEH 125
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNLSAEASPVVLFGGSY 203
RYYGES+P + ++ +N L YL +QALAD A FI K +N S V+L GGSY
Sbjct: 126 RYYGESVP--TETMSLEN---LQYLHVKQALADVARFIETFKSENAQLTNSKVLLAGGSY 180
Query: 204 GGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF-KRESASCFNTIKE 262
+ W + YP + +G ASSAP+L D + + F + C++ I+
Sbjct: 181 SATMVVWFKRLYPDLVVGGWASSAPLLAKVDFYE---YKEVTGRAFLELGGQKCYDRIQN 237
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW-----LESAYSYLAMVDYP---- 313
EL + + E +C + DL W + + ++ +A P
Sbjct: 238 GIAELEYMFDNKRA-AEARAMLRICSSFDHENDLDMWNLFGSISNIFASVAQTQSPGDIE 296
Query: 314 YPSDFMMPL 322
Y DF++
Sbjct: 297 YYCDFLLSF 305
>gi|195062815|ref|XP_001996259.1| Pro-X carboxypeptidase [Drosophila grimshawi]
gi|193899754|gb|EDV98620.1| Pro-X carboxypeptidase [Drosophila grimshawi]
Length = 478
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 126/287 (43%), Gaps = 21/287 (7%)
Query: 39 RRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDH 98
+ P FV L L P +Q+ + E + Q LDHF ++ T+ RYLIN +
Sbjct: 25 KDVPVFVKTLWDLHRGPPQQKGISLDKATEE--WITQPLDHFDESNEKTYQMRYLINDEF 82
Query: 99 WVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEV 158
PIF+Y G E ++ + G +D+A +L++ EHRYYG S+P V
Sbjct: 83 ---QTEGSPIFIYLGGEWEVSPGMIEKGHWYDLAKEHKGLLIYTEHRYYGNSVPTEKMTV 139
Query: 159 AYQNATTLSYLTAEQALADFAVFITNLK-QNLSAEASPVVLFGGSYGGMLAAWMRLKYPH 217
L YL +QALAD FIT LK +N S V+L GGSY + W + YP
Sbjct: 140 -----DDLQYLHVKQALADVKHFITTLKSENAQLANSKVLLAGGSYSATMVVWFKRLYPD 194
Query: 218 IAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA-SCFNTIKESWGELVSVGQKENG 276
+ +G ASSAP+L D + + F C++ I++ +L + +
Sbjct: 195 LVVGGWASSAPLLAKVDFFE---YKEVTGKAFAELGGQKCYDRIQKGIADLEYMFDNKRS 251
Query: 277 LLELTKTFHLCRELNSTEDLADW-----LESAYSYLAMVDYPYPSDF 318
E LC + DL W + + ++ LA P D+
Sbjct: 252 -AEARSMLRLCSSFDHENDLDMWNLFGSISNVFASLAQYQQPGEIDY 297
>gi|145351417|ref|XP_001420076.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580309|gb|ABO98369.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 538
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 128/263 (48%), Gaps = 36/263 (13%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG---DIEWFAVN 124
+E + EQRLDHF A +++QRY +N D + R P+F+ G EG D++ AV+
Sbjct: 56 HERWFAEQRLDHFDNALNASWTQRYFVN-DAYASAERGAPVFVCVGGEGPALDVD-VAVD 113
Query: 125 SG----FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
G +A + + EHR+YG+S P G V +L +L++ QAL D
Sbjct: 114 GGEHCAIATALAKKHRGLFFALEHRFYGKSQPTGDLSV-----ESLRFLSSAQALEDLVT 168
Query: 181 FITNLKQNLSAEASP---------VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
F E P V+ FGGSY GMLAAW R+K+PH+ A+ASSAP+
Sbjct: 169 FTRFAAAAYGLEIEPRNDGRKYSKVIAFGGSYPGMLAAWSRVKFPHVFHAAVASSAPVRA 228
Query: 232 FEDIVPPETFYNIVSSDFKRE----SASCFNTIKESWG-ELVSVGQKENGLLELTKTFHL 286
D+ +Y +V + + S +C+ ++ ++ L + +G L K F++
Sbjct: 229 QIDM---RGYYEVVGDALREKDVGGSDACYTAVENAFTVRLNEALKTSSGRRALEKQFNV 285
Query: 287 CRE-----LNSTEDLADWLESAY 304
C + + + +D AD L + +
Sbjct: 286 CGDEALDGVGARDDFADVLRAMF 308
>gi|308489478|ref|XP_003106932.1| hypothetical protein CRE_17219 [Caenorhabditis remanei]
gi|308252820|gb|EFO96772.1| hypothetical protein CRE_17219 [Caenorhabditis remanei]
Length = 541
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 149/323 (46%), Gaps = 27/323 (8%)
Query: 42 PRFVGKLP-------HLTEPPQRQQRQQQQQYRY-ETRYFEQRLDHFSFADLPTFSQRYL 93
P F+GK+ HL + QY + + + F Q+LDHF + T++Q+Y
Sbjct: 20 PFFLGKINGQTLLNHHLNQLTVTSNAGVLDQYPWLQVKNFTQKLDHFDPYNTKTWNQKYF 79
Query: 94 INTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESM 151
N + N + IFL G EG + +W A A FGA + EHR++G+S
Sbjct: 80 YNP-KFSRNNSI--IFLMIGGEGPENGKWAANPDVQYLQWAAEFGADVFDLEHRFFGDSW 136
Query: 152 PYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWM 211
P + +L YLT +QALAD A FI ++ Q + V FGGSY G L+AW
Sbjct: 137 P-----IPDMTTNSLRYLTTQQALADLAYFIESMNQLYGFKNPRWVTFGGSYPGSLSAWF 191
Query: 212 RLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVG 271
R KYP + +G++ASSAP+ D + +V D K +C +K+++ ++ +
Sbjct: 192 RQKYPQLTVGSVASSAPVNLKLDFY---EYAMVVEDDLKLTDVNCAPAVKDAFTKIQKLS 248
Query: 272 QKENGLLELTKTFHLCRELNSTE---DLADWLESAY-SYLAMVDYPYPSDFMMPLPGYPI 327
G +L F+L + D+ ++ + + ++ M Y Y +
Sbjct: 249 LTAEGRNQLNGYFNLQPPFDGKTTKLDINNFFGNLFNTFQGMTQYTYDGQSNSTHSDKTV 308
Query: 328 REVCKKIDNA--PDATSILERIF 348
R++C+ + N PD +E +F
Sbjct: 309 RKMCQIMTNTSEPDTVKRVENLF 331
>gi|194744673|ref|XP_001954817.1| GF16552 [Drosophila ananassae]
gi|190627854|gb|EDV43378.1| GF16552 [Drosophila ananassae]
Length = 597
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 123/252 (48%), Gaps = 14/252 (5%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + +QR+DHF + T+ RY N + N GPI+++ G E I +++G
Sbjct: 49 EELWLDQRVDHFDENNNGTWKMRYFRNAKY---HNPQGPIYIFVGGEWTISPGLMSTGLT 105
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D+A ML + EHRYYG S+P+G+ + L L +Q+LAD A FI + K N
Sbjct: 106 HDMAVENSGMLFYTEHRYYGLSLPHGNKSLKVHQ---LKQLNLQQSLADLAFFIRHQKSN 162
Query: 189 L-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
+ S V+L GGSY G + WM +YP + + ASSAP+L D + ++VSS
Sbjct: 163 NPELKDSKVILVGGSYSGSMVTWMTQRYPDLIAASWASSAPLLAKADFYE---YMDMVSS 219
Query: 248 DFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESAY 304
+ +C I + LV + EN + L F+ C++ + D A +
Sbjct: 220 SVQLSYGQNCSQRISRGFEYLVKLFH-ENNIRTLLSKFNGCKDYDPKNPLDRAAFFNGLG 278
Query: 305 SYLAMVDYPYPS 316
+Y A++ Y +
Sbjct: 279 NYFALIVQSYSA 290
>gi|194744683|ref|XP_001954822.1| GF16550 [Drosophila ananassae]
gi|190627859|gb|EDV43383.1| GF16550 [Drosophila ananassae]
Length = 489
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/365 (26%), Positives = 164/365 (44%), Gaps = 33/365 (9%)
Query: 20 IVIISILSPLSLAAQPSKFR-RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLD 78
+ I++ILSPL+ AA + + A F+ L L + + E R+ Q LD
Sbjct: 8 LAILAILSPLTTAASLGESKPEANDFIRTLKDLHRGSPAEPTMTRA--NVEERWITQWLD 65
Query: 79 HFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAM 138
+F + T+ R LIN D++V + PIF+Y G E I+ + SG DIA +
Sbjct: 66 NFDGDNNATWEDRILINEDYFVDGS---PIFIYLGGEWKIQPGDITSGLWVDIAKQHNGT 122
Query: 139 LVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVL 198
+V EHR++GES+P + Y QALAD I NLK+ + S +V+
Sbjct: 123 IVTTEHRFFGESLPI----TPFSTENLEKYQNVNQALADVINVIENLKEEDKYKDSKIVI 178
Query: 199 FGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CF 257
G SY +A W+R YP +G+ ASSAP++ D + ++ ++ +K C+
Sbjct: 179 HGCSYSASMATWIRKLYPETILGSWASSAPLVAKVDF---KEYFKVIGESYKVLGGQYCY 235
Query: 258 NTIKES---WGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPY 314
+ I + + +L + G+ + EL +LC ++ W +S +A + +
Sbjct: 236 DLIDNATSYYEDLFANGEGDQAKKEL----NLCDNFDADNKRDRW--QIFSTIANI-FAG 288
Query: 315 PSDFMMPLPGYPIREVCKKIDNAPDATSILE------RIFE--GVSVYYNYTGNVDCFQL 366
+ + P Y I + C + + D S+ RI E G + + G D ++
Sbjct: 289 IAQYQNP-ANYDIAQYCSVLRSFSDDDSVALSKFINWRIHEHSGQCISATFKGTTDYYEW 347
Query: 367 DDDPH 371
D +
Sbjct: 348 AKDNY 352
>gi|268535772|ref|XP_002633021.1| Hypothetical protein CBG21793 [Caenorhabditis briggsae]
Length = 1085
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 104/369 (28%), Positives = 162/369 (43%), Gaps = 45/369 (12%)
Query: 37 KFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINT 96
+F++ F+G+ PH P + +E F QR +HF + F Q++ N+
Sbjct: 548 RFKKV--FLGRPPHGFLPEPDYEMPADMPPGFEQGMFRQRENHFDNRNPDFFQQKFYKNS 605
Query: 97 DHWVGPNRLGPIFLYCGNEG-DIEWFAVNSGFVW-DIAPRFGAMLVFPEHRYYGESMPYG 154
W P GP FL G EG + + +N W A ++GA + EHR+YG+S+
Sbjct: 606 -QWAQPG--GPNFLMIGGEGPEGPRWVLNENLTWLTYAKKYGATVFILEHRFYGDSL--- 659
Query: 155 STEVAYQNATTLSYLTAEQALADFAVFI--TNLKQNLSAEASPVVLFGGSYGGMLAAWMR 212
QN + LT+ Q L D A FI N++ SA P + FGGSY G ++AWMR
Sbjct: 660 ----VGQNNDNFNVLTSLQMLYDLAEFIKAVNIRTGTSA---PWITFGGSYSGAMSAWMR 712
Query: 213 LKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQ 272
+P + IGA+ASS P+ D + +V + +C + I+ + + ++ Q
Sbjct: 713 EVFPELVIGAVASSGPVFAKTDFYE---YLMVVEKSIRTYDKTCADRIQSGFSTMQTMFQ 769
Query: 273 KENGLLELTKTFHLCREL--NSTEDLADWLESAYSYLAMVDYPYPSDFMMPLP-GYPIRE 329
+ G L+ F L N T+ + S Y D P GY I +
Sbjct: 770 TKEGRQNLSDIFQLQPPFGDNVTDTDQHYFFSNIYGNFQGAVQYSGDNTGPYANGYGIPD 829
Query: 330 VCKKIDNAPDATSILERIF---EGVSVYY----NYTGNVDCFQ--LDDDPHGLDG----- 375
+CK + N D+ + L I E ++V+Y NYTG + +Q +D H +
Sbjct: 830 MCKFMTN--DSNTPLNNIVQFNEYMTVFYNNGRNYTGTDNKYQDMIDSLVHAQESGPESA 887
Query: 376 ----WNWQV 380
W WQ
Sbjct: 888 ASLLWTWQT 896
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 105/230 (45%), Gaps = 16/230 (6%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV--NSGF 127
T Y Q LDH++ TF QRY T+ + R FLY GD E + +
Sbjct: 47 TAYMIQNLDHYNGNASGTFIQRYYY-TESYTLHQRTA--FLYISVSGDFETSVITDDRNP 103
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI--TNL 185
V A +FGA + EHRYYG+S P VA ++ +L YL + QA+ D FI N
Sbjct: 104 VVKSAKQFGATVFSLEHRYYGQSKP----NVANFDSNSLRYLNSFQAIQDIVAFIKYANK 159
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ N+ + VL+G YGG++AA R P + G +ASS+P+ D F + V
Sbjct: 160 QFNMDPDVR-WVLWGAGYGGVIAAEARKWNPDVVAGVIASSSPLTHVYDF---WQFNDHV 215
Query: 246 SSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+E C+ I + + ++ + G ++ F L L+ T
Sbjct: 216 QIAISQEGGQLCYQKIMQGFTDIRLAMRTPEGRSNISDLFQLNPRLDQTN 265
>gi|195062821|ref|XP_001996260.1| GH22290 [Drosophila grimshawi]
gi|193899755|gb|EDV98621.1| GH22290 [Drosophila grimshawi]
Length = 633
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 130/283 (45%), Gaps = 21/283 (7%)
Query: 20 IVIISILSPLSLAAQPSK--FRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
+ I +L L AQ F+R + + P L P Q R Q T + EQ+L
Sbjct: 6 VGITLVLLALGQCAQAGDNIFQRTFKQLHGEPPL---PANQNRADVVQ----TLWIEQKL 58
Query: 78 DHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGA 137
DHF ++ T+ RY++N + GP+F++ G E I + G ++D+A
Sbjct: 59 DHFDESETRTWQMRYMLNDGFFKAG---GPMFIFFGGEWTISPGRITGGHMYDMAKEHNG 115
Query: 138 MLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPV 196
+LV+ EHRYYGES P +++ +N + YL QALAD A FIT K + S V
Sbjct: 116 LLVYTEHRYYGESHPL--PDLSNEN---IQYLHVTQALADLAHFITTQKTTYEGLSDSKV 170
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASC 256
++ GGSY + W + YP + +G ASSAP+L + + + S A C
Sbjct: 171 IIVGGSYSATMVTWFKKIYPDLVVGGWASSAPLLAKLNFLEYKEIMG--QSITLMGGADC 228
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
I+ E+ ++ + G E+ LC + DL W
Sbjct: 229 NKRIENGIAEMETMFATKRG-AEVKALLKLCEHFDVYSDLDVW 270
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 40/135 (29%), Positives = 59/135 (43%), Gaps = 4/135 (2%)
Query: 166 LSYLTAEQALADFAVFITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+ YL QALAD A FIT K + S V++ GGSY + W + YP + +G A
Sbjct: 295 IQYLHVTQALADLAHFITTQKTTYEGLSDSKVIIVGGSYSATMVTWFKKIYPDLVVGGWA 354
Query: 225 SSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
SSAP+L + + + S A C I+ E+ ++ + G E+
Sbjct: 355 SSAPLLAKLNFLEYKEIMG--QSITLMGGADCNKRIENGIAEMETMFATKRG-AEVKALL 411
Query: 285 HLCRELNSTEDLADW 299
LC + DL W
Sbjct: 412 KLCEHFDVYSDLDVW 426
>gi|195497939|ref|XP_002096313.1| GE25601 [Drosophila yakuba]
gi|195497943|ref|XP_002096315.1| GE25604 [Drosophila yakuba]
gi|194182414|gb|EDW96025.1| GE25601 [Drosophila yakuba]
gi|194182416|gb|EDW96027.1| GE25604 [Drosophila yakuba]
Length = 485
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 136/294 (46%), Gaps = 17/294 (5%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
A FV L L P + + + + E R+ Q+LD+F ++ T+ R IN ++
Sbjct: 26 EATAFVKSLRELHRGPPVEPMKTRA--KVEERWITQKLDNFDDSNNATWQDRIYINNKYF 83
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA 159
V + PIF+Y G E I+ + SG DIA + L++ EHR++GES+P T ++
Sbjct: 84 VDGS---PIFIYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGESIPI--TPLS 138
Query: 160 YQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
+N Y + EQALAD I LKQ + S VV+ G SY +A W+R YP I
Sbjct: 139 TENLA--KYQSVEQALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKMYPEII 196
Query: 220 IGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKENGLL 278
G+ ASSAPIL V + + +V + C++ I + ++ + NG
Sbjct: 197 RGSWASSAPILA---KVNFKDYMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGNG-T 252
Query: 279 ELTKTFHLCRELNSTEDLADWL---ESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
+ K +LC + D W A + + Y P + +P +RE
Sbjct: 253 QAAKELNLCSNFDVNSDQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILRE 306
>gi|440289971|gb|ELP83425.1| serine carboxypeptidase (S28) family protein, partial [Entamoeba
invadens IP1]
Length = 220
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 71/233 (30%), Positives = 119/233 (51%), Gaps = 15/233 (6%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-IAPRF 135
+DHF+ + F +Y ++ + G + P+F+ G EG +++ ++ D +A R
Sbjct: 1 MDHFNANNDEEFEVKYFVSEKYLDGTDLHSPLFVMLGGEGPESSKTLDNHYIIDTLAART 60
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
+++ EHR+YG+S P + L Y TAEQA+ D+ IT +++ + P
Sbjct: 61 NGLMLAIEHRFYGDSTPSLKMD-------KLIYCTAEQAMMDYIEIITYIQETRNFIDHP 113
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS 255
V++ GGSY G LAAWMR KYP++ GA ASSAP+ E V + +V + +A
Sbjct: 114 VIVIGGSYSGNLAAWMRQKYPNVVDGAWASSAPV---EAQVDFYQYLEVVQAGLPANTAD 170
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
+ E W ++ E+G EL K F+ C + +D+ + E+ + LA
Sbjct: 171 LLSIAFEKWDQMTVT---ESGRKELKKVFNTCTDFGE-DDIQTFAETIGTALA 219
>gi|116788543|gb|ABK24916.1| unknown [Picea sitchensis]
Length = 489
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 69/165 (41%), Positives = 96/165 (58%), Gaps = 15/165 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F QRLDH+S D F+QRY D++ + GPIFL E + A + ++ +
Sbjct: 47 WFSQRLDHYSPTDHRQFNQRYYEFLDYFQAHD--GPIFLKVCGEYSCDGIA--NDYMAVL 102
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNL 185
A +FGA +V EHRYYG+S P+ + L YL+++QAL D A F +TN
Sbjct: 103 AKKFGAAIVSLEHRYYGKSSPFKES-----TTHNLQYLSSKQALFDLASFRNYYQELTNK 157
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
KQNLS + +FG SY G L+AW RLK+PH+ G+LASSA +L
Sbjct: 158 KQNLSNYDNSWFVFGVSYPGALSAWFRLKFPHLTCGSLASSAVVL 202
>gi|345323658|ref|XP_001512518.2| PREDICTED: putative serine protease K12H4.7-like [Ornithorhynchus
anatinus]
Length = 489
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 75/210 (35%), Positives = 115/210 (54%), Gaps = 11/210 (5%)
Query: 106 GPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT 165
GP+FL G EG ++ G A + GA+ + EHR+YG+S P + +++ N
Sbjct: 77 GPVFLMIGGEGPANPAWMHHGTWLTYAEKLGALCLLLEHRFYGKSHP--TQDLSTSN--- 131
Query: 166 LSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALAS 225
L YL++ QALAD A F T + + L + V FGGSY G LAAW RLKYPH+ A+A+
Sbjct: 132 LHYLSSRQALADLAHFRTVMAEKLGLVDNKWVAFGGSYPGSLAAWFRLKYPHLVDVAVAT 191
Query: 226 SAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
SAPI + PE + +V + C ++E+ +V + + + ++TK F+
Sbjct: 192 SAPIYAVMNF--PE-YLEVVQKSLRTFRRECPEIVREASDTVVEMLKYKANYFKITKDFN 248
Query: 286 LCREL--NSTEDLADWLES-AYSYLAMVDY 312
LC +L S D A +LES A +++ +V Y
Sbjct: 249 LCEQLQIKSKMDSAYFLESLASNFMDVVQY 278
>gi|335291865|ref|XP_003356607.1| PREDICTED: thymus-specific serine protease [Sus scrofa]
Length = 514
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 109/202 (53%), Gaps = 17/202 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N +W + GP+FL+ G EG + +V G +
Sbjct: 61 WLEQPLDPFNASDRQSFLQRYWVNDQYWTSQD--GPVFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P +A L +L++ ALAD L + N+
Sbjct: 119 APVWGALVIGLEHRFYGLSIPAEGLGMA-----KLRFLSSRHALADVVSARLALTRLFNV 173
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D + ++VS
Sbjct: 174 SS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDF---SEYNDVVSKSL 229
Query: 250 KRE----SASCFNTIKESWGEL 267
S C + ++ E+
Sbjct: 230 MNTAIGGSPECRAAVSSAFAEV 251
>gi|290998882|ref|XP_002682009.1| peptidase S28 [Naegleria gruberi]
gi|284095635|gb|EFC49265.1| peptidase S28 [Naegleria gruberi]
Length = 434
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 143/293 (48%), Gaps = 33/293 (11%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-IAP 133
QRLDHF + TF QR+ +N W G N +F+ G EG + FV +
Sbjct: 3 QRLDHFDPQNTETFQQRFWVNDTMWQGKN----VFIIIGGEGPASSKYLTGHFVINEYGK 58
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA 193
+ GA+L EHR+YGES+P S +A N L YLT+EQAL D F + L + +
Sbjct: 59 KHGALLAALEHRFYGESVPRKS--LATDN---LRYLTSEQALQDLVEFRSLLVKKYRMDE 113
Query: 194 SPV--VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNIVSSD 248
+ V V FGGSY G L+AW++ KYPH+ +GA+ASS P+ L+F + + T N +
Sbjct: 114 ANVKFVCFGGSYSGNLSAWLKAKYPHLFVGAIASSGPVEAKLEFNEYM--MTVANSI--- 168
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC--RELNSTEDLADWLES-AYS 305
C + ++++ + + G + F++C + + +D+A S +
Sbjct: 169 ----GPKCTDRVRKANDLIEQLIATPAGRQRVASMFNVCNPESMTNNDDIALLFSSLSDG 224
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYT 358
+V Y ++ G+ + +C I+++ DA LE V + +Y+
Sbjct: 225 VCEVVQYNLDNN---GAQGFNVTSMCAIIESSDDA---LEGFANWVKTWNSYS 271
>gi|218184874|gb|EEC67301.1| hypothetical protein OsI_34292 [Oryza sativa Indica Group]
Length = 550
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 68/169 (40%), Positives = 94/169 (55%), Gaps = 17/169 (10%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E R+ +QRLDHFS D F QRY D+ G GP+FL E + + ++
Sbjct: 50 EERWMDQRLDHFSPTDHRQFKQRYYEFADYHAGG---GPVFLRICGESSCN--GIPNDYL 104
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
+A +FGA +V PEHRYYG+S P+ S L +L+++QAL D A F I
Sbjct: 105 AVLAKKFGAAVVTPEHRYYGKSSPFESL-----TTENLRFLSSKQALFDLAAFRQHYQEI 159
Query: 183 TNLKQNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N + N S+ +P +FG SY G L+AW RLK+PH+ G+LASS +L
Sbjct: 160 LNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL 208
>gi|167533847|ref|XP_001748602.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772843|gb|EDQ86489.1| predicted protein [Monosiga brevicollis MX1]
Length = 459
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/182 (35%), Positives = 101/182 (55%), Gaps = 11/182 (6%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG-----DIEWFAVNS 125
R+ QRLDHF +D T+ Q Y +N+ ++ + P++L G EG + +V+
Sbjct: 10 RFVTQRLDHFDGSDTTTWQQAYYVNSTYFQAGSD-APVYLCVGGEGPPLDGSVVVASVHC 68
Query: 126 GFVWDIAPRFGAMLVFPEHRYYG-ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
++ P+ GA++ EHRYYG +M E L YL++ QAL D A FI+
Sbjct: 69 NVAVELLPKTGAIMFALEHRYYGCHNMSACPVENPLAKGA-LRYLSSRQALGDLAAFISY 127
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
++Q + + +V FGGSY GMLA W RLKYPH+ ++ASSAP+ E ++ +Y++
Sbjct: 128 IRQQYNLPNNKIVTFGGSYPGMLAGWARLKYPHLVHASVASSAPV---EAVLDMRGYYDV 184
Query: 245 VS 246
+
Sbjct: 185 TA 186
>gi|51972041|dbj|BAD44685.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 488
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 103/185 (55%), Gaps = 19/185 (10%)
Query: 55 PQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
P+R + +Y TR +F Q LDH+S +D F QRY DH P+ GPIF+
Sbjct: 29 PRRISHGLTESSKYLTRDELWFNQTLDHYSPSDHREFKQRYYEYLDHLRVPD--GPIFMM 86
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
EG + + ++ +A +F A +V EHRYYG+S P+ S +A +N L YL++
Sbjct: 87 ICGEGPCN--GIPNDYITVLAKKFDAGIVSLEHRYYGKSSPFKS--LATEN---LKYLSS 139
Query: 172 EQALADFAVF------ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+QAL D A F N+K N S + +P FG SY G L+AW RLK+PH+ G+LA
Sbjct: 140 KQALFDLAAFRQYYQDSLNVKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLA 199
Query: 225 SSAPI 229
SSA +
Sbjct: 200 SSAVV 204
>gi|79326354|ref|NP_001031795.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|51968458|dbj|BAD42921.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|332661231|gb|AEE86631.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 477
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 103/185 (55%), Gaps = 19/185 (10%)
Query: 55 PQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
P+R + +Y TR +F Q LDH+S +D F QRY DH P+ GPIF+
Sbjct: 29 PRRISHGLTESSKYLTRDELWFNQTLDHYSPSDHREFKQRYYEYLDHLRVPD--GPIFMM 86
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
EG + + ++ +A +F A +V EHRYYG+S P+ S +A +N L YL++
Sbjct: 87 ICGEGPCN--GIPNDYITVLAKKFDAGIVSLEHRYYGKSSPFKS--LATEN---LKYLSS 139
Query: 172 EQALADFAVF------ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+QAL D A F N+K N S + +P FG SY G L+AW RLK+PH+ G+LA
Sbjct: 140 KQALFDLAAFRQYYQDSLNVKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLA 199
Query: 225 SSAPI 229
SSA +
Sbjct: 200 SSAVV 204
>gi|18419800|ref|NP_567999.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|334187220|ref|NP_001190936.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|14334574|gb|AAK59466.1| unknown protein [Arabidopsis thaliana]
gi|22136934|gb|AAM91811.1| unknown protein [Arabidopsis thaliana]
gi|51970170|dbj|BAD43777.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970244|dbj|BAD43814.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970332|dbj|BAD43858.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970726|dbj|BAD44055.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|51970916|dbj|BAD44150.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
gi|332661230|gb|AEE86630.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
gi|332661232|gb|AEE86632.1| Serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 488
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 103/185 (55%), Gaps = 19/185 (10%)
Query: 55 PQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
P+R + +Y TR +F Q LDH+S +D F QRY DH P+ GPIF+
Sbjct: 29 PRRISHGLTESSKYLTRDELWFNQTLDHYSPSDHREFKQRYYEYLDHLRVPD--GPIFMM 86
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
EG + + ++ +A +F A +V EHRYYG+S P+ S +A +N L YL++
Sbjct: 87 ICGEGPCN--GIPNDYITVLAKKFDAGIVSLEHRYYGKSSPFKS--LATEN---LKYLSS 139
Query: 172 EQALADFAVF------ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+QAL D A F N+K N S + +P FG SY G L+AW RLK+PH+ G+LA
Sbjct: 140 KQALFDLAAFRQYYQDSLNVKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLA 199
Query: 225 SSAPI 229
SSA +
Sbjct: 200 SSAVV 204
>gi|51968542|dbj|BAD42963.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 462
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 103/185 (55%), Gaps = 19/185 (10%)
Query: 55 PQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
P+R + +Y TR +F Q LDH+S +D F QRY DH P+ GPIF+
Sbjct: 3 PRRISHGLTESSKYLTRDELWFNQTLDHYSPSDHREFKQRYYEYLDHLRVPD--GPIFMM 60
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
EG + + ++ +A +F A +V EHRYYG+S P+ S +A +N L YL++
Sbjct: 61 ICGEGPCN--GIPNDYITVLAKKFDAGIVSLEHRYYGKSSPFKS--LATEN---LKYLSS 113
Query: 172 EQALADFAVF------ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+QAL D A F N+K N S + +P FG SY G L+AW RLK+PH+ G+LA
Sbjct: 114 KQALFDLAAFRQYYQDSLNVKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLA 173
Query: 225 SSAPI 229
SSA +
Sbjct: 174 SSAVV 178
>gi|51970574|dbj|BAD43979.1| prolyl carboxypeptidase like protein [Arabidopsis thaliana]
Length = 488
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 73/185 (39%), Positives = 103/185 (55%), Gaps = 19/185 (10%)
Query: 55 PQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
P+R + +Y TR +F Q LDH+S +D F QRY DH P+ GPIF+
Sbjct: 29 PRRISHGLTESSKYLTRDELWFNQTLDHYSPSDHREFKQRYYEYLDHLRVPD--GPIFMM 86
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
EG + + ++ +A +F A +V EHRYYG+S P+ S +A +N L YL++
Sbjct: 87 ICGEGPCN--GIPNDYITVLAKKFDAGIVSLEHRYYGKSSPFKS--LATEN---LKYLSS 139
Query: 172 EQALADFAVF------ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+QAL D A F N+K N S + +P FG SY G L+AW RLK+PH+ G+LA
Sbjct: 140 KQALFDLAAFRQYYQDSLNVKFNRSGDVENPWFFFGASYSGALSAWFRLKFPHLTCGSLA 199
Query: 225 SSAPI 229
SSA +
Sbjct: 200 SSAVV 204
>gi|170574564|ref|XP_001892869.1| Serine protease Z688.6 precursor [Brugia malayi]
gi|158601363|gb|EDP38291.1| Serine protease Z688.6 precursor, putative [Brugia malayi]
Length = 108
Score = 110 bits (276), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 52/98 (53%), Positives = 70/98 (71%), Gaps = 4/98 (4%)
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+WD+AP F A ++F EHR+YG+S P+G+ +Y L YL++EQAL DFA+ I +LK
Sbjct: 1 MWDLAPEFNAAIIFAEHRFYGKSQPFGNE--SYATIRNLGYLSSEQALGDFALLIYHLKN 58
Query: 188 N--LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
L A+ S V+ FGGSYGGMLAAWMR+KYPH+ G+
Sbjct: 59 KRLLVAQNSSVIAFGGSYGGMLAAWMRIKYPHLVEGSF 96
>gi|441593763|ref|XP_004087106.1| PREDICTED: thymus-specific serine protease isoform 2 [Nomascus
leucogenys]
Length = 541
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 137/296 (46%), Gaps = 53/296 (17%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQD--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF---------- 181
AP +GA+++ EHR+YG S+P G E+A L +L++ A+ +
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRHAMGKSSGIPSDEDRPSPP 173
Query: 182 -------ITNLKQNLS-----AEASPVVLFGGSYGGMLAAWMRLK------YPHIAIGAL 223
+ + + LS + +SP + FGGSY G LAAW RLK +PH+ ++
Sbjct: 174 FDPRLADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKVLRLLRFPHLIFASV 233
Query: 224 ASSAPILQFEDIVPPETFYNIVSSDFKRE----SASCFNTIKESWGE----LVSVGQKEN 275
ASSAP+ D + ++VS S C + ++ E L S G +
Sbjct: 234 ASSAPVRAVLDF---SEYNDVVSRSLMSTEIGGSLECRAAVSVAFAEVERRLRSGGAAQA 290
Query: 276 GLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L +C L E+ A+ L + + + V Y +PL +R++C
Sbjct: 291 A---LRTELSVCGPLGRAENQAELLGALQALVGGV-VQYDGQAGVPL---SVRQLC 339
>gi|395736879|ref|XP_003776822.1| PREDICTED: thymus-specific serine protease isoform 2 [Pongo abelii]
Length = 541
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 134/294 (45%), Gaps = 49/294 (16%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ ++ +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSNRRSFLQRYWVNDQHWVGQD--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF---------- 181
AP +GA+++ EHR+YG S+P G E+A L +L++ A+ F+
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRHAMGKFSGIPSDEDRPSPP 173
Query: 182 -------ITNLKQNLS-----AEASPVVLFGGSYGGMLAAWMRLK------YPHIAIGAL 223
+ + + LS + +SP + FGGSY G LAAW RLK +PH+ ++
Sbjct: 174 FDPRLADVVSARLALSRLFNISSSSPWICFGGSYAGSLAAWARLKVLRLLRFPHLIFASV 233
Query: 224 ASSAPIL------QFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGL 277
ASSAP+ ++ D+V + + +A E L S G +
Sbjct: 234 ASSAPVRAVLDFSEYNDVVSRSLMSTAIGGSLECRAAVSV-AFAEVERRLRSGGAAQAA- 291
Query: 278 LELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L C L E+ A+ L + + + V Y PL +R++C
Sbjct: 292 --LRTELSACGPLGRAENQAELLGALQALVGGV-VQYDGQAGAPL---SVRQLC 339
>gi|344268141|ref|XP_003405921.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 429
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 75/215 (34%), Positives = 111/215 (51%), Gaps = 16/215 (7%)
Query: 91 RYLINTDHWVGPNRLGPIFLYCGN-EGDIE-WFAVNSGFVWDIAPRFGAMLVFPEHRYYG 148
RY IN D + P GP+FL G E E W ++N+ +V A R GA+ + EHR+YG
Sbjct: 5 RYFIN-DAFYKPG--GPVFLNIGGPETACESWISMNNTWV-TYAERLGALFLLLEHRFYG 60
Query: 149 ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLA 208
S P G A +L YL++ QALAD F T + + + + V FG YGG LA
Sbjct: 61 HSQPTGDLSTA-----SLRYLSSRQALADIVNFRTKIAKKMGLTENKWVAFGCWYGGFLA 115
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELV 268
AW R+KYP + A+ SSAP+ + + +V ++ CF T+KE++ ++V
Sbjct: 116 AWSRIKYPELFAAAVGSSAPMQAKANFY---EYLEVVQRSLATHNSECFQTVKEAFKQVV 172
Query: 269 SVGQKENGLLELTKTFHLCR--ELNSTEDLADWLE 301
+ + +L F LC+ +L S D A +LE
Sbjct: 173 KMMKLPEFYSKLEDDFTLCKPLKLYSAMDKAFFLE 207
>gi|449499944|ref|XP_004160961.1| PREDICTED: probable serine protease EDA2-like isoform 2 [Cucumis
sativus]
Length = 486
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/215 (38%), Positives = 110/215 (51%), Gaps = 35/215 (16%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q LDHFS + F QRY D++ P+ GPIFL EG +++ ++ +
Sbjct: 51 WFNQTLDHFSPYNHDKFQQRYYEFLDYFRIPD--GPIFLKICGEGPCN--GISNDYLGVL 106
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNL 185
A +FGA +V EHRYYG+S P+ S L YL+++QAL D AVF NL
Sbjct: 107 AKKFGAAIVSLEHRYYGKSSPFKSL-----TTNNLRYLSSKQALFDLAVFRQYYQDSLNL 161
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K N E +P FG SY G L+AW RLK+PH+ G+LASSA +L YN
Sbjct: 162 KLNKKGE-NPWFFFGVSYPGALSAWFRLKFPHLTCGSLASSAVVL---------AVYNFT 211
Query: 246 SSDFKRESASCFNTIKESWG-ELVSVGQKENGLLE 279
D I ES G E +V Q+ N L+E
Sbjct: 212 EFD---------QQIGESAGPECKAVLQETNRLIE 237
>gi|402866199|ref|XP_003919573.1| PREDICTED: LOW QUALITY PROTEIN: thymus-specific serine protease
[Papio anubis]
Length = 501
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 134/297 (45%), Gaps = 55/297 (18%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNEQHWVGED--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF---------- 181
AP +GA+++ EHR+YG S+P G E+A L +L++ A+ +
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRHAMGKSSGIPSDEDRPSSP 173
Query: 182 -------ITNLKQNLS-----AEASPVVLFGGSYGGMLAAWMRLK------YPHIAIGAL 223
+ + + LS + +SP + FGGSY G LAAW RLK +PH+ ++
Sbjct: 174 SDPRLADVVSARLALSRLFNVSSSSPWICFGGSYAGSLAAWARLKVLXLLRFPHLIFASV 233
Query: 224 ASSAPIL------QFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGEL---VSVGQKE 274
ASSAP+ ++ D+V + + C + ++ E+ + +G
Sbjct: 234 ASSAPVRAVLDFSEYNDVVSRSLMSTAIGGSLE-----CRAAVSAAFAEVERRLRLGGAA 288
Query: 275 NGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L C L S E+ A+ L A L Y PL +R++C
Sbjct: 289 QAALR--SELSACGTLGSAENQAELL-GALQALVGGAVQYDGQAGAPL---SVRQLC 339
>gi|307184675|gb|EFN71004.1| Putative serine protease F56F10.1 [Camponotus floridanus]
Length = 418
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 103/178 (57%), Gaps = 13/178 (7%)
Query: 113 GNEG--DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLT 170
G EG +++W G + A FGAM + EHR+YG S P + +++ +N L YL
Sbjct: 3 GAEGIANVKWMV--EGQWIEYAKEFGAMCFYLEHRFYGNSHP--TPDLSVKN---LIYLN 55
Query: 171 AEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
++QALAD A FI N+ + + ++FGGSYGG LAAWMR+KYPH+ GA+++S P+
Sbjct: 56 SQQALADLAYFIQNINIEYKFSNNTKWIVFGGSYGGSLAAWMRIKYPHLVHGAVSTSGPL 115
Query: 230 LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC 287
L D + ++ +V++ K S C +TI E++ EL + + ++ K F LC
Sbjct: 116 LAQIDF---QEYFVVVANALKDYSQKCVDTIAEAYRELGILLRHVGSQQKIEKKFKLC 170
>gi|449457546|ref|XP_004146509.1| PREDICTED: probable serine protease EDA2-like [Cucumis sativus]
gi|449499940|ref|XP_004160960.1| PREDICTED: probable serine protease EDA2-like isoform 1 [Cucumis
sativus]
Length = 489
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 83/215 (38%), Positives = 110/215 (51%), Gaps = 35/215 (16%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F Q LDHFS + F QRY D++ P+ GPIFL EG +++ ++ +
Sbjct: 51 WFNQTLDHFSPYNHDKFQQRYYEFLDYFRIPD--GPIFLKICGEGPCN--GISNDYLGVL 106
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNL 185
A +FGA +V EHRYYG+S P+ S L YL+++QAL D AVF NL
Sbjct: 107 AKKFGAAIVSLEHRYYGKSSPFKSL-----TTNNLRYLSSKQALFDLAVFRQYYQDSLNL 161
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K N E +P FG SY G L+AW RLK+PH+ G+LASSA +L YN
Sbjct: 162 KLNKKGE-NPWFFFGVSYPGALSAWFRLKFPHLTCGSLASSAVVL---------AVYNFT 211
Query: 246 SSDFKRESASCFNTIKESWG-ELVSVGQKENGLLE 279
D I ES G E +V Q+ N L+E
Sbjct: 212 EFD---------QQIGESAGPECKAVLQETNRLIE 237
>gi|413933853|gb|AFW68404.1| putative serine peptidase S28 family protein [Zea mays]
Length = 478
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 95/168 (56%), Gaps = 15/168 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ R+ QRLDHFS D F QRY D+ P GP+FL E + + + ++
Sbjct: 38 DERWMNQRLDHFSPTDHRQFKQRYFEFLDYHRAPG--GPVFLRICGESACD--GIPNDYL 93
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT----- 183
+A +FGA +V PEHRYYG+S P+ ++ +N L +L+++QAL D AVF
Sbjct: 94 AVLAKKFGAAVVTPEHRYYGKSSPF--KQLTTEN---LRFLSSKQALFDLAVFRQYYQES 148
Query: 184 -NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N + N S +P + G SY G L+AW RLK+PH+ G+LASS +L
Sbjct: 149 LNARYNRSGFDNPWFVIGVSYSGALSAWFRLKFPHLTCGSLASSGVVL 196
>gi|195158030|ref|XP_002019897.1| GL12651 [Drosophila persimilis]
gi|194116488|gb|EDW38531.1| GL12651 [Drosophila persimilis]
Length = 482
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/234 (32%), Positives = 114/234 (48%), Gaps = 28/234 (11%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + EQ++D+F + T+ RYL N + GPIF++ G E I +++G
Sbjct: 49 EELWLEQKVDNFDALNNQTWKMRYLRNGKY---HRNQGPIFIFVGGEWSISPGFLSTGLT 105
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-Q 187
D+A ML + EHRYYG+S+P+G L +L+ Q+LAD A FI K +
Sbjct: 106 HDMAVENSGMLFYTEHRYYGQSLPHGKESF---RVDKLQHLSIYQSLADLAHFIRFQKSE 162
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
N + S V+L GGSY G + AWM YP + + ASSAP+L +
Sbjct: 163 NPRMKQSEVILVGGSYSGSMVAWMTQLYPDLIAASWASSAPLL--------------AKA 208
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF------HLCRELNSTED 295
DF N+I+ S+G+ + + + G LTK F L ++LN ED
Sbjct: 209 DFHEYMEVASNSIRLSYGQNCTT-RIQKGFQHLTKLFEENQIPELLQKLNGCED 261
>gi|302796113|ref|XP_002979819.1| hypothetical protein SELMODRAFT_111339 [Selaginella moellendorffii]
gi|300152579|gb|EFJ19221.1| hypothetical protein SELMODRAFT_111339 [Selaginella moellendorffii]
Length = 472
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/173 (36%), Positives = 100/173 (57%), Gaps = 16/173 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ ++ Q LDH++ D TFSQRY TD++ PN GP+FL EG + + +
Sbjct: 33 DAHWYTQTLDHYATQDDRTFSQRYYEFTDYFDAPN--GPVFLKICPEGPC--VGIQNDYS 88
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
+A RFGA +V EHRYYG+S P+ + L YL+++QAL D A F +
Sbjct: 89 AVLAKRFGAAIVSLEHRYYGQSSPFKT-----HATENLIYLSSKQALFDLAAFREYYQDL 143
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
N + N +++ +P ++ G SY G L+AW +LK+PH+A+G++ASS + D+
Sbjct: 144 INHRTNSTSD-NPWIVIGWSYAGALSAWFKLKFPHLAVGSVASSGIVQAIFDV 195
>gi|281202572|gb|EFA76774.1| peptidase S28 family protein [Polysphondylium pallidum PN500]
Length = 463
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 97/166 (58%), Gaps = 12/166 (7%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW---FAV 123
++E +++ Q L+HF D TF QRY +N D + + GPI LY EG +
Sbjct: 28 KWEYQWYNQTLNHFDAEDTRTFLQRYYVN-DQYYDYKKGGPIILYINGEGPVSSPPNKPT 86
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
+ ++ A GA++V EHRYYG+S P+ +++ +N L +L++ QAL D A+FI
Sbjct: 87 DGTVIY--AQALGALIVTLEHRYYGDSSPFA--DLSTEN---LKFLSSRQALNDLAIFIL 139
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ + + A ++ GGSY G L+AW R+KYPH+ +G++ASS +
Sbjct: 140 DYRSTIQ-NAGDIITIGGSYSGALSAWFRVKYPHVTVGSVASSGVV 184
>gi|198455513|ref|XP_001360029.2| GA11106, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198133277|gb|EAL29181.2| GA11106, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 482
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/234 (32%), Positives = 114/234 (48%), Gaps = 28/234 (11%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + EQ++D+F + T+ RYL N + GPIF++ G E I +++G
Sbjct: 49 EELWLEQKVDNFDALNNQTWKMRYLRNGKY---HRNQGPIFIFVGGEWSISPGFLSTGLT 105
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-Q 187
D+A ML + EHRYYG+S+P+G L +L+ Q+LAD A FI K +
Sbjct: 106 HDMAVENSGMLFYTEHRYYGQSLPHGKESF---RVDKLQHLSIYQSLADLAHFIRFQKSE 162
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
N + S V+L GGSY G + AWM YP + + ASSAP+L +
Sbjct: 163 NPRMKQSEVILVGGSYSGSMVAWMTQLYPDLIAASWASSAPLL--------------AKA 208
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF------HLCRELNSTED 295
DF N+I+ S+G+ + + + G LTK F L ++LN ED
Sbjct: 209 DFHEYMEVASNSIRLSYGQNCTT-RIQKGFQHLTKLFEENQIPELLQKLNGCED 261
>gi|308452876|ref|XP_003089215.1| hypothetical protein CRE_21658 [Caenorhabditis remanei]
gi|308241640|gb|EFO85592.1| hypothetical protein CRE_21658 [Caenorhabditis remanei]
Length = 971
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 114/221 (51%), Gaps = 19/221 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN--SG 126
E F QR+DHF+ + F Q+Y N+ + P GP FL G E V S
Sbjct: 268 EIGMFRQRIDHFNNKNTKFFQQKYFKNS-RFARPG--GPNFLMIGGESPAHGSHVKNLSS 324
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
+ A +GA++ EHR+YG+S+ +N T L+ L++ Q L D A FI ++
Sbjct: 325 AIMRRAKEYGAIVYLLEHRFYGDSV--------VENNTDLTTLSSLQMLYDIAEFIKSV- 375
Query: 187 QNLSAEAS-PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
N +E S P + FGGSY G L+AWMR +P + IGA+ASSAP+L D + +V
Sbjct: 376 -NFKSETSNPWITFGGSYPGALSAWMREIFPDLVIGAIASSAPVLAKTDFYE---YMMVV 431
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
+ F +C+ IK + E+ + Q ++G +L+ F L
Sbjct: 432 ENSFLIYDPACYQEIKNGFDEIHELFQTDSGREKLSDLFKL 472
>gi|302797895|ref|XP_002980708.1| hypothetical protein SELMODRAFT_444586 [Selaginella moellendorffii]
gi|300151714|gb|EFJ18359.1| hypothetical protein SELMODRAFT_444586 [Selaginella moellendorffii]
Length = 1028
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 82/220 (37%), Positives = 114/220 (51%), Gaps = 26/220 (11%)
Query: 17 VITIVIISILSPLSLAAQPS-KFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQ 75
++ V+++ + S A PS + P F LP + P R + + + TR Q
Sbjct: 5 LVCFVLLATSTSFSSAFVPSSRLGFKPEF---LPEGSRSPPRGK----ETVNFFTR---Q 54
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
+LDHF+ D FSQ+YL D + N GPIFL E V + +V +A F
Sbjct: 55 KLDHFAPEDPRVFSQKYLELLDFFRPHN--GPIFLVMCGESTCTGDYVTT-YVGTLAESF 111
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI------TNLKQNL 189
GA +V EHRYYG S P+ + N L YLT++Q+L D AVFI N K N
Sbjct: 112 GAAIVTVEHRYYGHSSPF-----QHLNLHNLKYLTSKQSLFDHAVFIDYYQDLINQKYN- 165
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
E +P ++ GGSY G L+AW RLK+PH+ G+ ASSA +
Sbjct: 166 KTEKNPWIVIGGSYAGALSAWFRLKFPHLVAGSWASSAVV 205
>gi|281346012|gb|EFB21596.1| hypothetical protein PANDA_016894 [Ailuropoda melanoleuca]
Length = 509
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/180 (37%), Positives = 99/180 (55%), Gaps = 25/180 (13%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW ++ GP+FL+ G EG + +V G +
Sbjct: 54 WLEQPLDPFNTSDQRSFLQRYWVNDQHWA--SQRGPVFLHLGGEGSLRSGSVMRGHPAAL 111
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA---------------LA 176
AP +GA+++ EHR+YG S+P +VA L +L++ A A
Sbjct: 112 APAWGALVIGLEHRFYGLSIPAEGLDVA-----QLRFLSSRHAECAGTPSEEGPQSLPSA 166
Query: 177 DFAVFITNLKQ--NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
D A L + N+S ASP + FGGSY G LAAW RLK+PH+ + ++ASSAP+ D
Sbjct: 167 DVASARRALARLFNVST-ASPWICFGGSYAGSLAAWARLKFPHLILASVASSAPVRAVLD 225
>gi|302807545|ref|XP_002985467.1| hypothetical protein SELMODRAFT_122204 [Selaginella moellendorffii]
gi|300146930|gb|EFJ13597.1| hypothetical protein SELMODRAFT_122204 [Selaginella moellendorffii]
Length = 472
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 97/173 (56%), Gaps = 16/173 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ ++ Q LDH++ D TFSQRY TD++ PN GP+FL EG + + +
Sbjct: 33 DAHWYTQTLDHYATQDDRTFSQRYYEFTDYFDAPN--GPVFLKICPEGPC--VGIQNDYS 88
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
+A RFGA +V EHRYYG+S P+ L YL+++QAL D A F +
Sbjct: 89 AVLAKRFGAAIVSLEHRYYGQSSPF-----KIHATENLIYLSSKQALFDLAAFREYYQDL 143
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
N + N S +P ++ G SY G L+AW +LK+PH+A+G++ASS + D+
Sbjct: 144 INHRTN-STRDNPWIVIGWSYAGALSAWFKLKFPHLAVGSVASSGIVQAIFDV 195
>gi|195109598|ref|XP_001999370.1| GI24472 [Drosophila mojavensis]
gi|193915964|gb|EDW14831.1| GI24472 [Drosophila mojavensis]
Length = 499
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/226 (32%), Positives = 113/226 (50%), Gaps = 10/226 (4%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q LDHF D T+ RY N + GP++++ G E I +++G D+
Sbjct: 56 WLTQNLDHFEAGDNRTWQMRYFRNAKY---HKPQGPMYIFLGGEWTITPGLLSTGLTHDM 112
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNLS 190
A +L + EHRYYG+S P+ + + +N L YL QALAD A FI K Q+ +
Sbjct: 113 AVENAGILFYTEHRYYGQSWPFENNNLTVKN---LKYLNLHQALADVAHFIRYQKSQSAN 169
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
S V+L GGSY G +AAWM YP + ASSAP+L D E + + +S
Sbjct: 170 LTHSKVILIGGSYSGSMAAWMTHLYPELVAAVWASSAPLLAKADFY--EYMHRVENSLTL 227
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
++C ++ + LV++ EN +L + + C N+++ L
Sbjct: 228 SYGSNCTRRLERGFEHLVNLF-NENKASDLLRRLNACPNYNASDSL 272
>gi|226502825|ref|NP_001146300.1| uncharacterized protein LOC100279875 precursor [Zea mays]
gi|219886553|gb|ACL53651.1| unknown [Zea mays]
Length = 478
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 94/168 (55%), Gaps = 15/168 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ R+ QRLDHFS D F QRY D+ P GP+FL E + + + ++
Sbjct: 38 DERWMNQRLDHFSPTDHRQFKQRYFEFLDYHRAPG--GPVFLRICGESACD--GIPNDYL 93
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT----- 183
+A +FGA +V PEHRYYG+S P+ ++ +N L +L+++QAL D AVF
Sbjct: 94 AVLAKKFGAAVVTPEHRYYGKSSPF--KQLTTEN---LRFLSSKQALFDLAVFRQYYQES 148
Query: 184 -NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N + N S +P + G SY G L AW RLK+PH+ G+LASS +L
Sbjct: 149 LNARYNRSGFDNPWFVIGVSYSGALNAWFRLKFPHLTCGSLASSGVVL 196
>gi|302807541|ref|XP_002985465.1| hypothetical protein SELMODRAFT_424499 [Selaginella moellendorffii]
gi|300146928|gb|EFJ13595.1| hypothetical protein SELMODRAFT_424499 [Selaginella moellendorffii]
Length = 458
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 97/169 (57%), Gaps = 16/169 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+ ++ Q LDH++ D TFSQRY TD++ PN GP+FL EG + + +
Sbjct: 33 DAHWYTQTLDHYATQDDRTFSQRYYEFTDYFDAPN--GPVFLKICPEGPC--VGIQNDYS 88
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
+A RFGA +V EHRYYG+S P+ + L YL+++QAL D A F +
Sbjct: 89 AVLAKRFGAAIVSLEHRYYGQSSPFKT-----HATENLIYLSSKQALYDLAAFREYYQDL 143
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
N + N S +P ++ G SY G L+AW +LK+PH+A+G++ASS + +
Sbjct: 144 INHRTN-STRDNPWIVIGWSYAGALSAWFKLKFPHLAVGSVASSGIVAE 191
>gi|157119583|ref|XP_001659435.1| lysosomal pro-X carboxypeptidase, putative [Aedes aegypti]
gi|108875272|gb|EAT39497.1| AAEL008708-PA [Aedes aegypti]
Length = 467
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 80/273 (29%), Positives = 129/273 (47%), Gaps = 23/273 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+FE ++DHF+ ++ TFS RY N +H GPIF+ G+ G IE ++ G +D+
Sbjct: 24 WFETKVDHFNPRNVDTFSMRYYSNDEHSYPK---GPIFVIVGSNGPIETRYLSEGLFYDV 80
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A GA L EHRY+G S+P V + L +LT +QALAD A F+ ++K +
Sbjct: 81 AYLEGAFLFANEHRYFGHSLP-----VDDASTNNLDFLTIDQALADLAAFVHHIKHEVVR 135
Query: 192 E-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
+ V+L G YGG LA W ++PH+ G SS + D+ +F
Sbjct: 136 NPEAKVILMGYGYGGSLATWFHQQFPHLTNGVWVSSGTVEADFDLTGYMESLGETIGEFG 195
Query: 251 RESASCFNTIKESW---GELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYL 307
C+ TI + L+++ + + L + F+LC L+ T+D+ D + ++L
Sbjct: 196 --GRGCYGTIFSGFRVAQNLIAMDRAD----VLNEQFNLCEALD-TDDVMD----STAFL 244
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA 340
+ + M ++C IDN D
Sbjct: 245 LGLQRAIEDEIMHLRNTQSTTDMCGIIDNEEDT 277
>gi|297802314|ref|XP_002869041.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
gi|297314877|gb|EFH45300.1| serine carboxypeptidase S28 family protein [Arabidopsis lyrata
subsp. lyrata]
Length = 491
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 102/189 (53%), Gaps = 23/189 (12%)
Query: 55 PQRQQRQQQQQYRYETR---YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
P+R + +Y TR +F Q LDH+S +D F QRY DH P+ GPIF+
Sbjct: 28 PRRISHGLTESSKYLTRDELWFNQTLDHYSPSDHRKFRQRYYEYLDHLRVPD--GPIFMM 85
Query: 112 CGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
EG + + ++ +A +F A +V EHRYYG+S P+ S +A +N L YL++
Sbjct: 86 ICGEGPCN--GIPNDYITVLAKKFDAGIVSLEHRYYGKSSPFKS--LATEN---LKYLSS 138
Query: 172 EQALADFAVF----------ITNLKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAI 220
+QAL D A F N+K N S +P FG SY G L+AW RLK+PH+
Sbjct: 139 KQALFDLAAFRQYYQARSNDSLNVKFNRSGNVENPWFFFGASYSGALSAWFRLKFPHLTC 198
Query: 221 GALASSAPI 229
G+LASSA +
Sbjct: 199 GSLASSAVV 207
>gi|432101950|gb|ELK29783.1| Thymus-specific serine protease [Myotis davidii]
Length = 399
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 69/164 (42%), Positives = 95/164 (57%), Gaps = 8/164 (4%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY I+ HW +R GP+FL+ G EG + +V G +
Sbjct: 59 WLEQPLDPFNASDRRSFLQRYWISDQHWA--SRDGPVFLHLGGEGSLGPGSVMRGHPAAL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
AP GA+++ EHR+YG S+P G N T L +L++ ALAD L + L+
Sbjct: 117 APALGALVIGLEHRFYGLSIPAGGL-----NTTQLRFLSSRHALADVVSARLELSRLLNV 171
Query: 192 EASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+S V FGGSY G LAAW RLK+PH+ A+ASSAP+ D
Sbjct: 172 SSSSRWVCFGGSYAGSLAAWARLKFPHLLFAAVASSAPVRAVLD 215
>gi|256574599|dbj|BAH98105.1| serine protease [Entamoeba invadens]
gi|440300845|gb|ELP93292.1| hypothetical protein EIN_056770 [Entamoeba invadens IP1]
Length = 468
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/231 (30%), Positives = 121/231 (52%), Gaps = 16/231 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD- 130
Y +Q LDHF + S +Y +N D + P GP+F+ G EG A+ FV D
Sbjct: 33 YMDQPLDHFDLTNTKKISIQYFLN-DTYFTPE--GPLFVDLGGEGAASAGAIGGKFVIDK 89
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A ++ M++ EHR+YG S+P G + L YL+ QAL D+ I+ +K+ +
Sbjct: 90 YAQKYKGMMLAIEHRFYGRSLPVGGL-----SQENLGYLSGIQALEDYIHIISEIKKQ-N 143
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
PV++FGGSY G LA W+R KYP++ A+ASSAP+L F +++ D
Sbjct: 144 QITGPVIVFGGSYSGNLAVWIRQKYPNVVYAAVASSAPLLATNQFT---QFMDVIEKDMG 200
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLE 301
+ A+ + K++ + + + +G+ ++ F C+++ + +D +L+
Sbjct: 201 PQCAAAW---KQANANIEQLYKTADGIKQIQTDFKTCKDIKNDKDFTLFLQ 248
>gi|24648179|ref|NP_650804.1| CG3739 [Drosophila melanogaster]
gi|7300510|gb|AAF55664.1| CG3739 [Drosophila melanogaster]
Length = 547
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 137/297 (46%), Gaps = 23/297 (7%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
A FV L L P + + + + E R+ Q+LD+F ++ T+ R IN ++
Sbjct: 88 EANAFVKSLRELHRGPPVEPMKTRA--KVEERWITQKLDNFDDSNNATWQDRIYINNKYF 145
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA 159
V + PIF+Y G E I+ + SG DIA + L++ EHR++G+S+P T ++
Sbjct: 146 VDGS---PIFIYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGQSIPI--TPLS 200
Query: 160 YQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
+N Y + EQALAD I LKQ + S VV+ G SY +A W+R YP I
Sbjct: 201 TENLA--KYQSVEQALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPEII 258
Query: 220 IGALASSAPIL---QFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKEN 275
G+ ASSAP+L F+D + +V + C++ I + ++ + N
Sbjct: 259 RGSWASSAPLLAKVNFKD------YMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGN 312
Query: 276 GLLELTKTFHLCRELNSTEDLADWL---ESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
G + K +LC N + W A + + Y P + +P +RE
Sbjct: 313 G-TQAVKELNLCSNFNVNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILRE 368
>gi|66772019|gb|AAY55321.1| IP12634p [Drosophila melanogaster]
Length = 490
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 137/297 (46%), Gaps = 23/297 (7%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
A FV L L P + + + + E R+ Q+LD+F ++ T+ R IN ++
Sbjct: 31 EANAFVKSLRELHRGPPVEPMKTRA--KVEERWITQKLDNFDDSNNATWQDRIYINNKYF 88
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA 159
V + PIF+Y G E I+ + SG DIA + L++ EHR++G+S+P T ++
Sbjct: 89 VDGS---PIFIYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGQSIPI--TPLS 143
Query: 160 YQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
+N Y + EQALAD I LKQ + S VV+ G SY +A W+R YP I
Sbjct: 144 TENLA--KYQSVEQALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPEII 201
Query: 220 IGALASSAPIL---QFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKEN 275
G+ ASSAP+L F+D + +V + C++ I + ++ + N
Sbjct: 202 RGSWASSAPLLAKVNFKD------YMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGN 255
Query: 276 GLLELTKTFHLCRELNSTEDLADWL---ESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
G + K +LC N + W A + + Y P + +P +RE
Sbjct: 256 G-TQAVKELNLCSNFNVNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILRE 311
>gi|52789079|gb|AAT09104.1| serine peptidase [Bigelowiella natans]
Length = 546
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 158/351 (45%), Gaps = 50/351 (14%)
Query: 43 RFVGKLPHLTEPPQRQQR--QQQQQYRYETRYFEQRL-DHFS-FADLPTFSQRYLINTDH 98
RF G H++ Q Q + T +++ L DHF +D + QRY +++
Sbjct: 27 RFGGDDQHVSFTLQGNQSLLESHAGSNSTTHFYKNALLDHFGGLSDEKHWLQRYYVDSSQ 86
Query: 99 WVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEV 158
W G P+FLY G EG + +S F++++A A+++ EHR+YGES P V
Sbjct: 87 WGGEGY--PVFLYIGGEGP-QGPVSSSLFMYELAVEHKALVLALEHRFYGESRP-----V 138
Query: 159 AYQNATTLSYLTAEQALADFAVFITNLKQ---NL-------------SAEASPVVLFGGS 202
+ L +LT+ QAL D A F+ +K N+ SA+ SP V FGGS
Sbjct: 139 EDMSDANLKFLTSHQALGDLARFVEYIKAYDPNVNDAKSSPPLSLPASAQESPFVAFGGS 198
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPIL------QFEDIVPPETFYNIVSSDFKRESA-- 254
Y G LAAW +LKYP + IG++ASSAP+ ++ +V Y ++ + SA
Sbjct: 199 YPGNLAAWFKLKYPSVVIGSVASSAPVFAEYDFAEYGGVVGRALSYPLIGGSDQCYSAVE 258
Query: 255 SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY-SYLAMVDYP 313
T+K G + ++ C + DLA + + ++ +V Y
Sbjct: 259 KAVTTLKTLLDSTTPAGSSD----KIPSYLRPCSPIGGPLDLATYEAQIFGAFQGVVQYN 314
Query: 314 YPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
+ P Y + ++C + + D IL R+ + + + Y G V C
Sbjct: 315 -----LENRPPY-VSDLCTAMTDGNDDDDILLRLVKTLKLVY---GGVTCM 356
>gi|110289392|gb|ABB47879.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
Length = 490
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 93/169 (55%), Gaps = 17/169 (10%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E R+ +QRLDHFS D F QRY D+ G GP+FL E + + ++
Sbjct: 51 EERWMDQRLDHFSPTDHRQFKQRYYEFADYHAGG---GPVFLRICGESSCN--GIPNDYL 105
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
++ +FGA +V PEHRYYG+S P+ S L +L+++QAL D F I
Sbjct: 106 AVLSKKFGAAVVTPEHRYYGKSSPFESL-----TTENLRFLSSKQALFDLVAFRQHYQEI 160
Query: 183 TNLKQNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N + N S+ +P +FG SY G L+AW RLK+PH+ G+LASS +L
Sbjct: 161 LNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL 209
>gi|8574032|emb|CAB94769.1| protease, serine, 16 (thymus) [Homo sapiens]
gi|119623494|gb|EAX03089.1| protease, serine, 16 (thymus), isoform CRA_b [Homo sapiens]
Length = 541
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 89/295 (30%), Positives = 134/295 (45%), Gaps = 51/295 (17%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL+ G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQD--GPIFLHLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA----------------- 174
AP +GA+++ EHR+YG S+P G E+A L +L++ A
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRLAMGKSSGIPSDEDRPSPP 173
Query: 175 ----LADFAVFITNLKQ--NLSAEASPVVLFGGSYGGMLAAWMRLK------YPHIAIGA 222
LAD L + N+S+ +SP + FGGSY G LAAW RLK +PH+ +
Sbjct: 174 FDPRLADVVSARLALSRLFNISS-SSPWICFGGSYAGSLAAWARLKVLRLLRFPHLIFAS 232
Query: 223 LASSAPIL------QFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
+ASSAP+ ++ D+V + + +A E L S G +
Sbjct: 233 VASSAPVRAVLDFSEYNDVVSRSLMSTAIGGSLECRAAVSV-AFAEVERRLRSGGAAQAA 291
Query: 277 LLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L C L E+ A+ L + + + V Y PL +R++C
Sbjct: 292 ---LRTELSACGPLGRAENQAELLGALQALVGGV-VQYDGQTGAPL---SVRQLC 339
>gi|17539994|ref|NP_501599.1| Protein PCP-2 [Caenorhabditis elegans]
gi|3876284|emb|CAB05187.1| Protein PCP-2 [Caenorhabditis elegans]
Length = 1080
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 100/336 (29%), Positives = 159/336 (47%), Gaps = 36/336 (10%)
Query: 37 KFRRAPRFVGKLPHLTEPPQRQQRQQQQQY--RYETRYFEQRLDHFSFADLPTFSQRYLI 94
KF++ F+G+ PH P+ Y +ET F QR DHF+ + F QR+
Sbjct: 544 KFKKV--FLGRPPH-GFLPESDFNMSPDDYPAGFETGSFRQRQDHFNNQNADFFQQRFFK 600
Query: 95 NTDHWVGPNRLGPIFLYCGNEG-DIEWFAVNSGFVWDI-APRFGAMLVFPEHRYYGESMP 152
NT W P GP FL G EG D + +N + I A ++GA + EHR+YGES
Sbjct: 601 NT-QWAKPG--GPNFLMIGGEGPDKASWVLNENLPYLIWAKKYGATVYMLEHRFYGESR- 656
Query: 153 YGSTEVAYQNATTLSYLTAEQALADFAVFI--TNLKQNLSAEASPVVLFGGSYGGMLAAW 210
+ T + L++ Q + D A FI N+K S ++P + FGGSY G+++AW
Sbjct: 657 -------VGDNTNFNRLSSLQMIYDIADFIRSVNIK---SGTSNPWITFGGSYSGLISAW 706
Query: 211 MRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSV 270
R +P + +GA+ASSAP+ D + + + + +++C + I+E + + ++
Sbjct: 707 TREVFPELVVGAVASSAPVFAKTDFYE---YLMVAENSIRSYNSTCADRIQEGFNSMRAL 763
Query: 271 GQKENGLLELTKTFHLCREL--NSTE-DLADWLESAYSYLAMVDYPYPSDFMMPLPG-YP 326
+ G L+ F L N T+ D + + YS D Y D M Y
Sbjct: 764 FLTKGGRQTLSSMFKLDPPFADNVTDIDQHYFFSNIYSNF-QGDVQYSGDNMGSYANSYG 822
Query: 327 IREVCKKIDNAPDATSILERIF---EGVSVYYNYTG 359
I ++CK + N D+ + L I E ++ +YN G
Sbjct: 823 IPDMCKIMTN--DSNTPLNNIVAFNEYMANFYNGGG 856
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 126/297 (42%), Gaps = 27/297 (9%)
Query: 50 HLTEPPQRQQRQ-QQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
H + +R R+ + T Y Q LDH TF+QRYL + + +
Sbjct: 24 HFKKHLKRGSRKYGNSETAMTTGYMAQNLDHLIGNASGTFTQRYLYSQQYTL---HQRTA 80
Query: 109 FLYC-GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLS 167
FLY G EG + V A +FGA + EHRYYGES P V +A L
Sbjct: 81 FLYVSGVEGPNVVLDDRTPIV-KTAKQFGATIFTLEHRYYGESKP----NVDKLDAYNLR 135
Query: 168 YLTAEQALADFAVFI--TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALAS 225
+L + QA D FI N++ N+ + VV +G YGG++AA R P+ G +AS
Sbjct: 136 HLNSFQATQDVISFIKYANVQFNMDQDVRWVV-WGIGYGGIIAAEARKLDPNSVSGVIAS 194
Query: 226 SAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
S P+ D IV ++ + C+ + + ++ + G L ++ F
Sbjct: 195 STPLTHEYDFWRFNHRVAIVLAE--TGGSLCYRKVANGFADIREAMKTPEGRLNISDLFQ 252
Query: 286 LCRELNSTEDLADWLESAYSYLAMVD-----YPYPSDFMMPLPGYPIREVCKKIDNA 337
L LN T ++ + YLA++ + DF + I ++C ID +
Sbjct: 253 LNPRLNETA--LNYNDIQMFYLAIIAPFQEIVEFNDDFDL-----SIADLCTTIDKS 302
>gi|302795835|ref|XP_002979680.1| hypothetical protein SELMODRAFT_419387 [Selaginella moellendorffii]
gi|300152440|gb|EFJ19082.1| hypothetical protein SELMODRAFT_419387 [Selaginella moellendorffii]
Length = 462
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 82/241 (34%), Positives = 122/241 (50%), Gaps = 30/241 (12%)
Query: 79 HFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAM 138
H S AD TF+QRY TD++ PN GP+FL EG + + + +A RFGA
Sbjct: 27 HRSDADDRTFAQRYYEFTDYFDAPN--GPVFLKICGEGPC--VGIQNDYSAVLAKRFGAA 82
Query: 139 LVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNLKQNLSAE 192
+V EHRYYG+S P+ S L YL+++QAL D A F + N + N +++
Sbjct: 83 IVSLEHRYYGQSSPFKS-----HATENLIYLSSKQALFDLAAFREYYQDLINHRTNSTSD 137
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQFEDIVPPETFYNIVSSDF 249
+P ++ GGSY G L+AW +LK+PH+A+G++ASS I +F + + S
Sbjct: 138 -NPWIVMGGSYSGALSAWFKLKFPHLAVGSVASSGVVQAIFKFTKFDEQAKLFLVAES-- 194
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED----LADWLESAYS 305
A+C ++ + L G KEN + TK +L+ D LAD A+
Sbjct: 195 --AGATCSAALR-AVTRLAEQGLKENSV--STKALFNAEQLDVDGDFLYFLADAAAIAFQ 249
Query: 306 Y 306
Y
Sbjct: 250 Y 250
>gi|168067182|ref|XP_001785503.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162662877|gb|EDQ49678.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 483
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 67/165 (40%), Positives = 94/165 (56%), Gaps = 15/165 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F QRLDHFS D F QRY D++ PN PIFL E + + ++ +
Sbjct: 47 WFRQRLDHFSSQDRREFQQRYYEFLDYFKDPN--APIFLRICGESTCS--GIPNDYLLVL 102
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT------NL 185
A +FGA +V EHRYYGES P+ E+ N L YL+++QAL D A + N
Sbjct: 103 AKKFGAAVVSLEHRYYGESSPF--EELTTDN---LKYLSSKQALFDLASYRNFYQESINK 157
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
K N + + +P ++FG SY G L+AW RLK+PH+ G+L+SS +L
Sbjct: 158 KFNTTEKENPWIVFGVSYPGALSAWFRLKFPHLVRGSLSSSGVVL 202
>gi|110289391|gb|ABB47878.2| prolyl carboxypeptidase like protein, putative, expressed [Oryza
sativa Japonica Group]
gi|215706481|dbj|BAG93337.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 404
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 93/169 (55%), Gaps = 17/169 (10%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E R+ +QRLDHFS D F QRY D+ G GP+FL E + + ++
Sbjct: 51 EERWMDQRLDHFSPTDHRQFKQRYYEFADYHAGG---GPVFLRICGESSCN--GIPNDYL 105
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------I 182
++ +FGA +V PEHRYYG+S P+ S L +L+++QAL D F I
Sbjct: 106 AVLSKKFGAAVVTPEHRYYGKSSPFESL-----TTENLRFLSSKQALFDLVAFRQHYQEI 160
Query: 183 TNLKQNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N + N S+ +P +FG SY G L+AW RLK+PH+ G+LASS +L
Sbjct: 161 LNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL 209
>gi|194899982|ref|XP_001979536.1| GG23262 [Drosophila erecta]
gi|190651239|gb|EDV48494.1| GG23262 [Drosophila erecta]
Length = 485
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 140/298 (46%), Gaps = 25/298 (8%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
A FV L L P Q + + E R+ Q+LD+F ++ T+ R IN ++
Sbjct: 26 EANAFVKSLRQLHRGPPVQPMTTRA--KVEERWITQKLDNFDASNNATWQNRIYINNKYF 83
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA 159
V + PIF+Y G E I+ + SG DIA + L++ EHR++G+S+P T ++
Sbjct: 84 VDGS---PIFIYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGQSIPI--TPLS 138
Query: 160 YQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
+N Y + EQALAD + LKQ + S VV+ G SY +A W+R YP +
Sbjct: 139 TENLE--KYQSVEQALADVINVLATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPDVI 196
Query: 220 IGALASSAPIL---QFEDI--VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKE 274
G+ ASSAP+L F+D V E+ Y+I+ + C++ I + ++ +
Sbjct: 197 RGSWASSAPLLAKVNFKDYMKVVGES-YSILGGQY------CYDLIDNATSYYENLFEIG 249
Query: 275 NGLLELTKTFHLCRELNSTEDLADWL---ESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
NG + K +LC N + W A + + Y P + +P +RE
Sbjct: 250 NG-TQAAKELNLCSNFNVNSEQDRWQIFSTIANLFAGIAQYQKPEKYDIPKYCSILRE 306
>gi|157816660|gb|ABV82323.1| IP19978p [Drosophila melanogaster]
Length = 198
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 53/110 (48%), Positives = 70/110 (63%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q+++YE + F+ LDHFSF TF+ RYL N N PIF Y GNEGDIE FA
Sbjct: 30 QRFKYEIKEFQVPLDHFSFLINATFNIRYLYNDSFVDKSNARTPIFFYTGNEGDIELFAQ 89
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQ 173
N+GF+W+ A R A+++F EHRYYG+S+P+GS+ L+Y T EQ
Sbjct: 90 NTGFLWEQAERQRALVIFAEHRYYGKSLPFGSSTFNTSLPEHLAYFTVEQ 139
>gi|194899968|ref|XP_001979529.1| GG23317 [Drosophila erecta]
gi|190651232|gb|EDV48487.1| GG23317 [Drosophila erecta]
Length = 486
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 127/281 (45%), Gaps = 36/281 (12%)
Query: 51 LTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFL 110
L P Q R + + E + +Q++DHF + T+ RY N ++ P GPI++
Sbjct: 31 LNHEPVLQIRSKNELAAVEELWLDQKVDHFDEHNNKTWRMRYYSNAKYF-KPQ--GPIYI 87
Query: 111 YCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLT 170
+ G E I +++G D+A ML + EHRYYG S+P+G+ N L L+
Sbjct: 88 FVGGEWTISPGLLSTGLTHDMAVENSGMLFYTEHRYYGLSLPFGNESYRLNN---LKQLS 144
Query: 171 AEQALADFAVFITNLKQNL-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
Q+LAD A FI + + N E S V+L GGSY G L AWM YP + + ASSAP+
Sbjct: 145 LHQSLADLAHFIRHQQSNTPEMEDSKVILVGGSYSGSLVAWMTQLYPDLIAASWASSAPL 204
Query: 230 LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF----- 284
L +DF +I+ S+G S+ + E GL L K F
Sbjct: 205 L--------------AKADFFEYMEMVDKSIRLSYGHNCSL-RIEKGLKFLVKLFDGDEI 249
Query: 285 -HLCRELNSTE--------DLADWLESAYSYLAMVDYPYPS 316
L LN E D A + +Y A+V Y +
Sbjct: 250 QELLYNLNGCEGYRPKNPLDRAAFFNGLGNYFALVVQSYSA 290
>gi|388454936|ref|ZP_10137231.1| serine carboxypeptidase [Fluoribacter dumoffii Tex-KL]
Length = 467
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 76/220 (34%), Positives = 112/220 (50%), Gaps = 21/220 (9%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQR 76
IT+ +S+ + S P++ R++ K+ P +Q + YF+Q
Sbjct: 5 CITLFFLSLYAFFS----PAQAGMIERYLQKINEEKTPLLAEQT-------IKLGYFKQL 53
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFG 136
+DH + F QRY I D GP P+F Y E A+N G + + A +F
Sbjct: 54 IDHNN-PGTGNFYQRYYI--DESYGPEMDAPVFFYICGEAACSKRALN-GAIRNYAQKFH 109
Query: 137 AMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV 196
A LV EHRYYG+S+P+ + + L +LT E AL D A F +LK + V
Sbjct: 110 AKLVALEHRYYGDSLPFNTLSTEH-----LRFLTTEAALDDLAAFQRHLKNERNWNGKWV 164
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
FGGSY G L+A+ RLK+P++ +GALASSAP++ ED +
Sbjct: 165 A-FGGSYPGSLSAYYRLKFPYLVVGALASSAPVMAKEDFI 203
>gi|10140733|gb|AAG13566.1|AC073867_12 putative serine peptidase [Oryza sativa Japonica Group]
Length = 628
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 85/154 (55%), Gaps = 15/154 (9%)
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
AD F QRY D++ P GPIFLY E + + ++ +A +FGA +V P
Sbjct: 188 ADHRQFKQRYYEFLDYYRAPK--GPIFLYICGESSCN--GIPNSYLAVMAKKFGAAVVSP 243
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNLKQNLSAEASPV 196
EHRYYG+S P+ S L +L+++QAL D AVF N K N S S
Sbjct: 244 EHRYYGKSSPFESL-----TTENLRFLSSKQALFDLAVFRQYYQETLNAKYNRSGADSSW 298
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
+FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 299 FVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 332
>gi|301102560|ref|XP_002900367.1| serine protease family S28, putative [Phytophthora infestans T30-4]
gi|262102108|gb|EEY60160.1| serine protease family S28, putative [Phytophthora infestans T30-4]
Length = 526
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 145/296 (48%), Gaps = 33/296 (11%)
Query: 77 LDHFS-FADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV-NSGFVWDIAPR 134
LDHF+ + + QRY N + W G R P+FLY G EG + A+ N FV+ +A +
Sbjct: 72 LDHFAPVSKRSKWKQRYQANEEFWGG--RGFPVFLYIGGEGPLGPKAITNRTFVYYLAEQ 129
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEAS 194
A+L+ EHR+YG+S P + +++ N L+YL++EQALAD A F + +
Sbjct: 130 HRALLLALEHRFYGKSYP--TEDMSLPN---LAYLSSEQALADLAHFHSFVTDKYGLTDE 184
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR--E 252
V FGGSY G LAAW++LKYP + G +ASSAP+ D Y V D R
Sbjct: 185 KWVAFGGSYPGNLAAWVKLKYPALFAGTVASSAPVQAKTDFFE----YMEVVGDGLRYFG 240
Query: 253 SASCFNTIKES---WGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY-SYLA 308
C++ ++++ G L+ GQK G ++ + F C + + D + + S ++
Sbjct: 241 GGECYHEVEKAITQLGHLMDEGQK--GRDKVAELFKPCYPMTNEFDDSVFESSVMGAFQD 298
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCF 364
+ Y + +M L EVC+ DA E ++ + N T DC
Sbjct: 299 IAQYNGIHEGVMTLS-----EVCEHFAKPGDAV-------EKLASFINKTRVGDCL 342
>gi|218184873|gb|EEC67300.1| hypothetical protein OsI_34290 [Oryza sativa Indica Group]
Length = 524
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 86/154 (55%), Gaps = 15/154 (9%)
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
AD F+QRY D++ P GPIFLY E + + ++ +A +FGA +V P
Sbjct: 84 ADHRQFNQRYYEFLDYYRAPK--GPIFLYICGESSCN--GIPNSYLAVMAKKFGAAVVSP 139
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNLKQNLSAEASPV 196
EHRYYG+S P+ S L +L+++QAL D AVF N K N S S
Sbjct: 140 EHRYYGKSSPFESL-----TTENLRFLSSKQALFDLAVFRQYYQETLNAKYNRSGADSSW 194
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
+FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 195 FVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 228
>gi|15144318|gb|AAK84459.1|AC087192_20 putative serine peptidase [Oryza sativa Japonica Group]
Length = 267
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 85/154 (55%), Gaps = 15/154 (9%)
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
AD F QRY D++ P GPIFLY E + + ++ +A +FGA +V P
Sbjct: 24 ADHRQFKQRYYEFLDYYRAPK--GPIFLYICGESSCN--GIPNSYLAVMAKKFGAAVVSP 79
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNLKQNLSAEASPV 196
EHRYYG+S P+ S L +L+++QAL D AVF N K N S S
Sbjct: 80 EHRYYGKSSPFESLTT-----ENLRFLSSKQALFDLAVFRQYYQETLNAKYNRSGADSSW 134
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
+FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 135 FVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 168
>gi|324509651|gb|ADY44052.1| Serine protease, partial [Ascaris suum]
Length = 526
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 98/365 (26%), Positives = 159/365 (43%), Gaps = 32/365 (8%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLP--HLTEPPQRQQRQQQQQYRYETRYFE 74
++ + + ++ + LS A + RAP F+ P E R + + + E
Sbjct: 3 ILALSVFAVSTLLSDAYIHLRHPRAPHFIMGRPPGGFLEAGYRSALGENENVKASYPNVE 62
Query: 75 -----QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSGF 127
Q LDHF+ +D T+ QR N + N +F+ G E I +W +
Sbjct: 63 EFTIIQPLDHFNKSDTRTWEQRVQYNPMFY---NNQSVVFVLIGGESMINQKWVGNENVS 119
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+ A FGA EHR++G S P+ V L Y T EQALAD A FI +
Sbjct: 120 MMQWAKEFGAAAFQLEHRFFGYSRPF--PLVLTMTTEALVYCTTEQALADLAEFIQQMNA 177
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNI 244
S V FGGSY G L+AW R KYP + +GA+ASSAP+ L F + Y++
Sbjct: 178 KYSFVNPRWVTFGGSYPGSLSAWFRSKYPQLTVGAVASSAPLNLKLDFYE-------YSM 230
Query: 245 VSSDFKRES-ASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE----DLADW 299
V + RE+ C ++ + + + G +L + F+L + L ++
Sbjct: 231 VVENVLRETDPECHWRVENAIAYIEKIMLTSTGRQQLNQVFNLQPPFDEASVTPLTLHNF 290
Query: 300 LESAYS-YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSI--LERIFEGVSVYYN 356
+ + Y+ + +V Y Y + G +R +C + AP + + + + V+ +Y
Sbjct: 291 MSNLYTMFQGIVQYTYDGRNEHTMGGMNVRNLCNTVTKAPADEPLQQMRAVMDFVNSFYP 350
Query: 357 YTGNV 361
TGN
Sbjct: 351 QTGNC 355
>gi|222613128|gb|EEE51260.1| hypothetical protein OsJ_32132 [Oryza sativa Japonica Group]
Length = 524
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/154 (41%), Positives = 85/154 (55%), Gaps = 15/154 (9%)
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
AD F QRY D++ P GPIFLY E + + ++ +A +FGA +V P
Sbjct: 84 ADHRQFKQRYYEFLDYYRAPK--GPIFLYICGESSCN--GIPNSYLAVMAKKFGAAVVSP 139
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ITNLKQNLSAEASPV 196
EHRYYG+S P+ S L +L+++QAL D AVF N K N S S
Sbjct: 140 EHRYYGKSSPFESL-----TTENLRFLSSKQALFDLAVFRQYYQETLNAKYNRSGADSSW 194
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
+FGGSY G L+AW RLK+PH+ G+LASS +L
Sbjct: 195 FVFGGSYAGALSAWFRLKFPHLTCGSLASSGVVL 228
>gi|344268147|ref|XP_003405924.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 464
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/273 (28%), Positives = 124/273 (45%), Gaps = 19/273 (6%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN--EGDIEWFAVNSGFV 128
R+F Q+LDHF DL + QRY IN D + P GP+FL G W + N F+
Sbjct: 52 RWFTQKLDHFDQKDLSFWRQRYFIN-DAFYKPG--GPVFLMIGGMETAKRNWISRNLPFI 108
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
A R A+ + EHR+YG S P G A +L Y+ Q L D F + +
Sbjct: 109 -AYAERLHALCLVLEHRFYGHSQPTGDLSTA-----SLRYIRNHQVLGDIVNFRIKIAKL 162
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
+ + V FG YGG LA W R+KYP + A+ SSAP+ + + + ++ V +
Sbjct: 163 MGLTKNKWVAFGEFYGGSLAVWSRIKYPDLFAAAVGSSAPV---KVKINFDEYFEGVQTS 219
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLADWLESAYSY 306
+ C ++ + E++ + + L F LC + S++ + LE+ S+
Sbjct: 220 LDASNIKCSRAVQRALLEVIRMLKSPKSYSILKSDFMLCETPQNKSSQHIPFVLENLMSF 279
Query: 307 LA-MVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
L +V Y +M + I + C K+ P
Sbjct: 280 LIPIVQYNKKRKSVMNI--LSIDDFCDKMTETP 310
>gi|47224854|emb|CAG06424.1| unnamed protein product [Tetraodon nigroviridis]
Length = 418
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 88/141 (62%), Gaps = 8/141 (5%)
Query: 90 QRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGE 149
QR+L+N W P+ GP+FLY G EG I + V +G D+A + A+L+ EHR+YG+
Sbjct: 4 QRFLVNEAFWRNPD--GPVFLYIGGEGPIFEYDVLAGHHVDMAQQHSALLLALEHRFYGD 61
Query: 150 SMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGGMLA 208
S+ + + L++L+++QALAD AVF + + + + + + FGGSY G L+
Sbjct: 62 SVNPDGLKTEH-----LAHLSSKQALADLAVFHQYISGSFNLSHGNTWISFGGSYAGALS 116
Query: 209 AWMRLKYPHIAIGALASSAPI 229
AW R K+PH+ GA+ASSAP+
Sbjct: 117 AWFRGKFPHLVFGAVASSAPV 137
>gi|344268145|ref|XP_003405923.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 504
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 81/273 (29%), Positives = 129/273 (47%), Gaps = 20/273 (7%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE--WFAVNSGFV 128
R+F Q+LDHF ++ + QRY IN D + P GP+FL G G + W + N FV
Sbjct: 64 RWFMQKLDHFDQKEI-FWRQRYFIN-DAFYKPG--GPVFLMIGGMGSAKRNWTSRNLPFV 119
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
A R GA+ + EHR+YG S P G A +L Y+ Q L D A F + +
Sbjct: 120 -AYAERLGALCLVLEHRFYGRSQPTGDLSTA-----SLRYIRNHQVLGDIANFRIKIAKL 173
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
+ + V FG YGG LA W R+KYP + A+ SSAP+ + + + ++ V
Sbjct: 174 MGLTKNKWVAFGEFYGGSLAVWSRIKYPDLFAAAVGSSAPV---KAEINFDEYFEEVQVS 230
Query: 249 FKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR--ELNSTEDLADWLESAYSY 306
++ C +++ + E+ + +L + F LC +++S + LE+ S+
Sbjct: 231 LDAHNSECSSSVYLALREVTKRLIHQKHYSKLKRDFMLCEPLQIDSKQHATFVLENLMSF 290
Query: 307 LA-MVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
L +V Y +M + + CKK+ P
Sbjct: 291 LIPIVQYNKKRKSVMNI--LSTDDFCKKMTETP 321
>gi|444523275|gb|ELV13498.1| Thymus-specific serine protease [Tupaia chinensis]
Length = 393
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/162 (41%), Positives = 95/162 (58%), Gaps = 8/162 (4%)
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAP 133
+Q LD F+ D +F QRY +N HW G + P+FL+ G EG + +V G +AP
Sbjct: 76 QQPLDPFNATDGRSFLQRYWVNAQHWAGQD--APVFLHLGGEGGLGPGSVMRGHPAALAP 133
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA 193
+GA+++ EHR+YG S+P G +A L +L++ ALAD A L + L+ A
Sbjct: 134 AWGALVISLEHRFYGLSVPAGGLGLA-----QLRFLSSRHALADAASARLELSRLLNVSA 188
Query: 194 S-PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
S P + FGGSY G LAAW RLK+PH+ ++ASSAP+ D
Sbjct: 189 SSPWICFGGSYAGSLAAWARLKFPHLVSASVASSAPVRAVLD 230
>gi|357146992|ref|XP_003574183.1| PREDICTED: probable serine protease EDA2-like [Brachypodium
distachyon]
Length = 489
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 96/170 (56%), Gaps = 19/170 (11%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTD-HWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
E R+ QRLDHFS D F QR+ D H G GP+FL E + + + +
Sbjct: 50 EERWTNQRLDHFSPTDHRQFKQRHFEFLDYHRAG----GPVFLRICGESSCD--GIPNDY 103
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------ 181
+ +A +FGA +V PEHRYYG+S P+ + +N L +L+++QAL D AVF
Sbjct: 104 LAVLAKKFGAAVVTPEHRYYGKSSPF--ERLTTEN---LRFLSSKQALFDLAVFRQYYQD 158
Query: 182 ITNLKQNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
N + N S+ +P +FG SY G L+AW RLK+PH+ G+LASS +L
Sbjct: 159 ALNYRYNRSSGFDNPWFVFGVSYAGALSAWFRLKFPHLTCGSLASSGVVL 208
>gi|123418467|ref|XP_001305332.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121886845|gb|EAX92402.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 504
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/164 (39%), Positives = 97/164 (59%), Gaps = 20/164 (12%)
Query: 72 YFEQRLDHFSFADLP-TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
+F+Q+LDHFS DL TF QRY INT++ + + +Y G E + + S +D
Sbjct: 29 WFDQKLDHFS--DLAETFKQRYYINTNY---SKKSKNLVVYIGGEAPL----LESSLKYD 79
Query: 131 ---IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
IA ++++ EHRY+GES+P+G+ E+ YLT +QA+ D A FIT +KQ
Sbjct: 80 VQHIASVTKSVILALEHRYFGESIPHGNLELE-----NFKYLTVDQAIEDLANFITQMKQ 134
Query: 188 NLSAEASPV--VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
N +AS ++ GGSY G L++ R K+P + +G+ ASSAPI
Sbjct: 135 NYCQDASKCKALMVGGSYPGALSSRFRQKHPELTLGSWASSAPI 178
>gi|159117921|ref|XP_001709180.1| Thymus-specific serine protease precursor [Giardia lamblia ATCC
50803]
gi|157437295|gb|EDO81506.1| Thymus-specific serine protease precursor [Giardia lamblia ATCC
50803]
Length = 522
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 90/282 (31%), Positives = 130/282 (46%), Gaps = 19/282 (6%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q Y E + EQ +DHF + +SQRY N ++ GP+FL G EG V
Sbjct: 51 QHYPGELWFREQHVDHFDSTNTKKWSQRYYYNDTYYKAG---GPVFLMIGGEGPATPRDV 107
Query: 124 NSGFVWD-IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
F D A + V EHR+YG S P ST N+ LS L ++QALAD A F+
Sbjct: 108 GDYFSIDYFAKNMNGLKVALEHRFYGASFP--ST-----NSANLSLLRSDQALADIATFL 160
Query: 183 TNLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
LK+ + E + +V GGSY G LAAW R+++P I A++SS P L D PE
Sbjct: 161 AYLKREYNLPEGTKIVAVGGSYSGNLAAWARIQFPFIIDAAISSSGPYLAQTDY--PEYL 218
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRE--LNSTE-DLAD 298
+I S K C + I + + + + L F L E NST D A
Sbjct: 219 QHIDSQVRKYGGDRCMDIISAAHKDAEYLLSHDKA--TLATIFKLKEESIYNSTGYDKAS 276
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA 340
++ + + +V Y + I+++CK I+ + D+
Sbjct: 277 FMSAMGAPSGVVQYAKHDGYYNTTKDGDIKQMCKAIEASYDS 318
>gi|308162690|gb|EFO65071.1| Thymus-specific serine protease precursor [Giardia lamblia P15]
Length = 522
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 129/279 (46%), Gaps = 19/279 (6%)
Query: 62 QQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF 121
+ Q Y E + EQ +DHF + +SQRY N ++ GP+FL G EG +
Sbjct: 49 RTQHYSGELWFREQHVDHFDPMNTKKWSQRYYYNDTYYKAG---GPVFLMIGGEGPVTPK 105
Query: 122 AVNSGFVWD-IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
V F D A + V EHR+YG S P ST+ A LS L ++QALAD A
Sbjct: 106 YVEDYFSIDYFAKNMNGLKVALEHRFYGASFP--STDSA-----DLSLLRSDQALADIAT 158
Query: 181 FITNLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
F+ LK+ + E++ +V GGSY G LAAW R+++P I A++SS P L D PE
Sbjct: 159 FLAYLKREYNLPESTKIVAVGGSYSGNLAAWARIQFPFIISAAISSSGPYLAQTDY--PE 216
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL---NSTEDL 296
+I S K C + I + + + + L F L E N+ D
Sbjct: 217 YLQHIDSQIRKYGGDRCMDVISAAHKDAEYLLSHDKA--TLAAIFKLREESIYNNTGYDK 274
Query: 297 ADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID 335
A ++ + + +V Y + I+++CK I+
Sbjct: 275 ASFMSAMGAPSGVVQYAKHDGYYTTTKDGDIKQMCKAIE 313
>gi|195062790|ref|XP_001996254.1| GH22390 [Drosophila grimshawi]
gi|193899749|gb|EDV98615.1| GH22390 [Drosophila grimshawi]
Length = 497
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 75/247 (30%), Positives = 120/247 (48%), Gaps = 10/247 (4%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ Q+LDHF + T+ RYL N D + P GPI+++ G E I +++G D+
Sbjct: 51 WLSQKLDHFDELNNKTWQMRYLRN-DKYHKPQ--GPIYIFVGGEWTITPGLLSTGLTHDM 107
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNLS 190
A +L + EHRYYG+S+P+ S+ + + L +L QALAD A FI K + +
Sbjct: 108 AVENAGILFYTEHRYYGQSLPHNSSHNS-MSLENLKHLNLHQALADLACFIRYQKSHSAN 166
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK 250
S V+L GGSY G + AWM YP + + ASSAP+L + F + +S
Sbjct: 167 LTHSKVILIGGSYSGSMVAWMTQLYPELVTASWASSAPLLAKANFYEYMQF--VGNSINL 224
Query: 251 RESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE--DLADWLESAYSYLA 308
+C ++ + LV + N + +L + C N+++ D + +Y A
Sbjct: 225 TYGHNCTQRLENGFNHLVKLFNT-NKISKLLERLDACASFNASDLLDRISFFNGIGNYFA 283
Query: 309 MVDYPYP 315
+V Y
Sbjct: 284 LVVQSYS 290
>gi|301109703|ref|XP_002903932.1| serine protease family S28, putative [Phytophthora infestans T30-4]
gi|262096935|gb|EEY54987.1| serine protease family S28, putative [Phytophthora infestans T30-4]
Length = 528
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 99/186 (53%), Gaps = 19/186 (10%)
Query: 56 QRQQRQQQQQYRYETR--YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCG 113
Q Q + Q ++ E + +F Q +DHF+ TF QRY W P+ GP+ LY G
Sbjct: 32 QLVQAESQLLFKTEAQQLWFNQTVDHFASDSNATFQQRYYEVNKFWSKPD--GPVILYIG 89
Query: 114 NEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTA 171
EG +E +GFV IA +F A ++ EHR+YG S+P G STE YLT
Sbjct: 90 GEGAME--KAPAGFVHVIAQKFDAKILALEHRFYGRSIPNGDLSTE-------NYRYLTV 140
Query: 172 EQALADFAVFITNLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA--- 227
+QALAD F + + L A +A+ + GGSY G L+AW R+ YP + +L+SS
Sbjct: 141 QQALADLKHFKESYQSQLGAKDANQWIAIGGSYPGALSAWFRIAYPDATVASLSSSGVVQ 200
Query: 228 PILQFE 233
P+ +F
Sbjct: 201 PVYKFH 206
>gi|195391886|ref|XP_002054590.1| GJ24539 [Drosophila virilis]
gi|194152676|gb|EDW68110.1| GJ24539 [Drosophila virilis]
Length = 489
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 120/252 (47%), Gaps = 14/252 (5%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + QRLDHF + T+ RY N + GPI+++ G E I +++G
Sbjct: 53 EELWLTQRLDHFDGLNNKTWQMRYFRNAKY---HRPQGPIYIFLGGEWTITPGLLSTGLT 109
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D+A +L + EHRYYG+S PY + ++ ++ L +L+ QALAD A FI K +
Sbjct: 110 HDMAVENAGILFYTEHRYYGQSWPYENDSLSLEH---LKHLSLHQALADLAHFIRYQKSH 166
Query: 189 LS-AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
S S V+L GGSY G +AAWM YP + + ASSAP+L D + +V +
Sbjct: 167 SSNLTHSKVILVGGSYSGSMAAWMTHLYPELVTASWASSAPLLAKADFYE---YMQLVEN 223
Query: 248 DFKRE-SASCFNTIKESWGELVSVGQKENGLLELTKTFHLC--RELNSTEDLADWLESAY 304
+C I+ LV + N EL + + C + N++ D +
Sbjct: 224 SLNLSYGNNCTQRIERGLNHLVKLFNT-NETTELLTSLNACPSYDANNSLDRMTFFSGIG 282
Query: 305 SYLAMVDYPYPS 316
+Y A+V Y +
Sbjct: 283 NYFALVVQSYSA 294
>gi|403308859|ref|XP_003944859.1| PREDICTED: thymus-specific serine protease isoform 2 [Saimiri
boliviensis boliviensis]
Length = 539
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 80/263 (30%), Positives = 124/263 (47%), Gaps = 45/263 (17%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ L+ F+ +D +F QRY +N HW G + GPIFL+ G EG + +V G +
Sbjct: 59 WLEQLLNPFNVSDRRSFLQRYWVNDQHWTGQD--GPIFLHLGGEGSLGPGSVMKGHPAAL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA----------------- 174
AP +GA+++ EHR+YG S+P G ++A L +L++ A
Sbjct: 117 APAWGALVISLEHRFYGLSIPAGGLDMA-----QLRFLSSRHAVGKSSGIPSDEDRPSLP 171
Query: 175 ----LADFAVFITNLKQ--NLSAEASPVVLFGGSYGGMLAAWMRLK------YPHIAIGA 222
LAD L + N+S+ +SP + FGGSY G LAAW RLK +PH+ +
Sbjct: 172 SDPRLADVVSARLALSRLFNVSS-SSPWICFGGSYAGSLAAWARLKVLGFLRFPHLIFAS 230
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRE----SASCFNTIKESWGELVSVGQKENGLL 278
+ASSAP+ D + ++VS K S C + ++ E+ +
Sbjct: 231 VASSAPVRAVLDF---SEYNDVVSRSLKSAAIGGSLECQAAVSTAFSEVERRLRAGGAAR 287
Query: 279 ELTKT-FHLCRELNSTEDLADWL 300
+ + C L+ ED A+ L
Sbjct: 288 AALQAELNACGSLSRAEDQAELL 310
>gi|253743733|gb|EET00051.1| Thymus-specific serine protease precursor [Giardia intestinalis
ATCC 50581]
Length = 521
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/282 (31%), Positives = 133/282 (47%), Gaps = 19/282 (6%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV 123
Q Y E + +QR+DHF+ + +SQRY N ++ GP+FL G EG V
Sbjct: 51 QHYSGELWFHDQRVDHFNPVNTKKWSQRYYYNDTYYKAG---GPVFLMIGGEGPATPRDV 107
Query: 124 NSGFVWD-IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
F D A + V EHR+YG S P ST+ ++ LS L ++QALAD A F+
Sbjct: 108 GDYFSIDYFAKSMSGLKVALEHRFYGASFP--STD-----SSDLSLLRSDQALADIATFL 160
Query: 183 TNLKQNLSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
LK+ + +++ VV GGSY G LAAW R+++P + A++SS P L D PE
Sbjct: 161 AYLKKEYNLPKSTKVVAVGGSYSGNLAAWARIQFPFVIDAAISSSGPYLAQTDY--PEYL 218
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRE--LNSTE-DLAD 298
+I + K C + I + + + + L F L E NST D A
Sbjct: 219 QHIDAQIRKSGGDRCMDIISAAHTDAEYLLNHDKA--TLATIFKLKEESIYNSTAYDKAS 276
Query: 299 WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDA 340
++ + + +V Y + I+++CK I+ DA
Sbjct: 277 FMSTMGAPSGVVQYAKHDGYYTDTKDGDIKQMCKSIEAYYDA 318
>gi|195569669|ref|XP_002102831.1| GD19291 [Drosophila simulans]
gi|194198758|gb|EDX12334.1| GD19291 [Drosophila simulans]
Length = 487
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 121/266 (45%), Gaps = 36/266 (13%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + +Q++DHF + T+ RY N H+ P GPI+++ G E I +++G
Sbjct: 49 EELWLDQKVDHFDKNNNRTWKMRYYRNAKHF-KPQ--GPIYIFVGGEWTISPGLLSTGLT 105
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D+A ML + EHRYYG S+P+G N L L+ Q+LAD A FI + K N
Sbjct: 106 HDMAVENSGMLFYTEHRYYGLSLPFGHERYQLNN---LKQLSLHQSLADLAHFIRHQKSN 162
Query: 189 L-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
E S V+L GGSY G L WM YP + + ASSAP+L +
Sbjct: 163 GPEMEDSKVILVGGSYSGSLVTWMTQLYPDLIAASWASSAPLL--------------AKA 208
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF------HLCRELNSTE------- 294
DF +I+ S+G+ S+ + E G L K F L LN E
Sbjct: 209 DFFEYMEVVGKSIQLSYGKNCSL-RIEKGFKFLAKLFDGDEIQELLYNLNGCEGYSPKNP 267
Query: 295 -DLADWLESAYSYLAMVDYPYPSDFM 319
D A + +Y A+V Y S ++
Sbjct: 268 LDRAAFFNGLGNYFALVVQSYRSAYI 293
>gi|345317349|ref|XP_003429868.1| PREDICTED: dipeptidyl peptidase 2-like, partial [Ornithorhynchus
anatinus]
Length = 209
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 81/129 (62%), Gaps = 4/129 (3%)
Query: 202 SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIK 261
SYGGML+A+MRLKYPH+ GALA+SAP+L + P F+ V++DF+ S C ++
Sbjct: 81 SYGGMLSAYMRLKYPHLVTGALAASAPVLSVAGLGDPRQFFRDVTADFENFSPKCSGAVR 140
Query: 262 ESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLE---SAYSYLAMVDYPYPSDF 318
E++G++ + ++ +++ C L+ D+ LE +A++ +AM+DYPYP+DF
Sbjct: 141 EAFGQIWDLALRQ-AYDPISRGMATCHRLSDGADVDQLLEFARNAFAMIAMMDYPYPTDF 199
Query: 319 MMPLPGYPI 327
M P +P+
Sbjct: 200 MGHFPAHPV 208
>gi|195353641|ref|XP_002043312.1| GM26842 [Drosophila sechellia]
gi|194127426|gb|EDW49469.1| GM26842 [Drosophila sechellia]
Length = 487
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 85/266 (31%), Positives = 121/266 (45%), Gaps = 36/266 (13%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + +Q++DHF + T+ RY N H+ P GPI+++ G E I +++G
Sbjct: 49 EELWLDQKVDHFDKNNNRTWKMRYYRNAKHF-KPQ--GPIYIFVGGEWTISPGLLSTGLT 105
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
D+A ML + EHRYYG S+P+G N L L+ Q+LAD A FI + K N
Sbjct: 106 HDMAVENSGMLFYTEHRYYGLSLPFGHESYQLNN---LKQLSLHQSLADLAHFIRHQKSN 162
Query: 189 L-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
E S V+L GGSY G L WM YP + + ASSAP+L +
Sbjct: 163 GPEMEDSKVILVGGSYSGSLVTWMTQLYPDLIAASWASSAPLL--------------AKA 208
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF------HLCRELNSTE------- 294
DF +I+ S+G+ S+ + E G L K F L LN E
Sbjct: 209 DFFEYMEVVGKSIQLSYGKNCSL-RIEKGFKFLAKLFDGDEIQELLYNLNGCEGYSPKNP 267
Query: 295 -DLADWLESAYSYLAMVDYPYPSDFM 319
D A + +Y A+V Y S ++
Sbjct: 268 LDRAAFFNGLGNYFALVVQSYRSAYI 293
>gi|145525753|ref|XP_001448693.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124416248|emb|CAK81296.1| unnamed protein product [Paramecium tetraurelia]
Length = 462
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/181 (38%), Positives = 99/181 (54%), Gaps = 15/181 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
ET F Q LDH A+ T+ QRY + + + P + G I CG E + + + NS F
Sbjct: 30 ETFQFTQLLDHSDPANTQTWQQRYHVYSQY-FNPTKGGVILYICG-EWNCQGVSDNS-FS 86
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ- 187
+ +A GA+++ EHR+YG+S P+G+ + +N LSYL QAL D A FI +K+
Sbjct: 87 FQLAKDLGAIVIALEHRFYGQSQPFGADSWSLEN---LSYLNVHQALDDLAYFILQMKRL 143
Query: 188 NLSAEAS--PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI------LQFEDIVPPE 239
L + S P GGSY G L+AW R KYPH+ +G LASS I +F+D +
Sbjct: 144 KLHSIDSTLPWYAIGGSYPGALSAWFRYKYPHLTVGNLASSGVINTVLDFWEFDDQIRKS 203
Query: 240 T 240
T
Sbjct: 204 T 204
>gi|145498935|ref|XP_001435454.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124402586|emb|CAK68057.1| unnamed protein product [Paramecium tetraurelia]
Length = 452
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 98/184 (53%), Gaps = 19/184 (10%)
Query: 66 YRYETRYFEQRL-DHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
Y + +FE +L DH+ + F QRY + +++V P +F CG E+ +N
Sbjct: 21 YNQKMYWFEHQLVDHYDKLNKNVFHQRYWVVEENFV-PETGVVLFQICG-----EYTCIN 74
Query: 125 SG----FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAV 180
F+ +A F A+++ EHRYYG+SMP G + +N L YL+ QAL D A
Sbjct: 75 DIKLRLFIIQLAKEFNALIIILEHRYYGKSMPLGKESLKDEN---LRYLSTRQALDDLAY 131
Query: 181 F----ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
F + N K + ++ +P + GGSY G LAAW R +YPH+ IGALASSA + D
Sbjct: 132 FQRFMVLNKKHGIKSQ-NPWIAIGGSYPGALAAWYRYQYPHLVIGALASSAVVESITDFK 190
Query: 237 PPET 240
+T
Sbjct: 191 MFDT 194
>gi|397492345|ref|XP_003817084.1| PREDICTED: dipeptidyl peptidase 2 [Pan paniscus]
Length = 406
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 98/168 (58%), Gaps = 9/168 (5%)
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGML+A++R+KYPH+ GALA+SAP+L + F+ V++DF+ +S C ++E
Sbjct: 77 YGGMLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVRE 136
Query: 263 SWGELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLESAYSYLAMVDYPYPSDF 318
++ ++ + + G + + F C+ L+ +DL + +A++ LAM+DYPYP+DF
Sbjct: 137 AFRQIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPTDF 194
Query: 319 MMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
+ PLP P++ C ++ + + L + + YN +G+ C+ +
Sbjct: 195 LGPLPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSEHCYDI 239
>gi|66770655|gb|AAY54639.1| IP12434p [Drosophila melanogaster]
Length = 486
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 135/297 (45%), Gaps = 23/297 (7%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
A FV L L P + + + + E R+ Q+LD+ ++ T+ R IN ++
Sbjct: 27 EANAFVKSLRELHRGPPVEPMKTRA--KVEERWITQKLDNSDDSNNATWQDRIYINNKYF 84
Query: 100 VGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVA 159
V + PIF+Y G E I+ + SG DIA + L++ EHR++G+S+P T ++
Sbjct: 85 VDGS---PIFIYLGGEWAIDPSGITSGLWKDIAKQHNGSLLYTEHRFFGQSIPI--TPLS 139
Query: 160 YQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIA 219
+N Y + EQALAD I LKQ + S VV G SY +A W+R YP I
Sbjct: 140 TENLA--KYQSVEQALADVINVIATLKQEDKYKDSKVVASGCSYSATMATWIRKLYPEII 197
Query: 220 IGALASSAPIL---QFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKEN 275
G+ ASSAP+L F+D + +V + C++ I + ++ + N
Sbjct: 198 RGSWASSAPLLAKVNFKD------YMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGN 251
Query: 276 GLLELTKTFHLCRELNSTEDLADWL---ESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
G + K +LC N + W A + + Y P + +P +RE
Sbjct: 252 G-TQAVKELNLCSNFNVNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILRE 307
>gi|118361433|ref|XP_001013945.1| Serine carboxypeptidase S28 family protein [Tetrahymena
thermophila]
gi|89295712|gb|EAR93700.1| Serine carboxypeptidase S28 family protein [Tetrahymena thermophila
SB210]
Length = 873
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 98/180 (54%), Gaps = 9/180 (5%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+ + + ++ +++Y + TR + DHF+ + T+SQRY + D + P G + LY
Sbjct: 410 DQNEHKDQKNKKKYLFSTR-IQNLTDHFNITNNRTWSQRYWV-LDQYYNPQN-GSVLLYI 466
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
E F +A +F ++++ EHR+YG SMP+G + N L L +
Sbjct: 467 CGEYTCPGIPEERQFPILLAQKFSSLVLVLEHRFYGNSMPFGDQSMKQHN---LYLLNVD 523
Query: 173 QALADFAVFITNLK-QNLSAEAS--PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
QALAD A FIT +K +L + P + GGSY G ++AW R KYPH+ +GALASSA +
Sbjct: 524 QALADLAYFITYVKDHHLHGVQNHIPWLTIGGSYPGAMSAWFRYKYPHLTVGALASSAVV 583
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 75/324 (23%), Positives = 130/324 (40%), Gaps = 65/324 (20%)
Query: 69 ETRYF-EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGP--IFLYCGNEGDIEWFAVNS 125
E YF EQR DHFS + + QRY I + +LG I C + + +
Sbjct: 28 EEHYFNEQRYDHFS-NNFELWDQRYFIAKNEKSQNGQLGKVNIIFVCDKDLTHDILSCIP 86
Query: 126 GFVWDIAPRFGAMLVFP-EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
F +D R + +F E RYYGES PY S Y LSY + +Q +AD A+F++
Sbjct: 87 PF-FDSQRRNSDVNIFLLEMRYYGESQPYSS---RYLGIDYLSYQSIQQNIADIALFVSF 142
Query: 185 LKQ-NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
LK+ N+ + S ++KYPH+ G +A ++ ++ + E +
Sbjct: 143 LKKDNMVSSDSK----------------KIKYPHLIDGVIAFNSQLVN----INYEQYNQ 182
Query: 244 IVSSDFKRESASCF--------------------NTIKESWGELVSVGQKE-NGLLELTK 282
I+ + + C TI + E+VS +E L ++ +
Sbjct: 183 ILDQQLSQTNPQCLLQQELENKIKILEFCSQIQNKTISQIVNEIVSKSSQEYKNLFDIQE 242
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVDYPYP------------SDFMMPLPGYPIREV 330
L N+T+ + L D + S +P Y +++
Sbjct: 243 FASLSDSNNATQTKRKNQYLPFIRLQCSDLGFSKFFSNQVTENSQSTIFKKIPQYIFQKI 302
Query: 331 CKKIDNAPDATSI--LERIFEGVS 352
C+ + N P ++ L+++F G++
Sbjct: 303 CQTLFNQPFHKNVDQLQQLFGGIN 326
>gi|302766039|ref|XP_002966440.1| hypothetical protein SELMODRAFT_63867 [Selaginella moellendorffii]
gi|300165860|gb|EFJ32467.1| hypothetical protein SELMODRAFT_63867 [Selaginella moellendorffii]
Length = 393
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 69/167 (41%), Positives = 96/167 (57%), Gaps = 19/167 (11%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFL-YCGNEGDIEWFAVNSGFVWDIAP 133
Q+LDHF+ D TF Q+Y D++ P R GP+FL CG +A D+A
Sbjct: 1 QKLDHFTPEDTRTFPQKYFELLDYF-EPQR-GPMFLVMCGETSCPGGYA---QLTSDVAK 55
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE- 192
FGA +V EHR+YGES P+ + V L YLT +Q+L D A FI ++ ++A+
Sbjct: 56 EFGAAVVTLEHRFYGESSPFHNLTV-----DNLKYLTIQQSLLDHAEFIAFYQKVINAKF 110
Query: 193 ----ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQF 232
+P ++ GGSY G L+AW RLK+PH+ IG+ ASSA PIL +
Sbjct: 111 QKDGDNPWLVIGGSYAGALSAWFRLKFPHLVIGSWASSAVVHPILSY 157
>gi|145492429|ref|XP_001432212.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124399322|emb|CAK64815.1| unnamed protein product [Paramecium tetraurelia]
Length = 462
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 69/182 (37%), Positives = 97/182 (53%), Gaps = 17/182 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
ET F Q LDH A+ T+ QRY + + + P + G I CG E + + NS
Sbjct: 30 ETFQFTQLLDHSDPANTQTWQQRYHVYSQY-FNPTKGGVILYICG-EWNCQGVGDNS-LS 86
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ- 187
+ +A GA+++ EHR+YG+S P+G+ + +N LSYL QAL D A FI +K+
Sbjct: 87 FQLAKDLGAIVIALEHRFYGQSQPFGADSWSLEN---LSYLNVHQALDDLAYFILQMKRL 143
Query: 188 ---NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI------LQFEDIVPP 238
N+ + P GGSY G L+AW R KYPH+ +G LASS I QF+D +
Sbjct: 144 KLHNIDSTL-PWYAIGGSYPGALSAWFRYKYPHLTVGNLASSGVINTILDYWQFDDQIRK 202
Query: 239 ET 240
T
Sbjct: 203 ST 204
>gi|302689763|ref|XP_003034561.1| hypothetical protein SCHCODRAFT_15001 [Schizophyllum commune H4-8]
gi|300108256|gb|EFI99658.1| hypothetical protein SCHCODRAFT_15001 [Schizophyllum commune H4-8]
Length = 555
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 67/170 (39%), Positives = 95/170 (55%), Gaps = 17/170 (10%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVN 124
+ ++FEQ LDHF+ TF QRY +N H+V P P+ + G E G+ ++
Sbjct: 73 EFPAQWFEQPLDHFNNETGDTFGQRYWVNKRHYV-PGTNAPVIVLDGGETSGEDRLPFLD 131
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQALADFAVFI 182
+G V + G + V EHRYYGE+ P QN TT L +LT EQ+ AD A F+
Sbjct: 132 TGIVEILTRATGGVGVVLEHRYYGETKPV-------QNLTTDSLRFLTNEQSAADSANFM 184
Query: 183 TNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
N+K ++L+A +P + +GGSY G AA MR+ YP + GA+ASSA
Sbjct: 185 ANVKFDGIDEDLTAPGTPWIYYGGSYAGARAAHMRVLYPDLVFGAIASSA 234
>gi|302762512|ref|XP_002964678.1| hypothetical protein SELMODRAFT_63866 [Selaginella moellendorffii]
gi|300168407|gb|EFJ35011.1| hypothetical protein SELMODRAFT_63866 [Selaginella moellendorffii]
Length = 393
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 69/167 (41%), Positives = 96/167 (57%), Gaps = 19/167 (11%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFL-YCGNEGDIEWFAVNSGFVWDIAP 133
Q+LDHF+ D TF Q+Y D++ P R GP+FL CG +A D+A
Sbjct: 1 QKLDHFTPEDTRTFPQKYFELLDYF-EPQR-GPMFLVMCGETSCPGGYA---QLTSDVAK 55
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE- 192
FGA +V EHR+YGES P+ + V L YLT +Q+L D A FI ++ ++A+
Sbjct: 56 EFGAAVVTLEHRFYGESSPFHNLTV-----DNLKYLTIQQSLLDHAEFIAFYQKVINAKF 110
Query: 193 ----ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQF 232
+P ++ GGSY G L+AW RLK+PH+ IG+ ASSA PIL +
Sbjct: 111 QKDGDNPWLVIGGSYAGALSAWFRLKFPHLVIGSWASSAVVHPILSY 157
>gi|328872043|gb|EGG20413.1| peptidase S28 family protein [Dictyostelium fasciculatum]
Length = 466
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/149 (42%), Positives = 89/149 (59%), Gaps = 10/149 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW--FAVNSGFVW 129
++ Q L+HF D TF QRY N D + ++ GPI LY EG + + G V
Sbjct: 47 WYNQTLNHFDAQDSRTFMQRYYTN-DAYYDYSKGGPIILYINGEGPVSSPPYQQGDGVV- 104
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
A GA +V EHRYYG+S P+ +++ +N L +L++ QAL D AVFI++ ++NL
Sbjct: 105 VYAQALGAYIVTLEHRYYGDSSPF--EDLSTEN---LKFLSSRQALNDLAVFISDFRKNL 159
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHI 218
S ++ VV GGSY G L+AW R+KYPHI
Sbjct: 160 SL-STEVVTIGGSYSGALSAWFRVKYPHI 187
>gi|133930823|ref|NP_501598.2| Protein PCP-3 [Caenorhabditis elegans]
gi|119662054|emb|CAB05185.2| Protein PCP-3 [Caenorhabditis elegans]
Length = 1080
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 103/385 (26%), Positives = 169/385 (43%), Gaps = 40/385 (10%)
Query: 1 MPKTRQQNQNSLYL-SPVITIVIISILSPLSLAAQPSKFRRAPRF----VGKLPHLTEP- 54
+P TR Q+ S+ L V+ I S S L K P+F +G+ PH P
Sbjct: 503 IPWTRPQSAQSVNLRESVLKQEIESRFSKLGDDVPSKKTFPEPKFKKVFLGRPPHGFLPE 562
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
P + ++ +E F QR DHF + F Q++ N W + GP FL G
Sbjct: 563 PDYELKEDAYPPGFEQGTFRQRQDHFDNQNADFFQQKFFKNA-QWA--KQGGPNFLMIGG 619
Query: 115 EG--DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
EG W + A ++GA + EHR+YG+S+ + T L +
Sbjct: 620 EGPESARWVLNENITYLTWAKKYGATVYLLEHRFYGDSV--------VGDNTNFKLLNSL 671
Query: 173 QALADFAVFI--TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
Q L D A FI N++ + ++P + FGGSY G ++AWMR +P + +GA+ASS P+
Sbjct: 672 QMLYDLAEFIKAVNIR---TGTSNPWITFGGSYSGAMSAWMREVFPDLVVGAVASSGPVY 728
Query: 231 QFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL 290
D + +V + +R ++ C + I+ + + ++ + G L+ F L
Sbjct: 729 AKTDFYE---YLMVVENSVRRYNSKCADNIQSGFDAIRTLFLTKEGRQNLSSIFQLQPPF 785
Query: 291 NSTEDLAD---WLESAYSYLAMVDYPYPSDFMMPLP-GYPIREVCKKIDNAPDATSILER 346
+ + D + + Y Y D P GY I ++CK + N D+ + L
Sbjct: 786 SDSVTDTDQHYFFSNVYGNFQGA-VQYSGDNTGPYANGYGIPDMCKIMSN--DSNTPLNN 842
Query: 347 IF---EGVSVYYN---YTGNVDCFQ 365
I + + ++YN YTG + +Q
Sbjct: 843 IVAFNQFMIIFYNGGQYTGMDNNYQ 867
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 71/228 (31%), Positives = 106/228 (46%), Gaps = 18/228 (7%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF---V 128
Y Q LDHF TFSQRY T + R FLY +G +E AV S +
Sbjct: 47 YMLQSLDHFIGNASGTFSQRYFY-TQQYTLHQRTA--FLYVSADG-VEEAAVISDERNPI 102
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI--TNLK 186
A +FGA + EHRYYG+S P +A L +L + QA+ D FI N++
Sbjct: 103 VKTAKQFGATIFSLEHRYYGQSRP----NFDKFDAQNLRHLNSLQAILDIISFIKSVNVQ 158
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
N+ + VL+G YGG+LAA R P G +ASS+P+ D F + V+
Sbjct: 159 FNMDPDVR-WVLWGAGYGGILAAEARKWDPVTISGVIASSSPLTHLYDF---WQFNDQVA 214
Query: 247 SDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST 293
+ F + C+N +++ + ++ + G ++ F L L+ T
Sbjct: 215 TTFSQVGGGLCYNKVRQGFADIRQAMRTPEGRRNVSSLFQLNPRLDQT 262
>gi|349805101|gb|AEQ18023.1| hypothetical protein [Hymenochirus curtipes]
Length = 294
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 64/159 (40%), Positives = 93/159 (58%), Gaps = 11/159 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
Y Q LDHF+ + T+ QRY +N +HW P+ GP+FLY G EG + F+V SG ++
Sbjct: 1 YIAQPLDHFNRRNNATYRQRYWVNEEHWRQPD--GPVFLYIGGEGSLSEFSVLSGEHVEL 58
Query: 132 APRFGAMLVFPEHRYYGESM-PYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
A A+LV E +YG S+ P G T +L +L+++QALAD A F + +
Sbjct: 59 AQTHRALLVSLE-CFYGSSINPDGMT------LESLKFLSSQQALADLASFHLFISHKYN 111
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ + FGGSY G L+AW LK+PH+ ++ASSAP+
Sbjct: 112 LTRNTWICFGGSYPGSLSAWF-LKFPHLVYASVASSAPV 149
>gi|145522514|ref|XP_001447101.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124414601|emb|CAK79704.1| unnamed protein product [Paramecium tetraurelia]
Length = 461
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 85/166 (51%), Gaps = 17/166 (10%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGF 127
T +F Q+LDH F QRY + D+ V I CG EW + SG
Sbjct: 35 TLWFTQKLDHNDPTSKEVFRQRYHVYDDYVVRNQPESVILYICG-----EWTCDGIGSGL 89
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+D A + A+++ EHRY+G+S P+G + L YL QAL D A FI ++K
Sbjct: 90 TFDAAQQLKALVLVLEHRYFGQSQPFGDW-----STPNLKYLNIHQALDDIAYFIQDVKA 144
Query: 188 ----NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
N+ +P + GGSY G L+AW R KYPH+ IG LASSA +
Sbjct: 145 KGLFNIKPN-TPWIHLGGSYPGALSAWFRYKYPHLTIGGLASSAVV 189
>gi|338718275|ref|XP_003363793.1| PREDICTED: LOW QUALITY PROTEIN: thymus-specific serine
protease-like [Equus caballus]
Length = 620
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 86/146 (58%), Gaps = 10/146 (6%)
Query: 91 RYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGES 150
RY +N HW + GP+FL+ G EG + +V G +AP +GA+++ EHR+YG S
Sbjct: 184 RYWVNDQHWTAQD--GPVFLHLGGEGSLWPGSVLRGHPATLAPSWGALVIGLEHRFYGLS 241
Query: 151 MPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NLSAEASPVVLFGGSYGGMLA 208
+P G ++A L +L++ AL D L + N+S+ +SP + FGGSY G LA
Sbjct: 242 IPAGGLDMAQ-----LRFLSSRHALTDVVSARLALSRLFNVSS-SSPWICFGGSYAGSLA 295
Query: 209 AWMRLKYPHIAIGALASSAPILQFED 234
+W RLK+PH+ ++ASSAP+ D
Sbjct: 296 SWARLKFPHLIFASVASSAPVRAVLD 321
>gi|170092076|ref|XP_001877260.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164648753|gb|EDR12996.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 565
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 73/231 (31%), Positives = 118/231 (51%), Gaps = 24/231 (10%)
Query: 13 YLSPVITIV---------IISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQ 63
+L+P++ ++ +I IL P + + R A + K P L Q
Sbjct: 6 FLTPLVLLLKVAATSKSDLIRILGPQGVNLHKLEARAARQKFAKDPGLL--VQEAGGSSP 63
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWF 121
+ + +F+Q LDHFS ++ TF QRY +NT H+ P++ P+ + G E G+
Sbjct: 64 TSFEFRPLWFKQPLDHFSTSNKHTFHQRYWVNTRHY-KPSKNAPVIVLDGGETSGEDRLP 122
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
+++G V +A G + V EHRYYG+S+P V+ + +L +L Q+ AD A F
Sbjct: 123 FLDTGIVEILARATGGVGVVLEHRYYGKSIP-----VSNFSTDSLRWLNNAQSAADSANF 177
Query: 182 ITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+ N K ++L A +P + +GGSY G AA MR+ YP + GA++SS
Sbjct: 178 MRNFKIDGIDEDLRAPHTPWIYYGGSYAGARAAHMRVLYPDLVYGAISSSG 228
>gi|104531986|gb|ABF72901.1| CG3734-like [Belgica antarctica]
Length = 184
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/169 (38%), Positives = 86/169 (50%), Gaps = 9/169 (5%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
ET++ EQ L+HF D + RY+ N + GPIF+Y G E I + S
Sbjct: 24 ETKWIEQPLNHFDPQDHRVWQMRYMENREF---LQDGGPIFIYVGGEWTISEGWLRSSHF 80
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
+A + L + EHRYYGES P V L +L +QALAD A FIT++KQ
Sbjct: 81 HYMAEQLNGTLYYTEHRYYGESHPTEDLTV-----DNLRFLNIDQALADLAHFITHIKQT 135
Query: 189 L-SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
+ S V+L G SY + W KYPH+A GA +SSAP+ D V
Sbjct: 136 TPELQNSGVILIGASYSATMVTWFMQKYPHLARGAWSSSAPLEAKVDFV 184
>gi|240254469|ref|NP_179399.5| serine carboxypeptidase S28-like protein [Arabidopsis thaliana]
gi|322518656|sp|Q1PF50.2|EDA2_ARATH RecName: Full=Probable serine protease EDA2; AltName: Full=Protein
EMBRYO SAC DEVELOPMENT ARREST 2; Flags: Precursor
gi|330251630|gb|AEC06724.1| serine carboxypeptidase S28-like protein [Arabidopsis thaliana]
Length = 489
Score = 102 bits (255), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 140/289 (48%), Gaps = 44/289 (15%)
Query: 51 LTEPPQRQQRQQQQQYRYETR--YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
L P + +Y T +F Q LDH S D F QRY D++ P+ GP+
Sbjct: 26 LLHPSSVSHNVSRSRYYMTTNELWFNQTLDHESPNDHRKFRQRYYEFMDYFRSPD--GPM 83
Query: 109 FLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY 168
F+ EG A + ++ +A +F A +V EHRYYG+S P+ S +A +N L Y
Sbjct: 84 FMIICGEGPCSGIA--NDYINVLAKKFQAGVVSLEHRYYGKSSPFNS--LATEN---LKY 136
Query: 169 LTAEQALADFAVF------ITNLKQNLSAEAS--PVVLFGGSYGGMLAAWMRLKYPHIAI 220
L+++QAL D A F N K N+S+ S P FG SY G L+AW RLK+PH+
Sbjct: 137 LSSKQALYDLASFRQYYQESLNKKLNISSGGSDNPWFFFGISYSGALSAWFRLKFPHLTC 196
Query: 221 GALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVG-QKENGLLE 279
G+LASSA + + S+F ++ I ES G+ + Q+ N LLE
Sbjct: 197 GSLASSAVVRAIYEF-----------SEFDQQ-------IGESAGQECKLALQETNKLLE 238
Query: 280 L---TKTFHLCRELNSTE-DL-ADWLE-SAYSYLAMVDYPYPSDFMMPL 322
L K + N+TE D+ AD+L +A + + Y P +PL
Sbjct: 239 LGLKVKNKAVKSLFNATELDVDADFLYLTADAAVMAFQYGNPDKLCVPL 287
>gi|194744687|ref|XP_001954824.1| GF16548 [Drosophila ananassae]
gi|190627861|gb|EDV43385.1| GF16548 [Drosophila ananassae]
Length = 390
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 80/281 (28%), Positives = 130/281 (46%), Gaps = 27/281 (9%)
Query: 23 ISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSF 82
++IL+ LS + F+ V +LP EP + + R+ Q+LD+F
Sbjct: 8 LAILAVLSAPTVGASFKEPMPKVNRLPK--EPMITRATVHE-------RWINQKLDNFDE 58
Query: 83 ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFP 142
+ T+S R IN +V + PIF+Y G E + ++SG DIA + +V
Sbjct: 59 DNNATWSNRIFINEQDFVDGS---PIFIYLGGESEQLPSRISSGLWVDIAKQHNGTIVAT 115
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGS 202
EHR+YG+S P Y Y + QALAD I LK+ + S VV+ G S
Sbjct: 116 EHRFYGKSTPI----TPYSTENLEKYQSINQALADVINVIQTLKEEDKYKDSKVVIHGCS 171
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIK 261
Y +AAW+R YP I +G+ ASSAP++ V + ++ ++ F+ C++ I
Sbjct: 172 YSATMAAWIRKLYPDIIVGSWASSAPLVA---KVEFKEYFKVIGESFRILGGQYCYDLID 228
Query: 262 ES---WGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+ + L + G+ + EL +LC + + + W
Sbjct: 229 NATNYYENLFANGKGDQAKKEL----NLCDDFDPKNEWDRW 265
>gi|170045810|ref|XP_001850487.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868715|gb|EDS32098.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 726
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 109/217 (50%), Gaps = 17/217 (7%)
Query: 14 LSPVITIVIISILSPLSLAAQPSKFRRAP---RFVGKLPHLTEPPQRQQRQQQQQYRYET 70
++ ++ + ++ +++ +++A Q K + P R + P E P+ Y+
Sbjct: 2 VNKLVLVALLGMVAAVAIAEQ-RKVQITPVVQRMLNTFPK-PEVPEGYVSTNPHTIGYK- 58
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
F R+DHF + TF Y N +++ R GPIF++ G + +E + + G D
Sbjct: 59 --FRTRVDHFDPQNRATFEFEYYSNDEYY---ERGGPIFIFVGGDWPLEQYYIERGHFHD 113
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
IA R A + EHRYYG S P V+ + L YLT EQA+ D A +I +L+ N+
Sbjct: 114 IAQRTNAWMFTNEHRYYGHSSP-----VSDYSTENLRYLTVEQAMVDLAEWIFHLRNNVV 168
Query: 191 AEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASS 226
+ + V+L G Y G +A W R +YPH+ G ASS
Sbjct: 169 RDTNAKVILLGTGYAGAIATWARQRYPHLVDGVWASS 205
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 73/276 (26%), Positives = 116/276 (42%), Gaps = 44/276 (15%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRY 146
TF Y N +++ R GPIF++ G + +E + + G DIA R A + EHRY
Sbjct: 400 TFEFEYYSNDEYY---ERGGPIFIFVGGDWPLEQYYIERGHFHDIAQRTNAWMFTNEHRY 456
Query: 147 YGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPVVLFGGSYGG 205
YG S P V+ + L YLT EQA+ D A +I +L+ N+ + + V+L G Y G
Sbjct: 457 YGHSSP-----VSDYSTENLRYLTVEQAMVDLAEWIFHLRNNVVRDTNAKVILLGTGYAG 511
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPP--------------ETFYNIVSSDFKR 251
+A W R +YPH+ G ASS + +++ T N++ +
Sbjct: 512 AIATWARQRYPHLVDGVWASSEYAEEIGELLRDLGNDECYSRIWRAFRTAENLMDAGRTE 571
Query: 252 ESASCFNT------------------IKESWGELVSVGQKENGLLELTKTFHLCRELNST 293
FNT +K + + V GQ L + E
Sbjct: 572 TVTEMFNTCEPVDEENMLDVETFFFNVKAALQDAVLRGQNVEATEALCEALEESTEETDI 631
Query: 294 EDLADWLESAYSYLAMVDYPYPSD---FMMPLPGYP 326
+ +A WL+ Y +L + + + + F + GYP
Sbjct: 632 QVIAQWLQEFYFFLDCMPFDFEAHTDAFRLTEIGYP 667
>gi|344268154|ref|XP_003405927.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 486
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 105/188 (55%), Gaps = 15/188 (7%)
Query: 81 SFADLPTFSQRYLINTDHWVGPNRLGPIFLYC-GNE-GDIEWFAVNSGFVW-DIAPRFGA 137
++ D+ +++Y IN D + P GP+FL GNE IEW N F W A R GA
Sbjct: 27 AYGDVTQSTEKYYINYDFY-KPG--GPVFLKVQGNEPASIEWIRRN--FTWITYAQRLGA 81
Query: 138 MLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVV 197
+ + EHR+YG+S P + +++ +N YL++ QA+AD A F T + Q+++ + V
Sbjct: 82 LCLLLEHRFYGDSQP--TRDMSTENFR--RYLSSRQAVADIAEFRTVIAQSMNLTENKWV 137
Query: 198 LFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCF 257
LFGGSYGG LA W R+K+P++ A+ SSA + + V ++ ++ + C
Sbjct: 138 LFGGSYGGSLAVWSRIKHPNLFAAAVTSSAMV---QAKVNFYEYFEVIHRALATHNRECL 194
Query: 258 NTIKESWG 265
+K+++G
Sbjct: 195 KAVKQAYG 202
>gi|341879672|gb|EGT35607.1| hypothetical protein CAEBREN_08421 [Caenorhabditis brenneri]
Length = 1088
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 148/334 (44%), Gaps = 34/334 (10%)
Query: 34 QPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYL 93
QP KF++ G+ PH P +E F QR+DHF+ + F Q++
Sbjct: 548 QP-KFKKV--LFGRPPHGFLPEPDFNEAASMPSGFEEGTFRQRVDHFNNQNANFFQQKFY 604
Query: 94 INTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESM 151
N W P GP FL G EG W + A ++GA + EHR+YG+S+
Sbjct: 605 KNA-QWAQPG--GPNFLMIGGEGPESSRWVLNENITYLTWAKKYGATVYLLEHRFYGDSL 661
Query: 152 PYGSTEVAYQNATTLSYLTAEQALADFAVFI--TNLKQNLSAEASPVVLFGGSYGGMLAA 209
+ + L+ L + Q L D A FI NLK + ++P + FGGSY G ++A
Sbjct: 662 VGDNND--------LNTLNSLQMLYDLAEFIKSVNLK---TGTSNPWITFGGSYSGAMSA 710
Query: 210 WMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVS 269
WMR +P + +GA+ASS P+ D + +V + + +C + I+ + + +
Sbjct: 711 WMREVFPDMVVGAVASSGPVFAKTDFY---EYLMVVENSIRTYDRTCADRIQSGFNTMRT 767
Query: 270 VGQKENGLLELTKTFHLCREL--NSTE-DLADWLESAYSYLAMVDYPYPSDFMMPLP-GY 325
+ + G L+ F L N T+ D + + Y Y D P GY
Sbjct: 768 MFLTKEGRQNLSDIFQLLPPFGDNVTDIDQHYFFSNVYGNFQGA-VQYSGDNTGPYANGY 826
Query: 326 PIREVCKKIDNAPDATSILERIF---EGVSVYYN 356
I ++CK + N D + L I + ++V+YN
Sbjct: 827 GIPDMCKIMTN--DTNTPLNNIVAFNQFMTVFYN 858
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 73/279 (26%), Positives = 122/279 (43%), Gaps = 30/279 (10%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS--GF 127
T + + ++H + TFSQRY D+ + R+ FLY GD E +
Sbjct: 46 TGWLDTWINHNTGNQSGTFSQRYFYTQDYAL-HQRVA--FLYISVSGDFETSVITDERNP 102
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI--TNL 185
+ A +FGA + EHR+YG+S P ++ +L++L + QA+ D FI N
Sbjct: 103 IVKTARQFGATVFSLEHRFYGQSRP----NFDKFDSASLTHLNSFQAIQDILHFIRFANN 158
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
K L + +L+G YGG++AA R P + G +ASSAP+ D I+
Sbjct: 159 KFQLDPDVR-WILWGAGYGGIIAAEARKWDPKLVAGVVASSAPLTHKYDFWEFNDQVAII 217
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE-DLAD----WL 300
S + C+ + + + ++ + G ++ F L L+ T + D W+
Sbjct: 218 LSQVGGQ--LCYQKVAQGFADIGQAMRTPQGRANVSDLFGLVPRLDQTNLNYNDVQMFWM 275
Query: 301 E--SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNA 337
S + LA Y +DF + I ++C ID +
Sbjct: 276 SVISPFQTLAQ----YNNDFNL-----SIGDMCTNIDKS 305
>gi|344268151|ref|XP_003405926.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 431
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 78/257 (30%), Positives = 117/257 (45%), Gaps = 38/257 (14%)
Query: 71 RYFEQRLDHFSFA----------------------DLPTFSQRYLINTDHWVGPNRLGPI 108
R+F+Q L+HFS + + QRY IN D + P GP+
Sbjct: 43 RWFQQMLNHFSKNGSSFWSQAVAIEAVTWGQLAKDQIASEEQRYFIN-DIFYKPG--GPV 99
Query: 109 FLYCGNEGDIEWFAVNSGFVWDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLS 167
FL+ E V W A R GA+ + EHR+YG S P G A +L
Sbjct: 100 FLFIKGENAASRQMVTGNNTWTTYAERLGALCLLLEHRFYGHSQPTGDLSTA-----SLR 154
Query: 168 YLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
YL + QALAD F T + + + + V+FGGSYGG LA W R+K P + A+ SSA
Sbjct: 155 YLNSRQALADIVNFRTQIAKTMGLTKNKWVIFGGSYGGSLAVWSRIKRPDLFFAAVGSSA 214
Query: 228 PILQFEDIVP-PETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
P+L + E FY +++ + C +++++ +V++ + L F L
Sbjct: 215 PMLAKVNFYEYLEIFYRTLTT----YNIECARAVEKAFTLIVALVKIPEYYSMLADDFML 270
Query: 287 CR--ELNSTEDLADWLE 301
C ++NS D A +L
Sbjct: 271 CEPLKINSEMDAAFFLH 287
>gi|336370214|gb|EGN98555.1| hypothetical protein SERLA73DRAFT_183618 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382973|gb|EGO24123.1| hypothetical protein SERLADRAFT_470898 [Serpula lacrymans var.
lacrymans S7.9]
Length = 555
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 96/176 (54%), Gaps = 14/176 (7%)
Query: 59 QRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--G 116
Q ++ + +YF Q LDHFS TF QRY INT H+ P GP+ + G E G
Sbjct: 65 QHSTEKFEEFPEQYFRQPLDHFSNTS-ETFGQRYWINTRHYT-PGAGGPVIVLDGGETSG 122
Query: 117 DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALA 176
+ +++G V +A G + V EHRYYG S+P V+ + L +L EQ+ A
Sbjct: 123 EDRIPFLDTGIVEILARATGGVGVVLEHRYYGSSIP-----VSNFSTDNLRWLNNEQSAA 177
Query: 177 DFAVFITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
D A F+ N+K ++L+A +P + +GGSY G AA MR+ YP + GA+ASS
Sbjct: 178 DSANFMANVKFPGIEEDLTAPNTPWIYYGGSYAGARAAHMRVLYPELVYGAIASSG 233
>gi|10140734|gb|AAG13567.1|AC073867_13 putative serine peptidase [Oryza sativa Japonica Group]
Length = 502
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 93/180 (51%), Gaps = 28/180 (15%)
Query: 69 ETRYFEQRLDHFS-----------FADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD 117
E R+ +QRLDHFS D F QRY D+ G GP+FL E
Sbjct: 51 EERWMDQRLDHFSPTRPRADVAGGVQDHRQFKQRYYEFADYHAGG---GPVFLRICGESS 107
Query: 118 IEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALAD 177
+ + ++ ++ +FGA +V PEHRYYG+S P+ S L +L+++QAL D
Sbjct: 108 CN--GIPNDYLAVLSKKFGAAVVTPEHRYYGKSSPFESL-----TTENLRFLSSKQALFD 160
Query: 178 FAVF------ITNLKQNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
F I N + N S+ +P +FG SY G L+AW RLK+PH+ G+LASS +L
Sbjct: 161 LVAFRQHYQEILNARYNRSSGFDNPWFVFGVSYSGALSAWFRLKFPHLTCGSLASSGVVL 220
>gi|308451153|ref|XP_003088564.1| hypothetical protein CRE_09925 [Caenorhabditis remanei]
gi|308246539|gb|EFO90491.1| hypothetical protein CRE_09925 [Caenorhabditis remanei]
Length = 947
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 141/324 (43%), Gaps = 27/324 (8%)
Query: 46 GKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRL 105
G+ PH P + +E F Q+ +HFS D TF Q++ N W P
Sbjct: 408 GRPPHGFLPEPDYNKDATYPPGFEQGTFRQKQNHFSNQDPNTFQQKFFKNA-QWAKPG-- 464
Query: 106 GPIFLYCGNEG--DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA 163
GP FL G EG W A ++GA + EHR+YG+S+ +T+
Sbjct: 465 GPNFLMIGGEGPEGAGWVLNQDITYLTWAKKYGATVYLLEHRFYGDSVVGDNTD------ 518
Query: 164 TTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
L++ Q L D A FI + + ++P + FGGSY G L+AWMR +P + +GA+
Sbjct: 519 --FQLLSSLQMLYDLAEFIREINYR-TGTSNPWITFGGSYSGALSAWMREVFPDVVVGAV 575
Query: 224 ASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKT 283
ASS P+ D + +V + + +C + I+ + + ++ + G L+
Sbjct: 576 ASSGPVFAKTDFY---EYLMVVENSIRTYDPTCADRIQSGFNTMRTMFLTKEGRKSLSDL 632
Query: 284 FHLCRELNSTEDLAD---WLESAY-SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPD 339
F L D + + Y ++ V Y + GY I ++CK + N D
Sbjct: 633 FQLDPPFGDNVTDTDQHYFFSNVYGNFQGAVQYS-GDNAGAYANGYGIPDMCKIMTN--D 689
Query: 340 ATSILERIF---EGVSVYYNYTGN 360
+ L I + +S++YN GN
Sbjct: 690 DNTPLNNIVAFNQFMSIFYNGGGN 713
Score = 41.6 bits (96), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 45/99 (45%), Gaps = 4/99 (4%)
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS- 255
VL+G YGG++AA R YP G +ASSAP+ D F + V+ +E S
Sbjct: 23 VLWGAGYGGVIAAEARKWYPDTVAGVIASSAPLTHQYDF---WQFNSHVAMAIAQEGGSL 79
Query: 256 CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
C + + + ++ + G ++ F L L+ T
Sbjct: 80 CSQMVTQGFADIRQAMRTPEGRSNVSDLFQLNPRLDQTN 118
>gi|154413074|ref|XP_001579568.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121913776|gb|EAY18582.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 518
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 89/161 (55%), Gaps = 6/161 (3%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+T YF+Q LDHF+ +D TF QRY N D + + ++ G E I V G
Sbjct: 20 DTYYFDQFLDHFNTSDNRTFKQRYYYN-DTFCQNTTTKKLIVFIGGEAAITERRVQKGAY 78
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
+A + +V EHRY+GES P+ E+ N L YLT++QALAD A FI + +
Sbjct: 79 MKLAKETDSCVVALEHRYFGESQPF--EELITPN---LKYLTSDQALADLAYFIESFIKI 133
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+++ GGSY G L+++ R+KYPHIA + ASS P+
Sbjct: 134 KYQSRPTILVVGGSYPGTLSSYFRMKYPHIADFSWASSPPL 174
>gi|308478052|ref|XP_003101238.1| CRE-PCP-3 protein [Caenorhabditis remanei]
gi|308263943|gb|EFP07896.1| CRE-PCP-3 protein [Caenorhabditis remanei]
Length = 1095
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 87/324 (26%), Positives = 141/324 (43%), Gaps = 27/324 (8%)
Query: 46 GKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRL 105
G+ PH P + +E F Q+ +HFS D TF Q++ N W P
Sbjct: 554 GRPPHGFLPEPDYNKDATYPPGFEQGTFRQKQNHFSNQDPNTFQQKFFKNA-QWAKPG-- 610
Query: 106 GPIFLYCGNEG--DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA 163
GP FL G EG W A ++GA + EHR+YG+S+ +T+
Sbjct: 611 GPNFLMIGGEGPEGAGWVLNQDITYLTWAKKYGATVYLLEHRFYGDSVVGDNTD------ 664
Query: 164 TTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
L++ Q L D A FI + + ++P + FGGSY G L+AWMR +P + +GA+
Sbjct: 665 --FQLLSSLQMLYDLAEFIREINYR-TGTSNPWITFGGSYSGALSAWMREVFPDVVVGAV 721
Query: 224 ASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKT 283
ASS P+ D + +V + + +C + I+ + + ++ + G L+
Sbjct: 722 ASSGPVFAKTDFY---EYLMVVENSIRTYDPTCADRIQSGFNTMRTMFLTKEGRKSLSDL 778
Query: 284 FHLCRELNSTEDLAD---WLESAY-SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPD 339
F L D + + Y ++ V Y + GY I ++CK + N D
Sbjct: 779 FQLDPPFGDNVTDTDQHYFFSNVYGNFQGAVQYS-GDNAGAYANGYGIPDMCKIMTN--D 835
Query: 340 ATSILERIF---EGVSVYYNYTGN 360
+ L I + +S++YN GN
Sbjct: 836 DNTPLNNIVAFNQFMSIFYNGGGN 859
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 122/279 (43%), Gaps = 28/279 (10%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS--GF 127
T Y LD F TFSQRY T + ++ FLY GD E +
Sbjct: 46 TGYLNTPLDQFVGNASGTFSQRYFY-TRQYALHQKVA--FLYVSVSGDFETSVITDERNP 102
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI--TNL 185
+ A +FGA + EHRYYG S P N TTL +L + QA+ D FI N+
Sbjct: 103 IVITAKQFGATVFSLEHRYYGGSKP----NFDKFNGTTLRHLNSYQAIMDLNAFIKYANV 158
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ N+ + +L+G YGG++AA R YP G +ASSAP+ D F + V
Sbjct: 159 QFNMDPDCR-WILWGAGYGGIIAAEARKWYPDTVAGVIASSAPLTHQYDF---WQFNDHV 214
Query: 246 SSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAY 304
+ +E S C+ + + + ++ + G ++ F L L+ T ++ +
Sbjct: 215 QTAIMQEGGSLCYQKVAQGFADIRQAMRTPEGRSNVSDLFQLNPRLDQTN--LNYNDIQQ 272
Query: 305 SYLAMVD-----YPYPSDFMMPLPGYPIREVCKKIDNAP 338
YLA++ + +DF + I ++C ID P
Sbjct: 273 FYLAIIAPFQEVIQFNNDFNI-----SIIDLCTSIDKGP 306
>gi|403416809|emb|CCM03509.1| predicted protein [Fibroporia radiculosa]
Length = 562
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 127/261 (48%), Gaps = 29/261 (11%)
Query: 56 QRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE 115
Q+ + + + +F Q LDHFS TF+QRY INT H+ P P+ + G E
Sbjct: 58 QQPLQSLDEPSEFPAHWFTQPLDHFSKTPH-TFNQRYWINTRHY-KPGSNAPVIVLDGGE 115
Query: 116 --GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQ 173
G+ +++G V +A G + V EHRYYGES+P VA +L +L +Q
Sbjct: 116 TSGEDRIPFLDTGIVEILAKATGGVGVVLEHRYYGESVP-----VANLTTDSLRWLNNDQ 170
Query: 174 ALADFAVFITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASS-- 226
+ AD A F+ N+K ++L+A +P + +GGSY G AA MR+ YP + GA+ASS
Sbjct: 171 SAADSANFMANVKFPGIHEDLTAPGTPWIYYGGSYAGARAAHMRILYPDLVYGAIASSGV 230
Query: 227 --APILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
A I +E + +I+ E ++ + E+ ++S+ Q L L F
Sbjct: 231 THAAIEHWE-------YMDIIRRAADPECSANIQSAIEAIDTILSISQLRTPLKGL---F 280
Query: 285 HLCRELNSTEDLADWLESAYS 305
L L ED A L S S
Sbjct: 281 GLAG-LEHDEDFASLLTSPMS 300
>gi|302688585|ref|XP_003033972.1| hypothetical protein SCHCODRAFT_256599 [Schizophyllum commune H4-8]
gi|300107667|gb|EFI99069.1| hypothetical protein SCHCODRAFT_256599 [Schizophyllum commune H4-8]
Length = 546
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 105/201 (52%), Gaps = 24/201 (11%)
Query: 38 FRRAP--RFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLIN 95
FR A + + + EPP R+Q + + ++FEQ LDHF+ TF QRY +
Sbjct: 36 FREAHLRKLNAAVKGIHEPPARRQYDE-----FPEQWFEQPLDHFNNETGDTFLQRYWFS 90
Query: 96 TDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPY 153
H+ P GP+ + G E G+ +++G V + G + V EHRYYGE+ P
Sbjct: 91 KRHYT-PGSGGPVIVLDGGETSGEGRLPFLDTGIVEILTRATGGVGVILEHRYYGETQPV 149
Query: 154 GSTEVAYQNATT--LSYLTAEQALADFAVFITNLK-----QNLSAEASPVVLFGGSYGGM 206
QN TT L +L +Q+ AD A F+ N+K ++L+A +P + +GGSY G
Sbjct: 150 -------QNLTTDSLRFLNNDQSAADSAYFMANVKFDGIDEDLTAPGTPWIYYGGSYAGA 202
Query: 207 LAAWMRLKYPHIAIGALASSA 227
+A MR+ YP + GA+ASS
Sbjct: 203 RSAHMRVLYPDLVYGAIASSG 223
>gi|145491267|ref|XP_001431633.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398738|emb|CAK64235.1| unnamed protein product [Paramecium tetraurelia]
Length = 464
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 87/166 (52%), Gaps = 17/166 (10%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY-CGNEGDIEWFA--VNSG 126
T +F Q+LDH F QR I + +V ++ + LY CG EW + G
Sbjct: 35 TEWFTQKLDHNDPTSQEVFKQRVHI-YNEYVKDDQPEAVILYICG-----EWTCDGIGKG 88
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
+D A + A+++ EHRYYG+S P+ + L YL QAL D A FIT++K
Sbjct: 89 LTFDAAQQLNAVVLVLEHRYYGQSQPFEDW-----STPNLKYLNIHQALDDIAYFITSIK 143
Query: 187 QNLSAEA---SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
N + +P + GGSY G L+AW R KYPH+ IG LASSA +
Sbjct: 144 ANGNYNIKPDTPWIHLGGSYPGALSAWFRYKYPHLTIGGLASSAVV 189
>gi|390603317|gb|EIN12709.1| peptidase S28 [Punctularia strigosozonata HHB-11173 SS5]
Length = 575
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/182 (36%), Positives = 99/182 (54%), Gaps = 19/182 (10%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNS 125
+E +FEQ LDHF+ D T+ QRY INT H+ PN P+ + G E G +++
Sbjct: 84 FEAYWFEQPLDHFAREDTQTWRQRYWINTRHY-KPNSSAPVIVLDGGETSGANRLPFLDT 142
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G V +A G + V EHRYYG S+P V + L +L Q+ AD A F++++
Sbjct: 143 GIVEILAKATGGVGVVLEHRYYGRSIP-----VDNLSTDALRFLDNAQSAADSARFMSHV 197
Query: 186 K-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA------PILQFED 234
K ++L+A +P + +GGSY G AA M++ YP + GA+ASS I ++ D
Sbjct: 198 KFEGIEEDLTAPHAPWIYYGGSYAGARAAHMKVLYPELVFGAIASSGVTHAALTIWEYMD 257
Query: 235 IV 236
I+
Sbjct: 258 II 259
>gi|395326580|gb|EJF58988.1| hypothetical protein DICSQDRAFT_128566 [Dichomitus squalens
LYAD-421 SS1]
Length = 490
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 81/266 (30%), Positives = 125/266 (46%), Gaps = 23/266 (8%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
Q +T++F+Q +DH S TF QRY I+T ++ GPI Y E + ++
Sbjct: 42 QCPADTQWFDQPIDHASTNSSQTFKQRYQIDTSNF---KEGGPILFYQSPEA-TDIACIS 97
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
D A G ++ EHRY+G+S+P+G+ N L T + + D F+
Sbjct: 98 ELLFMDWAKELGGIVATLEHRYFGQSLPFGNNSYTLDN---LKPFTLDNVMQDAVHFLDF 154
Query: 185 LKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI--LQFEDIVPPE-- 239
+K+N++ A S ++ GGSYGG LA R YP GA + P L D V E
Sbjct: 155 VKKNVTGAAKSKTIVAGGSYGGFLAPVFRQNYPDTFFGAWGIAGPFRSLGTVDEVGAELH 214
Query: 240 TFYNIVSSDFKRESASCFNTIKESWG---ELVSVGQKENGLLELTKTFHLCR-ELNSTED 295
+YN V S + S F+ I+ + +L+ G LTK LC NS++D
Sbjct: 215 NWYNYVQSTYAHRSLEAFDRIRNGFAQVKQLIDTGHNAT----LTKELSLCHPPSNSSDD 270
Query: 296 L---ADWLESAYSYLAMVDYPYPSDF 318
L A +L S+Y+ ++ + P+ F
Sbjct: 271 LATFASFLVSSYTTMSQFNGLPPAVF 296
>gi|401398762|ref|XP_003880396.1| protein F23B2.12, partially confirmed by transcript evidence,
related [Neospora caninum Liverpool]
gi|325114806|emb|CBZ50362.1| protein F23B2.12, partially confirmed by transcript evidence,
related [Neospora caninum Liverpool]
Length = 684
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 95/194 (48%), Gaps = 18/194 (9%)
Query: 105 LGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNAT 164
+ P+F+Y G EG + V G + ++ FGA + EHRYYG+S P + V
Sbjct: 222 IRPVFVYIGGEGPLSSMEVKQGLLAEMGEAFGASVYALEHRYYGDSHPRPDSSV-----P 276
Query: 165 TLSYLTAEQALADFAVFITNLK--------QNLSAEASPVVLFGGSYGGMLAAWMRLKYP 216
L +LT+ QAL D A F+ ++K QN+S E PV++FG SY G LAA+ R KYP
Sbjct: 277 NLQWLTSHQALGDLAAFVAHVKREQAEQHPQNVSPEDIPVIVFGCSYPGSLAAYARSKYP 336
Query: 217 HIAIGALASSAPI-----LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVG 271
+GA++SS+P+ Q D + +S K +A I S E V V
Sbjct: 337 ASILGAISSSSPVEASALFQAFDQTVQRVLPSACTSQIKAATAIVERRIFTSEEEAVKVA 396
Query: 272 QKENGLLELTKTFH 285
K E++ H
Sbjct: 397 AKFGCGAEVSMKTH 410
>gi|221504075|gb|EEE29752.1| serine carboxypeptidase, putative [Toxoplasma gondii VEG]
Length = 738
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 76/131 (58%), Gaps = 13/131 (9%)
Query: 107 PIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
PIF+Y G EG + V G + ++ FGA L EHRYYG+S P + V L
Sbjct: 274 PIFVYIGGEGPLSSLEVKQGLLAEMGDIFGASLYALEHRYYGDSHPRPDSSV-----VNL 328
Query: 167 SYLTAEQALADFAVFITNLKQ--------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHI 218
+LT+ QAL D A F+ ++KQ +L+ E PVV+FG SY G LAA+ R KYP
Sbjct: 329 QWLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPAS 388
Query: 219 AIGALASSAPI 229
+GA++SS+P+
Sbjct: 389 ILGAVSSSSPV 399
>gi|221483143|gb|EEE21467.1| serine carboxypeptidase, putative [Toxoplasma gondii GT1]
Length = 738
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 76/131 (58%), Gaps = 13/131 (9%)
Query: 107 PIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
PIF+Y G EG + V G + ++ FGA L EHRYYG+S P + V L
Sbjct: 274 PIFVYIGGEGPLSSLEVKQGLLAEMGDIFGASLYALEHRYYGDSHPRPDSSV-----VNL 328
Query: 167 SYLTAEQALADFAVFITNLKQ--------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHI 218
+LT+ QAL D A F+ ++KQ +L+ E PVV+FG SY G LAA+ R KYP
Sbjct: 329 QWLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPAS 388
Query: 219 AIGALASSAPI 229
+GA++SS+P+
Sbjct: 389 ILGAVSSSSPV 399
>gi|237840317|ref|XP_002369456.1| serine carboxypeptidase S28, putative [Toxoplasma gondii ME49]
gi|211967120|gb|EEB02316.1| serine carboxypeptidase S28, putative [Toxoplasma gondii ME49]
Length = 738
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 76/131 (58%), Gaps = 13/131 (9%)
Query: 107 PIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
PIF+Y G EG + V G + ++ FGA L EHRYYG+S P + V L
Sbjct: 274 PIFVYIGGEGPLSSLEVRQGLLAEMGDIFGASLYALEHRYYGDSHPRPDSSV-----VNL 328
Query: 167 SYLTAEQALADFAVFITNLKQ--------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHI 218
+LT+ QAL D A F+ ++KQ +L+ E PVV+FG SY G LAA+ R KYP
Sbjct: 329 QWLTSHQALGDLAAFVAHVKQQEAEEHPQDLAPEDVPVVVFGCSYPGSLAAYARAKYPAS 388
Query: 219 AIGALASSAPI 229
+GA++SS+P+
Sbjct: 389 ILGAVSSSSPV 399
>gi|330846607|ref|XP_003295109.1| hypothetical protein DICPUDRAFT_160267 [Dictyostelium purpureum]
gi|325074265|gb|EGC28366.1| hypothetical protein DICPUDRAFT_160267 [Dictyostelium purpureum]
Length = 547
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 79/156 (50%), Gaps = 5/156 (3%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F+Q ++H S+ + TF QRY +N + +F EG + VN +IA
Sbjct: 9 FQQTINHLSYDTIGTFEQRYSVNKKFLASNGKPKAVFFLVSGEGPLSSEIVNHNPFVNIA 68
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
A++V E RYYGESMP+ + N + ++YLT +Q L D A F
Sbjct: 69 NETNALIVALELRYYGESMPFPN-----MNNSNMAYLTTDQILEDLANFQVYFTNKYQLG 123
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
++ G SY G ++AW RLKYPH+ A+ASS+P
Sbjct: 124 DIKWIIMGCSYAGTISAWYRLKYPHLVTAAIASSSP 159
>gi|449701650|gb|EMD42428.1| serine carboxypeptidase (S28) family protein, partial [Entamoeba
histolytica KU27]
Length = 195
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 89/162 (54%), Gaps = 8/162 (4%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E+ + LDHF+ + F +Y +N + P+F+ G EG + + +V
Sbjct: 38 ESMTYTVPLDHFNANNQNDFDIQYFVNKKFLDANDPNAPLFVLLGGEGPASPKVLQNNYV 97
Query: 129 WD-IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
D +A + +++ EHR+YG S P S E+ L Y TAEQAL D+ I+++++
Sbjct: 98 IDSLAKKHKGLMLSVEHRFYGASTP--SLEM-----DKLIYCTAEQALMDYVEVISHVQE 150
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ PV++ GGSY G LAAWMR KYP++ GA ASSAP+
Sbjct: 151 ENNLVGHPVIVLGGSYSGNLAAWMRQKYPNVVEGAWASSAPV 192
>gi|323454022|gb|EGB09893.1| hypothetical protein AURANDRAFT_10784, partial [Aureococcus
anophagefferens]
Length = 477
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 96/310 (30%), Positives = 144/310 (46%), Gaps = 52/310 (16%)
Query: 72 YFEQRLDHF-SFADLPT--FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
Y + LDHF S PT +SQRY ++ W G P+FLY G EG + F+
Sbjct: 8 YHDALLDHFESDVASPTRKWSQRYYVDESFWGGAGF--PVFLYIGGEGPQGPMSPRM-FI 64
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA---VFITNL 185
+ A A+LV EHR+YGES+P A + L YL + QALAD A V++++
Sbjct: 65 YAQAKEHRALLVTLEHRFYGESLP-----TANMDDANLRYLASAQALADLARFRVYVSSY 119
Query: 186 KQN---------LSAEASPV-----VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL- 230
+ L +ASP + FGGSY G LAAW + KYP + G +ASSAP+
Sbjct: 120 SPDAPDAASTPPLELKASPGMDSKWIAFGGSYPGDLAAWFKEKYPFLTAGVVASSAPVFA 179
Query: 231 -----QFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
Q+ ++V Y ++ S SC + ++ +LV+ E G K
Sbjct: 180 EYDFAQYSEVVGDALAYPLIGG-----SPSCADAVRRGVEDLVAA--LEAGAAP-PKALE 231
Query: 286 LCRELNSTEDLADWLESAY-SYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSIL 344
C + S D A + S + ++ +V Y + P Y + +VC +D AP S +
Sbjct: 232 PCGSIASGVDRAQYYSSIFGNFQGVVQYN-----LEAGPPY-VSDVCDAVDGAP---SPI 282
Query: 345 ERIFEGVSVY 354
E + S++
Sbjct: 283 EALAAATSLF 292
>gi|326427042|gb|EGD72612.1| hypothetical protein PTSG_04347 [Salpingoeca sp. ATCC 50818]
Length = 482
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/159 (37%), Positives = 91/159 (57%), Gaps = 13/159 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI-FLYCGNEGDIEWFAVNSGFVWD 130
YF+ LDHF + PTF RY I+ + + G + F Y G EG + + +V
Sbjct: 62 YFDFFLDHFDHSS-PTFRGRYYIDDSQF----KNGSVCFFYMGGEGPNT--GIRNDYVSY 114
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
+A ++ A++V EHR+YG+S+P+ V T L YLT+ QALAD A I ++ + +
Sbjct: 115 LAKQYKALIVSIEHRFYGDSVPFDDFSV-----TNLEYLTSRQALADAAQLIKHVNSSDT 169
Query: 191 AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ S FGGSY G L+AW R+KYP + +G+L+SS +
Sbjct: 170 YKCSAWFAFGGSYSGALSAWFRVKYPDVIVGSLSSSGVV 208
>gi|157119585|ref|XP_001659436.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875273|gb|EAT39498.1| AAEL008698-PA [Aedes aegypti]
Length = 500
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 85/304 (27%), Positives = 138/304 (45%), Gaps = 33/304 (10%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F R+DHF+ + TF Y N D + P GPIF++ G ++ + + G DIA
Sbjct: 58 FRTRVDHFNPQNRDTFEFEYYSN-DEFYRPG--GPIFIFVGGNWPLDQYYIEHGHFHDIA 114
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN-LSA 191
A + EHRYYG S P + V L YLT EQA+ D A I +++ N +
Sbjct: 115 NYENAWMFANEHRYYGHSFPVPNLSVE-----NLQYLTVEQAMVDLAELIYHVRHNVVRD 169
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNIVSSD 248
+ + V+L G Y G +A WMR +YPH+ GA SS I F++ Y + +
Sbjct: 170 DDARVILLGTGYAGAIATWMRQRYPHLVEGAWVSSGQIEARFNFKE-------YAMEIGE 222
Query: 249 FKRE--SASCFNTIKESWGELVSVGQKENGLLE-LTKTFHLCRELNSTEDLADWLESAYS 305
R+ + C++ I ++ ++ + GL +T F+ C ++ TE + D +
Sbjct: 223 LIRDYGTNECYSQIWRAFRTAENL--MDAGLANTVTDLFNTCERVD-TETMLD----VET 275
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQ 365
+ V + E+C+ ++N+ +AT L I + +Y Y +DC
Sbjct: 276 FFYNVKEALQRAILSEQDTETTEELCENLNNSTEATD-LHTIANWIEDFYYY---LDCMP 331
Query: 366 LDDD 369
D D
Sbjct: 332 FDFD 335
>gi|357626528|gb|EHJ76588.1| hypothetical protein KGM_15000 [Danaus plexippus]
Length = 413
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 131/281 (46%), Gaps = 27/281 (9%)
Query: 91 RYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGES 150
RY N +W GPIF++ G E G + ++A + EHRYYGES
Sbjct: 2 RYFENVLYW---QENGPIFVFLGGESASSPQWTRFGIIHELAKESQGAMYVTEHRYYGES 58
Query: 151 MPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAW 210
P T+ YL++ QALAD A I LK + S VV+ GGSY G LAAW
Sbjct: 59 KPKNLTK-----EDQFKYLSSRQALADIAKLIHYLKLLPMYKNSKVVVIGGSYAGNLAAW 113
Query: 211 MRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR-ESASCFNTIKESWGELVS 269
M++ YP + A+ASSAP+L +D + V+ D++ + C + IK +
Sbjct: 114 MKVLYPDLVDAAVASSAPVLAKKDFFE---YLEKVTEDYETYGTHGCSDKIKNIFDRFHQ 170
Query: 270 VGQKENGLLELTKTFHLCR--ELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPI 327
+ Q G+ +L K ++C +++ E+ A + E S + M + Y S I
Sbjct: 171 LLQSSEGIKQLKKEENICDSCDMSVIENQAVFFEVKTS-IFMSNSQYGS-------TKTI 222
Query: 328 REVCKKI-DNAPDATSILERIFEGVSVYYNYTGNVDCFQLD 367
++ C+K+ D + D S+ + + Y+ ++C+ D
Sbjct: 223 KQHCEKLSDVSYDTKSLTDNSMLPII----YSEKLNCYDYD 259
>gi|325186495|emb|CCA21035.1| lysosomal ProX carboxypeptidase putative [Albugo laibachii Nc14]
Length = 263
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 51/129 (39%), Positives = 75/129 (58%), Gaps = 8/129 (6%)
Query: 255 SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNS---TEDLADWLESAYSYLAMVD 311
+C I+++W L S+ + E G L+L K FHLCR L + LA WL +A+S LAM +
Sbjct: 8 NCAEKIRQAWPALFSMAENEPGRLQLAKIFHLCRPLQNETGIHHLALWLLNAFSVLAMRN 67
Query: 312 YPYPSDFM----MPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
YPYPS ++ LP +P++ C + D PD+ +++ +FE VSV YN T +DC L
Sbjct: 68 YPYPSSYLSNGEAQLPAWPMQSACSFLADQRPDSIALISSLFEAVSVLYNATKKMDCVDL 127
Query: 367 DDDPHGLDG 375
D +DG
Sbjct: 128 PRDMTSIDG 136
>gi|219130565|ref|XP_002185433.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403147|gb|EEC43102.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 538
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 152/351 (43%), Gaps = 50/351 (14%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGP---NRLGPIFLYCGNEGDIEWFAVNS 125
+TRY Q+L+HF +D +F+QR+ +D + NR FL G EG A++
Sbjct: 60 QTRYVNQQLNHFDASDTRSFAQRFFY-SDRYARAREENRNTYAFLCVGGEGP----ALDE 114
Query: 126 GFVWDIAPRFGAMLVFP---------------EHRYYGESMPYGSTEVAYQNATT----- 165
+ D G ML EHRYYGES P +N TT
Sbjct: 115 SVLVDSVHCTGDMLELAHILFEDGHKVHLYALEHRYYGESYPVFREGGCSKNRTTSPVTN 174
Query: 166 --LSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
L YL++ QALAD A F+ N + V FGGSY GM+AAW R KYPH+ A+
Sbjct: 175 QHLVYLSSTQALADLAHFV-NSRSLDGGTNIKWVTFGGSYPGMMAAWARSKYPHLIHAAV 233
Query: 224 ASSAPILQFEDIVPPETFYN-IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
+SSAP+ D + +++S S+ C + + GE+ + L
Sbjct: 234 SSSAPVQAVLDFSAYNNHVSKVLASANVGGSSECLAVFQAAHGEVTRMVHDATQHAGLAD 293
Query: 283 TFHLCRELNSTEDLADWLESAYSYLAMVD-YPYPSDFMMPLPGYPIREVCKKIDNAPDA- 340
F LC N+T L D + + +VD + ++ I ++C+ + N +
Sbjct: 294 MFGLC---NATSLLEDRNKELFLGDGLVDLHTQGNNPSCDRDLCNIGKICRTLLNDAKSF 350
Query: 341 ------TSILERIFE-GVSVYYNYTGNVDCFQLDDDPHGLDG----WNWQV 380
++ +R + G + ++TG +D + D G++G W WQ
Sbjct: 351 KPVTALANLAQRQRDRGACINIDWTGTLD--YISDPVRGVEGGLRSWLWQT 399
>gi|262176822|gb|ACY27467.1| serine protease Pro1 [Plasmodiophora brassicae]
Length = 467
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 83/261 (31%), Positives = 123/261 (47%), Gaps = 24/261 (9%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLPT-FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
Q + E + +Q +DH + F QRY W GP+ GP+ L E
Sbjct: 15 QGHAYVQTEHVFPDQLVDHTAIGGGGARFLQRYFRIDQFWSGPD--GPVILQLCGEYTCA 72
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
F +A R+GA+++ EHRY+G+S P+ + + L+YLT QAL+D A
Sbjct: 73 GVTDGRQFPSALAERYGALVLVLEHRYFGKSSPF-----SVLSPRNLTYLTTFQALSDIA 127
Query: 180 VFITNLKQNL---SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
F T+ Q + A A+ + GGSY G LAAW RLKYPH+ GALASSA + F +
Sbjct: 128 CF-TDWYQRVHIGRANANKWITIGGSYPGALAAWYRLKYPHLTAGALASSAVVAPFAEF- 185
Query: 237 PPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
PE + SS +C + +++ + Q+ L + K C +L+ + L
Sbjct: 186 -PEFDEQVASS----AGPACTHALQDITAMVEGALQEGGRLADEMKALFSCSQLSDADFL 240
Query: 297 ---ADWLESAYSY---LAMVD 311
AD + A Y LA+ D
Sbjct: 241 YLIADAMAEAIQYGHRLALCD 261
>gi|95768136|gb|ABF57331.1| protease, serine, 16 [Bos taurus]
Length = 230
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/145 (40%), Positives = 86/145 (59%), Gaps = 10/145 (6%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW + GP+FL+ G EG + +V G ++
Sbjct: 59 WLEQPLDPFNASDRRSFLQRYWVNDQHWTSQD--GPVFLHLGGEGSLGPGSVMRGHPANL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NL 189
AP +GA+++ EHR+YG S+P ++A L +L++ ALAD A L + N+
Sbjct: 117 APIWGALVISLEHRFYGLSIPAEGLDMA-----QLRFLSSRHALADAASARLTLSRLFNV 171
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLK 214
S+ SP + FGGSY G LAAW RLK
Sbjct: 172 SS-TSPWICFGGSYAGSLAAWARLK 195
>gi|299743114|ref|XP_001835550.2| hypothetical protein CC1G_13075 [Coprinopsis cinerea okayama7#130]
gi|298405506|gb|EAU86268.2| hypothetical protein CC1G_13075 [Coprinopsis cinerea okayama7#130]
Length = 564
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 71/231 (30%), Positives = 116/231 (50%), Gaps = 23/231 (9%)
Query: 62 QQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIE 119
+++ Y + ++FEQ LDHF + QR+ +N+ H+ P P+ + G E G+
Sbjct: 68 KEKNYDFRAQWFEQPLDHFDNTSDHRWHQRFWVNSRHY-KPRPGAPVIVLDGGETSGEER 126
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
+++G V +A G + + EHRYYG+S+P VA + +L +L Q+ AD A
Sbjct: 127 LPFLDTGIVNILAKATGGIGIVLEHRYYGDSIP-----VANFSTDSLRWLNNAQSAADSA 181
Query: 180 VFITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQ 231
F+ N+K ++++A +P + +GGSY G AA M++ YP I GA+ASSA LQ
Sbjct: 182 NFMRNVKLDSIQEDITAPGTPWIYYGGSYAGARAAHMKIIYPDIVYGAIASSAVTHATLQ 241
Query: 232 -------FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKEN 275
D P+ NIV+S ++ K L +G+ E+
Sbjct: 242 AWEYMTIIRDAADPKCSANIVNSIATIDTILQRGVFKRQLKSLFGLGELEH 292
>gi|393233661|gb|EJD41230.1| peptidase S28 [Auricularia delicata TFB-10046 SS5]
Length = 497
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 82/247 (33%), Positives = 122/247 (49%), Gaps = 26/247 (10%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFA 122
+ ++ R+F Q LDHF TF QRY +N H+ GP+ + G E G+
Sbjct: 5 EAKFPARWFRQPLDHFDRKRRDTFLQRYWVNDRHY---RSGGPVIVLDGGETSGENRLPF 61
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
+++G V +A + V EHRYYG S+P V +L +L +Q+ AD A F+
Sbjct: 62 LDTGIVDILAKATHGLGVVLEHRYYGRSIP-----VLNLTTDSLRWLDNKQSAADSATFM 116
Query: 183 TNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP 237
N+K ++L+A +P + +GGSY G AA MR+ YP + GA+ASSA + IV
Sbjct: 117 ANVKFEGISEDLTAPGTPWIYYGGSYAGARAAHMRVLYPDLTFGAIASSA--VTHAAIVY 174
Query: 238 PETFYNIVSSDFKRESA--SCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
E +Y ++ R+SA C ++ S + V Q L K F L +L +D
Sbjct: 175 WE-YYEVI-----RQSAPTGCIARLERSIDIIDRVLQVPVLRRPLKKLFGL-EDLEHDDD 227
Query: 296 LADWLES 302
A LES
Sbjct: 228 FASVLES 234
>gi|403343430|gb|EJY71042.1| Serine carboxypeptidase S28 family protein [Oxytricha trifallax]
Length = 494
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 83/171 (48%), Gaps = 15/171 (8%)
Query: 73 FEQRLDHFS---FADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
F Q +DHF D T+ QRY IN H+ N GP+FLY EG + + G+
Sbjct: 47 FTQIVDHFPPTPTNDAATYQQRYFINDKHFDKDN--GPVFLYICGEGTCK-PPSDRGYPM 103
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI----TNL 185
+A FGAM EHR+YG S P A + L YLTAEQALAD A FI +
Sbjct: 104 QLAIEFGAMFYAVEHRFYGTSQP-----TADWSTDNLKYLTAEQALADLAGFIDAQNAAI 158
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
+ A V GGSY G L+AW + YP A A +SS IL D
Sbjct: 159 IKQYGGAARKWVTIGGSYPGALSAWFKQAYPDKAAAAWSSSGVILPIRDFT 209
>gi|312066657|ref|XP_003136374.1| hypothetical protein LOAG_00786 [Loa loa]
gi|307768457|gb|EFO27691.1| hypothetical protein LOAG_00786 [Loa loa]
Length = 512
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 126/287 (43%), Gaps = 28/287 (9%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRL--GPIFLYCGNE--GDIEWFAVNSGFV 128
+Q+LDHF D + Q Y H P + G +FL G E D W N G
Sbjct: 67 LKQKLDHFDNNDGRKWRQFYT----HRKSPYQRSDGAVFLIVGGEDGADRAWL-TNQGLP 121
Query: 129 W-DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+ +A + A + EHR+YG S P T + +L YL A+QA+ D F+ + Q
Sbjct: 122 YVQLADQINASIFMLEHRFYGSSRPTIDTSI-----QSLKYLDAKQAVEDIDRFVQEINQ 176
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNI 244
+ FGGSY G LAAW R K+P A+ASSAP+ L F+D F
Sbjct: 177 REKLTNPKWITFGGSYSGNLAAWAREKHPRSIRAAVASSAPLQAKLNFKD------FERQ 230
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH----LCRELNSTEDLADWL 300
+ +++ C I++ + ++ + G +L K F L R S +D+A++
Sbjct: 231 IEKIIEKKDTKCVAVIRKLFQKMRQMSTTHEGRRKLVKIFRLDDSLIRPAVSDKDVANFF 290
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERI 347
+Y++ + + + +C K+ ++P SI E I
Sbjct: 291 LVISNYISFIVMHSGINVKDHRDLLTLDVMCSKLTHSPSLESIRELI 337
>gi|324510461|gb|ADY44374.1| Serine protease [Ascaris suum]
Length = 529
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 128/284 (45%), Gaps = 19/284 (6%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSG 126
ET Y Q +DH S L T+ QR N + + IFL G E +W A +
Sbjct: 37 ETFYVTQPIDH-SNPPLGTWQQRVQYNPRFYRNES---IIFLLIGGESPAAEKWVAQPNI 92
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
A ++GA + EHR++G+S PY + ++L Y T +QAL D A FI +
Sbjct: 93 TYLRWAEKYGAAVFQLEHRFFGKSRPYNDLKT-----SSLKYCTVDQALEDLASFIRQMN 147
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
V FGGSY G L+AW +++YP + +GA+ASSAP+ D + ++
Sbjct: 148 AKYGYVNPRWVTFGGSYPGSLSAWFQVRYPDLTVGAVASSAPLTFLLDYY---GYAMVME 204
Query: 247 SDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSY 306
+ + SA C I + +++ G EL+ L N T L + +Y
Sbjct: 205 NVIRNTSAECHEKIGNAITVILNKALTVAGREELSTKLKLKPAFNETTLTVRDLHNMMAY 264
Query: 307 L-----AMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILE 345
L +V Y Y + + + G+ +R +C I ++ +++
Sbjct: 265 LFGGLQNVVQYTYDARNSITMGGFNVRNMCNAITSSTSTDPVIQ 308
>gi|326436306|gb|EGD81876.1| thymus specific serine peptidase [Salpingoeca sp. ATCC 50818]
Length = 500
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 72/228 (31%), Positives = 106/228 (46%), Gaps = 19/228 (8%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG-----DIEWFAVNS 125
RYF Q DHF ++ T+ Q Y +N W G + P+FL G EG + +V+
Sbjct: 46 RYFTQWQDHFDGTNVNTWQQAYYVNDTFWKG-DANAPVFLCVGGEGPPIDGSVVVSSVHC 104
Query: 126 GFVWDIAPRFGAMLVFPEHRYYG-ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
++ P GA++ EHRYYG +M + L +L++ QALAD A F
Sbjct: 105 NGAVEMLPETGAIMFAVEHRYYGCHNMSACPVTSFLKPKDALRFLSSRQALADLAGFHAY 164
Query: 185 LKQNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+ + V FGGSY GMLA W RLK+PH+ ++ASSAP+ D+V YN
Sbjct: 165 ATATYGLKPTNKWVSFGGSYPGMLAGWFRLKFPHLVHASVASSAPVQAIVDMVG----YN 220
Query: 244 -------IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
VS++ S +C I + + + ++G L F
Sbjct: 221 DVVAEAYAVSNNNVGGSPACRKAIADGHAMIGQMFSSDSGRTRLANLF 268
>gi|302790399|ref|XP_002976967.1| hypothetical protein SELMODRAFT_416857 [Selaginella moellendorffii]
gi|300155445|gb|EFJ22077.1| hypothetical protein SELMODRAFT_416857 [Selaginella moellendorffii]
Length = 982
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 76/214 (35%), Positives = 106/214 (49%), Gaps = 31/214 (14%)
Query: 17 VITIVIISILSPLSLAAQP-SKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQ 75
++ V+++ + S A P S+ P F LP + P R + + + TR Q
Sbjct: 5 LVCFVLLATSTSFSSAFVPASRLGFKPEF---LPAGSRSPPRGK----ETVNFFTR---Q 54
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
+LDHF+ D FSQ+YL D + N GPIFL E V + +V +A F
Sbjct: 55 KLDHFAPEDPRVFSQKYLELLDFFRPRN--GPIFLVMCGESTCTGNYVTT-YVGTLAESF 111
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
GA +V EHRYYG S P+ + N L YLT++Q+L D AVFI +
Sbjct: 112 GAAIVTVEHRYYGHSSPF-----QHLNLHNLKYLTSKQSLFDHAVFIDYYQAR------- 159
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
GSY G L+AW RLK+PH+ G+ ASSA +
Sbjct: 160 -----GSYAGALSAWFRLKFPHLVAGSWASSAVV 188
>gi|119608755|gb|EAW88349.1| dipeptidyl-peptidase 7 [Homo sapiens]
Length = 327
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 95/165 (57%), Gaps = 9/165 (5%)
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG 265
ML+A++R+KYPH+ GALA+SAP+L + F+ V++DF+ +S C ++E++
Sbjct: 1 MLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVREAFR 60
Query: 266 ELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLESAYSYLAMVDYPYPSDFMMP 321
++ + + G + + F C+ L+ +DL + +A++ LAM+DYPYP+DF+ P
Sbjct: 61 QIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPTDFLGP 118
Query: 322 LPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
LP P++ C ++ + + L + + YN +G+ C+ +
Sbjct: 119 LPANPVKVGCDRLLSEAQRITGLRAL---AGLVYNASGSEHCYDI 160
>gi|157119587|ref|XP_001659437.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875274|gb|EAT39499.1| AAEL008702-PA [Aedes aegypti]
Length = 499
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 85/314 (27%), Positives = 143/314 (45%), Gaps = 29/314 (9%)
Query: 62 QQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF 121
+ + R F R+DHF+ TF Y N D + P GPIF+ G + +
Sbjct: 47 KSENSRTTGHLFRTRVDHFNPQKRDTFQFEYFSN-DQYYRPG--GPIFIVVGGNFPVSPY 103
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
+ G DIA A + EHR+YG SMP + +++ +N L YLT EQ + D A +
Sbjct: 104 FLEHGHFHDIAFYENAWMFTNEHRFYGNSMP--TEDLSVEN---LRYLTVEQTMVDLAEW 158
Query: 182 ITNLKQNLSAEASP-VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVP 237
I +L+QN+ + + V+L G Y G +A WMR +YPH+ GA SS I F++
Sbjct: 159 IFHLRQNVVRDQNARVILLGTGYSGAIATWMRQRYPHLVEGAWVSSGQIEARFNFKEYAE 218
Query: 238 P--ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED 295
E + S++ + F T + L+ G +++ F+ C + T+D
Sbjct: 219 EVGELIRDYGSNECYSQIWRAFRTAE----NLIDAGLGST----VSELFNTCEPI-VTDD 269
Query: 296 LADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYY 355
+ + ++ V + E+C++++N+ +AT L+ I V +Y
Sbjct: 270 IT--MLDVETFFWHVKTALQRGVLDEQDTDTTNELCERLNNSTEATD-LQTIANWVHEFY 326
Query: 356 NYTGNVDCFQLDDD 369
++ +DC D D
Sbjct: 327 DF---LDCMPFDFD 337
>gi|268535774|ref|XP_002633022.1| Hypothetical protein CBG21794 [Caenorhabditis briggsae]
Length = 1277
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 96/330 (29%), Positives = 151/330 (45%), Gaps = 27/330 (8%)
Query: 36 SKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLIN 95
SKF++ F+G+ PH P + +ET F QR DHF+ + F Q++ N
Sbjct: 729 SKFKKK-MFLGRPPHGFLPEPDYNKDGVYPPGFETGTFYQRQDHFNNQNPVHFQQKFYKN 787
Query: 96 TDHWVGPNRLGPIFLYCGNEGDIEW-FAVNSGFVW-DIAPRFGAMLVFPEHRYYGESMPY 153
+ W P GP FL G EG + +N W A ++GA + EHR+YG+S
Sbjct: 788 S-QWAQPG--GPNFLMIGGEGPEGPGWVLNEQLTWIQYAKKYGATVYILEHRFYGDS--- 841
Query: 154 GSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRL 213
++ N + L + Q L D A FI + N A A P + FGGSY G L+AWMR
Sbjct: 842 -KIDI---NNSNFYLLHSLQMLYDLAEFIKAVNINSPAPA-PWITFGGSYSGALSAWMRE 896
Query: 214 KYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQK 273
+P + IGA+ASS P+ D + +V + +C + I+ + + ++
Sbjct: 897 VFPELVIGAVASSGPVFAKTDFY---EYLMVVEKSIRTYDKTCADRIQSGFNTMRTMFLT 953
Query: 274 ENGLLELTKTFHL---CRELNSTEDLADWLESAY-SYLAMVDYPYPSDFMMPLPGYPIRE 329
+ G L+ F L E + D + + Y ++ V Y + GY I +
Sbjct: 954 KEGRQNLSDLFQLKPAFGENVTDTDQHYFFSNIYGNFQGAVQYS-GDNAGAYANGYGIAD 1012
Query: 330 VCKKIDNAPDATSILERIF---EGVSVYYN 356
+CK + N D+ L+ I E ++V+YN
Sbjct: 1013 MCKIMTN--DSNIPLDNIVQFNEFMTVFYN 1040
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 73/247 (29%), Positives = 110/247 (44%), Gaps = 18/247 (7%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN--SG 126
T Y Q LDHF+ TF Q Y T H+ R FLY GD E ++ +
Sbjct: 228 NTGYMIQTLDHFNSRSNETFVQTYYY-TQHFALHQRTA--FLYVSVSGDFETTVISDENN 284
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI--TN 184
V A +FGA L EHRYYG+S P V ++ L +L + QA+ D FI N
Sbjct: 285 PVVKSARQFGATLFSLEHRYYGQSKP----NVEKFDSFNLRFLNSFQAIQDIVAFIKYAN 340
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNI 244
+ NL + VL+G YGG++AA R P + G +ASS P+ D F +
Sbjct: 341 KQFNLDPDVR-WVLWGAGYGGVIAAEARKWNPDVVAGVIASSTPLTHEYDF---WQFNDH 396
Query: 245 VSSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
V +E C+ + + + ++ + G ++ F L L+ T ++ +
Sbjct: 397 VQMAISQEGGQLCYQKVAQGFADIRQAMRTPEGRSNISDLFQLNPRLDQTN--LNYNDVQ 454
Query: 304 YSYLAMV 310
YLA++
Sbjct: 455 MFYLAII 461
>gi|407849017|gb|EKG03883.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi]
Length = 483
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 100/214 (46%), Gaps = 10/214 (4%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPH-LTEPPQRQQRQQQQQYRYETRYFEQ 75
++ + ++ L + PS+ R + L L E Q++Q+ + RY+ Q
Sbjct: 16 LLAVAFSGPVTALHPSVIPSRARHGEVYRAALRRSLKEGSGGQEKQRHDVHNNAARYYNQ 75
Query: 76 RLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRF 135
R+DH + A L TF QR+ ++ W GP L EG + GFV +
Sbjct: 76 RVDH-ADATLGTFRQRWWVDRSSW--DVNSGPAILLVNGEGTAHGLP-DGGFVGEYGKSV 131
Query: 136 GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP 195
A++ EHRYYGESMP T N + L YLT E ALAD F ++ + +
Sbjct: 132 KAIVFSLEHRYYGESMPAPLT-----NRSMLKYLTVENALADLQAFKKYAEKKVVKKKVK 186
Query: 196 VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
++ GGSY G L+AW R KYP A +SS +
Sbjct: 187 WLIVGGSYAGALSAWARAKYPGDFDAAWSSSGVV 220
>gi|384485422|gb|EIE77602.1| hypothetical protein RO3G_02306 [Rhizopus delemar RA 99-880]
Length = 242
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/171 (36%), Positives = 88/171 (51%), Gaps = 17/171 (9%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVN 124
+Y +++Q +DHF + TF RY NT+ W P GP+ +Y E D F V
Sbjct: 13 KYGPFFYDQPVDHF-LENSTTFKHRYWANTE-WYQPG--GPVLIYNAGETAADQRSFLVI 68
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
F+ ++ +++ EHR+YG S+P + A L+ L QAL D A FI
Sbjct: 69 DSFMAELTKSLNGIIIVMEHRFYGLSLPSSNF-----TAKELATLNTAQALEDIASFIRY 123
Query: 185 LK-QNLSAEASPV-----VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+K N + P +++GGSY G LAAWMRLKYP I A+ SSAP+
Sbjct: 124 VKIPNFEVDLPPAPETRYIVYGGSYSGNLAAWMRLKYPDIVFAAVPSSAPV 174
>gi|66816525|ref|XP_642272.1| hypothetical protein DDB_G0278299 [Dictyostelium discoideum AX4]
gi|60470342|gb|EAL68322.1| hypothetical protein DDB_G0278299 [Dictyostelium discoideum AX4]
Length = 635
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 78/156 (50%), Gaps = 5/156 (3%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F+Q ++H S+ + TF QR+ +N + +F EG + VN IA
Sbjct: 76 FQQTINHLSYDTIGTFEQRFSVNKKFVPINGKPKAVFFLVSGEGPLSSEIVNHNPFVQIA 135
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
A++V E RYYGESMP+ + N + ++YLT +Q L D A F
Sbjct: 136 NETQALIVALELRYYGESMPFLN-----MNNSNMAYLTTDQILEDLATFQVFFTNKYQLN 190
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
++ G SY G ++AW RLKYPH+ A+ASS+P
Sbjct: 191 DIKWIIMGCSYAGTISAWYRLKYPHLVTAAIASSSP 226
>gi|170045806|ref|XP_001850485.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868713|gb|EDS32096.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 485
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/176 (34%), Positives = 88/176 (50%), Gaps = 9/176 (5%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
P+ + + Q R F R+DHF+ + TF +Y N D + P GPIF++ G
Sbjct: 42 PEVPKDYDRSQSRSIGTMFRTRVDHFNPQNRDTFEFQYYSN-DEYYQPG--GPIFIFVGG 98
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
+E + + G DIA A L EHRYYG S+P V + L +LT EQA
Sbjct: 99 NWPVEQYYIEHGHFHDIAYYENAWLFANEHRYYGSSLP-----VEDLSTPNLRFLTVEQA 153
Query: 175 LADFAVFITNLKQNLSAEASP-VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
L D I +++ N+ + + V+L G Y G +A WMR +YPH+ G+ SS +
Sbjct: 154 LVDLGELIYHIRHNVVRDDNARVILLGVGYAGAIATWMRQRYPHLVDGSWVSSGQV 209
>gi|71663192|ref|XP_818592.1| serine carboxypeptidase S28 [Trypanosoma cruzi strain CL Brener]
gi|70883852|gb|EAN96741.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi]
Length = 483
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 79/275 (28%), Positives = 121/275 (44%), Gaps = 19/275 (6%)
Query: 14 LSPVITIVIISILSPLSL---AAQPSKFRRAPRFVGKLPH-LTEPPQRQQRQQQQQYRYE 69
L V+ ++ ++ P++ + PS+ R + L L E Q++ + +
Sbjct: 10 LMVVLGLLAVAFSGPVTALHPSVIPSRARHGEVYRAALRRSLKEGSGGQEKLRHDVHNNA 69
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
RY+ QR+DH L TF QR+ ++ W + GP L EG + GFV
Sbjct: 70 ARYYNQRVDHADVT-LGTFRQRWWVDRSSWDANS--GPAILLVNGEGTAPGLP-DGGFVG 125
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
+ A++ EHRYYGESMP T N + L YLT E ALAD F ++ +
Sbjct: 126 EYGKSVKAIIFSLEHRYYGESMPAPLT-----NRSMLKYLTVENALADLQAFKKYAEKKV 180
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
+ ++ GGSY G L+AW R KYP A +SS + D E F +
Sbjct: 181 VKKKVKWLIVGGSYAGALSAWARAKYPGDFDAAWSSSGVVNAIFDY---EAFDGHL---L 234
Query: 250 KRESASCFNTIKESWGELVSVGQKENGLLELTKTF 284
K +SC ++ +G+ N ++ KTF
Sbjct: 235 KVLPSSCAAAVRTVFGKFSKAYDNPNRRAKMMKTF 269
>gi|157167876|ref|XP_001656138.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871004|gb|EAT35229.1| AAEL012590-PA [Aedes aegypti]
Length = 489
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 122/263 (46%), Gaps = 23/263 (8%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRY 146
+FS RY+ N H+ R GPIFL+ G +E V G D+A LV E RY
Sbjct: 69 SFSMRYVTNNRHY---RRGGPIFLFVGGPWPLEAHLVEQGHFVDMAAEMNGFLVANELRY 125
Query: 147 YGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE-ASPVVLFGGSYGG 205
YGES+P +V+ N YL Q L++ A FI +LK+++ + + V+L G Y
Sbjct: 126 YGESIPV--EDVSRNN---FRYLHNVQILSELATFIAHLKEDVVRDPNAKVILAGVGYSA 180
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA-SCFNTIKESW 264
LA WMR ++PH+ G +SS + + F +V + +R C++TI W
Sbjct: 181 SLAQWMRQRFPHLIHGVWSSSGMVRASTNY---REFAEVVGENIRRFGGDDCYSTI---W 234
Query: 265 GELVSVGQKENGLLELT--KTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPL 322
+ + L T + FH CR + D A+ LE + + + L
Sbjct: 235 RAFRTAENLIDAGLSTTVDELFHTCRPI----DAANALEVEAFFYGIFNEISREVVDADL 290
Query: 323 PGYPIREVCKKIDNAPDATSILE 345
G I+++C+ + + D S+LE
Sbjct: 291 RG-NIKQMCEPLTASDDENSLLE 312
>gi|157133206|ref|XP_001662800.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108870922|gb|EAT35147.1| AAEL012663-PA [Aedes aegypti]
Length = 485
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 78/263 (29%), Positives = 122/263 (46%), Gaps = 23/263 (8%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRY 146
+FS RY+ N H+ R GPIFL+ G +E V G D+A LV E RY
Sbjct: 69 SFSMRYVTNNRHY---RRGGPIFLFVGGPWPLEAHLVEQGHFVDMAAEMNGFLVANELRY 125
Query: 147 YGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE-ASPVVLFGGSYGG 205
YGES+P +V+ N YL Q L++ A FI +LK+++ + + V+L G Y
Sbjct: 126 YGESIPV--EDVSRNN---FRYLHNVQILSELATFIAHLKEDVVRDPNAKVILAGVGYSA 180
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESA-SCFNTIKESW 264
LA WMR ++PH+ G +SS + + F +V + +R C++TI W
Sbjct: 181 SLAQWMRQRFPHLIHGVWSSSGMVRASTNY---REFAEVVGENIRRFGGDDCYSTI---W 234
Query: 265 GELVSVGQKENGLLELT--KTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPL 322
+ + L T + FH CR + D A+ LE + + + L
Sbjct: 235 RAFRTAENLIDAGLSTTVDELFHTCRPI----DAANALEVEAFFYGIFNEISREVVDADL 290
Query: 323 PGYPIREVCKKIDNAPDATSILE 345
G I+++C+ + + D S+LE
Sbjct: 291 RG-NIKQMCEPLTASDDENSLLE 312
>gi|308491795|ref|XP_003108088.1| hypothetical protein CRE_10077 [Caenorhabditis remanei]
gi|308248936|gb|EFO92888.1| hypothetical protein CRE_10077 [Caenorhabditis remanei]
Length = 794
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/309 (28%), Positives = 135/309 (43%), Gaps = 34/309 (11%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI------EWF--AVN 124
+Q +DHFS T+ QRY N+ + +G +FL G EG I +W
Sbjct: 54 IDQVVDHFSNTTSATWRQRYQYNSKFY--NKTVGYVFLMLGGEGSINATNGDKWVRHEAE 111
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+ VW A FGA EHR+YG G + + Q +L LT +QALAD FI
Sbjct: 112 TMMVW--AAEFGAGAFQVEHRFYGSK---GFSPIGDQTTESLKLLTIDQALADIKEFINQ 166
Query: 185 LKQNLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED----IVPPE 239
+ P+ + FGGSY G L+AW R YP + GA++SS+ + F D + E
Sbjct: 167 MNALYFPLDKPIWITFGGSYPGSLSAWFRETYPEMTAGAVSSSSAVHVFVDYYGYAINTE 226
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLE-LTKTFHLC---RELNSTED 295
Y VS SC + IK ++ ++ E L KTF+LC E N ++
Sbjct: 227 KTYRTVSD-------SCGDVIKTAFQQMQKKAYNGPDSRELLKKTFNLCDSFDENNLSKS 279
Query: 296 LADWLESAYSYLAMVD-YPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVY 354
+ + ++ Y Y ++ Y + G + C ++NA I + V +
Sbjct: 280 IQFFFQNVYGYFQGINQYTGDNRNNATRSGLGVPGACNILNNATLGDEITRVV--AVMDW 337
Query: 355 YNYTGNVDC 363
Y+ G+ C
Sbjct: 338 YDSWGSPGC 346
>gi|393238402|gb|EJD45939.1| peptidase S28 [Auricularia delicata TFB-10046 SS5]
Length = 535
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 93/170 (54%), Gaps = 15/170 (8%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGP-IFLYCG-NEGDIEWFAVN 124
++ R+F Q LDHF A TF QRY ++ H++ P GP I L CG G+ ++
Sbjct: 45 KFPARWFRQPLDHFDRAKRDTFLQRYWVSDRHYL-PG--GPVIVLDCGETNGEDRLPFLD 101
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+G V +A + V EHRYYG S+P V +L +L +QA AD A F+ N
Sbjct: 102 TGIVDILAKATHGLGVVLEHRYYGSSVP-----VLNLTTDSLRWLNNKQAAADSATFMAN 156
Query: 185 LK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
++ +L+A +P + +GGSY G AA MR+ YP + GA+ASSA +
Sbjct: 157 VRFEGIDDDLTAPGTPWIYYGGSYAGGRAAHMRILYPDLVFGAIASSAAV 206
>gi|330837832|ref|XP_003292053.1| hypothetical protein DICPUDRAFT_39962 [Dictyostelium purpureum]
gi|325077726|gb|EGC31420.1| hypothetical protein DICPUDRAFT_39962 [Dictyostelium purpureum]
Length = 481
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 74/241 (30%), Positives = 124/241 (51%), Gaps = 27/241 (11%)
Query: 67 RYETRYFEQRLDHFSFADLP-TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ E + F Q++DH++F TF QR+ ++++ G GPIF G E +++ +N+
Sbjct: 45 KLEYKLFNQKIDHYNFQHGNLTFKQRFFEYSNYYDGN---GPIFFVFGPEQELKEDYINN 101
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESM--PYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
+ A A ++ EHRYYG+S+ + +TE +L YL ++QA+AD A FIT
Sbjct: 102 RQYEEWAKTLNASIICLEHRYYGKSIFTDHLTTE-------SLQYLNSDQAIADVAYFIT 154
Query: 184 -NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI------LQFEDIV 236
K+N + V FG SYGG +AA ++KYPH+ ++SS P+ Q+ +IV
Sbjct: 155 WYKKENKIDDGKRWVGFGASYGGTIAAQFKIKYPHLIDIIVSSSGPVSPELNFFQYLEIV 214
Query: 237 PPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
N + S+ ++ C I+ + E+ + + N L L F LC L + +D
Sbjct: 215 Q-----NTIISEV-QDGERCVENIRNATLEIEEIIKFGNHNL-LKDKFRLCAPLENEKDF 267
Query: 297 A 297
+
Sbjct: 268 S 268
>gi|409079210|gb|EKM79572.1| hypothetical protein AGABI1DRAFT_114101 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 552
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 96/169 (56%), Gaps = 17/169 (10%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNS 125
+E ++FEQ LDHF ++ TF QRY ++ H+ + P+F+ G E G +++
Sbjct: 64 FEPQWFEQPLDHFDESNPHTFKQRYWVSKRHYKA-RQGAPVFVLDGGETSGANRLPFLDT 122
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQALADFAVFIT 183
G V +A + V EHRYYGES +A +N TT L +L EQA AD A F+
Sbjct: 123 GIVDILARATEGLGVILEHRYYGES-------IAVENLTTDALRWLNNEQAAADSANFMA 175
Query: 184 NLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+K ++L+A ++P + +GGSY G +A M++ YP + GA+ASSA
Sbjct: 176 KVKFEGIEEDLTAPSTPWIYYGGSYAGARSAHMKILYPDLVYGAIASSA 224
>gi|2961378|emb|CAA18125.1| putative protein [Arabidopsis thaliana]
gi|7270572|emb|CAB80290.1| putative protein [Arabidopsis thaliana]
Length = 852
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 85/263 (32%), Positives = 122/263 (46%), Gaps = 41/263 (15%)
Query: 82 FADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVF 141
F D F QRY DH P+ GPIFL EG + + ++ +A +F A +V
Sbjct: 68 FQDHRKFRQRYYEYLDHLRVPD--GPIFLMICGEGPCN--GITNNYISVLAKKFDAGIVS 123
Query: 142 PEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF-----------------ITN 184
EHRYYG+S P+ S +A +N L YL+++QAL+D A F N
Sbjct: 124 LEHRYYGKSSPFKS--LATKN---LKYLSSKQALSDLATFRQYYQATCFGICLWMQDSLN 178
Query: 185 LKQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+K N S+ +P FG SY G L+AW RLK+PH+ G+LASSA + + PE
Sbjct: 179 VKFNRSSNVENPWFFFGVSYSGALSAWFRLKFPHLTCGSLASSAVVRAVYEF--PEFDQQ 236
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
I S C ++E+ +L+ +G K N K EL+ D +L +
Sbjct: 237 IAES----AGPECETALQET-NKLLELGLKVNN--RAVKALFNATELDVDADFL-YLIAD 288
Query: 304 YSYLAM----VDYPYPSDFMMPL 322
+AM + Y P +PL
Sbjct: 289 AGVMAMFIWQIQYGNPDKLCVPL 311
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 86/159 (54%), Gaps = 22/159 (13%)
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPE 143
D F QRY DH P+ GPIF+ EG + + ++ +A +F A +V E
Sbjct: 501 DHREFKQRYYEYLDHLRVPD--GPIFMMICGEGPCN--GIPNDYITVLAKKFDAGIVSLE 556
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF------------ITNLKQNLSA 191
HRYYG+S P+ S +A +N L YL+++QAL D A F N+K N S
Sbjct: 557 HRYYGKSSPFKS--LATEN---LKYLSSKQALFDLAAFRQYYQASTSLMDSLNVKFNRSG 611
Query: 192 EA-SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ +P FG SY G L+AW RLK+PH+ G+LASSA +
Sbjct: 612 DVENPWFFFGASYSGALSAWFRLKFPHLTCGSLASSAVV 650
>gi|160332814|emb|CAL69923.1| hypothetical protein [Plasmodiophora brassicae]
gi|162138601|emb|CAP58027.1| hypothetical protein [Plasmodiophora brassicae]
Length = 467
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 92/178 (51%), Gaps = 12/178 (6%)
Query: 61 QQQQQYRYETRYFEQRLDHFSFADLPT-FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
Q + E + +Q +DH + FSQRY W GP+ GP+ L E
Sbjct: 15 QGHAYVQTEHVFPDQLVDHTAIGGGGARFSQRYFRIDQFWSGPD--GPVILQLCGEYTCA 72
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
F +A R+GA+++ EHRY+G+S P+ + + L+YLT QAL+D A
Sbjct: 73 GVTDGRQFPSALAERYGALVLVLEHRYFGKSSPF-----SVLSPRNLTYLTTFQALSDIA 127
Query: 180 VFITNLKQNL---SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
F T+ Q + A A+ + GGSY G LAAW RLKYPH+ GALASSA + F +
Sbjct: 128 CF-TDWYQRVHIGRANANKWITIGGSYPGALAAWYRLKYPHLTAGALASSAVVAPFAE 184
>gi|395331113|gb|EJF63495.1| peptidase S28 [Dichomitus squalens LYAD-421 SS1]
Length = 574
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 99/187 (52%), Gaps = 20/187 (10%)
Query: 54 PPQRQQRQQQQQYRYETRYFEQRLDHFSF-----ADLPTFSQRYLINTDHWV-GPNRLGP 107
P Q + + + R+FEQ LDHF+ A+ T+ QRY +NT H+V GP+ P
Sbjct: 69 PVSNAQAESKWGGVFPERWFEQPLDHFAEGKGAQAETETWRQRYWVNTRHYVPGPD--AP 126
Query: 108 IFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT 165
+F+ G E G+ +++G +A G + V EHRYYGES P V +
Sbjct: 127 VFVIDGGETSGEDRLGFLDTGIADILARATGGVGVVLEHRYYGESRP-----VKNLTTDS 181
Query: 166 LSYLTAEQALADFAVFITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAI 220
L +L Q+ AD A F+ N+K ++L+A P + +GGSY G AA M++ YP +
Sbjct: 182 LRFLNNAQSAADSANFMANVKFPGIDEDLTAPNHPWIYYGGSYAGARAAHMKVLYPDLVW 241
Query: 221 GALASSA 227
GA+ASS
Sbjct: 242 GAVASSG 248
>gi|145490518|ref|XP_001431259.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124398363|emb|CAK63861.1| unnamed protein product [Paramecium tetraurelia]
Length = 446
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/163 (37%), Positives = 85/163 (52%), Gaps = 9/163 (5%)
Query: 71 RYFEQRL-DHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
R+FE +L DH+ + TF QRY + + P +F CG E F
Sbjct: 25 RWFEHQLIDHYDRTNTQTFRQRYW-TVEEYFQPEGGAVLFWICG-EYTCPGIRKERLFPV 82
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
++A A++V EHRYYG+SMP+ + +N L YL QAL D A F ++ Q
Sbjct: 83 ELAQTHKALIVVLEHRYYGKSMPFDEDALRLEN---LKYLGIRQALDDLAYFQLHIVQGK 139
Query: 190 ---SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
E+ P + GGSY G +AAW R +YPH+ +GALASSA +
Sbjct: 140 FFGVRESHPWIAIGGSYPGAMAAWYRYQYPHLVVGALASSAVV 182
>gi|170045808|ref|XP_001850486.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868714|gb|EDS32097.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 501
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 82/158 (51%), Gaps = 9/158 (5%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F R+DHF+ + TF Y N D + P GPIF++ G + + + G DIA
Sbjct: 58 FRTRVDHFNPQNRDTFELAYYSN-DEFYRPG--GPIFIFVGGNWAVNPYFIERGHFPDIA 114
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
GA + EHRYYG S P V + L +LT EQA+ D A I +L+ N+ +
Sbjct: 115 YMEGAWMFTNEHRYYGTSFP-----VEDLSTPNLRFLTVEQAMVDLAELIYHLRHNVVRD 169
Query: 193 ASP-VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ VVL G YGG +A WMR +YPH+ G+ SS +
Sbjct: 170 DNARVVLLGMGYGGAIATWMRQRYPHLVDGSWVSSGQV 207
>gi|426196115|gb|EKV46044.1| hypothetical protein AGABI2DRAFT_193941 [Agaricus bisporus var.
bisporus H97]
Length = 552
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 96/169 (56%), Gaps = 17/169 (10%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNS 125
+E ++FEQ LDHF ++ TF QRY ++ H+ + P+F+ G E G +++
Sbjct: 64 FEPQWFEQPLDHFDESNPHTFKQRYWVSKRHYKA-RQGAPVFVLDGGETSGANRLPFLDT 122
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQALADFAVFIT 183
G V +A + V EHRYYGES +A +N TT L +L EQA AD A F+
Sbjct: 123 GIVDILARATEGLGVILEHRYYGES-------IAVENLTTDALRWLNNEQAAADSANFMA 175
Query: 184 NLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+K ++L+A ++P + +GGSY G +A M++ YP + GA+ASSA
Sbjct: 176 KVKFNGIEEDLTAPSTPWIYYGGSYAGARSAHMKILYPDLVYGAIASSA 224
>gi|123399993|ref|XP_001301579.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121882777|gb|EAX88649.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 496
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/157 (38%), Positives = 85/157 (54%), Gaps = 9/157 (5%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F Q+LDHF + TF+QRY T + + +F Y G E + + S D+A
Sbjct: 8 FTQKLDHFDASSQETFNQRYYKITKNSTA--NVSALFFYIGGEAPLIGKRMLSLAPVDLA 65
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
+ A+L EHR++G S P T + +N L YLT EQ LAD A FI +KQ+
Sbjct: 66 EKNNAVLFGLEHRFFGNSAP---TNLTIEN---LKYLTIEQGLADLAHFINAMKQDYDHT 119
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ + GGSY G L++W RL YPH+A + ASSAP+
Sbjct: 120 VR-IGVIGGSYPGALSSWFRLLYPHLADVSWASSAPV 155
>gi|389740212|gb|EIM81403.1| peptidase S28 [Stereum hirsutum FP-91666 SS1]
Length = 545
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 111/215 (51%), Gaps = 17/215 (7%)
Query: 56 QRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWV-GPNRLGPIFLYCGN 114
+R + + ++ + F Q LDHF + TF QRY ++T H+ G N P+ + G
Sbjct: 53 KRTKGEVGKRATFPQYNFTQPLDHFYGSTNGTFPQRYWVSTRHYTPGSNATVPVIVLDGG 112
Query: 115 E--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
E G+ +++G V +A G + V EHRYYG+S+ G + + N L +L E
Sbjct: 113 ETSGEDRLPYLDTGIVDILAEATGGVGVVLEHRYYGDSV--GVPDFSTDN---LRWLNNE 167
Query: 173 QALADFAVFITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASS- 226
QAL D A F+ N+K ++L+A +P + FGGSY G AA M++ YP I GA+ASS
Sbjct: 168 QALEDSANFMRNVKFEGIDEDLTAPGTPWIYFGGSYAGARAAHMKVLYPDIVYGAIASSG 227
Query: 227 ---APILQFEDIVPPETFYNIVSSDFKRESASCFN 258
A I +E + F + SD ++ S +
Sbjct: 228 VTHAAITNWEYMDVIRQFATVECSDNLVQTVSTVD 262
>gi|170040233|ref|XP_001847911.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167863799|gb|EDS27182.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 512
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 77/247 (31%), Positives = 131/247 (53%), Gaps = 29/247 (11%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN-SGFVWD 130
+F R+DHF+ + ++ RYL TD W P GPI ++ G I+ + V+ S ++D
Sbjct: 66 FFTTRIDHFNAQNTDEWTLRYLAVTD-WYQPG--GPILIWLGGYMPIQPYMVDESSLIYD 122
Query: 131 IAPRF-GAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN- 188
+A GA+ F E RY+G+S + + +V+ +N L +L A+Q LAD A F+ LK++
Sbjct: 123 MAREMHGAVYAF-ETRYFGQS--WITEDVSTEN---LRFLNADQVLADLAEFVAYLKRDV 176
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQFEDIVPP--ETFYN 243
L E + V++ G YGG LA W R++YPH++ A +SS ++ F++ +T +
Sbjct: 177 LRNEYAHVLVSGVGYGGSLATWFRVRYPHLSDAAWSSSGVHNALVDFQEFAEDWGQTLID 236
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
S + E FN ++ L+ G+++ L + LC E+ D D LE +
Sbjct: 237 FGSQECYNEIFVAFNVMQ----NLIDAGRED----ILYERLDLCDEI----DTEDRLEVS 284
Query: 304 YSYLAMV 310
+ +L M+
Sbjct: 285 FFFLTMM 291
>gi|410040428|ref|XP_003950808.1| PREDICTED: thymus-specific serine protease [Pan troglodytes]
Length = 541
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 89/295 (30%), Positives = 133/295 (45%), Gaps = 51/295 (17%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HWVG + GPIFL G EG + +V G +
Sbjct: 61 WLEQLLDPFNVSDRRSFLQRYWVNDQHWVGQD--GPIFLLLGGEGSLGPGSVMRGHPAAL 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA----------------- 174
AP +GA+++ EHR+YG S+P G E+A L +L++ A
Sbjct: 119 APAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRLAMGKSSGIPSDEDRPSPP 173
Query: 175 ----LADFAVFITNLKQ--NLSAEASPVVLFGGSYGGMLAAWMRLK------YPHIAIGA 222
LAD L + N+S+ +SP + FGGSY G LAAW RLK +PH+ +
Sbjct: 174 FDPRLADVVSAHLALSRLFNISS-SSPWICFGGSYAGSLAAWARLKVLRLLRFPHLIFAS 232
Query: 223 LASSAPIL------QFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENG 276
+ASSAP+ ++ D+V + + +A E L S G +
Sbjct: 233 VASSAPVRAVLDFSEYNDVVSRSLMSTAIGGSLECRAAVSV-AFAEVERRLRSGGAAQAA 291
Query: 277 LLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVC 331
L C L E+ A+ L + + + V Y PL +R++C
Sbjct: 292 ---LQTELSACGPLGRAENQAELLGALQALVGGV-VQYDGQTGAPL---SVRQLC 339
>gi|330803268|ref|XP_003289630.1| hypothetical protein DICPUDRAFT_154024 [Dictyostelium purpureum]
gi|325080278|gb|EGC33840.1| hypothetical protein DICPUDRAFT_154024 [Dictyostelium purpureum]
Length = 226
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 87/152 (57%), Gaps = 13/152 (8%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS--GFV 128
++F Q +DHF+ + TF QRYLIN ++ G GP+F+ EG + V V
Sbjct: 63 QWFTQNVDHFNIVNTDTFQQRYLINDQYYDGT---GPVFIMINGEGPMGLDTVTGLQFVV 119
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
W A + A++V EHRYYG S + +++++ N L +L ++QALAD AVF + Q
Sbjct: 120 W--AKQLNALIVSLEHRYYGAS--FVTSDLSLDN---LQFLNSQQALADNAVFREFIAQK 172
Query: 189 LSAEASPV-VLFGGSYGGMLAAWMRLKYPHIA 219
+ ++ V FGGSY G L +W R+KYPH+
Sbjct: 173 YNIPSTTKWVSFGGSYSGALTSWFRIKYPHLV 204
>gi|195109612|ref|XP_001999377.1| GI23098 [Drosophila mojavensis]
gi|193915971|gb|EDW14838.1| GI23098 [Drosophila mojavensis]
Length = 386
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 103/211 (48%), Gaps = 14/211 (6%)
Query: 91 RYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGES 150
RYL+N V GP+F+Y G E I + +G ++D+A +L + EHRYYGES
Sbjct: 2 RYLLND---VFFKAGGPMFIYLGGEWTISNGFITAGHMYDMAKEHNGLLAYTEHRYYGES 58
Query: 151 MPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEA-SPVVLFGGSYGGMLAA 209
P +++ +N L +L +QALAD A FI K + + S V++ GGSY +
Sbjct: 59 HPL--PDLSNEN---LRFLHVKQALADLAHFIKTQKASYEGLSDSKVIIVGGSYSATMVT 113
Query: 210 WMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELV 268
W + YP + +G ASSAP++ + + ++ S C++ I++ EL
Sbjct: 114 WFKRTYPDLVVGGWASSAPVVAKVNFFE---YKEVMGESITLMGGSACYDRIEKGIAELE 170
Query: 269 SVGQKENGLLELTKTFHLCRELNSTEDLADW 299
++ + G E+ LC + DL W
Sbjct: 171 TMFANKRG-AEVKALLKLCEPFDVNNDLDIW 200
>gi|268552333|ref|XP_002634149.1| Hypothetical protein CBG01710 [Caenorhabditis briggsae]
Length = 507
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 88/319 (27%), Positives = 141/319 (44%), Gaps = 37/319 (11%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE-----WFAVNSGFVW 129
Q +DHF+ T+ QRY N+ + +G +FL G EG I+ W +
Sbjct: 34 QYVDHFANNTSATWLQRYQYNSKFY--NKTVGYVFLMLGGEGAIDPPGDKWVRHEGETMM 91
Query: 130 DIAPRFGAMLVFPEHRYYG--ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
A FGA EHR+YG E P G Q +L LT +QALAD FI + +
Sbjct: 92 VWAKEFGAAAFQVEHRFYGSKEFSPLGD-----QTTESLKLLTIDQALADIKEFINQMNK 146
Query: 188 NLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED----IVPPETFY 242
P+ + FGGSY G L+AW R YP + GA++SS+ + F D + E Y
Sbjct: 147 MYFPNDKPIWITFGGSYPGSLSAWFRETYPEMTAGAVSSSSAVHVFVDYYGYAINTEKTY 206
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK-TFHLCREL---NSTEDLAD 298
V SA C I+ ++ E+ + L K TF+LC + N ++ +
Sbjct: 207 RTV-------SAPCAEVIRTAFTEIQKKAYNGSDARALLKGTFNLCDDFDENNLSKSVQF 259
Query: 299 WLESAYSYLAMVD-YPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNY 357
+ ++ Y Y ++ Y + G + C ++NA + +R+ ++ Y ++
Sbjct: 260 FFQNIYGYFQGINQYTGDNRNTATRNGLGVPGACAILNNATIGEEV-DRVVAVLNWYDSF 318
Query: 358 TGNVDCFQLDDDPHGLDGW 376
+G + C P+ DG+
Sbjct: 319 SGPLTC-----RPNSYDGF 332
>gi|340387216|ref|XP_003392103.1| PREDICTED: lysosomal Pro-X carboxypeptidase-like, partial
[Amphimedon queenslandica]
Length = 159
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 47/134 (35%), Positives = 82/134 (61%), Gaps = 8/134 (5%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
Y T Y+ Q+LD+F+ D TF+QR L +W + + Y GNE I+ F N+GF
Sbjct: 34 YTTSYYTQQLDNFNSNDKRTFNQRILTAKQYW----KNDVLLFYPGNEAPIDEFYNNTGF 89
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
++++A RF A++VF EHRYYG+++P+G + ++YL+ QA+ADF+ + +++
Sbjct: 90 LFELAERFQALVVFAEHRYYGDTLPFGPQDTF--TPANMAYLSVGQAMADFSRLVQDIRD 147
Query: 188 NLSAEASPVVLFGG 201
++ + V++ GG
Sbjct: 148 KMNIKK--VIVIGG 159
>gi|320165589|gb|EFW42488.1| serine carboxypeptidase S28 [Capsaspora owczarzaki ATCC 30864]
Length = 491
Score = 95.1 bits (235), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/167 (36%), Positives = 85/167 (50%), Gaps = 19/167 (11%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F Q +DHF+ TF QRY NTD++ N GPIFLY E + + ++ ++
Sbjct: 56 FNQLIDHFNPQHRETFKQRYFENTDNFDPVN--GPIFLYICGEATCG--GIPNDYIRVLS 111
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF----------I 182
+F A +V EHRYYGES P+ A L YLT+ QA+ D A F +
Sbjct: 112 KQFNAAIVTLEHRYYGESSPF-----AQLTTPNLQYLTSRQAINDLAAFRDFYQHNVVDV 166
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+Q + +G SY G L+AW RLK+PH+ G+LASS +
Sbjct: 167 RYAQQRAGRGDNLWFTYGVSYSGALSAWFRLKFPHLTAGSLASSGVV 213
>gi|344268156|ref|XP_003405928.1| PREDICTED: putative serine protease K12H4.7-like [Loxodonta
africana]
Length = 521
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 61/201 (30%), Positives = 105/201 (52%), Gaps = 15/201 (7%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC-GNE-GDIEWFAVNSGFVW- 129
F QR P + +Y +N D + P GP+FL G+E I+W + F W
Sbjct: 24 FLQRKTRGEVMGKPGGASKYYVNYDFY-KPG--GPVFLMIEGHEPASIQW--LKRSFTWI 78
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
A R GA+ + EHR+YG+S P + + YL++ QA+AD A F T + Q++
Sbjct: 79 TYAQRLGALCILLEHRFYGDSQPIRNMSTEHLR----RYLSSRQAVADIAEFRTVIAQSM 134
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
+ + V+FGG YGG LA W R+K+P++ A++SSA I + V ++ ++
Sbjct: 135 NFTENKWVVFGGGYGGALAVWSRIKHPNLFAAAVSSSAMI---QAKVNFNEYFEVIYRTV 191
Query: 250 KRESASCFNTIKESWGELVSV 270
++ C +K+++G ++++
Sbjct: 192 DTHNSECLEAVKQAYGFVMAM 212
>gi|402220386|gb|EJU00458.1| peptidase S28 [Dacryopinax sp. DJM-731 SS1]
Length = 504
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 65/162 (40%), Positives = 94/162 (58%), Gaps = 16/162 (9%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE--WFAVNSGFVWD 130
F Q LDHFS + P ++QRY +N ++ P GP+FL+ EG E + +++G V
Sbjct: 41 FLQPLDHFS-SQSPQWAQRYWLNARYYT-PG--GPVFLFDTGEGPGEDRFGVLDTGIVAI 96
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK---- 186
+A G M V EHRYYG+SMP V+ + +L +L QA AD A F+ ++
Sbjct: 97 LARETGGMAVVLEHRYYGQSMP-----VSNLSTDSLRFLNNAQAAADSANFMRSVHFPGV 151
Query: 187 -QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+++SA P + +GGSYGG AA MR+ YP + GA+ASSA
Sbjct: 152 DEDVSALNRPWIYYGGSYGGARAAHMRVLYPELVWGAIASSA 193
>gi|125576268|gb|EAZ17490.1| hypothetical protein OsJ_33021 [Oryza sativa Japonica Group]
Length = 259
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 43/112 (38%), Positives = 67/112 (59%), Gaps = 7/112 (6%)
Query: 274 ENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKK 333
+ G L +TF++C+ + +D+ +E A Y +M+DYP PS+F+ LP YP+RE+C+
Sbjct: 8 DAGRARLNQTFNMCK--GNVDDIPGLVEKALIYGSMMDYPTPSNFLTSLPAYPVREICRA 65
Query: 334 IDNAPDATSILERIFEGVSVYYNYTGNVDCF----QLDDDPHGL-DGWNWQV 380
ID + RI + +++YYN TG + CF DDDP+G+ GW WQ
Sbjct: 66 IDKPTSGNDTVSRIKDAMTIYYNSTGGLACFPGAGAEDDDPYGMFPGWTWQA 117
>gi|299119157|gb|ADJ11381.1| GA15377 [Drosophila affinis]
Length = 167
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 47/133 (35%), Positives = 75/133 (56%), Gaps = 5/133 (3%)
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESAYSYLAM 309
+++C I SW ++G E G +++ FHLC+ + + DL D++E Y+ LAM
Sbjct: 9 NSNCTANIGRSWKTFETLGGTEAGKKQISDAFHLCQPIKNDADLKKFLDYIEEVYANLAM 68
Query: 310 VDYPYPSDFMMPLPGYPIREVCKKI-DNAPDATSILERIFEGVSVYYNYTGNVDCFQLDD 368
V+YPY S F+ PLP YP+R+VC + D + +L + ++VY NYTG+ C + D
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQNDADLLHAMASALAVYTNYTGSAKCLDISD 128
Query: 369 DPHGLD-GWNWQV 380
+ + GWN Q
Sbjct: 129 TSNADEAGWNVQT 141
>gi|449540369|gb|EMD31362.1| hypothetical protein CERSUDRAFT_89238 [Ceriporiopsis subvermispora
B]
Length = 520
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 63/162 (38%), Positives = 91/162 (56%), Gaps = 16/162 (9%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF--AVNSGFVWD 130
F Q LDHF + TF QRY ++T H+V P GPI ++ G E E +++G V
Sbjct: 58 FTQPLDHFEDTGV-TFEQRYWVSTRHYV-PG--GPIVVFDGGEASAEERLPILDTGIVDI 113
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK---- 186
+ G + + EHRYYG S+ G T N L +L +QAL D AVF+TN++
Sbjct: 114 LTNATGGLGIILEHRYYGASV--GVTNFTTDN---LRWLNNDQALEDSAVFMTNVQIPGI 168
Query: 187 -QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+N++A P + +GGSY G +A M+++YP I GA+ASS
Sbjct: 169 SENITAPGRPWIYYGGSYAGARSAIMKVRYPDIVYGAIASSG 210
>gi|409040075|gb|EKM49563.1| hypothetical protein PHACADRAFT_265104 [Phanerochaete carnosa
HHB-10118-sp]
Length = 561
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 98/174 (56%), Gaps = 20/174 (11%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWF 121
Q +E +F Q L+HFS + T+ QRY INT H+ P P+ + G E G+
Sbjct: 60 QPAEFEPHWFRQPLNHFS-NNSETWLQRYWINTRHY-KPGTHAPVIVIDGGETSGENRLP 117
Query: 122 AVNSGFVWDIAPR-FGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQALADF 178
+++G DI P+ G + V EHRY+GES+P QN TT L +L +QA AD
Sbjct: 118 FLDTGIA-DILPKEIGGIGVILEHRYHGESLPV-------QNFTTDSLRFLNNDQAAADS 169
Query: 179 AVFITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
A F+ N+K +++++ +P + +GGSY G +A M++ YP + GA+ASSA
Sbjct: 170 ANFMANVKFPGVDEDITSPGNPWIYYGGSYAGARSAHMKMLYPELVYGAIASSA 223
>gi|392591874|gb|EIW81201.1| peptidase S28 [Coniophora puteana RWD-64-598 SS2]
Length = 570
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 91/168 (54%), Gaps = 14/168 (8%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVN 124
+ ++F Q +DHFS D TF+QRY +N H+ P GP+ + G E G+ ++
Sbjct: 85 EFPDQWFNQPVDHFS-NDSATFAQRYWVNARHYT-PGAGGPVIVLDGGETSGEDRLPFLD 142
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+G V + G + V EHRYYG+S+P V + L +L QA AD A F+
Sbjct: 143 TGIVEILTRVTGGVGVVLEHRYYGKSIP-----VPNFSTDNLRWLNNAQAAADSANFMAT 197
Query: 185 LK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+K ++L+A P + +GGSY G AA MR+ YP + GA+ASSA
Sbjct: 198 VKFDGIDEDLTAPEVPWIYYGGSYAGARAAHMRVMYPELVFGAIASSA 245
>gi|119497857|ref|XP_001265686.1| serine peptidase, putative [Neosartorya fischeri NRRL 181]
gi|119413850|gb|EAW23789.1| serine peptidase, putative [Neosartorya fischeri NRRL 181]
Length = 525
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 91/323 (28%), Positives = 140/323 (43%), Gaps = 43/323 (13%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE---GDIEWFAVNSGFVW 129
F+Q +DH + L TFSQRY N ++W GP P+ L+ E D + F N V
Sbjct: 45 FDQYIDHNN-PGLGTFSQRYWYNPEYWAGPGS--PVVLFTPGESDAADYDGFLTNKTIVG 101
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK--- 186
A G ++ EHRY+G S PY + A TL YLT EQ++AD F +
Sbjct: 102 RFAEEIGGAVILLEHRYWGASSPYPNL-----TAETLQYLTLEQSIADLVHFAKTVNLPF 156
Query: 187 ---QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+ +A+ +P V+ GGSY G LAAW P ASSAP+ D ++
Sbjct: 157 DEHHSSNADNAPWVMTGGSYSGALAAWTASIAPGTFWAYHASSAPVQAIYDFW---QYFV 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
V + + N + E + + G E E+ + F L L +D A
Sbjct: 214 PVVEGMPKNCSKDVNRVVEYIDHVYASGDIER-QQEIKEMFGL-GALKHFDDFA------ 265
Query: 304 YSYLAMVDYPYPSDFMMPLPGYP-IREVCKKIDN-APDATSI-------LERIFEGVSVY 354
A+ + P+ M + GY + C ++N P A S+ LE+ +G + +
Sbjct: 266 ---AAITNGPWLWQDMNFVSGYSRFYKFCDAVENVTPGAKSVPGPEGVGLEKALQGYASW 322
Query: 355 YNYT---GNVDCFQLDDDPHGLD 374
+N T G+ ++ D + +D
Sbjct: 323 FNSTYLPGSCAKYKYWTDKYAVD 345
>gi|390461155|ref|XP_002746166.2| PREDICTED: thymus-specific serine protease [Callithrix jacchus]
Length = 521
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 79/258 (30%), Positives = 118/258 (45%), Gaps = 53/258 (20%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+ EQ LD F+ +D +F QRY +N HW + GPIFL+ G EG + +V G +
Sbjct: 59 WLEQLLDPFNVSDRRSFLQRYWVNDQHWASQD--GPIFLHLGGEGSLGPGSVMKGHPAAL 116
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA----------------- 174
AP GA+++ EHR+YG S+P G ++A L +L++ A
Sbjct: 117 APACGALVISLEHRFYGLSVPAGGLDMA-----QLRFLSSRHAVGKSSGIPSDEDRPSLP 171
Query: 175 ----LADFAVFITNLKQ--NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
LAD L + N+S+ +SP + FGGSY G LAAW RLK +G
Sbjct: 172 SDPRLADVVSARLALSRLFNVSS-SSPWICFGGSYAGSLAAWARLKV----LG------- 219
Query: 229 ILQFEDIVPPETFYNIVSSDFKRE----SASCFNTIKESWGELVSVGQKENGLLE--LTK 282
+L+F ++ F ++VS K S C + ++ E V + G + L
Sbjct: 220 LLRFPHLI----FASVVSRSLKSAAIGGSMECRAAVSAAFSE-VERRLRTGGAAQAALRA 274
Query: 283 TFHLCRELNSTEDLADWL 300
C L+ ED A+ L
Sbjct: 275 ELSACGSLSRAEDQAELL 292
>gi|224006151|ref|XP_002292036.1| hypothetical protein THAPSDRAFT_29166 [Thalassiosira pseudonana
CCMP1335]
gi|220972555|gb|EED90887.1| hypothetical protein THAPSDRAFT_29166 [Thalassiosira pseudonana
CCMP1335]
Length = 553
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 110/234 (47%), Gaps = 27/234 (11%)
Query: 72 YFEQRLDHFSFADLPT-----FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
Y +Q ++HF T +S RY +T ++ GP PIFL G EG A++SG
Sbjct: 91 YADQLVNHFHTDRSITPKDAKWSNRYYQSTKYYKGPG--SPIFLIVGGEG-----ALDSG 143
Query: 127 FVW-----DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
++ +A RFGA ++ EHR+YG P E L T +QALAD
Sbjct: 144 ILYPFVSEHLARRFGAAVIQIEHRFYGPFQPIVGREATVLELLELL--TPQQALADMVQL 201
Query: 182 ITNLKQNL--------SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
+ K+ L S + PV+ GGSY G L+A RL YP + ASSAP+ ++
Sbjct: 202 TKHFKELLGCSEFDRHSKKYCPVISVGGSYPGFLSAMFRLVYPDFVDISYASSAPLKLYD 261
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC 287
+Y+IV+ + S C +++++ E + K +++ K+ +C
Sbjct: 262 QTANQNVYYDIVTKAAEHTSPGCAKSVRDALEEASELILKAPSVIDAVKSMSMC 315
>gi|341874575|gb|EGT30510.1| hypothetical protein CAEBREN_03046 [Caenorhabditis brenneri]
Length = 517
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 81/308 (26%), Positives = 134/308 (43%), Gaps = 34/308 (11%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE--WFAVNSGFVWDIA 132
Q++D+F ++ T+ QRY N+++ N IFL E W + A
Sbjct: 63 QKVDNFDASNAATYKQRYWYNSNYTQNNNI---IFLMIQGESPATDLWITKPTYQYLQWA 119
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
FGA + EHR +G S PY T ++ T QALAD FI + +
Sbjct: 120 KEFGADVFQLEHRCFGNSRPYPDTSYP-----SIKVCTMTQALADIHSFIQQMNLQHNFR 174
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE 252
+ FGGSY G L+A R KYP +GA+ASSAP+ D Y +V D R+
Sbjct: 175 NPKWITFGGSYPGTLSALFRQKYPQDTVGAVASSAPLDWTLDFFE----YAMVVEDVLRQ 230
Query: 253 SAS-CFNTIKESWGELVSVGQKENGLLELTKTFHL----CRELNSTEDLADWLESAYS-Y 306
+++ C+ + +++ + + +G+ +L F+L + D+ ++ + YS Y
Sbjct: 231 TSTDCWKNVNQAFTNMQQLSLTVDGIQKLNTYFNLDPPFVAGKYTQHDIDNFFANVYSFY 290
Query: 307 LAMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVSVYYNYTGNVDCF 364
++ Y Y L G + +C K++ N PD + + ++
Sbjct: 291 QGVIQYTYDGRNAATLGGLNAQNLCAKMNDANVPDVITRVNNTINWIN------------ 338
Query: 365 QLDDDPHG 372
QL+ DP G
Sbjct: 339 QLNGDPVG 346
>gi|123435014|ref|XP_001308906.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121890608|gb|EAX95976.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 527
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 84/165 (50%), Gaps = 12/165 (7%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E + F+ R+DHF D + +R+L N LY G E + V +G
Sbjct: 22 EYKIFQNRIDHFDTHDSSYYMERFLENLT--FVNKTFKKALLYIGGESTLSPRYVQAGSY 79
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGS-TEVAYQNATTLSYLTAEQALADFAVFITN--- 184
++A R A + EHR++G+SMP+ T+ Y+ YLT QALAD A FI
Sbjct: 80 LELAARENAAVFALEHRFFGKSMPFDQLTKENYK------YLTIPQALADLAEFIERYIY 133
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ V + GGSY G L++W RLKYPH+A+ + ASSAP+
Sbjct: 134 THHLADQDGVTVAVVGGSYPGALSSWFRLKYPHLAVASWASSAPV 178
>gi|312080063|ref|XP_003142441.1| hypothetical protein LOAG_06857 [Loa loa]
Length = 109
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 43/96 (44%), Positives = 64/96 (66%), Gaps = 4/96 (4%)
Query: 61 QQQQQYRYETRYFEQR-LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
+ Y ++ ++++ +DHF++ +L TF +YL+N ++ N GP+F Y GNEGDIE
Sbjct: 1 MEAMNYEWDIKWYQSMPIDHFNYRNLDTFGLKYLVNYSYF---NCDGPLFFYAGNEGDIE 57
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGS 155
FA +G +WD+AP F A +VF EHRYYGES P+G
Sbjct: 58 TFAQMTGIMWDLAPLFNAAIVFAEHRYYGESQPFGK 93
>gi|17566516|ref|NP_507841.1| Protein PCP-4 [Caenorhabditis elegans]
gi|11064521|emb|CAC14390.1| Protein PCP-4 [Caenorhabditis elegans]
Length = 1042
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 81/276 (29%), Positives = 129/276 (46%), Gaps = 24/276 (8%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN-EGDIEWFAVNSG 126
+E F QR DHF ++ F Q++ N+ W P GP FL G E + E + +N
Sbjct: 545 FEQGTFRQRQDHFDNLNVDFFQQKFYKNS-QWARPG--GPNFLMIGGQEAEGESWVLNEK 601
Query: 127 FVWDI-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
W I A ++GA + EHR+YG+S+ N T L+ L++ Q L D A FI +
Sbjct: 602 LPWLISAQKYGATVYLLEHRFYGDSL--------VGNNTNLNLLSSLQVLYDSAEFIKAI 653
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ ++P + FG S+ L+AW R +P + GA++SS IL D + ++
Sbjct: 654 NYK-TQSSTPWITFGRSFP--LSAWTRAIFPDLVTGAVSSSGAILAKTDFFE---YLMVM 707
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--NSTE-DLADWLES 302
+ ++ SC + IK + E+ + G +L+K F L N TE D + +
Sbjct: 708 ETSIRKYDNSCADRIKSGFDEIRGLFLTSEGRQDLSKIFQLLPGFSENVTETDQHFFFSN 767
Query: 303 AYSYLAMVDYPYPSDFMMP-LPGYPIREVCKKIDNA 337
YS + + D P GY I E+C+ + A
Sbjct: 768 LYSNFQLA-VQFSGDNSGPWADGYGIPEMCRFMTGA 802
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 83/277 (29%), Positives = 126/277 (45%), Gaps = 24/277 (8%)
Query: 25 ILSPLSLA-AQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
+L PL LA S F P HLT+ P T Y Q+LDHFS
Sbjct: 1 MLQPLLLAYLTMSAFSIIPTHFPFKEHLTKQPASPATS------VTTGYLSQKLDHFSND 54
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS--GFVWDIAPRFGAMLVF 141
F+Q+Y V ++ FLY EG+ E + V A RFGA L
Sbjct: 55 SQVFFTQQYFYTERLSVSNQKVA--FLYVNTEGNEEIAVMTDERSPVVKAAKRFGAQLFA 112
Query: 142 PEHRYYGESMPYGSTEVAYQN--ATTLSYLTAEQALADFAVFI--TNLKQNLSAEASPVV 197
+HRYYG S P +QN A+ L YLT+ QA+ D FI N + N++ + V
Sbjct: 113 LKHRYYGASKP------NFQNFDASALRYLTSRQAIQDILSFIKYANTQFNMNPDVR-WV 165
Query: 198 LFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCF 257
L+G YGG+LAA R P GA++SSAP+ + D F + ++ + ++C+
Sbjct: 166 LWGTGYGGILAAEARKTDPVAVSGAISSSAPLRRLYDFWQFNDF--VGNTLMQIGGSNCY 223
Query: 258 NTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+++ + ++ + G +++ F L L+ T+
Sbjct: 224 GRVQQGFADIRQAMKTTAGRSQISDLFQLNPRLDQTQ 260
>gi|194390064|dbj|BAG60548.1| unnamed protein product [Homo sapiens]
Length = 213
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 82/133 (61%), Gaps = 6/133 (4%)
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG 265
ML+A++R+KYPH+ GALA+SAP+L + F+ V++DF+ +S C ++E++
Sbjct: 1 MLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVREAFR 60
Query: 266 ELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLESAYSYLAMVDYPYPSDFMMP 321
++ + + G + + F C+ L+ +DL + +A++ LAM+DYPYP+DF+ P
Sbjct: 61 QIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPTDFLGP 118
Query: 322 LPGYPIREVCKKI 334
LP P++ C ++
Sbjct: 119 LPANPVKVGCDRL 131
>gi|343960995|dbj|BAK62087.1| dipeptidyl-peptidase 2 precursor [Pan troglodytes]
Length = 486
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 82/133 (61%), Gaps = 6/133 (4%)
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG 265
ML+A++R+KYPH+ GALA+SAP+L + F+ V++DF+ +S C ++E++
Sbjct: 1 MLSAYLRMKYPHLVAGALAASAPVLAVAGLGDSNQFFRDVTADFEGQSPKCTQGVREAFR 60
Query: 266 ELVSVGQKENGLLELTK-TFHLCRELNSTEDLAD---WLESAYSYLAMVDYPYPSDFMMP 321
++ + + G + + F C+ L+ +DL + +A++ LAM+DYPYP+DF+ P
Sbjct: 61 QIKDLFLQ--GAYDTVRWEFGTCQPLSDEKDLTQLFMFARNAFTVLAMMDYPYPTDFLGP 118
Query: 322 LPGYPIREVCKKI 334
LP P++ C ++
Sbjct: 119 LPANPVKVGCDRL 131
>gi|299119169|gb|ADJ11387.1| GA15377 [Drosophila miranda]
Length = 167
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 75/133 (56%), Gaps = 5/133 (3%)
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESAYSYLAM 309
+++C I SW ++G E G +++ F+LC + + DL D++E Y LAM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKKFLDYIEEVYGNLAM 68
Query: 310 VDYPYPSDFMMPLPGYPIREVCKKIDNAPDA-TSILERIFEGVSVYYNYTGNVDCFQLDD 368
V+YPY S F+ PLP YP+R+VC + + + ++L + ++VY NYTG+V C +
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDANLLHAMASALAVYTNYTGSVKCLDISV 128
Query: 369 DPHGLD-GWNWQV 380
+ + D GWN Q
Sbjct: 129 NSNADDSGWNVQT 141
>gi|392565931|gb|EIW59107.1| peptidase S28 [Trametes versicolor FP-101664 SS1]
Length = 548
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 91/172 (52%), Gaps = 17/172 (9%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFA 122
+ ++ +F Q LDHF+ A TF QRY I+T H+ P P+ + G E G
Sbjct: 63 ERKFPAHWFTQPLDHFTNASGHTFEQRYWISTRHY-RPRPDAPVIVLDGGETSGRDRLPF 121
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQALADFAV 180
+++G V + G + V EHRYYG ++P QN TT L +L Q+ AD A
Sbjct: 122 LDTGIVEILTKATGGVGVILEHRYYGRTIPV-------QNFTTDSLRWLNNAQSAADSAN 174
Query: 181 FITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
F+ N+K ++L+A P + +GGSY G AA M++ YP + GA+ASS
Sbjct: 175 FMANVKFPGIDEDLAAPNHPWIYYGGSYAGARAAHMKILYPDLVYGAIASSG 226
>gi|407407956|gb|EKF31559.1| serine carboxypeptidase S28, putative [Trypanosoma cruzi
marinkellei]
Length = 483
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 103/217 (47%), Gaps = 13/217 (5%)
Query: 17 VITIVIISILSPLSL---AAQPSKFRRAPRFVGKLP-HLTEPPQRQQRQQQQQYRYETRY 72
V+ ++ ++ P++ + PS+ R + L ++ E P ++Q+ +Y
Sbjct: 13 VLGLLAVAFSGPVTALHPSVIPSRARHGEVYRAALRRNVKEGPGGHEKQRHDVRNNPAQY 72
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
+ QR+DH L TF QR+ ++ W + GP L EG + GFV +
Sbjct: 73 YNQRVDHADVT-LGTFRQRWWVDRSSWDANS--GPAILLVNGEGPAPGLP-DGGFVGEYG 128
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
A++ EHRYYGESMP T N + L YLT E ALAD F ++N+ +
Sbjct: 129 KSVKAIIFSLEHRYYGESMPAPLT-----NRSMLKYLTVENALADLQAFKKYAEKNVVKK 183
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
++ GGSY G L+AW R KYP A +SS +
Sbjct: 184 KVKWLIVGGSYAGALSAWARAKYPGDFDAAWSSSGVV 220
>gi|299119159|gb|ADJ11382.1| GA15377 [Drosophila miranda]
gi|299119161|gb|ADJ11383.1| GA15377 [Drosophila miranda]
gi|299119163|gb|ADJ11384.1| GA15377 [Drosophila miranda]
gi|299119165|gb|ADJ11385.1| GA15377 [Drosophila miranda]
gi|299119167|gb|ADJ11386.1| GA15377 [Drosophila miranda]
gi|299119171|gb|ADJ11388.1| GA15377 [Drosophila miranda]
gi|299119173|gb|ADJ11389.1| GA15377 [Drosophila miranda]
gi|299119175|gb|ADJ11390.1| GA15377 [Drosophila miranda]
gi|299119177|gb|ADJ11391.1| GA15377 [Drosophila miranda]
gi|299119179|gb|ADJ11392.1| GA15377 [Drosophila miranda]
gi|299119181|gb|ADJ11393.1| GA15377 [Drosophila miranda]
gi|299119183|gb|ADJ11394.1| GA15377 [Drosophila miranda]
gi|299119185|gb|ADJ11395.1| GA15377 [Drosophila miranda]
gi|299119187|gb|ADJ11396.1| GA15377 [Drosophila miranda]
gi|299119189|gb|ADJ11397.1| GA15377 [Drosophila miranda]
Length = 167
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 75/133 (56%), Gaps = 5/133 (3%)
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESAYSYLAM 309
+++C I SW ++G E G +++ F+LC + + DL D++E Y LAM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKKFLDYIEEVYGNLAM 68
Query: 310 VDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDD 368
V+YPY S F+ PLP YP+R+VC + + + + +L + ++VY NYTG+V C +
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASALAVYTNYTGSVKCLDISV 128
Query: 369 DPHGLD-GWNWQV 380
+ + D GWN Q
Sbjct: 129 NSNADDSGWNVQT 141
>gi|347970283|ref|XP_313407.5| AGAP003642-PA [Anopheles gambiae str. PEST]
gi|333468860|gb|EAA08815.5| AGAP003642-PA [Anopheles gambiae str. PEST]
Length = 476
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 87/169 (51%), Gaps = 9/169 (5%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
R E F R+DHF + TF Y+ N +++ P GPIF+ G + + + +G
Sbjct: 37 RSEAFRFRTRVDHFDVQNRATFEFNYVSNGEYYR-PG--GPIFIVVGGNNALNAYFIENG 93
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
DIA R G L EHRYYG S P V +A + +L+ EQAL D +I +L+
Sbjct: 94 LFHDIARRQGGWLFSNEHRYYGRSSP-----VEDYSAPNMRFLSVEQALIDLIEWIDHLR 148
Query: 187 QNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+ + + + V+L G YGG +A W R ++P + GA S+A ++ D
Sbjct: 149 REVVRDPNAKVILHGLGYGGAVAIWARQRFPSLIDGAYGSTASVIARVD 197
>gi|395324539|gb|EJF56977.1| hypothetical protein DICSQDRAFT_174353 [Dichomitus squalens
LYAD-421 SS1]
Length = 467
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 73/236 (30%), Positives = 108/236 (45%), Gaps = 25/236 (10%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRY 146
TF Q+Y++N ++ GPI Y NE ++ + D A G + EHRY
Sbjct: 11 TFKQQYILNATYF---KEGGPILFYQSNEA-TTITCPDTLILADWAKEIGGLTATLEHRY 66
Query: 147 YGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS-AEASPVVLFGGSYGG 205
+G+S+P+G+ +N YLT E + D FI +K N++ A S ++ G SYGG
Sbjct: 67 FGQSLPFGNDSYTQEN---FKYLTLENVMQDAVNFIDFIKSNVTGASNSKAIVVGRSYGG 123
Query: 206 MLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG 265
L+A R YP + GA A S P F D V + R+S + F+ IK+++
Sbjct: 124 TLSAIFRQNYPDVFYGAWAVSGPFYAFGD---STEIGQEVQQTYLRQSYTAFSRIKQAFS 180
Query: 266 ---ELVSVGQKENGLLELTKTFHLCRELNSTEDLAD------WLESAYSYLAMVDY 312
LV+ G + L K LC+ N T D+A WL AY L +
Sbjct: 181 NVKSLVASGDEPT----LAKELSLCQAPNVT-DVAGAVTFNYWLVGAYDILTQFSF 231
>gi|299119191|gb|ADJ11398.1| GA15377 [Drosophila pseudoobscura]
Length = 167
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 74/133 (55%), Gaps = 5/133 (3%)
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESAYSYLAM 309
+++C I SW ++G E G +++ F+LC + + DL D++E Y LAM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLDYIEEVYGNLAM 68
Query: 310 VDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDD 368
V+YPY S F+ PLP YP+R+VC + + + + +L + ++VY NYTG+V C
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASALAVYTNYTGSVKCLDTSV 128
Query: 369 DPHGLD-GWNWQV 380
+ + D GWN Q
Sbjct: 129 NSNADDSGWNVQT 141
>gi|299119193|gb|ADJ11399.1| GA15377 [Drosophila pseudoobscura]
gi|299119195|gb|ADJ11400.1| GA15377 [Drosophila pseudoobscura]
gi|299119197|gb|ADJ11401.1| GA15377 [Drosophila pseudoobscura]
gi|299119199|gb|ADJ11402.1| GA15377 [Drosophila pseudoobscura]
gi|299119201|gb|ADJ11403.1| GA15377 [Drosophila pseudoobscura]
gi|299119203|gb|ADJ11404.1| GA15377 [Drosophila pseudoobscura]
gi|299119205|gb|ADJ11405.1| GA15377 [Drosophila pseudoobscura]
gi|299119207|gb|ADJ11406.1| GA15377 [Drosophila pseudoobscura]
gi|299119209|gb|ADJ11407.1| GA15377 [Drosophila pseudoobscura]
gi|299119211|gb|ADJ11408.1| GA15377 [Drosophila pseudoobscura]
gi|299119213|gb|ADJ11409.1| GA15377 [Drosophila pseudoobscura]
gi|299119215|gb|ADJ11410.1| GA15377 [Drosophila pseudoobscura]
gi|299119217|gb|ADJ11411.1| GA15377 [Drosophila pseudoobscura]
gi|299119219|gb|ADJ11412.1| GA15377 [Drosophila pseudoobscura]
Length = 167
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 74/133 (55%), Gaps = 5/133 (3%)
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESAYSYLAM 309
+++C I SW ++G E G +++ F+LC + + DL D++E Y LAM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLDYIEEVYGNLAM 68
Query: 310 VDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDD 368
V+YPY S F+ PLP YP+R+VC + + + + +L + ++VY NYTG+V C
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHAMASALAVYTNYTGSVKCLDTSV 128
Query: 369 DPHGLD-GWNWQV 380
+ + D GWN Q
Sbjct: 129 NSNADDSGWNVQT 141
>gi|146322738|ref|XP_749261.2| serine peptidase [Aspergillus fumigatus Af293]
gi|129556778|gb|EAL87223.2| serine peptidase, putative [Aspergillus fumigatus Af293]
Length = 525
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 127/304 (41%), Gaps = 40/304 (13%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE---GDIEWFAVNSGFVW 129
FEQ +DH + L TF QRY N + W GP P+ L+ E D + F N V
Sbjct: 45 FEQYIDHNN-PGLGTFPQRYWYNPEFWAGPGS--PVLLFTPGESDAADYDGFLTNKTIVG 101
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK--- 186
A G ++ EHRY+G S PY TL YLT EQ++AD F +
Sbjct: 102 RFAEEIGGAVILLEHRYWGASSPYPEL-----TTETLQYLTLEQSIADLVHFAKTVNLPF 156
Query: 187 ---QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+ +A+ +P V+ GGSY G LAAW P ASSAP+ D ++
Sbjct: 157 DEIHSSNADNAPWVMTGGSYSGALAAWTASIAPGTFWAYHASSAPVQAIYDFW---QYFV 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
V + + N + E + G E E+ + F L L +D A
Sbjct: 214 PVVEGMPKNCSKDLNRVVEYIDHVYESGDIER-QQEIKEMFGL-GALKHFDDFA------ 265
Query: 304 YSYLAMVDYPYPSDFMMPLPGYP-IREVCKKIDN-APDATSI-------LERIFEGVSVY 354
A+ + P+ M + GY + C ++N P A S+ LE+ +G + +
Sbjct: 266 ---AAITNGPWLWQDMNFVSGYSRFYKFCDAVENVTPGAKSVPGPEGVGLEKALQGYASW 322
Query: 355 YNYT 358
+N T
Sbjct: 323 FNST 326
>gi|219127344|ref|XP_002183897.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217404620|gb|EEC44566.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 526
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/232 (29%), Positives = 116/232 (50%), Gaps = 27/232 (11%)
Query: 72 YFEQRL-DHFSFADL---PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
YF+ +L DH + + +SQR+ ++ +++ GP PIF+ G EG IE ++GF
Sbjct: 32 YFDDQLVDHVASSHRHGHERWSQRFYLSHEYFKGPGS--PIFVIMGGEGAIE---PSTGF 86
Query: 128 VW----DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNAT-----TLSYLTAEQALADF 178
++ +A FGAM++ PEHR+YG+S P E+ L LT EQAL D
Sbjct: 87 MYPFILQLAQTFGAMVLQPEHRFYGQSQPVTPAEIERARDDGKPDPRLKLLTVEQALHDA 146
Query: 179 AVFITNLKQNL-------SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
I ++ + S PV+ GGSY G L+A RL++P + A A+SAP+
Sbjct: 147 VRLIHFVRDRVRCSRDRFSPRYCPVITVGGSYPGFLSAMARLRFPGVVDMAYAASAPMKF 206
Query: 232 FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKT 283
+ V +YN + + ++ C ++ + + +V E+G ++ +T
Sbjct: 207 YAQQVDQYAYYNHIGTVAEQAFTGCSQDVRRALDDFRTV--YESGQSKINET 256
>gi|312380006|gb|EFR26124.1| hypothetical protein AND_08001 [Anopheles darlingi]
Length = 506
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/308 (26%), Positives = 145/308 (47%), Gaps = 33/308 (10%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
Y E R F R++HF D TF YL N ++ GP+F+ G ++ + + +
Sbjct: 56 YTTEGR-FTSRINHFDPQDRSTFEFNYLTNDQYY---REGGPLFVVVGGHHRLDPYFLEN 111
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
D+A GA L EHR++G S+P + +++ +N L +L EQAL D +I L
Sbjct: 112 SHFRDVAALNGAFLANNEHRFFGTSVP--TEDLSSEN---LRFLRTEQALFDLIEWIDFL 166
Query: 186 KQNLSAEASP-VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETF 241
K+ + + + V++ G SYGG LA W R ++P+I GA SSA + ++F + E F
Sbjct: 167 KREVMGDPNARVIVHGFSYGGALATWARQRFPNIIDGAWGSSATVRATVEFSEFT--EDF 224
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLE-LTKTFHLCRELNSTEDLADWL 300
N + + S C++ I ++ + ++ + GL E ++ TF+ C +++ L L
Sbjct: 225 GNTIRV---KGSDECYSAIFRAFHTIENL--LDAGLTERVSSTFNTCDPIDADNALQVEL 279
Query: 301 ESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYN--YT 358
+L ++ ++ VC ++ + TS+ E ++ Y Y
Sbjct: 280 -----FLHLMTLSLEISMFEDFDIENVQNVCNQLTDDSFDTSM-----EALAEYLKNRYA 329
Query: 359 GNVDCFQL 366
DCF L
Sbjct: 330 DVRDCFDL 337
>gi|299119221|gb|ADJ11413.1| GA15377 [Drosophila pseudoobscura]
Length = 167
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 74/133 (55%), Gaps = 5/133 (3%)
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLA---DWLESAYSYLAM 309
+++C I SW ++G E G +++ F+LC + + DL D++E Y LAM
Sbjct: 9 NSNCTTNIGRSWKTFETLGGTEAGKKQISDAFNLCHPIKNDADLKNFLDYIEEVYGNLAM 68
Query: 310 VDYPYPSDFMMPLPGYPIREVCKKIDNAPDATS-ILERIFEGVSVYYNYTGNVDCFQLDD 368
V+YPY S F+ PLP YP+R+VC + + + + +L + ++VY NYTG+V C
Sbjct: 69 VNYPYNSSFLAPLPAYPVRQVCFYLKDLHQSDADLLHGMASALAVYTNYTGSVKCLDTSV 128
Query: 369 DPHGLD-GWNWQV 380
+ + D GWN Q
Sbjct: 129 NSNADDSGWNVQT 141
>gi|397626302|gb|EJK68100.1| hypothetical protein THAOC_10754 [Thalassiosira oceanica]
Length = 629
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 70/230 (30%), Positives = 105/230 (45%), Gaps = 22/230 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA---VNS 125
E Y +Q +DHF T+ RY ++ ++ GP PIF+ G EG +E VN
Sbjct: 148 ELFYADQLVDHFD-GSTDTWDNRYYASSRYFGGPGH--PIFMVVGGEGSLEKMLYPFVNE 204
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
+A FGA +V EHR+YG P + V + T L LT +QA+AD +
Sbjct: 205 ----HLAFHFGAAVVQIEHRFYGPYQPLPNATV--EELTEL--LTPQQAMADMVRLTKHF 256
Query: 186 KQNL--------SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP 237
K L S E PVV GG+Y G L+A RL + A ASSAP+ ++ P
Sbjct: 257 KDELGCGGYDRTSPEYCPVVSVGGAYPGFLSAMFRLAHGDFVDVAYASSAPLKLYDQSAP 316
Query: 238 PETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC 287
E +Y+ V+ +R S C + ++ + E + + ++C
Sbjct: 317 QEVYYDTVTRAAERLSPGCADAVRSALVEAEEAIMASPSVESAVRAMNMC 366
>gi|159128675|gb|EDP53789.1| serine peptidase, putative [Aspergillus fumigatus A1163]
Length = 525
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 127/304 (41%), Gaps = 40/304 (13%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE---GDIEWFAVNSGFVW 129
FEQ +DH + L TF QRY N + W GP P+ L+ E D + F N V
Sbjct: 45 FEQYIDHNN-PGLGTFPQRYWYNPEFWAGPGS--PVLLFTPGESDAADYDGFLTNKTIVG 101
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK--- 186
A G ++ EHRY+G S PY TL YLT EQ++AD F +
Sbjct: 102 RFAEEIGGAVILLEHRYWGASSPYPEL-----TTETLQYLTLEQSIADLVHFAKTVNLPF 156
Query: 187 ---QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
+ +A+ +P V+ GGSY G LAAW P ASSAP+ D ++
Sbjct: 157 DENHSSNADNAPWVMTGGSYSGALAAWTASIAPGTFWAYHASSAPVQAIYDFW---QYFV 213
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
V + + N + E + G E E+ + F L L +D A
Sbjct: 214 PVVEGMPKNCSKDLNRVVEYIDHVYESGDIER-QQEIKEMFGL-GALKHFDDFA------ 265
Query: 304 YSYLAMVDYPYPSDFMMPLPGYP-IREVCKKIDN-APDATSI-------LERIFEGVSVY 354
A+ + P+ M + GY + C ++N P A S+ LE+ +G + +
Sbjct: 266 ---AAITNGPWLWQDMNFVSGYSRFYKFCDAVENVTPGAKSVPGPEGVGLEKALQGYASW 322
Query: 355 YNYT 358
+N T
Sbjct: 323 FNST 326
>gi|294873112|ref|XP_002766517.1| prolylcarboxypeptidase, putative [Perkinsus marinus ATCC 50983]
gi|239867444|gb|EEQ99234.1| prolylcarboxypeptidase, putative [Perkinsus marinus ATCC 50983]
Length = 133
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/119 (44%), Positives = 68/119 (57%), Gaps = 11/119 (9%)
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--KQNLSAE 192
GA LVF EHRYYG++ Y L YLT EQALAD++V I + K +L
Sbjct: 14 LGAFLVFAEHRYYGKTQVYSD-----GTPDCLRYLTIEQALADYSVLIDYIFDKHDLP-P 67
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP---PETFYNIVSSD 248
++ + FGGSYGGMLA+ R KYPHI GA+A+SAPI + P F I++ D
Sbjct: 68 STATIAFGGSYGGMLASAFRYKYPHIVDGAIAASAPIFAIGGVTPEPSKTAFNEIITRD 126
>gi|347971076|ref|XP_318472.5| AGAP004015-PA [Anopheles gambiae str. PEST]
gi|333469620|gb|EAA43688.5| AGAP004015-PA [Anopheles gambiae str. PEST]
Length = 497
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 93/193 (48%), Gaps = 11/193 (5%)
Query: 40 RAPRFVGKLP-HLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDH 98
R+ R +G L HL P + + + R F ++DHF+ + TF Y N +
Sbjct: 25 RSTRLLGTLQRHLARPAIPEGFVPRNENTTGGR-FRTKIDHFNPQNRDTFEFSYFSNNEF 83
Query: 99 WVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEV 158
+ P GPIF++ G + + + G ++D A R GA L EHRYYG S P V
Sbjct: 84 YR-PG--GPIFIFVGGNFAMTTYYIEHGLLYDTAARDGAWLFTNEHRYYGASTP-----V 135
Query: 159 AYQNATTLSYLTAEQALADFAVFITNLKQNLSAE-ASPVVLFGGSYGGMLAAWMRLKYPH 217
+ L +L +EQAL D +I L+ + + + VVL G Y G LA W R ++P
Sbjct: 136 PDYSTENLRFLKSEQALMDLIEWIDYLRNTVVGDPNAKVVLMGTGYAGALATWARQRFPS 195
Query: 218 IAIGALASSAPIL 230
I GA + A +L
Sbjct: 196 IIDGAWGAGATVL 208
>gi|392572044|gb|EIW65216.1| peptidase S28 [Trametes versicolor FP-101664 SS1]
Length = 528
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 63/164 (38%), Positives = 87/164 (53%), Gaps = 20/164 (12%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAVNSGFVWD 130
F Q LDHF TF QRY ++ H+ GP+ ++ EG D +++G +
Sbjct: 63 FTQPLDHFVDTGF-TFQQRYWLSDRHY---KPGGPVIVFEAGEGPGDERMPILDTGILNI 118
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQALADFAVFITN---- 184
+A + + EHRYYGES+P QN TT L +L EQA AD A FI N
Sbjct: 119 LANATDGLAIVLEHRYYGESVPV-------QNFTTDSLRWLNNEQAAADSANFIDNVTFP 171
Query: 185 -LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+ +L+A +P + +GGSYGG AA MR+ YP + GA+ASS
Sbjct: 172 GIPGDLTAPGTPWIYYGGSYGGARAAHMRVLYPDLVFGAIASSG 215
>gi|17550650|ref|NP_508903.1| Protein C26B9.5 [Caenorhabditis elegans]
gi|351058287|emb|CCD65711.1| Protein C26B9.5 [Caenorhabditis elegans]
Length = 516
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 76/308 (24%), Positives = 133/308 (43%), Gaps = 32/308 (10%)
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE--WFAVNSGFVWDI 131
Q++D+F + ++QRY N N +FL E W + +
Sbjct: 62 SQKVDNFDANNNAMYNQRYWYNPTFTQNKNI---VFLMIQGEAPATDTWISNPNYQYLQW 118
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSA 191
A FGA + EHR +G+S PY T + + T QALAD FI + + +
Sbjct: 119 AKEFGADVFQLEHRCFGQSRPYPDTSMP-----GIKVCTMTQALADIHNFIQQMNRRFNF 173
Query: 192 EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKR 251
+ + FGGSY G L+A R +YP +GA+ASSAP+ D + +V K+
Sbjct: 174 QNPKWITFGGSYPGTLSALFRQQYPADTVGAVASSAPLDWTLDFF---EYAMVVEDVLKK 230
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL----NSTEDLADWLESAYSYL 307
S C+ + +++ + + + G+ +L F+L + D+ ++ + YS+
Sbjct: 231 TSVDCWRNVNQAFLNMQQLSLTKAGIQQLNTYFNLVPAFVDGQYTQHDIDNFFANVYSFF 290
Query: 308 -AMVDYPYPSDFMMPLPGYPIREVCKKIDNA--PDATSILERIFEGVSVYYNYTGNVDCF 364
+V Y Y L G +++C K+++A PD + + ++
Sbjct: 291 QGVVQYTYDGRNNATLNGLNAQQLCNKMNDATVPDVITRVNNTINWIN------------ 338
Query: 365 QLDDDPHG 372
Q++ DP G
Sbjct: 339 QMNGDPVG 346
>gi|358060145|dbj|GAA94204.1| hypothetical protein E5Q_00852 [Mixia osmundae IAM 14324]
Length = 1014
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 70/239 (29%), Positives = 111/239 (46%), Gaps = 29/239 (12%)
Query: 73 FEQRLDHFSFADL---PT------FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE--WF 121
F Q+++HF + P F Q Y +NT + R GP+FL+ E + W
Sbjct: 530 FRQKVNHFPDEEARYGPKELSKGYFDQVYTVNTTAY---KRGGPVFLHISGETSAQSRWR 586
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
+G++ +A G +++ E RYYG S P+ S + +LT EQ +AD A F
Sbjct: 587 YATTGYLAKLAQATGGIVIIHECRYYGTSWPFESL-----TTDNMRFLTTEQMIADVAYF 641
Query: 182 ITNLK------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
++ +L+A +P +L GGS G L A+ +++YP I GA+ASSA + F
Sbjct: 642 AQTVRIPGLEHLDLTAPKTPWILTGGSLAGALTAFAKVQYPDIFFGAIASSATVQSFFGF 701
Query: 236 VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
P+ +Y RE + E +L + G K N + E K F L L++ +
Sbjct: 702 --PQWYY-APREHAPRECMAIVEKTVERMDKLFATGNK-NRIREFKKLFGLASVLDNRD 756
>gi|302790231|ref|XP_002976883.1| hypothetical protein SELMODRAFT_106030 [Selaginella moellendorffii]
gi|300155361|gb|EFJ21993.1| hypothetical protein SELMODRAFT_106030 [Selaginella moellendorffii]
Length = 393
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 86/156 (55%), Gaps = 24/156 (15%)
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFL-YCGNEGDIEWFAVNSGF---VWDIAPRFGAML 139
D FSQRY D++ P + GPIFL CG G+ +A GA++
Sbjct: 1 DSRVFSQRYFEFLDYF-QPQQ-GPIFLALCGES------TCGGGYQRTAQALAKSLGAVV 52
Query: 140 VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI------TNLKQNLSAEA 193
V EHRYYG+S P+ +Y+N L YLT +QAL D+A+FI NL+ N +
Sbjct: 53 VTIEHRYYGQSYPF--QNFSYKN---LKYLTTQQALYDYALFIDYYENLVNLQYNKQGK- 106
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+P ++ GGSY G L+AW RLK+PH+ + + ASS +
Sbjct: 107 NPWIVVGGSYAGALSAWFRLKFPHLVVASWASSGVV 142
>gi|157119581|ref|XP_001659434.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108875271|gb|EAT39496.1| AAEL008699-PA [Aedes aegypti]
Length = 512
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 66/225 (29%), Positives = 109/225 (48%), Gaps = 34/225 (15%)
Query: 20 IVIISILSPLSLAAQPSKFRRAP---------------RFVGKLPHLTEPPQRQQRQQQQ 64
IV++++ + A S RR P RF P + +P +R +
Sbjct: 4 IVLVTLALVVMTATAASVSRRVPNKELSRTLMQQVMTKRFSSTKPAVDDPAKRIVIEN-- 61
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
+F R+DHF+ + ++ RY TD+++ P GPI ++ G I V+
Sbjct: 62 -------FFTTRVDHFNSQNTAEWTLRYFAVTDYYM-PG--GPILIFLGGNQPILTSMVD 111
Query: 125 -SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
S ++D+A + E R+YG+S + + + + +N LS L +Q LAD A F+
Sbjct: 112 ESTLIYDMAREMNGAVYAFESRFYGQS--FVTEDASTEN---LSLLNTDQILADLAEFVQ 166
Query: 184 NLKQN-LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
LK++ L +PV++ G YGG LA W R++YPH+A A +SS
Sbjct: 167 YLKRDVLKNPNAPVMVSGSEYGGALATWFRVRYPHLAQAAWSSSG 211
>gi|312383569|gb|EFR28610.1| hypothetical protein AND_03264 [Anopheles darlingi]
Length = 490
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 72/248 (29%), Positives = 113/248 (45%), Gaps = 18/248 (7%)
Query: 17 VITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEP--PQRQQRQQQQQYRYETRYFE 74
++T+++ + AA SK R + + + HL +P P Q + R F
Sbjct: 12 LVTLIVCGASGRSASAAIDSKNVRLLKDLQR--HLAQPVIPDTYQGSEGNPRLIGAR-FR 68
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPR 134
R+DHF + TF Y N D + P GPI+++ G + + + G ++D A R
Sbjct: 69 TRVDHFDPQNRDTFEFNYYSN-DEFYQPG--GPIYIFVGGNFQLTTYYIEHGLLYDTAAR 125
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE-A 193
A L EHRYYG S P V + L +L EQ L D +I +L+ + +
Sbjct: 126 DHAWLFTNEHRYYGTSTP-----VENYSTENLRFLHTEQVLTDLIEWIDHLRNEVVRDPN 180
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRES 253
+ V+L G Y G LA W R ++P+I GA S A +L D + N + S +R
Sbjct: 181 AKVILMGVGYAGALATWARQRFPNIVDGAWGSGATVLASFDF---QEHANDIGSIIRRFG 237
Query: 254 AS-CFNTI 260
C++T+
Sbjct: 238 GDECYSTL 245
>gi|170045817|ref|XP_001850490.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868718|gb|EDS32101.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 487
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 106/232 (45%), Gaps = 23/232 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN-SGFVWD 130
YF RL+HF+ ++ RYL T+H+ P GPI + G + +N S + +
Sbjct: 29 YFTTRLNHFNHQQREDWTLRYLSVTEHY-RPG--GPILIRLSGNGPVRRDMINESSLITE 85
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL- 189
+A G + E R+YG S P N + +L +Q +AD FI LK+++
Sbjct: 86 LAREMGGAVYAFETRFYGMSKPTNDV-----NTEIMRFLKTDQIMADLVEFIIYLKRDVF 140
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQFEDIVP--PETFYNI 244
E PV++ G YGG LA W R++YPH+ A +S +L F D ET +
Sbjct: 141 RDENMPVLVSGAGYGGALATWFRVRYPHMGDAAWSSGGYHEAVLDFSDFAESWSETLIDY 200
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
S E FN ++ L+ +G + L + ++C E++ + L
Sbjct: 201 GSQQCYNELFVAFNVMQ----NLIDIGMTDM----LYEKLNICTEIDPEDSL 244
>gi|297832460|ref|XP_002884112.1| hypothetical protein ARALYDRAFT_480713 [Arabidopsis lyrata subsp.
lyrata]
gi|297329952|gb|EFH60371.1| hypothetical protein ARALYDRAFT_480713 [Arabidopsis lyrata subsp.
lyrata]
Length = 417
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 78/237 (32%), Positives = 119/237 (50%), Gaps = 34/237 (14%)
Query: 97 DHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGST 156
D++ P+ GP+F+ EG A + ++ +A +F A +V EHRYYG+S P+ S
Sbjct: 2 DYFRAPD--GPLFMIICGEGPCSGIA--NDYINVLAKKFQAGVVSLEHRYYGKSSPFNS- 56
Query: 157 EVAYQNATTLSYLTAEQALADFAVF------ITNLKQNLS--AEASPVVLFGGSYGGMLA 208
+A +N L YL+++QAL D A F N+K N+S +P FG SY G L+
Sbjct: 57 -LATEN---LKYLSSKQALFDLAAFRQYYQESLNVKLNMSNGGNENPWFFFGISYSGALS 112
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFK-RESAS--CFNTIKESWG 265
AW RLK+PH+ G+LASSA + Y D + ESA C ++E+
Sbjct: 113 AWFRLKFPHLTCGSLASSAVV---------RAVYEFTEFDQQIGESAGQECKGALQET-N 162
Query: 266 ELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPL 322
+L+ +G KEN + K+ EL+ D +L + + +A Y P +PL
Sbjct: 163 KLLELGLKEN--RKAVKSLFNATELDVDADFL-YLTADAAVMAF-QYGNPDKLCVPL 215
>gi|121717092|ref|XP_001276006.1| extracelular serine carboxypeptidase, putative [Aspergillus
clavatus NRRL 1]
gi|119404163|gb|EAW14580.1| extracelular serine carboxypeptidase, putative [Aspergillus
clavatus NRRL 1]
Length = 582
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 68/231 (29%), Positives = 105/231 (45%), Gaps = 22/231 (9%)
Query: 18 ITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRL 77
+ +I + LSPL+ A S + +L + + + R Q + F +
Sbjct: 5 VPFLIGTCLSPLTQAISLSPLELQFPLLHELRNAKSIEKGEFRSQALSVSFAEHNFSVPV 64
Query: 78 DHFSFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFAVNSGFVW 129
DHF +F+ RY + H+ GP+FL E D + ++ G V
Sbjct: 65 DHFHNESRYEPHSDDSFNLRYWFDASHY---KEGGPVFLIAAGETDATDRFPFLSQGIVA 121
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL---- 185
+A + + V EHRYYGES P+ + V + +L+ EQALAD+A F +N+
Sbjct: 122 QLAKTYNGLGVILEHRYYGESYPFVNLTV-----ENIRFLSTEQALADYAHFASNVAFPG 176
Query: 186 --KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
NL+A A P + +GGSY G A++R YP I G ++SS ED
Sbjct: 177 LEHLNLTAGAVPWIGYGGSYAGAFVAFLRKVYPDIFFGVVSSSGVTAAIED 227
>gi|388852407|emb|CCF54022.1| uncharacterized protein [Ustilago hordei]
Length = 656
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 94/189 (49%), Gaps = 15/189 (7%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHW--VGPNRLG---PIFLYCGNEGDIEWFA- 122
E Y +Q LDHF F QR+ +T H+ R G PI++ E D
Sbjct: 139 EPLYHKQPLDHFDNTTQAQFEQRFFYSTRHYKPASARRNGEAVPIYILDSGEADATARIP 198
Query: 123 -VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA---TTLSYLTAEQALADF 178
+++G + + G + + EHRYYG S+P TE+ +A L +LT +QAL D
Sbjct: 199 FLDTGILDIFSKATGGIGIVLEHRYYGTSLP-NRTELGSGDAWGVDQLRWLTNKQALQDS 257
Query: 179 AVFITNLK--QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
A FI NL ++E ++ +GGSY G AA MR YP + GA+ASSA + ++
Sbjct: 258 ADFIRNLDIPGTDNSEKRKIIYYGGSYPGARAAHMRFLYPELVHGAIASSAVVTAVDEF- 316
Query: 237 PPETFYNIV 245
PE FY I
Sbjct: 317 -PEYFYPIA 324
>gi|302797683|ref|XP_002980602.1| hypothetical protein SELMODRAFT_113084 [Selaginella moellendorffii]
gi|300151608|gb|EFJ18253.1| hypothetical protein SELMODRAFT_113084 [Selaginella moellendorffii]
Length = 401
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 61/156 (39%), Positives = 85/156 (54%), Gaps = 24/156 (15%)
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFL-YCGNEGDIEWFAVNSGF---VWDIAPRFGAML 139
D FSQRY D++ P + GPIFL CG G+ +A GA +
Sbjct: 1 DSRVFSQRYFEFLDYF-QPQQ-GPIFLALCGES------TCRGGYQRTAQALAKSLGAAV 52
Query: 140 VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI------TNLKQNLSAEA 193
V EHRYYG+S P+ +Y+N L YLT +QAL D+A+FI NL+ N +
Sbjct: 53 VTIEHRYYGQSYPF--QNFSYKN---LKYLTTQQALYDYALFIEYYQNLINLRYNKQGK- 106
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+P ++ GGSY G L+AW RLK+PH+ + + ASS +
Sbjct: 107 NPWIVVGGSYAGALSAWFRLKFPHLVVASWASSGVV 142
>gi|158291876|ref|XP_313404.4| AGAP003639-PA [Anopheles gambiae str. PEST]
gi|157017498|gb|EAA08831.4| AGAP003639-PA [Anopheles gambiae str. PEST]
Length = 507
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 77/264 (29%), Positives = 128/264 (48%), Gaps = 26/264 (9%)
Query: 37 KFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINT 96
K RR P + ++ + + + Q R F R++HF + TF YL N
Sbjct: 29 KIRRLPPGIERMFTRKDGIRPPKGYVSQSPRTVEGRFTSRVNHFDPQNRDTFEFNYLHND 88
Query: 97 DHWVGPNRLGPIFLYCGNEGDIE-WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGS 155
++ + GP+F+ G + +F NS F D+A GA L EHRY+GES P +
Sbjct: 89 QYY---RQGGPLFIVVGGHYPVNPYFMENSHF-RDVAALEGAWLATNEHRYFGESYP--T 142
Query: 156 TEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP-VVLFGGSYGGMLAAWMRLK 214
+++ +N L ++ EQ L D +I LK+ + + + V+L G YGG LA W R +
Sbjct: 143 EDLSTEN---LRFMRTEQVLFDLIEWIDFLKREVMGDPNARVILHGVGYGGSLATWARQR 199
Query: 215 YPHIAIGALASSAPI---LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG---ELV 268
+P+I GA SSAP+ FE+ NI+ +R S C+N I +++ L+
Sbjct: 200 FPNIIDGAWGSSAPVRATTNFEEFAVE--VGNIIR---ERGSDQCYNRIFQAFHTAENLI 254
Query: 269 SVGQKENGLLELTKTFHLCRELNS 292
G+ E +++ F+ C +++
Sbjct: 255 DAGRTE----MISEMFNTCDPVDT 274
>gi|451852466|gb|EMD65761.1| hypothetical protein COCSADRAFT_140099 [Cochliobolus sativus
ND90Pr]
Length = 531
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 97/190 (51%), Gaps = 22/190 (11%)
Query: 58 QQRQQQQQYRYETRYFEQRLDHFSFAD--LP----TFSQRYLINTDHWVGPNRLGPIFLY 111
+ R +++Q+ Y+ +Q +DHF +D +P TF QRY + ++ GP+FLY
Sbjct: 28 KGRLEKRQWAYDAHTIDQPIDHFPHSDRYVPHTNGTFKQRYFFDKSYY---KPGGPVFLY 84
Query: 112 CGNEGDIEWFAVN--SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYL 169
E EW N +G + + +F + V E+RYYG S PY +T L +L
Sbjct: 85 LAGETWGEWTLDNLQTGIIQILMKKFNGLGVILENRYYGYSFPYNTT-----TTDELRFL 139
Query: 170 TAEQALADFAVFITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
T EQ +AD F ++K +LS+ +P V++GGS G A+ Y I G +A
Sbjct: 140 TTEQTIADNEYFRQHVKFPGVDADLSSPDTPWVMYGGSLAGSQVAFTMKMYNEIFAGGIA 199
Query: 225 SSAPI-LQFE 233
SSA I +Q+E
Sbjct: 200 SSATITVQYE 209
>gi|212545673|ref|XP_002152990.1| serine peptidase, putative [Talaromyces marneffei ATCC 18224]
gi|210064510|gb|EEA18605.1| serine peptidase, putative [Talaromyces marneffei ATCC 18224]
Length = 520
Score = 87.8 bits (216), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 75/220 (34%), Positives = 97/220 (44%), Gaps = 24/220 (10%)
Query: 24 SILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFA 83
SILS ++L + F A R P++ P Q QQY T +F+Q LDH A
Sbjct: 3 SILSTIALGVSWASFVVASR-----PNVPRAPAITAEAQIQQY--ATGWFDQLLDHDKPA 55
Query: 84 DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA---VNSGFVWDIAPRFGAMLV 140
L TF QRY +T++W GP P+ L+ E E F N A FG +
Sbjct: 56 -LGTFKQRYFWSTEYWKGPGS--PVILFQPGEQTAEGFQGYLFNKTITGVYAQEFGGAGL 112
Query: 141 FPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK------QNLSAEAS 194
EHRY+GES P V T+ LT + ALAD F N++ S + +
Sbjct: 113 ILEHRYWGESSP-----VDTLTPKTMQQLTFKNALADAVYFAKNVELPFDNSTKSSPQNA 167
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
P +L GGSY G A W P ASSAP+ D
Sbjct: 168 PWILAGGSYSGAQAGWTAATLPGTFWAYHASSAPVEAIWD 207
>gi|407919835|gb|EKG13057.1| Peptidase S28 [Macrophomina phaseolina MS6]
Length = 563
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 87/171 (50%), Gaps = 18/171 (10%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV-----NSGFVWDI 131
LDHFS D F R+ + + G P+F+Y E D A+ ++ F I
Sbjct: 69 LDHFS-GDAGFFENRFWVAESGYKGAGH--PVFIYDAGETDASQNALFRLQNSTSFFKQI 125
Query: 132 APRFGAMLVFPEHRYYGESMPYG-STEVAYQNATTLSYLTAEQALADFAVFITNLKQ--- 187
FG + + EHRYYGES+P + + A ++ YLT+EQALAD VF N +
Sbjct: 126 VDAFGGIGIVWEHRYYGESVPVNINLDTAPED---FIYLTSEQALADVPVFAANFSRANF 182
Query: 188 ---NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
+L+ ++P + GGSY GM AA+MR YP + ASSAP+ D+
Sbjct: 183 PDIDLTPSSTPWIFIGGSYPGMRAAFMREYYPETIFASFASSAPVQAQNDM 233
>gi|345568628|gb|EGX51521.1| hypothetical protein AOL_s00054g220 [Arthrobotrys oligospora ATCC
24927]
Length = 547
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 91/171 (53%), Gaps = 21/171 (12%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE----GDIEWFAV 123
Y +F Q++DHF ++ TF QRY I+T H+ P GPIF+ G E G +E+
Sbjct: 65 YTELWFPQKVDHFDPSNNNTFQQRYWIST-HFYKPG--GPIFVLDGGETSGAGRVEYMQT 121
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
G I G + + EHRYYG+S Y + + +N L +L Q+L D A F
Sbjct: 122 GIGRY--ITEYLGGIGIVLEHRYYGKS--YVTPNLTVEN---LKWLNTAQSLKDNAYFAE 174
Query: 184 NLKQNLSAEAS-------PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
NL + L A S P + +GGSY G +A+++++YP I G+LASSA
Sbjct: 175 NLWKELPANLSHIRPDNAPFISYGGSYAGAKSAFLQIEYPEIYYGSLASSA 225
>gi|118487656|gb|ABK95653.1| unknown [Populus trichocarpa]
Length = 213
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 37/74 (50%), Positives = 49/74 (66%), Gaps = 2/74 (2%)
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLDD 368
MV+YP ++FMMPLP YP++ +CK ID P S + R+F S+YYNY+ CF+L+
Sbjct: 1 MVNYPTEANFMMPLPAYPVQAMCKIIDGFPSGASKITRVFAAASLYYNYSRAEKCFKLEH 60
Query: 369 DP--HGLDGWNWQV 380
P HGL GWNWQ
Sbjct: 61 GPDAHGLHGWNWQA 74
>gi|353242348|emb|CCA74000.1| related to serine protease [Piriformospora indica DSM 11827]
Length = 548
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 90/183 (49%), Gaps = 26/183 (14%)
Query: 62 QQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIE 119
Q Q + + F Q LDH + PTF QRY +NT H+ + GP+ + G E G+
Sbjct: 58 QPQYPEFPPQMFIQPLDHDDPSS-PTFEQRYWVNTRHY---KKGGPVIVIDGGETSGEDR 113
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
+++G +A + V EHRYYGES+P V +L +L QAL+D A
Sbjct: 114 LPFLDTGIADILAKATHGLGVILEHRYYGESVP-----VKNLTTDSLRWLNNYQALSDSA 168
Query: 180 VFITNLK---------------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
F+ ++ NL A SP + +GGSY G AA MR+ YP I GA+A
Sbjct: 169 RFMKHVNFSSNLFPSSVSSETISNLKAPNSPWIYYGGSYAGARAAHMRVLYPEIVFGAIA 228
Query: 225 SSA 227
SSA
Sbjct: 229 SSA 231
>gi|195569681|ref|XP_002102837.1| GD19287 [Drosophila simulans]
gi|194198764|gb|EDX12340.1| GD19287 [Drosophila simulans]
Length = 530
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 81/171 (47%), Gaps = 11/171 (6%)
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-QNL 189
+A +LV+ EHRYYG+S+P + + L YL +QALAD AVFI K +N
Sbjct: 1 MAQEHKGVLVYTEHRYYGQSVP-----TSTMSTDDLKYLDVKQALADVAVFIETFKAENP 55
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDF 249
S V+L GGSY + W + YP + +G ASSAP+L D + +V F
Sbjct: 56 QLANSKVILAGGSYSATMVVWFKRLYPDLIVGGWASSAPLLAKVDFTE---YKEVVGQAF 112
Query: 250 -KRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADW 299
+ C++ I+ EL S+ + G E LC + DL W
Sbjct: 113 LQLGGQKCYDRIENGIAELESMFANKRG-AEARAMLRLCNSFDDQNDLDLW 162
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 46/166 (27%), Positives = 72/166 (43%), Gaps = 8/166 (4%)
Query: 168 YLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
Y + +QALAD I LKQ + S VV+ G SY +A W+R YP I G+ ASSA
Sbjct: 190 YQSVDQALADVINVIATLKQEDKYKDSKVVVSGCSYSATMATWIRKLYPEIIRGSWASSA 249
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESAS-CFNTIKESWGELVSVGQKENGLLELTKTFHL 286
P+L V + + +V + C++ I + ++ + NG + K +L
Sbjct: 250 PLLA---KVNFKDYMKVVGESYATLGGQYCYDLIDNATSYYENLFEIGNG-TQAVKELNL 305
Query: 287 CRELNSTEDLADWL---ESAYSYLAMVDYPYPSDFMMPLPGYPIRE 329
C N + W A + + Y P + +P +RE
Sbjct: 306 CSNFNVNSEQDRWQIFSTIANIFAGIAQYQKPEKYDIPTYCSILRE 351
>gi|91806190|gb|ABE65823.1| serine carboxypeptidase S28 family protein [Arabidopsis thaliana]
Length = 417
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 82/241 (34%), Positives = 122/241 (50%), Gaps = 42/241 (17%)
Query: 97 DHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGST 156
D++ P+ GP+F+ EG A + ++ +A +F A +V EHRYYG+S P+ S
Sbjct: 2 DYFRSPD--GPMFMIICGEGPCSGIA--NDYINVLAKKFQAGVVSLEHRYYGKSSPFNS- 56
Query: 157 EVAYQNATTLSYLTAEQALADFAVF------ITNLKQNLSAEAS--PVVLFGGSYGGMLA 208
+A +N L YL+++QAL D A F N K N+S+ S P FG SY G L+
Sbjct: 57 -LATEN---LKYLSSKQALYDLASFRQYYQESLNKKLNISSGGSDNPWFFFGISYSGALS 112
Query: 209 AWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELV 268
AW RLK+PH+ G+LASSA + + S+F ++ I ES G+
Sbjct: 113 AWFRLKFPHLTCGSLASSAVVRAIYEF-----------SEFDQQ-------IGESAGQEC 154
Query: 269 SVG-QKENGLLEL---TKTFHLCRELNSTE-DL-ADWLE-SAYSYLAMVDYPYPSDFMMP 321
+ Q+ N LLEL K + N+TE D+ AD+L +A + + Y P +P
Sbjct: 155 KLALQETNKLLELGLKVKNKAVKSLFNATELDVDADFLYLTADAAVMAFQYGNPDKLCVP 214
Query: 322 L 322
L
Sbjct: 215 L 215
>gi|71002774|ref|XP_756068.1| extracelular serine carboxypeptidase [Aspergillus fumigatus Af293]
gi|66853706|gb|EAL94030.1| extracelular serine carboxypeptidase, putative [Aspergillus
fumigatus Af293]
Length = 572
Score = 87.4 bits (215), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 97/207 (46%), Gaps = 40/207 (19%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFA 122
+ E+RY DHF+ RY + H+ GP+FL E G +
Sbjct: 66 HFHNESRYEPHTRDHFNL--------RYWFDASHY---KEGGPVFLIAAGETNGRDRFPF 114
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
++ G V +A + + V EHRYYGES P+ A + +L+ EQA+AD+A F
Sbjct: 115 LSHGIVTQLAKTYNGLGVILEHRYYGESYPF-----ADLTTKNIRFLSTEQAMADYAYFA 169
Query: 183 TNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA--------- 227
+N+ NL+A+A P + +GGSY G A++R YP + G ++SS
Sbjct: 170 SNVVFPGLEHLNLTADAVPWIGYGGSYAGAFVAFLRKVYPEVFFGVVSSSGVTEAIVDYW 229
Query: 228 ----PILQF---EDIVPPETFYNIVSS 247
PI QF + I ETF +IV +
Sbjct: 230 QYYEPIRQFAPSDCIWSIETFMDIVDT 256
>gi|268579387|ref|XP_002644676.1| Hypothetical protein CBG14660 [Caenorhabditis briggsae]
Length = 512
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 130/287 (45%), Gaps = 23/287 (8%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE--WFAVNSGFVWDIA 132
Q++D+F + + Q Y N+++ + G +FL E W + A
Sbjct: 64 QKVDNFDNTNNAMYDQHYWYNSNY---TQKKGIVFLMIQGEAPATDLWIQNPNYQYLKWA 120
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
FGA + EHR +G+S PY +++Y N + T QA+AD FI + +
Sbjct: 121 KEFGADVFQLEHRCFGQSRPY--KDLSYPN---IKVCTMSQAIADIHNFIGQMNIQYNFR 175
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE 252
+ FGGSY G L+A R ++P +GA+ASSAP+ D + +V +
Sbjct: 176 NPKWITFGGSYPGTLSALFRQQHPEDTVGAVASSAPLDWTLDFF---EYAMVVEDVLNQT 232
Query: 253 SASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL----NSTEDLADWLESAYSYL- 307
S C+ +K+++ ++ + + G+ +L F L + D+ ++ + Y +
Sbjct: 233 STDCWQNVKDAFYKMQQLSLTKQGIQQLNAYFDLSPPFVDGQYTQHDIDNFFANLYGFFQ 292
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKID--NAPDATSILERIFEGVS 352
+V Y Y L G ++++C K++ N PD ++ R+ V+
Sbjct: 293 GVVQYTYDGRNAATLNGLNVQQLCNKMNDKNVPD---VINRVNNTVN 336
>gi|242794965|ref|XP_002482483.1| extracelular serine carboxypeptidase, putative [Talaromyces
stipitatus ATCC 10500]
gi|218719071|gb|EED18491.1| extracelular serine carboxypeptidase, putative [Talaromyces
stipitatus ATCC 10500]
Length = 561
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 109/224 (48%), Gaps = 27/224 (12%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
TF RY + H+ GP+F+ E G+ + G V +A + V EH
Sbjct: 74 TFPLRYWFDASHY---KEGGPVFVLESGETSGEDRLPYLQKGLVAQLAQLTNGIAVVLEH 130
Query: 145 RYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFITNL------KQNLSAEASPV 196
RYYG S+P STE +L +LT EQ LAD A F N+ QNL++ P
Sbjct: 131 RYYGASIPTKDFSTE-------SLRFLTTEQGLADVAYFAQNIVYPGFEDQNLTSRHVPY 183
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASC 256
+ +GGSY G + A++R+ YP + GA+ASSA E IV ++ + + +C
Sbjct: 184 IAYGGSYAGAMVAFLRVTYPDVFFGAIASSAVT---EAIVDYWQYWEPIR---RNAPQNC 237
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
+T++ G L ++ + + +L F L ++L+ +D A+ L
Sbjct: 238 IHTVENLTGVLDNLAHNTSAVKDLETLFGL-QDLSHVDDFANVL 280
>gi|347970281|ref|XP_001230929.2| AGAP003641-PA [Anopheles gambiae str. PEST]
gi|333468859|gb|EAU76915.2| AGAP003641-PA [Anopheles gambiae str. PEST]
Length = 328
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/282 (28%), Positives = 134/282 (47%), Gaps = 30/282 (10%)
Query: 37 KFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINT 96
K RR P + ++ + + + Q R F R++HF + TF YL N
Sbjct: 29 KIRRLPPGIERMFTRKDGIRPPKGYVSQSPRTVEGRFTSRVNHFDPQNRDTFEFNYLHND 88
Query: 97 DHWVGPNRLGPIFLYCGNEGDIE-WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGS 155
++ + GP+F+ G + +F NS F D+A GA L EHRY+GES P +
Sbjct: 89 QYY---RQGGPLFIVVGGHYPVNPYFMENSHF-RDVAALEGAWLATNEHRYFGESYP--T 142
Query: 156 TEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASP-VVLFGGSYGGMLAAWMRLK 214
+++ +N L ++ EQ L D +I L++ + + + V+L G YGG LA W R +
Sbjct: 143 EDLSTEN---LRFMRTEQVLFDLIEWIDFLRREVMGDPNARVILHGVGYGGSLATWARQR 199
Query: 215 YPHIAIGALASSAPI---LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWG---ELV 268
+P+I GA SSAP+ FE+ NI+ +R S C+N I +++ L+
Sbjct: 200 FPNIIDGAWGSSAPVRATTNFEEFAV--EVGNIIR---ERGSDQCYNRIFQAFHTAENLI 254
Query: 269 SVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMV 310
G+ E +++ F+ C + D + LE + AM+
Sbjct: 255 DAGRTE----MISEMFNTCDPV----DTDNPLEVELFFFAMM 288
>gi|258569361|ref|XP_002543484.1| predicted protein [Uncinocarpus reesii 1704]
gi|237903754|gb|EEP78155.1| predicted protein [Uncinocarpus reesii 1704]
Length = 546
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 83/153 (54%), Gaps = 15/153 (9%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV-----NSGFVWDIAPRFGAMLVF 141
T+ RY +N D + P GP+ ++ G EGD + A + ++ + FG + +
Sbjct: 85 TYRNRYWVN-DAYYRPG--GPVIIFDGGEGDAQGLANYYLEDQTSYIVQLLQEFGGVGLV 141
Query: 142 PEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ-----NLSAEASPV 196
EHRYYG+S PY + A+ L YL+ EQAL D F ++ +L+ ++P
Sbjct: 142 WEHRYYGQSNPYPVNDNT--PASQLQYLSNEQALNDLPYFARTFRRRSISYDLTPRSTPW 199
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
V+ GGSY GM AA+ RLK+P AL+SSAP+
Sbjct: 200 VMIGGSYPGMRAAFSRLKHPDTIFAALSSSAPV 232
>gi|156547443|ref|XP_001605073.1| PREDICTED: putative serine protease K12H4.7-like [Nasonia
vitripennis]
Length = 378
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 81/146 (55%), Gaps = 6/146 (4%)
Query: 165 TLSYLTAEQALADFAVFITNLK--QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
L YL++EQALAD A FI ++ Q L + S ++FGGSY G LAAWMR KYPH+ GA
Sbjct: 19 NLVYLSSEQALADVAYFIQGMQAAQQLP-DTSRWIMFGGSYSGSLAAWMRAKYPHLVHGA 77
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
+++S P+L D + ++ IV K S +C +TI + ++ + + G L K
Sbjct: 78 MSASGPLLAQIDF---QQYFIIVEESLKTHSQACVDTIAAAIRQVHIMLRHRIGQQGLEK 134
Query: 283 TFHLCRELNSTEDLADWLESAYSYLA 308
F+ C ++S + + + Y LA
Sbjct: 135 LFNFCDPIDSGKTSQKDIANLYETLA 160
>gi|123457340|ref|XP_001316398.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121899103|gb|EAY04175.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 440
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 94/175 (53%), Gaps = 9/175 (5%)
Query: 56 QRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE 115
+R +QQ + F+Q +DH + ++ TF Q+++IN + + GP+ PI L E
Sbjct: 9 RRPITRQQLRGTETNATFDQLIDH-NHSETGTFKQKFVIN-NQYGGPDS--PIILEISGE 64
Query: 116 GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQAL 175
D + F +A F +V +HR+YGES P+ E +N L YL+ EQA+
Sbjct: 65 SDGYYVGGVGDFEETLAKEFNCTVVTLQHRFYGESYPF--EESTTEN---LQYLSVEQAV 119
Query: 176 ADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
D + F+ K+ A+ + +L+GGSY G+L+A+ + K+ GA++SS +L
Sbjct: 120 EDISYFVDYYKKTYKADKNKWLLYGGSYPGLLSAYTKSKFDSKFAGAISSSGVVL 174
>gi|395529078|ref|XP_003766647.1| PREDICTED: thymus-specific serine protease-like [Sarcophilus
harrisii]
Length = 323
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/178 (34%), Positives = 90/178 (50%), Gaps = 16/178 (8%)
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--N 188
+AP +GA+++ EHR+YG S+P G + L +L++ ALAD A +L + N
Sbjct: 93 LAPHWGALVISLEHRFYGHSVPPGGLGL-----EQLRFLSSRHALADVASARVHLSRIYN 147
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSD 248
+SA +SP V FGGSY G LAAW RLK+PH+ A+ASSAP+ D ++ VS
Sbjct: 148 ISA-SSPWVSFGGSYAGSLAAWARLKFPHLIWAAVASSAPVQAQLDF---SSYNRGVSRS 203
Query: 249 FK----RESASCFNTIKESWGEL-VSVGQKENGLLELTKTFHLCRELNSTEDLADWLE 301
S C + ++ EL + + L C L + ED A+ LE
Sbjct: 204 LADPTVGGSLKCRRAVALAFSELDRGLSEGTEARAALQSAVRACGALEAPEDQAELLE 261
>gi|302795496|ref|XP_002979511.1| hypothetical protein SELMODRAFT_419277 [Selaginella moellendorffii]
gi|300152759|gb|EFJ19400.1| hypothetical protein SELMODRAFT_419277 [Selaginella moellendorffii]
Length = 905
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 78/151 (51%), Gaps = 21/151 (13%)
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI----TNL 185
D+A +F A +V EHRYYG S P + YLT EQ+LAD A FI T +
Sbjct: 618 DLARQFKAGVVTLEHRYYGYSFP----------SKDFKYLTVEQSLADHAAFIEYYQTFI 667
Query: 186 KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ + A+ ++ GGSY G L+AW RLKYPH+ +G+ ASSA + + I+ + F +
Sbjct: 668 NKKCNKHANKWIVIGGSYSGALSAWFRLKYPHLVVGSWASSAVV---DAILDYKAFDKQI 724
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENG 276
E A I +LV +G +EN
Sbjct: 725 GISAGLECAQALRKIT----DLVEIGLRENA 751
>gi|320593738|gb|EFX06147.1| serine-type peptidase [Grosmannia clavigera kw1407]
Length = 515
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/204 (31%), Positives = 95/204 (46%), Gaps = 23/204 (11%)
Query: 40 RAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW 99
+A + KLP + P ++ Q F+Q LDH S A TF+QR+ ++T W
Sbjct: 28 QAEKIFRKLPGI---PAKEAETADGQIE---GTFQQLLDH-SDASKGTFTQRFWLDTHFW 80
Query: 100 VGPNRLGPIFLYCGNEGDIEWFA--VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
GP P+FL+ E D + + G A FG ++V EHRY+G+S P+ +
Sbjct: 81 DGPGS--PVFLFMAGEEDASGYLGYLREGIPGLYAENFGGLVVVIEHRYFGKSQPFDTL- 137
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNLKQNLSA-------EASPVVLFGGSYGGMLAAW 210
A TL +L ++ D F N+ ++ +P VL GGSY G LAAW
Sbjct: 138 ----TAETLRFLDLPNSMKDMTYFAQNVDIEVANGTVLDKPSEAPWVLIGGSYSGALAAW 193
Query: 211 MRLKYPHIAIGALASSAPILQFED 234
++ K P + ASSA + D
Sbjct: 194 IQQKEPGVFFAYHASSAVVETISD 217
>gi|158291878|ref|XP_001688441.1| AGAP003640-PA [Anopheles gambiae str. PEST]
gi|157017499|gb|EDO64163.1| AGAP003640-PA [Anopheles gambiae str. PEST]
Length = 505
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 98/192 (51%), Gaps = 17/192 (8%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F R++HF + TF +L N D + P GP+F+ G F ++ DIA
Sbjct: 63 FTSRINHFDPQNRDTFEFNFLWN-DEYYRPG--GPLFIVVGGHHRTNPFFIDETHFKDIA 119
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
GA L EHRY+G S+P + +++ N L +L EQ L D +I L++ + +
Sbjct: 120 ALQGAFLATNEHRYFGTSVP--TEDLSSDN---LRFLRTEQTLFDLIEWIDFLRREVMRD 174
Query: 193 -ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI---LQFEDIVPPETFYNIVSSD 248
+ V+L G SYGG LA+W R ++P+I GA SSA + + FE+ E F N +
Sbjct: 175 PNAKVILHGFSYGGALASWARQRFPNIIDGAWVSSATVRATVNFEEFT--EDFGNTIR-- 230
Query: 249 FKRESASCFNTI 260
+ S C+N+I
Sbjct: 231 -IKGSDECYNSI 241
>gi|410931333|ref|XP_003979050.1| PREDICTED: dipeptidyl peptidase 2-like, partial [Takifugu rubripes]
Length = 317
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 52/167 (31%), Positives = 84/167 (50%), Gaps = 13/167 (7%)
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKE 262
YGGML+ +MRL+YP++ GALA+SAPIL + P F F+ +A +
Sbjct: 1 YGGMLSVYMRLRYPNMVAGALAASAPILSTAGLGDPSQF-------FRDVTAVSPVCLSA 53
Query: 263 SWGELVSVGQKENGLLELTKTFHLCRELNSTED---LADWLESAYSYLAMVDYPYPSDFM 319
+ + + LC+ +S ED L L +A++ +AM+DYPY + FM
Sbjct: 54 CLSLSALLSLLLPDYRRIQEKLSLCKPPSSPEDIYQLNGLLRNAFTLMAMLDYPYSTHFM 113
Query: 320 MPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQL 366
+P P++ C+ + A+ +LE + + + YN TG + CF L
Sbjct: 114 GNMPANPVKVACETMLR---ASGLLENLRDTAGIVYNSTGALGCFDL 157
>gi|255938516|ref|XP_002560028.1| Pc14g00330 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211584649|emb|CAP74174.1| Pc14g00330 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 557
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 68/237 (28%), Positives = 111/237 (46%), Gaps = 26/237 (10%)
Query: 12 LYLSPVIT-IVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQR------QQQQ 64
++ S V+ I ++ LS + S+F+R + +L + Q+ Q
Sbjct: 1 MHFSTVVKGIATLTALSGTAAGIVTSQFKRDLKLSAELGIHPDTLLGQKTTVHDVTNSQL 60
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE----- 119
E Y +DH S + + + RY ++ +H+ GPIF++ E E
Sbjct: 61 DETIEAEYVSIPIDH-SNSSVGHYRNRYWVSEEHY---KEDGPIFVFDVGESTAEPAGQT 116
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYG-STEVAYQNATTLSYLTAEQALADF 178
+ + +S F + + FG + + EHRYYG+S+PY S ++ ++ L YL +QALAD
Sbjct: 117 YLSNSSTFFYQLVKEFGGIGIVWEHRYYGDSLPYNVSLDMEPEH---LQYLNNKQALADI 173
Query: 179 AVFITNLKQ------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
F + +L+ +P V+ GGSY GM AA+ R YP A ASSAP+
Sbjct: 174 PYFAAQFTRQDYSDVDLTPAGTPWVMVGGSYAGMRAAFTRQSYPDTIYAAFASSAPV 230
>gi|317030966|ref|XP_001392567.2| serine peptidase [Aspergillus niger CBS 513.88]
Length = 508
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 91/198 (45%), Gaps = 21/198 (10%)
Query: 49 PHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
P L P + + YFEQ LDH + + TFSQRY +T++W GP P+
Sbjct: 25 PRLVPKPISRPASSKSAATTGEAYFEQLLDHHN-PEKGTFSQRYWWSTEYWGGPGS--PV 81
Query: 109 FLYCGNEGDI-----EWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA 163
L+ N G++ E + N A ++ EHRY+G+S PY EV NA
Sbjct: 82 VLF--NPGEVSADGYEGYLTNDTLTGVYAQEIQGAVILIEHRYWGDSSPY---EVL--NA 134
Query: 164 TTLSYLTAEQALADFAVFITNLKQNL------SAEASPVVLFGGSYGGMLAAWMRLKYPH 217
TL YLT +Q++ D F +K +A+ +P V+ GGSY G L AW P
Sbjct: 135 ETLQYLTLDQSILDMTYFAETVKLQFDNSSRSNAQNAPWVMVGGSYSGALTAWTESIAPG 194
Query: 218 IAIGALASSAPILQFEDI 235
A+SAP+ D
Sbjct: 195 TFWAYHATSAPVEAIYDF 212
>gi|134077081|emb|CAK45422.1| endoprotease Endo-Pro-Aspergillus niger
gi|350629684|gb|EHA18057.1| S28 peptidase [Aspergillus niger ATCC 1015]
Length = 526
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 91/198 (45%), Gaps = 21/198 (10%)
Query: 49 PHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPI 108
P L P + + YFEQ LDH + + TFSQRY +T++W GP P+
Sbjct: 25 PRLVPKPISRPASSKSAATTGEAYFEQLLDHHN-PEKGTFSQRYWWSTEYWGGPGS--PV 81
Query: 109 FLYCGNEGDI-----EWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA 163
L+ N G++ E + N A ++ EHRY+G+S PY EV NA
Sbjct: 82 VLF--NPGEVSADGYEGYLTNDTLTGVYAQEIQGAVILIEHRYWGDSSPY---EVL--NA 134
Query: 164 TTLSYLTAEQALADFAVFITNLKQNL------SAEASPVVLFGGSYGGMLAAWMRLKYPH 217
TL YLT +Q++ D F +K +A+ +P V+ GGSY G L AW P
Sbjct: 135 ETLQYLTLDQSILDMTYFAETVKLQFDNSSRSNAQNAPWVMVGGSYSGALTAWTESIAPG 194
Query: 218 IAIGALASSAPILQFEDI 235
A+SAP+ D
Sbjct: 195 TFWAYHATSAPVEAIYDF 212
>gi|310795241|gb|EFQ30702.1| serine carboxypeptidase S28 [Glomerella graminicola M1.001]
Length = 488
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 80/171 (46%), Gaps = 17/171 (9%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE---GDIEWFAVNSGFVW 129
F+Q LDH + + TF QRY N +HW GP P+F+ G E G + + N
Sbjct: 28 FDQFLDHTNTSK-GTFKQRYWWNAEHWGGPGF--PVFMVNGGETNAGRLTGYLENGTLAS 84
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT------ 183
A ++ EHRYYGES P+ ++ A TL YL QA+ D F
Sbjct: 85 LYAETHKGAIILIEHRYYGESWPFKTS-----TADTLQYLDVPQAIRDNIHFAQTADLPF 139
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+ + +A SP VL GGSY G LAAW + P ASSA + ED
Sbjct: 140 DTNKGANANTSPWVLVGGSYAGALAAWTSVVAPGTFAAYHASSAVVQAIED 190
>gi|297838667|ref|XP_002887215.1| hypothetical protein ARALYDRAFT_894677 [Arabidopsis lyrata subsp.
lyrata]
gi|297333056|gb|EFH63474.1| hypothetical protein ARALYDRAFT_894677 [Arabidopsis lyrata subsp.
lyrata]
Length = 274
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 88/191 (46%), Gaps = 54/191 (28%)
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
++S + P + GG+ AW RLKYP+IAIGALA SA LQF+ I+ E
Sbjct: 55 SVSEKLEPTKVHGGN-----PAWFRLKYPYIAIGALAPSAHTLQFDKIIRLEASMR---- 105
Query: 248 DFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYL 307
F+R + R+L+S + WLESA+
Sbjct: 106 PFRR----------------------------------ISRDLHSVTSASRWLESAFIDT 131
Query: 308 AMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSVYYNYTGNVDCFQLD 367
AMV+ P PS+ M PLP YP+ E+ + + S L+R F NY+ + +CF ++
Sbjct: 132 AMVNCPTPSNSMAPLPAYPVEEIIEDGE-----ASKLDRAFAAA----NYSDSENCFDIE 182
Query: 368 D--DPHGLDGW 376
+ DPHG W
Sbjct: 183 NQTDPHGFIQW 193
>gi|119482365|ref|XP_001261211.1| extracelular serine carboxypeptidase, putative [Neosartorya
fischeri NRRL 181]
gi|119409365|gb|EAW19314.1| extracelular serine carboxypeptidase, putative [Neosartorya
fischeri NRRL 181]
Length = 572
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 56/178 (31%), Positives = 85/178 (47%), Gaps = 24/178 (13%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFA 122
+ E+RY DHF+ RY + H+ GP+FL E G +
Sbjct: 66 HFHNESRYEPHTGDHFNL--------RYWFDASHY---KEGGPVFLIAAGETNGRNRFPF 114
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
++ G V +A + + V EHRYYGES P+ A + +L+ EQA+AD+A F
Sbjct: 115 LSHGIVTQLAKTYNGLGVILEHRYYGESYPF-----ANLTTKNIRFLSTEQAMADYAYFA 169
Query: 183 TNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+N+ NL+A+ P + +GGSY G A++R YP + G ++SS ED
Sbjct: 170 SNVVFPGLEHLNLTADTVPWIGYGGSYAGAFVAFLRKVYPEVFFGVVSSSGVTEAIED 227
>gi|402588746|gb|EJW82679.1| hypothetical protein WUBG_06411 [Wuchereria bancrofti]
Length = 396
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 57/163 (34%), Positives = 81/163 (49%), Gaps = 12/163 (7%)
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+W A ++GA + EHR++G S P+ + + YLT QALAD FI + +
Sbjct: 6 IW--AKKYGAACFYLEHRFFGASQPFED-----HSMESYKYLTVNQALADIKNFIVQMNE 58
Query: 188 N--LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
L E +LFGGSYGG LAAW R + I A+ SSA ++Q E V + +
Sbjct: 59 MFFLDIEKPRWILFGGSYGGALAAWFREMNEELTIAAIVSSA-VVQAE--VDYYDYTKNL 115
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR 288
K E+A C TI+ S L+ +G EL K F++C
Sbjct: 116 EYVLKEENAPCAETIRLSIKALIEKTYTVDGRAELGKVFNMCE 158
>gi|71986744|ref|NP_500596.2| Protein F19C7.4 [Caenorhabditis elegans]
gi|373219711|emb|CCD69715.1| Protein F19C7.4 [Caenorhabditis elegans]
Length = 542
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 97/363 (26%), Positives = 154/363 (42%), Gaps = 47/363 (12%)
Query: 14 LSPVITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYF 73
+S V+ +V ISI + + P R PR LT P + Y
Sbjct: 1 MSRVLCLVAISIGLTICHVSPPMVVGR-PR----EGLLTGDPAEGPTTDAKYMIYSN--I 53
Query: 74 EQRLDHFSFA-DLPTFSQRYLINTDHWVGPNRL-GPIFLYCGNEGDI------EWFAVNS 125
Q++DHFS ++ + Q Y N W N+ G +FL G E I W
Sbjct: 54 TQKVDHFSNGTNIGVWQQHYQYN---WKFYNKTTGYVFLMIGGESSINKTNGDRWIRHEG 110
Query: 126 GFVWDIAPRFGAMLVFPEHRYYG--ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
+ F A EHR+YG E P G Q ++ LT +QALAD FIT
Sbjct: 111 ETMMKWVAEFQAAAFQVEHRFYGSKEYSPIGD-----QTTASMKLLTIDQALADIKEFIT 165
Query: 184 NLKQNLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED----IVPP 238
+ + P+ V FGGSY G L+A+ R YP + GA++SS+ + F D +
Sbjct: 166 QINALYFKDDKPIWVTFGGSYPGSLSAFFRETYPEMTAGAVSSSSAVHVFVDYYGYAINT 225
Query: 239 ETFYNIVSSDFKRESASCFNTIKESWGELVSV---GQKENGLLELTKTFHLC---RELNS 292
E Y VS SC + IK ++ +L++ G LL+ + F+LC E N
Sbjct: 226 EKTYRTVSD-------SCGDVIKVAFQKLITKAYNGSDSRALLK--QQFNLCDSFDETNL 276
Query: 293 TEDLADWLESAYSYLAMVD-YPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGV 351
++ + + ++ Y Y +++ Y + G + C ++NA + +R+ +
Sbjct: 277 SKAVQFFFQNVYGYFQIINQYTGDNKSNATRSGLGVPAACDLLNNATIGDEV-QRVIAVM 335
Query: 352 SVY 354
++Y
Sbjct: 336 NLY 338
>gi|157133204|ref|XP_001662799.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108870921|gb|EAT35146.1| AAEL012664-PA [Aedes aegypti]
Length = 493
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 105/233 (45%), Gaps = 20/233 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E R F+QR H + + F RY+ N+ + P GPIFL+ G ++E V G
Sbjct: 54 EWRVFDQRQSHSNAHSVDMFPMRYVSNSKFYR-PG--GPIFLFVGGPWELEQHFVEQGHF 110
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-Q 187
D+A A +V E RYYGES+P V + L L QA D A I +++ +
Sbjct: 111 VDLAEENNAFVVANEMRYYGESLP-----VPNASRGNLRLLHIVQACTDIARLIVHIRYE 165
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
L + V++ G + G LA W RL+YPH+ G AS A + E+ F V
Sbjct: 166 VLRDPNARVIVAGVGFSGSLAHWTRLRYPHLIHGVWASGAMLQANENY---REFAEEVGE 222
Query: 248 DFKRESAS-CFNTIKESWG---ELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
+R + C+ + + L+ GQ + + K F +C +N T L
Sbjct: 223 YIRRYGGNDCYGALWRGFRTAENLIDAGQSQT----VDKLFKVCTPINGTNPL 271
>gi|358369966|dbj|GAA86579.1| extracelular serine carboxypeptidase [Aspergillus kawachii IFO
4308]
Length = 569
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 67/223 (30%), Positives = 103/223 (46%), Gaps = 38/223 (17%)
Query: 25 ILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQ----QQQYRYETRYFEQRLDHF 80
I S ++L + P + R PHL + Q + Q Q+ + +DHF
Sbjct: 12 IASTVALDSSPLELRT--------PHLHQLSQLSKSHQASPNQESASFPVYNLSVPIDHF 63
Query: 81 SFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIA 132
TF RY ++T H+ GP+F+ E G ++ G V +A
Sbjct: 64 HNESRYEPHTNATFGLRYWLDTSHY---QPGGPVFVIAAGETDGSDRIPFLSQGVVTQLA 120
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQALADFAVFITNL----- 185
+ + + EHRYYGES P+ N TT + +LT EQALAD+A F +N+
Sbjct: 121 AAYNGVALILEHRYYGESYPFA-------NLTTENIRFLTTEQALADYAYFASNIVFPGL 173
Query: 186 -KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+L+A +P + +GGSY G A++R YP + GA++SS
Sbjct: 174 EHLDLTAATTPWIAYGGSYAGAFVAFLRKLYPDLYWGAVSSSG 216
>gi|212536138|ref|XP_002148225.1| extracelular serine carboxypeptidase, putative [Talaromyces
marneffei ATCC 18224]
gi|210070624|gb|EEA24714.1| extracelular serine carboxypeptidase, putative [Talaromyces
marneffei ATCC 18224]
Length = 561
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 69/224 (30%), Positives = 109/224 (48%), Gaps = 27/224 (12%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
TF+ RY + H+ GP+F+ E G+ + G + +A + V EH
Sbjct: 74 TFALRYWFDASHY---KEGGPVFVLESGETSGEDRLPYLQKGLISQLAQLTNGIAVVLEH 130
Query: 145 RYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAVFITNL------KQNLSAEASPV 196
RYYG S+P STE +L +LT EQALAD A F N+ Q+L++ P
Sbjct: 131 RYYGTSIPTKDFSTE-------SLRFLTTEQALADVAYFAQNIIYPGLEDQSLTSNFVPY 183
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASC 256
+ +GGSY G + A++R+ YP + GA+ASSA E IV ++ + + ++ C
Sbjct: 184 IAYGGSYAGAMVAFLRVTYPDVFFGAIASSAVT---EAIVDYWQYWEPIRRNAPQD---C 237
Query: 257 FNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
T++ G L S+ + +L F L +L+ +D A+ L
Sbjct: 238 VRTVENLTGVLDSLTSNTAAIQDLETLFGL-EDLSYVDDFANVL 280
>gi|350636991|gb|EHA25349.1| hypothetical protein ASPNIDRAFT_56689 [Aspergillus niger ATCC 1015]
Length = 569
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 66/221 (29%), Positives = 104/221 (47%), Gaps = 34/221 (15%)
Query: 25 ILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQ----QQQYRYETRYFEQRLDHF 80
I S ++L + P + R +PHL + Q + Q Q + +DHF
Sbjct: 12 IASTVALDSSPLELR--------VPHLHQLSQLSKSHQASLNQDSASFPVYNLSVPIDHF 63
Query: 81 SFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIA 132
TF RY ++T H+ GP+F+ E G ++ G V +A
Sbjct: 64 HNESRYEPHTNATFGLRYWLDTSHY---QPGGPVFVIAAGETDGSDRIPFLSQGVVTQLA 120
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL------K 186
+ + + EHRYYGES P+ T + +N + +LT EQALAD+A F +N+
Sbjct: 121 AAYHGIGLILEHRYYGESYPF--TNLTTEN---IRFLTTEQALADYAYFASNIVFPGLED 175
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+L+A +P + +GGSY G A++R YP + GA++SS
Sbjct: 176 LDLTAATTPWIAYGGSYAGAFVAFLRKLYPELYWGAVSSSG 216
>gi|145246822|ref|XP_001395660.1| extracellular serine carboxypeptidase [Aspergillus niger CBS
513.88]
gi|134080382|emb|CAK46303.1| unnamed protein product [Aspergillus niger]
Length = 569
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 66/221 (29%), Positives = 104/221 (47%), Gaps = 34/221 (15%)
Query: 25 ILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQ----QQQYRYETRYFEQRLDHF 80
I S ++L + P + R +PHL + Q + Q Q + +DHF
Sbjct: 12 IASTVALDSSPLELR--------VPHLHQLSQLSKSHQASLNQDSASFPVYNLSVPIDHF 63
Query: 81 SFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIA 132
TF RY ++T H+ GP+F+ E G ++ G V +A
Sbjct: 64 HDESRYEPHTNATFGLRYWLDTSHY---QPGGPVFVIAAGETDGSDRIPFLSQGVVTQLA 120
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL------K 186
+ + + EHRYYGES P+ T + +N + +LT EQALAD+A F +N+
Sbjct: 121 AAYHGIGLILEHRYYGESYPF--TNLTTEN---IRFLTTEQALADYAYFASNIVFPGLED 175
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+L+A +P + +GGSY G A++R YP + GA++SS
Sbjct: 176 LDLTAATTPWIAYGGSYAGAFVAFLRKLYPELYWGAVSSSG 216
>gi|341889106|gb|EGT45041.1| hypothetical protein CAEBREN_04354 [Caenorhabditis brenneri]
Length = 668
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 82/296 (27%), Positives = 131/296 (44%), Gaps = 41/296 (13%)
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAE 172
G GD +W + + A FGA EHR+YG S Y + + Q +++ LT +
Sbjct: 10 GTNGD-KWVRHEAETMMTWAAEFGAAAFQVEHRFYG-SKDY--SPIGDQTPSSMKLLTID 65
Query: 173 QALADFAVFITNLKQNLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
QALAD FIT + + P+ V FGGSY G L+AW R YP + GA++SS+ +
Sbjct: 66 QALADIKEFITQMNALYFKDDKPIWVTFGGSYPGSLSAWFRETYPEMTAGAVSSSSAVHV 125
Query: 232 FED----IVPPETFYNIVSSDFKRESASCFNTIKESWGELV--SVGQKENGLLELTKTFH 285
F D + E Y VS SC NTI ++ ++ + K++ +L L + F+
Sbjct: 126 FVDYYGYAINTEKTYRTVSD-------SCANTIGVAFQSMIQKAYAGKDSRIL-LKQQFN 177
Query: 286 LCREL---NSTEDLADWLESAYSYLAMVD-YPYPSDFMMPLPGYPIREVCKKIDNAPDAT 341
LC + N ++ L + ++ Y Y ++ Y + G + C ++N +
Sbjct: 178 LCDDFDENNLSKSLQFFFQNVYGYFQGINQYTGDNRNNATRSGLGVPAACDILNNNKNGD 237
Query: 342 SILERIFEGVSVY-------------YNYTGNV----DCFQLDDDPHGLDGWNWQV 380
I R+ + +++Y NYT + D DDD W WQ
Sbjct: 238 EI-NRVVQVMNLYDSWYPPSATGCRPNNYTSFIQYYSDTTMPDDDRISTRSWIWQT 292
>gi|389645554|ref|XP_003720409.1| hypothetical protein MGG_15423 [Magnaporthe oryzae 70-15]
gi|351640178|gb|EHA48042.1| hypothetical protein MGG_15423 [Magnaporthe oryzae 70-15]
Length = 564
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 129/299 (43%), Gaps = 40/299 (13%)
Query: 21 VIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHF 80
+++++ + AA+PS PRF Q + + RY R +DHF
Sbjct: 12 MVVAVAGQVCSAARPSFM--TPRFYAH-------QQANEVVAAAEGRYPARNISVPVDHF 62
Query: 81 SF------ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIA 132
TF RY + H+V GP+ + G E G + G ++ +A
Sbjct: 63 HNDTSYEPHSNDTFELRYWFDASHYVNG---GPVIVLLGGETSGAERLPFMEKGILYRLA 119
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL------K 186
M V EHRYYG S P + + +N L +LT +QALAD A F N+
Sbjct: 120 RATRGMAVVLEHRYYGASFP--TPNLTTEN---LRFLTTDQALADTAYFAKNVVFHGYEN 174
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+NL++ +P +GGSY G AA++R YP + GA++SS L D ++
Sbjct: 175 RNLTSHTTPYFAYGGSYAGAFAAFVRKLYPDVFWGAISSSGVPLAVID------YWEYCE 228
Query: 247 SDFKRESASCFNTIKESWGELVSVGQ--KENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ K + C + ++ L ++ Q K + +L + F L L + D A+ L S
Sbjct: 229 AQRKFAPSECVDVTQKLTNVLDTIAQDGKFEDMKKLKEVFGLS-NLTNRHDFANVLSSG 286
>gi|440471426|gb|ELQ40436.1| hypothetical protein OOU_Y34scaffold00435g4 [Magnaporthe oryzae
Y34]
gi|440488251|gb|ELQ67984.1| hypothetical protein OOW_P131scaffold00272g5 [Magnaporthe oryzae
P131]
Length = 400
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/299 (27%), Positives = 129/299 (43%), Gaps = 40/299 (13%)
Query: 21 VIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHF 80
+++++ + AA+PS PRF Q + + RY R +DHF
Sbjct: 8 MVVAVAGQVCSAARPSFM--TPRFYAH-------QQANEVVAAAEGRYPARNISVPVDHF 58
Query: 81 SF------ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIA 132
TF RY + H+V GP+ + G E G + G ++ +A
Sbjct: 59 HNDTSYEPHSNDTFELRYWFDASHYVNG---GPVIVLLGGETSGAERLPFMEKGILYRLA 115
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL------K 186
M V EHRYYG S P + + +N L +LT +QALAD A F N+
Sbjct: 116 RATRGMAVVLEHRYYGASFP--TPNLTTEN---LRFLTTDQALADTAYFAKNVVFHGYEN 170
Query: 187 QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVS 246
+NL++ +P +GGSY G AA++R YP + GA++SS L D ++
Sbjct: 171 RNLTSHTTPYFAYGGSYAGAFAAFVRKLYPDVFWGAISSSGVPLAVID------YWEYCE 224
Query: 247 SDFKRESASCFNTIKESWGELVSVGQ--KENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ K + C + ++ L ++ Q K + +L + F L L + D A+ L S
Sbjct: 225 AQRKFAPSECVDVTQKLTNVLDTIAQDGKFEDMKKLKEVFGL-SNLTNRHDFANVLSSG 282
>gi|171676507|ref|XP_001903206.1| hypothetical protein [Podospora anserina S mat+]
gi|170936320|emb|CAP60978.1| unnamed protein product [Podospora anserina S mat+]
Length = 598
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 72/228 (31%), Positives = 108/228 (47%), Gaps = 23/228 (10%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW---FAVNS 125
E+R F Q +DH S L TFSQRY +T +W GP P+ ++ E E+ F N
Sbjct: 89 ESR-FSQLIDH-SNPSLGTFSQRYWWDTTYWDGPGS--PVVVFSPGEASAEYYSGFLTNQ 144
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
V A GA ++ EHRY+G+S P+ ++ + L+YLT ++ADFA F +
Sbjct: 145 TIVGLYAQAIGAAILLIEHRYWGDSSPF-----SHLSTVNLTYLTLNDSVADFAHFARQV 199
Query: 186 K------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP---ILQFEDIV 236
+ +A +P + GGSY G LAAW+ P ASSAP I F +
Sbjct: 200 QLPFDESGRSNAPKAPWIFVGGSYSGNLAAWLDHLSPGTFWAYHASSAPVQAIRHFWEYF 259
Query: 237 PP--ETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
P E S DF++ +A ++ E V ++ GL ++ +
Sbjct: 260 TPIWEGMPRNCSKDFEKITAHIDQVLEHGSEEEVKSLKEGFGLGDIKE 307
>gi|258566732|ref|XP_002584110.1| predicted protein [Uncinocarpus reesii 1704]
gi|237905556|gb|EEP79957.1| predicted protein [Uncinocarpus reesii 1704]
Length = 552
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 79/153 (51%), Gaps = 16/153 (10%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA----VN-SGFVWDIAPRFGAMLVFP 142
+ R+ IN H+ GP+F++ G E + + +A VN + F + F M +
Sbjct: 83 YKNRFWINDSHY---KSGGPVFVFDGGEANAQRYADYYLVNETSFFVHLLEEFKGMGIVW 139
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ------NLSAEASPV 196
EHRYYGES+P+ A YL EQALAD F K+ +L+ +++P
Sbjct: 140 EHRYYGESLPFPVNLDT--PAEHFQYLNNEQALADIPYFAERFKRENFPNDDLTPKSTPW 197
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
V+ GGSY GM AA+ R KYP A A+SAP+
Sbjct: 198 VMLGGSYPGMRAAFTRDKYPETIFAAFAASAPV 230
>gi|392898873|ref|NP_500595.2| Protein F19C7.2 [Caenorhabditis elegans]
gi|373219710|emb|CCD69714.1| Protein F19C7.2 [Caenorhabditis elegans]
Length = 540
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 91/344 (26%), Positives = 142/344 (41%), Gaps = 55/344 (15%)
Query: 75 QRLDHFSFA-DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI------EWFAVNSGF 127
Q++DHFS + + QRY N+ + G +FL G EG I +W
Sbjct: 55 QKVDHFSNGTNNGVWQQRYQYNSKFY--NKTTGYVFLMLGGEGSINVTNGDKWVRHEGET 112
Query: 128 VWDIAPRFGAMLVFPEHRYYG--ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
+ F A EHR+YG E P G Q ++ LT +QALAD FIT +
Sbjct: 113 MMKWVAEFQAAAFQVEHRFYGSKEYSPIGD-----QTTASMKLLTIDQALADIKEFITQM 167
Query: 186 KQNLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED----IVPPET 240
+ P+ V FGGSY G L+A+ R YP + GA++SS+ + F D + E
Sbjct: 168 NALYFKDDKPIWVTFGGSYPGSLSAFFRETYPEMTAGAVSSSSAVHVFVDYYEYAINTEK 227
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVS---VGQKENGLLELTKTFHLCRELNST---E 294
Y VS SC + IK ++ L++ G LL+ + F+LC + T +
Sbjct: 228 TYRTVSD-------SCGDVIKVAFQNLITKAYSGPDSRALLK--QRFNLCDNFDETKLSK 278
Query: 295 DLADWLESAYSYLAMVD-YPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEGVSV 353
+ + ++ Y Y ++ Y + G + C +++ I +R+ +++
Sbjct: 279 SVQFFFQNVYGYFQGINQYTGDNRNNATRSGLGVPAACNLLNDKTIGDEI-QRVIAVMNL 337
Query: 354 Y-------------YNYTGNV----DCFQLDDDPHGLDGWNWQV 380
Y NYT + D DDD W WQ
Sbjct: 338 YDSWYKPSDSGCRPNNYTAFIQAYSDTTMPDDDTISTRSWIWQT 381
>gi|406697786|gb|EKD01038.1| hypothetical protein A1Q2_04659 [Trichosporon asahii var. asahii
CBS 8904]
Length = 643
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 114/243 (46%), Gaps = 39/243 (16%)
Query: 46 GKLPHLTEPPQRQQRQQQQQYRYE---TRY----FEQRLDHFSFADLPTFSQRYLINTDH 98
GK + + + ++++ QY + TR+ F+Q++ HF + TF QRY +++ +
Sbjct: 81 GKEGDDKDAAEEKPKKEKDQYGADITKTRFGPFCFDQKVSHFDEGEKRTFCQRYWVDSQY 140
Query: 99 WVGPNRLGPIFLYCGNEGDIEWFAVNS------GFVWDIAPRFGAMLVFPEHRYYGESMP 152
+V GP+F+ G E D A +S G + ++ G M V EHRYYG S
Sbjct: 141 YV---EGGPVFILDGGETDGANRADDSIPFMEQGILQILSNATGGMSVILEHRYYGAS-- 195
Query: 153 YGSTEVAYQNATTLSYLTAEQALADFAVFITNL------------KQNLSAEASPVVLFG 200
Y + +++ N L +L +++L D A FI K + +P + +G
Sbjct: 196 YVTEDLSTDN---LRWLNNKESLEDSAEFIRKFPVPKDVQKKLKNKDVFKPDCTPYIYYG 252
Query: 201 GSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTI 260
GSY G AA+MR +YP + G + SS + + PE F I++ +C +
Sbjct: 253 GSYAGARAAFMRKEYPDLVFGGIGSSG--VTHAQVYYPEYFDPIITYG----EPACIKAM 306
Query: 261 KES 263
+ES
Sbjct: 307 EES 309
>gi|401886138|gb|EJT50197.1| hypothetical protein A1Q1_00498 [Trichosporon asahii var. asahii
CBS 2479]
Length = 643
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/243 (27%), Positives = 114/243 (46%), Gaps = 39/243 (16%)
Query: 46 GKLPHLTEPPQRQQRQQQQQYRYE---TRY----FEQRLDHFSFADLPTFSQRYLINTDH 98
GK + + + ++++ QY + TR+ F+Q++ HF + TF QRY +++ +
Sbjct: 81 GKEGDDKDAAEEKPKKEKDQYGADITKTRFGPFCFDQKVSHFDEGEKRTFCQRYWVDSQY 140
Query: 99 WVGPNRLGPIFLYCGNEGDIEWFAVNS------GFVWDIAPRFGAMLVFPEHRYYGESMP 152
+V GP+F+ G E D A +S G + ++ G M V EHRYYG S
Sbjct: 141 YV---EGGPVFILDGGETDGANRADDSIPFMEQGILQILSNATGGMSVILEHRYYGAS-- 195
Query: 153 YGSTEVAYQNATTLSYLTAEQALADFAVFITNL------------KQNLSAEASPVVLFG 200
Y + +++ N L +L +++L D A FI K + +P + +G
Sbjct: 196 YVTEDLSTDN---LRWLNNKESLEDSAEFIRKFPVPKDVQKKLKNKDVFKPDCTPYIYYG 252
Query: 201 GSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTI 260
GSY G AA+MR +YP + G + SS + + PE F I++ +C +
Sbjct: 253 GSYAGARAAFMRKEYPDLVFGGIGSSG--VTHAQVYYPEYFDPIITYG----EPACIKAM 306
Query: 261 KES 263
+ES
Sbjct: 307 EES 309
>gi|242814920|ref|XP_002486468.1| serine peptidase, family S28, putative [Talaromyces stipitatus ATCC
10500]
gi|218714807|gb|EED14230.1| serine peptidase, family S28, putative [Talaromyces stipitatus ATCC
10500]
Length = 608
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 59/178 (33%), Positives = 88/178 (49%), Gaps = 19/178 (10%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI----EWFAVNS 125
T Y + +DH + + T+ RY + T ++ GP+FLY E + S
Sbjct: 68 TEYADIPIDHDNHT-VGTYRNRYWVTTKYY---RSGGPVFLYDVGESSAYSSAQHMLGES 123
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN--ATTLSYLTAEQALADFAVFIT 183
F+ + FG + + EHRYYGES+P G + +N A +LT EQA+AD F
Sbjct: 124 SFLREFLQEFGGVGIVWEHRYYGESLPMG---LVNENTPAENFKFLTHEQAIADIPYFAQ 180
Query: 184 NLK------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
+ Q+LS + +P ++ GGSY GM A+ R +YP A ASSAP+ D+
Sbjct: 181 DFHRPELPFQDLSPKGTPWIMMGGSYSGMRTAFTRNEYPDTIYAAYASSAPVQARADM 238
>gi|90018240|gb|ABD83933.1| hypothetical protein [Ictalurus punctatus]
Length = 138
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 78/133 (58%), Gaps = 4/133 (3%)
Query: 216 PHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKEN 275
P+I GALA+SAPI+ + F+ V++DF+ + +C + +K ++ +L ++ Q+++
Sbjct: 1 PNIVAGALAASAPIVSTAGLGDSGQFFRDVTADFQNYNPACKDAVKAAFQKLQTLAQQQD 60
Query: 276 GLLELTKTFHLCRELNSTEDLAD---WLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCK 332
+ F LC+ +S +DL +L +A++ LAM+DYPY + FM +P +P++ C+
Sbjct: 61 -YARIQSAFSLCKTPSSNKDLHQLNGFLRNAFTLLAMMDYPYATIFMSKMPAFPVKVACE 119
Query: 333 KIDNAPDATSILE 345
+ N + S L
Sbjct: 120 VMLNGTEVLSALR 132
>gi|242039113|ref|XP_002466951.1| hypothetical protein SORBIDRAFT_01g017300 [Sorghum bicolor]
gi|241920805|gb|EER93949.1| hypothetical protein SORBIDRAFT_01g017300 [Sorghum bicolor]
Length = 401
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 45/107 (42%), Positives = 68/107 (63%), Gaps = 12/107 (11%)
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLS 190
+A +FGA +V PEHRYYG+S P+ ++ +N L +L+++QAL D AVF +++L+
Sbjct: 18 LAKKFGAAVVTPEHRYYGKSSPF--KQLTTEN---LRFLSSKQALFDLAVFRQYYQESLN 72
Query: 191 AE-------ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPIL 230
A +P + G SY G L+AW RLK+PH+ G+LASS +L
Sbjct: 73 ARYNRSSGFDNPWFVIGISYAGALSAWFRLKFPHLTCGSLASSGVVL 119
>gi|380487520|emb|CCF37988.1| serine carboxypeptidase S28 [Colletotrichum higginsianum]
Length = 516
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 81/175 (46%), Gaps = 20/175 (11%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGFVWD 130
F+Q LDH + TF QRY N +HW GP P+FL G E D F + +G V
Sbjct: 51 FDQLLDH-TQPWRGTFKQRYWWNAEHWGGPGY--PVFLINGGESDAAGFTGYLENGTVTG 107
Query: 131 I-APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN----- 184
+ A ++ EHRYYGES PY ++ A TL L QA+ D F
Sbjct: 108 LYAETHKGAVILIEHRYYGESWPYKTS-----TADTLQLLEVPQAIYDNIYFAETAALPF 162
Query: 185 ----LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
+ +A+ SP VL GGSY G LAAW + P ASSA + ED
Sbjct: 163 DQGTTDKGANADKSPWVLIGGSYAGALAAWTSVIAPGTFAAYHASSAVVQAIEDF 217
>gi|212545206|ref|XP_002152757.1| serine peptidase, family S28, putative [Talaromyces marneffei ATCC
18224]
gi|210065726|gb|EEA19820.1| serine peptidase, family S28, putative [Talaromyces marneffei ATCC
18224]
Length = 608
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 82/173 (47%), Gaps = 23/173 (13%)
Query: 82 FADLP---------TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI----EWFAVNSGFV 128
+AD+P T+ RY + T ++ GP+FLY E + + F
Sbjct: 70 YADIPIDHDNHTIGTYKNRYWVTTKYY---KPGGPVFLYDVGESSAYNSAQHMLGEAAFF 126
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-- 186
+ FG + + EHRYYGES+P G A A YLT QA+AD F +
Sbjct: 127 KEFLEEFGGLGIVWEHRYYGESLPMGPIN-ADTPAENFKYLTHTQAIADIPYFAQDFSRP 185
Query: 187 ----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
Q+LS + +P ++ GGSY GM AA+ R +YP A ASSAP+ D+
Sbjct: 186 ELPSQDLSPKGTPWIMIGGSYSGMRAAFTRDEYPQSIYAAYASSAPVQARADM 238
>gi|328723991|ref|XP_003248003.1| PREDICTED: putative serine protease K12H4.7-like [Acyrthosiphon
pisum]
Length = 253
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 72/134 (53%), Gaps = 10/134 (7%)
Query: 106 GPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT 165
GP+FL G I + SG + A F A EHRYYG S P N +
Sbjct: 10 GPVFLLVGGSEKILHSWMISGAWIEYAQIFNAACFQLEHRYYGMSHPTDDL-----NTSN 64
Query: 166 LSYLTAEQALADFAVFIT--NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
L YL+ EQ LAD A+FI ++++N ++ V FG SY G L AW+ LKYPH+ A+
Sbjct: 65 LVYLSTEQVLADLAIFINTISIEKNQLLGSAKWVGFGSSYSGSLVAWLILKYPHLVYAAV 124
Query: 224 ASSAPI---LQFED 234
+SS+P+ + FE+
Sbjct: 125 SSSSPLTAKIHFEE 138
>gi|398408866|ref|XP_003855898.1| serine carboxypeptidase [Zymoseptoria tritici IPO323]
gi|339475783|gb|EGP90874.1| serine carboxypeptidase [Zymoseptoria tritici IPO323]
Length = 538
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 87/322 (27%), Positives = 136/322 (42%), Gaps = 40/322 (12%)
Query: 29 LSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQ-QYRYETRYFEQRLDHFSFADLPT 87
++L S +RR+ + PHL Q R ++ FEQ +DH +L T
Sbjct: 8 ITLINLTSAYRRSTS--NRHPHLLHRSLLQARSANDVASKFANFTFEQYIDH-DQPELGT 64
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE----WFAVNSGFVWDIAPRFGAMLVFPE 143
F QRY+++T +W G P+ L+ EG IE +F + G +A GA V E
Sbjct: 65 FPQRYVVDTTYWNGTGS--PVILWIWGEGPIEDGLIYFNKSLGTAGLLASEIGAAQVILE 122
Query: 144 HRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEAS----PVVLF 199
HR++GES+ + E QN L YLT++ A+ D F +++ + S P +
Sbjct: 123 HRFFGESVVF--DEWTTQN---LQYLTSDNAIRDAIRFAKSVQLHFSNVTGLGDVPWIAT 177
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE-TFYNIVSSDFKRESASCFN 258
G SYGG L W+ +P ASSA + ++VP FY I F++
Sbjct: 178 GESYGGALVTWLAQLHPDTFWAYYASSATV----EVVPDNFGFYVIGEEVFRQNCTKDLQ 233
Query: 259 TIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLAMVDYPYPSDF 318
+ E++ G + H + L E L D ++ ++ + P S F
Sbjct: 234 LVAAHIDEILVNGSADQ--------IHDIKALFGMETLKDDVD----FVTALGRPTASYF 281
Query: 319 MMPLPGYPIREVCKKIDNAPDA 340
PG C +I+ DA
Sbjct: 282 PGDNPG----GFCDRIEGGLDA 299
>gi|219121656|ref|XP_002181178.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407164|gb|EEC47101.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 544
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 69/214 (32%), Positives = 101/214 (47%), Gaps = 27/214 (12%)
Query: 72 YFEQRLDHFS-FADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW- 129
Y EQ LDHF+ D ++QRY + ++ GP PIF+ G E AVN G ++
Sbjct: 94 YKEQTLDHFTPNKDEAPWAQRYYQDDKYFAGPGH--PIFVIMGGED-----AVN-GILYP 145
Query: 130 ----DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
+A RF A + EHR+YG+S P A L+ QALAD FI
Sbjct: 146 FVSKHLAKRFRAHTLCLEHRFYGKSKPLKHP----STADLRRLLSPAQALADAVQFIEYK 201
Query: 186 KQNL------SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPE 239
++ L + PVV GGSY G L+A +R+ YP + ASSAP+ + V
Sbjct: 202 RKQLGCGNKGTKSYCPVVTVGGSYPGFLSALLRIVYPDVVDIGYASSAPLHLYSHRVNKA 261
Query: 240 TFYNIVSSDFKRESASCFNTIKESWGELVSVGQK 273
++ V+ ++ S C +K + L+ V +K
Sbjct: 262 AYFEKVTQVAEQASRGCAGAVKNA---LMDVTEK 292
>gi|71003550|ref|XP_756441.1| hypothetical protein UM00294.1 [Ustilago maydis 521]
gi|46096046|gb|EAK81279.1| hypothetical protein UM00294.1 [Ustilago maydis 521]
Length = 583
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 96/189 (50%), Gaps = 15/189 (7%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHW---VGPNRLGPIFLYCGNEGDIEWFA--- 122
E Y Q LDHF F QR+ +T H+ N+ + +Y + G+ + A
Sbjct: 136 EPAYHRQPLDHFDNTTQAQFDQRFFYSTRHYKPASARNKGEAVPIYILDSGEADATARIP 195
Query: 123 -VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA---TTLSYLTAEQALADF 178
+++G + ++ G + + EHRYYG S+P T++ + L +LT +QAL D
Sbjct: 196 FLDTGILDILSKATGGIGIVLEHRYYGTSLP-NRTDLGPGDTWGVDQLRWLTNKQALEDS 254
Query: 179 AVFITNLK--QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
A FI +L ++E ++ +GGSY G +A MRL YP + GA+ASSA + ++
Sbjct: 255 ADFIRHLSIPGTDNSEKRKIIYYGGSYPGARSAHMRLLYPELVHGAIASSAVVTAVDEF- 313
Query: 237 PPETFYNIV 245
PE FY +
Sbjct: 314 -PEYFYPVA 321
>gi|358370554|dbj|GAA87165.1| serine peptidase, family S28 [Aspergillus kawachii IFO 4308]
Length = 562
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 77/253 (30%), Positives = 113/253 (44%), Gaps = 24/253 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE-----WFAV 123
ET Y LDH + A T+ R+ + +D + P PIF+Y E D E +
Sbjct: 68 ETEYVTIPLDHDN-ASAGTYQNRFWV-SDEFYEPGS--PIFVYDTGEADGESIASAYLTS 123
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA-TTLSYLTAEQALADFAVFI 182
F + F AM + EHRYYG S P ++Y++ T YLT +QALAD F
Sbjct: 124 TLSFFREFLIEFNAMGIAWEHRYYGNSTP---APISYESPPETYQYLTTKQALADLPYFA 180
Query: 183 TNLKQ------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
+N + +L+ + +P V+ GGSY G+ AA R +YP A +SSAP+ E V
Sbjct: 181 SNFSREKYPDVDLTPQGTPWVMVGGSYAGIRAALTRNEYPETIFAAYSSSAPV---EARV 237
Query: 237 PPETFYNIVSSDFKRES-ASCFNTIKESWGELVSVGQKENGLLELTKTFHL-CRELNSTE 294
+Y+ V + A+C I + + E+ + + F E NS
Sbjct: 238 NMSVYYDQVYRGMVADGWANCSADIHAALEYIDDQLSNEDTATSIKQLFFGPGAEANSNG 297
Query: 295 DLADWLESAYSYL 307
D L + YSY
Sbjct: 298 DFTGALTAIYSYF 310
>gi|195451237|ref|XP_002072827.1| GK13809 [Drosophila willistoni]
gi|194168912|gb|EDW83813.1| GK13809 [Drosophila willistoni]
Length = 340
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 79/143 (55%), Gaps = 15/143 (10%)
Query: 20 IVIISILSPLSLAA------QPSKFRRAP----RFVGKLPHLTEPPQRQQRQQQQQYRYE 69
++ ++IL+PL+ AA + +AP FV L L PP R+ ++ E
Sbjct: 9 LIALTILAPLTAAASVGTDLKADGQSKAPPIKSAFVESLEQLHRPPPREPALKRANV--E 66
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVW 129
TR+F Q LD+F ++ T+ RY+IN D++V PIF+Y G E I+ ++SG
Sbjct: 67 TRWFTQYLDNFDASNNATWQNRYMINEDYYV---EGSPIFVYLGGEWAIDASGISSGLWV 123
Query: 130 DIAPRFGAMLVFPEHRYYGESMP 152
DIA + L++ EHR++GES+P
Sbjct: 124 DIAKQHNGSLLYTEHRFFGESIP 146
>gi|169775711|ref|XP_001822322.1| extracellular serine carboxypeptidase [Aspergillus oryzae RIB40]
gi|238502353|ref|XP_002382410.1| extracellular serine carboxypeptidase, putative [Aspergillus flavus
NRRL3357]
gi|83771057|dbj|BAE61189.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220691220|gb|EED47568.1| extracellular serine carboxypeptidase, putative [Aspergillus flavus
NRRL3357]
Length = 573
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 89/187 (47%), Gaps = 22/187 (11%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADL------PTFSQRYLINTDHWVGPNRLGPI 108
P+ +QQ Y Y +DHF F RY + ++ GP+
Sbjct: 41 PESAVKQQSTTAEYPEYYLSVPVDHFHNDSRYAPHSDDHFDLRYWFDAQYY---KEGGPV 97
Query: 109 FLYCGNEGDI--EWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
F+ E D + ++ G V ++A + + V EHRYYG+S P VA +
Sbjct: 98 FVIAAGETDATDRFPFLSQGIVTELASAYNGIGVILEHRYYGKSYP-----VANLTTENI 152
Query: 167 SYLTAEQALADFAVFITNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAI 220
+L+ +QALAD+A F +N+ NL+++ +P + +GGSY G A++R YP +
Sbjct: 153 RFLSTDQALADYAYFASNVVFPGLEHVNLTSKTTPWIAYGGSYAGAFVAFLRKLYPDVYW 212
Query: 221 GALASSA 227
GA++SS
Sbjct: 213 GAVSSSG 219
>gi|391865279|gb|EIT74569.1| hydrolytic enzymes of the alpha/beta hydrolase fold protein
[Aspergillus oryzae 3.042]
Length = 573
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 89/187 (47%), Gaps = 22/187 (11%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADL------PTFSQRYLINTDHWVGPNRLGPI 108
P+ +QQ Y Y +DHF F RY + ++ GP+
Sbjct: 41 PESAVKQQSTTAEYPEYYLSVPVDHFHNDSRYAPHSDDHFDLRYWFDAQYY---KEGGPV 97
Query: 109 FLYCGNEGDI--EWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
F+ E D + ++ G V ++A + + V EHRYYG+S P VA +
Sbjct: 98 FVIAAGETDATDRFPFLSQGIVTELASAYNGIGVILEHRYYGKSYP-----VANLTTENI 152
Query: 167 SYLTAEQALADFAVFITNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAI 220
+L+ +QALAD+A F +N+ NL+++ +P + +GGSY G A++R YP +
Sbjct: 153 RFLSTDQALADYAYFASNVVFPGLEHVNLTSKTTPWIAYGGSYAGAFVAFLRKLYPDVYW 212
Query: 221 GALASSA 227
GA++SS
Sbjct: 213 GAVSSSG 219
>gi|358054024|dbj|GAA99823.1| hypothetical protein E5Q_06526 [Mixia osmundae IAM 14324]
Length = 1068
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 58/164 (35%), Positives = 83/164 (50%), Gaps = 20/164 (12%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEG--DIEWFAV--NSGFVWDIAPRFGAMLVFP 142
TF RY +N DH+ G + ++ EG D + A+ N F++++ ++ +
Sbjct: 616 TFKLRYFVNADHY---KPGGAVLIWNAGEGSADDQTAAIFSNRTFIYNLTQSTNSVGIVL 672
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN-----LKQNLSAEASPVV 197
EHRYYG+S+P S + L YLT EQALAD+ F N L Q ++ +P++
Sbjct: 673 EHRYYGKSIPMPSF-----STDDLQYLTVEQALADWEYFAKNAELPTLPQLITQNKAPLI 727
Query: 198 LFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQFEDIVPP 238
G SY G LAAW + YP G +ASSA IL F V P
Sbjct: 728 YLGASYSGALAAWQSVVYPTTFWGYIASSAVTVSILDFAAYVNP 771
>gi|121711132|ref|XP_001273182.1| serine peptidase, putative [Aspergillus clavatus NRRL 1]
gi|119401332|gb|EAW11756.1| serine peptidase, putative [Aspergillus clavatus NRRL 1]
Length = 531
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 93/202 (46%), Gaps = 26/202 (12%)
Query: 45 VGKLPHLTEPPQRQQRQQQ--------QQYRYETRYFEQRLDHFSFADLPTFSQRYLINT 96
V +P L PP + ++ +Q R T F+Q +DH + +L TF QR+ ++
Sbjct: 19 VSAIPQLPVPPPFEIQESSLLSGNARFRQVRGNTT-FDQLIDHDN-PELGTFQQRFWWSS 76
Query: 97 DHWVGPNRLGPIFLYCGNEGDIEWFA---VNSGFVWDIAPRFGAMLVFPEHRYYGESMPY 153
+ W GP P+ L+ E D + N A G ++ EHRY+G S PY
Sbjct: 77 EFWKGPGS--PVVLFTPGEADAPGYTGYLTNQTLPGRFAQEIGGAVILLEHRYWGTSSPY 134
Query: 154 GSTEVAYQNATTLSYLTAEQALADFAVFITNL------KQNLSAEASPVVLFGGSYGGML 207
+ N TL YLT EQ++AD F + + +A+ +P VL GGSY G L
Sbjct: 135 TNL-----NTETLQYLTLEQSIADLTHFAKTVDLAFDSNHSSNADKAPWVLTGGSYSGAL 189
Query: 208 AAWMRLKYPHIAIGALASSAPI 229
+AW P +SSAP+
Sbjct: 190 SAWTASTAPGTFWAYHSSSAPV 211
>gi|425770000|gb|EKV08475.1| Serine peptidase, family S28, putative [Penicillium digitatum Pd1]
gi|425771690|gb|EKV10127.1| Serine peptidase, family S28, putative [Penicillium digitatum
PHI26]
Length = 553
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 85/172 (49%), Gaps = 17/172 (9%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE-----WFAV 123
+ Y +DH S + + + RY ++ D++ GP+F+Y E E +
Sbjct: 60 KAEYVSLPIDH-SNSSVGYYQNRYWVSEDNY---KEGGPVFVYDVGEASAESSAQAYLGN 115
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT 183
++ F + + FG + + EHRYYG+S+PY + + L YL EQALAD F
Sbjct: 116 STTFFYQMVQEFGGIGIVWEHRYYGDSLPYNVS--LHMQPEHLLYLNNEQALADIPFFAA 173
Query: 184 NLKQ------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
N + +L+ +P V+ GGSY GM +A+ R YP + ASSAP+
Sbjct: 174 NFTRRNYSDVDLTPGGTPWVMVGGSYSGMRSAFTRHLYPETIYASYASSAPV 225
>gi|392572923|gb|EIW66066.1| hypothetical protein TREMEDRAFT_45908 [Tremella mesenterica DSM
1558]
Length = 558
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 60/177 (33%), Positives = 81/177 (45%), Gaps = 21/177 (11%)
Query: 64 QQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWF 121
Q Y+ F Q + HF TF QRY I+ + GP+F+ G E G+
Sbjct: 78 QLTHYKAYCFTQPVSHFDDTITDTFCQRYWIDASSY---EEGGPVFVLDGGETSGEDRLP 134
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
+ G + ++ + + EHRYYGES P S L +L E+AL D A F
Sbjct: 135 FLKQGILQILSNATNGLSIVLEHRYYGESQPVSSL-----TTDNLRFLNNEEALEDSAEF 189
Query: 182 ITNLK-----QNLSAEA------SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
I N + LS E +P + +GGSY G AA MR+ YP I GA+ SSA
Sbjct: 190 IRNFRIPSDVLKLSDEGILQPDRTPWIYYGGSYAGARAAHMRVGYPDIVYGAIGSSA 246
>gi|32351096|gb|AAP74971.1| thymus specific serine peptidase [Homo sapiens]
Length = 164
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 67/106 (63%), Gaps = 8/106 (7%)
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--N 188
+AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N
Sbjct: 9 LAPAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRLALADVVSARLALSRLFN 63
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+S+ +SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D
Sbjct: 64 ISS-SSPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD 108
>gi|242794970|ref|XP_002482484.1| extracelular serine carboxypeptidase, putative [Talaromyces
stipitatus ATCC 10500]
gi|218719072|gb|EED18492.1| extracelular serine carboxypeptidase, putative [Talaromyces
stipitatus ATCC 10500]
Length = 460
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 94/186 (50%), Gaps = 22/186 (11%)
Query: 123 VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYG--STEVAYQNATTLSYLTAEQALADFAV 180
+ G V +A + V EHRYYG S+P STE +L +LT EQ LAD A
Sbjct: 8 LQKGLVAQLAQLTNGIAVVLEHRYYGASIPTKDFSTE-------SLRFLTTEQGLADVAY 60
Query: 181 FITNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
F N+ QNL++ P + +GGSY G + A++R+ YP + GA+ASSA E
Sbjct: 61 FAQNIVYPGFEDQNLTSRHVPYIAYGGSYAGAMVAFLRVTYPDVFFGAIASSAVT---EA 117
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
IV ++ + + +C +T++ G L ++ + + +L F L ++L+ +
Sbjct: 118 IVDYWQYWEPIR---RNAPQNCIHTVENLTGVLDNLAHNTSAVKDLETLFGL-QDLSHVD 173
Query: 295 DLADWL 300
D A+ L
Sbjct: 174 DFANVL 179
>gi|290997480|ref|XP_002681309.1| predicted protein [Naegleria gruberi]
gi|284094933|gb|EFC48565.1| predicted protein [Naegleria gruberi]
Length = 469
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 82/169 (48%), Gaps = 15/169 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVG---PNRLGPIFLYCGNEGDIEWFAVNSGFV 128
+F QR+DHF+ + TF QRY V PN L LY E + +V
Sbjct: 36 WFHQRIDHFNALNTDTFPQRYYKFVPEGVSASSPNHL----LYICPEATCG--GTPNNYV 89
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQN 188
+ A A + EHR+YG+S+PY S + +YL E ALAD +VFI +
Sbjct: 90 KNYAMELKATIYTLEHRFYGKSVPYKSMKTV----NMANYLKTEMALADLSVFIEYIATL 145
Query: 189 LSAEASP--VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
S +P ++ G SY G L+A+ +KYPH+ GAL+SS + D
Sbjct: 146 PSDNNTPHQFIIVGCSYPGALSAFFSMKYPHLVKGALSSSGVVNSILDF 194
>gi|391873795|gb|EIT82803.1| hydrolytic enzymes of the alpha/beta hydrolase fold protein
[Aspergillus oryzae 3.042]
Length = 592
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 126/294 (42%), Gaps = 34/294 (11%)
Query: 60 RQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
+ Q + T Y +DH + + T+ R+ +N D++ PI +Y E + E
Sbjct: 59 KANTQIEKVTTEYITIPIDH-NDTSVGTYQNRFWVNDDYYKAGR---PIIMYDAGETNAE 114
Query: 120 WFAVNS-----GFVWDIAPRFGAMLVFPEHRYYGESMPYG-STEVAYQNATTLSYLTAEQ 173
A N F I AM + EHRYYG S P+ S + ++ YLT +Q
Sbjct: 115 SIAKNHLTSSLSFFRKILEDTHAMGIIWEHRYYGNSTPFPISRDTPPEH---FKYLTTKQ 171
Query: 174 ALADFAVFITNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
AL D F N + +L+ ++P VL GGSY G+ AA+ R KYP + A +SSA
Sbjct: 172 ALEDIPYFARNFSRPKFAEHDLTPSSTPWVLVGGSYAGIRAAFARNKYPDVIFAAYSSSA 231
Query: 228 PI-LQFE-DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
P+ Q I + + +V F+ +C I + G + + + K F
Sbjct: 232 PVQAQLNMSIYYDQVYRGLVGHGFE----NCAKDIHAALGYIDQQLSNNHTAAAIKKLFF 287
Query: 286 L-CRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
+ NS E L + YSY ++ + P +RE+C+ ++ P
Sbjct: 288 GPGADQNSNEGFTAALATIYSYF--------QNYGLDGPEGTLRELCEHLEVDP 333
>gi|317157591|ref|XP_001825996.2| serine peptidase, family S28 [Aspergillus oryzae RIB40]
Length = 580
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 126/294 (42%), Gaps = 34/294 (11%)
Query: 60 RQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
+ Q + T Y +DH + + T+ R+ +N D++ PI +Y E + E
Sbjct: 59 KANTQIEKVTTEYITIPIDH-NDTSVGTYQNRFWVNDDYYEAGR---PIIMYDAGETNAE 114
Query: 120 WFAVNS-----GFVWDIAPRFGAMLVFPEHRYYGESMPYG-STEVAYQNATTLSYLTAEQ 173
A N F I AM + EHRYYG S P+ S + ++ YLT +Q
Sbjct: 115 SIAKNHLTSSLSFFRKILEDTHAMGIIWEHRYYGNSTPFPISRDTPPEH---FKYLTTKQ 171
Query: 174 ALADFAVFITNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
AL D F N + +L+ ++P VL GGSY G+ AA+ R KYP + A +SSA
Sbjct: 172 ALEDIPYFARNFSRPKFAEHDLTPSSTPWVLVGGSYAGIRAAFARNKYPDVIFAAYSSSA 231
Query: 228 PI-LQFE-DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
P+ Q I + + +V F+ +C I + G + + + K F
Sbjct: 232 PVQAQLNMSIYYDQVYRGLVGHGFE----NCAKDIHAALGYIDQQLSNNHTAAAIKKLFF 287
Query: 286 L-CRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
+ NS E L + YSY ++ + P +RE+C+ ++ P
Sbjct: 288 GPGADQNSNEGFTAALATIYSYF--------QNYGLDGPEGTLRELCEHLEVDP 333
>gi|344299008|ref|XP_003421180.1| PREDICTED: thymus-specific serine protease, partial [Loxodonta
africana]
Length = 471
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 66/211 (31%), Positives = 99/211 (46%), Gaps = 26/211 (12%)
Query: 139 LVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--NLSAEASPV 196
+V EHR+YG S+P ++A L +L++ ALAD A L + N+S+ +SP
Sbjct: 12 VVGLEHRFYGLSIPVRGLDMAQ-----LRFLSSRHALADVASAHLALSRLFNVSS-SSPW 65
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRE---- 252
+ FGGSY G LAAW RLK+PH+ ++ASSAP+ D + N+VS
Sbjct: 66 ICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLDF---SEYNNVVSRSLMNTAIGG 122
Query: 253 SASCFNTIKESWGE----LVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYSYLA 308
S C++ ++ E L + G+ + L C L+ ED A+ LE A L
Sbjct: 123 SPECWSAASAAFAETERRLRAGGEAQAA---LRAELGACGRLSRAEDQAELLE-ALQALV 178
Query: 309 MVDYPYPSDFMMPLPGYPIREVCKKIDNAPD 339
Y PL +R++C + D
Sbjct: 179 GGTVQYNGQAGAPL---SVRQLCGLLVGGAD 206
>gi|449548946|gb|EMD39912.1| hypothetical protein CERSUDRAFT_81235 [Ceriporiopsis subvermispora
B]
Length = 555
Score = 80.9 bits (198), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 61/182 (33%), Positives = 92/182 (50%), Gaps = 20/182 (10%)
Query: 54 PPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHW-VGPNRLGPIFLYC 112
P + ++ + +F Q LDHFS F QRY INT H+ G N P+ +
Sbjct: 71 PLNEEDNPERSVAEFPAHWFTQPLDHFSNT-TSKFRQRYWINTRHYKSGTN--APVIVLD 127
Query: 113 GNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLT 170
G E G+ +++G V +A G + V EHR S+P V + +L +L
Sbjct: 128 GGETSGEDRLPFLDTGIVEILAKATGGVGVVLEHR----SLP-----VTELSTDSLRWLN 178
Query: 171 AEQALADFAVFITNLK-----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALAS 225
+QA AD A F+ N+K ++++A P + +GGSY G AA M++ YP + GA+AS
Sbjct: 179 NDQAAADSANFMANVKFPGIDEDITAPGHPWIYYGGSYAGARAAHMKILYPELVYGAIAS 238
Query: 226 SA 227
SA
Sbjct: 239 SA 240
>gi|67541941|ref|XP_664738.1| hypothetical protein AN7134.2 [Aspergillus nidulans FGSC A4]
gi|40742196|gb|EAA61386.1| hypothetical protein AN7134.2 [Aspergillus nidulans FGSC A4]
Length = 1157
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 130/312 (41%), Gaps = 52/312 (16%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD----IEWFAVNSGFVWDIA 132
+DH + A + TF RY +N ++V P GP+ LY E D + NS F+ I
Sbjct: 87 IDHHNPA-IGTFRNRYWVNDAYYV-PG--GPVVLYDVGEADGEPSVAHLRSNSSFLPPIL 142
Query: 133 PRFGAMLVFPEHR-----------------YYGESMPYGSTEVAYQNATT---LSYLTAE 172
FGA+ + EHR YYG+S+PY N T L YLT
Sbjct: 143 QEFGAIGIVWEHRSVVSTKVETRIEIDRNRYYGDSLPY-----PVNNNTLPEHLIYLTTR 197
Query: 173 QALADFAVFITNLKQ------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASS 226
QALAD F N + +L+ ++P V+ GGSY G AA+ R +YP + A+S
Sbjct: 198 QALADIPAFAENFTRPGLQEYDLTPASTPWVMIGGSYPGARAAFARNEYPDTIFASFAAS 257
Query: 227 API-LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
AP+ Q + E Y + ++ R AS + E +S + +L F
Sbjct: 258 APVQAQINMSIYYEQVYRSMVANGFRSCASNIHAALEYIDNQLSQNDTAASIKQL--FFG 315
Query: 286 LCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILE 345
E NS ED L Y + M P + E C+ +++ P E
Sbjct: 316 PGAEKNSNEDFTHALAVIYGSFQA--------YGMDGPAGSLGEFCRYLESDPVTGQPAE 367
Query: 346 RIFEGVSVYYNY 357
EG+++ + Y
Sbjct: 368 T--EGLALRHGY 377
>gi|238492811|ref|XP_002377642.1| serine peptidase, family S28, putative [Aspergillus flavus
NRRL3357]
gi|220696136|gb|EED52478.1| serine peptidase, family S28, putative [Aspergillus flavus
NRRL3357]
Length = 592
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 126/294 (42%), Gaps = 34/294 (11%)
Query: 60 RQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE 119
+ Q + T Y +DH + + T+ R+ +N D++ PI +Y E + E
Sbjct: 59 KANTQIEKVTTEYITIPIDH-NDTSVGTYQNRFWVNDDYYEAGR---PIIMYDAGETNAE 114
Query: 120 WFAVNS-----GFVWDIAPRFGAMLVFPEHRYYGESMPYG-STEVAYQNATTLSYLTAEQ 173
A N F I AM + EHRYYG S P+ S + ++ YLT +Q
Sbjct: 115 SIAKNHLTSSLSFFRKILEDTHAMGIIWEHRYYGNSTPFPISRDTPPEH---FKYLTTKQ 171
Query: 174 ALADFAVFITNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
AL D F N + +L+ ++P VL GGSY G+ AA+ R KYP + A +SSA
Sbjct: 172 ALEDIPYFARNFSRPKFAEHDLTPSSTPWVLVGGSYAGIRAAFARNKYPDVIFAAYSSSA 231
Query: 228 PI-LQFE-DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
P+ Q I + + +V F+ +C I + G + + + K F
Sbjct: 232 PVQAQLNMSIYYDQVYRGLVGHGFE----NCAKDIHAALGYIDQQLSNNHTAAAIKKLFF 287
Query: 286 L-CRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAP 338
+ NS E L + YSY ++ + P +RE+C+ ++ P
Sbjct: 288 GPGADQNSNEGFTAALATIYSYF--------QNYGLDGPEGTLRELCEHLEVDP 333
>gi|32351098|gb|AAP74974.1| thymus specific serine peptidase [Homo sapiens]
Length = 155
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 66/106 (62%), Gaps = 8/106 (7%)
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ--N 188
+AP +GA+++ EHR+YG S+P G E+A L +L++ ALAD L + N
Sbjct: 9 LAPAWGALVISLEHRFYGLSIPAGGLEMA-----QLRFLSSRLALADVVSARLALSRLFN 63
Query: 189 LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+S+ SP + FGGSY G LAAW RLK+PH+ ++ASSAP+ D
Sbjct: 64 ISSS-SPWICFGGSYAGSLAAWARLKFPHLIFASVASSAPVRAVLD 108
>gi|392585005|gb|EIW74346.1| peptidase S28 [Coniophora puteana RWD-64-598 SS2]
Length = 537
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/274 (28%), Positives = 118/274 (43%), Gaps = 36/274 (13%)
Query: 48 LPHLTEPPQRQQRQQQQQ-YRYETRY-FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRL 105
+P L PPQRQ QQ T Y F+Q +DH + A L TF QRY + +++
Sbjct: 36 IPRLA-PPQRQIVDQQGAPVNLSTVYTFDQLIDHANPA-LGTFKQRYWTSNEYY---KTG 90
Query: 106 GPIFLYCGNEGDIEWFA---VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN 162
GP+ L E + + + N IA + +V EHR++G+S PYG+
Sbjct: 91 GPVVLMTPGETNADGYESMLTNVSVNGLIAQQNNGAVVVIEHRFFGQSNPYGNL-----T 145
Query: 163 ATTLSYLTAEQALADFAVFITNLK------QNLSAEASPVVLFGGSYGGMLAAWMRLKYP 216
A +L YLT QA+ D A F + + + +P VL GGSY G L +W +K P
Sbjct: 146 AQSLRYLTIAQAIDDLAHFAQTVDLPWAGGDAVKPDKTPWVLTGGSYAGALTSWTMVKKP 205
Query: 217 HIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKEN- 275
+ +SS + D +Y + + +C ++ G L + N
Sbjct: 206 DVFYAGWSSSGVVEAITD------YYAYFTPILEHMPKNCSADVQAVVGYLDQLNSTSNA 259
Query: 276 -GLLELTKTFHLCR-------ELNSTEDLADWLE 301
G+ + TF L L T +LADW E
Sbjct: 260 TGIQTMQDTFGLGNLTHADDFALALTYNLADWQE 293
>gi|169621811|ref|XP_001804315.1| hypothetical protein SNOG_14118 [Phaeosphaeria nodorum SN15]
gi|111057235|gb|EAT78355.1| hypothetical protein SNOG_14118 [Phaeosphaeria nodorum SN15]
Length = 582
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 110/248 (44%), Gaps = 28/248 (11%)
Query: 77 LDHFS-FADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV-----NSGFVWD 130
LDHF + TF+ RY + + GP+F+Y EG+ A+ ++ F
Sbjct: 88 LDHFDPSKNHGTFNNRYWAASSSY---KPGGPVFIYDVGEGNASTNALFRIQNSTSFFKQ 144
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFIT------- 183
I ++ + + EHR+YG S P G + A +L EQ+LAD A F +
Sbjct: 145 IVDKYNGIGIVWEHRFYGNSSPGGPVNIDTP-AEQFRFLNTEQSLADVAAFASQFSLKNR 203
Query: 184 NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY- 242
+ L+ E +P V GGSY GM AA+MR KYP + ASSAP+ D ++FY
Sbjct: 204 GINYTLTPETTPWVFVGGSYPGMRAAFMREKYPDTIYASYASSAPVQASVD----QSFYF 259
Query: 243 -NIVSSDFKRESASCFNTIKESWGELVSV---GQKENGLLELTKTFHLCR--ELNSTEDL 296
I K +C I+ + + V G K N + K L + E NS
Sbjct: 260 EPIWRGMQKYGFGNCSRDIQAATRYIDGVFDRGSKNNAAADQLKIMFLGKGAEKNSHATF 319
Query: 297 ADWLESAY 304
AD L + +
Sbjct: 320 ADALTTVF 327
>gi|159130121|gb|EDP55235.1| extracelular serine carboxypeptidase, putative [Aspergillus
fumigatus A1163]
Length = 497
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 83/166 (50%), Gaps = 29/166 (17%)
Query: 106 GPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA 163
GP+FL E G + ++ G V +A + + V EHRYYGES P+ A
Sbjct: 21 GPVFLIAAGETNGRDRFPFLSHGIVTQLAKTYNGLGVILEHRYYGESYPF-----ADLTT 75
Query: 164 TTLSYLTAEQALADFAVFITNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPH 217
+ +L+ EQA+AD+A F +N+ NL+A+A P + +GGSY G A++R YP
Sbjct: 76 KNIRFLSTEQAMADYAYFASNVVFPGLEHLNLTADAVPWIGYGGSYAGAFVAFLRKVYPE 135
Query: 218 IAIGALASSA-------------PILQF---EDIVPPETFYNIVSS 247
+ G ++SS PI QF + I ETF +IV +
Sbjct: 136 VFFGVVSSSGVTEAIVDYWQYYEPIRQFAPSDCIWSIETFMDIVDT 181
>gi|294876612|ref|XP_002767728.1| Lysosomal Pro-X carboxypeptidase, putative [Perkinsus marinus ATCC
50983]
gi|239869546|gb|EER00446.1| Lysosomal Pro-X carboxypeptidase, putative [Perkinsus marinus ATCC
50983]
Length = 300
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 74/285 (25%), Positives = 124/285 (43%), Gaps = 26/285 (9%)
Query: 77 LDHFSF-ADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-IAPR 134
LDHFS A PT Y ++T+H+ IF G E + V F+ + +A
Sbjct: 30 LDHFSLVAKQPTIPLHYWLDTEHYDPAKDQCAIFYIMGGESPLPESGVIYPFISERLARE 89
Query: 135 FGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL-SAEA 193
+++ EHR+YG S+P +Y+ +L YL+ EQ+L D A + + + + +A
Sbjct: 90 HNGLVIESEHRFYGSSIPQ-----SYEK--SLPYLSVEQSLMDHATVLRHTLETVENANR 142
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP--ILQFEDIVPPETFYNIVSSDFKR 251
V+ GGSY G LA RL+YP + A ASS+P + E +Y+ V+
Sbjct: 143 CRVIAVGGSYSGFLALAFRLRYPKLVYAAYASSSPGRLYSQEASRFDGRYYSRVTDAADS 202
Query: 252 ESASCFNTIKESWGELVSVGQKENGLLELTKTFHLC------RELNSTEDLADWLESAYS 305
++C N++ +++ + V + +C RE E+L + +S
Sbjct: 203 IRSNCSNSVIKAFDDFVHRYAGRVTFEQAKNELKICNPEVFGREDGLFEELVQMVRMEFS 262
Query: 306 YLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILERIFEG 350
M YP P P +C ++ + A I E + +G
Sbjct: 263 GANMASYP-------PSSNSPTYMLCTMVEQSGIA-GIFEAMTQG 299
>gi|169603686|ref|XP_001795264.1| hypothetical protein SNOG_04851 [Phaeosphaeria nodorum SN15]
gi|111066122|gb|EAT87242.1| hypothetical protein SNOG_04851 [Phaeosphaeria nodorum SN15]
Length = 353
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/189 (30%), Positives = 94/189 (49%), Gaps = 21/189 (11%)
Query: 52 TEPPQRQQRQQQQQYRYETRYFEQRLDHFSFAD--LP----TFSQRYLINTDHWVGPNRL 105
T P ++Q ++ Y + +Q +DHF +D +P TF QRY+ ++ ++
Sbjct: 17 TPSPIQRQDKRAATPAYTSYTIDQPIDHFPESDRYVPHTNDTFKQRYVFDSSYY---KPG 73
Query: 106 GPIFLYCGNEGDIE--WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA 163
GP+FLY G E +E + + +G + + +F + V E+RYYG+S PY ++
Sbjct: 74 GPVFLYIGGETSVESRFSNLQTGIIQILMEKFNGIGVILENRYYGKSYPYKTS-----TT 128
Query: 164 TTLSYLTAEQALADFAVF-----ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHI 218
L +LT EQ +AD A F + ++LS P +++GGS G A+ Y I
Sbjct: 129 DELRFLTTEQTIADNAYFRQHATFPGVNESLSGPDVPWIMYGGSLAGAHTAFTMKTYNSI 188
Query: 219 AIGALASSA 227
G + SSA
Sbjct: 189 FAGGIGSSA 197
>gi|340939438|gb|EGS20060.1| serine-type peptidase-like protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 551
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 79/173 (45%), Gaps = 18/173 (10%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA----VNSGFV 128
F+Q +DH + L TF QRY T++W GP PI+L E F +S
Sbjct: 54 FDQLIDHNN-PSLGTFKQRYWYGTEYWRGPGS--PIYLTTPGEQSATGFNKTWLTSSRLS 110
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--- 185
+A R G ++ EHRY+GES PY E+ N L YLT EQ+L D F N
Sbjct: 111 GLMANRTGGAVIILEHRYWGESSPY--EELTVDN---LKYLTLEQSLKDLTYFAKNFVPP 165
Query: 186 ---KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
N SA+ +P + GGSY G LA W+ P +SS + D
Sbjct: 166 FDPSGNSSADKAPWIFSGGSYSGALAGWLAALEPGTFWAYYSSSGVVEAIGDF 218
>gi|258565711|ref|XP_002583600.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907301|gb|EEP81702.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 533
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 84/165 (50%), Gaps = 22/165 (13%)
Query: 77 LDHFSFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFV 128
LDHF D TF RY + H+ + GP+ + G E G + G +
Sbjct: 58 LDHFQNEDRYEPHSNATFKLRYWYDASHY---KKGGPVIILHGGETSGQGRLPFLQKGML 114
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--- 185
+A + V EHRYYG S+P + + + +N L +LT EQA+AD A F N+
Sbjct: 115 AQLAKATNGVGVVLEHRYYGTSIP--TKDFSTKN---LRFLTTEQAMADSAYFSKNVVFK 169
Query: 186 ---KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
++L+A +P +L+GGSY G A +R++YP I GA++SS
Sbjct: 170 GLEDKDLTAPKTPHILYGGSYAGAQVALLRVEYPEIFWGAISSSG 214
>gi|378726082|gb|EHY52541.1| extracelular serine carboxypeptidase [Exophiala dermatitidis
NIH/UT8656]
Length = 537
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 113/239 (47%), Gaps = 35/239 (14%)
Query: 67 RYETRYFEQRLDHFSFADLP--------TFSQRYLINTDHWVGPNRLGPIFLYCGNE--G 116
Y YF+Q +DHF D P TF+QRY + ++ GP+FLY G E G
Sbjct: 34 NYTAYYFDQLIDHFQ--DSPRYAPNTNATFTQRYYFDNTYY---KPGGPVFLYIGGETSG 88
Query: 117 DIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT--LSYLTAEQA 174
+ + +G V + + V E+RYYGES P+ +N TT L +LT EQ
Sbjct: 89 PSRFSNLQTGIVQILMNATNGLGVILENRYYGESYPF-------ENTTTDNLRFLTTEQT 141
Query: 175 LADFAVFITNL-------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+AD A F + NL+A+ +P +L+GGS G A+ ++Y + G +A+SA
Sbjct: 142 IADNAYFAQHAVFPNVTGGDNLTADTTPWILYGGSLAGAQTAFSLVEYSGLLWGGIAASA 201
Query: 228 PILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
+ ++ +YN + + ++ + N I + L+ + + + +L + F L
Sbjct: 202 VV---HAVLGYPEWYNPIQRNGPQDCITRINNIIDKIDYLIQNNETQ-AIQQLKEIFGL 256
>gi|119189871|ref|XP_001245542.1| hypothetical protein CIMG_04983 [Coccidioides immitis RS]
Length = 533
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 85/165 (51%), Gaps = 22/165 (13%)
Query: 77 LDHFSFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGFV 128
+DHF D TF RY + H+ GP+ + G E D E + G +
Sbjct: 48 IDHFHTDDRYAPHSNGTFELRYWFDASHY---KDGGPVIVLHGGETDGEGRLPFLQKGIL 104
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--- 185
+A + V EHRYYG S+P + + + +N L +LT EQA+AD A F N+
Sbjct: 105 GQLAQATNGVGVVLEHRYYGTSIP--TEDFSTKN---LRFLTTEQAMADSAYFAKNVVFE 159
Query: 186 ---KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
++L+A +P +L+GGSY G A++R++YP I GA++SS
Sbjct: 160 GLEDKDLTAPNTPYILYGGSYAGAQVAFLRVEYPDIFWGAISSSG 204
>gi|296414211|ref|XP_002836796.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295631635|emb|CAZ80987.1| unnamed protein product [Tuber melanosporum]
Length = 657
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 93/201 (46%), Gaps = 16/201 (7%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGFV 128
RY E L+H P F RY ++ D + P GPIF E D + + G
Sbjct: 57 RYVELPLNHGD-PKSPKFKNRYWVD-DTYYSPG--GPIFFVDNGEADADGMEEYLRKGAT 112
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQN---ATTLSYLTAEQALADFAVFITNL 185
+A F +L+ EHR+YG SMP + + + + L Y T EQAL D VF
Sbjct: 113 GSLAKEFNGLLILWEHRFYGTSMPDMTNAMRFTSDNFGAYLKYHTIEQALEDVVVFAKQF 172
Query: 186 KQN---LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
N +S P V GGSY G +AWMR++ P I +LASSA ++Q + + +Y
Sbjct: 173 TFNNKTVSPGEVPWVYLGGSYPGARSAWMRIRNPDIFHVSLASSA-VVQLQKNM--WQYY 229
Query: 243 NIVSSDFKRES-ASCFNTIKE 262
++ + A+C I+
Sbjct: 230 RVIEETLDKTGYANCSRDIRN 250
>gi|156058175|ref|XP_001595011.1| hypothetical protein SS1G_04819 [Sclerotinia sclerotiorum 1980]
gi|154702604|gb|EDO02343.1| hypothetical protein SS1G_04819 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 440
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/265 (25%), Positives = 119/265 (44%), Gaps = 28/265 (10%)
Query: 62 QQQQYRYETRYFEQRLDHFSFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNE 115
+++ Y+ +Q +DHF + TF QRY + ++ GPI+LY G E
Sbjct: 10 RREALAYKAHTIDQPIDHFPNDPMYAPHTNATFKQRYWFDATYY---KPGGPIYLYIGGE 66
Query: 116 --GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQ 173
G + + +G + + + + E+RYYGES P+ ++ L+YLT +Q
Sbjct: 67 TNGQYRFSNLQTGIIQILMEATNGLGIILENRYYGESFPFNTSTT-----DQLAYLTNQQ 121
Query: 174 ALADFAVF-----ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
+AD A F + + +++A + +L+GGS G A YP + G +ASSAP
Sbjct: 122 TVADNAYFAQHVSLPGVNASITAPNTKWILYGGSLAGGQTALSVKIYPEVFFGGIASSAP 181
Query: 229 ILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCR 288
I + +V +YN + ++ S N I + + L+S + ++ K+
Sbjct: 182 I---KAVVGYPEWYNPIQRLGPQDCISSINGIIDKFDALISANNTQ--AIKQFKSLFGLE 236
Query: 289 ELNSTEDLADWLESAYSYLAMVDYP 313
L D A + A+ +DYP
Sbjct: 237 ALTDNRDFA--MTIAFPLGGPMDYP 259
>gi|157167874|ref|XP_001656137.1| prolylcarboxypeptidase, putative [Aedes aegypti]
gi|108871003|gb|EAT35228.1| AAEL012589-PA, partial [Aedes aegypti]
Length = 459
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 68/233 (29%), Positives = 103/233 (44%), Gaps = 20/233 (8%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFV 128
E R F+Q H + + F RY+ N+ + P GPIFL+ G ++E V G
Sbjct: 20 EWRVFDQLQSHSNAHSVDMFPMRYVSNSKFYR-PG--GPIFLFVGGPWELEQHFVEQGHF 76
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-Q 187
D+A A +V E RYYGES+P V + L L QA D A I +++ +
Sbjct: 77 VDLAEENNAFVVANEMRYYGESLP-----VPNASRGNLRLLHIVQACTDIARLIVHIRYE 131
Query: 188 NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSS 247
L + V++ G + G LA W RL+YPH+ G AS A + E+ F V
Sbjct: 132 VLRDPNARVIVAGVGFSGSLAHWTRLRYPHLIHGVWASGAMLQANENY---REFAEEVGE 188
Query: 248 DFKRESAS-CFNTIKESWG---ELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
+R + C+ + + L+ GQ + + F +C +N T L
Sbjct: 189 YIRRYGGNDCYGALWRGFRTAENLIDAGQSQT----VDTLFKVCTPINGTNPL 237
>gi|358401652|gb|EHK50953.1| hypothetical protein TRIATDRAFT_296922 [Trichoderma atroviride IMI
206040]
Length = 533
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 88/170 (51%), Gaps = 18/170 (10%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI---EWFAVNSGF 127
R F+Q +DH +L TFSQRY+ N D++ GP PI L NE + E + N+
Sbjct: 49 RTFQQLIDH-DHPELGTFSQRYVYNGDYYKGPGS--PIILVGPNESALDGYEGYTTNATL 105
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL-- 185
A G + EHRY+G+S PY S + ++YLT +Q++ D F N+
Sbjct: 106 PGTYAQAVGGGAIIIEHRYWGQSSPYQSLDTE-----NMTYLTLDQSIKDLTYFAKNVVL 160
Query: 186 --KQNLSA--EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
QN ++ + +P VL G SY G L AW++ P A ++S+P+++
Sbjct: 161 PFDQNRTSTPDKAPWVLSGCSYSGALTAWVQDIAPGT-FWAYSASSPVVE 209
>gi|303322745|ref|XP_003071364.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240111066|gb|EER29219.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320032884|gb|EFW14834.1| extracelular serine carboxypeptidase [Coccidioides posadasii str.
Silveira]
Length = 543
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 85/165 (51%), Gaps = 22/165 (13%)
Query: 77 LDHFSFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGFV 128
+DHF D TF RY + H+ GP+ + G E D E + G +
Sbjct: 56 IDHFHTDDRYAPHSNGTFELRYWFDASHY---KDGGPVIVLHGGETDGEGRLPFLQKGIL 112
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--- 185
+A + V EHRYYG S+P + + + +N L +LT EQA+AD A F N+
Sbjct: 113 GQLAQATNGVGVVLEHRYYGTSIP--TEDFSTKN---LRFLTTEQAMADSAYFAKNVVFE 167
Query: 186 ---KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
++L+A +P +L+GGSY G A++R++YP I GA++SS
Sbjct: 168 GLEDKDLTAPNTPYILYGGSYAGAQVAFLRVEYPDIFWGAISSSG 212
>gi|392868441|gb|EAS34227.2| extracelular serine carboxypeptidase [Coccidioides immitis RS]
Length = 541
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 85/165 (51%), Gaps = 22/165 (13%)
Query: 77 LDHFSFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGFV 128
+DHF D TF RY + H+ GP+ + G E D E + G +
Sbjct: 56 IDHFHTDDRYAPHSNGTFELRYWFDASHY---KDGGPVIVLHGGETDGEGRLPFLQKGIL 112
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--- 185
+A + V EHRYYG S+P + + + +N L +LT EQA+AD A F N+
Sbjct: 113 GQLAQATNGVGVVLEHRYYGTSIP--TEDFSTKN---LRFLTTEQAMADSAYFAKNVVFE 167
Query: 186 ---KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
++L+A +P +L+GGSY G A++R++YP I GA++SS
Sbjct: 168 GLEDKDLTAPNTPYILYGGSYAGAQVAFLRVEYPDIFWGAISSSG 212
>gi|321260488|ref|XP_003194964.1| hypothetical protein CGB_F6500W [Cryptococcus gattii WM276]
gi|317461436|gb|ADV23177.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 563
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 91/193 (47%), Gaps = 33/193 (17%)
Query: 60 RQQQQQYR--------YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
RQ QQ ++ +E F Q + HF + TF QRY ++ +V P GP++L
Sbjct: 62 RQLQQPFQIITPSPSIFEPHCFPQLISHFDDSVNGTFCQRYWVDASSYV-PG--GPVYLL 118
Query: 112 CGNEGDIEWFA--VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYL 169
G E E+ + G + ++ G + + EHRYYGES+P V+ + L +L
Sbjct: 119 DGGEISAEYRLPFLEKGILDILSNATGGLSIVLEHRYYGESVP-----VSSFSTDDLRFL 173
Query: 170 TAEQALADFAVFITNLKQNLS---------------AEASPVVLFGGSYGGMLAAWMRLK 214
+AL D A FI N K S +P + +GGSY G AA MR +
Sbjct: 174 NNAEALEDSANFIGNFKLPFSLVNALPFELEETAFHPNNTPWIYYGGSYAGARAAHMRTQ 233
Query: 215 YPHIAIGALASSA 227
YP + GA+ASSA
Sbjct: 234 YPDLVWGAIASSA 246
>gi|452980750|gb|EME80511.1| hypothetical protein MYCFIDRAFT_31871 [Pseudocercospora fijiensis
CIRAD86]
Length = 560
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 80/176 (45%), Gaps = 17/176 (9%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF---AVN 124
Y YF+Q +DH S L TF Q Y +T +W GP P+ L+ E + +
Sbjct: 65 YSHSYFDQLIDH-SNPKLGTFKQFYYYDTTYWRGPGS--PVILFTPGEVNATRYYSYLTT 121
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+ +A GA + EHRY+G S PY A L YLT E A+AD F N
Sbjct: 122 NRTTGVLASEIGAATIVLEHRYWGTSTPYTDLTTA-----NLKYLTLENAIADMNYFAEN 176
Query: 185 LK------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+K +A+ P V+ GGSY G L+AW+ P +SSAP+ D
Sbjct: 177 VKLPFDTHGASNADDVPWVVMGGSYSGALSAWIASTAPGTIWAYHSSSAPVEAIGD 232
>gi|259483536|tpe|CBF79006.1| TPA: serine peptidase, family S28, putative (AFU_orthologue;
AFUA_4G03790) [Aspergillus nidulans FGSC A4]
Length = 557
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/312 (28%), Positives = 130/312 (41%), Gaps = 52/312 (16%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD----IEWFAVNSGFVWDIA 132
+DH + A + TF RY +N ++V P GP+ LY E D + NS F+ I
Sbjct: 87 IDHHNPA-IGTFRNRYWVNDAYYV-PG--GPVVLYDVGEADGEPSVAHLRSNSSFLPPIL 142
Query: 133 PRFGAMLVFPEHR-----------------YYGESMPYGSTEVAYQNATT---LSYLTAE 172
FGA+ + EHR YYG+S+PY N T L YLT
Sbjct: 143 QEFGAIGIVWEHRSVVSTKVETRIEIDRNRYYGDSLPY-----PVNNNTLPEHLIYLTTR 197
Query: 173 QALADFAVFITNLKQ------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASS 226
QALAD F N + +L+ ++P V+ GGSY G AA+ R +YP + A+S
Sbjct: 198 QALADIPAFAENFTRPGLQEYDLTPASTPWVMIGGSYPGARAAFARNEYPDTIFASFAAS 257
Query: 227 API-LQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
AP+ Q + E Y + ++ R AS + E +S + +L F
Sbjct: 258 APVQAQINMSIYYEQVYRSMVANGFRSCASNIHAALEYIDNQLSQNDTAASIKQL--FFG 315
Query: 286 LCRELNSTEDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILE 345
E NS ED L Y + M P + E C+ +++ P E
Sbjct: 316 PGAEKNSNEDFTHALAVIYGSFQA--------YGMDGPAGSLGEFCRYLESDPVTGQPAE 367
Query: 346 RIFEGVSVYYNY 357
EG+++ + Y
Sbjct: 368 T--EGLALRHGY 377
>gi|242805460|ref|XP_002484534.1| serine peptidase, putative [Talaromyces stipitatus ATCC 10500]
gi|218715159|gb|EED14581.1| serine peptidase, putative [Talaromyces stipitatus ATCC 10500]
Length = 535
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 116/283 (40%), Gaps = 38/283 (13%)
Query: 66 YRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNS 125
+ Y T +F+Q LDH +L TF QRY +T +W G P+ L+ E + F
Sbjct: 46 FEYSTGWFDQLLDH-DKPELGTFRQRYFYSTQYWKGSGS--PVILFQPGEQTADGF---Q 99
Query: 126 GFVWDI------APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
G++ ++ A FG + EHRY+GES P V T+ +LT + ALAD
Sbjct: 100 GYLTNVTISGVYAQEFGGAGIILEHRYWGESSP-----VNTLTPKTMQHLTFKNALADAV 154
Query: 180 VFITNLK------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
F N+K S + +P +L GGSY G A W P ASSAP+
Sbjct: 155 HFAKNVKLPFDNSTRSSPKNAPWILVGGSYSGAQAGWTAATLPGTFWAYHASSAPVEAIW 214
Query: 234 DIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNST 293
+ + F I K S N I + + G E+ +L + F L +
Sbjct: 215 NYW--QYFVPIQQRLPKNCSTDLVNVIDHI--DSILTGSNESAKDDLKRKFMLGDLRD-- 268
Query: 294 EDLADWLESAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDN 336
+D A A+V PY P I E C I+N
Sbjct: 269 DDFA---------AAIVGGPYLGQTTSWGPSGVIYEFCDYIEN 302
>gi|115385535|ref|XP_001209314.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114187761|gb|EAU29461.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 556
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 57/196 (29%), Positives = 94/196 (47%), Gaps = 29/196 (14%)
Query: 49 PHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTD------HWVGP 102
PHL + +R Q + Y Y +DHF RY +TD +W
Sbjct: 27 PHLRQI-KRSHAIQPRDVTYPAHYLSVPIDHFHN------ESRYEPHTDKHFPLRYWFDA 79
Query: 103 NRL---GPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTE 157
GP+F+ E G+ + ++ G V +A ++ + V EHRYYG+S P+ +
Sbjct: 80 QYYQPGGPVFVIAAGETSGEDRFPFLSQGIVTQLAEKYHGLGVILEHRYYGDSYPFDNL- 138
Query: 158 VAYQNATTLSYLTAEQALADFAVFITNL------KQNLSAEASPVVLFGGSYGGMLAAWM 211
+ + +L+ EQA+AD+A F +N+ +LS E +P + +GGSY G +++
Sbjct: 139 ----TTSNIRFLSTEQAVADYAYFASNVVFPGLDHVDLSPENTPWIAYGGSYAGAFVSFL 194
Query: 212 RLKYPHIAIGALASSA 227
R YP + GA++SS
Sbjct: 195 RKLYPDVYWGAVSSSG 210
>gi|115387739|ref|XP_001211375.1| predicted protein [Aspergillus terreus NIH2624]
gi|114195459|gb|EAU37159.1| predicted protein [Aspergillus terreus NIH2624]
Length = 564
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 62/212 (29%), Positives = 96/212 (45%), Gaps = 25/212 (11%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF----AVNSGFVWDIAPRFGAMLVFP 142
T+ R+ + +V + PIF++ G E + E + N F W I F M +
Sbjct: 85 TYQNRFWVTEQFYVPGS---PIFVFDGGESNAEVYKSHLTSNQSFFWHILKDFKGMGILW 141
Query: 143 EHRYYGESMPYG---STEVAYQNATTLSYLTAEQALADFAVFITNLKQ------NLSAEA 193
EHRYYGES P+ ST + + YLT QAL D F N + +L ++
Sbjct: 142 EHRYYGESRPFPVSLSTPPEH-----MEYLTTRQALEDIPYFAKNFSRPNHPDVDLRPQS 196
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRES 253
+P ++ GGSY G+ AA R +YP A ASSAP+ E V ++N V +
Sbjct: 197 TPWIMVGGSYAGIRAALARSEYPETIYAAYASSAPV---EAQVDMSIYFNNVYRGMEANG 253
Query: 254 -ASCFNTIKESWGELVSVGQKENGLLELTKTF 284
C +I+ ++ + KE+ + + F
Sbjct: 254 LGDCIQSIQAAYQYIDDQLGKEDTAAAVKRLF 285
>gi|322712275|gb|EFZ03848.1| serine peptidase, putative [Metarhizium anisopliae ARSEF 23]
Length = 550
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 72/251 (28%), Positives = 108/251 (43%), Gaps = 21/251 (8%)
Query: 45 VGKLPHLTEPPQRQQRQQQQQYRYETR--YFEQRLDHFSFADLPTFSQRYLINTDHWVGP 102
+ + HL P R ++ ++ T+ F+Q +DH + DL TF+Q YL N +++ GP
Sbjct: 23 ISRRHHLNRLPSRSPKEYSKRSNIPTQEGTFQQLIDHNN-PDLGTFTQHYLYNAEYYAGP 81
Query: 103 NRLGPIFLYCGNEGDIEWF-AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQ 161
PI L E ++ F A N+ A A ++ EHRY+G+S PY S
Sbjct: 82 G--SPIVLNTPGEDVVDGFYATNNTLPGLFAQTNNAAVIVLEHRYWGKSSPYDSLST--- 136
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNLSAEA------SPVVLFGGSYGGMLAAWMRLKY 215
T L YLT E A+ D F N++ + +P +L G SY G LAAW
Sbjct: 137 --TNLQYLTLENAIQDLIYFAHNVQLPFDVDGASKPTKAPWILTGCSYPGALAAWTHHLA 194
Query: 216 PHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKEN 275
P +SA I D +Y + + ++ E +++S G K
Sbjct: 195 PGTFWAYHCTSAVIEIISDFW---QYYEPIKAAMPTNCSTDMQRATEQIDKVLSHGTKTQ 251
Query: 276 GLLELTKTFHL 286
L KTF L
Sbjct: 252 K-HALKKTFGL 261
>gi|123463102|ref|XP_001316920.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121899640|gb|EAY04697.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 491
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 97/212 (45%), Gaps = 15/212 (7%)
Query: 69 ETRYFEQRLDHFSFADLPT-FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGF 127
+T + Q +DHF A PT F+QRY +N+D+ N+ I LY G +++ + G
Sbjct: 14 QTLSYTQMVDHF--ARKPTYFTQRYFVNSDY---ANKSRNIILYLGGANELDPNEITPGP 68
Query: 128 VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
+ +IA + ++++ EHRY+G+S+P + + Y + QA+ D F+ K
Sbjct: 69 ILEIASQTKSVIIGLEHRYFGKSVP-----TVNMSQFNMQYCSVPQAILDIKSFVLQGKI 123
Query: 188 NLSAEASP----VVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
P L G YGG LA W + +GA ASSAP++ +
Sbjct: 124 RNDYCTEPDFCKFFLMGKGYGGGLATWASTGFKRFYLGAWASSAPLVSINTFTQYDQKEA 183
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKEN 275
+ E+ +C+ + + + + +V +N
Sbjct: 184 YFLGNITIEATNCYKVMHDVYNTIETVVVAKN 215
>gi|189188228|ref|XP_001930453.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187972059|gb|EDU39558.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 567
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 84/176 (47%), Gaps = 22/176 (12%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE-WFAVNSGF--- 127
YFEQ +DH + L TF RYL + +HW GP PI ++ + +I ++A+ F
Sbjct: 62 YFEQPIDHNN-PSLGTFKIRYLWSNEHWKGPGS--PIVVFTPGQANISSYYAIFDKFNET 118
Query: 128 -----VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
+A GA LV E+RYYGES PY A L YL +Q + D F
Sbjct: 119 LMYQNTAQLAYEVGAALVLVENRYYGESSPYEELTTA-----NLQYLNQDQVMHDLVNFA 173
Query: 183 TNLK----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
N K N +A P +L GGSY G +A ++ K P SSA ++Q +D
Sbjct: 174 KNAKLPFSPNSTASNVPWILTGGSYSGAVATYVADKLPGTYWAYYTSSA-VVQSQD 228
>gi|58268250|ref|XP_571281.1| hypothetical protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|57227516|gb|AAW43974.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 561
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 85/177 (48%), Gaps = 25/177 (14%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNS 125
+E F Q + HF + TF QRY ++ + GPI+L G E G+ +
Sbjct: 76 FEPYCFPQFISHFDESVNGTFCQRYWVDASSY---RPGGPIYLLDGGETSGEYRLPFLEK 132
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G + ++ G + V EHRYYGES+P V+ + L +L +AL D A FI N
Sbjct: 133 GILDILSNATGGLSVVLEHRYYGESVP-----VSSFSTDDLRFLNNAEALEDSAYFIENF 187
Query: 186 KQNLS---------------AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
K S +P + +GGSY G AA MR++YP++ GA+ASSA
Sbjct: 188 KLPASLSNALPFELEETAFHPNNTPWIYYGGSYAGARAAHMRVQYPNLVWGAIASSA 244
>gi|367041275|ref|XP_003651018.1| hypothetical protein THITE_2110971 [Thielavia terrestris NRRL 8126]
gi|346998279|gb|AEO64682.1| hypothetical protein THITE_2110971 [Thielavia terrestris NRRL 8126]
Length = 549
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 83/173 (47%), Gaps = 18/173 (10%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA----VNSGFV 128
F+Q +DH + A L TF QRY T+ W GP PI+L E D F N+
Sbjct: 57 FDQLIDHANPA-LGTFKQRYWYGTEFWKGPGS--PIYLVTPGEQDGTGFNRTWLSNARLT 113
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ- 187
+A + G ++ EHRY+GES PY + V QN L YLT + +L D F
Sbjct: 114 GVMANQTGGAVIILEHRYWGESSPYQNLTV--QN---LKYLTLDNSLQDLVYFAKTFAPP 168
Query: 188 -NLSAEASPV----VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
+ S +++P V+ GGSY G L+AW+ KYP +SS + D
Sbjct: 169 FDTSGKSAPTEAPWVVVGGSYSGALSAWLAAKYPGTFWAYYSSSGVVEAVGDF 221
>gi|170040235|ref|XP_001847912.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167863800|gb|EDS27183.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 492
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 73/308 (23%), Positives = 143/308 (46%), Gaps = 48/308 (15%)
Query: 19 TIVIISILSPLSLAAQP------SKFRRAPRFVGKLPHLTEP--PQRQQRQQQQQYRYET 70
++ ++++L + AA P + + + R + +L P P++ + +
Sbjct: 3 SLFVVALLVASAFAAVPRTNPSKANLQTSRRLLEQLVTRGMPQTPRKPASEAPSKRIVIE 62
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN-SGFVW 129
+F R+DHF + ++ RYL TD++ P GPI ++ G I+ + V+ S ++
Sbjct: 63 NFFTTRIDHFDPQNTAEWTLRYLAVTDYY-QPG--GPILIWLGGNAPIQPYMVDESSLIY 119
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNAT-TLSYLTAEQALADFAVFITNLKQN 188
D+A +G++ V +T L +L +Q LAD A F+T L++
Sbjct: 120 DMAREM-----------------HGASWVTSDTSTENLRFLNTDQILADLAEFVTYLRRE 162
Query: 189 LSA-EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQFEDIVPP--ETFY 242
++ E + V++ G YGG LA W R++YPH+A A +S ++ F++ +T
Sbjct: 163 VTRNENAHVLVSGVGYGGSLATWFRVRYPHLADAAWSSGGLHNALMDFQEFAEAWGQTLI 222
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
+ S + E F+ ++ L+ G++E L + +LC E+ D D L+
Sbjct: 223 DFGSQECYNEIFVAFHVMQ----NLIDAGREE----ILHERLNLCTEI----DTEDRLQV 270
Query: 303 AYSYLAMV 310
+ ++ M+
Sbjct: 271 QFFFITMM 278
>gi|134113512|ref|XP_774572.1| hypothetical protein CNBF4600 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257214|gb|EAL19925.1| hypothetical protein CNBF4600 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 561
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 59/177 (33%), Positives = 85/177 (48%), Gaps = 25/177 (14%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNS 125
+E F Q + HF + TF QRY ++ + GPI+L G E G+ +
Sbjct: 76 FEPYCFPQFISHFDESVNGTFCQRYWVDASSY---RPGGPIYLLDGGETSGEYRLPFLEK 132
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G + ++ G + V EHRYYGES+P V+ + L +L +AL D A FI N
Sbjct: 133 GILDILSNATGGLSVVLEHRYYGESVP-----VSSFSTDDLRFLNNAEALEDSAYFIENF 187
Query: 186 KQNLS---------------AEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
K S +P + +GGSY G AA MR++YP++ GA+ASSA
Sbjct: 188 KLPASLSNALPFELEETAFHPNNTPWIYYGGSYAGARAAHMRVQYPNLVWGAIASSA 244
>gi|358059566|dbj|GAA94723.1| hypothetical protein E5Q_01376 [Mixia osmundae IAM 14324]
Length = 982
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 73/245 (29%), Positives = 114/245 (46%), Gaps = 33/245 (13%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI--EWFA------VN 124
F+Q +DH + + PTF QRY I D++ PN GP+ + G E DI E +
Sbjct: 28 FDQPIDH-TDSKSPTFKQRYHILGDYY-KPN--GPVIILDGAESDITRETYGGSKISYYR 83
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
+ F +A G +LV E R YG+S P+ + L Y +QA+AD F +
Sbjct: 84 TQFSKHLAQATGGLLVVFEQRCYGKSHPFSRSTT-----DNLRYCLIDQAIADAPYFAQH 138
Query: 185 LK----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
+K + L+A +P +L+GGS GG A+ LKY + +ASSA + D+ P+
Sbjct: 139 VKIPGFEGLNAPKTPYILYGGSLGGAKTAFSMLKYNDVLYAGIASSATVK--SDVTYPK- 195
Query: 241 FYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWL 300
+Y + + N + + EL Q + + +L K F L E L D+
Sbjct: 196 WYTAAQTYAPHACVATINNLVDRMDELHK--QSPHAIPQLQKLFGL-------ESLTDFR 246
Query: 301 ESAYS 305
+ AY+
Sbjct: 247 DFAYA 251
>gi|308448682|ref|XP_003087721.1| hypothetical protein CRE_28002 [Caenorhabditis remanei]
gi|308253266|gb|EFO97218.1| hypothetical protein CRE_28002 [Caenorhabditis remanei]
Length = 513
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 101/224 (45%), Gaps = 29/224 (12%)
Query: 105 LGPIFLYCGNEGDI------EWF--AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGST 156
+G +FL G EG I +W + VW A FGA EHR+YG G
Sbjct: 55 VGYVFLMLGGEGSINGTNGDKWVRHEAETMMVW--AAEFGAGAFQVEHRFYGSK---GFC 109
Query: 157 EVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPV-VLFGGSYGGMLAAWMRLKY 215
+ Q +L LT +QALAD FI + P+ + FGGSY G L+A+ R Y
Sbjct: 110 PIGDQTTESLKLLTIDQALADIKEFINQMNALYFPLDKPIWITFGGSYPGSLSAFFRETY 169
Query: 216 PHIAIGALASSAPILQFED----IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVG 271
P + GA++SS+ + F D + E Y VS SC + IK ++ ++
Sbjct: 170 PEMTAGAVSSSSAVHVFVDYYGYAINTEKTYRTVSD-------SCGDVIKTAFQQMQKKA 222
Query: 272 QKENGLLELTK-TFHLC---RELNSTEDLADWLESAYSYLAMVD 311
EL K TF+LC E N ++ + + ++ Y Y ++
Sbjct: 223 YNGPDSRELLKTTFNLCDSFDENNLSKSIQFFFQNVYGYFQGIN 266
>gi|361132051|gb|EHL03666.1| putative serine protease K12H4.7 [Glarea lozoyensis 74030]
Length = 577
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 83/171 (48%), Gaps = 16/171 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE---GDIEWFAVNSGFV 128
+F+Q LDH + + TF Q++ N + W GP PI ++ E + + N+ +
Sbjct: 99 FFDQLLDHKNPSK-GTFKQKFWWNIEFWNGPGS--PIVMFTPGEIAAANYGAYLTNATVI 155
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL--- 185
A ++ EHR++GES PY + N+ TL LT EQ++ADF F
Sbjct: 156 GLYAQEIKGAVIMVEHRFWGESSPYQTL-----NSETLQLLTLEQSIADFVYFAKVAPLP 210
Query: 186 --KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+ +A+ +P V GGSY G LAAW+ P ASSAP+ +D
Sbjct: 211 FDTKKSNADKAPWVFSGGSYSGALAAWIESTSPGTFWAYHASSAPVQAIDD 261
>gi|398391815|ref|XP_003849367.1| serine carboxypeptidase, partial [Zymoseptoria tritici IPO323]
gi|339469244|gb|EGP84343.1| serine carboxypeptidase [Zymoseptoria tritici IPO323]
Length = 486
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 65/233 (27%), Positives = 102/233 (43%), Gaps = 23/233 (9%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA---VNSGFVW 129
F+Q LDH + TF+ RY +T +W GP PI L E D ++ N V
Sbjct: 3 FQQPLDH-NDPSKGTFAIRYWFDTTYWKGPGS--PIVLTTYGETDATYYINGLTNRSMVG 59
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL 189
A GA + EHRY+G+S+P S A + YLT E +L D + F ++
Sbjct: 60 VTAKAIGAAAILVEHRYFGQSIPVDSLTTA-----NMKYLTLEDSLKDLSYFARTVELPF 114
Query: 190 SAEAS--------PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETF 241
+ +A P VL GGSY G LAAW +P ASS+ + Q F
Sbjct: 115 ARDAGCASNAADVPWVLMGGSYAGSLAAWTAKLFPDTFWAYYASSSVVHQTAAFW---QF 171
Query: 242 YNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+ V + ++ + + E++ G ++N + +L F +++S +
Sbjct: 172 FEPVKQNMPQDCRDDVTAVIDHMDEILIHGSEQN-IADLKARFDFPAKIDSVD 223
>gi|156058698|ref|XP_001595272.1| hypothetical protein SS1G_03361 [Sclerotinia sclerotiorum 1980]
gi|154701148|gb|EDO00887.1| hypothetical protein SS1G_03361 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 535
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/169 (33%), Positives = 78/169 (46%), Gaps = 17/169 (10%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE---GDIEWFAVNSG 126
+ +F Q LDH + TF Q++ N+++W GP PI + E + + N
Sbjct: 51 SAFFTQLLDHEN-PSKGTFQQKFWWNSENWAGPGS--PIVFFTPGEIAAAEYGAYLTNVT 107
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL- 185
A +V EHRY+GES PY + T L YL +QA+ADF F +
Sbjct: 108 VTGLFAQEVKGAVVMVEHRYWGESSPYDNLTT-----TNLQYLNLKQAIADFVHFAKTVD 162
Query: 186 -----KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ +A A+P +L GGSY G LAAW P ASSAP+
Sbjct: 163 LPFDTNHSSNAAAAPWILSGGSYSGALAAWTESTSPGTFWAYHASSAPV 211
>gi|161078413|ref|NP_001097835.1| CG11626 [Drosophila melanogaster]
gi|158030303|gb|AAF55668.2| CG11626 [Drosophila melanogaster]
gi|189181944|gb|ACD81748.1| IP20428p [Drosophila melanogaster]
Length = 379
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 58/106 (54%), Gaps = 4/106 (3%)
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL- 189
+A ML + EHRYYG S+P+G+ N L L+ Q+LAD A FI + K N
Sbjct: 1 MAVENSGMLFYTEHRYYGLSLPFGNESYRLSN---LKQLSLHQSLADLAHFIRHQKSNDP 57
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
E S V+L GGSY G L AWM YP + + ASSAP+L D
Sbjct: 58 EMEDSKVILVGGSYSGSLVAWMTQLYPDLIAASWASSAPLLAKADF 103
>gi|336275849|ref|XP_003352678.1| hypothetical protein SMAC_01511 [Sordaria macrospora k-hell]
gi|380094568|emb|CCC07948.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 570
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 89/176 (50%), Gaps = 26/176 (14%)
Query: 68 YETRYFEQRLDHFSFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNE----GD 117
Y R E +DHF TF RY + ++ + GP+ + E G
Sbjct: 57 YPARTIEVPVDHFHNDTKYEPHTNDTFDLRYWFDATYY---KKGGPVIVLAAGETSGVGR 113
Query: 118 IEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALAD 177
+++ + G V+ +A G + V EHRYYG+S+P +++ + +N L +LT +QALAD
Sbjct: 114 LQF--LQKGIVYQLAKATGGVGVILEHRYYGKSLP--TSDFSTKN---LRFLTTDQALAD 166
Query: 178 FAVFITNLK------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
F N+K +L+A +P + +GGSY G A++R YP + GA++SS
Sbjct: 167 TVYFAKNVKFAGLEHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYWGAISSSG 222
>gi|449704577|gb|EMD44793.1| serine carboxypeptidase (S28) family protein, partial [Entamoeba
histolytica KU27]
Length = 87
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 42/87 (48%), Positives = 56/87 (64%), Gaps = 7/87 (8%)
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGS 202
EHR+YG S P S E+ L Y TAEQAL D+ I+++++ + PV++ GGS
Sbjct: 5 EHRFYGASTP--SLEM-----DKLIYCTAEQALMDYVEVISHVQEENNLVGHPVIVLGGS 57
Query: 203 YGGMLAAWMRLKYPHIAIGALASSAPI 229
Y G LAAWMR KYP++ GA ASSAP+
Sbjct: 58 YSGNLAAWMRQKYPNVVEGAWASSAPV 84
>gi|119487142|ref|XP_001262426.1| serine peptidase, family S28, putative [Neosartorya fischeri NRRL
181]
gi|119410583|gb|EAW20529.1| serine peptidase, family S28, putative [Neosartorya fischeri NRRL
181]
Length = 550
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 85/169 (50%), Gaps = 16/169 (9%)
Query: 71 RYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW----FAVNSG 126
Y E +DH + T+ R+ +N D +V + PI +Y E E+ +S
Sbjct: 59 EYTEMLIDHED-PSVGTYRNRFWVNEDFYVSGS---PIMVYDIGEATAEYSVSLLTNSSS 114
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
++ + F AM + EHRYYG+S+P+ ++ L YLT EQALAD F N
Sbjct: 115 WLSLLLQEFHAMGIVWEHRYYGDSLPFPVSQD--MPVEHLKYLTTEQALADIPYFAANFS 172
Query: 187 Q------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ +L+ +P V+ GGSY G+ AA+ R KYP A ASSAP+
Sbjct: 173 RPNHPDIDLTPRGTPWVMIGGSYPGIRAAFTRNKYPDTIFAAYASSAPV 221
>gi|158298286|ref|XP_001689125.1| AGAP004013-PA [Anopheles gambiae str. PEST]
gi|157014451|gb|EDO63478.1| AGAP004013-PA [Anopheles gambiae str. PEST]
Length = 496
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 138/294 (46%), Gaps = 43/294 (14%)
Query: 20 IVIISILSPLSLAAQPSKFRRA--PRFVGKLPHLT---EPPQRQQRQQQQQYRYETRYFE 74
+V++ +LAA+ S +A P F+ +L T +P QR ++ +F
Sbjct: 5 VVLLLAACGTALAAKLSTPPKALTPSFLNRLRSATVGLKPSQRNANITEE-------FFT 57
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG-FVWDIAP 133
+DHF+ DL T+S RYL DH+V GP+ ++ + ++ ++ G + ++A
Sbjct: 58 TEVDHFNNQDLTTWSNRYLALMDHFVEG---GPMLIFLTGDAPLDPSMIDDGTLINEMAR 114
Query: 134 RFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE- 192
G + E R+YG+S P G V +L L +Q LAD A F+ +L++ +
Sbjct: 115 DLGGAVFALETRFYGKSQPVGDLTVE-----SLRLLNTDQILADVADFVVHLRRTVINNP 169
Query: 193 -ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQFEDIVP--PETFYNIVS 246
A P+V G GG LA W R++YPH+ +SS + F++ ET + S
Sbjct: 170 FAHPLVT-GTGLGGGLATWFRVRYPHLVDATWSSSGYIQAVFDFQEFSSGWAETAITVGS 228
Query: 247 SDFKRESASCFNTIKESWG---ELVSVGQKENGLLELTKTFHLCRELNSTEDLA 297
++ C+N I ++ L+ G E L + F+LC ++S + +A
Sbjct: 229 NE-------CYNRIFIAFHVAQNLIDAGFGE----VLYEKFNLCSPIDSEDRMA 271
>gi|67900488|ref|XP_680500.1| hypothetical protein AN7231.2 [Aspergillus nidulans FGSC A4]
gi|40742088|gb|EAA61278.1| hypothetical protein AN7231.2 [Aspergillus nidulans FGSC A4]
gi|259483425|tpe|CBF78805.1| TPA: hypothetical serine carboxypeptidase (Eurofung) [Aspergillus
nidulans FGSC A4]
Length = 519
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 87/174 (50%), Gaps = 20/174 (11%)
Query: 68 YETRYFEQRLDHFSFADL------PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE-- 119
Y+ F +DHF TF+ RY ++ ++ GP+F+ E D E
Sbjct: 18 YQAYNFSVPIDHFHNETRYAPHSNGTFNLRYWFDSTYY---QPGGPVFVIAAGETDGEDR 74
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
+ ++ G V +A + + V EHRYYGES P+ +V L +L+ EQ+LAD+A
Sbjct: 75 FEFLSQGIVTQLAEAYNGLGVILEHRYYGESYPFPGADVTVDE---LRFLSTEQSLADYA 131
Query: 180 VFITNL------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
F ++ +L+A +P + +GGSY G A+MR YP I GA++SS
Sbjct: 132 YFAKHVIFPGLEAYDLTAPNTPWIAYGGSYAGAQVAFMRKLYPSIFHGAVSSSG 185
>gi|303314813|ref|XP_003067415.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240107083|gb|EER25270.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320037761|gb|EFW19698.1| serine peptidase, family S28 [Coccidioides posadasii str. Silveira]
Length = 555
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 77/153 (50%), Gaps = 16/153 (10%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA----VN-SGFVWDIAPRFGAMLVFP 142
+ R+ +N + GP+FL+ G E + + +A VN + F + F M +
Sbjct: 86 YKNRFWVNDSKY---KSGGPVFLFDGGEANAQRYADFYLVNETSFFVQLLEEFHGMGIVW 142
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ------NLSAEASPV 196
EHRYYGES P+ A YL EQALAD F N K+ +L+ +++P
Sbjct: 143 EHRYYGESNPFPVNLDT--PAEHFQYLNNEQALADIPYFAKNFKRENFPDDDLTPKSTPW 200
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
V+ GGSY GM AA+ R +YP + A+ AP+
Sbjct: 201 VMIGGSYPGMRAAFTRDQYPETIFASFAACAPV 233
>gi|380483529|emb|CCF40564.1| serine carboxypeptidase S28 [Colletotrichum higginsianum]
Length = 537
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 86/186 (46%), Gaps = 17/186 (9%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+PP R + + + E +FEQ +DH + +L TF QRY N +W GP PI L+
Sbjct: 24 QPPLRAEDDEGLLSKREAGFFEQLIDHDA-PELGTFQQRYWWNATYWKGPGS--PIVLFT 80
Query: 113 GNEGDIEWFA---VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYL 169
E E ++ + +IA G +V EHR +G S+PY Q+ L
Sbjct: 81 PGEVAAEAYSGYLTDRALTGNIAKAIGGAVVMVEHRNWGTSLPY-----TLQDTKNLQQH 135
Query: 170 TAEQALADFAVFITNLK----QNLSAEA--SPVVLFGGSYGGMLAAWMRLKYPHIAIGAL 223
T A+ DF NL+ N S+ A +P V GGSY G+LAA + P
Sbjct: 136 TMTNAVLDFVNLARNLELPFDTNSSSNAPQAPWVYTGGSYSGVLAAAIAKLAPGTLWAYH 195
Query: 224 ASSAPI 229
+SS P+
Sbjct: 196 SSSGPV 201
>gi|119175285|ref|XP_001239900.1| hypothetical protein CIMG_09521 [Coccidioides immitis RS]
gi|392870094|gb|EAS28654.2| serine peptidase, family S28 [Coccidioides immitis RS]
Length = 555
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 77/153 (50%), Gaps = 16/153 (10%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA----VN-SGFVWDIAPRFGAMLVFP 142
+ R+ +N + GP+FL+ G E + + +A VN + F + F M +
Sbjct: 86 YKNRFWVNDSKY---KSGGPVFLFDGGEANAQRYADFYLVNETSFFVQLLEEFHGMGIVW 142
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ------NLSAEASPV 196
EHRYYGES P+ A YL EQALAD F N K+ +L+ +++P
Sbjct: 143 EHRYYGESNPFPVNLDT--PAEHFQYLNNEQALADIPYFAKNFKRENFPDDDLTPKSTPW 200
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
V+ GGSY GM AA+ R +YP + A+ AP+
Sbjct: 201 VMIGGSYPGMRAAFTRDQYPETIFASFAACAPV 233
>gi|405121525|gb|AFR96294.1| serine carboxypeptidase [Cryptococcus neoformans var. grubii H99]
Length = 520
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/178 (32%), Positives = 85/178 (47%), Gaps = 26/178 (14%)
Query: 68 YETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNS 125
+E F Q + HF + TF QRY ++ + P GP++L G E G+ +
Sbjct: 78 FEPYCFPQFISHFDDSVNGTFCQRYWVDASSYT-PG--GPVYLLDGGEISGEYRLPFLEK 134
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G + ++ G + + EHRYYGES+P V+ + L +L +AL D A FI N
Sbjct: 135 GILDILSNATGGLSIVLEHRYYGESVP-----VSSFSTDDLRFLNNAEALEDSAYFIENF 189
Query: 186 K----------------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
K +P + +GGSY G AA MR++YP++ GA+ASSA
Sbjct: 190 KLPSSLSNNILPFELEETAFHPNNTPWIYYGGSYAGARAAHMRVQYPNLVWGAIASSA 247
>gi|328865129|gb|EGG13515.1| hypothetical protein DFA_11276 [Dictyostelium fasciculatum]
Length = 576
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 78/157 (49%), Gaps = 6/157 (3%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F QR+DH + ++ TF QRY IN + L G EG I+ N +A
Sbjct: 38 FNQRVDH-NGVNVKTFPQRYCINKSFVHKGAAPKSVMLVLGGEGPIDPEITNHIPFIGVA 96
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
++++ E RYYGES+P V + + YLT +Q L D A F T
Sbjct: 97 NNTNSIIIALEIRYYGESIP-----VPNMSTDNMQYLTTDQILDDIAYFQTQFTNLYGLH 151
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
++ G SY G L+AW R+KYP++A A+ASSAPI
Sbjct: 152 NCKWIVMGCSYAGSLSAWYRMKYPNLAAAAIASSAPI 188
>gi|396460264|ref|XP_003834744.1| hypothetical protein LEMA_P068870.1 [Leptosphaeria maculans JN3]
gi|312211294|emb|CBX91379.1| hypothetical protein LEMA_P068870.1 [Leptosphaeria maculans JN3]
Length = 594
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 69/215 (32%), Positives = 92/215 (42%), Gaps = 28/215 (13%)
Query: 36 SKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYET---RYFEQRLDHFSFAD------LP 86
S F+ AP L L + + + R E Y LD+F +
Sbjct: 51 SSFKNAPPDTSSLSALLKSKTSESGAASRLSRTEIIVPEYVTLPLDNFHASKGQDAEYEG 110
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV-----NSGFVWDIAPRFGAMLVF 141
+F+ RY + G GP+F+Y E D E A+ + F I FG + V
Sbjct: 111 SFANRYWVAES---GYRPGGPVFVYDVGEADAEPNALFRLQNETSFFKQIVDEFGGIGVV 167
Query: 142 PEHRYYGESMPYGSTEVAYQNA--TTLSYLTAEQALADFAVFIT-----NLKQNLSAEAS 194
EHR+YG S P E N YLT EQ+LAD F N+ L+ +A+
Sbjct: 168 WEHRFYGNSTP----EPININTPPEVFKYLTTEQSLADVERFAKQFSRPNINHTLTPDAT 223
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
P + GGSY GM AA+MR YP A ASSAP+
Sbjct: 224 PWIFIGGSYPGMRAAFMRNMYPDTIYAAYASSAPV 258
>gi|123479668|ref|XP_001322991.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121905847|gb|EAY10768.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 437
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 82/157 (52%), Gaps = 9/157 (5%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIA 132
F+Q LDH + TF Q Y TDH VG + I + G E D + S F +A
Sbjct: 20 FKQTLDHENTGS-ETFDQYYYEVTDHVVGQPK--AIIVKIGAESDKLVASGVSDFNAVLA 76
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
R+ A+++ +HR++G+S+P V L +LT EQA+ D+ VF + QN
Sbjct: 77 KRYNAIVLTIQHRFFGKSIPQDGLTV-----DKLKFLTVEQAVQDYKVF-HDYYQNEKKL 130
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
P ++ GGSY G+L+A +R KYP A++SS +
Sbjct: 131 NLPWLVVGGSYPGLLSALIRDKYPDDFKAAISSSGVL 167
>gi|429848945|gb|ELA24373.1| serine peptidase, putative [Colletotrichum gloeosporioides Nara
gc5]
Length = 526
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/268 (27%), Positives = 119/268 (44%), Gaps = 32/268 (11%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
+PP R + + + + YFEQ +DH + +L TF QRY +N+ +W GP PI ++
Sbjct: 24 QPPFRAEDDEGGLVKRDALYFEQLIDHDA-PELGTFQQRYWVNSTYWKGPGS--PIIVFT 80
Query: 113 GNEGDIEWFAVNSGFVWD------IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
E E + SG++ D IA G +V EHR +G S+PY A Q+ L
Sbjct: 81 PGEVAAEAY---SGYLTDRALTGSIAKAVGGAVVMVEHRNWGTSLPY-----ALQDTKNL 132
Query: 167 SYLTAEQALADFAV--------FITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHI 218
T A+ D F TN N A +P + GGSY G+LAA + P
Sbjct: 133 QQHTMTNAVFDLTNLARTVDLPFDTNHSSN--APQAPWIYTGGSYSGILAAAISKYAPGT 190
Query: 219 AIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLL 278
ASS P+ D +++ + + + + FN + + +++ G E+ +
Sbjct: 191 IWAYHASSGPVEATYDYW---SYFLPIQNGMPQNCSRDFNLMVDHIDTVLTSG-TEDEIY 246
Query: 279 ELTKTFHLCRELNSTEDLADWLESAYSY 306
L + F L ++L +D A L Y
Sbjct: 247 SLKEKFGL-QDLAHKDDFASALVGPLGY 273
>gi|123446346|ref|XP_001311925.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
gi|121893752|gb|EAX98995.1| Clan SC, family S28, unassigned serine peptidase [Trichomonas
vaginalis G3]
Length = 436
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/262 (27%), Positives = 110/262 (41%), Gaps = 26/262 (9%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
P R +R+ Y F Q +DH S TF QRY D+ N+ +F+ G
Sbjct: 3 PGRLRRRILGDDNYTFLTFSQNIDH-SDPQKGTFKQRYEALFDY-TTDNKTAILFI--GG 58
Query: 115 EGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQA 174
E D + ++ + F A EHRY+GES P T+++Y N + YLT + A
Sbjct: 59 ESDTFRPRAFNDYMATLCKEFNAAFFMLEHRYFGESFP---TDLSYPN---IKYLTVDNA 112
Query: 175 LADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+ D F + + S +L GGSY G+L+A+ R KYP ++ASS ++ +
Sbjct: 113 IDDLYNFKVKMVEQYKMTDSKWILVGGSYPGLLSAYTRAKYPKEFHASIASSGVVIASNN 172
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVG-------------QKENGLLELT 281
E F ++ + AS I+ EL+ +KEN L L
Sbjct: 173 Y---EDFDRQIAISLGQSCASVAREIRRRTDELLETDPDWLLATFNMTGLEKENFPLVLG 229
Query: 282 KTFHLCRELNSTEDLADWLESA 303
+ F L + + L LE
Sbjct: 230 EIFSLGAQYGRRQQLCGPLEDT 251
>gi|296817873|ref|XP_002849273.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238839726|gb|EEQ29388.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 544
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/241 (27%), Positives = 107/241 (44%), Gaps = 27/241 (11%)
Query: 77 LDHFSFAD------LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGFV 128
+DHF + + F+ RY + H+ GP+ + G E D E + G +
Sbjct: 57 IDHFPKSQRYEPHTMEKFNLRYWFDASHY---KEGGPVIILHGGETDGEGRIPFLQKGIL 113
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-- 186
+A + V EHRYYG S+P + N +L +LT EQALAD A F N+K
Sbjct: 114 AQLAQATNGIGVVMEHRYYGGSLPTRD----FSN-KSLRFLTTEQALADTAYFSQNIKFP 168
Query: 187 ----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
NL+A + +++GGSY G A++R +YP + GA++SS D ++
Sbjct: 169 GLEKYNLTAPGTAHIVYGGSYAGGQVAFLRTQYPDVFWGAISSSGVTKAIYDYW---QYF 225
Query: 243 NIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLES 302
+ + ++ + ++ G+ N EL F L R ++ D A+ L S
Sbjct: 226 EPIRQEAPQDCVHVTQNFVDIVDNIIIHGKNANTTQELKNLFGLGRLRDA--DFANALSS 283
Query: 303 A 303
Sbjct: 284 G 284
>gi|350295890|gb|EGZ76867.1| peptidase S28 [Neurospora tetrasperma FGSC 2509]
Length = 562
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 83/151 (54%), Gaps = 20/151 (13%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNE----GDIEWFAVNSGFVWDIAPRFGAMLVFP 142
TF+ RY + ++ + GP+ + E G +++ + G V+ +A G + V
Sbjct: 85 TFNLRYWFDATYY---KKGGPVIVLAAGETSGVGRLQF--LQKGIVYQLAKATGGVGVIL 139
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK------QNLSAEASPV 196
EHRYYG+S+P +++ + +N L +LT +QALAD F N+K +L+A +P
Sbjct: 140 EHRYYGKSLP--TSDFSTKN---LRFLTTDQALADTVYFAKNVKFAGLEHLDLTAPNTPY 194
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+ +GGSY G A++R YP + GA++SS
Sbjct: 195 IAYGGSYAGAFVAFLRKLYPDVYWGAISSSG 225
>gi|345570251|gb|EGX53075.1| hypothetical protein AOL_s00007g24 [Arthrobotrys oligospora ATCC
24927]
Length = 687
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/182 (30%), Positives = 80/182 (43%), Gaps = 33/182 (18%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTD----------HWVGPNRLGPIFLYCGNEGDIEWF 121
+ E +DHFS + T S RY + H +G + +GP Y D + F
Sbjct: 69 FVELPMDHFSADNKDTISCRYFVQESYYKPGGPVIFHDIGESSIGP---YAKGLVDEDEF 125
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATT------LSYLTAEQAL 175
+V +A RF +L+ EHR+YG+S P T + A+ + T EQAL
Sbjct: 126 SVA------MAKRFNGLLILFEHRFYGQSAPTTKTSQSLAQASRRQLTEFFKFHTIEQAL 179
Query: 176 ADFAVFITNL--------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
D F N KQ L+ + +P + G SY G AWM + P + LASSA
Sbjct: 180 EDVVYFAKNFTYDLEEYPKQVLTPDKTPWIWIGVSYSGARGAWMAKRNPGLFKATLASSA 239
Query: 228 PI 229
P+
Sbjct: 240 PV 241
>gi|452003099|gb|EMD95556.1| hypothetical protein COCHEDRAFT_1061531, partial [Cochliobolus
heterostrophus C5]
Length = 497
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 79/171 (46%), Gaps = 21/171 (12%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE----WFAVNSGFVWDIA 132
LDHF +D TF RY + ++++ GP+F+ E + E W F I
Sbjct: 5 LDHFG-SDAGTFPNRYWVYSENY---KPGGPVFILDQGESNAEPVSRWIPDPRFFFNQIV 60
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL---SYLTAEQALADFAVFITN----- 184
F + + EHR YGES+P G + N T+L YL QALAD F
Sbjct: 61 KEFNGIGIAWEHRMYGESVPAG-----FHNDTSLDRFKYLNVPQALADIDAFAKQFSLPY 115
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
+ L A+ +P V G SY G AAW+R KYP + ASSA + D+
Sbjct: 116 INATLDADHTPWVFIGASYSGGRAAWVRNKYPDSIYASWASSAVVEAMVDM 166
>gi|336463818|gb|EGO52058.1| hypothetical protein NEUTE1DRAFT_118619 [Neurospora tetrasperma
FGSC 2508]
Length = 569
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 82/151 (54%), Gaps = 20/151 (13%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNE----GDIEWFAVNSGFVWDIAPRFGAMLVFP 142
TF RY + ++ + GP+ + E G +++ + G V+ +A G + V
Sbjct: 82 TFDLRYWFDATYY---KKGGPVIVLAAGETSGVGRLQF--LQKGIVYQLAKATGGVGVIL 136
Query: 143 EHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK------QNLSAEASPV 196
EHRYYG+S+P +++ + +N L +LT +QALAD F N+K +L+A +P
Sbjct: 137 EHRYYGKSLP--TSDFSTKN---LRFLTTDQALADTVYFAKNVKFAGLEHLDLTAPNTPY 191
Query: 197 VLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
+ +GGSY G A++R YP + GA++SS
Sbjct: 192 IAYGGSYAGAFVAFLRKLYPDVYWGAISSSG 222
>gi|344268139|ref|XP_003405920.1| PREDICTED: hypothetical protein LOC100665381 [Loxodonta africana]
Length = 431
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 73/142 (51%), Gaps = 5/142 (3%)
Query: 162 NATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIG 221
+ +L YL++ QALAD F T + + + + V FG SYGG LA W R+K+P +
Sbjct: 197 STASLHYLSSRQALADIVNFRTQIAEKMGLTRNKWVAFGCSYGGSLAVWSRIKHPDLFAA 256
Query: 222 ALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELT 281
A+ SSAPI + + +V +++CF +KE +G++V + + +L
Sbjct: 257 AVGSSAPIQAKANFYE---YLEVVQRSLATHNSTCFQAVKEVFGQVVKMLKLPKYYSKLE 313
Query: 282 KTFHLCR--ELNSTEDLADWLE 301
F LC+ L S D A +L+
Sbjct: 314 NDFTLCKPMRLYSAMDKAFFLD 335
>gi|330914104|ref|XP_003296495.1| hypothetical protein PTT_06614 [Pyrenophora teres f. teres 0-1]
gi|311331312|gb|EFQ95405.1| hypothetical protein PTT_06614 [Pyrenophora teres f. teres 0-1]
Length = 574
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 74/241 (30%), Positives = 102/241 (42%), Gaps = 38/241 (15%)
Query: 18 ITIVIISILSPLSLAAQPSKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETR------ 71
I ++ ++L+ L +P R +G P PQ +RQ
Sbjct: 8 ILVLASAVLAGRPLLHKPGFSLHTGRSIGN-PVTGNSPQLGKRQDVTMKSLTGNNSNPGV 66
Query: 72 -YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD 130
YFEQ +DH + L TF +YL +T+ W GP PI L+ E D A N FVW+
Sbjct: 67 AYFEQPIDHNN-PSLGTFKMKYLWSTEAWKGPGS--PIILFPSGEYD----ADNWKFVWN 119
Query: 131 -------------IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALAD 177
+A GA + E RYYGES P+ N L YLT +Q + D
Sbjct: 120 RYNETLMYQNVAQLAVEIGAAFLILESRYYGESSPFEEL-----NTANLQYLTHDQMIYD 174
Query: 178 FAVFITNLK----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFE 233
+ F N+K N +A P VL G S LA ++ K P ASSA +LQ +
Sbjct: 175 WVNFAANVKLPFSHNSTASNVPWVLAGSSLNANLATYIAHKLPGTFWTYYASSA-VLQTQ 233
Query: 234 D 234
+
Sbjct: 234 N 234
>gi|340514339|gb|EGR44603.1| predicted protein [Trichoderma reesei QM6a]
Length = 533
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 68/223 (30%), Positives = 106/223 (47%), Gaps = 21/223 (9%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI---EWFAVNSGFVW 129
F+Q +DH +L TFSQRY+ N + + GP PI L NE + E + N
Sbjct: 52 FQQLIDH-DHPELGTFSQRYVWNDEFYAGPGS--PIILMGPNESALDGYERYTTNLTLPG 108
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL---- 185
+A GA + EHRY+G+S P+ S + YLT EQ++ D F N+
Sbjct: 109 VMAQELGAGALIIEHRYWGQSSPFDSLTT-----ENMRYLTLEQSVQDLVYFAQNVVLPF 163
Query: 186 KQNLSA--EASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYN 243
QN ++ + +P VL G SY G LAAW++ P A S+P++ E I P ++
Sbjct: 164 DQNRTSTPDKAPWVLVGCSYSGALAAWVQDLAPGT-FWAYQCSSPVV--EAIGPLWKYFE 220
Query: 244 IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHL 286
V + + + I + + + V +G + EL +F+L
Sbjct: 221 QVKLAMPQNCTADYARIVQ-YIDGVLLGHNQKAKDELKASFNL 262
>gi|123501346|ref|XP_001328053.1| Serine protease K12H4.7 precursor-related protein [Trichomonas
vaginalis G3]
gi|121910991|gb|EAY15830.1| Serine protease K12H4.7 precursor-related protein [Trichomonas
vaginalis G3]
Length = 136
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/116 (37%), Positives = 70/116 (60%), Gaps = 10/116 (8%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDI 131
+F+Q+LDHFS D TF Q+Y INT++ + + +Y G E + ++ G V I
Sbjct: 23 WFDQKLDHFSNLD-ETFKQKYYINTNY---SKKSENLVVYIGGEAPLSESSIKHG-VLHI 77
Query: 132 APRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ 187
A + ++++ EHRY+G+S+P+G+ E+ L YLT +QA+ D A FI+ +KQ
Sbjct: 78 ANQSKSVILALEHRYFGDSIPHGNLELE-----NLKYLTVDQAIEDLANFISQMKQ 128
>gi|85112594|ref|XP_964374.1| hypothetical protein NCU00831 [Neurospora crassa OR74A]
gi|28926153|gb|EAA35138.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 561
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 91/183 (49%), Gaps = 28/183 (15%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGN 114
P R + + +T+Y D TF RY + ++ + GP+ +
Sbjct: 62 PARTIKVPVDHFHNDTKYEPHTND--------TFDLRYWFDATYY---KKGGPVIVLAAG 110
Query: 115 E----GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLT 170
E G +++ + G V+ +A G + V EHRYYG+S+P +++ + +N L +LT
Sbjct: 111 ETSGVGRLQF--LQKGIVYQLAKATGGVGVILEHRYYGKSLP--TSDFSTKN---LRFLT 163
Query: 171 AEQALADFAVFITNLK------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALA 224
+QALAD F N+K +L+A +P + +GGSY G A++R YP + GA++
Sbjct: 164 TDQALADTVYFAKNVKFAGLEHLDLTAPNTPYIAYGGSYAGAFVAFLRKLYPDVYWGAIS 223
Query: 225 SSA 227
SS
Sbjct: 224 SSG 226
>gi|70982075|ref|XP_746566.1| serine peptidase, family S28 [Aspergillus fumigatus Af293]
gi|66844189|gb|EAL84528.1| serine peptidase, family S28, putative [Aspergillus fumigatus
Af293]
Length = 560
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 113/251 (45%), Gaps = 37/251 (14%)
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW----FAVNSGFVW 129
E +DH + T+ R+ +N D ++ + PI +Y E E+ +S ++
Sbjct: 72 EMLIDHED-PSVGTYRNRFWVNEDFYISGS---PIMVYDIGEATAEYSVSLLTNSSSWLS 127
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ-- 187
+ F AM + EHRYYG S+PY ++ L YLT EQALAD F N +
Sbjct: 128 LLLQEFNAMGIVWEHRYYGGSLPYPVSQD--MPVEHLKYLTTEQALADIPYFAANFSRLN 185
Query: 188 ----NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI-LQFEDIVPPETFY 242
+L+ +P ++ GGSY G+ AA R KYP A ASSAP+ Q V E Y
Sbjct: 186 HPDFDLTPRGTPWIMIGGSYPGIRAAITRNKYPDTIFAAYASSAPVQAQLNMSVYYEQIY 245
Query: 243 N-IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--------NST 293
IV + + ++C I+ + + +G L +TF L + L NS
Sbjct: 246 RAIVGNGY----SNCTKDIQAALKYI-------DGQLSNKRTFALIKRLFLGPDAEKNSN 294
Query: 294 EDLADWLESAY 304
ED L + Y
Sbjct: 295 EDFTTALVTLY 305
>gi|159122200|gb|EDP47322.1| serine peptidase, family S28, putative [Aspergillus fumigatus
A1163]
Length = 560
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 76/251 (30%), Positives = 113/251 (45%), Gaps = 37/251 (14%)
Query: 74 EQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEW----FAVNSGFVW 129
E +DH + T+ R+ +N D ++ + PI +Y E E+ +S ++
Sbjct: 72 EMLIDHED-PSVGTYRNRFWVNEDFYISGS---PIMVYDIGEATAEYSVSLLTNSSSWLS 127
Query: 130 DIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ-- 187
+ F AM + EHRYYG S+PY ++ L YLT EQALAD F N +
Sbjct: 128 LLLQEFNAMGIVWEHRYYGGSLPYPVSQD--MPVEHLKYLTTEQALADIPYFAANFSRLN 185
Query: 188 ----NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI-LQFEDIVPPETFY 242
+L+ +P ++ GGSY G+ AA R KYP A ASSAP+ Q V E Y
Sbjct: 186 HPDFDLTPRGTPWIMIGGSYPGIRAAITRNKYPDTIFAAYASSAPVQAQLNMSVYYEQIY 245
Query: 243 N-IVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCREL--------NST 293
IV + + ++C I+ + + +G L +TF L + L NS
Sbjct: 246 RAIVGNGY----SNCTKDIQAALKYI-------DGQLSNKRTFALIKRLFLGPDAEKNSN 294
Query: 294 EDLADWLESAY 304
ED L + Y
Sbjct: 295 EDFTTALVTLY 305
>gi|261199212|ref|XP_002626007.1| extracelular serine carboxypeptidase [Ajellomyces dermatitidis
SLH14081]
gi|239594215|gb|EEQ76796.1| extracelular serine carboxypeptidase [Ajellomyces dermatitidis
SLH14081]
Length = 574
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 24/189 (12%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSF------ADLPTFSQRYLINTDHWVGPNRLGPI 108
P + + RY + +DHF F RY + H+ + GPI
Sbjct: 39 PSTESDKDDLTARYPVHKIKIPIDHFRSDPRYEPHTEEKFDVRYWFDASHY---KKGGPI 95
Query: 109 FLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
+ G E G+ + G V ++ + V EHRYYGES P A + +L
Sbjct: 96 IVLHGGETNGEGRLPFLQKGIVKILSEATNGLGVILEHRYYGESFP-----TANLSTESL 150
Query: 167 SYLTAEQALADFAVFITNL------KQNLSAEA--SPVVLFGGSYGGMLAAWMRLKYPHI 218
+LT EQALAD A F N+ +L+A+ +P +++GGSY G A++R++YP I
Sbjct: 151 RFLTTEQALADSAYFAQNVVFEGFEDVDLTAKGGNAPWIIYGGSYAGAQVAFLRVEYPDI 210
Query: 219 AIGALASSA 227
GA++SS
Sbjct: 211 FWGAISSSG 219
>gi|310796030|gb|EFQ31491.1| serine carboxypeptidase S28 [Glomerella graminicola M1.001]
Length = 541
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 76/260 (29%), Positives = 113/260 (43%), Gaps = 22/260 (8%)
Query: 52 TEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLY 111
T+PP + + FEQ +DH + +L TF QR+ N+ W GP PI L+
Sbjct: 24 TQPPFKADDVDGLLSKRALGSFEQLIDH-NDPELGTFQQRFWWNSTFWKGPGS--PIVLF 80
Query: 112 CGNEGDIEWFA---VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSY 168
E D E + + IA G ++ EHR +G S+PY A Q+ L
Sbjct: 81 TPGEEDAEEYTGYLTDRALTGAIAKEIGGAVIMVEHRNWGTSLPY-----ALQDTKNLQQ 135
Query: 169 LTAEQALADFAVFITNLK----QNLSAEA--SPVVLFGGSYGGMLAAWMRLKYPHIAIGA 222
T A+ D F N++ N S+ A +P V GGSY G LAA + P
Sbjct: 136 HTVANAVQDLVYFARNVELPFDTNSSSNAPQAPWVYTGGSYSGYLAAAIAKLAPGTFWAY 195
Query: 223 LASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
ASSAP+ E I +++ + + + F I E +++ G K+ + L K
Sbjct: 196 HASSAPV---EAINYYWSYFLPIQEGMPKNCSRDFERIIEHVDTVLNNGTKDE-IYALKK 251
Query: 283 TFHLCRELNSTEDLADWLES 302
F L ++L +D A L S
Sbjct: 252 KF-LMQDLPHDDDFASALSS 270
>gi|167536011|ref|XP_001749678.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163771826|gb|EDQ85487.1| predicted protein [Monosiga brevicollis MX1]
Length = 495
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/145 (38%), Positives = 76/145 (52%), Gaps = 11/145 (7%)
Query: 85 LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
+ TFS RY + H+ G F G E + + ++ D+A R+ A+ V EH
Sbjct: 97 MGTFSCRYYTSDLHY--DREQGVCFFEMGGEAPNN--GIGNDYIADLAKRYKALQVSIEH 152
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAEASPVVLFGGSYG 204
R+YGES+P V L YLT+ QALAD A I ++ N + + FGGSY
Sbjct: 153 RFYGESVPGDDFSV-----DNLHYLTSRQALADAAALIDHV--NRTYHCRKWMAFGGSYS 205
Query: 205 GMLAAWMRLKYPHIAIGALASSAPI 229
G L+AW R KYPHI GAL+SS +
Sbjct: 206 GALSAWFRTKYPHIIDGALSSSGVV 230
>gi|389739084|gb|EIM80278.1| peptidase S28 [Stereum hirsutum FP-91666 SS1]
Length = 550
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 88/193 (45%), Gaps = 16/193 (8%)
Query: 71 RYFEQRLDHFSFAD-LPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE-----WFAVN 124
++ + +DHF + +F+ R+ +N ++ GP+FL+ E D E +
Sbjct: 49 QFIQIPIDHFGTTNNTDSFANRFWVNDTYY---ESGGPVFLFDSGEQDAEPLLPYYLQEY 105
Query: 125 SGF--VWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFI 182
G +A R+ + + EHR+YG+S+P+ A A+ +LT EQAL D F
Sbjct: 106 HGLSATMRLAKRYNGLAILWEHRFYGDSLPFPVN--ANTTASQWQFLTTEQALEDVIFFA 163
Query: 183 TNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
N +L +P + GGSY G+ + +R + P ASSAP+ D+ ++Y
Sbjct: 164 NNFNSSLHPSTTPWIFLGGSYPGIRGSLLRQRNPSTIFATWASSAPVQAQVDMA---SYY 220
Query: 243 NIVSSDFKRESAS 255
R ++
Sbjct: 221 KAAERSLTRNCSA 233
>gi|239615378|gb|EEQ92365.1| extracelular serine carboxypeptidase [Ajellomyces dermatitidis
ER-3]
Length = 552
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 24/189 (12%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSF------ADLPTFSQRYLINTDHWVGPNRLGPI 108
P + + RY + +DHF F RY + H+ + GPI
Sbjct: 39 PSTESDKDDLTARYPVHKIKIPIDHFRSDPRYEPHTEEKFDVRYWFDASHY---KKGGPI 95
Query: 109 FLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
+ G E G+ + G V ++ + V EHRYYGES P A + +L
Sbjct: 96 IVLHGGETNGEGRLPFLQKGIVKILSEATNGLGVILEHRYYGESFP-----TANLSTESL 150
Query: 167 SYLTAEQALADFAVFITNL------KQNLSAEA--SPVVLFGGSYGGMLAAWMRLKYPHI 218
+LT EQALAD A F N+ +L+A+ +P +++GGSY G A++R++YP I
Sbjct: 151 RFLTTEQALADSAYFAQNVVFEGFEDVDLTAKGGNAPWIIYGGSYAGAQVAFLRVEYPDI 210
Query: 219 AIGALASSA 227
GA++SS
Sbjct: 211 FWGAISSSG 219
>gi|327290302|ref|XP_003229862.1| PREDICTED: dipeptidyl peptidase 2-like [Anolis carolinensis]
Length = 380
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 51/82 (62%), Gaps = 4/82 (4%)
Query: 65 QYRYETRYFEQRLDHFSFADLP--TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA 122
Q +E +YF+Q LDHF+F TF QRYLI W NR PIF Y GNEGD+ F
Sbjct: 27 QTGFEEKYFDQWLDHFNFESYSNRTFPQRYLITDKFWKRGNR--PIFFYTGNEGDVWNFG 84
Query: 123 VNSGFVWDIAPRFGAMLVFPEH 144
N GF+ ++A + GA++VF EH
Sbjct: 85 ENCGFILELAGQQGALVVFAEH 106
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 35/117 (29%), Positives = 57/117 (48%), Gaps = 18/117 (15%)
Query: 279 ELTKTFHLCRELNSTEDLADWLE---SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKID 335
E+++ C +L+S ED+ E +A++ +AM+DYPY +DFM P P++ C+ +
Sbjct: 132 EISRRMSTCEKLSSKEDIYQLFEFSRNAFTMIAMMDYPYKTDFMGHFPANPVKVGCELML 191
Query: 336 NAPDATSILERIFEGVSVYYNYTGNVDCFQLDD------DPHGL------DGWNWQV 380
D L + ++YN TG V CF + DP G D W++Q
Sbjct: 192 ANKDPVRGLAAL---CGLFYNSTGTVPCFDIYKEYRKCADPTGCGTGSDADAWDYQA 245
>gi|312372479|gb|EFR20431.1| hypothetical protein AND_20088 [Anopheles darlingi]
Length = 216
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 38/100 (38%), Positives = 58/100 (58%), Gaps = 3/100 (3%)
Query: 53 EPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYC 112
EPP R ++ T+Y QRLDHF ++ T+S RY+ N +H+ GP+F+Y
Sbjct: 118 EPPIRGGAPTKKAAPVTTKYIMQRLDHFDPQNVNTWSMRYMENGEHYQAG---GPLFIYV 174
Query: 113 GNEGDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMP 152
G E +I +++ G V+D+A + L + EHRYYG+S P
Sbjct: 175 GGEWEISEGSISRGHVYDMAQQLNGYLFYTEHRYYGKSHP 214
>gi|327356595|gb|EGE85452.1| extracelular serine carboxypeptidase [Ajellomyces dermatitidis ATCC
18188]
Length = 567
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 24/189 (12%)
Query: 55 PQRQQRQQQQQYRYETRYFEQRLDHFSF------ADLPTFSQRYLINTDHWVGPNRLGPI 108
P + + RY + +DHF F RY + H+ + GPI
Sbjct: 32 PSTESDKDDLTARYPVHKIKIPIDHFRSDPRYEPHTEEKFDVRYWFDASHY---KKGGPI 88
Query: 109 FLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTL 166
+ G E G+ + G V ++ + V EHRYYGES P A + +L
Sbjct: 89 IVLHGGETNGEGRLPFLQKGIVKILSEATNGLGVILEHRYYGESFP-----TANLSTESL 143
Query: 167 SYLTAEQALADFAVFITNL------KQNLSAEA--SPVVLFGGSYGGMLAAWMRLKYPHI 218
+LT EQALAD A F N+ +L+A+ +P +++GGSY G A++R++YP I
Sbjct: 144 RFLTTEQALADSAYFAQNVVFEGFEDVDLTAKGGNAPWIIYGGSYAGAQVAFLRVEYPDI 203
Query: 219 AIGALASSA 227
GA++SS
Sbjct: 204 FWGAISSSG 212
>gi|367030635|ref|XP_003664601.1| hypothetical protein MYCTH_2307580 [Myceliophthora thermophila ATCC
42464]
gi|347011871|gb|AEO59356.1| hypothetical protein MYCTH_2307580 [Myceliophthora thermophila ATCC
42464]
Length = 544
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 82/176 (46%), Gaps = 24/176 (13%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF-------AVNS 125
F+Q +DH + L TF QRY T++W GP PI+L E E F A S
Sbjct: 54 FDQLIDHAN-PSLGTFKQRYWYGTEYWKGPGS--PIYLVSPGEQSGEGFNRTWLTTARLS 110
Query: 126 GFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
G +A + G ++ EHRY+G+S PY E+ +N L YLT + +L D F N
Sbjct: 111 GV---MANQTGGAVIVLEHRYWGQSSPY--DELTVEN---LRYLTLDNSLKDLVYFAKNF 162
Query: 186 ------KQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
+ SA+ +P + GGSY G LA W+ + P +SS + D
Sbjct: 163 APPFDPSGSSSADKAPWIFAGGSYSGALAGWLAAREPGTFWAYYSSSGVVEAIGDF 218
>gi|405962352|gb|EKC28042.1| Dipeptidyl-peptidase 2 [Crassostrea gigas]
Length = 342
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 53/85 (62%), Gaps = 4/85 (4%)
Query: 64 QQYRYETRYFEQRLDHFSFADL--PTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWF 121
Q ++ Y +Q +DHF+F F +RYL+ D W P +GPIF Y GNEG IE F
Sbjct: 20 QTVPFKELYIDQYVDHFNFVSYGETIFKERYLLQ-DQWWKPG-VGPIFFYTGNEGSIEEF 77
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRY 146
N+GFV+DIAP F A++VF EH +
Sbjct: 78 WDNTGFVFDIAPEFNALVVFAEHMH 102
>gi|452836222|gb|EME38167.1| hypothetical protein DOTSEDRAFT_75982 [Dothistroma septosporum
NZE10]
Length = 554
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/174 (34%), Positives = 84/174 (48%), Gaps = 23/174 (13%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI----EWFAVN--SG 126
FEQ +DH S DL TF Q Y +T +W GP P+ L+ E + + + N +G
Sbjct: 69 FEQLIDH-SNPDLGTFEQFYFYDTRYWKGPGF--PVILFTPGEVNATRYYSYLSTNRTTG 125
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
+A GA + EHRY+G S P+ ++ +N + YLT + AL D F N+K
Sbjct: 126 L---LAEEIGAATIVLEHRYWGVSSPFA--DLTTEN---MRYLTLDNALKDMTYFANNVK 177
Query: 187 ------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+ SAE P V+ GGSY G L+AW+ ASSAP+ D
Sbjct: 178 LPFAEHASSSAEDVPWVVMGGSYSGALSAWLASVESGTFWAYHASSAPVEAVSD 231
>gi|403414842|emb|CCM01542.1| predicted protein [Fibroporia radiculosa]
Length = 512
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 93/193 (48%), Gaps = 21/193 (10%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD---IEWFAVNSGFV 128
YF+Q +DH + L TF QRY +++ GPI + E D E F N+
Sbjct: 52 YFDQLIDHNN-PSLGTFQQRYWQTWEYY---EPGGPIIITTPGEQDADGFEGFLTNATID 107
Query: 129 WDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-- 186
IA + G + EHRYYG S PY + VA +L Y T +QA+ DF F N++
Sbjct: 108 GLIAQQQGGATIVLEHRYYGLSNPYNNLSVA-----SLQYHTIQQAIDDFDYFAYNVELA 162
Query: 187 ----QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFY 242
+++ +P VL GGSY G L ++ ++ P + A ASS + E IV ++
Sbjct: 163 MPGGDHVTPNEAPWVLIGGSYAGALTSFTKVNKPDLFWAAWASSGVV---ESIVNYWGYF 219
Query: 243 NIVSSDFKRESAS 255
+I+ + +S
Sbjct: 220 DIIRKHMPQNCSS 232
>gi|402080438|gb|EJT75583.1| hypothetical protein GGTG_05516 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 545
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 67/227 (29%), Positives = 96/227 (42%), Gaps = 22/227 (9%)
Query: 67 RYETR--YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE---WF 121
RY+ R YF+Q +DH S L TF QRY +T + GP P+ ++ E E +
Sbjct: 52 RYDIRLGYFDQLIDH-SNPSLGTFKQRYWWDTTFYQGPGS--PVSMFSMGESTAEPYLAY 108
Query: 122 AVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
N V IA G + EHRY+GES P+ V L Y T A+AD F
Sbjct: 109 LTNRSMVGMIAQAVGGATIMLEHRYWGESSPFSDLTV-----HNLQYHTLNNAIADHTYF 163
Query: 182 ITNLKQNLSAEA------SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDI 235
N+K E +P V+ GGSY G L+AW+ P ASS + D
Sbjct: 164 ARNVKLPFDPEGKSAPSKAPWVMTGGSYAGALSAWIERLDPGTFWAYYASSPVVEAVGDF 223
Query: 236 VPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTK 282
+++ V R ++ + E +++ G + L TK
Sbjct: 224 W---RYFSPVQEGMPRNCSADVRKVIEHVDKVLLTGSPQEKLAIKTK 267
>gi|315052106|ref|XP_003175427.1| hypothetical protein MGYG_02952 [Arthroderma gypseum CBS 118893]
gi|311340742|gb|EFQ99944.1| hypothetical protein MGYG_02952 [Arthroderma gypseum CBS 118893]
Length = 545
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 64/226 (28%), Positives = 101/226 (44%), Gaps = 21/226 (9%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGFVWDIAPRFGAMLVFPEHR 145
F RY + H+ GP+ + G E D + G + +A + V EHR
Sbjct: 74 FDLRYWFDASHY---KEGGPVIILHGGETDGAGRIPFLQKGILAQLAQATNGIGVIMEHR 130
Query: 146 YYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK------QNLSAEASPVVLF 199
YYG S+P + N +L +LT EQALAD A F N+K NL+A + +L+
Sbjct: 131 YYGGSLPTRD----FSN-KSLRFLTTEQALADTAYFSKNIKFPGLEKYNLTAPGTAHILY 185
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNT 259
GGSY G A++R +YP I GA++SS D ++ + + ++
Sbjct: 186 GGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYW---QYFEPIRQEAPQDCVHVTQN 242
Query: 260 IKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ ++ G+ N + EL F L R ++ D A+ L S +
Sbjct: 243 FVDIVDNIIIHGKNANTIKELKNLFGLGRLRDA--DFANALSSGIT 286
>gi|303319727|ref|XP_003069863.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240109549|gb|EER27718.1| Serine carboxypeptidase S28 family protein [Coccidioides posadasii
C735 delta SOWgp]
gi|320034152|gb|EFW16097.1| conserved hypothetical protein [Coccidioides posadasii str.
Silveira]
Length = 554
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 78/155 (50%), Gaps = 22/155 (14%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV-----NSGFVWDIAPRFGAMLVFP 142
+ RY +N D + P GP+ ++ E + + FA + ++ + F + +
Sbjct: 85 YRNRYWVN-DQYYQPG--GPVVIFDTGETNGQAFADYYLVDPTSYIVQLLREFHGVGLVW 141
Query: 143 EHRYYGESMPY---GSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ------NLSAEA 193
EHRYYGES+PY G T A YLT EQAL D F ++ +L+ +
Sbjct: 142 EHRYYGESLPYPVNGQTSAA-----QFQYLTLEQALQDLPYFARTFRRPRLPNADLTPRS 196
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
+P ++ GGSY GM AA+ RLKYP A +SSAP
Sbjct: 197 TPWIMVGGSYPGMRAAFSRLKYPDTIFAAFSSSAP 231
>gi|406866881|gb|EKD19920.1| putative endoprotease [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 549
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 58/174 (33%), Positives = 83/174 (47%), Gaps = 31/174 (17%)
Query: 72 YFEQRLDHFSFADLP---TFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA------ 122
+F Q LDH D P TF Q+Y N++ W GP P+ + E +
Sbjct: 54 FFTQLLDH----DDPSKGTFQQKYWWNSEFWAGPGS--PVVFFTPGEAAAAPYGSYLTNV 107
Query: 123 -VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVF 181
V+ F ++ GA+++F EHRYYG+S PY + + A TL LT Q++ DF F
Sbjct: 108 TVSGLFAQEVQ---GAVILF-EHRYYGDSSPYDTLD-----AETLQLLTLHQSMQDFTYF 158
Query: 182 IT------NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
++ + +A +P V GGSY G LAAW +P ASSAP+
Sbjct: 159 ANTAALPFDINHSSNANKAPWVFTGGSYSGALAAWTEKLFPGTFWAYHASSAPV 212
>gi|390179599|ref|XP_003736935.1| GA11106, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859911|gb|EIM53008.1| GA11106, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 396
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 87/190 (45%), Gaps = 40/190 (21%)
Query: 131 IAPRFGAMLVFPEHRYYGESMPYG------STEVAYQNAT------------TLSYLTAE 172
+A ML + EHRYYG+S+P+G T+ + N L +L+
Sbjct: 1 MAVENSGMLFYTEHRYYGQSLPHGWVCLLILTDRSSFNGVFLWNRKESFRVDKLQHLSIY 60
Query: 173 QALADFAVFITNLK-QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQ 231
Q+LAD A FI K +N + S V+L GGSY G + AWM YP + + ASSAP+L
Sbjct: 61 QSLADLAHFIRFQKSENPRMKQSEVILVGGSYSGSMVAWMTQLYPDLIAASWASSAPLL- 119
Query: 232 FEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTF------H 285
+DF N+I+ S+G+ + + + G LTK F
Sbjct: 120 -------------AKADFHEYMEVASNSIRLSYGQNCTT-RIQKGFQHLTKLFEENQIPE 165
Query: 286 LCRELNSTED 295
L ++LN ED
Sbjct: 166 LLQKLNGCED 175
>gi|326482280|gb|EGE06290.1| extracelular serine carboxypeptidase [Trichophyton equinum CBS
127.97]
Length = 545
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 101/224 (45%), Gaps = 21/224 (9%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHR 145
F+ RY + H+ GP+ + G E G+ + G + +A + V EHR
Sbjct: 74 FNLRYWFDASHY---KEGGPVIILHGGETSGEGRIPFLQKGILAQLAQATNGIGVIMEHR 130
Query: 146 YYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK------QNLSAEASPVVLF 199
YYG S+P + N +L +LT EQALAD A F N+K NL+A + +L+
Sbjct: 131 YYGGSLPTRD----FSN-KSLRFLTTEQALADTAYFSKNIKFPGLEKYNLTAPGTAHILY 185
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNT 259
GGSY G A++R +YP I GA++SS D ++ + + ++
Sbjct: 186 GGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYW---QYFEPIRQEAPQDCVHVTQN 242
Query: 260 IKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESA 303
+ ++ G+ N EL F L R ++ D A+ L S
Sbjct: 243 FVDIVDNIIIHGKNANTTRELKNLFGLGRLRDA--DFANALSSG 284
>gi|326475086|gb|EGD99095.1| hypothetical protein TESG_06450 [Trichophyton tonsurans CBS 112818]
Length = 565
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/226 (28%), Positives = 102/226 (45%), Gaps = 21/226 (9%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHR 145
F+ RY + H+ GP+ + G E G+ + G + +A + V EHR
Sbjct: 74 FNLRYWFDASHY---KEGGPVIILHGGETSGEGRIPFLQKGILAQLAQATNGIGVIMEHR 130
Query: 146 YYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK------QNLSAEASPVVLF 199
YYG S+P + N +L +LT EQALAD A F N+K NL+A + +L+
Sbjct: 131 YYGGSLPTRD----FSN-KSLRFLTTEQALADTAYFSKNIKFPGLEKYNLTAPGTAHILY 185
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNT 259
GGSY G A++R +YP I GA++SS D ++ + + ++
Sbjct: 186 GGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYW---QYFEPIRQEAPQDCVHVTQN 242
Query: 260 IKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ ++ G+ N EL F L R ++ D A+ L S +
Sbjct: 243 FVDIVDNIIIHGKNANTTRELKNLFGLGRLRDA--DFANALSSGIT 286
>gi|71987309|ref|NP_001023156.1| Protein F28E10.5 [Caenorhabditis elegans]
gi|373219726|emb|CCD69731.1| Protein F28E10.5 [Caenorhabditis elegans]
Length = 229
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 80/170 (47%), Gaps = 17/170 (10%)
Query: 75 QRLDHFSFA-DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDI------EWFAVNSGF 127
Q++DHFS + + QRY N+ + G +FL G EG I +W
Sbjct: 58 QKVDHFSNGTNNGVWRQRYQYNSKFY--NKTTGYVFLMLGGEGSINVTNGDKWVRHEGET 115
Query: 128 VWDIAPRFGAMLVFPEHRYYG--ESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL 185
+ F A EHR+YG E P G Q ++ LT +QALAD FIT +
Sbjct: 116 MMKWVAEFQAAAFQVEHRFYGSKEYSPIGD-----QTTASMKLLTIDQALADIKEFITQM 170
Query: 186 KQNLSAEASPV-VLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+ P+ V FGGSY G L+A+ R YP + GA++SS+ + F D
Sbjct: 171 NALYFKDDKPIWVTFGGSYPGSLSAFFRETYPEMTAGAVSSSSAVHVFVD 220
>gi|302500786|ref|XP_003012386.1| extracelular serine carboxypeptidase, putative [Arthroderma
benhamiae CBS 112371]
gi|291175944|gb|EFE31746.1| extracelular serine carboxypeptidase, putative [Arthroderma
benhamiae CBS 112371]
Length = 716
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 69/260 (26%), Positives = 111/260 (42%), Gaps = 27/260 (10%)
Query: 60 RQQQQQYRYETRYFEQRLDHFSFAD------LPTFSQRYLINTDHWVGPNRLGPIFLYCG 113
R Q + + +DHF + F+ RY + H+ GP+ + G
Sbjct: 40 RSQDVNAAFPAHTIQIPIDHFPKSSRYEPHTTEKFNLRYWFDASHY---KEGGPVIILHG 96
Query: 114 NE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTA 171
E G+ + G + +A + V EHRYYG S+P + N +L +LT
Sbjct: 97 GETSGEGRIPFLQKGILAQLAQATNGIGVIMEHRYYGGSLPTPD----FSN-KSLRFLTT 151
Query: 172 EQALADFAVFITNLK------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALAS 225
EQALAD A F N+K NL+A + +L+GGSY G A++R +YP I GA++S
Sbjct: 152 EQALADTAYFSKNIKFPGLEKYNLTAPGTAHILYGGSYAGGQVAFLRTQYPDIFWGAISS 211
Query: 226 SAPILQFEDIVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFH 285
S D ++ + + ++ + ++ G+ N EL F
Sbjct: 212 SGVTKAIYDYW---QYFEPIRQEAPQDCVHVTQNFVDIVDNIIINGKNANTTRELKNLFG 268
Query: 286 LCRELNSTEDLADWLESAYS 305
L R ++ D A+ L S +
Sbjct: 269 LGRLRDA--DFANALSSGIT 286
>gi|451992925|gb|EMD85401.1| hypothetical protein COCHEDRAFT_1198878 [Cochliobolus
heterostrophus C5]
Length = 542
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 78/168 (46%), Gaps = 16/168 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV-----NSG 126
Y E LDHF + F RY +NT + GP+F+Y E + + +
Sbjct: 46 YIELPLDHFG-SSAGNFRNRYWVNTKSY---KLGGPVFIYDVGEANANTSSQFRLRDETS 101
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
F I F + + EHR+YG+S P T A +LT+EQALAD F
Sbjct: 102 FFKQIVDEFNGIGIVWEHRFYGDSSPV--TISIDTPAEAFRFLTSEQALADVDRFAKQFS 159
Query: 187 QN-----LSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+ L+ + +P V GGSY GM AA+MR YP + A+SAP+
Sbjct: 160 RKEINATLTPDRTPWVFIGGSYPGMRAAFMRNLYPDTIYASWAASAPV 207
>gi|443896181|dbj|GAC73525.1| hydrolytic enzymes of the alpha/beta hydrolase fold [Pseudozyma
antarctica T-34]
Length = 655
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 94/189 (49%), Gaps = 15/189 (7%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHW--VGPNRLG---PIFLYCGNEGDIEWFA- 122
E Y Q LDHF F QR+ +T H+ + G PI++ E D
Sbjct: 138 EPAYHTQPLDHFDNTTQAQFQQRFFYSTRHYKPASARKHGEAVPIYILDSGEADARARIP 197
Query: 123 -VNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA---TTLSYLTAEQALADF 178
+++G + ++ G + + EHRYYG S+P TE+ A L +LT +QAL D
Sbjct: 198 FLDTGILDILSEATGGIGIVLEHRYYGTSLP-NRTELGPGEAWGVDQLRWLTNKQALEDS 256
Query: 179 AVFIT--NLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIV 236
A FI N+ +AE V+ +GGSY G +A MRL YP + GA+ASSA + ++
Sbjct: 257 ADFIRHLNIPGTDNAEKRKVIYYGGSYPGARSAHMRLLYPELVHGAIASSAVVAAVDEF- 315
Query: 237 PPETFYNIV 245
PE FY I
Sbjct: 316 -PEYFYPIA 323
>gi|453089984|gb|EMF18024.1| peptidase S28 [Mycosphaerella populorum SO2202]
Length = 555
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 77/157 (49%), Gaps = 17/157 (10%)
Query: 87 TFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEH 144
TF+ RY + H+ GP+ + E G + G + +A G + V EH
Sbjct: 77 TFNLRYWFDASHYKAG---GPVIILQSGETSGKARLPFLQKGLLAQMAEATGGIAVVLEH 133
Query: 145 RYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-------QNLSAEASPVV 197
RYYG S P V + +LT EQA+AD A F N++ +L+A+ +P +
Sbjct: 134 RYYGTSYP-----VPDLSTENFRFLTTEQAMADEAYFAANIQFPGLEEHGDLTAKTTPYI 188
Query: 198 LFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+GGSY G A++R++YP I GA++SS +D
Sbjct: 189 GYGGSYAGAFNAFLRVQYPDIFWGAISSSGVTKAIDD 225
>gi|189209519|ref|XP_001941092.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187977185|gb|EDU43811.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 543
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 67/213 (31%), Positives = 99/213 (46%), Gaps = 30/213 (14%)
Query: 26 LSPLSLAAQPSKFRRAPRFVGKLPHLTEP----PQRQQRQQQQQYRYETRYFEQRLDHFS 81
LSP SLA + + +R R + K T+P P+R + ETRY + F+
Sbjct: 22 LSPRSLAERDEQVQR--RSLAKRAD-TDPTLLYPERNISVPVDFFHNETRYEPHSNESFN 78
Query: 82 FADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAML 139
RY + D + P GP+F+ G E G + G V + +
Sbjct: 79 L--------RYWFD-DTYYKPG--GPVFVLLGGETNGAGRLPFLQKGIVHQVIKATNGLG 127
Query: 140 VFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK-----QNLSAEAS 194
V EHRYYG+S P V + +LT EQ+LA+ F N+K +L+A +
Sbjct: 128 VILEHRYYGKSFP-----VPDLTTKNMRFLTTEQSLAEIDYFAKNVKFEGIDADLTAPNT 182
Query: 195 PVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA 227
P V++GGSY G AA++R+ YP GA++SS
Sbjct: 183 PWVVYGGSYAGAQAAFLRVVYPETFWGAISSSG 215
>gi|134081410|emb|CAK46451.1| unnamed protein product [Aspergillus niger]
Length = 542
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 83/173 (47%), Gaps = 19/173 (10%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD-----IEWFAV 123
ET Y +DH + A T+ R+ + +D + P PIF+Y E D +
Sbjct: 48 ETEYATIPIDHNN-ASAGTYQNRFWV-SDEFYQPGN--PIFVYDTGESDGGSIAQSYLTS 103
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA-TTLSYLTAEQALADFAVFI 182
F + F AM + EHRYYG S P V+Y+ YLT +QALAD F
Sbjct: 104 TLSFFREFLIEFNAMGIAWEHRYYGNSTP---APVSYETPPEAWQYLTTKQALADLPYFA 160
Query: 183 TNLKQ------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+N + +L+ + +P ++ GGSY G+ AA R +YP A +SS+P+
Sbjct: 161 SNFSREKYPDMDLTPQGTPWIMVGGSYAGIRAALTRKEYPETIFAAFSSSSPV 213
>gi|302664611|ref|XP_003023934.1| extracellular serine carboxypeptidase, putative [Trichophyton
verrucosum HKI 0517]
gi|291187955|gb|EFE43316.1| extracellular serine carboxypeptidase, putative [Trichophyton
verrucosum HKI 0517]
Length = 536
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 64/226 (28%), Positives = 102/226 (45%), Gaps = 21/226 (9%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIEWFAVNSGFVWDIAPRFGAMLVFPEHR 145
F+ RY + H+ GP+ + G E G+ + G + +A + V EHR
Sbjct: 74 FNLRYWFDASHY---KEGGPVIILHGGETSGEGRIPFLQKGILAQLAQATNGIGVIMEHR 130
Query: 146 YYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK------QNLSAEASPVVLF 199
YYG S+P + N +L +LT EQALAD A F N+K NL+A + +L+
Sbjct: 131 YYGGSLPTPD----FSN-KSLRFLTTEQALADTAYFSKNIKFPGLEKYNLTAPGTAHILY 185
Query: 200 GGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIVSSDFKRESASCFNT 259
GGSY G A++R +YP I GA++SS D ++ + + ++
Sbjct: 186 GGSYAGGQVAFLRTQYPDIFWGAISSSGVTKAIYDYW---QYFEPIRQEAPQDCVHVTQN 242
Query: 260 IKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDLADWLESAYS 305
+ ++ G+ N EL F L R ++ D A+ L S +
Sbjct: 243 FVDIVDNIIIHGKNANTTRELKNLFGLGRLRDA--DFANALSSGIT 286
>gi|317034919|ref|XP_001400740.2| serine peptidase, family S28 [Aspergillus niger CBS 513.88]
Length = 562
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 83/173 (47%), Gaps = 19/173 (10%)
Query: 69 ETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD-----IEWFAV 123
ET Y +DH + A T+ R+ + +D + P PIF+Y E D +
Sbjct: 68 ETEYATIPIDHNN-ASAGTYQNRFWV-SDEFYQPGN--PIFVYDTGESDGGSIAQSYLTS 123
Query: 124 NSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNA-TTLSYLTAEQALADFAVFI 182
F + F AM + EHRYYG S P V+Y+ YLT +QALAD F
Sbjct: 124 TLSFFREFLIEFNAMGIAWEHRYYGNSTP---APVSYETPPEAWQYLTTKQALADLPYFA 180
Query: 183 TNLKQ------NLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
+N + +L+ + +P ++ GGSY G+ AA R +YP A +SS+P+
Sbjct: 181 SNFSREKYPDMDLTPQGTPWIMVGGSYAGIRAALTRKEYPETIFAAFSSSSPV 233
>gi|170045815|ref|XP_001850489.1| prolylcarboxypeptidase [Culex quinquefasciatus]
gi|167868717|gb|EDS32100.1| prolylcarboxypeptidase [Culex quinquefasciatus]
Length = 490
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 59/232 (25%), Positives = 110/232 (47%), Gaps = 23/232 (9%)
Query: 72 YFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN-SGFVWD 130
+F R+++ + ++ RY T+ + GPI ++ G I+ + ++ S ++D
Sbjct: 34 FFTTRVNNLNPQRNERWTMRYFSVTEFYEAG---GPILIWLGGNAPIQEYMIDESSLLYD 90
Query: 131 IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNL- 189
+A + + E R+YG++ V +L L+ Q + D A F+T L++N+
Sbjct: 91 LARQMNGAIFAFESRFYGQNRATEDVTVE-----SLYLLSTYQIMGDLAEFVTYLRRNVV 145
Query: 190 SAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSA---PILQFEDIVPP--ETFYNI 244
E +PV++ G YGG LA W R+ YPH+A A +S ++ F + +T N
Sbjct: 146 HDEDAPVLVSGAGYGGALATWFRVHYPHLADAAWSSGGTHKAVMSFSEYAEAWGQTLINY 205
Query: 245 VSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTEDL 296
E F+ ++ L+ +G +E L + F+LC E+N ++L
Sbjct: 206 GGQACYNEIFVAFHVMQ----YLIDMGLEEI----LFEKFNLCTEINLQDNL 249
>gi|313212780|emb|CBY36704.1| unnamed protein product [Oikopleura dioica]
Length = 395
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/164 (28%), Positives = 78/164 (47%), Gaps = 13/164 (7%)
Query: 192 EASPVVLFGG------SYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPETFYNIV 245
+ +P+ + G S+ + L YP++ GALA+S+P+ + F+
Sbjct: 58 KTAPIFFYTGNEGPIDSFAANTGKFNELTYPNLVNGALAASSPVYWISAMGDSHGFWVKT 117
Query: 246 SSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTED----LADWLE 301
+ DF C +TI+ + L + + + +TKT C+ N TED + W
Sbjct: 118 TEDFSTALDKCEDTIRAGFAALDKM-KNDKDWAGITKTMRTCQ--NITEDNYMHMLGWAR 174
Query: 302 SAYSYLAMVDYPYPSDFMMPLPGYPIREVCKKIDNAPDATSILE 345
+A + +AM+DYPYP++F LPG P++ C + A SI E
Sbjct: 175 NAMATMAMMDYPYPTNFEAALPGNPVKASCVRAMAETGADSIRE 218
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 42/60 (70%), Gaps = 1/60 (1%)
Query: 67 RYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSG 126
+YET++ +QR+D+F++ T+ RYL+NTD +V + PIF Y GNEG I+ FA N+G
Sbjct: 22 KYETKWIDQRVDNFNYYLNKTYKMRYLVNTD-FVKDEKTAPIFFYTGNEGPIDSFAANTG 80
>gi|347840247|emb|CCD54819.1| similar to extracelular serine carboxypeptidase (secreted protein)
[Botryotinia fuckeliana]
Length = 530
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 65/243 (26%), Positives = 110/243 (45%), Gaps = 26/243 (10%)
Query: 68 YETRYFEQRLDHF----SFA--DLPTFSQRYLINTDHWVGPNRLGPIFLYCGNE--GDIE 119
Y +Q +DHF ++A TF QRY + ++ GP++LY G E G
Sbjct: 44 YTAHTIDQPIDHFPNDPAYAPHTNATFKQRYWYDAKYY---KPGGPVYLYIGGETNGQNR 100
Query: 120 WFAVNSGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFA 179
+ + +G + + + + E+RYYG+S P+ ++ L+YLT +Q +AD A
Sbjct: 101 FSNLQTGIIQILMEATNGLGIILENRYYGQSWPFNTSTT-----DNLAYLTNQQTVADNA 155
Query: 180 VF-----ITNLKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
F + L +++A + +L+GGS G A YP + G +A+SAP+ +
Sbjct: 156 YFAQHVSLPGLNASITAPDTKWILYGGSLAGGQTALSVKIYPDVLFGGIAASAPV---KT 212
Query: 235 IVPPETFYNIVSSDFKRESASCFNTIKESWGELVSVGQKENGLLELTKTFHLCRELNSTE 294
+V +YN + ++ S N I + + LV+V + E F L L
Sbjct: 213 VVGYPEWYNPIQRLAPQDCISSINGIIDKFDALVAVNNTR-AIREFKSLFGL-EALTDNR 270
Query: 295 DLA 297
D A
Sbjct: 271 DFA 273
>gi|345560973|gb|EGX44090.1| hypothetical protein AOL_s00210g251 [Arthrobotrys oligospora ATCC
24927]
Length = 635
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 55/206 (26%), Positives = 92/206 (44%), Gaps = 25/206 (12%)
Query: 77 LDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFA----VNSGFVWDIA 132
+DH F+ Y + H+ + GPIFLY E + + +N V+D+
Sbjct: 51 VDHQGPRTGEQFNLTYWVYDKHY---KKGGPIFLYLSGETTLSDYVAGTFLNGSRVYDLQ 107
Query: 133 PRFGAMLVFPEHRYYGESMP-----YGSTEVAYQN-ATTLSYLTAEQALADFAVFITNLK 186
+FG + + +HRYYG+S P G++ + A L YL + AL D N
Sbjct: 108 EKFGGLGIALQHRYYGDSTPQSAWGTGASGITIDTPAEKLRYLRTDLALQDVKFLADNFN 167
Query: 187 ---------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVP 237
+L + SP V+ GGSY G +A+++R YP A AS AP+ + ++P
Sbjct: 168 YTSERVPAGTDLRGKGSPWVVLGGSYAGNMASYLRKLYPDTFFAAYASGAPV-EARTMMP 226
Query: 238 PETFYNIVSSDFKRESASCFNTIKES 263
+++IV+ +C + +
Sbjct: 227 --MYWDIVAKSIGSTEPACVKNMNSA 250
>gi|440636097|gb|ELR06016.1| hypothetical protein GMDG_07727 [Geomyces destructans 20631-21]
Length = 546
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 74/174 (42%), Gaps = 23/174 (13%)
Query: 73 FEQRLDHFSFADLP---TFSQRYLINTDHWVGPNRLGPIFLYCGNE---GDIEWFAVNSG 126
F Q LDH DLP TF Q N++HW GP PI L+ E + E + N+
Sbjct: 49 FTQLLDH----DLPHGDTFGQHVWWNSEHWGGPGS--PIILFTPGETAADEYEGYLTNAT 102
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
A +V EHRY+GES PY A L LT ++ADF +
Sbjct: 103 LTGKFAQEVNGAVVMVEHRYWGESSPY-----ADLTGHNLKQLTLRNSIADFVRIAATAQ 157
Query: 187 ------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
A +P ++ GGSY G L+AW P +SSAP+ +D
Sbjct: 158 LPFDPSHKSDAAHAPWIMMGGSYAGSLSAWTESVSPGTFWAYHSSSAPVEAIDD 211
>gi|119183270|ref|XP_001242695.1| hypothetical protein CIMG_06591 [Coccidioides immitis RS]
gi|392865604|gb|EAS31403.2| serine peptidase, family S28 [Coccidioides immitis RS]
Length = 554
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 78/155 (50%), Gaps = 22/155 (14%)
Query: 88 FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAV-----NSGFVWDIAPRFGAMLVFP 142
+ RY +N D + P GP+ ++ E + + FA + ++ + F + +
Sbjct: 85 YRNRYWVN-DEYYQPG--GPVVIFDTGETNGQAFADYYLVDPTSYIVQLLREFHGVGLVW 141
Query: 143 EHRYYGESMPY---GSTEVAYQNATTLSYLTAEQALADFAVFITNLKQ------NLSAEA 193
EHRYYGES+PY G T A YLT EQAL D F ++ +L+ +
Sbjct: 142 EHRYYGESLPYPVNGQTSAA-----QFQYLTLEQALQDLPYFARTFRRPRLPNADLTPRS 196
Query: 194 SPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAP 228
+P ++ GGSY GM AA+ RLKYP A +SSAP
Sbjct: 197 TPWIMVGGSYPGMRAAFSRLKYPDTIFAAFSSSAP 231
>gi|380489886|emb|CCF36399.1| serine carboxypeptidase S28, partial [Colletotrichum higginsianum]
Length = 553
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 55/200 (27%), Positives = 89/200 (44%), Gaps = 24/200 (12%)
Query: 36 SKFRRAPRFVGKLPHLTEPPQRQQRQQQQQYRYETRYFEQRLDHFSFADLPTFSQRYLIN 95
S FR P V E P+ + ++RY +HF+ R+ +
Sbjct: 38 STFRSQPESVSAYSEYPEYPEYNISVPVDHFHNDSRYEPHSDEHFNL--------RFWFD 89
Query: 96 TDHWVGPNRLGPIFLYCGNEGDIEWFA--VNSGFVWDIAPRFGAMLVFPEHRYYGESMPY 153
H+ + GP+ + E D + ++ G + + G + V EHRYYG+S P
Sbjct: 90 AKHY---RKGGPVIILAAGETDAKERLPFLDHGILSILTEATGGVGVVLEHRYYGKSFP- 145
Query: 154 GSTEVAYQNATTLSYLTAEQALADFAVFITNLK------QNLSAEASPVVLFGGSYGGML 207
V + L +L+ +QALAD A F ++ NL+A +P + +GGSY G
Sbjct: 146 ----VPDLSTENLRFLSTDQALADTAYFAKHISFPGHEDLNLTAPGTPYLAYGGSYAGAF 201
Query: 208 AAWMRLKYPHIAIGALASSA 227
AA++R YP + G ++SS
Sbjct: 202 AAFLRKLYPEVFWGGISSSG 221
>gi|206598107|gb|ACI15917.1| serine carboxylase [Bodo saltans]
Length = 461
Score = 74.3 bits (181), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 76/165 (46%), Gaps = 10/165 (6%)
Query: 65 QYRYETRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVN 124
Q YFEQ +DH S A L TF QR+ + + N+ LY EG+
Sbjct: 26 QRNMTVNYFEQLIDH-SNAALGTFQQRWWGDLSAFT--NQSEYAMLYINGEGEAH--GSP 80
Query: 125 SGFVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITN 184
G+ A + EHRYYGESMP T N + L+YLT E ALAD F
Sbjct: 81 DGYPAVYGRNISAAMFGLEHRYYGESMPAPLT-----NRSMLNYLTVENALADLEAFRLY 135
Query: 185 LKQNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPI 229
L+ + + + GGSY G L+AW + YP + A +SS +
Sbjct: 136 LQATVLKKEVKWFICGGSYSGALSAWSKATYPASYLAAWSSSGVV 180
>gi|242213798|ref|XP_002472725.1| predicted protein [Postia placenta Mad-698-R]
gi|220728128|gb|EED82028.1| predicted protein [Postia placenta Mad-698-R]
Length = 528
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 91/185 (49%), Gaps = 21/185 (11%)
Query: 70 TRYFEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD---IEWFAVNSG 126
T YFEQ +DH + L TFSQRY +T + P GPI + E D E + N
Sbjct: 63 TYYFEQLIDHNN-PSLGTFSQRYW-HTWEFYEPG--GPIIITTPGEQDADGFEGYLTNLT 118
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
+ IA + EHRYYG S PY + VA +L Y T +QA+ DF F N+K
Sbjct: 119 IMGQIAQEQNGATIVLEHRYYGYSNPYNNLSVA-----SLKYHTIQQAIDDFDYFAYNVK 173
Query: 187 ------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET 240
+++ +P +L GGSY G L ++ ++ P + A +SS + E I+
Sbjct: 174 LAMPRGDHVTPAKAPWILVGGSYAGALTSFTKVNKPDLFWAAWSSSGVV---ESIINYWG 230
Query: 241 FYNIV 245
+++I+
Sbjct: 231 YFDII 235
>gi|429862127|gb|ELA36786.1| serine peptidase, putative [Colletotrichum gloeosporioides Nara
gc5]
Length = 335
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 59/174 (33%), Positives = 74/174 (42%), Gaps = 23/174 (13%)
Query: 73 FEQRLDHFSFADLPT---FSQRYLINTDHWVGPNRLGPIFLYCGNEGDIE---WFAVNSG 126
F Q +DH PT F QRY N W GP PIF+ E D E W +N
Sbjct: 49 FTQLIDHTD----PTKGCFEQRYWHNAKIWGGPGY--PIFMVNVGEADAEEYAWHLINQR 102
Query: 127 FVWDIAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLK 186
+ + +V EHRYYG+ P + A TL Y T Q+L D F +K
Sbjct: 103 LAYLYGEKLQGAVVLFEHRYYGKPQPLKT-----MTAETLQYHTVPQSLQDNKRFAQTVK 157
Query: 187 ------QNLSAEASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFED 234
+ + + SP VL GGSY G L AW + P + ASSA I D
Sbjct: 158 FGFDNCGSANVDKSPWVLIGGSYAGALPAWQSVITPGVFAAHHASSAVIHAIGD 211
>gi|339232872|ref|XP_003381553.1| serine carboxypeptidase S28 family protein [Trichinella spiralis]
gi|316979630|gb|EFV62393.1| serine carboxypeptidase S28 family protein [Trichinella spiralis]
Length = 484
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 100/209 (47%), Gaps = 26/209 (12%)
Query: 75 QRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGD--IEWFAVNSGFVWDIA 132
QRL HF T++Q + H + G + +Y + + ++G + +I+
Sbjct: 60 QRLSHFDSTINQTWNQSSTVCDLH---HQKGGAVVVYIQSRDSPSVPSCTYSAGLLSEIS 116
Query: 133 PRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNLKQNLSAE 192
+ A++V R++G + P GS V L YL+ E+ LAD A + +L+
Sbjct: 117 KQLNAVVVTFVPRFFGINKPTGSASV-----DNLKYLSVEEVLADLAHLVHSLRSKYPDS 171
Query: 193 ASPVVLFGGSYGGMLAAWMRLKYPHIAIGALASSAPILQFEDIVPPET---FYNIVSSDF 249
VV+ G ++GG LA W RLKYPH+ GA+AS AP+ ET F +V F
Sbjct: 172 GKTVVV-GTAHGGNLAIWFRLKYPHLCDGAIASGAPL---------ETTLGFGRLVDGIF 221
Query: 250 KRESA---SCFNTIKESWGELVSVGQKEN 275
+R +C +++S+ +L + ++ N
Sbjct: 222 ERLDNIRPNCARALRDSFAQLSLLFEERN 250
>gi|336258109|ref|XP_003343875.1| hypothetical protein SMAC_09286 [Sordaria macrospora k-hell]
gi|380087075|emb|CCC05489.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 547
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 56/156 (35%), Positives = 74/156 (47%), Gaps = 22/156 (14%)
Query: 73 FEQRLDHFSFADLPTFSQRYLINTDHWVGPNRLGPIFLYCGNEGDIEWFAVNSGFVWD-- 130
F+Q +DH + +L TF QR+ HW GP PI L N G+ N ++ D
Sbjct: 54 FDQLIDHDT-PELGTFKQRFWYGFQHWKGPGS--PIILV--NPGEQAADGFNKSYLTDQR 108
Query: 131 ----IAPRFGAMLVFPEHRYYGESMPYGSTEVAYQNATTLSYLTAEQALADFAVFITNL- 185
+A GA +V EHRY+GES PY V L YLT E +L D F ++
Sbjct: 109 LAGWMAKDIGAAVVIMEHRYWGESSPYDQLTV-----NNLQYLTLENSLKDINYFAEHIE 163
Query: 186 ----KQNLSAEA-SPVVLFGGSYGGMLAAWMRLKYP 216
+ N S A +P + GGSY G LA W+ YP
Sbjct: 164 LPFDETNGSKPANAPWIFTGGSYSGALAGWLEALYP 199
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.137 0.427
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,511,062,621
Number of Sequences: 23463169
Number of extensions: 281705016
Number of successful extensions: 1075456
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1055
Number of HSP's successfully gapped in prelim test: 256
Number of HSP's that attempted gapping in prelim test: 1070967
Number of HSP's gapped (non-prelim): 1435
length of query: 380
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 236
effective length of database: 8,980,499,031
effective search space: 2119397771316
effective search space used: 2119397771316
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 78 (34.7 bits)