BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy3964
(65 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 72.4 bits (176), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 34/60 (56%), Positives = 42/60 (70%), Gaps = 1/60 (1%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 65
SP LDH VLAVGYG K YW VKNSW + WG+ GY+ MS +DNNCG+ + P+Y T+
Sbjct: 262 SPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNNCGIASEPSYPTV 321
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 71.6 bits (174), Expect = 1e-12, Method: Composition-based stats.
Identities = 30/56 (53%), Positives = 38/56 (67%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
+P ++HAVLAVGYG DG PYW +KNSW WG++GY M + N CGV T +Y
Sbjct: 301 TPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCASY 356
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 70.9 bits (172), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 33/57 (57%), Positives = 42/57 (73%), Gaps = 1/57 (1%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 62
SP LDHAVLAVGYG G+ +W VKNSW+T WG+ GY+ MS ++NNCG+ T +Y
Sbjct: 264 SPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIATVASY 320
>sp|Q01958|CPP2_ENTHI Histolysain OS=Entamoeba histolytica GN=CPP2 PE=1 SV=1
Length = 315
Score = 70.5 bits (171), Expect = 3e-12, Method: Composition-based stats.
Identities = 33/62 (53%), Positives = 41/62 (66%)
Query: 3 KGHNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
K N+ L+H V AVGYG +DGK W V+NSW T WG++GY+ M I+ N CGV T P Y
Sbjct: 249 KCKNNYFALNHEVCAVGYGVVDGKECWIVRNSWGTGWGDKGYINMVIEGNTCGVATDPLY 308
Query: 63 VT 64
T
Sbjct: 309 PT 310
>sp|Q06964|CPP3_ENTHI Cysteine proteinase 3 (Fragment) OS=Entamoeba histolytica GN=CPNP
PE=1 SV=1
Length = 308
Score = 69.7 bits (169), Expect = 4e-12, Method: Composition-based stats.
Identities = 33/62 (53%), Positives = 41/62 (66%)
Query: 3 KGHNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
K N+ L+H V AVGYG +DGK W V+NSW T WG++GY+ M I+ N CGV T P Y
Sbjct: 242 KCKNNFFALNHEVCAVGYGVVDGKECWIVRNSWGTGWGDKGYINMVIEGNTCGVATDPLY 301
Query: 63 VT 64
T
Sbjct: 302 PT 303
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 69.7 bits (169), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 31/58 (53%), Positives = 39/58 (67%)
Query: 5 HNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
H +PD ++HAVLAVGYGE +G YW VKNSW + WGN GY L+ N CG+ +Y
Sbjct: 271 HKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASY 328
>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
Length = 335
Score = 69.7 bits (169), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 31/58 (53%), Positives = 38/58 (65%)
Query: 5 HNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
H +PD ++HAVLAVGYGE +G PYW VKNSW WG GY L+ N CG+ +Y
Sbjct: 273 HKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330
>sp|O46427|CATH_PIG Pro-cathepsin H OS=Sus scrofa GN=CTSH PE=1 SV=1
Length = 335
Score = 69.3 bits (168), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 31/58 (53%), Positives = 38/58 (65%)
Query: 5 HNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
H +PD ++HAVLAVGYGE +G PYW VKNSW WG GY L+ N CG+ +Y
Sbjct: 273 HKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWGMNGYFLIERGKNMCGLAACASY 330
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 69.3 bits (168), Expect = 7e-12, Method: Composition-based stats.
Identities = 29/57 (50%), Positives = 37/57 (64%)
Query: 6 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
++P ++HAVLAVGYG +G PYW +KNSW WG GY M + N CGV T +Y
Sbjct: 296 DTPMDVNHAVLAVGYGVENGTPYWLIKNSWGADWGEDGYFKMEMGKNMCGVATCASY 352
>sp|Q01957|CPP1_ENTHI Cysteine proteinase 1 OS=Entamoeba histolytica GN=CPP1 PE=1 SV=1
Length = 315
Score = 68.9 bits (167), Expect = 8e-12, Method: Composition-based stats.
Identities = 31/55 (56%), Positives = 37/55 (67%)
Query: 10 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTYVT 64
L+H V AVGYG +DGK W V+NSW T WG +GY+ M I+ N CGV T P Y T
Sbjct: 256 ALNHEVCAVGYGVVDGKECWIVRNSWGTGWGEKGYINMVIEGNTCGVATDPLYPT 310
>sp|P36185|ACP2_ENTHI Cysteine proteinase ACP2 (Fragment) OS=Entamoeba histolytica
GN=ACP2 PE=1 SV=1
Length = 310
Score = 68.9 bits (167), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 33/62 (53%), Positives = 41/62 (66%)
Query: 3 KGHNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
K N+ L+H V AVGYG +DGK W V+NSW T WG++GY+ M I+ N CGV T P Y
Sbjct: 244 KCKNNYFALNHEVCAVGYGVVDGKECWIVRNSWGTSWGDKGYINMVIEGNTCGVATDPLY 303
Query: 63 VT 64
T
Sbjct: 304 PT 305
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 68.6 bits (166), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/61 (49%), Positives = 39/61 (63%)
Query: 2 RKGHNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPT 61
+ H +PD ++HAVLAVGYGE +G YW VKNSW + WG GY L+ N CG+ +
Sbjct: 268 KSCHKTPDKVNHAVLAVGYGEQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGLAACAS 327
Query: 62 Y 62
Y
Sbjct: 328 Y 328
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 68.6 bits (166), Expect = 1e-11, Method: Composition-based stats.
Identities = 28/56 (50%), Positives = 37/56 (66%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
+PD ++HAVLAVGYG +G PYW +KNSW WG+ GY M + N C + T +Y
Sbjct: 302 TPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAIATCASY 357
>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
Length = 335
Score = 68.6 bits (166), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 30/58 (51%), Positives = 38/58 (65%)
Query: 5 HNSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
H +PD ++HAVLAVGYGE G PYW VKNSW WG +GY L+ N CG+ ++
Sbjct: 273 HKTPDKVNHAVLAVGYGEEKGIPYWIVKNSWGPNWGMKGYFLIERGKNMCGLAACASF 330
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 67.8 bits (164), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/60 (53%), Positives = 41/60 (68%), Gaps = 1/60 (1%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTYVTM 65
S LDHAVLAVGYG G+ +W VKNSW+T WG GY+ M+ ++NNCG+ T Y T+
Sbjct: 263 SSSQLDHAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATDACYPTV 322
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 67.4 bits (163), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 29/57 (50%), Positives = 39/57 (68%)
Query: 6 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
++P ++HAVLAVGYG DG PYW +KNSW WG++GY M + N CG+ T +Y
Sbjct: 298 STPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNMCGIATCASY 354
>sp|Q54TR1|CFAD_DICDI Counting factor associated protein D OS=Dictyostelium discoideum
GN=cfaD PE=1 SV=1
Length = 531
Score = 66.6 bits (161), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 34/58 (58%), Positives = 39/58 (67%), Gaps = 1/58 (1%)
Query: 6 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTY 62
N D LDH VLA+GYG G+ Y+ VKNSWST WG GYV M+ DNN CGV + TY
Sbjct: 468 NGLDDLDHEVLAIGYGTYQGQDYFLVKNSWSTNWGMDGYVYMARNDNNLCGVSSQATY 525
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 66.2 bits (160), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 37/57 (64%)
Query: 6 NSPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
N+P ++HAVLAVGYG D PYW +KNSW WG+ GY M + N CGV T +Y
Sbjct: 298 NTPMDVNHAVLAVGYGVEDDVPYWLIKNSWGGEWGDNGYFKMEMGKNMCGVATCSSY 354
>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
PE=2 SV=2
Length = 362
Score = 65.5 bits (158), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 37/56 (66%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAPTY 62
SP ++HAVLAVGYG +G PYW +KNSW WG+ GY M + N CG+ T +Y
Sbjct: 303 SPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCASY 358
>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330
Score = 64.3 bits (155), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 30/58 (51%), Positives = 39/58 (67%), Gaps = 1/58 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
D L+HAVLAVGYG GK +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 273 DNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330
>sp|P09648|CATL1_CHICK Cathepsin L1 (Fragments) OS=Gallus gallus GN=CTSL1 PE=1 SV=1
Length = 218
Score = 63.5 bits (153), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 39/57 (68%), Gaps = 1/57 (1%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 62
S + LDH VL VGYG GK YW VKNSW WG++GY+ M+ + N+CG+ TA +Y
Sbjct: 159 SSEDLDHGVLVVGYGFEGGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASY 215
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 62.8 bits (151), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 34/67 (50%), Positives = 42/67 (62%), Gaps = 6/67 (8%)
Query: 5 HNSPD----GLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMT 58
+N P+ LDH VL VGYG + G YW VKNSW T WG QGY+ M+ NN CG+ T
Sbjct: 273 YNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARNQNNQCGIAT 332
Query: 59 APTYVTM 65
A +Y T+
Sbjct: 333 ASSYPTV 339
>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329
Score = 62.4 bits (150), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
D L+HAVLAVGYG G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 272 DNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2
Length = 329
Score = 62.4 bits (150), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
D L+HAVLAVGYG G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 272 DNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK PE=1 SV=1
Length = 329
Score = 62.4 bits (150), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/60 (48%), Positives = 39/60 (65%), Gaps = 1/60 (1%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
S D ++HAVLAVGYG G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 270 SSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILMARNKNNACGIANLASFPKM 329
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 62.4 bits (150), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
D L+HAVLAVGYG G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 272 DNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 62.4 bits (150), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
D L+HAVLAVGYG G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 272 DNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 329
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 62.4 bits (150), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 29/58 (50%), Positives = 38/58 (65%), Gaps = 1/58 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
D L+HAVLAVGYG G +W +KNSW WGN+GY+LM+ NN CG+ ++ M
Sbjct: 273 DNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNACGIANLASFPKM 330
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 62.4 bits (150), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 40/53 (75%), Gaps = 1/53 (1%)
Query: 11 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 62
++H VL +GYG+L+GK YW VKNSW + +G QGY+ M+ K N+CG+ + P+Y
Sbjct: 275 VNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSY 327
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 62.4 bits (150), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 28/53 (52%), Positives = 38/53 (71%), Gaps = 1/53 (1%)
Query: 11 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 62
++H VL VGYG LDGK YW VKNSW ++G+QGY+ M+ N+CG+ P+Y
Sbjct: 276 VNHGVLVVGYGNLDGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSY 328
>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
Length = 329
Score = 61.6 bits (148), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 28/58 (48%), Positives = 37/58 (63%), Gaps = 1/58 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
D ++HAVL VGYG G YW +KNSW WGN+GYVL++ NN CG+ ++ M
Sbjct: 272 DNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLARNKNNACGITNLASFPKM 329
>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
Length = 326
Score = 61.2 bits (147), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 29/56 (51%), Positives = 38/56 (67%), Gaps = 1/56 (1%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPT 61
SP ++HAVLAVGYG G YW VKNSW TYWG +GY+ M+ + N CG+ + +
Sbjct: 263 SPLRVNHAVLAVGYGTQGGTDYWIVKNSWGTYWGERGYIRMARNRGNMCGIASLAS 318
>sp|Q90686|CATK_CHICK Cathepsin K OS=Gallus gallus GN=CTSK PE=2 SV=1
Length = 334
Score = 61.2 bits (147), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 39/60 (65%), Gaps = 1/60 (1%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIK-DNNCGVMTAPTYVTM 65
+P+ ++HAVLAVGYG G +W +KNSW T WGN+GYVL++ CG+ ++ M
Sbjct: 275 NPENINHAVLAVGYGAQKGTKHWIIKNSWGTEWGNKGYVLLARNMKQTCGIANLASFPKM 334
>sp|Q02765|CATS_RAT Cathepsin S OS=Rattus norvegicus GN=Ctss PE=2 SV=1
Length = 330
Score = 60.8 bits (146), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 27/55 (49%), Positives = 40/55 (72%), Gaps = 1/55 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 62
+ ++H VL VGYG LDGK YW VKNSW ++G+QGY+ M+ + N+CG+ + +Y
Sbjct: 273 ENMNHGVLVVGYGTLDGKDYWLVKNSWGLHFGDQGYIRMARNNKNHCGIASYCSY 327
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 60.8 bits (146), Expect = 3e-09, Method: Composition-based stats.
Identities = 26/54 (48%), Positives = 39/54 (72%), Gaps = 2/54 (3%)
Query: 11 LDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYV-LMSIKDNNCGVMTAPTY 62
LDH VL VG+G + G+ YW VKNSW T WG++G++ ++ K+N CG+ +A +Y
Sbjct: 315 LDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSY 368
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 60.5 bits (145), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 39/53 (73%), Gaps = 1/53 (1%)
Query: 11 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 62
++H VL VGYG+L+GK YW VKNSW +G +GY+ M+ K N+CG+ + P+Y
Sbjct: 276 VNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSY 328
>sp|O91466|CATV_GVCPM Viral cathepsin OS=Cydia pomonella granulosis virus (isolate
Mexico/1963) GN=VCATH PE=3 SV=1
Length = 333
Score = 60.1 bits (144), Expect = 4e-09, Method: Composition-based stats.
Identities = 25/49 (51%), Positives = 33/49 (67%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVM 57
+GL+HAVL VGYG + PYW +KNSW WG +GY + N+CG+M
Sbjct: 276 EGLNHAVLLVGYGVKNDVPYWILKNSWGAEWGEEGYFRVQRDKNSCGMM 324
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 59.7 bits (143), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 38/53 (71%), Gaps = 1/53 (1%)
Query: 11 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 62
++H VL VGYG L+GK YW VKNSW +G+QGY+ M+ N+CG+ + P+Y
Sbjct: 276 VNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSY 328
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 59.7 bits (143), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 30/57 (52%), Positives = 38/57 (66%), Gaps = 1/57 (1%)
Query: 7 SPDGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSI-KDNNCGVMTAPTY 62
S + LDH VLAVG G +G+ Y+ VKNSW WG GY+ M+ KDNNCG+ T +Y
Sbjct: 278 SSEDLDHGVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNKDNNCGISTMASY 334
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 59.3 bits (142), Expect = 6e-09, Method: Composition-based stats.
Identities = 29/57 (50%), Positives = 36/57 (63%), Gaps = 4/57 (7%)
Query: 10 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM--SIKDNN--CGVMTAPTY 62
LDH V AVGYG +GK YW V+NSW WG GYV M +IK ++ CG+ P+Y
Sbjct: 286 ALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSY 342
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 59.3 bits (142), Expect = 7e-09, Method: Composition-based stats.
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 4/56 (7%)
Query: 11 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTY 62
LDH V+AVGYG +GK YW V+NSW WG GY+ M + CG+ P+Y
Sbjct: 295 LDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSY 350
>sp|Q91GE3|CATV_NPVEP Viral cathepsin OS=Epiphyas postvittana nucleopolyhedrovirus
GN=VCATH PE=3 SV=1
Length = 323
Score = 58.9 bits (141), Expect = 8e-09, Method: Composition-based stats.
Identities = 25/48 (52%), Positives = 32/48 (66%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGV 56
+GL+HAVL VGYG + PYW +KNSW T WG QG+ + N CG+
Sbjct: 265 NGLNHAVLLVGYGVENNVPYWILKNSWGTDWGEQGFFKIQQNVNACGI 312
>sp|Q10991|CATL1_SHEEP Cathepsin L1 OS=Ovis aries GN=CTSL PE=1 SV=1
Length = 217
Score = 58.5 bits (140), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/61 (52%), Positives = 38/61 (62%), Gaps = 2/61 (3%)
Query: 7 SPDGLDHAVLAVGYG-ELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVT 64
S LDH VL VGYG E +W VKNSW WGN+GYV M+ NN CG+ TA +Y T
Sbjct: 157 SSKDLDHGVLVVGYGFEGTNNKFWIVKNSWGPEWGNKGYVKMAKDQNNHCGIATAASYPT 216
Query: 65 M 65
+
Sbjct: 217 V 217
>sp|Q91CL9|CATV_NPVAP Viral cathepsin OS=Antheraea pernyi nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 324
Score = 58.5 bits (140), Expect = 1e-08, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 31/47 (65%)
Query: 10 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGV 56
GL+HAVL VGYG +G P+W +KN+W WG QGY + N CG+
Sbjct: 267 GLNHAVLLVGYGVENGIPFWILKNTWGADWGEQGYFRVQQNINACGI 313
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 58.5 bits (140), Expect = 1e-08, Method: Composition-based stats.
Identities = 27/58 (46%), Positives = 34/58 (58%), Gaps = 4/58 (6%)
Query: 11 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLM----SIKDNNCGVMTAPTYVT 64
LDH V+AVGYG +GK YW V+NSW WG GYV M ++ CG+ +Y T
Sbjct: 300 LDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPT 357
>sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2
Length = 329
Score = 58.5 bits (140), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 26/58 (44%), Positives = 36/58 (62%), Gaps = 1/58 (1%)
Query: 9 DGLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNN-CGVMTAPTYVTM 65
D ++HAVL VGYG G +W +KNSW WGN+GY L++ NN CG+ ++ M
Sbjct: 272 DNVNHAVLVVGYGTQKGSKHWIIKNSWGESWGNKGYALLARNKNNACGITNMASFPKM 329
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 58.2 bits (139), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 38/53 (71%), Gaps = 1/53 (1%)
Query: 11 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKD-NNCGVMTAPTY 62
++H VL VGYG LDGK YW VKNSW +G+QGY+ M+ + N+CG+ + +Y
Sbjct: 285 VNHGVLVVGYGTLDGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASYCSY 337
>sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana defective polyhedrosis
virus GN=Vcath PE=3 SV=1
Length = 324
Score = 58.2 bits (139), Expect = 2e-08, Method: Composition-based stats.
Identities = 24/47 (51%), Positives = 31/47 (65%)
Query: 10 GLDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGV 56
GL+HAVL VGY +G P+W +KN+W T WG QGY + N CG+
Sbjct: 267 GLNHAVLLVGYAVENGVPFWILKNTWGTDWGEQGYFRVQQNINACGI 313
>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
GN=CG12163 PE=2 SV=2
Length = 614
Score = 57.4 bits (137), Expect = 3e-08, Method: Composition-based stats.
Identities = 30/65 (46%), Positives = 32/65 (49%), Gaps = 6/65 (9%)
Query: 7 SPDGLDHAVLAVGYGELD------GKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGVMTAP 60
S LDH VL VGYG D PYW VKNSW WG QGY + DN CGV
Sbjct: 549 SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTCGVSEMA 608
Query: 61 TYVTM 65
T +
Sbjct: 609 TSAVL 613
>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
Length = 467
Score = 57.4 bits (137), Expect = 3e-08, Method: Composition-based stats.
Identities = 23/46 (50%), Positives = 30/46 (65%)
Query: 11 LDHAVLAVGYGELDGKPYWQVKNSWSTYWGNQGYVLMSIKDNNCGV 56
LDH VL VGY + PYW +KNSW+T WG +GY+ ++ N C V
Sbjct: 282 LDHGVLLVGYNDSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQCLV 327
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.134 0.449
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 28,911,822
Number of Sequences: 539616
Number of extensions: 974104
Number of successful extensions: 2115
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 205
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 1827
Number of HSP's gapped (non-prelim): 228
length of query: 65
length of database: 191,569,459
effective HSP length: 37
effective length of query: 28
effective length of database: 171,603,667
effective search space: 4804902676
effective search space used: 4804902676
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 55 (25.8 bits)