RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy8713
(309 letters)
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 246 bits (630), Expect = 1e-80
Identities = 94/330 (28%), Positives = 121/330 (36%), Gaps = 105/330 (31%)
Query: 40 QAEKNS-LSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCP 98
+A+ + + NI K GV N + E LP++FDS WPNCP
Sbjct: 27 KAKYDGVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEEARAPLPSSFDSAEAWPNCP 86
Query: 99 TIREIRDQGSCGSCW-------------------------------------GC------ 115
TI +I DQ +CGSCW GC
Sbjct: 87 TIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPD 146
Query: 116 ----------------RPYEIAPCEHHVNGTR--PSCDASKGHTPKCVRECQENYDVPYK 157
+PY C HH P C TPKC C +
Sbjct: 147 RAWAYFSSTGLVSDYCQPYPFPHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPT---IP 203
Query: 158 KDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLI 217
SY++ E M+E++ GP E AF V++D I Y SG
Sbjct: 204 VVNYRSWTSYAL-QGEDDYMRELFFRGPFEVAFDVYEDFIAYNSG--------------- 247
Query: 218 KWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWN 277
V+ SG+ LGGHA+R++GWG YW IANSWN
Sbjct: 248 ------------------VYHH----VSGQYLGGHAVRLVGWGTSN--GVPYWKIANSWN 283
Query: 278 TDWGDNGLFKILRGKDECGIESSITAGVPK 307
T+WG +G F I RG ECGIE +AG+P
Sbjct: 284 TEWGMDGYFLIRRGSSECGIEDGGSAGIPL 313
Score = 54.7 bits (132), Expect = 1e-08
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
CG GCNGG P AW Y+ +G+VS
Sbjct: 136 CGDGCNGGDPDRAWAYFSSTGLVSDYCQ 163
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 223 bits (571), Expect = 7e-72
Identities = 100/203 (49%), Positives = 124/203 (61%), Gaps = 40/203 (19%)
Query: 107 GSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKS 166
G S GCRPY I PCEHHVNG+RP C +G TPKC + C+ Y YK+D ++G S
Sbjct: 155 GLYESHVGCRPYSIPPCEHHVNGSRPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGYNS 213
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTS 226
YSVS++EK IM EIY++GPVEGAF+V+ D +LYKSG
Sbjct: 214 YSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSG------------------------ 249
Query: 227 QLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLF 286
V+ +G+ +GGHAIRILGWG + YWL+ANSWNTDWGDNG F
Sbjct: 250 ---------VYQH----VTGEMMGGHAIRILGWGVENG--TPYWLVANSWNTDWGDNGFF 294
Query: 287 KILRGKDECGIESSITAGVPKLD 309
KILRG+D CGIES + AG+P+ D
Sbjct: 295 KILRGQDHCGIESEVVAGIPRTD 317
Score = 77.8 bits (192), Expect = 1e-16
Identities = 35/74 (47%), Positives = 47/74 (63%), Gaps = 6/74 (8%)
Query: 40 QAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPT 99
QA N N+ ++LK G L + P+ + ++E D LPA+FD+R +WP CPT
Sbjct: 26 QAGHNF-YNVDMSYLKRLCGTF----LGGPKPPQRVMFTE-DLKLPASFDAREQWPQCPT 79
Query: 100 IREIRDQGSCGSCW 113
I+EIRDQGSCGSCW
Sbjct: 80 IKEIRDQGSCGSCW 93
Score = 57.8 bits (140), Expect = 1e-09
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 130 CGDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 161
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 219 bits (561), Expect = 4e-71
Identities = 99/205 (48%), Positives = 122/205 (59%), Gaps = 40/205 (19%)
Query: 105 DQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGA 164
G S GCRPY I PCE HVNG RP C +G TPKC + C+ Y YK+D ++G
Sbjct: 96 SGGLYESHVGCRPYSIPPCEAHVNGARPPCT-GEGDTPKCSKICEPGYSPTYKQDKHYGY 154
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
SYSVS++EK IM EIY++GPVEGAF+V+ D +LYKSG
Sbjct: 155 NSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDFLLYKSG---------------------- 192
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
V+ +G+ +GGHAIRILGWG + YWL+ANSWNTDWGDNG
Sbjct: 193 -----------VYQH----VTGEMMGGHAIRILGWGVENG--TPYWLVANSWNTDWGDNG 235
Query: 285 LFKILRGKDECGIESSITAGVPKLD 309
FKILRG+D CGIES + AG+P+ D
Sbjct: 236 FFKILRGQDHCGIESEVVAGIPRTD 260
Score = 72.7 bits (179), Expect = 4e-15
Identities = 25/37 (67%), Positives = 31/37 (83%), Gaps = 1/37 (2%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
++E D LPA+FD+R +WP CPTI+EIRDQGSCGS W
Sbjct: 1 FTE-DLKLPASFDAREQWPQCPTIKEIRDQGSCGSAW 36
Score = 57.3 bits (139), Expect = 9e-10
Identities = 18/32 (56%), Positives = 22/32 (68%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQ 40
CG GCNGG+P AW +W + G+VSGG Y S
Sbjct: 73 CGDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 104
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 216 bits (553), Expect = 6e-70
Identities = 83/202 (41%), Positives = 111/202 (54%), Gaps = 39/202 (19%)
Query: 105 DQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGA 164
S + GC PY CEHH G P C + TP+C + CQ+ Y PY +D + G
Sbjct: 91 TGSSKENHAGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRGK 150
Query: 165 KSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDN 224
SY+V ++EK+I KEI ++GPVE FTV++D + YKSG
Sbjct: 151 SSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDFLNYKSG---------------------- 188
Query: 225 TSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNG 284
++ +G+ LGGHAIRI+GWG + +K YWLIANSWN DWG+NG
Sbjct: 189 -----------IYKH----ITGETLGGHAIRIIGWGVE--NKAPYWLIANSWNEDWGENG 231
Query: 285 LFKILRGKDECGIESSITAGVP 306
F+I+RG+DEC IES +TAG
Sbjct: 232 YFRIVRGRDECSIESEVTAGRI 253
Score = 71.9 bits (177), Expect = 6e-15
Identities = 19/31 (61%), Positives = 24/31 (77%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
++P++FDSR KWP C +I IRDQ CGSCW
Sbjct: 2 EIPSSFDSRKKWPRCKSIATIRDQSRCGSCW 32
Score = 58.0 bits (141), Expect = 5e-10
Identities = 17/35 (48%), Positives = 21/35 (60%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAYGSKQAEK 43
CG GC GG G AW YWVK GIV+G + + +
Sbjct: 68 CGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCE 102
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 186 bits (475), Expect = 5e-58
Identities = 48/298 (16%), Positives = 75/298 (25%), Gaps = 123/298 (41%)
Query: 81 DEDLPANFDSRTKWPNCPTIREIRDQ---GSCGSCW------------------------ 113
DLP ++D R R+Q CGSCW
Sbjct: 33 PADLPKSWDWRNVDG-VNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTL 91
Query: 114 -------------GC----------------------RPYEIAPCEHHVNGTRPSCDASK 138
C Y+ E +C
Sbjct: 92 LSVQNVIDCGNAGSCEGGNDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQCGTC---- 147
Query: 139 GHTPKCVRECQENYDVPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLIL 198
+EC + + Y S + +M EIY +GP+ + L
Sbjct: 148 ----NEFKECHAIRNYTLWRV-----GDYGSLSGREKMMAEIYANGPISCGIMATERLAN 198
Query: 199 YKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILG 258
Y G ++ + H + + G
Sbjct: 199 YTGG---------------------------------IYAE----YQDTTYINHVVSVAG 221
Query: 259 WGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECG--------IESSITAGVPKL 308
WG + +YW++ NSW WG+ G +I+ + G IE T G P +
Sbjct: 222 WGISD--GTEYWIVRNSWGEPWGERGWLRIVTSTYKDGKGARYNLAIEEHCTFGDPIV 277
Score = 50.8 bits (122), Expect = 1e-07
Identities = 7/28 (25%), Positives = 8/28 (28%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
C GG W Y + GI
Sbjct: 102 NAGSCEGGNDLSVWDYAHQHGIPDETCN 129
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 183 bits (467), Expect = 5e-55
Identities = 69/335 (20%), Positives = 96/335 (28%), Gaps = 118/335 (35%)
Query: 34 GAYGSKQAEKNSLSNIPRAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTK 93
+ + G H P + LP ++D R
Sbjct: 157 IQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILFLPTSWDWRNV 216
Query: 94 WPNCPTIREIRDQGSCGSCW-------------------------------------GC- 115
+ +R+Q SCGSC+ GC
Sbjct: 217 HG-INFVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCE 275
Query: 116 ----------------------RPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYD 153
PY GT C K +C Y
Sbjct: 276 GGFPYLIAGKYAQDFGLVEEACFPYT---------GTDSPC--------KMKEDCFRYYS 318
Query: 154 VPYKKDLNFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTA 213
Y + NE + E+ HGP+ AF V+DD + YK G
Sbjct: 319 SEYHYV-----GGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKG----------- 362
Query: 214 MSLIKWTIRDNTSQLGAEGAFTVFDD--LILYKSGKALGGHAIRILGWGEDEKSKEKYWL 271
++ L + L HA+ ++G+G D S YW+
Sbjct: 363 ----------------------IYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWI 400
Query: 272 IANSWNTDWGDNGLFKILRGKDECGIESSITAGVP 306
+ NSW T WG+NG F+I RG DEC IES A P
Sbjct: 401 VKNSWGTGWGENGYFRIRRGTDECAIESIAVAATP 435
Score = 44.0 bits (104), Expect = 3e-05
Identities = 9/29 (31%), Positives = 13/29 (44%), Gaps = 1/29 (3%)
Query: 9 CGFGCNGGFPGMA-WRYWVKSGIVSGGAY 36
GC GGFP + +Y G+V +
Sbjct: 270 YAQGCEGGFPYLIAGKYAQDFGLVEEACF 298
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 112 bits (282), Expect = 2e-29
Identities = 45/309 (14%), Positives = 76/309 (24%), Gaps = 121/309 (39%)
Query: 55 KSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQ---GS--- 108
KS G PD + R V LP D + + DQ GS
Sbjct: 30 KSGYGYIPD--IADIRDFSYTPEKSVIAALPPKVDLTPPFQ-------VYDQGRIGSCTA 80
Query: 109 ------------------------------------CGSCWG------------------ 114
+
Sbjct: 81 NALAAAIQFERIHDKQSPEFIPSRLFIYYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCP 140
Query: 115 --CRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYS-VSS 171
PY P + P ASK + +C YK N+ YS V+
Sbjct: 141 EKEWPYGDTPADPRTEEFPPGAPASKKPSDQC-----------YKDAQNYKITEYSRVAQ 189
Query: 172 NEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAE 231
+ + + P F+V++ + S +P
Sbjct: 190 DIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRIPLPTK-------------------- 229
Query: 232 GAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRG 291
+ GGHA+ +G+ ++ + ++ I NSW + G++G F +
Sbjct: 230 -------------NDTLEGGHAVLCVGYDDEIR----HFRIRNSWGNNVGEDGYFWMPYE 272
Query: 292 KDE-CGIES 299
+
Sbjct: 273 YISNTQLAD 281
Score = 35.2 bits (81), Expect = 0.016
Identities = 3/28 (10%), Positives = 7/28 (25%)
Query: 9 CGFGCNGGFPGMAWRYWVKSGIVSGGAY 36
+G + K G+ +
Sbjct: 117 HVNYDSGAMIRDGIKVLHKLGVCPEKEW 144
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 73.9 bits (182), Expect = 2e-15
Identities = 34/199 (17%), Positives = 57/199 (28%), Gaps = 43/199 (21%)
Query: 117 PYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDVPYKKDLNFGAKSYS--VSSNEK 174
PY + K + E + K + ++ + + + K
Sbjct: 100 PYNYVKVGEQCPKVEDHWMNLWDNG-KILHNKNEPNSLDGKGYTAYESERFHDNMDAFVK 158
Query: 175 SIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAF 234
I E+ G V + + SG+
Sbjct: 159 IIKTEVMNKGSVIAYIKAENVMGYEFSGKKVKN--------------------------- 191
Query: 235 TVFDDLILYKSGKALGGHAIRILGWG---EDEKSKEKYWLIANSWNTDWGDNGLFKILR- 290
G HA+ I+G+G E K+ YW++ NSW WGD G FK+
Sbjct: 192 ---------LCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVRNSWGPYWGDEGYFKVDMY 242
Query: 291 GKDECGIESSITAGVPKLD 309
G C + + +D
Sbjct: 243 GPTHCHFNFIHSVVIFNVD 261
Score = 37.3 bits (87), Expect = 0.003
Identities = 9/27 (33%), Positives = 16/27 (59%), Gaps = 1/27 (3%)
Query: 88 FDSRTK-WPNCPTIREIRDQGSCGSCW 113
+ +R K NC + ++ DQG+C + W
Sbjct: 9 YCNRLKDENNCISNLQVEDQGNCDTSW 35
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 68.6 bits (169), Expect = 7e-14
Identities = 31/128 (24%), Positives = 57/128 (44%), Gaps = 36/128 (28%)
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
++E+++++ + + PV AF V +D ++Y+ G
Sbjct: 118 MNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKG--------------------------- 150
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
++ +K+ + HA+ +G+GE+ YW++ NSW WG NG F I
Sbjct: 151 ------IYSSTSCHKTPDKVN-HAVLAVGYGEENG--IPYWIVKNSWGPQWGMNGYFLIE 201
Query: 290 RGKDECGI 297
RGK+ CG+
Sbjct: 202 RGKNMCGL 209
Score = 34.0 bits (79), Expect = 0.032
Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 3/29 (10%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
P + D R K N + +++QGSCGSCW
Sbjct: 2 PPSMDWRKK-GNFVS--PVKNQGSCGSCW 27
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 64.8 bits (159), Expect = 1e-12
Identities = 30/132 (22%), Positives = 50/132 (37%), Gaps = 37/132 (28%)
Query: 166 SYSVSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
S +S NE+ + + + GP+ A + Y+ G
Sbjct: 110 SVELSQNEQKLAAWLAKRGPISVAINA-FGMQFYRHG----------------------- 145
Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGL 285
+ L S + HA+ ++G+G+ S +W I NSW TDWG+ G
Sbjct: 146 ----------ISRPLRPLCSPWLID-HAVLLVGYGQR--SDVPFWAIKNSWGTDWGEKGY 192
Query: 286 FKILRGKDECGI 297
+ + RG CG+
Sbjct: 193 YYLHRGSGACGV 204
Score = 34.8 bits (81), Expect = 0.021
Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 4/29 (13%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
P +D R+K T +++DQG CGSCW
Sbjct: 2 PPEWDWRSK--GAVT--KVKDQGMCGSCW 26
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 63.7 bits (156), Expect = 3e-12
Identities = 25/129 (19%), Positives = 44/129 (34%), Gaps = 41/129 (31%)
Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQL 228
+ +E I + +GPV A + Y G
Sbjct: 118 LPQDEAQIAAWLAVNGPVAVAVDA-SSWMTYTGG-------------------------- 150
Query: 229 GAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKI 288
V + L H + ++G+ + + YW+I NSW T WG+ G +I
Sbjct: 151 -------VMTS----CVSEQLD-HGVLLVGYNDS--AAVPYWIIKNSWTTQWGEEGYIRI 196
Query: 289 LRGKDECGI 297
+G ++C +
Sbjct: 197 AKGSNQCLV 205
Score = 34.4 bits (80), Expect = 0.026
Identities = 13/29 (44%), Positives = 16/29 (55%), Gaps = 4/29 (13%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
PA D R + T ++DQG CGSCW
Sbjct: 2 PAAVDWRAR--GAVT--AVKDQGQCGSCW 26
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 63.8 bits (156), Expect = 7e-12
Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 2/48 (4%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
HA+ I+G+ YW++ NSW+T+WGDNG D IE
Sbjct: 250 HAVNIVGYSNA--QGVDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 295
Score = 39.9 bits (94), Expect = 7e-04
Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 4/37 (10%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
++ + PA D R T IR QG CGS W
Sbjct: 83 ACSINGNAPAEIDLRQM--RTVT--PIRMQGGCGSAW 115
Score = 29.5 bits (67), Expect = 1.3
Identities = 7/26 (26%), Positives = 10/26 (38%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
GC+G Y +G+V Y
Sbjct: 149 HGCHGDTIPRGIEYIQHNGVVQESYY 174
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 62.5 bits (153), Expect = 8e-12
Identities = 18/48 (37%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDECGIES 299
HA+ I+G+ + YW++ NSW+T+WGDNG D IE
Sbjct: 170 HAVNIVGYSNAQG--VDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEE 215
Score = 40.6 bits (96), Expect = 2e-04
Identities = 13/37 (35%), Positives = 16/37 (43%), Gaps = 4/37 (10%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
++ + PA D R T IR QG CGS W
Sbjct: 3 ACSINGNAPAEIDLRQM--RTVT--PIRMQGGCGSAW 35
Score = 30.6 bits (70), Expect = 0.55
Identities = 7/26 (26%), Positives = 10/26 (38%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
GC+G Y +G+V Y
Sbjct: 69 HGCHGDTIPRGIEYIQHNGVVQESYY 94
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 61.5 bits (150), Expect = 6e-11
Identities = 33/129 (25%), Positives = 48/129 (37%), Gaps = 39/129 (30%)
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
+E + + GPV AF D Y G ++ P ET +
Sbjct: 232 GPDENMLADMVATKGPVAVAFDADDPFGSYSGGVYYNPTCETNKFT-------------- 277
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
HA+ I+G+G + + + YWL+ NSW WG +G FKI
Sbjct: 278 ----------------------HAVLIVGYGNE--NGQDYWLVKNSWGDGWGLDGYFKIA 313
Query: 290 RGKD-ECGI 297
R + CGI
Sbjct: 314 RNANNHCGI 322
Score = 39.2 bits (92), Expect = 0.001
Identities = 13/63 (20%), Positives = 23/63 (36%), Gaps = 4/63 (6%)
Query: 51 RAHLKSWMGVHPDYNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCG 110
+A+ + + PA+FD R + + +++QGSCG
Sbjct: 83 KAYTHGLIMPADLHKNGIPIKTREDLGLNASVRYPASFDWRDQ--GMVS--PVKNQGSCG 138
Query: 111 SCW 113
S W
Sbjct: 139 SSW 141
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 58.3 bits (142), Expect = 2e-10
Identities = 18/49 (36%), Positives = 31/49 (63%), Gaps = 3/49 (6%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE---CGI 297
H + ++G+G D++SK+ +W++ NSW T WG +G + K E CG+
Sbjct: 167 HGVLLVGYGTDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGL 215
Score = 37.5 bits (88), Expect = 0.002
Identities = 14/31 (45%), Positives = 19/31 (61%), Gaps = 4/31 (12%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+LPA D R++ C T ++DQ CGSCW
Sbjct: 6 ELPAGVDWRSR--GCVT--PVKDQRDCGSCW 32
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 57.6 bits (139), Expect = 4e-10
Identities = 27/138 (19%), Positives = 51/138 (36%), Gaps = 11/138 (7%)
Query: 169 VSSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSL-----IKWTIRD 223
V +N Y + V+G + + G VP + + + + I
Sbjct: 75 VITNGGIASDANYPYTGVDGTCDLNKPIAARIDGYTNVPNSSSALLDAVAKQPVSVNIYT 134
Query: 224 NTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDN 283
+++ +F + H + I+G+G + + YW++ NSW T+WG +
Sbjct: 135 SSTSFQLYTGPGIFAGSSCSDDPATVD-HTVLIVGYGSNGTNA-DYWIVKNSWGTEWGID 192
Query: 284 GLFKILRGKDE----CGI 297
G I R + C I
Sbjct: 193 GYILIRRNTNRPDGVCAI 210
Score = 34.8 bits (80), Expect = 0.018
Identities = 13/29 (44%), Positives = 17/29 (58%), Gaps = 4/29 (13%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
PA+ D R K T ++DQG+CG CW
Sbjct: 2 PASIDWRKK--GAVT--SVKDQGACGMCW 26
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 56.8 bits (138), Expect = 8e-10
Identities = 31/136 (22%), Positives = 53/136 (38%), Gaps = 40/136 (29%)
Query: 169 VSSNEKSIMKEIYEHGPVEGAFTV-FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQ 227
+ EK++MK + GP+ A + + YK G +F P ++
Sbjct: 115 IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEP--------------DCSSED 160
Query: 228 LGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDNGL 285
+ D H + ++G+G + KYWL+ NSW +WG G
Sbjct: 161 M----------D------------HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGY 198
Query: 286 FKILRGKD-ECGIESS 300
K+ + + CGI S+
Sbjct: 199 VKMAKDRRNHCGIASA 214
Score = 35.2 bits (82), Expect = 0.015
Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 4/29 (13%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
P + D R K T +++QG CGSCW
Sbjct: 2 PRSVDWREK--GYVT--PVKNQGQCGSCW 26
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 56.8 bits (138), Expect = 1e-09
Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 6/50 (12%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE----CGI 297
H + I+G+G + YW+ NSW DWG +G +I R CG+
Sbjct: 168 HFVLIVGYGSE--DGVDYWIAKNSWGEDWGIDGYIRIQRNTGNLLGVCGM 215
Score = 39.1 bits (92), Expect = 0.001
Identities = 11/31 (35%), Positives = 16/31 (51%), Gaps = 4/31 (12%)
Query: 83 DLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
D P ++D K T +++ QG CGS W
Sbjct: 1 DAPESWDWSKK--GVIT--KVKFQGQCGSGW 27
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 56.3 bits (137), Expect = 1e-09
Identities = 18/50 (36%), Positives = 26/50 (52%), Gaps = 6/50 (12%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD----ECGI 297
HA+ +G+G+ + Y LI NSW T WG+ G +I R CG+
Sbjct: 159 HAVTAVGYGKS--GGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 206
Score = 35.5 bits (83), Expect = 0.011
Identities = 16/30 (53%), Positives = 17/30 (56%), Gaps = 4/30 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP N D R K T +R QGSCGSCW
Sbjct: 1 LPENVDWRKK--GAVT--PVRHQGSCGSCW 26
Score = 30.1 bits (69), Expect = 0.74
Identities = 11/26 (42%), Positives = 13/26 (50%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
GC GG+P A Y K+GI Y
Sbjct: 61 HGCKGGYPPYALEYVAKNGIHLRSKY 86
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 57.3 bits (139), Expect = 1e-09
Identities = 28/132 (21%), Positives = 52/132 (39%), Gaps = 39/132 (29%)
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
S +E S+ + + GPV A D+L Y G F+ + ++
Sbjct: 230 SGDENSLADAVGQAGPVAVAIDATDELQFYSGGLFYDQTCNQSDLN-------------- 275
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
H + ++G+G D + YW++ NSW + WG++G ++ +
Sbjct: 276 ----------------------HGVLVVGYGSDNG--QDYWILKNSWGSGWGESGYWRQV 311
Query: 290 RGKD-ECGIESS 300
R CGI ++
Sbjct: 312 RNYGNNCGIATA 323
Score = 38.4 bits (90), Expect = 0.002
Identities = 13/37 (35%), Positives = 19/37 (51%), Gaps = 5/37 (13%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
Y + L A+ D R+ + E++DQG CGS W
Sbjct: 109 YVSSKKPLAASVDWRSN---AVS--EVKDQGQCGSSW 140
Score = 28.4 bits (64), Expect = 3.0
Identities = 11/26 (42%), Positives = 15/26 (57%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
GC+GG+ A+ Y GI+S AY
Sbjct: 177 AGCDGGWMDSAFSYIHDYGIMSESAY 202
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 56.9 bits (138), Expect = 2e-09
Identities = 31/138 (22%), Positives = 51/138 (36%), Gaps = 40/138 (28%)
Query: 167 YSVSSNEKSIMKEIYEHGPVEGAFTV-FDDLILYKSGRFFVPGNETTAMSLIKWTIRDNT 225
+ EK++MK + GP+ A + + YK G +F P + M
Sbjct: 209 VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD---------- 258
Query: 226 SQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSK--EKYWLIANSWNTDWGDN 283
H + ++G+G + KYWL+ NSW +WG
Sbjct: 259 --------------------------HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMG 292
Query: 284 GLFKILRGKD-ECGIESS 300
G K+ + + CGI S+
Sbjct: 293 GYVKMAKDRRNHCGIASA 310
Score = 38.8 bits (91), Expect = 0.001
Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 4/37 (10%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+ + P + D R K T +++QG CGSCW
Sbjct: 90 QEPLFYEAPRSVDWREK--GYVT--PVKNQGQCGSCW 122
Score = 27.2 bits (61), Expect = 7.7
Identities = 10/27 (37%), Positives = 15/27 (55%), Gaps = 1/27 (3%)
Query: 11 FGCNGGFPGMAWRYWVKS-GIVSGGAY 36
GCNGG A++Y + G+ S +Y
Sbjct: 159 EGCNGGLMDYAFQYVQDNGGLDSEESY 185
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 55.7 bits (135), Expect = 2e-09
Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 20/84 (23%)
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDE--------KSKEKYWLIANSWN 277
+ DD Y+ G A HA+ ++G+G + K Y++I NSW
Sbjct: 151 SIAASDDFAFYRGGFYDGECGAAPN-HAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWG 209
Query: 278 TDWGDNGLFKILRGKDE----CGI 297
+DWG+ G + ++ C I
Sbjct: 210 SDWGEGGYINLETDENGYKKTCSI 233
Score = 37.2 bits (87), Expect = 0.004
Identities = 10/35 (28%), Positives = 14/35 (40%), Gaps = 4/35 (11%)
Query: 79 EVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+ +D R T ++DQ CGSCW
Sbjct: 15 ADAKLDRIAYDWRLH--GGVT--PVKDQALCGSCW 45
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 56.1 bits (136), Expect = 3e-09
Identities = 26/129 (20%), Positives = 45/129 (34%), Gaps = 39/129 (30%)
Query: 170 SSNEKSIMKEIYEHGPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLG 229
S +E + + GP A V D ++Y+SG +
Sbjct: 207 SGSEVELKNLVGAEGPAAVAVDVESDFMMYRSGIYQSQ---------------------- 244
Query: 230 AEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKIL 289
+ + HA+ +G+G YW++ NSW WG+ G +++
Sbjct: 245 -TCSPLRVN-------------HAVLAVGYGTQ--GGTDYWIVKNSWGLSWGERGYIRMV 288
Query: 290 RGKD-ECGI 297
R + CGI
Sbjct: 289 RNRGNMCGI 297
Score = 40.7 bits (96), Expect = 3e-04
Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 4/47 (8%)
Query: 67 PANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
++ L + Y + +P D R T E++DQG+CGS W
Sbjct: 75 ASDILSHGVPYEANNRAVPDKIDWRES--GYVT--EVKDQGNCGSGW 117
Score = 28.0 bits (63), Expect = 4.3
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
GC GG A++Y + G+ + +Y
Sbjct: 154 NGCGGGLMENAYQYLKQFGLETESSY 179
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 54.8 bits (133), Expect = 4e-09
Identities = 20/50 (40%), Positives = 27/50 (54%), Gaps = 10/50 (20%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD----ECGI 297
HA+ +G+G + Y LI NSW T WG+NG +I RG CG+
Sbjct: 159 HAVAAVGYGPN------YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGL 202
Score = 36.3 bits (85), Expect = 0.007
Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 4/30 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+P D R K T +++QGSCGSCW
Sbjct: 1 IPEYVDWRQK--GAVT--PVKNQGSCGSCW 26
Score = 29.0 bits (66), Expect = 1.5
Identities = 10/26 (38%), Positives = 14/26 (53%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
+GCNGG+P A + + GI Y
Sbjct: 61 YGCNGGYPWSALQLVAQYGIHYRNTY 86
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 55.3 bits (134), Expect = 4e-09
Identities = 24/84 (28%), Positives = 34/84 (40%), Gaps = 20/84 (23%)
Query: 233 AFTVFDDLILYKSG-------KALGGHAIRILGWGEDE--------KSKEKYWLIANSWN 277
+ V DD YK G L HA+ ++G+G E K Y++I NSW
Sbjct: 149 SVAVSDDFAFYKEGIFDGECGDQLN-HAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWG 207
Query: 278 TDWGDNGLFKILRGKDE----CGI 297
WG+ G I + CG+
Sbjct: 208 QQWGERGFINIETDESGLMRKCGL 231
Score = 39.5 bits (93), Expect = 7e-04
Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 4/37 (10%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
Y + A +D R + T ++DQ +CGSCW
Sbjct: 11 YRGEENFDHAAYDWRLH--SGVT--PVKDQKNCGSCW 43
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 55.0 bits (133), Expect = 5e-09
Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 13/67 (19%)
Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
Y G L H + ++G+G E K YW + NSW WG+ G ++ +
Sbjct: 151 FYSEGVFTGECGTELD-HGVAVVGYGVAEDGK-AYWTVKNSWGPSWGEQGYIRVEKDSGA 208
Query: 295 ----CGI 297
CGI
Sbjct: 209 SGGLCGI 215
Score = 38.4 bits (90), Expect = 0.001
Identities = 15/32 (46%), Positives = 18/32 (56%), Gaps = 4/32 (12%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
DLP + D R K T ++DQG CGSCW
Sbjct: 2 SDLPPSVDWRQK--GAVT--GVKDQGKCGSCW 29
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 54.4 bits (132), Expect = 6e-09
Identities = 22/67 (32%), Positives = 31/67 (46%), Gaps = 18/67 (26%)
Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
Y+ G ++ HA+ +G+G D Y LI NSW T WG+ G +I RG
Sbjct: 143 NYRGGIFAGPCGTSID-HAVAAVGYGND------YILIKNSWGTGWGEGGYIRIKRGSGN 195
Query: 295 ----CGI 297
CG+
Sbjct: 196 PQGACGV 202
Score = 35.9 bits (84), Expect = 0.008
Identities = 13/30 (43%), Positives = 17/30 (56%), Gaps = 4/30 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+P + D R K T +R+QG CGSCW
Sbjct: 1 IPTSIDWRQK--GAVT--PVRNQGGCGSCW 26
Score = 28.6 bits (65), Expect = 1.9
Identities = 12/26 (46%), Positives = 14/26 (53%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
+GC GGFP A +Y SGI Y
Sbjct: 61 YGCRGGFPLYALQYVANSGIHLRQYY 86
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 55.3 bits (134), Expect = 6e-09
Identities = 18/50 (36%), Positives = 29/50 (58%), Gaps = 3/50 (6%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD-ECGIESS 300
HA+ +G+G + K+W+I NSW +WG+ G + R K+ CGI +
Sbjct: 261 HAVLAVGYGIQKG--NKHWIIKNSWGENWGNKGYILMARNKNNACGIANL 308
Score = 38.8 bits (91), Expect = 0.002
Identities = 15/49 (30%), Positives = 23/49 (46%), Gaps = 4/49 (8%)
Query: 65 NLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
L +R + + E + P + D R K T +++QG CGSCW
Sbjct: 81 PLSHSRSNDTLYIPEWEGRAPDSVDYRKK--GYVT--PVKNQGQCGSCW 125
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 53.7 bits (130), Expect = 9e-09
Identities = 16/47 (34%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD-ECGI 297
H + ++G+G+ ++YWL+ NSW ++G+ G ++ R K CGI
Sbjct: 165 HGVLVVGYGDLNG--KEYWLVKNSWGHNFGEEGYIRMARNKGNHCGI 209
Score = 36.7 bits (86), Expect = 0.005
Identities = 15/30 (50%), Positives = 19/30 (63%), Gaps = 4/30 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP + D R K C T E++ QGSCG+CW
Sbjct: 2 LPDSVDWREK--GCVT--EVKYQGSCGACW 27
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 53.7 bits (130), Expect = 9e-09
Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 5/49 (10%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE---CGI 297
HA+ I+G+G + YW++ NSW T WG+ G +I R CGI
Sbjct: 162 HAVTIVGYGTE--GGIDYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGI 208
Score = 35.6 bits (83), Expect = 0.010
Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 4/30 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP D R+ +I+DQG CGS W
Sbjct: 1 LPDYVDWRSS--GAVV--DIKDQGQCGSAW 26
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 54.2 bits (131), Expect = 1e-08
Identities = 20/64 (31%), Positives = 34/64 (53%), Gaps = 10/64 (15%)
Query: 242 LYKSGKALGG-------HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD- 293
LY+SG H + ++G+G+ ++YWL+ NSW ++G+ G ++ R K
Sbjct: 245 LYRSGVYYEPSCTQNVNHGVLVVGYGDLNG--KEYWLVKNSWGHNFGEEGYIRMARNKGN 302
Query: 294 ECGI 297
CGI
Sbjct: 303 HCGI 306
Score = 41.1 bits (97), Expect = 2e-04
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 4/50 (8%)
Query: 64 YNLPANRLPELIGYSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+P+ + S + LP + D R K C T E++ QGSCG+ W
Sbjct: 79 LRVPSQWQRNITYKSNPNRILPDSVDWREK--GCVT--EVKYQGSCGAAW 124
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 52.9 bits (128), Expect = 2e-08
Identities = 18/47 (38%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD-ECGI 297
HA+ +G+G + K+W+I NSW +WG+ G + R K+ CGI
Sbjct: 162 HAVLAVGYGIQKG--NKHWIIKNSWGENWGNKGYILMARNKNNACGI 206
Score = 34.8 bits (81), Expect = 0.021
Identities = 12/29 (41%), Positives = 16/29 (55%), Gaps = 4/29 (13%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
P + D R K T +++QG CGSCW
Sbjct: 2 PDSVDYRKK--GYVT--PVKNQGQCGSCW 26
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 52.8 bits (128), Expect = 2e-08
Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 10/50 (20%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE----CGI 297
HA+ +G+G+ Y L+ NSW +WG+ G +I R CG+
Sbjct: 159 HAVTAVGYGKT------YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGV 202
Score = 34.0 bits (79), Expect = 0.034
Identities = 11/29 (37%), Positives = 15/29 (51%), Gaps = 4/29 (13%)
Query: 85 PANFDSRTKWPNCPTIREIRDQGSCGSCW 113
P + D R K T +++Q CGSCW
Sbjct: 2 PESIDWREK--GAVT--PVKNQNPCGSCW 26
Score = 29.4 bits (67), Expect = 1.1
Identities = 8/26 (30%), Positives = 15/26 (57%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
GC+GG+ + +Y V +G+ + Y
Sbjct: 61 HGCDGGYQTTSLQYVVDNGVHTEREY 86
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 53.8 bits (130), Expect = 2e-08
Identities = 17/50 (34%), Positives = 25/50 (50%), Gaps = 6/50 (12%)
Query: 252 HAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD----ECGI 297
A+ +G+G+ + Y LI NSW T WG+ G +I R CG+
Sbjct: 265 GAVTAVGYGKS--GGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 312
Score = 38.0 bits (89), Expect = 0.002
Identities = 17/37 (45%), Positives = 20/37 (54%), Gaps = 4/37 (10%)
Query: 77 YSEVDEDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+E +LP N D R K T +R QGSCGSCW
Sbjct: 100 INEDIVNLPENVDWRKK--GAVT--PVRHQGSCGSCW 132
Score = 29.9 bits (68), Expect = 1.0
Identities = 11/26 (42%), Positives = 13/26 (50%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
GC GG+P A Y K+GI Y
Sbjct: 167 HGCKGGYPPYALEYVAKNGIHLRSKY 192
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 52.3 bits (126), Expect = 3e-08
Identities = 22/99 (22%), Positives = 35/99 (35%), Gaps = 8/99 (8%)
Query: 201 SGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWG 260
G VP A+ + + F + I H + I+G+
Sbjct: 106 DGYNGVPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQ 165
Query: 261 EDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE--CGI 297
+ YW++ NSW WG+ G ++LR CGI
Sbjct: 166 AN------YWIVRNSWGRYWGEKGYIRMLRVGGCGLCGI 198
Score = 36.1 bits (84), Expect = 0.006
Identities = 14/30 (46%), Positives = 17/30 (56%), Gaps = 4/30 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP D R K T +++QGSCGSCW
Sbjct: 1 LPEQIDWRKK--GAVT--PVKNQGSCGSCW 26
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 51.8 bits (125), Expect = 4e-08
Identities = 19/67 (28%), Positives = 36/67 (53%), Gaps = 14/67 (20%)
Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
LY+SG + HA+ ++G+G + + + +W++ NSW +WG++G + R +
Sbjct: 145 LYRSGIFTGSCNISAN-HALTVVGYGTE--NDKDFWIVKNSWGKNWGESGYIRAERNIEN 201
Query: 295 ----CGI 297
CGI
Sbjct: 202 PDGKCGI 208
Score = 39.4 bits (93), Expect = 6e-04
Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 4/32 (12%)
Query: 82 EDLPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+DLP + D R +++QG CGSCW
Sbjct: 1 DDLPDSIDWREN--GAVV--PVKNQGGCGSCW 28
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 51.7 bits (125), Expect = 4e-08
Identities = 21/67 (31%), Positives = 30/67 (44%), Gaps = 14/67 (20%)
Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD- 293
Y SG A H + I+G+G S + YW++ NSW +WG+ G + R
Sbjct: 142 HYSSGIFTGPCGTAQN-HGVVIVGYGTQ--SGKNYWIVRNSWGQNWGNQGYIWMERNVAS 198
Query: 294 ---ECGI 297
CGI
Sbjct: 199 SAGLCGI 205
Score = 35.6 bits (83), Expect = 0.010
Identities = 12/30 (40%), Positives = 16/30 (53%), Gaps = 4/30 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
LP+ D R+K I++Q CGSCW
Sbjct: 1 LPSFVDWRSK--GAVN--SIKNQKQCGSCW 26
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 51.8 bits (125), Expect = 5e-08
Identities = 21/67 (31%), Positives = 30/67 (44%), Gaps = 13/67 (19%)
Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKDE 294
Y G L H + I+G+G KYW + NSW +WG+ G ++ RG +
Sbjct: 146 FYSEGVFTGSCGTELD-HGVAIVGYGTTIDGT-KYWTVKNSWGPEWGEKGYIRMERGISD 203
Query: 295 ----CGI 297
CGI
Sbjct: 204 KEGLCGI 210
Score = 35.6 bits (83), Expect = 0.010
Identities = 14/30 (46%), Positives = 18/30 (60%), Gaps = 4/30 (13%)
Query: 84 LPANFDSRTKWPNCPTIREIRDQGSCGSCW 113
+PA+ D R K T ++DQG CGSCW
Sbjct: 2 VPASVDWRKK--GAVT--SVKDQGQCGSCW 27
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 51.0 bits (123), Expect = 7e-08
Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 14/67 (20%)
Query: 242 LYKSG-------KALGGHAIRILGWGEDEKSKEKYWLIANSWNTDWGDNGLFKILRGKD- 293
LYKSG L HA+ +G+G + Y +I NSW +WG+ G ++ R
Sbjct: 143 LYKSGVFDGPCGTKLD-HAVTAVGYGTS--DGKNYIIIKNSWGPNWGEKGYMRLKRQSGN 199
Query: 294 ---ECGI 297
CG+
Sbjct: 200 SQGTCGV 206
Score = 34.0 bits (79), Expect = 0.039
Identities = 7/12 (58%), Positives = 11/12 (91%)
Query: 102 EIRDQGSCGSCW 113
+++QG+CGSCW
Sbjct: 15 PVKNQGACGSCW 26
Score = 29.0 bits (66), Expect = 1.7
Identities = 7/26 (26%), Positives = 14/26 (53%)
Query: 11 FGCNGGFPGMAWRYWVKSGIVSGGAY 36
+GC GG+ + +Y +G+ + Y
Sbjct: 61 YGCKGGYQTTSLQYVANNGVHTSKVY 86
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 42.9 bits (100), Expect = 7e-05
Identities = 15/83 (18%), Positives = 39/83 (46%), Gaps = 1/83 (1%)
Query: 206 VPGNETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKS 265
+ G++ +K + ++ + T + + Y + + H ++I G +D++
Sbjct: 272 LSGSDMAHWLKLKPEEKKLNTKPQPQKWCTQAERQLAYDNYETTDDHGMQIYGIAKDQE- 330
Query: 266 KEKYWLIANSWNTDWGDNGLFKI 288
+Y+++ NSW T+ NG++
Sbjct: 331 GNEYYMVKNSWGTNSKYNGIWYA 353
Score = 27.1 bits (59), Expect = 7.8
Identities = 4/11 (36%), Positives = 8/11 (72%)
Query: 103 IRDQGSCGSCW 113
+++Q G+CW
Sbjct: 25 VKNQNRAGTCW 35
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 36.8 bits (84), Expect = 0.009
Identities = 34/251 (13%), Positives = 61/251 (24%), Gaps = 89/251 (35%)
Query: 22 WRYWVKSGIVSGGAYGSKQAE--KNSLSNIP----RAHLKSWMGVHP-DYNLPANRLPEL 74
W W K ++SL+ + R + V P ++P L +
Sbjct: 344 WDNWKHVNC-------DKLTTIIESSLNVLEPAEYRKMFDR-LSVFPPSAHIPTILLSLI 395
Query: 75 IGYS--EVDEDLPANFDSRT---KWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVN- 128
+ + K P T I +
Sbjct: 396 WFDVIKSDVMVVVNKLHKYSLVEKQPKEST------------------ISI----PSIYL 433
Query: 129 GTRPSCDASKG-HTPKCVRECQENYDVPYKKDLNFGAK----SYSVSSNEKSIMKEIYEH 183
+ + H R ++Y++P D + Y Y H
Sbjct: 434 ELKVKLENEYALH-----RSIVDHYNIPKTFDSDDLIPPYLDQY------------FYSH 476
Query: 184 --------GPVEGAFTVFDDLILYKSGRFFVPGNETTAMSLIKWTIRDNTSQLGAEGAFT 235
E T+F + + RF ++ IR +++ A G+
Sbjct: 477 IGHHLKNIEHPE-RMTLFRMV--FLDFRF------------LEQKIRHDSTAWNASGSIL 521
Query: 236 -VFDDLILYKS 245
L YK
Sbjct: 522 NTLQQLKFYKP 532
Score = 32.1 bits (72), Expect = 0.23
Identities = 20/144 (13%), Positives = 42/144 (29%), Gaps = 40/144 (27%)
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIYE--HGPVEGAFTVFDDLILYKSGRFFVPGNETTAM 214
+ + F + S+M +Y LY + F N +
Sbjct: 88 RINYKFLMSPIKTEQRQPSMMTRMYIEQRDR------------LYNDNQVFAKYNVSRLQ 135
Query: 215 SLIKWTIRDNTSQLGAEGAFTVFDDLILY---KSGK-ALGGHAIRILGWGEDEKSKEK-- 268
+K +R +L ++++ SGK + K + K
Sbjct: 136 PYLK--LRQALLELRPA------KNVLIDGVLGSGKTWVALDVCL------SYKVQCKMD 181
Query: 269 ---YWLIANSWNTDWGDNGLFKIL 289
+WL + N+ + ++L
Sbjct: 182 FKIFWLNLKNCNS---PETVLEML 202
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 33.2 bits (75), Expect = 0.11
Identities = 10/37 (27%), Positives = 13/37 (35%), Gaps = 2/37 (5%)
Query: 252 HAIRILGWGED--EKSKEKYWLIANSWNTDWGDNGLF 286
HA+ E + W + NSW D G G
Sbjct: 371 HAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYL 407
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 32.0 bits (72), Expect = 0.26
Identities = 14/36 (38%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 252 HAIRILGWGEDEKSKE-KYWLIANSWNTDWGDNGLF 286
A+ I G DE SK + + NSW D G +GL+
Sbjct: 373 AAMLITGCHVDETSKLPLRYRVENSWGKDSGKDGLY 408
>2e1b_A PH0108, 216AA long hypothetical alanyl-tRNA synthetase;
zinc-binding motif, trans-editing enzyme, structural
genomics, NPPSFA; 2.70A {Pyrococcus horikoshii} SCOP:
b.43.3.6 d.67.1.2
Length = 216
Score = 28.7 bits (65), Expect = 1.9
Identities = 7/29 (24%), Positives = 10/29 (34%), Gaps = 9/29 (31%)
Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGV 305
+ + G K L+ SSI G
Sbjct: 189 DI--KEIGHIKKLK-------RSSIGRGK 208
>2dtg_E Insulin receptor; IR ectodomain, X-RAY crystallography, hormone
receptor/immune system complex; 3.80A {Homo sapiens}
SCOP: b.1.2.1 b.1.2.1 b.1.2.1 c.10.2.5 c.10.2.5 g.3.9.1
PDB: 3loh_E
Length = 897
Score = 28.7 bits (63), Expect = 2.9
Identities = 13/58 (22%), Positives = 16/58 (27%), Gaps = 4/58 (6%)
Query: 95 PNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY 152
CP C + C+ N R C H KC+ EC Y
Sbjct: 239 ETCPPPYYHFQDWRCVNFSFCQ----DLHHKCKNSRRQGCHQYVIHNNKCIPECPSGY 292
Score = 27.2 bits (59), Expect = 9.3
Identities = 9/64 (14%), Positives = 12/64 (18%), Gaps = 2/64 (3%)
Query: 87 NFDSRTKWPNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSC--DASKGHTPKC 144
D+ CP + + G H C C
Sbjct: 149 KDDNEECGDICPGTAKGKTNCPATVINGQFVERCWTHSHCQKVCPTICKSHGCTAEGLCC 208
Query: 145 VREC 148
EC
Sbjct: 209 HSEC 212
>1v4p_A Alanyl-tRNA synthetase; alanine-tRNA ligase, riken structural
genomics/proteomics initiative RSGI, structural
genomics; 1.45A {Pyrococcus horikoshii} SCOP: d.67.1.2
PDB: 1wxo_A 1v7o_A 1wnu_A 3rhu_A 3rfn_A
Length = 157
Score = 27.6 bits (62), Expect = 3.4
Identities = 6/29 (20%), Positives = 11/29 (37%), Gaps = 8/29 (27%)
Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGV 305
T G+ G KI + + + G+
Sbjct: 123 TT--GEIGPIKIRK------VRFRKSKGL 143
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 28.5 bits (63), Expect = 4.0
Identities = 35/207 (16%), Positives = 60/207 (28%), Gaps = 74/207 (35%)
Query: 139 GHTPKCVRECQENYDV--PYKKDL-NFGAKSYSVSSNEKSIMKEIYEHGPVEGAFTVFDD 195
G+T E ++ Y DL F A++ S ++++ G +
Sbjct: 164 GNTDDYFEELRDLYQTYHVLVGDLIKFSAETLSELIRTTLDAEKVFTQG-----L----N 214
Query: 196 LILYKSGRFFVPGNE---TTAMS--LIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALG 250
++ + P + + +S LI QL + V K LG
Sbjct: 215 ILEWLENPSNTPDKDYLLSIPISCPLIGVI------QL---AHYVVT--------AKLLG 257
Query: 251 ---GHAIRIL----GWGEDEKSKEKYWLI-------ANSWNTDWGDNG------LFKI-L 289
G L G + L+ +SW + + LF I +
Sbjct: 258 FTPGELRSYLKGATGHSQG--------LVTAVAIAETDSWE-SFFVSVRKAITVLFFIGV 308
Query: 290 RGKD---ECGIESSITA-------GVP 306
R + + SI GVP
Sbjct: 309 RCYEAYPNTSLPPSILEDSLENNEGVP 335
>2ztg_A Alanyl-tRNA synthetase; class-II aminoacyl-tRNA synthetase,
aminoacyl-tRNA synthetase, ATP-binding, cytoplasm,
ligase; HET: A5A; 2.20A {Archaeoglobus fulgidus}
Length = 739
Score = 28.0 bits (63), Expect = 5.0
Identities = 10/29 (34%), Positives = 14/29 (48%), Gaps = 9/29 (31%)
Query: 277 NTDWGDNGLFKILRGKDECGIESSITAGV 305
+T G+ G+ KIL+ SI GV
Sbjct: 710 ST--GEIGMLKILK-------VESIQDGV 729
>3fe4_A Carbonic anhydrase 6; secretion, metal binding, structural GEN
structural genomics consortium, SGC, glycoprotein,
lyase, M binding, secreted; 1.90A {Homo sapiens}
Length = 278
Score = 27.4 bits (61), Expect = 5.9
Identities = 9/19 (47%), Positives = 11/19 (57%)
Query: 113 WGCRPYEIAPCEHHVNGTR 131
WG EI+ EH V+G R
Sbjct: 95 WGGASSEISGSEHTVDGIR 113
>3hh2_C Follistatin; protein-protein complex, TB domain, cystine knot
motif, TGF- fold, disulfide linked dimer, CLE PAIR of
basic residues, cytokine; HET: CIT; 2.15A {Homo sapiens}
PDB: 2b0u_C* 2p6a_D
Length = 288
Score = 27.3 bits (59), Expect = 6.0
Identities = 13/65 (20%), Positives = 23/65 (35%), Gaps = 6/65 (9%)
Query: 95 PNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENYDV 154
+ + + G +C C+ CE+ G C +K + P+CV C +
Sbjct: 42 NDNTLFKWMIFNGGAPNCIPCK----ETCENVDCGPGKKCRMNKKNKPRCV--CAPDCSN 95
Query: 155 PYKKD 159
K
Sbjct: 96 ITWKG 100
>3k1f_M Transcription initiation factor IIB; RNA polymerase II, TFIIB,
transcription factor, DNA-binding, DNA-directed RNA
polymerase; 4.30A {Saccharomyces cerevisiae}
Length = 197
Score = 26.9 bits (59), Expect = 6.9
Identities = 9/42 (21%), Positives = 12/42 (28%), Gaps = 6/42 (14%)
Query: 77 YSEVDEDLPANFDSRTKWPNC----PTIREIRDQGS--CGSC 112
+ N + P C P I E +G C C
Sbjct: 7 IDKRAGRRGPNLNIVLTCPECKVYPPKIVERFSEGDVVCALC 48
>2hr7_A Insulin receptor; hormone receptor, leucine rich repeat,
transferase; HET: NAG BMA MAN FUC P33; 2.32A {Homo
sapiens}
Length = 486
Score = 27.1 bits (59), Expect = 8.2
Identities = 13/58 (22%), Positives = 16/58 (27%), Gaps = 4/58 (6%)
Query: 95 PNCPTIREIRDQGSCGSCWGCRPYEIAPCEHHVNGTRPSCDASKGHTPKCVRECQENY 152
CP C + C+ N R C H KC+ EC Y
Sbjct: 239 ETCPPPYYHFQDWRCVNFSFCQD----LHHKCKNSRRQGCHQYVIHNNKCIPECPSGY 292
>2f68_X Collagen adhesin; beta barrel, domain SWAP, cell adhesion; 1.95A
{Staphylococcus aureus} PDB: 2f6a_A 1amx_A
Length = 313
Score = 26.7 bits (58), Expect = 9.1
Identities = 18/130 (13%), Positives = 38/130 (29%), Gaps = 15/130 (11%)
Query: 157 KKDLNFGAKSYSVSSNEKSIMKEIY-------EHGPVEGAFTVFDDLILYKSGRFFVPGN 209
F + +++ S K + V + + YK+G +P +
Sbjct: 105 SGFAEFEVQGRNLTQTNTSDDKVATITSGNKSTNVTVHKSEAGTSSVFYYKTGDM-LPED 163
Query: 210 ETTAMSLIKWTIRDNTSQLGAEGAFTVFDDLILYKSGKALGGHAIRILGWGEDEKSKEKY 269
T ++W + N + T+ D + + G+ L + I G
Sbjct: 164 TTH----VRWFLNINNEKSYVSKDITIKDQI---QGGQQLDLSTLNINVTGTHSNYYSGQ 216
Query: 270 WLIANSWNTD 279
I +
Sbjct: 217 SAITDFEKAF 226
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.317 0.137 0.447
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,958,293
Number of extensions: 291432
Number of successful extensions: 834
Number of sequences better than 10.0: 1
Number of HSP's gapped: 798
Number of HSP's successfully gapped: 163
Length of query: 309
Length of database: 6,701,793
Length adjustment: 93
Effective length of query: 216
Effective length of database: 4,105,140
Effective search space: 886710240
Effective search space used: 886710240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.5 bits)