RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy5095
(192 letters)
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 160 bits (407), Expect = 3e-50
Identities = 60/178 (33%), Positives = 96/178 (53%), Gaps = 8/178 (4%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA-GLEAEADYPFRNQN 60
+E Q+ + GTLL LS+ +L++C+ ++ C GG + A +K+ GLE E DY ++
Sbjct: 34 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAIKNLGGLETEDDYSYQ--- 90
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDY-NGKLIRK 118
G C + A K KV + D + + L GP+ +N +Q Y +G
Sbjct: 91 GHMQSCQFSAEKAKVYIQDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPL 150
Query: 119 NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
+C ++HAV++VGYG R VP W ++NSWG WG + GY+ + RG+ ACG+ +
Sbjct: 151 RPLCSPWLIDHAVLLVGYGQRSDVPFWAIKNSWGTDWG-EKGYYYLHRGSGACGVNTM 207
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 151 bits (385), Expect = 8e-47
Identities = 47/179 (26%), Positives = 85/179 (47%), Gaps = 9/179 (5%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHA---GLEAEADYPFRN 58
+E Q+ + L LS+ L+ C+ + GC GG N A +++ + E YP+ +
Sbjct: 34 VECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYAS 93
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIR 117
G++ C V ++ + + L GP+ ++ + Y G ++
Sbjct: 94 GEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVM- 152
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIESY 175
C SE L+H V++VGY VP WI++NSW +WG ++GY + +G+N C ++
Sbjct: 153 --TSCVSEQLDHGVLLVGYNDSAAVPYWIIKNSWTTQWG-EEGYIRIAKGSNQCLVKEE 208
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 145 bits (368), Expect = 3e-43
Identities = 46/181 (25%), Positives = 73/181 (40%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
ES Y L L++ +L++C GC G + I+Y++H G+ E+ Y +
Sbjct: 123 TESAYLAYRDQSLDLAEQELVDCA-SQHGCHGDTIPRGIEYIQHNGVVQESYYRYV---A 178
Query: 62 VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYH-YGPLVAGMNGALLQD---YNGKL 115
C + +S++ + ++ R L + + + L Y+G+
Sbjct: 179 REQSCRRPNAQR-FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGR- 236
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ HAV IVGY V WIVRNSW WG D+GY + IE
Sbjct: 237 TIIQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWG-DNGYGYFAANIDLMMIEE 295
Query: 175 Y 175
Y
Sbjct: 296 Y 296
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 143 bits (362), Expect = 3e-43
Identities = 46/181 (25%), Positives = 73/181 (40%), Gaps = 14/181 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
ES Y L L++ +L++C GC G + I+Y++H G+ E+ Y +
Sbjct: 43 TESAYLAYRQQSLDLAEQELVDCA-SQHGCHGDTIPRGIEYIQHNGVVQESYYRYV---A 98
Query: 62 VTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYH-YGPLVAGMNGALLQD---YNGKL 115
C + +S++ + ++ R L + + + L Y+G+
Sbjct: 99 REQSCRRPNAQR-FGISNYCQIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGR- 156
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIES 174
+ HAV IVGY V WIVRNSW WG D+GY + IE
Sbjct: 157 TIIQRDNGYQPNYHAVNIVGYSNAQGVDYWIVRNSWDTNWG-DNGYGYFAANIDLMMIEE 215
Query: 175 Y 175
Y
Sbjct: 216 Y 216
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 140 bits (354), Expect = 4e-40
Identities = 57/192 (29%), Positives = 85/192 (44%), Gaps = 21/192 (10%)
Query: 1 MLESQYAIKHGTLL--PLSKSQLIECNIYNQGCQGGG-FNKAIQYLKHAGLEAEADYPFR 57
MLE++ I LS +++ C+ Y QGC+GG + A +Y + GL EA +P+
Sbjct: 242 MLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYT 301
Query: 58 NQNGVTGRCAYDARKVKVRVSDF------LVFNGSDTFRRMLYHYGPLVAGMN-GALLQD 110
G C + S++ + L H+GP+
Sbjct: 302 ---GTDSPCKMKEDCFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLH 358
Query: 111 YNG----KLIRKNDVCPSENLNHAVVIVGYGMRHQ--VPVWIVRNSWG-RWGPDDGYFTV 163
Y ++ P E NHAV++VGYG + WIV+NSWG WG ++GYF +
Sbjct: 359 YKKGIYHHTGLRDPFNPFELTNHAVLLVGYGTDSASGMDYWIVKNSWGTGWG-ENGYFRI 417
Query: 164 ERGTNACGIESY 175
RGT+ C IES
Sbjct: 418 RRGTDECAIESI 429
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 135 bits (342), Expect = 6e-40
Identities = 53/188 (28%), Positives = 94/188 (50%), Gaps = 18/188 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+E+ +AI G L+ LS+ +LI+C ++GC G ++ ++ +KH G+ +EADYP++
Sbjct: 35 IEAAHAIATGNLVSLSEQELIDCVDESEGCYNGWHYQSFEWVVKHGGIASEADYPYK--- 91
Query: 61 GVTGRCAYDARKVKVRVSDFLVFN--------GSDTFRRMLYHYGPLVAGMNGALLQDYN 112
G+C + + KV + ++ V +++ + P+ ++ Y+
Sbjct: 92 ARDGKCKANEIQDKVTIDNYGVQILSNESTESEAESSLQSFVLEQPISVSIDAKDFHFYS 151
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA-- 169
G + + +NH V+IVGYG V WI +NSWG WG DGY ++R T
Sbjct: 152 GGIYDGGNCSSPYGINHFVLIVGYGSEDGVDYWIAKNSWGEDWG-IDGYIRIQRNTGNLL 210
Query: 170 --CGIESY 175
CG+ +
Sbjct: 211 GVCGMNYF 218
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 134 bits (340), Expect = 6e-40
Identities = 61/182 (33%), Positives = 92/182 (50%), Gaps = 12/182 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LES AI G +L L++ QL++C N N GCQGG ++A +Y++ + G+ E YP++
Sbjct: 35 LESAVAIATGKMLSLAEQQLVDCAQNFNNHGCQGGLPSQAFEYIRYNKGIMGEDTYPYK- 93
Query: 59 QNGVTGRCAYDARKVKVRVSDF--LVFNGSDTFRRMLYHYGPLVAGMN-GALLQDY-NGK 114
G C + K V D + N + + Y P+ Y G
Sbjct: 94 --GQDDHCKFQPDKAIAFVKDVANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGI 151
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNACGIE 173
+ + +NHAV+ VGYG + +P WIV+NSWG +WG +GYF +ERG N CG+
Sbjct: 152 YSSTSCHKTPDKVNHAVLAVGYGEENGIPYWIVKNSWGPQWG-MNGYFLIERGKNMCGLA 210
Query: 174 SY 175
+
Sbjct: 211 AC 212
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 137 bits (347), Expect = 6e-40
Identities = 61/182 (33%), Positives = 89/182 (48%), Gaps = 13/182 (7%)
Query: 2 LESQYAIKHGTLL--PLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
+ESQ I +G +S+ QL++C GC GG N A Y+ + G+++E YP+
Sbjct: 149 IESQMKIANGAGYDSSVSEQQLVDCVPNALGCSGGWMNDAFTYVAQNGGIDSEGAYPYE- 207
Query: 59 QNGVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMN-GALLQDYNGKL 115
G C YD +V R+S ++ +G D M+ GP+ + Y+G
Sbjct: 208 --MADGNCHYDPNQVAARLSGYVYLSGPDENMLADMVATKGPVAVAFDADDPFGSYSGG- 264
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIE 173
+ N C + HAV+IVGYG + W+V+NSWG WG DGYF + R N CGI
Sbjct: 265 VYYNPTCETNKFTHAVLIVGYGNENGQDYWLVKNSWGDGWG-LDGYFKIARNANNHCGIA 323
Query: 174 SY 175
Sbjct: 324 GV 325
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 136 bits (345), Expect = 1e-39
Identities = 61/182 (33%), Positives = 92/182 (50%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY++ + G+++E YP+
Sbjct: 133 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV--- 189
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSD--TFRRMLYHYGPLVAGMNGAL--LQDY-NGKL 115
G C Y+ + + + +R + GP+ ++ +L Q Y G
Sbjct: 190 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKG-- 247
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIE 173
+ ++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 248 VYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGIA 306
Query: 174 SY 175
+
Sbjct: 307 NL 308
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 135 bits (342), Expect = 3e-39
Identities = 55/181 (30%), Positives = 92/181 (50%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E Q A++ G L LS+ LI+C + N GC GG + A Y+ G+ +E+ YP+
Sbjct: 148 VEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGGWMDSAFSYIHDYGIMSESAYPYE-- 205
Query: 60 NGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
C +D+ + +S + + +G ++ + GP+ ++ LQ Y+G +
Sbjct: 206 -AQGDYCRFDSSQSVTTLSGYYDLPSGDENSLADAVGQAGPVAVAIDATDELQFYSGG-L 263
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIES 174
+ C +LNH V++VGYG + WI++NSWG WG + GY+ R N CGI +
Sbjct: 264 FYDQTCNQSDLNHGVLVVGYGSDNGQDYWILKNSWGSGWG-ESGYWRQVRNYGNNCGIAT 322
Query: 175 Y 175
Sbjct: 323 A 323
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 134 bits (340), Expect = 5e-39
Identities = 63/181 (34%), Positives = 90/181 (49%), Gaps = 12/181 (6%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQ 59
+E QY T + S+ QL++C N GC GG A QYLK GLE E+ YP+
Sbjct: 125 MEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGGLMENAYQYLKQFGLETESSYPYT-- 182
Query: 60 NGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMN-GALLQDYNGKLI 116
V G+C Y+ + +V+ F V +GS + ++ GP ++ + Y I
Sbjct: 183 -AVEGQCRYNKQLGVAKVTGFYTVHSGSEVELKNLVGAEGPAAVAVDVESDFMMYRSG-I 240
Query: 117 RKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIES 174
++ C +NHAV+ VGYG + WIV+NSWG WG + GY + R N CGI S
Sbjct: 241 YQSQTCSPLRVNHAVLAVGYGTQGGTDYWIVKNSWGLSWG-ERGYIRMVRNRGNMCGIAS 299
Query: 175 Y 175
Sbjct: 300 L 300
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 134 bits (339), Expect = 7e-39
Identities = 54/186 (29%), Positives = 85/186 (45%), Gaps = 17/186 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q K G L+ LS+ L++C N+GC GG + A QY++ + GL++E YP+
Sbjct: 130 LEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYE- 188
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGAL--LQDYNGKL 115
C Y+ + + F+ + + + GP+ ++ Y
Sbjct: 189 --ATEESCKYNPKYSVANDAGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEG- 245
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPV----WIVRNSWG-RWGPDDGYFTVERGT-NA 169
I C SE+++H V++VGYG W+V+NSWG WG GY + + N
Sbjct: 246 IYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWG-MGGYVKMAKDRRNH 304
Query: 170 CGIESY 175
CGI S
Sbjct: 305 CGIASA 310
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 131 bits (331), Expect = 1e-38
Identities = 62/182 (34%), Positives = 94/182 (51%), Gaps = 14/182 (7%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
LE Q K G LL LS L++C N GC GG A QY++ + G+++E YP+
Sbjct: 34 LEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYV--- 90
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGAL--LQDY-NGKL 115
G C Y+ + + + G+ +R + GP+ ++ +L Q Y G
Sbjct: 91 GQEESCMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKG-- 148
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACGIE 173
+ ++ C S+NLNHAV+ VGYG++ WI++NSWG WG + GY + R NACGI
Sbjct: 149 VYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWG-NKGYILMARNKNNACGIA 207
Query: 174 SY 175
+
Sbjct: 208 NL 209
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 129 bits (326), Expect = 1e-37
Identities = 53/191 (27%), Positives = 82/191 (42%), Gaps = 24/191 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+ESQYAI+ L S+ +L++C++ N GC GG A + GL ++ DYP+ +
Sbjct: 53 VESQYAIRKKALFLFSEQELVDCSVKNNGCYGGYITNAFDDMIDLGGLCSQDDYPYVSNL 112
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKN 119
C + + + V D F+ L + GP+ + Y G
Sbjct: 113 P--ETCNLKRCNERYTIKSY-VSIPDDKFKEALRYLGPISISIAASDDFAFYRGGFYDGE 169
Query: 120 DVCPSENLNHAVVIVGYGM----------RHQVPVWIVRNSWG-RWGPDDGYFTVERGTN 168
C NHAV++VGYGM + +I++NSWG WG + GY +E N
Sbjct: 170 --C-GAAPNHAVILVGYGMKDIYNEDTGRMEKFYYYIIKNSWGSDWG-EGGYINLETDEN 225
Query: 169 A----CGIESY 175
C I +
Sbjct: 226 GYKKTCSIGTE 236
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 128 bits (324), Expect = 1e-37
Identities = 55/187 (29%), Positives = 86/187 (45%), Gaps = 19/187 (10%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
LE Q K G L+ LS+ L++C N+GC GG + A QY++ + GL++E YP+
Sbjct: 34 LEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYE- 92
Query: 59 QNGVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMNGAL--LQDY-NGK 114
C Y+ + + F+ + + + GP+ ++ Y G
Sbjct: 93 --ATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEG- 149
Query: 115 LIRKNDVCPSENLNHAVVIVGYGMRHQ----VPVWIVRNSWG-RWGPDDGYFTVERGT-N 168
I C SE+++H V++VGYG W+V+NSWG WG GY + + N
Sbjct: 150 -IYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWG-MGGYVKMAKDRRN 207
Query: 169 ACGIESY 175
CGI S
Sbjct: 208 HCGIASA 214
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 127 bits (322), Expect = 4e-37
Identities = 51/188 (27%), Positives = 90/188 (47%), Gaps = 24/188 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+ESQYAI+ L+ LS+ +L++C+ N GC GG N A + ++ G+ + DYP+ +
Sbjct: 51 VESQYAIRKNKLITLSEQELVDCSFKNYGCNGGLINNAFEDMIELGGICPDGDYPYVSDA 110
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRKN 119
C D K + ++ + + + L GP+ + Y + +
Sbjct: 111 P--NLCNIDRCTEKYGIKNY-LSVPDNKLKEALRFLGPISISVAVSDDFAFYKEGIF--D 165
Query: 120 DVCPSENLNHAVVIVGYGMRHQVPV----------WIVRNSWG-RWGPDDGYFTVERGTN 168
C + LNHAV++VG+GM+ V +I++NSWG +WG + G+ +E +
Sbjct: 166 GEC-GDQLNHAVMLVGFGMKEIVNPLTKKGEKHYYYIIKNSWGQQWG-ERGFINIETDES 223
Query: 169 A----CGI 172
CG+
Sbjct: 224 GLMRKCGL 231
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 129 bits (327), Expect = 4e-37
Identities = 56/184 (30%), Positives = 96/184 (52%), Gaps = 16/184 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC---NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C N+GC GG A QY+ + G++++A YP++
Sbjct: 132 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYK 191
Query: 58 NQNGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGAL--LQDYNG 113
+ +C YD++ S + + G D + + + GP+ G++ Y
Sbjct: 192 ---AMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRS 248
Query: 114 KLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NACG 171
+ C ++N+NH V++VGYG + W+V+NSWG +G ++GY + R N CG
Sbjct: 249 G-VYYEPSC-TQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFG-EEGYIRMARNKGNHCG 305
Query: 172 IESY 175
I S+
Sbjct: 306 IASF 309
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 124 bits (314), Expect = 4e-36
Identities = 57/185 (30%), Positives = 98/185 (52%), Gaps = 18/185 (9%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC---NIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFR 57
LE+Q +K G L+ LS L++C N+GC GG A QY + + G++++A YP++
Sbjct: 35 LEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYK 94
Query: 58 NQNGVTGRCAYDARKVKVRVSDFL-VFNGS-DTFRRMLYHYGPLVAGMNGAL--LQDY-N 112
+ +C YD++ S + + G D + + + GP+ G++ Y +
Sbjct: 95 ---AMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRS 151
Query: 113 GKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGT-NAC 170
G + C ++N+NH V++VGYG + W+V+NSWG +G ++GY + R N C
Sbjct: 152 G--VYYEPSC-TQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFG-EEGYIRMARNKGNHC 207
Query: 171 GIESY 175
GI S+
Sbjct: 208 GIASF 212
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 123 bits (310), Expect = 1e-35
Identities = 56/181 (30%), Positives = 86/181 (47%), Gaps = 20/181 (11%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+ES I+ G L+ LS+ +L++C+ N GC GG F A QY + + G++ +A+YP++
Sbjct: 34 VESINQIRTGNLISLSEQELVDCDKKNHGCLGGAFVFAYQYIINNGGIDTQANYPYK--- 90
Query: 61 GVTGRCAYDARKVKVRVSDFL-VFNGSDTFRRMLYHYGPLVAGMN--GALLQDYNGKLIR 117
V G C ++ V + + V ++ + P ++ A Q Y+ +
Sbjct: 91 AVQGPCQAASKVVS--IDGYNGVPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIF- 147
Query: 118 KNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVER--GTNACGIES 174
LNH V IVGY + WIVRNSWG WG + GY + R G CGI
Sbjct: 148 --SGPCGTKLNHGVTIVGYQANY----WIVRNSWGRYWG-EKGYIRMLRVGGCGLCGIAR 200
Query: 175 Y 175
Sbjct: 201 L 201
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 123 bits (311), Expect = 1e-35
Identities = 44/202 (21%), Positives = 74/202 (36%), Gaps = 54/202 (26%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E I+ G L+ LS+ +L++C + GC+GG A++Y+ G+ + YP++
Sbjct: 34 VEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRSKYPYK---A 90
Query: 62 VTGRCAYDARKVKV-RVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
G C + + S + F LY G
Sbjct: 91 KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPF--QLYKGG- 147
Query: 99 LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
+ + G C ++HAV VGYG +++NSWG WG +
Sbjct: 148 -I----------FEGP-------C-GTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWG-E 187
Query: 158 DGYFTVERGTNA----CGIESY 175
GY ++R CG+
Sbjct: 188 KGYIRIKRAPGNSPGVCGLYKS 209
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 122 bits (310), Expect = 2e-35
Identities = 52/185 (28%), Positives = 85/185 (45%), Gaps = 24/185 (12%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E I G LL LS+ +L++C + GC+GG A+QY+ ++G+ YP+ G
Sbjct: 34 VEGINKIVTGQLLSLSEQELLDCERRSYGCRGGFPLYALQYVANSGIHLRQYYPYE---G 90
Query: 62 VTGRCAYDARK-VKVRVSDFLVFNGSDTFRRMLYH---YGPLVAGMN--GALLQDYNGKL 115
V +C K KV+ ++ + L P+ + G Q+Y G +
Sbjct: 91 VQRQCRASQAKGPKVKTDGVGRVPRNNE--QALIQRIAIQPVSIVVEAKGRAFQNYRGGI 148
Query: 116 IRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPDDGYFTVERGTNA----C 170
C +++HAV VGYG +++NSWG WG + GY ++RG+ C
Sbjct: 149 F--AGPC-GTSIDHAVAAVGYG----NDYILIKNSWGTGWG-EGGYIRIKRGSGNPQGAC 200
Query: 171 GIESY 175
G+ S
Sbjct: 201 GVLSD 205
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 122 bits (309), Expect = 2e-35
Identities = 57/202 (28%), Positives = 81/202 (40%), Gaps = 54/202 (26%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQY-LKHAGLEAEADYPFRNQN 60
+E I G L+ LS+ QL++C N GC+GG N A Q+ + + G+ +E YP+R
Sbjct: 36 VEGINQIVTGDLISLSEQQLVDCTTANHGCRGGWMNPAFQFIVNNGGINSEETYPYR--- 92
Query: 61 GVTGRCAYDARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
G G C V + + V N F LY G
Sbjct: 93 GQDGICNSTVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDF--QLYRSG- 149
Query: 99 LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
+ + G C + + NHA+ +VGYG + WIV+NSWG WG +
Sbjct: 150 -I----------FTGS-------C-NISANHALTVVGYGTENDKDFWIVKNSWGKNWG-E 189
Query: 158 DGYFTVERGTNA----CGIESY 175
GY ER CGI +
Sbjct: 190 SGYIRAERNIENPDGKCGITRF 211
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 124 bits (314), Expect = 4e-35
Identities = 50/206 (24%), Positives = 78/206 (37%), Gaps = 38/206 (18%)
Query: 1 MLESQYAIKHGT-LLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFR- 57
+ ++ G + +S L+ C + GC GG ++A Y GL ++ P+
Sbjct: 108 AMSDRFCTMGGVQDVHISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLVSDYCQPYPF 167
Query: 58 -----------------NQNGVTGRCAYDARKVKVRVSDFLVF-----NGSDTFRRMLYH 95
N T +C Y + V ++ + G D + R L+
Sbjct: 168 PHCSHHSKSKNGYPPCSQFNFDTPKCDYTCDDPTIPVVNYRSWTSYALQGEDDYMRELFF 227
Query: 96 YGPLVAGMN-GALLQDYNGKLIRK----NDVCPSENLNHAVVIVGYGMRHQVPVWIVRNS 150
GP + Y + V HAV +VG+G + VP W + NS
Sbjct: 228 RGPFEVAFDVYEDFIAY------NSGVYHHVSGQYLGGHAVRLVGWGTSNGVPYWKIANS 281
Query: 151 WGR-WGPDDGYFTVERGTNACGIESY 175
W WG DGYF + RG++ CGIE
Sbjct: 282 WNTEWG-MDGYFLIRRGSSECGIEDG 306
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 120 bits (304), Expect = 1e-34
Identities = 50/202 (24%), Positives = 80/202 (39%), Gaps = 58/202 (28%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E I G L+ LS+ +L++C + GC GG ++QY+ G+ E +YP+
Sbjct: 34 IEGINKIITGQLISLSEQELLDCERRSHGCDGGYQTTSLQYVVDNGVHTEREYPYE---K 90
Query: 62 VTGRCAYDARK-VKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
GRC +K KV ++ + + N F Y G
Sbjct: 91 KQGRCRAKDKKGPKVYITGYKYVPANDEISLIQAIANQPVSVVTDSRGRGF--QFYKGG- 147
Query: 99 LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
+ Y G C N +HAV VGYG + +++NSWG WG +
Sbjct: 148 -I----------YEGP-------C-GTNTDHAVTAVGYGKTY----LLLKNSWGPNWG-E 183
Query: 158 DGYFTVERGT----NACGIESY 175
GY ++R + CG+ +
Sbjct: 184 KGYIRIKRASGRSKGTCGVYTS 205
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 123 bits (311), Expect = 1e-34
Identities = 43/202 (21%), Positives = 73/202 (36%), Gaps = 54/202 (26%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E I+ G L+ LS+ +L++C + GC+GG A++Y+ G+ + YP++
Sbjct: 140 VEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKNGIHLRSKYPYK---A 196
Query: 62 VTGRCAYDARKVKV-RVSDFL-------------VFNG---------SDTFRRMLYHYGP 98
G C + + S + F LY G
Sbjct: 197 KQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPF--QLYKGG- 253
Query: 99 LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
+ + G C ++ AV VGYG +++NSWG WG +
Sbjct: 254 -I----------FEGP-------C-GTKVDGAVTAVGYGKSGGKGYILIKNSWGTAWG-E 293
Query: 158 DGYFTVERGTNA----CGIESY 175
GY ++R CG+
Sbjct: 294 KGYIRIKRAPGNSPGVCGLYKS 315
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 120 bits (304), Expect = 1e-34
Identities = 51/202 (25%), Positives = 83/202 (41%), Gaps = 54/202 (26%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E I G LL LS+ +L++C+ ++ GC+GG ++QY+ + G+ YP++
Sbjct: 34 VEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVANNGVHTSKVYPYQ---A 90
Query: 62 VTGRC-AYDARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
+C A D KV+++ + + N F LY G
Sbjct: 91 KQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPF--QLYKSG- 147
Query: 99 LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
V ++G C L+HAV VGYG I++NSWG WG +
Sbjct: 148 -V----------FDGP-------C-GTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWG-E 187
Query: 158 DGYFTVERGTNA----CGIESY 175
GY ++R + CG+
Sbjct: 188 KGYMRLKRQSGNSQGTCGVYKS 209
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 120 bits (303), Expect = 1e-34
Identities = 46/202 (22%), Positives = 74/202 (36%), Gaps = 58/202 (28%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNG 61
+E I+ G L S+ +L++C+ + GC GG A+Q + G+ YP+ G
Sbjct: 34 IEGIIKIRTGNLNQYSEQELLDCDRRSYGCNGGYPWSALQLVAQYGIHYRNTYPYE---G 90
Query: 62 VTGRCAY-DARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
V C + + + N F LY G
Sbjct: 91 VQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDF--QLYRGG- 147
Query: 99 LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
+ + G C ++HAV VGYG +++NSWG WG +
Sbjct: 148 -I----------FVGP-------C-GNKVDHAVAAVGYG----PNYILIKNSWGTGWG-E 183
Query: 158 DGYFTVERGTNA----CGIESY 175
+GY ++RGT CG+ +
Sbjct: 184 NGYIRIKRGTGNSYGVCGLYTS 205
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 121 bits (306), Expect = 2e-34
Identities = 47/212 (22%), Positives = 79/212 (37%), Gaps = 42/212 (19%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKA-IQYLKHAG-LEAEADYP-- 55
LE+ +K +S + C + C G +Q ++ G L AE++YP
Sbjct: 43 LETIRCMKGYEPTKISALYVANCYKGEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYN 102
Query: 56 -------------FRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDTFR-----------R 91
G+ ++ + S+ F
Sbjct: 103 YVKVGEQCPKVEDHWMNLWDNGKILHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKT 162
Query: 92 MLYHYGPLVAGMN--GALLQDYNGKLIRKNDVCPSENLNHAVVIVGYG-----MRHQVPV 144
+ + G ++A + + +++GK + ++C + +HAV IVGYG +
Sbjct: 163 EVMNKGSVIAYIKAENVMGYEFSGKKV--KNLCGDDTADHAVNIVGYGNYVNSEGEKKSY 220
Query: 145 WIVRNSWG-RWGPDDGYFTVER-GTNACGIES 174
WIVRNSWG WG D+GYF V+ G C
Sbjct: 221 WIVRNSWGPYWG-DEGYFKVDMYGPTHCHFNF 251
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 117 bits (296), Expect = 2e-33
Identities = 58/202 (28%), Positives = 84/202 (41%), Gaps = 55/202 (27%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQN 60
+ES I+ G L+ LS+ +L++C+ + GC GG N A QY+ + G++ + +YP+
Sbjct: 34 VESINKIRTGQLISLSEQELVDCDTASHGCNGGWMNNAFQYIITNGGIDTQQNYPYS--- 90
Query: 61 GVTGRCAYDARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHYGP 98
V G C +V V ++ F V + F Y G
Sbjct: 91 AVQGSCKPYRLRV-VSINGFQRVTRNNESALQSAVASQPVSVTVEAAGAPF--QHYSSG- 146
Query: 99 LVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RWGPD 157
+ + G C NH VVIVGYG + WIVRNSWG WG +
Sbjct: 147 -I----------FTGP-------C-GTAQNHGVVIVGYGTQSGKNYWIVRNSWGQNWG-N 186
Query: 158 DGYFTVERGTNA----CGIESY 175
GY +ER + CGI
Sbjct: 187 QGYIWMERNVASSAGLCGIAQL 208
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 118 bits (297), Expect = 2e-33
Identities = 52/205 (25%), Positives = 78/205 (38%), Gaps = 57/205 (27%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY--NQGCQGGGFNKAIQY-LKHAGLEAEADYPFRN 58
LE + K G L+ LS+ +L++C+ NQ C GG N A QY L G+ +E YP+
Sbjct: 40 LEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYL- 98
Query: 59 QNGVTGRCAYDARKVKVRVSDF-------------LVFNG---------SDTFRRMLYHY 96
C + + V++ F + F YH
Sbjct: 99 --ARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPF--QFYHE 154
Query: 97 GPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGM--RHQVPVWIVRNSWG-R 153
G V ++ C +L+H V++VGYG + WI++NSWG
Sbjct: 155 G--V----------FDAS-------C-GTDLDHGVLLVGYGTDKESKKDFWIMKNSWGTG 194
Query: 154 WGPDDGYFTVERGT---NACGIESY 175
WG DGY + CG+
Sbjct: 195 WG-RDGYMYMAMHKGEEGQCGLLLD 218
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 118 bits (299), Expect = 3e-33
Identities = 44/197 (22%), Positives = 72/197 (36%), Gaps = 23/197 (11%)
Query: 1 MLESQYAIKHGTLLP---LSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFR 57
+ + IK P LS +I+C C+GG Y G+ E ++
Sbjct: 74 AMADRINIKRKGAWPSTLLSVQNVIDCG-NAGSCEGGNDLSVWDYAHQHGIPDETCNNYQ 132
Query: 58 NQNG------------VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMN- 104
++ C RV D+ +G + +Y GP+ G+
Sbjct: 133 AKDQECDKFNQCGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMA 192
Query: 105 GALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGR-WGPDDGYFTV 163
L +Y G + + + +NH V + G+G+ WIVRNSWG WG + G+ +
Sbjct: 193 TERLANYTGGIYA--EYQDTTYINHVVSVAGWGISDGTEYWIVRNSWGEPWG-ERGWLRI 249
Query: 164 ERGTNACGIESYG--GI 178
T G + I
Sbjct: 250 VTSTYKDGKGARYNLAI 266
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 114 bits (287), Expect = 5e-32
Identities = 51/204 (25%), Positives = 79/204 (38%), Gaps = 56/204 (27%)
Query: 2 LESQYAIKHGTLLPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRN 58
+E I G L+ LS+ +L++C +GC GG Q++ + G+ EA+YP+
Sbjct: 34 VEGINKIATGDLISLSEQELVDCGRTQNTRGCDGGFMTDGFQFIINNGGINTEANYPYT- 92
Query: 59 QNGVTGRCAYDARKVKV-RVSDF-------------LVFNG---------SDTFRRMLYH 95
G+C D ++ K + + V F Y
Sbjct: 93 --AEEGQCNLDLQQEKYVSIDTYENVPYNNEWALQTAVAYQPVSVALEAAGYNF--QHYS 148
Query: 96 YGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG-RW 154
G + + G C ++HAV IVGYG + WIV+NSWG W
Sbjct: 149 SG--I----------FTGP-------C-GTAVDHAVTIVGYGTEGGIDYWIVKNSWGTTW 188
Query: 155 GPDDGYFTVERG---TNACGIESY 175
G ++GY ++R CGI
Sbjct: 189 G-EEGYMRIQRNVGGVGQCGIAKK 211
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 114 bits (287), Expect = 6e-32
Identities = 55/205 (26%), Positives = 75/205 (36%), Gaps = 57/205 (27%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNI-YNQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQ 59
+E IK L+ LS+ +L++C+ NQGC GG + A +++K G+ EA+YP+
Sbjct: 35 VEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYE-- 92
Query: 60 NGVTGRCAYDARKVKV-RVSDF-------------LVFNG---------SDTFRRMLYHY 96
G C + V N F Y
Sbjct: 93 -AYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDF--QFYSE 149
Query: 97 GPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGM-RHQVPVWIVRNSWG-RW 154
G V + G C L+H V IVGYG W V+NSWG W
Sbjct: 150 G--V----------FTGS-------C-GTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEW 189
Query: 155 GPDDGYFTVERGT----NACGIESY 175
G + GY +ERG CGI
Sbjct: 190 G-EKGYIRMERGISDKEGLCGIAME 213
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 113 bits (285), Expect = 8e-32
Identities = 51/185 (27%), Positives = 87/185 (47%), Gaps = 16/185 (8%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYL-KHAGLEAEADYPFRNQN 60
+E AI G L+ +S+ Q+++C+ GG + A +++ + G+ ++A+YP+
Sbjct: 34 IEGIDAITTGRLISVSEQQIVDCDTXXXXXXGGDADDAFRWVITNGGIASDANYPYT--- 90
Query: 61 GVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGAL--LQDYNGKLIRK 118
GV G C + + + R+ + S + P+ + + Q Y G I
Sbjct: 91 GVDGTCDLN-KPIAARIDGYTNVPNSSSALLDAVAKQPVSVNIYTSSTSFQLYTGPGIFA 149
Query: 119 NDVCPSE--NLNHAVVIVGYGMRHQ-VPVWIVRNSWG-RWGPDDGYFTVERGTNA----C 170
C + ++H V+IVGYG WIV+NSWG WG DGY + R TN C
Sbjct: 150 GSSCSDDPATVDHTVLIVGYGSNGTNADYWIVKNSWGTEWG-IDGYILIRRNTNRPDGVC 208
Query: 171 GIESY 175
I+++
Sbjct: 209 AIDAW 213
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 111 bits (281), Expect = 7e-31
Identities = 62/208 (29%), Positives = 85/208 (40%), Gaps = 60/208 (28%)
Query: 2 LESQYAIKHGTLLPLSKSQLIECNIY-NQGCQGGGFNKAIQYLK-HAGLEAEADYPFRNQ 59
+E AI+ G+L+ LS+ +LI+C+ N GCQGG + A +Y+K + GL EA YP+R
Sbjct: 37 VEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYR-- 94
Query: 60 NGVTGRCAYD----ARKVKVRVSDF-------------LVFNG---------SDTFRRML 93
G C V V + V N F M
Sbjct: 95 -AARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAF--MF 151
Query: 94 YHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQ-VPVWIVRNSWG 152
Y G V + G+ C L+H V +VGYG+ W V+NSWG
Sbjct: 152 YSEG--V----------FTGE-------C-GTELDHGVAVVGYGVAEDGKAYWTVKNSWG 191
Query: 153 -RWGPDDGYFTVERGTNA----CGIESY 175
WG + GY VE+ + A CGI
Sbjct: 192 PSWG-EQGYIRVEKDSGASGGLCGIAME 218
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 111 bits (280), Expect = 1e-30
Identities = 48/217 (22%), Positives = 75/217 (34%), Gaps = 50/217 (23%)
Query: 1 MLESQYAIKHGTL--LPLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF 56
+ + I + +S L+ C ++ GC GG +A + GL + Y
Sbjct: 43 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 102
Query: 57 R-------------NQNGVTGRC-------------------AYDARKVKVRVSDFLVFN 84
+ NG C Y K + + V N
Sbjct: 103 HVGCRPYSIPPCEAHVNGARPPCTGEGDTPKCSKICEPGYSPTYKQDKHYG-YNSYSVSN 161
Query: 85 GSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRK----NDVCPSENLNHAVVIVGYGMR 139
+Y GP+ + + Y K V HA+ I+G+G+
Sbjct: 162 SEKDIMAEIYKNGPVEGAFSVYSDFLLY------KSGVYQHVTGEMMGGHAIRILGWGVE 215
Query: 140 HQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
+ P W+V NSW WG D+G+F + RG + CGIES
Sbjct: 216 NGTPYWLVANSWNTDWG-DNGFFKILRGQDHCGIESE 251
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 111 bits (279), Expect = 2e-30
Identities = 49/217 (22%), Positives = 79/217 (36%), Gaps = 50/217 (23%)
Query: 1 MLESQYAIKHG--TLLPLSKSQLIEC-NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFR 57
+ + I+ G + LS L+ C GC+GG A Y G+ +
Sbjct: 39 AMSDRSCIQSGGKQNVELSAVDLLSCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENH 98
Query: 58 -----------------------NQNGVTGRC----------AYDARKVKVRVSDFLVFN 84
++ T RC Y K + S + V N
Sbjct: 99 AGCEPYPFPKCEHHTKGKYPPCGSKIYKTPRCKQTCQKKYKTPYTQDKHRG-KSSYNVKN 157
Query: 85 GSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRK----NDVCPSENLNHAVVIVGYGMR 139
++ + YGP+ AG +Y K + HA+ I+G+G+
Sbjct: 158 DEKAIQKEIMKYGPVEAGFTVYEDFLNY------KSGIYKHITGETLGGHAIRIIGWGVE 211
Query: 140 HQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
++ P W++ NSW WG ++GYF + RG + C IES
Sbjct: 212 NKAPYWLIANSWNEDWG-ENGYFRIVRGRDECSIESE 247
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 112 bits (281), Expect = 3e-30
Identities = 48/217 (22%), Positives = 75/217 (34%), Gaps = 50/217 (23%)
Query: 1 MLESQYAIKHGTLL--PLSKSQLIEC--NIYNQGCQGGGFNKAIQYLKHAGLEAEADYPF 56
+ + I + +S L+ C ++ GC GG +A + GL + Y
Sbjct: 100 AISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYES 159
Query: 57 R-------------NQNGVTGRC-------------------AYDARKVKVRVSDFLVFN 84
+ NG C Y K + + V N
Sbjct: 160 HVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYG-YNSYSVSN 218
Query: 85 GSDTFRRMLYHYGPLVAGMN-GALLQDYNGKLIRK----NDVCPSENLNHAVVIVGYGMR 139
+Y GP+ + + Y K V HA+ I+G+G+
Sbjct: 219 SEKDIMAEIYKNGPVEGAFSVYSDFLLY------KSGVYQHVTGEMMGGHAIRILGWGVE 272
Query: 140 HQVPVWIVRNSWGR-WGPDDGYFTVERGTNACGIESY 175
+ P W+V NSW WG D+G+F + RG + CGIES
Sbjct: 273 NGTPYWLVANSWNTDWG-DNGFFKILRGQDHCGIESE 308
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 98.8 bits (246), Expect = 2e-25
Identities = 36/200 (18%), Positives = 66/200 (33%), Gaps = 30/200 (15%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNI----YNQGCQGGGF-NKAIQYLKHAGLEAEADYP 55
++ + + + I N + G I+ L G+ E ++P
Sbjct: 86 AIQFERIHDKQSPEFIPSRLFIYYNERKIEGHVNYDSGAMIRDGIKVLHKLGVCPEKEWP 145
Query: 56 FRNQNG----------------VTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPL 99
+ + + +C DA+ K+ V D + L P
Sbjct: 146 YGDTPADPRTEEFPPGAPASKKPSDQCYKDAQNYKI-TEYSRVAQDIDHLKACLAVGSPF 204
Query: 100 VAGMN-GALLQDYNGKLIRKNDVCPSEN--LNHAVVIVGYGMRHQVPVWIVRNSWG-RWG 155
V G + N +R ++ HAV+ VGY ++ + +RNSWG G
Sbjct: 205 VFGFSVYNSWVGNNSLPVRIPLPTKNDTLEGGHAVLCVGYDD--EIRHFRIRNSWGNNVG 262
Query: 156 PDDGYFTVERG-TNACGIES 174
+DGYF + + +
Sbjct: 263 -EDGYFWMPYEYISNTQLAD 281
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 37.5 bits (86), Expect = 0.001
Identities = 9/42 (21%), Positives = 17/42 (40%), Gaps = 1/42 (2%)
Query: 122 CPSENLNHAVVIVGYGM-RHQVPVWIVRNSWGRWGPDDGYFT 162
+H + I G + ++V+NSWG +G +
Sbjct: 311 NYETTDDHGMQIYGIAKDQEGNEYYMVKNSWGTNSKYNGIWY 352
Score = 27.5 bits (60), Expect = 3.3
Identities = 13/73 (17%), Positives = 22/73 (30%), Gaps = 12/73 (16%)
Query: 1 MLESQYAIKHGTLLPLSKSQLIECNIYNQG------------CQGGGFNKAIQYLKHAGL 48
LES+ LS+ + ++ QGG F A+ ++ GL
Sbjct: 42 FLESELLRMGKGEYDLSEMFTVYNTYLDRADAAVRTHGDVSFSQGGSFYDALYGMETFGL 101
Query: 49 EAEADYPFRNQNG 61
E +
Sbjct: 102 VPEEEMRPGMMYA 114
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 37.0 bits (85), Expect = 0.002
Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 4/90 (4%)
Query: 78 SDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENL-NHAVVIVGY 136
++ VF GS T + M G + + YN + + + E+L A++I G
Sbjct: 321 NNKAVFFGSHTPKFMDKKTGVMDIELWNYPAIGYNLPQQKASRIRYHESLMTAAMLITGC 380
Query: 137 GM---RHQVPVWIVRNSWGRWGPDDGYFTV 163
+ + V NSWG+ DG + +
Sbjct: 381 HVDETSKLPLRYRVENSWGKDSGKDGLYVM 410
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 35.5 bits (81), Expect = 0.008
Identities = 10/40 (25%), Positives = 13/40 (32%), Gaps = 4/40 (10%)
Query: 128 NHAVVIVGYGMRHQVP----VWIVRNSWGRWGPDDGYFTV 163
HA+ + W V NSWG GY +
Sbjct: 370 THAMTFTAVSEKDDQDGAFTKWRVENSWGEDHGHKGYLCM 409
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 31.9 bits (72), Expect = 0.11
Identities = 21/119 (17%), Positives = 36/119 (30%), Gaps = 37/119 (31%)
Query: 38 KAIQYLKHAGLEAEADYPFRNQNGVTGRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYG 97
KAI L G+ YP + ++ + N ML
Sbjct: 298 KAITVLFFIGVRCYEAYP---------NTSLPPSILEDSLE-----NNEGVPSPML---- 339
Query: 98 PLVAGMNGALLQDYNGKLIRK-NDVCPSE--------NLNHAVVIVG-----YGMRHQV 142
++ + +QDY + K N P+ N +V+ G YG+ +
Sbjct: 340 -SISNLTQEQVQDY----VNKTNSHLPAGKQVEISLVNGAKNLVVSGPPQSLYGLNLTL 393
Score = 31.2 bits (70), Expect = 0.26
Identities = 18/98 (18%), Positives = 31/98 (31%), Gaps = 20/98 (20%)
Query: 12 TLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADY--------PFRNQNGVT 63
TL L ++ L ++ QG +++L++ + DY P GV
Sbjct: 194 TLSELIRTTLDAEKVFTQGLN------ILEWLENPSNTPDKDYLLSIPISCPL---IGVI 244
Query: 64 GRCAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVA 101
Y V ++ F + H LV
Sbjct: 245 QLAHY---VVTAKLLGFTPGELRSYLKGATGHSQGLVT 279
>1nz9_A Transcription antitermination protein NUSG; transcription
elongation, riken structural genomics/proteomics
initiative, RSGI; NMR {Thermus thermophilus} SCOP:
b.34.5.4
Length = 58
Score = 27.5 bits (62), Expect = 0.48
Identities = 9/28 (32%), Positives = 12/28 (42%)
Query: 53 DYPFRNQNGVTGRCAYDARKVKVRVSDF 80
PF + G + KVKV V+ F
Sbjct: 15 SGPFADFTGTVTEINPERGKVKVMVTIF 42
>2z0t_A Putative uncharacterized protein PH0355; alpha/beta protein, RNA
binding protein, structural genomics, NPPSFA; 1.80A
{Pyrococcus horikoshii} PDB: 1s04_A
Length = 109
Score = 27.8 bits (62), Expect = 1.00
Identities = 10/61 (16%), Positives = 23/61 (37%), Gaps = 13/61 (21%)
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGS------------DTFRRMLYHYGPLVAGMNGALLQ 109
+ GR + R+ +++ D ++F G +F+ ML G ++
Sbjct: 22 IEGRLYDEKRR-QIKPGDIIIFEGGKLKVKVKGIRVYSSFKEMLEKEGIENVLPGVKSIE 80
Query: 110 D 110
+
Sbjct: 81 E 81
>3r4c_A Hydrolase, haloacid dehalogenase-like hydrolase; haloalkanoate
dehalogenase enzyme superfamily, phosphohydrol
hydrolase; 1.82A {Bacteroides thetaiotaomicron}
Length = 268
Score = 27.5 bits (62), Expect = 2.8
Identities = 22/142 (15%), Positives = 41/142 (28%), Gaps = 53/142 (37%)
Query: 11 GTLLPLSKSQLIECNIYNQGCQGGGFNKAIQYLKHAGLEAEADYPFRNQNGV-----TGR 65
GTLL ++ + +I A++ + +G+ TGR
Sbjct: 21 GTLLSFETHKVSQSSI-----------DALKKVH--------------DSGIKIVIATGR 55
Query: 66 CAYDARKVKVRVSDFLVFNGSDTFRRMLYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSE 125
A D ++ Y ++A +NGA +G +IRK +
Sbjct: 56 AASDLHEID------------------AVPYDGVIA-LNGAECVLRDGSVIRKVAI---- 92
Query: 126 NLNHAVVIVGYGMRHQVPVWIV 147
+ V +
Sbjct: 93 PAQDFRKSMELAREFDFAVALE 114
>3mpo_A Predicted hydrolase of the HAD superfamily; SGX, PSI, structural
genomics, protein structure initiative; 2.90A
{Lactobacillus brevis}
Length = 279
Score = 27.1 bits (61), Expect = 3.8
Identities = 8/76 (10%), Positives = 24/76 (31%), Gaps = 12/76 (15%)
Query: 93 LYHYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWG 152
+ NG++ Q +GK++ + + + + + + + I
Sbjct: 61 IDGDDQYAITFNGSVAQTISGKVLTNHSL----TYEDYIDLEAWARKVRAHFQIET---- 112
Query: 153 RWGPDDGYFTVERGTN 168
D +T + +
Sbjct: 113 ----PDYIYTANKDIS 124
>2b30_A Pvivax hypothetical protein; SGPP, structural genomics, PSI,
protein structure initiative; 2.70A {Plasmodium vivax}
SCOP: c.108.1.10
Length = 301
Score = 26.8 bits (60), Expect = 5.1
Identities = 8/71 (11%), Positives = 22/71 (30%), Gaps = 10/71 (14%)
Query: 95 HYGPLVAGMNGALLQDYNGKLIRKNDVCPSENLNHAVVIVGYGMRHQVPVWIVRNSWGRW 154
YG +NG ++ D G + + + ++ Y + + + +
Sbjct: 89 FYGMPGVYINGTIVYDQIGYTLLDETI----ETDVYAELISYLVEKNLVNQTIFHR---- 140
Query: 155 GPDDGYFTVER 165
+ + E
Sbjct: 141 --GESNYVTED 149
>2jvv_A Transcription antitermination protein NUSG; transcription factor,
transcription regulation, transcription termination; NMR
{Escherichia coli} PDB: 2k06_A 2kvq_G
Length = 181
Score = 26.3 bits (59), Expect = 5.1
Identities = 11/26 (42%), Positives = 15/26 (57%)
Query: 55 PFRNQNGVTGRCAYDARKVKVRVSDF 80
PF + NGV Y+ ++KV VS F
Sbjct: 140 PFADFNGVVEEVDYEKSRLKVSVSIF 165
>3n6r_B Propionyl-COA carboxylase, beta subunit; protein complex,
biotin-dependent carboxylase, ligase; HET: BTI; 3.20A
{Roseobacter denitrificans}
Length = 531
Score = 26.4 bits (59), Expect = 6.2
Identities = 11/25 (44%), Positives = 12/25 (48%)
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGS 86
VTG + R V V DF V GS
Sbjct: 96 VTGWGTINGRVVYVFSQDFTVLGGS 120
>1nrw_A Hypothetical protein, haloacid dehalogenase-like hydrolase;
structural genomics, PSI, protein structure initiative;
1.70A {Bacillus subtilis} SCOP: c.108.1.10
Length = 288
Score = 26.5 bits (59), Expect = 6.6
Identities = 6/16 (37%), Positives = 9/16 (56%)
Query: 104 NGALLQDYNGKLIRKN 119
NGA++ D G+L
Sbjct: 68 NGAVIHDPEGRLYHHE 83
>1vrg_A Propionyl-COA carboxylase, beta subunit; TM0716, structural joint
center for structural genomics, JCSG, protein structu
initiative, PSI; HET: MSE; 2.30A {Thermotoga maritima}
SCOP: c.14.1.4 c.14.1.4
Length = 527
Score = 26.4 bits (59), Expect = 7.8
Identities = 11/25 (44%), Positives = 13/25 (52%)
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGS 86
+TG + RKV V DF V GS
Sbjct: 89 ITGVGEINGRKVAVFSQDFTVMGGS 113
>1m1h_A Transcription antitermination protein NUSG; transcription
termination, RNP motif, immunoglobulin fold, nucleic
acid interaction; 1.95A {Aquifex aeolicus} SCOP:
b.114.1.1 d.58.42.1 PDB: 1m1g_A 1npp_A 1npr_A
Length = 248
Score = 25.8 bits (57), Expect = 8.1
Identities = 9/26 (34%), Positives = 12/26 (46%)
Query: 55 PFRNQNGVTGRCAYDARKVKVRVSDF 80
PF N G + RK+ V +S F
Sbjct: 207 PFMNFTGTVEEVHPEKRKLTVMISIF 232
>1x0u_A Hypothetical methylmalonyl-COA decarboxylase ALPH; lyase; 2.20A
{Sulfolobus tokodaii}
Length = 522
Score = 26.0 bits (58), Expect = 9.7
Identities = 11/25 (44%), Positives = 11/25 (44%)
Query: 62 VTGRCAYDARKVKVRVSDFLVFNGS 86
VTG D R V DF V GS
Sbjct: 82 VTGWGKVDGRTVFAYAQDFTVLGGS 106
>2xhc_A Transcription antitermination protein NUSG; 2.45A {Thermotoga
maritima}
Length = 352
Score = 25.8 bits (57), Expect = 9.9
Identities = 8/26 (30%), Positives = 14/26 (53%)
Query: 55 PFRNQNGVTGRCAYDARKVKVRVSDF 80
PF + GV + +++KV V+ F
Sbjct: 311 PFEDFAGVIKEIDPERQELKVNVTIF 336
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.322 0.141 0.450
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 3,116,359
Number of extensions: 184801
Number of successful extensions: 496
Number of sequences better than 10.0: 1
Number of HSP's gapped: 398
Number of HSP's successfully gapped: 75
Length of query: 192
Length of database: 6,701,793
Length adjustment: 88
Effective length of query: 104
Effective length of database: 4,244,745
Effective search space: 441453480
Effective search space used: 441453480
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 54 (24.3 bits)