RPS-BLAST 2.2.26 [Sep-21-2011]
Database: pdb70
27,921 sequences; 6,701,793 total letters
Searching..................................................done
Query= psy12185
(317 letters)
>2o6x_A Procathepsin L1, secreted cathepsin L 1; hydrolase, thiol protease,
cysteine protease, zymogen, hydro; 1.40A {Fasciola
hepatica}
Length = 310
Score = 207 bits (530), Expect = 1e-65
Identities = 74/285 (25%), Positives = 129/285 (45%), Gaps = 37/285 (12%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEE 93
+L+ +++ Y K Y+ ++ R +EK++ I+E N ++ + G+ +F+D++ E
Sbjct: 3 DLWHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQFTDMTFE 62
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EFK ++L ++SH ++ ++ +P K DWRE+G + +V
Sbjct: 63 EFKAKYLTEMSRASDILSHGVPYEANN---------------RAVPDKIDWRESGYVTEV 107
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
++Q CG+ WAFST T E + T S Q+++DC+ GN GC GG
Sbjct: 108 KDQGNCGSGWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSRPWGNNGCGGG-------L 160
Query: 213 MD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
M+ + + LE ES YP + C+ K+ + + S E +
Sbjct: 161 MENAYQYLKQFGLETESSYPYTAVEGQCRYNK-QLGVAKVTGF---YTVHSGSEVELKNL 216
Query: 266 IATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ GP AV+ + Y G+ + S +NHAV VGY
Sbjct: 217 VGAEGPAAVAVDVESDFMMYRSGIY-QSQTCSPLRVNHAVLAVGY 260
>1cs8_A Human procathepsin L; prosegment, propeptide, inhibition,
hydrolase; HET: OCS; 1.80A {Homo sapiens} SCOP: d.3.1.1
PDB: 1cjl_A 3hwn_A*
Length = 316
Score = 200 bits (510), Expect = 2e-62
Identities = 73/281 (25%), Positives = 121/281 (43%), Gaps = 30/281 (10%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEE 93
++ ++ + + Y +E R +EK++ +IE N + R+ S + F D++ E
Sbjct: 10 AQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSE 69
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKV 153
EF+ K + P DWRE G + V
Sbjct: 70 EFRQVMNGFQNRKPRKGKVFQEPLF-----------------YEAPRSVDWREKGYVTPV 112
Query: 154 RNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDW 212
+NQ CG+CWAFS E K G L LS Q ++DC+G GN GC+GG +
Sbjct: 113 KNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQY 172
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SESSILTDIATHGP 271
+ N L+ E YP + +CK + + IP E +++ +AT GP
Sbjct: 173 VQDNG-GLDSEESYPYEATEESCKYNP-KYSVANDAGF---VDIPKQEKALMKAVATVGP 227
Query: 272 VIAAVNA--LTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
+ A++A ++ +Y G+ +C S +++H V +VGY
Sbjct: 228 ISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGY 266
>1by8_A Protein (procathepsin K); hydrolase(sulfhydryl proteinase), papain;
2.60A {Homo sapiens} SCOP: d.3.1.1 PDB: 7pck_A
Length = 314
Score = 198 bits (506), Expect = 5e-62
Identities = 74/284 (26%), Positives = 121/284 (42%), Gaps = 33/284 (11%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
+ +++ ++K Y +K + R +EK+L I N + + + D++
Sbjct: 9 THWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTS 68
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EE + V S+ + P D+R+ G +
Sbjct: 69 EEVVQKMTGLKVPLSHSRSNDTLYIPEWE--------------GRAPDSVDYRKKGYVTP 114
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ CG+CWAFS+V E K G L LS Q ++DC N GC GG +
Sbjct: 115 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS-ENDGCGGGYMTNAFQY 173
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHG 270
+ N+ ++ E YP + ++ +C T K + Y IP +E ++ +A G
Sbjct: 174 VQKNR-GIDSEDAYPYVGQEESCMYNPTGK-AAKCRGY---REIPEGNEKALKRAVARVG 228
Query: 271 PVIAAVNA--LTWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
PV A++A ++Q+Y GV Y C + N+NHAV VGY
Sbjct: 229 PVSVAIDASLTSFQFYSKGV--YYDESC--NSDNLNHAVLAVGY 268
>3qj3_A Cathepsin L-like protein; hydrolase, proteinase, larVal midgut;
1.85A {Tenebrio molitor}
Length = 331
Score = 199 bits (507), Expect = 7e-62
Identities = 77/283 (27%), Positives = 122/283 (43%), Gaps = 25/283 (8%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
E + +F+ Y +SY + E R + F+K L+ EE N K RQ S G+ F+D++
Sbjct: 20 EKWENFKTTYARSYVNAKEETFRKQIFQKKLETFEEHNEKYRQGLVSYTLGVNLFTDMTP 79
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EE K + + + + + + P DWR+ G++
Sbjct: 80 EEMKAYTHGLIMP-----ADLHKNGIPIKTREDLGLNASVRYPASF----DWRDQGMVSP 130
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSL--LSVQEVIDCAGNGNMGCSGGDFCALL 210
V+NQ +CG+ WAFS+ ES + NG +S Q+++DC +GCSGG
Sbjct: 131 VKNQGSCGSSWAFSSTGAIESQMKIANGAGYDSSVSEQQLVDCVP-NALGCSGGWMNDAF 189
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTDIAT 268
++ N ++ E YP + D C + ++ Y + E+ + +AT
Sbjct: 190 TYVAQNG-GIDSEGAYPYEMADGNCHYDP-NQVAARLSGY---VYLSGPDENMLADMVAT 244
Query: 269 HGPVIAAVNAL-TWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
GPV A +A + Y GGV C HAV IVGY
Sbjct: 245 KGPVAVAFDADDPFGSYSGGVYYNPTC--ETNKFTHAVLIVGY 285
>1pci_A Procaricain; zymogen, hydrolase, thiol protease; 3.20A {Carica
papaya} SCOP: d.3.1.1
Length = 322
Score = 197 bits (502), Expect = 2e-61
Identities = 83/298 (27%), Positives = 136/298 (45%), Gaps = 33/298 (11%)
Query: 18 FLAIPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQ 75
F + ++L +LF+S+ + K Y + E RF+ F+ +L+ I+E NK
Sbjct: 2 FSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKN- 60
Query: 76 SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIP 135
S G+ EF+DLS +EF +++ ++ + S+ + +
Sbjct: 61 --NSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDI-------------- 104
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
+P DWR+ G + VR+Q +CG+CWAFS V T E ++ ++ G L LS QE++DC
Sbjct: 105 VNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCER 164
Query: 196 NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTL 255
+ GC GG L+++ N + S+YP K C+ K VK
Sbjct: 165 -RSHGCKGGYPPYALEYVAKNG--IHLRSKYPYKAKQGTCRAKQVGGPIVKTSGV---GR 218
Query: 256 IPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ E ++L IA PV V + +Q Y GG+ + C + + AV VGY
Sbjct: 219 VQPNNEGNLLNAIA-KQPVSVVVESKGRPFQLYKGGIFEGPCGTKV---DGAVTAVGY 272
>1xkg_A DER P I, major mite fecal allergen DER P 1; major allergen,
cysteine protease, house DUST mite, dermatop
pteronyssinus; 1.61A {Dermatophagoides pteronyssinus}
SCOP: d.3.1.1
Length = 312
Score = 194 bits (495), Expect = 2e-60
Identities = 68/293 (23%), Positives = 114/293 (38%), Gaps = 46/293 (15%)
Query: 27 KPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGIT 85
+P+ + F +++ + KSY + + + KNF +S+ ++ I
Sbjct: 1 RPSSI---KTFEEYKKAFNKSYATFEDEEAARKNFLESVKYVQSNGG----------AIN 47
Query: 86 EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWR 145
SDLS +EFK R L + L + + + + + P I D R
Sbjct: 48 HLSDLSLDEFKNRFLMSAEAFEHLKTQFDLNA------ETNACSINGNAPAEI----DLR 97
Query: 146 EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGD 205
+ + +R Q CG+ WAFS V ES + L+ QE++DCA GC G
Sbjct: 98 QMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRDQSLDLAEQELVDCA--SQHGCHGD- 154
Query: 206 FCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
+ + + ES Y + ++ +C+R I +Y P+ +
Sbjct: 155 ------TIPRGIEYIQHNGVVQESYYRYVAREQSCRRPNA--QRFGISNYC-QIYPPNAN 205
Query: 261 SILTDIA-THGPV---IAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
I +A TH + I + +++Y G I D HAV IVGY
Sbjct: 206 KIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI-IQRDNGYQPNYHAVNIVGY 257
>2c0y_A Procathepsin S; proenzyme, proteinase, hydrolase, thiol protease,
prosegment binding loop, glycoprotein, lysosome,
protease, zymogen; 2.1A {Homo sapiens}
Length = 315
Score = 194 bits (495), Expect = 3e-60
Identities = 77/284 (27%), Positives = 122/284 (42%), Gaps = 33/284 (11%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
+ +++ Y K Y K+E +R +EK+L + N ++ S G+ D++
Sbjct: 10 HHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTS 69
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EE + V +R+IT +P DWRE G + +
Sbjct: 70 EEVMSLMSSLRVPSQ----------------WQRNITYKSNPNRILPDSVDWREKGCVTE 113
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN--GNMGCSGGDFCALL 210
V+ Q +CGA WAFS V E+ LK G L LS Q ++DC+ GN GC+GG
Sbjct: 114 VKYQGSCGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 173
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTDIAT 268
++ NK ++ ++ YP D C+ + Y T +P E + +A
Sbjct: 174 QYIIDNK-GIDSDASYPYKAMDQKCQYDS-KYRAATCSKY---TELPYGREDVLKEAVAN 228
Query: 269 HGPVIAAVNA--LTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
GPV V+A ++ Y GV +C ++ NH V +VGY
Sbjct: 229 KGPVSVGVDARHPSFFLYRSGVYYEPSCTQNV---NHGVLVVGY 269
>3qt4_A Cathepsin-L-like midgut cysteine proteinase; hydrolase, zymogen,
intramolecular DISS bonds, insect larVal midgut; HET:
PG4 PG6; 2.11A {Tenebrio molitor}
Length = 329
Score = 190 bits (486), Expect = 7e-59
Identities = 80/289 (27%), Positives = 127/289 (43%), Gaps = 44/289 (15%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSE 92
E +S F+ +KKSY S E R F+ ++ I E N K + + + +F D+S+
Sbjct: 25 EQWSQFKLTHKKSYSSPIEEIRRQLIFKDNVAKIAEHNAKFEKGEVTYSKAMNQFGDMSK 84
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEF R K + + + DWR +
Sbjct: 85 EEFLAYVNRGKAQKPKHPENLRMPYVSSK--------------KPLAASVDWRSNAVSE- 129
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLD 211
V++Q CG+ W+FST E AL+ G L+ LS Q +IDC+ + GN GC GG
Sbjct: 130 VKDQGQCGSSWSFSTTGAVEGQLALQRGRLTSLSEQNLIDCSSSYGNAGCDGG------- 182
Query: 212 WMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILT 264
WMD ++ + ES YP + C+ + S + + Y +P E+S+
Sbjct: 183 WMDSAFSYIHDYGIMSESAYPYEAQGDYCRFDS-SQSVTTLSGY---YDLPSGDENSLAD 238
Query: 265 DIATHGPVIAAVNAL-TWQYYLGGVIQYN---CDGSLANINHAVQIVGY 309
+ GPV A++A Q+Y GG+ + C + +++NH V +VGY
Sbjct: 239 AVGQAGPVAVAIDATDELQFYSGGL--FYDQTC--NQSDLNHGVLVVGY 283
>3pdf_A Cathepsin C, dipeptidyl peptidase 1; two domains, cystein protease,
hydrolase-hydrolase inhibitor; HET: LXV NAG; 1.85A {Homo
sapiens} PDB: 1jqp_A* 2djf_B* 1k3b_B* 2djg_B* 2djf_A*
1k3b_A* 2djg_A* 2djf_C* 1k3b_C* 2djg_C*
Length = 441
Score = 159 bits (404), Expect = 1e-45
Identities = 57/297 (19%), Positives = 103/297 (34%), Gaps = 43/297 (14%)
Query: 32 QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
+K+ S S+ + ++ + ++ +N ++S + Y E+ L+
Sbjct: 116 KKVGTASENVYVNTAHLKNSQEKYSNRLYKYDHNFVKAINAIQKSWTATTY--MEYETLT 173
Query: 92 EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWRE---AG 148
+ R HS + + +P DWR
Sbjct: 174 LGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILF-------------LPTSWDWRNVHGIN 220
Query: 149 IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSL--LSVQEVIDCAGNGNMGCSGGDF 206
+ VRNQ +CG+C++F+++ E+ + LS QEV+ C+ GC GG
Sbjct: 221 FVSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQ-YAQGCEGG-- 277
Query: 207 CALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDT---LIP 257
+ L E+ +P D+ CK K Y
Sbjct: 278 -----FPYLIAGKYAQDFGLVEEACFPYTGTDSPCKMKEDCF-RYYSSEYHYVGGFYGGC 331
Query: 258 SESSILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLAN----INHAVQIVGY 309
+E+ + ++ HGP+ A + +Y G+ + N NHAV +VGY
Sbjct: 332 NEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELTNHAVLLVGY 388
>1m6d_A Cathepsin F, catsf; papain family cysteine protease, hydrolase;
HET: MYP; 1.70A {Homo sapiens} SCOP: d.3.1.1
Length = 214
Score = 151 bits (383), Expect = 7e-45
Identities = 56/179 (31%), Positives = 79/179 (44%), Gaps = 20/179 (11%)
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P + DWR G + KV++Q CG+CWAFS E L GTL LS QE++DC +
Sbjct: 2 PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDK-MD 60
Query: 199 MGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
C GG N LE E +Y +C+ A V I+
Sbjct: 61 KACMGG-------LPSNAYSAIKNLGGLETEDDYSYQGHMQSCQFSAEKA-KVYIQDS-- 110
Query: 253 DTLIP-SESSILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
+ +E + +A GP+ A+NA Q+Y G+ + S I+HAV +VGY
Sbjct: 111 -VELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCSPWLIDHAVLLVGY 168
>1s4v_A Cysteine endopeptidase; KDEL ER retention signal, endosperm,
ricinosomes, SEED germi senescence, hydrolase-hydrolase
inhibitor complex; 2.00A {Ricinus communis} SCOP:
d.3.1.1
Length = 229
Score = 149 bits (378), Expect = 5e-44
Identities = 58/182 (31%), Positives = 89/182 (48%), Gaps = 24/182 (13%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+P DWR+ G + V++Q CG+CWAFST+ E ++ +K L LS QE++DC +
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
N GC+GG MD + + E+ YP D C + V I +
Sbjct: 62 NQGCNGG-------LMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGH- 113
Query: 252 CDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
+P E+++L +A + PV A++A +Q+Y GV +C L +H V IV
Sbjct: 114 --ENVPENDENALLKAVA-NQPVSVAIDAGGSDFQFYSEGVFTGSCGTEL---DHGVAIV 167
Query: 308 GY 309
GY
Sbjct: 168 GY 169
>2xu3_A Cathepsin L1; hydrolase, drug design, thiol protease; HET: XU3 BTB;
0.90A {Homo sapiens} PDB: 2xu4_A* 2xu5_A* 2yj2_A*
2yj8_A* 2yj9_A* 2yjb_A* 2yjc_A* 3bc3_A* 3h89_A* 3h8b_A*
3h8c_A* 3of9_A* 3of8_A* 3hha_A* 2xu1_A* 3iv2_A* 3k24_A*
2nqd_B* 3kse_A* 2vhs_A ...
Length = 220
Score = 145 bits (369), Expect = 1e-42
Identities = 58/182 (31%), Positives = 86/182 (47%), Gaps = 24/182 (13%)
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-G 197
P DWRE G + V+NQ CG+CWAFS E K G L LS Q ++DC+G G
Sbjct: 2 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQG 61
Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
N GC+GG MD + L+ E YP + +CK +
Sbjct: 62 NEGCNGG-------LMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFV 113
Query: 252 CDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQ-YNCDGSLANINHAVQIV 307
IP E +++ +AT GP+ A++A ++ +Y G+ +C S +++H V +V
Sbjct: 114 D---IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVV 168
Query: 308 GY 309
GY
Sbjct: 169 GY 170
>3ioq_A CMS1MS2; caricaceae, cysteine protease, papain family, hydrolase;
HET: E64 SO4; 1.87A {Carica candamarcensis}
Length = 213
Score = 144 bits (367), Expect = 2e-42
Identities = 56/181 (30%), Positives = 77/181 (42%), Gaps = 24/181 (13%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
IP DWR+ G + VRNQ CG+CW FS+V E ++ + G L LS QE++DC
Sbjct: 1 IPTSIDWRQKGAVTPVRNQGGCGSCWTFSSVAAVEGINKIVTGQLLSLSEQELLDCER-R 59
Query: 198 NMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
+ GC GG + V + YP C+ VK
Sbjct: 60 SYGCRGG-------FPLYALQYVANSGIHLRQYYPYEGVQRQCRASQAKGPKVKTDGV-- 110
Query: 253 DTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
+P E +++ IA PV V A +Q Y GG+ C S+ +HAV VG
Sbjct: 111 -GRVPRNNEQALIQRIA-IQPVSIVVEAKGRAFQNYRGGIFAGPCGTSI---DHAVAAVG 165
Query: 309 Y 309
Y
Sbjct: 166 Y 166
>2cio_A Papain; hydrolase/inhibitor, complex hydrolase/inhibitor, ICP,
cysteine protease, allergen, protease, thiol protease;
1.5A {Carica papaya} PDB: 1khq_A 1khp_A 1ppn_A 3e1z_B
3ima_A 3lfy_A 9pap_A 1bqi_A* 1bp4_A* 1pad_A 1pe6_A*
1pip_A* 1pop_A* 1ppd_A 1ppp_A* 1stf_E* 2pad_A 4pad_A*
5pad_A* 6pad_A* ...
Length = 212
Score = 144 bits (367), Expect = 2e-42
Identities = 54/181 (29%), Positives = 82/181 (45%), Gaps = 24/181 (13%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
IP DWR+ G + V+NQ +CG+CWAFS V T E + ++ G L+ S QE++DC
Sbjct: 1 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDR-R 59
Query: 198 NMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
+ GC+GG + V + + + YP C+ + P K
Sbjct: 60 SYGCNGG-------YPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQ 112
Query: 253 DTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
+ E ++L IA + PV + A +Q Y GG+ C + +HAV VG
Sbjct: 113 ---VQPYNEGALLYSIA-NQPVSVVLEAAGKDFQLYRGGIFVGPCGNKV---DHAVAAVG 165
Query: 309 Y 309
Y
Sbjct: 166 Y 166
>2bdz_A Mexicain; cysteine protease, peptidase_C1, papain-like, HYDR; HET:
E64; 2.10A {Jacaratia mexicana}
Length = 214
Score = 144 bits (367), Expect = 2e-42
Identities = 61/180 (33%), Positives = 86/180 (47%), Gaps = 24/180 (13%)
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P DWRE G + V+NQ CG+CWAFSTV T E ++ + G L LS QE++DC +
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCER-RS 60
Query: 199 MGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
GC GG + V + E EYP K C+ K V I Y
Sbjct: 61 HGCDGG-------YQTTSLQYVVDNGVHTEREYPYEKKQGRCRAKDKKGPKVYITGY--- 110
Query: 254 TLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+P+ E S++ IA + PV ++ +Q+Y GG+ + C + +HAV VGY
Sbjct: 111 KYVPANDEISLIQAIA-NQPVSVVTDSRGRGFQFYKGGIYEGPCGTNT---DHAVTAVGY 166
>8pch_A Cathepsin H; hydrolase, protease, cysteine proteinase,
aminopeptidase; HET: NAG BMA; 2.10A {Sus scrofa} SCOP:
d.3.1.1 PDB: 1nb3_A* 1nb5_A*
Length = 220
Score = 144 bits (367), Expect = 2e-42
Identities = 53/183 (28%), Positives = 78/183 (42%), Gaps = 23/183 (12%)
Query: 139 PVKKDWREAG-IIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN- 196
P DWR+ G + V+NQ +CG+CW FST ES A+ G + L+ Q+++DCA N
Sbjct: 2 PPSMDWRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQQLVDCAQNF 61
Query: 197 GNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
N GC GG + E YP +D CK + +K
Sbjct: 62 NNHGCQGG-------LPSQAFEYIRYNKGIMGEDTYPYKGQDDHCKFQPDKA-IAFVKDV 113
Query: 251 TCDTLIPS--ESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQ-YNCDGSLANINHAVQI 306
I E +++ +A + PV A + Y G+ +C + +NHAV
Sbjct: 114 ---ANITMNDEEAMVEAVALYNPVSFAFEVTNDFLMYRKGIYSSTSCHKTPDKVNHAVLA 170
Query: 307 VGY 309
VGY
Sbjct: 171 VGY 173
>3i06_A Cruzipain; autocatalytic cleavage, glycoprotein, protease, thiol
protease, zymogen; HET: QL2; 1.10A {Trypanosoma cruzi}
PDB: 1ewm_A* 1ewo_A* 1ewl_A* 1f29_A* 1ewp_A* 1f2b_A*
1f2c_A* 1f2a_A* 1me4_A* 1u9q_X* 2aim_A* 2efm_A* 2oz2_A*
1me3_A* 3kku_A* 3lxs_A* 1aim_A* 3iut_A* 3hd3_A* 2p86_A*
...
Length = 215
Score = 144 bits (366), Expect = 2e-42
Identities = 53/181 (29%), Positives = 79/181 (43%), Gaps = 23/181 (12%)
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P DWR G + V++Q CG+CWAFS + E L L+ LS Q ++ C +
Sbjct: 2 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCD-KTD 60
Query: 199 MGCSGGDFCALLDWMD-----VNKVV---LEPESEYPLLLKDAACK--RKATSPNGVKIK 248
GCSGG M+ + + + E YP + + G I
Sbjct: 61 SGCSGG-------LMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATIT 113
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
+ L E+ I +A +GPV AV+A +W Y GGV+ +C ++H V +VG
Sbjct: 114 GHV--ELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE--QLDHGVLLVG 168
Query: 309 Y 309
Y
Sbjct: 169 Y 169
>1ppo_A Protease omega; hydrolase(thiol protease); 1.80A {Carica papaya}
SCOP: d.3.1.1 PDB: 1meg_A*
Length = 216
Score = 144 bits (366), Expect = 3e-42
Identities = 58/181 (32%), Positives = 85/181 (46%), Gaps = 24/181 (13%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+P DWR+ G + VR+Q +CG+CWAFS V T E ++ ++ G L LS QE++DC
Sbjct: 1 LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCER-R 59
Query: 198 NMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
+ GC GG + V K + S+YP K C+ K VK
Sbjct: 60 SHGCKGG-------YPPYALEYVAKNGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGR 112
Query: 253 DTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
+ +E ++L IA PV V + +Q Y GG+ + C + +HAV VG
Sbjct: 113 ---VQPNNEGNLLNAIA-KQPVSVVVESKGRPFQLYKGGIFEGPCGTKV---DHAVTAVG 165
Query: 309 Y 309
Y
Sbjct: 166 Y 166
>3p5u_A Actinidin; SAD, cysteine proteinases, hydrolase; 1.50A {Actinidia
arguta} PDB: 3p5v_A 3p5w_A 3p5x_A 1aec_A* 2act_A
Length = 220
Score = 144 bits (366), Expect = 3e-42
Identities = 52/182 (28%), Positives = 82/182 (45%), Gaps = 23/182 (12%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN- 196
+P DWR +G + +++Q CG+ WAFST+ E ++ + G L LS QE++DC
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSAWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 197 GNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
GC GG +M +N + E+ YP ++ C V I +Y
Sbjct: 61 NTRGCDGG-------FMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTY 113
Query: 251 TCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
+P + L + PV A+ A +Q+Y G+ C ++ +HAV IV
Sbjct: 114 ---ENVPYNNEWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAV---DHAVTIV 167
Query: 308 GY 309
GY
Sbjct: 168 GY 169
>1yal_A Chymopapain; hydrolase, thiol protease; 1.70A {Carica papaya} SCOP:
d.3.1.1 PDB: 1gec_E*
Length = 218
Score = 144 bits (365), Expect = 3e-42
Identities = 60/180 (33%), Positives = 80/180 (44%), Gaps = 24/180 (13%)
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P DWR G + V+NQ CG+CWAFST+ T E ++ + G L LS QE++DC +
Sbjct: 2 PQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDK-HS 60
Query: 199 MGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
GC GG + V + YP K C+ VKI Y
Sbjct: 61 YGCKGG-------YQTTSLQYVANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGY--- 110
Query: 254 TLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+PS E+S L +A + P+ V A +Q Y GV C L +HAV VGY
Sbjct: 111 KRVPSNCETSFLGALA-NQPLSVLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 166
>3ovx_A Cathepsin S; hydrolase, covalent inhibitor, aldehyde warhead is
covalently bound to Cys25, lysosomeal protein; HET: O64;
1.49A {Homo sapiens} PDB: 2h7j_A* 2f1g_A* 2hh5_B*
2hhn_A* 2hxz_A* 2op3_A* 2frq_A* 2fra_A* 2fq9_A* 2ft2_A*
2fud_A* 2g7y_A* 1ms6_A* 2r9m_A* 2r9n_A* 2r9o_A* 3n3g_A*
3n4c_A* 3mpe_A* 1nqc_A* ...
Length = 218
Score = 144 bits (365), Expect = 4e-42
Identities = 59/184 (32%), Positives = 86/184 (46%), Gaps = 25/184 (13%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN- 196
+P DWRE G + +V+ Q +CGACWAFS V E+ LK G L LS Q ++DC+
Sbjct: 2 LPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEK 61
Query: 197 -GNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
GN GC+GG +M ++ ++ ++ YP D C+ +
Sbjct: 62 YGNKGCNGG-------FMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYR-AATCSK 113
Query: 250 YTCDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
Y T +P E + +A GPV V+A ++ Y GV Y N+NH V
Sbjct: 114 Y---TELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGV--YYEPSCTQNVNHGVL 168
Query: 306 IVGY 309
+VGY
Sbjct: 169 VVGY 172
>2fo5_A Cysteine proteinase EP-B 2; EP-B2, EPB2, EPB, cysteine
endoprotease, endopeptidase, LEUP hydrolase; HET: AR7;
2.20A {Hordeum vulgare}
Length = 262
Score = 145 bits (368), Expect = 4e-42
Identities = 62/186 (33%), Positives = 86/186 (46%), Gaps = 25/186 (13%)
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
+ +P DWR+ G + V++Q CG+CWAFSTV + E ++A++ G+L LS QE+IDC
Sbjct: 2 SDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT 61
Query: 196 NGNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAAC---KRKATSPNGVK 246
N GC GG MD N L E+ YP C + SP V
Sbjct: 62 ADNDGCQGG-------LMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVH 114
Query: 247 IKSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
I + +P+ S L + PV AV A + +Y GV C L +H
Sbjct: 115 IDGH---QDVPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTEL---DHG 168
Query: 304 VQIVGY 309
V +VGY
Sbjct: 169 VAVVGY 174
>3f75_A Toxopain-2, cathepsin L protease; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 224
Score = 144 bits (365), Expect = 5e-42
Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 24/185 (12%)
Query: 135 PTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA 194
P+ +P DWR G + V++Q+ CG+CWAFST E H K G L LS QE++DC+
Sbjct: 4 PSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCS 63
Query: 195 GN-GNMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
GN CSGG M+ ++ + E YP L +D C+ ++ VKI
Sbjct: 64 RAEGNQSCSGG-------EMNDAFQYVLDSGGICSEDAYPYLARDEECRAQSCEK-VVKI 115
Query: 248 KSYTCDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAV 304
+ +P S + + PV A+ A + +Q+Y GV +C L +H V
Sbjct: 116 LGF---KDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDL---DHGV 169
Query: 305 QIVGY 309
+VGY
Sbjct: 170 LLVGY 174
>3kwz_A Cathepsin K; enzyme inhibitor, covalent reversible inhibitor,
disease mutation, disulfide bond, glycoprotein,
hydrolase, lysosome, protease; HET: KWZ; 1.49A {Homo
sapiens} PDB: 1au0_A* 1au2_A* 1au3_A* 1au4_A* 1ayu_A*
1ayv_A* 1ayw_A* 1bgo_A* 1atk_A* 1nl6_A* 1nlj_A* 1q6k_A*
1mem_A* 1yk7_A* 1yk8_A* 1yt7_A* 2ato_A* 2aux_A* 2auz_A*
2bdl_A* ...
Length = 215
Score = 143 bits (363), Expect = 6e-42
Identities = 59/181 (32%), Positives = 86/181 (47%), Gaps = 23/181 (12%)
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P D+R+ G + V+NQ CG+CWAFS+V E K G L LS Q ++DC N
Sbjct: 2 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVS-EN 60
Query: 199 MGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTC 252
GC GG +M ++ E YP + ++ +C T K + Y
Sbjct: 61 DGCGGG-------YMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYRE 112
Query: 253 DTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
IP E ++ +A GPV A++A ++Q+Y GV Y+ + N+NHAV VG
Sbjct: 113 ---IPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVY-YDESCNSDNLNHAVLAVG 168
Query: 309 Y 309
Y
Sbjct: 169 Y 169
>2b1m_A SPE31; papain-like, sugar binding protein; HET: NAG FUC PG4; 2.00A
{Pachyrhizus erosus} PDB: 2b1n_A*
Length = 246
Score = 144 bits (365), Expect = 9e-42
Identities = 61/184 (33%), Positives = 83/184 (45%), Gaps = 22/184 (11%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
P DW + G+I KV+ Q CG+ WAFS E+ HA+ G L LS QE+IDC
Sbjct: 2 APESWDWSKKGVITKVKFQGQCGSGWAFSATGAIEAAHAIATGNLVSLSEQELIDCVD-E 60
Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
+ GC G W V + E++YP +D CK + V I +Y
Sbjct: 61 SEGCYNG-------WHYQSFEWVVKHGGIASEADYPYKARDGKCKANE-IQDKVTIDNYG 112
Query: 252 CDTL----IPSES-SILTDIATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQ 305
L SE+ S L P+ +++A + +Y GG+ NC S INH V
Sbjct: 113 VQILSNESTESEAESSLQSFVLEQPISVSIDAKDFHFYSGGIYDGGNC-SSPYGINHFVL 171
Query: 306 IVGY 309
IVGY
Sbjct: 172 IVGY 175
>1cqd_A Protein (protease II); cysteine protease, glycoprotein, proline
specificity, carboh papain family, hydrolase; HET: NAG
FUL FUC; 2.10A {Zingiber officinale} SCOP: d.3.1.1
Length = 221
Score = 142 bits (360), Expect = 2e-41
Identities = 64/182 (35%), Positives = 90/182 (49%), Gaps = 26/182 (14%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+P DWRE G + V+NQ CG+CWAFSTV E ++ + G L LS Q+++DC
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-A 61
Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
N GC GG WM+ VN + E YP +D C +P V I SY
Sbjct: 62 NHGCRGG-------WMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAP-VVSIDSY- 112
Query: 252 CDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
+PS E S+ +A + PV ++A +Q Y G+ +C+ S NHA+ +V
Sbjct: 113 --ENVPSHNEQSLQKAVA-NQPVSVTMDAAGRDFQLYRSGIFTGSCNISA---NHALTVV 166
Query: 308 GY 309
GY
Sbjct: 167 GY 168
>2wbf_X Serine-repeat antigen protein; SERA, malaria, vacuole, protease,
cathepsin, hydrolase, glycoprotein, thiol protease; HET:
DMS; 1.60A {Plasmodium falciparum} PDB: 3ch3_X 3ch2_X
Length = 265
Score = 143 bits (362), Expect = 4e-41
Identities = 44/209 (21%), Positives = 66/209 (31%), Gaps = 44/209 (21%)
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMG 200
KD +V +Q C W F++ E++ +K + +S V +C
Sbjct: 14 KDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYKGEHKDR 73
Query: 201 CSGGDFCALLDWMD-------VNKVVLEPESEYP------------------LLLKDAAC 235
C G + L ES YP L +
Sbjct: 74 CDEG-------SSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKI 126
Query: 236 KRKATSPNGVKIKSYT-------CDTLIPSESSILTDIATHGPVIAAVNALTW--QYYLG 286
PN + K YT D + I T++ G VIA + A + G
Sbjct: 127 LHNKNEPNSLDGKGYTAYESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSG 186
Query: 287 GVIQYNCDGSLANINHAVQIVGYDNYSRT 315
++ C +HAV IVGY NY +
Sbjct: 187 KKVKNLCGDD--TADHAVNIVGYGNYVNS 213
>1iwd_A Ervatamin B; cysteine protease, alpha-beta protein, catalytic DYAD,
L-DOM domain., hydrolase; 1.63A {Tabernaemontana
divaricata} SCOP: d.3.1.1
Length = 215
Score = 139 bits (354), Expect = 1e-40
Identities = 55/182 (30%), Positives = 85/182 (46%), Gaps = 27/182 (14%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+P DWR G + ++NQ+ CG+CWAFS V ES++ ++ G L LS QE++DC
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT-A 59
Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
+ GC+GG WM+ + ++ + YP +CK V I +
Sbjct: 60 SHGCNGG-------WMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRL--RVVSINGF- 109
Query: 252 CDTLIPS--ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
+ ES++ + +A PV V A +Q+Y G+ C + NH V IV
Sbjct: 110 --QRVTRNNESALQSAVA-SQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQ---NHGVVIV 163
Query: 308 GY 309
GY
Sbjct: 164 GY 165
>3bwk_A Cysteine protease falcipain-3; malaria, hydrolase; HET: C1P; 2.42A
{Plasmodium falciparum} PDB: 3bpm_A*
Length = 243
Score = 140 bits (356), Expect = 2e-40
Identities = 54/182 (29%), Positives = 84/182 (46%), Gaps = 24/182 (13%)
Query: 136 TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG 195
+ DWR G + V++Q CG+CWAFS+V + ES +A++ L L S QE++DC+
Sbjct: 18 KLDRIAYDWRLHGGVTPVKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQELVDCSV 77
Query: 196 NGNMGCSGGDFCALLDWMD------VNKVVLEPESEYP-LLLKDAACKRKATSPNGVKIK 248
N GC GG ++ ++ L + +YP + C K + IK
Sbjct: 78 -KNNGCYGG-------YITNAFDDMIDLGGLCSQDDYPYVSNLPETCNLKRCNE-RYTIK 128
Query: 249 SYTCDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
SY IP + + GP+ ++ A + +Y GG C + NHAV +V
Sbjct: 129 SY---VSIP-DDKFKEALRYLGPISISIAASDDFAFYRGGFYDGECGAAP---NHAVILV 181
Query: 308 GY 309
GY
Sbjct: 182 GY 183
>3f5v_A DER P 1 allergen; allergy, asthma, DUST mites, glycoprotein,
hydrola protease, secreted, thiol protease; HET: P6G;
1.36A {Dermatophagoides pteronyssinus} PDB: 2as8_A
3rvw_A* 3rvx_A 3rvv_A* 3d6s_A*
Length = 222
Score = 139 bits (352), Expect = 3e-40
Identities = 49/190 (25%), Positives = 76/190 (40%), Gaps = 22/190 (11%)
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T +I P + D R+ + +R Q CG+ WAFS V ES + L+ Q
Sbjct: 1 TNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSAWAFSGVAATESAYLAYRQQSLDLAEQ 60
Query: 189 EVIDCAGNGNMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKATSPN 243
E++DCA GC G + + + ES Y + ++ +C+R
Sbjct: 61 ELVDCA--SQHGCHGD-------TIPRGIEYIQHNGVVQESYYRYVAREQSCRRPNA--Q 109
Query: 244 GVKIKSYTCDTLIPSESSILTDIA-THGPV---IAAVNALTWQYYLGGVIQYNCDGSLAN 299
I +Y P+ + I +A TH + I + +++Y G I D
Sbjct: 110 RFGISNYC-QIYPPNANKIREALAQTHSAIAVIIGIKDLDAFRHYDGRTI-IQRDNGYQP 167
Query: 300 INHAVQIVGY 309
HAV IVGY
Sbjct: 168 NYHAVNIVGY 177
>2oul_A Falcipain 2; cysteine protease, inhibitor, macromolecular
interaction, HY hydrolase inhibitor complex; 2.20A
{Plasmodium falciparum} SCOP: d.3.1.1 PDB: 2ghu_A 1yvb_A
3bpf_A* 3pnr_A
Length = 241
Score = 139 bits (352), Expect = 6e-40
Identities = 47/179 (26%), Positives = 82/179 (45%), Gaps = 22/179 (12%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
DWR + V++Q+ CG+CWAFS++ + ES +A++ L LS QE++DC+
Sbjct: 18 DHAAYDWRLHSGVTPVKDQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQELVDCS-FK 76
Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
N GC+GG ++ + + P+ +YP + IK+Y
Sbjct: 77 NYGCNGG-------LINNAFEDMIELGGICPDGDYPYVSDAPNLCNIDRCTEKYGIKNY- 128
Query: 252 CDTLIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+P ++ + + GP+ +V + +Y G+ C L NHAV +VG+
Sbjct: 129 --LSVP-DNKLKEALRFLGPISISVAVSDDFAFYKEGIFDGECGDQL---NHAVMLVGF 181
>3hhi_A Cathepsin B-like cysteine protease; occluding loop, hydrolase, THIO
protease; HET: 074; 1.60A {Trypanosoma brucei} PDB:
3mor_A*
Length = 325
Score = 139 bits (352), Expect = 4e-39
Identities = 42/270 (15%), Positives = 84/270 (31%), Gaps = 40/270 (14%)
Query: 65 DIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVK 124
++ +N+ + A+Y +++ E K + N + + +
Sbjct: 13 AFVDRVNRLNRGIWKAKYD-GVMQNITLREAKRLNGVIKKNNNASILPKRRFTEEEARAP 71
Query: 125 KRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT-LS 183
+P+ + W I ++ +Q CG+CWA + G
Sbjct: 72 ---------LPSSFDSAEAWPNCPTIPQIADQSACGSCWAVAAASAMSDRFCTMGGVQDV 122
Query: 184 LLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYP---------------- 227
+S +++ C + GC+GGD + +V + P
Sbjct: 123 HISAGDLLACCSDCGDGCNGGDPDRAWAYFSSTGLV--SDYCQPYPFPHCSHHSKSKNGY 180
Query: 228 -----LLLKDAACKRKATSPNGVKIKSYTCDTLIP--SESSILTDIATHGPVIAAVNALT 280
C P + + +Y T E + ++ GP A +
Sbjct: 181 PPCSQFNFDTPKCDYTCDDPT-IPVVNYRSWTSYALQGEDDYMRELFFRGPFEVAFDVYE 239
Query: 281 -WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ Y GV + L HAV++VG+
Sbjct: 240 DFIAYNSGVYHHVSGQYL--GGHAVRLVGW 267
>1o0e_A Ervatamin C; plant cysteine protease, two domain, stable at PH
2-12, HYDR; 1.90A {Tabernaemontana divaricata} SCOP:
d.3.1.1 PDB: 2pns_A* 2pre_A* 3bcn_A*
Length = 208
Score = 130 bits (330), Expect = 4e-37
Identities = 56/181 (30%), Positives = 83/181 (45%), Gaps = 26/181 (14%)
Query: 138 IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNG 197
+P + DWR+ G + V+NQ +CG+CWAFSTV T ES++ ++ G L LS QE++DC
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK-K 59
Query: 198 NMGCSGGDFCALLDWMD------VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYT 251
N GC GG +N ++ ++ YP C+ + V I Y
Sbjct: 60 NHGCLGG-------AFVFAYQYIINNGGIDTQANYPYKAVQGPCQ---AASKVVSIDGY- 108
Query: 252 CDTLIPSES-SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
+P + L P A++A +Q Y G+ C +NH V IVG
Sbjct: 109 --NGVPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCG---TKLNHGVTIVG 163
Query: 309 Y 309
Y
Sbjct: 164 Y 164
>3pbh_A Procathepsin B; thiol protease, cysteine protease, proenzyme,
papain; 2.50A {Homo sapiens} SCOP: d.3.1.1 PDB: 2pbh_A
1pbh_A 1mir_A
Length = 317
Score = 124 bits (313), Expect = 1e-33
Identities = 53/290 (18%), Positives = 100/290 (34%), Gaps = 62/290 (21%)
Query: 56 RFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKH 115
F +++ +NK + ++ F ++ K R +
Sbjct: 6 SFHPLSD--ELVNYVNKRNTTWQAGHN----FYNVDMSYLK-RLCGTFLGGPKPPQRVMF 58
Query: 116 HDHHHNHVKKRSITTGITIPTGIPVKKDWREA----GIIGKVRNQQTCGACWAFSTVETA 171
+ +P D RE I ++R+Q +CG+CWAF VE
Sbjct: 59 TE-----------------DLKLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAI 101
Query: 172 ESMHALK-NGTLS-LLSVQEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLE------- 221
+ N +S +S ++++ C G+ GC+GG ++ +V
Sbjct: 102 SDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 161
Query: 222 -------PESEYPLLLKDAACKRKATSPN--------------GVKIKSYTCDTLIPSES 260
P E+ + C + +P K Y ++ SE
Sbjct: 162 GCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEK 221
Query: 261 SILTDIATHGPVIAAVNALT-WQYYLGGVIQYNCDGSLANINHAVQIVGY 309
I+ +I +GPV A + + + Y GV Q+ + HA++I+G+
Sbjct: 222 DIMAEIYKNGPVEGAFSVYSDFLLYKSGVYQHVTGEMM--GGHAIRILGW 269
>3u8e_A Papain-like cysteine protease; papain-like cysteine peptidase,
peptidase_C1A, hydrolase, in form; 1.31A {Crocus
sativus}
Length = 222
Score = 118 bits (298), Expect = 2e-32
Identities = 53/175 (30%), Positives = 76/175 (43%), Gaps = 11/175 (6%)
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P DWR+ G + V++Q CG CWAF E + A+ G L +S Q+++DC
Sbjct: 2 PASIDWRKKGAVTSVKDQGACGMCWAFGATGAIEGIDAITTGRLISVSEQQIVDCD-TXX 60
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS 258
GGD W+ N + ++ YP D C P +I Y T +P+
Sbjct: 61 XXXXGGDADDAFRWVITNG-GIASDANYPYTGVDGTCDLN--KPIAARIDGY---TNVPN 114
Query: 259 ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQY--NCDGSLANINHAVQIVGY 309
SS L D PV + ++Q Y G I +C A ++H V IVGY
Sbjct: 115 SSSALLDAVAKQPVSVNIYTSSTSFQLYTGPGIFAGSSCSDDPATVDHTVLIVGY 169
>3qsd_A Cathepsin B-like peptidase (C01 family); cysteine peptidase,
digestive tract, hydrolase-hydrolase INH complex; HET:
074; 1.30A {Schistosoma mansoni} PDB: 3s3q_A* 3s3r_A*
Length = 254
Score = 117 bits (296), Expect = 1e-31
Identities = 45/208 (21%), Positives = 74/208 (35%), Gaps = 38/208 (18%)
Query: 138 IPVKKDWREA----GIIGKVRNQQTCGACWAFSTVETAESMHALKNG--TLSLLSVQEVI 191
IP D R+ I +R+Q CG+CWAF VE +++G LS +++
Sbjct: 3 IPSSFDSRKKWPRCKSIATIRDQSRCGSCWAFGAVEAMSDRSCIQSGGKQNVELSAVDLL 62
Query: 192 DCAGNGNMGCSGGDFCALLDWMDVNKVVLE-------PESEYPLLLKDAACKRKATSPNG 244
C + +GC GG D+ +V YP + K K
Sbjct: 63 SCCESCGLGCEGGILGPAWDYWVKEGIVTGSSKENHAGCEPYPFPKCEHHTKGKYPPCGS 122
Query: 245 VKIKSYTCDT----------------------LIPSESSILTDIATHGPVIAAVNALT-W 281
K+ C + E +I +I +GPV A +
Sbjct: 123 KIYKTPRCKQTCQKKYKTPYTQDKHRGKSSYNVKNDEKAIQKEIMKYGPVEAGFTVYEDF 182
Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGY 309
Y G+ ++ +L HA++I+G+
Sbjct: 183 LNYKSGIYKHITGETL--GGHAIRIIGW 208
>3ois_A Cysteine protease; alpha and beta, hydrolase; HET: UDP; 1.65A
{Xylella fastidiosa}
Length = 291
Score = 118 bits (298), Expect = 1e-31
Identities = 32/202 (15%), Positives = 55/202 (27%), Gaps = 30/202 (14%)
Query: 133 TIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVID 192
++ +P K D +V +Q G+C A + + + + + I
Sbjct: 52 SVIAALPPKVDLTPPF---QVYDQGRIGSCTANALAAAIQFERIHDKQSPEFIPSRLFIY 108
Query: 193 CA----GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP------ 242
SG + + V PE E+P A + + P
Sbjct: 109 YNERKIEGHVNYDSGAMIRDGIKVLHKLGVC--PEKEWPYGDTPADPRTEEFPPGAPASK 166
Query: 243 ----------NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-WQYYLGGV--I 289
KI Y+ + + +A P + + W I
Sbjct: 167 KPSDQCYKDAQNYKITEYS--RVAQDIDHLKACLAVGSPFVFGFSVYNSWVGNNSLPVRI 224
Query: 290 QYNCDGSLANINHAVQIVGYDN 311
HAV VGYD+
Sbjct: 225 PLPTKNDTLEGGHAVLCVGYDD 246
>3cbj_A Cathepsin B; cathepsin B, occluding loop, chagas disease, glyco
hydrolase, lysosome, protease, thiol protease, zymogen,
CYT vesicle; 1.80A {Homo sapiens} PDB: 3cbk_A 1gmy_A*
3ai8_B* 3k9m_A 1the_A* 1cpj_A* 1cte_A 2dcc_A* 2dc6_A*
1ito_A* 2dc8_A* 2dc9_A* 2dca_A* 2dcb_A* 2dc7_A* 2dcd_A*
1qdq_A* 1csb_B* 1huc_B 2ipp_B ...
Length = 266
Score = 117 bits (294), Expect = 4e-31
Identities = 44/208 (21%), Positives = 77/208 (37%), Gaps = 38/208 (18%)
Query: 138 IPVKKDWREA----GIIGKVRNQQTCGACWAFSTVETAESMHALK-NGTLS-LLSVQEVI 191
+P D RE I ++R+Q +CG+ WAF VE + N +S +S ++++
Sbjct: 7 LPASFDAREQWPQCPTIKEIRDQGSCGSAWAFGAVEAISDRICIHTNAHVSVEVSAEDLL 66
Query: 192 DCAGN-GNMGCSGGDFCALLDWMDVNKVVLE-----PESEYPLLLKDAACKRKATSP--- 242
C G+ GC+GG ++ +V P + P
Sbjct: 67 TCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEAHVNGARPPCT 126
Query: 243 --------------------NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALT-W 281
K Y ++ SE I+ +I +GPV A + + +
Sbjct: 127 GEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYSDF 186
Query: 282 QYYLGGVIQYNCDGSLANINHAVQIVGY 309
Y GV Q+ + HA++I+G+
Sbjct: 187 LLYKSGVYQHVTGEMM--GGHAIRILGW 212
>1deu_A Procathepsin X; cysteine protease, proregion, prosegment, HY; 1.70A
{Homo sapiens} SCOP: d.3.1.1 PDB: 1ef7_A
Length = 277
Score = 115 bits (289), Expect = 2e-30
Identities = 46/201 (22%), Positives = 69/201 (34%), Gaps = 37/201 (18%)
Query: 135 PTGIPVKKDWREAG---IIGKVRNQ---QTCGACWAFSTVETAESMHALKNG---TLSLL 185
P +P DWR RNQ Q CG+CWA ++ +K +LL
Sbjct: 33 PADLPKSWDWRNVDGVNYASITRNQHIPQYCGSCWAHASTSAMADRINIKRKGAWPSTLL 92
Query: 186 SVQEVIDCAGNGNMGCSGGDFCALLDWMD-----VNKVVLEPESEYPLLLKDAACKRKAT 240
SVQ VIDC G C GG ++ + E+ KD C +
Sbjct: 93 SVQNVIDCGNAG--SCEGG-------NDLSVWDYAHQHGIPDETCNNYQAKDQECDKFNQ 143
Query: 241 SPNGVKIKSYTCDT-----------LIPSESSILTDIATHGPVIAAVNA-LTWQYYLGGV 288
+ K + ++ +I +GP+ + A Y GG+
Sbjct: 144 CGTCNEFKECHAIRNYTLWRVGDYGSLSGREKMMAEIYANGPISCGIMATERLANYTGGI 203
Query: 289 IQYNCDGSLANINHAVQIVGY 309
D + INH V + G+
Sbjct: 204 YAEYQDTTY--INHVVSVAGW 222
>3f75_P Toxopain-2, cathepsin L propeptide; medical structural genomics of
pathogenic protozoa, MSGPP, C protease, parasite,
protozoa, hydrolase; 1.99A {Toxoplasma gondii}
Length = 106
Score = 67.5 bits (165), Expect = 2e-14
Identities = 23/74 (31%), Positives = 38/74 (51%), Gaps = 4/74 (5%)
Query: 35 ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+ FSSFQ Y KSY ++ E R+ F+ +L I N+ S + F DLS +
Sbjct: 23 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYS---YSLKMNHFGDLSRD 79
Query: 94 EFKTRHLRHSVNKH 107
EF+ ++L +++
Sbjct: 80 EFRRKYLGFKKSRN 93
>2l95_A Crammer, LP06209P; cysteine proteinase inhibitor, intrinsic
disorder P like protein, hydrolase; NMR {Drosophila
melanogaster}
Length = 80
Score = 65.8 bits (161), Expect = 4e-14
Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 1/67 (1%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN-KNRQSPESARYGITEFSDLSEE 93
E + ++ ++ K+Y E +R + + +S IEE N K + + + GI +DL+ E
Sbjct: 8 EEWVEYKSKFDKNYEAEEDLMRRRIYAESKARIEEHNRKFEKGEVTWKMGINHLADLTPE 67
Query: 94 EFKTRHL 100
EF R
Sbjct: 68 EFAQRSG 74
>2pff_B Fatty acid synthase subunit beta; fatty acid synthase,
acyl-carrier-protein, beta-ketoacyl RED beta-ketoacyl
synthase, dehydratase; 4.00A {Saccharomyces cerevisiae}
Length = 2006
Score = 52.7 bits (126), Expect = 8e-08
Identities = 51/347 (14%), Positives = 95/347 (27%), Gaps = 119/347 (34%)
Query: 24 KVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPE----- 78
+ + +L + ++ D K F + L+I+E L +P+
Sbjct: 178 QTYHVLVG---DLIKFSAETLS-ELIRTTLDAE-KVFTQGLNILEWLENPSNTPDKDYLL 232
Query: 79 SARY-----GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK-RSITT-- 130
S G+ + + H +++ + T
Sbjct: 233 SIPISCPLIGVIQLA------------------HYVVTAKLLGFTPGELRSYLKGATGHS 274
Query: 131 -GITIPTGIPVKKDWREAG-----------IIGKVRNQQTCGACWAFSTVETAESMH--A 176
G+ I W IG VR + A+ S+ +
Sbjct: 275 QGLVTAVAIAETDSWESFFVSVRKAITVLFFIG-VRCYE------AYPNTSLPPSILEDS 327
Query: 177 LKNGT-----------LSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVN---KVVL-- 220
L+N L+ VQ+ ++ N ++ +L VN +V+
Sbjct: 328 LENNEGVPSPMLSISNLTQEQVQDYVN-KTNSHLPAGKQVEISL-----VNGAKNLVVSG 381
Query: 221 EPESEYPL---LLKDAA-----------CKRKA---------TSPNGVKIKSYTCDTLIP 257
P+S Y L L K A +RK SP + L+P
Sbjct: 382 PPQSLYGLNLTLRKAKAPSGLDQSRIPFSERKLKFSNRFLPVASP-------FHSHLLVP 434
Query: 258 SESSILTDIATHG----------PVIAAVNALTWQYYLGGVIQYNCD 294
+ I D+ + PV + + G + + D
Sbjct: 435 ASDLINKDLVKNNVSFNAKDIQIPVYDTFDGSDLRVLSGSISERIVD 481
Score = 38.5 bits (89), Expect = 0.003
Identities = 53/345 (15%), Positives = 93/345 (26%), Gaps = 125/345 (36%)
Query: 22 PVKVSKPNLEQKL----ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSP 77
P+ +S +LE L F Q ++ F K L E P
Sbjct: 8 PLTLSHGSLEHVLLVPTASFFIASQLQEQ-------------FNKILPEPTEGFAADDEP 54
Query: 78 ES-----ARY-----------GITEFSDLSE---EEFKTRHLR----HSVNKHVLMSHHK 114
+ ++ + +F + EF+ +L H++ +L +
Sbjct: 55 TTPAELVGKFLGYVSSLVEPSKVGQFDQVLNLCLTEFENCYLEGNDIHALAAKLLQENDT 114
Query: 115 HHDHHHNHVK---KRSITTGITIPTGIP------VKKDWREAGII----GKVRNQQTCGA 161
+K I V + A ++ G Q
Sbjct: 115 TLVKTKELIKNYITARIMAKRPFDKKSNSALFRAVGEG--NAQLVAIFGG----QGNTDD 168
Query: 162 CWA-----FST----VETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
+ + T V A TLS L + +D G + +L+W
Sbjct: 169 YFEELRDLYQTYHVLVGDLIKFSA---ETLSELI-RTTLDAEKVFTQGLN------ILEW 218
Query: 213 MDVNKVVLEPESEY--------PLL-LKDAACKRKATSPNGVKIKSY--TCDTL--IPSE 259
++ P+ +Y PL+ + A Y T L P E
Sbjct: 219 LE--NPSNTPDKDYLLSIPISCPLIGVIQLAH--------------YVVTAKLLGFTPGE 262
Query: 260 -SSILTDIATHGP-VIAAV----------------NALTWQYYLG 286
S L H ++ AV A+T +++G
Sbjct: 263 LRSYLKGATGHSQGLVTAVAIAETDSWESFFVSVRKAITVLFFIG 307
Score = 38.5 bits (89), Expect = 0.003
Identities = 51/310 (16%), Positives = 98/310 (31%), Gaps = 105/310 (33%)
Query: 34 LELFSSFQ----------QRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
++L+ + + +K +Y S DI N +L I K ++ E Y
Sbjct: 1633 MDLYKTSKAAQDVWNRADNHFKDTYGFSILDI-VINNPVNLTIHFGGEKGKRIRE--NYS 1689
Query: 84 ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143
F + + + KT + +N+H +T T
Sbjct: 1690 AMIFETIVDGKLKTEKIFKEINEH---------------------STSYT---------- 1718
Query: 144 WR-EAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCS 202
+R E G++ + Q A + +E A + LK+ L + AG+ S
Sbjct: 1719 FRSEKGLLSATQFTQP-----ALTLMEKA-AFEDLKSKGL----IPADATFAGH-----S 1763
Query: 203 GGDFCAL------LDWMDVNKVV---------LEPESEYPLLLKDAACKRKATSPNGVKI 247
G++ AL + + +VV P E A +P V
Sbjct: 1764 LGEYAALASLADVMSIESLVEVVFYRGMTMQVAVPRDELGRSNYGMI----AINPGRV-A 1818
Query: 248 KSYTCDTLIPSESSILTDIATH-GPVIAAVNALTWQYYLGGVIQYNCD-------GSLAN 299
S++ + L ++ + G ++ VN YN + G L
Sbjct: 1819 ASFSQEAL----QYVVERVGKRTGWLVEIVN-------------YNVENQQYVAAGDLRA 1861
Query: 300 INHAVQIVGY 309
++ ++ +
Sbjct: 1862 LDTVTNVLNF 1871
>3pw3_A Aminopeptidase C; bleomycin, cysteine proteinase fold, structural
genomics, JO center for structural genomics, JCSG; HET:
MSE; 2.23A {Parabacteroides distasonis}
Length = 383
Score = 42.1 bits (98), Expect = 1e-04
Identities = 21/89 (23%), Positives = 27/89 (30%), Gaps = 13/89 (14%)
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-----------AGNGN 198
I V+NQ G CW +S+ ES LS +
Sbjct: 22 ITSVKNQNRAGTCWCYSSYSFLESELLRMGKGEYDLSEMFTVYNTYLDRADAAVRTHGDV 81
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYP 227
GG F L M+ +V PE E
Sbjct: 82 SFSQGGSFYDALYGMETFGLV--PEEEMR 108
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis,
programmed cell death; HET: DTP; 6.90A {Drosophila
melanogaster} PDB: 3iz8_A*
Length = 1221
Score = 39.1 bits (90), Expect = 0.002
Identities = 53/346 (15%), Positives = 96/346 (27%), Gaps = 106/346 (30%)
Query: 26 SKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGIT 85
SK + L LF + + ++ K ++ N++ + I+ + S + Y I
Sbjct: 57 SKDAVSGTLRLFWTLLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQP-SMMTRMY-IE 114
Query: 86 EFSDL--SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKD 143
+ L + F ++ S + + + + + I G+
Sbjct: 115 QRDRLYNDNQVFAKYNV----------SRLQPYLKLRQALLELRPAKNVLI-DGV----- 158
Query: 144 WREAGIIGK------------VRNQQTCGACWA-FSTVETAESMHALKNGTLSLLSVQEV 190
G GK V+ + W + E++ L+ L L Q
Sbjct: 159 ---LG-SGKTWVALDVCLSYKVQCKMDFKIFWLNLKNCNSPETV--LEM--LQKLLYQ-- 208
Query: 191 IDCAGNGNMGCSGGDFCA----LLDWMDVNKVVLEPESEYP--LL----------LK--D 232
ID S D + + + L Y LL +
Sbjct: 209 IDP-----NWTSRSDHSSNIKLRIHSIQAELRRLLKSKPYENCLLVLLNVQNAKAWNAFN 263
Query: 233 AACK-----RKATSPNGVKIKSYT-------CDTLIPSES-SILTDIA------------ 267
+CK R + + + T TL P E S+L
Sbjct: 264 LSCKILLTTRFKQVTDFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDLPREVL 323
Query: 268 THGP----VIAAV---NALTWQYYLGGVIQYNCDGSLANINHAVQI 306
T P +IA TW + NCD + ++
Sbjct: 324 TTNPRRLSIIAESIRDGLATWDNWK----HVNCD----KLTTIIES 361
Score = 29.8 bits (66), Expect = 1.3
Identities = 16/125 (12%), Positives = 37/125 (29%), Gaps = 32/125 (25%)
Query: 21 IPVKV--------SKPNLEQKLELF--SSFQQRYKKSYSKSEHDIRFKNFEKSLD----- 65
IP + K ++ + S ++ K + S I + K +
Sbjct: 387 IPTILLSLIWFDVIKSDVMVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYALH 446
Query: 66 --IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHV 123
I++ N + + + +++ H+ H H K+ +H
Sbjct: 447 RSIVDHYNIPK------TFDSDDLIPPYLDQYFYSHIGH---------HLKNIEHPERMT 491
Query: 124 KKRSI 128
R +
Sbjct: 492 LFRMV 496
>3t8b_A 1,4-dihydroxy-2-naphthoyl-COA synthase; crotonase superfamily,
lyase; 1.65A {Mycobacterium tuberculosis} PDB: 3t8a_A
1rjm_A* 1rjn_A* 1q52_A 1q51_A
Length = 334
Score = 31.7 bits (72), Expect = 0.25
Identities = 11/36 (30%), Positives = 16/36 (44%), Gaps = 6/36 (16%)
Query: 271 PVIAAVNALTWQYYLGG--VIQYNCDGSLANINHAV 304
VI VN + GG + CD +LA+ +A
Sbjct: 169 VVICLVNG----WAAGGGHSLHVVCDLTLASREYAR 200
>2bec_A Calcineurin B homologous protein 2; calcineurin-homologous protein,
calcium-binding protein, NHE1 regulating protein; 2.70A
{Homo sapiens}
Length = 202
Score = 30.9 bits (70), Expect = 0.40
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+S EF + V + + + KHH HHH
Sbjct: 172 VSFVEFTKSLEKMDVEQKMSIRILKHHHHHH 202
>2cb5_A Protein (bleomycin hydrolase); aminopeptidase, cysteine protease,
SELF- compartmentalizing, cylinase; 1.85A {Homo sapiens}
SCOP: d.3.1.1 PDB: 1cb5_A
Length = 453
Score = 30.5 bits (68), Expect = 0.76
Identities = 7/31 (22%), Positives = 11/31 (35%)
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+ NQ++ G W FS + K
Sbjct: 60 GKPITNQKSSGRSWIFSCLNVMRLPFMKKLN 90
>1q0p_A Complement factor B; VON willebrand factor, MAC-1, I domain, A
domain, hydrolase; 1.80A {Homo sapiens} SCOP: c.62.1.1
Length = 223
Score = 29.9 bits (67), Expect = 0.96
Identities = 11/56 (19%), Positives = 17/56 (30%), Gaps = 1/56 (1%)
Query: 55 IRFKNFEKSLDIIEEL-NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVL 109
I NF + + L K RYG+ ++ + K S V
Sbjct: 28 IGASNFTGAKKSLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEADSSNADWVT 83
>3cio_A ETK, tyrosine-protein kinase ETK; WZC, escherichia coli tyrosine
kinase domain, signaling protein, transferase, inner
membrane, membrane; 2.50A {Escherichia coli}
Length = 299
Score = 29.6 bits (67), Expect = 1.3
Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 8/39 (20%)
Query: 110 MSHHKHHDHHHNH---VKKRSITTGITIP-----TGIPV 140
M HH HH HHH+ ++ R I +G+ P GI V
Sbjct: 1 MGHHHHHHHHHHSSGHIEGRHIGSGVEAPEQLEEHGISV 39
>1pq4_A Periplasmic binding protein component of AN ABC T uptake
transporter; ZNUA, loop, metal-binding, metal binding
protein; 1.90A {Synechocystis SP} SCOP: c.92.2.2 PDB:
2ov3_A 2ov1_A
Length = 291
Score = 29.3 bits (66), Expect = 1.6
Identities = 15/68 (22%), Positives = 25/68 (36%), Gaps = 5/68 (7%)
Query: 59 NFEKSL-DIIEELNKNRQSPESARYGIT--EFSDLSEEEF-KTRHLRHSVNKHVLMSHHK 114
FE+ + ++ N N + +SA+ GIT E + H HS + H S +
Sbjct: 61 GFEQPWLEKLKAANANMKLIDSAQ-GITPLEMEKHDHSHGEEEGHDDHSHDGHDHGSESE 119
Query: 115 HHDHHHNH 122
Sbjct: 120 KEKAKGAL 127
>2prs_A High-affinity zinc uptake system protein ZNUA; protein consists of
two (beta/ALFA)4 domains, metal transport; 1.70A
{Escherichia coli} PDB: 2osv_A 2ps0_A 2ps3_A 2ps9_A
2ogw_A 2xy4_A* 2xqv_A* 2xh8_A
Length = 284
Score = 28.9 bits (65), Expect = 2.0
Identities = 9/66 (13%), Positives = 17/66 (25%), Gaps = 12/66 (18%)
Query: 58 KNFEKSLD-IIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
E + + +L +Q + + S H H +
Sbjct: 57 PEMEAFMQKPVSKLPGAKQVTIAQLEDVKPLLMKSIHGDDDDH-----------DHAEKS 105
Query: 117 DHHHNH 122
D H+H
Sbjct: 106 DEDHHH 111
>1s7o_A Hypothetical UPF0122 protein
SPY1201/SPYM3_0842/SPS1042/SPYM18_1152; putative DNA
binding protein, structural genomics; 2.31A
{Streptococcus pyogenes serotype M3} SCOP: a.4.13.3
Length = 113
Score = 27.5 bits (61), Expect = 2.3
Identities = 7/45 (15%), Positives = 18/45 (40%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKN 73
E KL ++S + R + H + ++ + I+ ++
Sbjct: 68 TYEMKLHMYSDYVVRSEIFDDMIAHYPHDEYLQEKISILTSIDNR 112
>2e01_A Cysteine proteinase 1; bleomycin hydrolase, thiol protease, C1
protease, hydrolase; 1.73A {Saccharomyces cerevisiae}
PDB: 2e02_A 2e03_A 2dzy_A 1a6r_A 2e00_A 2dzz_A 3gcb_A
1gcb_A
Length = 457
Score = 28.9 bits (64), Expect = 2.3
Identities = 7/33 (21%), Positives = 11/33 (33%)
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLS 183
V NQ++ G CW F+ +
Sbjct: 66 TPVTNQKSSGRCWLFAATNQLRLNVLSELNLKE 98
>3zqk_A VON willebrand factor; blood clotting, adamts-13, force sensor, VON
willebrand DISE domain, haemostasis; HET: NAG; 1.70A
{Homo sapiens} PDB: 3ppv_A 3ppx_A 3ppw_A 3ppy_A 3gxb_A*
Length = 199
Score = 28.4 bits (64), Expect = 2.6
Identities = 9/59 (15%), Positives = 25/59 (42%), Gaps = 4/59 (6%)
Query: 55 IRFKNFEKSLDIIEELNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
I +F +S + +EE+ + +S + ++S + E+ + +K ++
Sbjct: 34 IGEADFNRSKEFMEEVIQRMDVGQDSIHVTVLQYSYMVTVEY---PFSEAQSKGDILQR 89
>1mjn_A Integrin alpha-L; rossmann fold, immune system; 1.30A {Homo
sapiens} SCOP: c.62.1.1 PDB: 3hi6_A 1mq8_B* 3eoa_I
3eob_I 1rd4_A* 1lfa_A 1zon_A 1zoo_A 1zop_A 1dgq_A
1xdd_A* 1xdg_A* 1xuo_A* 3e2m_A* 3bqn_B* 1cqp_A* 3bqm_B*
2ica_A* 2o7n_A* 3m6f_A* ...
Length = 179
Score = 28.1 bits (63), Expect = 2.8
Identities = 10/41 (24%), Positives = 22/41 (53%), Gaps = 1/41 (2%)
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
++ F+K LD ++++ + S S ++ +FS + EF
Sbjct: 15 LQPDEFQKILDFMKDV-MKKCSNTSYQFAAVQFSTSYKTEF 54
>3nb2_A Secreted effector protein; pentapeptide, HECT domain, HECT E
ubiquitin ligase, ligase; HET: MES; 2.10A {Escherichia
coli} PDB: 3naw_A* 3sqv_A
Length = 613
Score = 28.7 bits (64), Expect = 2.8
Identities = 10/82 (12%), Positives = 21/82 (25%)
Query: 35 ELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LFS + Y K+ L EL + ++ + ++
Sbjct: 417 NLFSESFPIFSIPYHKAFSQNFVSGILDILISDNELKERFIEALNSNKSDYKMIADDQQR 476
Query: 95 FKTRHLRHSVNKHVLMSHHKHH 116
++ L + H
Sbjct: 477 KLACVWNPFLDGWELNAQHVDM 498
>1wuf_A Hypothetical protein LIN2664; structural genomics, unknown
function, nysgxrc target T2186, superfamily, protein
structure initiative, PSI; 2.90A {Listeria innocua}
SCOP: c.1.11.2 d.54.1.1
Length = 393
Score = 28.4 bits (64), Expect = 2.9
Identities = 27/133 (20%), Positives = 45/133 (33%), Gaps = 34/133 (25%)
Query: 112 HHKHHDHHHNHVKKRS----ITTGITIPTGIPVKKDWREAGIIGKVRNQQTC-------- 159
HH HH HHH+ + R I +P+ ++ + G+++++
Sbjct: 2 HHHHHHHHHHGLVPRGSHMYFQKARLIHAELPLLAPFKTS--YGELKSKDFYIIELINEE 59
Query: 160 -----GACWAFS----TVETAESM-HALKNGTLSLL---------SVQEVIDCAGNGNMG 200
G AF T ET S +K L LL +QE+ M
Sbjct: 60 GIHGYGELEAFPLPDYTEETLSSAILIIKEQLLPLLAQRKIRKPEEIQELFSWIQGNEMA 119
Query: 201 CSGGDFCALLDWM 213
+ + A+ D
Sbjct: 120 KAAVE-LAVWDAF 131
>1svj_A Potassium-transporting ATPase B chain; alpha-beta sandwich,
hydrolase; NMR {Escherichia coli} SCOP: d.220.1.1 PDB:
1u7q_A 2a00_A* 2a29_A*
Length = 156
Score = 27.9 bits (62), Expect = 3.0
Identities = 9/26 (34%), Positives = 12/26 (46%), Gaps = 8/26 (30%)
Query: 110 MSHHKHHDHHHNHVKKRSITTG-ITI 134
M HH HH HHH+ ++G
Sbjct: 1 MGHHHHHHHHHH-------SSGHGGR 19
>3oka_C N-terminal His-affinity TAG; GT-B fold, alpha-mannosyltransferase,
GDP-MAN binding, trans; HET: GDD; 2.20A {Escherichia
coli}
Length = 26
Score = 25.5 bits (55), Expect = 3.1
Identities = 10/20 (50%), Positives = 13/20 (65%), Gaps = 3/20 (15%)
Query: 110 MSHHKHHDHHHN---HVKKR 126
M HH HH HHH+ H++ R
Sbjct: 1 MGHHHHHHHHHHSSGHIEGR 20
>3t89_A 1,4-dihydroxy-2-naphthoyl-COA synthase; crotonase superfamily,
lyase; 1.95A {Escherichia coli} PDB: 3t88_A 3h02_A
2iex_A
Length = 289
Score = 28.3 bits (64), Expect = 3.2
Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 6/30 (20%)
Query: 271 PVIAAVNALTWQYYLGG--VIQYNCDGSLA 298
PV+A V Y +GG V+ CD ++A
Sbjct: 125 PVVAMVAG----YSIGGGHVLHMMCDLTIA 150
>2uzf_A Naphthoate synthase; lyase, menaquinone biosynthesis; HET: CAA;
2.9A {Staphylococcus aureus}
Length = 273
Score = 28.3 bits (64), Expect = 3.3
Identities = 12/30 (40%), Positives = 16/30 (53%), Gaps = 6/30 (20%)
Query: 271 PVIAAVNALTWQYYLGG--VIQYNCDGSLA 298
PVIA V Y +GG V+ CD ++A
Sbjct: 109 PVIAMVKG----YAVGGGNVLNVVCDLTIA 134
>1zx5_A Mannosephosphate isomerase, putative; STRU genomics, PSI, protein
structure initiative, midwest center structural
genomics, MCSG; HET: LFR; 2.30A {Archaeoglobus fulgidus}
SCOP: b.82.1.3
Length = 300
Score = 28.3 bits (63), Expect = 3.6
Identities = 12/59 (20%), Positives = 21/59 (35%), Gaps = 6/59 (10%)
Query: 162 CWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCS---GGDFCALLDWMDVNK 217
W FS + S +K LS+ E+ + +G + F L+ +D
Sbjct: 39 SWEFSAHTSRPSTVLVKGQQLSM---IELFSKHRDELLGRAAEKFSKFPILVRLIDAAS 94
>1wue_A Mandelate racemase/muconate lactonizing enzyme FA protein;
structural genomics, unknown function, nysgxrc target
T2185; 2.10A {Enterococcus faecalis} SCOP: c.1.11.2
d.54.1.1
Length = 386
Score = 28.0 bits (63), Expect = 3.8
Identities = 26/132 (19%), Positives = 44/132 (33%), Gaps = 33/132 (25%)
Query: 112 HHKHHDHHHNHVKKRS---ITTGITIPTGIPVKKDWREAGIIGKVRNQQTC--------- 159
HH HH HHH V + S I + T +P+K + + G++ +
Sbjct: 3 HHHHHHHHHGLVPRGSHMNIQSIETYQVRLPLKTPFVTS--YGRLEEKAFDLFVITDEQG 60
Query: 160 ----GACWAFS----TVETAESM-HALKNGTLSLL---------SVQEVIDCAGNGNMGC 201
G AF ET + ++ + LL V + + MG
Sbjct: 61 NQGFGELVAFEQPDYVQETLVTERFIIQQHLIPLLLTEAIEQPQEVSTIFEEVKGHWMGK 120
Query: 202 SGGDFCALLDWM 213
+ + A+ D
Sbjct: 121 AALE-TAIWDLY 131
>3ebl_A Gibberellin receptor GID1; alpha/beta hydrolase, lipase,
gibberellin signaling pathway, hydrolase, nucleus,
hydrolase receptor; HET: GA4; 1.90A {Oryza sativa subsp}
PDB: 3ed1_A*
Length = 365
Score = 28.1 bits (63), Expect = 4.0
Identities = 8/19 (42%), Positives = 12/19 (63%)
Query: 104 VNKHVLMSHHKHHDHHHNH 122
+N ++ H HH HHH+H
Sbjct: 347 LNANLYYGSHHHHHHHHHH 365
>2xgg_A Microneme protein 2; A/I domain, cell adhesion, hydrolase; 2.05A
{Toxoplasma gondii}
Length = 178
Score = 27.7 bits (62), Expect = 4.3
Identities = 6/42 (14%), Positives = 12/42 (28%), Gaps = 1/42 (2%)
Query: 55 IRFKNFEKSLDIIEELNKNRQ-SPESARYGITEFSDLSEEEF 95
I +NF + PE + +S ++
Sbjct: 30 IGIQNFRLVKQFLHTFLMVLPIGPEEVNNAVVTYSTDVHLQW 71
>4eml_A Naphthoate synthase; 1,4-dihydroxy-2-naphthoyl-coenzyme A, lyase;
2.04A {Synechocystis SP}
Length = 275
Score = 27.9 bits (63), Expect = 4.4
Identities = 11/30 (36%), Positives = 15/30 (50%), Gaps = 6/30 (20%)
Query: 271 PVIAAVNALTWQYYLGG--VIQYNCDGSLA 298
VIA V Y +GG V+ CD ++A
Sbjct: 111 VVIALVAG----YAIGGGHVLHLVCDLTIA 136
>3i71_A Ethanolamine utilization protein EUTK; helix-turn-helix, unknown
function; HET: FLC; 2.10A {Escherichia coli}
Length = 68
Score = 25.9 bits (56), Expect = 4.4
Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 6/66 (9%)
Query: 61 EKSLDIIEELNKNRQSPE----SARYG--ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
E + +++ L RQ +A +G + + + E+ F LR +++ L H +
Sbjct: 3 ESADELLALLTSVRQGMTAGEVAAHFGWPLEKARNALEQLFSAGTLRKRSSRYRLKPHLE 62
Query: 115 HHDHHH 120
HH HHH
Sbjct: 63 HHHHHH 68
>3kd6_A Carbohydrate kinase, PFKB family; nucleoside kinase, AMP, PSI-II,
NYSGXRC, struc genomics, protein structure initiative;
HET: AMP; 1.88A {Chlorobaculum tepidum}
Length = 313
Score = 27.9 bits (62), Expect = 4.7
Identities = 11/42 (26%), Positives = 17/42 (40%), Gaps = 7/42 (16%)
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH--HKHHDHHH 120
++G ++DL E R + +S HH HHH
Sbjct: 277 QFGPYRYNDLDLLEVDDR-----YQSFLELSRIEEGHHHHHH 313
>3pqa_A Lactaldehyde dehydrogenase; structural genomics, protein structure
initiative, nysgrc, P biology, oxidoreductase; 1.50A
{Methanocaldococcus jannaschii} PDB: 3rhd_A*
Length = 486
Score = 27.9 bits (63), Expect = 4.9
Identities = 13/48 (27%), Positives = 18/48 (37%), Gaps = 6/48 (12%)
Query: 78 ESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKK 125
E +Y + E S KT + + SHH HH H +K
Sbjct: 445 EGVKYAMEEMS-----NIKTIIISKA-ENLYFQSHHHHHHWSHPQFEK 486
>3erv_A Putative C39-like peptidase; structural genomics, unknown function,
PSI-2, protein structure initiative; 2.10A {Bacillus
anthracis}
Length = 236
Score = 27.4 bits (60), Expect = 6.0
Identities = 12/62 (19%), Positives = 20/62 (32%), Gaps = 3/62 (4%)
Query: 253 DTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI---NHAVQIVGY 309
D S + + PV+ NA + + +I H V ++GY
Sbjct: 126 DLTGKSIEELYKSVKAGQPVVIITNATFAPLDEDEFTTWETNNGDVSITYNEHCVVLIGY 185
Query: 310 DN 311
D
Sbjct: 186 DQ 187
>1ijb_A VON willebrand factor; dinucleotide-binding fold, blood clotting;
1.80A {Homo sapiens} SCOP: c.62.1.1 PDB: 1ijk_A 1auq_A
1u0n_A 3hxo_A 1uex_C 3hxq_A 1sq0_A 1m10_A 1fns_A 1oak_A
1u0o_C
Length = 202
Score = 27.3 bits (61), Expect = 6.0
Identities = 9/45 (20%), Positives = 15/45 (33%), Gaps = 7/45 (15%)
Query: 55 IRFKNFEKSLD----IIEELNKNRQSPESARYGITEFSDLSEEEF 95
+ FE ++E L S + R + E+ D S
Sbjct: 26 LSEAEFEVLKAFVVDMMERLRV---SQKWVRVAVVEYHDGSHAYI 67
>1xsv_A Hypothetical UPF0122 protein SAV1236; helix-turn-helix, putative
DNA-binding protein, signal recognition particle,
unknown function; 1.70A {Staphylococcus aureus subsp}
SCOP: a.4.13.3
Length = 113
Score = 26.3 bits (58), Expect = 6.3
Identities = 11/42 (26%), Positives = 23/42 (54%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
+ E+KLEL+ F+QR + +H + ++ + +E+L
Sbjct: 71 DYEKKLELYQKFEQRREIYDEMKQHLSNPEQIQRYIQQLEDL 112
>2k8i_A SLYD, peptidyl-prolyl CIS-trans isomerase; ppiase, chaperone,
rotamase; NMR {Escherichia coli}
Length = 171
Score = 26.9 bits (60), Expect = 6.7
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 6/40 (15%)
Query: 84 ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKH-HDHHHNH 122
+ + +EEE H+ H + H HH H HDHHH+H
Sbjct: 136 VVAIREATEEELAHGHV-HGAHDH----HHDHDHDHHHHH 170
>3odm_A Pepcase, PEPC, phosphoenolpyruvate carboxylase; beta-barrel, lyase;
2.95A {Clostridium perfringens}
Length = 560
Score = 27.4 bits (60), Expect = 7.1
Identities = 9/25 (36%), Positives = 11/25 (44%)
Query: 112 HHKHHDHHHNHVKKRSITTGITIPT 136
HH HH HHH+ + IP
Sbjct: 4 HHHHHHHHHSSGHIDDDDKHMKIPC 28
>1uw4_B UPF2, regulator of nonsense transcripts 2; nonsense mediated mRNA
decay protein, RNA-binding protein, N domain, MIF4G
domain; 1.95A {Homo sapiens} SCOP: a.118.1.14
Length = 248
Score = 27.2 bits (60), Expect = 7.3
Identities = 21/100 (21%), Positives = 37/100 (37%), Gaps = 9/100 (9%)
Query: 1 MFDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNF 60
D LF + L+ + + ++KL+ F + QRY + K ++ K+
Sbjct: 142 SLDPPEHLFRIRLVCTILDTCGQYFDRGSSKRKLDCFLVYFQRY--VWWKKSLEVWTKDH 199
Query: 61 EKSLDI-------IEELNKNRQSPESARYGITEFSDLSEE 93
+DI +E L + S I + DL E
Sbjct: 200 PFPIDIDYMISDTLELLRPKIKLCNSLEESIRQVQDLERE 239
>1atz_A VON willebrand factor; collagen-binding, hemostasis, dinucleotide
binding fold; 1.80A {Homo sapiens} SCOP: c.62.1.1 PDB:
4dmu_B 2adf_A 1fe8_A 1ao3_A
Length = 189
Score = 26.9 bits (60), Expect = 7.5
Identities = 5/59 (8%), Positives = 16/59 (27%), Gaps = 4/59 (6%)
Query: 55 IRFKNFEKSLDIIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
F++ + P + + ++ ++ + K L+S
Sbjct: 18 FPASYFDEMKSFAKAFISKANIGPRLTQVSVLQYGSITTIDV---PWNVVPEKAHLLSL 73
>1pt6_A Integrin alpha-1; cell adhesion; 1.87A {Homo sapiens} SCOP:
c.62.1.1 PDB: 4a0q_A 1qcy_A 1qc5_A 1qc5_B 1ck4_A 1mhp_A
Length = 213
Score = 27.0 bits (60), Expect = 7.8
Identities = 8/55 (14%), Positives = 21/55 (38%), Gaps = 4/55 (7%)
Query: 59 NFEKSLDIIEELNKNRQ-SPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
++ + +L K P+ + GI ++ + EF +L + ++
Sbjct: 22 PWDSVTAFLNDLLKRMDIGPKQTQVGIVQYGENVTHEF---NLNKYSSTEEVLVA 73
>2d7u_A Adenylosuccinate synthetase; structural genomics, conserved
hypothetical protein, NPPSFA; 2.50A {Pyrococcus
horikoshii}
Length = 339
Score = 27.2 bits (61), Expect = 7.9
Identities = 10/48 (20%), Positives = 18/48 (37%), Gaps = 1/48 (2%)
Query: 84 ITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTG 131
E L + K R + ++ HK D + ++ + TTG
Sbjct: 82 FHELEQLKDFNVKDR-VGIDYRCAIIEEKHKQLDRTNGYLHGKIGTTG 128
>2hza_A Nickel-responsive regulator; nickel-binding, ribbon-helix-helix,
transcription factor, ME binding protein; HET: 3CM;
2.10A {Escherichia coli} SCOP: a.43.1.3 d.58.18.4 PDB:
1q5v_A* 2hzv_A 3od2_A*
Length = 133
Score = 26.4 bits (58), Expect = 8.2
Identities = 4/25 (16%), Positives = 10/25 (40%)
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHH 120
+ +H + + + H H +H
Sbjct: 70 RIVSTQHHHHDLSVATLHVHINHDD 94
>1mc0_A 3',5'-cyclic nucleotide phosphodiesterase 2A; GAF domain, 3',5'
guanosine monophosphate, hydrolase; HET: PCG; 2.86A {Mus
musculus} SCOP: d.110.2.1 d.110.2.1
Length = 368
Score = 27.3 bits (60), Expect = 8.3
Identities = 7/34 (20%), Positives = 17/34 (50%), Gaps = 6/34 (17%)
Query: 87 FSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+ ++E ++++ +M + +HH HHH
Sbjct: 341 YKKVNEAQYRSHLANE------MMMYLEHHHHHH 368
>1n3y_A Integrin alpha-X; alpha/beta rossmann fold, cell adhesion; 1.65A
{Homo sapiens} SCOP: c.62.1.1
Length = 198
Score = 26.6 bits (59), Expect = 8.9
Identities = 8/58 (13%), Positives = 21/58 (36%), Gaps = 4/58 (6%)
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSH 112
I +NF ++ + + + S ++ + +FS+ + F +S
Sbjct: 22 ISSRNFATMMNFVRAVIS-QFQRPSTQFSLMQFSNKFQTHF---TFEEFRRSSNPLSL 75
Database: pdb70
Posted date: Sep 4, 2012 3:40 AM
Number of letters in database: 6,701,793
Number of sequences in database: 27,921
Lambda K H
0.319 0.133 0.406
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 27921
Number of Hits to DB: 4,964,434
Number of extensions: 303477
Number of successful extensions: 1769
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1512
Number of HSP's successfully gapped: 107
Length of query: 317
Length of database: 6,701,793
Length adjustment: 94
Effective length of query: 223
Effective length of database: 4,077,219
Effective search space: 909219837
Effective search space used: 909219837
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 57 (25.5 bits)