RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5221
(436 letters)
>gnl|CDD|235478 PRK05457, PRK05457, heat shock protein HtpX; Provisional.
Length = 284
Score = 345 bits (887), Expect = e-118
Identities = 137/276 (49%), Positives = 183/276 (66%), Gaps = 18/276 (6%)
Query: 2 LSLLGINYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTE 61
LSLLG+ + LN+ L + + +FGF GS ISLL+SK +AK S VI+ P N TE
Sbjct: 22 LSLLGVQSY-----LNLGGLLVFAAVFGFGGSFISLLMSKWMAKRSTGAEVIEQPRNETE 76
Query: 62 DWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKA 120
WL++TV + ++ A I PEVAIY+ EINAFATGA KN+SL+A+S LL++M++ E++A
Sbjct: 77 RWLVETVARQARQAGIGMPEVAIYHSPEINAFATGASKNNSLVAVSTGLLQNMSRDEVEA 136
Query: 121 IIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKI 180
++AHEISHI+NGDMVTM LIQGV+NTF+I +SRI + I+D F+ N N
Sbjct: 137 VLAHEISHIANGDMVTMTLIQGVVNTFVIFLSRIIAQIVDRFVSG-------NEEGNGIG 189
Query: 181 YGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVND 240
Y +++ +L+I G ASI+V WFSR+REFRAD G AKL G + MI+AL+ L
Sbjct: 190 YFIVSIVLEIVFGILASIIVMWFSRHREFRADAGGAKLAG-REKMIAALQRLKTSYE-PQ 247
Query: 241 LPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAIL 276
LP S+A GIN K +LF +HP EKRIA L
Sbjct: 248 LPGSMAAFGING---KSGLSELFMSHPPLEKRIAAL 280
>gnl|CDD|182691 PRK10743, PRK10743, heat shock protein IbpA; Provisional.
Length = 137
Score = 161 bits (408), Expect = 3e-48
Identities = 66/132 (50%), Positives = 93/132 (70%)
Query: 296 FAPFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIET 355
+P YRS IGFDRL + +N ++++ YPPYN+ELVDEN YRI +A+AGF S+LEI
Sbjct: 6 LSPLYRSAIGFDRLFNLLENNQSQSNGGYPPYNVELVDENHYRIAIAVAGFAESELEITA 65
Query: 356 ERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIREV 415
+ + L + G + ++R +L++GIA RNFE +FQLA+++HV A G+L IDL R +
Sbjct: 66 QDNLLVVKGAHADEQKERTYLYQGIAERNFERKFQLAENIHVRGANLVNGLLYIDLERVI 125
Query: 416 PEALKPRKIYIN 427
PEA KPR+I IN
Sbjct: 126 PEAKKPRRIEIN 137
>gnl|CDD|107227 cd06470, ACD_IbpA-B_like, Alpha-crystallin domain (ACD) found in
Escherichia coli inclusion body-associated proteins IbpA
and IbpB, and similar proteins. IbpA and IbpB are 16
kDa small heat shock proteins (sHsps). sHsps are
molecular chaperones that suppress protein aggregation
and protect against cell stress, and are generally
active as large oligomers consisting of multiple
subunits. IbpA and IbpB are produced during high-level
production of various heterologous proteins,
specifically human prorenin, renin and bovine
insulin-like growth factor 2 (bIGF-2), and are strongly
associated with inclusion bodies containing these
heterologous proteins. IbpA and IbpB work as an
integrated system to stabilize thermally aggregated
proteins in a disaggregation competent state. The
chaperone activity of IbpB is also significantly
elevated as the temperature increases from normal to
heat shock. The high temperature results in the
disassociation of 2-3-MDa IbpB oligomers into smaller
approximately 600-kDa structures. This elevated activity
seen under heat shock conditions is retained for an
extended period of time after the temperature is
returned to normal. IbpA also forms multimers.
Length = 90
Score = 141 bits (357), Expect = 3e-41
Identities = 50/90 (55%), Positives = 61/90 (67%), Gaps = 1/90 (1%)
Query: 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQ-QRNFLHRGIATR 383
PPYNIE EN YRIT+A+AGF+ DLEIE E + L +TG+K + +R +LHRGIA R
Sbjct: 1 PPYNIEKTGENNYRITLAVAGFSEDDLEIEVENNQLTVTGKKADEENEEREYLHRGIAKR 60
Query: 384 NFEHRFQLADHVHVVAAFFDQGILNIDLIR 413
FE F LADHV V A + G+L IDL R
Sbjct: 61 AFERSFNLADHVKVKGAELENGLLTIDLER 90
>gnl|CDD|223575 COG0501, HtpX, Zn-dependent protease with chaperone function
[Posttranslational modification, protein turnover,
chaperones].
Length = 302
Score = 146 bits (370), Expect = 1e-40
Identities = 74/287 (25%), Positives = 130/287 (45%), Gaps = 33/287 (11%)
Query: 1 MLSLLGINYFFYKIGLN----IKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNP 56
+L LG + G+ + L + +++ F +ISLL SK + S+ + + P
Sbjct: 31 LLLALGPSLLLLLGGIAAASALGRLLLIALLLAFAALLISLLFSKALVLKSLGALTLSEP 90
Query: 57 ---SNFTEDWLLKTVNKISKNANI-STPEVAIYNG-EINAFATGAFKNSSLIAISANLLE 111
+ LL V ++++ A I PEV I + NAFA G + + ++ LL+
Sbjct: 91 ILLAPRLYAVLLLKVAELARQAGIPHMPEVYILETPQPNAFALGGGPKNGRVVVTTGLLD 150
Query: 112 SMTKLEIKAIIAHEISHISNGDMVT-MLLIQGVINTFIILISRIFSYILDCFLLNSYDDS 170
+ E++A++AHE+ HI N + + L + + F++L + + L
Sbjct: 151 LLNDDELEAVLAHELGHIKNRHTLVRLTLRGLLASAFVLLATLALAAGLL---------- 200
Query: 171 NYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALR 230
+ + +L + F A+++V FSR RE+ AD +AKL G P+ + SAL+
Sbjct: 201 -------GEAALALLLLLLLLALFLATLLVLAFSRKREYEADRFAAKLTG-PEKLASALQ 252
Query: 231 HLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAILS 277
L+ +S + I + F LF+THP +RIA L
Sbjct: 253 KLARLSGRANSK-----AFIASGFSGGRLQALFSTHPPLAERIAALR 294
>gnl|CDD|183223 PRK11597, PRK11597, heat shock chaperone IbpB; Provisional.
Length = 142
Score = 139 bits (353), Expect = 5e-40
Identities = 56/130 (43%), Positives = 81/130 (62%), Gaps = 2/130 (1%)
Query: 297 APFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETE 356
+P R IGFD+LA N QS+PPYNIE D+N YRIT+A+AGF + DL+I+ E
Sbjct: 7 SPLLRQWIGFDKLANALQN--AGESQSFPPYNIEKSDDNHYRITLALAGFRQEDLDIQLE 64
Query: 357 RDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIREVP 416
L + G + ++ +LH+G+ + F F LA+++ V A F G+L+IDLIR P
Sbjct: 65 GTRLTVKGTPEQPEKEVKWLHQGLVNQPFSLSFTLAENMEVSGATFVNGLLHIDLIRNEP 124
Query: 417 EALKPRKIYI 426
EA+ P++I I
Sbjct: 125 EAIAPQRIAI 134
>gnl|CDD|223149 COG0071, IbpA, Molecular chaperone (small heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 146
Score = 125 bits (315), Expect = 2e-34
Identities = 47/136 (34%), Positives = 72/136 (52%), Gaps = 5/136 (3%)
Query: 296 FAPFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIET 355
F P R + GFDRL + F N+ PP +IE D+ YRIT + G ++ D+EI
Sbjct: 12 FFPLLRDSPGFDRLFREFGNLPESRPTGTPPVDIEETDDE-YRITAELPGVDKEDIEITV 70
Query: 356 ERDTLRITGRK--QADTQQRNFLHRGIATRNFEHRFQLADHV--HVVAAFFDQGILNIDL 411
E +TL I G + + + ++ +L R A FE F+L + V V+ A + G+L + L
Sbjct: 71 EGNTLTIRGEREEEEEEEEEGYLRRERAYGEFERTFRLPEKVDPEVIKAKYKNGLLTVTL 130
Query: 412 IREVPEALKPRKIYIN 427
+ PE KP++I I
Sbjct: 131 PKAEPEEKKPKRIEIE 146
>gnl|CDD|235186 PRK03982, PRK03982, heat shock protein HtpX; Provisional.
Length = 288
Score = 122 bits (307), Expect = 9e-32
Identities = 87/290 (30%), Positives = 140/290 (48%), Gaps = 39/290 (13%)
Query: 4 LLGINYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTED- 62
L I Y L I I +I+ ++IS S KI S N ++ +E+
Sbjct: 18 LYAIGYL-----LGGSIGPIIAILLALIPNLISYYYSDKIVLASYNARIV------SEEE 66
Query: 63 --WLLKTVNKISKNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIK 119
L + V ++++ ANI P+VAI + NAFATG +++A++ +L + + E++
Sbjct: 67 APELYRIVERLAERANIPKPKVAIVPTQTPNAFATGRDPKHAVVAVTEGILNLLNEDELE 126
Query: 120 AIIAHEISHISNGDMVTMLLIQGVINTF---IILISRIFSYILDCFLLNSYDDSNYNSNY 176
+IAHE++HI N D LIQ + T I+ +++ S+ L + D N
Sbjct: 127 GVIAHELTHIKNRD----TLIQTIAATLAGAIMYLAQWLSWGL--WFGGGGRDDRNGGN- 179
Query: 177 NHKIYGLITTILQIFLG-FGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHL--- 232
I ++L I L A+++ SR REF AD G A+L G P + +AL+ L
Sbjct: 180 ------PIGSLLLIILAPIAATLIQFAISRQREFSADEGGARLTGNPLALANALQKLEKG 233
Query: 233 -SNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAILSLLKL 281
I + N P + A + I N F+ + LF+THP E+RI LL++
Sbjct: 234 VRYIPLKNGNP-ATAHMFIINPFRGQFLANLFSTHPPTEERIE--RLLEM 280
>gnl|CDD|235318 PRK04897, PRK04897, heat shock protein HtpX; Provisional.
Length = 298
Score = 120 bits (304), Expect = 4e-31
Identities = 68/215 (31%), Positives = 109/215 (50%), Gaps = 3/215 (1%)
Query: 64 LLKTVNKISKNANISTPEVAIYNGE-INAFATGAFKNSSLIAISANLLESMTKLEIKAII 122
L V ++ A I P V I + NAFATG+ ++ +A++ LL M + E++ +I
Sbjct: 82 LWHIVEDMAMVAQIPMPRVFIIDDPSPNAFATGSSPKNAAVAVTTGLLAIMNREELEGVI 141
Query: 123 AHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYG 182
HEISHI N D + + I + + I L+S I ++ + D + + I
Sbjct: 142 GHEISHIRNYD-IRLSTIAVALASAITLLSDIAGRMMWWGGGSRRRDDDRDGGGLQIILL 200
Query: 183 LITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNIS-IVNDL 241
+++ +L I A+++ SR RE+ AD S +L P +ISAL +SN + + +
Sbjct: 201 IVSLLLLILAPLAATLIQLAISRQREYLADASSVELTRNPQGLISALEKISNSQPMKHPV 260
Query: 242 PKSIAILGINNNFKKKSFLKLFATHPSFEKRIAIL 276
+ A L I++ KKK KLF THP E+RI L
Sbjct: 261 DDASAALYISDPLKKKGLSKLFDTHPPIEERIERL 295
>gnl|CDD|179418 PRK02391, PRK02391, heat shock protein HtpX; Provisional.
Length = 296
Score = 111 bits (279), Expect = 1e-27
Identities = 77/293 (26%), Positives = 126/293 (43%), Gaps = 44/293 (15%)
Query: 4 LLGINYFFYKIGLNIKILTISSIIFGFTG--SIISLLLSKKIAKWSMNLMVIKNPSNFTE 61
LL Y + + + I + +I G + S K+A WSM ++ + + E
Sbjct: 20 LLFALYLVF-VAVLIALGVSLVLIVVIAGGFLLAQYFFSDKLALWSMGARIV-SEDEYPE 77
Query: 62 DWLLKTVNKISKNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIKA 120
L V ++ A++ P VA+ + ++ NAFATG ++++ ++ L+ + E++A
Sbjct: 78 --LHAMVERLCALADLPKPRVAVADSDVPNAFATGRSPKNAVVCVTTGLMRRLDPDELEA 135
Query: 121 IIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKI 180
++AHE+SH+ N D+ M I +F +S I I+
Sbjct: 136 VLAHELSHVKNRDVAVM-----TIASF---LSTIAFLIVRWGFYFGGFGGRGGGG----- 182
Query: 181 YGLITTILQIFLGFGASIVVAW---------FSRNREFRADHGSAKLLGTPDPMISALRH 231
I + S+VV W SR REF AD G+A + G P + SAL
Sbjct: 183 -----GGGGILVVILVSLVV-WAISFLLIRALSRYREFAADRGAAIITGRPSALASALMK 236
Query: 232 LSNISIVNDLPKS--IAILGINNNF-----KKKSFLKLFATHPSFEKRIAILS 277
+S ++ +P G+N F S +LF+THP EKRIA L
Sbjct: 237 ISG--RMDRVPTEDLREAEGMNAFFIIPALSGGSLGRLFSTHPPLEKRIAQLE 287
>gnl|CDD|235081 PRK02870, PRK02870, heat shock protein HtpX; Provisional.
Length = 336
Score = 102 bits (257), Expect = 2e-24
Identities = 71/274 (25%), Positives = 130/274 (47%), Gaps = 30/274 (10%)
Query: 22 TISSIIFGFTGSIISLLLSK--KIAKWSMNLMVI--KNPSNFTEDWLLKTVNKISKNANI 77
T + I+ I + KI I +N + E L V ++ A +
Sbjct: 71 TATLIMSLVAVISILVTFQNFDKIMLSGTEYKEITPENALSLQERQLYNVVEELLVAAGL 130
Query: 78 S-TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV 135
P+V I + +NAFA+G + S+++AI+ LLE + + E++A++AHE+SHI +GD +
Sbjct: 131 RFMPKVYIIDAPYMNAFASGYSEKSAMVAITTGLLEKLDRDELQAVMAHELSHIRHGD-I 189
Query: 136 TMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFG 195
+ L GV++ +++++ Y + + +S N +I IL+ L
Sbjct: 190 RLTLCVGVLSNIMLIVADFLFY----SFMGNRRNSGANRAR------MIILILRYVLPIL 239
Query: 196 ASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFK 255
+++ + SR RE+ AD G+ +L+ +PM AL+ +SN ND + + + +
Sbjct: 240 TVLLMLFLSRTREYMADAGAVELMRDNEPMARALQKISNDHAQNDEQYAYKHTD-HESTR 298
Query: 256 KKSFL------------KLFATHPSFEKRIAILS 277
+ ++L F+THPS E R+A L
Sbjct: 299 RAAYLFDPAGISPGSLSDAFSTHPSIENRLAALG 332
>gnl|CDD|179524 PRK03001, PRK03001, M48 family peptidase; Provisional.
Length = 283
Score = 101 bits (254), Expect = 2e-24
Identities = 75/280 (26%), Positives = 127/280 (45%), Gaps = 33/280 (11%)
Query: 23 ISSIIFGFTGSIISLLLSKKI---AKWSMNLMVIK-------NPSNFTEDWLLKTVNKIS 72
I +I G G +I+LL + + + W + MV+K + + + + V +++
Sbjct: 20 IGGMIGGSQGMLIALLFALGMNFFSYWFSDKMVLKMYNAQEVDENTAPQ--FYRMVRELA 77
Query: 73 KNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131
+ A + P+V + N + NAFATG + +A + +L +++ EI+ ++AHE++H+ +
Sbjct: 78 QRAGLPMPKVYLINEDQPNAFATGRNPEHAAVAATTGILRVLSEREIRGVMAHELAHVKH 137
Query: 132 GDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIF 191
D+ LI + T IS L F + N + I G+ IL
Sbjct: 138 RDI----LISTISATMAGAIS-----ALANFAMFFGGRDE-NGRPVNPIAGIAVAIL--- 184
Query: 192 LGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNIS------IVNDLPKSI 245
AS++ SR REF AD G A++ G P + SAL + + P +
Sbjct: 185 APLAASLIQMAISRAREFEADRGGARISGDPQALASALDKIHRYASGIPFQAAEAHPAT- 243
Query: 246 AILGINNNFKKKSFLKLFATHPSFEKRIAILSLLKLSLYF 285
A + I N LF+THPS E+RIA L + + F
Sbjct: 244 AQMMIINPLSGGGLANLFSTHPSTEERIARLMAMARTGRF 283
>gnl|CDD|216503 pfam01435, Peptidase_M48, Peptidase family M48.
Length = 223
Score = 94.8 bits (236), Expect = 2e-22
Identities = 45/202 (22%), Positives = 79/202 (39%), Gaps = 26/202 (12%)
Query: 74 NANISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNG 132
A EV + + + NAFA G KN + ++ LL ++T+ E+ A++ HEI HI
Sbjct: 46 GAGFPPWEVYVVDSPQPNAFALGGGKNK-RVVVTTGLLLALTEDELAAVLGHEIGHIKAR 104
Query: 133 DMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFL 192
V + QG++ ++L+ + L
Sbjct: 105 HSVES-MSQGLLLNLLLLLGAAALGGRALGFNAN-------------------GFLTALG 144
Query: 193 GFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINN 252
F +++ +SR +E+ AD A+ LG + A V L +A ++
Sbjct: 145 IFLLQLLLLPYSRKQEYEADEAGAR-LGGDKDLARAGYKPRA--AVKFL-AKLAAENLSR 200
Query: 253 NFKKKSFLKLFATHPSFEKRIA 274
K + +L +THP +RI
Sbjct: 201 VSGGKLYPELLSTHPPLVERIQ 222
>gnl|CDD|234944 PRK01345, PRK01345, heat shock protein HtpX; Provisional.
Length = 317
Score = 92.8 bits (231), Expect = 6e-21
Identities = 70/268 (26%), Positives = 129/268 (48%), Gaps = 32/268 (11%)
Query: 26 IIFGFTGSIISLLLSKKI---AKWSMNLMVIK-------NPSNFTEDWLLKTVNKISKNA 75
+I G G +I+L+++ + + W+ + MV++ + + E L + V +++ A
Sbjct: 23 LIGGAGGMMIALVIAAGMNLFSYWNSDKMVLRMYGAQEVDERSAPE--LYRMVRDLARRA 80
Query: 76 NISTPEVAIY-NGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDM 134
+ P+V I N + NAFATG ++ +A + LL+ ++ E+ ++AHE++H+ N D
Sbjct: 81 GLPMPKVYIIDNPQPNAFATGRNPENAAVAATTGLLQRLSPEEVAGVMAHELAHVKNRDT 140
Query: 135 VTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLG- 193
+TM + T IS + ++ F ++N GL+ T+ + +
Sbjct: 141 LTMT----ITATLAGAISMLANF---AFFFGGNRENNNGP------LGLVGTLAAMIVAP 187
Query: 194 FGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNI--SIVNDLPK---SIAIL 248
A +V SR RE+ AD A++ G P + SAL + + N+ + + A +
Sbjct: 188 LAAMLVQMAISRTREYAADRRGAEICGNPLWLASALGKIERGAHGVPNEEAERNPATAHM 247
Query: 249 GINNNFKKKSFLKLFATHPSFEKRIAIL 276
I N + LF+THP+ E RIA L
Sbjct: 248 FIINPLSGEGMDNLFSTHPATENRIAAL 275
>gnl|CDD|235102 PRK03072, PRK03072, heat shock protein HtpX; Provisional.
Length = 288
Score = 86.6 bits (215), Expect = 4e-19
Identities = 68/280 (24%), Positives = 122/280 (43%), Gaps = 28/280 (10%)
Query: 4 LLGINYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDW 63
++ I F + GL I +L + S K+A +M+ + S
Sbjct: 21 IVFIGALFGRTGLGIAVL------IAVGMNAYVYWNSDKLALRAMHAQPV---SEVQAPA 71
Query: 64 LLKTVNKISKNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIKAII 122
+ + V ++S A P + I NAFATG ++ + + +L+ + + E++ ++
Sbjct: 72 MYRIVRELSTAARQPMPRLYISPTAAPNAFATGRNPRNAAVCCTEGILQILNERELRGVL 131
Query: 123 AHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYG 182
HE+SH+ N D++ + G + + I ++ + + + ++
Sbjct: 132 GHELSHVYNRDILISS-VAGALASVITYLANMAMFA-----------GMFGGRRDNDGPN 179
Query: 183 LITTILQIFLG-FGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDL 241
+ +L LG A+++ SR+RE++AD A+L G P + SALR +S L
Sbjct: 180 PLALLLVSLLGPIAATVIQLAISRSREYQADESGAELTGDPLALASALRKISGGVQAAPL 239
Query: 242 PK-----SIAILGINNNFKKKSFLKLFATHPSFEKRIAIL 276
P S A L I N F+ +LF+THP RIA L
Sbjct: 240 PPEPQLASQAHLMIANPFRAGGIGRLFSTHPPMADRIARL 279
>gnl|CDD|107221 cd06464, ACD_sHsps-like, Alpha-crystallin domain (ACD) of
alpha-crystallin-type small(s) heat shock proteins
(Hsps). sHsps are small stress induced proteins with
monomeric masses between 12 -43 kDa, whose common
feature is the Alpha-crystallin domain (ACD). sHsps are
generally active as large oligomers consisting of
multiple subunits, and are believed to be
ATP-independent chaperones that prevent aggregation and
are important in refolding in combination with other
Hsps.
Length = 88
Score = 72.6 bits (179), Expect = 5e-16
Identities = 22/82 (26%), Positives = 44/82 (53%), Gaps = 3/82 (3%)
Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITG-RKQADTQQRNFLHRGIATRNFEHRFQL 391
++ Y + + GF + D+++E E L I+G R++ + ++ N+L R + +F F+L
Sbjct: 5 TDDAYVVEADLPGFKKEDIKVEVEDGVLTISGEREEEEEEEENYLRRERSYGSFSRSFRL 64
Query: 392 ADHVHV--VAAFFDQGILNIDL 411
+ V + A + G+L I L
Sbjct: 65 PEDVDPDKIKASLENGVLTITL 86
>gnl|CDD|234931 PRK01265, PRK01265, heat shock protein HtpX; Provisional.
Length = 324
Score = 72.9 bits (179), Expect = 4e-14
Identities = 65/307 (21%), Positives = 120/307 (39%), Gaps = 39/307 (12%)
Query: 5 LGINYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWL 64
+ Y+ + + ++ I F +II L + + + + P++ WL
Sbjct: 28 YAVAYYAFGAQFGVGLILGILIFVFFL-NIIQWLFGPYMINAAYRTVEVT-PTDPVYGWL 85
Query: 65 LKTVNKISKNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIKAIIA 123
V +++K I P+V I + NAFA G+ IAI+ LL+ + + EIKA+
Sbjct: 86 YSIVAEVAKYNGIRVPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKILNRDEIKAVAG 145
Query: 124 HEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGL 183
HE+ H+ + D V +L+ G+I T I + +S N
Sbjct: 146 HELGHLKHRD-VELLMAIGLIPTLIYYLG--YSLFWGGMFGGGGGGRGNNGGL-----LF 197
Query: 184 ITTILQIFLGFGASIVVAWFSRNREFRADHGSA---------------KLLGTPDPMI-- 226
+ I + + F +++V +R RE AD SA K+ + DP
Sbjct: 198 LIGIALMAVSFVFNLLVLSINRMREAYADVNSALTVPGGAENLQTALAKITLSMDPGALE 257
Query: 227 -----SALRHLSNISIVNDLPKSIA------ILGINNNFKKKSFLKLFATHPSFEKRIAI 275
S ++++ ++ + + ++ K + +F+ HP KRI +
Sbjct: 258 RFKKKSTTNQMASMLFFSNAIEEVPTWDARELVEYWKTTKVPWYADIFSDHPHPAKRIQL 317
Query: 276 LSLLKLS 282
L L S
Sbjct: 318 LEKLSKS 324
>gnl|CDD|215655 pfam00011, HSP20, Hsp20/alpha crystallin family.
Length = 101
Score = 67.6 bits (166), Expect = 4e-14
Identities = 21/98 (21%), Positives = 49/98 (50%), Gaps = 2/98 (2%)
Query: 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQL 391
D++ + + + + GF +L+++ E + + + G+ + + + + L + R+F +F L
Sbjct: 4 EDKDAFVVKLDVPGFKPEELKVKVEDNRVLVKGKHEKEEEDDHGLRSERSYRSFSRKFVL 63
Query: 392 ADHVHV--VAAFFDQGILNIDLIREVPEALKPRKIYIN 427
++ V A G+L + + + P KPR+I I
Sbjct: 64 PENADPDKVKASLKDGVLTVTVPKLEPPEKKPRRIQIQ 101
>gnl|CDD|107219 cd00298, ACD_sHsps_p23-like, This domain family includes the
alpha-crystallin domain (ACD) of alpha-crystallin-type
small heat shock proteins (sHsps) and a similar domain
found in p23-like proteins. sHsps are small stress
induced proteins with monomeric masses between 12 -43
kDa, whose common feature is this ACD. sHsps are
generally active as large oligomers consisting of
multiple subunits, and are believed to be
ATP-independent chaperones that prevent aggregation and
are important in refolding in combination with other
Hsps. p23 is a cochaperone of the Hsp90 chaperoning
pathway. It binds Hsp90 and participates in the folding
of a number of Hsp90 clients including the progesterone
receptor. p23 also has a passive chaperoning activity.
p23 in addition may act as the cytosolic prostaglandin
E2 synthase. Included in this family is the p23-like
C-terminal CHORD-SGT1 (CS) domain of suppressor of G2
allele of Skp1 (Sgt1) and the p23-like domains of human
butyrate-induced transcript 1 (hB-ind1), NUD (nuclear
distribution) C, Melusin, and NAD(P)H cytochrome b5
(NCB5) oxidoreductase (OR).
Length = 80
Score = 53.0 bits (128), Expect = 4e-09
Identities = 19/81 (23%), Positives = 41/81 (50%), Gaps = 8/81 (9%)
Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLA 392
+++ +T+ + G + D+++E E + L I+G+++ + + R + FE F+L
Sbjct: 4 TDDEVVVTVDLPGVKKEDIKVEVEDNVLTISGKREEEEE------RERSYGEFERSFELP 57
Query: 393 DHVHV--VAAFFDQGILNIDL 411
+ V A + G+L I L
Sbjct: 58 EDVDPEKSKASLENGVLEITL 78
>gnl|CDD|218642 pfam05569, Peptidase_M56, BlaR1 peptidase M56. Production of
beta-Lactamase and penicillin-binding protein 2a (which
mediate staphylococcal resistance to beta-lactam
antibiotics) is regulated by a signal-transducing
integral membrane protein and a transcriptional
repressor. The signal transducer is a fusion protein
with penicillin-binding and zinc metalloprotease
domains. The signal for protein expression is
transmitted by site-specific proteolytic cleavage of
both the transducer, which auto-activates, and the
repressor, which is inactivated, unblocking gene
transcription. Homologues to this peptidase domain,
which corresponds to Merops family M56, are also found
in a number of other bacterial genome sequences.
Length = 299
Score = 43.2 bits (102), Expect = 1e-04
Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 13/84 (15%)
Query: 60 TEDWLLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIK 119
ED K I ++ I +P V G +K I + A+ ++ E+
Sbjct: 151 KEDMGYKRNIVIRLSSTIDSPAV-----------FGLWKP--RIVLPADFDTRLSDEELD 197
Query: 120 AIIAHEISHISNGDMVTMLLIQGV 143
I+ HE++H+ GD++ LL+ +
Sbjct: 198 YILLHELAHLKRGDLIINLLVAVL 221
>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
repeats [General function prediction only].
Length = 484
Score = 39.7 bits (93), Expect = 0.003
Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 12/78 (15%)
Query: 61 EDWLLKTVNKISKNANISTPEVAIY---NGEINAFAT-GA--FKNSSLIAISANLLESMT 114
E+++ +++ A++ + + INAFAT G N+ L+ + N ES
Sbjct: 73 EEYVNSLGQRLAAAADLVKTPFTFFLVNDDSINAFATPGGYVVVNTGLLLTAEN--ES-- 128
Query: 115 KLEIKAIIAHEISHISNG 132
E+ +IAHEI H++
Sbjct: 129 --ELAGVIAHEIGHVAQR 144
>gnl|CDD|226672 COG4219, MecR1, Antirepressor regulating drug resistance, predicted
signal transduction N-terminal membrane component
[Transcription / Signal transduction mechanisms].
Length = 337
Score = 39.0 bits (91), Expect = 0.003
Identities = 17/55 (30%), Positives = 31/55 (56%), Gaps = 8/55 (14%)
Query: 103 IAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSY 157
I + A+ +E +T E+K II HE+SH+ GD +IN ++++ +F +
Sbjct: 175 IVLPADFVERLTDEELKYIILHELSHLKRGDA--------IINLIVVVLGVLFWF 221
>gnl|CDD|107247 cd06526, metazoan_ACD, Alpha-crystallin domain (ACD) of metazoan
alpha-crystallin-type small(s) heat shock proteins
(Hsps). sHsps are small stress induced proteins with
monomeric masses between 12 -43 kDa, whose common
feature is the Alpha-crystallin domain (ACD). sHsps are
generally active as large oligomers consisting of
multiple subunits, and are believed to be
ATP-independent chaperones that prevent aggregation and
are important in refolding in combination with other
Hsps.
Length = 83
Score = 36.0 bits (84), Expect = 0.004
Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 8/80 (10%)
Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLA 392
D+ K+++T+ + GF +L+++ + L + G+ + + ++ R F R+QL
Sbjct: 5 DDEKFQVTLDVKGFKPEELKVKVSDNKLVVEGKHEEREDEHGYVS-----REFTRRYQLP 59
Query: 393 DHV---HVVAAFFDQGILNI 409
+ V V ++ G+L I
Sbjct: 60 EGVDPDSVTSSLSSDGVLTI 79
>gnl|CDD|107228 cd06471, ACD_LpsHSP_like, Group of bacterial proteins containing an
alpha crystallin domain (ACD) similar to Lactobacillus
plantarum (Lp) small heat shock proteins (sHsp) HSP
18.5, HSP 18.55 and HSP 19.3. sHsps are molecular
chaperones that suppress protein aggregation and protect
against cell stress, and are generally active as large
oligomers consisting of multiple subunits. Transcription
of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is
regulated by a variety of stresses including heat, cold
and ethanol. Early growing L. plantarum cells contain
elevated levels of these mRNAs which rapidly fall of as
the cells enter stationary phase. Also belonging to this
group is Bifidobacterium breve (Bb) HSP20 and Oenococcus
oenis (syn. Leuconostoc oenos) (Oo) HSP18.
Transcription of the gene encoding BbHSP20 is strongly
induced following heat or osmotic shock, and that of the
gene encoding OoHSP18 following heat, ethanol or acid
shock. OoHSP18 is peripherally associated with the
cytoplasmic membrane.
Length = 93
Score = 34.4 bits (80), Expect = 0.017
Identities = 17/87 (19%), Positives = 40/87 (45%), Gaps = 11/87 (12%)
Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQR----NFLHRGIATRNFEHR 388
+++Y + + GF + D++++ + L I+ ++ ++ N++ R +F
Sbjct: 8 TDDEYIVEADLPGFKKEDIKLDYKDGYLTISAKRDESKDEKDKKGNYIRRERYYGSFSRS 67
Query: 389 FQLADHVHVVA----AFFDQGILNIDL 411
F L + V A ++ G+L I L
Sbjct: 68 FYLPN---VDEEEIKAKYENGVLKITL 91
>gnl|CDD|107236 cd06481, ACD_HspB9_like, Alpha crystallin domain (ACD) found in
mammalian small heat shock protein (sHsp) HspB9 and
similar proteins. sHsps are molecular chaperones that
suppress protein aggregation and protect against cell
stress, and are generally active as large oligomers
consisting of multiple subunits. Human (h) HspB9 is
expressed exclusively in the normal testis and in
various tumor samples and is a cancer/testis antigen.
hHspB9 interacts with TCTEL1 (T-complex testis
expressed protein -1), a subunit of dynein. hHspB9 and
TCTEL1 are co-expressed in similar cells within the
testis and in tumor cells. Included in this group is
Xenopus Hsp30, a developmentally-regulated
heat-inducible molecular chaperone.
Length = 87
Score = 31.2 bits (71), Expect = 0.18
Identities = 13/61 (21%), Positives = 31/61 (50%), Gaps = 5/61 (8%)
Query: 337 YRITMAIAGFNRSDLEIETERDTLRITGRKQ--ADTQQRNFLHRGIATRNFEHRFQLADH 394
+ + + + GF+ DL + + L +TG+++ + ++ +F + + F QL +H
Sbjct: 9 FSLKLDVRGFSPEDLSVRVDGRKLVVTGKREKKNEDEKGSFSYE---YQEFVREAQLPEH 65
Query: 395 V 395
V
Sbjct: 66 V 66
>gnl|CDD|189002 cd09595, M1, Peptidase M1 family contains aminopeptidase N and
leukotriene A4 hydrolase. M1 Peptidase family includes
aminopeptidase N (APN) and leukotriene A4 hydrolase
(LTA4H). All peptidases in this family bind a single
catalytic zinc ion which is tetrahedrally co-ordinated
by three amino acid ligands and a water molecule that
forms the nucleophile on activation during catalysis.
APN consists of a small N-terminal cytoplasmic domain, a
single transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and is present in a variety
of human tissues and cell types. APN expression is
dysregulated in many inflammatory diseases and is
enhanced in numerous tumor cells, making it a lead
target in the development of anti-cancer and
anti-inflammatory drugs. LTA4H is a bifunctional enzyme,
possessing an aminopeptidase as well as an epoxide
hydrolase activity. The two activities occupy different,
but overlapping sites. The activity and physiological
relevance of the aminopeptidase in LTA4H is as yet
unknown while the epoxide hydrolase converts leukotriene
A4 (LTA4) into leukotriene B4 (LTB4), a potent
chemotaxin that is fundamental to the inflammatory
response of mammals.
Length = 407
Score = 33.2 bits (76), Expect = 0.25
Identities = 18/53 (33%), Positives = 29/53 (54%), Gaps = 3/53 (5%)
Query: 88 EINAFATGAFKNSSLIAISANLL---ESMTKLEIKAIIAHEISHISNGDMVTM 137
E+ F +GA +N LI S +LL ++ +IAHE++H G++VT
Sbjct: 235 EVPEFPSGAMENPGLIFFSQSLLLAMIDAGDELLENVIAHELAHQWFGNLVTG 287
>gnl|CDD|223385 COG0308, PepN, Aminopeptidase N [Amino acid transport and
metabolism].
Length = 859
Score = 32.4 bits (74), Expect = 0.46
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 7/53 (13%)
Query: 92 FATGAFKNSSLIAISANLL-------ESMTKLEIKAIIAHEISHISNGDMVTM 137
F+ GA +N L+ L ++ +IAHE++H G++VTM
Sbjct: 274 FSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTM 326
>gnl|CDD|153290 cd07606, BAR_SFC_plant, The Bin/Amphiphysin/Rvs (BAR) domain of the
plant protein SCARFACE (SFC). BAR domains are
dimerization, lipid binding and curvature sensing
modules found in many different proteins with diverse
functions including organelle biogenesis, membrane
trafficking or remodeling, and cell division and
migration. The plant protein SCARFACE (SFC), also called
VAscular Network 3 (VAN3), is a plant ACAP (ArfGAP with
Coiled-coil, ANK repeat and PH domain containing
protein), an Arf GTPase Activating Protein (GAP) that
plays a role in the trafficking of auxin efflux
regulators from the plasma membrane to the endosome. It
is required for the normal vein patterning in leaves.
SCF contains an N-terminal BAR domain, followed by a
Pleckstrin Homology (PH) domain, an Arf GAP domain, and
C-terminal ankyrin (ANK) repeats. BAR domains form
dimers that bind to membranes, induce membrane bending
and curvature, and may also be involved in
protein-protein interactions.
Length = 202
Score = 31.3 bits (71), Expect = 0.74
Identities = 22/101 (21%), Positives = 40/101 (39%), Gaps = 2/101 (1%)
Query: 307 DRLAQIFDNVRTEAQQSYPPYNIELVDENKYRIT-MAIAGFNRSDLEIETERDTLRI-TG 364
DRLAQ D E + + ++ +D + R +++ + ++ E D +
Sbjct: 92 DRLAQFADTDLQEVKDARRRFDKASLDYEQARSKFLSLTKDAKPEILAAAEEDLGTTRSA 151
Query: 365 RKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQG 405
+ A N LH A + E +L+ + AFF G
Sbjct: 152 FETARFDLMNRLHAADARKRVEFLERLSGSMDAHLAFFKSG 192
>gnl|CDD|233857 TIGR02412, pepN_strep_liv, aminopeptidase N, Streptomyces lividans
type. This family is a subset of the members of the
zinc metallopeptidase family M1 (pfam01433), with a
single member characterized in Streptomyces lividans 66
and designated aminopeptidase N. The spectrum of
activity may differ somewhat from the aminopeptidase N
clade of E. coli and most other Proteobacteria, well
separated phylogenetically within the M1 family. The M1
family also includes leukotriene A-4
hydrolase/aminopeptidase (with a bifunctional active
site).
Length = 831
Score = 31.7 bits (72), Expect = 0.87
Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 5/51 (9%)
Query: 92 FATGAFKNSSLIAISANLL--ESMTKLEIK---AIIAHEISHISNGDMVTM 137
F GA +N+ + + N L T+ E + +I HE++H+ GD+VTM
Sbjct: 256 FNAGAMENAGCVTFAENFLHRAEATRAEKENRAGVILHEMAHMWFGDLVTM 306
>gnl|CDD|107230 cd06475, ACD_HspB1_like, Alpha crystallin domain (ACD) found in
mammalian small (s)heat shock protein (Hsp)-27 (also
denoted HspB1 in human) and similar proteins. sHsps are
molecular chaperones that suppress protein aggregation
and protect against cell stress, and are generally
active as large oligomers consisting of multiple
subunits. Hsp27 shows enhanced synthesis in response to
stress. It is a molecular chaperone which interacts with
a large number of different proteins. It is found in
many types of human cells including breast, uterus,
cervix, platelets and cancer cells. Hsp27 has diverse
cellular functions including, chaperoning, regulation of
actin polymerization, keratinocyte differentiation,
regulation of inflammatory pathways in keratinocytes,
and protection from oxidative stress through modulating
glutathione levels. It is also a subunit of
AUF1-containing protein complexes. It has been linked to
several transduction pathways regulating cellular
functions including differentiation, cell growth,
development, and apoptosis. Its activity can be
regulated by phosphorylation. Its unphosphorylated state
is a high molecular weight aggregated form (100-800kDa)
composed of up to 24 subunits, which forms as a result
of multiple interactions within the ACD, and is required
for chaperone function and resistance to oxidative
stress. Upon phosphorylation these large aggregates
rapidly disassociate to smaller oligomers and chaperone
activity is modified. High constitutive levels of Hsp27
have been detected in various cancer cells, in
particular those of carcinoma origin. Over-expression of
Hsp27 has a protective effect against various
diseases-processes, including Huntington's disease.
Mutations in Hsp27 have been associated with a form of
distal hereditary motor neuropathy type II and
Charcot-Marie-Tooth disease type 2.
Length = 86
Score = 29.4 bits (66), Expect = 0.94
Identities = 16/81 (19%), Positives = 40/81 (49%), Gaps = 8/81 (9%)
Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQL- 391
+++++++ + F +L ++T+ + ITG+ +++ H G +R F ++ L
Sbjct: 8 TADRWKVSLDVNHFAPEELVVKTKDGVVEITGKH----EEKQDEH-GFVSRCFTRKYTLP 62
Query: 392 --ADHVHVVAAFFDQGILNID 410
D V ++ GIL ++
Sbjct: 63 PGVDPTAVTSSLSPDGILTVE 83
>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
tricorn interacting factor F3, Endoplasmic reticulum
aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ). This
M1 peptidase family includes eukaryotic and bacterial
members: aminopeptidase N (APN), aminopeptidase Q (APQ,
laeverin), endoplasmic reticulum aminopeptidase 1
(ERAP1) as well as tricorn interacting factor F3.
Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease,
consists of a small N-terminal cytoplasmic domain, a
single transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and is present in a variety
of human tissues and cell types (leukocyte, fibroblast,
endothelial and epithelial cells). APN expression is
dysregulated in inflammatory diseases such as chronic
pain, rheumatoid arthritis, multiple sclerosis, systemic
sclerosis, systemic lupus erythematosus,
polymyositis/dermatomyosytis and pulmonary sarcoidosis,
and is enhanced in tumor cells such as melanoma, renal,
prostate, pancreas, colon, gastric and thyroid cancers.
It is considered a marker of differentiation since it is
predominantly expressed on stem cells and on cells of
the granulocytic and monocytic lineages at distinct
stages of differentiation. Thus, APN inhibition may lead
to the development of anti-cancer and anti-inflammatory
drugs. ERAP1 also known as endoplasmic reticulum
aminopeptidase associated with antigen processing
(ERAAP), adipocyte derived leucine aminopeptidase
(A-LAP) or aminopeptidase regulating tumor necrosis
factor receptor I (THFRI) shedding (ARTS-1), associates
with the closely related ER aminopeptidase ERAP2, for
the final trimming of peptides within the ER for
presentation by MHC class I molecules. ERAP1 is
associated with ankylosing spondylitis (AS), an
inflammatory arthritis that predominantly affects the
spine. ERAP1 also aids in the shedding of membrane-bound
cytokine receptors. The tricorn interacting factor F3,
together with factors F1 and F2, degrades the tricorn
protease products, producing free amino acids, thus
completing the proteasomal degradation pathway. F3 is
homologous to F2, but not F1, and shows a strong
preference for glutamate in the P1' position. APQ, also
known as laeverin, is specifically expressed in human
embryo-derived extravillous trophoblasts (EVTs) that
invade the uterus during early placentation. It cleaves
the N-terminal amino acid of various peptides such as
angiotensin III, endokinin C, and kisspeptin-10, all
expressed in the placenta in large quantities. APN is a
receptor for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs are also putative Cry toxin
receptors. Cry1 proteins are pore-forming toxins that
bind to the midgut epithelial cell membrane of
susceptible insect larvae, causing extensive damage.
Several different toxins, including Cry1Aa, Cry1Ab,
Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
bind to APNs; however, a direct role of APN in
cytotoxicity has been yet to be firmly established.
Length = 446
Score = 31.4 bits (72), Expect = 1.0
Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 12/63 (19%)
Query: 82 VAIYNGEINAFATGAFKNSSLIAISANLL-------ESMTKLEIKAIIAHEISHISNGDM 134
VAI + FA GA +N LI L + K + ++AHE++H G++
Sbjct: 248 VAIPD-----FAAGAMENWGLITYRETALLYDPKTSSASNKQRVATVVAHELAHQWFGNL 302
Query: 135 VTM 137
VTM
Sbjct: 303 VTM 305
>gnl|CDD|227123 COG4784, COG4784, Putative Zn-dependent protease [General function
prediction only].
Length = 479
Score = 31.4 bits (71), Expect = 1.1
Identities = 20/81 (24%), Positives = 32/81 (39%), Gaps = 21/81 (25%)
Query: 200 VAWFSRNREFRADHGSAKLLGT----PDPMISALRHLSN---ISIVNDLPKSIAILGINN 252
+A FSRN+E +AD K+LG P L+ ++ V+ + L
Sbjct: 178 LAQFSRNQELQADAIGIKMLGEAGYDPYAAARFLQSMAAYTDFRSVS--GAADQSLD--- 232
Query: 253 NFKKKSFLKLFATHPSFEKRI 273
A+HP+ +RI
Sbjct: 233 ---------FLASHPNTPQRI 244
>gnl|CDD|216501 pfam01433, Peptidase_M1, Peptidase family M1. Members of this
family are aminopeptidases. The members differ widely in
specificity, hydrolysing acidic, basic or neutral
N-terminal residues. This family includes leukotriene-A4
hydrolase, this enzyme also has an aminopeptidase
activity.
Length = 390
Score = 30.7 bits (70), Expect = 1.5
Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 7/53 (13%)
Query: 92 FATGAFKNSSLIAISANLL-------ESMTKLEIKAIIAHEISHISNGDMVTM 137
F+ GA +N LI L S K + +IAHE++H G++VTM
Sbjct: 262 FSAGAMENWGLITYREPALLYDPGNSTSSDKQRVAEVIAHELAHQWFGNLVTM 314
>gnl|CDD|220661 pfam10263, SprT-like, SprT-like family. This family represents a
domain found in eukaryotes and prokaryotes. The domain
contains a characteristic motif of the zinc
metallopeptidases. This family includes the bacterial
SprT protein.
Length = 153
Score = 29.7 bits (67), Expect = 1.7
Identities = 9/31 (29%), Positives = 18/31 (58%)
Query: 98 KNSSLIAISANLLESMTKLEIKAIIAHEISH 128
+NS+ I ++ LL ++ E+ + HE+ H
Sbjct: 38 RNSAEIRLNPKLLAENSEEELIETLLHEMCH 68
>gnl|CDD|189009 cd09602, M1_APN_3, Peptidase M1 family containing Aminopeptidase N.
This family contains bacterial and eukaryotic
aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
3.4.11.2), a Type II integral membrane protease
belonging to the M1 gluzincin family. APN consists of a
small N-terminal cytoplasmic domain, a single
transmembrane domain and a large extracellular
ectodomain that contains the active site. It
preferentially cleaves neutral amino acids from the
N-terminus of oligopeptides and, in higher eukaryotes,
is present in a variety of human tissues and cell types
(leukocyte, fibroblast, endothelial and epithelial
cells). APN expression is dysregulated in inflammatory
diseases such as chronic pain, rheumatoid arthritis,
multiple sclerosis, systemic sclerosis, systemic lupus
erythematosus, polymyositis/dermatomyosytis and
pulmonary sarcoidosis, and is enhanced in tumor cells
such as melanoma, renal, prostate, pancreas, colon,
gastric and thyroid cancers. It is predominantly
expressed on stem cells and on cells of the granulocytic
and monocytic lineages at distinct stages of
differentiation, thus considered a marker of
differentiation. Thus, APN inhibition may lead to the
development of anti-cancer and anti-inflammatory drugs.
APNs are also present in many pathogenic bacteria and
represent potential drug targets, Some APNs have been
used commercially, such as one from Lactococcus lactis
used in the food industry. APN also serves as a receptor
for coronaviruses, although the virus receptor
interaction site seems to be distinct from the enzymatic
site and aminopeptidase activity is not necessary for
viral infection. APNs have also been extensively studied
as putative Cry toxin receptors. Cry1 proteins are
pore-forming toxins that bind to the midgut epithelial
cell membrane of susceptible insect larvae, causing
extensive damage. Several different toxins, including
Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
been shown to bind to APNs; however, a direct role of
APN in cytotoxicity has been yet to be firmly
established.
Length = 438
Score = 30.7 bits (70), Expect = 1.7
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 121 IIAHEISHISNGDMVTM 137
IAHE++H+ GD+VTM
Sbjct: 285 TIAHEMAHMWFGDLVTM 301
>gnl|CDD|227270 COG4934, COG4934, Predicted protease [Posttranslational modification,
protein turnover, chaperones].
Length = 1174
Score = 30.9 bits (70), Expect = 1.7
Identities = 30/176 (17%), Positives = 63/176 (35%), Gaps = 14/176 (7%)
Query: 17 NIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNAN 76
N K T+S++ F S+ + L+ K + NF D L+ T N IS N
Sbjct: 948 NSKSGTVSNLQIYFLSSVPTSGLTGKSSDG--------GIKNFVIDVLVNT-NGISAINN 998
Query: 77 ISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT 136
+ I + + + + ++T + + +++ I +S+ +
Sbjct: 999 GTGNYYVIASVSNGTISFSSQIYGKDVYNITVAEGNIT--SVNSALSNLIVSLSSTTVPI 1056
Query: 137 MLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNY-NSNYNHKIYGLITTILQIF 191
+ + + I SY + F + + SN + +Y +Y T +
Sbjct: 1057 IKNVLPS--LVYGEYNIINSYTGNDFGVITIVISNSPSGSYPSTLYNTDQTQTSSY 1110
>gnl|CDD|214368 CHL00117, rpoC2, RNA polymerase beta'' subunit; Reviewed.
Length = 1364
Score = 30.3 bits (69), Expect = 2.7
Identities = 40/202 (19%), Positives = 76/202 (37%), Gaps = 31/202 (15%)
Query: 7 INYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTE---DW 63
+NY Y G I+ ISSI ++ L + + + + ++F E +
Sbjct: 808 VNYILYGNGKPIRG--ISSI------QLVRTCLVLNWDQDKKSSSIEEARASFVEVRTNG 859
Query: 64 LLKTVNKISK-NANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAII 122
L++ +I+ + IS Y + N + +S L+ S N L+S I
Sbjct: 860 LIRDFLRINLVKSPIS------YIRKRNDPS-----SSGLLVQSNNFLDSTNIYSKAEIQ 908
Query: 123 AHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYG 182
+ S N + LL + + ++++S DCF + ++ + N K
Sbjct: 909 SQ--SLSQNQGTIRTLLNRNKESQSLLILSS-----SDCFRIGPFNGKKSKYH-NIKESN 960
Query: 183 LITTILQIFLGFGASIVVAWFS 204
+ I G + +A FS
Sbjct: 961 PLIPIRNSLGPLGTVLQIANFS 982
>gnl|CDD|226341 COG3820, COG3820, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 230
Score = 29.5 bits (66), Expect = 2.8
Identities = 14/51 (27%), Positives = 22/51 (43%)
Query: 200 VAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGI 250
+ W RN D +KL+GT I +R ++ + N P LG+
Sbjct: 109 ILWLLRNHPELKDAQISKLIGTTKSTIEQIRTRTHWNSANITPMDPVTLGL 159
>gnl|CDD|189003 cd09596, M36, Peptidase M36 family also known as fungalysin family.
Peptidase M36 protease family, also known as fungalysin
(elastinolytic metalloproteinase) family, includes
endopeptidases from pathogenic fungi. Fungalysin can
hydrolyze extracellular matrix proteins such as elastin
and keratin, with a preference for cleavage on the amino
side of hydrophobic residues with bulky side-chains.
This family is similar to the M4 (thermolysin) family
due to the presence of the active site residues in HEXXH
and EXXXD motifs, as well as its fold prediction. Some
of these enzymes also contain a protease-associated (PA)
domain insert. The eukaryotic M36 and bacterial M4
families of metalloproteases also share a conserved
domain in their propeptides called FTP
(fungalysin/thermolysin propeptide). Aspergillus
fumigatus causes the pulmonary disease aspergillosis by
invading the lungs of immuno-compromised animals. It
secretes fungalysin that possibly breaks down
proteinaceous structural barriers. A solid lesion known
as an aspergilloma can grow in a lung cavity,
particularly following recovery from tuberculosis.
Length = 271
Score = 29.3 bits (66), Expect = 4.3
Identities = 20/75 (26%), Positives = 27/75 (36%), Gaps = 22/75 (29%)
Query: 74 NANISTPEVAIYNGEINA--FATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISH-IS 130
NAN STP I + + F G ++ + A +IAHE H IS
Sbjct: 68 NANFSTPPDGI-SPRMQMYLFGGGGCRDGAEDA---------------DVIAHEYGHGIS 111
Query: 131 N---GDMVTMLLIQG 142
N G + Q
Sbjct: 112 NRLVGGGAGLSSGQS 126
>gnl|CDD|220310 pfam09608, Alph_Pro_TM, Putative transmembrane protein
(Alph_Pro_TM). This family consists of predicted
transmembrane proteins of about 270 amino acids. Members
are found, so far, only among the Alphaproteobacteria
and only once in each genome.
Length = 236
Score = 28.7 bits (65), Expect = 4.6
Identities = 12/33 (36%), Positives = 15/33 (45%), Gaps = 9/33 (27%)
Query: 180 IYGLITTILQIFLGFGASIVVAWFSRNREFRAD 212
+YGL+ L I G+ AS V FR D
Sbjct: 213 LYGLLAVALAIAAGWLASAV---------FRRD 236
>gnl|CDD|239654 cd03682, ClC_sycA_like, ClC sycA-like chloride channel proteins.
This ClC family presents in bacteria, where it
facilitates acid resistance in acidic soil. Mutation of
this gene (sycA) in Rhizobium tropici CIAT899 causes
serious deficiencies in nodule development, nodulation
competitiveness, and N2 fixation on Phaseolus vulgaris
plants, due to its reduced ability for acid resistance.
This family is part of the ClC chloride channel
superfamiy. These proteins catalyse the selective flow
of Cl- ions across cell membranes and Cl-/H+ exchange
transport. These proteins share two characteristics that
are apparently inherent to the entire ClC chloride
channel superfamily: a unique double-barreled
architecture and voltage-dependent gating mechanism. The
gating is conferred by the permeating anion itself,
acting as the gating charge.
Length = 378
Score = 29.1 bits (66), Expect = 4.9
Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 10/61 (16%)
Query: 3 SLLGINYFFYKIGLNIKI-------LTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKN 55
LG+ + Y I + + ++ IIFG G + + LL + K IKN
Sbjct: 178 HALGLEHTHYHIVFIPTLDPLLFVKVILAGIIFGLAGRLFAELLH-FLKKLLKKR--IKN 234
Query: 56 P 56
P
Sbjct: 235 P 235
>gnl|CDD|148070 pfam06242, DUF1013, Protein of unknown function (DUF1013). Family
of uncharacterized proteins found in Proteobacteria.
Length = 141
Score = 28.0 bits (63), Expect = 5.8
Identities = 17/51 (33%), Positives = 26/51 (50%)
Query: 200 VAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGI 250
+AW RN +D +KL+GT I A+R S+ + N P+ LG+
Sbjct: 74 IAWLLRNHPELSDAQISKLIGTTKTTIEAIRDRSHWNSANIKPRDPVTLGL 124
>gnl|CDD|179937 PRK05101, PRK05101, galactokinase; Provisional.
Length = 382
Score = 28.4 bits (64), Expect = 7.5
Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 13/48 (27%)
Query: 220 GTPDPMISAL-----------RHLSNISIVNDLPKSIAILGINNNFKK 256
G D +ISAL R L ++ +P+ +A++ IN+N K+
Sbjct: 171 GIMDQLISALGKKDHALLIDCRSLETKAVP--MPEGVAVVIINSNVKR 216
>gnl|CDD|215729 pfam00117, GATase, Glutamine amidotransferase class-I.
Length = 186
Score = 27.6 bits (62), Expect = 9.3
Identities = 13/66 (19%), Positives = 22/66 (33%), Gaps = 3/66 (4%)
Query: 190 IFLGFGASIVVAWFSRNR--EFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAI 247
+ L FG ++ A + H G P+ I H S + LP + +
Sbjct: 83 LALAFGGKVIKAKKEPHHGKNSPVGHDKGLFYGLPNVFIVRRYH-SYAVDPDTLPDGLEV 141
Query: 248 LGINNN 253
+ N
Sbjct: 142 TAASEN 147
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.324 0.138 0.392
Gapped
Lambda K H
0.267 0.0736 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,579,792
Number of extensions: 2276173
Number of successful extensions: 3718
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3667
Number of HSP's successfully gapped: 114
Length of query: 436
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 336
Effective length of database: 6,502,202
Effective search space: 2184739872
Effective search space used: 2184739872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 60 (26.9 bits)