RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy5221
         (436 letters)



>gnl|CDD|235478 PRK05457, PRK05457, heat shock protein HtpX; Provisional.
          Length = 284

 Score =  345 bits (887), Expect = e-118
 Identities = 137/276 (49%), Positives = 183/276 (66%), Gaps = 18/276 (6%)

Query: 2   LSLLGINYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTE 61
           LSLLG+  +     LN+  L + + +FGF GS ISLL+SK +AK S    VI+ P N TE
Sbjct: 22  LSLLGVQSY-----LNLGGLLVFAAVFGFGGSFISLLMSKWMAKRSTGAEVIEQPRNETE 76

Query: 62  DWLLKTVNKISKNANISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKA 120
            WL++TV + ++ A I  PEVAIY+  EINAFATGA KN+SL+A+S  LL++M++ E++A
Sbjct: 77  RWLVETVARQARQAGIGMPEVAIYHSPEINAFATGASKNNSLVAVSTGLLQNMSRDEVEA 136

Query: 121 IIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKI 180
           ++AHEISHI+NGDMVTM LIQGV+NTF+I +SRI + I+D F+         N   N   
Sbjct: 137 VLAHEISHIANGDMVTMTLIQGVVNTFVIFLSRIIAQIVDRFVSG-------NEEGNGIG 189

Query: 181 YGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVND 240
           Y +++ +L+I  G  ASI+V WFSR+REFRAD G AKL G  + MI+AL+ L        
Sbjct: 190 YFIVSIVLEIVFGILASIIVMWFSRHREFRADAGGAKLAG-REKMIAALQRLKTSYE-PQ 247

Query: 241 LPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAIL 276
           LP S+A  GIN    K    +LF +HP  EKRIA L
Sbjct: 248 LPGSMAAFGING---KSGLSELFMSHPPLEKRIAAL 280


>gnl|CDD|182691 PRK10743, PRK10743, heat shock protein IbpA; Provisional.
          Length = 137

 Score =  161 bits (408), Expect = 3e-48
 Identities = 66/132 (50%), Positives = 93/132 (70%)

Query: 296 FAPFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIET 355
            +P YRS IGFDRL  + +N ++++   YPPYN+ELVDEN YRI +A+AGF  S+LEI  
Sbjct: 6   LSPLYRSAIGFDRLFNLLENNQSQSNGGYPPYNVELVDENHYRIAIAVAGFAESELEITA 65

Query: 356 ERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIREV 415
           + + L + G    + ++R +L++GIA RNFE +FQLA+++HV  A    G+L IDL R +
Sbjct: 66  QDNLLVVKGAHADEQKERTYLYQGIAERNFERKFQLAENIHVRGANLVNGLLYIDLERVI 125

Query: 416 PEALKPRKIYIN 427
           PEA KPR+I IN
Sbjct: 126 PEAKKPRRIEIN 137


>gnl|CDD|107227 cd06470, ACD_IbpA-B_like, Alpha-crystallin domain (ACD) found in
           Escherichia coli inclusion body-associated proteins IbpA
           and IbpB, and similar proteins.  IbpA and IbpB are 16
           kDa small heat shock proteins (sHsps). sHsps are
           molecular chaperones that suppress protein aggregation
           and protect against cell stress, and are generally
           active as large oligomers consisting of multiple
           subunits. IbpA and IbpB are produced during high-level
           production of various heterologous proteins,
           specifically human prorenin, renin and bovine
           insulin-like growth factor 2 (bIGF-2), and are strongly
           associated with inclusion bodies containing these
           heterologous proteins. IbpA and IbpB work as an
           integrated system to stabilize thermally aggregated
           proteins in a disaggregation competent state.  The
           chaperone activity of IbpB is also significantly
           elevated as the temperature increases from normal to
           heat shock. The high temperature results in the
           disassociation of 2-3-MDa IbpB oligomers into smaller
           approximately 600-kDa structures. This elevated activity
           seen under heat shock conditions is retained for an
           extended period of time after the temperature is
           returned to normal. IbpA also forms multimers.
          Length = 90

 Score =  141 bits (357), Expect = 3e-41
 Identities = 50/90 (55%), Positives = 61/90 (67%), Gaps = 1/90 (1%)

Query: 325 PPYNIELVDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQ-QRNFLHRGIATR 383
           PPYNIE   EN YRIT+A+AGF+  DLEIE E + L +TG+K  +   +R +LHRGIA R
Sbjct: 1   PPYNIEKTGENNYRITLAVAGFSEDDLEIEVENNQLTVTGKKADEENEEREYLHRGIAKR 60

Query: 384 NFEHRFQLADHVHVVAAFFDQGILNIDLIR 413
            FE  F LADHV V  A  + G+L IDL R
Sbjct: 61  AFERSFNLADHVKVKGAELENGLLTIDLER 90


>gnl|CDD|223575 COG0501, HtpX, Zn-dependent protease with chaperone function
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 302

 Score =  146 bits (370), Expect = 1e-40
 Identities = 74/287 (25%), Positives = 130/287 (45%), Gaps = 33/287 (11%)

Query: 1   MLSLLGINYFFYKIGLN----IKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNP 56
           +L  LG +      G+     +  L + +++  F   +ISLL SK +   S+  + +  P
Sbjct: 31  LLLALGPSLLLLLGGIAAASALGRLLLIALLLAFAALLISLLFSKALVLKSLGALTLSEP 90

Query: 57  ---SNFTEDWLLKTVNKISKNANI-STPEVAIYNG-EINAFATGAFKNSSLIAISANLLE 111
              +      LL  V ++++ A I   PEV I    + NAFA G    +  + ++  LL+
Sbjct: 91  ILLAPRLYAVLLLKVAELARQAGIPHMPEVYILETPQPNAFALGGGPKNGRVVVTTGLLD 150

Query: 112 SMTKLEIKAIIAHEISHISNGDMVT-MLLIQGVINTFIILISRIFSYILDCFLLNSYDDS 170
            +   E++A++AHE+ HI N   +  + L   + + F++L +   +  L           
Sbjct: 151 LLNDDELEAVLAHELGHIKNRHTLVRLTLRGLLASAFVLLATLALAAGLL---------- 200

Query: 171 NYNSNYNHKIYGLITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALR 230
                   +    +  +L +   F A+++V  FSR RE+ AD  +AKL G P+ + SAL+
Sbjct: 201 -------GEAALALLLLLLLLALFLATLLVLAFSRKREYEADRFAAKLTG-PEKLASALQ 252

Query: 231 HLSNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAILS 277
            L+ +S   +         I + F       LF+THP   +RIA L 
Sbjct: 253 KLARLSGRANSK-----AFIASGFSGGRLQALFSTHPPLAERIAALR 294


>gnl|CDD|183223 PRK11597, PRK11597, heat shock chaperone IbpB; Provisional.
          Length = 142

 Score =  139 bits (353), Expect = 5e-40
 Identities = 56/130 (43%), Positives = 81/130 (62%), Gaps = 2/130 (1%)

Query: 297 APFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIETE 356
           +P  R  IGFD+LA    N      QS+PPYNIE  D+N YRIT+A+AGF + DL+I+ E
Sbjct: 7   SPLLRQWIGFDKLANALQN--AGESQSFPPYNIEKSDDNHYRITLALAGFRQEDLDIQLE 64

Query: 357 RDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQGILNIDLIREVP 416
              L + G  +   ++  +LH+G+  + F   F LA+++ V  A F  G+L+IDLIR  P
Sbjct: 65  GTRLTVKGTPEQPEKEVKWLHQGLVNQPFSLSFTLAENMEVSGATFVNGLLHIDLIRNEP 124

Query: 417 EALKPRKIYI 426
           EA+ P++I I
Sbjct: 125 EAIAPQRIAI 134


>gnl|CDD|223149 COG0071, IbpA, Molecular chaperone (small heat shock protein)
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 146

 Score =  125 bits (315), Expect = 2e-34
 Identities = 47/136 (34%), Positives = 72/136 (52%), Gaps = 5/136 (3%)

Query: 296 FAPFYRSTIGFDRLAQIFDNVRTEAQQSYPPYNIELVDENKYRITMAIAGFNRSDLEIET 355
           F P  R + GFDRL + F N+        PP +IE  D+  YRIT  + G ++ D+EI  
Sbjct: 12  FFPLLRDSPGFDRLFREFGNLPESRPTGTPPVDIEETDDE-YRITAELPGVDKEDIEITV 70

Query: 356 ERDTLRITGRK--QADTQQRNFLHRGIATRNFEHRFQLADHV--HVVAAFFDQGILNIDL 411
           E +TL I G +  + + ++  +L R  A   FE  F+L + V   V+ A +  G+L + L
Sbjct: 71  EGNTLTIRGEREEEEEEEEEGYLRRERAYGEFERTFRLPEKVDPEVIKAKYKNGLLTVTL 130

Query: 412 IREVPEALKPRKIYIN 427
            +  PE  KP++I I 
Sbjct: 131 PKAEPEEKKPKRIEIE 146


>gnl|CDD|235186 PRK03982, PRK03982, heat shock protein HtpX; Provisional.
          Length = 288

 Score =  122 bits (307), Expect = 9e-32
 Identities = 87/290 (30%), Positives = 140/290 (48%), Gaps = 39/290 (13%)

Query: 4   LLGINYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTED- 62
           L  I Y      L   I  I +I+     ++IS   S KI   S N  ++      +E+ 
Sbjct: 18  LYAIGYL-----LGGSIGPIIAILLALIPNLISYYYSDKIVLASYNARIV------SEEE 66

Query: 63  --WLLKTVNKISKNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIK 119
              L + V ++++ ANI  P+VAI   +  NAFATG     +++A++  +L  + + E++
Sbjct: 67  APELYRIVERLAERANIPKPKVAIVPTQTPNAFATGRDPKHAVVAVTEGILNLLNEDELE 126

Query: 120 AIIAHEISHISNGDMVTMLLIQGVINTF---IILISRIFSYILDCFLLNSYDDSNYNSNY 176
            +IAHE++HI N D     LIQ +  T    I+ +++  S+ L  +      D     N 
Sbjct: 127 GVIAHELTHIKNRD----TLIQTIAATLAGAIMYLAQWLSWGL--WFGGGGRDDRNGGN- 179

Query: 177 NHKIYGLITTILQIFLG-FGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHL--- 232
                  I ++L I L    A+++    SR REF AD G A+L G P  + +AL+ L   
Sbjct: 180 ------PIGSLLLIILAPIAATLIQFAISRQREFSADEGGARLTGNPLALANALQKLEKG 233

Query: 233 -SNISIVNDLPKSIAILGINNNFKKKSFLKLFATHPSFEKRIAILSLLKL 281
              I + N  P + A + I N F+ +    LF+THP  E+RI    LL++
Sbjct: 234 VRYIPLKNGNP-ATAHMFIINPFRGQFLANLFSTHPPTEERIE--RLLEM 280


>gnl|CDD|235318 PRK04897, PRK04897, heat shock protein HtpX; Provisional.
          Length = 298

 Score =  120 bits (304), Expect = 4e-31
 Identities = 68/215 (31%), Positives = 109/215 (50%), Gaps = 3/215 (1%)

Query: 64  LLKTVNKISKNANISTPEVAIYNGE-INAFATGAFKNSSLIAISANLLESMTKLEIKAII 122
           L   V  ++  A I  P V I +    NAFATG+   ++ +A++  LL  M + E++ +I
Sbjct: 82  LWHIVEDMAMVAQIPMPRVFIIDDPSPNAFATGSSPKNAAVAVTTGLLAIMNREELEGVI 141

Query: 123 AHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYG 182
            HEISHI N D + +  I   + + I L+S I   ++     +   D + +      I  
Sbjct: 142 GHEISHIRNYD-IRLSTIAVALASAITLLSDIAGRMMWWGGGSRRRDDDRDGGGLQIILL 200

Query: 183 LITTILQIFLGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNIS-IVNDL 241
           +++ +L I     A+++    SR RE+ AD  S +L   P  +ISAL  +SN   + + +
Sbjct: 201 IVSLLLLILAPLAATLIQLAISRQREYLADASSVELTRNPQGLISALEKISNSQPMKHPV 260

Query: 242 PKSIAILGINNNFKKKSFLKLFATHPSFEKRIAIL 276
             + A L I++  KKK   KLF THP  E+RI  L
Sbjct: 261 DDASAALYISDPLKKKGLSKLFDTHPPIEERIERL 295


>gnl|CDD|179418 PRK02391, PRK02391, heat shock protein HtpX; Provisional.
          Length = 296

 Score =  111 bits (279), Expect = 1e-27
 Identities = 77/293 (26%), Positives = 126/293 (43%), Gaps = 44/293 (15%)

Query: 4   LLGINYFFYKIGLNIKILTISSIIFGFTG--SIISLLLSKKIAKWSMNLMVIKNPSNFTE 61
           LL   Y  + + + I +     +I    G   +     S K+A WSM   ++ +   + E
Sbjct: 20  LLFALYLVF-VAVLIALGVSLVLIVVIAGGFLLAQYFFSDKLALWSMGARIV-SEDEYPE 77

Query: 62  DWLLKTVNKISKNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIKA 120
             L   V ++   A++  P VA+ + ++ NAFATG    ++++ ++  L+  +   E++A
Sbjct: 78  --LHAMVERLCALADLPKPRVAVADSDVPNAFATGRSPKNAVVCVTTGLMRRLDPDELEA 135

Query: 121 IIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKI 180
           ++AHE+SH+ N D+  M      I +F   +S I   I+                     
Sbjct: 136 VLAHELSHVKNRDVAVM-----TIASF---LSTIAFLIVRWGFYFGGFGGRGGGG----- 182

Query: 181 YGLITTILQIFLGFGASIVVAW---------FSRNREFRADHGSAKLLGTPDPMISALRH 231
                    I +    S+VV W          SR REF AD G+A + G P  + SAL  
Sbjct: 183 -----GGGGILVVILVSLVV-WAISFLLIRALSRYREFAADRGAAIITGRPSALASALMK 236

Query: 232 LSNISIVNDLPKS--IAILGINNNF-----KKKSFLKLFATHPSFEKRIAILS 277
           +S    ++ +P        G+N  F        S  +LF+THP  EKRIA L 
Sbjct: 237 ISG--RMDRVPTEDLREAEGMNAFFIIPALSGGSLGRLFSTHPPLEKRIAQLE 287


>gnl|CDD|235081 PRK02870, PRK02870, heat shock protein HtpX; Provisional.
          Length = 336

 Score =  102 bits (257), Expect = 2e-24
 Identities = 71/274 (25%), Positives = 130/274 (47%), Gaps = 30/274 (10%)

Query: 22  TISSIIFGFTGSIISLLLSK--KIAKWSMNLMVI--KNPSNFTEDWLLKTVNKISKNANI 77
           T + I+       I +      KI         I  +N  +  E  L   V ++   A +
Sbjct: 71  TATLIMSLVAVISILVTFQNFDKIMLSGTEYKEITPENALSLQERQLYNVVEELLVAAGL 130

Query: 78  S-TPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMV 135
              P+V I +   +NAFA+G  + S+++AI+  LLE + + E++A++AHE+SHI +GD +
Sbjct: 131 RFMPKVYIIDAPYMNAFASGYSEKSAMVAITTGLLEKLDRDELQAVMAHELSHIRHGD-I 189

Query: 136 TMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLGFG 195
            + L  GV++  +++++    Y      + +  +S  N         +I  IL+  L   
Sbjct: 190 RLTLCVGVLSNIMLIVADFLFY----SFMGNRRNSGANRAR------MIILILRYVLPIL 239

Query: 196 ASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINNNFK 255
             +++ + SR RE+ AD G+ +L+   +PM  AL+ +SN    ND   +      + + +
Sbjct: 240 TVLLMLFLSRTREYMADAGAVELMRDNEPMARALQKISNDHAQNDEQYAYKHTD-HESTR 298

Query: 256 KKSFL------------KLFATHPSFEKRIAILS 277
           + ++L              F+THPS E R+A L 
Sbjct: 299 RAAYLFDPAGISPGSLSDAFSTHPSIENRLAALG 332


>gnl|CDD|179524 PRK03001, PRK03001, M48 family peptidase; Provisional.
          Length = 283

 Score =  101 bits (254), Expect = 2e-24
 Identities = 75/280 (26%), Positives = 127/280 (45%), Gaps = 33/280 (11%)

Query: 23  ISSIIFGFTGSIISLLLSKKI---AKWSMNLMVIK-------NPSNFTEDWLLKTVNKIS 72
           I  +I G  G +I+LL +  +   + W  + MV+K       + +   +    + V +++
Sbjct: 20  IGGMIGGSQGMLIALLFALGMNFFSYWFSDKMVLKMYNAQEVDENTAPQ--FYRMVRELA 77

Query: 73  KNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISN 131
           + A +  P+V + N +  NAFATG     + +A +  +L  +++ EI+ ++AHE++H+ +
Sbjct: 78  QRAGLPMPKVYLINEDQPNAFATGRNPEHAAVAATTGILRVLSEREIRGVMAHELAHVKH 137

Query: 132 GDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIF 191
            D+    LI  +  T    IS      L  F +        N    + I G+   IL   
Sbjct: 138 RDI----LISTISATMAGAIS-----ALANFAMFFGGRDE-NGRPVNPIAGIAVAIL--- 184

Query: 192 LGFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNIS------IVNDLPKSI 245
               AS++    SR REF AD G A++ G P  + SAL  +   +           P + 
Sbjct: 185 APLAASLIQMAISRAREFEADRGGARISGDPQALASALDKIHRYASGIPFQAAEAHPAT- 243

Query: 246 AILGINNNFKKKSFLKLFATHPSFEKRIAILSLLKLSLYF 285
           A + I N         LF+THPS E+RIA L  +  +  F
Sbjct: 244 AQMMIINPLSGGGLANLFSTHPSTEERIARLMAMARTGRF 283


>gnl|CDD|216503 pfam01435, Peptidase_M48, Peptidase family M48. 
          Length = 223

 Score = 94.8 bits (236), Expect = 2e-22
 Identities = 45/202 (22%), Positives = 79/202 (39%), Gaps = 26/202 (12%)

Query: 74  NANISTPEVAIYNG-EINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNG 132
            A     EV + +  + NAFA G  KN   + ++  LL ++T+ E+ A++ HEI HI   
Sbjct: 46  GAGFPPWEVYVVDSPQPNAFALGGGKNK-RVVVTTGLLLALTEDELAAVLGHEIGHIKAR 104

Query: 133 DMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFL 192
             V   + QG++   ++L+              +                     L    
Sbjct: 105 HSVES-MSQGLLLNLLLLLGAAALGGRALGFNAN-------------------GFLTALG 144

Query: 193 GFGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGINN 252
            F   +++  +SR +E+ AD   A+ LG    +  A         V  L   +A   ++ 
Sbjct: 145 IFLLQLLLLPYSRKQEYEADEAGAR-LGGDKDLARAGYKPRA--AVKFL-AKLAAENLSR 200

Query: 253 NFKKKSFLKLFATHPSFEKRIA 274
               K + +L +THP   +RI 
Sbjct: 201 VSGGKLYPELLSTHPPLVERIQ 222


>gnl|CDD|234944 PRK01345, PRK01345, heat shock protein HtpX; Provisional.
          Length = 317

 Score = 92.8 bits (231), Expect = 6e-21
 Identities = 70/268 (26%), Positives = 129/268 (48%), Gaps = 32/268 (11%)

Query: 26  IIFGFTGSIISLLLSKKI---AKWSMNLMVIK-------NPSNFTEDWLLKTVNKISKNA 75
           +I G  G +I+L+++  +   + W+ + MV++       +  +  E  L + V  +++ A
Sbjct: 23  LIGGAGGMMIALVIAAGMNLFSYWNSDKMVLRMYGAQEVDERSAPE--LYRMVRDLARRA 80

Query: 76  NISTPEVAIY-NGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDM 134
            +  P+V I  N + NAFATG    ++ +A +  LL+ ++  E+  ++AHE++H+ N D 
Sbjct: 81  GLPMPKVYIIDNPQPNAFATGRNPENAAVAATTGLLQRLSPEEVAGVMAHELAHVKNRDT 140

Query: 135 VTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGLITTILQIFLG- 193
           +TM     +  T    IS + ++    F      ++N          GL+ T+  + +  
Sbjct: 141 LTMT----ITATLAGAISMLANF---AFFFGGNRENNNGP------LGLVGTLAAMIVAP 187

Query: 194 FGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNI--SIVNDLPK---SIAIL 248
             A +V    SR RE+ AD   A++ G P  + SAL  +      + N+  +   + A +
Sbjct: 188 LAAMLVQMAISRTREYAADRRGAEICGNPLWLASALGKIERGAHGVPNEEAERNPATAHM 247

Query: 249 GINNNFKKKSFLKLFATHPSFEKRIAIL 276
            I N    +    LF+THP+ E RIA L
Sbjct: 248 FIINPLSGEGMDNLFSTHPATENRIAAL 275


>gnl|CDD|235102 PRK03072, PRK03072, heat shock protein HtpX; Provisional.
          Length = 288

 Score = 86.6 bits (215), Expect = 4e-19
 Identities = 68/280 (24%), Positives = 122/280 (43%), Gaps = 28/280 (10%)

Query: 4   LLGINYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDW 63
           ++ I   F + GL I +L           +      S K+A  +M+   +   S      
Sbjct: 21  IVFIGALFGRTGLGIAVL------IAVGMNAYVYWNSDKLALRAMHAQPV---SEVQAPA 71

Query: 64  LLKTVNKISKNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIKAII 122
           + + V ++S  A    P + I      NAFATG    ++ +  +  +L+ + + E++ ++
Sbjct: 72  MYRIVRELSTAARQPMPRLYISPTAAPNAFATGRNPRNAAVCCTEGILQILNERELRGVL 131

Query: 123 AHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYG 182
            HE+SH+ N D++    + G + + I  ++ +  +              +    ++    
Sbjct: 132 GHELSHVYNRDILISS-VAGALASVITYLANMAMFA-----------GMFGGRRDNDGPN 179

Query: 183 LITTILQIFLG-FGASIVVAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDL 241
            +  +L   LG   A+++    SR+RE++AD   A+L G P  + SALR +S       L
Sbjct: 180 PLALLLVSLLGPIAATVIQLAISRSREYQADESGAELTGDPLALASALRKISGGVQAAPL 239

Query: 242 PK-----SIAILGINNNFKKKSFLKLFATHPSFEKRIAIL 276
           P      S A L I N F+     +LF+THP    RIA L
Sbjct: 240 PPEPQLASQAHLMIANPFRAGGIGRLFSTHPPMADRIARL 279


>gnl|CDD|107221 cd06464, ACD_sHsps-like, Alpha-crystallin domain (ACD) of
           alpha-crystallin-type small(s) heat shock proteins
           (Hsps). sHsps are small stress induced proteins with
           monomeric masses between 12 -43 kDa, whose common
           feature is the Alpha-crystallin domain  (ACD). sHsps are
           generally active as large oligomers consisting of
           multiple subunits, and are believed to be
           ATP-independent chaperones that prevent aggregation and
           are important in refolding in combination with other
           Hsps.
          Length = 88

 Score = 72.6 bits (179), Expect = 5e-16
 Identities = 22/82 (26%), Positives = 44/82 (53%), Gaps = 3/82 (3%)

Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITG-RKQADTQQRNFLHRGIATRNFEHRFQL 391
            ++ Y +   + GF + D+++E E   L I+G R++ + ++ N+L R  +  +F   F+L
Sbjct: 5   TDDAYVVEADLPGFKKEDIKVEVEDGVLTISGEREEEEEEEENYLRRERSYGSFSRSFRL 64

Query: 392 ADHVHV--VAAFFDQGILNIDL 411
            + V    + A  + G+L I L
Sbjct: 65  PEDVDPDKIKASLENGVLTITL 86


>gnl|CDD|234931 PRK01265, PRK01265, heat shock protein HtpX; Provisional.
          Length = 324

 Score = 72.9 bits (179), Expect = 4e-14
 Identities = 65/307 (21%), Positives = 120/307 (39%), Gaps = 39/307 (12%)

Query: 5   LGINYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWL 64
             + Y+ +     + ++    I   F  +II  L    +   +   + +  P++    WL
Sbjct: 28  YAVAYYAFGAQFGVGLILGILIFVFFL-NIIQWLFGPYMINAAYRTVEVT-PTDPVYGWL 85

Query: 65  LKTVNKISKNANISTPEVAIYNGEI-NAFATGAFKNSSLIAISANLLESMTKLEIKAIIA 123
              V +++K   I  P+V I +    NAFA G+      IAI+  LL+ + + EIKA+  
Sbjct: 86  YSIVAEVAKYNGIRVPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKILNRDEIKAVAG 145

Query: 124 HEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYGL 183
           HE+ H+ + D V +L+  G+I T I  +   +S                N          
Sbjct: 146 HELGHLKHRD-VELLMAIGLIPTLIYYLG--YSLFWGGMFGGGGGGRGNNGGL-----LF 197

Query: 184 ITTILQIFLGFGASIVVAWFSRNREFRADHGSA---------------KLLGTPDPMI-- 226
           +  I  + + F  +++V   +R RE  AD  SA               K+  + DP    
Sbjct: 198 LIGIALMAVSFVFNLLVLSINRMREAYADVNSALTVPGGAENLQTALAKITLSMDPGALE 257

Query: 227 -----SALRHLSNISIVNDLPKSIA------ILGINNNFKKKSFLKLFATHPSFEKRIAI 275
                S    ++++   ++  + +       ++      K   +  +F+ HP   KRI +
Sbjct: 258 RFKKKSTTNQMASMLFFSNAIEEVPTWDARELVEYWKTTKVPWYADIFSDHPHPAKRIQL 317

Query: 276 LSLLKLS 282
           L  L  S
Sbjct: 318 LEKLSKS 324


>gnl|CDD|215655 pfam00011, HSP20, Hsp20/alpha crystallin family. 
          Length = 101

 Score = 67.6 bits (166), Expect = 4e-14
 Identities = 21/98 (21%), Positives = 49/98 (50%), Gaps = 2/98 (2%)

Query: 332 VDENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQL 391
            D++ + + + + GF   +L+++ E + + + G+ + + +  + L    + R+F  +F L
Sbjct: 4   EDKDAFVVKLDVPGFKPEELKVKVEDNRVLVKGKHEKEEEDDHGLRSERSYRSFSRKFVL 63

Query: 392 ADHVHV--VAAFFDQGILNIDLIREVPEALKPRKIYIN 427
            ++     V A    G+L + + +  P   KPR+I I 
Sbjct: 64  PENADPDKVKASLKDGVLTVTVPKLEPPEKKPRRIQIQ 101


>gnl|CDD|107219 cd00298, ACD_sHsps_p23-like, This domain family includes the
           alpha-crystallin domain (ACD) of alpha-crystallin-type
           small heat shock proteins (sHsps) and a similar domain
           found in p23-like proteins.  sHsps are small stress
           induced proteins with monomeric masses between 12 -43
           kDa, whose common feature is this ACD. sHsps are
           generally active as large oligomers consisting of
           multiple subunits, and are believed to be
           ATP-independent chaperones that prevent aggregation and
           are important in refolding in combination with other
           Hsps. p23 is a cochaperone of the Hsp90 chaperoning
           pathway. It binds Hsp90 and participates in the folding
           of a number of Hsp90 clients including the progesterone
           receptor. p23 also has a passive chaperoning activity.
           p23 in addition may act as the cytosolic prostaglandin
           E2 synthase. Included in this family is the p23-like
           C-terminal CHORD-SGT1 (CS) domain of suppressor of G2
           allele of Skp1 (Sgt1) and  the p23-like domains of human
           butyrate-induced transcript 1 (hB-ind1), NUD (nuclear
           distribution) C, Melusin, and NAD(P)H cytochrome b5
           (NCB5) oxidoreductase (OR).
          Length = 80

 Score = 53.0 bits (128), Expect = 4e-09
 Identities = 19/81 (23%), Positives = 41/81 (50%), Gaps = 8/81 (9%)

Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLA 392
            +++  +T+ + G  + D+++E E + L I+G+++ + +      R  +   FE  F+L 
Sbjct: 4   TDDEVVVTVDLPGVKKEDIKVEVEDNVLTISGKREEEEE------RERSYGEFERSFELP 57

Query: 393 DHVHV--VAAFFDQGILNIDL 411
           + V      A  + G+L I L
Sbjct: 58  EDVDPEKSKASLENGVLEITL 78


>gnl|CDD|218642 pfam05569, Peptidase_M56, BlaR1 peptidase M56.  Production of
           beta-Lactamase and penicillin-binding protein 2a (which
           mediate staphylococcal resistance to beta-lactam
           antibiotics) is regulated by a signal-transducing
           integral membrane protein and a transcriptional
           repressor. The signal transducer is a fusion protein
           with penicillin-binding and zinc metalloprotease
           domains. The signal for protein expression is
           transmitted by site-specific proteolytic cleavage of
           both the transducer, which auto-activates, and the
           repressor, which is inactivated, unblocking gene
           transcription. Homologues to this peptidase domain,
           which corresponds to Merops family M56, are also found
           in a number of other bacterial genome sequences.
          Length = 299

 Score = 43.2 bits (102), Expect = 1e-04
 Identities = 20/84 (23%), Positives = 37/84 (44%), Gaps = 13/84 (15%)

Query: 60  TEDWLLKTVNKISKNANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIK 119
            ED   K    I  ++ I +P V            G +K    I + A+    ++  E+ 
Sbjct: 151 KEDMGYKRNIVIRLSSTIDSPAV-----------FGLWKP--RIVLPADFDTRLSDEELD 197

Query: 120 AIIAHEISHISNGDMVTMLLIQGV 143
            I+ HE++H+  GD++  LL+  +
Sbjct: 198 YILLHELAHLKRGDLIINLLVAVL 221


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 39.7 bits (93), Expect = 0.003
 Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 12/78 (15%)

Query: 61  EDWLLKTVNKISKNANISTPEVAIY---NGEINAFAT-GA--FKNSSLIAISANLLESMT 114
           E+++     +++  A++       +   +  INAFAT G     N+ L+  + N  ES  
Sbjct: 73  EEYVNSLGQRLAAAADLVKTPFTFFLVNDDSINAFATPGGYVVVNTGLLLTAEN--ES-- 128

Query: 115 KLEIKAIIAHEISHISNG 132
             E+  +IAHEI H++  
Sbjct: 129 --ELAGVIAHEIGHVAQR 144


>gnl|CDD|226672 COG4219, MecR1, Antirepressor regulating drug resistance, predicted
           signal transduction N-terminal membrane component
           [Transcription / Signal transduction mechanisms].
          Length = 337

 Score = 39.0 bits (91), Expect = 0.003
 Identities = 17/55 (30%), Positives = 31/55 (56%), Gaps = 8/55 (14%)

Query: 103 IAISANLLESMTKLEIKAIIAHEISHISNGDMVTMLLIQGVINTFIILISRIFSY 157
           I + A+ +E +T  E+K II HE+SH+  GD         +IN  ++++  +F +
Sbjct: 175 IVLPADFVERLTDEELKYIILHELSHLKRGDA--------IINLIVVVLGVLFWF 221


>gnl|CDD|107247 cd06526, metazoan_ACD, Alpha-crystallin domain (ACD) of metazoan
           alpha-crystallin-type small(s) heat shock proteins
           (Hsps). sHsps are small stress induced proteins with
           monomeric masses between 12 -43 kDa, whose common
           feature is the Alpha-crystallin domain  (ACD). sHsps are
           generally active as large oligomers consisting of
           multiple subunits, and are believed to be
           ATP-independent chaperones that prevent aggregation and
           are important in refolding in combination with other
           Hsps.
          Length = 83

 Score = 36.0 bits (84), Expect = 0.004
 Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 8/80 (10%)

Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQLA 392
           D+ K+++T+ + GF   +L+++   + L + G+ +    +  ++      R F  R+QL 
Sbjct: 5   DDEKFQVTLDVKGFKPEELKVKVSDNKLVVEGKHEEREDEHGYVS-----REFTRRYQLP 59

Query: 393 DHV---HVVAAFFDQGILNI 409
           + V    V ++    G+L I
Sbjct: 60  EGVDPDSVTSSLSSDGVLTI 79


>gnl|CDD|107228 cd06471, ACD_LpsHSP_like, Group of bacterial proteins containing an
           alpha crystallin domain (ACD) similar to Lactobacillus
           plantarum (Lp) small heat shock proteins (sHsp) HSP
           18.5, HSP 18.55 and HSP 19.3. sHsps are molecular
           chaperones that suppress protein aggregation and protect
           against cell stress, and are generally active as large
           oligomers consisting of multiple subunits. Transcription
           of the genes encoding Lp HSP 18.5, 18.55 and 19.3 is
           regulated by a variety of stresses including heat, cold
           and ethanol. Early growing L. plantarum cells contain
           elevated levels of these mRNAs which rapidly fall of as
           the cells enter stationary phase. Also belonging to this
           group is Bifidobacterium breve (Bb) HSP20 and Oenococcus
           oenis (syn. Leuconostoc oenos) (Oo) HSP18.
           Transcription of the gene encoding BbHSP20 is strongly
           induced following heat or osmotic shock, and that of the
           gene encoding OoHSP18 following heat, ethanol or acid
           shock. OoHSP18 is peripherally associated with the
           cytoplasmic membrane.
          Length = 93

 Score = 34.4 bits (80), Expect = 0.017
 Identities = 17/87 (19%), Positives = 40/87 (45%), Gaps = 11/87 (12%)

Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQR----NFLHRGIATRNFEHR 388
            +++Y +   + GF + D++++ +   L I+ ++     ++    N++ R     +F   
Sbjct: 8   TDDEYIVEADLPGFKKEDIKLDYKDGYLTISAKRDESKDEKDKKGNYIRRERYYGSFSRS 67

Query: 389 FQLADHVHVVA----AFFDQGILNIDL 411
           F L +   V      A ++ G+L I L
Sbjct: 68  FYLPN---VDEEEIKAKYENGVLKITL 91


>gnl|CDD|107236 cd06481, ACD_HspB9_like, Alpha crystallin domain (ACD) found in
           mammalian small heat shock protein (sHsp) HspB9 and
           similar proteins. sHsps are molecular chaperones that
           suppress protein aggregation and protect against cell
           stress, and are generally active as large oligomers
           consisting of multiple subunits. Human (h) HspB9 is
           expressed exclusively in the normal testis and in
           various tumor samples and is a cancer/testis antigen.
           hHspB9  interacts with TCTEL1 (T-complex testis
           expressed protein -1), a subunit of dynein. hHspB9 and
           TCTEL1 are co-expressed in similar cells within the
           testis and in tumor cells. Included in this group is
           Xenopus Hsp30, a developmentally-regulated
           heat-inducible molecular chaperone.
          Length = 87

 Score = 31.2 bits (71), Expect = 0.18
 Identities = 13/61 (21%), Positives = 31/61 (50%), Gaps = 5/61 (8%)

Query: 337 YRITMAIAGFNRSDLEIETERDTLRITGRKQ--ADTQQRNFLHRGIATRNFEHRFQLADH 394
           + + + + GF+  DL +  +   L +TG+++   + ++ +F +     + F    QL +H
Sbjct: 9   FSLKLDVRGFSPEDLSVRVDGRKLVVTGKREKKNEDEKGSFSYE---YQEFVREAQLPEH 65

Query: 395 V 395
           V
Sbjct: 66  V 66


>gnl|CDD|189002 cd09595, M1, Peptidase M1 family contains aminopeptidase N and
           leukotriene A4 hydrolase.  M1 Peptidase family includes
           aminopeptidase N (APN) and leukotriene A4 hydrolase
           (LTA4H).  All peptidases in this family bind a single
           catalytic zinc ion which is tetrahedrally co-ordinated
           by three amino acid ligands and a water molecule that
           forms the nucleophile on activation during catalysis.
           APN consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types. APN expression is
           dysregulated in many inflammatory diseases and is
           enhanced in numerous tumor cells, making it a lead
           target in the development of anti-cancer and
           anti-inflammatory drugs. LTA4H is a bifunctional enzyme,
           possessing an aminopeptidase as well as an epoxide
           hydrolase activity. The two activities occupy different,
           but overlapping sites. The activity and physiological
           relevance of the aminopeptidase in LTA4H is as yet
           unknown while the epoxide hydrolase converts leukotriene
           A4 (LTA4) into leukotriene B4 (LTB4), a potent
           chemotaxin that is fundamental to the inflammatory
           response of mammals.
          Length = 407

 Score = 33.2 bits (76), Expect = 0.25
 Identities = 18/53 (33%), Positives = 29/53 (54%), Gaps = 3/53 (5%)

Query: 88  EINAFATGAFKNSSLIAISANLL---ESMTKLEIKAIIAHEISHISNGDMVTM 137
           E+  F +GA +N  LI  S +LL          ++ +IAHE++H   G++VT 
Sbjct: 235 EVPEFPSGAMENPGLIFFSQSLLLAMIDAGDELLENVIAHELAHQWFGNLVTG 287


>gnl|CDD|223385 COG0308, PepN, Aminopeptidase N [Amino acid transport and
           metabolism].
          Length = 859

 Score = 32.4 bits (74), Expect = 0.46
 Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 7/53 (13%)

Query: 92  FATGAFKNSSLIAISANLL-------ESMTKLEIKAIIAHEISHISNGDMVTM 137
           F+ GA +N  L+      L              ++ +IAHE++H   G++VTM
Sbjct: 274 FSAGAMENWGLVTFREKYLLADPETATDSDYENVEEVIAHELAHQWFGNLVTM 326


>gnl|CDD|153290 cd07606, BAR_SFC_plant, The Bin/Amphiphysin/Rvs (BAR) domain of the
           plant protein SCARFACE (SFC).  BAR domains are
           dimerization, lipid binding and curvature sensing
           modules found in many different proteins with diverse
           functions including organelle biogenesis, membrane
           trafficking or remodeling, and cell division and
           migration. The plant protein SCARFACE (SFC), also called
           VAscular Network 3 (VAN3), is a plant ACAP (ArfGAP with
           Coiled-coil, ANK repeat and PH domain containing
           protein), an Arf GTPase Activating Protein (GAP) that
           plays a role in the trafficking of auxin efflux
           regulators from the plasma membrane to the endosome. It
           is required for the normal vein patterning in leaves.
           SCF contains an N-terminal BAR domain, followed by a
           Pleckstrin Homology (PH) domain, an Arf GAP domain, and
           C-terminal ankyrin (ANK) repeats. BAR domains form
           dimers that bind to membranes, induce membrane bending
           and curvature, and may also be involved in
           protein-protein interactions.
          Length = 202

 Score = 31.3 bits (71), Expect = 0.74
 Identities = 22/101 (21%), Positives = 40/101 (39%), Gaps = 2/101 (1%)

Query: 307 DRLAQIFDNVRTEAQQSYPPYNIELVDENKYRIT-MAIAGFNRSDLEIETERDTLRI-TG 364
           DRLAQ  D    E + +   ++   +D  + R   +++    + ++    E D     + 
Sbjct: 92  DRLAQFADTDLQEVKDARRRFDKASLDYEQARSKFLSLTKDAKPEILAAAEEDLGTTRSA 151

Query: 365 RKQADTQQRNFLHRGIATRNFEHRFQLADHVHVVAAFFDQG 405
            + A     N LH   A +  E   +L+  +    AFF  G
Sbjct: 152 FETARFDLMNRLHAADARKRVEFLERLSGSMDAHLAFFKSG 192


>gnl|CDD|233857 TIGR02412, pepN_strep_liv, aminopeptidase N, Streptomyces lividans
           type.  This family is a subset of the members of the
           zinc metallopeptidase family M1 (pfam01433), with a
           single member characterized in Streptomyces lividans 66
           and designated aminopeptidase N. The spectrum of
           activity may differ somewhat from the aminopeptidase N
           clade of E. coli and most other Proteobacteria, well
           separated phylogenetically within the M1 family. The M1
           family also includes leukotriene A-4
           hydrolase/aminopeptidase (with a bifunctional active
           site).
          Length = 831

 Score = 31.7 bits (72), Expect = 0.87
 Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 5/51 (9%)

Query: 92  FATGAFKNSSLIAISANLL--ESMTKLEIK---AIIAHEISHISNGDMVTM 137
           F  GA +N+  +  + N L     T+ E +    +I HE++H+  GD+VTM
Sbjct: 256 FNAGAMENAGCVTFAENFLHRAEATRAEKENRAGVILHEMAHMWFGDLVTM 306


>gnl|CDD|107230 cd06475, ACD_HspB1_like, Alpha crystallin domain (ACD) found in
           mammalian small (s)heat shock protein (Hsp)-27 (also
           denoted HspB1 in human) and similar proteins. sHsps are
           molecular chaperones that suppress protein aggregation
           and protect against cell stress, and are generally
           active as large oligomers consisting of multiple
           subunits. Hsp27 shows enhanced synthesis in response to
           stress. It is a molecular chaperone which interacts with
           a large number of different proteins. It is found in
           many types of human cells including breast, uterus,
           cervix, platelets and cancer cells. Hsp27 has diverse
           cellular functions including, chaperoning, regulation of
           actin polymerization, keratinocyte differentiation,
           regulation of inflammatory pathways in keratinocytes,
           and protection from oxidative stress through modulating
           glutathione levels. It is also a subunit of
           AUF1-containing protein complexes. It has been linked to
           several transduction pathways regulating cellular
           functions including differentiation, cell growth,
           development, and apoptosis. Its activity can be
           regulated by phosphorylation. Its unphosphorylated state
           is a high molecular weight aggregated form (100-800kDa)
           composed of up to 24 subunits, which forms as a result
           of multiple interactions within the ACD, and is required
           for chaperone function and resistance to oxidative
           stress. Upon phosphorylation these large aggregates
           rapidly disassociate to smaller oligomers and chaperone
           activity is modified.  High constitutive levels of Hsp27
           have been detected in various cancer cells, in
           particular those of carcinoma origin. Over-expression of
           Hsp27 has a protective effect against various
           diseases-processes, including Huntington's disease.
           Mutations in Hsp27 have been associated with a form of
           distal hereditary motor neuropathy type II and
           Charcot-Marie-Tooth disease type 2.
          Length = 86

 Score = 29.4 bits (66), Expect = 0.94
 Identities = 16/81 (19%), Positives = 40/81 (49%), Gaps = 8/81 (9%)

Query: 333 DENKYRITMAIAGFNRSDLEIETERDTLRITGRKQADTQQRNFLHRGIATRNFEHRFQL- 391
             +++++++ +  F   +L ++T+   + ITG+     +++   H G  +R F  ++ L 
Sbjct: 8   TADRWKVSLDVNHFAPEELVVKTKDGVVEITGKH----EEKQDEH-GFVSRCFTRKYTLP 62

Query: 392 --ADHVHVVAAFFDQGILNID 410
              D   V ++    GIL ++
Sbjct: 63  PGVDPTAVTSSLSPDGILTVE 83


>gnl|CDD|189008 cd09601, M1_APN_2, Peptidase M1 Aminopeptidase N family incudes
           tricorn interacting factor F3, Endoplasmic reticulum
           aminopeptidase 1 (ERAP1), Aminopeptidase Q (APQ).  This
           M1 peptidase family includes eukaryotic and bacterial
           members: aminopeptidase N (APN), aminopeptidase Q (APQ,
           laeverin), endoplasmic reticulum aminopeptidase 1
           (ERAP1) as well as tricorn interacting factor F3.
           Aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease,
           consists of a small N-terminal cytoplasmic domain, a
           single transmembrane domain and a large extracellular
           ectodomain that contains the active site. It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and is present in a variety
           of human tissues and cell types (leukocyte, fibroblast,
           endothelial and epithelial cells). APN expression is
           dysregulated in inflammatory diseases such as chronic
           pain, rheumatoid arthritis, multiple sclerosis, systemic
           sclerosis, systemic lupus erythematosus,
           polymyositis/dermatomyosytis and pulmonary sarcoidosis,
           and is enhanced in tumor cells such as melanoma, renal,
           prostate, pancreas, colon, gastric and thyroid cancers.
           It is considered a marker of differentiation since it is
           predominantly expressed on stem cells and on cells of
           the granulocytic and monocytic lineages at distinct
           stages of differentiation. Thus, APN inhibition may lead
           to the development of anti-cancer and anti-inflammatory
           drugs. ERAP1 also known as endoplasmic reticulum
           aminopeptidase associated with antigen processing
           (ERAAP), adipocyte derived leucine aminopeptidase
           (A-LAP) or aminopeptidase regulating tumor necrosis
           factor receptor I (THFRI) shedding (ARTS-1), associates
           with the closely related ER aminopeptidase ERAP2, for
           the final trimming of peptides within the ER for
           presentation by MHC class I molecules. ERAP1 is
           associated with ankylosing spondylitis (AS), an
           inflammatory arthritis that predominantly affects the
           spine. ERAP1 also aids in the shedding of membrane-bound
           cytokine receptors. The tricorn interacting factor F3,
           together with factors F1 and F2, degrades the tricorn
           protease products, producing free amino acids, thus
           completing the proteasomal degradation pathway. F3 is
           homologous to F2, but not F1, and shows a strong
           preference for glutamate in the P1' position. APQ, also
           known as laeverin, is specifically expressed in human
           embryo-derived extravillous trophoblasts (EVTs) that
           invade the uterus during early placentation. It cleaves
           the N-terminal amino acid of various peptides such as
           angiotensin III, endokinin C, and kisspeptin-10, all
           expressed in the placenta in large quantities. APN is a
           receptor for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs are also putative Cry toxin
           receptors. Cry1 proteins are pore-forming toxins that
           bind to the midgut epithelial cell membrane of
           susceptible insect larvae, causing extensive damage.
           Several different toxins, including Cry1Aa, Cry1Ab,
           Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have been shown to
           bind to APNs; however, a direct role of APN in
           cytotoxicity has been yet to be firmly established.
          Length = 446

 Score = 31.4 bits (72), Expect = 1.0
 Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 12/63 (19%)

Query: 82  VAIYNGEINAFATGAFKNSSLIAISANLL-------ESMTKLEIKAIIAHEISHISNGDM 134
           VAI +     FA GA +N  LI      L        +  K  +  ++AHE++H   G++
Sbjct: 248 VAIPD-----FAAGAMENWGLITYRETALLYDPKTSSASNKQRVATVVAHELAHQWFGNL 302

Query: 135 VTM 137
           VTM
Sbjct: 303 VTM 305


>gnl|CDD|227123 COG4784, COG4784, Putative Zn-dependent protease [General function
           prediction only].
          Length = 479

 Score = 31.4 bits (71), Expect = 1.1
 Identities = 20/81 (24%), Positives = 32/81 (39%), Gaps = 21/81 (25%)

Query: 200 VAWFSRNREFRADHGSAKLLGT----PDPMISALRHLSN---ISIVNDLPKSIAILGINN 252
           +A FSRN+E +AD    K+LG     P      L+ ++       V+    +   L    
Sbjct: 178 LAQFSRNQELQADAIGIKMLGEAGYDPYAAARFLQSMAAYTDFRSVS--GAADQSLD--- 232

Query: 253 NFKKKSFLKLFATHPSFEKRI 273
                      A+HP+  +RI
Sbjct: 233 ---------FLASHPNTPQRI 244


>gnl|CDD|216501 pfam01433, Peptidase_M1, Peptidase family M1.  Members of this
           family are aminopeptidases. The members differ widely in
           specificity, hydrolysing acidic, basic or neutral
           N-terminal residues. This family includes leukotriene-A4
           hydrolase, this enzyme also has an aminopeptidase
           activity.
          Length = 390

 Score = 30.7 bits (70), Expect = 1.5
 Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 7/53 (13%)

Query: 92  FATGAFKNSSLIAISANLL-------ESMTKLEIKAIIAHEISHISNGDMVTM 137
           F+ GA +N  LI      L        S  K  +  +IAHE++H   G++VTM
Sbjct: 262 FSAGAMENWGLITYREPALLYDPGNSTSSDKQRVAEVIAHELAHQWFGNLVTM 314


>gnl|CDD|220661 pfam10263, SprT-like, SprT-like family.  This family represents a
           domain found in eukaryotes and prokaryotes. The domain
           contains a characteristic motif of the zinc
           metallopeptidases. This family includes the bacterial
           SprT protein.
          Length = 153

 Score = 29.7 bits (67), Expect = 1.7
 Identities = 9/31 (29%), Positives = 18/31 (58%)

Query: 98  KNSSLIAISANLLESMTKLEIKAIIAHEISH 128
           +NS+ I ++  LL   ++ E+   + HE+ H
Sbjct: 38  RNSAEIRLNPKLLAENSEEELIETLLHEMCH 68


>gnl|CDD|189009 cd09602, M1_APN_3, Peptidase M1 family containing Aminopeptidase N.
            This family contains bacterial and eukaryotic
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site.  It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 438

 Score = 30.7 bits (70), Expect = 1.7
 Identities = 10/17 (58%), Positives = 14/17 (82%)

Query: 121 IIAHEISHISNGDMVTM 137
            IAHE++H+  GD+VTM
Sbjct: 285 TIAHEMAHMWFGDLVTM 301


>gnl|CDD|227270 COG4934, COG4934, Predicted protease [Posttranslational modification,
            protein turnover, chaperones].
          Length = 1174

 Score = 30.9 bits (70), Expect = 1.7
 Identities = 30/176 (17%), Positives = 63/176 (35%), Gaps = 14/176 (7%)

Query: 17   NIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTEDWLLKTVNKISKNAN 76
            N K  T+S++   F  S+ +  L+ K +             NF  D L+ T N IS   N
Sbjct: 948  NSKSGTVSNLQIYFLSSVPTSGLTGKSSDG--------GIKNFVIDVLVNT-NGISAINN 998

Query: 77   ISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISHISNGDMVT 136
             +     I +      +  +      +        ++T   + + +++ I  +S+  +  
Sbjct: 999  GTGNYYVIASVSNGTISFSSQIYGKDVYNITVAEGNIT--SVNSALSNLIVSLSSTTVPI 1056

Query: 137  MLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNY-NSNYNHKIYGLITTILQIF 191
            +  +           + I SY  + F + +   SN  + +Y   +Y    T    +
Sbjct: 1057 IKNVLPS--LVYGEYNIINSYTGNDFGVITIVISNSPSGSYPSTLYNTDQTQTSSY 1110


>gnl|CDD|214368 CHL00117, rpoC2, RNA polymerase beta'' subunit; Reviewed.
          Length = 1364

 Score = 30.3 bits (69), Expect = 2.7
 Identities = 40/202 (19%), Positives = 76/202 (37%), Gaps = 31/202 (15%)

Query: 7   INYFFYKIGLNIKILTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKNPSNFTE---DW 63
           +NY  Y  G  I+   ISSI       ++   L     +   +  + +  ++F E   + 
Sbjct: 808 VNYILYGNGKPIRG--ISSI------QLVRTCLVLNWDQDKKSSSIEEARASFVEVRTNG 859

Query: 64  LLKTVNKISK-NANISTPEVAIYNGEINAFATGAFKNSSLIAISANLLESMTKLEIKAII 122
           L++   +I+   + IS      Y  + N  +     +S L+  S N L+S        I 
Sbjct: 860 LIRDFLRINLVKSPIS------YIRKRNDPS-----SSGLLVQSNNFLDSTNIYSKAEIQ 908

Query: 123 AHEISHISNGDMVTMLLIQGVINTFIILISRIFSYILDCFLLNSYDDSNYNSNYNHKIYG 182
           +   S   N   +  LL +   +  ++++S       DCF +  ++      + N K   
Sbjct: 909 SQ--SLSQNQGTIRTLLNRNKESQSLLILSS-----SDCFRIGPFNGKKSKYH-NIKESN 960

Query: 183 LITTILQIFLGFGASIVVAWFS 204
            +  I       G  + +A FS
Sbjct: 961 PLIPIRNSLGPLGTVLQIANFS 982


>gnl|CDD|226341 COG3820, COG3820, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 230

 Score = 29.5 bits (66), Expect = 2.8
 Identities = 14/51 (27%), Positives = 22/51 (43%)

Query: 200 VAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGI 250
           + W  RN     D   +KL+GT    I  +R  ++ +  N  P     LG+
Sbjct: 109 ILWLLRNHPELKDAQISKLIGTTKSTIEQIRTRTHWNSANITPMDPVTLGL 159


>gnl|CDD|189003 cd09596, M36, Peptidase M36 family also known as fungalysin family.
            Peptidase M36 protease family, also known as fungalysin
           (elastinolytic metalloproteinase) family, includes
           endopeptidases from pathogenic fungi. Fungalysin can
           hydrolyze extracellular matrix proteins such as elastin
           and keratin, with a preference for cleavage on the amino
           side of hydrophobic residues with bulky side-chains.
           This family is similar to the M4 (thermolysin) family
           due to the presence of the active site residues in HEXXH
           and EXXXD motifs, as well as its fold prediction. Some
           of these enzymes also contain a protease-associated (PA)
           domain insert. The eukaryotic M36 and bacterial M4
           families of metalloproteases also share a conserved
           domain in their propeptides called FTP
           (fungalysin/thermolysin propeptide). Aspergillus
           fumigatus causes the pulmonary disease aspergillosis by
           invading the lungs of immuno-compromised animals. It
           secretes fungalysin that possibly breaks down
           proteinaceous structural barriers. A solid lesion known
           as an aspergilloma can grow in a lung cavity,
           particularly following recovery from tuberculosis.
          Length = 271

 Score = 29.3 bits (66), Expect = 4.3
 Identities = 20/75 (26%), Positives = 27/75 (36%), Gaps = 22/75 (29%)

Query: 74  NANISTPEVAIYNGEINA--FATGAFKNSSLIAISANLLESMTKLEIKAIIAHEISH-IS 130
           NAN STP   I +  +    F  G  ++ +  A                +IAHE  H IS
Sbjct: 68  NANFSTPPDGI-SPRMQMYLFGGGGCRDGAEDA---------------DVIAHEYGHGIS 111

Query: 131 N---GDMVTMLLIQG 142
           N   G    +   Q 
Sbjct: 112 NRLVGGGAGLSSGQS 126


>gnl|CDD|220310 pfam09608, Alph_Pro_TM, Putative transmembrane protein
           (Alph_Pro_TM).  This family consists of predicted
           transmembrane proteins of about 270 amino acids. Members
           are found, so far, only among the Alphaproteobacteria
           and only once in each genome.
          Length = 236

 Score = 28.7 bits (65), Expect = 4.6
 Identities = 12/33 (36%), Positives = 15/33 (45%), Gaps = 9/33 (27%)

Query: 180 IYGLITTILQIFLGFGASIVVAWFSRNREFRAD 212
           +YGL+   L I  G+ AS V         FR D
Sbjct: 213 LYGLLAVALAIAAGWLASAV---------FRRD 236


>gnl|CDD|239654 cd03682, ClC_sycA_like, ClC sycA-like chloride channel proteins.
           This ClC family presents in bacteria, where it
           facilitates acid resistance in acidic soil. Mutation of
           this gene (sycA) in Rhizobium tropici CIAT899 causes
           serious deficiencies in nodule development, nodulation
           competitiveness, and N2 fixation on Phaseolus vulgaris
           plants, due to its reduced ability for acid resistance. 
           This family is part of the ClC chloride channel
           superfamiy. These proteins catalyse the selective flow
           of Cl- ions across cell membranes and Cl-/H+ exchange
           transport. These proteins share two characteristics that
           are apparently inherent to the entire ClC chloride
           channel superfamily: a unique double-barreled
           architecture and voltage-dependent gating mechanism. The
           gating is conferred by the permeating anion itself,
           acting as the gating charge.
          Length = 378

 Score = 29.1 bits (66), Expect = 4.9
 Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 10/61 (16%)

Query: 3   SLLGINYFFYKIGLNIKI-------LTISSIIFGFTGSIISLLLSKKIAKWSMNLMVIKN 55
             LG+ +  Y I     +       + ++ IIFG  G + + LL   + K       IKN
Sbjct: 178 HALGLEHTHYHIVFIPTLDPLLFVKVILAGIIFGLAGRLFAELLH-FLKKLLKKR--IKN 234

Query: 56  P 56
           P
Sbjct: 235 P 235


>gnl|CDD|148070 pfam06242, DUF1013, Protein of unknown function (DUF1013).  Family
           of uncharacterized proteins found in Proteobacteria.
          Length = 141

 Score = 28.0 bits (63), Expect = 5.8
 Identities = 17/51 (33%), Positives = 26/51 (50%)

Query: 200 VAWFSRNREFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAILGI 250
           +AW  RN    +D   +KL+GT    I A+R  S+ +  N  P+    LG+
Sbjct: 74  IAWLLRNHPELSDAQISKLIGTTKTTIEAIRDRSHWNSANIKPRDPVTLGL 124


>gnl|CDD|179937 PRK05101, PRK05101, galactokinase; Provisional.
          Length = 382

 Score = 28.4 bits (64), Expect = 7.5
 Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 13/48 (27%)

Query: 220 GTPDPMISAL-----------RHLSNISIVNDLPKSIAILGINNNFKK 256
           G  D +ISAL           R L   ++   +P+ +A++ IN+N K+
Sbjct: 171 GIMDQLISALGKKDHALLIDCRSLETKAVP--MPEGVAVVIINSNVKR 216


>gnl|CDD|215729 pfam00117, GATase, Glutamine amidotransferase class-I. 
          Length = 186

 Score = 27.6 bits (62), Expect = 9.3
 Identities = 13/66 (19%), Positives = 22/66 (33%), Gaps = 3/66 (4%)

Query: 190 IFLGFGASIVVAWFSRNR--EFRADHGSAKLLGTPDPMISALRHLSNISIVNDLPKSIAI 247
           + L FG  ++ A    +        H      G P+  I    H S     + LP  + +
Sbjct: 83  LALAFGGKVIKAKKEPHHGKNSPVGHDKGLFYGLPNVFIVRRYH-SYAVDPDTLPDGLEV 141

Query: 248 LGINNN 253
              + N
Sbjct: 142 TAASEN 147


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.324    0.138    0.392 

Gapped
Lambda     K      H
   0.267   0.0736    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,579,792
Number of extensions: 2276173
Number of successful extensions: 3718
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3667
Number of HSP's successfully gapped: 114
Length of query: 436
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 336
Effective length of database: 6,502,202
Effective search space: 2184739872
Effective search space used: 2184739872
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.5 bits)
S2: 60 (26.9 bits)