RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy768
         (475 letters)



>gnl|CDD|216503 pfam01435, Peptidase_M48, Peptidase family M48. 
          Length = 223

 Score =  107 bits (270), Expect = 6e-27
 Identities = 61/218 (27%), Positives = 86/218 (39%), Gaps = 44/218 (20%)

Query: 227 KRVGAVVKRLIDANKVYMEHNNFKYP-ITIIDDPLINAFVFPDG---RIFMFTGMFQLCQ 282
            R+  VV+RL D+         F    + ++D P  NAF    G   R+ + TG+     
Sbjct: 32  ARLQRVVERLADS-----AGAGFPPWEVYVVDSPQPNAFALGGGKNKRVVVTTGLLL-AL 85

Query: 283 TDDELATVLSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDD 342
           T+DELA VL HE+ H   +H  E +S                         ++  +LL  
Sbjct: 86  TEDELAAVLGHEIGHIKARHSVESMS-----------------------QGLLLNLLLLL 122

Query: 343 LKSIIFELPFEREMETEADESIIF-----ELPFEREMETEADEVGLKL-----MARACYD 392
             + +                 IF      LP+ R+ E EADE G +L     +ARA Y 
Sbjct: 123 GAAALGGRALGFNANGFLTALGIFLLQLLLLPYSRKQEYEADEAGARLGGDKDLARAGYK 182

Query: 393 VRVAPLFWQKMALKETQDQVGPKM-EEYLSTHPSHENR 429
            R A  F  K+A +      G K+  E LSTHP    R
Sbjct: 183 PRAAVKFLAKLAAENLSRVSGGKLYPELLSTHPPLVER 220


>gnl|CDD|227122 COG4783, COG4783, Putative Zn-dependent protease, contains TPR
           repeats [General function prediction only].
          Length = 484

 Score = 89.4 bits (222), Expect = 5e-19
 Identities = 49/194 (25%), Positives = 84/194 (43%), Gaps = 32/194 (16%)

Query: 251 YPITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVAEKLSNK 310
           +   +++D  INAF  P G + + TG+    + + ELA V++HE+ H   +H+A      
Sbjct: 94  FTFFLVNDDSINAFATPGGYVVVNTGLLLTAENESELAGVIAHEIGHVAQRHLA------ 147

Query: 311 TFLEILYIVPLMIIWFLLPDLGAIV-----TQVLLDDLKSIIFELPFEREMETEADESII 365
             +E       M I  +L  + A +         +                   A     
Sbjct: 148 RSMEQQQRAAPMAIAGMLLGILAALAGADAGMAGI-----------------AGALAGAA 190

Query: 366 -FELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALKETQDQVGPKMEEYLSTHP 424
             +L F R+ E EAD +G+  + RA YD +  P F++++A +      G +  EYL THP
Sbjct: 191 QGQLNFSRQNEQEADRIGITTLVRAGYDPQGMPEFFERLADQLRY---GGQPPEYLLTHP 247

Query: 425 SHENRANNLESKMK 438
             E R  +L ++ +
Sbjct: 248 LPEERIADLRNRAE 261


>gnl|CDD|227123 COG4784, COG4784, Putative Zn-dependent protease [General function
           prediction only].
          Length = 479

 Score = 85.7 bits (212), Expect = 6e-18
 Identities = 51/206 (24%), Positives = 83/206 (40%), Gaps = 37/206 (17%)

Query: 231 AVVKRLIDANKVYMEHNNFKYPITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATV 290
            +V R++ A     E+    Y ITI++ P INAF  P G +++  G+  L     E+A V
Sbjct: 68  RMVARIVGALTAVSENPQQTYRITILNSPNINAFALPGGYLYITRGLLALANDSSEVAAV 127

Query: 291 LSHELSHTLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDDLKSIIFEL 350
           L+HE+ H    H  ++   +    I                  +V +VL  D        
Sbjct: 128 LAHEMGHVTANHGIQRQQREAAEVI---------------ASRVVAEVLGSDAA------ 166

Query: 351 PFEREMETEADESII----FELPFEREMETEADEVGLKLMARACYDVRVAPLFWQKMALK 406
                      +++I        F R  E +AD +G+K++  A YD   A  F Q MA  
Sbjct: 167 ---------GKQALIRGKLRLAQFSRNQELQADAIGIKMLGEAGYDPYAAARFLQSMAAY 217

Query: 407 ETQDQVGPKMEE---YLSTHPSHENR 429
                V    ++   +L++HP+   R
Sbjct: 218 TDFRSVSGAADQSLDFLASHPNTPQR 243


>gnl|CDD|223575 COG0501, HtpX, Zn-dependent protease with chaperone function
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 302

 Score = 82.7 bits (204), Expect = 2e-17
 Identities = 47/201 (23%), Positives = 72/201 (35%), Gaps = 27/201 (13%)

Query: 249 FKYPITIIDDPLINAFVFP----DGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304
               + I++ P  NAF       +GR+ + TG+  L   DDEL  VL+HEL H   +H  
Sbjct: 116 HMPEVYILETPQPNAFALGGGPKNGRVVVTTGLLDL-LNDDELEAVLAHELGHIKNRHT- 173

Query: 305 EKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDDLKSIIFELPFEREMETEADESI 364
             L   T   +L    +++    L         + L  L  ++                 
Sbjct: 174 --LVRLTLRGLLASAFVLLATLALAAGLLGEAALALLLLLLLLALFLAT----------- 220

Query: 365 IFELPFEREMETEADEVGLKL-----MARACYD-VRVAPLFWQKMALKETQDQVGPKMEE 418
           +  L F R+ E EAD    KL     +A A     R++     K  +       G +++ 
Sbjct: 221 LLVLAFSRKREYEADRFAAKLTGPEKLASALQKLARLSGRANSKAFI--ASGFSGGRLQA 278

Query: 419 YLSTHPSHENRANNLESKMKE 439
             STHP    R   L      
Sbjct: 279 LFSTHPPLAERIAALRQLALT 299


>gnl|CDD|234931 PRK01265, PRK01265, heat shock protein HtpX; Provisional.
          Length = 324

 Score = 34.7 bits (80), Expect = 0.086
 Identities = 23/86 (26%), Positives = 36/86 (41%), Gaps = 11/86 (12%)

Query: 245 EHNNFKYP-ITIIDDPLINAFVFPD---GRIFMFTGMFQLCQTDDELATVLSHELSHTLL 300
           ++N  + P + I D P  NAF +     G+    T         DE+  V  HEL H  L
Sbjct: 94  KYNGIRVPKVYIADVPFPNAFAYGSPIAGKRIAITLPLLKILNRDEIKAVAGHELGH--L 151

Query: 301 KHVAEKLSNKTFLEILYIVPLMIIWF 326
           KH      +   L  + ++P +I + 
Sbjct: 152 KH-----RDVELLMAIGLIPTLIYYL 172


>gnl|CDD|234944 PRK01345, PRK01345, heat shock protein HtpX; Provisional.
          Length = 317

 Score = 32.3 bits (74), Expect = 0.51
 Identities = 19/51 (37%), Positives = 29/51 (56%), Gaps = 9/51 (17%)

Query: 253 ITIIDDPLINAFVFPDGR------IFMFTGMFQLCQTDDELATVLSHELSH 297
           + IID+P  NAF    GR      +   TG+ Q   + +E+A V++HEL+H
Sbjct: 87  VYIIDNPQPNAFA--TGRNPENAAVAATTGLLQRL-SPEEVAGVMAHELAH 134


>gnl|CDD|235186 PRK03982, PRK03982, heat shock protein HtpX; Provisional.
          Length = 288

 Score = 31.9 bits (73), Expect = 0.71
 Identities = 25/105 (23%), Positives = 49/105 (46%), Gaps = 22/105 (20%)

Query: 275 TGMFQLCQTDDELATVLSHELSH-----TLLKHVAEKLSNKTFLEILYIVPLM--IIWFL 327
            G+  L   +DEL  V++HEL+H     TL++ +A  L+      I+Y+   +   +WF 
Sbjct: 114 EGILNLL-NEDELEGVIAHELTHIKNRDTLIQTIAATLAGA----IMYLAQWLSWGLWFG 168

Query: 328 LPD---------LGAIVTQVLLDDLKSIIFELPFEREMETEADES 363
                       +G+++  ++L  + + + +    R+ E  ADE 
Sbjct: 169 GGGRDDRNGGNPIGSLLL-IILAPIAATLIQFAISRQREFSADEG 212


>gnl|CDD|183649 PRK12649, PRK12649, putative monovalent cation/H+ antiporter
           subunit A; Reviewed.
          Length = 789

 Score = 31.7 bits (72), Expect = 0.88
 Identities = 15/34 (44%), Positives = 23/34 (67%), Gaps = 2/34 (5%)

Query: 317 YIVPLMIIWFLLPDLGAIVTQVLLDDLKSIIFEL 350
           Y+V L+ I+   PDL   +TQ+L++ L +IIF L
Sbjct: 653 YLVSLLFIYLKAPDLA--LTQILVETLSTIIFLL 684


>gnl|CDD|199913 cd11007, M35_like_1, Peptidase M35-like domain of uncharacterized
           proteins.  This family contains proteins similar to the
           M35 Zn2+-metallopeptidases, also known as the
           deuterolysin family, presumably these are bacterial
           metalloendopeptidases that have yet to be characterized.
           Typically, members of this family of extracellular
           peptidases contain a unique zinc-binding motif (the
           aspzincin motif), defined by the HExxH + D motif where
           an aspartic acid is the third zinc ligand; however,
           members of this family do not contain the GTXDXXYG motif
           C-terminal to the His zinc ligands that is typical for
           the M35 proteases. Deuterolysins are highly active
           towards basic nuclear proteins such as histones and
           protamines, with a preference for a Lys or Arg residue
           in the P1' subsite. MEPs specifically cleave
           peptidyl-lysine bonds (-X-Lys-) in proteins and
           peptides. Many members of the M35 peptidases display
           unusual thermostabilities.
          Length = 183

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 11/81 (13%)

Query: 226 YKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLIN--AFVFPDGRI---FMFTGMFQL 280
            K++ AV+  L   N V    ++ K     + +      AFV     I    +F  +   
Sbjct: 62  LKKILAVLNSLTPENFVR--SDSEKDTCGCVPNDGDGEAAFVCGPDTISIHVIFCELGAF 119

Query: 281 C----QTDDELATVLSHELSH 297
                   D  A+ + HELSH
Sbjct: 120 FDAASNGLDSQASTIIHELSH 140


>gnl|CDD|189011 cd09604, M1_APN_5, Peptidase M1 family containing bacterial
           Aminopeptidase N.  This family contains bacterial
           aminopeptidase N (APN; CD13; Alanyl aminopeptidase; EC
           3.4.11.2), a Type II integral membrane protease
           belonging to the M1 gluzincin family. APN consists of a
           small N-terminal cytoplasmic domain, a single
           transmembrane domain and a large extracellular
           ectodomain that contains the active site.  It
           preferentially cleaves neutral amino acids from the
           N-terminus of oligopeptides and, in higher eukaryotes,
           is present in a variety of human tissues and cell types
           (leukocyte, fibroblast, endothelial and epithelial
           cells). APN expression is dysregulated in inflammatory
           diseases such as chronic pain, rheumatoid arthritis,
           multiple sclerosis, systemic sclerosis, systemic lupus
           erythematosus, polymyositis/dermatomyosytis and
           pulmonary sarcoidosis, and is enhanced in tumor cells
           such as melanoma, renal, prostate, pancreas, colon,
           gastric and thyroid cancers. It is predominantly
           expressed on stem cells and on cells of the granulocytic
           and monocytic lineages at distinct stages of
           differentiation, thus considered a marker of
           differentiation. Thus, APN inhibition may lead to the
           development of anti-cancer and anti-inflammatory drugs.
           APNs are also present in many pathogenic bacteria and
           represent potential drug targets, Some APNs have been
           used commercially, such as one from Lactococcus lactis
           used in the food industry. APN also serves as a receptor
           for coronaviruses, although the virus receptor
           interaction site seems to be distinct from the enzymatic
           site and aminopeptidase activity is not necessary for
           viral infection. APNs have also been extensively studied
           as putative Cry toxin receptors. Cry1 proteins are
           pore-forming toxins that bind to the midgut epithelial
           cell membrane of susceptible insect larvae, causing
           extensive damage. Several different toxins, including
           Cry1Aa, Cry1Ab, Cry1Ac, Cry1Ba, Cry1Ca and Cry1Fa, have
           been shown to bind to APNs; however, a direct role of
           APN in cytotoxicity has been yet to be firmly
           established.
          Length = 435

 Score = 31.2 bits (71), Expect = 1.4
 Identities = 28/122 (22%), Positives = 48/122 (39%), Gaps = 18/122 (14%)

Query: 186 TGRQKFIIVKPNQLNDVTQIAYDNFIEEH----GNQVLPLGHPAYKRVGAVVKRLIDANK 241
            G + +     N   D    A   FI +     G +V     P       + KR +DA K
Sbjct: 185 GGTKTWEFEAENV-RDFALAASPKFIVDEATVDGIKVRAYYFPEDA---ELAKRYLDAAK 240

Query: 242 VYMEHNN---FKYP---ITIIDDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHEL 295
             +E  +     YP    +++++P      +P G   +   + +L      L TVL+HE+
Sbjct: 241 KALEFYSELFGPYPYKEFSVVENPFPGGMEYP-GLTLIGGRVLRL---PFILETVLAHEI 296

Query: 296 SH 297
           +H
Sbjct: 297 AH 298


>gnl|CDD|179418 PRK02391, PRK02391, heat shock protein HtpX; Provisional.
          Length = 296

 Score = 30.7 bits (70), Expect = 1.4
 Identities = 18/51 (35%), Positives = 26/51 (50%), Gaps = 9/51 (17%)

Query: 253 ITIIDDPLINAFVFPDGR------IFMFTGMFQLCQTDDELATVLSHELSH 297
           + + D  + NAF    GR      + + TG+ +     DEL  VL+HELSH
Sbjct: 96  VAVADSDVPNAFA--TGRSPKNAVVCVTTGLMRRL-DPDELEAVLAHELSH 143


>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
           translational repressor [Translation, ribosomal
           structure and biogenesis].
          Length = 777

 Score = 30.9 bits (70), Expect = 2.0
 Identities = 19/144 (13%), Positives = 43/144 (29%), Gaps = 33/144 (22%)

Query: 49  QDTIGCILSCDKKYSTCLNFQITGHQDKHLTRVLFHQSANHQLQVFNIRVLRCFHTSQPK 108
           Q  I  + +  +         +      +  +++  Q+ NH +Q       +C      K
Sbjct: 529 QKAIDIVSTDIQ------ISLLVEELRPYCLQLIKDQNGNHVIQ-------KCIEK-FNK 574

Query: 109 NALPGFIYLIFKPILR--------VVGAIVIARISRKWWSKLSPDQKKIILSQIKKHQDK 160
                    IF  I            G+ V+ R         + + K+ ++ +I  +   
Sbjct: 575 EKNQF----IFDSINENLYDLSTHRYGSRVVQRCLENC----NSEDKENLVEEIISNSKY 626

Query: 161 IAGITLTSIGLAYIYYFLHLETCP 184
           ++       G   + + L     P
Sbjct: 627 LS---QDQYGNYVVQHILDNGAEP 647


>gnl|CDD|235478 PRK05457, PRK05457, heat shock protein HtpX; Provisional.
          Length = 284

 Score = 30.1 bits (69), Expect = 2.3
 Identities = 54/232 (23%), Positives = 76/232 (32%), Gaps = 97/232 (41%)

Query: 255 IIDDPLINAFVFPDGRIFMFTGMFQ----------LCQ--TDDELATVLSHELSH----- 297
           I   P INAF          TG  +          L Q  + DE+  VL+HE+SH     
Sbjct: 99  IYHSPEINAFA---------TGASKNNSLVAVSTGLLQNMSRDEVEAVLAHEISHIANGD 149

Query: 298 ----TLLKHVAEKLSNKTFLEILYIVPLMIIWFLLPDLGAIVTQVLLDD---------LK 344
               TL++ V     N TF           + FL   +  IV + +  +         + 
Sbjct: 150 MVTMTLIQGVV----N-TF-----------VIFLSRIIAQIVDRFVSGNEEGNGIGYFIV 193

Query: 345 SIIFELPFEREMETEADESI---IFELPFEREMETEADEVGLKLMARACYDVRVAPLFWQ 401
           SI+ E+ F           I   I  + F R  E  AD  G KL  R            +
Sbjct: 194 SIVLEIVF----------GILASIIVMWFSRHREFRADAGGAKLAGR------------E 231

Query: 402 KM--ALK--------ETQDQV-------GPKMEEYLSTHPSHENRANNLESK 436
           KM  AL+        +    +          + E   +HP  E R   L S 
Sbjct: 232 KMIAALQRLKTSYEPQLPGSMAAFGINGKSGLSELFMSHPPLEKRIAALRSG 283


>gnl|CDD|143397 cd07078, ALDH, NAD(P)+ dependent aldehyde dehydrogenase family.
           The aldehyde dehydrogenase family (ALDH) of NAD(P)+
           dependent enzymes, in general, oxidize a wide range of
           endogenous and exogenous aliphatic and aromatic
           aldehydes to their corresponding carboxylic acids and
           play an  important role in detoxification. Besides
           aldehyde detoxification, many ALDH isozymes possess
           multiple additional catalytic and non-catalytic
           functions such as participating in  metabolic pathways,
           or as  binding proteins, or as osmoregulants, to mention
           a few. The enzyme has three domains, a NAD(P)+
           cofactor-binding domain, a catalytic domain, and a
           bridging domain; and the active enzyme  is generally
           either homodimeric or homotetrameric. The catalytic
           mechanism is proposed to involve cofactor binding,
           resulting in a conformational change and activation of
           an invariant catalytic cysteine nucleophile. The
           cysteine and aldehyde substrate form an oxyanion
           thiohemiacetal intermediate resulting in hydride
           transfer to the cofactor and formation of a
           thioacylenzyme intermediate. Hydrolysis of the
           thioacylenzyme and release of the carboxylic acid
           product occurs, and in most cases, the reduced cofactor
           dissociates from the enzyme. The evolutionary
           phylogenetic tree of ALDHs appears to have an initial
           bifurcation between what has been characterized as the
           classical aldehyde dehydrogenases, the ALDH family
           (ALDH) and extended family members or aldehyde
           dehydrogenase-like (ALDH-like) proteins. The ALDH
           proteins are represented by enzymes which share a number
           of highly conserved residues necessary for catalysis and
           cofactor binding and they include such proteins as
           retinal dehydrogenase, 10-formyltetrahydrofolate
           dehydrogenase, non-phosphorylating glyceraldehyde
           3-phosphate dehydrogenase,
           delta(1)-pyrroline-5-carboxylate dehydrogenases,
           alpha-ketoglutaric semialdehyde dehydrogenase,
           alpha-aminoadipic semialdehyde dehydrogenase, coniferyl
           aldehyde dehydrogenase and succinate-semialdehyde
           dehydrogenase.  Included in this larger group are all
           human, Arabidopsis, Tortula, fungal, protozoan, and
           Drosophila ALDHs identified in families ALDH1 through
           ALDH22 with the exception of families ALDH18, ALDH19,
           and ALDH20 which are present in the ALDH-like group.
          Length = 432

 Score = 29.9 bits (68), Expect = 3.0
 Identities = 10/41 (24%), Positives = 23/41 (56%), Gaps = 4/41 (9%)

Query: 128 AIVIARISRKWWSKLSPDQKKIIL----SQIKKHQDKIAGI 164
           A+  AR + K W+ L P ++  IL      +++ ++++A +
Sbjct: 3   AVAAARAAFKAWAALPPAERAAILRKLADLLEERREELAAL 43


>gnl|CDD|221619 pfam12520, DUF3723, Protein of unknown function (DUF3723).  This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 374 and 1069 amino
           acids in length. There is a conserved LGF sequence
           motif.
          Length = 506

 Score = 30.0 bits (68), Expect = 3.1
 Identities = 7/29 (24%), Positives = 14/29 (48%)

Query: 136 RKWWSKLSPDQKKIILSQIKKHQDKIAGI 164
           R+WW++LS   K      ++K++      
Sbjct: 122 RRWWARLSSSNKARRRRLLQKNRRLRNAF 150


>gnl|CDD|204379 pfam10023, DUF2265, Predicted aminopeptidase (DUF2265).  Members of
           this family of bacterial proteins comprise various
           hypothetical proteins and putative aminopeptidases.
           Their exact function, has not, as yet, been defined.
          Length = 338

 Score = 29.1 bits (66), Expect = 4.6
 Identities = 12/41 (29%), Positives = 19/41 (46%), Gaps = 15/41 (36%)

Query: 257 DDPLINAFVFPDGRIFMFTGMFQLCQTDDELATVLSHELSH 297
           +DPL++ F+                  D ELA ++ HEL+H
Sbjct: 151 NDPLLSTFIRYP---------------DGELARLIFHELAH 176


>gnl|CDD|234514 TIGR04231, seadorna_VP5, seadornavirus VP5 protein.  This protein
           family occurs in the seadornavirus virus group, with
           designations VP5 in Banna virus, and VP6 in Kadipiro
           virus and Liao ning virus. The function is unassigned.
          Length = 505

 Score = 29.5 bits (66), Expect = 4.7
 Identities = 25/98 (25%), Positives = 41/98 (41%), Gaps = 17/98 (17%)

Query: 209 NFIEEHGNQ----VLPLGHPAYKRVGAVVKRLIDANKVYMEHNNFKYPITIIDDPLINAF 264
           + ++ HGN     V PL      R   +++ L D+N +      FK      D+ L  A+
Sbjct: 259 SRVKMHGNSRFGFVKPLDDAQKSRSTGLIRALDDSNSIR-----FKSIGVGRDEKLSYAY 313

Query: 265 VFPDGRIFMF-----TGMFQLCQTDDELATVLSHELSH 297
              DG +  +      G   LC T D +  ++ H+L H
Sbjct: 314 ---DGCLLYYIKKIQNGNRGLCVTIDNINKLIEHKLLH 348


>gnl|CDD|179524 PRK03001, PRK03001, M48 family peptidase; Provisional.
          Length = 283

 Score = 29.2 bits (66), Expect = 4.9
 Identities = 16/54 (29%), Positives = 30/54 (55%), Gaps = 11/54 (20%)

Query: 255 IIDDPLINAFVFPDGR------IFMFTGMFQLCQTDDELATVLSHELSHTLLKH 302
           +I++   NAF    GR      +   TG+ ++  ++ E+  V++HEL+H  +KH
Sbjct: 89  LINEDQPNAFA--TGRNPEHAAVAATTGILRVL-SEREIRGVMAHELAH--VKH 137


>gnl|CDD|213809 TIGR03406, FeS_long_SufT, probable FeS assembly SUF system protein
           SufT.  The function is unknown for this protein family,
           but members are found almost always in operons for the
           the SUF system of iron-sulfur cluster biosynthesis. The
           SUF system is present elsewhere on the chromosome for
           those few species where SUF genes are not adjacent. This
           family shares this property of association with the SUF
           system with a related family, TIGR02945. TIGR02945
           consists largely of a DUF59 domain (see pfamam family
           pfam01883), while this protein is about double the
           length, with a unique N-terminal domain and DUF59
           C-terminal domain. A location immediately downstream of
           the cysteine desulfurase gene sufS in many contexts
           suggests the gene symbol sufT. Note that some other
           homologs of this family and of TIGR02945, but no actual
           members of this family, are found in operons associated
           with phenylacetic acid (or other ring-hydroxylating)
           degradation pathways [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Other].
          Length = 174

 Score = 28.5 bits (64), Expect = 5.1
 Identities = 14/41 (34%), Positives = 26/41 (63%), Gaps = 4/41 (9%)

Query: 338 VLLDDLKSIIFELPFEREMETEADESIIFELPFEREMETEA 378
           VL++D++  +  +P   E+E E    ++F+ P+ REM +EA
Sbjct: 131 VLVEDVEDKVLAVPNVDEVEVE----LVFDPPWSREMMSEA 167


>gnl|CDD|188992 cd02699, M4, Peptidase M4 family includes thermolysin, pseudolysin,
           aureolysin, neutral protease and bacillolysin.  M4
           Peptidase family belong to the Glu-zincins family and
           includes numerous zinc-dependent metallopeptidases that
           hydrolyze peptide bonds such as thermolysin (EC
           3.4.24.27), pseudolysin (the extracellullar elastase of
           Pseudomonas aeruginosa), aureolysin (the extracellular
           metalloproteinase from Staphylococcus aureus), neutral
           protease from Bacillus cereus as well as fungalysin
           (from Aspergillus fumigates) and bacillolysin (EC
           3.4.24.28). Typically, the M4 peptidases consist of a
           presequence (signal sequence), a propeptide sequence and
           a peptidase unit. The presequence is cleaved off during
           export while the propeptide has inhibitory and chaperone
           functions and facilitates folding. The propeptide
           remains attached until the peptidase is secreted and can
           be safely activated. All peptidases in this family bind
           a single catalytic zinc ion which is tetrahedrally
           co-ordinated by three amino acid ligands and a water
           molecule that forms the nucleophile on activation during
           catalysis. These peptidases are secreted eubacterial
           endopeptidases from Gram-positive or Gram-negative
           sources that degrade extracellular proteins and peptides
           for bacterial nutrition. They contain HEXXH and EXXXD
           motifs as part of their active site and are selectively
           inhibited by Steptomyces metalloproteinase inhibitor
           (SMPI) as well as by phosphoramidon from Streptomyces
           tanashiensis. A large number of these enzymes are
           implicated as key factors in the pathogenesis of various
           diseases, including  gastritis, peptic ulcer, gastric
           carcinoma, cholera and several types of bacterial
           infections, and are therefore important drug targets.
           Some enzymes of the family can function at extremes of
           temperatures, while some function in organic solvents,
           thus rendering them novel targets for biotechnological
           applications.
          Length = 230

 Score = 28.8 bits (65), Expect = 5.5
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 13/49 (26%)

Query: 262 NAF------VFPDGRIFMFTGMFQLCQTDDELATVLSHELSHTLLKHVA 304
           NAF      VF DG    FT       + D    V++HEL+H +    +
Sbjct: 81  NAFWDGSQMVFGDGDGSTFTP---FAVSLD----VVAHELTHGVTDSTS 122


>gnl|CDD|162041 TIGR00795, lctP, L-lactate transport.  The Lactate Permease (LctP)
           Family (TC 2.A.14) The only characterized member of this
           family, from E. coli, appears to catalyze lactate:H+
           uptake. Members of this family have 12 probable TMS
           [Transport and binding proteins, Carbohydrates, organic
           alcohols, and acids].
          Length = 530

 Score = 29.3 bits (66), Expect = 5.6
 Identities = 12/51 (23%), Positives = 24/51 (47%), Gaps = 9/51 (17%)

Query: 110 ALPGFIYLIFKPILRVVGAIVIARIS---------RKWWSKLSPDQKKIIL 151
            L GF+Y ++     ++ AI + ++S         R     +SPD++  +L
Sbjct: 60  FLYGFLYGLWPIAWIIIAAIFLYKLSVKSGAFEIIRSSILSISPDRRIQVL 110


>gnl|CDD|239796 cd04268, ZnMc_MMP_like, Zinc-dependent metalloprotease, MMP_like
           subfamily. This group contains matrix metalloproteinases
           (MMPs), serralysins, and the astacin_like family of
           proteases.
          Length = 165

 Score = 28.2 bits (63), Expect = 6.3
 Identities = 9/36 (25%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 268 DGRIFMFTGMFQLCQTDDELATVLSHELSHTL-LKH 302
             R+++++   +   +   L     HEL H L L+H
Sbjct: 77  LARVYLYSSFVEY--SGARLRNTAEHELGHALGLRH 110


>gnl|CDD|233203 TIGR00946, 2a69, he Auxin Efflux Carrier (AEC) Family.  [Transport
           and binding proteins, Other].
          Length = 321

 Score = 28.5 bits (64), Expect = 8.2
 Identities = 8/37 (21%), Positives = 17/37 (45%)

Query: 114 FIYLIFKPILRVVGAIVIARISRKWWSKLSPDQKKII 150
             Y+I + +L ++  I++  I  K +  L  +    I
Sbjct: 2   ITYVILETVLPILVVILLGYILGKRFGILDEEHASGI 38


>gnl|CDD|219022 pfam06426, SATase_N, Serine acetyltransferase, N-terminal.  The
           N-terminal domain of serine acetyltransferase has a
           sequence that is conserved in plants and bacteria.
          Length = 105

 Score = 27.0 bits (61), Expect = 8.4
 Identities = 10/31 (32%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 288 ATVLSHE-LSHTLLKHVAEKLSNKTFLEILY 317
           AT+L+H+ L   L   +A KL++ T      
Sbjct: 25  ATILNHDSLESALAYRLANKLASPTMSADTL 55


>gnl|CDD|224553 COG1638, DctP, TRAP-type C4-dicarboxylate transport system,
           periplasmic component [Carbohydrate transport and
           metabolism].
          Length = 332

 Score = 28.4 bits (64), Expect = 9.0
 Identities = 6/23 (26%), Positives = 13/23 (56%)

Query: 134 ISRKWWSKLSPDQKKIILSQIKK 156
           +S+ +W  L  + + I+L   K+
Sbjct: 242 VSKAFWDSLPEEDQTILLEAAKE 264


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.323    0.139    0.418 

Gapped
Lambda     K      H
   0.267   0.0766    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,460,594
Number of extensions: 2425202
Number of successful extensions: 2330
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2313
Number of HSP's successfully gapped: 47
Length of query: 475
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 374
Effective length of database: 6,457,848
Effective search space: 2415235152
Effective search space used: 2415235152
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 61 (27.2 bits)