RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6405
         (548 letters)



>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
           signaling molecules, often in tandem arrangements. May
           be responsible for specific protein-protein
           interactions, as most PDZ domains bind C-terminal
           polypeptides, and binding to internal (non-C-terminal)
           polypeptides and even to lipids has been demonstrated.
           In this subfamily of PDZ domains an N-terminal
           beta-strand forms the peptide-binding groove base, a
           circular permutation with respect to PDZ domains found
           in proteases.
          Length = 82

 Score = 83.4 bits (207), Expect = 8e-20
 Identities = 37/83 (44%), Positives = 52/83 (62%), Gaps = 4/83 (4%)

Query: 53  VVVLHRDLPGNSVGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMK 112
            V L +D PG  +G +L GG D     I V +V  G PA+R G L+ GDRIL +NG S++
Sbjct: 3   TVTLRKD-PGGGLGFSLRGGKDSGGG-IFVSRVEPGGPAER-GGLRVGDRILEVNGVSVE 59

Query: 113 GLTHKESLAILKR-GPPVEITLI 134
           GLTH+E++ +LK  G  V +T+ 
Sbjct: 60  GLTHEEAVELLKNSGDEVTLTVR 82



 Score = 71.4 bits (176), Expect = 1e-15
 Identities = 35/76 (46%), Positives = 44/76 (57%), Gaps = 5/76 (6%)

Query: 128 PVEITLIKD-GAGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNV 186
              +TL KD G GLGFSL GGKDS      + V ++  GG AE+ G L+ GD IL +N V
Sbjct: 1   VRTVTLRKDPGGGLGFSLRGGKDS---GGGIFVSRVEPGGPAERGG-LRVGDRILEVNGV 56

Query: 187 PVTEMSRIEAWSLLKK 202
            V  ++  EA  LLK 
Sbjct: 57  SVEGLTHEEAVELLKN 72


>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2.  Also
           called DHR (Dlg homologous region) or GLGF (relatively
           well conserved tetrapeptide in these domains). Some PDZs
           have been shown to bind C-terminal polypeptides; others
           appear to bind internal (non-C-terminal) polypeptides.
           Different PDZs possess different binding specificities.
          Length = 85

 Score = 76.6 bits (189), Expect = 2e-17
 Identities = 35/89 (39%), Positives = 55/89 (61%), Gaps = 5/89 (5%)

Query: 50  EVSVVVLHRDLPGNSVGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGK 109
           E  +V L +   G  +G +L GG D E   + V  V+ G+PA + G L+ GD IL +NG 
Sbjct: 1   EPRLVELEKG--GGGLGFSLVGGKD-EGGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGT 56

Query: 110 SMKGLTHKESLAILKR-GPPVEITLIKDG 137
           S++GLTH E++ +LK+ G  V +T+++ G
Sbjct: 57  SVEGLTHLEAVDLLKKAGGKVTLTVLRGG 85



 Score = 68.2 bits (167), Expect = 2e-14
 Identities = 32/78 (41%), Positives = 40/78 (51%), Gaps = 4/78 (5%)

Query: 128 PVEITLIKDGAGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVP 187
           P  + L K G GLGFSL GGKD       + V  +  G  A K G L+ GD IL +N   
Sbjct: 2   PRLVELEKGGGGLGFSLVGGKDE---GGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGTS 57

Query: 188 VTEMSRIEAWSLLKKLND 205
           V  ++ +EA  LLKK   
Sbjct: 58  VEGLTHLEAVDLLKKAGG 75


>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF).  PDZ
           domains are found in diverse signaling proteins.
          Length = 80

 Score = 63.0 bits (154), Expect = 1e-12
 Identities = 32/80 (40%), Positives = 48/80 (60%), Gaps = 4/80 (5%)

Query: 54  VVLHRDLPGNSVGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKG 113
           V L +   G  +G +L GGSD +   I V +V+ G  A+  G LQ+GDRILSING+ ++ 
Sbjct: 2   VTLEKSGRG-GLGFSLVGGSDGD-PGIFVSEVLPGGAAEAGG-LQEGDRILSINGQDLEN 58

Query: 114 LTHKESLAILKRGPPVEITL 133
           L+H +   +  +G   E+TL
Sbjct: 59  LSH-DEAVLALKGSGGEVTL 77



 Score = 53.0 bits (128), Expect = 5e-09
 Identities = 34/77 (44%), Positives = 42/77 (54%), Gaps = 5/77 (6%)

Query: 130 EITLIKDG-AGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVPV 188
           E+TL K G  GLGFSL GG D   GD  + V ++  GG AE  G L+ GD IL+IN   +
Sbjct: 1   EVTLEKSGRGGLGFSLVGGSD---GDPGIFVSEVLPGGAAEAGG-LQEGDRILSINGQDL 56

Query: 189 TEMSRIEAWSLLKKLND 205
             +S  EA   LK    
Sbjct: 57  ENLSHDEAVLALKGSGG 73


>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region)
           or GLGF (after a conserved sequence motif). Many PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. Heterodimerization through
           PDZ-PDZ domain interactions adds to the domain's
           versatility, and PDZ domain-mediated interactions may be
           modulated dynamically through target phosphorylation.
           Some PDZ domains play a role in scaffolding
           supramolecular complexes. PDZ domains are found in
           diverse signaling proteins in bacteria, archebacteria,
           and eurkayotes. This CD contains two distinct structural
           subgroups with either a N- or C-terminal beta-strand
           forming the peptide-binding groove base. The circular
           permutation placing the strand on the N-terminus appears
           to be found in Eumetazoa only, while the C-terminal
           variant is found in all three kingdoms of life, and
           seems to co-occur with protease domains. PDZ domains
           have been named after PSD95(post synaptic density
           protein), DlgA (Drosophila disc large tumor suppressor),
           and ZO1, a mammalian tight junction protein.
          Length = 70

 Score = 59.6 bits (145), Expect = 2e-11
 Identities = 25/72 (34%), Positives = 40/72 (55%), Gaps = 4/72 (5%)

Query: 63  NSVGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKESLAI 122
             +G ++ GG       + V  V  G+PA+R G LQ GD IL++NG  +K LT ++   +
Sbjct: 1   GGLGFSIRGG---TEGGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLTLEDVAEL 56

Query: 123 LKRGPPVEITLI 134
           LK+    ++TL 
Sbjct: 57  LKKEVGEKVTLT 68



 Score = 44.6 bits (106), Expect = 3e-06
 Identities = 24/71 (33%), Positives = 35/71 (49%), Gaps = 6/71 (8%)

Query: 139 GLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVPVTEMSRIEAWS 198
           GLGFS+ GG      +  + V  +  G  AE+ G L+AGD IL +N   V  ++  +   
Sbjct: 2   GLGFSIRGG-----TEGGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLTLEDVAE 55

Query: 199 LLKKLNDGTIQ 209
           LLKK     + 
Sbjct: 56  LLKKEVGEKVT 66


>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
           tail-specific-, and tricorn proteases, which function in
           posttranslational protein processing, maturation, and
           disassembly or degradation, in Bacteria, Archaea, and
           plant chloroplasts. May be responsible for substrate
           recognition and/or binding, as most PDZ domains bind
           C-terminal polypeptides, and binding to internal
           (non-C-terminal) polypeptides and even to lipids has
           been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 85

 Score = 52.6 bits (127), Expect = 6e-09
 Identities = 21/78 (26%), Positives = 43/78 (55%), Gaps = 9/78 (11%)

Query: 65  VGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKESLAILK 124
           +G+ L     Y+   + +  V+ G+PA + G ++ GD I++I+G+ + GL+ ++ + +L 
Sbjct: 4   IGLEL----KYDDGGLVITSVLPGSPAAKAG-IKAGDIIVAIDGEPVDGLSLEDVVKLL- 57

Query: 125 RGPP---VEITLIKDGAG 139
           RG     V +TL +    
Sbjct: 58  RGKAGTKVRLTLKRGDGE 75



 Score = 42.2 bits (100), Expect = 3e-05
 Identities = 20/65 (30%), Positives = 29/65 (44%), Gaps = 9/65 (13%)

Query: 137 GAGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVPVTEMSRIEA 196
           G GL    + G         L +  +  G  A K G +KAGD I+ I+  PV  +S  + 
Sbjct: 3   GIGLELKYDDGG--------LVITSVLPGSPAAKAG-IKAGDIIVAIDGEPVDGLSLEDV 53

Query: 197 WSLLK 201
             LL+
Sbjct: 54  VKLLR 58


>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ
           family.  This family consists of a set proteins various
           designated DegP, heat shock protein HtrA, and protease
           DO. The ortholog in Pseudomonas aeruginosa is designated
           MucD and is found in an operon that controls mucoid
           phenotype. This family also includes the DegQ (HhoA)
           paralog in E. coli which can rescue a DegP mutant, but
           not the smaller DegS paralog, which cannot. Members of
           this family are located in the periplasm and have
           separable functions as both protease and chaperone.
           Members have a trypsin domain and two copies of a PDZ
           domain. This protein protects bacteria from thermal and
           other stresses and may be important for the survival of
           bacterial pathogens.// The chaperone function is
           dominant at low temperatures, whereas the proteolytic
           activity is turned on at elevated temperatures [Protein
           fate, Protein folding and stabilization, Protein fate,
           Degradation of proteins, peptides, and glycopeptides].
          Length = 428

 Score = 52.2 bits (126), Expect = 4e-07
 Identities = 43/167 (25%), Positives = 70/167 (41%), Gaps = 41/167 (24%)

Query: 82  VHKVISGTPADRDGRLQKGDRILSINGK---SMKGLTH---------KESLAILKRGPPV 129
           V +V+ G+PA++ G L+ GD I S+NGK   S   L           K +L IL++G   
Sbjct: 261 VAQVLPGSPAEKAG-LKAGDVITSVNGKPISSFADLRRAIGTLKPGKKVTLGILRKGKEK 319

Query: 130 EITLI----KDGAG------LGFSLE-------GGKDSPFGDQPLTVKKIFTGGCAEKNG 172
            IT+      +         LG ++                 + + V K+ +G  A + G
Sbjct: 320 TITVTLGASPEEQASSSNPFLGLTVANLSPEIRKELRLKGDVKGVVVTKVVSGSPAARAG 379

Query: 173 QLKAGDEILTINNVPVT---EMSRI-------EAWSLLKKLNDGTIQ 209
            L+ GD IL++N  PV+   E+ ++          +LL      TI 
Sbjct: 380 -LQPGDVILSVNQQPVSSVAELRKVLARAKKGGRVALLILRGGATIF 425



 Score = 36.8 bits (86), Expect = 0.028
 Identities = 25/64 (39%), Positives = 40/64 (62%), Gaps = 2/64 (3%)

Query: 76  ESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTH-KESLAILKRGPPVEITLI 134
           + K + V KV+SG+PA R G LQ GD ILS+N + +  +   ++ LA  K+G  V + ++
Sbjct: 360 DVKGVVVTKVVSGSPAARAG-LQPGDVILSVNQQPVSSVAELRKVLARAKKGGRVALLIL 418

Query: 135 KDGA 138
           + GA
Sbjct: 419 RGGA 422


>gnl|CDD|221961 pfam13180, PDZ_2, PDZ domain. 
          Length = 81

 Score = 46.9 bits (112), Expect = 6e-07
 Identities = 27/73 (36%), Positives = 40/73 (54%), Gaps = 5/73 (6%)

Query: 66  GITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTH-KESLAILK 124
           G+ +      E   +TV  V  G+PA + G L+ GD ILSI+GK +  LT   E +   K
Sbjct: 4   GVRV---VQNEGTGVTVVSVKEGSPAAKAG-LKPGDIILSIDGKKVNSLTELIEVILNGK 59

Query: 125 RGPPVEITLIKDG 137
            G  V++T+ +DG
Sbjct: 60  PGDTVKLTVYRDG 72



 Score = 30.7 bits (70), Expect = 0.35
 Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 6/52 (11%)

Query: 138 AGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVPVT 189
             +G  +   + +  G   ++VK+   G  A K G LK GD IL+I+   V 
Sbjct: 1   GDIGVRVVQNEGT--GVTVVSVKE---GSPAAKAG-LKPGDIILSIDGKKVN 46


>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
           proteases, such as DegP/HtrA, which are oligomeric
           proteins involved in heat-shock response, chaperone
           function, and apoptosis. May be responsible for
           substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 90

 Score = 43.0 bits (102), Expect = 2e-05
 Identities = 23/57 (40%), Positives = 36/57 (63%), Gaps = 2/57 (3%)

Query: 82  VHKVISGTPADRDGRLQKGDRILSINGKSMKGLTH-KESLAILKRGPPVEITLIKDG 137
           V  V  G+PA + G L+ GD IL++NGK +K +   + +LA LK G  V +T+++ G
Sbjct: 28  VASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADLRRALAELKPGDKVTLTVLRGG 83



 Score = 28.4 bits (64), Expect = 2.4
 Identities = 14/31 (45%), Positives = 16/31 (51%), Gaps = 1/31 (3%)

Query: 159 VKKIFTGGCAEKNGQLKAGDEILTINNVPVT 189
           V  +  G  A K G LK GD IL +N  PV 
Sbjct: 28  VASVDPGSPAAKAG-LKPGDVILAVNGKPVK 57


>gnl|CDD|234386 TIGR03900, prc_long_Delta, putative carboxyl-terminal-processing
           protease, deltaproteobacterial.  This model describes a
           multidomain protein of about 1070 residues, restricted
           to the order Myxococcales in the Deltaproteobacteria.
           Members contain a PDZ domain (pfam00595), an S41 family
           peptidase domain (pfam03572), and an SH3 domain
           (pfam06347). A core region of this family, including PDZ
           and S41 regions, is described by TIGR00225, C-terminal
           processing peptidase, which recognizes the Prc protease.
           The species distribution of this family approximates
           that of largely Deltaproteobacterial C-terminal putative
           protein-sorting domain, TIGR03901, analogous to LPXTG
           and PEP-CTERM, but the co-occurrence may reflect shared
           restriction to the Myxococcales rather than a
           substrate/target relationship.
          Length = 973

 Score = 46.3 bits (110), Expect = 4e-05
 Identities = 25/62 (40%), Positives = 38/62 (61%), Gaps = 5/62 (8%)

Query: 79  EITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKESLAILKRGPP---VEITLIK 135
            +TV +VI GTPA R G LQ+ D I+ I+ +S   +T  +++  L RGPP   V I + +
Sbjct: 152 NLTVVRVIPGTPAARAG-LQRNDVIVKIDDESTVNMTLNDAVGRL-RGPPDTKVTIWVRR 209

Query: 136 DG 137
           +G
Sbjct: 210 EG 211



 Score = 30.1 bits (68), Expect = 3.6
 Identities = 14/48 (29%), Positives = 25/48 (52%), Gaps = 1/48 (2%)

Query: 154 DQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVPVTEMSRIEAWSLLK 201
           D+ LTV ++  G  A + G L+  D I+ I++     M+  +A   L+
Sbjct: 150 DRNLTVVRVIPGTPAARAG-LQRNDVIVKIDDESTVNMTLNDAVGRLR 196


>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
           membrane].
          Length = 406

 Score = 45.8 bits (109), Expect = 4e-05
 Identities = 23/77 (29%), Positives = 44/77 (57%), Gaps = 6/77 (7%)

Query: 65  VGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKESLAILK 124
           +GI L      +   + V   I G+PA + G ++ GD I+ I+GKS+ G++  E++ +++
Sbjct: 102 IGIELQ---MEDIGGVKVVSPIDGSPAAKAG-IKPGDVIIKIDGKSVGGVSLDEAVKLIR 157

Query: 125 R--GPPVEITLIKDGAG 139
              G  V +T+++ G G
Sbjct: 158 GKPGTKVTLTILRAGGG 174



 Score = 32.3 bits (74), Expect = 0.71
 Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 1/45 (2%)

Query: 157 LTVKKIFTGGCAEKNGQLKAGDEILTINNVPVTEMSRIEAWSLLK 201
           + V     G  A K G +K GD I+ I+   V  +S  EA  L++
Sbjct: 114 VKVVSPIDGSPAAKAG-IKPGDVIIKIDGKSVGGVSLDEAVKLIR 157


>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant
           zinc metalloprotases, presumably membrane-associated or
           integral membrane proteases, which may be involved in
           signalling and regulatory mechanisms. May be responsible
           for substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, and binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 79

 Score = 41.8 bits (99), Expect = 4e-05
 Identities = 20/61 (32%), Positives = 38/61 (62%), Gaps = 5/61 (8%)

Query: 79  EITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKESLAILKRGP--PVEITLIKD 136
           E  + +V+ G+PA + G L+ GDRIL+ING+ +K     + +  ++  P  P+ +T+ ++
Sbjct: 13  EPVIGEVVPGSPAAKAG-LKAGDRILAINGQKIKSWE--DLVDAVQENPGKPLTLTVERN 69

Query: 137 G 137
           G
Sbjct: 70  G 70



 Score = 36.1 bits (84), Expect = 0.004
 Identities = 19/57 (33%), Positives = 24/57 (42%), Gaps = 7/57 (12%)

Query: 138 AGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVPVTEMSRI 194
           A LGF   G    P       + ++  G  A K G LKAGD IL IN   +     +
Sbjct: 1   AILGFVPGGPPIEP------VIGEVVPGSPAAKAG-LKAGDRILAINGQKIKSWEDL 50


>gnl|CDD|232883 TIGR00225, prc, C-terminal peptidase (prc).  A C-terminal peptidase
           with different substrates in different species including
           processing of D1 protein of the photosystem II reaction
           center in higher plants and cleavage of a peptide of 11
           residues from the precursor form of penicillin-binding
           protein in E.coli E.coli and H influenza have the most
           distal branch of the tree and their proteins have an
           N-terminal 200 amino acids that show no homology to
           other proteins in the database [Protein fate,
           Degradation of proteins, peptides, and glycopeptides,
           Protein fate, Protein modification and repair].
          Length = 334

 Score = 43.5 bits (103), Expect = 2e-04
 Identities = 22/83 (26%), Positives = 47/83 (56%), Gaps = 7/83 (8%)

Query: 65  VGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKESLAIL- 123
           +GI +      +  EI +     G+PA++ G ++ GD+I+ INGKS+ G++  +++A++ 
Sbjct: 53  IGIQV----GMDDGEIVIVSPFEGSPAEKAG-IKPGDKIIKINGKSVAGMSLDDAVALIR 107

Query: 124 -KRGPPVEITLIKDGAGLGFSLE 145
            K+G  V + +++ G     +  
Sbjct: 108 GKKGTKVSLEILRAGKSKPLTFT 130



 Score = 33.9 bits (78), Expect = 0.17
 Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)

Query: 154 DQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVPVTEMSRIEAWSLLK 201
           D  + +   F G  AEK G +K GD+I+ IN   V  MS  +A +L++
Sbjct: 61  DGEIVIVSPFEGSPAEKAG-IKPGDKIIKINGKSVAGMSLDDAVALIR 107


>gnl|CDD|238491 cd00991, PDZ_archaeal_metalloprotease, PDZ domain of archaeal zinc
           metalloprotases, presumably membrane-associated or
           integral membrane proteases, which may be involved in
           signalling and regulatory mechanisms. May be responsible
           for substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, and binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 79

 Score = 39.7 bits (93), Expect = 2e-04
 Identities = 19/54 (35%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 85  VISGTPADRDGRLQKGDRILSINGKSMKGLTH-KESLAILKRGPPVEITLIKDG 137
           VI G+PA+ +  L  GD I SING  +  L    E+L   K G  + +T++   
Sbjct: 17  VIVGSPAE-NAVLHTGDVIYSINGTPITTLEDFMEALKPTKPGEVITVTVLPST 69


>gnl|CDD|232801 TIGR00054, TIGR00054, RIP metalloprotease RseP.  Members of this
           nearly universal bacterial protein family are regulated
           intramembrane proteolysis (RIP) proteases. Older and
           synonymous gene symbols include yaeL in E. coli, mmpA in
           Caulobacter crescentus, etc. This family includes a
           region that hits the PDZ domain, found in a number of
           proteins targeted to the membrane by binding to a
           peptide ligand. The N-terminal region of this family
           contains a perfectly conserved motif HEXGH as found in a
           number of metalloproteinases, where the Glu is the
           active site and the His residues coordinate the metal
           cation [Protein fate, Degradation of proteins, peptides,
           and glycopeptides].
          Length = 419

 Score = 41.0 bits (96), Expect = 0.001
 Identities = 40/191 (20%), Positives = 59/191 (30%), Gaps = 20/191 (10%)

Query: 73  SDYESKEITVHKVI---SGTPADRDGRLQKGDRILSINGKSMKGLTHKESLAILKRGPP- 128
           S        V  VI          +  ++ GD ILS+NG  + G            G P 
Sbjct: 118 SLIGVPGYEVGPVIELLDKNSIALEAGIEPGDEILSVNGNKIPGFKDVRQQIADIAGEPM 177

Query: 129 VEITLIKDGAGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNVPV 188
           VEI   ++        E     P  +    +  +     AEK G LK GD I +IN   +
Sbjct: 178 VEILAERENWTFEVMKELIPRGPKIE--PVLSDVTPNSPAEKAG-LKEGDYIQSINGEKL 234

Query: 189 TEMSRIEAWSLLKKLN----------DGTIQDIKALGYDLNSLGYPRTQDLNALGYPKTR 238
              +     S +K+            +G    I         +G   +  L  L      
Sbjct: 235 RSWTDFV--SAVKENPGKSMDIKVERNGETLSISLTPEAKGKIGIGISPSLAPLEVSYGI 292

Query: 239 DLNALEYIKTR 249
            LNA     + 
Sbjct: 293 -LNAFAKGASA 302


>gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease;
           Provisional.
          Length = 389

 Score = 40.1 bits (94), Expect = 0.003
 Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 5/82 (6%)

Query: 59  DLPGNSVGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKE 118
            + G  + +    GSD     + V     G PA R G ++ GD IL+I+G S +GL+  E
Sbjct: 83  AVTGVGLEVGYPTGSDGPPAGLVVVAPAPGGPAARAG-IRPGDVILAIDGTSTEGLSLYE 141

Query: 119 SLAILKRGP---PVEITLIKDG 137
           + A   +GP    VE+TL +  
Sbjct: 142 A-ADRLQGPEGSSVELTLRRGP 162



 Score = 30.5 bits (69), Expect = 2.2
 Identities = 22/84 (26%), Positives = 32/84 (38%), Gaps = 7/84 (8%)

Query: 118 ESLAILKRGPPVEITLIKDGAGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAG 177
           E    L+ G    +T      G+G  +     S      L V     GG A + G ++ G
Sbjct: 71  EKFKSLRSGTKGAVT------GVGLEVGYPTGSDGPPAGLVVVAPAPGGPAARAG-IRPG 123

Query: 178 DEILTINNVPVTEMSRIEAWSLLK 201
           D IL I+      +S  EA   L+
Sbjct: 124 DVILAIDGTSTEGLSLYEAADRLQ 147


>gnl|CDD|238486 cd00986, PDZ_LON_protease, PDZ domain of ATP-dependent LON serine
           proteases. Most PDZ domains bind C-terminal
           polypeptides, though binding to internal
           (non-C-terminal) polypeptides and even to lipids has
           been demonstrated. In this bacterial subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
            is thought to form the peptide-binding groove base, a
           circular permutation with respect to PDZ domains found
           in Eumetazoan signaling proteins.
          Length = 79

 Score = 35.9 bits (83), Expect = 0.005
 Identities = 20/73 (27%), Positives = 36/73 (49%), Gaps = 5/73 (6%)

Query: 74  DYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHK--ESLAILKRGPPVEI 131
           DY    + V  V+ G PA   G+L+ GD I++++GK  K    +  + +   K G  V++
Sbjct: 4   DYTYHGVYVTSVVEGMPAA--GKLKAGDHIIAVDGKPFKE-AEELIDYIQSKKEGDTVKL 60

Query: 132 TLIKDGAGLGFSL 144
            + ++   L   L
Sbjct: 61  KVKREEKELPEDL 73


>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
           periplasmic, contain C-terminal PDZ domain
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 347

 Score = 38.0 bits (88), Expect = 0.010
 Identities = 24/92 (26%), Positives = 45/92 (48%), Gaps = 9/92 (9%)

Query: 61  PGNSVGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRILSINGKSMKGLT-HKES 119
              ++G+ +A G         V  V+ G+PA + G ++ GD I ++NGK +  L+    +
Sbjct: 260 ADIALGLPVAAG-------AVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAA 311

Query: 120 LAILKRGPPVEITLIKDGAGLGFSLEGGKDSP 151
           +A  + G  V + L++ G     ++  G  SP
Sbjct: 312 VASNRPGDEVALKLLRGGKERELAVTLGDRSP 343



 Score = 30.2 bits (68), Expect = 2.6
 Identities = 19/68 (27%), Positives = 25/68 (36%), Gaps = 11/68 (16%)

Query: 123 LKRGPP-VEITLIKDGAGLGFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEIL 181
           + RG   V    +     LG  +  G           V  +  G  A K G +KAGD I 
Sbjct: 246 VVRGYLGVIGEPLTADIALGLPVAAG---------AVVLGVLPGSPAAKAG-IKAGDIIT 295

Query: 182 TINNVPVT 189
            +N  PV 
Sbjct: 296 AVNGKPVA 303


>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B.  SpoIVB, the
           stage IV sporulation protein B of endospore-forming
           bacteria such as Bacillus subtilis, is a serine
           proteinase, expressed in the spore (rather than mother
           cell) compartment, that participates in a proteolytic
           activation cascade for Sigma-K. It appears to be
           universal among endospore-forming bacteria and occurs
           nowhere else [Cellular processes, Sporulation and
           germination].
          Length = 402

 Score = 36.9 bits (86), Expect = 0.023
 Identities = 26/95 (27%), Positives = 47/95 (49%), Gaps = 13/95 (13%)

Query: 44  LIPGTSEVSVVVLHRDLPGNSVGITLAGGSDYESKEITVHKVISGTPADRDGRLQKGDRI 103
           +IPG   + V +       N+ G+ + G SD E+++  +H     +P   +  +Q GDRI
Sbjct: 91  VIPGGQSIGVKL-------NTKGVLVVGFSDIETEKGKIH-----SPG-EEAGIQIGDRI 137

Query: 104 LSINGKSMKGLTHKESLAILKRGPPVEITLIKDGA 138
           L ING+ +K +    +L     G  + +T+ + G 
Sbjct: 138 LKINGEKIKNMDDLANLINKAGGEKLTLTIERGGK 172


>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
          Length = 473

 Score = 35.5 bits (82), Expect = 0.073
 Identities = 34/129 (26%), Positives = 55/129 (42%), Gaps = 23/129 (17%)

Query: 82  VHKVISGTPADRDGRLQKGDRILSINGKSMKGLT------------HKESLAILKRGPPV 129
           V +V+  + A + G ++ GD I S+NGK +                 K +L +L+ G PV
Sbjct: 315 VSQVLPNSSAAKAG-IKAGDVITSLNGKPISSFAALRAQVGTMPVGSKLTLGLLRDGKPV 373

Query: 130 EITLI--------KDGAGLGFSLEGGKDSPFGDQP-LTVKKIFTGGCAEKNGQLKAGDEI 180
            + +          D + +   +EG + S  G    + V  +  G  A + G LK GD I
Sbjct: 374 NVNVELQQSSQNQVDSSNIFNGIEGAELSNKGGDKGVVVDNVKPGTPAAQIG-LKKGDVI 432

Query: 181 LTINNVPVT 189
           +  N  PV 
Sbjct: 433 IGANQQPVK 441


>gnl|CDD|182723 PRK10779, PRK10779, zinc metallopeptidase RseP; Provisional.
          Length = 449

 Score = 33.1 bits (76), Expect = 0.36
 Identities = 26/99 (26%), Positives = 44/99 (44%), Gaps = 17/99 (17%)

Query: 79  EITVHKVISGTPADRDGRLQKGDRILSINGKSM-----------KGLTHKESLAILKRGP 127
           E  + +V   + A + G LQ GDRI+ ++G+ +                  +L I ++G 
Sbjct: 222 EPVLAEVQPNSAASKAG-LQAGDRIVKVDGQPLTQWQTFVTLVRDNPGKPLALEIERQGS 280

Query: 128 PVEITLIKD-----GAGLGFSLEGGKDSPFGDQPLTVKK 161
           P+ +TL  D     G   GF+    K  P  D+  TV++
Sbjct: 281 PLSLTLTPDSKPGNGKAEGFAGVVPKVIPLPDEYKTVRQ 319


>gnl|CDD|223821 COG0750, COG0750, Predicted membrane-associated Zn-dependent
           proteases 1 [Cell envelope biogenesis, outer membrane].
          Length = 375

 Score = 32.5 bits (74), Expect = 0.51
 Identities = 22/121 (18%), Positives = 40/121 (33%), Gaps = 17/121 (14%)

Query: 81  TVHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKESLAILKRGPPVEITLIKDGAGL 140
            V +V   + A   G L+ GDRI++++G+ +        L +   G    +  I      
Sbjct: 132 VVGEVAPKSAAALAG-LRPGDRIVAVDGEKVASWDDVRRLLVAAAGDVFNLLTI------ 184

Query: 141 GFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEIL-----TINNVPVTEMSRIE 195
                 G+      + +    +           LK GD+I+      I   P  E    +
Sbjct: 185 LVIRLDGEAHAVAAEIIKSLGLTP-----VVIPLKPGDKIVAVDVGAIGLSPNGEPDVGK 239

Query: 196 A 196
            
Sbjct: 240 V 240


>gnl|CDD|143576 cd07467, CRD_TK_ROR1, Cysteine-rich domain of tyrosine kinase-like
           orphan receptor 1.  The cysteine-rich domain (CRD) is an
           essential part of the tyrosine kinase-like orphan
           receptor 1 (Ror1), a conserved family of tyrosine
           kinases that function in various processes, including
           neuronal and skeletal development, cell polarity, and
           cell movement. Ror proteins are receptors of Wnt
           proteins, which are key players in a number of
           fundamental cellular processes in embryogenesis and
           postnatal development. In different cellular contexts,
           Ror proteins can either activate or repress
           transcription of Wnt target genes, and can modulate Wnt
           signaling by sequestering Wnt ligands. In addition, a
           number of Wnt-independent functions have been proposed
           for both Ror1 and Ror2.
          Length = 142

 Score = 31.5 bits (71), Expect = 0.53
 Identities = 21/63 (33%), Positives = 27/63 (42%), Gaps = 5/63 (7%)

Query: 233 GYPKTRDLNALE-YIKTRDLNALEYIKTRD----LNALRYPKTQDLNALEYPKTQDLNAL 287
           G PK RDL   E  I    L   EYI  R     L  L+ P  +DL   + P+  +   +
Sbjct: 75  GMPKPRDLCRDECEILENVLCQTEYIFARSNPMILMRLKLPNCEDLAQPDSPEAANCIRI 134

Query: 288 GHP 290
           G P
Sbjct: 135 GIP 137


>gnl|CDD|215829 pfam00266, Aminotran_5, Aminotransferase class-V.  This domain is
           found in amino transferases, and other enzymes including
           cysteine desulphurase EC:4.4.1.-.
          Length = 370

 Score = 32.6 bits (75), Expect = 0.54
 Identities = 26/88 (29%), Positives = 34/88 (38%), Gaps = 22/88 (25%)

Query: 162 IFTGGCAEKN--------GQLKAGDEILTINNVPVTEM---SRIEAWSLLKKLNDGTIQD 210
           IFT G  E            LK GDEIL      VTEM   + +  W  L K    T++ 
Sbjct: 65  IFTSGTTEAINLVAISLGRSLKPGDEIL------VTEMEHHANLVPWQELAKRTGATVRV 118

Query: 211 IKALGYDLNSLGYPRTQDLNALGYPKTR 238
           I      ++  G      L  L  P+T+
Sbjct: 119 IP-----VDPNGLLDLDALEKLLTPRTK 141


>gnl|CDD|226483 COG3975, COG3975, Predicted protease with the C-terminal PDZ domain
           [General function prediction only].
          Length = 558

 Score = 32.0 bits (73), Expect = 0.84
 Identities = 23/105 (21%), Positives = 40/105 (38%), Gaps = 8/105 (7%)

Query: 37  EGGCPSLLIPGTSEVSVVVLHRDLPGNSVGITLAGGSDYESKEITVHKVISGTPADRDGR 96
           EG  P  L P      +    +      +G+ +      E     +  V  G PA + G 
Sbjct: 425 EGTEPPPLNPLLERFGLTFTPKPREAYYLGLKV----KSEGGHEKITFVFPGGPAYKAG- 479

Query: 97  LQKGDRILSING--KSMKGLTHKESLAI-LKRGPPVEITLIKDGA 138
           L  GD+I++ING    +      + + + + R   +   L+K G 
Sbjct: 480 LSPGDKIVAINGISDQLDRYKVNDKIQVHVFREGRLREFLVKLGG 524


>gnl|CDD|182262 PRK10139, PRK10139, serine endoprotease; Provisional.
          Length = 455

 Score = 31.5 bits (71), Expect = 1.3
 Identities = 25/106 (23%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 82  VHKVISGTPADRDGRLQKGDRILSINGKSMKGLTHKES-LAILKRGPPVEITLIKDGAGL 140
           V +V+  + + + G ++ GD I S+NGK +       S +A  + G  V++ L+++G  L
Sbjct: 294 VSEVLPNSGSAKAG-VKAGDIITSLNGKPLNSFAELRSRIATTEPGTKVKLGLLRNGKPL 352

Query: 141 GFSLEGGKDSPFGDQPLTVKKIFTGGCAEKNGQLKAGDEILTINNV 186
              +     +        +     G     +GQLK G + + I+ V
Sbjct: 353 EVEVTLDTSTSSSASAEMITPALQGA-TLSDGQLKDGTKGIKIDEV 397


>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein.  This
            model represents a group of paralogous families in
            plasmodium species alternately annotated as reticulocyte
            binding protein, 235-kDa family protein and rhoptry
            protein. Rhoptry protein is localized on the cell surface
            and is extremely large (although apparently lacking in
            repeat structure) and is important for the process of
            invasion of the RBCs by the parasite. These proteins are
            found in P. falciparum, P. vivax and P. yoelii.
          Length = 2757

 Score = 31.6 bits (71), Expect = 1.7
 Identities = 36/114 (31%), Positives = 53/114 (46%), Gaps = 14/114 (12%)

Query: 338  ILNALEYPKSQDLNALGYPKTQDLNVLEYPKTRD-LNALR--YPKTQDLNALGHPKTQD- 393
            IL++L+  K   L  +  P+ +      Y K RD  + L   + K+QDL    H K QD 
Sbjct: 1927 ILSSLDSEKEDTLKFIPSPEKEPEI---YTKIRDSYDTLLDIFKKSQDL----HKKEQDT 1979

Query: 394  ---LNDTRFEYPKIQDLNALGYDLNALGHPKTQDLNDTRFEYPKIQDLNALGYD 444
               + + +  Y KIQ  N L   L+ L + K + LND +    K  +LN L  D
Sbjct: 1980 LNIIFENQQLYEKIQASNELKDTLSDLKYKKEKILNDVKLLLHKFDELNKLSCD 2033


>gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional.
          Length = 353

 Score = 30.7 bits (70), Expect = 2.1
 Identities = 12/30 (40%), Positives = 18/30 (60%), Gaps = 1/30 (3%)

Query: 80  ITVHKVISGTPADRDGRLQKGDRILSINGK 109
           I V++V    PA + G +Q  D I+S+N K
Sbjct: 281 IVVNEVSPDGPAAKAG-IQVNDLIISVNNK 309


>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
          Length = 697

 Score = 29.7 bits (68), Expect = 5.4
 Identities = 13/34 (38%), Positives = 19/34 (55%), Gaps = 3/34 (8%)

Query: 18  LISQQELQQLID---ESNQSLDEGGCPSLLIPGT 48
           L+ +QE+QQL+D   +    L E   P +L  GT
Sbjct: 503 LLGRQEVQQLLDRLAKEYPKLVEELVPKVLSLGT 536


>gnl|CDD|233085 TIGR00675, dcm, DNA-methyltransferase (dcm).  All proteins in this
           family for which functions are known are DNA-cytosine
           methyltransferases. This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 315

 Score = 29.2 bits (66), Expect = 6.0
 Identities = 12/49 (24%), Positives = 20/49 (40%), Gaps = 14/49 (28%)

Query: 296 NALGHPKTQDLNALIQDLNALGYDLNALGHPKTQDLNALGYKILNALEY 344
             + H K +    +I+ L  LGY +               YK+LNA ++
Sbjct: 111 GLVSHDKGRTFKVIIETLEELGYKVY--------------YKVLNAKDF 145


>gnl|CDD|132850 cd07211, Pat_PNPLA8, Patatin-like phospholipase domain containing
           protein 8.  PNPLA8 is a Ca-independent myocardial
           phospholipase which maintains mitochondrial integrity.
           PNPLA8 is also known as iPLA2-gamma. In humans, it is
           predominantly expressed in heart tissue. iPLA2-gamma can
           catalyze both phospholipase A1 and A2 reactions (PLA1
           and PLA2 respectively). This family includes PNPLA8
           (iPLA2-gamma) from Homo sapiens and iPLA2-2 from Mus
           musculus.
          Length = 308

 Score = 28.8 bits (65), Expect = 7.4
 Identities = 11/27 (40%), Positives = 18/27 (66%), Gaps = 3/27 (11%)

Query: 99  KGDRILSINGKSMKGLTHKESLAILKR 125
           +G RILSI+G   +G+    +L IL++
Sbjct: 6   RGIRILSIDGGGTRGVV---ALEILRK 29


>gnl|CDD|130585 TIGR01522, ATPase-IIA2_Ca, golgi membrane calcium-translocating
           P-type ATPase.  This model describes the P-type ATPase
           responsible for translocating calcium ions across the
           golgi membrane of fungi and animals , and is of
           particular importance in the sarcoplasmic reticulum of
           skeletal and cardiac muscle in vertebrates. The calcium
           P-type ATPases have been characterized as Type IIA based
           on a phylogenetic analysis which distinguishes this
           group from the Type IIB PMCA calcium pump modelled by
           TIGR01517. A separate analysis divides Type IIA into
           sub-types, SERCA and PMR1 the former of which is
           modelled by TIGR01116.
          Length = 884

 Score = 29.0 bits (65), Expect = 8.5
 Identities = 20/90 (22%), Positives = 34/90 (37%), Gaps = 25/90 (27%)

Query: 97  LQKGDRI-----------LSINGKSMKGLT---HKESLAILKRGPPVEI----------T 132
           L  GDR+           LSI+  ++ G T    K +  I       ++          T
Sbjct: 143 LSVGDRVPADLRIVEAVDLSIDESNLTGETTPVSKVTAPIPA-ATNGDLAERSNIAFMGT 201

Query: 133 LIKDGAGLGFSLEGGKDSPFGDQPLTVKKI 162
           L++ G G G  +  G ++ FG     ++ I
Sbjct: 202 LVRCGHGKGIVVGTGSNTEFGAVFKMMQAI 231


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.135    0.390 

Gapped
Lambda     K      H
   0.267   0.0695    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 27,901,729
Number of extensions: 2777871
Number of successful extensions: 1835
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1817
Number of HSP's successfully gapped: 61
Length of query: 548
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 446
Effective length of database: 6,413,494
Effective search space: 2860418324
Effective search space used: 2860418324
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 61 (27.3 bits)