RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy1022
         (245 letters)



>gnl|CDD|239885 cd04438, DEP_dishevelled, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in dishevelled-like proteins.
           Dishevelled-like proteins play a key role in the
           transduction of the Wnt signal from the cell surface to
           the nucleus, which in turn is an important regulatory
           pathway for cellular development and growth. They
           contain an N-terminal DIX domain, a central PDZ domain,
           and a C-terminal DEP domain.
          Length = 84

 Score =  114 bits (288), Expect = 5e-33
 Identities = 43/52 (82%), Positives = 47/52 (90%)

Query: 189 ADVVDWLDKHVEGFTDRREARKYASQMLKFGYIRHTVNKITFSEQCYYIFGD 240
           +D+VDWL  HVEG TDRREARKYAS +LK GYIRHTVNKITFSEQCYY+FGD
Sbjct: 33  SDLVDWLLSHVEGLTDRREARKYASSLLKLGYIRHTVNKITFSEQCYYVFGD 84


>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
           signaling molecules, often in tandem arrangements. May
           be responsible for specific protein-protein
           interactions, as most PDZ domains bind C-terminal
           polypeptides, and binding to internal (non-C-terminal)
           polypeptides and even to lipids has been demonstrated.
           In this subfamily of PDZ domains an N-terminal
           beta-strand forms the peptide-binding groove base, a
           circular permutation with respect to PDZ domains found
           in proteases.
          Length = 82

 Score = 81.8 bits (203), Expect = 2e-20
 Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 6/88 (6%)

Query: 36  IITVTLNMDTVNFLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDIN 95
           + TVTL  D    LG S+ G   K   GGI+V  +  GG     G +  GD IL+VN ++
Sbjct: 1   VRTVTLRKDPGGGLGFSLRG--GKDSGGGIFVSRVEPGGPAER-GGLRVGDRILEVNGVS 57

Query: 96  FENMSNDEAVRVLREVVQKPGPIKLVVA 123
            E ++++EAV +L+        + L V 
Sbjct: 58  VEGLTHEEAVELLKNS---GDEVTLTVR 82


>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF).  PDZ
           domains are found in diverse signaling proteins.
          Length = 80

 Score = 79.2 bits (196), Expect = 2e-19
 Identities = 33/86 (38%), Positives = 47/86 (54%), Gaps = 6/86 (6%)

Query: 38  TVTLNMDTVNFLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFE 97
            VTL       LG S+VG S+  GD GI+V  ++ GGA    G ++ GD IL +N  + E
Sbjct: 1   EVTLEKSGRGGLGFSLVGGSD--GDPGIFVSEVLPGGAAEAGG-LQEGDRILSINGQDLE 57

Query: 98  NMSNDEAVRVLREVVQKPGPIKLVVA 123
           N+S+DEAV  L+      G + L + 
Sbjct: 58  NLSHDEAVLALKG---SGGEVTLTIL 80


>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2.  Also
           called DHR (Dlg homologous region) or GLGF (relatively
           well conserved tetrapeptide in these domains). Some PDZs
           have been shown to bind C-terminal polypeptides; others
           appear to bind internal (non-C-terminal) polypeptides.
           Different PDZs possess different binding specificities.
          Length = 85

 Score = 72.8 bits (179), Expect = 6e-17
 Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 7/88 (7%)

Query: 35  NIITVTLNMDTVNFLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDI 94
               V L       LG S+VG   K   GG+ V S++ G   A  G +  GD+IL+VN  
Sbjct: 1   EPRLVELEKGG-GGLGFSLVG--GKDEGGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGT 56

Query: 95  NFENMSNDEAVRVLREVVQKPGPIKLVV 122
           + E +++ EAV +L++     G + L V
Sbjct: 57  SVEGLTHLEAVDLLKKA---GGKVTLTV 81


>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region)
           or GLGF (after a conserved sequence motif). Many PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. Heterodimerization through
           PDZ-PDZ domain interactions adds to the domain's
           versatility, and PDZ domain-mediated interactions may be
           modulated dynamically through target phosphorylation.
           Some PDZ domains play a role in scaffolding
           supramolecular complexes. PDZ domains are found in
           diverse signaling proteins in bacteria, archebacteria,
           and eurkayotes. This CD contains two distinct structural
           subgroups with either a N- or C-terminal beta-strand
           forming the peptide-binding groove base. The circular
           permutation placing the strand on the N-terminus appears
           to be found in Eumetazoa only, while the C-terminal
           variant is found in all three kingdoms of life, and
           seems to co-occur with protease domains. PDZ domains
           have been named after PSD95(post synaptic density
           protein), DlgA (Drosophila disc large tumor suppressor),
           and ZO1, a mammalian tight junction protein.
          Length = 70

 Score = 54.6 bits (132), Expect = 2e-10
 Identities = 23/74 (31%), Positives = 40/74 (54%), Gaps = 7/74 (9%)

Query: 49  LGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVL 108
           LG SI G    G +GG+ V S+  G   A    ++ GD+IL VN  + +N++ ++   +L
Sbjct: 3   LGFSIRG----GTEGGVVVLSVEPGSP-AERAGLQAGDVILAVNGTDVKNLTLEDVAELL 57

Query: 109 REVVQKPGPIKLVV 122
           ++ V +   + L V
Sbjct: 58  KKEVGE--KVTLTV 69


>gnl|CDD|239836 cd04371, DEP, DEP domain, named after Dishevelled, Egl-10, and
           Pleckstrin, where this domain was first discovered. The
           function of this domain is still not clear, but it is
           believed to be important for the membrane association of
           the signaling proteins in which it is present. New
           studies show that the DEP domain of Sst2, a yeast RGS
           protein is necessary and sufficient for receptor
           interaction.
          Length = 81

 Score = 54.7 bits (132), Expect = 3e-10
 Identities = 16/51 (31%), Positives = 25/51 (49%), Gaps = 2/51 (3%)

Query: 189 ADVVDWLDKHVEGFTDRREARKYASQMLKFGYIRHTVN-KITFSEQCYYIF 238
           +++VDWL  ++E    R EA +    +LK G I H  + K TF +      
Sbjct: 32  SELVDWLLDNLEA-ITREEAVELGQALLKHGLIHHVSDDKHTFRDSYALYR 81


>gnl|CDD|216020 pfam00610, DEP, Domain found in Dishevelled, Egl-10, and Pleckstrin
           (DEP).  The DEP domain is responsible for mediating
           intracellular protein targeting and regulation of
           protein stability in the cell. The DEP domain is present
           in a number of signaling molecules, including Regulator
           of G protein Signaling (RGS) proteins, and has been
           implicated in membrane targeting. New findings in yeast,
           however, demonstrate a major role for a DEP domain in
           mediating the interaction of an RGS protein to the
           C-terminal tail of a GPCR, thus placing RGS in close
           proximity with its substrate G protein alpha subunit.
          Length = 74

 Score = 53.8 bits (130), Expect = 5e-10
 Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 4/54 (7%)

Query: 189 ADVVDWLDKHVEGFT-DRREARKYASQMLKFGYIRHTVNKITF---SEQCYYIF 238
           ++ VDWL  + EG   DR EA +    +L  G I H  +K      S+  +Y F
Sbjct: 21  SEAVDWLMDNFEGLVIDREEAVELGQLLLDHGLIHHVGDKHRGFLDSKYSFYRF 74


>gnl|CDD|214489 smart00049, DEP, Domain found in Dishevelled, Egl-10, and
           Pleckstrin.  Domain of unknown function present in
           signalling proteins that contain PH, rasGEF, rhoGEF,
           rhoGAP, RGS, PDZ domains. DEP domain in Drosophila
           dishevelled is essential to rescue planar polarity
           defects and induce JNK signalling (Cell 94, 109-118).
          Length = 77

 Score = 50.7 bits (122), Expect = 9e-09
 Identities = 19/55 (34%), Positives = 26/55 (47%), Gaps = 4/55 (7%)

Query: 189 ADVVDWLDKHVEGFTDRREARKYASQMLKFGYIRHTV--NKITFS-EQCYYIFGD 240
           +++VDWL  ++E   DR EA      +L  G I H    NK TF   +  Y F  
Sbjct: 24  SELVDWLMDNLE-IIDREEAVHLGQLLLDEGLIHHVNGPNKHTFKDSKALYRFTT 77


>gnl|CDD|239896 cd04449, DEP_DEPDC5-like, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in DEPDC5-like proteins. DEPDC5, in human
           also known as KIAA0645, is a DEP domain containing
           protein of unknown function.
          Length = 83

 Score = 44.2 bits (105), Expect = 3e-06
 Identities = 11/43 (25%), Positives = 19/43 (44%)

Query: 188 SADVVDWLDKHVEGFTDRREARKYASQMLKFGYIRHTVNKITF 230
            ++ V WL  + E    R EA +   +++  G I H   +  F
Sbjct: 32  GSEAVSWLINNFEDVDTREEAVELGQELMNEGLIEHVSGRHPF 74


>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
           membrane].
          Length = 406

 Score = 43.1 bits (102), Expect = 6e-05
 Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 10/103 (9%)

Query: 50  GISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVLR 109
           GI I  Q      GG+ V S + G   A  G I+PGD+I++++  +   +S DEAV+++R
Sbjct: 101 GIGIELQMED--IGGVKVVSPIDGSPAAKAG-IKPGDVIIKIDGKSVGGVSLDEAVKLIR 157

Query: 110 EVVQKPG-PIKLVVAKCWDPNPKGYFTIPRTEPVRPIDPGAWV 151
               KPG  + L + +     P    T+ R E    ++  A  
Sbjct: 158 G---KPGTKVTLTILRAGGGKPF-TVTLTREE--IELEDVAAK 194


>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
           tail-specific-, and tricorn proteases, which function in
           posttranslational protein processing, maturation, and
           disassembly or degradation, in Bacteria, Archaea, and
           plant chloroplasts. May be responsible for substrate
           recognition and/or binding, as most PDZ domains bind
           C-terminal polypeptides, and binding to internal
           (non-C-terminal) polypeptides and even to lipids has
           been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 85

 Score = 37.6 bits (88), Expect = 5e-04
 Identities = 14/48 (29%), Positives = 29/48 (60%), Gaps = 1/48 (2%)

Query: 62  DGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVLR 109
           DGG+ + S++ G   A    I+ GD+I+ ++    + +S ++ V++LR
Sbjct: 12  DGGLVITSVLPGS-PAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLR 58


>gnl|CDD|239890 cd04443, DEP_GPR155, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in GPR155-like proteins. GRP155-like
           proteins, also known as PGR22, contain an N-terminal
           permease domain, a central transmembrane region and a
           C-terminal DEP domain. They are orphan receptors of the
           class B G protein-coupled receptors. Their function is
           unknown.
          Length = 83

 Score = 37.3 bits (87), Expect = 6e-04
 Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 6/52 (11%)

Query: 190 DVVDWLDKHVE-GFT-DRREARKYASQMLKFGYIRHTVNKITFSEQ-CYYIF 238
           D+V WL   +E G   DR EA  Y  ++L+ G ++H  N+  F ++   Y F
Sbjct: 35  DLVSWL---IEVGLAQDRGEAVLYGRRLLQGGVLQHITNEHHFRDENLLYRF 83


>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
           proteases, such as DegP/HtrA, which are oligomeric
           proteins involved in heat-shock response, chaperone
           function, and apoptosis. May be responsible for
           substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, though binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 90

 Score = 35.3 bits (82), Expect = 0.004
 Identities = 20/82 (24%), Positives = 37/82 (45%), Gaps = 11/82 (13%)

Query: 48  FLGISIV-------GQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMS 100
           +LG+++         +       G+ V S+  G   A  G ++PGD+IL VN    +++ 
Sbjct: 2   WLGVTVQDLTPDLAEELGLKDTKGVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSV- 59

Query: 101 NDEAVRVLREVVQKPGPIKLVV 122
             +  R L E ++    + L V
Sbjct: 60  -ADLRRALAE-LKPGDKVTLTV 79


>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant
           zinc metalloprotases, presumably membrane-associated or
           integral membrane proteases, which may be involved in
           signalling and regulatory mechanisms. May be responsible
           for substrate recognition and/or binding, as most PDZ
           domains bind C-terminal polypeptides, and binding to
           internal (non-C-terminal) polypeptides and even to
           lipids has been demonstrated. In this subfamily of
           protease-associated PDZ domains a C-terminal beta-strand
           forms the peptide-binding groove base, a circular
           permutation with respect to PDZ domains found in
           Eumetazoan signaling proteins.
          Length = 79

 Score = 34.9 bits (81), Expect = 0.004
 Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 12/76 (15%)

Query: 48  FLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRV 107
            LG    G   +       +G ++ G   A  G ++ GD IL +N    ++  +     +
Sbjct: 2   ILGFVPGGPPIE-----PVIGEVVPGSPAAKAG-LKAGDRILAINGQKIKSWED-----L 50

Query: 108 LREVVQKPG-PIKLVV 122
           +  V + PG P+ L V
Sbjct: 51  VDAVQENPGKPLTLTV 66


>gnl|CDD|239888 cd04441, DEP_2_DEP6, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain 2 found in DEP6-like proteins. DEP6 proteins
           contain two DEP and a PDZ domain. Their function is
           unknown.
          Length = 85

 Score = 32.4 bits (74), Expect = 0.033
 Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 7/53 (13%)

Query: 192 VDWLDKHVEGFTDRREARKYASQMLKFGYIRHTVNKITFSEQCYYIFGDLLQQ 244
           +DWL +  E    RREA +   ++L+ G I+H  NK  F +       +LL Q
Sbjct: 39  IDWLLQEGE-AESRREAVQLCRRLLEHGIIQHVSNKHHFFD------SNLLYQ 84


>gnl|CDD|232883 TIGR00225, prc, C-terminal peptidase (prc).  A C-terminal peptidase
           with different substrates in different species including
           processing of D1 protein of the photosystem II reaction
           center in higher plants and cleavage of a peptide of 11
           residues from the precursor form of penicillin-binding
           protein in E.coli E.coli and H influenza have the most
           distal branch of the tree and their proteins have an
           N-terminal 200 amino acids that show no homology to
           other proteins in the database [Protein fate,
           Degradation of proteins, peptides, and glycopeptides,
           Protein fate, Protein modification and repair].
          Length = 334

 Score = 34.3 bits (79), Expect = 0.042
 Identities = 25/101 (24%), Positives = 40/101 (39%), Gaps = 11/101 (10%)

Query: 48  FLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRV 107
             GI I    +   DG I + S  +G   A    I+PGD I+++N  +   MS D+AV +
Sbjct: 50  LEGIGIQVGMD---DGEIVIVSPFEGSP-AEKAGIKPGDKIIKINGKSVAGMSLDDAVAL 105

Query: 108 LREVVQKPG-PIKLVVAKCWDPNPKGYFTIPRTEPVRPIDP 147
           +R    K G  + L + +          T         +  
Sbjct: 106 IR---GKKGTKVSLEILR---AGKSKPLTFTLKRDRIELQT 140


>gnl|CDD|239897 cd04450, DEP_RGS7-like, DEP (Dishevelled, Egl-10, and Pleckstrin)
           domain found in RGS (regulator of G-protein signaling)
           proteins of the subfamily R7. This subgroup contains
           RGS7, RGS6, RGS9 and RGS11. They share a common domain
           architecture, containing, beside the RGS domain, a DEP
           domain and a GGL (G-protein gamma subunit-like ) domain.
           RGS proteins are GTPase-activating (GAP) proteins of
           heterotrimeric G proteins by increasing the rate of GTP
           hydrolysis of the alpha subunit. The fungal homologs,
           like yeast Sst2, share a related common domain
           architecture, containing RGS and DEP domains. Sst2 has
           been identified as the principal regulator of mating
           pheromone signaling and recently the DEP domain of Sst2
           has been shown to be necessary and sufficient to mediate
           receptor interaction.
          Length = 88

 Score = 32.3 bits (74), Expect = 0.049
 Identities = 10/32 (31%), Positives = 16/32 (50%), Gaps = 1/32 (3%)

Query: 190 DVVDWLDKHVEGFTDRREARKYASQMLKFGYI 221
            +V WL    +   D  EA + A+  +K+G I
Sbjct: 33  AIVQWLMDCTD-VVDPSEALEIAALFVKYGLI 63


>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
          Length = 473

 Score = 33.6 bits (77), Expect = 0.074
 Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 24/101 (23%)

Query: 22  SSFSSI--------TDSSMSLNII--------TVTL------NMDTVN-FLGISIVGQSN 58
           SSF+++          S ++L ++         V L       +D+ N F GI     SN
Sbjct: 344 SSFAALRAQVGTMPVGSKLTLGLLRDGKPVNVNVELQQSSQNQVDSSNIFNGIEGAELSN 403

Query: 59  KGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENM 99
           KGGD G+ V ++  G   A  G ++ GD+I+  N    +N+
Sbjct: 404 KGGDKGVVVDNVKPGTPAAQIG-LKKGDVIIGANQQPVKNI 443


>gnl|CDD|234386 TIGR03900, prc_long_Delta, putative carboxyl-terminal-processing
           protease, deltaproteobacterial.  This model describes a
           multidomain protein of about 1070 residues, restricted
           to the order Myxococcales in the Deltaproteobacteria.
           Members contain a PDZ domain (pfam00595), an S41 family
           peptidase domain (pfam03572), and an SH3 domain
           (pfam06347). A core region of this family, including PDZ
           and S41 regions, is described by TIGR00225, C-terminal
           processing peptidase, which recognizes the Prc protease.
           The species distribution of this family approximates
           that of largely Deltaproteobacterial C-terminal putative
           protein-sorting domain, TIGR03901, analogous to LPXTG
           and PEP-CTERM, but the co-occurrence may reflect shared
           restriction to the Myxococcales rather than a
           substrate/target relationship.
          Length = 973

 Score = 32.1 bits (73), Expect = 0.33
 Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 6/61 (9%)

Query: 49  LGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVL 108
           LGI I  +     D  + V  ++ G   A  G ++  D+I++++D +  NM+ ++AV  L
Sbjct: 142 LGIVIGMR-----DRNLTVVRVIPGTPAARAG-LQRNDVIVKIDDESTVNMTLNDAVGRL 195

Query: 109 R 109
           R
Sbjct: 196 R 196


>gnl|CDD|201816 pfam01472, PUA, PUA domain.  The PUA domain named after
           Pseudouridine synthase and Archaeosine transglycosylase,
           was detected in archaeal and eukaryotic pseudouridine
           synthases, archaeal archaeosine synthases, a family of
           predicted ATPases that may be involved in RNA
           modification, a family of predicted archaeal and
           bacterial rRNA methylases. Additionally, the PUA domain
           was detected in a family of eukaryotic proteins that
           also contain a domain homologous to the translation
           initiation factor eIF1/SUI1; these proteins may comprise
           a novel type of translation factors. Unexpectedly, the
           PUA domain was detected also in bacterial and yeast
           glutamate kinases; this is compatible with the
           demonstrated role of these enzymes in the regulation of
           the expression of other genes. It is predicted that the
           PUA domain is an RNA binding domain.
          Length = 74

 Score = 29.0 bits (66), Expect = 0.51
 Identities = 12/53 (22%), Positives = 22/53 (41%), Gaps = 6/53 (11%)

Query: 69  SIMKGGAVALDGRIEPGDMILQVNDINFE------NMSNDEAVRVLREVVQKP 115
           S++  G V +DG    GD ++ V +          N S++E  ++      K 
Sbjct: 18  SLLAPGVVEVDGDFRRGDEVVVVTEKGELVAVGLANYSSEEMAKIKGGKAVKV 70


>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B.  SpoIVB, the
           stage IV sporulation protein B of endospore-forming
           bacteria such as Bacillus subtilis, is a serine
           proteinase, expressed in the spore (rather than mother
           cell) compartment, that participates in a proteolytic
           activation cascade for Sigma-K. It appears to be
           universal among endospore-forming bacteria and occurs
           nowhere else [Cellular processes, Sporulation and
           germination].
          Length = 402

 Score = 31.2 bits (71), Expect = 0.54
 Identities = 22/89 (24%), Positives = 38/89 (42%), Gaps = 22/89 (24%)

Query: 37  ITVTLNMDTVNFLGISIVGQSN-KGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDIN 95
           I V LN       G+ +VG S+ +   G I+        +   +  I+ GD IL++N   
Sbjct: 98  IGVKLNTK-----GVLVVGFSDIETEKGKIH--------SPGEEAGIQIGDRILKINGEK 144

Query: 96  FENMSNDEAVRVLREVVQKPG--PIKLVV 122
            +NM +      L  ++ K G   + L +
Sbjct: 145 IKNMDD------LANLINKAGGEKLTLTI 167


>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ
           family.  This family consists of a set proteins various
           designated DegP, heat shock protein HtrA, and protease
           DO. The ortholog in Pseudomonas aeruginosa is designated
           MucD and is found in an operon that controls mucoid
           phenotype. This family also includes the DegQ (HhoA)
           paralog in E. coli which can rescue a DegP mutant, but
           not the smaller DegS paralog, which cannot. Members of
           this family are located in the periplasm and have
           separable functions as both protease and chaperone.
           Members have a trypsin domain and two copies of a PDZ
           domain. This protein protects bacteria from thermal and
           other stresses and may be important for the survival of
           bacterial pathogens.// The chaperone function is
           dominant at low temperatures, whereas the proteolytic
           activity is turned on at elevated temperatures [Protein
           fate, Protein folding and stabilization, Protein fate,
           Degradation of proteins, peptides, and glycopeptides].
          Length = 428

 Score = 30.6 bits (70), Expect = 0.63
 Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 4/67 (5%)

Query: 56  QSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVLREVVQKP 115
              KG   G+ V  ++ G   A  G ++PGD+IL VN    + +S+   +R +    +K 
Sbjct: 355 LRLKGDVKGVVVTKVVSGSPAARAG-LQPGDVILSVNQ---QPVSSVAELRKVLARAKKG 410

Query: 116 GPIKLVV 122
           G + L++
Sbjct: 411 GRVALLI 417



 Score = 29.9 bits (68), Expect = 1.2
 Identities = 9/38 (23%), Positives = 16/38 (42%), Gaps = 1/38 (2%)

Query: 63  GGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMS 100
            G  V  ++ G      G ++ GD+I  VN     + +
Sbjct: 257 RGALVAQVLPGSPAEKAG-LKAGDVITSVNGKPISSFA 293


>gnl|CDD|70841 pfam07390, P30, Mycoplasma P30 protein.  This family consists of
           several P30 proteins which seem to be specific to
           Mycoplasma agalactiae. P30 is a 30-kDa immunodominant
           antigen and is known to be a transmembrane protein.
          Length = 266

 Score = 29.8 bits (66), Expect = 1.3
 Identities = 12/29 (41%), Positives = 17/29 (58%)

Query: 118 IKLVVAKCWDPNPKGYFTIPRTEPVRPID 146
           I  V AKC + + K   T P+ EP +P+D
Sbjct: 19  IPFVAAKCSEDDKKEKVTKPKNEPTKPVD 47


>gnl|CDD|214635 smart00359, PUA, Putative RNA-binding Domain in PseudoUridine
           synthase and Archaeosine transglycosylase. 
          Length = 76

 Score = 27.6 bits (62), Expect = 1.4
 Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 8/52 (15%)

Query: 68  GSIMKGGAVALDGRIEPGDMILQVNDINFE-------NMSNDEAVRVLREVV 112
            S++  G V +DG I+ GD ++ + D   E       NMS++E  R+  + +
Sbjct: 17  ASLLAPGVVRVDGDIKEGD-VVVIVDEKGEPLGIGLANMSSEEIARIKGKGL 67


>gnl|CDD|113681 pfam04917, Shufflon_N, Bacterial shufflon protein, N-terminal
           constant region.  This family represents the
           high-similarity N-terminal 'constant region' shared by
           shufflon proteins.
          Length = 356

 Score = 29.6 bits (66), Expect = 1.4
 Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)

Query: 64  GIYVGSIMKGGAVALDGRIEPGDMILQVNDIN 95
           GIY G  +KGG V  DGR+  G+  LQ+    
Sbjct: 300 GIYTGGQVKGGTVRADGRLYTGE-YLQLEKTA 330


>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1.  TT virus (TTV), isolated
          initially from a Japanese patient with hepatitis of
          unknown aetiology, has since been found to infect both
          healthy and diseased individuals and numerous
          prevalence studies have raised questions about its role
          in unexplained hepatitis. ORF1 is a large 750 residue
          protein. The N-terminal half of this protein
          corresponds to the capsid protein.
          Length = 525

 Score = 29.9 bits (68), Expect = 1.5
 Identities = 10/18 (55%), Positives = 10/18 (55%)

Query: 3  RRRRPQRRRRHRPPALSR 20
          RRRR  RRRR R     R
Sbjct: 20 RRRRRARRRRRRRRVRRR 37



 Score = 27.2 bits (61), Expect = 9.4
 Identities = 9/18 (50%), Positives = 10/18 (55%)

Query: 3  RRRRPQRRRRHRPPALSR 20
          RRRR + RRR R     R
Sbjct: 28 RRRRRRVRRRRRGRRRRR 45


>gnl|CDD|132166 TIGR03122, one_C_dehyd_C, formylmethanofuran dehydrogenase subunit
           C.  Members of this largely archaeal protein family are
           subunit C of the formylmethanofuran dehydrogenase.
           Nomenclature in some bacteria may reflect inclusion of
           the formyltransferase described by TIGR03119 as part of
           the complex, and therefore call this protein
           formyltransferase/hydrolase complex Fhc subunit C. Note
           that this model does not distinguish tungsten (FwdC)
           from molybdenum-containing (FmdC) forms of this enzyme.
          Length = 260

 Score = 29.2 bits (66), Expect = 1.5
 Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 3/30 (10%)

Query: 61  GDGGIYVGSIMKGGAVALDGRIEP---GDM 87
           G+ GI+ G  M GG + +DG +     G+M
Sbjct: 170 GNAGIFAGIHMNGGTIIIDGDVGRRPGGEM 199



 Score = 26.9 bits (60), Expect = 8.5
 Identities = 10/20 (50%), Positives = 16/20 (80%)

Query: 61  GDGGIYVGSIMKGGAVALDG 80
           GD G++VG+ MKGG + ++G
Sbjct: 87  GDVGMHVGAEMKGGKIVVNG 106


>gnl|CDD|226011 COG3480, SdrC, Predicted secreted protein containing a PDZ domain
           [Signal transduction mechanisms].
          Length = 342

 Score = 29.3 bits (66), Expect = 1.7
 Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 6/53 (11%)

Query: 64  GIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVLREVVQKPG 116
           G+YV S++        G++E GD I+ V+   F   S+DE +  +    +KPG
Sbjct: 131 GVYVLSVIDNSPFK--GKLEAGDTIIAVDGEPFT--SSDELIDYVSS--KKPG 177


>gnl|CDD|238486 cd00986, PDZ_LON_protease, PDZ domain of ATP-dependent LON serine
          proteases. Most PDZ domains bind C-terminal
          polypeptides, though binding to internal
          (non-C-terminal) polypeptides and even to lipids has
          been demonstrated. In this bacterial subfamily of
          protease-associated PDZ domains a C-terminal
          beta-strand  is thought to form the peptide-binding
          groove base, a circular permutation with respect to PDZ
          domains found in Eumetazoan signaling proteins.
          Length = 79

 Score = 27.0 bits (60), Expect = 2.8
 Identities = 11/34 (32%), Positives = 22/34 (64%), Gaps = 2/34 (5%)

Query: 64 GIYVGSIMKGGAVALDGRIEPGDMILQVNDINFE 97
          G+YV S+++G  +   G+++ GD I+ V+   F+
Sbjct: 9  GVYVTSVVEG--MPAAGKLKAGDHIIAVDGKPFK 40


>gnl|CDD|183172 PRK11517, PRK11517, transcriptional regulatory protein YedW;
           Provisional.
          Length = 223

 Score = 28.3 bits (63), Expect = 3.0
 Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 3/45 (6%)

Query: 80  GRIEPGDMIL-QVNDINFENMSN--DEAVRVLREVVQKPGPIKLV 121
           G I P  +I  ++  INF++ +N  D A+R LR  V  P P KL+
Sbjct: 164 GEIIPRTVIASEIWGINFDSDTNTVDVAIRRLRAKVDDPFPEKLI 208


>gnl|CDD|225128 COG2218, FwdC, Formylmethanofuran dehydrogenase subunit C [Energy
           production and conversion].
          Length = 264

 Score = 28.5 bits (64), Expect = 3.4
 Identities = 11/23 (47%), Positives = 15/23 (65%)

Query: 61  GDGGIYVGSIMKGGAVALDGRIE 83
           GD G +VG  MKGG + +DG+  
Sbjct: 194 GDAGDFVGGEMKGGTIVVDGKAG 216


>gnl|CDD|240550 cd13145, MATE_like_5, Uncharacterized subfamily of the multidrug
           and toxic compound extrusion (MATE) proteins.  The
           integral membrane proteins from the MATE family are
           involved in exporting metabolites across the cell
           membrane and are responsible for multidrug resistance
           (MDR) in many bacteria and animals. A number of family
           members are involved in the synthesis of peptidoglycan
           components in bacteria.
          Length = 440

 Score = 28.3 bits (64), Expect = 4.0
 Identities = 12/18 (66%), Positives = 15/18 (83%)

Query: 204 DRREARKYASQMLKFGYI 221
           DRR+AR+YA+Q L FG I
Sbjct: 80  DRRKARRYAAQGLSFGII 97


>gnl|CDD|238480 cd00980, FwdC/FmdC, FwdC/FmdC. This domain of unknown function is
           found in the subunit C of formylmethanofuran
           dehydrogenase, an enzyme that catalyzes the first step
           in methane formation from CO2 in methanogenic archaea,
           hyperthermophiles and bacteria. There are two
           isoenzymes, a tungsten-containing isoenzyme (Fwd) and a
           molybdenum-containing isoenzyme (Fmd). The subunits C of
           both isoenzymes (FwdC/FmdC) are characterized by a
           repeated GXXGXXXG motif.
          Length = 203

 Score = 27.7 bits (62), Expect = 4.9
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 60  GGDGGIYVGSIMKGGAVALDGRI 82
            GD GI+ G  M GG + + G  
Sbjct: 128 KGDAGIFAGIRMNGGTIIVRGDA 150


>gnl|CDD|238481 cd00981, arch_gltB, Archaeal-type gltB domain. This domain shares
           sequence similarity with a region of unknown function
           found in the large subunit of glutamate synthase, which
           is encoded by gltB and found in most bacteria and
           eukaryotes.  It is predicted to be homologous to the
           C-terminal domain of glutamate synthase based upon
           sequence similarity coupled with genome organization
           data, showing that this domain is found in a gene
           cluster with other domains of Glts, which are annotated.
           This domain is found primarily in archaea, but is also
           present in a few bacteria, likely as a result of lateral
           gene transfer.
          Length = 232

 Score = 27.3 bits (61), Expect = 7.0
 Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 3/56 (5%)

Query: 50  GISIV-GQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEA 104
           G+ IV G        G Y+G+ M GG + + G++E   +  +V    FE    D  
Sbjct: 147 GVIIVLGLGTDEEPVGRYIGTGMHGGVIYIRGKVERSKLGKEV--PKFELTEEDLE 200


>gnl|CDD|215143 PLN02255, PLN02255, H(+) -translocating inorganic pyrophosphatase.
          Length = 765

 Score = 27.5 bits (61), Expect = 8.0
 Identities = 18/49 (36%), Positives = 23/49 (46%), Gaps = 3/49 (6%)

Query: 126 WDPNPKGYFTIPRTEPVRPIDPGAWVAHTAAIRGD--GFPLRPPSVSTL 172
           WD N K Y     +E  R + P     H AA+ GD  G PL+  S  +L
Sbjct: 689 WD-NAKKYIEAGASEHARSLGPKGSDPHKAAVIGDTIGDPLKDTSGPSL 736


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 27.2 bits (60), Expect = 8.7
 Identities = 10/31 (32%), Positives = 20/31 (64%), Gaps = 1/31 (3%)

Query: 3   RRRRPQRRRRHRPPALSRTSSFSSITDSSMS 33
           RRRR +  R +RPP+     S S++ +++++
Sbjct: 315 RRRRRRPARIYRPPSPV-APSISAVNEAALA 344


>gnl|CDD|222616 pfam14239, RRXRR, RRXRR protein.  This domain is found in bacteria,
           eukaryotes and viruses, and is approximately 180 amino
           acids in length. It contains a conserved RRXRR motif. It
           is often found in association with pfam01844.
          Length = 174

 Score = 26.8 bits (60), Expect = 8.8
 Identities = 8/16 (50%), Positives = 10/16 (62%)

Query: 1   MSRRRRPQRRRRHRPP 16
             RRRR  R+ R+R P
Sbjct: 95  RLRRRRRNRKTRYRKP 110


>gnl|CDD|133178 cd05046, PTK_CCK4, Pseudokinase domain of the Protein Tyrosine
           Kinase, Colon Carcinoma Kinase 4.  Protein Tyrosine
           Kinase (PTK) family; Colon Carcinoma Kinase 4 (CCK4);
           pseudokinase domain. The PTKc (catalytic domain) family,
           to which this subfamily belongs, includes the catalytic
           domains of other kinases such as protein
           serine/threonine kinases, RIO kinases, and
           phosphoinositide 3-kinase (PI3K). PTKs catalyze the
           transfer of the gamma-phosphoryl group from ATP to
           tyrosine (tyr) residues in protein substrates. CCK4,
           also called protein tyrosine kinase 7 (PTK7), is an
           orphan receptor tyr kinase (RTK) containing an
           extracellular region with seven immunoglobulin domains,
           a transmembrane segment, and an intracellular inactive
           pseudokinase domain. Studies in mice reveal that CCK4 is
           essential for neural development. Mouse embryos
           containing a truncated CCK4 die perinatally and display
           craniorachischisis, a severe form of neural tube defect.
           The mechanism of action of the CCK4 pseudokinase is
           still unknown. Other pseudokinases such as HER3 rely on
           the activity of partner RTKs.
          Length = 275

 Score = 27.0 bits (60), Expect = 9.7
 Identities = 10/43 (23%), Positives = 20/43 (46%), Gaps = 7/43 (16%)

Query: 96  FENMSNDEAVRVLREVVQK-------PGPIKLVVAKCWDPNPK 131
           F  +S++E +  L+    +       P  +  ++ +CW  NPK
Sbjct: 219 FYGLSDEEVLNRLQAGKLELPVPEGCPSRLYKLMTRCWAVNPK 261


>gnl|CDD|187769 cd09638, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2. 
           CRISPR (Clustered Regularly Interspaced Short
           Palindromic Repeats) and associated Cas proteins
           comprise a system for heritable host defense by
           prokaryotic cells against phage and other foreign DNA;
           Cas2 is present in majority of CRISPR/Cas systems along
           with Cas1; RNAse specific to U-rich regions; Possesses
           an RRM/ferredoxin fold.
          Length = 90

 Score = 25.8 bits (57), Expect = 9.9
 Identities = 7/21 (33%), Positives = 11/21 (52%)

Query: 203 TDRREARKYASQMLKFGYIRH 223
           T+R+  RK    + K+G  R 
Sbjct: 13  TERKRRRKLRKLLEKYGLFRV 33


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.319    0.135    0.399 

Gapped
Lambda     K      H
   0.267   0.0757    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,573,417
Number of extensions: 1174227
Number of successful extensions: 1665
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1638
Number of HSP's successfully gapped: 60
Length of query: 245
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 151
Effective length of database: 6,768,326
Effective search space: 1022017226
Effective search space used: 1022017226
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (26.1 bits)