RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1022
(245 letters)
>gnl|CDD|239885 cd04438, DEP_dishevelled, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in dishevelled-like proteins.
Dishevelled-like proteins play a key role in the
transduction of the Wnt signal from the cell surface to
the nucleus, which in turn is an important regulatory
pathway for cellular development and growth. They
contain an N-terminal DIX domain, a central PDZ domain,
and a C-terminal DEP domain.
Length = 84
Score = 114 bits (288), Expect = 5e-33
Identities = 43/52 (82%), Positives = 47/52 (90%)
Query: 189 ADVVDWLDKHVEGFTDRREARKYASQMLKFGYIRHTVNKITFSEQCYYIFGD 240
+D+VDWL HVEG TDRREARKYAS +LK GYIRHTVNKITFSEQCYY+FGD
Sbjct: 33 SDLVDWLLSHVEGLTDRREARKYASSLLKLGYIRHTVNKITFSEQCYYVFGD 84
>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
signaling molecules, often in tandem arrangements. May
be responsible for specific protein-protein
interactions, as most PDZ domains bind C-terminal
polypeptides, and binding to internal (non-C-terminal)
polypeptides and even to lipids has been demonstrated.
In this subfamily of PDZ domains an N-terminal
beta-strand forms the peptide-binding groove base, a
circular permutation with respect to PDZ domains found
in proteases.
Length = 82
Score = 81.8 bits (203), Expect = 2e-20
Identities = 30/88 (34%), Positives = 45/88 (51%), Gaps = 6/88 (6%)
Query: 36 IITVTLNMDTVNFLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDIN 95
+ TVTL D LG S+ G K GGI+V + GG G + GD IL+VN ++
Sbjct: 1 VRTVTLRKDPGGGLGFSLRG--GKDSGGGIFVSRVEPGGPAER-GGLRVGDRILEVNGVS 57
Query: 96 FENMSNDEAVRVLREVVQKPGPIKLVVA 123
E ++++EAV +L+ + L V
Sbjct: 58 VEGLTHEEAVELLKNS---GDEVTLTVR 82
>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF). PDZ
domains are found in diverse signaling proteins.
Length = 80
Score = 79.2 bits (196), Expect = 2e-19
Identities = 33/86 (38%), Positives = 47/86 (54%), Gaps = 6/86 (6%)
Query: 38 TVTLNMDTVNFLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFE 97
VTL LG S+VG S+ GD GI+V ++ GGA G ++ GD IL +N + E
Sbjct: 1 EVTLEKSGRGGLGFSLVGGSD--GDPGIFVSEVLPGGAAEAGG-LQEGDRILSINGQDLE 57
Query: 98 NMSNDEAVRVLREVVQKPGPIKLVVA 123
N+S+DEAV L+ G + L +
Sbjct: 58 NLSHDEAVLALKG---SGGEVTLTIL 80
>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2. Also
called DHR (Dlg homologous region) or GLGF (relatively
well conserved tetrapeptide in these domains). Some PDZs
have been shown to bind C-terminal polypeptides; others
appear to bind internal (non-C-terminal) polypeptides.
Different PDZs possess different binding specificities.
Length = 85
Score = 72.8 bits (179), Expect = 6e-17
Identities = 29/88 (32%), Positives = 44/88 (50%), Gaps = 7/88 (7%)
Query: 35 NIITVTLNMDTVNFLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDI 94
V L LG S+VG K GG+ V S++ G A G + GD+IL+VN
Sbjct: 1 EPRLVELEKGG-GGLGFSLVG--GKDEGGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGT 56
Query: 95 NFENMSNDEAVRVLREVVQKPGPIKLVV 122
+ E +++ EAV +L++ G + L V
Sbjct: 57 SVEGLTHLEAVDLLKKA---GGKVTLTV 81
>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region)
or GLGF (after a conserved sequence motif). Many PDZ
domains bind C-terminal polypeptides, though binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. Heterodimerization through
PDZ-PDZ domain interactions adds to the domain's
versatility, and PDZ domain-mediated interactions may be
modulated dynamically through target phosphorylation.
Some PDZ domains play a role in scaffolding
supramolecular complexes. PDZ domains are found in
diverse signaling proteins in bacteria, archebacteria,
and eurkayotes. This CD contains two distinct structural
subgroups with either a N- or C-terminal beta-strand
forming the peptide-binding groove base. The circular
permutation placing the strand on the N-terminus appears
to be found in Eumetazoa only, while the C-terminal
variant is found in all three kingdoms of life, and
seems to co-occur with protease domains. PDZ domains
have been named after PSD95(post synaptic density
protein), DlgA (Drosophila disc large tumor suppressor),
and ZO1, a mammalian tight junction protein.
Length = 70
Score = 54.6 bits (132), Expect = 2e-10
Identities = 23/74 (31%), Positives = 40/74 (54%), Gaps = 7/74 (9%)
Query: 49 LGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVL 108
LG SI G G +GG+ V S+ G A ++ GD+IL VN + +N++ ++ +L
Sbjct: 3 LGFSIRG----GTEGGVVVLSVEPGSP-AERAGLQAGDVILAVNGTDVKNLTLEDVAELL 57
Query: 109 REVVQKPGPIKLVV 122
++ V + + L V
Sbjct: 58 KKEVGE--KVTLTV 69
>gnl|CDD|239836 cd04371, DEP, DEP domain, named after Dishevelled, Egl-10, and
Pleckstrin, where this domain was first discovered. The
function of this domain is still not clear, but it is
believed to be important for the membrane association of
the signaling proteins in which it is present. New
studies show that the DEP domain of Sst2, a yeast RGS
protein is necessary and sufficient for receptor
interaction.
Length = 81
Score = 54.7 bits (132), Expect = 3e-10
Identities = 16/51 (31%), Positives = 25/51 (49%), Gaps = 2/51 (3%)
Query: 189 ADVVDWLDKHVEGFTDRREARKYASQMLKFGYIRHTVN-KITFSEQCYYIF 238
+++VDWL ++E R EA + +LK G I H + K TF +
Sbjct: 32 SELVDWLLDNLEA-ITREEAVELGQALLKHGLIHHVSDDKHTFRDSYALYR 81
>gnl|CDD|216020 pfam00610, DEP, Domain found in Dishevelled, Egl-10, and Pleckstrin
(DEP). The DEP domain is responsible for mediating
intracellular protein targeting and regulation of
protein stability in the cell. The DEP domain is present
in a number of signaling molecules, including Regulator
of G protein Signaling (RGS) proteins, and has been
implicated in membrane targeting. New findings in yeast,
however, demonstrate a major role for a DEP domain in
mediating the interaction of an RGS protein to the
C-terminal tail of a GPCR, thus placing RGS in close
proximity with its substrate G protein alpha subunit.
Length = 74
Score = 53.8 bits (130), Expect = 5e-10
Identities = 18/54 (33%), Positives = 26/54 (48%), Gaps = 4/54 (7%)
Query: 189 ADVVDWLDKHVEGFT-DRREARKYASQMLKFGYIRHTVNKITF---SEQCYYIF 238
++ VDWL + EG DR EA + +L G I H +K S+ +Y F
Sbjct: 21 SEAVDWLMDNFEGLVIDREEAVELGQLLLDHGLIHHVGDKHRGFLDSKYSFYRF 74
>gnl|CDD|214489 smart00049, DEP, Domain found in Dishevelled, Egl-10, and
Pleckstrin. Domain of unknown function present in
signalling proteins that contain PH, rasGEF, rhoGEF,
rhoGAP, RGS, PDZ domains. DEP domain in Drosophila
dishevelled is essential to rescue planar polarity
defects and induce JNK signalling (Cell 94, 109-118).
Length = 77
Score = 50.7 bits (122), Expect = 9e-09
Identities = 19/55 (34%), Positives = 26/55 (47%), Gaps = 4/55 (7%)
Query: 189 ADVVDWLDKHVEGFTDRREARKYASQMLKFGYIRHTV--NKITFS-EQCYYIFGD 240
+++VDWL ++E DR EA +L G I H NK TF + Y F
Sbjct: 24 SELVDWLMDNLE-IIDREEAVHLGQLLLDEGLIHHVNGPNKHTFKDSKALYRFTT 77
>gnl|CDD|239896 cd04449, DEP_DEPDC5-like, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in DEPDC5-like proteins. DEPDC5, in human
also known as KIAA0645, is a DEP domain containing
protein of unknown function.
Length = 83
Score = 44.2 bits (105), Expect = 3e-06
Identities = 11/43 (25%), Positives = 19/43 (44%)
Query: 188 SADVVDWLDKHVEGFTDRREARKYASQMLKFGYIRHTVNKITF 230
++ V WL + E R EA + +++ G I H + F
Sbjct: 32 GSEAVSWLINNFEDVDTREEAVELGQELMNEGLIEHVSGRHPF 74
>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
membrane].
Length = 406
Score = 43.1 bits (102), Expect = 6e-05
Identities = 31/103 (30%), Positives = 50/103 (48%), Gaps = 10/103 (9%)
Query: 50 GISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVLR 109
GI I Q GG+ V S + G A G I+PGD+I++++ + +S DEAV+++R
Sbjct: 101 GIGIELQMED--IGGVKVVSPIDGSPAAKAG-IKPGDVIIKIDGKSVGGVSLDEAVKLIR 157
Query: 110 EVVQKPG-PIKLVVAKCWDPNPKGYFTIPRTEPVRPIDPGAWV 151
KPG + L + + P T+ R E ++ A
Sbjct: 158 G---KPGTKVTLTILRAGGGKPF-TVTLTREE--IELEDVAAK 194
>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
tail-specific-, and tricorn proteases, which function in
posttranslational protein processing, maturation, and
disassembly or degradation, in Bacteria, Archaea, and
plant chloroplasts. May be responsible for substrate
recognition and/or binding, as most PDZ domains bind
C-terminal polypeptides, and binding to internal
(non-C-terminal) polypeptides and even to lipids has
been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 85
Score = 37.6 bits (88), Expect = 5e-04
Identities = 14/48 (29%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 62 DGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVLR 109
DGG+ + S++ G A I+ GD+I+ ++ + +S ++ V++LR
Sbjct: 12 DGGLVITSVLPGS-PAAKAGIKAGDIIVAIDGEPVDGLSLEDVVKLLR 58
>gnl|CDD|239890 cd04443, DEP_GPR155, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in GPR155-like proteins. GRP155-like
proteins, also known as PGR22, contain an N-terminal
permease domain, a central transmembrane region and a
C-terminal DEP domain. They are orphan receptors of the
class B G protein-coupled receptors. Their function is
unknown.
Length = 83
Score = 37.3 bits (87), Expect = 6e-04
Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 6/52 (11%)
Query: 190 DVVDWLDKHVE-GFT-DRREARKYASQMLKFGYIRHTVNKITFSEQ-CYYIF 238
D+V WL +E G DR EA Y ++L+ G ++H N+ F ++ Y F
Sbjct: 35 DLVSWL---IEVGLAQDRGEAVLYGRRLLQGGVLQHITNEHHFRDENLLYRF 83
>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
proteases, such as DegP/HtrA, which are oligomeric
proteins involved in heat-shock response, chaperone
function, and apoptosis. May be responsible for
substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, though binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 90
Score = 35.3 bits (82), Expect = 0.004
Identities = 20/82 (24%), Positives = 37/82 (45%), Gaps = 11/82 (13%)
Query: 48 FLGISIV-------GQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMS 100
+LG+++ + G+ V S+ G A G ++PGD+IL VN +++
Sbjct: 2 WLGVTVQDLTPDLAEELGLKDTKGVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSV- 59
Query: 101 NDEAVRVLREVVQKPGPIKLVV 122
+ R L E ++ + L V
Sbjct: 60 -ADLRRALAE-LKPGDKVTLTV 79
>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant
zinc metalloprotases, presumably membrane-associated or
integral membrane proteases, which may be involved in
signalling and regulatory mechanisms. May be responsible
for substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, and binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 79
Score = 34.9 bits (81), Expect = 0.004
Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 12/76 (15%)
Query: 48 FLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRV 107
LG G + +G ++ G A G ++ GD IL +N ++ + +
Sbjct: 2 ILGFVPGGPPIE-----PVIGEVVPGSPAAKAG-LKAGDRILAINGQKIKSWED-----L 50
Query: 108 LREVVQKPG-PIKLVV 122
+ V + PG P+ L V
Sbjct: 51 VDAVQENPGKPLTLTV 66
>gnl|CDD|239888 cd04441, DEP_2_DEP6, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain 2 found in DEP6-like proteins. DEP6 proteins
contain two DEP and a PDZ domain. Their function is
unknown.
Length = 85
Score = 32.4 bits (74), Expect = 0.033
Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 7/53 (13%)
Query: 192 VDWLDKHVEGFTDRREARKYASQMLKFGYIRHTVNKITFSEQCYYIFGDLLQQ 244
+DWL + E RREA + ++L+ G I+H NK F + +LL Q
Sbjct: 39 IDWLLQEGE-AESRREAVQLCRRLLEHGIIQHVSNKHHFFD------SNLLYQ 84
>gnl|CDD|232883 TIGR00225, prc, C-terminal peptidase (prc). A C-terminal peptidase
with different substrates in different species including
processing of D1 protein of the photosystem II reaction
center in higher plants and cleavage of a peptide of 11
residues from the precursor form of penicillin-binding
protein in E.coli E.coli and H influenza have the most
distal branch of the tree and their proteins have an
N-terminal 200 amino acids that show no homology to
other proteins in the database [Protein fate,
Degradation of proteins, peptides, and glycopeptides,
Protein fate, Protein modification and repair].
Length = 334
Score = 34.3 bits (79), Expect = 0.042
Identities = 25/101 (24%), Positives = 40/101 (39%), Gaps = 11/101 (10%)
Query: 48 FLGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRV 107
GI I + DG I + S +G A I+PGD I+++N + MS D+AV +
Sbjct: 50 LEGIGIQVGMD---DGEIVIVSPFEGSP-AEKAGIKPGDKIIKINGKSVAGMSLDDAVAL 105
Query: 108 LREVVQKPG-PIKLVVAKCWDPNPKGYFTIPRTEPVRPIDP 147
+R K G + L + + T +
Sbjct: 106 IR---GKKGTKVSLEILR---AGKSKPLTFTLKRDRIELQT 140
>gnl|CDD|239897 cd04450, DEP_RGS7-like, DEP (Dishevelled, Egl-10, and Pleckstrin)
domain found in RGS (regulator of G-protein signaling)
proteins of the subfamily R7. This subgroup contains
RGS7, RGS6, RGS9 and RGS11. They share a common domain
architecture, containing, beside the RGS domain, a DEP
domain and a GGL (G-protein gamma subunit-like ) domain.
RGS proteins are GTPase-activating (GAP) proteins of
heterotrimeric G proteins by increasing the rate of GTP
hydrolysis of the alpha subunit. The fungal homologs,
like yeast Sst2, share a related common domain
architecture, containing RGS and DEP domains. Sst2 has
been identified as the principal regulator of mating
pheromone signaling and recently the DEP domain of Sst2
has been shown to be necessary and sufficient to mediate
receptor interaction.
Length = 88
Score = 32.3 bits (74), Expect = 0.049
Identities = 10/32 (31%), Positives = 16/32 (50%), Gaps = 1/32 (3%)
Query: 190 DVVDWLDKHVEGFTDRREARKYASQMLKFGYI 221
+V WL + D EA + A+ +K+G I
Sbjct: 33 AIVQWLMDCTD-VVDPSEALEIAALFVKYGLI 63
>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
Length = 473
Score = 33.6 bits (77), Expect = 0.074
Identities = 28/101 (27%), Positives = 46/101 (45%), Gaps = 24/101 (23%)
Query: 22 SSFSSI--------TDSSMSLNII--------TVTL------NMDTVN-FLGISIVGQSN 58
SSF+++ S ++L ++ V L +D+ N F GI SN
Sbjct: 344 SSFAALRAQVGTMPVGSKLTLGLLRDGKPVNVNVELQQSSQNQVDSSNIFNGIEGAELSN 403
Query: 59 KGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENM 99
KGGD G+ V ++ G A G ++ GD+I+ N +N+
Sbjct: 404 KGGDKGVVVDNVKPGTPAAQIG-LKKGDVIIGANQQPVKNI 443
>gnl|CDD|234386 TIGR03900, prc_long_Delta, putative carboxyl-terminal-processing
protease, deltaproteobacterial. This model describes a
multidomain protein of about 1070 residues, restricted
to the order Myxococcales in the Deltaproteobacteria.
Members contain a PDZ domain (pfam00595), an S41 family
peptidase domain (pfam03572), and an SH3 domain
(pfam06347). A core region of this family, including PDZ
and S41 regions, is described by TIGR00225, C-terminal
processing peptidase, which recognizes the Prc protease.
The species distribution of this family approximates
that of largely Deltaproteobacterial C-terminal putative
protein-sorting domain, TIGR03901, analogous to LPXTG
and PEP-CTERM, but the co-occurrence may reflect shared
restriction to the Myxococcales rather than a
substrate/target relationship.
Length = 973
Score = 32.1 bits (73), Expect = 0.33
Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 6/61 (9%)
Query: 49 LGISIVGQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVL 108
LGI I + D + V ++ G A G ++ D+I++++D + NM+ ++AV L
Sbjct: 142 LGIVIGMR-----DRNLTVVRVIPGTPAARAG-LQRNDVIVKIDDESTVNMTLNDAVGRL 195
Query: 109 R 109
R
Sbjct: 196 R 196
>gnl|CDD|201816 pfam01472, PUA, PUA domain. The PUA domain named after
Pseudouridine synthase and Archaeosine transglycosylase,
was detected in archaeal and eukaryotic pseudouridine
synthases, archaeal archaeosine synthases, a family of
predicted ATPases that may be involved in RNA
modification, a family of predicted archaeal and
bacterial rRNA methylases. Additionally, the PUA domain
was detected in a family of eukaryotic proteins that
also contain a domain homologous to the translation
initiation factor eIF1/SUI1; these proteins may comprise
a novel type of translation factors. Unexpectedly, the
PUA domain was detected also in bacterial and yeast
glutamate kinases; this is compatible with the
demonstrated role of these enzymes in the regulation of
the expression of other genes. It is predicted that the
PUA domain is an RNA binding domain.
Length = 74
Score = 29.0 bits (66), Expect = 0.51
Identities = 12/53 (22%), Positives = 22/53 (41%), Gaps = 6/53 (11%)
Query: 69 SIMKGGAVALDGRIEPGDMILQVNDINFE------NMSNDEAVRVLREVVQKP 115
S++ G V +DG GD ++ V + N S++E ++ K
Sbjct: 18 SLLAPGVVEVDGDFRRGDEVVVVTEKGELVAVGLANYSSEEMAKIKGGKAVKV 70
>gnl|CDD|234035 TIGR02860, spore_IV_B, stage IV sporulation protein B. SpoIVB, the
stage IV sporulation protein B of endospore-forming
bacteria such as Bacillus subtilis, is a serine
proteinase, expressed in the spore (rather than mother
cell) compartment, that participates in a proteolytic
activation cascade for Sigma-K. It appears to be
universal among endospore-forming bacteria and occurs
nowhere else [Cellular processes, Sporulation and
germination].
Length = 402
Score = 31.2 bits (71), Expect = 0.54
Identities = 22/89 (24%), Positives = 38/89 (42%), Gaps = 22/89 (24%)
Query: 37 ITVTLNMDTVNFLGISIVGQSN-KGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDIN 95
I V LN G+ +VG S+ + G I+ + + I+ GD IL++N
Sbjct: 98 IGVKLNTK-----GVLVVGFSDIETEKGKIH--------SPGEEAGIQIGDRILKINGEK 144
Query: 96 FENMSNDEAVRVLREVVQKPG--PIKLVV 122
+NM + L ++ K G + L +
Sbjct: 145 IKNMDD------LANLINKAGGEKLTLTI 167
>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ
family. This family consists of a set proteins various
designated DegP, heat shock protein HtrA, and protease
DO. The ortholog in Pseudomonas aeruginosa is designated
MucD and is found in an operon that controls mucoid
phenotype. This family also includes the DegQ (HhoA)
paralog in E. coli which can rescue a DegP mutant, but
not the smaller DegS paralog, which cannot. Members of
this family are located in the periplasm and have
separable functions as both protease and chaperone.
Members have a trypsin domain and two copies of a PDZ
domain. This protein protects bacteria from thermal and
other stresses and may be important for the survival of
bacterial pathogens.// The chaperone function is
dominant at low temperatures, whereas the proteolytic
activity is turned on at elevated temperatures [Protein
fate, Protein folding and stabilization, Protein fate,
Degradation of proteins, peptides, and glycopeptides].
Length = 428
Score = 30.6 bits (70), Expect = 0.63
Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 4/67 (5%)
Query: 56 QSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVLREVVQKP 115
KG G+ V ++ G A G ++PGD+IL VN + +S+ +R + +K
Sbjct: 355 LRLKGDVKGVVVTKVVSGSPAARAG-LQPGDVILSVNQ---QPVSSVAELRKVLARAKKG 410
Query: 116 GPIKLVV 122
G + L++
Sbjct: 411 GRVALLI 417
Score = 29.9 bits (68), Expect = 1.2
Identities = 9/38 (23%), Positives = 16/38 (42%), Gaps = 1/38 (2%)
Query: 63 GGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMS 100
G V ++ G G ++ GD+I VN + +
Sbjct: 257 RGALVAQVLPGSPAEKAG-LKAGDVITSVNGKPISSFA 293
>gnl|CDD|70841 pfam07390, P30, Mycoplasma P30 protein. This family consists of
several P30 proteins which seem to be specific to
Mycoplasma agalactiae. P30 is a 30-kDa immunodominant
antigen and is known to be a transmembrane protein.
Length = 266
Score = 29.8 bits (66), Expect = 1.3
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 118 IKLVVAKCWDPNPKGYFTIPRTEPVRPID 146
I V AKC + + K T P+ EP +P+D
Sbjct: 19 IPFVAAKCSEDDKKEKVTKPKNEPTKPVD 47
>gnl|CDD|214635 smart00359, PUA, Putative RNA-binding Domain in PseudoUridine
synthase and Archaeosine transglycosylase.
Length = 76
Score = 27.6 bits (62), Expect = 1.4
Identities = 15/52 (28%), Positives = 27/52 (51%), Gaps = 8/52 (15%)
Query: 68 GSIMKGGAVALDGRIEPGDMILQVNDINFE-------NMSNDEAVRVLREVV 112
S++ G V +DG I+ GD ++ + D E NMS++E R+ + +
Sbjct: 17 ASLLAPGVVRVDGDIKEGD-VVVIVDEKGEPLGIGLANMSSEEIARIKGKGL 67
>gnl|CDD|113681 pfam04917, Shufflon_N, Bacterial shufflon protein, N-terminal
constant region. This family represents the
high-similarity N-terminal 'constant region' shared by
shufflon proteins.
Length = 356
Score = 29.6 bits (66), Expect = 1.4
Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 1/32 (3%)
Query: 64 GIYVGSIMKGGAVALDGRIEPGDMILQVNDIN 95
GIY G +KGG V DGR+ G+ LQ+
Sbjct: 300 GIYTGGQVKGGTVRADGRLYTGE-YLQLEKTA 330
>gnl|CDD|217301 pfam02956, TT_ORF1, TT viral orf 1. TT virus (TTV), isolated
initially from a Japanese patient with hepatitis of
unknown aetiology, has since been found to infect both
healthy and diseased individuals and numerous
prevalence studies have raised questions about its role
in unexplained hepatitis. ORF1 is a large 750 residue
protein. The N-terminal half of this protein
corresponds to the capsid protein.
Length = 525
Score = 29.9 bits (68), Expect = 1.5
Identities = 10/18 (55%), Positives = 10/18 (55%)
Query: 3 RRRRPQRRRRHRPPALSR 20
RRRR RRRR R R
Sbjct: 20 RRRRRARRRRRRRRVRRR 37
Score = 27.2 bits (61), Expect = 9.4
Identities = 9/18 (50%), Positives = 10/18 (55%)
Query: 3 RRRRPQRRRRHRPPALSR 20
RRRR + RRR R R
Sbjct: 28 RRRRRRVRRRRRGRRRRR 45
>gnl|CDD|132166 TIGR03122, one_C_dehyd_C, formylmethanofuran dehydrogenase subunit
C. Members of this largely archaeal protein family are
subunit C of the formylmethanofuran dehydrogenase.
Nomenclature in some bacteria may reflect inclusion of
the formyltransferase described by TIGR03119 as part of
the complex, and therefore call this protein
formyltransferase/hydrolase complex Fhc subunit C. Note
that this model does not distinguish tungsten (FwdC)
from molybdenum-containing (FmdC) forms of this enzyme.
Length = 260
Score = 29.2 bits (66), Expect = 1.5
Identities = 11/30 (36%), Positives = 17/30 (56%), Gaps = 3/30 (10%)
Query: 61 GDGGIYVGSIMKGGAVALDGRIEP---GDM 87
G+ GI+ G M GG + +DG + G+M
Sbjct: 170 GNAGIFAGIHMNGGTIIIDGDVGRRPGGEM 199
Score = 26.9 bits (60), Expect = 8.5
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 61 GDGGIYVGSIMKGGAVALDG 80
GD G++VG+ MKGG + ++G
Sbjct: 87 GDVGMHVGAEMKGGKIVVNG 106
>gnl|CDD|226011 COG3480, SdrC, Predicted secreted protein containing a PDZ domain
[Signal transduction mechanisms].
Length = 342
Score = 29.3 bits (66), Expect = 1.7
Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 6/53 (11%)
Query: 64 GIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEAVRVLREVVQKPG 116
G+YV S++ G++E GD I+ V+ F S+DE + + +KPG
Sbjct: 131 GVYVLSVIDNSPFK--GKLEAGDTIIAVDGEPFT--SSDELIDYVSS--KKPG 177
>gnl|CDD|238486 cd00986, PDZ_LON_protease, PDZ domain of ATP-dependent LON serine
proteases. Most PDZ domains bind C-terminal
polypeptides, though binding to internal
(non-C-terminal) polypeptides and even to lipids has
been demonstrated. In this bacterial subfamily of
protease-associated PDZ domains a C-terminal
beta-strand is thought to form the peptide-binding
groove base, a circular permutation with respect to PDZ
domains found in Eumetazoan signaling proteins.
Length = 79
Score = 27.0 bits (60), Expect = 2.8
Identities = 11/34 (32%), Positives = 22/34 (64%), Gaps = 2/34 (5%)
Query: 64 GIYVGSIMKGGAVALDGRIEPGDMILQVNDINFE 97
G+YV S+++G + G+++ GD I+ V+ F+
Sbjct: 9 GVYVTSVVEG--MPAAGKLKAGDHIIAVDGKPFK 40
>gnl|CDD|183172 PRK11517, PRK11517, transcriptional regulatory protein YedW;
Provisional.
Length = 223
Score = 28.3 bits (63), Expect = 3.0
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 3/45 (6%)
Query: 80 GRIEPGDMIL-QVNDINFENMSN--DEAVRVLREVVQKPGPIKLV 121
G I P +I ++ INF++ +N D A+R LR V P P KL+
Sbjct: 164 GEIIPRTVIASEIWGINFDSDTNTVDVAIRRLRAKVDDPFPEKLI 208
>gnl|CDD|225128 COG2218, FwdC, Formylmethanofuran dehydrogenase subunit C [Energy
production and conversion].
Length = 264
Score = 28.5 bits (64), Expect = 3.4
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 61 GDGGIYVGSIMKGGAVALDGRIE 83
GD G +VG MKGG + +DG+
Sbjct: 194 GDAGDFVGGEMKGGTIVVDGKAG 216
>gnl|CDD|240550 cd13145, MATE_like_5, Uncharacterized subfamily of the multidrug
and toxic compound extrusion (MATE) proteins. The
integral membrane proteins from the MATE family are
involved in exporting metabolites across the cell
membrane and are responsible for multidrug resistance
(MDR) in many bacteria and animals. A number of family
members are involved in the synthesis of peptidoglycan
components in bacteria.
Length = 440
Score = 28.3 bits (64), Expect = 4.0
Identities = 12/18 (66%), Positives = 15/18 (83%)
Query: 204 DRREARKYASQMLKFGYI 221
DRR+AR+YA+Q L FG I
Sbjct: 80 DRRKARRYAAQGLSFGII 97
>gnl|CDD|238480 cd00980, FwdC/FmdC, FwdC/FmdC. This domain of unknown function is
found in the subunit C of formylmethanofuran
dehydrogenase, an enzyme that catalyzes the first step
in methane formation from CO2 in methanogenic archaea,
hyperthermophiles and bacteria. There are two
isoenzymes, a tungsten-containing isoenzyme (Fwd) and a
molybdenum-containing isoenzyme (Fmd). The subunits C of
both isoenzymes (FwdC/FmdC) are characterized by a
repeated GXXGXXXG motif.
Length = 203
Score = 27.7 bits (62), Expect = 4.9
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 60 GGDGGIYVGSIMKGGAVALDGRI 82
GD GI+ G M GG + + G
Sbjct: 128 KGDAGIFAGIRMNGGTIIVRGDA 150
>gnl|CDD|238481 cd00981, arch_gltB, Archaeal-type gltB domain. This domain shares
sequence similarity with a region of unknown function
found in the large subunit of glutamate synthase, which
is encoded by gltB and found in most bacteria and
eukaryotes. It is predicted to be homologous to the
C-terminal domain of glutamate synthase based upon
sequence similarity coupled with genome organization
data, showing that this domain is found in a gene
cluster with other domains of Glts, which are annotated.
This domain is found primarily in archaea, but is also
present in a few bacteria, likely as a result of lateral
gene transfer.
Length = 232
Score = 27.3 bits (61), Expect = 7.0
Identities = 16/56 (28%), Positives = 25/56 (44%), Gaps = 3/56 (5%)
Query: 50 GISIV-GQSNKGGDGGIYVGSIMKGGAVALDGRIEPGDMILQVNDINFENMSNDEA 104
G+ IV G G Y+G+ M GG + + G++E + +V FE D
Sbjct: 147 GVIIVLGLGTDEEPVGRYIGTGMHGGVIYIRGKVERSKLGKEV--PKFELTEEDLE 200
>gnl|CDD|215143 PLN02255, PLN02255, H(+) -translocating inorganic pyrophosphatase.
Length = 765
Score = 27.5 bits (61), Expect = 8.0
Identities = 18/49 (36%), Positives = 23/49 (46%), Gaps = 3/49 (6%)
Query: 126 WDPNPKGYFTIPRTEPVRPIDPGAWVAHTAAIRGD--GFPLRPPSVSTL 172
WD N K Y +E R + P H AA+ GD G PL+ S +L
Sbjct: 689 WD-NAKKYIEAGASEHARSLGPKGSDPHKAAVIGDTIGDPLKDTSGPSL 736
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 27.2 bits (60), Expect = 8.7
Identities = 10/31 (32%), Positives = 20/31 (64%), Gaps = 1/31 (3%)
Query: 3 RRRRPQRRRRHRPPALSRTSSFSSITDSSMS 33
RRRR + R +RPP+ S S++ +++++
Sbjct: 315 RRRRRRPARIYRPPSPV-APSISAVNEAALA 344
>gnl|CDD|222616 pfam14239, RRXRR, RRXRR protein. This domain is found in bacteria,
eukaryotes and viruses, and is approximately 180 amino
acids in length. It contains a conserved RRXRR motif. It
is often found in association with pfam01844.
Length = 174
Score = 26.8 bits (60), Expect = 8.8
Identities = 8/16 (50%), Positives = 10/16 (62%)
Query: 1 MSRRRRPQRRRRHRPP 16
RRRR R+ R+R P
Sbjct: 95 RLRRRRRNRKTRYRKP 110
>gnl|CDD|133178 cd05046, PTK_CCK4, Pseudokinase domain of the Protein Tyrosine
Kinase, Colon Carcinoma Kinase 4. Protein Tyrosine
Kinase (PTK) family; Colon Carcinoma Kinase 4 (CCK4);
pseudokinase domain. The PTKc (catalytic domain) family,
to which this subfamily belongs, includes the catalytic
domains of other kinases such as protein
serine/threonine kinases, RIO kinases, and
phosphoinositide 3-kinase (PI3K). PTKs catalyze the
transfer of the gamma-phosphoryl group from ATP to
tyrosine (tyr) residues in protein substrates. CCK4,
also called protein tyrosine kinase 7 (PTK7), is an
orphan receptor tyr kinase (RTK) containing an
extracellular region with seven immunoglobulin domains,
a transmembrane segment, and an intracellular inactive
pseudokinase domain. Studies in mice reveal that CCK4 is
essential for neural development. Mouse embryos
containing a truncated CCK4 die perinatally and display
craniorachischisis, a severe form of neural tube defect.
The mechanism of action of the CCK4 pseudokinase is
still unknown. Other pseudokinases such as HER3 rely on
the activity of partner RTKs.
Length = 275
Score = 27.0 bits (60), Expect = 9.7
Identities = 10/43 (23%), Positives = 20/43 (46%), Gaps = 7/43 (16%)
Query: 96 FENMSNDEAVRVLREVVQK-------PGPIKLVVAKCWDPNPK 131
F +S++E + L+ + P + ++ +CW NPK
Sbjct: 219 FYGLSDEEVLNRLQAGKLELPVPEGCPSRLYKLMTRCWAVNPK 261
>gnl|CDD|187769 cd09638, Cas2_I_II_III, CRISPR/Cas system-associated protein Cas2.
CRISPR (Clustered Regularly Interspaced Short
Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Cas2 is present in majority of CRISPR/Cas systems along
with Cas1; RNAse specific to U-rich regions; Possesses
an RRM/ferredoxin fold.
Length = 90
Score = 25.8 bits (57), Expect = 9.9
Identities = 7/21 (33%), Positives = 11/21 (52%)
Query: 203 TDRREARKYASQMLKFGYIRH 223
T+R+ RK + K+G R
Sbjct: 13 TERKRRRKLRKLLEKYGLFRV 33
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.319 0.135 0.399
Gapped
Lambda K H
0.267 0.0757 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,573,417
Number of extensions: 1174227
Number of successful extensions: 1665
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1638
Number of HSP's successfully gapped: 60
Length of query: 245
Length of database: 10,937,602
Length adjustment: 94
Effective length of query: 151
Effective length of database: 6,768,326
Effective search space: 1022017226
Effective search space used: 1022017226
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 58 (26.1 bits)