RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6546
(294 letters)
>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
only].
Length = 467
Score = 34.7 bits (79), Expect = 0.050
Identities = 38/199 (19%), Positives = 63/199 (31%), Gaps = 36/199 (18%)
Query: 94 LTRHPCPQCSRDFATVKILKTHLKT-----HEGIIYPCPE--CDKTFLSTSGLREHRKVH 146
QC+ F+ L HL++ + CP C K F L+ H +H
Sbjct: 287 SLPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLH 346
Query: 147 AGVRYICH---ACPREFSSEATMKAHVASHNGVQHACDQCDRTFSCMPYLRQHQKLHSGI 203
+ +FS + L+Q++ L +
Sbjct: 347 TSISPAKEKLLNSSSKFSPLLNNEPPQ---------------------SLQQYKDLKNDK 385
Query: 204 TYSC--PSCSKSFNRKDRYRMHLKLH---QGVRHTCKECSKEFSNEYNLKNHMVLHDVRA 258
SC ++F R +H+ H + CSK F+ YNL H +H A
Sbjct: 386 KSETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHA 445
Query: 259 YVCELCARQFTRVDHFNHH 277
+ + F R ++H
Sbjct: 446 PLLCSILKSFRRDLDLSNH 464
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
Members of this family are the S-layer-forming
halobacterial major cell surface glycoprotein. The
highest scores below model cutoffs are fragmentary
paralogs to actual members of the family. Modifications
include at N-linked and O-linked glycosylation, a
C-terminal diphytanylglyceryl modification, and probable
cleavage of the PGF-CTERM tail.
Length = 782
Score = 34.9 bits (80), Expect = 0.052
Identities = 13/30 (43%), Positives = 14/30 (46%), Gaps = 1/30 (3%)
Query: 7 PTTNNPSTTNTTPTPTTQTSSTTPTRHPCP 36
TT+ TT TTPT T TT T P
Sbjct: 716 TTTSEDPTTTTTPTTTG-PEETTETAEPTT 744
Score = 34.5 bits (79), Expect = 0.067
Identities = 9/26 (34%), Positives = 12/26 (46%)
Query: 6 SPTTNNPSTTNTTPTPTTQTSSTTPT 31
T + T T P TT+T+ T T
Sbjct: 720 EDPTTTTTPTTTGPEETTETAEPTTT 745
Score = 31.8 bits (72), Expect = 0.51
Identities = 11/26 (42%), Positives = 12/26 (46%)
Query: 6 SPTTNNPSTTNTTPTPTTQTSSTTPT 31
+PTT P T T PTT T T
Sbjct: 727 TPTTTGPEETTETAEPTTTTEEPTEE 752
Score = 31.4 bits (71), Expect = 0.66
Identities = 7/26 (26%), Positives = 11/26 (42%)
Query: 6 SPTTNNPSTTNTTPTPTTQTSSTTPT 31
+ T +T PT TT+ + T
Sbjct: 729 TTTGPEETTETAEPTTTTEEPTEETT 754
>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
Length = 128
Score = 31.4 bits (71), Expect = 0.18
Identities = 23/80 (28%), Positives = 36/80 (45%), Gaps = 8/80 (10%)
Query: 67 KTFLSTLTGTPQGARGRAVYLSRMSTTLTRHP--CPQCSRDFATVKILKTHLK--THEGI 122
+ + TL PQ + YL ++ T+ P CP C F++ LK H++ H +
Sbjct: 43 RAVVKTLIYNPQ-LLDESSYLYKLLTSKAVSPYVCPLCLMPFSSSVSLKQHIRYTEHSKV 101
Query: 123 IYPCPECDKTFLSTSGLREH 142
CP C K F +T +H
Sbjct: 102 ---CPVCGKEFRNTDSTLDH 118
Score = 26.8 bits (59), Expect = 7.2
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 5/40 (12%)
Query: 35 CPQCSRDFATVKILKTHLK--THEGIIYPCPECDKTFLST 72
CP C F++ LK H++ H + CP C K F +T
Sbjct: 76 CPLCLMPFSSSVSLKQHIRYTEHSKV---CPVCGKEFRNT 112
>gnl|CDD|223609 COG0535, COG0535, Predicted Fe-S oxidoreductases [General function
prediction only].
Length = 347
Score = 32.7 bits (74), Expect = 0.21
Identities = 19/99 (19%), Positives = 32/99 (32%), Gaps = 18/99 (18%)
Query: 118 THEGIIYPCPECDKTFLSTSGLREHRKVHAGVRYICHACPREFSSEATMKAHVASHNGVQ 177
+G +YPCP FL G+RE + + RE ++
Sbjct: 266 DPDGEVYPCP-----FLPELGVREESFKEIWEESLLNKL-RERD------------ELLE 307
Query: 178 HACDQCDRTFSCMPYLRQHQKLHSGITYSCPSCSKSFNR 216
+C +C+ C + +L I + P C
Sbjct: 308 GSCGKCEYREYCGGCRARAYELTGDIYGADPFCPLVKKS 346
Score = 27.3 bits (60), Expect = 10.0
Identities = 18/87 (20%), Positives = 24/87 (27%), Gaps = 11/87 (12%)
Query: 54 THEGIIYPCPECDKTFLSTLTGTPQGARGRAVYLS-----RMSTTLTRHPCPQCSRDFAT 108
+G +YPCP FL L G + + S R L C +C
Sbjct: 266 DPDGEVYPCP-----FLPEL-GVREESFKEIWEESLLNKLRERDELLEGSCGKCEYREYC 319
Query: 109 VKILKTHLKTHEGIIYPCPECDKTFLS 135
+ I P C S
Sbjct: 320 GGCRARAYELTGDIYGADPFCPLVKKS 346
>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
This domain family is found in archaea and eukaryotes,
and is approximately 30 amino acids in length. The
mammalian members of this group occur multiple times
along the protein, joined by flexible linkers, and are
referred to as JAZ - dsRNA-binding ZF protein -
zinc-fingers. The JAZ proteins are expressed in all
tissues tested and localise in the nucleus, particularly
the nucleolus. JAZ preferentially binds to
double-stranded (ds) RNA or RNA/DNA hybrids rather than
DNA. In addition to binding double-stranded RNA, these
zinc-fingers are required for nucleolar localisation.
Length = 27
Score = 29.1 bits (66), Expect = 0.23
Identities = 8/21 (38%), Positives = 12/21 (57%)
Query: 231 RHTCKECSKEFSNEYNLKNHM 251
+ C C K F +E L+NH+
Sbjct: 1 QFYCVACDKYFKSENALENHL 21
Score = 27.2 bits (61), Expect = 1.1
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 150 RYICHACPREFSSEATMKAHVAS 172
++ C AC + F SE ++ H+ S
Sbjct: 1 QFYCVACDKYFKSENALENHLKS 23
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 32.5 bits (74), Expect = 0.25
Identities = 13/29 (44%), Positives = 18/29 (62%), Gaps = 1/29 (3%)
Query: 6 SPTTNNPSTTNTTPTPTTQTSSTTPTRHP 34
TT++PS T T +PT ++STT T P
Sbjct: 814 QTTTSSPSPT-QTTSPTQTSTSTTTTTSP 841
Score = 31.8 bits (72), Expect = 0.43
Identities = 10/33 (30%), Positives = 14/33 (42%)
Query: 5 LSPTTNNPSTTNTTPTPTTQTSSTTPTRHPCPQ 37
+ T + T T T TSS +PT+ P
Sbjct: 797 ILIETTSVVITKTVTQTQTTTSSPSPTQTTSPT 829
Score = 30.2 bits (68), Expect = 1.6
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 3 FLLSPTTNNPSTTNTTPTPTTQTSSTTPT 31
+++ T TT ++P+PT TS T +
Sbjct: 804 VVITKTVTQTQTTTSSPSPTQTTSPTQTS 832
>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger.
Length = 23
Score = 28.2 bits (63), Expect = 0.39
Identities = 14/23 (60%), Positives = 15/23 (65%)
Query: 124 YPCPECDKTFLSTSGLREHRKVH 146
Y CPEC K F S S LREH + H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 27.8 bits (62), Expect = 0.51
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 232 HTCKECSKEFSNEYNLKNHMVLH 254
+ C EC K F ++ L+ HM H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
Score = 26.7 bits (59), Expect = 1.5
Identities = 10/23 (43%), Positives = 12/23 (52%)
Query: 205 YSCPSCSKSFNRKDRYRMHLKLH 227
Y CP C K F K R H++ H
Sbjct: 1 YRCPECGKVFKSKSALREHMRTH 23
>gnl|CDD|132198 TIGR03154, sulfolob_CbsA, cytochrome b558/566, subunit A. Members
of this protein family are CbsA, one subunit of a highly
glycosylated, heterodimeric, mono-heme cytochrome
b558/566, found in Sulfolobus acidocaldarius and several
other members of the Sulfolobales, a branch of the
Crenarchaeota.
Length = 465
Score = 31.5 bits (71), Expect = 0.55
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 4 LLSPTTNNPSTTNTTPTPTTQTSSTTPT 31
L++ P++T T+ +P+T TSS P+
Sbjct: 412 LVTTPPTPPTSTTTSTSPSTTTSSAIPS 439
>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger.
Length = 27
Score = 27.6 bits (62), Expect = 0.68
Identities = 10/24 (41%), Positives = 13/24 (54%)
Query: 123 IYPCPECDKTFLSTSGLREHRKVH 146
++ C C KTF S L H+K H
Sbjct: 1 VHTCGVCGKTFSSLQALGGHKKSH 24
>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048). This
presumed domain is functionally uncharacterized. This
domain family is found in eukaryotes, and is typically
between 228 and 257 amino acids in length.
Length = 242
Score = 30.5 bits (69), Expect = 0.76
Identities = 10/31 (32%), Positives = 17/31 (54%)
Query: 4 LLSPTTNNPSTTNTTPTPTTQTSSTTPTRHP 34
LLSP + P + + +T + + P+RHP
Sbjct: 7 LLSPNSGPPFSPLASSGGSTDSDGSRPSRHP 37
>gnl|CDD|216786 pfam01921, tRNA-synt_1f, tRNA synthetases class I (K). This family
includes only lysyl tRNA synthetases from prokaryotes.
Length = 357
Score = 30.7 bits (70), Expect = 0.80
Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 2/42 (4%)
Query: 35 CPQCSRDFATVKILKTHLKTHEGIIYPCPECDKTFLSTLTGT 76
CP+C R T +++ + I Y C EC +TG
Sbjct: 174 CPKCGRV-LTTPVVEYDAE-GGTITYKCDECGHEGEVDITGG 213
>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
Length = 1467
Score = 31.3 bits (70), Expect = 0.83
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 5 LSPTTNNPSTTNTTPTPTTQTSSTTPTRHPCP 36
LS ++ P+ T TP P++ +SS+ + P
Sbjct: 1228 LSAASSPPAATTPTPPPSSSSSSSAQSISTSP 1259
>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type. The C2H2 zinc finger
is the classical zinc finger domain. The two conserved
cysteines and histidines co-ordinate a zinc ion. The
following pattern describes the zinc finger.
#-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
be any amino acid, and numbers in brackets indicate the
number of residues. The positions marked # are those
that are important for the stable fold of the zinc
finger. The final position can be either his or cys. The
C2H2 zinc finger is composed of two short beta strands
followed by an alpha helix. The amino terminal part of
the helix binds the major groove in DNA binding zinc
fingers. The accepted consensus binding sequence for Sp1
is usually defined by the asymmetric hexanucleotide core
GGGCGG but this sequence does not include, among others,
the GAG (=CTC) repeat that constitutes a high-affinity
site for Sp1 binding to the wt1 promoter.
Length = 22
Score = 26.9 bits (60), Expect = 0.98
Identities = 10/22 (45%), Positives = 13/22 (59%)
Query: 233 TCKECSKEFSNEYNLKNHMVLH 254
C +C K FS + NLK H+ H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 25.4 bits (56), Expect = 3.9
Identities = 11/22 (50%), Positives = 14/22 (63%)
Query: 206 SCPSCSKSFNRKDRYRMHLKLH 227
CP C KSF+RK + HL+ H
Sbjct: 1 KCPDCGKSFSRKSNLKRHLRTH 22
Score = 25.0 bits (55), Expect = 4.9
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 126 CPECDKTFLSTSGLREHRKVH 146
CP+C K+F S L+ H + H
Sbjct: 2 CPDCGKSFSRKSNLKRHLRTH 22
>gnl|CDD|221755 pfam12756, zf-C2H2_2, C2H2 type zinc-finger (2 copies). This
family contains two copies of a C2H2-like zinc finger
domain.
Length = 100
Score = 28.8 bits (65), Expect = 1.2
Identities = 18/79 (22%), Positives = 30/79 (37%), Gaps = 9/79 (11%)
Query: 99 CPQCSRDFATVKILKTHLKTHEGIIYPCPECDKTFLSTS-GLRE--HRKVHAGVRYICHA 155
C C+ TV+ H+ G P ++ +L GL K+H G C
Sbjct: 2 CLFCNHTSDTVEENLEHMFKSHGFFIP----EREYLVDLEGLLNYLREKIHEG--NECLY 55
Query: 156 CPREFSSEATMKAHVASHN 174
C ++F S ++ H+
Sbjct: 56 CGKQFKSLEALRQHMRDKG 74
Score = 26.5 bits (59), Expect = 7.7
Identities = 10/30 (33%), Positives = 17/30 (56%), Gaps = 2/30 (6%)
Query: 222 MHLKLHQGVRHTCKECSKEFSNEYNLKNHM 251
+ K+H+G + C C K+F + L+ HM
Sbjct: 43 LREKIHEG--NECLYCGKQFKSLEALRQHM 70
>gnl|CDD|214426 MTH00123, ATP8, ATP synthase F0 subunit 8; Provisional.
Length = 54
Score = 27.5 bits (62), Expect = 1.4
Identities = 11/22 (50%), Positives = 12/22 (54%)
Query: 4 LLSPTTNNPSTTNTTPTPTTQT 25
L TNNP+ TNTT T T
Sbjct: 29 LSFTPTNNPTKTNTTTNTTPWT 50
Score = 27.2 bits (61), Expect = 1.8
Identities = 7/29 (24%), Positives = 13/29 (44%)
Query: 3 FLLSPTTNNPSTTNTTPTPTTQTSSTTPT 31
++ P + + TN T T++T T
Sbjct: 22 LIIQPKILSFTPTNNPTKTNTTTNTTPWT 50
>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
transition [Transcription / Cell division and chromosome
partitioning].
Length = 423
Score = 30.1 bits (67), Expect = 1.4
Identities = 25/88 (28%), Positives = 38/88 (43%), Gaps = 10/88 (11%)
Query: 116 LKTHEGIIYPCP--ECDKTFLSTSGLREHRKVHAGVRYICHACPREFSSEATMKAHVASH 173
LK +G Y CP C+K + + +GL+ H H H P S M A
Sbjct: 342 LKVKDGKPYKCPVEGCNKKYKNQNGLKYHML-HGHQNQKLHENP----SPEKMNIFSAKD 396
Query: 174 NGVQHACDQCDRTFSCMPYLRQHQKLHS 201
+ C+ CD+ + + L+ H+K HS
Sbjct: 397 K--PYRCEVCDKRYKNLNGLKYHRK-HS 421
Score = 29.7 bits (66), Expect = 1.9
Identities = 20/89 (22%), Positives = 41/89 (46%), Gaps = 9/89 (10%)
Query: 197 QKLHSGITYSCP--SCSKSFNRKDRYRMHLKLHQGVRHTCKECSKEFSNEYNLKNHMVLH 254
K+ G Y CP C+K + ++ + H+ + + S E N ++ K+
Sbjct: 342 LKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKD----- 396
Query: 255 DVRAYVCELCARQFTRVDHFNHHRRLQHD 283
+ Y CE+C +++ ++ +HR+ HD
Sbjct: 397 --KPYRCEVCDKRYKNLNGLKYHRKHSHD 423
>gnl|CDD|213276 cd05908, A_NRPS_MycA_like, The adenylation domain of nonribosomal
peptide synthetases (NRPS) similar to mycosubtilin
synthase subunit A (MycA). The adenylation (A) domain
of NRPS recognizes a specific amino acid or hydroxy acid
and activates it as (amino)-acyl adenylate by hydrolysis
of ATP. The activated acyl moiety then forms thioester
to the enzyme-bound cofactor phosphopantetheine of a
peptidyl carrier protein domain. This family includes
NRPS similar to mycosubtilin synthase subunit A (MycA).
Mycosubtilin, which is characterized by a beta-amino
fatty acid moiety linked to the circular heptapeptide
Asn-Tyr-Asn-Gln-Pro-Ser-Asn, belongs to the iturin
family of lipopeptide antibiotics. The mycosubtilin
synthase subunit A (MycA) combines functional domains
derived from peptide synthetases, amino transferases,
and fatty acid synthases. Nonribosomal peptide
synthetases are large multifunction enzymes that
synthesize many therapeutically useful peptides. NRPS
has a distinct modular structure in which each module is
responsible for the recognition, activation, and, in
some cases, modification of a single amino acid residue
of the final peptide product. The modules can be
subdivided into domains that catalyze specific
biochemical reactions.
Length = 499
Score = 29.9 bits (68), Expect = 1.5
Identities = 12/43 (27%), Positives = 17/43 (39%)
Query: 35 CPQCSRDFATVKILKTHLKTHEGIIYPCPECDKTFLSTLTGTP 77
P+ F TV + + HLK II P+ G+P
Sbjct: 277 FPKPGEPFKTVCLDRNHLKVGVKIIEIDPKNSDAISFVEVGSP 319
Score = 29.5 bits (67), Expect = 2.3
Identities = 10/30 (33%), Positives = 14/30 (46%)
Query: 99 CPQCSRDFATVKILKTHLKTHEGIIYPCPE 128
P+ F TV + + HLK II P+
Sbjct: 277 FPKPGEPFKTVCLDRNHLKVGVKIIEIDPK 306
>gnl|CDD|212564 cd11674, lambda-1, inner capsid protein lambda-1 or VP3. The
reovirus inner capsid protein lambda-1 displays
nucleoside triphosphate phosphohydrolase (NTPase),
RNA-5'-triphosphatase (RTPase), and RNA helicase
activity and may play a role in the transcription of
the virus genome, the unwinding or reannealing of
double-stranded RNA during RNA synthesis. The RTPase
activity constitutes the first step in the capping of
RNA, resulting in a 5'-diphosphorylated RNA
plus-strand. lambda1 is an Orthoreovirus core protein,
VP3 is the homologous core protein in Aquareoviruses.
Length = 1166
Score = 30.1 bits (68), Expect = 1.8
Identities = 8/37 (21%), Positives = 19/37 (51%)
Query: 18 TPTPTTQTSSTTPTRHPCPQCSRDFATVKILKTHLKT 54
+P + + + P+ + C C +F+++ L H +T
Sbjct: 59 SPPQSKKATPINPSSYVCNVCMAEFSSMDQLAEHQRT 95
Score = 29.7 bits (67), Expect = 2.1
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 256 VRAYVCELCARQFTRVDHFNHHRRLQH 282
+YVC +C +F+ +D H+R H
Sbjct: 71 PSSYVCNVCMAEFSSMDQLAEHQRTTH 97
Score = 28.2 bits (63), Expect = 7.0
Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 1/33 (3%)
Query: 170 VASHNGVQHACDQCDRTFSCMPYLRQHQKL-HS 201
N + C+ C FS M L +HQ+ HS
Sbjct: 66 ATPINPSSYVCNVCMAEFSSMDQLAEHQRTTHS 98
>gnl|CDD|239922 cd04476, RPA1_DBD_C, RPA1_DBD_C: A subfamily of OB folds
corresponding to the C-terminal OB fold, the
ssDNA-binding domain (DBD)-C, of human RPA1 (also called
RPA70). RPA1 is the large subunit of Replication protein
A (RPA). RPA is a nuclear ssDNA-binding protein (SSB)
which appears to be involved in all aspects of DNA
metabolism including replication, recombination, and
repair. RPA also mediates specific interactions of
various nuclear proteins. In animals, plants, and fungi,
RPA is a heterotrimer with subunits of 70KDa (RPA1),
32kDa (RPA2), and 14 KDa (RPA3). In addition to DBD-C,
RPA1 contains three other OB folds: DBD-A, DBD-B, and
RPA1N. The major DNA binding activity of RPA is
associated with RPA1 DBD-A and DBD-B. RPA1 DBD-C is
involved in DNA binding and trimerization. It contains
two structural insertions not found to date in other
OB-folds: a zinc ribbon and a three-helix bundle. RPA1
DBD-C also contains a Cys4-type zinc-binding motif,
which plays a role in the ssDNA binding function of this
domain. It appears that zinc itself may not be required
for ssDNA binding.
Length = 166
Score = 28.8 bits (65), Expect = 1.9
Identities = 12/49 (24%), Positives = 18/49 (36%), Gaps = 11/49 (22%)
Query: 179 ACDQCDRTFSCMPYLRQHQKLHSGITYSCPSCSKSFNR-KDRYRMHLKL 226
AC C++ TY C C+KS + RY + L +
Sbjct: 36 ACPGCNKKV----------VEEGNGTYRCEKCNKSVPNPEYRYILSLNV 74
>gnl|CDD|225380 COG2824, PhnA, Uncharacterized Zn-ribbon-containing protein
involved in phosphonate metabolism [Inorganic ion
transport and metabolism].
Length = 112
Score = 28.1 bits (63), Expect = 2.0
Identities = 9/35 (25%), Positives = 15/35 (42%), Gaps = 10/35 (28%)
Query: 31 TRHPCPQCSRDFATVKILKTHLKTHEGIIYPCPEC 65
+ PCP+C+ ++ +G CPEC
Sbjct: 2 SLPPCPKCNSEYTY----------EDGGQLICPEC 26
Score = 28.1 bits (63), Expect = 2.0
Identities = 9/35 (25%), Positives = 15/35 (42%), Gaps = 10/35 (28%)
Query: 95 TRHPCPQCSRDFATVKILKTHLKTHEGIIYPCPEC 129
+ PCP+C+ ++ +G CPEC
Sbjct: 2 SLPPCPKCNSEYTY----------EDGGQLICPEC 26
>gnl|CDD|203307 pfam05693, Glycogen_syn, Glycogen synthase. This family consists
of the eukaryotic glycogen synthase proteins GYS1, GYS2
and GYS3. Glycogen synthase (GS) is the enzyme
responsible for the synthesis of -1,4-linked glucose
chains in glycogen. It is the rate limiting enzyme in
the synthesis of the polysaccharide, and its activity is
highly regulated through phosphorylation at multiple
sites and also by allosteric effectors, mainly glucose
6-phosphate (G6P).
Length = 633
Score = 29.3 bits (66), Expect = 2.5
Identities = 9/41 (21%), Positives = 12/41 (29%)
Query: 1 DHFLLSPTTNNPSTTNTTPTPTTQTSSTTPTRHPCPQCSRD 41
D F P P + S +P+ P P S
Sbjct: 589 DQFKQEVEETISDNEFKIPRPASVPPSPSPSDAPSPHSSDA 629
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 25.8 bits (57), Expect = 2.7
Identities = 12/25 (48%), Positives = 16/25 (64%), Gaps = 1/25 (4%)
Query: 48 LKTHLKTHEGI-IYPCPECDKTFLS 71
L+ H++TH G Y CP C K+F S
Sbjct: 2 LRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 25.8 bits (57), Expect = 2.7
Identities = 12/25 (48%), Positives = 16/25 (64%), Gaps = 1/25 (4%)
Query: 112 LKTHLKTHEGI-IYPCPECDKTFLS 135
L+ H++TH G Y CP C K+F S
Sbjct: 2 LRRHMRTHTGEKPYKCPVCGKSFSS 26
Score = 25.8 bits (57), Expect = 3.2
Identities = 12/26 (46%), Positives = 16/26 (61%), Gaps = 1/26 (3%)
Query: 192 YLRQHQKLHSGIT-YSCPSCSKSFNR 216
LR+H + H+G Y CP C KSF+
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain. This region of the APC
family of proteins is known as the basic domain. It
contains a high proportion of positively charged amino
acids and interacts with microtubules.
Length = 359
Score = 28.9 bits (64), Expect = 2.8
Identities = 20/89 (22%), Positives = 34/89 (38%), Gaps = 9/89 (10%)
Query: 7 PTTNNPSTTNTTPTPTTQTSSTTPTRHPCPQCSRDFATVKILKTHLKTHEGIIYPCPECD 66
P T++P+ + + + +P R P Q K + P PE
Sbjct: 122 PRTSSPARALLASSGSQHKTQKSPVRIPFMQ--------NPAKPPPLSKNASSRPRPEPG 173
Query: 67 KTFLSTLTGTPQGARGRAVYLSRMSTTLT 95
+ + G P GARG + L RM++ +
Sbjct: 174 SRGRAGMNGGP-GARGSRLELVRMASAKS 201
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 28.6 bits (64), Expect = 2.8
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 5 LSPTTNNPSTTNTTPTPTTQTSSTTPTRHP 34
L+ T+++PS TT + +T+TS T T
Sbjct: 80 LTTTSSSPSNDTTTASTSTKTSPTVSTTVT 109
>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 46 to 94 amino acids in length. This domain is
found associated with pfam01462, pfam00560.
Length = 122
Score = 27.6 bits (61), Expect = 3.8
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 4 LLSPTTNNPSTTNTTPTPTTQTSSTT 29
++ PTT +TT TT PTT T TT
Sbjct: 43 IICPTTTTTTTTTTTTMPTTTTLPTT 68
>gnl|CDD|234471 TIGR04104, cxxc_20_cxxc, cxxc_20_cxxc protein. This small,
uncommon, poorly conserved protein is found primarily in
the Firmicutes. It features are pair of CxxC motifs
separated by about 20 amino acids, followed by a highly
hydrophobic region of about 45 amino acids. It has no
conserved gene neighborhood, and its function is
unknown.
Length = 94
Score = 26.9 bits (60), Expect = 4.0
Identities = 8/45 (17%), Positives = 21/45 (46%), Gaps = 4/45 (8%)
Query: 206 SCPSCSKSFNRKDRYRMHLKLHQGVRHTCKECSKE--FSNEYNLK 248
C +C++ F+ K+ + L++ ++ C C + + + K
Sbjct: 2 ICKNCNEKFSYKELLKSLFSLYRPIK--CPNCGTKQYLTAKSRKK 44
>gnl|CDD|221093 pfam11359, gpUL132, Glycoprotein UL132. Glycoprotein UL132 is a
low-abundance structural component of Human
cytomegalovirus (HCMV). The function of this protein is
not fully understood.
Length = 235
Score = 27.7 bits (61), Expect = 6.1
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 5 LSPTTNNPSTTNTTPTPTTQTSSTTPT 31
++ +T P+ T T T T TS+TT
Sbjct: 1 MTSSTTTPANTTATVTVTVATSNTTSV 27
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 27.6 bits (60), Expect = 6.8
Identities = 11/25 (44%), Positives = 13/25 (52%)
Query: 7 PTTNNPSTTNTTPTPTTQTSSTTPT 31
PTT+N ST N T T Q + T
Sbjct: 92 PTTSNASTINVTTKVTAQNITATEA 116
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 24.9 bits (54), Expect = 7.0
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 205 YSCPSCSKSFNRKDRYRMHLKLH 227
+ CP C KSF+ KD + HL+ H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 24.5 bits (53), Expect = 7.3
Identities = 9/23 (39%), Positives = 13/23 (56%)
Query: 124 YPCPECDKTFLSTSGLREHRKVH 146
+ CP C K+F S L+ H + H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
Score = 24.1 bits (52), Expect = 9.9
Identities = 9/23 (39%), Positives = 12/23 (52%)
Query: 232 HTCKECSKEFSNEYNLKNHMVLH 254
C C K FS++ LK H+ H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKH 23
>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
modification].
Length = 470
Score = 28.1 bits (62), Expect = 7.3
Identities = 19/67 (28%), Positives = 28/67 (41%), Gaps = 8/67 (11%)
Query: 207 CPSCSKSFNRKDRYRMHLKLHQGVRHTCKECSKEFS---NEYNLKNHM-VLHDVRAYVCE 262
C C + F+R + HL+ G RH CKE + +EY L ++ L D +
Sbjct: 241 CVKCGREFSRSKVFEYHLE---GKRH-CKEGQGKEEFVYSEYVLHRYLKYLGDPVSETRS 296
Query: 263 LCARQFT 269
L R
Sbjct: 297 LVLRSLA 303
Score = 28.1 bits (62), Expect = 7.4
Identities = 7/19 (36%), Positives = 13/19 (68%)
Query: 35 CPQCSRDFATVKILKTHLK 53
C +C R+F+ K+ + HL+
Sbjct: 241 CVKCGREFSRSKVFEYHLE 259
Score = 28.1 bits (62), Expect = 7.4
Identities = 7/19 (36%), Positives = 13/19 (68%)
Query: 99 CPQCSRDFATVKILKTHLK 117
C +C R+F+ K+ + HL+
Sbjct: 241 CVKCGREFSRSKVFEYHLE 259
>gnl|CDD|226202 COG3677, COG3677, Transposase and inactivated derivatives [DNA
replication, recombination, and repair].
Length = 129
Score = 27.0 bits (60), Expect = 7.9
Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 4/40 (10%)
Query: 207 CPSCSKSFNRKDRYRMHLKLHQGVRHTCKECSKEFSNEYN 246
CP C KS N + + HQ R+ CK C F+ E
Sbjct: 33 CPRC-KSSNVV-KIGGIRRGHQ--RYKCKSCGSTFTVETG 68
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 27.8 bits (61), Expect = 7.9
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 12 PSTTNTTPTPTTQTSSTTPTRHP 34
P+ NTT + TT T++TT T P
Sbjct: 676 PAAANTTTSSTTTTTTTTTTAAP 698
>gnl|CDD|129701 TIGR00614, recQ_fam, ATP-dependent DNA helicase, RecQ family. All
proteins in this family for which functions are known
are 3'-5' DNA-DNA helicases. These proteins are used for
recombination, recombinational repair, and possibly
maintenance of chromosome stability. This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 470
Score = 27.8 bits (62), Expect = 7.9
Identities = 11/27 (40%), Positives = 15/27 (55%), Gaps = 1/27 (3%)
Query: 187 FSCMPYLRQHQKLHSGITYSCPSCSKS 213
+ ++R+ K SGI Y CPS KS
Sbjct: 214 EDLLRFIRKEFKGKSGIIY-CPSRKKS 239
>gnl|CDD|227443 COG5112, UFD2, U1-like Zn-finger-containing protein [General
function prediction only].
Length = 126
Score = 26.6 bits (58), Expect = 9.1
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 94 LTRHPCPQCSRDFATVKILKTHLK 117
L +H C +C+R F T K L H K
Sbjct: 53 LGQHYCIECARYFITEKALMEHKK 76
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 27.4 bits (61), Expect = 9.5
Identities = 10/29 (34%), Positives = 14/29 (48%), Gaps = 3/29 (10%)
Query: 259 YVCELCARQFTRVDHFNHHRR---LQHDP 284
Y+CE C + + HR+ LQH P
Sbjct: 159 YICEFCLKYYGSQTSLVRHRKKCSLQHPP 187
>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
N-terminal. This is the eukaryotic DNA polymerase alpha
subunit B N-terminal domain which is involved in complex
formation. Also see pfam04058.
Length = 239
Score = 27.4 bits (61), Expect = 9.5
Identities = 8/32 (25%), Positives = 14/32 (43%)
Query: 2 HFLLSPTTNNPSTTNTTPTPTTQTSSTTPTRH 33
SP + +PS +TP+P + S+
Sbjct: 126 SLFFSPASFSPSAAPSTPSPNSAKFSSRSNPG 157
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.322 0.132 0.431
Gapped
Lambda K H
0.267 0.0720 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,948,626
Number of extensions: 1200566
Number of successful extensions: 1934
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1864
Number of HSP's successfully gapped: 147
Length of query: 294
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 198
Effective length of database: 6,679,618
Effective search space: 1322564364
Effective search space used: 1322564364
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 59 (26.5 bits)