RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12708
(695 letters)
>gnl|CDD|201301 pfam00554, RHD, Rel homology domain (RHD). Proteins containing the
Rel homology domain (RHD) are eukaryotic transcription
factors. The RHD is composed of two structural domains.
This is the N-terminal domain that is similar to that
found in P53. The C-terminal domain has an
immunoglobulin-like fold (See pfam01833) that binds to
DNA.
Length = 169
Score = 196 bits (500), Expect = 4e-59
Identities = 85/178 (47%), Positives = 111/178 (62%), Gaps = 13/178 (7%)
Query: 41 IEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVSKD 99
+EI+EQPK + +RFRY+CEGRSAGSI G +ST K++P + I NY G A + VS V+KD
Sbjct: 1 LEIVEQPKQRGMRFRYKCEGRSAGSIPGESSTRSKKTFPTVQICNYDGPAVIRVSLVTKD 60
Query: 100 SPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDI-QAFRLYLQFCQR 158
P+R HPH LV KD C +G+ L DM SF NLGIQC+KKKD+ +A + ++
Sbjct: 61 EPHRPHPHSLVGKD-CKDGVCEVELGPEDMVASFQNLGIQCVKKKDVEEALKERIELNID 119
Query: 159 PFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDN-VIKLEPVVSDIIYD 215
PFN + ++DLN VRLCFQA+LP + N L PVVS+ IYD
Sbjct: 120 PFNVGFEAL---------RQIKDMDLNVVRLCFQAFLPDTRGNFTTPLPPVVSNPIYD 168
>gnl|CDD|143647 cd07887, RHD-n_Dorsal_Dif, N-terminal sub-domain of the Rel
homology domain (RHD) of the arthropod protein Dorsal.
Proteins containing the Rel homology domain (RHD) are
metazoan transcription factors. The RHD is composed of
two structural sub-domains; this model characterizes the
N-terminal RHD sub-domain of the arthropod Dorsal and
Dif (Dorsal-related immunity factor), and similar
proteins. Dorsal and Dif are Rel-like transcription
factors, which play roles in mediating innate immunity
in Drosophila. They are activated via the Toll pathway.
Cytoplasmic Dorsal/Dif are inactivated via forming a
complex with Cactus, the Drosophila homologue of
mammalian I-kappa B proteins. In response to signals,
Cactus is degraded and Dorsal/Dif can be transported
into the nucleus, where they act as transcription
factors. Dorsal is also an essential gene in
establishing the proper dorsal/ventral polarity in the
developing embryo.
Length = 173
Score = 189 bits (481), Expect = 2e-56
Identities = 88/185 (47%), Positives = 117/185 (63%), Gaps = 18/185 (9%)
Query: 39 PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
PY+ I+EQP S+ +RFRYECEGRSAGSI GANST E K++P I + NY G A +VVSCV+
Sbjct: 1 PYVRIVEQPTSRALRFRYECEGRSAGSIPGANSTSEGKTFPTIQVVNYDGRAVVVVSCVT 60
Query: 98 KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDIQ-AFRLYLQFC 156
KD P+R HPH LV K+ C G+ + +M F LGIQC+KKKD++ + +L +
Sbjct: 61 KDEPFRPHPHNLVGKEGCKKGVCTKKINPTEMRIVFQKLGIQCVKKKDVEESLKLREEIN 120
Query: 157 QRPFNSEKNSAWTLLGVMGFNNRD---NLDLNAVRLCFQAYLPKDKDNV-IKLEPVVSDI 212
PF + GF+++D ++DLN VRLCFQ +L + + L PVVSD
Sbjct: 121 VDPFRT------------GFDHKDQINSIDLNVVRLCFQVFLEDENGRFTVPLPPVVSDP 168
Query: 213 IYDAK 217
IYD K
Sbjct: 169 IYDKK 173
>gnl|CDD|143640 cd07827, RHD-n, N-terminal sub-domain of the Rel homology domain
(RHD). Proteins containing the Rel homology domain
(RHD) are metazoan transcription factors. The RHD is
composed of two structural sub-domains; this model
characterizes the N-terminal sub-domain, which may be
distantly related to the DNA-binding domain found in
P53. The C-terminal sub-domain has an
immunoglobulin-like fold and serves as a dimerization
module that also binds DNA (see cd00102). The RHD is
found in NF-kappa B, nuclear factor of activated T-cells
(NFAT), the tonicity-responsive enhancer binding protein
(TonEBP), and the arthropod proteins Dorsal and Relish
(Rel).
Length = 174
Score = 187 bits (476), Expect = 1e-55
Identities = 87/184 (47%), Positives = 114/184 (61%), Gaps = 15/184 (8%)
Query: 39 PYIEILEQPKSKV-RFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
PY+EI EQPK + RFRYECEGRSAGSI G NST + K++P + +RNY G A +VVS V+
Sbjct: 1 PYLEITEQPKQRGHRFRYECEGRSAGSIPGENSTADRKTFPTVKLRNYNGPAKIVVSLVT 60
Query: 98 KDSPYRSHPHKLVSKDNCTNGIFCATLL--EGDMTYSFTNLGIQCMKKKDI-QAFRLYLQ 154
KD P + HPH+LV K +C +G+ C L + +MT SF NLGIQC++KKD+ +A +Q
Sbjct: 61 KDDPPKPHPHQLVGKTDCRDGV-CEVRLGPKNNMTASFNNLGIQCVRKKDVEEALGQRIQ 119
Query: 155 FCQRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDNVIK-LEPVVSDII 213
PF K N ++DLN VRLCFQA++ L PV+S+ I
Sbjct: 120 LGIDPFMVHKGPE---------GNASDIDLNRVRLCFQAFIEDSDGGFTLPLPPVLSNPI 170
Query: 214 YDAK 217
YD K
Sbjct: 171 YDKK 174
>gnl|CDD|143645 cd07885, RHD-n_RelA, N-terminal sub-domain of the Rel homology
domain (RHD) of RelA. Proteins containing the Rel
homology domain (RHD) are metazoan transcription
factors. The RHD is composed of two structural
sub-domains; this model characterizes the N-terminal RHD
domain of the RelA family of transcription factors,
categorized as a class II member of the NF-kappa B
family. In class II NF-kappa Bs, the RHD domain
co-occurs with a C-terminal transactivation domain
(TAD). NF-kappa B proteins are part of a protein complex
that acts as a transcription factor, which is
responsible for regulating a host of cellular responses
to a variety of stimuli. This complex tightly regulates
the expression of a large number of genes, and is
involved in processes such as adaptive and innate
immunity, stress response, inflammation, cell adhesion,
proliferation and apoptosis. The cytosolic NF-kappa B
complex is activated via phosphorylation of the
ankyrin-repeat containing inhibitory protein I-kappa B,
which dissociates from the complex and exposes the
nuclear localization signal of the heterodimer (NF-kappa
B and Rel). RelA (also called p65) forms heterodimers
with NF-kappa B1 (p50) and B2 (p52). RelA also forms
homodimers.
Length = 169
Score = 154 bits (391), Expect = 8e-44
Identities = 83/179 (46%), Positives = 108/179 (60%), Gaps = 14/179 (7%)
Query: 39 PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
PY+EI+EQPK + +RFRY+CEGRSAGSI G ST K++P I I NY G + +S V+
Sbjct: 1 PYVEIIEQPKQRGMRFRYKCEGRSAGSIPGERSTDTTKTHPTIKINNYTGPGRVRISLVT 60
Query: 98 KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDI-QAFRLYLQFC 156
KD P++ HPH+LV KD C +G + A L +SF NLGIQC+KK+D+ QA +Q
Sbjct: 61 KDPPHKPHPHELVGKD-CKDGYYEAELSPDRCIHSFQNLGIQCVKKRDLEQAVSQRIQTN 119
Query: 157 QRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDNVIKLEPVVSDIIYD 215
PFN V R + DLNAVRLCFQ + ++ L PV+S IYD
Sbjct: 120 NNPFN-----------VPIEEQRADYDLNAVRLCFQVTVRDPSGRLLPLPPVLSQPIYD 167
>gnl|CDD|143643 cd07883, RHD-n_NFkB, N-terminal sub-domain of the Rel homology
domain (RHD) of nuclear factor of kappa light
polypeptide gene enhancer in B-cells (NF-kappa B).
Proteins containing the Rel homology domain (RHD) are
metazoan transcription factors. The RHD is composed of
two structural sub-domains; this model characterizes the
N-terminal RHD sub-domain of the NF-kappa B1 and B2
families of transcription factors, also referred to as
class I members of the NF-kappa B family. In class I
NF-kappa Bs, the RHD domain co-occurs with C-terminal
ankyrin repeats. Family members include NF-kappa B1 and
NF-kappa B2. NF-kappa B1 is commonly referred to as p105
or p50 (proteolytically processed form), while NF-kappa
B2 is called p100 or p52 (proteolytically processed
form). NF-kappa B proteins are part of a protein complex
that acts as a transcription factor, which is
responsible for regulating a host of cellular responses
to a variety of stimuli. This complex tightly regulates
the expression of a large number of genes, and is
involved in processes such as adaptive and innate
immunity, stress response, inflammation, cell adhesion,
proliferation and apoptosis. The cytosolic NF-kappa B
complex is activated via phosphorylation of the
ankyrin-repeat containing inhibitory protein I-kappa B,
which dissociates from the complex and exposes the
nuclear localization signal of the heterodimer (NF-kappa
B and REL). p105 and p100 may also act as I-kappa Bs due
to their C-terminal ankyrin repeats.
Length = 197
Score = 152 bits (386), Expect = 1e-42
Identities = 78/204 (38%), Positives = 102/204 (50%), Gaps = 32/204 (15%)
Query: 39 PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
PY+EILEQPK + RFRY CEG S G + GA+S KSYP + I NY G A +VV V+
Sbjct: 1 PYLEILEQPKQRGFRFRYGCEGPSHGGLPGASSEKNKKSYPTVKICNYQGPARIVVQLVT 60
Query: 98 KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDI----------Q 147
P R H H LV K +C +GI + DMT F NLGI + KK++ Q
Sbjct: 61 NSEPPRLHAHSLVGK-HCEDGICTVQVGPKDMTAQFPNLGILHVTKKNVVETLEARLLAQ 119
Query: 148 AFRLYLQFCQRPFNSEKNSAWTLLGVMGFNNRDN-------------LDLNAVRLCFQAY 194
R Y ++E G + + +DL+ VRLCFQA+
Sbjct: 120 CTRGYNPGDLVHVDAEGG------GDRQLTDEEQAEIRQKAKQQAKSMDLSVVRLCFQAF 173
Query: 195 LPKDKDN-VIKLEPVVSDIIYDAK 217
LP + L+PV+SD IYD+K
Sbjct: 174 LPDSNGSFTRPLKPVISDAIYDSK 197
>gnl|CDD|143649 cd07933, RHD-n_c-Rel, N-terminal sub-domain of the Rel homology
domain (RHD) of c-Rel. Proteins containing the Rel
homology domain (RHD) are metazoan transcription
factors. The RHD is composed of two structural
sub-domains; this model characterizes the N-terminal RHD
sub-domain of the c-Rel family of transcription factors,
categorized as a class II member of the NF-kappa B
family. In class II NF-kappa Bs, the RHD domain
co-occurs with a C-terminal transactivation domain
(TAD). NF-kappa B proteins are part of a protein complex
that acts as a transcription factor, which is
responsible for regulating a host of cellular responses
to a variety of stimuli. This complex tightly regulates
the expression of a large number of genes, and is
involved in processes such as adaptive and innate
immunity, stress response, inflammation, cell adhesion,
proliferation and apoptosis. The cytosolic NF-kappa B
complex is activated via phosphorylation of the
ankyrin-repeat containing inhibitory protein I-kappa B,
which dissociates from the complex and exposes the
nuclear localization signal of the heterodimer (NF-kappa
B and Rel). c-Rel plays an important role in B cell
proliferation and survival.
Length = 172
Score = 145 bits (367), Expect = 2e-40
Identities = 75/180 (41%), Positives = 106/180 (58%), Gaps = 13/180 (7%)
Query: 39 PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
PY+EI EQP+ + +RFRY+CEGRSAGSI G ST ++YP I I NY G + ++ V+
Sbjct: 1 PYVEIFEQPRQRGMRFRYKCEGRSAGSIPGERSTDNNRTYPSIQILNYTGKGKVRITLVT 60
Query: 98 KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKDIQ-AFRLYLQFC 156
K+ PY+ HPH LV KD C +G + A +F NLGIQC+++++++ A L +
Sbjct: 61 KNEPYKPHPHDLVGKD-CRDGYYEAEFGPERRVLAFQNLGIQCVRRREVKEAIMLRISRG 119
Query: 157 QRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDN-VIKLEPVVSDIIYD 215
PFN + + + DLN VRLCFQ +LP + N L P+VS+ IYD
Sbjct: 120 INPFNVPEEQLLQI---------EEYDLNVVRLCFQIFLPDEHGNYTTALPPIVSNPIYD 170
>gnl|CDD|143646 cd07886, RHD-n_RelB, N-terminal sub-domain of the Rel homology
domain (RHD) of the reticuloendotheliosis viral oncogene
homolog B (RelB) protein. Proteins containing the Rel
homology domain (RHD) are metazoan transcription
factors. The RHD is composed of two structural
sub-domains; this model characterizes the N-terminal RHD
sub-domain of the RelB family of transcription factors,
categorized as class II NF-kappa B family members. In
class II NF-kappa Bs, the RHD domain co-occurs with a
C-terminal transactivation domain (TAD). NF-kappa B
proteins are part of a protein complex that acts as a
transcription factor, which is responsible for
regulating a host of cellular responses to a variety of
stimuli. This complex tightly regulates the expression
of a large number of genes, and is involved in processes
such as adaptive and innate immunity, stress response,
inflammation, cell adhesion, proliferation and
apoptosis. The cytosolic NF-kappa B complex is activated
via phosphorylation of the ankyrin-repeat containing
inhibitory protein I-kappa B, which dissociates from the
complex and exposes the nuclear localization signal of
the heterodimer (NF-kappa B and Rel). RelB, is unable to
homodimerize but is a potent transactivator in a
heterodimer with NF-kappa B1 (p50) or B2 (p52). It is
involved in the regulation of genes that play roles in
inflammatory processes and the immune response.
Length = 172
Score = 144 bits (364), Expect = 6e-40
Identities = 82/184 (44%), Positives = 106/184 (57%), Gaps = 17/184 (9%)
Query: 39 PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVG--DAHLVVSC 95
P + I EQPK + +RFRYECEGRSAGSI+G +ST K+ P I I+N +G + + V
Sbjct: 1 PRLLITEQPKQRGMRFRYECEGRSAGSILGESSTEANKTQPAIEIQNCIGLKEVTVTVCL 60
Query: 96 VSKDSPYRSHPHKLVSKDNCTNGIFCATLL-EGDMTYSFTNLGIQCMKKKDIQ-AFRLYL 153
V KD P+R HPH LV KD C NGI TL +SF+NLGIQC++K++I+ A L
Sbjct: 61 VWKDPPHRVHPHGLVGKD-CPNGICQVTLNPHSSPRHSFSNLGIQCVRKREIEAAIETRL 119
Query: 154 QFCQRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDNVIKLEPVVSDII 213
Q PF + N + +D+N VRLCFQA D L PV+S+ I
Sbjct: 120 QLNIDPFKAGS-----------LKNHEEVDMNVVRLCFQASYRDDDGRKDCLSPVLSEPI 168
Query: 214 YDAK 217
YD K
Sbjct: 169 YDKK 172
>gnl|CDD|143651 cd07935, RHD-n_NFkB1, N-terminal sub-domain of the Rel homology
domain (RHD) of nuclear factor of kappa B1 (NF-kappa
B1). Proteins containing the Rel homology domain (RHD)
are metazoan transcription factors. The RHD is composed
of two structural sub-domains; this model characterizes
the N-terminal RHD sub-domain of the NF-kappa B1 family
of transcription factors, a class I member of the
NF-kappa B family. In class I NF-kappa Bs, the RHD
domain co-occurs with C-terminal ankyrin repeats.
NF-kappa B1 is commonly referred to as p105 or p50
(proteolytically processed form). NF-kappa B proteins
are part of a protein complex that acts as a
transcription factor, which is responsible for
regulating a host of cellular responses to a variety of
stimuli. This complex tightly regulates the expression
of a large number of genes, and is involved in processes
such as adaptive and innate immunity, stress response,
inflammation, cell adhesion, proliferation and
apoptosis. The cytosolic NF-kappa B complex is activated
via phosphorylation of the ankyrin-repeat containing
inhibitory protein I-kappa B, which dissociates from the
complex and exposes the nuclear localization signal of
the heterodimer (NF-kappa B and REL). NF-kappa B1 is
involved in the canonical NF-kappa B signaling pathway
which is activated by many agonists and is essential in
immune and inflammatory responses, as well as cell
survival. p105 is involved in its own specific NF-kappa
B signaling pathway which is also implicated in immune
and inflammatory responses. p105 may also act as an
I-kappa B due to its C-terminal ankyrin repeats. It is
also involved in mitogen-activated protein kinase (MAPK)
signaling as its degradation leads to the activation of
TPL-2, a MAPK kinase kinase which activates ERK
pathways.
Length = 202
Score = 117 bits (295), Expect = 4e-30
Identities = 76/206 (36%), Positives = 97/206 (47%), Gaps = 31/206 (15%)
Query: 39 PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
PY++ILEQPK + RFRY CEG S G + GA+S KSYP++ I NYVG A ++V V+
Sbjct: 1 PYLQILEQPKQRGFRFRYVCEGPSHGGLPGASSEKNKKSYPQVKICNYVGPAKVIVQLVT 60
Query: 98 KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKKD---------IQA 148
H H LV K +C +GI T DM F NLGI + KK +A
Sbjct: 61 NGKNIHLHAHSLVGK-HCEDGICTVTAGPKDMVVGFANLGILHVTKKKVFETLEARMTEA 119
Query: 149 FRL-------------YLQ---FCQRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQ 192
+ YLQ R + V +DL+ VRL F
Sbjct: 120 CKKGYNPGLLVHPELAYLQAEGGGDRQLTEREKEIIRQAAVQ---QTKEMDLSVVRLMFT 176
Query: 193 AYLPKDKDNVIK-LEPVVSDIIYDAK 217
A+LP + LEPVVSD IYD+K
Sbjct: 177 AFLPDSTGGFTRRLEPVVSDAIYDSK 202
>gnl|CDD|143650 cd07934, RHD-n_NFkB2, N-terminal sub-domain of the Rel homology
domain (RHD) of nuclear factor kappa B2 (NF-kappa B2).
Proteins containing the Rel homology domain (RHD) are
metazoan transcription factors. The RHD is composed of
two structural sub-domains; this model characterizes the
N-terminal RHD sub-domain of the NF-kappa B2 family of
transcription factors, a class I member of the NF-kappa
B family. In class I NF-kappa Bs, the RHD domain
co-occurs with C-terminal ankyrin repeats. NF-kappa B2
is commonly referred to as p100 or p52 (proteolytically
processed form). NF-kappa B proteins are part of a
protein complex that acts as a transcription factor,
which is responsible for regulating a host of cellular
responses to a variety of stimuli. This complex tightly
regulates the expression of a large number of genes, and
is involved in processes such as adaptive and innate
immunity, stress response, inflammation, cell adhesion,
proliferation and apoptosis. The cytosolic NF-kappa B
complex is activated via phosphorylation of the
ankyrin-repeat containing inhibitory protein I-kappa B,
which dissociates from the complex and exposes the
nuclear localization signal of the heterodimer (NF-kappa
B and REL). NF-kappa B2 is involved in the alternative
NF-kappa B signaling pathway which is activated by few
agonists and plays an important role in secondary
lymphoid organogenesis, maturation of B-cells, and
adaptive humoral immunity. p100 may also act as an
I-kappa B due to its C-terminal ankyrin repeats.
Length = 185
Score = 111 bits (278), Expect = 4e-28
Identities = 69/185 (37%), Positives = 95/185 (51%), Gaps = 6/185 (3%)
Query: 39 PYIEILEQPKSK-VRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
PY+ I+EQPK + RFRY CEG S G + GA+S K+YP + I NYVG A + V V+
Sbjct: 1 PYLVIIEQPKQRGFRFRYVCEGPSHGGLPGASSEKGRKTYPTVKICNYVGMARIEVDLVT 60
Query: 98 KDSPYRSHPHKLVSKDNCTNGIFCATLLEGDMTYSFTNLGIQCMKKK---DIQAFRLYLQ 154
P R H H LV K +G + DMT F+NLGI + KK +I +L Q
Sbjct: 61 HTDPPRVHAHSLVGKHCNESGNCSVDVGPKDMTAQFSNLGILHVTKKNMMEILKEKLKRQ 120
Query: 155 FCQ-RPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDN-VIKLEPVVSDI 212
+ + L + +DL+ VRL F AYL + + L+PV+SD
Sbjct: 121 KLRNTGPYKLTEAEERELEQEAKELKKVMDLSIVRLKFTAYLRDSNGSYTLALKPVISDP 180
Query: 213 IYDAK 217
I+D+K
Sbjct: 181 IHDSK 185
>gnl|CDD|238582 cd01177, IPT_NFkappaB, IPT domain of the transcription factor
NFkappaB and related transcription factors. NFkappaB is
considered a central regulator of stress responses,
activated by different stressful conditions, including
physical stress, oxidative stress, and exposure to
certain chemicals. NFkappaB blocking cell apoptosis in
several cell types, gives it an important role in cell
proliferation and differentiation.
Length = 102
Score = 98.9 bits (247), Expect = 8e-25
Identities = 44/102 (43%), Positives = 57/102 (55%), Gaps = 4/102 (3%)
Query: 222 LTIHTLSHVSAPVVGDMKMIILCDKVNKDDIEVRFYEEQDGVVVWDER--VKTLEVHKQY 279
L I L S V G ++ +LCDKV K+DI+VRF+EE + VW+ +VH+QY
Sbjct: 1 LKICRLDKTSGSVKGGDEVYLLCDKVQKEDIQVRFFEEDEEETVWEAFGDFSQTDVHRQY 60
Query: 280 AIVLKTPAYKSFEINDPVHVKIQLVTKKD--ISEPYNFMLTP 319
AIV +TP Y +I +PV VKIQL D SE F P
Sbjct: 61 AIVFRTPPYHDPDITEPVKVKIQLKRPSDGERSESVPFTYVP 102
>gnl|CDD|143644 cd07884, RHD-n_Relish, N-terminal sub-domain of the Rel homology
domain (RHD) of the arthropod protein Relish. Proteins
containing the Rel homology domain (RHD) are metazoan
transcription factors. The RHD is composed of two
structural sub-domains; this model characterizes the
N-terminal RHD sub-domain of the arthropod Relish
protein, in which the RHD domain co-occurs with
C-terminal ankyrin repeats. Family members are sometimes
referred to as p110 or p68 (proteolytically processed
form). Relish is an NF-kappa B-like transcription
factor, which plays a role in mediating innate immunity
in Drosophila. It is activated via the Imd (immune
deficiency) pathway, which triggers phosphorylation of
Relish. IKK-dependent proteolytic cleavage of Relish
(which involves Dredd) results in a smaller active form
(without the C-terminal ankyrin repeats), which is
transported into the nucleus and functions as a
transactivator.
Length = 159
Score = 93.6 bits (233), Expect = 3e-22
Identities = 54/186 (29%), Positives = 81/186 (43%), Gaps = 34/186 (18%)
Query: 39 PYIEILEQPKSKVRFRYECEGR-SAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVS 97
P++ I+EQP K RFRY+ E + GS++G ST K++P + + NY G A + S
Sbjct: 1 PFLRIVEQPVDKFRFRYKSEMHGTHGSLLGERSTSSKKTFPTVKLCNYRGQAVIRCSLYQ 60
Query: 98 KDSPYRS-HPHKLVSKDNCTNGIFCATLL----EGDMTYSFTNLGIQCMKKKDIQAFRLY 152
D R H HKLV K + + + EGD F N+GI KK+I
Sbjct: 61 ADDNRRKPHVHKLVGKQ-GDDDVCDPHDIEVSPEGDYVAMFQNMGIIHTAKKNI--PEEL 117
Query: 153 LQFCQRPFNSEKNSAWTLLGVMGFNNRDNLDLNAVRLCFQAYLPKDKDNVIKL-EPVVSD 211
+ + N++LN V L FQA+ ++ + PV S+
Sbjct: 118 YK------------------------KKNMNLNQVVLRFQAFAVSANGHLRPICPPVYSN 153
Query: 212 IIYDAK 217
I + K
Sbjct: 154 PINNLK 159
>gnl|CDD|238336 cd00602, IPT_TF, IPT domain of eukaryotic transcription factors
NF-kappaB/Rel, nuclear factor of activated Tcells
(NFAT), and recombination signal J-kappa binding
protein (RBP-Jkappa). The IPT domains in these proteins
are involved in DNA binding. Most NF-kappaB/Rel proteins
form homo- and heterodimers, while NFAT proteins are
largely monomeric (with TonEBP being an exception).
While the majority of sequence-specific DNA binding
elements are found in the N-terminal domain, several are
found in the IPT domain in loops adjacent to, and
including, the linker region.
Length = 101
Score = 82.3 bits (204), Expect = 4e-19
Identities = 45/102 (44%), Positives = 56/102 (54%), Gaps = 5/102 (4%)
Query: 222 LTIHTLSHVSAPVVGDMKMIILCDKVNKDDIEVRFYEEQDGVVVWDER--VKTLEVHKQY 279
L I +S +S V G ++ +LCDKVNK DI+V F E+ G VW+ + +V Q
Sbjct: 1 LPICRVSSLSGSVNGGDEVFLLCDKVNKPDIKVWFGEKGPGETVWEAEAMFRQEDVR-QV 59
Query: 280 AIVLKTPAYKSFEINDPVHVKIQLVTKKDI--SEPYNFMLTP 319
AIV KTP Y + I PV V IQLV D SEP F TP
Sbjct: 60 AIVFKTPPYHNKWITRPVQVPIQLVRPDDRKRSEPLTFTYTP 101
>gnl|CDD|214657 smart00429, IPT, ig-like, plexins, transcription factors.
Length = 90
Score = 40.9 bits (96), Expect = 1e-04
Identities = 28/96 (29%), Positives = 31/96 (32%), Gaps = 10/96 (10%)
Query: 221 DLTIHTLSHVSAPVVGDMKMIILCDKVNKDDIEVRFYEEQDGVVVWDERVKTLEVHKQYA 280
D I +S S PV G + I LC K K V T A
Sbjct: 1 DPVITRISPTSGPVSGGTE-ITLCGKNLKSISVVFVEVGVGEAPC------TFSPSSSTA 53
Query: 281 IVLKTPAYKSFEINDPVHV-KIQLVTKKDISEPYNF 315
IV KTP Y I V V + L S P F
Sbjct: 54 IVCKTPPYH--NIPGSVPVRTVGLRNGGVPSSPQPF 87
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 37.0 bits (86), Expect = 0.005
Identities = 19/46 (41%), Positives = 20/46 (43%), Gaps = 7/46 (15%)
Query: 516 EDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRP 561
E PP D P P A PP P+A PA K PPPRP
Sbjct: 68 EPAAPPPDPEPPVPGPAGPPS-PLAPP------APARKPPLPPPRP 106
Score = 32.3 bits (74), Expect = 0.20
Identities = 26/88 (29%), Positives = 30/88 (34%), Gaps = 16/88 (18%)
Query: 514 CYEDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRP-PSHTKPNLATL 572
+ D PP P PP PPV V V+PP S+ PPP P P P
Sbjct: 35 AHPDDPPPVGDPRPPVVDTPPPVSAVW------VLPPPSEPAAPPPDPEPPVPGP----- 83
Query: 573 QRKSTSDKFDSVSTHDDNIPPLPPKRSR 600
PPLPP R +
Sbjct: 84 ----AGPPSPLAPPAPARKPPLPPPRPQ 107
>gnl|CDD|238050 cd00102, IPT, Immunoglobulin-like fold, Plexins, Transcription
factors (IPT). IPTs are also known as Transcription
factor ImmunoGlobin (TIG) domains. They are present in
intracellular transcription factors, cell surface
receptors (such as plexins and scatter factor
receptors), as well as, cyclodextrin glycosyltransferase
and similar enzymes. Although they are involved in DNA
binding in transcription factors, their function in
other proteins is unknown. In these transcription
factors, IPTs form homo- or heterodimers with the
exception of the nuclear factor of activated Tcells
(NFAT) transcription factors which are mainly monomers.
Length = 89
Score = 35.9 bits (83), Expect = 0.008
Identities = 23/95 (24%), Positives = 31/95 (32%), Gaps = 11/95 (11%)
Query: 222 LTIHTLSHVSAPVVGDMKMIILCDKVNKD-DIEVRFYEEQDGVVVWDERVKTLEVHKQYA 280
I ++S S PV G ++ I ++ V F L V A
Sbjct: 1 PVITSISPSSGPVSGGTEVTITGSNFGSGSNLRVTFGGGVP--------CSVLSVS-STA 51
Query: 281 IVLKTPAYKSFEINDPVHVKIQLVTKKDISEPYNF 315
IV TP Y + PV V + S P F
Sbjct: 52 IVCTTPPYAN-PGPGPVEVTVDRGNGGITSSPLTF 85
>gnl|CDD|238583 cd01178, IPT_NFAT, IPT domain of the NFAT family of transcription
factors. NFAT transcription complexes are a target of
calcineurin, a calcium dependent phosphatase, and
activate genes mainly involved in cell-cell-interaction.
Length = 101
Score = 35.9 bits (83), Expect = 0.009
Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 4/92 (4%)
Query: 231 SAPVVGDMKMIILCDKVNKDDIEVRFYEE-QDGVVVWDERVKT-LEVHKQYAIVLKTPAY 288
S V G ++ + KD +V F E+ QDG W+ E Q +V++ P Y
Sbjct: 11 SCSVNGGEELFLTGKNFLKDS-KVVFQEKGQDGEAQWEAEATIDKEKSHQNHLVVEVPPY 69
Query: 289 KSFEINDPVHVKIQLVT-KKDISEPYNFMLTP 319
+ + PV V+ +V K+ S+P F TP
Sbjct: 70 HNKHVAAPVQVQFYVVNGKRKRSQPQTFTYTP 101
>gnl|CDD|237782 PRK14666, uvrC, excinuclease ABC subunit C; Provisional.
Length = 694
Score = 36.8 bits (85), Expect = 0.047
Identities = 22/102 (21%), Positives = 32/102 (31%), Gaps = 16/102 (15%)
Query: 510 SGGECYEDVLPPQDIPTPP-PEVATPPVKPVALSNVKVVIPPASKVTTPP--PRPPSHTK 566
C + L P P PE ++ PV PVA + P + T P SH
Sbjct: 329 PTAVCTDAGLLPDTPLLPDAPEGSSDPVVPVAAATPVDASLPDVRTGTAPTSLANVSHAD 388
Query: 567 PNLATLQRKSTSDKFDSVSTHDDNIPPLPPKRSRKSGLDKNL 608
P + T + PK + L++ L
Sbjct: 389 PA-------------VAQPTQAATLAGAAPKGATHLMLEETL 417
>gnl|CDD|143648 cd07927, RHD-n_NFAT_like, N-terminal sub-domain of the Rel homology
domain (RHD) of nuclear factor of activated T-cells
(NFAT) proteins and similar proteins. Proteins
containing the Rel homology domain (RHD) are metazoan
transcription factors. The RHD is composed of two
structural sub-domains; this model characterizes the
N-terminal RHD sub-domain of the NFAT family of
transcription factors. NFAT transcription complexes are
a target of calcineurin, a calcium dependent
phosphatase, and activate genes that are mainly involved
in cell-cell interaction. Upon de-phosphorylation of the
nuclear localization signal, NFAT enters the nucleus and
acts as a transcription factor; its export from the
nucleus is triggered by phosphorylation via export
kinases. NFATs play important roles in mediating the
immune response, and are found in T cells, B Cells, NK
cells, mast cells, and monocytes. NFATs are also found
in various non-hematopoietic cell types, where they play
roles in development. This group also contains the
N-terminal RHD sub-domain of the non-calcium regulated
tonicity-responsive enhancer binding protein (TonEBP),
also called NFAT5. Mammalian TonEBP regulates the
expression of genes in response to tonicity. It plays a
pivotal role in urinary concentrating mechanisms in
kidney medulla, by triggering the accumulation of
osmolytes that enable renal medullary cells to tolerate
high levels of urea and salt.
Length = 161
Score = 34.9 bits (80), Expect = 0.051
Identities = 26/123 (21%), Positives = 45/123 (36%), Gaps = 18/123 (14%)
Query: 39 PYIEILEQPKSKVRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVSK 98
+ I QP+ R RYE EG S G++ ++ +P + + Y+ L V +
Sbjct: 1 YELRIEVQPEPHHRARYETEG-SRGAVKAPST----GGFPTVKLHGYMEPVGLQVFIGTA 55
Query: 99 DSPYRSHPHKLVSKDNCTNGIFC-------ATLLE------GDMTYSFTNLGIQCMKKKD 145
+ H V + C +LE +MT + GI ++ D
Sbjct: 56 SGRLKPHAFYQVHRITGKTTTPCKEKIIGNTKVLEIPLEPKNNMTATIDCAGILKLRNAD 115
Query: 146 IQA 148
I+
Sbjct: 116 IEL 118
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 36.5 bits (84), Expect = 0.063
Identities = 14/51 (27%), Positives = 17/51 (33%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLA 570
+ PP + PP PP PPP PP +P LA
Sbjct: 2894 STESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA 2944
Score = 34.5 bits (79), Expect = 0.25
Identities = 23/99 (23%), Positives = 33/99 (33%), Gaps = 5/99 (5%)
Query: 519 LPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPP-----SHTKPNLATLQ 573
LPP P P P +L V P PP R P + +P + L
Sbjct: 2828 LPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887
Query: 574 RKSTSDKFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPVP 612
R + S +S + D P ++ + PP P
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPP 2926
Score = 33.4 bits (76), Expect = 0.60
Identities = 18/78 (23%), Positives = 27/78 (34%), Gaps = 5/78 (6%)
Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPP--RPPSHTKPNLATLQRKSTS 578
P +PTP P P + + P + PPP +PP+ +T
Sbjct: 412 PASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATR 471
Query: 579 DKFDSVSTHDDNIPPLPP 596
D++ PP PP
Sbjct: 472 KALDALRERR---PPEPP 486
Score = 33.0 bits (75), Expect = 0.76
Identities = 19/55 (34%), Positives = 21/55 (38%), Gaps = 2/55 (3%)
Query: 519 LPPQDIPTPPPEVATPP--VKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLAT 571
LP P PP P P A S + PP S T PP PP P+L
Sbjct: 2798 LPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPL 2852
Score = 32.2 bits (73), Expect = 1.3
Identities = 28/141 (19%), Positives = 36/141 (25%), Gaps = 24/141 (17%)
Query: 520 PPQDIPTPPPEVATPPVKPV---------ALSNVKVVIPPASKVTTPPPR--PPSHTKPN 568
P PP V P P + PPA+ PP R P+ +
Sbjct: 2733 PALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLS 2792
Query: 569 LATLQRKSTSDKFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPVPKYKSFFHKLFFKSKSK 628
+ S D D + LPP S L P +
Sbjct: 2793 ESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQP-------------TAP 2839
Query: 629 SKPATPRERSLSIDSRKVGGG 649
P P SL + GG
Sbjct: 2840 PPPPGPPPPSLPLGGSVAPGG 2860
Score = 31.8 bits (72), Expect = 1.7
Identities = 18/92 (19%), Positives = 26/92 (28%), Gaps = 7/92 (7%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSD 579
P P PP A P P P PP RP P + R++
Sbjct: 2611 PAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRL 2670
Query: 580 KFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPV 611
+ PP+R R+ + +
Sbjct: 2671 GRAAQ-------ASSPPQRPRRRAARPTVGSL 2695
Score = 29.9 bits (67), Expect = 5.9
Identities = 19/81 (23%), Positives = 29/81 (35%), Gaps = 7/81 (8%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIP---PASKVTTPPPRP----PSHTKPNLATL 572
P PT P A P V + ++P + P P P P+ + P L
Sbjct: 2940 QPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
Query: 573 QRKSTSDKFDSVSTHDDNIPP 593
S S++ H++ PP
Sbjct: 3000 SLSRVSSWASSLALHEETDPP 3020
Score = 29.5 bits (66), Expect = 7.4
Identities = 14/77 (18%), Positives = 20/77 (25%), Gaps = 3/77 (3%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSD 579
P + +P + A P PV V PP +P +P +
Sbjct: 2867 PSR---SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQP 2923
Query: 580 KFDSVSTHDDNIPPLPP 596
PP P
Sbjct: 2924 PPPPQPQPPPPPPPRPQ 2940
Score = 29.5 bits (66), Expect = 7.9
Identities = 10/72 (13%), Positives = 20/72 (27%), Gaps = 2/72 (2%)
Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSV 584
+ PP+ + ++ + P TP P P + + ++
Sbjct: 2676 ASSPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL 2735
Query: 585 STHDDNIPPLPP 596
PP P
Sbjct: 2736 PA--APAPPAVP 2745
Score = 29.1 bits (65), Expect = 9.9
Identities = 11/49 (22%), Positives = 13/49 (26%), Gaps = 1/49 (2%)
Query: 520 PPQDIPTPPPEVATPPVK-PVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
PP P P P P + + P PP P P
Sbjct: 2703 PPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATP 2751
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 35.1 bits (81), Expect = 0.093
Identities = 17/86 (19%), Positives = 28/86 (32%), Gaps = 13/86 (15%)
Query: 517 DVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPP-------SHTKPNL 569
D+L + + P P ++V PP PPP PP S + +L
Sbjct: 154 DLLSSDESVPSSSTTSFPISPPTEEPVLEVPPPP------PPPPPPPPPSLQQSTSAIDL 207
Query: 570 ATLQRKSTSDKFDSVSTHDDNIPPLP 595
++ S ++ P P
Sbjct: 208 IKERKGQRSAAGKTLVLSKPKSPEFP 233
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 33.8 bits (78), Expect = 0.18
Identities = 21/65 (32%), Positives = 28/65 (43%), Gaps = 15/65 (23%)
Query: 517 DVLPP--QDIPTPPPE----------VATPPVKPVALS--NVKVVIPPASKVTTPPPRPP 562
D++P Q +PT PPE A P + P ++ N V PA V P P+P
Sbjct: 54 DMMPAATQALPTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPA-PVEPPKPKPV 112
Query: 563 SHTKP 567
KP
Sbjct: 113 EKPKP 117
Score = 33.4 bits (77), Expect = 0.29
Identities = 12/51 (23%), Positives = 15/51 (29%), Gaps = 3/51 (5%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVT---TPPPRPPSHTKP 567
P + P V P KPV K + P P+P K
Sbjct: 93 PNTPVEPEPAPVEPPKPKPVEKPKPKPKPQQKVEAPPAPKPEPKPVVEEKA 143
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 32.5 bits (74), Expect = 0.23
Identities = 19/81 (23%), Positives = 30/81 (37%), Gaps = 9/81 (11%)
Query: 519 LPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTS 578
+P P P P + P + +A+ P A PP P +T+ +
Sbjct: 24 MPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPPV----YPPGSTVLVQGGF 79
Query: 579 D---KFDSVSTHDDNIPPLPP 596
D +F + +IPP PP
Sbjct: 80 DAGARFGPGTG--SSIPPPPP 98
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 33.6 bits (77), Expect = 0.35
Identities = 16/83 (19%), Positives = 21/83 (25%), Gaps = 14/83 (16%)
Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIP--PASKVTTPPPRPPSHTKPNLATLQRKSTS 578
P P PP +PV + +P P + P + K
Sbjct: 385 PSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPESAPKLTRAAIPVDEKPKY 444
Query: 579 DKFDSVSTHDDNIPPLPPKRSRK 601
PP PPK K
Sbjct: 445 T------------PPAPPKEEEK 455
Score = 29.0 bits (65), Expect = 9.0
Identities = 13/77 (16%), Positives = 22/77 (28%), Gaps = 2/77 (2%)
Query: 526 TPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSVS 585
P P A S V+ P+++ +P T V+
Sbjct: 361 VPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPP--RPVA 418
Query: 586 THDDNIPPLPPKRSRKS 602
+ P PK +R +
Sbjct: 419 PPVPHTPESAPKLTRAA 435
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 33.6 bits (77), Expect = 0.39
Identities = 17/93 (18%), Positives = 27/93 (29%), Gaps = 6/93 (6%)
Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPR-PPSHTKPNLATLQRKSTSD 579
P A A + V + P + V PP P +P ++K
Sbjct: 402 PSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVS 461
Query: 580 KFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPVP 612
K S+ L P + + N+ P
Sbjct: 462 KVSSLG-----PSTLRPIQEKAEQATGNIKEAP 489
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 32.8 bits (75), Expect = 0.43
Identities = 16/48 (33%), Positives = 19/48 (39%), Gaps = 2/48 (4%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
PP++ P PP E TPP K K P P+P KP
Sbjct: 62 PPEEQPKPPTEPETPPEPTPPKPKEKPKPEK--KPKKPKPKPKPKPKP 107
>gnl|CDD|239125 cd02660, Peptidase_C19D, A subfamily of Peptidase C19. Peptidase
C19 contains ubiquitinyl hydrolases. They are
intracellular peptidases that remove ubiquitin molecules
from polyubiquinated peptides by cleavage of isopeptide
bonds. They hydrolyze bonds involving the carboxyl group
of the C-terminal Gly residue of ubiquitin. The purpose
of the de-ubiquitination is thought to be editing of the
ubiquitin conjugates, which could rescue them from
degradation, as well as recycling of the ubiquitin. The
ubiquitin/proteasome system is responsible for most
protein turnover in the mammalian cell, and with over 50
members, family C19 is one of the largest families of
peptidases in the human genome.
Length = 328
Score = 33.1 bits (76), Expect = 0.43
Identities = 10/34 (29%), Positives = 16/34 (47%)
Query: 380 RVDKSVNINMKPHVTFSQHDTDMETNQVPDVLFD 413
V + +NM P+ + S DT + PD +D
Sbjct: 241 YVQFPLELNMTPYTSSSIGDTQDSNSLDPDYTYD 274
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 32.9 bits (75), Expect = 0.62
Identities = 37/195 (18%), Positives = 56/195 (28%), Gaps = 19/195 (9%)
Query: 433 KDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNESSVTIEEVHEFPKEHLGQRTEGYGS 492
D + + D +I NA +L NES I +F + ++ Q +
Sbjct: 178 ADDVVNRADPNIITRIKNAEPK-TIATELVQESNESEPVITNEQDFDRLNIDQLVDYINK 236
Query: 493 YSSLQLAMKNQYFID--RNSGGECYEDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPP 550
+ Q +D RN + P P P TP P + +
Sbjct: 237 NNDGQFNFDAHDSVDDVRNFAKTVWRQKNTPAQTPMPERSWQTPAQTPARRISTP--MTE 294
Query: 551 ASKVTTPPPRPPSHTKPNLAT------------LQRKSTSDKFDSVST--HDDNIPPLPP 596
K P + P+ + L+ + SD F S D P
Sbjct: 295 EIKSWQTPLQTPAMYSSDYQAPKPEPIYTWEELLRERFPSDLFAISSLPDSDSEASDSGP 354
Query: 597 KRSRKSGLDKNLPPV 611
R RK LP
Sbjct: 355 TRKRKRRRVPPLPEY 369
>gnl|CDD|177697 PLN00067, PLN00067, PsbP domain-containing protein 6; Provisional.
Length = 263
Score = 32.2 bits (73), Expect = 0.70
Identities = 25/103 (24%), Positives = 38/103 (36%), Gaps = 8/103 (7%)
Query: 532 ATPPVKPVALSNVKVVIPPASKVTTPPP--RPPSHTKPNLATLQRKSTSDKFDSVSTHDD 589
AT + P++LS AS + P + + +R+
Sbjct: 2 ATASLSPLSLSFSVSSSTSASSSASSPLAVASSVSPRAAVVIHRRELL---LGLALAPLI 58
Query: 590 NIPPLPPKRSRKSGLDKNLPPVPKYKSFFHKLFFKSKSKSKPA 632
I P PP +R+ + LPP P SF + FK+ K PA
Sbjct: 59 LIAPEPPAEAREVEVGSYLPPSPSDPSF---VLFKASPKDTPA 98
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 32.6 bits (75), Expect = 0.72
Identities = 17/83 (20%), Positives = 25/83 (30%), Gaps = 4/83 (4%)
Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPN-LATLQRKSTSDKFDS 583
P P A + + TPPP PP+ T+ + A Q + S
Sbjct: 520 TAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPP 579
Query: 584 VSTHDDNIPPLPPKRSRKSGLDK 606
+ P S +DK
Sbjct: 580 IPEEPTPSPTKD---SSPEEIDK 599
Score = 30.7 bits (70), Expect = 3.1
Identities = 13/53 (24%), Positives = 18/53 (33%), Gaps = 3/53 (5%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATL 572
P TP P S IP ++ T P P PP++ +L
Sbjct: 390 IQPSAPKTKQAATTPSPPPAKASPP---IPVPAEPTEPSPTPPANAANAPPSL 439
>gnl|CDD|173133 PRK14670, uvrC, excinuclease ABC subunit C; Provisional.
Length = 574
Score = 32.2 bits (73), Expect = 0.93
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 4/78 (5%)
Query: 407 VPDVLFDTKEKIDERFEELLNEVSNLKDTL--EEKNDFLLIED--SNNAVITLDELDDLN 462
VPD + K+ + +L+NE+ N K + +E + + I + +NA + L E ++
Sbjct: 282 VPDKIHIFKKIDTKNITKLINELKNTKTEIIYKETKEIIKIMEMAISNAKLALREYENEK 341
Query: 463 NNVNESSVTIEEVHEFPK 480
N ES I E+ + PK
Sbjct: 342 NKALESLKIILEMDKLPK 359
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.4 bits (73), Expect = 1.1
Identities = 25/105 (23%), Positives = 32/105 (30%), Gaps = 19/105 (18%)
Query: 521 PQDIPTPP---PEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKST 577
P I PP + PP P A V P + P P+ PS +L +
Sbjct: 178 PPSIQVPPGAALAPSAPPPTPSA-QAVPPQGSPIAAQPAPQPQQPSPL-----SLISAPS 231
Query: 578 SDKFDSVSTHDDNIP----------PLPPKRSRKSGLDKNLPPVP 612
S H P P P R +S PP+P
Sbjct: 232 LHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMP 276
Score = 30.8 bits (69), Expect = 3.5
Identities = 23/89 (25%), Positives = 34/89 (38%), Gaps = 5/89 (5%)
Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSV 584
P PP + PP AL+ P+++ PP P +P Q S +
Sbjct: 175 PQGPPSIQVPPG--AALAPSAPPPTPSAQ-AVPPQGSPIAAQPAPQPQQPSPLSLI-SAP 230
Query: 585 STHDDNIP-PLPPKRSRKSGLDKNLPPVP 612
S H +P P PP + + + PP P
Sbjct: 231 SLHPQRLPSPHPPLQPQTASQQSPQPPAP 259
Score = 30.4 bits (68), Expect = 4.3
Identities = 23/102 (22%), Positives = 35/102 (34%), Gaps = 12/102 (11%)
Query: 519 LPPQDIPTPPPEVATPPVKPVALSN--VKVVIPPASKVTTPPPRPPSHTKPNLATLQRKS 576
LPP+ P + + P + + +P PPP P+ T S
Sbjct: 431 LPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAI----GPPPSLPTSTPAAPPRASSGS 486
Query: 577 TSDKFDSVSTHDD--NIPPLPPKRSRKSGLDK----NLPPVP 612
S+ PPLPP + ++ LD+ PP P
Sbjct: 487 QPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPP 528
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 31.7 bits (72), Expect = 1.2
Identities = 14/77 (18%), Positives = 22/77 (28%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSD 579
++ P P E P++ + V PP KP A ++ S
Sbjct: 110 LGEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSM 169
Query: 580 KFDSVSTHDDNIPPLPP 596
+ LPP
Sbjct: 170 AIAVPAIDTPVTLELPP 186
Score = 29.4 bits (66), Expect = 6.3
Identities = 14/64 (21%), Positives = 19/64 (29%), Gaps = 6/64 (9%)
Query: 520 PPQDIPT--PPPEVATPPVKPVA----LSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQ 573
PP T P P PV +A + V + PPP P + +
Sbjct: 148 PPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPPPPVVPQPSTMVVHRRS 207
Query: 574 RKST 577
R
Sbjct: 208 RIKR 211
Score = 29.0 bits (65), Expect = 7.7
Identities = 15/54 (27%), Positives = 18/54 (33%), Gaps = 2/54 (3%)
Query: 516 EDVLPPQDIPTPPPEVATPPVK-PVALSNVKVVIP-PASKVTTPPPRPPSHTKP 567
E+VL P P E P PV + I PA PP+ P
Sbjct: 138 EEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPP 191
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 32.2 bits (73), Expect = 1.2
Identities = 16/73 (21%), Positives = 25/73 (34%), Gaps = 5/73 (6%)
Query: 519 LPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTS 578
P D+P P P KP A +P + TP +PP A L +
Sbjct: 394 APEPDLPQPD--RHPGPAKPEAPGARPAELPSPASAPTPEQQPPV---ARSAPLPPSPQA 448
Query: 579 DKFDSVSTHDDNI 591
+V++ +
Sbjct: 449 SAPRNVASGKPGV 461
>gnl|CDD|236327 PRK08661, PRK08661, prolyl-tRNA synthetase; Provisional.
Length = 477
Score = 31.6 bits (73), Expect = 1.3
Identities = 27/86 (31%), Positives = 36/86 (41%), Gaps = 18/86 (20%)
Query: 401 DMETNQVPDVLF--DTKEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDEL 458
D+E N V VL DT EK +EL+ +V + LEE + L A
Sbjct: 355 DLENNTV--VLVRRDTLEKETVPLDELVEKV---PELLEEIQENLY----EKA------K 399
Query: 459 DDLNNNVNESSVTIEEVHEFPKEHLG 484
+ L N E T+EE E +E G
Sbjct: 400 EFLEENTVEVD-TLEEFKEAIEEKGG 424
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 31.0 bits (70), Expect = 1.4
Identities = 33/173 (19%), Positives = 58/173 (33%), Gaps = 23/173 (13%)
Query: 409 DVLFDTKEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNES 468
D L K++I E L +E +LK+ +E + LL E + +EL ++ ++N+
Sbjct: 11 DELEKLKDEIKAEEEALESEEKHLKEYDKEM-EELLEEKMQHV----EELRQIHADINDM 65
Query: 469 SVTIEEVHEFPKEHLGQRTEGYGSYSSLQLAMKNQYFIDRNSGGECYEDVLP-------- 520
I++ + +G+ +G Y + I+ E + LP
Sbjct: 66 ETEIKQSKSELERRMGKIARMHGEY------NPLKESINEMRKLELGLEELPQLHEEERQ 119
Query: 521 PQDIPTPP-PEVATPPVKP---VALSNVKVVIPPASKVTTPPPRPPSHTKPNL 569
T P P V P +N + P H P
Sbjct: 120 KISPATSPVPPVPLPDPPAFPSTLPANAAAAAAAQQQRDVPQNLINRHRMPPS 172
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 31.6 bits (72), Expect = 1.5
Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 11/70 (15%)
Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKV-----TTPPPRPPSHT------KPNLATLQ 573
P + +P LS+ PP + TPPP SH+ +P + +
Sbjct: 297 GVEPEPTGSVSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPK 356
Query: 574 RKSTSDKFDS 583
RK DK
Sbjct: 357 RKREGDKKQK 366
>gnl|CDD|237455 PRK13637, cbiO, cobalt transporter ATP-binding subunit;
Provisional.
Length = 287
Score = 31.2 bits (71), Expect = 1.6
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 261 DGVVVWDERVKTLEVHKQYAIVLKTPAYKSFE 292
DGV + D++VK ++ K+ +V + P Y+ FE
Sbjct: 67 DGVDITDKKVKLSDIRKKVGLVFQYPEYQLFE 98
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 31.8 bits (72), Expect = 1.8
Identities = 17/58 (29%), Positives = 21/58 (36%), Gaps = 9/58 (15%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVT--TPPPRPPSHTKPNLATLQRK 575
P D + PP A PP P A +PP PPR P +A L+
Sbjct: 906 PGGDAASAPPPGAGPPAPPQA-------VPPPRTTQPPAAPPRGPDVPPAAVAELRET 956
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 31.4 bits (71), Expect = 1.8
Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 4/67 (5%)
Query: 548 IPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSVSTHDDN--IPPLPPKRSRKSGLD 605
IP A PPP P ++L ++S + + N +P P +
Sbjct: 182 IPGAI--QPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPAS 239
Query: 606 KNLPPVP 612
PP+P
Sbjct: 240 IPAPPIP 246
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 30.6 bits (69), Expect = 2.2
Identities = 12/43 (27%), Positives = 13/43 (30%)
Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPS 563
P P PP A P PPA + P PP
Sbjct: 41 PAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPP 83
Score = 30.2 bits (68), Expect = 3.6
Identities = 22/88 (25%), Positives = 26/88 (29%), Gaps = 24/88 (27%)
Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSV 584
P PPP + A PP + P P P+ P
Sbjct: 41 PAPPPPPPSTAAAAPA-PAAPPPPPPPAAPPAPQPDDPNAAPPPPPA------------- 86
Query: 585 STHDDNIPPLPPKRSRKSGLDKNLPPVP 612
D N PP PP +D N PP P
Sbjct: 87 ---DPNAPPPPP-------VDPNAPPPP 104
Score = 29.5 bits (66), Expect = 5.7
Identities = 10/42 (23%), Positives = 10/42 (23%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRP 561
PP T A P P PPP P
Sbjct: 44 PPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85
Score = 28.7 bits (64), Expect = 9.6
Identities = 11/48 (22%), Positives = 12/48 (25%)
Query: 515 YEDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPP 562
D PP P+ P P P PP PP
Sbjct: 38 NADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPP 85
>gnl|CDD|220880 pfam10812, DUF2561, Protein of unknown function (DUF2561). This
family of proteins with unknown function appears to be
restricted to Mycobacterium spp.
Length = 207
Score = 30.2 bits (68), Expect = 2.6
Identities = 9/41 (21%), Positives = 11/41 (26%)
Query: 527 PPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
PPP +P K + P PS T
Sbjct: 95 PPPRSQGLRRRPGQPVRAGYAAARTEKESVPATHAPSRTDA 135
>gnl|CDD|216325 pfam01140, Gag_MA, Matrix protein (MA), p15. The matrix protein,
p15, is encoded by the gag gene. MA is involved in
pathogenicity.
Length = 129
Score = 29.4 bits (66), Expect = 2.8
Identities = 13/42 (30%), Positives = 17/42 (40%)
Query: 522 QDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPS 563
+D+ PP P V P + P S + PPRP S
Sbjct: 86 EDLAFEPPPWVKPFVDPPKVLLPSSTPKPVSPSPSAPPRPSS 127
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 29.4 bits (66), Expect = 3.0
Identities = 15/51 (29%), Positives = 17/51 (33%), Gaps = 3/51 (5%)
Query: 520 PPQDIPTPPPEVATP-PVKPVALSNVKVVIPPASKVTTPPPR--PPSHTKP 567
P P PP +P P P +PPA P P PP P
Sbjct: 85 PAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPP 135
>gnl|CDD|234063 TIGR02928, TIGR02928, orc1/cdc6 family replication initiation
protein. Members of this protein family are found
exclusively in the archaea. This set of DNA binding
proteins shows homology to the origin recognition
complex subunit 1/cell division control protein 6 family
in eukaryotes. Several members may be found in genome
and interact with each other [DNA metabolism, DNA
replication, recombination, and repair].
Length = 365
Score = 30.3 bits (69), Expect = 3.2
Identities = 14/61 (22%), Positives = 24/61 (39%), Gaps = 14/61 (22%)
Query: 411 LFDTKEKIDER---FEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNE 467
L + E++ E+ L L E+ D L +I LDE+D L + ++
Sbjct: 99 LRGSGEEVPTTGLSTSEVFRR---LYKELNERGDSL--------IIVLDEIDYLVGDDDD 147
Query: 468 S 468
Sbjct: 148 L 148
>gnl|CDD|238439 cd00862, ProRS_anticodon_zinc, ProRS Prolyl-anticodon binding
domain, long version found predominantly in eukaryotes
and archaea. ProRS belongs to class II aminoacyl-tRNA
synthetases (aaRS). This alignment contains the
anticodon binding domain, which is responsible for
specificity in tRNA-binding, so that the activated amino
acid is transferred to a ribose 3' OH group of the
appropriate tRNA only, and an additional C-terminal
zinc-binding domain specific to this subfamily of aaRSs.
Length = 202
Score = 30.0 bits (68), Expect = 3.3
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 3/44 (6%)
Query: 401 DMETNQVPDVLFDTKEKIDERFEELLNEVSNLKDTLEEKNDFLL 444
D+E N V V DT EK EL+ +V + L+E + L
Sbjct: 78 DLEKNTVVIVRRDTGEKKTVPLAELVEKV---PELLDEIQEDLY 118
>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein. This is is a family of
Papillomavirus proteins, E4, coded for by ORF4. A splice
variant, E1--E4, exists but neither the function of E4
or E1--E4 is known.
Length = 95
Score = 28.4 bits (64), Expect = 3.3
Identities = 16/50 (32%), Positives = 21/50 (42%), Gaps = 8/50 (16%)
Query: 555 TTPPPRPPSHTKPNLATLQRKST----SDKFDSVSTHDDNIPPLPPKRSR 600
TTPP PP P +++ SD DS ST + PP P +
Sbjct: 20 TTPPRPPP---PPPPWAPKKRCRRRLESD-SDSSSTTSPSSPPTPQPTTE 65
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 30.5 bits (69), Expect = 3.5
Identities = 8/48 (16%), Positives = 13/48 (27%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
P P P +P A + + A+ P P +
Sbjct: 366 PAAAAEAAAPAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASA 413
Score = 30.1 bits (68), Expect = 4.8
Identities = 10/52 (19%), Positives = 13/52 (25%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLAT 571
P P A P A + PPA+ P P +
Sbjct: 388 APAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPA 439
Score = 29.7 bits (67), Expect = 6.0
Identities = 11/41 (26%), Positives = 15/41 (36%), Gaps = 1/41 (2%)
Query: 528 PPEVATPPVKPVALSNV-KVVIPPASKVTTPPPRPPSHTKP 567
PE A P PVA + + + P PP+ P
Sbjct: 383 RPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPP 423
>gnl|CDD|216597 pfam01601, Corona_S2, Coronavirus S2 glycoprotein. The coronavirus
spike glycoprotein forms the characteristic 'corona'
after which the group is named. The Spike glycoprotein
is translated as a large polypeptide that is
subsequently cleaved to S1 pfam01600 and S2.
Length = 609
Score = 30.3 bits (69), Expect = 3.5
Identities = 17/87 (19%), Positives = 38/87 (43%), Gaps = 16/87 (18%)
Query: 393 VTFSQHDTDMETNQVPDVLFDTKEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAV 452
V + + +PD D +++D+ F+ L + V +L ++ N +
Sbjct: 462 VNYVNITNTVLPPIIPD-YIDFNKELDKWFKNLNSTVPDLD-----------LDKFNATI 509
Query: 453 ITL----DELDDLNNNVNESSVTIEEV 475
+ L D L ++ N+N+S + +E +
Sbjct: 510 LNLTGEIDRLQEVIKNLNDSLIDLEWL 536
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 30.4 bits (68), Expect = 4.2
Identities = 12/62 (19%), Positives = 18/62 (29%), Gaps = 4/62 (6%)
Query: 506 IDRNSGGECYEDVLPPQDIPTPPPEVATPPVKPVALSNVKVVIP----PASKVTTPPPRP 561
R + +P + T P V P +S P V PP+P
Sbjct: 381 RQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQP 440
Query: 562 PS 563
+
Sbjct: 441 TN 442
>gnl|CDD|223030 PHA03271, PHA03271, envelope glycoprotein C; Provisional.
Length = 490
Score = 30.0 bits (67), Expect = 4.8
Identities = 19/56 (33%), Positives = 26/56 (46%), Gaps = 3/56 (5%)
Query: 528 PPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHT---KPNLATLQRKSTSDK 580
P++ATPPV S++ P ++V PP S T K + T KS S K
Sbjct: 31 NPDLATPPVLAFNPSSISADGGPLAEVPASPPTELSTTVATKTAVPTTGEKSESHK 86
>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 504
Score = 29.8 bits (67), Expect = 4.9
Identities = 16/78 (20%), Positives = 23/78 (29%), Gaps = 1/78 (1%)
Query: 495 SLQLAMKNQYFIDRNSGGECYEDVLPPQDIPTPPPEVATPPVKPVA-LSNVKVVIPPASK 553
SL+LA+ + + E V PP P + K V L + A+
Sbjct: 330 SLELALLHALLALGGAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAG 389
Query: 554 VTTPPPRPPSHTKPNLAT 571
P P P
Sbjct: 390 APLPDFDPRPRGPPAPEP 407
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 29.9 bits (67), Expect = 5.1
Identities = 17/53 (32%), Positives = 22/53 (41%), Gaps = 3/53 (5%)
Query: 518 VLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLA 570
V+ P+ +PTPP TPP PA + PPP P H +A
Sbjct: 159 VVGPRPVPTPPG--GTPPPDDDEGDEAGAPATPAPPLH-PPPAPHPHPIAEVA 208
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 29.9 bits (66), Expect = 5.1
Identities = 14/38 (36%), Positives = 15/38 (39%), Gaps = 1/38 (2%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTP 557
PP PP + A PP K A K PPA P
Sbjct: 263 PPAKAAAPPAKAAAPPAKAAA-PPAKAAAPPAKAAAAP 299
Score = 29.1 bits (64), Expect = 8.9
Identities = 17/47 (36%), Positives = 20/47 (42%), Gaps = 5/47 (10%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPR---PPS 563
P PP + A PP K A K PPA K PP + PP+
Sbjct: 249 APAKAAAPPAKAAAPPAKAAA-PPAKAAAPPA-KAAAPPAKAAAPPA 293
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 29.9 bits (67), Expect = 5.4
Identities = 12/52 (23%), Positives = 13/52 (25%)
Query: 520 PPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLAT 571
P T PP P + P A PP P AT
Sbjct: 222 PTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANAT 273
Score = 29.5 bits (66), Expect = 6.7
Identities = 16/79 (20%), Positives = 24/79 (30%), Gaps = 17/79 (21%)
Query: 517 DVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKS 576
DV P P P P P + P++ + P P+ +A Q +
Sbjct: 201 DVFVPA-TPRPTPRTTASP---------ETTPTPSTTTSPPSTTIPA-PSTTIAAPQAGT 249
Query: 577 TSDKFDSVSTHDDNIPPLP 595
T + PP P
Sbjct: 250 TPE------AEGTPAPPTP 262
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 29.4 bits (66), Expect = 5.7
Identities = 13/47 (27%), Positives = 16/47 (34%)
Query: 527 PPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQ 573
PPP A PP P A + T P P K + L+
Sbjct: 155 PPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLK 201
>gnl|CDD|179379 PRK02155, ppnK, NAD(+)/NADH kinase family protein; Provisional.
Length = 291
Score = 29.5 bits (67), Expect = 6.4
Identities = 9/21 (42%), Positives = 13/21 (61%)
Query: 535 PVKPVALSNVKVVIPPASKVT 555
P+ P LSN +V+P S+V
Sbjct: 210 PIAPHTLSNRPIVLPDDSEVA 230
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 29.8 bits (66), Expect = 6.9
Identities = 20/67 (29%), Positives = 28/67 (41%), Gaps = 15/67 (22%)
Query: 558 PPRPPSHTKPNLATLQRKSTSDKFDSVSTHD------------DNIPPLPPKRSRKSGLD 605
P RPP L+RKS+ D D + + D++PP P+ R + L
Sbjct: 783 PDRPPDDDFETRKGLKRKSSEDHADPIPEGNATKKTCGLQGLPDSLPPAVPETDRDNPL- 841
Query: 606 KNLPPVP 612
LPP P
Sbjct: 842 --LPPCP 846
>gnl|CDD|173403 PTZ00111, PTZ00111, DNA replication licensing factor MCM4;
Provisional.
Length = 915
Score = 29.6 bits (66), Expect = 7.2
Identities = 24/108 (22%), Positives = 42/108 (38%), Gaps = 18/108 (16%)
Query: 401 DMETNQVPDVLFDTKEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDD 460
+ ++P +L + E++ ERF L + D LE K++ + S L L
Sbjct: 71 QRDVARLPFLLDNRLEELSERFTNFLKNFTEFSDVLEFKDE----QQSKPEYTELYYLWK 126
Query: 461 LNNNVNESSVTIEEVHEFPKEHLGQRTEGYGSYSSLQLAMKNQYFIDR 508
L N F KE+L + GY ++ + + Y D+
Sbjct: 127 LMN--------------FIKENLRDHSTGYSRILPFEVDLMHVYSFDK 160
>gnl|CDD|143642 cd07882, RHD-n_TonEBP, N-terminal sub-domain of the Rel homology
domain (RHD) of tonicity-responsive enhancer binding
protein (TonEBP). Proteins containing the Rel homology
domain (RHD) are metazoan transcription factors. The RHD
is composed of two structural sub-domains; this model
characterizes the N-terminal RHD sub-domain of the
tonicity-responsive enhancer binding protein (TonEBP),
also called NFAT5. Mammalian TonEBP regulates the
expression of genes in response to tonicity. It plays a
pivotal role in urinary concentrating mechanisms in
kidney medulla, by triggering the accumulation of
osmolytes that enable renal medullary cells to tolerate
high levels of urea and salt.
Length = 161
Score = 28.6 bits (64), Expect = 7.2
Identities = 29/121 (23%), Positives = 50/121 (41%), Gaps = 18/121 (14%)
Query: 41 IEILEQPKSKVRFRYECEGRSAGSIMGANSTLECKSYPKIMIRNYVGDAHLVVSCVSKDS 100
++IL QP+++ R RY EG S GS+ + + +P + + Y L V +
Sbjct: 3 LKILVQPETQHRARYLTEG-SRGSVKDRSQ----QGFPTVKLEGYNKPVVLQVFVGTDSG 57
Query: 101 PYRSHPHKLVSKDNCTNGIFC-------ATLLE------GDMTYSFTNLGIQCMKKKDIQ 147
+ H K N C T++E +MT S +GI ++ D++
Sbjct: 58 RVKPHGFYQACKVTGRNTTPCEEVDVEGTTVIEVPLDPTNNMTISVDCVGILKLRNADVE 117
Query: 148 A 148
A
Sbjct: 118 A 118
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 28.9 bits (65), Expect = 7.2
Identities = 17/66 (25%), Positives = 21/66 (31%), Gaps = 9/66 (13%)
Query: 520 PPQDIPTPPPEVATPPV-----KPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQR 574
+ P PE P KPV KV P +V PRP S +
Sbjct: 86 KEAPVVIPKPEPKPKPKPKPKPKPVK----KVEEQPKREVKPVEPRPASPFENTAPARPT 141
Query: 575 KSTSDK 580
ST+
Sbjct: 142 SSTATA 147
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 29.6 bits (66), Expect = 7.3
Identities = 22/98 (22%), Positives = 30/98 (30%), Gaps = 5/98 (5%)
Query: 520 PPQDIPTP--PPEVATPPVKPVALSNVKVVIPPASKVTTPPPR--PPSHTKPNLA-TLQR 574
PP P P AT +P A + + P A+ PP P P A R
Sbjct: 706 PPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 765
Query: 575 KSTSDKFDSVSTHDDNIPPLPPKRSRKSGLDKNLPPVP 612
+ PP P +R R + + P
Sbjct: 766 PPAAAPGAPTPQPPPQAPPAPQQRPRGAPTPQPPPQAG 803
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 28.4 bits (63), Expect = 7.6
Identities = 11/47 (23%), Positives = 16/47 (34%)
Query: 521 PQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKP 567
P + P VA P K + + + P V P P+ K
Sbjct: 79 PVPVAVVAPVVAPKPKKSQPVMSQEKTASPQKSVPAPSPKEKKKKKV 125
>gnl|CDD|236507 PRK09424, pntA, NAD(P) transhydrogenase subunit alpha; Provisional.
Length = 509
Score = 29.4 bits (67), Expect = 7.7
Identities = 8/48 (16%), Positives = 14/48 (29%), Gaps = 11/48 (22%)
Query: 525 PTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATL 572
P PP +V+ P A+ + P+ A +
Sbjct: 369 PPPPIQVSAAPAA-----------AAAAPAAKEEEKKPASPWRKYALM 405
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 29.3 bits (66), Expect = 7.7
Identities = 21/92 (22%), Positives = 36/92 (39%), Gaps = 4/92 (4%)
Query: 524 IPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTS--DKF 581
P A+ + ++ + + P+ T PP + P L LQR + + D
Sbjct: 33 RPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAA-TTQTPQLNPLQRAAAAALDAV 91
Query: 582 DS-VSTHDDNIPPLPPKRSRKSGLDKNLPPVP 612
+S + +H + PLP + N PVP
Sbjct: 92 ESALVSHLERQHPLPKTADPAVQIAGNFAPVP 123
>gnl|CDD|183955 PRK13292, PRK13292, trifunctional NADH dehydrogenase I subunit
B/C/D; Provisional.
Length = 788
Score = 29.4 bits (66), Expect = 7.7
Identities = 12/38 (31%), Positives = 16/38 (42%), Gaps = 5/38 (13%)
Query: 393 VTFSQHDTDMET-----NQVPDVLFDTKEKIDERFEEL 425
V + TDM T VP+VL K++ F L
Sbjct: 236 VRLDEGATDMLTYRCPPELVPEVLRHLKKRASSPFRRL 273
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 29.6 bits (67), Expect = 7.8
Identities = 16/62 (25%), Positives = 24/62 (38%), Gaps = 8/62 (12%)
Query: 416 EKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNESSVTIEEV 475
E+++E EEL + L DFL E+ +E D+L E T+
Sbjct: 387 EELEEEIEELRERFGDAPVDLGNAEDFL--EE------LREERDELREREAELEATLRTA 438
Query: 476 HE 477
E
Sbjct: 439 RE 440
>gnl|CDD|241369 cd13215, PH-GRAM1_AGT26, Autophagy-related protein 26/Sterol
3-beta-glucosyltransferase Pleckstrin homology (PH)
domain, repeat 1. ATG26 (also called
UGT51/UDP-glycosyltransferase 51), a member of the
glycosyltransferase 28 family, resulting in the
biosynthesis of sterol glucoside. ATG26 in decane
metabolism and autophagy. There are 32 known
autophagy-related (ATG) proteins, 17 are components of
the core autophagic machinery essential for all
autophagy-related pathways and 15 are the additional
components required only for certain pathways or
species. The core autophagic machinery includes 1) the
ATG9 cycling system (ATG1, ATG2, ATG9, ATG13, ATG18, and
ATG27), 2) the phosphatidylinositol 3-kinase complex
(ATG6/VPS30, ATG14, VPS15, and ATG34), and 3) the
ubiquitin-like protein system (ATG3, ATG4, ATG5, ATG7,
ATG8, ATG10, ATG12, and ATG16). Less is known about how
the core machinery is adapted or modulated with
additional components to accommodate the nonselective
sequestration of bulk cytosol (autophagosome formation)
or selective sequestration of specific cargos (Cvt
vesicle, pexophagosome, or bacteria-containing
autophagosome formation). The pexophagosome-specific
additions include the ATG30-ATG11-ATG17
receptor-adaptors complex, the coiled-coil protein
ATG25, and the sterol glucosyltransferase ATG26. ATG26
is necessary for the degradation of medium peroxisomes.
It contains 2 GRAM domains and a single PH domain. PH
domains are only found in eukaryotes. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. PH domains also have
diverse functions. They are often involved in targeting
proteins to the plasma membrane, but few display strong
specificity in lipid binding. Any specificity is usually
determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 130
Score = 27.9 bits (63), Expect = 8.0
Identities = 10/16 (62%), Positives = 12/16 (75%)
Query: 189 LCFQAYLPKDKDNVIK 204
LCF AYLPK + V+K
Sbjct: 16 LCFFAYLPKKEGEVVK 31
>gnl|CDD|184920 PRK14956, PRK14956, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 484
Score = 29.1 bits (65), Expect = 8.2
Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 1/69 (1%)
Query: 542 SNVKVVIPPASKVTTPPPRPPSHTKPNLATLQRKSTSDKFDSVSTHDDNIPPLPPKRSRK 601
N+ + P K++TPPP +K T + + +F+S + D N+ P +
Sbjct: 393 KNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFES-NQQDSNLDNNPLPSKSE 451
Query: 602 SGLDKNLPP 610
S +
Sbjct: 452 SQSEPPSSK 460
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 29.3 bits (66), Expect = 8.2
Identities = 15/54 (27%), Positives = 30/54 (55%), Gaps = 3/54 (5%)
Query: 415 KEKIDERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDE-LDDLNNNVNE 467
+++E+ EEL E+ +L+ LEE +E+ + + L+E L+ L + V +
Sbjct: 339 LAELEEKLEELKEELESLEAELEELEA--ELEELESRLEELEEQLETLRSKVAQ 390
>gnl|CDD|220554 pfam10076, DUF2313, Uncharacterized protein conserved in bacteria
(DUF2313). Members of this family of proteins comprise
various hypothetical and putative bacteriophage tail
proteins.
Length = 161
Score = 28.4 bits (64), Expect = 8.3
Identities = 16/59 (27%), Positives = 27/59 (45%)
Query: 420 ERFEELLNEVSNLKDTLEEKNDFLLIEDSNNAVITLDELDDLNNNVNESSVTIEEVHEF 478
+E +L NL DTLEE+ +L + + I++ + L + +TI E F
Sbjct: 52 PDWERVLGLPDNLTDTLEERRARVLAKLNETGGISIAYFERLAAALGYGEITITENRPF 110
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 28.8 bits (65), Expect = 8.8
Identities = 12/55 (21%), Positives = 16/55 (29%)
Query: 517 DVLPPQDIPTPPPEVATPPVKPVALSNVKVVIPPASKVTTPPPRPPSHTKPNLAT 571
+ + P VAT P V VV P + V T P+
Sbjct: 180 APVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGA 234
>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
Fmp27_WPPW is a conserved domain of a family of proteins
involved in RNA polymerase II transcription initiation.
It contains characteristic HQR and WPPW sequence motifs.
and is towards the C-terminal in members which contain
Fmp27_SW pfam10305.
Length = 470
Score = 29.2 bits (66), Expect = 9.5
Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 1/64 (1%)
Query: 383 KSVNINMKPHVTFSQHDTDMETNQVPDVLFDTKEKIDERFEELLNEVSNLKD-TLEEKND 441
K + K + S + + + + E++ +RFEELL EV D ++E
Sbjct: 312 KERLKSDKTSESESGSENISQEKDPSEKEEKSDEELLDRFEELLREVKKSDDFDVQESYL 371
Query: 442 FLLI 445
LI
Sbjct: 372 VRLI 375
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.132 0.384
Gapped
Lambda K H
0.267 0.0705 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 34,681,660
Number of extensions: 3379736
Number of successful extensions: 3444
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3250
Number of HSP's successfully gapped: 155
Length of query: 695
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 591
Effective length of database: 6,324,786
Effective search space: 3737948526
Effective search space used: 3737948526
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 62 (27.7 bits)