RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy1763
(365 letters)
>gnl|CDD|212559 cd11661, SANT_MTA3_like, Myb-Like Dna-Binding Domain of MTA3 and
related proteins. Members in this SANT/myb family
include domains found in mouse metastasis-associated
protein 3 (MTA3) proteins and arginine-glutamic
dipeptide (RERE) repeats proteins. SANT (SWI3, ADA2,
N-CoR and TFIIIB) DNA-binding domains are a diverse set
of proteins that share a common 3 alpha-helix bundle.
MTA3 has been shown to interact with nucleosome
remodeling and deacetylase (NuRD) proteins CHD4 and
HDAC1, and the core cohesin complex protein RAD21 in the
ovary, and regulate G2/M progression in proliferating
granulosa cells. RERE belongs to the atrophin family and
has been identified as a nuclear receptor corepressor;
altered expression levels of RERE are associated with
cancer in humans while mutations of Rere in mice cause
failure in closing the anterior neural tube and fusion
of the telencephalic and optic vesicles during
embryogenesis.
Length = 46
Score = 90.0 bits (224), Expect = 5e-23
Identities = 26/45 (57%), Positives = 35/45 (77%)
Query: 211 WSEEECKNFESGLRMYGKNFYQIHQNKVKTRNVGEIVQFYYLWKK 255
WSE E K FE GLR YGK+F+ I Q+ + ++VGE+V+FYY+WKK
Sbjct: 2 WSESEAKLFEEGLRKYGKDFHDIRQDFLPWKSVGELVEFYYMWKK 46
>gnl|CDD|216509 pfam01448, ELM2, ELM2 domain. The ELM2 (Egl-27 and MTA1 homology
2) domain is a small domain of unknown function. It is
found in the MTA1 protein that is part of the NuRD
complex. The domain is usually found to the N terminus
of a myb-like DNA binding domain pfam00249. ELM2 is also
found associated with an ARID DNA binding domain
pfam01388 in a member from Arabidopsis thaliana. This
suggests that ELM2 may also be involved in DNA binding,
or perhaps is a protein-protein interaction domain.
Length = 55
Score = 43.8 bits (104), Expect = 3e-06
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 2/54 (3%)
Query: 105 IMVGSEYQAWIPEGMCKY-GDILPYENDDKLLWDPNHNLVTVDLVETYLKKSHE 157
I VG +YQA IPE + + D E++D+L+W P++ L + +E YL +
Sbjct: 1 IRVGPDYQAEIPELLPESEKDQDKEEDEDQLVWSPDNEL-SEQKLEEYLLVAKS 53
>gnl|CDD|238096 cd00167, SANT, 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains.
Tandem copies of the domain bind telomeric DNA tandem
repeatsas part of the capping complex. Binding is
sequence dependent for repeats which contain the G/C
rich motif [C2-3 A (CA)1-6]. The domain is also found in
regulatory transcriptional repressor complexes where it
also binds DNA.
Length = 45
Score = 36.0 bits (84), Expect = 0.001
Identities = 9/44 (20%), Positives = 20/44 (45%)
Query: 211 WSEEECKNFESGLRMYGKNFYQIHQNKVKTRNVGEIVQFYYLWK 254
W+EEE + ++ YGKN ++ ++ R + + +
Sbjct: 2 WTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRNLL 45
>gnl|CDD|197842 smart00717, SANT, SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding
domains.
Length = 49
Score = 36.4 bits (85), Expect = 0.001
Identities = 10/46 (21%), Positives = 21/46 (45%)
Query: 211 WSEEECKNFESGLRMYGKNFYQIHQNKVKTRNVGEIVQFYYLWKKT 256
W+EEE + ++ YGKN ++ ++ R + + + K
Sbjct: 4 WTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWRNLLKP 49
>gnl|CDD|215818 pfam00249, Myb_DNA-binding, Myb-like DNA-binding domain. This
family contains the DNA binding domains from Myb
proteins, as well as the SANT domain family.
Length = 47
Score = 34.8 bits (81), Expect = 0.004
Identities = 5/44 (11%), Positives = 15/44 (34%)
Query: 211 WSEEECKNFESGLRMYGKNFYQIHQNKVKTRNVGEIVQFYYLWK 254
W+ EE + ++ +G + + R + + +
Sbjct: 4 WTPEEDELLIEAVKKHGNGNWSKIAKHLPGRTDNQCKNRWNNYL 47
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 34.3 bits (79), Expect = 0.072
Identities = 19/80 (23%), Positives = 36/80 (45%), Gaps = 6/80 (7%)
Query: 24 GNPLAPPVSDPLMLDEEIEDDDEEEE--EEEKQPVSQLAKLYEEGCSTSSKETALSKALE 81
P+A SD L++E + EE K+ ++ KL E+ S+S T + E
Sbjct: 104 SVPIASGESDLENLNDEDTSSESSYGFKEESKKGSAETLKLEEKDSSSSEGSTVDLEPPE 163
Query: 82 EDDDEYNFFDEHPDDDDDDP 101
E+++E +++ +P
Sbjct: 164 EEEEEI----AEEEEEVKEP 179
>gnl|CDD|234681 PRK00182, tatB, sec-independent translocase; Provisional.
Length = 160
Score = 32.1 bits (73), Expect = 0.25
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 13/75 (17%)
Query: 39 EEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDDD 98
++++ D EE +E ++P++Q+A +TA++KAL + DD +F DD
Sbjct: 49 QQLDGDFGEEFDEFRKPLNQIASY-----RRMGPKTAITKALFDGDD--SFL------DD 95
Query: 99 DDPKKTIMVGSEYQA 113
DPKK + G+E +A
Sbjct: 96 FDPKKIMADGTEGEA 110
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 31.6 bits (72), Expect = 0.45
Identities = 21/65 (32%), Positives = 32/65 (49%), Gaps = 6/65 (9%)
Query: 36 MLDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPD 95
+ EE ED DE+ EK+ A+ +E + + EE+++E FD+ D
Sbjct: 141 LFTEEEEDIDEKLSMLEKKLKELEAEDVDEE------DEKDEEEEEEEEEEDEDFDDDDD 194
Query: 96 DDDDD 100
DDDDD
Sbjct: 195 DDDDD 199
Score = 30.5 bits (69), Expect = 1.2
Identities = 19/67 (28%), Positives = 35/67 (52%), Gaps = 17/67 (25%)
Query: 37 LDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYN---FFDEH 93
E+++++DE++EEEE++ + ++ ++DDD+YN +FD
Sbjct: 164 EAEDVDEEDEKDEEEEEEEEEEDEDFDDDD--------------DDDDDDYNAENYFDNG 209
Query: 94 PDDDDDD 100
DDD DD
Sbjct: 210 EDDDYDD 216
>gnl|CDD|215973 pfam00527, E7, E7 protein, Early protein.
Length = 92
Score = 29.9 bits (68), Expect = 0.47
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 27 LAPPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLY 63
L P L E+++D +EEEE+EE + + Y
Sbjct: 13 LELPEPVDLHCYEQLDDSEEEEEDEEDRGQPPAQQPY 49
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 31.6 bits (71), Expect = 0.54
Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 10/80 (12%)
Query: 289 EQEGSINPPASPNVYLMSESSKRQRNSSLGNKT--TPPEETKTDKEGATSTPAAQQPANT 346
QE + +PN +++ + ++S K EE K+ AT P +Q T
Sbjct: 69 RQERTPKLEKAPN------TNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTT 122
Query: 347 SQSL-PQTQ-TTPTSSAEEK 364
++S P+T+ TTP S+ +
Sbjct: 123 TESTTPKTKVTTPPSTNTPQ 142
>gnl|CDD|235224 PRK04140, PRK04140, hypothetical protein; Provisional.
Length = 317
Score = 31.8 bits (73), Expect = 0.56
Identities = 18/73 (24%), Positives = 36/73 (49%), Gaps = 4/73 (5%)
Query: 38 DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKAL-EEDDDEYNFFDEHPDD 96
D+E+ +D EE+ EK+ ++ L +L G + A A+ E++ DE +
Sbjct: 194 DDEVTPEDPEEDPFEKEILNVLTRL---GFDVLPTKKAPFDAVSEDEKDENILLTGISEF 250
Query: 97 DDDDPKKTIMVGS 109
+++ K+ +V S
Sbjct: 251 NEETEKRARIVSS 263
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.9 bits (72), Expect = 0.68
Identities = 16/43 (37%), Positives = 26/43 (60%)
Query: 38 DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKAL 80
+EE E+++EEEEEEE++ + + EE S ET +A+
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAI 907
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 31.1 bits (71), Expect = 0.94
Identities = 18/67 (26%), Positives = 31/67 (46%), Gaps = 23/67 (34%)
Query: 38 DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDD 97
DEE +DDD EEEEE+ + + EE+D+E D+ D++
Sbjct: 334 DEEDDDDDLEEEEED-----------------------VDLSDEEEDEEDEDSDDEDDEE 370
Query: 98 DDDPKKT 104
+++ +K
Sbjct: 371 EEEEEKE 377
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.7 bits (70), Expect = 1.5
Identities = 17/66 (25%), Positives = 33/66 (50%), Gaps = 8/66 (12%)
Query: 38 DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDD 97
D++I+DDD++E+++E + EE KE +K LE+ D+ +F + D +
Sbjct: 144 DDDIDDDDDDEDDDEDDDDDDVDDEDEEK-----KE---AKELEKLSDDDDFVWDEDDSE 195
Query: 98 DDDPKK 103
+
Sbjct: 196 ALRQAR 201
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 30.4 bits (68), Expect = 1.5
Identities = 15/67 (22%), Positives = 27/67 (40%), Gaps = 3/67 (4%)
Query: 293 SINPPASPNVYLMSESSKRQRNSSLGNKTTP---PEETKTDKEGATSTPAAQQPANTSQS 349
S + S N +S+ + N T+P + T+ G ST A QP + +
Sbjct: 31 SSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPHSHETT 90
Query: 350 LPQTQTT 356
+ T++
Sbjct: 91 ITCTKSL 97
>gnl|CDD|226709 COG4258, COG4258, Predicted exporter [General function prediction
only].
Length = 788
Score = 30.2 bits (68), Expect = 1.8
Identities = 16/61 (26%), Positives = 22/61 (36%)
Query: 299 SPNVYLMSESSKRQRNSSLGNKTTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPT 358
S +L S S +RQR L T P T + T P + + +PT
Sbjct: 521 SNARFLPSNSLQRQREDLLPLPTALPRLTAALVNASGLTAVVFDPFAAPVEVDRWLASPT 580
Query: 359 S 359
S
Sbjct: 581 S 581
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 29.8 bits (68), Expect = 2.0
Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 12/66 (18%)
Query: 37 LDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDD 96
++E ++ EE E EK V + Y +E E+DDD + D+ +D
Sbjct: 332 IEENPREEAEEAEAPEK--VEFMWDDYH-------REQLEEVEEEDDDD---WDDDWDED 379
Query: 97 DDDDPK 102
DD+ +
Sbjct: 380 DDEGVE 385
>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
N-terminal domain. Autophagocytosis is a
starvation-induced process responsible for transport of
cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
ubiquitin like modifier that is topologically similar to
the canonical E2 enzyme. It catalyzes the conjugation of
Atg8 and phosphatidylethanolamine.
Length = 146
Score = 28.8 bits (65), Expect = 2.7
Identities = 17/65 (26%), Positives = 28/65 (43%), Gaps = 6/65 (9%)
Query: 37 LDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDD 96
+E +ED+DE++ K + E + + ++DDD + D DD
Sbjct: 88 AEEIVEDEDEDDGWVTTHGNRDKQK------DDIADEEDIPEIGDDDDDVVDSSDADEDD 141
Query: 97 DDDDP 101
DDD P
Sbjct: 142 DDDIP 146
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 29.6 bits (66), Expect = 2.8
Identities = 20/57 (35%), Positives = 25/57 (43%), Gaps = 3/57 (5%)
Query: 306 SESSKRQRNSSLGNKTTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPTSSAE 362
S+K + S N TTPP T E TP A P++T PQ+ TT E
Sbjct: 45 PPSTKVEAPQSKPNATTPPS---TKVEAPQQTPNATTPSSTKVETPQSPTTKQVPTE 98
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 29.6 bits (66), Expect = 3.1
Identities = 18/61 (29%), Positives = 29/61 (47%)
Query: 38 DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDD 97
DE+ +DDDEE++EEE++ ++ +E E + E D + F E D
Sbjct: 159 DEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTDCFKFIEAGAGD 218
Query: 98 D 98
D
Sbjct: 219 D 219
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 29.6 bits (67), Expect = 3.4
Identities = 17/60 (28%), Positives = 26/60 (43%), Gaps = 7/60 (11%)
Query: 43 DDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDDDDDPK 102
DD + EEE + +P A E +E +++DE + E +DDDDD
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEE-------SDEEDEESSKSEDDEDDDDDDD 154
>gnl|CDD|234407 TIGR03941, tRNA_deam_assoc, putative tRNA adenosine
deaminase-associated protein. This model describes a
protein family about 200 amino acids in length with only
five invariant residues, including an Arg, a Ser-Asp
pair, and two Gly residues. Members always are found
exclusively in Actinobacteria, and always adjacent to
homologs of TadA, a tRNA-specific adenosine deaminase
from Escherichia coli. Homology, phyletic pattern, and
gene neighborhood together suggest a housekeeping
function in tRNA metabolism [Unknown function, General].
Length = 154
Score = 28.3 bits (64), Expect = 3.7
Identities = 24/71 (33%), Positives = 32/71 (45%), Gaps = 12/71 (16%)
Query: 26 PLAPPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDD 85
LA V D L D +DDDEE++ + P L L + G S E L L +D+D
Sbjct: 78 DLAADVLDELDADVPDDDDDEEDDADP-WPEGDLDILADLGLS----EEELG-VLCDDED 131
Query: 86 EYNFFDEHPDD 96
Y PD+
Sbjct: 132 LY------PDE 136
>gnl|CDD|178320 PLN02718, PLN02718, Probable galacturonosyltransferase.
Length = 603
Score = 29.1 bits (65), Expect = 3.8
Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 7/79 (8%)
Query: 290 QEGSINPPASPNVYLMSESSK--RQRNSSLGNKTTPPEETK--TDKEGATSTPAAQQPAN 345
Q+G N S Y S+ + + GN P EE + K +S+ Q A
Sbjct: 78 QDGDFN---SVVSYNSSDKNNDSLESEVDGGNNHKPKEEQAQVSQKTTVSSSEEVQISAR 134
Query: 346 TSQSLPQTQTTPTSSAEEK 364
Q +TQ P + EK
Sbjct: 135 DIQLNHKTQFNPPTVKHEK 153
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 29.1 bits (65), Expect = 3.9
Identities = 21/104 (20%), Positives = 40/104 (38%), Gaps = 10/104 (9%)
Query: 17 NERSPSHGNPLAPP------VSDPLMLDEEIE----DDDEEEEEEEKQPVSQLAKLYEEG 66
ER + P P M +E E ++EEEEEEE++ Q ++ E
Sbjct: 409 RERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGE 468
Query: 67 CSTSSKETALSKALEEDDDEYNFFDEHPDDDDDDPKKTIMVGSE 110
+E EE+ + + D ++ ++D ++ +
Sbjct: 469 DEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAG 512
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 28.5 bits (64), Expect = 4.9
Identities = 10/23 (43%), Positives = 17/23 (73%)
Query: 79 ALEEDDDEYNFFDEHPDDDDDDP 101
A ++D DEY++ DE ++DD+ P
Sbjct: 190 APDDDLDEYDYGDEDEEEDDEPP 212
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 29.0 bits (65), Expect = 5.0
Identities = 15/78 (19%), Positives = 25/78 (32%), Gaps = 1/78 (1%)
Query: 286 FLDEQEGSINPPASPNVYLMSESSKRQRNSSLGNKTTPPEETKTDKEGATSTPAAQQPAN 345
FLDE E S P + RN +K T ++++ A +A+ +
Sbjct: 37 FLDELEQS-EKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRS 95
Query: 346 TSQSLPQTQTTPTSSAEE 363
+ T E
Sbjct: 96 SHPKSKAPSTESEEEEEP 113
>gnl|CDD|218003 pfam04281, Tom22, Mitochondrial import receptor subunit Tom22. The
mitochondrial protein translocase family, which is
responsible for movement of nuclear encoded pre-proteins
into mitochondria, is very complex with at least 19
components. These proteins include several chaperone
proteins, four proteins of the outer membrane
translocase (Tom) import receptor, five proteins of the
Tom channel complex, five proteins of the inner membrane
translocase (Tim) and three "motor" proteins. This
family represents the Tom22 proteins. The N terminal
region of Tom22 has been shown to have chaperone-like
activity, and the C terminal region faces the
intermembrane face.
Length = 136
Score = 27.6 bits (62), Expect = 5.2
Identities = 12/67 (17%), Positives = 25/67 (37%), Gaps = 19/67 (28%)
Query: 39 EEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDDD 98
E+ED+ +E+ + ++Q ++DD++ D DD
Sbjct: 5 TEVEDETFQEKPAAPKNLAQEES-------------------DDDDEDDTDTDSDISDDS 45
Query: 99 DDPKKTI 105
D +T+
Sbjct: 46 DFENETL 52
>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727). This
domain family is found in bacteria and eukaryotes, and
is approximately 100 amino acids in length.
Length = 100
Score = 27.2 bits (61), Expect = 5.4
Identities = 16/38 (42%), Positives = 22/38 (57%), Gaps = 3/38 (7%)
Query: 29 PPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLYEEG 66
P + +L+ E EDDD+E+EEEE + LA Y E
Sbjct: 19 PEPEEDEILELEEEDDDDEDEEEEYEL---LASFYHED 53
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 27.7 bits (62), Expect = 5.5
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 38 DEEIEDDDEEEEEEEKQPVSQ 58
+++ EDDDE+E EEE+ PV +
Sbjct: 124 EDDEEDDDEDESEEEESPVKK 144
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 28.7 bits (64), Expect = 5.7
Identities = 10/45 (22%), Positives = 18/45 (40%)
Query: 321 TTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPTSSAEEKT 365
TTP T+ ++ + A ++ NT + T S+ E
Sbjct: 34 TTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETA 78
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 28.1 bits (62), Expect = 6.0
Identities = 19/75 (25%), Positives = 32/75 (42%), Gaps = 10/75 (13%)
Query: 291 EGSINPPASPNVYLMSESSKR----QRNSSLGNKTTPPEETKTDKEGATSTPAAQQPANT 346
E S P A+ +L +E + R ++ TTPP ++T A
Sbjct: 66 ESSTLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHE------KKN 119
Query: 347 SQSLPQTQTTPTSSA 361
+++ +TQT PT+S
Sbjct: 120 TRTTSRTQTPPTTST 134
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 28.3 bits (62), Expect = 6.4
Identities = 10/52 (19%), Positives = 21/52 (40%), Gaps = 1/52 (1%)
Query: 314 NSSLGNKTTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPTSSAEEKT 365
+ G T+ + +++T A + N + P T ++ +S KT
Sbjct: 113 ATEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAP-TLSSKGTSNATKT 163
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 27.1 bits (60), Expect = 6.6
Identities = 11/36 (30%), Positives = 20/36 (55%)
Query: 28 APPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLY 63
A + EE + +EE+EEEE++ ++ L L+
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEEALAGLGALF 106
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 28.3 bits (63), Expect = 7.1
Identities = 17/69 (24%), Positives = 26/69 (37%), Gaps = 1/69 (1%)
Query: 38 DEEIEDDDEEEEEEEKQPVSQLAKLYEEGCSTSSKETALSKALEEDDDEYNFFDEHPDDD 97
E + DDEEEE+ + + SQ + + E A S ED E DE +
Sbjct: 186 SAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTSES-ESDESDSEI 244
Query: 98 DDDPKKTIM 106
+ +
Sbjct: 245 SESRSVSDS 253
>gnl|CDD|187787 cd09656, Cmr3_III-B, CRISPR/Cas system-associated RAMP superfamily
protein Cmr3. CRISPR (Clustered Regularly Interspaced
Short Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
RAMP superfamily protein; This protein is a subunit of
Cmr complex.
Length = 318
Score = 28.2 bits (63), Expect = 7.3
Identities = 16/90 (17%), Positives = 29/90 (32%), Gaps = 11/90 (12%)
Query: 79 ALEEDDDEYNFFDEHPDDDDDDPK-----KTIMVGSEYQAWIPEGMCKYGDILPYENDDK 133
LE D+D + F+ + + K I + + + G + E
Sbjct: 200 KLEVDEDNEDTFNPEIEPEGKYVKFYLLTPAIFIEGGVLSDLK------GLLCEVEIFAV 253
Query: 134 LLWDPNHNLVTVDLVETYLKKSHEMFINGA 163
L+ + D VE K + E G+
Sbjct: 254 LIDKRKQIVSGWDYVEKKPKPTLEAVPPGS 283
>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
Length = 325
Score = 28.0 bits (63), Expect = 7.3
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 33 DPLMLDEEIEDDDEEEEEEEKQPVSQLA 60
+ +++EE D+D EEE Q +Q
Sbjct: 24 EKALVEEEPSDNDLAEEELLSQGATQRV 51
>gnl|CDD|233619 TIGR01888, cas_cmr3, CRISPR type III-B/RAMP module-associated
protein Cmr3. CRISPR is a term for Clustered Regularly
Interspaced Short Palidromic Repeats. A number of
protein families appear only in association with these
repeats and are designated Cas (CRISPR associated)
proteins. This highly divergent family is found in at
least ten different archaeal and bacterial species as
part of the CRISPR RAMP modulue but is not a member of
the RAMP superfamily itself. A typical example is TM1793
from Thermotoga maritima [Mobile and extrachromosomal
element functions, Other].
Length = 337
Score = 28.2 bits (63), Expect = 8.0
Identities = 18/124 (14%), Positives = 33/124 (26%), Gaps = 19/124 (15%)
Query: 79 ALEEDDDEYNFFDEHPDDDDDDPK-----KTIMVGSEYQAWIPEGMCKYGDILPYENDDK 133
LE D+D + F+ + K I + + + +C L +
Sbjct: 215 KLEVDEDNEDTFNPEILPETKYVKFYLLTPAIFIEGGVLSDLKGLLC--EAKLVEVFNFG 272
Query: 134 LLWDPNHNLVTVDLVETYLKKSHEMFINGA-------GDCLPMGSHSRDDEQALYLLLQC 186
D VE K + E G+ + R + + +
Sbjct: 273 KRKRTVSG---WDYVEKKPKPTLEAVPPGSVYFFKAKEEGPLELLFPRLIKLGKFHKI-- 327
Query: 187 GYNT 190
GY
Sbjct: 328 GYGL 331
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 28.2 bits (62), Expect = 8.3
Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 6/55 (10%)
Query: 312 QRNSSLGNKTTPPEETKTDKEGATSTPA-AQQPA-----NTSQSLPQTQTTPTSS 360
+ N++ TT +ET T ++ A P A +PA NT+ + T T ++
Sbjct: 272 KENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTN 326
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 28.1 bits (62), Expect = 8.5
Identities = 11/52 (21%), Positives = 17/52 (32%)
Query: 314 NSSLGNKTTPPEETKTDKEGATSTPAAQQPANTSQSLPQTQTTPTSSAEEKT 365
S+ N T T T + T +Q Q+ + P + EK
Sbjct: 63 KSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKA 114
>gnl|CDD|224179 COG1259, COG1259, Uncharacterized conserved protein [Function
unknown].
Length = 151
Score = 27.3 bits (61), Expect = 8.7
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 21 PSHGNPLAPPVSDPLMLDEEIEDDDEEEEEEEKQP 55
PS LA V P+ + EE+ D+ E E E+E
Sbjct: 107 PSDAIALALRVGAPIYVAEEVLDEAEIEIEDEDDE 141
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 28.0 bits (62), Expect = 9.1
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 3/54 (5%)
Query: 23 HGNPLAPPVSDPLMLDEEIEDDDEEEEEEEKQPVSQLAKLYEEG-CSTSSKETA 75
N + SD EE E+++EEEEEEE+ V + K S+ E+
Sbjct: 215 PPNSSSSSGSDSE--SEEDEEEEEEEEEEEEIDVVTVEKRRSSSNRKASTSESI 266
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.128 0.372
Gapped
Lambda K H
0.267 0.0794 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 18,865,143
Number of extensions: 1818355
Number of successful extensions: 3585
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3105
Number of HSP's successfully gapped: 179
Length of query: 365
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 267
Effective length of database: 6,590,910
Effective search space: 1759772970
Effective search space used: 1759772970
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 60 (26.8 bits)