RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3485
(304 letters)
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 37.3 bits (87), Expect = 0.002
Identities = 22/83 (26%), Positives = 44/83 (53%), Gaps = 12/83 (14%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEIL--DN-EDDTKSQGSVET-----IDDDI---EMV 109
+E+ + +E++ + + E +E EDGE L DN E D ++ ++ +DD+ E+
Sbjct: 5 DEEEE-LEDDIDDLDEEAEEEEDGEDLFGDNMERDYRANPELDRYDEDGLDDESDYEELS 63
Query: 110 FAAEKKADPKRDMEKVRLRKQLM 132
AA + A+ + + LR++ M
Sbjct: 64 AAARRAAEARLNRRDRLLRRRRM 86
>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain.
Length = 26
Score = 34.3 bits (79), Expect = 0.003
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 200 SMAIHLRTHTKEKPFACEMCNYRAAS 225
++ H+RTHT EKP+ C +C +S
Sbjct: 1 NLRRHMRTHTGEKPYKCPVCGKSFSS 26
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 37.3 bits (87), Expect = 0.008
Identities = 20/69 (28%), Positives = 29/69 (42%), Gaps = 11/69 (15%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKS-QGSVETIDDDIEMVFAAEKKADPK 119
EE PP P P ED+ + E D E+ + ++E E+V A + A P
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPE--DEEEQDDAEDRALE------EIVLEAVRAALPP 311
Query: 120 RDMEKVRLR 128
D+ RL
Sbjct: 312 -DLLA-RLA 318
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 33.1 bits (76), Expect = 0.039
Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 2/47 (4%)
Query: 61 EEDVDKVEEE--PPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
ED K +E+ + +D+ +D + D D ++ DDD
Sbjct: 50 AEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDDDDD 96
Score = 30.4 bits (69), Expect = 0.40
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 68 EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
+ E + E++E ED +LD++DD + +DDD
Sbjct: 49 DAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDD 86
Score = 29.6 bits (67), Expect = 0.66
Identities = 9/41 (21%), Positives = 23/41 (56%)
Query: 68 EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEM 108
E+ + + +E++ +D + D++DD + DDD+++
Sbjct: 50 AEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDL 90
Score = 27.7 bits (62), Expect = 3.4
Identities = 10/50 (20%), Positives = 19/50 (38%)
Query: 58 TGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIE 107
G E + + P + E+ +D + D +D + DDD+
Sbjct: 32 CGEEVPPEVAKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLP 81
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 34.1 bits (78), Expect = 0.096
Identities = 22/103 (21%), Positives = 38/103 (36%), Gaps = 10/103 (9%)
Query: 38 KKNHQNRNQK--------KMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILD- 88
K+ QN K + + + +E++ + E EE E ED ++
Sbjct: 25 KRIQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQVDFQHNSESAVEEVEAEDEIEVEQ 84
Query: 89 -NEDDTKSQGSVETIDDDIEMVFAAEKKADPKRDMEKVRLRKQ 130
D KS V+ +M + D K +E+ LRK+
Sbjct: 85 NQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENLRKR 127
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 33.8 bits (77), Expect = 0.12
Identities = 19/64 (29%), Positives = 33/64 (51%), Gaps = 6/64 (9%)
Query: 65 DKVEEEPPEPEPEEDE----VEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKR 120
D V EE E E EEDE E G+++D + + + + ++T+ D I+ ++ K +
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFID--LESQTKKPSVK 194
Query: 121 DMEK 124
D K
Sbjct: 195 DNGK 198
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 33.7 bits (77), Expect = 0.13
Identities = 20/85 (23%), Positives = 36/85 (42%), Gaps = 7/85 (8%)
Query: 49 MKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEM 108
M S+ ++ EE+ ++ EEE E + E+E + E + E + + + EM
Sbjct: 434 MASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADN-------GSEEEM 486
Query: 109 VFAAEKKADPKRDMEKVRLRKQLMA 133
++E D + E R MA
Sbjct: 487 EGSSEGDGDGEEPEEDAERRNSEMA 511
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 33.1 bits (76), Expect = 0.13
Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 5/48 (10%)
Query: 49 MKSKTAKYWTGGEEDVDK-----VEEEPPEPEPEEDEVEDGEILDNED 91
K +A+ E+D V+ EPPE E EE E+ E+ + ED
Sbjct: 134 SKKGSAETLKLEEKDSSSSEGSTVDLEPPEEEEEEIAEEEEEVKEPED 181
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 33.6 bits (77), Expect = 0.13
Identities = 20/91 (21%), Positives = 37/91 (40%), Gaps = 1/91 (1%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKR 120
EE+ + E E E + V D++DD + ++D E + K KR
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 121 DMEKVR-LRKQLMAIEEKIKAARHPLFSPDD 150
+ +++ L+KQ +KI ++ D
Sbjct: 61 EKARLKELKKQKKQEIQKILEQQNAAIDADM 91
>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306). This
family of proteobacterial species proteins has no known
function.
Length = 115
Score = 31.1 bits (71), Expect = 0.23
Identities = 17/62 (27%), Positives = 23/62 (37%), Gaps = 6/62 (9%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETI--DDDIEMVFAAEKKADP 118
EE+ P E EE E+ED E+L+ D +T+ D A
Sbjct: 26 EEEAAAAAPAPAPEEEEEAELEDEELLEELDLP----DPDTLTPGSDFSAFLAKGVPEAL 81
Query: 119 KR 120
KR
Sbjct: 82 KR 83
>gnl|CDD|222450 pfam13909, zf-H2C2_5, C2H2-type zinc-finger domain.
Length = 24
Score = 28.7 bits (65), Expect = 0.24
Identities = 9/25 (36%), Positives = 13/25 (52%), Gaps = 1/25 (4%)
Query: 248 YHCAFCNFTSPTQWAINRHTKYTHM 272
Y C+ C ++S T + RH K H
Sbjct: 1 YKCSHCPYSS-TPKKLERHLKLHHG 24
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 32.7 bits (74), Expect = 0.27
Identities = 24/71 (33%), Positives = 37/71 (52%), Gaps = 8/71 (11%)
Query: 59 GGEEDVDKVE-EEPPEPEPEEDEVE-----DGEILDNEDDTKSQG--SVETIDDDIEMVF 110
GE + ++VE E E E EDE E +GE +++E + +++G VET D E
Sbjct: 707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEH 766
Query: 111 AAEKKADPKRD 121
E +A+ K D
Sbjct: 767 EGETEAEGKED 777
Score = 28.8 bits (64), Expect = 4.0
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 59 GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNED 91
GG+ + ++ EEE E E EE+E E+ E +NE+
Sbjct: 860 GGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 32.6 bits (75), Expect = 0.27
Identities = 15/80 (18%), Positives = 37/80 (46%), Gaps = 1/80 (1%)
Query: 60 GEEDVD-KVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADP 118
+ D++ ++++EP + D ED + + D+ + S D+D + + A
Sbjct: 102 DDGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATR 161
Query: 119 KRDMEKVRLRKQLMAIEEKI 138
+R +E+ R R++ ++
Sbjct: 162 ERSLERRRRRREWEEKRAEL 181
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 32.5 bits (74), Expect = 0.31
Identities = 15/47 (31%), Positives = 20/47 (42%)
Query: 68 EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEK 114
E +PP P P + E E D D T ET E++F A +
Sbjct: 270 EPQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIPEELMFDAVE 316
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 31.7 bits (72), Expect = 0.36
Identities = 18/80 (22%), Positives = 31/80 (38%), Gaps = 4/80 (5%)
Query: 35 TKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVE-DGEILDNEDDT 93
T K +++KK K K + E D DK EE +P+ E+ + N
Sbjct: 58 TPRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSD 117
Query: 94 KSQGSVETIDD-DIEMVFAA 112
+ +DD D+++
Sbjct: 118 IDN--EDDMDDSDLKLATEN 135
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 31.6 bits (71), Expect = 0.43
Identities = 18/55 (32%), Positives = 23/55 (41%), Gaps = 5/55 (9%)
Query: 41 HQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKS 95
+ K K E++ D+ EEE E E EE EVE+ E N T S
Sbjct: 117 QLPKKAGNAGKKATK-----EDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNS 166
Score = 27.4 bits (60), Expect = 8.8
Identities = 11/44 (25%), Positives = 19/44 (43%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDD 104
EE+ D E E +E E+ E D +D +++ T+
Sbjct: 52 EENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAENTTLST 95
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 32.0 bits (73), Expect = 0.44
Identities = 15/72 (20%), Positives = 24/72 (33%), Gaps = 11/72 (15%)
Query: 59 GGEEDVDKVEEEPPEPEPEEDEVEDGE-----------ILDNEDDTKSQGSVETIDDDIE 107
+D +EE+P EP P+ ++ ++GE DD K E D
Sbjct: 317 PQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDDEKEPDPQEEADGQGS 376
Query: 108 MVFAAEKKADPK 119
A +
Sbjct: 377 STDPAGDIFRIR 388
>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
Length = 130
Score = 30.2 bits (68), Expect = 0.52
Identities = 15/51 (29%), Positives = 22/51 (43%), Gaps = 2/51 (3%)
Query: 68 EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADP 118
E+EP EPE EE E+ +N D+ E D + + +K P
Sbjct: 10 EDEPSEPEIEE-GAEEDSESNNNDE-VKGSIGENEDKEEQEPVQRPRKTTP 58
>gnl|CDD|240256 PTZ00071, PTZ00071, 40S ribosomal protein S24; Provisional.
Length = 132
Score = 30.4 bits (69), Expect = 0.58
Identities = 13/47 (27%), Positives = 23/47 (48%)
Query: 9 NLQQIQKTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAK 55
NL ++K PK + + +I +K K +K +NR +K + K
Sbjct: 80 NLAALKKFEPKYRLIRLGLIKKKKAGRKARKELKNRRKKVRGKEKTK 126
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 31.3 bits (71), Expect = 0.73
Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 3/75 (4%)
Query: 31 RKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNE 90
++ WTK K+ Q+ + +A + D EEE E E EEDE E ++
Sbjct: 358 KQKWTKAKETEQDYILD---AFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSD 414
Query: 91 DDTKSQGSVETIDDD 105
D+ + VE+ +D
Sbjct: 415 DEEFEEDDVESKYED 429
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 30.5 bits (69), Expect = 0.74
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDD 92
E D EE+ + E EE+E E+ E D++DD
Sbjct: 163 LEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194
Score = 30.1 bits (68), Expect = 1.1
Identities = 15/45 (33%), Positives = 24/45 (53%)
Query: 48 KMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDD 92
K+ K EDVD+ +E+ E E EE+E ++ D++DD
Sbjct: 152 KLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
Score = 29.3 bits (66), Expect = 2.2
Identities = 17/102 (16%), Positives = 43/102 (42%), Gaps = 4/102 (3%)
Query: 11 QQIQKTNPKLKKVNY-PIIVARKMWTKLKKNHQNR---NQKKMKSKTAKYWTGGEEDVDK 66
Q+ +K P +K+ + + +++++ + N + K K K + E+ +K
Sbjct: 93 QKKRKIGPSIKEHPFDLELFPKELYSVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEK 152
Query: 67 VEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEM 108
+ + + E E D E +E++ + + + DD +
Sbjct: 153 LSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194
>gnl|CDD|219236 pfam06949, DUF1292, Protein of unknown function (DUF1292). This
family consists of several hypothetical bacterial
proteins of around 90 residues in length. The function
of this family is unknown.
Length = 74
Score = 28.7 bits (65), Expect = 0.76
Identities = 14/42 (33%), Positives = 20/42 (47%), Gaps = 4/42 (9%)
Query: 75 EPEEDEVEDGE--ILDNEDDTKSQGSVETIDDD--IEMVFAA 112
E+E ED E I E+D + + I+DD +MV A
Sbjct: 26 PVGEEEDEDIEVYIFKIEEDEDGEEELLPIEDDEEWDMVEEA 67
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 30.9 bits (70), Expect = 0.80
Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 5/103 (4%)
Query: 49 MKSKTAKYWTGGEEDVDKVEEEPP---EPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
+K K K E+D+ + E+P E E E+++ +D D ++D G+ E +
Sbjct: 113 LKQKVKKNNKQFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAAS 172
Query: 106 IEMVFAAEKKAD--PKRDMEKVRLRKQLMAIEEKIKAARHPLF 146
E K+ D + + + + +K +
Sbjct: 173 SESGVDRVKEDDEEDEDADLSKKDVLEEPKMFKKPEEITWDDV 215
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 31.2 bits (71), Expect = 0.80
Identities = 27/123 (21%), Positives = 46/123 (37%), Gaps = 12/123 (9%)
Query: 8 PNLQQIQKTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKV 67
P L+ KK + + K+ K ++ K+K K K + +D
Sbjct: 495 PWLKTTSSVGKSAKKQDSKKKSSSKLDKAANK--ISKAAVKVKKKKKKEKSIDLDDDLID 552
Query: 68 EEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETI-----DDDIEMVFAAEKKADPKRDM 122
EE+ + + +++E ED E L K + I DD+ F EKK + +
Sbjct: 553 EEDSIKLDVDDEEDEDDEELPFLFKQK-----DLIKEAFAGDDVVAEFEKEKKEVIEEED 607
Query: 123 EKV 125
K
Sbjct: 608 PKE 610
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 31.1 bits (70), Expect = 0.83
Identities = 23/102 (22%), Positives = 45/102 (44%), Gaps = 6/102 (5%)
Query: 48 KMKSKTAKYWTGGEEDVDKVE--EEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
K K K A E+D E ++ + +PEE++ ED E D++D+ + +
Sbjct: 263 KDKKKDAGDDAELEDDEPDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKL 322
Query: 106 IEMVFAAEKKADPKR----DMEKVRLRKQLMAIEEKIKAARH 143
E V PK + + +L++Q+ +E++ A +
Sbjct: 323 DEPVLEGVDLESPKELSSFEKRQAKLKQQIEQLEKENLAPKS 364
Score = 27.6 bits (61), Expect = 9.8
Identities = 23/98 (23%), Positives = 33/98 (33%), Gaps = 14/98 (14%)
Query: 59 GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADP 118
ED EEE E E EDE+ D E + + + E +
Sbjct: 109 MDSEDSADDEEEEEEDESLEDEMIDDEDEADLFN------ESESSLEDLSDDETEDDEEK 162
Query: 119 KRDMEKVRLRKQLMAIEEKIKAARHPLFSP---DDQFF 153
K + E+ K+ +A R F DD+FF
Sbjct: 163 KMEEEEAGEEKE-----SVEQATREKKFDKSGVDDKFF 195
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 30.5 bits (68), Expect = 0.86
Identities = 15/64 (23%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILD---NEDDTKSQGSVETIDDDIEMVFAAEKKAD 117
E+++ EEPPE E E +E + E +D E+ + + +++ ++ +++ K D
Sbjct: 43 VEELEDQPEEPPEQEEENEEQKPKEEIDYPIQENKSFDEKNLDDLERSNSDIYSESHKYD 102
Query: 118 PKRD 121
D
Sbjct: 103 NASD 106
>gnl|CDD|218970 pfam06278, DUF1032, Protein of unknown function (DUF1032). This
family consists of several conserved eukaryotic proteins
of unknown function.
Length = 565
Score = 31.1 bits (70), Expect = 0.87
Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 4/79 (5%)
Query: 34 WTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDT 93
W +K+ + + Q + ++ E+ + +EE+ E E+D D + D ED
Sbjct: 342 WKHVKERLETQRQMLRRRGAERWLPDEEQKLWPLEEDRLEDSVEDDGDAD-DFSDPEDYL 400
Query: 94 KSQGSV---ETIDDDIEMV 109
+ + E D+E
Sbjct: 401 EPPEGLDPEEQAFQDLEAE 419
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 30.4 bits (69), Expect = 1.0
Identities = 20/95 (21%), Positives = 39/95 (41%), Gaps = 7/95 (7%)
Query: 47 KKMKSKTAKYWTG-GEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
KK K K +K G +E+ D+ E E + P+E D ++ +
Sbjct: 2 KKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKI------GKNPS 55
Query: 106 IEMVFAAEKKADPKRDMEKVRLRKQLMAIEEKIKA 140
++ F +K + K + LR++ + +E +K
Sbjct: 56 VDTSFLPDKAREEKEAELREELREEFLKKQEAVKE 90
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.9 bits (69), Expect = 1.1
Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 4/111 (3%)
Query: 30 ARKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDN 89
A+K K + + ++ K K + E+ K EE E + + +
Sbjct: 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671
Query: 90 EDDTKSQGSVETIDDDIEMVFAAEKKADPKRDMEKVRLRKQLMAIEEKIKA 140
ED K++ + + +D+ + A +K+A+ + E+++ ++ EEK KA
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA----EEKKKA 1718
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 30.5 bits (69), Expect = 1.1
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 7/68 (10%)
Query: 46 QKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
Q K ++K++K T + D ++E + ++DE LD+ DD + D+D
Sbjct: 228 QPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDD-------DVSDED 280
Query: 106 IEMVFAAE 113
E +F +
Sbjct: 281 GEDLFDTD 288
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 30.7 bits (69), Expect = 1.1
Identities = 16/80 (20%), Positives = 31/80 (38%), Gaps = 16/80 (20%)
Query: 60 GEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPK 119
+E+ ++ + E EP +ED +E+ LD + DD ++ EK
Sbjct: 4016 ADENKEEADAEKDEPMQDEDPLEENNTLDED---------IQQDDFSDLAEDDEK----- 4061
Query: 120 RDMEKVRLRKQLMAIEEKIK 139
M + + + EE +
Sbjct: 4062 --MNEDGFEENVQENEESTE 4079
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 29.7 bits (67), Expect = 1.1
Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 1/75 (1%)
Query: 59 GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADP 118
G E + D EE+ PE +ED+ E + D + ++ +++ V E++ DP
Sbjct: 2 GSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDS-EEEQIFVTRQEEEVDP 60
Query: 119 KRDMEKVRLRKQLMA 133
+ + E R +++MA
Sbjct: 61 EAEAEFDREFEKMMA 75
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 30.0 bits (68), Expect = 1.2
Identities = 21/101 (20%), Positives = 41/101 (40%), Gaps = 21/101 (20%)
Query: 62 EDVDKVEEEPPEPEPE-------EDEVEDGEILDN---EDDTKSQGSVETIDDDIEMVFA 111
D D E EPP P + +DE ED ++ D+ E+D + +
Sbjct: 2 ADWDDEEFEPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKA----------K 51
Query: 112 AEKKADPKRDME-KVRLRKQLMAIEEKIKAARHPLFSPDDQ 151
KA K+ ++ K+ +++ +E+ +P+D+
Sbjct: 52 VAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDE 92
>gnl|CDD|131353 TIGR02300, FYDLN_acid, TIGR02300 family protein. Members of this
family are bacterial proteins with a conserved motif
[KR]FYDLN, sometimes flanked by a pair of CXXC motifs,
followed by a long region of low complexity sequence in
which roughly half the residues are Asp and Glu,
including multiple runs of five or more acidic residues.
The function of members of this family is unknown.
Length = 129
Score = 29.2 bits (65), Expect = 1.2
Identities = 16/47 (34%), Positives = 23/47 (48%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIE 107
EE ++V+ P P +D ED +I D++DD E D D E
Sbjct: 75 EEADEEVDPGAPTPAGGDDLAEDEDIADDDDDVTFLEDEEDDDIDDE 121
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 29.8 bits (67), Expect = 1.3
Identities = 20/96 (20%), Positives = 38/96 (39%), Gaps = 3/96 (3%)
Query: 46 QKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
QK K A +E+ D+ + E + E E + E E + + + ++
Sbjct: 28 QKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEI 87
Query: 106 IEMVFAAEKKADPKRD-MEKVRLRKQL--MAIEEKI 138
+ E+ D K+ + + +K M +EEKI
Sbjct: 88 AQEEEKEEEAEDVKQQEVFSFKRKKPFKEMNLEEKI 123
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 30.0 bits (68), Expect = 1.3
Identities = 17/95 (17%), Positives = 41/95 (43%), Gaps = 3/95 (3%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAA---EKKAD 117
EE+ K E E ++D+ E+ ++ ++D+ +G ++ D E+ + E++ D
Sbjct: 80 EEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKD 139
Query: 118 PKRDMEKVRLRKQLMAIEEKIKAARHPLFSPDDQF 152
K ++L +E+ A + ++
Sbjct: 140 EAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKA 174
Score = 27.7 bits (62), Expect = 7.8
Identities = 24/85 (28%), Positives = 39/85 (45%), Gaps = 1/85 (1%)
Query: 59 GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSV-ETIDDDIEMVFAAEKKAD 117
G E D D EEE E E +ED ++GE +D E D + + S E ++ E A++ +D
Sbjct: 91 GLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSD 150
Query: 118 PKRDMEKVRLRKQLMAIEEKIKAAR 142
+ E + E + + A
Sbjct: 151 EELSEEDEEEAAEEEEAEAEKEKAS 175
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 30.0 bits (67), Expect = 1.3
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 4/64 (6%)
Query: 60 GEEDVDKVEEEPPEPEPEEDE-VEDGE-ILD--NEDDTKSQGSVETIDDDIEMVFAAEKK 115
G + + E E E EE E +DGE ++D E K QG E +++++ +F E +
Sbjct: 197 GSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWE 256
Query: 116 ADPK 119
+
Sbjct: 257 EESP 260
Score = 28.4 bits (63), Expect = 4.4
Identities = 20/82 (24%), Positives = 35/82 (42%), Gaps = 6/82 (7%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILD------NEDDTKSQGSVETIDDDIEMVFAAEK 114
+ DVD EEE E +E ED E+ N + + + DD E V E
Sbjct: 171 QPDVDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEG 230
Query: 115 KADPKRDMEKVRLRKQLMAIEE 136
+ K+ E+ + ++++ + E
Sbjct: 231 ERIDKKQGEEEEMEEEVINLFE 252
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 30.0 bits (67), Expect = 1.4
Identities = 13/52 (25%), Positives = 23/52 (44%)
Query: 62 EDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAE 113
E + E+EP + + +E + DGE + K + ET D ++ E
Sbjct: 29 ESGESKEKEPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRTHTE 80
>gnl|CDD|221757 pfam12761, End3, Actin cytoskeleton-regulatory complex protein
END3. Endocytosis is accomplished through the
sequential recruitment at endocytic sites of proteins
that drive cargo sorting, membrane invagination and
vesicle release. End3p is part of the coat module
protein complex Pan1, along with Pan1p, Sla1p, and
Sla2p. The proteins in this complex are regulated by
phosphorylation events. End3p also regulates the
cortical actin cytoskeleton. The subunits of the Pan1
complex are homologous to mammalian intersectin.
Length = 193
Score = 29.6 bits (67), Expect = 1.4
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 121 DMEKVRLRKQLMAIEEKIKAA 141
D E+VRLR++L +E K+ A
Sbjct: 95 DWEEVRLRRELAELESKLSKA 115
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 28.8 bits (64), Expect = 1.4
Identities = 13/56 (23%), Positives = 27/56 (48%)
Query: 62 EDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKAD 117
+D + EEE E + EE+ + E + E++T+ +D+ + ++K D
Sbjct: 48 DDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAETKKQKTD 103
Score = 28.4 bits (63), Expect = 2.0
Identities = 17/78 (21%), Positives = 34/78 (43%), Gaps = 5/78 (6%)
Query: 46 QKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
++K K A G E+ + +E + EE+EV++ + + + + +G E +
Sbjct: 24 EEKENGKNAP--ANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETEG 81
Query: 106 IEMVFAAEK---KADPKR 120
AAE A+ K+
Sbjct: 82 ATGKRAAEDEEDDAETKK 99
>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
Length = 211
Score = 29.8 bits (67), Expect = 1.5
Identities = 14/78 (17%), Positives = 26/78 (33%), Gaps = 1/78 (1%)
Query: 5 PKIPNLQQIQKTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDV 64
P P K P + + TK K + + K + EE V
Sbjct: 25 PPNPLSLFSSPNRKKSAPEQVPTGKNKLLLTKRSKLK-GIPKPRKLHKHGFWAEIFEEKV 83
Query: 65 DKVEEEPPEPEPEEDEVE 82
++ E P + ++D ++
Sbjct: 84 EREELGNPCKDLDKDIIK 101
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 30.0 bits (68), Expect = 1.8
Identities = 19/76 (25%), Positives = 36/76 (47%), Gaps = 4/76 (5%)
Query: 55 KYWTGGEEDVDKVEEEPPEPEPE----EDEVEDGEILDNEDDTKSQGSVETIDDDIEMVF 110
+ G E+ ++ EE+ + E E +D E+ E +D D+ + + ++ D+D E
Sbjct: 313 NFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEE 372
Query: 111 AAEKKADPKRDMEKVR 126
EK+ K+ E R
Sbjct: 373 EEEKEKKKKKSAESTR 388
>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger. This family contains
a number of divergent C2H2 type zinc fingers.
Length = 24
Score = 26.4 bits (58), Expect = 1.8
Identities = 9/24 (37%), Positives = 13/24 (54%)
Query: 214 FACEMCNYRAASKIAIYAHMRTTH 237
F C +C +SK A+ H+R H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
Score = 24.5 bits (53), Expect = 8.6
Identities = 6/24 (25%), Positives = 12/24 (50%)
Query: 248 YHCAFCNFTSPTQWAINRHTKYTH 271
+ C C + ++ A+ RH + H
Sbjct: 1 FKCPLCGKSFSSKDALKRHLRKHH 24
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 29.8 bits (67), Expect = 2.0
Identities = 10/15 (66%), Positives = 11/15 (73%), Gaps = 1/15 (6%)
Query: 259 TQWAINRHTKYTHMN 273
T W IN+HT YTH N
Sbjct: 428 TDW-INKHTVYTHGN 441
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 29.8 bits (67), Expect = 2.3
Identities = 20/73 (27%), Positives = 32/73 (43%), Gaps = 10/73 (13%)
Query: 45 NQKKMKSKTAKYWTGGEEDVDKVEEEPPEPE----------PEEDEVEDGEILDNEDDTK 94
+ KS+ + + G EED D +EE + + E + EDGE + EDD +
Sbjct: 633 KADENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAE 692
Query: 95 SQGSVETIDDDIE 107
+ V ID +
Sbjct: 693 ANEGVVPIDKAVR 705
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 29.6 bits (66), Expect = 2.7
Identities = 14/69 (20%), Positives = 34/69 (49%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKR 120
EED+ + + + E+++ +D + D+E++ + + ++ DD+ E E K
Sbjct: 142 EEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKS 201
Query: 121 DMEKVRLRK 129
+++K K
Sbjct: 202 EVDKTDCFK 210
>gnl|CDD|177300 PHA00732, PHA00732, hypothetical protein.
Length = 79
Score = 27.4 bits (61), Expect = 2.8
Identities = 10/39 (25%), Positives = 18/39 (46%)
Query: 248 YHCAFCNFTSPTQWAINRHTKYTHMNTSGPTIQPECIKI 286
+ C C FT+ T +A+ +H + H T P ++
Sbjct: 2 FKCPICGFTTVTLFALKQHARRNHTLTKCPVCNKSYRRL 40
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel,
four-helix bundle structure with a large hydrophobic
patch in which 23 residues of one monomer form van der
Waals contacts with the other monomer. This CENP-B
dimer configuration may be suitable for capturing two
distant CENP-B boxes during centromeric heterochromatin
formation.
Length = 101
Score = 27.8 bits (61), Expect = 2.8
Identities = 18/46 (39%), Positives = 24/46 (52%), Gaps = 1/46 (2%)
Query: 53 TAKYWTGGEE-DVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQG 97
T + G E+ D D EEE + E EED+ ED + D+E S G
Sbjct: 2 TLHFLEGEEDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPSFG 47
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 29.2 bits (65), Expect = 2.9
Identities = 7/40 (17%), Positives = 9/40 (22%)
Query: 59 GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGS 98
G + PP P E + Q S
Sbjct: 125 GSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPS 164
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 28.4 bits (62), Expect = 2.9
Identities = 23/82 (28%), Positives = 37/82 (45%), Gaps = 7/82 (8%)
Query: 47 KKMKSKTAKYWTGGEEDVDKVE-----EEPPEPEPEEDEVEDGEILDNEDDTKSQGSVET 101
K K K K G +D + + EE EPEPEE+ E+ E E+ + ET
Sbjct: 38 KGGKGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPE--PEEEGEEEPEPEET 95
Query: 102 IDDDIEMVFAAEKKADPKRDME 123
+++ E E + +P+ + E
Sbjct: 96 GEEEPEPEPEPEPEPEPEPEPE 117
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 28.9 bits (65), Expect = 3.3
Identities = 26/107 (24%), Positives = 42/107 (39%), Gaps = 11/107 (10%)
Query: 21 KKVNYPI--IVARKMWTKLKKNHQNRN---QKKMKSKTAKYWTGGEEDVDKVEEE----- 70
++VN I +V +L +N + K + A+ E+VD+ E+E
Sbjct: 322 RRVNDVIRPLVREHNNDQLNVKLRNPSTKESKMRDKRRARLDPIDFEEVDEDEDEEEEQR 381
Query: 71 PPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKAD 117
E E EE E + E + +D S S + D E E +D
Sbjct: 382 SDEHEEEEGEDSEEEGSQSRED-GSSESSSDVGSDSESKADKESASD 427
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 28.9 bits (64), Expect = 3.6
Identities = 17/81 (20%), Positives = 40/81 (49%), Gaps = 4/81 (4%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGS----VETIDDDIEMVFAAEKKA 116
E+ V++ E+ E E++ +D E L + + + + ET + D ++ + KK
Sbjct: 474 EDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKR 533
Query: 117 DPKRDMEKVRLRKQLMAIEEK 137
+ E+ +L+ +M+ ++K
Sbjct: 534 KVDEEEEEKKLKMIMMSNKQK 554
>gnl|CDD|223976 COG1047, SlpA, FKBP-type peptidyl-prolyl cis-trans isomerases 2
[Posttranslational modification, protein turnover,
chaperones].
Length = 174
Score = 28.0 bits (63), Expect = 4.0
Identities = 13/47 (27%), Positives = 20/47 (42%), Gaps = 1/47 (2%)
Query: 60 GEEDVDKVEEEPPEPEPEEDEVEDGEILD-NEDDTKSQGSVETIDDD 105
GE D D V+ P + E+E G ++ D + G V + D
Sbjct: 70 GEYDPDLVQRVPRDEFQGVGELEVGMEVEAEGGDGEIPGVVTEVSGD 116
>gnl|CDD|217678 pfam03699, UPF0182, Uncharacterized protein family (UPF0182). This
family contains uncharacterized integral membrane
proteins.
Length = 771
Score = 29.1 bits (66), Expect = 4.0
Identities = 11/30 (36%), Positives = 12/30 (40%), Gaps = 9/30 (30%)
Query: 261 WAINRHTKYTH--------MNTSGPTIQPE 282
W INRH YTH +N PE
Sbjct: 413 W-INRHLVYTHGYGFVMSPVNQVTSEGLPE 441
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 29.0 bits (66), Expect = 4.0
Identities = 18/69 (26%), Positives = 29/69 (42%), Gaps = 12/69 (17%)
Query: 63 DVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKRDM 122
VD EE P E E D + + E+ +D+ + AA++ P++ +
Sbjct: 173 FVDPNAEEDPAHVGSELEELDDDEDEEEE-----------EDENDDSLAADESELPEKVL 221
Query: 123 EKV-RLRKQ 130
EK L KQ
Sbjct: 222 EKFKALAKQ 230
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 28.4 bits (64), Expect = 4.0
Identities = 12/70 (17%), Positives = 29/70 (41%), Gaps = 5/70 (7%)
Query: 72 PEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKRDMEKVRLRK-- 129
P P E E+ + + + + D+++ + + R +E+ R ++
Sbjct: 14 PPKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEE---LDEEEDDRFLEEYRRKRLA 70
Query: 130 QLMAIEEKIK 139
++ A+ EK K
Sbjct: 71 EMKALAEKSK 80
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 28.8 bits (65), Expect = 4.1
Identities = 12/30 (40%), Positives = 14/30 (46%)
Query: 66 KVEEEPPEPEPEEDEVEDGEILDNEDDTKS 95
EEP E EE E D E + +EDD
Sbjct: 831 LPYEEPEEETAEEPEEVDPEDVKSEDDIDD 860
Score = 28.8 bits (65), Expect = 4.5
Identities = 14/49 (28%), Positives = 21/49 (42%), Gaps = 2/49 (4%)
Query: 62 EDVDKVEE-EP-PEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEM 108
V + E EP P EPEE+ E+ E +D ED + + +
Sbjct: 820 YKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQLGL 868
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 28.7 bits (65), Expect = 4.1
Identities = 16/61 (26%), Positives = 27/61 (44%), Gaps = 1/61 (1%)
Query: 35 TKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTK 94
K+KK + +KK + + + K EEE E E +E+E E+ E E+ +
Sbjct: 410 KKIKKIVEKA-EKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Query: 95 S 95
Sbjct: 469 E 469
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 28.3 bits (64), Expect = 4.3
Identities = 14/72 (19%), Positives = 27/72 (37%), Gaps = 12/72 (16%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKKADPKR 120
+ + E E DE + + D+++ D+D + A+ DP+
Sbjct: 31 DPNAAAAAATAAAIESELDEEDLEDDDDDDE-----------DEDEDDEEEADLGPDPEE 79
Query: 121 DMEKV-RLRKQL 131
E+ LR+Q
Sbjct: 80 ARERFAELREQY 91
>gnl|CDD|224881 COG1970, MscL, Large-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 130
Score = 27.6 bits (62), Expect = 4.5
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILD 88
++K E E P P P +EV EI D
Sbjct: 98 RRKLEKEEPEAPAPAPPAEEVLLTEIRD 125
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 28.8 bits (65), Expect = 4.5
Identities = 5/47 (10%), Positives = 20/47 (42%), Gaps = 1/47 (2%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETID-DDI 106
+++ ++ +E + +D+ + D+E +++ + D
Sbjct: 166 DDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTATADP 212
Score = 28.0 bits (63), Expect = 6.7
Identities = 11/68 (16%), Positives = 27/68 (39%), Gaps = 7/68 (10%)
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKK----- 115
++D D +++ + + + D+ ++ + E + S D+D +K
Sbjct: 148 DDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLT 207
Query: 116 --ADPKRD 121
ADP +
Sbjct: 208 ATADPVKA 215
>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6. This region covers
TM regions 4-6 of the ryanodine receptor 1 family.
Length = 217
Score = 28.1 bits (63), Expect = 4.9
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 66 KVEEEPPEPEPEEDEVEDGE 85
K E E P+ EPE+ ++EDGE
Sbjct: 92 KEEGEEPKSEPEKADLEDGE 111
>gnl|CDD|225880 COG3343, RpoE, DNA-directed RNA polymerase, delta subunit
[Transcription].
Length = 175
Score = 27.8 bits (62), Expect = 5.2
Identities = 14/47 (29%), Positives = 26/47 (55%), Gaps = 2/47 (4%)
Query: 59 GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDD 105
G E+++D E++ E + E D ++D D+ED+ + I+DD
Sbjct: 113 GDEDELDYDEDKEEEEDDEVDSLDDEN--DDEDEDDDEIVEILIEDD 157
>gnl|CDD|234609 PRK00068, PRK00068, hypothetical protein; Validated.
Length = 970
Score = 28.4 bits (64), Expect = 5.3
Identities = 9/13 (69%), Positives = 9/13 (69%), Gaps = 1/13 (7%)
Query: 261 WAINRHTKYTHMN 273
W INRH YTH N
Sbjct: 430 W-INRHLVYTHGN 441
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 26.6 bits (59), Expect = 5.4
Identities = 20/63 (31%), Positives = 29/63 (46%), Gaps = 3/63 (4%)
Query: 36 KLK-KNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTK 94
KLK K + +KK K K K + +E+V +EE + E D E E DNE +
Sbjct: 9 KLKLKGKKIDVKKKKKKKKKKNKS--KEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQ 66
Query: 95 SQG 97
+
Sbjct: 67 EED 69
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 27.3 bits (61), Expect = 5.7
Identities = 20/58 (34%), Positives = 22/58 (37%), Gaps = 3/58 (5%)
Query: 31 RKMWTKLKKNH---QNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGE 85
RK K K Q + QKK K K AK EE E E EE E + E
Sbjct: 77 RKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQE 134
>gnl|CDD|163564 TIGR03852, sucrose_gtfA, sucrose phosphorylase. In the forward
direction, this enzyme uses phosphate to cleave sucrose
into D-fructose + alpha-D-glucose 1-phosphate.
Characterized representatives from Streptococcus mutans
and Bifidobacterium adolescentis represent
well-separated branches of a molecular phylogenetic
tree. In S. mutans, the region including this gene has
been associated with neighboring transporter genes and
multiple sugar metabolism.
Length = 470
Score = 28.2 bits (63), Expect = 5.8
Identities = 11/57 (19%), Positives = 21/57 (36%), Gaps = 3/57 (5%)
Query: 32 KMWTKLKKNHQNRNQKKMKSKTAKYWTGG---EEDVDKVEEEPPEPEPEEDEVEDGE 85
+ + + N K + + +W G +EDVD + + +E DG
Sbjct: 89 EYYQDFLEKKDNSKYKDLFIRYKDFWPNGRPTQEDVDLIYKRKDRAPYQEVTFADGS 145
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 28.1 bits (63), Expect = 6.4
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 59 GGEEDVDKVEEEPPEPEPEEDEVEDG 84
EE+ + EEE E EPEE E E+G
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEG 373
>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722). This
eukaryotic family of proteins has no known function.
Length = 373
Score = 27.9 bits (62), Expect = 6.6
Identities = 13/75 (17%), Positives = 28/75 (37%), Gaps = 10/75 (13%)
Query: 38 KKNHQNRNQKKMKSKTA----KYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDT 93
K Q + ++ KS ++ + E +++PP+ + +E + E D
Sbjct: 278 KPESQQKKHRRTKSSSSFGVIDLNSISEASQVNEDDDPPDSDSKERKNE------ENSDP 331
Query: 94 KSQGSVETIDDDIEM 108
+S S + D
Sbjct: 332 ESTPSDDNDDKTCSE 346
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family.
This family includes the HABP4 family of
hyaluronan-binding proteins, and the PAI-1 mRNA-binding
protein, PAI-RBP1. HABP4 has been observed to bind
hyaluronan (a glucosaminoglycan), but it is not known
whether this is its primary role in vivo. It has also
been observed to bind RNA, but with a lower affinity
than that for hyaluronan. PAI-1 mRNA-binding protein
specifically binds the mRNA of type-1 plasminogen
activator inhibitor (PAI-1), and is thought to be
involved in regulation of mRNA stability. However, in
both cases, the sequence motifs predicted to be
important for ligand binding are not conserved
throughout the family, so it is not known whether
members of this family share a common function.
Length = 106
Score = 26.6 bits (59), Expect = 7.3
Identities = 11/28 (39%), Positives = 13/28 (46%)
Query: 58 TGGEEDVDKVEEEPPEPEPEEDEVEDGE 85
EE EE E EE+EVE+ E
Sbjct: 48 NEVEEKQAVEEEANKEGVVEEEEVEEEE 75
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 28.1 bits (63), Expect = 7.5
Identities = 23/111 (20%), Positives = 45/111 (40%), Gaps = 13/111 (11%)
Query: 42 QNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVET 101
+ R + ++ A+ ED D +EE E E E+E E + +E
Sbjct: 332 ECRVAAQAHNEEAE---SLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEE 388
Query: 102 IDDDIEMV-----FAAEK--KADPKRDM---EKVRLRKQLMAIEEKIKAAR 142
++++IE + A A+ + E+ LR++ +E ++ AR
Sbjct: 389 LEEEIEELRERFGDAPVDLGNAEDFLEELREERDELREREAELEATLRTAR 439
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 26.8 bits (59), Expect = 7.7
Identities = 14/63 (22%), Positives = 24/63 (38%), Gaps = 4/63 (6%)
Query: 57 WT--GGEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEK 114
WT G E EE E D + ++ D+ + + D++ E A ++
Sbjct: 52 WTEAGRAEKGPAAAATAAAEAAEEAEAADAD--EDADEAAEADAADEADEEEETDEAVDE 109
Query: 115 KAD 117
AD
Sbjct: 110 TAD 112
>gnl|CDD|236876 PRK11191, PRK11191, RNase E inhibitor protein; Provisional.
Length = 138
Score = 26.8 bits (60), Expect = 7.7
Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 2/42 (4%)
Query: 75 EPEEDEVEDGEILDNEDDTKSQG-SVETIDDDIEMVFA-AEK 114
+ EE E+EDG+++ D + E ID +E + A AEK
Sbjct: 62 DAEELELEDGDVIFCCDAVSEVALNAELIDAQVEQLLALAEK 103
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 27.7 bits (62), Expect = 9.1
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 68 EEEPPEPEPEEDEVEDGEILDNE 90
EEE PE EE+E E+ E E
Sbjct: 310 EEEEEVPEEEEEEEEEEERTFEE 332
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 25.9 bits (57), Expect = 9.3
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 60 GEEDVDKVEEEPPEPEPEEDEVEDGEILDNEDD 92
GEE D ++E + + ++D+ +D + D+EDD
Sbjct: 47 GEEPEDDDDDEDDDDDDDKDDKDDDDDDDDEDD 79
>gnl|CDD|234586 PRK00037, hisS, histidyl-tRNA synthetase; Reviewed.
Length = 412
Score = 27.4 bits (62), Expect = 9.4
Identities = 7/23 (30%), Positives = 10/23 (43%)
Query: 161 VLYADLLDHCKLCLRRKRKNPMR 183
L L + + RR NP+R
Sbjct: 179 FLEKGLDELDEDSKRRLETNPLR 201
>gnl|CDD|235861 PRK06782, PRK06782, flagellar motor switch protein; Reviewed.
Length = 528
Score = 27.5 bits (61), Expect = 9.8
Identities = 19/115 (16%), Positives = 41/115 (35%), Gaps = 18/115 (15%)
Query: 10 LQQIQKTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAKYWTG---------G 60
L + + + L ++ + + V + TK+ + + AK
Sbjct: 370 LGESIEVDE-LVQITFNLEVGDLLQTKM--------YQILPISEAKEMVRLLLYPMVKEE 420
Query: 61 EEDVDKVEEEPPEPEPEEDEVEDGEILDNEDDTKSQGSVETIDDDIEMVFAAEKK 115
EE + E EP +E E+ E ++ + ++++VF + KK
Sbjct: 421 EEIATEEIEPEKVVEPVVQPIEFKEVKQMEPVYMDTSILQDVPMNVKVVFGSTKK 475
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 27.7 bits (62), Expect = 9.9
Identities = 22/95 (23%), Positives = 38/95 (40%), Gaps = 19/95 (20%)
Query: 61 EEDVDKV-EEEPPEPEPEEDEVEDGEILDNEDDTKSQGS---------------VETIDD 104
E + KV EEE P + +E E EI E K +E +
Sbjct: 391 AEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKR 450
Query: 105 DIEMVFAAEKKADPKRDME-KVRLRKQLMAIEEKI 138
+IE + + +R++ KVR +++ A + +I
Sbjct: 451 EIEKL--ESELERFRREVRDKVRKDREIRARDRRI 483
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 27.5 bits (61), Expect = 9.9
Identities = 20/111 (18%), Positives = 39/111 (35%), Gaps = 10/111 (9%)
Query: 15 KTNPKLKKVNYPIIVARKMWTKLKKNHQNRNQKKMKSKTAKYWTGGEEDVDKVEEEPPEP 74
LK+ + + + + K+ K K + +E+ + EE
Sbjct: 202 VKAASLKRNPPK-------KSNIMSSFFKKKTKEKKEKKEASESTVKEESE--EESGKRD 252
Query: 75 EPEEDEVEDGEILDNEDDTKS-QGSVETIDDDIEMVFAAEKKADPKRDMEK 124
EDE + LD ++D + S E D + E ++K + M +
Sbjct: 253 VILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMME 303
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.134 0.423
Gapped
Lambda K H
0.267 0.0710 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,519,745
Number of extensions: 1453288
Number of successful extensions: 2891
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2540
Number of HSP's successfully gapped: 170
Length of query: 304
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 208
Effective length of database: 6,679,618
Effective search space: 1389360544
Effective search space used: 1389360544
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 59 (26.5 bits)