RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4596
(439 letters)
>gnl|CDD|238492 cd00992, PDZ_signaling, PDZ domain found in a variety of Eumetazoan
signaling molecules, often in tandem arrangements. May
be responsible for specific protein-protein
interactions, as most PDZ domains bind C-terminal
polypeptides, and binding to internal (non-C-terminal)
polypeptides and even to lipids has been demonstrated.
In this subfamily of PDZ domains an N-terminal
beta-strand forms the peptide-binding groove base, a
circular permutation with respect to PDZ domains found
in proteases.
Length = 82
Score = 84.9 bits (211), Expect = 2e-20
Identities = 40/91 (43%), Positives = 50/91 (54%), Gaps = 11/91 (12%)
Query: 160 LQDVVLPK--SGPLGFSIIGGTDHSCIPFGQHKPGIFISHIVPGGVAALSGKLRMGDRIL 217
++ V L K G LGFS+ GG D GIF+S + PGG A G LR+GDRIL
Sbjct: 1 VRTVTLRKDPGGGLGFSLRGGKDS--------GGGIFVSRVEPGGPAER-GGLRVGDRIL 51
Query: 218 AVNGRDVSKVTHEEAVLALLEPCPEILLKIQ 248
VNG V +THEEAV L E+ L ++
Sbjct: 52 EVNGVSVEGLTHEEAVELLKNSGDEVTLTVR 82
Score = 81.8 bits (203), Expect = 2e-19
Identities = 35/90 (38%), Positives = 56/90 (62%), Gaps = 10/90 (11%)
Query: 258 EITLIRQEGEKLGMHIKGGLRGQRGNPLDKNDEGVFITKINSGGAAKRDGRLKVGMRLLE 317
+TL + G LG ++GG + G+F++++ GG A+R G L+VG R+LE
Sbjct: 3 TVTLRKDPGGGLGFSLRGG---------KDSGGGIFVSRVEPGGPAER-GGLRVGDRILE 52
Query: 318 VNGMSLLGASHQEAVNTLRYSGHQISLTVC 347
VNG+S+ G +H+EAV L+ SG +++LTV
Sbjct: 53 VNGVSVEGLTHEEAVELLKNSGDEVTLTVR 82
Score = 47.9 bits (115), Expect = 2e-07
Identities = 20/37 (54%), Positives = 29/37 (78%)
Query: 29 VGMRLLEVNGMSLLGASHQEAVNTLRYSGHQISLTVC 65
VG R+LEVNG+S+ G +H+EAV L+ SG +++LTV
Sbjct: 46 VGDRILEVNGVSVEGLTHEEAVELLKNSGDEVTLTVR 82
>gnl|CDD|214570 smart00228, PDZ, Domain present in PSD-95, Dlg, and ZO-1/2. Also
called DHR (Dlg homologous region) or GLGF (relatively
well conserved tetrapeptide in these domains). Some PDZs
have been shown to bind C-terminal polypeptides; others
appear to bind internal (non-C-terminal) polypeptides.
Different PDZs possess different binding specificities.
Length = 85
Score = 80.1 bits (198), Expect = 1e-18
Identities = 36/93 (38%), Positives = 49/93 (52%), Gaps = 10/93 (10%)
Query: 160 LQDVVLPK-SGPLGFSIIGGTDHSCIPFGQHKPGIFISHIVPGGVAALSGKLRMGDRILA 218
+ V L K G LGFS++GG D G+ +S +VPG AA +G LR+GD IL
Sbjct: 2 PRLVELEKGGGGLGFSLVGGKDEG--------GGVVVSSVVPGSPAAKAG-LRVGDVILE 52
Query: 219 VNGRDVSKVTHEEAVLALLEPCPEILLKIQHDP 251
VNG V +TH EAV L + ++ L +
Sbjct: 53 VNGTSVEGLTHLEAVDLLKKAGGKVTLTVLRGG 85
Score = 67.4 bits (165), Expect = 4e-14
Identities = 30/88 (34%), Positives = 48/88 (54%), Gaps = 10/88 (11%)
Query: 262 IRQEGEKLGMHIKGGLRGQRGNPLDKNDEGVFITKINSGGAAKRDGRLKVGMRLLEVNGM 321
+ + G LG + GG GV ++ + G A + G L+VG +LEVNG
Sbjct: 7 LEKGGGGLGFSLVGG---------KDEGGGVVVSSVVPGSPAAKAG-LRVGDVILEVNGT 56
Query: 322 SLLGASHQEAVNTLRYSGHQISLTVCKG 349
S+ G +H EAV+ L+ +G +++LTV +G
Sbjct: 57 SVEGLTHLEAVDLLKKAGGKVTLTVLRG 84
Score = 40.8 bits (96), Expect = 7e-05
Identities = 19/39 (48%), Positives = 29/39 (74%)
Query: 29 VGMRLLEVNGMSLLGASHQEAVNTLRYSGHQISLTVCKG 67
VG +LEVNG S+ G +H EAV+ L+ +G +++LTV +G
Sbjct: 46 VGDVILEVNGTSVEGLTHLEAVDLLKKAGGKVTLTVLRG 84
>gnl|CDD|201332 pfam00595, PDZ, PDZ domain (Also known as DHR or GLGF). PDZ
domains are found in diverse signaling proteins.
Length = 80
Score = 75.0 bits (185), Expect = 5e-17
Identities = 42/89 (47%), Positives = 59/89 (66%), Gaps = 11/89 (12%)
Query: 162 DVVLPKS--GPLGFSIIGGTDHSCIPFGQHKPGIFISHIVPGGVAALSGKLRMGDRILAV 219
+V L KS G LGFS++GG+D PGIF+S ++PGG AA +G L+ GDRIL++
Sbjct: 1 EVTLEKSGRGGLGFSLVGGSDG--------DPGIFVSEVLPGG-AAEAGGLQEGDRILSI 51
Query: 220 NGRDVSKVTHEEAVLALLEPCPEILLKIQ 248
NG+D+ ++H+EAVLAL E+ L I
Sbjct: 52 NGQDLENLSHDEAVLALKGSGGEVTLTIL 80
Score = 63.0 bits (154), Expect = 9e-13
Identities = 32/89 (35%), Positives = 49/89 (55%), Gaps = 10/89 (11%)
Query: 258 EITLIRQEGEKLGMHIKGGLRGQRGNPLDKNDEGVFITKINSGGAAKRDGRLKVGMRLLE 317
E+TL + LG + GG D G+F++++ GGAA+ G L+ G R+L
Sbjct: 1 EVTLEKSGRGGLGFSLVGG---------SDGDPGIFVSEVLPGGAAEAGG-LQEGDRILS 50
Query: 318 VNGMSLLGASHQEAVNTLRYSGHQISLTV 346
+NG L SH EAV L+ SG +++LT+
Sbjct: 51 INGQDLENLSHDEAVLALKGSGGEVTLTI 79
Score = 31.8 bits (73), Expect = 0.10
Identities = 16/36 (44%), Positives = 23/36 (63%)
Query: 29 VGMRLLEVNGMSLLGASHQEAVNTLRYSGHQISLTV 64
G R+L +NG L SH EAV L+ SG +++LT+
Sbjct: 44 EGDRILSINGQDLENLSHDEAVLALKGSGGEVTLTI 79
>gnl|CDD|238080 cd00136, PDZ, PDZ domain, also called DHR (Dlg homologous region)
or GLGF (after a conserved sequence motif). Many PDZ
domains bind C-terminal polypeptides, though binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. Heterodimerization through
PDZ-PDZ domain interactions adds to the domain's
versatility, and PDZ domain-mediated interactions may be
modulated dynamically through target phosphorylation.
Some PDZ domains play a role in scaffolding
supramolecular complexes. PDZ domains are found in
diverse signaling proteins in bacteria, archebacteria,
and eurkayotes. This CD contains two distinct structural
subgroups with either a N- or C-terminal beta-strand
forming the peptide-binding groove base. The circular
permutation placing the strand on the N-terminus appears
to be found in Eumetazoa only, while the C-terminal
variant is found in all three kingdoms of life, and
seems to co-occur with protease domains. PDZ domains
have been named after PSD95(post synaptic density
protein), DlgA (Drosophila disc large tumor suppressor),
and ZO1, a mammalian tight junction protein.
Length = 70
Score = 51.2 bits (123), Expect = 1e-08
Identities = 28/68 (41%), Positives = 36/68 (52%), Gaps = 11/68 (16%)
Query: 169 GPLGFSIIGGTDHSCIPFGQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVT 228
G LGFSI GGT+ G+ + + PG A +G L+ GD ILAVNG DV +T
Sbjct: 1 GGLGFSIRGGTE----------GGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLT 49
Query: 229 HEEAVLAL 236
E+ L
Sbjct: 50 LEDVAELL 57
Score = 43.1 bits (102), Expect = 8e-06
Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 286 DKNDEGVFITKINSGGAAKRDGRLKVGMRLLEVNGMSLLGASHQEAVNTLRYS-GHQISL 344
+ GV + + G A+R G L+ G +L VNG + + ++ L+ G +++L
Sbjct: 9 GGTEGGVVVLSVEPGSPAERAG-LQAGDVILAVNGTDVKNLTLEDVAELLKKEVGEKVTL 67
Query: 345 TV 346
TV
Sbjct: 68 TV 69
Score = 29.6 bits (67), Expect = 0.47
Identities = 10/37 (27%), Positives = 19/37 (51%), Gaps = 1/37 (2%)
Query: 29 VGMRLLEVNGMSLLGASHQEAVNTLRYS-GHQISLTV 64
G +L VNG + + ++ L+ G +++LTV
Sbjct: 33 AGDVILAVNGTDVKNLTLEDVAELLKKEVGEKVTLTV 69
>gnl|CDD|238488 cd00988, PDZ_CTP_protease, PDZ domain of C-terminal processing-,
tail-specific-, and tricorn proteases, which function in
posttranslational protein processing, maturation, and
disassembly or degradation, in Bacteria, Archaea, and
plant chloroplasts. May be responsible for substrate
recognition and/or binding, as most PDZ domains bind
C-terminal polypeptides, and binding to internal
(non-C-terminal) polypeptides and even to lipids has
been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 85
Score = 49.9 bits (120), Expect = 5e-08
Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 4/80 (5%)
Query: 186 FGQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALL-EPCPEIL 244
G+ I+ ++PG AA +G ++ GD I+A++G V ++ E+ V L + ++
Sbjct: 8 LKYDDGGLVITSVLPGSPAAKAG-IKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKVR 66
Query: 245 LKIQHDPHPEGFQEITLIRQ 264
L ++ +E+TL R
Sbjct: 67 LTLKRG--DGEPREVTLTRL 84
Score = 32.6 bits (75), Expect = 0.061
Identities = 25/76 (32%), Positives = 41/76 (53%), Gaps = 8/76 (10%)
Query: 285 LDKNDEGVFITKINSGG-AAKRDGRLKVGMRLLEVNGMSLLGASHQEAVNTLR-YSGHQI 342
L +D G+ IT + G AAK +K G ++ ++G + G S ++ V LR +G ++
Sbjct: 8 LKYDDGGLVITSVLPGSPAAKAG--IKAGDIIVAIDGEPVDGLSLEDVVKLLRGKAGTKV 65
Query: 343 SLTVCKGFERGDIERR 358
LT+ +RGD E R
Sbjct: 66 RLTL----KRGDGEPR 77
>gnl|CDD|233695 TIGR02037, degP_htrA_DO, periplasmic serine protease, Do/DeqQ
family. This family consists of a set proteins various
designated DegP, heat shock protein HtrA, and protease
DO. The ortholog in Pseudomonas aeruginosa is designated
MucD and is found in an operon that controls mucoid
phenotype. This family also includes the DegQ (HhoA)
paralog in E. coli which can rescue a DegP mutant, but
not the smaller DegS paralog, which cannot. Members of
this family are located in the periplasm and have
separable functions as both protease and chaperone.
Members have a trypsin domain and two copies of a PDZ
domain. This protein protects bacteria from thermal and
other stresses and may be important for the survival of
bacterial pathogens.// The chaperone function is
dominant at low temperatures, whereas the proteolytic
activity is turned on at elevated temperatures [Protein
fate, Protein folding and stabilization, Protein fate,
Degradation of proteins, peptides, and glycopeptides].
Length = 428
Score = 53.0 bits (128), Expect = 2e-07
Identities = 37/148 (25%), Positives = 61/148 (41%), Gaps = 20/148 (13%)
Query: 186 FGQHKP-GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEIL 244
G K G ++ ++PG A +G L+ GD I +VNG+ +S + L+P ++
Sbjct: 251 LGLEKQRGALVAQVLPGSPAEKAG-LKAGDVITSVNGKPISSFADLRRAIGTLKPGKKVT 309
Query: 245 LKIQHDPHPEGFQEITL-------IRQEGEKL-----GMHIKGGLRGQRGNPLDKNDEGV 292
L I + IT+ + ++ +R + D GV
Sbjct: 310 LGILRKGKE---KTITVTLGASPEEQASSSNPFLGLTVANLSPEIRKELRLKGDVK--GV 364
Query: 293 FITKINSGGAAKRDGRLKVGMRLLEVNG 320
+TK+ SG A R G L+ G +L VN
Sbjct: 365 VVTKVVSGSPAARAG-LQPGDVILSVNQ 391
Score = 39.9 bits (94), Expect = 0.002
Identities = 21/59 (35%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQHD 250
G+ ++ +V G AA +G L+ GD IL+VN + VS V VLA + + L I
Sbjct: 363 GVVVTKVVSGSPAARAG-LQPGDVILSVNQQPVSSVAELRKVLARAKKGGRVALLILRG 420
>gnl|CDD|238487 cd00987, PDZ_serine_protease, PDZ domain of tryspin-like serine
proteases, such as DegP/HtrA, which are oligomeric
proteins involved in heat-shock response, chaperone
function, and apoptosis. May be responsible for
substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, though binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 90
Score = 43.4 bits (103), Expect = 9e-06
Identities = 22/57 (38%), Positives = 33/57 (57%), Gaps = 1/57 (1%)
Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQ 248
G+ ++ + PG AA +G L+ GD ILAVNG+ V V LA L+P ++ L +
Sbjct: 25 GVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADLRRALAELKPGDKVTLTVL 80
Score = 33.4 bits (77), Expect = 0.038
Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 13/90 (14%)
Query: 276 GLRGQRGNP------LDKNDEGVFITKINSGGAAKRDGRLKVGMRLLEVNGMSLLGASH- 328
G+ Q P K+ +GV + ++ G A + G LK G +L VNG + +
Sbjct: 4 GVTVQDLTPDLAEELGLKDTKGVLVASVDPGSPAAKAG-LKPGDVILAVNGKPVKSVADL 62
Query: 329 QEAVNTLRYSGHQISLTVCKGFERGDIERR 358
+ A+ L+ G +++LTV RG E
Sbjct: 63 RRALAELK-PGDKVTLTV----LRGGKELT 87
>gnl|CDD|238489 cd00989, PDZ_metalloprotease, PDZ domain of bacterial and plant
zinc metalloprotases, presumably membrane-associated or
integral membrane proteases, which may be involved in
signalling and regulatory mechanisms. May be responsible
for substrate recognition and/or binding, as most PDZ
domains bind C-terminal polypeptides, and binding to
internal (non-C-terminal) polypeptides and even to
lipids has been demonstrated. In this subfamily of
protease-associated PDZ domains a C-terminal beta-strand
forms the peptide-binding groove base, a circular
permutation with respect to PDZ domains found in
Eumetazoan signaling proteins.
Length = 79
Score = 41.1 bits (97), Expect = 5e-05
Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 184 IPFGQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLE-PCPE 242
G I +VPG AA +G L+ GDRILA+NG+ + + E+ V A+ E P
Sbjct: 5 FVPGGPPIEPVIGEVVPGSPAAKAG-LKAGDRILAINGQKIK--SWEDLVDAVQENPGKP 61
Query: 243 ILLKIQHD 250
+ L ++ +
Sbjct: 62 LTLTVERN 69
>gnl|CDD|223864 COG0793, Prc, Periplasmic protease [Cell envelope biogenesis, outer
membrane].
Length = 406
Score = 41.2 bits (97), Expect = 0.001
Identities = 22/78 (28%), Positives = 40/78 (51%), Gaps = 9/78 (11%)
Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLAL-LEPCPEILLKIQHD 250
G+ + + G AA +G ++ GD I+ ++G+ V V+ +EAV + +P ++ L I
Sbjct: 113 GVKVVSPIDGSPAAKAG-IKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPGTKVTLTI--- 168
Query: 251 PHPEG---FQEITLIRQE 265
G +TL R+E
Sbjct: 169 -LRAGGGKPFTVTLTREE 185
Score = 31.1 bits (71), Expect = 1.1
Identities = 26/104 (25%), Positives = 46/104 (44%), Gaps = 19/104 (18%)
Query: 250 DPH-----PEGFQEITL-IRQEGEKLGMHIKGGLRGQRGNPLDKNDEGVFITKINSGGAA 303
DPH PE E E +G+ ++ ++ GV + G A
Sbjct: 77 DPHSTYLDPEDAAEFRTDTSGEFGGIGIELQ-----------MEDIGGVKVVSPIDGSPA 125
Query: 304 KRDGRLKVGMRLLEVNGMSLLGASHQEAVNTLR-YSGHQISLTV 346
+ G +K G +++++G S+ G S EAV +R G +++LT+
Sbjct: 126 AKAG-IKPGDVIIKIDGKSVGGVSLDEAVKLIRGKPGTKVTLTI 168
>gnl|CDD|223343 COG0265, DegQ, Trypsin-like serine proteases, typically
periplasmic, contain C-terminal PDZ domain
[Posttranslational modification, protein turnover,
chaperones].
Length = 347
Score = 38.3 bits (89), Expect = 0.005
Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 1/58 (1%)
Query: 191 PGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQ 248
G + ++PG AA +G ++ GD I AVNG+ V+ ++ A +A P E+ LK+
Sbjct: 270 AGAVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAAVASNRPGDEVALKLL 326
Score = 31.0 bits (70), Expect = 1.1
Identities = 14/68 (20%), Positives = 25/68 (36%), Gaps = 3/68 (4%)
Query: 280 QRGNPLDKNDEGVFITKINSGGAAKRDGRLKVGMRLLEVNGMSLLGA-SHQEAVNTLRYS 338
G + + G A + G +K G + VNG + AV + R
Sbjct: 260 ADIALGLPVAAGAVVLGVLPGSPAAKAG-IKAGDIITAVNGKPVASLSDLVAAVASNR-P 317
Query: 339 GHQISLTV 346
G +++L +
Sbjct: 318 GDEVALKL 325
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 36.4 bits (84), Expect = 0.009
Identities = 16/58 (27%), Positives = 40/58 (68%), Gaps = 2/58 (3%)
Query: 376 DREDLDTDVFKQEEEM--KKELVEWEKEEQEERKENLELDEVREKSTPEMPHYFQDAN 431
+ E+ D + ++ E++ ++E+VE E+EE+E+ ++N++L ++ +K+ ++ + QD N
Sbjct: 53 EEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKKNINDIFNSTQDDN 110
Score = 28.3 bits (63), Expect = 4.9
Identities = 16/54 (29%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 372 MSSLDREDL---DTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREKSTPE 422
++S D+ED+ + DV +++E +E E ++EE EE ++ + +E+ E E
Sbjct: 28 ITSWDKEDIIKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEE 81
>gnl|CDD|221961 pfam13180, PDZ_2, PDZ domain.
Length = 81
Score = 33.4 bits (77), Expect = 0.032
Identities = 17/64 (26%), Positives = 32/64 (50%), Gaps = 1/64 (1%)
Query: 187 GQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLK 246
G+ + + G AA +G L+ GD IL+++G+ V+ +T V+ +P + L
Sbjct: 9 QNEGTGVTVVSVKEGSPAAKAG-LKPGDIILSIDGKKVNSLTELIEVILNGKPGDTVKLT 67
Query: 247 IQHD 250
+ D
Sbjct: 68 VYRD 71
>gnl|CDD|236873 PRK11186, PRK11186, carboxy-terminal protease; Provisional.
Length = 667
Score = 34.1 bits (79), Expect = 0.14
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 195 ISHIVPGGVAALSGKLRMGDRILAV 219
I+ +V GG AA S KL +GD+I+ V
Sbjct: 259 INSLVAGGPAAKSKKLSVGDKIVGV 283
>gnl|CDD|227169 COG4832, COG4832, Uncharacterized conserved protein [Function
unknown].
Length = 207
Score = 33.3 bits (76), Expect = 0.18
Identities = 27/142 (19%), Positives = 55/142 (38%), Gaps = 19/142 (13%)
Query: 310 KVGMRLLEVNGMSLL---------GASHQEAVNTLRYSGHQISLTVCKGFERGDIERRAS 360
K +++V M+ L Q AV L + I ++ E+ D +
Sbjct: 18 KKKASIIKVPEMNFLMIDGKGDPNSTEFQNAVQVLYSLAYSIKMSYKMKEEKDDKIKDFV 77
Query: 361 TS--EGSRSVTQSMSSLDREDLD-TDVFKQEEEMKKELVEWEKEEQEERKENLELDEVRE 417
EG LD+ T + +Q + + +E+ E KE E++K +++L V+
Sbjct: 78 VPPLEGLWWSEPEFDKLDKNKWKWTIMIRQPDFITQEIFEKAKEVVEKKKPSIDLSRVKF 137
Query: 418 KSTPEMPHYFQDANCVCFVNFG 439
+S ++ + ++ G
Sbjct: 138 ES-------LEEGLSIQIMHVG 152
>gnl|CDD|232883 TIGR00225, prc, C-terminal peptidase (prc). A C-terminal peptidase
with different substrates in different species including
processing of D1 protein of the photosystem II reaction
center in higher plants and cleavage of a peptide of 11
residues from the precursor form of penicillin-binding
protein in E.coli E.coli and H influenza have the most
distal branch of the tree and their proteins have an
N-terminal 200 amino acids that show no homology to
other proteins in the database [Protein fate,
Degradation of proteins, peptides, and glycopeptides,
Protein fate, Protein modification and repair].
Length = 334
Score = 33.1 bits (76), Expect = 0.22
Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 3/74 (4%)
Query: 193 IFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALL-EPCPEILLKIQHDP 251
I I G A +G ++ GD+I+ +NG+ V+ ++ ++AV + + ++ L+I
Sbjct: 64 IVIVSPFEGSPAEKAG-IKPGDKIIKINGKSVAGMSLDDAVALIRGKKGTKVSLEILR-A 121
Query: 252 HPEGFQEITLIRQE 265
TL R
Sbjct: 122 GKSKPLTFTLKRDR 135
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 32.5 bits (74), Expect = 0.38
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 373 SSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREKSTPEMPHYFQDAN 431
+LD E LD ++ K+EE + EL E EKE+ + E +EL E +E+ E Y ++ N
Sbjct: 66 EALDSE-LD-ELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQLENEELQYLREYN 122
>gnl|CDD|182262 PRK10139, PRK10139, serine endoprotease; Provisional.
Length = 455
Score = 32.6 bits (74), Expect = 0.40
Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQHDP 251
G F+S ++P +A +G ++ GD I ++NG+ ++ + +A EP ++ L + +
Sbjct: 291 GAFVSEVLPNSGSAKAG-VKAGDIITSLNGKPLNSFAELRSRIATTEPGTKVKLGLLRNG 349
Query: 252 HPEGFQEITL 261
P E+TL
Sbjct: 350 KPLEV-EVTL 358
>gnl|CDD|177679 PLN00047, PLN00047, photosystem II biogenesis protein Psb29;
Provisional.
Length = 283
Score = 32.1 bits (73), Expect = 0.44
Identities = 20/51 (39%), Positives = 29/51 (56%), Gaps = 7/51 (13%)
Query: 379 DLDTDVFK-------QEEEMKKELVEWEKEEQEERKENLELDEVREKSTPE 422
D D DV++ Q +E+ KE VE EK++QEER E+ + +E K E
Sbjct: 228 DRDLDVYRGLLSKLVQAKELLKEYVEREKKKQEERAESQKANEAVTKCLGE 278
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 32.5 bits (74), Expect = 0.48
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 393 KELVEWEKEEQEERKENLELDEVREKSTPEMPHYFQD 429
KEL E ++E+ EE+ E ++ +EV E+ E ++ +
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYE 313
>gnl|CDD|182820 PRK10898, PRK10898, serine endoprotease; Provisional.
Length = 353
Score = 32.3 bits (74), Expect = 0.50
Identities = 11/34 (32%), Positives = 21/34 (61%), Gaps = 1/34 (2%)
Query: 189 HKPGIFISHIVPGGVAALSGKLRMGDRILAVNGR 222
GI ++ + P G AA +G +++ D I++VN +
Sbjct: 277 QLQGIVVNEVSPDGPAAKAG-IQVNDLIISVNNK 309
>gnl|CDD|216447 pfam01346, FKBP_N, Domain amino terminal to FKBP-type
peptidyl-prolyl isomerase. This family is only found at
the amino terminus of pfam00254. This domain is of
unknown function.
Length = 124
Score = 30.6 bits (70), Expect = 0.56
Identities = 13/59 (22%), Positives = 25/59 (42%), Gaps = 14/59 (23%)
Query: 367 SVTQSMSSLDREDLDTDVFKQ--------------EEEMKKELVEWEKEEQEERKENLE 411
S+ + + E LD D Q +EE++K L ++K+ Q +++ E
Sbjct: 32 SLGRYLKQQGEEQLDKDALLQGLQDALAGKELKLTDEEIQKALQAFQKKLQAKQQAKAE 90
>gnl|CDD|238486 cd00986, PDZ_LON_protease, PDZ domain of ATP-dependent LON serine
proteases. Most PDZ domains bind C-terminal
polypeptides, though binding to internal
(non-C-terminal) polypeptides and even to lipids has
been demonstrated. In this bacterial subfamily of
protease-associated PDZ domains a C-terminal beta-strand
is thought to form the peptide-binding groove base, a
circular permutation with respect to PDZ domains found
in Eumetazoan signaling proteins.
Length = 79
Score = 29.3 bits (66), Expect = 0.75
Identities = 14/35 (40%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSK 226
G++++ +V G AA GKL+ GD I+AV+G+ +
Sbjct: 9 GVYVTSVVEGMPAA--GKLKAGDHIIAVDGKPFKE 41
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 31.6 bits (71), Expect = 0.90
Identities = 17/74 (22%), Positives = 28/74 (37%), Gaps = 1/74 (1%)
Query: 365 SRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREKSTPEMP 424
S + S++ + + DV K+E K + E EE+ + E D + K EM
Sbjct: 282 SLDSSTLKSAVCNDPGNIDVSKEELSEKIPELMVECRLVEEKLDTFE-DNNKNKDIMEMV 340
Query: 425 HYFQDANCVCFVNF 438
+ F F
Sbjct: 341 SKPCSSLKSLFSGF 354
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 30.0 bits (68), Expect = 1.1
Identities = 12/32 (37%), Positives = 17/32 (53%)
Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVRE 417
++EE K L E EKE E K+ E ++E
Sbjct: 89 EKEERFMKALAEAEKERAELEKKKAEAKLMKE 120
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 28.5 bits (64), Expect = 1.2
Identities = 16/42 (38%), Positives = 23/42 (54%), Gaps = 1/42 (2%)
Query: 378 EDLDTDVFKQEEEMKKELV-EWEKEEQEERKENLELDEVREK 418
+ L+ D+ + + KKEL E EK E+EE E + REK
Sbjct: 12 DMLNADLLAKLKAKKKELKAEEEKREEEEEARKREERKEREK 53
>gnl|CDD|131391 TIGR02338, gimC_beta, prefoldin, beta subunit, archaeal.
Chaperonins are cytosolic, ATP-dependent molecular
chaperones, with a conserved toroidal architecture, that
assist in the folding of nascent and/or denatured
polypeptide chains. The group I chaperonin system
consists of GroEL and GroES, and is found (usually) in
bacteria and organelles of bacterial origin. The group
II chaperonin system, called the thermosome in Archaea
and TRiC or CCT in the Eukaryota, is structurally
similar but only distantly related. Prefoldin, also
called GimC, is a complex in Archaea and Eukaryota, that
works with group II chaperonins. Members of this protein
family are the archaeal clade of the beta class of
prefoldin subunit. Closely related, but outside the
scope of this family are the eukaryotic beta-class
prefoldin subunits, Gim-1,3,4 and 6. The alpha class
prefoldin subunits are more distantly related.
Length = 110
Score = 29.2 bits (66), Expect = 1.3
Identities = 18/55 (32%), Positives = 30/55 (54%), Gaps = 4/55 (7%)
Query: 366 RSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEE-QEERKENLELDEVREKS 419
+ +++ L+R DT V+K + LV+ +KEE +E KE E E+R K+
Sbjct: 34 KEAEKALEELERLPDDTPVYKSVGNL---LVKTDKEEAIQELKEKKETLELRVKT 85
>gnl|CDD|177681 PLN00049, PLN00049, carboxyl-terminal processing protease;
Provisional.
Length = 389
Score = 30.9 bits (70), Expect = 1.4
Identities = 22/67 (32%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 199 VPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCP-EILLKIQHDPHPEGFQ 257
PGG AA +G +R GD ILA++G ++ EA L P + L ++ P +
Sbjct: 110 APGGPAARAG-IRPGDVILAIDGTSTEGLSLYEAADRLQGPEGSSVELTLRRGPET---R 165
Query: 258 EITLIRQ 264
+TL R+
Sbjct: 166 LVTLTRE 172
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 29.7 bits (67), Expect = 1.5
Identities = 12/33 (36%), Positives = 18/33 (54%)
Query: 376 DREDLDTDVFKQEEEMKKELVEWEKEEQEERKE 408
D E L ++ K +++KKE E K+ E KE
Sbjct: 105 DSEGLLEELKKHRDKLKKEQKELRKKLDELEKE 137
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 30.5 bits (69), Expect = 1.6
Identities = 18/84 (21%), Positives = 32/84 (38%), Gaps = 11/84 (13%)
Query: 355 IERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKK---ELVEWEKEEQE----ERK 407
+E A+ SE R + + + ++ +V K EEE +K E + ++E +E +
Sbjct: 66 LEEEAAASEEER---ERLE-AEVDEATAEVAKLEEEREKKEAETRQLQQELREAQEAHER 121
Query: 408 ENLELDEVREKSTPEMPHYFQDAN 431
EL E T
Sbjct: 122 ARQELLEAAAAPTAPPHVAAPVNG 145
>gnl|CDD|226483 COG3975, COG3975, Predicted protease with the C-terminal PDZ domain
[General function prediction only].
Length = 558
Score = 30.5 bits (69), Expect = 2.0
Identities = 13/36 (36%), Positives = 19/36 (52%), Gaps = 1/36 (2%)
Query: 186 FGQHKPGIFISHIVPGGVAALSGKLRMGDRILAVNG 221
I+ + PGG A +G L GD+I+A+NG
Sbjct: 457 VKSEGGHEKITFVFPGGPAYKAG-LSPGDKIVAING 491
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.5 bits (68), Expect = 2.1
Identities = 17/75 (22%), Positives = 30/75 (40%)
Query: 351 ERGDIERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENL 410
E+ E +E ++ + E + K+ EE K E K+E EE K+
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746
Query: 411 ELDEVREKSTPEMPH 425
E + E+ ++ H
Sbjct: 1747 EEAKKDEEEKKKIAH 1761
Score = 28.6 bits (63), Expect = 8.0
Identities = 14/47 (29%), Positives = 25/47 (53%), Gaps = 10/47 (21%)
Query: 386 KQEEEMKKELVEWEKEEQEERK----------ENLELDEVREKSTPE 422
K+ EE KK+ E +K E++E+K E + +E+++K E
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEE 1714
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 29.1 bits (66), Expect = 2.2
Identities = 10/33 (30%), Positives = 17/33 (51%), Gaps = 1/33 (3%)
Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
K E E++K E E+ E E + L+ E ++
Sbjct: 46 KAEYELEKLEKELEELEAELARRELK-AEAKKM 77
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 30.3 bits (68), Expect = 2.3
Identities = 16/43 (37%), Positives = 27/43 (62%)
Query: 376 DREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
D E+ + + ++EEE ++E E E+EE+ E +LE E R+K
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQK 904
>gnl|CDD|204467 pfam10376, Mei5, Double-strand recombination repair protein. Mei5
is one of a pair of meiosis-specific proteins which
facilitate the loading of Dmc1 on to Rad51 on DNA at
double-strand breaks during recombination. Recombination
is carried out by a large protein complex based around
the two RecA homologues, Rad51 and Dmc1. This complex
may play both a catalytic and a structural role in the
interaction between homologous chromosomes during
meiosis. Mei5 is seen to contain a coiled-coli region.
Length = 212
Score = 29.8 bits (67), Expect = 2.4
Identities = 15/62 (24%), Positives = 28/62 (45%)
Query: 356 ERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEV 415
R SE + S ++R D ++E E +K +E++ +E+ + L+L E
Sbjct: 94 WRTICQSESRSILNSSSPKINRMGGYKDFKRKELEAEKRKLEYQVDEESDDLRRLKLVEK 153
Query: 416 RE 417
E
Sbjct: 154 YE 155
>gnl|CDD|223821 COG0750, COG0750, Predicted membrane-associated Zn-dependent
proteases 1 [Cell envelope biogenesis, outer membrane].
Length = 375
Score = 29.8 bits (67), Expect = 2.8
Identities = 29/121 (23%), Positives = 41/121 (33%), Gaps = 15/121 (12%)
Query: 194 FISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQ----- 248
+ + P AAL+G LR GDRI+AV+G V+ +L LL I
Sbjct: 132 VVGEVAPKSAAALAG-LRPGDRIVAVDGEKVASWDDVRRLLVAAAGDVFNLLTILVIRLD 190
Query: 249 ---HDPHPEGFQEITLIRQEGEKL----GMHIKGGLRGQRGNPLDKNDEGVFITKINSGG 301
H E + + L + + G G N D G + K
Sbjct: 191 GEAHAVAAEIIKSLGLTPVVIPLKPGDKIVAVDVGAIGLSPNGEP--DVGKVLVKYGPLE 248
Query: 302 A 302
A
Sbjct: 249 A 249
>gnl|CDD|236802 PRK10942, PRK10942, serine endoprotease; Provisional.
Length = 473
Score = 29.7 bits (67), Expect = 3.1
Identities = 30/130 (23%), Positives = 55/130 (42%), Gaps = 7/130 (5%)
Query: 192 GIFISHIVPGGVAALSGKLRMGDRILAVNGRDVSKVTHEEAVLALLEPCPEILLKIQHDP 251
G F+S ++P AA +G ++ GD I ++NG+ +S A + + ++ L + D
Sbjct: 312 GAFVSQVLPNSSAAKAG-IKAGDVITSLNGKPISSFAALRAQVGTMPVGSKLTLGLLRDG 370
Query: 252 HPEGFQEITLIRQEGEKLGMHIKGGLRGQRGNPLDKNDE--GVFITKINSGGAAKRDGRL 309
P + + Q+ + + G G L GV + + G A + G L
Sbjct: 371 KP---VNVNVELQQSSQNQVDSSNIFNGIEGAELSNKGGDKGVVVDNVKPGTPAAQIG-L 426
Query: 310 KVGMRLLEVN 319
K G ++ N
Sbjct: 427 KKGDVIIGAN 436
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 29.3 bits (66), Expect = 3.5
Identities = 21/73 (28%), Positives = 35/73 (47%), Gaps = 8/73 (10%)
Query: 351 ERGDIERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEM------KKELVEWEKEEQE 404
+R + + R S ++ S S + S SS D D D + + KKE E ++ E+E
Sbjct: 108 KRIEEDARNSDADDSDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEE 167
Query: 405 ERKENLELDEVRE 417
E+ E ++ RE
Sbjct: 168 EKAA--EEEKARE 178
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 27.6 bits (62), Expect = 3.6
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
K EEE K+++ E EK +E +E E +E +EK
Sbjct: 40 KDEEERKEQMEELEKAREETERERKEREERKEK 72
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.7 bits (67), Expect = 3.6
Identities = 12/33 (36%), Positives = 22/33 (66%)
Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
++ EE+++E+ E E+E + R EL+E+ EK
Sbjct: 291 ERLEELEREIEELEEELEGLRALLEELEELLEK 323
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 28.5 bits (64), Expect = 3.6
Identities = 10/35 (28%), Positives = 17/35 (48%), Gaps = 6/35 (17%)
Query: 381 DTDVF------KQEEEMKKELVEWEKEEQEERKEN 409
D + F KQ EE +K E ++ E+ E + +
Sbjct: 86 DKEKFVTSAYKKQLEENRKLEEEEKEREELEEEND 120
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 29.2 bits (66), Expect = 3.8
Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 5/59 (8%)
Query: 365 SRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEE-RKENLELDEVREKSTPE 422
++ Q+ +L + + E K E E E+E E +KE ++ E +E+S E
Sbjct: 189 EEAILQTDQALT----AKEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQE 243
>gnl|CDD|235426 PRK05352, PRK05352, Na(+)-translocating NADH-quinone reductase
subunit A; Provisional.
Length = 448
Score = 29.4 bits (67), Expect = 4.0
Identities = 16/50 (32%), Positives = 23/50 (46%), Gaps = 12/50 (24%)
Query: 266 GEKLGMHIKGGLRGQRGNPL--DKNDEGVFIT--------KINSGGAAKR 305
G + M +K G + ++G PL DK + GV T IN G +R
Sbjct: 39 GLRPKMKVKEGDKVKKGQPLFEDKKNPGVKFTSPASGTVVAINRG--ERR 86
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 29.4 bits (66), Expect = 4.0
Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 6/54 (11%)
Query: 355 IERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKE 408
+R+ E + V Q + D K +EE K++L + KEE+E KE
Sbjct: 297 EQRKKKKKEMAPKVKQRFEAND------PAQKLQEERKEQLAKLRKEEKEREKE 344
>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80. Spc24 is a component of
the evolutionarily conserved kinetochore-associated
Ndc80 complex and is involved in chromosome segregation.
Length = 117
Score = 28.1 bits (63), Expect = 4.5
Identities = 15/58 (25%), Positives = 28/58 (48%), Gaps = 10/58 (17%)
Query: 377 REDLDTDVFK-------QEEE---MKKELVEWEKEEQEERKENLELDEVREKSTPEMP 424
++L+ + F+ E E ++ EL E ++ +E ++ +E DEV E TP
Sbjct: 1 IQELENEKFRLAKELNDLESELNNLQSELEELKERLEELEEQEVEGDEVDEDETPVDD 58
>gnl|CDD|219951 pfam08644, SPT16, FACT complex subunit (SPT16/CDC68). Proteins in
this family are subunits the FACT complex. The FACT
complex plays a role in transcription initiation and
promotes binding of TATA-binding protein (TBP) to a TATA
box in chromatin.
Length = 152
Score = 28.4 bits (64), Expect = 4.7
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 5/64 (7%)
Query: 366 RSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEER--KENLELDEVREKSTPEM 423
+ +T S D + L DVFK +E++K + + E E +E+ E +L R K P +
Sbjct: 68 KELT--FRSKDSKHLQ-DVFKAIKELQKRVKKRETERKEKADVVEQEKLILNRNKRPPRL 124
Query: 424 PHYF 427
+
Sbjct: 125 KDVY 128
>gnl|CDD|217206 pfam02731, SKIP_SNW, SKIP/SNW domain. This domain is found in
chromatin proteins.
Length = 158
Score = 28.4 bits (64), Expect = 4.9
Identities = 11/30 (36%), Positives = 21/30 (70%), Gaps = 5/30 (16%)
Query: 377 REDLDTDVFKQEEEMKKELVEWEKEEQEER 406
RE++ +Q E++++L E EK+E+EE+
Sbjct: 122 REEV-----RQRAELQRQLAEKEKQEKEEK 146
>gnl|CDD|132281 TIGR03237, dnd_assoc_2, DNA phosphorothioation-dependent
restriction protein DptH. A DNA sulfur modification
(phosphorothioation) system, dnd (degradation during
electrophoresis), is sparsely and sporadically
distributed among the bacteria. This protein is one
member of a three-gene restriction enzyme cassette that
depends on DNA phosphorothioation [DNA metabolism,
Restriction/modification].
Length = 1256
Score = 29.5 bits (66), Expect = 4.9
Identities = 13/38 (34%), Positives = 18/38 (47%), Gaps = 1/38 (2%)
Query: 393 KELVEWEKEEQEER-KENLELDEVREKSTPEMPHYFQD 429
KEL W+ + +ER EN EL + + T P D
Sbjct: 193 KELAGWKGNQIKERLGENRELSDKLDFITYHFPRELHD 230
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.9 bits (65), Expect = 5.0
Identities = 10/34 (29%), Positives = 21/34 (61%), Gaps = 3/34 (8%)
Query: 388 EEEMKKELVEWEK---EEQEERKENLELDEVREK 418
+EE KK L EWE ++Q++++E + +++
Sbjct: 177 QEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQ 210
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 29.1 bits (66), Expect = 5.3
Identities = 9/33 (27%), Positives = 20/33 (60%)
Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
++ E ++EL E E+ +EE + ++E R++
Sbjct: 303 RRAAEREEELEEGERLREEEAERQARIEEERQR 335
>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
[Transcription].
Length = 507
Score = 28.9 bits (64), Expect = 5.4
Identities = 8/39 (20%), Positives = 19/39 (48%)
Query: 380 LDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEVREK 418
L + + ++ E++ E KE +E + E D+ ++
Sbjct: 435 LRSYLLEKLIELQNEHKHHMKEIEEAKNTAKEEDQTAQR 473
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 28.5 bits (64), Expect = 6.0
Identities = 16/54 (29%), Positives = 29/54 (53%), Gaps = 1/54 (1%)
Query: 386 KQEEEMKKELVEWEKEEQEERKENLELDEVREKSTPEMPHYFQDANCV-CFVNF 438
K+EEE +KE E ++EQEE ++ V E+ T ++ + + F+N+
Sbjct: 55 KKEEEERKEREEQARKEQEEYEKLKSSFVVEEEGTDKLSADEESNELLEDFINY 108
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 28.3 bits (64), Expect = 7.0
Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 7/41 (17%)
Query: 385 FKQEEEMKKELVEWEKEEQEERKENLE-------LDEVREK 418
F+ E+E KK + + EK+ +E K+ LE +D +EK
Sbjct: 95 FEAEKEKKKAMSKEEKKAIKEEKDKLEEPYGYCLVDGRKEK 135
>gnl|CDD|236896 PRK11301, livM, leucine/isoleucine/valine transporter permease
subunit; Provisional.
Length = 419
Score = 28.4 bits (64), Expect = 7.4
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 12 GEQAGVILAAGEVVIHPVGMRLLEVNGMSLLGA 44
G Q GVILAA +V+ P MR M + G
Sbjct: 360 GSQFGVILAAILLVVLPELMRDFNEYRMLMFGL 392
>gnl|CDD|237975 cd00001, PTS_IIB_man, PTS_IIB, PTS system, Mannose/sorbose specific
IIB subunit. The bacterial phosphoenolpyruvate: sugar
phosphotransferase system (PTS) is a multi-protein
system involved in the regulation of a variety of
metabolic and transcriptional processes. This family is
one of four structurally and functionally distinct group
IIB PTS system cytoplasmic enzymes, necessary for the
uptake of carbohydrates across the cytoplasmic membrane
and their phosphorylation. The active site histidine
receives a phosphate group from the IIA subunit and
transfers it to the substrate.
Length = 151
Score = 27.6 bits (62), Expect = 7.8
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 289 DEGVFITKINSGGAAKRDGRLKVG 312
+ GV I IN G A R G++++
Sbjct: 92 EGGVPIKTINVGNMAFRPGKVQIT 115
>gnl|CDD|152077 pfam11641, Antigen_Bd37, Glycosylphosphatidylinositol-anchored
merozoite surface protein. This family of proteins
represents the core region of Bd37, a surface antigen of
B.divergens which is GPI-anchored at the surface of the
merozoite. The structure of the protein consists of
mainly alpha folds and has three sub domains.
Length = 224
Score = 28.3 bits (63), Expect = 7.9
Identities = 18/64 (28%), Positives = 35/64 (54%), Gaps = 3/64 (4%)
Query: 367 SVTQSMSSLDREDLDTDVFKQEEEMK-KELVEWEKEEQEERKENLEL--DEVREKSTPEM 423
+V S S D E+L T + K E++K ++ + +K+ ++ NL++ DE+ +K T
Sbjct: 93 TVESSAQSTDSEELKTLLLKFSEDLKAEQELHSDKKGSKDLLSNLKVQRDELVKKFTALT 152
Query: 424 PHYF 427
P +
Sbjct: 153 PSFL 156
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 28.1 bits (62), Expect = 8.0
Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 2/39 (5%)
Query: 386 KQEEEMKKELVEWEKEEQ--EERKENLELDEVREKSTPE 422
+QE E +E +EWE+ E+ +ER++ E + RE+ T E
Sbjct: 132 RQEREELEERMEWERREEKIDEREDQEEQEREREEQTIE 170
>gnl|CDD|217751 pfam03830, PTSIIB_sorb, PTS system sorbose subfamily IIB component.
Length = 151
Score = 27.4 bits (62), Expect = 8.1
Identities = 6/24 (25%), Positives = 14/24 (58%)
Query: 289 DEGVFITKINSGGAAKRDGRLKVG 312
+ GV I ++N G ++G+ ++
Sbjct: 93 EGGVPIKELNVGNMHFKEGKKQIT 116
>gnl|CDD|177876 PLN02231, PLN02231, alanine transaminase.
Length = 534
Score = 28.4 bits (63), Expect = 8.3
Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 3/62 (4%)
Query: 357 RRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKEN---LELD 413
R +STSE S S + S + + ++ V K E ++ E+V + QEE K N D
Sbjct: 39 RFSSTSEMSASDSTSSLPVTLDTINPKVLKCEYAVRGEIVTIAQRLQEELKTNPGSYPFD 98
Query: 414 EV 415
E+
Sbjct: 99 EI 100
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 28.0 bits (63), Expect = 8.4
Identities = 15/41 (36%), Positives = 24/41 (58%), Gaps = 7/41 (17%)
Query: 385 FKQEEEMKKELVEWEKEEQEERKENLE-------LDEVREK 418
F++E+E KK + + EK+ +E KE LE +D +EK
Sbjct: 94 FEEEKEKKKAMSKEEKKAIKEEKEKLEEPYGYCLVDGHKEK 134
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 27.5 bits (62), Expect = 8.5
Identities = 12/36 (33%), Positives = 18/36 (50%)
Query: 392 KKELVEWEKEEQEERKENLELDEVREKSTPEMPHYF 427
KKEL WEK E++ K + ++ + K P F
Sbjct: 65 KKELKAWEKAEKKAEKAKAKAEKKKAKKEEPKPRLF 100
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 28.2 bits (63), Expect = 8.9
Identities = 10/34 (29%), Positives = 17/34 (50%), Gaps = 2/34 (5%)
Query: 387 QEEEMKKELVEWEKEEQ--EERKENLELDEVREK 418
+ E K+E + E+ + E+K L+ REK
Sbjct: 139 EIIEKKRENNKNEERLKFENEKKLEESLELEREK 172
>gnl|CDD|235083 PRK02889, tolB, translocation protein TolB; Provisional.
Length = 427
Score = 28.4 bits (64), Expect = 9.2
Identities = 16/39 (41%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 306 DGRLKVGMRLLE-VNGMSLLGASHQEAVNTLRYSGHQIS 343
DGR V RL + V G SL G S + LR + H+I+
Sbjct: 110 DGRYDVRFRLYDTVKGQSLGGLSLTVTEDQLRLAAHRIA 148
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 28.5 bits (64), Expect = 9.4
Identities = 12/63 (19%), Positives = 22/63 (34%)
Query: 356 ERRASTSEGSRSVTQSMSSLDREDLDTDVFKQEEEMKKELVEWEKEEQEERKENLELDEV 415
E + + + +L E + + +E E +E KEE EE+ L +
Sbjct: 316 NELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELE 375
Query: 416 REK 418
Sbjct: 376 ELF 378
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 27.6 bits (62), Expect = 9.6
Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 3/35 (8%)
Query: 386 KQEEEMKKELVEWEKEEQEERKENL--ELDEVREK 418
K+ E+ K L +WE E E+R E + E DE+ +K
Sbjct: 96 KELEKELKNL-KWESEVLEQRFEKVERERDELYDK 129
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.134 0.385
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,111,145
Number of extensions: 2172780
Number of successful extensions: 3178
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3060
Number of HSP's successfully gapped: 130
Length of query: 439
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 339
Effective length of database: 6,502,202
Effective search space: 2204246478
Effective search space used: 2204246478
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (27.0 bits)