RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4283
(210 letters)
>gnl|CDD|191619 pfam06840, DUF1241, Protein of unknown function (DUF1241). This
family consists of several programmed cell death 10
protein (PDCD10 or TFAR15) sequences. The function of
this family is unknown.
Length = 154
Score = 215 bits (550), Expect = 3e-72
Identities = 98/158 (62%), Positives = 122/158 (77%), Gaps = 5/158 (3%)
Query: 3 MGEESP-VTSLILPVLFRPILSQLEKRDVQASQTLRAALSRTETANPGFAYDFVVGIMRQ 61
MGEE+ V S+ L V+ RP+ S+LE+RDV A+Q LRAA ++ E NPGF YD V+GI+R+
Sbjct: 1 MGEETTSVVSMALQVIIRPVFSKLERRDVSAAQKLRAAFTKAEKENPGFTYDLVMGILRK 60
Query: 62 GELSVNMNESILRLQGAASDVDAVEYKLNRSEDAFQELNKKSCALKKILSRIPDEITDRK 121
+L VN+NES+LR+ G ++ EY+L R+E FQELNKK+ ALK ILSRIPDEI DRK
Sbjct: 61 KDLEVNLNESLLRMAGDDTE----EYRLTRTEPEFQELNKKARALKTILSRIPDEINDRK 116
Query: 122 TFLETIKEIASAIKKLLDAVNEVSGFIPGPSNKQALEQ 159
FLETIK+IASAIKKLLDAVNEV FIPG + K+ALE
Sbjct: 117 AFLETIKDIASAIKKLLDAVNEVYKFIPGYTGKKALEH 154
>gnl|CDD|217892 pfam04100, Vps53_N, Vps53-like, N-terminal. Vps53 complexes with
Vps52 and Vps54 to form a multi- subunit complex
involved in regulating membrane trafficking events.
Length = 375
Score = 34.7 bits (80), Expect = 0.021
Identities = 18/76 (23%), Positives = 38/76 (50%), Gaps = 5/76 (6%)
Query: 67 NMNESILRLQGAASDVDA-----VEYKLNRSEDAFQELNKKSCALKKILSRIPDEITDRK 121
N++ I ++Q +DA V + N E + L++ A++++ +I + + +
Sbjct: 22 NIDSLIQKIQEEIRRLDANIKTVVRGQANVGEKGKEALDEAQKAIEELFQKIKEIKSKAE 81
Query: 122 TFLETIKEIASAIKKL 137
+ E +KEI IK+L
Sbjct: 82 STEEMVKEITRDIKQL 97
>gnl|CDD|219368 pfam07299, FBP, Fibronectin-binding protein (FBP). This family
consists of several bacterial fibronectin-binding
proteins which are thought to be involved in virulence
in Listeria species.
Length = 208
Score = 30.3 bits (69), Expect = 0.54
Identities = 15/58 (25%), Positives = 30/58 (51%), Gaps = 4/58 (6%)
Query: 102 KSCALKKILSRIPDEITDRKTFLETIKEIASA--IKKLLDAVNE-VSGFIPGPSNKQA 156
K+ A++KIL P+ ++K L+ + + + +K L+ + V F P P+ +Q
Sbjct: 34 KALAIEKILDLFPELSPEQKELLDKLLTVRTRAEAEKFLEELKPYVIPF-PSPTTQQL 90
>gnl|CDD|180828 PRK07081, PRK07081, acyl carrier protein; Provisional.
Length = 83
Score = 28.4 bits (64), Expect = 0.88
Identities = 13/28 (46%), Positives = 19/28 (67%), Gaps = 1/28 (3%)
Query: 113 IPDEITDRKTFLETIKEIASAIKKLLDA 140
IPDE+ +RK F +I +A A+ +L DA
Sbjct: 53 IPDEMLNRKLF-ASIDALAGAVTQLQDA 79
>gnl|CDD|237459 PRK13655, PRK13655, phosphoenolpyruvate carboxylase; Provisional.
Length = 494
Score = 29.9 bits (68), Expect = 0.89
Identities = 26/86 (30%), Positives = 34/86 (39%), Gaps = 21/86 (24%)
Query: 84 AVEYKLNRSE--DAFQELNKKSCALKKILSRIPDEITDRKTFLETIKEIASA-----IKK 136
A Y E A +E+N+K A +ILS E L I E S I+K
Sbjct: 275 AFRYDYPYEEVKKAIEEINEKLIAPPRILSEEDKEE------LLEIIEKYSERYQSQIEK 328
Query: 137 LLDAVNEVSGFIPGPSNKQALEQRKR 162
L D +N V+ IP +R R
Sbjct: 329 LADTINRVAKLIPK--------RRDR 346
>gnl|CDD|224170 COG1250, FadB, 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism].
Length = 307
Score = 29.5 bits (67), Expect = 1.1
Identities = 14/36 (38%), Positives = 18/36 (50%), Gaps = 1/36 (2%)
Query: 113 IPDEITDRKTFLETIKEIASAIKKLLDAVNEVSGFI 148
I E T +T +E + E A I K V +V GFI
Sbjct: 154 IRGEKTSDET-VERVVEFAKKIGKTPVVVKDVPGFI 188
>gnl|CDD|218489 pfam05192, MutS_III, MutS domain III. This domain is found in
proteins of the MutS family (DNA mismatch repair
proteins) and is found associated with pfam00488,
pfam05188, pfam01624 and pfam05190. The MutS family of
proteins is named after the Salmonella typhimurium MutS
protein involved in mismatch repair; other members of
the family included the eukaryotic MSH 1,2,3, 4,5 and 6
proteins. These have various roles in DNA repair and
recombination. Human MSH has been implicated in
non-polyposis colorectal carcinoma (HNPCC) and is a
mismatch binding protein. The aligned region corresponds
with domain III, which is central to the structure of
Thermus aquaticus MutS as characterized in.
Length = 290
Score = 29.3 bits (66), Expect = 1.5
Identities = 18/83 (21%), Positives = 38/83 (45%), Gaps = 7/83 (8%)
Query: 83 DAVEYKLNRSEDAFQELNKKSCALKKILSRI------PDEITDRKTFLETIKEIASAIKK 136
DAVE L + +E K L+++LSR+ P ++ ++ LE + + +
Sbjct: 56 DAVEELLEDLREELREALKGIPDLERLLSRLALGRASPRDLLALRSSLEALPALRKLLAS 115
Query: 137 LLDAVNEVSGFIPGPSNKQALEQ 159
+ +++ +P P + LE+
Sbjct: 116 -APLLADLADQLPLPELLELLER 137
>gnl|CDD|165303 PHA03002, PHA03002, Hypothetical protein; Provisional.
Length = 679
Score = 29.3 bits (66), Expect = 1.7
Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 3/57 (5%)
Query: 87 YKLNRSEDAFQELNKKSCALKKILSRIPDEITDRKTFLETIKEIASA-IKKLLDAVN 142
KL E A EL KK I++ I D+I KTF+E I EI + +L +N
Sbjct: 196 LKLTDEEKAI-ELVKKY-PNDNIINYISDDIKYNKTFIEKIHEIVNENFPNMLTGLN 250
>gnl|CDD|221232 pfam11802, CENP-K, Centromere-associated protein K. CENP-K is one
of seven new CENP-A-nucleosome distal (CAD) centromere
components (the others being CENP-L, CENP-O, CENP-P,
CENP-Q, CENP-R and CENP-S) that are identified as
assembling on the CENP-A nucleosome associated complex,
NAC. The CENP-A NAC is essential, as disruption of the
complex causes errors of chromosome alignment and
segregation that preclude cell survival despite
continued centromere-derived mitotic checkpoint
signalling. CENP-K is centromere-associated through its
interaction with one or more components of the CENP-A
NAC.
Length = 273
Score = 28.7 bits (64), Expect = 1.9
Identities = 17/92 (18%), Positives = 40/92 (43%), Gaps = 7/92 (7%)
Query: 97 QELNKKSCALKKILSRIPDEITDRKTFLETIKEIASAIKKLLDAVNEVSGFIPGPSNK-- 154
+EL K L+ +LS + + K LE ++ +++LD++NE +
Sbjct: 97 EELQKLRQDLEMVLSSVQSKNEKLKEDLEREQQWLDEQQQILDSLNERQSELKNQVVTFS 156
Query: 155 -----QALEQRKREFVKFSKRFSNTLKDYFKE 181
Q L+ + ++ ++ + L ++ +E
Sbjct: 157 ESRIFQELKTKMLRIKEYKEKLLSALGEFLEE 188
>gnl|CDD|233920 TIGR02548, casB_cse2, CRISPR type I-E/ECOLI-associated protein
CasB/Cse2. CRISPR is a term for Clustered, Regularly
Interspaced Short Palidromic Repeats. A number of
protein families appear only in association with these
repeats and are designated Cas (CRISPR-Associated)
proteins. This model family is found in Ecoli subtype
CRISPR/Cas regions of many bacteria, most of which are
mesophiles, and not in Archaea. It was designated Cse2
originally, and renamed CasB based on its
characterization in the CASCADE complex [Mobile and
extrachromosomal element functions, Other].
Length = 159
Score = 28.2 bits (63), Expect = 2.2
Identities = 12/45 (26%), Positives = 19/45 (42%), Gaps = 3/45 (6%)
Query: 22 LSQLEKRDVQASQTLRAALSRTETANP---GFAYDFVVGIMRQGE 63
L +L++RD A LRA L R +P F + ++
Sbjct: 5 LERLDERDPGARAALRAGLRRCAGPDPELLSFVFPRLLKGDWPTP 49
>gnl|CDD|222493 pfam14010, PEPcase_2, Phosphoenolpyruvate carboxylase. This family
of phosphoenolpyruvate carboxylases is based on
seqeunces not picked up by the model for PEPcase,
PF00311. Most of the family members are from Archaea.
Length = 476
Score = 28.8 bits (65), Expect = 2.3
Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 15/65 (23%)
Query: 105 ALKKILSRI---PDEITDRKTFLETI----KEIASAIKKLLDAVNEVSGFIPGPSNKQAL 157
A++KI P +++ + L I + S I+KL D +N V+ FIP
Sbjct: 284 AIEKINEAERKKPRILSEEEEVLNIIEKYSEHYRSQIEKLADTINRVASFIPR------- 336
Query: 158 EQRKR 162
+R R
Sbjct: 337 -RRDR 340
>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
family.
Length = 308
Score = 28.4 bits (64), Expect = 2.4
Identities = 22/81 (27%), Positives = 37/81 (45%), Gaps = 15/81 (18%)
Query: 65 SVNMNESILRLQGAASDVDAVEYKLNRSE--DAFQELNKKSCALKKILSRI------PDE 116
+++ E RL DAVE + E ++L K+ L+++LSRI P +
Sbjct: 28 LLDLKEINERL-------DAVEELVENPELRQKLRQLLKRIPDLERLLSRIERGRASPRD 80
Query: 117 ITDRKTFLETIKEIASAIKKL 137
+ LE +KEI ++ L
Sbjct: 81 LLRLYDSLEGLKEIRQLLESL 101
>gnl|CDD|177499 PHA02951, PHA02951, Hypothetical protein; Provisional.
Length = 337
Score = 28.3 bits (63), Expect = 2.7
Identities = 12/50 (24%), Positives = 22/50 (44%)
Query: 94 DAFQELNKKSCALKKILSRIPDEITDRKTFLETIKEIASAIKKLLDAVNE 143
D++ + K++ I I + T+ T ++ IK LLD +E
Sbjct: 250 DSYADTIKENLLKYTIQKDINIKHTNITTLWNETTKLIKEIKSLLDKNHE 299
>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
Provisional.
Length = 593
Score = 28.4 bits (63), Expect = 3.3
Identities = 28/123 (22%), Positives = 48/123 (39%), Gaps = 9/123 (7%)
Query: 22 LSQLEKRDVQASQTLRAALSRTETANPGFAYDFVVGIMRQGELSVNMNESILRLQGAASD 81
LSQLE R +A + + + +F + E + SI + A S
Sbjct: 101 LSQLESR----LAVWQAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSV 156
Query: 82 VDAVEYKLNRSEDAFQELNKKSCALKK---ILSRIPDEITDRKTFLETIKEIASAIKKLL 138
DA E KL ++++ Q L+ + + + E T+ K L+ K + +K
Sbjct: 157 YDAAEKKLTQAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKEALD--KATDATVKAGT 214
Query: 139 DAV 141
DA
Sbjct: 215 DAK 217
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 28.1 bits (63), Expect = 3.8
Identities = 17/104 (16%), Positives = 41/104 (39%), Gaps = 2/104 (1%)
Query: 74 RLQGAASDVDAVEYKLNRSEDAFQELNKKSCALKKILSRIPDEITDRKTFLETIKEIASA 133
R+ + +++ L + + A EL + + ++ + T +++ +
Sbjct: 300 RITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDK 359
Query: 134 IKKLLDAVNEVSGFIPGPSNKQALEQRKREFVKFSKRFSNTLKD 177
KK+ A+ E+ N + L + + E K K S +K+
Sbjct: 360 AKKVKAAIEELQA--EFVDNAEELAKLQDELDKIVKTKSELVKE 401
>gnl|CDD|240111 cd04794, euk_LANCL, eukaryotic Lanthionine synthetase C-like
protein. This family contains the lanthionine synthetase
C-like proteins 1 and 2 which are related to the
bacterial lanthionine synthetase components C (LanC).
LANCL1 and LANCL2 (testes-specific adriamycin
sensitivity protein) are thought to be peptide-modifying
enzyme components in eukaryotic cells. Both proteins are
produced in large quantities in the brain and testes and
may have role in the immune surveillance of these
organs.
Length = 343
Score = 27.6 bits (62), Expect = 4.5
Identities = 17/105 (16%), Positives = 35/105 (33%), Gaps = 18/105 (17%)
Query: 110 LSRIPDEITDRKTFLETIKEIASAIKKLLDAVNEVSGFIPGPSNKQALEQRKREFVKFSK 169
L +P +T+ +T+L E A + + +S G + A+ ++K
Sbjct: 14 LYEVPPLLTNDQTYLSWALEYIKASLRAIPDDRRISFLC-GNAGVYAVGAAVYHYLKDED 72
Query: 170 RFSNTLKDYFKEGQANSV----------------YVSALYLIHQT 198
L F + + V Y+ AL +++
Sbjct: 73 EVDEFLSK-FLQLLESVVSPDSGLPDELLYGRAGYLYALLFLNKK 116
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 27.7 bits (63), Expect = 5.3
Identities = 14/47 (29%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 93 EDAFQELNKK--SCALKKILSRIPDEITDRKTFLETIKEIASAIKKL 137
++A+Q +K+ AL I + + + + E KEI A KKL
Sbjct: 254 KEAYQITDKQEREAALDAIKEEVLEALAAEEEEEEDEKEIKEAFKKL 300
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 27.7 bits (62), Expect = 5.4
Identities = 20/96 (20%), Positives = 45/96 (46%), Gaps = 3/96 (3%)
Query: 88 KLNRSEDAFQELNKKSCALKKILSRIPDEITDRKTFLETIKEIASAIKKLLDAVNEVSGF 147
+L + + +++L +K LK + + E+ + + + E+ + +L + + E+
Sbjct: 519 ELEKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKE 578
Query: 148 IP--GPSNKQALEQRKREFVKFSKRFSNTLKDYFKE 181
+ G + + LE+R +E F + LKD KE
Sbjct: 579 LEELGFESVEELEERLKELEPFYNEY-LELKDAEKE 613
>gnl|CDD|200466 cd11327, AmyAc_Glg_debranch_2, Alpha amylase catalytic domain found
in glycogen debranching enzymes. Debranching enzymes
facilitate the breakdown of glycogen through
glucosyltransferase and glucosidase activity. These
activities are performed by a single enzyme in mammals,
yeast, and some bacteria, but by two distinct enzymes in
Escherichia coli and other bacteria. Debranching enzymes
perform two activities, 4-alpha-D-glucanotransferase (EC
2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33).
4-alpha-D-glucanotransferase catalyzes the
endohydrolysis of 1,6-alpha-D-glucoside linkages at
points of branching in chains of 1,4-linked
alpha-D-glucose residues. Amylo-alpha-1,6-glucosidase
catalyzes the endohydrolysis of 1,6-alpha-D-glucoside
linkages at points of branching in chains of 1,4-linked
alpha-D-glucose residues. The catalytic triad (DED),
which is highly conserved in other debranching enzymes,
is not present in this group. The Alpha-amylase family
comprises the largest family of glycoside hydrolases
(GH), with the majority of enzymes acting on starch,
glycogen, and related oligo- and polysaccharides. These
proteins catalyze the transformation of alpha-1,4 and
alpha-1,6 glucosidic linkages with retention of the
anomeric center. The protein is described as having 3
domains: A, B, C. A is a (beta/alpha) 8-barrel; B is a
loop between the beta 3 strand and alpha 3 helix of A; C
is the C-terminal extension characterized by a Greek
key. The majority of the enzymes have an active site
cleft found between domains A and B where a triad of
catalytic residues (Asp, Glu and Asp) performs
catalysis. Other members of this family have lost the
catalytic activity as in the case of the human 4F2hc, or
only have 2 residues that serve as the catalytic
nucleophile and the acid/base, such as Thermus A4
beta-galactosidase with 2 Glu residues (GH42) and human
alpha-galactosidase with 2 Asp residues (GH31). The
family members are quite extensive and include: alpha
amylase, maltosyltransferase, cyclodextrin
glycotransferase, maltogenic amylase, neopullulanase,
isoamylase, 1,4-alpha-D-glucan maltotetrahydrolase,
4-alpha-glucotransferase, oligo-1,6-glucosidase,
amylosucrase, sucrose phosphorylase, and amylomaltase.
Length = 478
Score = 27.6 bits (62), Expect = 5.7
Identities = 10/36 (27%), Positives = 16/36 (44%)
Query: 65 SVNMNESILRLQGAASDVDAVEYKLNRSEDAFQELN 100
+V+M ++ D + +E L R A ELN
Sbjct: 249 TVDMEKAAEIFNSHRGDEERIEECLERFRKALDELN 284
>gnl|CDD|147776 pfam05804, KAP, Kinesin-associated protein (KAP). This family
consists of several eukaryotic kinesin-associated (KAP)
proteins. Kinesins are intracellular multimeric
transport motor proteins that move cellular cargo on
microtubule tracks. It has been shown that the sea
urchin KRP85/95 holoenzyme associates with a KAP115
non-motor protein, forming a heterotrimeric complex in
vitro, called the Kinesin-II.
Length = 708
Score = 27.5 bits (61), Expect = 5.8
Identities = 18/55 (32%), Positives = 31/55 (56%), Gaps = 7/55 (12%)
Query: 82 VDAVEYKLNRSEDAFQELNKKSCALKKILSRIPDEITDRKTFLETIKEIASAIKK 136
++ +E++L R E +EL+KK KK+ P +K + ET+K++ IKK
Sbjct: 210 MNVIEHELKRYEFWIEELDKK----KKMNEEKP---ILKKDYEETLKKLKGLIKK 257
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 27.3 bits (61), Expect = 8.0
Identities = 23/125 (18%), Positives = 54/125 (43%), Gaps = 7/125 (5%)
Query: 24 QLEKRDVQASQT-----LRAALSRTETANPGFAYDFVVGIMRQGELSVNMNESILRLQGA 78
QL+ + QA + L+A L E A + + + + + + E L+
Sbjct: 201 QLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEE--ELEEL 258
Query: 79 ASDVDAVEYKLNRSEDAFQELNKKSCALKKILSRIPDEITDRKTFLETIKEIASAIKKLL 138
+++ +E KL EL ++ L+K L + +EI+ + + ++E + +++ L
Sbjct: 259 TAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQL 318
Query: 139 DAVNE 143
+ +
Sbjct: 319 EELEA 323
>gnl|CDD|218128 pfam04523, Herpes_U30, Herpes virus tegument protein U30. This
family is named after the human herpesvirus protein, but
has been characterized in cytomegalovirus as UL47.
Cytomegalovirus UL47 is a component of the tegument,
which is a protein layer surrounding the viral capsid.
UL47 co-precipitates with UL48 and UL69 tegument
proteins, and the major capsid protein UL86. A
UL47-containing complex is thought to be involved in the
release of viral DNA from the disassembling virus
particle.
Length = 883
Score = 26.9 bits (60), Expect = 9.3
Identities = 14/59 (23%), Positives = 25/59 (42%), Gaps = 6/59 (10%)
Query: 58 IMRQGELSVNMNESILRLQGAASDVDAV---EYKLNRSEDAFQELNKKSCALKKILSRI 113
+R ELS + + +RL D++ + ++ S FQE+ + K SR
Sbjct: 713 EVRLNELSGELQTAYIRLLQILEDIEKIKTHPIQITDSTPNFQEIKE---TYLKCFSRY 768
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 26.8 bits (59), Expect = 9.8
Identities = 13/50 (26%), Positives = 23/50 (46%)
Query: 68 MNESILRLQGAASDVDAVEYKLNRSEDAFQELNKKSCALKKILSRIPDEI 117
+ + I L+ S++D +E KL S + + K+ +K S EI
Sbjct: 171 LKDVIDMLRAEISNIDYLEEKLKSSNLELENIKKQIADDEKSHSITLKEI 220
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.131 0.349
Gapped
Lambda K H
0.267 0.0700 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,322,687
Number of extensions: 966248
Number of successful extensions: 1422
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1415
Number of HSP's successfully gapped: 85
Length of query: 210
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 117
Effective length of database: 6,812,680
Effective search space: 797083560
Effective search space used: 797083560
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 57 (25.8 bits)