RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy7416
(381 letters)
>gnl|CDD|221323 pfam11931, DUF3449, Domain of unknown function (DUF3449). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 181 to 207 amino acids in length. This domain
has two conserved sequence motifs: PIP and CEICG. The
domain carries a zinc-finger domain of the C2H2-type.
Length = 187
Score = 282 bits (723), Expect = 2e-95
Identities = 109/184 (59%), Positives = 136/184 (73%), Gaps = 9/184 (4%)
Query: 176 LEAQIYKLAEQVSEQRSATKDNVQRKQARGEGER---------GDSEESEAGDTSDEDDE 226
LE +I KLAE + ++R T +NV+RKQA E ER SE+ D+ D+++E
Sbjct: 3 LEFKIEKLAELLKKEREDTIENVERKQALTEEERQASADESSEDASEDGSDDDSDDDEEE 62
Query: 227 VPYNPKNLPLGWDGKPIPYWLYKLHGLNISYTCEICGNFVYKGPKAFQRHFAEWRHAHGM 286
YNP NLPLGWDGKPIPYWLYKLHGL + CEICGN YKG KAF++HF+EWRHA+G+
Sbjct: 63 PIYNPLNLPLGWDGKPIPYWLYKLHGLGKEFKCEICGNTSYKGRKAFEKHFSEWRHAYGL 122
Query: 287 RCLGIPNTAHFANVTQIEDALALWEKLKAQKQEERWQPEQEEEFEDSLGNVVNRKTYEDL 346
+CLGIPNT HF +T IE+AL LWEKLK Q + +R++PE EEE ED GNV+++KTYEDL
Sbjct: 123 KCLGIPNTRHFKEITSIEEALELWEKLKKQTKRKRFEPENEEEVEDEEGNVMSKKTYEDL 182
Query: 347 KRQG 350
KRQG
Sbjct: 183 KRQG 186
Score = 50.3 bits (121), Expect = 2e-07
Identities = 21/56 (37%), Positives = 34/56 (60%)
Query: 46 LEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDDEKEAGS 101
LE +I KLAE + ++R T +NV+RKQA E ER S + + D S++ + ++
Sbjct: 3 LEFKIEKLAELLKKEREDTIENVERKQALTEEERQASADESSEDASEDGSDDDSDD 58
>gnl|CDD|227515 COG5188, PRP9, Splicing factor 3a, subunit 3 [RNA processing and
modification].
Length = 470
Score = 161 bits (408), Expect = 4e-45
Identities = 75/204 (36%), Positives = 108/204 (52%), Gaps = 18/204 (8%)
Query: 165 KQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDS------------ 212
K+ + ++E E +++ + + + S T+ V R A ER
Sbjct: 265 KEGQGKEEFVYSEYVLHRYLKYLGDPVSETRSLVLRSLAITAKERKAEISLLSRRKKQPA 324
Query: 213 ------EESEAGDTSDEDDEVPYNPKNLPLGWDGKPIPYWLYKLHGLNISYTCEICGNFV 266
E +E D D+ V ++PLG DG P+P WL KLHGL+I + CEICGN+V
Sbjct: 325 TKSSEKEGAEQVDGEQRDEHVSGKSFDMPLGPDGLPMPRWLCKLHGLDIEFECEICGNYV 384
Query: 267 YKGPKAFQRHFAEWRHAHGMRCLGIPNTAHFANVTQIEDALALWEKLKAQKQEERWQPEQ 326
Y G F RHF E RH +G+ CLGI + F +T+I +A+ LW +++ + E
Sbjct: 385 YYGRDRFDRHFEEDRHIYGLECLGIKPSRVFKGITRIGEAMKLWNRMEESSSSLKVPTEY 444
Query: 327 EEEFEDSLGNVVNRKTYEDLKRQG 350
EEFED GNV+++K YEDLKRQG
Sbjct: 445 SEEFEDEEGNVMSKKVYEDLKRQG 468
>gnl|CDD|205477 pfam13297, Telomere_Sde2_2, Telomere stability C-terminal. This
short C-terminal domain is found in higher eukaryotes
further downstream from the Sde2 family, pfam13019. It
is found in all Sde2-related proteins except those from
fission yeast, fly, and mosquito. Its exact function in
telomere formation and maintenance has not yet been
established.
Length = 60
Score = 67.2 bits (165), Expect = 2e-14
Identities = 33/60 (55%), Positives = 35/60 (58%), Gaps = 14/60 (23%)
Query: 110 LDLSAFSSWEELASLGLDRLK------------TLEERAQRLFSTKGKTA--LDPSLLAK 155
LDL AFSS EEL +LGLDRLK TL+ERA RLFS KG LD L AK
Sbjct: 1 LDLDAFSSAEELEALGLDRLKAALMALGLKCGGTLQERAARLFSLKGLPLEELDKKLFAK 60
Score = 40.6 bits (96), Expect = 4e-05
Identities = 16/27 (59%), Positives = 17/27 (62%), Gaps = 2/27 (7%)
Query: 1 TLEERAQRLFSTKGKTA--LDPSLLAK 25
TL+ERA RLFS KG LD L AK
Sbjct: 34 TLQERAARLFSLKGLPLEELDKKLFAK 60
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 39.6 bits (92), Expect = 0.003
Identities = 38/194 (19%), Positives = 72/194 (37%), Gaps = 5/194 (2%)
Query: 36 QQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDD 95
ER+ E G K A+ E + ++ +A G + G+ E E G+ +++
Sbjct: 686 PAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEG 745
Query: 96 EKEA-GSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLLA 154
E EA G + E + E +A KG + +
Sbjct: 746 EGEAEGKHEVETEGDRKETEHEGETEAEGKEDE--DEGEIQAGEDGEMKGDEGAEGKVEH 803
Query: 155 KSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQ--VSEQRSATKDNVQRKQARGEGERGDS 212
+ + + G K + + + + A + EQ +E + K + + G + GDS
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863
Query: 213 EESEAGDTSDEDDE 226
EE E + +E++E
Sbjct: 864 EEEEEEEEEEEEEE 877
Score = 34.2 bits (78), Expect = 0.12
Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 2/88 (2%)
Query: 13 KGKTALDPSLLAKSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQ--VSEQRSATKDNVQR 70
KG + + + + + G K + + + + A + EQ +E + K + +
Sbjct: 792 KGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKG 851
Query: 71 KQARGEGERGDSEESEAGDTSDEDDEKE 98
G + GDSEE E + +E++E+E
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEE 879
>gnl|CDD|131925 TIGR02878, spore_ypjB, sporulation protein YpjB. Members of this
protein, YpjB, family are restricted to a subset of
endospore-forming bacteria, including Bacillus species
but not CLostridium or some others. In Bacillus
subtilis, ypjB was found to be part of the sigma-E
regulon, where sigma-E is a sporulation sigma factor
that regulates expression in the mother cell
compartment. Null mutants of ypjB show a sporulation
defect. This protein family is not, however, a part of
the endospore formation minimal gene set [Cellular
processes, Sporulation and germination].
Length = 233
Score = 34.4 bits (79), Expect = 0.061
Identities = 13/52 (25%), Positives = 21/52 (40%), Gaps = 1/52 (1%)
Query: 103 LTHVGAHLD-LSAFSSWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLL 153
+ V +HL L F + ++L + + LF + DPSLL
Sbjct: 150 VQRVDSHLSYLENFRFQQRSEDEKEEQLSLMRGDLKALFDGVKEDEADPSLL 201
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 33.9 bits (78), Expect = 0.15
Identities = 32/186 (17%), Positives = 65/186 (34%), Gaps = 21/186 (11%)
Query: 34 SKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDE 93
S ER++EI LE +I +L E+++E + R E E + E + +E
Sbjct: 670 SSILERRREIEELEEKIEELEEKIAELE------KALAELRKELEELEEELEQLRKELEE 723
Query: 94 DDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLL 153
+ DL+ + E + +L + L+ L
Sbjct: 724 LSRQ-------ISALRKDLARLEAEVEQLEERIAQLSKELTELE-----AEIEELEERLE 771
Query: 154 AKSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQR---SATKDNVQRKQARGEGERG 210
+ +++ +E + +I L+ ++ L E + E R + + + R E
Sbjct: 772 EAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLER 831
Query: 211 DSEESE 216
+E
Sbjct: 832 RIAATE 837
Score = 32.7 bits (75), Expect = 0.39
Identities = 38/197 (19%), Positives = 81/197 (41%), Gaps = 12/197 (6%)
Query: 30 KGLKSKQQERQKEIAGLEAQIYKLAEQVSEQR---SATKDNVQRKQARGEGE--RGDSEE 84
+++ +E + +I L+ ++ L E + E R + + + R E R + E
Sbjct: 778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATE 837
Query: 85 SEAGDTSDE-DDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFSTK 143
D ++ ++ E L L+ EL +L L+ +LEE L S
Sbjct: 838 RRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEAL-LNERASLEEALALLRSEL 896
Query: 144 GK--TALDPSLLAKSKPDKGLKSKQQERQK---EIAGLEAQIYKLAEQVSEQRSATKDNV 198
+ L +S+ + L+ +++ + + GLE +I L E++SE+ S T +
Sbjct: 897 EELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQERLSEEYSLTLEEA 956
Query: 199 QRKQARGEGERGDSEES 215
+ + + E + ++
Sbjct: 957 EALENKIEDDEEEARRR 973
Score = 30.4 bits (69), Expect = 1.8
Identities = 44/214 (20%), Positives = 92/214 (42%), Gaps = 17/214 (7%)
Query: 2 LEERAQRLFSTKGKTA-LDPSLLAKSKPDKGLKSKQQERQKEIAGLEAQIYKLAE---QV 57
+EE + L++ + + L+ + L+ + +E + ++ LE+++ +LAE ++
Sbjct: 283 IEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAEL 342
Query: 58 SEQRSATKDNVQRKQARGEGERGDSEESEAG-DTSDEDDEKEAGSVLTHVGAHLDLSAFS 116
E+ K+ ++ +A E + EE E+ + +E E V L+
Sbjct: 343 EEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNN-- 400
Query: 117 SWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLLAKSKPDKGLKSKQQER---QKEI 173
E+ L RL+ LE+R +RL + + L+ ++E Q+E+
Sbjct: 401 ---EIERLE-ARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEEL 456
Query: 174 AGLEAQIYKLAEQVSEQRSA---TKDNVQRKQAR 204
LE + +L E++ E A + + + QAR
Sbjct: 457 ERLEEALEELREELEEAEQALDAAERELAQLQAR 490
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 33.8 bits (78), Expect = 0.17
Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 7/52 (13%)
Query: 29 DKGLKSKQQERQKEIAGLEAQIYKLAEQV-------SEQRSATKDNVQRKQA 73
DK K K +E ++I L+ +I + +Q+ EQR +N+ RKQ
Sbjct: 169 DKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQN 220
Score = 33.8 bits (78), Expect = 0.17
Identities = 16/52 (30%), Positives = 26/52 (50%), Gaps = 7/52 (13%)
Query: 159 DKGLKSKQQERQKEIAGLEAQIYKLAEQV-------SEQRSATKDNVQRKQA 203
DK K K +E ++I L+ +I + +Q+ EQR +N+ RKQ
Sbjct: 169 DKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQN 220
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 32.7 bits (75), Expect = 0.33
Identities = 41/201 (20%), Positives = 75/201 (37%), Gaps = 16/201 (7%)
Query: 32 LKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSE--ESEAGD 89
L +Q+ERQ+ L Q+ L +Q+ E R + A+ +R + E E + G
Sbjct: 279 LAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGA 338
Query: 90 TSDEDDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFST--KGKTA 147
D D E+ DL S RL L + Q + + K
Sbjct: 339 FEDADIEQ----------LQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQK 388
Query: 148 LDPSLLAK-SKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGE 206
+ L K ++ L + ++E+ ++ A +E + L Q+ +Q A K ++ E
Sbjct: 389 IKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELE 448
Query: 207 GERGDSEESEAGDTSDEDDEV 227
R + + +E+
Sbjct: 449 -LRLGRLKQRLDSATATPEEL 468
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 32.7 bits (75), Expect = 0.40
Identities = 35/166 (21%), Positives = 68/166 (40%), Gaps = 27/166 (16%)
Query: 37 QERQKEIAGLEAQIYKLAEQVSE---QRSATKDNVQ------------RKQARGEGERGD 81
+ER++E+ LEA+I L E ++E +R + V+ R E D
Sbjct: 247 EERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDD 306
Query: 82 SEESEAGDTSDEDDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEERAQRLFS 141
++ +E ++++ + A + E + LEERA+ L
Sbjct: 307 ADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESL---REDADDLEERAEEL-- 361
Query: 142 TKGKTALDPSLLAKSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQV 187
+ + A S L ++ + ++R++EI LE +I +L E+
Sbjct: 362 -REEAAELESELEEA------REAVEDRREEIEELEEEIEELRERF 400
>gnl|CDD|150291 pfam09577, Spore_YpjB, Sporulation protein YpjB (SpoYpjB). These
proteins are found in the endospore-forming bacteria
which include Bacillus species. In Bacillus subtilis,
ypjB was found to be part of the sigma-E regulon.
Sigma-E is a sporulation sigma factor that regulates
expression in the mother cell compartment. Null mutants
of ypjB show a sporulation defect, but this gene is not,
however, a part of the endospore formation minimal gene
set.
Length = 232
Score = 31.2 bits (71), Expect = 0.72
Identities = 18/52 (34%), Positives = 23/52 (44%), Gaps = 1/52 (1%)
Query: 103 LTHVGAHLD-LSAFSSWEELASLGLDRLKTLEERAQRLFSTKGKTALDPSLL 153
L V +HL L F + L+RL LE + LF + DPSLL
Sbjct: 149 LQRVDSHLSVLENFRFQQLSKKSKLERLTLLEGDLKALFDGVKEDEADPSLL 200
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.8 bits (70), Expect = 1.0
Identities = 42/224 (18%), Positives = 79/224 (35%), Gaps = 26/224 (11%)
Query: 17 ALDPSLLAKSKPDKGLKSKQQERQKEIAGLEAQIYKLAE---QVSEQRSATKDNVQRKQA 73
A D L K L+ K +E+Q + A LE Q+ L + Q T D++++ +
Sbjct: 35 ADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRK 94
Query: 74 RGEGERGDSEESEAGDTSDEDDEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLE 133
+ E + E + L G + + S E+ + RL
Sbjct: 95 QIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPED--AQRSVRLAIYY 152
Query: 134 -----ERAQRLFSTKG---------------KTALDPSLLAKSKPDKGLKSKQQERQKEI 173
RA+R+ + K + L L + L +ER+K +
Sbjct: 153 GALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTL 212
Query: 174 AGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEA 217
A L ++ ++ E+ A + ++ + A E + E+ A
Sbjct: 213 AQLNSE-LSADQKKLEELRANESRLKNEIASAEAAAAKAREAAA 255
>gnl|CDD|129911 TIGR00831, a_cpa1, Na+/H+ antiporter, bacterial form. The
Monovalent Cation:Proton Antiporter-1 (CPA1) Family (TC
2.A.36) The CPA1 family is a large family of proteins
derived from Gram-positive and Gram-negative bacteria,
blue green bacteria, yeast, plants and animals.
Transporters from eukaryotes have been functionally
characterized, and all of these catalyze Na+:H+
exchange. Their primary physiological functions may be
in (1) cytoplasmic pH regulation, extruding the H+
generated during metabolism, and (2) salt tolerance (in
plants), due to Na+ uptake into vacuoles. This model is
specific for the bacterial members of this family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 525
Score = 30.6 bits (69), Expect = 1.5
Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 6/70 (8%)
Query: 36 QQERQKEIA-GLEAQIYKLAEQVSEQRSATKDNVQRK-----QARGEGERGDSEESEAGD 89
++E ++ IA + A+ K A + ++ + V R+ AR E R D EE
Sbjct: 414 ERELEEIIARYIAARSAKFALMKAVEQLRIVEPVARELLPELDARIEELRADGEEKIRSG 473
Query: 90 TSDEDDEKEA 99
+++ + A
Sbjct: 474 MGEKNLRRRA 483
>gnl|CDD|173689 cd05598, STKc_LATS, Catalytic domain of the Protein
Serine/Threonine Kinase, Large Tumor Suppressor.
Serine/Threonine Kinases (STKs), Large Tumor Suppressor
(LATS) subfamily, catalytic (c) domain. STKs catalyze
the transfer of the gamma-phosphoryl group from ATP to
serine/threonine residues on protein substrates. The
LATS subfamily is part of a larger superfamily that
includes the catalytic domains of other protein STKs,
protein tyrosine kinases, RIO kinases, aminoglycoside
phosphotransferase, choline kinase, and phosphoinositide
3-kinase. LATS was originally identified in Drosophila
using a screen for genes whose inactivation led to
overproliferation of cells. In tetrapods, there are two
LATS isoforms, LATS1 and LATS2. Inactivation of LATS1 in
mice results in the development of various tumors,
including sarcomas and ovarian cancer. LATS functions as
a tumor suppressor and is implicated in cell cycle
regulation.
Length = 376
Score = 29.8 bits (67), Expect = 2.4
Identities = 11/32 (34%), Positives = 15/32 (46%)
Query: 107 GAHLDLSAFSSWEELASLGLDRLKTLEERAQR 138
G H + EE + + RLK LE R +R
Sbjct: 163 GDHHRQDSMEPSEEWSEIDRCRLKPLERRRKR 194
>gnl|CDD|235131 PRK03562, PRK03562, glutathione-regulated potassium-efflux system
protein KefC; Provisional.
Length = 621
Score = 30.0 bits (68), Expect = 2.4
Identities = 13/54 (24%), Positives = 23/54 (42%), Gaps = 2/54 (3%)
Query: 48 AQIYKLAEQVSEQRSA--TKDNVQRKQARGEGERGDSEESEAGDTSDEDDEKEA 99
+ +A+Q S S T+D + G +G E G+ +DE + K +
Sbjct: 567 KKRAAVAKQTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKPS 620
Score = 29.2 bits (66), Expect = 4.0
Identities = 13/53 (24%), Positives = 22/53 (41%), Gaps = 2/53 (3%)
Query: 178 AQIYKLAEQVSEQRSA--TKDNVQRKQARGEGERGDSEESEAGDTSDEDDEVP 228
+ +A+Q S S T+D + G +G E G+ +DE + P
Sbjct: 567 KKRAAVAKQTSAMLSEIITEDREHLSLIQRHGWQGTEEGKHTGNMADEPETKP 619
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 30.1 bits (67), Expect = 2.5
Identities = 37/189 (19%), Positives = 74/189 (39%), Gaps = 9/189 (4%)
Query: 30 KGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGD 89
K ++K+ E K +A +A+ K AE+ + ++K E ++ + + +A +
Sbjct: 1565 KAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEE 1624
Query: 90 TSDEDDEKEAGSVLTHVGAHLDLSA--FSSWEELASLGLDRLKTLEERAQRLFSTKGKTA 147
++EK+ L A A EE + E ++ K
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684
Query: 148 LDPSLLAKSKPDKGLKSKQQERQK--EIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARG 205
D + K + LK + +E +K E+ EA+ K AE++ + K + +
Sbjct: 1685 ED-----EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEA 1739
Query: 206 EGERGDSEE 214
E ++ +EE
Sbjct: 1740 EEDKKKAEE 1748
Score = 29.7 bits (66), Expect = 3.2
Identities = 33/169 (19%), Positives = 68/169 (40%), Gaps = 1/169 (0%)
Query: 35 KQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDED 94
K +E +K++ L+ + + ++ E + A ++N + + D +++E ++ED
Sbjct: 1627 KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED 1686
Query: 95 DEKEAGSVLTHVGAHLDLSAFSSWEELASLGLDRLKTLEE-RAQRLFSTKGKTALDPSLL 153
++K A ++ E + LK EE + K + D
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746
Query: 154 AKSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQ 202
++K D+ K K +KE +I K E V E+ +D +R +
Sbjct: 1747 EEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRME 1795
>gnl|CDD|221084 pfam11336, DUF3138, Protein of unknown function (DUF3138). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 514
Score = 29.5 bits (66), Expect = 3.3
Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 7/64 (10%)
Query: 42 EIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDDEKEAGS 101
+I L+AQ+ L +QV+E R+A + A G G + S + A D A +
Sbjct: 26 QIKALQAQLTALQQQVNELRAALA---AKPAAAGGGAKIQSAAAAAAAAPSSD----AAA 78
Query: 102 VLTH 105
LT+
Sbjct: 79 ALTN 82
Score = 28.3 bits (63), Expect = 8.1
Identities = 16/53 (30%), Positives = 25/53 (47%), Gaps = 3/53 (5%)
Query: 172 EIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDED 224
+I L+AQ+ L +QV+E R+A + A G G + S + A D
Sbjct: 26 QIKALQAQLTALQQQVNELRAALA---AKPAAAGGGAKIQSAAAAAAAAPSSD 75
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.4 bits (66), Expect = 3.8
Identities = 12/61 (19%), Positives = 30/61 (49%), Gaps = 1/61 (1%)
Query: 38 ERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDDEK 97
+RQ+ + ++ +++ E + T + + + A+ + E+ EE+E D++K
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAE-AKAKEEKLKQEENEKKQKEQADEDK 424
Query: 98 E 98
E
Sbjct: 425 E 425
>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein. This family includes FlxA from
E. coli. The expression of FlxA is regulated by the FliA
sigma factor, a transcription factor specific for class
3 flagellar operons. However FlxA is not required for
flagellar function or formation.
Length = 106
Score = 27.8 bits (62), Expect = 4.0
Identities = 15/65 (23%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 25 KSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRK---QARGEGERGD 81
+ + + +QQ Q++I L+AQI +L +Q SE+ + + Q EG
Sbjct: 42 EDLTAEEKQKQQQLIQQQIQMLQAQIAQLQQQQSEEAQQQQQSSIEDSPSQTSAEGVNRP 101
Query: 82 SEESE 86
+ +++
Sbjct: 102 TADNQ 106
Score = 27.8 bits (62), Expect = 4.0
Identities = 15/65 (23%), Positives = 31/65 (47%), Gaps = 3/65 (4%)
Query: 155 KSKPDKGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRK---QARGEGERGD 211
+ + + +QQ Q++I L+AQI +L +Q SE+ + + Q EG
Sbjct: 42 EDLTAEEKQKQQQLIQQQIQMLQAQIAQLQQQQSEEAQQQQQSSIEDSPSQTSAEGVNRP 101
Query: 212 SEESE 216
+ +++
Sbjct: 102 TADNQ 106
>gnl|CDD|223426 COG0349, Rnd, Ribonuclease D [Translation, ribosomal structure and
biogenesis].
Length = 361
Score = 28.8 bits (65), Expect = 5.2
Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 4/46 (8%)
Query: 307 LALWEKLKAQKQEERWQPEQEEEFEDSLGNVVNRKTYEDLKRQGWQ 352
L L++KL + E E+EF + R+TY+ L W+
Sbjct: 156 LPLYDKLTEELAREGRLEWAEDEFR----LLPTRRTYKVLPEDAWR 197
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 27.7 bits (62), Expect = 5.8
Identities = 10/45 (22%), Positives = 17/45 (37%)
Query: 190 QRSATKDNVQRKQARGEGERGDSEESEAGDTSDEDDEVPYNPKNL 234
R K N + G+ D +E + D + ++ NP L
Sbjct: 35 SRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDENPYKL 79
>gnl|CDD|235591 PRK05755, PRK05755, DNA polymerase I; Provisional.
Length = 880
Score = 28.5 bits (65), Expect = 7.9
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 32 LKSKQQERQKEIAGLEAQIYKLAEQ 56
LK E + +A LE +IY+LA +
Sbjct: 505 LKELSAELAQRLAELEQEIYELAGE 529
Score = 28.5 bits (65), Expect = 7.9
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 162 LKSKQQERQKEIAGLEAQIYKLAEQ 186
LK E + +A LE +IY+LA +
Sbjct: 505 LKELSAELAQRLAELEQEIYELAGE 529
>gnl|CDD|234856 PRK00885, PRK00885, phosphoribosylamine--glycine ligase;
Provisional.
Length = 420
Score = 28.1 bits (64), Expect = 8.3
Identities = 17/47 (36%), Positives = 23/47 (48%), Gaps = 9/47 (19%)
Query: 269 GPKAFQRHFAEWRHAHGMRCLGIPNTAHFANVTQIEDALALWEKLKA 315
G KAF + F R+ GIP TA + T E+ALA ++ A
Sbjct: 101 GSKAFAKDFMA-RY-------GIP-TAAYETFTDAEEALAYLDEKGA 138
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 27.9 bits (62), Expect = 9.2
Identities = 16/72 (22%), Positives = 37/72 (51%)
Query: 33 KSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGDTSD 92
SKQ ++++ A +E ++ +L E ++ + +++++Q E DS++ +
Sbjct: 207 TSKQDQKEERRAAVERRLAELVEMINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDS 266
Query: 93 EDDEKEAGSVLT 104
ED E + S +T
Sbjct: 267 EDGETKPESYIT 278
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 28.2 bits (63), Expect = 9.5
Identities = 16/54 (29%), Positives = 25/54 (46%), Gaps = 1/54 (1%)
Query: 308 ALWEKLKAQKQEERWQPEQEEEFEDSLGNVVNRKTYEDLKRQGWQPEQEEEFED 361
ALWE L QK + + EQ+ D+ + + + + E+EEE ED
Sbjct: 351 ALWETLNKQKWTKAKETEQDY-ILDAFSALEIEDANTERDDEEEEDEEEEEEED 403
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 28.1 bits (63), Expect = 9.7
Identities = 27/89 (30%), Positives = 46/89 (51%), Gaps = 2/89 (2%)
Query: 130 KTLEERAQRLFSTKGKTALDPSLLAKSKPDKG-LKSKQQERQKEIAGLEAQIYKLAEQVS 188
+T EE A + + K + +LL + + + L+ + ER+K + GLE+ + K +Q+S
Sbjct: 177 QTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSLQKDQQQLS 236
Query: 189 EQRSATKDNVQRKQARGEGERGDSEESEA 217
E R A + ++ AR E E E EA
Sbjct: 237 ELR-ANESRLRDSIARAEREAKARAEREA 264
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 27.3 bits (61), Expect = 9.7
Identities = 15/70 (21%), Positives = 33/70 (47%), Gaps = 9/70 (12%)
Query: 30 KGLKSKQQERQKEIAGLEAQIYKLAEQVSEQRSATKDNVQRKQARGEGERGDSEESEAGD 89
K K++E +EI ++ + + + +++ + K + K D ++ + D
Sbjct: 58 TEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDK---------DKKDDKKDD 108
Query: 90 TSDEDDEKEA 99
S++ DEKEA
Sbjct: 109 KSEKKDEKEA 118
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.128 0.362
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,201,493
Number of extensions: 1842098
Number of successful extensions: 2791
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2628
Number of HSP's successfully gapped: 187
Length of query: 381
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 282
Effective length of database: 6,546,556
Effective search space: 1846128792
Effective search space used: 1846128792
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 60 (26.8 bits)