RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy3460
(153 letters)
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper.
Length = 65
Score = 57.2 bits (139), Expect = 5e-12
Identities = 26/65 (40%), Positives = 40/65 (61%)
Query: 37 EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLV 96
E+ K+ RR+ +N+ +A+ SR +KK ++ LERKVE L++EN KK I L L
Sbjct: 1 EEDEKRRRRRERNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLK 60
Query: 97 SQLNK 101
S+L +
Sbjct: 61 SELEE 65
>gnl|CDD|201054 pfam00170, bZIP_1, bZIP transcription factor. The Pfam entry
includes the basic region and the leucine zipper
region.
Length = 64
Score = 55.4 bits (134), Expect = 2e-11
Identities = 24/63 (38%), Positives = 39/63 (61%)
Query: 37 EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLV 96
EK LK+ RR+ KN+ +A+ SR +KK Y++ LE KV+ L++EN + + L+ L
Sbjct: 1 EKELKRERRRQKNREAARRSRLRKKAYIEELEEKVKELEAENKTLRSELERLKKECAKLK 60
Query: 97 SQL 99
S+
Sbjct: 61 SEN 63
>gnl|CDD|213400 cd12193, B_zip1, basic leucine zipper DNA-binding and
multimerization region of GCN4 and related proteins.
Basic leucine zipper (bZIP) transcription factors act
in networks of homo- and hetero-dimers in the
regulation in a diverse set of cellular pathways.
Classical leucine zippers have alpha helices with
leucine residues 7 amino acids apart, which stabilize
dimerization with a parallel leucine zipper domain.
Dimerization creates a pair of basic regions that bind
DNA and undergo conformational change. GCN4 was
identified in Saccharomyces cerevisiae from mutations
in a deficiency in activation with the general amino
acid control pathway. GCN4 encodes a trans-activator of
amino acid biosynthetic genes containing 2 acidic
activation domains and a C-terminal bZIP domain,
comprised of a basic alpha-helical DNA-binding region
and a coiled-coil dimerization region.
Length = 54
Score = 44.2 bits (105), Expect = 3e-07
Identities = 19/51 (37%), Positives = 30/51 (58%)
Query: 40 LKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLED 90
+ R++ +N+ +A+ SR +K E M LE KVE L SEN D + + L+
Sbjct: 1 DPRARKRARNREAARRSRARKMERMKQLEDKVEELTSENSDLEDEVDRLKK 51
>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper.
Length = 54
Score = 36.4 bits (85), Expect = 3e-04
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 38 KSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLED 90
K + R+ +N +A+ SR KKK+ + LE +V+ L+ EN ++++ LE
Sbjct: 1 KDDEYRDRRRRNNEAARRSREKKKQREEELEERVKELEEENAQLRQKVEQLEK 53
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 33.4 bits (77), Expect = 0.039
Identities = 20/76 (26%), Positives = 39/76 (51%), Gaps = 5/76 (6%)
Query: 42 KIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNK 101
K RR N++ E R ++E + L+RK+E L + + +K+ L + +L + +
Sbjct: 72 KERR---NELQRLERRLLQRE--ETLDRKMESLDKKEENLEKKEKELSNKEKNLDEKEEE 126
Query: 102 LQAIIARHTAQLSNVS 117
L+ +IA +L +S
Sbjct: 127 LEELIAEQREELERIS 142
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 32.0 bits (73), Expect = 0.11
Identities = 29/127 (22%), Positives = 48/127 (37%), Gaps = 19/127 (14%)
Query: 36 EEKSLKKIRRKIKNKISAQESRRKKK--EYMDALERKVEILKSENCDYKKRISNLEDTNY 93
E S++ ++ NK+ ES+ K+ EY D L+ K IL E K S L++
Sbjct: 944 EGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLK-KSTILVRE---GNKANSELKNFKK 999
Query: 94 SL-------------VSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAIIARHTAQL 140
L QL +L +A + +S + +S L LQ + +
Sbjct: 1000 ELAELSKQYGALQESTKQLKELPVEVAELQSASKIISSESTELSILKPLQKLKGLLLLEN 1059
Query: 141 SNNQNNL 147
+ Q
Sbjct: 1060 NQLQARY 1066
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 31.2 bits (71), Expect = 0.18
Identities = 24/103 (23%), Positives = 53/103 (51%), Gaps = 1/103 (0%)
Query: 12 TEEEKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQESRRKK-KEYMDALERK 70
EEE ++L+ E + L + + + L++ ++K +++A E ++ + ++ LE +
Sbjct: 686 LEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEE 745
Query: 71 VEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQL 113
+E L+ E + ++R+ LE+ SL L KL+ I +
Sbjct: 746 LEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKR 788
Score = 27.8 bits (62), Expect = 2.7
Identities = 21/134 (15%), Positives = 50/134 (37%), Gaps = 11/134 (8%)
Query: 13 EEEKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQES-----RRKKKEYMDAL 67
E E L+ + L + + LK+ +K ++ +E+ + E +A
Sbjct: 301 EGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAK 360
Query: 68 ERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLN 127
E E L + + ++ L + L ++L +++ + ++ SL +L
Sbjct: 361 EELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE------SLEERLE 414
Query: 128 KLQAIIARHTAQLS 141
+L + +L
Sbjct: 415 RLSERLEDLKEELK 428
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 30.8 bits (70), Expect = 0.24
Identities = 17/77 (22%), Positives = 32/77 (41%), Gaps = 10/77 (12%)
Query: 38 KSLKKIRRKIKNKISAQESRRKKKEY----------MDALERKVEILKSENCDYKKRISN 87
L + K+K + +E ++E + LE VE L+ EN + K+ +
Sbjct: 388 YPLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEE 447
Query: 88 LEDTNYSLVSQLNKLQA 104
L+ L S+L + +
Sbjct: 448 LKREIEKLESELERFRR 464
Score = 29.3 bits (66), Expect = 0.88
Identities = 23/71 (32%), Positives = 33/71 (46%), Gaps = 3/71 (4%)
Query: 23 GYPIPTRLPLTKTEEKSLKKIR--RKIKNKISAQESRRKKKEYMD-ALERKVEILKSENC 79
GYP+ L K EE+ +K + + +I+ E R KK E LE + LK E
Sbjct: 387 GYPLAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELE 446
Query: 80 DYKKRISNLED 90
+ K+ I LE
Sbjct: 447 ELKREIEKLES 457
>gnl|CDD|224520 COG1604, COG1604, CRISPR system related protein, RAMP superfamily
[Defense mechanisms].
Length = 257
Score = 30.5 bits (69), Expect = 0.27
Identities = 13/60 (21%), Positives = 23/60 (38%)
Query: 36 EEKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSL 95
E L K + +K +A E K + ++LK Y++ + N + T S
Sbjct: 19 EYNPLLKEQIINLDKKNAFEEEIIKDFVNEIQSYLSKVLKRLGRSYEQYVKNTKITLNSF 78
>gnl|CDD|185098 PRK15176, PRK15176, Vi polysaccharide export inner membrane protein
VexB; Provisional.
Length = 264
Score = 29.2 bits (65), Expect = 0.77
Identities = 16/49 (32%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 91 TNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAIIARHTAQ 139
T+ ++VSQ+ K I + + N I+Y ++S +N L I +R AQ
Sbjct: 100 TSLAVVSQVKKRHVIFSLAAIEFVNAVIIYIIISLINFL--IFSRWEAQ 146
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 29.3 bits (66), Expect = 0.81
Identities = 27/111 (24%), Positives = 59/111 (53%), Gaps = 14/111 (12%)
Query: 36 EEKSLKKIRRKIKNKISAQESRRK-----KKEYMDALERKVEILKSENCDYKKRISNLED 90
+ + +I ++I ++ +E + K +E + +LE+++E +KSE + + RI LE+
Sbjct: 714 ASRKIGEIEKEI-EQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEE 772
Query: 91 TNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAIIARHTAQLS 141
+ L LN L+A ++ S + + ++L+KL+ ++R A+L
Sbjct: 773 DLHKLEEALNDLEARLSH--------SRIPEIQAELSKLEEEVSRIEARLR 815
Score = 28.9 bits (65), Expect = 1.1
Identities = 30/149 (20%), Positives = 55/149 (36%), Gaps = 23/149 (15%)
Query: 8 VLHLTEEEKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQESRRKKKEY---- 63
LH EE L+A +R+P + E L++ +I+ ++ E + +
Sbjct: 773 DLHKLEEALNDLEAR--LSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEY 830
Query: 64 ----MDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIV 119
+ L+ + LK + +K I NL L +L +L+A +
Sbjct: 831 LEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEA-------------AL 877
Query: 120 YSLVSQLNKLQAIIARHTAQLSNNQNNLR 148
L S+L L+ AQL + +
Sbjct: 878 RDLESRLGDLKKERDELEAQLRELERKIE 906
Score = 28.9 bits (65), Expect = 1.1
Identities = 20/109 (18%), Positives = 48/109 (44%), Gaps = 16/109 (14%)
Query: 37 EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVE-------ILKSENCDYKKRISNLE 89
E+ ++I + K + E + KE ++ L ++E + E DY++++ L+
Sbjct: 339 EELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLK 398
Query: 90 DTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAIIARHTA 138
L +L++LQ + R + +L++++ + I A+
Sbjct: 399 REINELKRELDRLQEELQRLSEELADLN---------AAIAGIEAKINE 438
>gnl|CDD|233922 TIGR02550, flagell_flgL, flagellar hook-associated protein 3. This
protein family consists of flagellar hook-associated
proteins designated FlgL (or HAP3) encoded in bacterial
flagellar operons. A N-terminal region of about 150
residues and a C-terminal region of about 85 residues
are conserved. Members show considerable length
heterogeneity between these two well-conserved terminal
regions; the seed alignment 486 columns, while members
of this family are from 287 to over 500 residues in
length. This model distinguishes FlgL from the flagellin
gene product FliC [Cellular processes, Chemotaxis and
motility].
Length = 267
Score = 28.4 bits (64), Expect = 1.3
Identities = 11/42 (26%), Positives = 19/42 (45%), Gaps = 1/42 (2%)
Query: 64 MDALERKVEILKSENCDYKKRISNLEDTNY-SLVSQLNKLQA 104
++ LE L + + +S+LED + +S LN Q
Sbjct: 205 LNRLENLENRLSEQKLQLTEVLSDLEDVDLAEAISDLNMQQT 246
>gnl|CDD|240334 PTZ00258, PTZ00258, GTP-binding protein; Provisional.
Length = 390
Score = 28.4 bits (64), Expect = 1.4
Identities = 20/81 (24%), Positives = 41/81 (50%), Gaps = 10/81 (12%)
Query: 37 EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLV 96
EK L ++ +K K K ++++K +D L++ +E L+ K + + + T+ +
Sbjct: 160 EKRLDELTKKRKKKKK----KKEEKVELDVLKKVLEWLEE-----GKPVRDGDWTD-KEI 209
Query: 97 SQLNKLQAIIARHTAQLSNVS 117
LN+ Q + A+ L N+S
Sbjct: 210 EILNEYQLLTAKPMIYLVNMS 230
>gnl|CDD|99913 cd05530, POLBc_B1, DNA polymerase type-B B1 subfamily catalytic
domain. Archaeal proteins that are involved in DNA
replication are similar to those from eukaryotes. Some
archaeal members also possess multiple family B DNA
polymerases (B1, B2 and B3). So far there is no specific
function(s) has been assigned for different members of
the archaea type B DNA polymerases. Phylogenetic
analyses of eubacterial, archaeal, and eukaryotic family
B DNA polymerases are support independent gene
duplications during the evolution of archaeal and
eukaryotic family B DNA polymerases.
Length = 372
Score = 28.5 bits (64), Expect = 1.4
Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 2/44 (4%)
Query: 41 KKIRRKIKNKISAQESRRKKKEY-MDALERKVEILKSENCDYKK 83
+K R KI++ + R KKKEY +D L KV + K +Y K
Sbjct: 264 EKAREKIRDIVKGVYKRLKKKEYTLDQLAFKVMLSKPPE-EYTK 306
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 28.1 bits (63), Expect = 1.6
Identities = 18/103 (17%), Positives = 39/103 (37%), Gaps = 5/103 (4%)
Query: 15 EKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQESR-RKKKEYMD----ALER 69
E+ A L K E K+ +I+ +IS + +K+E ++ AL +
Sbjct: 44 EEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQ 103
Query: 70 KVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQ 112
+ L S + +KR ++ + + + + A+
Sbjct: 104 RRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNALHSLLAE 146
Score = 26.9 bits (60), Expect = 5.1
Identities = 25/123 (20%), Positives = 48/123 (39%), Gaps = 5/123 (4%)
Query: 28 TRLPLTKTEEKSLKKIRRKIKNKISA---QESRRKKKEYMDALERKVEILKSENCDYKKR 84
L + + L ++K K+ + K D L+ +V K + R
Sbjct: 20 NSLYELRLDLARLLLENEELKQKVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRAR 79
Query: 85 ISNLEDTNYSLVSQLNKLQAIIA--RHTAQLSNVSIVYSLVSQLNKLQAIIARHTAQLSN 142
IS L++ ++ +L+ +A R ++ + SQL KLQ I R ++L+
Sbjct: 80 ISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRTRSKLNA 139
Query: 143 NQN 145
+
Sbjct: 140 LHS 142
>gnl|CDD|216597 pfam01601, Corona_S2, Coronavirus S2 glycoprotein. The coronavirus
spike glycoprotein forms the characteristic 'corona'
after which the group is named. The Spike glycoprotein
is translated as a large polypeptide that is
subsequently cleaved to S1 pfam01600 and S2.
Length = 609
Score = 28.0 bits (63), Expect = 2.1
Identities = 17/70 (24%), Positives = 33/70 (47%), Gaps = 10/70 (14%)
Query: 64 MDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLV 123
D L+ +++ + + K + N++D + S L+K+Q ++ + LS L
Sbjct: 243 TDVLQENQKLIANS---FNKALGNIQDGFTTTASALSKIQDVVNQQGQALSQ------LT 293
Query: 124 SQL-NKLQAI 132
+QL N AI
Sbjct: 294 NQLSNNFGAI 303
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 28.0 bits (63), Expect = 2.2
Identities = 17/93 (18%), Positives = 35/93 (37%), Gaps = 8/93 (8%)
Query: 61 KEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVY 120
KE + LE +++ L+ E KK + L + + Q T +
Sbjct: 333 KEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKE-------- 384
Query: 121 SLVSQLNKLQAIIARHTAQLSNNQNNLRVGVAK 153
L +L +L+ + +L + + ++ V K
Sbjct: 385 KLSEELEELEEELKELKEELESLYSEGKISVNK 417
>gnl|CDD|235613 PRK05802, PRK05802, hypothetical protein; Provisional.
Length = 320
Score = 27.6 bits (62), Expect = 2.3
Identities = 13/37 (35%), Positives = 17/37 (45%)
Query: 97 SQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAII 133
S N +IAR Q V ++ L S NK+ II
Sbjct: 169 STKNGKSLVIARGIGQAPGVPVIKKLYSNGNKIIVII 205
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 27.9 bits (63), Expect = 2.4
Identities = 13/48 (27%), Positives = 20/48 (41%)
Query: 55 ESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKL 102
E +E LE ++ L+ + D + LE L S+L KL
Sbjct: 1 ERLEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKL 48
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 28.0 bits (63), Expect = 2.5
Identities = 15/95 (15%), Positives = 38/95 (40%), Gaps = 8/95 (8%)
Query: 36 EEKSLKKIRRKIKNKISAQESRRKKKEYMDA------LERKVEILKSENC--DYKKRISN 87
EE S +++R+ E+ K + Y+ ++KV + E D ++ +
Sbjct: 38 EELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEK 97
Query: 88 LEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSL 122
+E L ++++L+ I ++ + +
Sbjct: 98 IEKEIKELEEEISELENEIKELEQEIERLEPWGNF 132
>gnl|CDD|219955 pfam08657, DASH_Spc34, DASH complex subunit Spc34. The DASH
complex is a ~10 subunit microtubule-binding complex
that is transferred to the kinetochore prior to mitosis.
In Saccharomyces cerevisiae DASH forms both rings and
spiral structures on microtubules in vitro. Components
of the DASH complex, including Dam1, Duo1, Spc34, Dad1
and Ask1, are essential and connect the centromere to
the plus end of spindle microtubules.
Length = 255
Score = 27.5 bits (61), Expect = 2.8
Identities = 12/42 (28%), Positives = 20/42 (47%)
Query: 80 DYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYS 121
+++I L D L + +L+A IA QL ++I S
Sbjct: 176 GAREKIETLRDRYNELSESIAELEAEIAEQKRQLEIMNISSS 217
>gnl|CDD|190534 pfam03131, bZIP_Maf, bZIP Maf transcription factor. Maf
transcription factors contain a conserved basic region
leucine zipper (bZIP) domain, which mediates their
dimerisation and DNA binding property. Thus, this
family is probably related to pfam00170.
Length = 93
Score = 26.6 bits (59), Expect = 3.1
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 7/54 (12%)
Query: 32 LTKTEEKSLKKIRRKIKNKISAQESRRKKKEYMDALE-------RKVEILKSEN 78
L+K E LK+ RR +KN+ AQ R K+ + LE ++VE LK E
Sbjct: 21 LSKEEVIRLKQRRRTLKNRGYAQSCRSKRVQQRHELEKEKSQLQQQVEQLKQEV 74
>gnl|CDD|236345 PRK08870, flgL, flagellar hook-associated protein FlgL; Reviewed.
Length = 404
Score = 27.2 bits (61), Expect = 3.4
Identities = 14/65 (21%), Positives = 26/65 (40%), Gaps = 8/65 (12%)
Query: 64 MDALERKVEILKSENCDYKKRISNLEDTNY-SLVSQLNKLQAIIARHTAQLSNVSIVYSL 122
++ L+ + + +S+LED +Y S +S+ QA L +S
Sbjct: 342 LNELDSAEAVHEDNKLQNTSALSDLEDLDYASAISEFELQQAA-------LQAAQQAFSK 394
Query: 123 VSQLN 127
V L+
Sbjct: 395 VQSLS 399
>gnl|CDD|133212 cd05081, PTKc_Jak2_Jak3_rpt2, Catalytic (repeat 2) domain of the
Protein Tyrosine Kinases, Janus kinases 2 and 3.
Protein Tyrosine Kinase (PTK) family; Janus kinase 2
(Jak2) and Jak3; catalytic (c) domain (repeat 2). The
PTKc family is part of a larger superfamily that
includes the catalytic domains of other kinases such as
protein serine/threonine kinases, RIO kinases, and
phosphoinositide 3-kinase (PI3K). PTKs catalyze the
transfer of the gamma-phosphoryl group from ATP to
tyrosine (tyr) residues in protein substrates. Jak2 and
Jak3 are members of the Janus kinase (Jak) subfamily of
proteins, which are cytoplasmic (or nonreceptor) tyr
kinases containing an N-terminal FERM domain, followed
by a Src homology 2 (SH2) domain, a pseudokinase
domain, and a C-terminal catalytic tyr kinase domain.
Jaks are crucial for cytokine receptor signaling. They
are activated by autophosphorylation upon
cytokine-induced receptor aggregation, and subsequently
trigger downstream signaling events such as the
phosphorylation of signal transducers and activators of
transcription (STATs). Jak2 is widely expressed in many
tissues while Jak3 is expressed only in hematopoietic
cells. Jak2 is essential for the signaling of
hormone-like cytokines such as growth hormone,
erythropoietin, thrombopoietin, and prolactin, as well
as some IFNs and cytokines that signal through the IL-3
and gp130 receptors. Jak3 binds the shared receptor
subunit common gamma chain and thus, is essential in
the signaling of cytokines that use it such as IL-2,
IL-4, IL-7, IL-9, IL-15, and IL-21. Disruption of Jak2
in mice results in an embryonic lethal phenotype with
multiple defects including erythropoietic and cardiac
abnormalities. It is the only Jak gene that results in
a lethal phenotype when disrupted in mice. A mutation
in the pseudokinase domain of Jak2, V617F, is present
in many myeloproliferative diseases, including almost
all patients with polycythemia vera, and 50% of
patients with essential thrombocytosis and
myelofibrosis. Jak3 is important in lymphoid
development and myeloid cell differentiation.
Inactivating mutations in Jak3 have been reported in
humans with severe combined immunodeficiency (SCID).
Length = 284
Score = 27.0 bits (60), Expect = 4.6
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 60 KKEYMDALERKVEILKSENCDY 81
E++ ER++EILKS D
Sbjct: 45 TAEHLRDFEREIEILKSLQHDN 66
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 27.1 bits (60), Expect = 4.9
Identities = 16/80 (20%), Positives = 32/80 (40%), Gaps = 2/80 (2%)
Query: 27 PTRLPLTKTEEKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYK--KR 84
P R L + + + +R + +++ Q +++ + AL + K + K
Sbjct: 423 PLRQRLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKT 482
Query: 85 ISNLEDTNYSLVSQLNKLQA 104
I E L +Q +LQA
Sbjct: 483 ICEQEARIKDLEAQRAQLQA 502
>gnl|CDD|221268 pfam11853, DUF3373, Protein of unknown function (DUF3373). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria. Proteins in this
family are typically between 472 to 574 amino acids in
length.
Length = 485
Score = 27.1 bits (60), Expect = 5.0
Identities = 10/42 (23%), Positives = 22/42 (52%), Gaps = 3/42 (7%)
Query: 53 AQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYS 94
A +K ++AL++++ LK++ D +KR+ E +
Sbjct: 25 ADIDLLQK---IEALKKELAELKAQLKDLQKRVDKTEKKSAG 63
>gnl|CDD|131884 TIGR02837, spore_II_R, stage II sporulation protein R. A
comparative genome analysis of all sequenced genomes of
shows a number of proteins conserved strictly among the
endospore-forming subset of the Firmicutes. This
protein, a member of this panel, is designated stage II
sporulation protein R [Cellular processes, Sporulation
and germination].
Length = 168
Score = 26.6 bits (59), Expect = 5.1
Identities = 18/48 (37%), Positives = 29/48 (60%), Gaps = 1/48 (2%)
Query: 40 LKKIRRKIKNKISAQESRRKKKEYMDALERKVE-ILKSENCDYKKRIS 86
LK+IR + S +E+RR +E + +ER E ++K+E DYK R+
Sbjct: 65 LKEIRPWLSGLKSLEEARRVIRENLPEIERIAESVIKAEGADYKVRVE 112
>gnl|CDD|220388 pfam09766, FimP, Fms-interacting protein. This entry carries part
of the crucial 144 N-terminal residues of the FmiP
protein, which is essential for the binding of the
protein to the cytoplasmic domain of activated
Fms-molecules in M-CSF induced haematopoietic
differentiation of macrophages. The C-terminus contains
a putative nuclear localisation sequence and a leucine
zipper which suggest further, as yet unknown, nuclear
functions. The level of FMIP expression might form a
threshold that determines whether cells differentiate
into macrophages or into granulocytes.
Length = 352
Score = 26.9 bits (60), Expect = 5.3
Identities = 20/96 (20%), Positives = 37/96 (38%), Gaps = 12/96 (12%)
Query: 8 VLHLTEEEKRTLQAE-GYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQ-----------E 55
HL +E R L + +PL + ++ + I R K + E
Sbjct: 42 KSHLEKEINRCLDFKSKDEEIELVPLEEFYAQAPEDISRPEKTPLDEHNLMLARLEWELE 101
Query: 56 SRRKKKEYMDALERKVEILKSENCDYKKRISNLEDT 91
R++ E + LE + E L+ + K+ +S+L
Sbjct: 102 QRKELAEKLKELESEKEKLQQDIEKKKEFLSSLPPK 137
>gnl|CDD|219638 pfam07914, DUF1679, Protein of unknown function (DUF1679). The
region featured in this family is found in a number of
C. elegans proteins, in one case as a repeat. In many of
the family members, this region is associated with the
CHK region described by SMART as being found in ZnF_C4
and HLH domain-containing kinases. In fact, one member
of this family is annotated as being a member of the
nuclear hormone receptor family, and contains regions
typical of such proteins (Interpro:IPR000536,
Interpro:IPR008946, and Interpro:IPR001628).
Length = 413
Score = 26.5 bits (59), Expect = 6.4
Identities = 33/135 (24%), Positives = 62/135 (45%), Gaps = 19/135 (14%)
Query: 9 LHLTEEEKRTLQAEGYPIPTRLPLTKTEEKSLKKIRRKIKNKISAQESRRKKKEYMDALE 68
L+EEEK++ A G ++ T E+ LK I +++N A+ + +E
Sbjct: 189 ESLSEEEKKS--ANGADFLEQMFETFMSEEGLKGIFDQLRNIFGAEYPEK--------VE 238
Query: 69 RKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLN- 127
V+I + Y I + Y+ +S++ ++ ++ H L +I++SL +
Sbjct: 239 ELVDIFE----HYGPEILIFK--KYTNLSKVLGIKPVL-VH-GDLWQSNILWSLDNDGKL 290
Query: 128 KLQAIIARHTAQLSN 142
KL+AII T + N
Sbjct: 291 KLKAIIDYQTVHMGN 305
>gnl|CDD|169386 PRK08334, PRK08334, translation initiation factor IF-2B subunit
beta; Validated.
Length = 356
Score = 26.5 bits (58), Expect = 6.9
Identities = 20/95 (21%), Positives = 40/95 (42%), Gaps = 25/95 (26%)
Query: 58 RKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVS 117
+ K E+MD + E LK N T +L LN+++ ++ H
Sbjct: 80 KTKDEFMDGFYKAYETLK-----------NTRPTAVNLFWALNRIKKLVEEH-------- 120
Query: 118 IVYSLVSQLNKLQAIIARHTAQLSNN--QNNLRVG 150
L L++++ +I ++++ + NLR+G
Sbjct: 121 ----LEDPLDEIKRLIVEEAQKIADEDVEANLRMG 151
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 26.5 bits (59), Expect = 7.1
Identities = 20/98 (20%), Positives = 42/98 (42%), Gaps = 5/98 (5%)
Query: 34 KTEEKSLKKIRRKIKNKISAQESRRKK--KEYMDALERKVEILKSENCDYKKRISNLEDT 91
+ +++ KI N + +KK + L+ ++ + E +K I++LE
Sbjct: 371 DAINELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKE 430
Query: 92 NYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKL 129
L +++ L+ I QL+N+ ++NKL
Sbjct: 431 IKQLEAEIKALEKEIKELEKQLTNIEP---TADEINKL 465
>gnl|CDD|222818 PHA01075, PHA01075, major capsid protein.
Length = 408
Score = 26.1 bits (57), Expect = 7.9
Identities = 26/134 (19%), Positives = 57/134 (42%), Gaps = 30/134 (22%)
Query: 26 IPTRLPLTKTEEKSLKK-------------IRRKIKNKISAQESRRKKKEYMDALERKVE 72
+P + L+K EE LK+ I +K K ++ ++R + DA +
Sbjct: 95 VPINMDLSKNEELMLKRNYPRMATKLYGNGIVKKQKFTLNNNDTRFNFQTLADATNYALG 154
Query: 73 ILKSENCDYKKRISNLEDTNYSLVSQLNKLQAIIARHTAQLSNVSIVYSLVSQLNKLQAI 132
+ YKK+IS++ V + +++A++ ++ + + V S+ + +
Sbjct: 155 V-------YKKKISDIN------VLEEKEMRAMLVDYSLNQLSETNVRKATSK----EDL 197
Query: 133 IARHTAQLSNNQNN 146
++ + N QNN
Sbjct: 198 ASKVFEAILNLQNN 211
>gnl|CDD|206687 cd01900, YchF, YchF GTPase. YchF is a member of the Obg family,
which includes four other subfamilies of GTPases: Obg,
DRG, Ygr210, and NOG1. Obg is an essential gene that is
involved in DNA replication in C. crescentus and
Streptomyces griseus and is associated with the
ribosome. Several members of the family, including YchF,
possess the TGS domain related to the RNA-binding
proteins. Experimental data and genomic analysis suggest
that YchF may be part of a nucleoprotein complex and may
function as a GTP-dependent translational factor.
Length = 274
Score = 26.3 bits (59), Expect = 8.0
Identities = 12/53 (22%), Positives = 26/53 (49%), Gaps = 6/53 (11%)
Query: 37 EKSLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLE 89
EK L+++ +K K+ ++ KE ++ LE+ E L+ +++ E
Sbjct: 137 EKRLERLEKKAKSG------DKEAKEELELLEKIKEHLEEGKPARTLELTDEE 183
>gnl|CDD|132845 cd07206, Pat_TGL3-4-5_SDP1, Triacylglycerol lipase 3, 4, and 5 and
Sugar-Dependent 1 lipase. Triacylglycerol lipases are
involved in triacylglycerol mobilization and
degradation; they are found in lipid particles. TGL4 is
30% homologus to TGL3, whereas TGL5 is 26% homologus to
TGL3. Sugar-Dependent 1 (SDP1) lipase has a patatin-like
acyl-hydrolase domain that initiates the breakdown of
storage oil in germinating Arabidopsis seeds. This
family includes subfamilies of proteins: TGL3, TGL4,
TGL5, and SDP1.
Length = 298
Score = 26.0 bits (58), Expect = 8.2
Identities = 8/18 (44%), Positives = 13/18 (72%)
Query: 83 KRISNLEDTNYSLVSQLN 100
KR++ L + N+ +VSQ N
Sbjct: 214 KRLARLYNVNHFIVSQTN 231
>gnl|CDD|200298 TIGR03603, cyclo_dehy_ocin, thiazole/oxazole-forming peptide
maturase, SagC family component. Members of this
protein family include enzymes related to SagC, a
protein involved in thiazole/oxazole cyclodehydration
modifications during biosynthesis of streptolysin S in
Streptococcus pyogenes from the protoxin polypeptide
(product of the sagA gene). Recent evidence suggests
that the YcaO/SagD-like component, not this component,
performs an ATP-dependent cyclodehydration. This protein
family serves as a marker for widely distributed
prokaryotic systems for making a general class of
heterocycle-containing bacteriocins. Note that this
model does not find all possible examples of bacteriocin
biosynthesis cyclodehydratases, an in particular misses
the E. coli plasmid protein McbB of microcin B17
biosynthesis [Cellular processes, Pathogenesis].
Length = 319
Score = 26.4 bits (58), Expect = 8.2
Identities = 17/67 (25%), Positives = 32/67 (47%), Gaps = 7/67 (10%)
Query: 43 IRRKIKNKISAQESRRKKK-----EYMDALERKVEI--LKSENCDYKKRISNLEDTNYSL 95
+R ++ + +++ + K+ +D LE K + K DY+ I E N SL
Sbjct: 108 LRETAESDLYSKDDIQLKRDRLLTSNLDILESKSLVDDKKDTLKDYEMIIVCAEHGNISL 167
Query: 96 VSQLNKL 102
+ +LN+L
Sbjct: 168 LRELNRL 174
>gnl|CDD|149675 pfam08694, UFC1, Ubiquitin-fold modifier-conjugating enzyme 1.
Ubiquitin-like (UBL) post-translational modifiers are
covalently linked to most, if not all, target
protein(s) through an enzymatic cascade analogous to
ubiquitylation, consisting of E1 (activating), E2
(conjugating), and E3 (ligating) enzymes.
Ubiquitin-fold modifier 1 (Ufm1) a ubiquitin-like
protein is activated by a novel E1-like enzyme, Uba5,
by forming a high-energy thioester bond. Activated Ufm1
is then transferred to its cognate E2-like enzyme,
Ufc1, in a similar thioester linkage. This family
represents the E2-like enzyme.
Length = 161
Score = 25.5 bits (56), Expect = 9.4
Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 58 RKKKEYMDALERKVEILKSENCDYKKRISNLEDTNY 93
R K+EY AL + VE KS + D+ + SN E T +
Sbjct: 28 RLKEEYQ-ALIKYVENNKSADNDWFRLESNKEGTRW 62
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 26.2 bits (58), Expect = 9.6
Identities = 12/77 (15%), Positives = 32/77 (41%), Gaps = 3/77 (3%)
Query: 34 KTEEKSLKKIRRKI---KNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLED 90
+ E+ ++++ +I K +I E + K+ + + + L +Y + +E
Sbjct: 255 RKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEK 314
Query: 91 TNYSLVSQLNKLQAIIA 107
L ++N ++ I
Sbjct: 315 RLSRLEEEINGIEERIK 331
>gnl|CDD|100796 PRK01156, PRK01156, chromosome segregation protein; Provisional.
Length = 895
Score = 26.0 bits (57), Expect = 10.0
Identities = 14/67 (20%), Positives = 25/67 (37%)
Query: 39 SLKKIRRKIKNKISAQESRRKKKEYMDALERKVEILKSENCDYKKRISNLEDTNYSLVSQ 98
+ K+I I +I+ + KK + KS D +I LE S
Sbjct: 309 NKKQILSNIDAEINKYHAIIKKLSVLQKDYNDYIKKKSRYDDLNNQILELEGYEMDYNSY 368
Query: 99 LNKLQAI 105
L ++++
Sbjct: 369 LKSIESL 375
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.126 0.327
Gapped
Lambda K H
0.267 0.0856 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,292,850
Number of extensions: 648110
Number of successful extensions: 1346
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1304
Number of HSP's successfully gapped: 248
Length of query: 153
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 64
Effective length of database: 6,990,096
Effective search space: 447366144
Effective search space used: 447366144
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 55 (24.7 bits)