RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14051
(256 letters)
>gnl|CDD|189427 pfam00157, Pou, Pou domain - N-terminal to homeobox domain.
Length = 75
Score = 101 bits (253), Expect = 9e-28
Identities = 38/109 (34%), Positives = 53/109 (48%), Gaps = 34/109 (31%)
Query: 2 VDGINLDEIKEFAKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQ 61
D +LDE+++FAK FK RR+ LG TQ VG AL GP +SQ+ IC
Sbjct: 1 DDTTDLDELEQFAKEFKQRRIKLGYTQADVGLALGALYGPVFSQTTICR----------- 49
Query: 62 IASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAE 110
FE L ++ K+ K+KP+LE+W++EAE
Sbjct: 50 -----------------------FEALQLSFKNMCKLKPLLEKWLEEAE 75
>gnl|CDD|197673 smart00352, POU, Found in Pit-Oct-Unc transcription factors.
Length = 75
Score = 95.6 bits (238), Expect = 1e-25
Identities = 36/108 (33%), Positives = 51/108 (47%), Gaps = 34/108 (31%)
Query: 3 DGINLDEIKEFAKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQI 62
D + E++ FAK FK RR+ LG TQ VG AL GPA+SQ+ IC
Sbjct: 2 DDTDPRELEAFAKQFKQRRIKLGFTQADVGLALGALYGPAFSQTTICR------------ 49
Query: 63 ASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAE 110
FE L ++ K+ K+KP+L++W++EAE
Sbjct: 50 ----------------------FEALQLSFKNMCKLKPLLQKWLEEAE 75
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 33.6 bits (77), Expect = 0.094
Identities = 13/27 (48%), Positives = 14/27 (51%)
Query: 55 QMLCAAQIASQQQQQQQQQHQQQQQQQ 81
Q LC QQ Q QQQ QQQ +Q
Sbjct: 788 QSLCQQSEMQQQPSQPQQQPQQQPAEQ 814
>gnl|CDD|222221 pfam13560, HTH_31, Helix-turn-helix domain. This domain is a
helix-turn-helix domain that probably binds to DNA.
Length = 63
Score = 30.2 bits (69), Expect = 0.13
Identities = 8/28 (28%), Positives = 13/28 (46%)
Query: 12 EFAKAFKLRRLSLGLTQTQVGQALSVTE 39
E + R GLTQ ++ + L V+
Sbjct: 1 ELGARLRRLRERAGLTQAELAERLGVSR 28
>gnl|CDD|222374 pfam13779, DUF4175, Domain of unknown function (DUF4175).
Length = 820
Score = 33.0 bits (76), Expect = 0.15
Identities = 17/65 (26%), Positives = 33/65 (50%), Gaps = 14/65 (21%)
Query: 64 SQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMK-----EAEE-----RQ 113
++Q Q+ QQ Q Q N+M ++T + Q++ +E M+ EA++ +Q
Sbjct: 513 AEQAQRNPQQGDQPPDQGNSM----ELTQQDLQRMMDRIEELMESGRRAEAQQLLEQLQQ 568
Query: 114 VMSNM 118
+M N+
Sbjct: 569 MMENL 573
Score = 31.4 bits (72), Expect = 0.47
Identities = 9/17 (52%), Positives = 9/17 (52%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
Q Q Q Q QQ QQ Q
Sbjct: 628 QGGQGQGQPGQQGQQGQ 644
Score = 31.4 bits (72), Expect = 0.55
Identities = 10/18 (55%), Positives = 10/18 (55%)
Query: 65 QQQQQQQQQHQQQQQQQN 82
Q QQ QQ Q QQQ Q
Sbjct: 634 GQPGQQGQQGQGQQQGQQ 651
Score = 31.0 bits (71), Expect = 0.63
Identities = 9/19 (47%), Positives = 9/19 (47%)
Query: 64 SQQQQQQQQQHQQQQQQQN 82
Q QQ Q Q Q Q QQ
Sbjct: 622 QQGQQGQGGQGQGQPGQQG 640
Score = 31.0 bits (71), Expect = 0.67
Identities = 10/20 (50%), Positives = 10/20 (50%)
Query: 65 QQQQQQQQQHQQQQQQQNTM 84
Q Q Q QQ QQ Q QQ
Sbjct: 631 QGQGQPGQQGQQGQGQQQGQ 650
Score = 30.6 bits (70), Expect = 0.98
Identities = 10/25 (40%), Positives = 10/25 (40%)
Query: 65 QQQQQQQQQHQQQQQQQNTMFEKLD 89
Q QQ Q QQ QQ Q L
Sbjct: 639 QGQQGQGQQQGQQGQGGQGGQGSLA 663
Score = 29.9 bits (68), Expect = 1.4
Identities = 10/17 (58%), Positives = 10/17 (58%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
QQ QQ Q QQQ QQ
Sbjct: 636 PGQQGQQGQGQQQGQQG 652
Score = 29.9 bits (68), Expect = 1.4
Identities = 10/24 (41%), Positives = 11/24 (45%)
Query: 65 QQQQQQQQQHQQQQQQQNTMFEKL 88
Q QQ QQ QQQ QQ +
Sbjct: 635 QPGQQGQQGQGQQQGQQGQGGQGG 658
Score = 29.9 bits (68), Expect = 1.5
Identities = 9/17 (52%), Positives = 9/17 (52%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
QQ Q Q Q Q QQ Q
Sbjct: 625 QQGQGGQGQGQPGQQGQ 641
Score = 29.9 bits (68), Expect = 1.6
Identities = 9/19 (47%), Positives = 9/19 (47%)
Query: 65 QQQQQQQQQHQQQQQQQNT 83
Q Q QQ Q Q QQQ
Sbjct: 633 QGQPGQQGQQGQGQQQGQQ 651
Score = 29.5 bits (67), Expect = 1.9
Identities = 7/17 (41%), Positives = 7/17 (41%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
Q Q Q Q QQ Q
Sbjct: 629 GGQGQGQPGQQGQQGQG 645
Score = 29.5 bits (67), Expect = 2.2
Identities = 10/22 (45%), Positives = 10/22 (45%)
Query: 60 AQIASQQQQQQQQQHQQQQQQQ 81
AQ QQ QQ Q Q Q Q
Sbjct: 617 AQRGEQQGQQGQGGQGQGQPGQ 638
Score = 29.1 bits (66), Expect = 3.1
Identities = 9/18 (50%), Positives = 9/18 (50%)
Query: 64 SQQQQQQQQQHQQQQQQQ 81
QQ Q QQQ Q Q Q
Sbjct: 640 GQQGQGQQQGQQGQGGQG 657
Score = 28.7 bits (65), Expect = 3.3
Identities = 8/17 (47%), Positives = 8/17 (47%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
Q Q Q Q QQ Q Q
Sbjct: 630 GQGQGQPGQQGQQGQGQ 646
Score = 28.7 bits (65), Expect = 3.4
Identities = 11/21 (52%), Positives = 13/21 (61%)
Query: 61 QIASQQQQQQQQQHQQQQQQQ 81
+A QQ QQQ Q QQQ +Q
Sbjct: 738 AMAQQQGQQQGGQGQQQGRQG 758
Score = 28.7 bits (65), Expect = 3.6
Identities = 8/17 (47%), Positives = 8/17 (47%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
Q Q Q Q QQ QQ
Sbjct: 626 QGQGGQGQGQPGQQGQQ 642
Score = 28.7 bits (65), Expect = 3.7
Identities = 10/25 (40%), Positives = 11/25 (44%)
Query: 59 AAQIASQQQQQQQQQHQQQQQQQNT 83
A + QQQ QQ Q QQQ
Sbjct: 732 ARALGEAMAQQQGQQQGGQGQQQGR 756
Score = 28.7 bits (65), Expect = 3.8
Identities = 8/17 (47%), Positives = 8/17 (47%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
Q Q Q Q QQ QQ
Sbjct: 627 GQGGQGQGQPGQQGQQG 643
Score = 27.9 bits (63), Expect = 5.5
Identities = 8/18 (44%), Positives = 10/18 (55%)
Query: 65 QQQQQQQQQHQQQQQQQN 82
Q+Q Q+ QQ QQ Q
Sbjct: 612 QEQFNAQRGEQQGQQGQG 629
Score = 27.9 bits (63), Expect = 6.0
Identities = 7/30 (23%), Positives = 12/30 (40%)
Query: 55 QMLCAAQIASQQQQQQQQQHQQQQQQQNTM 84
QM+ Q+ Q Q + Q + T+
Sbjct: 568 QMMENLQVTQGGQGGQSEMQQAMEGLGETL 597
Score = 27.9 bits (63), Expect = 6.6
Identities = 11/36 (30%), Positives = 13/36 (36%), Gaps = 5/36 (13%)
Query: 59 AAQIASQQQQ-----QQQQQHQQQQQQQNTMFEKLD 89
A Q+ Q QQ Q Q Q Q + E L
Sbjct: 559 AQQLLEQLQQMMENLQVTQGGQGGQSEMQQAMEGLG 594
>gnl|CDD|237354 PRK13341, PRK13341, recombination factor protein RarA/unknown
domain fusion protein; Reviewed.
Length = 725
Score = 31.6 bits (72), Expect = 0.45
Identities = 19/89 (21%), Positives = 34/89 (38%), Gaps = 6/89 (6%)
Query: 21 RLSLGLTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQIASQQQQQQQQQHQQQQQQ 80
+L L + VG A ++ A+ SAL L A ++ + Q Q QQ +
Sbjct: 595 KLKLLYSIPAVGPAWALLSEDELLYKALYSAL----LEAEELWLKLQWDQSLL-QQSLEM 649
Query: 81 QNTMFEKLDITPKSAQKI-KPVLERWMKE 108
+ A I +++RW+ +
Sbjct: 650 PGWSTGIEQWPEELALGIDSKLIKRWLAQ 678
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 31.1 bits (71), Expect = 0.52
Identities = 10/49 (20%), Positives = 24/49 (48%), Gaps = 1/49 (2%)
Query: 65 QQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEERQ 113
++ +Q+ +++QQ++ E + +A K K R +K E+ +
Sbjct: 244 ERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSR-IKRLEKLE 291
>gnl|CDD|218813 pfam05929, Phage_GPO, Phage capsid scaffolding protein (GPO) serine
peptidase. This family consists of several
bacteriophage capsid scaffolding proteins (GPO) and some
related bacterial sequences. GPO is thought to function
in both the assembly of proheads and the cleavage of
GPN. The family is found to function as a serine
peptidase, with a conserved Asp, His and Ser catalytic
triad, as in subtilisin, and as represented in
MEROPS:S73. The family includes GpO from Enterobacteria
phage P2 which cleaves itself and then becomes the
scaffold protein upon which the bacteriophage prohead is
built - a mechanism quite common amongst phages.
Length = 277
Score = 30.6 bits (69), Expect = 0.59
Identities = 9/38 (23%), Positives = 16/38 (42%)
Query: 64 SQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPV 101
S +Q Q ++ + N + L T AQ+ +P
Sbjct: 227 SDLEQALNSQGDERDKAFNALKASLSHTDDYAQQARPP 264
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 30.8 bits (70), Expect = 0.59
Identities = 11/28 (39%), Positives = 17/28 (60%), Gaps = 2/28 (7%)
Query: 59 AAQIASQQQQQQQQQH--QQQQQQQNTM 84
AAQ A +++Q QQ+ +QQ QQ +
Sbjct: 183 AAQKAELEEKQSQQKTLLYEQQAQQQKL 210
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 30.6 bits (69), Expect = 0.75
Identities = 14/31 (45%), Positives = 16/31 (51%)
Query: 48 ICSALATQMLCAAQIASQQQQQQQQQHQQQQ 78
I +A A Q+ I Q QQQ Q Q QQ Q
Sbjct: 292 IGNAFANQIQLNFVIPQQAQQQGQGQQQQAQ 322
Score = 29.8 bits (67), Expect = 1.4
Identities = 12/20 (60%), Positives = 12/20 (60%)
Query: 67 QQQQQQQHQQQQQQQNTMFE 86
QQ QQQ QQQQ Q T E
Sbjct: 308 QQAQQQGQGQQQQAQATAQE 327
Score = 28.3 bits (63), Expect = 5.0
Identities = 11/20 (55%), Positives = 12/20 (60%)
Query: 61 QIASQQQQQQQQQHQQQQQQ 80
Q A QQ Q QQQQ Q Q+
Sbjct: 308 QQAQQQGQGQQQQAQATAQE 327
>gnl|CDD|225387 COG2831, FhaC, Hemolysin activation/secretion protein
[Intracellular trafficking and secretion].
Length = 554
Score = 30.4 bits (69), Expect = 0.90
Identities = 7/34 (20%), Positives = 16/34 (47%)
Query: 49 CSALATQMLCAAQIASQQQQQQQQQHQQQQQQQN 82
+AL AA + +QQQ ++ ++++
Sbjct: 13 SAALPPLSSAAALENPAEGNIRQQQDERARREEL 46
>gnl|CDD|220844 pfam10675, DUF2489, Protein of unknown function (DUF2489). This
is a bacterial family of uncharacterized proteins.
Length = 131
Score = 29.1 bits (66), Expect = 0.98
Identities = 6/30 (20%), Positives = 18/30 (60%)
Query: 61 QIASQQQQQQQQQHQQQQQQQNTMFEKLDI 90
++ Q++ Q++ Q + +Q ++ + E + I
Sbjct: 14 KLKKQKELQEKAQAEAKQARRARILESIRI 43
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 30.4 bits (68), Expect = 1.1
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 55 QMLCAAQIASQQQQQQQQQHQQQQQQ 80
Q+L A QQQQ + QQ QQQ+QQ
Sbjct: 77 QVLLKQTKAHQQQQLENQQRQQQRQQ 102
>gnl|CDD|240373 PTZ00345, PTZ00345, glycerol-3-phosphate dehydrogenase;
Provisional.
Length = 365
Score = 29.6 bits (67), Expect = 1.4
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 170 LPATNMPHFILASGQLVQGIQGAQLLI 196
LP +P I+A L + ++ A LLI
Sbjct: 72 LPGIKLPDNIVAVSDLKEAVEDADLLI 98
>gnl|CDD|236512 PRK09430, djlA, Dna-J like membrane chaperone protein; Provisional.
Length = 267
Score = 29.4 bits (67), Expect = 1.6
Identities = 15/49 (30%), Positives = 24/49 (48%), Gaps = 4/49 (8%)
Query: 55 QMLCAAQIASQQQQQQQQQHQQQQQQQNTM---FEKLDITPK-SAQKIK 99
Q+L Q + QQQQ QQ Q+ T+ ++ L ++ Q+IK
Sbjct: 170 QLLRMMQAGFRFQQQQGGGGYQQAQRGPTLEDAYKVLGVSESDDDQEIK 218
>gnl|CDD|238045 cd00093, HTH_XRE, Helix-turn-helix XRE-family like proteins.
Prokaryotic DNA binding proteins belonging to the
xenobiotic response element family of transcriptional
regulators.
Length = 58
Score = 27.1 bits (61), Expect = 1.7
Identities = 9/32 (28%), Positives = 15/32 (46%), Gaps = 1/32 (3%)
Query: 14 AKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQ 45
+ K R GLTQ ++ + L V+ S+
Sbjct: 1 GERLKELRKEKGLTQEELAEKLGVSRS-TISR 31
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 28.9 bits (65), Expect = 2.2
Identities = 18/101 (17%), Positives = 34/101 (33%)
Query: 33 QALSVTEGPAYSQSAICSALATQMLCAAQIASQQQQQQQQQHQQQQQQQNTMFEKLDITP 92
A S + PA ++ + Q+ SQQQ Q+ Q QQQ N
Sbjct: 132 PANSPLDQPAQQRALQQLQQRYGAPASGQLPSQQQSAQKNDESQLQQQPNGETPPQQTDG 191
Query: 93 KSAQKIKPVLERWMKEAEERQVMSNMPGIISPPLISQNQNM 133
+ + ++ + Q + G + ++ Q +
Sbjct: 192 AGDDESEALVRLREADGTLEQRIKGAEGGGAMKVLKQPKKQ 232
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 29.3 bits (65), Expect = 2.3
Identities = 18/70 (25%), Positives = 25/70 (35%), Gaps = 7/70 (10%)
Query: 62 IASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEERQVMSNMPGI 121
+A Q Q QQ QQ Q Q + + P+ Q +PV A + Q +
Sbjct: 772 VAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV-------APQPQYQQPQQPV 824
Query: 122 ISPPLISQNQ 131
P Q Q
Sbjct: 825 APQPQYQQPQ 834
>gnl|CDD|173487 PTZ00236, PTZ00236, mitochondrial import inner membrane translocase
subunit tim17; Provisional.
Length = 164
Score = 28.5 bits (64), Expect = 2.4
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 61 QIASQQQQQQQQQHQQQQQQQNTMFEK 87
Q+ + +QQ QQQQ ++Q ++ E+
Sbjct: 137 QMRTPRQQFQQQQEMEKQLEEEDKKEQ 163
Score = 27.7 bits (62), Expect = 3.9
Identities = 9/45 (20%), Positives = 20/45 (44%)
Query: 44 SQSAICSALATQMLCAAQIASQQQQQQQQQHQQQQQQQNTMFEKL 88
++AI + ++ I ++Q + + Q QQQQ+ +
Sbjct: 113 VRNAIFGGILLGIIELVSIGMNRRQMRTPRQQFQQQQEMEKQLEE 157
Score = 27.3 bits (61), Expect = 6.5
Identities = 7/25 (28%), Positives = 13/25 (52%)
Query: 60 AQIASQQQQQQQQQHQQQQQQQNTM 84
+ QQ QQQQ+ +Q +++
Sbjct: 138 MRTPRQQFQQQQEMEKQLEEEDKKE 162
>gnl|CDD|133900 PHA00455, PHA00455, hypothetical protein.
Length = 85
Score = 27.0 bits (60), Expect = 2.8
Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 7/65 (10%)
Query: 140 LSLNSNAVSANFHPNVNPRTDLKN-------PSAGSTLPATNMPHFILASGQLVQGIQGA 192
+ L+ N + F N R+ +K P G T+P H ++ G+ + Q +
Sbjct: 1 MRLHFNKSNGIFSVRRNDRSTVKASEKHAKLPLIGDTVPLAPRVHALITRGKFEKATQKS 60
Query: 193 QLLIP 197
+ +P
Sbjct: 61 RPFVP 65
>gnl|CDD|222719 pfam14377, DUF4414, Domain of unknown function (DUF4414). This
family is frequently found on DNA binding proteins of
the URE-B1 type and on ligases.
Length = 105
Score = 27.7 bits (62), Expect = 2.9
Identities = 17/85 (20%), Positives = 29/85 (34%), Gaps = 9/85 (10%)
Query: 62 IASQQQQQQQQQHQQQQQQQNTMF-EKLDITPKSAQKIKPVLERWMKEAEERQVMSNMPG 120
IA Q + +Q ++ + E L P + E +E +ERQ +
Sbjct: 18 IAQQLRASVRQSTSASGEEPTEINPEFLAALPPEIR-----EEVLQQERQERQRRARE-- 70
Query: 121 IISPPLISQNQNMVNAPLNLSLNSN 145
I+ + Q M A +L
Sbjct: 71 -IAAAGAATPQEMDPASFLATLPPE 94
>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
Length = 110
Score = 27.5 bits (62), Expect = 2.9
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 60 AQIASQQQQQQQQQHQQQQQQQ 81
++ Q Q+QQQ Q +Q++ Q
Sbjct: 12 EELQKQAQEQQQAQEEQEEAQA 33
Score = 26.4 bits (59), Expect = 7.0
Identities = 8/17 (47%), Positives = 12/17 (70%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
Q+QQQ Q+ Q++ Q Q
Sbjct: 18 AQEQQQAQEEQEEAQAQ 34
>gnl|CDD|113452 pfam04684, BAF1_ABF1, BAF1 / ABF1 chromatin reorganising factor.
ABF1 is a sequence-specific DNA binding protein involved
in transcription activation, gene silencing and
initiation of DNA replication. ABF1 is known to remodel
chromatin, and it is proposed that it mediates its
effects on transcription and gene expression by
modifying local chromatin architecture. These functions
require a conserved stretch of 20 amino acids in the
C-terminal region of ABF1 (amino acids 639 to 662 S.
cerevisiae). The N-terminal two thirds of the protein
are necessary for DNA binding, and the N-terminus (amino
acids 9 to 91 in S. cerevisiae) is thought to contain a
novel zinc-finger motif which may stabilise the protein
structure.
Length = 508
Score = 28.9 bits (64), Expect = 2.9
Identities = 8/20 (40%), Positives = 9/20 (45%)
Query: 61 QIASQQQQQQQQQHQQQQQQ 80
Q+ Q Q HQ QQQ
Sbjct: 309 QLPEGHQHDHQHHHQHQQQD 328
Score = 28.5 bits (63), Expect = 4.0
Identities = 10/17 (58%), Positives = 11/17 (64%)
Query: 65 QQQQQQQQQHQQQQQQQ 81
Q QQQ QHQ+Q QQ
Sbjct: 322 HQHQQQDLQHQEQHQQH 338
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 28.9 bits (65), Expect = 3.0
Identities = 15/91 (16%), Positives = 31/91 (34%), Gaps = 5/91 (5%)
Query: 9 EIKEFAKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQIASQQQQ 68
E E +LR L E + A+ A Q + A +
Sbjct: 439 EFNEEEYELELRLGRLKQRLDSATATPEELEQLEINDEALEKAQEEQ-----EQAEANVE 493
Query: 69 QQQQQHQQQQQQQNTMFEKLDITPKSAQKIK 99
Q Q + +Q +++++ E L + +++
Sbjct: 494 QLQSELRQLRKRRDEALEALQRAERRLLQLR 524
>gnl|CDD|206093 pfam13922, PHD_3, PHD domain of transcriptional enhancer, Asx.
This is the DNA-binding domain on the additional sex
combs-like 1 proteins. The Asx protein acts as an
enhancer of trithorax and polycomb in displaying
bidirectional homoeotic phenotypes in Drosophila,
suggesting that it is required for maintenance of both
activation and silencing of Hox genes. Asx is required
for normal adult haematopoiesis and its function
depends on its cellular context.
Length = 64
Score = 26.7 bits (59), Expect = 3.1
Identities = 5/19 (26%), Positives = 5/19 (26%)
Query: 64 SQQQQQQQQQHQQQQQQQN 82
Q QH QQ
Sbjct: 2 QAQNGSGHSQHLPQQNFSG 20
Score = 26.3 bits (58), Expect = 3.7
Identities = 8/17 (47%), Positives = 8/17 (47%)
Query: 66 QQQQQQQQHQQQQQQQN 82
QQ Q H Q QQN
Sbjct: 1 QQAQNGSGHSQHLPQQN 17
Score = 25.9 bits (57), Expect = 5.4
Identities = 4/19 (21%), Positives = 5/19 (26%)
Query: 64 SQQQQQQQQQHQQQQQQQN 82
Q Q QQ +
Sbjct: 1 QQAQNGSGHSQHLPQQNFS 19
>gnl|CDD|235310 PRK04750, ubiB, putative ubiquinone biosynthesis protein UbiB;
Reviewed.
Length = 537
Score = 28.7 bits (65), Expect = 3.2
Identities = 9/10 (90%), Positives = 9/10 (90%)
Query: 99 KPVLERWMKE 108
KP LERWMKE
Sbjct: 427 KPFLERWMKE 436
>gnl|CDD|197775 smart00530, HTH_XRE, Helix-turn-helix XRE-family like proteins.
Length = 56
Score = 26.3 bits (59), Expect = 3.3
Identities = 10/31 (32%), Positives = 15/31 (48%), Gaps = 6/31 (19%)
Query: 18 KLRRLSLGLTQTQVGQALSVTEGPAYSQSAI 48
K R GLTQ ++ + L V S+S +
Sbjct: 3 KELREEKGLTQEELAEKLGV------SRSTL 27
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.2 bits (63), Expect = 3.6
Identities = 16/67 (23%), Positives = 38/67 (56%), Gaps = 12/67 (17%)
Query: 65 QQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEER-------QVMSN 117
+QQQQ+++ ++++Q++ ++ + + A++ ++WMK ++R Q M +
Sbjct: 192 LKQQQQKREEERRKQRKKQ--QEEEERKQKAEEA---WQKWMKNVKQRPKPVPLNQGMDS 246
Query: 118 MPGIISP 124
+ G ISP
Sbjct: 247 LRGTISP 253
>gnl|CDD|227361 COG5028, COG5028, Vesicle coat complex COPII, subunit
SEC24/subunit SFB2/subunit SFB3 [Intracellular
trafficking and secretion].
Length = 861
Score = 28.6 bits (64), Expect = 3.8
Identities = 15/62 (24%), Positives = 22/62 (35%), Gaps = 1/62 (1%)
Query: 26 LTQTQVGQALSVTEGPAYSQSAICSALATQMLCAAQIASQQQQQ-QQQQHQQQQQQQNTM 84
Q+QV + ++ A A + A QM QQQ ++Q Q NT
Sbjct: 11 QAQSQVHTGAASSKKSARPHRAYANFSAGQMGMPPYTTPPLQQQSRRQIDQAATAMHNTG 70
Query: 85 FE 86
Sbjct: 71 AN 72
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 28.4 bits (63), Expect = 4.1
Identities = 13/52 (25%), Positives = 25/52 (48%)
Query: 67 QQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEERQVMSNM 118
Q+ H++++++ + E+L T K +K+K E KE E S +
Sbjct: 457 GHPQEHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEI 508
>gnl|CDD|131979 TIGR02933, nifM_nitrog, nitrogen fixation protein NifM. Members of
this protein family, found in a subset of
nitrogen-fixing bacteria, are the nitrogen fixation
protein NifM. NifM, homologous to peptidyl-prolyl
cis-trans isomerases, appears to be an accessory protein
for NifH, the Fe protein, also called component II or
dinitrogenase reductase, of nitrogenase [Central
intermediary metabolism, Nitrogen fixation].
Length = 256
Score = 27.9 bits (62), Expect = 4.5
Identities = 10/57 (17%), Positives = 22/57 (38%), Gaps = 5/57 (8%)
Query: 59 AAQIASQQQQQQQQQHQQQQQQQNTMFEK-----LDITPKSAQKIKPVLERWMKEAE 110
+++ Q QQ Q Q+Q+ + + + I P ++ L + + E
Sbjct: 16 PGELSPDQLQQFDQAWQRQRHIEQAVVRAADEIGVVIPPSLLEEAPQALAQALDEQA 72
>gnl|CDD|227911 COG5624, TAF61, Transcription initiation factor TFIID, subunit
TAF12 (also component of histone acetyltransferase SAGA)
[Transcription].
Length = 505
Score = 28.1 bits (62), Expect = 4.6
Identities = 6/20 (30%), Positives = 8/20 (40%)
Query: 61 QIASQQQQQQQQQHQQQQQQ 80
+ Q QQ Q+ QQ
Sbjct: 230 LVDRYPQFQQGQKQVLSPQQ 249
Score = 28.1 bits (62), Expect = 4.9
Identities = 5/23 (21%), Positives = 10/23 (43%)
Query: 60 AQIASQQQQQQQQQHQQQQQQQN 82
+ ++ + Q QQ Q+Q
Sbjct: 223 DGRSKRRLVDRYPQFQQGQKQVL 245
>gnl|CDD|181215 PRK08059, PRK08059, general stress protein 13; Validated.
Length = 123
Score = 27.3 bits (61), Expect = 4.7
Identities = 7/49 (14%), Positives = 20/49 (40%), Gaps = 14/49 (28%)
Query: 65 QQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEERQ 113
++++ + +Q NT+ +K LE W++++ +
Sbjct: 86 KRKKGKILIPNPSEQGFNTLRDK--------------LEEWIEQSNKTD 120
>gnl|CDD|152475 pfam12040, DUF3526, Domain of unknown function (DUF3526). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria. This domain is typically
between 149 to 170 amino acids in length. This domain
has a single completely conserved residue P that may be
functionally important.
Length = 158
Score = 27.3 bits (61), Expect = 4.9
Identities = 9/41 (21%), Positives = 19/41 (46%)
Query: 93 KSAQKIKPVLERWMKEAEERQVMSNMPGIISPPLISQNQNM 133
+ + P+ E + +Q ++ G++SP L Q +M
Sbjct: 59 YGDKVLAPLAEALDQALLAQQRLARRFGLLSPTLALQRASM 99
>gnl|CDD|176958 CHL00015, ndhE, NADH dehydrogenase subunit 4L.
Length = 101
Score = 26.9 bits (60), Expect = 5.1
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 131 QNMVNAPLNLSLNSNAVSANF 151
+NMV A + L L NAV+ NF
Sbjct: 25 RNMVRALMCLELILNAVNINF 45
>gnl|CDD|220815 pfam10577, UPF0560, Uncharacterized protein family UPF0560. This
family of proteins has no known function.
Length = 805
Score = 28.3 bits (63), Expect = 5.2
Identities = 13/47 (27%), Positives = 15/47 (31%), Gaps = 3/47 (6%)
Query: 38 TEGPAYSQSAICSA---LATQMLCAAQIASQQQQQQQQQHQQQQQQQ 81
E P Q + A L A S Q Q Q+ QQ Q
Sbjct: 523 GEHPLVGQQLMEPADQQSEIGTLQAELSHSHAQPQGQRLSAQQISGQ 569
>gnl|CDD|237256 PRK12901, secA, preprotein translocase subunit SecA; Reviewed.
Length = 1112
Score = 28.1 bits (63), Expect = 5.2
Identities = 4/31 (12%), Positives = 14/31 (45%)
Query: 60 AQIASQQQQQQQQQHQQQQQQQNTMFEKLDI 90
+I Q+ + ++ +++ +K +I
Sbjct: 1017 GEIPVQEAPEIREAAPERRLDPKYRTQKEEI 1047
>gnl|CDD|197277 cd09180, PLDc_N_DEXD_b, N-terminal putative catalytic domain of
uncharacterized prokaryotic and archeal HKD family
nucleases fused to a DEAD/DEAH box helicase domain.
N-terminal putative catalytic domain of uncharacterized
prokaryotic and archeal HKD family nucleases fused to a
DEAD/DEAH box helicase domain. All members of this
subfamily are uncharacterized. Other characterized
members of the superfamily that have a related domain
architecture ( containing a DEAD/DEAH box helicase
domain), include the DNA/RNA helicase superfamily II
(SF2) and Res-subunit of type III restriction
endonucleases. In addition to the helicase-like region,
members of this subfamily also contain one copy of the
conserved HKD motif (H-x-K-x(4)-D, where x represents
any amino acid residue) in the N-terminal putative
catalytic domain. The HKD motif characterizes the
phospholipase D (PLD, EC 3.1.4.4) superfamily. A few
family members contain additional domains, like a
C-terminal peptidase S24-like domain.
Length = 142
Score = 27.3 bits (61), Expect = 5.6
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 3/65 (4%)
Query: 90 ITPKSAQKIKPVLER-WMKEAEERQVMSNMPGIISPPLISQNQNMVNAPLNLSLNSNAVS 148
I+ Q + P LE K R + S GI P + + + N + + + +N S
Sbjct: 24 ISESGIQLLIPELEELINKGVPIRIITSTYLGITEPKALRELADYDN--VEVKIYNNIKS 81
Query: 149 ANFHP 153
+FH
Sbjct: 82 ESFHA 86
>gnl|CDD|182380 PRK10328, PRK10328, DNA binding protein, nucleoid-associated;
Provisional.
Length = 134
Score = 27.1 bits (60), Expect = 5.7
Identities = 7/24 (29%), Positives = 18/24 (75%)
Query: 65 QQQQQQQQQHQQQQQQQNTMFEKL 88
++++QQQ++ ++Q++ NT E +
Sbjct: 42 EEEEQQQRELAERQEKINTWLELM 65
>gnl|CDD|148108 pfam06300, Tsp45I, Tsp45I type II restriction enzyme. This family
consists of several type II restriction enzymes.
Length = 261
Score = 27.6 bits (61), Expect = 5.9
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 77 QQQQQNTMFEKLDITPKSAQKIKPVLERWMKE 108
+ N + EKL ++ ++I P+ +RW KE
Sbjct: 98 YEMGLNKIIEKLSEPKETNRQIGPMFKRWYKE 129
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 27.9 bits (62), Expect = 6.0
Identities = 10/54 (18%), Positives = 23/54 (42%), Gaps = 1/54 (1%)
Query: 59 AAQIASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEER 112
+ +++ ++Q+ Q +Q++ K A++ E K+AEE
Sbjct: 72 KLEQQAEEAEKQRAAEQARQKELEQRAAAEKAA-KQAEQAAKQAEEKQKQAEEA 124
>gnl|CDD|182559 PRK10573, PRK10573, type IV pilin biogenesis protein; Provisional.
Length = 399
Score = 27.5 bits (62), Expect = 6.7
Identities = 8/18 (44%), Positives = 11/18 (61%)
Query: 58 CAAQIASQQQQQQQQQHQ 75
C Q+A QQ+ QQQ +
Sbjct: 142 CCFQLARQQEAQQQLTKK 159
>gnl|CDD|226660 COG4197, COG4197, Uncharacterized protein conserved in bacteria,
prophage-related [Function unknown].
Length = 96
Score = 26.4 bits (58), Expect = 7.0
Identities = 12/46 (26%), Positives = 17/46 (36%), Gaps = 2/46 (4%)
Query: 16 AFKLRRLSLGLTQTQVGQ-ALSVTEGPAYSQSAICSALATQMLCAA 60
L RL LG++ V Q + A AI A + Q+
Sbjct: 15 QKALARL-LGVSPPSVNQWIKGRRQVAAERALAIERATSGQVSREE 59
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 27.7 bits (61), Expect = 7.2
Identities = 6/38 (15%), Positives = 16/38 (42%)
Query: 60 AQIASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQK 97
AQ ++ ++Q + QQ+ + + D ++
Sbjct: 215 AQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQ 252
Score = 27.7 bits (61), Expect = 7.9
Identities = 10/51 (19%), Positives = 27/51 (52%)
Query: 60 AQIASQQQQQQQQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAE 110
+Q +++ QQ +++ ++Q + +K D +A K + + + +EA+
Sbjct: 208 SQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAK 258
>gnl|CDD|184536 PRK14145, PRK14145, heat shock protein GrpE; Provisional.
Length = 196
Score = 27.2 bits (60), Expect = 7.3
Identities = 19/62 (30%), Positives = 28/62 (45%), Gaps = 16/62 (25%)
Query: 64 SQQQQQQQQQHQQQ------------QQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEE 111
+ +Q QQ Q QQQ QQ++ E LDI AQ++K E + K E+
Sbjct: 29 PEDEQAQQNQPQQQTVDEIEELKQKLQQKEVEAQEYLDI----AQRLKAEFENYRKRTEK 84
Query: 112 RQ 113
+
Sbjct: 85 EK 86
>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
C-methyltransferase; Provisional.
Length = 390
Score = 27.4 bits (61), Expect = 8.0
Identities = 11/42 (26%), Positives = 21/42 (50%)
Query: 42 AYSQSAICSALATQMLCAAQIASQQQQQQQQQHQQQQQQQNT 83
A +Q+A ALA Q+ + Q+Q+ + +QQ + +
Sbjct: 62 AQNQTATNDALANQLTALQKAQESQKQELEGILKQQAKALDQ 103
Score = 27.4 bits (61), Expect = 9.0
Identities = 7/54 (12%), Positives = 22/54 (40%), Gaps = 4/54 (7%)
Query: 63 ASQQQQQQ----QQQHQQQQQQQNTMFEKLDITPKSAQKIKPVLERWMKEAEER 112
+ Q Q+ + Q+Q+ + ++ A + + L + + E +++
Sbjct: 68 TNDALANQLTALQKAQESQKQELEGILKQQAKALDQANRQQAALAKQLDELQQK 121
>gnl|CDD|188064 TIGR00559, pdxJ, pyridoxine 5'-phosphate synthase. PdxJ is
required in the biosynthesis of pyridoxine (vitamin B6),
a precursor to the enzyme cofactor pyridoxal phosphate.
ECOCYC describes the predicted reaction equation as
1-amino-propan-2-one-3-phosphate +
deoxyxylulose-5-phosphate = pyridoxine-5'-phosphate. The
product of that reaction is oxidized by PdxH to
pyridoxal 5'-phosphate [Biosynthesis of cofactors,
prosthetic groups, and carriers, Pyridoxine].
Length = 236
Score = 27.1 bits (60), Expect = 8.0
Identities = 8/22 (36%), Positives = 13/22 (59%)
Query: 4 GINLDEIKEFAKAFKLRRLSLG 25
G+N +K FA +L L++G
Sbjct: 190 GLNYHNVKAFAAIPELHELNIG 211
>gnl|CDD|140213 PTZ00186, PTZ00186, heat shock 70 kDa precursor protein;
Provisional.
Length = 657
Score = 27.7 bits (61), Expect = 8.1
Identities = 16/34 (47%), Positives = 17/34 (50%)
Query: 49 CSALATQMLCAAQIASQQQQQQQQQHQQQQQQQN 82
C Q AA S +QQQ QQQQQQQN
Sbjct: 619 CGRTEYQQAAAANSGSSSNSGEQQQQQQQQQQQN 652
>gnl|CDD|215443 PLN02829, PLN02829, Probable galacturonosyltransferase.
Length = 639
Score = 27.5 bits (61), Expect = 8.6
Identities = 24/117 (20%), Positives = 39/117 (33%), Gaps = 6/117 (5%)
Query: 4 GINLDEIKEFAKAFKLRRLSLGLTQTQVGQALSVTEGPAYSQS-----AICSALATQMLC 58
GI + + L L + LS T+ +SQ+ +
Sbjct: 72 GIVYSDNSSKTIEPDSQDLLLDKRGEHKARVLSATDDDTHSQTDDIIKQVTQKAGQDDSD 131
Query: 59 AAQIASQQQQQQQQQHQQQQQQQNTMFEKLDI-TPKSAQKIKPVLERWMKEAEERQV 114
+ SQ Q Q + + QQ EK+D P + K + M +A RQ+
Sbjct: 132 QQEKNSQSQSASQAESLEHVQQSAQTSEKVDEKEPLLTKTDKQTDQTVMPDARVRQL 188
>gnl|CDD|226115 COG3587, COG3587, Restriction endonuclease [Defense mechanisms].
Length = 985
Score = 27.4 bits (61), Expect = 8.7
Identities = 12/70 (17%), Positives = 30/70 (42%), Gaps = 1/70 (1%)
Query: 74 HQQQQQQQNTM-FEKLDITPKSAQKIKPVLERWMKEAEERQVMSNMPGIISPPLISQNQN 132
+Q+Q + F+ D+ + ++ A++ ++ N+ I S I+Q
Sbjct: 10 YQEQARAAILAVFKGQDLAEPKNYNQRSANPQFEIGADKDLLLENINNIQSKNGITQGSV 69
Query: 133 MVNAPLNLSL 142
++ LN+ +
Sbjct: 70 RIDDKLNIDI 79
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.125 0.342
Gapped
Lambda K H
0.267 0.0788 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,112,799
Number of extensions: 1071288
Number of successful extensions: 4406
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3212
Number of HSP's successfully gapped: 352
Length of query: 256
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 161
Effective length of database: 6,723,972
Effective search space: 1082559492
Effective search space used: 1082559492
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.0 bits)