RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8548
(102 letters)
>gnl|CDD|222120 pfam13426, PAS_9, PAS domain.
Length = 101
Score = 58.2 bits (141), Expect = 1e-12
Identities = 19/65 (29%), Positives = 37/65 (56%), Gaps = 2/65 (3%)
Query: 16 YPIVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIF 75
IVY +D L G+ R +++ K + + L+GP EE ++ +AL + E+++E+
Sbjct: 8 GRIVYANDAALRLLGYTREELLGK--SIRDLFGPGDDEEAVARLREALRNGGEVEVELEL 65
Query: 76 YKKNG 80
+K+G
Sbjct: 66 RRKDG 70
>gnl|CDD|237427 PRK13559, PRK13559, hypothetical protein; Provisional.
Length = 361
Score = 58.7 bits (142), Expect = 1e-11
Identities = 22/75 (29%), Positives = 42/75 (56%), Gaps = 3/75 (4%)
Query: 6 VLGNAQVPSLYPIVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALES 65
+ + P L PIV + F +L+G+A +++ G C+FL G T +I A+ +
Sbjct: 57 CITDPHQPDL-PIVLANQAFLDLTGYAAEEVV--GRNCRFLQGAATDPIAVAKIRAAIAA 113
Query: 66 KTELKLEVIFYKKNG 80
+ E+ +E++ Y+K+G
Sbjct: 114 EREIVVELLNYRKDG 128
>gnl|CDD|237425 PRK13557, PRK13557, histidine kinase; Provisional.
Length = 540
Score = 58.5 bits (142), Expect = 2e-11
Identities = 21/64 (32%), Positives = 39/64 (60%), Gaps = 2/64 (3%)
Query: 17 PIVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFY 76
PIV+ + F E++G+A +I+ G C+FL GP+T ++ A+ + E+ E++ Y
Sbjct: 54 PIVFANRAFLEMTGYAAEEII--GNNCRFLQGPETDRATVAEVRDAIAERREIATEILNY 111
Query: 77 KKNG 80
+K+G
Sbjct: 112 RKDG 115
>gnl|CDD|237426 PRK13558, PRK13558, bacterio-opsin activator; Provisional.
Length = 665
Score = 47.9 bits (114), Expect = 9e-08
Identities = 19/64 (29%), Positives = 41/64 (64%), Gaps = 2/64 (3%)
Query: 17 PIVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFY 76
P++Y +D F ++G++ +++ G C+FL G DT+EE ++ +A++ + +E+ Y
Sbjct: 172 PLIYINDAFERITGYSPDEVL--GRNCRFLQGEDTNEERVAELREAIDEERPTSVELRNY 229
Query: 77 KKNG 80
+K+G
Sbjct: 230 RKDG 233
>gnl|CDD|238075 cd00130, PAS, PAS domain; PAS motifs appear in archaea,
eubacteria and eukarya. Probably the most surprising
identification of a PAS domain was that in EAG-like
K+-channels. PAS domains have been found to bind
ligands, and to act as sensors for light and oxygen in
signal transduction.
Length = 103
Score = 41.1 bits (96), Expect = 5e-06
Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 2/63 (3%)
Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFYK 77
I+Y + +L G++ +++ K L P+ EE + ++E L + LEV +
Sbjct: 14 ILYANPAAEQLLGYSPEELIGKSLL--DLIHPEDREELRERLENLLSGGEPVTLEVRLRR 71
Query: 78 KNG 80
K+G
Sbjct: 72 KDG 74
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 34.2 bits (78), Expect = 0.007
Identities = 14/17 (82%), Positives = 17/17 (100%)
Query: 83 EEEEEEEEEEEKEEDKN 99
EEEEEEEEEEE+EE++N
Sbjct: 874 EEEEEEEEEEEEEEEEN 890
Score = 32.3 bits (73), Expect = 0.033
Identities = 14/19 (73%), Positives = 17/19 (89%)
Query: 83 EEEEEEEEEEEKEEDKNLL 101
EEEEEEEEEEE+EE++ L
Sbjct: 876 EEEEEEEEEEEEEENEEPL 894
Score = 31.9 bits (72), Expect = 0.040
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 870 EEEEEEEEEEEEEEEE 885
Score = 31.9 bits (72), Expect = 0.041
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 872 EEEEEEEEEEEEEEEE 887
Score = 31.9 bits (72), Expect = 0.044
Identities = 14/19 (73%), Positives = 17/19 (89%)
Query: 80 GYHEEEEEEEEEEEKEEDK 98
G EEEEEEEEEEE+EE++
Sbjct: 861 GDSEEEEEEEEEEEEEEEE 879
Score = 31.9 bits (72), Expect = 0.045
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 873 EEEEEEEEEEEEEEEE 888
Score = 31.5 bits (71), Expect = 0.048
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 871 EEEEEEEEEEEEEEEE 886
Score = 31.5 bits (71), Expect = 0.049
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 867 EEEEEEEEEEEEEEEE 882
Score = 31.5 bits (71), Expect = 0.053
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 866 EEEEEEEEEEEEEEEE 881
Score = 31.5 bits (71), Expect = 0.053
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 869 EEEEEEEEEEEEEEEE 884
Score = 31.1 bits (70), Expect = 0.066
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 868 EEEEEEEEEEEEEEEE 883
Score = 31.1 bits (70), Expect = 0.084
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 865 EEEEEEEEEEEEEEEE 880
Score = 29.6 bits (66), Expect = 0.23
Identities = 13/20 (65%), Positives = 17/20 (85%)
Query: 79 NGYHEEEEEEEEEEEKEEDK 98
+G EEEEEEEEEE+EE++
Sbjct: 859 DGGDSEEEEEEEEEEEEEEE 878
>gnl|CDD|232930 TIGR00342, TIGR00342, tRNA sulfurtransferase ThiI. Members of this
protein family are "ThiI", a sulfurtransferase involved
in 4-thiouridine modification of tRNA. This protein
often is bifunctional, with genetically separable
activities, where the C-terminal rhodanese-like domain
(residues 385 to 482 in E. coli ThiI), a domain not
included in this model, is sufficient to synthesize the
thiazole moiety of thiamine (see TIGR04271). Note that
ThiI, because of its role in tRNA modification, may
occur in species (such as Mycoplasma genitalium) that
lack de novo thiamine biosynthesis [Biosynthesis of
cofactors, prosthetic groups, and carriers, Thiamine,
Protein synthesis, tRNA and rRNA base modification].
Length = 371
Score = 31.2 bits (71), Expect = 0.055
Identities = 17/61 (27%), Positives = 26/61 (42%), Gaps = 5/61 (8%)
Query: 22 SDGFCELSGFARAQIMQKGC---ACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFYKK 78
S G S A +M++GC A F P SE+ + ++E+ S E V Y
Sbjct: 180 SGGID--SPVAAFMMMKRGCRVVAVHFFNEPAASEKAREKVERLANSLNETGGSVKLYVF 237
Query: 79 N 79
+
Sbjct: 238 D 238
>gnl|CDD|225112 COG2202, AtoS, FOG: PAS/PAC domain [Signal transduction
mechanisms].
Length = 232
Score = 30.6 bits (67), Expect = 0.11
Identities = 12/67 (17%), Positives = 26/67 (38%)
Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFYK 77
I+Y + EL G++ + + +G + + E + A L++E +
Sbjct: 134 ILYANPAAEELLGYSPEEELGRGLSDLIHPEDEERRELELARALAEGRGGPLEIEYRVRR 193
Query: 78 KNGYHEE 84
K+G
Sbjct: 194 KDGERVR 200
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 29.8 bits (68), Expect = 0.17
Identities = 12/15 (80%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEEEEEE EE+
Sbjct: 306 EEEEEEEEEEPSEEE 320
Score = 29.1 bits (66), Expect = 0.30
Identities = 13/16 (81%), Positives = 14/16 (87%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE
Sbjct: 302 EEEEEEEEEEEEEEPS 317
Score = 29.1 bits (66), Expect = 0.30
Identities = 12/16 (75%), Positives = 14/16 (87%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEE+EE+
Sbjct: 301 AEEEEEEEEEEEEEEP 316
Score = 29.1 bits (66), Expect = 0.33
Identities = 12/15 (80%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEEEEEEE E+
Sbjct: 305 EEEEEEEEEEEPSEE 319
Score = 27.9 bits (63), Expect = 0.95
Identities = 11/15 (73%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEEEEEEE+ +
Sbjct: 304 EEEEEEEEEEEEPSE 318
Score = 26.8 bits (60), Expect = 2.0
Identities = 11/16 (68%), Positives = 14/16 (87%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEE+EE++
Sbjct: 300 AAEEEEEEEEEEEEEE 315
>gnl|CDD|214512 smart00091, PAS, PAS domain. PAS motifs appear in archaea,
eubacteria and eukarya. Probably the most surprising
identification of a PAS domain was that in EAG-like
K+-channels.
Length = 67
Score = 27.7 bits (62), Expect = 0.32
Identities = 10/47 (21%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALE 64
I+Y + EL G++ +++ K + L P+ E + +++ L
Sbjct: 23 ILYANPAAEELLGYSPEELIGK--SLLELIHPEDRERVQEALQRLLS 67
>gnl|CDD|232884 TIGR00229, sensory_box, PAS domain S-box. The PAS domain was
previously described. This sensory box, or S-box domain
occupies the central portion of the PAS domain but is
more widely distributed. It is often tandemly repeated.
Known prosthetic groups bound in the S-box domain
include heme in the oxygen sensor FixL, FAD in the
redox potential sensor NifL, and a 4-hydroxycinnamyl
chromophore in photoactive yellow protein. Proteins
containing the domain often contain other regulatory
domains such as response regulator or sensor histidine
kinase domains. Other S-box proteins include
phytochromes and the aryl hydrocarbon receptor nuclear
translocator [Regulatory functions, Small molecule
interactions].
Length = 124
Score = 28.8 bits (65), Expect = 0.32
Identities = 16/64 (25%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELK-LEVIFY 76
I+Y + F E+ G++ +++ + L + EE + +IE+ LE + E E
Sbjct: 25 ILYVNPAFEEIFGYSAEELIGRNVL--ELIPEEDREEVRERIERRLEGEREPVSEERRVR 82
Query: 77 KKNG 80
+K+G
Sbjct: 83 RKDG 86
>gnl|CDD|233035 TIGR00585, mutl, DNA mismatch repair protein MutL. All proteins in
this family for which the functions are known are
involved in the process of generalized mismatch repair.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 312
Score = 28.4 bits (64), Expect = 0.54
Identities = 9/26 (34%), Positives = 12/26 (46%)
Query: 6 VLGNAQVPSLYPIVYCSDGFCELSGF 31
V G A + L P+ D +L GF
Sbjct: 215 VFGTAVLRKLIPLDEWEDLDLQLEGF 240
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 27.2 bits (61), Expect = 0.70
Identities = 10/15 (66%), Positives = 15/15 (100%)
Query: 83 EEEEEEEEEEEKEED 97
EE++EEEEEEE+++D
Sbjct: 68 EEKKEEEEEEEEDDD 82
Score = 26.8 bits (60), Expect = 0.85
Identities = 11/15 (73%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
EEE++EEEEE+EED
Sbjct: 66 AEEEKKEEEEEEEED 80
Score = 26.8 bits (60), Expect = 1.2
Identities = 11/15 (73%), Positives = 15/15 (100%)
Query: 83 EEEEEEEEEEEKEED 97
EEE++EEEEEE+E+D
Sbjct: 67 EEEKKEEEEEEEEDD 81
Score = 25.3 bits (56), Expect = 3.7
Identities = 9/15 (60%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
EEE++EEEE+EE+
Sbjct: 65 AAEEEKKEEEEEEEE 79
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 27.9 bits (62), Expect = 0.72
Identities = 10/14 (71%), Positives = 11/14 (78%)
Query: 82 HEEEEEEEEEEEKE 95
H +EEEEEEEE E
Sbjct: 93 HTRQEEEEEEEENE 106
Score = 27.5 bits (61), Expect = 0.89
Identities = 10/14 (71%), Positives = 13/14 (92%)
Query: 83 EEEEEEEEEEEKEE 96
+EEEEEEEE EK++
Sbjct: 96 QEEEEEEEENEKQQ 109
Score = 25.2 bits (55), Expect = 5.2
Identities = 9/15 (60%), Positives = 11/15 (73%)
Query: 79 NGYHEEEEEEEEEEE 93
G+ +EEEEEEEE
Sbjct: 91 TGHTRQEEEEEEEEN 105
Score = 24.8 bits (54), Expect = 8.3
Identities = 9/14 (64%), Positives = 13/14 (92%)
Query: 83 EEEEEEEEEEEKEE 96
EEEEEEEE E++++
Sbjct: 97 EEEEEEEENEKQQQ 110
Score = 24.8 bits (54), Expect = 8.5
Identities = 9/17 (52%), Positives = 14/17 (82%)
Query: 83 EEEEEEEEEEEKEEDKN 99
EEEEEEE E++++ D+
Sbjct: 98 EEEEEEENEKQQQSDEA 114
>gnl|CDD|235206 PRK04031, PRK04031, DNA primase; Provisional.
Length = 408
Score = 27.9 bits (63), Expect = 0.76
Identities = 17/46 (36%), Positives = 26/46 (56%), Gaps = 2/46 (4%)
Query: 56 KTQIEKALESKTELK--LEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
K +I KAL +K ++ LE + K E+ +EEEE+ EKE +
Sbjct: 253 KKEIAKALRNKVPVEQYLEELGKKAQKAAEKVKEEEEKPEKEPAEQ 298
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 27.8 bits (62), Expect = 0.92
Identities = 12/15 (80%), Positives = 15/15 (100%)
Query: 83 EEEEEEEEEEEKEED 97
EEEE+EEEEEE++ED
Sbjct: 391 EEEEDEEEEEEEDED 405
Score = 27.4 bits (61), Expect = 1.4
Identities = 11/15 (73%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
EEE+EEEEEEE E++
Sbjct: 392 EEEDEEEEEEEDEDE 406
Score = 26.6 bits (59), Expect = 2.3
Identities = 15/48 (31%), Positives = 26/48 (54%), Gaps = 3/48 (6%)
Query: 50 DTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEED 97
E + I A + L++E +++ EE+EEEEEEE+++E
Sbjct: 363 KAKETEQDYILDAFSA---LEIEDANTERDDEEEEDEEEEEEEDEDEG 407
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 27.2 bits (61), Expect = 0.95
Identities = 12/15 (80%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEEEE+EE EE+
Sbjct: 82 EEEEEEEEKEESEEE 96
Score = 26.8 bits (60), Expect = 1.1
Identities = 10/16 (62%), Positives = 15/16 (93%)
Query: 83 EEEEEEEEEEEKEEDK 98
EE++EEEEEEE++E+
Sbjct: 78 EEKKEEEEEEEEKEES 93
Score = 26.1 bits (58), Expect = 2.6
Identities = 9/16 (56%), Positives = 15/16 (93%)
Query: 83 EEEEEEEEEEEKEEDK 98
EE++EEEEEE+E+++
Sbjct: 77 AEEKKEEEEEEEEKEE 92
Score = 25.7 bits (57), Expect = 3.3
Identities = 10/15 (66%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
+EEEEEEEE+E+ E+
Sbjct: 81 KEEEEEEEEKEESEE 95
Score = 24.9 bits (55), Expect = 5.2
Identities = 9/15 (60%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
++EEEEEEEE++E +
Sbjct: 80 KKEEEEEEEEKEESE 94
Score = 24.9 bits (55), Expect = 5.9
Identities = 9/16 (56%), Positives = 13/16 (81%)
Query: 83 EEEEEEEEEEEKEEDK 98
EE++EEEE+EE+K
Sbjct: 75 AAAEEKKEEEEEEEEK 90
Score = 24.5 bits (54), Expect = 8.3
Identities = 11/15 (73%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEEE+EE +EE
Sbjct: 83 EEEEEEEKEESEEEA 97
Score = 24.5 bits (54), Expect = 9.5
Identities = 9/16 (56%), Positives = 13/16 (81%)
Query: 83 EEEEEEEEEEEKEEDK 98
EE++EEEEE+EE +
Sbjct: 76 AAEEKKEEEEEEEEKE 91
>gnl|CDD|205914 pfam13740, ACT_6, ACT domain. ACT domains bind to amino acids
and regulate associated enzyme domains.
Length = 76
Score = 26.3 bits (59), Expect = 1.2
Identities = 9/26 (34%), Positives = 13/26 (50%), Gaps = 1/26 (3%)
Query: 52 SEEHKTQIEKALESK-TELKLEVIFY 76
+ ++EKAL EL L+V F
Sbjct: 51 PWDAIARLEKALLFLAQELGLQVRFK 76
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 27.3 bits (60), Expect = 1.3
Identities = 13/45 (28%), Positives = 29/45 (64%)
Query: 56 KTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKNL 100
+ QIEKA + + EL+ + + ++ +E E++EE+E + E++ +
Sbjct: 125 QEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTI 169
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 27.6 bits (61), Expect = 1.3
Identities = 12/14 (85%), Positives = 14/14 (100%)
Query: 83 EEEEEEEEEEEKEE 96
E+EEEEEEEEE+EE
Sbjct: 230 EDEEEEEEEEEEEE 243
Score = 26.8 bits (59), Expect = 1.9
Identities = 12/15 (80%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
+EEEEEEEEEE+E D
Sbjct: 231 DEEEEEEEEEEEEID 245
Score = 26.8 bits (59), Expect = 2.0
Identities = 12/15 (80%), Positives = 15/15 (100%)
Query: 83 EEEEEEEEEEEKEED 97
EE+EEEEEEEE+EE+
Sbjct: 229 EEDEEEEEEEEEEEE 243
Score = 25.3 bits (55), Expect = 7.1
Identities = 11/15 (73%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
E EE+EEEEEE+EE+
Sbjct: 227 ESEEDEEEEEEEEEE 241
Score = 24.9 bits (54), Expect = 8.3
Identities = 11/15 (73%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
EE+EEEEEEE+EE+
Sbjct: 228 SEEDEEEEEEEEEEE 242
>gnl|CDD|224221 COG1302, COG1302, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 131
Score = 26.9 bits (60), Expect = 1.3
Identities = 14/54 (25%), Positives = 32/54 (59%), Gaps = 3/54 (5%)
Query: 47 YG---PDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEED 97
YG P+ +E + ++++ +E+ T LK+ + G ++E++EEEE +++
Sbjct: 77 YGVKIPEVAENIQERVKEEVENMTGLKVVEVNVHVVGVKVKKEQKEEEESFDDE 130
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 27.3 bits (61), Expect = 1.4
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 79 NGYHEEEEEEEEEEEKEED 97
+ EEEEEEEEEE+EE+
Sbjct: 400 ERWAAEEEEEEEEEEEEEE 418
Score = 26.2 bits (58), Expect = 3.3
Identities = 13/14 (92%), Positives = 14/14 (100%)
Query: 83 EEEEEEEEEEEKEE 96
EEEEEEEEEEE+EE
Sbjct: 406 EEEEEEEEEEEEEE 419
>gnl|CDD|153040 pfam12606, RELT, Tumour necrosis factor receptor superfamily
member 19. This family of proteins is found in
bacteria and eukaryotes. Proteins in this family are
typically between 49 and 288 amino acids in length.
There are two completely conserved residues (K and Y)
that may be functionally important. The members of
tumor necrosis factor receptor (TNFR) superfamily have
been designated as the "guardians of the immune system"
due to their roles in immune cell proliferation,
differentiation, activation, and death (apoptosis). The
messenger RNA of RELT is especially abundant in
hematologic tissues such as spleen, lymph node, and
peripheral blood leukocytes as well as in leukemias and
lymphomas. RELT is able to activate the NF-kappaB
pathway and selectively binds tumor necrosis factor
receptor-associated factor 1.
Length = 50
Score = 25.7 bits (57), Expect = 1.4
Identities = 10/26 (38%), Positives = 16/26 (61%), Gaps = 3/26 (11%)
Query: 77 KKNGYH---EEEEEEEEEEEKEEDKN 99
KK GY + E+E EE+++EE +
Sbjct: 25 KKKGYRCTVDPLEDEPEEKKEEEKEE 50
>gnl|CDD|234575 PRK00016, PRK00016, metal-binding heat shock protein; Provisional.
Length = 159
Score = 27.1 bits (61), Expect = 1.4
Identities = 10/20 (50%), Positives = 12/20 (60%), Gaps = 1/20 (5%)
Query: 79 NGY-HEEEEEEEEEEEKEED 97
GY H E+EE EE EE+
Sbjct: 125 LGYDHIEDEEAEEMFGLEEE 144
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 27.1 bits (60), Expect = 1.5
Identities = 14/50 (28%), Positives = 24/50 (48%), Gaps = 3/50 (6%)
Query: 50 DTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
D EE EK+L +E + ++ E++EE +EE EK++
Sbjct: 315 DIHEEIPKLDEKSL---SEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVK 361
>gnl|CDD|219822 pfam08401, DUF1738, Domain of unknown function (DUF1738). This
region is found in a number of bacterial hypothetical
proteins. Some members are annotated as being similar to
replication primases, and in fact this region is often
found together with the Toprim domain (pfam01751).
Length = 123
Score = 26.8 bits (60), Expect = 1.5
Identities = 9/26 (34%), Positives = 19/26 (73%), Gaps = 2/26 (7%)
Query: 73 VIFYKKNGYHEEEEEEEEEEEKEEDK 98
V+++K + +EEE+E+ EE + E++
Sbjct: 89 VVYWKP--FEKEEEDEDGEEGEPEER 112
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 27.3 bits (61), Expect = 1.6
Identities = 13/20 (65%), Positives = 13/20 (65%)
Query: 83 EEEEEEEEEEEKEEDKNLLK 102
EEEEEE EE E EE LL
Sbjct: 359 EEEEEEPEEPEPEEGPPLLT 378
Score = 26.2 bits (58), Expect = 3.5
Identities = 11/15 (73%), Positives = 15/15 (100%)
Query: 83 EEEEEEEEEEEKEED 97
EEE+E+EEEEE+EE+
Sbjct: 350 EEEQEDEEEEEEEEE 364
Score = 25.0 bits (55), Expect = 7.8
Identities = 11/16 (68%), Positives = 15/16 (93%)
Query: 83 EEEEEEEEEEEKEEDK 98
EE+E+EEEEEE+EE +
Sbjct: 351 EEQEDEEEEEEEEEPE 366
Score = 25.0 bits (55), Expect = 8.5
Identities = 11/23 (47%), Positives = 18/23 (78%)
Query: 77 KKNGYHEEEEEEEEEEEKEEDKN 99
+K+ E+E+EEEEEEE+E ++
Sbjct: 346 QKDEEEEQEDEEEEEEEEEPEEP 368
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 26.9 bits (60), Expect = 1.7
Identities = 11/15 (73%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
EEEE EEEEE+EE+
Sbjct: 311 EEEEVPEEEEEEEEE 325
Score = 26.1 bits (58), Expect = 3.7
Identities = 11/15 (73%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEE EEEE+EE+
Sbjct: 310 EEEEEVPEEEEEEEE 324
Score = 25.8 bits (57), Expect = 4.4
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 83 EEEEEEEEEEEKEEDKNLL 101
EE EEEEEEE+EE++
Sbjct: 313 EEVPEEEEEEEEEEERTFE 331
Score = 25.4 bits (56), Expect = 7.6
Identities = 11/14 (78%), Positives = 12/14 (85%)
Query: 83 EEEEEEEEEEEKEE 96
EEEEEE EEE+EE
Sbjct: 309 EEEEEEVPEEEEEE 322
Score = 25.0 bits (55), Expect = 9.6
Identities = 11/15 (73%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEEEEE +EE+
Sbjct: 306 EEEEEEEEEVPEEEE 320
>gnl|CDD|235359 PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha;
Provisional.
Length = 310
Score = 27.0 bits (61), Expect = 1.7
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 83 EEEEEEEEEEEKEEDKNLLK 102
EE EEE EEEE E D LL+
Sbjct: 230 EEAEEEVEEEEPEFDPILLR 249
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 26.9 bits (59), Expect = 2.0
Identities = 10/19 (52%), Positives = 12/19 (63%)
Query: 81 YHEEEEEEEEEEEKEEDKN 99
+ EE E EEE E ED+N
Sbjct: 378 RNPEESEHEEEVEDYEDEN 396
>gnl|CDD|165423 PHA03152, PHA03152, hypothetical protein; Provisional.
Length = 138
Score = 26.1 bits (57), Expect = 2.6
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 5 FVLGNAQVPSLYPI--VYCSDGFCELSGFARAQIMQKGCA 42
F L +PS+Y I +Y +D GF+ AQ+ Q+ A
Sbjct: 30 FALNETLIPSIYDINWIYITDPH-TCGGFSIAQVFQRRSA 68
>gnl|CDD|182533 PRK10542, PRK10542, glutathionine S-transferase; Provisional.
Length = 201
Score = 26.2 bits (58), Expect = 2.8
Identities = 10/21 (47%), Positives = 13/21 (61%)
Query: 46 LYGPDTSEEHKTQIEKALESK 66
L+ PDT EE+K + LE K
Sbjct: 112 LFRPDTPEEYKPTVRAQLEKK 132
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 26.4 bits (59), Expect = 2.8
Identities = 14/51 (27%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 49 PDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
P +E + + E S E K +K E++ E+EE EK K
Sbjct: 391 PLLAEGEEEEGENGNLSPAERKK----LRKKQRKAEKKAEKEEAEKAAAKK 437
>gnl|CDD|223396 COG0319, COG0319, Predicted metal-dependent hydrolase [General
function prediction only].
Length = 153
Score = 26.1 bits (58), Expect = 2.9
Identities = 14/19 (73%), Positives = 16/19 (84%), Gaps = 1/19 (5%)
Query: 80 GY-HEEEEEEEEEEEKEED 97
GY HEE+EEEEE EE EE+
Sbjct: 121 GYDHEEDEEEEEMEELEEE 139
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 26.3 bits (58), Expect = 2.9
Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 2/53 (3%)
Query: 46 LYGPDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
LY +EE T I++ E E V+ ++ EEEEEEEEEEE E +
Sbjct: 370 LYDKIATEEDATTIDELREFLKEKGHPVV--ERWAEEEEEEEEEEEEEAAEAE 420
>gnl|CDD|216228 pfam00989, PAS, PAS fold. The PAS fold corresponds to the
structural domain that has previously been defined as
PAS and PAC motifs. The PAS fold appears in archaea,
eubacteria and eukarya.
Length = 113
Score = 25.8 bits (57), Expect = 3.3
Identities = 13/53 (24%), Positives = 21/53 (39%), Gaps = 6/53 (11%)
Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKF--LYGPDTSEEHKTQIEKALESKTE 68
I+Y + EL G +R +++ K L D E + +AL E
Sbjct: 23 ILYVNAAAEELLGLSREEVIGK----SLLDLIPEDDDAEVAELLRQALLQGEE 71
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more
internally with limited reactivity in the C-terminal
domains, while P2 proteins seem to be more externally
located and are more likely to interact with other
cellular components. In lower eukaryotes, P1 and P2 are
further subdivided into P1A, P1B, P2A, and P2B, which
form P1A/P2B and P1B/P2A heterodimers. Some plant
species have a third P-protein, called P3, which is not
homologous to P1 and P2. In humans, P1 and P2 are
strongly autoimmunogenic. They play a significant role
in the etiology and pathogenesis of systemic lupus
erythema (SLE). In addition, the ribosome-inactivating
protein trichosanthin (TCS) interacts with human P0,
P1, and P2, with its primary binding site located in
the C-terminal region of P2. TCS inactivates the
ribosome by depurinating a specific adenine in the
sarcin-ricin loop of 28S rRNA.
Length = 103
Score = 25.7 bits (57), Expect = 3.4
Identities = 9/15 (60%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
+EE++EEEEEE ++D
Sbjct: 83 KEEKKEEEEEESDDD 97
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 26.0 bits (58), Expect = 3.6
Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 7/54 (12%)
Query: 53 EEHKTQIEKA----LESKTELKLEVIFYKKNG---YHEEEEEEEEEEEKEEDKN 99
+E+K LE+ L+LE F +K ++ E E+ EEKEE+ N
Sbjct: 827 KEYKKSKGYVLVIKLENGGGLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEELN 880
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 25.4 bits (56), Expect = 3.9
Identities = 10/15 (66%), Positives = 12/15 (80%)
Query: 83 EEEEEEEEEEEKEED 97
E EEEE+EEE +EE
Sbjct: 85 EAEEEEKEEEAEEES 99
>gnl|CDD|239210 cd02844, PAZ_CAF_like, PAZ domain, CAF_like subfamily. CAF (for
carpel factory) is a plant homolog of Dicer. CAF has
been implicated in flower morphogenesis and in early
Arabidopsis development and might function through
posttranscriptional regulation of specific mRNA
molecules. PAZ domains are named after the proteins
Piwi, Argonaut, and Zwille. PAZ is found in two families
of proteins that are essential components of
RNA-mediated gene-silencing pathways, including RNA
interference, the Piwi and Dicer families. PAZ functions
as a nucleic-acid binding domain, with a strong
preference for single-stranded nucleic acids (RNA or
DNA) or RNA duplexes with single-stranded 3' overhangs.
It has been suggested that the PAZ domain provides a
unique mode for the recognition of the two 3'-terminal
nucleotides in single-stranded nucleic acids and buries
the 3' OH group, and that it might recognize
characteristic 3' overhangs in siRNAs within RISC
(RNA-induced silencing) and other complexes.
Length = 135
Score = 25.8 bits (57), Expect = 3.9
Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 4/82 (4%)
Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEKALESKTELKLEVIFYK 77
Y G +L+ + + + +E K I ++ LK + IF
Sbjct: 43 RFYVISGILDLNANSS---FPGKEGLGYATYAEYFKE-KYGIVLNHPNQPLLKGKQIFNL 98
Query: 78 KNGYHEEEEEEEEEEEKEEDKN 99
N H EE+ E EEKE+D+
Sbjct: 99 HNLLHNRFEEKGESEEKEKDRY 120
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 25.4 bits (56), Expect = 4.0
Identities = 13/15 (86%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEEEEEEE EE+
Sbjct: 81 EEEEEEEEEEESEEE 95
Score = 25.4 bits (56), Expect = 4.3
Identities = 13/15 (86%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEEEEEEE+EE
Sbjct: 78 EEEEEEEEEEEEEES 92
Score = 24.6 bits (54), Expect = 6.8
Identities = 13/16 (81%), Positives = 16/16 (100%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE++
Sbjct: 76 EEEEEEEEEEEEEEEE 91
Score = 24.6 bits (54), Expect = 7.0
Identities = 13/16 (81%), Positives = 15/16 (93%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEEEE+EE+
Sbjct: 77 EEEEEEEEEEEEEEES 92
Score = 24.6 bits (54), Expect = 8.0
Identities = 11/16 (68%), Positives = 14/16 (87%)
Query: 83 EEEEEEEEEEEKEEDK 98
EEEEEEEEE+EE++
Sbjct: 74 AAEEEEEEEEEEEEEE 89
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 25.6 bits (57), Expect = 4.1
Identities = 16/57 (28%), Positives = 26/57 (45%), Gaps = 3/57 (5%)
Query: 44 KFLYGPDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKNL 100
K L E + +I+K E K EL+ V + E E+ EEEE++ ++
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAK---LEAIEKREEEERQIEEKR 167
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 25.8 bits (56), Expect = 4.5
Identities = 10/50 (20%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 52 SEEHKTQ-IEKALESKTELKLEV-IFYKKNGYHEEEEEEEEEEEKEEDKN 99
SE + T+ ++ + ++E++ G + EE++++ E+E++K+
Sbjct: 81 SEPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKS 130
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 26.0 bits (57), Expect = 4.5
Identities = 10/28 (35%), Positives = 17/28 (60%), Gaps = 4/28 (14%)
Query: 79 NGYHEEEEEEEEEEE----KEEDKNLLK 102
+ +E+E + E+E +EEDKN L+
Sbjct: 374 DEDDSDEDEVDYEKERKRRREEDKNFLR 401
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 25.6 bits (57), Expect = 4.9
Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 9/48 (18%)
Query: 53 EEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKNL 100
++ K +E AL K ELK + + E++ E+ + K E K
Sbjct: 52 KDLKESLEAALLDKKELKA---------WEKAEKKAEKAKAKAEKKKA 90
>gnl|CDD|182840 PRK10921, PRK10921, twin-arginine protein translocation system
subunit TatC; Provisional.
Length = 258
Score = 25.5 bits (56), Expect = 5.0
Identities = 8/24 (33%), Positives = 13/24 (54%)
Query: 75 FYKKNGYHEEEEEEEEEEEKEEDK 98
FY G EEE + E E ++ ++
Sbjct: 235 FYVGKGRRREEENDAEAESEKTEE 258
>gnl|CDD|119297 pfam10777, YlaC, sigma70 family sigma factor YlaC. Members of the
sigma70 family of sigma factors are components of the
RNA polymerase holoenzyme that direct bacterial or
plastid core RNA polymerase to specific promoter
elements. This domain is an inner membrane protein of
unknown function.
Length = 156
Score = 25.4 bits (56), Expect = 5.4
Identities = 8/32 (25%), Positives = 18/32 (56%)
Query: 44 KFLYGPDTSEEHKTQIEKALESKTELKLEVIF 75
+ L P+ +E K Q++K + +K E+ ++
Sbjct: 117 EILNSPNVPDEQKAQLKKIIATKGEIDFYDVY 148
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 25.4 bits (56), Expect = 5.7
Identities = 7/15 (46%), Positives = 8/15 (53%)
Query: 83 EEEEEEEEEEEKEED 97
EEEE+EED
Sbjct: 288 AAAAAPAEEEEEEED 302
Score = 25.4 bits (56), Expect = 6.9
Identities = 6/15 (40%), Positives = 9/15 (60%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEEE+++
Sbjct: 290 AAAPAEEEEEEEDDM 304
Score = 25.0 bits (55), Expect = 9.1
Identities = 7/15 (46%), Positives = 9/15 (60%)
Query: 83 EEEEEEEEEEEKEED 97
EEEEE+E+D
Sbjct: 289 AAAAPAEEEEEEEDD 303
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 25.5 bits (56), Expect = 5.8
Identities = 13/50 (26%), Positives = 24/50 (48%)
Query: 50 DTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
E + EK + + + + K++ EEE EEE+EE +E++
Sbjct: 59 TAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEK 108
>gnl|CDD|219844 pfam08447, PAS_3, PAS fold. The PAS fold corresponds to the
structural domain that has previously been defined as
PAS and PAC motifs. The PAS fold appears in archaea,
eubacteria and eukarya.
Length = 90
Score = 24.7 bits (54), Expect = 5.9
Identities = 13/64 (20%), Positives = 27/64 (42%), Gaps = 1/64 (1%)
Query: 18 IVYCSDGFCELSGFARAQIMQKGCACKFLYGPDTSEEHKTQIEK-ALESKTELKLEVIFY 76
I+Y S F E+ G+ ++ L P+ E + +++ +L+ E
Sbjct: 1 IIYWSPRFEEILGYTPEELKSSYEGWLDLVHPEDRERVRRALQEFSLKKGEPYSGEYRIR 60
Query: 77 KKNG 80
+K+G
Sbjct: 61 RKDG 64
>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
Length = 613
Score = 25.5 bits (57), Expect = 6.9
Identities = 12/20 (60%), Positives = 17/20 (85%)
Query: 83 EEEEEEEEEEEKEEDKNLLK 102
+EEE+EE+EE +EE K LL+
Sbjct: 480 DEEEKEEKEEAEEEFKPLLE 499
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 25.4 bits (56), Expect = 7.1
Identities = 11/47 (23%), Positives = 21/47 (44%)
Query: 52 SEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
S+ H+ +EK E LK ++ KK E + + + K + +
Sbjct: 272 SKTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFE 318
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This
is a family of uncharacterized proteins.
Length = 449
Score = 25.3 bits (55), Expect = 7.1
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 77 KKNGYHEEEEEEEEEEEKEEDKN 99
+K G EEE EEE++ EED +
Sbjct: 39 RKLGKEAEEEAMEEEDDDEEDDD 61
>gnl|CDD|221096 pfam11368, DUF3169, Protein of unknown function (DUF3169). Some
members in this family of proteins are annotated as
membrane proteins however this cannot be confirmed.
Currently there is no known function.
Length = 248
Score = 25.4 bits (56), Expect = 7.1
Identities = 8/26 (30%), Positives = 16/26 (61%), Gaps = 1/26 (3%)
Query: 78 KNGYHEEEEEEEEEE-EKEEDKNLLK 102
K Y +EE+E+ +E ++ ++NL
Sbjct: 76 KALYDKEEDEDAADELYRQANRNLEY 101
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 25.0 bits (55), Expect = 7.2
Identities = 7/17 (41%), Positives = 16/17 (94%)
Query: 83 EEEEEEEEEEEKEEDKN 99
E++EE+++E+E EE+++
Sbjct: 124 EDDEEDDDEDESEEEES 140
Score = 24.6 bits (54), Expect = 8.5
Identities = 9/15 (60%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
E+EEEE++EE+ +ED
Sbjct: 119 EDEEEEDDEEDDDED 133
Score = 24.6 bits (54), Expect = 9.6
Identities = 9/22 (40%), Positives = 15/22 (68%)
Query: 76 YKKNGYHEEEEEEEEEEEKEED 97
+ +++EE+EEEE+ EED
Sbjct: 108 ASEEDESDDDEEDEEEEDDEED 129
Score = 24.6 bits (54), Expect = 9.8
Identities = 8/15 (53%), Positives = 14/15 (93%)
Query: 83 EEEEEEEEEEEKEED 97
EE+EEEE++EE +++
Sbjct: 118 EEDEEEEDDEEDDDE 132
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 25.4 bits (55), Expect = 7.9
Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 8/58 (13%)
Query: 50 DTSEEHKTQIEKALESKTELKLEVIFYKKNGYH--------EEEEEEEEEEEKEEDKN 99
D ++ + E + ES E L+ IFY +G EE+E EEEKEE++N
Sbjct: 729 DAIDDEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEEEN 786
>gnl|CDD|220001 pfam08748, DUF1789, Domain of unknown function (DUF1789).
Putative uncharacterized domain found in phage-related
conserved hypothetical protein from Bordetella.
Length = 125
Score = 24.7 bits (54), Expect = 7.9
Identities = 8/55 (14%), Positives = 21/55 (38%)
Query: 44 KFLYGPDTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
KF+ G + + +I + +++ + E ++ E + K +K
Sbjct: 5 KFVLGQLPTFKLPVKIPRLGGEDAKVEFTFKHMDRKELAELYDKWGESDAKLGEK 59
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 25.1 bits (55), Expect = 8.4
Identities = 13/55 (23%), Positives = 23/55 (41%), Gaps = 1/55 (1%)
Query: 45 FLYGPDTSEEHKTQIEKALESKTELK-LEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
F + D S K E K + + K ++E+++EEE ++EE
Sbjct: 711 FFFSFDISGLMKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The
CAF-1 or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression
in addition to DNA-based checkpoints.
Length = 76
Score = 24.1 bits (53), Expect = 8.5
Identities = 7/15 (46%), Positives = 13/15 (86%)
Query: 83 EEEEEEEEEEEKEED 97
E+EE+EEE+++ + D
Sbjct: 60 EDEEDEEEDDDDDMD 74
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 25.1 bits (55), Expect = 8.6
Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 11/48 (22%)
Query: 51 TSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDK 98
+++E K + K +L+ I + E +E+E+EEEE+ D+
Sbjct: 348 STKESKMR------DKRRARLDPI-----DFEEVDEDEDEEEEQRSDE 384
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 25.0 bits (55), Expect = 8.7
Identities = 11/53 (20%), Positives = 26/53 (49%)
Query: 50 DTSEEHKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKNLLK 102
+ EE +IE E+K + + +++ EE+E+ ++ ++ DK +
Sbjct: 234 EREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEIL 286
>gnl|CDD|218778 pfam05852, DUF848, Gammaherpesvirus protein of unknown function
(DUF848). This family consists of several
uncharacterized proteins from the Gammaherpesvirinae.
Length = 146
Score = 24.6 bits (54), Expect = 8.8
Identities = 12/37 (32%), Positives = 20/37 (54%)
Query: 59 IEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKE 95
++ E K EL+ E+ + G +EE EEEE ++
Sbjct: 95 TDRVEELKEELEFELERLEAAGGEQEELSGEEEEPED 131
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 25.0 bits (54), Expect = 9.2
Identities = 13/17 (76%), Positives = 15/17 (88%)
Query: 83 EEEEEEEEEEEKEEDKN 99
EEEEEEEEEEE E ++N
Sbjct: 139 EEEEEEEEEEEAEVEEN 155
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 24.8 bits (54), Expect = 9.2
Identities = 12/17 (70%), Positives = 16/17 (94%)
Query: 83 EEEEEEEEEEEKEEDKN 99
+EEEEE EEEE+EED++
Sbjct: 268 DEEEEETEEEEEEEDED 284
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 24.9 bits (55), Expect = 9.3
Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 2/58 (3%)
Query: 44 KFLYGPDTSEE--HKTQIEKALESKTELKLEVIFYKKNGYHEEEEEEEEEEEKEEDKN 99
+FL G + + K + + + E K + EEEEEEE+E+++EE +
Sbjct: 400 EFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEE 457
>gnl|CDD|204616 pfam11240, DUF3042, Protein of unknown function (DUF3042). This
family of proteins with unknown function appears to be
restricted to Firmicutes.
Length = 54
Score = 23.5 bits (51), Expect = 9.8
Identities = 9/22 (40%), Positives = 11/22 (50%)
Query: 73 VIFYKKNGYHEEEEEEEEEEEK 94
V +KK EE+EE EE
Sbjct: 23 VFAFKKKVIEPIEEKEEFIEEN 44
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 25.0 bits (54), Expect = 9.9
Identities = 5/15 (33%), Positives = 15/15 (100%)
Query: 83 EEEEEEEEEEEKEED 97
++E+E+E+++++E+D
Sbjct: 156 DDEDEDEDDDDEEDD 170
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.131 0.371
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,066,640
Number of extensions: 425579
Number of successful extensions: 3250
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2542
Number of HSP's successfully gapped: 480
Length of query: 102
Length of database: 10,937,602
Length adjustment: 68
Effective length of query: 34
Effective length of database: 7,921,530
Effective search space: 269332020
Effective search space used: 269332020
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 53 (24.0 bits)