RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6875
(201 letters)
>gnl|CDD|215754 pfam00153, Mito_carr, Mitochondrial carrier protein.
Length = 96
Score = 75.4 bits (186), Expect = 4e-18
Identities = 23/67 (34%), Positives = 33/67 (49%), Gaps = 4/67 (5%)
Query: 136 WWRHLVAGGVAGAVSRTCTAPLDRLKVYLQ----VHGNKTTGIKKCLLHLLHEGGFLSLW 191
+ L+AGG+AGA++ T T PLD +K LQ K GI C + E G L+
Sbjct: 5 FLASLLAGGIAGAIAATVTYPLDVVKTRLQSSAAGGSRKYKGILDCFKKIYKEEGIRGLY 64
Query: 192 RGNGSKI 198
+G +
Sbjct: 65 KGLLPNL 71
Score = 67.7 bits (166), Expect = 4e-15
Identities = 18/59 (30%), Positives = 28/59 (47%)
Query: 41 GVHGNKTTGIKKCLLHLLHEGGFLSLWRGNGINVLKIAPESALKFMAYEQAKRLIRGNY 99
K GI C + E G L++G N+L++AP +A+ F YE K+L+
Sbjct: 38 AGGSRKYKGILDCFKKIYKEEGIRGLYKGLLPNLLRVAPAAAIYFGTYETLKKLLLKKL 96
>gnl|CDD|240302 PTZ00169, PTZ00169, ADP/ATP transporter on adenylate translocase;
Provisional.
Length = 300
Score = 54.0 bits (130), Expect = 6e-09
Identities = 42/151 (27%), Positives = 68/151 (45%), Gaps = 33/151 (21%)
Query: 48 TGIKKCLLHLLHEGGFLSLWRGNGINVLKIAPESALKFMAYEQAKRLIRGNYTRELSIYE 107
+GI C + E G LSLWRGN NV++ P A F A++ + + Y ++ ++
Sbjct: 53 SGIVNCFRRVSKEQGVLSLWRGNTANVIRYFPTQAFNF-AFKDYFKNMFPKYNQKTDFWK 111
Query: 108 RFVAGSLAGGVSLNVPDDFTAKEMLTGMWWRHLVAGGVAGAVSRTCTAPLD----RLKVY 163
F ++++GG+AGA S PLD RL
Sbjct: 112 FFGV---------------------------NILSGGLAGASSLLIVYPLDFARTRLASD 144
Query: 164 LQVHGNKT-TGIKKCLLHLLHEGGFLSLWRG 193
+ G++ TG+ CL+ + + GFLSL++G
Sbjct: 145 IGKGGDREFTGLFDCLMKISKQTGFLSLYQG 175
Score = 50.9 bits (122), Expect = 7e-08
Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 10/64 (15%)
Query: 141 VAGGVAGAVSRTCTAPLDRLKVYLQVH--------GN--KTTGIKKCLLHLLHEGGFLSL 190
+ GG++ A+S+T AP++R+K+ +Q G + +GI C + E G LSL
Sbjct: 12 LMGGISAAISKTAVAPIERVKMLIQTQDSIPEIKSGKVPRYSGIVNCFRRVSKEQGVLSL 71
Query: 191 WRGN 194
WRGN
Sbjct: 72 WRGN 75
Score = 37.4 bits (87), Expect = 0.002
Identities = 30/88 (34%), Positives = 46/88 (52%), Gaps = 5/88 (5%)
Query: 48 TGIKKCLLHLLHEGGFLSLWRGNGINVLKIAPESALKFMAYEQAKRLIRGNYTRELSIYE 107
TG+ CL+ + + GFLSL++G G++V I F Y+ AK L+ GN +Y+
Sbjct: 154 TGLFDCLMKISKQTGFLSLYQGFGVSVQGIIVYRGAYFGLYDSAKALLFGNDKNTNILYK 213
Query: 108 RFVAGS---LAGGVSLNVPDDFTAKEML 132
VA + LAG +S P D + M+
Sbjct: 214 WAVAQTVTILAGLISY--PFDTVRRRMM 239
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 35.4 bits (81), Expect = 0.016
Identities = 16/28 (57%), Positives = 24/28 (85%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
EEEEE++EEE+E+EEE++++E E EE
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 35.4 bits (81), Expect = 0.017
Identities = 16/29 (55%), Positives = 24/29 (82%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
SEEEEE++EEE+E+EEE++++E E E
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 34.2 bits (78), Expect = 0.038
Identities = 19/47 (40%), Positives = 30/47 (63%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKCLLHLL 58
EEEEE++EEE+E+EEE++++E E L + +T + L LL
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQAIYLFLL 912
Score = 33.0 bits (75), Expect = 0.10
Identities = 14/26 (53%), Positives = 23/26 (88%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
E + EEEEE++EEE+E+EEE++++E
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 32.7 bits (74), Expect = 0.12
Identities = 14/26 (53%), Positives = 22/26 (84%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
E + EEEEE++EEE+E+EEE++ +E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 32.3 bits (73), Expect = 0.15
Identities = 14/26 (53%), Positives = 22/26 (84%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
E + EEEEE++EEE+E+EEE+++ E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 31.9 bits (72), Expect = 0.21
Identities = 14/28 (50%), Positives = 23/28 (82%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+ EEE++EEE+E+EEE++++E E EE
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 35.1 bits (81), Expect = 0.018
Identities = 14/34 (41%), Positives = 25/34 (73%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+E+++ +EEEE+++EEK++EEEK + EE
Sbjct: 31 VEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEE 64
Score = 32.0 bits (73), Expect = 0.16
Identities = 17/39 (43%), Positives = 25/39 (64%), Gaps = 4/39 (10%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGI 50
EE +E++E+E++K++ KK KE E E L NKT I
Sbjct: 59 EEVDEEEEKEEKKKKTKKVKETTTEWELL----NKTKPI 93
Score = 31.3 bits (71), Expect = 0.30
Identities = 18/46 (39%), Positives = 27/46 (58%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
E EEEEEK+E+++E+E+ K+E E EE KT +K+
Sbjct: 34 EVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKE 79
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 32.6 bits (75), Expect = 0.031
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREG 37
EEKKEEE+E+EE+++ +E G
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAAG 100
Score = 30.7 bits (70), Expect = 0.15
Identities = 10/21 (47%), Positives = 16/21 (76%)
Query: 12 EEEEEKKEEEKEKEEEKKKKE 32
E+K+EE+E+EEEK++ E
Sbjct: 74 AAAAEEKKEEEEEEEEKEESE 94
Score = 29.9 bits (68), Expect = 0.34
Identities = 8/30 (26%), Positives = 17/30 (56%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
++++E+E+EEE+K++ LG
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEEAAAGLG 102
Score = 28.0 bits (63), Expect = 1.6
Identities = 8/28 (28%), Positives = 16/28 (57%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+E+++E+EEE++K+E E
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 27.2 bits (61), Expect = 2.8
Identities = 8/29 (27%), Positives = 15/29 (51%)
Query: 13 EEEEKKEEEKEKEEEKKKKEGGREGEELG 41
EEK++EEE+++++ E E
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 26.8 bits (60), Expect = 3.2
Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 3/33 (9%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHG 44
EE+KE+EEE+++KE E EE G
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKE---ESEEEAAAG 100
Score = 25.7 bits (57), Expect = 8.7
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 5 AIERKISEEEEEKKEEEKEKE 25
A E K EEEEE+++EE E+E
Sbjct: 76 AAEEKKEEEEEEEEKEESEEE 96
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 32.3 bits (74), Expect = 0.041
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
E++EEE+E+EEE++++ LG
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEEAMAGLG 101
Score = 29.2 bits (66), Expect = 0.51
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREG 37
+ ++EEE+E+EEE++++E E
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEESEEEA 96
Score = 26.9 bits (60), Expect = 3.2
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 18 KEEEKEKEEEKKKKEGGREGEELGVHG 44
E+E+EEE++++E E EE + G
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMAG 99
Score = 26.5 bits (59), Expect = 5.0
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 15 EEKKEEEKEKEEEKKKKEGGREGEE 39
EE+E+EEE++++E EE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is
functionally equivalent to L7/L12 in bacteria and the
P1 and P2 proteins in eukaryotes. L12p is homologous to
P1 and P2 but is not homologous to bacterial L7/L12. It
is located in the L12 stalk, with proteins L10, L11,
and 23S rRNA. L12p is the only protein in the ribosome
to occur as multimers, always appearing as sets of
dimers. Recent data indicate that most archaeal species
contain six copies of L12p (three homodimers), while
eukaryotes have four copies (two heterodimers), and
bacteria may have four or six copies (two or three
homodimers), depending on the species. The organization
of proteins within the stalk has been characterized
primarily in bacteria, where L7/L12 forms either two or
three homodimers and each homodimer binds to the
extended C-terminal helix of L10. L7/L12 is attached to
the ribosome through L10 and is the only ribosomal
protein that does not directly interact with rRNA.
Archaeal L12p is believed to function in a similar
fashion. However, hybrid ribosomes containing the large
subunit from E. coli with an archaeal stalk are able to
bind archaeal and eukaryotic elongation factors but not
bacterial elongation factors. In several mesophilic and
thermophilic archaeal species, the binding of 23S rRNA
to protein L11 and to the L10/L12p pentameric complex
was found to be temperature-dependent and cooperative.
Length = 106
Score = 31.7 bits (72), Expect = 0.071
Identities = 15/23 (65%), Positives = 19/23 (82%)
Query: 5 AIERKISEEEEEKKEEEKEKEEE 27
A E+ +EEE+KKEEEKE+EEE
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEE 96
Score = 31.7 bits (72), Expect = 0.077
Identities = 15/31 (48%), Positives = 22/31 (70%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKEG 33
A E K E+EEEKK+EE+++EEE++ G
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEEALAG 101
Score = 31.7 bits (72), Expect = 0.080
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
P A +EE+ E+KEEEK+KEEEK+++E
Sbjct: 66 PAAAAAAAAEEKAEEKEEEKKKEEEKEEEE 95
Score = 29.4 bits (66), Expect = 0.44
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREG 37
E+ EEK+EE+K++EE+++++E G
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEEALAG 101
Score = 28.6 bits (64), Expect = 0.79
Identities = 11/31 (35%), Positives = 22/31 (70%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
+ EE+ +++EE++K+EE+K++E LG
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEALAGLG 103
Score = 28.6 bits (64), Expect = 0.89
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 14 EEEKKEEEKEKEEEKKKKEGGREGEELGVHG 44
+E+ +EKEEEKKK+E E EE + G
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEEALAG 101
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 33.2 bits (76), Expect = 0.082
Identities = 22/79 (27%), Positives = 34/79 (43%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKCLLHLLHEGGFLSL 66
EE+EE++EEE E E K+ E EE V L + ++ +
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSSLAVGYKNDRSYV 447
Query: 67 WRGNGINVLKIAPESALKF 85
RG+ I V K + +L+F
Sbjct: 448 VRGDKIGVFKHTDDDSLEF 466
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 33.1 bits (75), Expect = 0.084
Identities = 12/25 (48%), Positives = 20/25 (80%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKK 31
E+K EE E +KE+EKE+E E++++
Sbjct: 593 EQKAREEREREKEKEKERERERERE 617
Score = 31.6 bits (71), Expect = 0.26
Identities = 17/35 (48%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEK-KKKEGGREGE 38
A+E K E E+K EE+E+E+EK K++E RE E
Sbjct: 584 AVE-KAKREAEQKAREEREREKEKEKERERERERE 617
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 33.2 bits (75), Expect = 0.091
Identities = 13/36 (36%), Positives = 20/36 (55%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
A E K +E +K E K+K +E KK E ++ +E
Sbjct: 1492 AEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEA 1527
Score = 32.8 bits (74), Expect = 0.12
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+K +EE ++K +E K+ E KKK + ++ EE
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKKKADEAKKAEE 1520
Score = 32.0 bits (72), Expect = 0.24
Identities = 13/36 (36%), Positives = 24/36 (66%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
A +K ++E ++K EE+K+ +E KKK E ++ +E
Sbjct: 1414 AAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEA 1449
Score = 32.0 bits (72), Expect = 0.25
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
A E K +EE ++K EE K+ +E KKK E ++ +E
Sbjct: 1453 AEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEA 1488
Score = 31.6 bits (71), Expect = 0.25
Identities = 15/36 (41%), Positives = 24/36 (66%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
A E+K ++E ++K EE K+ +E KKK E ++ EE
Sbjct: 1427 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEA 1462
Score = 31.6 bits (71), Expect = 0.32
Identities = 15/35 (42%), Positives = 23/35 (65%), Gaps = 2/35 (5%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE--GGREGEE 39
E K +EE+++K EE K+ EE++KK +E EE
Sbjct: 1666 EAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEE 1700
Score = 31.3 bits (70), Expect = 0.36
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
A E K ++E ++K EE K+ EE KKK E ++ +E
Sbjct: 1440 AEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEA 1475
Score = 31.3 bits (70), Expect = 0.44
Identities = 15/36 (41%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 5 AIERKISEEEEEKKEEE-KEKEEEKKKKEGGREGEE 39
A E K +EE+E+K E K++ EE KK E ++ E
Sbjct: 1677 AEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEA 1712
Score = 30.9 bits (69), Expect = 0.59
Identities = 15/33 (45%), Positives = 20/33 (60%), Gaps = 1/33 (3%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKK-EGGREGEE 39
+K EE + K EE +K EE KKK E ++ EE
Sbjct: 1653 KKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEE 1685
Score = 30.5 bits (68), Expect = 0.66
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 13 EEEEKKEEEKEKEEEKKKKEGGREGEE 39
E +KK +E +K EE KK + ++ EE
Sbjct: 1506 AEAKKKADEAKKAEEAKKADEAKKAEE 1532
Score = 30.1 bits (67), Expect = 0.85
Identities = 18/40 (45%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 5 AIERKISEE----EEEKKEEEKEKEEEKKKKEGGREGEEL 40
A E K ++E EE KK +E +K EEKKK + ++ EEL
Sbjct: 1518 AEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEEL 1557
Score = 30.1 bits (67), Expect = 0.95
Identities = 13/37 (35%), Positives = 20/37 (54%), Gaps = 1/37 (2%)
Query: 5 AIERKISEEEEEKKEEEKEKEEE-KKKKEGGREGEEL 40
A E K +E +K K+K +E KKK E ++ +E
Sbjct: 1400 AEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEA 1436
Score = 29.7 bits (66), Expect = 1.1
Identities = 16/33 (48%), Positives = 22/33 (66%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E K EE++KK EE +K+EE+KKK + EE
Sbjct: 1734 EAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEE 1766
Score = 29.7 bits (66), Expect = 1.3
Identities = 16/42 (38%), Positives = 20/42 (47%), Gaps = 9/42 (21%)
Query: 7 ERKISEEEEEKKEEE---------KEKEEEKKKKEGGREGEE 39
E KI EE KK EE K +E+EKK E ++ E
Sbjct: 1658 ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAE 1699
Score = 29.7 bits (66), Expect = 1.4
Identities = 18/37 (48%), Positives = 22/37 (59%), Gaps = 2/37 (5%)
Query: 5 AIERKISEEEEEKKEEE--KEKEEEKKKKEGGREGEE 39
A E E EE KK EE K++ EEKKK E ++ EE
Sbjct: 1690 AAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEE 1726
Score = 29.3 bits (65), Expect = 1.4
Identities = 13/37 (35%), Positives = 21/37 (56%), Gaps = 4/37 (10%)
Query: 8 RKISEE----EEEKKEEEKEKEEEKKKKEGGREGEEL 40
+K +E +E KK EE +K +E KK E ++ +E
Sbjct: 1503 KKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539
Score = 29.3 bits (65), Expect = 1.5
Identities = 13/34 (38%), Positives = 21/34 (61%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
E K +E +K EE+K+ +E KKK E ++ +E
Sbjct: 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEA 1320
Score = 29.3 bits (65), Expect = 1.8
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
A E K +EE + K EE K+ EEEKKK E ++ E
Sbjct: 1609 AEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEA 1643
Score = 29.0 bits (64), Expect = 2.1
Identities = 14/39 (35%), Positives = 22/39 (56%)
Query: 1 MTPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
M A + + EE +K +E +K EEKKK + ++ EE
Sbjct: 1262 MAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEE 1300
Score = 29.0 bits (64), Expect = 2.1
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+K +EE ++K + K+K EE KK + E
Sbjct: 1321 KKKAEEAKKKADAAKKKAEEAKKAAEAAKAEA 1352
Score = 29.0 bits (64), Expect = 2.2
Identities = 12/35 (34%), Positives = 23/35 (65%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
A E+K ++E ++K EE+K+K +E KK ++ +
Sbjct: 1387 AEEKKKADEAKKKAEEDKKKADELKKAAAAKKKAD 1421
Score = 29.0 bits (64), Expect = 2.3
Identities = 15/37 (40%), Positives = 20/37 (54%), Gaps = 4/37 (10%)
Query: 7 ERKISE----EEEEKKEEEKEKEEEKKKKEGGREGEE 39
ERK E E+ +K E K+ EE KK E ++ EE
Sbjct: 1211 ERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEE 1247
Score = 29.0 bits (64), Expect = 2.4
Identities = 10/32 (31%), Positives = 20/32 (62%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+K +EE+++K +E K+ KKK + ++ E
Sbjct: 1397 KKKAEEDKKKADELKKAAAAKKKADEAKKKAE 1428
Score = 28.6 bits (63), Expect = 2.6
Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 5 AIERKISEEEEEKKEEEKEKEEE-KKKKEGGREGEEL 40
A E K ++E ++K EE K+K +E KK E ++ +E
Sbjct: 1479 AEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEA 1515
Score = 28.2 bits (62), Expect = 3.4
Identities = 12/36 (33%), Positives = 21/36 (58%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
A E K E++K+E +K+ + KKK E ++ +E
Sbjct: 1361 AAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEA 1396
Score = 28.2 bits (62), Expect = 3.5
Identities = 13/48 (27%), Positives = 16/48 (33%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
A K EE +K E + E E E E+ K KK
Sbjct: 1331 ADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKK 1378
Score = 28.2 bits (62), Expect = 3.5
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 13 EEEEKKEEEKEKEEEKKKKEGGREGEE 39
EE K EE K+K E+ +K E R+ E+
Sbjct: 1112 EEARKAEEAKKKAEDARKAEEARKAED 1138
Score = 28.2 bits (62), Expect = 3.7
Identities = 19/39 (48%), Positives = 24/39 (61%), Gaps = 4/39 (10%)
Query: 5 AIERKISEE----EEEKKEEEKEKEEEKKKKEGGREGEE 39
A E K +EE +E KK EE +K EEKKK E ++ EE
Sbjct: 1536 ADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574
Score = 28.2 bits (62), Expect = 4.0
Identities = 15/35 (42%), Positives = 20/35 (57%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
A E K EE +K +E K+K EE KK E ++ E
Sbjct: 1433 ADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAE 1467
Score = 28.2 bits (62), Expect = 4.0
Identities = 14/48 (29%), Positives = 19/48 (39%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
A E K + +KK EE +K E K E +E K +K
Sbjct: 1324 AEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371
Score = 27.8 bits (61), Expect = 4.4
Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 2/34 (5%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKK--EGGREGEE 39
+K EE++K EE K+ EEE K K E +E EE
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEE 1741
Score = 27.8 bits (61), Expect = 4.6
Identities = 16/39 (41%), Positives = 21/39 (53%)
Query: 1 MTPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
M E+K+ EE +K EE K K EE KK E ++ E
Sbjct: 1598 MKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVE 1636
Score = 27.8 bits (61), Expect = 4.9
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 14 EEEKKEEEKEKEEEKKKKEGGREGEEL 40
+E KK EEK+K +E KK E ++ +E
Sbjct: 1281 DELKKAEEKKKADEAKKAEEKKKADEA 1307
Score = 27.4 bits (60), Expect = 6.0
Identities = 17/46 (36%), Positives = 25/46 (54%), Gaps = 3/46 (6%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
E KI EE +K+ EE +K+ E+ KK+ E E+ + K KK
Sbjct: 1727 ENKIKAEEAKKEAEEDKKKAEEAKKD---EEEKKKIAHLKKEEEKK 1769
Score = 27.4 bits (60), Expect = 6.1
Identities = 19/41 (46%), Positives = 23/41 (56%), Gaps = 8/41 (19%)
Query: 7 ERKISEEEEEKKEEEKEK--------EEEKKKKEGGREGEE 39
E KI EE +K EEEK+K EEKKK E ++ EE
Sbjct: 1617 EAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEE 1657
Score = 27.4 bits (60), Expect = 6.5
Identities = 14/37 (37%), Positives = 24/37 (64%), Gaps = 1/37 (2%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKK-KEGGREGEEL 40
A E K ++E ++K EE K+ +E KKK +E ++ +E
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEA 1502
Score = 27.4 bits (60), Expect = 6.7
Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKK--EGGREGEEL 40
E K +E +KK EE +K +E KKK E ++ +
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAA 1334
Score = 27.4 bits (60), Expect = 6.9
Identities = 14/35 (40%), Positives = 21/35 (60%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
A E K EE++K +E K+K EE KK + ++ E
Sbjct: 1420 ADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAE 1454
Score = 27.4 bits (60), Expect = 7.1
Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 1/36 (2%)
Query: 5 AIERKISEEEEEKKEEEKEK-EEEKKKKEGGREGEE 39
A E K EE +K +E K+K EE KKK + ++ E
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE 1507
Score = 27.0 bits (59), Expect = 8.1
Identities = 12/24 (50%), Positives = 16/24 (66%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKK 31
+K EE + K EE K++ EE KKK
Sbjct: 1722 KKAEEENKIKAEEAKKEAEEDKKK 1745
Score = 27.0 bits (59), Expect = 8.6
Identities = 15/37 (40%), Positives = 19/37 (51%), Gaps = 4/37 (10%)
Query: 7 ERKISEE----EEEKKEEEKEKEEEKKKKEGGREGEE 39
+EE E K EE ++ EE KKK E R+ EE
Sbjct: 1096 AFGKAEEAKKTETGKAEEARKAEEAKKKAEDARKAEE 1132
Score = 27.0 bits (59), Expect = 8.7
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKK 31
A E K ++E ++K EE K+K + KKK
Sbjct: 1311 AEEAKKADEAKKKAEEAKKKADAAKKK 1337
Score = 27.0 bits (59), Expect = 8.7
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
A + EE+ + EK+KEE KKK + ++ E
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAE 1388
Score = 27.0 bits (59), Expect = 9.3
Identities = 12/33 (36%), Positives = 19/33 (57%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
K EE ++K + K+K EEKKK + ++ E
Sbjct: 1369 AEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE 1401
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 32.9 bits (76), Expect = 0.10
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 1 MTPIAIERKISEEEEEKKEEEKEKEEEKKK 30
++ ++ E E EEE+EK+EEKK
Sbjct: 61 LSLEEQRERLEELAPELLEEEEEKKEEKKG 90
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 32.6 bits (75), Expect = 0.11
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKE 32
E+ + EEE++EE +EK+EEKKK+E
Sbjct: 268 EEEKILKAAEEERQEEAQEKKEEKKKEE 295
Score = 28.0 bits (63), Expect = 3.0
Identities = 11/26 (42%), Positives = 18/26 (69%), Gaps = 1/26 (3%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKK 30
A K+S EE+ K EEKE++++ +K
Sbjct: 298 AKLAKLSPEEQRKL-EEKERKKQARK 322
Score = 28.0 bits (63), Expect = 3.7
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 12 EEEEEKKEEEKEKEEEKKKKE 32
EE +EKKEE+K++E E K +
Sbjct: 282 EEAQEKKEEKKKEEREAKLAK 302
Score = 27.2 bits (61), Expect = 6.1
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
R+ EE+ K EE+ +EE ++KKE
Sbjct: 264 TREEEEEKILKAAEEERQEEAQEKKE 289
Score = 26.8 bits (60), Expect = 8.7
Identities = 10/30 (33%), Positives = 18/30 (60%), Gaps = 4/30 (13%)
Query: 7 ERKISEEEEEKKEEEKEKE----EEKKKKE 32
E + +EE++K+E E + EE++K E
Sbjct: 283 EAQEKKEEKKKEEREAKLAKLSPEEQRKLE 312
Score = 26.8 bits (60), Expect = 9.3
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKK 31
E + E +E+K+E++KE+ E K K
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAK 302
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 32.5 bits (75), Expect = 0.11
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 13 EEEEKKEEEKEKEEEKKKKEGGREG 37
EE++EEE+E+EEE+ +E G
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEEAAAG 324
Score = 30.6 bits (70), Expect = 0.45
Identities = 9/31 (29%), Positives = 18/31 (58%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
++ ++EEE+E+EEE+++ LG
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSEEEAAAGLG 326
Score = 30.2 bits (69), Expect = 0.56
Identities = 10/40 (25%), Positives = 21/40 (52%)
Query: 2 TPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
++ +S + + EE+E+EEE++++E E E
Sbjct: 284 LDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAA 323
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 32.0 bits (73), Expect = 0.13
Identities = 19/48 (39%), Positives = 32/48 (66%), Gaps = 1/48 (2%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEG-GREGEELGVHGNKTTGIKK 52
+E++ EEEEKKE++K+KE +K+KKE ++ + + G+K KK
Sbjct: 143 VEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKK 190
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 32.6 bits (75), Expect = 0.14
Identities = 14/34 (41%), Positives = 21/34 (61%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
I K ++ EE+K+E+K+K KKKE E E+
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448
Score = 31.8 bits (73), Expect = 0.23
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 9 KISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
K E+ EKK EE++KE++KK G ++ EE
Sbjct: 413 KKIVEKAEKKREEEKKEKKKKAFAGKKKEEE 443
Score = 31.4 bits (72), Expect = 0.30
Identities = 13/34 (38%), Positives = 23/34 (67%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+KI E+ E+K+EEEK+++++K +E EE
Sbjct: 411 KIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444
Score = 30.7 bits (70), Expect = 0.51
Identities = 14/33 (42%), Positives = 26/33 (78%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+++ EEEE++K+EE+++EEE++ +E E EE
Sbjct: 439 KKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 30.7 bits (70), Expect = 0.52
Identities = 11/36 (30%), Positives = 22/36 (61%)
Query: 4 IAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
I + +E++ E++++EK+K+ KK+ E EE
Sbjct: 412 IKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEE 447
Score = 30.3 bits (69), Expect = 0.73
Identities = 19/43 (44%), Positives = 29/43 (67%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKT 47
A ++K EEEEEK+++E+EKEEE+++ E +E EE T
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478
Score = 29.9 bits (68), Expect = 0.84
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+E+ + EEEKKE++K+ KKK+E E +E
Sbjct: 416 VEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE 449
Score = 29.9 bits (68), Expect = 0.89
Identities = 16/33 (48%), Positives = 23/33 (69%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E+K +KKEEE+E+E+EKK++E E EE
Sbjct: 429 EKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Score = 29.9 bits (68), Expect = 0.99
Identities = 13/33 (39%), Positives = 24/33 (72%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+++ ++ K++E+E+EEEK+KKE +E EE
Sbjct: 427 KKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Score = 29.9 bits (68), Expect = 1.0
Identities = 9/46 (19%), Positives = 24/46 (52%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
+ +E+++K K+KEEE+++++ +E E+ ++
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 29.5 bits (67), Expect = 1.3
Identities = 7/33 (21%), Positives = 21/33 (63%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
K EEE+++K+++ ++K+++E + ++
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKK 451
Score = 28.7 bits (65), Expect = 2.1
Identities = 16/35 (45%), Positives = 26/35 (74%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
A K EEEEE+++E+KE+E+E++++E E EE
Sbjct: 434 AFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 28.7 bits (65), Expect = 2.1
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
A ++ E+ +K+ E+EK+E+KKK G++ EE
Sbjct: 406 KKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEE 442
Score = 28.3 bits (64), Expect = 3.0
Identities = 14/35 (40%), Positives = 24/35 (68%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+ +++EE++EEEKEK+EE+K++E EE
Sbjct: 431 KKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 27.6 bits (62), Expect = 5.4
Identities = 10/33 (30%), Positives = 24/33 (72%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+R+ ++E++KK +K+EE++++E ++ EE
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 27.2 bits (61), Expect = 6.7
Identities = 9/32 (28%), Positives = 19/32 (59%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
K + ++ +K E+ EK+ E++KKE ++
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFA 436
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 31.1 bits (71), Expect = 0.16
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 4 IAIERKISEEEEEKKEEEKEKEEEKKKKEG 33
I IE K E+E EK+E+E+++E+EK+ +G
Sbjct: 12 IDIEEKREEKEREKEEKERKEEKEKEWGKG 41
Score = 30.7 bits (70), Expect = 0.25
Identities = 8/25 (32%), Positives = 20/25 (80%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKE 32
R I EE+ +++E +++E+E+K+++
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEK 34
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 31.6 bits (72), Expect = 0.18
Identities = 12/21 (57%), Positives = 17/21 (80%)
Query: 12 EEEEEKKEEEKEKEEEKKKKE 32
EE+E++EEEK EEEK ++E
Sbjct: 159 AEEKEREEEEKAAEEEKAREE 179
Score = 27.0 bits (60), Expect = 6.7
Identities = 13/21 (61%), Positives = 15/21 (71%)
Query: 7 ERKISEEEEEKKEEEKEKEEE 27
E K EEEE+ EEEK +EEE
Sbjct: 160 EEKEREEEEKAAEEEKAREEE 180
>gnl|CDD|227084 COG4741, COG4741, Predicted secreted endonuclease distantly
related to archaeal Holliday junction resolvase
[Nucleotide transport and metabolism].
Length = 175
Score = 31.3 bits (71), Expect = 0.19
Identities = 14/32 (43%), Positives = 19/32 (59%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGRE 36
ER ++E + K+EE K KE +KK E RE
Sbjct: 46 ERERLVNEAQARKEEEWKLKEWIEKKIEEARE 77
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 31.9 bits (72), Expect = 0.21
Identities = 14/46 (30%), Positives = 23/46 (50%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKK 52
E K+S E K E E++++ E+ ++E E L G K +K
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKG 346
Score = 28.4 bits (63), Expect = 3.2
Identities = 10/30 (33%), Positives = 16/30 (53%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
IE+ EE E+++ E+E KK K+
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 31.5 bits (72), Expect = 0.25
Identities = 16/38 (42%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 4 IAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
++ +I E E+E+ EEE E+ E+ K EG EG E G
Sbjct: 95 EKLQMQI-EREQEEWEEELERLIEEAKAEGYEEGYEKG 131
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 31.7 bits (72), Expect = 0.26
Identities = 15/36 (41%), Positives = 21/36 (58%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
+ER EEEEE++EEE+E E + E G E+
Sbjct: 398 VERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVP 433
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 31.2 bits (71), Expect = 0.27
Identities = 10/28 (35%), Positives = 15/28 (53%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGR 35
KI +EEE KK+ + E+ K G +
Sbjct: 268 GKIDKEEEAKKQAKAERRAANAPKPGAK 295
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized.
This family of proteins is found in bacteria. Proteins
in this family are approximately 90 amino acids in
length. There are two completely conserved L residues
that may be functionally important.
Length = 70
Score = 28.8 bits (65), Expect = 0.36
Identities = 14/29 (48%), Positives = 21/29 (72%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
EEE+ ++EEE K EE+K++E + EEL
Sbjct: 32 EEEKREEEEEARKREERKEREKNKSFEEL 60
>gnl|CDD|206063 pfam13892, DBINO, DNA-binding domain. DBINO is a DNA-binding
domain found on global transcription activator SNF2L1
proteins and chromatin re-modelling proteins.
Length = 140
Score = 30.3 bits (69), Expect = 0.36
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGRE 36
EE E +K EKE E+ KK+E RE
Sbjct: 84 EERELRKRAEKEALEQAKKEEELRE 108
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 31.1 bits (70), Expect = 0.36
Identities = 12/26 (46%), Positives = 21/26 (80%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEK 28
P AI+ ++ EE +E++EEE+ +EEE+
Sbjct: 324 PFAIDEEMLEETQEQQEEEENEEEEE 349
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 31.0 bits (70), Expect = 0.37
Identities = 16/34 (47%), Positives = 23/34 (67%), Gaps = 1/34 (2%)
Query: 7 ERKISEE-EEEKKEEEKEKEEEKKKKEGGREGEE 39
+RK EE +E++ +EKEKE+EKK +E EE
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEE 163
Score = 30.6 bits (69), Expect = 0.57
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+ E KEEEKEKE+ K++K+ +E +
Sbjct: 94 PAKEPKNESGKEEEKEKEQVKEEKKKKKEKPK 125
Score = 29.1 bits (65), Expect = 1.5
Identities = 18/39 (46%), Positives = 25/39 (64%)
Query: 2 TPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
T A E K +EE+KE+E+ KEE+KKKKE +E +
Sbjct: 92 TKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKD 130
Score = 27.5 bits (61), Expect = 5.0
Identities = 11/31 (35%), Positives = 23/31 (74%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREG 37
+ ++ EE+++KKE+ KE+ +++K KE +E
Sbjct: 110 KEQVKEEKKKKKEKPKEEPKDRKPKEEAKEK 140
>gnl|CDD|239572 cd03490, Topoisomer_IB_N_1, Topoisomer_IB_N_1: A subgroup of the
N-terminal DNA binding fragment found in eukaryotic DNA
topoisomerase (topo) IB. Topo IB proteins include the
monomeric yeast and human topo I and heterodimeric topo
I from Leishmania donvanni. Topo I enzymes are divided
into: topo type IA (bacterial) and type IB
(eukaryotic). Topo I relaxes superhelical tension in
duplex DNA by creating a single-strand nick, the broken
strand can then rotate around the unbroken strand to
remove DNA supercoils and, the nick is religated,
liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit religation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topos I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I have putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 217
Score = 30.6 bits (69), Expect = 0.38
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGRE 36
I+ + EE+E+KK KE++E KKK+ RE
Sbjct: 89 IKNHLEEEKEKKKNLNKEEKEAKKKERAKRE 119
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 30.9 bits (70), Expect = 0.38
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+ +EEEE++ +E E+EE + +E G + E
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSRE 402
Score = 28.5 bits (64), Expect = 2.5
Identities = 8/35 (22%), Positives = 18/35 (51%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNK 46
+E+E +EEE+ +E ++++ E E +
Sbjct: 370 VDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDG 404
Score = 27.8 bits (62), Expect = 4.8
Identities = 10/29 (34%), Positives = 20/29 (68%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E +E++ EEE+++ +E +++EG EE
Sbjct: 368 EEVDEDEDEEEEQRSDEHEEEEGEDSEEE 396
Score = 26.6 bits (59), Expect = 8.9
Identities = 7/29 (24%), Positives = 13/29 (44%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
EE E+ +EE + E+ + G +
Sbjct: 387 EEEGEDSEEEGSQSREDGSSESSSDVGSD 415
Score = 26.6 bits (59), Expect = 9.2
Identities = 7/34 (20%), Positives = 18/34 (52%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E + E+ ++ EEE+ ++ E++ + +G
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSS 406
Score = 26.6 bits (59), Expect = 9.2
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 10 ISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
I EE ++ E+E+E++ + +E E E
Sbjct: 365 IDFEEVDEDEDEEEEQRSDEHEEEEGEDSE 394
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 29.1 bits (66), Expect = 0.47
Identities = 11/16 (68%), Positives = 14/16 (87%)
Query: 12 EEEEEKKEEEKEKEEE 27
EEEKKEEE+E+EE+
Sbjct: 65 AAEEEKKEEEEEEEED 80
Score = 29.1 bits (66), Expect = 0.49
Identities = 9/16 (56%), Positives = 13/16 (81%)
Query: 12 EEEEEKKEEEKEKEEE 27
EE+K+EE+E+EEE
Sbjct: 64 AAAEEEKKEEEEEEEE 79
Score = 27.6 bits (62), Expect = 1.6
Identities = 9/16 (56%), Positives = 15/16 (93%)
Query: 12 EEEEEKKEEEKEKEEE 27
EEE+K+EEE+E+E++
Sbjct: 66 AEEEKKEEEEEEEEDD 81
Score = 26.4 bits (59), Expect = 3.8
Identities = 6/17 (35%), Positives = 13/17 (76%)
Query: 11 SEEEEEKKEEEKEKEEE 27
+ E++++E+E+EEE
Sbjct: 62 AAAAAEEEKKEEEEEEE 78
Score = 26.0 bits (58), Expect = 5.8
Identities = 7/21 (33%), Positives = 14/21 (66%)
Query: 13 EEEEKKEEEKEKEEEKKKKEG 33
EEEK++EEE+++++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81
Score = 25.7 bits (57), Expect = 7.9
Identities = 7/17 (41%), Positives = 11/17 (64%)
Query: 11 SEEEEEKKEEEKEKEEE 27
+ +EE+KE+EEE
Sbjct: 60 AAAAAAAEEEKKEEEEE 76
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 30.8 bits (69), Expect = 0.48
Identities = 15/38 (39%), Positives = 22/38 (57%)
Query: 9 KISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNK 46
K++EEEE + +EE E+ EE KK+ + G E K
Sbjct: 239 KVAEEEEFELDEEHEEAEEDKKEALEKIGAEGDEEKFK 276
Score = 29.3 bits (65), Expect = 1.8
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGRE 36
E +EE E+ EE+K++ EK EG E
Sbjct: 244 EEFELDEEHEEAEEDKKEALEKIGAEGDEE 273
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 30.1 bits (68), Expect = 0.66
Identities = 9/26 (34%), Positives = 21/26 (80%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
E K +++++K+EEE+ K+ +K+++E
Sbjct: 188 ELKKLKQQQQKREEERRKQRKKQQEE 213
Score = 27.8 bits (62), Expect = 4.3
Identities = 9/38 (23%), Positives = 26/38 (68%)
Query: 2 TPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+ ++++ E E +K +++++K EE+++K+ ++ EE
Sbjct: 176 SQEEAKKRLQEWELKKLKQQQQKREEERRKQRKKQQEE 213
Score = 26.6 bits (59), Expect = 8.3
Identities = 9/27 (33%), Positives = 20/27 (74%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEG 33
++K EE +++++++E+EE K+K E
Sbjct: 196 QQKREEERRKQRKKQQEEEERKQKAEE 222
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 30.3 bits (69), Expect = 0.70
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
K +E+EE +K K+K E KK G +GE
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGET 453
Score = 27.2 bits (61), Expect = 7.5
Identities = 7/35 (20%), Positives = 17/35 (48%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNK 46
E++ +KEE ++ +KK + ++ + K
Sbjct: 421 AEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKK 455
Score = 26.8 bits (60), Expect = 9.1
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEG 33
+ +K + E++ ++EE EK KKK E
Sbjct: 413 KLRKKQRKAEKKAEKEEAEKAAAKKKAEA 441
>gnl|CDD|235425 PRK05349, PRK05349, Na(+)-translocating NADH-quinone reductase
subunit B; Provisional.
Length = 405
Score = 30.2 bits (69), Expect = 0.73
Identities = 17/49 (34%), Positives = 23/49 (46%), Gaps = 10/49 (20%)
Query: 108 RFVAGSLAGGVS----LN-VPDDFTAKEMLTGM-WWRHLVAGGVA-GAV 149
R VAG + G ++ N + D + M W+ HLV GG A G V
Sbjct: 287 RIVAGVMIGMIATSLLFNLIGSDTNP---MFAMPWYWHLVLGGFAFGMV 332
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 30.0 bits (68), Expect = 0.81
Identities = 13/42 (30%), Positives = 23/42 (54%), Gaps = 2/42 (4%)
Query: 7 ERKISEEEEEKKEEE--KEKEEEKKKKEGGREGEELGVHGNK 46
+R +EEE K+E E K+ E E+ ++ G E ++ +K
Sbjct: 255 DRTKTEEELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSK 296
Score = 28.0 bits (63), Expect = 4.1
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
++E+ +EEE+E++E+KKKK EL
Sbjct: 362 DSDDEDDEEEEEEEKEKKKKKSAESTRSEL 391
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more
internally with limited reactivity in the C-terminal
domains, while P2 proteins seem to be more externally
located and are more likely to interact with other
cellular components. In lower eukaryotes, P1 and P2 are
further subdivided into P1A, P1B, P2A, and P2B, which
form P1A/P2B and P1B/P2A heterodimers. Some plant
species have a third P-protein, called P3, which is not
homologous to P1 and P2. In humans, P1 and P2 are
strongly autoimmunogenic. They play a significant role
in the etiology and pathogenesis of systemic lupus
erythema (SLE). In addition, the ribosome-inactivating
protein trichosanthin (TCS) interacts with human P0,
P1, and P2, with its primary binding site located in
the C-terminal region of P2. TCS inactivates the
ribosome by depurinating a specific adenine in the
sarcin-ricin loop of 28S rRNA.
Length = 103
Score = 28.4 bits (64), Expect = 0.84
Identities = 8/16 (50%), Positives = 14/16 (87%)
Query: 12 EEEEEKKEEEKEKEEE 27
++EEKKEEE+E+ ++
Sbjct: 81 AKKEEKKEEEEEESDD 96
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 30.0 bits (68), Expect = 0.92
Identities = 12/33 (36%), Positives = 21/33 (63%), Gaps = 3/33 (9%)
Query: 3 PIAIERKISEEEEEKKEEE---KEKEEEKKKKE 32
++R+ E+E K+++ +K+EEKKKKE
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKE 582
Score = 28.5 bits (64), Expect = 2.7
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 13 EEEEKKEEEKEKEEEKKKKEGGREGEEL 40
E+EEK+ +++K K KK+ ++ +EL
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKEL 583
Score = 26.9 bits (60), Expect = 8.9
Identities = 9/27 (33%), Positives = 19/27 (70%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKE 32
E K + +E+++ + K++EE+KKK+
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKEL 583
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 28.2 bits (63), Expect = 0.92
Identities = 7/39 (17%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 4 IAIERKI---SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+ I R + EE E+ ++E + +++ K + + ++
Sbjct: 37 VEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDD 75
>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex. Med21 has been
known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
Drosophila. The heterodimer of the two subunits Med7 and
Med21 appears to act as a hinge between the middle and
the tail regions of Mediator.
Length = 132
Score = 28.8 bits (65), Expect = 0.93
Identities = 14/26 (53%), Positives = 18/26 (69%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
R+I E EEE +E E E+EE K+KE
Sbjct: 91 LRRIKELEEELREVEAEREEAVKEKE 116
Score = 27.2 bits (61), Expect = 3.1
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKK 30
E ++ E E E++E KEKE+ KK
Sbjct: 98 EEELREVEAEREEAVKEKEKLLKK 121
Score = 26.5 bits (59), Expect = 5.8
Identities = 13/33 (39%), Positives = 22/33 (66%), Gaps = 3/33 (9%)
Query: 11 SEEEEE---KKEEEKEKEEEKKKKEGGREGEEL 40
SEEE+ K+ EE+ +E E +++E +E E+L
Sbjct: 86 SEEEQLRRIKELEEELREVEAEREEAVKEKEKL 118
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 29.3 bits (66), Expect = 0.93
Identities = 13/39 (33%), Positives = 21/39 (53%)
Query: 1 MTPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
I+ K+S E++ KE E E +E+ +K+ E EE
Sbjct: 144 EEEEDIDEKLSMLEKKLKELEAEDVDEEDEKDEEEEEEE 182
Score = 27.8 bits (62), Expect = 3.5
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E++ K+ E ++ +EE +K E E EE
Sbjct: 156 LEKKLKELEAEDVDEEDEKDEEEEEEEE 183
Score = 26.3 bits (58), Expect = 9.8
Identities = 10/16 (62%), Positives = 15/16 (93%)
Query: 12 EEEEEKKEEEKEKEEE 27
EE+E+ +EEE+E+EEE
Sbjct: 170 EEDEKDEEEEEEEEEE 185
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 29.9 bits (67), Expect = 0.94
Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 3/33 (9%)
Query: 1 MTPIAIERKISEEEE---EKKEEEKEKEEEKKK 30
T E+KI EEE +KK+EEK KE+E KK
Sbjct: 3 RTESEAEKKILTEEELERKKKKEEKAKEKELKK 35
Score = 29.5 bits (66), Expect = 1.3
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+ E E+K E+E E +KKK+E +E E
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKEL 33
Score = 27.6 bits (61), Expect = 5.5
Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 1/32 (3%)
Query: 1 MTPIAIERKISEEEEEK-KEEEKEKEEEKKKK 31
+T +ERK +EE+ K KE +K K +K+ K
Sbjct: 13 LTEEELERKKKKEEKAKEKELKKLKAAQKEAK 44
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.8 bits (67), Expect = 0.97
Identities = 14/29 (48%), Positives = 20/29 (68%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
SEE E K +EEK K+EE +KK+ + E+
Sbjct: 395 SEEAEAKAKEEKLKQEENEKKQKEQADED 423
Score = 29.4 bits (66), Expect = 1.2
Identities = 12/33 (36%), Positives = 24/33 (72%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREG 37
A E K+ +EE EKK++E+ E+++K+++ R+
Sbjct: 402 AKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 29.6 bits (66), Expect = 0.98
Identities = 13/33 (39%), Positives = 24/33 (72%), Gaps = 2/33 (6%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E K E++ ++++EE+E EEE K++EG +G +
Sbjct: 212 EHKTEEKQPQEEQEEEEVEEEAKQEEG--QGTD 242
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 29.9 bits (68), Expect = 0.98
Identities = 10/24 (41%), Positives = 17/24 (70%), Gaps = 2/24 (8%)
Query: 11 SEEEEEK--KEEEKEKEEEKKKKE 32
S++E E+ K+ E+ E+KK+KE
Sbjct: 502 SDDEIERMVKDAEEYAAEDKKRKE 525
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 28.8 bits (65), Expect = 1.1
Identities = 10/21 (47%), Positives = 17/21 (80%)
Query: 12 EEEEEKKEEEKEKEEEKKKKE 32
+EEE K++ E+ KE E++KK+
Sbjct: 106 DEEEAKRQHEEAKEREREKKK 126
Score = 28.0 bits (63), Expect = 1.7
Identities = 8/35 (22%), Positives = 19/35 (54%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
I+ + + +E++ ++EE K++ E +E E
Sbjct: 89 IDERAEAQRARDEEKKLDEEEAKRQHEEAKERERE 123
Score = 27.2 bits (61), Expect = 3.5
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKK 30
++ EE + + EE KE+E EKKK
Sbjct: 101 EEKKLDEEEAKRQHEEAKEREREKKK 126
>gnl|CDD|185494 PTZ00168, PTZ00168, mitochondrial carrier protein; Provisional.
Length = 259
Score = 29.5 bits (66), Expect = 1.1
Identities = 13/54 (24%), Positives = 28/54 (51%), Gaps = 6/54 (11%)
Query: 141 VAGGVAGAVSRTCTAPLDRLKVYLQVHGNK-TTGIKKCLLHLLHEGGFLSLWRG 193
+ GG+AG ++ T P+D +K ++G + + + E G+L+ ++G
Sbjct: 184 ICGGLAGGIAGFLTTPVDVIKSRQIIYGKSYIETVTE-----IAEEGYLTFYKG 232
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 29.3 bits (66), Expect = 1.1
Identities = 16/33 (48%), Positives = 26/33 (78%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
ERK EE EE++E++KE+EE K+++E R+ +E
Sbjct: 41 ERKEEEELEEEREKKKEEEERKEREEQARKEQE 73
Score = 27.4 bits (61), Expect = 3.8
Identities = 15/35 (42%), Positives = 23/35 (65%), Gaps = 3/35 (8%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
E ++ EE E+KKEEE+ KE E++ + +E EE
Sbjct: 44 EEEELEEEREKKKEEEERKEREEQAR---KEQEEY 75
Score = 27.4 bits (61), Expect = 4.5
Identities = 15/28 (53%), Positives = 19/28 (67%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E E K+EEE E+E EKKK+E R+ E
Sbjct: 38 REGERKEEEELEEEREKKKEEEERKERE 65
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 29.6 bits (67), Expect = 1.2
Identities = 14/38 (36%), Positives = 25/38 (65%), Gaps = 1/38 (2%)
Query: 4 IAIERKISE-EEEEKKEEEKEKEEEKKKKEGGREGEEL 40
++RK+ E E E+K+EE +EK +K ++ G + EE+
Sbjct: 87 QGLQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEV 124
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal.
This domain is found to the N-terminus of bacterial
signal peptidases of the S49 family (pfam01343).
Length = 154
Score = 28.6 bits (65), Expect = 1.2
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
E K E+ E+K E+ K K E+KK K+
Sbjct: 67 ELKAWEKAEKKAEKAKAKAEKKKAKK 92
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 29.7 bits (67), Expect = 1.2
Identities = 16/33 (48%), Positives = 22/33 (66%), Gaps = 4/33 (12%)
Query: 7 ERKISEEEEEKKEE----EKEKEEEKKKKEGGR 35
E KIS+ EEK ++ EK+KEEE K++E R
Sbjct: 734 EFKISDSVEEKTKKKKKKEKKKEEEYKREEKAR 766
Score = 28.5 bits (64), Expect = 3.0
Identities = 11/32 (34%), Positives = 21/32 (65%)
Query: 1 MTPIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
++ E+ ++++EKK+EE+ K EEK + E
Sbjct: 737 ISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 29.2 bits (65), Expect = 1.3
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 10 ISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTG 49
+ EEEE+ EE +EEE+ ++ G E N+
Sbjct: 174 VDNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEE 213
Score = 27.3 bits (60), Expect = 6.3
Identities = 12/33 (36%), Positives = 17/33 (51%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E + EE + ++EEE E+ EG RE E
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNE 210
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 29.3 bits (65), Expect = 1.3
Identities = 12/34 (35%), Positives = 24/34 (70%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKEGGRE 36
P ++ +++EKKE+EKE++++KKK+ G +
Sbjct: 193 PAVEKKSKKPKKKEKKEKEKERDKDKKKEVEGFK 226
Score = 27.7 bits (61), Expect = 4.6
Identities = 14/36 (38%), Positives = 26/36 (72%), Gaps = 1/36 (2%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
A+E+K S++ ++K+++EKEKE +K KK+ + L
Sbjct: 194 AVEKK-SKKPKKKEKKEKEKERDKDKKKEVEGFKSL 228
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 29.5 bits (66), Expect = 1.4
Identities = 13/46 (28%), Positives = 23/46 (50%), Gaps = 2/46 (4%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKEGG--REGEELGVHGNK 46
P A ++ + +E KE+++E E+ K+K R+G G K
Sbjct: 598 PQAPKKVLESPKEPSKEKKEEDEDTKEKAPLSDARKGRARGPARRK 643
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 29.5 bits (66), Expect = 1.4
Identities = 17/33 (51%), Positives = 26/33 (78%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E + EEEEE++EEE+E+E+E +++EG E EE
Sbjct: 441 EEESVEEEEEEEEEEEEEEQESEEEEGEDEEEE 473
Score = 29.1 bits (65), Expect = 2.0
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
EEE++EEE+E+EEE++ +E E EE
Sbjct: 443 ESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472
Score = 28.3 bits (63), Expect = 2.9
Identities = 15/34 (44%), Positives = 21/34 (61%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E + EEE+E +EEE E EEE+++ E EE
Sbjct: 451 EEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEE 484
>gnl|CDD|226040 COG3509, LpqC, Poly(3-hydroxybutyrate) depolymerase [Secondary
metabolites biosynthesis, transport, and catabolism].
Length = 312
Score = 29.3 bits (66), Expect = 1.4
Identities = 21/90 (23%), Positives = 31/90 (34%), Gaps = 25/90 (27%)
Query: 109 FVAGSLAGG-----VSLNVPDDFTAKEMLTGMWWRHLVAGGVAGAVSRTCTAPLDRLKVY 163
+V G GG ++ PD F A VAG +A V+ T P+ +
Sbjct: 147 YVTGLSNGGRMANRLACEYPDIFAAIA---------PVAGLLALGVACTPPRPVSVMA-- 195
Query: 164 LQVHGNKTTGIKKCLLHLLHEGGFLSLWRG 193
HG + GG + + RG
Sbjct: 196 --FHGTADPLNP-------YHGGGVPIGRG 216
>gnl|CDD|234715 PRK00290, dnaK, molecular chaperone DnaK; Provisional.
Length = 627
Score = 29.3 bits (67), Expect = 1.5
Identities = 12/24 (50%), Positives = 17/24 (70%), Gaps = 2/24 (8%)
Query: 11 SEEEEEK--KEEEKEKEEEKKKKE 32
S+EE E+ K+ E EE+KK+KE
Sbjct: 501 SDEEIERMVKDAEANAEEDKKRKE 524
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 29.0 bits (65), Expect = 1.5
Identities = 17/37 (45%), Positives = 26/37 (70%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGV 42
IE +ISE+ EE++ EE E++EE++ +E E E LG
Sbjct: 71 IEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGD 107
>gnl|CDD|187630 cd05372, ENR_SDR, Enoyl acyl carrier protein (ACP) reductase (ENR),
divergent SDR. This bacterial subgroup of ENRs includes
Escherichia coli ENR. ENR catalyzes the
NAD(P)H-dependent reduction of enoyl-ACP in the last
step of fatty acid biosynthesis. De novo fatty acid
biosynthesis is catalyzed by the fatty acid synthetase
complex, through the serial addition of 2-carbon
subunits. In bacteria and plants,ENR catalyzes one of
six synthetic steps in this process. Oilseed rape ENR,
and also apparently the NADH-specific form of
Escherichia coli ENR, is tetrameric. Although similar
to the classical SDRs, this group does not have the
canonical catalytic tetrad, nor does it have the typical
Gly-rich NAD-binding pattern. Such so-called divergent
SDRs have a GXXXXXSXA NAD-binding motif and a YXXMXXXK
(or YXXXMXXXK) active site motif. SDRs are a
functionally diverse family of oxidoreductases that have
a single domain with a structurally conserved Rossmann
fold (alpha/beta folding pattern with a central
beta-sheet), an NAD(P)(H)-binding region, and a
structurally diverse C-terminal region. Classical SDRs
are typically about 250 residues long, while extended
SDRs are approximately 350 residues. Sequence identity
between different SDR enzymes are typically in the
15-30% range, but the enzymes share the Rossmann fold
NAD-binding motif and characteristic NAD-binding and
catalytic sequence patterns. These enzymes catalyze a
wide range of activities including the metabolism of
steroids, cofactors, carbohydrates, lipids, aromatic
compounds, and amino acids, and act in redox sensing.
Classical SDRs have an TGXXX[AG]XG cofactor binding
motif and a YXXXK active site motif, with the Tyr
residue of the active site motif serving as a critical
catalytic residue (Tyr-151, human
15-hydroxyprostaglandin dehydrogenase (15-PGDH)
numbering). In addition to the Tyr and Lys, there is
often an upstream Ser (Ser-138, 15-PGDH numbering)
and/or an Asn (Asn-107, 15-PGDH numbering) contributing
to the active site; while substrate binding is in the
C-terminal region, which determines specificity. The
standard reaction mechanism is a 4-pro-S hydride
transfer and proton relay involving the conserved Tyr
and Lys, a water molecule stabilized by Asn, and
nicotinamide. Extended SDRs have additional elements in
the C-terminal region, and typically have a TGXXGXXG
cofactor binding motif. Complex (multidomain) SDRs such
as ketoreductase domains of fatty acid synthase have a
GGXGXXG NAD(P)-binding motif and an altered active site
motif (YXXXN). Fungal type ketoacyl reductases have a
TGXXXGX(1-2)G NAD(P)-binding motif. Some atypical SDRs
have lost catalytic activity and/or have an unusual
NAD(P)-binding motif and missing or unusual active site
residues. Reactions catalyzed within the SDR family
include isomerization, decarboxylation, epimerization,
C=N bond reduction, dehydratase activity,
dehalogenation, Enoyl-CoA reduction, and
carbonyl-alcohol oxidoreduction.
Length = 250
Score = 28.7 bits (65), Expect = 1.7
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 70 NGINVLKIAPESALKFMAYEQAKRLIRGN 98
N + V K A ES+++++AYE ++ IR N
Sbjct: 152 NVMGVAKAALESSVRYLAYELGRKGIRVN 180
>gnl|CDD|167649 PRK03963, PRK03963, V-type ATP synthase subunit E; Provisional.
Length = 198
Score = 28.6 bits (64), Expect = 1.7
Identities = 15/37 (40%), Positives = 22/37 (59%), Gaps = 3/37 (8%)
Query: 6 IERKISEEEEEKKE---EEKEKEEEKKKKEGGREGEE 39
I ++I+ E E+K E EE +KE EK K+E + E
Sbjct: 7 IIQEINREAEQKIEYILEEAQKEAEKIKEEARKRAES 43
>gnl|CDD|181163 PRK07906, PRK07906, hypothetical protein; Provisional.
Length = 426
Score = 29.0 bits (66), Expect = 1.7
Identities = 19/55 (34%), Positives = 23/55 (41%), Gaps = 18/55 (32%)
Query: 116 GGVSLNVPDDF------TAKEMLTGMWWRHLVAGGVAG---------AVSRTCTA 155
GG SL VP TA++ G+ W L A G AG AV+R A
Sbjct: 169 GGFSLTVPGRDRLYLIETAEK---GLAWMRLTARGRAGHGSMVNDDNAVTRLAEA 220
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation
factor 3 (eIF3). In yeast it is called Hcr1. The
Saccharomyces cerevisiae protein eIF3j (HCR1) has been
shown to be required for processing of 20S pre-rRNA and
binds to 18S rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 28.9 bits (65), Expect = 1.7
Identities = 12/20 (60%), Positives = 14/20 (70%)
Query: 12 EEEEEKKEEEKEKEEEKKKK 31
EEE+E+KEEEK K K K
Sbjct: 39 EEEDEEKEEEKAKVAAKAKA 58
Score = 27.3 bits (61), Expect = 4.9
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 11 SEEEEEKKEEEKEKEEEKKKK 31
EE+EEK+EE+ + + K K
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAK 59
Score = 26.9 bits (60), Expect = 6.8
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 16 EKKEEEKEKEEEKKKKEGGREGEEL 40
+ K EEKEK + +K+++G RE EE
Sbjct: 63 KAKIEEKEKAKREKEEKGLRELEED 87
>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal. This is the N terminal of
the DNA repair protein Rad54.
Length = 191
Score = 28.5 bits (64), Expect = 1.7
Identities = 15/45 (33%), Positives = 22/45 (48%), Gaps = 11/45 (24%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKT----TGIKK 52
+ EEEK E+++E EE + K G +K+ GIKK
Sbjct: 146 KIEEEKAEKDQEPEESETK-------LSNGPKVHKSLAEILGIKK 183
>gnl|CDD|234366 TIGR03825, FliH_bacil, flagellar assembly protein FliH. This
bacillus clade of FliH proteins is not found by the Pfam
FliH model pfam02108, but is closely related to the
sequences identified by that model. Sequences identified
by this model are observed in flagellar operons in an
analogous position relative to other flagellar operon
genes.
Length = 255
Score = 28.6 bits (64), Expect = 1.8
Identities = 14/37 (37%), Positives = 23/37 (62%), Gaps = 1/37 (2%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
AI +I E+E + EEE+E+ ++ K+EG G + G
Sbjct: 66 AIREQI-EQERAQWEEERERLIQEAKQEGYEAGFQAG 101
>gnl|CDD|218688 pfam05680, ATP-synt_E, ATP synthase E chain. This family
consists of several ATP synthase E chain sequences
which are components of the CF(0) subunit.
Length = 83
Score = 27.4 bits (61), Expect = 1.8
Identities = 14/35 (40%), Positives = 19/35 (54%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
E KI E E ++K EK K KKK+ +E + L
Sbjct: 35 KEEKIREYEAQEKLIEKAKAAYAKKKQAKQEAKSL 69
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 28.5 bits (64), Expect = 1.9
Identities = 7/33 (21%), Positives = 23/33 (69%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
++ ++++ KK+++K+K+++ KK+ E ++
Sbjct: 82 QKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
Score = 27.3 bits (61), Expect = 4.2
Identities = 7/29 (24%), Positives = 20/29 (68%)
Query: 4 IAIERKISEEEEEKKEEEKEKEEEKKKKE 32
+K +++++KK+++K+ + EKK ++
Sbjct: 88 KKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
Score = 26.6 bits (59), Expect = 8.5
Identities = 7/28 (25%), Positives = 22/28 (78%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
++ ++KK+++K+K+++KK + ++ E+
Sbjct: 89 KKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 28.5 bits (63), Expect = 2.0
Identities = 18/49 (36%), Positives = 29/49 (59%), Gaps = 4/49 (8%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGV----HGNKTTG 49
A + S+E+EE++EEE+E+E E ++ E G G HGN ++G
Sbjct: 129 ATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSG 177
Score = 27.7 bits (61), Expect = 4.0
Identities = 15/43 (34%), Positives = 27/43 (62%), Gaps = 2/43 (4%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKT 47
+K ++E+E ++EE+E+EEE+++ E E E G +G T
Sbjct: 124 NAGKKATKEDESDEDEEEEEEEEEEEAE--VEENEQGTNGTST 164
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 27.3 bits (61), Expect = 2.2
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 12 EEEEEKKEEEKEKEEEKKK 30
EEE +KEEE+EK K+K
Sbjct: 88 EEERLEKEEEREKRARKRK 106
Score = 27.0 bits (60), Expect = 3.3
Identities = 7/20 (35%), Positives = 16/20 (80%)
Query: 12 EEEEEKKEEEKEKEEEKKKK 31
E+ EE++ E++E+ E++ +K
Sbjct: 85 EDREEERLEKEEEREKRARK 104
Score = 25.8 bits (57), Expect = 8.5
Identities = 8/19 (42%), Positives = 15/19 (78%)
Query: 13 EEEEKKEEEKEKEEEKKKK 31
EEE+ E+E+E+E+ +K+
Sbjct: 87 REEERLEKEEEREKRARKR 105
Score = 25.8 bits (57), Expect = 8.7
Identities = 9/20 (45%), Positives = 15/20 (75%)
Query: 13 EEEEKKEEEKEKEEEKKKKE 32
E+++EE EKEEE++K+
Sbjct: 83 RLEDREEERLEKEEEREKRA 102
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 28.0 bits (63), Expect = 2.2
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 9 KISEEEEEKKEEEKEKEEEKKKKE 32
++ E+E K + EK +E E+K+KE
Sbjct: 122 ELLEKELAKLKREKRRENERKQKE 145
Score = 27.3 bits (61), Expect = 4.6
Identities = 13/27 (48%), Positives = 17/27 (62%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGE 38
+EEE+ +E EKE K K+E RE E
Sbjct: 114 LDEEEQIDELLEKELAKLKREKRRENE 140
Score = 26.1 bits (58), Expect = 8.9
Identities = 7/27 (25%), Positives = 16/27 (59%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKE 32
I+ + +E + K E++ + E K+K+
Sbjct: 120 IDELLEKELAKLKREKRRENERKQKEI 146
>gnl|CDD|215645 PLN03246, PLN03246, 26S proteasome regulatory subunit; Provisional.
Length = 303
Score = 28.6 bits (64), Expect = 2.3
Identities = 11/29 (37%), Positives = 14/29 (48%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGG 34
I KI +E EK E+ K + KK G
Sbjct: 275 INNKILNKEHEKAEDAKPAAKPAKKGSGK 303
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 27.4 bits (61), Expect = 2.3
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGG 34
E +E +EEEKE+E E++ +
Sbjct: 81 AEADEAEEEEKEEEAEEESDDDM 103
Score = 27.4 bits (61), Expect = 2.7
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
A + E E E EEE+K++E E ++
Sbjct: 65 AAAGAAAAAAAGAEAAAEADEAEEEEKEEEAEEESDD 101
Score = 25.8 bits (57), Expect = 8.3
Identities = 8/33 (24%), Positives = 17/33 (51%)
Query: 2 TPIAIERKISEEEEEKKEEEKEKEEEKKKKEGG 34
A +E +E ++EE++E+ EE+ +
Sbjct: 72 AAAAGAEAAAEADEAEEEEKEEEAEEESDDDML 104
Score = 25.8 bits (57), Expect = 8.5
Identities = 6/29 (20%), Positives = 15/29 (51%)
Query: 13 EEEEKKEEEKEKEEEKKKKEGGREGEELG 41
E E ++ +EEEK+++ +++
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEESDDDML 104
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel,
four-helix bundle structure with a large hydrophobic
patch in which 23 residues of one monomer form van der
Waals contacts with the other monomer. This CENP-B
dimer configuration may be suitable for capturing two
distant CENP-B boxes during centromeric heterochromatin
formation.
Length = 101
Score = 27.4 bits (60), Expect = 2.4
Identities = 7/23 (30%), Positives = 16/23 (69%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEG 33
S+EEE+ +E++E ++E ++
Sbjct: 16 SDEEEDDDDEDEEDDDEDDDEDD 38
Score = 26.3 bits (57), Expect = 5.6
Identities = 6/32 (18%), Positives = 21/32 (65%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEELGV 42
S+ + +++E++ +++EE ++ + +E+ V
Sbjct: 12 SDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPV 43
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 27.8 bits (62), Expect = 2.4
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
E+K +E E+KE K K+E+K+KKE
Sbjct: 98 EQKEVSKETEEKEAIKAKKEKKEKKE 123
Score = 27.8 bits (62), Expect = 2.9
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 10 ISEEEEEKKEEEKEKEEEKKKKE 32
+ EE EK+++E KE E+K+
Sbjct: 90 LFPEESEKEQKEVSKETEEKEAI 112
Score = 27.4 bits (61), Expect = 3.9
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKK 31
E EE+E + +KEK+E+K+KK
Sbjct: 101 EVSKETEEKEAIKAKKEKKEKKEKK 125
Score = 26.3 bits (58), Expect = 8.5
Identities = 9/31 (29%), Positives = 19/31 (61%)
Query: 2 TPIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
P E++ E +E +E+E K +++KK++
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKKEK 121
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap
is involved in recombination. Rap (recombination adept
with plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 28.1 bits (63), Expect = 2.5
Identities = 10/29 (34%), Positives = 21/29 (72%)
Query: 4 IAIERKISEEEEEKKEEEKEKEEEKKKKE 32
+A++R+ ++E++ K E + E+ E K +KE
Sbjct: 32 LALKREKAQEKKRKAEAQAERRELKARKE 60
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 28.1 bits (63), Expect = 2.5
Identities = 11/25 (44%), Positives = 13/25 (52%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKK 31
ERK + KK +K E KKKK
Sbjct: 172 ERKEALAAAAKKSATPQKVETKKKK 196
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 28.5 bits (64), Expect = 2.8
Identities = 12/32 (37%), Positives = 20/32 (62%)
Query: 9 KISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
+ + EEEE++EEE+E+EEE + E+
Sbjct: 401 RWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEM 432
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 27.1 bits (60), Expect = 2.8
Identities = 19/39 (48%), Positives = 25/39 (64%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNK 46
+KI EE +K EEE +K EKKKKEG E +++ G K
Sbjct: 38 KKIEEEIIKKAEEEAQKLIEKKKKEGEEEAKKILEEGEK 76
>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812). This
family consists of several eukaryotic proteins of
unknown function.
Length = 536
Score = 28.6 bits (64), Expect = 2.8
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKE 32
++++EE ++E +E EEE K K+
Sbjct: 301 KQVNEELTTVRQENEELEEEYKIKK 325
>gnl|CDD|187737 cd09616, Peptidase_C12_UCH_L1_L3, Cysteine peptidase C12 containing
ubiquitin carboxyl-terminal hydrolase (UCH) families L1
and L3. This ubiquitin C-terminal hydrolase (UCH)
family includes UCH-L1 and UCH-L3, the two members
sharing around 53% sequence identity as well as
conserved catalytic residues. Both enzymes hydrolyze
carboxyl terminal esters and amides of ubiquitin (Ub).
UCH-L1, in dimeric form, has additional enzymatic
activity as a ubiquitin ligase. It is highly abundant in
the brain, constituting up to 2% of total protein, and
is expressed exclusively in neurons and testes. Abnormal
expression of UCH-L1 has been shown to correlate with
several forms of cancer, including several primary lung
tumors, lung tumor cell lines, and colorectal cancers.
Mutations in the UCH-L1 gene have been linked to
susceptibility to and protection from Parkinson's
disease (PD); dysfunction of the hydrolase activity can
lead to an accumulation of alpha-synuclein, which is
linked to Parkinson's disease (PD), while accumulation
of neurofibrillary tangles is linked to Alzheimer's
disease (AD). UCH-L3 hydrolyzes isopeptide bonds at the
C-terminal glycine of either Ub or Nedd8, a
ubiquitin-like protein. It can also interact with
Lys48-linked Ub dimers to protect them from degradation
while inhibiting its hydrolase activity at the same
time. Unlike UCH-L1, neither dimerization nor ligase
activity have been observed for UCH-L3. It has been
shown that levels of Nedd8 and the apoptotic protein p53
and Bax are elevated in UCH-L3 knockout mice upon
cryptorchid injury, possibly contributing to profound
germ cell loss via apoptosis.
Length = 222
Score = 28.0 bits (63), Expect = 3.0
Identities = 18/70 (25%), Positives = 33/70 (47%), Gaps = 10/70 (14%)
Query: 16 EKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKC-LLHLLHEGGFLSLWRGNGINV 74
+ EE +++EEE+ K++G E + +T G C + L+H ++ N +
Sbjct: 55 KAYEEFRKEEEEEIKEKGQEVSESV-YFMKQTIG-NACGTIALIH-----AV--ANNEDR 105
Query: 75 LKIAPESALK 84
+ I S LK
Sbjct: 106 INILEGSFLK 115
>gnl|CDD|223696 COG0623, FabI, Enoyl-[acyl-carrier-protein].
Length = 259
Score = 27.9 bits (63), Expect = 3.1
Identities = 10/29 (34%), Positives = 18/29 (62%)
Query: 70 NGINVLKIAPESALKFMAYEQAKRLIRGN 98
N + V K A E++++++A + K IR N
Sbjct: 156 NVMGVAKAALEASVRYLAADLGKEGIRVN 184
>gnl|CDD|165026 PHA02644, PHA02644, hypothetical protein; Provisional.
Length = 112
Score = 27.3 bits (59), Expect = 3.1
Identities = 14/33 (42%), Positives = 20/33 (60%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E E E+EKK E+ E E+EKK ++ E E+
Sbjct: 57 EHIKKENEDEKKPEKPENEDEKKPEKPENEDEK 89
Score = 27.3 bits (59), Expect = 3.3
Identities = 13/28 (46%), Positives = 19/28 (67%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E E+EKK E+ E E+EKK ++ E E+
Sbjct: 73 ENEDEKKPEKPENEDEKKPEKPENEDEK 100
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 28.2 bits (62), Expect = 3.3
Identities = 16/34 (47%), Positives = 20/34 (58%), Gaps = 1/34 (2%)
Query: 4 IAIER-KISEEEEEKKEEEKEKEEEKKKKEGGRE 36
I ER K EE+E + ++ KEEE KKE RE
Sbjct: 1021 IVKERIKDQEEKERMESLQRAKEEEIGKKEKERE 1054
Score = 28.2 bits (62), Expect = 4.0
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKE 32
++ EEE KKE+E+E+ K +
Sbjct: 1036 ESLQRAKEEEIGKKEKEREQRIRKTIHD 1063
>gnl|CDD|212050 cd11480, SLC5sbd_u4, Uncharacterized bacterial solute carrier 5
subfamily; putative solute-binding domain. SLC5 (also
called the sodium/glucose cotransporter family or solute
sodium symporter family) is a family of proteins that
co-transports Na+ with sugars, amino acids, inorganic
ions or vitamins. Prokaryotic members of this family
include Vibrio parahaemolyticus glucose/galactose
(vSGLT), and Escherichia coli proline (PutP) and
pantothenate (PutF) cotransporters. One member of the
SLC5 family, human SGLT3, has been characterized as a
glucose sensor and not a transporter. This subfamily
belongs to the solute carrier 5 (SLC5) transporter
family.
Length = 484
Score = 27.9 bits (63), Expect = 3.6
Identities = 14/63 (22%), Positives = 23/63 (36%), Gaps = 22/63 (34%)
Query: 109 FVAGSLAGGVSLNVPDDFTAKEM----------------LTGMWWRHL-----VAGGVAG 147
V G +A ++L P A + L G++WR +AG + G
Sbjct: 377 VVIGVVAILLALLAPPQNVA-FLVALAFAIAASAFFPVLLLGIFWRRFTTAGAIAGMLVG 435
Query: 148 AVS 150
+S
Sbjct: 436 LLS 438
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 27.0 bits (60), Expect = 3.6
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEG 33
E + EEEE+++EE +E+ + K K+
Sbjct: 101 ENRKLEEEEKEREELEEENDVTKGKDL 127
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 27.2 bits (60), Expect = 3.7
Identities = 13/36 (36%), Positives = 21/36 (58%), Gaps = 9/36 (25%)
Query: 5 AIERKISEEE---------EEKKEEEKEKEEEKKKK 31
A +I +EE E+++E +E+EEEK+KK
Sbjct: 76 ATFMEIQKEERRSARMARLEDERELAREEEEEKRKK 111
Score = 26.4 bits (58), Expect = 5.8
Identities = 10/23 (43%), Positives = 17/23 (73%)
Query: 9 KISEEEEEKKEEEKEKEEEKKKK 31
++ +E E +EEE+EK ++KKK
Sbjct: 93 RLEDERELAREEEEEKRKKKKKN 115
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 27.8 bits (62), Expect = 3.8
Identities = 13/33 (39%), Positives = 23/33 (69%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
++ +E+E+E+ EEE E+E+E+ E +E EE
Sbjct: 80 DKGDAEKEDEESEEENEEEDEESSDENEKETEE 112
Score = 27.0 bits (60), Expect = 5.8
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 13 EEEEKKEEEKEKEEEKKKKEGGREGEE 39
E EE KEEEKE + K++ G +E
Sbjct: 61 EIEEVKEEEKEAANSEDKEDKGDAEKE 87
Score = 27.0 bits (60), Expect = 6.8
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
SE++E+K + EKE EE +++ E E
Sbjct: 75 SEDKEDKGDAEKEDEESEEENEEEDEESS 103
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 27.2 bits (61), Expect = 4.4
Identities = 11/37 (29%), Positives = 18/37 (48%), Gaps = 1/37 (2%)
Query: 5 AIERKISEEEEEKKEEEKEKEE-EKKKKEGGREGEEL 40
++ KI+ ++EK E +E E K KE E +
Sbjct: 71 ELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHA 107
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.5 bits (61), Expect = 4.4
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKE 32
A+E + EEE+ + +KE+EE++ K
Sbjct: 150 ALEFEKEEEEQRRLLLQKEEEEQQMNKR 177
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 25.4 bits (56), Expect = 4.6
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKK 29
E++ EEE E +EE +E+ E K+
Sbjct: 24 EKREDEEENEDEEEGEEQSEVKR 46
Score = 24.6 bits (54), Expect = 8.4
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 4 IAIERKISEEEEEKKEEEKEKEEEKKK 30
+ E K +EEE + EEE E++ E K+
Sbjct: 20 LCEEEKREDEEENEDEEEGEEQSEVKR 46
Score = 24.2 bits (53), Expect = 9.6
Identities = 10/20 (50%), Positives = 17/20 (85%)
Query: 14 EEEKKEEEKEKEEEKKKKEG 33
EEEK+E+E+E E+E++ +E
Sbjct: 22 EEEKREDEEENEDEEEGEEQ 41
>gnl|CDD|181605 PRK08993, PRK08993, 2-deoxy-D-gluconate 3-dehydrogenase; Validated.
Length = 253
Score = 27.5 bits (61), Expect = 4.7
Identities = 16/46 (34%), Positives = 22/46 (47%), Gaps = 5/46 (10%)
Query: 67 WRGNGINVLKIAPESALKFMAYEQAKRLIRGNYTRELSIYERFVAG 112
W + INV IAP +MA ++L R + R I +R AG
Sbjct: 176 WAKHNINVNAIAP----GYMATNNTQQL-RADEQRSAEILDRIPAG 216
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 26.5 bits (59), Expect = 4.7
Identities = 7/17 (41%), Positives = 14/17 (82%)
Query: 11 SEEEEEKKEEEKEKEEE 27
+ ++EEKKEE +E+ ++
Sbjct: 85 AAKKEEKKEESEEESDD 101
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 26.8 bits (59), Expect = 4.8
Identities = 11/19 (57%), Positives = 15/19 (78%)
Query: 15 EEKKEEEKEKEEEKKKKEG 33
E KKEE+KE+EEE++ G
Sbjct: 89 EAKKEEKKEEEEEEEDDLG 107
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 27.6 bits (61), Expect = 4.9
Identities = 13/35 (37%), Positives = 19/35 (54%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
+ I + EE K +E K KE+ KK E + E+L
Sbjct: 186 LAELIIDLEELKLQELKLKEQAKKALEYYQLKEKL 220
>gnl|CDD|204918 pfam12436, USP7, Ubiquitin-specific protease 7. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam00443, pfam00917. USP7 regulates
the turnover of p53.
Length = 35
Score = 24.8 bits (55), Expect = 5.0
Identities = 8/22 (36%), Positives = 14/22 (63%)
Query: 6 IERKISEEEEEKKEEEKEKEEE 27
+ ++ EE EE++ KE+EE
Sbjct: 9 LPFRLEEEREERERRRKEREEA 30
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 27.3 bits (61), Expect = 5.1
Identities = 6/29 (20%), Positives = 16/29 (55%)
Query: 4 IAIERKISEEEEEKKEEEKEKEEEKKKKE 32
+E ++ + K+ E+ KE+E+ ++
Sbjct: 162 EELEEDAADRDARKRAAEEAKEQEELRRR 190
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 26.6 bits (59), Expect = 5.2
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
R+I EE EE+ E+ ++ E+ ++E E EE
Sbjct: 41 AREIIEEAEEEAEKLAQEILEEAREEAEEEAEE 73
>gnl|CDD|220379 pfam09750, DRY_EERY, Alternative splicing regulator. This entry
represents the conserved N-terminal region of SWAP
(suppressor-of-white-apricot protein) proteins. This
region contains two highly conserved motifs, viz: DRY
and EERY, which appear to be the sites for alternative
splicing of exons 2 and 3 of the SWAP mRNA. These
proteins are thus thought to be involved in
auto-regulation of pre-mRNA splicing. Most family
members are associated with two Surp domains pfam01805
and an Arginine- serine-rich binding region towards the
C-terminus.
Length = 121
Score = 26.6 bits (59), Expect = 5.5
Identities = 14/27 (51%), Positives = 18/27 (66%)
Query: 9 KISEEEEEKKEEEKEKEEEKKKKEGGR 35
K EE EK+EEEK + +KKK +GG
Sbjct: 84 KNDFEEAEKEEEEKRELAQKKKSKGGY 110
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 27.5 bits (61), Expect = 5.5
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
+ S+ EEE +E+EKEK + KK
Sbjct: 277 SGERSDSEEETEEKEKEKRKRLKKMM 302
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 27.4 bits (61), Expect = 5.6
Identities = 14/33 (42%), Positives = 23/33 (69%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
++K + E KK+ EK+K+++K+KKE EGE
Sbjct: 57 DKKEDKNNESKKKSEKKKKKKKEKKEPKSEGET 89
>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK. Members of this
family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
chaperone system. All members of the seed alignment were
taken from completely sequenced bacterial or archaeal
genomes and (except for Mycoplasma sequence) found
clustered with other genes of this systems. This model
excludes DnaK homologs that are not DnaK itself, such as
the heat shock cognate protein HscA (TIGR01991).
However, it is not designed to distinguish among DnaK
paralogs in eukaryotes. Note that a number of dnaK genes
have shadow ORFs in the same reverse (relative to dnaK)
reading frame, a few of which have been assigned
glutamate dehydrogenase activity. The significance of
this observation is unclear; lengths of such shadow ORFs
are highly variable as if the presumptive protein
product is not conserved [Protein fate, Protein folding
and stabilization].
Length = 595
Score = 27.7 bits (62), Expect = 5.7
Identities = 14/26 (53%), Positives = 18/26 (69%), Gaps = 2/26 (7%)
Query: 9 KISEEEEEK--KEEEKEKEEEKKKKE 32
+SEEE E+ KE E EE+KK+KE
Sbjct: 497 GLSEEEIERMVKEAEANAEEDKKRKE 522
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell
cycle progression and biogenesis of 60S ribosomal
subunits. The Schistosoma mansoni Mak16 has been shown
to target protein transport to the nucleolus.
Length = 97
Score = 26.0 bits (57), Expect = 5.7
Identities = 14/40 (35%), Positives = 24/40 (60%), Gaps = 1/40 (2%)
Query: 5 AIERKISEEE-EEKKEEEKEKEEEKKKKEGGREGEELGVH 43
A+E + SEE EE++EEE+E++E + + E E +
Sbjct: 54 ALEAEESEENDEEEEEEEEEEDEGEIEYVSDDEELEEEIE 93
>gnl|CDD|217206 pfam02731, SKIP_SNW, SKIP/SNW domain. This domain is found in
chromatin proteins.
Length = 158
Score = 26.9 bits (60), Expect = 5.8
Identities = 11/27 (40%), Positives = 19/27 (70%), Gaps = 1/27 (3%)
Query: 7 ERKISEEEEEKKEEEKE-KEEEKKKKE 32
ERK EE ++ E +++ E+EK++KE
Sbjct: 118 ERKAREEVRQRAELQRQLAEKEKQEKE 144
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to
be DNA-binding and may be a transcription factor.
Length = 238
Score = 27.0 bits (60), Expect = 6.0
Identities = 9/21 (42%), Positives = 16/21 (76%)
Query: 11 SEEEEEKKEEEKEKEEEKKKK 31
+EEE +KE ++E+ +KKK+
Sbjct: 69 DDEEEGEKELQREERLKKKKR 89
>gnl|CDD|217553 pfam03430, TATR, Trans-activating transcriptional regulator. This
family of trans-activating transcriptional regulator
(TATR), also known as intermediate early protein 1, are
common to the Nucleopolyhedroviruses.
Length = 571
Score = 27.6 bits (61), Expect = 6.0
Identities = 8/46 (17%), Positives = 19/46 (41%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKC 53
RK SE + + + K ++ K R+ + + T+ ++
Sbjct: 127 RKGSELDSDSDSSDSSKGKKLVNKPKKRQKYKKATIQSPTSLTEEE 172
>gnl|CDD|192481 pfam10197, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved
region at the N-terminal end of a family of proteins
referred to as CIRs (CBF1-interacting co-repressors).
CBF1 (centromere-binding factor 1) acts as a
transcription factor that causes repression by binding
specifically to GTGGGAA motifs in responsive promoters,
and it requires CIR as a co-repressor. CIR binds to
histone deacetylase and to SAP30 and serves as a linker
between CBF1 and the histone deacetylase complex.
Length = 37
Score = 24.5 bits (54), Expect = 6.2
Identities = 9/19 (47%), Positives = 14/19 (73%)
Query: 13 EEEEKKEEEKEKEEEKKKK 31
E E+K EE++K EE +K+
Sbjct: 15 EAEQKALEEQKKIEELRKE 33
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase;
Provisional.
Length = 330
Score = 27.1 bits (61), Expect = 6.4
Identities = 8/25 (32%), Positives = 15/25 (60%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKK 31
E K + ++KKE+++ K + K K
Sbjct: 65 ELKAWHKAQKKKEKQEAKAAKAKSK 89
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 27.3 bits (61), Expect = 6.4
Identities = 13/29 (44%), Positives = 22/29 (75%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
+EEEE+++EE+E+EEE+ ++ EG L
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376
>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family. This is a
family of transcription elongation factors which
includes those referred to as Bex proteins as well as
those named TCEAL7. Bex1 was shown to be a novel link
between neurotrophin signalling, the cell cycle, and
neuronal differentiation, suggesting it might function
by coordinating internal cellular states with the
ability of cells to respond to external signals. TCEAL7
has been shown negatively to regulate the NF-kappaB
pathway, hence being important in ovarian cancer as it
one of the genes frequently downregulated in this
cancer. A closely related protein, TFIIS/TCEA, found in
pfam07500 is involved in transcription elongation and
transcript fidelity. TFIIS/TCEA promotes 3'
endoribonuclease activity of RNA polymerase II (pol II)
and allows pol II to bypass transcript pause or
'arrest' during elongation process. It is thus possible
that BEX is also acting in this way.
Length = 97
Score = 26.2 bits (58), Expect = 6.5
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
+E E K E + KEEE+K+ EG++
Sbjct: 4 PCKENEGKPESEPKEEEEKRPLEEGEGKKPE 34
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 27.3 bits (61), Expect = 6.5
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
K E + EEK K+EEKK+K E E +
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKERI 680
Score = 26.9 bits (60), Expect = 8.3
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
+EE EE+ + E E E K + +EG
Sbjct: 232 TEEREEETDVEIETTSETKGTKQEQEGST 260
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this
family (Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 26.1 bits (58), Expect = 6.7
Identities = 11/26 (42%), Positives = 19/26 (73%), Gaps = 2/26 (7%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
ERK EEE+ ++EEE ++ + K++E
Sbjct: 2 ERK--EEEKAQREEELKRLKNLKREE 25
Score = 26.1 bits (58), Expect = 6.9
Identities = 8/18 (44%), Positives = 14/18 (77%)
Query: 14 EEEKKEEEKEKEEEKKKK 31
+E K+EE+ ++EEE K+
Sbjct: 1 KERKEEEKAQREEELKRL 18
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 27.3 bits (60), Expect = 6.7
Identities = 11/35 (31%), Positives = 19/35 (54%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELG 41
++ + EE ++EEKE E E+++K EE
Sbjct: 137 QKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENN 171
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 27.3 bits (60), Expect = 6.7
Identities = 20/85 (23%), Positives = 38/85 (44%), Gaps = 6/85 (7%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVHGNKTTGIKKCLLHLL----HEGG 62
+ +I E+ E E+E E+ + + + + + N + HL+ +E
Sbjct: 372 KMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADEKNSHLVVGFRNERS 431
Query: 63 FLSLWRGNGINVLKIAPESALKFMA 87
+++ RGN I V K E +L+F A
Sbjct: 432 YVT--RGNSIGVFKNTDEGSLEFKA 454
>gnl|CDD|235963 PRK07203, PRK07203, putative chlorohydrolase/aminohydrolase;
Validated.
Length = 442
Score = 27.2 bits (61), Expect = 7.0
Identities = 11/28 (39%), Positives = 15/28 (53%), Gaps = 4/28 (14%)
Query: 113 SLAGGVSLNV--PDDFTAKEMLTGMWWR 138
LA G+ N+ P DF +L +WWR
Sbjct: 71 GLARGMMANIPPPPDFI--SILKNLWWR 96
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 26.9 bits (59), Expect = 7.2
Identities = 12/31 (38%), Positives = 18/31 (58%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKEG 33
P +RK++EE+EE E E E + +EG
Sbjct: 213 PRGHKRKLNEEDEEDAESESSFESSNEDEEG 243
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 27.3 bits (60), Expect = 7.2
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKK 31
+ + EE K EE KE +E KKKK
Sbjct: 225 KADEDGEEPKVEEVKEGDEGKKKK 248
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 27.1 bits (60), Expect = 7.3
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 8 RKISEEEEEKKEEEKEKEEEKKKKEGGR 35
+ +E+E KE++++KE E++KK+
Sbjct: 278 LRTERKEKEAKEQQEKKELEQRKKKKKE 305
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 26.1 bits (58), Expect = 7.3
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
E+K E++ E ++ K EEK + E
Sbjct: 25 EKKQEEKKSEAEKLAKMSAEEKAEYE 50
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 26.6 bits (59), Expect = 7.4
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKE 32
E E+E E K + +EKKK+E
Sbjct: 107 DETEQEDPPETKTESKEKKKRE 128
Score = 26.3 bits (58), Expect = 8.5
Identities = 8/22 (36%), Positives = 15/22 (68%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKE 32
S +E E+++ + K E K+KK+
Sbjct: 105 STDETEQEDPPETKTESKEKKK 126
>gnl|CDD|162945 TIGR02604, Piru_Ver_Nterm, putative membrane-bound dehydrogenase
domain. All proteins that score above the trusted
cutoff score of 45 to this model are large proteins of
either Pirellula sp. 1 or Verrucomicrobium spinosum.
These proteins all contain, in addition to this domain,
several hundred residues of highly variable sequence,
and then a well-conserved C-terminal domain (TIGR02603)
that features a putative cytochrome c-type heme binding
motif CXXCH. The membrane-bound L-sorbosone
dehydrogenase from Acetobacter liquefaciens
(Gluconacetobacter liquefaciens) (SP|Q44091) is
homologous to this domain but lacks additional sequence
regions shared by members of this family and belongs to
a different clade of the larger family of homologs. It
and its closely related homologs are excluded from the
this model by scoring between the trusted (45) and noise
(18) cutoffs.
Length = 367
Score = 27.0 bits (60), Expect = 7.5
Identities = 12/59 (20%), Positives = 17/59 (28%), Gaps = 1/59 (1%)
Query: 105 IYERFVAGSLAGGVSLNVPDDFTAKEMLTGMWWRHLVAGGVAGAVSRTCTAPLDRLKVY 163
G + D E+ TG W + G V+ TAP + Y
Sbjct: 214 AEGGRNGYQSFNGRRYDHADRGADHEVPTGEWRQDDRGVETVGDVAGGGTAPCG-IAFY 271
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 26.9 bits (60), Expect = 7.5
Identities = 7/29 (24%), Positives = 17/29 (58%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
++E+E+K+E KE E+ + + ++
Sbjct: 165 VDDEDEEKKEAKELEKLSDDDDFVWDEDD 193
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 26.8 bits (60), Expect = 7.6
Identities = 12/28 (42%), Positives = 18/28 (64%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEG 33
+E++I + EEEK+E EK E + K E
Sbjct: 125 LEQEIKKLEEEKEELEKRVAELEAKLEA 152
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 27.0 bits (59), Expect = 7.7
Identities = 9/25 (36%), Positives = 15/25 (60%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGRE 36
+E E+KEE+ E EE ++ E +
Sbjct: 317 DEIGEEKEEDDENEENERHTELLAD 341
>gnl|CDD|224236 COG1317, FliH, Flagellar biosynthesis/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 234
Score = 26.6 bits (59), Expect = 7.8
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 5 AIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGV 42
+ +++ E EE E E+ + +EG EG+E G
Sbjct: 55 SAAQELQEGIEEGAREGYEEGFQLGYEEGFEEGQEEGR 92
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 26.1 bits (58), Expect = 8.2
Identities = 9/35 (25%), Positives = 17/35 (48%), Gaps = 2/35 (5%)
Query: 4 IAIERKISEEEEEKKEEEKEKEEEKK--KKEGGRE 36
R+ +E +KE+ K EKK +++ +E
Sbjct: 95 ALRLRRERTKERAEKEKRTRKNREKKFKRRQKEKE 129
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 26.5 bits (59), Expect = 8.3
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKK 31
E + + +E++ E EK+E KKKK
Sbjct: 172 EAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 26.9 bits (60), Expect = 8.5
Identities = 10/34 (29%), Positives = 21/34 (61%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
I+ K + ++K+EE+K ++ +KKKK + +
Sbjct: 375 IKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 26.5 bits (59), Expect = 8.9
Identities = 12/25 (48%), Positives = 19/25 (76%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKK 30
+ ++SEE+EE+ EE+E E EK+K
Sbjct: 149 SDEELSEEDEEEAAEEEEAEAEKEK 173
Score = 26.5 bits (59), Expect = 8.9
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 11 SEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
SE+EEEK E K+ +E+ ++ + EE
Sbjct: 132 SEDEEEKDEAAKKAKEDSDEELSEEDEEE 160
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM).
This family consists of several Plasmodium falciparum
SPAM (secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the
alanine heptad-repeat domain. Heptad repeats in which
the a and d position contain hydrophobic residues
generate amphipathic alpha-helices which give rise to
helical bundles or coiled-coil structures in proteins.
SPAM is an example of a P. falciparum antigen in which
a repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 26.4 bits (58), Expect = 8.9
Identities = 8/19 (42%), Positives = 14/19 (73%)
Query: 12 EEEEEKKEEEKEKEEEKKK 30
EEEEE +E+ + ++ +KK
Sbjct: 79 EEEEEDEEDNVDLKDIEKK 97
Score = 26.4 bits (58), Expect = 9.8
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 7 ERKISEEEEEKKEEEKEKEEEKKKKE 32
+ + EE E +EEE+E EE+ +
Sbjct: 67 DIEDEEEIVEDEEEEEEDEEDNVDLK 92
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 26.9 bits (59), Expect = 8.9
Identities = 10/28 (35%), Positives = 22/28 (78%)
Query: 12 EEEEEKKEEEKEKEEEKKKKEGGREGEE 39
E++++++++E+E+EEE++ K E EE
Sbjct: 162 EDDDDEEDDEEEEEEEEEIKGFDDEDEE 189
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45;
Provisional.
Length = 193
Score = 26.6 bits (58), Expect = 8.9
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 3 PIAIERKISEEEEEKKEEEKEKEEEKKKKE 32
P I ++ ++ EE E+E+E EE+K K+E
Sbjct: 39 PEDIVEELEDQPEEPPEQEEENEEQKPKEE 68
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 27.0 bits (60), Expect = 9.0
Identities = 10/28 (35%), Positives = 16/28 (57%), Gaps = 1/28 (3%)
Query: 6 IERKIS-EEEEEKKEEEKEKEEEKKKKE 32
E K + EE E+ EE + +KKK++
Sbjct: 406 AEGKKAIEEIREELIEEGLLKSKKKKRK 433
Score = 26.6 bits (59), Expect = 9.7
Identities = 10/25 (40%), Positives = 18/25 (72%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKK 30
+E K+ ++E+E +E EK EE ++K
Sbjct: 298 LENKLEKQEDELEELEKAAEELRQK 322
>gnl|CDD|222222 pfam13561, adh_short_C2, Enoyl-(Acyl carrier protein) reductase.
Length = 239
Score = 26.4 bits (59), Expect = 9.1
Identities = 12/29 (41%), Positives = 18/29 (62%)
Query: 70 NGINVLKIAPESALKFMAYEQAKRLIRGN 98
G+ V K A ES +++AYE ++ IR N
Sbjct: 144 GGMGVAKAALESLARYLAYELGRKGIRVN 172
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 26.7 bits (59), Expect = 9.4
Identities = 9/35 (25%), Positives = 21/35 (60%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKKEGGREGEEL 40
+++K + E+E K+ EKE+ +++K+ E +
Sbjct: 92 LQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQ 126
>gnl|CDD|241121 cd12677, RRM4_Nop4p, RNA recognition motif 4 in yeast nucleolar
protein 4 (Nop4p) and similar proteins. This subgroup
corresponds to the RRM4 of Nop4p (also known as
Nop77p), encoded by YPL043W from Saccharomyces
cerevisiae. It is an essential nucleolar protein
involved in processing and maturation of 27S pre-rRNA
and biogenesis of 60S ribosomal subunits. Nop4p has
four RNA recognition motifs (RRMs), also termed RBDs
(RNA binding domains) or RNPs (ribonucleoprotein
domains). .
Length = 156
Score = 26.1 bits (57), Expect = 9.5
Identities = 9/25 (36%), Positives = 16/25 (64%)
Query: 10 ISEEEEEKKEEEKEKEEEKKKKEGG 34
+S+EEE + + + KE + KK + G
Sbjct: 44 LSKEEENRDKGHRYKEAQLKKGKSG 68
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 26.8 bits (59), Expect = 9.6
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 6 IERKISEEEEEKKEEEKEKEEEKKKK 31
I+ + E++ E E E +EE+KK
Sbjct: 545 IDLDADDWTEDEDENEMETDEERKKP 570
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 25.6 bits (56), Expect = 9.7
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 9 KISEEEEEKKEEEKEKEEEKKKKEGGREGEE 39
K +E+K E+ K + KKK ++ +E
Sbjct: 4 KSQLSKEKKAEKAKAGTAKDKKKWSKKKKKE 34
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 26.8 bits (59), Expect = 10.0
Identities = 8/43 (18%), Positives = 19/43 (44%)
Query: 1 MTPIAIERKISEEEEEKKEEEKEKEEEKKKKEGGREGEELGVH 43
M +E + + +E++ EE + + + E G + E+
Sbjct: 230 MINWNLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETK 272
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.135 0.399
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,545,201
Number of extensions: 1004688
Number of successful extensions: 6336
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5054
Number of HSP's successfully gapped: 726
Length of query: 201
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 109
Effective length of database: 6,857,034
Effective search space: 747416706
Effective search space used: 747416706
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 56 (25.2 bits)