RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5944
(309 letters)
>gnl|CDD|215999 pfam00573, Ribosomal_L4, Ribosomal protein L4/L1 family. This
family includes Ribosomal L4/L1 from eukaryotes and
archaebacteria and L4 from eubacteria. L4 from yeast has
been shown to bind rRNA.
Length = 190
Score = 57.9 bits (141), Expect = 4e-10
Identities = 25/100 (25%), Positives = 46/100 (46%), Gaps = 5/100 (5%)
Query: 191 GLTSTLSIKLMQDDLHIVKDLVLPVDDPKYLENLVEE-RCWGPSVLFVDNTDYMERNISI 249
L S LS K + L +V D L + K L++ VL V + + N+ +
Sbjct: 93 ALRSALSAKAREGKLVVVDDFELEIPKTKDAVKLLKNLGLKNKKVLIVVDEK--DENLYL 150
Query: 250 VTDQIKHYNIMPAYGEN--NMLKHDTLVITVDALKQIEDK 287
+ +++ G N ++L D +VIT AL+++E++
Sbjct: 151 SARNLPGVDVVTVDGLNVYDLLPADKVVITKSALEKLEER 190
Score = 50.2 bits (121), Expect = 2e-07
Identities = 19/53 (35%), Positives = 26/53 (49%)
Query: 72 VNLHPDIFGLVPRLDVITMNIEWQKKYKWVRFDHTKVTNEVERTGRKPWPQKG 124
V L P +FG R D++ + Q + TK +EV G+KPW QKG
Sbjct: 1 VELPPAVFGAEIRPDLLHRAVVAQLANRRQGTASTKTRSEVSGGGKKPWRQKG 53
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 57.3 bits (138), Expect = 4e-09
Identities = 26/29 (89%), Positives = 27/29 (93%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G EE+EEEEEEEEEEEEEEEEEEEEEEE
Sbjct: 861 GDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 55.4 bits (133), Expect = 2e-08
Identities = 26/39 (66%), Positives = 27/39 (69%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
G + EEEEEEEEEEEEEEEEEEEEEEE P L
Sbjct: 858 SDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSL 896
Score = 50.0 bits (119), Expect = 9e-07
Identities = 22/29 (75%), Positives = 24/29 (82%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G + + EEEEEEEEEEEEEEEEEEEEE
Sbjct: 856 GGSDGGDSEEEEEEEEEEEEEEEEEEEEE 884
Score = 47.3 bits (112), Expect = 6e-06
Identities = 22/29 (75%), Positives = 24/29 (82%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G G + + EEEEEEEEEEEEEEEEEEEE
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEE 883
Score = 45.4 bits (107), Expect = 3e-05
Identities = 21/29 (72%), Positives = 22/29 (75%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G G + EEEEEEEEEEEEEEEEEEE
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEE 882
Score = 38.1 bits (88), Expect = 0.006
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
KG + + + EEEEEEEEEEEEEE
Sbjct: 850 KGVDGGGGSDGGDSEEEEEEEEEEEEEE 877
Score = 31.9 bits (72), Expect = 0.52
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E+ + + + EEEEEEEEEEE
Sbjct: 845 AKQDEKGVDGGGGSDGGDSEEEEEEEEEEE 874
Score = 28.4 bits (63), Expect = 6.5
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
KGE + E E E + E+E E E E +E
Sbjct: 675 TKGENESEGEIPAERKGEQEGEGEIEAKE 703
Score = 28.0 bits (62), Expect = 8.3
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+GEE E+E E E E + E E E + +E
Sbjct: 736 EEGEEVEDEGEGEAEGKHEVETEGDRKE 763
>gnl|CDD|234414 TIGR03953, rplD_bact, 50S ribosomal protein L4,
bacterial/organelle. Members of this protein family are
ribosomal protein L4. This model recognizes bacterial
and most organellar forms, but excludes homologs from
the eukaryotic cytoplasm and from archaea [Protein
synthesis, Ribosomal proteins: synthesis and
modification].
Length = 188
Score = 51.7 bits (125), Expect = 5e-08
Identities = 30/104 (28%), Positives = 53/104 (50%), Gaps = 16/104 (15%)
Query: 191 GLTSTLSIKLMQDDLHIVKDLVLPVDDPK------YLENLVEERCWGPSVLFVDNTDYME 244
L S LS K + L +V D L ++PK L+NL VL V TD ++
Sbjct: 93 ALRSALSEKAREGKLVVVDDFDL--EEPKTKDLAKILKNLG----LDKKVLIV--TDEVD 144
Query: 245 RNISIVTDQIKHYNIMPAYGEN--NMLKHDTLVITVDALKQIED 286
N+ + + + ++PA G N ++L+ D +V+T A++++E+
Sbjct: 145 ENLYLSARNLPNVKVLPANGLNVYDLLRADKVVLTEAAVEKLEE 188
Score = 42.9 bits (102), Expect = 5e-05
Identities = 16/55 (29%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 72 VNLHPDIFGLVPRLDVITMNIEWQ--KKYKWVRFDHTKVTNEVERTGRKPWPQKG 124
+ L+ +FG+ P ++ ++ Q + + TK +EV G+KPW QKG
Sbjct: 1 IELNDAVFGIEPNEHLLHQVVKAQLANRRQGTH--KTKTRSEVSGGGKKPWRQKG 53
>gnl|CDD|235404 PRK05319, rplD, 50S ribosomal protein L4; Provisional.
Length = 205
Score = 48.5 bits (117), Expect = 7e-07
Identities = 33/106 (31%), Positives = 58/106 (54%), Gaps = 17/106 (16%)
Query: 191 GLTSTLSIKLMQDDLHIVKDLVLPVDDPK------YLENLVEERCWGPSVLFVDNTDYME 244
L S LS K + L +V DL L + PK L+NL ++ VL V TD ++
Sbjct: 108 ALRSALSEKAREGRLVVVDDLSL--EAPKTKELAAKLKNLGLKK-----VLIV--TDEVD 158
Query: 245 RNISIVTDQIKHYNIMPAYGEN--NMLKHDTLVITVDALKQIEDKL 288
N+ + + + +++PA G N ++L+ D +V+T A+K++E++L
Sbjct: 159 ENLYLSARNLPNVDVLPAQGLNVYDLLRADKVVLTKAAVKKLEERL 204
Score = 43.2 bits (103), Expect = 5e-05
Identities = 18/57 (31%), Positives = 26/57 (45%), Gaps = 4/57 (7%)
Query: 70 GFVNLHPDIFGLVPRLDVITMNIEWQ--KKYKWVRFDHTKVTNEVERTGRKPWPQKG 124
G V L +FG+ P ++ + Q + TK +EV G+KPW QKG
Sbjct: 14 GEVELSDAVFGVEPNEALLHQVVVAQLANARQGTH--ATKTRSEVSGGGKKPWRQKG 68
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 45.3 bits (108), Expect = 2e-06
Identities = 17/28 (60%), Positives = 20/28 (71%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
EE++EEEEEEEE+EE EEE A L
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEAAAGL 101
Score = 43.0 bits (102), Expect = 2e-05
Identities = 15/25 (60%), Positives = 18/25 (72%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
EE++EEEEEEEE+EE EEE
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 42.2 bits (100), Expect = 2e-05
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
EE++EEEEEEEE+EE EEE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEESEEE 96
Score = 42.2 bits (100), Expect = 2e-05
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE++EEEEEEEE+EE EEE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 41.5 bits (98), Expect = 5e-05
Identities = 14/23 (60%), Positives = 17/23 (73%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
EE++EEEEEEEE+EE EE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEE 95
Score = 40.3 bits (95), Expect = 1e-04
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEE 165
E+++EEEEEEEE+EE EEE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 39.2 bits (92), Expect = 3e-04
Identities = 13/23 (56%), Positives = 16/23 (69%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
EE++EEEEEEEE+EE E
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESE 94
Score = 36.9 bits (86), Expect = 0.002
Identities = 12/23 (52%), Positives = 15/23 (65%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
EE++EEEEEEEE+EE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEES 93
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 44.6 bits (106), Expect = 3e-06
Identities = 21/28 (75%), Positives = 21/28 (75%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
EEEEEEEEEEEEEEEE EEE A L
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMAGL 100
Score = 43.1 bits (102), Expect = 1e-05
Identities = 19/26 (73%), Positives = 19/26 (73%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
EEEEEEEEEEEEEEEE EEE
Sbjct: 70 AAAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 41.6 bits (98), Expect = 4e-05
Identities = 18/23 (78%), Positives = 18/23 (78%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
EEEEEEEEEEEEEEEE EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEE 94
Score = 40.8 bits (96), Expect = 8e-05
Identities = 18/24 (75%), Positives = 19/24 (79%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEE 163
E+EEEEEEEEEEEEEE EEE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95
Score = 40.8 bits (96), Expect = 8e-05
Identities = 18/25 (72%), Positives = 19/25 (76%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEE 164
+EEEEEEEEEEEEEEE EEE
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESEEE 95
Score = 39.3 bits (92), Expect = 2e-04
Identities = 17/23 (73%), Positives = 17/23 (73%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
EEEEEEEEEEEEEEEE E
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESE 93
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 47.3 bits (113), Expect = 6e-06
Identities = 27/45 (60%), Positives = 30/45 (66%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
I+RL +RR R + EEEEEEEEE EEEEEEEEEEE T
Sbjct: 285 IDRLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERT 329
Score = 41.2 bits (97), Expect = 5e-04
Identities = 22/30 (73%), Positives = 24/30 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EE+EEEE EEEEEEEEEEE EEE+ AT
Sbjct: 308 EEEEEEEVPEEEEEEEEEEERTFEEEVRAT 337
Score = 37.7 bits (88), Expect = 0.007
Identities = 19/32 (59%), Positives = 20/32 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
EE+EEE EEEEEEEEEEE EEE
Sbjct: 309 EEEEEEVPEEEEEEEEEEERTFEEEVRATVAE 340
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 43.3 bits (102), Expect = 1e-05
Identities = 18/31 (58%), Positives = 24/31 (77%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
EE+ EE+EEE+++EEE+EEEEEE A L
Sbjct: 72 AAAEEKAEEKEEEKKKEEEKEEEEEEALAGL 102
Score = 40.9 bits (96), Expect = 8e-05
Identities = 17/25 (68%), Positives = 22/25 (88%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEE 165
EEK EE+EEE+++EEE+EEEEEE
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEE 97
Score = 39.0 bits (91), Expect = 3e-04
Identities = 15/29 (51%), Positives = 21/29 (72%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE+ EE+EEE+++EEE+EEEEE
Sbjct: 71 AAAAEEKAEEKEEEKKKEEEKEEEEEEAL 99
Score = 36.3 bits (84), Expect = 0.003
Identities = 13/24 (54%), Positives = 21/24 (87%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEE 163
+ + +E+EEE+++EEE+EEEEEE
Sbjct: 74 AEEKAEEKEEEKKKEEEKEEEEEE 97
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 45.4 bits (108), Expect = 2e-05
Identities = 20/39 (51%), Positives = 26/39 (66%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
+ +++EEE+E+EEEEEEEEE EE E EE L P
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPI 380
Score = 43.9 bits (104), Expect = 6e-05
Identities = 20/33 (60%), Positives = 24/33 (72%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
R E+K+EEEE+E+EEEEEEEEE EE E
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEP 370
Score = 43.1 bits (102), Expect = 1e-04
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
EE++E+EEEEEEEEE EE E EE + ++
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPIS 381
Score = 33.9 bits (78), Expect = 0.098
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE+EEEEEEEE E +E EE + E +
Sbjct: 288 EEEEEEEEEEEPAERDELEENPDFEGLEV 316
Score = 33.9 bits (78), Expect = 0.10
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+GEE+EEEEE E +EE+E EEEEEEE
Sbjct: 169 EGEEEEEEEEVGEADEEDEGEEEEEEEPEE 198
Score = 33.9 bits (78), Expect = 0.11
Identities = 20/39 (51%), Positives = 26/39 (66%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
G+ EE+EEE E +EE+E EEEEEEE EE+ G+
Sbjct: 170 GEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTGV 208
Score = 32.3 bits (74), Expect = 0.33
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+G + EEEEEEEEEEE E +E EE
Sbjct: 282 RGLYQLEEEEEEEEEEEPAERDELEENP 309
Score = 32.0 bits (73), Expect = 0.39
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
Y + E +E EEEEEEEE E +EE+E EEE
Sbjct: 161 YVAEVELREGEEEEEEEEVGEADEEDEGEEE 191
Score = 32.0 bits (73), Expect = 0.43
Identities = 15/26 (57%), Positives = 18/26 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+EEEEEEEEE E +E EE + E
Sbjct: 287 LEEEEEEEEEEEPAERDELEENPDFE 312
Score = 31.6 bits (72), Expect = 0.49
Identities = 15/38 (39%), Positives = 17/38 (44%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPV 184
E E E EEEEEEEE E +EE +P
Sbjct: 164 EVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPK 201
Score = 31.6 bits (72), Expect = 0.51
Identities = 18/26 (69%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E EEEEEEEE E +EE+E EEEEE
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEE 193
Score = 31.2 bits (71), Expect = 0.76
Identities = 16/33 (48%), Positives = 19/33 (57%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
PR EE+EEEEEEE E +E EE + E
Sbjct: 281 PRGLYQLEEEEEEEEEEEPAERDELEENPDFEG 313
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 42.4 bits (100), Expect = 3e-05
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G E E +E EEEE+EEE EEE +++
Sbjct: 75 AGAEAAAEADEAEEEEKEEEAEEESDDD 102
Score = 41.2 bits (97), Expect = 6e-05
Identities = 13/30 (43%), Positives = 19/30 (63%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E E +E EEEE+EEE EEE +++ +
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEESDDDMLFG 106
Score = 39.3 bits (92), Expect = 3e-04
Identities = 14/28 (50%), Positives = 16/28 (57%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
E E +E EEEE+EEE EEE L
Sbjct: 77 AEAAAEADEAEEEEKEEEAEEESDDDML 104
Score = 38.5 bits (90), Expect = 5e-04
Identities = 13/29 (44%), Positives = 15/29 (51%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +E EEEE+EEE EEE
Sbjct: 70 AAAAAAGAEAAAEADEAEEEEKEEEAEEE 98
Score = 38.5 bits (90), Expect = 6e-04
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEE 166
G E +E EEEE+EEE EEE +++
Sbjct: 76 GAEAAAEADEAEEEEKEEEAEEESDDD 102
Score = 34.7 bits (80), Expect = 0.013
Identities = 12/29 (41%), Positives = 14/29 (48%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G E E +E EEEE+EEE E
Sbjct: 68 GAAAAAAAGAEAAAEADEAEEEEKEEEAE 96
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 42.7 bits (101), Expect = 3e-05
Identities = 14/34 (41%), Positives = 27/34 (79%)
Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+ + + E ++EE+EEEE++EE+++E+E EEE +
Sbjct: 107 VASEEDESDDDEEDEEEEDDEEDDDEDESEEEES 140
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 42.9 bits (102), Expect = 9e-05
Identities = 19/28 (67%), Positives = 19/28 (67%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
EEEEEEEEEEEEEE EEE A L
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSEEEAAAGL 325
Score = 41.4 bits (98), Expect = 3e-04
Identities = 17/24 (70%), Positives = 18/24 (75%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
+ EEEEEEEEEEEEEE EEE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 40.2 bits (95), Expect = 7e-04
Identities = 17/25 (68%), Positives = 18/25 (72%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
+ EEEEEEEEEEEEEE EEE
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSEEE 320
Score = 39.1 bits (92), Expect = 0.002
Identities = 16/23 (69%), Positives = 17/23 (73%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
+ EEEEEEEEEEEEEE EE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEE 319
Score = 34.1 bits (79), Expect = 0.068
Identities = 14/19 (73%), Positives = 16/19 (84%)
Query: 141 KGEEKEEEEEEEEEEEEEE 159
+ EE+EEEEEEEEE EEE
Sbjct: 302 EEEEEEEEEEEEEEPSEEE 320
Score = 34.1 bits (79), Expect = 0.068
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E + + EEEEEEEEEEEEE
Sbjct: 289 KEVLSAQAQAAAAEEEEEEEEEEEEE 314
Score = 33.7 bits (78), Expect = 0.11
Identities = 14/21 (66%), Positives = 15/21 (71%)
Query: 148 EEEEEEEEEEEEEEEEEEEEE 168
+ EEEEEEEEEEEEEE
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEP 316
Score = 30.2 bits (69), Expect = 1.4
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E KE + + EEEEEEEEEEE
Sbjct: 287 ELKEVLSAQAQAAAAEEEEEEEEEEE 312
Score = 28.7 bits (65), Expect = 3.7
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE +E + + EEEEEEEEEE
Sbjct: 285 DEELKEVLSAQAQAAAAEEEEEEEEEE 311
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 43.2 bits (102), Expect = 1e-04
Identities = 19/45 (42%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 125 LIERLDGKRRPRLYT-GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ KRR RL E E+E+EEEE+ +E EEEE E+ E
Sbjct: 350 KESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSE 394
Score = 29.3 bits (66), Expect = 3.0
Identities = 10/27 (37%), Positives = 14/27 (51%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +EEE E+ EEE + E+ E
Sbjct: 383 DEHEEEEGEDSEEEGSQSREDGSSESS 409
Score = 28.5 bits (64), Expect = 4.9
Identities = 9/29 (31%), Positives = 15/29 (51%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EE+E E+ EEE + E+ E +
Sbjct: 383 DEHEEEEGEDSEEEGSQSREDGSSESSSD 411
Score = 27.8 bits (62), Expect = 9.4
Identities = 8/30 (26%), Positives = 13/30 (43%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ EE E+ EEE + E+ E +
Sbjct: 385 HEEEEGEDSEEEGSQSREDGSSESSSDVGS 414
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 42.4 bits (99), Expect = 1e-04
Identities = 19/32 (59%), Positives = 23/32 (71%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
K +E +E+EEEEEEEEEEE E EE E+ T
Sbjct: 131 KEDESDEDEEEEEEEEEEEAEVEENEQGTNGT 162
Score = 40.4 bits (94), Expect = 6e-04
Identities = 21/36 (58%), Positives = 24/36 (66%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
GK KE+E +E+EEEEEEEEEEE E EE N
Sbjct: 125 AGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTN 160
Score = 39.3 bits (91), Expect = 0.001
Identities = 17/30 (56%), Positives = 23/30 (76%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
G +K +E+E +E+EEEEEEEEEEE E+
Sbjct: 123 GNAGKKATKEDESDEDEEEEEEEEEEEAEV 152
Score = 35.8 bits (82), Expect = 0.018
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
P+ G++ +E+E +E+EEEEEEEEEEE E
Sbjct: 119 PKKAGNAGKKATKEDESDEDEEEEEEEEEEEAE 151
Score = 35.4 bits (81), Expect = 0.029
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+++EEEEEEEEEEE E EE E+ T+T
Sbjct: 132 EDESDEDEEEEEEEEEEEAEVEENEQGTNGTST 164
Score = 32.7 bits (74), Expect = 0.21
Identities = 21/72 (29%), Positives = 25/72 (34%), Gaps = 30/72 (41%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEE------------------------------EEEE 160
GK+ + +E+EEEEEEEEE E EE E
Sbjct: 126 GKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGE 185
Query: 161 EEEEEEEEITAT 172
EE E E T
Sbjct: 186 EESVTEAEAEGT 197
Score = 32.0 bits (72), Expect = 0.36
Identities = 14/28 (50%), Positives = 20/28 (71%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K ++ +E+E +E+EEEEEEEEEE
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEE 148
Score = 31.6 bits (71), Expect = 0.47
Identities = 13/29 (44%), Positives = 17/29 (58%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G G+ EEE EEE EEE E+ + E+
Sbjct: 55 GDGDSSEEEGEEETSNEEENNEDSDGNED 83
Score = 30.0 bits (67), Expect = 1.3
Identities = 14/32 (43%), Positives = 18/32 (56%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
G E+E EEE EEE E+ + E+EE A
Sbjct: 57 GDSSEEEGEEETSNEEENNEDSDGNEDEEAEA 88
Score = 30.0 bits (67), Expect = 1.3
Identities = 13/33 (39%), Positives = 16/33 (48%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
R G + EE + + EEE EEE EEE
Sbjct: 41 RFSVQSGSDSSEENGDGDSSEEEGEEETSNEEE 73
Score = 28.5 bits (63), Expect = 4.3
Identities = 18/59 (30%), Positives = 19/59 (32%), Gaps = 30/59 (50%)
Query: 143 EEKEEEEEEEE------------------------------EEEEEEEEEEEEEEEITA 171
EE+EEEEEE E EE EEE E E E T
Sbjct: 141 EEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVTEAEAEGTTV 199
Score = 28.1 bits (62), Expect = 5.2
Identities = 12/32 (37%), Positives = 16/32 (50%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
+EE EEE EEE E+ + E+E N
Sbjct: 59 SSEEEGEEETSNEEENNEDSDGNEDEEAEAEN 90
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 41.8 bits (98), Expect = 3e-04
Identities = 26/58 (44%), Positives = 31/58 (53%)
Query: 111 EVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E ER G P + + + EE+E EEEEEEEEEEEEEE+E EEE
Sbjct: 408 ERERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEE 465
Score = 39.9 bits (93), Expect = 0.001
Identities = 21/29 (72%), Positives = 25/29 (86%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EE+EEEEEEEEE+E EEEE E+EEEE
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEE 473
Score = 39.9 bits (93), Expect = 0.001
Identities = 22/32 (68%), Positives = 25/32 (78%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
EE+EEEEEEEE+E EEEE E+EEEEE N
Sbjct: 449 EEEEEEEEEEEQESEEEEGEDEEEEEEVEADN 480
Score = 36.8 bits (85), Expect = 0.013
Identities = 26/70 (37%), Positives = 34/70 (48%), Gaps = 8/70 (11%)
Query: 109 TNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGE------EKEEEEEEEEEEEEEEEEE 162
T E ER R+ ++ + + GE ++ EEEE EEEEEEEEEE
Sbjct: 398 TEEEERRKRQE--RERQGTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEE 455
Query: 163 EEEEEEITAT 172
EEEE+E
Sbjct: 456 EEEEQESEEE 465
Score = 36.4 bits (84), Expect = 0.018
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+E EEEE E+EEEEEE E + EEE+
Sbjct: 460 QESEEEEGEDEEEEEEVEADNGSEEEMEG 488
Score = 34.9 bits (80), Expect = 0.048
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E++ EEEE E+EEEEEE E + EE
Sbjct: 459 EQESEEEEGEDEEEEEEVEADNGSEE 484
Score = 34.5 bits (79), Expect = 0.081
Identities = 16/28 (57%), Positives = 20/28 (71%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EE+E EEEE E+EEEEEE E + E
Sbjct: 456 EEEEQESEEEEGEDEEEEEEVEADNGSE 483
Score = 34.1 bits (78), Expect = 0.094
Identities = 16/27 (59%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE+E+E EEEE E+EEEEEE E +
Sbjct: 456 EEEEQESEEEEGEDEEEEEEVEADNGS 482
Score = 33.0 bits (75), Expect = 0.20
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ EE+E E+EEEEEE E + EEE E
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 28.7 bits (64), Expect = 4.4
Identities = 17/52 (32%), Positives = 24/52 (46%), Gaps = 10/52 (19%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEE----------EEEEEEITATLNPCGLCRI 181
+GE++EEEEE E + EEE E E EE+ G+ R+
Sbjct: 465 EEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRM 516
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 38.8 bits (91), Expect = 3e-04
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEE 161
EEE++EEEEEEEE++
Sbjct: 60 AAAAAAAEEEKKEEEEEEEEDD 81
Score = 38.8 bits (91), Expect = 3e-04
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
EEE++EEEEEEEE++
Sbjct: 57 AAAAAAAAAAEEEKKEEEEEEEEDD 81
Score = 38.4 bits (90), Expect = 4e-04
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEE 162
EEE++EEEEEEEE++
Sbjct: 58 AAAAAAAAAEEEKKEEEEEEEEDD 81
Score = 38.4 bits (90), Expect = 4e-04
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEI 169
EEE++EEEEEEEE++++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDDDM 83
Score = 37.6 bits (88), Expect = 6e-04
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEE 164
EEE++EEEEEEEE++
Sbjct: 57 AAAAAAAAAAEEEKKEEEEEEEEDD 81
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 41.1 bits (96), Expect = 4e-04
Identities = 19/37 (51%), Positives = 21/37 (56%)
Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
R L T + E EE+EEEEEEEEEEEEI
Sbjct: 208 PRLGLRTPPNSSSSSGSDSESEEDEEEEEEEEEEEEI 244
Score = 36.1 bits (83), Expect = 0.019
Identities = 16/25 (64%), Positives = 20/25 (80%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEIT 170
+ E EE+EEEEEEEEEEEE + +T
Sbjct: 224 SDSESEEDEEEEEEEEEEEEIDVVT 248
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 41.6 bits (97), Expect = 4e-04
Identities = 15/27 (55%), Positives = 26/27 (96%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
++ +E+E+E++++EE++EEEEEEEEEI
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEI 180
Score = 41.2 bits (96), Expect = 7e-04
Identities = 13/25 (52%), Positives = 24/25 (96%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
++++E+E++++EE++EEEEEEEEE
Sbjct: 155 DDDEDEDEDDDDEEDDEEEEEEEEE 179
Score = 40.8 bits (95), Expect = 8e-04
Identities = 16/33 (48%), Positives = 26/33 (78%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
PR ++ E+E+E++++EE++EEEEEEEEE
Sbjct: 147 PRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179
Score = 38.9 bits (90), Expect = 0.004
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEE 165
D R +E E+E++++EE++EEEEEEEEE
Sbjct: 144 DMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEE 179
Score = 38.1 bits (88), Expect = 0.006
Identities = 19/52 (36%), Positives = 31/52 (59%), Gaps = 6/52 (11%)
Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE------EEEE 168
+ L E R + E+++E++++EE++EEEEEEEEE E+EE
Sbjct: 138 RHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189
Score = 36.6 bits (84), Expect = 0.018
Identities = 10/31 (32%), Positives = 23/31 (74%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
++++E+E+E++++EE++EEEEE +
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEIKGFD 184
Score = 29.2 bits (65), Expect = 3.4
Identities = 14/26 (53%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E+++EEEEEEEEE + ++E+EE+E
Sbjct: 166 DEEDDEEEEEEEEEIKGFDDEDEEDE 191
Score = 28.9 bits (64), Expect = 5.1
Identities = 8/37 (21%), Positives = 23/37 (62%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+RR R + + ++++E+E+E++++EE++
Sbjct: 134 RRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDD 170
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 39.4 bits (92), Expect = 0.001
Identities = 13/28 (46%), Positives = 26/28 (92%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ +EK+EEEEEEEEEE+E+ ++++++++
Sbjct: 170 EEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 39.0 bits (91), Expect = 0.001
Identities = 12/28 (42%), Positives = 25/28 (89%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E++++EEEEEEEEEE+E+ +++++++
Sbjct: 169 DEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
Score = 37.8 bits (88), Expect = 0.003
Identities = 15/27 (55%), Positives = 24/27 (88%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ +EE+E++EEEEEEEEEE+E+ ++
Sbjct: 165 AEDVDEEDEKDEEEEEEEEEEDEDFDD 191
Score = 37.4 bits (87), Expect = 0.004
Identities = 12/28 (42%), Positives = 24/28 (85%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ E+ EEEEEEEEEE+E+ ++++++++
Sbjct: 170 EEDEKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 37.0 bits (86), Expect = 0.006
Identities = 17/34 (50%), Positives = 25/34 (73%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+L + + KE E E+ +EE+E++EEEEEEEEE
Sbjct: 152 KLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEE 185
Score = 36.7 bits (85), Expect = 0.007
Identities = 13/26 (50%), Positives = 24/26 (92%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++E+E++EEEEEEEEEE+E+ ++++
Sbjct: 168 VDEEDEKDEEEEEEEEEEDEDFDDDD 193
Score = 36.7 bits (85), Expect = 0.007
Identities = 14/26 (53%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EE+E++EEEEEEEEEE+E+ +++
Sbjct: 167 DVDEEDEKDEEEEEEEEEEDEDFDDD 192
Score = 36.7 bits (85), Expect = 0.009
Identities = 17/28 (60%), Positives = 24/28 (85%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E E+ +EE+E++EEEEEEEEEE+E+
Sbjct: 161 KELEAEDVDEEDEKDEEEEEEEEEEDED 188
Score = 35.1 bits (81), Expect = 0.023
Identities = 17/28 (60%), Positives = 24/28 (85%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K +E E E+ +EE+E++EEEEEEEEEE+
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEEED 186
Score = 34.0 bits (78), Expect = 0.072
Identities = 9/25 (36%), Positives = 20/25 (80%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEE 163
EE+EEEEEE+E+ ++++++++
Sbjct: 173 EKDEEEEEEEEEEDEDFDDDDDDDD 197
Score = 31.3 bits (71), Expect = 0.42
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E++ +E E E+ +EE+E++EEEE
Sbjct: 150 DEKLSMLEKKLKELEAEDVDEEDEKDEEEE 179
Score = 30.5 bits (69), Expect = 0.88
Identities = 12/38 (31%), Positives = 24/38 (63%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
KR+ L+T + E+ +E+ E++ +E E E+ +EE+
Sbjct: 135 FKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEED 172
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 38.7 bits (90), Expect = 0.001
Identities = 19/38 (50%), Positives = 23/38 (60%), Gaps = 4/38 (10%)
Query: 142 GEEKEEEEEEEEEEE----EEEEEEEEEEEEITATLNP 175
EE EE+EEE E+ E EE+EE EE EEE A+
Sbjct: 28 AEEVEEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKA 65
Score = 31.8 bits (72), Expect = 0.26
Identities = 16/46 (34%), Positives = 21/46 (45%)
Query: 119 PWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
P P K E D + + E E+ +EE+EE EE EEE
Sbjct: 14 PPPTKDEEEDWDSQAEEVEEDEEEMEDWEDSLDEEDEEAEEVEEET 59
Score = 29.5 bits (66), Expect = 1.7
Identities = 12/25 (48%), Positives = 18/25 (72%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EEE+ + + EE EE+EEE E+ E
Sbjct: 18 KDEEEDWDSQAEEVEEDEEEMEDWE 42
Score = 29.1 bits (65), Expect = 2.2
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+++EE+ + + EE EE+EEE E+ E+
Sbjct: 18 KDEEEDWDSQAEEVEEDEEEMEDWED 43
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 39.4 bits (92), Expect = 0.002
Identities = 20/45 (44%), Positives = 22/45 (48%)
Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
L E L K P + EE+EEEEEEEEE E E EE
Sbjct: 386 LREFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGF 430
Score = 38.6 bits (90), Expect = 0.003
Identities = 17/28 (60%), Positives = 17/28 (60%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
EEEEEEEEEEEEE E E EE
Sbjct: 403 EEEEEEEEEEEEEAAEAEAPMEEPVPGF 430
Score = 34.4 bits (79), Expect = 0.076
Identities = 16/25 (64%), Positives = 16/25 (64%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITA 171
E EEEEEEEEEEEEE E E
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPM 423
Score = 32.9 bits (75), Expect = 0.20
Identities = 16/25 (64%), Positives = 17/25 (68%)
Query: 149 EEEEEEEEEEEEEEEEEEEEITATL 173
E EEEEEEEEEEEEE E A +
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPM 423
Score = 30.6 bits (69), Expect = 1.3
Identities = 14/35 (40%), Positives = 16/35 (45%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
EE+EEEEEE E E EE E + G
Sbjct: 407 EEEEEEEEEAAEAEAPMEEPVPGFEVPEMPMPAAG 441
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 36.5 bits (84), Expect = 0.002
Identities = 23/58 (39%), Positives = 30/58 (51%)
Query: 111 EVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ K P G +G + + EE +E++EEEE E EEEE EEEEE E
Sbjct: 23 VEEKENGKNAPANGNENEENGAQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEEETE 80
Score = 33.4 bits (76), Expect = 0.032
Identities = 19/31 (61%), Positives = 23/31 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
EE+ +E++EEEE E EEEE EEEEE AT
Sbjct: 54 EEEVDEDDEEEEGEGEEEEGEEEEETEGATG 84
Score = 27.2 bits (60), Expect = 5.4
Identities = 18/38 (47%), Positives = 21/38 (55%), Gaps = 6/38 (15%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEE------EEEEEITAT 172
EE+E E EEEE EEEEE E E+EE+ T
Sbjct: 60 DDEEEEGEGEEEEGEEEEETEGATGKRAAEDEEDDAET 97
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 37.3 bits (87), Expect = 0.003
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
R G ++++EEEEEEE E EE +EEE+ +E +
Sbjct: 91 RKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEK 126
Score = 35.3 bits (82), Expect = 0.012
Identities = 17/34 (50%), Positives = 23/34 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
K +E+EEEEE E EE +EEE+ +E E+ A L
Sbjct: 99 KEKEEEEEEEVEVEELDEEEQIDELLEKELAKLK 132
Score = 34.2 bits (79), Expect = 0.037
Identities = 15/31 (48%), Positives = 22/31 (70%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ EE+EEEE E EE +EEE+ +E E+E+
Sbjct: 98 KKEKEEEEEEEVEVEELDEEEQIDELLEKEL 128
Score = 30.7 bits (70), Expect = 0.57
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ EE E EE +EEE+ +E E+E +
Sbjct: 103 EEEEEEVEVEELDEEEQIDELLEKELAKL 131
Score = 29.2 bits (66), Expect = 1.8
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 129 LDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
LD K + + E +E +EEE+ +E E+E + + E+
Sbjct: 96 LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREK 135
Score = 26.9 bits (60), Expect = 8.6
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ +EEE+ +E E+E + + E+
Sbjct: 111 VEELDEEEQIDELLEKELAKLKREKR 136
Score = 26.9 bits (60), Expect = 9.3
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE +EEE+ +E E+E + + E+
Sbjct: 112 EELDEEEQIDELLEKELAKLKREKRR 137
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 38.1 bits (89), Expect = 0.005
Identities = 17/27 (62%), Positives = 18/27 (66%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITA 171
EEEEEEEEEEEEEEE E + A
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPA 429
Score = 37.4 bits (87), Expect = 0.008
Identities = 17/24 (70%), Positives = 18/24 (75%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
E+ EEEEEEEEEEEEEEE E
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAE 423
Score = 36.2 bits (84), Expect = 0.016
Identities = 20/42 (47%), Positives = 23/42 (54%), Gaps = 3/42 (7%)
Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
L + L K P + + EEEEEEEEEEEEEEE E
Sbjct: 387 LKKFLKEKEHPVV---ERWAAEEEEEEEEEEEEEEEPVAEVM 425
Score = 35.4 bits (82), Expect = 0.031
Identities = 15/22 (68%), Positives = 16/22 (72%)
Query: 151 EEEEEEEEEEEEEEEEEEITAT 172
EEEEEEEEEEEEEEE +
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEV 424
Score = 34.7 bits (80), Expect = 0.053
Identities = 15/27 (55%), Positives = 16/27 (59%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+EEEEEEEEEEEEEE E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPA 429
Score = 33.9 bits (78), Expect = 0.11
Identities = 14/23 (60%), Positives = 14/23 (60%)
Query: 153 EEEEEEEEEEEEEEEEITATLNP 175
EEEEEEEEEEEEEE
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVM 425
Score = 31.6 bits (72), Expect = 0.50
Identities = 17/33 (51%), Positives = 18/33 (54%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+EKE E EEEEEEEEEEEEE
Sbjct: 392 KEKEHPVVERWAAEEEEEEEEEEEEEEEPVAEV 424
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 37.4 bits (87), Expect = 0.005
Identities = 17/31 (54%), Positives = 20/31 (64%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
KE EE+E EEEE+ EEE+ EE T NP
Sbjct: 156 KERAEEKEREEEEKAAEEEKAREEEILTGNP 186
Score = 27.8 bits (62), Expect = 7.8
Identities = 3/31 (9%), Positives = 15/31 (48%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+ + + +++ ++++ E+E + L
Sbjct: 121 DSDSSSDSDSSDDDSDDDDSEDETAALLREL 151
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 35.9 bits (83), Expect = 0.006
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEE 165
G G ++EEEEEEEE E++++ +E +
Sbjct: 90 GTGHTRQEEEEEEEENEKQQQSDEAQ 115
Score = 34.4 bits (79), Expect = 0.023
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
T G +EEEEEEEE E++++ +E +
Sbjct: 87 TTSGTGHTRQEEEEEEEENEKQQQSDEAQ 115
Score = 32.1 bits (73), Expect = 0.14
Identities = 10/23 (43%), Positives = 18/23 (78%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEI 169
+EEEEEEEE E++++ +E ++
Sbjct: 94 TRQEEEEEEEENEKQQQSDEAQV 116
Score = 32.1 bits (73), Expect = 0.15
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
+EEEEEEEE E++++ +E
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEA 114
Score = 29.4 bits (66), Expect = 1.1
Identities = 11/33 (33%), Positives = 23/33 (69%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
T + EE+EEEE E++++ +E + ++ ++ E A
Sbjct: 94 TRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPA 126
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 35.5 bits (81), Expect = 0.006
Identities = 12/34 (35%), Positives = 27/34 (79%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
P L+ +GEE + + +EEE++++E+EE+++E++
Sbjct: 1 PTLHFLEGEEDSDSDSDEEEDDDDEDEEDDDEDD 34
Score = 33.2 bits (75), Expect = 0.032
Identities = 8/27 (29%), Positives = 23/27 (85%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ + + +EEE++++E+EE+++E+++
Sbjct: 9 EEDSDSDSDEEEDDDDEDEEDDDEDDD 35
Score = 32.0 bits (72), Expect = 0.091
Identities = 8/24 (33%), Positives = 24/24 (100%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
+E+E++++E+EE+++E+++E+++E
Sbjct: 17 DEEEDDDDEDEEDDDEDDDEDDDE 40
Score = 32.0 bits (72), Expect = 0.091
Identities = 8/24 (33%), Positives = 24/24 (100%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
++EE++++E+EE+++E+++E+++E
Sbjct: 17 DEEEDDDDEDEEDDDEDDDEDDDE 40
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 37.1 bits (87), Expect = 0.006
Identities = 19/29 (65%), Positives = 20/29 (68%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+ E EEEEEE EEEEEEEEEEE
Sbjct: 224 GRQGRLAEAAEEEEEEAEEEEEEEEEEEA 252
Score = 35.5 bits (83), Expect = 0.023
Identities = 20/37 (54%), Positives = 25/37 (67%)
Query: 129 LDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEE 165
L+G++ + EE+E EEEEEEEEEEE EE E E
Sbjct: 222 LEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 34.8 bits (81), Expect = 0.040
Identities = 20/27 (74%), Positives = 21/27 (77%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
EE+EEE EEEEEEEEEEE EE E E
Sbjct: 232 AAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 34.4 bits (80), Expect = 0.050
Identities = 21/29 (72%), Positives = 22/29 (75%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+ E EEEEEE EEEEEEEEEEE EE
Sbjct: 227 GRLAEAAEEEEEEAEEEEEEEEEEEAEEA 255
Score = 33.6 bits (78), Expect = 0.083
Identities = 19/26 (73%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+EEEEE EEEEEEEEEEE EE E
Sbjct: 232 AAEEEEEEAEEEEEEEEEEEAEEAEA 257
>gnl|CDD|223166 COG0088, RplD, Ribosomal protein L4 [Translation, ribosomal
structure and biogenesis].
Length = 214
Score = 36.8 bits (86), Expect = 0.006
Identities = 22/65 (33%), Positives = 28/65 (43%), Gaps = 1/65 (1%)
Query: 72 VNLHPDIFGLVPRLDVITMNIEWQKKYKWVRFDHTKVTNEVERTGRKPWPQKGL-IERLD 130
V L P++FG D+I + Q + TK EV G+KPW QKG R
Sbjct: 17 VELLPEVFGPERNEDLIHRAVLAQLANRRQGTHSTKTRGEVSGGGKKPWGQKGTGRARQG 76
Query: 131 GKRRP 135
R P
Sbjct: 77 SIRSP 81
Score = 34.1 bits (79), Expect = 0.052
Identities = 28/107 (26%), Positives = 45/107 (42%), Gaps = 15/107 (14%)
Query: 191 GLTSTLSIKLMQDDLHIVKDLVLPVDDPK------YLENLVEERCWGPSVLFVDNTDYME 244
L S LS K L +V+ V D PK +L+ L + +L V +
Sbjct: 110 ALRSALSAKARAGKLVVVRGHVF-EDAPKTKELVEFLKKLGLDV---KRLLIV--KGERD 163
Query: 245 RNISIVTDQIKHYNIMPAYGENN---MLKHDTLVITVDALKQIEDKL 288
N + +K+ ++ G +L+ D LVIT A +IE++L
Sbjct: 164 GNGKLSARNLKNVKVVLVVGGLPVVDVLRADKLVITKLAPGKIEERL 210
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 36.6 bits (85), Expect = 0.006
Identities = 18/43 (41%), Positives = 28/43 (65%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
ER + K+ G+ +E+EE EEE E+++EEEE +E EE+
Sbjct: 27 EREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQAR 69
Score = 36.2 bits (84), Expect = 0.010
Identities = 21/52 (40%), Positives = 30/52 (57%)
Query: 117 RKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ Q+ E +R+ +GE KEEEE EEE E+++EEEE +E EE
Sbjct: 15 QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREE 66
Score = 33.5 bits (77), Expect = 0.065
Identities = 19/60 (31%), Positives = 30/60 (50%)
Query: 111 EVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
E ER RK +K ER + + K EE+E +E EE+ +E+EE E+ + +
Sbjct: 25 EEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVV 84
>gnl|CDD|182840 PRK10921, PRK10921, twin-arginine protein translocation system
subunit TatC; Provisional.
Length = 258
Score = 37.0 bits (86), Expect = 0.008
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEE 160
R Y GKG +EEE + E E E+ EE
Sbjct: 234 RFYVGKGRRREEENDAEAESEKTEE 258
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 34.5 bits (80), Expect = 0.008
Identities = 14/27 (51%), Positives = 21/27 (77%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E EEEEE E+ E E+EE+EEE++++
Sbjct: 45 DAEWEEEEEGEDLESEDEEDEEEDDDD 71
Score = 32.2 bits (74), Expect = 0.048
Identities = 12/26 (46%), Positives = 22/26 (84%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+EE E+ E E+EE+EEE+++++ +
Sbjct: 49 EEEEEGEDLESEDEEDEEEDDDDDMD 74
Score = 28.3 bits (64), Expect = 0.95
Identities = 9/21 (42%), Positives = 18/21 (85%)
Query: 142 GEEKEEEEEEEEEEEEEEEEE 162
GE+ E E+EE+EEE+++++ +
Sbjct: 54 GEDLESEDEEDEEEDDDDDMD 74
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 36.9 bits (86), Expect = 0.008
Identities = 15/39 (38%), Positives = 26/39 (66%)
Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ L + EE+E++ E E+E+EE+E+EEEEE++
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDD 314
Score = 35.0 bits (81), Expect = 0.037
Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 8/47 (17%)
Query: 132 KRRPRLYTGKGEE--------KEEEEEEEEEEEEEEEEEEEEEEEIT 170
K P + G E+ +EEEEE++ E E+E+EE+E+EEE
Sbjct: 266 KGEPEVVGGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEE 312
Score = 34.6 bits (80), Expect = 0.044
Identities = 17/39 (43%), Positives = 26/39 (66%)
Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
D + L EE+EE++ E E+E+EE+E+EEEEE+
Sbjct: 275 DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEED 313
Score = 34.6 bits (80), Expect = 0.055
Identities = 14/26 (53%), Positives = 22/26 (84%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE + E E+E+EE+E+EEEEE+++E
Sbjct: 291 EEDDYSESEDEDEEDEDEEEEEDDDE 316
Score = 34.3 bits (79), Expect = 0.060
Identities = 13/27 (48%), Positives = 21/27 (77%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ E E+E+EE+E+EEEEE+++E
Sbjct: 291 EEDDYSESEDEDEEDEDEEEEEDDDEG 317
Score = 33.1 bits (76), Expect = 0.15
Identities = 12/29 (41%), Positives = 23/29 (79%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + E E+E+EE+E+EEEEE+++E ++
Sbjct: 291 EEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 28.5 bits (64), Expect = 4.1
Identities = 10/25 (40%), Positives = 21/25 (84%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEE 163
+ E+++EE+E+EEEEE+++E ++
Sbjct: 295 YSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 33.5 bits (77), Expect = 0.009
Identities = 14/23 (60%), Positives = 18/23 (78%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
EEK E+EEE E+EEE EE+ E +
Sbjct: 23 EEKREDEEENEDEEEGEEQSEVK 45
Score = 32.3 bits (74), Expect = 0.024
Identities = 14/23 (60%), Positives = 18/23 (78%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
EEE+ E+EEE E+EEE EE+ E
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEV 44
Score = 31.9 bits (73), Expect = 0.034
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
EE++ E+EEE E+EEE EE+ E +
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVK 45
Score = 31.5 bits (72), Expect = 0.044
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEE 168
+EE+ E+EEE E+EEE EE+ E +
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVK 45
Score = 31.2 bits (71), Expect = 0.056
Identities = 13/24 (54%), Positives = 19/24 (79%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
E+E+ E+EEE E+EEE EE+ E +
Sbjct: 22 EEEKREDEEENEDEEEGEEQSEVK 45
Score = 26.5 bits (59), Expect = 2.6
Identities = 12/19 (63%), Positives = 15/19 (78%)
Query: 151 EEEEEEEEEEEEEEEEEEI 169
EEE+ E+EEE E+EEE E
Sbjct: 22 EEEKREDEEENEDEEEGEE 40
Score = 26.5 bits (59), Expect = 2.6
Identities = 11/21 (52%), Positives = 16/21 (76%)
Query: 141 KGEEKEEEEEEEEEEEEEEEE 161
K E++EE E+EEE EE+ E +
Sbjct: 25 KREDEEENEDEEEGEEQSEVK 45
Score = 25.4 bits (56), Expect = 5.9
Identities = 11/19 (57%), Positives = 14/19 (73%)
Query: 152 EEEEEEEEEEEEEEEEEIT 170
EEE+ E+EEE E+EEE
Sbjct: 22 EEEKREDEEENEDEEEGEE 40
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 36.7 bits (85), Expect = 0.009
Identities = 18/29 (62%), Positives = 19/29 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E EEEE EE EE+EEEE EEE EE
Sbjct: 76 SEDGEEEEVEEGEEDEEEEGEEESEEFEP 104
Score = 36.0 bits (83), Expect = 0.021
Identities = 18/26 (69%), Positives = 20/26 (76%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
E+ EEEE EE EE+EEEE EEE EE
Sbjct: 76 SEDGEEEEVEEGEEDEEEEGEEESEE 101
Score = 35.6 bits (82), Expect = 0.027
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ E+EE EE EE+EEEE EEE EE E +
Sbjct: 76 SEDGEEEEVEEGEEDEEEEGEEESEEFEPL 105
Score = 34.8 bits (80), Expect = 0.047
Identities = 22/50 (44%), Positives = 28/50 (56%), Gaps = 5/50 (10%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE-----EEITA 171
E +G+ R+ EE+ EE EE+EEEE EEE EE E EE+T
Sbjct: 65 EDDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTE 114
Score = 33.3 bits (76), Expect = 0.13
Identities = 19/38 (50%), Positives = 24/38 (63%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
G+ E + E+ EEEE EE EE+EEEE EE + P G
Sbjct: 69 GRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLG 106
Score = 32.1 bits (73), Expect = 0.32
Identities = 18/36 (50%), Positives = 21/36 (58%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
G+ EE EE EE+EEEE EEE EE E + L
Sbjct: 78 DGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELT 113
Score = 28.6 bits (64), Expect = 3.6
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
G+ +E E E+ EEEE EE EE+EE
Sbjct: 64 GEDDEGRIEVRISEDGEEEEVEEGEEDEEEE 94
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 37.1 bits (86), Expect = 0.010
Identities = 16/27 (59%), Positives = 23/27 (85%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ EEK+EEEE+ ++EEE +EEEE+EE
Sbjct: 43 EKEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 36.3 bits (84), Expect = 0.018
Identities = 16/26 (61%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EEKEE++EEEE+ ++EEE +EEEE+
Sbjct: 42 EEKEEKKEEEEKTTDKEEEVDEEEEK 67
Score = 35.9 bits (83), Expect = 0.021
Identities = 17/35 (48%), Positives = 24/35 (68%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
L+ K EKE +EEEEEE+EE++EEEE+ +
Sbjct: 24 YLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKE 58
Score = 35.9 bits (83), Expect = 0.026
Identities = 15/26 (57%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE++EE++EEEE+ ++EEE +EEEE
Sbjct: 41 EEEKEEKKEEEEKTTDKEEEVDEEEE 66
Score = 35.5 bits (82), Expect = 0.034
Identities = 14/26 (53%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E++E++EEEE+ ++EEE +EEEE+E
Sbjct: 43 EKEEKKEEEEKTTDKEEEVDEEEEKE 68
Score = 35.1 bits (81), Expect = 0.037
Identities = 15/26 (57%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+E+EE++EEEE+ ++EEE +EEE
Sbjct: 40 EEEEKEEKKEEEEKTTDKEEEVDEEE 65
Score = 35.1 bits (81), Expect = 0.043
Identities = 14/26 (53%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E+++EEEE+ ++EEE +EEEE+EE
Sbjct: 44 KEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 35.1 bits (81), Expect = 0.046
Identities = 18/28 (64%), Positives = 23/28 (82%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EKE E+E +EEEEEE+EE++EEEE T
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTT 55
Score = 34.7 bits (80), Expect = 0.058
Identities = 15/26 (57%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+EE+EE++EEEE+ ++EEE +EE
Sbjct: 39 EEEEEKEEKKEEEEKTTDKEEEVDEE 64
Score = 34.4 bits (79), Expect = 0.064
Identities = 15/26 (57%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+EEE+EE++EEEE+ ++EEE +E
Sbjct: 38 EEEEEEKEEKKEEEEKTTDKEEEVDE 63
Score = 32.8 bits (75), Expect = 0.21
Identities = 14/26 (53%), Positives = 23/26 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E+EEEE+EE++EEEE+ ++EEE +
Sbjct: 37 DEEEEEEKEEKKEEEEKTTDKEEEVD 62
Score = 32.0 bits (73), Expect = 0.39
Identities = 13/33 (39%), Positives = 26/33 (78%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+ +++EEE+ ++EEE +EEEE+EE+++ T +
Sbjct: 45 EEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKV 77
Score = 30.1 bits (68), Expect = 1.8
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEE 162
K EE++ ++EEE +EEEE+EE
Sbjct: 47 KKEEEEKTTDKEEEVDEEEEKEE 69
Score = 29.7 bits (67), Expect = 2.1
Identities = 13/30 (43%), Positives = 24/30 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EEK ++EEE +EEEE+EE++++ +++ T
Sbjct: 51 EEKTTDKEEEVDEEEEKEEKKKKTKKVKET 80
Score = 29.4 bits (66), Expect = 2.6
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ EE++EE E E E EE+ E+ + EE+
Sbjct: 500 EVEEEDEEAEVETTEPAEEDAEDSKMEEV 528
Score = 29.0 bits (65), Expect = 3.3
Identities = 13/27 (48%), Positives = 16/27 (59%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
+E+EE EEE+EE E E E EE
Sbjct: 496 DEDEEVEEEDEEAEVETTEPAEEDAED 522
Score = 29.0 bits (65), Expect = 3.4
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E++E EEE+EE E E E EE+ E+
Sbjct: 497 EDEEVEEEDEEAEVETTEPAEEDAED 522
Score = 28.6 bits (64), Expect = 4.8
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E EE EEE+EE E E E EE+ E
Sbjct: 496 DEDEEVEEEDEEAEVETTEPAEEDAE 521
Score = 27.8 bits (62), Expect = 9.6
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE EEE+EE E E E EE+ E+ +
Sbjct: 499 EEVEEEDEEAEVETTEPAEEDAEDSK 524
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 37.0 bits (86), Expect = 0.012
Identities = 15/29 (51%), Positives = 23/29 (79%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
E+++EEEE+EEEEEEE+E+E +E +
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEHS 413
Score = 36.3 bits (84), Expect = 0.019
Identities = 15/26 (57%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ E ++EEEE+EEEEEEE+E+E
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEG 407
Score = 36.3 bits (84), Expect = 0.019
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ EEEE+EEEEEEE+E+E +E
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEH 412
Score = 36.3 bits (84), Expect = 0.020
Identities = 15/26 (57%), Positives = 20/26 (76%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEI 169
E E ++EEEE+EEEEEEE+E+E
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEG 407
Score = 35.9 bits (83), Expect = 0.027
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E+ E ++EEEE+EEEEEEE+E
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDE 404
Score = 35.5 bits (82), Expect = 0.033
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E ++EEEE+EEEEEEE+E+E
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEGP 408
Score = 35.1 bits (81), Expect = 0.046
Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 5/40 (12%)
Query: 140 GKGEEKEEEEEEEEEEEEEEE-----EEEEEEEEITATLN 174
EE+E+EEEEEEE+E+E ++EE EE+ +
Sbjct: 388 RDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKY 427
Score = 34.7 bits (80), Expect = 0.058
Identities = 14/31 (45%), Positives = 20/31 (64%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
L + ++EEEE+EEEEEEE+E+E
Sbjct: 378 ALEIEDANTERDDEEEEDEEEEEEEDEDEGP 408
Score = 33.2 bits (76), Expect = 0.20
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+E+EEEEE+E+E +E ++EE EE
Sbjct: 395 DEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKY 427
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 36.2 bits (83), Expect = 0.014
Identities = 19/42 (45%), Positives = 27/42 (64%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+L K + L K EEK+ +EE+EEEE EEE ++EE +
Sbjct: 200 EQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGT 241
Score = 33.1 bits (75), Expect = 0.12
Identities = 12/29 (41%), Positives = 21/29 (72%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
T + + +EE+EEEE EEE ++EE + ++
Sbjct: 215 TEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
Score = 32.7 bits (74), Expect = 0.18
Identities = 17/60 (28%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 107 KVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
+ V+ KP Q+ L+ +L + K ++E+EEEE EEE ++EE + ++
Sbjct: 185 SLGGSVQVKAPKP-KQEQLLSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEEGQGTDD 243
Score = 31.9 bits (72), Expect = 0.34
Identities = 21/78 (26%), Positives = 37/78 (47%), Gaps = 17/78 (21%)
Query: 105 HTKVTNEVERTGRKPWPQK------GLIE---RLDGK-----RRPRLYTGKGEEKEEEEE 150
H +T E +PWP++ L++ L G +P+ + +E +E
Sbjct: 156 HPTIT---ENQSFQPWPERLHNNVEELLQSSLSLGGSVQVKAPKPKQEQLLSKLQEYLQE 212
Query: 151 EEEEEEEEEEEEEEEEEE 168
+ EE++ +EE+EEEE E
Sbjct: 213 HKTEEKQPQEEQEEEEVE 230
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 36.6 bits (84), Expect = 0.014
Identities = 28/84 (33%), Positives = 40/84 (47%), Gaps = 10/84 (11%)
Query: 100 WVRFDHTKVTNEVERTGRKPWPQKGLIE-RLDGKRRPR-----LYTGKGEEKEEEEEEEE 153
W+R + + R + Q G+I+ RPR L+ GEE EEE+ + E
Sbjct: 37 WLRKEKACALRQQRRHRLQR--QHGVIDGENSETERPRDLTAALFAEAGEEAEEEDNDRE 94
Query: 154 --EEEEEEEEEEEEEEEITATLNP 175
+ E EEE+EEEE E +NP
Sbjct: 95 CPDTEAEEEDEEEEIEAPDPEVNP 118
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 36.5 bits (85), Expect = 0.015
Identities = 14/30 (46%), Positives = 23/30 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+ +EEE+EE+E+ ++E++EEEEEEE
Sbjct: 350 DLSDEEEDEEDEDSDDEDDEEEEEEEKEKK 379
Score = 35.7 bits (83), Expect = 0.029
Identities = 13/32 (40%), Positives = 23/32 (71%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+ + +EEE+EE+E+ ++E++EEEEEE
Sbjct: 345 EEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEK 376
Score = 35.7 bits (83), Expect = 0.031
Identities = 14/30 (46%), Positives = 24/30 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
++EE+EE+E+ ++E++EEEEEEE+E
Sbjct: 352 SDEEEDEEDEDSDDEDDEEEEEEEKEKKKK 381
Score = 35.4 bits (82), Expect = 0.035
Identities = 15/32 (46%), Positives = 25/32 (78%)
Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+GEE EEEEE+ ++E+EE+++++ EEE
Sbjct: 314 FGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEE 345
Score = 34.2 bits (79), Expect = 0.090
Identities = 14/30 (46%), Positives = 25/30 (83%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+E+E+EE+E+ ++E++EEEEEEE+E+
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKK 382
Score = 33.8 bits (78), Expect = 0.11
Identities = 14/26 (53%), Positives = 24/26 (92%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
EE++EE+E+ ++E++EEEEEEE+E+
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEK 378
Score = 33.4 bits (77), Expect = 0.14
Identities = 12/29 (41%), Positives = 22/29 (75%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EE ++++ EEEEE+ + +EEE+EE+
Sbjct: 332 DEDEEDDDDDLEEEEEDVDLSDEEEDEED 360
Score = 33.4 bits (77), Expect = 0.14
Identities = 12/29 (41%), Positives = 25/29 (86%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ EE+EE+ ++E+EE+++++ EEEEE++
Sbjct: 321 EDEEEEEDGVDDEDEEDDDDDLEEEEEDV 349
Score = 33.4 bits (77), Expect = 0.16
Identities = 15/52 (28%), Positives = 29/52 (55%)
Query: 121 PQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
K + LD + P G + EE+EEEEE+ ++E+EE++++++
Sbjct: 294 DSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEE 345
Score = 33.0 bits (76), Expect = 0.19
Identities = 11/28 (39%), Positives = 23/28 (82%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EE++ + +EEE+EE+E+ ++E++EE
Sbjct: 343 EEEEEDVDLSDEEEDEEDEDSDDEDDEE 370
Score = 33.0 bits (76), Expect = 0.19
Identities = 12/30 (40%), Positives = 22/30 (73%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
G +E EE+++++ EEEEE+ + +EEE+
Sbjct: 329 GVDDEDEEDDDDDLEEEEEDVDLSDEEEDE 358
Score = 32.7 bits (75), Expect = 0.26
Identities = 11/26 (42%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+EEE+ + +EEE+EE+E+ ++E+
Sbjct: 342 LEEEEEDVDLSDEEEDEEDEDSDDED 367
Score = 32.7 bits (75), Expect = 0.29
Identities = 13/29 (44%), Positives = 25/29 (86%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+ +E+EEE+ ++E+EE+++++ EEEEE
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEE 347
Score = 32.7 bits (75), Expect = 0.31
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G + E+EE+++++ EEEEE+ + +EE
Sbjct: 328 DGVDDEDEEDDDDDLEEEEEDVDLSDEE 355
Score = 32.3 bits (74), Expect = 0.35
Identities = 14/30 (46%), Positives = 25/30 (83%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EE EE+E+ ++E++EEEEEEE+E+++ +
Sbjct: 355 EEDEEDEDSDDEDDEEEEEEEKEKKKKKSA 384
Score = 32.3 bits (74), Expect = 0.38
Identities = 10/26 (38%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE ++E+EE+++++ EEEEE+ +
Sbjct: 326 EEDGVDDEDEEDDDDDLEEEEEDVDL 351
Score = 32.3 bits (74), Expect = 0.38
Identities = 11/27 (40%), Positives = 21/27 (77%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ EEEEE+ + +EEE+EE+E+ ++
Sbjct: 339 DDDLEEEEEDVDLSDEEEDEEDEDSDD 365
Score = 32.3 bits (74), Expect = 0.39
Identities = 11/26 (42%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ +EEEE+ + +EEE+EE+E+ ++E
Sbjct: 341 DLEEEEEDVDLSDEEEDEEDEDSDDE 366
Score = 31.5 bits (72), Expect = 0.67
Identities = 9/31 (29%), Positives = 21/31 (67%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E+ ++E+EE+++++ EEEEE+ +
Sbjct: 322 DEEEEEDGVDDEDEEDDDDDLEEEEEDVDLS 352
Score = 31.1 bits (71), Expect = 0.83
Identities = 12/27 (44%), Positives = 21/27 (77%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ +++ EEEEE+ + +EEE+EE+E
Sbjct: 335 EEDDDDDLEEEEEDVDLSDEEEDEEDE 361
Score = 30.7 bits (70), Expect = 1.0
Identities = 11/26 (42%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ ++ EEEEE+ + +EEE+EE+E+
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDED 362
Score = 30.4 bits (69), Expect = 1.5
Identities = 13/33 (39%), Positives = 28/33 (84%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+ E++E+E+ ++E++EEEEEEE+E++++ +A
Sbjct: 353 DEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAE 385
Score = 30.4 bits (69), Expect = 1.6
Identities = 11/32 (34%), Positives = 26/32 (81%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
+E++E+ ++E++EEEEEEE+E+++++ +
Sbjct: 357 DEEDEDSDDEDDEEEEEEEKEKKKKKSAESTR 388
Score = 29.6 bits (67), Expect = 2.2
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ + EEEEE+ + +EEE+EE+E+
Sbjct: 338 DDDDLEEEEEDVDLSDEEEDEEDEDS 363
Score = 28.4 bits (64), Expect = 5.3
Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 10/39 (25%)
Query: 143 EEKEEEEEE-----EEE-----EEEEEEEEEEEEEEITA 171
EE +EE E E E EEE++EEEE+ + +A
Sbjct: 261 EELAKEEAERLKKLEAERLRRMRGEEEDDEEEEDSKESA 299
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 36.5 bits (84), Expect = 0.015
Identities = 12/28 (42%), Positives = 24/28 (85%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+E EEE EEE+++EE+++++++E+E
Sbjct: 41 LGKEAEEEAMEEEDDDEEDDDDDDDEDE 68
Score = 28.8 bits (64), Expect = 3.6
Identities = 6/26 (23%), Positives = 25/26 (96%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+E+++EE+++++++E+E+++++++
Sbjct: 50 MEEEDDDEEDDDDDDDEDEDDDDDDD 75
Score = 27.7 bits (61), Expect = 9.0
Identities = 6/29 (20%), Positives = 26/29 (89%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
++ ++E+E++++++++E++E+E++++ T
Sbjct: 60 DDDDDDEDEDDDDDDDDEDDEDEDDDDST 88
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 35.5 bits (82), Expect = 0.015
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
R T K ++K++E+ EEE++ E ++++ EE E
Sbjct: 59 PRKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENE 94
Score = 33.2 bits (76), Expect = 0.11
Identities = 10/35 (28%), Positives = 21/35 (60%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
++ K ++KE+ EEE++ E ++++ EE E
Sbjct: 60 RKPATTKKSKKKDKEKLTEEEKKPESDDDKTEENE 94
Score = 32.8 bits (75), Expect = 0.13
Identities = 15/62 (24%), Positives = 29/62 (46%), Gaps = 6/62 (9%)
Query: 115 TGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
T RKP K + K + +L + E+K E ++++ EE E + + EE + + +
Sbjct: 58 TPRKPATTK----KSKKKDKEKL--TEEEKKPESDDDKTEENENDPDNNEESGDSQESAS 111
Query: 175 PC 176
Sbjct: 112 AN 113
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 36.7 bits (86), Expect = 0.016
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
EE E EE +++E+EEEEE+E +L
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSL 209
Score = 34.8 bits (81), Expect = 0.061
Identities = 16/52 (30%), Positives = 27/52 (51%), Gaps = 5/52 (9%)
Query: 125 LIERL-DGKRRPR----LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+RL +G+RR R + E++ E EE +++E+EEEEE+
Sbjct: 155 WYDRLENGERRLRELIDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDEND 206
Score = 31.3 bits (72), Expect = 0.63
Identities = 11/33 (33%), Positives = 20/33 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E EE +++E+EEEEE+E ++ + A +
Sbjct: 183 AHVGSELEELDDDEDEEEEEDENDDSLAADESE 215
Score = 29.4 bits (67), Expect = 2.9
Identities = 12/40 (30%), Positives = 24/40 (60%)
Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ + P + EE +++E+EEEEE+E ++ +E E+
Sbjct: 177 NAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESEL 216
Score = 29.4 bits (67), Expect = 2.9
Identities = 10/35 (28%), Positives = 19/35 (54%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+ EE +++E+EEEEE+E ++ + L
Sbjct: 184 HVGSELEELDDDEDEEEEEDENDDSLAADESELPE 218
Score = 27.5 bits (62), Expect = 9.8
Identities = 7/26 (26%), Positives = 14/26 (53%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE + ++ E E + + +E+ EE
Sbjct: 61 EEAPDADDLLLAENEADAQTDEDAEE 86
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 36.4 bits (84), Expect = 0.017
Identities = 20/53 (37%), Positives = 25/53 (47%), Gaps = 1/53 (1%)
Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
L + L K P + E +EEE EEEEE + EE E E E+ A P
Sbjct: 382 DELRKFLKEKGHPVV-KRVVREVDEEEIEEEEEAMQPEEMEMEGFEVPALQMP 433
Score = 29.1 bits (65), Expect = 3.4
Identities = 14/48 (29%), Positives = 21/48 (43%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVHIVV 190
+E+E EEEEE + EE E E E + I V+ + +
Sbjct: 404 DEEEIEEEEEAMQPEEMEMEGFEVPALQMPAASAAPAGIKIVLKNAKI 451
>gnl|CDD|218258 pfam04774, HABP4_PAI-RBP1, Hyaluronan / mRNA binding family. This
family includes the HABP4 family of hyaluronan-binding
proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1.
HABP4 has been observed to bind hyaluronan (a
glucosaminoglycan), but it is not known whether this is
its primary role in vivo. It has also been observed to
bind RNA, but with a lower affinity than that for
hyaluronan. PAI-1 mRNA-binding protein specifically
binds the mRNA of type-1 plasminogen activator inhibitor
(PAI-1), and is thought to be involved in regulation of
mRNA stability. However, in both cases, the sequence
motifs predicted to be important for ligand binding are
not conserved throughout the family, so it is not known
whether members of this family share a common function.
Length = 106
Score = 34.3 bits (79), Expect = 0.017
Identities = 26/75 (34%), Positives = 34/75 (45%), Gaps = 3/75 (4%)
Query: 102 RFDHTKVTNEVER--TGRKPW-PQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEE 158
R + E +R GR W K I L ++ + EEK+ EEE +E
Sbjct: 7 RHSGSGRGGEDKREGAGRGNWGSVKDEIAELTEEQGEEEEKNEVEEKQAVEEEANKEGVV 66
Query: 159 EEEEEEEEEEITATL 173
EEEE EEEE+ TL
Sbjct: 67 EEEEVEEEEDKEMTL 81
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 36.4 bits (84), Expect = 0.018
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K EEK+EE+ EE + EE +EE EEEE+
Sbjct: 282 KKEEKDEEKSEEVKTEEVDEEFEEEEKG 309
Score = 34.4 bits (79), Expect = 0.075
Identities = 15/27 (55%), Positives = 21/27 (77%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E+KEE++EE+ EE + EE +EE EEE
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFEEEE 307
Score = 33.7 bits (77), Expect = 0.14
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
K EEK EE + EE +EE EEEE+ E+ +N
Sbjct: 286 KDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVN 319
Score = 33.7 bits (77), Expect = 0.16
Identities = 14/28 (50%), Positives = 22/28 (78%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K +++E++EE+ EE + EE +EE EEEE
Sbjct: 280 KEKKEEKDEEKSEEVKTEEVDEEFEEEE 307
Score = 30.6 bits (69), Expect = 1.2
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E KE++EE++EE+ EE + EE +EE
Sbjct: 278 ELKEKKEEKDEEKSEEVKTEEVDEEF 303
Score = 28.7 bits (64), Expect = 5.0
Identities = 11/24 (45%), Positives = 18/24 (75%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEIT 170
E++EE++EE+ EE + EE +EE
Sbjct: 281 EKKEEKDEEKSEEVKTEEVDEEFE 304
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 35.7 bits (82), Expect = 0.019
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E+E+E EEEEEE EEEEE + E+++ L P
Sbjct: 7 LEEEDESGEEEEEESEEEEETDSEDDMEPRLKP 39
Score = 31.0 bits (70), Expect = 0.59
Identities = 18/27 (66%), Positives = 19/27 (70%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEIT 170
E E E EEE+E EEEEEE EEEE T
Sbjct: 1 ETEVLELEEEDESGEEEEEESEEEEET 27
Score = 29.5 bits (66), Expect = 2.3
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEE 163
+ E+EEEE EEEEE + E++ E
Sbjct: 6 ELEEEDESGEEEEEESEEEEETDSEDDME 34
Score = 28.7 bits (64), Expect = 3.5
Identities = 14/25 (56%), Positives = 18/25 (72%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEE 165
+ E EEEEEE EEEEE + E++ E
Sbjct: 10 EDESGEEEEEESEEEEETDSEDDME 34
>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid).
Members of this family are bacterial proteins with a
conserved motif [KR]FYDLN, sometimes flanked by a pair
of CXXC motifs, followed by a long region of low
complexity sequence in which roughly half the residues
are Asp and Glu, including multiple runs of five or more
acidic residues. The function of members of this family
is unknown.
Length = 104
Score = 33.8 bits (78), Expect = 0.021
Identities = 7/38 (18%), Positives = 25/38 (65%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
R P +K+E+EE+E++ ++++++++++++
Sbjct: 43 SRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDL 80
Score = 30.7 bits (70), Expect = 0.28
Identities = 6/35 (17%), Positives = 23/35 (65%)
Query: 134 RPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ R E+ +++E+EE+E++ ++++++++
Sbjct: 42 KSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDD 76
Score = 29.6 bits (67), Expect = 0.76
Identities = 5/27 (18%), Positives = 19/27 (70%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+ E+ +++E+EE+E++ +++++
Sbjct: 49 DAEDAAKKDEDEEDEDDVVLDDDDDDD 75
>gnl|CDD|184735 PRK14547, rplD, 50S ribosomal protein L4; Provisional.
Length = 298
Score = 35.7 bits (82), Expect = 0.022
Identities = 16/38 (42%), Positives = 18/38 (47%)
Query: 105 HTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKG 142
HTK EV G+KPW QKG G R + G G
Sbjct: 50 HTKTRAEVRGGGKKPWRQKGTGRARQGSIRAPQWRGGG 87
>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose
2'-O-methyltransferase Hen1. Members of this protein
family are bacterial Hen1, a 3' terminal RNA ribose
2'-O-methyltransferase that acts in bacterial RNA
repair. All members of the seed alignment belong to a
cassette with the RNA repair enzyme polynucleotide
kinase-phosphatase (Pnkp). Chemically similar Hen1 in
eukaryotes acts instead on small regulatory RNAs
[Transcription, RNA processing, Protein synthesis, tRNA
and rRNA base modification].
Length = 462
Score = 35.7 bits (83), Expect = 0.025
Identities = 34/100 (34%), Positives = 43/100 (43%), Gaps = 21/100 (21%)
Query: 81 LVPRLDVITMNIEWQKKYKWVRFDHTKVTNEVERTGRK--PW----PQKGLIERLDGKRR 134
L+P LD K+ WV D EVE+ RK W P++ LI R K R
Sbjct: 183 LIPVLD--------DAKHYWVGDD------EVEKLLRKGEGWLAEHPERELITRRYLKHR 228
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
R + E +E E EE E EE +EE E +LN
Sbjct: 229 -RSLARDALARLAEADEAEPEEAETEEAQEEAAEKPPSLN 267
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 35.6 bits (82), Expect = 0.026
Identities = 18/30 (60%), Positives = 23/30 (76%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+ EE+E +E+EEEEE EEEEEEE+E
Sbjct: 254 DQDGDYVEEKELKEDEEEEETEEEEEEEDE 283
Score = 34.4 bits (79), Expect = 0.058
Identities = 19/25 (76%), Positives = 23/25 (92%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
EEKE +E+EEEEE EEEEEEE+E+E
Sbjct: 261 EEKELKEDEEEEETEEEEEEEDEDE 285
Score = 33.6 bits (77), Expect = 0.094
Identities = 15/29 (51%), Positives = 21/29 (72%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ + EE+E +E+EEEEE EEEEEE
Sbjct: 252 DDDQDGDYVEEKELKEDEEEEETEEEEEE 280
Score = 33.3 bits (76), Expect = 0.11
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+ EE+E +E+EEEEE EEEEEEE
Sbjct: 255 QDGDYVEEKELKEDEEEEETEEEEEEEDEDE 285
Score = 31.7 bits (72), Expect = 0.39
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
++ + EE+E +E+EEEEE EEEEE
Sbjct: 254 DQDGDYVEEKELKEDEEEEETEEEEEEEDE 283
Score = 30.6 bits (69), Expect = 1.0
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E+ +++ + EE+E +E+EEEEE EE
Sbjct: 251 EDDDQDGDYVEEKELKEDEEEEETEEE 277
Score = 29.4 bits (66), Expect = 2.3
Identities = 11/28 (39%), Positives = 21/28 (75%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K + EE+++++ + EE+E +E+EEEE
Sbjct: 245 KKSDDEEDDDQDGDYVEEKELKEDEEEE 272
Score = 28.6 bits (64), Expect = 4.1
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE ++++ + EE+E +E+EEEEE E
Sbjct: 250 EEDDDQDGDYVEEKELKEDEEEEETE 275
Score = 28.3 bits (63), Expect = 4.7
Identities = 15/23 (65%), Positives = 19/23 (82%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEE 162
+ +E EEEEE EEEEEEE+E+E
Sbjct: 263 KELKEDEEEEETEEEEEEEDEDE 285
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 35.7 bits (83), Expect = 0.029
Identities = 17/33 (51%), Positives = 24/33 (72%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
K +++EEEEEEE+E++EEE+EEEE A
Sbjct: 430 KKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 35.3 bits (82), Expect = 0.032
Identities = 20/30 (66%), Positives = 26/30 (86%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
G++KEEEEEEE+E++EEE+EEEEEE E
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAEEE 465
Score = 34.9 bits (81), Expect = 0.049
Identities = 15/40 (37%), Positives = 25/40 (62%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
K + K E+K++ +++EEEEEEE+E++EEE
Sbjct: 417 EKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Score = 34.9 bits (81), Expect = 0.054
Identities = 20/28 (71%), Positives = 26/28 (92%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K EE+EEE+E++EEE+EEEEEE EEE+E
Sbjct: 440 KEEEEEEEKEKKEEEKEEEEEEAEEEKE 467
Score = 34.5 bits (80), Expect = 0.060
Identities = 21/30 (70%), Positives = 26/30 (86%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EEKE++EEE+EEEEEE EEE+EEEEE
Sbjct: 442 EEEEEEKEKKEEEKEEEEEEAEEEKEEEEE 471
Score = 34.1 bits (79), Expect = 0.075
Identities = 17/45 (37%), Positives = 27/45 (60%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
I K+R K ++ +++EEEEEEE+E++EEE+EE
Sbjct: 415 IVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEE 459
Score = 34.1 bits (79), Expect = 0.079
Identities = 18/37 (48%), Positives = 28/37 (75%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + + E++++EEE+EEEEEE EEE+EEEEE+
Sbjct: 436 AGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
Score = 34.1 bits (79), Expect = 0.081
Identities = 18/44 (40%), Positives = 31/44 (70%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
++ + +++ + +K+EEEEEEE+E++EEE+EEEEEE
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Score = 34.1 bits (79), Expect = 0.091
Identities = 22/33 (66%), Positives = 29/33 (87%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+ E+KEEE+EEEEEE EEE+EEEEE+++ ATL
Sbjct: 447 EKEKKEEEKEEEEEEAEEEKEEEEEKKKKQATL 479
Score = 33.0 bits (76), Expect = 0.22
Identities = 19/60 (31%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 110 NEVER-TGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+E TG K +K +++ + ++K +++EEEEEEE+E++EEE+E
Sbjct: 397 EEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKE 456
Score = 32.6 bits (75), Expect = 0.23
Identities = 19/30 (63%), Positives = 26/30 (86%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ +EK+EEE+EEEEEE EEE+EEEEE++
Sbjct: 444 EEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 32.2 bits (74), Expect = 0.34
Identities = 12/32 (37%), Positives = 23/32 (71%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+ EEK+E++++ +++EEEEEEE+E+
Sbjct: 423 REEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Score = 31.8 bits (73), Expect = 0.44
Identities = 10/30 (33%), Positives = 20/30 (66%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EK+ EEE++E++++ +++EEEE
Sbjct: 419 AEKKREEEKKEKKKKAFAGKKKEEEEEEEK 448
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 34.9 bits (81), Expect = 0.030
Identities = 10/27 (37%), Positives = 22/27 (81%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +EE+ E++++++E+E+E++EEE
Sbjct: 45 ESELDEEDLEDDDDDDEDEDEDDEEEA 71
Score = 34.9 bits (81), Expect = 0.033
Identities = 10/26 (38%), Positives = 22/26 (84%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +EE+ E++++++E+E+E++EE
Sbjct: 44 IESELDEEDLEDDDDDDEDEDEDDEE 69
Score = 34.5 bits (80), Expect = 0.039
Identities = 8/27 (29%), Positives = 22/27 (81%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+E++ E++++++E+E+E++EEE +
Sbjct: 49 DEEDLEDDDDDDEDEDEDDEEEADLGP 75
Score = 34.5 bits (80), Expect = 0.045
Identities = 10/26 (38%), Positives = 22/26 (84%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E EE+ E++++++E+E+E++EEE +
Sbjct: 47 ELDEEDLEDDDDDDEDEDEDDEEEAD 72
Score = 34.1 bits (79), Expect = 0.052
Identities = 8/29 (27%), Positives = 20/29 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E E +EE+ E++++++E+E+E++
Sbjct: 42 AAIESELDEEDLEDDDDDDEDEDEDDEEE 70
Score = 32.5 bits (75), Expect = 0.19
Identities = 8/30 (26%), Positives = 18/30 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
E E +EE+ E++++++E+E+E
Sbjct: 40 TAAAIESELDEEDLEDDDDDDEDEDEDDEE 69
Score = 31.8 bits (73), Expect = 0.30
Identities = 7/26 (26%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +EE+ E++++++E+E+
Sbjct: 39 ATAAAIESELDEEDLEDDDDDDEDED 64
Score = 30.6 bits (70), Expect = 0.75
Identities = 6/26 (23%), Positives = 20/26 (76%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEE 166
+ + +++++++E+E+E++EEE +
Sbjct: 50 EEDLEDDDDDDEDEDEDDEEEADLGP 75
Score = 30.6 bits (70), Expect = 0.79
Identities = 7/26 (26%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +EE+ E++++++E+E
Sbjct: 38 AATAAAIESELDEEDLEDDDDDDEDE 63
Score = 29.8 bits (68), Expect = 1.4
Identities = 6/26 (23%), Positives = 15/26 (57%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +EE+ E++++++E+
Sbjct: 37 AAATAAAIESELDEEDLEDDDDDDED 62
Score = 29.8 bits (68), Expect = 1.5
Identities = 8/29 (27%), Positives = 19/29 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E +++++E+E+E++EEE + + E
Sbjct: 53 LEDDDDDDEDEDEDDEEEADLGPDPEEAR 81
Score = 28.3 bits (64), Expect = 3.6
Identities = 6/26 (23%), Positives = 14/26 (53%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +EE+ E++++++E
Sbjct: 36 AAAATAAAIESELDEEDLEDDDDDDE 61
Score = 28.3 bits (64), Expect = 3.7
Identities = 5/26 (19%), Positives = 13/26 (50%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +EE+ E++++++
Sbjct: 35 AAAAATAAAIESELDEEDLEDDDDDD 60
Score = 27.9 bits (63), Expect = 6.2
Identities = 5/26 (19%), Positives = 12/26 (46%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +EE+ E+++++
Sbjct: 34 AAAAAATAAAIESELDEEDLEDDDDD 59
>gnl|CDD|221620 pfam12527, DUF3727, Protein of unknown function (DUF3727). This
domain family is found in bacteria and eukaryotes, and
is approximately 100 amino acids in length.
Length = 100
Score = 33.3 bits (77), Expect = 0.035
Identities = 20/43 (46%), Positives = 29/43 (67%), Gaps = 2/43 (4%)
Query: 132 KRRPRLYTGKGE--EKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
KR T +GE E EE+E E EEE++++E+EEEE E+ A+
Sbjct: 6 KRSAFTLTVRGELPEPEEDEILELEEEDDDDEDEEEEYELLAS 48
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 34.0 bits (78), Expect = 0.037
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+++ +EEEE EEE +E E+E+ E +
Sbjct: 86 KIFRELSSSDDEEEETEEESTDETEQEDPPETK 118
Score = 33.2 bits (76), Expect = 0.082
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
L + EE+E EEE +E E+E+ E + E +
Sbjct: 91 LSSSDDEEEETEEESTDETEQEDPPETKTESK 122
Score = 31.6 bits (72), Expect = 0.21
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
++EEEE EEE +E E+E+ E T +
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTES 121
Score = 30.9 bits (70), Expect = 0.41
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE EEE +E E+E+ E + E +E+
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEK 124
Score = 29.3 bits (66), Expect = 1.7
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITAT 172
++EEEE EEE +E E+E+ T T
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKT 119
>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor.
Length = 456
Score = 35.0 bits (80), Expect = 0.042
Identities = 12/40 (30%), Positives = 19/40 (47%)
Query: 129 LDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
LDG PR G+ E E+ ++++ + E E E E
Sbjct: 44 LDGLLYPRSCQGQEEPDEKTQDQQSLSDVERAEPEVEASR 83
>gnl|CDD|214375 CHL00147, rpl4, ribosomal protein L4; Validated.
Length = 215
Score = 34.6 bits (80), Expect = 0.043
Identities = 17/38 (44%), Positives = 19/38 (50%)
Query: 105 HTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKG 142
TK +EV GRKPW QKG G R L+ G G
Sbjct: 55 STKTRSEVRGGGRKPWKQKGTGRARAGSNRSPLWKGGG 92
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 34.4 bits (79), Expect = 0.049
Identities = 15/32 (46%), Positives = 23/32 (71%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
E++E+E EE+E E EEEE+EE E+ A+ +
Sbjct: 47 EDEEDEAVVEEDENELTEEEEDEEGEVKASPD 78
Score = 32.4 bits (74), Expect = 0.24
Identities = 14/33 (42%), Positives = 21/33 (63%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+++EE+E EE+E E EEEE+EE +P
Sbjct: 45 VDEDEEDEAVVEEDENELTEEEEDEEGEVKASP 77
Score = 29.4 bits (66), Expect = 2.2
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
+E++E EE+E E EEEE+EE E
Sbjct: 47 EDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 29.0 bits (65), Expect = 3.0
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
+EE E++ +E+EE+E EE+E T
Sbjct: 37 DEEAEDDVVDEDEEDEAVVEEDENELT 63
>gnl|CDD|183731 PRK12766, PRK12766, 50S ribosomal protein L32e; Provisional.
Length = 232
Score = 34.4 bits (79), Expect = 0.050
Identities = 18/42 (42%), Positives = 23/42 (54%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILP 183
G + EE E E E+E EEEEEE+ ++ L P GL P
Sbjct: 57 GGLEVSEETEAEVEDEGGEEEEEEDADVETELRPRGLTEKTP 98
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 34.8 bits (80), Expect = 0.053
Identities = 9/39 (23%), Positives = 20/39 (51%)
Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
KR K +++ EE ++++ E+E ++ E+E
Sbjct: 73 SAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERL 111
Score = 33.6 bits (77), Expect = 0.11
Identities = 9/31 (29%), Positives = 21/31 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
K E+++ EE ++++ E+E ++ E+E + A
Sbjct: 83 KKEQQQAEELQQKQAAEQERLKQLEKERLAA 113
Score = 32.1 bits (73), Expect = 0.32
Identities = 8/38 (21%), Positives = 21/38 (55%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
++R + + EE ++++ E+E ++ E+E +E
Sbjct: 79 EQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQ 116
Score = 30.2 bits (68), Expect = 1.6
Identities = 6/26 (23%), Positives = 20/26 (76%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEI 169
E++ +++E+++ EE ++++ E+E +
Sbjct: 78 EEQRKKKEQQQAEELQQKQAAEQERL 103
Score = 29.0 bits (65), Expect = 3.3
Identities = 6/26 (23%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++K+ E+E ++ E+E +E+++
Sbjct: 93 QQKQAAEQERLKQLEKERLAAQEQKK 118
Score = 28.6 bits (64), Expect = 4.4
Identities = 7/28 (25%), Positives = 16/28 (57%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K EKE +E++++ EE ++ ++
Sbjct: 104 KQLEKERLAAQEQKKQAEEAAKQAALKQ 131
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 32.7 bits (75), Expect = 0.057
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEE 165
E ++EE++EEEEEE +++
Sbjct: 71 AAAAAAAAAEAKKEEKKEEEEEESDDD 97
Score = 31.9 bits (73), Expect = 0.096
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
E ++EE++EEEEEE ++++
Sbjct: 74 AAAAAAEAKKEEKKEEEEEESDDDM 98
>gnl|CDD|224759 COG1846, MarR, Transcriptional regulators [Transcription].
Length = 126
Score = 33.0 bits (75), Expect = 0.059
Identities = 19/57 (33%), Positives = 25/57 (43%), Gaps = 4/57 (7%)
Query: 122 QKGLIERL---DGKRRPRLY-TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
KGLIERL + +R + T KG E E+ +E E EEE+ L
Sbjct: 62 DKGLIERLRDPEDRRAVLVRLTEKGRELLEQLLPAAQELLAEILAGLSEEELRKLLR 118
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 34.2 bits (79), Expect = 0.064
Identities = 15/44 (34%), Positives = 29/44 (65%)
Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
L+E+ + R + +G E +++++EEEE E EE+E+ ++E E
Sbjct: 74 LLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGE 117
Score = 34.2 bits (79), Expect = 0.070
Identities = 16/34 (47%), Positives = 21/34 (61%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
E+ +EE EE+EEE EEEE E E+E + L
Sbjct: 144 KAKEDSDEELSEEDEEEAAEEEEAEAEKEKASEL 177
Score = 33.5 bits (77), Expect = 0.11
Identities = 11/30 (36%), Positives = 20/30 (66%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K E+ E +++++EEEE E EE+E+ +
Sbjct: 85 KKEAEQGLESDDDDDEEEEWEVEEDEDSDD 114
Score = 32.3 bits (74), Expect = 0.25
Identities = 20/56 (35%), Positives = 26/56 (46%), Gaps = 7/56 (12%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRIL 182
E + + EE EE+EEE EEEE E E+E+ E+ T RIL
Sbjct: 135 EEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEKASELATT-------RIL 183
Score = 31.2 bits (71), Expect = 0.58
Identities = 12/43 (27%), Positives = 26/43 (60%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E L+ + + E+ E +++++EEEE E EE+E+ ++
Sbjct: 72 LELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDD 114
Score = 30.4 bits (69), Expect = 1.1
Identities = 11/42 (26%), Positives = 24/42 (57%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + +++ G + +++EEEE E EE+E+ ++E E
Sbjct: 78 WKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWI 119
Score = 30.0 bits (68), Expect = 1.4
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
GE + E ++E E + E+EEE++E
Sbjct: 116 GEWIDVESDKEIESSDSEDEEEKDEAAK 143
Score = 29.2 bits (66), Expect = 2.3
Identities = 10/34 (29%), Positives = 20/34 (58%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
+KE E + E+EEE++E ++ +E+ L+
Sbjct: 123 SDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEE 156
Score = 29.2 bits (66), Expect = 2.7
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE+ E EE+E+ ++E E + E ++EI
Sbjct: 100 EEEEWEVEEDEDSDDEGEWIDVESDKEI 127
Score = 28.1 bits (63), Expect = 6.7
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +E E + E+EEE++E ++ +E
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKE 147
Score = 28.1 bits (63), Expect = 6.9
Identities = 8/29 (27%), Positives = 15/29 (51%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+ +E E + E ++E E + E+EE
Sbjct: 111 DSDDEGEWIDVESDKEIESSDSEDEEEKD 139
Score = 28.1 bits (63), Expect = 7.0
Identities = 9/31 (29%), Positives = 19/31 (61%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+ E + EE+E+ ++E E + E ++E E +
Sbjct: 100 EEEEWEVEEDEDSDDEGEWIDVESDKEIESS 130
Score = 27.7 bits (62), Expect = 7.7
Identities = 10/32 (31%), Positives = 21/32 (65%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
K E + E+EEE++E ++ +E+ +EE++
Sbjct: 124 DKEIESSDSEDEEEKDEAAKKAKEDSDEELSE 155
Score = 27.7 bits (62), Expect = 8.1
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E EEE++E ++ +E+ +EE EE+
Sbjct: 132 SEDEEEKDEAAKKAKEDSDEELSEED 157
Score = 27.7 bits (62), Expect = 8.7
Identities = 11/27 (40%), Positives = 18/27 (66%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ EEEE E EE+E+ ++E E + E
Sbjct: 96 DDDDEEEEWEVEEDEDSDDEGEWIDVE 122
>gnl|CDD|185582 PTZ00373, PTZ00373, 60S Acidic ribosomal protein P2; Provisional.
Length = 112
Score = 32.6 bits (74), Expect = 0.070
Identities = 12/19 (63%), Positives = 17/19 (89%)
Query: 141 KGEEKEEEEEEEEEEEEEE 159
K E K+EE++EEEEEEE++
Sbjct: 87 KAEAKKEEKKEEEEEEEDD 105
Score = 31.0 bits (70), Expect = 0.21
Identities = 10/19 (52%), Positives = 15/19 (78%)
Query: 140 GKGEEKEEEEEEEEEEEEE 158
+KEE++EEEEEEE++
Sbjct: 87 KAEAKKEEKKEEEEEEEDD 105
Score = 31.0 bits (70), Expect = 0.22
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEE 163
K E ++EE++EEEEEEE++
Sbjct: 85 GAKAEAKKEEKKEEEEEEEDD 105
Score = 30.6 bits (69), Expect = 0.30
Identities = 11/21 (52%), Positives = 18/21 (85%)
Query: 142 GEEKEEEEEEEEEEEEEEEEE 162
G + E ++EE++EEEEEEE++
Sbjct: 85 GAKAEAKKEEKKEEEEEEEDD 105
Score = 30.3 bits (68), Expect = 0.42
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
G + E ++EE++EEEEEEE++
Sbjct: 80 GAATAGAKAEAKKEEKKEEEEEEEDD 105
Score = 30.3 bits (68), Expect = 0.50
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 148 EEEEEEEEEEEEEEEEEEEEEITATL 173
+ E ++EE++EEEEEEE+++ +L
Sbjct: 85 GAKAEAKKEEKKEEEEEEEDDLGFSL 110
Score = 29.5 bits (66), Expect = 0.78
Identities = 11/21 (52%), Positives = 17/21 (80%)
Query: 140 GKGEEKEEEEEEEEEEEEEEE 160
G E ++EE++EEEEEEE++
Sbjct: 85 GAKAEAKKEEKKEEEEEEEDD 105
Score = 29.5 bits (66), Expect = 0.80
Identities = 10/23 (43%), Positives = 16/23 (69%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
+ E ++EE++EEEEEEE+
Sbjct: 82 ATAGAKAEAKKEEKKEEEEEEED 104
Score = 28.0 bits (62), Expect = 2.8
Identities = 10/23 (43%), Positives = 18/23 (78%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEE 161
T + + ++EE++EEEEEEE++
Sbjct: 83 TAGAKAEAKKEEKKEEEEEEEDD 105
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 33.3 bits (76), Expect = 0.070
Identities = 18/27 (66%), Positives = 23/27 (85%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+E +EEEEEE+EEE EE E+ E+EEEI
Sbjct: 48 QEDDEEEEEEDEEEIEEPEDIEDEEEI 74
Score = 32.9 bits (75), Expect = 0.11
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ EE E+ E+EEE E+EEEEEE+EE+
Sbjct: 58 DEEEIEEPEDIEDEEEIVEDEEEEEEDEED 87
Score = 32.5 bits (74), Expect = 0.15
Identities = 16/25 (64%), Positives = 22/25 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
+EK+E++EEEEEE+EEE EE E+ E
Sbjct: 45 DEKQEDDEEEEEEDEEEIEEPEDIE 69
Score = 31.8 bits (72), Expect = 0.26
Identities = 18/27 (66%), Positives = 22/27 (81%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
K E+ EEEEEE+EEE EE E+ E+EEE
Sbjct: 47 KQEDDEEEEEEDEEEIEEPEDIEDEEE 73
Score = 31.0 bits (70), Expect = 0.41
Identities = 17/28 (60%), Positives = 24/28 (85%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E++E++EEEEEE+EEE EE E+ E+E
Sbjct: 44 KDEKQEDDEEEEEEDEEEIEEPEDIEDE 71
Score = 31.0 bits (70), Expect = 0.51
Identities = 16/25 (64%), Positives = 23/25 (92%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
K+E++E++EEEEEE+EEE EE E+I
Sbjct: 44 KDEKQEDDEEEEEEDEEEIEEPEDI 68
Score = 30.6 bits (69), Expect = 0.58
Identities = 18/28 (64%), Positives = 23/28 (82%), Gaps = 2/28 (7%)
Query: 143 EEKEEEEEEEEEEE--EEEEEEEEEEEE 168
EE EE E+ E+EEE E+EEEEEE+EE+
Sbjct: 60 EEIEEPEDIEDEEEIVEDEEEEEEDEED 87
Score = 29.8 bits (67), Expect = 1.1
Identities = 14/26 (53%), Positives = 22/26 (84%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEI 169
++++E++EEEEEE+EEE EE E+ E
Sbjct: 45 DEKQEDDEEEEEEDEEEIEEPEDIED 70
Score = 29.4 bits (66), Expect = 1.5
Identities = 16/27 (59%), Positives = 21/27 (77%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE+EE+EEE EE E+ E+EEE E+E
Sbjct: 53 EEEEEDEEEIEEPEDIEDEEEIVEDEE 79
Score = 29.1 bits (65), Expect = 1.8
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E EEE E+EEEEEE+EE+ + ++I
Sbjct: 68 IEDEEEIVEDEEEEEEDEEDNVDLKDI 94
Score = 29.1 bits (65), Expect = 2.0
Identities = 10/28 (35%), Positives = 20/28 (71%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
E++EEEEE+EE+ + ++ E++ +I
Sbjct: 76 EDEEEEEEDEEDNVDLKDIEKKNINDIF 103
Score = 29.1 bits (65), Expect = 2.0
Identities = 15/25 (60%), Positives = 23/25 (92%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
KE E+ ++E++E++EEEEEE+EEEI
Sbjct: 38 KENEDVKDEKQEDDEEEEEEDEEEI 62
Score = 29.1 bits (65), Expect = 2.2
Identities = 14/25 (56%), Positives = 20/25 (80%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEIT 170
E++E++EEEEEE+EEE EE E+
Sbjct: 46 EKQEDDEEEEEEDEEEIEEPEDIED 70
Score = 28.7 bits (64), Expect = 2.9
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE E+EEEEEE+EE+ + ++ E++
Sbjct: 72 EEIVEDEEEEEEDEEDNVDLKDIEKK 97
Score = 27.9 bits (62), Expect = 5.3
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E E+EEEEEE+EE+ + ++ E++ I
Sbjct: 73 EIVEDEEEEEEDEEDNVDLKDIEKKNI 99
Score = 27.5 bits (61), Expect = 6.5
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+ E+EEEEEE+EE+ + ++ E+
Sbjct: 71 EEEIVEDEEEEEEDEEDNVDLKDIEK 96
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 34.2 bits (79), Expect = 0.077
Identities = 14/36 (38%), Positives = 19/36 (52%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+ + K+EEE E + +EE EEEEE I L
Sbjct: 307 WYLRRRSKQEEEAAAEAAKAQEEAAEEEEESINDIL 342
Score = 31.1 bits (71), Expect = 0.83
Identities = 12/39 (30%), Positives = 19/39 (48%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLC 179
+ +++EE E + +EE EEEEE L+ L
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEEEESINDILALDDLELE 350
Score = 29.9 bits (68), Expect = 2.0
Identities = 13/37 (35%), Positives = 18/37 (48%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
L +E+E E + +EE EEEEE +I A
Sbjct: 308 YLRRRSKQEEEAAAEAAKAQEEAAEEEEESINDILAL 344
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 33.8 bits (78), Expect = 0.081
Identities = 7/24 (29%), Positives = 9/24 (37%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEE 162
EEEEEEE++
Sbjct: 280 AAAAAAAAAAAAAPAEEEEEEEDD 303
Score = 32.7 bits (75), Expect = 0.23
Identities = 7/23 (30%), Positives = 9/23 (39%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEE 166
EEEEEEE++
Sbjct: 281 AAAAAAAAAAAAPAEEEEEEEDD 303
Score = 31.9 bits (73), Expect = 0.33
Identities = 7/21 (33%), Positives = 9/21 (42%)
Query: 147 EEEEEEEEEEEEEEEEEEEEE 167
EEEEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
Score = 31.9 bits (73), Expect = 0.33
Identities = 7/21 (33%), Positives = 9/21 (42%)
Query: 148 EEEEEEEEEEEEEEEEEEEEE 168
EEEEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
Score = 31.9 bits (73), Expect = 0.35
Identities = 7/21 (33%), Positives = 9/21 (42%)
Query: 143 EEKEEEEEEEEEEEEEEEEEE 163
EEEEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
Score = 30.0 bits (68), Expect = 1.3
Identities = 7/20 (35%), Positives = 7/20 (35%)
Query: 150 EEEEEEEEEEEEEEEEEEEI 169
EEEEEEE
Sbjct: 283 AAAAAAAAAAPAEEEEEEED 302
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 33.1 bits (76), Expect = 0.085
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
GEE EE EE+E++E +EEE + ++
Sbjct: 10 GEEDEELPEEDEDDESSDEEEVDLPDDE 37
Score = 31.6 bits (72), Expect = 0.30
Identities = 11/29 (37%), Positives = 21/29 (72%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ +E +EEE + ++E++EE + EEE+I
Sbjct: 21 EDDESSDEEEVDLPDDEQDEESDSEEEQI 49
Score = 31.2 bits (71), Expect = 0.44
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E DG+ L +++ +EEE + ++E++EE + EEE
Sbjct: 6 ESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEE 47
Score = 30.4 bits (69), Expect = 0.64
Identities = 10/34 (29%), Positives = 22/34 (64%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
G+ +E+ EE+E++E +EEE + ++E+ +
Sbjct: 10 GEEDEELPEEDEDDESSDEEEVDLPDDEQDEESD 43
Score = 30.4 bits (69), Expect = 0.77
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
G E ++ EE+EE EE+E++E +EEE+
Sbjct: 2 GSESESDDGEEDEELPEEDEDDESSDEEEVDLP 34
Score = 30.0 bits (68), Expect = 0.83
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
EEE + ++E++EE + EEE+ +T
Sbjct: 22 DDESSDEEEVDLPDDEQDEESDSEEEQIFVT 52
Score = 29.7 bits (67), Expect = 1.3
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
G E E ++ EE+EE EE+E++E +E L
Sbjct: 2 GSESESDDGEEDEELPEEDEDDESSDEEEVDL 33
Score = 28.9 bits (65), Expect = 2.4
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E +EEE + ++E++EE + EEE+
Sbjct: 21 EDDESSDEEEVDLPDDEQDEESDSEEEQIF 50
>gnl|CDD|100108 cd04411, Ribosomal_P1_P2_L12p, Ribosomal protein P1, P2, and L12p.
Ribosomal proteins P1 and P2 are the eukaryotic proteins
that are functionally equivalent to bacterial L7/L12.
L12p is the archaeal homolog. Unlike other ribosomal
proteins, the archaeal L12p and eukaryotic P1 and P2 do
not share sequence similarity with their bacterial
counterparts. They are part of the ribosomal stalk
(called the L7/L12 stalk in bacteria), along with 28S
rRNA and the proteins L11 and P0 in eukaryotes (23S
rRNA, L11, and L10e in archaea). In bacterial ribosomes,
L7/L12 homodimers bind the extended C-terminal helix of
L10 to anchor the L7/L12 molecules to the ribosome.
Eukaryotic P1/P2 heterodimers and archaeal L12p
homodimers are believed to bind the L10 equivalent
proteins, eukaryotic P0 and archaeal L10e, in a similar
fashion. P1 and P2 (L12p, L7/L12) are the only proteins
in the ribosome to occur as multimers, always appearing
as sets of dimers. Recent data indicate that most
archaeal species contain six copies of L12p (three
homodimers), while eukaryotes have two copies each of P1
and P2 (two heterodimers). Bacteria may have four or six
copies (two or three homodimers), depending on the
species. As in bacteria, the stalk is crucial for
binding of initiation, elongation, and release factors
in eukaryotes and archaea.
Length = 105
Score = 32.2 bits (73), Expect = 0.095
Identities = 12/28 (42%), Positives = 16/28 (57%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E E+ EE +EEEEEEE+E+
Sbjct: 75 AAATAEPAEKAEEAKEEEEEEEDEDFGF 102
Score = 31.5 bits (71), Expect = 0.14
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEE 161
T + EK EE +EEEEEEE+E+
Sbjct: 78 TAEPAEKAEEAKEEEEEEEDEDF 100
Score = 31.5 bits (71), Expect = 0.16
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E+ EE +EEEEEEE+E+
Sbjct: 74 TAAATAEPAEKAEEAKEEEEEEEDEDF 100
Score = 30.7 bits (69), Expect = 0.29
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
P E E+ EE +EEEEEEE+E+
Sbjct: 71 PAATAAATAEPAEKAEEAKEEEEEEEDEDF 100
Score = 29.9 bits (67), Expect = 0.55
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEE 165
T + E+ EE +EEEEEEE+E+
Sbjct: 74 TAAATAEPAEKAEEAKEEEEEEEDEDF 100
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 33.5 bits (76), Expect = 0.11
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
KEE + + +E EE+EE++E EE
Sbjct: 306 AHKEEVQSDRPDEIGEEKEEDDENEEN 332
Score = 32.4 bits (73), Expect = 0.33
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +EE + + +E EE+EE++E E
Sbjct: 304 AEAHKEEVQSDRPDEIGEEKEEDDENE 330
Score = 32.0 bits (72), Expect = 0.40
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
EE + + +E EE+EE++E EE E T L
Sbjct: 308 KEEVQSDRPDEIGEEKEEDDENEENERHTELL 339
Score = 31.6 bits (71), Expect = 0.48
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E + + +E EE+EE++E EE E
Sbjct: 308 KEEVQSDRPDEIGEEKEEDDENEENERH 335
Score = 31.2 bits (70), Expect = 0.59
Identities = 11/43 (25%), Positives = 20/43 (46%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
IE + + + E E +EE + + +E EE+EE+
Sbjct: 284 IENKEVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEED 326
Score = 29.7 bits (66), Expect = 1.8
Identities = 9/28 (32%), Positives = 18/28 (64%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ ++E + + +E EE+EE++E EE
Sbjct: 304 AEAHKEEVQSDRPDEIGEEKEEDDENEE 331
Score = 28.5 bits (63), Expect = 4.8
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +EE + + +E EE+EE++
Sbjct: 302 ENAEAHKEEVQSDRPDEIGEEKEEDD 327
Score = 28.5 bits (63), Expect = 5.1
Identities = 9/29 (31%), Positives = 16/29 (55%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ + + +E EE+EE++E EE E
Sbjct: 307 HKEEVQSDRPDEIGEEKEEDDENEENERH 335
Score = 28.5 bits (63), Expect = 5.1
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+ + +E EE+EE++E EE E E+ A
Sbjct: 309 EEVQSDRPDEIGEEKEEDDENEENERHTELLA 340
>gnl|CDD|218303 pfam04874, Mak16, Mak16 protein C-terminal region. The precise
function of this eukaryotic protein family is unknown.
The yeast orthologues have been implicated in cell cycle
progression and biogenesis of 60S ribosomal subunits.
The Schistosoma mansoni Mak16 has been shown to target
protein transport to the nucleolus.
Length = 97
Score = 31.8 bits (72), Expect = 0.11
Identities = 19/28 (67%), Positives = 21/28 (75%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E EE EE +EEEEEEEEEE+E E E
Sbjct: 53 KALEAEESEENDEEEEEEEEEEDEGEIE 80
Score = 31.4 bits (71), Expect = 0.12
Identities = 17/27 (62%), Positives = 19/27 (70%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE EE +EEEEEEEEEE+E E E
Sbjct: 58 EESEENDEEEEEEEEEEDEGEIEYVSD 84
Score = 30.2 bits (68), Expect = 0.33
Identities = 17/27 (62%), Positives = 21/27 (77%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE+EEEEEE+E E E ++EE EEEI
Sbjct: 66 EEEEEEEEEDEGEIEYVSDDEELEEEI 92
Score = 30.2 bits (68), Expect = 0.36
Identities = 17/26 (65%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +E EE +EEEEEEEEEE+E E E
Sbjct: 56 EAEESEENDEEEEEEEEEEDEGEIEY 81
Score = 29.8 bits (67), Expect = 0.50
Identities = 16/28 (57%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E EEEEEEEEEE+E E E ++EE
Sbjct: 60 SEENDEEEEEEEEEEDEGEIEYVSDDEE 87
Score = 29.5 bits (66), Expect = 0.61
Identities = 17/25 (68%), Positives = 20/25 (80%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
+K E EE EE +EEEEEEEEEE+E
Sbjct: 52 KKALEAEESEENDEEEEEEEEEEDE 76
Score = 29.1 bits (65), Expect = 0.87
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE +EEEEEEEEEE+E E E ++E
Sbjct: 61 EENDEEEEEEEEEEDEGEIEYVSDDEELE 89
Score = 28.3 bits (63), Expect = 1.8
Identities = 16/26 (61%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+EEEEEEE+E E E ++EE EE
Sbjct: 65 EEEEEEEEEEDEGEIEYVSDDEELEE 90
Score = 28.3 bits (63), Expect = 1.9
Identities = 16/24 (66%), Positives = 18/24 (75%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEIT 170
E EE EE +EEEEEEEEEE+E
Sbjct: 56 EAEESEENDEEEEEEEEEEDEGEI 79
Score = 27.5 bits (61), Expect = 3.4
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITATL 173
K+ E EE EE +EEEEEEEEEE+
Sbjct: 52 KKALEAEESEENDEEEEEEEEEEDEGEIE 80
Score = 27.1 bits (60), Expect = 3.8
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+EEEEE+E E E ++EE EEE
Sbjct: 66 EEEEEEEEEDEGEIEYVSDDEELEEEI 92
Score = 27.1 bits (60), Expect = 4.9
Identities = 14/27 (51%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE+EEE+E E E ++EE EEE E++
Sbjct: 69 EEEEEEDEGEIEYVSDDEELEEEIEDL 95
Score = 26.4 bits (58), Expect = 8.7
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+EEEE+E E E ++EE EEE E
Sbjct: 68 EEEEEEEDEGEIEYVSDDEELEEEIE 93
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 33.6 bits (77), Expect = 0.12
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
++EEEE + E EE E E+ E E A L
Sbjct: 390 SQREEEEALQREAEERLEAEQAERAEEDARLRE 422
Score = 32.4 bits (74), Expect = 0.30
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
EE+E + E EE E E+ E EE+ L P
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYP 425
Score = 32.4 bits (74), Expect = 0.31
Identities = 14/45 (31%), Positives = 20/45 (44%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E L + RL + E EE+ E E+E E+E+E A
Sbjct: 396 EALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEA 440
Score = 30.5 bits (69), Expect = 1.1
Identities = 12/30 (40%), Positives = 17/30 (56%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+ E E+E E+E+E EE + EEEE
Sbjct: 419 RLRELYPLPEDEFEDEDELEEAQPEEEEEA 448
Score = 29.8 bits (67), Expect = 1.8
Identities = 7/33 (21%), Positives = 11/33 (33%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E+ E E+ E EE+ E +
Sbjct: 402 AEERLEAEQAERAEEDARLRELYPLPEDEFEDE 434
Score = 29.8 bits (67), Expect = 1.9
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E + EEEE + E EE E E+ E
Sbjct: 387 ELISQREEEEALQREAEERLEAEQAE 412
Score = 29.4 bits (66), Expect = 2.3
Identities = 10/27 (37%), Positives = 13/27 (48%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE E E+E E+E+E EE
Sbjct: 415 EEDARLRELYPLPEDEFEDEDELEEAQ 441
Score = 29.4 bits (66), Expect = 2.4
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E+E E+E+E EE + EEEEE
Sbjct: 420 LRELYPLPEDEFEDEDELEEAQPEEEEEA 448
Score = 29.4 bits (66), Expect = 2.5
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+E E+E+E EE + EEEEE
Sbjct: 424 YPLPEDEFEDEDELEEAQPEEEEEAR 449
Score = 29.0 bits (65), Expect = 3.4
Identities = 9/29 (31%), Positives = 14/29 (48%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E+ E E+E E+E+E EE +
Sbjct: 416 EDARLRELYPLPEDEFEDEDELEEAQPEE 444
Score = 29.0 bits (65), Expect = 3.6
Identities = 9/33 (27%), Positives = 13/33 (39%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E ++ E EE+ E E+E E L
Sbjct: 407 EAEQAERAEEDARLRELYPLPEDEFEDEDELEE 439
Score = 29.0 bits (65), Expect = 4.0
Identities = 8/33 (24%), Positives = 14/33 (42%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+ + EE+ E E+E E+E+ P
Sbjct: 410 QAERAEEDARLRELYPLPEDEFEDEDELEEAQP 442
Score = 28.2 bits (63), Expect = 7.0
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
Y + E+ + E + EEEE + E EE
Sbjct: 376 YDQEAEDAKVAELISQREEEEALQREAEERL 406
Score = 27.8 bits (62), Expect = 7.2
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
E + EEEE + E EE E E+ A
Sbjct: 386 AELISQREEEEALQREAEERLEAEQAERAE 415
>gnl|CDD|191634 pfam06886, TPX2, Targeting protein for Xklp2 (TPX2). This family
represents a conserved region approximately 60 residues
long within the eukaryotic targeting protein for Xklp2
(TPX2). Xklp2 is a kinesin-like protein localised on
centrosomes throughout the cell cycle and on spindle
pole microtubules during metaphase. In Xenopus, it has
been shown that Xklp2 protein is required for centrosome
separation and maintenance of spindle bi-polarity. TPX2
is a microtubule-associated protein that mediates the
binding of the C-terminal domain of Xklp2 to
microtubules. It is phosphorylated during mitosis in a
microtubule-dependent way.
Length = 57
Score = 30.4 bits (69), Expect = 0.12
Identities = 16/36 (44%), Positives = 22/36 (61%), Gaps = 1/36 (2%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
++R K EEKE+ E E+EE E ++EEEEE
Sbjct: 8 EKRAEFDK-KLEEKEKALEAEKEEAEARQKEEEEEA 42
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 32.2 bits (73), Expect = 0.12
Identities = 8/33 (24%), Positives = 12/33 (36%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
KG E EE E + +E+ +E
Sbjct: 59 EKGPAAAATAAAEAAEEAEAADADEDADEAAEA 91
Score = 31.5 bits (71), Expect = 0.22
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EE E + +E+ +E E + +E +E T
Sbjct: 73 AEEAEAADADEDADEAAEADAADEADEEEET 103
Score = 31.5 bits (71), Expect = 0.25
Identities = 10/38 (26%), Positives = 16/38 (42%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K T E EE E + +E+ +E E + +E
Sbjct: 59 EKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADE 96
Score = 29.5 bits (66), Expect = 1.1
Identities = 9/47 (19%), Positives = 17/47 (36%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+E + R + E EE E + +E+ +E E +
Sbjct: 50 LEWTEAGRAEKGPAAAATAAAEAAEEAEAADADEDADEAAEADAADE 96
Score = 29.1 bits (65), Expect = 1.4
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E + +E+ +E E + +E +EEE T
Sbjct: 77 EAADADEDADEAAEADAADEADEEEETD 104
Score = 29.1 bits (65), Expect = 1.4
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
+E +E +E + E EE +EEE+EE
Sbjct: 103 TDEAVDETADEADAEAEEADEEEDEE 128
Score = 28.8 bits (64), Expect = 1.7
Identities = 12/26 (46%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +E +E + E EE +EEE+EE E
Sbjct: 105 EAVDETADEADAEAEEADEEEDEEAE 130
Score = 28.4 bits (63), Expect = 2.2
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
E +E + E EE +EEE+EE E
Sbjct: 107 VDETADEADAEAEEADEEEDEEAEA 131
Score = 28.4 bits (63), Expect = 2.9
Identities = 7/38 (18%), Positives = 16/38 (42%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G + E+ E + +E+ +E E + +E +
Sbjct: 61 GPAAAATAAAEAAEEAEAADADEDADEAAEADAADEAD 98
Score = 28.0 bits (62), Expect = 3.0
Identities = 11/29 (37%), Positives = 18/29 (62%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
T + ++ +E + E EE +EEE+EE E
Sbjct: 103 TDEAVDETADEADAEAEEADEEEDEEAEA 131
Score = 27.6 bits (61), Expect = 4.1
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+++ +E E + +E +EEEE +E + T
Sbjct: 81 ADEDADEAAEADAADEADEEEETDEAVDET 110
Score = 27.6 bits (61), Expect = 4.1
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITA 171
+E +E + E EE +EEE+EE A
Sbjct: 107 VDETADEADAEAEEADEEEDEEAEA 131
Score = 27.6 bits (61), Expect = 4.5
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E+ +E E + +E +EEEE +E
Sbjct: 80 DADEDADEAAEADAADEADEEEETDE 105
Score = 27.2 bits (60), Expect = 5.6
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E +E +E + E EE +EEE+E
Sbjct: 102 ETDEAVDETADEADAEAEEADEEEDE 127
Score = 27.2 bits (60), Expect = 5.6
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+ +E+ +E E + +E +EEEE +
Sbjct: 79 ADADEDADEAAEADAADEADEEEETDEAVDETA 111
Score = 27.2 bits (60), Expect = 5.7
Identities = 9/29 (31%), Positives = 15/29 (51%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+E+ +E E + +E +EEEE A
Sbjct: 78 AADADEDADEAAEADAADEADEEEETDEA 106
Score = 26.8 bits (59), Expect = 7.7
Identities = 7/28 (25%), Positives = 13/28 (46%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+ EE E + +E+ +E E + A
Sbjct: 70 AEAAEEAEAADADEDADEAAEADAADEA 97
Score = 26.8 bits (59), Expect = 9.4
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E+ +E E + +E +EEEE +E + TA
Sbjct: 82 DEDADEAAEADAADEADEEEETDEAVDETA 111
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 32.7 bits (75), Expect = 0.13
Identities = 16/34 (47%), Positives = 20/34 (58%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
+ EE+ EE E EE E+E EEE EEE L+
Sbjct: 7 QVEEEVEETEVEEAVEDEVEEETVEEESEAELLD 40
Score = 32.7 bits (75), Expect = 0.14
Identities = 16/28 (57%), Positives = 18/28 (64%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E+ EEE EE E EE E+E EEE E
Sbjct: 4 KNEQVEEEVEETEVEEAVEDEVEEETVE 31
Score = 32.7 bits (75), Expect = 0.14
Identities = 15/29 (51%), Positives = 17/29 (58%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE E EE E+E EEE EEE E E +
Sbjct: 13 EETEVEEAVEDEVEEETVEEESEAELLDE 41
Score = 32.3 bits (74), Expect = 0.17
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E +E E EE E+E EEE EEE E
Sbjct: 7 QVEEEVEETEVEEAVEDEVEEETVEEESEA 36
Score = 31.9 bits (73), Expect = 0.23
Identities = 14/26 (53%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE E+E EEE EEE E E +EE+
Sbjct: 18 EEAVEDEVEEETVEEESEAELLDEEQ 43
Score = 31.9 bits (73), Expect = 0.29
Identities = 17/28 (60%), Positives = 19/28 (67%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
EEK E+ EEE EE E EE E+E EE T
Sbjct: 2 EEKNEQVEEEVEETEVEEAVEDEVEEET 29
Score = 31.5 bits (72), Expect = 0.39
Identities = 15/29 (51%), Positives = 18/29 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E+ E+ EEE EE E EE E+E EEE
Sbjct: 3 EKNEQVEEEVEETEVEEAVEDEVEEETVE 31
Score = 30.0 bits (68), Expect = 1.1
Identities = 13/29 (44%), Positives = 15/29 (51%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E + EE E+E EEE EEE E E
Sbjct: 14 ETEVEEAVEDEVEEETVEEESEAELLDEE 42
Score = 30.0 bits (68), Expect = 1.3
Identities = 14/32 (43%), Positives = 20/32 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
+E EEE EEE E E +EE+ + E+ A L+
Sbjct: 23 DEVEEETVEEESEAELLDEEQAKIAELEAKLD 54
Score = 29.6 bits (67), Expect = 1.6
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ E E EEE EEE E E +EE+ +
Sbjct: 19 EAVEDEVEEETVEEESEAELLDEEQAK 45
Score = 29.2 bits (66), Expect = 2.2
Identities = 13/26 (50%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E+E EEE EEE E E +EE+
Sbjct: 19 EAVEDEVEEETVEEESEAELLDEEQA 44
Score = 28.8 bits (65), Expect = 2.9
Identities = 13/33 (39%), Positives = 17/33 (51%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E++ E+ EEE EE E EE E+E E
Sbjct: 1 MEEKNEQVEEEVEETEVEEAVEDEVEEETVEEE 33
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 30.8 bits (70), Expect = 0.14
Identities = 18/44 (40%), Positives = 25/44 (56%)
Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
L + L+ +L K E K EEE+ EEEEE + EE +E E+
Sbjct: 10 LKDMLNADLLAKLKAKKKELKAEEEKREEEEEARKREERKEREK 53
Score = 29.2 bits (66), Expect = 0.52
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
+ +EE+ EEEEE + EE +E E+
Sbjct: 30 KAEEEKREEEEEARKREERKEREKN 54
Score = 29.2 bits (66), Expect = 0.54
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EEE+ EEEEE + EE +E E+
Sbjct: 30 KAEEEKREEEEEARKREERKEREKN 54
Score = 28.1 bits (63), Expect = 1.1
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E++EEEEE + EE +E E+ + EE
Sbjct: 32 EEEKREEEEEARKREERKEREKNKSFEE 59
Score = 27.7 bits (62), Expect = 1.8
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
K ++KE + EEE+ EEEEE + EE +E
Sbjct: 23 KAKKKELKAEEEKREEEEEARKREERKERE 52
Score = 26.1 bits (58), Expect = 5.5
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEE 165
K EE++ EEEEE + EE +E E+
Sbjct: 30 KAEEEKREEEEEARKREERKEREKN 54
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 32.4 bits (74), Expect = 0.14
Identities = 6/27 (22%), Positives = 6/27 (22%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
E E E EEE
Sbjct: 129 AAAEAAAAEAAAPAAEAAAAAAAEEEA 155
Score = 32.0 bits (73), Expect = 0.17
Identities = 6/28 (21%), Positives = 7/28 (25%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K + E E E EE
Sbjct: 126 KKKAAAEAAAAEAAAPAAEAAAAAAAEE 153
Score = 31.6 bits (72), Expect = 0.26
Identities = 6/28 (21%), Positives = 7/28 (25%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K + E E EEE
Sbjct: 128 KAAAEAAAAEAAAPAAEAAAAAAAEEEA 155
Score = 31.2 bits (71), Expect = 0.32
Identities = 7/28 (25%), Positives = 7/28 (25%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E E E EEE
Sbjct: 127 KKAAAEAAAAEAAAPAAEAAAAAAAEEE 154
Score = 30.0 bits (68), Expect = 0.74
Identities = 6/30 (20%), Positives = 8/30 (26%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
T K ++ E E E E
Sbjct: 123 TPKKKKAAAEAAAAEAAAPAAEAAAAAAAE 152
Score = 28.1 bits (63), Expect = 4.2
Identities = 4/29 (13%), Positives = 7/29 (24%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+K++ E E E
Sbjct: 124 PKKKKAAAEAAAAEAAAPAAEAAAAAAAE 152
Score = 27.0 bits (60), Expect = 9.9
Identities = 6/31 (19%), Positives = 8/31 (25%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEE 162
K++ E E EEE
Sbjct: 125 KKKKAAAEAAAAEAAAPAAEAAAAAAAEEEA 155
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 33.4 bits (76), Expect = 0.14
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E +++E+ EE EEE+ EEE ++
Sbjct: 311 KPEIEQDEDSEESEEEKNEEEGGLSKKG 338
Score = 33.0 bits (75), Expect = 0.19
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E + E E++E+ EE EEE+ EEE
Sbjct: 304 LSPEIPAKPEIEQDEDSEESEEEKNEEE 331
Score = 31.9 bits (72), Expect = 0.49
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E E++E+ EE EEE+ EEE
Sbjct: 309 PAKPEIEQDEDSEESEEEKNEEEGGL 334
Score = 31.1 bits (70), Expect = 0.91
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E E++E+ EE EEE+ EEE
Sbjct: 308 IPAKPEIEQDEDSEESEEEKNEEEGGLS 335
Score = 30.7 bits (69), Expect = 1.00
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+K E + E E++E+ EE EEE
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEE 326
Score = 30.3 bits (68), Expect = 1.5
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ E+ EE EEE+ EEE ++ ++
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKK 340
Score = 29.9 bits (67), Expect = 1.7
Identities = 9/29 (31%), Positives = 17/29 (58%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ ++ E + E E++E+ EE EEE+
Sbjct: 299 EREDKLSPEIPAKPEIEQDEDSEESEEEK 327
Score = 29.2 bits (65), Expect = 3.5
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEE 166
+ +E EE EEE+ EEE ++ ++
Sbjct: 315 EQDEDSEESEEEKNEEEGGLSKKGKK 340
Score = 28.8 bits (64), Expect = 4.5
Identities = 10/40 (25%), Positives = 21/40 (52%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEE 165
++L + + + E+ EE EEE+ EEE ++ ++
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340
Score = 28.4 bits (63), Expect = 5.8
Identities = 9/26 (34%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+E++ E + E E++E+ EE
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEES 323
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 32.8 bits (75), Expect = 0.14
Identities = 14/34 (41%), Positives = 20/34 (58%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
K E+K +EE+ E EE E+EE E+ EE +
Sbjct: 24 KEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57
Score = 31.3 bits (71), Expect = 0.45
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E KEE++ +EE+ E EE E+EE E+
Sbjct: 22 ENKEEDKGKEEDLEFEEIEKEEIIED 47
Score = 31.3 bits (71), Expect = 0.46
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
GK E+ E EE E+EE E+ EE E + E
Sbjct: 28 KGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57
Score = 31.3 bits (71), Expect = 0.47
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E EE+ +E E +EE++ +EE+ E
Sbjct: 9 KHENMEEDCCKENENKEEDKGKEEDLEF 36
Score = 30.9 bits (70), Expect = 0.60
Identities = 11/26 (42%), Positives = 18/26 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ +E E +EE++ +EE+ E EE E
Sbjct: 15 EDCCKENENKEEDKGKEEDLEFEEIE 40
Score = 30.9 bits (70), Expect = 0.63
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
++ E +EE++ +EE+ E EE E+EEI
Sbjct: 18 CKENENKEEDKGKEEDLEFEEIEKEEI 44
Score = 30.5 bits (69), Expect = 0.84
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+E E +EE++ +EE+ E EE E+EE I
Sbjct: 19 KENENKEEDKGKEEDLEFEEIEKEEII 45
Score = 30.5 bits (69), Expect = 0.85
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
EE E+EE E+ EE E + EE ++E
Sbjct: 37 EEIEKEEIIEDSEESNEVKIEELKDENN 64
Score = 30.5 bits (69), Expect = 0.92
Identities = 13/28 (46%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E +EE++ +EE+ E EE E+EE E
Sbjct: 19 KENENKEEDKGKEEDLEFEEIEKEEIIE 46
Score = 29.3 bits (66), Expect = 2.0
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ + EE E+EE E+ EE E + EE+
Sbjct: 33 DLEFEEIEKEEIIEDSEESNEVKIEEL 59
Score = 28.2 bits (63), Expect = 4.8
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ K E EE+ +E E +EE++ +EE
Sbjct: 7 DAKHENMEEDCCKENENKEEDKGKEE 32
Score = 27.8 bits (62), Expect = 5.8
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K + E EE+ +E E +EE++ +EE++
Sbjct: 6 KDAKHENMEEDCCKENENKEEDKGKEEDL 34
Score = 27.8 bits (62), Expect = 6.6
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
EKEE E+ EE E + EE ++E
Sbjct: 38 EIEKEEIIEDSEESNEVKIEELKDE 62
Score = 27.4 bits (61), Expect = 7.5
Identities = 8/27 (29%), Positives = 17/27 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E++ ++ + E EE+ +E E +EE+
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDK 28
Score = 27.0 bits (60), Expect = 9.6
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EKE ++ + E EE+ +E E +EE
Sbjct: 1 MEKECKDAKHENMEEDCCKENENKEE 26
>gnl|CDD|236152 PRK08115, PRK08115, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 858
Score = 33.7 bits (77), Expect = 0.14
Identities = 21/73 (28%), Positives = 32/73 (43%), Gaps = 15/73 (20%)
Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL--------- 173
KG +DG R ++ T K EE + +E+ EE EEEE ++ + T+
Sbjct: 762 KGGTVYVDGSRDAQVLTLKAEENDLDEQLLLLPEELEEEETKKPVVLVDTIQDVRSTDVT 821
Query: 174 ------NPCGLCR 180
N C +CR
Sbjct: 822 IGSEVGNTCPVCR 834
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 32.7 bits (75), Expect = 0.18
Identities = 11/36 (30%), Positives = 13/36 (36%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
E E E E E EEE E + + P L
Sbjct: 128 PVCSVNMTECTGPEPEPEPEPEEEPEKKSGMGPLLL 163
Score = 31.5 bits (72), Expect = 0.37
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
Y K +EK +++ +E + +E+EEE++E
Sbjct: 181 YKPKQQEKGAPDDDLDEYDYGDEDEEEDDEP 211
Score = 30.0 bits (68), Expect = 1.2
Identities = 9/34 (26%), Positives = 13/34 (38%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+ GE + E + E + E E EEE
Sbjct: 21 FAYGGEPEPEVPQTEATDPSEVVVETVTEEEGGP 54
Score = 29.2 bits (66), Expect = 2.4
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEE 166
KG ++ +E + +E+EEE++E
Sbjct: 188 KGAPDDDLDEYDYGDEDEEEDDEPPW 213
Score = 28.1 bits (63), Expect = 4.9
Identities = 8/25 (32%), Positives = 8/25 (32%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
E E E E E EEE
Sbjct: 127 CPVCSVNMTECTGPEPEPEPEPEEE 151
Score = 28.1 bits (63), Expect = 5.1
Identities = 12/36 (33%), Positives = 14/36 (38%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
G E E E + E + E E EE L P G
Sbjct: 24 GGEPEPEVPQTEATDPSEVVVETVTEEEGGPLTPEG 59
Score = 28.1 bits (63), Expect = 5.5
Identities = 8/26 (30%), Positives = 8/26 (30%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E E E E EEE
Sbjct: 127 CPVCSVNMTECTGPEPEPEPEPEEEP 152
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 32.4 bits (74), Expect = 0.18
Identities = 9/52 (17%), Positives = 31/52 (59%)
Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
K E + + + K ++K++++++++++++ E+++E+E E+ L
Sbjct: 75 KKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLT 126
Score = 32.0 bits (73), Expect = 0.21
Identities = 14/69 (20%), Positives = 37/69 (53%), Gaps = 10/69 (14%)
Query: 107 KVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
KV E E ++ W K + K ++K++++++++ E+++E+E E++
Sbjct: 73 KVKKEYEE--KQKWKWKKKKSKKK--------KDKDKDKKDDKKDDKSEKKDEKEAEDKL 122
Query: 167 EEITATLNP 175
E++T + +
Sbjct: 123 EDLTKSYSE 131
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 33.2 bits (75), Expect = 0.18
Identities = 24/70 (34%), Positives = 30/70 (42%), Gaps = 5/70 (7%)
Query: 104 DHTKVTN-EVERTGRKPWPQKGL----IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEE 158
DH + E E RK K L IERL+ + R RL + E E E E E E E
Sbjct: 433 DHAERARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERE 492
Query: 159 EEEEEEEEEE 168
E + E +
Sbjct: 493 RLERDRLERD 502
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 32.4 bits (74), Expect = 0.18
Identities = 12/29 (41%), Positives = 13/29 (44%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EK+ E EE EEEE EE A
Sbjct: 149 EKKAAEAAAVAAEEAAAAEEEEAEEAPAE 177
Score = 29.4 bits (66), Expect = 1.8
Identities = 13/32 (40%), Positives = 13/32 (40%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
K E EE EEEE EE EE A
Sbjct: 151 KAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAE 182
Score = 28.6 bits (64), Expect = 3.5
Identities = 11/25 (44%), Positives = 12/25 (48%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
E+K E EE EEEE EE
Sbjct: 149 EKKAAEAAAVAAEEAAAAEEEEAEE 173
Score = 27.4 bits (61), Expect = 7.1
Identities = 12/25 (48%), Positives = 12/25 (48%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
E EEEE EE EE EE E
Sbjct: 162 EAAAAEEEEAEEAPAEEAPAEESAE 186
Score = 27.4 bits (61), Expect = 8.3
Identities = 11/26 (42%), Positives = 12/26 (46%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E EEEE EE EE EE
Sbjct: 159 AAEEAAAAEEEEAEEAPAEEAPAEES 184
Score = 27.4 bits (61), Expect = 8.3
Identities = 13/26 (50%), Positives = 13/26 (50%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE EEEE EE EE EE E
Sbjct: 161 EEAAAAEEEEAEEAPAEEAPAEESAE 186
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 33.0 bits (75), Expect = 0.19
Identities = 15/80 (18%), Positives = 37/80 (46%), Gaps = 3/80 (3%)
Query: 96 KKYKWVRFDHTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEE 155
+ YK+ R +++ + GR ++ L G + Y + ++ E + +E
Sbjct: 230 RSYKFGRQTRGFRSDDDQGYGRY---KQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDE 286
Query: 156 EEEEEEEEEEEEEITATLNP 175
EE+ ++++ E+ + T T +
Sbjct: 287 EEDGDDDDNEDNDDTNTNHG 306
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 33.1 bits (76), Expect = 0.19
Identities = 11/39 (28%), Positives = 23/39 (58%), Gaps = 4/39 (10%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
RRPR K + E++++ E ++ E+E++E++E
Sbjct: 1352 LLRRPR----KKKSDSSSEDDDDSEVDDSEDEDDEDDED 1386
Score = 32.3 bits (74), Expect = 0.42
Identities = 8/38 (21%), Positives = 23/38 (60%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ R K + E++++ E ++ E+E++E++E++
Sbjct: 1350 SRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDD 1387
Score = 30.8 bits (70), Expect = 1.2
Identities = 7/48 (14%), Positives = 24/48 (50%)
Query: 121 PQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ + R E++++ E ++ E+E++E++E+++
Sbjct: 1341 VKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 32.7 bits (74), Expect = 0.20
Identities = 17/29 (58%), Positives = 19/29 (65%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
G E E EEEE+EE EE EE EE E +T
Sbjct: 105 GAENETVEEEEKEESREEREEVEETEGVT 133
Score = 32.3 bits (73), Expect = 0.29
Identities = 16/40 (40%), Positives = 18/40 (45%)
Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
P + E E EEEE+EE EE EE EE T
Sbjct: 94 SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVT 133
Score = 31.5 bits (71), Expect = 0.59
Identities = 15/37 (40%), Positives = 18/37 (48%)
Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
+R + E EEEE+EE EE EE EE E
Sbjct: 94 SEPSRRMQEDSGAENETVEEEEKEESREEREEVEETE 130
Score = 31.2 bits (70), Expect = 0.79
Identities = 13/32 (40%), Positives = 20/32 (62%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
K E+K + + EE ++EE+E E EEEE+
Sbjct: 132 VTKSEQKNDWRDAEECQKEEKEPEPEEEEKPK 163
Score = 30.8 bits (69), Expect = 0.89
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
T + EEKEE EE EE EE E + E++
Sbjct: 110 TVEEEEKEESREEREEVEETEGVTKSEQKN 139
Score = 30.8 bits (69), Expect = 0.98
Identities = 17/35 (48%), Positives = 20/35 (57%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEE 165
RR + +G E EEEE+EE EE EE EE E
Sbjct: 96 PSRRMQEDSGAENETVEEEEKEESREEREEVEETE 130
Score = 29.6 bits (66), Expect = 2.4
Identities = 13/36 (36%), Positives = 18/36 (50%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
EE+E+EE EE EE EE E + E+ +
Sbjct: 110 TVEEEEKEESREEREEVEETEGVTKSEQKNDWRDAE 145
Score = 27.7 bits (61), Expect = 9.8
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+EE EE E + E++ + + EE
Sbjct: 121 EEREEVEETEGVTKSEQKNDWRDAEE 146
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 32.3 bits (74), Expect = 0.22
Identities = 13/30 (43%), Positives = 20/30 (66%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
K EEKE+ + E+EE+ E EE+ E+E+
Sbjct: 65 KIEEKEKAKREKEEKGLRELEEDTPEDELA 94
Score = 29.6 bits (67), Expect = 1.6
Identities = 10/26 (38%), Positives = 19/26 (73%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITAT 172
E+E+++ ++ +EEE+EE+EEE
Sbjct: 27 EDEDDDVKDSWDEEEDEEKEEEKAKV 52
Score = 29.6 bits (67), Expect = 1.8
Identities = 12/30 (40%), Positives = 18/30 (60%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
K K + EE+E+ + E+EE+ E EE T
Sbjct: 59 KKALKAKIEEKEKAKREKEEKGLRELEEDT 88
Score = 29.2 bits (66), Expect = 2.1
Identities = 10/32 (31%), Positives = 21/32 (65%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+ E+E+++ ++ +EEE+EE+EEE+
Sbjct: 25 DDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKA 56
Score = 28.9 bits (65), Expect = 3.2
Identities = 9/27 (33%), Positives = 21/27 (77%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+++++ ++ +EEE+EE+EEE+ +
Sbjct: 26 DEDEDDDVKDSWDEEEDEEKEEEKAKV 52
Score = 28.5 bits (64), Expect = 4.5
Identities = 11/32 (34%), Positives = 22/32 (68%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E E+++ ++ +EEE+EE+EEE+ ++ A
Sbjct: 27 EDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKA 58
Score = 28.1 bits (63), Expect = 6.0
Identities = 11/40 (27%), Positives = 21/40 (52%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
K + + E++E+ + E+EE+ E EE+ E+ A
Sbjct: 55 KAKAKKALKAKIEEKEKAKREKEEKGLRELEEDTPEDELA 94
Score = 27.7 bits (62), Expect = 7.3
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K + K E+EE+ E EE+ E+E E+
Sbjct: 69 KEKAKREKEEKGLRELEEDTPEDELAEK 96
>gnl|CDD|220112 pfam09110, HAND, HAND. The HAND domain adopts a secondary
structure consisting of four alpha helices, three of
which (H2, H3, H4) form an L-like configuration. Helix
H2 runs antiparallel to helices H3 and H4, packing
closely against helix H4, whilst helix H1 reposes in the
concave surface formed by these three helices and runs
perpendicular to them. The domain confers DNA and
nucleosome binding properties to the protein.
Length = 109
Score = 31.1 bits (71), Expect = 0.23
Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 8/42 (19%)
Query: 134 RPRLYTGKGEEKEEEEEEEEEEEEEEEE--------EEEEEE 167
+ L ++E+EEE E + + E+E+ EEEEEE
Sbjct: 60 KVPLDDVGDGDEEDEEEREAKRKLEQEKIDNAEPLTEEEEEE 101
Score = 27.2 bits (61), Expect = 4.5
Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 8/35 (22%)
Query: 142 GEEKEEEEEEEEEE--------EEEEEEEEEEEEE 168
EE+E + + E+E+ EEEEEE++ EE
Sbjct: 74 EEEREAKRKLEQEKIDNAEPLTEEEEEEKQRLLEE 108
Score = 26.5 bits (59), Expect = 8.8
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
+ + +EE+EEE E + + E+E+I
Sbjct: 64 DDVGDGDEEDEEEREAKRKLEQEKI 88
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 32.7 bits (75), Expect = 0.24
Identities = 19/54 (35%), Positives = 31/54 (57%), Gaps = 3/54 (5%)
Query: 122 QKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+K L E+L R+ K E K EE+E+E+ + EEE+E+++E+ NP
Sbjct: 301 RKALEEQL---RQGEELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENP 351
Score = 30.4 bits (69), Expect = 1.4
Identities = 15/39 (38%), Positives = 22/39 (56%), Gaps = 5/39 (12%)
Query: 141 KGEEKEEEEEEEEEEE-----EEEEEEEEEEEEITATLN 174
+ E +++EE + E EE E EEE +EEE E + N
Sbjct: 381 RAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKN 419
Score = 28.5 bits (64), Expect = 6.3
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K + KEE +E EE + + E EE E++
Sbjct: 237 KEKLKEELKEFEELVKADPEAALEELEKL 265
>gnl|CDD|220172 pfam09309, FCP1_C, FCP1, C-terminal. The C-terminal domain of
FCP-1 is required for interaction with the carboxy
terminal domain of RAP74. Interaction relies extensively
on van der Waals contacts between hydrophobic residues
situated within alpha-helices in both domains.
Length = 263
Score = 32.3 bits (73), Expect = 0.24
Identities = 20/65 (30%), Positives = 28/65 (43%), Gaps = 7/65 (10%)
Query: 113 ERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE----EEEE 168
E P + G R PR + K E++EE+ E E E E+EE E +E
Sbjct: 194 EPDEGTPSSSE---RSNIGGRGPRGHKRKLNEEDEEDAESESSFESSNEDEEGSSSEADE 250
Query: 169 ITATL 173
+ A L
Sbjct: 251 MAAAL 255
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 32.5 bits (74), Expect = 0.25
Identities = 9/31 (29%), Positives = 20/31 (64%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ E++ + EE E++ ++EEEE+ + E+
Sbjct: 174 ESESEDESKSEESAEDDSDDEEEEDSDSEDY 204
Score = 30.2 bits (68), Expect = 1.4
Identities = 12/39 (30%), Positives = 21/39 (53%)
Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + EE E E+E + EE E++ ++EEEE+
Sbjct: 160 KKEAKESSDKDDEEESESEDESKSEESAEDDSDDEEEED 198
Score = 29.4 bits (66), Expect = 2.3
Identities = 12/33 (36%), Positives = 20/33 (60%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
EE+ E E+E + EE E++ ++EEEE +
Sbjct: 167 SDKDDEEESESEDESKSEESAEDDSDDEEEEDS 199
Score = 29.4 bits (66), Expect = 2.6
Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 4/54 (7%)
Query: 116 GRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
G + +KG ++ K + + +E ++++EEE E E+E + EE E+
Sbjct: 141 GIETKAKKGKAKKKTKKSK----KKEAKESSDKDDEEESESEDESKSEESAEDD 190
Score = 28.6 bits (64), Expect = 4.3
Identities = 13/33 (39%), Positives = 23/33 (69%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
T K ++KE +E ++++EEE E E+E + E +A
Sbjct: 155 TKKSKKKEAKESSDKDDEEESESEDESKSEESA 187
>gnl|CDD|221203 pfam11748, DUF3306, Protein of unknown function (DUF3306). This
family of proteobacterial species proteins has no known
function.
Length = 115
Score = 30.7 bits (70), Expect = 0.33
Identities = 13/46 (28%), Positives = 19/46 (41%)
Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+L + + E+E EEEEE E E+EE+ L
Sbjct: 10 KLAVRAEEPAEPAETAEEEAAAAAPAPAPEEEEEAELEDEELLEEL 55
Score = 28.8 bits (65), Expect = 1.6
Identities = 13/36 (36%), Positives = 13/36 (36%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
E EEE EEEEE E E L L
Sbjct: 22 AETAEEEAAAAAPAPAPEEEEEAELEDEELLEELDL 57
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 31.1 bits (71), Expect = 0.36
Identities = 14/30 (46%), Positives = 24/30 (80%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+G+ + EE+ EE+E E+EE+E +EE+E+E
Sbjct: 8 SGRIIDIEEKREEKEREKEEKERKEEKEKE 37
Score = 28.0 bits (63), Expect = 4.1
Identities = 12/23 (52%), Positives = 19/23 (82%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEI 169
EE+ EE+E E+EE+E +EE+E+
Sbjct: 15 EEKREEKEREKEEKERKEEKEKE 37
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 32.2 bits (73), Expect = 0.36
Identities = 6/31 (19%), Positives = 10/31 (32%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
++ E E+E E E E +
Sbjct: 222 APSPNTQQAVEHEDEPTEPEREGPPFPGHRS 252
Score = 31.5 bits (71), Expect = 0.52
Identities = 7/35 (20%), Positives = 11/35 (31%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
T + ++ E E+E E E E
Sbjct: 218 TPQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRS 252
Score = 31.5 bits (71), Expect = 0.63
Identities = 11/60 (18%), Positives = 14/60 (23%), Gaps = 3/60 (5%)
Query: 109 TNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
T E R G K +G G + + EE E E
Sbjct: 75 TAEESRHGEKEERGQGGPS---GSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESP 131
Score = 31.1 bits (70), Expect = 0.66
Identities = 6/36 (16%), Positives = 12/36 (33%), Gaps = 1/36 (2%)
Query: 142 GEEKEEEEEEEE-EEEEEEEEEEEEEEEITATLNPC 176
++ ++ E E+E E E E +
Sbjct: 219 PQQAPSPNTQQAVEHEDEPTEPEREGPPFPGHRSHS 254
Score = 30.7 bits (69), Expect = 1.0
Identities = 4/29 (13%), Positives = 8/29 (27%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G E E ++ + E+
Sbjct: 145 GPHEPAPPESHNPSPNQQPSSFLQPSHED 173
Score = 30.3 bits (68), Expect = 1.4
Identities = 6/29 (20%), Positives = 11/29 (37%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ ++ E E+E E E E
Sbjct: 216 SPTPQQAPSPNTQQAVEHEDEPTEPEREG 244
Score = 30.3 bits (68), Expect = 1.4
Identities = 4/29 (13%), Positives = 4/29 (13%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G E E E
Sbjct: 125 GSSPESPASHSPPPSPPSHPGPHEPAPPE 153
Score = 29.9 bits (67), Expect = 1.8
Identities = 6/29 (20%), Positives = 10/29 (34%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + E+ EE E E E +
Sbjct: 160 NQQPSSFLQPSHEDSPEEPEPPTSEPEPD 188
Score = 29.2 bits (65), Expect = 2.8
Identities = 5/28 (17%), Positives = 5/28 (17%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
G EE E E
Sbjct: 109 GSAEELASGLSPENTSGSSPESPASHSP 136
Score = 28.8 bits (64), Expect = 3.7
Identities = 5/27 (18%), Positives = 11/27 (40%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
GE + ++ ++ E E+E
Sbjct: 212 GEPQSPTPQQAPSPNTQQAVEHEDEPT 238
Score = 28.8 bits (64), Expect = 3.8
Identities = 5/28 (17%), Positives = 9/28 (32%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
++ + E+ EE E E
Sbjct: 157 PSPNQQPSSFLQPSHEDSPEEPEPPTSE 184
Score = 28.8 bits (64), Expect = 4.4
Identities = 4/29 (13%), Positives = 4/29 (13%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE E E
Sbjct: 106 SPSGSAEELASGLSPENTSGSSPESPASH 134
Score = 28.8 bits (64), Expect = 4.4
Identities = 5/27 (18%), Positives = 10/27 (37%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ ++ E E+E E E
Sbjct: 215 QSPTPQQAPSPNTQQAVEHEDEPTEPE 241
Score = 28.4 bits (63), Expect = 5.2
Identities = 6/29 (20%), Positives = 9/29 (31%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ EE E E E + +
Sbjct: 166 FLQPSHEDSPEEPEPPTSEPEPDSPGPPQ 194
Score = 28.4 bits (63), Expect = 5.8
Identities = 8/31 (25%), Positives = 11/31 (35%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
E+ EE E E E + + E T
Sbjct: 169 PSHEDSPEEPEPPTSEPEPDSPGPPQSETPT 199
Score = 28.0 bits (62), Expect = 7.2
Identities = 6/29 (20%), Positives = 9/29 (31%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E+ EE E E E +
Sbjct: 163 PSSFLQPSHEDSPEEPEPPTSEPEPDSPG 191
Score = 27.6 bits (61), Expect = 8.4
Identities = 7/26 (26%), Positives = 12/26 (46%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + E + E EE E+EE +
Sbjct: 64 QGHRQTESDTETAEESRHGEKEERGQ 89
Score = 27.6 bits (61), Expect = 8.8
Identities = 2/27 (7%), Positives = 7/27 (25%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ ++ + E+ E
Sbjct: 150 APPESHNPSPNQQPSSFLQPSHEDSPE 176
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 30.7 bits (70), Expect = 0.37
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE++ +EEE + + EE +E E E
Sbjct: 98 ARDEEKKLDEEEAKRQHEEAKEREREK 124
Score = 29.6 bits (67), Expect = 0.94
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EEK+ +EEE + + EE +E E E+++
Sbjct: 101 EEKKLDEEEAKRQHEEAKEREREKKK 126
Score = 29.2 bits (66), Expect = 1.3
Identities = 8/26 (30%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ +EE++ +EEE + + EE +E
Sbjct: 95 AQRARDEEKKLDEEEAKRQHEEAKER 120
Score = 28.4 bits (64), Expect = 2.0
Identities = 9/31 (29%), Positives = 18/31 (58%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ E + +EE++ +EEE + + EE +E
Sbjct: 90 DERAEAQRARDEEKKLDEEEAKRQHEEAKER 120
Score = 26.5 bits (59), Expect = 9.4
Identities = 9/26 (34%), Positives = 19/26 (73%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E+++ +EEE + + EE +E E E++
Sbjct: 100 DEEKKLDEEEAKRQHEEAKEREREKK 125
>gnl|CDD|223130 COG0052, RpsB, Ribosomal protein S2 [Translation, ribosomal
structure and biogenesis].
Length = 252
Score = 31.5 bits (72), Expect = 0.38
Identities = 17/48 (35%), Positives = 21/48 (43%), Gaps = 9/48 (18%)
Query: 130 DGKRRPRLYT---------GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
D R L G+G +EEE EE+EE EE E +EE
Sbjct: 204 DAIRSIALIYWLLARAILEGRGGALDEEEAAIEEDEEVEEFEAKEEAA 251
>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
Length = 586
Score = 32.1 bits (73), Expect = 0.39
Identities = 14/26 (53%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+KEE+EE+ EE +E+EEE+ E+E
Sbjct: 53 EQKEEKEEDAGEEGDEDEEEQGEDEH 78
Score = 30.2 bits (68), Expect = 1.5
Identities = 12/29 (41%), Positives = 20/29 (68%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
+L K E++E+ EE +E+EEE+ E+E
Sbjct: 50 QLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78
>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal. This is the N terminal of
the DNA repair protein Rad54.
Length = 191
Score = 31.2 bits (71), Expect = 0.40
Identities = 12/36 (33%), Positives = 18/36 (50%), Gaps = 2/36 (5%)
Query: 137 LY--TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
LY T + K EEE+ E+++E EE E +
Sbjct: 136 LYDPTVDDKPKIEEEKAEKDQEPEESETKLSNGPKV 171
Score = 30.0 bits (68), Expect = 1.2
Identities = 9/30 (30%), Positives = 17/30 (56%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
++K + EEE+ E+++E EE E +
Sbjct: 142 DDKPKIEEEKAEKDQEPEESETKLSNGPKV 171
Score = 27.3 bits (61), Expect = 8.9
Identities = 9/28 (32%), Positives = 17/28 (60%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
EEE+ E+++E EE E + ++ +L
Sbjct: 148 EEEKAEKDQEPEESETKLSNGPKVHKSL 175
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 31.8 bits (72), Expect = 0.45
Identities = 12/29 (41%), Positives = 24/29 (82%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
GK EEKE+E+ +EE+++++E+ +EE ++
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKDR 131
Score = 29.9 bits (67), Expect = 2.1
Identities = 12/36 (33%), Positives = 24/36 (66%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
K E +EEE+E+E+ +EE+++++E+ +E P
Sbjct: 98 PKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKP 133
Score = 29.1 bits (65), Expect = 3.6
Identities = 11/40 (27%), Positives = 25/40 (62%)
Query: 129 LDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K G+E+E+E+E+ +EE+++++E+ +EE +
Sbjct: 90 AKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPK 129
Score = 28.3 bits (63), Expect = 6.7
Identities = 13/43 (30%), Positives = 25/43 (58%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E+ K E K E +EEE+E+E+ +EE+++++E
Sbjct: 80 VEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKE 122
Score = 27.9 bits (62), Expect = 8.9
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+RP K +EK+ EE + EEE++ E +
Sbjct: 140 KRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRP 175
Score = 27.5 bits (61), Expect = 9.1
Identities = 9/30 (30%), Positives = 23/30 (76%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE++E+E+ +EE+++++E+ +EE ++
Sbjct: 101 ESGKEEEKEKEQVKEEKKKKKEKPKEEPKD 130
>gnl|CDD|221668 pfam12619, MCM2_N, Mini-chromosome maintenance protein 2. This
domain family is found in eukaryotes, and is typically
between 138 and 153 amino acids in length. The family is
found in association with pfam00493. Mini-chromosome
maintenance (MCM) proteins are essential for DNA
replication. These proteins use ATPase activity to
perform this function.
Length = 145
Score = 30.8 bits (70), Expect = 0.48
Identities = 12/27 (44%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+E +EEEE E++ ++ +EE EEEE+
Sbjct: 2 DEPDEEEELEDDIDDLDEEAEEEEDGE 28
Score = 30.4 bits (69), Expect = 0.55
Identities = 12/26 (46%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E++ +EEEE E++ ++ +EE EEEE+
Sbjct: 1 EDEPDEEEELEDDIDDLDEEAEEEED 26
Score = 30.0 bits (68), Expect = 0.91
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++EEE E++ ++ +EE EEEE+ E+
Sbjct: 4 PDEEEELEDDIDDLDEEAEEEEDGED 29
Score = 28.4 bits (64), Expect = 2.8
Identities = 11/25 (44%), Positives = 19/25 (76%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
EE+E E++ ++ +EE EEEE+ E+
Sbjct: 5 DEEEELEDDIDDLDEEAEEEEDGED 29
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 31.8 bits (72), Expect = 0.51
Identities = 13/42 (30%), Positives = 21/42 (50%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILP 183
E+ E E E + E+ EE+E+E++ E + R LP
Sbjct: 668 IEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALP 709
Score = 31.0 bits (70), Expect = 0.92
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E E E E+ E E E + E+ EE+E
Sbjct: 661 DEDECEAIEDSESESESDGEDGEEDE 686
Score = 30.6 bits (69), Expect = 1.3
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E + E+ E E E + E+ EE+E+E+
Sbjct: 664 ECEAIEDSESESESDGEDGEEDEQED 689
Score = 29.8 bits (67), Expect = 2.3
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ E E E + E+ EE+E+E++ E E +
Sbjct: 667 AIEDSESESESDGEDGEEDEQEDDAEANEGV 697
Score = 29.5 bits (66), Expect = 2.7
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
E E+ E E E + E+ EE+E+E++
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDD 690
Score = 29.5 bits (66), Expect = 3.0
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E++ E E+ E E E + E+ EE+E+
Sbjct: 662 EDECEAIEDSESESESDGEDGEEDEQ 687
Score = 28.7 bits (64), Expect = 5.2
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE +++E+E E E+ E E E + E+
Sbjct: 656 EETDDDEDECEAIEDSESESESDGEDGEE 684
Score = 28.7 bits (64), Expect = 5.4
Identities = 10/29 (34%), Positives = 17/29 (58%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ ++ E+E E E+ E E E + E+ EE
Sbjct: 656 EETDDDEDECEAIEDSESESESDGEDGEE 684
Score = 28.3 bits (63), Expect = 6.9
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
GEE++E++ EE +++E+E E E+ E
Sbjct: 647 GEEEDEDDLEETDDDEDECEAIEDSES 673
Score = 27.9 bits (62), Expect = 8.5
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
++ E EEE+E++ EE +++E+E E I
Sbjct: 642 QQLFEGEEEDEDDLEETDDDEDECEAI 668
Score = 27.9 bits (62), Expect = 9.2
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E ++E+E E E+ E E E + E+
Sbjct: 655 LEETDDDEDECEAIEDSESESESDGEDG 682
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 31.5 bits (72), Expect = 0.51
Identities = 8/33 (24%), Positives = 15/33 (45%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+ EEE E++++ E+EEE +
Sbjct: 263 QPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDR 295
Score = 30.8 bits (70), Expect = 0.92
Identities = 7/26 (26%), Positives = 13/26 (50%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EEE E++++ E+EE
Sbjct: 262 PQPPEEEPPPPPPPPEDDDDPPEDEE 287
Score = 29.2 bits (66), Expect = 2.7
Identities = 6/26 (23%), Positives = 12/26 (46%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EEE E++++ E+E
Sbjct: 261 PPQPPEEEPPPPPPPPEDDDDPPEDE 286
Score = 28.8 bits (65), Expect = 3.9
Identities = 5/26 (19%), Positives = 11/26 (42%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EEE E++++ E+
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPED 285
Score = 28.5 bits (64), Expect = 5.4
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E++++ E+EEE+++ E+ EEI
Sbjct: 274 PPPEDDDDPPEDEEEQDDAEDRALEEI 300
>gnl|CDD|151173 pfam10669, Phage_Gp23, Protein gp23 (Bacteriophage A118). This is
the highly conserved family of the major tail subunit
protein.
Length = 121
Score = 30.1 bits (67), Expect = 0.55
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
E KEE ++ E E E+ ++E +EE ++ +T+N
Sbjct: 47 EMKEERDKMETEREKRDKESKEERDKFISTMN 78
>gnl|CDD|217530 pfam03390, 2HCT, 2-hydroxycarboxylate transporter family. The
2-hydroxycarboxylate transporter family is a family of
secondary transporters found exclusively in the
bacterial kingdom. They function in the metabolism of
the di- and tricarboxylates malate and citrate, mostly
in fermentative pathways involving decarboxylation of
malate or oxaloacetate.
Length = 414
Score = 31.3 bits (72), Expect = 0.55
Identities = 16/36 (44%), Positives = 24/36 (66%), Gaps = 2/36 (5%)
Query: 124 GLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEE 159
GL+ +L GK++P L TG G+ +E + EEEE+E
Sbjct: 207 GLLNKL-GKKKPSL-TGNGQLVRKESDLLVEEEEKE 240
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 31.4 bits (71), Expect = 0.55
Identities = 11/31 (35%), Positives = 17/31 (54%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
T + ++E E+E E +E EE E E E+
Sbjct: 344 TEESTDEESEDEVEIDESVIEEVAEMELLEV 374
Score = 30.7 bits (69), Expect = 1.1
Identities = 11/30 (36%), Positives = 15/30 (50%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE E+E E +E EE E E E ++
Sbjct: 347 STDEESEDEVEIDESVIEEVAEMELLEVQV 376
Score = 30.3 bits (68), Expect = 1.5
Identities = 10/30 (33%), Positives = 17/30 (56%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E++E + EE +EE E+E E +E +
Sbjct: 333 EVVEKQEGDVVTEESTDEESEDEVEIDESV 362
Score = 29.9 bits (67), Expect = 1.8
Identities = 11/29 (37%), Positives = 15/29 (51%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G +E +EE E+E E +E EE E
Sbjct: 340 GDVVTEESTDEESEDEVEIDESVIEEVAE 368
Score = 29.1 bits (65), Expect = 3.0
Identities = 10/27 (37%), Positives = 14/27 (51%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
EE +EE E+E E +E EE+
Sbjct: 343 VTEESTDEESEDEVEIDESVIEEVAEM 369
Score = 28.0 bits (62), Expect = 7.4
Identities = 10/30 (33%), Positives = 14/30 (46%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
T + E E E +E EE E E E + +
Sbjct: 348 TDEESEDEVEIDESVIEEVAEMELLEVQVD 377
Score = 28.0 bits (62), Expect = 7.7
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E + EE +EE E+E E +E
Sbjct: 338 QEGDVVTEESTDEESEDEVEIDESVI 363
Score = 27.6 bits (61), Expect = 8.8
Identities = 12/32 (37%), Positives = 16/32 (50%), Gaps = 2/32 (6%)
Query: 144 EKEE--EEEEEEEEEEEEEEEEEEEEEITATL 173
EK+E EE +EE E+E E +E I
Sbjct: 336 EKQEGDVVTEESTDEESEDEVEIDESVIEEVA 367
Score = 27.6 bits (61), Expect = 9.9
Identities = 9/31 (29%), Positives = 16/31 (51%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
E++ E +E EE E E E + +++ L
Sbjct: 353 EDEVEIDESVIEEVAEMELLEVQVDDLAERL 383
>gnl|CDD|129854 TIGR00771, DcuC, c4-dicarboxylate anaerobic carrier family protein.
These proteins are members of the C4-dicarboxylate
Uptake C (DcuC) Family (TC 2.A.61). The only
functionally characterized member of this family is the
anaerobic C4-dicarboxylate transporter (DcuC) of
Escherichia coli. DcuC has 12 GES predicted
transmembrane regions, is induced only under anaerobic
conditions, and is not repressed by glucose. It may
therefore function as a succinate efflux system during
anaerobic glucose fermentation. However, when
overexpressed, it can replace either DcuA or DcuB in
catalyzing fumarate-succinate exchange and fumarate
uptake [Transport and binding proteins, Carbohydrates,
organic alcohols, and acids].
Length = 388
Score = 31.4 bits (71), Expect = 0.56
Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 3/63 (4%)
Query: 152 EEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVHIVVGLTSTLSIKLMQDDLHIVKDL 211
E++++ E+E E ++ + L ILP++ I++ L L Q LH +
Sbjct: 165 EDKKQAYIAHEKESEIKVIEGVP--VLYAILPLIPLIIL-LLFIGGTGLQQVPLHEWTKM 221
Query: 212 VLP 214
+P
Sbjct: 222 GVP 224
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 30.3 bits (69), Expect = 0.56
Identities = 14/30 (46%), Positives = 17/30 (56%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EEK E E E+ E+E EE E E E+ A
Sbjct: 44 EEKAEYELEKLEKELEELEAELARRELKAE 73
Score = 26.8 bits (60), Expect = 7.5
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEIT 170
EE+ E E E+ E+E EE E E+
Sbjct: 43 AEEKAEYELEKLEKELEELEAELA 66
>gnl|CDD|219905 pfam08564, CDC37_C, Cdc37 C terminal domain. Cdc37 is a protein
required for the activity of numerous eukaryotic protein
kinases. This domains corresponds to the C terminal
domain whose function is unclear. It is found C terminal
to the Hsp90 chaperone (Heat shocked protein 90) binding
domain pfam08565 and the N terminal kinase binding
domain of Cdc37 pfam03234.
Length = 89
Score = 29.6 bits (67), Expect = 0.58
Identities = 16/27 (59%), Positives = 21/27 (77%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
GE++ +E EEE EEE E+EE EEE+E
Sbjct: 63 GEKEFKELEEEYNEEEAEKEEIEEEDE 89
Score = 28.5 bits (64), Expect = 1.2
Identities = 17/27 (62%), Positives = 20/27 (74%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEE 166
G+ E KE EEE EEE E+EE EEE+E
Sbjct: 63 GEKEFKELEEEYNEEEAEKEEIEEEDE 89
Score = 28.5 bits (64), Expect = 1.5
Identities = 16/29 (55%), Positives = 19/29 (65%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
EKE +E EEE EEE E+EE EEE+
Sbjct: 60 KIEGEKEFKELEEEYNEEEAEKEEIEEED 88
Score = 27.7 bits (62), Expect = 2.2
Identities = 15/30 (50%), Positives = 18/30 (60%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E E+E +E EEE EEE E+EE EE
Sbjct: 60 KIEGEKEFKELEEEYNEEEAEKEEIEEEDE 89
Score = 27.7 bits (62), Expect = 2.4
Identities = 16/29 (55%), Positives = 20/29 (68%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K E ++E +E EEE EEE E+EE EEE
Sbjct: 59 AKIEGEKEFKELEEEYNEEEAEKEEIEEE 87
Score = 27.3 bits (61), Expect = 2.9
Identities = 15/27 (55%), Positives = 18/27 (66%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K E E+E +E EEE EEE E+EEI
Sbjct: 58 NAKIEGEKEFKELEEEYNEEEAEKEEI 84
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 31.6 bits (71), Expect = 0.58
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ E+K EE E E+E+E+E E E E E
Sbjct: 591 EAEQKAREEREREKEKEKERERERERE 617
Score = 30.4 bits (68), Expect = 1.5
Identities = 15/40 (37%), Positives = 23/40 (57%)
Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+L KR + K E +++ EE E E+E+E+E E E E
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKERERERE 615
Score = 30.4 bits (68), Expect = 1.6
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEE 164
EE+E E+E+E+E E E E E E
Sbjct: 598 EEREREKEKEKEREREREREAE 619
Score = 29.7 bits (66), Expect = 2.7
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E E++ EE E E+E+E+E E E E A
Sbjct: 591 EAEQKAREEREREKEKEKEREREREREA 618
Score = 29.3 bits (65), Expect = 3.0
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEE 167
+EE E E+E+E+E E E E E E
Sbjct: 597 REEREREKEKEKEREREREREAE 619
Score = 29.3 bits (65), Expect = 3.0
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
++ EE E E+E+E+E E E E E E A
Sbjct: 594 QKAREEREREKEKEKEREREREREAERAA 622
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 30.9 bits (70), Expect = 0.60
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+ E + E E++EEEE EEEEE+ EEE
Sbjct: 66 REEARLELLEQQEEEESAEEEEEDPEEENEE 96
Score = 30.1 bits (68), Expect = 0.85
Identities = 19/26 (73%), Positives = 22/26 (84%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E++EEEE EEEEE+ EEE EEEEEE
Sbjct: 75 EQQEEEESAEEEEEDPEEENEEEEEE 100
Score = 30.1 bits (68), Expect = 0.89
Identities = 17/26 (65%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+++EEE EEEEE+ EEE EEEEEE
Sbjct: 76 QQEEEESAEEEEEDPEEENEEEEEEY 101
Score = 29.7 bits (67), Expect = 1.4
Identities = 18/26 (69%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E++EEEE EEEEE+ EEE EEEEE
Sbjct: 74 LEQQEEEESAEEEEEDPEEENEEEEE 99
Score = 28.9 bits (65), Expect = 2.3
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
+++EEEE EEEEE+ EEE EEEE P G
Sbjct: 74 LEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFG 107
Score = 28.2 bits (63), Expect = 4.4
Identities = 21/46 (45%), Positives = 25/46 (54%), Gaps = 2/46 (4%)
Query: 125 LIERLDGK--RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
L LD + R L + EE EEEEE+ EEE EEEEEE +
Sbjct: 60 LESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGP 105
Score = 27.4 bits (61), Expect = 7.3
Identities = 14/31 (45%), Positives = 17/31 (54%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
E+EEE+ EEE EEEEEE + E
Sbjct: 82 SAEEEEEDPEEENEEEEEEYQRGPFGEEEEE 112
Score = 27.0 bits (60), Expect = 9.1
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+ +E E E++EEEE EEEEE+
Sbjct: 65 DREEARLELLEQQEEEESAEEEEEDPEE 92
Score = 27.0 bits (60), Expect = 9.9
Identities = 16/29 (55%), Positives = 17/29 (58%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE EEE EEEEEE + EEEEE
Sbjct: 87 EEDPEEENEEEEEEYQRGPFGEEEEEDGD 115
>gnl|CDD|221586 pfam12457, TIP_N, Tuftelin interacting protein N terminal. This
domain family is found in eukaryotes, and is typically
between 99 and 114 amino acids in length. The family is
found in association with pfam08697, pfam01585. There
are two completely conserved residues (G and F) that may
be functionally important. TIP is involved in enamel
assembly by interacting with one of the major proteins
responsible for biomineralisation of enamel - tuftelin.
Length = 106
Score = 29.6 bits (67), Expect = 0.60
Identities = 10/60 (16%), Positives = 30/60 (50%), Gaps = 11/60 (18%)
Query: 120 WPQKGLIERLDGKRRPRLYTG-----------KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ ++ + Y+ +E EE+++E+E E+++E +E+++E++
Sbjct: 47 EGERRRGRGRRRSKKKKDYSKPVAFVSGGIKQAAKEPEEDDKEDESEDDDESDEDDDEDD 106
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.9 bits (70), Expect = 0.61
Identities = 15/29 (51%), Positives = 20/29 (68%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EE EEE EEE+EE +E E+E EE+
Sbjct: 86 KEDEESEEENEEEDEESSDENEKETEEKT 114
Score = 30.9 bits (70), Expect = 0.72
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+ EE++EE +E E+E EE+ E E+EIT
Sbjct: 94 ENEEEDEESSDENEKETEEKTESNVEKEIT 123
Score = 30.5 bits (69), Expect = 0.94
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
KG+ ++E+EE EEE EEE+EE +E E+ T
Sbjct: 81 KGDAEKEDEESEEENEEEDEESSDENEKETEE 112
Score = 30.1 bits (68), Expect = 1.00
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +EE EEE+EE +E E+E EE+ E
Sbjct: 88 DEESEEENEEEDEESSDENEKETEEKTE 115
Score = 30.1 bits (68), Expect = 1.3
Identities = 15/35 (42%), Positives = 19/35 (54%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
EE EEE+EE +E E+E EE+ E NP
Sbjct: 91 SEEENEEEDEESSDENEKETEEKTESNVEKEITNP 125
Score = 29.7 bits (67), Expect = 1.4
Identities = 15/30 (50%), Positives = 22/30 (73%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+KE++ + E+E+EE EEE EEE+EE
Sbjct: 72 AANSEDKEDKGDAEKEDEESEEENEEEDEE 101
Score = 29.7 bits (67), Expect = 1.6
Identities = 16/30 (53%), Positives = 20/30 (66%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
EKE+EE EEE EEE+EE +E E+E
Sbjct: 80 DKGDAEKEDEESEEENEEEDEESSDENEKE 109
Score = 28.9 bits (65), Expect = 2.8
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
E+K + E+E+EE EEE EEE+EE +
Sbjct: 79 EDKGDAEKEDEESEEENEEEDEESSDENEK 108
Score = 28.9 bits (65), Expect = 3.0
Identities = 14/30 (46%), Positives = 21/30 (70%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K ++ + E+E+EE EEE EEE+EE +E
Sbjct: 76 EDKEDKGDAEKEDEESEEENEEEDEESSDE 105
Score = 27.8 bits (62), Expect = 7.3
Identities = 13/29 (44%), Positives = 20/29 (68%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K EEKE E++E++ + E+E+EE EE
Sbjct: 65 VKEEEKEAANSEDKEDKGDAEKEDEESEE 93
>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
Length = 246
Score = 31.1 bits (71), Expect = 0.62
Identities = 10/31 (32%), Positives = 10/31 (32%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
E E E E EE EEE T
Sbjct: 28 PEPPPPEPAAPPPEPPEPEEVPEEEAAPLTE 58
Score = 30.3 bits (69), Expect = 1.1
Identities = 8/27 (29%), Positives = 9/27 (33%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ E E E EE EEE
Sbjct: 27 PPEPPPPEPAAPPPEPPEPEEVPEEEA 53
Score = 29.1 bits (66), Expect = 2.4
Identities = 11/29 (37%), Positives = 12/29 (41%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E E EE EEE EEE+ A
Sbjct: 35 PAAPPPEPPEPEEVPEEEAAPLTEEELEA 63
Score = 28.7 bits (65), Expect = 2.9
Identities = 12/28 (42%), Positives = 12/28 (42%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E E EE EEE EEE E I
Sbjct: 37 APPPEPPEPEEVPEEEAAPLTEEELEAI 64
Score = 28.7 bits (65), Expect = 3.2
Identities = 8/27 (29%), Positives = 8/27 (29%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E E E EE EE
Sbjct: 25 DPPPEPPPPEPAAPPPEPPEPEEVPEE 51
>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
Provisional.
Length = 606
Score = 31.3 bits (71), Expect = 0.67
Identities = 10/33 (30%), Positives = 19/33 (57%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
PR + E+ E E EE+EE+++E+++
Sbjct: 574 PRANNNEASEETETITVPENNEEDEEDDDEDDD 606
Score = 29.4 bits (66), Expect = 3.0
Identities = 10/28 (35%), Positives = 16/28 (57%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E EE E E EE+EE+++E++
Sbjct: 578 NNEASEETETITVPENNEEDEEDDDEDD 605
Score = 29.0 bits (65), Expect = 3.9
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E E E EE+EE+++E+++
Sbjct: 580 EASEETETITVPENNEEDEEDDDEDDD 606
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 29.5 bits (67), Expect = 0.67
Identities = 7/24 (29%), Positives = 13/24 (54%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEE 163
++EE++EE EEE ++
Sbjct: 78 AAAAAAAAAKKEEKKEESEEESDD 101
Score = 29.5 bits (67), Expect = 0.81
Identities = 8/21 (38%), Positives = 13/21 (61%)
Query: 139 TGKGEEKEEEEEEEEEEEEEE 159
K+EE++EE EEE ++
Sbjct: 81 AAAAAAKKEEKKEESEEESDD 101
Score = 28.4 bits (64), Expect = 1.8
Identities = 7/23 (30%), Positives = 13/23 (56%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEE 161
++EE++EE EEE ++
Sbjct: 79 AAAAAAAAKKEEKKEESEEESDD 101
Score = 28.4 bits (64), Expect = 1.9
Identities = 7/18 (38%), Positives = 13/18 (72%)
Query: 147 EEEEEEEEEEEEEEEEEE 164
++EE++EE EEE ++
Sbjct: 84 AAAKKEEKKEESEEESDD 101
Score = 28.4 bits (64), Expect = 1.9
Identities = 7/18 (38%), Positives = 13/18 (72%)
Query: 148 EEEEEEEEEEEEEEEEEE 165
++EE++EE EEE ++
Sbjct: 84 AAAKKEEKKEESEEESDD 101
Score = 28.4 bits (64), Expect = 1.9
Identities = 7/18 (38%), Positives = 13/18 (72%)
Query: 149 EEEEEEEEEEEEEEEEEE 166
++EE++EE EEE ++
Sbjct: 84 AAAKKEEKKEESEEESDD 101
Score = 28.4 bits (64), Expect = 1.9
Identities = 7/18 (38%), Positives = 13/18 (72%)
Query: 150 EEEEEEEEEEEEEEEEEE 167
++EE++EE EEE ++
Sbjct: 84 AAAKKEEKKEESEEESDD 101
Score = 28.4 bits (64), Expect = 1.9
Identities = 7/18 (38%), Positives = 13/18 (72%)
Query: 151 EEEEEEEEEEEEEEEEEE 168
++EE++EE EEE ++
Sbjct: 84 AAAKKEEKKEESEEESDD 101
>gnl|CDD|215973 pfam00527, E7, E7 protein, Early protein.
Length = 92
Score = 29.1 bits (66), Expect = 0.67
Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 10/68 (14%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITA--TLNPCGLCRILPVMVHIVVGLTSTLSIKLMQ-- 202
E+ ++ EEEEE+EE+ + + PCG C + +VV L++ I+ +Q
Sbjct: 24 YEQLDDSEEEEEDEEDRGQPPAQQPYRIVTPCGGCDKT---LRLVV-LSTDEDIRTLQQL 79
Query: 203 --DDLHIV 208
L +V
Sbjct: 80 LLGTLSLV 87
>gnl|CDD|184886 PRK14892, PRK14892, putative transcription elongation factor Elf1;
Provisional.
Length = 99
Score = 29.6 bits (67), Expect = 0.68
Identities = 14/28 (50%), Positives = 15/28 (53%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEE 163
GK E KE + EE EEE E E E E
Sbjct: 72 LYLEGKIEIKERKNEETEEENEIEGESE 99
Score = 28.1 bits (63), Expect = 2.2
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
E K E +E + EE EEE E E E E
Sbjct: 75 EGKIEIKERKNEETEEENEIEGESE 99
Score = 26.9 bits (60), Expect = 4.8
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E + E +E + EE EEE E E E E
Sbjct: 74 LEGKIEIKERKNEETEEENEIEGESE 99
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase 19
kDa subunit. This family contains several DNA-directed
RNA polymerase 19 kDa polypeptides. The Poxvirus
DNA-directed RNA polymerase (EC: 2.7.7.6) catalyzes
DNA-template-directed extension of the 3'-end of an RNA
strand by one nucleotide at a time.
Length = 167
Score = 30.4 bits (69), Expect = 0.69
Identities = 11/25 (44%), Positives = 18/25 (72%)
Query: 148 EEEEEEEEEEEEEEEEEEEEEITAT 172
E ++++ EE EEEEE+EE+ E +
Sbjct: 11 ESDDDDSEEYEEEEEDEEDAESLES 35
Score = 30.1 bits (68), Expect = 0.93
Identities = 12/24 (50%), Positives = 18/24 (75%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
E ++++ EE EEEEE+EE+ E E
Sbjct: 11 ESDDDDSEEYEEEEEDEEDAESLE 34
Score = 30.1 bits (68), Expect = 1.0
Identities = 13/33 (39%), Positives = 20/33 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E +++ EE EEEEE+EE+ E E ++L
Sbjct: 11 ESDDDDSEEYEEEEEDEEDAESLESSDVSSLKQ 43
Score = 27.8 bits (62), Expect = 4.8
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
+ E ++++ EE EEEEE+EE+ +
Sbjct: 8 IDYESDDDDSEEYEEEEEDEEDAESLE 34
Score = 27.4 bits (61), Expect = 6.7
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
++ EE EEEEE+EE+ E E +
Sbjct: 13 DDDDSEEYEEEEEDEEDAESLESSD 37
Score = 27.4 bits (61), Expect = 7.3
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+ + + E ++++ EE EEEEE+EE +L
Sbjct: 3 DSDDIIDYESDDDDSEEYEEEEEDEEDAESL 33
Score = 27.0 bits (60), Expect = 9.2
Identities = 10/26 (38%), Positives = 19/26 (73%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ ++ + E ++++ EE EEEEE+E
Sbjct: 2 EDSDDIIDYESDDDDSEEYEEEEEDE 27
>gnl|CDD|221250 pfam11831, Myb_Cef, pre-mRNA splicing factor component. This
family is a region of the Myb-Related Cdc5p/Cef1
proteins, in fungi, and is part of the pre-mRNA splicing
factor complex.
Length = 363
Score = 31.2 bits (71), Expect = 0.69
Identities = 16/28 (57%), Positives = 18/28 (64%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E E EEEEEE EE EEE EE+ + A
Sbjct: 146 ELELPEEEEEEPEEMEEELEEDAADRDA 173
Score = 28.1 bits (63), Expect = 6.2
Identities = 14/25 (56%), Positives = 17/25 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
E EEEEEE EE EEE EE+ + +
Sbjct: 148 ELPEEEEEEPEEMEEELEEDAADRD 172
Score = 28.1 bits (63), Expect = 7.1
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+ EE+ + + + EE +E+EE
Sbjct: 160 MEEELEEDAADRDARKRAAEEAKEQEE 186
Score = 27.7 bits (62), Expect = 8.0
Identities = 15/25 (60%), Positives = 17/25 (68%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
K + E E E EEEEEE EE EEE+
Sbjct: 140 KPKNEFELELPEEEEEEPEEMEEEL 164
Score = 27.7 bits (62), Expect = 8.1
Identities = 16/26 (61%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E E E EEEEEE EE EEE EE
Sbjct: 141 PKNEFELELPEEEEEEPEEMEEELEE 166
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 31.2 bits (71), Expect = 0.71
Identities = 12/33 (36%), Positives = 18/33 (54%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
EE +++ EEE ++E E E E E I + P
Sbjct: 243 EESDDDGEEESDDESAWEGFESEYEPINKPVRP 275
Score = 30.1 bits (68), Expect = 1.4
Identities = 10/32 (31%), Positives = 15/32 (46%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
EE +++ EEE ++E E E E P
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYEPINKPV 273
Score = 29.7 bits (67), Expect = 2.3
Identities = 11/43 (25%), Positives = 22/43 (51%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ER++ K+ ++ E E EE +++ EEE ++E
Sbjct: 216 LERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESA 258
Score = 28.9 bits (65), Expect = 3.1
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+EE E+E + E++ +E E EE+ + P
Sbjct: 199 LQEEYEKEVKAEKKRQELERVEEKKLEKMAP 229
Score = 28.9 bits (65), Expect = 3.4
Identities = 9/26 (34%), Positives = 15/26 (57%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ +++ EEE ++E E E E E
Sbjct: 242 LEESDDDGEEESDDESAWEGFESEYE 267
Score = 28.5 bits (64), Expect = 4.5
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E EE +++ EEE ++E E E +N
Sbjct: 239 EGLLEESDDDGEEESDDESAWEGFESEYEPINK 271
Score = 28.5 bits (64), Expect = 5.0
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE E+E + E++ +E E EE++ E++
Sbjct: 201 EEYEKEVKAEKKRQELERVEEKKLEKMAP 229
Score = 28.5 bits (64), Expect = 5.1
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+G +E +++ EEE ++E E E E
Sbjct: 239 EGLLEESDDDGEEESDDESAWEGFESEY 266
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 31.1 bits (70), Expect = 0.76
Identities = 15/37 (40%), Positives = 18/37 (48%)
Query: 106 TKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKG 142
K EV TG+KPW QKG G R ++ G G
Sbjct: 139 VKTRAEVSGTGKKPWRQKGTGRARAGSTRSPIFVGGG 175
>gnl|CDD|184539 PRK14150, PRK14150, heat shock protein GrpE; Provisional.
Length = 193
Score = 30.3 bits (69), Expect = 0.77
Identities = 12/28 (42%), Positives = 19/28 (67%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+E+ EE E E E++EE +E E E+E+
Sbjct: 10 QEQVSEEIEAEVEQQEEADEAELEDELD 37
Score = 29.9 bits (68), Expect = 1.1
Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 1/35 (2%)
Query: 143 EEKEEEEE-EEEEEEEEEEEEEEEEEEITATLNPC 176
E+K +E+ EE E E E++EE +E E+ L+
Sbjct: 5 EQKTPQEQVSEEIEAEVEQQEEADEAELEDELDEA 39
Score = 29.1 bits (66), Expect = 2.0
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EE++ +E+ EE E E E++EE +E
Sbjct: 4 EEQKTPQEQVSEEIEAEVEQQEEADEAELE 33
Score = 27.2 bits (61), Expect = 8.9
Identities = 11/32 (34%), Positives = 18/32 (56%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
+E++ +E+ EE E E E++EE A L
Sbjct: 2 SNEEQKTPQEQVSEEIEAEVEQQEEADEAELE 33
Score = 27.2 bits (61), Expect = 9.7
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E+ E E E++EE +E E E+E +E
Sbjct: 14 SEEIEAEVEQQEEADEAELEDELDEAD 40
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 31.1 bits (70), Expect = 0.78
Identities = 14/33 (42%), Positives = 17/33 (51%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
EK E E EE E EEE E+ E+E + C
Sbjct: 371 EKMEIENRNPEESEHEEEVEDYEDENDHSKRIC 403
Score = 30.3 bits (68), Expect = 1.6
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
E+ E E EE E EEE E+ E+E
Sbjct: 371 EKMEIENRNPEESEHEEEVEDYEDE 395
Score = 30.0 bits (67), Expect = 2.1
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E + EE E EEE E+ E+E + + I
Sbjct: 376 ENRNPEESEHEEEVEDYEDENDHSKRICD 404
Score = 29.6 bits (66), Expect = 2.6
Identities = 22/76 (28%), Positives = 33/76 (43%), Gaps = 5/76 (6%)
Query: 92 IEWQKKYK---WVRFDHTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEE 148
+++ KY W ++ K NE RK + L+ + LY K E +
Sbjct: 322 LQFLSKYIGCLWRNLNNEKWGNEEAE--RKDYILDSSSVPLEKQFDDILYFEKMEIENRN 379
Query: 149 EEEEEEEEEEEEEEEE 164
EE E EEE E+ E+E
Sbjct: 380 PEESEHEEEVEDYEDE 395
Score = 29.6 bits (66), Expect = 2.8
Identities = 11/26 (42%), Positives = 13/26 (50%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ E E EE E EEE E+ E
Sbjct: 368 LYFEKMEIENRNPEESEHEEEVEDYE 393
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 30.4 bits (69), Expect = 0.79
Identities = 19/27 (70%), Positives = 21/27 (77%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EK EEEE EEEE EEE E EEE++EI
Sbjct: 19 AEKSEEEEAEEEEPEEENELEEEQQEI 45
Score = 30.0 bits (68), Expect = 1.0
Identities = 18/29 (62%), Positives = 19/29 (65%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE EE E+ EEEE EEEE EEE E E
Sbjct: 12 DAEETEEAEKSEEEEAEEEEPEEENELEE 40
Score = 29.2 bits (66), Expect = 1.8
Identities = 16/36 (44%), Positives = 19/36 (52%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
+ + EE EE E+ EEEE EEEE EEE
Sbjct: 7 KTEEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQ 42
Score = 28.8 bits (65), Expect = 2.5
Identities = 18/34 (52%), Positives = 20/34 (58%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
K EE + EE EE E+ EEEE EEEE EE
Sbjct: 5 EQKTEEPDAEETEEAEKSEEEEAEEEEPEEENEL 38
Score = 28.8 bits (65), Expect = 2.6
Identities = 18/31 (58%), Positives = 22/31 (70%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
E ++ EEEE EEEE EEE E EEE++ A L
Sbjct: 18 EAEKSEEEEAEEEEPEEENELEEEQQEIAEL 48
Score = 28.1 bits (63), Expect = 4.4
Identities = 17/32 (53%), Positives = 20/32 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
E+K EE + EE EE E+ EEEE EEE N
Sbjct: 5 EQKTEEPDAEETEEAEKSEEEEAEEEEPEEEN 36
Score = 27.7 bits (62), Expect = 6.4
Identities = 17/28 (60%), Positives = 21/28 (75%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E+EE EEEE EEE E EEE++E E+ A
Sbjct: 23 EEEEAEEEEPEEENELEEEQQEIAELEA 50
Score = 27.7 bits (62), Expect = 6.5
Identities = 17/27 (62%), Positives = 20/27 (74%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
K EE+E EEEE EEE E EEE++E E
Sbjct: 21 KSEEEEAEEEEPEEENELEEEQQEIAE 47
Score = 27.3 bits (61), Expect = 8.4
Identities = 17/36 (47%), Positives = 20/36 (55%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
+ EE EEEE EEE E EEE++E E E L
Sbjct: 21 KSEEEEAEEEEPEEENELEEEQQEIAELEAQLEELK 56
>gnl|CDD|129865 TIGR00783, ccs, citrate carrier protein, CCS family. These
proteins are members of the Citrate:Cation Symporter
(CCS) Family (TC 2.A.24). These proteins have 12 GES
predicted transmembrane regions. Most members of the CCS
family catalyze citrate uptake with either Na+ or H+ as
the cotransported cation. However, one member is
specific for L-malate and probably functions by a proton
symport mechanism [Unclassified, Role category not yet
assigned].
Length = 347
Score = 30.8 bits (70), Expect = 0.81
Identities = 16/40 (40%), Positives = 25/40 (62%), Gaps = 2/40 (5%)
Query: 124 GLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEE 163
GL+ R+ GK+RP+L G GE E+ E+ E+ +E E +
Sbjct: 138 GLLSRI-GKKRPKL-NGHGELVRSEKREDAEKAKEITEIK 175
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 30.7 bits (69), Expect = 0.90
Identities = 13/38 (34%), Positives = 17/38 (44%)
Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
D P E+EE EE + EEEE+EE +
Sbjct: 163 DNDEAPPAQPDVDNEEEERLEESDGREEEEDEEVGSDS 200
Score = 29.6 bits (66), Expect = 2.0
Identities = 15/36 (41%), Positives = 18/36 (50%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
R+ +GEE+E EEE E E EEE EE
Sbjct: 232 RIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVPRN 267
Score = 29.2 bits (65), Expect = 2.9
Identities = 12/27 (44%), Positives = 18/27 (66%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ + + EEEE EE + EEEE+EE+
Sbjct: 170 AQPDVDNEEEERLEESDGREEEEDEEV 196
Score = 28.0 bits (62), Expect = 6.0
Identities = 16/38 (42%), Positives = 21/38 (55%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
Y G+ +K++ EEEE EEE E E EEE + P
Sbjct: 228 YEGERIDKKQGEEEEMEEEVINLFEIEWEEESPSEEVP 265
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 30.4 bits (68), Expect = 0.90
Identities = 17/44 (38%), Positives = 29/44 (65%), Gaps = 4/44 (9%)
Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
L ER++ +RR K +E+E++EE+E E EE+ EE+ ++ E
Sbjct: 138 LEERMEWERREE----KIDEREDQEEQEREREEQTIEEQSDDSE 177
Score = 29.3 bits (65), Expect = 2.1
Identities = 18/52 (34%), Positives = 29/52 (55%)
Query: 122 QKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+K LI R + + R + EE+ E E EE+ +E E++EE+E E T+
Sbjct: 118 KKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTI 169
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 29.6 bits (67), Expect = 0.90
Identities = 12/29 (41%), Positives = 23/29 (79%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+EK ++E E+EE +++ EE++ ++EE TA
Sbjct: 56 DEKWKKETEDEEFQQKREEKKRKDEEKTA 84
Score = 28.9 bits (65), Expect = 2.0
Identities = 14/37 (37%), Positives = 21/37 (56%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
KG +KEE+E + EE +EEEE EE++ +
Sbjct: 103 AKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEI 139
>gnl|CDD|220379 pfam09750, DRY_EERY, Alternative splicing regulator. This entry
represents the conserved N-terminal region of SWAP
(suppressor-of-white-apricot protein) proteins. This
region contains two highly conserved motifs, viz: DRY
and EERY, which appear to be the sites for alternative
splicing of exons 2 and 3 of the SWAP mRNA. These
proteins are thus thought to be involved in
auto-regulation of pre-mRNA splicing. Most family
members are associated with two Surp domains pfam01805
and an Arginine- serine-rich binding region towards the
C-terminus.
Length = 121
Score = 29.3 bits (66), Expect = 0.94
Identities = 23/66 (34%), Positives = 29/66 (43%), Gaps = 16/66 (24%)
Query: 119 PW--PQKGLIERLDGK----RRPRLYTGKGEEKEEEEEEEEEEEEEEE----------EE 162
PW Q LI+R D + PR + K +EE+E EE EE EE
Sbjct: 30 PWQGDQDNLIDRFDVRALLDYIPRGSRNRRPYKTDEEDEAEERFLNEERYRDLIKNDFEE 89
Query: 163 EEEEEE 168
E+EEE
Sbjct: 90 AEKEEE 95
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 29.7 bits (67), Expect = 0.95
Identities = 13/50 (26%), Positives = 26/50 (52%), Gaps = 7/50 (14%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE-------EEITATLN 174
P G + ++++E++E++EE + E+++E EEI A L
Sbjct: 41 FNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDDENPYKLIRLEEILAPLT 90
Score = 28.5 bits (64), Expect = 2.2
Identities = 9/35 (25%), Positives = 22/35 (62%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRI 181
+E+ + ++++E++E++EE + NP L R+
Sbjct: 48 DEDGGDIDDDDEDDEDDEEADAEDDDENPYKLIRL 82
>gnl|CDD|147051 pfam04698, MOBP_C-Myrip, Myelin-associated oligodendrocytic basic
protein (MOBP). MOBP is abundantly expressed in central
nervous system myelin, and shares several
characteristics with myelin basic protein (MBP), in
terms of regional distribution and function. This family
is the middle and C-terminal regions of MOBP which has
been shown to be essential for normal arrangement of the
radial component in central nervous system myelin. Most
member-proteins carry a FVHE-PHD type zinc-finger at
their N-terminus.
Length = 710
Score = 30.9 bits (69), Expect = 0.99
Identities = 23/90 (25%), Positives = 39/90 (43%), Gaps = 21/90 (23%)
Query: 106 TKVTNEVERTGRKPWPQKGLIERLDGKRRPRLY----------TGKGEEKEEEEEEEEEE 155
T ++NE + + L + + K R RLY T GE++E E + E E
Sbjct: 414 TDISNEAQDP-------QSLTDSTEEKLRNRLYELAMKMSEKETSSGEDQESEPKAEPEN 466
Query: 156 EEE----EEEEEEEEEEITATLNPCGLCRI 181
++E E+ + +EE+ + LC I
Sbjct: 467 QKESLSSEDNNQGVQEELKKKCSAVSLCNI 496
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 31.0 bits (70), Expect = 0.99
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+ + + ++ E+E+E E E +EE
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEER 567
Score = 28.3 bits (63), Expect = 6.0
Identities = 7/22 (31%), Positives = 15/22 (68%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEE 165
+ ++ E+E+E E E +EE ++
Sbjct: 548 DADDWTEDEDENEMETDEERKK 569
>gnl|CDD|221333 pfam11942, Spt5_N, Spt5 transcription elongation factor, acidic
N-terminal. This is the very acidic N-terminal region
of the early transcription elongation factor Spt5. The
Spt5-Spt4 complex regulates early transcription
elongation by RNA polymerase II and has an imputed role
in pre-mRNA processing via its physical association with
mRNA capping enzymes. The actual function of this
N-terminal domain is not known although it is
dispensable for binding to Spt4.
Length = 92
Score = 28.9 bits (65), Expect = 1.00
Identities = 12/26 (46%), Positives = 18/26 (69%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITA 171
+ E E ++EEEEEEEEE++ E+
Sbjct: 3 DTEAEVDDEEEEEEEEEDDLEDLSDE 28
Score = 28.9 bits (65), Expect = 1.0
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E E ++EEEEEEEEE++ E+ +E+
Sbjct: 4 TEAEVDDEEEEEEEEEDDLEDLSDED 29
Score = 28.2 bits (63), Expect = 1.5
Identities = 12/28 (42%), Positives = 17/28 (60%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATL 173
E E ++EEEEEEEEE++ E+
Sbjct: 4 TEAEVDDEEEEEEEEEDDLEDLSDEDEF 31
Score = 28.2 bits (63), Expect = 1.9
Identities = 14/26 (53%), Positives = 21/26 (80%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEI 169
E ++EEEEEEEEE++ E+ +E+E I
Sbjct: 7 EVDDEEEEEEEEEDDLEDLSDEDEFI 32
Score = 27.4 bits (61), Expect = 3.1
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +EEEEEEEEE++ E+ +E+E
Sbjct: 7 EVDDEEEEEEEEEDDLEDLSDEDEFI 32
Score = 27.4 bits (61), Expect = 3.7
Identities = 12/26 (46%), Positives = 20/26 (76%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
+E+EEEEEEE++ E+ +E+E +E
Sbjct: 9 DDEEEEEEEEEDDLEDLSDEDEFIDE 34
Score = 26.6 bits (59), Expect = 6.0
Identities = 12/31 (38%), Positives = 19/31 (61%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNPC 176
EEEEEEEEE++ E+ +E+E + +
Sbjct: 11 EEEEEEEEEDDLEDLSDEDEFIDEAEAEDDR 41
Score = 26.6 bits (59), Expect = 6.3
Identities = 13/29 (44%), Positives = 21/29 (72%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E + ++EEEEEEEEE++ E+ +E+E
Sbjct: 5 EAEVDDEEEEEEEEEDDLEDLSDEDEFID 33
Score = 26.3 bits (58), Expect = 8.0
Identities = 12/25 (48%), Positives = 20/25 (80%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
++EEEEEEEE++ E+ +E+E +E
Sbjct: 10 DEEEEEEEEEDDLEDLSDEDEFIDE 34
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 28.8 bits (65), Expect = 1.0
Identities = 15/32 (46%), Positives = 21/32 (65%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
T K +EK+EE EE++EE E ++EEE E
Sbjct: 48 VTDKVKEKKEEREEDKEELIERIKKEEETFED 79
Score = 27.7 bits (62), Expect = 2.3
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E+KEE EE++EE E ++EEE E++
Sbjct: 54 EKKEEREEDKEELIERIKKEEETFEDV 80
Score = 26.9 bits (60), Expect = 4.2
Identities = 12/30 (40%), Positives = 19/30 (63%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K EE+EE++EE E ++EEE E+ +
Sbjct: 53 KEKKEEREEDKEELIERIKKEEETFEDVDL 82
>gnl|CDD|219009 pfam06404, PSK, Phytosulfokine precursor protein (PSK). This
family consists of several plant specific phytosulfokine
precursor proteins. Phytosulfokines, are active as
either a pentapeptide or a C-terminally truncated
tetrapeptide. These compounds were first isolated
because of their ability to stimulate cell division in
somatic embryo cultures of Asparagus officinalis.
Length = 74
Score = 28.5 bits (64), Expect = 1.1
Identities = 11/28 (39%), Positives = 15/28 (53%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
+ K E E + +E+E E E E EEE
Sbjct: 26 SSKIEAAESSKVDEDEVVSECEGEGEEE 53
Score = 27.7 bits (62), Expect = 2.0
Identities = 9/27 (33%), Positives = 14/27 (51%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + E + +E+E E E E EEE
Sbjct: 27 SKIEAAESSKVDEDEVVSECEGEGEEE 53
Score = 26.9 bits (60), Expect = 3.4
Identities = 10/29 (34%), Positives = 13/29 (44%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
E E + +E+E E E E EEE
Sbjct: 25 ESSKIEAAESSKVDEDEVVSECEGEGEEE 53
>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
domain 1. The small mechanosensitive channel, MscS, is
a part of the turgor-driven solute efflux system that
protects bacteria from lysis in the event of osmotic
shock. The MscS protein alone is sufficient to form a
functional mechanosensitive channel gated directly by
tension in the lipid bilayer. The MscS proteins are
heptamers of three transmembrane subunits with seven
converging M3 domains, and this domain is one of the
inner membrane domains.
Length = 339
Score = 30.3 bits (69), Expect = 1.1
Identities = 11/27 (40%), Positives = 13/27 (48%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E + EEEEE E E EE E+
Sbjct: 268 AEILAQRAEEEEESSEGAAETIEEPEL 294
>gnl|CDD|222366 pfam13764, E3_UbLigase_R4, E3 ubiquitin-protein ligase UBR4. This
is a family of E## ubiquitin ligase enzymes.
Length = 794
Score = 30.8 bits (70), Expect = 1.1
Identities = 15/36 (41%), Positives = 20/36 (55%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRIL 182
EE EE E++ EEEE+ E E I + L CG +
Sbjct: 87 EEFIEELEDDREEEEDPEVEFAIASVLQECGGLETM 122
>gnl|CDD|237084 PRK12378, PRK12378, hypothetical protein; Provisional.
Length = 235
Score = 30.2 bits (69), Expect = 1.2
Identities = 10/30 (33%), Positives = 19/30 (63%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+G++++E E + + + E+ EEEE IT
Sbjct: 140 EGDDEDELLEALIDADVDVEDVEEEEGTIT 169
>gnl|CDD|114603 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP).
This family consists of several Trypanosoma brucei
procyclic acidic repetitive protein (PARP) like
sequences. The procyclic acidic repetitive protein
(parp) genes of Trypanosoma brucei encode a small family
of abundant surface proteins whose expression is
restricted to the procyclic form of the parasite. They
are found at two unlinked loci, parpA and parpB;
transcription of both loci is developmentally regulated.
Length = 145
Score = 29.5 bits (65), Expect = 1.2
Identities = 18/46 (39%), Positives = 24/46 (52%)
Query: 123 KGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
KG+ + GK G + + ++E EEEEE E EEE EEE
Sbjct: 34 KGITKGGKGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEE 79
Score = 29.2 bits (64), Expect = 1.5
Identities = 18/27 (66%), Positives = 19/27 (70%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+EEE E EEE EEE E EEE EEE
Sbjct: 63 EPEEEEEPEPEEEGEEEPEPEEEGEEE 89
Score = 28.8 bits (63), Expect = 2.3
Identities = 19/37 (51%), Positives = 19/37 (51%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
EE EEE E EEE EEE E EE EEE P
Sbjct: 70 PEPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEP 106
Score = 28.0 bits (61), Expect = 3.7
Identities = 24/60 (40%), Positives = 26/60 (43%)
Query: 116 GRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
G K +KG D EE+E E EEE EEE E EEE EEE E T
Sbjct: 39 GGKGKGEKGTKVGADDTNGTDPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGEE 98
>gnl|CDD|217940 pfam04177, TAP42, TAP42-like family. The TOR signalling pathway
activates a cell-growth program in response to
nutrients. TIP41 (pfam04176) interacts with TAP42 and
negatively regulates the TOR signaling pathway.
Length = 335
Score = 30.3 bits (69), Expect = 1.2
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+G + EEEE++E++EE+++EE
Sbjct: 295 QGGGEAAASAEEEEDDEDDEEDDDEETL 322
Score = 30.0 bits (68), Expect = 1.5
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G E EEEE++E++EE+++EE +
Sbjct: 297 GGEAAASAEEEEDDEDDEEDDDEETLK 323
Score = 28.8 bits (65), Expect = 3.6
Identities = 12/28 (42%), Positives = 20/28 (71%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
G GE EEEE++E++EE+++EE +
Sbjct: 296 GGGEAAASAEEEEDDEDDEEDDDEETLK 323
Score = 27.6 bits (62), Expect = 8.0
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G + E EEEE++E++EE+++EE
Sbjct: 292 GVPQGGGEAAASAEEEEDDEDDEEDDDEE 320
Score = 27.6 bits (62), Expect = 8.8
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E EEEE++E++EE+++EE + A
Sbjct: 296 GGGEAAASAEEEEDDEDDEEDDDEETLKA 324
>gnl|CDD|149439 pfam08375, Rpn3_C, Proteasome regulatory subunit C-terminal. This
eukaryotic domain is found at the C-terminus of 26S
proteasome regulatory subunits such as the non-ATPase
Rpn3 subunit which is essential for proteasomal
function. It occurs together with the PCI/PINT domain
(pfam01399).
Length = 68
Score = 28.1 bits (63), Expect = 1.3
Identities = 13/35 (37%), Positives = 19/35 (54%), Gaps = 3/35 (8%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEE---EEEEEEI 169
Y E+KEEE+ EE E ++ E E E E ++
Sbjct: 32 YPPDKEKKEEEKAEEARERDQLELELAKELSEGDL 66
>gnl|CDD|226501 COG4014, COG4014, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 97
Score = 28.7 bits (64), Expect = 1.3
Identities = 15/40 (37%), Positives = 20/40 (50%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
K E KEE EE+ +E E +E+EE E + L C
Sbjct: 55 VVDKSEIKEEREEDIDEVVERIRKEKEEFELVDLGLGVCC 94
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 30.2 bits (68), Expect = 1.4
Identities = 9/26 (34%), Positives = 19/26 (73%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+ E+E++ +E+E++++ EEEE
Sbjct: 222 EDNEDEDDPKEDEDDDQGEEEESGSS 247
Score = 29.4 bits (66), Expect = 2.0
Identities = 8/26 (30%), Positives = 20/26 (76%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEE 167
G+E E + E+ E+E++ +E+E++++
Sbjct: 213 GDEPESADSEDNEDEDDPKEDEDDDQ 238
Score = 29.4 bits (66), Expect = 2.0
Identities = 9/26 (34%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E+ E+E++ +E+E++++ EEEE
Sbjct: 219 ADSEDNEDEDDPKEDEDDDQGEEEES 244
Score = 29.0 bits (65), Expect = 2.8
Identities = 9/26 (34%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E + E+ E+E++ +E+E++++ EEE
Sbjct: 217 ESADSEDNEDEDDPKEDEDDDQGEEE 242
Score = 29.0 bits (65), Expect = 3.3
Identities = 9/32 (28%), Positives = 24/32 (75%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
+ ++ E+E++ +E+E++++ EEEE + +L+
Sbjct: 220 DSEDNEDEDDPKEDEDDDQGEEEESGSSDSLS 251
Score = 28.6 bits (64), Expect = 4.3
Identities = 9/29 (31%), Positives = 20/29 (68%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G E + E+ E+E++ +E+E++++ EE
Sbjct: 213 GDEPESADSEDNEDEDDPKEDEDDDQGEE 241
Score = 28.2 bits (63), Expect = 5.3
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ KE+E++++ EEEE + E+
Sbjct: 229 DPKEDEDDDQGEEEESGSSDSLSEDS 254
Score = 28.2 bits (63), Expect = 5.9
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
GEE+E + E+ + EE E E+ A
Sbjct: 239 GEEEESGSSDSLSEDSDASSEEMESGEMEA 268
Score = 27.9 bits (62), Expect = 6.7
Identities = 9/29 (31%), Positives = 13/29 (44%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+ EE + E+ + EE E E E
Sbjct: 239 GEEEESGSSDSLSEDSDASSEEMESGEME 267
Score = 27.9 bits (62), Expect = 7.7
Identities = 8/26 (30%), Positives = 13/26 (50%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ EEEE + E+ + EE E
Sbjct: 237 DQGEEEESGSSDSLSEDSDASSEEME 262
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 30.4 bits (69), Expect = 1.4
Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 6/49 (12%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
I L+ E+E EE EE + E+ + E++ ++E T L
Sbjct: 825 INELEPLPYEE------PEEETAEEPEEVDPEDVKSEDDIDDEDTGQLG 867
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 30.3 bits (69), Expect = 1.4
Identities = 9/25 (36%), Positives = 21/25 (84%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
E++++++EE +EE+EE + E++E+
Sbjct: 124 EEDDDDDEESDEEDEESSKSEDDED 148
Score = 29.6 bits (67), Expect = 2.3
Identities = 9/29 (31%), Positives = 20/29 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
+ EE+++++EE +EE+EE + E++
Sbjct: 121 RDLEEDDDDDEESDEEDEESSKSEDDEDD 149
Score = 29.6 bits (67), Expect = 2.7
Identities = 8/27 (29%), Positives = 21/27 (77%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ ++EE +EE+EE + E++E++++
Sbjct: 125 EDDDDDEESDEEDEESSKSEDDEDDDD 151
Score = 29.6 bits (67), Expect = 2.7
Identities = 10/31 (32%), Positives = 24/31 (77%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
EE +EE+EE + E++E+++++++++ AT
Sbjct: 130 DEESDEEDEESSKSEDDEDDDDDDDDDDIAT 160
Score = 28.4 bits (64), Expect = 5.0
Identities = 8/26 (30%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ +EE +EE+EE + E++E+++++
Sbjct: 127 DDDDEESDEEDEESSKSEDDEDDDDD 152
Score = 28.4 bits (64), Expect = 5.2
Identities = 10/47 (21%), Positives = 26/47 (55%)
Query: 122 QKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ R D R +E+ +EE+EE + E++E++++++++
Sbjct: 109 ELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDD 155
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 30.4 bits (68), Expect = 1.4
Identities = 17/31 (54%), Positives = 21/31 (67%), Gaps = 5/31 (16%)
Query: 143 EEKEEEEEEEEEEEE-----EEEEEEEEEEE 168
EE E EE+EE E+E +E+ EEEEEEE
Sbjct: 144 EEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174
Score = 29.6 bits (66), Expect = 2.2
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 9/44 (20%)
Query: 133 RRPRLYTGKGEEKEEEEEEEEEEE---------EEEEEEEEEEE 167
R+P + EE+EE E EE+EE +E+ EEEEEEE
Sbjct: 131 RKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174
Score = 28.8 bits (64), Expect = 4.6
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEI 169
EEEEE E EE+EE E+E ++
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKV 162
Score = 28.5 bits (63), Expect = 6.0
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
+ EEEEE E EE+EE E+E
Sbjct: 136 PDLVLEEEEEVEMEEDEEYYEKEPG 160
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.2 bits (68), Expect = 1.4
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E E EEE +E ++ + ++ EE ++I
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQI 306
Score = 28.2 bits (63), Expect = 5.7
Identities = 7/34 (20%), Positives = 13/34 (38%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E E EEE +E ++ + ++ P
Sbjct: 274 PPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIP 307
Score = 28.2 bits (63), Expect = 6.3
Identities = 8/29 (27%), Positives = 14/29 (48%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLN 174
E E EEE +E ++ + ++ EE
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQIPE 308
Score = 27.8 bits (62), Expect = 7.5
Identities = 7/25 (28%), Positives = 16/25 (64%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEE 166
EE+ +E ++ + ++ EE ++ EE
Sbjct: 285 PEEEPDEPDQTDPDDGEETDQIPEE 309
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 29.8 bits (67), Expect = 1.4
Identities = 12/29 (41%), Positives = 16/29 (55%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
K +EKEE E EE + EE EE ++
Sbjct: 68 KKKKKEKEEVPELAAEELSDSEENEENDK 96
Score = 29.4 bits (66), Expect = 1.6
Identities = 11/30 (36%), Positives = 18/30 (60%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
T K ++++EE E EE + EE EE ++
Sbjct: 67 TKKKKKEKEEVPELAAEELSDSEENEENDK 96
Score = 29.0 bits (65), Expect = 2.5
Identities = 12/48 (25%), Positives = 20/48 (41%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLC 179
K+ + K E E EE + EE EE +++ + E+ N
Sbjct: 65 KKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAEV 112
Score = 28.6 bits (64), Expect = 3.1
Identities = 12/37 (32%), Positives = 19/37 (51%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
R+ + ++K+E+EE E EE + EE EE
Sbjct: 58 PRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEEN 94
Score = 28.2 bits (63), Expect = 4.7
Identities = 13/31 (41%), Positives = 19/31 (61%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
T K ++K++E+EE E EE + EE EE
Sbjct: 64 TKKTKKKKKEKEEVPELAAEELSDSEENEEN 94
Score = 27.5 bits (61), Expect = 8.0
Identities = 10/40 (25%), Positives = 16/40 (40%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCR 180
K E++E E EE + EE EE +++
Sbjct: 71 KKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAA 110
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 29.9 bits (68), Expect = 1.4
Identities = 22/30 (73%), Positives = 23/30 (76%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
K EEK EEEEEEE E EEEEEEE+EEE
Sbjct: 13 VEKVEEKIEEEEEEEAPEAEEEEEEEDEEE 42
Score = 29.2 bits (66), Expect = 2.8
Identities = 18/25 (72%), Positives = 22/25 (88%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
EE+EEEE E EEEEEEE+EEE++E
Sbjct: 21 EEEEEEEAPEAEEEEEEEDEEEKKE 45
Score = 28.4 bits (64), Expect = 4.4
Identities = 18/25 (72%), Positives = 22/25 (88%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
E+EEEEE E EEEEEEE+EEE++E
Sbjct: 21 EEEEEEEAPEAEEEEEEEDEEEKKE 45
Score = 28.0 bits (63), Expect = 7.5
Identities = 17/28 (60%), Positives = 23/28 (82%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + +EEEEEE E EEEEEEE+EEE++
Sbjct: 17 EEKIEEEEEEEAPEAEEEEEEEDEEEKK 44
>gnl|CDD|216249 pfam01025, GrpE, GrpE.
Length = 165
Score = 29.5 bits (67), Expect = 1.4
Identities = 21/27 (77%), Positives = 22/27 (81%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EEKEEEEE E+EEE EEE EE EEEI
Sbjct: 1 EEKEEEEELEDEEEALEEELEELEEEI 27
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 29.9 bits (67), Expect = 1.4
Identities = 20/48 (41%), Positives = 25/48 (52%), Gaps = 16/48 (33%)
Query: 142 GEEKEEEEE----------------EEEEEEEEEEEEEEEEEEITATL 173
GEEKE ++E EE +E++EEEE EEEEE T L
Sbjct: 308 GEEKEPKQESQEQLFNPFAIDEEMLEETQEQQEEEENEEEEENDTVKL 355
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 30.2 bits (68), Expect = 1.5
Identities = 13/30 (43%), Positives = 18/30 (60%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
G E+ E +E EE+E E + E EE E+ A
Sbjct: 250 GREESEGSDESEEDEAEATDGEGEEGEMDA 279
Score = 29.8 bits (67), Expect = 2.2
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+ E + +E EE+E E + E EE E +
Sbjct: 250 GREESEGSDESEEDEAEATDGEGEEGEMD 278
Score = 29.0 bits (65), Expect = 3.7
Identities = 10/30 (33%), Positives = 15/30 (50%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ + E+ E EE E +E EE+E E
Sbjct: 237 DDDQPDNNEDSEAGREESEGSDESEEDEAE 266
Score = 28.7 bits (64), Expect = 4.4
Identities = 11/26 (42%), Positives = 14/26 (53%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E EE E +E EE+E E + E E
Sbjct: 248 EAGREESEGSDESEEDEAEATDGEGE 273
Score = 28.3 bits (63), Expect = 6.5
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITA 171
E+ E EE E +E EE+E E T
Sbjct: 245 EDSEAGREESEGSDESEEDEAEATD 269
Score = 28.3 bits (63), Expect = 7.1
Identities = 10/30 (33%), Positives = 14/30 (46%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E EE E +E EE+E E + E
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEG 272
Score = 27.9 bits (62), Expect = 7.4
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
+ E+ E EE E +E EE+E AT
Sbjct: 242 DNNEDSEAGREESEGSDESEEDEAEAT 268
Score = 27.9 bits (62), Expect = 8.0
Identities = 11/26 (42%), Positives = 15/26 (57%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+EE E +E EE+E E + E EE
Sbjct: 249 AGREESEGSDESEEDEAEATDGEGEE 274
Score = 27.5 bits (61), Expect = 10.0
Identities = 11/28 (39%), Positives = 17/28 (60%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
G+GEE E + E E+ E +E +E+ E
Sbjct: 270 GEGEEGEMDAAEASEDSESDESDEDTET 297
>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
trafficking and secretion].
Length = 340
Score = 29.9 bits (68), Expect = 1.5
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
E+ E E+E+ EEE+EEEE+E E + L+
Sbjct: 12 EELEAEKEKIEEEDEEEEKEGWFERLKQGLS 42
Score = 29.1 bits (66), Expect = 2.6
Identities = 16/28 (57%), Positives = 21/28 (75%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
E+ EEE EE E E+E+ EEE+EEEE+
Sbjct: 5 EKLEEELEELEAEKEKIEEEDEEEEKEG 32
Score = 29.1 bits (66), Expect = 2.6
Identities = 17/26 (65%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EK EEE EE E E+E+ EEE+EEEE
Sbjct: 4 FEKLEEELEELEAEKEKIEEEDEEEE 29
Score = 28.8 bits (65), Expect = 4.2
Identities = 15/27 (55%), Positives = 20/27 (74%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G ++ EEE EE E E+E+ EEE+EEE
Sbjct: 2 GFFEKLEEELEELEAEKEKIEEEDEEE 28
Score = 27.6 bits (62), Expect = 7.7
Identities = 16/32 (50%), Positives = 21/32 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
E+E EE E E+E+ EEE+EEEE+E L
Sbjct: 7 LEEELEELEAEKEKIEEEDEEEEKEGWFERLK 38
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.3 bits (69), Expect = 1.5
Identities = 7/28 (25%), Positives = 24/28 (85%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
++ E++++++ ++E+EE++E +E E+++
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLS 182
Score = 29.6 bits (67), Expect = 2.5
Identities = 6/30 (20%), Positives = 25/30 (83%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
++++++E++++++ ++E+EE++E +E+
Sbjct: 151 DDDEDDDEDDDDDDVDDEDEEKKEAKELEK 180
Score = 29.2 bits (66), Expect = 3.5
Identities = 8/33 (24%), Positives = 20/33 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+E EE++E +E E+ ++++ +E+ + L
Sbjct: 167 DEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQ 199
Score = 28.8 bits (65), Expect = 3.6
Identities = 6/30 (20%), Positives = 24/30 (80%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
++ ++E+++E++++++ ++E+EE++E
Sbjct: 146 DIDDDDDDEDDDEDDDDDDVDDEDEEKKEA 175
Score = 28.8 bits (65), Expect = 3.7
Identities = 7/29 (24%), Positives = 22/29 (75%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E ++E++++++ ++E+EE++E +E
Sbjct: 150 DDDDEDDDEDDDDDDVDDEDEEKKEAKEL 178
Score = 28.4 bits (64), Expect = 5.3
Identities = 5/29 (17%), Positives = 23/29 (79%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ ++++E+++E++++++ ++E+EE+
Sbjct: 145 DDIDDDDDDEDDDEDDDDDDVDDEDEEKK 173
Score = 28.0 bits (63), Expect = 7.1
Identities = 6/30 (20%), Positives = 20/30 (66%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ +++ ++E+EE++E +E E+ ++
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDD 184
>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein. This family consists of
several eukaryotic synaphin 1 and 2 proteins.
Synaphin/complexin is a cytosolic protein that
preferentially binds to syntaxin within the SNARE
complex. Synaphin promotes SNAREs to form precomplexes
that oligomerise into higher order structures. A peptide
from the central, syntaxin binding domain of synaphin
competitively inhibits these two proteins from
interacting and prevents SNARE complexes from
oligomerising. It is thought that oligomerisation of
SNARE complexes into a higher order structure creates a
SNARE scaffold for efficient, regulated fusion of
synaptic vesicles. Synaphin promotes neuronal exocytosis
by promoting interaction between the complementary
syntaxin and synaptobrevin transmembrane regions that
reside in opposing membranes prior to fusion.
Length = 139
Score = 29.1 bits (65), Expect = 1.5
Identities = 12/29 (41%), Positives = 19/29 (65%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+G+E + EEE+EE +E E EEE + +
Sbjct: 25 DEGDESDAEEEDEEIQEALREAEEERKAK 53
Score = 28.3 bits (63), Expect = 2.7
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G+E E +E + EEE+EE +E E EE
Sbjct: 22 GKEDEGDESDAEEEDEEIQEALREAEE 48
Score = 28.3 bits (63), Expect = 3.0
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
GK +E +E + EEE+EE +E E EEE
Sbjct: 22 GKEDEGDESDAEEEDEEIQEALREAEEE 49
Score = 27.9 bits (62), Expect = 3.4
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G E++ +E + EEE+EE +E E EEE
Sbjct: 21 GGKEDEGDESDAEEEDEEIQEALREAEEE 49
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 29.8 bits (67), Expect = 1.6
Identities = 7/31 (22%), Positives = 21/31 (67%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
GEE+ E+++ E ++ ++++E++++E
Sbjct: 80 EVVGEEEPEKDQAEAGKQLSQKQQEQQQQEE 110
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 29.6 bits (67), Expect = 1.6
Identities = 14/46 (30%), Positives = 26/46 (56%), Gaps = 3/46 (6%)
Query: 132 KRRPRLYTGK---GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
K++ + K G++ EEE+E+E E+E++ +E E +E N
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPN 46
Score = 28.9 bits (65), Expect = 3.2
Identities = 16/38 (42%), Positives = 20/38 (52%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
L EEKE E EE EE +++E +EEEI T
Sbjct: 59 SFLPDKAREEKEAELREELREEFLKKQEAVKEEEIEIT 96
Score = 27.7 bits (62), Expect = 6.6
Identities = 9/30 (30%), Positives = 18/30 (60%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+++EEE+E+E E+E++ +E N
Sbjct: 14 GDDDEEEDEDEGEDEKKVPKESSEPDEANV 43
Score = 27.7 bits (62), Expect = 7.8
Identities = 12/30 (40%), Positives = 21/30 (70%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
++EEE+E+E E+E++ +E E A +NP
Sbjct: 16 DDEEEDEDEGEDEKKVPKESSEPDEANVNP 45
>gnl|CDD|178493 PLN02905, PLN02905, beta-amylase.
Length = 702
Score = 30.3 bits (68), Expect = 1.6
Identities = 11/33 (33%), Positives = 19/33 (57%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
G EE +EEE + + +EE++ + E+ I T
Sbjct: 10 GTSEEDDEEEMDMDVKEEDDGDRRNREKHIPVT 42
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 30.0 bits (67), Expect = 1.7
Identities = 10/38 (26%), Positives = 16/38 (42%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ P G+ + E EE +EE+ E E+ E
Sbjct: 346 FEVGPETAEGELLQFEAEEIYNDEEQYTRPVYENEDFE 383
Score = 27.6 bits (61), Expect = 9.5
Identities = 11/37 (29%), Positives = 16/37 (43%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
G + P + E E E E + E EE +EE+
Sbjct: 335 GYQPPGWHEPLFEVGPETAEGELLQFEAEEIYNDEEQ 371
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 29.9 bits (68), Expect = 1.7
Identities = 15/32 (46%), Positives = 23/32 (71%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K +E++ EE+E+ EE ++EEEEEEEE +
Sbjct: 322 LLSKRKEQQAEEKEKPAEEAKKEEEEEEEESV 353
Score = 28.8 bits (65), Expect = 4.6
Identities = 14/31 (45%), Positives = 22/31 (70%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
Y ++++ EE+E+ EE ++EEEEEEEE
Sbjct: 321 YLLSKRKEQQAEEKEKPAEEAKKEEEEEEEE 351
Score = 28.0 bits (63), Expect = 8.9
Identities = 14/36 (38%), Positives = 21/36 (58%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
+ + EEKE+ EE ++EEEEEEEE ++
Sbjct: 320 AYLLSKRKEQQAEEKEKPAEEAKKEEEEEEEESVDD 355
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 30.2 bits (69), Expect = 1.7
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
+E+ E +E EE+E++ E+ EIT
Sbjct: 517 AYIKEQMEGSAPKEPEEKEKKPEKPEIT 544
>gnl|CDD|214923 smart00937, PCRF, This domain is found in peptide chain release
factors.
Length = 116
Score = 28.6 bits (65), Expect = 1.7
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
EE+E++E E EEE EE E E EEE+ L
Sbjct: 4 EEEEDDEMRELAEEELEELEAELEEELEELL 34
Score = 27.0 bits (61), Expect = 6.2
Identities = 15/30 (50%), Positives = 18/30 (60%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
E EEEE++E E EEE EE E E+ L
Sbjct: 1 ELLEEEEDDEMRELAEEELEELEAELEEEL 30
Score = 27.0 bits (61), Expect = 6.2
Identities = 14/28 (50%), Positives = 17/28 (60%)
Query: 148 EEEEEEEEEEEEEEEEEEEEEITATLNP 175
E EEEE++E E EEE EE+ A L
Sbjct: 1 ELLEEEEDDEMRELAEEELEELEAELEE 28
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 28.8 bits (65), Expect = 1.7
Identities = 9/26 (34%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ +++ +EE+ +EE++E+EE+EE
Sbjct: 104 DDILPDDDFDEEDLDEEDDEDEEDEE 129
Score = 28.1 bits (63), Expect = 3.1
Identities = 8/26 (30%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ +++ +EE+ +EE++E+EE+E
Sbjct: 103 DDDILPDDDFDEEDLDEEDDEDEEDE 128
Score = 27.7 bits (62), Expect = 4.4
Identities = 9/43 (20%), Positives = 26/43 (60%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E K++ + G + +++ +++ +EE+ +EE++E+E
Sbjct: 83 LEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDE 125
Score = 27.7 bits (62), Expect = 4.8
Identities = 11/44 (25%), Positives = 26/44 (59%)
Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
L E+ D K++ + ++ +++ +EE+ +EE++E+EE
Sbjct: 83 LEEKFDKKKKKFMDGDDDIIDDDILPDDDFDEEDLDEEDDEDEE 126
>gnl|CDD|215009 smart01069, CDC37_C, Cdc37 C terminal domain. Cdc37 is a protein
required for the activity of numerous eukaryotic protein
kinases. This domains corresponds to the C terminal
domain whose function is unclear. It is found C terminal
to the Hsp90 chaperone (Heat shocked protein 90) binding
domain pfam08565 and the N terminal kinase binding
domain of Cdc37.
Length = 93
Score = 28.1 bits (63), Expect = 1.8
Identities = 14/30 (46%), Positives = 20/30 (66%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+E E +E +E+ E EEE E+E+EEEE
Sbjct: 63 AIEDETEFKELQEQYEVEEEAEKEDEEEED 92
Score = 27.8 bits (62), Expect = 2.3
Identities = 15/33 (45%), Positives = 22/33 (66%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
P + E KE +E+ E EEE E+E+EEEE++
Sbjct: 61 PNAIEDETEFKELQEQYEVEEEAEKEDEEEEDD 93
Score = 27.0 bits (60), Expect = 4.3
Identities = 13/26 (50%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E +E +E+ E EEE E+E+EEEE++
Sbjct: 68 TEFKELQEQYEVEEEAEKEDEEEEDD 93
Score = 26.2 bits (58), Expect = 8.6
Identities = 11/30 (36%), Positives = 17/30 (56%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
E+E E +E +E+ E EEE E+E+
Sbjct: 62 NAIEDETEFKELQEQYEVEEEAEKEDEEEE 91
>gnl|CDD|216403 pfam01271, Granin, Granin (chromogranin or secretogranin).
Length = 585
Score = 30.0 bits (67), Expect = 1.8
Identities = 16/46 (34%), Positives = 21/46 (45%), Gaps = 3/46 (6%)
Query: 133 RRPRLYTGKG-EEKEEEEEEEEEEEEEEEEEE--EEEEEITATLNP 175
R G+G + E E EE EE+E + +E EITA L
Sbjct: 291 LRWVNARGRGRSQNRAERERSEESEEKELDRASPYQEIEITANLQI 336
Score = 28.0 bits (62), Expect = 7.8
Identities = 11/40 (27%), Positives = 18/40 (45%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
P GE+ EE EE++ +EE + +E +L
Sbjct: 215 PYEDVVGGEDWNPIEEGEEDQTQEEVKRSKERTHKGRSLP 254
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 29.6 bits (66), Expect = 1.9
Identities = 10/42 (23%), Positives = 25/42 (59%)
Query: 134 RPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+P + K + +++E +E++E+ EE E + E++E+ +
Sbjct: 142 KPHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQPQS 183
Score = 28.8 bits (64), Expect = 3.2
Identities = 12/48 (25%), Positives = 27/48 (56%)
Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+L R + + K + K +++E +E++E+ EE E + E++ + P
Sbjct: 134 QLSDLNRLKPHMMKTKSKNQKQENQEKQEKLEESEIQSEKQEQPSQQP 181
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 29.7 bits (67), Expect = 2.0
Identities = 9/34 (26%), Positives = 11/34 (32%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ E E EE E E E EE+
Sbjct: 139 YPKTDEEKELLREVEELESRLATEPSPAPELEEQ 172
Score = 29.7 bits (67), Expect = 2.1
Identities = 8/31 (25%), Positives = 11/31 (35%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
Y ++E+E E EE E E
Sbjct: 137 YEYPKTDEEKELLREVEELESRLATEPSPAP 167
Score = 29.3 bits (66), Expect = 2.9
Identities = 9/31 (29%), Positives = 13/31 (41%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K +E++E E EE E E E+
Sbjct: 139 YPKTDEEKELLREVEELESRLATEPSPAPEL 169
Score = 28.9 bits (65), Expect = 3.9
Identities = 10/27 (37%), Positives = 10/27 (37%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EEKE E EE E E E
Sbjct: 144 EEKELLREVEELESRLATEPSPAPELE 170
>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking. Sec14-like
Golgi-trafficking domain The GOLD domain is always found
combined with lipid- or membrane-association domains.
Length = 136
Score = 28.9 bits (65), Expect = 2.0
Identities = 16/37 (43%), Positives = 21/37 (56%), Gaps = 1/37 (2%)
Query: 148 EEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPV 184
+EEEEEE EEEE E + E + P + I+PV
Sbjct: 56 DEEEEEEAEEEEAETGDVEAGSKSQSRPL-VDEIIPV 91
Score = 27.0 bits (60), Expect = 7.7
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEE 164
E +EEEEEE EEEE E + E
Sbjct: 52 SESSDEEEEEEAEEEEAETGDVE 74
Score = 27.0 bits (60), Expect = 8.7
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEE 168
E +EEEEEE EEEE E + E
Sbjct: 53 ESSDEEEEEEAEEEEAETGDVE 74
Score = 27.0 bits (60), Expect = 8.9
Identities = 13/22 (59%), Positives = 15/22 (68%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEE 165
E +EEEEEE EEEE E + E
Sbjct: 53 ESSDEEEEEEAEEEEAETGDVE 74
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 29.8 bits (68), Expect = 2.0
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+ E EE E EEEEE++EE++
Sbjct: 64 EEQRERLEELAPELLEEEEEKKEEKK 89
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 29.6 bits (67), Expect = 2.1
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE +EEE++EE++ ++ EE ++ EE
Sbjct: 68 SEESDEEEDDEEDDNDDSEENKDTVEEF 95
Score = 28.5 bits (64), Expect = 5.7
Identities = 9/35 (25%), Positives = 22/35 (62%)
Query: 132 KRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
K ++ + ++EE++EE++ ++ EE ++ EE
Sbjct: 60 KEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEE 94
Score = 28.1 bits (63), Expect = 6.2
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
K K + EE +EEE++EE++ ++ EE T+
Sbjct: 60 KEAAKIDFSEESDEEEDDEEDDNDDSEENKDTV 92
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 28.6 bits (64), Expect = 2.1
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEE 162
K K+EEE+EEE E +EEE
Sbjct: 89 VFNKLTSKKEEEKEEEIPEPTKEEE 113
Score = 28.2 bits (63), Expect = 2.9
Identities = 11/26 (42%), Positives = 17/26 (65%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEE 161
+++ +KEEE+EEE E +EEE
Sbjct: 88 KVFNKLTSKKEEEKEEEIPEPTKEEE 113
Score = 27.5 bits (61), Expect = 4.5
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEI 169
++EEE+EEE E +EEE EI
Sbjct: 94 TSKKEEEKEEEIPEPTKEEELLGEI 118
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 29.7 bits (67), Expect = 2.3
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
K +E+E E+E EEEE E E+ I
Sbjct: 396 KVKEEERPREKEGTEEEERREITVYEKRIKK 426
Score = 29.3 bits (66), Expect = 2.7
Identities = 11/34 (32%), Positives = 14/34 (41%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG 177
K +EEE E+E EEEE E +
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLE 428
>gnl|CDD|227277 COG4941, COG4941, Predicted RNA polymerase sigma factor containing
a TPR repeat domain [Transcription].
Length = 415
Score = 29.7 bits (67), Expect = 2.3
Identities = 16/65 (24%), Positives = 25/65 (38%), Gaps = 13/65 (20%)
Query: 118 KPWPQKG--------LIE-----RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
+ WP+ G LI +D RR E +E+EE EE E ++E
Sbjct: 45 ERWPRAGPPRNPAAWLIAVGRNRAIDRVRRRARRDAAPPELLLSDEDEEMEEAEALDDEH 104
Query: 165 EEEEI 169
++
Sbjct: 105 IRDDR 109
>gnl|CDD|148810 pfam07412, Geminin, Geminin. This family contains the eukaryotic
protein geminin (approximately 200 residues long).
Geminin inhibits DNA replication by preventing the
incorporation of MCM complex into prereplication
complex, and is degraded during the mitotic phase of the
cell cycle. It has been proposed that geminin inhibits
DNA replication during S, G2, and M phases and that
geminin destruction at the metaphase-anaphase transition
permits replication in the succeeding cell cycle.
Length = 200
Score = 29.1 bits (65), Expect = 2.3
Identities = 17/37 (45%), Positives = 23/37 (62%)
Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEE 161
+IERL G+ L + K +E + EEEE E+ EEE E
Sbjct: 164 MIERLTGEPPDNLESLKSQEFDSEEEEAEDSLEEESE 200
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 29.4 bits (66), Expect = 2.3
Identities = 11/72 (15%), Positives = 31/72 (43%), Gaps = 10/72 (13%)
Query: 106 TKVTNEVERTGRKPWPQKGLIER----------LDGKRRPRLYTGKGEEKEEEEEEEEEE 155
++ + + +K ++ +E+ + R+ L EK ++ E+ +
Sbjct: 53 NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAK 112
Query: 156 EEEEEEEEEEEE 167
+ EE++++ EE
Sbjct: 113 QAEEKQKQAEEA 124
>gnl|CDD|237798 PRK14714, PRK14714, DNA polymerase II large subunit; Provisional.
Length = 1337
Score = 29.7 bits (67), Expect = 2.4
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
T +G+ EE EEEE + +E E+ I
Sbjct: 280 TIEGKSDAGEEGEEEELSTDADEYEKGPPRI 310
>gnl|CDD|198151 smart01083, Cir_N, N-terminal domain of CBF1 interacting
co-repressor CIR. This is a 45 residue conserved region
at the N-terminal end of a family of proteins referred
to as CIRs (CBF1-interacting co-repressors). CBF1
(centromere-binding factor 1) acts as a transcription
factor that causes repression by binding specifically to
GTGGGAA motifs in responsive promoters, and it requires
CIR as a co-repressor. CIR binds to histone deacetylase
and to SAP30 and serves as a linker between CBF1 and the
histone deacetylase complex.
Length = 37
Score = 26.3 bits (59), Expect = 2.4
Identities = 10/24 (41%), Positives = 17/24 (70%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEE 167
K E++ EEE+++ EE +E E+E
Sbjct: 14 WKAEQKAEEEKKKIEERRKEIEKE 37
Score = 26.0 bits (58), Expect = 3.3
Identities = 11/23 (47%), Positives = 17/23 (73%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEE 163
K E+K EEE+++ EE +E E+E
Sbjct: 15 KAEQKAEEEKKKIEERRKEIEKE 37
Score = 25.6 bits (57), Expect = 4.2
Identities = 9/23 (39%), Positives = 17/23 (73%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
+ E++ EEE+++ EE +E E+E
Sbjct: 15 KAEQKAEEEKKKIEERRKEIEKE 37
Score = 25.6 bits (57), Expect = 4.2
Identities = 9/24 (37%), Positives = 17/24 (70%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
+ E++ EEE+++ EE +E E+E
Sbjct: 14 WKAEQKAEEEKKKIEERRKEIEKE 37
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 29.6 bits (67), Expect = 2.4
Identities = 12/32 (37%), Positives = 19/32 (59%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
T + EE+EEE + E E E + ++E+E T
Sbjct: 229 TSETEEREEETDVEIETTSETKGTKQEQEGST 260
Score = 28.8 bits (65), Expect = 4.9
Identities = 12/43 (27%), Positives = 25/43 (58%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E + K+ R + K +E + + EE+ ++EE++E+ + EE
Sbjct: 635 ELMKIKQIFRNWIEKKKEFKILDYTEEKTKKEEKKEKNKREEK 677
Score = 28.4 bits (64), Expect = 5.5
Identities = 10/32 (31%), Positives = 18/32 (56%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E+ EEE + E E E + ++E+E + +P
Sbjct: 233 EEREEETDVEIETTSETKGTKQEQEGSTEEDP 264
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 29.3 bits (66), Expect = 2.5
Identities = 12/34 (35%), Positives = 17/34 (50%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E + EEE E EE +E + EE+ L+P
Sbjct: 141 AEAEARLEEEVAEIREEGQELSSKREELKEKLDP 174
>gnl|CDD|216641 pfam01673, Herpes_env, Herpesvirus putative major envelope
glycoprotein. This family consists of probable major
envelope glycoproteins from members of the herpesviridae
including herpes simplex virus, human cytomegalovirus
and varicella-zoster virus. Members of the herpesviridae
have a dsDNA genome and do not have a RNA stage during
there replication.
Length = 483
Score = 29.6 bits (67), Expect = 2.6
Identities = 11/39 (28%), Positives = 14/39 (35%), Gaps = 3/39 (7%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLN---PCGLCRIL 182
E+ E E + A L PC +CRIL
Sbjct: 25 GTPEDAELAPFVETALNNVLTHLAAALALDSPCNVCRIL 63
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 29.4 bits (66), Expect = 2.7
Identities = 11/28 (39%), Positives = 20/28 (71%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
EEKE+E+ + ++ E+E+E+EE E +
Sbjct: 288 EEKEKEKRKRLKKMMEDEDEDEEMEIVP 315
Score = 29.4 bits (66), Expect = 2.7
Identities = 13/27 (48%), Positives = 18/27 (66%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITAT 172
EEEE EE E ++EEE+EE+T +
Sbjct: 319 VEEEESEEPEPPPLPKKEEEKEEVTVS 345
Score = 27.9 bits (62), Expect = 7.5
Identities = 11/26 (42%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE EE+E+E+ + ++ E+E+E+EE
Sbjct: 285 EETEEKEKEKRKRLKKMMEDEDEDEE 310
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 29.3 bits (66), Expect = 2.7
Identities = 10/62 (16%), Positives = 25/62 (40%), Gaps = 9/62 (14%)
Query: 122 QKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEE---------EEEEEEITAT 172
Q + +R G + E + +++++E+ E + +E+ E++ T
Sbjct: 228 QPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLDDSDDDVSDEDGEDLFDT 287
Query: 173 LN 174
N
Sbjct: 288 DN 289
>gnl|CDD|145910 pfam03012, PP_M1, Phosphoprotein. This family includes the M1
phosphoprotein non-structural RNA polymerase alpha
subunit, which is thought to be a component of the
active polymerase, and may be involved in template
binding.
Length = 298
Score = 28.9 bits (65), Expect = 2.8
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 130 DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+ R ++ K + EEE+E+E EE+ E E+
Sbjct: 51 EDMSRLQISDAKPSQFTEEEDEDEGSSEEDFYMTEGED 88
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 28.9 bits (65), Expect = 2.8
Identities = 7/25 (28%), Positives = 20/25 (80%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
+K ++++++ EEE ++ ++++EEE
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEE 214
Score = 28.9 bits (65), Expect = 2.8
Identities = 7/26 (26%), Positives = 21/26 (80%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
++ ++++++ EEE ++ ++++EEEE
Sbjct: 190 KKLKQQQQKREEERRKQRKKQQEEEE 215
Score = 28.5 bits (64), Expect = 4.2
Identities = 13/60 (21%), Positives = 28/60 (46%), Gaps = 5/60 (8%)
Query: 110 NEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
++ER ++ + L K+ R + +E++++E E+E EE E + +E
Sbjct: 79 VKLERQAQEAYE-----NWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133
Score = 27.8 bits (62), Expect = 6.5
Identities = 8/29 (27%), Positives = 22/29 (75%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K ++++++ EEE ++ ++++EEEE ++
Sbjct: 191 KLKQQQQKREEERRKQRKKQQEEEERKQK 219
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 29.3 bits (66), Expect = 2.8
Identities = 11/27 (40%), Positives = 17/27 (62%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G + E ++ +EE+EE E+E EEE
Sbjct: 103 GATRNYEADKLDEEQEERVEKEREEEL 129
Score = 28.6 bits (64), Expect = 4.2
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
E ++ +EE+EE E+E EEE
Sbjct: 108 YEADKLDEEQEERVEKEREEELA 130
Score = 28.6 bits (64), Expect = 4.3
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
E ++ +EE+EE E+E EEE
Sbjct: 108 YEADKLDEEQEERVEKEREEELA 130
Score = 28.2 bits (63), Expect = 5.4
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 149 EEEEEEEEEEEEEEEEEEEEI 169
E++EEEEEEEE+E + +
Sbjct: 178 RREKKEEEEEEEEDEALIKSL 198
Score = 28.2 bits (63), Expect = 5.5
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 152 EEEEEEEEEEEEEEEEEITA 171
E++EEEEEEEE+E I +
Sbjct: 178 RREKKEEEEEEEEDEALIKS 197
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 28.6 bits (64), Expect = 2.9
Identities = 9/31 (29%), Positives = 22/31 (70%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+E +E EE+E + ++E++E++E+++ L
Sbjct: 100 KEVSKETEEKEAIKAKKEKKEKKEKKVAEKL 130
Score = 28.2 bits (63), Expect = 3.3
Identities = 10/30 (33%), Positives = 19/30 (63%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE E+E++E +E EE+E + ++E+
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEK 121
Score = 27.8 bits (62), Expect = 4.5
Identities = 8/29 (27%), Positives = 20/29 (68%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E++++E +E EE+E + ++E++E+
Sbjct: 96 EKEQKEVSKETEEKEAIKAKKEKKEKKEK 124
Score = 27.4 bits (61), Expect = 7.3
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EKE++E +E EE+E + ++E++E
Sbjct: 90 LFPEESEKEQKEVSKETEEKEAIKAKKEKKE 120
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 28.1 bits (63), Expect = 2.9
Identities = 14/26 (53%), Positives = 19/26 (73%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEEIT 170
K+ EE + EEEE+E EE EEE ++T
Sbjct: 97 KQLEENRKLEEEEKEREELEEENDVT 122
Score = 27.4 bits (61), Expect = 4.9
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEEEE 168
K++ EE + EEEE+E EE EEE
Sbjct: 96 KKQLEENRKLEEEEKEREELEEEN 119
Score = 27.4 bits (61), Expect = 6.1
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEE 163
Y + EE + EEEE+E EE EEE +
Sbjct: 95 YKKQLEENRKLEEEEKEREELEEEND 120
Score = 27.0 bits (60), Expect = 8.1
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 150 EEEEEEEEEEEEEEEEEEEITATLN 174
EE + EEEE+E EE EEE T
Sbjct: 100 EENRKLEEEEKEREELEEENDVTKG 124
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 29.1 bits (66), Expect = 2.9
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E + E EEE EE E EEE E +
Sbjct: 302 ERRAAEREEELEEGERLREEEAERQAR 328
Score = 28.7 bits (65), Expect = 3.8
Identities = 14/27 (51%), Positives = 16/27 (59%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EEKEEE +E EEE + EEEE
Sbjct: 35 EEKEEERRIDEMMEEERLKALAEEEER 61
Score = 28.3 bits (64), Expect = 5.0
Identities = 15/26 (57%), Positives = 15/26 (57%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EEKEE E EEE EE E EEE
Sbjct: 298 EEKEERRAAEREEELEEGERLREEEA 323
Score = 28.0 bits (63), Expect = 7.2
Identities = 11/26 (42%), Positives = 15/26 (57%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
EE+ +E EEE + EEEE E +
Sbjct: 39 EERRIDEMMEEERLKALAEEEERERK 64
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 28.1 bits (63), Expect = 2.9
Identities = 10/28 (35%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ + EEE ++EE+ E E+ E+ E++E
Sbjct: 74 EALKLLEEENDDEEDAETEDTEDVEDDE 101
Score = 28.1 bits (63), Expect = 3.8
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ EEE ++EE+ E E+ E+ E++E
Sbjct: 77 KLLEEENDDEEDAETEDTEDVEDDEWE 103
Score = 27.7 bits (62), Expect = 4.7
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+EE ++EE+ E E+ E+ E++E E
Sbjct: 77 KLLEEENDDEEDAETEDTEDVEDDEWE 103
Score = 27.3 bits (61), Expect = 6.0
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E + EEE ++EE+ E E+ E+ E
Sbjct: 73 KEALKLLEEENDDEEDAETEDTEDVE 98
Score = 26.9 bits (60), Expect = 8.9
Identities = 11/27 (40%), Positives = 19/27 (70%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
K E+E ++EE+ E E+ E+ E++E E
Sbjct: 77 KLLEEENDDEEDAETEDTEDVEDDEWE 103
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 29.4 bits (66), Expect = 3.0
Identities = 8/28 (28%), Positives = 21/28 (75%)
Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
+ ++ +E+E+++E+E+E+E+E+E
Sbjct: 883 QFQENEDDDADEDEDQDEDEDEDEDEDE 910
Score = 28.6 bits (64), Expect = 5.4
Identities = 10/31 (32%), Positives = 25/31 (80%)
Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
L + E ++++ +E+E+++E+E+E+E+E+E
Sbjct: 880 LISQFQENEDDDADEDEDQDEDEDEDEDEDE 910
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 29.1 bits (66), Expect = 3.1
Identities = 10/29 (34%), Positives = 22/29 (75%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
EE+ +EE +E++EE+++EE E + +++
Sbjct: 277 EEERQEEAQEKKEEKKKEEREAKLAKLSP 305
Score = 28.4 bits (64), Expect = 5.3
Identities = 14/30 (46%), Positives = 22/30 (73%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
EEE +EE +E++EE+++EE E A L+P
Sbjct: 276 AEEERQEEAQEKKEEKKKEEREAKLAKLSP 305
>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
Length = 382
Score = 29.1 bits (66), Expect = 3.2
Identities = 11/37 (29%), Positives = 15/37 (40%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
G+ EE+EEE EEEE + E +
Sbjct: 224 LGEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQ 260
>gnl|CDD|222762 pfam14456, alpha-hel2, Alpha-helical domain 2. An alpha-helical
domain found in gene neighborhoods encoding genes
containing bacterial homologs of components of the
ubiquitin modification pathway such as the E1, E2, Ub
and JAB peptidase proteins.
Length = 322
Score = 29.0 bits (65), Expect = 3.3
Identities = 14/48 (29%), Positives = 19/48 (39%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVHIV 189
GE+ EEE E E E E++EE EE + P +
Sbjct: 139 GEDDEEEALEVLEGEGEDDEEAAAEEYFSRVRPALAPDDGHPHLFKWG 186
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 29.1 bits (65), Expect = 3.4
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G E E+EE+ ++++ E E+EE+ E E
Sbjct: 204 GDDTESEDEEDGDDDQPTENEQEEQGEGE 232
>gnl|CDD|191365 pfam05756, S-antigen, S-antigen protein. S-antigens are heat
stable proteins that are found in the blood of
individuals infected with malaria.
Length = 94
Score = 27.6 bits (61), Expect = 3.4
Identities = 12/30 (40%), Positives = 19/30 (63%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
GKG++ E+ +EE E E ++EE+ EE
Sbjct: 52 GKGQKYEDLQEEGEGENDDEEDSNSEESNN 81
>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
Length = 613
Score = 28.9 bits (66), Expect = 3.5
Identities = 14/36 (38%), Positives = 22/36 (61%), Gaps = 2/36 (5%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEE--EEITATLN 174
+G+ +E+EEE+EE+EE EEE + E + L
Sbjct: 471 RGDLDLGKEDEEEKEEKEEAEEEFKPLLERLKEALG 506
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 29.2 bits (66), Expect = 3.5
Identities = 14/59 (23%), Positives = 26/59 (44%), Gaps = 10/59 (16%)
Query: 110 NEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E RT + R + +RR + +EK + ++E + EE+ +E E+E
Sbjct: 662 TEKARTQDEQQQA----PRRERQRR------RNDEKRQAQQEAKALNVEEQSVQETEQE 710
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.0 bits (65), Expect = 3.6
Identities = 9/26 (34%), Positives = 18/26 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E + EE E + +EE+ ++EE E++
Sbjct: 390 DETDASEEAEAKAKEEKLKQEENEKK 415
Score = 28.2 bits (63), Expect = 5.4
Identities = 8/28 (28%), Positives = 19/28 (67%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+E E + +EE+ ++EE E++++E+
Sbjct: 392 TDASEEAEAKAKEEKLKQEENEKKQKEQ 419
>gnl|CDD|237628 PRK14156, PRK14156, heat shock protein GrpE; Provisional.
Length = 177
Score = 28.3 bits (63), Expect = 3.6
Identities = 15/26 (57%), Positives = 15/26 (57%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E K EE EEE E E EE EE EE
Sbjct: 4 EIKNEEVEEEVEATETEETVEEVVEE 29
Score = 28.3 bits (63), Expect = 3.7
Identities = 15/27 (55%), Positives = 15/27 (55%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEE 167
K EE EEE E E EE EE EE E
Sbjct: 6 KNEEVEEEVEATETEETVEEVVEETPE 32
Score = 28.3 bits (63), Expect = 4.4
Identities = 15/32 (46%), Positives = 17/32 (53%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
EE + EE EEE E E EE EE E T +
Sbjct: 3 EEIKNEEVEEEVEATETEETVEEVVEETPEKS 34
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 28.5 bits (64), Expect = 3.6
Identities = 15/24 (62%), Positives = 19/24 (79%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEI 169
EEEE++EE E EEEEEEEE + +
Sbjct: 35 EEEEDDEEFEIEEEEEEEEVDSDF 58
Score = 28.5 bits (64), Expect = 4.0
Identities = 18/35 (51%), Positives = 22/35 (62%), Gaps = 9/35 (25%)
Query: 144 EKEEEEEE---------EEEEEEEEEEEEEEEEEI 169
E+E EE+E EEE++EE E EEEEEEE
Sbjct: 19 EEELEEDEFFWTYLLFEEEEDDEEFEIEEEEEEEE 53
Score = 28.1 bits (63), Expect = 5.1
Identities = 15/34 (44%), Positives = 23/34 (67%), Gaps = 6/34 (17%)
Query: 142 GEEKEEEEEEE------EEEEEEEEEEEEEEEEI 169
EE+EEEEE + E++E E ++EEE E+E+
Sbjct: 45 IEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKEL 78
Score = 28.1 bits (63), Expect = 6.0
Identities = 14/22 (63%), Positives = 18/22 (81%)
Query: 145 KEEEEEEEEEEEEEEEEEEEEE 166
+EEE++EE E EEEEEEEE +
Sbjct: 35 EEEEDDEEFEIEEEEEEEEVDS 56
Score = 27.3 bits (61), Expect = 8.7
Identities = 10/23 (43%), Positives = 17/23 (73%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
+ E++E E ++EEE E+E + EE
Sbjct: 60 DSEDDEPESDDEEEGEKELQREE 82
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 28.7 bits (64), Expect = 3.6
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
E EEE EE E+E E E E+EE+ +
Sbjct: 215 EASPEEENYEEYEDETELEVEDEEKALDEQ 244
>gnl|CDD|232958 TIGR00399, metG_C_term, methionyl-tRNA synthetase C-terminal
region/beta chain. The methionyl-tRNA synthetase (metG)
is a class I amino acyl-tRNA ligase. This model
describes a region of the methionyl-tRNA synthetase that
is present at the C-terminus of MetG in some species (E.
coli, B. subtilis, Thermotoga maritima, Methanobacterium
thermoautotrophicum), and as a separate beta chain in
Aquifex aeolicus. It is absent in a number of other
species (e.g. Mycoplasma genitalium, Mycobacterium
tuberculosis), while Pyrococcus horikoshii has both a
full length MetG and a second protein homologous to the
beta chain only. Proteins hit by This model should
called methionyl-tRNA synthetase beta chain if and only
if the model metG hits a separate protein not also hit
by This model [Protein synthesis, tRNA aminoacylation].
Length = 137
Score = 27.8 bits (62), Expect = 3.7
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
E K + +++E+++E E+ E ++E IT
Sbjct: 6 ELKLKGAKKKEKKDEGEKALEPQKETIT 33
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 28.3 bits (63), Expect = 3.8
Identities = 19/75 (25%), Positives = 36/75 (48%)
Query: 95 QKKYKWVRFDHTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEE 154
QK + + D + E K ++ IE + ++ + E+ E++E+EE
Sbjct: 28 QKTFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEI 87
Query: 155 EEEEEEEEEEEEEEI 169
+EEE+EEE E+ +
Sbjct: 88 AQEEEKEEEAEDVKQ 102
Score = 27.5 bits (61), Expect = 7.4
Identities = 11/35 (31%), Positives = 21/35 (60%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+ + E++E+EE +EEE+EEE E+ ++
Sbjct: 75 REDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFK 109
>gnl|CDD|235378 PRK05256, PRK05256, condesin subunit E; Provisional.
Length = 238
Score = 28.4 bits (64), Expect = 3.8
Identities = 10/27 (37%), Positives = 16/27 (59%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEE 165
+ ++E +E EEE+ +EEEEE
Sbjct: 212 PIENHLQQESSLNDETEEEQPDEEEEE 238
Score = 28.0 bits (63), Expect = 5.2
Identities = 11/25 (44%), Positives = 16/25 (64%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
E ++E +E EEE+ +EEEEE
Sbjct: 214 ENHLQQESSLNDETEEEQPDEEEEE 238
>gnl|CDD|185618 PTZ00438, PTZ00438, gamete antigen 27/25-like protein; Provisional.
Length = 374
Score = 28.9 bits (64), Expect = 3.8
Identities = 17/28 (60%), Positives = 21/28 (75%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+G +KEEEE+E+ EE EE EE E EEE
Sbjct: 103 RGTQKEEEEDEDVEEIEEVEEVEVVEEE 130
>gnl|CDD|218114 pfam04501, Baculo_VP39, Baculovirus major capsid protein VP39.
This family constitutes the 39 kDa major capsid protein
of the Baculoviridae.
Length = 300
Score = 28.5 bits (64), Expect = 3.9
Identities = 9/29 (31%), Positives = 15/29 (51%), Gaps = 1/29 (3%)
Query: 218 PKYLENLVEERCWGPSVLFVDNTDYMERN 246
P +++NL+ + P L +DN RN
Sbjct: 197 PPFIKNLIN-KLVAPETLTIDNETLRLRN 224
>gnl|CDD|221746 pfam12739, TRAPPC-Trs85, ER-Golgi trafficking TRAPP I complex 85
kDa subunit. This family is one of the subunits of the
TRAPP Golgi trafficking complex. TRAPP subunits are
found in two different sized complexes, TRAPP I and
TRAPP II, and this Trs85 is in the smaller complex.
TRAPP I, but Not TRAPP II, functions in ER-Golgi
transport. Trs85p was reported to function in the
cytosol-to-vacuole targeting pathway, suggesting a role
for this subunit in autophagy as well as in secretion.
The overall architecture of TRAPP I shows the other
components to be Bet3p (TRAPPC3), Bet5p (TRAPPC1),
Trs20p (TRAPPC2), Trs23p (TRAPPC4), Trs31p (TRAPPC5),
Trs33p (TRAPPC6a and b) and Trs85p.
Length = 390
Score = 28.9 bits (65), Expect = 4.0
Identities = 22/115 (19%), Positives = 40/115 (34%), Gaps = 21/115 (18%)
Query: 152 EEEEEEEEEEEEEEEEEITATLNPCGLCRILPV--------MVHIVVGLTSTLSIKLMQD 203
+E + ++EE + E++ T C C +L + + S
Sbjct: 40 DESDGDDEERSQALLEQLKRTFGLCN-CHLLRLNSSSSESAALTDDDPPDEWSSALEDLQ 98
Query: 204 DLHIVKDLV--LPVDDPKYLENLVEERCWGPSVLFVDNT--DYMERNISIVTDQI 254
++ L V D + + V E FV + +MER + + DQI
Sbjct: 99 EISSAGPEGVYLSVSDVEAIRAFVRE--------FVTQSLIPFMERKVRFLNDQI 145
>gnl|CDD|216831 pfam01988, VIT1, VIT family. This family includes the vacuolar
Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
transporter VIT1.
Length = 209
Score = 28.4 bits (64), Expect = 4.0
Identities = 15/38 (39%), Positives = 18/38 (47%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
K E EE E E E+ E E + E E EE+ GL
Sbjct: 57 KSERDLEEAELEREKRELENDPEGEREELAEIYRARGL 94
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.8 bits (65), Expect = 4.0
Identities = 8/33 (24%), Positives = 21/33 (63%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E +E++ + +++EE++++E E+ A + P
Sbjct: 561 EALKEQKRLRKLKKQEEKKKKELEKLEKAKIPP 593
>gnl|CDD|241523 cd13372, PH_CAPRI, Ca2+ promoted Ras inactivator pleckstrin
homology (PH) domain. CAPRI (also called RASA4/RAS p21
protein activator (GTPase activating protein)
4/GAPL/FLJ59070/KIAA0538/MGC131890) is a member of the
GAP1 family of GTPase-activating proteins. CAPRI
contains two fully conserved C2 domains, a PH domain, a
RasGAP domain, and a BTK domain. Its catalytic GAP
domain has dual RasGAP and RapGAP activities, while its
C2 domains bind phospholipids in the presence of Ca2+.
Both CAPRI and RASAL are calcium-activated RasGAPs that
inactivate Ras at the plasma membrane. Thereby enhancing
the weak intrinsic GTPase activity of RAS proteins
resulting in the inactive GDP-bound form of RAS and
allowing control of cellular proliferation and
differentiation. CAPRI and RASAL differ in that CAPRI is
an amplitude sensor while RASAL senses calcium
oscillations. This difference between them resides not
in their C2 domains, but in their PH domains leading to
speculation that this might reflect an association with
either phosphoinositides and/or proteins. PH domains
have diverse functions, but in general are involved in
targeting proteins to the appropriate cellular location
or in the interaction with a binding partner. They share
little sequence conservation, but all have a common
fold, which is electrostatically polarized. Less than
10% of PH domains bind phosphoinositide phosphates
(PIPs) with high affinity and specificity. PH domains
are distinguished from other PIP-binding domains by
their specific high-affinity binding to PIPs with two
vicinal phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2
or PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 140
Score = 27.9 bits (62), Expect = 4.1
Identities = 23/101 (22%), Positives = 42/101 (41%), Gaps = 19/101 (18%)
Query: 162 EEEEEEEITATLNPCG-LCRILPVMVHIVVG----LTST---LSIKLMQDDLHIVKDL-- 211
EEE+E ++T L + + + +H G + S+ L L +D L K
Sbjct: 9 EEEDELDLTRMLLLQAPMVKEGFLFIHRTKGKGPLMASSFKKLYFTLTKDALSFAKTPHS 68
Query: 212 ----VLPVDDPKYLENLVEERCWGPS----VLFVDNTDYME 244
+ + + E VEE+C+G S +++ D+ E
Sbjct: 69 KKSSSISLAKIRAAEK-VEEKCFGSSNVMQIIYTDDAGQQE 108
>gnl|CDD|192445 pfam10018, Med4, Vitamin-D-receptor interacting Mediator subunit 4.
Members of this family function as part of the Mediator
(Med) complex, which links DNA-bound transcriptional
regulators and the general transcription machinery,
particularly the RNA polymerase II enzyme. They play a
role in basal transcription by mediating activation or
repression according to the specific complement of
transcriptional regulators bound to the promoter.
Length = 176
Score = 28.1 bits (63), Expect = 4.2
Identities = 18/64 (28%), Positives = 28/64 (43%), Gaps = 6/64 (9%)
Query: 114 RTGRKPWPQ-----KGLIERL-DGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
+PWP +GL+ + + G E+EEE + E E + EEE+E E +
Sbjct: 113 GDLFRPWPAEDEMRRGLLAQSSLLSKEGIEPPGFDPEEEEERKAREAEAKTEEEDEREAQ 172
Query: 168 EITA 171
E
Sbjct: 173 ERNQ 176
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 28.9 bits (64), Expect = 4.4
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 148 EEEEEEEEEEEEEEEEEEEEEITAT 172
E EE+E EEE+EEEE +E++A
Sbjct: 768 AESSEEDESSEEEKEEEENKEVSAK 792
>gnl|CDD|214818 smart00784, SPT2, SPT2 chromatin protein. This entry includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 106
Score = 27.3 bits (61), Expect = 4.4
Identities = 9/20 (45%), Positives = 13/20 (65%)
Query: 145 KEEEEEEEEEEEEEEEEEEE 164
+ E+ EEE E+EEE E+
Sbjct: 83 RLEDREEERLEKEEEREKRA 102
Score = 27.0 bits (60), Expect = 5.6
Identities = 11/31 (35%), Positives = 16/31 (51%), Gaps = 6/31 (19%)
Query: 144 EKEEE------EEEEEEEEEEEEEEEEEEEE 168
++EE E+ EEE E+EEE E+
Sbjct: 72 QEEERRSARLARLEDREEERLEKEEEREKRA 102
Score = 26.2 bits (58), Expect = 10.0
Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 6/33 (18%)
Query: 142 GEEKEEEEE------EEEEEEEEEEEEEEEEEE 168
G + +EEE E+ EEE E+EEE E+
Sbjct: 68 GADIQEEERRSARLARLEDREEERLEKEEEREK 100
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 28.8 bits (64), Expect = 4.4
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E E E+ ++E E++EE++ EEE
Sbjct: 142 NESESSLEDLSDDETEDDEEKKMEEEE 168
Score = 28.4 bits (63), Expect = 5.2
Identities = 12/29 (41%), Positives = 22/29 (75%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+ + K EEE+EE++E+E++++EEE E
Sbjct: 288 EADSKPEEEDEEDDEQEDDQDEEEPPEAA 316
Score = 28.4 bits (63), Expect = 5.9
Identities = 11/29 (37%), Positives = 19/29 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITA 171
E ++E E++EE++ EEEE EE+ +
Sbjct: 148 LEDLSDDETEDDEEKKMEEEEAGEEKESV 176
Score = 28.4 bits (63), Expect = 6.3
Identities = 10/28 (35%), Positives = 17/28 (60%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E E+ ++E E++EE++ EEEE
Sbjct: 144 SESSLEDLSDDETEDDEEKKMEEEEAGE 171
Score = 28.0 bits (62), Expect = 6.6
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ ++ ++E E++EE++ EEEE EE
Sbjct: 144 SESSLEDLSDDETEDDEEKKMEEEEAGEE 172
Score = 27.6 bits (61), Expect = 9.2
Identities = 10/27 (37%), Positives = 18/27 (66%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
G + + + E+ ++EEEEEE+E E+
Sbjct: 103 GSDGSDMDSEDSADDEEEEEEDESLED 129
>gnl|CDD|217509 pfam03353, Lin-8, Ras-mediated vulval-induction antagonist. LIN-8
is a nuclear protein, present at the sites of
transcriptional repressor complexes, which interacts
with LIN-35 Rb.Lin35 Rb is a product of the class B
synMuv gene lin-35 which silences genes required for
vulval specification through chromatin modification and
remodelling. The biological role of the interaction has
not yet been determined however predictions have been
made. The interaction shows that class A synMuv genes
control vulval induction through the transcriptional
regulation of gene expression. LIN-8 normally functions
as part of a protein complex however when the complex is
absent, other family members can partially replace LIN-8
activity.
Length = 316
Score = 28.5 bits (64), Expect = 4.5
Identities = 11/36 (30%), Positives = 19/36 (52%)
Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
+Y G E++ E ++ EE +E +E E EE +
Sbjct: 145 IYDGIFEDRTRESQDMEENDEVDEVEVEEVPDDYGA 180
>gnl|CDD|217827 pfam03979, Sigma70_r1_1, Sigma-70 factor, region 1.1. Region 1.1
modulates DNA binding by region 2 and 4 when sigma is
unbound by the core RNA polymerase. Region 1.1 is also
involved in promoter binding.
Length = 79
Score = 26.9 bits (60), Expect = 4.5
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
EE + EE +EE +E+ EEE E
Sbjct: 57 EEADSEELDEETSSDEDAEEEAE 79
Score = 26.1 bits (58), Expect = 6.9
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
EE + EE +EE +E+ EEE E
Sbjct: 57 EEADSEELDEETSSDEDAEEEAE 79
Score = 26.1 bits (58), Expect = 8.0
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEE 166
E+ + EE +EE +E+ EEE E
Sbjct: 57 EEADSEELDEETSSDEDAEEEAE 79
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 28.9 bits (65), Expect = 4.6
Identities = 15/34 (44%), Positives = 21/34 (61%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
G E+ E+E EE E+ +EE EEE +ITA +
Sbjct: 382 TGLTPEKLEKELEELEKAKEEIEEEISKITARIG 415
>gnl|CDD|234064 TIGR02936, fdxN_nitrog, ferredoxin III, nif-specific. Members of
this family are homodimeric ferredoxins from nitrogen
fixation regions of many nitrogen-fixing bacteria. As
characterized in Rhodobacter capsulatus, these proteins
are homodimeric, with two 4Fe-4S clusters bound per
monomer. Although nif-specific, this protein family is
not usiveral, as other nitrogenase systems may
substitute flavodoxins, or different types of ferredoxin
[Central intermediary metabolism, Nitrogen fixation].
Length = 91
Score = 27.0 bits (60), Expect = 4.8
Identities = 12/43 (27%), Positives = 21/43 (48%), Gaps = 6/43 (13%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEEITATLNP-----CGLC-RILP 183
+ EE E +++++E E ++ NP CG C R+ P
Sbjct: 41 KGINEEGELVASDDDDDEIERKVMVVANPGNCIGCGACARVCP 83
>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
Length = 1634
Score = 28.8 bits (64), Expect = 4.8
Identities = 22/67 (32%), Positives = 32/67 (47%), Gaps = 6/67 (8%)
Query: 114 RTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEE----EEEEEEEEEI 169
R+ K W K I + + R +L KGE +EEEEE EE+E+ +E +E +
Sbjct: 1448 RSMFKGWFYKNHIP--EARVRRQLRIWKGENPDEEEEEAFREEKEKVFTYLNKELDEAGL 1505
Query: 170 TATLNPC 176
L C
Sbjct: 1506 KRVLRRC 1512
>gnl|CDD|177755 PLN00152, PLN00152, DNA-directed RNA polymerase; Provisional.
Length = 130
Score = 27.5 bits (61), Expect = 4.9
Identities = 11/28 (39%), Positives = 13/28 (46%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
G E E E E EE EE+ E +E
Sbjct: 8 GYEDEPSEPEIEEGAEEDSESNNNDEVK 35
Score = 27.1 bits (60), Expect = 5.8
Identities = 9/32 (28%), Positives = 16/32 (50%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
++E E E EE EE+ E +E+ ++
Sbjct: 8 GYEDEPSEPEIEEGAEEDSESNNNDEVKGSIG 39
>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
Length = 341
Score = 28.3 bits (64), Expect = 5.0
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E+E +EEEEEE E E + A L P
Sbjct: 32 EQEAFAPAAPDEEEEEEGELEAAVEAALRP 61
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 26.7 bits (59), Expect = 5.2
Identities = 5/27 (18%), Positives = 26/27 (96%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
GEE E+++++E++++++++++++++++
Sbjct: 47 GEEPEDDDDDEDDDDDDDKDDKDDDDD 73
>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
Length = 325
Score = 28.4 bits (64), Expect = 5.3
Identities = 8/32 (25%), Positives = 13/32 (40%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
E +E+ EEE + + EEE +
Sbjct: 20 EVFDEKALVEEEPSDNDLAEEELLSQGATQRV 51
>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
selection and in elongation by RNA polymerase II
[Transcription].
Length = 521
Score = 28.5 bits (63), Expect = 5.3
Identities = 11/32 (34%), Positives = 19/32 (59%)
Query: 133 RRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEE 164
R RLY+ + E + ++ +E EE E+ +EE
Sbjct: 157 REERLYSERHIELQRFKDYKELEESEQGLQEE 188
>gnl|CDD|240413 PTZ00423, PTZ00423, glideosome-associated protein 45; Provisional.
Length = 193
Score = 27.8 bits (61), Expect = 5.4
Identities = 13/27 (48%), Positives = 20/27 (74%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
E+ E++ EE E+EEE EE++ +EEI
Sbjct: 43 VEELEDQPEEPPEQEEENEEQKPKEEI 69
Score = 27.4 bits (60), Expect = 8.4
Identities = 13/26 (50%), Positives = 18/26 (69%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+K E+ EE E++ EE E+EEE E
Sbjct: 36 EQKPEDIVEELEDQPEEPPEQEEENE 61
>gnl|CDD|233117 TIGR00759, aceE, pyruvate dehydrogenase E1 component, homodimeric
type. Most members of this family are pyruvate
dehydrogenase complex, E1 component. Note: this family
was classified as subfamily rather than equivalog
because it includes a counterexample from Pseudomonas
putida, MdeB, that is active as an E1 component of an
alpha-ketoglutarate dehydrogenase complex rather than a
pyruvate dehydrogase complex. The second pyruvate
dehydrogenase complex E1 protein from Alcaligenes
eutrophus, PdhE, complements an aceE mutant of E. coli
but is not part of a pyruvate dehydrogenase complex
operon, is more similar to the Pseudomonas putida MdeB
than to E. coli AceE, and may have also have a different
primary specificity.
Length = 885
Score = 28.6 bits (64), Expect = 5.4
Identities = 16/51 (31%), Positives = 22/51 (43%), Gaps = 1/51 (1%)
Query: 249 IVTDQIKHYNIMPAYGENNMLKHDTLVITVDALKQIEDKLLLALNRSDNER 299
I+ IK Y M E+ H + VDALK D+ L L+ + E
Sbjct: 383 ILAKTIKGYG-MGDAAESRNTAHQVKKLEVDALKNFRDRFELPLSDAQVEE 432
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 28.1 bits (63), Expect = 5.4
Identities = 11/60 (18%), Positives = 25/60 (41%), Gaps = 3/60 (5%)
Query: 120 WPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLC 179
W ++ +L+ KR+ R K++ + + +E ++ + + I A L G
Sbjct: 146 WKYADVVAKLEEKRKER---AAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKKFGYA 202
>gnl|CDD|224559 COG1645, COG1645, Uncharacterized Zn-finger containing protein
[General function prediction only].
Length = 131
Score = 27.4 bits (61), Expect = 5.4
Identities = 12/30 (40%), Positives = 14/30 (46%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
G E EEEEEE E E +E+ E
Sbjct: 51 GYREVVVEEEEEEVEAEVQEQLRRSRPELP 80
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 28.5 bits (64), Expect = 5.5
Identities = 13/45 (28%), Positives = 27/45 (60%), Gaps = 6/45 (13%)
Query: 131 GKRRPRLYTGKGEEK------EEEEEEEEEEEEEEEEEEEEEEEI 169
K+ R + GK E EE+ ++++++E+++EEE + EE+
Sbjct: 721 MKKIFRNWNGKDAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKA 765
Score = 27.8 bits (62), Expect = 9.8
Identities = 9/30 (30%), Positives = 21/30 (70%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
E+ ++++++E+++EEE + EE+ EI
Sbjct: 743 EKTKKKKKKEKKKEEEYKREEKARIEIAEA 772
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 28.5 bits (64), Expect = 5.6
Identities = 19/48 (39%), Positives = 22/48 (45%), Gaps = 6/48 (12%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
ER + R + E E E EE+ E E EEE EE E A LN
Sbjct: 537 ERAEELRE------RAAELEAEAEEKREAAAEAEEEAEEAREEVAELN 578
>gnl|CDD|215369 PLN02685, PLN02685, iron superoxide dismutase.
Length = 299
Score = 28.0 bits (62), Expect = 5.6
Identities = 12/31 (38%), Positives = 17/31 (54%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
RL + K + E+EE E+EEE + E E
Sbjct: 256 RLESAKARAAQREQEETRTEDEEEPDSEAVE 286
>gnl|CDD|217148 pfam02620, DUF177, Uncharacterized ACR, COG1399.
Length = 118
Score = 27.3 bits (61), Expect = 5.7
Identities = 13/24 (54%), Positives = 18/24 (75%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEEI 169
EEEE E+EE E+++EE E +EI
Sbjct: 35 EEEEAEDEELEDDDEEILVEGDEI 58
>gnl|CDD|222662 pfam14300, DUF4375, Domain of unknown function (DUF4375). This
family of proteins is found in bacteria. Proteins in
this family are typically between 156 and 204 amino
acids in length. There is a single completely conserved
residue G that may be functionally important.
Length = 123
Score = 27.2 bits (61), Expect = 5.8
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 137 LYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
LY GE E EE +EE E+ EE +E
Sbjct: 67 LYGKHGEPIEREERDEEFMALFEQFEEFDE 96
>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3). This family
consists of several optic atrophy 3 (OPA3) proteins.
OPA3 deficiency causes type III 3-methylglutaconic
aciduria (MGA) in humans. This disease manifests with
early bilateral optic atrophy, spasticity,
extrapyramidal dysfunction, ataxia, and cognitive
deficits, but normal longevity.
Length = 134
Score = 27.2 bits (61), Expect = 5.8
Identities = 14/32 (43%), Positives = 17/32 (53%), Gaps = 1/32 (3%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEE-EEEEEEITA 171
K +KEEE ++E EE E E E E E A
Sbjct: 102 KEAKKEEELQQELEELEARVGELELEIERQKA 133
>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
Length = 470
Score = 28.3 bits (64), Expect = 5.8
Identities = 23/90 (25%), Positives = 36/90 (40%), Gaps = 15/90 (16%)
Query: 139 TGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVHIVVGLTSTLSI 198
G EE EE E +E E E +EE+ L+ + GL L+
Sbjct: 395 EGFDEETVEELRERAKEALETEALAQEEKLADDLLS--------------LEGLDRELAF 440
Query: 199 KLMQDDLHIVKDLV-LPVDDPKYLENLVEE 227
KL + + ++DL VD+ +E E+
Sbjct: 441 KLAEKGIKTLEDLAEQAVDELIDIEGDEEK 470
>gnl|CDD|239583 cd03506, Delta6-FADS-like, The Delta6 Fatty Acid Desaturase
(Delta6-FADS)-like CD includes the integral-membrane
enzymes: delta-4, delta-5, delta-6, delta-8,
delta-8-sphingolipid, and delta-11 desaturases found in
vertebrates, higher plants, fungi, and bacteria. These
desaturases are required for the synthesis of highly
unsaturated fatty acids (HUFAs), which are mainly
esterified into phospholipids and contribute to
maintaining membrane fluidity. While HUFAs may be
required for cold tolerance in bacteria, plants and
fish, the primary role of HUFAs in mammals is cell
signaling. These enzymes are described as front-end
desaturases because they introduce a double bond between
the pre-exiting double bond and the carboxyl (front) end
of the fatty acid. Various substrates are involved, with
both acyl-coenzyme A (CoA) and acyl-lipid desaturases
present in this CD. Acyl-lipid desaturases are localized
in the membranes of cyanobacterial thylakoid, plant
endoplasmic reticulum (ER), and plastid; and acyl-CoA
desaturases are present in ER membrane. ER-bound plant
acyl-lipid desaturases and acyl-CoA desaturases require
cytochrome b5 as an electron donor. Most of the
eukaryotic desaturase domains have an adjacent
N-terminal cytochrome b5-like domain. This domain family
has extensive hydrophobic regions that would be capable
of spanning the membrane bilayer at least twice.
Comparison of sequences also reveals the existence of
three regions of conserved histidine cluster motifs that
contain the residues: HXXXH, HXX(X)HH, and Q/HXXHH.
These histidine residues are reported to be
catalytically essential and proposed to be the ligands
for the iron atoms contained within the homolog,
stearoyl CoA desaturase.
Length = 204
Score = 28.0 bits (63), Expect = 5.9
Identities = 11/42 (26%), Positives = 15/42 (35%), Gaps = 3/42 (7%)
Query: 67 KHFGFVNLH---PDIFGLVPRLDVITMNIEWQKKYKWVRFDH 105
H + N+ PDI L + QKK R+ H
Sbjct: 57 VHHAYTNILGHDPDIDTLPLLARSEPAFGKDQKKRFLHRYQH 98
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 28.3 bits (63), Expect = 6.0
Identities = 12/41 (29%), Positives = 22/41 (53%)
Query: 128 RLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
L+ +RR + EE E + ++E+E E+ E+ E + E
Sbjct: 234 NLEERRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPE 274
>gnl|CDD|224304 COG1386, scpB, Chromosome segregation and condensation protein B
[DNA replication, recombination and repair].
Length = 184
Score = 27.7 bits (62), Expect = 6.0
Identities = 16/55 (29%), Positives = 23/55 (41%), Gaps = 9/55 (16%)
Query: 123 KGLIE---RLDGKRRPRLY--TGKGEE----KEEEEEEEEEEEEEEEEEEEEEEE 168
+GLI R D RP LY T K + +E + EE ++ EE+
Sbjct: 130 RGLIREVGRRDTPGRPYLYGTTEKFLDYFGLDSLDELPDLEELKDAGLLSEEDLL 184
>gnl|CDD|219069 pfam06512, Na_trans_assoc, Sodium ion transport-associated.
Members of this family contain a region found
exclusively in eukaryotic sodium channels or their
subunits, many of which are voltage-gated. Members very
often also contain between one and four copies of
pfam00520 and, less often, one copy of pfam00612.
Length = 230
Score = 27.7 bits (62), Expect = 6.1
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEE 165
E EEEEEE EEEEE +E E+
Sbjct: 160 EPPEEEEEEIAEEEEEVKEPED 181
Score = 27.7 bits (62), Expect = 7.1
Identities = 13/24 (54%), Positives = 17/24 (70%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEE 166
+ + EEEEEE EEEEE +E E+
Sbjct: 158 DLEPPEEEEEEIAEEEEEVKEPED 181
Score = 27.3 bits (61), Expect = 8.1
Identities = 14/23 (60%), Positives = 16/23 (69%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEEE 168
+ E EEEEEE EEEEE +E E
Sbjct: 158 DLEPPEEEEEEIAEEEEEVKEPE 180
Score = 27.3 bits (61), Expect = 8.9
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 146 EEEEEEEEEEEEEEEEEEEEEE 167
E EEEEEE EEEEE +E E+
Sbjct: 160 EPPEEEEEEIAEEEEEVKEPED 181
Score = 27.3 bits (61), Expect = 8.9
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 147 EEEEEEEEEEEEEEEEEEEEEE 168
E EEEEEE EEEEE +E E+
Sbjct: 160 EPPEEEEEEIAEEEEEVKEPED 181
>gnl|CDD|224798 COG1886, FliN, Flagellar motor switch/type III secretory pathway
protein [Cell motility and secretion / Intracellular
trafficking and secretion].
Length = 136
Score = 27.4 bits (61), Expect = 6.2
Identities = 12/82 (14%), Positives = 22/82 (26%), Gaps = 7/82 (8%)
Query: 119 PWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
LI + P + +E E E E E +
Sbjct: 12 EVGFDLLIGLGWAQAVPGSEATADSLLYKSVKEVAFAEVELTESTVLESLNESIDL---- 67
Query: 179 CRILPVMVHIVVGL-TSTLSIK 199
+L + V + V L + + +
Sbjct: 68 --LLDIPVRLSVELGRTKMPLG 87
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 28.6 bits (63), Expect = 6.2
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K EE ++ EE+E++ E ++E EE ++
Sbjct: 1676 KAEEAKKAEEDEKKAAEALKKEAEEAKKA 1704
Score = 28.6 bits (63), Expect = 6.5
Identities = 12/29 (41%), Positives = 24/29 (82%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
K EE ++E EE++++ EE +++EEE+++I
Sbjct: 1731 KAEEAKKEAEEDKKKAEEAKKDEEEKKKI 1759
Score = 28.2 bits (62), Expect = 8.2
Identities = 15/62 (24%), Positives = 30/62 (48%)
Query: 107 KVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
+ +VE+ +K +K E L E K+ EE++++ EE ++ EE+E++
Sbjct: 1630 EEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Query: 167 EE 168
Sbjct: 1690 AA 1691
Score = 28.2 bits (62), Expect = 8.5
Identities = 9/25 (36%), Positives = 19/25 (76%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
+K EE++++ EE ++ EE+E++ E
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAE 1692
>gnl|CDD|225029 COG2118, COG2118, DNA-binding protein [General function prediction
only].
Length = 116
Score = 26.9 bits (60), Expect = 6.2
Identities = 16/49 (32%), Positives = 28/49 (57%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
E L+ RR +L + + K EE+ E +EE+ +EEE +++ + L P
Sbjct: 4 EELEEIRRRKLAELQRQAKLEEQREAQEEQARQEEEAQKQAILRQILTP 52
>gnl|CDD|240228 PTZ00010, PTZ00010, tubulin beta chain; Provisional.
Length = 445
Score = 28.2 bits (63), Expect = 6.3
Identities = 16/48 (33%), Positives = 21/48 (43%), Gaps = 14/48 (29%)
Query: 138 YTGKG-EEKEEEEEE-------------EEEEEEEEEEEEEEEEEITA 171
YTG+G +E E E E ++ EEE E +EEEE
Sbjct: 398 YTGEGMDEMEFTEAESNMNDLVSEYQQYQDATVEEEGEFDEEEEAYEI 445
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 28.2 bits (63), Expect = 6.4
Identities = 7/27 (25%), Positives = 13/27 (48%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+ EEE+EEE E + +++
Sbjct: 6 NTQANEEEDEEELEAVARSAGSDSDDD 32
Score = 28.2 bits (63), Expect = 7.2
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 149 EEEEEEEEEEEEEEEEEEEEITATLNP 175
EEE+ + EEE+EEE E + +
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGS 27
>gnl|CDD|220063 pfam08895, DUF1840, Domain of unknown function (DUF1840). This
family of proteins are functionally uncharacterized.
Length = 105
Score = 26.8 bits (60), Expect = 6.4
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
+E E+E+++E+EE+EE +
Sbjct: 54 AAEEARRAPPAEDEDDDEDEEDEEPAV 80
Score = 26.4 bits (59), Expect = 9.7
Identities = 10/32 (31%), Positives = 17/32 (53%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
RL E+ E+E+++E+EE+EE
Sbjct: 48 RLEAAIAAEEARRAPPAEDEDDDEDEEDEEPA 79
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.8 bits (62), Expect = 6.4
Identities = 12/27 (44%), Positives = 19/27 (70%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE E E+EEEE+ ++EEEE+++
Sbjct: 148 EEALEFEKEEEEQRRLLLQKEEEEQQM 174
>gnl|CDD|235224 PRK04140, PRK04140, hypothetical protein; Provisional.
Length = 317
Score = 27.9 bits (63), Expect = 6.4
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 148 EEEEEEEEEEEEEEEEEEEEEITATLNPCGL 178
E+ ++E E+ EE+ E+EI L G
Sbjct: 190 EKVRDDEVTPEDPEEDPFEKEILNVLTRLGF 220
>gnl|CDD|221805 pfam12848, ABC_tran_2, ABC transporter. This domain is related to
pfam00005.
Length = 85
Score = 26.4 bits (59), Expect = 6.8
Identities = 10/26 (38%), Positives = 20/26 (76%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E+KEE E++E+ E++++E ++ EE
Sbjct: 20 EQKEERLEQQEKAYEKQQKEIKKLEE 45
>gnl|CDD|218590 pfam05438, TRH, Thyrotropin-releasing hormone (TRH). This family
consists of several thyrotropin-releasing hormone (TRH)
proteins. Thyrotropin-Releasing Hormone (TRH;
pyroGlu-His-Pro-NH2), originally isolated as a
hypothalamic neuropeptide hormone, most likely acts also
as a neuromodulator and/or neurotransmitter in the
central nervous system (CNS). This interpretation is
supported by the identification of a peptidase localised
on the surface of neuronal cells which has been termed
TRH-degrading ectoenzyme (TRH-DE) since it selectively
inactivates TRH. TRH has been used clinically for the
treatment of spinocerebellar degeneration and
disturbance of consciousness in humans.
Length = 210
Score = 27.6 bits (61), Expect = 6.8
Identities = 19/85 (22%), Positives = 31/85 (36%)
Query: 93 EWQKKYKWVRFDHTKVTNEVERTGRKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEE 152
EW + H +E PW L +R +R K + E EE
Sbjct: 114 EWDSDAGQQKRQHPGKRLSLEHMLEDPWLLSELSKRQHPGKRYLDPLHKRQHPGRRELEE 173
Query: 153 EEEEEEEEEEEEEEEEITATLNPCG 177
E++ + +E E+ + A +PC
Sbjct: 174 EDDGDSKELEKRQHPGKRALGSPCE 198
>gnl|CDD|222117 pfam13422, DUF4110, Domain of unknown function (DUF4110). This is
a family that is found predominantly at the C-terminus
of Kelch-containing proteins. However, the exact
function of this region is not known.
Length = 99
Score = 26.5 bits (59), Expect = 6.9
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI 169
EE+ E EE E+EEEE+ EE+
Sbjct: 56 EERYWELREELRALEDEEEEQAGIEEV 82
Score = 26.2 bits (58), Expect = 9.3
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
E +EE E+EEEE+ EE
Sbjct: 61 ELREELRALEDEEEEQAGIEEVV 83
Score = 26.2 bits (58), Expect = 9.6
Identities = 10/25 (40%), Positives = 13/25 (52%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEE 167
EE E+EEEE+ EE +E
Sbjct: 64 EELRALEDEEEEQAGIEEVVSLAKE 88
>gnl|CDD|222508 pfam14033, DUF4246, Protein of unknown function (DUF4246). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and fungi.
Proteins in this family are typically between 392 and
644 amino acids in length.
Length = 496
Score = 28.0 bits (63), Expect = 6.9
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
G R+ E +E+E EE + E E+E+ E+
Sbjct: 250 GHEPLRIEYYGVEYHPKEKEPEEPKPEPGEDEDYED 285
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 27.7 bits (62), Expect = 6.9
Identities = 20/108 (18%), Positives = 43/108 (39%), Gaps = 13/108 (12%)
Query: 126 IERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVM 185
L+ +RR R + + +++++ +E +++E+ ++ E EE + + I
Sbjct: 5 EALLEQRRRKR---EQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAE 61
Query: 186 VHIVVGLTSTLSIKLMQD-----DLHIVKDLVLPVDDPKYLENLVEER 228
+IV ++ DL + K D K L +E R
Sbjct: 62 GNIVFS-----KVEFADGEQAKKDLKLKKKKKKKKTDYKQLLKKLEAR 104
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 27.9 bits (62), Expect = 7.0
Identities = 12/60 (20%), Positives = 27/60 (45%), Gaps = 3/60 (5%)
Query: 117 RKPWPQKGLIERLDGKRR---PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATL 173
+ +P +E L+ + R P L E +E+++E+E + E E + ++ +
Sbjct: 78 EELFPSAEKLEELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMI 137
>gnl|CDD|182633 PRK10669, PRK10669, putative cation:proton antiport protein;
Provisional.
Length = 558
Score = 28.1 bits (63), Expect = 7.2
Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 15/64 (23%)
Query: 149 EEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMVH-IVVG---LTSTLSIKLMQDD 204
+ E EE+ EE EEE++I P +C H ++VG + S L KL+
Sbjct: 392 AKTETLEEQTLEEAIEEEKQI-----PVDICN------HALLVGYGRVGSLLGEKLLAAG 440
Query: 205 LHIV 208
+ +V
Sbjct: 441 IPLV 444
>gnl|CDD|234338 TIGR03742, PRTRC_F, PRTRC system protein F. A novel genetic system
characterized by seven (usually) major proteins,
including a ParB homolog and a ThiF homolog, is commonly
found on plasmids or in bacterial chromosomal regions
near phage, plasmid, or transposon markers. It is most
common among the beta Proteobacteria. We designate the
system PRTRC, or ParB-Related,ThiF-Related Cassette.
This protein family is designated protein F. It is the
most divergent of the families.
Length = 342
Score = 27.8 bits (62), Expect = 7.2
Identities = 11/34 (32%), Positives = 16/34 (47%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
GE EEE EE +E++E+ E + L
Sbjct: 165 GETDEEEALEELCDEDDEDREAYLPSVVEQALLE 198
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 27.3 bits (61), Expect = 7.4
Identities = 11/35 (31%), Positives = 18/35 (51%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITAT 172
Y K E + +EE E ++E EE+++IT
Sbjct: 31 YEKKDIEINTDYLQEETELDKELFTPEEQKKITFQ 65
>gnl|CDD|218049 pfam04373, DUF511, Protein of unknown function (DUF511). Bacterial
protein of unknown function.
Length = 310
Score = 27.7 bits (62), Expect = 7.5
Identities = 13/36 (36%), Positives = 20/36 (55%)
Query: 131 GKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEE 166
++ PR Y K E E E + +E + EE+EE+ E
Sbjct: 64 QEKPPRRYYLKSREDELELKALDEIKSEEDEEQSEA 99
>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family. This is a
family of transcription elongation factors which
includes those referred to as Bex proteins as well as
those named TCEAL7. Bex1 was shown to be a novel link
between neurotrophin signalling, the cell cycle, and
neuronal differentiation, suggesting it might function
by coordinating internal cellular states with the
ability of cells to respond to external signals. TCEAL7
has been shown negatively to regulate the NF-kappaB
pathway, hence being important in ovarian cancer as it
one of the genes frequently downregulated in this
cancer. A closely related protein, TFIIS/TCEA, found in
pfam07500 is involved in transcription elongation and
transcript fidelity. TFIIS/TCEA promotes 3'
endoribonuclease activity of RNA polymerase II (pol II)
and allows pol II to bypass transcript pause or 'arrest'
during elongation process. It is thus possible that BEX
is also acting in this way.
Length = 97
Score = 26.5 bits (59), Expect = 7.6
Identities = 11/25 (44%), Positives = 15/25 (60%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEE 168
E E + E E +EEEE+ EE E +
Sbjct: 7 ENEGKPESEPKEEEEKRPLEEGEGK 31
Score = 26.2 bits (58), Expect = 8.5
Identities = 12/26 (46%), Positives = 16/26 (61%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEE 168
E K E E +EEEE+ EE E ++ E
Sbjct: 9 EGKPESEPKEEEEKRPLEEGEGKKPE 34
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 28.0 bits (62), Expect = 7.6
Identities = 13/52 (25%), Positives = 21/52 (40%)
Query: 117 RKPWPQKGLIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
+K R + R RL G E + EEE E+E+ + ++ E
Sbjct: 6 DLEDLKKEAARRAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIE 57
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 28.0 bits (62), Expect = 7.6
Identities = 9/34 (26%), Positives = 20/34 (58%), Gaps = 5/34 (14%)
Query: 143 EEKEEEEEEEEEEEE-----EEEEEEEEEEEITA 171
E+K+ +E+E+EEE ++E + + +E +
Sbjct: 3946 EDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQ 3979
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 27.3 bits (60), Expect = 7.8
Identities = 17/63 (26%), Positives = 32/63 (50%)
Query: 127 ERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILPVMV 186
++ D K++ ++ KG+ K++EE+ + E E +E E E+ P + PV V
Sbjct: 23 QKSDKKKKEKVSEKKGKSKKKEEKPNGKIPEHEPNQEVTEVEVIIEKEPVPAVAVAPVPV 82
Query: 187 HIV 189
+V
Sbjct: 83 AVV 85
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 28.1 bits (62), Expect = 7.9
Identities = 11/34 (32%), Positives = 18/34 (52%)
Query: 135 PRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
T + EE +E E+ E+E E + +EE+ E
Sbjct: 948 ESDETDEDEESDESSEDLSEDESENDSSDEEDGE 981
>gnl|CDD|217952 pfam04189, Gcd10p, Gcd10p family. eIF-3 is a multi-subunit complex
that stimulates translation initiation in vitro at
several different steps. This family corresponds to the
gamma subunit if eIF3. The Yeast protein Gcd10p has also
been shown to be part of a complex with the
methyltransferase Gcd14p that is involved in modifying
tRNA.
Length = 299
Score = 27.6 bits (62), Expect = 7.9
Identities = 11/27 (40%), Positives = 12/27 (44%)
Query: 142 GEEKEEEEEEEEEEEEEEEEEEEEEEE 168
E EE EEEE E EEE +
Sbjct: 273 FEPPAEEIEEEEPEVLPEEELNTMKPS 299
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B
(IRAGA and IRAGB). The function of MRVI1 is unknown
although mutations in the Mrvi1 gene induces myeloid
leukaemia by altering the expression of a gene important
for myeloid cell growth and/or differentiation so it has
been speculated that Mrvi1 is a tumour suppressor gene.
IRAG is very similar in sequence to MRVI1 and is an
essential NO/cGKI-dependent regulator of IP3-induced
calcium release. Activation of cGKI decreases
IP3-stimulated elevations in intracellular calcium,
induces smooth muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 28.1 bits (62), Expect = 7.9
Identities = 15/67 (22%), Positives = 28/67 (41%), Gaps = 9/67 (13%)
Query: 141 KGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCG---------LCRILPVMVHIVVG 191
+GEE E + EE E EE ++E+ ++ L + L +M ++
Sbjct: 430 EGEEAVERTRKPSLSEEVAETEEWDKEQRSSKLASWVTFLQVSYRKANQALWLMAAAILL 489
Query: 192 LTSTLSI 198
L + +S
Sbjct: 490 LAALMSF 496
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 28.0 bits (62), Expect = 8.0
Identities = 13/44 (29%), Positives = 23/44 (52%)
Query: 125 LIERLDGKRRPRLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
I+ L R + ++E E+EEE + +E +EEE+E+
Sbjct: 239 RIDLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEK 282
>gnl|CDD|236876 PRK11191, PRK11191, RNase E inhibitor protein; Provisional.
Length = 138
Score = 26.8 bits (60), Expect = 8.1
Identities = 7/30 (23%), Positives = 16/30 (53%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
Y G G E+ E+ ++E+ +E+++
Sbjct: 108 YDGWGTYFEDPNAEDGDDEDFVDEDDDGVR 137
>gnl|CDD|225752 COG3211, PhoX, Predicted phosphatase [General function prediction
only].
Length = 616
Score = 27.9 bits (62), Expect = 8.1
Identities = 19/90 (21%), Positives = 28/90 (31%), Gaps = 7/90 (7%)
Query: 29 FNDFTSNKSEILHETKEFQNAFLQPRQAWVENLDTTEEKHFGF-VNLHPDIFGLVPRLDV 87
F+ + + E L R T + F P+ FG V +D
Sbjct: 241 FDGYFGGEDEELEPE------DDLKRYGIPGGDYTWGKFDARFDYVKEPNRFGWVVEIDP 294
Query: 88 ITMNIEWQKKYKWVRFDHTKVTNEVERTGR 117
N K+ RF H +V + GR
Sbjct: 295 NDPNSTPVKRTALGRFKHEAAAVDVNKDGR 324
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 28.0 bits (63), Expect = 8.3
Identities = 16/38 (42%), Positives = 20/38 (52%), Gaps = 5/38 (13%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
G+ EKEE E+ E E+ EE+EEE LNP
Sbjct: 849 LEGEFSEKEEAEKFYNNLIEVEKVEEKEEE-----LNP 881
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 27.3 bits (61), Expect = 8.3
Identities = 11/31 (35%), Positives = 13/31 (41%)
Query: 138 YTGKGEEKEEEEEEEEEEEEEEEEEEEEEEE 168
Y + K EE EE E +EEE E
Sbjct: 156 YAEWKDAKLLEEFAAEEAEAAAAAKEEESAE 186
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 27.4 bits (61), Expect = 8.4
Identities = 20/61 (32%), Positives = 23/61 (37%), Gaps = 10/61 (16%)
Query: 114 RTGRKPWPQ-KGLIERLDGKRRP-----RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEE 167
R P PQ KGL R P E E E+E + E+E E EEE
Sbjct: 97 VVPRPPVPQPKGLKMRF----FPTGYGAPDGPPSELGSESETSEKETTAKVEKEAEVEEE 152
Query: 168 E 168
E
Sbjct: 153 E 153
>gnl|CDD|224035 COG1110, COG1110, Reverse gyrase [DNA replication, recombination,
and repair].
Length = 1187
Score = 28.1 bits (63), Expect = 8.5
Identities = 12/43 (27%), Positives = 17/43 (39%), Gaps = 8/43 (18%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEI--------TATLNPCG 177
+ E+ E EE E E E E++ +AT P G
Sbjct: 248 KLYGEKRAERVREELREVEREREKKRRKLGILVVSSATGKPRG 290
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 27.8 bits (62), Expect = 8.5
Identities = 16/67 (23%), Positives = 19/67 (28%), Gaps = 3/67 (4%)
Query: 106 TKVTNEVERTGRKPWPQKGLIERLDGKRR---PRLYTGKGEEKEEEEEEEEEEEEEEEEE 162
K E R G P+K K E + + E EE
Sbjct: 50 PKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEE 109
Query: 163 EEEEEEI 169
EEE EE
Sbjct: 110 EEEPEET 116
>gnl|CDD|219405 pfam07418, PCEMA1, Acidic phosphoprotein precursor PCEMA1. This
family consists of several acidic phosphoprotein
precursor PCEMA1 sequences which appear to be found
exclusively in Plasmodium chabaudi. PCEMA1 is an antigen
that is associated with the membrane of the infected
erythrocyte throughout the entire intraerythrocytic
cycle. The exact function of this family is unclear.
Length = 286
Score = 27.6 bits (61), Expect = 8.7
Identities = 14/34 (41%), Positives = 17/34 (50%)
Query: 136 RLYTGKGEEKEEEEEEEEEEEEEEEEEEEEEEEI 169
R Y +K E E E+E E+E EEE EE
Sbjct: 245 REYREWKGDKANTNETEIEDESEDEYEEEAGEEQ 278
>gnl|CDD|227225 COG4888, COG4888, Uncharacterized Zn ribbon-containing protein
[General function prediction only].
Length = 104
Score = 26.3 bits (58), Expect = 8.7
Identities = 8/23 (34%), Positives = 13/23 (56%)
Query: 140 GKGEEKEEEEEEEEEEEEEEEEE 162
G+G + +EE ++E E EE
Sbjct: 80 GRGCGESGTDEENDQEIESPGEE 102
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 27.7 bits (61), Expect = 9.1
Identities = 9/41 (21%), Positives = 25/41 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNPCGLCRILP 183
E+ ++ + + + EE++++ EEE+ + T++ + G + P
Sbjct: 104 EQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQP 144
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 26.5 bits (59), Expect = 9.3
Identities = 13/23 (56%), Positives = 20/23 (86%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEE 165
EE ++EEEEE+EE+E+E E +E+
Sbjct: 28 EEYDKEEEEEKEEKEKEAEPDED 50
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 27.5 bits (61), Expect = 9.7
Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 8/45 (17%)
Query: 134 RPRLYTGKG--------EEKEEEEEEEEEEEEEEEEEEEEEEEIT 170
RP+ GKG E ++E + E E EE EEE+
Sbjct: 227 RPQRPVGKGLSELVQEEESIDDELDVLREIEAEEAGIGPIEEEVV 271
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 27.7 bits (62), Expect = 9.8
Identities = 7/31 (22%), Positives = 19/31 (61%)
Query: 144 EKEEEEEEEEEEEEEEEEEEEEEEEITATLN 174
E ++ ++EEEE +E++++E++ +
Sbjct: 221 ELDDIDDEEEERVLADEDDDDEDDMFDMDDD 251
>gnl|CDD|221473 pfam12230, PRP21_like_P, Pre-mRNA splicing factor PRP21 like
protein. This domain family is found in eukaryotes, and
is typically between 212 and 238 amino acids in length.
The family is found in association with pfam01805. There
are two completely conserved residues (W and H) that may
be functionally important. PRP21 is required for
assembly of the prespliceosome and it interacts with U2
snRNP and/or pre-mRNA in the prespliceosome. This family
also contains proteins similar to PRP21, such as the
mammalian SF3a. SF3a also interacts with U2 snRNP from
the prespliceosome, converting it to its active form.
Length = 230
Score = 27.4 bits (61), Expect = 9.9
Identities = 15/33 (45%), Positives = 20/33 (60%)
Query: 143 EEKEEEEEEEEEEEEEEEEEEEEEEEITATLNP 175
+ EEEEE EEEEE+ +EE++EE P
Sbjct: 100 DAAAAPEEEEEMEEEEEDIDEEKKEESAKKSTP 132
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.317 0.137 0.406
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,641,512
Number of extensions: 1677397
Number of successful extensions: 20188
Number of sequences better than 10.0: 1
Number of HSP's gapped: 13607
Number of HSP's successfully gapped: 1935
Length of query: 309
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 212
Effective length of database: 6,635,264
Effective search space: 1406675968
Effective search space used: 1406675968
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 59 (26.3 bits)