RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17824
(272 letters)
>gnl|CDD|238907 cd01926, cyclophilin_ABH_like, cyclophilin_ABH_like: Cyclophilin
A, B and H-like cyclophilin-type peptidylprolyl cis-
trans isomerase (PPIase) domain. This family represents
the archetypal cystolic cyclophilin similar to human
cyclophilins A, B and H. PPIase is an enzyme which
accelerates protein folding by catalyzing the cis-trans
isomerization of the peptide bonds preceding proline
residues. These enzymes have been implicated in protein
folding processes which depend on catalytic
/chaperone-like activities. As cyclophilins, Human
hCyP-A, human cyclophilin-B (hCyP-19), S. cerevisiae
Cpr1 and C. elegans Cyp-3, are inhibited by the
immunosuppressive drug cyclopsporin A (CsA). CsA binds
to the PPIase active site. Cyp-3. S. cerevisiae Cpr1
interacts with the Rpd3 - Sin3 complex and in addition
is a component of the Set3 complex. S. cerevisiae Cpr1
has also been shown to have a role in Zpr1p nuclear
transport. Human cyclophilin H associates with the
[U4/U6.U5] tri-snRNP particles of the splicesome.
Length = 164
Score = 260 bits (667), Expect = 9e-89
Identities = 97/167 (58%), Positives = 113/167 (67%), Gaps = 4/167 (2%)
Query: 6 PRCYFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHR 65
P+ +FD+ I P GRIV EL++D+ P T ENFRALCTGEKG G GK Y FHR
Sbjct: 1 PKVFFDITIGGEPAGRIVMELFADVVPKTAENFRALCTGEKGKG---GKPFGYKGSTFHR 57
Query: 66 VVKGFMIQSGDFSVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFIT 125
V+ FMIQ GDF+ GNGTGG+SIYG F DENF+LKH P LLSMAN G +TNGSQFFIT
Sbjct: 58 VIPDFMIQGGDFTRGNGTGGKSIYGEKFPDENFKLKHTGPGLLSMANAGPNTNGSQFFIT 117
Query: 126 TQPAPHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKC 172
T P LDG HVVFG VV G +VV IEN+ +P + V + C
Sbjct: 118 TVKTPWLDGKHVVFGKVVEGMDVVKKIENVGSGN-GKPKKKVVIADC 163
>gnl|CDD|240249 PTZ00060, PTZ00060, cyclophilin; Provisional.
Length = 183
Score = 227 bits (581), Expect = 1e-75
Identities = 98/171 (57%), Positives = 118/171 (69%), Gaps = 3/171 (1%)
Query: 5 RPRCYFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFH 64
RP+ +FD+ I + P GRIVFEL+SD+ P T ENFRALC G+K ++GKNLHY +FH
Sbjct: 15 RPKVFFDISIDNAPAGRIVFELFSDVTPKTAENFRALCIGDKV--GSSGKNLHYKGSIFH 72
Query: 65 RVVKGFMIQSGDFSVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFI 124
R++ FM Q GD + NGTGGESIYG F DENF+LKHD+P LLSMAN G +TNGSQFFI
Sbjct: 73 RIIPQFMCQGGDITNHNGTGGESIYGRKFTDENFKLKHDQPGLLSMANAGPNTNGSQFFI 132
Query: 125 TTQPAPHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKCNEL 175
TT P P LDG HVVFG V+ G VV +E + P + V V C EL
Sbjct: 133 TTVPCPWLDGKHVVFGKVIEGMEVVRAMEKEGT-QSGYPKKPVVVTDCGEL 182
>gnl|CDD|238194 cd00317, cyclophilin, cyclophilin: cyclophilin-type peptidylprolyl
cis- trans isomerases. This family contains eukaryotic,
bacterial and archeal proteins which exhibit a
peptidylprolyl cis- trans isomerases activity (PPIase,
Rotamase) and in addition bind the immunosuppressive
drug cyclosporin (CsA). Immunosuppression in
vertebrates is believed to be the result of the
cyclophilin A-cyclosporin protein drug complex binding
to and inhibiting the protein-phosphatase calcineurin.
PPIase is an enzyme which accelerates protein folding by
catalyzing the cis-trans isomerization of the peptide
bonds preceding proline residues. Cyclophilins are a
diverse family in terms of function and have been
implicated in protein folding processes which depend on
catalytic /chaperone-like activities. This group
contains human cyclophilin 40, a co-chaperone of the
hsp90 chaperone system; human cyclophilin A, a
chaperone in the HIV-1 infectious process and; human
cyclophilin H, a component of the U4/U6 snRNP, whose
isomerization or chaperoning activities may play a role
in RNA splicing. .
Length = 146
Score = 192 bits (491), Expect = 2e-62
Identities = 77/157 (49%), Positives = 86/157 (54%), Gaps = 14/157 (8%)
Query: 16 DIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSG 75
D GRIV ELY D P T ENF +L G Y FHRV+ GFMIQ G
Sbjct: 3 DTTKGRIVIELYGDEAPKTVENFLSLARGG-----------FYDGTTFHRVIPGFMIQGG 51
Query: 76 DFSVGNGTGGESIYGGTFEDENFELK-HDEPFLLSMANRGKDTNGSQFFITTQPAPHLDG 134
D + GG S G F DENF LK H LSMAN G +TNGSQFFITT P PHLDG
Sbjct: 52 DPT--GTGGGGSGPGYKFPDENFPLKYHHRRGTLSMANAGPNTNGSQFFITTAPTPHLDG 109
Query: 135 VHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVK 171
H VFG VV G +VV IE D+ RP++ V +
Sbjct: 110 KHTVFGKVVEGMDVVDKIERGDTDENGRPIKPVTISD 146
>gnl|CDD|215759 pfam00160, Pro_isomerase, Cyclophilin type peptidyl-prolyl
cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans
isomerases, also known as cyclophilins, share this
domain of about 109 amino acids. Cyclophilins have been
found in all organisms studied so far and catalyze
peptidyl-prolyl isomerisation during which the peptide
bond preceding proline (the peptidyl-prolyl bond) is
stabilised in the cis conformation. Mammalian
cyclophilin A (CypA) is a major cellular target for the
immunosuppressive drug cyclosporin A (CsA). Other roles
for cyclophilins may include chaperone and cell
signalling function.
Length = 144
Score = 185 bits (471), Expect = 2e-59
Identities = 71/164 (43%), Positives = 90/164 (54%), Gaps = 20/164 (12%)
Query: 9 YFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVK 68
+ D+ + LGRIV EL+ D P T ENF +LC K Y +FHRV+
Sbjct: 1 FLDITTN---LGRIVIELFGDEAPKTVENFLSLC-----------KKGFYDGTIFHRVIP 46
Query: 69 GFMIQSGDFSVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFITTQP 128
GFMIQ GD + G GTGG+ + DE + LSMA G ++ GSQFFIT
Sbjct: 47 GFMIQGGDPT-GPGTGGK----RSIPDEFVSKLKHKRGTLSMARSGPNSAGSQFFITLGD 101
Query: 129 APHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKC 172
APHLDG + VFG VV G +VV IE + D RPL+DV++ C
Sbjct: 102 APHLDGGYTVFGRVVEGMDVVEKIEKVETDG-DRPLKDVKITSC 144
>gnl|CDD|178694 PLN03149, PLN03149, peptidyl-prolyl isomerase H (cyclophilin H);
Provisional.
Length = 186
Score = 183 bits (466), Expect = 3e-58
Identities = 88/171 (51%), Positives = 101/171 (59%), Gaps = 4/171 (2%)
Query: 6 PRCYFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHR 65
P +FDV I IP GRI EL++DI P T ENFR CTGE + G Y FHR
Sbjct: 19 PVVFFDVTIGGIPAGRIKMELFADIAPKTAENFRQFCTGEF---RKAGLPQGYKGCQFHR 75
Query: 66 VVKGFMIQSGDFSVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFIT 125
V+K FMIQ GDF G+GTG SIYG FEDENF KH P LLSMAN G +TNG QFFIT
Sbjct: 76 VIKDFMIQGGDFLKGDGTGCVSIYGSKFEDENFIAKHTGPGLLSMANSGPNTNGCQFFIT 135
Query: 126 TQPAPHLDGVHVVFGHVV-SGQNVVSHIENLPVDKMSRPLQDVRVVKCNEL 175
LD HVVFG V+ G VV IEN+ +RP + +C E+
Sbjct: 136 CAKCDWLDNKHVVFGRVLGDGLLVVRKIENVATGPNNRPKLACVISECGEM 186
>gnl|CDD|223725 COG0652, PpiB, Peptidyl-prolyl cis-trans isomerase (rotamase) -
cyclophilin family [Posttranslational modification,
protein turnover, chaperones].
Length = 158
Score = 177 bits (452), Expect = 3e-56
Identities = 67/169 (39%), Positives = 82/169 (48%), Gaps = 20/169 (11%)
Query: 14 ISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQ 73
I + G I ELY D P T NF L K Y +FHRV+ GFMIQ
Sbjct: 3 ILETNKGDITIELYPDKAPKTVANFLQLV-----------KEGFYDGTIFHRVIPGFMIQ 51
Query: 74 SGDFSVGNGTGGESIYGGTFEDENFELKHD--EPFLLSMANRG-KDTNGSQFFITTQPAP 130
GD + G+GTGG F+DENF L D + LSMA G ++NGSQFFIT P
Sbjct: 52 GGDPTGGDGTGGPGP---PFKDENFALNGDRHKRGTLSMARAGDPNSNGSQFFITVVDNP 108
Query: 131 HLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQ---DVRVVKCNELV 176
LDG + VFG VV G +VV I+N D D + ++V
Sbjct: 109 FLDGKYTVFGQVVEGMDVVDKIKNGDTDDSGYVQDVPADPVKILSVKIV 157
>gnl|CDD|238908 cd01927, cyclophilin_WD40, cyclophilin_WD40: cyclophilin-type
peptidylprolyl cis- trans isomerases (cyclophilins)
having a WD40 domain. This group consists of several
hypothetical and putative eukaryotic and bacterial
proteins which have a cyclophilin domain and a WD40
domain. Function of the protein is not known.
Length = 148
Score = 175 bits (445), Expect = 2e-55
Identities = 77/153 (50%), Positives = 96/153 (62%), Gaps = 13/153 (8%)
Query: 19 LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
G I L+ + P T ENF +N +Y N +FHRV+KGFMIQ+GD
Sbjct: 6 KGDIHIRLFPEEAPKTVENFTTHA-----------RNGYYNNTIFHRVIKGFMIQTGD-P 53
Query: 79 VGNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
G+GTGGESI+G FEDE + LKHD P+ LSMAN G +TNGSQFFITT P LD H
Sbjct: 54 TGDGTGGESIWGKEFEDEFSPSLKHDRPYTLSMANAGPNTNGSQFFITTVATPWLDNKHT 113
Query: 138 VFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVV 170
VFG VV G +VV IEN+ DK RP +D++++
Sbjct: 114 VFGRVVKGMDVVQRIENVKTDKNDRPYEDIKII 146
>gnl|CDD|238909 cd01928, Cyclophilin_PPIL3_like, Cyclophilin_PPIL3_like. Proteins
similar to Human cyclophilin-like peptidylprolyl cis-
trans isomerase (PPIL3). Members of this family lack a
key residue important for cyclosporin binding: the
tryptophan residue corresponding to W121 in human
hCyP-18a; most members have a histidine at this
position. The exact function of the protein is not
known.
Length = 153
Score = 165 bits (420), Expect = 1e-51
Identities = 77/152 (50%), Positives = 95/152 (62%), Gaps = 13/152 (8%)
Query: 19 LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
LG I EL+ D CP CENF ALC +Y +FHR +KGFM+Q+GD
Sbjct: 9 LGDIKIELFCDDCPKACENFLALCASG-----------YYNGCIFHRNIKGFMVQTGD-P 56
Query: 79 VGNGTGGESIYGGTFEDENFE-LKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
G G GGESI+G FEDE E LKHD ++SMAN G +TNGSQFFIT PHLDG +
Sbjct: 57 TGTGKGGESIWGKKFEDEFRETLKHDSRGVVSMANNGPNTNGSQFFITYAKQPHLDGKYT 116
Query: 138 VFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
VFG V+ G + +E LPVDK RPL+++R+
Sbjct: 117 VFGKVIDGFETLDTLEKLPVDKKYRPLEEIRI 148
>gnl|CDD|238904 cd01923, cyclophilin_RING, cyclophilin_RING: cyclophilin-type
peptidylprolyl cis- trans isomerases (cyclophilins)
having a modified RING finger domain. This group
includes the nuclear proteins, Human hCyP-60 and
Caenorhabditis elegans MOG-6 which, compared to the
archetypal cyclophilin Human cyclophilin A exhibit
reduced peptidylprolyl cis- trans isomerase activity and
lack a residue important for cyclophilin binding. Human
hCyP-60 has been shown to physically interact with the
proteinase inhibitor peptide eglin c and; C. elegans
MOG-6 to physically interact with MEP-1, a nuclear zinc
finger protein. MOG-6 has been shown to function in
germline sex determination.
Length = 159
Score = 159 bits (403), Expect = 5e-49
Identities = 67/152 (44%), Positives = 87/152 (57%), Gaps = 13/152 (8%)
Query: 19 LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
G + EL+ D P CENF LC K +Y +FHR ++ FMIQ GD +
Sbjct: 8 KGDLNLELHCDKAPKACENFIKLC-----------KKGYYDGTIFHRSIRNFMIQGGDPT 56
Query: 79 VGNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
G G GGESI+G F+DE L HD +LSMAN G +TNGSQFFIT + HLDG H
Sbjct: 57 -GTGRGGESIWGKPFKDEFKPNLSHDGRGVLSMANSGPNTNGSQFFITYRSCKHLDGKHT 115
Query: 138 VFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
VFG VV G + +EN+P RP +++++
Sbjct: 116 VFGRVVGGLETLEAMENVPDPGTDRPKEEIKI 147
>gnl|CDD|238903 cd01922, cyclophilin_SpCYP2_like, cyclophilin_SpCYP2_like:
cyclophilin 2-like peptidylprolyl cis- trans isomerase
(PPIase) domain similar to Schizosaccharomyces pombe
cyp-2. These proteins bind their respective SNW
chromatin binding protein in autologous systems, in a
CsA independent manner indicating interaction with a
surface outside the PPIase active site. SNW proteins
play a basic and broad range role in signaling.
Length = 146
Score = 145 bits (368), Expect = 7e-44
Identities = 69/154 (44%), Positives = 90/154 (58%), Gaps = 14/154 (9%)
Query: 19 LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
+G I ELY + P TC+NF L +G +Y +FHR++K FMIQ GD
Sbjct: 6 MGEITLELYWNHAPKTCKNFYELAK--RG---------YYNGTIFHRLIKDFMIQGGD-P 53
Query: 79 VGNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
G G GG SIYG FEDE + ELKH +LSMAN G +TNGSQFFIT P P LDG H
Sbjct: 54 TGTGRGGASIYGKKFEDEIHPELKHTGAGILSMANAGPNTNGSQFFITLAPTPWLDGKHT 113
Query: 138 VFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVK 171
+FG V G V+ ++ + RP+ +V+++K
Sbjct: 114 IFGRVSKGMKVIENMVEVQTQT-DRPIDEVKILK 146
>gnl|CDD|238906 cd01925, cyclophilin_CeCYP16-like, cyclophilin_CeCYP16-like:
cyclophilin-type peptidylprolyl cis- trans isomerase)
(PPIase) domain similar to Caenorhabditis elegans
cyclophilin 16. C. elegans CeCYP-16, compared to the
archetypal cyclophilin Human cyclophilin A has, a
reduced peptidylprolyl cis- trans isomerase activity, is
cyclosporin insensitive and shows an altered substrate
preference favoring, hydrophobic, acidic or amide amino
acids. Most members of this subfamily have a glutamate
residue in the active site at the position equivalent to
a tryptophan (W121 in Human cyclophilin A), which has
been shown to be important for cyclophilin binding.
Length = 171
Score = 124 bits (314), Expect = 2e-35
Identities = 62/157 (39%), Positives = 83/157 (52%), Gaps = 16/157 (10%)
Query: 19 LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
G I EL+S P C NF LC +Y N +FHRVV GF+IQ GD +
Sbjct: 14 AGDIDIELWSKEAPKACRNFIQLCL-----------EGYYDNTIFHRVVPGFIIQGGDPT 62
Query: 79 VGNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNGSQFFITTQPAPHLDGVHV 137
G GTGGESIYG F+DE + L+ + L+ MAN G D+NGSQFF T A L+ H
Sbjct: 63 -GTGTGGESIYGEPFKDEFHSRLRFNRRGLVGMANAGDDSNGSQFFFTLDKADELNNKHT 121
Query: 138 VFGHVV--SGQNVVSHIENLPVDKMSRPLQDVRVVKC 172
+FG V + N++ + + DK RP+ ++
Sbjct: 122 LFGKVTGDTIYNLLK-LAEVETDKDERPVYPPKITSV 157
>gnl|CDD|238902 cd01921, cyclophilin_RRM, cyclophilin_RRM: cyclophilin-type
peptidylprolyl cis- trans isomerase domain occuring with
a C-terminal RNA recognition motif domain (RRM). This
subfamily of the cyclophilin domain family contains a
number of eukaryotic cyclophilins having the RRM domain
including the nuclear proteins: human hCyP-57,
Arabidopsis thaliana AtCYP59, Caenorhabditis elegans
CeCyP-44 and Paramecium tetrurelia Kin241. The Kin241
protein has been shown to have a role in cell
morphogenesis.
Length = 166
Score = 117 bits (294), Expect = 1e-32
Identities = 65/160 (40%), Positives = 88/160 (55%), Gaps = 21/160 (13%)
Query: 19 LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
LG +V +L++D CP+ C NF LC K +Y +F+ V K F+ Q+GD +
Sbjct: 6 LGDLVIDLFTDECPLACLNFLKLC-----------KLKYYNFCLFYNVQKDFIAQTGDPT 54
Query: 79 VGNGTGGESIYGGT-------FEDE-NFELKHDEPFLLSMANRGKDTNGSQFFIT-TQPA 129
G G GGESIY FE E LKH + +SM N G + NGSQF+IT +
Sbjct: 55 -GTGAGGESIYSQLYGRQARFFEPEILPLLKHSKKGTVSMVNAGDNLNGSQFYITLGENL 113
Query: 130 PHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
+LDG H VFG VV G +V+ I + VD RPL+D+R+
Sbjct: 114 DYLDGKHTVFGQVVEGFDVLEKINDAIVDDDGRPLKDIRI 153
>gnl|CDD|140248 PTZ00221, PTZ00221, cyclophilin; Provisional.
Length = 249
Score = 118 bits (296), Expect = 5e-32
Identities = 76/201 (37%), Positives = 104/201 (51%), Gaps = 7/201 (3%)
Query: 7 RCYFDVEISDIPLGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRV 66
R + D+ I D+ GR+VFEL+ D+ P T ENFRAL TG GI TG L YL H V
Sbjct: 54 RAFLDISIGDVLAGRLVFELFEDVVPETVENFRALITGSCGIDTNTGVKLDYLYTPVHHV 113
Query: 67 VKGF-MIQSGDF-SVGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNGSQFFI 124
+ +I G+ S + G I DE + +H E LL+M + G T+GS F I
Sbjct: 114 DRNNNIIVLGELDSFNVSSTGTPI-----ADEGYRHRHTERGLLTMISEGPHTSGSVFGI 168
Query: 125 TTQPAPHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKCNELVLKSKLKAK 184
T P+P LD VVFG V +++ +E+LP+D + RPL V V C L + +
Sbjct: 169 TLGPSPSLDFKQVVFGKAVDDLSLLEKLESLPLDDVGRPLLPVTVSFCGALTGEKPPGRQ 228
Query: 185 EKRKASPTSASSDESDKEEEE 205
+ A+ SASS+ EE
Sbjct: 229 QLLAAADDSASSEHVACAAEE 249
>gnl|CDD|238901 cd01920, cyclophilin_EcCYP_like, cyclophilin_EcCYP_like:
cyclophilin-type A-like peptidylprolyl cis- trans
isomerase (PPIase) domain similar to the cytosolic E.
coli cyclophilin A and Streptomyces antibioticus
SanCyp18. Compared to the archetypal cyclophilin Human
cyclophilin A, these have reduced affinity for
cyclosporin A. E. coli cyclophilin A has a similar
peptidylprolyl cis- trans isomerase activity to the
human cyclophilin A. Most members of this subfamily
contain a phenylalanine residue at the position
equivalent to Human cyclophilin W121, where a tyrptophan
has been shown to be important for cyclophilin binding.
Length = 155
Score = 80.2 bits (198), Expect = 1e-18
Identities = 53/170 (31%), Positives = 65/170 (38%), Gaps = 47/170 (27%)
Query: 19 LGRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFS 78
LG IV ELY D PIT ENF A KG Y N +FHRV+ GF+IQ
Sbjct: 6 LGDIVVELYDDKAPITVENFLAYV--RKG---------FYDNTIFHRVISGFVIQ----- 49
Query: 79 VGNGTGGESIYGGTFEDENFELKHDEPFLLSMANRGKDTNG--------------SQFFI 124
GG F + + + +P N +T G SQFFI
Sbjct: 50 -----------GGGFTPDLAQKETLKPIKNEAGNGLSNTRGTIAMARTNAPDSATSQFFI 98
Query: 125 TTQPAPHLD-----GVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
+ LD + VFG V G +VV I + QDV V
Sbjct: 99 NLKDNASLDYQNEQWGYTVFGEVTEGMDVVDKIAGVETYS-FGSYQDVPV 147
>gnl|CDD|182824 PRK10903, PRK10903, peptidyl-prolyl cis-trans isomerase A (rotamase
A); Provisional.
Length = 190
Score = 56.4 bits (136), Expect = 1e-09
Identities = 48/171 (28%), Positives = 64/171 (37%), Gaps = 51/171 (29%)
Query: 20 GRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFSV 79
G I EL S P++ +NF + Y N FHRV+ GFMIQ G F+
Sbjct: 38 GNIELELNSQKAPVSVKNFVDYVN-----------SGFYNNTTFHRVIPGFMIQGGGFT- 85
Query: 80 GNGTGGESIYGGTFEDENFELKHDEPFLLSMANRG---------------KDTNGSQFFI 124
E + K P + + A+ G KD+ SQFFI
Sbjct: 86 ----------------EQMQQKKPNPPIKNEADNGLRNTRGTIAMARTADKDSATSQFFI 129
Query: 125 TTQPAPHLD------GVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRV 169
LD G + VFG VV G +V I +P + P Q+V
Sbjct: 130 NVADNAFLDHGQRDFG-YAVFGKVVKGMDVADKISQVPTHDVG-PYQNVPS 178
>gnl|CDD|182734 PRK10791, PRK10791, peptidyl-prolyl cis-trans isomerase B (rotamase
B); Provisional.
Length = 164
Score = 51.8 bits (124), Expect = 3e-08
Identities = 45/147 (30%), Positives = 63/147 (42%), Gaps = 27/147 (18%)
Query: 20 GRIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGDFSV 79
G IV + + D P T +NF C + Y N +FHRV+ GFMIQ G F
Sbjct: 9 GDIVIKTFDDKAPETVKNFLDYC-----------REGFYNNTIFHRVINGFMIQGGGFEP 57
Query: 80 GNGTGGESIYGGTFEDE-NFELKHDEPFLLSMANRGKDTNG--SQFFITTQP-------A 129
G + ++E N LK+ L+MA R + + +QFFI
Sbjct: 58 GM---KQKATKEPIKNEANNGLKNTRG-TLAMA-RTQAPHSATAQFFINVVDNDFLNFSG 112
Query: 130 PHLDGV-HVVFGHVVSGQNVVSHIENL 155
L G + VF VV G +VV I+ +
Sbjct: 113 ESLQGWGYCVFAEVVEGMDVVDKIKGV 139
>gnl|CDD|238905 cd01924, cyclophilin_TLP40_like, cyclophilin_TLP40_like:
cyclophilin-type peptidylprolyl cis- trans isomerases
(cyclophilins) similar ot the Spinach thylakoid lumen
protein TLP40. Compared to the archetypal cyclophilin
Human cyclophilin A, these proteins have similar
peptidylprolyl cis- trans isomerase activity and reduced
affinity for cyclosporin A. Spinach TLP40 has been shown
to have a dual function as a folding catalyst and
regulator of dephosphorylation.
Length = 176
Score = 51.7 bits (124), Expect = 3e-08
Identities = 47/167 (28%), Positives = 67/167 (40%), Gaps = 47/167 (28%)
Query: 21 RIVFELYSDICPITCENFRALCTGEKGIGKTTGKNLHYLNIVFHRVVKGFMIQSGD-FSV 79
IV + Y+ P+T NF L E+G Y + FHRV GF++Q+GD
Sbjct: 10 TIVLDGYN--APVTAGNFVDLV--ERGF---------YDGMEFHRVEGGFVVQTGDPQGK 56
Query: 80 GNG-----TG--------------GESIYGGTFE-----DENFELKHDEPFLLSMANRGK 115
G TG + +YG T E DE L + ++MA
Sbjct: 57 NPGFPDPETGKSRTIPLEIKPEGQKQPVYGKTLEEAGRYDEQPVLPFNAFGAIAMARTEF 116
Query: 116 DTNG--SQFFI-------TTQPAPHLDGVHVVFGHVVSGQNVVSHIE 153
D N SQFF T LDG + VFG+V G +++ ++
Sbjct: 117 DPNSASSQFFFLLKDNELTPSRNNVLDGRYAVFGYVTDGLDILRELK 163
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 48.0 bits (115), Expect = 2e-06
Identities = 24/74 (32%), Positives = 46/74 (62%)
Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
K+ +K + +EK+ ++ KK KK K KK + EEEEEKE+ E+ E+E +
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 259 KKHKKDKKKRKKKS 272
++ +++++K+KK++
Sbjct: 464 EEKEEEEEKKKKQA 477
Score = 44.9 bits (107), Expect = 2e-05
Identities = 22/73 (30%), Positives = 45/73 (61%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
S ++ K+ ++ E + E+E K++KK KKK+++ + E+E+++EE E+ + E
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 254 EAKKRKKHKKDKK 266
E +K ++ +K KK
Sbjct: 463 EEEKEEEEEKKKK 475
Score = 44.9 bits (107), Expect = 2e-05
Identities = 20/72 (27%), Positives = 42/72 (58%)
Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
K + E++ +EE ++ + A K+K+ +++++K+KK+ + EEEEE+ E
Sbjct: 404 GSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 246 GEQTDSEKEAKK 257
E+ + E++ KK
Sbjct: 464 EEKEEEEEKKKK 475
Score = 44.1 bits (105), Expect = 4e-05
Identities = 20/73 (27%), Positives = 42/73 (57%)
Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
A+ E+ +++ E+ EK+ K KK+++++++K K EEE+E+EE E
Sbjct: 403 TGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Query: 249 TDSEKEAKKRKKH 261
+ ++E +++KK
Sbjct: 463 EEEKEEEEEKKKK 475
Score = 43.8 bits (104), Expect = 5e-05
Identities = 19/69 (27%), Positives = 42/69 (60%)
Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
E+E T S+K KK KK + +KK+++ K E++++ G++ + E+E +K KK ++
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 264 DKKKRKKKS 272
+++ ++
Sbjct: 455 KEEEEEEAE 463
Score = 41.1 bits (97), Expect = 4e-04
Identities = 17/76 (22%), Positives = 47/76 (61%), Gaps = 8/76 (10%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
+ + K KA++KR+ +E +++++ G++ + E+E +K KK ++ ++++++
Sbjct: 410 KKIKKIVEKAEKKRE--------EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEE 461
Query: 234 KSKYEEEEEKEEGEQT 249
+ +EEEE+++ +Q
Sbjct: 462 AEEEKEEEEEKKKKQA 477
Score = 40.3 bits (95), Expect = 7e-04
Identities = 24/81 (29%), Positives = 47/81 (58%), Gaps = 8/81 (9%)
Query: 200 DKEEEEKEEGEQTDSEKEAK-KRKKHKKDKKKKKKKSKY-------EEEEEKEEGEQTDS 251
+EE E G + ++K K K KK +++KK+K K EEEEE+E+ ++ +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEE 454
Query: 252 EKEAKKRKKHKKDKKKRKKKS 272
++E ++ + +K++++ KKK
Sbjct: 455 KEEEEEEAEEEKEEEEEKKKK 475
Score = 33.4 bits (77), Expect = 0.11
Identities = 15/58 (25%), Positives = 38/58 (65%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
K + + K+++K + E ++EEE++++ E+ + E+E + +K ++++KKKK+ +
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 47.2 bits (112), Expect = 5e-06
Identities = 29/106 (27%), Positives = 55/106 (51%), Gaps = 1/106 (0%)
Query: 168 RVVKCNELVLKSKLK-AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK 226
RV K +K K AKE + S ++ +EE+K++ E+ E + +K K+ K
Sbjct: 79 RVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+K+ K+K K +E++ +E ++ + +K + R K + K +KK
Sbjct: 139 EKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPP 184
Score = 41.8 bits (98), Expect = 2e-04
Identities = 28/109 (25%), Positives = 52/109 (47%), Gaps = 1/109 (0%)
Query: 165 QDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTD-SEKEAKKRKK 223
D V + + K + K + +E ++E+ KEE ++ KE K +K
Sbjct: 73 SDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRK 132
Query: 224 HKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K++ K+K+ + E+E+EK+ E D E+E K+ + K + K+ K
Sbjct: 133 PKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKK 181
Score = 41.4 bits (97), Expect = 3e-04
Identities = 31/102 (30%), Positives = 53/102 (51%), Gaps = 1/102 (0%)
Query: 171 KCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 230
KC E L S K K ++ +E + E G++ + EKE K +K KK +K
Sbjct: 65 KCAESKLSSDEAVKRVEKGGSKGPAAKTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKP 124
Query: 231 KK-KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K+ K + +EE KE+ + EKE +K+ + +D+++ KK+
Sbjct: 125 KEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKR 166
Score = 40.3 bits (94), Expect = 7e-04
Identities = 22/101 (21%), Positives = 45/101 (44%), Gaps = 5/101 (4%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSE----KEAKKRKKHKKDKKKKK 232
K + ++ +E+++K+E + + + KE K K+ K+K+K+K
Sbjct: 91 KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEK 150
Query: 233 KKSKYEE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+K E + E+E+ + K K+ KK K+K+
Sbjct: 151 EKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPP 191
Score = 38.7 bits (90), Expect = 0.002
Identities = 19/94 (20%), Positives = 47/94 (50%), Gaps = 1/94 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+ + K+K+K P D K EE KE+ + EKE +K+ + +D++++KK+ +
Sbjct: 111 EQVKEEKKKKKEKPKEEPKDRKPK-EEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV 169
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ ++ + + K+ + +K ++ ++
Sbjct: 170 RAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREA 203
Score = 38.7 bits (90), Expect = 0.002
Identities = 23/94 (24%), Positives = 43/94 (45%), Gaps = 2/94 (2%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K K + K K P ++ K EE ++ E+ E+ + K K KKK +K
Sbjct: 130 DRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERV--RAKSRPKKPPKKKPPNKK 187
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+E E+E+ Q E K ++ ++++ K++
Sbjct: 188 KEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE 221
Score = 34.9 bits (80), Expect = 0.038
Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 4/94 (4%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K K K KEK+ P E +K+ E + + KK KK+ +++K+ +
Sbjct: 144 KEKEKEKEKKVEEP---RDREEEKKRERVRAKSRPKKPPK-KKPPNKKKEPPEEEKQRQA 199
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
E K + E+ D +E +K + KD++
Sbjct: 200 AREAVKGKPEEPDVNEEREKEEDDGKDRETTTSP 233
Score = 29.1 bits (65), Expect = 2.7
Identities = 17/97 (17%), Positives = 41/97 (42%), Gaps = 2/97 (2%)
Query: 173 NELVLKSKLKAKEK-RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK 231
+ K + + + K R P KE E+E+ Q E K ++ +++++
Sbjct: 159 DREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEERE 218
Query: 232 KKKSKYEEEEE-KEEGEQTDSEKEAKKRKKHKKDKKK 267
K++ ++ E E+ +S + ++ ++ KK
Sbjct: 219 KEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKK 255
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 46.7 bits (110), Expect = 9e-06
Identities = 27/107 (25%), Positives = 49/107 (45%), Gaps = 3/107 (2%)
Query: 169 VVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR---KKHK 225
K + +A E++ + + K + K++ E+ EAKK+ K K
Sbjct: 1347 AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKK 1406
Query: 226 KDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
D+ KK +K + +E K++ E+ EAKK+ + K + KKK+
Sbjct: 1407 ADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKA 1453
Score = 46.7 bits (110), Expect = 9e-06
Identities = 27/99 (27%), Positives = 53/99 (53%), Gaps = 5/99 (5%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK-----KHKKDKKKKK 232
+ K+KA+E +KA ++E K EEEK++ EQ ++ +K+K K +++ K K
Sbjct: 1604 EKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663
Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ + EE+K++ E+ +E +K+ K+ + K
Sbjct: 1664 AEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAK 1702
Score = 46.3 bits (109), Expect = 1e-05
Identities = 33/99 (33%), Positives = 57/99 (57%), Gaps = 5/99 (5%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK-----RKKHKKDKKKKK 232
++K KA+E +KA ++E+ K +E K++ E+ EAKK +KK + KK +
Sbjct: 1448 EAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAE 1507
Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K K +E ++ EE ++ D K+A++ KK + KK +KK
Sbjct: 1508 AKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKK 1546
Score = 45.9 bits (108), Expect = 2e-05
Identities = 29/90 (32%), Positives = 51/90 (56%), Gaps = 2/90 (2%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
KA+EK+KA ++E+ K +E K++ E+ ++K+A KK ++ KK + +K E E
Sbjct: 1297 KAEEKKKADEAKKKAEEAKKADEAKKKAEE--AKKKADAAKKKAEEAKKAAEAAKAEAEA 1354
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+E E + + EA ++KK + KK K
Sbjct: 1355 AADEAEAAEEKAEAAEKKKEEAKKKADAAK 1384
Score = 45.5 bits (107), Expect = 2e-05
Identities = 26/95 (27%), Positives = 51/95 (53%), Gaps = 5/95 (5%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK-----KHKKDKKKKKKKSK 236
A+E +KA ++E+ K EE++++ + ++ + +K K + ++KKK ++ K
Sbjct: 1663 AAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELK 1722
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EEE K + E+ E E K+K + K + +KK
Sbjct: 1723 KAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKK 1757
Score = 45.5 bits (107), Expect = 2e-05
Identities = 28/95 (29%), Positives = 49/95 (51%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
++K KA +K + + + E+ K E E E +E++A+ +K K++ KKK +K
Sbjct: 1326 EAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ EE+K+ E +E KK+ K KKK+
Sbjct: 1386 KAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKA 1420
Score = 44.4 bits (104), Expect = 4e-05
Identities = 29/97 (29%), Positives = 54/97 (55%), Gaps = 2/97 (2%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR--KKHKKDKKKKKKKS 235
++K KA+EK+KA ++E+ K +E K++ E+ +EAKK+ + K D+ KKK +
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEE 1481
Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ +E K++ E+ + + K+ K K KK+
Sbjct: 1482 AKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKA 1518
Score = 44.0 bits (103), Expect = 5e-05
Identities = 29/92 (31%), Positives = 51/92 (55%), Gaps = 3/92 (3%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
++K KA+E +KA ++E+ K EE K++ E+ EAKK+ + K + KKK+
Sbjct: 1435 EAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA-- 1492
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
EE +K+ E + + KK + KK ++ +K
Sbjct: 1493 -EEAKKKADEAKKAAEAKKKADEAKKAEEAKK 1523
Score = 43.6 bits (102), Expect = 8e-05
Identities = 26/96 (27%), Positives = 50/96 (52%), Gaps = 5/96 (5%)
Query: 182 KAKEKRKASPTSAS-----SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
KA+EK+KA + +DE+ K+ EE ++ ++ + E K+K KK ++ K
Sbjct: 1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKA 1344
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
E + + E ++E +K + +K K++ KKK+
Sbjct: 1345 AEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKA 1380
Score = 43.2 bits (101), Expect = 1e-04
Identities = 29/95 (30%), Positives = 49/95 (51%), Gaps = 2/95 (2%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR--KKHKKDKKKKKKKSKY 237
K A K+KA ++E K +E K++ E+ EAKK+ + K ++ KKK +
Sbjct: 1411 KKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ +E K++ E+ EAKK+ + K K KK+
Sbjct: 1471 KADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKA 1505
Score = 42.8 bits (100), Expect = 1e-04
Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 1/94 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+ K KA E +K + + +DE+ K+ EE K++ + + E K+ + + +
Sbjct: 1299 EEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADE 1358
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
E EEK E + E+ KK KK +++KK
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKK 1392
Score = 42.4 bits (99), Expect = 2e-04
Identities = 30/105 (28%), Positives = 55/105 (52%), Gaps = 10/105 (9%)
Query: 178 KSKLKAKEKRKASPTSAS-----SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
++ +KA+E RKA + +DE+ K EE+K+ E +EAKK + KK ++ K
Sbjct: 1269 QAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAK 1328
Query: 233 KKS-----KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
KK+ K EE ++ E + ++E A + + ++ + +KK
Sbjct: 1329 KKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKK 1373
Score = 42.4 bits (99), Expect = 2e-04
Identities = 31/95 (32%), Positives = 52/95 (54%), Gaps = 4/95 (4%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+ K KA E +KA+ +DE+ K+ EEK++ ++ + E K K D+ KKK +
Sbjct: 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAK----KADEAKKKAEEAK 1457
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ EE K++ E+ EAKK+ + K + KKK+
Sbjct: 1458 KAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKA 1492
Score = 42.1 bits (98), Expect = 2e-04
Identities = 29/96 (30%), Positives = 52/96 (54%), Gaps = 3/96 (3%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK---KHKKDKKKKKKKSK 236
K K + K+KA ++E K +E K++ E+ + + K+ K K D+ KKK + K
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ +E K++ E+ EAKK+ + K ++ KKK+
Sbjct: 1431 KKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKA 1466
Score = 41.7 bits (97), Expect = 3e-04
Identities = 28/100 (28%), Positives = 56/100 (56%), Gaps = 6/100 (6%)
Query: 178 KSKLKAKEKRKASPTSA--SSDESDKEEEEKEEGEQTDSEKEAKKRK----KHKKDKKKK 231
+ K KA+E +KA + +++E+ K EE+K++ E+ +E +K+ K + ++ KK
Sbjct: 1644 EEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKK 1703
Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
++ K +E EEK++ E+ +E K K + K+ + K
Sbjct: 1704 AEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDK 1743
Score = 41.3 bits (96), Expect = 4e-04
Identities = 29/98 (29%), Positives = 58/98 (59%), Gaps = 6/98 (6%)
Query: 178 KSKLKAKEKRKASPTSASSDES-DKEEEEKEEGEQTDSEKEAKKRK-----KHKKDKKKK 231
+ K KA+E +KA + E+ KE EE ++ E+ ++ +K+K K +++ K K
Sbjct: 1672 EDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIK 1731
Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
+++K E EE+K++ E+ ++E KK+ H K ++++K
Sbjct: 1732 AEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKK 1769
Score = 41.3 bits (96), Expect = 5e-04
Identities = 28/87 (32%), Positives = 53/87 (60%), Gaps = 2/87 (2%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
K KA+E +KA ++E+ K +E K++ E+ ++K+A + KK + KKK + K EE
Sbjct: 1463 KKKAEEAKKADEAKKKAEEAKKADEAKKKAEE--AKKKADEAKKAAEAKKKADEAKKAEE 1520
Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKK 266
++ +E ++ + K+A + KK ++ KK
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKK 1547
Score = 40.5 bits (94), Expect = 8e-04
Identities = 28/95 (29%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 178 KSKLKAKE-KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+ KA E K+KA +DE+ K E K++ ++ +EAKK + KK ++ KK
Sbjct: 1480 EEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEA 1539
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ EE+K+ E +E+ K +K K ++ K+ ++
Sbjct: 1540 KKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEE 1574
Score = 39.4 bits (91), Expect = 0.002
Identities = 25/98 (25%), Positives = 49/98 (50%), Gaps = 4/98 (4%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR----KKHKKDKKKKKK 233
+++++ K ++E+ K EE K + E+ +E KK+ KK + ++KKK +
Sbjct: 1591 EARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE 1650
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ K EEE K + + + E K+K + K + +K
Sbjct: 1651 ELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Score = 38.6 bits (89), Expect = 0.003
Identities = 29/93 (31%), Positives = 49/93 (52%), Gaps = 3/93 (3%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK---KHKKDKKKKKKKSKYE 238
KA+E +KA EEE+K + E+ +EAK + K +++KKK ++ K +
Sbjct: 1582 KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641
Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
E EEK++ E+ +E K K ++ KK + K
Sbjct: 1642 EAEEKKKAEELKKAEEENKIKAAEEAKKAEEDK 1674
Score = 37.4 bits (86), Expect = 0.008
Identities = 25/93 (26%), Positives = 51/93 (54%), Gaps = 1/93 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+ KA E +K + + +DE+ K+ EE K++ ++ EAKK+ K ++ KK +
Sbjct: 1467 EEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADE 1526
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
++ EE ++ ++ +E KK + KK ++ +K
Sbjct: 1527 AKKAEEAKKADEAKKAEEKKKADELKKAEELKK 1559
Score = 37.0 bits (85), Expect = 0.010
Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 4/99 (4%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
++K KA E +KA+ +DE+ K EE K+ E +E+ K + K ++KKK + K
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKK 1553
Query: 238 EEE----EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
EE EEK++ E+ +E K K ++ K+ +++
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEA 1592
Score = 36.7 bits (84), Expect = 0.015
Identities = 25/91 (27%), Positives = 53/91 (58%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
KA+EK+KA + ++ + + EE ++ + + + K ++++KK K +++K EE
Sbjct: 1559 KAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEA 1618
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ + E +E+E KK ++ KK + + KKK+
Sbjct: 1619 KIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649
Score = 34.7 bits (79), Expect = 0.060
Identities = 24/90 (26%), Positives = 47/90 (52%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
KA+ +KA ++E+ K EEE+ E E+ ++ K ++ K +E +
Sbjct: 1225 KAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELK 1284
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ EE ++ D K+A+++KK + KKK ++
Sbjct: 1285 KAEEKKKADEAKKAEEKKKADEAKKKAEEA 1314
Score = 33.6 bits (76), Expect = 0.12
Identities = 30/96 (31%), Positives = 52/96 (54%), Gaps = 8/96 (8%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE--- 238
KA+EK+KA + E K+ EEK++ E+ +E K K ++ KK ++++ E
Sbjct: 1541 KAEEKKKADELKKA--EELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVM 1598
Query: 239 ---EEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EEE+K + E+ +EAK + + K ++ KKK
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKK 1634
Score = 33.6 bits (76), Expect = 0.14
Identities = 30/92 (32%), Positives = 56/92 (60%), Gaps = 10/92 (10%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEE---KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
K +A+EK+KA +E K EEE K E + ++E++ KK ++ KKD+++KKK +
Sbjct: 1709 KKEAEEKKKA-------EELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAH 1761
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
++EEEK+ E ++ + + ++D+K+R
Sbjct: 1762 LKKEEEKKAEEIRKEKEAVIEEELDEEDEKRR 1793
Score = 32.0 bits (72), Expect = 0.34
Identities = 27/133 (20%), Positives = 58/133 (43%), Gaps = 22/133 (16%)
Query: 162 RPLQDVRVVKCNELVLKSKLKAKEKRKA----------------------SPTSASSDES 199
R +D + + + ++K A+E +KA + ++E+
Sbjct: 1218 RKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEA 1277
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
K +E K+ E+ +++ K +K K D+ KKK + + +E K++ E+ + +A K+K
Sbjct: 1278 RKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKK 1337
Query: 260 KHKKDKKKRKKKS 272
+ K K+
Sbjct: 1338 AEEAKKAAEAAKA 1350
Score = 32.0 bits (72), Expect = 0.36
Identities = 27/94 (28%), Positives = 54/94 (57%), Gaps = 1/94 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
++K KA E +KA + +DE+ K EE K+ E +E++ K + K ++ KK ++ K
Sbjct: 1507 EAKKKADEAKKAE-EAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKK 1565
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EE +K E ++ + ++A++ KK ++ + + K
Sbjct: 1566 AEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMK 1599
Score = 31.6 bits (71), Expect = 0.53
Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 1/92 (1%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEK-EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
KA+E RKA + EEE K EE + + K+A+ KK ++ KK ++ K EEE
Sbjct: 1189 KAEELRKAEDARKAEAARKAEEERKAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEE 1248
Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
EE + + + A ++ K + +K+
Sbjct: 1249 RNNEEIRKFEEARMAHFARRQAAIKAEEARKA 1280
Score = 31.3 bits (70), Expect = 0.61
Identities = 24/95 (25%), Positives = 53/95 (55%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+ K KA+E +KA + E K+E E+++ + +++K+ +++KK KK+++KK++
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEE 1772
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+E++ E+ E++ K+R + K K
Sbjct: 1773 IRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807
Score = 30.9 bits (69), Expect = 0.94
Identities = 24/94 (25%), Positives = 51/94 (54%), Gaps = 1/94 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+ KA E +KA + +DE+ K EE+K+ E +E+ K +K K ++ KK ++ K
Sbjct: 1519 EEAKKADEAKKAE-EAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKN 1577
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ EE ++ + + + K ++++KK + ++
Sbjct: 1578 MALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEE 1611
Score = 30.1 bits (67), Expect = 1.6
Identities = 24/90 (26%), Positives = 45/90 (50%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
KA+E RKA + EE +K+ E +E+E + K ++ + ++ +
Sbjct: 1213 KAEEARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAI 1272
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ EE + D K+A+++KK + KK +KK
Sbjct: 1273 KAEEARKADELKKAEEKKKADEAKKAEEKK 1302
Score = 28.6 bits (63), Expect = 5.2
Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 8/103 (7%)
Query: 178 KSKLKAKEKRKAS---PTSASSDESDKEEEEKEEGEQTDSEKEAKKRK-----KHKKDKK 229
+ K KA+E +KA + E K+ EE E +E KK K K ++ K
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620
Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K ++ K EEE++K E + +E KK ++ KK +++ K K+
Sbjct: 1621 KAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKA 1663
Score = 28.2 bits (62), Expect = 5.5
Identities = 22/90 (24%), Positives = 46/90 (51%)
Query: 170 VKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
K E + K++ + K K + + A D+ EE +K+E E+ K+ +K ++ +
Sbjct: 1715 KKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774
Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
K+K+ EE +E++E + + +K+ K
Sbjct: 1775 KEKEAVIEEELDEEDEKRRMEVDKKIKDIF 1804
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 44.9 bits (107), Expect = 2e-05
Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 7/73 (9%)
Query: 197 DESDKEEEEKEEGEQTD-SEKEAKK-RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
D+ E E+EEGE + S E KK RKK +K +KK +K EE EK ++
Sbjct: 389 DKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEK-----EEAEKAAAKKKAEAAA 443
Query: 255 AKKRKKHKKDKKK 267
K + + KK
Sbjct: 444 KKAKGPDGETKKV 456
Score = 34.1 bits (79), Expect = 0.078
Identities = 18/87 (20%), Positives = 36/87 (41%)
Query: 170 VKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
++ + L A+ + + S K+ +K+ + +EKE ++ KK +
Sbjct: 381 IEIYLKLHDKPLLAEGEEEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAE 440
Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAK 256
KK+K + E K+ EK A+
Sbjct: 441 AAAKKAKGPDGETKKVDPDPLGEKLAR 467
Score = 28.4 bits (64), Expect = 5.2
Identities = 18/40 (45%), Positives = 21/40 (52%), Gaps = 3/40 (7%)
Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKRKK 270
K EEEE E G S E KK RKK +K +KK +K
Sbjct: 390 KPLLAEGEEEEGENGNL--SPAERKKLRKKQRKAEKKAEK 427
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 44.4 bits (105), Expect = 3e-05
Identities = 22/95 (23%), Positives = 44/95 (46%), Gaps = 7/95 (7%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK---KKK 234
K+K +A+ ++KA + E + + + E ++ K A+ +KK + + K K K K
Sbjct: 133 KAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKK----KAAEAKKKAEAEAKAKAEAKAK 188
Query: 235 SKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
+K EE + K E + + EA + + +
Sbjct: 189 AKAEEAKAKAEAAKAKAAAEAAAKAEAEAAAAAAA 223
Score = 41.0 bits (96), Expect = 4e-04
Identities = 25/94 (26%), Positives = 47/94 (50%), Gaps = 3/94 (3%)
Query: 180 KLKAKEKRKASPTSASSDESD---KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+ K E+R A+ +A E EE++K+ E + K K + +KK K+++K
Sbjct: 90 RQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAK 149
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+ EEE + +++K+A + KK + + K K
Sbjct: 150 KQAEEEAKAKAAAEAKKKAAEAKKKAEAEAKAKA 183
Score = 40.2 bits (94), Expect = 6e-04
Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 7/89 (7%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQT-----DSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
K + K E++ EE E+ +KE ++R +K K+ ++ +K EE++
Sbjct: 59 KKPAAKKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQ 118
Query: 243 KEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K+ E K+A + K + + ++K K
Sbjct: 119 KQAEEA--KAKQAAEAKAKAEAEAEKKAK 145
Score = 37.9 bits (88), Expect = 0.003
Identities = 18/89 (20%), Positives = 40/89 (44%), Gaps = 3/89 (3%)
Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
+ +++ P + E K+ E++ E + E + R+K + + +K +K E+
Sbjct: 53 NRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAE-QARQKELEQRAAAEKAAKQAEQAA 111
Query: 243 KEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K+ E+ +EAK K + K + +
Sbjct: 112 KQAEEKQKQAEEAK--AKQAAEAKAKAEA 138
Score = 36.7 bits (85), Expect = 0.009
Identities = 22/90 (24%), Positives = 48/90 (53%), Gaps = 1/90 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
++ +A+EK+K + + + ++ + + + E E+ E+ K+ ++ K K + K K
Sbjct: 109 QAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKA 168
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
E ++K E E ++ EAK + K ++ K K
Sbjct: 169 AEAKKKAEAE-AKAKAEAKAKAKAEEAKAK 197
Score = 34.4 bits (79), Expect = 0.056
Identities = 16/70 (22%), Positives = 40/70 (57%), Gaps = 2/70 (2%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
+ +KE ++ +EK AK+ ++ K ++K+K++ EE + K+ E + ++K
Sbjct: 88 QARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQA--EEAKAKQAAEAKAKAEAEAEKKAK 145
Query: 262 KKDKKKRKKK 271
++ KK+ +++
Sbjct: 146 EEAKKQAEEE 155
Score = 34.0 bits (78), Expect = 0.072
Identities = 23/89 (25%), Positives = 40/89 (44%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
K K + K K + + + E++ E++ KEE ++ E+ K K K + KK
Sbjct: 116 EKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEAKKKA 175
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
E + K E + +EAK + + K K
Sbjct: 176 EAEAKAKAEAKAKAKAEEAKAKAEAAKAK 204
Score = 29.8 bits (67), Expect = 1.3
Identities = 18/91 (19%), Positives = 45/91 (49%), Gaps = 8/91 (8%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
A ++ + KE+E +++ EQ +EA+K++ ++ ++K+ ++ E+
Sbjct: 46 GAVAQQANRIQQQKKPAAKKEQERQKKLEQQ--AEEAEKQRAAEQARQKELEQRAAAEKA 103
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K+ E+ AK+ ++ +K ++ K K
Sbjct: 104 AKQA------EQAAKQAEEKQKQAEEAKAKQ 128
Score = 28.3 bits (63), Expect = 5.0
Identities = 14/72 (19%), Positives = 35/72 (48%), Gaps = 1/72 (1%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
D ++ +K A K+++ ++ KK +++ + E++ E+ Q + E+ A K
Sbjct: 44 DPGAVAQQANRIQQQKKPAAKKEQ-ERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEK 102
Query: 260 KHKKDKKKRKKK 271
K+ ++ K+
Sbjct: 103 AAKQAEQAAKQA 114
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 43.4 bits (103), Expect = 6e-05
Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 1/53 (1%)
Query: 221 RKKHKKDK-KKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
R K K KK K SK + E+ E+ + + KKR + K+ KR+K S
Sbjct: 377 RPKTKAPSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRDTKNIGKRRKPS 429
Score = 34.5 bits (80), Expect = 0.048
Identities = 27/90 (30%), Positives = 40/90 (44%), Gaps = 15/90 (16%)
Query: 157 VDKMSRPLQDV---RVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTD 213
+ K+ R +++ RV+ EL K+K EK+ P S K E+KE+
Sbjct: 357 LGKIERYIEEPLKARVID--ELRPKTK-APSEKKTGKP---SKKVLAKRAEKKEK----- 405
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
EKE K KK +D K K+ K E+
Sbjct: 406 -EKEKPKVKKRHRDTKNIGKRRKPSGTSEE 434
Score = 33.4 bits (77), Expect = 0.12
Identities = 17/72 (23%), Positives = 25/72 (34%), Gaps = 22/72 (30%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
K EK+ G S+K KR + K+ +K+K K K + R
Sbjct: 381 KAPSEKKTG--KPSKKVLAKRAEKKEKEKEKPKVKK-------------------RHRDT 419
Query: 261 HKKDKKKRKKKS 272
K+RK
Sbjct: 420 KNI-GKRRKPSG 430
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 41.6 bits (98), Expect = 1e-04
Identities = 24/70 (34%), Positives = 40/70 (57%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
S + S+ S+KE K E E E+E K++KK K+ KK+KK+K +E+ + +G
Sbjct: 124 PPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGS 183
Query: 248 QTDSEKEAKK 257
+ +K+ KK
Sbjct: 184 KKKKKKKKKK 193
Score = 33.9 bits (78), Expect = 0.040
Identities = 18/68 (26%), Positives = 37/68 (54%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
L S+ + EK + ++ ++E++EK++ ++ EK+ KK KK K + K K
Sbjct: 125 PSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSK 184
Query: 234 KSKYEEEE 241
K K ++++
Sbjct: 185 KKKKKKKK 192
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 42.0 bits (99), Expect = 2e-04
Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 9/96 (9%)
Query: 185 EKRKASPTS--------ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+K K P S S + EE+ + T S K+ KK K+ + KKK +KK K
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENK-VATTSTKKDKKEDKNNESKKKSEKKKK 75
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
++E+++ + E +K K KK K K
Sbjct: 76 KKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKP 111
Score = 41.6 bits (98), Expect = 3e-04
Identities = 27/111 (24%), Positives = 48/111 (43%), Gaps = 8/111 (7%)
Query: 158 DKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASP-------TSASSDESDKEEEEKEEGE 210
+S +V V+ L S+ + K ++ + S +S+K++++K+E +
Sbjct: 22 QPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKK 81
Query: 211 QTDSEKEAKK-RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+ SE E K K KK KK KKK K + E+ + +E K
Sbjct: 82 EPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNAFNKIAELAEKSNVY 132
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 41.2 bits (97), Expect = 2e-04
Identities = 23/84 (27%), Positives = 46/84 (54%), Gaps = 4/84 (4%)
Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
A+P + K+++++ E +KE ++++K K KKK KKK ++++K++ +
Sbjct: 49 ATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKK----KDKDKDKKDD 104
Query: 249 TDSEKEAKKRKKHKKDKKKRKKKS 272
+K KK +K +DK + KS
Sbjct: 105 KKDDKSEKKDEKEAEDKLEDLTKS 128
Score = 37.0 bits (86), Expect = 0.005
Identities = 27/89 (30%), Positives = 48/89 (53%), Gaps = 6/89 (6%)
Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK-KDKKKKKKKSKYEEEE 241
AK+K+K +E +K ++E EE ++ +K+ K+KK K KDKK KK K E+++
Sbjct: 60 AKKKKKELA-----EEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKD 114
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
EKE ++ + ++ + K +K
Sbjct: 115 EKEAEDKLEDLTKSYSETLSTLSELKPRK 143
Score = 29.3 bits (66), Expect = 1.5
Identities = 14/64 (21%), Positives = 28/64 (43%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K+K + + K K++++K++ + +K KK +K +DK + KS
Sbjct: 71 IEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAEDKLEDLTKSYS 130
Query: 238 EEEE 241
E
Sbjct: 131 ETLS 134
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 41.3 bits (97), Expect = 2e-04
Identities = 19/87 (21%), Positives = 41/87 (47%), Gaps = 1/87 (1%)
Query: 175 LVLKSKLKAKEKRKASPTSASSDESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
L S + S D+ E EE KEE ++ + ++ + + +K+ ++ ++
Sbjct: 34 LFFPSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEE 93
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+++ E+EE +E E+ EK +K
Sbjct: 94 ENEEEDEESSDENEKETEEKTESNVEK 120
Score = 37.0 bits (86), Expect = 0.005
Identities = 20/83 (24%), Positives = 41/83 (49%), Gaps = 6/83 (7%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+A + + S + ++ EE ++EE E +SE + K K+D++ +++ +
Sbjct: 39 SPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEE 98
Query: 238 EEEE------EKEEGEQTDSEKE 254
+EE E EE +++ EKE
Sbjct: 99 DEESSDENEKETEEKTESNVEKE 121
Score = 32.8 bits (75), Expect = 0.11
Identities = 19/83 (22%), Positives = 42/83 (50%), Gaps = 2/83 (2%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSE--KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
S S + ++E ++ ++ E + E KE +K + +DK+ K K +EE E+E E
Sbjct: 38 SSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
Query: 248 QTDSEKEAKKRKKHKKDKKKRKK 270
+ + + +++ +K + +K
Sbjct: 98 EDEESSDENEKETEEKTESNVEK 120
Score = 31.2 bits (71), Expect = 0.34
Identities = 12/74 (16%), Positives = 40/74 (54%)
Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
++ EQ + + ++ + ++ K+++K+ + E++E+K + E+ D E E +
Sbjct: 37 PSSPSDQAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENE 96
Query: 259 KKHKKDKKKRKKKS 272
++ ++ + +K++
Sbjct: 97 EEDEESSDENEKET 110
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 41.9 bits (98), Expect = 2e-04
Identities = 27/97 (27%), Positives = 44/97 (45%), Gaps = 4/97 (4%)
Query: 178 KSKLKAKEKRKASPTSASSD----ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
K + + A +S D E+ K E EK + E +++EA K K HK K++
Sbjct: 253 KQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQES 312
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
K+ +E E+KE Q E A+ +K K + +
Sbjct: 313 KASEKEAEDKELEAQKKREPVAEDLQKTKPQVEAQPT 349
Score = 40.0 bits (93), Expect = 0.001
Identities = 24/99 (24%), Positives = 44/99 (44%), Gaps = 5/99 (5%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
EL K K ++KA ++D+ E +K++ + + K K + +K
Sbjct: 221 ELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQK 280
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ E EK + E +++EA K K HK K++ K+
Sbjct: 281 R-----EIEKAQIEIKKNDEEALKAKDHKAFDLKQESKA 314
Score = 32.3 bits (73), Expect = 0.27
Identities = 18/86 (20%), Positives = 45/86 (52%), Gaps = 1/86 (1%)
Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
K + S A + KEE +K++ + ++++A + + DK++ + + K +E + +
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDN-ADKQRDEVRQKQQEAKNLPK 262
Query: 246 GEQTDSEKEAKKRKKHKKDKKKRKKK 271
T S KE K+ +++K + ++ +
Sbjct: 263 PADTSSPKEDKQVAENQKREIEKAQI 288
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 42.0 bits (98), Expect = 3e-04
Identities = 28/146 (19%), Positives = 56/146 (38%), Gaps = 5/146 (3%)
Query: 120 SQFFITTQPAPHLDGVHVVFGHVVSGQNVVSHIENLPVDKMSRPLQDVRVVKCNELVLKS 179
+QF + + + G + SR + +E ++
Sbjct: 7 TQFLSFPRCNADVPCLLQSLGFNKFRGERWRLQNKFLLAAGSRKKI-ESALAVDEEPDEN 65
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
+K+K S A+ + E EEG + ++A K+ KK ++ ++K+
Sbjct: 66 GAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAAS 125
Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDK 265
+ +EE +EK+ +KR+K KK
Sbjct: 126 SDVEEEK----TEKKVRKRRKVKKMD 147
Score = 33.1 bits (75), Expect = 0.16
Identities = 20/74 (27%), Positives = 36/74 (48%)
Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
S K+ E ++ E A +KK + K+ KK+ E E EEG + ++A
Sbjct: 48 SRKKIESALAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALD 107
Query: 259 KKHKKDKKKRKKKS 272
K+ KK ++ ++K+
Sbjct: 108 KESKKTPRRTRRKA 121
Score = 31.2 bits (70), Expect = 0.63
Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 8/77 (10%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K K + + +ASSD +++ E+K +K KK +D + + +S+
Sbjct: 108 KESKKTPRRTRRKAAAASSDVEEEKTEKKVRKR--------RKVKKMDEDVEDQGSESEV 159
Query: 238 EEEEEKEEGEQTDSEKE 254
+ EE E ++E E
Sbjct: 160 SDVEESEFVTSLENESE 176
Score = 31.2 bits (70), Expect = 0.68
Identities = 17/108 (15%), Positives = 41/108 (37%), Gaps = 1/108 (0%)
Query: 161 SRPLQDVR-VVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAK 219
R DV +++ + + + K + S + + EE ++ + + K
Sbjct: 13 PRCNADVPCLLQSLGFNKFRGERWRLQNKFLLAAGSRKKIESALAVDEEPDENGAVSKKK 72
Query: 220 KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
+ K+ KK + EE E D+ + + +K ++ ++K
Sbjct: 73 PTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRK 120
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 40.0 bits (94), Expect = 3e-04
Identities = 25/102 (24%), Positives = 55/102 (53%), Gaps = 3/102 (2%)
Query: 172 CNELVLKSKLKAKEKRKASPTSASSDES---DKEEEEKEEGEQTDSEKEAKKRKKHKKDK 228
++ K K+K++++ + S+ D+++E E ++ + K+AK++ + K K
Sbjct: 19 KSKRSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLK 78
Query: 229 KKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
KK++ K +E+EE+ ++EKE + +K K + K K+
Sbjct: 79 AKKEEAEKEKEKEERFMKALAEAEKERAELEKKKAEAKLMKE 120
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 40.8 bits (96), Expect = 3e-04
Identities = 22/67 (32%), Positives = 42/67 (62%), Gaps = 1/67 (1%)
Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
S + +++EE++EE + ++ +AKK K K ++K+K K+ K EE+ +E E T ++ A
Sbjct: 36 SWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREK-EEKGLRELEEDTPEDELA 94
Query: 256 KKRKKHK 262
+K + K
Sbjct: 95 EKLRLRK 101
Score = 31.9 bits (73), Expect = 0.23
Identities = 18/92 (19%), Positives = 42/92 (45%), Gaps = 5/92 (5%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
E + D+ D E+E+ + + D E++ +K ++ K K K K + + E++
Sbjct: 10 EPPAPPAKAVVKDKWDDEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEK 69
Query: 245 EGEQTDSEKEAKKRKKH-----KKDKKKRKKK 271
E + + E++ + + + +K R +K
Sbjct: 70 EKAKREKEEKGLRELEEDTPEDELAEKLRLRK 101
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 41.6 bits (98), Expect = 3e-04
Identities = 21/66 (31%), Positives = 38/66 (57%)
Query: 207 EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
E ++E + +K++ +K KKK KKSK + +++ + SE + + K++ K KK
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 267 KRKKKS 272
K+KKK
Sbjct: 61 KKKKKK 66
Score = 36.2 bits (84), Expect = 0.013
Identities = 21/72 (29%), Positives = 36/72 (50%), Gaps = 2/72 (2%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE--EEEKEEGEQTDSEKEAKKRK 259
E E E +++ K++ KK+KK KK + E E ++ ++++ KK+K
Sbjct: 1 AMNENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKK 60
Query: 260 KHKKDKKKRKKK 271
K KK KKK+
Sbjct: 61 KKKKKKKKKNLG 72
Score = 35.5 bits (82), Expect = 0.023
Identities = 19/88 (21%), Positives = 32/88 (36%), Gaps = 5/88 (5%)
Query: 171 KCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 230
+ K + K K +K + S + D + SE + + K++ K KKK
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELI-----SENQEAENKQNNKKKKK 61
Query: 231 KKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
KKKK K + E + +
Sbjct: 62 KKKKKKKKNLGEAYDLAYDLPVVWSSAA 89
Score = 34.7 bits (80), Expect = 0.049
Identities = 23/66 (34%), Positives = 35/66 (53%)
Query: 207 EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
E GE +++ + ++K K K KK KK +++ E + E E K+ K KK KK
Sbjct: 4 ENGENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKK 63
Query: 267 KRKKKS 272
K+KKK
Sbjct: 64 KKKKKK 69
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 39.4 bits (92), Expect = 4e-04
Identities = 16/52 (30%), Positives = 27/52 (51%)
Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
+S+ +E + EEE +E EQ D + + K+ KK + K K K + + E
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEP 143
Score = 31.3 bits (71), Expect = 0.26
Identities = 14/64 (21%), Positives = 31/64 (48%), Gaps = 4/64 (6%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK----KDKKKKK 232
+ +L + + + S+DE+++E+ + + E + +K + K + K + KK
Sbjct: 87 IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKKP 146
Query: 233 KKSK 236
K SK
Sbjct: 147 KPSK 150
Score = 29.0 bits (65), Expect = 1.7
Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 17/74 (22%)
Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
S +EEE+ E E TD + ++D + K +SK E++++E + K
Sbjct: 93 SSDDEEEETEEESTDETE--------QEDPPETKTESK--EKKKRE-------VPKPKTE 135
Query: 259 KKHKKDKKKRKKKS 272
K+ K + K+ K S
Sbjct: 136 KEKPKTEPKKPKPS 149
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 40.5 bits (95), Expect = 6e-04
Identities = 21/97 (21%), Positives = 46/97 (47%), Gaps = 13/97 (13%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
SD+ E + E E + K++ K +++K+K++K+ + E +EEK+ +
Sbjct: 245 SDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304
Query: 248 QT-------------DSEKEAKKRKKHKKDKKKRKKK 271
+ +++A+ RKK ++ ++ KKK
Sbjct: 305 KLAQLARLKEIAKEVAQKEKARARKKEQRKERGEKKK 341
Score = 27.7 bits (62), Expect = 6.4
Identities = 17/66 (25%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
EL ++K + + K+K + + + +E ++E + +++ K+R + KK K++K
Sbjct: 291 ELEREAKEEKQLKKKLA--QLARLKEIAKEVAQKEKARARKKEQRKERGEKKKLKRRKLG 348
Query: 234 KSKYEE 239
K KY E
Sbjct: 349 KHKYPE 354
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 40.7 bits (95), Expect = 7e-04
Identities = 28/88 (31%), Positives = 43/88 (48%), Gaps = 18/88 (20%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
EL +++ +AK+ K SD D EEEE+EE E+ + E+E +
Sbjct: 836 ELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEE-------------- 881
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
EEEEE+EE E+ S + + R+K
Sbjct: 882 ----EEEEEEEENEEPLSLEWPETRQKQ 905
Score = 34.6 bits (79), Expect = 0.058
Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 3/84 (3%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQ---TDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
K S ++D E + E GEQ +++ EAK+ +K EEEE
Sbjct: 809 AGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEE 868
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDK 265
E+EE E+ + E+E ++ ++ + ++
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEENEE 892
Score = 33.0 bits (75), Expect = 0.18
Identities = 22/85 (25%), Positives = 35/85 (41%), Gaps = 5/85 (5%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKE-EGEQTDSEKEAKKRKKHKKDKKKKK 232
EL +K +L +R + A D S + E E GE+T E E + + ++
Sbjct: 609 ELWVKEQL---SRRPVAKVMALGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGG 665
Query: 233 KKSKYEEEEEKEEGEQT-DSEKEAK 256
+ + E E K E E + E K
Sbjct: 666 EAEQEGETETKGENESEGEIPAERK 690
Score = 32.7 bits (74), Expect = 0.22
Identities = 24/97 (24%), Positives = 46/97 (47%), Gaps = 7/97 (7%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKE----EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
K + +A+E T A E + E EE +E ++ + E E K + + D+K+ +
Sbjct: 707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEH 766
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+ + E E +++E E E +A + + K D+ K
Sbjct: 767 EGETEAEGKEDEDE---GEIQAGEDGEMKGDEGAEGK 800
Score = 32.3 bits (73), Expect = 0.33
Identities = 15/71 (21%), Positives = 33/71 (46%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
E + E E KE + ++E E + + + + + + EE +E ++ + E E K
Sbjct: 693 QEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGK 752
Query: 257 KRKKHKKDKKK 267
+ + D+K+
Sbjct: 753 HEVETEGDRKE 763
Score = 29.2 bits (65), Expect = 3.0
Identities = 17/69 (24%), Positives = 32/69 (46%), Gaps = 3/69 (4%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
+ES+ E + +GEQ + K HK + + ++ + + E E E E E E E
Sbjct: 679 NESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEG---EIETG 735
Query: 257 KRKKHKKDK 265
+ + +D+
Sbjct: 736 EEGEEVEDE 744
Score = 27.7 bits (61), Expect = 9.3
Identities = 15/79 (18%), Positives = 38/79 (48%), Gaps = 1/79 (1%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
+ + + +E E E E + E+ + ++ + + K + +S+ E E++ ++ + E
Sbjct: 640 HTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEI 699
Query: 254 EAKKRKKHKKDKKKRKKKS 272
EA K HK + + + +
Sbjct: 700 EA-KEADHKGETEAEEVEH 717
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 39.9 bits (93), Expect = 8e-04
Identities = 23/91 (25%), Positives = 42/91 (46%), Gaps = 2/91 (2%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K K A+ K+KA + +++E+ + E + +E EAK + K + + K K++
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAKAKAEAAA--AKKKAEAEAKAAAEKAKAEAEAKAKAEK 219
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
+ E EE + +K A K K K +
Sbjct: 220 KAEAAAEEKAAAEKKKAAAKAKADKAAAAAK 250
Score = 37.6 bits (87), Expect = 0.005
Identities = 25/93 (26%), Positives = 42/93 (45%), Gaps = 3/93 (3%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
KAK +A+ A+++ K EE + E+ ++ EA KK + + K + E E
Sbjct: 153 KAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAE 212
Query: 242 EKEEGEQT---DSEKEAKKRKKHKKDKKKRKKK 271
K + E+ +E++A KK K K K
Sbjct: 213 AKAKAEKKAEAAAEEKAAAEKKKAAAKAKADKA 245
Score = 37.2 bits (86), Expect = 0.006
Identities = 25/120 (20%), Positives = 53/120 (44%), Gaps = 7/120 (5%)
Query: 159 KMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEA 218
K + + R+ + + LK++ + K+ +A + + +E+ K EQ + A
Sbjct: 93 KPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAA 152
Query: 219 K-------KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K + K + KKK +++ EE K + E ++K+A+ K +K K + +
Sbjct: 153 KAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAAEKAKAEAE 212
Score = 33.0 bits (75), Expect = 0.17
Identities = 15/89 (16%), Positives = 45/89 (50%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
K +++RK + + K+ E+E +Q + E+ + ++ + ++ +K+ + + +++E
Sbjct: 76 KGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQE 135
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
E+ + +K+A+ K + + K
Sbjct: 136 EQARKAAAEQKKKAEAAKAKAAAEAAKLK 164
Score = 32.6 bits (74), Expect = 0.21
Identities = 20/91 (21%), Positives = 47/91 (51%), Gaps = 3/91 (3%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
+ +++S KEE+ EE + + ++ + ++ K+ K ++++ + EE E++
Sbjct: 68 QSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQA 127
Query: 245 EGEQTDSEKEAKK---RKKHKKDKKKRKKKS 272
+ EQ E++A+K +K K + K K +
Sbjct: 128 QLEQKQQEEQARKAAAEQKKKAEAAKAKAAA 158
Score = 32.2 bits (73), Expect = 0.29
Identities = 23/98 (23%), Positives = 44/98 (44%), Gaps = 5/98 (5%)
Query: 180 KLKAKEKRKASPTSASSDESDK-----EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
K A++K+KA A + E +K+ E + +EAK + + KKK + +
Sbjct: 140 KAAAEQKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAE 199
Query: 235 SKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+K E+ K E E ++ + +K ++KK +
Sbjct: 200 AKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAA 237
Score = 30.3 bits (68), Expect = 1.2
Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 13/109 (11%)
Query: 177 LKSKLKAKEKR----KASPTSASSDESDKEEEEKEEGEQTDSEKE------AKKRKKHKK 226
LK K A+++R + A + EE EK+ + ++E A+++KK +
Sbjct: 92 LKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEA 151
Query: 227 DKKKKK---KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K K K K E +K+ E + +EAK + + KKK + ++
Sbjct: 152 AKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEA 200
Score = 28.4 bits (63), Expect = 4.8
Identities = 15/75 (20%), Positives = 35/75 (46%), Gaps = 1/75 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K++ A +K+ + A+++++ E E K + E+ +E A+++ +K K K K+
Sbjct: 186 KAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKK-AEAAAEEKAAAEKKKAAAKAKADK 244
Query: 238 EEEEEKEEGEQTDSE 252
K + +
Sbjct: 245 AAAAAKAAERKAAAA 259
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 39.8 bits (93), Expect = 0.001
Identities = 20/94 (21%), Positives = 43/94 (45%), Gaps = 2/94 (2%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+K KA+ + K + +A ++ +++ + E + EAKK K + K ++K
Sbjct: 144 AAKAKAEAEAKRAAAAAKKAAAEAKKKAEAE-AAKKAAAEAKK-KAEAEAAAKAAAEAKK 201
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ E E ++ +++K+A K K + K
Sbjct: 202 KAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAK 235
Score = 35.2 bits (81), Expect = 0.031
Identities = 17/85 (20%), Positives = 37/85 (43%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
+ KR + +E ++K+ EQ ++ K+R ++ KK+ ++ +K +
Sbjct: 71 QKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKK 266
+K+ E A K K + K+
Sbjct: 131 QKQAEEAAAKAAAAAKAKAEAEAKR 155
Score = 34.0 bits (78), Expect = 0.064
Identities = 19/83 (22%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 196 SDESDKEEEEKEEGEQTDSE---KEAKKRKKHKKDKKKK----KKKSKYEEEEEKEEGEQ 248
EE+ K++ +Q E K+A ++++ K+ +K++ ++K + EE ++ +Q
Sbjct: 72 KSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQ 131
Query: 249 TDSEKEAKKRKKHKKDKKKRKKK 271
+E+ A K K K + + K
Sbjct: 132 KQAEEAAAKAAAAAKAKAEAEAK 154
Score = 34.0 bits (78), Expect = 0.074
Identities = 15/74 (20%), Positives = 36/74 (48%), Gaps = 3/74 (4%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK---KSKYEEEEEKEEGEQTDSEK 253
E ++ ++++ + ++ K+A ++K ++ K K+K E E ++ +
Sbjct: 106 LEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAA 165
Query: 254 EAKKRKKHKKDKKK 267
EAKK+ + + KK
Sbjct: 166 EAKKKAEAEAAKKA 179
Score = 32.9 bits (75), Expect = 0.15
Identities = 22/89 (24%), Positives = 43/89 (48%), Gaps = 1/89 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+K A ++++A +A + + K + E E + K+A K KK + + KK+
Sbjct: 123 AAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAK-KKAEAEAAKKAAA 181
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
E +++ E + EAKK+ + + KK
Sbjct: 182 EAKKKAEAEAAAKAAAEAKKKAEAEAKKK 210
Score = 32.5 bits (74), Expect = 0.21
Identities = 21/93 (22%), Positives = 43/93 (46%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
++ LK K+ +A+ +A++ ++ E E K K+K + KK ++K
Sbjct: 126 QAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEAKKKAEAEAAKKAAAEAKK 185
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+ E E +++K+A+ K K + +KK
Sbjct: 186 KAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKK 218
Score = 31.7 bits (72), Expect = 0.37
Identities = 19/83 (22%), Positives = 40/83 (48%), Gaps = 1/83 (1%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
E K+ +AK+K +A + ++ E+ K+ E E ++ +++K+A K K +
Sbjct: 174 EAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAE 233
Query: 233 KKSKYEEEEEKEEGEQTDSEKEA 255
K+ E+ + E+ + K A
Sbjct: 234 AKAAAEKAAAAKAAEKAAAAKAA 256
Score = 31.3 bits (71), Expect = 0.50
Identities = 19/88 (21%), Positives = 35/88 (39%), Gaps = 9/88 (10%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
KA + K + ++ ++ E ++K E E ++K+A K K + K +K
Sbjct: 176 AKKAAAEAKKKAEAEAAAKAAAEAKKKAEAE---AKKKAAAEAKKKAAAEAKAAAAKAAA 232
Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
E + +EK A + K K
Sbjct: 233 EAKA------AAEKAAAAKAAEKAAAAK 254
Score = 30.9 bits (70), Expect = 0.60
Identities = 23/98 (23%), Positives = 42/98 (42%), Gaps = 4/98 (4%)
Query: 178 KSKLKAKEKRK-ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK---KK 233
K +L A+E++K A + + K+ EE ++ +A+ K KK +
Sbjct: 108 KERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAAAAAKKAAAEA 167
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K K E E K+ + + EA+ K + KK+ +
Sbjct: 168 KKKAEAEAAKKAAAEAKKKAEAEAAAKAAAEAKKKAEA 205
Score = 29.4 bits (66), Expect = 2.3
Identities = 16/58 (27%), Positives = 37/58 (63%)
Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
EQ + +++ +K K ++++KKK++ + EE ++K+ EQ ++ K+R ++ KK+
Sbjct: 62 EQYNRQQQQQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQ 119
Score = 28.2 bits (63), Expect = 5.2
Identities = 15/79 (18%), Positives = 36/79 (45%), Gaps = 7/79 (8%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
+ E+E ++ E+ + +++K+ ++ K+ K K EE + ++ EA+
Sbjct: 95 KQAAEQERLKQLEK-ERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEA 153
Query: 258 R------KKHKKDKKKRKK 270
+ KK + KK+ +
Sbjct: 154 KRAAAAAKKAAAEAKKKAE 172
Score = 28.2 bits (63), Expect = 5.4
Identities = 13/75 (17%), Positives = 32/75 (42%), Gaps = 1/75 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQ-TDSEKEAKKRKKHKKDKKKKKKKSK 236
K K +A+ KA+ + E++ +++ E ++ +E +A K + K +K +
Sbjct: 184 KKKAEAEAAAKAAAEAKKKAEAEAKKKAAAEAKKKAAAEAKAAAAKAAAEAKAAAEKAAA 243
Query: 237 YEEEEEKEEGEQTDS 251
+ E+ +
Sbjct: 244 AKAAEKAAAAKAAAE 258
Score = 27.8 bits (62), Expect = 6.3
Identities = 14/66 (21%), Positives = 35/66 (53%), Gaps = 1/66 (1%)
Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
++ Q +K AK R + ++ KK++++ + ++++ E+ EKE ++ KK
Sbjct: 62 EQYNRQQQQQKSAK-RAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQA 120
Query: 266 KKRKKK 271
++ K+
Sbjct: 121 EEAAKQ 126
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 39.9 bits (93), Expect = 0.001
Identities = 20/65 (30%), Positives = 34/65 (52%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
E E E+ T+ E E KK+K+ K +K+ KK ++E + + Q S+ +K K
Sbjct: 5 ESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKKSEK 64
Query: 263 KDKKK 267
K +K+
Sbjct: 65 KSRKR 69
Score = 36.8 bits (85), Expect = 0.011
Identities = 19/90 (21%), Positives = 39/90 (43%), Gaps = 16/90 (17%)
Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD----------------KKK 230
+ + +++E E K++ E+ EKE KK K +K+ KK
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
Query: 231 KKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+KKS+ + E++ + D + ++K+
Sbjct: 62 SEKKSRKRDVEDENPEDFIDPDTPFGQKKR 91
Score = 36.4 bits (84), Expect = 0.015
Identities = 19/73 (26%), Positives = 38/73 (52%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
S + E++K+ +EE E+ ++E K K+ KK K +K+ + ++ +G +
Sbjct: 2 SRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
Query: 253 KEAKKRKKHKKDK 265
E K RK+ +D+
Sbjct: 62 SEKKSRKRDVEDE 74
Score = 33.3 bits (76), Expect = 0.12
Identities = 21/65 (32%), Positives = 39/65 (60%), Gaps = 8/65 (12%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK------KK 267
+E EA+K+ +++ ++KKKK E+ +EKE + ++KEAK + + ++ KK
Sbjct: 4 TESEAEKKILTEEELERKKKKE--EKAKEKELKKLKAAQKEAKAKLQAQQASDGTNVPKK 61
Query: 268 RKKKS 272
+KKS
Sbjct: 62 SEKKS 66
Score = 32.9 bits (75), Expect = 0.19
Identities = 21/64 (32%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
Query: 209 GEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
G +T+SE E K + + ++KKKK++ E+E +K + Q E +AK + + D
Sbjct: 1 GSRTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQ--KEAKAKLQAQQASDGTNV 58
Query: 269 KKKS 272
KKS
Sbjct: 59 PKKS 62
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 39.6 bits (92), Expect = 0.001
Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 7/77 (9%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA- 255
E D++ EE EE + + +KK KK KK K KK K + + ++ + +D + E
Sbjct: 314 IEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSG-DDSDDSDIDGEDS 372
Query: 256 -----KKRKKHKKDKKK 267
K++K K ++
Sbjct: 373 VSLVTAKKQKEPKKEEP 389
Score = 38.4 bits (89), Expect = 0.003
Identities = 22/106 (20%), Positives = 40/106 (37%), Gaps = 11/106 (10%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEE-----------GEQTDSEKEAKKRKKHKK 226
K K+ ++ A +SD ++E E + + E + K
Sbjct: 253 KKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKP 312
Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ ++ + + EEE+ +EEG + K+ KK K K K S
Sbjct: 313 EIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDS 358
Score = 33.8 bits (77), Expect = 0.096
Identities = 23/114 (20%), Positives = 47/114 (41%), Gaps = 20/114 (17%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEK-------------------EA 218
+ K K K+K+ A D+ + ++ ++ DS+ +
Sbjct: 238 EEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDP 297
Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE-KEAKKRKKHKKDKKKRKKK 271
++R+ + K + + +E+ E+ E E+ + E +KK KK KK K K+
Sbjct: 298 EEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGL 351
Score = 33.4 bits (76), Expect = 0.11
Identities = 17/58 (29%), Positives = 26/58 (44%)
Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
D E E +K + K K +E++ E + + E K +KK KK K +KK
Sbjct: 197 DEETEGEKGGGGRGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKK 254
Score = 31.1 bits (70), Expect = 0.67
Identities = 19/69 (27%), Positives = 30/69 (43%), Gaps = 2/69 (2%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
K K DESDK E + E+ +K+ K K KK KK K +++ ++
Sbjct: 214 KIKDLEGDDEDDGDESDKGGE--DGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADE 271
Query: 244 EEGEQTDSE 252
+ + D E
Sbjct: 272 YDSDDGDDE 280
Score = 28.4 bits (63), Expect = 4.2
Identities = 16/58 (27%), Positives = 28/58 (48%), Gaps = 2/58 (3%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE--EEKEEGEQTDSEK 253
E D E++ E + + E K +KK KK K KKK ++ ++ ++ DS+
Sbjct: 219 EGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDD 276
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 39.7 bits (93), Expect = 0.001
Identities = 18/77 (23%), Positives = 40/77 (51%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
S +E ++E +K G + + +K + KK KK+ K + K ++E ++EE + + E +
Sbjct: 407 SDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAK 466
Query: 255 AKKRKKHKKDKKKRKKK 271
+K + ++ +K
Sbjct: 467 VEKVANKLLKRSEKAQK 483
Score = 37.7 bits (88), Expect = 0.005
Identities = 22/88 (25%), Positives = 44/88 (50%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
EK S ++++ +E+++ + E+ ++E K +K K+ +K++ EEEEE+
Sbjct: 431 EKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEEL 490
Query: 245 EGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ E + + K KK K+K S
Sbjct: 491 DEENPWLKTTSSVGKSAKKQDSKKKSSS 518
Score = 35.0 bits (81), Expect = 0.034
Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 3/94 (3%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
L+ + +AK ++ A+ S+++ KEEEE E + K K KK+ K K
Sbjct: 459 LEDEEEAKVEKVANKLLKRSEKAQKEEEE---EELDEENPWLKTTSSVGKSAKKQDSKKK 515
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+ +K + + + + KK+KK +K
Sbjct: 516 SSSKLDKAANKISKAAVKVKKKKKKEKSIDLDDD 549
Score = 30.4 bits (69), Expect = 1.0
Identities = 23/106 (21%), Positives = 39/106 (36%), Gaps = 23/106 (21%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDK--------------------- 228
S + DE EEE E+ ++ D + E + K K
Sbjct: 322 SVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSGLLSMKFMQR 381
Query: 229 --KKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+KK+++ E EE + E E + E + + KK+ +RK
Sbjct: 382 AEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGP 427
Score = 28.9 bits (65), Expect = 3.7
Identities = 16/75 (21%), Positives = 41/75 (54%), Gaps = 1/75 (1%)
Query: 197 DESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
+E+D E EE + E E + E + + KK+ ++K + E+E + + + +++ E
Sbjct: 388 EENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKENKNEF 447
Query: 256 KKRKKHKKDKKKRKK 270
K++K+ ++++ +
Sbjct: 448 KEKKESDEEEELEDE 462
Score = 28.1 bits (63), Expect = 6.3
Identities = 21/71 (29%), Positives = 36/71 (50%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
E +KEE + E + + + D+++ ++ SK K E + E E+KK KK
Sbjct: 383 EARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEAESKKLKKE 442
Query: 262 KKDKKKRKKKS 272
K++ K KK+S
Sbjct: 443 NKNEFKEKKES 453
Score = 28.1 bits (63), Expect = 6.6
Identities = 18/89 (20%), Positives = 30/89 (33%), Gaps = 10/89 (11%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
S D+ EE++ +E + + KK +KKK E +E E ++
Sbjct: 1 MSDDDESDEEDDDDEDKHSKLLSAISSLGGEKKKDEKKKADRSEESGKESEFNLPSEGAG 60
Query: 254 EA----------KKRKKHKKDKKKRKKKS 272
+ KK K+ K S
Sbjct: 61 GKLTLSDLLGSIQDSSSLKKSLKQLDKIS 89
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 38.6 bits (90), Expect = 0.002
Identities = 18/58 (31%), Positives = 32/58 (55%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
KE K P +E ++++EE+E+ + E + ++ K+ KK K KK K++ E E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Score = 37.8 bits (88), Expect = 0.004
Identities = 15/46 (32%), Positives = 25/46 (54%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
+E +E++E+EE E+ ++ +K +K KK KK K E E
Sbjct: 41 EEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Score = 36.3 bits (84), Expect = 0.016
Identities = 18/51 (35%), Positives = 32/51 (62%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
+E ++E+EEK+E E+ ++KE + ++ +K++KKKK K E E E
Sbjct: 36 PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
Score = 33.6 bits (77), Expect = 0.11
Identities = 23/64 (35%), Positives = 37/64 (57%), Gaps = 11/64 (17%)
Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
E++ E E D E+E +K +K ++++K K+ + +EEEEKEE K+KK KK
Sbjct: 28 EKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE-----------KKKKTKK 76
Query: 264 DKKK 267
K+
Sbjct: 77 VKET 80
Score = 31.3 bits (71), Expect = 0.49
Identities = 19/61 (31%), Positives = 35/61 (57%), Gaps = 11/61 (18%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
E EKE ++E ++ K+ KK++++K + E +EE+E+ E++ KK KK K
Sbjct: 30 EVEKEV-----PDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK------EEKKKKTKKVK 78
Query: 263 K 263
+
Sbjct: 79 E 79
Score = 28.6 bits (64), Expect = 4.4
Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 4/48 (8%)
Query: 228 KKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR----KKHKKDKKKRKKK 271
K+ +K+ EEEEEKEE ++ + + K+ ++ K++KKK+ KK
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 38.7 bits (91), Expect = 0.002
Identities = 20/97 (20%), Positives = 40/97 (41%), Gaps = 1/97 (1%)
Query: 177 LKSKLKAKEKRKASPTSA-SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
L KLK S +S +E+ KEE ++E E+ +E E + +++ K + +
Sbjct: 20 LLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAA 79
Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
EE + + + A+ + R+ K+
Sbjct: 80 APAEEAAEAAAAAEAAARPAEDEAARPAEAAARRPKA 116
Score = 37.9 bits (89), Expect = 0.004
Identities = 22/98 (22%), Positives = 40/98 (40%), Gaps = 1/98 (1%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
EL + SK + EK K S S EEEE + E +E K + + + ++
Sbjct: 12 ELGVSSK-ELLEKLKELGIEVKSHSSTVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEE 70
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
++K E E ++ A+ + +D+ R +
Sbjct: 71 EAKAEAAAAAPAEEAAEAAAAAEAAARPAEDEAARPAE 108
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 38.6 bits (90), Expect = 0.003
Identities = 21/102 (20%), Positives = 50/102 (49%), Gaps = 6/102 (5%)
Query: 157 VDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGE-----Q 211
D++ + + + + ++ S K+ ++SP +++SD+ K+++ K +
Sbjct: 377 YDQIEKEMDAFSIQASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAE 436
Query: 212 TDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
+ + E KK KK++KK + KS + + + G + +S K
Sbjct: 437 SIPDDEEDAPKKGKKNQKKGRDKSS-KVPSDSKAGGKKESVK 477
Score = 33.6 bits (77), Expect = 0.12
Identities = 17/83 (20%), Positives = 31/83 (37%), Gaps = 1/83 (1%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
+ ++ S+K E + + K+KK K K ++EE +
Sbjct: 391 ASSAGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGK 450
Query: 250 DSEKEAK-KRKKHKKDKKKRKKK 271
++K+ + K K D K KK
Sbjct: 451 KNQKKGRDKSSKVPSDSKAGGKK 473
Score = 33.3 bits (76), Expect = 0.14
Identities = 24/96 (25%), Positives = 36/96 (37%), Gaps = 8/96 (8%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+K EK + + +S EK G +E K KKKK K +
Sbjct: 373 IKGIYDQIEKEMDAFSIQASSAGLIGSSEKSLG---SNESSPAASNSDKGSKKKKGKSTS 429
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K ++ + E KK KK++KK + KS
Sbjct: 430 -----TKGGTAESIPDDEEDAPKKGKKNQKKGRDKS 460
Score = 28.6 bits (64), Expect = 4.0
Identities = 20/68 (29%), Positives = 33/68 (48%), Gaps = 16/68 (23%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK---------KKK 234
K+K K++ T + ES ++EE D+ K+ KK +K +DK K KK+
Sbjct: 422 KKKGKSTSTKGGTAESIPDDEE-------DAPKKGKKNQKKGRDKSSKVPSDSKAGGKKE 474
Query: 235 SKYEEEEE 242
S +E+
Sbjct: 475 SVKSQEDN 482
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 38.2 bits (89), Expect = 0.004
Identities = 25/95 (26%), Positives = 49/95 (51%), Gaps = 3/95 (3%)
Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
N+L L + K E+ K S T S D+ ++E KE ++ + K + D ++
Sbjct: 342 NDLTLLALTKLYEEVK-SNTDLSGDK--RQELLKEYNKKLQDYTKKLGEVKDETDASEEA 398
Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
+ EE+ ++EE E+ E+ + ++K +KD++K
Sbjct: 399 EAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERK 433
Score = 34.4 bits (79), Expect = 0.053
Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 1/55 (1%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSE-KEAKKRKKHKKDKKKKKK 233
K T AS + K +EEK + E+ + + KE K K+ K ++KK
Sbjct: 380 DYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 32.8 bits (75), Expect = 0.17
Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 4/63 (6%)
Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK----EAKKRKKHKKDKKKR 268
++ K+ K +D KK + K E + +E + EK E +K++K + D+ K
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425
Query: 269 KKK 271
K++
Sbjct: 426 KRQ 428
Score = 31.7 bits (72), Expect = 0.43
Identities = 13/50 (26%), Positives = 26/50 (52%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK 226
L + + + A ++ +EE EK++ EQ D +KE +++ + KK
Sbjct: 385 LGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
Score = 29.8 bits (67), Expect = 1.8
Identities = 10/47 (21%), Positives = 29/47 (61%), Gaps = 2/47 (4%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
DE+D EE + + + + + + ++ +K +K++ + K+ + ++E +K
Sbjct: 390 DETDASEEAEAKAK--EEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 37.4 bits (87), Expect = 0.004
Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 6/77 (7%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
S + +EEE E+ Q + + KKR K K K+ KKK K K+ +
Sbjct: 61 SEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKK------KDPTAAKSPK 114
Query: 253 KEAKKRKKHKKDKKKRK 269
A + KK +
Sbjct: 115 AAAPRPKKKSERISWAP 131
Score = 32.7 bits (75), Expect = 0.13
Identities = 20/87 (22%), Positives = 39/87 (44%), Gaps = 1/87 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
K K+K+K PT+A S ++ + K E R+K + + K++
Sbjct: 94 AYKEPTKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEAT 153
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKK 263
+E +E+E + K K+++K K+
Sbjct: 154 HERLKEREIRRKKIQAKARKRKEKKKE 180
Score = 29.7 bits (67), Expect = 1.5
Identities = 23/79 (29%), Positives = 41/79 (51%), Gaps = 5/79 (6%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
+SD ++ E +E E D E+ K+ ++ ++ KKKK+ K+K +E K++ +K
Sbjct: 51 EEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKK-----KKK 105
Query: 254 EAKKRKKHKKDKKKRKKKS 272
+ K K + KKKS
Sbjct: 106 DPTAAKSPKAAAPRPKKKS 124
Score = 28.5 bits (64), Expect = 3.1
Identities = 22/96 (22%), Positives = 45/96 (46%), Gaps = 2/96 (2%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
++ SP SS S + +E + E KK + + +K+KKK+ +
Sbjct: 124 SERISWAPTLLDSPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKEL 183
Query: 238 EEEEEKEEGEQTDSE--KEAKKRKKHKKDKKKRKKK 271
+EE E ++T+ K ++ ++ +++KKK K +
Sbjct: 184 TQEERLAEAKETERINLKSLERYEEQEEEKKKAKIQ 219
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 36.8 bits (86), Expect = 0.004
Identities = 17/59 (28%), Positives = 37/59 (62%), Gaps = 1/59 (1%)
Query: 214 SEKEAKKRKKHK-KDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+K+AKK KK K K +K+ +K + ++E K+ E+ +EK + R+ +++ + + ++K
Sbjct: 13 DKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQRQAEAEQK 71
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 38.0 bits (88), Expect = 0.005
Identities = 25/89 (28%), Positives = 42/89 (47%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
++ ES K+E EKEE KE K+ +K KK ++++ K EEEE K
Sbjct: 241 DLLQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELK 300
Query: 244 EEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
E + + K + K + +K+ +K +
Sbjct: 301 SELLKLERRKVDDEEKLKESEKELKKLEK 329
Score = 32.6 bits (74), Expect = 0.23
Identities = 22/99 (22%), Positives = 47/99 (47%), Gaps = 1/99 (1%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
L K +E+++ KEEEE + ++ +K K+ +K+ KK
Sbjct: 267 ILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKK 326
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K E ++EKEE E+ + E + + K+ +++++ + +
Sbjct: 327 LEK-ELKKEKEEIEELEKELKELEIKREAEEEEEEQLEK 364
Score = 32.2 bits (73), Expect = 0.29
Identities = 18/108 (16%), Positives = 52/108 (48%), Gaps = 2/108 (1%)
Query: 165 QDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKH 224
Q ++ + E + K + +E+ K+ + + +++EE+ ++ + EKE K + +
Sbjct: 744 QKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKELAEEEEKTEKLKVEE--EKEEKLKAQE 801
Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
++ + +++ + E E+E+ EK ++ + + K ++K
Sbjct: 802 EELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKL 849
Score = 31.9 bits (72), Expect = 0.37
Identities = 21/98 (21%), Positives = 42/98 (42%), Gaps = 3/98 (3%)
Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
E + + + E+ S E EE E+ ++ D+++E ++R K K++
Sbjct: 914 EEKENEIEERIAEEAIILLKYESEPEELLLEEADEKEKEEDNKEEEEERNKRLLLAKEEL 973
Query: 233 KKSK---YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
E EEKEE D K+ + ++ K+ ++
Sbjct: 974 GNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLRE 1011
Score = 31.9 bits (72), Expect = 0.40
Identities = 18/119 (15%), Positives = 50/119 (42%), Gaps = 1/119 (0%)
Query: 153 ENLPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQT 212
+ ++ + LQ+ + + + K + + E+E K+ ++
Sbjct: 273 KENKEEEKEKKLQEEELKLLAKE-EEELKSELLKLERRKVDDEEKLKESEKELKKLEKEL 331
Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EKE + + + + + K++++ EEEE+ E+ ++ + E + K K + ++
Sbjct: 332 KKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKKLESERLSSA 390
Score = 31.5 bits (71), Expect = 0.58
Identities = 26/105 (24%), Positives = 49/105 (46%), Gaps = 7/105 (6%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
L+ +L AK+K ++ S+++ ++E E K E E+ + ++ + K++KK
Sbjct: 369 LEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKL-LLELSEQEEDLLKEEKK 427
Query: 234 KSKYEEEEEKEEGE------QTDSEKEAKKRKKHKKDKKKRKKKS 272
+ EE +E E + E+ K+ K KDK + KK
Sbjct: 428 EELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKSE 472
Score = 31.1 bits (70), Expect = 0.65
Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 2/91 (2%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKK--HKKDKKKKKKKSKYEEEE 241
K K +A +K E E+E D K ++R + + ++++ ++E
Sbjct: 202 KLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEL 261
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
EKEE KE K+ +K KK +++ K
Sbjct: 262 EKEEEILAQVLKENKEEEKEKKLQEEELKLL 292
Score = 30.7 bits (69), Expect = 1.0
Identities = 20/109 (18%), Positives = 51/109 (46%)
Query: 164 LQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKK 223
LQ++ + E+ + KE+ + + E +KE++ +EE + +++E + + +
Sbjct: 243 LQELLRDEQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSE 302
Query: 224 HKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K +++K + +E EKE + K+ K+ + + + K +
Sbjct: 303 LLKLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIK 351
Score = 29.2 bits (65), Expect = 2.9
Identities = 18/97 (18%), Positives = 45/97 (46%)
Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
+ K + + ++E E++ +E E+ + + KK ++ ++ + K+
Sbjct: 716 LKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELSLKEK 775
Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ EEEEK E + + EKE K + + ++ + ++
Sbjct: 776 ELAEEEEKTEKLKVEEEKEEKLKAQEEELRALEEELK 812
Score = 28.4 bits (63), Expect = 5.2
Identities = 18/102 (17%), Positives = 48/102 (47%), Gaps = 6/102 (5%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
L+ + E++ + ++EKEE E+ + E + + K+ +++++++ +
Sbjct: 306 LERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKL 365
Query: 237 YEEEEEKEE------GEQTDSEKEAKKRKKHKKDKKKRKKKS 272
E+ E+ EE +++ A K K+ + + K ++K
Sbjct: 366 QEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKE 407
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 37.3 bits (87), Expect = 0.005
Identities = 30/101 (29%), Positives = 51/101 (50%), Gaps = 19/101 (18%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQ-------------------TDS 214
EL+ K K + ++K++A S D+ D+EEE + E ++ +DS
Sbjct: 73 ELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDS 132
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
E E +K + KK K+ ++ E+EEE E E+ ++EKE
Sbjct: 133 EDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 37.8 bits (88), Expect = 0.006
Identities = 17/82 (20%), Positives = 33/82 (40%)
Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
K + + + + + GE+ + + + + H K + + EE+ +E
Sbjct: 860 KHQVDNMVTEYEGNIGDTQNSIRGEENTVKGQYSELQNHHKTEALSQNNKYNEEKSAQER 919
Query: 246 GEQTDSEKEAKKRKKHKKDKKK 267
DS +E KR K +DK K
Sbjct: 920 MPGADSPEELMKRAKEYQDKHK 941
Score = 29.7 bits (67), Expect = 1.8
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK 231
+E+ +E DS +E KR K +DK K
Sbjct: 912 EEKSAQERMPGADSPEELMKRAKEYQDKHKG 942
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 37.3 bits (87), Expect = 0.006
Identities = 15/91 (16%), Positives = 32/91 (35%), Gaps = 1/91 (1%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
A K + + + + K + + E+ + + KK ++ + +
Sbjct: 3 TASTKAELAAEEEAKKKLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGM 62
Query: 242 EKEEGEQTDSEKEAKKRKK-HKKDKKKRKKK 271
K+ + T+S+ KK K K K K
Sbjct: 63 VKDTDDATESDIPKKKTKTAAKAAAAKAPAK 93
Score = 36.5 bits (85), Expect = 0.013
Identities = 20/89 (22%), Positives = 33/89 (37%), Gaps = 3/89 (3%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK---KHKKDKKKKKKKSKYE 238
+A E +K +P K+ + T+S+ KK K K K KKK K E
Sbjct: 40 EALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDE 99
Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
+ K+ ++ +K+ D
Sbjct: 100 LDSSKKAEKKNALDKDDDLNYVKDIDVLN 128
Score = 32.7 bits (75), Expect = 0.18
Identities = 18/95 (18%), Positives = 37/95 (38%), Gaps = 1/95 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K++L A+E+ K ++ K KEE + E + K ++ + K
Sbjct: 7 KAELAAEEEAKKKLKKLAAKSKSKGFITKEE-IKEALESKKKTPEQIDQVLIFLSGMVKD 65
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
++ + + + ++ AK KKK K +
Sbjct: 66 TDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDEL 100
Score = 32.7 bits (75), Expect = 0.22
Identities = 17/91 (18%), Positives = 31/91 (34%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
L+SK K E+ S D ++ + + + ++ AK KKK K +
Sbjct: 42 LESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKKKLKDELD 101
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
++ EK+ D + K
Sbjct: 102 SSKKAEKKNALDKDDDLNYVKDIDVLNQADD 132
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 37.4 bits (87), Expect = 0.007
Identities = 18/73 (24%), Positives = 31/73 (42%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
EL L K A +E+ +EE+++ + + +K KK K+
Sbjct: 825 ELSNNGPLFQKLMENAGKMEEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEG 884
Query: 234 KSKYEEEEEKEEG 246
KS ++EE+E G
Sbjct: 885 KSVLIKQEERETG 897
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 37.4 bits (87), Expect = 0.007
Identities = 24/91 (26%), Positives = 44/91 (48%), Gaps = 3/91 (3%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSE-KEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
+ K S S +E + ++++ + DS + +K +++ KKK K + +++
Sbjct: 199 KFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDS 258
Query: 244 EEGEQTD--SEKEAKKRKKHKKDKKKRKKKS 272
E +D E K KK KK KKK+KK+
Sbjct: 259 TETPASDYYDVSEMVKFKKPKKKKKKKKKRR 289
Score = 33.2 bits (76), Expect = 0.13
Identities = 26/97 (26%), Positives = 45/97 (46%), Gaps = 6/97 (6%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K + + K+K K + S + D E + + KK KK KK KKK++K
Sbjct: 238 KKRQEVKKKLKIN---NVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDLDE 294
Query: 238 EEEEEKEEG---EQTDSEKEAKKRKKHKKDKKKRKKK 271
+E E + EG + S K+ ++ +D K++K+
Sbjct: 295 DELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKRKE 331
Score = 31.7 bits (72), Expect = 0.41
Identities = 19/85 (22%), Positives = 43/85 (50%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
K S +A+ + K +EE E + + +E + + K+++ K K E++ ++ +
Sbjct: 31 KESRDAAAYENWKKRQEEAEAKRKREELREKIAKAREKRERNSKLGGIKTLGEDDDDDDD 90
Query: 248 QTDSEKEAKKRKKHKKDKKKRKKKS 272
K++KKR+K K+ ++K+
Sbjct: 91 TKAWLKKSKKRQKKKEAERKKALLL 115
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 36.9 bits (85), Expect = 0.007
Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 4/75 (5%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK- 236
K+K K EK S S + ES++EE + E ++ + E K +K K D KK +K
Sbjct: 225 KTKKKDLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRKPHI 284
Query: 237 ---YEEEEEKEEGEQ 248
YE+E E E+
Sbjct: 285 HIEYEQERENEKIPA 299
Score = 35.4 bits (81), Expect = 0.020
Identities = 35/132 (26%), Positives = 56/132 (42%), Gaps = 14/132 (10%)
Query: 147 NVVSHIENLPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEK 206
+IE + ++ + + E + +EKR+ E EEEE+
Sbjct: 156 KFEKNIEKELMLRLKSGIYGDTPLNVREHLWNKAATEREKRQ------DEKERYVEEEEE 209
Query: 207 EEGE----QTDSEKEAKKRKKHKK----DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
+ E DSEKE K+K +K D+ + +S+ EE E E E D + + K R
Sbjct: 210 SDTELEAVTDDSEKEKTKKKDLEKWLGSDQSMETSESEEEESSESESDEDEDEDNKGKIR 269
Query: 259 KKHKKDKKKRKK 270
K+ D KK +K
Sbjct: 270 KRKTDDAKKSRK 281
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 35.8 bits (83), Expect = 0.007
Identities = 24/52 (46%), Positives = 37/52 (71%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
++++EE ++ D EK AKKR K +K K+KKKKK K ++ +KEE E + S +E
Sbjct: 69 QQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEE 120
Score = 31.9 bits (73), Expect = 0.15
Identities = 21/46 (45%), Positives = 32/46 (69%)
Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
D+K KK+ E ++++EE ++ D EK AKKR K +K K+K+KKK
Sbjct: 56 DEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKK 101
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 37.2 bits (86), Expect = 0.009
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 6/66 (9%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD--KKKKKKKSKYEE 239
K K +R SSDE + + +E E D EKE K+R++ K+ + K + SKY
Sbjct: 357 KRKRRRVPPLPEYSSDEDEDDSDEDE----VDYEKERKRRREEDKNFLRLKALELSKYAG 412
Query: 240 EEEKEE 245
E+ E
Sbjct: 413 VNERME 418
Score = 28.3 bits (63), Expect = 4.6
Identities = 15/74 (20%), Positives = 32/74 (43%), Gaps = 4/74 (5%)
Query: 191 PTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTD 250
SD + E +DS K++++ + +E+E+ + ++ D
Sbjct: 329 RERFPSDLFAISSLPDSDSEASDSGPTRKRKRRRVP----PLPEYSSDEDEDDSDEDEVD 384
Query: 251 SEKEAKKRKKHKKD 264
EKE K+R++ K+
Sbjct: 385 YEKERKRRREEDKN 398
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 36.2 bits (83), Expect = 0.010
Identities = 17/63 (26%), Positives = 35/63 (55%), Gaps = 3/63 (4%)
Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
+S S S + + K + EQ S+ + ++HK ++K+ +++ EEEE +EE +Q +
Sbjct: 182 SSLSLGGSVQVKAPKPKQEQLLSKLQEYL-QEHKTEEKQPQEEQ--EEEEVEEEAKQEEG 238
Query: 252 EKE 254
+
Sbjct: 239 QGT 241
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 35.6 bits (82), Expect = 0.013
Identities = 20/64 (31%), Positives = 31/64 (48%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
K K K K+ R P + + + E+ + ++ K+RKK KK+KKKKKK+
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKKKKKRHS 170
Query: 237 YEEE 240
E
Sbjct: 171 PEHP 174
Score = 32.9 bits (75), Expect = 0.11
Identities = 23/83 (27%), Positives = 31/83 (37%), Gaps = 18/83 (21%)
Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG 246
P + K + E+T S+ E K + K KKK +
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHED------------- 151
Query: 247 EQTDSEKEAKKRKKHKKDKKKRK 269
+KE KK+KK KK KKKR
Sbjct: 152 -----DKERKKKKKEKKKKKKRH 169
Score = 32.1 bits (73), Expect = 0.18
Identities = 17/50 (34%), Positives = 25/50 (50%)
Query: 222 KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
KKHK KK + + EE + EK+ KK+K ++K+KKK
Sbjct: 111 KKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKK 160
Score = 31.7 bits (72), Expect = 0.23
Identities = 19/56 (33%), Positives = 30/56 (53%), Gaps = 2/56 (3%)
Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ K+K K KK + + EE EG + +K K+KKH+ DK+++KKK
Sbjct: 106 IQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKK--HKKKKHEDDKERKKKK 159
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 35.0 bits (81), Expect = 0.017
Identities = 18/76 (23%), Positives = 36/76 (47%), Gaps = 3/76 (3%)
Query: 164 LQDVRVVKCNEL--VLKSKLKA-KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK 220
+D++V+ + +LK + K K +E + E EE +E EQ D E +
Sbjct: 69 CKDLKVLGKKDFKKLLKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKEL 128
Query: 221 RKKHKKDKKKKKKKSK 236
K ++ +++ ++K K
Sbjct: 129 AKLKREKRRENERKQK 144
Score = 34.2 bits (79), Expect = 0.030
Identities = 12/47 (25%), Positives = 27/47 (57%)
Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K+K++++ + E EE +E EQ D E + K ++ +++ ++K
Sbjct: 96 LDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERK 142
Score = 27.3 bits (61), Expect = 5.7
Identities = 11/61 (18%), Positives = 32/61 (52%), Gaps = 7/61 (11%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
+L+ K + +E+ + DE ++ +E E ++ AK +++ +++ ++K+K
Sbjct: 92 KLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLE-------KELAKLKREKRRENERKQK 144
Query: 234 K 234
+
Sbjct: 145 E 145
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 36.1 bits (83), Expect = 0.017
Identities = 29/93 (31%), Positives = 41/93 (44%), Gaps = 3/93 (3%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE-- 239
KAK K KA+ + + + +++ + E T+ EK A K K K K +K +
Sbjct: 90 KAKAKAKAAAAAKAKAAALAKQKREGTEEVTEEEKAAAKAKAAAAAKAKAAALAKQKREG 149
Query: 240 -EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EE EE E+TD EK K K K K
Sbjct: 150 TEEVTEEEEETDKEKAKAKAAAAAKAKAAALAK 182
Score = 35.3 bits (81), Expect = 0.028
Identities = 30/90 (33%), Positives = 37/90 (41%), Gaps = 1/90 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+K KA KA + + + + EE EE E+TD EK K K K K K
Sbjct: 126 AAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQKA 185
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
E E E E T+ EK K K K K
Sbjct: 186 AEAGEGTE-EVTEEEKAKAKAKAAAAAKAK 214
Score = 32.3 bits (73), Expect = 0.29
Identities = 23/83 (27%), Positives = 39/83 (46%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K+ KAK A +A+ + +E E+ E+ +++KE K K K K +K
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183
Query: 238 EEEEEKEEGEQTDSEKEAKKRKK 260
+ E E E+ E++AK + K
Sbjct: 184 KAAEAGEGTEEVTEEEKAKAKAK 206
Score = 30.7 bits (69), Expect = 0.72
Identities = 22/91 (24%), Positives = 40/91 (43%), Gaps = 2/91 (2%)
Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK--KKSKYEEE 240
+ KR+A+ + + + +++ + E T+ EK K K K K K K E
Sbjct: 57 EEAKRRAAAAAKAKAAALAKQKREGTEEVTEEEKAKAKAKAAAAAKAKAAALAKQKREGT 116
Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EE E E+ ++ +A K K ++K+
Sbjct: 117 EEVTEEEKAAAKAKAAAAAKAKAAALAKQKR 147
Score = 28.0 bits (62), Expect = 6.7
Identities = 24/92 (26%), Positives = 38/92 (41%), Gaps = 10/92 (10%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQT------DSEKEAKKRKKHKKDKKKKKKKS 235
+AKE+ + + E K EEE E E+ + +EAK+R K K +
Sbjct: 17 RAKEEARKRLVAKHGAEISKLEEENREKEKALPKNDDMTIEEAKRRAA----AAAKAKAA 72
Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
+++ + E T+ EK K K K K
Sbjct: 73 ALAKQKREGTEEVTEEEKAKAKAKAAAAAKAK 104
Score = 27.6 bits (61), Expect = 6.9
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 8/67 (11%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE-- 252
E D E+ +KE + +++EA+KR K + K + EE EKE+ + +
Sbjct: 1 MDPEKDLEDLKKEAARR--AKEEARKRLVAKHGAEISKLE---EENREKEKALPKNDDMT 55
Query: 253 -KEAKKR 258
+EAK+R
Sbjct: 56 IEEAKRR 62
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 34.7 bits (80), Expect = 0.022
Identities = 13/65 (20%), Positives = 28/65 (43%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
KEE E EE + D+ +A +K+ ++ ++ + ++ S K+ + +
Sbjct: 72 KEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDPRGGTQ 131
Query: 261 HKKDK 265
DK
Sbjct: 132 DVVDK 136
Score = 29.7 bits (67), Expect = 1.0
Identities = 12/59 (20%), Positives = 29/59 (49%), Gaps = 4/59 (6%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
SE A R++ + K++ +++ E E + + + + RK+ ++ +K+KS
Sbjct: 60 SEIHALLRER----ELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKS 114
Score = 27.0 bits (60), Expect = 7.6
Identities = 14/81 (17%), Positives = 28/81 (34%), Gaps = 5/81 (6%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEE---KEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
+ + E + AS D E ++E EQ + K++ K++K K
Sbjct: 69 RELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAA--QKRKSCRDKERKSAKDP 126
Query: 235 SKYEEEEEKEEGEQTDSEKEA 255
++ + D +E
Sbjct: 127 RGGTQDVVDKSQASLDYGEEE 147
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 35.6 bits (82), Expect = 0.026
Identities = 17/88 (19%), Positives = 36/88 (40%), Gaps = 2/88 (2%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K++ KA++K A A+ E ++E ++E + A++ K K
Sbjct: 340 KNEAKARKKEIAQKRRAAEREINREA--RQERAAAMARARARRAAVKAKKKGLIDASPNE 397
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
+ E EE + + + EA + ++
Sbjct: 398 DTPSENEESKGSPPQVEATTTAEPNREP 425
Score = 33.7 bits (77), Expect = 0.082
Identities = 15/93 (16%), Positives = 35/93 (37%), Gaps = 5/93 (5%)
Query: 178 KSKLKAKEKRKA----SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
K+ L + + +P A ++ E +K + + K KK K+ + +++
Sbjct: 302 KAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKARKKEIAQKR-RAAERE 360
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
++ +E + + + A K KK
Sbjct: 361 INREARQERAAAMARARARRAAVKAKKKGLIDA 393
Score = 33.3 bits (76), Expect = 0.13
Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 1/61 (1%)
Query: 210 EQTDSEKEAKKRKKHKK-DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
E + +K K K KK+ +K + E E E Q + A+ R + K K+
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKK 387
Query: 269 K 269
K
Sbjct: 388 K 388
Score = 29.8 bits (67), Expect = 1.5
Identities = 14/75 (18%), Positives = 31/75 (41%), Gaps = 10/75 (13%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK----------SKYEEEEEKEEGEQTDSE 252
+E KE + ++ K+++K ++ + + E + + +T +
Sbjct: 281 DEFKEHHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEK 340
Query: 253 KEAKKRKKHKKDKKK 267
EAK RKK K++
Sbjct: 341 NEAKARKKEIAQKRR 355
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 35.4 bits (81), Expect = 0.027
Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 2/82 (2%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
S D D +E+ K EG Q ++E + D ++ S + E E++E +
Sbjct: 722 SDDEPKLDAID-DEDAKSEGSQESDQEEGLDEIFYSFDGEQDNSDS-FAESSEEDESSEE 779
Query: 250 DSEKEAKKRKKHKKDKKKRKKK 271
+ E+E K K+ KKK++K
Sbjct: 780 EKEEEENKEVSAKRAKKKQRKN 801
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 35.6 bits (82), Expect = 0.028
Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 10/85 (11%)
Query: 197 DESD-KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE-------- 247
D+ D K EE +E+ ++ + E+ KK + KK K ++ + E
Sbjct: 1169 DKEDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSA 1228
Query: 248 -QTDSEKEAKKRKKHKKDKKKRKKK 271
+T++ E K K KKK
Sbjct: 1229 METENVAEVVKPKGRAGAKKKAPAA 1253
Score = 31.8 bits (72), Expect = 0.47
Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 4/70 (5%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEG----EQTDSEKEAKKRKKHKKDKKKKKKKS 235
K+ + +K +P + S+ E E+ G E + + K + + KK
Sbjct: 1196 KVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAK 1255
Query: 236 KYEEEEEKEE 245
+ EEE+E +
Sbjct: 1256 EKEEEDEILD 1265
Score = 31.8 bits (72), Expect = 0.49
Identities = 19/103 (18%), Positives = 31/103 (30%), Gaps = 13/103 (12%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEE-------------GEQTDSEKEAKKRKKHKK 226
+ A K+ + S SD D +++ E ++K AK KK
Sbjct: 1295 RRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERLKKKGGRKPAAANKKAAKPPAAAKK 1354
Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
+ + E + E E K RK K+
Sbjct: 1355 RGPATVQSGQKLLTEMLKPAEAIGISPEKKVRKMRASPFNKKS 1397
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 35.3 bits (81), Expect = 0.029
Identities = 12/74 (16%), Positives = 28/74 (37%), Gaps = 1/74 (1%)
Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
++ + +E+ + + E+E+ + A + + ++ E +E E
Sbjct: 359 STPAVEETSEADIEREQPGDLAGQAPAAH-QVDAEAASAAPEEPAALASEAHDETEPEVP 417
Query: 252 EKEAKKRKKHKKDK 265
EK A K D+
Sbjct: 418 EKAAPIPDPAKPDE 431
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 35.4 bits (82), Expect = 0.030
Identities = 25/98 (25%), Positives = 38/98 (38%), Gaps = 1/98 (1%)
Query: 175 LVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
L + K K KR S S K + + G S KK KK + KK
Sbjct: 1265 LSKEGKPKNAPKR-VSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKK 1323
Query: 235 SKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K E++ + + K+A + + ++ RKKKS
Sbjct: 1324 KKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKS 1361
Score = 33.5 bits (77), Expect = 0.11
Identities = 12/84 (14%), Positives = 31/84 (36%), Gaps = 11/84 (13%)
Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
+ S K+K+K+ +A +S + + + + + +KKK
Sbjct: 1314 LEGSLAALKKKKKSEKKTARKKKSKTRVK-----------QASASQSSRLLRRPRKKKSD 1362
Query: 236 KYEEEEEKEEGEQTDSEKEAKKRK 259
E+++ E + ++ E +
Sbjct: 1363 SSSEDDDDSEVDDSEDEDDEDDED 1386
Score = 33.5 bits (77), Expect = 0.12
Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 1/76 (1%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
+E + +E++ + + + +K K KK+KKK KK S + ++ G + + K
Sbjct: 1148 EEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKK-KKSSADKSKKASVVGNSKRVDSDEK 1206
Query: 257 KRKKHKKDKKKRKKKS 272
++ K D KK
Sbjct: 1207 RKLDDKPDNKKSNSSG 1222
Score = 31.9 bits (73), Expect = 0.41
Identities = 24/107 (22%), Positives = 49/107 (45%), Gaps = 11/107 (10%)
Query: 177 LKSKLKAKEK-----RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKK- 230
L ++L+ KEK + +P ++ DK EE EE E+ + ++ AK+++ K K K
Sbjct: 1107 LNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKSKTKGKA 1166
Query: 231 -----KKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K K K +++++ + + ++ +K+K K
Sbjct: 1167 SKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKP 1213
Score = 30.8 bits (70), Expect = 0.89
Identities = 14/106 (13%), Positives = 35/106 (33%)
Query: 161 SRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK 220
D + ++ A + P+ ES+ + ++ ++
Sbjct: 1258 DEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSKPSSPTKKKVKKRLEGS 1317
Query: 221 RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
KK KK +KK ++ ++ + + + +R + KK
Sbjct: 1318 LAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDS 1363
Score = 30.4 bits (69), Expect = 1.2
Identities = 23/95 (24%), Positives = 40/95 (42%), Gaps = 6/95 (6%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K KLK KEK+K +DK ++ G + + K++ K D KK
Sbjct: 1171 KPKLKKKEKKK------KKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSD 1224
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+E++E+++ + S + K KK+ K
Sbjct: 1225 QEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDE 1259
Score = 29.6 bits (67), Expect = 1.9
Identities = 19/101 (18%), Positives = 39/101 (38%), Gaps = 1/101 (0%)
Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
++ K+K K ++ +S +S ++ +E + + K K+ +
Sbjct: 1227 DDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPP 1286
Query: 233 KKSKYEEEEEKEEGEQTDSEKEAK-KRKKHKKDKKKRKKKS 272
SK + E + + K+ KR + K+KKKS
Sbjct: 1287 PPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKS 1327
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 33.4 bits (77), Expect = 0.035
Identities = 13/64 (20%), Positives = 34/64 (53%)
Query: 208 EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
E E+ + + + K K+K K +KK + ++ E ++ + + EK + +K +K+ ++
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60
Query: 268 RKKK 271
+ +
Sbjct: 61 LEAE 64
Score = 32.2 bits (74), Expect = 0.096
Identities = 18/74 (24%), Positives = 36/74 (48%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
E +K +KE + EK ++K+ +K + +K EE+ + E E+ + E E +
Sbjct: 3 EEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELE 62
Query: 258 RKKHKKDKKKRKKK 271
+ +++ K KK
Sbjct: 63 AELARRELKAEAKK 76
Score = 31.5 bits (72), Expect = 0.16
Identities = 18/65 (27%), Positives = 34/65 (52%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+E+ K E +Q + + EA+K K ++K + + K E+E E+ E E E +A+ +K
Sbjct: 18 AKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELKAEAKKM 77
Query: 261 HKKDK 265
+
Sbjct: 78 LSEKG 82
Score = 28.8 bits (65), Expect = 1.4
Identities = 15/68 (22%), Positives = 32/68 (47%)
Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKD 264
E EE + ++ K K K +KK+++ K E E+ + + +E E +K +K ++
Sbjct: 1 EPEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEE 60
Query: 265 KKKRKKKS 272
+ +
Sbjct: 61 LEAELARR 68
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 34.5 bits (79), Expect = 0.036
Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 3/83 (3%)
Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
+ + + ++EEEE EE E+TDSE + + R K +KK + +E E+E
Sbjct: 1 ETEVLELEEEDESGEEEEEESEEEEETDSEDDMEPRLKPVFTRKKDRITI---QEREREA 57
Query: 246 GEQTDSEKEAKKRKKHKKDKKKR 268
++ E+EAK++ + +K + +
Sbjct: 58 AKEKALEEEAKRKAEERKRETLK 80
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 34.6 bits (80), Expect = 0.036
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 2/69 (2%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
EEE E + E+ AKK+KK K KKK KK + E +E + + +
Sbjct: 166 EEEVELLKARLEEERAKKKKK--KKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGE 223
Query: 263 KDKKKRKKK 271
K K+K+
Sbjct: 224 AKKLKKKRS 232
Score = 31.9 bits (73), Expect = 0.28
Identities = 19/71 (26%), Positives = 25/71 (35%), Gaps = 6/71 (8%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+EE E + + + KK+KK KK KK S E E
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSA------VPTELSSGAG 219
Query: 261 HKKDKKKRKKK 271
+ KK KKK
Sbjct: 220 QVGEAKKLKKK 230
Score = 30.4 bits (69), Expect = 0.80
Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 1/65 (1%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
+ +AK+K+K + + E +E + + + KK KKK+S +
Sbjct: 178 EERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVG-EAKKLKKKRSIAPD 236
Query: 240 EEEKE 244
E+ E
Sbjct: 237 NEKSE 241
Score = 27.3 bits (61), Expect = 8.1
Identities = 16/60 (26%), Positives = 25/60 (41%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K K K K+ +K + T +S++ + E EAKK KK + +K Y
Sbjct: 184 KKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDNEKSEVY 243
Score = 27.3 bits (61), Expect = 8.3
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 5/80 (6%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKK-----DKKKKKKKSKYEEEEEKEEGEQTDSE 252
E + E + E+ +K+ KK+KK KK + S E G+ +++
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAK 225
Query: 253 KEAKKRKKHKKDKKKRKKKS 272
K KKR ++K KS
Sbjct: 226 KLKKKRSIAPDNEKSEVYKS 245
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 35.1 bits (82), Expect = 0.037
Identities = 21/96 (21%), Positives = 48/96 (50%), Gaps = 1/96 (1%)
Query: 175 LVLKSKLKAKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
V K +AK K +E+ KE E K+E E+ K R + +K+ ++++
Sbjct: 23 FVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRN 82
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
+ + E+ ++ E D + E ++++ + +KK+++
Sbjct: 83 ELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKE 118
Score = 32.4 bits (75), Expect = 0.27
Identities = 22/103 (21%), Positives = 53/103 (51%), Gaps = 7/103 (6%)
Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQ--TDSEKEAKKRKKH--KKDKKKK 231
+ +++ +AK + + A + + + E KEE + + EKE ++R+ K +K+
Sbjct: 33 IKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLL 92
Query: 232 KKKSKYEEEEE---KEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+K+ + + E K E E EKE +++++ + K++ ++
Sbjct: 93 QKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135
Score = 27.8 bits (63), Expect = 8.0
Identities = 17/70 (24%), Positives = 39/70 (55%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+ E EKE E+ + ++ +KR K++ +K + + EEE E+ E+ +K+ + KK
Sbjct: 70 RNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKK 129
Query: 261 HKKDKKKRKK 270
++ ++ ++
Sbjct: 130 EEELEELIEE 139
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 34.3 bits (79), Expect = 0.041
Identities = 22/110 (20%), Positives = 45/110 (40%), Gaps = 19/110 (17%)
Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR-----KKHKKDKKKKKKKS 235
L AK+ ++ +E K+E EKE E ++ AK++ ++ + K++
Sbjct: 93 LSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPK 152
Query: 236 KYEEEEEKEEGEQTDSEKEAKK--------------RKKHKKDKKKRKKK 271
+E E + S K + KK K+ ++KR+++
Sbjct: 153 HKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEE 202
Score = 28.2 bits (63), Expect = 4.3
Identities = 25/93 (26%), Positives = 45/93 (48%), Gaps = 8/93 (8%)
Query: 188 KASPTSASSDES---DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
AS S SS S + + E++ E ++ AK+ ++ KK +K ++K K E E+E+E
Sbjct: 62 TASSLSDSSTYSRSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKERE 121
Query: 245 EGEQTDSEKEAK-----KRKKHKKDKKKRKKKS 272
E E + K ++K + K++ K
Sbjct: 122 EAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHK 154
Score = 28.2 bits (63), Expect = 5.0
Identities = 21/83 (25%), Positives = 43/83 (51%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
S T++S +S +E + +EA + K +++KK K EE++K+E E+
Sbjct: 60 SSTASSLSDSSTYSRSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKE 119
Query: 250 DSEKEAKKRKKHKKDKKKRKKKS 272
E E ++R +K ++ ++K+
Sbjct: 120 REEAELRQRLAKEKYEEWCRQKA 142
Score = 27.8 bits (62), Expect = 6.5
Identities = 20/91 (21%), Positives = 50/91 (54%), Gaps = 5/91 (5%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDS-EKEAKKRKKHKKDKKKKKKKSK 236
K K + AS + + S + ++ ++E ++ E + K+++ K++++++K++ K
Sbjct: 150 TPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQKREEERRKQRKK 209
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
+EEEE+ + +E+ +K K+ K + K
Sbjct: 210 QQEEEER----KQKAEEAWQKWMKNVKQRPK 236
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 34.4 bits (79), Expect = 0.042
Identities = 17/57 (29%), Positives = 26/57 (45%), Gaps = 1/57 (1%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
E E K K K + + K + ++ EK + E K K H ++KK+ KKK
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVK-KLHSQEKKEEKKK 72
Score = 32.5 bits (74), Expect = 0.18
Identities = 15/73 (20%), Positives = 29/73 (39%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+ K+K + + + E+ K + +E+ K + KK++KKK KK K
Sbjct: 19 EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKV 78
Query: 238 EEEEEKEEGEQTD 250
+ + D
Sbjct: 79 PLQVNPAQLFVDD 91
Score = 30.1 bits (68), Expect = 0.97
Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 1/61 (1%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
E+ K ++ + E K + K +K K E K+ Q ++E KK KK
Sbjct: 18 SEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQE-KKEEKKKPKKK 76
Query: 262 K 262
K
Sbjct: 77 K 77
Score = 29.0 bits (65), Expect = 2.2
Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 2/65 (3%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
ES+K + K++G K + K +K K K++ E +K ++ EK+ K
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKA--IEKFKLLIKAQMAERVKKLHSQEKKEEKKKPK 74
Query: 258 RKKHK 262
+KK
Sbjct: 75 KKKVP 79
Score = 29.0 bits (65), Expect = 2.4
Identities = 15/67 (22%), Positives = 26/67 (38%), Gaps = 7/67 (10%)
Query: 205 EKEEGEQTDSEKEAKKRKKHK-KDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
E E+ + + + K +DK +K K + +K + KK +K
Sbjct: 17 ESEKYKANKDKGNPEIYNKINSQDKAIEKFK------LLIKAQMAERVKKLHSQEKKEEK 70
Query: 264 DKKKRKK 270
K K+KK
Sbjct: 71 KKPKKKK 77
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 35.2 bits (81), Expect = 0.042
Identities = 24/93 (25%), Positives = 40/93 (43%), Gaps = 7/93 (7%)
Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
L ++ P S KEE+ E + +S+K AK++K D K KK K+ +
Sbjct: 171 LSDSDQEAVLPLVKKSPSEAKEEKA--EERKQESKKGAKRKKDASGDDKSKKAKTDRDVS 228
Query: 241 EEKEEGEQTDSEKEAKKRKKHK-----KDKKKR 268
++ S+ E+K + K KD K+
Sbjct: 229 TSTAASQKKSSDLESKLEAQSKELWSLKDDLKK 261
Score = 29.8 bits (67), Expect = 1.6
Identities = 17/40 (42%), Positives = 25/40 (62%)
Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
KKS E +EEK E + +S+K AK++K D K +K K+
Sbjct: 184 KKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKKAKT 223
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 33.9 bits (78), Expect = 0.042
Identities = 15/69 (21%), Positives = 43/69 (62%), Gaps = 2/69 (2%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
+ +++ E + E+E +K+ + K++ ++K+++ EE E+K+E E + ++ ++ +K +
Sbjct: 15 QARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE--ERKEREEQARKEQ 72
Query: 263 KDKKKRKKK 271
++ +K K
Sbjct: 73 EEYEKLKSS 81
Score = 33.9 bits (78), Expect = 0.047
Identities = 18/67 (26%), Positives = 44/67 (65%), Gaps = 2/67 (2%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
K+ + EE + ++EA++ ++ ++ K ++K++ + +EEEE E E+ + +KE ++RK+
Sbjct: 6 KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELE--EEREKKKEEEERKE 63
Query: 261 HKKDKKK 267
++ +K
Sbjct: 64 REEQARK 70
Score = 33.5 bits (77), Expect = 0.062
Identities = 17/72 (23%), Positives = 43/72 (59%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+ EEK+ Q +E ++ ++ K ++K++ ++ + EE EE+ E ++ + E++ ++ +
Sbjct: 9 AKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQA 68
Query: 261 HKKDKKKRKKKS 272
K+ ++ K KS
Sbjct: 69 RKEQEEYEKLKS 80
Score = 32.0 bits (73), Expect = 0.20
Identities = 13/52 (25%), Positives = 38/52 (73%)
Query: 220 KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+K+ K ++K+ +++ + EEEE+EE ++ + ++E +++++ + ++++ KKK
Sbjct: 5 AKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKK 56
Score = 31.6 bits (72), Expect = 0.25
Identities = 21/85 (24%), Positives = 50/85 (58%), Gaps = 13/85 (15%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
KL+ K+ R+ + + EEEE+EE ++ + ++E +++++ + +++++KKK
Sbjct: 10 KLEEKQARRQ--------QREAEEEEREERKKLEEKREGERKEEEELEEEREKKK----- 56
Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKD 264
EEE+ + + + KE ++ +K K
Sbjct: 57 EEEERKEREEQARKEQEEYEKLKSS 81
Score = 30.1 bits (68), Expect = 0.87
Identities = 16/69 (23%), Positives = 37/69 (53%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
+E++K + ++E EE+ E ++ + E++ ++ + K+ ++ +K KS + EEE
Sbjct: 29 EERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEKLKSSFVVEEEG 88
Query: 244 EEGEQTDSE 252
+ D E
Sbjct: 89 TDKLSADEE 97
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 34.6 bits (79), Expect = 0.044
Identities = 23/72 (31%), Positives = 33/72 (45%), Gaps = 1/72 (1%)
Query: 202 EEEEKEEGEQTDSEKEAKKR-KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+E+EKE + E+EA R K K K + K+K E E E +K + KK
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKK 295
Query: 261 HKKDKKKRKKKS 272
K +KK K +
Sbjct: 296 ALKMEKKAIKNA 307
Score = 27.7 bits (61), Expect = 7.7
Identities = 15/63 (23%), Positives = 28/63 (44%), Gaps = 1/63 (1%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
+ K+ RK + + ++ + K E + +E EA+ KKK K+ K +
Sbjct: 241 EMKKIRKWEREAGARLKALAALKGKAE-AKNKAEIEAEALASATAVKKKAKEVMKKALKM 299
Query: 242 EKE 244
EK+
Sbjct: 300 EKK 302
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 34.6 bits (80), Expect = 0.048
Identities = 18/55 (32%), Positives = 33/55 (60%), Gaps = 2/55 (3%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
E + P+ D DKEE ++E+ E+ +++ + RK K+++KKKK+ K E+
Sbjct: 536 EDKPDGPSVWKLD--DKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 34.4 bits (79), Expect = 0.049
Identities = 23/86 (26%), Positives = 43/86 (50%), Gaps = 8/86 (9%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K K K K+++K + S +ES++E +++ + +S + + +D+ K +
Sbjct: 223 KKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSD 282
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKK 263
EEE + EKE +KRK+ KK
Sbjct: 283 SEEETE--------EKEKEKRKRLKK 300
Score = 34.0 bits (78), Expect = 0.062
Identities = 19/99 (19%), Positives = 39/99 (39%), Gaps = 4/99 (4%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEK----EAKKRKKHKKDKKKKKK 233
KS K ++ K + T + ++ + + S KK K+ K+ K+ +
Sbjct: 179 KSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFFKKKTKEKKEKKEASES 238
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K E EEE + + ++ A+ + + + K S
Sbjct: 239 TVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPS 277
Score = 31.7 bits (72), Expect = 0.37
Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 15/109 (13%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK------ 231
+S + S DE + E+E K GE++DSE+E ++++K K+ + KK
Sbjct: 247 ESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKKMMEDED 306
Query: 232 -------KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK--KKRKKK 271
+S EEEE +E +KE +K + ++R ++
Sbjct: 307 EDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRR 355
Score = 28.6 bits (64), Expect = 3.2
Identities = 19/95 (20%), Positives = 47/95 (49%), Gaps = 4/95 (4%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K +K + +K+E+++ E E + K+ + + + +
Sbjct: 206 SLKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGL 265
Query: 238 EEEEEKEE----GEQTDSEKEAKKRKKHKKDKKKR 268
+E+E+++E GE++DSE+E ++++K K+ + K+
Sbjct: 266 DEDEDEDEPKPSGERSDSEEETEEKEKEKRKRLKK 300
Score = 27.9 bits (62), Expect = 6.1
Identities = 26/111 (23%), Positives = 36/111 (32%), Gaps = 23/111 (20%)
Query: 180 KLKAKEKRKASPTSASSD--------ESDKEEEEKEEGEQTDSEKE----------AKKR 221
K K K KR + ES EEEE EE E K+
Sbjct: 290 KEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGG 349
Query: 222 KKHKKDKKKKKKKSKYEE-----EEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
++ + + KKK K EE ++ E ++ E E K K
Sbjct: 350 RRRGRRRVMKKKTFKDEEGYLVTKKVYEWESFSEDEAEPPPTKPKPKVSTP 400
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 34.1 bits (78), Expect = 0.051
Identities = 21/56 (37%), Positives = 28/56 (50%), Gaps = 4/56 (7%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE----EEEKEEGEQTDSEKE 254
+ K+E E S + K+KK KKDKKKK KS EE K + E + + E
Sbjct: 73 QAGKKETEDWFSPNQENKQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVHEQKSE 128
Score = 28.7 bits (64), Expect = 2.5
Identities = 15/33 (45%), Positives = 20/33 (60%)
Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ K+E E S + K+KK KKDKKK+ KS
Sbjct: 73 QAGKKETEDWFSPNQENKQKKKKKDKKKKSPKS 105
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 34.2 bits (79), Expect = 0.052
Identities = 16/59 (27%), Positives = 27/59 (45%)
Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ K K+ KKKK++ + E E ++ S++E KK+K + K KK
Sbjct: 216 LIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKALKK 274
Score = 33.5 bits (77), Expect = 0.10
Identities = 25/94 (26%), Positives = 35/94 (37%), Gaps = 19/94 (20%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
K LK+ + P ES K++ E EE + + KKRK ++ KKKK K+SK
Sbjct: 208 YKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESK 267
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+ K K KK
Sbjct: 268 -------------------GVKALKKVVAKGMKK 282
Score = 32.7 bits (75), Expect = 0.15
Identities = 15/62 (24%), Positives = 24/62 (38%)
Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
L++ + K S E D E E ++ S++E KK+K + K
Sbjct: 213 KSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEIKKKKPKESKGVKAL 272
Query: 233 KK 234
KK
Sbjct: 273 KK 274
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 34.2 bits (79), Expect = 0.054
Identities = 30/100 (30%), Positives = 48/100 (48%), Gaps = 11/100 (11%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD-----KKKK 231
++ KL A+ K + S+E+ K+ E K+ G+Q EK ++R K KKD KK K
Sbjct: 120 VRKKLLAEAAAKKA-----SEEARKQRELKKFGKQVQVEKL-QERAKEKKDMLEKIKKLK 173
Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
KK+ +E + + D+ K K+K K + K
Sbjct: 174 KKRKGGGDEGDLFDFLVDDAAKGGSKKKGRKGGAARGKPN 213
>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 215
Score = 33.8 bits (77), Expect = 0.056
Identities = 15/58 (25%), Positives = 31/58 (53%)
Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
SK +AK +R A + + E+ + E+ + EQ + +K+ +++K K+ K + K
Sbjct: 57 SKKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAALAKEYKAQVK 114
Score = 29.2 bits (65), Expect = 1.7
Identities = 15/56 (26%), Positives = 29/56 (51%), Gaps = 2/56 (3%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
S+K AK ++ KK + ++ + EE K EQ + +K+ +++K K+ K
Sbjct: 57 SKKRAKVQRLAKKSRVDAREAYRIAAEENKA--EQLERDKQLSEQQKQAALAKEYK 110
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 34.3 bits (77), Expect = 0.059
Identities = 24/93 (25%), Positives = 55/93 (59%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
K+K EK+KA A E++++ K+ E+ EA+ +K+ ++++ K+ K++ E
Sbjct: 69 KVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWKAEQER 128
Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
++++E ++ D +K ++KK K K ++ +K+
Sbjct: 129 IQKEQEKKEADLKKLQAEKKKEKAVKAEKAEKA 161
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 34.2 bits (79), Expect = 0.063
Identities = 24/94 (25%), Positives = 43/94 (45%), Gaps = 21/94 (22%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
KL + RK T +E + E+E E EA+++K+ KK ++++ K +K
Sbjct: 250 NLKLSPEVLRKVDKTREEEEEKILKAAEEERQE------EAQEKKEEKKKEEREAKLAKL 303
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EE+ RK +K++KK+ +K
Sbjct: 304 SPEEQ---------------RKLEEKERKKQARK 322
Score = 30.7 bits (70), Expect = 0.76
Identities = 13/54 (24%), Positives = 29/54 (53%), Gaps = 1/54 (1%)
Query: 220 KRKKHKKDKKKKKKKSKYEEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K + +K K++ EEEE+ + E+ E+ +K+++ KK++++ K
Sbjct: 249 ANLKLSPEVLRKVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK 302
Score = 29.1 bits (66), Expect = 2.3
Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 4/67 (5%)
Query: 168 RVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD 227
+V K E + LKA E+ + +E KEE E + + S +E +K + +K+
Sbjct: 260 KVDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAK--LSPEEQRKLE--EKE 315
Query: 228 KKKKKKK 234
+KK+ +K
Sbjct: 316 RKKQARK 322
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 33.2 bits (76), Expect = 0.064
Identities = 21/66 (31%), Positives = 37/66 (56%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
++ E+E++E + EKEA K KK KK+KK+KK K +++ + + T + K
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152
Query: 260 KHKKDK 265
K ++ K
Sbjct: 153 KKEEGK 158
Score = 29.7 bits (67), Expect = 0.96
Identities = 16/66 (24%), Positives = 31/66 (46%)
Query: 207 EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKK 266
EE E+ E + +K KK+KK+ K ++ EK +++ + +K KK
Sbjct: 93 EESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTTT 152
Query: 267 KRKKKS 272
K+++
Sbjct: 153 KKEEGK 158
Score = 27.8 bits (62), Expect = 4.8
Identities = 15/63 (23%), Positives = 28/63 (44%), Gaps = 1/63 (1%)
Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK-EEGEQTDSEKEAKKRKKHKKDKKKR 268
+ +++ + K+ ++ + K KK K E++E+K E + K K KK
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKKTT 151
Query: 269 KKK 271
KK
Sbjct: 152 TKK 154
Score = 27.0 bits (60), Expect = 7.1
Identities = 12/58 (20%), Positives = 32/58 (55%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+E++K +K + ++K+ K ++E+++++ ++ + KK K K+ K+ K
Sbjct: 92 PEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKATKK 149
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 34.3 bits (78), Expect = 0.066
Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 2/77 (2%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK--KKKKKSKYEEEEEKEEGEQTDSEKE 254
+E D ++EE + ++ + E + K + + K K K K + +EE+EE +
Sbjct: 491 EEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEEEKKLKMIMMS 550
Query: 255 AKKRKKHKKDKKKRKKK 271
K++K +KK K KK
Sbjct: 551 NKQKKLYKKMKYSNAKK 567
Score = 33.1 bits (75), Expect = 0.16
Identities = 26/83 (31%), Positives = 40/83 (48%), Gaps = 10/83 (12%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
K S TS + + +K + +K + D E+E KK K K+KK K + K+
Sbjct: 514 KYSETSEADKDVNKSKNKKRK---VDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKK--- 567
Query: 248 QTDSEKEAKKRKKHKKDKKKRKK 270
E++A+ KK KK K+KK
Sbjct: 568 ----EEQAENLKKKKKQIAKQKK 586
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 33.5 bits (77), Expect = 0.067
Identities = 12/36 (33%), Positives = 16/36 (44%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
D KEE EE ++ + K K + KKKK
Sbjct: 161 DAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 30.4 bits (69), Expect = 0.75
Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 4/42 (9%)
Query: 228 KKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
+K K+K ++ EE KE K K + KK+K
Sbjct: 159 RKDAKQKEEFAAEERKEALA----AAAKKSATPQKVETKKKK 196
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 33.5 bits (77), Expect = 0.071
Identities = 18/68 (26%), Positives = 30/68 (44%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
K + +E Q + +AKK+ KK K K+ + +E K E E ++ AKK K
Sbjct: 6 KRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKKPGK 65
Query: 261 HKKDKKKR 268
+ +
Sbjct: 66 FYVPAEHK 73
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 34.0 bits (79), Expect = 0.074
Identities = 30/131 (22%), Positives = 55/131 (41%), Gaps = 32/131 (24%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSE--KEAKKRKKHKKDKKKK 231
+L+ +++ +A++ K + A + + +K + EA+KR +KK+K
Sbjct: 566 KLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEK 625
Query: 232 KKKSKYEEEEEKEEGE--------------QTDSEKEA----------------KKRKKH 261
KKK + E++EE + G+ +KEA +K +K
Sbjct: 626 KKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIMKMKVPLSDLEKIQKP 685
Query: 262 KKDKKKRKKKS 272
KK KKK+ K
Sbjct: 686 KKKKKKKPKTV 696
Score = 30.6 bits (70), Expect = 0.94
Identities = 23/65 (35%), Positives = 37/65 (56%), Gaps = 9/65 (13%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
AS +E ++E E+K E E EA ++ K ++ ++KK K +EEE+K E EK
Sbjct: 523 ASLEELERELEQKAE------EAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEA---EK 573
Query: 254 EAKKR 258
EA++
Sbjct: 574 EAQQA 578
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 33.5 bits (77), Expect = 0.077
Identities = 19/76 (25%), Positives = 38/76 (50%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
++ E E + Q + +++K K KKK+K + +E +E+E+ + ++
Sbjct: 145 TTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRL 204
Query: 255 AKKRKKHKKDKKKRKK 270
+R+ KK K K+KK
Sbjct: 205 ELQRELMKKGKGKKKK 220
Score = 28.9 bits (65), Expect = 2.6
Identities = 20/81 (24%), Positives = 36/81 (44%), Gaps = 1/81 (1%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK-SKYEEEEEKEEGEQTDS 251
+ +E + + S + K++KK K K+K K+ + +E E+K + +
Sbjct: 145 TTPELLDRRENRPRISQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRL 204
Query: 252 EKEAKKRKKHKKDKKKRKKKS 272
E + + KK K KKK K
Sbjct: 205 ELQRELMKKGKGKKKKIVKDK 225
Score = 28.5 bits (64), Expect = 3.5
Identities = 19/70 (27%), Positives = 39/70 (55%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
+ E+ E+ + KKRK +K+ K++K+++ K ++ E++ E ++ +K K+K
Sbjct: 160 SQLEKTSLVDEKQKKKSAKKKRKLYKELKERKEREKKLKKVEQRLELQRELMKKGKGKKK 219
Query: 260 KHKKDKKKRK 269
K KDK +
Sbjct: 220 KIVKDKDGKV 229
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 33.7 bits (77), Expect = 0.083
Identities = 28/106 (26%), Positives = 42/106 (39%), Gaps = 13/106 (12%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK-----------DK 228
K K K+ S E+E ++ E D ++ KK K K +K
Sbjct: 262 KPKEKKTSSTFLPSLMGGYFSGSEDEDDDDEDIDPDQVVKKPVKRKNRRGQRARQAIWEK 321
Query: 229 K--KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K K K E E+E++E E SE EA++ K+ D K +
Sbjct: 322 KYGSGAKHVKKEREKEQKEREGRQSEWEARQAKREGGDAKAGRAAE 367
Score = 32.9 bits (75), Expect = 0.15
Identities = 17/89 (19%), Positives = 40/89 (44%), Gaps = 8/89 (8%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK--------KSKYEEEE 241
+ ++ S+ + + E E +DSE+ + KK K+ K S E+E+
Sbjct: 229 NEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGSEDED 288
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+ +E D + ++K+++ ++ R+
Sbjct: 289 DDDEDIDPDQVVKKPVKRKNRRGQRARQA 317
Score = 28.3 bits (63), Expect = 4.3
Identities = 20/78 (25%), Positives = 34/78 (43%), Gaps = 7/78 (8%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+ E+E++E E SE EA++ K+ D K + +++ EK+ K+
Sbjct: 333 EREKEQKEREGRQSEWEARQAKREGGDAKAGRAAEPTGSRTQQKGDRPKRGEKKKPKKPS 392
Query: 261 HKKD-------KKKRKKK 271
K KKK K+K
Sbjct: 393 VDKPLHPSWEAKKKAKEK 410
Score = 28.3 bits (63), Expect = 4.5
Identities = 20/99 (20%), Positives = 42/99 (42%), Gaps = 10/99 (10%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQT----------DSEKEAKKRKKHKKDKKKK 231
+++ + ++ ++ D+SD EEEE + E E+E ++ ++
Sbjct: 174 ESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAPSINYNEDTS 233
Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+ +S + E E +DSE+ + KK K+ K
Sbjct: 234 ESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTF 272
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 33.1 bits (76), Expect = 0.089
Identities = 14/36 (38%), Positives = 18/36 (50%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
D EE EE E + KE + + KK+ KKKK
Sbjct: 161 DAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 30.0 bits (68), Expect = 0.83
Identities = 10/36 (27%), Positives = 20/36 (55%)
Query: 236 KYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K + E+ E+ ++ AK+ + + +KK+ KKK
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
Score = 28.8 bits (65), Expect = 2.0
Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 4/41 (9%)
Query: 229 KKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
K K ++ EE + KE + + KK+ KK+K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAA----KEEESAEGEKKESKKKK 196
Score = 27.7 bits (62), Expect = 6.1
Identities = 9/36 (25%), Positives = 19/36 (52%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
K+ + EE ++E A +++ + +KK+ K K
Sbjct: 160 KDAKLLEEFAAEEAEAAAAAKEEESAEGEKKESKKK 195
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 31.8 bits (73), Expect = 0.12
Identities = 12/54 (22%), Positives = 31/54 (57%)
Query: 217 EAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+A R + + ++K+++ E+E KE + D++K KK++K + + + ++
Sbjct: 61 DAGLRAERQAKVEEKEQEVAEREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 33.1 bits (75), Expect = 0.12
Identities = 22/94 (23%), Positives = 38/94 (40%), Gaps = 4/94 (4%)
Query: 182 KAKEKRKASPTSASSD----ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+ K T+ S E+ K+ EK+ + +E + + K K K+ K K
Sbjct: 128 DMRAAEKVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQ 187
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ ++ Q++S+ K K DK K K K
Sbjct: 188 KALPKQTAETQSNSKPIETAPKADKADKTKPKPK 221
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 33.2 bits (76), Expect = 0.13
Identities = 14/72 (19%), Positives = 30/72 (41%), Gaps = 6/72 (8%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
+ ++ ++D +E K +K + E+ E D E+ ++ +
Sbjct: 302 RAKPSDEPSLPESDIHEE------IPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVE 355
Query: 260 KHKKDKKKRKKK 271
K +K KKK +K+
Sbjct: 356 KKQKVKKKPRKR 367
Score = 30.1 bits (68), Expect = 1.3
Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 14/90 (15%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
++K + R S ESD EE + E++ SE +E
Sbjct: 295 RVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFL--------------GYMGGIDE 340
Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
++E E+ E++ E E K++ K K K+K
Sbjct: 341 DDEDEDDEESKEEVEKKQKVKKKPRKRKVN 370
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 32.1 bits (74), Expect = 0.15
Identities = 12/35 (34%), Positives = 21/35 (60%), Gaps = 2/35 (5%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
+KE K K +KK +K K+K E+++ K+E +
Sbjct: 64 DKKELKAW--EKAEKKAEKAKAKAEKKKAKKEEPK 96
Score = 28.3 bits (64), Expect = 3.0
Identities = 10/28 (35%), Positives = 14/28 (50%)
Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
K KK +K K K +KK +EE +
Sbjct: 70 AWEKAEKKAEKAKAKAEKKKAKKEEPKP 97
Score = 27.5 bits (62), Expect = 4.6
Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
A+K+K K + + +Y++ +E E +KE K +K +K +K K K
Sbjct: 32 AQKKKGKKGELEITDLNEEYKDLKESLE-AALLDKKELKAWEKAEKKAEKAKAK 84
>gnl|CDD|153340 cd07656, F-BAR_srGAP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Slit-Robo GTPase
Activating Proteins. F-BAR domains are dimerization
modules that bind and bend membranes and are found in
proteins involved in membrane dynamics and actin
reorganization. Slit-Robo GTPase Activating Proteins
(srGAPs) are Rho GAPs that interact with Robo1, the
transmembrane receptor of Slit proteins. Slit proteins
are secreted proteins that control axon guidance and the
migration of neurons and leukocytes. Vertebrates contain
three isoforms of srGAPs, all of which are expressed
during embryonic and early development in the nervous
system but with different localization and timing.
srGAPs contain an N-terminal F-BAR domain, a Rho GAP
domain, and a C-terminal SH3 domain. F-BAR domains form
banana-shaped dimers with a positively-charged concave
surface that binds to negatively-charged lipid
membranes. They can induce membrane deformation in the
form of long tubules.
Length = 241
Score = 32.7 bits (75), Expect = 0.15
Identities = 16/49 (32%), Positives = 32/49 (65%), Gaps = 1/49 (2%)
Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK-DKKKKKKKSKYEE 239
S S++ KE E++EE ++ EK+ ++ + KK +K+ +K+++KY E
Sbjct: 146 ESKSAERKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAKYSE 194
Score = 30.4 bits (69), Expect = 0.87
Identities = 12/40 (30%), Positives = 24/40 (60%)
Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+K + E++EEK+E + ++ KK +K+ +KR+ K
Sbjct: 152 RKLKEAEKQEEKQEQSPEKKLERSRSSKKIEKEVEKRQAK 191
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 32.9 bits (75), Expect = 0.15
Identities = 24/94 (25%), Positives = 40/94 (42%), Gaps = 1/94 (1%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
KA K K++ S ++ K ++ +T ++ K +K KK KK K K
Sbjct: 210 SGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKK 269
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
++ + + ++ AK K K KKK KK
Sbjct: 270 AAKKAAKAAAKA-AKGAAKATKGKAKAKKKAGKK 302
Score = 32.2 bits (73), Expect = 0.23
Identities = 25/102 (24%), Positives = 42/102 (41%), Gaps = 8/102 (7%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDS--------EKEAKKRKKHKKDKK 229
K+ A +KA+ T+A + + K+ +K + + K A K K
Sbjct: 230 KAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAVKKAAKKAAKAAAKAAKGAAKAT 289
Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K K K+K + ++ G + + +A KR K KK KK
Sbjct: 290 KGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331
Score = 27.5 bits (61), Expect = 7.1
Identities = 25/99 (25%), Positives = 39/99 (39%), Gaps = 8/99 (8%)
Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
VL + A K+ A +A S + K ++ +K AK KK K K +
Sbjct: 190 VLAAVGAANAKKAAKTPAAKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAA 249
Query: 236 KYEEEEEKEEGEQTDSEKEAKK--RKKHKKDKKKRKKKS 272
K ++ K+ K+A K +K KK K K +
Sbjct: 250 KKAKKTAKKAL------KKAAKAVKKAAKKAAKAAAKAA 282
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 33.0 bits (75), Expect = 0.16
Identities = 21/75 (28%), Positives = 37/75 (49%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
++ DE ++EE+E E E D E EA + + + ++EE+K E E+ E
Sbjct: 113 DSADDEEEEEEDESLEDEMIDDEDEADLFNESESSLEDLSDDETEDDEEKKMEEEEAGEE 172
Query: 253 KEAKKRKKHKKDKKK 267
KE+ ++ +K K
Sbjct: 173 KESVEQATREKKFDK 187
Score = 28.4 bits (63), Expect = 4.9
Identities = 21/69 (30%), Positives = 35/69 (50%), Gaps = 5/69 (7%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE-----EEEKEEGE 247
S+ D SD E E+ EE + + E +K + ++KK KS ++ +E E E
Sbjct: 146 SSLEDLSDDETEDDEEKKMEEEEAGEEKESVEQATREKKFDKSGVDDKFFKLDEMNEFLE 205
Query: 248 QTDSEKEAK 256
T++E+EA
Sbjct: 206 ATEAEEEAA 214
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 33.1 bits (76), Expect = 0.16
Identities = 26/88 (29%), Positives = 43/88 (48%), Gaps = 8/88 (9%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
K KE+ + + EE+E E T EK KK ++ + +++ + K E EE
Sbjct: 396 KVKEEERPRE--------KEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEE 447
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
K E E+ +SE E +R+ K +K R+
Sbjct: 448 LKREIEKLESELERFRREVRDKVRKDRE 475
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 32.4 bits (74), Expect = 0.16
Identities = 22/79 (27%), Positives = 39/79 (49%), Gaps = 1/79 (1%)
Query: 177 LKSKLKA-KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
L S + K+ K S + EE+E+ ++ S E K ++ +D ++ +K
Sbjct: 116 LYSVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLKELEAEDVDEEDEKD 175
Query: 236 KYEEEEEKEEGEQTDSEKE 254
+ EEEEE+EE E D + +
Sbjct: 176 EEEEEEEEEEDEDFDDDDD 194
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 32.1 bits (73), Expect = 0.16
Identities = 19/85 (22%), Positives = 43/85 (50%)
Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
KA EK + S +S +EE++ E E ++ + ++ ++++K++ ++ E
Sbjct: 32 IIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEE 91
Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKK 263
E+EE+ E + K++K K+
Sbjct: 92 EKEEEAEDVKQQEVFSFKRKKPFKE 116
Score = 27.1 bits (60), Expect = 8.4
Identities = 22/85 (25%), Positives = 44/85 (51%), Gaps = 1/85 (1%)
Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
+KA + SD+ + KEE EQ +EA+K K+ + +++ + + +EE +EE
Sbjct: 33 IKKADEKEEEKENSDEHVKSKEE-EQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQEE 91
Query: 246 GEQTDSEKEAKKRKKHKKDKKKRKK 270
++ ++E ++ K KK K+
Sbjct: 92 EKEEEAEDVKQQEVFSFKRKKPFKE 116
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 32.1 bits (73), Expect = 0.16
Identities = 25/84 (29%), Positives = 38/84 (45%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
+ K++ S D E+ E + S+K+ +K K KK KKKK + EE E
Sbjct: 21 RAHVKSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPEL 80
Query: 244 EEGEQTDSEKEAKKRKKHKKDKKK 267
E +DSE+ + KK + K
Sbjct: 81 AAEELSDSEENEENDKKVDYELPK 104
Score = 30.5 bits (69), Expect = 0.63
Identities = 20/91 (21%), Positives = 39/91 (42%)
Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
++A K +D+ E+ E + + +K ++KK K KKKKK+ + E
Sbjct: 20 IRAHVKSTTESVDKVNDDIFPEDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKEEVPE 79
Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EE ++ +E K+ ++ K +
Sbjct: 80 LAAEELSDSEENEENDKKVDYELPKVQNTAA 110
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 31.5 bits (72), Expect = 0.18
Identities = 18/42 (42%), Positives = 31/42 (73%), Gaps = 3/42 (7%)
Query: 221 RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
+ +H+++K++KKK+SK ++EEEKE Q K+ KK+ KH+
Sbjct: 92 KLEHERNKQEKKKRSKEKKEEEKERKRQL---KQQKKKAKHR 130
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 33.1 bits (76), Expect = 0.18
Identities = 23/86 (26%), Positives = 42/86 (48%), Gaps = 2/86 (2%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
L+S+L+ E R + SD S K E ++E EQ + E+E K + ++ ++ +
Sbjct: 693 LQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKER--LEELEEDLSSLE 750
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHK 262
E E K E ++ ++ E + HK
Sbjct: 751 QEIENVKSELKELEARIEELEEDLHK 776
Score = 30.8 bits (70), Expect = 0.97
Identities = 17/84 (20%), Positives = 39/84 (46%), Gaps = 4/84 (4%)
Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
+ + E + E+E +E ++ + + + + K+ + KK + EEE E+ E D
Sbjct: 821 LNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDL 880
Query: 252 EKE----AKKRKKHKKDKKKRKKK 271
E K+R + + ++ ++K
Sbjct: 881 ESRLGDLKKERDELEAQLRELERK 904
>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752). This
domain family is found in eukaryotes, and is typically
between 140 and 163 amino acids in length.
Length = 148
Score = 31.6 bits (72), Expect = 0.18
Identities = 25/95 (26%), Positives = 45/95 (47%), Gaps = 2/95 (2%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
+L+ + + SA + S K + +E + E KKR K ++ +KK+K K +E
Sbjct: 45 RLQDEVLGVEAGASAPAAASAKASKRDKEMARKVKEYNEKKRGKSLVEQHQKKQKKKKKE 104
Query: 240 EEEKEEGEQT-DSEKE-AKKRKKHKKDKKKRKKKS 272
EE + + D EK+ A K K +++ K+
Sbjct: 105 EENDDPSRRPFDREKDLAVGGKISKAQRRELINKA 139
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 33.1 bits (75), Expect = 0.19
Identities = 18/75 (24%), Positives = 38/75 (50%), Gaps = 5/75 (6%)
Query: 195 SSDESDKEEEEKEEGE-----QTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
S DESD+ EEE E E ++D E ++ + +D + + ++ +EE+ E+ ++
Sbjct: 927 SDDESDESEEEVSEYEASSDDESDETDEDEESDESSEDLSEDESENDSSDEEDGEDWDEL 986
Query: 250 DSEKEAKKRKKHKKD 264
+S+ R ++
Sbjct: 987 ESKAAYDSRPGKRRK 1001
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 33.0 bits (76), Expect = 0.19
Identities = 17/103 (16%), Positives = 46/103 (44%), Gaps = 8/103 (7%)
Query: 178 KSKLKA-KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDK-------K 229
K++++A ++++K + + + E+ + E+E+ + K+A + + K +
Sbjct: 435 KAEIRAIEQEKKKAEEAKARFEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALAR 494
Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K KK+ + + G + D+ R+ K + R+ +
Sbjct: 495 VKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQAEK 537
Score = 28.4 bits (64), Expect = 5.2
Identities = 17/102 (16%), Positives = 40/102 (39%), Gaps = 10/102 (9%)
Query: 180 KLKAKEKRKASPTSASSD----------ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
K +A+ ++ +A++D K ++ ++ ++E+E +K
Sbjct: 527 KAQARARQAEKQAAAAADPKKAAVAAAIARAKAKKAAQQAANAEAEEEVDPKKAAVAAAI 586
Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ K K ++ E E+ +E + KK + + KK
Sbjct: 587 ARAKAKKAAQQAASAEPEEQVAEVDPKKAAVAAAIARAKAKK 628
Score = 28.0 bits (63), Expect = 7.0
Identities = 19/93 (20%), Positives = 32/93 (34%), Gaps = 5/93 (5%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK-KDKKKKKKKSKYEEE 240
KA E R A A + + + +K Q K + ++ +K +
Sbjct: 475 KAAEARAAKDKDAVAAALARVKAKKAAATQPIVIKAGARPDNSAVIAAREARKAQARARQ 534
Query: 241 EEKEEGEQTDSEKE----AKKRKKHKKDKKKRK 269
EK+ D +K A R K KK ++
Sbjct: 535 AEKQAAAAADPKKAAVAAAIARAKAKKAAQQAA 567
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 32.8 bits (75), Expect = 0.21
Identities = 15/75 (20%), Positives = 40/75 (53%), Gaps = 8/75 (10%)
Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG-------EQTDSEKEAKK 257
+K++G +EK++ + + +++K+ + + E ++EK+ ++ + E++ +
Sbjct: 1504 QKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAE 1563
Query: 258 RKKHKKDKKKRKKKS 272
KK K K++ KS
Sbjct: 1564 SDI-KKRKNKKQYKS 1577
Score = 32.8 bits (75), Expect = 0.25
Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 2/62 (3%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKE-EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
+EK A SD+ K E ++ + E++ + KK K KK+ KS E
Sbjct: 1522 NRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDI-KKRKNKKQYKSNTE 1580
Query: 239 EE 240
E
Sbjct: 1581 AE 1582
Score = 30.8 bits (70), Expect = 0.86
Identities = 17/53 (32%), Positives = 26/53 (49%), Gaps = 1/53 (1%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
EKE + + + DK+KK +EK E+ +E + KKRK K+ K
Sbjct: 1527 EKEPAGQGELESDKEKKGNLESVLSNQEKNI-EEDYAESDIKKRKNKKQYKSN 1578
Score = 30.5 bits (69), Expect = 1.1
Identities = 17/44 (38%), Positives = 25/44 (56%)
Query: 208 EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
+ E S+ +K KK KK +KKK+++ K EE+ E E DS
Sbjct: 732 DAEFKISDSVEEKTKKKKKKEKKKEEEYKREEKARIEIAEAWDS 775
Score = 29.7 bits (67), Expect = 2.1
Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 16/74 (21%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK-------KHKK--- 263
++K +K+K +KK + + +EEKE Q + E + +K+ +K
Sbjct: 1499 NKKINQKKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIE 1558
Query: 264 ------DKKKRKKK 271
D KKRK K
Sbjct: 1559 EDYAESDIKKRKNK 1572
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 32.7 bits (74), Expect = 0.22
Identities = 20/76 (26%), Positives = 37/76 (48%)
Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
SDE DK+ + D +K +K K + + SK E+ + E+ + +
Sbjct: 145 SDEDDKDPNDPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKK 204
Query: 256 KKRKKHKKDKKKRKKK 271
K++K+ +K++ K KKK
Sbjct: 205 KEKKEKEKERDKDKKK 220
Score = 32.0 bits (72), Expect = 0.38
Identities = 14/60 (23%), Positives = 30/60 (50%)
Query: 191 PTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTD 250
P SD+ K EEE+ ++ + +K KK+++ +K +++ E +E+ + D
Sbjct: 73 PGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVD 132
Score = 30.8 bits (69), Expect = 0.81
Identities = 32/126 (25%), Positives = 53/126 (42%), Gaps = 45/126 (35%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK-KDKKKKKKKSKY------ 237
+K + + TS S ++ D EK+ S+K KK KK K K++ K KKK
Sbjct: 176 QKHRNAETSKSPEKGDVPAVEKK------SKKPKKKEKKEKEKERDKDKKKEVEGFKSLL 229
Query: 238 -------------------------------EEEEEKEEGEQTDSEKEAK-KRKKHKKDK 265
E +E ++ E +++K K K+KK +K+K
Sbjct: 230 LALDDSPASAASVAEADEASLANTVSGTAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEK 289
Query: 266 KKRKKK 271
+++KKK
Sbjct: 290 EEKKKK 295
Score = 28.9 bits (64), Expect = 3.1
Identities = 16/56 (28%), Positives = 29/56 (51%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
+A E D+ ++ + E + + + KK++K K++KKKKKK + EQ
Sbjct: 257 TAPDSEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQ 312
Score = 28.9 bits (64), Expect = 3.1
Identities = 12/40 (30%), Positives = 26/40 (65%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
E+E + R++ +KDK++KKK+ K + + T+S+++
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDED 124
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 32.4 bits (74), Expect = 0.22
Identities = 16/64 (25%), Positives = 31/64 (48%), Gaps = 6/64 (9%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEK------EEGEQTDSEKEAKKRKKHKKDKKK 267
+ EA K + +K+ KK+S + +E E ++T S++E K + K ++ K
Sbjct: 349 TAYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELK 408
Query: 268 RKKK 271
R +
Sbjct: 409 RLEN 412
Score = 31.2 bits (71), Expect = 0.53
Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 3/68 (4%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK---EAKKR 258
E +E + AKK+ K K+ K S E ++ + EQ E E KR
Sbjct: 350 AYEAIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKR 409
Query: 259 KKHKKDKK 266
++ K K+
Sbjct: 410 LENGKQKR 417
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 32.2 bits (73), Expect = 0.23
Identities = 24/95 (25%), Positives = 39/95 (41%), Gaps = 8/95 (8%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K+ + K K + TS + ++ K +T S K AKK K KK K K K
Sbjct: 11 KTTEEKKPAAKKATTSKETAKTKKTA-------KTTSTKAAKKAAKVKKTKSVKTTTKKV 63
Query: 238 EEEEEKEEG-EQTDSEKEAKKRKKHKKDKKKRKKK 271
+ EK E ++ K+ K++ + + K
Sbjct: 64 TVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNK 98
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 32.1 bits (74), Expect = 0.23
Identities = 14/45 (31%), Positives = 27/45 (60%), Gaps = 1/45 (2%)
Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
+++ +E KEE + +K+ K HK KKK+K+++K + + K
Sbjct: 46 NEQYKEMKEELKAALLDKKELK-AWHKAQKKKEKQEAKAAKAKSK 89
Score = 28.6 bits (65), Expect = 3.4
Identities = 14/55 (25%), Positives = 26/55 (47%), Gaps = 1/55 (1%)
Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKRKKK 271
+RKK +K + + ++ +E ++E +KE K K KK +K+ K
Sbjct: 29 LAQRKKSQKGELEVTNLNEQYKEMKEELKAALLDKKELKAWHKAQKKKEKQEAKA 83
Score = 28.3 bits (64), Expect = 4.2
Identities = 16/46 (34%), Positives = 24/46 (52%), Gaps = 7/46 (15%)
Query: 216 KEAKKR-KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
KE K+ K DKK+ K K ++++EK+E K AK + K
Sbjct: 50 KEMKEELKAALLDKKELKAWHKAQKKKEKQE------AKAAKAKSK 89
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 31.3 bits (71), Expect = 0.24
Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 5/77 (6%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K+K+ R + +++D EE+E++ + + KK +K +++K K
Sbjct: 60 KAKISPNALRGGATAGHGEEDTDDEEDEEDFATPSAVPQWGKKARK-----RQRKVIRKL 114
Query: 238 EEEEEKEEGEQTDSEKE 254
E EE+ +Q D E E
Sbjct: 115 LEAEEQLREDQYDDEDE 131
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 32.6 bits (74), Expect = 0.25
Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 6/102 (5%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKKK 232
+L L S++ +K P E D K++ +T SEK + K ++KHK D++ K
Sbjct: 703 DLDLLSRIPGHPYKKGVP--PKPAEKDSLSAPKKQTSKTASEKSSSKGKRKHKNDEEADK 760
Query: 233 ---KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
KK + EE+ + S + K+ +K + ++++
Sbjct: 761 IESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKEEE 802
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 32.7 bits (75), Expect = 0.25
Identities = 16/70 (22%), Positives = 34/70 (48%)
Query: 191 PTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTD 250
T + S+ EE E+E + ++ E K K+ ++ ++ EE+E+ ++ E D
Sbjct: 222 FTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281
Query: 251 SEKEAKKRKK 260
+ K++K
Sbjct: 282 KLEILKEKKD 291
Score = 30.4 bits (69), Expect = 1.3
Identities = 12/32 (37%), Positives = 22/32 (68%)
Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
+KKK+ K Y EE+ K+E ++ +++E K+R
Sbjct: 648 EKKKEFKILDYTEEKTKKEEKKEKNKREEKER 679
Score = 28.4 bits (64), Expect = 4.4
Identities = 14/62 (22%), Positives = 31/62 (50%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
E+++ EEE + +T SE + K+++ ++ S+ +E+ +K E + KK
Sbjct: 231 ETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLDKLEILKEKK 290
Query: 258 RK 259
+
Sbjct: 291 DE 292
Score = 28.4 bits (64), Expect = 4.5
Identities = 18/77 (23%), Positives = 35/77 (45%), Gaps = 1/77 (1%)
Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG 246
R SP + E EE+EE + E + K K++++ ++ EEKE+
Sbjct: 216 RIPSPFFTKKLKETSETEEREEETDVEIETT-SETKGTKQEQEGSTEEDPSLFSEEKEDP 274
Query: 247 EQTDSEKEAKKRKKHKK 263
++T+ + + K+ K
Sbjct: 275 DKTEDLDKLEILKEKKD 291
>gnl|CDD|178669 PLN03122, PLN03122, Poly [ADP-ribose] polymerase; Provisional.
Length = 815
Score = 32.5 bits (74), Expect = 0.26
Identities = 19/73 (26%), Positives = 31/73 (42%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
K E R + A+ + ++K E ++ + E+ KK KK KK K E+
Sbjct: 1 KVHETRSQAHAPAAEEGKGGTRKQKAENKEHEGEQSPKKAKKEKKQDDSGNGNGKSAEDA 60
Query: 242 EKEEGEQTDSEKE 254
KE E + +E
Sbjct: 61 VKEFEEFCKAIEE 73
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 32.5 bits (75), Expect = 0.26
Identities = 18/85 (21%), Positives = 38/85 (44%), Gaps = 4/85 (4%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAK----KRKKHKKDKKKKKKKSKYEEE 240
+ + + +E D +E+E+EE ++ D A K +K K K+ K +
Sbjct: 178 AEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKA 237
Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDK 265
+EK+ + K+ K ++ K++
Sbjct: 238 QEKKVEGRLAQHKKYAKLREKLKEE 262
Score = 31.7 bits (73), Expect = 0.45
Identities = 17/91 (18%), Positives = 37/91 (40%), Gaps = 3/91 (3%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
+ P S+ + +++E EE E + E + + +K +K K
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEE--EDENDDSLAADESELPEKVLEKFK-AL 227
Query: 240 EEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
++ ++ + +K + +HKK K R+K
Sbjct: 228 AKQYKKLRKAQEKKVEGRLAQHKKYAKLREK 258
>gnl|CDD|187777 cd09646, Cas7_I-E, CRISPR/Cas system-associated RAMP superfamily
protein Cas7. CRISPR (Clustered Regularly Interspaced
Short Palindromic Repeats) and associated Cas proteins
comprise a system for heritable host defense by
prokaryotic cells against phage and other foreign DNA;
Cas7 is a RAMP superfamily protein; Subunit of the
Cascade complex; also known as Cse4/CasC family.
Length = 325
Score = 32.1 bits (73), Expect = 0.27
Identities = 17/79 (21%), Positives = 20/79 (25%), Gaps = 4/79 (5%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE----EGEQTDSE 252
E K E E AK KK KK K E + E
Sbjct: 68 TELAKRLLELGYDEDIAEPDAAKIAYGLKKASGKKSDKLLLLSAPEAAWVARYAAELADE 127
Query: 253 KEAKKRKKHKKDKKKRKKK 271
+A DK+ K
Sbjct: 128 FDAAAAADAVADKELDKNF 146
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 32.4 bits (73), Expect = 0.29
Identities = 14/53 (26%), Positives = 31/53 (58%)
Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ R+KH+ + K++ K +EE+E+ E Q E+E K++K ++ + ++
Sbjct: 1010 ECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIH 1062
Score = 31.2 bits (70), Expect = 0.62
Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 6/89 (6%)
Query: 155 LPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDS 214
LP+DK S + R+ L + +EK + D+EE+E+ E Q
Sbjct: 989 LPLDKRSIAVVSRRIE------LPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAK 1042
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
E+E K++K ++ + +K Y+E +K
Sbjct: 1043 EEEIGKKEKEREQRIRKTIHDNYKEMAKK 1071
Score = 29.3 bits (65), Expect = 2.8
Identities = 17/68 (25%), Positives = 35/68 (51%), Gaps = 5/68 (7%)
Query: 209 GEQTDSEKEAKKR--KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH---KK 263
+ + E K R K+ KD+++K++ + +E+E G++ ++ ++ H K+
Sbjct: 1008 PPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKEKEREQRIRKTIHDNYKE 1067
Query: 264 DKKKRKKK 271
KKR KK
Sbjct: 1068 MAKKRLKK 1075
Score = 27.4 bits (60), Expect = 9.6
Identities = 16/71 (22%), Positives = 32/71 (45%)
Query: 192 TSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
A+ D + EEE ++ E+ + E + + ++ + S E EE + ++
Sbjct: 582 KDATLDSIEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDYET 641
Query: 252 EKEAKKRKKHK 262
E+E RKK +
Sbjct: 642 EREENARKKEE 652
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 31.8 bits (72), Expect = 0.30
Identities = 20/78 (25%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
A S + +E K G+Q + + + +KHK+D++KK+++ + +E+++ G S
Sbjct: 148 RALSRKKSDDEHRKRSGKQKEKRRV-EDSQKHKEDRRKKQEEKRRNDEDKRPGGGGGSSG 206
Query: 253 KEAKKRKKHKKDKKKRKK 270
++ K + K+KR+K
Sbjct: 207 GQSGLSTKDEPPKEKRQK 224
Score = 29.8 bits (67), Expect = 1.2
Identities = 22/95 (23%), Positives = 46/95 (48%), Gaps = 4/95 (4%)
Query: 182 KAKEKRKASPTSASSDESDK---EEEEKEEGEQTDSEKEAKKRKKHKKDKK-KKKKKSKY 237
K +++ +A S DE K +++EK E + KE +++K+ +K + + K+
Sbjct: 142 KFEKECRALSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRNDEDKRPGGG 201
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ G T E +KR+KH +++ + +S
Sbjct: 202 GGSSGGQSGLSTKDEPPKEKRQKHHDPERRLEPQS 236
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 30.7 bits (70), Expect = 0.31
Identities = 19/71 (26%), Positives = 37/71 (52%), Gaps = 7/71 (9%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
K+ K E E +++ KKRK+ K ++ + + +E E+ EK +K ++
Sbjct: 65 KKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEK------EKRTRKNRE 118
Query: 261 HKKDKKKRKKK 271
KK K+++K+K
Sbjct: 119 -KKFKRRQKEK 128
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 31.5 bits (72), Expect = 0.34
Identities = 17/43 (39%), Positives = 27/43 (62%), Gaps = 1/43 (2%)
Query: 200 DKEEEEK-EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
++EEEEK +E E+ + AK R + + K+K K+ K E+EE
Sbjct: 123 EREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEE 165
Score = 30.0 bits (68), Expect = 0.96
Identities = 19/69 (27%), Positives = 34/69 (49%), Gaps = 9/69 (13%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKD--KKKKKKKSKYEEEEEKEEGEQTDSEKEA---- 255
E+ E+EE E+ D E+ + + K + + K+K+K ++EE EE + A
Sbjct: 120 EQLEREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRHRAANATALAAI 179
Query: 256 ---KKRKKH 261
KK+K+
Sbjct: 180 GGRKKKKRR 188
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 32.1 bits (73), Expect = 0.34
Identities = 21/96 (21%), Positives = 35/96 (36%), Gaps = 6/96 (6%)
Query: 183 AKEKRKASPTSASSDESDKEE---EEKEEGEQTDSEKEAKKRKKHKK---DKKKKKKKSK 236
+K AS D ++ +E E +E EQ++ K KK K+ + + KK
Sbjct: 12 KAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKP 71
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
E E + K A+ + K +S
Sbjct: 72 TESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTES 107
Score = 31.7 bits (72), Expect = 0.39
Identities = 21/82 (25%), Positives = 31/82 (37%), Gaps = 1/82 (1%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
EL K K +K K + + K + E + EK AK RK + +
Sbjct: 40 ELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKS-AESTRSSHP 98
Query: 234 KSKYEEEEEKEEGEQTDSEKEA 255
KSK E +EE E ++
Sbjct: 99 KSKAPSTESEEEEEPEETPDPI 120
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 31.4 bits (72), Expect = 0.34
Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 13/84 (15%)
Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
+K K + +R E++ K+ + EK ++ +K K +KK+K + +
Sbjct: 106 AKAKVQAQRA-------------EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Query: 239 EEEEKEEGEQTDSEKEAKKRKKHK 262
+ E + SE + K K
Sbjct: 153 KPEPQHTPVSDISELTVGQAVKVK 176
Score = 31.4 bits (72), Expect = 0.35
Identities = 12/57 (21%), Positives = 30/57 (52%)
Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+EAK + + ++ +++ KK+ E+E+ E+ K +K++K + K + +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTP 160
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 31.8 bits (73), Expect = 0.44
Identities = 17/77 (22%), Positives = 35/77 (45%), Gaps = 5/77 (6%)
Query: 181 LKAKEKRKASPTSASSDESDKE-----EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
+K E+ ASP + E + E+E ++ + ++++K K+ KK KKKK
Sbjct: 355 VKLCEQAAASPEYDTELEVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKY 414
Query: 236 KYEEEEEKEEGEQTDSE 252
K + + ++ +
Sbjct: 415 KVPRGKIYKVLKEATRQ 431
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 31.4 bits (72), Expect = 0.48
Identities = 15/41 (36%), Positives = 24/41 (58%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEA 218
K L + K S + ++ ++EEEE+EE E+ SE+EA
Sbjct: 281 KDALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEA 321
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 31.2 bits (71), Expect = 0.50
Identities = 20/91 (21%), Positives = 38/91 (41%), Gaps = 20/91 (21%)
Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
K ++KR+A +E D++E+E+EE + E+E E E
Sbjct: 352 SKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEE--------------------EGE 391
Query: 241 EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ +EEG Q+ + ++ D + + K
Sbjct: 392 DSEEEGSQSREDGSSESSSDVGSDSESKADK 422
Score = 29.7 bits (67), Expect = 1.7
Identities = 16/72 (22%), Positives = 29/72 (40%), Gaps = 12/72 (16%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
D + +E+E EE EQ E E +++ + EE + E ++S +
Sbjct: 366 DFEEVDEDEDEEEEQRSDEHE------------EEEGEDSEEEGSQSREDGSSESSSDVG 413
Query: 257 KRKKHKKDKKKR 268
+ K DK+
Sbjct: 414 SDSESKADKESA 425
Score = 27.8 bits (62), Expect = 7.2
Identities = 18/59 (30%), Positives = 29/59 (49%), Gaps = 3/59 (5%)
Query: 197 DESDKEEEE---KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSE 252
DE ++EE+ EE E DSE+E + ++ + S E + +KE +DSE
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDSE 431
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.4 bits (71), Expect = 0.53
Identities = 21/79 (26%), Positives = 39/79 (49%), Gaps = 10/79 (12%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
KA SP+ AS + ++E E+EE E+ + E+E ++ ++ + +
Sbjct: 419 SDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGE---------- 468
Query: 238 EEEEEKEEGEQTDSEKEAK 256
+EEEE+E SE+E +
Sbjct: 469 DEEEEEEVEADNGSEEEME 487
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 31.5 bits (72), Expect = 0.53
Identities = 9/56 (16%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 193 SASSDESDKEEEEKE-EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
+ES K E++++ + + D + ++R ++ ++++ ++ + E E +G
Sbjct: 134 DEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEFYYYQGT 189
Score = 31.1 bits (71), Expect = 0.60
Identities = 8/51 (15%), Positives = 27/51 (52%)
Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
+S S D+ D ++++ ++ T ++R++ + ++K+ + + Y +
Sbjct: 137 DEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEFYYYQ 187
Score = 29.2 bits (66), Expect = 2.4
Identities = 10/78 (12%), Positives = 30/78 (38%), Gaps = 23/78 (29%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
++ D +EE EE E++ E++E ++ + D +
Sbjct: 122 DLEEDDDDDEESDEEDEESSKS-----------------------EDDEDDDDDDDDDDI 158
Query: 254 EAKKRKKHKKDKKKRKKK 271
++R ++ +++ ++
Sbjct: 159 ATRERSLERRRRRREWEE 176
>gnl|CDD|227666 COG5374, COG5374, Uncharacterized conserved protein [Function
unknown].
Length = 192
Score = 30.5 bits (69), Expect = 0.54
Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 2/55 (3%)
Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
+K E + TD + +K + + +K +++ ++ E EQ +KE+ K+K
Sbjct: 139 DKMEADSTDLKARLRKAQILLEGLQKNQEELFKLLDKYNELREQV--QKESSKKK 191
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 29.6 bits (67), Expect = 0.54
Identities = 14/30 (46%), Positives = 24/30 (80%)
Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEA 218
A+ +A++ E ++EEEE+EE E+ +SE+EA
Sbjct: 67 AAAAAAAAAEEEEEEEEEEEEEEEESEEEA 96
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 31.4 bits (70), Expect = 0.56
Identities = 19/80 (23%), Positives = 42/80 (52%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
L+ + + E+ K + ++ + + ++E ++T+ EK+ +++K K K +
Sbjct: 151 LEQEEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIE 210
Query: 237 YEEEEEKEEGEQTDSEKEAK 256
E+E++K E E+ D KE K
Sbjct: 211 LEQEKQKTENEKQDLIKEQK 230
Score = 30.6 bits (68), Expect = 0.88
Identities = 18/75 (24%), Positives = 40/75 (53%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
E K E+EK++ E+ E + K ++ +K +++K K E+E++K + E ++
Sbjct: 154 EEQKTEQEKQKTEKEGIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQ 213
Query: 258 RKKHKKDKKKRKKKS 272
K+ +++K+ K
Sbjct: 214 EKQKTENEKQDLIKE 228
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 30.9 bits (70), Expect = 0.57
Identities = 16/67 (23%), Positives = 32/67 (47%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
+ K E +E+ + + KE K K+ + ++ +K+ E+ EE E + + + E
Sbjct: 5 CKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENN 64
Query: 257 KRKKHKK 263
K K+ K
Sbjct: 65 KLKEENK 71
Score = 28.6 bits (64), Expect = 3.0
Identities = 13/51 (25%), Positives = 26/51 (50%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
++ KEE+ + E + + E + K ++ K + +K +EE +K E E
Sbjct: 26 EDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDENNKLKEENKKLENE 76
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 31.1 bits (71), Expect = 0.58
Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 19/57 (33%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EK K KK +++KK +K+K K KKRKK K +KK+ +K
Sbjct: 377 EKYPKPPKKKREEKKPQKRKKK-------------------KKRKKKGKKRKKKGRK 414
Score = 31.1 bits (71), Expect = 0.69
Identities = 13/46 (28%), Positives = 29/46 (63%)
Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+K+ ++ K KY + +K+ E+ +++ KK++K K K+K+K +
Sbjct: 369 NKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 28.4 bits (64), Expect = 3.9
Identities = 14/42 (33%), Positives = 27/42 (64%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
EE KE+ + +K +K+ + +K KKK+KKK K +++ ++
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKRKKKGRK 414
Score = 28.4 bits (64), Expect = 4.0
Identities = 17/52 (32%), Positives = 25/52 (48%), Gaps = 15/52 (28%)
Query: 220 KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K K K KKK+++K +K KK+K+ KK KK++KK
Sbjct: 376 KEKYPKPPKKKREEKKP---------------QKRKKKKKRKKKGKKRKKKG 412
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 31.3 bits (71), Expect = 0.60
Identities = 16/69 (23%), Positives = 28/69 (40%), Gaps = 2/69 (2%)
Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKK--KKKSKYEEEEEKEEGEQTDSEKEAKKRKKH 261
E + ++AKK KK K++ KK +E + ++ + K +K
Sbjct: 728 ERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKA 787
Query: 262 KKDKKKRKK 270
KK K +K
Sbjct: 788 KKPSAKTQK 796
Score = 29.3 bits (66), Expect = 2.2
Identities = 15/75 (20%), Positives = 33/75 (44%), Gaps = 7/75 (9%)
Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKK-------KKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
+E+ ++ D+ K+ KR++ K + +KK+K + ++ + S K K
Sbjct: 738 AREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKI 797
Query: 258 RKKHKKDKKKRKKKS 272
K + +KK +
Sbjct: 798 AAATKAKRAAKKKVA 812
Score = 29.3 bits (66), Expect = 2.6
Identities = 13/62 (20%), Positives = 24/62 (38%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+ + +K P SA E + + ++ + + AK +K K K+ K K
Sbjct: 752 GKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811
Query: 238 EE 239
E
Sbjct: 812 AE 813
Score = 28.2 bits (63), Expect = 5.4
Identities = 15/74 (20%), Positives = 32/74 (43%), Gaps = 7/74 (9%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKH-------KKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
+E+ +K + + ++ +K + + +KK K K +K + + K+ +T
Sbjct: 739 REKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPSAKTQKIA 798
Query: 254 EAKKRKKHKKDKKK 267
A K K+ K K
Sbjct: 799 AATKAKRAAKKKVA 812
Score = 27.8 bits (62), Expect = 7.0
Identities = 18/89 (20%), Positives = 26/89 (29%), Gaps = 20/89 (22%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
K K+ + ++ E EK+AK + K +K KK
Sbjct: 741 KAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPKAAKKDARKAKKPS--------- 791
Query: 244 EEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
AK +K K KR K
Sbjct: 792 -----------AKTQKIAAATKAKRAAKK 809
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 31.2 bits (71), Expect = 0.62
Identities = 19/91 (20%), Positives = 34/91 (37%), Gaps = 5/91 (5%)
Query: 183 AKEKRKASPTSASSDESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
E K+ S ++ K+ ++ K D + K + K E+ E
Sbjct: 352 KIELDKSKTPSENAQRYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTAL----EKAE 407
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K+ E+ E + K KK K+K+K+
Sbjct: 408 GKKAIEEIREELIEEGLLKSKKKKRKKKEWF 438
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 31.1 bits (71), Expect = 0.62
Identities = 7/36 (19%), Positives = 25/36 (69%)
Query: 212 TDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
+++K+K+ KK ++K+ K+ + +++++++E +
Sbjct: 175 PGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDD 210
>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6. In yeast, 15 Apg proteins
coordinate the formation of autophagosomes. Autophagy is
a bulk degradation process induced by starvation in
eukaryotic cells. Apg6/Vps30p has two distinct functions
in the autophagic process, either associated with the
membrane or in a retrieval step of the carboxypeptidase
Y sorting pathway.
Length = 356
Score = 31.0 bits (70), Expect = 0.63
Identities = 21/96 (21%), Positives = 44/96 (45%), Gaps = 6/96 (6%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK-----KDKKKK 231
L S+L+ EK + + + + + E E DSE + K+++ + ++ +K+
Sbjct: 34 LDSELRDAEKERDT-YKQYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKE 92
Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
E E +EE EQ ++E+ R+ + D+
Sbjct: 93 DDDLDGELVELQEEKEQLENEELQYLREYNLFDRNN 128
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 30.7 bits (70), Expect = 0.64
Identities = 26/56 (46%), Positives = 37/56 (66%)
Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
K+A KRK+ +K K KK+ K + E+ E+K+ Q E+ KKRK KK+KKK+K K
Sbjct: 145 KKALKRKEKQKKKSKKEWKERKEKVEKKKAERQKKREENLKKRKDDKKNKKKKKAK 200
Score = 30.0 bits (68), Expect = 1.1
Identities = 30/67 (44%), Positives = 39/67 (58%), Gaps = 1/67 (1%)
Query: 170 VKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
VK +E +LK LK KEK+K + +K E++K E Q E+ KKRK KK+KK
Sbjct: 137 VKDDEKLLKKALKRKEKQKKKSKKEWKERKEKVEKKKAE-RQKKREENLKKRKDDKKNKK 195
Query: 230 KKKKKSK 236
KKK K K
Sbjct: 196 KKKAKKK 202
Score = 28.0 bits (63), Expect = 4.9
Identities = 16/51 (31%), Positives = 32/51 (62%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
E+ +KR++ K KK+K+K++K +E+ +K E E+ +E+ K+K +
Sbjct: 9 EQRRRKREQRKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIEN 59
>gnl|CDD|218311 pfam04888, SseC, Secretion system effector C (SseC) like family.
SseC is a secreted protein that forms a complex together
with SecB and SecD on the surface of Salmonella. All
these proteins are secreted by the type III secretion
system. Many mucosal pathogens use type III secretion
systems for the injection of effector proteins into
target cells. SecB, SseC and SecD are inserted into the
target cell membrane. where they form a small pore or
translocon. In addition to SseC, this family includes
the bacterial secreted proteins PopB, PepB, YopB and
EspD which are thought to be directly involved in pore
formation, and type III secretion system translocon.
Length = 303
Score = 30.9 bits (70), Expect = 0.64
Identities = 14/52 (26%), Positives = 27/52 (51%), Gaps = 8/52 (15%)
Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
K +Q S+ + +R + K++KK ++ + + ++ EK E EAKK
Sbjct: 11 KLAEKQAKSKLQQLERARDKQEKKAEEYQEQIKKAIEKAE--------EAKK 54
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 30.7 bits (70), Expect = 0.66
Identities = 26/56 (46%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 217 EAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKRKKK 271
EAK RKK + K+ +K K K E E E+ + + K+ KK KK KK KKKRKK
Sbjct: 115 EAKARKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKT 170
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.8 bits (70), Expect = 0.66
Identities = 23/96 (23%), Positives = 46/96 (47%), Gaps = 14/96 (14%)
Query: 177 LKSKLKAKEKRKASPTSA---SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
L+ K+KA + + SD + + K + E+ + + +A+ ++K K+ KK++KK
Sbjct: 282 LQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKK 341
Query: 234 KSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
K + E EE+ +K + DK++ K
Sbjct: 342 KKQIERLEER-----------IEKLEVQATDKEENK 366
Score = 28.1 bits (63), Expect = 5.1
Identities = 15/70 (21%), Positives = 37/70 (52%), Gaps = 5/70 (7%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
K + ++ + E + ++K K ++ +K E +E+K+E +KE KK+K+
Sbjct: 290 KYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKE-----KKKEEKKKKQ 344
Query: 261 HKKDKKKRKK 270
++ +++ +K
Sbjct: 345 IERLEERIEK 354
Score = 27.7 bits (62), Expect = 8.2
Identities = 13/58 (22%), Positives = 24/58 (41%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
EK +K K K K+ KK E + + + ++ + +KK++KK
Sbjct: 280 EKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKK 337
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid layers.
The innermost capsid (core) is made of VP2. The genomic
RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 31.1 bits (70), Expect = 0.67
Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 1/78 (1%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
A + + + ++ D EK+ +K + K+K KK+ + + EQ+ E
Sbjct: 2 AYRNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQE- 60
Query: 254 EAKKRKKHKKDKKKRKKK 271
K + KK K+ K+
Sbjct: 61 NLKIADEVKKSTKEESKQ 78
Score = 27.6 bits (61), Expect = 9.7
Identities = 19/84 (22%), Positives = 37/84 (44%), Gaps = 13/84 (15%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK-----DKKKKKKKSKYEEEEEKEEGEQ 248
A+ + +D+ +E+ +E + + E K++ KK D K + +E K E
Sbjct: 9 ANINNNDRMQEKDDEKQDQKNRMELKEKVLDKKEEVVTDNVDSPVKEQSSQENLKIADEV 68
Query: 249 TDSEKE--------AKKRKKHKKD 264
S KE K +++H+K+
Sbjct: 69 KKSTKEESKQLLEVLKTKEEHQKE 92
>gnl|CDD|222440 pfam13897, GOLD_2, Golgi-dynamics membrane-trafficking. Sec14-like
Golgi-trafficking domain The GOLD domain is always found
combined with lipid- or membrane-association domains.
Length = 136
Score = 30.0 bits (68), Expect = 0.69
Identities = 14/31 (45%), Positives = 20/31 (64%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKEAKKRKK 223
S SSDE ++EE E+EE E D E +K + +
Sbjct: 52 SESSDEEEEEEAEEEEAETGDVEAGSKSQSR 82
>gnl|CDD|240061 cd04710, BAH_fungalPHD, BAH, or Bromo Adjacent Homology domain, as
present in fungal proteins containing PHD domains. BAH
domains are found in a variety of proteins playing roles
in transcriptional silencing and the remodeling of
chromatin. It is assumed that in most or all of these
instances the BAH domain mediates protein-protein
interactions.
Length = 135
Score = 29.6 bits (67), Expect = 0.72
Identities = 15/42 (35%), Positives = 23/42 (54%), Gaps = 10/42 (23%)
Query: 5 RPRCYFDVEISDIPLG--RIVF-ELYSDICPITCENFRALCT 43
RPR +IS + R+++ ++SDICPI + R CT
Sbjct: 66 RPR-----DISRRVVADSRLLYASMHSDICPI--GSVRGKCT 100
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 31.2 bits (70), Expect = 0.77
Identities = 20/91 (21%), Positives = 42/91 (46%), Gaps = 4/91 (4%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR--KKHKKDKKKKKKKS 235
+ K A +ASPT+ S + DS+ E+ K+ KK K++ K +
Sbjct: 21 RKKQTASPDGRASPTNEDQRSSGRNSPSAASTSSNDSKAESTKKPNKKIKEEATSPLKST 80
Query: 236 KYEEEEEKEEGEQTD--SEKEAKKRKKHKKD 264
K + E+ + E+ + + K++K ++ + +
Sbjct: 81 KRQREKPASDTEEPERVTAKKSKTQELSRPN 111
Score = 30.4 bits (68), Expect = 1.2
Identities = 17/50 (34%), Positives = 33/50 (66%), Gaps = 1/50 (2%)
Query: 214 SEKEAKKRKK-HKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
S K AKKR++ +K K++ ++K++ E E EKE+ ++ + E+E + + K
Sbjct: 574 SSKLAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 29.3 bits (65), Expect = 2.7
Identities = 18/63 (28%), Positives = 37/63 (58%), Gaps = 3/63 (4%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
ASS + K EE E+ ++ E E K R++ +++K+K+K++ + E E + + + S
Sbjct: 573 ASSKLAKKREEAVEKAKR---EAEQKAREEREREKEKEKEREREREREAERAAKASSSSH 629
Query: 254 EAK 256
E++
Sbjct: 630 ESR 632
>gnl|CDD|217838 pfam04004, Leo1, Leo1-like protein. Members of this family are
part of the Paf1/RNA polymerase II complex. The Paf1
complex probably functions during the elongation phase
of transcription. The Leo1 subunit of the yeast
Paf1-complex binds RNA and contributes to complex
recruitment. The subunit acts by co-ordinating
co-transcriptional chromain modifications and helping
recruitment of mRNA 3prime-end processing factors.
Length = 312
Score = 30.6 bits (69), Expect = 0.77
Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 14/81 (17%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE----------EKEEGE 247
E +K E EK+E ++ A++R+++++ K K E E
Sbjct: 236 EHEKREREKKEEQKL----RARRRRQNREKMKNKPPNRPGHGSGSDSNVAKAATTYSEDE 291
Query: 248 QTDSEKEAKKRKKHKKDKKKR 268
S+ E +K K+ KK+ R
Sbjct: 292 DEGSDCETRKAKEVKKEGAGR 312
>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
Length = 925
Score = 30.8 bits (70), Expect = 0.77
Identities = 19/121 (15%), Positives = 40/121 (33%), Gaps = 17/121 (14%)
Query: 151 HIENLPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGE 210
++ L + + ++ + NE K + K + E + EEE +
Sbjct: 818 RLKALRIQREEMLMRPEELELINEEQKNLKQEIKLE---------LSEIQEAEEEIQN-- 866
Query: 211 QTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
+E + + + KK K + + ++ + K KK KK +K
Sbjct: 867 --INENKNEFVEFKNDPKKLNKLIIAKDVLIKLVISS----DEIKQDEKTTKKKKKDLEK 920
Query: 271 K 271
Sbjct: 921 T 921
>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
Phosphatase-Interacting Proteins. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Vetebrates contain two
Proline-Serine-Threonine Phosphatase-Interacting
Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
mainly expressed in hematopoietic cells and are involved
in the regulation of cell adhesion and motility.
Mutations in PSTPIPs have been shown to cause
autoinflammatory disorders. PSTPIP1 contains an
N-terminal F-BAR domain, PEST motifs, and a C-terminal
SH3 domain, while PSTPIP2 contains only the N-terminal
F-BAR domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules.
Length = 239
Score = 30.5 bits (69), Expect = 0.79
Identities = 18/81 (22%), Positives = 39/81 (48%), Gaps = 4/81 (4%)
Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD----KKKKKKKSKYEEEEEKEEGEQTDS 251
+E++K EE +E+ ++ + E ++ K KK K K YE++ +++ +
Sbjct: 84 REEAEKLEEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQKCREKDKAEQAY 143
Query: 252 EKEAKKRKKHKKDKKKRKKKS 272
EK + + + +K K+K
Sbjct: 144 EKSSSGAQPKEAEKLKKKAAQ 164
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 31.1 bits (70), Expect = 0.80
Identities = 20/83 (24%), Positives = 36/83 (43%), Gaps = 10/83 (12%)
Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK-------KKKKKSKYEEEEEKEEGEQ 248
++SD E E+ + ++ E E + K K K+++K+ EE + + + Q
Sbjct: 385 EEDSDIELEDSSDSDEESGEDEYDQLPPFKPLTKAQMAKLSKEQRKAYLEEYDYRVKLLQ 444
Query: 249 TDSEKEAKKRKKHKKDKKKRKKK 271
+E KR K KK K+
Sbjct: 445 KKQWREELKR---MKMMKKFGKE 464
Score = 29.2 bits (65), Expect = 2.7
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 5/68 (7%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK-----YE 238
K KA S ++ EE + + +K+ ++ K K KK K+ Y
Sbjct: 414 KPLTKAQMAKLSKEQRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKEIGELPDGYS 473
Query: 239 EEEEKEEG 246
EE ++E G
Sbjct: 474 EEVDEENG 481
>gnl|CDD|203444 pfam06424, PRP1_N, PRP1 splicing factor, N-terminal. This domain
is specific to the N-terminal part of the prp1 splicing
factor, which is involved in mRNA splicing (and possibly
also poly(A)+ RNA nuclear export and cell cycle
progression). This domain is specific to the N terminus
of the RNA splicing factor encoded by prp1. It is
involved in mRNA splicing and possibly also poly(A)and
RNA nuclear export and cell cycle progression.
Length = 131
Score = 29.6 bits (67), Expect = 0.81
Identities = 16/49 (32%), Positives = 31/49 (63%), Gaps = 1/49 (2%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
S + D E+EE + ++ E+ ++RKK ++++K+K++ KY EE K
Sbjct: 52 SDGKYDDEDEEADRIYESIDERMDERRKK-RREQKEKEEIEKYREENPK 99
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 28.9 bits (65), Expect = 0.85
Identities = 16/61 (26%), Positives = 25/61 (40%)
Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
+ + K K K K DKK+ K K +E + K+ G + + K KK +K
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 270 K 270
Sbjct: 62 D 62
Score = 28.2 bits (63), Expect = 1.7
Identities = 14/45 (31%), Positives = 20/45 (44%)
Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
K ++ QT + ++ K KKDKK KK K+E K
Sbjct: 31 EGKAKKNGAPVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNTCK 75
Score = 27.4 bits (61), Expect = 3.2
Identities = 13/63 (20%), Positives = 24/63 (38%)
Query: 205 EKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKD 264
K+ +T+ +++ K K + D K K K+ K ++ K + K K
Sbjct: 2 NKKNKTKTECKEKGCKWDKKEDDGKCKPKEGKAKKNGAPVTQTAGTETTTEKCKGKKDKK 61
Query: 265 KKK 267
K
Sbjct: 62 DCK 64
>gnl|CDD|226341 COG3820, COG3820, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 230
Score = 30.3 bits (68), Expect = 0.87
Identities = 16/84 (19%), Positives = 36/84 (42%), Gaps = 4/84 (4%)
Query: 162 RPLQDVRVVKCNELVLKSKL-KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKK 220
P+ V + C+++ L ++ +A R + + E E+ + E+ + E+E
Sbjct: 150 TPMDPVTLGLCSQIDLDKEVERASRGRPTAKELGPTLLPASETEQLDYEEEREEERELDA 209
Query: 221 RKKHKKDKKKKKKKSKYEEEEEKE 244
K KS ++EE+++
Sbjct: 210 DAVF---AKLSSLKSGNKDEEDED 230
>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
This family represents Cwf15/Cwc15 (from
Schizosaccharomyces pombe and Saccharomyces cerevisiae
respectively) and their homologues. The function of
these proteins is unknown, but they form part of the
spliceosome and are thus thought to be involved in mRNA
splicing.
Length = 241
Score = 30.1 bits (68), Expect = 0.89
Identities = 14/58 (24%), Positives = 35/58 (60%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
+S S SSD+ +++ ++E E E K+++ ++ ++++++K+ EE+ +EE
Sbjct: 123 DSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180
Score = 29.7 bits (67), Expect = 1.3
Identities = 17/104 (16%), Positives = 44/104 (42%), Gaps = 9/104 (8%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDK---KKKKK 233
+++ K K++ +D+S + E E D E E K+++ + +
Sbjct: 64 EEAERAHKSKKENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSD 123
Query: 234 KSKYEEEEEKEEGEQTDSEKEA------KKRKKHKKDKKKRKKK 271
S + + + + ++ A +K KK + ++K+R+++
Sbjct: 124 SSSDSDSSDDDSDDDDSEDETAALLRELEKIKKERAEEKEREEE 167
Score = 28.5 bits (64), Expect = 2.8
Identities = 15/65 (23%), Positives = 34/65 (52%), Gaps = 6/65 (9%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
S SD + + + + ++ A ++ +K KK++ EE+E+EE E+ E++
Sbjct: 122 SDSSSDSDSSDDDSDDDDSEDETAALLRELEKIKKER------AEEKEREEEEKAAEEEK 175
Query: 255 AKKRK 259
A++ +
Sbjct: 176 AREEE 180
>gnl|CDD|227441 COG5110, RPN1, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 881
Score = 30.7 bits (69), Expect = 0.94
Identities = 16/53 (30%), Positives = 29/53 (54%), Gaps = 5/53 (9%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
DESDK+++ +E Q EK+ +K KKK+++ + EE+ +G+
Sbjct: 1 MVDESDKKQQTIDEQSQISPEKQTPNKK-----DKKKEEEEQLSEEDAMLKGD 48
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 29.9 bits (68), Expect = 0.95
Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 10/75 (13%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE------ 254
+ E+ K E EQ E + + +K + +K+ + ++K E E++EE E+ EK
Sbjct: 117 QAEQGKSELEQ---EIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIA 173
Query: 255 -AKKRKKHKKDKKKR 268
KK+ + K + ++
Sbjct: 174 FLKKQNQQLKSQLEQ 188
Score = 27.6 bits (62), Expect = 5.2
Identities = 17/63 (26%), Positives = 33/63 (52%), Gaps = 2/63 (3%)
Query: 208 EGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
+ EQ SE E + +K +++K++ +K E E + E E+ + E+ + K+H +
Sbjct: 117 QAEQGKSELEQEIKKL--EEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAF 174
Query: 268 RKK 270
KK
Sbjct: 175 LKK 177
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 30.4 bits (68), Expect = 0.96
Identities = 21/79 (26%), Positives = 34/79 (43%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
L K K+ + DE ++EEEE+EE E ++E+ + + S
Sbjct: 118 LPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVDHGNGSSG 177
Query: 237 YEEEEEKEEGEQTDSEKEA 255
+ EE EE T++E E
Sbjct: 178 GDNGEEGEEESVTEAEAEG 196
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 28.1 bits (63), Expect = 0.98
Identities = 16/37 (43%), Positives = 23/37 (62%)
Query: 226 KDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
K K KKK EEE++EE E+ +E K+R+K+K
Sbjct: 19 LAKLKAKKKELKAEEEKREEEEEARKREERKEREKNK 55
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 30.2 bits (68), Expect = 1.00
Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 24/97 (24%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ--------- 248
E KE+E + + + + K K + D + +K+K + + KE E+
Sbjct: 110 EQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKREQKARAAKERKERLVAEAREHF 169
Query: 249 ---------------TDSEKEAKKRKKHKKDKKKRKK 270
EKE KK+ K K ++K +K
Sbjct: 170 GYWVDPRDPRFQEMLQQKEKEEKKKVKEAKRREKEEK 206
Score = 27.5 bits (61), Expect = 6.0
Identities = 16/73 (21%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK-EEGEQT--DSEKEAKKR 258
EE E+EE E S + + + ++ +K+ +++++ E + + Q D + +KR
Sbjct: 88 EELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQKRKR 147
Query: 259 KKHKKDKKKRKKK 271
++ + K+RK++
Sbjct: 148 EQKARAAKERKER 160
Score = 27.1 bits (60), Expect = 8.3
Identities = 13/72 (18%), Positives = 33/72 (45%), Gaps = 1/72 (1%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG-EQTDSEKEAKKRK 259
E+ E+ E E+ + + + ++++K+K+ + E E + + + +
Sbjct: 84 AEKLEELEEEEREWYPSLNQMLEENREQQKEKEARRQAREAEIAKNMAKMPQMIADWRAQ 143
Query: 260 KHKKDKKKRKKK 271
K K+++K R K
Sbjct: 144 KRKREQKARAAK 155
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 30.3 bits (68), Expect = 1.0
Identities = 14/67 (20%), Positives = 31/67 (46%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
D EEEE+ E E+E ++ + ++ + EEE E + + + E ++
Sbjct: 175 DNEEEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERID 234
Query: 260 KHKKDKK 266
K + +++
Sbjct: 235 KKQGEEE 241
Score = 28.4 bits (63), Expect = 3.7
Identities = 16/72 (22%), Positives = 34/72 (47%), Gaps = 4/72 (5%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
E ++EE E + + + E ++ K + ++++ +++ E E EE +S E
Sbjct: 210 EEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEIEWEE----ESPSEEVP 265
Query: 258 RKKHKKDKKKRK 269
R + KK+K
Sbjct: 266 RNNEESPAKKQK 277
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 29.7 bits (66), Expect = 1.1
Identities = 15/83 (18%), Positives = 48/83 (57%)
Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
A+ ++ +D EE + E+ +E +++++ ++ +++++ + + EE + + E+
Sbjct: 9 AALGKPTAGTTDAEEATRLLAEKRRQAREQREQEEQERREQEEQDRLEREELKRRAAEER 68
Query: 249 TDSEKEAKKRKKHKKDKKKRKKK 271
E+EA+++++ + +K+ K K
Sbjct: 69 LRREEEARRQEEERAREKEEKAK 91
Score = 28.5 bits (63), Expect = 2.4
Identities = 17/71 (23%), Positives = 46/71 (64%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+E+E +E+ EQ E+E KR+ ++ +++++ + EEE +E+ E+ + E +++++
Sbjct: 42 EEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAEEEEKQE 101
Query: 261 HKKDKKKRKKK 271
++ ++ +K+K
Sbjct: 102 QEEQERIQKQK 112
>gnl|CDD|218889 pfam06088, TLP-20, Nucleopolyhedrovirus telokin-like protein-20
(TLP20). This family consists of several
Nucleopolyhedrovirus telokin-like protein-20 (TLP20)
sequences. The function of this family is unknown but
TLP20 is known to shares some antigenic similarities to
the smooth muscle protein telokin although the amino
acid sequence shows no homologies to telokin.
Length = 162
Score = 29.6 bits (67), Expect = 1.1
Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 5/65 (7%)
Query: 169 VVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEK-----EAKKRKK 223
V+K + K +K ++P S E + +E+E++ ++ E KK
Sbjct: 94 VLKKGGCIFKIVYWNSKKFVSAPVPHHSSEDENDEDEEDNADRAGIESGIDDSAPPSPKK 153
Query: 224 HKKDK 228
K D+
Sbjct: 154 QKLDE 158
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 30.2 bits (68), Expect = 1.2
Identities = 20/54 (37%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSK----YEEEEEKEEGEQTDSEKEAKKRKKHKK 263
S K+ KK+K +K KKK ++ SK E EE EG + E + K KK
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQIHALENIEEGIEGSNNEKSSETPQLKLEKK 736
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 30.0 bits (67), Expect = 1.2
Identities = 18/90 (20%), Positives = 49/90 (54%), Gaps = 2/90 (2%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
+ KE + S + S+ S + +E+ +T E+E ++ ++ +++ ++ + +K E++
Sbjct: 80 RQKEFKPTSTDQSLSEPSRRMQEDSGAENETVEEEEKEESREEREEVEETEGVTKSEQKN 139
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ + E + +KE K+ + +++K KR
Sbjct: 140 DWRDAE--ECQKEEKEPEPEEEEKPKRGSL 167
Score = 29.6 bits (66), Expect = 2.1
Identities = 19/83 (22%), Positives = 44/83 (53%), Gaps = 4/83 (4%)
Query: 184 KEKRKASPTSASSDESDKEEEEK----EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
K K+K + +E K+ EE+ EE EQ ++EA ++ + +++K++ K++ +
Sbjct: 204 KLKQKQQEAALELEELKKKREERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRR 263
Query: 240 EEEKEEGEQTDSEKEAKKRKKHK 262
E E+ ++ + ++ +K K
Sbjct: 264 AEAAEKRQKVPEDGLSEDKKPFK 286
Score = 28.5 bits (63), Expect = 4.5
Identities = 19/86 (22%), Positives = 45/86 (52%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
K K + S E + E KE + ++EA + K K+++++K EEE+ +
Sbjct: 178 KLKHTENTFSRGGAEGAQVEAGKEFEKLKQKQQEAALELEELKKKREERRKVLEEEEQRR 237
Query: 244 EEGEQTDSEKEAKKRKKHKKDKKKRK 269
++ E +E +++++ K++ ++R+
Sbjct: 238 KQEEADRKSREEEEKRRLKEEIERRR 263
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 29.9 bits (68), Expect = 1.2
Identities = 14/72 (19%), Positives = 44/72 (61%), Gaps = 1/72 (1%)
Query: 201 KEEEEKEEGEQTDSEKEAKK-RKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
+EE+E E E+ + +K+ ++ + +++ ++K++ K E EE+ + + EK ++ +
Sbjct: 109 QEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEE 168
Query: 260 KHKKDKKKRKKK 271
+ + ++++++K
Sbjct: 169 REAERRERKEEK 180
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 28.0 bits (63), Expect = 1.2
Identities = 18/49 (36%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 215 EKEAKKRKKHKKDKKKKKKK-----SKYEEEEEKEEGEQTDSEKEAKKR 258
K AKKR K +KKK+K KY +E E ++ ++EAKK+
Sbjct: 14 AKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAKKK 62
>gnl|CDD|236012 PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated.
Length = 373
Score = 30.1 bits (68), Expect = 1.3
Identities = 13/53 (24%), Positives = 23/53 (43%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 230
L +E R+ + T + D+E + E ++ E +AK + K KK
Sbjct: 6 DELLLLEEDRQPAATDDLDSDEDEESSSENADELSEKEAKAKALARIKIGPKK 58
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 30.2 bits (68), Expect = 1.3
Identities = 23/92 (25%), Positives = 34/92 (36%), Gaps = 9/92 (9%)
Query: 171 KCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTD-----SEKEAKKRK--- 222
K ++L K K K+K+K D+ D + K +Q S K K
Sbjct: 65 KTDDLKDSKKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEK 124
Query: 223 -KHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
K K K K K + K+E + D+ K
Sbjct: 125 LKKKITVNKSTNKKKKKVLSSKDELIKYDNNK 156
Score = 29.8 bits (67), Expect = 1.6
Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 4/75 (5%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
+K +++ ++ +TD K++KK K K+KKK K K +++ + + + +
Sbjct: 52 LHEKLDKKNKKFNKTDDLKDSKK----TKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFA 107
Query: 258 RKKHKKDKKKRKKKS 272
+K K K+
Sbjct: 108 GPLAISLMRKPKPKT 122
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 29.4 bits (66), Expect = 1.3
Identities = 18/91 (19%), Positives = 41/91 (45%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
K E K +E +++EEE EE E + E+E + ++ +++ ++ K E++
Sbjct: 38 KENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEKK 97
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ + + A+ K ++ KK+
Sbjct: 98 NINDIFNSTQDDNAQNLISKNYKKNEKSKKT 128
Score = 26.7 bits (59), Expect = 9.9
Identities = 14/63 (22%), Positives = 34/63 (53%)
Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
E++ ++EK+E ++ + E++ ++ ++ + + +++ EEEEE EE + E
Sbjct: 37 IKENEDVKDEKQEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEEEEEDEEDNVDLKDIEK 96
Query: 256 KKR 258
K
Sbjct: 97 KNI 99
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 30.0 bits (68), Expect = 1.4
Identities = 21/75 (28%), Positives = 42/75 (56%)
Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
+E +K E+E +E E+ E E +++ + K+K + K E EE+ E + + E+
Sbjct: 220 REELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIRELEERIEELKKEIEELE 279
Query: 256 KKRKKHKKDKKKRKK 270
+K K+ K+ K+K ++
Sbjct: 280 EKVKELKELKEKAEE 294
>gnl|CDD|221466 pfam12220, U1snRNP70_N, U1 small nuclear ribonucleoprotein of 70kDa
MW N terminal. This domain is found in eukaryotes. This
domain is about 90 amino acids in length. This domain is
found associated with pfam00076. This domain is part of
U1 snRNP, which is the pre-mRNA binding protein of the
penta-snRNP spliceosome complex. It extends over a
distance of 180 A from its RNA binding domain, wraps
around the core domain of U1 snRNP consisting of the
seven Sm proteins and finally contacts U1-C, which is
crucial for 5'-splice-site recognition.
Length = 94
Score = 28.4 bits (64), Expect = 1.4
Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 9/70 (12%)
Query: 177 LKSKLKAKEKRKASPTSASSD--ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
L KA EKRK +P + + K+ +++ E T++ E K++++K++KK
Sbjct: 22 LPPLDKAPEKRKTNPITGVAQYLSEFKDYKDEPPPEPTETWLE-------KREREKREKK 74
Query: 235 SKYEEEEEKE 244
K E++ E+E
Sbjct: 75 EKLEKKLEEE 84
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 30.2 bits (69), Expect = 1.4
Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 197 DESDKEEEEKEEGEQTDSE--KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
D ++ E+ + + E+ D + KEA + K + +KKK+ K+ +E + G++ D E+
Sbjct: 167 DSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGPVQIGKKIDKEE 225
Score = 27.5 bits (62), Expect = 9.3
Identities = 13/49 (26%), Positives = 26/49 (53%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
D+S +E E+ E ++ + EK AK+ + K + +KKK + ++
Sbjct: 166 DDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPKEGP 214
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 30.0 bits (68), Expect = 1.4
Identities = 15/78 (19%), Positives = 39/78 (50%), Gaps = 3/78 (3%)
Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKK---KKKSKYEEEEE 242
++ T+ + + E EK + + ++R++ + D+K++ + K+ EE+
Sbjct: 644 RQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQRRRNDEKRQAQQEAKALNVEEQS 703
Query: 243 KEEGEQTDSEKEAKKRKK 260
+E EQ + ++ + R+K
Sbjct: 704 VQETEQEERVQQVQPRRK 721
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 28.9 bits (65), Expect = 1.5
Identities = 15/66 (22%), Positives = 30/66 (45%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
++ D E E+T+ +KE ++ KK +K K+ +EE + + + +E
Sbjct: 32 EKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTV 91
Query: 257 KRKKHK 262
K K +
Sbjct: 92 KATKKQ 97
Score = 27.7 bits (62), Expect = 4.3
Identities = 10/48 (20%), Positives = 24/48 (50%), Gaps = 2/48 (4%)
Query: 227 DKKKKKKKSKYEEEEEKEEGE--QTDSEKEAKKRKKHKKDKKKRKKKS 272
+KK + + Y +EE + + E + +K+ +K +K +++ K
Sbjct: 32 EKKDIEINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQ 79
>gnl|CDD|72638 pfam09222, Fim-adh_lectin, Fimbrial adhesin F17-AG, lectin
domain. Members of this family are
carbohydrate-specific lectin domains found in bacterial
fimbrial adhesins. They adopt a compact, elongated
structure consisting of a beta-sandwich with two major
sheets: one consisting of five long strands in mixed
orientations, and a front sheet with four antiparallel
strands, forming an immunoglobin-like fold.
Length = 170
Score = 29.3 bits (65), Expect = 1.5
Identities = 17/46 (36%), Positives = 21/46 (45%), Gaps = 3/46 (6%)
Query: 52 TGKNLHYLNIVFHRVVKGFMI---QSGDFSVGNGTGGESIYGGTFE 94
TG N+ Y N R+ GF + D V G+SIYG T E
Sbjct: 33 TGYNIGYQNANVWRISGGFCVGLDGKVDLPVVGSLDGQSIYGLTEE 78
>gnl|CDD|238558 cd01138, FeuA, Periplasmic binding protein FeuA. These proteins
have predicted to function as initial receptors in ABC
transport of metal ions in some eubacterial species.
They belong to the TroA superfamily of periplasmic metal
binding proteins that share a distinct fold and ligand
binding mechanism. A typical TroA protein is comprised
of two globular subdomains connected by a single helix
and can bind their ligands in the cleft between these
domains.
Length = 248
Score = 29.6 bits (67), Expect = 1.6
Identities = 22/53 (41%), Positives = 32/53 (60%), Gaps = 6/53 (11%)
Query: 188 KASPT-SASSDESDKEEEEKEEGEQTDSEKEAKK-----RKKHKKDKKKKKKK 234
K +PT S + SD EE+ KE G+ + E EA+K ++K K+ K+K KKK
Sbjct: 85 KIAPTVPVSYNSSDWEEQLKEIGKLLNKEDEAEKWLADYKQKAKEAKEKIKKK 137
>gnl|CDD|235089 PRK02946, aceK, bifunctional isocitrate dehydrogenase
kinase/phosphatase protein; Validated.
Length = 575
Score = 29.8 bits (68), Expect = 1.7
Identities = 11/22 (50%), Positives = 15/22 (68%)
Query: 20 GRIVFELYSDICPITCENFRAL 41
GR+VF Y +IC +T NFR +
Sbjct: 470 GRVVFYDYDEICYLTECNFRDI 491
>gnl|CDD|184724 PRK14520, rpsP, 30S ribosomal protein S16; Provisional.
Length = 155
Score = 28.9 bits (65), Expect = 1.7
Identities = 7/48 (14%), Positives = 13/48 (27%)
Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
+E + + KKKK E E + ++
Sbjct: 107 NAALAEADGGPTAEATTPKKKKAAAEAAAAEAAAPAAEAAAAAAAEEE 154
>gnl|CDD|236544 PRK09506, mrcB, bifunctional glycosyl transferase/transpeptidase;
Reviewed.
Length = 830
Score = 29.7 bits (67), Expect = 1.8
Identities = 10/58 (17%), Positives = 26/58 (44%)
Query: 213 DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
D E +K K + K+K ++ ++++ + + D E +K K + + ++
Sbjct: 5 DREPIGRKGKPSRPVKQKVSRRRYRDDDDYDDYDDYEDEEPMPRKGKGKGRKPRGKRG 62
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 29.5 bits (67), Expect = 1.8
Identities = 17/55 (30%), Positives = 32/55 (58%), Gaps = 6/55 (10%)
Query: 217 EAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
E + R++++++ K+ YE E+E E E+ EK+AKK K KD++ ++
Sbjct: 67 EEELREEYEEELKE------YEAEKEIWEAEKKGLEKKAKKAIKKGKDEEALAEE 115
Score = 29.2 bits (66), Expect = 2.4
Identities = 16/58 (27%), Positives = 31/58 (53%), Gaps = 1/58 (1%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
++E E+ E E + E E + + KK +KK KK ++ +++E + E EA++
Sbjct: 67 EEELREEYEEELKEYEAEKEIWEAEKKGLEKKAKK-AIKKGKDEEALAEELLELEAEE 123
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 29.8 bits (67), Expect = 1.8
Identities = 17/74 (22%), Positives = 34/74 (45%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
E ++ + E E + + + D + + + E+EE+ EE E + +K+
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 258 RKKHKKDKKKRKKK 271
K K+ KK+KK+
Sbjct: 61 EKARLKELKKQKKQ 74
Score = 29.8 bits (67), Expect = 2.1
Identities = 16/62 (25%), Positives = 33/62 (53%)
Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
A+ S E + E+EE +E ++ ++ E KR+K + + KK+KK + ++ E
Sbjct: 22 ARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKARLKELKKQKKQEIQKILE 81
Query: 243 KE 244
++
Sbjct: 82 QQ 83
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 28.7 bits (64), Expect = 1.9
Identities = 22/69 (31%), Positives = 42/69 (60%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
E+E E + S+++ K KK K D+KK+ K + E+ EKE ++ ++ + KK +
Sbjct: 51 EKESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQ 110
Query: 263 KDKKKRKKK 271
K++++R+KK
Sbjct: 111 KERERRRKK 119
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 29.3 bits (65), Expect = 1.9
Identities = 18/82 (21%), Positives = 43/82 (52%), Gaps = 3/82 (3%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEK 243
K+K+ +++ +E EE EE + + +E ++ ++++++++++ EE+ +
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQSDD 175
Query: 244 EEG---EQTDSEKEAKKRKKHK 262
E EQ +SE E+ K K
Sbjct: 176 SEHEIIEQDESETESDDDKTEK 197
Score = 28.9 bits (64), Expect = 2.1
Identities = 19/74 (25%), Positives = 40/74 (54%), Gaps = 4/74 (5%)
Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
+K + +K ++ +K +E+EE E E+ +R++ K D+++ +++ + E E
Sbjct: 110 VKKDDIKKKKSLIIRQEQIEKARQEREELE----ERMEWERREEKIDEREDQEEQERERE 165
Query: 241 EEKEEGEQTDSEKE 254
E+ E + DSE E
Sbjct: 166 EQTIEEQSDDSEHE 179
>gnl|CDD|115441 pfam06783, UPF0239, Uncharacterized protein family (UPF0239).
Length = 85
Score = 27.7 bits (61), Expect = 1.9
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
S E++ E E E E T+ E A + HK K++KKK+
Sbjct: 47 SHEAEAENSEPESAEVTEKEGAAVPSRLHKIKKEEKKKR 85
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 28.0 bits (63), Expect = 2.0
Identities = 15/51 (29%), Positives = 28/51 (54%)
Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
EQ + E + K + + KKKKK+K+K + +E ++ K+ ++KK
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQKKK 81
Score = 26.1 bits (58), Expect = 8.4
Identities = 15/35 (42%), Positives = 22/35 (62%)
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
EE+EEKE ++ D + K+KK +K KKKR +
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDEAG 64
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 29.5 bits (65), Expect = 2.0
Identities = 15/73 (20%), Positives = 38/73 (52%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
+++++E++ EQ +++ ++ +HK K++ +K+ K E +E+ + K+ K
Sbjct: 147 NEKKKERQLAEQLAAKRLKDEQHRHKARKQELRKREKDRERARREDAAAAAAAKQKAAAK 206
Query: 260 KHKKDKKKRKKKS 272
K K+ K+
Sbjct: 207 KAAAPSGKKSAKA 219
>gnl|CDD|220521 pfam10014, BsmA, Biofilm formation and stress response factor.
YjfO-BsmA (biofilm stress and motility) is a protein
that is expressed at high levels in biofilms. It is
found to be up-regulated during biofilm formation and pH
and oxidative stress.
Length = 194
Score = 29.1 bits (66), Expect = 2.0
Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 10/60 (16%)
Query: 124 ITTQPAP---HLDGVHVVFGHVVSGQNV---VSHIENLPVDKMSRPLQDVRVVKCNELVL 177
+P P H DGV VF H+V N+ + I + L +++ + +L
Sbjct: 108 ELGEPTPEGLHQDGVDFVFVHLVGRHNIEGGETRIYDDD----GEILFRFTLLEPLDTLL 163
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 29.5 bits (66), Expect = 2.1
Identities = 13/58 (22%), Positives = 24/58 (41%), Gaps = 1/58 (1%)
Query: 191 PTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
PT +E + E E +EG E EA R+ +++ + +E ++ E
Sbjct: 220 PTENEQEEQGEGEGEGQEGSAPQ-ESEATDRESESGEEEMVQSDQDDLPDESDDDSET 276
Score = 28.4 bits (63), Expect = 4.4
Identities = 24/105 (22%), Positives = 45/105 (42%), Gaps = 18/105 (17%)
Query: 157 VDKMSRPLQD----VRVVKCNELVLKSKLKAKEKRKASPTSASSDESD---KEEEEKEEG 209
+D++S + D RVV+ +L+S A+E + + D D E E++E+G
Sbjct: 173 LDRLSAAIDDQQAFARVVR---EMLRSMELAEEMGDDTESEDEEDGDDDQPTENEQEEQG 229
Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
E +E ++ + + E E +EE Q+D +
Sbjct: 230 EGEGEGQEGSA--------PQESEATDRESESGEEEMVQSDQDDL 266
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 29.0 bits (65), Expect = 2.2
Identities = 15/79 (18%), Positives = 29/79 (36%), Gaps = 2/79 (2%)
Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKD--KKKKKKKSKYE 238
+ KE + + S ++ E E+ +E A+ + K +++
Sbjct: 107 KEEKEGKVNAKKDKLSKAKKAAKKAALEAEKKVNEARAEAVAEKKAAEAAAVAAEEAAAA 166
Query: 239 EEEEKEEGEQTDSEKEAKK 257
EEEE EE ++ E
Sbjct: 167 EEEEAEEAPAEEAPAEESA 185
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 28.5 bits (64), Expect = 2.3
Identities = 20/72 (27%), Positives = 35/72 (48%)
Query: 176 VLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS 235
VL+S L +E R +ES +EEEE E E + E+E ++ +++++
Sbjct: 59 VLESSLDREEARLELLEQQEEEESAEEEEEDPEEENEEEEEEYQRGPFGEEEEEDGDSYD 118
Query: 236 KYEEEEEKEEGE 247
+ EE+EE
Sbjct: 119 ELPTPEEREELR 130
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 29.7 bits (67), Expect = 2.3
Identities = 20/75 (26%), Positives = 39/75 (52%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
E +E E+E E +E K+ + + +K++ + E EEEKEE E+ E E++
Sbjct: 841 EEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESEL 900
Query: 258 RKKHKKDKKKRKKKS 272
+ ++ +K R++
Sbjct: 901 AELKEEIEKLRERLE 915
Score = 29.7 bits (67), Expect = 2.3
Identities = 17/90 (18%), Positives = 44/90 (48%), Gaps = 1/90 (1%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
+ KEK +A E+ EE E+ E ++++E +++ ++ ++ ++ EE
Sbjct: 327 ELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEA-LREEL 385
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ E E + E ++ K+ + ++R ++
Sbjct: 386 AELEAELAEIRNELEELKREIESLEERLER 415
Score = 28.1 bits (63), Expect = 5.7
Identities = 18/75 (24%), Positives = 45/75 (60%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
+E ++ E+E EE ++ E EA+K + + K+ +++K + EEE + E E + ++E +
Sbjct: 849 EELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIE 908
Query: 257 KRKKHKKDKKKRKKK 271
K ++ ++ + + ++
Sbjct: 909 KLRERLEELEAKLER 923
Score = 27.8 bits (62), Expect = 7.7
Identities = 15/95 (15%), Positives = 38/95 (40%), Gaps = 4/95 (4%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
L+ +L E+ + + EE K E E E + + ++ + K++ +
Sbjct: 244 LEEELSRLEEELEELQEELEEAEKEIEELKSELE----ELREELEELQEELLELKEEIEE 299
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
E E + E E ++ ++ ++ K++ +
Sbjct: 300 LEGEISLLRERLEELENELEELEERLEELKEKIEA 334
Score = 27.8 bits (62), Expect = 7.9
Identities = 23/80 (28%), Positives = 42/80 (52%), Gaps = 4/80 (5%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
LK +L+ E K DE + EEEKEE E+ E E++ + ++ +K +++ +
Sbjct: 861 LKEELEELEAEKEE----LEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916
Query: 237 YEEEEEKEEGEQTDSEKEAK 256
E + E+ E E + E+E +
Sbjct: 917 LEAKLERLEVELPELEEELE 936
>gnl|CDD|165563 PHA03308, PHA03308, transcriptional regulator ICP4; Provisional.
Length = 1463
Score = 29.4 bits (65), Expect = 2.3
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKK 226
S +S+SSD S EE+ E+ E+ D E+ +++ ++
Sbjct: 1263 SSSSSSSDSSSSEEDGDEKNEKEDRERAGGGKRRGRQ 1299
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 29.6 bits (67), Expect = 2.4
Identities = 9/38 (23%), Positives = 23/38 (60%), Gaps = 1/38 (2%)
Query: 219 KKRKKHKKDKKKKKKK-SKYEEEEEKEEGEQTDSEKEA 255
+++KK+KKD +K+ + Y +++ KE ++ ++
Sbjct: 84 REKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRHV 121
Score = 28.4 bits (64), Expect = 5.1
Identities = 7/16 (43%), Positives = 13/16 (81%)
Query: 256 KKRKKHKKDKKKRKKK 271
+++KK+KKD +KR +
Sbjct: 84 REKKKYKKDVEKRNRS 99
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein is
found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 28.9 bits (65), Expect = 2.4
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 1/48 (2%)
Query: 225 KKDKKKKKKK-SKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
KK KKKKK K S +++EE++E E D +K K+ + + KK
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKK 48
Score = 28.1 bits (63), Expect = 4.3
Identities = 20/76 (26%), Positives = 34/76 (44%), Gaps = 11/76 (14%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK---------- 233
K+K+K + S + D+EE+E E ++ KE+ + + + KKK
Sbjct: 1 KKKKKKKKSKLSFGDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSF 60
Query: 234 -KSKYEEEEEKEEGEQ 248
K EE+E E E+
Sbjct: 61 LPDKAREEKEAELREE 76
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 29.2 bits (66), Expect = 2.4
Identities = 11/59 (18%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
+EK E + E+E R + K+ +K+ + ++E E+E+ ++ + + + ++ +
Sbjct: 195 KEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELR-KKYEEKLRQELERQAE 252
Score = 27.7 bits (62), Expect = 8.4
Identities = 21/113 (18%), Positives = 57/113 (50%), Gaps = 5/113 (4%)
Query: 157 VDKMSRPLQDVRVVKCNELVLKSKLKAK-EKRKASPTSASSDESDKEEEEKEEGEQTDSE 215
+D++S+ L +++ E L+ LK K E+ + + +E E+ + + E
Sbjct: 172 LDQLSKKLAELK--AEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFE 229
Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
+E ++ +K ++K +++ + + E E+K + E + + +++ K+ K++
Sbjct: 230 REKEELRKKYEEKLRQELERQAEAHEQKLKNELA--LQAIELQREFNKEIKEK 280
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 29.3 bits (66), Expect = 2.4
Identities = 17/76 (22%), Positives = 38/76 (50%), Gaps = 2/76 (2%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK--KSKYEEEEEKEEGEQTDSEKE 254
+E +KE E E E + E+E + K+ + +++ ++ K E EEE E + + +
Sbjct: 469 EEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALK 528
Query: 255 AKKRKKHKKDKKKRKK 270
+ +K +K + ++
Sbjct: 529 EELEEKLEKLENLLEE 544
Score = 29.0 bits (65), Expect = 3.5
Identities = 16/95 (16%), Positives = 42/95 (44%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
LK K +E ++ E + EE E E+ E K + + + + +
Sbjct: 212 LKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKI 271
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
EEE + E + E++ ++ ++ +++ ++ +++
Sbjct: 272 REEELRELERLLEELEEKIERLEELEREIEELEEE 306
Score = 28.6 bits (64), Expect = 4.8
Identities = 21/99 (21%), Positives = 39/99 (39%), Gaps = 7/99 (7%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK------- 229
L+ +L+ E+ ++ EE+ E+ E E E K K + K
Sbjct: 506 LEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLE 565
Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
+ ++ K EE + + + +E ++R K K K K
Sbjct: 566 DRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKE 604
Score = 27.8 bits (62), Expect = 8.2
Identities = 18/100 (18%), Positives = 40/100 (40%), Gaps = 2/100 (2%)
Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
EL L + + + + +K EE+ E+ E +E + K +++
Sbjct: 305 EELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLES--ELEELAEEKNELAKLLEER 362
Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K E EE E+ + E+ + + ++ K++ + S
Sbjct: 363 LKELEERLEELEKELEKALERLKQLEEAIQELKEELAELS 402
Score = 27.4 bits (61), Expect = 8.9
Identities = 15/98 (15%), Positives = 36/98 (36%), Gaps = 3/98 (3%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
L+ E+ + S E E+ E++ + +E + K +++ K
Sbjct: 307 LEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKEL 366
Query: 237 YEEEEEKEEGEQTDSEKE---AKKRKKHKKDKKKRKKK 271
E EE E+ + E+ + ++ K++ +
Sbjct: 367 EERLEELEKELEKALERLKQLEEAIQELKEELAELSAA 404
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 28.8 bits (65), Expect = 2.4
Identities = 18/58 (31%), Positives = 31/58 (53%), Gaps = 9/58 (15%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
L K++ +K++ KE E + +++++ +E KK+KK K KKKK KK
Sbjct: 149 LLKKVERLKKKE---------LDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKK 197
Score = 28.4 bits (64), Expect = 3.0
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 204 EEKEEGEQT----DSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
+ KEEG+ EK K K ++ KKK+ E E +++ + ++E+E KK+K
Sbjct: 128 DVKEEGDDKFYKYSDEKLLAWLLK-KVERLKKKELDIKEAEAARDKKKSNNAEEEKKKKK 186
Query: 260 KHKKDKKKRKKKS 272
K KKK KK +
Sbjct: 187 KKSAKKKKLKKVA 199
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 29.3 bits (65), Expect = 2.5
Identities = 31/125 (24%), Positives = 48/125 (38%), Gaps = 23/125 (18%)
Query: 167 VRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKE-----------EGEQTDSE 215
R+ K KL K K+K +P ++SDK +E E G++ E
Sbjct: 474 TRISKIQFTQEIKKLIKKSKKKLAPI--EEEDSDKHDEPPEGPEASGLPPKAPGDKEGEE 531
Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTD----------SEKEAKKRKKHKKDK 265
E + K+ + K+ K E E K+ G + + E K KH KD
Sbjct: 532 GEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDP 591
Query: 266 KKRKK 270
++ KK
Sbjct: 592 EEPKK 596
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 28.6 bits (64), Expect = 2.5
Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 1/49 (2%)
Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
+K KK KKK K+K EE++ + + ++T+ E E + +
Sbjct: 60 RKPATTKKSKKKDKEKLTEEEKKPESDDDKTE-ENENDPDNNEESGDSQ 107
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 27.8 bits (62), Expect = 2.6
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 189 ASPTSASSDESDKEEEEKEEGEQTDSEKEA 218
A+ +A +KEEE+K+E E+ + E+EA
Sbjct: 69 AAAAAAEEKAEEKEEEKKKEEEKEEEEEEA 98
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 29.1 bits (65), Expect = 2.6
Identities = 7/81 (8%), Positives = 28/81 (34%)
Query: 186 KRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
R++ S D ++ + + +R + + ++ + +S + +
Sbjct: 38 HRRSRERSYREDSRPRDRRRYDSRSPRSLRYSSVRRSRDRPRRRSRSVRSIEQHRRRLRD 97
Query: 246 GEQTDSEKEAKKRKKHKKDKK 266
++ ++ K++ K
Sbjct: 98 RSPSNQWRKDDKKRSLWDIKP 118
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 29.1 bits (65), Expect = 2.6
Identities = 19/89 (21%), Positives = 35/89 (39%), Gaps = 1/89 (1%)
Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
+E+ P + S K + + + K A + K+ K+K K+ K E
Sbjct: 760 LEEEEFEYP-RMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPKEKGPNDKEIKIESPSV 818
Query: 243 KEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ EGE+ ++ +K +KKK
Sbjct: 819 ETEGERCTIKQREEKGIDAPAILNVKKKK 847
Score = 28.8 bits (64), Expect = 3.8
Identities = 20/108 (18%), Positives = 41/108 (37%), Gaps = 18/108 (16%)
Query: 148 VVSHIENLPVDKMSRPLQDVR-----VVKCNELVL----KSKLKAKEKRKASPTSASSDE 198
E+L ++ P V K N VL K+ +AK+K K ++
Sbjct: 753 KFCFNEDLEEEEFEYPRMYEEASGKSVRKVNTAVLSTTIKAAARAKQKPK--------EK 804
Query: 199 SDKEEEEKEEGEQTDSEKEA-KKRKKHKKDKKKKKKKSKYEEEEEKEE 245
++E K E ++E E +++ +K + +++ K +
Sbjct: 805 GPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKVD 852
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 29.0 bits (65), Expect = 2.7
Identities = 10/62 (16%), Positives = 24/62 (38%), Gaps = 2/62 (3%)
Query: 177 LKSKLKAKEKRKA--SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
+ + K EK K P + + D E + E +E+E ++ + + +
Sbjct: 65 IPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKVQNPHEG 124
Query: 235 SK 236
++
Sbjct: 125 AR 126
Score = 28.2 bits (63), Expect = 5.0
Identities = 12/59 (20%), Positives = 26/59 (44%), Gaps = 5/59 (8%)
Query: 219 KKRKKHKKDKKKKKKKSK-----YEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K++K+ +K KK+KK K E + E +E+E ++ + + + +
Sbjct: 67 KRKKQTEKGKKEKKPKVSDRMDLDNENVQLEMLYSISNEREIRQLRFGGSKVQNPHEGA 125
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 29.2 bits (65), Expect = 2.7
Identities = 20/65 (30%), Positives = 30/65 (46%), Gaps = 1/65 (1%)
Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEG 246
R S S E + E + +S K+ KK KK K KKK+ K++ E++ EE
Sbjct: 93 RHISSVSTVDSELNDEYGNPIWKNRVESWKD-KKNKKKKSAKKKEAHKAQIPPEQQMEEK 151
Query: 247 EQTDS 251
D+
Sbjct: 152 PSADA 156
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 29.1 bits (66), Expect = 2.7
Identities = 27/132 (20%), Positives = 49/132 (37%), Gaps = 6/132 (4%)
Query: 146 QNVVSHIENLPVDKMSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEE 205
+ ++S I NL + P ++VV L+ +L+A E + E + E
Sbjct: 728 KEIISAIRNLRAEMNLSPSAPLKVVLVGSEELEDRLEANEDDIKGLANLEELEILSPDPE 787
Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE------QTDSEKEAKKRK 259
+ E T A+ + ++ E+E EK E E + +E K
Sbjct: 788 EPPVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEIDRIEKKLSNEGFVAKAP 847
Query: 260 KHKKDKKKRKKK 271
+ +K+K K
Sbjct: 848 EEVVEKEKEKLA 859
>gnl|CDD|227701 COG5414, COG5414, TATA-binding protein-associated factor
[Transcription].
Length = 392
Score = 28.9 bits (64), Expect = 2.7
Identities = 20/98 (20%), Positives = 42/98 (42%), Gaps = 4/98 (4%)
Query: 174 ELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
E+ K + +E+ + + +SD+ +E EE E+ D +E ++ + D+ + +
Sbjct: 288 EVSEGDKEQQQEEVENAEAHKEEVQSDRPDEIGEEKEEDDENEENERHTELLADELNELE 347
Query: 234 KSKYEEEEEKEEGE----QTDSEKEAKKRKKHKKDKKK 267
K E+ + E Q E + K + K+K
Sbjct: 348 KGIEEKRRQMESATNPILQKRFESQLNVLLKELELKRK 385
>gnl|CDD|218985 pfam06315, AceK, Isocitrate dehydrogenase kinase/phosphatase
(AceK). This family consists of several bacterial
isocitrate dehydrogenase kinase/phosphatase (AceK)
proteins (EC:2.7.1.116).
Length = 562
Score = 29.1 bits (66), Expect = 2.7
Identities = 10/22 (45%), Positives = 14/22 (63%)
Query: 20 GRIVFELYSDICPITCENFRAL 41
GR+VF Y +I +T NFR +
Sbjct: 463 GRVVFYDYDEIEYLTDCNFRDI 484
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 28.8 bits (65), Expect = 2.8
Identities = 15/42 (35%), Positives = 17/42 (40%)
Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+K + K K KKKKK K E Q KK KK
Sbjct: 352 PIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
Score = 28.8 bits (65), Expect = 3.2
Identities = 14/46 (30%), Positives = 21/46 (45%)
Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
A K+ K + K K KK K + +T E ++ +KK KK
Sbjct: 348 ATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAKSSGKKKVKK 393
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 29.3 bits (66), Expect = 2.8
Identities = 21/91 (23%), Positives = 38/91 (41%), Gaps = 8/91 (8%)
Query: 165 QDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKH 224
+DVR ELVL +R+ P + +K+ EEK E D EK +
Sbjct: 295 EDVREAA--ELVLP------HRRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDA 346
Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
++ + ++E+E + E+ D + +
Sbjct: 347 EQSGPRGHPTPGNDDEKEPDPQEEADGQGSS 377
>gnl|CDD|219743 pfam08202, Mis12_component, Mis12-Mtw1 protein family. Mis12-Mtw1
is a eukaryotic conserved kinetochore protein that is
involved in chromosome segregation.
Length = 296
Score = 28.7 bits (64), Expect = 2.9
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 1/67 (1%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
+K E +KEE +++ + + A+ KK K K S E E+EEG+ DSE +
Sbjct: 80 EKLESKKEETDESTARRIARVIKKEFLRDLKAGKVSISWENREEEEGDDNDSELT-NLFE 138
Query: 260 KHKKDKK 266
+ + K
Sbjct: 139 DNDEKKL 145
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 28.1 bits (63), Expect = 3.1
Identities = 10/29 (34%), Positives = 21/29 (72%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHK 225
+E ++EE+++E+ ++ +SE+E KK K
Sbjct: 118 EEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 28.1 bits (63), Expect = 3.5
Identities = 10/26 (38%), Positives = 18/26 (69%)
Query: 237 YEEEEEKEEGEQTDSEKEAKKRKKHK 262
EEE+++E+ ++ +SE+E KK K
Sbjct: 121 EEEEDDEEDDDEDESEEEESPVKKVK 146
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 28.1 bits (63), Expect = 3.1
Identities = 13/31 (41%), Positives = 18/31 (58%)
Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+E + +K+ KK+KK KK KK KK K
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 28.8 bits (64), Expect = 3.1
Identities = 13/63 (20%), Positives = 28/63 (44%), Gaps = 6/63 (9%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAK------KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
E E E ++ + + + K +K++ K K+ + + EEE+ +E + D
Sbjct: 4 EFASSELEDDDWVRGSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDD 63
Query: 252 EKE 254
+ E
Sbjct: 64 DDE 66
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 29.1 bits (65), Expect = 3.2
Identities = 24/98 (24%), Positives = 34/98 (34%), Gaps = 15/98 (15%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEA----KKRKKHKKDKKKKKKKSKYEEEEEK 243
+A S S ESD E+ E++E E E K + K + + E
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSED 725
Query: 244 EEGEQTDSEK---------EAKKRKKHKKDKKKRKKKS 272
EEG D E+ + K KK + KK
Sbjct: 726 EEG--MDDEQMMRLDTYLAQIFKEKKERIQAGGETKKE 761
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 28.7 bits (64), Expect = 3.2
Identities = 19/51 (37%), Positives = 28/51 (54%), Gaps = 1/51 (1%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
E+ K+ K K+ K K E E +K++ EQ +E KK+KK KK+K
Sbjct: 28 EEAQKQIADGKTLKRLYKVYEKREFELKKQQFEQLKAELS-KKKKKFKKEK 77
>gnl|CDD|220135 pfam09184, PPP4R2, PPP4R2. PPP4R2 (protein phosphatase 4 core
regulatory subunit R2) is the regulatory subunit of the
histone H2A phosphatase complex. It has been shown to
confer resistance to the anticancer drug cisplatin in
yeast, and may confer resistance in higher eukaryotes.
Length = 285
Score = 28.6 bits (64), Expect = 3.2
Identities = 21/82 (25%), Positives = 33/82 (40%), Gaps = 5/82 (6%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEE-EKEEGE----QTDSEKEAKKRKKHKKDKKKKK 232
K L P S+ EE EKE + Q D K + + K D++
Sbjct: 195 KDSLGNGSSTNGLPDSSQDKNKSLEEYYEKESSDAAASQDDGPKGSDVKNKKSDDEEDDD 254
Query: 233 KKSKYEEEEEKEEGEQTDSEKE 254
+ Y EE+E +E E+ + +E
Sbjct: 255 QDGDYVEEKELKEDEEEEETEE 276
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 27.4 bits (61), Expect = 3.3
Identities = 19/71 (26%), Positives = 36/71 (50%), Gaps = 1/71 (1%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+ E+ E+ EKE K K+ ++ ++ + K + +EEKE E+ + K K+ +
Sbjct: 33 RMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKM-AAKMHAKKVE 91
Query: 261 HKKDKKKRKKK 271
K ++KR K
Sbjct: 92 RLKRREKRNKL 102
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 28.5 bits (64), Expect = 3.3
Identities = 12/34 (35%), Positives = 15/34 (44%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKK 233
DKEEE K++ + A K K KKK
Sbjct: 271 DKEEEAKKQAKAERRAANAPKPGAKPKRSKKKAP 304
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 28.9 bits (65), Expect = 3.3
Identities = 20/94 (21%), Positives = 48/94 (51%), Gaps = 4/94 (4%)
Query: 178 KSKLKAKEKR---KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
+ +L+ +E+R K A +++ D E + EE E+ S +E + + K+ + +
Sbjct: 76 REELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRV 135
Query: 235 SKYEEEEEKEE-GEQTDSEKEAKKRKKHKKDKKK 267
+ E+ ++ + D+E E +K ++ KK +++
Sbjct: 136 AGLTPEQARKLLLKLLDAELEEEKAQRVKKIEEE 169
Score = 27.8 bits (62), Expect = 6.6
Identities = 18/88 (20%), Positives = 45/88 (51%), Gaps = 1/88 (1%)
Query: 183 AKEKRKASPTSASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
K +R A E+ KE EE+ E E ++R + +++ ++++++ + EEE
Sbjct: 26 KKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQREEER 85
Query: 242 EKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
++ EQ D+ E +++ +++++
Sbjct: 86 LVQKEEQLDARAEKLDNLENQLEEREKA 113
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 28.9 bits (65), Expect = 3.4
Identities = 15/68 (22%), Positives = 27/68 (39%)
Query: 180 KLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEE 239
KL EK + + ++ KEE +K + E E+ + + + + EE
Sbjct: 17 KLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEE 76
Query: 240 EEEKEEGE 247
E + E E
Sbjct: 77 EAKAEAAE 84
>gnl|CDD|217451 pfam03249, TSA, Type specific antigen. There are several antigenic
variants in Rickettsia tsutsugamushi, and a
type-specific antigen (TSA) of 56-kilodaltons located on
the rickettsial surface is responsible for the
variation. TSA proteins are probably integral membrane
proteins.
Length = 502
Score = 28.7 bits (64), Expect = 3.5
Identities = 13/52 (25%), Positives = 24/52 (46%)
Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
K+ K ++H KK +K + +EE+ +G +K+ K + K K
Sbjct: 348 KDLVKLQRHAGIKKAMEKLAAQQEEDAGNQGGGDCKKKQGASEKSKEGGKGK 399
>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family.
Length = 296
Score = 28.6 bits (63), Expect = 3.5
Identities = 22/87 (25%), Positives = 42/87 (48%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K +A+ ASP A + + E+ ++ + S + KK+KK KK K
Sbjct: 93 KEPAEAEPAEPASPAEAEGEAATSTEKAEDGATPSPSSETPKKKKKRFSFKKSFKLSGFS 152
Query: 238 EEEEEKEEGEQTDSEKEAKKRKKHKKD 264
++ +KE GE ++E A +++ K++
Sbjct: 153 FKKNKKEAGEGAEAEGAAAEKEGAKEE 179
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 28.6 bits (64), Expect = 3.5
Identities = 25/93 (26%), Positives = 49/93 (52%), Gaps = 7/93 (7%)
Query: 184 KEKRKASPTSASSDES-----DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
K R+ P S ES E +EKE EQ + ++ +++KK K+ K K++ +
Sbjct: 258 KPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVKQRFEAN 317
Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+ +K + + E+ AK RK+ K+ +K+ +++
Sbjct: 318 DPAQKLQ--EERKEQLAKLRKEEKEREKEYEQE 348
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 27.9 bits (63), Expect = 3.5
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
K+ KKKKKKK + E EEK++ E++ ++
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQKA 116
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 27.7 bits (62), Expect = 3.7
Identities = 13/51 (25%), Positives = 31/51 (60%)
Query: 222 KKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ ++K+++K+ + EE+E KEE E+ + +K ++ K+ ++ K K+
Sbjct: 10 RIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKN 60
>gnl|CDD|218517 pfam05236, TAF4, Transcription initiation factor TFIID component
TAF4 family. This region of similarity is found in
Transcription initiation factor TFIID component TAF4.
Length = 255
Score = 28.5 bits (64), Expect = 3.7
Identities = 13/52 (25%), Positives = 24/52 (46%), Gaps = 7/52 (13%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
KEEEE+ + + E ++ + K+K+K E++ E EE +
Sbjct: 122 QKEEEERRVERRRELGLE-------DPEQLRLKQKAKEEQKAESEETRHRAA 166
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 28.2 bits (63), Expect = 3.7
Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 2/59 (3%)
Query: 216 KEAKKRKKHKKDKKKKKKKSKY--EEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K+ K++K+ K +KK+ KY EE +K E + E KKK K K
Sbjct: 163 KKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKKAKSKY 221
>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein. Hsp70 chaperones help to fold
many proteins. Hsp70 assisted folding involves repeated
cycles of substrate binding and release. Hsp70 activity
is ATP dependent. Hsp70 proteins are made up of two
regions: the amino terminus is the ATPase domain and the
carboxyl terminus is the substrate binding region.
Length = 598
Score = 28.7 bits (65), Expect = 3.8
Identities = 24/71 (33%), Positives = 33/71 (46%), Gaps = 5/71 (7%)
Query: 193 SASSDESDKEEEE-KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE--KEEGEQT 249
A+ D+ KE E K E E+ +K K + DK + K K EE E KEE E
Sbjct: 516 YAAEDKKRKERIEAKNEAEEY--VYSLEKSLKEEGDKLPEADKKKVEEAIEWLKEELEGE 573
Query: 250 DSEKEAKKRKK 260
D E+ K ++
Sbjct: 574 DKEEIEAKTEE 584
Score = 27.6 bits (62), Expect = 8.3
Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 13/87 (14%)
Query: 193 SASSDESDKEEEEKEEGEQTDSEKE----AKKRKKHKKDKKKKKKKSKYEEEEEKEEGE- 247
SA + KE++ S+ E K +++ + KK+K E E K E E
Sbjct: 481 SAKDKGTGKEQKITITASSGLSDDEIERMVKDAEEYAAEDKKRK-----ERIEAKNEAEE 535
Query: 248 ---QTDSEKEAKKRKKHKKDKKKRKKK 271
+ + + K + DKKK ++
Sbjct: 536 YVYSLEKSLKEEGDKLPEADKKKVEEA 562
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 28.9 bits (65), Expect = 3.8
Identities = 10/52 (19%), Positives = 22/52 (42%)
Query: 219 KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
K +K+ + + +E+++G +E E + DK+K +K
Sbjct: 216 ININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEK 267
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 28.6 bits (64), Expect = 3.8
Identities = 16/65 (24%), Positives = 30/65 (46%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
S ++ + E + E EAKK+K+ ++ K +KK +Y +E++ QT
Sbjct: 70 SDQKPEDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSSSLQT 129
Query: 250 DSEKE 254
+E
Sbjct: 130 KLTEE 134
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 28.4 bits (63), Expect = 3.8
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKE 217
+ P S+SS SD E EE EE E+ + E+E
Sbjct: 212 LRTPPNSSSSSGSDSESEEDEEEEEEEEEEE 242
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 28.3 bits (63), Expect = 3.9
Identities = 13/61 (21%), Positives = 18/61 (29%)
Query: 204 EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
E KE KK KK + K K K K +++ + K K
Sbjct: 133 EVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWK 192
Query: 264 D 264
Sbjct: 193 S 193
>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 199 to 238 amino acids in length. This domain is
found associated with pfam06512, pfam00520. This domain
has a conserved ADD sequence motif.
Length = 222
Score = 28.2 bits (63), Expect = 4.2
Identities = 12/56 (21%), Positives = 25/56 (44%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
S+ ++ K+ + K KK+ EE + ++ SE +++ +K RK
Sbjct: 4 SKLSSESAKERRNRNDKNKKEEHSIGSEEGDSEKEPKSESADGRKRCRFLLRKTRK 59
>gnl|CDD|215283 PLN02518, PLN02518, pheophorbide a oxygenase.
Length = 539
Score = 28.7 bits (64), Expect = 4.2
Identities = 9/36 (25%), Positives = 21/36 (58%)
Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK 222
R A+P S S+ + +++E +E+ + + +E+ K
Sbjct: 49 RVAAPPSVPSEAALQQDEGEEQRVEQELGQESSDSK 84
>gnl|CDD|227881 COG5594, COG5594, Uncharacterized integral membrane protein
[Function unknown].
Length = 827
Score = 28.5 bits (64), Expect = 4.2
Identities = 17/53 (32%), Positives = 21/53 (39%), Gaps = 3/53 (5%)
Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
E+ +E K KK K KKS K+E E KKR KH+
Sbjct: 265 EKALNELLNKLLKKSHL---KTNKKSGKLTPSRKKEFEILPEYVPDKKRPKHR 314
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 26.9 bits (60), Expect = 4.3
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
E +E++E+ + + + +RKK KK K KKK K
Sbjct: 17 GEEGEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 26.5 bits (59), Expect = 4.6
Identities = 13/35 (37%), Positives = 23/35 (65%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
EE+ E+ + + + +++ ++ KK KKK KKKSK
Sbjct: 20 GEEDDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 28.3 bits (63), Expect = 4.4
Identities = 19/119 (15%), Positives = 39/119 (32%), Gaps = 15/119 (12%)
Query: 160 MSRPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAK 219
M P D +E E K ++ E E + E+
Sbjct: 184 MGGPDSDEDENPLDEEA---PDMTPETSKQDQKEERRAAVERRLAELVEMINWNLEE--- 237
Query: 220 KRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKH------KKDKKKRKKKS 272
+ + +K+++ + E + + E+ DSE K + + ++ + RK K
Sbjct: 238 ---RRRDLRKEQELEENVERDSDDEDEHGEDSEDGETKPESYITSEYIERISEIRKMKD 293
>gnl|CDD|223068 PHA03384, PHA03384, early DNA-binding protein E2A; Provisional.
Length = 445
Score = 28.5 bits (64), Expect = 4.4
Identities = 16/73 (21%), Positives = 23/73 (31%), Gaps = 7/73 (9%)
Query: 199 SDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
SD + E KK K +++ EEEEE+E R
Sbjct: 8 SDSPYSSDDSPSLEPPELPPKK-------KGRRRVSPVEEEEEEEEAEVVAVGFSYPPVR 60
Query: 259 KKHKKDKKKRKKK 271
KD K+ +
Sbjct: 61 ISRGKDGKRPVRP 73
>gnl|CDD|234941 PRK01315, PRK01315, putative inner membrane protein translocase
component YidC; Provisional.
Length = 329
Score = 28.2 bits (63), Expect = 4.4
Identities = 11/59 (18%), Positives = 27/59 (45%), Gaps = 1/59 (1%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK-KDKKKRKKKS 272
+E + KK K + + K E + + ++++ +R++ K + + KR+K
Sbjct: 268 AREERLAKKGKDHGESEGKVVAPEGAVAQTTEVREQTKRQTVQRQQPKRQSRAKRQKGG 326
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 28.5 bits (64), Expect = 4.4
Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 190 SPTSASSDESDKEE----EEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+ S S E+ E+ + +KR+ KK KKKK KK K
Sbjct: 324 TSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQKKKKSKKLK 374
>gnl|CDD|191249 pfam05279, Asp-B-Hydro_N, Aspartyl beta-hydroxylase N-terminal
region. This family includes the N-terminal regions of
the junctin, junctate and aspartyl beta-hydroxylase
proteins. Junctate is an integral ER/SR membrane calcium
binding protein, which comes from an alternatively
spliced form of the same gene that generates aspartyl
beta-hydroxylase and junctin. Aspartyl beta-hydroxylase
catalyzes the post-translational hydroxylation of
aspartic acid or asparagine residues contained within
epidermal growth factor (EGF) domains of proteins.
Length = 240
Score = 28.0 bits (62), Expect = 4.5
Identities = 13/63 (20%), Positives = 31/63 (49%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
+S++ ++ +EK ++ + KE ++ + K + + Y+EE+ E + E+
Sbjct: 168 TASEQYKQDMKEKASEQENEDSKEPVEKAERTKAETDDVTEEDYDEEDNPVEDSKAIKEE 227
Query: 254 EAK 256
AK
Sbjct: 228 LAK 230
>gnl|CDD|148511 pfam06936, Selenoprotein_S, Selenoprotein S (SelS). This family
consists of several mammalian selenoprotein S (SelS)
sequences. SelS is a plasma membrane protein and is
present in a variety of tissues and cell types. The
function of this family is unknown.
Length = 190
Score = 27.9 bits (62), Expect = 4.6
Identities = 20/91 (21%), Positives = 36/91 (39%), Gaps = 18/91 (19%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKK------------HKKDKKKKK 232
+K + S S+KE D +K ++++ K K K+
Sbjct: 52 QKLRPRWRSLQKRSSEKEIT------VFDPDKVVRRQEALEAARLRMQEDLDAKAAKFKE 105
Query: 233 KKSKYEEEEEKEEGEQTDSEKEAKKRKKHKK 263
K+ + EEE+ +++ E D +E K K K
Sbjct: 106 KQKQLEEEKRRQKIEMWDRMQEGKSYKVSAK 136
Score = 27.2 bits (60), Expect = 7.7
Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 2/82 (2%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEE 241
+ +E +A+ D K + KE+ Q E+E +++K D+ ++ K K +
Sbjct: 80 RRQEALEAARLRMQEDLDAKAAKFKEK--QKQLEEEKRRQKIEMWDRMQEGKSYKVSAKL 137
Query: 242 EKEEGEQTDSEKEAKKRKKHKK 263
+E E+T S K K KK
Sbjct: 138 AQEATEETSSSSVLPKPKTEKK 159
>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are typically between 187 and 201 amino
acids in length. There is a single completely conserved
residue Q that may be functionally important.
Length = 158
Score = 27.5 bits (62), Expect = 4.6
Identities = 7/38 (18%), Positives = 19/38 (50%)
Query: 231 KKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKR 268
+ K ++Y++E+ + E + E +AK+ + +
Sbjct: 73 QAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 28.3 bits (64), Expect = 4.7
Identities = 24/73 (32%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKH-KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRK 259
K+ + +GE + E +AK KK K KK+ +K K E+E KE + + E+ A+ R
Sbjct: 591 KKIRDILDGEYEEDEYKAKIEKKASKMSKKELEKLIKKLEKEMKEAAKNLEFEEAARLRD 650
Query: 260 KHKKDKKKRKKKS 272
+ K+ K++ KS
Sbjct: 651 EIKELKEELLGKS 663
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 28.2 bits (63), Expect = 4.7
Identities = 21/105 (20%), Positives = 48/105 (45%), Gaps = 5/105 (4%)
Query: 173 NELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
NE + + +L +++ + +E E E E + KE +K+ ++ + +KK
Sbjct: 85 NEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKK 144
Query: 233 KKSKYEEE-----EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+++ EE EK+ E + E+E + + H+ + K++
Sbjct: 145 RENNKNEERLKFENEKKLEESLELEREKFEEQLHEANLDLEFKEN 189
>gnl|CDD|236559 PRK09548, PRK09548, PTS system ascorbate-specific transporter
subunits IICB; Provisional.
Length = 602
Score = 28.6 bits (64), Expect = 4.8
Identities = 11/48 (22%), Positives = 21/48 (43%), Gaps = 2/48 (4%)
Query: 178 KSKLKAKEKRKA--SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKK 223
+L+AKE A T D + +EE ++ S+++ K +
Sbjct: 461 SKQLRAKEAAAAAGGKTVDQLDGYNLDEEVINAVQKLISKQDGGKPVR 508
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 28.2 bits (63), Expect = 4.9
Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 1/69 (1%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
+ D+EEEE E+ + S+ E + ++ S+ + KE+ E+ +
Sbjct: 134 DPESEDEEEEEDEDDDDDGSDDEDEDEDGVGA-TEEVAASSESGVDRVKEDDEEDEDADL 192
Query: 255 AKKRKKHKK 263
+KK +
Sbjct: 193 SKKDVLEEP 201
>gnl|CDD|220535 pfam10037, MRP-S27, Mitochondrial 28S ribosomal protein S27.
Members of this family of small ribosomal proteins
possess one of three conserved blocks of sequence found
in proteins that stimulate the dissociation of guanine
nucleotides from G-proteins, leaving open the
possibility that MRP-S27 might be a functional partner
of GTP-binding ribosomal proteins.
Length = 417
Score = 28.2 bits (63), Expect = 4.9
Identities = 16/71 (22%), Positives = 34/71 (47%), Gaps = 11/71 (15%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK----KHKKDKKKKK 232
+SKL+A K ++ + EE E+ D E ++++ +++K ++KK
Sbjct: 346 WRSKLQALAKVESRSLLSV-------EERLPTIEKEDLELYEQRQQLWFFENRKLWQRKK 398
Query: 233 KKSKYEEEEEK 243
K + +EE +
Sbjct: 399 KLREQADEEYQ 409
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 28.2 bits (63), Expect = 4.9
Identities = 21/81 (25%), Positives = 40/81 (49%), Gaps = 2/81 (2%)
Query: 194 ASSDESDKE-EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ-TDS 251
+KE E++ E E+ E+E K++ ++ ++ K+++ + E EKE E
Sbjct: 518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKAL 577
Query: 252 EKEAKKRKKHKKDKKKRKKKS 272
+KE + + K+KK K K
Sbjct: 578 KKEVESIIRELKEKKIHKAKE 598
Score = 27.9 bits (62), Expect = 7.0
Identities = 20/80 (25%), Positives = 38/80 (47%), Gaps = 13/80 (16%)
Query: 199 SDKEEEEKEEGEQTDS--------EKEAKKRKKHKKDKKKKKKKSKYEEEEE-----KEE 245
S E+E +++ E + +KE ++ + K++++ KK +E +E K+E
Sbjct: 521 SALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKE 580
Query: 246 GEQTDSEKEAKKRKKHKKDK 265
E E + KK K K+ K
Sbjct: 581 VESIIRELKEKKIHKAKEIK 600
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 28.3 bits (63), Expect = 5.0
Identities = 14/84 (16%), Positives = 25/84 (29%), Gaps = 1/84 (1%)
Query: 190 SPTSASSD-ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQ 248
S T S E +E+ + D + + + K+S EK E+
Sbjct: 113 SSTDDDSLLELPDRDEDADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEK 172
Query: 249 TDSEKEAKKRKKHKKDKKKRKKKS 272
+ K D ++ K
Sbjct: 173 KGQTENTFKFYNFGNDGEEAAAKD 196
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is
a family of proteins conserved in yeasts. The function
is not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 28.4 bits (63), Expect = 5.0
Identities = 18/88 (20%), Positives = 32/88 (36%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
K A + A S ++E+ EK E +E K + + E+ EE E
Sbjct: 321 AKEPAGESPAVSPSFEREKSEKSRHESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYE 380
Query: 245 EGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
D + A K + ++ ++K
Sbjct: 381 PLFPEDESEIAVKPPTEESSRRPEEEKH 408
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 28.4 bits (63), Expect = 5.0
Identities = 19/90 (21%), Positives = 35/90 (38%), Gaps = 12/90 (13%)
Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
A+ RK S K + ++ ++K K+ KK KK +
Sbjct: 971 AEAVRKTVRRSVK-----KAAATRAAMKKKVAKKAPAKKAAAKKAAAKKAAA-------K 1018
Query: 243 KEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
K+ ++ ++K A+K K KK +K +
Sbjct: 1019 KKVAKKAPAKKVARKPAAKKAAKKPARKAA 1048
>gnl|CDD|236034 PRK07502, PRK07502, cyclohexadienyl dehydrogenase; Validated.
Length = 307
Score = 28.0 bits (63), Expect = 5.1
Identities = 10/23 (43%), Positives = 15/23 (65%), Gaps = 1/23 (4%)
Query: 124 ITTQPAPHL-DGVHVVFGHVVSG 145
+ APHL +GVH + GH ++G
Sbjct: 106 VIAAMAPHLPEGVHFIPGHPLAG 128
>gnl|CDD|227238 COG4901, COG4901, Ribosomal protein S25 [Translation, ribosomal
structure and biogenesis].
Length = 107
Score = 26.8 bits (59), Expect = 5.2
Identities = 17/51 (33%), Positives = 24/51 (47%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
KE K K K KKK SK +++EE D E K RK+ +++
Sbjct: 8 SKEKKAEKAKAGTAKDKKKWSKKKKKEEARRAVTVDEELLDKIRKEVPRER 58
>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
peptidase. Corresponds to Merops family C37.
Norwalk-like viruses (NLVs), including the Southampton
virus, cause acute non-bacterial gastroenteritis in
humans. The NLV genome encodes three open reading frames
(ORFs). ORF1 encodes a polyprotein, which is processed
by the viral protease into six proteins.
Length = 535
Score = 28.3 bits (63), Expect = 5.4
Identities = 23/114 (20%), Positives = 38/114 (33%), Gaps = 39/114 (34%)
Query: 197 DESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKS--------------------- 235
D E E+++ ++A K K+KK + KK
Sbjct: 207 TYYDGCESEEDDEGWYIEPQDATPEGKKGKNKKGRGKKHNAFSRRGLSDEEYDEYKKIRE 266
Query: 236 ----------------KYEEE--EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKK 271
+YEEE E + E+EAK R++ +K RK++
Sbjct: 267 ERGGKYSIQEYLEDRERYEEELAERQATEADFCEEEEAKIRQRIFGLRKTRKQR 320
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 28.4 bits (63), Expect = 5.5
Identities = 15/61 (24%), Positives = 28/61 (45%), Gaps = 4/61 (6%)
Query: 200 DKEEEEKEEGEQTDSEKEA----KKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEA 255
+ EE+ E EQ +E+ A + D K + K + E+E+E+E + + E
Sbjct: 3910 EPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEI 3969
Query: 256 K 256
+
Sbjct: 3970 Q 3970
Score = 28.0 bits (62), Expect = 6.9
Identities = 12/71 (16%), Positives = 33/71 (46%), Gaps = 4/71 (5%)
Query: 196 SDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE----EEKEEGEQTDS 251
SD ++ +E+ E+G + + ++ + + K ++ ++ E++ K + + T +
Sbjct: 4053 SDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQAIDNHPKMDAKSTFA 4112
Query: 252 EKEAKKRKKHK 262
EA + K
Sbjct: 4113 SAEADEENTDK 4123
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 28.0 bits (62), Expect = 5.5
Identities = 19/84 (22%), Positives = 40/84 (47%), Gaps = 4/84 (4%)
Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK---S 235
+L+ KE S+ E ++ + EKE + + K ++ + ++ +K+ +
Sbjct: 297 EELREKEGEAEEAASSLEAELNRTKSEKESKAREKMVEIPLKLQQASSEAEEARKEAEAA 356
Query: 236 KYEEEEEKEEGEQT-DSEKEAKKR 258
+ E + KEE EQT + + A+ R
Sbjct: 357 REELRKLKEEAEQTKAALETAELR 380
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 28.2 bits (64), Expect = 5.7
Identities = 11/59 (18%), Positives = 17/59 (28%), Gaps = 17/59 (28%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
EK KD KK K+ ++K K+ +K K +
Sbjct: 818 PEKLRYLADAPAKDPAGKKAAVKF-----------------SRKTKQQYVASEKDGKAT 859
>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein. This family
is found in Trypanosome species, and appears to be one
of two invariant surface glycoproteins, ISG65 and ISG75.
that are found in the mammalian stage of the parasitic
protozoan. the sequence suggests the two families are
polypeptides with N-terminal signal sequences,
hydrophilic extracellular domains, single trans-membrane
alpha-helices and short cytoplasmic domains. they are
both expressed in the bloodstream form but not in the
midgut stage. Both polypeptides are distributed over the
entire surface of the parasite.
Length = 286
Score = 27.9 bits (62), Expect = 5.8
Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 1/76 (1%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKH-KKDKKKKKKKS 235
+K L+ E R S S S + KE EK + + ++ EAKK + +K ++ K+ +
Sbjct: 78 VKYYLEKLESRSDSKLSDSDKKKIKEICEKAKDKIDEALPEAKKAGREAEKLAEEVKEAA 137
Query: 236 KYEEEEEKEEGEQTDS 251
+EG +
Sbjct: 138 AKALGWALDEGTSNST 153
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 28.0 bits (63), Expect = 5.8
Identities = 15/67 (22%), Positives = 22/67 (32%), Gaps = 19/67 (28%)
Query: 206 KEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDK 265
E + S+ KKR K K+ +++K KE K
Sbjct: 1 AESNGEPLSKNALKKRLKAKQAEEEKA-------------------AKEEAKAAAAAAAA 41
Query: 266 KKRKKKS 272
K R +KS
Sbjct: 42 KGRSRKS 48
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 27.7 bits (62), Expect = 5.9
Identities = 19/81 (23%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
E++ ++DE+ + EE EE + + + A++ K + KK + S E+ + +
Sbjct: 43 EEQVEQLREEANDEAKEIIEEAEE-DAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQI 101
Query: 245 EGEQTDSEKEAKKRKKHKKDK 265
E EQ + E+E ++ + K +
Sbjct: 102 EREQEEWEEELERLIEEAKAE 122
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 28.0 bits (63), Expect = 6.0
Identities = 15/71 (21%), Positives = 38/71 (53%)
Query: 201 KEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+ E E+E E+ + + ++R +++ +K +S ++EE E+ E+ S KE +K
Sbjct: 64 RAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKELSNKEKNLDEK 123
Query: 261 HKKDKKKRKKK 271
++ ++ ++
Sbjct: 124 EEELEELIAEQ 134
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 28.0 bits (63), Expect = 6.0
Identities = 14/43 (32%), Positives = 24/43 (55%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
SDE + EE+E + E + E+E +K KK KK + + + +
Sbjct: 351 LSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPF 393
Score = 27.7 bits (62), Expect = 9.3
Identities = 11/47 (23%), Positives = 27/47 (57%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
++ D +EE++E ++ +++ ++ ++ +K+KKKKK E
Sbjct: 344 EEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 28.0 bits (62), Expect = 6.1
Identities = 8/77 (10%), Positives = 26/77 (33%), Gaps = 6/77 (7%)
Query: 184 KEKRKASPTSASSDESDKEEEEK------EEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
+ + A+ + + + + + QT+S+ E + +H + +++ +
Sbjct: 35 RRRGTAARAAKPAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSG 94
Query: 238 EEEEEKEEGEQTDSEKE 254
E + S
Sbjct: 95 SGSESVGSPTPSPSGSA 111
>gnl|CDD|240400 PTZ00396, PTZ00396, Casein kinase II subunit beta; Provisional.
Length = 251
Score = 27.7 bits (62), Expect = 6.2
Identities = 10/42 (23%), Positives = 22/42 (52%), Gaps = 1/42 (2%)
Query: 231 KKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
KKK SK + + E G+ + KK K + +++++++
Sbjct: 208 KKKSSKEYKLQNGEFGKSAIEQ-FLKKSPKGNIELQEQEQQA 248
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 27.6 bits (61), Expect = 6.2
Identities = 13/45 (28%), Positives = 26/45 (57%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKK 232
KA + +S ++ + E+ ++ D + KKRK +KK++ +KK
Sbjct: 16 KAYDKAVASFKNALKNEDCATPQENDEQNPGKKRKNNKKNRSRKK 60
>gnl|CDD|184885 PRK14891, PRK14891, 50S ribosomal protein L24e/unknown domain
fusion protein; Provisional.
Length = 131
Score = 26.8 bits (59), Expect = 6.3
Identities = 11/71 (15%), Positives = 35/71 (49%), Gaps = 1/71 (1%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
A+ + +++E++ + +++ E +++ + ++ + D+ + + + E E+
Sbjct: 62 PAAAATAAAEAAEEAEAADADEDADEAAEADAADEADEEEETDEAVDETADEADAEAEEA 121
Query: 245 EGEQTDSEKEA 255
+ E D E EA
Sbjct: 122 DEE-EDEEAEA 131
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 27.8 bits (62), Expect = 6.3
Identities = 20/94 (21%), Positives = 40/94 (42%), Gaps = 2/94 (2%)
Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
+LK + R+A S E+ E+KEE E+ E++ K + ++ + + +
Sbjct: 155 EELKELQSRRADVDVNSMLEALFRREKKEEEEE--EEEDEALIKSLSFGPETEEDRRRAD 212
Query: 239 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+E+ +++ E D+ K KKS
Sbjct: 213 DEDSEDDEEDNDNTPSPKSGSSSPAKPTSILKKS 246
>gnl|CDD|147051 pfam04698, MOBP_C-Myrip, Myelin-associated oligodendrocytic basic
protein (MOBP). MOBP is abundantly expressed in central
nervous system myelin, and shares several
characteristics with myelin basic protein (MBP), in
terms of regional distribution and function. This family
is the middle and C-terminal regions of MOBP which has
been shown to be essential for normal arrangement of the
radial component in central nervous system myelin. Most
member-proteins carry a FVHE-PHD type zinc-finger at
their N-terminus.
Length = 710
Score = 28.2 bits (62), Expect = 6.3
Identities = 16/74 (21%), Positives = 39/74 (52%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
E+D E ++ TDS +E + + ++ K +K++ E++E E + +++KE+
Sbjct: 413 ETDISNEAQDPQSLTDSTEEKLRNRLYELAMKMSEKETSSGEDQESEPKAEPENQKESLS 472
Query: 258 RKKHKKDKKKRKKK 271
+ + + ++ KK
Sbjct: 473 SEDNNQGVQEELKK 486
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 27.6 bits (61), Expect = 6.4
Identities = 16/54 (29%), Positives = 31/54 (57%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
++ ++EKEE + E EA+ + ++ K+ + + S+ EE +K GE D +K
Sbjct: 145 EELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKK 198
>gnl|CDD|215349 PLN02647, PLN02647, acyl-CoA thioesterase.
Length = 437
Score = 27.8 bits (62), Expect = 6.4
Identities = 16/59 (27%), Positives = 29/59 (49%), Gaps = 9/59 (15%)
Query: 187 RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
K++P + S E+++E+ EE E R K +K K+ ++K+ E E+ E
Sbjct: 214 GKSAPVNRLSPETEEEKLLFEEAEA---------RNKLRKKKRGEQKREFENGEAERLE 263
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 27.8 bits (62), Expect = 6.5
Identities = 25/104 (24%), Positives = 39/104 (37%), Gaps = 15/104 (14%)
Query: 184 KEKRKASPTSASSDESDKEEEEKEEGEQT-DSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
+E K S A S ++ + EE ++ ++T DS E K ++ K K++KK + E
Sbjct: 95 RELEKESKKQAKSPKAMRTFEESKKSKKTVDSMIERKPKEPGLKRKQRKKAQESATSPES 154
Query: 243 K---------EEGEQTDSEKE-----AKKRKKHKKDKKKRKKKS 272
KE AKK K +KS
Sbjct: 155 SPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKLSSTASSGDEKS 198
>gnl|CDD|218652 pfam05602, CLPTM1, Cleft lip and palate transmembrane protein 1
(CLPTM1). This family consists of several eukaryotic
cleft lip and palate transmembrane protein 1 sequences.
Cleft lip with or without cleft palate is a common birth
defect that is genetically complex. The nonsyndromic
forms have been studied genetically using linkage and
candidate-gene association studies with only partial
success in defining the loci responsible for orofacial
clefting. CLPTM1 encodes a transmembrane protein and has
strong homology to two Caenorhabditis elegans genes,
suggesting that CLPTM1 may belong to a new gene family.
This family also contains the human cisplatin resistance
related protein CRR9p which is associated with
CDDP-induced apoptosis.
Length = 437
Score = 27.6 bits (62), Expect = 6.7
Identities = 11/33 (33%), Positives = 15/33 (45%), Gaps = 3/33 (9%)
Query: 223 KHKKDKKKKKKK---SKYEEEEEKEEGEQTDSE 252
+ KK KKK K E+EE +EE +
Sbjct: 146 TYLPKKKVKKKNLLGGKSEKEEPEEEKTPAPDK 178
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 27.7 bits (62), Expect = 6.8
Identities = 21/92 (22%), Positives = 36/92 (39%), Gaps = 27/92 (29%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKK---------------------------RK 222
+P E ++EEEE+EE + D +E R
Sbjct: 280 APRGLYQLEEEEEEEEEEEPAERDELEENPDFEGLEVRELADSLDNWVHHVQHILPQGRC 339
Query: 223 KHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKE 254
++K +++ + +EEEE+EE E + E E
Sbjct: 340 TWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPE 371
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 27.8 bits (62), Expect = 6.8
Identities = 7/51 (13%), Positives = 21/51 (41%), Gaps = 1/51 (1%)
Query: 195 SSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
+ +++ E EQ ++ + ++ K + S+ +++K E
Sbjct: 178 AEIAAEQAELTTLLSEQRA-QQAKLAQLLEERKKTLAQLNSELSADQKKLE 227
Score = 27.8 bits (62), Expect = 7.1
Identities = 11/69 (15%), Positives = 31/69 (44%), Gaps = 10/69 (14%)
Query: 203 EEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHK 262
E+ E SE+ A++ K + +++KK ++ ++ + KK ++ +
Sbjct: 181 AAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLN----------SELSADQKKLEELR 230
Query: 263 KDKKKRKKK 271
++ + K +
Sbjct: 231 ANESRLKNE 239
>gnl|CDD|238673 cd01377, MYSc_type_II, Myosin motor domain, type II myosins. Myosin
II mediates cortical contraction in cell motility, and
is the motor in smooth and skeletal muscle. This
catalytic (head) domain has ATPase activity and belongs
to the larger group of P-loop NTPases. Myosins are
actin-dependent molecular motors that play important
roles in muscle contraction, cell motility, and
organelle transport. The head domain is a molecular
motor, which utilizes ATP hydrolysis to generate
directed movement toward the plus end along actin
filaments. A cyclical interaction between myosin and
actin provides the driving force. Rates of ATP
hydrolysis and consequently the speed of movement along
actin filaments vary widely, from about 0.04 micrometer
per second for myosin I to 4.5 micrometer per second for
myosin II in skeletal muscle. Myosin II moves in
discrete steps about 5-10 nm long and generates 1-5
piconewtons of force. Upon ATP binding, the myosin head
dissociates from an actin filament. ATP hydrolysis
causes the head to pivot and associate with a new actin
subunit. The release of Pi causes the head to pivot and
move the filament (power stroke). Release of ADP
completes the cycle.
Length = 693
Score = 27.9 bits (63), Expect = 6.8
Identities = 15/29 (51%), Positives = 16/29 (55%), Gaps = 2/29 (6%)
Query: 36 ENFRALCTGEKGIGKT--TGKNLHYLNIV 62
EN L TGE G GKT T K + YL V
Sbjct: 90 ENQSILITGESGAGKTENTKKVIQYLASV 118
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
This family consists of several brain acid soluble
protein 1 (BASP1) or neuronal axonal membrane protein
NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 27.5 bits (60), Expect = 6.9
Identities = 28/85 (32%), Positives = 45/85 (52%), Gaps = 10/85 (11%)
Query: 182 KAKEK-RKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHK---------KDKKKK 231
KAK+K +KA + + + KE EE + +T KEAK+ K K ++K+ +
Sbjct: 17 KAKDKDKKAEGAATEEEGTPKENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGE 76
Query: 232 KKKSKYEEEEEKEEGEQTDSEKEAK 256
K+ + +EE K E E+T+ EAK
Sbjct: 77 KEAAAAKEEAPKAEPEKTEGAAEAK 101
>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 103
Score = 26.5 bits (59), Expect = 7.0
Identities = 12/44 (27%), Positives = 25/44 (56%), Gaps = 4/44 (9%)
Query: 217 EAKKRKKHKK---DK-KKKKKKSKYEEEEEKEEGEQTDSEKEAK 256
E KK++K K DK +K++ K+K + +E + +S +++
Sbjct: 38 EYKKQQKAKAREADKARKQQLKAKQRQAANDDEEDTIESVTQSQ 81
Score = 26.5 bits (59), Expect = 7.8
Identities = 9/40 (22%), Positives = 23/40 (57%)
Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
AK R+ K K++ K K + +++E+ ++ ++ ++ K
Sbjct: 45 AKAREADKARKQQLKAKQRQAANDDEEDTIESVTQSQSNK 84
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 27.5 bits (61), Expect = 7.1
Identities = 15/75 (20%), Positives = 29/75 (38%), Gaps = 2/75 (2%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
+ +KEE K+ + + K + + EE + + SE + K
Sbjct: 132 QREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKI--ISKAWSELDESK 189
Query: 258 RKKHKKDKKKRKKKS 272
+KK+ KK K++
Sbjct: 190 KKKYIDKYKKLKEEY 204
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.5 bits (61), Expect = 7.2
Identities = 19/98 (19%), Positives = 47/98 (47%), Gaps = 6/98 (6%)
Query: 178 KSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKY 237
K K++ +K + ++S +E+EE EE + + E+E ++R +K++++++ +
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178
Query: 238 EEEEEKEEGEQTDS------EKEAKKRKKHKKDKKKRK 269
++ +E E + + K K + +K K
Sbjct: 179 NKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPK 216
Score = 27.5 bits (61), Expect = 8.0
Identities = 19/64 (29%), Positives = 35/64 (54%), Gaps = 4/64 (6%)
Query: 212 TDSEKEAKKRKKHKKDKK----KKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKK 267
D E KK + ++K+ K K K+KS E+EE +E E E+E ++ K+++++
Sbjct: 113 IDLENTKKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQ 172
Query: 268 RKKK 271
+ K
Sbjct: 173 QMNK 176
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 27.9 bits (62), Expect = 7.3
Identities = 11/39 (28%), Positives = 24/39 (61%)
Query: 215 EKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEK 253
E+E ++RK+ K K ++++K+K E +++E E +
Sbjct: 13 EREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSR 51
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 27.2 bits (61), Expect = 7.4
Identities = 10/25 (40%), Positives = 17/25 (68%)
Query: 224 HKKDKKKKKKKSKYEEEEEKEEGEQ 248
+ +K+KKK SK E++ KEE ++
Sbjct: 95 FEAEKEKKKAMSKEEKKAIKEEKDK 119
>gnl|CDD|218274 pfam04801, Sin_N, Sin-like protein conserved region. Family of
higher eukaryotic proteins. SIN was identified as a
protein that interacts specifically with SXL (sex
lethal) in a yeast two-hybrid assay. The interaction is
mediated by one of the SXL RNA binding domains.
Length = 422
Score = 27.8 bits (62), Expect = 7.4
Identities = 25/90 (27%), Positives = 41/90 (45%), Gaps = 15/90 (16%)
Query: 179 SKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYE 238
L K+KRK +A DE EE E EE +Q K + + +K++K+++ Y
Sbjct: 137 QYLDKKDKRKKEEDTADEDEDPDEEAE-EELKQVTV-----KFSRPETEKQRKRREQSYN 190
Query: 239 EEEEKEEGEQ---------TDSEKEAKKRK 259
++K E DSE E +++K
Sbjct: 191 FLQKKIAEEPWIELKYHGKKDSESELERQK 220
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 27.8 bits (61), Expect = 7.7
Identities = 17/89 (19%), Positives = 37/89 (41%), Gaps = 4/89 (4%)
Query: 170 VKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKK 229
K + L+ + + + P D+ D E+ ++ E E K A++RK
Sbjct: 364 AKLSSTQLRRRAASLSGEEEEPEDELKDDVDGEQADESEHETLALRKNARQRKA----GL 419
Query: 230 KKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
++ EEE+K ++ ++ K++
Sbjct: 420 ASPEEEALGEEEQKAPPKKKQLNQKNKQQ 448
Score = 27.4 bits (60), Expect = 9.4
Identities = 19/80 (23%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Query: 183 AKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEE 242
+ +R+A+ S +E + E ++ +GEQ D E + K +++K + EEE
Sbjct: 369 TQLRRRAASLSGEEEEPEDELKDDVDGEQADES-EHETLALRKNARQRKAGLASPEEEAL 427
Query: 243 KEEGEQTDSEKEAKKRKKHK 262
EE ++ +K+ +K +
Sbjct: 428 GEEEQKAPPKKKQLNQKNKQ 447
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 27.6 bits (61), Expect = 7.8
Identities = 15/59 (25%), Positives = 26/59 (44%)
Query: 200 DKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
++E E K+ + S K+ +K + + EE E +EE E + E + KR
Sbjct: 343 NEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKR 401
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 27.7 bits (61), Expect = 8.0
Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 6/78 (7%)
Query: 198 ESDKEEEEKEEGEQTD------SEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDS 251
E D+E EKE G+ D E+E K K + + KK S + + S
Sbjct: 149 EEDEEYYEKEPGKVVDEKSEEEEEEELKTMKDFIDLESQTKKPSVKDNGKSFWSAASVFS 208
Query: 252 EKEAKKRKKHKKDKKKRK 269
+K K R+K K K +
Sbjct: 209 KKLQKWRQKQKLKKPRSG 226
>gnl|CDD|227780 COG5493, COG5493, Uncharacterized conserved protein containing a
coiled-coil domain [Function unknown].
Length = 231
Score = 27.1 bits (60), Expect = 8.1
Identities = 13/43 (30%), Positives = 22/43 (51%)
Query: 216 KEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKR 258
KE ++R+K D+K + +K K +E+ K + E A K
Sbjct: 53 KETEQRQKELADEKLEVRKQKATKEDLKLLQRFQEEEFRATKE 95
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 27.4 bits (61), Expect = 8.1
Identities = 12/46 (26%), Positives = 25/46 (54%)
Query: 227 DKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+K+ +K K K + K + E+ E+ + R+K K+ K K +++
Sbjct: 347 EKRIEKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERRG 392
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is
found in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 27.4 bits (61), Expect = 8.2
Identities = 13/63 (20%), Positives = 33/63 (52%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
+K K ++++ S +E +K+ E D E ++ + + + K+ K +E++++E
Sbjct: 50 QKAKLKGQASTAAGSASQELQKQASESNDEEVVGEEEPEKDQAEAGKQLSQKQQEQQQQE 109
Query: 245 EGE 247
E +
Sbjct: 110 EKQ 112
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 27.5 bits (61), Expect = 8.4
Identities = 17/33 (51%), Positives = 19/33 (57%), Gaps = 2/33 (6%)
Query: 202 EEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKK 234
E E E Q ++E EA K K KDKK KKKK
Sbjct: 228 EGGELIEKPQAEAENEAGKSDK--KDKKSKKKK 258
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 27.5 bits (61), Expect = 8.4
Identities = 9/28 (32%), Positives = 11/28 (39%)
Query: 218 AKKRKKHKKDKKKKKKKSKYEEEEEKEE 245
K K K KK K +E E K +
Sbjct: 330 VTKEKNKKVPVMFLGKKPKEKEVESKSQ 357
>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
subunit.
Length = 1079
Score = 27.6 bits (61), Expect = 8.6
Identities = 17/47 (36%), Positives = 20/47 (42%), Gaps = 7/47 (14%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKS-------KYEEEEEKEEGEQTDSEK 253
S K++ K K DKKK K E+ EE EG D EK
Sbjct: 662 SRKKSSKSSKKGSDKKKSGKHVDPTVPVFNLEDIEEGVEGAGFDDEK 708
>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
unknown].
Length = 984
Score = 27.5 bits (61), Expect = 8.6
Identities = 15/74 (20%), Positives = 28/74 (37%), Gaps = 2/74 (2%)
Query: 198 ESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKK 257
+ E EE+ D E+E R + ++K S+ ++EE K+
Sbjct: 397 LFEDEAEEEARQRLADDEEE--VRAGDEAREEKIAANSQVIDKEEVCNLYDRRDTAWQKQ 454
Query: 258 RKKHKKDKKKRKKK 271
R +K ++K
Sbjct: 455 RFLREKQTAFERQK 468
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 27.5 bits (61), Expect = 8.6
Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 2/76 (2%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
+ S + E+ P SA D D E+E+ + ++ D + E ++ + S+
Sbjct: 202 MLSSMDMAEELGDEPESA--DSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSE 259
Query: 237 YEEEEEKEEGEQTDSE 252
E E E E + +
Sbjct: 260 EMESGEMEAAEASADD 275
>gnl|CDD|205150 pfam12925, APP_E2, E2 domain of amyloid precursor protein. The E2
domain is the largest of the conserved domains of the
amyloid precursor protein. The structure of E2 consists
of two coiled-coil sub-structures connected through a
continuous helix, and bears an unexpected resemblance to
the spectrin family of protein structures.E 2 can
reversibly dimerise in solution, and the dimerisation
occurs along the longest dimension of the molecule in an
antiparallel orientation, which enables the N-terminal
substructure of one monomer to pack against the
C-terminal substructure of a second monomer. The high
degree of conservation of residues at the putative dimer
interface suggests that the E2 dimer observed in the
crystal could be physiologically relevant. Heparin
sulfate proteoglycans, the putative ligands for the
precursor present in extracellular matrix, bind to E2 at
a conserved and positively charged site near the dimer
interface.
Length = 193
Score = 27.0 bits (60), Expect = 8.7
Identities = 19/71 (26%), Positives = 33/71 (46%)
Query: 190 SPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQT 249
S +SD D E +E + + KEAKKR + K ++ + ++EE E + +
Sbjct: 3 PTPSPTSDAVDPYFSEPKEDNEHERYKEAKKRLEEKHRERMTQVMKEWEEAESQYKNLPK 62
Query: 250 DSEKEAKKRKK 260
K A+ +K
Sbjct: 63 ADPKAAQLMRK 73
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 26.7 bits (59), Expect = 8.8
Identities = 17/85 (20%), Positives = 39/85 (45%)
Query: 185 EKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKE 244
+K + +S E + E E++ EQ E+E +++K + +++++K+ + + E+K
Sbjct: 21 QKFSQEESRLTSKEKRRAEIEQKLEEQEKQEREELRKEKRELFEERRRKQLELRKLEQKM 80
Query: 245 EGEQTDSEKEAKKRKKHKKDKKKRK 269
E E+ + K K
Sbjct: 81 EDEKLQETWHEHNLALANFIRTKTK 105
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 27.2 bits (61), Expect = 8.9
Identities = 20/68 (29%), Positives = 31/68 (45%), Gaps = 10/68 (14%)
Query: 177 LKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSK 236
LK KL +K EEE+EE E+E ++ ++ KK+K K+K
Sbjct: 5 LKEKLS----------KFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAK 54
Query: 237 YEEEEEKE 244
E +EK+
Sbjct: 55 ITEIKEKD 62
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 27.4 bits (61), Expect = 8.9
Identities = 11/46 (23%), Positives = 25/46 (54%)
Query: 225 KKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRKK 270
D K+ + + ++ E ++ E +D E++ K+K KK ++K +
Sbjct: 151 DSDNKEAPETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPES 196
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 27.5 bits (61), Expect = 9.2
Identities = 18/102 (17%), Positives = 37/102 (36%), Gaps = 11/102 (10%)
Query: 182 KAKEKRKASPTSASSDESDKEEEEKEEG--EQTDSEKEAKKRKKHK---KDKKKKKKKSK 236
KA + K + + D+ ++ T + + K + K K+K SK
Sbjct: 3 KASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLSK 62
Query: 237 YEEE------EEKEEGEQTDSEKEAKKRKKHKKDKKKRKKKS 272
+ EE+E+G ++ S+ +K +K +
Sbjct: 63 LYSDVDSSSSEEEEDGSESISKLNVNSKKISLNQVSTQKWRK 104
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 27.4 bits (60), Expect = 9.3
Identities = 26/89 (29%), Positives = 44/89 (49%), Gaps = 10/89 (11%)
Query: 194 ASSDESDKEEEEKEEGEQTDSEKEAKKR---KKHKKDKKKKKKKSKYEEEEEKEEGEQTD 250
E+ KE + K+ + +D + + KK KK+KK K ++ + +EE E + + D
Sbjct: 442 RERQENRKERQLKKTKDDSDVDLKDKKESINKKNKKGKHAIERTAASKEELELIKADDED 501
Query: 251 SE-------KEAKKRKKHKKDKKKRKKKS 272
E K K +K KK++K +KK S
Sbjct: 502 DEQLDHFDMKSILKAEKFKKNRKLKKKAS 530
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 26.4 bits (58), Expect = 9.3
Identities = 13/23 (56%), Positives = 15/23 (65%)
Query: 214 SEKEAKKRKKHKKDKKKKKKKSK 236
+EK KK+KK KK KK KK K
Sbjct: 97 TEKPKKKKKKSKKTKKPKKSSKK 119
>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
Length = 290
Score = 27.4 bits (61), Expect = 9.3
Identities = 12/60 (20%), Positives = 20/60 (33%), Gaps = 8/60 (13%)
Query: 210 EQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKKHKKDKKKRK 269
E + SE + K + K K K+ E + A K+ +K+K
Sbjct: 237 ENSSSEAKPKAKAKPKTAGIKRPVVKM--------EKTAEKGGRPASNGKRFTSPGRKKK 288
Score = 27.0 bits (60), Expect = 9.8
Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 4/71 (5%)
Query: 162 RPLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKR 221
R L D ++ +L+ S +AK K KA P +A K K E + A
Sbjct: 222 RDLTDDELIDLFKLIENSSSEAKPKAKAKPKTAGI----KRPVVKMEKTAEKGGRPASNG 277
Query: 222 KKHKKDKKKKK 232
K+ +KKK
Sbjct: 278 KRFTSPGRKKK 288
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 27.1 bits (60), Expect = 9.4
Identities = 17/84 (20%), Positives = 29/84 (34%), Gaps = 3/84 (3%)
Query: 188 KASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEEEEKEEGE 247
A+ + E + E E E + + + K+K K E++ +K + +
Sbjct: 44 LAAKVLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKP---EKKPKKPKPK 100
Query: 248 QTDSEKEAKKRKKHKKDKKKRKKK 271
K K K K KK K
Sbjct: 101 PKPKPKPKPKVKPQPKPKKPPSKT 124
>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
Length = 250
Score = 27.1 bits (60), Expect = 9.6
Identities = 13/88 (14%), Positives = 38/88 (43%), Gaps = 10/88 (11%)
Query: 181 LKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRKKHKKDKKKKKKKSKYEEE 240
++ +KR+ + D +++E +E E+ ++ ++ + ++ + + +E+
Sbjct: 31 IQVMKKRQQRIANRWQDAEQRQQEAGQEAER---YRQKQQSLEQQRASFMAQAQEAADEQ 87
Query: 241 ------EEKEEGEQTDSE-KEAKKRKKH 261
E +E+ E E +R+K
Sbjct: 88 RQHLLNEAREDVATARDEWLEQLEREKQ 115
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 27.3 bits (60), Expect = 9.8
Identities = 23/98 (23%), Positives = 42/98 (42%), Gaps = 5/98 (5%)
Query: 163 PLQDVRVVKCNELVLKSKLKAKEKRKASPTSASSDESDKEEEEKEEGEQTDSEKEAKKRK 222
P + V + V KS + + + K PT S + + E E E +
Sbjct: 49 PFKTVGALSKATKVWKSAVSSSDDSKTVPTPVS-----EPNITRSFQEPVSQESEVQDNT 103
Query: 223 KHKKDKKKKKKKSKYEEEEEKEEGEQTDSEKEAKKRKK 260
+ +D K K S+ ++++ +EE ++ S K+ K KK
Sbjct: 104 EQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGSKK 141
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.310 0.131 0.365
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,494,132
Number of extensions: 1314821
Number of successful extensions: 13824
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8446
Number of HSP's successfully gapped: 1769
Length of query: 272
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 177
Effective length of database: 6,723,972
Effective search space: 1190143044
Effective search space used: 1190143044
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 58 (26.2 bits)