RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16796
(166 letters)
>gnl|CDD|200956 pfam00046, Homeobox, Homeobox domain.
Length = 57
Score = 56.7 bits (138), Expect = 1e-11
Identities = 24/43 (55%), Positives = 27/43 (62%)
Query: 79 RRLRTAYTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQ 121
RR RT +T QL ELEKEF N+Y R E+A L LTERQ
Sbjct: 1 RRKRTTFTPEQLEELEKEFEKNRYPSAEEREELAKKLGLTERQ 43
>gnl|CDD|197696 smart00389, HOX, Homeodomain. DNA-binding factors that are
involved in the transcriptional regulation of key
developmental processes.
Length = 57
Score = 52.6 bits (127), Expect = 3e-10
Identities = 24/44 (54%), Positives = 28/44 (63%)
Query: 78 PRRLRTAYTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQ 121
RR RT++T QL ELEKEF N Y R R E+A L L+ERQ
Sbjct: 1 KRRKRTSFTPEQLEELEKEFQKNPYPSREEREELAKKLGLSERQ 44
>gnl|CDD|238039 cd00086, homeodomain, Homeodomain; DNA binding domains involved in
the transcriptional regulation of key eukaryotic
developmental processes; may bind to DNA as monomers or
as homo- and/or heterodimers, in a sequence-specific
manner.
Length = 59
Score = 49.5 bits (119), Expect = 6e-09
Identities = 25/43 (58%), Positives = 27/43 (62%)
Query: 79 RRLRTAYTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQ 121
RR RT +T QL ELEKEF N Y R R E+A L LTERQ
Sbjct: 1 RRKRTRFTPEQLEELEKEFEKNPYPSREEREELAKELGLTERQ 43
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 46.5 bits (110), Expect = 2e-06
Identities = 24/29 (82%), Positives = 25/29 (86%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
EEEEEEEEEEEEE+EEEEEEEEE EE
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 45.0 bits (106), Expect = 6e-06
Identities = 23/31 (74%), Positives = 26/31 (83%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMN 59
EEEEEEEEEEEEE+EEEEEEEEE E ++
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEPLS 895
Score = 43.4 bits (102), Expect = 2e-05
Identities = 23/35 (65%), Positives = 27/35 (77%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
EEEEEEEEEEEEE+EEEEEEE E + E E ++
Sbjct: 869 EEEEEEEEEEEEEEEEEEEEENEEPLSLEWPETRQ 903
Score = 43.1 bits (101), Expect = 3e-05
Identities = 21/29 (72%), Positives = 23/29 (79%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
+ EEEEEEEEEEE+EEEEEEEEE E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890
Score = 42.3 bits (99), Expect = 4e-05
Identities = 22/31 (70%), Positives = 24/31 (77%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
+ EEEEEEEEEE+EEEEEEEEE EE E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 38.8 bits (90), Expect = 7e-04
Identities = 20/31 (64%), Positives = 23/31 (74%)
Query: 33 EEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
+ EEEEEEE+EEEEEEEEE EE E +E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 36.9 bits (85), Expect = 0.003
Identities = 34/126 (26%), Positives = 55/126 (43%), Gaps = 3/126 (2%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNTQ 89
E E E +E+E+E E + E+ E E + E+ + + EK + A
Sbjct: 769 ETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEV 828
Query: 90 LLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGEEEAK 149
E ++ + ++ E SD +EEEE E+EEEEEE EEE +
Sbjct: 829 KDETGEQELNAENQGEAKQDEKGVD---GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEE 885
Query: 150 EQEQKQ 155
E+E+ +
Sbjct: 886 EEEENE 891
Score = 36.5 bits (84), Expect = 0.004
Identities = 34/137 (24%), Positives = 56/137 (40%), Gaps = 10/137 (7%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNT 88
E E E + +E E E E E E + E + E+ K +E + + T
Sbjct: 753 HEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEK 812
Query: 89 QLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYV----------AKEEEEGEDE 138
E + E + + E + + + V ++EEEE E+E
Sbjct: 813 DEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEE 872
Query: 139 EEEEEGEEEAKEQEQKQ 155
EEEEE EEE +E+E+++
Sbjct: 873 EEEEEEEEEEEEEEEEE 889
Score = 35.7 bits (82), Expect = 0.007
Identities = 18/36 (50%), Positives = 22/36 (61%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKEN 64
+ + EEEE+EEEEEEEEE EE E +EN
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890
Score = 33.4 bits (76), Expect = 0.046
Identities = 17/43 (39%), Positives = 25/43 (58%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKK 68
K ++ + + EE+EEEEEEEEE EE E +E N++
Sbjct: 850 KGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 33.4 bits (76), Expect = 0.050
Identities = 32/129 (24%), Positives = 44/129 (34%), Gaps = 3/129 (2%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEE---EKKNGLPRRL 81
N E E+E E E K E E E E + E + I K+ + E +
Sbjct: 659 NGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718
Query: 82 RTAYTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEE 141
E E E A E + D K + E E E +E+E
Sbjct: 719 GETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDE 778
Query: 142 EEGEEEAKE 150
+EGE +A E
Sbjct: 779 DEGEIQAGE 787
Score = 32.3 bits (73), Expect = 0.12
Identities = 18/29 (62%), Positives = 20/29 (68%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
EEEEEEEEEEEEE EE E TR ++
Sbjct: 877 EEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 28.8 bits (64), Expect = 1.4
Identities = 35/139 (25%), Positives = 47/139 (33%), Gaps = 18/139 (12%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKEN-----INKKKEEEKKNGLPRRLRTA 84
E E EE E E E E E E+E T E E E+ K E E +
Sbjct: 709 ETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEG 768
Query: 85 YTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEG 144
T + E E E EI A D E + D E E + E++E
Sbjct: 769 ETEAEGKEDEDEG------------EIQAGED-GEMKGDEGAEGKVEHEGETEAGEKDEH 815
Query: 145 EEEAKEQEQKQVGVIEKEK 163
E +++ Q E +
Sbjct: 816 EGQSETQADDTEVKDETGE 834
Score = 28.0 bits (62), Expect = 2.7
Identities = 17/56 (30%), Positives = 30/56 (53%), Gaps = 3/56 (5%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRL 81
+ ++E+ + + + EEEEEE EE E +E +++EEE++N P L
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEE---EEEEEEEENEEPLSL 896
Score = 27.3 bits (60), Expect = 4.9
Identities = 31/138 (22%), Positives = 50/138 (36%), Gaps = 7/138 (5%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNT 88
E E E EE + E E E + E++ K E E + +P +
Sbjct: 637 EAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696
Query: 89 QLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGEEEA 148
+E ++ H + E + TE + + E E EE E+EGE EA
Sbjct: 697 GEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEI-------ETGEEGEEVEDEGEGEA 749
Query: 149 KEQEQKQVGVIEKEKNKE 166
+ + + + KE E
Sbjct: 750 EGKHEVETEGDRKETEHE 767
Score = 26.5 bits (58), Expect = 8.5
Identities = 31/129 (24%), Positives = 51/129 (39%), Gaps = 6/129 (4%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEEN-TRTEEMNERKENINKKKEEEKKNGLPRRLRTAYT 86
+ E E E EE E E E EN + E++ K E E + +P +
Sbjct: 635 VAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQE 694
Query: 87 NTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGEE 146
+E ++ H + E + TE + + + EEGE+ E+E EGE
Sbjct: 695 GEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGE-----IETGEEGEEVEDEGEGEA 749
Query: 147 EAKEQEQKQ 155
E K + + +
Sbjct: 750 EGKHEVETE 758
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 37.7 bits (88), Expect = 0.001
Identities = 22/29 (75%), Positives = 23/29 (79%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
EEEEEEEEE E+EEEEEEEEE T EE
Sbjct: 305 PEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 36.9 bits (86), Expect = 0.003
Identities = 21/29 (72%), Positives = 22/29 (75%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
E EEEEEEEEE +EEEEEEEEE EE
Sbjct: 304 EPEEEEEEEEEVPEEEEEEEEEEERTFEE 332
Score = 36.5 bits (85), Expect = 0.004
Identities = 21/27 (77%), Positives = 21/27 (77%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRT 55
E EEEEEEEEE EEEEEEEEE RT
Sbjct: 303 PEPEEEEEEEEEVPEEEEEEEEEEERT 329
Score = 36.2 bits (84), Expect = 0.004
Identities = 17/32 (53%), Positives = 18/32 (56%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
E EEEEEEEE+ EEEEEEE E
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEE 332
Score = 33.5 bits (77), Expect = 0.040
Identities = 18/24 (75%), Positives = 19/24 (79%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEE 51
EEEEEEEE EEE+EEEEEEE
Sbjct: 306 EEEEEEEEEVPEEEEEEEEEEERT 329
Score = 31.9 bits (73), Expect = 0.14
Identities = 15/23 (65%), Positives = 16/23 (69%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
E EEEEEEE+E EEEEEE
Sbjct: 300 PSPPEPEEEEEEEEEVPEEEEEE 322
Score = 31.2 bits (71), Expect = 0.21
Identities = 18/27 (66%), Positives = 19/27 (70%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRT 55
EEEE EEEEEEE+EEE EEE T
Sbjct: 311 EEEEVPEEEEEEEEEEERTFEEEVRAT 337
Score = 30.8 bits (70), Expect = 0.36
Identities = 18/27 (66%), Positives = 19/27 (70%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRT 55
EEEEE EEEEEE+EEEE EE R
Sbjct: 310 EEEEEVPEEEEEEEEEEERTFEEEVRA 336
Score = 26.9 bits (60), Expect = 6.9
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 131 EEEEGEDEEEEEEGEEEAKEQE 152
EEE E+EEE E EEE +E+E
Sbjct: 305 PEEEEEEEEEVPEEEEEEEEEE 326
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 37.4 bits (87), Expect = 0.002
Identities = 24/54 (44%), Positives = 39/54 (72%)
Query: 20 IQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
I L+ K +E+E +EEEEEE++E++EEEE+ + EE++E +E KKK+ +K
Sbjct: 23 IYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKK 76
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 35.9 bits (83), Expect = 0.003
Identities = 16/58 (27%), Positives = 28/58 (48%)
Query: 18 IMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
I +L + EEE EEE +E E+E+ E + E+ ++ K K+K + +
Sbjct: 87 IFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPK 144
Score = 34.7 bits (80), Expect = 0.008
Identities = 17/60 (28%), Positives = 29/60 (48%)
Query: 14 IPSSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
+P+ I + + +EEEE EEE +E E+E+ E + E+ +KE+ K
Sbjct: 81 LPAMWKIFRELSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPK 140
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 36.5 bits (85), Expect = 0.004
Identities = 12/58 (20%), Positives = 33/58 (56%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYT 86
++EE +EE+EE K E++E+++++ +++ R+ ++ +++ + L Y
Sbjct: 129 DDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEKRAELEFYYY 186
Score = 31.1 bits (71), Expect = 0.25
Identities = 12/60 (20%), Positives = 29/60 (48%), Gaps = 3/60 (5%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNT 88
EE+++++EE +EE EE + E++ ++ ++ ++ E ++ RR
Sbjct: 124 EEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRR---RRREWEEKRAE 180
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 35.8 bits (82), Expect = 0.009
Identities = 13/39 (33%), Positives = 29/39 (74%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
N I++++E+E+E+++++E++EEEEEE + ++ E
Sbjct: 150 NFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDE 188
Score = 31.9 bits (72), Expect = 0.13
Identities = 18/49 (36%), Positives = 31/49 (63%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGL 77
E+E++++EE++EE+EEEEEE + +E +E E+ +K E K
Sbjct: 160 EDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVDKTDC 208
Score = 31.2 bits (70), Expect = 0.29
Identities = 12/47 (25%), Positives = 30/47 (63%)
Query: 119 ERQSDCKYYVAKEEEEGEDEEEEEEGEEEAKEQEQKQVGVIEKEKNK 165
E S +V +++E EDE++++E ++E +E+E++++ + E +
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189
Score = 28.1 bits (62), Expect = 3.0
Identities = 14/64 (21%), Positives = 34/64 (53%), Gaps = 4/64 (6%)
Query: 17 SIMIQLFCNKTIEEEEEEEEE----EEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
I+I+ + + EE+ + ++++++E+E++++E EE E +E E+E
Sbjct: 129 MIIIKRRRARHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDE 188
Query: 73 KKNG 76
+ G
Sbjct: 189 EDEG 192
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 34.9 bits (81), Expect = 0.013
Identities = 16/47 (34%), Positives = 30/47 (63%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
+ EEE++E++++ +++EEEE E+ E KE ++ EEEK+
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEE 468
Score = 34.9 bits (81), Expect = 0.015
Identities = 15/49 (30%), Positives = 31/49 (63%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
K +E+ E++ EEE++EK+++ ++ EE + K+ K++EEE+
Sbjct: 414 KIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEA 462
Score = 34.1 bits (79), Expect = 0.025
Identities = 12/46 (26%), Positives = 30/46 (65%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
++ EEE++E++++ +++EEE +E E ++ +++ EE+K
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEK 466
Score = 33.4 bits (77), Expect = 0.038
Identities = 20/44 (45%), Positives = 28/44 (63%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKK 69
K +++EEEEEEEKE++EEE+EE E + +E KKK
Sbjct: 432 KKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 33.4 bits (77), Expect = 0.041
Identities = 16/50 (32%), Positives = 32/50 (64%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
+ E++++ +++E+EEEEE+E++ EE E E +++EE+KK
Sbjct: 426 EKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 33.0 bits (76), Expect = 0.049
Identities = 16/49 (32%), Positives = 32/49 (65%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
+ +E++++ ++++EEEEEE+E+ +E E + K++EEEKK
Sbjct: 425 EEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKK 473
Score = 33.0 bits (76), Expect = 0.059
Identities = 18/51 (35%), Positives = 33/51 (64%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNG 76
+ ++E++++ ++KEEEEEEE+E E+ E +E +K+EEE+K
Sbjct: 424 EEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKK 474
Score = 32.6 bits (75), Expect = 0.087
Identities = 15/52 (28%), Positives = 33/52 (63%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGL 77
K +++ +++EEE+EEE+E++EE EE +E ++++++K+ L
Sbjct: 428 KEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQATL 479
Score = 31.0 bits (71), Expect = 0.25
Identities = 11/59 (18%), Positives = 32/59 (54%)
Query: 16 SSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
+ I+ ++ ++ E+ EK+ EEE++E+ + +++E ++KE++++
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEE 453
Score = 26.4 bits (59), Expect = 9.2
Identities = 16/29 (55%), Positives = 23/29 (79%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENT 53
+ +E++EEE+EEEEE+ EEE+EEEE
Sbjct: 444 EEEEKEKKEEEKEEEEEEAEEEKEEEEEK 472
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 34.6 bits (80), Expect = 0.018
Identities = 24/88 (27%), Positives = 32/88 (36%), Gaps = 5/88 (5%)
Query: 14 IPSSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
IPS F K ++E E EE EEE E E E +E E EEK
Sbjct: 217 IPSP-----FFTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEK 271
Query: 74 KNGLPRRLRTAYTNTQLLELEKEFHFNK 101
++ + + E+ F F K
Sbjct: 272 EDPDKTEDLDKLEILKEKKDEELFWFEK 299
Score = 31.1 bits (71), Expect = 0.23
Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 1/65 (1%)
Query: 11 SYFIPSSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEM-NERKENINKKK 69
S F + + EE + E E E K ++E+E + +E KE+ +K +
Sbjct: 219 SPFFTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTE 278
Query: 70 EEEKK 74
+ +K
Sbjct: 279 DLDKL 283
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 34.5 bits (79), Expect = 0.018
Identities = 11/56 (19%), Positives = 25/56 (44%), Gaps = 14/56 (25%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEE--------------ENTRTEEMNERKENINKKK 69
++ ++ E+E+E E E +EE ++ + + ++ E +E K K
Sbjct: 547 LDADDWTEDEDENEMETDEERKKPGCCRKAYNWFCGFDQGKAPKLTEEEEAALKMK 602
Score = 32.5 bits (74), Expect = 0.081
Identities = 7/29 (24%), Positives = 15/29 (51%)
Query: 34 EEEEEEEEKEEEEEEEEENTRTEEMNERK 62
EE + + + E+E+E T+E ++
Sbjct: 542 EERIDLDADDWTEDEDENEMETDEERKKP 570
Score = 32.1 bits (73), Expect = 0.13
Identities = 11/39 (28%), Positives = 20/39 (51%), Gaps = 3/39 (7%)
Query: 129 AKEEE---EGEDEEEEEEGEEEAKEQEQKQVGVIEKEKN 164
+ EE + +D E+E+ E ++E+K+ G K N
Sbjct: 540 STEERIDLDADDWTEDEDENEMETDEERKKPGCCRKAYN 578
Score = 30.6 bits (69), Expect = 0.35
Identities = 8/24 (33%), Positives = 16/24 (66%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEE 51
I+ + ++ E+E+E E E +EE +
Sbjct: 545 IDLDADDWTEDEDENEMETDEERK 568
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 34.3 bits (79), Expect = 0.020
Identities = 15/50 (30%), Positives = 24/50 (48%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNG 76
I+ EE +E+E+EEE++ +E EEE E + + E G
Sbjct: 364 PIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVG 413
Score = 29.7 bits (67), Expect = 0.77
Identities = 13/45 (28%), Positives = 24/45 (53%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
+E+EEEE+ +E +EEE E+ EE + E+ + + +
Sbjct: 373 DEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSE 417
Score = 27.8 bits (62), Expect = 2.8
Identities = 15/45 (33%), Positives = 24/45 (53%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
+EEEE+ +E EE+E E+ EEE + E+ + + E K
Sbjct: 375 DEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESK 419
Score = 27.4 bits (61), Expect = 4.5
Identities = 13/47 (27%), Positives = 22/47 (46%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
EEE+ +E EEEE E+ EEE ++ + + + + K
Sbjct: 377 EEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423
Score = 26.6 bits (59), Expect = 7.0
Identities = 15/48 (31%), Positives = 25/48 (52%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNG 76
EEEE+ +E EEE+ E+ EEE +R + +E ++ E +
Sbjct: 376 EEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKE 423
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 34.1 bits (79), Expect = 0.023
Identities = 16/21 (76%), Positives = 17/21 (80%)
Query: 29 EEEEEEEEEEEEEKEEEEEEE 49
EEEEEEEEEEE+EE EEE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEE 320
Score = 33.7 bits (78), Expect = 0.030
Identities = 16/21 (76%), Positives = 17/21 (80%)
Query: 30 EEEEEEEEEEEEKEEEEEEEE 50
EEEEEEEEEE+EEE EEE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEE 320
Score = 33.3 bits (77), Expect = 0.038
Identities = 16/23 (69%), Positives = 17/23 (73%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
EEEEEEEEE+EEEE EEE
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSEEE 320
Score = 32.1 bits (74), Expect = 0.099
Identities = 15/23 (65%), Positives = 17/23 (73%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
+ EEEEEEEE+EEEEE EE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPSEE 319
Score = 31.0 bits (71), Expect = 0.21
Identities = 14/23 (60%), Positives = 16/23 (69%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
+ EEEEEEE+EEEEEE E
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEPSE 318
Score = 27.9 bits (63), Expect = 2.6
Identities = 13/26 (50%), Positives = 17/26 (65%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEE 51
K + + + EEE+EEEEEEEEE
Sbjct: 289 KEVLSAQAQAAAAEEEEEEEEEEEEE 314
Score = 27.1 bits (61), Expect = 4.0
Identities = 17/36 (47%), Positives = 19/36 (52%)
Query: 112 AASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGEEE 147
A +L E S A EEEE E+EEEEEE E
Sbjct: 283 ALDEELKEVLSAQAQAAAAEEEEEEEEEEEEEEPSE 318
Score = 26.4 bits (59), Expect = 8.1
Identities = 14/22 (63%), Positives = 16/22 (72%)
Query: 129 AKEEEEGEDEEEEEEGEEEAKE 150
+EEEE E+EEEEE EEEA
Sbjct: 302 EEEEEEEEEEEEEEPSEEEAAA 323
Score = 26.4 bits (59), Expect = 9.4
Identities = 14/20 (70%), Positives = 16/20 (80%)
Query: 129 AKEEEEGEDEEEEEEGEEEA 148
A+EEEE E+EEEEEE EE
Sbjct: 301 AEEEEEEEEEEEEEEPSEEE 320
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 34.0 bits (78), Expect = 0.026
Identities = 19/35 (54%), Positives = 21/35 (60%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
+ EEEEEEEEEEEEE E E EE E+ E
Sbjct: 400 RWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEVPE 434
Score = 32.1 bits (73), Expect = 0.12
Identities = 15/28 (53%), Positives = 17/28 (60%)
Query: 24 CNKTIEEEEEEEEEEEEEKEEEEEEEEE 51
+ EEEEEEEEEEEE+ E E E
Sbjct: 397 VVERWAEEEEEEEEEEEEEAAEAEAPME 424
Score = 28.2 bits (63), Expect = 2.3
Identities = 16/29 (55%), Positives = 18/29 (62%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEM 58
E EEEEEEEE+EEEE E E E +
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPV 427
Score = 27.9 bits (62), Expect = 3.1
Identities = 16/30 (53%), Positives = 16/30 (53%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEM 58
EEEEEEEEEE E E EE EM
Sbjct: 406 EEEEEEEEEEAAEAEAPMEEPVPGFEVPEM 435
Score = 27.1 bits (60), Expect = 5.3
Identities = 15/28 (53%), Positives = 18/28 (64%)
Query: 129 AKEEEEGEDEEEEEEGEEEAKEQEQKQV 156
A+EEEE E+EEEEE E EA +E
Sbjct: 402 AEEEEEEEEEEEEEAAEAEAPMEEPVPG 429
Score = 26.3 bits (58), Expect = 8.1
Identities = 14/30 (46%), Positives = 18/30 (60%)
Query: 131 EEEEGEDEEEEEEGEEEAKEQEQKQVGVIE 160
EEE E+EEEEEE EA+ ++ V E
Sbjct: 402 AEEEEEEEEEEEEEAAEAEAPMEEPVPGFE 431
Score = 26.3 bits (58), Expect = 9.2
Identities = 15/32 (46%), Positives = 16/32 (50%)
Query: 34 EEEEEEEEKEEEEEEEEENTRTEEMNERKENI 65
E EEEE+EEEEEEEE M E
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGF 430
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 33.6 bits (77), Expect = 0.043
Identities = 18/67 (26%), Positives = 38/67 (56%), Gaps = 1/67 (1%)
Query: 9 QRSYFIPSSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKK 68
++ Y + + +++ T ++EEEE+EEEEE EE+E+E +++ ++++ K
Sbjct: 368 EQDYILDAFSALEIEDANTERDDEEEEDEEEEE-EEDEDEGPSKEHSDDEEFEEDDVESK 426
Query: 69 KEEEKKN 75
E+ N
Sbjct: 427 YEDSDGN 433
Score = 29.7 bits (67), Expect = 0.65
Identities = 13/41 (31%), Positives = 25/41 (60%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKK 69
E+EEEEEEE+E+E +E ++E +++ + E+ +
Sbjct: 394 EDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNS 434
Score = 28.9 bits (65), Expect = 1.2
Identities = 16/50 (32%), Positives = 24/50 (48%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGL 77
E E+ E + EEEE+EEEE E+ KE+ + ++ EE
Sbjct: 376 FSALEIEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVES 425
Score = 27.8 bits (62), Expect = 3.6
Identities = 12/22 (54%), Positives = 17/22 (77%)
Query: 129 AKEEEEGEDEEEEEEGEEEAKE 150
+EEE+ E+EEEE+E E +KE
Sbjct: 390 DEEEEDEEEEEEEDEDEGPSKE 411
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 33.5 bits (77), Expect = 0.044
Identities = 17/23 (73%), Positives = 20/23 (86%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
EEEE+E+EEEEEE+EE EE E E
Sbjct: 349 EEEEQEDEEEEEEEEEPEEPEPE 371
Score = 33.1 bits (76), Expect = 0.056
Identities = 18/38 (47%), Positives = 24/38 (63%)
Query: 24 CNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNER 61
C E+++EEEE+E+E+EEEEEEE E EE
Sbjct: 339 CTWVNPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376
Score = 31.6 bits (72), Expect = 0.17
Identities = 17/29 (58%), Positives = 23/29 (79%)
Query: 23 FCNKTIEEEEEEEEEEEEEKEEEEEEEEE 51
+ N ++EEEE+E+EEEE+EEEE EE E
Sbjct: 341 WVNPEQKDEEEEQEDEEEEEEEEEPEEPE 369
Score = 31.2 bits (71), Expect = 0.26
Identities = 17/35 (48%), Positives = 22/35 (62%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
EE+E+EEEEEEEE+ EE E EE ++E
Sbjct: 351 EEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAP 385
Score = 26.9 bits (60), Expect = 6.0
Identities = 15/32 (46%), Positives = 18/32 (56%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
EEEEEEEEE E E EE + E E+ +
Sbjct: 290 EEEEEEEEEPAERDELEENPDFEGLEVRELAD 321
Score = 26.2 bits (58), Expect = 9.1
Identities = 20/48 (41%), Positives = 27/48 (56%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
+ E E E EEEEE+EE E +EE+ EE E E + K++E N
Sbjct: 162 VAEVELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTGVN 209
Score = 26.2 bits (58), Expect = 9.9
Identities = 17/37 (45%), Positives = 21/37 (56%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
+ E EEEEEEEE + +EE+E EE E KE
Sbjct: 166 ELREGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKE 202
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for cell
viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 33.5 bits (77), Expect = 0.044
Identities = 11/45 (24%), Positives = 26/45 (57%)
Query: 24 CNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKK 68
I+ EE +EEE++E+++ ++ EE EE + ++ + ++
Sbjct: 61 EAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDEVLEQ 105
Score = 28.9 bits (65), Expect = 1.5
Identities = 15/44 (34%), Positives = 26/44 (59%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEE 71
I +E + + EE EEE++EE++N +EE + E K ++E
Sbjct: 58 IAKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFPKARDE 101
>gnl|CDD|153128 cd02437, CCC1_like_1, CCC1-related protein family. CCC1_like_1:
This is a protein family closely related to CCC1, a
family of proteins involved in iron and manganese
transport. Yeast CCC1 is a vacuole transmembrane
protein responsible for the iron and manganese
accumulation in vacuole. .
Length = 175
Score = 32.4 bits (74), Expect = 0.054
Identities = 11/40 (27%), Positives = 17/40 (42%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENIN 66
EE EE+EK E+ + EE+ RK ++
Sbjct: 56 VAEEGSLRGMLEEKEKILLRLEQMLTSTREELAIRKADLP 95
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 32.3 bits (74), Expect = 0.087
Identities = 17/35 (48%), Positives = 19/35 (54%)
Query: 24 CNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEM 58
+ EEEEEEEEEEE+EEE E EM
Sbjct: 398 VVERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPEM 432
Score = 30.4 bits (69), Expect = 0.40
Identities = 16/33 (48%), Positives = 18/33 (54%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
+E EEEEEEEE+EEEEEE E
Sbjct: 399 VERWAAEEEEEEEEEEEEEEEPVAEVMMMPAPE 431
Score = 28.1 bits (63), Expect = 2.7
Identities = 16/29 (55%), Positives = 17/29 (58%)
Query: 35 EEEEEEEKEEEEEEEEENTRTEEMNERKE 63
EEEEE+EEEEEEEEE M E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPE 431
Score = 26.2 bits (58), Expect = 9.3
Identities = 13/28 (46%), Positives = 14/28 (50%)
Query: 130 KEEEEGEDEEEEEEGEEEAKEQEQKQVG 157
EEE E+EEEEEE EE E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAP 430
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 32.1 bits (74), Expect = 0.091
Identities = 19/48 (39%), Positives = 24/48 (50%), Gaps = 7/48 (14%)
Query: 20 IQLFCNK-------TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
I+L+ +K + E EEEEE+ EEEEEEEE EE
Sbjct: 210 IKLYTSKIADAILEGRQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEA 257
Score = 27.1 bits (61), Expect = 4.8
Identities = 16/21 (76%), Positives = 17/21 (80%)
Query: 29 EEEEEEEEEEEEEKEEEEEEE 49
EE EEEEEEEEEE+ EE E E
Sbjct: 238 EEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|215595 PLN03130, PLN03130, ABC transporter C family member; Provisional.
Length = 1622
Score = 32.4 bits (74), Expect = 0.099
Identities = 14/45 (31%), Positives = 24/45 (53%), Gaps = 1/45 (2%)
Query: 34 EEEEEEEEKEEEEEEEEENTRTEEMNERKEN-INKKKEEEKKNGL 77
EE EE +EE+++ + N K++ +KKK +E K+ L
Sbjct: 844 EEYVEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVL 888
Score = 26.6 bits (59), Expect = 8.5
Identities = 12/52 (23%), Positives = 26/52 (50%), Gaps = 4/52 (7%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKK----KEEEKKNGL 77
EE EEE+++ + N ++ + +K++ K K+EE++ G+
Sbjct: 847 VEENGEEEDDQTSSKPVANGNANNLKKDSSSKKKSKEGKSVLIKQEERETGV 898
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 32.0 bits (72), Expect = 0.12
Identities = 14/31 (45%), Positives = 19/31 (61%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMN 59
EEE+ + E + E EEE+EEEE E+N
Sbjct: 87 EEEDNDRECPDTEAEEEDEEEEIEAPDPEVN 117
Score = 29.6 bits (66), Expect = 0.75
Identities = 23/79 (29%), Positives = 34/79 (43%), Gaps = 9/79 (11%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNTQ 89
E EE EEE+ ++E + E EE EE+ +N E GL R A
Sbjct: 81 EAGEEAEEEDNDRECPDTEAEEEDEEEEIEAPDPEVNPLDAE----GLSGLAREACD--- 133
Query: 90 LLELEKEFHFNKYLCRPRR 108
L+K +++L + RR
Sbjct: 134 --ALKKALLRHRFLWQRRR 150
>gnl|CDD|129022 smart00786, SHR3_chaperone, ER membrane protein SH3. This family
of proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). Shr3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of Shr3, AAPs are retained in the ER.
Length = 196
Score = 31.6 bits (72), Expect = 0.12
Identities = 9/39 (23%), Positives = 18/39 (46%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKK 68
E ++ +++EE EE +E + ++ E KK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 29.3 bits (66), Expect = 0.84
Identities = 10/39 (25%), Positives = 18/39 (46%)
Query: 31 EEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKK 69
E ++ +++EE EE +E + +K KKK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.9 bits (65), Expect = 0.88
Identities = 12/38 (31%), Positives = 18/38 (47%)
Query: 126 YYVAKEEEEGEDEEEEEEGEEEAKEQEQKQVGVIEKEK 163
AK++EE EE +E AK+ Q +K+K
Sbjct: 159 RKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKKKK 196
Score = 28.1 bits (63), Expect = 1.8
Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 5/43 (11%)
Query: 32 EEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
E ++ +++EE EE +E +K +K E +KK
Sbjct: 158 ERKDAKQKEEFAAEERKEALAA-----AAKKSATPQKVETKKK 195
Score = 27.3 bits (61), Expect = 3.1
Identities = 9/43 (20%), Positives = 16/43 (37%), Gaps = 9/43 (20%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKE 70
++EE EE +E ++ + E KKK+
Sbjct: 163 KQKEEFAAEERKEALAAAAKKSATPQKVE---------TKKKK 196
Score = 27.0 bits (60), Expect = 4.7
Identities = 8/40 (20%), Positives = 18/40 (45%)
Query: 126 YYVAKEEEEGEDEEEEEEGEEEAKEQEQKQVGVIEKEKNK 165
+Y +++ + ++E EE +E +K + E K
Sbjct: 155 WYAERKDAKQKEEFAAEERKEALAAAAKKSATPQKVETKK 194
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 31.9 bits (73), Expect = 0.13
Identities = 9/30 (30%), Positives = 14/30 (46%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEM 58
EEE E+ ++ E+EEE E+
Sbjct: 266 EEEPPPPPPPPEDDDDPPEDEEEQDDAEDR 295
Score = 29.2 bits (66), Expect = 1.1
Identities = 7/35 (20%), Positives = 14/35 (40%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKEN 64
+ EEE E++++ EE + E+
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAED 294
Score = 28.8 bits (65), Expect = 1.3
Identities = 8/40 (20%), Positives = 17/40 (42%)
Query: 32 EEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEE 71
+ EEE E+++ E+ E+ + ++ EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
Score = 28.5 bits (64), Expect = 2.0
Identities = 7/29 (24%), Positives = 15/29 (51%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
E++++ E+EEE+++ R E
Sbjct: 270 PPPPPPPEDDDDPPEDEEEQDDAEDRALE 298
Score = 28.1 bits (63), Expect = 2.8
Identities = 7/29 (24%), Positives = 18/29 (62%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEM 58
E++++ ++EEE+++ E+ EE+
Sbjct: 272 PPPPPEDDDDPPEDEEEQDDAEDRALEEI 300
Score = 27.3 bits (61), Expect = 4.3
Identities = 6/29 (20%), Positives = 11/29 (37%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
+ EEE E++++ EE
Sbjct: 260 APPQPPEEEPPPPPPPPEDDDDPPEDEEE 288
Score = 27.3 bits (61), Expect = 4.4
Identities = 7/33 (21%), Positives = 15/33 (45%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNER 61
E E+++ E+EEE+++ + E
Sbjct: 268 EPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.8 bits (72), Expect = 0.15
Identities = 25/48 (52%), Positives = 30/48 (62%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
+ E EEEEEEEEEE+EEE+E EEE EE E E N +EE +
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 29.5 bits (66), Expect = 0.95
Identities = 23/51 (45%), Positives = 29/51 (56%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPR 79
EEEEEEEEEE+E +EEE E+EEE E N +E + E + P
Sbjct: 450 EEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPE 500
Score = 29.5 bits (66), Expect = 0.98
Identities = 23/49 (46%), Positives = 30/49 (61%)
Query: 24 CNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
+ EEEEEEEEEEE+EE+E EEEE EE E + + ++E E
Sbjct: 439 SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 31.9 bits (73), Expect = 0.15
Identities = 20/51 (39%), Positives = 32/51 (62%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLP 78
+EEEEE+ + +EE++EE+E+ ++ EE E KE KK E ++ LP
Sbjct: 342 LEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELP 392
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 31.4 bits (71), Expect = 0.15
Identities = 17/35 (48%), Positives = 21/35 (60%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKEN 64
+ E EE+EEE+EEEEEEEE + T E N
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTVEKRRSSSN 257
Score = 30.3 bits (68), Expect = 0.37
Identities = 14/21 (66%), Positives = 17/21 (80%)
Query: 29 EEEEEEEEEEEEEKEEEEEEE 49
+ E EE+EEEE+EEEEEEE
Sbjct: 223 GSDSESEEDEEEEEEEEEEEE 243
Score = 28.4 bits (63), Expect = 1.9
Identities = 13/18 (72%), Positives = 16/18 (88%)
Query: 27 TIEEEEEEEEEEEEEKEE 44
+ EE+EEEEEEEEE+EE
Sbjct: 226 SESEEDEEEEEEEEEEEE 243
Score = 26.8 bits (59), Expect = 6.8
Identities = 13/17 (76%), Positives = 14/17 (82%)
Query: 131 EEEEGEDEEEEEEGEEE 147
E EE E+EEEEEE EEE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243
Score = 26.4 bits (58), Expect = 7.2
Identities = 13/29 (44%), Positives = 21/29 (72%)
Query: 135 GEDEEEEEEGEEEAKEQEQKQVGVIEKEK 163
G D E EE+ EEE +E+E++++ V+ EK
Sbjct: 223 GSDSESEEDEEEEEEEEEEEEIDVVTVEK 251
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97
family. This model family represents the major capsid
protein component of the heads (capsids) of
bacteriophage HK97, phi-105, P27, and related phage.
This model represents one of several analogous families
lacking detectable sequence similarity. The gene
encoding this component is typically located in an
operon encoding the small and large terminase subunits,
the portal protein and the prohead or maturation
protease [Mobile and extrachromosomal element
functions, Prophage functions].
Length = 384
Score = 31.2 bits (71), Expect = 0.20
Identities = 9/45 (20%), Positives = 21/45 (46%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
E +E + E E+ E EEE + + E + ++ + + ++
Sbjct: 9 AELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEE 53
Score = 30.4 bits (69), Expect = 0.39
Identities = 14/94 (14%), Positives = 33/94 (35%), Gaps = 4/94 (4%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAY 85
+ + E+E+ E EEE+ E + +EE ++++ + + + +E +
Sbjct: 16 RKLTEDEKLAEAEEEKAEYDALKEE----IDKLDAEIDRLEELLDELEAKPAASGEGGGG 71
Query: 86 TNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTE 119
+ + F YL L+
Sbjct: 72 EEEEEEAKAEAAEFRAYLRGGDDALAEERKALST 105
Score = 26.5 bits (59), Expect = 8.3
Identities = 10/60 (16%), Positives = 23/60 (38%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTA 84
EEE+ E + +EE ++ + E + + E K + + ++ + A
Sbjct: 24 LAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAASGEGGGGEEEEEEAKAEAA 83
>gnl|CDD|222948 PHA02941, PHA02941, hypothetical protein; Provisional.
Length = 356
Score = 31.1 bits (70), Expect = 0.23
Identities = 18/33 (54%), Positives = 24/33 (72%)
Query: 21 QLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENT 53
QLF I+EE EE +E++E+EE EEEEE +T
Sbjct: 320 QLFNPFAIDEEMLEETQEQQEEEENEEEEENDT 352
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 31.2 bits (70), Expect = 0.24
Identities = 14/33 (42%), Positives = 19/33 (57%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNER 61
E E++ EE E EKE+E+E E E R E +
Sbjct: 591 EAEQKAREEREREKEKEKEREREREREAERAAK 623
Score = 28.5 bits (63), Expect = 2.2
Identities = 12/29 (41%), Positives = 17/29 (58%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTR 54
K EE E E+E+E+E + E E E E +
Sbjct: 595 KAREEREREKEKEKEREREREREAERAAK 623
Score = 27.3 bits (60), Expect = 4.3
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERK 62
K EE E+ + E E+K EE E E+ E ER+
Sbjct: 579 KKREEAVEKAKREAEQKAREEREREKEKEKERERERE 615
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 30.4 bits (69), Expect = 0.26
Identities = 14/24 (58%), Positives = 22/24 (91%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEEN 52
+EE+EEEE++EE+ +E+E EEEE+
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEES 140
Score = 30.0 bits (68), Expect = 0.38
Identities = 14/36 (38%), Positives = 28/36 (77%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
EE+E +++EE+EE+E++EE+++E+ EE + K+
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKK 144
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 31.0 bits (71), Expect = 0.26
Identities = 16/58 (27%), Positives = 23/58 (39%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTA 84
T+EEEE +EE + E EEE + E E E E + ++ A
Sbjct: 37 TVEEEEARKEEAKREAEEEAKAEAEEAAAAEAEEEAKAEAAAAAPAEEAAEAAAAAEA 94
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.9 bits (71), Expect = 0.26
Identities = 16/54 (29%), Positives = 26/54 (48%)
Query: 20 IQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
I F + EE+ E EE +++E+EEEE ++ E+ +K EK
Sbjct: 170 IDGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEK 223
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP),
alpha subunit. The alpha-subunit of the TRAP complex
(TRAP alpha) is a single-spanning membrane protein of
the endoplasmic reticulum (ER) which is found in
proximity of nascent polypeptide chains translocating
across the membrane.
Length = 281
Score = 30.5 bits (69), Expect = 0.32
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
T +EE E++ +E+E++E EE+EN TEE +
Sbjct: 35 TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEED 68
Score = 29.0 bits (65), Expect = 1.1
Identities = 13/25 (52%), Positives = 17/25 (68%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEE 51
E+EE+E EE+E E EEEE+E
Sbjct: 45 VDEDEEDEAVVEEDENELTEEEEDE 69
Score = 28.6 bits (64), Expect = 1.3
Identities = 12/35 (34%), Positives = 21/35 (60%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
E+EE E++ +E++E+E EE+ E E +E
Sbjct: 36 EDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEE 70
Score = 28.6 bits (64), Expect = 1.5
Identities = 12/27 (44%), Positives = 17/27 (62%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENT 53
+EE+E EE+E + EEEE+EE
Sbjct: 46 DEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 28.2 bits (63), Expect = 2.1
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 127 YVAKEEEEGEDEEEEEEGEEEAKEQEQKQVGVIEKE 162
E+EE ED+ +E+ E+EA +E + E+E
Sbjct: 32 QDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEE 67
Score = 27.0 bits (60), Expect = 4.4
Identities = 13/59 (22%), Positives = 28/59 (47%), Gaps = 2/59 (3%)
Query: 15 PSSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
P+ ++ ++ E+EE E + +E+EE+ EE + E ++++EE
Sbjct: 15 PAVLLTFGNGKGPFASAQDLTEDEEAEDDVVDEDEEDEAVVEE--DENELTEEEEDEEG 71
Score = 27.0 bits (60), Expect = 5.1
Identities = 11/28 (39%), Positives = 18/28 (64%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRT 55
++E+EE+E EE++ E EEEE+
Sbjct: 45 VDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 26.3 bits (58), Expect = 7.9
Identities = 14/35 (40%), Positives = 20/35 (57%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
EE E++ +E+EE E EE+E EE +E E
Sbjct: 38 EEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 30.4 bits (69), Expect = 0.32
Identities = 14/40 (35%), Positives = 18/40 (45%), Gaps = 2/40 (5%)
Query: 126 YYVAKEEEEGEDEEEEEEGEEEAKEQEQKQVGVIEKEKNK 165
+ AK EE EE E AKE+E + E +K K
Sbjct: 159 WKDAKLLEEF--AAEEAEAAAAAKEEESAEGEKKESKKKK 196
Score = 28.1 bits (63), Expect = 1.9
Identities = 10/30 (33%), Positives = 12/30 (40%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
EE EE E +EEE E + E
Sbjct: 163 KLLEEFAAEEAEAAAAAKEEESAEGEKKES 192
Score = 26.9 bits (60), Expect = 4.9
Identities = 13/40 (32%), Positives = 19/40 (47%), Gaps = 2/40 (5%)
Query: 35 EEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
E ++ + EE EE E + E E +KKE +KK
Sbjct: 158 EWKDAKLLEEFAAEEAEAAAAAKEEESAE--GEKKESKKK 195
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more
internally with limited reactivity in the C-terminal
domains, while P2 proteins seem to be more externally
located and are more likely to interact with other
cellular components. In lower eukaryotes, P1 and P2 are
further subdivided into P1A, P1B, P2A, and P2B, which
form P1A/P2B and P1B/P2A heterodimers. Some plant
species have a third P-protein, called P3, which is not
homologous to P1 and P2. In humans, P1 and P2 are
strongly autoimmunogenic. They play a significant role
in the etiology and pathogenesis of systemic lupus
erythema (SLE). In addition, the ribosome-inactivating
protein trichosanthin (TCS) interacts with human P0,
P1, and P2, with its primary binding site located in
the C-terminal region of P2. TCS inactivates the
ribosome by depurinating a specific adenine in the
sarcin-ricin loop of 28S rRNA.
Length = 103
Score = 29.6 bits (67), Expect = 0.33
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 33 EEEEEEEEEKEEEEEEEEEN 52
E ++EE+KEEEEEE +++
Sbjct: 78 AAEAKKEEKKEEEEEESDDD 97
Score = 26.5 bits (59), Expect = 3.0
Identities = 8/20 (40%), Positives = 16/20 (80%)
Query: 29 EEEEEEEEEEEEEKEEEEEE 48
E ++EE++EEE+EE +++
Sbjct: 78 AAEAKKEEKKEEEEEESDDD 97
Score = 26.1 bits (58), Expect = 4.2
Identities = 8/20 (40%), Positives = 16/20 (80%)
Query: 30 EEEEEEEEEEEEKEEEEEEE 49
E ++EE++EE+EEE +++
Sbjct: 78 AAEAKKEEKKEEEEEESDDD 97
>gnl|CDD|227391 COG5058, LAG1, Protein transporter of the TRAM (translocating
chain-associating membrane) superfamily, longevity
assurance factor [Intracellular trafficking and
secretion].
Length = 395
Score = 30.6 bits (69), Expect = 0.35
Identities = 10/40 (25%), Positives = 19/40 (47%)
Query: 17 SIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTE 56
+I + I E E ++E ++E EEE + E + +
Sbjct: 356 LFLIVRVAYRVIWEGELKDERSDDESEEESDLESSEDKND 395
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 30.6 bits (70), Expect = 0.40
Identities = 11/54 (20%), Positives = 24/54 (44%)
Query: 22 LFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
+ ++ +EE E+ E ++EEE+E+ E M + + K+ +
Sbjct: 158 ILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNFDPK 211
Score = 30.6 bits (70), Expect = 0.46
Identities = 16/53 (30%), Positives = 29/53 (54%)
Query: 23 FCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
F I+ E ++ +EE E+ E ++EEE+E E + K+ +KK++ K
Sbjct: 156 FGILKIDFEIDDSKEELEKFEAQKEEEDEKLAKEALEAMKKLEAEKKKQSKNF 208
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 30.7 bits (69), Expect = 0.41
Identities = 12/50 (24%), Positives = 21/50 (42%), Gaps = 5/50 (10%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKEN-----INKKKEEEKK 74
+E+ ++E + E E E T + E K +N KK++ K
Sbjct: 801 PKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 29.2 bits (66), Expect = 0.42
Identities = 14/21 (66%), Positives = 18/21 (85%)
Query: 29 EEEEEEEEEEEEEKEEEEEEE 49
EE++EEEEEEE++EE EEE
Sbjct: 76 AAEEKKEEEEEEEEKEESEEE 96
Score = 29.2 bits (66), Expect = 0.48
Identities = 16/23 (69%), Positives = 18/23 (78%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
EE++EEEEEEEEKEE EEE
Sbjct: 77 AEEKKEEEEEEEEKEESEEEAAA 99
Score = 29.2 bits (66), Expect = 0.49
Identities = 14/22 (63%), Positives = 18/22 (81%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEE 50
EE++EEEEEE+E+EE EEE
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEE 96
Score = 29.2 bits (66), Expect = 0.50
Identities = 14/23 (60%), Positives = 18/23 (78%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
EE++EEEEE+EE+EE EEE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEE 96
Score = 28.8 bits (65), Expect = 0.58
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
EE++EEEE+EEE+EE EE
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEE 95
Score = 27.6 bits (62), Expect = 1.4
Identities = 12/23 (52%), Positives = 16/23 (69%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
EE++EEE+EEEE+EE E
Sbjct: 72 AAAAAAEEKKEEEEEEEEKEESE 94
Score = 26.1 bits (58), Expect = 5.5
Identities = 11/23 (47%), Positives = 15/23 (65%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
EE++EE+EEEEE+EE
Sbjct: 71 AAAAAAAEEKKEEEEEEEEKEES 93
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 30.0 bits (68), Expect = 0.44
Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 10/53 (18%)
Query: 15 PSSIMIQLFCNKTIEEEEEEEEEEEEEK----------EEEEEEEEENTRTEE 57
+M+ LF + EE EE+EE E E EE EE E
Sbjct: 194 EWDVMVDLFFWRDPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246
>gnl|CDD|219956 pfam08658, Rad54_N, Rad54 N terminal. This is the N terminal of
the DNA repair protein Rad54.
Length = 191
Score = 30.0 bits (68), Expect = 0.44
Identities = 15/48 (31%), Positives = 29/48 (60%), Gaps = 3/48 (6%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNER-KE--NINKKKEE 71
T++++ + EEE+ E+ +E EE E + + ++++ E I KK EE
Sbjct: 140 TVDDKPKIEEEKAEKDQEPEESETKLSNGPKVHKSLAEILGIKKKVEE 187
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 30.6 bits (69), Expect = 0.46
Identities = 15/49 (30%), Positives = 27/49 (55%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
K ++E++EE++EE+ E+ + EE +EE E+ + E KK
Sbjct: 277 KELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK 325
Score = 27.9 bits (62), Expect = 3.3
Identities = 21/60 (35%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 20 IQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE-NINKKKEEEKKNGLP 78
++ K E++EE+ EE + E+ +EE EEEE E+ E+ NKK +E G P
Sbjct: 276 LKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKKFIKEVIAGRP 335
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 30.4 bits (68), Expect = 0.49
Identities = 17/56 (30%), Positives = 27/56 (48%)
Query: 110 EIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGEEEAKEQEQKQVGVIEKEKNK 165
+D R+ V +EEEE E EE+EE E+E + ++ E+E+ K
Sbjct: 121 IAPPEIDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELK 176
Score = 26.9 bits (59), Expect = 5.5
Identities = 15/41 (36%), Positives = 23/41 (56%)
Query: 32 EEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
EEEEE E E++EE E+E +E +E +E K ++
Sbjct: 140 LEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMKD 180
>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
domain 1. The small mechanosensitive channel, MscS, is
a part of the turgor-driven solute efflux system that
protects bacteria from lysis in the event of osmotic
shock. The MscS protein alone is sufficient to form a
functional mechanosensitive channel gated directly by
tension in the lipid bilayer. The MscS proteins are
heptamers of three transmembrane subunits with seven
converging M3 domains, and this domain is one of the
inner membrane domains.
Length = 339
Score = 29.9 bits (68), Expect = 0.49
Identities = 12/38 (31%), Positives = 16/38 (42%), Gaps = 5/38 (13%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENIN 66
E + EEEEE E E E E++ E I+
Sbjct: 268 AEILAQRAEEEEESSEGAAETIE---EPELD--LETIS 300
>gnl|CDD|184416 PRK13955, mscL, large-conductance mechanosensitive channel;
Provisional.
Length = 130
Score = 29.0 bits (65), Expect = 0.62
Identities = 15/46 (32%), Positives = 26/46 (56%)
Query: 18 IMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
+ +++F T ++EEE+EEE E +EEE E ++ N K+
Sbjct: 85 MFVKVFNKLTSKKEEEKEEEIPEPTKEEELLGEIRDLLKQQNSSKD 130
>gnl|CDD|223626 COG0552, FtsY, Signal recognition particle GTPase [Intracellular
trafficking and secretion].
Length = 340
Score = 29.9 bits (68), Expect = 0.62
Identities = 18/42 (42%), Positives = 26/42 (61%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKE 70
E+ EEE EE E EKE+ EEE+EE + K+ ++K K+
Sbjct: 5 EKLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKK 46
Score = 29.1 bits (66), Expect = 0.93
Identities = 19/48 (39%), Positives = 29/48 (60%), Gaps = 4/48 (8%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEM----NERKENINKK 68
+ +EEE EE E E+E+ EEE+EEEE+ E + ++ K+N K
Sbjct: 4 FEKLEEELEELEAEKEKIEEEDEEEEKEGWFERLKQGLSKTKKNFGKG 51
Score = 26.8 bits (60), Expect = 6.1
Identities = 14/29 (48%), Positives = 19/29 (65%)
Query: 23 FCNKTIEEEEEEEEEEEEEKEEEEEEEEE 51
F E EE E E+E+ E+E+EEEE+E
Sbjct: 4 FEKLEEELEELEAEKEKIEEEDEEEEKEG 32
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 29.6 bits (67), Expect = 0.64
Identities = 6/21 (28%), Positives = 9/21 (42%)
Query: 30 EEEEEEEEEEEEKEEEEEEEE 50
+EEEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
Score = 29.6 bits (67), Expect = 0.65
Identities = 7/20 (35%), Positives = 8/20 (40%)
Query: 32 EEEEEEEEEEKEEEEEEEEE 51
EEEEEEE+
Sbjct: 283 AAAAAAAAAAPAEEEEEEED 302
Score = 29.6 bits (67), Expect = 0.75
Identities = 7/21 (33%), Positives = 9/21 (42%)
Query: 31 EEEEEEEEEEEKEEEEEEEEE 51
EEEEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
Score = 28.4 bits (64), Expect = 1.8
Identities = 6/23 (26%), Positives = 9/23 (39%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEE 48
EE+EEEE++
Sbjct: 281 AAAAAAAAAAAAPAEEEEEEEDD 303
Score = 28.1 bits (63), Expect = 2.0
Identities = 6/21 (28%), Positives = 9/21 (42%)
Query: 29 EEEEEEEEEEEEEKEEEEEEE 49
E+EEEEE++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
>gnl|CDD|224881 COG1970, MscL, Large-conductance mechanosensitive channel [Cell
envelope biogenesis, outer membrane].
Length = 130
Score = 28.8 bits (65), Expect = 0.66
Identities = 10/31 (32%), Positives = 13/31 (41%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTE 56
K I + + E+EE E EE TE
Sbjct: 92 KAINKLRRKLEKEEPEAPAPAPPAEEVLLTE 122
Score = 27.6 bits (62), Expect = 2.1
Identities = 8/31 (25%), Positives = 14/31 (45%), Gaps = 1/31 (3%)
Query: 18 IMIQLFCNKTIEEEEEEEEEEEEEKEEEEEE 48
++++ NK + E+EE E EE
Sbjct: 89 LVVKAI-NKLRRKLEKEEPEAPAPAPPAEEV 118
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 28.8 bits (65), Expect = 0.77
Identities = 18/56 (32%), Positives = 32/56 (57%)
Query: 19 MIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
+ +L E+EEEEEEE E E+ +EEE+ +E E ++E + + ++K+
Sbjct: 90 VRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
Score = 26.5 bits (59), Expect = 5.2
Identities = 20/48 (41%), Positives = 29/48 (60%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
K EEEEE E EE +E+E+ +E E+ + +R+EN K+KE K
Sbjct: 101 KEEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENERKQKEILK 148
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 29.2 bits (66), Expect = 0.84
Identities = 14/32 (43%), Positives = 22/32 (68%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
EEEEEE++ E E E+EE+E+EE ++ +
Sbjct: 287 EEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318
Score = 29.2 bits (66), Expect = 1.0
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
+EE+ EE E+ E+EEEE++ E+ +E +E +E E +K
Sbjct: 275 DEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 28.1 bits (63), Expect = 2.0
Identities = 15/45 (33%), Positives = 32/45 (71%)
Query: 31 EEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
+EE+ EE EK EEEEEE++ + +E+ +E E+ +++++++ +
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318
Score = 27.7 bits (62), Expect = 2.7
Identities = 14/37 (37%), Positives = 26/37 (70%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKEN 64
+E+ EEEEEE++ + E+E+EE+E+ EE ++ +
Sbjct: 283 LEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 28.0 bits (63), Expect = 0.86
Identities = 12/20 (60%), Positives = 14/20 (70%)
Query: 32 EEEEEEEEEEKEEEEEEEEE 51
EEE+KEEEEEEEE+
Sbjct: 61 AAAAAAEEEKKEEEEEEEED 80
Score = 28.0 bits (63), Expect = 0.88
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 31 EEEEEEEEEEEKEEEEEEEEE 51
EEE+++EEEEEEE++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81
Score = 27.6 bits (62), Expect = 1.0
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 30 EEEEEEEEEEEEKEEEEEEEE 50
EEE++E+EEEEEE++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81
Score = 27.6 bits (62), Expect = 1.1
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 29 EEEEEEEEEEEEEKEEEEEEE 49
EEE++EE+EEEEE++
Sbjct: 61 AAAAAAEEEKKEEEEEEEEDD 81
>gnl|CDD|182840 PRK10921, PRK10921, twin-arginine protein translocation system
subunit TatC; Provisional.
Length = 258
Score = 29.3 bits (66), Expect = 0.86
Identities = 9/28 (32%), Positives = 13/28 (46%), Gaps = 3/28 (10%)
Query: 125 KYYVAKEEEEGEDEEEEEEGEEEAKEQE 152
++YV K EEE + E K +E
Sbjct: 234 RFYVGKGRR---REEENDAEAESEKTEE 258
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 29.3 bits (66), Expect = 0.89
Identities = 13/58 (22%), Positives = 29/58 (50%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLR 82
+K E +++ E++++++KE++E + E T+ +K KKK + K
Sbjct: 61 DKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKPPKPKPNEDVDNA 118
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 28.1 bits (63), Expect = 1.1
Identities = 17/24 (70%), Positives = 18/24 (75%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEEN 52
EEEEEEEEE+EEEEEE EE
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95
Score = 27.3 bits (61), Expect = 2.1
Identities = 16/23 (69%), Positives = 17/23 (73%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
EEEEEEEE+EEEEEEE E
Sbjct: 71 AAAAAEEEEEEEEEEEEEEEESE 93
>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
Length = 586
Score = 29.0 bits (65), Expect = 1.1
Identities = 17/48 (35%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 2 IASQERQQRSYFIPSSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEE 49
I S R + P I +QL K +EE+ EE +E+E+E+ E+E
Sbjct: 33 IDSSFRSSSNPMPP--IPLQLLEQKEEKEEDAGEEGDEDEEEQGEDEH 78
>gnl|CDD|221342 pfam11960, DUF3474, Domain of unknown function (DUF3474). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria and eukaryotes. This domain
is typically between 126 to 140 amino acids in length.
This domain is found associated with pfam00487.
Length = 135
Score = 28.5 bits (64), Expect = 1.2
Identities = 11/18 (61%), Positives = 15/18 (83%)
Query: 27 TIEEEEEEEEEEEEEKEE 44
++EEEEEE +EEE+EE
Sbjct: 72 SVEEEEEENGNDEEEEEE 89
Score = 28.1 bits (63), Expect = 1.2
Identities = 12/16 (75%), Positives = 13/16 (81%)
Query: 36 EEEEEEKEEEEEEEEE 51
EEEEEE +EEEEEE
Sbjct: 74 EEEEEENGNDEEEEEE 89
Score = 27.7 bits (62), Expect = 1.9
Identities = 12/16 (75%), Positives = 13/16 (81%)
Query: 34 EEEEEEEEKEEEEEEE 49
EEEEEE +EEEEEE
Sbjct: 74 EEEEEENGNDEEEEEE 89
Score = 27.3 bits (61), Expect = 2.6
Identities = 11/16 (68%), Positives = 13/16 (81%)
Query: 30 EEEEEEEEEEEEKEEE 45
EEEEEE +EE+EEE
Sbjct: 74 EEEEEENGNDEEEEEE 89
Score = 27.3 bits (61), Expect = 2.8
Identities = 12/16 (75%), Positives = 12/16 (75%)
Query: 33 EEEEEEEEEKEEEEEE 48
EEEEEE EEEEEE
Sbjct: 74 EEEEEENGNDEEEEEE 89
>gnl|CDD|216881 pfam02115, Rho_GDI, RHO protein GDP dissociation inhibitor.
Length = 200
Score = 28.5 bits (64), Expect = 1.2
Identities = 15/44 (34%), Positives = 22/44 (50%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
E+E E+ EE E+EE+E+ N + KE KE+E
Sbjct: 3 EQEPTAEQAEEIAAEDEEDEDSVNYKPPPQKSLKEIQELDKEDE 46
>gnl|CDD|237001 PRK11856, PRK11856, branched-chain alpha-keto acid dehydrogenase
subunit E2; Reviewed.
Length = 411
Score = 29.0 bits (66), Expect = 1.3
Identities = 9/25 (36%), Positives = 9/25 (36%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEE 51
IEEE E E E E E
Sbjct: 75 VIEEEGEAEAAAAAEAAPEAPAPEP 99
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.0 bits (65), Expect = 1.3
Identities = 12/50 (24%), Positives = 25/50 (50%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
NK +++ ++ E ++E + EE E + + E E K++ +E K
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDK 424
Score = 27.8 bits (62), Expect = 3.4
Identities = 13/47 (27%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMN-ERKENINKKKEEEKK 74
++ E ++E + +E E + +EE + EE ++KE ++ KE+ +K
Sbjct: 383 KKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQK 429
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 28.8 bits (64), Expect = 1.3
Identities = 16/54 (29%), Positives = 25/54 (46%)
Query: 20 IQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
I F IE EE E EEE E+ E+E + + R + +E + + E+
Sbjct: 367 ILYFEKMEIENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADEKNS 420
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 28.2 bits (63), Expect = 1.3
Identities = 11/22 (50%), Positives = 17/22 (77%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEE 50
+EEEEEEEE EK+++ +E +
Sbjct: 94 TRQEEEEEEEENEKQQQSDEAQ 115
Score = 26.7 bits (59), Expect = 3.5
Identities = 14/37 (37%), Positives = 25/37 (67%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
T +EEEEEEEE E++++ +E + +++ + E E E
Sbjct: 94 TRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGE 130
Score = 26.3 bits (58), Expect = 5.3
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
+EEEEEEEE E++++ +E + + +E
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHE 123
>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
Length = 382
Score = 28.7 bits (65), Expect = 1.4
Identities = 11/56 (19%), Positives = 26/56 (46%)
Query: 19 MIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
++ + + EEEEE EEEE + E + + +++ +++ E++
Sbjct: 219 LVAILLGEEEEEEEEVEEEEAQASPAAEPATAQAAPAPKQEQQQAPPQRQEPEKEA 274
>gnl|CDD|162338 TIGR01398, FlhA, flagellar biosynthesis protein FlhA. This model
describes flagellar biosynthesis protein FlhA, one of a
large number of genes associated with the biosynthesis
of functional bacterial flagella. Homologs of many such
proteins, including FlhA, function in type III protein
secretion systems. A separate model describes InvA
(Salmonella enterica), LcrD (Yersinia enterocolitica),
HrcV (Xanthomonas), etc., all of which score below the
noise cutoff for this model [Cellular processes,
Chemotaxis and motility].
Length = 678
Score = 28.8 bits (65), Expect = 1.4
Identities = 11/25 (44%), Positives = 14/25 (56%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEEN 52
EEE E + +E+ EEEEE N
Sbjct: 315 QEEEAAAEAAKAQEEAAEEEEESIN 339
Score = 28.8 bits (65), Expect = 1.6
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENT 53
++EEE E + +EE EEEE +
Sbjct: 313 SKQEEEAAAEAAKAQEEAAEEEEES 337
Score = 27.2 bits (61), Expect = 4.5
Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 3/32 (9%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
++EEE E K +EE EEE E +N+
Sbjct: 312 RSKQEEEAAAEAAKAQEEAAEEE---EESIND 340
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 28.9 bits (64), Expect = 1.6
Identities = 20/55 (36%), Positives = 33/55 (60%)
Query: 20 IQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
I+L KT E+E +E+EE+ +K +E+ EE + +E +E K+ KK +E K
Sbjct: 203 IELMVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKKVKEVTK 257
>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
Length = 697
Score = 28.5 bits (65), Expect = 1.6
Identities = 11/34 (32%), Positives = 18/34 (52%)
Query: 126 YYVAKEEEEGEDEEEEEEGEEEAKEQEQKQVGVI 159
Y + K E++ + EE EEEA E E++ +
Sbjct: 324 YRLRKREKKAAELAAEEAEEEEAAEPEEESWDDV 357
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The
domain is found in the primary vegetative sigma factor.
The function of this domain is unclear and can be
removed without loss of function.
Length = 211
Score = 28.3 bits (64), Expect = 1.7
Identities = 9/34 (26%), Positives = 19/34 (55%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
E E +EE+ E+ ++++E+E+E+ E
Sbjct: 41 AAAIESELDEEDLEDDDDDDEDEDEDDEEEADLG 74
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 28.6 bits (65), Expect = 1.8
Identities = 33/147 (22%), Positives = 66/147 (44%), Gaps = 17/147 (11%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAY 85
+ +EE ++E E ++E E +EE R E E +E N+ ++ EK+ +
Sbjct: 42 RILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRL----LQKEEN 97
Query: 86 TNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGED----EEEE 141
+ +L LEK + L + + +L +++ + + + ++ +E E EE
Sbjct: 98 LDRKLELLEKR---EEELEKKEKELEQKQQELEKKEEELEELIEEQLQELERISGLTAEE 154
Query: 142 ------EEGEEEAKEQEQKQVGVIEKE 162
E+ EEEA+ + + IE+E
Sbjct: 155 AKEILLEKVEEEARHEAAVLIKEIEEE 181
>gnl|CDD|180935 PRK07333, PRK07333, 2-octaprenyl-6-methoxyphenyl hydroxylase;
Provisional.
Length = 403
Score = 28.4 bits (64), Expect = 1.8
Identities = 12/32 (37%), Positives = 18/32 (56%)
Query: 113 ASLDLTERQSDCKYYVAKEEEEGEDEEEEEEG 144
+SL TER +D + VA ++ E E E+ G
Sbjct: 221 SSLVWTERTADAERLVALDDLVFEAELEQRFG 252
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 28.3 bits (63), Expect = 1.9
Identities = 11/38 (28%), Positives = 20/38 (52%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
+ IE+ E E E + E+ EE+E+E++ + K
Sbjct: 666 EAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKA 703
Score = 27.1 bits (60), Expect = 6.0
Identities = 15/44 (34%), Positives = 20/44 (45%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
E E E+ E E E + E+ EE+E E NE I+K
Sbjct: 664 ECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRA 707
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 28.3 bits (63), Expect = 2.1
Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 5/94 (5%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNT 88
E EE E E E E E+ + + ++E+I+ + + L R+L+ N
Sbjct: 65 HNSESAVEEVEAEDEIEVEQNQSDVLKSSSIVKEESISTDMDGIDDDSLDRKLKLERENL 124
Query: 89 QLLELEK--EFHF---NKYLCRPRRIEIAASLDL 117
+ E+E+ E +F NK P+ ++ + +++
Sbjct: 125 RKREIEELAEENFSRGNKLFVYPQVVKPDSDIEV 158
>gnl|CDD|218328 pfam04921, XAP5, XAP5, circadian clock regulator. This protein
is found in a wide range of eukaryotes. It is a nuclear
protein and is suggested to be DNA binding. In plants,
this family is essential for correct circadian clock
functioning by acting as a light-quality regulator
coordinating the activities of blue and red light
signalling pathways during plant growth - inhibiting
growth in red light but promoting growth in blue light.
Length = 233
Score = 28.1 bits (63), Expect = 2.1
Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 11/80 (13%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINK-----------KKEEEKKNGL 77
+++EEE+E+E E E++ +E +N K+ I K K EEK+ L
Sbjct: 14 GDDDEEEDEDEGEDEKKVPKESSEPDEANVNPNKKKIGKNPSVDTSFLPDKAREEKEAEL 73
Query: 78 PRRLRTAYTNTQLLELEKEF 97
LR + Q E+E
Sbjct: 74 REELREEFLKKQEAVKEEEI 93
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 28.5 bits (64), Expect = 2.1
Identities = 12/34 (35%), Positives = 22/34 (64%)
Query: 23 FCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTE 56
F EE+ ++++++E+K+EEE + EE R E
Sbjct: 735 FKISDSVEEKTKKKKKKEKKKEEEYKREEKARIE 768
>gnl|CDD|219038 pfam06459, RR_TM4-6, Ryanodine Receptor TM 4-6. This region covers
TM regions 4-6 of the ryanodine receptor 1 family.
Length = 217
Score = 27.7 bits (62), Expect = 2.2
Identities = 15/42 (35%), Positives = 20/42 (47%)
Query: 20 IQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNER 61
I LF + EEEEEE ++ EE+EEE + E
Sbjct: 141 ILLFYKVSTSPAEEEEEEGSGSGDDGEEDEEEGIVYFVLQES 182
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 28.4 bits (63), Expect = 2.3
Identities = 13/47 (27%), Positives = 29/47 (61%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
+E +EE + E++E ++E+ EEN +E ++ + + +++EK N
Sbjct: 4017 DENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMN 4063
Score = 27.7 bits (61), Expect = 4.2
Identities = 32/148 (21%), Positives = 56/148 (37%), Gaps = 18/148 (12%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTE---------EMNERKENINKKKEEEKK- 74
T+ E+++ EE EE + + +E+ E + + EN N EEE +
Sbjct: 3828 TNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQD 3887
Query: 75 -----NGLPRRLRTAYTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVA 129
N +P L + E +E + DL ++ D K
Sbjct: 3888 LDEEVNDIPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKEDDNKALED 3947
Query: 130 KEEEEGEDEEEEEEG---EEEAKEQEQK 154
K+ +E EDEEE + ++E + Q+
Sbjct: 3948 KDRQEKEDEEEMSDDVGIDDEIQPDIQE 3975
Score = 26.9 bits (59), Expect = 7.0
Identities = 25/129 (19%), Positives = 55/129 (42%), Gaps = 25/129 (19%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTA 84
+K + + E+++ +E+++EEE ++ E + +EN ++ E + LP L+
Sbjct: 3937 SKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLK-- 3994
Query: 85 YTNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEG 144
L+ ++ +++ DL D A E +E D E++E
Sbjct: 3995 ------LDEKEG-------------DVSKDSDL----EDMDMEAADENKEEADAEKDEPM 4031
Query: 145 EEEAKEQEQ 153
++E +E
Sbjct: 4032 QDEDPLEEN 4040
Score = 26.5 bits (58), Expect = 8.7
Identities = 12/41 (29%), Positives = 23/41 (56%)
Query: 23 FCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKE 63
F + ++E+ E+ EE +E EE E+ +++E E+ E
Sbjct: 4052 FSDLAEDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGE 4092
>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
Provisional.
Length = 606
Score = 28.2 bits (63), Expect = 2.3
Identities = 12/43 (27%), Positives = 19/43 (44%)
Query: 10 RSYFIPSSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEEN 52
F+P+ N EE E E EE+EE+++E+
Sbjct: 562 PPTFVPTQEPYDPRANNNEASEETETITVPENNEEDEEDDDED 604
Score = 27.8 bits (62), Expect = 3.2
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEE 51
N+ EE E E E++EE+++E++
Sbjct: 579 NEASEETETITVPENNEEDEEDDDEDD 605
Score = 27.1 bits (60), Expect = 6.1
Identities = 9/28 (32%), Positives = 16/28 (57%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEEN 52
N EE E E +E+EE+++E++
Sbjct: 578 NNEASEETETITVPENNEEDEEDDDEDD 605
>gnl|CDD|224217 COG1298, FlhA, Flagellar biosynthesis pathway, component FlhA [Cell
motility and secretion / Intracellular trafficking and
secretion].
Length = 696
Score = 28.0 bits (63), Expect = 2.4
Identities = 16/31 (51%), Positives = 20/31 (64%)
Query: 21 QLFCNKTIEEEEEEEEEEEEEKEEEEEEEEE 51
L K + EE+E+ EE +KEEEEEEEE
Sbjct: 322 LLSKRKEQQAEEKEKPAEEAKKEEEEEEEES 352
Score = 26.4 bits (59), Expect = 9.6
Identities = 14/24 (58%), Positives = 20/24 (83%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEEN 52
++ EE+E+ EE K+EEEEEEEE+
Sbjct: 329 QQAEEKEKPAEEAKKEEEEEEEES 352
>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
Length = 1634
Score = 28.4 bits (63), Expect = 2.4
Identities = 28/129 (21%), Positives = 51/129 (39%), Gaps = 25/129 (19%)
Query: 33 EEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNTQLLE 92
E +EEEEE EE+E+ +E++ E K L R + Y T
Sbjct: 1475 ENPDEEEEEAFREEKEKVFTYLNKELD----------EAGLKRVLRRCVIETYQKT---- 1520
Query: 93 LEKEFHFNKYLCRPRRI------EIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGEE 146
+ R + E A+ D ++ + + + + +DE+++E+G+
Sbjct: 1521 ---DKSKVLLRLRVGNVGKGKKKETLATADEIDQAA--HMLMDQVGQVPDDEDDDEDGDV 1575
Query: 147 EAKEQEQKQ 155
KE +Q Q
Sbjct: 1576 LEKEVDQLQ 1584
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 27.8 bits (62), Expect = 2.4
Identities = 16/66 (24%), Positives = 33/66 (50%), Gaps = 3/66 (4%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAY 85
+ +E E E+ K++EE E ++ EE+ ++ I++ +E++ + L+T
Sbjct: 75 EDEKELSASSLEAEQAKKKEEAEAKKKKEMEELKAVQKKIDQYIKEKQLSS---SLQTKL 131
Query: 86 TNTQLL 91
T LL
Sbjct: 132 TEEGLL 137
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 28.3 bits (64), Expect = 2.4
Identities = 15/54 (27%), Positives = 29/54 (53%)
Query: 19 MIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
+I E E++ EE E KE E+ +EE + E++ E ++ + ++ E+E
Sbjct: 521 LIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKE 574
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 28.0 bits (63), Expect = 2.4
Identities = 16/45 (35%), Positives = 27/45 (60%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
EE +EE +E++EEK++EE E + + E + E +KK+ K
Sbjct: 278 EERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQARK 322
Score = 26.4 bits (59), Expect = 8.0
Identities = 18/55 (32%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPR 79
+ + + ++ EEEEE+ + EEE R EE E+KE K++ E K L
Sbjct: 255 PEVLRKVDKTREEEEEKILKAAEEE----RQEEAQEKKEEKKKEEREAKLAKLSP 305
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 28.2 bits (63), Expect = 2.5
Identities = 10/30 (33%), Positives = 18/30 (60%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEM 58
E E EEE +E ++ + ++ EE + EE+
Sbjct: 281 EPPEPEEEPDEPDQTDPDDGEETDQIPEEL 310
Score = 27.1 bits (60), Expect = 5.5
Identities = 9/29 (31%), Positives = 15/29 (51%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEE 57
E E EEE +E + + ++ EE + E
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQIPE 308
>gnl|CDD|235359 PRK05182, PRK05182, DNA-directed RNA polymerase subunit alpha;
Provisional.
Length = 310
Score = 27.8 bits (63), Expect = 2.6
Identities = 14/44 (31%), Positives = 24/44 (54%), Gaps = 1/44 (2%)
Query: 16 SSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMN 59
+ I+++ + ++ EE EE EEE E+EE E + EE+
Sbjct: 213 AKILVEQL-SVFVDLEEAEEAEEEVEEEEPEFDPILLRPIEELE 255
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 27.8 bits (62), Expect = 2.8
Identities = 28/129 (21%), Positives = 55/129 (42%), Gaps = 11/129 (8%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAY 85
++I++E + E E E+ EEE ++ N+ + KKK +++ RR++
Sbjct: 244 ESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKKGQKRTT---RRVKMRP 300
Query: 86 TNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGE 145
+ + + I + S+ Y+ +E+ EDE++EE E
Sbjct: 301 VRAKPSDEPSLPESD--------IHEEIPKLDEKSLSEFLGYMGGIDEDDEDEDDEESKE 352
Query: 146 EEAKEQEQK 154
E K+Q+ K
Sbjct: 353 EVEKKQKVK 361
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 28.1 bits (63), Expect = 2.9
Identities = 19/79 (24%), Positives = 35/79 (44%), Gaps = 11/79 (13%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKK---KEEEKK------NGLPRR 80
E+EE+E +E+++ + +++EE + E K I K +E K GLP
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTH 615
Query: 81 LR--TAYTNTQLLELEKEF 97
+ + +L KE+
Sbjct: 616 DADGEEISKKERKKLSKEY 634
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 27.3 bits (61), Expect = 3.0
Identities = 12/46 (26%), Positives = 27/46 (58%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
E EEEE+ +EEE+E + + + ++ + + K+++EE ++
Sbjct: 123 EREEEEKRDEEERERLLRAAKSRSEQSRLKQKAKEMQKEEDEEMRH 168
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 27.6 bits (62), Expect = 3.1
Identities = 22/48 (45%), Positives = 29/48 (60%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
+K +E+ EE+ EEEEEE+ E EEEEE EE E+ +K K E
Sbjct: 10 SKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITE 57
Score = 27.6 bits (62), Expect = 3.1
Identities = 22/47 (46%), Positives = 30/47 (63%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
IEEEEEEE E EE+EEEE+EEE+ + ++ K K+K+ E
Sbjct: 19 KIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKEKDIED 65
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 28.0 bits (62), Expect = 3.1
Identities = 18/49 (36%), Positives = 25/49 (51%)
Query: 32 EEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRR 80
E + E E++E+ EE EE EE K+ KK + KKNGL +
Sbjct: 306 PEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 27.7 bits (62), Expect = 3.1
Identities = 15/51 (29%), Positives = 27/51 (52%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
+ +++ E+ E+EK EE E+ +E+N + E I KK E+ +K
Sbjct: 314 EERELKKKFTEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKR 364
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 27.7 bits (61), Expect = 3.2
Identities = 24/91 (26%), Positives = 38/91 (41%), Gaps = 7/91 (7%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNT 88
++ E+ +EE E E E EN T++ E + K N +PR + NT
Sbjct: 560 QKYEQSDEESVESSSSENSSENENEVTDKGEEIYSLLKKTINRIDMNKIPRPI----INT 615
Query: 89 QLLELEKE---FHFNKYLCRPRRIEIAASLD 116
Q + +K F KY+ P + S+D
Sbjct: 616 QEKKKKKNLKVFETCKYISPPSYYQPYISID 646
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 27.8 bits (62), Expect = 3.3
Identities = 13/47 (27%), Positives = 23/47 (48%), Gaps = 1/47 (2%)
Query: 32 EEEEEEEEEEKEEEEEEEEENTRTEEM-NERKENINKKKEEEKKNGL 77
+ +EE++ + E E+ + E E+ E I KE+ KK G+
Sbjct: 234 NKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIKDLKEKAKKLGI 280
>gnl|CDD|215212 PLN02372, PLN02372, violaxanthin de-epoxidase.
Length = 455
Score = 27.5 bits (61), Expect = 3.3
Identities = 16/58 (27%), Positives = 27/58 (46%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRT 83
K +EE E+ +E + EEE E+E E EE + K ++ +E + L+
Sbjct: 368 KDVEEGEKTIVKEARQIEEELEKEVEKLGKEEESLFKRVALEEGLKELEQDEENFLKE 425
>gnl|CDD|234456 TIGR04074, bacter_Hen1, 3' terminal RNA ribose
2'-O-methyltransferase Hen1. Members of this protein
family are bacterial Hen1, a 3' terminal RNA ribose
2'-O-methyltransferase that acts in bacterial RNA
repair. All members of the seed alignment belong to a
cassette with the RNA repair enzyme polynucleotide
kinase-phosphatase (Pnkp). Chemically similar Hen1 in
eukaryotes acts instead on small regulatory RNAs
[Transcription, RNA processing, Protein synthesis, tRNA
and rRNA base modification].
Length = 462
Score = 27.7 bits (62), Expect = 3.6
Identities = 10/25 (40%), Positives = 14/25 (56%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTR 54
E +E E EE E +E +EE E+
Sbjct: 241 EADEAEPEEAETEEAQEEAAEKPPS 265
Score = 27.3 bits (61), Expect = 4.9
Identities = 10/23 (43%), Positives = 14/23 (60%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEE 50
E +E E EE E E+ +EE E+
Sbjct: 240 AEADEAEPEEAETEEAQEEAAEK 262
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 27.4 bits (61), Expect = 3.6
Identities = 14/46 (30%), Positives = 24/46 (52%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
EEE + E EE + E+ E EE+ R E+ E+ + ++E +
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELE 438
Score = 26.3 bits (58), Expect = 8.4
Identities = 8/46 (17%), Positives = 20/46 (43%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
++ E EE E E+ + EE+ +E ++ ++ + +
Sbjct: 398 LQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPE 443
Score = 26.3 bits (58), Expect = 9.2
Identities = 12/46 (26%), Positives = 20/46 (43%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
EE E E+ E E++ E E +E + + +EEE+
Sbjct: 403 EERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEA 448
Score = 26.3 bits (58), Expect = 9.6
Identities = 14/50 (28%), Positives = 19/50 (38%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLP 78
EEEE + E EE E E+ E E + E+E + P
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQP 442
>gnl|CDD|218187 pfam04637, Herpes_pp85, Herpesvirus phosphoprotein 85 (HHV6-7
U14/HCMV UL25). This family includes UL25 proteins from
HCMV, as well as U14 proteins from HHV 6 and HHV7. These
85 kD phosphoproteins appear to act as structural
antigens, but their precise function is otherwise
unknown.
Length = 502
Score = 27.4 bits (61), Expect = 3.9
Identities = 10/65 (15%), Positives = 20/65 (30%), Gaps = 1/65 (1%)
Query: 2 IASQERQQRSYFIPSSIMIQLFC-NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
Q + Y PS + L + + E + + E ++ E +
Sbjct: 302 FLRQVHKADVYICPSYVRFALRLLLNRLSRLRDAEAGLDSDSESDDLELDPTGPYGVSFY 361
Query: 61 RKENI 65
R +N
Sbjct: 362 RAKNP 366
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 27.7 bits (62), Expect = 3.9
Identities = 24/140 (17%), Positives = 52/140 (37%), Gaps = 22/140 (15%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE------------NTRTEEMNERKENINKKKEEEKKNG 76
EE + +E+ + E + E +++ N R + + R ++ EE +
Sbjct: 583 GEETKPQEQPAPKAEAKPERQQDRRKPRQNNRRDRNERRDTRDNRTRREGRENREENRRN 642
Query: 77 LPRRLRTAYT---NTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEE 133
+ + + Q EK ++ PRR + R++D K +E +
Sbjct: 643 RRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRR-------ERQRRRNDEKRQAQQEAK 695
Query: 134 EGEDEEEEEEGEEEAKEQEQ 153
EE+ + E+ + +Q
Sbjct: 696 ALNVEEQSVQETEQEERVQQ 715
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 27.0 bits (60), Expect = 4.0
Identities = 11/28 (39%), Positives = 22/28 (78%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEEN 52
+EEEEEEEEEE+E +++++++++
Sbjct: 171 EDEKDEEEEEEEEEEDEDFDDDDDDDDD 198
Score = 27.0 bits (60), Expect = 4.1
Identities = 14/32 (43%), Positives = 26/32 (81%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
E+ +EE+E++EE+EEEEEEE+E+ ++ ++
Sbjct: 165 AEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDD 196
Score = 26.3 bits (58), Expect = 7.1
Identities = 14/36 (38%), Positives = 27/36 (75%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKEN 64
E E+ +EE+E+++EEEEEEEEE+ ++ ++ ++
Sbjct: 163 LEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDD 198
Score = 26.3 bits (58), Expect = 7.5
Identities = 16/26 (61%), Positives = 20/26 (76%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEE 51
K E E E+ +EE+E+ EEEEEEEEE
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEE 184
Score = 26.3 bits (58), Expect = 7.7
Identities = 15/39 (38%), Positives = 27/39 (69%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINK 67
E+ +EE+E++EEE+EEEEEE+E+ ++ ++ N
Sbjct: 166 EDVDEEDEKDEEEEEEEEEEDEDFDDDDDDDDDDYNAEN 204
Score = 25.9 bits (57), Expect = 9.8
Identities = 13/25 (52%), Positives = 23/25 (92%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEE 51
++EE+E++EEEEEE+EEE+E+ ++
Sbjct: 167 DVDEEDEKDEEEEEEEEEEDEDFDD 191
>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
insulinase-like [General function prediction only].
Length = 978
Score = 27.3 bits (61), Expect = 4.5
Identities = 16/94 (17%), Positives = 40/94 (42%), Gaps = 2/94 (2%)
Query: 10 RSYFI--PSSIMIQLFCNKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINK 67
R YF+ P + + + + +EE+ E+EE E +K E +E+ + + +++ +
Sbjct: 441 RKYFLDNPHYVTVIVLPSPELEEKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQD 500
Query: 68 KKEEEKKNGLPRRLRTAYTNTQLLELEKEFHFNK 101
+ + E+ L+ + + E +
Sbjct: 501 QPDSEEDLATLPTLKLGDVPDPIEKTSLETEVSN 534
>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor.
Length = 456
Score = 27.3 bits (60), Expect = 4.5
Identities = 12/55 (21%), Positives = 25/55 (45%), Gaps = 3/55 (5%)
Query: 3 ASQERQQRSYFIPSSIMIQLFCNKTIEEEEEEEEEEEEEKEEE---EEEEEENTR 54
SQ + IP ++ L+ +EE +E+ ++++ + E E E +R
Sbjct: 29 GSQTSDGEASAIPIALDGLLYPRSCQGQEEPDEKTQDQQSLSDVERAEPEVEASR 83
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called
The RES complex, pre-mRNA retention and splicing
complex. Subunits of this complex are not essential for
viability of yeasts but they are required for efficient
splicing in vitro and in vivo. Furthermore,
inactivation of this complex causes pre-mRNA leakage
from the nucleus. Bud13 contains a unique,
phylogenetically conserved C-terminal region of unknown
function.
Length = 141
Score = 26.5 bits (59), Expect = 4.5
Identities = 24/70 (34%), Positives = 38/70 (54%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYT 86
IEE+ EE+E E+EEKE +EE+E+E + E +E ++ E+ K L R
Sbjct: 13 DIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDY 72
Query: 87 NTQLLELEKE 96
+ +L E E+
Sbjct: 73 DEELKEQERW 82
>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366). This
family of proteins is found in bacteria and eukaryotes.
Proteins in this family are typically between 227 and
387 amino acids in length.
Length = 213
Score = 26.9 bits (60), Expect = 4.6
Identities = 10/30 (33%), Positives = 10/30 (33%)
Query: 24 CNKTIEEEEEEEEEEEEEKEEEEEEEEENT 53
C E E E E E EEE E
Sbjct: 126 CCPVCSVNMTECTGPEPEPEPEPEEEPEKK 155
>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
protein TraG; Provisional.
Length = 942
Score = 27.4 bits (61), Expect = 4.6
Identities = 9/48 (18%), Positives = 20/48 (41%), Gaps = 2/48 (4%)
Query: 27 TIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
++ + E + K EE+ +E + +E + + KE + K
Sbjct: 894 ELQNHHKTEALSQNNKYNEEKSAQERMPGAD--SPEELMKRAKEYQDK 939
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 27.3 bits (61), Expect = 4.7
Identities = 13/41 (31%), Positives = 19/41 (46%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENIN 66
KTI E E EE EE+ EE EE + + ++ +
Sbjct: 823 KTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDT 863
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 27.3 bits (61), Expect = 4.9
Identities = 6/27 (22%), Positives = 20/27 (74%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEEN 52
+ E++++ E ++ E E++E++E+++
Sbjct: 1362 DSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 26.9 bits (60), Expect = 5.0
Identities = 17/47 (36%), Positives = 25/47 (53%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEE 71
+ EE EE E EE E E EEE EE + E ++E + I + + +
Sbjct: 6 EQVEEEVEETEVEEAVEDEVEEETVEEESEAELLDEEQAKIAELEAK 52
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 27.4 bits (60), Expect = 5.0
Identities = 35/141 (24%), Positives = 60/141 (42%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAY 85
K EEE++ + EE ++ EE + + EE + EE ++ E + KK+ EEKK +
Sbjct: 1599 KLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEE 1658
Query: 86 TNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGE 145
+ E K+ +K + E K EE + E EE++
Sbjct: 1659 NKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKA 1718
Query: 146 EEAKEQEQKQVGVIEKEKNKE 166
EE K+ E++ E+ K +
Sbjct: 1719 EELKKAEEENKIKAEEAKKEA 1739
Score = 26.6 bits (58), Expect = 8.9
Identities = 28/128 (21%), Positives = 58/128 (45%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAY 85
K EE++++ EE ++ +E+E++ E + E ++ E + KK+ EEKK +
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEE 1727
Query: 86 TNTQLLELEKEFHFNKYLCRPRRIEIAASLDLTERQSDCKYYVAKEEEEGEDEEEEEEGE 145
+ E +KE +K + + + + + + + +E E EEE E
Sbjct: 1728 NKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDE 1787
Query: 146 EEAKEQEQ 153
E+ K + +
Sbjct: 1788 EDEKRRME 1795
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 27.1 bits (60), Expect = 5.2
Identities = 12/44 (27%), Positives = 19/44 (43%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
E+ E EE E +E EE+E E E + + + E+
Sbjct: 243 NNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDS 286
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 26.2 bits (58), Expect = 5.4
Identities = 14/23 (60%), Positives = 16/23 (69%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEE 51
E E +E EEEEKEEE EEE +
Sbjct: 78 EAAAEADEAEEEEKEEEAEEESD 100
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 27.1 bits (60), Expect = 5.4
Identities = 12/27 (44%), Positives = 16/27 (59%)
Query: 24 CNKTIEEEEEEEEEEEEEKEEEEEEEE 50
K + E +EEE EEEE+ + EE E
Sbjct: 395 VVKRVVREVDEEEIEEEEEAMQPEEME 421
Score = 27.1 bits (60), Expect = 5.7
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEE 50
E +EEE EEEEE + EE E E
Sbjct: 399 VVREVDEEEIEEEEEAMQPEEMEME 423
Score = 26.7 bits (59), Expect = 6.1
Identities = 13/32 (40%), Positives = 16/32 (50%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
E +EEE EEEEE + EE E E +
Sbjct: 401 REVDEEEIEEEEEAMQPEEMEMEGFEVPALQM 432
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 26.7 bits (59), Expect = 5.8
Identities = 18/40 (45%), Positives = 26/40 (65%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINK 67
I E+ EEEE EE E++EEEE EEE+ E + + E + +
Sbjct: 75 ISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGDTPEELTE 114
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 26.6 bits (59), Expect = 5.8
Identities = 19/46 (41%), Positives = 36/46 (78%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
E EEEE++E++++KE ++E++E+ + E+M E K + KKK+++KK
Sbjct: 148 EVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKKKKKK 193
Score = 26.2 bits (58), Expect = 7.5
Identities = 13/51 (25%), Positives = 19/51 (37%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRR 80
+ E E E E+E E E +E KKK++E K +
Sbjct: 120 APDGPPSELGSESETSEKETTAKVEKEAEVEEEEKKEKKKKKEVKKEKKEK 170
>gnl|CDD|237046 PRK12297, obgE, GTPase CgtA; Reviewed.
Length = 424
Score = 27.0 bits (61), Expect = 5.9
Identities = 12/26 (46%), Positives = 15/26 (57%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEE 51
+ E EEEE EEE + EEEE+
Sbjct: 327 EETPEFPLEEEEVEEEVYYKFEEEEK 352
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 26.9 bits (60), Expect = 6.0
Identities = 16/48 (33%), Positives = 24/48 (50%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
K IE E + E E E+E E+++E + E ER + K+ EK
Sbjct: 204 KAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEK 251
>gnl|CDD|100111 cd05833, Ribosomal_P2, Ribosomal protein P2. This subfamily
represents the eukaryotic large ribosomal protein P2.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P2 is located in the L12 stalk, with proteins
P1, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers). Bacteria may have four or six copies
of L7/L12 (two or three homodimers) depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plants have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site in the C-terminal region
of P2. TCS inactivates the ribosome by depurinating a
specific adenine in the sarcin-ricin loop of 28S rRNA.
Length = 109
Score = 26.1 bits (58), Expect = 6.1
Identities = 8/17 (47%), Positives = 12/17 (70%)
Query: 35 EEEEEEEKEEEEEEEEE 51
++EEK+EE EEE +
Sbjct: 84 AAAKKEEKKEESEEESD 100
Score = 25.7 bits (57), Expect = 6.7
Identities = 8/18 (44%), Positives = 13/18 (72%)
Query: 34 EEEEEEEEKEEEEEEEEE 51
++EE+KEE EEE ++
Sbjct: 84 AAAKKEEKKEESEEESDD 101
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 26.6 bits (59), Expect = 6.3
Identities = 19/47 (40%), Positives = 28/47 (59%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKN 75
E+ +E E +EE+K +EE+ E E EE+ E E N+ K EE K+
Sbjct: 15 EDCCKENENKEEDKGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKD 61
>gnl|CDD|224739 COG1826, TatA, Sec-independent protein secretion pathway
components [Intracellular trafficking and secretion].
Length = 94
Score = 25.8 bits (57), Expect = 6.5
Identities = 10/46 (21%), Positives = 24/46 (52%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
+ + E +EE + E+ +++++E E + ++ KE+ KK
Sbjct: 45 DVKNELDEELKLEELDDKKKELTAELQATKEELDQLASELKEDLKK 90
>gnl|CDD|237028 PRK12267, PRK12267, methionyl-tRNA synthetase; Reviewed.
Length = 648
Score = 26.7 bits (60), Expect = 6.5
Identities = 10/33 (30%), Positives = 16/33 (48%)
Query: 28 IEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNE 60
I+ EEE +E+ + +E EE + E E
Sbjct: 510 IDVEEEIAYIKEQMEGSAPKEPEEKEKKPEKPE 542
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 26.4 bits (58), Expect = 6.9
Identities = 15/43 (34%), Positives = 21/43 (48%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
EE+EEE E+ E+ +EE+EE E E K + K
Sbjct: 32 EEDEEEMEDWEDSLDEEDEEAEEVEEETAASSKAPSSSSKSSS 74
>gnl|CDD|215107 PLN00220, PLN00220, tubulin beta chain; Provisional.
Length = 447
Score = 26.7 bits (59), Expect = 7.1
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 125 KYYVAKEEEEGEDEEEEEEGEEEA 148
+Y A +EEGE E+EEEE E E
Sbjct: 424 QYQDATADEEGEYEDEEEEEEYEE 447
Score = 26.3 bits (58), Expect = 9.1
Identities = 13/25 (52%), Positives = 18/25 (72%)
Query: 21 QLFCNKTIEEEEEEEEEEEEEKEEE 45
Q + + T +EE E E+EEEEE+ EE
Sbjct: 423 QQYQDATADEEGEYEDEEEEEEYEE 447
>gnl|CDD|236049 PRK07562, PRK07562, ribonucleotide-diphosphate reductase subunit
alpha; Validated.
Length = 1220
Score = 26.9 bits (60), Expect = 7.1
Identities = 14/55 (25%), Positives = 25/55 (45%), Gaps = 7/55 (12%)
Query: 35 EEEEEEEKEEEEEEEEENTRTEEMNER---KENINKKKEEEKKNGLPRRLRTAYT 86
E++++++ E E R ++ E+ + +E EK LP R R YT
Sbjct: 929 EDDDDDDALEALIEAPAAARAVQVAEKIVERVVERIIREREK---LPDR-RKGYT 979
>gnl|CDD|221937 pfam13148, DUF3987, Protein of unknown function (DUF3987). A
family of uncharacterized proteins found by clustering
human gut metagenomic sequences.
Length = 379
Score = 26.5 bits (59), Expect = 7.1
Identities = 22/62 (35%), Positives = 33/62 (53%), Gaps = 3/62 (4%)
Query: 35 EEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTAYTNTQLLELE 94
EE EEE +EE EEE +E +E+ E ++ K E++ K + + +LLELE
Sbjct: 64 EEIEEELREEYEEELKEYEAEKEIWEAEK---KGLEKKAKKAIKKGKDEEALAEELLELE 120
Query: 95 KE 96
E
Sbjct: 121 AE 122
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 26.9 bits (59), Expect = 7.2
Identities = 14/34 (41%), Positives = 19/34 (55%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERK 62
E EE+E EEEKEEEE +E R ++ +
Sbjct: 768 AESSEEDESSEEEKEEEENKEVSAKRAKKKQRKN 801
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 26.6 bits (59), Expect = 8.2
Identities = 10/40 (25%), Positives = 13/40 (32%)
Query: 26 KTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENI 65
+EE+E E EE + E EE E
Sbjct: 140 PKTDEEKELLREVEELESRLATEPSPAPELEEQLALMEKS 179
>gnl|CDD|218967 pfam06273, eIF-4B, Plant specific eukaryotic initiation factor 4B.
This family consists of several plant specific
eukaryotic initiation factor 4B proteins.
Length = 496
Score = 26.6 bits (58), Expect = 8.4
Identities = 14/70 (20%), Positives = 25/70 (35%)
Query: 25 NKTIEEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKKNGLPRRLRTA 84
+ +E + E EEEK +EE ++ +E+ GL +R
Sbjct: 353 DLELEHRRVDRPETEEEKNLKEEINLLRVDLKKEEAIAPESKGSGQEQSHKGLSELIRQK 412
Query: 85 YTNTQLLELE 94
+ L +E
Sbjct: 413 EKELERLTVE 422
>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
Length = 647
Score = 26.5 bits (59), Expect = 8.6
Identities = 8/32 (25%), Positives = 18/32 (56%)
Query: 22 LFCNKTIEEEEEEEEEEEEEKEEEEEEEEENT 53
LFC++ + ++E+E +EEE +++
Sbjct: 104 LFCDEELSSSDDEDEPPPPSPPPSQEEESQSS 135
>gnl|CDD|216050 pfam00665, rve, Integrase core domain. Integrase mediates
integration of a DNA copy of the viral genome into the
host chromosome. Integrase is composed of three domains.
The amino-terminal domain is a zinc binding domain
pfam02022. This domain is the central catalytic domain.
The carboxyl terminal domain that is a non-specific DNA
binding domain pfam00552. The catalytic domain acts as
an endonuclease when two nucleotides are removed from
the 3' ends of the blunt-ended viral DNA made by reverse
transcription. This domain also catalyzes the DNA strand
transfer reaction of the 3' ends of the viral DNA to the
5' ends of the integration site.
Length = 113
Score = 25.7 bits (57), Expect = 8.6
Identities = 9/37 (24%), Positives = 13/37 (35%), Gaps = 4/37 (10%)
Query: 76 GLPRRLRT----AYTNTQLLELEKEFHFNKYLCRPRR 108
G P + + +T+ EL KE RP
Sbjct: 59 GPPVTIHSDNGSEFTSKAFQELLKELGIKHSFSRPGN 95
>gnl|CDD|113290 pfam04514, BTV_NS2, Bluetongue virus non-structural protein NS2.
This family includes NS2 proteins from other members of
the Orbivirus genus. NS2 is a non-specific
single-stranded RNA-binding protein that forms large
homomultimers and accumulates in viral inclusion bodies
of infected cells. Three RNA binding regions have been
identified in Bluetongue virus serotype 17 at residues
2-11, 153-166 and 274-286. NS2 multimers also possess
nucleotidyl phosphatase activity. The precise function
of NS2 is not known, but it may be involved in the
transport and condensation of viral mRNAs.
Length = 363
Score = 26.4 bits (58), Expect = 8.6
Identities = 15/43 (34%), Positives = 23/43 (53%)
Query: 30 EEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
E + ++E+E E E+ E + E+ T E ER I K K+E
Sbjct: 253 ERDSDDEDEHGEDSEDGETKPESYITSEYIERISEIRKMKDER 295
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 26.5 bits (59), Expect = 8.8
Identities = 12/48 (25%), Positives = 26/48 (54%), Gaps = 2/48 (4%)
Query: 28 IEEEEEEE--EEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEK 73
I++EEEE +E+++ E++ + +++ E E + K+ EK
Sbjct: 225 IDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 26.3 bits (58), Expect = 8.9
Identities = 10/46 (21%), Positives = 24/46 (52%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEEKK 74
E+++ EE ++++ E+E ++ E R ++K+ K+ K
Sbjct: 85 EQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALK 130
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 26.3 bits (58), Expect = 9.3
Identities = 15/44 (34%), Positives = 26/44 (59%)
Query: 29 EEEEEEEEEEEEEKEEEEEEEEENTRTEEMNERKENINKKKEEE 72
E E++ E E+ KE+E+ ++E EE+ ER+ N + E+E
Sbjct: 526 ELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKE 569
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 26.4 bits (58), Expect = 9.8
Identities = 10/51 (19%), Positives = 22/51 (43%), Gaps = 3/51 (5%)
Query: 35 EEEEEEEKEEEEEEEEENTRTEEMNERKE---NINKKKEEEKKNGLPRRLR 82
E E ++ E+ E + + E +R+ IN++ +E+ + R
Sbjct: 328 ENAEIKKTRTAEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARA 378
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.299 0.122 0.308
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 8,780,881
Number of extensions: 895289
Number of successful extensions: 19474
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9396
Number of HSP's successfully gapped: 2777
Length of query: 166
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 76
Effective length of database: 6,945,742
Effective search space: 527876392
Effective search space used: 527876392
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (22.0 bits)
S2: 55 (24.8 bits)