RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy6666
(175 letters)
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 129 bits (325), Expect = 5e-36
Identities = 59/188 (31%), Positives = 96/188 (51%), Gaps = 16/188 (8%)
Query: 1 MDESGEQFVAYFLP-------TAETLDKRKLDQLSNLEYTDEQEYEYKMAREYNWNVKSK 53
++ E+FV+Y+LP +TL+KR D +Y +++EY++K REY+ VKSK
Sbjct: 233 VELEEEEFVSYYLPDEKDSEILEDTLEKRSDDLH---DYDEDEEYKFKRVREYDMKVKSK 289
Query: 54 SCKGYEENYFLIVRD-DAVYYNELETRVRLSKRR-----QKVGAAPNNTRLVVTHRPLNA 107
+ K E F + + VYY L +RV L +RR + + NN +L V R +
Sbjct: 290 ATKLNELALFFVSDENGVVYYKPLRSRVELRRRRVNDVIRPLVREHNNDQLNVKLRNPST 349
Query: 108 TEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
E +++ R L+P + EE +EDE+EEEE+ + + E E +EG +G E
Sbjct: 350 KESKMRDKRRARLDPIDFEEVDEDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESS 409
Query: 168 ISECSNED 175
S+ +
Sbjct: 410 SDVGSDSE 417
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 38.5 bits (89), Expect = 0.001
Identities = 17/47 (36%), Positives = 32/47 (68%), Gaps = 3/47 (6%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA---ERQEGEGGE 159
R+ F+ ++E+E+ED+++EE++ EE ++ E + E +E EGGE
Sbjct: 148 RDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194
Score = 33.1 bits (75), Expect = 0.073
Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)
Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ-EGEGGEREGGGEGE 167
E N +++DE+E+E++ +E E E E + +G E E GE
Sbjct: 143 EDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGE 194
Score = 32.7 bits (74), Expect = 0.084
Identities = 12/51 (23%), Positives = 30/51 (58%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
++++E+++E++++E ++ ++ E E + + E + GGE E S D
Sbjct: 154 DDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFTYEKSEVD 204
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 38.1 bits (88), Expect = 0.002
Identities = 18/31 (58%), Positives = 23/31 (74%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
+ EEEEE+EEEEEEE EE ++ E E E +E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 37.3 bits (86), Expect = 0.003
Identities = 17/31 (54%), Positives = 21/31 (67%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAE 151
+ EEEEEE+EEEEEEE EE ++ E E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 36.1 bits (83), Expect = 0.007
Identities = 19/34 (55%), Positives = 21/34 (61%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
E EEEEEE+EEEEEEE EE + E E E
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWPE 900
Score = 35.7 bits (82), Expect = 0.008
Identities = 25/70 (35%), Positives = 31/70 (44%), Gaps = 1/70 (1%)
Query: 106 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEG-GEREGGG 164
N +E I R+ E E E +E + + E EAEE + TEAE E EG E G
Sbjct: 679 NESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEG 738
Query: 165 EGEISECSNE 174
E E E
Sbjct: 739 EEVEDEGEGE 748
Score = 35.4 bits (81), Expect = 0.014
Identities = 18/34 (52%), Positives = 22/34 (64%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
E EEEEEE+EEEEEEE EE E E +++
Sbjct: 872 EEEEEEEEEEEEEEEEEENEEPLSLEWPETRQKQ 905
Score = 34.2 bits (78), Expect = 0.026
Identities = 16/31 (51%), Positives = 21/31 (67%)
Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
+ EEE+EEEEEEE EE ++ E E E + E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 32.3 bits (73), Expect = 0.13
Identities = 16/45 (35%), Positives = 22/45 (48%), Gaps = 1/45 (2%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGG-GEGEI 168
EE E+E E E E + E +G+ E E + E + EGEI
Sbjct: 739 EEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEI 783
Score = 31.1 bits (70), Expect = 0.34
Identities = 17/47 (36%), Positives = 21/47 (44%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
E P E E E EE E +E + E E + ER+G EGE
Sbjct: 650 ERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGE 696
Score = 30.7 bits (69), Expect = 0.36
Identities = 15/31 (48%), Positives = 20/31 (64%)
Query: 129 EEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
+ +EEEEEEE EE ++ E E E +E E E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 30.7 bits (69), Expect = 0.38
Identities = 22/67 (32%), Positives = 27/67 (40%), Gaps = 9/67 (13%)
Query: 116 RERFLEPPNEEEEE---EDEEEEEEEAEETQQGETTEAERQ-----EGEGGEREGGGEGE 167
R E N EE E E E E + E +GE AER+ EGE +E +GE
Sbjct: 651 RPTEAEGENGEESGGEAEQEGETETKGENESEGE-IPAERKGEQEGEGEIEAKEADHKGE 709
Query: 168 ISECSNE 174
E
Sbjct: 710 TEAEEVE 716
Score = 29.6 bits (66), Expect = 0.86
Identities = 20/48 (41%), Positives = 22/48 (45%), Gaps = 1/48 (2%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEI 168
E E E E E E E+ E + GE E EGE GE EG E E
Sbjct: 711 EAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGE-GEAEGKHEVET 757
Score = 29.6 bits (66), Expect = 1.1
Identities = 15/31 (48%), Positives = 18/31 (58%)
Query: 131 DEEEEEEEAEETQQGETTEAERQEGEGGERE 161
D EEEEEE EE ++ E E E +E E E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 29.6 bits (66), Expect = 1.1
Identities = 19/47 (40%), Positives = 24/47 (51%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
E E E E +E+E+ E Q GE E + EG G+ E GE E E
Sbjct: 765 EHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGE 811
Score = 29.2 bits (65), Expect = 1.2
Identities = 19/52 (36%), Positives = 24/52 (46%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
E+E E + EE EE E+ +GE E EG +E EGE ED
Sbjct: 726 TEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKED 777
Score = 29.2 bits (65), Expect = 1.3
Identities = 16/51 (31%), Positives = 21/51 (41%), Gaps = 4/51 (7%)
Query: 121 EPPNEEE---EEEDEEEEEEEAEETQQGET-TEAERQEGEGGEREGGGEGE 167
E E E E E E E E + Q+GE EA+ + +G E E
Sbjct: 668 EQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHE 718
Score = 29.2 bits (65), Expect = 1.3
Identities = 14/52 (26%), Positives = 20/52 (38%)
Query: 123 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
E E + E E E E ++GE E + + +G E E E E
Sbjct: 669 QEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGE 720
Score = 29.2 bits (65), Expect = 1.3
Identities = 14/42 (33%), Positives = 17/42 (40%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
E E E EDE E E E + + E E + E EG
Sbjct: 718 EGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEG 759
Score = 29.2 bits (65), Expect = 1.5
Identities = 17/54 (31%), Positives = 24/54 (44%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
E E E E+ EE E+E E +G+ + + E EG E E E +E
Sbjct: 727 EDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDE 780
Score = 28.8 bits (64), Expect = 1.8
Identities = 17/60 (28%), Positives = 26/60 (43%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
+E E E E +EDE+E E +A E + + E + E GE + E +E
Sbjct: 762 KETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSET 821
Score = 28.8 bits (64), Expect = 1.9
Identities = 21/66 (31%), Positives = 29/66 (43%), Gaps = 15/66 (22%)
Query: 125 EEEEEEDEEEEEEEAEETQQ------GETTEAE---------RQEGEGGEREGGGEGEIS 169
E E E E++E E ETQ ET E E +Q+ +G + GG +G S
Sbjct: 804 EGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDS 863
Query: 170 ECSNED 175
E E+
Sbjct: 864 EEEEEE 869
Score = 28.4 bits (63), Expect = 2.1
Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEG-GEREGGGEGEI 168
E E E + + E E + E +GET EAE +E E GE + G +GE+
Sbjct: 744 EGEGEAEGKHEVETEGDRKETEHEGET-EAEGKEDEDEGEIQAGEDGEM 791
Score = 28.4 bits (63), Expect = 2.6
Identities = 14/37 (37%), Positives = 19/37 (51%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
+ + EEEEE EE ++ E E E +E E E E
Sbjct: 855 GGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 28.0 bits (62), Expect = 3.0
Identities = 13/37 (35%), Positives = 18/37 (48%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
+ D + EEE EE ++ E E E +E E E E
Sbjct: 853 DGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 27.7 bits (61), Expect = 4.7
Identities = 16/43 (37%), Positives = 20/43 (46%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
E E + E E E E E + + E E Q GE GE +G E
Sbjct: 756 ETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAE 798
Score = 27.3 bits (60), Expect = 5.0
Identities = 12/40 (30%), Positives = 17/40 (42%)
Query: 128 EEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
E EE E E + GE + E ++ E +G E E
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESE 682
Score = 27.3 bits (60), Expect = 6.0
Identities = 15/50 (30%), Positives = 21/50 (42%)
Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
E E E + E E E E + + + E E E + GE E E +E
Sbjct: 670 EGETETKGENESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEG 719
Score = 27.3 bits (60), Expect = 6.4
Identities = 16/48 (33%), Positives = 21/48 (43%)
Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
L ++ + E E E EE ++ E E E GGE E GE E
Sbjct: 627 LGDLSKGDVAEAEHTGERTGEEGERPTEAEGENGEESGGEAEQEGETE 674
Score = 26.9 bits (59), Expect = 6.5
Identities = 19/46 (41%), Positives = 23/46 (50%), Gaps = 13/46 (28%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEIS 169
+EEEEEE+EEEEEEE +E E E E E +S
Sbjct: 863 SEEEEEEEEEEEEEE-------------EEEEEEEEEEEENEEPLS 895
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 35.8 bits (83), Expect = 0.008
Identities = 14/57 (24%), Positives = 21/57 (36%), Gaps = 3/57 (5%)
Query: 96 TRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAER 152
RLV+ R AT PP E+D++ E+E E+ + E
Sbjct: 247 ARLVLAPR---ATRLPAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEEI 300
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 35.4 bits (82), Expect = 0.011
Identities = 20/43 (46%), Positives = 24/43 (55%)
Query: 114 RLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
R R R PP+ E EE+EEEEEE EE ++ E E E E
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 32.3 bits (74), Expect = 0.12
Identities = 19/43 (44%), Positives = 24/43 (55%)
Query: 111 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 153
R +R R PP EEEEE+EEE EE EE ++ E E +
Sbjct: 291 RRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 31.5 bits (72), Expect = 0.22
Identities = 20/46 (43%), Positives = 24/46 (52%)
Query: 111 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
R+ R R PP E E+EEEEEEE E ++ E E ER E
Sbjct: 287 RLPLRRRRRRRPPPSPPEPEEEEEEEEEVPEEEEEEEEEEERTFEE 332
>gnl|CDD|240331 PTZ00254, PTZ00254, 40S ribosomal protein SA; Provisional.
Length = 249
Score = 34.3 bits (79), Expect = 0.019
Identities = 12/41 (29%), Positives = 17/41 (41%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 165
+ EE E++EE E + AE +EGE E
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246
Score = 32.3 bits (74), Expect = 0.084
Identities = 13/41 (31%), Positives = 17/41 (41%)
Query: 127 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
+ EE EE+EE AE + A + EG E E
Sbjct: 206 DPEEAEEKEEAAAETAGVEDAAAAAAEAEEGEEWVAAANNE 246
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 34.1 bits (78), Expect = 0.027
Identities = 34/103 (33%), Positives = 43/103 (41%), Gaps = 14/103 (13%)
Query: 75 ELETRVRLSKRRQKV---GAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEED 131
E E R R + RQ + P+ P A++ E EE EE+
Sbjct: 400 EEERRKRQERERQGTSSRSSDPSKASSTSGESPSMASQ-----------ESEEEESVEEE 448
Query: 132 EEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
EEEEEEE EE Q+ E E E +E E G E E S+E
Sbjct: 449 EEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSE 491
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 32.6 bits (75), Expect = 0.075
Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)
Query: 109 EFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
E R R LE ++++EE DEE+EE E + + + + + ER
Sbjct: 113 EPRYDDA-YRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRER 163
Score = 30.0 bits (68), Expect = 0.62
Identities = 9/52 (17%), Positives = 28/52 (53%)
Query: 110 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
R + + + +++++E+ +EE+EE+ +++ E + + + + RE
Sbjct: 111 QDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 32.8 bits (75), Expect = 0.085
Identities = 15/42 (35%), Positives = 25/42 (59%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 166
EEEE+E+EEEEE+E E + + + E +E + + +G
Sbjct: 391 EEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432
Score = 32.0 bits (73), Expect = 0.15
Identities = 14/49 (28%), Positives = 23/49 (46%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGG 164
+ EE+EEE+EEE+E+E + + E E + E + G
Sbjct: 384 ANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432
Score = 27.0 bits (60), Expect = 6.3
Identities = 15/48 (31%), Positives = 26/48 (54%)
Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
+E N E ++E+EE+EEEE EE + ++ + E E + + E
Sbjct: 381 IEDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYE 428
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 32.5 bits (74), Expect = 0.090
Identities = 15/55 (27%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
E ++ EED +EE+ + ++ E +EA R+E EG + E + +E ++ +
Sbjct: 220 EETGDDGIEEDADEEDGDDDQPDNNEDSEAGREESEGSD---ESEEDEAEATDGE 271
Score = 32.1 bits (73), Expect = 0.13
Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 1/52 (1%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
NE+ E EE E + E + E T+ E +EGE E + E S+ S+ED
Sbjct: 244 NEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSE-SDESDED 294
Score = 30.6 bits (69), Expect = 0.42
Identities = 18/52 (34%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
++ + ED E EE+E + + E EAE +GEG E GE + +E S +
Sbjct: 239 DQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEE----GEMDAAEASEDS 286
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 32.4 bits (74), Expect = 0.097
Identities = 18/41 (43%), Positives = 27/41 (65%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
+E E P+EEEEEE EE++EEE + T + E + E ++ E
Sbjct: 29 KEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEE 69
Score = 26.7 bits (59), Expect = 8.5
Identities = 15/38 (39%), Positives = 22/38 (57%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
E E+E +EEEEEE EE ++ E +++E E E
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEK 67
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 32.1 bits (74), Expect = 0.12
Identities = 14/21 (66%), Positives = 15/21 (71%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
EEEEEE+EEEEEEE E
Sbjct: 298 AAAAEEEEEEEEEEEEEEPSE 318
Score = 31.0 bits (71), Expect = 0.31
Identities = 14/20 (70%), Positives = 16/20 (80%)
Query: 122 PPNEEEEEEDEEEEEEEAEE 141
EEEEEE+EEEEEE +EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEE 319
Score = 30.6 bits (70), Expect = 0.32
Identities = 13/19 (68%), Positives = 14/19 (73%)
Query: 121 EPPNEEEEEEDEEEEEEEA 139
E EEEEEE+EE EEEA
Sbjct: 303 EEEEEEEEEEEEEPSEEEA 321
Score = 30.2 bits (69), Expect = 0.56
Identities = 13/21 (61%), Positives = 14/21 (66%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
EEEEEE+EEEEE EE
Sbjct: 300 AAEEEEEEEEEEEEEEPSEEE 320
Score = 29.1 bits (66), Expect = 1.2
Identities = 12/20 (60%), Positives = 13/20 (65%)
Query: 121 EPPNEEEEEEDEEEEEEEAE 140
E EEEEEE+E EEE A
Sbjct: 304 EEEEEEEEEEEEPSEEEAAA 323
Score = 27.9 bits (63), Expect = 2.6
Identities = 12/20 (60%), Positives = 13/20 (65%)
Query: 122 PPNEEEEEEDEEEEEEEAEE 141
EEE+EEEEEEE EE
Sbjct: 296 AQAAAAEEEEEEEEEEEEEE 315
Score = 27.9 bits (63), Expect = 3.2
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
+ EEEEE+EEEEEEE
Sbjct: 297 QAAAAEEEEEEEEEEEEEEPS 317
Score = 27.5 bits (62), Expect = 4.3
Identities = 12/21 (57%), Positives = 13/21 (61%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
EEEE+EEEEEEE E
Sbjct: 296 AQAAAAEEEEEEEEEEEEEEP 316
>gnl|CDD|220776 pfam10483, Hap2_elong, Histone acetylation protein 2. Hap2 is one
of three histone acetyltransferases proteins that, in
yeasts, are found associated with elongating forms of
RNA polymerase II (Elongator). The Haps can be isolated
in two forms, as a six-subunit complex with Elongator
and as a complex of the three proteins on their own. The
role of the Hap complex in transcription is still
speculative, being possibly to keep the HAT activity of
free Elongator in check, allowing histone acetylation
only in the presence of a transcribing polymerase, or
the interaction with Haps might render Elongator
susceptible to modifications thereby altering its
activity.
Length = 280
Score = 31.5 bits (72), Expect = 0.17
Identities = 29/116 (25%), Positives = 43/116 (37%), Gaps = 30/116 (25%)
Query: 75 ELETRVR-LSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQR---LRERFLEPPNE----- 125
+ E R LS+ + N +V+ E R + L E F+ P
Sbjct: 165 DKEALDRSLSEPVFPLNEGLNGVGIVLE------LENRRKSGRSLTETFILSPATHEYES 218
Query: 126 -EEEEEDEEEEEEEAEETQQGETT------EAERQEGEG--------GEREGGGEG 166
+E+ E + E EE E+ + TT E +R E E+ GGGEG
Sbjct: 219 LKEKPEGDAEAAEEDPESLEDLTTFNLGLSEKQRLAREQVELPFFDAQEKLGGGEG 274
>gnl|CDD|233683 TIGR02005, PTS-IIBC-alpha, PTS system, alpha-glucoside-specific
IIBC component. This model represents a family of fused
PTS enzyme II B and C domains. A gene from Clostridium
has been partially characterized as a maltose
transporter, while genes from Fusobacterium and
Klebsiella have been proposed to transport the five
non-standard isomers of sucrose [Transport and binding
proteins, Carbohydrates, organic alcohols, and acids].
Length = 524
Score = 31.3 bits (71), Expect = 0.20
Identities = 15/69 (21%), Positives = 31/69 (44%), Gaps = 6/69 (8%)
Query: 110 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG-----GG 164
FR L+ P E+ EEE + + + + ++G++ E++R + GG
Sbjct: 402 FRFLILKFNIPTPGREDTEEEVKLYSKADYK-AKKGDSVESKRASSGNDKAAYFLQALGG 460
Query: 165 EGEISECSN 173
+ I + +N
Sbjct: 461 KENIKDVTN 469
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 31.2 bits (71), Expect = 0.24
Identities = 18/82 (21%), Positives = 30/82 (36%), Gaps = 8/82 (9%)
Query: 97 RLVVTHR----PLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEE----AEETQQGETT 148
LV+ HR P + + ++ E E P + E+ DE E++ E G
Sbjct: 302 ELVLPHRRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEGEDDAEQSGPRGHPTPGNDD 361
Query: 149 EAERQEGEGGEREGGGEGEISE 170
E E E + +G +
Sbjct: 362 EKEPDPQEEADGQGSSTDPAGD 383
>gnl|CDD|224530 COG1614, CdhC, CO dehydrogenase/acetyl-CoA synthase beta subunit
[Energy production and conversion].
Length = 470
Score = 30.9 bits (70), Expect = 0.28
Identities = 18/42 (42%), Positives = 21/42 (50%), Gaps = 4/42 (9%)
Query: 118 RFLE----PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 155
FL+ P E EE+EEEEEEE EE + E E G
Sbjct: 388 EFLKEKGHPVVERWAEEEEEEEEEEEEEAAEAEAPMEEPVPG 429
Score = 30.2 bits (68), Expect = 0.61
Identities = 15/34 (44%), Positives = 18/34 (52%)
Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEA 150
ER+ E EEEEEE+EE E EA + E
Sbjct: 399 ERWAEEEEEEEEEEEEEAAEAEAPMEEPVPGFEV 432
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 30.9 bits (70), Expect = 0.30
Identities = 15/55 (27%), Positives = 27/55 (49%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
NE+E++ E+E++++ EE + G + E GE E +E S +D
Sbjct: 221 SEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADD 275
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 30.8 bits (70), Expect = 0.39
Identities = 19/40 (47%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEG 155
R ++ P ++EEEE E+EEEEE EE E E E +EG
Sbjct: 338 RCTWVNPEQKDEEEEQEDEEEEEEEE----EPEEPEPEEG 373
Score = 26.6 bits (59), Expect = 8.1
Identities = 17/40 (42%), Positives = 22/40 (55%)
Query: 127 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 166
E E+EEEEEE E ++ E E E +E E +E G G
Sbjct: 168 REGEEEEEEEEVGEADEEDEGEEEEEEEPEEVPKEEEGTG 207
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.5 bits (69), Expect = 0.44
Identities = 8/29 (27%), Positives = 14/29 (48%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTE 149
P E EEE DE ++ + + + + E
Sbjct: 280 PEPPEPEEEPDEPDQTDPDDGEETDQIPE 308
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 30.3 bits (68), Expect = 0.45
Identities = 12/18 (66%), Positives = 16/18 (88%)
Query: 121 EPPNEEEEEEDEEEEEEE 138
+ +EE+EEE+EEEEEEE
Sbjct: 225 DSESEEDEEEEEEEEEEE 242
Score = 29.5 bits (66), Expect = 0.88
Identities = 13/17 (76%), Positives = 15/17 (88%)
Query: 125 EEEEEEDEEEEEEEAEE 141
E EE+E+EEEEEEE EE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243
Score = 29.1 bits (65), Expect = 1.1
Identities = 12/17 (70%), Positives = 14/17 (82%)
Query: 122 PPNEEEEEEDEEEEEEE 138
E+EEEE+EEEEEEE
Sbjct: 227 ESEEDEEEEEEEEEEEE 243
Score = 28.0 bits (62), Expect = 2.6
Identities = 12/20 (60%), Positives = 14/20 (70%)
Query: 121 EPPNEEEEEEDEEEEEEEAE 140
EE+EE+EEEEEEE E
Sbjct: 224 SDSESEEDEEEEEEEEEEEE 243
Score = 26.4 bits (58), Expect = 7.9
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQ 153
+ E EEDEEEEEEE EE + T +R+
Sbjct: 225 DSESEEDEEEEEEEEEEEEIDVVTVEKRR 253
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 29.2 bits (66), Expect = 0.53
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGE 146
EE++EE+EEEEE+E E +
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 26.5 bits (59), Expect = 4.5
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQG 145
EE++E+EEEEEE+ E ++
Sbjct: 73 AAAAAEEKKEEEEEEEEKEESEEEA 97
Score = 26.1 bits (58), Expect = 5.6
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQG 145
++EEEE+EEE+EE EE G
Sbjct: 76 AAEEKKEEEEEEEEKEESEEEAAAG 100
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 29.3 bits (66), Expect = 0.61
Identities = 13/35 (37%), Positives = 22/35 (62%)
Query: 127 EEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
++E+EE EEE +ET+Q + E + + E +RE
Sbjct: 94 SDDEEEETEEESTDETEQEDPPETKTESKEKKKRE 128
Score = 27.4 bits (61), Expect = 2.8
Identities = 12/33 (36%), Positives = 21/33 (63%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
+EEEE E+E +E E E+ + +T E+++ E
Sbjct: 96 DEEEETEEESTDETEQEDPPETKTESKEKKKRE 128
Score = 26.6 bits (59), Expect = 5.2
Identities = 11/32 (34%), Positives = 19/32 (59%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
++EEEE EEE +E E+ ET +++ +
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTESKEKKK 126
Score = 26.6 bits (59), Expect = 6.1
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 118 RFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAER 152
R L ++EEEE +EE +E +E TE++
Sbjct: 89 RELSSSDDEEEETEEESTDETEQEDPPETKTESKE 123
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 30.2 bits (68), Expect = 0.63
Identities = 10/45 (22%), Positives = 16/45 (35%)
Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
+ + E E E + E+ EE E+ E E + R
Sbjct: 663 DECEAIEDSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRA 707
Score = 27.9 bits (62), Expect = 3.3
Identities = 15/36 (41%), Positives = 25/36 (69%), Gaps = 1/36 (2%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQE-GEGGE 159
EEE+E+D EE +++ +E + E +E+E + GE GE
Sbjct: 648 EEEDEDDLEETDDDEDECEAIEDSESESESDGEDGE 683
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 30.2 bits (68), Expect = 0.66
Identities = 14/54 (25%), Positives = 22/54 (40%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
E N + EE++EEE E + ++ + E + E E E S E
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAE 56
Score = 29.0 bits (65), Expect = 1.7
Identities = 12/51 (23%), Positives = 20/51 (39%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
EEE+ + EEE+ EE + + + + E E E + E
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAES 51
Score = 27.8 bits (62), Expect = 3.2
Identities = 11/51 (21%), Positives = 18/51 (35%)
Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
E + EEE+E+E E + + + E E E + E
Sbjct: 2 EEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESP 52
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 29.9 bits (67), Expect = 0.70
Identities = 14/51 (27%), Positives = 25/51 (49%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNE 174
NE+EE+ + E E +E Q+ E T+ E + GE + + E ++
Sbjct: 223 NEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDD 273
Score = 29.5 bits (66), Expect = 0.91
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
E E+E++ E E E +E + +EA +E E GE E
Sbjct: 221 TENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEM 258
Score = 28.8 bits (64), Expect = 1.9
Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 7/53 (13%)
Query: 123 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
P E E+EE E E E E + E+ +R+ GE E+ + +D
Sbjct: 220 PTENEQEEQGEGEGEGQEGSAPQESEATDRESES-------GEEEMVQSDQDD 265
Score = 26.8 bits (59), Expect = 8.3
Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
EE ++ E E+EE+ ++ Q E + E+ EGEG +EG E SE ++ +
Sbjct: 201 EEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQE-SEATDRE 250
Score = 26.4 bits (58), Expect = 10.0
Identities = 12/45 (26%), Positives = 20/45 (44%)
Query: 131 DEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
+ EE + E++ E + ++ E +G GEGE E S
Sbjct: 198 ELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQ 242
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 29.7 bits (67), Expect = 0.71
Identities = 17/50 (34%), Positives = 28/50 (56%), Gaps = 2/50 (4%)
Query: 111 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQ--GETTEAERQEGEGG 158
RI++LR + E P EE E+E +E++ + +Q G +A+ G GG
Sbjct: 122 RIRKLRRKAGEEPEEELPLEEEATKEDDLADPRQVLGVYYKAKDPPGAGG 171
Score = 28.9 bits (65), Expect = 1.4
Identities = 13/48 (27%), Positives = 17/48 (35%)
Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
L+PP E DE E E E T E+ +E E +
Sbjct: 195 LDPPEPPEPTVDEAAPETEVEATPAAESPAEPAEETAETTPEETEDAP 242
>gnl|CDD|235302 PRK04456, PRK04456, acetyl-CoA decarbonylase/synthase complex
subunit beta; Reviewed.
Length = 463
Score = 29.7 bits (67), Expect = 0.73
Identities = 14/29 (48%), Positives = 15/29 (51%)
Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQE 154
EEE+EEEEEEE EE E E
Sbjct: 403 AAEEEEEEEEEEEEEEEPVAEVMMMPAPE 431
Score = 29.7 bits (67), Expect = 0.76
Identities = 13/18 (72%), Positives = 14/18 (77%)
Query: 125 EEEEEEDEEEEEEEAEET 142
EEEE+EEEEEEE EE
Sbjct: 403 AAEEEEEEEEEEEEEEEP 420
Score = 29.7 bits (67), Expect = 0.92
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEA 150
EEEEE+EEEEEEE E + A
Sbjct: 404 AEEEEEEEEEEEEEEEPVAEVMMMPA 429
Score = 28.5 bits (64), Expect = 1.9
Identities = 15/36 (41%), Positives = 18/36 (50%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
EEEEEE+EEEEEEE + GG +
Sbjct: 406 EEEEEEEEEEEEEEPVAEVMMMPAPEMQLPASGGIK 441
Score = 27.7 bits (62), Expect = 3.2
Identities = 15/28 (53%), Positives = 17/28 (60%)
Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQ 144
ER+ EEEEEE+EEEEE AE
Sbjct: 400 ERWAAEEEEEEEEEEEEEEEPVAEVMMM 427
>gnl|CDD|212055 cd11486, SLC5sbd_SGLT1, Na(+)/glucose cotransporter SGLT1;solute
binding domain. Human SGLT1 (hSGLT1) is a
high-affinity/low-capacity glucose transporter, which
can also transport galactose. In the transport
mechanism, two Na+ ions first bind to the extracellular
side of the transporter and induce a conformational
change in the glucose binding site. This results in an
increased affinity for glucose. A second conformational
change in the transporter follows, bringing the Na+ and
glucose binding sites to the inner surface of the
membrane. Glucose is then released, followed by the Na+
ions. In the process, hSGLT1 is also able to transport
water and urea and may be a major pathway for transport
of these across the intestinal brush-border membrane.
hSGLT1 is encoded by the SLC5A1 gene and expressed
mostly in the intestine, but also in the trachea,
kidney, heart, brain, testis, and prostate. The
WHO/UNICEF oral rehydration solution (ORS) for the
treatment of secretory diarrhea contains salt and
glucose. The glucose, along with sodium ions, is
transported by hSGLT1 and water is either co-transported
along with these or follows by osmosis. Mutations in
SGLT1 are associated with intestinal glucose galactose
malabsorption (GGM). Up-regulation of intestinal SGLT1
may protect against enteric infections. SGLT1 is
expressed in colorectal, head and neck, and prostate
tumors. Epidermal growth factor receptor (EGFR)
functions in cell survival by stabilizing SGLT1, and
thereby maintaining intracellular glucose levels. SGLT1
is predicted to have 14 membrane-spanning regions. This
subgroup belongs to the solute carrier 5
(SLC5)transporter family.
Length = 635
Score = 29.5 bits (66), Expect = 0.94
Identities = 8/31 (25%), Positives = 16/31 (51%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
N EE D + ++ +E + T+ ER++
Sbjct: 539 NSTEERIDLDADDWTEDEDENEMETDEERKK 569
Score = 29.1 bits (65), Expect = 1.3
Identities = 14/40 (35%), Positives = 21/40 (52%), Gaps = 7/40 (17%)
Query: 106 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQG 145
N+TE RI L+ ++ E+EDE E E + E + G
Sbjct: 539 NSTEERID------LDA-DDWTEDEDENEMETDEERKKPG 571
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 29.6 bits (66), Expect = 0.99
Identities = 8/47 (17%), Positives = 15/47 (31%), Gaps = 1/47 (2%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
PN++ + E+ EE + +E E + E
Sbjct: 157 PSPNQQPSSFLQPSHEDSPEEPEPPT-SEPEPDSPGPPQSETPTSSP 202
Score = 28.0 bits (62), Expect = 3.2
Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 3/57 (5%)
Query: 117 ERFLEPPNEEEEEEDE---EEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
FL+P +E+ EE E E E ++ Q ET + + G + +
Sbjct: 164 SSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQ 220
Score = 27.6 bits (61), Expect = 3.6
Identities = 13/45 (28%), Positives = 18/45 (40%)
Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
E+ E + E AEE++ GE E + G E G S
Sbjct: 63 EQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSP 107
Score = 27.6 bits (61), Expect = 3.8
Identities = 7/47 (14%), Positives = 13/47 (27%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
P+E E + + + E +E E E +
Sbjct: 144 PGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSP 190
Score = 27.2 bits (60), Expect = 4.7
Identities = 2/47 (4%), Positives = 4/47 (8%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
P + + E
Sbjct: 126 SSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHE 172
Score = 26.9 bits (59), Expect = 6.7
Identities = 4/51 (7%), Positives = 6/51 (11%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
EE ++ G E
Sbjct: 104 TPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES 154
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 29.6 bits (66), Expect = 1.1
Identities = 12/42 (28%), Positives = 23/42 (54%), Gaps = 4/42 (9%)
Query: 125 EEEEEEDEEEEEEEA----EETQQGETTEAERQEGEGGEREG 162
E++E+ +E+ EE E T+ G ++ E ++GE E +
Sbjct: 4057 EDDEKMNEDGFEENVQENEESTEDGVKSDEELEQGEVPEDQA 4098
Score = 27.3 bits (60), Expect = 5.7
Identities = 8/37 (21%), Positives = 15/37 (40%)
Query: 123 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
E+++ +E+E+EE G E + E
Sbjct: 3942 NKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNS 3978
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 29.2 bits (65), Expect = 1.1
Identities = 6/34 (17%), Positives = 9/34 (26%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 158
E EE +E + E E+
Sbjct: 392 EAASAAPEEPAALASEAHDETEPEVPEKAAPIPD 425
Score = 28.4 bits (63), Expect = 2.4
Identities = 3/42 (7%), Positives = 11/42 (26%)
Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
+ + + E + ++ +E + E
Sbjct: 382 QAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPI 423
>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
[Transcription / Chromatin structure and dynamics].
Length = 279
Score = 29.2 bits (65), Expect = 1.1
Identities = 17/52 (32%), Positives = 21/52 (40%), Gaps = 1/52 (1%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGE-TTEAERQEGEGGEREGGGEGEISEC 171
E EE + EEEE+EE GE E +E E E GE +
Sbjct: 178 EEERLEESDGREEEEDEEVGSDSYGEGNRELNEEEEEEAEGSDDGEDVVDYE 229
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 28.4 bits (64), Expect = 1.2
Identities = 9/24 (37%), Positives = 22/24 (91%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGET 147
+EE+EEE+++EE+++ +E+++ E+
Sbjct: 117 DEEDEEEEDDEEDDDEDESEEEES 140
Score = 27.7 bits (62), Expect = 2.0
Identities = 9/25 (36%), Positives = 20/25 (80%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETT 148
++EE+EE+E++EE++ E+ + E +
Sbjct: 116 DDEEDEEEEDDEEDDDEDESEEEES 140
Score = 26.9 bits (60), Expect = 4.2
Identities = 10/20 (50%), Positives = 16/20 (80%)
Query: 125 EEEEEEDEEEEEEEAEETQQ 144
EE++EED++E+E E EE+
Sbjct: 123 EEDDEEDDDEDESEEEESPV 142
>gnl|CDD|221775 pfam12794, MscS_TM, Mechanosensitive ion channel inner membrane
domain 1. The small mechanosensitive channel, MscS, is
a part of the turgor-driven solute efflux system that
protects bacteria from lysis in the event of osmotic
shock. The MscS protein alone is sufficient to form a
functional mechanosensitive channel gated directly by
tension in the lipid bilayer. The MscS proteins are
heptamers of three transmembrane subunits with seven
converging M3 domains, and this domain is one of the
inner membrane domains.
Length = 339
Score = 28.7 bits (65), Expect = 1.3
Identities = 11/37 (29%), Positives = 17/37 (45%), Gaps = 4/37 (10%)
Query: 111 RI--QRLRER--FLEPPNEEEEEEDEEEEEEEAEETQ 143
R+ +R + + + EEEEE E E EE +
Sbjct: 257 RLAYERAKAKRAEILAQRAEEEEESSEGAAETIEEPE 293
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 29.2 bits (65), Expect = 1.4
Identities = 16/44 (36%), Positives = 24/44 (54%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
R+ + EEEEE E EE+EE E + G+ + + +E E E
Sbjct: 131 RKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEE 174
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 28.2 bits (63), Expect = 1.4
Identities = 13/42 (30%), Positives = 21/42 (50%)
Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
++EEEEEEE E+ QQ + + ++ + E G E
Sbjct: 92 GHTRQEEEEEEEENEKQQQSDEAQVQQHQQHEHPAENGEESA 133
Score = 27.5 bits (61), Expect = 2.8
Identities = 10/19 (52%), Positives = 17/19 (89%)
Query: 125 EEEEEEDEEEEEEEAEETQ 143
EEEEEE+E E++++++E Q
Sbjct: 97 EEEEEEEENEKQQQSDEAQ 115
>gnl|CDD|235562 PRK05687, fliH, flagellar assembly protein H; Validated.
Length = 246
Score = 28.4 bits (64), Expect = 1.7
Identities = 12/47 (25%), Positives = 14/47 (29%)
Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 166
PP + E EE EE T E + EG G
Sbjct: 30 PPPPEPAAPPPEPPEPEEVPEEEAAPLTEEELEAIRQQAHEEGFAAG 76
>gnl|CDD|227563 COG5238, RNA1, Ran GTPase-activating protein (RanGAP) involved in
mRNA processing and transport [Signal transduction
mechanisms / RNA processing and modification].
Length = 388
Score = 28.7 bits (64), Expect = 1.8
Identities = 10/35 (28%), Positives = 17/35 (48%)
Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
+EE ++E E +E+ + E E Q + ER
Sbjct: 348 TDEESEDEVEIDESVIEEVAEMELLEVQVDDLAER 382
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 28.8 bits (65), Expect = 1.8
Identities = 17/40 (42%), Positives = 20/40 (50%), Gaps = 1/40 (2%)
Query: 120 LEP-PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGG 158
LEP P EE EEE EE EE E + E + G+ G
Sbjct: 828 LEPLPYEEPEEETAEEPEEVDPEDVKSEDDIDDEDTGQLG 867
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 28.3 bits (64), Expect = 2.0
Identities = 15/29 (51%), Positives = 19/29 (65%)
Query: 113 QRLRERFLEPPNEEEEEEDEEEEEEEAEE 141
Q++RE FL+ EEEEE+EEEE E
Sbjct: 517 QKIREEFLQKHKNEEEEEEEEEELPLIPE 545
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP), alpha
subunit. The alpha-subunit of the TRAP complex (TRAP
alpha) is a single-spanning membrane protein of the
endoplasmic reticulum (ER) which is found in proximity
of nascent polypeptide chains translocating across the
membrane.
Length = 281
Score = 28.2 bits (63), Expect = 2.1
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 122 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
+EE E++ +E+EE+ ++ E E +E E GE
Sbjct: 35 TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 27.0 bits (60), Expect = 5.2
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 119 FLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
+ E++ +++EE+E EE + T E E +EGE
Sbjct: 35 TEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 28.4 bits (63), Expect = 2.1
Identities = 15/57 (26%), Positives = 23/57 (40%)
Query: 84 KRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAE 140
++ A + L + PL I + +E N EE E +EE E+ E E
Sbjct: 339 EKWGNEEAERKDYILDSSSVPLEKQFDDILYFEKMEIENRNPEESEHEEEVEDYEDE 395
>gnl|CDD|215914 pfam00428, Ribosomal_60s, 60s Acidic ribosomal protein. This
family includes archaebacterial L12, eukaryotic P0, P1
and P2.
Length = 88
Score = 26.8 bits (60), Expect = 2.2
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 121 EPPNEEEEEEDEEEEEEEA 139
EEE++E+EEEEEE+
Sbjct: 63 AAAAEEEKKEEEEEEEEDD 81
Score = 26.4 bits (59), Expect = 3.9
Identities = 10/17 (58%), Positives = 13/17 (76%)
Query: 125 EEEEEEDEEEEEEEAEE 141
EEE+ EEEEEEE ++
Sbjct: 65 AAEEEKKEEEEEEEEDD 81
Score = 26.0 bits (58), Expect = 4.4
Identities = 8/20 (40%), Positives = 11/20 (55%)
Query: 122 PPNEEEEEEDEEEEEEEAEE 141
E+E++EEEE EE
Sbjct: 59 AAAAAAAAEEEKKEEEEEEE 78
Score = 25.7 bits (57), Expect = 6.6
Identities = 10/18 (55%), Positives = 13/18 (72%)
Query: 121 EPPNEEEEEEDEEEEEEE 138
EEE+++EEEEEEE
Sbjct: 62 AAAAAEEEKKEEEEEEEE 79
>gnl|CDD|233055 TIGR00617, rpa1, replication factor-a protein 1 (rpa1). All
proteins in this family for which functions are known
are part of a multiprotein complex made up of homologs
of RPA1, RPA2 and RPA3 that bind ssDNA and function in
the recognition of DNA damage for nucleotide excision
repairThis family is based on the phylogenomic analysis
of JA Eisen (1999, Ph.D. Thesis, Stanford University)
[DNA metabolism, DNA replication, recombination, and
repair].
Length = 608
Score = 28.2 bits (63), Expect = 2.4
Identities = 15/65 (23%), Positives = 28/65 (43%), Gaps = 9/65 (13%)
Query: 2 DESGEQFVAYFLPTAETLDKRKLDQLSNLEYTDEQEYEYKMAR----EYNWNVKSKSCKG 57
DE+G+ +V F AE + + +L L+ D E+E Y + ++ K
Sbjct: 518 DETGQLWVTAFNDQAEQILGKSAAELGELKEEDPDEFEAIFQEAQFVPYIFRLRVK---- 573
Query: 58 YEENY 62
++ Y
Sbjct: 574 -QDTY 577
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 28.2 bits (63), Expect = 2.4
Identities = 16/57 (28%), Positives = 21/57 (36%), Gaps = 3/57 (5%)
Query: 107 ATEFRIQRLRERFLEPPNEEEEEED---EEEEEEEAEETQQGETTEAERQEGEGGER 160
E +QR E LE E EED E +E + + E + E E R
Sbjct: 393 EEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEEAR 449
Score = 27.8 bits (62), Expect = 3.4
Identities = 14/51 (27%), Positives = 25/51 (49%)
Query: 106 NATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
++ E+ + + E ++ EEEE + E EE E +Q E E + + E
Sbjct: 372 DSAEYDQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRE 422
Score = 26.3 bits (58), Expect = 10.0
Identities = 14/49 (28%), Positives = 15/49 (30%)
Query: 122 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
EEEE + E EE E Q E R E E E
Sbjct: 389 ISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDEL 437
>gnl|CDD|240327 PTZ00243, PTZ00243, ABC transporter; Provisional.
Length = 1560
Score = 28.2 bits (63), Expect = 2.6
Identities = 7/43 (16%), Positives = 16/43 (37%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEG 166
++ +E + + E + G + E + GG+G
Sbjct: 883 ENKDSKEGDADAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDG 925
Score = 27.1 bits (60), Expect = 6.6
Identities = 5/35 (14%), Positives = 11/35 (31%)
Query: 126 EEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
+ E + + A + + + EG G
Sbjct: 893 DAEVAEVDAAPGGAVDHEPPVAKQEGNAEGGDGAA 927
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 27.8 bits (62), Expect = 3.0
Identities = 16/59 (27%), Positives = 26/59 (44%), Gaps = 6/59 (10%)
Query: 118 RFLEPPNEEEEEEDEEEEEEEAEETQQGET------TEAERQEGEGGEREGGGEGEISE 170
R+ E P E+EEE+E+E++++ + E TE E G + E E
Sbjct: 130 RYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDE 188
>gnl|CDD|215518 PLN02959, PLN02959, aminoacyl-tRNA ligase.
Length = 1084
Score = 28.1 bits (63), Expect = 3.0
Identities = 12/46 (26%), Positives = 22/46 (47%)
Query: 101 THRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGE 146
T P+ A+ ++ R +++ PP EE+EDE A+ +
Sbjct: 94 TGMPIKASADKLAREIQQYGNPPVFPEEDEDEAAAVAAAKAEAEAA 139
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin
binding protein sp32 sequences. sp32 is a sperm specific
protein which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved
in packaging the acrosin zymogen into the acrosomal
matrix.
Length = 243
Score = 27.7 bits (61), Expect = 3.1
Identities = 14/35 (40%), Positives = 23/35 (65%)
Query: 112 IQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGE 146
+ +L+E E EE++ ++E+EEEE EE +Q E
Sbjct: 203 LSKLQEYLQEHKTEEKQPQEEQEEEEVEEEAKQEE 237
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 28.0 bits (63), Expect = 3.2
Identities = 13/42 (30%), Positives = 27/42 (64%)
Query: 120 LEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
++ +EEE+EEDE+ ++E+ EE ++ E + +++ E E
Sbjct: 349 VDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSE 390
Score = 26.5 bits (59), Expect = 9.9
Identities = 16/59 (27%), Positives = 31/59 (52%), Gaps = 1/59 (1%)
Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSNED 175
+ F EE+EEE+E+ ++E EE + E E ++ + + E E E S+ +++
Sbjct: 312 DNFGLGQGEEDEEEEEDGVDDEDEE-DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDE 369
>gnl|CDD|235396 PRK05299, rpsB, 30S ribosomal protein S2; Provisional.
Length = 258
Score = 27.4 bits (62), Expect = 3.7
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQE 154
E E+EEEE EE EE ++ E E E
Sbjct: 229 LAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
Score = 26.7 bits (60), Expect = 7.4
Identities = 18/34 (52%), Positives = 20/34 (58%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTE 149
R+ L EEEEEE EEEEEEE EE + E
Sbjct: 225 RQGRLAEAAEEEEEEAEEEEEEEEEEEAEEAEAE 258
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 26.7 bits (59), Expect = 3.9
Identities = 12/21 (57%), Positives = 16/21 (76%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
E E+EEE+ +EEE+EE EE
Sbjct: 76 EKAEEKEEEKKKEEEKEEEEE 96
Score = 25.5 bits (56), Expect = 8.3
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQG 145
+ +EEE++ EEE+EEE EE G
Sbjct: 77 KAEEKEEEKKKEEEKEEEEEEALAG 101
>gnl|CDD|237875 PRK14974, PRK14974, cell division protein FtsY; Provisional.
Length = 336
Score = 27.6 bits (62), Expect = 4.0
Identities = 17/57 (29%), Positives = 31/57 (54%)
Query: 114 RLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISE 170
+L+E+ + + EE+ +EEEEEE E ++ E + E ++ + G + EI E
Sbjct: 4 KLKEKLSKFVEKVEEKIEEEEEEEAPEAEEEEEEEDEEEKKEKPGFFDKAKITEIKE 60
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 26.5 bits (59), Expect = 4.1
Identities = 9/21 (42%), Positives = 15/21 (71%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
E ++EE +EEEEEE+++
Sbjct: 76 AAAAEAKKEEKKEEEEEESDD 96
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 27.0 bits (60), Expect = 4.1
Identities = 15/48 (31%), Positives = 18/48 (37%)
Query: 107 ATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
E R + + E+ EE EEEEAEE E E E
Sbjct: 139 VNEARAEAVAEKKAAEAAAVAAEEAAAAEEEEAEEAPAEEAPAEESAE 186
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 27.2 bits (60), Expect = 4.1
Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 10/57 (17%)
Query: 122 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE----------GGGEGEI 168
PP +++E+ ++E E+ A+ ++ + + EGEG E E GGEGE+
Sbjct: 89 PPGGKKKEKKKKETEKPAQGGEKPDQGPEAKGEGEGHEPEDPPPEDTPPPPGGEGEV 145
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 26.5 bits (59), Expect = 4.2
Identities = 14/26 (53%), Positives = 16/26 (61%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGE 146
EEEEEE+EEEEEEE E +
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEAMA 98
Score = 25.4 bits (56), Expect = 9.2
Identities = 14/21 (66%), Positives = 15/21 (71%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
EEEEEE+EEEEEEE E
Sbjct: 72 AAAAEEEEEEEEEEEEEEEES 92
>gnl|CDD|202496 pfam02981, FokI_N, Restriction endonuclease FokI, recognition
domain.
Length = 145
Score = 26.8 bits (59), Expect = 4.3
Identities = 11/42 (26%), Positives = 18/42 (42%)
Query: 76 LETRVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRE 117
+ V R ++ + N T L VT++ L T I+ E
Sbjct: 46 IPELVTELNLRYELKSVINQTPLSVTYKQLVGTGTSIRSKAE 87
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 26.9 bits (60), Expect = 4.3
Identities = 13/37 (35%), Positives = 21/37 (56%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
E EE DEEE+ +E E + + +R+E E ++E
Sbjct: 109 VEVEELDEEEQIDELLEKELAKLKREKRRENERKQKE 145
Score = 26.5 bits (59), Expect = 6.2
Identities = 14/40 (35%), Positives = 23/40 (57%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGER 160
E EEE E +E +EEE+ +E + E + +R++ ER
Sbjct: 102 EEEEEEEVEVEELDEEEQIDELLEKELAKLKREKRRENER 141
Score = 26.5 bits (59), Expect = 6.5
Identities = 17/45 (37%), Positives = 24/45 (53%)
Query: 110 FRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
+R + + L+ +EEEEE+E E EE EE Q E E E +
Sbjct: 86 WRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAK 130
>gnl|CDD|240285 PTZ00135, PTZ00135, 60S acidic ribosomal protein P0; Provisional.
Length = 310
Score = 27.3 bits (61), Expect = 4.9
Identities = 6/21 (28%), Positives = 8/21 (38%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
EEEEEE ++
Sbjct: 283 AAAAAAAAAAPAEEEEEEEDD 303
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 26.2 bits (58), Expect = 4.9
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
E E +E E+EE+EEE EE
Sbjct: 78 EAAAEADEAEEEEKEEEAEEE 98
Score = 25.8 bits (57), Expect = 8.2
Identities = 11/21 (52%), Positives = 14/21 (66%)
Query: 121 EPPNEEEEEEDEEEEEEEAEE 141
E +E +EEE+EEEAEE
Sbjct: 77 AEAAAEADEAEEEEKEEEAEE 97
>gnl|CDD|129416 TIGR00316, cdhC, CO dehydrogenase/CO-methylating acetyl-CoA
synthase complex, beta subunit. Nomenclature follows
the description for Methanosarcina thermophila. The
CO-methylating acetyl-CoA synthase is considered the
defining enzyme of the Wood-Ljungdahl pathway, used for
acetate catabolism by sulfate reducing bacteria but for
acetate biosynthesis by acetogenic bacteria such as
oorella thermoacetica (f. Clostridium thermoaceticum)
[Energy metabolism, Chemoautotrophy].
Length = 458
Score = 27.1 bits (60), Expect = 5.2
Identities = 12/25 (48%), Positives = 14/25 (56%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTE 149
E +EEE EEEEE E + E E
Sbjct: 402 EVDEEEIEEEEEAMQPEEMEMEGFE 426
>gnl|CDD|182341 PRK10260, PRK10260, L,D-transpeptidase; Provisional.
Length = 306
Score = 26.9 bits (59), Expect = 5.3
Identities = 11/36 (30%), Positives = 18/36 (50%)
Query: 78 TRVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQ 113
TRV+ K P+ +R + H PL+ TE + +
Sbjct: 229 TRVQFIDEPVKATTEPDGSRYIEVHNPLSTTEAQFE 264
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 27.2 bits (61), Expect = 5.5
Identities = 16/43 (37%), Positives = 26/43 (60%)
Query: 117 ERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
+ F EEEEEE++E++EEE EE ++ E E +E + +
Sbjct: 433 KAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
Score = 26.8 bits (60), Expect = 7.6
Identities = 15/55 (27%), Positives = 26/55 (47%)
Query: 113 QRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGE 167
++ E E + + +EEEEEE +E ++ E E E + E E E + +
Sbjct: 421 KKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKK 475
>gnl|CDD|215637 PLN03223, PLN03223, Polycystin cation channel protein; Provisional.
Length = 1634
Score = 27.2 bits (60), Expect = 5.6
Identities = 15/52 (28%), Positives = 28/52 (53%), Gaps = 4/52 (7%)
Query: 107 ATEFRIQR----LRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
AT I + L ++ + P++E+++ED + E+E ++ QQ AE Q
Sbjct: 1544 ATADEIDQAAHMLMDQVGQVPDDEDDDEDGDVLEKEVDQLQQSLERLAEVQR 1595
>gnl|CDD|234352 TIGR03779, Bac_Flav_CT_M, Bacteroides conjugative transposon TraM
protein. Members of this protein family are designated
TraM and are found in a proposed transfer region of a
class of conjugative transposon found in the Bacteroides
lineage [Cellular processes, DNA transformation].
Length = 410
Score = 27.0 bits (60), Expect = 5.9
Identities = 18/59 (30%), Positives = 22/59 (37%), Gaps = 1/59 (1%)
Query: 103 RPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
R A I R F E P +EE+E E EE E E + A E + E
Sbjct: 120 RSSAAAYRDINRELGSFYEYPKTDEEKE-LLREVEELESRLATEPSPAPELEEQLALME 177
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The domain
is found in the primary vegetative sigma factor. The
function of this domain is unclear and can be removed
without loss of function.
Length = 211
Score = 26.8 bits (60), Expect = 6.0
Identities = 11/33 (33%), Positives = 21/33 (63%)
Query: 121 EPPNEEEEEEDEEEEEEEAEETQQGETTEAERQ 153
E E+++++DE+E+E++ EE G E R+
Sbjct: 50 EEDLEDDDDDDEDEDEDDEEEADLGPDPEEARE 82
Score = 26.0 bits (58), Expect = 8.9
Identities = 9/40 (22%), Positives = 13/40 (32%)
Query: 123 PNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
PN E E +E + + + E E E E
Sbjct: 32 PNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEA 71
>gnl|CDD|236154 PRK08119, PRK08119, flagellar motor switch protein; Validated.
Length = 382
Score = 27.1 bits (61), Expect = 6.0
Identities = 12/31 (38%), Positives = 13/31 (41%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
EEEEEE+E EEEE T
Sbjct: 225 GEEEEEEEEVEEEEAQASPAAEPATAQAAPA 255
>gnl|CDD|240415 PTZ00429, PTZ00429, beta-adaptin; Provisional.
Length = 746
Score = 26.8 bits (59), Expect = 6.6
Identities = 16/61 (26%), Positives = 25/61 (40%), Gaps = 12/61 (19%)
Query: 95 NTRLVVTHRPLNATEFRIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQE 154
NT +V RP + FL P + E DEE+ E++ + + T Q+
Sbjct: 584 NTAAIVFARPYQS-----------FLPPYGLADVELDEEDTEDD-DAVELPSTPSMGTQD 631
Query: 155 G 155
G
Sbjct: 632 G 632
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 26.5 bits (59), Expect = 6.8
Identities = 11/35 (31%), Positives = 23/35 (65%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
E+ EEE+EE++ E+E+ + + E E ++ + G+
Sbjct: 284 EKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGD 318
Score = 26.5 bits (59), Expect = 7.9
Identities = 16/41 (39%), Positives = 22/41 (53%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGE 156
E LE EEEEE+D E E+E EE + E E + + +
Sbjct: 279 LEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 26.9 bits (60), Expect = 7.3
Identities = 12/44 (27%), Positives = 18/44 (40%)
Query: 122 PPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 165
P E+EE E++A + +AE E E E G +
Sbjct: 773 APPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAMELLEEELGAK 816
>gnl|CDD|219922 pfam08595, RXT2_N, RXT2-like, N-terminal. The family represents
the N-terminal region of RXT2-like proteins. In S.
cerevisiae, RXT2 has been demonstrated to be involved in
conjugation with cellular fusion (mating) and invasive
growth. A high throughput localisation study has
localised RXT2 to the nucleus.
Length = 141
Score = 26.2 bits (58), Expect = 7.5
Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Query: 112 IQRLRERFLEPP--NEEEEEEDEEEEEEEAEETQQGETTE 149
+ R R +F PP +E+ + D+++E++E +E E +
Sbjct: 34 LSRKRIKFNNPPRIDEDGGDIDDDDEDDEDDEEADAEDDD 73
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 26.3 bits (58), Expect = 7.7
Identities = 10/33 (30%), Positives = 20/33 (60%)
Query: 119 FLEPPNEEEEEEDEEEEEEEAEETQQGETTEAE 151
F E +E++E +E EE+EA + ++ + + E
Sbjct: 91 FPEESEKEQKEVSKETEEKEAIKAKKEKKEKKE 123
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 26.6 bits (59), Expect = 7.7
Identities = 15/51 (29%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 111 RIQRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGERE 161
+++L + E++E+E EE E+ E Q+GE A Q E G +
Sbjct: 294 ELKKLENKL-----EKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKS 339
>gnl|CDD|225962 COG3428, COG3428, Predicted membrane protein [Function unknown].
Length = 494
Score = 26.6 bits (59), Expect = 7.8
Identities = 11/47 (23%), Positives = 17/47 (36%), Gaps = 1/47 (2%)
Query: 113 QRLRERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGE 159
+RE ++ + E E + EEE E + R GE
Sbjct: 140 ALVREARVKKLDALELAEADTPEEEV-AEVLARSQSSVLRYPMNKGE 185
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 26.9 bits (59), Expect = 7.9
Identities = 23/70 (32%), Positives = 31/70 (44%), Gaps = 14/70 (20%)
Query: 113 QRLRERFLE--------PPNE---EEEEEDEEEEEEEAEETQQGETTEAERQEGE--GGE 159
QR RF + P + E D+EEE E + + +G + E EGE G
Sbjct: 43 QRSYSRFEDEDDDDDFPAPADGYSRGEAADDEEEGEASSDATEGHDEDDEIYEGEYQGIP 102
Query: 160 R-EGGGEGEI 168
R E GG+ EI
Sbjct: 103 RAENGGKDEI 112
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 26.7 bits (59), Expect = 8.0
Identities = 15/42 (35%), Positives = 21/42 (50%)
Query: 124 NEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGE 165
++E E E+ E EE ++GE E E E E E E G+
Sbjct: 66 DDEGRIEVRISEDGEEEEVEEGEEDEEEEGEEESEEFEPLGD 107
>gnl|CDD|145361 pfam02161, Prog_receptor, Progesterone receptor.
Length = 456
Score = 26.5 bits (58), Expect = 8.8
Identities = 10/38 (26%), Positives = 19/38 (50%)
Query: 125 EEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREG 162
+EE +E ++++ ++ + EA R GEG R
Sbjct: 56 QEEPDEKTQDQQSLSDVERAEPEVEASRGRGEGSSRPP 93
>gnl|CDD|182388 PRK10337, PRK10337, sensor protein QseC; Provisional.
Length = 449
Score = 26.5 bits (59), Expect = 9.1
Identities = 18/64 (28%), Positives = 25/64 (39%), Gaps = 6/64 (9%)
Query: 64 LIVRDDAVYYNELETRVRLSKRRQKVGAAPNNTRLVVTHRPLNATEFRIQRLRERFLEPP 123
L+VR N L+ +R S + V N V T + R+ ERF PP
Sbjct: 355 LLVR------NLLDNAIRYSPQGSVVDVTLNARNFTVRDNGPGVTPEALARIGERFYRPP 408
Query: 124 NEEE 127
+E
Sbjct: 409 GQEA 412
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 26.4 bits (59), Expect = 9.3
Identities = 12/58 (20%), Positives = 18/58 (31%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEETQQGETTEAERQEGEGGEREGGGEGEISECSN 173
+ R E + EEE + EE + E G G ++ SN
Sbjct: 108 KRRLFEEQDSGLGNSLEEESTDVVEEEGVESSGGGEGGSETGQGGGNGLVLDLLRSSN 165
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 26.3 bits (58), Expect = 9.9
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 116 RERFLEPPNEEEEEEDEEEEEEEAEE 141
+ + LE + +EE+E +EEEEEE EE
Sbjct: 159 KLKELEAEDVDEEDEKDEEEEEEEEE 184
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.305 0.128 0.349
Gapped
Lambda K H
0.267 0.0812 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,113,579
Number of extensions: 870070
Number of successful extensions: 6548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4330
Number of HSP's successfully gapped: 894
Length of query: 175
Length of database: 10,937,602
Length adjustment: 90
Effective length of query: 85
Effective length of database: 6,945,742
Effective search space: 590388070
Effective search space used: 590388070
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.9 bits)
S2: 56 (25.2 bits)