RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8459
(106 letters)
>gnl|CDD|218556 pfam05327, RRN3, RNA polymerase I specific transcription initiation
factor RRN3. This family consists of several eukaryotic
proteins which are homologous to the yeast RRN3 protein.
RRN3 is one of the RRN genes specifically required for
the transcription of rDNA by RNA polymerase I (Pol I) in
Saccharomyces cerevisiae.
Length = 554
Score = 40.4 bits (95), Expect = 4e-05
Identities = 20/54 (37%), Positives = 28/54 (51%), Gaps = 1/54 (1%)
Query: 43 NNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDT-EGKEVRGK 95
NE D +D+EE E +D+E + + D DEE+ + E E T KEV K
Sbjct: 219 QNELDDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKEVSEK 272
Score = 33.0 bits (76), Expect = 0.017
Identities = 11/47 (23%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
+ D+E E ED++ E++ D ++D E+E + + K+
Sbjct: 224 DIDDEEEERVLADEDDDDEDDMFD--MDDDDEEESDPEVERTSTIKE 268
Score = 27.7 bits (62), Expect = 1.3
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 1/54 (1%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDE-EDGEDEEEDTEGKEVRGKKRKHD 100
D E E + D E E+ L +E+DD D+ D +D++E+ EV +
Sbjct: 215 DVEIQNELDDIDDEEEERVLADEDDDDEDDMFDMDDDDEEESDPEVERTSTIKE 268
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 40.4 bits (94), Expect = 5e-05
Identities = 21/49 (42%), Positives = 32/49 (65%), Gaps = 1/49 (2%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
+AK +E + S+G D E E+EE EEEE++ +EE+ E+EEE+ E
Sbjct: 844 EAKQDEK-GVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 39.2 bits (91), Expect = 1e-04
Identities = 20/49 (40%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 45 EAPDSED--DEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
EA E D D + E+EE EEEE++ +EE+ E+EEE+ E +E
Sbjct: 844 EAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 36.9 bits (85), Expect = 7e-04
Identities = 19/63 (30%), Positives = 34/63 (53%)
Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
+E + E + E++G+ + E DG D E+ E+EEE+ E +E ++ + +EEE
Sbjct: 830 DETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
Query: 104 EGE 106
E
Sbjct: 890 NEE 892
Score = 33.8 bits (77), Expect = 0.009
Identities = 19/68 (27%), Positives = 34/68 (50%), Gaps = 5/68 (7%)
Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEE-----EDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
E + E E++ DD E +DE E+E + + +E G D ++G + ++ +
Sbjct: 810 GEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDEKGVDGGGGSDGGDSEEEEEE 869
Query: 99 HDEEEEGE 106
+EEEE E
Sbjct: 870 EEEEEEEE 877
Score = 33.8 bits (77), Expect = 0.010
Identities = 19/69 (27%), Positives = 31/69 (44%), Gaps = 1/69 (1%)
Query: 38 GFDAKNNEAPDSEDDEESEGD-DYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
G + + + E+ EES G+ + EGE E E E +G + + E+E E +
Sbjct: 646 GEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEAD 705
Query: 97 RKHDEEEEG 105
K + E E
Sbjct: 706 HKGETEAEE 714
Score = 31.9 bits (72), Expect = 0.049
Identities = 18/45 (40%), Positives = 28/45 (62%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEE 84
D K + D +SE ++ E E+EE EEEE++ +EE+ E+EE
Sbjct: 848 DEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 31.5 bits (71), Expect = 0.054
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 1/69 (1%)
Query: 38 GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
G +A+ +++ + ESEG+ E + +E E + +E E + E E G+
Sbjct: 664 GGEAEQEGETETKGENESEGE-IPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETE 722
Query: 98 KHDEEEEGE 106
E+EGE
Sbjct: 723 AEGTEDEGE 731
Score = 31.5 bits (71), Expect = 0.063
Identities = 13/67 (19%), Positives = 28/67 (41%)
Query: 38 GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
+A+ E D + + E + +G++ + E +G E +DE E + +
Sbjct: 769 ETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEV 828
Query: 98 KHDEEEE 104
K + E+
Sbjct: 829 KDETGEQ 835
Score = 31.1 bits (70), Expect = 0.090
Identities = 16/41 (39%), Positives = 26/41 (63%)
Query: 38 GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEE 78
G D ++E + E++EE E ++ E E+EE EEE ++ L E
Sbjct: 857 GSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897
Score = 30.3 bits (68), Expect = 0.13
Identities = 21/63 (33%), Positives = 34/63 (53%), Gaps = 2/63 (3%)
Query: 45 EAPDSEDDEESEGDDYEGEDE-ELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
+ D + + E+E ++EGE E E E+E + E+GE+ E++ EG E GK E +
Sbjct: 702 KEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEG-EAEGKHEVETEGD 760
Query: 104 EGE 106
E
Sbjct: 761 RKE 763
Score = 30.3 bits (68), Expect = 0.17
Identities = 25/68 (36%), Positives = 36/68 (52%), Gaps = 6/68 (8%)
Query: 45 EAPDSEDDEESEGDDYEGE---DEELEEEEDDGLDEEDGEDEEE---DTEGKEVRGKKRK 98
E + E + E+EG + EGE EE EE ED+G E +G+ E E D + E G+
Sbjct: 713 EEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEA 772
Query: 99 HDEEEEGE 106
+E+E E
Sbjct: 773 EGKEDEDE 780
Score = 30.0 bits (67), Expect = 0.21
Identities = 21/71 (29%), Positives = 32/71 (45%), Gaps = 4/71 (5%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE---DTEGK-EVRGK 95
+ E + E G++ GE E+ E E G +E +GE E + EG+ E+ K
Sbjct: 643 ERTGEEGERPTEAEGENGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAK 702
Query: 96 KRKHDEEEEGE 106
+ H E E E
Sbjct: 703 EADHKGETEAE 713
Score = 29.6 bits (66), Expect = 0.25
Identities = 16/61 (26%), Positives = 33/61 (54%), Gaps = 2/61 (3%)
Query: 48 DSEDDEESEGD-DYEGEDEELEEEEDDGLDEEDGEDEEEDTEGK-EVRGKKRKHDEEEEG 105
+ + ++E EG+ + + D + E E ++ E + E E + EG+ E + + ++E EG
Sbjct: 688 ERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEG 747
Query: 106 E 106
E
Sbjct: 748 E 748
Score = 29.2 bits (65), Expect = 0.33
Identities = 13/37 (35%), Positives = 24/37 (64%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDD 73
G + ++ + E++EE E ++ E E+EE EEEE++
Sbjct: 854 GGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEEN 890
Score = 28.8 bits (64), Expect = 0.52
Identities = 14/34 (41%), Positives = 22/34 (64%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEE 70
DG D++ E + E++EE E ++ E E+EE EE
Sbjct: 859 DGGDSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 28.8 bits (64), Expect = 0.57
Identities = 17/61 (27%), Positives = 36/61 (59%), Gaps = 1/61 (1%)
Query: 45 EAPDSEDDEESEGDDYEGEDE-ELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
+ ++E + E+E + E EDE E++ ED + ++G + + + EG+ G+K +H+ +
Sbjct: 760 DRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQS 819
Query: 104 E 104
E
Sbjct: 820 E 820
Score = 28.0 bits (62), Expect = 0.91
Identities = 18/60 (30%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 48 DSEDDEESEG-DDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
+SE + +E + EGE E +E D + E E E E E + + + EEGE
Sbjct: 680 ESEGEIPAERKGEQEGEGEIEAKEADHKGETEAEEVEHEGETEAEGTEDEGEIETGEEGE 739
Score = 28.0 bits (62), Expect = 0.91
Identities = 16/56 (28%), Positives = 27/56 (48%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
++ ++ E D+ EGE E E E +G +E + E + EGKE + E+
Sbjct: 733 ETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGED 788
Score = 27.3 bits (60), Expect = 2.0
Identities = 15/56 (26%), Positives = 24/56 (42%)
Query: 51 DDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
D+ ++EGE E E++E +G E +D E E E E ++ E
Sbjct: 794 DEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQDE 849
Score = 26.9 bits (59), Expect = 2.6
Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 1/67 (1%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
D EA + + + E+EGD E E E E E ++E ED E K G + K
Sbjct: 743 DEGEGEA-EGKHEVETEGDRKETEHEGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKV 801
Query: 100 DEEEEGE 106
+ E E E
Sbjct: 802 EHEGETE 808
Score = 26.5 bits (58), Expect = 3.3
Identities = 19/81 (23%), Positives = 28/81 (34%), Gaps = 11/81 (13%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE----------- 85
DG + A + E + E E E + D + +D E+E
Sbjct: 788 DGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQGEAKQ 847
Query: 86 DTEGKEVRGKKRKHDEEEEGE 106
D +G + G D EEE E
Sbjct: 848 DEKGVDGGGGSDGGDSEEEEE 868
Score = 26.1 bits (57), Expect = 4.8
Identities = 17/59 (28%), Positives = 29/59 (49%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
++E+ E + EG ++E E E + +E + E E E EV + + + E EGE
Sbjct: 711 EAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGE 769
>gnl|CDD|197729 smart00446, LRRcap, occurring C-terminal to leucine-rich repeats.
A motif occurring C-terminal to leucine-rich repeats
in "sds22-like" and "typical" LRR-containing proteins.
Length = 19
Score = 36.9 bits (87), Expect = 6e-05
Identities = 12/18 (66%), Positives = 14/18 (77%)
Query: 20 ENYREKVFELLPNLKYLD 37
+YREKV ELLP L+ LD
Sbjct: 2 AHYREKVIELLPQLRKLD 19
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 38.8 bits (91), Expect = 2e-04
Identities = 20/63 (31%), Positives = 32/63 (50%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
DA + D +DDEES+ +D E E +E++DD D++D E E + R + +
Sbjct: 118 DAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRRRRREWEEK 177
Query: 100 DEE 102
E
Sbjct: 178 RAE 180
Score = 36.9 bits (86), Expect = 8e-04
Identities = 17/71 (23%), Positives = 37/71 (52%)
Query: 36 LDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGK 95
L ++ D E+D++ + + E ++E + E+D+ D++D +D+ E R +
Sbjct: 110 LQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSLERRR 169
Query: 96 KRKHDEEEEGE 106
+R+ EE+ E
Sbjct: 170 RRREWEEKRAE 180
Score = 32.6 bits (75), Expect = 0.023
Identities = 15/62 (24%), Positives = 34/62 (54%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
DG + + DD + ++ + +DEE +EE+++ ED ED+++D + ++ ++
Sbjct: 103 DGDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRE 162
Query: 97 RK 98
R
Sbjct: 163 RS 164
Score = 30.0 bits (68), Expect = 0.18
Identities = 14/57 (24%), Positives = 25/57 (43%), Gaps = 1/57 (1%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
++E + Y+ +LEE++DD +E D EDEE + D++
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDDD-EESDEEDEESSKSEDDEDDDDDDDDDDIA 159
Score = 30.0 bits (68), Expect = 0.20
Identities = 13/56 (23%), Positives = 27/56 (48%)
Query: 51 DDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
D E E D D+ + E+D D+E+ ++E+E++ E D++++
Sbjct: 104 GDIEEELQDEPRYDDAYRDLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIA 159
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 38.1 bits (88), Expect = 3e-04
Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 8/59 (13%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
ED + + +DE+ +E++DD D+E+ E+EEE+ +G + DE+EE E
Sbjct: 141 AEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFD--------DEDEEDE 191
Score = 30.0 bits (67), Expect = 0.19
Identities = 22/53 (41%), Positives = 31/53 (58%), Gaps = 1/53 (1%)
Query: 32 NLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEED-DGLDEEDGEDE 83
+L D N D ++DE+ + DD E ++EE EEEE+ G D+ED EDE
Sbjct: 139 HLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDE 191
Score = 29.6 bits (66), Expect = 0.32
Identities = 18/70 (25%), Positives = 40/70 (57%)
Query: 34 KYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
++L D + +DD+E E +D + E+++ EEEE++ + +++EED G++
Sbjct: 138 RHLAEEDMSPRDNFVIDDDDEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEEDEGGEDFT 197
Query: 94 GKKRKHDEEE 103
+K + D+ +
Sbjct: 198 YEKSEVDKTD 207
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 35.9 bits (83), Expect = 0.002
Identities = 21/54 (38%), Positives = 26/54 (48%), Gaps = 5/54 (9%)
Query: 51 DDEESEGDDYEGEDEELEEEEDDGLDE-----EDGEDEEEDTEGKEVRGKKRKH 99
+D +E DD E EDEE EEEED+ +D E EE+D E K
Sbjct: 382 EDANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDGNSS 435
Score = 30.1 bits (68), Expect = 0.20
Identities = 17/50 (34%), Positives = 28/50 (56%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
DA + E+DEE E ++ E E E +D+ +E+D E + ED++G
Sbjct: 383 DANTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEEDDVESKYEDSDG 432
Score = 27.8 bits (62), Expect = 1.2
Identities = 17/44 (38%), Positives = 25/44 (56%), Gaps = 1/44 (2%)
Query: 63 EDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
E+E+ + DEE EDEEE+ E E G ++H ++EE E
Sbjct: 377 SALEIEDANTERDDEE-EEDEEEEEEEDEDEGPSKEHSDDEEFE 419
Score = 26.6 bits (59), Expect = 2.7
Identities = 18/58 (31%), Positives = 28/58 (48%), Gaps = 2/58 (3%)
Query: 49 SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
E +++ D + + E E D +EED E+EEE+ E + K D+EE E
Sbjct: 365 KETEQDYILDAFSALEIEDANTERDDEEEEDEEEEEEEDEDEGPS--KEHSDDEEFEE 420
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 35.7 bits (83), Expect = 0.002
Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 2/72 (2%)
Query: 35 YLDGFDAKNNEAPDSEDDEESE--GDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
L + E D DDE+ E DD E E+E+++ +++ +E++ D+E+D E +E
Sbjct: 315 GLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEE 374
Query: 93 RGKKRKHDEEEE 104
+K+K E
Sbjct: 375 EKEKKKKKSAES 386
Score = 35.0 bits (81), Expect = 0.003
Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 2/71 (2%)
Query: 38 GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGE--DEEEDTEGKEVRGK 95
G + + E +D++E + DD E+EE + D+ DEED + DE+++ E +E + K
Sbjct: 319 GEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEK 378
Query: 96 KRKHDEEEEGE 106
K+K E
Sbjct: 379 KKKKSAESTRS 389
Score = 34.2 bits (79), Expect = 0.006
Identities = 27/72 (37%), Positives = 42/72 (58%), Gaps = 2/72 (2%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDD--YEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRG 94
+ + A D +D+ E + DD G+ EE EEEE+DG+D+ED ED+++D E +E
Sbjct: 290 EEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDV 349
Query: 95 KKRKHDEEEEGE 106
+E+EE E
Sbjct: 350 DLSDEEEDEEDE 361
Score = 32.7 bits (75), Expect = 0.024
Identities = 23/70 (32%), Positives = 38/70 (54%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
D F+ +++ EE E ++ +G D+E EE++DD L+EE+ + + D E E
Sbjct: 304 DEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDS 363
Query: 97 RKHDEEEEGE 106
D+EEE E
Sbjct: 364 DDEDDEEEEE 373
Score = 26.9 bits (60), Expect = 2.3
Identities = 15/74 (20%), Positives = 36/74 (48%)
Query: 33 LKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
L+ + G + + E DS++ + D++E +D++ DEE+ ED +D + ++
Sbjct: 279 LRRMRGEEEDDEEEEDSKESADDLDDEFEPDDDDNFGLGQGEEDEEEEEDGVDDEDEEDD 338
Query: 93 RGKKRKHDEEEEGE 106
+ +E+ +
Sbjct: 339 DDDLEEEEEDVDLS 352
Score = 25.7 bits (57), Expect = 6.3
Identities = 11/48 (22%), Positives = 21/48 (43%), Gaps = 1/48 (2%)
Query: 38 GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
K+++ D +D ++ + D D+ DEED E++ E
Sbjct: 124 SEIDKDDDVRDDDDFDDDDLGD-LASDDRAAHFGGGEDDEEDEEEQPE 170
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 34.6 bits (79), Expect = 0.005
Identities = 22/62 (35%), Positives = 31/62 (50%), Gaps = 3/62 (4%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGED--EEEDTEGKEVRGKKR 97
+++ E DE+ E D+ ED +E E+D DEEDGED E E + R KR
Sbjct: 941 YEASSDDESDETDEDEESDESS-EDLSEDESENDSSDEEDGEDWDELESKAAYDSRPGKR 999
Query: 98 KH 99
+
Sbjct: 1000 RK 1001
>gnl|CDD|220972 pfam11081, DUF2890, Protein of unknown function (DUF2890). This
family is conserved in dsDNA adenoviruses of
vertebrates. The function is not known.
Length = 172
Score = 33.7 bits (77), Expect = 0.008
Identities = 20/42 (47%), Positives = 28/42 (66%), Gaps = 1/42 (2%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
D E+D +S+ ++ E EDEE E+ +D LDEED E EE + E
Sbjct: 18 KDEEEDWDSQAEEVE-EDEEEMEDWEDSLDEEDEEAEEVEEE 58
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 33.0 bits (76), Expect = 0.015
Identities = 16/57 (28%), Positives = 31/57 (54%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
D D +++ D EDD++ + DD + E +E +E E D++ DE++ ++ R
Sbjct: 145 DDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQAR 201
Score = 32.7 bits (75), Expect = 0.019
Identities = 16/71 (22%), Positives = 39/71 (54%)
Query: 34 KYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
K +D + +++ D +DD+ + D + +D+E ++E+DD D +D ++E+++ + E
Sbjct: 122 KDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKL 181
Query: 94 GKKRKHDEEEE 104
+E+
Sbjct: 182 SDDDDFVWDED 192
Score = 32.7 bits (75), Expect = 0.022
Identities = 14/68 (20%), Positives = 34/68 (50%)
Query: 36 LDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGK 95
D D + + D ++D++ + DD + +DE+ E++E L++ +D+ E +
Sbjct: 139 DDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALR 198
Query: 96 KRKHDEEE 103
+ + D +
Sbjct: 199 QARKDAKL 206
Score = 32.7 bits (75), Expect = 0.024
Identities = 15/81 (18%), Positives = 41/81 (50%)
Query: 24 EKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
+ + ++ L+ D +++ D + D++ DD + ED++ ++++DD DE++ + E
Sbjct: 115 DDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKE 174
Query: 84 EEDTEGKEVRGKKRKHDEEEE 104
++ E +++ E
Sbjct: 175 AKELEKLSDDDDFVWDEDDSE 195
Score = 31.9 bits (73), Expect = 0.042
Identities = 16/70 (22%), Positives = 35/70 (50%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
D + D +DD++ DD +D++ E++++D D++ +++EE E KE+
Sbjct: 123 DIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLS 182
Query: 97 RKHDEEEEGE 106
D + +
Sbjct: 183 DDDDFVWDED 192
Score = 31.1 bits (71), Expect = 0.081
Identities = 16/77 (20%), Positives = 38/77 (49%)
Query: 15 DVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDG 74
D NY + + L D D + + D +DD++ E DD + +D+++++E+++
Sbjct: 113 DKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEK 172
Query: 75 LDEEDGEDEEEDTEGKE 91
+ ++ E +D +
Sbjct: 173 KEAKELEKLSDDDDFVW 189
Score = 31.1 bits (71), Expect = 0.083
Identities = 15/67 (22%), Positives = 39/67 (58%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
A +++ D +DD + + D + +DE+ +E++DD +++ E+++E E +++
Sbjct: 129 QADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFV 188
Query: 100 DEEEEGE 106
+E++ E
Sbjct: 189 WDEDDSE 195
Score = 29.2 bits (66), Expect = 0.31
Identities = 18/70 (25%), Positives = 39/70 (55%), Gaps = 2/70 (2%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
D N D + +++ DD + +D++L++++ D D++D ED++ED + +V +
Sbjct: 112 LDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDID--DDDDDEDDDEDDDDDDVDDED 169
Query: 97 RKHDEEEEGE 106
+ E +E E
Sbjct: 170 EEKKEAKELE 179
Score = 28.8 bits (65), Expect = 0.46
Identities = 17/67 (25%), Positives = 36/67 (53%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
+ KN D + + + D D++ ++++DD LD++D +D+++D + E
Sbjct: 107 EKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVD 166
Query: 100 DEEEEGE 106
DE+EE +
Sbjct: 167 DEDEEKK 173
Score = 28.8 bits (65), Expect = 0.48
Identities = 16/68 (23%), Positives = 36/68 (52%), Gaps = 1/68 (1%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGED-EEEDTEGKEVRGKKRK 98
D + D + + + DD + +D + ++ +DD DE+D ED +++D + ++ K+ K
Sbjct: 117 DLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAK 176
Query: 99 HDEEEEGE 106
E+ +
Sbjct: 177 ELEKLSDD 184
Score = 27.3 bits (61), Expect = 1.7
Identities = 11/84 (13%), Positives = 33/84 (39%)
Query: 3 LTNLKHVDLFNNDVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEG 62
L +K +D+ N ++ + + D D ++ D DDE+ E + +
Sbjct: 118 LNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDDDEDDDEDDDDDDVDDEDEEKKEAKE 177
Query: 63 EDEELEEEEDDGLDEEDGEDEEED 86
++ ++++ +++ +
Sbjct: 178 LEKLSDDDDFVWDEDDSEALRQAR 201
>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
Length = 852
Score = 33.2 bits (76), Expect = 0.017
Identities = 18/74 (24%), Positives = 27/74 (36%), Gaps = 19/74 (25%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE-------------------DTE 88
DS + +G + + ED +++DD D G EEE D E
Sbjct: 23 DSGESRWVDGSEDDSEDPASLDDDDDDRDSGGGMLEEEEDGNMRRRLIRTPPRVDSFDVE 82
Query: 89 GKEVRGKKRKHDEE 102
E+ G R E+
Sbjct: 83 AMEIPGAHRHDSED 96
Score = 27.4 bits (61), Expect = 1.4
Identities = 8/66 (12%), Positives = 24/66 (36%), Gaps = 2/66 (3%)
Query: 41 AKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
+++ S + + + + +D D ED ++D + ++ G + +
Sbjct: 3 EESSGGIGSSSNGSGGLVGTDSGESRWVDGSED--DSEDPASLDDDDDDRDSGGGMLEEE 60
Query: 101 EEEEGE 106
E+
Sbjct: 61 EDGNMR 66
>gnl|CDD|217783 pfam03896, TRAP_alpha, Translocon-associated protein (TRAP),
alpha subunit. The alpha-subunit of the TRAP complex
(TRAP alpha) is a single-spanning membrane protein of
the endoplasmic reticulum (ER) which is found in
proximity of nascent polypeptide chains translocating
across the membrane.
Length = 281
Score = 32.8 bits (75), Expect = 0.019
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 46 APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDT 87
A D +DEE+E D + ++E+ E+D + + E++EE
Sbjct: 31 AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEGE 72
Score = 28.2 bits (63), Expect = 0.62
Identities = 16/44 (36%), Positives = 25/44 (56%), Gaps = 7/44 (15%)
Query: 63 EDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
EDEE E++ D +E++ EE++ E E +E+EEGE
Sbjct: 36 EDEEAEDDVVDEDEEDEAVVEEDENELTEE-------EEDEEGE 72
Score = 27.0 bits (60), Expect = 1.7
Identities = 15/41 (36%), Positives = 24/41 (58%), Gaps = 4/41 (9%)
Query: 53 EESEGDDYEGEDEELEEEEDD----GLDEEDGEDEEEDTEG 89
+ +D E ED+ ++E+E+D DE + +EEED EG
Sbjct: 31 AQDLTEDEEAEDDVVDEDEEDEAVVEEDENELTEEEEDEEG 71
Score = 26.7 bits (59), Expect = 2.8
Identities = 15/41 (36%), Positives = 25/41 (60%), Gaps = 4/41 (9%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
E ++EDD E + EDE + EE+++ L EE+ ++E E
Sbjct: 36 EDEEAEDDVVDE----DEEDEAVVEEDENELTEEEEDEEGE 72
>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region. The
domain is found in the primary vegetative sigma factor.
The function of this domain is unclear and can be
removed without loss of function.
Length = 211
Score = 32.5 bits (75), Expect = 0.022
Identities = 14/51 (27%), Positives = 26/51 (50%)
Query: 35 YLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
++D A + + E E D + +D++ +E+EDD + + G D EE
Sbjct: 29 FIDPNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLGPDPEE 79
Score = 28.3 bits (64), Expect = 0.69
Identities = 14/43 (32%), Positives = 22/43 (51%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
P++ + DEE E++DD ++ED +DEEE G
Sbjct: 32 PNAAAAAATAAAIESELDEEDLEDDDDDDEDEDEDDEEEADLG 74
>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat.
Length = 60
Score = 30.6 bits (70), Expect = 0.022
Identities = 16/37 (43%), Positives = 23/37 (62%), Gaps = 3/37 (8%)
Query: 1 KQLTNLKHVDLFNNDVTTIENYREKVFELLPNLKYLD 37
K L NLK +DL N++T+I + F LP+L+ LD
Sbjct: 21 KGLPNLKVLDLSGNNLTSIS---PEAFSGLPSLRSLD 54
Score = 30.6 bits (70), Expect = 0.030
Identities = 17/33 (51%), Positives = 21/33 (63%), Gaps = 3/33 (9%)
Query: 5 NLKHVDLFNNDVTTIENYREKVFELLPNLKYLD 37
NLK +DL NN +T I + F+ LPNLK LD
Sbjct: 1 NLKSLDLSNNRLTVIP---DGAFKGLPNLKVLD 30
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 32.7 bits (74), Expect = 0.026
Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 2/69 (2%)
Query: 40 DAKNNEAPDSED--DEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
DA+ +E ED +E + D+ +D+ + EDD EDG +E + +
Sbjct: 4024 DAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEENVQENEESTEDGVK 4083
Query: 98 KHDEEEEGE 106
+E E+GE
Sbjct: 4084 SDEELEQGE 4092
Score = 28.8 bits (64), Expect = 0.53
Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 3/51 (5%)
Query: 41 AKNNEAP--DSEDDEES-EGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
A NNE+ EDD ++ E D + +++E E +D G+D+E D +E+
Sbjct: 3928 AANNESDLVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPDIQENNS 3978
Score = 28.0 bits (62), Expect = 0.94
Identities = 13/71 (18%), Positives = 29/71 (40%), Gaps = 1/71 (1%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLD-EEDGEDEEEDTEGKEVRGK 95
G A++ +EDD+ E + E + + +E + + E D +D+ E
Sbjct: 3820 SGVGAEDITNTLNEDDDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENS 3879
Query: 96 KRKHDEEEEGE 106
+ + ++ E
Sbjct: 3880 DSEEENQDLDE 3890
Score = 28.0 bits (62), Expect = 1.1
Identities = 12/37 (32%), Positives = 22/37 (59%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
E+ EE++ + E +E EE++ LDE+ +D+ D
Sbjct: 4018 ENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSD 4054
Score = 25.7 bits (56), Expect = 5.7
Identities = 12/48 (25%), Positives = 23/48 (47%)
Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
NE E +++S + +L +EDD ED + +E++ E +
Sbjct: 3912 NEEDLLETEQKSNEQSAANNESDLVSKEDDNKALEDKDRQEKEDEEEM 3959
>gnl|CDD|177447 PHA02664, PHA02664, hypothetical protein; Provisional.
Length = 534
Score = 32.3 bits (73), Expect = 0.029
Identities = 19/63 (30%), Positives = 30/63 (47%), Gaps = 12/63 (19%)
Query: 44 NEAPDSEDDEESEG--------DDYEGEDEELEEEEDDGLDEEDG----EDEEEDTEGKE 91
+E D EDD +S D+ + D+ +E D G++ +DG EEE+ E +
Sbjct: 464 DERADGEDDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQAIEEEEEEERA 523
Query: 92 VRG 94
V G
Sbjct: 524 VLG 526
Score = 28.8 bits (64), Expect = 0.52
Identities = 16/50 (32%), Positives = 26/50 (52%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
D D+ + + S +DE DD G++ + E DDG + E+EEE+
Sbjct: 471 DDSDSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGGVGQAIEEEEEE 520
Score = 28.0 bits (62), Expect = 0.79
Identities = 18/77 (23%), Positives = 33/77 (42%), Gaps = 12/77 (15%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELE----------EEEDDGLDEEDGEDEEEDTEG 89
A ++ + + DE+ GD+ +++ + E+E D D D +E D+
Sbjct: 446 HADRADSDEDDMDEQESGDERADGEDDSDSSYSYSTTSSEDESDSAD--DSWGDESDSGI 503
Query: 90 KEVRGKKRKHDEEEEGE 106
+ G + EEEE E
Sbjct: 504 EHDDGGVGQAIEEEEEE 520
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 32.0 bits (73), Expect = 0.029
Identities = 12/54 (22%), Positives = 22/54 (40%), Gaps = 1/54 (1%)
Query: 39 FDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
+ A D + EGD + +D+E E D D +D +E+ + +
Sbjct: 235 SSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDL-DDSDDDVSDEDGEDLFDT 287
Score = 31.3 bits (71), Expect = 0.069
Identities = 11/42 (26%), Positives = 21/42 (50%), Gaps = 2/42 (4%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEE 78
+ ++ D ED ES+ DD +D+ +E+ +D D +
Sbjct: 249 SDDEGDGSDDDDDEDAIESDLDD--SDDDVSDEDGEDLFDTD 288
Score = 29.7 bits (67), Expect = 0.19
Identities = 11/45 (24%), Positives = 20/45 (44%), Gaps = 1/45 (2%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
+ DS+D+ + DD + ED + +D D D + E+
Sbjct: 245 DGIDSDDEGDGSDDDDD-EDAIESDLDDSDDDVSDEDGEDLFDTD 288
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 32.2 bits (73), Expect = 0.035
Identities = 18/59 (30%), Positives = 32/59 (54%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
+ +EE E ++ E E+E+ EEE+ +EE+ E E ++ +E+ G + EE E
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPE 500
Score = 31.8 bits (72), Expect = 0.044
Identities = 24/58 (41%), Positives = 37/58 (63%)
Query: 49 SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
S++ EE E + E E+EE EEEE+ +EE+GEDEEE+ E + G + + + EG+
Sbjct: 436 SQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGD 493
Score = 31.0 bits (70), Expect = 0.074
Identities = 17/67 (25%), Positives = 36/67 (53%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
+++ E+ + E++EE E ++ E E EE E E+++ +E + ++ E+ G
Sbjct: 438 ESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGE 497
Query: 100 DEEEEGE 106
+ EE+ E
Sbjct: 498 EPEEDAE 504
Score = 30.7 bits (69), Expect = 0.10
Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 2/67 (2%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
+ E + E++EE E ++ EGEDEE EEE + D E+ E +EG + +
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEA--DNGSEEEMEGSSEGDGDGEEPEED 502
Query: 100 DEEEEGE 106
E E
Sbjct: 503 AERRNSE 509
Score = 28.7 bits (64), Expect = 0.57
Identities = 20/50 (40%), Positives = 31/50 (62%)
Query: 42 KNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
+ E+ + E EE E ++ E E+EE E EE++G DEE+ E+ E D +E
Sbjct: 435 ASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEE 484
Score = 26.4 bits (58), Expect = 2.9
Identities = 14/53 (26%), Positives = 25/53 (47%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
E + ED+EE E + + EE E +G + + +E+ + E+ G R
Sbjct: 463 EEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISR 515
Score = 25.3 bits (55), Expect = 9.1
Identities = 13/65 (20%), Positives = 29/65 (44%)
Query: 42 KNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
++ E +++EE E + G +EE+E + D E+ E++ E + + +
Sbjct: 461 ESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEEPEEDAERRNSEMAGISRMSEGQ 520
Query: 102 EEEGE 106
+ G
Sbjct: 521 QPRGS 525
Score = 25.3 bits (55), Expect = 9.7
Identities = 18/61 (29%), Positives = 31/61 (50%)
Query: 46 APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEG 105
+ S D ++ E +E E++ EE+ E+EEE+ E ++ ++ DEEEE
Sbjct: 415 SSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEE 474
Query: 106 E 106
E
Sbjct: 475 E 475
>gnl|CDD|130728 TIGR01667, YCCS_YHJK, integral membrane protein, YccS/YhfK family.
This model represents two clades of putative
transmembrane proteins including the E. coli YccS and
YhfK proteins. The YccS hypothetical equivalog
(TIGR01666) is found in beta and gamma proteobacteria,
while the smaller YhfK group is only found in E. coli,
Salmonella and Yersinia. TMHMM on the 19 hits to this
model shows a consensus of 11 transmembrane helices
separated into two clusters, an N-terminal cluster of 6
and a central cluster of 5. This would indicate two
non-membrane domains one on each side of the membrane.
Length = 701
Score = 31.0 bits (70), Expect = 0.083
Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 5/57 (8%)
Query: 22 YREKVFELLPNLKYLDGF----DAKNNEAPDSEDDEES-EGDDYEGEDEELEEEEDD 73
+RE++ + L A E DEE+ GD E +E++ ED
Sbjct: 619 HRERLLNPELAAELLQACEIVAKAIQRCQARLEYDEEALAGDANIMEAQEMQPHEDA 675
>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies). Leucine rich
repeats are short sequence motifs present in a number
of proteins with diverse functions and cellular
locations. These repeats are usually involved in
protein-protein interactions. Each Leucine Rich Repeat
is composed of a beta-alpha unit. These units form
elongated non-globular structures. Leucine Rich Repeats
are often flanked by cysteine rich domains.
Length = 43
Score = 28.6 bits (65), Expect = 0.089
Identities = 14/34 (41%), Positives = 19/34 (55%), Gaps = 5/34 (14%)
Query: 4 TNLKHVDLFNNDVTTIENYREKVFELLPNLKYLD 37
TNL+ +DL NN +T + LPNL+ LD
Sbjct: 1 TNLETLDLSNNQITDLPP-----LSNLPNLETLD 29
Score = 27.5 bits (62), Expect = 0.32
Identities = 7/24 (29%), Positives = 11/24 (45%)
Query: 1 KQLTNLKHVDLFNNDVTTIENYRE 24
L NL+ +DL N +T +
Sbjct: 20 SNLPNLETLDLSGNKITDLSPLSN 43
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 30.7 bits (69), Expect = 0.090
Identities = 10/70 (14%), Positives = 20/70 (28%), Gaps = 2/70 (2%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
DG E D E + D G+ + + + E +E + +
Sbjct: 356 DGESTPAVEETSEADIEREQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHD--ETE 413
Query: 97 RKHDEEEEGE 106
+ E+
Sbjct: 414 PEVPEKAAPI 423
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 31.0 bits (70), Expect = 0.097
Identities = 18/52 (34%), Positives = 26/52 (50%), Gaps = 1/52 (1%)
Query: 39 FDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEED-GEDEEEDTEG 89
F+ + + D E+ ++ E + ED E E E D EED ED+ E EG
Sbjct: 645 FEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDAEANEG 696
Score = 29.5 bits (66), Expect = 0.30
Identities = 18/48 (37%), Positives = 27/48 (56%), Gaps = 2/48 (4%)
Query: 41 AKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
+ E + ++D+ E DD E E E +E+ E + E DGED EED +
Sbjct: 642 QQLFEGEEEDEDDLEETDDDEDECEAIEDSESE--SESDGEDGEEDEQ 687
Score = 27.5 bits (61), Expect = 1.3
Identities = 15/52 (28%), Positives = 23/52 (44%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
D + + EE + DD E D++ +E E E + E + ED E E
Sbjct: 635 DENKSRHQQLFEGEEEDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDE 686
Score = 26.4 bits (58), Expect = 4.0
Identities = 15/57 (26%), Positives = 28/57 (49%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
DSE + ES+G+D E +++E + E ++G+ D + + D E+E
Sbjct: 670 DSESESESDGEDGEEDEQEDDAEANEGVVPIDKAVRRALPKVLNLPDALDGGDSEDE 726
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 30.5 bits (69), Expect = 0.12
Identities = 21/57 (36%), Positives = 32/57 (56%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
E ++E ++ E E EE +EEE+ D+E+ DEEE+ E K+ + KK K E
Sbjct: 30 EVEKEVPDEEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKKTKKVKETTTEWEL 86
>gnl|CDD|217829 pfam03985, Paf1, Paf1. Members of this family are components of
the RNA polymerase II associated Paf1 complex. The Paf1
complex functions during the elongation phase of
transcription in conjunction with Spt4-Spt5 and
Spt16-Pob3i.
Length = 431
Score = 30.5 bits (69), Expect = 0.12
Identities = 16/59 (27%), Positives = 24/59 (40%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
+ ED+EE + D E+E + EE+ EDG E G + K K +
Sbjct: 372 EDEDEEEEQRSDEHEEEEGEDSEEEGSQSREDGSSESSSDVGSDSESKADKESASDSDS 430
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 30.3 bits (68), Expect = 0.13
Identities = 12/68 (17%), Positives = 16/68 (23%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
A S + + S ED E E E D + +
Sbjct: 146 PHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQ 205
Query: 97 RKHDEEEE 104
DE E
Sbjct: 206 SPPDEPGE 213
Score = 29.9 bits (67), Expect = 0.20
Identities = 6/70 (8%), Positives = 15/70 (21%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
+E E S + E+ + + E E + +
Sbjct: 140 PPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPT 199
Query: 97 RKHDEEEEGE 106
+ +
Sbjct: 200 SSPPPQSPPD 209
Score = 27.2 bits (60), Expect = 1.6
Identities = 6/60 (10%), Positives = 10/60 (16%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
P++ E + + D EE E
Sbjct: 120 PENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPE 179
Score = 25.7 bits (56), Expect = 5.8
Identities = 6/52 (11%), Positives = 12/52 (23%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
A ++ P ++ + ED E+ E
Sbjct: 131 PASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPT 182
Score = 25.3 bits (55), Expect = 7.1
Identities = 5/70 (7%), Positives = 13/70 (18%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
G ++ + P E E+ E + +
Sbjct: 95 SGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPES 154
Query: 97 RKHDEEEEGE 106
++
Sbjct: 155 HNPSPNQQPS 164
>gnl|CDD|130712 TIGR01651, CobT, cobaltochelatase, CobT subunit. This model
describes Pseudomonas denitrificans CobT gene product,
which is a cobalt chelatase subunit that functions in
cobalamin biosynthesis. Cobalamin (vitamin B12) can be
synthesized via several pathways, including an aerobic
pathway (found in Pseudomonas denitrificans) and an
anaerobic pathway (found in P. shermanii and Salmonella
typhimurium). These pathways differ in the point of
cobalt insertion during corrin ring formation. There are
apparently a number of variations on these two pathways,
where the major differences seem to be concerned with
the process of ring contraction. Confusion regarding the
functions of enzymes found in the aerobic vs. anaerobic
pathways has arisen because nonhomologous genes in these
different pathways were given the same gene symbols.
Thus, cobT in the aerobic pathway (P. denitrificans) is
not a homolog of cobT in the anaerobic pathway (S.
typhimurium). It should be noted that E. coli
synthesizes cobalamin only when it is supplied with the
precursor cobinamide, which is a complex intermediate.
Additionally, all E. coli cobalamin synthesis genes
(cobU, cobS and cobT) were named after their Salmonella
typhimurium homologs which function in the anaerobic
cobalamin synthesis pathway. This model describes the
aerobic cobalamin pathway Pseudomonas denitrificans CobT
gene product, which is a cobalt chelatase subunit, with
a MW ~70 kDa. The aerobic pathway cobalt chelatase is a
heterotrimeric, ATP-dependent enzyme that catalyzes
cobalt insertion during cobalamin biosynthesis. The
other two subunits are the P. denitrificans CobS
(TIGR01650) and CobN (pfam02514 CobN/Magnesium
Chelatase) proteins. To avoid potential confusion with
the nonhomologous Salmonella typhimurium/E.coli cobT
gene product, the P. denitrificans gene symbol is not
used in the name of this model [Biosynthesis of
cofactors, prosthetic groups, and carriers, Heme,
porphyrin, and cobalamin].
Length = 600
Score = 30.3 bits (68), Expect = 0.15
Identities = 22/60 (36%), Positives = 29/60 (48%), Gaps = 2/60 (3%)
Query: 49 SEDDEESEGDDYEGEDEELEEEED--DGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
S + E GDD E EDEE +++ + EE GE E E EG + + E E GE
Sbjct: 196 SMELAEEMGDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGE 255
Score = 29.1 bits (65), Expect = 0.35
Identities = 11/54 (20%), Positives = 22/54 (40%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
+ E E S + E D E E E++ + + + +E + E G+ +
Sbjct: 232 EGEGQEGSAPQESEATDRESESGEEEMVQSDQDDLPDESDDDSETPGEGARPAR 285
Score = 26.4 bits (58), Expect = 3.3
Identities = 13/65 (20%), Positives = 29/65 (44%), Gaps = 1/65 (1%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
D +E + DD++ ++ E + E E E +G ++ E + ++E E +
Sbjct: 205 DDTESEDEEDGDDDQPTENEQEEQGEG-EGEGQEGSAPQESEATDRESESGEEEMVQSDQ 263
Query: 100 DEEEE 104
D+ +
Sbjct: 264 DDLPD 268
Score = 26.1 bits (57), Expect = 4.8
Identities = 14/72 (19%), Positives = 30/72 (41%), Gaps = 3/72 (4%)
Query: 38 GFDAKNNEAPDSEDDE-ESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE--DTEGKEVRG 94
G D ++ + D +DD+ + +GE E +E + E + E E + E +
Sbjct: 204 GDDTESEDEEDGDDDQPTENEQEEQGEGEGEGQEGSAPQESEATDRESESGEEEMVQSDQ 263
Query: 95 KKRKHDEEEEGE 106
+ +++ E
Sbjct: 264 DDLPDESDDDSE 275
>gnl|CDD|233914 TIGR02538, type_IV_pilB, type IV-A pilus assembly ATPase PilB.
This model describes a protein of type IV pilus
biogenesis designated PilB in Pseudomonas aeruginosa but
PilF in Neisseria gonorrhoeae; the more common usage,
reflected here, is PilB. This protein is an ATPase
involved in protein export for pilin assembly and is
closely related to GspE (TIGR02533) of type II
secretion, also called the main terminal branch of the
general secretion pathway. Note that type IV pilus
systems are often divided into type IV-A and IV-B, with
the latter group including bundle-forming pilus,
mannose-sensitive hemagglutinin, etc. Members of this
family are found in type IV-A systems [Cell envelope,
Surface structures, Protein fate, Protein and peptide
secretion and trafficking].
Length = 564
Score = 30.0 bits (68), Expect = 0.20
Identities = 12/42 (28%), Positives = 19/42 (45%), Gaps = 1/42 (2%)
Query: 51 DDEESEGDDYEGEDEELEEEED-DGLDEEDGEDEEEDTEGKE 91
+S + ED EE D D +D+E +D E+D +
Sbjct: 137 GGSDSLAKELGDEDIGDLEELDVDAIDDEGPDDIEQDAVDDD 178
Score = 29.6 bits (67), Expect = 0.24
Identities = 9/40 (22%), Positives = 22/40 (55%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDT 87
DS E + D + E+ +++ +D+G D+ + + ++D
Sbjct: 140 DSLAKELGDEDIGDLEELDVDAIDDEGPDDIEQDAVDDDA 179
>gnl|CDD|216289 pfam01080, Presenilin, Presenilin. Mutations in presenilin-1 are a
major cause of early onset Alzheimer's disease. It has
been found that presenilin-1 binds to beta-catenin
in-vivo. This family also contains SPE proteins from
C.elegans.
Length = 403
Score = 29.8 bits (67), Expect = 0.24
Identities = 15/61 (24%), Positives = 27/61 (44%), Gaps = 2/61 (3%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
E P+ E D E + E E + D L E+ E+ + + + +++ +EEE
Sbjct: 267 ELPEKSSTPELSDD--ESDSSETESQSDSSLAPEEDAAEQPEVQSNSLPSNEKREEEEER 324
Query: 105 G 105
G
Sbjct: 325 G 325
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 29.6 bits (67), Expect = 0.24
Identities = 15/63 (23%), Positives = 32/63 (50%)
Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
++ + + EE E D EGE ++E +++ + + E+E+++ K + EE+
Sbjct: 98 DDEEEEWEVEEDEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKKAKEDSDEELSEED 157
Query: 104 EGE 106
E E
Sbjct: 158 EEE 160
Score = 26.2 bits (58), Expect = 4.2
Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 6/90 (6%)
Query: 23 REKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLD------ 76
E+V + +P L+ L+ + + + ++E ES+ DD E E+ E+EE+ED +
Sbjct: 62 EEEVVDGIPGLELLEKWKEEERKKKEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWIDV 121
Query: 77 EEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
E D E E D+E +E + + K +E+ E
Sbjct: 122 ESDKEIESSDSEDEEEKDEAAKKAKEDSDE 151
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 29.8 bits (67), Expect = 0.25
Identities = 17/53 (32%), Positives = 26/53 (49%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
+SE +ESE D+ E D E EE E D + + + +E E E G+ +
Sbjct: 253 ESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSESDESDEDTETPGEDARPA 305
Score = 29.0 bits (65), Expect = 0.41
Identities = 17/68 (25%), Positives = 29/68 (42%), Gaps = 8/68 (11%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDE--------EDGEDEEEDTEGKEVRGKKRK 98
D+++++ + ED E EE +G DE DGE EE + + E
Sbjct: 229 EDADEEDGDDDQPDNNEDSEAGREESEGSDESEEDEAEATDGEGEEGEMDAAEASEDSES 288
Query: 99 HDEEEEGE 106
+ +E+ E
Sbjct: 289 DESDEDTE 296
>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus. This
family represents the C-terminus (approximately 500
residues) of the eukaryotic coatomer alpha subunit.
Coatomer (COPI) is a large cytosolic protein complex
which forms a coat around vesicles budding from the
Golgi apparatus. Such coatomer-coated vesicles have been
proposed to play a role in many distinct steps of
intracellular transport. Note that many family members
also contain the pfam04053 domain.
Length = 421
Score = 29.4 bits (66), Expect = 0.33
Identities = 18/62 (29%), Positives = 29/62 (46%), Gaps = 10/62 (16%)
Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
+ + D GDD+ GED +L +ED L+ EDG ++E+ + +EE
Sbjct: 25 GDKAAAADFAVMGGDDW-GEDADLGLDEDGYLEGEDGLLDDEEGP---------EGGDEE 74
Query: 104 EG 105
G
Sbjct: 75 GG 76
Score = 26.4 bits (58), Expect = 2.9
Identities = 15/39 (38%), Positives = 20/39 (51%), Gaps = 4/39 (10%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
++D EG+D +DEE E D EE G D E+D
Sbjct: 50 DEDGYLEGEDGLLDDEEGPEGGD----EEGGWDVEDDLV 84
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 29.2 bits (66), Expect = 0.36
Identities = 9/43 (20%), Positives = 14/43 (32%), Gaps = 6/43 (13%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
P E E EDD ED E++++ +
Sbjct: 259 PAPPQPPEEE------PPPPPPPPEDDDDPPEDEEEQDDAEDR 295
Score = 26.9 bits (60), Expect = 2.5
Identities = 10/37 (27%), Positives = 16/37 (43%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGE 81
+ P+ E + + + E EEE+DD D E
Sbjct: 263 QPPEEEPPPPPPPPEDDDDPPEDEEEQDDAEDRALEE 299
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 29.0 bits (66), Expect = 0.40
Identities = 10/39 (25%), Positives = 19/39 (48%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
D D +E D E E ++D+ +EE+ E+++
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSL 209
Score = 28.6 bits (65), Expect = 0.50
Identities = 10/39 (25%), Positives = 23/39 (58%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
P++E+D G + E D++ +EEE++ +++ +E
Sbjct: 176 PNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADES 214
Score = 26.3 bits (59), Expect = 3.4
Identities = 13/38 (34%), Positives = 19/38 (50%)
Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGE 81
EAPD++D +E + DE+ EEE L + E
Sbjct: 61 EEAPDADDLLLAENEADAQTDEDAEEEAAAALSSVESE 98
Score = 25.5 bits (57), Expect = 6.4
Identities = 10/45 (22%), Positives = 20/45 (44%)
Query: 59 DYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
D + E+ G + E+ +D+E++ E ++ DE E
Sbjct: 171 DGFVDPNAEEDPAHVGSELEELDDDEDEEEEEDENDDSLAADESE 215
>gnl|CDD|140234 PTZ00207, PTZ00207, hypothetical protein; Provisional.
Length = 591
Score = 29.0 bits (65), Expect = 0.41
Identities = 11/49 (22%), Positives = 21/49 (42%), Gaps = 6/49 (12%)
Query: 29 LLPNLKY-LDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLD 76
+ P + + L FD K DD + + ++ EE+ ED ++
Sbjct: 289 IFPFMAFPLTTFDGKR-----PHDDSDGKAKEHVEAGEEVSAAEDKVVE 332
>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
Length = 390
Score = 29.1 bits (66), Expect = 0.42
Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 2/60 (3%)
Query: 33 LKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
+ +++ + E ++ + E DDY E E EEEDD D+ D + +E+D EG EV
Sbjct: 329 MTFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDD--DDWDDDWDEDDDEGVEV 386
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This
is a family of uncharacterized proteins.
Length = 449
Score = 28.8 bits (64), Expect = 0.46
Identities = 16/43 (37%), Positives = 32/43 (74%), Gaps = 2/43 (4%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDT 87
EA + EDD+E + DD + EDE+ ++++DD ++++ ED+++ T
Sbjct: 48 EAMEEEDDDEEDDDDDDDEDEDDDDDDDD--EDDEDEDDDDST 88
Score = 28.0 bits (62), Expect = 0.94
Identities = 20/61 (32%), Positives = 31/61 (50%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
E D +DD+E E DD + +DE+ E+E+DD D ++ E G DE ++
Sbjct: 57 EEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNEAGFADSDDESDD 116
Query: 105 G 105
G
Sbjct: 117 G 117
Score = 26.9 bits (59), Expect = 2.3
Identities = 17/43 (39%), Positives = 29/43 (67%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
E+ E E DD E +D++ +E+EDD D++D +DE+ED + +
Sbjct: 47 EEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTL 89
Score = 26.1 bits (57), Expect = 4.8
Identities = 14/51 (27%), Positives = 30/51 (58%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGK 90
D + D +D++E + DD + ED+E E+++D L ++ D+ +T+ +
Sbjct: 54 DDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDGNETDNE 104
Score = 25.3 bits (55), Expect = 8.9
Identities = 10/56 (17%), Positives = 32/56 (57%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEG 105
++ EE ++ + ++E+ ++++D+ D++D +D+E+D + + ++G
Sbjct: 43 KEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDEDEDDDDSTLHDDSSADDG 98
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1.
Members of this family are necessary for accurate
chromosome transmission during cell division.
Length = 804
Score = 29.0 bits (65), Expect = 0.46
Identities = 12/53 (22%), Positives = 23/53 (43%), Gaps = 4/53 (7%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
+ D+ D E++ Y+G ++L+EE+D DE G + +
Sbjct: 14 RAPAGGGGGSFDAIDFEDT----YDGLGDQLDEEDDALNDETFGGVADVGKDF 62
Score = 27.4 bits (61), Expect = 1.4
Identities = 14/49 (28%), Positives = 18/49 (36%), Gaps = 8/49 (16%)
Query: 38 GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
GFD D G D E+ DGL ++ DEE+D
Sbjct: 5 GFDTSLPP------DRAPAGGGGGSFDAIDFEDTYDGLGDQ--LDEEDD 45
>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
subunit 8 N-terminus. The largest of the mammalian
translation initiation factors, eIF3, consists of at
least eight subunits ranging in mass from 35 to 170 kDa.
eIF3 binds to the 40 S ribosome in an early step of
translation initiation and promotes the binding of
methionyl-tRNAi and mRNA.
Length = 593
Score = 28.6 bits (64), Expect = 0.56
Identities = 28/69 (40%), Positives = 35/69 (50%), Gaps = 6/69 (8%)
Query: 42 KNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED----TEGKEVRGKKR 97
KNN+ EDD +D E EDEE EE+EDD D D EDE+ED TE +
Sbjct: 119 KNNK--QFEDDITRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESG 176
Query: 98 KHDEEEEGE 106
+E+ E
Sbjct: 177 VDRVKEDDE 185
Score = 27.8 bits (62), Expect = 0.94
Identities = 20/80 (25%), Positives = 40/80 (50%), Gaps = 16/80 (20%)
Query: 41 AKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGL----------------DEEDGEDEE 84
+ E P+SED+EE E +D + + + E+E++DG+ +ED E++E
Sbjct: 129 TRYREDPESEDEEEEEDEDDDDDGSDDEDEDEDGVGATEEVAASSESGVDRVKEDDEEDE 188
Query: 85 EDTEGKEVRGKKRKHDEEEE 104
+ K+ ++ K ++ E
Sbjct: 189 DADLSKKDVLEEPKMFKKPE 208
>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex. This Mediator
complex subunit was formerly known as Srb4 in yeasts or
Trap80 in Drosophila and human. The Med17 subunit is
located within the head domain and is essential for
cell viability to the extent that a mutant strain of
cerevisiae lacking it shows all RNA polymerase
II-dependent transcription ceasing at non-permissive
temperatures.
Length = 454
Score = 28.5 bits (64), Expect = 0.58
Identities = 14/38 (36%), Positives = 21/38 (55%)
Query: 51 DDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
E ++ D E DEE ++EEDD D E+ +D E+
Sbjct: 59 AKEAAKIDFSEESDEEEDDEEDDNDDSEENKDTVEEFP 96
>gnl|CDD|223052 PHA03346, PHA03346, US22 family homolog; Provisional.
Length = 520
Score = 28.6 bits (64), Expect = 0.60
Identities = 11/63 (17%), Positives = 27/63 (42%), Gaps = 13/63 (20%)
Query: 40 DAKNNEAPDSEDDEESEGDD-------------YEGEDEELEEEEDDGLDEEDGEDEEED 86
+ P++ED+ + G + +D+E ++E+DD ++ + +E+D
Sbjct: 392 RRRPRGPPENEDEADGGGGLAPPPEPPRPPPPPFPDDDDEGDDEDDDDWEDLGFDLDEDD 451
Query: 87 TEG 89
Sbjct: 452 VYD 454
Score = 25.1 bits (55), Expect = 9.7
Identities = 13/40 (32%), Positives = 20/40 (50%)
Query: 46 APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
P D++ EGDD + +D E + D D D +D +E
Sbjct: 421 PPPPFPDDDDEGDDEDDDDWEDLGFDLDEDDVYDLKDVDE 460
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 28.1 bits (63), Expect = 0.60
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 56 EGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
+D +DEE EEEEDD D+++ E EEE++ K+V+
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEESPVKKVK 146
Score = 25.7 bits (57), Expect = 4.0
Identities = 14/31 (45%), Positives = 23/31 (74%)
Query: 55 SEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
SE D+ + ++E+ EEE+D+ D+ED +EEE
Sbjct: 109 SEEDESDDDEEDEEEEDDEEDDDEDESEEEE 139
>gnl|CDD|130366 TIGR01299, synapt_SV2, synaptic vesicle protein SV2. This model
describes a tightly conserved subfamily of the larger
family of sugar (and other) transporters described by
PFAM model pfam00083. Members of this subfamily include
closely related forms SV2A and SV2B of synaptic vesicle
protein from vertebrates and a more distantly related
homolog (below trusted cutoff) from Drosophila
melanogaster. Members are predicted to have two sets of
six transmembrane helices.
Length = 742
Score = 28.4 bits (63), Expect = 0.61
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 1/54 (1%)
Query: 45 EAPDSEDDEESEGDDY-EGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKR 97
E D +DD + D Y GE + EEE + D +G DE+++ E +G R
Sbjct: 50 EDEDDDDDFPAPADGYSRGEAADDEEEGEASSDATEGHDEDDEIYEGEYQGIPR 103
>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984). This
family of proteins is functionally uncharacterized. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 393 and 442 amino
acids in length.
Length = 301
Score = 28.1 bits (63), Expect = 0.74
Identities = 13/57 (22%), Positives = 18/57 (31%), Gaps = 8/57 (14%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEE 103
+ D G D+ ED E D D E ++E + DE E
Sbjct: 224 AEDSRDTVLLGPDFVDEDIRAEMASIDDESFSSLSDSESESEDEI--------DEAE 272
>gnl|CDD|223041 PHA03321, PHA03321, tegument protein VP11/12; Provisional.
Length = 694
Score = 28.4 bits (63), Expect = 0.77
Identities = 13/72 (18%), Positives = 22/72 (30%), Gaps = 20/72 (27%)
Query: 46 APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGED------------EEEDTEGKEVR 93
P + E D Y E +DD D DG ++D + V
Sbjct: 527 GPPAAAPPEQMEDPYL-------EPDDDRFDRRDGAAAAATSHPREAPAPDDDPIYEGVS 579
Query: 94 -GKKRKHDEEEE 104
++ ++E
Sbjct: 580 DSEEPVYEEIPT 591
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 28.1 bits (63), Expect = 0.78
Identities = 15/50 (30%), Positives = 29/50 (58%)
Query: 53 EESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
EE + +++E E+EE EEE D D+ + ++ E D E + + +R+ +
Sbjct: 36 EEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLK 85
Score = 25.8 bits (57), Expect = 4.8
Identities = 16/55 (29%), Positives = 31/55 (56%)
Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
E D E+ E E ++ E D + ++ EDD + +D E+ E++ + +E KK++
Sbjct: 35 EEEEDDEEFEIEEEEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKKKKR 89
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 28.2 bits (63), Expect = 0.82
Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 40 DAKNNEAPDSEDDEESE-GDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
+ N +A + ED+EE E G D + +E + DE++ +DEE ++ K K+ K
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREK 62
Score = 26.7 bits (59), Expect = 3.1
Identities = 15/47 (31%), Positives = 17/47 (36%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
E E D+ E E D DE EDE+ED E E
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDE 47
Score = 25.5 bits (56), Expect = 6.5
Identities = 13/46 (28%), Positives = 21/46 (45%)
Query: 61 EGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
E + + EEED+ E D++ EV + DEE++ E
Sbjct: 3 EQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEE 48
Score = 25.1 bits (55), Expect = 8.4
Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 9/63 (14%)
Query: 50 EDDEESEGDDYEGEDEELE---------EEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
E+++ + + E ++EELE ++D+ E++ EDEE+D E + +
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKR 60
Query: 101 EEE 103
E+
Sbjct: 61 EKA 63
Score = 25.1 bits (55), Expect = 9.9
Identities = 9/52 (17%), Positives = 22/52 (42%)
Query: 54 ESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEG 105
E E + + +EE EEE + + ++++ ++ + +E E
Sbjct: 1 EEEQVNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESP 52
>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
Toc86/159, G and M domains. The long precursor of the
86K protein originally described is proposed to have
three domains. The N-terminal A-domain is acidic,
repetitive, weakly conserved, readily removed by
proteolysis during chloroplast isolation, and not
required for protein translocation. The other domains
are designated G (GTPase) and M (membrane anchor); this
family includes most of the G domain and all of M
[Transport and binding proteins, Amino acids, peptides
and amines].
Length = 763
Score = 28.4 bits (63), Expect = 0.82
Identities = 16/30 (53%), Positives = 18/30 (60%), Gaps = 2/30 (6%)
Query: 54 ESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
E +G D E D ELE+ D DEE GEDE
Sbjct: 379 EQQGGDEEDSDIELEDSSDS--DEESGEDE 406
>gnl|CDD|240226 PTZ00007, PTZ00007, (NAP-L) nucleosome assembly protein -L;
Provisional.
Length = 337
Score = 27.8 bits (62), Expect = 0.90
Identities = 14/55 (25%), Positives = 24/55 (43%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
EA D + D S+ DD + + + + + + +EE+D KE G K
Sbjct: 278 EAIDEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGAKSNE 332
Score = 25.1 bits (55), Expect = 9.2
Identities = 8/53 (15%), Positives = 20/53 (37%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
++D + D+ + + + + + D D E+ + E K +E
Sbjct: 281 DEDSDYSSDEDDDDYDSYDSSDSASSDSNSDVDTNEEDDRGEKESNGAKSNEL 333
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family
belongs to the basic helix-loop-helix leucine zipper
class of transcription factors, see pfam00010. Myc forms
a heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20
residues of this domain cause unique changes in the
induction of apoptosis, transformation, and G2 arrest.
Length = 329
Score = 27.6 bits (61), Expect = 1.1
Identities = 16/35 (45%), Positives = 19/35 (54%)
Query: 38 GFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEED 72
G N + S D ESE D+ E E+EE EEE D
Sbjct: 211 GLRTPPNSSSSSGSDSESEEDEEEEEEEEEEEEID 245
Score = 27.6 bits (61), Expect = 1.2
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 2/40 (5%)
Query: 62 GEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
G D E EE+E++ +EE+ E+EE D E R
Sbjct: 223 GSDSESEEDEEE--EEEEEEEEEIDVVTVEKRRSSSNRKA 260
>gnl|CDD|115196 pfam06524, NOA36, NOA36 protein. This family consists of several
NOA36 proteins which contain 29 highly conserved
cysteine residues. The function of this protein is
unknown.
Length = 314
Score = 27.6 bits (61), Expect = 1.1
Identities = 12/47 (25%), Positives = 23/47 (48%), Gaps = 2/47 (4%)
Query: 42 KNNEAPDSEDDEESEGDDYEGEDEELEE--EEDDGLDEEDGEDEEED 86
K N + + D + YE +D+E + D+ D +D ++E+ D
Sbjct: 253 KQNLSSNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDND 299
Score = 27.6 bits (61), Expect = 1.2
Identities = 17/52 (32%), Positives = 25/52 (48%), Gaps = 3/52 (5%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
G N +S+DDE S +DY DEE + ++DD D +D D +
Sbjct: 261 YGDFKGVNYGYESDDDEGSSSNDY---DEEEDGDDDDNEDNDDTNTNHGDED 309
Score = 27.2 bits (60), Expect = 1.6
Identities = 8/46 (17%), Positives = 20/46 (43%)
Query: 43 NNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
+N+ D + D + + +E++ D++D ED ++
Sbjct: 258 SNKYGDFKGVNYGYESDDDEGSSSNDYDEEEDGDDDDNEDNDDTNT 303
>gnl|CDD|240329 PTZ00248, PTZ00248, eukaryotic translation initiation factor 2
subunit 1; Provisional.
Length = 319
Score = 27.7 bits (62), Expect = 1.2
Identities = 13/44 (29%), Positives = 26/44 (59%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
+ E + ++EE E D E EDE+ E+E+++ +++D D+
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 26.9 bits (60), Expect = 1.7
Identities = 15/44 (34%), Positives = 28/44 (63%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
+ + +E E + E E+++ E ED+ ++ED E+EE+D EG +
Sbjct: 276 EEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDEGDK 319
Score = 26.2 bits (58), Expect = 3.3
Identities = 12/43 (27%), Positives = 26/43 (60%)
Query: 49 SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
D+E+ E + E+EE E++ + DE++ +++EE+ E +
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDD 315
Score = 26.2 bits (58), Expect = 3.9
Identities = 14/43 (32%), Positives = 27/43 (62%)
Query: 43 NNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEE 85
+E E E++E ++ E + E E+E+++ DEE+ ED++E
Sbjct: 274 GDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDDE 316
Score = 25.4 bits (56), Expect = 7.5
Identities = 12/43 (27%), Positives = 26/43 (60%)
Query: 62 GEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
G DEE EE + +EE+ ED+ ++E ++ + + +E+++
Sbjct: 273 GGDEEDLEELLEKAEEEEEEDDYSESEDEDEEDEDEEEEEDDD 315
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 27.4 bits (61), Expect = 1.2
Identities = 13/51 (25%), Positives = 25/51 (49%)
Query: 55 SEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEEG 105
S DD E E EE +E + D + + E ++ + +EV K + ++ +
Sbjct: 92 SSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTE 142
Score = 26.3 bits (58), Expect = 2.7
Identities = 10/31 (32%), Positives = 15/31 (48%)
Query: 76 DEEDGEDEEEDTEGKEVRGKKRKHDEEEEGE 106
D+E+ E EEE T+ E E +E +
Sbjct: 95 DDEEEETEEESTDETEQEDPPETKTESKEKK 125
>gnl|CDD|236987 PRK11798, PRK11798, ClpXP protease specificity-enhancing factor;
Provisional.
Length = 138
Score = 27.1 bits (61), Expect = 1.2
Identities = 8/31 (25%), Positives = 12/31 (38%)
Query: 69 EEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
E E ++E +D + D E R H
Sbjct: 103 EPEAAYDEDEHDDDTDPDDEPPPPPRGGRPH 133
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 27.3 bits (61), Expect = 1.2
Identities = 15/53 (28%), Positives = 29/53 (54%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
S++D+E E E ++++++++EDD D+ D E K + KK + D
Sbjct: 3 KSDEDDEFEVVPAEEDEKKVDKDEDDDKDDIDDLTAEALALAKLLARKKTRED 55
>gnl|CDD|233870 TIGR02442, Cob-chelat-sub, cobaltochelatase subunit.
Cobaltochelatase is responsible for the insertion of
cobalt into the corrin ring of coenzyme B12 during its
biosynthesis. Two versions have been well described.
CbiK/CbiX is a monomeric, anaerobic version which acts
early in the biosynthesis (pfam06180). CobNST is a
trimeric, ATP-dependent, aerobic version which acts late
in the biosynthesis (TIGR02257/TIGR01650/TIGR01651). A
number of genomes (actinobacteria, cyanobacteria,
betaproteobacteria and pseudomonads) which apparently
biosynthesize B12, encode a cobN gene but are
demonstrably lacking cobS and cobT. These genomes do,
however contain a homolog (modelled here) of the
magnesium chelatase subunits BchI/BchD family. Aside
from the cyanobacteria (which have a separate magnesium
chelatase trimer), these species do not make chlorins,
so do not have any use for a magnesium chelatase.
Furthermore, in nearly all cases the members of this
family are proximal to either CobN itself or other genes
involved in cobalt transport or B12 biosynthesis.
Length = 633
Score = 27.7 bits (62), Expect = 1.2
Identities = 20/87 (22%), Positives = 36/87 (41%), Gaps = 3/87 (3%)
Query: 8 HVDLFNNDVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDE-ESEGDDYEGEDEE 66
L T E+ RE +LP+ + F+ + +++ E G D E DE
Sbjct: 283 LAALDGRRRVTAEDVREAAELVLPHRRRRKPFEQPQGKDEKDLEEKPEEPGPDPEKPDEG 342
Query: 67 LEEEEDDG--LDEEDGEDEEEDTEGKE 91
++ E G G D+E++ + +E
Sbjct: 343 EDDAEQSGPRGHPTPGNDDEKEPDPQE 369
Score = 26.6 bits (59), Expect = 2.9
Identities = 14/48 (29%), Positives = 20/48 (41%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGK 95
+ EDD E G +E+E D +E DG+ D G R +
Sbjct: 341 EGEDDAEQSGPRGHPTPGNDDEKEPDPQEEADGQGSSTDPAGDIFRIR 388
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 27.5 bits (61), Expect = 1.2
Identities = 12/46 (26%), Positives = 18/46 (39%)
Query: 53 EESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
EE DD E+E EE E+ + + + R K+K
Sbjct: 242 EEESIDDELDVLREIEAEEAGIGPIEEEVVDSQAANDEPRRVFKKK 287
>gnl|CDD|217861 pfam04050, Upf2, Up-frameshift suppressor 2. Transcripts
harbouring premature signals for translation termination
are recognised and rapidly degraded by eukaryotic cells
through a pathway known as nonsense-mediated mRNA decay.
In Saccharomyces cerevisiae, three trans-acting factors
(Upf1 to Upf3) are required for nonsense-mediated mRNA
decay.
Length = 171
Score = 27.3 bits (61), Expect = 1.3
Identities = 18/64 (28%), Positives = 34/64 (53%)
Query: 43 NNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
+E+ + +E+ E + + +DE +EEE D D+E E+ + + E V ++ + D E
Sbjct: 2 GSESESDDGEEDEELPEEDEDDESSDEEEVDLPDDEQDEESDSEEEQIFVTRQEEEVDPE 61
Query: 103 EEGE 106
E E
Sbjct: 62 AEAE 65
>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740). This
family consists of several uncharacterized plant
proteins of unknown function.
Length = 565
Score = 27.7 bits (61), Expect = 1.4
Identities = 16/54 (29%), Positives = 26/54 (48%), Gaps = 7/54 (12%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEED-------GEDEEEDTEGKEVR 93
D E + D E+EE+E EED+ E++ +EEE+ E K ++
Sbjct: 126 IDPEPRKPIVPDLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEEEELKTMK 179
Score = 25.4 bits (55), Expect = 7.9
Identities = 12/36 (33%), Positives = 18/36 (50%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEED 72
D + E ED+E E + + DE+ EEEE+
Sbjct: 137 DLVLEEEEEVEMEEDEEYYEKEPGKVVDEKSEEEEE 172
>gnl|CDD|222792 PHA00435, PHA00435, capsid assembly protein.
Length = 306
Score = 27.5 bits (61), Expect = 1.4
Identities = 20/58 (34%), Positives = 29/58 (50%), Gaps = 3/58 (5%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEE--LEEEEDDGLDEEDGEDEEEDTEGKEV 92
DG DA P++ DD D GED+E +E + +EE+ E+ EED E +
Sbjct: 40 DGDDAIELAEPETSDDPYG-NPDPFGEDDEGRIEVRISEDGEEEEVEEGEEDEEEEGE 96
>gnl|CDD|217059 pfam02480, Herpes_gE, Alphaherpesvirus glycoprotein E.
Glycoprotein E (gE) of Alphaherpesvirus forms a complex
with glycoprotein I (gI) (pfam01688), functioning as an
immunoglobulin G (IgG) Fc binding protein. gE is
involved in virus spread but is not essential for
propagation.
Length = 437
Score = 27.4 bits (61), Expect = 1.6
Identities = 5/27 (18%), Positives = 12/27 (44%)
Query: 41 AKNNEAPDSEDDEESEGDDYEGEDEEL 67
N D D ES+ ++++ + +
Sbjct: 399 PTNELFEDVSSDSESDDEEFDSDSDRE 425
>gnl|CDD|217373 pfam03115, Astro_capsid, Astrovirus capsid protein precursor. This
product is encoded by astrovirus ORF2, one of the three
astrovirus ORFs (1a, 1b, 2). The 87kD precursor protein
undergoes an intracellular cleavage to form a 79kD
protein. Subsequently, extracellular trypsin cleavage
yields the three proteins forming the infectious virion.
Length = 787
Score = 27.4 bits (61), Expect = 1.7
Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 4/51 (7%)
Query: 45 EAPDSEDD----EESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
+APD +D EE++ +D EDE+ E + D D E E++D +
Sbjct: 661 DAPDPYEDLISLEETDTEDESTEDEDDELDRFDLHDSSGSEPEDDDENNRV 711
>gnl|CDD|227504 COG5177, COG5177, Uncharacterized conserved protein [Function
unknown].
Length = 769
Score = 27.4 bits (60), Expect = 1.7
Identities = 16/59 (27%), Positives = 23/59 (38%), Gaps = 4/59 (6%)
Query: 52 DEESEGDDYEGEDEELEEEEDDGLDEEDGE----DEEEDTEGKEVRGKKRKHDEEEEGE 106
DEE E E+ + +DD E D E DEE + E + +EE +
Sbjct: 395 DEEEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLR 453
Score = 25.8 bits (56), Expect = 5.8
Identities = 20/57 (35%), Positives = 28/57 (49%)
Query: 36 LDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
+D D K N+ + DEES DD EG +E EEE+ L E ++E+ E
Sbjct: 413 IDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFRDMEKEDREFPDEA 469
Score = 25.4 bits (55), Expect = 7.7
Identities = 16/65 (24%), Positives = 29/65 (44%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
+ ++ + D E + DD E++ E D+ +D E EE + +E R +
Sbjct: 397 EEEDGQCNDEESTMSAIDDDDPKENDNEEVAGDEESAIDDNEGFEELSPEEEERQLREFR 456
Query: 100 DEEEE 104
D E+E
Sbjct: 457 DMEKE 461
Score = 25.0 bits (54), Expect = 9.6
Identities = 19/81 (23%), Positives = 31/81 (38%), Gaps = 1/81 (1%)
Query: 24 EKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGL-DEEDGED 82
+ ++ L ++ D E + Y+ + E EEEED DEE
Sbjct: 352 DFEYDGLTTVRIDDHGFLPGREQTSKKAAVPKGTSFYQAKWAEDEEEEDGQCNDEESTMS 411
Query: 83 EEEDTEGKEVRGKKRKHDEEE 103
+D + KE ++ DEE
Sbjct: 412 AIDDDDPKENDNEEVAGDEES 432
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are
transmembrane heparin sulfate proteoglycans which are
implicated in the binding of extracellular matrix
components and growth factors.
Length = 207
Score = 27.0 bits (60), Expect = 1.8
Identities = 15/48 (31%), Positives = 21/48 (43%), Gaps = 6/48 (12%)
Query: 41 AKNNEAPDSEDDEESEGDDYEGEDEELEEEED------DGLDEEDGED 82
A + DEE GDD E D+E+++E D+ED E
Sbjct: 23 AAAEYPDERYLDEEGSGDDDEFIDDEMDDEYSGSGSGATPSDDEDSEP 70
>gnl|CDD|223059 PHA03367, PHA03367, single-stranded DNA binding protein; Provisional.
Length = 1115
Score = 27.3 bits (61), Expect = 2.0
Identities = 11/48 (22%), Positives = 18/48 (37%), Gaps = 4/48 (8%)
Query: 54 ESEGDDYEGEDEELEEEEDD---GLDEEDGEDEEEDTEGKEVRGKKRK 98
E+ D G++ E E+ G+ D + GKKR+
Sbjct: 1068 EAVADAPGGDELEFVAVEEPQSVGVQAGDEF-LLPGGGSGLLAGKKRR 1114
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 27.1 bits (60), Expect = 2.1
Identities = 16/59 (27%), Positives = 25/59 (42%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
K + ED+E + E+EE EE E L +++ E EE R + R+
Sbjct: 297 RLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLPKKEEEKEEVTVSPDGGRRRGRR 355
>gnl|CDD|179525 PRK03003, PRK03003, GTP-binding protein Der; Reviewed.
Length = 472
Score = 26.9 bits (60), Expect = 2.2
Identities = 9/34 (26%), Positives = 14/34 (41%)
Query: 56 EGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
+ + D +E D LD+ED + E G
Sbjct: 3 DDEAEMRADGTWADESDWELDDEDLAELEAAEGG 36
>gnl|CDD|215628 PLN03196, PLN03196, MOC1-like protein; Provisional.
Length = 487
Score = 27.0 bits (60), Expect = 2.4
Identities = 14/45 (31%), Positives = 20/45 (44%), Gaps = 1/45 (2%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEV 92
D + EE E G E+ E DEED +D++E + V
Sbjct: 442 DFIEGEEMEPLFSMGGKLEMPGSESVS-DEEDDDDDDEVLYRRTV 485
Score = 25.1 bits (55), Expect = 9.6
Identities = 10/35 (28%), Positives = 14/35 (40%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEED 79
E + E G E + +EEDD D+E
Sbjct: 445 EGEEMEPLFSMGGKLEMPGSESVSDEEDDDDDDEV 479
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 26.8 bits (59), Expect = 2.4
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 49 SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
S+DDE++ + + DE E++DD D ED +E
Sbjct: 284 SDDDEDAIETEEDDVDESAIEDDDDDSDWEDSVEE 318
>gnl|CDD|218055 pfam04386, SspB, Stringent starvation protein B. Escherichia coli
stringent starvation protein B (SspB), is thought to
enhance the specificity of degradation of tmRNA-tagged
proteins by the ClpXP protease. The tmRNA tag, also
known as ssrA, is an 11-aa peptide added to the C
terminus of proteins stalled during translation, targets
proteins for degradation by ClpXP and ClpAP. SspB a
cytoplasmic protein that specifically binds to residues
1-4 and 7 of the tag. Binding of SspB enhances
degradation of tagged proteins by ClpX, and masks
sequence elements important for ClpA interactions,
inhibiting degradation by ClpA. However, more recent
work has cast doubt on the importance of SspB in
wild-type cells. SspB is encoded in an operon whose
synthesis is stimulated by carbon, amino acid, and
phosphate starvation. SspB may play a special role
during nutrient stress, for example by ensuring rapid
degradation of the products of stalled translation,
without causing a global increase in degradation of all
ClpXP substrates.
Length = 153
Score = 26.4 bits (59), Expect = 2.6
Identities = 10/39 (25%), Positives = 19/39 (48%)
Query: 63 EDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
+E EEEE++ D +D ++E+ + K R +
Sbjct: 112 PEEADEEEEEEDDDADDSPEDEDPEPKDPPKTKGRPSLK 150
Score = 24.9 bits (55), Expect = 9.5
Identities = 9/33 (27%), Positives = 18/33 (54%)
Query: 58 DDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGK 90
+ E E+EE ++ +D DE+ + T+G+
Sbjct: 114 EADEEEEEEDDDADDSPEDEDPEPKDPPKTKGR 146
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 26.7 bits (59), Expect = 2.6
Identities = 17/62 (27%), Positives = 31/62 (50%), Gaps = 2/62 (3%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEE--DDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
E+ DSED+E+ + + +D++ EEEE ED + E+ E E+ + D+
Sbjct: 217 ESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMESGEMEAAEASADDT 276
Query: 103 EE 104
+
Sbjct: 277 PD 278
Score = 25.2 bits (55), Expect = 7.6
Identities = 18/66 (27%), Positives = 32/66 (48%), Gaps = 4/66 (6%)
Query: 41 AKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHD 100
+ + A + D+ ES D + EDE+ +E++ D++ GE+EE + +
Sbjct: 204 SSMDMAEELGDEPESA-DSEDNEDEDDPKEDE---DDDQGEEEESGSSDSLSEDSDASSE 259
Query: 101 EEEEGE 106
E E GE
Sbjct: 260 EMESGE 265
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 26.6 bits (59), Expect = 2.6
Identities = 15/64 (23%), Positives = 30/64 (46%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
+ E E+ E + +D E + + +E+E+ + E+ ++E D KE K +
Sbjct: 58 ETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKETEEKTESN 117
Query: 100 DEEE 103
E+E
Sbjct: 118 VEKE 121
Score = 25.1 bits (55), Expect = 8.0
Identities = 16/62 (25%), Positives = 32/62 (51%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
+A E + + D E EE++EEE + + ED ED+ + + E ++ + ++EE
Sbjct: 43 QAAADEQEAKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEES 102
Query: 105 GE 106
+
Sbjct: 103 SD 104
>gnl|CDD|132741 cd06943, NR_LBD_RXR_like, The ligand binding domain of the retinoid
X receptor and Ultraspiracle, members of nuclear
receptor superfamily. The ligand binding domain of the
retinoid X receptor (RXR) and Ultraspiracle (USP): This
family includes two evolutionary related nuclear
receptors: retinoid X receptor (RXR) and Ultraspiracle
(USP). RXR is a nuclear receptor in mammalian and USP is
its counterpart in invertebrates. The native ligand of
retinoid X receptor is 9-cis retinoic acid (RA). RXR
functions as a DNA binding partner by forming
heterodimers with other nuclear receptors including CAR,
FXR, LXR, PPAR, PXR, RAR, TR, and VDR. RXRs can play
different roles in these heterodimers. It acts either
as a structural component of the heterodimer complex,
required for DNA binding but not acting as a receptor or
as both a structural and a functional component of the
heterodimer, allowing 9-cis RA to signal through the
corresponding heterodimer. In addition, RXR can also
form homodimers, functioning as a receptor for 9-cis RA,
independently of other nuclear receptors. Ultraspiracle
(USP) plays similar roles as DNA binding partner of
other nuclear rec eptors in invertebrates. USP has no
known high-affinity ligand and is thought to be a silent
component in the heterodimeric complex with partner
receptors. Like other members of the nuclear receptor
(NR) superfamily of ligand-activated transcription
factors, RXR and USP have a central well conserved DNA
binding domain (DBD), a variable N-terminal domain, a
flexible hinge and a C-terminal ligand binding domain
(LBD).
Length = 207
Score = 26.5 bits (59), Expect = 2.6
Identities = 12/31 (38%), Positives = 17/31 (54%), Gaps = 6/31 (19%)
Query: 6 LKHVDLFNNDVTT------IENYREKVFELL 30
L+ + LFN DV +E+ REKV+ L
Sbjct: 137 LRAIILFNPDVKGLKSRQEVESLREKVYASL 167
>gnl|CDD|218555 pfam05320, Pox_RNA_Pol_19, Poxvirus DNA-directed RNA polymerase
19 kDa subunit. This family contains several
DNA-directed RNA polymerase 19 kDa polypeptides. The
Poxvirus DNA-directed RNA polymerase (EC: 2.7.7.6)
catalyzes DNA-template-directed extension of the 3'-end
of an RNA strand by one nucleotide at a time.
Length = 167
Score = 26.2 bits (58), Expect = 2.7
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 52 DEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEED 86
++ + DYE +D++ EE E++ DEED E E
Sbjct: 2 EDSDDIIDYESDDDDSEEYEEEEEDEEDAESLESS 36
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 26.5 bits (59), Expect = 2.7
Identities = 9/32 (28%), Positives = 17/32 (53%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEE 78
+ D SE DD D+ +E+++D D++
Sbjct: 1357 RKKKSDSSSEDDDDSEVDDSEDEDDEDDEDDD 1388
>gnl|CDD|220777 pfam10486, PI3K_1B_p101, Phosphoinositide 3-kinase gamma adapter
protein p101 subunit. Class I PI3Ks are dual-specific
lipid and protein kinases involved in numerous
intracellular signaling pathways. Class IB PI3K,
p110gamma, is mainly activated by seven-transmembrane
G-protein-coupled receptors (GPCRs), through its
regulatory subunit p101 and G-protein beta-gamma
subunits.
Length = 856
Score = 26.9 bits (59), Expect = 2.7
Identities = 13/32 (40%), Positives = 19/32 (59%)
Query: 47 PDSEDDEESEGDDYEGEDEELEEEEDDGLDEE 78
+S+ E +D E E+EE E++E DGL E
Sbjct: 303 KESQLLEPVLSEDEEVEEEEEEDDETDGLSPE 334
Score = 26.5 bits (58), Expect = 3.3
Identities = 14/26 (53%), Positives = 18/26 (69%), Gaps = 1/26 (3%)
Query: 62 GEDEELEEEEDDGLDEEDGEDEEEDT 87
EDEE+EEEE++ DE DG E D+
Sbjct: 313 SEDEEVEEEEEED-DETDGLSPERDS 337
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 26.5 bits (59), Expect = 3.0
Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 1/61 (1%)
Query: 42 KNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
+ E D E + SE + E E EED L E+ ED ++ + ++ K K DE
Sbjct: 234 EREEETDVEIETTSETKGTKQEQEG-STEEDPSLFSEEKEDPDKTEDLDKLEILKEKKDE 292
Query: 102 E 102
E
Sbjct: 293 E 293
Score = 26.1 bits (58), Expect = 4.8
Identities = 17/70 (24%), Positives = 29/70 (41%), Gaps = 4/70 (5%)
Query: 39 FDAKNNEAPDSEDDEESEGDDYEGEDE----ELEEEEDDGLDEEDGEDEEEDTEGKEVRG 94
F K E ++E+ EE + E E + E+E D +E+ED + E
Sbjct: 222 FTKKLKETSETEEREEETDVEIETTSETKGTKQEQEGSTEEDPSLFSEEKEDPDKTEDLD 281
Query: 95 KKRKHDEEEE 104
K E+++
Sbjct: 282 KLEILKEKKD 291
>gnl|CDD|221855 pfam12921, ATP13, Mitochondrial ATPase expression. ATP13 is
necessary for the expression of subunit 9 of
mitochondrial ATPase. The protein has a basic amino
terminal signal sequence that is cleaved upon import
into mitochondria.
Length = 125
Score = 26.1 bits (58), Expect = 3.1
Identities = 8/39 (20%), Positives = 15/39 (38%)
Query: 12 FNNDVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSE 50
+ + +E Y E V+ + K + D + P S
Sbjct: 14 YEGQLELLEKYIETVWGIDSQGKSKERVDKGSPLYPSSR 52
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 26.6 bits (58), Expect = 3.2
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 41 AKNNEAPDSEDDEESEGDDYEGEDEELEEEE 71
A+ E + ED++ D E++E EE E
Sbjct: 80 AEAGEEAEEEDNDRECPDTEAEEEDEEEEIE 110
Score = 26.2 bits (57), Expect = 3.5
Identities = 19/49 (38%), Positives = 26/49 (53%), Gaps = 3/49 (6%)
Query: 61 EGEDEELEEEED--DGLDEEDGED-EEEDTEGKEVRGKKRKHDEEEEGE 106
+GE+ E E D L E GE+ EEED + + + + DEEEE E
Sbjct: 62 DGENSETERPRDLTAALFAEAGEEAEEEDNDRECPDTEAEEEDEEEEIE 110
Score = 25.0 bits (54), Expect = 9.2
Identities = 15/40 (37%), Positives = 21/40 (52%), Gaps = 4/40 (10%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEG 89
E+D + E D E E+E+ EEE ++ D E D EG
Sbjct: 88 EEDNDRECPDTEAEEEDEEEE----IEAPDPEVNPLDAEG 123
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 26.3 bits (58), Expect = 3.3
Identities = 14/44 (31%), Positives = 24/44 (54%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
++ED + +E E+E L+ E ++ L+ E E EED +E
Sbjct: 379 EAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRE 422
Score = 25.9 bits (57), Expect = 5.1
Identities = 14/44 (31%), Positives = 21/44 (47%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
E ++E E +E D E L E+E + DE + EE+ E
Sbjct: 404 ERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEEEEE 447
Score = 25.5 bits (56), Expect = 6.0
Identities = 14/53 (26%), Positives = 25/53 (47%)
Query: 39 FDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
+ + ++E+ E+E + ED L E DE + EDE E+ + +E
Sbjct: 392 REEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDELEEAQPEE 444
>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
This model family represents the major capsid protein
component of the heads (capsids) of bacteriophage HK97,
phi-105, P27, and related phage. This model represents
one of several analogous families lacking detectable
sequence similarity. The gene encoding this component is
typically located in an operon encoding the small and
large terminase subunits, the portal protein and the
prohead or maturation protease [Mobile and
extrachromosomal element functions, Prophage functions].
Length = 384
Score = 26.2 bits (58), Expect = 3.4
Identities = 16/70 (22%), Positives = 25/70 (35%)
Query: 37 DGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
+ E +DE+ + E + + +EE D LD E EE E +
Sbjct: 6 EALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAAS 65
Query: 97 RKHDEEEEGE 106
+ EE E
Sbjct: 66 GEGGGGEEEE 75
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 26.3 bits (58), Expect = 3.4
Identities = 12/69 (17%), Positives = 30/69 (43%)
Query: 25 KVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEE 84
+V L + KY D N E + + ++ + ++ + E L ++ ++E+
Sbjct: 11 RVVRALESEKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEK 70
Query: 85 EDTEGKEVR 93
+ + K+V
Sbjct: 71 KKPKKKKVP 79
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 26.5 bits (58), Expect = 3.4
Identities = 15/52 (28%), Positives = 21/52 (40%)
Query: 51 DDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
D S D E ED+ E E+D + EE + E E G K ++
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKK 340
Score = 25.3 bits (55), Expect = 7.6
Identities = 13/51 (25%), Positives = 20/51 (39%)
Query: 48 DSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
DS + + E+ + + DE+ E EEE E + KK K
Sbjct: 289 DSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGK 339
>gnl|CDD|227496 COG5167, VID27, Protein involved in vacuole import and degradation
[Intracellular trafficking and secretion].
Length = 776
Score = 26.5 bits (58), Expect = 3.5
Identities = 10/46 (21%), Positives = 20/46 (43%)
Query: 54 ESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
E+ + +EE+E+ ED+ + D++E +K H
Sbjct: 376 ENRNPEESEHEEEVEDYEDENDHSKRICDDDELENHFRAADEKNSH 421
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 26.1 bits (58), Expect = 3.8
Identities = 18/33 (54%), Positives = 22/33 (66%), Gaps = 1/33 (3%)
Query: 61 EGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
E E+EE EEEE +EE E+EEE T +EVR
Sbjct: 304 EPEEEEEEEEEVPEEEEE-EEEEEERTFEEEVR 335
Score = 25.4 bits (56), Expect = 8.4
Identities = 14/33 (42%), Positives = 20/33 (60%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDE 77
P+ E++EE E + E E+EE EEEE +E
Sbjct: 301 SPPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
Score = 25.0 bits (55), Expect = 9.6
Identities = 14/38 (36%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 46 APDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDE 83
P S + E E ++ E EE EEEE++ E E+E
Sbjct: 298 PPPSPPEPEEEEEEEEEVPEEEEEEEEEE--ERTFEEE 333
>gnl|CDD|218379 pfam05009, EBV-NA3, Epstein-Barr virus nuclear antigen 3 (EBNA-3).
This family contains EBNA-3A, -3B, and -3C which are
latent infection nuclear proteins important for
Epstein-Barr virus (EBV)-induced B-cell immortalisation
and the immune response to EBV infection.
Length = 254
Score = 26.2 bits (58), Expect = 3.9
Identities = 9/19 (47%), Positives = 11/19 (57%)
Query: 49 SEDDEESEGDDYEGEDEEL 67
E ESE + E +DEEL
Sbjct: 216 VETSSESEDSESESDDEEL 234
>gnl|CDD|240433 PTZ00482, PTZ00482, membrane-attack complex/perforin (MACPF)
Superfamily; Provisional.
Length = 844
Score = 26.0 bits (57), Expect = 4.2
Identities = 6/52 (11%), Positives = 21/52 (40%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
D ++E +D+E + + + +++ L + D + + + +
Sbjct: 88 DDDDDEFDFLYEDDEDDAGNATSGESSTDDDSLLELPDRDEDADTQANNDQT 139
Score = 25.2 bits (55), Expect = 9.0
Identities = 12/67 (17%), Positives = 25/67 (37%), Gaps = 2/67 (2%)
Query: 40 DAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKH 99
DA D +D + + D+ L++ + E+ E+++ TE
Sbjct: 129 DADTQANNDQTNDFDQDDSSNSQTDQGLKQSVNLSSAEKLIEEKKGQTENT--FKFYNFG 186
Query: 100 DEEEEGE 106
++ EE
Sbjct: 187 NDGEEAA 193
>gnl|CDD|235279 PRK04319, PRK04319, acetyl-CoA synthetase; Provisional.
Length = 570
Score = 26.0 bits (58), Expect = 4.5
Identities = 15/69 (21%), Positives = 26/69 (37%), Gaps = 15/69 (21%)
Query: 16 VTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDD-- 73
+TT K + LP+LK++ ++ EG + +E+ D+
Sbjct: 150 ITTPALLERKPADDLPSLKHVLLV-----------GEDVEEGPGTLDFNALMEQASDEFD 198
Query: 74 --GLDEEDG 80
D EDG
Sbjct: 199 IEWTDREDG 207
>gnl|CDD|235548 PRK05657, PRK05657, RNA polymerase sigma factor RpoS; Validated.
Length = 325
Score = 25.7 bits (57), Expect = 4.7
Identities = 6/40 (15%), Positives = 16/40 (40%)
Query: 49 SEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTE 88
+ + D E+ +E ++EE +++ + E
Sbjct: 2 TLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEE 41
Score = 25.3 bits (56), Expect = 7.1
Identities = 7/47 (14%), Positives = 21/47 (44%)
Query: 44 NEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGK 90
+ +E+++ D+ E + + ++ + D +EE ++G
Sbjct: 1 MTLKVHDLNEDADFDENGVEVFDEKALVEEEPSDNDLAEEELLSQGA 47
>gnl|CDD|222927 PHA02774, PHA02774, E1; Provisional.
Length = 613
Score = 26.0 bits (58), Expect = 4.9
Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 15/62 (24%)
Query: 52 DEESEGDDYEGEDEELEEEE-------DDGLDEEDGED-------EEEDTEGKEVRGKKR 97
+ E+E D E E E+L + DD + G E E+ E ++++ KR
Sbjct: 20 EAEAECSDGEDELEDLFDTGSDISDLIDDAEVVQGGNSLALFHQQEAEEDE-QQIQALKR 78
Query: 98 KH 99
K+
Sbjct: 79 KY 80
Score = 25.2 bits (56), Expect = 7.7
Identities = 11/39 (28%), Positives = 16/39 (41%), Gaps = 2/39 (5%)
Query: 52 DEESEGDDYEGEDEE--LEEEEDDGLDEEDGEDEEEDTE 88
+ +G D E L E E + D ED ++ DT
Sbjct: 1 MADDKGTDPEEGGSGWFLVEAEAECSDGEDELEDLFDTG 39
>gnl|CDD|237549 PRK13897, PRK13897, type IV secretion system component VirD4;
Provisional.
Length = 606
Score = 25.9 bits (57), Expect = 5.0
Identities = 12/35 (34%), Positives = 19/35 (54%)
Query: 39 FDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDD 73
+D + N SE+ E + EDEE ++E+DD
Sbjct: 572 YDPRANNNEASEETETITVPENNEEDEEDDDEDDD 606
>gnl|CDD|185764 cd09241, BRO1_ScRim20-like, Protein-interacting, N-terminal,
Bro1-like domain of Saccharomyces cerevisiae Rim20 and
related proteins. This family contains the N-terminal,
Bro1-like domain of Saccharomyces cerevisiae Rim20
(also known as PalA) and related proteins. It belongs
to the BRO1_Alix_like superfamily which also includes
the Bro1-like domains of mammalian Alix
(apoptosis-linked gene-2 interacting protein X),
His-Domain type N23 protein tyrosine phosphatase
(HD-PTP, also known as PTPN23), RhoA-binding proteins
Rhophilin-1 and -2, Brox, Saccharomyces cerevisiae
Bro1, Ustilago maydis Rim23 (also known as PalC), and
related domains. Alix, HD-PTP, Brox, Bro1, Rim20, and
Rim23, interact with the ESCRT (Endosomal Sorting
Complexes Required for Transport) system. Rim20 and
Rim23 participate in the response to the external pH
via the Rim101 pathway. Bro1-like domains are
boomerang-shaped, and part of the domain is a
tetratricopeptide repeat (TPR)-like structure.
Bro1-like domains bind components of the ESCRT-III
complex: Snf7 in the case of Rim20. RIM20, and some
other members of the BRO1_Alix_like superfamily
including Alix, also have a V-shaped (V) domain. In the
case of Alix, the V-domain is a dimerization domain
that also contains a binding site for the retroviral
late assembly (L) domain YPXnL motif, which is
partially conserved in the V-domain superfamily. Rim20
localizes to endosomes under alkaline pH conditions. By
binding Snf7, it may bring the protease Rim13 (a
YPxL-containing transcription factor) into proximity
with Rim101, and thus aid in the proteolytic activation
of the latter. Rim20 and other intermediates in the
Rim101 pathway play roles in the pathogenesis of fungal
corneal infection during Candida albicans keratitis.
Length = 355
Score = 25.7 bits (57), Expect = 5.0
Identities = 6/32 (18%), Positives = 12/32 (37%)
Query: 6 LKHVDLFNNDVTTIENYREKVFELLPNLKYLD 37
+ F +D+ I+ R P++ L
Sbjct: 28 FQTPSSFEDDLAEIDKLRNDAINPEPSVNGLS 59
>gnl|CDD|147601 pfam05505, Ebola_NP, Ebola nucleoprotein. This family consists of
Ebola and Marburg virus nucleoproteins. These proteins
are responsible for encapsidation of genomic RNA. It has
been found that nucleoprotein DNA vaccines can offer
protection from the virus.
Length = 717
Score = 26.1 bits (57), Expect = 5.1
Identities = 13/33 (39%), Positives = 18/33 (54%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGED 82
ED+ E + DD + LE ++D G D E G D
Sbjct: 547 EDEPEDQTDDDDSSLPPLESDDDPGSDNEQGVD 579
>gnl|CDD|131585 TIGR02533, type_II_gspE, type II secretion system protein E. This
family describes GspE, the E protein of the type II
secretion system, also called the main terminal branch
of the general secretion pathway. This model separates
GspE from the PilB protein of type IV pilin biosynthesis
[Protein fate, Protein and peptide secretion and
trafficking, Cellular processes, Pathogenesis].
Length = 486
Score = 25.8 bits (57), Expect = 5.3
Identities = 12/28 (42%), Positives = 15/28 (53%)
Query: 53 EESEGDDYEGEDEELEEEEDDGLDEEDG 80
EE EG+D E E + +D LD ED
Sbjct: 78 EEIEGEDDLSALELDEPKIEDLLDLEDD 105
>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
transition [Transcription / Cell division and chromosome
partitioning].
Length = 423
Score = 25.8 bits (56), Expect = 5.5
Identities = 18/97 (18%), Positives = 34/97 (35%), Gaps = 11/97 (11%)
Query: 6 LKHVDLFNNDVTTIENYREKVFELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDE 65
L+ VDL ++D+ N + P+ + F+ +++ D E +S G+
Sbjct: 247 LEDVDLMDDDILGPSNEEMLYKYISPSQGSAELFE-ESSLGFDYEFIHKSVGNK------ 299
Query: 66 ELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEE 102
E G+ + D + K H E
Sbjct: 300 ----EIRGGISTGEMIDVRKLPCTNSSSNGKLAHGGE 332
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 25.8 bits (57), Expect = 5.7
Identities = 13/42 (30%), Positives = 24/42 (57%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKE 91
E+DEE D+ + E E EE+ ++E D ++ ++T+ K
Sbjct: 129 EEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKR 170
Score = 25.4 bits (56), Expect = 6.7
Identities = 14/60 (23%), Positives = 31/60 (51%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDEEEE 104
+ P ++E+ E + E + E + EE ++ ++ E+ +D+E E K RG + ++
Sbjct: 122 KRPKYPNEEDEEENVDEDDAEIILEEVEEEVEIEEVDDDEGTQETKYKRGDTSLTPQAKD 181
>gnl|CDD|222997 PHA03132, PHA03132, thymidine kinase; Provisional.
Length = 580
Score = 25.9 bits (57), Expect = 5.7
Identities = 10/41 (24%), Positives = 15/41 (36%), Gaps = 1/41 (2%)
Query: 48 DSEDDEESEGDDYEGEDEE-LEEEEDDGLDEEDGEDEEEDT 87
DE EG E D E + G E ++++D
Sbjct: 14 YDSSDESPEGSRDENFDAERDDFLTPLGSTSEATSEDDDDL 54
>gnl|CDD|235052 PRK02597, rpoC2, DNA-directed RNA polymerase subunit beta';
Provisional.
Length = 1331
Score = 25.7 bits (57), Expect = 5.7
Identities = 10/53 (18%), Positives = 21/53 (39%)
Query: 27 FELLPNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEED 79
E + +D + + D+E DD + LE +++GL ++
Sbjct: 1279 LEATRSRHGIDPSSSNFAAFARPDADDELLIDDQLPDPAALEGLQEEGLLSDE 1331
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 25.7 bits (57), Expect = 6.0
Identities = 15/42 (35%), Positives = 24/42 (57%), Gaps = 6/42 (14%)
Query: 61 EGEDEELEEEEDDGLDEEDGEDEEED-----TEGKEV-RGKK 96
E +D++ +E++D E GE+ + + TE EV RGKK
Sbjct: 200 EDDDDDEDEDDDGEGGGEGGEERQREHPFIVTEPGEVARGKK 241
>gnl|CDD|201520 pfam00945, Rhabdo_ncap, Rhabdovirus nucleocapsid protein. The
Nucleocapsid (N) Protein is said to have a "tight"
structure. The carboxyl end of the N-terminal domain
possesses an RNA binding domain. Sequence alignments
show 2 regions of reasonable conservation, approx.
64-103 and 201-329. A whole functional protein is
required for encapsidation to take place.
Length = 406
Score = 25.4 bits (56), Expect = 6.7
Identities = 7/38 (18%), Positives = 17/38 (44%), Gaps = 3/38 (7%)
Query: 60 YEGEDEELEEEEDDGLDEEDGEDE---EEDTEGKEVRG 94
+ E + E +DDG + + + +E + ++V
Sbjct: 341 FRDEKKYTEYRDDDGDNTDAPLADDGDDEAPKSRDVVE 378
>gnl|CDD|184899 PRK14906, PRK14906, DNA-directed RNA polymerase subunit beta'/alpha
domain fusion protein; Provisional.
Length = 1460
Score = 25.6 bits (56), Expect = 6.9
Identities = 10/42 (23%), Positives = 18/42 (42%)
Query: 52 DEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVR 93
D E D++ + + E + + E + EE E K +R
Sbjct: 181 DYVPEDDEFVDDIGDDERLTAEEVRAEVADIYEEYNERKALR 222
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 25.4 bits (55), Expect = 7.0
Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 3/58 (5%)
Query: 50 EDDEESEGDDYEGEDEELEEEEDDGLDEED---GEDEEEDTEGKEVRGKKRKHDEEEE 104
E G++ + +DEEL+ +++ L+ + E E D + + + KKRK DEEEE
Sbjct: 483 VRYEHVAGEEDDDDDEELQAQKELELEAQGIKYSETSEADKDVNKSKNKKRKVDEEEE 540
>gnl|CDD|237084 PRK12378, PRK12378, hypothetical protein; Provisional.
Length = 235
Score = 25.1 bits (56), Expect = 7.2
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 61 EGEDEELEEEEDDGLDEEDGEDEEE 85
+ EDE LE D +D ED E+EE
Sbjct: 142 DDEDELLEALIDADVDVEDVEEEEG 166
>gnl|CDD|215153 PLN02271, PLN02271, serine hydroxymethyltransferase.
Length = 586
Score = 25.5 bits (56), Expect = 7.4
Identities = 14/39 (35%), Positives = 19/39 (48%)
Query: 63 EDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRKHDE 101
E EE EE D +EE GEDE G + K+ + +
Sbjct: 57 EKEEDAGEEGDEDEEEQGEDEHFSILGHPMCLKRPRDGD 95
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 25.4 bits (56), Expect = 8.0
Identities = 10/27 (37%), Positives = 17/27 (62%)
Query: 60 YEGEDEELEEEEDDGLDEEDGEDEEED 86
E ++ EEE++D +EE+ E+ EE
Sbjct: 342 VNPEQKDEEEEQEDEEEEEEEEEPEEP 368
Score = 25.0 bits (55), Expect = 8.7
Identities = 11/25 (44%), Positives = 17/25 (68%)
Query: 61 EGEDEELEEEEDDGLDEEDGEDEEE 85
+ E+EE E+EE++ +EE E E E
Sbjct: 347 KDEEEEQEDEEEEEEEEEPEEPEPE 371
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 25.4 bits (56), Expect = 8.6
Identities = 9/27 (33%), Positives = 16/27 (59%)
Query: 73 DGLDEEDGEDEEEDTEGKEVRGKKRKH 99
D L++ +GE+EE ++R +K K
Sbjct: 606 DELEQLEGENEENVPSDHQIRSRKAKR 632
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 25.3 bits (56), Expect = 8.7
Identities = 12/47 (25%), Positives = 18/47 (38%), Gaps = 3/47 (6%)
Query: 11 LFNNDVTTIENYREKVFELLPN-LKYLDGFDAKNNEAPDSEDDEESE 56
V +I Y + E LP + +D +DA E +D E
Sbjct: 894 FSEYGVASISMYNKASGEKLPQIVIIIDNYDAVKEEP--FYEDFEEL 938
>gnl|CDD|227880 COG5593, COG5593, Nucleic-acid-binding protein possibly involved in
ribosomal biogenesis [Translation, ribosomal structure
and biogenesis].
Length = 821
Score = 25.0 bits (54), Expect = 9.2
Identities = 23/76 (30%), Positives = 36/76 (47%), Gaps = 9/76 (11%)
Query: 31 PNLKYLDGFDAKNNEAPDSEDDEESEGDDYEGEDEEL--------EEEEDDGLDEEDGED 82
P L +D DAK+ + +S D EE + + D E EED+ +EE E+
Sbjct: 726 PKLDAIDDEDAKSEGSQES-DQEEGLDEIFYSFDGEQDNSDSFAESSEEDESSEEEKEEE 784
Query: 83 EEEDTEGKEVRGKKRK 98
E ++ K + K+RK
Sbjct: 785 ENKEVSAKRAKKKQRK 800
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 24.9 bits (55), Expect = 9.5
Identities = 18/52 (34%), Positives = 29/52 (55%), Gaps = 1/52 (1%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKK 96
+P E++E E E + LE++ D LDEE ++EE+D +E R K+
Sbjct: 18 PSPKEEEEEALELAIQEAHENALEKKLLDELDEEL-DEEEDDRFLEEYRRKR 68
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 25.1 bits (55), Expect = 9.7
Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 45 EAPDSEDDEESEGDDYEGEDEELEEEEDDGLDEEDGEDEEEDTEGKEVRGKKRK 98
S DE SEG E +D+ EE +D+ +G + E + K VR K++
Sbjct: 228 APEASRLDEMSEGLLEESDDDGEEESDDE--SAWEGFESEYEPINKPVRPKRKT 279
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.297 0.129 0.345
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,840,208
Number of extensions: 561981
Number of successful extensions: 5357
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2991
Number of HSP's successfully gapped: 1108
Length of query: 106
Length of database: 10,937,602
Length adjustment: 71
Effective length of query: 35
Effective length of database: 7,788,468
Effective search space: 272596380
Effective search space used: 272596380
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 44 (21.8 bits)
S2: 53 (24.3 bits)